Query psy13819
Match_columns 323
No_of_seqs 322 out of 2017
Neff 6.4
Searched_HMMs 46136
Date Fri Aug 16 21:16:18 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13819.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13819hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03059 beta-galactosidase; P 100.0 5.1E-70 1.1E-74 566.5 22.8 242 66-322 20-267 (840)
2 PF01301 Glyco_hydro_35: Glyco 100.0 1.5E-69 3.3E-74 519.8 21.6 238 82-322 1-242 (319)
3 KOG0496|consensus 100.0 1.6E-63 3.4E-68 501.5 21.3 238 73-322 17-257 (649)
4 COG1874 LacA Beta-galactosidas 100.0 3.6E-32 7.8E-37 280.0 13.9 168 77-251 2-184 (673)
5 PF02449 Glyco_hydro_42: Beta- 99.9 2.5E-24 5.4E-29 210.6 10.5 139 98-252 3-162 (374)
6 KOG0496|consensus 99.8 3.1E-20 6.6E-25 188.5 4.9 64 1-65 577-640 (649)
7 PF02836 Glyco_hydro_2_C: Glyc 99.8 8.4E-17 1.8E-21 152.7 21.2 202 77-317 2-212 (298)
8 PLN03059 beta-galactosidase; P 99.8 3.5E-19 7.5E-24 186.9 4.7 62 1-64 640-728 (840)
9 PRK10150 beta-D-glucuronidase; 99.7 1.2E-15 2.6E-20 158.2 22.8 166 67-263 271-449 (604)
10 PRK10340 ebgA cryptic beta-D-g 99.6 1.1E-13 2.4E-18 150.9 21.1 197 66-312 312-514 (1021)
11 PRK09525 lacZ beta-D-galactosi 99.6 1.9E-13 4.1E-18 149.1 21.8 155 67-263 329-489 (1027)
12 PF00150 Cellulase: Cellulase 99.5 7.1E-13 1.5E-17 122.6 16.1 156 85-262 3-170 (281)
13 COG3250 LacZ Beta-galactosidas 99.4 1.5E-12 3.3E-17 138.2 15.6 126 66-234 278-409 (808)
14 TIGR03356 BGL beta-galactosida 99.4 3.9E-13 8.4E-18 134.3 8.2 119 95-227 49-168 (427)
15 PLN02998 beta-glucosidase 99.4 3E-13 6.5E-18 137.3 5.0 115 105-227 82-197 (497)
16 PLN02814 beta-glucosidase 99.4 3.2E-13 7E-18 137.3 5.0 120 95-227 72-192 (504)
17 PRK09593 arb 6-phospho-beta-gl 99.4 4.9E-13 1.1E-17 135.3 5.4 118 95-225 68-187 (478)
18 PRK09589 celA 6-phospho-beta-g 99.4 4.2E-13 9.1E-18 135.7 4.9 118 95-225 62-181 (476)
19 PRK15014 6-phospho-beta-glucos 99.3 2E-12 4.3E-17 130.9 8.6 117 95-224 64-182 (477)
20 PLN02849 beta-glucosidase 99.3 5.6E-13 1.2E-17 135.5 4.6 115 105-227 79-194 (503)
21 PRK13511 6-phospho-beta-galact 99.3 2.1E-12 4.6E-17 130.5 7.9 117 95-226 49-166 (469)
22 TIGR01233 lacG 6-phospho-beta- 99.3 3.5E-12 7.7E-17 128.7 8.4 117 95-226 48-165 (467)
23 PF00232 Glyco_hydro_1: Glycos 99.3 4.2E-13 9.2E-18 135.0 0.0 126 95-235 53-180 (455)
24 PRK09852 cryptic 6-phospho-bet 99.3 1.9E-12 4.1E-17 130.8 4.7 119 95-226 66-186 (474)
25 COG2723 BglB Beta-glucosidase/ 99.2 1.4E-11 3.1E-16 122.7 7.1 121 105-234 59-181 (460)
26 PF03198 Glyco_hydro_72: Gluca 98.8 1.5E-07 3.2E-12 90.0 17.0 201 72-311 9-246 (314)
27 smart00633 Glyco_10 Glycosyl h 98.8 6.1E-08 1.3E-12 90.4 12.7 161 128-313 3-192 (254)
28 KOG0626|consensus 98.8 7.4E-09 1.6E-13 104.4 6.5 119 106-233 92-213 (524)
29 PF13364 BetaGal_dom4_5: Beta- 98.7 1.3E-08 2.8E-13 83.8 4.5 45 2-46 60-104 (111)
30 PF13204 DUF4038: Protein of u 98.5 1.5E-06 3.3E-11 82.8 13.1 157 80-263 2-184 (289)
31 PLN02801 beta-amylase 98.3 4.5E-06 9.7E-11 84.1 10.6 84 104-190 36-127 (517)
32 PLN02803 beta-amylase 98.3 4.4E-06 9.5E-11 84.5 10.5 84 104-190 106-197 (548)
33 PLN00197 beta-amylase; Provisi 98.3 4.4E-06 9.6E-11 84.8 10.5 83 104-190 126-217 (573)
34 PLN02161 beta-amylase 98.3 5.3E-06 1.2E-10 83.5 10.8 84 104-190 116-207 (531)
35 PLN02705 beta-amylase 98.3 4.5E-06 9.7E-11 85.4 10.3 83 104-190 267-358 (681)
36 PF07745 Glyco_hydro_53: Glyco 98.2 2.4E-05 5.2E-10 76.1 14.4 179 108-311 27-237 (332)
37 PLN02905 beta-amylase 98.2 5.8E-06 1.2E-10 84.9 10.5 83 104-190 285-376 (702)
38 PF01373 Glyco_hydro_14: Glyco 98.2 2.5E-06 5.5E-11 84.2 7.2 115 106-229 17-153 (402)
39 COG3693 XynA Beta-1,4-xylanase 98.2 3.7E-05 8E-10 73.8 14.2 175 114-311 55-258 (345)
40 PF00331 Glyco_hydro_10: Glyco 98.2 1.3E-05 2.9E-10 77.4 11.1 197 92-311 11-241 (320)
41 PF14488 DUF4434: Domain of un 98.2 5.6E-05 1.2E-09 66.6 13.7 131 100-253 15-151 (166)
42 COG2730 BglC Endoglucanase [Ca 98.1 2.2E-05 4.8E-10 78.2 10.0 114 108-234 76-193 (407)
43 COG3934 Endo-beta-mannanase [C 97.4 0.00014 3E-09 73.0 4.4 173 82-267 3-186 (587)
44 COG3867 Arabinogalactan endo-1 97.1 0.0033 7.1E-08 60.1 9.1 116 107-234 65-183 (403)
45 PF14871 GHL6: Hypothetical gl 96.6 0.016 3.5E-07 49.2 9.4 97 109-211 4-122 (132)
46 KOG2230|consensus 96.4 0.016 3.5E-07 59.6 9.1 113 80-231 327-444 (867)
47 PF01229 Glyco_hydro_39: Glyco 96.2 0.015 3.3E-07 59.3 8.1 61 102-165 36-105 (486)
48 PF02638 DUF187: Glycosyl hydr 95.7 0.09 2E-06 50.8 10.4 117 103-229 17-161 (311)
49 PRK09936 hypothetical protein; 95.3 0.44 9.6E-06 45.6 13.2 59 100-164 33-92 (296)
50 smart00642 Aamy Alpha-amylase 95.1 0.05 1.1E-06 47.8 5.8 63 105-167 19-93 (166)
51 COG3623 SgaU Putative L-xylulo 94.8 1.1 2.4E-05 41.8 14.0 40 274-314 220-260 (287)
52 TIGR01515 branching_enzym alph 94.7 0.5 1.1E-05 49.8 13.2 56 108-164 159-226 (613)
53 PF02055 Glyco_hydro_30: O-Gly 93.8 3.3 7.2E-05 42.7 16.5 254 40-314 32-340 (496)
54 PRK05402 glycogen branching en 93.8 1.6 3.4E-05 47.0 14.7 56 109-164 269-335 (726)
55 PF14587 Glyco_hydr_30_2: O-Gl 93.8 2.3 5.1E-05 42.3 14.7 176 73-267 3-229 (384)
56 COG1649 Uncharacterized protei 93.5 0.44 9.5E-06 47.9 9.3 86 103-189 62-163 (418)
57 PRK14705 glycogen branching en 93.2 3 6.5E-05 47.4 16.2 56 109-164 770-835 (1224)
58 PF00128 Alpha-amylase: Alpha 93.0 0.1 2.2E-06 48.2 3.8 58 107-164 6-72 (316)
59 PRK12568 glycogen branching en 92.8 3.8 8.2E-05 44.2 15.6 56 109-166 274-341 (730)
60 PRK14706 glycogen branching en 92.7 1.9 4.2E-05 45.7 13.2 51 111-164 174-237 (639)
61 COG0296 GlgB 1,4-alpha-glucan 92.4 1.5 3.2E-05 46.4 11.7 55 104-163 164-233 (628)
62 PRK09441 cytoplasmic alpha-amy 92.1 0.25 5.3E-06 50.4 5.5 57 108-164 25-101 (479)
63 PF05913 DUF871: Bacterial pro 91.9 0.25 5.4E-06 48.8 5.1 72 93-170 2-73 (357)
64 cd06565 GH20_GcnA-like Glycosy 91.7 5.7 0.00012 38.1 14.1 57 104-164 16-80 (301)
65 PLN02447 1,4-alpha-glucan-bran 91.7 8.1 0.00018 41.9 16.5 61 105-166 251-322 (758)
66 TIGR01531 glyc_debranch glycog 91.2 1.5 3.1E-05 50.2 10.6 117 81-204 103-239 (1464)
67 PRK12313 glycogen branching en 91.2 0.4 8.6E-06 50.7 6.0 53 109-164 174-240 (633)
68 COG5309 Exo-beta-1,3-glucanase 90.7 2.9 6.2E-05 39.9 10.5 115 104-266 62-180 (305)
69 TIGR02402 trehalose_TreZ malto 90.3 0.48 1E-05 49.2 5.6 55 107-164 113-180 (542)
70 PF01261 AP_endonuc_2: Xylose 89.9 2.7 5.9E-05 36.4 9.4 128 111-265 1-135 (213)
71 cd02742 GH20_hexosaminidase Be 89.9 9.3 0.0002 36.6 13.7 142 102-265 13-193 (303)
72 PF12876 Cellulase-like: Sugar 88.8 0.96 2.1E-05 35.2 5.0 46 217-262 5-62 (88)
73 cd06568 GH20_SpHex_like A subg 88.4 10 0.00022 36.9 13.0 68 95-164 10-95 (329)
74 cd06563 GH20_chitobiase-like T 88.3 20 0.00044 35.1 15.1 65 95-164 10-106 (357)
75 cd06564 GH20_DspB_LnbB-like Gl 88.3 10 0.00022 36.6 12.8 145 93-258 7-196 (326)
76 PRK10785 maltodextrin glucosid 88.2 1 2.2E-05 47.4 6.3 55 107-164 181-246 (598)
77 TIGR02104 pulA_typeI pullulana 88.1 0.86 1.9E-05 47.9 5.7 55 109-164 168-249 (605)
78 PLN02960 alpha-amylase 88.0 0.97 2.1E-05 49.4 6.0 57 108-164 420-486 (897)
79 PRK09856 fructoselysine 3-epim 87.8 22 0.00049 32.7 14.8 53 105-162 13-65 (275)
80 TIGR00542 hxl6Piso_put hexulos 87.8 23 0.0005 32.9 15.2 58 104-164 15-73 (279)
81 PF03659 Glyco_hydro_71: Glyco 87.6 2.3 5.1E-05 42.4 8.1 89 103-225 15-103 (386)
82 PF02065 Melibiase: Melibiase; 87.5 5 0.00011 40.2 10.4 115 96-211 49-181 (394)
83 PRK13210 putative L-xylulose 5 87.0 25 0.00054 32.5 14.3 54 105-161 16-69 (284)
84 PRK01060 endonuclease IV; Prov 86.6 19 0.00042 33.3 13.3 52 105-160 12-63 (281)
85 PRK10933 trehalose-6-phosphate 86.3 1.4 3E-05 45.9 5.9 57 105-164 33-101 (551)
86 PRK13398 3-deoxy-7-phosphohept 86.1 4.5 9.8E-05 38.3 8.8 74 83-164 21-98 (266)
87 PF13200 DUF4015: Putative gly 85.9 2 4.2E-05 41.8 6.3 110 104-214 12-135 (316)
88 COG3589 Uncharacterized conser 85.7 1.8 3.9E-05 42.4 5.9 73 93-171 4-76 (360)
89 PF02679 ComA: (2R)-phospho-3- 85.7 1.4 3.1E-05 41.2 5.1 52 104-165 83-134 (244)
90 TIGR02403 trehalose_treC alpha 85.4 1.7 3.7E-05 45.2 6.0 60 105-166 27-97 (543)
91 PRK09505 malS alpha-amylase; R 85.0 1.8 4E-05 46.3 6.1 58 107-164 232-312 (683)
92 PF02837 Glyco_hydro_2_N: Glyc 84.8 1.2 2.7E-05 38.0 4.0 40 5-46 96-136 (167)
93 TIGR03849 arch_ComA phosphosul 84.8 4.1 9E-05 38.0 7.6 51 105-165 71-121 (237)
94 TIGR03234 OH-pyruv-isom hydrox 84.6 28 0.00061 31.7 13.2 43 106-162 15-57 (254)
95 smart00812 Alpha_L_fucos Alpha 84.4 8.7 0.00019 38.3 10.3 106 99-214 78-191 (384)
96 TIGR02103 pullul_strch alpha-1 83.8 19 0.00041 39.9 13.3 21 144-164 404-424 (898)
97 PLN02361 alpha-amylase 83.1 2.4 5.3E-05 42.5 5.8 57 108-164 32-96 (401)
98 TIGR02456 treS_nterm trehalose 83.1 2.3 5E-05 44.0 5.8 58 105-164 28-96 (539)
99 PF07488 Glyco_hydro_67M: Glyc 82.4 15 0.00032 35.8 10.4 137 104-262 56-194 (328)
100 cd04908 ACT_Bt0572_1 N-termina 82.2 6 0.00013 28.6 6.3 55 104-162 12-66 (66)
101 TIGR02100 glgX_debranch glycog 81.9 2.1 4.6E-05 45.9 5.1 55 110-164 189-265 (688)
102 cd06593 GH31_xylosidase_YicI Y 81.7 6.4 0.00014 37.5 7.9 70 102-171 21-93 (308)
103 PRK03705 glycogen debranching 80.7 2.5 5.5E-05 45.0 5.2 55 110-164 184-262 (658)
104 cd06562 GH20_HexA_HexB-like Be 79.9 63 0.0014 31.6 15.0 68 95-164 10-90 (348)
105 PRK14510 putative bifunctional 79.7 2.9 6.3E-05 47.7 5.5 56 109-164 191-267 (1221)
106 PF14307 Glyco_tran_WbsX: Glyc 79.4 41 0.00088 32.8 12.8 132 103-259 56-192 (345)
107 PLN00196 alpha-amylase; Provis 79.3 4.5 9.8E-05 40.9 6.2 57 108-164 47-112 (428)
108 cd06592 GH31_glucosidase_KIAA1 79.1 6 0.00013 37.9 6.8 68 101-171 26-97 (303)
109 TIGR02401 trehalose_TreY malto 79.1 4.2 9E-05 44.5 6.2 63 104-166 15-87 (825)
110 PRK14511 maltooligosyl trehalo 78.4 4.2 9.1E-05 44.7 6.0 61 104-167 19-92 (879)
111 PRK12677 xylose isomerase; Pro 78.0 31 0.00066 34.4 11.6 101 104-226 30-134 (384)
112 PRK14507 putative bifunctional 77.7 4.1 8.8E-05 47.8 5.9 61 103-166 756-829 (1693)
113 cd00019 AP2Ec AP endonuclease 77.7 60 0.0013 30.1 15.0 57 102-163 8-65 (279)
114 COG1099 Predicted metal-depend 76.8 18 0.00039 33.8 8.7 116 108-263 14-135 (254)
115 PF01261 AP_endonuc_2: Xylose 76.6 33 0.00071 29.5 10.3 120 105-252 27-153 (213)
116 PF14701 hDGE_amylase: glucano 76.6 17 0.00037 36.8 9.4 81 103-190 20-117 (423)
117 PRK12331 oxaloacetate decarbox 76.6 89 0.0019 31.9 14.6 56 97-164 88-143 (448)
118 TIGR02102 pullulan_Gpos pullul 76.5 4.6 0.0001 45.6 5.8 21 144-164 555-575 (1111)
119 PF13199 Glyco_hydro_66: Glyco 75.3 23 0.0005 37.2 10.2 198 104-309 117-361 (559)
120 COG2342 Predicted extracellula 74.2 68 0.0015 30.9 12.1 183 108-311 33-251 (300)
121 cd06591 GH31_xylosidase_XylS X 74.0 12 0.00027 36.0 7.5 67 103-170 22-92 (319)
122 COG1874 LacA Beta-galactosidas 74.0 1.9 4.1E-05 46.0 1.9 49 1-50 544-592 (673)
123 PRK09856 fructoselysine 3-epim 72.8 5.2 0.00011 37.0 4.4 56 105-164 90-149 (275)
124 cd06570 GH20_chitobiase-like_1 72.4 32 0.00069 33.3 9.8 68 95-164 10-88 (311)
125 cd06599 GH31_glycosidase_Aec37 72.3 14 0.00031 35.5 7.4 68 104-171 28-100 (317)
126 PF01791 DeoC: DeoC/LacD famil 72.2 1.6 3.5E-05 40.0 0.8 54 108-164 79-132 (236)
127 PLN02877 alpha-amylase/limit d 72.0 7.1 0.00015 43.4 5.8 21 144-164 466-486 (970)
128 cd06589 GH31 The enzymes of gl 71.4 10 0.00022 35.4 6.1 65 103-168 22-90 (265)
129 TIGR00677 fadh2_euk methylenet 71.3 35 0.00076 32.5 9.7 108 92-214 131-251 (281)
130 PRK09875 putative hydrolase; P 71.3 84 0.0018 30.1 12.4 88 76-183 8-95 (292)
131 PRK08227 autoinducer 2 aldolas 69.3 50 0.0011 31.3 10.2 63 94-162 83-145 (264)
132 PLN02784 alpha-amylase 69.1 9.9 0.00022 41.8 6.0 57 108-164 524-588 (894)
133 cd06602 GH31_MGAM_SI_GAA This 68.7 13 0.00028 36.3 6.3 68 103-171 22-93 (339)
134 PF05089 NAGLU: Alpha-N-acetyl 68.7 21 0.00046 35.0 7.7 127 103-247 17-198 (333)
135 COG1523 PulA Type II secretory 68.6 8.2 0.00018 41.5 5.2 55 110-164 205-285 (697)
136 cd06598 GH31_transferase_CtsZ 67.7 22 0.00048 34.2 7.6 69 103-171 22-97 (317)
137 TIGR02455 TreS_stutzeri trehal 67.2 16 0.00035 39.0 7.0 72 110-181 79-175 (688)
138 TIGR02631 xylA_Arthro xylose i 66.8 36 0.00079 33.9 9.1 53 103-162 30-86 (382)
139 COG1306 Uncharacterized conser 66.7 13 0.00028 36.2 5.6 56 104-164 76-144 (400)
140 smart00481 POLIIIAc DNA polyme 66.1 22 0.00048 25.6 5.7 46 106-164 16-61 (67)
141 KOG0259|consensus 66.0 11 0.00023 37.8 5.1 58 102-163 180-238 (447)
142 COG2159 Predicted metal-depend 65.8 41 0.00088 32.2 9.0 69 90-167 98-167 (293)
143 PF00728 Glyco_hydro_20: Glyco 65.5 24 0.00053 33.8 7.5 63 102-164 15-93 (351)
144 smart00518 AP2Ec AP endonuclea 65.2 1.1E+02 0.0024 28.0 14.1 55 107-167 12-67 (273)
145 PRK13210 putative L-xylulose 5 64.7 10 0.00022 35.1 4.5 59 105-164 94-153 (284)
146 PF01120 Alpha_L_fucos: Alpha- 64.4 1.4E+02 0.003 29.1 12.6 140 109-262 95-243 (346)
147 TIGR03234 OH-pyruv-isom hydrox 64.3 8.9 0.00019 35.1 4.0 60 105-164 84-143 (254)
148 PRK14040 oxaloacetate decarbox 64.3 1.8E+02 0.004 30.8 14.1 54 97-162 89-142 (593)
149 PRK12858 tagatose 1,6-diphosph 64.1 9 0.0002 37.6 4.2 66 96-164 98-163 (340)
150 PRK08673 3-deoxy-7-phosphohept 63.6 27 0.00058 34.3 7.4 75 82-164 86-164 (335)
151 cd06600 GH31_MGAM-like This fa 63.6 19 0.00041 34.7 6.3 67 103-170 22-90 (317)
152 cd00311 TIM Triosephosphate is 62.2 18 0.00039 33.8 5.6 49 111-165 77-125 (242)
153 PF08531 Bac_rhamnosid_N: Alph 62.1 30 0.00064 30.3 6.8 44 4-50 13-66 (172)
154 PRK09997 hydroxypyruvate isome 61.4 11 0.00025 34.6 4.2 60 105-164 85-144 (258)
155 cd07937 DRE_TIM_PC_TC_5S Pyruv 61.2 25 0.00055 33.2 6.6 51 101-163 87-137 (275)
156 PF14307 Glyco_tran_WbsX: Glyc 61.1 1.7E+02 0.0036 28.5 13.4 44 79-125 150-195 (345)
157 cd06603 GH31_GANC_GANAB_alpha 60.9 28 0.00061 33.8 7.0 68 103-171 22-91 (339)
158 COG3684 LacD Tagatose-1,6-bisp 60.9 10 0.00022 36.0 3.7 61 101-164 107-167 (306)
159 cd04882 ACT_Bt0572_2 C-termina 60.7 21 0.00045 25.0 4.6 55 104-160 10-64 (65)
160 PRK13209 L-xylulose 5-phosphat 60.4 1.4E+02 0.0031 27.5 13.8 54 105-161 21-74 (283)
161 TIGR00587 nfo apurinic endonuc 60.2 73 0.0016 29.8 9.5 58 107-168 13-72 (274)
162 PRK09997 hydroxypyruvate isome 59.6 1.4E+02 0.0031 27.3 13.8 42 107-162 17-58 (258)
163 COG0366 AmyA Glycosidases [Car 59.4 16 0.00034 36.5 5.1 54 108-163 32-96 (505)
164 KOG0470|consensus 59.1 12 0.00026 40.2 4.2 57 108-164 258-331 (757)
165 PRK09282 pyruvate carboxylase 57.9 2.6E+02 0.0055 29.7 14.6 56 97-164 88-143 (592)
166 PRK11572 copper homeostasis pr 57.2 16 0.00035 34.3 4.4 62 91-162 59-120 (248)
167 TIGR00433 bioB biotin syntheta 57.0 17 0.00037 34.1 4.6 52 108-163 123-177 (296)
168 TIGR01108 oadA oxaloacetate de 56.6 2.7E+02 0.0058 29.5 14.7 54 99-164 85-138 (582)
169 cd06595 GH31_xylosidase_XylS-l 56.5 45 0.00097 31.7 7.5 67 103-169 23-99 (292)
170 PRK14582 pgaB outer membrane N 56.3 1.2E+02 0.0027 32.6 11.3 76 87-169 311-406 (671)
171 PRK14042 pyruvate carboxylase 56.0 2.8E+02 0.006 29.5 14.1 53 99-163 90-142 (596)
172 cd01299 Met_dep_hydrolase_A Me 55.8 34 0.00074 32.5 6.6 60 104-164 119-180 (342)
173 TIGR00542 hxl6Piso_put hexulos 55.0 19 0.00042 33.5 4.6 56 105-164 94-153 (279)
174 PRK09989 hypothetical protein; 54.9 26 0.00057 32.2 5.5 43 106-162 16-58 (258)
175 PF14683 CBM-like: Polysacchar 54.6 12 0.00027 32.8 3.0 37 8-45 96-148 (167)
176 cd06597 GH31_transferase_CtsY 54.2 44 0.00095 32.6 7.1 68 103-170 22-111 (340)
177 cd02871 GH18_chitinase_D-like 54.0 1.5E+02 0.0033 28.3 10.8 84 145-257 61-148 (312)
178 PLN03036 glutamine synthetase; 53.9 1.8E+02 0.0039 29.6 11.5 66 106-177 231-308 (432)
179 PRK00042 tpiA triosephosphate 53.2 30 0.00066 32.5 5.6 49 111-165 79-127 (250)
180 PLN02561 triosephosphate isome 52.9 31 0.00067 32.5 5.6 49 111-165 81-129 (253)
181 cd06604 GH31_glucosidase_II_Ma 52.8 39 0.00085 32.7 6.5 68 103-171 22-91 (339)
182 PRK14566 triosephosphate isome 52.5 43 0.00093 31.7 6.5 49 111-165 88-136 (260)
183 PF13380 CoA_binding_2: CoA bi 51.6 43 0.00092 27.3 5.6 44 102-161 63-106 (116)
184 PRK15492 triosephosphate isome 50.6 35 0.00077 32.2 5.6 49 111-165 87-135 (260)
185 COG1735 Php Predicted metal-de 50.5 97 0.0021 30.2 8.5 94 71-183 13-109 (316)
186 KOG0805|consensus 50.5 59 0.0013 30.9 6.9 79 144-231 37-122 (337)
187 PF01055 Glyco_hydro_31: Glyco 49.5 50 0.0011 33.0 6.8 70 103-173 41-112 (441)
188 cd06594 GH31_glucosidase_YihQ 49.1 72 0.0016 30.7 7.7 69 103-171 21-98 (317)
189 PRK08645 bifunctional homocyst 48.8 87 0.0019 33.2 8.7 113 86-214 459-579 (612)
190 PTZ00333 triosephosphate isome 48.3 42 0.00091 31.6 5.7 49 111-165 82-130 (255)
191 PRK13209 L-xylulose 5-phosphat 47.1 26 0.00057 32.5 4.2 57 106-164 100-158 (283)
192 PF01487 DHquinase_I: Type I 3 46.8 36 0.00079 30.7 5.0 63 95-166 118-184 (224)
193 PRK09432 metF 5,10-methylenete 46.6 1.1E+02 0.0024 29.4 8.4 89 110-214 168-266 (296)
194 TIGR02635 RhaI_grampos L-rhamn 46.5 3.1E+02 0.0068 27.4 14.1 121 104-252 39-166 (378)
195 PRK02412 aroD 3-dehydroquinate 46.5 62 0.0013 30.2 6.5 31 95-125 140-172 (253)
196 cd06601 GH31_lyase_GLase GLase 45.7 2.6E+02 0.0057 27.2 11.0 66 103-169 22-89 (332)
197 TIGR00419 tim triosephosphate 45.3 49 0.0011 30.2 5.5 44 111-164 74-117 (205)
198 COG3661 AguA Alpha-glucuronida 45.2 2.8E+02 0.0062 28.6 11.1 113 105-234 183-295 (684)
199 cd06545 GH18_3CO4_chitinase Th 45.1 2.2E+02 0.0047 26.1 10.0 31 135-165 36-67 (253)
200 COG1809 (2R)-phospho-3-sulfola 45.1 1.1E+02 0.0023 28.7 7.5 63 93-165 78-140 (258)
201 cd00019 AP2Ec AP endonuclease 45.0 18 0.0004 33.5 2.8 59 105-164 85-143 (279)
202 KOG3293|consensus 45.0 50 0.0011 27.8 4.9 74 31-107 2-76 (134)
203 KOG0622|consensus 44.2 50 0.0011 33.5 5.7 68 102-179 190-258 (448)
204 cd02877 GH18_hevamine_XipI_cla 44.0 2.9E+02 0.0063 26.2 12.9 48 118-165 24-80 (280)
205 PRK14565 triosephosphate isome 44.0 2.7E+02 0.0059 25.9 11.9 49 111-165 78-126 (237)
206 PLN02284 glutamine synthetase 43.9 3.3E+02 0.0071 26.8 11.5 52 111-167 176-239 (354)
207 PRK14567 triosephosphate isome 43.7 53 0.0011 31.0 5.6 49 111-165 78-126 (253)
208 PRK09250 fructose-bisphosphate 43.5 37 0.00081 33.5 4.7 89 109-217 150-238 (348)
209 cd06569 GH20_Sm-chitobiase-lik 43.0 94 0.002 31.6 7.7 64 95-163 14-116 (445)
210 cd00958 DhnA Class I fructose- 42.9 60 0.0013 29.4 5.8 63 96-164 66-129 (235)
211 PF07755 DUF1611: Protein of u 42.5 23 0.00049 34.3 3.0 62 90-166 35-97 (301)
212 COG1448 TyrB Aspartate/tyrosin 42.2 27 0.00058 34.9 3.5 61 89-164 173-242 (396)
213 PF03932 CutC: CutC family; I 42.1 21 0.00045 32.5 2.5 59 94-162 61-119 (201)
214 PRK13396 3-deoxy-7-phosphohept 41.9 1.3E+02 0.0028 29.8 8.2 74 83-164 94-172 (352)
215 COG1979 Uncharacterized oxidor 41.8 1.1E+02 0.0023 30.4 7.4 62 74-165 32-93 (384)
216 PRK12595 bifunctional 3-deoxy- 41.3 1.2E+02 0.0026 30.0 7.9 74 83-164 112-189 (360)
217 PRK09057 coproporphyrinogen II 41.3 82 0.0018 31.1 6.8 52 108-164 104-160 (380)
218 PF03102 NeuB: NeuB family; I 41.2 46 0.00099 31.1 4.7 65 101-165 52-121 (241)
219 PF02228 Gag_p19: Major core p 41.1 11 0.00024 29.3 0.5 38 104-158 21-58 (92)
220 PRK13125 trpA tryptophan synth 40.8 57 0.0012 30.1 5.4 47 108-165 91-137 (244)
221 PRK12399 tagatose 1,6-diphosph 40.7 47 0.001 32.5 4.8 62 101-164 101-162 (324)
222 PF07071 DUF1341: Protein of u 40.7 64 0.0014 29.6 5.4 72 107-217 137-211 (218)
223 COG1891 Uncharacterized protei 40.6 12 0.00026 33.7 0.7 48 113-163 139-186 (235)
224 TIGR03128 RuMP_HxlA 3-hexulose 40.2 71 0.0015 28.2 5.7 41 110-163 68-108 (206)
225 PRK08195 4-hyroxy-2-oxovalerat 40.1 67 0.0014 31.4 5.9 45 108-164 91-135 (337)
226 cd00003 PNPsynthase Pyridoxine 40.0 58 0.0013 30.4 5.1 49 104-170 109-158 (234)
227 PRK10658 putative alpha-glucos 39.9 98 0.0021 33.2 7.6 68 104-171 282-352 (665)
228 cd07944 DRE_TIM_HOA_like 4-hyd 39.9 54 0.0012 30.8 5.1 45 108-164 85-129 (266)
229 cd06416 GH25_Lys1-like Lys-1 i 39.4 1.2E+02 0.0025 26.8 7.0 84 95-183 56-156 (196)
230 TIGR01698 PUNP purine nucleoti 39.2 52 0.0011 30.7 4.8 41 84-124 47-88 (237)
231 KOG0471|consensus 39.1 48 0.001 34.6 5.0 64 107-170 42-114 (545)
232 PF10566 Glyco_hydro_97: Glyco 39.0 66 0.0014 30.7 5.5 62 102-164 29-93 (273)
233 PRK10397 lipoprotein; Provisio 38.5 41 0.00088 28.6 3.5 19 16-34 34-52 (137)
234 PF02606 LpxK: Tetraacyldisacc 38.4 91 0.002 30.4 6.5 63 84-164 224-286 (326)
235 COG1453 Predicted oxidoreducta 38.3 1.7E+02 0.0038 29.3 8.4 57 103-183 32-90 (391)
236 TIGR01211 ELP3 histone acetylt 38.2 63 0.0014 33.6 5.7 63 93-164 195-262 (522)
237 TIGR01232 lacD tagatose 1,6-di 38.1 36 0.00077 33.3 3.6 52 110-164 111-163 (325)
238 PRK06852 aldolase; Validated 38.1 70 0.0015 31.0 5.6 48 111-164 121-174 (304)
239 PF12891 Glyco_hydro_44: Glyco 38.0 74 0.0016 29.8 5.5 86 144-234 24-137 (239)
240 PRK09739 hypothetical protein; 37.9 1.1E+02 0.0023 27.1 6.4 76 89-164 5-88 (199)
241 COG5520 O-Glycosyl hydrolase [ 37.9 2.5E+02 0.0053 28.3 9.2 116 116-252 77-204 (433)
242 KOG2233|consensus 37.7 1.3E+02 0.0027 31.5 7.4 153 102-268 75-292 (666)
243 PRK04161 tagatose 1,6-diphosph 37.6 54 0.0012 32.1 4.7 62 101-164 103-164 (329)
244 cd00537 MTHFR Methylenetetrahy 37.3 1.8E+02 0.0039 27.1 8.2 91 109-214 151-250 (274)
245 PF05706 CDKN3: Cyclin-depende 37.2 1.5E+02 0.0032 26.4 7.0 50 105-164 58-107 (168)
246 cd07943 DRE_TIM_HOA 4-hydroxy- 37.1 70 0.0015 29.8 5.4 45 108-164 88-132 (263)
247 PRK09989 hypothetical protein; 37.0 48 0.001 30.4 4.2 59 106-164 86-144 (258)
248 TIGR00676 fadh2 5,10-methylene 36.9 2.4E+02 0.0052 26.4 9.0 109 91-214 126-247 (272)
249 PRK05265 pyridoxine 5'-phospha 36.7 70 0.0015 30.0 5.1 49 104-170 112-161 (239)
250 COG0662 {ManC} Mannose-6-phosp 36.4 60 0.0013 26.8 4.3 41 5-50 66-110 (127)
251 PRK10569 NAD(P)H-dependent FMN 36.4 92 0.002 27.7 5.8 73 89-164 2-75 (191)
252 KOG3698|consensus 36.4 2.5E+02 0.0054 30.0 9.4 72 85-164 11-94 (891)
253 COG0149 TpiA Triosephosphate i 36.3 77 0.0017 29.9 5.4 49 111-165 81-129 (251)
254 cd04726 KGPDC_HPS 3-Keto-L-gul 36.2 89 0.0019 27.3 5.7 45 109-166 68-113 (202)
255 TIGR03217 4OH_2_O_val_ald 4-hy 35.8 82 0.0018 30.7 5.8 45 108-164 90-134 (333)
256 PRK13762 tRNA-modifying enzyme 35.6 1.5E+02 0.0032 28.7 7.5 43 130-181 127-173 (322)
257 PRK08208 coproporphyrinogen II 35.4 49 0.0011 33.2 4.3 50 108-161 141-194 (430)
258 KOG1065|consensus 35.4 62 0.0014 35.4 5.2 66 105-173 311-380 (805)
259 TIGR00539 hemN_rel putative ox 35.1 66 0.0014 31.4 5.0 52 108-163 100-156 (360)
260 PRK07432 5'-methylthioadenosin 35.1 39 0.00084 32.5 3.3 39 84-124 43-85 (290)
261 PF02126 PTE: Phosphotriestera 35.0 2.3E+02 0.0049 27.4 8.6 62 105-183 38-99 (308)
262 PTZ00372 endonuclease 4-like p 34.9 90 0.002 31.6 6.0 83 82-165 150-240 (413)
263 PRK07379 coproporphyrinogen II 34.7 78 0.0017 31.5 5.5 52 108-164 115-172 (400)
264 PRK01045 ispH 4-hydroxy-3-meth 34.7 1.1E+02 0.0025 29.4 6.5 70 87-169 30-100 (298)
265 COG1324 CutA Uncharacterized p 34.5 78 0.0017 25.9 4.5 36 207-249 67-102 (104)
266 TIGR01361 DAHP_synth_Bsub phos 34.5 1.1E+02 0.0024 28.8 6.2 74 83-164 19-96 (260)
267 TIGR03471 HpnJ hopanoid biosyn 34.4 42 0.00091 34.0 3.6 55 108-164 287-343 (472)
268 PRK06703 flavodoxin; Provision 34.4 2.7E+02 0.0059 23.1 9.4 100 85-213 46-148 (151)
269 PLN02429 triosephosphate isome 34.2 88 0.0019 30.5 5.6 49 111-165 140-188 (315)
270 PRK05660 HemN family oxidoredu 34.2 51 0.0011 32.5 4.1 50 108-161 107-160 (378)
271 cd07944 DRE_TIM_HOA_like 4-hyd 33.9 97 0.0021 29.1 5.8 66 100-165 15-81 (266)
272 cd02874 GH18_CFLE_spore_hydrol 33.6 2.9E+02 0.0062 26.1 9.1 43 117-165 24-66 (313)
273 PRK12330 oxaloacetate decarbox 33.4 92 0.002 32.3 5.9 52 99-162 91-142 (499)
274 PF00121 TIM: Triosephosphate 33.4 33 0.00072 32.0 2.5 49 111-165 77-125 (244)
275 cd06542 GH18_EndoS-like Endo-b 33.2 3.8E+02 0.0082 24.4 10.0 85 144-255 51-142 (255)
276 cd04886 ACT_ThrD-II-like C-ter 33.2 1.6E+02 0.0036 20.2 7.0 58 104-161 9-72 (73)
277 PF04909 Amidohydro_2: Amidohy 33.2 1E+02 0.0022 27.6 5.6 148 104-268 83-236 (273)
278 PRK10340 ebgA cryptic beta-D-g 33.0 41 0.0009 37.8 3.6 36 7-45 139-175 (1021)
279 TIGR00559 pdxJ pyridoxine 5'-p 33.0 88 0.0019 29.3 5.2 48 105-170 110-158 (237)
280 PLN02231 alanine transaminase 32.4 2.1E+02 0.0045 29.8 8.4 58 102-163 253-310 (534)
281 COG3142 CutC Uncharacterized p 32.3 43 0.00092 31.3 2.9 63 90-162 58-120 (241)
282 KOG1411|consensus 32.0 59 0.0013 32.4 4.0 184 5-234 129-326 (427)
283 PRK00870 haloalkane dehalogena 31.8 1.3E+02 0.0028 27.8 6.3 68 86-159 45-115 (302)
284 cd06416 GH25_Lys1-like Lys-1 i 31.6 3.6E+02 0.0077 23.7 10.7 122 108-268 12-136 (196)
285 PRK15108 biotin synthase; Prov 31.6 77 0.0017 31.0 4.8 49 108-161 136-188 (345)
286 PLN02540 methylenetetrahydrofo 31.6 1.5E+02 0.0033 31.3 7.1 94 106-214 154-259 (565)
287 cd04883 ACT_AcuB C-terminal AC 31.6 1.8E+02 0.0039 20.6 5.8 53 105-161 13-69 (72)
288 PLN02607 1-aminocyclopropane-1 31.4 2.4E+02 0.0051 28.5 8.5 60 101-164 181-240 (447)
289 PF08821 CGGC: CGGC domain; I 31.2 1.4E+02 0.003 24.4 5.5 57 102-164 49-106 (107)
290 cd06547 GH85_ENGase Endo-beta- 31.1 1.2E+02 0.0026 29.7 6.1 115 121-263 32-149 (339)
291 PF01075 Glyco_transf_9: Glyco 31.0 48 0.001 29.8 3.1 74 90-166 108-194 (247)
292 COG1306 Uncharacterized conser 30.8 2.5E+02 0.0055 27.5 7.9 97 102-214 190-297 (400)
293 COG2876 AroA 3-deoxy-D-arabino 30.7 1.9E+02 0.0041 27.7 7.0 57 104-164 58-116 (286)
294 PF01183 Glyco_hydro_25: Glyco 30.7 3.3E+02 0.0072 23.4 8.3 116 111-264 12-129 (181)
295 COG0413 PanB Ketopantoate hydr 30.6 1.6E+02 0.0035 28.1 6.5 65 79-165 74-138 (268)
296 TIGR03581 EF_0839 conserved hy 30.6 1.1E+02 0.0024 28.4 5.3 73 106-217 136-211 (236)
297 COG0602 NrdG Organic radical a 30.4 2.5E+02 0.0055 25.4 7.7 77 87-181 21-114 (212)
298 PF09587 PGA_cap: Bacterial ca 30.4 4.2E+02 0.0092 24.1 9.6 81 81-168 122-227 (250)
299 PRK08508 biotin synthase; Prov 30.4 86 0.0019 29.6 4.8 48 108-160 102-153 (279)
300 PF02057 Glyco_hydro_59: Glyco 30.3 4.3E+02 0.0094 28.5 10.3 134 109-268 65-215 (669)
301 COG3664 XynB Beta-xylosidase [ 30.2 1.1E+02 0.0024 31.0 5.6 133 111-267 11-144 (428)
302 cd08183 Fe-ADH2 Iron-containin 30.2 1.2E+02 0.0026 29.7 6.0 62 87-165 22-83 (374)
303 PF10566 Glyco_hydro_97: Glyco 30.2 72 0.0016 30.5 4.2 48 107-162 108-155 (273)
304 PF13987 YedD: YedD-like prote 30.2 16 0.00036 29.7 -0.1 19 16-34 10-28 (111)
305 PRK09058 coproporphyrinogen II 30.0 93 0.002 31.5 5.3 52 108-164 163-220 (449)
306 PRK06294 coproporphyrinogen II 29.9 1.1E+02 0.0025 30.0 5.7 53 108-164 103-160 (370)
307 cd08185 Fe-ADH1 Iron-containin 29.4 2E+02 0.0043 28.3 7.4 68 87-166 25-92 (380)
308 PLN02763 hydrolase, hydrolyzin 29.3 2E+02 0.0042 32.5 7.9 67 104-171 200-268 (978)
309 PRK10426 alpha-glucosidase; Pr 29.3 1.7E+02 0.0038 31.1 7.3 66 105-170 221-295 (635)
310 PRK09525 lacZ beta-D-galactosi 28.9 37 0.0008 38.3 2.3 37 6-46 150-188 (1027)
311 PRK15108 biotin synthase; Prov 28.8 1.9E+02 0.004 28.3 7.0 57 102-165 76-132 (345)
312 COG2179 Predicted hydrolase of 28.7 1.4E+02 0.003 26.7 5.4 45 110-163 19-68 (175)
313 PF11347 DUF3148: Protein of u 28.4 1.3E+02 0.0028 22.4 4.4 61 23-89 2-62 (63)
314 PRK10422 lipopolysaccharide co 28.3 1.8E+02 0.0038 28.0 6.7 63 101-166 198-273 (352)
315 PRK08446 coproporphyrinogen II 28.0 1.3E+02 0.0028 29.3 5.7 52 108-164 98-155 (350)
316 PRK08599 coproporphyrinogen II 27.9 69 0.0015 31.4 3.9 49 108-159 100-151 (377)
317 PRK14905 triosephosphate isome 27.9 1.2E+02 0.0027 29.9 5.6 50 111-166 88-137 (355)
318 cd04740 DHOD_1B_like Dihydroor 27.9 1.8E+02 0.0039 27.3 6.5 62 103-165 100-162 (296)
319 PRK07823 5'-methylthioadenosin 27.8 93 0.002 29.5 4.5 57 84-140 44-104 (264)
320 PLN02389 biotin synthase 27.5 95 0.0021 30.9 4.7 50 108-162 178-231 (379)
321 PF04914 DltD_C: DltD C-termin 27.2 57 0.0012 27.6 2.7 24 143-166 35-58 (130)
322 PRK05904 coproporphyrinogen II 27.2 1.2E+02 0.0026 29.7 5.4 52 108-164 103-160 (353)
323 PF02811 PHP: PHP domain; Int 26.6 1.8E+02 0.0039 24.2 5.8 47 105-164 16-62 (175)
324 PRK05628 coproporphyrinogen II 26.5 1.3E+02 0.0028 29.4 5.5 53 108-164 108-165 (375)
325 cd03174 DRE_TIM_metallolyase D 26.5 1E+02 0.0022 28.0 4.5 55 108-164 77-135 (265)
326 PRK04302 triosephosphate isome 26.5 1.4E+02 0.0031 27.0 5.4 47 109-165 76-122 (223)
327 PF03740 PdxJ: Pyridoxal phosp 26.4 72 0.0016 29.9 3.4 49 104-170 110-159 (239)
328 PTZ00377 alanine aminotransfer 26.4 3.4E+02 0.0073 27.4 8.6 59 102-164 200-258 (481)
329 PF11261 IRF-2BP1_2: Interfero 26.4 46 0.001 23.8 1.6 30 127-156 19-49 (54)
330 PLN02450 1-aminocyclopropane-1 26.3 1.9E+02 0.004 29.3 6.7 58 103-164 174-231 (468)
331 PTZ00372 endonuclease 4-like p 26.3 7E+02 0.015 25.3 13.7 55 108-168 144-202 (413)
332 TIGR02171 Fb_sc_TIGR02171 Fibr 26.3 2.5E+02 0.0054 31.4 7.8 71 144-235 808-879 (912)
333 cd08190 HOT Hydroxyacid-oxoaci 26.2 2.4E+02 0.0052 28.2 7.4 65 88-165 24-88 (414)
334 COG0076 GadB Glutamate decarbo 26.1 95 0.0021 31.8 4.5 71 83-164 152-244 (460)
335 PLN02389 biotin synthase 26.1 1.8E+02 0.0038 29.0 6.3 61 100-165 114-174 (379)
336 TIGR03101 hydr2_PEP hydrolase, 26.0 1.4E+02 0.0031 28.0 5.5 70 90-159 28-99 (266)
337 cd01019 ZnuA Zinc binding prot 26.0 1.3E+02 0.0028 28.4 5.1 57 86-164 178-235 (286)
338 TIGR03567 FMN_reduc_SsuE FMN r 25.9 2.4E+02 0.0051 24.3 6.5 82 90-181 2-84 (171)
339 PF03170 BcsB: Bacterial cellu 25.7 1.6E+02 0.0036 30.8 6.4 42 7-49 363-410 (605)
340 COG1082 IolE Sugar phosphate i 25.6 1.6E+02 0.0034 26.8 5.6 61 104-164 83-146 (274)
341 PRK12360 4-hydroxy-3-methylbut 25.6 1.8E+02 0.0038 27.9 6.0 73 87-169 31-103 (281)
342 COG2108 Uncharacterized conser 25.4 1.2E+02 0.0026 29.8 4.8 48 108-164 124-171 (353)
343 COG1663 LpxK Tetraacyldisaccha 25.2 1.3E+02 0.0029 29.6 5.1 59 84-158 230-288 (336)
344 cd01017 AdcA Metal binding pro 25.1 1.2E+02 0.0026 28.4 4.8 58 85-163 169-226 (282)
345 cd06546 GH18_CTS3_chitinase GH 25.0 3.8E+02 0.0083 24.9 8.1 82 146-256 61-142 (256)
346 TIGR02026 BchE magnesium-proto 24.8 81 0.0018 32.3 3.8 51 108-163 287-342 (497)
347 TIGR03551 F420_cofH 7,8-dideme 24.6 65 0.0014 31.3 2.9 51 108-162 141-197 (343)
348 PLN03244 alpha-amylase; Provis 24.5 59 0.0013 35.7 2.8 47 119-165 401-462 (872)
349 smart00518 AP2Ec AP endonuclea 24.2 1.8E+02 0.0039 26.6 5.7 72 90-162 27-102 (273)
350 PRK07094 biotin synthase; Prov 24.2 73 0.0016 30.4 3.2 51 108-161 129-182 (323)
351 PRK09860 putative alcohol dehy 24.1 2.6E+02 0.0057 27.6 7.1 66 88-166 32-97 (383)
352 cd07939 DRE_TIM_NifV Streptomy 24.1 1.1E+02 0.0023 28.4 4.2 56 108-164 72-130 (259)
353 PRK10150 beta-D-glucuronidase; 24.1 54 0.0012 34.4 2.4 40 6-48 94-134 (604)
354 PRK08883 ribulose-phosphate 3- 24.1 2.1E+02 0.0045 26.2 6.0 41 110-164 73-113 (220)
355 PLN02587 L-galactose dehydroge 24.0 2E+02 0.0043 27.3 6.1 19 146-164 187-205 (314)
356 cd08171 GlyDH-like2 Glycerol d 24.0 1.4E+02 0.003 29.0 5.1 65 87-165 22-86 (345)
357 PRK13962 bifunctional phosphog 23.9 1.5E+02 0.0033 31.8 5.7 49 111-165 474-522 (645)
358 cd04724 Tryptophan_synthase_al 23.9 1.7E+02 0.0036 27.0 5.4 43 108-164 94-136 (242)
359 KOG4039|consensus 23.9 1.8E+02 0.0039 26.6 5.3 67 99-170 103-172 (238)
360 PRK15447 putative protease; Pr 23.8 2.4E+02 0.0052 26.9 6.6 59 100-164 10-68 (301)
361 cd01018 ZntC Metal binding pro 23.7 1.2E+02 0.0025 28.3 4.3 55 86-163 169-223 (266)
362 PF05378 Hydant_A_N: Hydantoin 23.7 2.6E+02 0.0055 24.6 6.3 33 102-134 131-163 (176)
363 cd08181 PPD-like 1,3-propanedi 23.4 2.6E+02 0.0056 27.2 6.9 67 87-165 25-91 (357)
364 cd03789 GT1_LPS_heptosyltransf 23.3 1.8E+02 0.004 26.7 5.6 76 90-168 124-211 (279)
365 PTZ00445 p36-lilke protein; Pr 23.2 1.7E+02 0.0037 27.1 5.1 62 103-164 27-98 (219)
366 PRK09249 coproporphyrinogen II 23.1 90 0.0019 31.6 3.7 47 108-158 151-201 (453)
367 PF00282 Pyridoxal_deC: Pyrido 23.1 1.8E+02 0.004 28.6 5.8 72 86-164 139-231 (373)
368 PLN02376 1-aminocyclopropane-1 23.1 2.3E+02 0.005 29.0 6.7 59 101-163 180-238 (496)
369 CHL00200 trpA tryptophan synth 23.0 1.3E+02 0.0028 28.4 4.5 44 107-164 108-151 (263)
370 PRK13347 coproporphyrinogen II 23.0 1.4E+02 0.0031 30.2 5.1 47 108-159 152-203 (453)
371 cd08194 Fe-ADH6 Iron-containin 22.9 3E+02 0.0064 27.0 7.2 67 87-166 23-89 (375)
372 cd07493 Peptidases_S8_9 Peptid 22.8 2.3E+02 0.0051 25.8 6.2 62 104-165 90-155 (261)
373 PRK14041 oxaloacetate decarbox 22.7 8.5E+02 0.018 25.0 14.5 50 102-163 92-141 (467)
374 PLN02229 alpha-galactosidase 22.6 1.8E+02 0.0038 29.7 5.6 72 99-170 74-156 (427)
375 PRK06756 flavodoxin; Provision 22.6 4.4E+02 0.0096 21.7 8.8 99 87-212 49-147 (148)
376 TIGR02201 heptsyl_trn_III lipo 22.5 1.9E+02 0.0042 27.5 5.7 65 99-166 194-271 (344)
377 TIGR01210 conserved hypothetic 22.5 1.2E+02 0.0027 29.1 4.4 62 108-178 117-180 (313)
378 cd00384 ALAD_PBGS Porphobilino 22.5 7.1E+02 0.015 24.4 9.4 32 133-165 41-72 (314)
379 cd01311 PDC_hydrolase 2-pyrone 22.5 2E+02 0.0043 26.5 5.7 50 108-166 83-132 (263)
380 cd01335 Radical_SAM Radical SA 22.4 1.1E+02 0.0023 25.4 3.6 57 108-164 88-145 (204)
381 TIGR03566 FMN_reduc_MsuE FMN r 22.4 2.9E+02 0.0064 23.7 6.4 83 90-181 2-87 (174)
382 cd00502 DHQase_I Type I 3-dehy 22.3 1.2E+02 0.0025 27.5 4.0 30 95-124 118-149 (225)
383 COG0219 CspR Predicted rRNA me 22.3 68 0.0015 28.1 2.3 18 173-190 104-121 (155)
384 PRK08931 5'-methylthioadenosin 22.3 1.2E+02 0.0025 29.3 4.1 39 84-124 43-85 (289)
385 TIGR01370 cysRS possible cyste 22.3 5.5E+02 0.012 25.0 8.8 154 150-313 87-274 (315)
386 PF00218 IGPS: Indole-3-glycer 22.1 2.2E+02 0.0048 26.8 5.9 102 40-164 46-165 (254)
387 cd06831 PLPDE_III_ODC_like_AZI 22.0 1.6E+02 0.0034 29.4 5.1 67 100-178 145-214 (394)
388 PRK06920 dnaE DNA polymerase I 22.0 1.6E+02 0.0034 33.8 5.5 51 105-168 19-69 (1107)
389 PRK03204 haloalkane dehalogena 21.9 2.3E+02 0.0049 26.2 6.0 67 86-159 33-101 (286)
390 PF06230 DUF1009: Protein of u 21.8 2.1E+02 0.0044 26.4 5.4 42 108-161 172-213 (214)
391 cd03334 Fab1_TCP TCP-1 like do 21.7 2.9E+02 0.0063 25.8 6.6 61 87-164 86-159 (261)
392 PF15631 Imm-NTF2-2: NTF2 fold 21.5 55 0.0012 24.6 1.3 30 69-98 19-49 (66)
393 TIGR02090 LEU1_arch isopropylm 21.4 1.2E+02 0.0025 29.9 4.0 56 108-164 74-132 (363)
394 PRK06582 coproporphyrinogen II 21.3 2.7E+02 0.0059 27.6 6.6 53 108-164 111-167 (390)
395 TIGR00538 hemN oxygen-independ 21.3 95 0.002 31.4 3.5 47 108-159 151-202 (455)
396 TIGR02690 resist_ArsH arsenica 21.3 6.4E+02 0.014 23.1 12.1 80 89-179 28-107 (219)
397 PRK05799 coproporphyrinogen II 21.2 1.6E+02 0.0034 28.8 4.9 48 108-159 99-150 (374)
398 PRK08564 5'-methylthioadenosin 21.1 1.4E+02 0.0031 28.2 4.4 41 84-124 47-89 (267)
399 PRK07279 dnaE DNA polymerase I 21.0 1.8E+02 0.004 33.0 5.8 47 105-164 18-64 (1034)
400 cd06525 GH25_Lyc-like Lyc mura 21.0 54 0.0012 28.7 1.4 88 94-186 54-148 (184)
401 PF02548 Pantoate_transf: Keto 20.9 3.8E+02 0.0082 25.5 7.1 56 102-183 92-147 (261)
402 PRK09532 DNA polymerase III su 20.9 1.9E+02 0.0042 32.1 5.9 46 106-164 20-65 (874)
403 PF11340 DUF3142: Protein of u 20.8 3.6E+02 0.0078 24.2 6.6 57 202-262 23-79 (181)
404 PLN02880 tyrosine decarboxylas 20.7 2.2E+02 0.0047 29.3 6.0 54 103-163 222-275 (490)
405 PF04476 DUF556: Protein of un 20.6 1.8E+02 0.004 27.2 4.9 49 111-162 137-185 (235)
406 cd01137 PsaA Metal binding pro 20.5 1.6E+02 0.0036 27.7 4.7 55 87-162 177-231 (287)
407 COG3623 SgaU Putative L-xylulo 20.5 2.1E+02 0.0046 27.1 5.2 56 105-163 96-154 (287)
408 TIGR01752 flav_long flavodoxin 20.5 5.4E+02 0.012 21.9 9.5 116 87-211 44-165 (167)
409 PF00326 Peptidase_S9: Prolyl 20.5 2.3E+02 0.0049 24.7 5.4 65 86-151 143-207 (213)
410 cd04909 ACT_PDH-BS C-terminal 20.4 2.3E+02 0.005 20.0 4.5 55 105-160 13-69 (69)
411 TIGR03569 NeuB_NnaB N-acetylne 20.4 8.1E+02 0.018 23.9 11.0 26 101-126 72-97 (329)
412 cd08551 Fe-ADH iron-containing 20.4 3.8E+02 0.0081 26.1 7.4 66 88-166 24-89 (370)
413 cd04739 DHOD_like Dihydroorota 20.4 2.9E+02 0.0062 26.7 6.4 64 102-167 109-173 (325)
414 PRK05898 dnaE DNA polymerase I 20.4 1.9E+02 0.0042 32.5 5.8 47 105-164 18-64 (971)
415 PF08194 DIM: DIM protein; In 20.3 57 0.0012 21.6 1.0 11 6-16 23-33 (36)
416 TIGR03099 dCO2ase_PEP1 pyridox 20.1 7.3E+02 0.016 24.2 9.4 68 98-165 164-232 (398)
417 PRK01076 L-rhamnose isomerase; 20.0 8.5E+02 0.018 24.8 9.6 104 86-204 52-161 (419)
No 1
>PLN03059 beta-galactosidase; Provisional
Probab=100.00 E-value=5.1e-70 Score=566.47 Aligned_cols=242 Identities=26% Similarity=0.457 Sum_probs=220.1
Q ss_pred ccccccceEEEEecCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccch
Q psy13819 66 RRARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRD 145 (323)
Q Consensus 66 ~~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~d 145 (323)
+.+.......|+||+++|+|||||++|+||++||||+||++|+|+|+|||++|+|||+|||+||.|||+||+|||+|.+|
T Consensus 20 ~~~~~~~~~~v~~d~~~f~idG~p~~i~sG~iHY~R~~p~~W~d~L~k~Ka~GlNtV~tYV~Wn~HEp~~G~~dF~G~~D 99 (840)
T PLN03059 20 SSWVSHGSASVSYDHRAFIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGNYYFEDRYD 99 (840)
T ss_pred hhhhccceeEEEEeCCEEEECCEEEEEEEeCcccCcCCHHHHHHHHHHHHHcCCCeEEEEecccccCCCCCeeeccchHH
Confidence 34455556678999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHH--hhhccCCCcE
Q psy13819 146 IEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQ--KYLYGNDRPI 223 (323)
Q Consensus 146 l~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~--~~~~~n~g~V 223 (323)
|++||++|+|.||+|||||||||||||++||+|.||++ .|+|.+|++||.|++++++|+++|+++|+ ++++++||||
T Consensus 100 L~~Fl~la~e~GLyvilRpGPYIcAEw~~GGlP~WL~~-~~~i~~Rs~d~~fl~~v~~~~~~l~~~l~~~~l~~~~GGPI 178 (840)
T PLN03059 100 LVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKY-VPGIEFRTDNGPFKAAMQKFTEKIVDMMKSEKLFEPQGGPI 178 (840)
T ss_pred HHHHHHHHHHcCCEEEecCCcceeeeecCCCCchhhhc-CCCcccccCCHHHHHHHHHHHHHHHHHHhhcceeecCCCcE
Confidence 99999999999999999999999999999999999987 69999999999999999999999999996 6889999999
Q ss_pred EEEccCCCcCCCC----CChHHHHHHHHHHHHHhcCCceEEEeeCCCCccccccccCceEEeecCCCCCchhHHHHHHHh
Q psy13819 224 ILVQVENEYGSDA----ECDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRCTVDGVYSTVDFTVFKDVNVSFQAQRT 299 (323)
Q Consensus 224 i~~QieNE~g~~~----~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~~~~g~~~~~nfg~~~~~~~~~~~~r~ 299 (323)
||+|||||||+|. .+|++|++||++++++ .|++||+||||++. ++++++.|+| | ...+.|..
T Consensus 179 ImvQIENEYGs~~~~~~~~d~~Yl~~l~~~~~~-~Gi~VPl~t~dg~~------~~~~v~~t~N-g---~~~~~f~~--- 244 (840)
T PLN03059 179 ILSQIENEYGPVEWEIGAPGKAYTKWAADMAVK-LGTGVPWVMCKQED------APDPVIDTCN-G---FYCENFKP--- 244 (840)
T ss_pred EEEEecccccceecccCcchHHHHHHHHHHHHH-cCCCcceEECCCCC------CCccceecCC-C---chhhhccc---
Confidence 9999999999973 5689999999999999 59999999999874 3578998999 3 22344443
Q ss_pred hCCCCCeEEEEecccccccccCC
Q psy13819 300 RAPQGPLVNAEFEFFPMLLWAGM 322 (323)
Q Consensus 300 ~~p~~Plm~~E~~~gw~~~W~~~ 322 (323)
..+.+|+|+||||+|||++||+.
T Consensus 245 ~~~~~P~m~tE~w~GWf~~wG~~ 267 (840)
T PLN03059 245 NKDYKPKMWTEAWTGWYTEFGGA 267 (840)
T ss_pred CCCCCCcEEeccCchhHhhcCCC
Confidence 23457999999999999999973
No 2
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=100.00 E-value=1.5e-69 Score=519.84 Aligned_cols=238 Identities=43% Similarity=0.792 Sum_probs=200.3
Q ss_pred eEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 82 TFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 82 ~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
+|+|||||++|+||++||+|+|+++|+++|+|||++|+|||++||+||.|||+||+|||+|.+||++||++|+|+||+||
T Consensus 1 ~~~~~g~~~~~~~Ge~hy~r~p~~~W~~~l~k~ka~G~n~v~~yv~W~~he~~~g~~df~g~~dl~~f~~~a~~~gl~vi 80 (319)
T PF01301_consen 1 SFLIDGKPFFILSGEFHYFRIPPEYWRDRLQKMKAAGLNTVSTYVPWNLHEPEEGQFDFTGNRDLDRFLDLAQENGLYVI 80 (319)
T ss_dssp CEEETTEEE-EEEEEE-GGGS-GGGHHHHHHHHHHTT-SEEEEE--HHHHSSBTTB---SGGG-HHHHHHHHHHTT-EEE
T ss_pred CeEECCEEEEEEEeeeccccCChhHHHHHHHHHHhCCcceEEEeccccccCCCCCcccccchhhHHHHHHHHHHcCcEEE
Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHH
Q psy13819 162 LRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPA 241 (323)
Q Consensus 162 lr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~ 241 (323)
|||||||||||++||+|.||.+ .+++.+|++||.|++++++|+++|+++|+++|+++||||||+|||||||.+ .++++
T Consensus 81 lrpGpyi~aE~~~gG~P~Wl~~-~~~~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~GGpII~vQvENEyg~~-~~~~~ 158 (319)
T PF01301_consen 81 LRPGPYICAEWDNGGLPAWLLR-KPDIRLRTNDPPFLEAVERWYRALAKIIKPLQYTNGGPIIMVQVENEYGSY-GTDRA 158 (319)
T ss_dssp EEEES---TTBGGGG--GGGGG-STTS-SSSS-HHHHHHHHHHHHHHHHHHGGGBGGGTSSEEEEEESSSGGCT-SS-HH
T ss_pred ecccceecccccchhhhhhhhc-cccccccccchhHHHHHHHHHHHHHHHHHhhhhcCCCceehhhhhhhhCCC-cccHh
Confidence 9999999999999999999998 579999999999999999999999999999999999999999999999965 56999
Q ss_pred HHHHHHHHHHHhcCCc-eEEEeeCCCCcccccc-ccCc--eEEeecCCCCCchhHHHHHHHhhCCCCCeEEEEecccccc
Q psy13819 242 HAVWLRDLLRTYVQDK-AVLYSTDGAFDAYLRC-TVDG--VYSTVDFTVFKDVNVSFQAQRTRAPQGPLVNAEFEFFPML 317 (323)
Q Consensus 242 Y~~~l~~~~~~~~g~~-~~l~t~dg~~~~~~~~-~~~g--~~~~~nfg~~~~~~~~~~~~r~~~p~~Plm~~E~~~gw~~ 317 (323)
|++.|++++++. +++ ++++|+|+++.....+ +++| ++++.+|++.+.+.+.|..+++++|++|+|++|||+|||+
T Consensus 159 Y~~~l~~~~~~~-g~~~~~~~t~d~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~p~~P~~~~E~~~Gwf~ 237 (319)
T PF01301_consen 159 YMEALKDAYRDW-GIDPVLLYTTDGPWGSWLPDGGLPGADIYATDNFPPGDNPDEYFGDQRSFQPNQPLMCTEFWGGWFD 237 (319)
T ss_dssp HHHHHHHHHHHT-T-SSSBEEEEESSSHCCHCCC-TTTGSCEEEEEETTTSSHHHHHHHHHHHHTTS--EEEEEESS---
T ss_pred HHHHHHHHHHHh-hCccceeeccCCCcccccccCCCCcceEEeccccCCCchHHHHHhhhhhcCCCCCeEEEEecccccc
Confidence 999999999996 554 7899999998777777 8899 9999999987666789999999999999999999999999
Q ss_pred cccCC
Q psy13819 318 LWAGM 322 (323)
Q Consensus 318 ~W~~~ 322 (323)
+||.-
T Consensus 238 ~WG~~ 242 (319)
T PF01301_consen 238 HWGGP 242 (319)
T ss_dssp BTTS-
T ss_pred ccCCC
Confidence 99973
No 3
>KOG0496|consensus
Probab=100.00 E-value=1.6e-63 Score=501.51 Aligned_cols=238 Identities=32% Similarity=0.608 Sum_probs=218.4
Q ss_pred eEEEEecCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHH
Q psy13819 73 TFAIDLAGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQL 152 (323)
Q Consensus 73 ~~~v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~ 152 (323)
++.|+|++++|++||+|++++||++||||++|++|+++|+++|++|+|+|+|||+||.|||+||+|||+|+.||.+||++
T Consensus 17 ~~~v~yd~~~~~idG~r~~~isGsIHY~R~~pe~W~~~i~k~k~~Gln~IqtYVfWn~Hep~~g~y~FsG~~DlvkFikl 96 (649)
T KOG0496|consen 17 SFNVTYDKRSLLIDGQRFILISGSIHYPRSTPEMWPDLIKKAKAGGLNVIQTYVFWNLHEPSPGKYDFSGRYDLVKFIKL 96 (649)
T ss_pred eeEEeccccceeecCCeeEEEEeccccccCChhhhHHHHHHHHhcCCceeeeeeecccccCCCCcccccchhHHHHHHHH
Confidence 78899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCc
Q psy13819 153 AVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEY 232 (323)
Q Consensus 153 a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~ 232 (323)
|++.||+|+||+||||||||++||+|.||.. .|++.+|++|+.|++++++|+++|++++++++++|||||||.||||||
T Consensus 97 ~~~~GLyv~LRiGPyIcaEw~~GG~P~wL~~-~pg~~~Rt~nepfk~~~~~~~~~iv~~mk~L~~~qGGPIIl~QIENEY 175 (649)
T KOG0496|consen 97 IHKAGLYVILRIGPYICAEWNFGGLPWWLRN-VPGIVFRTDNEPFKAEMERWTTKIVPMMKKLFASQGGPIILVQIENEY 175 (649)
T ss_pred HHHCCeEEEecCCCeEEecccCCCcchhhhh-CCceEEecCChHHHHHHHHHHHHHHHHHHHHHhhcCCCEEEEEeechh
Confidence 9999999999999999999999999977776 799999999999999999999999999999999999999999999999
Q ss_pred CCCCCChHHHHHHHHHHHHHhcCCceEEEeeCCCCcccccc-ccCceEEeecCCCC--CchhHHHHHHHhhCCCCCeEEE
Q psy13819 233 GSDAECDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRC-TVDGVYSTVDFTVF--KDVNVSFQAQRTRAPQGPLVNA 309 (323)
Q Consensus 233 g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~-~~~g~~~~~nfg~~--~~~~~~~~~~r~~~p~~Plm~~ 309 (323)
| +|.+++++..+++++-+++|+++++....++.| +.+ +.+++-. +.....+..++..+|++|+|+|
T Consensus 176 G-------~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk~~d----apd~~in~cng~~c~~~f~~pn~~~kP~~wt 244 (649)
T KOG0496|consen 176 G-------NYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCKQDD----APDPGINTCNGFYCGDTFKRPNSPNKPLVWT 244 (649)
T ss_pred h-------HHHHHHHHHHHHhhccceEEEEecCCCCceeEecCCC----CCCccccccCCccchhhhccCCCCCCCceec
Confidence 8 677888899999889899999999998888999 332 2233211 2344456667788999999999
Q ss_pred EecccccccccCC
Q psy13819 310 EFEFFPMLLWAGM 322 (323)
Q Consensus 310 E~~~gw~~~W~~~ 322 (323)
|+|||||++||+.
T Consensus 245 E~wtgwf~~wGg~ 257 (649)
T KOG0496|consen 245 ENWTGWFTHWGGP 257 (649)
T ss_pred ccccchhhhhCCC
Confidence 9999999999985
No 4
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=99.97 E-value=3.6e-32 Score=279.95 Aligned_cols=168 Identities=29% Similarity=0.496 Sum_probs=144.0
Q ss_pred EecCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeecccccchHHHHHHHHHH
Q psy13819 77 DLAGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVE 155 (323)
Q Consensus 77 ~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e 155 (323)
.++..++.+||+|++++||++||+|+|++.|.+||++||++|+|+|++ ||+|+.|||++|+|||+ ..|.. |+++|++
T Consensus 2 ~~~~~~~~~dg~~~~l~gG~y~p~~~p~~~w~ddl~~mk~~G~N~V~ig~faW~~~eP~eG~fdf~-~~D~~-~l~~a~~ 79 (673)
T COG1874 2 SYDGYSFIRDGRRILLYGGDYYPERWPRETWMDDLRKMKALGLNTVRIGYFAWNLHEPEEGKFDFT-WLDEI-FLERAYK 79 (673)
T ss_pred cccccceeeCCceeEEeccccChHHCCHHHHHHHHHHHHHhCCCeeEeeeEEeeccCccccccCcc-cchHH-HHHHHHh
Confidence 457889999999999999999999999999999999999999999999 99999999999999999 45566 9999999
Q ss_pred CCCeEEEccCC-ccccccCCCCCcccccccCCCee---------ecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 156 EDLYILLRPGP-FICGKRDFGGFPPWLLKVAPDIL---------LRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 156 ~GL~vilr~GP-yi~aEw~~gg~P~Wl~~~~p~~~---------~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
.||+|||+||| ..|++|...++|+||.. ++... ...+++-|++++.+.++.+.+ +++++|++||+
T Consensus 80 ~Gl~vil~t~P~g~~P~Wl~~~~PeiL~~-~~~~~~~~~g~r~~~~~~~~~Yr~~~~~i~~~ire----r~~~~~~~v~~ 154 (673)
T COG1874 80 AGLYVILRTGPTGAPPAWLAKKYPEILAV-DENGRVRSDGARENICPVSPVYREYLDRILQQIRE----RLYGNGPAVIT 154 (673)
T ss_pred cCceEEEecCCCCCCchHHhcCChhheEe-cCCCcccCCCcccccccccHHHHHHHHHHHHHHHH----HHhccCCceeE
Confidence 99999999999 88999999999999877 44322 225677788888875555444 45899999999
Q ss_pred EccCCCcCC-CCC---ChHHHHHHHHHHHH
Q psy13819 226 VQVENEYGS-DAE---CDPAHAVWLRDLLR 251 (323)
Q Consensus 226 ~QieNE~g~-~~~---~~~~Y~~~l~~~~~ 251 (323)
||++||||+ +|. |...+..||++.+-
T Consensus 155 w~~dneY~~~~~~~~~~~~~f~~wLk~~yg 184 (673)
T COG1874 155 WQNDNEYGGHPCYCDYCQAAFRLWLKKGYG 184 (673)
T ss_pred EEccCccCCccccccccHHHHHHHHHhCcc
Confidence 999999999 343 45777888887773
No 5
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=99.91 E-value=2.5e-24 Score=210.60 Aligned_cols=139 Identities=28% Similarity=0.514 Sum_probs=113.6
Q ss_pred cCCCCChhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCC
Q psy13819 98 HYFRAPPGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGG 176 (323)
Q Consensus 98 Hy~r~p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg 176 (323)
++..+|++.|++||++||++|+|+||+ .+.|+.+||+||+|||+. |+++|++|+++||+|||++. ...
T Consensus 3 ~pe~~~~e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~eG~ydF~~---lD~~l~~a~~~Gi~viL~~~--------~~~ 71 (374)
T PF02449_consen 3 YPEQWPEEEWEEDLRLMKEAGFNTVRIGEFSWSWLEPEEGQYDFSW---LDRVLDLAAKHGIKVILGTP--------TAA 71 (374)
T ss_dssp -GGGS-CCHHHHHHHHHHHHT-SEEEE-CCEHHHH-SBTTB---HH---HHHHHHHHHCTT-EEEEEEC--------TTT
T ss_pred CcccCCHHHHHHHHHHHHHcCCCEEEEEEechhhccCCCCeeecHH---HHHHHHHHHhccCeEEEEec--------ccc
Confidence 445788999999999999999999997 899999999999999999 78889999999999999884 577
Q ss_pred CcccccccCCCeee----------------cCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC-CCCC-
Q psy13819 177 FPPWLLKVAPDILL----------------RQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS-DAEC- 238 (323)
Q Consensus 177 ~P~Wl~~~~p~~~~----------------R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~-~~~~- 238 (323)
.|.||.+++|++.. ..++|.|++++++++++|+++ |++++.|++|||+||++. .+.|
T Consensus 72 ~P~Wl~~~~Pe~~~~~~~g~~~~~g~~~~~~~~~p~yr~~~~~~~~~l~~~-----y~~~p~vi~~~i~NE~~~~~~~~~ 146 (374)
T PF02449_consen 72 PPAWLYDKYPEILPVDADGRRRGFGSRQHYCPNSPAYREYARRFIRALAER-----YGDHPAVIGWQIDNEPGYHRCYSP 146 (374)
T ss_dssp S-HHHHCCSGCCC-B-TTTSBEECCCSTT-HCCHHHHHHHHHHHHHHHHHH-----HTTTTTEEEEEECCSTTCTS--SH
T ss_pred cccchhhhcccccccCCCCCcCccCCccccchhHHHHHHHHHHHHHHHHhh-----ccccceEEEEEeccccCcCcCCCh
Confidence 99999887887653 156899999999999999999 889999999999999987 4544
Q ss_pred --hHHHHHHHHHHHHH
Q psy13819 239 --DPAHAVWLRDLLRT 252 (323)
Q Consensus 239 --~~~Y~~~l~~~~~~ 252 (323)
.++|.+||+++|..
T Consensus 147 ~~~~~f~~wLk~kY~t 162 (374)
T PF02449_consen 147 ACQAAFRQWLKEKYGT 162 (374)
T ss_dssp HHHHHHHHHHHHHHSS
T ss_pred HHHHHHHHHHHHHhCC
Confidence 47899999999873
No 6
>KOG0496|consensus
Probab=99.80 E-value=3.1e-20 Score=188.53 Aligned_cols=64 Identities=36% Similarity=0.732 Sum_probs=60.8
Q ss_pred CCCCceeEEEEcCcccccccCCCCCeeEEEeccccccCCcceEEEEEEeeCCCCccccchhhhhh
Q psy13819 1 MRRWSKGVLFANGRHWGRYWSEVGPQYSLFVPEEFLKVGTNRITIFELTRAPDKYDVDFVDKISQ 65 (323)
Q Consensus 1 ~~~~~kg~~~vng~~lGryw~~~gpq~~lyvP~~~l~~g~N~i~i~e~~~~~~~~~i~~~~~~i~ 65 (323)
|.|||||+|||||+||||||++.|||++||||++|||++.|.|||||+++ +++..|+|+++++.
T Consensus 577 m~g~GKG~vwVNG~niGRYW~~~G~Q~~yhvPr~~Lk~~~N~lvvfEee~-~~p~~i~~~~~~~~ 640 (649)
T KOG0496|consen 577 MNGWGKGQVWVNGQNIGRYWPSFGPQRTYHVPRSWLKPSGNLLVVFEEEG-GDPNGISFVTRPVL 640 (649)
T ss_pred cCCCcceEEEECCcccccccCCCCCceEEECcHHHhCcCCceEEEEEecc-CCCccceEEEeEee
Confidence 78999999999999999999989999999999999999999999999999 77889999998865
No 7
>PF02836 Glyco_hydro_2_C: Glycosyl hydrolases family 2, TIM barrel domain; InterPro: IPR006103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. Beta-galactosidase from E. coli has a TIM-barrel-like core surrounded by four other largely beta domains [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3CMG_A 3FN9_C 1YQ2_A 3K4D_B 3LPG_B 3LPF_A 3K4A_B 3K46_B 3GM8_A 3DEC_A ....
Probab=99.76 E-value=8.4e-17 Score=152.70 Aligned_cols=202 Identities=19% Similarity=0.242 Sum_probs=128.9
Q ss_pred EecCCeEEeCCeeeEEeeeeecCC------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHH
Q psy13819 77 DLAGDTFRLNEDPFQFVSGSFHYF------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFL 150 (323)
Q Consensus 77 ~~~~~~~~~dGk~~~i~~g~~Hy~------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl 150 (323)
...++.|+|||||+.|.|.+.|.. .+|.+.++.||++||++|+|+||+ .|.|. -.+|+
T Consensus 2 ev~~~~~~lNGk~~~l~Gv~~h~~~~~~g~a~~~~~~~~d~~l~k~~G~N~iR~-----~h~p~-----------~~~~~ 65 (298)
T PF02836_consen 2 EVKDGGFYLNGKPIFLRGVNRHQDYPGLGRAMPDEAMERDLELMKEMGFNAIRT-----HHYPP-----------SPRFY 65 (298)
T ss_dssp EEETTEEEETTEEE-EEEEEE-S-BTTTBT---HHHHHHHHHHHHHTT-SEEEE-----TTS-------------SHHHH
T ss_pred EEECCEEEECCEEEEEEEEeeCcCcccccccCCHHHHHHHHHHHHhcCcceEEc-----ccccC-----------cHHHH
Confidence 347999999999999999999963 367899999999999999999999 77776 47889
Q ss_pred HHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCC
Q psy13819 151 QLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVEN 230 (323)
Q Consensus 151 ~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieN 230 (323)
++|.+.||.|+..+. . .+.-.|-... .......+|.+.+.+.+-+++++.+ ..|||+||+|-+.|
T Consensus 66 ~~cD~~GilV~~e~~-~-------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~-----~~NHPSIi~W~~gN 130 (298)
T PF02836_consen 66 DLCDELGILVWQEIP-L-------EGHGSWQDFG--NCNYDADDPEFRENAEQELREMVRR-----DRNHPSIIMWSLGN 130 (298)
T ss_dssp HHHHHHT-EEEEE-S---------BSCTSSSSTS--CTSCTTTSGGHHHHHHHHHHHHHHH-----HTT-TTEEEEEEEE
T ss_pred HHHhhcCCEEEEecc-c-------cccCccccCC--ccccCCCCHHHHHHHHHHHHHHHHc-----CcCcCchheeecCc
Confidence 999999999998762 1 1111121110 0123356899999999999999998 88999999999999
Q ss_pred CcCCCCCChHHHHHHHHHHHHHhcCCceEEEeeCCCCccccccccCceEEeecCCCC-CchhHHHHHHHh--hCCCCCeE
Q psy13819 231 EYGSDAECDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRCTVDGVYSTVDFTVF-KDVNVSFQAQRT--RAPQGPLV 307 (323)
Q Consensus 231 E~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~~~~g~~~~~nfg~~-~~~~~~~~~~r~--~~p~~Plm 307 (323)
|. ....+++.|.+++++......+.++++. ....+.......+. ..... ...+..-..+.. ..+++|++
T Consensus 131 E~-----~~~~~~~~l~~~~k~~DptRpv~~~~~~-~~~~~~~~~~~~~~--~~y~~~~~~~~~~~~~~~~~~~~~kP~i 202 (298)
T PF02836_consen 131 ES-----DYREFLKELYDLVKKLDPTRPVTYASNG-WDPYVDDIIFDIYS--GWYNGYGDPEDFEKYLEDWYKYPDKPII 202 (298)
T ss_dssp SS-----HHHHHHHHHHHHHHHH-TTSEEEEETGT-SGGSTSSCEECSET--TTSSSCCHHHHHHHHHHHHHHHCTS-EE
T ss_pred cC-----ccccchhHHHHHHHhcCCCCceeecccc-cccccccccccccc--cccCCcccHHHHHHHHHhccccCCCCeE
Confidence 99 2577888899999986555544455452 21112110000110 11111 122222223332 46899999
Q ss_pred EEEecccccc
Q psy13819 308 NAEFEFFPML 317 (323)
Q Consensus 308 ~~E~~~gw~~ 317 (323)
.+||-...+.
T Consensus 203 ~sEyg~~~~~ 212 (298)
T PF02836_consen 203 ISEYGADAYN 212 (298)
T ss_dssp EEEESEBBSS
T ss_pred ehhccccccc
Confidence 9999766554
No 8
>PLN03059 beta-galactosidase; Provisional
Probab=99.76 E-value=3.5e-19 Score=186.91 Aligned_cols=62 Identities=27% Similarity=0.546 Sum_probs=57.1
Q ss_pred CCCCceeEEEEcCcccccccCCC--------------------------CCeeEEE-eccccccCCcceEEEEEEeeCCC
Q psy13819 1 MRRWSKGVLFANGRHWGRYWSEV--------------------------GPQYSLF-VPEEFLKVGTNRITIFELTRAPD 53 (323)
Q Consensus 1 ~~~~~kg~~~vng~~lGryw~~~--------------------------gpq~~ly-vP~~~l~~g~N~i~i~e~~~~~~ 53 (323)
|++||||+|||||+||||||+ + ||||+|| ||++|||+|+|+|||||+++ ++
T Consensus 640 m~gmGKG~aWVNG~nIGRYW~-~~a~~~gC~~c~y~g~~~~~kc~~~cggP~q~lYHVPr~~Lk~g~N~lViFEe~g-g~ 717 (840)
T PLN03059 640 MSSMGKGQIWINGQSIGRHWP-AYTAHGSCNGCNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLIVFEEWG-GN 717 (840)
T ss_pred cccCCCeeEEECCcccccccc-cccccCCCccccccccccchhhhccCCCceeEEEeCcHHHhccCCceEEEEEecC-CC
Confidence 789999999999999999995 5 9999999 99999999999999999998 67
Q ss_pred Cccccchhhhh
Q psy13819 54 KYDVDFVDKIS 64 (323)
Q Consensus 54 ~~~i~~~~~~i 64 (323)
+..|+|..+.+
T Consensus 718 p~~I~~~~~~~ 728 (840)
T PLN03059 718 PAGISLVKRTT 728 (840)
T ss_pred CCceEEEEeec
Confidence 88999877553
No 9
>PRK10150 beta-D-glucuronidase; Provisional
Probab=99.71 E-value=1.2e-15 Score=158.19 Aligned_cols=166 Identities=20% Similarity=0.118 Sum_probs=122.2
Q ss_pred cccccceEEEEecCCeEEeCCeeeEEeeeeecCC------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc
Q psy13819 67 RARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYF------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY 140 (323)
Q Consensus 67 ~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf 140 (323)
...+.|++++ +++.|+|||||+++.|.+.|.. .++.+.++.|+++||++|+|+||+ .|-|.
T Consensus 271 ~~~GfR~i~~--~~~~f~lNG~pv~lrG~~~h~~~~~~G~a~~~~~~~~d~~l~K~~G~N~vR~-----sh~p~------ 337 (604)
T PRK10150 271 LRFGIRSVAV--KGGQFLINGKPFYFKGFGKHEDADIRGKGLDEVLNVHDHNLMKWIGANSFRT-----SHYPY------ 337 (604)
T ss_pred eeeEEEEEEE--eCCEEEECCEEEEEEeeeccCCCCccCCcCCHHHHHHHHHHHHHCCCCEEEe-----ccCCC------
Confidence 4566787777 6789999999999999999853 256788999999999999999999 57775
Q ss_pred cccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccc-------ccCCCeeecCCCHhHHHHHHHHHHHHHHHHH
Q psy13819 141 DGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLL-------KVAPDILLRQNHPVYQRYVTRWFQELFPRIQ 213 (323)
Q Consensus 141 ~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~-------~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~ 213 (323)
-++|+++|.|.||+|+.... ..|+..|.. ...+.......+|.+.+...+-+++++.+
T Consensus 338 -----~~~~~~~cD~~GllV~~E~p--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~mv~r-- 402 (604)
T PRK10150 338 -----SEEMLDLADRHGIVVIDETP--------AVGLNLSFGAGLEAGNKPKETYSEEAVNGETQQAHLQAIRELIAR-- 402 (604)
T ss_pred -----CHHHHHHHHhcCcEEEEecc--------cccccccccccccccccccccccccccchhHHHHHHHHHHHHHHh--
Confidence 36789999999999998762 112222221 10122122234678888888888888888
Q ss_pred hhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEee
Q psy13819 214 KYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYST 263 (323)
Q Consensus 214 ~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~ 263 (323)
..|||+||+|.+.||..........+.+.|.+.+++......+.+.+
T Consensus 403 ---~~NHPSIi~Ws~gNE~~~~~~~~~~~~~~l~~~~k~~DptR~vt~~~ 449 (604)
T PRK10150 403 ---DKNHPSVVMWSIANEPASREQGAREYFAPLAELTRKLDPTRPVTCVN 449 (604)
T ss_pred ---ccCCceEEEEeeccCCCccchhHHHHHHHHHHHHHhhCCCCceEEEe
Confidence 89999999999999976432234678888888888864333344443
No 10
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=99.58 E-value=1.1e-13 Score=150.93 Aligned_cols=197 Identities=17% Similarity=0.160 Sum_probs=132.1
Q ss_pred ccccccceEEEEecCCeEEeCCeeeEEeeeeecCC------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeec
Q psy13819 66 RRARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYF------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYH 139 (323)
Q Consensus 66 ~~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fd 139 (323)
....+.|++++ +++.|+|||||++|.|++.|.. .++++.+++||++||++|+|+||+ .|-|.
T Consensus 312 ~~~~GfR~iei--~~~~f~lNGkpi~lrGvnrh~~~p~~G~a~~~e~~~~dl~lmK~~g~NavR~-----sHyP~----- 379 (1021)
T PRK10340 312 PQRVGFRDIKV--RDGLFWINNRYVKLHGVNRHDNDHRKGRAVGMDRVEKDIQLMKQHNINSVRT-----AHYPN----- 379 (1021)
T ss_pred EeeeEEEEEEE--ECCEEEECCEEEEEEEeecCCCCcccCccCCHHHHHHHHHHHHHCCCCEEEe-----cCCCC-----
Confidence 35677888887 5789999999999999998843 257899999999999999999999 57776
Q ss_pred ccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccC
Q psy13819 140 YDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGN 219 (323)
Q Consensus 140 f~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n 219 (323)
-.+|+++|.|.||+|+-.. |..|. |++ . . .+...-.++|.+.++..+-+++++.+ .+|
T Consensus 380 ------~~~fydlcDe~GllV~dE~-~~e~~-----g~~--~-~--~~~~~~~~~p~~~~~~~~~~~~mV~R-----drN 437 (1021)
T PRK10340 380 ------DPRFYELCDIYGLFVMAET-DVESH-----GFA--N-V--GDISRITDDPQWEKVYVDRIVRHIHA-----QKN 437 (1021)
T ss_pred ------CHHHHHHHHHCCCEEEECC-ccccc-----Ccc--c-c--cccccccCCHHHHHHHHHHHHHHHHh-----CCC
Confidence 4688999999999999876 33222 221 0 0 01111235777877777777888888 899
Q ss_pred CCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEeeCCCCccccccccCceEEeecCCCCCchhHHHHHHHh
Q psy13819 220 DRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRCTVDGVYSTVDFTVFKDVNVSFQAQRT 299 (323)
Q Consensus 220 ~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~~~~g~~~~~nfg~~~~~~~~~~~~r~ 299 (323)
||+||+|-+.||.+.. . ..+.+.+.+++......+.|..|... ..-++++ ...+ ..+ ......+
T Consensus 438 HPSIi~WslGNE~~~g----~-~~~~~~~~~k~~DptR~v~~~~~~~~------~~~Dv~~-~~Y~---~~~-~~~~~~~ 501 (1021)
T PRK10340 438 HPSIIIWSLGNESGYG----C-NIRAMYHAAKALDDTRLVHYEEDRDA------EVVDVIS-TMYT---RVE-LMNEFGE 501 (1021)
T ss_pred CCEEEEEECccCcccc----H-HHHHHHHHHHHhCCCceEEeCCCcCc------cccceec-cccC---CHH-HHHHHHh
Confidence 9999999999998641 2 23567777777544333333222111 1123332 1221 111 1222333
Q ss_pred hCCCCCeEEEEec
Q psy13819 300 RAPQGPLVNAEFE 312 (323)
Q Consensus 300 ~~p~~Plm~~E~~ 312 (323)
..+++|++.+||=
T Consensus 502 ~~~~kP~i~~Ey~ 514 (1021)
T PRK10340 502 YPHPKPRILCEYA 514 (1021)
T ss_pred CCCCCcEEEEchH
Confidence 3457899999984
No 11
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=99.57 E-value=1.9e-13 Score=149.13 Aligned_cols=155 Identities=18% Similarity=0.208 Sum_probs=115.9
Q ss_pred cccccceEEEEecCCeEEeCCeeeEEeeeeecCC------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc
Q psy13819 67 RARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYF------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY 140 (323)
Q Consensus 67 ~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf 140 (323)
...+.|++++ +++.|+|||||+.|.|.+.|.. +++++.+++||++||++|+|+||+ .|-|.
T Consensus 329 ~~~GfR~iei--~~~~f~LNGkpi~lrGvn~h~~~p~~G~a~t~e~~~~di~lmK~~g~NaVR~-----sHyP~------ 395 (1027)
T PRK09525 329 YDVGFRKVEI--ENGLLKLNGKPLLIRGVNRHEHHPEHGQVMDEETMVQDILLMKQHNFNAVRC-----SHYPN------ 395 (1027)
T ss_pred eeEEEEEEEE--ECCEEEECCEEEEEEEeEccccCcccCccCCHHHHHHHHHHHHHCCCCEEEe-----cCCCC------
Confidence 3567787777 6789999999999999999942 467899999999999999999999 57776
Q ss_pred cccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCC
Q psy13819 141 DGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGND 220 (323)
Q Consensus 141 ~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~ 220 (323)
-.+|+++|.|.||+|+-... . | .+|-.|. . . -.++|.|.+++.+-+++++.+ .+||
T Consensus 396 -----~p~fydlcDe~GilV~dE~~-~---e-~hg~~~~---~-~-----~~~dp~~~~~~~~~~~~mV~R-----drNH 451 (1027)
T PRK09525 396 -----HPLWYELCDRYGLYVVDEAN-I---E-THGMVPM---N-R-----LSDDPRWLPAMSERVTRMVQR-----DRNH 451 (1027)
T ss_pred -----CHHHHHHHHHcCCEEEEecC-c---c-ccCCccc---c-C-----CCCCHHHHHHHHHHHHHHHHh-----CCCC
Confidence 46889999999999998862 1 1 1111121 1 0 135688888888878888888 8999
Q ss_pred CcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEee
Q psy13819 221 RPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYST 263 (323)
Q Consensus 221 g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~ 263 (323)
|+||+|-+.||.+.. ...+.+.+.+++......+.|..
T Consensus 452 PSIi~WSlgNE~~~g-----~~~~~l~~~~k~~DptRpV~y~~ 489 (1027)
T PRK09525 452 PSIIIWSLGNESGHG-----ANHDALYRWIKSNDPSRPVQYEG 489 (1027)
T ss_pred CEEEEEeCccCCCcC-----hhHHHHHHHHHhhCCCCcEEECC
Confidence 999999999998642 22345666666654444455554
No 12
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=99.50 E-value=7.1e-13 Score=122.60 Aligned_cols=156 Identities=18% Similarity=0.305 Sum_probs=113.6
Q ss_pred eCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcc-cCCC-eecccccchHHHHHHHHHHCCCeEEE
Q psy13819 85 LNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHE-VHPG-HYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 85 ~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hE-P~~G-~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
.+|+++.+.|.+.|+.. +..-+++++.||++|+|+||+.|.|...+ |.|+ .++-+..+.|+++|+.|+++||+|||
T Consensus 3 ~~G~~v~~~G~n~~w~~--~~~~~~~~~~~~~~G~n~VRi~v~~~~~~~~~~~~~~~~~~~~~ld~~v~~a~~~gi~vil 80 (281)
T PF00150_consen 3 QNGKPVNWRGFNTHWYN--PSITEADFDQLKALGFNTVRIPVGWEAYQEPNPGYNYDETYLARLDRIVDAAQAYGIYVIL 80 (281)
T ss_dssp TTSEBEEEEEEEETTSG--GGSHHHHHHHHHHTTESEEEEEEESTSTSTTSTTTSBTHHHHHHHHHHHHHHHHTT-EEEE
T ss_pred CCCCeEEeeeeecccCC--CCCHHHHHHHHHHCCCCEEEeCCCHHHhcCCCCCccccHHHHHHHHHHHHHHHhCCCeEEE
Confidence 47999999999999321 22789999999999999999999996655 6776 48888899999999999999999999
Q ss_pred ccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCC----
Q psy13819 163 RPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAEC---- 238 (323)
Q Consensus 163 r~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~---- 238 (323)
.+. ..|.|...... -...+...+...++++.++++ |++.++|++++|-||+......
T Consensus 81 d~h----------~~~~w~~~~~~----~~~~~~~~~~~~~~~~~la~~-----y~~~~~v~~~el~NEP~~~~~~~~w~ 141 (281)
T PF00150_consen 81 DLH----------NAPGWANGGDG----YGNNDTAQAWFKSFWRALAKR-----YKDNPPVVGWELWNEPNGGNDDANWN 141 (281)
T ss_dssp EEE----------ESTTCSSSTST----TTTHHHHHHHHHHHHHHHHHH-----HTTTTTTEEEESSSSGCSTTSTTTTS
T ss_pred Eec----------cCccccccccc----cccchhhHHHHHhhhhhhccc-----cCCCCcEEEEEecCCccccCCccccc
Confidence 872 23777433111 012333455566677777777 7778899999999999875321
Q ss_pred -----h-HHHHHHHHHHHHHhcCCceEEEe
Q psy13819 239 -----D-PAHAVWLRDLLRTYVQDKAVLYS 262 (323)
Q Consensus 239 -----~-~~Y~~~l~~~~~~~~g~~~~l~t 262 (323)
. .++.+.+.+.+|+. +.+.+++.
T Consensus 142 ~~~~~~~~~~~~~~~~~Ir~~-~~~~~i~~ 170 (281)
T PF00150_consen 142 AQNPADWQDWYQRAIDAIRAA-DPNHLIIV 170 (281)
T ss_dssp HHHTHHHHHHHHHHHHHHHHT-TSSSEEEE
T ss_pred cccchhhhhHHHHHHHHHHhc-CCcceeec
Confidence 1 45666667777774 55444433
No 13
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=99.44 E-value=1.5e-12 Score=138.16 Aligned_cols=126 Identities=22% Similarity=0.269 Sum_probs=105.2
Q ss_pred ccccccceEEEEecCCeEEeCCeeeEEeeeeecCC-----CCC-hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeec
Q psy13819 66 RRARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYF-----RAP-PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYH 139 (323)
Q Consensus 66 ~~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~-----r~p-~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fd 139 (323)
....+-|+++| .++.|.|||||+++.|.+.|.+ +.. .+..++||++||++|+|+|||. |-|+
T Consensus 278 ~~~iGfR~iei--~~~~~~iNGkpvf~kGvnrHe~~~~~G~~~~~~~~~~dl~lmk~~n~N~vRts-----HyP~----- 345 (808)
T COG3250 278 ALRIGFRTVEI--KDGLLLINGKPVFIRGVNRHEDDPILGRVTDEDAMERDLKLMKEANMNSVRTS-----HYPN----- 345 (808)
T ss_pred EeeeccEEEEE--ECCeEEECCeEEEEeeeecccCCCccccccCHHHHHHHHHHHHHcCCCEEEec-----CCCC-----
Confidence 35567788777 5679999999999999999975 333 4559999999999999999994 9998
Q ss_pred ccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccC
Q psy13819 140 YDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGN 219 (323)
Q Consensus 140 f~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n 219 (323)
=..|+++|.+.||+||-.+ ++ +| .|.| +++.+++.+..=+++++++ .+|
T Consensus 346 ------~~~~ydLcDelGllV~~Ea-~~---~~--~~~~--------------~~~~~~k~~~~~i~~mver-----~kn 394 (808)
T COG3250 346 ------SEEFYDLCDELGLLVIDEA-MI---ET--HGMP--------------DDPEWRKEVSEEVRRMVER-----DRN 394 (808)
T ss_pred ------CHHHHHHHHHhCcEEEEec-ch---hh--cCCC--------------CCcchhHHHHHHHHHHHHh-----ccC
Confidence 5789999999999999987 11 11 1222 5788999999999999999 889
Q ss_pred CCcEEEEccCCCcCC
Q psy13819 220 DRPIILVQVENEYGS 234 (323)
Q Consensus 220 ~g~Vi~~QieNE~g~ 234 (323)
||+||+|-+.||.|.
T Consensus 395 HPSIiiWs~gNE~~~ 409 (808)
T COG3250 395 HPSIIIWSLGNESGH 409 (808)
T ss_pred CCcEEEEeccccccC
Confidence 999999999999764
No 14
>TIGR03356 BGL beta-galactosidase.
Probab=99.41 E-value=3.9e-13 Score=134.31 Aligned_cols=119 Identities=15% Similarity=0.145 Sum_probs=99.4
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
++-|| ..|++|+++||++|+|++|+.|.|++++|+ +|++|.++.+.++++|+.|.++||.+|+.. .
T Consensus 49 a~d~y-----~~y~eDi~l~~~~G~~~~R~si~Wsri~p~g~~~~n~~~~~~y~~~i~~l~~~gi~pivtL--------~ 115 (427)
T TIGR03356 49 ACDHY-----HRYEEDVALMKELGVDAYRFSIAWPRIFPEGTGPVNPKGLDFYDRLVDELLEAGIEPFVTL--------Y 115 (427)
T ss_pred cccHH-----HhHHHHHHHHHHcCCCeEEcccchhhcccCCCCCcCHHHHHHHHHHHHHHHHcCCeeEEee--------c
Confidence 45566 678999999999999999999999999999 799999999999999999999999999999 5
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEc
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQ 227 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Q 227 (323)
|..+|.||.++ .++.-|..-..|.++++..++++..+++.+..-|+++++++.
T Consensus 116 Hfd~P~~l~~~-gGw~~~~~~~~f~~ya~~~~~~~~d~v~~w~t~NEp~~~~~~ 168 (427)
T TIGR03356 116 HWDLPQALEDR-GGWLNRDTAEWFAEYAAVVAERLGDRVKHWITLNEPWCSAFL 168 (427)
T ss_pred cCCccHHHHhc-CCCCChHHHHHHHHHHHHHHHHhCCcCCEEEEecCcceeccc
Confidence 78899999874 676655555666777777676666666667777888877654
No 15
>PLN02998 beta-glucosidase
Probab=99.38 E-value=3e-13 Score=137.33 Aligned_cols=115 Identities=13% Similarity=0.153 Sum_probs=107.0
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
..|++|+++||++|+|+.|+.|.|++++|. .|.+|-+|.+.++++|+.+.|+||.++|.. .|..+|.||..
T Consensus 82 hry~EDi~lmk~lG~~~YRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~L~~~GIeP~VTL--------~H~dlP~~L~~ 153 (497)
T PLN02998 82 HKYKEDVKLMADMGLEAYRFSISWSRLLPSGRGPINPKGLQYYNNLIDELITHGIQPHVTL--------HHFDLPQALED 153 (497)
T ss_pred HhhHHHHHHHHHcCCCeEEeeccHHhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCceEEEe--------cCCCCCHHHHH
Confidence 678999999999999999999999999997 678999999999999999999999999999 68899999988
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEc
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQ 227 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Q 227 (323)
++.++.-|..-..|.++++..++++..+|+.+..-|+|++++..
T Consensus 154 ~yGGW~n~~~v~~F~~YA~~~~~~fgdrVk~WiT~NEP~~~~~~ 197 (497)
T PLN02998 154 EYGGWLSQEIVRDFTAYADTCFKEFGDRVSHWTTINEVNVFALG 197 (497)
T ss_pred hhCCcCCchHHHHHHHHHHHHHHHhcCcCCEEEEccCcchhhhc
Confidence 66899988888999999999999999999999999999987653
No 16
>PLN02814 beta-glucosidase
Probab=99.37 E-value=3.2e-13 Score=137.29 Aligned_cols=120 Identities=13% Similarity=0.117 Sum_probs=109.7
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
++-|| ..|++|+++||++|+|+.|+.|.|++++|. +|.+|-+|.+.++++|+.|.++||.++|.. .
T Consensus 72 a~D~Y-----hry~EDI~L~k~lG~~ayRfSIsWsRI~P~G~g~~N~~Gl~fY~~lId~l~~~GI~P~VTL--------~ 138 (504)
T PLN02814 72 ASDGY-----HKYKEDVKLMAEMGLESFRFSISWSRLIPNGRGLINPKGLLFYKNLIKELRSHGIEPHVTL--------Y 138 (504)
T ss_pred cccHH-----HhhHHHHHHHHHcCCCEEEEeccHhhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCceEEEe--------c
Confidence 44555 678999999999999999999999999997 688999999999999999999999999999 6
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEc
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQ 227 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Q 227 (323)
|..+|.||.+++.++.-|..-..|.++++..++++..+|+.+..-|+|++++..
T Consensus 139 H~dlP~~L~~~yGGW~n~~~i~~F~~YA~~~f~~fgdrVk~WiT~NEP~~~~~~ 192 (504)
T PLN02814 139 HYDLPQSLEDEYGGWINRKIIEDFTAFADVCFREFGEDVKLWTTINEATIFAIG 192 (504)
T ss_pred CCCCCHHHHHhcCCcCChhHHHHHHHHHHHHHHHhCCcCCEEEeccccchhhhc
Confidence 889999999867899999888999999999999999999999999999977653
No 17
>PRK09593 arb 6-phospho-beta-glucosidase; Reviewed
Probab=99.36 E-value=4.9e-13 Score=135.31 Aligned_cols=118 Identities=14% Similarity=0.089 Sum_probs=108.2
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR 172 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw 172 (323)
++-|| ..|++|+++||++|+|+.|+.|+|++++|. +|++|-+|++.++++|+.|.++||.++|.+
T Consensus 68 a~d~Y-----hry~eDi~Lm~~lG~~aYRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lId~L~~~GI~P~VTL-------- 134 (478)
T PRK09593 68 AIDMY-----HHYKEDIALFAEMGFKTYRMSIAWTRIFPKGDELEPNEAGLQFYEDIFKECHKYGIEPLVTI-------- 134 (478)
T ss_pred ccchH-----HhhHHHHHHHHHcCCCEEEEecchhhcccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------
Confidence 55566 678999999999999999999999999998 667999999999999999999999999999
Q ss_pred CCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 173 DFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
.|..+|.||..++.++.-|..-..|.++++..++++..+|+.+..-|++++++
T Consensus 135 ~H~dlP~~L~~~~GGW~n~~~v~~F~~YA~~~~~~fgdrVk~WiT~NEP~~~~ 187 (478)
T PRK09593 135 THFDCPMHLIEEYGGWRNRKMVGFYERLCRTLFTRYKGLVKYWLTFNEINMIL 187 (478)
T ss_pred cccCCCHHHHhhcCCCCChHHHHHHHHHHHHHHHHhcCcCCEEEeecchhhhh
Confidence 68899999987668898888888999999999999999999999999998765
No 18
>PRK09589 celA 6-phospho-beta-glucosidase; Reviewed
Probab=99.36 E-value=4.2e-13 Score=135.73 Aligned_cols=118 Identities=10% Similarity=0.097 Sum_probs=108.3
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR 172 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw 172 (323)
++-|| ..+++|+++||++|+|+.|+.|.|++++|. +|++|-+|++.++++|+.|.++||.++|.+
T Consensus 62 a~D~Y-----hry~eDi~Lm~~lG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~L~~~GI~P~VTL-------- 128 (476)
T PRK09589 62 AIDFY-----HRYKEDIALFAEMGFKCFRTSIAWTRIFPQGDELEPNEEGLQFYDDLFDECLKQGIEPVVTL-------- 128 (476)
T ss_pred cccHH-----HhhHHHHHHHHHcCCCEEEeccchhhcCcCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------
Confidence 45566 678999999999999999999999999998 567999999999999999999999999999
Q ss_pred CCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 173 DFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
.|..+|.||..++.++.-|..-..|.++++..++++..+|+.+..-|++++++
T Consensus 129 ~H~dlP~~L~~~yGGW~n~~~i~~F~~YA~~~f~~fgdrVk~WiT~NEp~~~~ 181 (476)
T PRK09589 129 SHFEMPYHLVTEYGGWRNRKLIDFFVRFAEVVFTRYKDKVKYWMTFNEINNQA 181 (476)
T ss_pred cCCCCCHHHHHhcCCcCChHHHHHHHHHHHHHHHHhcCCCCEEEEecchhhhh
Confidence 68899999988678999888889999999999999999999999999998764
No 19
>PRK15014 6-phospho-beta-glucosidase BglA; Provisional
Probab=99.34 E-value=2e-12 Score=130.87 Aligned_cols=117 Identities=10% Similarity=0.150 Sum_probs=106.3
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR 172 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw 172 (323)
++-|| ..|++|+++||++|+|+.|+.|.|++++|. +|++|-+|.+.++++|+.|.++||.++|.+
T Consensus 64 A~D~Y-----hry~EDI~Lm~elG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~l~~~GI~P~vTL-------- 130 (477)
T PRK15014 64 AVDFY-----GHYKEDIKLFAEMGFKCFRTSIAWTRIFPKGDEAQPNEEGLKFYDDMFDELLKYNIEPVITL-------- 130 (477)
T ss_pred ccCcc-----cccHHHHHHHHHcCCCEEEecccceeeccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------
Confidence 56677 567999999999999999999999999998 577999999999999999999999999999
Q ss_pred CCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEE
Q psy13819 173 DFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPII 224 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi 224 (323)
.|..+|.||..++.++.-|..-..|.++++..++++..+|+.+..-|+++++
T Consensus 131 ~H~dlP~~L~~~yGGW~n~~~~~~F~~Ya~~~f~~fgdrVk~WiT~NEp~~~ 182 (477)
T PRK15014 131 SHFEMPLHLVQQYGSWTNRKVVDFFVRFAEVVFERYKHKVKYWMTFNEINNQ 182 (477)
T ss_pred eCCCCCHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcCCEEEEecCcccc
Confidence 6899999998866888888778889999999999999999999889999875
No 20
>PLN02849 beta-glucosidase
Probab=99.34 E-value=5.6e-13 Score=135.52 Aligned_cols=115 Identities=16% Similarity=0.175 Sum_probs=106.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCC-CeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHP-GHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~-G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
..+++|+++||++|+|+.|+.|.|++++|.. |.+|-+|.+.++++|+.|.++||.++|.. .|..+|.||.+
T Consensus 79 hrY~eDI~Lm~~lG~~aYRfSIsWsRI~P~G~g~vN~~gl~fY~~lid~l~~~GI~P~VTL--------~H~dlP~~L~~ 150 (503)
T PLN02849 79 HKYKEDVKLMVETGLDAFRFSISWSRLIPNGRGSVNPKGLQFYKNFIQELVKHGIEPHVTL--------FHYDHPQYLED 150 (503)
T ss_pred HhHHHHHHHHHHcCCCeEEEeccHHhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCeEEEee--------cCCCCcHHHHH
Confidence 6789999999999999999999999999974 78999999999999999999999999999 68899999988
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEc
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQ 227 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Q 227 (323)
++.++.-|..-..|.++++..++++..+|+.+..-|+|++++..
T Consensus 151 ~yGGW~nr~~v~~F~~YA~~~f~~fgDrVk~WiT~NEP~~~~~~ 194 (503)
T PLN02849 151 DYGGWINRRIIKDFTAYADVCFREFGNHVKFWTTINEANIFTIG 194 (503)
T ss_pred hcCCcCCchHHHHHHHHHHHHHHHhcCcCCEEEEecchhhhhhc
Confidence 66899989888999999999999999999999999999987553
No 21
>PRK13511 6-phospho-beta-galactosidase; Provisional
Probab=99.33 E-value=2.1e-12 Score=130.47 Aligned_cols=117 Identities=17% Similarity=0.150 Sum_probs=106.1
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
++-|| ..+++|+++||++|+|+.|+.|+|++++|. .|.+|-+|.+.++++|+.|.++||.++|.+ .
T Consensus 49 a~d~Y-----~ry~eDi~L~~~lG~~~yRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~l~~~GI~P~VTL--------~ 115 (469)
T PRK13511 49 ASDFY-----HRYPEDLKLAEEFGVNGIRISIAWSRIFPDGYGEVNPKGVEYYHRLFAECHKRHVEPFVTL--------H 115 (469)
T ss_pred ccchh-----hhhHHHHHHHHHhCCCEEEeeccHhhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEe--------c
Confidence 55666 567999999999999999999999999997 578999999999999999999999999999 6
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEE
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILV 226 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~ 226 (323)
|..+|.||.+ ..++.-|..-..|.++++..++++.. |+.+..-|+|++++.
T Consensus 116 H~dlP~~L~~-~GGW~n~~~v~~F~~YA~~~~~~fgd-Vk~W~T~NEP~~~~~ 166 (469)
T PRK13511 116 HFDTPEALHS-NGDWLNRENIDHFVRYAEFCFEEFPE-VKYWTTFNEIGPIGD 166 (469)
T ss_pred CCCCcHHHHH-cCCCCCHHHHHHHHHHHHHHHHHhCC-CCEEEEccchhhhhh
Confidence 8899999997 47888888888999999999999999 999988999987753
No 22
>TIGR01233 lacG 6-phospho-beta-galactosidase. This enzyme is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=99.31 E-value=3.5e-12 Score=128.75 Aligned_cols=117 Identities=15% Similarity=0.130 Sum_probs=105.5
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
++-|| ..+++|+++||++|+|+.|+.|+|++++|. +|.+|-+|.+.++++|+.|.++||.++|.. .
T Consensus 48 a~d~y-----hry~eDi~L~~~lG~~~yRfSIsWsRI~P~g~~~~N~~gl~~Y~~lid~l~~~GI~P~VTL--------~ 114 (467)
T TIGR01233 48 ASDFY-----HKYPVDLELAEEYGVNGIRISIAWSRIFPTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTL--------H 114 (467)
T ss_pred cCchh-----hhHHHHHHHHHHcCCCEEEEecchhhccCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEec--------c
Confidence 56676 668999999999999999999999999997 688999999999999999999999999999 6
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEE
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILV 226 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~ 226 (323)
|..+|.||.+ ..++..|..-..|.++++..++++.. |+.+..-|+|++++.
T Consensus 115 H~dlP~~L~~-~GGW~n~~~v~~F~~YA~~~f~~fgd-Vk~WiT~NEP~~~~~ 165 (467)
T TIGR01233 115 HFDTPEALHS-NGDFLNRENIEHFIDYAAFCFEEFPE-VNYWTTFNEIGPIGD 165 (467)
T ss_pred CCCCcHHHHH-cCCCCCHHHHHHHHHHHHHHHHHhCC-CCEEEEecchhhhhh
Confidence 8899999987 48888888888999999999999986 888888899887654
No 23
>PF00232 Glyco_hydro_1: Glycosyl hydrolase family 1; InterPro: IPR001360 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 1 GH1 from CAZY comprises enzymes with a number of known activities; beta-glucosidase (3.2.1.21 from EC); beta-galactosidase (3.2.1.23 from EC); 6-phospho-beta-galactosidase (3.2.1.85 from EC); 6-phospho-beta-glucosidase (3.2.1.86 from EC); lactase-phlorizin hydrolase (3.2.1.62 from EC), (3.2.1.108 from EC); beta-mannosidase (3.2.1.25 from EC); myrosinase (3.2.1.147 from EC). ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1QVB_A 3AHY_D 2E9L_A 2ZOX_A 2JFE_X 2E9M_A 3FIZ_A 3FIY_A 3CMJ_A 3FJ0_A ....
Probab=99.29 E-value=4.2e-13 Score=134.97 Aligned_cols=126 Identities=18% Similarity=0.257 Sum_probs=99.1
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR 172 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw 172 (323)
++-|| ..|++||++||++|+|+.|+.|+|++++|. +|++|-++.+.++++|+.+.++||.+||.+
T Consensus 53 a~d~y-----~~y~eDi~l~~~lg~~~yRfsi~W~Ri~P~g~~g~~n~~~~~~Y~~~i~~l~~~gi~P~vtL-------- 119 (455)
T PF00232_consen 53 ACDHY-----HRYKEDIALMKELGVNAYRFSISWSRIFPDGFEGKVNEEGLDFYRDLIDELLENGIEPIVTL-------- 119 (455)
T ss_dssp TTGHH-----HHHHHHHHHHHHHT-SEEEEE--HHHHSTTSSSSSS-HHHHHHHHHHHHHHHHTT-EEEEEE--------
T ss_pred cccch-----hhhhHHHHHHHhhccceeeeecchhheeecccccccCHhHhhhhHHHHHHHHhhccceeeee--------
Confidence 45565 789999999999999999999999999999 699999999999999999999999999999
Q ss_pred CCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCC
Q psy13819 173 DFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSD 235 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~ 235 (323)
.|..+|.||.+ +.++..|..-..|.++++..++++..+|+-+..-|++++.+.+ ..-.|.+
T Consensus 120 ~H~~~P~~l~~-~ggw~~~~~~~~F~~Ya~~~~~~~gd~V~~w~T~NEp~~~~~~-~y~~g~~ 180 (455)
T PF00232_consen 120 YHFDLPLWLED-YGGWLNRETVDWFARYAEFVFERFGDRVKYWITFNEPNVFALL-GYLYGGF 180 (455)
T ss_dssp ESS--BHHHHH-HTGGGSTHHHHHHHHHHHHHHHHHTTTBSEEEEEETHHHHHHH-HHTSSSS
T ss_pred eecccccceee-cccccCHHHHHHHHHHHHHHHHHhCCCcceEEeccccceeecc-ccccccc
Confidence 68999999998 6787777677777777777777777777777777787776554 3333433
No 24
>PRK09852 cryptic 6-phospho-beta-glucosidase; Provisional
Probab=99.29 E-value=1.9e-12 Score=130.84 Aligned_cols=119 Identities=15% Similarity=0.114 Sum_probs=107.4
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR 172 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw 172 (323)
++-|| ..+++|+++||++|+|+.|+.+.|++++|. ++++|-++.+.++++|+.|.++||.++|.+
T Consensus 66 A~D~Y-----hry~eDi~l~~~lG~~~yR~si~WsRi~P~g~~~~~n~~~~~~Y~~~i~~l~~~gi~p~VtL-------- 132 (474)
T PRK09852 66 AIDFY-----HRYKEDIALMAEMGFKVFRTSIAWSRLFPQGDELTPNQQGIAFYRSVFEECKKYGIEPLVTL-------- 132 (474)
T ss_pred cCchh-----hhhHHHHHHHHHcCCCeEEeeceeeeeeeCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------
Confidence 55566 668999999999999999999999999998 567899999999999999999999999999
Q ss_pred CCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEE
Q psy13819 173 DFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILV 226 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~ 226 (323)
.|..+|.||..++.++.-|..-..|.++++..++++..+|+.++.-|++++++.
T Consensus 133 ~H~~~P~~l~~~~GGW~~~~~~~~F~~ya~~~~~~fgd~Vk~WiTfNEPn~~~~ 186 (474)
T PRK09852 133 CHFDVPMHLVTEYGSWRNRKMVEFFSRYARTCFEAFDGLVKYWLTFNEINIMLH 186 (474)
T ss_pred eCCCCCHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhcCcCCeEEeecchhhhhc
Confidence 689999999876688888888889999999999999999999999999997754
No 25
>COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]
Probab=99.23 E-value=1.4e-11 Score=122.69 Aligned_cols=121 Identities=17% Similarity=0.254 Sum_probs=107.5
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC--eecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG--HYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLL 182 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G--~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~ 182 (323)
..+++|+++||+||+|+.||.|.|+++-|.++ +.|-.|++.++++++.|.++||..++.+ .|..+|.||.
T Consensus 59 hrYkeDi~L~~emG~~~~R~SI~WsRIfP~g~~~e~N~~gl~fY~~l~del~~~gIep~vTL--------~Hfd~P~~L~ 130 (460)
T COG2723 59 HRYKEDIALAKEMGLNAFRTSIEWSRIFPNGDGGEVNEKGLRFYDRLFDELKARGIEPFVTL--------YHFDLPLWLQ 130 (460)
T ss_pred hhhHHHHHHHHHcCCCEEEeeeeEEEeecCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEe--------cccCCcHHHh
Confidence 56799999999999999999999999999764 5999999999999999999999999999 6889999999
Q ss_pred ccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 183 KVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 183 ~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
+++.++..|..-..|.++++..++++..+|+.+.+-|+++|+.. ...-++.
T Consensus 131 ~~ygGW~nR~~i~~F~~ya~~vf~~f~dkVk~W~TFNE~n~~~~-~~y~~~~ 181 (460)
T COG2723 131 KPYGGWENRETVDAFARYAATVFERFGDKVKYWFTFNEPNVVVE-LGYLYGG 181 (460)
T ss_pred hccCCccCHHHHHHHHHHHHHHHHHhcCcceEEEEecchhhhhc-ccccccc
Confidence 97679999988899999999999999999999988899887655 2344443
No 26
>PF03198 Glyco_hydro_72: Glucanosyltransferase; InterPro: IPR004886 This family is a group of yeast glycolipid proteins anchored to the membrane. It includes Candida albicans (Yeast) pH-regulated protein, which is required for apical growth and plays a role in morphogenesis and Saccharomyces cerevisiae glycolipid anchored surface protein.; PDB: 2W61_A 2W62_A 2W63_A.
Probab=98.85 E-value=1.5e-07 Score=89.99 Aligned_cols=201 Identities=13% Similarity=0.156 Sum_probs=97.1
Q ss_pred ceEEEEecCCeEE--eCCeeeEEeeeeecCCCC-----------ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCee
Q psy13819 72 RTFAIDLAGDTFR--LNEDPFQFVSGSFHYFRA-----------PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHY 138 (323)
Q Consensus 72 ~~~~v~~~~~~~~--~dGk~~~i~~g~~Hy~r~-----------p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~f 138 (323)
.+++| .++.|. .+|++|+|.|....+--. .++.|++|+..||++|+||||+| .+.|.
T Consensus 9 ~pI~i--kG~kff~~~~g~~F~ikGVaYQp~~~~~~~~~~DPLad~~~C~rDi~~l~~LgiNtIRVY----~vdp~---- 78 (314)
T PF03198_consen 9 PPIEI--KGNKFFYSKNGTRFFIKGVAYQPGGSSEPSNYIDPLADPEACKRDIPLLKELGINTIRVY----SVDPS---- 78 (314)
T ss_dssp --EEE--ETTEEEETTT--B--EEEEE----------SS--GGG-HHHHHHHHHHHHHHT-SEEEES-------TT----
T ss_pred CCEEE--ECCEeEECCCCCEEEEeeEEcccCCCCCCccCcCcccCHHHHHHhHHHHHHcCCCEEEEE----EeCCC----
Confidence 34444 788888 899999999987765322 26789999999999999999997 44554
Q ss_pred cccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhcc
Q psy13819 139 HYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYG 218 (323)
Q Consensus 139 df~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~ 218 (323)
.|.+.+++++++.|||||+..+.- ..=+.+..| ..+-+..++++.... ++. +.
T Consensus 79 -----~nHd~CM~~~~~aGIYvi~Dl~~p----------~~sI~r~~P---~~sw~~~l~~~~~~v----id~-----fa 131 (314)
T PF03198_consen 79 -----KNHDECMSAFADAGIYVILDLNTP----------NGSINRSDP---APSWNTDLLDRYFAV----IDA-----FA 131 (314)
T ss_dssp -----S--HHHHHHHHHTT-EEEEES-BT----------TBS--TTS---------HHHHHHHHHH----HHH-----HT
T ss_pred -----CCHHHHHHHHHhCCCEEEEecCCC----------CccccCCCC---cCCCCHHHHHHHHHH----HHH-----hc
Confidence 467899999999999999988421 001111111 011223333333333 333 34
Q ss_pred CCCcEEEEccCCCcCCCC--CCh----HHHHHHHHHHHHHhcCC---ceEEEeeCCCC-----ccccccccC----ceEE
Q psy13819 219 NDRPIILVQVENEYGSDA--ECD----PAHAVWLRDLLRTYVQD---KAVLYSTDGAF-----DAYLRCTVD----GVYS 280 (323)
Q Consensus 219 n~g~Vi~~QieNE~g~~~--~~~----~~Y~~~l~~~~~~~~g~---~~~l~t~dg~~-----~~~~~~~~~----g~~~ 280 (323)
+.++++++-+.||.-.-. ... ++..|.+|+=.++. +. +|=.-++|... ..++.|.-+ +++
T Consensus 132 ~Y~N~LgFf~GNEVin~~~~t~aap~vKAavRD~K~Yi~~~-~~R~IPVGYsaaD~~~~r~~~a~Yl~Cg~~~~~iDf~- 209 (314)
T PF03198_consen 132 KYDNTLGFFAGNEVINDASNTNAAPYVKAAVRDMKAYIKSK-GYRSIPVGYSAADDAEIRQDLANYLNCGDDDERIDFF- 209 (314)
T ss_dssp T-TTEEEEEEEESSS-STT-GGGHHHHHHHHHHHHHHHHHS-SS----EEEEE---TTTHHHHHHHTTBTT-----S-E-
T ss_pred cCCceEEEEecceeecCCCCcccHHHHHHHHHHHHHHHHhc-CCCCCceeEEccCChhHHHHHHHHhcCCCccccccee-
Confidence 458999999999986432 112 44444444444442 33 33333446553 134667222 122
Q ss_pred eecCCC---CCchhH-HHHHH-HhhCC-CCCeEEEEe
Q psy13819 281 TVDFTV---FKDVNV-SFQAQ-RTRAP-QGPLVNAEF 311 (323)
Q Consensus 281 ~~nfg~---~~~~~~-~~~~~-r~~~p-~~Plm~~E~ 311 (323)
.+|-.. .+.... -++.+ +.+.. .-|++-+||
T Consensus 210 g~N~Y~WCg~Stf~~SGy~~l~~~f~~y~vPvffSEy 246 (314)
T PF03198_consen 210 GLNSYEWCGDSTFETSGYDRLTKEFSNYSVPVFFSEY 246 (314)
T ss_dssp EEEE----SS--HHHHSHHHHHHHHTT-SS-EEEEEE
T ss_pred eeccceecCCCccccccHHHHHHHhhCCCCCeEEccc
Confidence 122111 122222 13333 34432 679999998
No 27
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=98.81 E-value=6.1e-08 Score=90.39 Aligned_cols=161 Identities=16% Similarity=0.201 Sum_probs=106.6
Q ss_pred CCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHH
Q psy13819 128 WRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQE 207 (323)
Q Consensus 128 W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~ 207 (323)
|...||++|+|||+. ++++++.|+++||.|. ..+. -|.. ..|.|+... + .+...++..+|+++
T Consensus 3 W~~~ep~~G~~n~~~---~D~~~~~a~~~gi~v~--gH~l---~W~~-~~P~W~~~~-~-------~~~~~~~~~~~i~~ 65 (254)
T smart00633 3 WDSTEPSRGQFNFSG---ADAIVNFAKENGIKVR--GHTL---VWHS-QTPDWVFNL-S-------KETLLARLENHIKT 65 (254)
T ss_pred cccccCCCCccChHH---HHHHHHHHHHCCCEEE--EEEE---eecc-cCCHhhhcC-C-------HHHHHHHHHHHHHH
Confidence 899999999999998 7778999999999983 2222 2433 689998752 2 45678899999999
Q ss_pred HHHHHHhhhccCCCcEEEEccCCCcCCCC-----------CChHHHHHHHHHHHHHhcCCceEEEeeCCCCccc------
Q psy13819 208 LFPRIQKYLYGNDRPIILVQVENEYGSDA-----------ECDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAY------ 270 (323)
Q Consensus 208 l~~~i~~~~~~n~g~Vi~~QieNE~g~~~-----------~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~------ 270 (323)
++.+ |+ +.|..|+|-||.-... ..+.+|+...-+.+|+. ..++.|+.+|..-...
T Consensus 66 v~~r-----y~--g~i~~wdV~NE~~~~~~~~~~~~~w~~~~G~~~i~~af~~ar~~-~P~a~l~~Ndy~~~~~~~k~~~ 137 (254)
T smart00633 66 VVGR-----YK--GKIYAWDVVNEALHDNGSGLRRSVWYQILGEDYIEKAFRYAREA-DPDAKLFYNDYNTEEPNAKRQA 137 (254)
T ss_pred HHHH-----hC--CcceEEEEeeecccCCCcccccchHHHhcChHHHHHHHHHHHHh-CCCCEEEEeccCCcCccHHHHH
Confidence 9999 64 5689999999964321 01247888888888885 5678888887542111
Q ss_pred ----ccc-ccCce-EEeecCCCC-----CchhHHHHHHHhhCC-CCCeEEEEecc
Q psy13819 271 ----LRC-TVDGV-YSTVDFTVF-----KDVNVSFQAQRTRAP-QGPLVNAEFEF 313 (323)
Q Consensus 271 ----~~~-~~~g~-~~~~nfg~~-----~~~~~~~~~~r~~~p-~~Plm~~E~~~ 313 (323)
++. .-.|+ +..+.+... .+.+...+.++++.. +.|+++||+=.
T Consensus 138 ~~~~v~~l~~~g~~iDgiGlQ~H~~~~~~~~~~~~~~l~~~~~~g~pi~iTE~dv 192 (254)
T smart00633 138 IYELVKKLKAKGVPIDGIGLQSHLSLGSPNIAEIRAALDRFASLGLEIQITELDI 192 (254)
T ss_pred HHHHHHHHHHCCCccceeeeeeeecCCCCCHHHHHHHHHHHHHcCCceEEEEeec
Confidence 000 00121 333332211 134455556655543 78999999843
No 28
>KOG0626|consensus
Probab=98.79 E-value=7.4e-09 Score=104.43 Aligned_cols=119 Identities=14% Similarity=0.184 Sum_probs=108.4
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCC---CeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHP---GHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLL 182 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~---G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~ 182 (323)
.+++|+++||++|+++.|+.|.|+++=|.- +..+-+|.+-+..+|+...++||.++|.. -|..+|.+|.
T Consensus 92 ~ykeDv~Lmk~lgv~afRFSIsWSRIlP~G~~~~gVN~~Gi~fY~~LI~eL~~nGI~P~VTL--------fHwDlPq~Le 163 (524)
T KOG0626|consen 92 RYKEDVKLMKELGVDAFRFSISWSRILPNGRLTGGVNEAGIQFYNNLIDELLANGIEPFVTL--------FHWDLPQALE 163 (524)
T ss_pred hhHHHHHHHHHcCCCeEEEEeehHhhCCCCCcCCCcCHHHHHHHHHHHHHHHHcCCeEEEEE--------ecCCCCHHHH
Confidence 679999999999999999999999999983 46899999999999999999999999998 5889999999
Q ss_pred ccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcC
Q psy13819 183 KVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYG 233 (323)
Q Consensus 183 ~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g 233 (323)
.++.+++.+..-.+|+++++-.|+++..+|+-+..-|+|.|++++ ....|
T Consensus 164 DeYgGwLn~~ivedF~~yA~~CF~~fGDrVK~WiT~NEP~v~s~~-gY~~G 213 (524)
T KOG0626|consen 164 DEYGGWLNPEIVEDFRDYADLCFQEFGDRVKHWITFNEPNVFSIG-GYDTG 213 (524)
T ss_pred HHhccccCHHHHHHHHHHHHHHHHHhcccceeeEEecccceeeee-hhccC
Confidence 888899988888999999999999999999999999999999887 44333
No 29
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=98.71 E-value=1.3e-08 Score=83.77 Aligned_cols=45 Identities=31% Similarity=0.663 Sum_probs=37.7
Q ss_pred CCCceeEEEEcCcccccccCCCCCeeEEEeccccccCCcceEEEE
Q psy13819 2 RRWSKGVLFANGRHWGRYWSEVGPQYSLFVPEEFLKVGTNRITIF 46 (323)
Q Consensus 2 ~~~~kg~~~vng~~lGryw~~~gpq~~lyvP~~~l~~g~N~i~i~ 46 (323)
....++.|||||++|||||+.+|||.++.||..+|+.++|.|+|+
T Consensus 60 g~~~~~~vwVNG~~~G~~~~~~g~q~tf~~p~~il~~~n~v~~vl 104 (111)
T PF13364_consen 60 GNAFRASVWVNGWFLGSYWPGIGPQTTFSVPAGILKYGNNVLVVL 104 (111)
T ss_dssp STTEEEEEEETTEEEEEEETTTECCEEEEE-BTTBTTCEEEEEEE
T ss_pred CCceEEEEEECCEEeeeecCCCCccEEEEeCceeecCCCEEEEEE
Confidence 356789999999999999988999999999999999985554444
No 30
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=98.51 E-value=1.5e-06 Score=82.81 Aligned_cols=157 Identities=19% Similarity=0.355 Sum_probs=91.7
Q ss_pred CCeEE-eCCeeeEEeeeeecCC---CCChhcHHHHHHHHHHhCCCEEEEEE-e-CCCc--------ccC----CCeeccc
Q psy13819 80 GDTFR-LNEDPFQFVSGSFHYF---RAPPGRWCWIMRAMRAAGLNALSTYV-E-WRSH--------EVH----PGHYHYD 141 (323)
Q Consensus 80 ~~~~~-~dGk~~~i~~g~~Hy~---r~p~~~W~~~l~~mk~~G~N~Vr~yv-~-W~~h--------EP~----~G~fdf~ 141 (323)
++.|. -||+||+.++ .-.+. +...++|+.-|+..|+-|||+|++-+ + |... .|- ++.+||+
T Consensus 2 ~r~f~~~dG~Pff~lg-dT~W~~~~~~~~~e~~~yL~~r~~qgFN~iq~~~l~~~~~~~~~n~~~~~~~~~~~~~~~d~~ 80 (289)
T PF13204_consen 2 GRHFVYADGTPFFWLG-DTAWSLFHRLTREEWEQYLDTRKEQGFNVIQMNVLPQWDGYNTPNRYGFAPFPDEDPGQFDFT 80 (289)
T ss_dssp SSSEEETTS-B--EEE-EE-TTHHHH--HHHHHHHHHHHHHTT--EEEEES-SSSS-B----TTS-BS-SSTT------T
T ss_pred CceEecCCCCEEeehh-HHHHHHhhCCCHHHHHHHHHHHHHCCCCEEEEEeCCCcccccccccCCCcCCCCCCccccCCC
Confidence 34565 8999999998 44442 56789999999999999999999933 2 2222 121 1236666
Q ss_pred c-----cchHHHHHHHHHHCCCeEEEccC---CccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHH
Q psy13819 142 G-----HRDIEHFLQLAVEEDLYILLRPG---PFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQ 213 (323)
Q Consensus 142 g-----~~dl~~fl~~a~e~GL~vilr~G---Pyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~ 213 (323)
. .+++++.|+.|.+.||.+-|-|- ||.-+ .|-.. +. .-=.+.+++|.+.|++|
T Consensus 81 ~~N~~YF~~~d~~i~~a~~~Gi~~~lv~~wg~~~~~~--------~Wg~~--~~-------~m~~e~~~~Y~~yv~~R-- 141 (289)
T PF13204_consen 81 RPNPAYFDHLDRRIEKANELGIEAALVPFWGCPYVPG--------TWGFG--PN-------IMPPENAERYGRYVVAR-- 141 (289)
T ss_dssp T----HHHHHHHHHHHHHHTT-EEEEESS-HHHHH-----------------TT-------SS-HHHHHHHHHHHHHH--
T ss_pred CCCHHHHHHHHHHHHHHHHCCCeEEEEEEECCccccc--------ccccc--cc-------CCCHHHHHHHHHHHHHH--
Confidence 4 57999999999999999866441 12111 23221 00 01156889999999999
Q ss_pred hhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEee
Q psy13819 214 KYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYST 263 (323)
Q Consensus 214 ~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~ 263 (323)
+..-++|| |-+.||+ .......++.+.+.+.+++..... |.|.
T Consensus 142 ---y~~~~Nvi-W~l~gd~-~~~~~~~~~w~~~~~~i~~~dp~~--L~T~ 184 (289)
T PF13204_consen 142 ---YGAYPNVI-WILGGDY-FDTEKTRADWDAMARGIKENDPYQ--LITI 184 (289)
T ss_dssp ---HTT-SSEE-EEEESSS---TTSSHHHHHHHHHHHHHH--SS---EEE
T ss_pred ---HhcCCCCE-EEecCcc-CCCCcCHHHHHHHHHHHHhhCCCC--cEEE
Confidence 66667777 7799999 333446788888888888864444 4443
No 31
>PLN02801 beta-amylase
Probab=98.29 E-value=4.5e-06 Score=84.08 Aligned_cols=84 Identities=20% Similarity=0.334 Sum_probs=65.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccc----cCCCCCc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGK----RDFGGFP 178 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aE----w~~gg~P 178 (323)
++..+..|+++|++|+..|.+.|-|..+|.+ |++|||++ +.++++++++.||++.+-..=.-||. ..+=-+|
T Consensus 36 ~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~P~~YdWsg---Y~~l~~mvr~~GLKlq~vmSFHqCGGNVGD~~~IpLP 112 (517)
T PLN02801 36 EEGLEKQLKRLKEAGVDGVMVDVWWGIVESKGPKQYDWSA---YRSLFELVQSFGLKIQAIMSFHQCGGNVGDAVNIPIP 112 (517)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCccCcHH---HHHHHHHHHHcCCeEEEEEEecccCCCCCCcccccCC
Confidence 5678999999999999999999999999985 99999999 88889999999999765443233442 1111379
Q ss_pred ccccc---cCCCeee
Q psy13819 179 PWLLK---VAPDILL 190 (323)
Q Consensus 179 ~Wl~~---~~p~~~~ 190 (323)
.|+.+ ++|++.+
T Consensus 113 ~WV~~~g~~~pDi~f 127 (517)
T PLN02801 113 QWVRDVGDSDPDIFY 127 (517)
T ss_pred HHHHHhhccCCCcee
Confidence 99865 3566553
No 32
>PLN02803 beta-amylase
Probab=98.29 E-value=4.4e-06 Score=84.52 Aligned_cols=84 Identities=24% Similarity=0.465 Sum_probs=65.5
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc----CCCCCc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR----DFGGFP 178 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw----~~gg~P 178 (323)
++..+..|+++|++|+..|.+.|-|..+|.+ |++|||++ +.++++++++.||++.+-..=.-||.= .+=-+|
T Consensus 106 ~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~YdWsg---Y~~l~~mvr~~GLKlq~vmSFHqCGGNVGD~~~IpLP 182 (548)
T PLN02803 106 PRAMNASLMALRSAGVEGVMVDAWWGLVEKDGPMKYNWEG---YAELVQMVQKHGLKLQVVMSFHQCGGNVGDSCSIPLP 182 (548)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEeeeeeeccCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEecccCCCCCCcccccCC
Confidence 4567889999999999999999999999995 99999999 888899999999997654432334420 001389
Q ss_pred ccccc---cCCCeee
Q psy13819 179 PWLLK---VAPDILL 190 (323)
Q Consensus 179 ~Wl~~---~~p~~~~ 190 (323)
.|+.+ ++|++.+
T Consensus 183 ~WV~e~~~~~pDi~f 197 (548)
T PLN02803 183 PWVLEEMSKNPDLVY 197 (548)
T ss_pred HHHHHhhhcCCCceE
Confidence 99865 3577654
No 33
>PLN00197 beta-amylase; Provisional
Probab=98.28 E-value=4.4e-06 Score=84.76 Aligned_cols=83 Identities=24% Similarity=0.493 Sum_probs=66.0
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCccc-CCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCC-----CC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEV-HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFG-----GF 177 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~g-----g~ 177 (323)
++..+..|+.+|++|+..|.+.|-|..+|. .|++|||++ +.++++++++.||++.+-..=.-||. +-| -+
T Consensus 126 ~~~l~~~L~~LK~~GVdGVmvDvWWGiVE~~~p~~YdWsg---Y~~L~~mvr~~GLKlq~VmSFHqCGG-NVGD~~~IpL 201 (573)
T PLN00197 126 RKAMKASLQALKSAGVEGIMMDVWWGLVERESPGVYNWGG---YNELLEMAKRHGLKVQAVMSFHQCGG-NVGDSCTIPL 201 (573)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEecccCC-CCCCcccccC
Confidence 567899999999999999999999999999 699999999 88889999999999665443233442 111 38
Q ss_pred cccccc---cCCCeee
Q psy13819 178 PPWLLK---VAPDILL 190 (323)
Q Consensus 178 P~Wl~~---~~p~~~~ 190 (323)
|.|+.+ ++|++.+
T Consensus 202 P~WV~~~g~~dpDiff 217 (573)
T PLN00197 202 PKWVVEEVDKDPDLAY 217 (573)
T ss_pred CHHHHHhhccCCCcee
Confidence 999865 3577654
No 34
>PLN02161 beta-amylase
Probab=98.27 E-value=5.3e-06 Score=83.55 Aligned_cols=84 Identities=19% Similarity=0.251 Sum_probs=65.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCccc-CCCeecccccchHHHHHHHHHHCCCeEEEccCCcccccc----CCCCCc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEV-HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKR----DFGGFP 178 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw----~~gg~P 178 (323)
++..+..|+++|++|+..|.+.|-|..+|. .|++|||++ +.++++++++.||++.+-..=.-|+.= ..=-+|
T Consensus 116 ~~al~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~YdWsg---Y~~l~~mvr~~GLKlq~vmSFHqCGGNvGd~~~IpLP 192 (531)
T PLN02161 116 LKALTVSLKALKLAGVHGIAVEVWWGIVERFSPLEFKWSL---YEELFRLISEAGLKLHVALCFHSNMHLFGGKGGISLP 192 (531)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEeeeeeeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEecccCCCCCCccCccCC
Confidence 456788999999999999999999999999 699999999 888899999999997654432334321 111379
Q ss_pred ccccc---cCCCeee
Q psy13819 179 PWLLK---VAPDILL 190 (323)
Q Consensus 179 ~Wl~~---~~p~~~~ 190 (323)
.|+.+ ++|++.+
T Consensus 193 ~WV~~~g~~~pDi~f 207 (531)
T PLN02161 193 LWIREIGDVNKDIYY 207 (531)
T ss_pred HHHHhhhccCCCceE
Confidence 99865 3577654
No 35
>PLN02705 beta-amylase
Probab=98.27 E-value=4.5e-06 Score=85.44 Aligned_cols=83 Identities=18% Similarity=0.335 Sum_probs=65.8
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCccc-CCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCC-----CC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEV-HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFG-----GF 177 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~g-----g~ 177 (323)
++..+..|+++|++|+..|.+.|-|..+|. .|++|||++ +.++++++++.||++.+-..=.-||. +-| -+
T Consensus 267 ~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~~P~~YdWsg---Y~~L~~mvr~~GLKlqvVmSFHqCGG-NVGD~~~IPL 342 (681)
T PLN02705 267 PEGVRQELSHMKSLNVDGVVVDCWWGIVEGWNPQKYVWSG---YRELFNIIREFKLKLQVVMAFHEYGG-NASGNVMISL 342 (681)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeEeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEeeccCC-CCCCcccccC
Confidence 577899999999999999999999999999 599999999 88889999999999655443233442 111 38
Q ss_pred cccccc---cCCCeee
Q psy13819 178 PPWLLK---VAPDILL 190 (323)
Q Consensus 178 P~Wl~~---~~p~~~~ 190 (323)
|.|+.+ ++|++.+
T Consensus 343 P~WV~e~g~~nPDiff 358 (681)
T PLN02705 343 PQWVLEIGKDNQDIFF 358 (681)
T ss_pred CHHHHHhcccCCCcee
Confidence 999864 3577653
No 36
>PF07745 Glyco_hydro_53: Glycosyl hydrolase family 53; InterPro: IPR011683 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This domain is found in family 53 of the glycosyl hydrolase classification []. These enzymes are endo-1,4- beta-galactanases (3.2.1.89 from EC). The structure of this domain is known [] and has a TIM barrel fold.; GO: 0015926 glucosidase activity; PDB: 1HJQ_A 1HJS_A 1HJU_B 1FHL_A 1FOB_A 2GFT_A 1UR4_B 1UR0_A 1R8L_B 2CCR_A ....
Probab=98.25 E-value=2.4e-05 Score=76.06 Aligned_cols=179 Identities=15% Similarity=0.166 Sum_probs=96.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCC-CeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCC
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHP-GHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAP 186 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~-G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p 186 (323)
+|.++.||+.|+|+||+-| | +.|.. |..|.+ +..++.+.|+++||+|+|.+. | -..|- +|
T Consensus 27 ~d~~~ilk~~G~N~vRlRv-w--v~P~~~g~~~~~---~~~~~akrak~~Gm~vlldfH-Y---------SD~Wa---DP 87 (332)
T PF07745_consen 27 KDLFQILKDHGVNAVRLRV-W--VNPYDGGYNDLE---DVIALAKRAKAAGMKVLLDFH-Y---------SDFWA---DP 87 (332)
T ss_dssp --HHHHHHHTT--EEEEEE----SS-TTTTTTSHH---HHHHHHHHHHHTT-EEEEEE--S---------SSS-----BT
T ss_pred CCHHHHHHhcCCCeEEEEe-c--cCCcccccCCHH---HHHHHHHHHHHCCCeEEEeec-c---------cCCCC---CC
Confidence 5789999999999999988 4 45665 555554 467778889999999999983 2 11232 12
Q ss_pred Ceee------cCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC--C---CC--ChH---HHHHHHHHHH
Q psy13819 187 DILL------RQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS--D---AE--CDP---AHAVWLRDLL 250 (323)
Q Consensus 187 ~~~~------R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~--~---~~--~~~---~Y~~~l~~~~ 250 (323)
+... -.+.+...+++..|.+.++..++. +|-.+-|+||.||... . +. .-. .++..-.+++
T Consensus 88 g~Q~~P~aW~~~~~~~l~~~v~~yT~~vl~~l~~----~G~~pd~VQVGNEin~Gmlwp~g~~~~~~~~a~ll~ag~~AV 163 (332)
T PF07745_consen 88 GKQNKPAAWANLSFDQLAKAVYDYTKDVLQALKA----AGVTPDMVQVGNEINNGMLWPDGKPSNWDNLAKLLNAGIKAV 163 (332)
T ss_dssp TB-B--TTCTSSSHHHHHHHHHHHHHHHHHHHHH----TT--ESEEEESSSGGGESTBTTTCTT-HHHHHHHHHHHHHHH
T ss_pred CCCCCCccCCCCCHHHHHHHHHHHHHHHHHHHHH----CCCCccEEEeCccccccccCcCCCccCHHHHHHHHHHHHHHH
Confidence 2111 124466788999999999999764 3445669999999653 2 11 113 3444445666
Q ss_pred HHhcCCceEEEeeCCCCcc-----cccc-ccC----ceEEeecCCCC--Cchh---HHHHHHHhhCCCCCeEEEEe
Q psy13819 251 RTYVQDKAVLYSTDGAFDA-----YLRC-TVD----GVYSTVDFTVF--KDVN---VSFQAQRTRAPQGPLVNAEF 311 (323)
Q Consensus 251 ~~~~g~~~~l~t~dg~~~~-----~~~~-~~~----g~~~~~nfg~~--~~~~---~~~~~~r~~~p~~Plm~~E~ 311 (323)
|+.....-++...+.+.+. ++.. .-. +++ .+.++|. ...+ ..++.+++.. ++|+|++|.
T Consensus 164 r~~~p~~kV~lH~~~~~~~~~~~~~f~~l~~~g~d~Dvi-GlSyYP~w~~~l~~l~~~l~~l~~ry-~K~V~V~Et 237 (332)
T PF07745_consen 164 REVDPNIKVMLHLANGGDNDLYRWFFDNLKAAGVDFDVI-GLSYYPFWHGTLEDLKNNLNDLASRY-GKPVMVVET 237 (332)
T ss_dssp HTHSSTSEEEEEES-TTSHHHHHHHHHHHHHTTGG-SEE-EEEE-STTST-HHHHHHHHHHHHHHH-T-EEEEEEE
T ss_pred HhcCCCCcEEEEECCCCchHHHHHHHHHHHhcCCCcceE-EEecCCCCcchHHHHHHHHHHHHHHh-CCeeEEEec
Confidence 6642222244444444432 1111 111 233 3344442 1233 3344442222 689999996
No 37
>PLN02905 beta-amylase
Probab=98.25 E-value=5.8e-06 Score=84.89 Aligned_cols=83 Identities=22% Similarity=0.501 Sum_probs=65.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCccc-CCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCC-----CC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEV-HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFG-----GF 177 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~g-----g~ 177 (323)
++..+..|+++|++|+..|.+.|-|..+|. .|++|||++ +.++++++++.||++.+-..=.-||. +-| -+
T Consensus 285 ~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~gP~~YdWsg---Y~~L~~mvr~~GLKlqvVMSFHqCGG-NVGD~~~IPL 360 (702)
T PLN02905 285 PDGLLKQLRILKSINVDGVKVDCWWGIVEAHAPQEYNWNG---YKRLFQMVRELKLKLQVVMSFHECGG-NVGDDVCIPL 360 (702)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEecccCC-CCCCcccccC
Confidence 456789999999999999999999999999 699999999 88889999999999665443233442 111 38
Q ss_pred cccccc---cCCCeee
Q psy13819 178 PPWLLK---VAPDILL 190 (323)
Q Consensus 178 P~Wl~~---~~p~~~~ 190 (323)
|.|+.+ ++|++.+
T Consensus 361 P~WV~e~g~~nPDiff 376 (702)
T PLN02905 361 PHWVAEIGRSNPDIFF 376 (702)
T ss_pred CHHHHHhhhcCCCceE
Confidence 999865 3577653
No 38
>PF01373 Glyco_hydro_14: Glycosyl hydrolase family 14; InterPro: IPR001554 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 14 GH14 from CAZY comprises enzymes with only one known activity; beta-amylase (3.2.1.2 from EC). A Glu residue has been proposed as a catalytic residue, but it is not known if it is the nucleophile or the proton donor. Beta-amylase [, ] is an enzyme that hydrolyses 1,4-alpha-glucosidic linkages in starch-type polysaccharide substrates so as to remove successive maltose units from the non-reducing ends of the chains. Beta-amylase is present in certain bacteria as well as in plants. Three highly conserved sequence regions are found in all known beta-amylases. The first of these regions is located in the N-terminal section of the enzymes and contains an aspartate which is known [] to be involved in the catalytic mechanism. The second, located in a more central location, is centred around a glutamate which is also involved [] in the catalytic mechanism. The 3D structure of a complex of soybean beta-amylase with an inhibitor (alpha-cyclodextrin) has been determined to 3.0A resolution by X-ray diffraction []. The enzyme folds into large and small domains: the large domain has a (beta alpha)8 super-secondary structural core, while the smaller is formed from two long loops extending from the beta-3 and beta-4 strands of the (beta alpha)8 fold []. The interface of the two domains, together with shorter loops from the (beta alpha)8 core, form a deep cleft, in which the inhibitor binds []. Two maltose molecules also bind in the cleft, one sharing a binding site with alpha-cyclodextrin, and the other sitting more deeply in the cleft [].; GO: 0016161 beta-amylase activity, 0000272 polysaccharide catabolic process; PDB: 1FA2_A 2DQX_A 1WDP_A 1UKP_C 1BYC_A 1BYA_A 1Q6C_A 1V3I_A 1BTC_A 1BYB_A ....
Probab=98.23 E-value=2.5e-06 Score=84.16 Aligned_cols=115 Identities=19% Similarity=0.364 Sum_probs=75.3
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCcccc----ccCCCCCccc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICG----KRDFGGFPPW 180 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~a----Ew~~gg~P~W 180 (323)
..+..|+++|++|+..|.+.|-|..+|.+ |++|||++ ++++++++++.||++.+-+.=.-|+ ...+=-+|.|
T Consensus 17 ~~~~~L~~LK~~GV~GVmvdvWWGiVE~~~p~~ydWs~---Y~~l~~~vr~~GLk~~~vmsfH~cGgNvgD~~~IpLP~W 93 (402)
T PF01373_consen 17 ALEAQLRALKSAGVDGVMVDVWWGIVEGEGPQQYDWSG---YRELFEMVRDAGLKLQVVMSFHQCGGNVGDDCNIPLPSW 93 (402)
T ss_dssp HHHHHHHHHHHTTEEEEEEEEEHHHHTGSSTTB---HH---HHHHHHHHHHTT-EEEEEEE-S-BSSSTTSSSEB-S-HH
T ss_pred HHHHHHHHHHHcCCcEEEEEeEeeeeccCCCCccCcHH---HHHHHHHHHHcCCeEEEEEeeecCCCCCCCccCCcCCHH
Confidence 56889999999999999999999999998 99999999 8888999999999976544334443 2222347999
Q ss_pred ccc---cCCCeeec--------------CCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccC
Q psy13819 181 LLK---VAPDILLR--------------QNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVE 229 (323)
Q Consensus 181 l~~---~~p~~~~R--------------~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Qie 229 (323)
+.+ ++ ++.+. ..... ++....|++.+....+++. +.|..+||.
T Consensus 94 v~~~~~~~-di~ytd~~G~rn~E~lSp~~~grt-~~~Y~dfm~sF~~~f~~~~----~~I~~I~vg 153 (402)
T PF01373_consen 94 VWEIGKKD-DIFYTDRSGNRNKEYLSPVLDGRT-LQCYSDFMRSFRDNFSDYL----STITEIQVG 153 (402)
T ss_dssp HHHHHHHS-GGEEE-TTS-EEEEEE-CTBTTBC-HHHHHHHHHHHHHHCHHHH----TGEEEEEE-
T ss_pred HHhccccC-CcEEECCCCCcCcceeecccCCch-HHHHHHHHHHHHHHHHHHH----hhheEEEec
Confidence 864 22 43321 11122 5555666666666666654 467777653
No 39
>COG3693 XynA Beta-1,4-xylanase [Carbohydrate transport and metabolism]
Probab=98.20 E-value=3.7e-05 Score=73.78 Aligned_cols=175 Identities=15% Similarity=0.209 Sum_probs=120.2
Q ss_pred HHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCC
Q psy13819 114 MRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQN 193 (323)
Q Consensus 114 mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~ 193 (323)
.|+.+.=|-+...=|+.+||++|.|+|+. -+++.+.|+++||.+- -.+.| |.+ -.|.|+...+ -+
T Consensus 55 ~re~n~iTpenemKwe~i~p~~G~f~Fe~---AD~ia~FAr~h~m~lh--GHtLv---W~~-q~P~W~~~~e------~~ 119 (345)
T COG3693 55 ARECNQITPENEMKWEAIEPERGRFNFEA---ADAIANFARKHNMPLH--GHTLV---WHS-QVPDWLFGDE------LS 119 (345)
T ss_pred HhhhcccccccccccccccCCCCccCccc---hHHHHHHHHHcCCeec--cceee---ecc-cCCchhhccc------cC
Confidence 55555556666888999999999999999 5667999999999643 22222 333 5788987621 34
Q ss_pred CHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC----C-------CCChHHHHHHHHHHHHHhcCCceEEEe
Q psy13819 194 HPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS----D-------AECDPAHAVWLRDLLRTYVQDKAVLYS 262 (323)
Q Consensus 194 ~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~----~-------~~~~~~Y~~~l~~~~~~~~g~~~~l~t 262 (323)
-+..++.+++++..++.| |+ +.|+.|.|-||.=. + +....+|+++.=..+|+ ...+-.||-
T Consensus 120 ~~~~~~~~e~hI~tV~~r-----Yk--g~~~sWDVVNE~vdd~g~~R~s~w~~~~~gpd~I~~aF~~Are-adP~AkL~~ 191 (345)
T COG3693 120 KEALAKMVEEHIKTVVGR-----YK--GSVASWDVVNEAVDDQGSLRRSAWYDGGTGPDYIKLAFHIARE-ADPDAKLVI 191 (345)
T ss_pred hHHHHHHHHHHHHHHHHh-----cc--CceeEEEecccccCCCchhhhhhhhccCCccHHHHHHHHHHHh-hCCCceEEe
Confidence 578999999999999999 75 34999999999622 1 12347899999999988 477789999
Q ss_pred eCCCCccc-------------ccc---ccCceEEeecCCC-CCchhHHHHHHHhhCC-CCCeEEEEe
Q psy13819 263 TDGAFDAY-------------LRC---TVDGVYSTVDFTV-FKDVNVSFQAQRTRAP-QGPLVNAEF 311 (323)
Q Consensus 263 ~dg~~~~~-------------~~~---~~~g~~~~~nfg~-~~~~~~~~~~~r~~~p-~~Plm~~E~ 311 (323)
+|..-... |+. .+.|+=--..|+. ....+.....++++.. +=|.|+||+
T Consensus 192 NDY~ie~~~~kr~~~~nlI~~LkekG~pIDgiG~QsH~~~~~~~~~~~~~a~~~~~k~Gl~i~VTEL 258 (345)
T COG3693 192 NDYSIEGNPAKRNYVLNLIEELKEKGAPIDGIGIQSHFSGDGPSIEKMRAALLKFSKLGLPIYVTEL 258 (345)
T ss_pred ecccccCChHHHHHHHHHHHHHHHCCCCccceeeeeeecCCCCCHHHHHHHHHHHhhcCCCceEEEe
Confidence 98842110 111 1333221222321 1345566666666655 459999997
No 40
>PF00331 Glyco_hydro_10: Glycosyl hydrolase family 10; InterPro: IPR001000 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 10 GH10 from CAZY comprises enzymes with a number of known activities; xylanase (3.2.1.8 from EC); endo-1,3-beta-xylanase (3.2.1.32 from EC); cellobiohydrolase (3.2.1.91 from EC). These enzymes were formerly known as cellulase family F. The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) [, ]. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family F [] or as the glycosyl hydrolases family 10 []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1UQZ_A 1UQY_A 1UR2_A 1UR1_A 2CNC_A 1OD8_A 1E0W_A 1E0V_A 1V0M_A 1E0X_B ....
Probab=98.18 E-value=1.3e-05 Score=77.36 Aligned_cols=197 Identities=16% Similarity=0.215 Sum_probs=129.0
Q ss_pred EeeeeecCCCCChhcHHHHHHHHHHhCCCEEEE--EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccc
Q psy13819 92 FVSGSFHYFRAPPGRWCWIMRAMRAAGLNALST--YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFIC 169 (323)
Q Consensus 92 i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~--yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~ 169 (323)
.+|.+++...+..+. ..+.+-...||.+.. ..-|...||++|+|||+. .+++++.|+++||.|--.+ .
T Consensus 11 ~~G~av~~~~~~~~~---~~~~~~~~~Fn~~t~eN~~Kw~~~e~~~g~~~~~~---~D~~~~~a~~~g~~vrGH~--L-- 80 (320)
T PF00331_consen 11 PFGAAVNAQQLEDDP---RYRELFAKHFNSVTPENEMKWGSIEPEPGRFNFES---ADAILDWARENGIKVRGHT--L-- 80 (320)
T ss_dssp EEEEEEBGGGHTHHH---HHHHHHHHH-SEEEESSTTSHHHHESBTTBEE-HH---HHHHHHHHHHTT-EEEEEE--E--
T ss_pred CEEEEechhHcCCcH---HHHHHHHHhCCeeeeccccchhhhcCCCCccCccc---hhHHHHHHHhcCcceeeee--E--
Confidence 678888876554332 466666678999999 589999999999999987 7788999999999875322 1
Q ss_pred cccCCCCCcccccccCCCeeecCCC---HhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCC-------C--
Q psy13819 170 GKRDFGGFPPWLLKVAPDILLRQNH---PVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDA-------E-- 237 (323)
Q Consensus 170 aEw~~gg~P~Wl~~~~p~~~~R~~~---~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~-------~-- 237 (323)
-|. ...|.|+... +. .+. +.+.+.+++++++++.+ |++.|.|.+|.|-||.=... .
T Consensus 81 -vW~-~~~P~w~~~~-~~----~~~~~~~~~~~~l~~~I~~v~~~-----y~~~g~i~~WDVvNE~i~~~~~~~~~r~~~ 148 (320)
T PF00331_consen 81 -VWH-SQTPDWVFNL-AN----GSPDEKEELRARLENHIKTVVTR-----YKDKGRIYAWDVVNEAIDDDGNPGGLRDSP 148 (320)
T ss_dssp -EES-SSS-HHHHTS-TT----SSBHHHHHHHHHHHHHHHHHHHH-----TTTTTTESEEEEEES-B-TTSSSSSBCTSH
T ss_pred -EEc-ccccceeeec-cC----CCcccHHHHHHHHHHHHHHHHhH-----hccccceEEEEEeeecccCCCccccccCCh
Confidence 132 3789999872 10 111 24899999999999999 76668999999999963321 0
Q ss_pred ----ChHHHHHHHHHHHHHhcCCceEEEeeCCCCcccccc----------ccCce-EEeecC----CCCCchhHHHHHHH
Q psy13819 238 ----CDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRC----------TVDGV-YSTVDF----TVFKDVNVSFQAQR 298 (323)
Q Consensus 238 ----~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~----------~~~g~-~~~~nf----g~~~~~~~~~~~~r 298 (323)
.+.+|+..+-+.+++. -.++.||-+|..-...-+. .-.|+ +..+.+ ......+..-+.|+
T Consensus 149 ~~~~lG~~yi~~aF~~A~~~-~P~a~L~~NDy~~~~~~k~~~~~~lv~~l~~~gvpIdgIG~Q~H~~~~~~~~~i~~~l~ 227 (320)
T PF00331_consen 149 WYDALGPDYIADAFRAAREA-DPNAKLFYNDYNIESPAKRDAYLNLVKDLKARGVPIDGIGLQSHFDAGYPPEQIWNALD 227 (320)
T ss_dssp HHHHHTTCHHHHHHHHHHHH-HTTSEEEEEESSTTSTHHHHHHHHHHHHHHHTTHCS-EEEEEEEEETTSSHHHHHHHHH
T ss_pred hhhcccHhHHHHHHHHHHHh-CCCcEEEeccccccchHHHHHHHHHHHHHHhCCCccceechhhccCCCCCHHHHHHHHH
Confidence 1246888777888875 4478999998765322110 01122 222322 11122556666777
Q ss_pred hhCC-CCCeEEEEe
Q psy13819 299 TRAP-QGPLVNAEF 311 (323)
Q Consensus 299 ~~~p-~~Plm~~E~ 311 (323)
++.. +-|+.+||+
T Consensus 228 ~~~~~Gl~i~ITEl 241 (320)
T PF00331_consen 228 RFASLGLPIHITEL 241 (320)
T ss_dssp HHHTTTSEEEEEEE
T ss_pred HHHHcCCceEEEee
Confidence 6654 679999998
No 41
>PF14488 DUF4434: Domain of unknown function (DUF4434)
Probab=98.16 E-value=5.6e-05 Score=66.58 Aligned_cols=131 Identities=18% Similarity=0.178 Sum_probs=86.8
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEE-EEeCCCc--ccC---CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALST-YVEWRSH--EVH---PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~h--EP~---~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
-.++++.|+++++.||++|+++|=+ .+....- .|. ++.|.-...+-|+.+|++|++.||+|++..+
T Consensus 15 ~~~~~~~W~~~~~~m~~~GidtlIlq~~~~~~~~~yps~~~~~~~~~~~~d~l~~~L~~A~~~Gmkv~~Gl~-------- 86 (166)
T PF14488_consen 15 QNWTPAQWREEFRAMKAIGIDTLILQWTGYGGFAFYPSKLSPGGFYMPPVDLLEMILDAADKYGMKVFVGLY-------- 86 (166)
T ss_pred cCCCHHHHHHHHHHHHHcCCcEEEEEEeecCCcccCCccccCccccCCcccHHHHHHHHHHHcCCEEEEeCC--------
Confidence 4788999999999999999999965 2222111 122 1223333456799999999999999999883
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHh
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTY 253 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~ 253 (323)
--|.|....++ ..-.+...+..+++.++ |+++++.-+|=|-+|...+.....++.+.|.+.+++.
T Consensus 87 --~~~~~w~~~~~--------~~~~~~~~~v~~el~~~-----yg~h~sf~GWYip~E~~~~~~~~~~~~~~l~~~lk~~ 151 (166)
T PF14488_consen 87 --FDPDYWDQGDL--------DWEAERNKQVADELWQR-----YGHHPSFYGWYIPYEIDDYNWNAPERFALLGKYLKQI 151 (166)
T ss_pred --CCchhhhccCH--------HHHHHHHHHHHHHHHHH-----HcCCCCCceEEEecccCCcccchHHHHHHHHHHHHHh
Confidence 12233332111 11233334445555554 8889999999999999766444566777777777664
No 42
>COG2730 BglC Endoglucanase [Carbohydrate transport and metabolism]
Probab=98.05 E-value=2.2e-05 Score=78.25 Aligned_cols=114 Identities=13% Similarity=0.117 Sum_probs=76.8
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC----CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH----PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~----~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
++++..||.+|||+||+++.|..+++- |...+-+-..-|++.|+.|++.||+|++...=+-++. .+-=..|...
T Consensus 76 ~~~~~~ik~~G~n~VRiPi~~~~~~~~~~~~p~~~~~~~~~~ld~~I~~a~~~gi~V~iD~H~~~~~~--~~~~~s~~~~ 153 (407)
T COG2730 76 EEDFDQIKSAGFNAVRIPIGYWALQATDGDNPYLIGLTQLKILDEAINWAKKLGIYVLIDLHGYPGGN--NGHEHSGYTS 153 (407)
T ss_pred hhHHHHHHHcCCcEEEcccchhhhhccCCCCCCeecchHHHHHHHHHHHHHhcCeeEEEEecccCCCC--CCcCcccccc
Confidence 899999999999999999996555553 4444233333688889999999999999863221110 0001123222
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
.++. .....++..+.++.++.+ +++...||++|+-||+-.
T Consensus 154 ~~~~------~~~~~~~~~~~w~~ia~~-----f~~~~~VIg~~~~NEP~~ 193 (407)
T COG2730 154 DYKE------ENENVEATIDIWKFIANR-----FKNYDTVIGFELINEPNG 193 (407)
T ss_pred cccc------cchhHHHHHHHHHHHHHh-----ccCCCceeeeeeecCCcc
Confidence 1211 234456666677777777 777889999999999985
No 43
>COG3934 Endo-beta-mannanase [Carbohydrate transport and metabolism]
Probab=97.42 E-value=0.00014 Score=72.99 Aligned_cols=173 Identities=19% Similarity=0.225 Sum_probs=120.9
Q ss_pred eEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCccc---CCCeec-ccccchHHHHHHHHHHCC
Q psy13819 82 TFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEV---HPGHYH-YDGHRDIEHFLQLAVEED 157 (323)
Q Consensus 82 ~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP---~~G~fd-f~g~~dl~~fl~~a~e~G 157 (323)
.|.++++++..++..--++++..++.+++++.|+.+|++++|+.+. .-|+ ++|.=+ -++..-++.|++.|.+++
T Consensus 3 ~F~Lg~n~wprIanikmw~~~~~~ei~~dle~a~~vg~k~lR~fiL--DgEdc~d~~G~~na~s~~~y~~~fla~a~~l~ 80 (587)
T COG3934 3 VFALGLNRWPRIANIKMWPAIGNREIKADLEPAGFVGVKDLRLFIL--DGEDCRDKEGYRNAGSNVWYAAWFLAPAGYLD 80 (587)
T ss_pred eEEeccccchhhhhhhHHHHhhhhhhhcccccccCccceeEEEEEe--cCcchhhhhceecccccHHHHHHHhhhcccCc
Confidence 4788889998888877788888889999999999999999999743 2244 233222 234556899999999999
Q ss_pred CeEEEccCCccccccCCCCCc---ccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 158 LYILLRPGPFICGKRDFGGFP---PWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 158 L~vilr~GPyi~aEw~~gg~P---~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
|+|+++. |.+-=+.||.- .|.-...|+-.. .|+.++...++|++.|++. |+.++.|.+|-+-||.-+
T Consensus 81 lkvlitl---ivg~~hmgg~Nw~Ipwag~~~pdn~i--yD~k~~~~~kkyvedlVk~-----yk~~ptI~gw~l~Ne~lv 150 (587)
T COG3934 81 LKVLITL---IVGLKHMGGTNWRIPWAGEQSPDNVI--YDPKFRGPGKKYVEDLVKP-----YKLDPTIAGWALRNEPLV 150 (587)
T ss_pred ceEEEEE---eecccccCcceeEeecCCCCCccccc--cchhhcccHHHHHHHHhhh-----hccChHHHHHHhcCCccc
Confidence 9998875 33322334431 244332344332 3678888888999998886 888999999999999433
Q ss_pred C-CCChHHHHHHHHHHHH--HhcCCceEEEee-CCCC
Q psy13819 235 D-AECDPAHAVWLRDLLR--TYVQDKAVLYST-DGAF 267 (323)
Q Consensus 235 ~-~~~~~~Y~~~l~~~~~--~~~g~~~~l~t~-dg~~ 267 (323)
- ...+..+++|++.++- +.++.. +++++ |...
T Consensus 151 ~~p~s~N~f~~w~~emy~yiK~ldd~-hlvsvGD~~s 186 (587)
T COG3934 151 EAPISVNNFWDWSGEMYAYIKWLDDG-HLVSVGDPAS 186 (587)
T ss_pred cccCChhHHHHHHHHHHHHhhccCCC-CeeecCCcCC
Confidence 1 1346889999999863 334433 45554 5443
No 44
>COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism]
Probab=97.05 E-value=0.0033 Score=60.11 Aligned_cols=116 Identities=19% Similarity=0.236 Sum_probs=71.9
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHH---HHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQL---AVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~---a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
=+|.|+.+|+.|+|.||+-| |+.---..|.=--.|..|+.+.+++ |+..||+|++.+. | .++..=|+- .
T Consensus 65 ~qD~~~iLK~~GvNyvRlRv-wndP~dsngn~yggGnnD~~k~ieiakRAk~~GmKVl~dFH-Y----SDfwaDPak--Q 136 (403)
T COG3867 65 RQDALQILKNHGVNYVRLRV-WNDPYDSNGNGYGGGNNDLKKAIEIAKRAKNLGMKVLLDFH-Y----SDFWADPAK--Q 136 (403)
T ss_pred HHHHHHHHHHcCcCeEEEEE-ecCCccCCCCccCCCcchHHHHHHHHHHHHhcCcEEEeecc-c----hhhccChhh--c
Confidence 37889999999999999965 5554333333333456788877776 5568999999883 1 122111221 1
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
+.|..--..+-+...+++-.|.+..+..+++ +|=-+=|+||.||-..
T Consensus 137 ~kPkaW~~l~fe~lk~avy~yTk~~l~~m~~----eGi~pdmVQVGNEtn~ 183 (403)
T COG3867 137 KKPKAWENLNFEQLKKAVYSYTKYVLTTMKK----EGILPDMVQVGNETNG 183 (403)
T ss_pred CCcHHhhhcCHHHHHHHHHHHHHHHHHHHHH----cCCCccceEeccccCC
Confidence 1122111234455677777888888887553 2333459999999754
No 45
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=96.64 E-value=0.016 Score=49.16 Aligned_cols=97 Identities=12% Similarity=0.263 Sum_probs=65.0
Q ss_pred HHHHHHHHhCCCEEEEEEe-------C--CCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcc
Q psy13819 109 WIMRAMRAAGLNALSTYVE-------W--RSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPP 179 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~yv~-------W--~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~ 179 (323)
+-++.+|++|+|+|.++.- | ..+.+.|+- + .+=|.++++.|++.||+|+++...- --++...-.|.
T Consensus 4 ~~~~~lk~~~v~si~i~a~~h~g~ayYPt~~~~~hp~L---~-~Dllge~v~a~h~~Girv~ay~~~~-~d~~~~~~HPe 78 (132)
T PF14871_consen 4 QFVDTLKEAHVNSITIFAKCHGGYAYYPTKVGPRHPGL---K-RDLLGEQVEACHERGIRVPAYFDFS-WDEDAAERHPE 78 (132)
T ss_pred HHHHHHHHhCCCEEEEEcccccEEEEccCCCCcCCCCC---C-cCHHHHHHHHHHHCCCEEEEEEeee-cChHHHHhCCc
Confidence 4578899999999998332 2 334444543 1 2346888999999999999988533 22333345799
Q ss_pred cccccCCC-------------eeecCCCHhHHHHHHHHHHHHHHH
Q psy13819 180 WLLKVAPD-------------ILLRQNHPVYQRYVTRWFQELFPR 211 (323)
Q Consensus 180 Wl~~~~p~-------------~~~R~~~~~f~~~~~~~~~~l~~~ 211 (323)
|+... ++ +..-..+..|++.+..-+++++.+
T Consensus 79 W~~~~-~~G~~~~~~~~~~~~~~~~c~ns~Y~e~~~~~i~Ei~~~ 122 (132)
T PF14871_consen 79 WFVRD-ADGRPMRGERFGYPGWYTCCLNSPYREFLLEQIREILDR 122 (132)
T ss_pred eeeEC-CCCCCcCCCCcCCCCceecCCCccHHHHHHHHHHHHHHc
Confidence 98762 22 112244567888888888888776
No 46
>KOG2230|consensus
Probab=96.40 E-value=0.016 Score=59.63 Aligned_cols=113 Identities=18% Similarity=0.281 Sum_probs=85.7
Q ss_pred CCeEEeCCeeeEEeeeeecCC-----CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHH
Q psy13819 80 GDTFRLNEDPFQFVSGSFHYF-----RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAV 154 (323)
Q Consensus 80 ~~~~~~dGk~~~i~~g~~Hy~-----r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~ 154 (323)
+..|.+||.|+++.+++.-+. |...+.-+-.|+.++++|.|++|++ .. |. -.-+.|.++|.
T Consensus 327 nfyfkin~~pvflkg~nwip~s~f~dr~t~~~~~~LL~Sv~e~~MN~lRVW---GG-----Gv------YEsd~FY~lad 392 (867)
T KOG2230|consen 327 NFYFKINDEPVFLKGTNWIPVSMFRDRENIAKTEFLLDSVAEVGMNMLRVW---GG-----GV------YESDYFYQLAD 392 (867)
T ss_pred eeEEEEcCcEEEeecCCccChHHHHhhHHHHHHHHHHHHHHHhCcceEEEe---cC-----cc------ccchhHHHHhh
Confidence 456899999999999876653 3445566778999999999999993 32 23 33467899999
Q ss_pred HCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCC
Q psy13819 155 EEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENE 231 (323)
Q Consensus 155 e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE 231 (323)
+.||.|.-..- +.||- +| .+..|++.++.=++.-+.| .+.|++||.|-=.||
T Consensus 393 ~lGilVWQD~M-FACAl-------------YP------t~~eFl~sv~eEV~yn~~R-----ls~HpSviIfsgNNE 444 (867)
T KOG2230|consen 393 SLGILVWQDMM-FACAL-------------YP------TNDEFLSSVREEVRYNAMR-----LSHHPSVIIFSGNNE 444 (867)
T ss_pred hccceehhhhH-HHhhc-------------cc------CcHHHHHHHHHHHHHHHHh-----hccCCeEEEEeCCCc
Confidence 99998876652 44552 33 5789999999888888888 567899999876665
No 47
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=96.23 E-value=0.015 Score=59.35 Aligned_cols=61 Identities=16% Similarity=0.283 Sum_probs=39.1
Q ss_pred CChhcHHHHHHHHH-HhCCCEEEEE-Ee---CCCc-c-cCCC--eecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 102 APPGRWCWIMRAMR-AAGLNALSTY-VE---WRSH-E-VHPG--HYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 102 ~p~~~W~~~l~~mk-~~G~N~Vr~y-v~---W~~h-E-P~~G--~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.-++.|+..|+.++ ++||..||+. ++ -... + ...| .|||+. +++++|...++||++++..|
T Consensus 36 ~l~~~~q~~l~~~~~~~gf~yvR~h~l~~ddm~~~~~~~~~~~~~Ynf~~---lD~i~D~l~~~g~~P~vel~ 105 (486)
T PF01229_consen 36 LLRADWQEQLRELQEELGFRYVRFHGLFSDDMMVYSESDEDGIPPYNFTY---LDQILDFLLENGLKPFVELG 105 (486)
T ss_dssp GGBHHHHHHHHHHHCCS--SEEEES-TTSTTTT-EEEEETTEEEEE--HH---HHHHHHHHHHCT-EEEEEE-
T ss_pred HhhHHHHHHHHHHHhccCceEEEEEeeccCchhhccccccCCCCcCChHH---HHHHHHHHHHcCCEEEEEEE
Confidence 34678999999887 6899999992 22 0111 1 1233 399998 77779999999999998875
No 48
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=95.71 E-value=0.09 Score=50.77 Aligned_cols=117 Identities=17% Similarity=0.251 Sum_probs=71.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCC-------CcccCCCeec-----ccccchHHHHHHHHHHCCCeEEEccCCc-cc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWR-------SHEVHPGHYH-----YDGHRDIEHFLQLAVEEDLYILLRPGPF-IC 169 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~-------~hEP~~G~fd-----f~g~~dl~~fl~~a~e~GL~vilr~GPy-i~ 169 (323)
.++.-++.|++++++|+|+|=+-|-+. ..+|..+... -.|.|=|+.+|+.|+++||.|...+ .+ ..
T Consensus 17 ~~~~~~~~l~~l~~~~~N~V~~qVr~~gda~Y~S~~~p~s~~~~g~~~~~pg~DpL~~~I~eaHkrGlevHAW~-~~~~~ 95 (311)
T PF02638_consen 17 SKEQIDEMLDDLKSAGFNAVFVQVRPRGDALYPSDIEPWSGYLTGKQGKDPGFDPLEFMIEEAHKRGLEVHAWF-RVGFN 95 (311)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEEEEeCcEEEecccccccccccCCCCCCCCCccHHHHHHHHHHHcCCEEEEEE-EeecC
Confidence 367889999999999999998877663 3444221110 0134559999999999999998765 11 10
Q ss_pred cc-c--CCCCCcccccccCCCeeec------------CCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccC
Q psy13819 170 GK-R--DFGGFPPWLLKVAPDILLR------------QNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVE 229 (323)
Q Consensus 170 aE-w--~~gg~P~Wl~~~~p~~~~R------------~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Qie 229 (323)
+. . -....|.|+..+.++.... ..+|.-+++..+.+++++++ | .|=++|++
T Consensus 96 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i~~~v~Eiv~~-----Y----dvDGIhlD 161 (311)
T PF02638_consen 96 APDVSHILKKHPEWFAVNHPGWVRTYEDANGGYYWLNPGHPEVRDYIIDIVKEIVKN-----Y----DVDGIHLD 161 (311)
T ss_pred CCchhhhhhcCchhheecCCCceeecccCCCCceEECCCCHHHHHHHHHHHHHHHhc-----C----CCCeEEec
Confidence 00 0 0123577876434543322 34566666666666666655 4 35566666
No 49
>PRK09936 hypothetical protein; Provisional
Probab=95.28 E-value=0.44 Score=45.62 Aligned_cols=59 Identities=19% Similarity=0.396 Sum_probs=46.7
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccc-hHHHHHHHHHHCCCeEEEcc
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHR-DIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~-dl~~fl~~a~e~GL~vilr~ 164 (323)
.+++++.|++.++.+++.|++|+=+ -|..- |.=||.+.+ -|.+.++.|++.||+|+|..
T Consensus 33 ~~~~~~qWq~~~~~~~~~G~~tLiv--QWt~y----G~~~fg~~~g~La~~l~~A~~~Gl~v~vGL 92 (296)
T PRK09936 33 SQVTDTQWQGLWSQLRLQGFDTLVV--QWTRY----GDADFGGQRGWLAKRLAAAQQAGLKLVVGL 92 (296)
T ss_pred CCCCHHHHHHHHHHHHHcCCcEEEE--Eeeec----cCCCcccchHHHHHHHHHHHHcCCEEEEcc
Confidence 4688999999999999999999866 34333 111776543 57888999999999999976
No 50
>smart00642 Aamy Alpha-amylase domain.
Probab=95.06 E-value=0.05 Score=47.76 Aligned_cols=63 Identities=11% Similarity=0.110 Sum_probs=44.5
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCccc--C-----CCee-----cccccchHHHHHHHHHHCCCeEEEccCCc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEV--H-----PGHY-----HYDGHRDIEHFLQLAVEEDLYILLRPGPF 167 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP--~-----~G~f-----df~g~~dl~~fl~~a~e~GL~vilr~GPy 167 (323)
+.+.+.|.-+|++|+|+|.+.-....... . +..| .|...++++++++.|+++||+||+..=+-
T Consensus 19 ~gi~~~l~yl~~lG~~~I~l~Pi~~~~~~~~~~~gY~~~d~~~i~~~~Gt~~d~~~lv~~~h~~Gi~vilD~V~N 93 (166)
T smart00642 19 QGIIEKLDYLKDLGVTAIWLSPIFESPQGYPSYHGYDISDYKQIDPRFGTMEDFKELVDAAHARGIKVILDVVIN 93 (166)
T ss_pred HHHHHHHHHHHHCCCCEEEECcceeCCCCCCCCCCcCccccCCCCcccCCHHHHHHHHHHHHHCCCEEEEEECCC
Confidence 45667788899999999999444433321 1 1111 23445789999999999999999987443
No 51
>COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism]
Probab=94.82 E-value=1.1 Score=41.80 Aligned_cols=40 Identities=23% Similarity=0.259 Sum_probs=32.3
Q ss_pred ccCceEEeecCCCCC-chhHHHHHHHhhCCCCCeEEEEeccc
Q psy13819 274 TVDGVYSTVDFTVFK-DVNVSFQAQRTRAPQGPLVNAEFEFF 314 (323)
Q Consensus 274 ~~~g~~~~~nfg~~~-~~~~~~~~~r~~~p~~Plm~~E~~~g 314 (323)
+.||.+--+-||.+. +....|+.|++....+|++ -|.|+-
T Consensus 220 ~~~GqFrdvpfGeG~Vdf~~~f~~lk~~ny~gpfL-IEMWse 260 (287)
T COG3623 220 TSPGQFRDVPFGEGCVDFEECFKTLKQLNYRGPFL-IEMWSE 260 (287)
T ss_pred cCCCccccCCcCCcchhHHHHHHHHHHhCCCCcee-hhhhhh
Confidence 478888878888653 6788999999999999954 588875
No 52
>TIGR01515 branching_enzym alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase. A sequence from Arabidopsis thaliana, GP|9294564, scores just above trusted, but appears either to contain corrupt sequence or, more likely, to be a pseudogene as some of the conserved catalytic residues common to the alpha amylase family are not conserved here.
Probab=94.73 E-value=0.5 Score=49.82 Aligned_cols=56 Identities=11% Similarity=0.125 Sum_probs=40.1
Q ss_pred HHHH-HHHHHhCCCEEEE-EEeCCCcccCCCee----------cccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIM-RAMRAAGLNALST-YVEWRSHEVHPGHY----------HYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l-~~mk~~G~N~Vr~-yv~W~~hEP~~G~f----------df~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.| .-+|++|+|+|.+ +|+.......= -| .|...+++.++++.|+++||+|||..
T Consensus 159 ~~~l~dyl~~LGvt~i~L~Pi~e~~~~~~w-GY~~~~y~~~~~~~Gt~~dlk~lV~~~H~~Gi~VilD~ 226 (613)
T TIGR01515 159 ADQLIPYVKELGFTHIELLPVAEHPFDGSW-GYQVTGYYAPTSRFGTPDDFMYFVDACHQAGIGVILDW 226 (613)
T ss_pred HHHHHHHHHHcCCCEEEECCcccCCCCCCC-CCCcccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 3454 7889999999999 77653211100 12 23345789999999999999999985
No 53
>PF02055 Glyco_hydro_30: O-Glycosyl hydrolase family 30; InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=93.79 E-value=3.3 Score=42.69 Aligned_cols=254 Identities=14% Similarity=0.168 Sum_probs=121.3
Q ss_pred cceEEEEEEeeCCCCccccchhhhhh-ccccccceEEEEecCCeEEeCCeeeEEeeeeecC------CCCChhcHHHHHH
Q psy13819 40 TNRITIFELTRAPDKYDVDFVDKISQ-RRARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHY------FRAPPGRWCWIMR 112 (323)
Q Consensus 40 ~N~i~i~e~~~~~~~~~i~~~~~~i~-~~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy------~r~p~~~W~~~l~ 112 (323)
..+..++.....+ ..++.....+. ......-+++|+.+...-.|+| +||++-- .+++++.=++.|+
T Consensus 32 ~g~~~~y~ts~~g--~rl~~~~~~~~~~~~~~~~~i~id~~~~yQ~i~G-----FGga~Tdasa~~l~~l~~~~r~~ll~ 104 (496)
T PF02055_consen 32 AGQAAVYTTSKSG--ARLKQSVGNITNSDPSSSVTITIDPSTTYQTIDG-----FGGAFTDASAYNLQKLSEEQRDELLR 104 (496)
T ss_dssp TTEEEEEEEETTS---ESEEEEEEEBSS--SSSEEEEEEEEEEEEE--E-----EEEE--HHHHHHHHTS-HHHHHHHHH
T ss_pred CCeEEEEEECCcc--chhhhhhhcccccccccceeeeecccccceEEEE-----EeeeHHHHHHHHHHhCCHHHHHHHHH
Confidence 4567777655422 22222211111 1222334555654444445555 8888752 3455443333333
Q ss_pred HH---HHhCCCEEEEEEeCCCcccC-------CCeec---cccc-ch---HHHHHHHHHH--CCCeEEEccCCccccccC
Q psy13819 113 AM---RAAGLNALSTYVEWRSHEVH-------PGHYH---YDGH-RD---IEHFLQLAVE--EDLYILLRPGPFICGKRD 173 (323)
Q Consensus 113 ~m---k~~G~N~Vr~yv~W~~hEP~-------~G~fd---f~g~-~d---l~~fl~~a~e--~GL~vilr~GPyi~aEw~ 173 (323)
.+ +-+|+|.+|+.+-=+..... |+.|+ |+-. +| ...+|+.|.+ .+|+++.-| |
T Consensus 105 ~~F~~~G~g~s~~R~pIgssDfs~~~Yty~d~~~D~~l~~Fs~~~~d~~~~ip~ik~a~~~~~~lki~aSp-------W- 176 (496)
T PF02055_consen 105 SLFSEDGIGYSLLRVPIGSSDFSTRPYTYDDVPGDFNLSNFSIAREDKKYKIPLIKEALAINPNLKIFASP-------W- 176 (496)
T ss_dssp HHHSTTTT---EEEEEES--SSSSS---ST-STTHTTTTT---HHHHHTTHHHHHHHHHHHHTT-EEEEEE-------S-
T ss_pred HHhhcCCceEEEEEeeccCcCCcCCcccccCCCCCCccccCCccccchhhHHHHHHHHHHhCCCcEEEEec-------C-
Confidence 33 44899999997743222211 23222 1111 12 3467777666 358888877 3
Q ss_pred CCCCcccccccCC---Ceeec-CCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC-------CCCC----
Q psy13819 174 FGGFPPWLLKVAP---DILLR-QNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS-------DAEC---- 238 (323)
Q Consensus 174 ~gg~P~Wl~~~~p---~~~~R-~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~-------~~~~---- 238 (323)
..|.|+..+.. .-.++ ...+.|.+...+|+.+.++..+ ++|=+|-++-+.||+.. +..|
T Consensus 177 --SpP~WMKtn~~~~g~g~l~g~~~~~y~~~yA~Y~vkfi~aY~----~~GI~i~aiT~QNEP~~~~~~~~~~~s~~~t~ 250 (496)
T PF02055_consen 177 --SPPAWMKTNGSMNGGGSLKGSLGDEYYQAYADYFVKFIQAYK----KEGIPIWAITPQNEPDNGSDPNYPWPSMGWTP 250 (496)
T ss_dssp -----GGGBTTSSSCSS-BBSCGTTSHHHHHHHHHHHHHHHHHH----CTT--ESEEESSSSCCGGGSTT-SSC--B--H
T ss_pred --CCCHHHccCCcCcCCCccCCCCCchhHHHHHHHHHHHHHHHH----HCCCCeEEEeccCCCCCCCCCCCCCCcCCCCH
Confidence 58999977421 12233 2345677777777777766633 34668889999999873 2222
Q ss_pred --hHHHHH-HHHHHHHHh-cCCceEEEeeCCCCcc---c----ccc-ccCceEEeec--CCCCCchhHHHHHHHhhCCCC
Q psy13819 239 --DPAHAV-WLRDLLRTY-VQDKAVLYSTDGAFDA---Y----LRC-TVDGVYSTVD--FTVFKDVNVSFQAQRTRAPQG 304 (323)
Q Consensus 239 --~~~Y~~-~l~~~~~~~-~g~~~~l~t~dg~~~~---~----~~~-~~~g~~~~~n--fg~~~~~~~~~~~~r~~~p~~ 304 (323)
-++|+. +|.-++++. +|.++-++..|..... + +.. ........+- .+........+....+..|++
T Consensus 251 ~~~~~Fi~~~LgP~l~~~~~g~d~kI~~~D~n~~~~~~~~~~il~d~~A~~yv~GiA~HwY~g~~~~~~l~~~h~~~P~k 330 (496)
T PF02055_consen 251 EEQADFIKNYLGPALRKAGLGKDVKILIYDHNRDNLPDYADTILNDPEAAKYVDGIAFHWYGGDPSPQALDQVHNKFPDK 330 (496)
T ss_dssp HHHHHHHHHTHHHHHHTSTT-TTSEEEEEEEEGGGTTHHHHHHHTSHHHHTTEEEEEEEETTCS-HCHHHHHHHHHSTTS
T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEEEecCCcccchhhhhhhcChhhHhheeEEEEECCCCCchhhHHHHHHHHCCCc
Confidence 156665 477788774 3447766665432211 1 111 0001122222 222222334566667778999
Q ss_pred CeEEEEeccc
Q psy13819 305 PLVNAEFEFF 314 (323)
Q Consensus 305 Plm~~E~~~g 314 (323)
.++.||-..|
T Consensus 331 ~l~~TE~~~g 340 (496)
T PF02055_consen 331 FLLFTEACCG 340 (496)
T ss_dssp EEEEEEEESS
T ss_pred EEEeeccccC
Confidence 9999998766
No 54
>PRK05402 glycogen branching enzyme; Provisional
Probab=93.77 E-value=1.6 Score=47.02 Aligned_cols=56 Identities=11% Similarity=0.154 Sum_probs=37.6
Q ss_pred HH-HHHHHHhCCCEEEE-EEeCC----CcccCCC-----eecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 109 WI-MRAMRAAGLNALST-YVEWR----SHEVHPG-----HYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 109 ~~-l~~mk~~G~N~Vr~-yv~W~----~hEP~~G-----~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+. |.-+|++|+|+|.+ +|+=. .|--.+. .-.|...+++.+|++.|+++||+|||..
T Consensus 269 ~~l~~ylk~LGv~~i~L~Pi~e~~~~~~~GY~~~~y~ai~~~~Gt~~dfk~lV~~~H~~Gi~VilD~ 335 (726)
T PRK05402 269 DQLIPYVKEMGFTHVELLPIAEHPFDGSWGYQPTGYYAPTSRFGTPDDFRYFVDACHQAGIGVILDW 335 (726)
T ss_pred HHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 44 47789999999999 55310 0000000 0113345789999999999999999985
No 55
>PF14587 Glyco_hydr_30_2: O-Glycosyl hydrolase family 30; PDB: 3CLW_B.
Probab=93.76 E-value=2.3 Score=42.32 Aligned_cols=176 Identities=16% Similarity=0.182 Sum_probs=87.1
Q ss_pred eEEEEecCCeEEeCCeeeEEeeeeecCCC------CChhcHHHHHHHH-----------HHhCCCEEEEEEe--------
Q psy13819 73 TFAIDLAGDTFRLNEDPFQFVSGSFHYFR------APPGRWCWIMRAM-----------RAAGLNALSTYVE-------- 127 (323)
Q Consensus 73 ~~~v~~~~~~~~~dGk~~~i~~g~~Hy~r------~p~~~W~~~l~~m-----------k~~G~N~Vr~yv~-------- 127 (323)
+++|+.+...-.++| +|++.--++ +|.+.=++.++++ |-+|||.+|.-|-
T Consensus 3 ~vtId~~~~~QtieG-----fGaS~aW~a~~~Gk~w~~~~r~~iaDlLFS~~~~~~g~p~GlGLSI~RyNIGgGs~~~~d 77 (384)
T PF14587_consen 3 SVTIDPSTTYQTIEG-----FGASDAWWANFVGKNWPEEKRNQIADLLFSTENDSNGNPKGLGLSIWRYNIGGGSAEQGD 77 (384)
T ss_dssp EEEEEEEEEEEE--E-----EEEE-TTTHHHHHHHS-HHHHHHHHHHHH---B-TTS-B-S---S-EEEE---STTTTTT
T ss_pred eEEEcCCCCceeecc-----ccHHHhHHHHHhcccCCHHHHHHHHHHhcCCCcccCCCCCCceeeeeeeccccCCccccc
Confidence 455655555556665 666665443 3322222233332 4589999999553
Q ss_pred -------CCCcc---cCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCC---e-eecCC
Q psy13819 128 -------WRSHE---VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPD---I-LLRQN 193 (323)
Q Consensus 128 -------W~~hE---P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~---~-~~R~~ 193 (323)
|.+.| +..|.|||+....=+.|++.|+++|+..++-+ .-..|.|+..+--. - ..-..
T Consensus 78 ~~~i~~~~rr~e~f~~~dg~yDW~~D~gQrwfL~~Ak~rGV~~f~aF---------SNSPP~~MT~NG~~~g~~~~~~NL 148 (384)
T PF14587_consen 78 SSGIRDPWRRAESFLPADGSYDWDADAGQRWFLKAAKERGVNIFEAF---------SNSPPWWMTKNGSASGGDDGSDNL 148 (384)
T ss_dssp SS--SSSTT----SB-TTS-B-TTSSHHHHHHHHHHHHTT---EEEE----------SSS-GGGSSSSSSB-S-SSS-SS
T ss_pred CccCCCcccCCccccCCCCCcCCCCCHHHHHHHHHHHHcCCCeEEEe---------ecCCCHHHhcCCCCCCCCcccccc
Confidence 32222 55789999987777889999999999988766 23578888763110 0 00112
Q ss_pred CHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCC---CC---C------hHHHHHHHHHHHHHhcCCceEEE
Q psy13819 194 HPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSD---AE---C------DPAHAVWLRDLLRTYVQDKAVLY 261 (323)
Q Consensus 194 ~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~---~~---~------~~~Y~~~l~~~~~~~~g~~~~l~ 261 (323)
.+...++...|+..++++++. .|=+|--+..-||+... +. | -.+.++.|...+++. |....+.
T Consensus 149 k~d~y~~FA~YLa~Vv~~~~~----~GI~f~~IsP~NEP~~~W~~~~QEG~~~~~~e~a~vI~~L~~~L~~~-GL~t~I~ 223 (384)
T PF14587_consen 149 KPDNYDAFADYLADVVKHYKK----WGINFDYISPFNEPQWNWAGGSQEGCHFTNEEQADVIRALDKALKKR-GLSTKIS 223 (384)
T ss_dssp -TT-HHHHHHHHHHHHHHHHC----TT--EEEEE--S-TTS-GG--SS-B----HHHHHHHHHHHHHHHHHH-T-S-EEE
T ss_pred ChhHHHHHHHHHHHHHHHHHh----cCCccceeCCcCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHHHHhc-CCCceEE
Confidence 345566677777777777532 34467777888998642 11 1 157888898888885 7666555
Q ss_pred eeCCCC
Q psy13819 262 STDGAF 267 (323)
Q Consensus 262 t~dg~~ 267 (323)
.+|...
T Consensus 224 ~~Ea~~ 229 (384)
T PF14587_consen 224 ACEAGD 229 (384)
T ss_dssp EEEESS
T ss_pred ecchhh
Confidence 555443
No 56
>COG1649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=93.49 E-value=0.44 Score=47.92 Aligned_cols=86 Identities=16% Similarity=0.223 Sum_probs=58.4
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCC--------Cccc----CCCee-cccccchHHHHHHHHHHCCCeEEEccCCccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWR--------SHEV----HPGHY-HYDGHRDIEHFLQLAVEEDLYILLRPGPFIC 169 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~--------~hEP----~~G~f-df~g~~dl~~fl~~a~e~GL~vilr~GPyi~ 169 (323)
.+..-.+.|+.++++|+|||-.-| |+ ...| .||.+ --.|.|=|...|++|++.||.|+.+.=||..
T Consensus 62 ~~~el~~~ld~l~~ln~NTv~~qV-~~~G~~lypS~~~p~s~~~~~~~~~~~g~DpLa~~I~~AHkr~l~v~aWf~~~~~ 140 (418)
T COG1649 62 QRQELKDILDDLQKLNFNTVYPQV-WNDGDALYPSAVLPWSDGLPGVLGVDPGYDPLAFVIAEAHKRGLEVHAWFNPYRM 140 (418)
T ss_pred cHHHHHHHHHHHHHcCCceeEEEE-ecCccccccccccccccCcCcccCCCCCCChHHHHHHHHHhcCCeeeechhhccc
Confidence 467888999999999999997733 22 1122 24433 2345667888899999999999998877776
Q ss_pred cccCCC---CCcccccccCCCee
Q psy13819 170 GKRDFG---GFPPWLLKVAPDIL 189 (323)
Q Consensus 170 aEw~~g---g~P~Wl~~~~p~~~ 189 (323)
|--..- -.|.|+..+.|+..
T Consensus 141 a~~~s~~~~~~p~~~~~~~~~~~ 163 (418)
T COG1649 141 APPTSPLTKRHPHWLTTKRPGWV 163 (418)
T ss_pred CCCCChhHhhCCCCcccCCCCeE
Confidence 642221 23667766545544
No 57
>PRK14705 glycogen branching enzyme; Provisional
Probab=93.23 E-value=3 Score=47.42 Aligned_cols=56 Identities=9% Similarity=0.159 Sum_probs=38.5
Q ss_pred HHHHHHHHhCCCEEEE-EEe-------CCCc--ccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 109 WIMRAMRAAGLNALST-YVE-------WRSH--EVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~-yv~-------W~~h--EP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.|.-+|++|+|+|.+ .|+ |..- ....=.-.|...+|+.+|++.|+++||.|||..
T Consensus 770 ~lldYlk~LGvt~IeLmPv~e~p~~~swGY~~~~y~ap~~ryGt~~dfk~lVd~~H~~GI~VILD~ 835 (1224)
T PRK14705 770 ELVDYVKWLGFTHVEFMPVAEHPFGGSWGYQVTSYFAPTSRFGHPDEFRFLVDSLHQAGIGVLLDW 835 (1224)
T ss_pred HHHHHHHHhCCCEEEECccccCCCCCCCCCCccccCCcCcccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 4468899999999999 552 3110 000001123345799999999999999999984
No 58
>PF00128 Alpha-amylase: Alpha amylase, catalytic domain; InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=93.01 E-value=0.1 Score=48.18 Aligned_cols=58 Identities=16% Similarity=0.181 Sum_probs=40.9
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCC--eecc-------cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPG--HYHY-------DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G--~fdf-------~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+.|+-+|++|+|+|.+.-.+...+..-| .-|| ...+++.++++.|+++||+||+..
T Consensus 6 i~~kLdyl~~lGv~~I~l~Pi~~~~~~~~gY~~~d~~~vd~~~Gt~~d~~~Lv~~~h~~gi~VilD~ 72 (316)
T PF00128_consen 6 IIDKLDYLKDLGVNAIWLSPIFESPNGYHGYDPSDYYAVDPRFGTMEDFKELVDAAHKRGIKVILDV 72 (316)
T ss_dssp HHHTHHHHHHHTESEEEESS-EESSSSTTTTSESEEEEESTTTBHHHHHHHHHHHHHHTTCEEEEEE
T ss_pred HHHhhHHHHHcCCCceecccccccccccccccceeeeccccccchhhhhhhhhhccccccceEEEee
Confidence 356789999999999999444332211111 2232 235799999999999999999975
No 59
>PRK12568 glycogen branching enzyme; Provisional
Probab=92.83 E-value=3.8 Score=44.23 Aligned_cols=56 Identities=11% Similarity=0.239 Sum_probs=39.4
Q ss_pred HHHHHHHHhCCCEEEE-EEe-------CCCcccCCCee----cccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 109 WIMRAMRAAGLNALST-YVE-------WRSHEVHPGHY----HYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~-yv~-------W~~hEP~~G~f----df~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+.|.-+|++|+|+|.+ +|+ |..- | -|-| .|...+++.+|++.|+++||.|||..=|
T Consensus 274 ~ll~ylk~LGvt~I~LmPi~e~~~~~~wGY~-~-~~~~a~~~~~G~~~dfk~lV~~~H~~Gi~VIlD~V~ 341 (730)
T PRK12568 274 QLIPYVQQLGFTHIELLPITEHPFGGSWGYQ-P-LGLYAPTARHGSPDGFAQFVDACHRAGIGVILDWVS 341 (730)
T ss_pred HHHHHHHHcCCCEEEECccccCCCCCCCCCC-C-CcCCccCcccCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 4468899999999999 552 3210 0 0111 2344578999999999999999998544
No 60
>PRK14706 glycogen branching enzyme; Provisional
Probab=92.72 E-value=1.9 Score=45.75 Aligned_cols=51 Identities=14% Similarity=0.122 Sum_probs=35.6
Q ss_pred HHHHHHhCCCEEEE-EEeCCCcccCCCe--e----------cccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 111 MRAMRAAGLNALST-YVEWRSHEVHPGH--Y----------HYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 111 l~~mk~~G~N~Vr~-yv~W~~hEP~~G~--f----------df~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.-+|++|+|+|++ .|. -| |..+. | .|...+++.+|++.|+++||.|||..
T Consensus 174 ~~ylk~lG~t~velmPv~--e~-~~~~~wGY~~~~~~~~~~~~g~~~~~~~lv~~~H~~gi~VilD~ 237 (639)
T PRK14706 174 GEYVTYMGYTHVELLGVM--EH-PFDGSWGYQVTGYYAPTSRLGTPEDFKYLVNHLHGLGIGVILDW 237 (639)
T ss_pred HHHHHHcCCCEEEccchh--cC-CCCCCCCcCcccccccccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 36789999999998 442 11 11111 1 12234789999999999999999974
No 61
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=92.39 E-value=1.5 Score=46.42 Aligned_cols=55 Identities=15% Similarity=0.235 Sum_probs=42.0
Q ss_pred hhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeecccc--------------cchHHHHHHHHHHCCCeEEEc
Q psy13819 104 PGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYHYDG--------------HRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fdf~g--------------~~dl~~fl~~a~e~GL~vilr 163 (323)
.+.-.+.|.-+|+||+++|++ .|. ..||.-+|.. -+|+.+|||.|+++||-|||.
T Consensus 164 ~e~a~~llpYl~elG~T~IELMPv~-----e~p~~~sWGYq~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD 233 (628)
T COG0296 164 FELAIELLPYLKELGITHIELMPVA-----EHPGDRSWGYQGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILD 233 (628)
T ss_pred HHHHHHHhHHHHHhCCCEEEEcccc-----cCCCCCCCCCCcceeccccccCCCHHHHHHHHHHHHHcCCEEEEE
Confidence 455578899999999999999 432 2344333322 379999999999999999997
No 62
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=92.07 E-value=0.25 Score=50.37 Aligned_cols=57 Identities=9% Similarity=0.049 Sum_probs=39.6
Q ss_pred HHHHHHHHHhCCCEEEE-EEeCCC-----cccCC-Ceec-------------ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALST-YVEWRS-----HEVHP-GHYH-------------YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~-yv~W~~-----hEP~~-G~fd-------------f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.|+-++++|+|+|-+ +++=+. |--.+ .-|| |...+|+.++++.|+++||+||+..
T Consensus 25 ~~kldyl~~LGvtaIwl~P~~~~~~~~~~hgY~~~D~~~~~~~~~~~~id~~fGt~~dl~~Li~~~H~~Gi~vi~D~ 101 (479)
T PRK09441 25 AERAPELAEAGITAVWLPPAYKGTSGGYDVGYGVYDLFDLGEFDQKGTVRTKYGTKEELLNAIDALHENGIKVYADV 101 (479)
T ss_pred HHHHHHHHHcCCCEEEeCCCccCCCCCCCCCCCeecccccccccccCCcCcCcCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 47788899999999999 443322 11111 1121 2235789999999999999999873
No 63
>PF05913 DUF871: Bacterial protein of unknown function (DUF871); InterPro: IPR008589 This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown though a number are annotated as outer surface proteins.; PDB: 2P0O_A 1X7F_A.
Probab=91.91 E-value=0.25 Score=48.79 Aligned_cols=72 Identities=14% Similarity=0.023 Sum_probs=48.0
Q ss_pred eeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 93 VSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 93 ~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
+|=|+.....+.+...+.|++|+++|+..|=|. .|.|+... -+....+..+++.|+++||.|++...|-+..
T Consensus 2 lGiSvY~~~~~~~~~~~yi~~a~~~Gf~~iFTS----L~ipe~~~--~~~~~~~~~l~~~a~~~~~~v~~Disp~~l~ 73 (357)
T PF05913_consen 2 LGISVYPGQSSFEENKAYIEKAAKYGFKRIFTS----LHIPEDDP--EDYLERLKELLKLAKELGMEVIADISPKVLK 73 (357)
T ss_dssp EEEEE-CCCS-HHHHHHHHHHHHCTTEEEEEEE----E-----------HHHHHHHHHHHHHHCT-EEEEEE-CCHHH
T ss_pred cEEEEeCCCCCHHHHHHHHHHHHHCCCCEEECC----CCcCCCCH--HHHHHHHHHHHHHHHHCCCEEEEECCHHHHH
Confidence 344555555567899999999999999999994 88888432 2234568888999999999999999876554
No 64
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=91.72 E-value=5.7 Score=38.10 Aligned_cols=57 Identities=21% Similarity=0.279 Sum_probs=42.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEe----CCCcccC----CCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVE----WRSHEVH----PGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~----W~~hEP~----~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+..++.++.|...|+|.+.+++. +.. +|+ +|.|. .+++.++++.|+++||.||-..
T Consensus 16 ~~~lk~~id~ma~~k~N~l~lhl~D~f~~~~-~p~~~~~~~~yT---~~ei~ei~~yA~~~gI~vIPei 80 (301)
T cd06565 16 VSYLKKLLRLLALLGANGLLLYYEDTFPYEG-EPEVGRMRGAYT---KEEIREIDDYAAELGIEVIPLI 80 (301)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEEecceecCC-CcccccCCCCcC---HHHHHHHHHHHHHcCCEEEecC
Confidence 578899999999999999999653 322 122 12222 3578999999999999999764
No 65
>PLN02447 1,4-alpha-glucan-branching enzyme
Probab=91.71 E-value=8.1 Score=41.91 Aligned_cols=61 Identities=15% Similarity=0.120 Sum_probs=42.6
Q ss_pred hcHHHHHHHHHHhCCCEEEE-EEe-------CCCccc---CCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 105 GRWCWIMRAMRAAGLNALST-YVE-------WRSHEV---HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~-yv~-------W~~hEP---~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+.+++.|.-+|++|+|+|.+ .|+ |...-. .+ .-.|...+++.++++.|+++||.|||..=+
T Consensus 251 ~~~~~~L~ylk~LG~t~I~LmPi~e~~~~~~wGY~~~~~fa~-~~~~Gtp~dlk~LVd~aH~~GI~VilDvV~ 322 (758)
T PLN02447 251 EFADDVLPRIKALGYNAVQLMAIQEHAYYGSFGYHVTNFFAV-SSRSGTPEDLKYLIDKAHSLGLRVLMDVVH 322 (758)
T ss_pred HHHHHHHHHHHHcCCCEEEECCccccCCCCCCCcCcccCccc-ccccCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 44778899999999999999 331 211100 00 112344578999999999999999998533
No 66
>TIGR01531 glyc_debranch glycogen debranching enzymye. glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4--1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
Probab=91.21 E-value=1.5 Score=50.23 Aligned_cols=117 Identities=15% Similarity=0.235 Sum_probs=71.4
Q ss_pred CeEEeCC-eeeEEeeeee--cCCCC--ChhcHHHHHHHHHHhCCCEEEE-EEe-CCCcccCCCeec-------c----cc
Q psy13819 81 DTFRLNE-DPFQFVSGSF--HYFRA--PPGRWCWIMRAMRAAGLNALST-YVE-WRSHEVHPGHYH-------Y----DG 142 (323)
Q Consensus 81 ~~~~~dG-k~~~i~~g~~--Hy~r~--p~~~W~~~l~~mk~~G~N~Vr~-yv~-W~~hEP~~G~fd-------f----~g 142 (323)
=.+.||| |.+.+=|-++ ..+++ +-..|++.|+.+|++|.|+|-+ ++. =....-.-..+| | .+
T Consensus 103 P~L~i~~~~~lPl~~i~iqTvlsK~mG~~~~w~~~L~~ik~lGyN~IhftPI~~~G~SnS~Ysi~Dyl~idP~~~~~~~~ 182 (1464)
T TIGR01531 103 PMLYINADKFLPLDSIALQTVLAKLLGPLSEWEPRLRVAKEKGYNMIHFTPLQELGGSNSCYSLYDQLQLNQHFKSQKDG 182 (1464)
T ss_pred CeeEECCCcccCcCceeeeeehhhhcCCHHHHHHHHHHHHHcCCCEEEeCCCccCCCCCCCccccchhhcChhhcccCCc
Confidence 4688888 5444333333 34442 4478999999999999999998 433 110000001112 3 25
Q ss_pred cchHHHHHHHHHHC-CCeEEEccCCccccccCCCCC-cccccccCCCeeecCCCHhHHHHHHHH
Q psy13819 143 HRDIEHFLQLAVEE-DLYILLRPGPFICGKRDFGGF-PPWLLKVAPDILLRQNHPVYQRYVTRW 204 (323)
Q Consensus 143 ~~dl~~fl~~a~e~-GL~vilr~GPyi~aEw~~gg~-P~Wl~~~~p~~~~R~~~~~f~~~~~~~ 204 (323)
.+|+.++++.|++. ||++|+.. -|+|-+. -.|+.+ .|+.-....+..+++.+-..
T Consensus 183 ~~d~~~lV~~~h~~~Gm~~ilDv------V~NHTa~ds~Wl~e-HPEa~Yn~~~sP~L~~A~eL 239 (1464)
T TIGR01531 183 KNDVQALVEKLHRDWNVLSITDI------VFNHTANNSPWLLE-HPEAAYNCITSPHLRPAIVL 239 (1464)
T ss_pred HHHHHHHHHHHHHhcCCEEEEEe------eecccccCCHHHHh-ChHhhcCCCCCchhhhHHHH
Confidence 67899999999996 99999976 2344333 248876 67755554444555544433
No 67
>PRK12313 glycogen branching enzyme; Provisional
Probab=91.21 E-value=0.4 Score=50.69 Aligned_cols=53 Identities=11% Similarity=0.136 Sum_probs=38.0
Q ss_pred HH-HHHHHHhCCCEEEE-EEeCCCcccCCC--ee----------cccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 109 WI-MRAMRAAGLNALST-YVEWRSHEVHPG--HY----------HYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 109 ~~-l~~mk~~G~N~Vr~-yv~W~~hEP~~G--~f----------df~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+. |.-+|++|+|+|.+ +|+ .. |..+ -| .|...+++.+|++.|+++||+|||..
T Consensus 174 ~~ll~yl~~LGv~~i~L~Pi~-~~--~~~~~~GY~~~~y~~i~~~~Gt~~d~k~lv~~~H~~Gi~VilD~ 240 (633)
T PRK12313 174 DELIPYVKEMGYTHVEFMPLM-EH--PLDGSWGYQLTGYFAPTSRYGTPEDFMYLVDALHQNGIGVILDW 240 (633)
T ss_pred HHHHHHHHHcCCCEEEeCchh-cC--CCCCCCCCCCcCcCcCCCCCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 44 58999999999998 542 11 1000 11 23445799999999999999999984
No 68
>COG5309 Exo-beta-1,3-glucanase [Carbohydrate transport and metabolism]
Probab=90.70 E-value=2.9 Score=39.85 Aligned_cols=115 Identities=14% Similarity=0.114 Sum_probs=80.3
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
.+..+.||+.++.-+. .||+|- -|.. .|+.++.++.+.|++|+|.. |+..
T Consensus 62 a~~~~sDLe~l~~~t~-~IR~Y~-----------sDCn---~le~v~pAa~~~g~kv~lGi---------------w~td 111 (305)
T COG5309 62 ADQVASDLELLASYTH-SIRTYG-----------SDCN---TLENVLPAAEASGFKVFLGI---------------WPTD 111 (305)
T ss_pred HHHHHhHHHHhccCCc-eEEEee-----------ccch---hhhhhHHHHHhcCceEEEEE---------------eecc
Confidence 6789999999999887 999983 2333 36677999999999999977 5544
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCc--CCCCC--ChHHHHHHHHHHHHHhcCCceE
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEY--GSDAE--CDPAHAVWLRDLLRTYVQDKAV 259 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~--g~~~~--~~~~Y~~~l~~~~~~~~g~~~~ 259 (323)
+ - ...+++ .++..+ ..+..-+.|..+-|.||. +..+. .-.+|+...|.++++ +|.++|
T Consensus 112 d----------~--~~~~~~---til~ay--~~~~~~d~v~~v~VGnEal~r~~~tasql~~~I~~vrsav~~-agy~gp 173 (305)
T COG5309 112 D----------I--HDAVEK---TILSAY--LPYNGWDDVTTVTVGNEALNRNDLTASQLIEYIDDVRSAVKE-AGYDGP 173 (305)
T ss_pred c----------h--hhhHHH---HHHHHH--hccCCCCceEEEEechhhhhcCCCCHHHHHHHHHHHHHHHHh-cCCCCc
Confidence 1 0 112221 222221 124455778899999995 33322 126899999999986 699999
Q ss_pred EEeeCCC
Q psy13819 260 LYSTDGA 266 (323)
Q Consensus 260 l~t~dg~ 266 (323)
+-|.|..
T Consensus 174 V~T~dsw 180 (305)
T COG5309 174 VTTVDSW 180 (305)
T ss_pred eeecccc
Confidence 9998854
No 69
>TIGR02402 trehalose_TreZ malto-oligosyltrehalose trehalohydrolase. Members of this family are the trehalose biosynthetic enzyme malto-oligosyltrehalose trehalohydrolase, formally known as 4-alpha-D-{(1-4)-alpha-D-glucano}trehalose trehalohydrolase (EC 3.2.1.141). It is the TreZ protein of the TreYZ pathway for trehalose biosynthesis, and alternative to the OtsAB system.
Probab=90.29 E-value=0.48 Score=49.24 Aligned_cols=55 Identities=20% Similarity=0.268 Sum_probs=39.3
Q ss_pred HHHHHHHHHHhCCCEEEE-EEe-------CCC-----cccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 107 WCWIMRAMRAAGLNALST-YVE-------WRS-----HEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~-yv~-------W~~-----hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+.|.-+|++|+|+|.+ +|+ |.. ..|. -.|...+++.++++.|+++||+|||..
T Consensus 113 i~~~l~yl~~LGv~~i~L~Pi~~~~~~~~~GY~~~~~~~~~---~~~G~~~e~k~lV~~aH~~Gi~VilD~ 180 (542)
T TIGR02402 113 AIEKLPYLADLGITAIELMPVAQFPGTRGWGYDGVLPYAPH---NAYGGPDDLKALVDAAHGLGLGVILDV 180 (542)
T ss_pred HHHhhHHHHHcCCCEEEeCccccCCCCCCCCCCccCccccc---cccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 346789999999999999 542 110 0010 113345789999999999999999974
No 70
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=89.94 E-value=2.7 Score=36.41 Aligned_cols=128 Identities=13% Similarity=0.067 Sum_probs=73.0
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeee
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILL 190 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~ 190 (323)
|+.++++|+..|++........... ..+++++.+.++++||.+..-..+. .... +....
T Consensus 1 l~~~~~~G~~~vE~~~~~~~~~~~~-------~~~~~~~~~~~~~~gl~i~~~~~~~------~~~~--------~~~~~ 59 (213)
T PF01261_consen 1 LEAAAEAGFDGVELRFDDGQPWDEK-------DDEAEELRRLLEDYGLKIASLHPPT------NFWS--------PDEEN 59 (213)
T ss_dssp HHHHHHTTHSEEEEEHHHHSHHTHH-------HHHHHHHHHHHHHTTCEEEEEEEEE------SSSC--------TGTTS
T ss_pred ChHHHHcCCCEEEEecCCCcccccc-------hHHHHHHHHHHHHcCCeEEEEeccc------cccc--------ccccc
Confidence 5789999999999954433222111 3458899999999999966422111 0001 11112
Q ss_pred cCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccC--CCcCCCCC-----ChHHHHHHHHHHHHHhcCCceEEEee
Q psy13819 191 RQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVE--NEYGSDAE-----CDPAHAVWLRDLLRTYVQDKAVLYST 263 (323)
Q Consensus 191 R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Qie--NE~g~~~~-----~~~~Y~~~l~~~~~~~~g~~~~l~t~ 263 (323)
+...+. ++...+.+++.++..+.+ +...+.+... +....... .-.+.++.+.+.+.++ |+.+.+-+.
T Consensus 60 ~~~~~~-r~~~~~~~~~~i~~a~~l----g~~~i~~~~g~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~-gv~i~lE~~ 133 (213)
T PF01261_consen 60 GSANDE-REEALEYLKKAIDLAKRL----GAKYIVVHSGRYPSGPEDDTEENWERLAENLRELAEIAEEY-GVRIALENH 133 (213)
T ss_dssp TTSSSH-HHHHHHHHHHHHHHHHHH----TBSEEEEECTTESSSTTSSHHHHHHHHHHHHHHHHHHHHHH-TSEEEEE-S
T ss_pred cCcchh-hHHHHHHHHHHHHHHHHh----CCCceeecCcccccccCCCHHHHHHHHHHHHHHHHhhhhhh-cceEEEecc
Confidence 233344 778888888888877655 4556666644 22221111 0134566666676664 776666555
Q ss_pred CC
Q psy13819 264 DG 265 (323)
Q Consensus 264 dg 265 (323)
.+
T Consensus 134 ~~ 135 (213)
T PF01261_consen 134 PG 135 (213)
T ss_dssp SS
T ss_pred cC
Confidence 43
No 71
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=89.88 E-value=9.3 Score=36.56 Aligned_cols=142 Identities=15% Similarity=0.275 Sum_probs=82.4
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEe----CCCcc---c-------------CCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVE----WRSHE---V-------------HPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~----W~~hE---P-------------~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
.|.+..++.|+.|...++|++.+++. |..-- | ..|.|.- +++..+++.|+++||.||
T Consensus 13 ~~~~~lk~~id~ma~~K~N~lhlHl~D~~~~~le~~~~p~l~~~g~~~~~~~~~~~yT~---~di~elv~yA~~rgI~vi 89 (303)
T cd02742 13 LSVESIKRTIDVLARYKINTFHWHLTDDQAWRIESKKFPELAEKGGQINPRSPGGFYTY---AQLKDIIEYAAARGIEVI 89 (303)
T ss_pred cCHHHHHHHHHHHHHhCCcEEEEeeecCCCceEeeCccchhhhhcccccCCCCCCeECH---HHHHHHHHHHHHcCCEEE
Confidence 45789999999999999999999988 75431 1 1123333 468888999999999999
Q ss_pred EccCCccccccCCCCCc-ccccccCCCe---------------eecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 162 LRPGPFICGKRDFGGFP-PWLLKVAPDI---------------LLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 162 lr~GPyi~aEw~~gg~P-~Wl~~~~p~~---------------~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
... +.-|.= +|+.. +|+. .+-..+| ++.++++.|+..+.++. ..+.|-
T Consensus 90 PEi--------D~PGH~~a~~~~-~p~l~~~~~~~~~~~~~~~~l~~~~~----~t~~fl~~l~~e~~~lf---~~~~iH 153 (303)
T cd02742 90 PEI--------DMPGHSTAFVKS-FPKLLTECYAGLKLRDVFDPLDPTLP----KGYDFLDDLFGEIAELF---PDRYLH 153 (303)
T ss_pred Eec--------cchHHHHHHHHh-CHHhccCccccCCCCCCCCccCCCCc----cHHHHHHHHHHHHHHhC---CCCeEE
Confidence 764 332322 23322 2221 1222333 34455566666655543 235554
Q ss_pred EccCCCcCCCCCCh---HHHHHHHHHHHHHhcCCceEEEeeCC
Q psy13819 226 VQVENEYGSDAECD---PAHAVWLRDLLRTYVQDKAVLYSTDG 265 (323)
Q Consensus 226 ~QieNE~g~~~~~~---~~Y~~~l~~~~~~~~g~~~~l~t~dg 265 (323)
+- .-|+-.-.... ..|++.+.+.+++. |.. ++.=.|.
T Consensus 154 iG-gDE~~~~~~~~~l~~~f~~~~~~~v~~~-g~~-~~~W~d~ 193 (303)
T cd02742 154 IG-GDEAHFKQDRKHLMSQFIQRVLDIVKKK-GKK-VIVWQDG 193 (303)
T ss_pred ec-ceecCCCCCHHHHHHHHHHHHHHHHHHc-CCe-EEEeccc
Confidence 43 33432111111 56778888888774 644 4443343
No 72
>PF12876 Cellulase-like: Sugar-binding cellulase-like; InterPro: IPR024778 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This entry represents a family of putative cellulase enzymes.; PDB: 3GYC_B.
Probab=88.78 E-value=0.96 Score=35.22 Aligned_cols=46 Identities=28% Similarity=0.315 Sum_probs=27.1
Q ss_pred ccCCCcEEEEccCCC-cCCCC----C-----ChHHHHHHHHHHHHHh--cCCceEEEe
Q psy13819 217 YGNDRPIILVQVENE-YGSDA----E-----CDPAHAVWLRDLLRTY--VQDKAVLYS 262 (323)
Q Consensus 217 ~~n~g~Vi~~QieNE-~g~~~----~-----~~~~Y~~~l~~~~~~~--~g~~~~l~t 262 (323)
+++.+.|.+|+|.|| .+... . ....|.++|+++++.. +....|+.+
T Consensus 5 ~~~~~~Il~Wdl~NE~p~~~~~~~~~~~~~~~~~~~~~~l~~~~~~iR~~dP~~pvt~ 62 (88)
T PF12876_consen 5 FGYDPRILAWDLWNEPPNNWADGYPAEWGDPKAEAYAEWLKEAFRWIRAVDPSQPVTS 62 (88)
T ss_dssp TT-GGGEEEEESSTTTT-TT-TT-TT-TT-TTSHHHHHHHHHHHHHHHTT-TTS-EE-
T ss_pred hcCCCCEEEEEeecCCCCcccccccccccchhHHHHHHHHHHHHHHHHHhCCCCcEEe
Confidence 678899999999999 66321 1 1256777777766553 344555543
No 73
>cd06568 GH20_SpHex_like A subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex). SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=88.36 E-value=10 Score=36.87 Aligned_cols=68 Identities=16% Similarity=0.192 Sum_probs=47.9
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEe----CCCcccC------CCeec--------ccccchHHHHHHHHHHC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVE----WRSHEVH------PGHYH--------YDGHRDIEHFLQLAVEE 156 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~----W~~hEP~------~G~fd--------f~g~~dl~~fl~~a~e~ 156 (323)
.+-|+ .|.+..++.|+.|....+|++.+++. |...-+. .|.+. |=-.+++..+++-|+++
T Consensus 10 ~aR~f--~~~~~lk~~id~ma~~KlN~lhlHLtD~~~~rle~~~~P~lt~~ga~~~~~~~~~~~YT~~di~elv~yA~~r 87 (329)
T cd06568 10 VARHF--FTVAEVKRYIDLLALYKLNVLHLHLTDDQGWRIEIKSWPKLTEIGGSTEVGGGPGGYYTQEDYKDIVAYAAER 87 (329)
T ss_pred ccCCC--cCHHHHHHHHHHHHHhCCcEEEEEeecCCcceeeecCcccccccccccccCCCCCCcCCHHHHHHHHHHHHHc
Confidence 34454 37899999999999999999999883 5332111 22221 00135688889999999
Q ss_pred CCeEEEcc
Q psy13819 157 DLYILLRP 164 (323)
Q Consensus 157 GL~vilr~ 164 (323)
||.||..+
T Consensus 88 gI~vIPEi 95 (329)
T cd06568 88 HITVVPEI 95 (329)
T ss_pred CCEEEEec
Confidence 99999764
No 74
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=88.34 E-value=20 Score=35.12 Aligned_cols=65 Identities=20% Similarity=0.281 Sum_probs=47.3
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEe----CCCcc---c-------------------------CCCeecccc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVE----WRSHE---V-------------------------HPGHYHYDG 142 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~----W~~hE---P-------------------------~~G~fdf~g 142 (323)
.+-|+ +|.+..++.|+.|....+|++.+++. |..-- | ..|.|.
T Consensus 10 ~aR~f--~~~~~ik~~Id~ma~~K~N~lhlHltDdq~~rle~~~~P~Lt~~ga~~~~~~~~~~~~~~~~~~~~~~YT--- 84 (357)
T cd06563 10 VSRHF--FPVDEVKRFIDLMALYKLNVFHWHLTDDQGWRIEIKKYPKLTEVGAWRGPTEIGLPQGGGDGTPYGGFYT--- 84 (357)
T ss_pred ccccC--cCHHHHHHHHHHHHHhccceEEEeeecCCCceecccCcchhhhcccccCcccccccccccCCCccCceEC---
Confidence 44455 46899999999999999999999773 32111 1 123333
Q ss_pred cchHHHHHHHHHHCCCeEEEcc
Q psy13819 143 HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 143 ~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+|+..+++.|+++||.||..+
T Consensus 85 ~~di~eiv~yA~~rgI~VIPEI 106 (357)
T cd06563 85 QEEIREIVAYAAERGITVIPEI 106 (357)
T ss_pred HHHHHHHHHHHHHcCCEEEEec
Confidence 3568888999999999999764
No 75
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=88.26 E-value=10 Score=36.63 Aligned_cols=145 Identities=15% Similarity=0.201 Sum_probs=82.3
Q ss_pred eeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEe--CCC---cccC------------------------CCeeccccc
Q psy13819 93 VSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVE--WRS---HEVH------------------------PGHYHYDGH 143 (323)
Q Consensus 93 ~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~--W~~---hEP~------------------------~G~fdf~g~ 143 (323)
+-.+-||+ |.+..++.|+.|...++|++...+. |.. ..|. .|.|. .
T Consensus 7 LD~aR~~~--~~~~ik~~id~ma~~K~N~lhlHltD~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~~~~~~~~YT---~ 81 (326)
T cd06564 7 LDVGRKYY--SMDFLKDIIKTMSWYKMNDLQLHLNDNLIFNLDDMSTTVNNATYASDDVKSGNNYYNLTANDGYYT---K 81 (326)
T ss_pred EEccCCCC--CHHHHHHHHHHHHHcCCceEEEeecCCcccccCCCchhhhhhhhhccccccccccCCCCCCCCccc---H
Confidence 33555664 6899999999999999999998442 211 1111 11222 3
Q ss_pred chHHHHHHHHHHCCCeEEEccCCccccccCCCCC-cccccccCCCeee------------cCCCHhHHHHHHHHHHHHHH
Q psy13819 144 RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGF-PPWLLKVAPDILL------------RQNHPVYQRYVTRWFQELFP 210 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~-P~Wl~~~~p~~~~------------R~~~~~f~~~~~~~~~~l~~ 210 (323)
+++..+++.|+++||.||-.+ +.-|. =+|+.. +|+... -..+| .+.++++.|+.
T Consensus 82 ~di~eiv~yA~~rgI~vIPEI--------D~PGH~~a~~~~-~pel~~~~~~~~~~~~~l~~~~~----~t~~f~~~l~~ 148 (326)
T cd06564 82 EEFKELIAYAKDRGVNIIPEI--------DSPGHSLAFTKA-MPELGLKNPFSKYDKDTLDISNP----EAVKFVKALFD 148 (326)
T ss_pred HHHHHHHHHHHHcCCeEeccC--------CCcHHHHHHHHh-hHHhcCCCcccCCCcccccCCCH----HHHHHHHHHHH
Confidence 578899999999999999754 33332 123322 333221 12334 45555555555
Q ss_pred HHHhhhccCCCcEEEEccCCCcCCCC-CC--hHHHHHHHHHHHHHhcCCce
Q psy13819 211 RIQKYLYGNDRPIILVQVENEYGSDA-EC--DPAHAVWLRDLLRTYVQDKA 258 (323)
Q Consensus 211 ~i~~~~~~n~g~Vi~~QieNE~g~~~-~~--~~~Y~~~l~~~~~~~~g~~~ 258 (323)
.+.+. +....+.|-+- .-|+-... .. -..|++.+.+.+++. |..+
T Consensus 149 E~~~~-f~~~~~~~HiG-gDE~~~~~~~~~~~~~f~~~~~~~v~~~-gk~~ 196 (326)
T cd06564 149 EYLDG-FNPKSDTVHIG-ADEYAGDAGYAEAFRAYVNDLAKYVKDK-GKTP 196 (326)
T ss_pred HHHHh-cCCCCCEEEec-cccccccCccHHHHHHHHHHHHHHHHHc-CCeE
Confidence 55554 32123455443 33432211 11 157888888888874 6554
No 76
>PRK10785 maltodextrin glucosidase; Provisional
Probab=88.22 E-value=1 Score=47.36 Aligned_cols=55 Identities=11% Similarity=0.116 Sum_probs=41.0
Q ss_pred HHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeec----------ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 107 WCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYH----------YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fd----------f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+.|.-+|++|+|+|-+ +|+=+ |.--.|| |...+++.++++.|+++||+|||..
T Consensus 181 I~~kLdYL~~LGv~~I~L~Pif~s---~s~hgYd~~Dy~~iDp~~Gt~~df~~Lv~~aH~rGikVilD~ 246 (598)
T PRK10785 181 ISEKLPYLKKLGVTALYLNPIFTA---PSVHKYDTEDYRHVDPQLGGDAALLRLRHATQQRGMRLVLDG 246 (598)
T ss_pred HHHHHHHHHHcCCCEEEeCCcccC---CCCCCcCcccccccCcccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 467899999999999999 55422 1111233 2234789999999999999999974
No 77
>TIGR02104 pulA_typeI pullulanase, type I. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the subfamilies described in TIGR02102 and TIGR02103 but having a different domain architecture with shorter sequences. Members are called type I pullulanases.
Probab=88.14 E-value=0.86 Score=47.91 Aligned_cols=55 Identities=15% Similarity=0.279 Sum_probs=37.3
Q ss_pred HHHHHHHHhCCCEEEE-EEe---------------CCCc-----ccCCCeec----c--cccchHHHHHHHHHHCCCeEE
Q psy13819 109 WIMRAMRAAGLNALST-YVE---------------WRSH-----EVHPGHYH----Y--DGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~-yv~---------------W~~h-----EP~~G~fd----f--~g~~dl~~fl~~a~e~GL~vi 161 (323)
+.|.-+|++|+|+|.+ +|+ |... .|+ +.|- + +..+++.++++.|+++||+||
T Consensus 168 ~~LdyL~~LGvt~I~L~Pi~~~~~~~~~~~~~~~~wGY~~~~y~~~~-~~y~~~p~~~~~~~~efk~lV~~~H~~Gi~Vi 246 (605)
T TIGR02104 168 TGLDYLKELGVTHVQLLPVFDFAGVDEEDPNNAYNWGYDPLNYNVPE-GSYSTNPYDPATRIRELKQMIQALHENGIRVI 246 (605)
T ss_pred hHHHHHHHcCCCEEEeCCcccccccccccCCCCCCCCCCCccCCCcC-hhhhcCCCccchHHHHHHHHHHHHHHCCCEEE
Confidence 4589999999999999 443 2111 000 0010 0 013689999999999999999
Q ss_pred Ecc
Q psy13819 162 LRP 164 (323)
Q Consensus 162 lr~ 164 (323)
|..
T Consensus 247 lDv 249 (605)
T TIGR02104 247 MDV 249 (605)
T ss_pred EEE
Confidence 974
No 78
>PLN02960 alpha-amylase
Probab=88.00 E-value=0.97 Score=49.39 Aligned_cols=57 Identities=16% Similarity=0.166 Sum_probs=39.0
Q ss_pred HHHHHHHHHhCCCEEEE-EEe-------CCCcc--cCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALST-YVE-------WRSHE--VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~-yv~-------W~~hE--P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.|.-+|++|+|+|.+ .|. |...- ...=.-.|...+++.+|++.|+++||.|||..
T Consensus 420 e~~LdYLk~LGvt~IeLmPv~e~~~~~swGY~~~~yfa~~~~yGtp~dfk~LVd~aH~~GI~VILDv 486 (897)
T PLN02960 420 QKVLPHVKKAGYNAIQLIGVQEHKDYSSVGYKVTNFFAVSSRFGTPDDFKRLVDEAHGLGLLVFLDI 486 (897)
T ss_pred HHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCCCcccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 45699999999999999 543 21100 00000112335789999999999999999985
No 79
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=87.81 E-value=22 Score=32.72 Aligned_cols=53 Identities=15% Similarity=0.151 Sum_probs=36.4
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
..+++.|+.++++|++.|++...-. |+-.+ +++ ..+++++-++++++||.|..
T Consensus 13 ~~l~~~l~~~~~~G~~~vEl~~~~~-~~~~~---~~~-~~~~~~l~~~~~~~gl~v~s 65 (275)
T PRK09856 13 LPIEHAFRDASELGYDGIEIWGGRP-HAFAP---DLK-AGGIKQIKALAQTYQMPIIG 65 (275)
T ss_pred CCHHHHHHHHHHcCCCEEEEccCCc-ccccc---ccC-chHHHHHHHHHHHcCCeEEE
Confidence 3589999999999999999942111 11001 121 23577788999999999864
No 80
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=87.78 E-value=23 Score=32.89 Aligned_cols=58 Identities=9% Similarity=0.181 Sum_probs=42.3
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-Ecc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~ 164 (323)
...|.+.++.++++|++.|++.+. .. ...+...+++. .++..+.+.++++||.|. +.+
T Consensus 15 ~~~~~e~l~~~~~~G~~~VEl~~~-~~-~~~~~~~~~~~-~~~~~~~~~l~~~gl~i~~~~~ 73 (279)
T TIGR00542 15 GECWLERLQLAKTCGFDFVEMSVD-ET-DDRLSRLDWSR-EQRLALVNAIIETGVRIPSMCL 73 (279)
T ss_pred CCCHHHHHHHHHHcCCCEEEEecC-Cc-cchhhccCCCH-HHHHHHHHHHHHcCCCceeeec
Confidence 367999999999999999999532 21 22234455553 457888999999999975 344
No 81
>PF03659 Glyco_hydro_71: Glycosyl hydrolase family 71 ; InterPro: IPR005197 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha-1,3-glucanases belonging to glycoside hydrolase family 71 (GH71 from CAZY).
Probab=87.63 E-value=2.3 Score=42.39 Aligned_cols=89 Identities=13% Similarity=0.277 Sum_probs=60.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLL 182 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~ 182 (323)
..+.|++|+++++++|++...+-+. ....+.. +.|...++.|++.|+++++-+ +..+...|=
T Consensus 15 t~~dw~~di~~A~~~GIDgFaLNig------~~d~~~~---~~l~~a~~AA~~~gFKlf~Sf--------D~~~~~~~~- 76 (386)
T PF03659_consen 15 TQEDWEADIRLAQAAGIDGFALNIG------SSDSWQP---DQLADAYQAAEAVGFKLFFSF--------DMNSLGPWS- 76 (386)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEecc------cCCcccH---HHHHHHHHHHHhcCCEEEEEe--------cccCCCCCC-
Confidence 6789999999999999999999553 1122222 447778999999999999987 443332221
Q ss_pred ccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 183 KVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 183 ~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
.+.+.+++++...+=+.+. .++.|++.
T Consensus 77 ---------------~~~~~~~i~~y~~~pa~~~-~~Gkp~VS 103 (386)
T PF03659_consen 77 ---------------QDELIALIKKYAGHPAYFR-YDGKPVVS 103 (386)
T ss_pred ---------------HHHHHHHHHHHcCChhHEe-ECCeEEEE
Confidence 1466666666666644444 35666665
No 82
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=87.50 E-value=5 Score=40.18 Aligned_cols=115 Identities=17% Similarity=0.272 Sum_probs=73.4
Q ss_pred eecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCC----Ceeccc--cc-chHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 96 SFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHP----GHYHYD--GH-RDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 96 ~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~----G~fdf~--g~-~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
+-.|+.+..+...+.++.++++|++.+-+-=-|....... |.+--+ .+ +.+..+++.+++.||+.=|...|.+
T Consensus 49 ~~~~~d~~e~~i~~~a~~~~~~G~e~fviDDGW~~~r~~d~~~~GdW~~~~~kFP~Gl~~l~~~i~~~Gmk~GlW~ePe~ 128 (394)
T PF02065_consen 49 EAYYFDITEEKILELADAAAELGYEYFVIDDGWFGGRDDDNAGLGDWEPDPKKFPNGLKPLADYIHSLGMKFGLWFEPEM 128 (394)
T ss_dssp HHHTTG--HHHHHHHHHHHHHHT-SEEEE-SSSBCTESTTTSTTSBECBBTTTSTTHHHHHHHHHHHTT-EEEEEEETTE
T ss_pred cccCcCCCHHHHHHHHHHHHHhCCEEEEEcCccccccCCCcccCCceeEChhhhCCcHHHHHHHHHHCCCeEEEEecccc
Confidence 4456777788899999999999999999988886542221 333221 11 2388889999999999998888876
Q ss_pred ccc-cC-CCCCcccccccCCCe---e------ecCCCHhHHHHHHHHHHHHHHH
Q psy13819 169 CGK-RD-FGGFPPWLLKVAPDI---L------LRQNHPVYQRYVTRWFQELFPR 211 (323)
Q Consensus 169 ~aE-w~-~gg~P~Wl~~~~p~~---~------~R~~~~~f~~~~~~~~~~l~~~ 211 (323)
++. .+ .-..|.|+.. .++- . +-.++|..+++....+.+++..
T Consensus 129 v~~~S~l~~~hPdw~l~-~~~~~~~~~r~~~vLD~~~pev~~~l~~~i~~ll~~ 181 (394)
T PF02065_consen 129 VSPDSDLYREHPDWVLR-DPGRPPTLGRNQYVLDLSNPEVRDYLFEVIDRLLRE 181 (394)
T ss_dssp EESSSCHCCSSBGGBTC-CTTSE-ECBTTBEEB-TTSHHHHHHHHHHHHHHHHH
T ss_pred ccchhHHHHhCccceee-cCCCCCcCcccceEEcCCCHHHHHHHHHHHHHHHHh
Confidence 653 11 2357999876 3331 1 2245777777776666665544
No 83
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=86.97 E-value=25 Score=32.48 Aligned_cols=54 Identities=13% Similarity=0.188 Sum_probs=38.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
-.+++.++.++++|+..|++.+. ..|+ .....+++ ..++..+-+.++++||.+.
T Consensus 16 ~~~~e~~~~~~~~G~~~iEl~~~-~~~~-~~~~~~~~-~~~~~~l~~~l~~~Gl~i~ 69 (284)
T PRK13210 16 LSWEERLVFAKELGFDFVEMSVD-ESDE-RLARLDWS-KEERLSLVKAIYETGVRIP 69 (284)
T ss_pred CCHHHHHHHHHHcCCCeEEEecC-Cccc-ccccccCC-HHHHHHHHHHHHHcCCCce
Confidence 47999999999999999999643 1111 11122333 2458888999999999876
No 84
>PRK01060 endonuclease IV; Provisional
Probab=86.56 E-value=19 Score=33.30 Aligned_cols=52 Identities=13% Similarity=0.078 Sum_probs=35.4
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~v 160 (323)
...++-++.++++|+++|++.+.-. +.-.++.++ ..++.++-++++++||.+
T Consensus 12 ~~~~~~l~~~~~~G~d~vEl~~~~p-~~~~~~~~~---~~~~~~lk~~~~~~gl~~ 63 (281)
T PRK01060 12 GGLEGAVAEAAEIGANAFMIFTGNP-QQWKRKPLE---ELNIEAFKAACEKYGISP 63 (281)
T ss_pred CCHHHHHHHHHHcCCCEEEEECCCC-CCCcCCCCC---HHHHHHHHHHHHHcCCCC
Confidence 3488999999999999999964311 111111221 224777889999999983
No 85
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=86.28 E-value=1.4 Score=45.93 Aligned_cols=57 Identities=14% Similarity=0.166 Sum_probs=41.5
Q ss_pred hcHHHHHHHHHHhCCCEEEE-EEeCCCcccC-CCee---c-------ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALST-YVEWRSHEVH-PGHY---H-------YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~-~G~f---d-------f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.-+.+.|+-+|++|+++|-+ +++-+ |. ..-| | |...+|+.++++.|+++||+||+..
T Consensus 33 ~gi~~~ldyl~~lGv~~i~l~P~~~~---~~~~~gY~~~d~~~id~~~Gt~~d~~~lv~~~h~~gi~vilD~ 101 (551)
T PRK10933 33 RGVTQRLDYLQKLGVDAIWLTPFYVS---PQVDNGYDVANYTAIDPTYGTLDDFDELVAQAKSRGIRIILDM 101 (551)
T ss_pred HHHHHhhHHHHhCCCCEEEECCCCCC---CCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 34568899999999999999 54321 21 1122 2 2234689999999999999999874
No 86
>PRK13398 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=86.12 E-value=4.5 Score=38.28 Aligned_cols=74 Identities=8% Similarity=0.176 Sum_probs=56.8
Q ss_pred EEeCCeeeEEeee--eecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCC
Q psy13819 83 FRLNEDPFQFVSG--SFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 83 ~~~dGk~~~i~~g--~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL 158 (323)
+.+.|+++.++.| ++| ..+.-.+-.+.+|++|+..+|.|++=++..| +.|. +...+..+-+.|++.||
T Consensus 21 ~~~g~~~~~~iaGPCsie----~~~~~~~~A~~lk~~g~~~~r~~~~kpRTs~----~s~~G~g~~gl~~l~~~~~~~Gl 92 (266)
T PRK13398 21 VVIGGEEKIIIAGPCAVE----SEEQMVKVAEKLKELGVHMLRGGAFKPRTSP----YSFQGLGEEGLKILKEVGDKYNL 92 (266)
T ss_pred EEEcCCCEEEEEeCCcCC----CHHHHHHHHHHHHHcCCCEEEEeeecCCCCC----CccCCcHHHHHHHHHHHHHHcCC
Confidence 5666668888888 444 2567788899999999999999876544433 3555 45678888999999999
Q ss_pred eEEEcc
Q psy13819 159 YILLRP 164 (323)
Q Consensus 159 ~vilr~ 164 (323)
.++..+
T Consensus 93 ~~~te~ 98 (266)
T PRK13398 93 PVVTEV 98 (266)
T ss_pred CEEEee
Confidence 999876
No 87
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=85.90 E-value=2 Score=41.85 Aligned_cols=110 Identities=16% Similarity=0.185 Sum_probs=67.7
Q ss_pred hhcHHHHHHHHHHhCCCEEEE-------EEeCCCcccCCCeecc--cccchHHHHHHHHHHCCCeEEEccCCccccccCC
Q psy13819 104 PGRWCWIMRAMRAAGLNALST-------YVEWRSHEVHPGHYHY--DGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDF 174 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~-------yv~W~~hEP~~G~fdf--~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~ 174 (323)
++..+..|+.+++.|+|+|=+ .+.+..-.|..-+..- ....|+.++++.++|+||++|.|.--+--. .-.
T Consensus 12 ~~~~~~~~~~i~~t~lNavVIDvKdd~G~i~y~s~~~~~~~~ga~~~~i~D~~~l~~~l~e~gIY~IARIv~FkD~-~la 90 (316)
T PF13200_consen 12 PERLDKLLDLIKRTELNAVVIDVKDDDGNITYDSQVPLAREIGAVKPYIKDLKALVKKLKEHGIYPIARIVVFKDP-VLA 90 (316)
T ss_pred HHHHHHHHHHHHhcCCceEEEEEecCCceEEecCCCchhhhcccccccccCHHHHHHHHHHCCCEEEEEEEEecCh-HHh
Confidence 467889999999999999987 3455444443322221 124789999999999999999986211100 000
Q ss_pred CCCcccccccCCCeeecC-----CCHhHHHHHHHHHHHHHHHHHh
Q psy13819 175 GGFPPWLLKVAPDILLRQ-----NHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 175 gg~P~Wl~~~~p~~~~R~-----~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
.-.|.|-.++..+-..+. --..|.+.+.+|.-.+++.++.
T Consensus 91 ~~~pe~av~~~~G~~w~d~~~~~WvnP~~~evw~Y~i~IA~Eaa~ 135 (316)
T PF13200_consen 91 EAHPEWAVKTKDGSVWRDNEGEAWVNPYSKEVWDYNIDIAKEAAK 135 (316)
T ss_pred hhChhhEEECCCCCcccCCCCCccCCCCCHHHHHHHHHHHHHHHH
Confidence 114555443222222221 1234677888888888888775
No 88
>COG3589 Uncharacterized conserved protein [Function unknown]
Probab=85.70 E-value=1.8 Score=42.36 Aligned_cols=73 Identities=15% Similarity=0.070 Sum_probs=56.7
Q ss_pred eeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 93 VSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 93 ~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
+|-++.+.|.+.+.=..-|++|...|+..|-| +.|.|++.. -....-+..+++.|++.|++||+...|-|.-+
T Consensus 4 ~GfSifp~~~~~~~~~~Yi~~~~~~Gf~~IFt----sl~~~~~~~--~~~~~~~~ell~~Anklg~~vivDvnPsil~~ 76 (360)
T COG3589 4 LGFSIFPNRSPKEKDIAYIDRMHKYGFKRIFT----SLLIPEEDA--ELYFHRFKELLKEANKLGLRVIVDVNPSILKE 76 (360)
T ss_pred eeEEeccCCCcchhHHHHHHHHHHcCccceee----ecccCCchH--HHHHHHHHHHHHHHHhcCcEEEEEcCHHHHhh
Confidence 45566667888888889999999999999999 477777531 12233477779999999999999998876654
No 89
>PF02679 ComA: (2R)-phospho-3-sulfolactate synthase (ComA); InterPro: IPR003830 Methanogenic archaea produce methane via the anaerobic reduction of acetate or single carbon compounds []. Coenzyme M (CoM; 2-mercaptoethanesulphonic acid) serves as the terminal methyl carrier for this process. Previously thought to be unique to methanogenic archaea, CoM has also been found in methylotrophic bacteria. Biosynthesis of CoM begins with the Michael addition of sulphite to phosphoenolpyruvate, forming 2-phospho-3-sulpholactate (PSL). This reaction is catalyzed by members of this family, PSL synthase (ComA) []. Subsequently, PSL is dephosphorylated by phosphosulpholactate phosphatase (ComB) to form 3-sulpholactate [], which is then converted to 3-sulphopyruvate by L-sulpholactate dehydrogenase (ComC; 1.1.1.272 from EC) []. Sulphopyruvate decarboxylase (ComDE; 4.1.1.79 from EC) converts 3-sulphopyruvate to sulphoacetaldehyde []. Reductive thiolation of sulphoacetaldehyde is the final step.; GO: 0019295 coenzyme M biosynthetic process; PDB: 1U83_A 1QWG_A.
Probab=85.67 E-value=1.4 Score=41.24 Aligned_cols=52 Identities=17% Similarity=0.300 Sum_probs=38.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
+...++-++.+|++||++|++. .|..+.+- ++..++|+.|+++|++|+-..|
T Consensus 83 q~~~~~yl~~~k~lGf~~IEiS---------dGti~l~~-~~r~~~I~~~~~~Gf~v~~EvG 134 (244)
T PF02679_consen 83 QGKFDEYLEECKELGFDAIEIS---------DGTIDLPE-EERLRLIRKAKEEGFKVLSEVG 134 (244)
T ss_dssp TT-HHHHHHHHHHCT-SEEEE-----------SSS---H-HHHHHHHHHHCCTTSEEEEEES
T ss_pred cChHHHHHHHHHHcCCCEEEec---------CCceeCCH-HHHHHHHHHHHHCCCEEeeccc
Confidence 5678899999999999999994 34444443 4567889999999999999986
No 90
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=85.37 E-value=1.7 Score=45.15 Aligned_cols=60 Identities=12% Similarity=0.111 Sum_probs=42.8
Q ss_pred hcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeec----------ccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 105 GRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYH----------YDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fd----------f~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
.-+.+.|.-+|++|+|+|-+ .++=+... ..-|| |...+++.++++.|+++||+||+..=+
T Consensus 27 ~gi~~~l~yl~~lG~~~i~l~Pi~~~~~~--~~gY~~~d~~~id~~~Gt~~~~~~lv~~ah~~gi~vilD~v~ 97 (543)
T TIGR02403 27 RGIIEKLDYLKKLGVDYIWLNPFYVSPQK--DNGYDVSDYYAINPLFGTMADFEELVSEAKKRNIKIMLDMVF 97 (543)
T ss_pred HHHHHhHHHHHHcCCCEEEECCcccCCCC--CCCCCccccCccCcccCCHHHHHHHHHHHHHCCCEEEEEECc
Confidence 45678899999999999999 44322110 11232 334578999999999999999998533
No 91
>PRK09505 malS alpha-amylase; Reviewed
Probab=84.99 E-value=1.8 Score=46.27 Aligned_cols=58 Identities=17% Similarity=0.141 Sum_probs=41.0
Q ss_pred HHHHHHHHHHhCCCEEEE-EEeCCCcccC----CC-----------e-------ecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 107 WCWIMRAMRAAGLNALST-YVEWRSHEVH----PG-----------H-------YHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~----~G-----------~-------fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
+.+.|.-+|++|+|+|-+ .++=+.|... .| . -.|...++++++++.|+++||+||+.
T Consensus 232 i~~kLdyl~~LGv~aIwlsPi~~~~~~~~~~g~~g~~~~~~yhgY~~~D~~~id~~~Gt~~dfk~Lv~~aH~~Gi~VilD 311 (683)
T PRK09505 232 LTEKLDYLQQLGVNALWISSPLEQIHGWVGGGTKGDFPHYAYHGYYTLDWTKLDANMGTEADLRTLVDEAHQRGIRILFD 311 (683)
T ss_pred HHHhhHHHHHcCCCEEEeCccccccccccccccccCCCcCCCCCCCccccccCCCCCCCHHHHHHHHHHHHHCCCEEEEE
Confidence 567899999999999998 4432322211 11 1 12334579999999999999999987
Q ss_pred c
Q psy13819 164 P 164 (323)
Q Consensus 164 ~ 164 (323)
.
T Consensus 312 ~ 312 (683)
T PRK09505 312 V 312 (683)
T ss_pred E
Confidence 3
No 92
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=84.80 E-value=1.2 Score=38.02 Aligned_cols=40 Identities=18% Similarity=0.358 Sum_probs=27.9
Q ss_pred ceeEEEEcCcccccccCCCCCeeEEEeccccccCCc-ceEEEE
Q psy13819 5 SKGVLFANGRHWGRYWSEVGPQYSLFVPEEFLKVGT-NRITIF 46 (323)
Q Consensus 5 ~kg~~~vng~~lGryw~~~gpq~~lyvP~~~l~~g~-N~i~i~ 46 (323)
....|||||+.||+..... ....+=|+ +.|++|. |+|.|.
T Consensus 96 ~~a~v~vNG~~vg~~~~~~-~~~~~dIt-~~l~~g~~N~l~V~ 136 (167)
T PF02837_consen 96 YAAEVYVNGKLVGSHEGGY-TPFEFDIT-DYLKPGEENTLAVR 136 (167)
T ss_dssp SEEEEEETTEEEEEEESTT-S-EEEECG-GGSSSEEEEEEEEE
T ss_pred EeeEEEeCCeEEeeeCCCc-CCeEEeCh-hhccCCCCEEEEEE
Confidence 3678999999999976322 23333375 4799998 888873
No 93
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=84.75 E-value=4.1 Score=38.02 Aligned_cols=51 Identities=14% Similarity=0.248 Sum_probs=41.8
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
...++-++.+|++||++|++. .|..+.+- ++..++|+.++++|++|+-..|
T Consensus 71 ~~~~~Yl~~~k~lGf~~IEiS---------~G~~~i~~-~~~~rlI~~~~~~g~~v~~EvG 121 (237)
T TIGR03849 71 GKFDEYLNECDELGFEAVEIS---------DGSMEISL-EERCNLIERAKDNGFMVLSEVG 121 (237)
T ss_pred hhHHHHHHHHHHcCCCEEEEc---------CCccCCCH-HHHHHHHHHHHhCCCeEecccc
Confidence 567788889999999999993 45555543 4577899999999999999887
No 94
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=84.62 E-value=28 Score=31.74 Aligned_cols=43 Identities=16% Similarity=0.153 Sum_probs=34.7
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
.+++.+++++++|++.|++..++ ..++..+.++++++||.+..
T Consensus 15 ~l~e~~~~~~e~G~~~vEl~~~~--------------~~~~~~l~~~l~~~gl~v~~ 57 (254)
T TIGR03234 15 PFLERFAAAAQAGFTGVEYLFPY--------------DWDAEALKARLAAAGLEQVL 57 (254)
T ss_pred CHHHHHHHHHHcCCCEEEecCCc--------------cCCHHHHHHHHHHcCCeEEE
Confidence 57899999999999999995321 12477789999999999774
No 95
>smart00812 Alpha_L_fucos Alpha-L-fucosidase. O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site PUBMED:. Because the fold of proteins is better conserved than their sequences, some of the families can be grouped in 'clans'. Family 29 encompasses alpha-L-fucosidases, which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Deficiency of alpha-L-fucosidase results in the lysosomal storage disease fucosidosis.
Probab=84.43 E-value=8.7 Score=38.30 Aligned_cols=106 Identities=14% Similarity=0.171 Sum_probs=61.8
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEE-------EEeCCCcccCCCeeccc-ccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALST-------YVEWRSHEVHPGHYHYD-GHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~-------yv~W~~hEP~~G~fdf~-g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
+.+..++.| ++.+|++|+..|=+ +-.|+.....-..-+-. ..|=+..+.+.|+++||++-+=-.+ -
T Consensus 78 p~~fD~~~W---a~~~k~AGakY~vlTaKHHDGF~lw~S~~t~~n~~~~~pkrDiv~el~~A~rk~Glk~G~Y~S~---~ 151 (384)
T smart00812 78 AEKFDPEEW---ADLFKKAGAKYVVLTAKHHDGFCLWDSKYSNWNAVDTGPKRDLVGELADAVRKRGLKFGLYHSL---F 151 (384)
T ss_pred chhCCHHHH---HHHHHHcCCCeEEeeeeecCCccccCCCCCCCcccCCCCCcchHHHHHHHHHHcCCeEEEEcCH---H
Confidence 334556666 67889999987755 34566544332222222 3455778899999999988774433 2
Q ss_pred ccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 171 KRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 171 Ew~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
+|.+ |.|... .+.-....+.+.|.+++..+..+|.+.|..
T Consensus 152 DW~~---p~y~~~-~~~~~~~~~~~~~~~y~~~~~~Ql~ELit~ 191 (384)
T smart00812 152 DWFN---PLYAGP-TSSDEDPDNWPRFQEFVDDWLPQLRELVTR 191 (384)
T ss_pred HhCC---Cccccc-cccccccccchhHHHHHHHHHHHHHHHHhc
Confidence 5653 443221 111112234567788777777776666543
No 96
>TIGR02103 pullul_strch alpha-1,6-glucosidases, pullulanase-type. Members of this protein family include secreted (or membrane-anchored) pullulanases of Gram-negative bacteria and pullulanase-type starch debranching enzymes of plants. Both enzymes hydrolyze alpha-1,6 glycosidic linkages. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family is closely homologous to, but architecturally different from, the Gram-positive pullulanases of Gram-positive bacteria (TIGR02102).
Probab=83.77 E-value=19 Score=39.95 Aligned_cols=21 Identities=10% Similarity=0.289 Sum_probs=18.8
Q ss_pred chHHHHHHHHHHCCCeEEEcc
Q psy13819 144 RDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++.++++.|+++||+||+..
T Consensus 404 ~Efk~mV~alH~~Gi~VIlDV 424 (898)
T TIGR02103 404 KEFREMVQALNKTGLNVVMDV 424 (898)
T ss_pred HHHHHHHHHHHHCCCEEEEEe
Confidence 478999999999999999974
No 97
>PLN02361 alpha-amylase
Probab=83.10 E-value=2.4 Score=42.49 Aligned_cols=57 Identities=7% Similarity=-0.042 Sum_probs=40.7
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeec-------ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYH-------YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fd-------f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.|.-++++|+++|-+.-+.....+. =...| |...++|.++++.|+++||+||+..
T Consensus 32 ~~kl~~l~~lG~t~iwl~P~~~~~~~~GY~~~d~y~~~~~~Gt~~el~~li~~~h~~gi~vi~D~ 96 (401)
T PLN02361 32 EGKVPDLAKSGFTSAWLPPPSQSLAPEGYLPQNLYSLNSAYGSEHLLKSLLRKMKQYNVRAMADI 96 (401)
T ss_pred HHHHHHHHHcCCCEEEeCCCCcCCCCCCCCcccccccCcccCCHHHHHHHHHHHHHcCCEEEEEE
Confidence 577888999999999995444332221 01222 2334789999999999999999864
No 98
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=83.05 E-value=2.3 Score=44.03 Aligned_cols=58 Identities=14% Similarity=0.161 Sum_probs=41.5
Q ss_pred hcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeec----------ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYH----------YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fd----------f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.-+.+.|.-+|++|+|+|-+ +|+ .... ...-|| |...+++.++++.|+++||+||+..
T Consensus 28 ~gi~~~Ldyl~~LGv~~i~L~Pi~-~~~~-~~~gY~~~dy~~vd~~~Gt~~df~~Lv~~ah~~Gi~vilD~ 96 (539)
T TIGR02456 28 PGLTSKLDYLKWLGVDALWLLPFF-QSPL-RDDGYDVSDYRAILPEFGTIDDFKDFVDEAHARGMRVIIDL 96 (539)
T ss_pred HHHHHhHHHHHHCCCCEEEECCCc-CCCC-CCCCCCcccccccChhhCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 46778899999999999999 433 1110 012222 2334799999999999999999963
No 99
>PF07488 Glyco_hydro_67M: Glycosyl hydrolase family 67 middle domain; InterPro: IPR011100 Alpha-glucuronidases, components of an ensemble of enzymes central to the recycling of photosynthetic biomass, remove the alpha-1,2 linked 4-O-methyl glucuronic acid from xylans. This family represents the central catalytic domain of alpha-glucuronidase [].; GO: 0046559 alpha-glucuronidase activity, 0045493 xylan catabolic process, 0005576 extracellular region; PDB: 1MQP_A 1K9E_A 1MQQ_A 1L8N_A 1K9D_A 1MQR_A 1K9F_A 1GQL_A 1GQI_B 1GQJ_B ....
Probab=82.45 E-value=15 Score=35.75 Aligned_cols=137 Identities=14% Similarity=0.210 Sum_probs=70.5
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
.+...+-.+++...|+|.|.+ |-+...+-.-.=+.+..+.++-+..+.+||+|.|..- ..|--+.||++ .
T Consensus 56 ~~R~~~YARllASiGINgvvl----NNVNa~~~~Lt~~~l~~v~~lAdvfRpYGIkv~LSvn--FasP~~lggL~----T 125 (328)
T PF07488_consen 56 LTRYRDYARLLASIGINGVVL----NNVNANPKLLTPEYLDKVARLADVFRPYGIKVYLSVN--FASPIELGGLP----T 125 (328)
T ss_dssp -HHHHHHHHHHHHTT--EEE-----S-SS--CGGGSTTTHHHHHHHHHHHHHTT-EEEEEE---TTHHHHTTS-S-----
T ss_pred hhHHHHHHHHHhhcCCceEEe----cccccChhhcCHHHHHHHHHHHHHHhhcCCEEEEEee--ccCCcccCCcC----c
Confidence 467788888999999999998 5555554333334567888999999999999998650 00111123332 1
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCC--CCChHHHHHHHHHHHHHhcCCceEEE
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSD--AECDPAHAVWLRDLLRTYVQDKAVLY 261 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~--~~~~~~Y~~~l~~~~~~~~g~~~~l~ 261 (323)
.+| .| ..+.+|+++-+..|-.+.=.=+|=++=..-|.++|-+ +..-.+=...|+++++-+ |. ++++
T Consensus 126 aDP------ld----~~V~~WW~~k~~eIY~~IPDfgGflVKAdSEGqPGP~~YgRthAdGANmlA~Al~P~-GG-~V~w 193 (328)
T PF07488_consen 126 ADP------LD----PEVRQWWKDKADEIYSAIPDFGGFLVKADSEGQPGPFTYGRTHADGANMLARALKPH-GG-IVIW 193 (328)
T ss_dssp --T------TS----HHHHHHHHHHHHHHHHH-TT--EEEE--SBTTB--GGGGT--HHHHHHHHHHHHGGG-T--EEEE
T ss_pred CCC------CC----HHHHHHHHHHHHHHHHhCCCccceEEEecCCCCCCCcccCCCchhhHHHHHHHhhcc-CC-EEEE
Confidence 122 33 4666677666666554422224445555667777753 321233345577777776 43 4544
Q ss_pred e
Q psy13819 262 S 262 (323)
Q Consensus 262 t 262 (323)
.
T Consensus 194 R 194 (328)
T PF07488_consen 194 R 194 (328)
T ss_dssp E
T ss_pred E
Confidence 4
No 100
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=82.22 E-value=6 Score=28.64 Aligned_cols=55 Identities=18% Similarity=0.201 Sum_probs=43.7
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
|....+.++.+.+.|+|...+|++ .++. ++.+-+.. +|.++..+..+++|..|.+
T Consensus 12 pG~La~v~~~l~~~~inI~~i~~~--~~~~-~~~~rl~~-~~~~~~~~~L~~~G~~v~~ 66 (66)
T cd04908 12 PGRLAAVTEILSEAGINIRALSIA--DTSE-FGILRLIV-SDPDKAKEALKEAGFAVKL 66 (66)
T ss_pred CChHHHHHHHHHHCCCCEEEEEEE--ecCC-CCEEEEEE-CCHHHHHHHHHHCCCEEEC
Confidence 456788899999999999999873 2333 58887766 6678999999999988753
No 101
>TIGR02100 glgX_debranch glycogen debranching enzyme GlgX. This family consists of the GlgX protein from the E. coli glycogen operon and probable equivalogs from other prokaryotic species. GlgX is not required for glycogen biosynthesis, but instead acts as a debranching enzyme for glycogen catabolism. This model distinguishes GlgX from pullanases and other related proteins that also operate on alpha-1,6-glycosidic linkages. In the wide band between the trusted and noise cutoffs are functionally similar enzymes, mostly from plants, that act similarly but usually are termed isoamylase.
Probab=81.88 E-value=2.1 Score=45.85 Aligned_cols=55 Identities=11% Similarity=0.247 Sum_probs=37.0
Q ss_pred HHHHHHHhCCCEEEE-EEeCCCccc---CCC-----ee---cc----------cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 110 IMRAMRAAGLNALST-YVEWRSHEV---HPG-----HY---HY----------DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~-yv~W~~hEP---~~G-----~f---df----------~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.|.-+|++|+|+|.+ +|+=...++ +.| -| || ...+++.++++.|+++||+|||..
T Consensus 189 ~LdyLk~LGvtaI~L~Pi~~~~~~~~~~~~~~~~ywGYd~~~y~a~d~~y~~~g~~~efk~LV~~~H~~GI~VIlDv 265 (688)
T TIGR02100 189 MIDYLKKLGVTAVELLPVHAFIDDRHLLEKGLRNYWGYNTLGFFAPEPRYLASGQVAEFKTMVRALHDAGIEVILDV 265 (688)
T ss_pred hhHHHHHcCCCEEEECCcccCCccccccccCCCCccCcCcccccccChhhcCCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 478899999999999 543111111 111 01 11 124689999999999999999984
No 102
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=81.65 E-value=6.4 Score=37.54 Aligned_cols=70 Identities=17% Similarity=0.122 Sum_probs=50.6
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
...+..++.++++|+.|+.+=.+.+-....... -+.|+|+ -.-|..++++.+++.|+++++..-|+|+..
T Consensus 21 ~~~~~v~~~~~~~~~~~iP~d~~~lD~~w~~~~~~~~f~~d~~~FPd~~~~i~~l~~~G~~~~~~~~P~i~~~ 93 (308)
T cd06593 21 YDEEEVNEFADGMRERNLPCDVIHLDCFWMKEFQWCDFEFDPDRFPDPEGMLSRLKEKGFKVCLWINPYIAQK 93 (308)
T ss_pred CCHHHHHHHHHHHHHcCCCeeEEEEecccccCCcceeeEECcccCCCHHHHHHHHHHCCCeEEEEecCCCCCC
Confidence 346778999999999996665544443333322 1345554 345899999999999999999988998753
No 103
>PRK03705 glycogen debranching enzyme; Provisional
Probab=80.70 E-value=2.5 Score=45.02 Aligned_cols=55 Identities=16% Similarity=0.288 Sum_probs=36.7
Q ss_pred HHHHHHHhCCCEEEE-EEeCCCcccCC---C-----eec----------cc-----ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 110 IMRAMRAAGLNALST-YVEWRSHEVHP---G-----HYH----------YD-----GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~-yv~W~~hEP~~---G-----~fd----------f~-----g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.|.-+|++|+|+|.+ +|+=..-++.. | -|| |. ..+++.++++.|+++||+|||..
T Consensus 184 ~LdYLk~LGvt~I~L~Pv~~~~~~~~~~~~g~~~ywGYd~~~yfa~d~~ygt~~~~~~~efk~LV~~~H~~GI~VIlDv 262 (658)
T PRK03705 184 MIAYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPLAMFALDPAYASGPETALDEFRDAVKALHKAGIEVILDV 262 (658)
T ss_pred chHHHHHcCCCEEEecCcccCCCcccccccccccccCcccccccccccccCCCCcchHHHHHHHHHHHHHCCCEEEEEE
Confidence 488999999999999 44311111110 0 011 11 12479999999999999999974
No 104
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=79.94 E-value=63 Score=31.60 Aligned_cols=68 Identities=16% Similarity=0.176 Sum_probs=48.9
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEEe----CCCcccC------CCeecccc---cchHHHHHHHHHHCCCeEE
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVE----WRSHEVH------PGHYHYDG---HRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~----W~~hEP~------~G~fdf~g---~~dl~~fl~~a~e~GL~vi 161 (323)
.+-|+ .|.+..++.|+.|....+|++.+++. |..--+. .|.|.-.+ .+|+..+++.|+++||.||
T Consensus 10 vaR~f--~~~~~ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~Lt~~ga~~~~~~YT~~di~eiv~yA~~rgI~vI 87 (348)
T cd06562 10 TSRHF--LSVDSIKRTIDAMAYNKLNVLHWHITDSQSFPLESPSYPELSKKGAYSPSEVYTPEDVKEIVEYARLRGIRVI 87 (348)
T ss_pred ccccC--CCHHHHHHHHHHHHHhCCcEEEEeEEcCCCceEeeCCCchhhhccCcCCCceECHHHHHHHHHHHHHcCCEEE
Confidence 44455 46799999999999999999999864 5443221 23332111 3568888999999999999
Q ss_pred Ecc
Q psy13819 162 LRP 164 (323)
Q Consensus 162 lr~ 164 (323)
..+
T Consensus 88 PEI 90 (348)
T cd06562 88 PEI 90 (348)
T ss_pred Eec
Confidence 764
No 105
>PRK14510 putative bifunctional 4-alpha-glucanotransferase/glycogen debranching enzyme; Provisional
Probab=79.68 E-value=2.9 Score=47.65 Aligned_cols=56 Identities=14% Similarity=0.237 Sum_probs=39.0
Q ss_pred HHHHHHHHhCCCEEEE-EEeCCCcccC---CCe-----ec----------cc--ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 109 WIMRAMRAAGLNALST-YVEWRSHEVH---PGH-----YH----------YD--GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~-yv~W~~hEP~---~G~-----fd----------f~--g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..|.-+|++|+|+|.+ .|+=...|.. .|. || |. ..+++.++++.|+++||+|||..
T Consensus 191 ~~i~yLk~LGvt~I~L~Pi~~~~~~~~~~~~g~~~yWGY~~~~yfa~dp~yg~~~~~efk~lV~~~H~~GI~VILDv 267 (1221)
T PRK14510 191 EAISYLKKLGVSIVELNPIFASVDEHHLPQLGLSNYWGYNTVAFLAPDPRLAPGGEEEFAQAIKEAQSAGIAVILDV 267 (1221)
T ss_pred hhHHHHHHcCCCEEEeCCccccCcccccccccCcCcCCCCCCCCCCcChhhccCcHHHHHHHHHHHHHCCCEEEEEE
Confidence 4566999999999999 5542222211 010 22 22 55789999999999999999984
No 106
>PF14307 Glyco_tran_WbsX: Glycosyltransferase WbsX
Probab=79.41 E-value=41 Score=32.81 Aligned_cols=132 Identities=17% Similarity=0.203 Sum_probs=73.7
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHH---HCCCeEEEccCCccccccCCCCCcc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAV---EEDLYILLRPGPFICGKRDFGGFPP 179 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~---e~GL~vilr~GPyi~aEw~~gg~P~ 179 (323)
.++.++.-++++|+.|++..-+|-.|.. |..=|++-++..- +.+|...|.- .+-.|.. .
T Consensus 56 ~p~v~~~Q~~lA~~~GI~gF~~~~Ywf~-----------gk~lLe~p~~~~l~~~~~d~pFcl~W---AN~~w~~----~ 117 (345)
T PF14307_consen 56 DPEVMEKQAELAKEYGIDGFCFYHYWFN-----------GKRLLEKPLENLLASKEPDFPFCLCW---ANENWTR----R 117 (345)
T ss_pred CHHHHHHHHHHHHHhCCCEEEEEeeecC-----------CchHHHHHHHHHHhcCCCCCcEEEEE---CCChhhh----c
Confidence 3678899999999999999999988863 3333555554443 3445444432 1111211 2
Q ss_pred cccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh--hhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCc
Q psy13819 180 WLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK--YLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDK 257 (323)
Q Consensus 180 Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~--~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~ 257 (323)
|-.. ...+.+....+. .+..+++++.|++..++ ++-.+|.||+++=--... .+-+.+++.+++.+++. |.+
T Consensus 118 w~g~-~~~~l~~q~y~~-~~d~~~~~~~l~~~F~D~rYikVdGKPv~~Iy~p~~~----pd~~~~~~~wr~~a~~~-G~~ 190 (345)
T PF14307_consen 118 WDGR-NNEILIEQKYSG-EDDWKEHFRYLLPYFKDPRYIKVDGKPVFLIYRPGDI----PDIKEMIERWREEAKEA-GLP 190 (345)
T ss_pred cCCC-CccccccccCCc-hhHHHHHHHHHHHHhCCCCceeECCEEEEEEECcccc----cCHHHHHHHHHHHHHHc-CCC
Confidence 3211 122222211110 12235555666666664 444578888866422222 23578999999999984 766
Q ss_pred eE
Q psy13819 258 AV 259 (323)
Q Consensus 258 ~~ 259 (323)
-+
T Consensus 191 gi 192 (345)
T PF14307_consen 191 GI 192 (345)
T ss_pred ce
Confidence 43
No 107
>PLN00196 alpha-amylase; Provisional
Probab=79.25 E-value=4.5 Score=40.88 Aligned_cols=57 Identities=12% Similarity=0.163 Sum_probs=40.6
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC---CC-eecc-----cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH---PG-HYHY-----DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~---~G-~fdf-----~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.|.-+|++|+++|-+.-+....... +. -|+. ...+++.++++.|+++||+||+..
T Consensus 47 ~~kldyL~~LGvtaIWL~P~~~s~s~hGY~~~D~y~ld~~~fGt~~elk~Lv~~aH~~GIkVilDv 112 (428)
T PLN00196 47 MGKVDDIAAAGITHVWLPPPSHSVSEQGYMPGRLYDLDASKYGNEAQLKSLIEAFHGKGVQVIADI 112 (428)
T ss_pred HHHHHHHHHcCCCEEEeCCCCCCCCCCCCCccccCCCCcccCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 688899999999999994443332221 11 1333 223689999999999999999873
No 108
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=79.10 E-value=6 Score=37.86 Aligned_cols=68 Identities=13% Similarity=0.218 Sum_probs=52.7
Q ss_pred CCChhcHHHHHHHHHHhCCC--EEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLN--ALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N--~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
..+.+..++.++++++.|+. +|-+-..|- ..-|.|.|+ ..-|..++++..++.|+++++..=|+|+.+
T Consensus 26 ~~s~~~v~~~~~~~~~~~iP~d~i~iD~~w~---~~~g~f~~d~~~FPdp~~mi~~l~~~G~k~~l~i~P~i~~~ 97 (303)
T cd06592 26 DINQETVLNYAQEIIDNGFPNGQIEIDDNWE---TCYGDFDFDPTKFPDPKGMIDQLHDLGFRVTLWVHPFINTD 97 (303)
T ss_pred CcCHHHHHHHHHHHHHcCCCCCeEEeCCCcc---ccCCccccChhhCCCHHHHHHHHHHCCCeEEEEECCeeCCC
Confidence 45677889999999999965 555544553 345766664 356899999999999999999988998864
No 109
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=79.05 E-value=4.2 Score=44.46 Aligned_cols=63 Identities=10% Similarity=0.073 Sum_probs=45.4
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCC---Ceec-------ccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHP---GHYH-------YDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~---G~fd-------f~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
-+.+.+.|.-++++|+++|-+.-.+....... ...| |.+.+++.++++.|+++||+||+..=|
T Consensus 15 f~~~~~~L~YL~~LGv~~V~lsPi~~a~~gs~hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIlDiVp 87 (825)
T TIGR02401 15 FDDAAALLPYLKSLGVSHLYLSPILTAVPGSTHGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIVDIVP 87 (825)
T ss_pred HHHHHHhhHHHHHcCCCEEEeCcCccCCCCCCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 35789999999999999999943332211110 1223 335789999999999999999998644
No 110
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=78.44 E-value=4.2 Score=44.68 Aligned_cols=61 Identities=11% Similarity=0.092 Sum_probs=45.9
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC------eecc-------cccchHHHHHHHHHHCCCeEEEccCCc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG------HYHY-------DGHRDIEHFLQLAVEEDLYILLRPGPF 167 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G------~fdf-------~g~~dl~~fl~~a~e~GL~vilr~GPy 167 (323)
-+.+.+.+.-++++|+|+|-+.-.. +..+| ..|| .+.+++.++++.|+++||+||+..=|-
T Consensus 19 f~~~~~~l~YL~~LGis~IyLsPi~---~a~~gs~hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIlDiV~N 92 (879)
T PRK14511 19 FDDAAELVPYFADLGVSHLYLSPIL---AARPGSTHGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLILDIVPN 92 (879)
T ss_pred HHHHHHHhHHHHHcCCCEEEECcCc---cCCCCCCCCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEeccc
Confidence 4678999999999999999994332 22222 2233 356799999999999999999986553
No 111
>PRK12677 xylose isomerase; Provisional
Probab=78.05 E-value=31 Score=34.40 Aligned_cols=101 Identities=12% Similarity=0.077 Sum_probs=62.1
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc---ccchHHHHHHHHHHCCCeEE-EccCCccccccCCCCCcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD---GHRDIEHFLQLAVEEDLYIL-LRPGPFICGKRDFGGFPP 179 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~---g~~dl~~fl~~a~e~GL~vi-lr~GPyi~aEw~~gg~P~ 179 (323)
+-.+.+.+++++++|+..|++. .+..--|+.+ ....+.++.+++++.||.|. +.+. ....|.
T Consensus 30 ~~~~~E~v~~~a~~Gf~gVElh------~~~l~p~~~~~~~~~~~~~~lk~~l~~~GL~v~~v~~n--------~f~~p~ 95 (384)
T PRK12677 30 PLDPVEAVHKLAELGAYGVTFH------DDDLVPFGATDAERDRIIKRFKKALDETGLVVPMVTTN--------LFTHPV 95 (384)
T ss_pred CCCHHHHHHHHHHhCCCEEEec------ccccCCCCCChhhhHHHHHHHHHHHHHcCCeeEEEecC--------CCCCcc
Confidence 4467899999999999999883 1111112222 11248889999999999977 4442 111222
Q ss_pred cccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEE
Q psy13819 180 WLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILV 226 (323)
Q Consensus 180 Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~ 226 (323)
+. + ..+-+.++..++.+.+.+++.++..+.+ +-+.|+.|
T Consensus 96 ~~---~--g~lts~d~~~R~~Ai~~~~r~IdlA~eL---Ga~~Vvv~ 134 (384)
T PRK12677 96 FK---D--GAFTSNDRDVRRYALRKVLRNIDLAAEL---GAKTYVMW 134 (384)
T ss_pred cc---C--CcCCCCCHHHHHHHHHHHHHHHHHHHHh---CCCEEEEe
Confidence 21 1 1244557788888777787777765543 23446555
No 112
>PRK14507 putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase; Provisional
Probab=77.74 E-value=4.1 Score=47.78 Aligned_cols=61 Identities=20% Similarity=0.178 Sum_probs=46.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC------eec-------ccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG------HYH-------YDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G------~fd-------f~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+-+.|.+.|.-+|++|+|+|-+.-.+. ..+| .-| |.+.+++.++++.|+++||+||+..=|
T Consensus 756 tf~~~~~~l~Yl~~LGv~~i~lsPi~~---a~~gs~hGYdv~D~~~idp~lG~~edf~~Lv~~ah~~Gi~vilDiV~ 829 (1693)
T PRK14507 756 TFADAEAILPYLAALGISHVYASPILK---ARPGSTHGYDIVDHSQINPEIGGEEGFERFCAALKAHGLGQLLDIVP 829 (1693)
T ss_pred CHHHHHHHhHHHHHcCCCEEEECCCcC---CCCCCCCCCCCCCCCccCcccCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 346799999999999999999943333 2222 222 335678999999999999999998644
No 113
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=77.66 E-value=60 Score=30.07 Aligned_cols=57 Identities=11% Similarity=0.108 Sum_probs=39.8
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEEEc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYILLR 163 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vilr 163 (323)
.+ ..|++.++.++++|++.|++.+........ ......+++++.++++++ ++.+.+-
T Consensus 8 ~~-~~l~~~l~~a~~~G~d~vEl~~~~~~~~~~----~~~~~~~~~~l~~~~~~~~~~~i~~~ 65 (279)
T cd00019 8 AG-FGLENALKRAKEIGFDTVAMFLGNPRSWLS----RPLKKERAEKFKAIAEEGPSICLSVH 65 (279)
T ss_pred cc-ccHHHHHHHHHHcCCCEEEEEcCCCCccCC----CCCCHHHHHHHHHHHHHcCCCcEEEE
Confidence 44 789999999999999999997643211111 111336788899999999 6665553
No 114
>COG1099 Predicted metal-dependent hydrolases with the TIM-barrel fold [General function prediction only]
Probab=76.84 E-value=18 Score=33.77 Aligned_cols=116 Identities=16% Similarity=0.188 Sum_probs=70.1
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHH----HHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQ----LAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~----~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
.+||.+|+..|+.+|-+- .|-|.|-..--...+.+++++. .|.+.||++-+..| =.|
T Consensus 14 ~eDlekMa~sGI~~Vit~----AhdP~~~~~~~v~~~h~~rl~~~E~~Ra~~~Gl~~~vavG----------vHP----- 74 (254)
T COG1099 14 FEDLEKMALSGIREVITL----AHDPYPMKTAEVYLDHFRRLLGVEPERAEKAGLKLKVAVG----------VHP----- 74 (254)
T ss_pred HHHHHHHHHhChhhhhhc----ccCCCCcccHHHHHHHHHHHHccchhhHHhhCceeeEEec----------cCC-----
Confidence 689999999999999996 4555555555555555666554 49999999888775 111
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCC--CChHHHHHHHHHHHHHhcCCceEEE
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDA--ECDPAHAVWLRDLLRTYVQDKAVLY 261 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~--~~~~~Y~~~l~~~~~~~~g~~~~l~ 261 (323)
|.-.|. ++.+ +++ +.++ +. +..|+++. |.|... .....+.+.--+++++ +++++++-
T Consensus 75 -------r~iP~e-~~~~---l~~----L~~~-l~-~e~VvAiG---EiGLe~~t~~E~evf~~QL~LA~e-~dvPviVH 133 (254)
T COG1099 75 -------RAIPPE-LEEV---LEE----LEEL-LS-NEDVVAIG---EIGLEEATDEEKEVFREQLELARE-LDVPVIVH 133 (254)
T ss_pred -------CCCCch-HHHH---HHH----HHhh-cc-cCCeeEee---ecccccCCHHHHHHHHHHHHHHHH-cCCcEEEe
Confidence 112222 2222 233 3333 33 44677764 444321 1235666666677777 58888887
Q ss_pred ee
Q psy13819 262 ST 263 (323)
Q Consensus 262 t~ 263 (323)
|-
T Consensus 134 TP 135 (254)
T COG1099 134 TP 135 (254)
T ss_pred CC
Confidence 73
No 115
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=76.65 E-value=33 Score=29.51 Aligned_cols=120 Identities=14% Similarity=0.101 Sum_probs=68.6
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccC------CCeecccccchHHHHHHHHHHCCCeEEEc-cCCccccccCCCCC
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVH------PGHYHYDGHRDIEHFLQLAVEEDLYILLR-PGPFICGKRDFGGF 177 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~------~G~fdf~g~~dl~~fl~~a~e~GL~vilr-~GPyi~aEw~~gg~ 177 (323)
...++-.+.+++.|+..+...+.-....+. ... --...+.+.+.+++|++.|...++- +|. .
T Consensus 27 ~~~~~~~~~~~~~gl~i~~~~~~~~~~~~~~~~~~~~~~-r~~~~~~~~~~i~~a~~lg~~~i~~~~g~----------~ 95 (213)
T PF01261_consen 27 DEAEELRRLLEDYGLKIASLHPPTNFWSPDEENGSANDE-REEALEYLKKAIDLAKRLGAKYIVVHSGR----------Y 95 (213)
T ss_dssp HHHHHHHHHHHHTTCEEEEEEEEESSSCTGTTSTTSSSH-HHHHHHHHHHHHHHHHHHTBSEEEEECTT----------E
T ss_pred HHHHHHHHHHHHcCCeEEEEecccccccccccccCcchh-hHHHHHHHHHHHHHHHHhCCCceeecCcc----------c
Confidence 345677778889999977775444443331 112 2233567899999999999987654 331 1
Q ss_pred cccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHH
Q psy13819 178 PPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRT 252 (323)
Q Consensus 178 P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~ 252 (323)
+.+ .. .....-.+.+.+.++++++..+++ | +.+.+||..+........ .+.+.+++++
T Consensus 96 ~~~-~~--------~~~~~~~~~~~~~l~~l~~~a~~~-----g--v~i~lE~~~~~~~~~~~~-~~~~~~~l~~ 153 (213)
T PF01261_consen 96 PSG-PE--------DDTEENWERLAENLRELAEIAEEY-----G--VRIALENHPGPFSETPFS-VEEIYRLLEE 153 (213)
T ss_dssp SSS-TT--------SSHHHHHHHHHHHHHHHHHHHHHH-----T--SEEEEE-SSSSSSSEESS-HHHHHHHHHH
T ss_pred ccc-cC--------CCHHHHHHHHHHHHHHHHhhhhhh-----c--ceEEEecccCccccchhh-HHHHHHHHhh
Confidence 111 01 123455777777888888885543 3 234567887764321100 3445555555
No 116
>PF14701 hDGE_amylase: glucanotransferase domain of human glycogen debranching enzyme
Probab=76.63 E-value=17 Score=36.79 Aligned_cols=81 Identities=19% Similarity=0.345 Sum_probs=51.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEE-EEeCCCcccC----CCeeccc------c----cchHHHHHHHHH-HCCCeEEEccCC
Q psy13819 103 PPGRWCWIMRAMRAAGLNALST-YVEWRSHEVH----PGHYHYD------G----HRDIEHFLQLAV-EEDLYILLRPGP 166 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~----~G~fdf~------g----~~dl~~fl~~a~-e~GL~vilr~GP 166 (323)
|-+.|++.|+.++++|+|+|.+ ++-=-..... ..+..|+ + ..++.+++..++ ++||.+|...
T Consensus 20 ~~~~W~~~l~~~~~~GYNmIHftPlq~~G~S~S~YSI~Dql~~~~~~~~~~~~~~~~~v~~~v~~~~~~~~ll~~~Dv-- 97 (423)
T PF14701_consen 20 PFSDWEKHLKVISEKGYNMIHFTPLQERGESNSPYSIYDQLKFDPDFFPPGKESTFEDVKEFVKEAEKKYGLLSMTDV-- 97 (423)
T ss_pred CHhHHHHHHHHHHHcCCcEEEecccccCCCCCCCccccchhhcChhhcCCCccccHHHHHHHHHHHHHHcCceEEEEE--
Confidence 4469999999999999999998 3322111000 0111111 1 258999999985 7999988765
Q ss_pred ccccccCCCCC-cccccccCCCeee
Q psy13819 167 FICGKRDFGGF-PPWLLKVAPDILL 190 (323)
Q Consensus 167 yi~aEw~~gg~-P~Wl~~~~p~~~~ 190 (323)
-|+|-.. -.||.. .|+.-.
T Consensus 98 ----V~NHtA~nS~Wl~e-HPEagY 117 (423)
T PF14701_consen 98 ----VLNHTANNSPWLRE-HPEAGY 117 (423)
T ss_pred ----eeccCcCCChHHHh-Cccccc
Confidence 2343332 358877 787543
No 117
>PRK12331 oxaloacetate decarboxylase; Provisional
Probab=76.59 E-value=89 Score=31.90 Aligned_cols=56 Identities=7% Similarity=0.049 Sum_probs=46.0
Q ss_pred ecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 97 FHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 97 ~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..|.+.|.+.-+++++++.+.|+..||+..+-|.. +++...++.|+++|+.|.+..
T Consensus 88 ~G~~~~pddvv~~~v~~A~~~Gvd~irif~~lnd~------------~n~~~~v~~ak~~G~~v~~~i 143 (448)
T PRK12331 88 LGYRNYADDVVESFVQKSVENGIDIIRIFDALNDV------------RNLETAVKATKKAGGHAQVAI 143 (448)
T ss_pred cccccCchhhHHHHHHHHHHCCCCEEEEEEecCcH------------HHHHHHHHHHHHcCCeEEEEE
Confidence 34667788888999999999999999998876654 258888999999999876544
No 118
>TIGR02102 pullulan_Gpos pullulanase, extracellular, Gram-positive. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. In contrast, a glycogen debranching enzyme such GlgX, homologous to this family, can release glucose at alpha,1-6 linkages from glycogen first subjected to limit degradation by phosphorylase. Characterized members of this family include a surface-located pullulanase from Streptococcus pneumoniae (PubMed:11083842) and an extracellular bifunctional amylase/pullulanase with C-terminal pullulanase activity (PubMed:8798645).
Probab=76.46 E-value=4.6 Score=45.56 Aligned_cols=21 Identities=5% Similarity=0.219 Sum_probs=19.4
Q ss_pred chHHHHHHHHHHCCCeEEEcc
Q psy13819 144 RDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++.++++.|+++||.|||..
T Consensus 555 ~EfK~LV~alH~~GI~VILDV 575 (1111)
T TIGR02102 555 AEFKNLINEIHKRGMGVILDV 575 (1111)
T ss_pred HHHHHHHHHHHHCCCEEEEec
Confidence 579999999999999999985
No 119
>PF13199 Glyco_hydro_66: Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=75.32 E-value=23 Score=37.20 Aligned_cols=198 Identities=15% Similarity=0.176 Sum_probs=92.3
Q ss_pred hhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCee---cccc---------cchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHY---HYDG---------HRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~f---df~g---------~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
++.-++.|+.|++..+|.|++ -..|.+|.|-|+.= +-++ ..-+...|+.|++.|++++.=-.=|-+-
T Consensus 117 ~~~~~~~i~~L~~yHIN~~QFYDW~~rH~~Pl~~~~~~~~~~w~D~~~r~i~~~~Vk~yI~~ah~~Gmkam~Ynmiyaa~ 196 (559)
T PF13199_consen 117 AEDIEAEIDQLNRYHINGLQFYDWMYRHHKPLPGTNGQPDQTWTDWANRQISTSTVKDYINAAHKYGMKAMAYNMIYAAN 196 (559)
T ss_dssp HHHHHHHHHHHHHTT--EEEETS--SBTTB-S-SSS-EEE-TT-TTT--EEEHHHHHHHHHHHHHTT-EEEEEEESSEEE
T ss_pred chhHHHHHHHHHhhCcCeEEEEeeccccCCcCCCCCCchhhhhhhhcCCEehHHHHHHHHHHHHHcCcceehhHhhhccc
Confidence 457789999999999999999 79999999987533 3222 1367788999999999998643212111
Q ss_pred cc--CCCCCcccccccCCCe------ee-----------cCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccC--
Q psy13819 171 KR--DFGGFPPWLLKVAPDI------LL-----------RQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVE-- 229 (323)
Q Consensus 171 Ew--~~gg~P~Wl~~~~p~~------~~-----------R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Qie-- 229 (323)
+. ..|-.|.|..-+++.- .+ =..++.++++...=+++.++.+. + ..+-+=|+.
T Consensus 197 ~~~~~~gv~~eW~ly~d~~~~~~~~~~l~~~w~s~lyl~dP~N~~WQ~yI~~q~~~~~~~~g---F---DG~hlDq~G~~ 270 (559)
T PF13199_consen 197 NNYEEDGVSPEWGLYKDDSHSNQDTYDLPDGWPSDLYLMDPGNPEWQNYIINQMNKAIQNFG---F---DGWHLDQLGNR 270 (559)
T ss_dssp TT--S--SS-GGBEEESSSBTSB-EEEETT-E--EEEEB-TT-HHHHHHHHHHHHHHHHHHT--------EEEEE-S--E
T ss_pred cCcccccCCchhhhhhccCCCccceeecCcccccceEEecCCCHHHHHHHHHHHHHHHHccC---C---ceEeeeccCCC
Confidence 11 2455688865433321 01 13466666666555555444421 1 112222332
Q ss_pred -CCcCCCC----CChHHHHHHHHHHHHHhc-CCceEEEeeCCCCcccccc--ccCceEEeecCCCCC---chhHHHHHHH
Q psy13819 230 -NEYGSDA----ECDPAHAVWLRDLLRTYV-QDKAVLYSTDGAFDAYLRC--TVDGVYSTVDFTVFK---DVNVSFQAQR 298 (323)
Q Consensus 230 -NE~g~~~----~~~~~Y~~~l~~~~~~~~-g~~~~l~t~dg~~~~~~~~--~~~g~~~~~nfg~~~---~~~~~~~~~r 298 (323)
+.|...+ .-...|...|.++-.+ + +...++-..++-....+.. ...=+|. .=+.+.+ +.....+..|
T Consensus 271 ~~~~d~~G~~i~~l~~~y~~Fi~~~K~~-~~~k~lv~N~V~~~g~~~~a~~~~~d~lY~-EvW~~~~~Y~~Lk~~i~~~r 348 (559)
T PF13199_consen 271 GTVYDYDGNKIYDLSDGYASFINAMKEA-LPDKYLVFNAVSGYGIEQIAKTSKVDFLYN-EVWDDYDTYGDLKRIIDQNR 348 (559)
T ss_dssp EEEGGTT---GGECHHHHHHHHHHHHHH-STTSEEEEB-GGGTTHHHHTT-S--SSEEE-E--SBS-BHHHHHHHHHHHH
T ss_pred CccccCCCCCchhhHHHHHHHHHHHHHh-CCCCceeeeccCccchhhhhcccccceeee-ecccccccHHHHHHHHHHHh
Confidence 1121112 1245677666666444 4 3334444444433211111 1111332 2223222 3455666677
Q ss_pred hh--CCCCCeEEE
Q psy13819 299 TR--APQGPLVNA 309 (323)
Q Consensus 299 ~~--~p~~Plm~~ 309 (323)
+. ..++++++.
T Consensus 349 ~~~~~~gk~~V~A 361 (559)
T PF13199_consen 349 KYTSSGGKSTVVA 361 (559)
T ss_dssp HHH---S--EEEE
T ss_pred hhhccccchhhhH
Confidence 77 456777764
No 120
>COG2342 Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase [Carbohydrate transport and metabolism]
Probab=74.16 E-value=68 Score=30.89 Aligned_cols=183 Identities=13% Similarity=0.153 Sum_probs=105.5
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE--ccCC------ccccccCCCCCcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL--RPGP------FICGKRDFGGFPP 179 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil--r~GP------yi~aEw~~gg~P~ 179 (323)
..++..+.+-++.++-+..+ .-|.|+-.+. ..=|+..++.|.++|. -+|- |..-.|. -+.|.
T Consensus 33 d~~~~~i~~~~f~llVVDps------~~g~~~~~~~---~eelr~~~~gg~~pIAYlsIg~ae~yR~Ywd~~w~-~~~p~ 102 (300)
T COG2342 33 DAYINEILNSPFDLLVVDPS------YCGPFNTPWT---IEELRTKADGGVKPIAYLSIGEAESYRFYWDKYWL-TGRPD 102 (300)
T ss_pred cchHHHHhcCCCcEEEEecc------ccCCCCCcCc---HHHHHHHhcCCeeEEEEEechhhhhhhhHhhhhhh-cCCcc
Confidence 35677888888888876433 3355554443 2337888999977774 3432 3334443 37899
Q ss_pred cccccCCCee----ecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCc------CCC-CC----ChHHHHH
Q psy13819 180 WLLKVAPDIL----LRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEY------GSD-AE----CDPAHAV 244 (323)
Q Consensus 180 Wl~~~~p~~~----~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~------g~~-~~----~~~~Y~~ 244 (323)
||.+.+|++. ++.=+|.+++....++++|.+. +=.| |.+ .+--.| +-. +. ...+++.
T Consensus 103 wLg~edP~W~Gny~VkYW~~eWkdii~~~l~rL~d~------GfdG-vyL-D~VD~y~Y~~~~~~~~~~~~~k~m~~~i~ 174 (300)
T COG2342 103 WLGEEDPEWPGNYAVKYWEPEWKDIIRSYLDRLIDQ------GFDG-VYL-DVVDAYWYVEWNDRETGVNAAKKMVKFIA 174 (300)
T ss_pred cccCCCCCCCCCceeeccCHHHHHHHHHHHHHHHHc------cCce-EEE-eeechHHHHHHhcccccccHHHHHHHHHH
Confidence 9988788653 6677899999998888887776 1111 211 111111 100 01 1257888
Q ss_pred HHHHHHHHhcCCceEEEeeCCCCccccccccCceEEeecC------------CCCCchhHHHHHHHhhC-CCCCeEEEEe
Q psy13819 245 WLRDLLRTYVQDKAVLYSTDGAFDAYLRCTVDGVYSTVDF------------TVFKDVNVSFQAQRTRA-PQGPLVNAEF 311 (323)
Q Consensus 245 ~l~~~~~~~~g~~~~l~t~dg~~~~~~~~~~~g~~~~~nf------------g~~~~~~~~~~~~r~~~-p~~Plm~~E~ 311 (323)
.+++..|+.... ..++-++|+.. +.....+.++-..| ++....+..-+.||+.+ .++|+++-||
T Consensus 175 ~i~~~~ra~~~~-~~Vi~qng~~l--~d~~~a~l~~~~~~~~~vE~~~~d~~~~~~~~~~~e~~Lr~l~~~G~~V~vieY 251 (300)
T COG2342 175 AIAEYARAANPL-FRVIPQNGAEL--FDADGAGLLPRLGFGVAVETVFYDDERPLESADTFEEYLRKLCRLGKPVYVIEY 251 (300)
T ss_pred HHHHHHHhcCCc-EEEEecccHhh--cCccccchhhccccceEEEEEEecCccCCCchhhHHHHHHHHHhcCCcEEEEEe
Confidence 899999885443 56666766642 22110111111111 22222233336778876 4789999998
No 121
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=74.03 E-value=12 Score=35.97 Aligned_cols=67 Identities=12% Similarity=0.181 Sum_probs=52.4
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC--eeccc--ccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG--HYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G--~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
..+.-++.++++++.|+.+=.+.+-|.... ..+ .|+|+ ..-|..++++.+++.|++|++..=|+|+.
T Consensus 22 ~~~ev~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~~~f~~d~~~FPdp~~mi~~L~~~G~kv~~~i~P~v~~ 92 (319)
T cd06591 22 TQEELLDVAKEYRKRGIPLDVIVQDWFYWP-KQGWGEWKFDPERFPDPKAMVRELHEMNAELMISIWPTFGP 92 (319)
T ss_pred CHHHHHHHHHHHHHhCCCccEEEEechhhc-CCCceeEEEChhhCCCHHHHHHHHHHCCCEEEEEecCCcCC
Confidence 456778999999999988877777765443 234 66665 34589999999999999999988788875
No 122
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=73.95 E-value=1.9 Score=45.97 Aligned_cols=49 Identities=18% Similarity=0.184 Sum_probs=44.0
Q ss_pred CCCCceeEEEEcCcccccccCCCCCeeEEEeccccccCCcceEEEEEEee
Q psy13819 1 MRRWSKGVLFANGRHWGRYWSEVGPQYSLFVPEEFLKVGTNRITIFELTR 50 (323)
Q Consensus 1 ~~~~~kg~~~vng~~lGryw~~~gpq~~lyvP~~~l~~g~N~i~i~e~~~ 50 (323)
++.|.+++=+.|+..||||| ..+++...+|++.++++|.+..++++..+
T Consensus 544 ~~~~~~~v~~~~~~~l~~~~-~~~~~~~~~v~~~~~~~g~~~~l~~~~~g 592 (673)
T COG1874 544 LKLWRATVDGEGGTVLARFR-EDGYAGGPAVTRRWYGGGKAYYLGFRTSG 592 (673)
T ss_pred cccceeeeeccCCeEEEEEe-ccCcCCCcccchhhhcCcceeEEEeccCc
Confidence 35677888888999999999 69999999999999999999999998765
No 123
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=72.80 E-value=5.2 Score=36.98 Aligned_cols=56 Identities=9% Similarity=0.031 Sum_probs=39.1
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC----eecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG----HYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G----~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.+++.++.++++|..+|.+.. .+..... .+.. -.+.+.++.+.|+++|+.+.+.+
T Consensus 90 ~~~~~~i~~a~~lGa~~i~~~~---~~~~~~~~~~~~~~~-~~~~l~~l~~~a~~~gv~l~iE~ 149 (275)
T PRK09856 90 DMIKLAMDMAKEMNAGYTLISA---AHAGYLTPPNVIWGR-LAENLSELCEYAENIGMDLILEP 149 (275)
T ss_pred HHHHHHHHHHHHhCCCEEEEcC---CCCCCCCCHHHHHHH-HHHHHHHHHHHHHHcCCEEEEec
Confidence 4667888999999999997732 2221111 1111 12468899999999999999987
No 124
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=72.39 E-value=32 Score=33.26 Aligned_cols=68 Identities=16% Similarity=0.160 Sum_probs=48.8
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEE----eCCCc---ccC---CCeec-ccccchHHHHHHHHHHCCCeEEEc
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYV----EWRSH---EVH---PGHYH-YDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv----~W~~h---EP~---~G~fd-f~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.+-|+ +|.+..++.|+.|....+|+...++ .|..- -|+ .|.+. +=-.+|+..+++-|+++|+.||..
T Consensus 10 ~aR~f--~~~~~ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~lt~~g~~~~~yT~~di~elv~yA~~rgI~vIPE 87 (311)
T cd06570 10 VSRHF--IPVAVIKRQLDAMASVKLNVFHWHLTDDQGFRIESKKYPKLQQKASDGLYYTQEQIREVVAYARDRGIRVVPE 87 (311)
T ss_pred cCCCC--cCHHHHHHHHHHHHHhCCeEEEEEEecCCCceeecCCCccccccCCCCCccCHHHHHHHHHHHHHcCCEEEEe
Confidence 34455 5689999999999999999999988 67532 122 22111 111356888899999999999976
Q ss_pred c
Q psy13819 164 P 164 (323)
Q Consensus 164 ~ 164 (323)
.
T Consensus 88 I 88 (311)
T cd06570 88 I 88 (311)
T ss_pred e
Confidence 4
No 125
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=72.32 E-value=14 Score=35.51 Aligned_cols=68 Identities=9% Similarity=0.025 Sum_probs=50.4
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC---CCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH---PGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~---~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
.+.-.+.++.+++.|+.+=-+.+-+....-. ...|+|. ..-|..++++..++.|+++++..=|+|+..
T Consensus 28 q~~v~~~~~~~r~~~iP~d~i~ld~~~~~~~~~~~~~f~~d~~~FPdp~~mi~~L~~~g~k~~~~i~P~i~~~ 100 (317)
T cd06599 28 QEALLEFIDKCREHDIPCDSFHLSSGYTSIEGGKRYVFNWNKDRFPDPAAFVAKFHERGIRLAPNIKPGLLQD 100 (317)
T ss_pred HHHHHHHHHHHHHcCCCeeEEEEeccccccCCCceeeeecCcccCCCHHHHHHHHHHCCCEEEEEeCCcccCC
Confidence 4567888999999999877776654333221 2245553 345899999999999999999998999763
No 126
>PF01791 DeoC: DeoC/LacD family aldolase; InterPro: IPR002915 This family includes the enzyme deoxyribose-phosphate aldolase, which is involved in nucleotide metabolism. 2-deoxy-D-ribose 5-phosphate = D-glyceraldehyde 3-phosphate + acetaldehyde The family also includes a group of related bacterial proteins of unknown function, see examples Q57843 from SWISSPROT and P76143 from SWISSPROT.; GO: 0016829 lyase activity; PDB: 2A4A_A 1VCV_B 1P1X_A 1KTN_B 1JCL_A 1JCJ_A 1MZH_A 3GKF_D 3GLC_L 3GND_N ....
Probab=72.23 E-value=1.6 Score=40.05 Aligned_cols=54 Identities=9% Similarity=0.108 Sum_probs=42.3
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
-...+.+.++|.+.|.+.+.|....+..-.+ -.+++.++.+.|++.||+||+.+
T Consensus 79 ~~~ve~A~~~GAd~vd~vi~~~~~~~~~~~~---~~~~i~~v~~~~~~~gl~vIlE~ 132 (236)
T PF01791_consen 79 VAEVEEAIRLGADEVDVVINYGALGSGNEDE---VIEEIAAVVEECHKYGLKVILEP 132 (236)
T ss_dssp HHHHHHHHHTT-SEEEEEEEHHHHHTTHHHH---HHHHHHHHHHHHHTSEEEEEEEE
T ss_pred HHHHHHHHHcCCceeeeeccccccccccHHH---HHHHHHHHHHHHhcCCcEEEEEE
Confidence 5678899999999999999997665543122 23568899999999999999984
No 127
>PLN02877 alpha-amylase/limit dextrinase
Probab=71.99 E-value=7.1 Score=43.41 Aligned_cols=21 Identities=14% Similarity=0.170 Sum_probs=18.7
Q ss_pred chHHHHHHHHHHCCCeEEEcc
Q psy13819 144 RDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+++++++.|+++||.||+..
T Consensus 466 ~efk~mV~~lH~~GI~VImDV 486 (970)
T PLN02877 466 IEFRKMVQALNRIGLRVVLDV 486 (970)
T ss_pred HHHHHHHHHHHHCCCEEEEEE
Confidence 369999999999999999974
No 128
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=71.37 E-value=10 Score=35.41 Aligned_cols=65 Identities=11% Similarity=0.200 Sum_probs=51.5
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCee--ccc--ccchHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHY--HYD--GHRDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~f--df~--g~~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
..+...+.++.+++.|+.+=.+.+-+..... -+.| +|+ ..-|..++++.+++.|++|++..-|+|
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~~~lD~~~~~~-~~~f~~~~d~~~Fpdp~~~i~~l~~~g~~~~~~~~P~v 90 (265)
T cd06589 22 DQDKVLEVIDGMRENDIPLDGFVLDDDYTDG-YGDFTFDWDAGKFPNPKSMIDELHDNGVKLVLWIDPYI 90 (265)
T ss_pred CHHHHHHHHHHHHHcCCCccEEEECcccccC-CceeeeecChhhCCCHHHHHHHHHHCCCEEEEEeChhH
Confidence 4667889999999999887777666665554 3566 443 456899999999999999999988887
No 129
>TIGR00677 fadh2_euk methylenetetrahydrofolate reductase, eukaryotic type. This protein is an FAD-containing flavoprotein.
Probab=71.32 E-value=35 Score=32.46 Aligned_cols=108 Identities=12% Similarity=0.107 Sum_probs=63.6
Q ss_pred EeeeeecCCCCCh-hcHHHHHHHHH---HhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCe--EEEccC
Q psy13819 92 FVSGSFHYFRAPP-GRWCWIMRAMR---AAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLY--ILLRPG 165 (323)
Q Consensus 92 i~~g~~Hy~r~p~-~~W~~~l~~mk---~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~--vilr~G 165 (323)
.+++..|+..-|. ...+.+++.|| ++|.+.+-|-++ ||.+ .+.+|++.|++.|+. ||..+-
T Consensus 131 ~igva~~Pe~Hp~~~~~~~d~~~L~~Ki~aGA~f~iTQ~~----------Fd~~---~~~~f~~~~~~~gi~~PIi~GI~ 197 (281)
T TIGR00677 131 CIGVAGYPEGHPEAESVELDLKYLKEKVDAGADFIITQLF----------YDVD---NFLKFVNDCRAIGIDCPIVPGIM 197 (281)
T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHcCCCEeeccce----------ecHH---HHHHHHHHHHHcCCCCCEEeecc
Confidence 4566666544321 22333344333 599999888322 3333 378999999999765 444444
Q ss_pred Cccc-------cccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 166 PFIC-------GKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 166 Pyi~-------aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
|... .++..--+|.|+.+..-. ...+++...+..-++..++++.+..
T Consensus 198 pi~s~~~~~~~~~~~Gi~vP~~l~~~l~~--~~~~~~~~~~~gi~~a~~~~~~l~~ 251 (281)
T TIGR00677 198 PINNYASFLRRAKWSKTKIPQEIMSRLEP--IKDDDEAVRDYGIELIVEMCQKLLA 251 (281)
T ss_pred ccCCHHHHHHHHhcCCCCCCHHHHHHHHh--ccCCHHHHHHHHHHHHHHHHHHHHH
Confidence 4311 467767789998763100 1123355566777777777777654
No 130
>PRK09875 putative hydrolase; Provisional
Probab=71.32 E-value=84 Score=30.13 Aligned_cols=88 Identities=15% Similarity=0.190 Sum_probs=55.9
Q ss_pred EEecCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHH
Q psy13819 76 IDLAGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVE 155 (323)
Q Consensus 76 v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e 155 (323)
+++-+.++.++..++.- .....--..+.-.+.|+.+|++|.+|| + .+. ....-+|...+.+++++
T Consensus 8 ~tl~HEHl~~~~~~~~~---~~~~~l~~~~~~~~el~~~~~~Gg~ti-V--d~T---------~~g~GRd~~~l~~is~~ 72 (292)
T PRK09875 8 YTLAHEHLHIDLSGFKN---NVDCRLDQYAFICQEMNDLMTRGVRNV-I--EMT---------NRYMGRNAQFMLDVMRE 72 (292)
T ss_pred cceecCCeEecChhhcC---CcccccccHHHHHHHHHHHHHhCCCeE-E--ecC---------CCccCcCHHHHHHHHHH
Confidence 34456677666432221 111111124566778999999999887 2 221 11224689999999999
Q ss_pred CCCeEEEccCCccccccCCCCCcccccc
Q psy13819 156 EDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 156 ~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
-|+.+|..+|-|. ..-+|.|+..
T Consensus 73 tgv~Iv~~TG~y~-----~~~~p~~~~~ 95 (292)
T PRK09875 73 TGINVVACTGYYQ-----DAFFPEHVAT 95 (292)
T ss_pred hCCcEEEcCcCCC-----CccCCHHHhc
Confidence 9999999999773 2236788765
No 131
>PRK08227 autoinducer 2 aldolase; Validated
Probab=69.26 E-value=50 Score=31.32 Aligned_cols=63 Identities=11% Similarity=0.054 Sum_probs=44.7
Q ss_pred eeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 94 SGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 94 ~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
++..++...+.+..--.++.+-++|..+|.+.+++.. ++.-+-+.++.+..+.|++.||-+|+
T Consensus 83 ~~t~~~~~~~~~~l~~sVeeAvrlGAdAV~~~v~~Gs------~~E~~~l~~l~~v~~ea~~~G~Plla 145 (264)
T PRK08227 83 GGNSILKELSNEAVAVDMEDAVRLNACAVAAQVFIGS------EYEHQSIKNIIQLVDAGLRYGMPVMA 145 (264)
T ss_pred CCCCCCCCCCcccceecHHHHHHCCCCEEEEEEecCC------HHHHHHHHHHHHHHHHHHHhCCcEEE
Confidence 3344443333344444577788999999999999972 22234567899999999999999886
No 132
>PLN02784 alpha-amylase
Probab=69.06 E-value=9.9 Score=41.77 Aligned_cols=57 Identities=7% Similarity=-0.067 Sum_probs=40.5
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecc-------cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHY-------DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf-------~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.+.-++++|+++|-+.-+-....+. -..+|| .-.++|..+++.|+++||+||+..
T Consensus 524 ~ekldyL~~LG~taIWLpP~~~s~s~~GY~p~D~y~lds~yGT~~ELk~LI~a~H~~GIkVIlDi 588 (894)
T PLN02784 524 GEKAAELSSLGFTVVWLPPPTESVSPEGYMPKDLYNLNSRYGTIDELKDLVKSFHEVGIKVLGDA 588 (894)
T ss_pred HHHHHHHHHhCCCEEEeCCCCCCCCCCCcCcccccccCcCcCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 578889999999999994443332221 012332 224789999999999999999873
No 133
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=68.74 E-value=13 Score=36.25 Aligned_cols=68 Identities=13% Similarity=0.115 Sum_probs=53.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc--cchH--HHHHHHHHHCCCeEEEccCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG--HRDI--EHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl--~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
+.+..++.++.+++.|+.+=-+.+-+..+.. -+.|+|+. .-|. .++++.+++.|++|++..-|+|+-+
T Consensus 22 ~~~~v~~~~~~~r~~~iP~d~i~lD~~~~~~-~~~f~~d~~~FPdp~~~~mi~~L~~~G~k~~~~i~P~v~~~ 93 (339)
T cd06602 22 NVDEVKEVVENMRAAGIPLDVQWNDIDYMDR-RRDFTLDPVRFPGLKMPEFVDELHANGQHYVPILDPAISAN 93 (339)
T ss_pred CHHHHHHHHHHHHHhCCCcceEEECcccccC-ccceecccccCCCccHHHHHHHHHHCCCEEEEEEeCccccC
Confidence 3567889999999999988777666555543 36666654 4577 9999999999999999888998853
No 134
>PF05089 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain; InterPro: IPR024733 Alpha-N-acetylglucosaminidase is a lysosomal enzyme that is required for the stepwise degradation of heparan sulphate []. Mutations on the alpha-N-acetylglucosaminidase gene can lead to mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B), characterised by neurological dysfunction but relatively mild somatic manifestations []. Alpha-N-acetylglucosaminidase is composed of three domains. This entry represents the central domain, which has a tim barrel fold [].; PDB: 4A4A_A 2VC9_A 2VCC_A 2VCB_A 2VCA_A.
Probab=68.68 E-value=21 Score=34.98 Aligned_cols=127 Identities=20% Similarity=0.301 Sum_probs=68.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEE------------------------------eCCCcccCCCeecccc---------c
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYV------------------------------EWRSHEVHPGHYHYDG---------H 143 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv------------------------------~W~~hEP~~G~fdf~g---------~ 143 (323)
.=++|++.|+-|.-.|+|..-..+ +|.+..--.| |.| .
T Consensus 17 dW~rWEreIDWMALnGiNl~La~~GqEavw~~v~~~~G~t~~ei~~ff~GPA~laW~rMgNl~g---wgGPLp~~w~~~q 93 (333)
T PF05089_consen 17 DWERWEREIDWMALNGINLPLAIVGQEAVWQRVLRELGLTDEEIREFFTGPAFLAWWRMGNLQG---WGGPLPQSWIDQQ 93 (333)
T ss_dssp -HHHHHHHHHHHHHTT--EEE--TTHHHHHHHHHGGGT--HHHHHHHS--TT-HHHHHTTS--S---TT----TTHHHHH
T ss_pred CHHHHHHHHHHHHHhCCchhhhhhHHHHHHHHHHHHcCCCHHHHHHHcCCHHHHHHHHhCCccc---CCCCCCHHHHHHH
Confidence 347899999999999999875432 3333322222 222 1
Q ss_pred chH-HHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCee---------------ecCCCHhHHHHHHHHHHH
Q psy13819 144 RDI-EHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDIL---------------LRQNHPVYQRYVTRWFQE 207 (323)
Q Consensus 144 ~dl-~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~---------------~R~~~~~f~~~~~~~~~~ 207 (323)
.+| .++++..++.||..||-- -.|-+|.=+.+++|+.. +.+.+|.|.+-.+.|+++
T Consensus 94 ~~Lq~kIl~RmreLGm~PVLPa--------F~G~VP~~~~~~~P~a~i~~~~~W~~f~~~~~L~P~dplF~~i~~~F~~~ 165 (333)
T PF05089_consen 94 AELQKKILDRMRELGMTPVLPA--------FAGHVPRAFKRKYPNANITRQGNWNGFCRPYFLDPTDPLFAEIAKLFYEE 165 (333)
T ss_dssp HHHHHHHHHHHHHHT-EEEEE----------S-EE-TTHHHHSTT--EE---EETTEE--EEE-SS--HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCcccCCC--------cCCCCChHHHhcCCCCEEeeCCCcCCCCCCceeCCCCchHHHHHHHHHHH
Confidence 122 456888999999988843 13446777766666643 236799999999999999
Q ss_pred HHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHH
Q psy13819 208 LFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLR 247 (323)
Q Consensus 208 l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~ 247 (323)
..+. |+ ...+.+...=||-+.-.. +.+|+..+.
T Consensus 166 q~~~-----yG-~~~~Y~~D~FnE~~p~~~-~~~~l~~~s 198 (333)
T PF05089_consen 166 QIKL-----YG-TDHIYAADPFNEGGPPSG-DPEYLANVS 198 (333)
T ss_dssp HHHH-----H----SEEE--TTTTS---TT-S---HHHHH
T ss_pred HHHh-----cC-CCceeCCCccCCCCCCCC-chHHHHHHH
Confidence 8888 76 567888898999875432 334444433
No 135
>COG1523 PulA Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]
Probab=68.59 E-value=8.2 Score=41.51 Aligned_cols=55 Identities=16% Similarity=0.247 Sum_probs=40.5
Q ss_pred HHHHHHHhCCCEEEE-EEeCCCcccCC---C-----------------eecccc-----cchHHHHHHHHHHCCCeEEEc
Q psy13819 110 IMRAMRAAGLNALST-YVEWRSHEVHP---G-----------------HYHYDG-----HRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~-yv~W~~hEP~~---G-----------------~fdf~g-----~~dl~~fl~~a~e~GL~vilr 163 (323)
.|+-+|++|+++|++ +|+.-..++.. | .|--+. .+.+..+++.++++||-|||.
T Consensus 205 ~i~yLk~LGvtaVeLLPV~~~~~~~~l~~~gl~n~WGYdP~~fFAp~~~Yss~p~p~~~i~EfK~mV~~lHkaGI~VILD 284 (697)
T COG1523 205 IIDYLKDLGVTAVELLPVFDFYDEPHLDKSGLNNNWGYDPLNFFAPEGRYASNPEPATRIKEFKDMVKALHKAGIEVILD 284 (697)
T ss_pred HHHHHHHhCCceEEEecceEEeccccccccccccccCCCcccccCCCccccCCCCcchHHHHHHHHHHHHHHcCCEEEEE
Confidence 499999999999999 66654444443 2 222222 357888999999999999997
Q ss_pred c
Q psy13819 164 P 164 (323)
Q Consensus 164 ~ 164 (323)
.
T Consensus 285 V 285 (697)
T COG1523 285 V 285 (697)
T ss_pred E
Confidence 3
No 136
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=67.69 E-value=22 Score=34.20 Aligned_cols=69 Identities=10% Similarity=0.132 Sum_probs=51.4
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCccc-----CCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEV-----HPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-----~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
..+...+.++.+++.|+-+=-+.+.+..+.. .-|.|+|+ ..-|..++++.++++|++|++..=|+|+.+
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~w~~~~~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~v~P~v~~~ 97 (317)
T cd06598 22 NWQEVDDTIKTLREKDFPLDAAILDLYWFGKDIDKGHMGNLDWDRKAFPDPAGMIADLAKKGVKTIVITEPFVLKN 97 (317)
T ss_pred CHHHHHHHHHHHHHhCCCceEEEEechhhcCcccCCceeeeEeccccCCCHHHHHHHHHHcCCcEEEEEcCcccCC
Confidence 3567789999999999876666655443332 23455554 345889999999999999999888898754
No 137
>TIGR02455 TreS_stutzeri trehalose synthase, Pseudomonas stutzeri type. Trehalose synthase catalyzes a one-step conversion of maltose to trehalose. This is an alternative to the OtsAB and TreYZ pathways. This family includes a characterized example from Pseudomonas stutzeri plus very closely related sequences from other Pseudomonads. Cutoff scores are set to find a more distantly related sequence from Desulfovibrio vulgaris, likely to be functionally equivalent, between trusted and noise limits.
Probab=67.23 E-value=16 Score=38.96 Aligned_cols=72 Identities=10% Similarity=0.198 Sum_probs=50.0
Q ss_pred HHHHHHHhCCCEEEE---EEe-----CCCcccCCCeecccc---------cchHHHHHHHHHHCCCeEEEccC-------
Q psy13819 110 IMRAMRAAGLNALST---YVE-----WRSHEVHPGHYHYDG---------HRDIEHFLQLAVEEDLYILLRPG------- 165 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~---yv~-----W~~hEP~~G~fdf~g---------~~dl~~fl~~a~e~GL~vilr~G------- 165 (323)
-.+-++++|+++|-+ |-+ |...--..|-||-+. .+|++++++.|+++||+||+..=
T Consensus 79 ~wdyL~~LGV~~iwl~P~~~SGgi~g~~~tP~~D~gyDi~d~~Idp~~GT~eDf~~L~~~Ah~~G~~vi~DlVpnHTs~g 158 (688)
T TIGR02455 79 LWKALSEIGVQGIHNGPIKLSGGIRGREFTPSIDGNFDRISFDIDPLLGSEEELIQLSRMAAAHNAITIDDIIPAHTGKG 158 (688)
T ss_pred HHHHHHHhCCCEEEeCcceecccccccCCCCCCCCCCCcccCccCcccCCHHHHHHHHHHHHHCCCEEEEEeCCCCCCCC
Confidence 345688999999988 333 222222245566332 46999999999999999998742
Q ss_pred -CccccccCCCCCcccc
Q psy13819 166 -PFICGKRDFGGFPPWL 181 (323)
Q Consensus 166 -Pyi~aEw~~gg~P~Wl 181 (323)
||.-|+.+.+.+|.|.
T Consensus 159 hdF~lAr~~~~~Y~g~Y 175 (688)
T TIGR02455 159 ADFRLAELAHGDYPGLY 175 (688)
T ss_pred cchHHHhhcCCCCCCce
Confidence 3666777777777775
No 138
>TIGR02631 xylA_Arthro xylose isomerase, Arthrobacter type. This model describes a D-xylose isomerase that is also active as a D-glucose isomerase. It is tetrameric and dependent on a divalent cation Mg2+, Co2+ or Mn2+ as characterized in Arthrobacter. Members of this family differ substantially from the D-xylose isomerases of family TIGR02630.
Probab=66.78 E-value=36 Score=33.85 Aligned_cols=53 Identities=8% Similarity=0.060 Sum_probs=36.7
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEE----EeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTY----VEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~y----v~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
|+....+.+++++++|+..|++. ++|..-.. +-..+++++-++++++||.|..
T Consensus 30 ~~~~~~e~i~~la~~GfdgVE~~~~dl~P~~~~~~-------e~~~~~~~lk~~L~~~GL~v~~ 86 (382)
T TIGR02631 30 TALDPVEAVHKLAELGAYGVTFHDDDLIPFGAPPQ-------ERDQIVRRFKKALDETGLKVPM 86 (382)
T ss_pred CCcCHHHHHHHHHHhCCCEEEecccccCCCCCChh-------HHHHHHHHHHHHHHHhCCeEEE
Confidence 44567899999999999999873 22211110 0123467889999999999764
No 139
>COG1306 Uncharacterized conserved protein [Function unknown]
Probab=66.73 E-value=13 Score=36.18 Aligned_cols=56 Identities=23% Similarity=0.361 Sum_probs=41.7
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc-------------cchHHHHHHHHHHCCCeEEEcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG-------------HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g-------------~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+..-...++.+|..|+|++-+.+ --.-|.-.|.- ..|+..+|+.|+|.||++|.|.
T Consensus 76 kk~~de~fk~ikdn~~Na~ViD~-----Kdd~G~lty~s~d~~~~~~~sv~~f~Di~~~iKkaKe~giY~IARi 144 (400)
T COG1306 76 KKRLDELFKLIKDNNINAFVIDV-----KDDYGELTYPSSDEINKYTKSVNKFKDIEPVIKKAKENGIYAIARI 144 (400)
T ss_pred hhHHHHHHHHHHhCCCCEEEEEe-----cCCCccEeccccchhhhhhhccccccccHHHHHHHHhcCeEEEEEE
Confidence 45667889999999999998743 22233333322 3478999999999999999884
No 140
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=66.14 E-value=22 Score=25.65 Aligned_cols=46 Identities=15% Similarity=0.193 Sum_probs=35.2
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..++-++.+++.|+++|-+- -|. ++.+ ..++.+.+++.||.||...
T Consensus 16 ~~~~~~~~a~~~g~~~v~iT----Dh~------~~~~---~~~~~~~~~~~gi~~i~G~ 61 (67)
T smart00481 16 SPEELVKRAKELGLKAIAIT----DHG------NLFG---AVEFYKAAKKAGIKPIIGL 61 (67)
T ss_pred CHHHHHHHHHHcCCCEEEEe----eCC------cccC---HHHHHHHHHHcCCeEEEEE
Confidence 45788999999999999983 332 3333 6678899999999988643
No 141
>KOG0259|consensus
Probab=65.95 E-value=11 Score=37.81 Aligned_cols=58 Identities=14% Similarity=0.206 Sum_probs=50.1
Q ss_pred CChhcHHHHHHHHHHh-CCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 102 APPGRWCWIMRAMRAA-GLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~-G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.|.+.|+-||.-++++ -=||+.+-|- |=+.|--++|+..- |.+++++|+++||.||..
T Consensus 180 LPe~~weIDL~~veal~DENT~Aivvi-NP~NPcGnVys~~H---L~kiae~A~klgi~vIaD 238 (447)
T KOG0259|consen 180 LPEKDWEIDLDGVEALADENTVAIVVI-NPNNPCGNVYSEDH---LKKIAETAKKLGIMVIAD 238 (447)
T ss_pred cCcccceechHHHHHhhccCeeEEEEe-CCCCCCcccccHHH---HHHHHHHHHHhCCeEEeh
Confidence 5778999999999996 8899999665 77888888988764 888899999999999975
No 142
>COG2159 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=65.83 E-value=41 Score=32.16 Aligned_cols=69 Identities=16% Similarity=0.041 Sum_probs=41.6
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCc
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPF 167 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPy 167 (323)
-++++...-.++.+...-++.-+..++.|+--+.+ ......+.- ...+..+++.|+++|+-|.+..|+.
T Consensus 98 drf~~~~~v~p~~~~~a~~E~er~v~~~gf~g~~l~p~~~~~~~~---------~~~~~pi~~~a~~~gvpv~ihtG~~ 167 (293)
T COG2159 98 DRFVGFARVDPRDPEAAAEELERRVRELGFVGVKLHPVAQGFYPD---------DPRLYPIYEAAEELGVPVVIHTGAG 167 (293)
T ss_pred cceeeeeeeCCCchHHHHHHHHHHHHhcCceEEEecccccCCCCC---------ChHHHHHHHHHHHcCCCEEEEeCCC
Confidence 33444444444555223344444556689999888 322222111 1227889999999999999988753
No 143
>PF00728 Glyco_hydro_20: Glycosyl hydrolase family 20, catalytic domain; InterPro: IPR015883 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3RPM_A 1C7T_A 1QBA_A 1QBB_A 1C7S_A 3RCN_A 2YL8_A 2YL6_A 2YLL_A 2YL5_C ....
Probab=65.51 E-value=24 Score=33.79 Aligned_cols=63 Identities=16% Similarity=0.118 Sum_probs=42.7
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCC----Ccc---c---CCCeecc------cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWR----SHE---V---HPGHYHY------DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~----~hE---P---~~G~fdf------~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.|.+..++.|+.|...++|++..++.=+ .-- | +.|.+.- =-.+|+..+++.|+++||.||...
T Consensus 15 ~~~~~ik~~id~ma~~k~N~lhlhl~D~~~~~~~~~~~p~l~~~ga~~~~~~~~~yT~~di~~lv~yA~~~gI~VIPei 93 (351)
T PF00728_consen 15 FSVDTIKRLIDQMAYYKLNVLHLHLSDDQGFRLESKSYPELTEKGAYRPSDAGGYYTKEDIRELVAYAKERGIEVIPEI 93 (351)
T ss_dssp B-HHHHHHHHHHHHHTT-SEEEEEEESSTCB-BEBSTSTHHHHTTTESTTCTESEBEHHHHHHHHHHHHHTT-EEEEEE
T ss_pred CCHHHHHHHHHHHHHcCCcEEEEEEecCCCCccccCCCccccccCccccccccccCCHHHHHHHHHHHHHcCCceeeec
Confidence 4578899999999999999999988522 111 1 1232221 113579999999999999999753
No 144
>smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites
Probab=65.23 E-value=1.1e+02 Score=28.00 Aligned_cols=55 Identities=9% Similarity=0.141 Sum_probs=36.8
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCee-cccccchHHHHHHHHHHCCCeEEEccCCc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHY-HYDGHRDIEHFLQLAVEEDLYILLRPGPF 167 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~f-df~g~~dl~~fl~~a~e~GL~vilr~GPy 167 (323)
-.+.++.+.++|++.|++. ..+|..-.- +++ ..+++++-+.++++||.+.+- +||
T Consensus 12 ~~~~~~~~~~~G~~~vel~----~~~~~~~~~~~~~-~~~~~~l~~~~~~~gl~ls~h-~p~ 67 (273)
T smart00518 12 LYKAFIEAVDIGARSFQLF----LGNPRSWKGVRLS-EETAEKFKEALKENNIDVSVH-APY 67 (273)
T ss_pred HhHHHHHHHHcCCCEEEEE----CCCCCCCCCCCCC-HHHHHHHHHHHHHcCCCEEEE-CCc
Confidence 3578999999999999994 445532110 222 235778888899999986552 344
No 145
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=64.70 E-value=10 Score=35.13 Aligned_cols=59 Identities=7% Similarity=-0.069 Sum_probs=38.5
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc-cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY-DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf-~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.+++.++.++++|++.|++.-.-...++.... .+ ...+.+.++.++|+++|+.+.+.+
T Consensus 94 ~~~~~~i~~a~~lG~~~v~~~~~~~~~~~~~~~-~~~~~~~~l~~l~~~a~~~gv~l~lE~ 153 (284)
T PRK13210 94 EIMKKAIRLAQDLGIRTIQLAGYDVYYEEKSEE-TRQRFIEGLAWAVEQAAAAQVMLAVEI 153 (284)
T ss_pred HHHHHHHHHHHHhCCCEEEECCcccccccccHH-HHHHHHHHHHHHHHHHHHhCCEEEEEe
Confidence 456888999999999999872100000111100 11 112468888999999999999988
No 146
>PF01120 Alpha_L_fucos: Alpha-L-fucosidase; InterPro: IPR000933 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Family 29 (GH29 from CAZY) encompasses alpha-L-fucosidases (3.2.1.51 from EC) [], which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Alpha-L-fucosidase is responsible for hydrolysing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Fucosylated glycoconjugates are involved in numerous biological events, making alpha-l-fucosidases, the enzymes responsible for their processing, critically important. Deficiency in alpha-l-fucosidase activity is associated with fucosidosis, a lysosomal storage disorder characterised by rapid neurodegeneration, resulting in severe mental and motor deterioration []. The enzyme is a hexamer and displays a two-domain fold, composed of a catalytic (beta/alpha)(8)-like domain and a C-terminal beta-sandwich domain []. Drosophila melanogaster spermatozoa contains an alpha-l-fucosidase that might be involved in fertilisation by interacting with alpha-l-fucose residues on the micropyle of the eggshell []. In human sperm, membrane-associated alpha-l-fucosidase is stable for extended periods of time, which is made possible by membrane domains and compartmentalisation. These help preserve protein integrity []. ; GO: 0004560 alpha-L-fucosidase activity, 0005975 carbohydrate metabolic process; PDB: 3EYP_B 2ZX6_A 2ZWY_B 2ZX8_B 2WSP_A 2ZXA_A 2ZWZ_B 1ODU_B 1HL9_A 2ZX5_B ....
Probab=64.36 E-value=1.4e+02 Score=29.12 Aligned_cols=140 Identities=14% Similarity=0.161 Sum_probs=66.2
Q ss_pred HHHHHHHHhCCCEEEE-------EEeCCCcccCCCeeccc-ccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccc
Q psy13819 109 WIMRAMRAAGLNALST-------YVEWRSHEVHPGHYHYD-GHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPW 180 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~-------yv~W~~hEP~~G~fdf~-g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~W 180 (323)
+-++.+|++|+.-|=+ +-.|+.-...-..-+-. +.|=+..|.+.|+++||++-+=.+|. +|.+...+.-
T Consensus 95 qW~~~ak~aGakY~VlTakHHDGF~LW~S~~t~~~v~~~~~krDiv~El~~A~rk~Glk~G~Y~S~~---dw~~~~~~~~ 171 (346)
T PF01120_consen 95 QWAKLAKDAGAKYVVLTAKHHDGFCLWPSKYTDYNVVNSGPKRDIVGELADACRKYGLKFGLYYSPW---DWHHPDYPPD 171 (346)
T ss_dssp HHHHHHHHTT-SEEEEEEE-TT--BSS--TT-SSBGGGGGGTS-HHHHHHHHHHHTT-EEEEEEESS---SCCCTTTTSS
T ss_pred HHHHHHHHcCCCEEEeehhhcCccccCCCCCCcccccCCCCCCCHHHHHHHHHHHcCCeEEEEecch---HhcCcccCCC
Confidence 3478899999998765 44565543332222221 23557788999999999988844322 5554333322
Q ss_pred ccccCCCeeecCCCHhHHHHHH-HHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceE
Q psy13819 181 LLKVAPDILLRQNHPVYQRYVT-RWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAV 259 (323)
Q Consensus 181 l~~~~p~~~~R~~~~~f~~~~~-~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~ 259 (323)
.....+. .....+.+.+++. .+..+|.+.+.. | .+-+|=+...... ....--...+.+++++. +.+++
T Consensus 172 ~~~~~~~--~~~~~~~~~~~~~~~~~~ql~EL~~~--Y--~~d~lWfDg~~~~----~~~~~~~~~~~~~i~~~-qp~~i 240 (346)
T PF01120_consen 172 EEGDENG--PADGPGNWQRYYNEYWLAQLRELLTR--Y--KPDILWFDGGWPD----PDEDWDSAELYNWIRKL-QPDVI 240 (346)
T ss_dssp CHCHHCC----HCCHHHHHHHHHHHHHHHHHHHHC--S--TESEEEEESTTSC----CCTHHHHHHHHHHHHHH-STTSE
T ss_pred ccCCccc--ccccchhhHhHhhhhhHHHHHHHHhC--C--CcceEEecCCCCc----cccccCHHHHHHHHHHh-CCeEE
Confidence 1110000 0112334444444 333333333332 3 3445544533332 11222236777788774 55555
Q ss_pred EEe
Q psy13819 260 LYS 262 (323)
Q Consensus 260 l~t 262 (323)
+..
T Consensus 241 i~~ 243 (346)
T PF01120_consen 241 INN 243 (346)
T ss_dssp EEC
T ss_pred Eec
Confidence 544
No 147
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=64.29 E-value=8.9 Score=35.09 Aligned_cols=60 Identities=15% Similarity=0.104 Sum_probs=39.5
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.+++-++.++++|..+|++......-++..-+..-...+.+.++.+.|++.|+.+.+.+
T Consensus 84 ~~~~~~i~~a~~lg~~~i~~~~g~~~~~~~~~~~~~~~~~~l~~l~~~A~~~gi~l~lE~ 143 (254)
T TIGR03234 84 EGVALAIAYARALGCPQVNCLAGKRPAGVSPEEARATLVENLRYAADALDRIGLTLLIEP 143 (254)
T ss_pred HHHHHHHHHHHHhCCCEEEECcCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCCEEEEEE
Confidence 567788999999999999873321100000001111123568888999999999999987
No 148
>PRK14040 oxaloacetate decarboxylase; Provisional
Probab=64.26 E-value=1.8e+02 Score=30.79 Aligned_cols=54 Identities=9% Similarity=0.063 Sum_probs=44.9
Q ss_pred ecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 97 FHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 97 ~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
+-|.|.|.+.-+.+++++.+.|+..||+..+.|.. +++...++.|+++|+.+..
T Consensus 89 vg~~~ypddvv~~~v~~a~~~Gid~~rifd~lnd~------------~~~~~ai~~ak~~G~~~~~ 142 (593)
T PRK14040 89 LGYRHYADDVVERFVERAVKNGMDVFRVFDAMNDP------------RNLETALKAVRKVGAHAQG 142 (593)
T ss_pred eccccCcHHHHHHHHHHHHhcCCCEEEEeeeCCcH------------HHHHHHHHHHHHcCCeEEE
Confidence 44677788888999999999999999998766653 4688899999999998643
No 149
>PRK12858 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=64.10 E-value=9 Score=37.62 Aligned_cols=66 Identities=21% Similarity=0.265 Sum_probs=46.6
Q ss_pred eecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 96 SFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 96 ~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.++ |.|...-.-..+.++++|.++|.+.++|.-.++. .-+-.-..++.++.+.|++.||-+++.+
T Consensus 98 t~~g-r~~~~~~~~sve~a~~~GAdAVk~lv~~~~d~~~--~~~~~~~~~l~rv~~ec~~~giPlllE~ 163 (340)
T PRK12858 98 TAPG-RLPDLLDNWSVRRIKEAGADAVKLLLYYRPDEDD--AINDRKHAFVERVGAECRANDIPFFLEP 163 (340)
T ss_pred CCCC-CCccccccccHHHHHHcCCCEEEEEEEeCCCcch--HHHHHHHHHHHHHHHHHHHcCCceEEEE
Confidence 3444 5554444445778999999999999999854221 1112345569999999999999998864
No 150
>PRK08673 3-deoxy-7-phosphoheptulonate synthase; Reviewed
Probab=63.63 E-value=27 Score=34.29 Aligned_cols=75 Identities=16% Similarity=0.170 Sum_probs=55.7
Q ss_pred eEEeCCeeeEEeeeeecCCCC--ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc--cchHHHHHHHHHHCC
Q psy13819 82 TFRLNEDPFQFVSGSFHYFRA--PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG--HRDIEHFLQLAVEED 157 (323)
Q Consensus 82 ~~~~dGk~~~i~~g~~Hy~r~--p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl~~fl~~a~e~G 157 (323)
...+.|+++.++.| +.. .++.-.+-.+.+|++|.+.+|.+++= |+---|.|.| ..-|.-+.+.|++.|
T Consensus 86 ~~~ig~~~~~~IAG----PCsiEs~e~~~~~A~~lk~~ga~~~r~~~fK----pRTsp~sf~G~g~~gL~~L~~~~~~~G 157 (335)
T PRK08673 86 DVEIGGGKPVVIAG----PCSVESEEQILEIARAVKEAGAQILRGGAFK----PRTSPYSFQGLGEEGLKLLAEAREETG 157 (335)
T ss_pred CEEECCCceEEEEe----cCccCCHHHHHHHHHHHHHhchhhccCcEec----CCCCCcccccccHHHHHHHHHHHHHcC
Confidence 35666778888888 221 25677778888899999999998884 4433467765 456777778899999
Q ss_pred CeEEEcc
Q psy13819 158 LYILLRP 164 (323)
Q Consensus 158 L~vilr~ 164 (323)
|.++..+
T Consensus 158 l~v~tev 164 (335)
T PRK08673 158 LPIVTEV 164 (335)
T ss_pred CcEEEee
Confidence 9999876
No 151
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=63.59 E-value=19 Score=34.70 Aligned_cols=67 Identities=9% Similarity=0.158 Sum_probs=51.5
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
..+..++.++++++.++.+=.+.+-+.... .-+.|+|+ -.-|..++++.+++.|++|++..=|+|..
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~f~~d~~~FPdp~~~i~~l~~~g~k~~~~~~P~i~~ 90 (317)
T cd06600 22 PQDKVVEVVDIMQKEGFPYDVVFLDIHYMD-SYRLFTWDPYRFPEPKKLIDELHKRNVKLVTIVDPGIRV 90 (317)
T ss_pred CHHHHHHHHHHHHHcCCCcceEEEChhhhC-CCCceeechhcCCCHHHHHHHHHHCCCEEEEEeeccccC
Confidence 356778999999999988777766554433 23556654 34589999999999999999988888874
No 152
>cd00311 TIM Triosephosphate isomerase (TIM) is a glycolytic enzyme that catalyzes the interconversion of dihydroxyacetone phosphate and D-glyceraldehyde-3-phosphate. The reaction is very efficient and requires neither cofactors nor metal ions. TIM, usually homodimeric, but in some organisms tetrameric, is ubiqitous and conserved in function across eukaryotes, bacteria and archaea.
Probab=62.21 E-value=18 Score=33.81 Aligned_cols=49 Identities=18% Similarity=0.253 Sum_probs=39.6
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++--|.-+ .+.+.+=++.|.++||.+|++.|
T Consensus 77 ~~mL~d~G~~~vii-----GHSERR~~f~Et-~~~i~~Kv~~a~~~gl~pIvCiG 125 (242)
T cd00311 77 AEMLKDAGAKYVII-----GHSERRQYFGET-DEDVAKKVKAALEAGLTPILCVG 125 (242)
T ss_pred HHHHHHcCCCEEEe-----CcccccCcCCCC-cHHHHHHHHHHHHCCCEEEEEeC
Confidence 56899999999999 555555555555 46688889999999999999997
No 153
>PF08531 Bac_rhamnosid_N: Alpha-L-rhamnosidase N-terminal domain; InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=62.09 E-value=30 Score=30.28 Aligned_cols=44 Identities=23% Similarity=0.382 Sum_probs=25.6
Q ss_pred CceeEEEEcCcccccccCCCCC------eeEEE----eccccccCCcceEEEEEEee
Q psy13819 4 WSKGVLFANGRHWGRYWSEVGP------QYSLF----VPEEFLKVGTNRITIFELTR 50 (323)
Q Consensus 4 ~~kg~~~vng~~lGryw~~~gp------q~~ly----vP~~~l~~g~N~i~i~e~~~ 50 (323)
-|+=.++|||+.+|+-= ..| ...+| | .++|++|.|.|-|.=..+
T Consensus 13 ~g~Y~l~vNG~~V~~~~--l~P~~t~y~~~~~Y~tyDV-t~~L~~G~N~iav~lg~g 66 (172)
T PF08531_consen 13 LGRYELYVNGERVGDGP--LAPGWTDYDKRVYYQTYDV-TPYLRPGENVIAVWLGNG 66 (172)
T ss_dssp ESEEEEEETTEEEEEE----------BTTEEEEEEEE--TTT--TTEEEEEEEEEE-
T ss_pred CeeEEEEECCEEeeCCc--cccccccCCCceEEEEEeC-hHHhCCCCCEEEEEEeCC
Confidence 45557899999999732 233 22333 4 679999999988853333
No 154
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=61.39 E-value=11 Score=34.59 Aligned_cols=60 Identities=17% Similarity=0.218 Sum_probs=39.3
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+..++.++.++++|..+|.+...+..-...+.+..-...+.+.++.+.|+++|+.+.+.+
T Consensus 85 ~~~~~~i~~a~~lga~~i~~~~g~~~~~~~~~~~~~~~~~~l~~l~~~a~~~Gv~l~lE~ 144 (258)
T PRK09997 85 DGVAAAIRYARALGNKKINCLVGKTPAGFSSEQIHATLVENLRYAANMLMKEDILLLIEP 144 (258)
T ss_pred HHHHHHHHHHHHhCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHcCCEEEEEe
Confidence 457888999999999999874322110001111111123567888899999999999987
No 155
>cd07937 DRE_TIM_PC_TC_5S Pyruvate carboxylase and Transcarboxylase 5S, carboxyltransferase domain. This family includes the carboxyltransferase domains of pyruvate carboxylase (PC) and the transcarboxylase (TC) 5S subunit. Transcarboxylase 5S is a cobalt-dependent metalloenzyme subunit of the biotin-dependent transcarboxylase multienzyme complex. Transcarboxylase 5S transfers carbon dioxide from the 1.3S biotin to pyruvate in the second of two carboxylation reactions catalyzed by TC. The first reaction involves the transfer of carbon dioxide from methylmalonyl-CoA to the 1.3S biotin, and is catalyzed by the 12S subunit. These two steps allow a carboxylate group to be transferred from oxaloacetate to propionyl-CoA to yield pyruvate and methylmalonyl-CoA. The catalytic domain of transcarboxylase 5S has a canonical TIM-barrel fold with a large C-terminal extension that forms a funnel leading to the active site. Transcarboxylase 5S forms a homodimer and there are six dimers per complex
Probab=61.22 E-value=25 Score=33.16 Aligned_cols=51 Identities=8% Similarity=0.086 Sum_probs=42.1
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
..|.+.-++++++..+.|+..||+.++-+. .+.+...++.|+++|+.|.+.
T Consensus 87 ~~p~~~~~~di~~~~~~g~~~iri~~~~~~------------~~~~~~~i~~ak~~G~~v~~~ 137 (275)
T cd07937 87 HYPDDVVELFVEKAAKNGIDIFRIFDALND------------VRNLEVAIKAVKKAGKHVEGA 137 (275)
T ss_pred CCCcHHHHHHHHHHHHcCCCEEEEeecCCh------------HHHHHHHHHHHHHCCCeEEEE
Confidence 356667799999999999999999877655 356888899999999988753
No 156
>PF14307 Glyco_tran_WbsX: Glycosyltransferase WbsX
Probab=61.07 E-value=1.7e+02 Score=28.53 Aligned_cols=44 Identities=11% Similarity=0.112 Sum_probs=29.8
Q ss_pred cCCeEEeCCeeeEEeeeeecCCCCC-hhcHHHHH-HHHHHhCCCEEEEE
Q psy13819 79 AGDTFRLNEDPFQFVSGSFHYFRAP-PGRWCWIM-RAMRAAGLNALSTY 125 (323)
Q Consensus 79 ~~~~~~~dGk~~~i~~g~~Hy~r~p-~~~W~~~l-~~mk~~G~N~Vr~y 125 (323)
|++.+.|||||++++=..-. +| ...+-+.+ +.+|++|++-+-+-
T Consensus 150 D~rYikVdGKPv~~Iy~p~~---~pd~~~~~~~wr~~a~~~G~~giyii 195 (345)
T PF14307_consen 150 DPRYIKVDGKPVFLIYRPGD---IPDIKEMIERWREEAKEAGLPGIYII 195 (345)
T ss_pred CCCceeECCEEEEEEECccc---ccCHHHHHHHHHHHHHHcCCCceEEE
Confidence 78899999999998855422 22 22333333 56688999977773
No 157
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=60.89 E-value=28 Score=33.79 Aligned_cols=68 Identities=7% Similarity=0.015 Sum_probs=52.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc--cchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG--HRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
..+.-++.++.+++.|+.+=.+.+-+... ...+.|+|+- .-|..++++.+++.|++|++..-|+|+.+
T Consensus 22 ~~~ev~~~~~~~~~~~iP~d~i~lD~~~~-~~~~~f~~d~~~FPdp~~mi~~L~~~G~k~~~~~~P~v~~~ 91 (339)
T cd06603 22 DQEDVKEVDAGFDEHDIPYDVIWLDIEHT-DGKRYFTWDKKKFPDPEKMQEKLASKGRKLVTIVDPHIKRD 91 (339)
T ss_pred CHHHHHHHHHHHHHcCCCceEEEEChHHh-CCCCceEeCcccCCCHHHHHHHHHHCCCEEEEEecCceecC
Confidence 35677899999999998887776665432 3445566543 34889999999999999999998999853
No 158
>COG3684 LacD Tagatose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism]
Probab=60.87 E-value=10 Score=36.04 Aligned_cols=61 Identities=25% Similarity=0.215 Sum_probs=46.9
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
|.|.-.=.-.-+++|+.|-+++.+-|.|..-||+ -+-.-.+-+++|...|..+||..+|.|
T Consensus 107 rlp~l~~~isa~riK~~G~~avK~Lvy~~~D~~e---~neqk~a~ierigsec~aedi~f~lE~ 167 (306)
T COG3684 107 RLPDLLRKISAKRIKEDGGDAVKFLVYYRSDEDE---INEQKLAYIERIGSECHAEDLPFFLEP 167 (306)
T ss_pred cchhhhhhhCHHHHHHhcccceEEEEEEcCCchH---HhHHHHHHHHHHHHHhhhcCCceeEee
Confidence 4443222335678999999999999999999994 333333448999999999999999988
No 159
>cd04882 ACT_Bt0572_2 C-terminal ACT domain of a novel protein composed of just two ACT domains. Included in this CD is the C-terminal ACT domain of a novel protein composed of just two ACT domains, as seen in the yet uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related proteins. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=60.69 E-value=21 Score=24.97 Aligned_cols=55 Identities=18% Similarity=0.311 Sum_probs=38.7
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeE
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~v 160 (323)
|..-.+.+..+.+.|+|.+.++. +.........+-+.-. +.++.++.++++|..|
T Consensus 10 pG~L~~i~~~l~~~~~nI~~i~~-~~~~~~~~~~v~~~ve-~~~~~~~~L~~~G~~v 64 (65)
T cd04882 10 PGGLHEILQILSEEGINIEYMYA-FVEKKGGKALLIFRTE-DIEKAIEVLQERGVEL 64 (65)
T ss_pred CcHHHHHHHHHHHCCCChhheEE-EccCCCCeEEEEEEeC-CHHHHHHHHHHCCceE
Confidence 45567788899999999998875 2222223445555433 3888999999999865
No 160
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=60.38 E-value=1.4e+02 Score=27.50 Aligned_cols=54 Identities=15% Similarity=0.239 Sum_probs=37.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
..|.+.++.++++|+..|++.+. ..++ .....+++. .+++.+-++++++||.+.
T Consensus 21 ~~~~e~~~~~~~~G~~~iEl~~~-~~~~-~~~~~~~~~-~~~~~l~~~l~~~gl~i~ 74 (283)
T PRK13209 21 ECWLEKLAIAKTAGFDFVEMSVD-ESDE-RLARLDWSR-EQRLALVNALVETGFRVN 74 (283)
T ss_pred CCHHHHHHHHHHcCCCeEEEecC-cccc-chhccCCCH-HHHHHHHHHHHHcCCcee
Confidence 47899999999999999999532 1110 011223332 357888999999999875
No 161
>TIGR00587 nfo apurinic endonuclease (APN1). All proteins in this family for which functions are known are 5' AP endonculeases that are used in base excision repair and the repair of abasic sites in DNA.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=60.21 E-value=73 Score=29.77 Aligned_cols=58 Identities=10% Similarity=0.041 Sum_probs=38.4
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeE--EEccCCcc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYI--LLRPGPFI 168 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~v--ilr~GPyi 168 (323)
-.+-++.++++|+++|++++. .|+--........+..+|-+.++++++.+ +.--+||.
T Consensus 13 ~~~a~~~~~~~G~~~~qif~~----~P~~w~~~~~~~~~~~~~~~~~~~~~~~~~~i~~Hapy~ 72 (274)
T TIGR00587 13 LQAAYNRAAEIGATAFMFFLK----SPRWWRRPMLEEEVIDWFKAALETNKNLSQIVLVHAPYL 72 (274)
T ss_pred HHHHHHHHHHhCCCEEEEEec----CccccCCCCCCHHHHHHHHHHHHHcCCCCcceeccCCee
Confidence 356799999999999999643 23211112222345777778899998864 55556774
No 162
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=59.59 E-value=1.4e+02 Score=27.26 Aligned_cols=42 Identities=17% Similarity=0.159 Sum_probs=34.0
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
.++.|++++++|++.|++.. |. ..+++++.++++++||.+..
T Consensus 17 l~~~l~~~a~~Gf~~VEl~~------~~--------~~~~~~~~~~l~~~gl~~~~ 58 (258)
T PRK09997 17 FLARFEKAAQCGFRGVEFMF------PY--------DYDIEELKQVLASNKLEHTL 58 (258)
T ss_pred HHHHHHHHHHhCCCEEEEcC------CC--------CCCHHHHHHHHHHcCCcEEE
Confidence 67889999999999999931 11 13588889999999999864
No 163
>COG0366 AmyA Glycosidases [Carbohydrate transport and metabolism]
Probab=59.44 E-value=16 Score=36.52 Aligned_cols=54 Identities=13% Similarity=0.225 Sum_probs=39.2
Q ss_pred HHHHHHHHHhCCCEEEE-EEeCCCcccCCCee----------cccccchHHHHHHHHHHCCCeEEEc
Q psy13819 108 CWIMRAMRAAGLNALST-YVEWRSHEVHPGHY----------HYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~f----------df~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.+.|.-+|.+|+++|=+ .++=+. ....-| .|...+|+.++++.|+++||+||+.
T Consensus 32 ~~~LdYl~~LGv~aiwl~Pi~~s~--~~~~gY~~~Dy~~id~~~Gt~~d~~~li~~~H~~gi~vi~D 96 (505)
T COG0366 32 TEKLDYLKELGVDAIWLSPIFESP--QADHGYDVSDYTKVDPHFGTEEDFKELVEEAHKRGIKVILD 96 (505)
T ss_pred HHhhhHHHHhCCCEEEeCCCCCCC--ccCCCccccchhhcCcccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 37899999999999977 222110 111111 3556789999999999999999987
No 164
>KOG0470|consensus
Probab=59.06 E-value=12 Score=40.24 Aligned_cols=57 Identities=12% Similarity=0.130 Sum_probs=37.9
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCC-cccCCC---eecccc-------------cchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRS-HEVHPG---HYHYDG-------------HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~-hEP~~G---~fdf~g-------------~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+++|..+|.+|+|+|++---=.. |+-+-. ..+|-. ..++..+++.|++.||.|+|..
T Consensus 258 eKvlphlK~LG~NaiqLmpi~Ef~~~~~s~GY~~~nFFapssrYgt~~s~~ri~efK~lVd~aHs~GI~VlLDV 331 (757)
T KOG0470|consen 258 EKVLPHLKKLGYNAIQLMPIFEFGHYYASWGYQVTNFFAPSSRYGTPESPCRINEFKELVDKAHSLGIEVLLDV 331 (757)
T ss_pred hhhhhHHHHhCccceEEeehhhhhhhhhccCcceeEeecccccccCCCcccchHHHHHHHHHHhhCCcEEehhh
Confidence 45599999999999998221122 222211 111110 2379999999999999999874
No 165
>PRK09282 pyruvate carboxylase subunit B; Validated
Probab=57.90 E-value=2.6e+02 Score=29.68 Aligned_cols=56 Identities=11% Similarity=0.095 Sum_probs=46.0
Q ss_pred ecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 97 FHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 97 ~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..|...|.+.-+.+++++.+.|+..||+..+-|.. +++...++.|+++|+.|....
T Consensus 88 ~gy~~ypd~vv~~~v~~A~~~Gvd~irif~~lnd~------------~n~~~~i~~ak~~G~~v~~~i 143 (592)
T PRK09282 88 VGYRHYPDDVVEKFVEKAAENGIDIFRIFDALNDV------------RNMEVAIKAAKKAGAHVQGTI 143 (592)
T ss_pred cccccccchhhHHHHHHHHHCCCCEEEEEEecChH------------HHHHHHHHHHHHcCCEEEEEE
Confidence 34656777888999999999999999998776663 578899999999999877443
No 166
>PRK11572 copper homeostasis protein CutC; Provisional
Probab=57.16 E-value=16 Score=34.34 Aligned_cols=62 Identities=19% Similarity=0.170 Sum_probs=46.2
Q ss_pred EEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 91 QFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 91 ~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
+..+|.+.|....-+.+.+|++.+|++|++.|-+-+. .+.|+-|.+. +.++++.|+ |+.+..
T Consensus 59 RPR~gdF~Ys~~E~~~M~~di~~~~~~GadGvV~G~L-----~~dg~vD~~~---~~~Li~~a~--~~~vTF 120 (248)
T PRK11572 59 RPRGGDFCYSDGEFAAMLEDIATVRELGFPGLVTGVL-----DVDGHVDMPR---MRKIMAAAG--PLAVTF 120 (248)
T ss_pred ecCCCCCCCCHHHHHHHHHHHHHHHHcCCCEEEEeeE-----CCCCCcCHHH---HHHHHHHhc--CCceEE
Confidence 4467889998877889999999999999999998543 3456677666 556677773 555554
No 167
>TIGR00433 bioB biotin synthetase. Catalyzes the last step of the biotin biosynthesis pathway.
Probab=56.97 E-value=17 Score=34.10 Aligned_cols=52 Identities=19% Similarity=0.181 Sum_probs=34.8
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCC---CeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHP---GHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~---G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
++.+++||++|++.|.+.+. ..-|-.+ +..++ ++..+.++.|+++|+.|...
T Consensus 123 ~e~l~~Lk~aG~~~v~i~~E-~~~~~~~~i~~~~s~---~~~~~ai~~l~~~Gi~v~~~ 177 (296)
T TIGR00433 123 PEQAKRLKDAGLDYYNHNLD-TSQEFYSNIISTHTY---DDRVDTLENAKKAGLKVCSG 177 (296)
T ss_pred HHHHHHHHHcCCCEEEEccc-CCHHHHhhccCCCCH---HHHHHHHHHHHHcCCEEEEe
Confidence 67899999999999999765 2211111 11222 23556789999999987543
No 168
>TIGR01108 oadA oxaloacetate decarboxylase alpha subunit. This model describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea. The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.
Probab=56.60 E-value=2.7e+02 Score=29.50 Aligned_cols=54 Identities=9% Similarity=0.073 Sum_probs=44.8
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
|-..|.+.-+.+++++.+.|+..||+..+-|.. +++...++.|+++|+.|....
T Consensus 85 ~~~ypddvv~~~v~~a~~~Gvd~irif~~lnd~------------~n~~~~i~~ak~~G~~v~~~i 138 (582)
T TIGR01108 85 YRHYADDVVERFVKKAVENGMDVFRIFDALNDP------------RNLQAAIQAAKKHGAHAQGTI 138 (582)
T ss_pred cccCchhhHHHHHHHHHHCCCCEEEEEEecCcH------------HHHHHHHHHHHHcCCEEEEEE
Confidence 545667778899999999999999998877753 568899999999999877543
No 169
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=56.52 E-value=45 Score=31.67 Aligned_cols=67 Identities=15% Similarity=0.107 Sum_probs=49.5
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCccc--------CCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEV--------HPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFIC 169 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP--------~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~ 169 (323)
+.+.-++.++++++.|+-+=-+++-...|.- .-+.|+|+ ..-|..++++.+++.|+++++..=|+|.
T Consensus 23 s~~ev~~v~~~~r~~~iP~D~i~lD~dw~~~~~~~~~~~~~~~ft~d~~~FPdp~~mi~~Lh~~G~k~v~~v~P~~~ 99 (292)
T cd06595 23 SDEEYLALMDRFKKHNIPLDVLVIDMDWHVTDIPSKYGSGWTGYSWNRKLFPDPEKLLQDLHDRGLKVTLNLHPADG 99 (292)
T ss_pred CHHHHHHHHHHHHHhCCCccEEEEecccccccccccccCCcceeEEChhcCCCHHHHHHHHHHCCCEEEEEeCCCcc
Confidence 5677889999999999877666665443331 22456664 3468999999999999999987777753
No 170
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=56.33 E-value=1.2e+02 Score=32.58 Aligned_cols=76 Identities=7% Similarity=0.033 Sum_probs=49.2
Q ss_pred CeeeEEeeeeecCCCCC-----hhcHHHHHHHHHHhCCCEEEE---------------EEeCCCcccCCCeecccccchH
Q psy13819 87 EDPFQFVSGSFHYFRAP-----PGRWCWIMRAMRAAGLNALST---------------YVEWRSHEVHPGHYHYDGHRDI 146 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p-----~~~W~~~l~~mk~~G~N~Vr~---------------yv~W~~hEP~~G~fdf~g~~dl 146 (323)
..+.++++..+-|-.-| .+.-...|+.+|++|+|||=+ |++| .|=|- +-| +.+-+
T Consensus 311 ~~~~r~~h~dld~vyd~dp~qq~~~L~~lLdrlk~~G~ntV~lqafadp~gd~~~~s~yfP~-~~lp~--r~d--~f~~~ 385 (671)
T PRK14582 311 KSPQRVMHIDLDYVYDENPQQQDRNIDVLIQRVKDMQISTVYLQAFADPDGDGLVKELYFPN-RLLPM--RAD--LFNRV 385 (671)
T ss_pred CCCEEEEEeccccccCCCHHHHHHHHHHHHHHHHHcCCCEEEEEeccCCCCCccccccccCc-ccccc--ccC--CcCHH
Confidence 34566666655554322 356788999999999999965 4566 33332 222 11112
Q ss_pred HHHHHHHHHCCCeEEEccCCccc
Q psy13819 147 EHFLQLAVEEDLYILLRPGPFIC 169 (323)
Q Consensus 147 ~~fl~~a~e~GL~vilr~GPyi~ 169 (323)
...++.+.|++|..+..||-.
T Consensus 386 --aw~l~~r~~v~v~AWmp~~~~ 406 (671)
T PRK14582 386 --AWQLRTRAGVNVYAWMPVLSF 406 (671)
T ss_pred --HHHHHHhhCCEEEEeccceee
Confidence 245699999999999988854
No 171
>PRK14042 pyruvate carboxylase subunit B; Provisional
Probab=55.97 E-value=2.8e+02 Score=29.52 Aligned_cols=53 Identities=6% Similarity=0.018 Sum_probs=43.2
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
|-.+|.+..+..++++++.|+.++|+.-+.| ..+++...++.+++.|..+...
T Consensus 90 y~~~~d~vv~~~v~~a~~~Gidv~Rifd~ln------------d~~n~~~~i~~~k~~G~~~~~~ 142 (596)
T PRK14042 90 YRNYADDVVRAFVKLAVNNGVDVFRVFDALN------------DARNLKVAIDAIKSHKKHAQGA 142 (596)
T ss_pred cccCChHHHHHHHHHHHHcCCCEEEEcccCc------------chHHHHHHHHHHHHcCCEEEEE
Confidence 5567788889999999999999999944433 3567888999999999987654
No 172
>cd01299 Met_dep_hydrolase_A Metallo-dependent hydrolases, subgroup A is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. The vast majority of the members have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. The function of this subgroup is unknown.
Probab=55.80 E-value=34 Score=32.49 Aligned_cols=60 Identities=10% Similarity=0.126 Sum_probs=43.8
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++..++.++.+++.|.+.|.+|.....--+. ++.-.++ .+.+.+++++|+++|+.|.+-.
T Consensus 119 ~~~~~~~v~~~~~~G~~~iK~~~~g~~~~~~~~~~~~~~~-~e~l~~~~~~A~~~g~~v~~H~ 180 (342)
T cd01299 119 VEEVRAAVREQLRRGADQIKIMATGGVLSPGDPPPDTQFS-EEELRAIVDEAHKAGLYVAAHA 180 (342)
T ss_pred HHHHHHHHHHHHHhCCCEEEEeccCCcCCCCCCCcccCcC-HHHHHHHHHHHHHcCCEEEEEe
Confidence 6788999999999999999998865332221 1211122 2458888999999999988765
No 173
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=55.01 E-value=19 Score=33.45 Aligned_cols=56 Identities=9% Similarity=0.029 Sum_probs=37.8
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCe-ec--cc-ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGH-YH--YD-GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~-fd--f~-g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+..++.++.++++|.++|++.- .+...+. .. +. ..+.+.++.+.|+++|+.+.+.+
T Consensus 94 ~~~~~~i~~a~~lG~~~v~~~~----~~~~~~~~~~~~~~~~~~~l~~l~~~A~~~Gv~l~lE~ 153 (279)
T TIGR00542 94 EIMEKAIQLARDLGIRTIQLAG----YDVYYEEHDEETRRRFREGLKEAVELAARAQVTLAVEI 153 (279)
T ss_pred HHHHHHHHHHHHhCCCEEEecC----cccccCcCCHHHHHHHHHHHHHHHHHHHHcCCEEEEee
Confidence 3578889999999999998731 1111110 01 11 12457788999999999999986
No 174
>PRK09989 hypothetical protein; Provisional
Probab=54.89 E-value=26 Score=32.18 Aligned_cols=43 Identities=19% Similarity=0.263 Sum_probs=33.4
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
..++.|++++++|+..|++..+|. + +.+.+.++++++||.+..
T Consensus 16 ~l~~~l~~~~~~Gfd~VEl~~~~~----------~----~~~~~~~~l~~~Gl~v~~ 58 (258)
T PRK09989 16 PFIERFAAARKAGFDAVEFLFPYD----------Y----STLQIQKQLEQNHLTLAL 58 (258)
T ss_pred CHHHHHHHHHHcCCCEEEECCccc----------C----CHHHHHHHHHHcCCcEEE
Confidence 467899999999999999943321 1 245668889999999875
No 175
>PF14683 CBM-like: Polysaccharide lyase family 4, domain III; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=54.56 E-value=12 Score=32.83 Aligned_cols=37 Identities=41% Similarity=0.686 Sum_probs=19.0
Q ss_pred EEEEcCcccccccC-CCCC------------eeEE--E-eccccccCCcceEEE
Q psy13819 8 VLFANGRHWGRYWS-EVGP------------QYSL--F-VPEEFLKVGTNRITI 45 (323)
Q Consensus 8 ~~~vng~~lGryw~-~~gp------------q~~l--y-vP~~~l~~g~N~i~i 45 (323)
.|.||| -.+..+. ..|+ ...+ + ||+..|++|.|+|.+
T Consensus 96 ~V~vNg-~~~~~~~~~~~~d~~~~r~g~~~G~~~~~~~~ipa~~L~~G~Nti~l 148 (167)
T PF14683_consen 96 QVSVNG-WSGPFPSAPFGNDNAIYRSGIHRGNYRLYEFDIPASLLKAGENTITL 148 (167)
T ss_dssp EEEETT-EE-----------S--GGGT---S---EEEEEE-TTSS-SEEEEEEE
T ss_pred EEEEcC-ccCCccccccCCCCceeeCceecccEEEEEEEEcHHHEEeccEEEEE
Confidence 688999 5555552 1111 1223 3 999999999999976
No 176
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=54.24 E-value=44 Score=32.61 Aligned_cols=68 Identities=12% Similarity=0.134 Sum_probs=48.6
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEE----------eCCCcccC---------CCeeccc---ccchHHHHHHHHHHCCCeE
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYV----------EWRSHEVH---------PGHYHYD---GHRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv----------~W~~hEP~---------~G~fdf~---g~~dl~~fl~~a~e~GL~v 160 (323)
..+.-++.++++++.|+.+=-+.+ .|+.+.-- -+.++|. ..-|..++++.+++.|++|
T Consensus 22 ~~~ev~~v~~~~~~~~iP~d~i~lD~W~~~~~~~~w~d~~y~~~~~~~~~~~~~~~f~~~~~FPdp~~mi~~Lh~~G~kv 101 (340)
T cd06597 22 TQAEVMRQMDAHEEHGIPVTVVVIEQWSDEATFYVFNDAQYTPKDGGAPLSYDDFSFPVEGRWPNPKGMIDELHEQGVKV 101 (340)
T ss_pred CHHHHHHHHHHHHHcCCCeeEEEEecccCcceeeeeccchhcccccCCcceecccccCccccCCCHHHHHHHHHHCCCEE
Confidence 356778999999999998777655 24432221 1334443 2347899999999999999
Q ss_pred EEccCCcccc
Q psy13819 161 LLRPGPFICG 170 (323)
Q Consensus 161 ilr~GPyi~a 170 (323)
++..=|+|..
T Consensus 102 ~l~v~P~i~~ 111 (340)
T cd06597 102 LLWQIPIIKL 111 (340)
T ss_pred EEEecCcccc
Confidence 9988888863
No 177
>cd02871 GH18_chitinase_D-like GH18 domain of Chitinase D (ChiD). ChiD, a chitinase found in Bacillus circulans, hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins. The domain architecture of ChiD includes a catalytic glycosyl hydrolase family 18 (GH18) domain, a chitin-binding domain, and a fibronectin type III domain. The chitin-binding and fibronectin type III domains are located either N-terminal or C-terminal to the catalytic domain. This family includes exochitinase Chi36 from Bacillus cereus.
Probab=53.97 E-value=1.5e+02 Score=28.27 Aligned_cols=84 Identities=14% Similarity=0.150 Sum_probs=49.0
Q ss_pred hHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEE
Q psy13819 145 DIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPII 224 (323)
Q Consensus 145 dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi 224 (323)
.+.+.++.|++.|++|++-.|- |. + . . . ..++ +..+++++.+.+.++++- +=
T Consensus 61 ~~~~~i~~~q~~G~KVllSiGG-----~~--~------~--~--~--~~~~---~~~~~fa~sl~~~~~~~g------~D 112 (312)
T cd02871 61 EFKADIKALQAKGKKVLISIGG-----AN--G------H--V--D--LNHT---AQEDNFVDSIVAIIKEYG------FD 112 (312)
T ss_pred HHHHHHHHHHHCCCEEEEEEeC-----CC--C------c--c--c--cCCH---HHHHHHHHHHHHHHHHhC------CC
Confidence 3677889999999999998861 11 1 1 0 0 1122 344566677777666542 44
Q ss_pred EEccCCCcCCCCC---Ch-HHHHHHHHHHHHHhcCCc
Q psy13819 225 LVQVENEYGSDAE---CD-PAHAVWLRDLLRTYVQDK 257 (323)
Q Consensus 225 ~~QieNE~g~~~~---~~-~~Y~~~l~~~~~~~~g~~ 257 (323)
++.|+=|++.... .+ ..|.+.|+++ |+.++.+
T Consensus 113 GiDiD~E~~~~~~~~~~~~~~~~~~lk~l-r~~~~~~ 148 (312)
T cd02871 113 GLDIDLESGSNPLNATPVITNLISALKQL-KDHYGPN 148 (312)
T ss_pred eEEEecccCCccCCcHHHHHHHHHHHHHH-HHHcCCC
Confidence 7888888865321 12 4556666554 4434543
No 178
>PLN03036 glutamine synthetase; Provisional
Probab=53.88 E-value=1.8e+02 Score=29.64 Aligned_cols=66 Identities=26% Similarity=0.349 Sum_probs=44.7
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc--------chH---HHHH-HHHHHCCCeEEEccCCccccccC
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH--------RDI---EHFL-QLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~--------~dl---~~fl-~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
.-++..+.+.++|++.-.+ .||-.||+|-+.-. |++ ..++ +.|+++|+.+-.-|=|+. ++|.
T Consensus 231 i~~~i~~a~~~~GI~Ie~~-----~~E~gpGQ~Ei~l~~~d~L~aAD~~~l~R~ivk~VA~~~Gl~ATFMPKP~~-gd~~ 304 (432)
T PLN03036 231 ISDAHYKACLYAGINISGT-----NGEVMPGQWEYQVGPSVGIDAGDHIWCSRYILERITEQAGVVLTLDPKPIE-GDWN 304 (432)
T ss_pred HHHHHHHHHHHCCCCeEEE-----EcCcCCCceEEecCCChHHHHHHHHHHHHHHHHHHHHHhCCEEEEcCCcCC-CCcC
Confidence 3345566778999999998 89999998877632 111 1112 358899999988877642 4555
Q ss_pred CCCC
Q psy13819 174 FGGF 177 (323)
Q Consensus 174 ~gg~ 177 (323)
.-|.
T Consensus 305 GSGm 308 (432)
T PLN03036 305 GAGC 308 (432)
T ss_pred CCCc
Confidence 5453
No 179
>PRK00042 tpiA triosephosphate isomerase; Provisional
Probab=53.25 E-value=30 Score=32.46 Aligned_cols=49 Identities=12% Similarity=0.178 Sum_probs=36.2
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++-.|.-+. +.+.+=++.|.++||.+|++.|
T Consensus 79 ~~mLkd~G~~~vii-----GHSERR~~f~Etd-~~v~~K~~~a~~~gl~pIvCiG 127 (250)
T PRK00042 79 AEMLKDLGVKYVII-----GHSERRQYFGETD-ELVNKKVKAALKAGLTPILCVG 127 (250)
T ss_pred HHHHHHCCCCEEEe-----CcccccCccCcCH-HHHHHHHHHHHHCCCEEEEEcC
Confidence 56899999999999 5655555555442 3344445559999999999997
No 180
>PLN02561 triosephosphate isomerase
Probab=52.90 E-value=31 Score=32.50 Aligned_cols=49 Identities=10% Similarity=0.103 Sum_probs=39.1
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++..|.-+ ...+.+=++.|.++||.+|++.|
T Consensus 81 ~~mL~d~G~~~vii-----GHSERR~~f~Et-d~~v~~Kv~~al~~gl~pIvCvG 129 (253)
T PLN02561 81 AEMLVNLGIPWVIL-----GHSERRALLGES-NEFVGDKVAYALSQGLKVIACVG 129 (253)
T ss_pred HHHHHHcCCCEEEE-----CcccccCccCCC-hHHHHHHHHHHHHCcCEEEEEcC
Confidence 56899999999999 565555666655 34567778899999999999997
No 181
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=52.82 E-value=39 Score=32.73 Aligned_cols=68 Identities=12% Similarity=0.134 Sum_probs=51.0
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
+.+..++.++++++.|+.+=.+.+-+.... .-+.|+|+ ..-|..++++.+++.|+++++..-|+|+.+
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~f~~d~~~fPdp~~m~~~l~~~g~~~~~~~~P~v~~~ 91 (339)
T cd06604 22 PEEEVREIADEFRERDIPCDAIYLDIDYMD-GYRVFTWDKERFPDPKELIKELHEQGFKVVTIIDPGVKVD 91 (339)
T ss_pred CHHHHHHHHHHHHHhCCCcceEEECchhhC-CCCceeeccccCCCHHHHHHHHHHCCCEEEEEEeCceeCC
Confidence 456778999999999987766655555443 23445554 345789999999999999999888888753
No 182
>PRK14566 triosephosphate isomerase; Provisional
Probab=52.47 E-value=43 Score=31.74 Aligned_cols=49 Identities=16% Similarity=0.151 Sum_probs=38.0
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++..|.-+ .+.+.+=++.|.++||.+|++.|
T Consensus 88 ~~mL~d~G~~~vii-----GHSERR~~f~Et-d~~v~~Kv~~al~~gl~pIvCvG 136 (260)
T PRK14566 88 GQMLKDAGCRYVII-----GHSERRRMYGET-SNIVAEKFAAAQKHGLTPILCVG 136 (260)
T ss_pred HHHHHHcCCCEEEE-----CcccccCCCCcC-HHHHHHHHHHHHHCCCEEEEEcC
Confidence 56899999999999 555555555544 23466678999999999999997
No 183
>PF13380 CoA_binding_2: CoA binding domain; PDB: 3FF4_A 2D5A_A 2D59_A 2E6U_X 1IUL_A 1IUK_A 1Y81_A 2DUW_A.
Probab=51.56 E-value=43 Score=27.35 Aligned_cols=44 Identities=25% Similarity=0.422 Sum_probs=31.4
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
.|++...+.++.+.++|+..|=+. +| + .-+++++.|+++||.++
T Consensus 63 ~~~~~~~~~v~~~~~~g~~~v~~~---------~g----~---~~~~~~~~a~~~gi~vi 106 (116)
T PF13380_consen 63 VPPDKVPEIVDEAAALGVKAVWLQ---------PG----A---ESEELIEAAREAGIRVI 106 (116)
T ss_dssp S-HHHHHHHHHHHHHHT-SEEEE----------TT----S-----HHHHHHHHHTT-EEE
T ss_pred cCHHHHHHHHHHHHHcCCCEEEEE---------cc----h---HHHHHHHHHHHcCCEEE
Confidence 678999999999999998877662 12 2 24566999999999976
No 184
>PRK15492 triosephosphate isomerase; Provisional
Probab=50.57 E-value=35 Score=32.22 Aligned_cols=49 Identities=14% Similarity=0.185 Sum_probs=38.4
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..++|++|++.|-+ .|--++-.|.-+ ...+.+=++.|.++||.+|++.|
T Consensus 87 a~mLkd~G~~~vii-----GHSERR~~f~Et-d~~v~~Kv~~a~~~gl~pIvCiG 135 (260)
T PRK15492 87 PLMLKEIGTQLVMI-----GHSERRHKFGET-DQEENAKVLAALKHDFTTLLCVG 135 (260)
T ss_pred HHHHHHcCCCEEEE-----CccccccccCcc-hHHHHHHHHHHHHCCCEEEEEcC
Confidence 56899999999999 555555566544 34466678899999999999997
No 185
>COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only]
Probab=50.47 E-value=97 Score=30.20 Aligned_cols=94 Identities=11% Similarity=0.123 Sum_probs=57.9
Q ss_pred cceEEEEecCCeEEeCCeeeEEeeeeecCCCCChh---cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHH
Q psy13819 71 SRTFAIDLAGDTFRLNEDPFQFVSGSFHYFRAPPG---RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIE 147 (323)
Q Consensus 71 ~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~---~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~ 147 (323)
...+.+++-+.++.+|. .-+.++.-+-+-.... .-..++....+.|.+||=- +. .-.--+|..
T Consensus 13 ~~~lGvTl~HEHl~~~~--~~~~~~~~~d~~~~~~~~a~~~~e~~~~~a~Gg~TIVD--------~T----~~~~GRdv~ 78 (316)
T COG1735 13 PADLGVTLMHEHLFIDP--YEIAGGLKNDPYDEDDEVALAIAELKRLMARGGQTIVD--------AT----NIGIGRDVL 78 (316)
T ss_pred HHHccceeehhhhccch--HHHhhcCCCCcccccHHHHHHHHHHHHHHHcCCCeEee--------CC----ccccCcCHH
Confidence 33445666677777774 2222332222222222 2344566666678888754 11 112246899
Q ss_pred HHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 148 HFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 148 ~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
++.+.+++.||.++..+|.|.-+.|+ .|+..
T Consensus 79 ~m~~vs~atglnIV~~TGfy~~~~~p-----~~~~~ 109 (316)
T COG1735 79 KMRRVAEATGLNIVAATGFYKAAFHP-----EYFAL 109 (316)
T ss_pred HHHHHHHHhCCcEEEeccccccccch-----hHHhh
Confidence 99999999999999999999877654 67654
No 186
>KOG0805|consensus
Probab=50.46 E-value=59 Score=30.88 Aligned_cols=79 Identities=14% Similarity=0.239 Sum_probs=50.6
Q ss_pred chHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHH-------HHHHhhh
Q psy13819 144 RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELF-------PRIQKYL 216 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~-------~~i~~~~ 216 (323)
+...+++..|++.|-+.+|-| |---||.|.|..- ++.+-+..+.-++..++|...-+ ++++.+.
T Consensus 37 ~K~~~~~~Eaa~~Ga~LV~fP------EAfiGGYPrg~~F---g~~~G~r~~eGR~ef~kY~a~AIev~gpEv~~l~~la 107 (337)
T KOG0805|consen 37 DKAEKYIVEAASKGAELVLFP------EAFIGGYPRGFRF---GLAVGVRNEEGRDEFRKYHASAIEVPGPEVERLAELA 107 (337)
T ss_pred HHHHHHHHHHhcCCceEEEee------hHhccCCCCccee---eEEEeecchhhhHHHHHHHHHhhcCCChHHHHHHHHh
Confidence 446788999999999999988 4456999999754 33443444555555555544332 2333343
Q ss_pred ccCCCcEEEEccCCC
Q psy13819 217 YGNDRPIILVQVENE 231 (323)
Q Consensus 217 ~~n~g~Vi~~QieNE 231 (323)
-+|.-.+++=-||-|
T Consensus 108 ~~~~v~lv~G~iEre 122 (337)
T KOG0805|consen 108 KKNNVYLVMGAIERE 122 (337)
T ss_pred hcCCeEEEEEEEecc
Confidence 455556666678887
No 187
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=49.48 E-value=50 Score=32.96 Aligned_cols=70 Identities=11% Similarity=0.256 Sum_probs=49.2
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
..+...+.++.+++.|+-+=.+.+...... ..+.|.|+ -..|...+++.+++.|+++++..-|+|+.+-.
T Consensus 41 ~~~~v~~~i~~~~~~~iP~d~~~iD~~~~~-~~~~f~~d~~~FPd~~~~~~~l~~~G~~~~~~~~P~v~~~~~ 112 (441)
T PF01055_consen 41 NQDEVREVIDRYRSNGIPLDVIWIDDDYQD-GYGDFTWDPERFPDPKQMIDELHDQGIKVVLWVHPFVSNDSP 112 (441)
T ss_dssp SHHHHHHHHHHHHHTT--EEEEEE-GGGSB-TTBTT-B-TTTTTTHHHHHHHHHHTT-EEEEEEESEEETTTT
T ss_pred CHHHHHHHHHHHHHcCCCccceeccccccc-cccccccccccccchHHHHHhHhhCCcEEEEEeecccCCCCC
Confidence 356778999999999998887766655433 34444433 23589999999999999999998899887654
No 188
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=49.13 E-value=72 Score=30.74 Aligned_cols=69 Identities=13% Similarity=0.363 Sum_probs=50.9
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEe-CCC-cccCCCe---eccc----ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVE-WRS-HEVHPGH---YHYD----GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~-W~~-hEP~~G~---fdf~----g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
+.+.-++.++++++.|+-+=-+++- |.. ++..-|. .||+ ..-|..++++..++.|++|++..=|+|+.+
T Consensus 21 s~~~v~~~~~~~~~~~iP~d~i~lddw~~~~~~~~g~~~~~~f~~d~~~FPdp~~mi~~Lh~~G~~~~~~i~P~v~~~ 98 (317)
T cd06594 21 GTDKVLEALEKARAAGVKVAGLWLQDWTGRRETSFGDRLWWNWEWDPERYPGLDELIEELKARGIRVLTYINPYLADD 98 (317)
T ss_pred CHHHHHHHHHHHHHcCCCeeEEEEccccCcccccccceeeeeeEEChhhCCCHHHHHHHHHHCCCEEEEEecCceecC
Confidence 6678899999999999887777664 532 2222231 2343 345889999999999999999888898764
No 189
>PRK08645 bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed
Probab=48.76 E-value=87 Score=33.16 Aligned_cols=113 Identities=12% Similarity=0.044 Sum_probs=71.2
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
++++-+.+|+..++.+-+.+.=-++|++-.++|...+-|-.+ ||- +.+.+|++.|++.++.+|...-
T Consensus 459 ~~~~~f~ig~A~~P~~~~~~~d~~~L~~Ki~aGAdf~iTQ~~----------fd~---~~~~~~~~~~~~~~vpIi~GIm 525 (612)
T PRK08645 459 GKKTNFSIGGAFNPNVRNLDKEVKRLEKKIEAGADYFITQPV----------YDE---ELIEELLEATKHLGVPIFIGIM 525 (612)
T ss_pred CCCCceeeeEEeCCCCCChHHHHHHHHHHHHcCCCEEEeccc----------CCH---HHHHHHHHHHhcCCCCEEEEee
Confidence 445556778877765544332233444445689999999333 333 3488889999988888888877
Q ss_pred Ccccc--------ccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 166 PFICG--------KRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 166 Pyi~a--------Ew~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
|.... ++..--+|.|+.+..-. . .+...+++...++..++++.+.+
T Consensus 526 Pi~s~k~~~~~~~~~~Gv~vP~~l~~~l~~--~-~d~~~~~~~gv~~a~e~i~~l~~ 579 (612)
T PRK08645 526 PLVSYRNAEFLHNEVPGITLPEEIRERMRA--V-EDKEEAREEGVAIARELIDAARE 579 (612)
T ss_pred ecCCHHHHHHHHhCCCCCCCCHHHHHHHHh--c-CCchHHHHHHHHHHHHHHHHHHh
Confidence 74432 34455578888663111 1 12346777777888888887653
No 190
>PTZ00333 triosephosphate isomerase; Provisional
Probab=48.27 E-value=42 Score=31.63 Aligned_cols=49 Identities=16% Similarity=0.219 Sum_probs=40.0
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..++|++|++.|-+ .|--++-.|.-+ .+.+.+=++.|.++||.+|++.|
T Consensus 82 ~~mL~d~G~~~vii-----GHSERR~~f~Et-d~~I~~Kv~~al~~gl~pIlCvG 130 (255)
T PTZ00333 82 AEMLKDLGINWTIL-----GHSERRQYFGET-NEIVAQKVKNALENGLKVILCIG 130 (255)
T ss_pred HHHHHHcCCCEEEE-----CcccccCcCCCC-cHHHHHHHHHHHHCCCEEEEEcC
Confidence 46899999999999 666665666554 45678888999999999999997
No 191
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=47.07 E-value=26 Score=32.46 Aligned_cols=57 Identities=5% Similarity=0.010 Sum_probs=37.0
Q ss_pred cHHHHHHHHHHhCCCEEEEEE--eCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYV--EWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv--~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..++.++.++++|..+|++.- .|...+ .+..++. -.+.+..+.+.|+++|+.+.+.+
T Consensus 100 ~~~~~i~~a~~lG~~~i~~~~~~~~~~~~-~~~~~~~-~~~~l~~l~~~A~~~GV~i~iE~ 158 (283)
T PRK13209 100 IMRKAIQLAQDLGIRVIQLAGYDVYYEQA-NNETRRR-FIDGLKESVELASRASVTLAFEI 158 (283)
T ss_pred HHHHHHHHHHHcCCCEEEECCcccccccc-HHHHHHH-HHHHHHHHHHHHHHhCCEEEEee
Confidence 467888999999999998720 010000 0000110 12457888999999999999988
No 192
>PF01487 DHquinase_I: Type I 3-dehydroquinase; InterPro: IPR001381 3-dehydroquinate dehydratase (4.2.1.10 from EC), or dehydroquinase, catalyzes the conversion of 3-dehydroquinate into 3-dehydroshikimate. It is the third step in the shikimate pathway for the biosynthesis of aromatic amino acids from chorismate. Two classes of dehydroquinases exist, known as types I and II. The best studied type I enzyme is from Escherichia coli (gene aroD) and related bacteria where it is a homodimeric protein. In fungi, dehydroquinase is part of a multifunctional enzyme which catalyzes five consecutive steps in the shikimate pathway. A histidine [] is involved in the catalytic mechanism.; GO: 0003855 3-dehydroquinate dehydratase activity; PDB: 2O7Q_A 2GPT_A 2O7S_A 1SFL_A 2OCZ_A 2OX1_C 1GQN_A 1QFE_B 1L9W_D 3L9C_A ....
Probab=46.77 E-value=36 Score=30.73 Aligned_cols=63 Identities=19% Similarity=0.288 Sum_probs=39.0
Q ss_pred eeecCCCCChh--cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEE-EccCC
Q psy13819 95 GSFHYFRAPPG--RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYIL-LRPGP 166 (323)
Q Consensus 95 g~~Hy~r~p~~--~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vi-lr~GP 166 (323)
++.|.|.-+|. .+.+.+++|++.|.+.|.+=+. |. +.+....+.+++..+++. +..+| +..|+
T Consensus 118 ~S~H~f~~tp~~~~l~~~~~~~~~~gadivKia~~-----~~----~~~D~~~l~~~~~~~~~~~~~p~i~~~MG~ 184 (224)
T PF01487_consen 118 LSYHDFEKTPSWEELIELLEEMQELGADIVKIAVM-----AN----SPEDVLRLLRFTKEFREEPDIPVIAISMGE 184 (224)
T ss_dssp EEEEESS---THHHHHHHHHHHHHTT-SEEEEEEE------S----SHHHHHHHHHHHHHHHHHTSSEEEEEEETG
T ss_pred EEeccCCCCCCHHHHHHHHHHHHhcCCCeEEEEec-----cC----CHHHHHHHHHHHHHHhhccCCcEEEEEcCC
Confidence 67887765544 4899999999999999999333 32 222333466666666654 67766 45653
No 193
>PRK09432 metF 5,10-methylenetetrahydrofolate reductase; Provisional
Probab=46.58 E-value=1.1e+02 Score=29.36 Aligned_cols=89 Identities=13% Similarity=0.140 Sum_probs=54.8
Q ss_pred HHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCC--CeEEEccCCccc-------cccCCCCCccc
Q psy13819 110 IMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEED--LYILLRPGPFIC-------GKRDFGGFPPW 180 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~G--L~vilr~GPyi~-------aEw~~gg~P~W 180 (323)
.|++-.++|.+.+-|-++ ||.+ .+.+|++.|++.| +.|++..-|... ++..+-.+|.|
T Consensus 168 ~Lk~K~~aGA~~~iTQ~~----------Fd~~---~~~~f~~~~~~~Gi~vPIi~GI~pi~s~~~~~~~~~~~Gv~vP~~ 234 (296)
T PRK09432 168 NLKRKVDAGANRAITQFF----------FDVE---SYLRFRDRCVSAGIDVEIVPGILPVSNFKQLKKFADMTNVRIPAW 234 (296)
T ss_pred HHHHHHHcCCCeeecccc----------cchH---HHHHHHHHHHHcCCCCCEEeeccccCCHHHHHHHHHccCCCCCHH
Confidence 344445689988887222 3333 4889999999999 556666656332 45566778999
Q ss_pred ccccCCCeeecCCC-HhHHHHHHHHHHHHHHHHHh
Q psy13819 181 LLKVAPDILLRQNH-PVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 181 l~~~~p~~~~R~~~-~~f~~~~~~~~~~l~~~i~~ 214 (323)
+.+..-. . .++ +..+++-.++..++++.+.+
T Consensus 235 l~~~l~~--~-~d~~~~~~~~Gi~~a~e~i~~L~~ 266 (296)
T PRK09432 235 MAKMFDG--L-DDDAETRKLVGASIAMDMVKILSR 266 (296)
T ss_pred HHHHHHh--c-CCCHHHHHHHHHHHHHHHHHHHHH
Confidence 8663211 1 123 34566666677777776543
No 194
>TIGR02635 RhaI_grampos L-rhamnose isomerase, Streptomyces subtype. This clade of sequences is closely related to the L-rhamnose isomerases found in Pseudomonas stutzeri and in a number of the Rhizobiales (TIGR02629). The genes of the family represented here are found in similar genomic contexts which contain genes apparently involved in rhamnose catabolism such as rhamnulose-1-phosphate aldolase (TIGR02632), sugar kinases, and sugar transporters.
Probab=46.52 E-value=3.1e+02 Score=27.36 Aligned_cols=121 Identities=14% Similarity=0.059 Sum_probs=70.3
Q ss_pred hhcHHHHHHHHHHh-CC-CEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCCccccccCCCCCccc
Q psy13819 104 PGRWCWIMRAMRAA-GL-NALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGPFICGKRDFGGFPPW 180 (323)
Q Consensus 104 ~~~W~~~l~~mk~~-G~-N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GPyi~aEw~~gg~P~W 180 (323)
+++..+|+.++..+ ++ -.|.+.++|.. . .|+.++.+.++++||.|. +.|+ +...|.
T Consensus 39 ~~e~~~d~~~v~~L~~~~~~v~lH~~~d~------~------~d~~~~~~~l~~~GL~v~~i~p~--------~f~~~~- 97 (378)
T TIGR02635 39 VFEKIEDAALVHRLTGICPTVALHIPWDR------V------EDYEELARYAEELGLKIGAINPN--------LFQDDD- 97 (378)
T ss_pred HHHHHHHHHHHHhhcCCCCceeeccCCcc------c------cCHHHHHHHHHHcCCceeeeeCC--------ccCCcc-
Confidence 45566777777776 33 45666666622 1 457788999999999988 7774 111121
Q ss_pred ccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEcc-CCCcCCCCC---ChHHHHHHHHHHHHH
Q psy13819 181 LLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQV-ENEYGSDAE---CDPAHAVWLRDLLRT 252 (323)
Q Consensus 181 l~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~Qi-eNE~g~~~~---~~~~Y~~~l~~~~~~ 252 (323)
++...+-..||..++.+-++.++.++.-+.+ +.+.|..|=- .-+|..+.. .-..+.+.|++++..
T Consensus 98 ----~~~GSLt~pD~~vR~~AIe~~k~~idiA~eL---Ga~~I~iW~~DG~~~~g~~~~~~a~~rl~esL~eI~~~ 166 (378)
T TIGR02635 98 ----YKFGSLTHPDKRIRRKAIDHLLECVDIAKKT---GSKDISLWLADGTNYPGQDDFRSRKDRLEESLAEVYEH 166 (378)
T ss_pred ----cCCCCCCCCCHHHHHHHHHHHHHHHHHHHHh---CCCeEEEecCCcCcCCcccCHHHHHHHHHHHHHHHHHh
Confidence 1222466678999999988888888765532 2234445511 122222211 114556666776633
No 195
>PRK02412 aroD 3-dehydroquinate dehydratase; Provisional
Probab=46.49 E-value=62 Score=30.17 Aligned_cols=31 Identities=23% Similarity=0.319 Sum_probs=26.5
Q ss_pred eeecCCCCCh--hcHHHHHHHHHHhCCCEEEEE
Q psy13819 95 GSFHYFRAPP--GRWCWIMRAMRAAGLNALSTY 125 (323)
Q Consensus 95 g~~Hy~r~p~--~~W~~~l~~mk~~G~N~Vr~y 125 (323)
+++|.|..+| +.+.+.+++|++.|.+.|.+-
T Consensus 140 ~S~H~f~~tP~~~~l~~~~~~~~~~gaDivKia 172 (253)
T PRK02412 140 LSYHDFEKTPPKEEIVERLRKMESLGADIVKIA 172 (253)
T ss_pred EeeCCCCCCcCHHHHHHHHHHHHHhCCCEEEEE
Confidence 4999987766 567889999999999999993
No 196
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=45.67 E-value=2.6e+02 Score=27.21 Aligned_cols=66 Identities=9% Similarity=0.051 Sum_probs=52.1
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc--cchHHHHHHHHHHCCCeEEEccCCccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG--HRDIEHFLQLAVEEDLYILLRPGPFIC 169 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl~~fl~~a~e~GL~vilr~GPyi~ 169 (323)
+.++.++.++++++.++-+=-+++-+..+.- -+.|.|+. .-|..++++.+++.|+++++..=|+|.
T Consensus 22 ~~~ev~~v~~~~r~~~IP~D~i~lDidy~~~-~~~Ft~d~~~FPdp~~mv~~L~~~G~klv~~i~P~i~ 89 (332)
T cd06601 22 NRSDLEEVVEGYRDNNIPLDGLHVDVDFQDN-YRTFTTNGGGFPNPKEMFDNLHNKGLKCSTNITPVIS 89 (332)
T ss_pred CHHHHHHHHHHHHHcCCCCceEEEcCchhcC-CCceeecCCCCCCHHHHHHHHHHCCCeEEEEecCcee
Confidence 4667889999999999877777777766543 36666643 457899999999999999988888887
No 197
>TIGR00419 tim triosephosphate isomerase. Triosephosphate isomerase (tim/TPIA) is the glycolytic enzyme that catalyzes the reversible interconversion of glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. The active site of the enzyme is located between residues 240-258 of the model ([AV]-Y-E-P-[LIVM]-W-[SA]-I-G-T-[GK]) with E being the active site residue. There is a slight deviation from this sequence within the archeal members of this family.
Probab=45.29 E-value=49 Score=30.16 Aligned_cols=44 Identities=14% Similarity=0.104 Sum_probs=34.9
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..++|++|++.|-+ ++-|.+ |..+. +.+=++.|.++||.+|++.
T Consensus 74 ~~mLkd~G~~~vii----GHSERR---f~Etd---i~~Kv~~a~~~gl~~IvCi 117 (205)
T TIGR00419 74 AEMLKDIGAKGTLI----NHSERR---MKLAD---IEKKIARLKELGLTSVVCT 117 (205)
T ss_pred HHHHHHcCCCEEEE----CcccCC---CCccH---HHHHHHHHHHCCCEEEEEE
Confidence 56899999999999 344554 44443 8888999999999999976
No 198
>COG3661 AguA Alpha-glucuronidase [Carbohydrate transport and metabolism]
Probab=45.22 E-value=2.8e+02 Score=28.64 Aligned_cols=113 Identities=15% Similarity=0.079 Sum_probs=68.8
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKV 184 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~ 184 (323)
+..+|-.+.+...|+|.+-+-- =|.++.+-=--+-..+.++.++-+.-+++||+|.|.. +..-|--|-.
T Consensus 183 qR~kDYAR~laSiGINg~v~NN-VNvk~~e~~lit~~fl~k~aklAdiFR~YGIK~yLsi---------nfaSP~~lGg- 251 (684)
T COG3661 183 QRMKDYARALASIGINGTVLNN-VNVKKAESYLITAPFLAKAAKLADIFRPYGIKVYLSI---------NFASPMELGG- 251 (684)
T ss_pred HHHHHHHHHHhhcCcceEEecc-cccchhhhheechHhHHHHHHHHHHhhhccceEEEEe---------ccCCccccCC-
Confidence 5667788888899999998820 0233332112333346778888899999999999876 2333433321
Q ss_pred CCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 185 APDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 185 ~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
+.+.| ..-+++++|+++-+..|-++.-.=||-++-..-|--+|-
T Consensus 252 -----L~TAD-PLDe~VrawWkeka~~IY~yIPDFGGFLVKAdSE~~PGP 295 (684)
T COG3661 252 -----LKTAD-PLDEAVRAWWKEKADEIYKYIPDFGGFLVKADSEGQPGP 295 (684)
T ss_pred -----cCcCC-cccHHHHHHHHHHHHHHHHhcccccceEEeccCCCCCCc
Confidence 12222 235688899998888877764333454444444444554
No 199
>cd06545 GH18_3CO4_chitinase The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas.
Probab=45.13 E-value=2.2e+02 Score=26.14 Aligned_cols=31 Identities=13% Similarity=0.247 Sum_probs=23.6
Q ss_pred CCeeccc-ccchHHHHHHHHHHCCCeEEEccC
Q psy13819 135 PGHYHYD-GHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 135 ~G~fdf~-g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.|...++ ...++..+++.|++.|++|++..|
T Consensus 36 ~G~l~~~~~~~~~~~~~~~~~~~~~kvl~sig 67 (253)
T cd06545 36 NGTLNANPVRSELNSVVNAAHAHNVKILISLA 67 (253)
T ss_pred CCeEEecCcHHHHHHHHHHHHhCCCEEEEEEc
Confidence 3555553 234678889999999999999886
No 200
>COG1809 (2R)-phospho-3-sulfolactate synthase (PSL synthase, CoM biosynthesis) [Coenzyme transport and metabolism]
Probab=45.08 E-value=1.1e+02 Score=28.71 Aligned_cols=63 Identities=14% Similarity=0.231 Sum_probs=44.5
Q ss_pred eeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 93 VSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 93 ~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.||...--++....-..-|+.++++||++|+++ -|.-.-+ .....++|+.|-++|+.|.-..|
T Consensus 78 pGGtlfe~a~~~~kvdeyl~e~~~lGfe~iEIS---------~G~i~m~-~eek~~lIe~a~d~Gf~vlsEvG 140 (258)
T COG1809 78 PGGTLFEIAYSQDKVDEYLNEAKELGFEAIEIS---------NGTIPMS-TEEKCRLIERAVDEGFMVLSEVG 140 (258)
T ss_pred CCceEEEeehhcccHHHHHHHHHHcCccEEEec---------CCeeecc-hHHHHHHHHHHHhcccEEehhhc
Confidence 344333335555677888999999999999993 1222222 12367889999999999998886
No 201
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=45.00 E-value=18 Score=33.55 Aligned_cols=59 Identities=10% Similarity=0.002 Sum_probs=39.0
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+..++-++.++++|+++|.+........+.+..+. .-.+.+.++.+.|+++|+.+.+.+
T Consensus 85 ~~~~~~i~~A~~lG~~~v~~~~g~~~~~~~~~~~~-~~~~~l~~l~~~a~~~gi~l~lEn 143 (279)
T cd00019 85 ERLKDEIERCEELGIRLLVFHPGSYLGQSKEEGLK-RVIEALNELIDKAETKGVVIALET 143 (279)
T ss_pred HHHHHHHHHHHHcCCCEEEECCCCCCCCCHHHHHH-HHHHHHHHHHHhccCCCCEEEEeC
Confidence 56788999999999999877332211001111111 113567888888999999999988
No 202
>KOG3293|consensus
Probab=44.98 E-value=50 Score=27.76 Aligned_cols=74 Identities=19% Similarity=0.220 Sum_probs=39.7
Q ss_pred eccccccCCcceEEEEEEeeCCCCccccchhhhhhccc-cccceEEEEecCCeEEeCCeeeEEeeeeecCCCCChhcH
Q psy13819 31 VPEEFLKVGTNRITIFELTRAPDKYDVDFVDKISQRRA-RMSRTFAIDLAGDTFRLNEDPFQFVSGSFHYFRAPPGRW 107 (323)
Q Consensus 31 vP~~~l~~g~N~i~i~e~~~~~~~~~i~~~~~~i~~~~-~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W 107 (323)
+|-++|+...|..+.+||+. +....=.++.-..++.- +..-.++ +-|++.|... -...|.+-.+-|-|+|.+-.
T Consensus 2 lPLsLL~~aq~~pmlvELKN-get~nGhL~~cD~wMNl~L~~Vi~t-s~Dgdkf~r~-pEcYirGttIkylri~d~ii 76 (134)
T KOG3293|consen 2 LPLSLLKTAQNHPMLVELKN-GETYNGHLVNCDNWMNLHLREVICT-SEDGDKFFRM-PECYIRGTTIKYLRIPDEII 76 (134)
T ss_pred cchhHHHhcCCCeEEEEecC-CCEecceeecchhhhhcchheeEEe-ccCCCceeec-ceeEEecceeEEEeccHHHH
Confidence 58889999999999999986 32211111111112211 1111111 1244444433 13455677888999986543
No 203
>KOG0622|consensus
Probab=44.15 E-value=50 Score=33.46 Aligned_cols=68 Identities=18% Similarity=0.231 Sum_probs=48.7
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCe-EEEccCCccccccCCCCCcc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLY-ILLRPGPFICGKRDFGGFPP 179 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~-vilr~GPyi~aEw~~gg~P~ 179 (323)
+..+.-+..|+.+|++|+|.|=+++.=..--+.+-.|.-+ ..|.+..+|++.+.|.. .+|..| ||+|.
T Consensus 190 ~~~~~~~~lLd~ak~l~lnvvGvsfHvGSgc~d~~~y~~A-i~dAr~vfd~g~e~Gf~m~~LdiG---------GGf~g 258 (448)
T KOG0622|consen 190 CSLDNCRHLLDMAKELELNVVGVSFHVGSGCTDLQAYRDA-ISDARNVFDMGAELGFEMDILDIG---------GGFPG 258 (448)
T ss_pred CCHHHHHHHHHHHHHcCceEEEEEEEecCCCCCHHHHHHH-HHHHHHHHHHHHhcCceEEEeecC---------CCCCC
Confidence 3456778899999999999999987655433333333322 34677789999999997 567885 88874
No 204
>cd02877 GH18_hevamine_XipI_class_III This conserved domain family includes xylanase inhibitor Xip-I, and the class III plant chitinases such as hevamine, concanavalin B, and PPL2, all of which have a glycosyl hydrolase family 18 (GH18) domain. Hevamine is a class III endochitinase that hydrolyzes the linear polysaccharide chains of chitin and peptidoglycan and is important for defense against pathogenic bacteria and fungi. PPL2 (Parkia platycephala lectin 2) is a class III chitinase from Parkia platycephala seeds that hydrolyzes beta(1-4) glycosidic bonds linking 2-acetoamido-2-deoxy-beta-D-glucopyranose units in chitin.
Probab=44.03 E-value=2.9e+02 Score=26.25 Aligned_cols=48 Identities=10% Similarity=0.079 Sum_probs=31.9
Q ss_pred CCCEEEEEEeCCCcccCCCeeccccc---------chHHHHHHHHHHCCCeEEEccC
Q psy13819 118 GLNALSTYVEWRSHEVHPGHYHYDGH---------RDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 118 G~N~Vr~yv~W~~hEP~~G~fdf~g~---------~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|-+.+.=..-....-..||.+. ..+.+.|+.|++.|++|+|-+|
T Consensus 24 ~~dii~i~Fl~~~~~~~~p~~n~~~~c~~~~~~~c~~~~~dI~~cq~~G~KVlLSIG 80 (280)
T cd02877 24 NYDIVNISFLNVFGSGGTPGLNFAGHCGGSTYPNCPQLGADIKHCQSKGKKVLLSIG 80 (280)
T ss_pred CccEEEEEeEcccCCCCCcccCccccCcccccccchhHHHHHHHHHHCCCEEEEEcc
Confidence 47777774432221111124666554 2689999999999999999996
No 205
>PRK14565 triosephosphate isomerase; Provisional
Probab=44.01 E-value=2.7e+02 Score=25.94 Aligned_cols=49 Identities=18% Similarity=0.295 Sum_probs=35.8
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.+-+ .|--++-.|+-+. ..+.+=++.|.++||.+|++.|
T Consensus 78 ~~mLkd~G~~~vii-----GHSERR~~f~Etd-~~V~~Kv~~al~~gl~pIvCiG 126 (237)
T PRK14565 78 AKMLKECGCSYVIL-----GHSERRSTFHETD-SDIRLKAESAIESGLIPIICVG 126 (237)
T ss_pred HHHHHHcCCCEEEE-----CcccccCcCCcCH-HHHHHHHHHHHHCCCEEEEEcC
Confidence 56899999999999 5555545554442 2233334899999999999997
No 206
>PLN02284 glutamine synthetase
Probab=43.87 E-value=3.3e+02 Score=26.81 Aligned_cols=52 Identities=21% Similarity=0.413 Sum_probs=37.2
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeeccccc--------chH---HHHHH-HHHHCCCeEEEccCCc
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH--------RDI---EHFLQ-LAVEEDLYILLRPGPF 167 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~--------~dl---~~fl~-~a~e~GL~vilr~GPy 167 (323)
.+.+.++|++.-.+ .||-.||||-+.-. |++ ..+++ .|+++|+.+-.-|=|+
T Consensus 176 ~~~l~~~Gi~ve~~-----h~E~apGQ~Ei~l~~~d~l~aAD~~~~~K~vvk~vA~~~Gl~ATFMPKP~ 239 (354)
T PLN02284 176 YKACLYAGINISGI-----NGEVMPGQWEFQVGPVVGISAGDQLWVARYILERITEIAGVVVSFDPKPI 239 (354)
T ss_pred HHHHHHCCCCeEEE-----EcCcCCCceEEEecCCcHHHHHHHHHHHHHHHHHHHHHhCCEEEEcCCCC
Confidence 34449999999888 89999998776421 111 22233 6999999998888775
No 207
>PRK14567 triosephosphate isomerase; Provisional
Probab=43.70 E-value=53 Score=30.99 Aligned_cols=49 Identities=14% Similarity=0.206 Sum_probs=38.2
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..++|++|++.|-+ .|--++-.|.-+ ...+.+=++.|.++||.+|++.|
T Consensus 78 ~~mLkd~G~~yvii-----GHSERR~~f~Et-d~~v~~Kv~~al~~gl~pI~CiG 126 (253)
T PRK14567 78 ARMLEDIGCDYLLI-----GHSERRSLFAES-DEDVFKKLNKIIDTTITPVVCIG 126 (253)
T ss_pred HHHHHHcCCCEEEE-----CcccccCccCCC-HHHHHHHHHHHHHCCCEEEEEcC
Confidence 46899999999999 555555555544 34566778899999999999997
No 208
>PRK09250 fructose-bisphosphate aldolase; Provisional
Probab=43.51 E-value=37 Score=33.53 Aligned_cols=89 Identities=12% Similarity=-0.035 Sum_probs=58.9
Q ss_pred HHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCe
Q psy13819 109 WIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDI 188 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~ 188 (323)
-.++.+-++|..+|.+.+++.. .+...-..++.+..+.|++.||-+++..=|. + .. +.+ +.
T Consensus 150 ~sVedAlrLGAdAV~~tvy~Gs------~~E~~ml~~l~~i~~ea~~~GlPlv~~~YpR--G----~~----i~~--~~- 210 (348)
T PRK09250 150 ASVEDALRLGAVAVGATIYFGS------EESRRQIEEISEAFEEAHELGLATVLWSYLR--N----SA----FKK--DG- 210 (348)
T ss_pred ecHHHHHHCCCCEEEEEEecCC------HHHHHHHHHHHHHHHHHHHhCCCEEEEeccc--C----cc----cCC--cc-
Confidence 3477788999999999999872 2334567789999999999999998854110 0 00 001 10
Q ss_pred eecCCCHhHHHHHHHHHHHHHHHHHhhhc
Q psy13819 189 LLRQNHPVYQRYVTRWFQELFPRIQKYLY 217 (323)
Q Consensus 189 ~~R~~~~~f~~~~~~~~~~l~~~i~~~~~ 217 (323)
--..+|.+..++.+.-..|...|.+.-|
T Consensus 211 -d~~~~~d~Ia~AaRiaaELGADIVKv~y 238 (348)
T PRK09250 211 -DYHTAADLTGQANHLAATIGADIIKQKL 238 (348)
T ss_pred -cccccHHHHHHHHHHHHHHcCCEEEecC
Confidence 0023477888888877777766655434
No 209
>cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=43.05 E-value=94 Score=31.64 Aligned_cols=64 Identities=19% Similarity=0.242 Sum_probs=46.7
Q ss_pred eeecCCCCChhcHHHHHHHHHHhCCCEEEEEE----eCCCcc---c--------------------------------CC
Q psy13819 95 GSFHYFRAPPGRWCWIMRAMRAAGLNALSTYV----EWRSHE---V--------------------------------HP 135 (323)
Q Consensus 95 g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv----~W~~hE---P--------------------------------~~ 135 (323)
.+-|++ |.+..++.|+.|....+|.+..++ .|..-- | ..
T Consensus 14 vaR~f~--~~~~ik~~Id~ma~~K~N~lHlHLtDdqgwriei~~~P~Lt~~ga~r~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (445)
T cd06569 14 VARNFH--SKETVLKLLDQMAAYKLNKLHLHLTDDEGWRLEIPGLPELTEVGAKRCHDLSETTCLLPQLGSGPDTNNSGS 91 (445)
T ss_pred ccCCCC--CHHHHHHHHHHHHHhCCceEEEEeecCCCcceeccCCchhhhcccccccccccccccccccccCcccCcccC
Confidence 445553 789999999999999999999987 353210 1 01
Q ss_pred CeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 136 GHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 136 G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
|.|. .+|+..+++.|+++||.||..
T Consensus 92 g~YT---~~di~eiv~yA~~rgI~VIPE 116 (445)
T cd06569 92 GYYS---RADYIEILKYAKARHIEVIPE 116 (445)
T ss_pred CccC---HHHHHHHHHHHHHcCCEEEEc
Confidence 1122 357888999999999999975
No 210
>cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies.
Probab=42.89 E-value=60 Score=29.39 Aligned_cols=63 Identities=11% Similarity=-0.055 Sum_probs=45.2
Q ss_pred eecCC-CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 96 SFHYF-RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 96 ~~Hy~-r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
...++ +.+.+.....++.+.++|.+.|.+.+...... .-.-.+++.++.+.|++.|+.+|+..
T Consensus 66 ~~i~~p~~~~~~~~~~v~~a~~~Ga~~v~~~~~~~~~~------~~~~~~~i~~v~~~~~~~g~~~iie~ 129 (235)
T cd00958 66 TSLSPKDDNDKVLVASVEDAVRLGADAVGVTVYVGSEE------EREMLEELARVAAEAHKYGLPLIAWM 129 (235)
T ss_pred CCCCCCCCCchhhhcCHHHHHHCCCCEEEEEEecCCch------HHHHHHHHHHHHHHHHHcCCCEEEEE
Confidence 33343 77777777789999999999997765544221 11223468888999999999999865
No 211
>PF07755 DUF1611: Protein of unknown function (DUF1611); InterPro: IPR011669 This entry contains a number of hypothetical bacterial and archaeal proteins. The region is approximately 350 residues long. A member of this family (Q6M063 from SWISSPROT) is thought to associate with another subunit to form an H+-transporting ATPase, but no evidence has been found to support this.; PDB: 2G0T_A 2OBN_A.
Probab=42.53 E-value=23 Score=34.32 Aligned_cols=62 Identities=21% Similarity=0.237 Sum_probs=40.5
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCC
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGP 166 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GP 166 (323)
.+++|-+..--+.| +.|++.+..+-++|+|.|.- +|+. +.|...|.++|+++|..++ +|-.|
T Consensus 35 ~liiGiA~~GG~lp-~~w~~~i~~Ai~~Gl~IvsG-----LH~~---------L~ddpel~~~A~~~g~~i~DvR~p~ 97 (301)
T PF07755_consen 35 TLIIGIAPAGGRLP-PSWRPVILEAIEAGLDIVSG-----LHDF---------LSDDPELAAAAKKNGVRIIDVRKPP 97 (301)
T ss_dssp EEEE---STTHCCH-CCHHHHHHHHHHTT-EEEE------SSS----------HCCHHHHHCCHHCCT--EEETTS--
T ss_pred EEEEecCcCCCcCC-HHHHHHHHHHHHcCCCEEec-----Chhh---------hccCHHHHHHHHHcCCeEeeccCCC
Confidence 34444444444444 89999999999999999998 6663 4567889999999999988 56533
No 212
>COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=42.24 E-value=27 Score=34.94 Aligned_cols=61 Identities=16% Similarity=0.269 Sum_probs=46.1
Q ss_pred eeEEeeeeecCC---CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc------ccchHHHHHHHHHHCCCe
Q psy13819 89 PFQFVSGSFHYF---RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD------GHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 89 ~~~i~~g~~Hy~---r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~------g~~dl~~fl~~a~e~GL~ 159 (323)
-+.|++||.|.| +.++++|.+.++.+|+-|+=-+ ||+. |.+.=..-|+++.+.+..
T Consensus 173 ~vvLLH~CcHNPTG~D~t~~qW~~l~~~~~~r~lip~---------------~D~AYQGF~~GleeDa~~lR~~a~~~~~ 237 (396)
T COG1448 173 SVVLLHGCCHNPTGIDPTEEQWQELADLIKERGLIPF---------------FDIAYQGFADGLEEDAYALRLFAEVGPE 237 (396)
T ss_pred CEEEEecCCCCCCCCCCCHHHHHHHHHHHHHcCCeee---------------eehhhhhhccchHHHHHHHHHHHHhCCc
Confidence 477899999987 6889999999999998876332 3332 333345668999999998
Q ss_pred EEEcc
Q psy13819 160 ILLRP 164 (323)
Q Consensus 160 vilr~ 164 (323)
+++.-
T Consensus 238 ~lva~ 242 (396)
T COG1448 238 LLVAS 242 (396)
T ss_pred EEEEe
Confidence 87765
No 213
>PF03932 CutC: CutC family; InterPro: IPR005627 Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are between 200 and 300 amino acids in length and are found in both eukaryotes and bacteria.; GO: 0005507 copper ion binding, 0055070 copper ion homeostasis; PDB: 2BDQ_A 3IWP_I 1X8C_B 1X7I_A 1TWD_B.
Probab=42.15 E-value=21 Score=32.55 Aligned_cols=59 Identities=15% Similarity=0.192 Sum_probs=36.8
Q ss_pred eeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 94 SGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 94 ~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
+|.+.|....-+.+.+|++.+|++|.+.+-+-+ =.+.|..|-+. +.++++.|+ |+.+..
T Consensus 61 ~gdF~Ys~~E~~~M~~dI~~~~~~GadG~VfG~-----L~~dg~iD~~~---~~~Li~~a~--~~~~tF 119 (201)
T PF03932_consen 61 GGDFVYSDEEIEIMKEDIRMLRELGADGFVFGA-----LTEDGEIDEEA---LEELIEAAG--GMPVTF 119 (201)
T ss_dssp SS-S---HHHHHHHHHHHHHHHHTT-SEEEE-------BETTSSB-HHH---HHHHHHHHT--TSEEEE
T ss_pred CCCccCCHHHHHHHHHHHHHHHHcCCCeeEEEe-----ECCCCCcCHHH---HHHHHHhcC--CCeEEE
Confidence 467777666667889999999999999998832 22356666655 666677775 555544
No 214
>PRK13396 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=41.86 E-value=1.3e+02 Score=29.83 Aligned_cols=74 Identities=18% Similarity=0.180 Sum_probs=51.6
Q ss_pred EEeC-CeeeEEeeeeecCCC-C-ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc--chHHHHHHHHHHCC
Q psy13819 83 FRLN-EDPFQFVSGSFHYFR-A-PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH--RDIEHFLQLAVEED 157 (323)
Q Consensus 83 ~~~d-Gk~~~i~~g~~Hy~r-~-p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~--~dl~~fl~~a~e~G 157 (323)
..+. ++|+.++.| +. + .++.-.+.-+.+|++|.+.+|-..+= |+---|.|.|. .-|.-+-+.+++.|
T Consensus 94 v~iGg~~~l~vIAG----PCsIEs~eq~l~~A~~lk~~g~~~~r~g~~k----pRtsp~sf~G~g~~gl~~L~~~~~e~G 165 (352)
T PRK13396 94 VPFGENHPVVVVAG----PCSVENEEMIVETAKRVKAAGAKFLRGGAYK----PRTSPYAFQGHGESALELLAAAREATG 165 (352)
T ss_pred eEecCCCeEEEEEe----CCcccCHHHHHHHHHHHHHcCCCEEEeeeec----CCCCCcccCCchHHHHHHHHHHHHHcC
Confidence 4444 466778887 22 2 25677788889999999999975443 44444677653 45666667788999
Q ss_pred CeEEEcc
Q psy13819 158 LYILLRP 164 (323)
Q Consensus 158 L~vilr~ 164 (323)
|.++-.+
T Consensus 166 l~~~tev 172 (352)
T PRK13396 166 LGIITEV 172 (352)
T ss_pred CcEEEee
Confidence 9999776
No 215
>COG1979 Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family [Energy production and conversion]
Probab=41.84 E-value=1.1e+02 Score=30.41 Aligned_cols=62 Identities=19% Similarity=0.356 Sum_probs=45.8
Q ss_pred EEEEecCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHH
Q psy13819 74 FAIDLAGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLA 153 (323)
Q Consensus 74 ~~v~~~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a 153 (323)
.-|.|.+.++..|| ..++.+..+| |+..+++ +.+||.|- ..-+.+-+++|
T Consensus 32 VLi~YGGGSIKrnG------------------vydqV~~~Lk--g~~~~E~----~GVEPNP~------~~Tv~kaV~i~ 81 (384)
T COG1979 32 VLIVYGGGSIKKNG------------------VYDQVVEALK--GIEVIEF----GGVEPNPR------LETLMKAVEIC 81 (384)
T ss_pred EEEEecCccccccc------------------hHHHHHHHhc--CceEEEe----cCCCCCch------HHHHHHHHHHH
Confidence 34556666666654 4466677777 8888888 68999874 23377889999
Q ss_pred HHCCCeEEEccC
Q psy13819 154 VEEDLYILLRPG 165 (323)
Q Consensus 154 ~e~GL~vilr~G 165 (323)
+|+++-.+|..|
T Consensus 82 kee~idflLAVG 93 (384)
T COG1979 82 KEENIDFLLAVG 93 (384)
T ss_pred HHcCceEEEEec
Confidence 999999998875
No 216
>PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed
Probab=41.28 E-value=1.2e+02 Score=30.01 Aligned_cols=74 Identities=18% Similarity=0.203 Sum_probs=50.5
Q ss_pred EEeCCeeeEEeeee--ecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc--cchHHHHHHHHHHCCC
Q psy13819 83 FRLNEDPFQFVSGS--FHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG--HRDIEHFLQLAVEEDL 158 (323)
Q Consensus 83 ~~~dGk~~~i~~g~--~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl~~fl~~a~e~GL 158 (323)
+.+.|....++.|. ++ ..+.-.+..+.+|+.|++.++-..+= |+.--|.|.| .+.+..+-+.|++.||
T Consensus 112 ~~~g~~~~~~iaGpc~iE----~~~~~~~~A~~lk~~g~~~~r~~~~k----pRtsp~~f~g~~~e~l~~L~~~~~~~Gl 183 (360)
T PRK12595 112 EVIGDGNQSFIFGPCSVE----SYEQVEAVAKALKAKGLKLLRGGAFK----PRTSPYDFQGLGVEGLKILKQVADEYGL 183 (360)
T ss_pred EEecCCCeeeEEeccccc----CHHHHHHHHHHHHHcCCcEEEccccC----CCCCCccccCCCHHHHHHHHHHHHHcCC
Confidence 45554433445453 11 25667778888999999999974332 4444456665 4678888999999999
Q ss_pred eEEEcc
Q psy13819 159 YILLRP 164 (323)
Q Consensus 159 ~vilr~ 164 (323)
.++..+
T Consensus 184 ~~~t~v 189 (360)
T PRK12595 184 AVISEI 189 (360)
T ss_pred CEEEee
Confidence 999877
No 217
>PRK09057 coproporphyrinogen III oxidase; Provisional
Probab=41.26 E-value=82 Score=31.05 Aligned_cols=52 Identities=23% Similarity=0.172 Sum_probs=33.9
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccc----cchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDG----HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g----~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.|+.||++|+|.|++.|- .- +.+-..-+ ..++.+.++.|++.+..|-+..
T Consensus 104 ~e~L~~l~~~GvnrislGvQ-----S~~d~vL~~l~R~~~~~~~~~ai~~~~~~~~~v~~dl 160 (380)
T PRK09057 104 AGRFRGYRAAGVNRVSLGVQ-----ALNDADLRFLGRLHSVAEALAAIDLAREIFPRVSFDL 160 (380)
T ss_pred HHHHHHHHHcCCCEEEEecc-----cCCHHHHHHcCCCCCHHHHHHHHHHHHHhCccEEEEe
Confidence 38899999999999999443 11 11111111 2456777888888887766654
No 218
>PF03102 NeuB: NeuB family; InterPro: IPR013132 NeuB is the prokaryotic N-acetylneuraminic acid synthase (Neu5Ac). It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only been observed in prokaryotes; eukaryotes synthesise the 9-phosphate form, Neu5Ac-9-P, and utilise ManNAc-6-P instead of ManNAc. Such eukaryotic enzymes are not present in this family []. This family also contains SpsE spore coat polysaccharide biosynthesis proteins.; GO: 0016051 carbohydrate biosynthetic process; PDB: 3G8R_B 1XUU_A 1XUZ_A 3CM4_A 2ZDR_A 1VLI_A 2WQP_A.
Probab=41.21 E-value=46 Score=31.10 Aligned_cols=65 Identities=14% Similarity=0.153 Sum_probs=35.8
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC----CCeecc-cccchHHHHHHHHHHCCCeEEEccC
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH----PGHYHY-DGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~----~G~fdf-~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..|.+.|++..+..++.|+..+.+.+.-...+-- .-.|-- |+.-.-..+|+.+++.|..|||-+|
T Consensus 52 el~~e~~~~L~~~~~~~gi~f~stpfd~~s~d~l~~~~~~~~KIaS~dl~n~~lL~~~A~tgkPvIlSTG 121 (241)
T PF03102_consen 52 ELSEEQHKELFEYCKELGIDFFSTPFDEESVDFLEELGVPAYKIASGDLTNLPLLEYIAKTGKPVILSTG 121 (241)
T ss_dssp SS-HHHHHHHHHHHHHTT-EEEEEE-SHHHHHHHHHHT-SEEEE-GGGTT-HHHHHHHHTT-S-EEEE-T
T ss_pred cCCHHHHHHHHHHHHHcCCEEEECCCCHHHHHHHHHcCCCEEEeccccccCHHHHHHHHHhCCcEEEECC
Confidence 4788999999999999999999995332211110 011110 1111223567888888888888775
No 219
>PF02228 Gag_p19: Major core protein p19; InterPro: IPR003139 Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes []. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds, their primary sequences can be very different. This entry represents matrix proteins from delta-retroviruses such as Human T-lymphotropic virus 1 and Human T-cell leukemia virus 2 (HTLV-2), both members of the human oncovirus subclass of retroviruses [, ].; GO: 0005198 structural molecule activity, 0019013 viral nucleocapsid; PDB: 1JVR_A.
Probab=41.08 E-value=11 Score=29.26 Aligned_cols=38 Identities=18% Similarity=0.484 Sum_probs=27.9
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL 158 (323)
...|-.-++.+- +.||.|-.|||.. |.+||++|.|--+
T Consensus 21 ~hhWLNflQaAy--------------RL~PgPS~~DF~q---Lr~flk~alkTpv 58 (92)
T PF02228_consen 21 THHWLNFLQAAY--------------RLQPGPSSFDFHQ---LRNFLKLALKTPV 58 (92)
T ss_dssp HHHHHHHHHHHH--------------HSS---STTTHHH---HHHHHHHHHT-TT
T ss_pred HHHHHHHHHHHH--------------hcCCCCCcccHHH---HHHHHHHHHcCCe
Confidence 467888888776 6799999999987 8899999977543
No 220
>PRK13125 trpA tryptophan synthase subunit alpha; Provisional
Probab=40.82 E-value=57 Score=30.08 Aligned_cols=47 Identities=13% Similarity=0.173 Sum_probs=33.6
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+-++.++++|.+.|-+. --| + +..+++.++++.|+++||.+++...
T Consensus 91 ~~~i~~~~~~Gadgvii~-----dlp----~--e~~~~~~~~~~~~~~~Gl~~~~~v~ 137 (244)
T PRK13125 91 DNFLNMARDVGADGVLFP-----DLL----I--DYPDDLEKYVEIIKNKGLKPVFFTS 137 (244)
T ss_pred HHHHHHHHHcCCCEEEEC-----CCC----C--CcHHHHHHHHHHHHHcCCCEEEEEC
Confidence 344888999999999992 001 0 1113578899999999999887663
No 221
>PRK12399 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=40.73 E-value=47 Score=32.49 Aligned_cols=62 Identities=19% Similarity=0.239 Sum_probs=45.6
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
|.|.-.=..-.+++|++|.++|-+-+.|+--||.+ -+-.-..-++++-+.|+++||..+|.+
T Consensus 101 Rl~~ll~~wS~~rike~GadavK~Llyy~pD~~~~--in~~k~a~vervg~eC~a~dipf~lE~ 162 (324)
T PRK12399 101 RLPDCLDDWSAKRIKEEGADAVKFLLYYDVDEPDE--INEQKKAYIERIGSECVAEDIPFFLEI 162 (324)
T ss_pred CcccccchhhHHHHHHhCCCeEEEEEEECCCCCHH--HHHHHHHHHHHHHHHHHHCCCCeEEEE
Confidence 45533333457789999999999999999877762 111222348888999999999999987
No 222
>PF07071 DUF1341: Protein of unknown function (DUF1341); InterPro: IPR010763 Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.; PDB: 3NZR_D 3LM7_A 3M0Z_B 3M6Y_A 3N73_A 3MUX_A.
Probab=40.67 E-value=64 Score=29.62 Aligned_cols=72 Identities=10% Similarity=0.236 Sum_probs=41.0
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc---chHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH---RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~---~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
-+.-++++|+||.+.|.++ .-.|. +.|....+.|.++|+++ .| +||+-
T Consensus 137 vetAiaml~dmG~~SiKff-------------Pm~Gl~~leE~~avAkA~a~~g~~l--EP---------TGGId----- 187 (218)
T PF07071_consen 137 VETAIAMLKDMGGSSIKFF-------------PMGGLKHLEELKAVAKACARNGFTL--EP---------TGGID----- 187 (218)
T ss_dssp HHHHHHHHHHTT--EEEE----------------TTTTTHHHHHHHHHHHHHCT-EE--EE---------BSS-------
T ss_pred HHHHHHHHHHcCCCeeeEe-------------ecCCcccHHHHHHHHHHHHHcCcee--CC---------cCCcC-----
Confidence 4678999999999999994 22343 45667778899999998 77 57752
Q ss_pred cCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhc
Q psy13819 184 VAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLY 217 (323)
Q Consensus 184 ~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~ 217 (323)
-..|.+-++-.+++=++++-|+.|
T Consensus 188 ----------l~N~~~I~~i~l~aGv~~viPHiY 211 (218)
T PF07071_consen 188 ----------LDNFEEIVKICLDAGVEKVIPHIY 211 (218)
T ss_dssp ----------TTTHHHHHHHHHHTT-S-B--EE-
T ss_pred ----------HHHHHHHHHHHHHcCCCeeccchh
Confidence 123444455555555566555555
No 223
>COG1891 Uncharacterized protein conserved in archaea [Function unknown]
Probab=40.62 E-value=12 Score=33.67 Aligned_cols=48 Identities=19% Similarity=0.279 Sum_probs=34.7
Q ss_pred HHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 113 AMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 113 ~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
-+.++|++.+-+.- .+.--.--|||-...+|..|+++|+++||.+-|.
T Consensus 139 vaa~ag~DvaMvDT---aiKDGkslFdfm~~e~l~eFvd~Ah~hGL~~AlA 186 (235)
T COG1891 139 VAAEAGADVAMVDT---AIKDGKSLFDFMDEEELEEFVDLAHEHGLEVALA 186 (235)
T ss_pred HHHhcCCCEEEEec---ccccchhHHhhhcHHHHHHHHHHHHHcchHHHhc
Confidence 45678888776521 1222233688888889999999999999987663
No 224
>TIGR03128 RuMP_HxlA 3-hexulose-6-phosphate synthase. at the cost of also yielding formaldehyde. These latter species tend usually have a formaldehyde-activating enzyme to attach formaldehyde to the C1 carrier tetrahydromethanopterin. In these species, the enzyme is viewed as a lyase rather than a synthase and is called D-arabino 3-hexulose 6-phosphate formaldehyde lyase. Note that there is some overlap in specificity with the Escherichia coli enzyme 3-keto-L-gulonate 6-phosphate decarboxylase.
Probab=40.25 E-value=71 Score=28.21 Aligned_cols=41 Identities=12% Similarity=0.024 Sum_probs=33.0
Q ss_pred HHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 110 IMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.++.+.++|.+.|.+..... ...+.++++.|+++|+.+++.
T Consensus 68 ~~~~~~~~Gad~i~vh~~~~-------------~~~~~~~i~~~~~~g~~~~~~ 108 (206)
T TIGR03128 68 EAEQAFAAGADIVTVLGVAD-------------DATIKGAVKAAKKHGKEVQVD 108 (206)
T ss_pred HHHHHHHcCCCEEEEeccCC-------------HHHHHHHHHHHHHcCCEEEEE
Confidence 68889999999999854421 134678899999999999986
No 225
>PRK08195 4-hyroxy-2-oxovalerate/4-hydroxy-2-oxopentanoic acid aldolase,; Validated
Probab=40.15 E-value=67 Score=31.40 Aligned_cols=45 Identities=13% Similarity=0.005 Sum_probs=36.7
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++++++.+.|+..||+.+..+.. +.+.+.++.|++.|+.|.+.+
T Consensus 91 ~~dl~~a~~~gvd~iri~~~~~e~------------~~~~~~i~~ak~~G~~v~~~l 135 (337)
T PRK08195 91 VDDLKMAYDAGVRVVRVATHCTEA------------DVSEQHIGLARELGMDTVGFL 135 (337)
T ss_pred HHHHHHHHHcCCCEEEEEEecchH------------HHHHHHHHHHHHCCCeEEEEE
Confidence 478999999999999998755442 347888999999999988755
No 226
>cd00003 PNPsynthase Pyridoxine 5'-phosphate (PNP) synthase domain; pyridoxal 5'-phosphate is the active form of vitamin B6 that acts as an essential, ubiquitous coenzyme in amino acid metabolism. In bacteria, formation of pyridoxine 5'-phosphate is a step in the biosynthesis of vitamin B6. PNP synthase, a homooctameric enzyme, catalyzes the final step in PNP biosynthesis, the condensation of 1-amino-acetone 3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP synthase adopts a TIM barrel topology, intersubunit contacts are mediated by three ''extra'' helices, generating a tetramer of symmetric dimers with shared active sites; the open state has been proposed to accept substrates and to release products, while most of the catalytic events are likely to occur in the closed state; a hydrophilic channel running through the center of the barrel was identified as the essential structural feature that enables PNP synthase to release water molecules produced during the reaction from the closed,
Probab=40.01 E-value=58 Score=30.43 Aligned_cols=49 Identities=18% Similarity=0.251 Sum_probs=38.3
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCCcccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGPFICG 170 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GPyi~a 170 (323)
.+..++.++++|+.|+- |++ .++|. ...++.|++.|-..| |-+|||..+
T Consensus 109 ~~~l~~~i~~l~~~gI~-VSL-----FiDPd------------~~qi~~A~~~GAd~VELhTG~Ya~a 158 (234)
T cd00003 109 AEKLKPIIERLKDAGIR-VSL-----FIDPD------------PEQIEAAKEVGADRVELHTGPYANA 158 (234)
T ss_pred HHHHHHHHHHHHHCCCE-EEE-----EeCCC------------HHHHHHHHHhCcCEEEEechhhhcC
Confidence 35667889999999984 666 45666 456899999999876 899999765
No 227
>PRK10658 putative alpha-glucosidase; Provisional
Probab=39.93 E-value=98 Score=33.22 Aligned_cols=68 Identities=16% Similarity=0.109 Sum_probs=47.1
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
.+...+.++.+++.|+-+=-+.+-+..+... -+.|.|+ ..-|.+.+++.+++.|+++++..=|+|..+
T Consensus 282 e~~v~~~~~~~r~~~iP~d~i~lD~~w~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~i~P~i~~~ 352 (665)
T PRK10658 282 EATVNSFIDGMAERDLPLHVFHFDCFWMKEFQWCDFEWDPRTFPDPEGMLKRLKAKGLKICVWINPYIAQK 352 (665)
T ss_pred HHHHHHHHHHHHHcCCCceEEEEchhhhcCCceeeeEEChhhCCCHHHHHHHHHHCCCEEEEeccCCcCCC
Confidence 3445677899999998765554443333321 2344443 234789999999999999999998998753
No 228
>cd07944 DRE_TIM_HOA_like 4-hydroxy-2-oxovalerate aldolase-like, N-terminal catalytic TIM barrel domain. This family of bacterial enzymes is sequence-similar to 4-hydroxy-2-oxovalerate aldolase (HOA) but its exact function is unknown. This family includes the Bacteroides vulgatus Bvu_2661 protein and belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues t
Probab=39.89 E-value=54 Score=30.81 Aligned_cols=45 Identities=2% Similarity=0.042 Sum_probs=36.4
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++++.+.+.|+..||+.+..+ ..+.+.+.++.|+++|+.|.+.+
T Consensus 85 ~~~l~~a~~~gv~~iri~~~~~------------~~~~~~~~i~~ak~~G~~v~~~~ 129 (266)
T cd07944 85 IDLLEPASGSVVDMIRVAFHKH------------EFDEALPLIKAIKEKGYEVFFNL 129 (266)
T ss_pred HHHHHHHhcCCcCEEEEecccc------------cHHHHHHHHHHHHHCCCeEEEEE
Confidence 5688899999999999977544 24557788999999999988765
No 229
>cd06416 GH25_Lys1-like Lys-1 is a lysozyme encoded by the Caenorhabditis elegans lys-1 gene. This gene is one of a several lysozyme genes upregulated upon infection by the Gram-negative bacterial pathogen Serratia marcescens. Lys-1 contains a glycosyl hydrolase family 25 (GH25) catalytic domain. This family also includes Lys-5 from Caenorhabditis elegans.
Probab=39.44 E-value=1.2e+02 Score=26.82 Aligned_cols=84 Identities=13% Similarity=0.260 Sum_probs=52.1
Q ss_pred eeecCCCCC-----hhcHHHHHHHHHHhCCC--EEEEEEeCCCcccCCCeec--ccc-cchHHHHHHHHHHCCCeEEEcc
Q psy13819 95 GSFHYFRAP-----PGRWCWIMRAMRAAGLN--ALSTYVEWRSHEVHPGHYH--YDG-HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 95 g~~Hy~r~p-----~~~W~~~l~~mk~~G~N--~Vr~yv~W~~hEP~~G~fd--f~g-~~dl~~fl~~a~e~GL~vilr~ 164 (323)
|..||++.. .++.+.-++.++..+.. .|=+. +|..++.+. .+. .+.+.+|++..+++|.++++-.
T Consensus 56 G~Yhf~~~~~~~~~~~Qa~~f~~~~~~~~~~~~~i~lD-----iE~~~~~~~~~~~~~~~~~~~f~~~~~~~G~~~~iYt 130 (196)
T cd06416 56 DVYFFPCINCCGSAAGQVQTFLQYLKANGIKYGTVWID-----IEQNPCQWSSDVASNCQFLQELVSAAKALGLKVGIYS 130 (196)
T ss_pred ceEEEecCCCCCCHHHHHHHHHHHHHhCCCceeEEEEE-----EecCCCCCcCCHHHHHHHHHHHHHHHHHhCCeEEEEc
Confidence 788987543 55777888888886543 33233 332233322 111 2457889999999999999988
Q ss_pred CCccc----cccC---CCCCcccccc
Q psy13819 165 GPFIC----GKRD---FGGFPPWLLK 183 (323)
Q Consensus 165 GPyi~----aEw~---~gg~P~Wl~~ 183 (323)
+++-. +... ....|.|+..
T Consensus 131 ~~~~w~~~~~~~~~~~~~~ypLWiA~ 156 (196)
T cd06416 131 SQYDWSQIFGSSYTCNFSSLPLWYAH 156 (196)
T ss_pred CcchhccccCCCcCCCcCCCceEecC
Confidence 77422 1111 3467889876
No 230
>TIGR01698 PUNP purine nucleotide phosphorylase. methylthioadenosine.
Probab=39.25 E-value=52 Score=30.70 Aligned_cols=41 Identities=22% Similarity=0.382 Sum_probs=32.9
Q ss_pred EeCCeeeEEeeeeecCCC-CChhcHHHHHHHHHHhCCCEEEE
Q psy13819 84 RLNEDPFQFVSGSFHYFR-APPGRWCWIMRAMRAAGLNALST 124 (323)
Q Consensus 84 ~~dGk~~~i~~g~~Hy~r-~p~~~W~~~l~~mk~~G~N~Vr~ 124 (323)
.+.|+++.++.|..|+.. .+..+-+--++.||++|+..+=.
T Consensus 47 ~l~g~~V~~l~Gr~H~yeg~~~~~v~~~i~al~~lGv~~ii~ 88 (237)
T TIGR01698 47 RIGDGPVLVLGGRTHAYEGGDARAVVHPVRTARATGAETLIL 88 (237)
T ss_pred EECCEEEEEEcCCCcccCCCcHHHhHHHHHHHHHcCCCEEEE
Confidence 578999999999999643 33445578899999999988765
No 231
>KOG0471|consensus
Probab=39.14 E-value=48 Score=34.61 Aligned_cols=64 Identities=9% Similarity=-0.005 Sum_probs=46.4
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCee--c-------ccccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHY--H-------YDGHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~f--d-------f~g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
..+.|+-+|++|+++|=+.-+=...++.-|-+ | |.-.+|+..+++.+++.||++|++.-|--++
T Consensus 42 I~~kldyi~~lG~taiWisP~~~s~~~~~GY~~~d~~~l~p~fGt~edf~~Li~~~h~~gi~ii~D~viNh~~ 114 (545)
T KOG0471|consen 42 ITSKLDYIKELGFTAIWLSPFTKSSKPDFGYDASDLEQLRPRFGTEEDFKELILAMHKLGIKIIADLVINHRS 114 (545)
T ss_pred chhhhhHHHhcCCceEEeCCCcCCCHHHhccCccchhhhcccccHHHHHHHHHHHHhhcceEEEEeeccccCC
Confidence 36789999999999999954444555533321 2 2234689999999999999999987554443
No 232
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=38.99 E-value=66 Score=30.70 Aligned_cols=62 Identities=15% Similarity=0.200 Sum_probs=44.7
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc---cchHHHHHHHHHHCCCeEEEcc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG---HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g---~~dl~~fl~~a~e~GL~vilr~ 164 (323)
...+..++-++-+.++|+..|-+---|...+ ....+|++. ..|+.++++-|++.|+.|+|..
T Consensus 29 ~~t~~~k~yIDfAa~~G~eYvlvD~GW~~~~-~~~~~d~~~~~~~~dl~elv~Ya~~KgVgi~lw~ 93 (273)
T PF10566_consen 29 ATTETQKRYIDFAAEMGIEYVLVDAGWYGWE-KDDDFDFTKPIPDFDLPELVDYAKEKGVGIWLWY 93 (273)
T ss_dssp SSHHHHHHHHHHHHHTT-SEEEEBTTCCGS---TTT--TT-B-TT--HHHHHHHHHHTT-EEEEEE
T ss_pred CCHHHHHHHHHHHHHcCCCEEEecccccccc-ccccccccccCCccCHHHHHHHHHHcCCCEEEEE
Confidence 3467889999999999999999988888722 235677764 4689999999999999999865
No 233
>PRK10397 lipoprotein; Provisional
Probab=38.51 E-value=41 Score=28.58 Aligned_cols=19 Identities=42% Similarity=0.872 Sum_probs=15.1
Q ss_pred cccccCCCCCeeEEEeccc
Q psy13819 16 WGRYWSEVGPQYSLFVPEE 34 (323)
Q Consensus 16 lGryw~~~gpq~~lyvP~~ 34 (323)
+--||++.|||..|.-|..
T Consensus 34 l~GyWqs~gpq~~lvSp~A 52 (137)
T PRK10397 34 LAGYWQTKGPQRALVSPEA 52 (137)
T ss_pred ccceeeccCCcccccCccc
Confidence 3458999999998887765
No 234
>PF02606 LpxK: Tetraacyldisaccharide-1-P 4'-kinase; InterPro: IPR003758 Tetraacyldisaccharide 4'-kinase phosphorylates the 4'-position of a tetraacyldisaccharide 1-phosphate precursor (DS-1-P) of lipid A, but the enzyme has not yet been purified because of instability []. This enzyme is involved in the synthesis of lipid A portion of the bacterial lipopolysaccharide layer (LPS).; GO: 0005524 ATP binding, 0009029 tetraacyldisaccharide 4'-kinase activity, 0009245 lipid A biosynthetic process
Probab=38.41 E-value=91 Score=30.37 Aligned_cols=63 Identities=22% Similarity=0.319 Sum_probs=48.2
Q ss_pred EeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 84 RLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 84 ~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.+.|+++..++|--++ +.-++.++++|++.+.+..+ |+.+.|+. .|++.+.+.|++.| .||.
T Consensus 224 ~l~~~~v~a~sGIg~P--------~~F~~~L~~~G~~~~~~~~f-------~DHh~yt~-~dl~~l~~~a~~~~--~ilt 285 (326)
T PF02606_consen 224 PLKGKPVLAFSGIGNP--------ERFFDTLESLGIEVVGTLAF-------PDHHRYTE-QDLEKLEAEAKAAG--IILT 285 (326)
T ss_pred hccCCeeEEEEEcCCh--------HHHHHHHHHcCCeEEEeeEC-------CCCCCCCH-HHHHHHHHhhcccc--eEEe
Confidence 4789999999998887 45577888899999977433 45566664 56889999999999 5554
Q ss_pred c
Q psy13819 164 P 164 (323)
Q Consensus 164 ~ 164 (323)
+
T Consensus 286 T 286 (326)
T PF02606_consen 286 T 286 (326)
T ss_pred c
Confidence 3
No 235
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=38.35 E-value=1.7e+02 Score=29.27 Aligned_cols=57 Identities=23% Similarity=0.359 Sum_probs=38.5
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCC--eEEEccCCccccccCCCCCccc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDL--YILLRPGPFICGKRDFGGFPPW 180 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL--~vilr~GPyi~aEw~~gg~P~W 180 (323)
..+.-.+-++.+=+.|+|.|.| .|..|--+ -+.+|..|-+.|- +|.|.+ -+|.|
T Consensus 32 d~~~~~~~i~~aie~GiNyidT--A~~Yh~g~-----------sE~~lgkaL~~~~Rekv~LaT-----------Klp~~ 87 (391)
T COG1453 32 DEENANETIDYAIEHGINYIDT--AWPYHGGE-----------SEEFLGKALKDGYREKVKLAT-----------KLPSW 87 (391)
T ss_pred cHHHHHHHHHHHHHcCCceEee--cccccCCC-----------chHHHHHHhhhcccceEEEEe-----------ecCCc
Confidence 4567788888888999999999 45555443 3445665555444 466665 26778
Q ss_pred ccc
Q psy13819 181 LLK 183 (323)
Q Consensus 181 l~~ 183 (323)
..+
T Consensus 88 ~~~ 90 (391)
T COG1453 88 PVK 90 (391)
T ss_pred ccc
Confidence 766
No 236
>TIGR01211 ELP3 histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases.
Probab=38.18 E-value=63 Score=33.65 Aligned_cols=63 Identities=11% Similarity=-0.024 Sum_probs=39.7
Q ss_pred eeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC-CCeeccc----ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 93 VSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYD----GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 93 ~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~----g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++-+.++-.++ ++.|+.|+++|+|.|++.|- .- +.+-..- ..+++.+.++++++.|+.|.+..
T Consensus 195 itiEtRPD~i~----~e~L~~L~~~G~~rVslGVQ-----S~~d~VL~~inRght~~~v~~Ai~~lr~~G~~v~~~L 262 (522)
T TIGR01211 195 LTIETRPDYCR----EEHIDRMLKLGATRVELGVQ-----TIYNDILERTKRGHTVRDVVEATRLLRDAGLKVVYHI 262 (522)
T ss_pred EEEEEcCCcCC----HHHHHHHHHcCCCEEEEECc-----cCCHHHHHHhCCCCCHHHHHHHHHHHHHcCCeEEEEe
Confidence 33444443333 57799999999999999432 11 1111111 12467788999999999877654
No 237
>TIGR01232 lacD tagatose 1,6-diphosphate aldolase. This family consists of Gram-positive proteins. Tagatose 1,6-diphosphate aldolase is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=38.13 E-value=36 Score=33.32 Aligned_cols=52 Identities=17% Similarity=0.227 Sum_probs=41.7
Q ss_pred HHHHHHHhCCCEEEEEEeCCCcccCC-CeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 110 IMRAMRAAGLNALSTYVEWRSHEVHP-GHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~yv~W~~hEP~~-G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+++|++|.++|-+-+.|+--++.+ .+-. ..-++++-+.|+++||..+|.+
T Consensus 111 s~~rike~GadavK~Llyy~pD~~~ein~~k---~a~vervg~ec~a~dipf~lE~ 163 (325)
T TIGR01232 111 SAKRLKEQGANAVKFLLYYDVDDAEEINIQK---KAYIERIGSECVAEDIPFFLEV 163 (325)
T ss_pred cHHHHHHhCCCeEEEEEEeCCCCChHHHHHH---HHHHHHHHHHHHHCCCCeEEEE
Confidence 57889999999999999999776442 2222 2348899999999999999987
No 238
>PRK06852 aldolase; Validated
Probab=38.09 E-value=70 Score=31.01 Aligned_cols=48 Identities=8% Similarity=0.059 Sum_probs=36.6
Q ss_pred HHHHHHhC------CCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 111 MRAMRAAG------LNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 111 l~~mk~~G------~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.+-++| ..+|.+.++|. +.+..+-+.++.+..+.|++.||-+|+..
T Consensus 121 VeeAvrlG~~~~~~AdAV~v~v~~G------s~~E~~ml~~l~~v~~ea~~~GlPll~~~ 174 (304)
T PRK06852 121 VEQVVEFKENSGLNILGVGYTIYLG------SEYESEMLSEAAQIIYEAHKHGLIAVLWI 174 (304)
T ss_pred HHHHHhcCCccCCCceEEEEEEecC------CHHHHHHHHHHHHHHHHHHHhCCcEEEEe
Confidence 45555666 67999999987 23335567789999999999999988744
No 239
>PF12891 Glyco_hydro_44: Glycoside hydrolase family 44; InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases.; PDB: 3IK2_A 3ZQ9_A 2YJQ_B 2YKK_A 2YIH_A 2EEX_A 2EQD_A 2E0P_A 2E4T_A 2EO7_A ....
Probab=38.00 E-value=74 Score=29.82 Aligned_cols=86 Identities=15% Similarity=0.276 Sum_probs=45.8
Q ss_pred chHHHHHHHHHHCCCeEEEcc--CCccccccC----CCCCcccccccCC-----------Cee-----ecCCCH------
Q psy13819 144 RDIEHFLQLAVEEDLYILLRP--GPFICGKRD----FGGFPPWLLKVAP-----------DIL-----LRQNHP------ 195 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~--GPyi~aEw~----~gg~P~Wl~~~~p-----------~~~-----~R~~~~------ 195 (323)
...+.|++...+.|.+.|+.+ --|+-+.-. ..++|.|--...- +.. +-.++|
T Consensus 24 ~~~~~f~~~~~~~ga~~m~T~pm~G~Vakd~~~~~~~~~fp~~~y~~Q~~~d~~~~~~Gng~~~~~~~~~~~~P~~~~~~ 103 (239)
T PF12891_consen 24 DVADTFIDQNLAAGAYSMMTLPMIGYVAKDANSVSESESFPSWRYGPQQWFDPWNPDCGNGVKPDKTALTSNDPDTPDNP 103 (239)
T ss_dssp HHHHHHHHHHHHTT-EEEEEE--SSEEES-BSEGBGGGTSSSTTEEEBS-EETTEEEEE-SEESTSSS--SSSGGSSSSE
T ss_pred HHHHHHHHHhhhcCcceeEeecccceEecCCCCcccccCCChhhcccccccCcCcCCCCccccCCCCCCCCCCCCCCccH
Confidence 347899999999999988764 225555543 5667776433200 011 112233
Q ss_pred hHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCC
Q psy13819 196 VYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGS 234 (323)
Q Consensus 196 ~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~ 234 (323)
.|++.. +..|..+.-.. ...+.|-.|.++||+..
T Consensus 104 ~y~~ew---V~~l~~~~g~a--~~~~gvk~y~lDNEP~L 137 (239)
T PF12891_consen 104 VYMDEW---VNYLVNKYGNA--STNGGVKYYSLDNEPDL 137 (239)
T ss_dssp EEHHHH---HHHHHHHH--T--TSTTS--EEEESS-GGG
T ss_pred hHHHHH---HHHHHHHHhcc--ccCCCceEEEecCchHh
Confidence 455544 44444442221 24678999999999974
No 240
>PRK09739 hypothetical protein; Provisional
Probab=37.93 E-value=1.1e+02 Score=27.14 Aligned_cols=76 Identities=8% Similarity=-0.025 Sum_probs=45.5
Q ss_pred eeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeec---c-----cccchHHHHHHHHHHCCCeE
Q psy13819 89 PFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYH---Y-----DGHRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 89 ~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fd---f-----~g~~dl~~fl~~a~e~GL~v 160 (323)
++.++.|+.+..+.....-+.-++.+++.|..+..+.+.=....|.-+.-+ + ...+++++.++...+.+..|
T Consensus 5 kiliI~~sp~~~s~s~~l~~~~~~~~~~~g~~v~~~dL~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~AD~iV 84 (199)
T PRK09739 5 RIYLVWAHPRHDSLTAKVAEAIHQRAQERGHQVEELDLYRSGFDPVLTPEDEPDWKNPDKRYSPEVHQLYSELLEHDALV 84 (199)
T ss_pred eEEEEEcCCCCCCcHHHHHHHHHHHHHHCCCEEEEEEhhhhCCCCCCCHHHhhhhcccCCCCCHHHHHHHHHHHhCCEEE
Confidence 456677766554445556677777888888544333221011122222111 1 12467899999999999999
Q ss_pred EEcc
Q psy13819 161 LLRP 164 (323)
Q Consensus 161 ilr~ 164 (323)
|.-|
T Consensus 85 ~~~P 88 (199)
T PRK09739 85 FVFP 88 (199)
T ss_pred EECc
Confidence 9988
No 241
>COG5520 O-Glycosyl hydrolase [Cell envelope biogenesis, outer membrane]
Probab=37.89 E-value=2.5e+02 Score=28.25 Aligned_cols=116 Identities=15% Similarity=0.226 Sum_probs=60.7
Q ss_pred HhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccC---CC--eee
Q psy13819 116 AAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVA---PD--ILL 190 (323)
Q Consensus 116 ~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~---p~--~~~ 190 (323)
++|+..+|+.|-=+..-- .|.+|-.. +.| .+-+-+...|++|+..| -..|+|+.+.. .+ -++
T Consensus 77 ~lg~si~Rv~I~~ndfsl-~g~~d~w~-kel-s~Ak~~in~g~ivfASP----------WspPa~Mktt~~~ngg~~g~L 143 (433)
T COG5520 77 QLGFSILRVPIDSNDFSL-GGSADNWY-KEL-STAKSAINPGMIVFASP----------WSPPASMKTTNNRNGGNAGRL 143 (433)
T ss_pred ccCceEEEEEeccccccc-CCCcchhh-hhc-ccchhhcCCCcEEEecC----------CCCchhhhhccCcCCcccccc
Confidence 478889998653111000 02222110 011 12333667899999988 35899986631 12 123
Q ss_pred c-CCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCC---CCC---hHHHHHHHHHHHHH
Q psy13819 191 R-QNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSD---AEC---DPAHAVWLRDLLRT 252 (323)
Q Consensus 191 R-~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~---~~~---~~~Y~~~l~~~~~~ 252 (323)
| ...++|.++..+|+..+ -.||=|+-+..|.||+..- ..| ..+..+.+++-.+.
T Consensus 144 k~e~Ya~yA~~l~~fv~~m--------~~nGvnlyalSVQNEPd~~p~~d~~~wtpQe~~rF~~qyl~s 204 (433)
T COG5520 144 KYEKYADYADYLNDFVLEM--------KNNGVNLYALSVQNEPDYAPTYDWCWWTPQEELRFMRQYLAS 204 (433)
T ss_pred chhHhHHHHHHHHHHHHHH--------HhCCCceeEEeeccCCcccCCCCcccccHHHHHHHHHHhhhh
Confidence 3 23344444444443332 3468899999999998752 222 34445555554444
No 242
>KOG2233|consensus
Probab=37.70 E-value=1.3e+02 Score=31.46 Aligned_cols=153 Identities=15% Similarity=0.188 Sum_probs=93.7
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEE----EeCCCcccCCC----eec--cccc---------------------------c
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTY----VEWRSHEVHPG----HYH--YDGH---------------------------R 144 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~y----v~W~~hEP~~G----~fd--f~g~---------------------------~ 144 (323)
+.=+.|++.|+-|.-.|+|.+-.. ..|-.+--.=| +-| |+|- .
T Consensus 75 W~w~qWeR~iDWmALnGinl~la~~gQEaIWqkVf~~lgl~~eeldeyftgpAflAW~RMGNl~awgGpLs~aw~~~ql~ 154 (666)
T KOG2233|consen 75 WGWEQWEREIDWMALNGINLVLAPLGQEAIWQKVFMGLGLQREELDEYFTGPAFLAWHRMGNLHAWGGPLSPAWMLNQLL 154 (666)
T ss_pred echHHHHhHhhHHHHcCcceeeccchhHHHHHHHHHHcCCCHHHHHHhcccHHHHHHHHhcCccccCCCCCHHHHHHHHH
Confidence 346899999999999999988653 22321111000 111 2221 1
Q ss_pred hHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCee-------------------ecCCCHhHHHHHHHHH
Q psy13819 145 DIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDIL-------------------LRQNHPVYQRYVTRWF 205 (323)
Q Consensus 145 dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~-------------------~R~~~~~f~~~~~~~~ 205 (323)
=-.++|+...+.||.++|-. -.|-.|.-|.+-.|+.. +...||-|.+-...|+
T Consensus 155 LqkrIidrm~~lGmTpvLPa--------FaG~VP~al~rlfPesnf~rl~rWn~f~s~~~C~l~v~P~dplF~eIgs~Fl 226 (666)
T KOG2233|consen 155 LQKRIIDRMLELGMTPVLPA--------FAGHVPDALERLFPESNFTRLPRWNNFTSRYSCMLLVSPFDPLFQEIGSTFL 226 (666)
T ss_pred HHHHHHHHHHHcCCCccchh--------hccccHHHHHHhCchhceeeccccCCCCcceeeeEEccCCcchHHHHHHHHH
Confidence 12467899999999998854 13446777766556533 2256899999999999
Q ss_pred HHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHH-hcCCce----E----EEeeCCCCc
Q psy13819 206 QELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRT-YVQDKA----V----LYSTDGAFD 268 (323)
Q Consensus 206 ~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~-~~g~~~----~----l~t~dg~~~ 268 (323)
+.+.+. |+.-.+|....-=||.--- ..+..|++.+.+++-+ .-++|- + +||.|..+.
T Consensus 227 r~~~ke-----fG~~tniy~~DpFNE~~Pp-~sepey~~staaAiyesm~kvdknaVWllQgWlF~~d~FW~ 292 (666)
T KOG2233|consen 227 RHQIKE-----FGGVTNIYSADPFNEILPP-ESEPEYVKSTAAAIYESMKKVDKNAVWLLQGWLFTYDPFWP 292 (666)
T ss_pred HHHHHH-----hCCcccccccCcccccCCC-CCChHHHHHHHHHHHHHHhccCcceEEeeecceeecCCCCC
Confidence 998888 6544455555555554221 2368898887765443 334431 1 677887554
No 243
>PRK04161 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=37.59 E-value=54 Score=32.14 Aligned_cols=62 Identities=18% Similarity=0.238 Sum_probs=45.2
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
|.|.-.=....+++|++|.++|-+-+.|+--|+.+ -+-.-..-++++-+.|+++||..+|.+
T Consensus 103 Rl~~ll~~ws~~rike~GadavK~Llyy~pD~~~e--in~~k~a~vervg~eC~a~dipf~lE~ 164 (329)
T PRK04161 103 RLPDCLVEWSVKRLKEAGADAVKFLLYYDVDGDEE--INDQKQAYIERIGSECTAEDIPFFLEL 164 (329)
T ss_pred ccccccchhhHHHHHHhCCCeEEEEEEECCCCCHH--HHHHHHHHHHHHHHHHHHCCCCeEEEE
Confidence 44432223467889999999999999999876652 111222348888999999999999987
No 244
>cd00537 MTHFR Methylenetetrahydrofolate reductase (MTHFR). 5,10-Methylenetetrahydrofolate is reduced to 5-methyltetrahydrofolate by methylenetetrahydrofolate reductase, a cytoplasmic, NAD(P)-dependent enzyme. 5-methyltetrahydrofolate is utilized by methionine synthase to convert homocysteine to methionine. The enzymatic mechanism is a ping-pong bi-bi mechanism, in which NAD(P)+ release precedes the binding of methylenetetrahydrofolate and the acceptor is free FAD. The family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from prokaryotes and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The bacterial enzyme is a homotetramer and NADH is the preferred reductant while the eukaryotic enzyme is a homodimer and NADPH is the preferred reductant. In humans, there are several clinically significant mutations in MTHFR that result in hyperhomocysteinemia, which is a risk factor for the development of cardiovascular disease.
Probab=37.32 E-value=1.8e+02 Score=27.10 Aligned_cols=91 Identities=15% Similarity=0.149 Sum_probs=54.8
Q ss_pred HHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCC--CeEEEccCCccc-------cccCCCCCcc
Q psy13819 109 WIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEED--LYILLRPGPFIC-------GKRDFGGFPP 179 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~G--L~vilr~GPyi~-------aEw~~gg~P~ 179 (323)
+.|++=.++|.+.+-|-+. ||. +.+.+|++.|++.| +.|++..-|... +++..-++|.
T Consensus 151 ~~L~~Ki~aGA~f~iTQ~~----------fd~---~~~~~~~~~~~~~gi~vPIi~GI~p~~s~~~l~~~~~~~Gv~vP~ 217 (274)
T cd00537 151 KRLKRKVDAGADFIITQLF----------FDN---DAFLRFVDRCRAAGITVPIIPGIMPLTSYKQAKRFAKLCGVEIPD 217 (274)
T ss_pred HHHHHHHHCCCCEEeeccc----------ccH---HHHHHHHHHHHHcCCCCCEEeeccccCCHHHHHHHHHhhCCCCCH
Confidence 3344444579999999322 222 34889999999998 556766666433 3444556788
Q ss_pred cccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 180 WLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 180 Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
|+.++... ...+.....+.-.++..++++.+.+
T Consensus 218 ~~~~~l~~--~~~~~~~~~~~g~~~~~~l~~~l~~ 250 (274)
T cd00537 218 WLLERLEK--LKDDAEAVRAEGIEIAAELCDELLE 250 (274)
T ss_pred HHHHHHHh--cCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 88663110 1122344556666777777777554
No 245
>PF05706 CDKN3: Cyclin-dependent kinase inhibitor 3 (CDKN3); InterPro: IPR022778 This entry represents a domain found in cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner [], []. This domain is also found in MAP kinase phosphatase and esterases. This entry contains both eukaryotic and bacterial proteins.; GO: 0004721 phosphoprotein phosphatase activity, 0004725 protein tyrosine phosphatase activity; PDB: 1FQ1_A 1FPZ_F.
Probab=37.19 E-value=1.5e+02 Score=26.38 Aligned_cols=50 Identities=12% Similarity=-0.009 Sum_probs=30.0
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..-..||+.+|+.|+++|=+ ..|.. +.+.-+.. .+.+.++++||.++--|
T Consensus 58 RdL~~DL~~Lk~~G~~~Vvt-----l~~~~--EL~~l~Vp---~L~~~~~~~Gi~~~h~P 107 (168)
T PF05706_consen 58 RDLQADLERLKDWGAQDVVT-----LLTDH--ELARLGVP---DLGEAAQARGIAWHHLP 107 (168)
T ss_dssp B-HHHHHHHHHHTT--EEEE------S-HH--HHHHTT-T---THHHHHHHTT-EEEE--
T ss_pred chHHHHHHHHHHCCCCEEEE-----eCcHH--HHHHcCCc---cHHHHHHHcCCEEEecC
Confidence 56688999999999999977 44433 34443433 34899999999776333
No 246
>cd07943 DRE_TIM_HOA 4-hydroxy-2-oxovalerate aldolase, N-terminal catalytic TIM barrel domain. 4-hydroxy 2-ketovalerate aldolase (Also known as 4-hydroxy-2-ketovalerate aldolase and 4-hydroxy-2-oxopentanoate aldolase (HOA)) converts 4-hydroxy-2-oxopentanoate to acetaldehyde and pyruvate, the penultimate step in the meta-cleavage pathway for the degradation of phenols, cresols and catechol. This family includes the Escherichia coli MhpE aldolase, the Pseudomonas DmpG aldolase, and the Burkholderia xenovorans BphI pyruvate aldolase. In Pseudomonas, the DmpG aldolase tightly associates with a dehydrogenase (DmpF ) and is inactive without it. HOA has a canonical TIM-barrel fold with a C-terminal extension that forms a funnel leading to the active site. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate
Probab=37.12 E-value=70 Score=29.75 Aligned_cols=45 Identities=7% Similarity=-0.028 Sum_probs=37.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++++++++.|+..||+.++-+.. ..+.+.++.|++.|+.|.+.+
T Consensus 88 ~~~i~~a~~~g~~~iri~~~~s~~------------~~~~~~i~~ak~~G~~v~~~~ 132 (263)
T cd07943 88 VDDLKMAADLGVDVVRVATHCTEA------------DVSEQHIGAARKLGMDVVGFL 132 (263)
T ss_pred HHHHHHHHHcCCCEEEEEechhhH------------HHHHHHHHHHHHCCCeEEEEE
Confidence 477999999999999998876632 247788999999999987765
No 247
>PRK09989 hypothetical protein; Provisional
Probab=37.04 E-value=48 Score=30.43 Aligned_cols=59 Identities=7% Similarity=-0.088 Sum_probs=37.4
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..++-++.++++|..+|++...|-...+.+.+..-...+.+.++.+.|++.|+.+.+.+
T Consensus 86 ~l~~~i~~A~~lg~~~v~v~~g~~~~~~~~~~~~~~~~~~l~~l~~~a~~~gv~l~lE~ 144 (258)
T PRK09989 86 DIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFAPHGKRILVEA 144 (258)
T ss_pred HHHHHHHHHHHhCcCEEEECccCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCCEEEEEe
Confidence 35666777799999999764333211111111101123568888999999999988876
No 248
>TIGR00676 fadh2 5,10-methylenetetrahydrofolate reductase, prokaryotic form. This protein is an FAD-containing flavoprotein.
Probab=36.87 E-value=2.4e+02 Score=26.45 Aligned_cols=109 Identities=13% Similarity=0.098 Sum_probs=62.6
Q ss_pred EEeeeeecCCCCCh-hcHH---HHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCe--EEEcc
Q psy13819 91 QFVSGSFHYFRAPP-GRWC---WIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLY--ILLRP 164 (323)
Q Consensus 91 ~i~~g~~Hy~r~p~-~~W~---~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~--vilr~ 164 (323)
+.+++..|+.+-|. ..-+ ++|++=.++|.+.+-|-. .||.+ .+.+|++.|++.|+. |++..
T Consensus 126 f~ig~a~~Peghp~~~~~~~~~~~L~~K~~aGA~f~iTQ~----------~fd~~---~~~~~~~~~~~~gi~~PIi~Gi 192 (272)
T TIGR00676 126 FDIGVAAYPEKHPEAPNLEEDIENLKRKVDAGADYAITQL----------FFDND---DYYRFVDRCRAAGIDVPIIPGI 192 (272)
T ss_pred eeEEEEeCCCCCCCCCCHHHHHHHHHHHHHcCCCeEeecc----------ccCHH---HHHHHHHHHHHcCCCCCEeccc
Confidence 45667776554331 1222 345555578999888822 23433 378899999999765 55555
Q ss_pred CCccc-------cccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 165 GPFIC-------GKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 165 GPyi~-------aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
-|... ..+..-.+|.|+.+.... ...+....+++..++..++++.+..
T Consensus 193 ~p~~s~k~~~~~~~~~Gv~vP~~~~~~l~~--~~~~~~~~~~~gi~~~~~~~~~l~~ 247 (272)
T TIGR00676 193 MPITNFKQLLRFAERCGAEIPAWLVKRLEK--YDDDPEEVRAVGIEYATDQCEDLIA 247 (272)
T ss_pred CCcCCHHHHHHHHhccCCCCCHHHHHHHHh--cCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 55333 235556678888663110 1112245566666777777776543
No 249
>PRK05265 pyridoxine 5'-phosphate synthase; Provisional
Probab=36.70 E-value=70 Score=29.97 Aligned_cols=49 Identities=16% Similarity=0.247 Sum_probs=37.5
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCCcccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGPFICG 170 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GPyi~a 170 (323)
.+..++.++++|+.|+ -|++. ++|. ...++.|++.|-..| |-+|||..+
T Consensus 112 ~~~l~~~i~~L~~~gI-rVSLF-----idP~------------~~qi~~A~~~GAd~VELhTG~yA~a 161 (239)
T PRK05265 112 FDKLKPAIARLKDAGI-RVSLF-----IDPD------------PEQIEAAAEVGADRIELHTGPYADA 161 (239)
T ss_pred HHHHHHHHHHHHHCCC-EEEEE-----eCCC------------HHHHHHHHHhCcCEEEEechhhhcC
Confidence 3566788899999998 55553 3666 456899999999866 899999765
No 250
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=36.44 E-value=60 Score=26.78 Aligned_cols=41 Identities=20% Similarity=0.361 Sum_probs=31.7
Q ss_pred ceeEEEEcCcccccccCCCCCeeEEEeccc----cccCCcceEEEEEEee
Q psy13819 5 SKGVLFANGRHWGRYWSEVGPQYSLFVPEE----FLKVGTNRITIFELTR 50 (323)
Q Consensus 5 ~kg~~~vng~~lGryw~~~gpq~~lyvP~~----~l~~g~N~i~i~e~~~ 50 (323)
|.|.|.++|..+ .++|.+++|||+. +.+.|.-.|+++|...
T Consensus 66 G~g~v~~~~~~~-----~v~~gd~~~iP~g~~H~~~N~G~~~L~liei~~ 110 (127)
T COG0662 66 GTGKVTIGGEEV-----EVKAGDSVYIPAGTPHRVRNTGKIPLVLIEVQS 110 (127)
T ss_pred eEEEEEECCEEE-----EecCCCEEEECCCCcEEEEcCCCcceEEEEEec
Confidence 678899987654 5899999999987 3356666688888765
No 251
>PRK10569 NAD(P)H-dependent FMN reductase; Provisional
Probab=36.39 E-value=92 Score=27.73 Aligned_cols=73 Identities=10% Similarity=0.086 Sum_probs=48.2
Q ss_pred eeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeec-ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 89 PFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYH-YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 89 ~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fd-f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.+++|+..........-+.-.+.+++.|..+..+.+. ...++ +.-. -....++.++++..++.+-.||..|
T Consensus 2 kIl~I~GSpr~~S~t~~l~~~~~~~l~~~g~ev~~idL~--~l~~~-~~~~~~~~~~~~~~~~~~i~~AD~iIi~tP 75 (191)
T PRK10569 2 RVITLAGSPRFPSRSSALLEYAREWLNGLGVEVYHWNLQ--NFAPE-DLLYARFDSPALKTFTEQLAQADGLIVATP 75 (191)
T ss_pred EEEEEEcCCCCCChHHHHHHHHHHHHHhCCCEEEEEEcc--CCChH-HHHhccCCCHHHHHHHHHHHHCCEEEEECC
Confidence 356778877765555566677777888889777666543 22221 1000 0113478999999999999999988
No 252
>KOG3698|consensus
Probab=36.38 E-value=2.5e+02 Score=29.96 Aligned_cols=72 Identities=14% Similarity=0.213 Sum_probs=48.5
Q ss_pred eCCeeeEEeeeeec--CCC-CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCC---------eecccccchHHHHHHH
Q psy13819 85 LNEDPFQFVSGSFH--YFR-APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPG---------HYHYDGHRDIEHFLQL 152 (323)
Q Consensus 85 ~dGk~~~i~~g~~H--y~r-~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G---------~fdf~g~~dl~~fl~~ 152 (323)
.+|+|-+ ++|.+- |-| +.+|+=++.+++++.+|+|+.-. -|+.+ -|+-+-...|..+|+.
T Consensus 11 A~g~r~f-iCGVvEGFYGRPWt~EQRK~LFrrl~~~gl~tYlY-------APKDDyKHR~~WRElY~vEEa~~L~~Li~a 82 (891)
T KOG3698|consen 11 AVGNRKF-ICGVVEGFYGRPWTPEQRKHLFRRLNQLGLTTYLY-------APKDDYKHRSLWRELYNVEEATYLRNLIEA 82 (891)
T ss_pred cccccee-EEEeeccccCCCCCHHHHHHHHHHHHhcccceeee-------cccchhHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 3455544 445444 545 66899999999999999994322 33322 2444444678888999
Q ss_pred HHHCCCeEEEcc
Q psy13819 153 AVEEDLYILLRP 164 (323)
Q Consensus 153 a~e~GL~vilr~ 164 (323)
|+|+++..+-.+
T Consensus 83 Ake~~i~F~YAi 94 (891)
T KOG3698|consen 83 AKENNINFVYAI 94 (891)
T ss_pred HHhcCceEEEEc
Confidence 999999877544
No 253
>COG0149 TpiA Triosephosphate isomerase [Carbohydrate transport and metabolism]
Probab=36.30 E-value=77 Score=29.93 Aligned_cols=49 Identities=16% Similarity=0.125 Sum_probs=36.1
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..+++++|++.|-+ -|.-++-.|+-+. ..+.+=++.|.++||.+|++.|
T Consensus 81 ~~mL~d~G~~~vii-----GHSERR~~~~E~d-~~i~~K~~aa~~~Gl~pIlCvG 129 (251)
T COG0149 81 AEMLKDLGAKYVLI-----GHSERRLYFGETD-ELIAKKVKAAKEAGLTPILCVG 129 (251)
T ss_pred HHHHHHcCCCEEEE-----Cccccccccccch-HHHHHHHHHHHHCCCeEEEEcC
Confidence 45799999999999 4444434444332 2355668999999999999987
No 254
>cd04726 KGPDC_HPS 3-Keto-L-gulonate 6-phosphate decarboxylase (KGPDC) and D-arabino-3-hexulose-6-phosphate synthase (HPS). KGPDC catalyzes the formation of L-xylulose 5-phosphate and carbon dioxide from 3-keto-L-gulonate 6-phosphate as part of the anaerobic pathway for L-ascorbate utilization in some eubacteria. HPS catalyzes the formation of D-arabino-3-hexulose-6-phosphate from D-ribulose 5-phosphate and formaldehyde in microorganisms that can use formaldehyde as a carbon source. Both catalyze reactions that involve the Mg2+-assisted formation and stabilization of 1,2-enediolate reaction intermediates.
Probab=36.18 E-value=89 Score=27.31 Aligned_cols=45 Identities=11% Similarity=0.037 Sum_probs=33.5
Q ss_pred HHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc-cCC
Q psy13819 109 WIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR-PGP 166 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr-~GP 166 (323)
..++.++++|.+.|.+. .|+. ...+.++++.|+++|+++++. ++|
T Consensus 68 ~~~~~~~~aGad~i~~h-----~~~~--------~~~~~~~i~~~~~~g~~~~v~~~~~ 113 (202)
T cd04726 68 LEAEMAFKAGADIVTVL-----GAAP--------LSTIKKAVKAAKKYGKEVQVDLIGV 113 (202)
T ss_pred HHHHHHHhcCCCEEEEE-----eeCC--------HHHHHHHHHHHHHcCCeEEEEEeCC
Confidence 35688999999999983 2221 134678899999999999875 543
No 255
>TIGR03217 4OH_2_O_val_ald 4-hydroxy-2-oxovalerate aldolase. Members of this protein family are 4-hydroxy-2-oxovalerate aldolase, also called 4-hydroxy-2-ketovalerate aldolase and 2-oxo-4-hydroxypentanoate aldolase. This enzyme, part of the pathway for the meta-cleavage of catechol, produces pyruvate and acetaldehyde. Acetaldehyde is then converted by acetaldehyde dehydrogenase (acylating) (DmpF; EC 1.2.1.10) to acetyl-CoA. The two enzymes are tightly associated.
Probab=35.78 E-value=82 Score=30.73 Aligned_cols=45 Identities=13% Similarity=0.033 Sum_probs=36.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++++++.+.|+..||+.+..+.. +...+.++.|++.|+.|...+
T Consensus 90 ~~dl~~a~~~gvd~iri~~~~~e~------------d~~~~~i~~ak~~G~~v~~~l 134 (333)
T TIGR03217 90 VHDLKAAYDAGARTVRVATHCTEA------------DVSEQHIGMARELGMDTVGFL 134 (333)
T ss_pred HHHHHHHHHCCCCEEEEEeccchH------------HHHHHHHHHHHHcCCeEEEEE
Confidence 468999999999999997754332 347788999999999987655
No 256
>PRK13762 tRNA-modifying enzyme; Provisional
Probab=35.56 E-value=1.5e+02 Score=28.74 Aligned_cols=43 Identities=14% Similarity=0.117 Sum_probs=30.8
Q ss_pred CcccCCCeeccccc----chHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccc
Q psy13819 130 SHEVHPGHYHYDGH----RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWL 181 (323)
Q Consensus 130 ~hEP~~G~fdf~g~----~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl 181 (323)
..+|..+.+.++|. .++..+++.+++.|+.+.|.+ +|.+|.-+
T Consensus 127 a~~~~~v~iSl~GEPlL~p~l~eli~~~k~~Gi~~~L~T---------NG~~~e~l 173 (322)
T PRK13762 127 AMEPKHVAISLSGEPTLYPYLPELIEEFHKRGFTTFLVT---------NGTRPDVL 173 (322)
T ss_pred ccCCCEEEEeCCccccchhhHHHHHHHHHHcCCCEEEEC---------CCCCHHHH
Confidence 44455566665552 468899999999999988877 56667554
No 257
>PRK08208 coproporphyrinogen III oxidase; Validated
Probab=35.41 E-value=49 Score=33.24 Aligned_cols=50 Identities=22% Similarity=0.163 Sum_probs=33.3
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC----CCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH----PGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~----~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
++.|+.||++|+|.|++.|- ...++ =|+- . ..+++.+.++.|++.|+.++
T Consensus 141 ~e~l~~l~~~G~~rvslGvQ--S~~~~~L~~l~R~-~-~~~~~~~ai~~l~~~g~~~i 194 (430)
T PRK08208 141 AEKLALLAARGVNRLSIGVQ--SFHDSELHALHRP-Q-KRADVHQALEWIRAAGFPIL 194 (430)
T ss_pred HHHHHHHHHcCCCEEEEecc--cCCHHHHHHhCCC-C-CHHHHHHHHHHHHHcCCCeE
Confidence 57899999999999999443 11111 0110 0 12467788999999999754
No 258
>KOG1065|consensus
Probab=35.40 E-value=62 Score=35.38 Aligned_cols=66 Identities=18% Similarity=0.175 Sum_probs=45.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc----ccchHHHHHHHHHHCCCeEEEccCCccccccC
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD----GHRDIEHFLQLAVEEDLYILLRPGPFICGKRD 173 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~----g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~ 173 (323)
...++..+.++++|+.. .+.|+.+--..+.=||+ ...++..|++..++.|+++++.+-|+|.....
T Consensus 311 s~~~dvv~~~~~agiPl---d~~~~DiDyMd~ykDFTvd~~~fp~~~~fv~~Lh~~G~kyvliidP~is~~~~ 380 (805)
T KOG1065|consen 311 SVVRDVVENYRAAGIPL---DVIVIDIDYMDGYKDFTVDKVWFPDLKDFVDDLHARGFKYVLIIDPFISTNSS 380 (805)
T ss_pred HHHHHHHHHHHHcCCCc---ceeeeehhhhhcccceeeccccCcchHHHHHHHHhCCCeEEEEeCCccccCcc
Confidence 34579999999999982 23333333333433444 23458899999999999999888888875443
No 259
>TIGR00539 hemN_rel putative oxygen-independent coproporphyrinogen III oxidase. Experimentally determined examples of oxygen-independent coproporphyrinogen III oxidase, an enzyme that replaces HemF function under anaerobic conditions, belong to a family of proteins described by the model hemN. This model, hemN_rel, models a closely related protein, shorter at the amino end and lacking the region containing the motif PYRT[SC]YP found in members of the hemN family. Several species, including E. coli, Helicobacter pylori, Aquifex aeolicus, and Chlamydia trachomatis, have members of both this family and the E. coli hemN family. The member of this family from Bacillus subtilis was shown to complement an hemF/hemN double mutant of Salmonella typimurium and to prevent accumulation of coproporphyrinogen III under anaerobic conditions, but the exact role of this protein is still uncertain. It is found in a number of species that do not synthesize heme de novo.
Probab=35.13 E-value=66 Score=31.36 Aligned_cols=52 Identities=23% Similarity=0.332 Sum_probs=34.7
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc----cchHHHHHHHHHHCCCe-EEEc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG----HRDIEHFLQLAVEEDLY-ILLR 163 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g----~~dl~~fl~~a~e~GL~-vilr 163 (323)
++.++.||++|+|.|++.|- .. .+...+.-+ .+++.+.++.+++.|+. |-+.
T Consensus 100 ~e~l~~l~~~Gv~risiGvq--S~--~~~~l~~lgR~~~~~~~~~ai~~l~~~G~~~v~~d 156 (360)
T TIGR00539 100 AEWCKGLKGAGINRLSLGVQ--SF--RDDKLLFLGRQHSAKNIAPAIETALKSGIENISLD 156 (360)
T ss_pred HHHHHHHHHcCCCEEEEecc--cC--ChHHHHHhCCCCCHHHHHHHHHHHHHcCCCeEEEe
Confidence 57799999999999998443 11 122222222 24677889999999996 4443
No 260
>PRK07432 5'-methylthioadenosine phosphorylase; Provisional
Probab=35.09 E-value=39 Score=32.54 Aligned_cols=39 Identities=23% Similarity=0.354 Sum_probs=31.7
Q ss_pred EeCCeeeEEe--eeeecCCCCChhc--HHHHHHHHHHhCCCEEEE
Q psy13819 84 RLNEDPFQFV--SGSFHYFRAPPGR--WCWIMRAMRAAGLNALST 124 (323)
Q Consensus 84 ~~dGk~~~i~--~g~~Hy~r~p~~~--W~~~l~~mk~~G~N~Vr~ 124 (323)
.++|+++.++ .|..|+. ++.. .+.-+..||++|+..|=.
T Consensus 43 ~l~g~~v~~l~RhGr~H~y--~p~~i~~rAni~alk~lGv~~ii~ 85 (290)
T PRK07432 43 TLDGTRVAFLARHGRNHTL--LPTELPFRANIYAMKQLGVEYLIS 85 (290)
T ss_pred EECCEEEEEEECCCCCCcc--ChhhcCcHHHHHHHHHcCCCEEEE
Confidence 4789999999 9999975 4444 455699999999998866
No 261
>PF02126 PTE: Phosphotriesterase family; InterPro: IPR001559 Synonym(s): Paraoxonase, A-esterase, Aryltriphosphatase, Phosphotriesterase, Paraoxon hydrolase Bacteria such as Brevundimonas diminuta (Pseudomonas diminuta) harbour a plasmid that carries the gene for Aryldialkylphosphatase (3.1.8.1 from EC) (PTE) (also known as parathion hydrolase). This enzyme has attracted interest because of its potential use in the detoxification of chemical waste and warfare agents and its ability to degrade agricultural pesticides such as parathion. It acts specifically on synthetic organophosphate triesters and phosphorofluoridates. It does not seem to have a natural occuring substrate and may thus have optimally evolved for utilizing paraoxon. Aryldialkylphosphatase belongs to a family [, ] of enzymes that possess a binuclear zinc metal centre at their active site. The two zinc ions are coordinated by six different residues, six of which being histidines. This family so far includes, in addition to the parathion hydrolase, the following proteins: Escherichia coli protein Php, the substrate of which is not yet known. Mycobacterium tuberculosis phosphotriesterase homology protein Rv0230C. Mammalian phosphotriesterase related protein (PTER) (RPR-1). ; GO: 0008270 zinc ion binding, 0016788 hydrolase activity, acting on ester bonds, 0009056 catabolic process; PDB: 3MSR_A 3OVG_D 3K2G_C 1BF6_B 3OQE_A 3C86_A 3SO7_A 2D2G_A 2R1P_A 2D2H_A ....
Probab=34.99 E-value=2.3e+02 Score=27.43 Aligned_cols=62 Identities=18% Similarity=0.299 Sum_probs=40.4
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLK 183 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~ 183 (323)
+.-...|+.+|++|.++| +|..|- ..-+|...+.+++++-|+.||..+|=|.- .-.|.|+..
T Consensus 38 ~~~~~El~~~k~~Gg~ti--------Vd~T~~----g~GRd~~~l~~is~~tGv~II~~TG~y~~-----~~~p~~~~~ 99 (308)
T PF02126_consen 38 EAAVAELKEFKAAGGRTI--------VDATPI----GLGRDVEALREISRRTGVNIIASTGFYKE-----PFYPEWVRE 99 (308)
T ss_dssp HHHHHHHHHHHHTTEEEE--------EE--SG----GGTB-HHHHHHHHHHHT-EEEEEEEE-SG-----GCSCHHHHT
T ss_pred HHHHHHHHHHHHcCCCEE--------EecCCc----ccCcCHHHHHHHHHHhCCeEEEeCCCCcc-----ccCChhhhc
Confidence 344568899999997774 233321 23468999999999999999999985532 224667654
No 262
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=34.92 E-value=90 Score=31.59 Aligned_cols=83 Identities=7% Similarity=0.043 Sum_probs=57.4
Q ss_pred eEEeCCeeeEEeeeeecCCCCC---hhcHHHHHHHHHHhCCCE--E--EEEEeCCCcccCCCeecccccchHHHHHHHHH
Q psy13819 82 TFRLNEDPFQFVSGSFHYFRAP---PGRWCWIMRAMRAAGLNA--L--STYVEWRSHEVHPGHYHYDGHRDIEHFLQLAV 154 (323)
Q Consensus 82 ~~~~dGk~~~i~~g~~Hy~r~p---~~~W~~~l~~mk~~G~N~--V--r~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~ 154 (323)
...+.+.-|+||.++-+-++.+ ++.-+.--+.+++.|++. + .....-|.--|.+..+..+ .+.+.+-|+.|.
T Consensus 150 a~~~g~~afqiF~~npr~w~~~~~~~~~~~~f~~~~~~~gi~~~~i~~HapYlINLASpd~e~rekS-v~~~~~eL~rA~ 228 (413)
T PTZ00372 150 AYNIAGQAFALFLKNQRTWNSPPLSDETIDKFKENCKKYNYDPKFILPHGSYLINLANPDKEKREKS-YDAFLDDLQRCE 228 (413)
T ss_pred HHHcCCCEEEEEcCCCccCCCCCCCHHHHHHHHHHHHHcCCCcceEEeecCceecCCCCCHHHHHHH-HHHHHHHHHHHH
Confidence 3456668899998887766544 455566667788888762 3 2222267777777777776 344778899999
Q ss_pred HCCCe-EEEccC
Q psy13819 155 EEDLY-ILLRPG 165 (323)
Q Consensus 155 e~GL~-vilr~G 165 (323)
+.|.. |++.||
T Consensus 229 ~LGa~~VV~HPG 240 (413)
T PTZ00372 229 QLGIKLYNFHPG 240 (413)
T ss_pred HcCCCEEEECCC
Confidence 99997 556786
No 263
>PRK07379 coproporphyrinogen III oxidase; Provisional
Probab=34.72 E-value=78 Score=31.51 Aligned_cols=52 Identities=19% Similarity=0.160 Sum_probs=34.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecccc----cchHHHHHHHHHHCCCe-EEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHYDG----HRDIEHFLQLAVEEDLY-ILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf~g----~~dl~~fl~~a~e~GL~-vilr~ 164 (323)
++.|+.+|++|+|.|++.|- .- +..-..-+ .+++.+.++.+++.|+. |-+..
T Consensus 115 ~e~l~~l~~~GvnrislGvQ-----S~~d~~L~~l~R~~~~~~~~~ai~~l~~~G~~~v~~dl 172 (400)
T PRK07379 115 LEQLQGYRSLGVNRVSLGVQ-----AFQDELLALCGRSHRVKDIFAAVDLIHQAGIENFSLDL 172 (400)
T ss_pred HHHHHHHHHCCCCEEEEEcc-----cCCHHHHHHhCCCCCHHHHHHHHHHHHHcCCCeEEEEe
Confidence 57899999999999999432 11 11111111 24677889999999997 44443
No 264
>PRK01045 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed
Probab=34.71 E-value=1.1e+02 Score=29.45 Aligned_cols=70 Identities=13% Similarity=0.150 Sum_probs=49.5
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcc-cCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHE-VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hE-P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
++|+..+|-=+|. ++.++++++.|+.+|.- .-| |..+..=+...--=....+.|++.|+.||-.+=
T Consensus 30 ~~~iytlG~iIHN--------~~vv~~L~~~GV~~v~~-----~~~v~~~~~ViirAHGv~~~~~~~~~~~g~~viDaTC 96 (298)
T PRK01045 30 GAPIYVRHEIVHN--------RYVVERLEKKGAIFVEE-----LDEVPDGAIVIFSAHGVSPAVREEAKERGLTVIDATC 96 (298)
T ss_pred CCCeEEEecCccC--------HHHHHHHHHCCCEEecC-----cccCCCCCEEEEeCCCCCHHHHHHHHHCCCeEEeCCC
Confidence 5678888888998 56799999999999973 222 223455555433334557888888888888777
Q ss_pred Cccc
Q psy13819 166 PFIC 169 (323)
Q Consensus 166 Pyi~ 169 (323)
|++-
T Consensus 97 P~V~ 100 (298)
T PRK01045 97 PLVT 100 (298)
T ss_pred ccch
Confidence 7654
No 265
>COG1324 CutA Uncharacterized protein involved in tolerance to divalent cations [Inorganic ion transport and metabolism]
Probab=34.52 E-value=78 Score=25.90 Aligned_cols=36 Identities=33% Similarity=0.452 Sum_probs=23.8
Q ss_pred HHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHH
Q psy13819 207 ELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDL 249 (323)
Q Consensus 207 ~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~ 249 (323)
++.++|+.+---.-|-||++.|+| | .++|++|+.+.
T Consensus 67 ~l~~~ikelHpYevPeIi~i~v~~--g-----~~eYL~Wl~~~ 102 (104)
T COG1324 67 ELIERIKELHPYEVPEIIALPVDN--G-----LPEYLEWLNEE 102 (104)
T ss_pred HHHHHHHHhCCCCCceEEEEEecc--C-----CHHHHHHHHHh
Confidence 344444432222358899999987 3 48999999864
No 266
>TIGR01361 DAHP_synth_Bsub phospho-2-dehydro-3-deoxyheptonate aldolase. The member of this family from Synechocystis PCC 6803, CcmA, was shown to be essential for carboxysome formation. However, no other candidate for this enzyme is present in that species, chorismate biosynthesis does occur, other species having this protein lack carboxysomes but appear to make chorismate, and a requirement of CcmA for carboxysome formation does not prohibit a role in chorismate biosynthesis.
Probab=34.51 E-value=1.1e+02 Score=28.75 Aligned_cols=74 Identities=15% Similarity=0.162 Sum_probs=50.2
Q ss_pred EEeCCeeeEEeeeeecCCCC--ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCC
Q psy13819 83 FRLNEDPFQFVSGSFHYFRA--PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 83 ~~~dGk~~~i~~g~~Hy~r~--p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL 158 (323)
+.+.++.+.++.| +.. ..+.-.+-.+.+|+.|.+..+.+++=++..| +.|. +...|..+-+.|++.||
T Consensus 19 ~~~g~~~~~~IAG----pc~ie~~~~~~~~A~~lk~~~~k~~r~~~~KpRtsp----~s~~g~g~~gl~~l~~~~~~~Gl 90 (260)
T TIGR01361 19 VKIGEGSPIVIAG----PCSVESEEQIMETARFVKEAGAKILRGGAFKPRTSP----YSFQGLGEEGLKLLRRAADEHGL 90 (260)
T ss_pred EEEcCCcEEEEEe----CCccCCHHHHHHHHHHHHHHHHHhccCceecCCCCC----ccccccHHHHHHHHHHHHHHhCC
Confidence 3444444555666 222 2456667788888899988887666544444 3454 35668888899999999
Q ss_pred eEEEcc
Q psy13819 159 YILLRP 164 (323)
Q Consensus 159 ~vilr~ 164 (323)
.++..+
T Consensus 91 ~~~t~~ 96 (260)
T TIGR01361 91 PVVTEV 96 (260)
T ss_pred CEEEee
Confidence 999877
No 267
>TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift.
Probab=34.42 E-value=42 Score=34.01 Aligned_cols=55 Identities=13% Similarity=-0.014 Sum_probs=35.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeec-ccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYH-YDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fd-f~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.++.|+++|++.|.+.+. ...++ -...+ -...+++.+.++.|+++|+.+.+..
T Consensus 287 ~e~l~~l~~aG~~~v~iGiE--S~s~~~L~~~~K~~~~~~~~~~i~~~~~~Gi~v~~~~ 343 (472)
T TIGR03471 287 YETLKVMKENGLRLLLVGYE--SGDQQILKNIKKGLTVEIARRFTRDCHKLGIKVHGTF 343 (472)
T ss_pred HHHHHHHHHcCCCEEEEcCC--CCCHHHHHHhcCCCCHHHHHHHHHHHHHCCCeEEEEE
Confidence 57899999999999888432 11110 00000 0112467888999999999876543
No 268
>PRK06703 flavodoxin; Provisional
Probab=34.38 E-value=2.7e+02 Score=23.13 Aligned_cols=100 Identities=9% Similarity=-0.124 Sum_probs=58.4
Q ss_pred eCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc---ccchHHHHHHHHHHCCCeEE
Q psy13819 85 LNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD---GHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 85 ~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~---g~~dl~~fl~~a~e~GL~vi 161 (323)
+..-...+++...+-.-.+|..+.+-+..+++.-++...+.++- .+|++ .......+-+.+++.|..++
T Consensus 46 l~~~d~viigspt~~~g~~p~~~~~f~~~l~~~~l~~k~~~vfg--------~g~~~y~~~~~a~~~l~~~l~~~G~~~~ 117 (151)
T PRK06703 46 LLAYDGIILGSYTWGDGDLPYEAEDFHEDLENIDLSGKKVAVFG--------SGDTAYPLFCEAVTIFEERLVERGAELV 117 (151)
T ss_pred HhcCCcEEEEECCCCCCcCcHHHHHHHHHHhcCCCCCCEEEEEc--------cCCCChHHHHHHHHHHHHHHHHCCCEEc
Confidence 33445566655555344445667777777776666655555552 22322 12345567778899999887
Q ss_pred EccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHH
Q psy13819 162 LRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQ 213 (323)
Q Consensus 162 lr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~ 213 (323)
.++- .... .| +...-++.++.|.+++++.++
T Consensus 118 ~~~~-------~~~~--------~p------~~~~~~~~~~~~~~~~~~~~~ 148 (151)
T PRK06703 118 QEGL-------KIEL--------AP------ETDEDVEKCSNFAIAFAEKFA 148 (151)
T ss_pred ccCe-------EEec--------CC------CchhHHHHHHHHHHHHHHHHH
Confidence 7551 1111 12 124567888888888887654
No 269
>PLN02429 triosephosphate isomerase
Probab=34.22 E-value=88 Score=30.52 Aligned_cols=49 Identities=14% Similarity=0.092 Sum_probs=34.1
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++-.|.-+. +.+.+=+..|.++||.+|++.|
T Consensus 140 a~mLkd~Gv~~Vii-----GHSERR~~f~Etd-~~V~~Kv~~al~~GL~pIvCIG 188 (315)
T PLN02429 140 VEQLKDLGCKWVIL-----GHSERRHVIGEKD-EFIGKKAAYALSEGLGVIACIG 188 (315)
T ss_pred HHHHHHcCCCEEEe-----CccccCCCCCcCH-HHHHHHHHHHHHCcCEEEEEcC
Confidence 56899999999999 4544445554332 2233334449999999999997
No 270
>PRK05660 HemN family oxidoreductase; Provisional
Probab=34.22 E-value=51 Score=32.50 Aligned_cols=50 Identities=20% Similarity=0.137 Sum_probs=33.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc----ccchHHHHHHHHHHCCCeEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD----GHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~----g~~dl~~fl~~a~e~GL~vi 161 (323)
++.|+.||++|+|.|++.|- ... +-..+.- ..++..+.++.|++.|+..+
T Consensus 107 ~e~l~~Lk~~Gv~risiGvq--S~~--~~~L~~l~r~~~~~~~~~ai~~~~~~G~~~v 160 (378)
T PRK05660 107 ADRFVGYQRAGVNRISIGVQ--SFS--EEKLKRLGRIHGPDEAKRAAKLAQGLGLRSF 160 (378)
T ss_pred HHHHHHHHHcCCCEEEeccC--cCC--HHHHHHhCCCCCHHHHHHHHHHHHHcCCCeE
Confidence 58899999999999999443 111 1122211 12456777999999999643
No 271
>cd07944 DRE_TIM_HOA_like 4-hydroxy-2-oxovalerate aldolase-like, N-terminal catalytic TIM barrel domain. This family of bacterial enzymes is sequence-similar to 4-hydroxy-2-oxovalerate aldolase (HOA) but its exact function is unknown. This family includes the Bacteroides vulgatus Bvu_2661 protein and belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues t
Probab=33.88 E-value=97 Score=29.10 Aligned_cols=66 Identities=14% Similarity=0.130 Sum_probs=45.8
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHH-HCCCeEEEccC
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAV-EEDLYILLRPG 165 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~-e~GL~vilr~G 165 (323)
.+.+.+.-.+..+.+-++|++.|++.++-...+...|..-|...+.+.++.+..+ +.-+-++++++
T Consensus 15 ~~f~~~~~~~ia~~L~~~GVd~IEvG~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 81 (266)
T cd07944 15 WDFGDEFVKAIYRALAAAGIDYVEIGYRSSPEKEFKGKSAFCDDEFLRRLLGDSKGNTKIAVMVDYG 81 (266)
T ss_pred ccCCHHHHHHHHHHHHHCCCCEEEeecCCCCccccCCCccCCCHHHHHHHHhhhccCCEEEEEECCC
Confidence 3566788899999999999999999877665444556666665555655565543 44444566664
No 272
>cd02874 GH18_CFLE_spore_hydrolase Cortical fragment-lytic enzyme (CFLE) is a peptidoglycan hydrolase involved in bacterial endospore germination. CFLE is expressed as an inactive preprotein (called SleB) in the forespore compartment of sporulating cells. SleB translocates across the forespore inner membrane and is deposited as a mature enzyme in the cortex layer of the spore. As part of a sensory mechanism capable of initiating germination, CFLE degrades a spore-specific peptidoglycan constituent called muramic-acid delta-lactam that comprises the outer cortex. CFLE has a C-terminal glycosyl hydrolase family 18 (GH18) catalytic domain as well as two N-terminal LysM peptidoglycan-binding domains. In addition to SleB, this family includes YaaH, YdhD, and YvbX from Bacillus subtilis.
Probab=33.62 E-value=2.9e+02 Score=26.12 Aligned_cols=43 Identities=19% Similarity=0.224 Sum_probs=27.9
Q ss_pred hCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 117 AGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 117 ~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
..+++|.. .|-...+ .|.. ++. ...++++.|+++|++|++..|
T Consensus 24 ~~lt~v~p--~w~~~~~-~g~~--~~~-~~~~~~~~a~~~~~kv~~~i~ 66 (313)
T cd02874 24 PYLTYIAP--FWYGVDA-DGTL--TGL-PDERLIEAAKRRGVKPLLVIT 66 (313)
T ss_pred CCCCEEEE--EEEEEcC-CCCC--CCC-CCHHHHHHHHHCCCeEEEEEe
Confidence 34555544 4545544 3543 332 246889999999999999885
No 273
>PRK12330 oxaloacetate decarboxylase; Provisional
Probab=33.43 E-value=92 Score=32.34 Aligned_cols=52 Identities=10% Similarity=0.088 Sum_probs=43.3
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
|-.+|.+.-+.++++..+.|+..+|+..+-|.. +++...++.+++.|..+..
T Consensus 91 y~~y~ddvv~~fv~~a~~~Gidi~RIfd~lndv------------~nl~~ai~~vk~ag~~~~~ 142 (499)
T PRK12330 91 YRHYEDEVVDRFVEKSAENGMDVFRVFDALNDP------------RNLEHAMKAVKKVGKHAQG 142 (499)
T ss_pred ccCcchhHHHHHHHHHHHcCCCEEEEEecCChH------------HHHHHHHHHHHHhCCeEEE
Confidence 666777888999999999999999998777664 6688888889998887643
No 274
>PF00121 TIM: Triosephosphate isomerase; InterPro: IPR000652 Triosephosphate isomerase (5.3.1.1 from EC) (TIM) [] is the glycolytic enzyme that catalyses the reversible interconversion of glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. TIM plays an important role in several metabolic pathways and is essential for efficient energy production. It is present in eukaryotes as well as in prokaryotes. TIM is a dimer of identical subunits, each of which is made up of about 250 amino-acid residues. A glutamic acid residue is involved in the catalytic mechanism [, ]. The tertiary structure of TIM has eight beta/alpha motifs folded into a barrel structure. The TIM barrel fold occurs ubiquitously and is found in numerous other enzymes that can be involved in energy metabolism, macromolecule metabolism, or small molecule metabolism []. The sequence around the active site residue is perfectly conserved in all known TIM's. Deficiencies in TIM are associated with haemolytic anaemia coupled with a progressive, severe neurological disorder [].; GO: 0004807 triose-phosphate isomerase activity, 0008152 metabolic process; PDB: 2YPI_A 1YPI_A 1NEY_B 1NF0_B 1I45_A 7TIM_A 3YPI_B 2H6R_H 2Y63_A 1N55_A ....
Probab=33.36 E-value=33 Score=32.02 Aligned_cols=49 Identities=14% Similarity=0.173 Sum_probs=37.4
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++-.|.-+ .+.+.+=++.|.++||.+|++.|
T Consensus 77 ~~mL~d~G~~~vii-----GHSERR~~f~Et-d~~i~~Kv~~al~~gl~pIvCvG 125 (244)
T PF00121_consen 77 AEMLKDLGCKYVII-----GHSERRQYFGET-DEIINKKVKAALENGLTPIVCVG 125 (244)
T ss_dssp HHHHHHTTESEEEE-----SCHHHHHHST-B-HHHHHHHHHHHHHTT-EEEEEES
T ss_pred HHHHHHhhCCEEEe-----ccccccCccccc-cHHHHHHHHHHHHCCCEEEEEec
Confidence 57899999999999 554444444433 35688889999999999999997
No 275
>cd06542 GH18_EndoS-like Endo-beta-N-acetylglucosaminidases are bacterial chitinases that hydrolyze the chitin core of various asparagine (N)-linked glycans and glycoproteins. The endo-beta-N-acetylglucosaminidases have a glycosyl hydrolase family 18 (GH18) catalytic domain. Some members also have an additional C-terminal glycosyl hydrolase family 20 (GH20) domain while others have an N-terminal domain of unknown function (pfam08522). Members of this family include endo-beta-N-acetylglucosaminidase S (EndoS) from Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and EndoH from Flavobacterium meningosepticum, and EndoE from Enterococcus faecalis. EndoS is a secreted endoglycosidase from Streptococcus pyogenes that specifically hydrolyzes the glycan on human IgG between two core N-acetylglucosamine residues. EndoE is a secreted endoglycosidase, encoded by the ndoE gene in Enterococcus faecalis, that hydrolyzes the glycan on human RNase B.
Probab=33.22 E-value=3.8e+02 Score=24.41 Aligned_cols=85 Identities=18% Similarity=0.269 Sum_probs=47.3
Q ss_pred chHHHHHHHHHHCCCeEEEccCCccccccCCC-CCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCc
Q psy13819 144 RDIEHFLQLAVEEDLYILLRPGPFICGKRDFG-GFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRP 222 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~g-g~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~ 222 (323)
......++.+++.|++|++..|- |..+ ++ +. ..++ +..++|.+.++..+..+-+
T Consensus 51 ~~~~~~i~~l~~kG~KVl~sigg-----~~~~~~~--~~----------~~~~---~~~~~fa~~l~~~v~~ygl----- 105 (255)
T cd06542 51 TNKETYIRPLQAKGTKVLLSILG-----NHLGAGF--AN----------NLSD---AAAKAYAKAIVDTVDKYGL----- 105 (255)
T ss_pred HHHHHHHHHHhhCCCEEEEEECC-----CCCCCCc--cc----------cCCH---HHHHHHHHHHHHHHHHhCC-----
Confidence 34566788899999999998851 1111 11 00 1122 3467777777777765433
Q ss_pred EEEEccCCCcCCCC-----CCh-HHHHHHHHHHHHHhcC
Q psy13819 223 IILVQVENEYGSDA-----ECD-PAHAVWLRDLLRTYVQ 255 (323)
Q Consensus 223 Vi~~QieNE~g~~~-----~~~-~~Y~~~l~~~~~~~~g 255 (323)
=++.|+=|+.... ..+ ..|...++++ |+.++
T Consensus 106 -DGiDiD~E~~~~~~~~~~~~~~~~~~~lv~~L-r~~~~ 142 (255)
T cd06542 106 -DGVDFDDEYSGYGKNGTSQPSNEAFVRLIKEL-RKYMG 142 (255)
T ss_pred -CceEEeeeecccCCCCCCcchHHHHHHHHHHH-HHHhC
Confidence 3566777765431 113 3455444444 44455
No 276
>cd04886 ACT_ThrD-II-like C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. This CD includes the C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. The Escherichia coli tdcB gene product, ThrD-II, anaerobically catalyzes the pyridoxal phosphate-dependent dehydration of L-threonine and L-serine to ammonia and to alpha-ketobutyrate and pyruvate, respectively. Tetrameric ThrD-II is subject to allosteric activation by AMP, inhibition by alpha-keto acids, and catabolite inactivation by several metabolites of glycolysis and the citric acid cycle. Also included in this CD are N-terminal ACT domains present in smaller (~170 a.a.) archaeal proteins of unknown function. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=33.18 E-value=1.6e+02 Score=20.24 Aligned_cols=58 Identities=17% Similarity=0.275 Sum_probs=38.3
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCe----ecc--cccchHHHHHHHHHHCCCeEE
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGH----YHY--DGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~----fdf--~g~~dl~~fl~~a~e~GL~vi 161 (323)
|....+.++.+.+.|+|...+...=....-.+|. +.+ ....+++.+++..++.|..+.
T Consensus 9 ~G~L~~i~~~i~~~~~nI~~i~~~~~~~~~~~~~~~~~i~v~~~~~~~l~~l~~~l~~~g~~~~ 72 (73)
T cd04886 9 PGQLAKLLAVIAEAGANIIEVSHDRAFKTLPLGEVEVELTLETRGAEHIEEIIAALREAGYDVR 72 (73)
T ss_pred CChHHHHHHHHHHcCCCEEEEEEEeccCCCCCceEEEEEEEEeCCHHHHHHHHHHHHHcCCEEe
Confidence 4567888999999999999884321111001343 222 233678899999999998763
No 277
>PF04909 Amidohydro_2: Amidohydrolase; InterPro: IPR006992 These proteins are related to the metal-dependent hydrolase superfamily []. The family includes 2-amino-3-carboxymuconate-6-semialdehyde decarboxylase which converts alpha-amino-beta-carboxymuconate-epsilon- semialdehyde (ACMS) to alpha-aminomuconate semialdehyde (AMS). ACMS can be converted non-enzymatically to quinolate, a potent endogenous excitoxin of neuronal cells which is implicated in the pathogenesis of various neurodegenerative disorders. In the presence of AMCSD, ACMS is converted to AMS, a benign catabolite. 2-amino-3-(3-oxoprop-2-enyl)-but-2-enedioate = 2-aminomuconate semialdehyde + CO2. ; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 2QPX_A 4D8L_A 3K4W_I 3IRS_B 4DZI_B 3S4T_G 2GWG_B 3IJ6_A 2DVX_C 2DVT_C ....
Probab=33.16 E-value=1e+02 Score=27.62 Aligned_cols=148 Identities=15% Similarity=0.126 Sum_probs=70.6
Q ss_pred hhcHHHHHHHHH-HhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC-CccccccCCCCCc---
Q psy13819 104 PGRWCWIMRAMR-AAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG-PFICGKRDFGGFP--- 178 (323)
Q Consensus 104 ~~~W~~~l~~mk-~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G-Pyi~aEw~~gg~P--- 178 (323)
++...+.+++.. +.|+-.|+++... +.++........+++++|+|+|+-|++-+| +....+-.....|
T Consensus 83 ~~~~~~~l~~~~~~~g~~Gv~l~~~~-------~~~~~~~~~~~~~~~~~~~~~~~pv~~H~g~~~~~~~~~~~~~~~~~ 155 (273)
T PF04909_consen 83 PEDAVEELERALQELGFRGVKLHPDL-------GGFDPDDPRLDDPIFEAAEELGLPVLIHTGMTGFPDAPSDPADPEEL 155 (273)
T ss_dssp HHHHHHHHHHHHHTTTESEEEEESSE-------TTCCTTSGHCHHHHHHHHHHHT-EEEEEESHTHHHHHHHHHHHHHHH
T ss_pred chhHHHHHHHhccccceeeeEecCCC-------CccccccHHHHHHHHHHHHhhccceeeeccccchhhhhHHHHHHHHH
Confidence 445555555555 8999999985322 223333222225889999999999999875 0000000000011
Q ss_pred ccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcC-CCCCChHHHHHHHHHHHHHhcCCc
Q psy13819 179 PWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYG-SDAECDPAHAVWLRDLLRTYVQDK 257 (323)
Q Consensus 179 ~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g-~~~~~~~~Y~~~l~~~~~~~~g~~ 257 (323)
.-+..++|++++-..+-... ..++..++..++. .++|.. .+.--++ .+..........+..+++. .|.+
T Consensus 156 ~~~~~~~P~l~ii~~H~G~~---~~~~~~~~~l~~~-----~~nvy~-d~s~~~~~~~~~~~~~~~~~l~~~~~~-~g~d 225 (273)
T PF04909_consen 156 EELLERFPDLRIILAHLGGP---FPWWEEALRLLDR-----FPNVYV-DLSGIPPFWYFWPPSFDRPFLRRAVDE-FGPD 225 (273)
T ss_dssp TTHHHHSTTSEEEESGGGTT---HHHHHHHHHHHHH-----HTTEEE-ECHSHHSSEEEETTHHCHHHHHHHHHH-HTGG
T ss_pred HHHHHHhcCCeEEEecCccc---chhHHHHHHHHHh-----CCcccc-cccccccccccCcccccHHHHHHHHHH-hCCc
Confidence 11233478776655544443 3344444444332 123321 1111000 0000011223445555544 4888
Q ss_pred eEEEeeCCCCc
Q psy13819 258 AVLYSTDGAFD 268 (323)
Q Consensus 258 ~~l~t~dg~~~ 268 (323)
-+||-||.+..
T Consensus 226 rilfGSD~P~~ 236 (273)
T PF04909_consen 226 RILFGSDYPHP 236 (273)
T ss_dssp GEEEE--TTSS
T ss_pred eEEecCCCCCC
Confidence 89999998864
No 278
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=33.03 E-value=41 Score=37.85 Aligned_cols=36 Identities=19% Similarity=0.249 Sum_probs=24.1
Q ss_pred eEEEEcCcccccccCCCCCeeEEE-eccccccCCcceEEE
Q psy13819 7 GVLFANGRHWGRYWSEVGPQYSLF-VPEEFLKVGTNRITI 45 (323)
Q Consensus 7 g~~~vng~~lGryw~~~gpq~~ly-vP~~~l~~g~N~i~i 45 (323)
-.|||||+.||+.= -|=.-.-| |. ++|+.|.|+|.|
T Consensus 139 a~VwvNG~~VG~~~--g~~~pfefDIT-~~l~~G~N~LaV 175 (1021)
T PRK10340 139 FEVYVNGQYVGFSK--GSRLTAEFDIS-AMVKTGDNLLCV 175 (1021)
T ss_pred EEEEECCEEecccc--CCCccEEEEcc-hhhCCCccEEEE
Confidence 47999999999732 22222224 43 378999999876
No 279
>TIGR00559 pdxJ pyridoxine 5'-phosphate synthase. PdxJ is required in the biosynthesis of pyridoxine (vitamin B6), a precursor to the enzyme cofactor pyridoxal phosphate. ECOCYC describes the predicted reaction equation as 1-amino-propan-2-one-3-phosphate + deoxyxylulose-5-phosphate = pyridoxine-5'-phosphate. The product of that reaction is oxidized by PdxH to pyridoxal 5'-phosphate.
Probab=32.98 E-value=88 Score=29.29 Aligned_cols=48 Identities=19% Similarity=0.385 Sum_probs=0.0
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCCcccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGPFICG 170 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GPyi~a 170 (323)
+..++.++++|+.|+ -|++ .++|. ...++.|++.|-..| |-+|||..+
T Consensus 110 ~~l~~~i~~l~~~gI-~VSL-----FiDP~------------~~qi~~A~~~GAd~VELhTG~YA~a 158 (237)
T TIGR00559 110 DKLCELVKRFHAAGI-EVSL-----FIDAD------------KDQISAAAEVGADRIEIHTGPYANA 158 (237)
T ss_pred HHHHHHHHHHHHCCC-EEEE-----EeCCC------------HHHHHHHHHhCcCEEEEechhhhcC
No 280
>PLN02231 alanine transaminase
Probab=32.40 E-value=2.1e+02 Score=29.79 Aligned_cols=58 Identities=10% Similarity=0.079 Sum_probs=38.6
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
...+..++.++..+..|.++--+++. |-|.|.--.++-+- +.+++++|+++|+.+|..
T Consensus 253 ~d~~~Le~~l~~~~~~~~~~k~ivl~-nP~NPTG~vls~e~---l~~Iv~~a~~~~l~lI~D 310 (534)
T PLN02231 253 LEISELKKQLEDARSKGITVRALVVI-NPGNPTGQVLAEEN---QRDIVEFCKQEGLVLLAD 310 (534)
T ss_pred CCHHHHHHHHHHHhhcCCCeEEEEEe-CCCCCCCcCCCHHH---HHHHHHHHHHcCCEEEEE
Confidence 33444455555445545554334555 77888866666555 777899999999999976
No 281
>COG3142 CutC Uncharacterized protein involved in copper resistance [Inorganic ion transport and metabolism]
Probab=32.35 E-value=43 Score=31.31 Aligned_cols=63 Identities=16% Similarity=0.148 Sum_probs=44.3
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
++..+|.+-|..-..+-+++|++++|++|+..|-+.. -...|.-|-.- ++++++.|+ ||-|..
T Consensus 58 IRPRgGdFvY~~~E~~iM~~DI~~~~~lG~~GVV~G~-----lt~dg~iD~~~---le~Li~aA~--gL~vTF 120 (241)
T COG3142 58 IRPRGGDFVYSDDELEIMLEDIRLARELGVQGVVLGA-----LTADGNIDMPR---LEKLIEAAG--GLGVTF 120 (241)
T ss_pred EecCCCCcccChHHHHHHHHHHHHHHHcCCCcEEEee-----ecCCCccCHHH---HHHHHHHcc--CCceee
Confidence 3557888888877778899999999999999999832 23345555554 445566555 555544
No 282
>KOG1411|consensus
Probab=31.97 E-value=59 Score=32.41 Aligned_cols=184 Identities=15% Similarity=0.224 Sum_probs=93.5
Q ss_pred ceeEEEEcCcccccccCCCCCeeEEEeccc-cccCCcceEEEEEEeeCC-------CC--ccccchhhhhhccccccceE
Q psy13819 5 SKGVLFANGRHWGRYWSEVGPQYSLFVPEE-FLKVGTNRITIFELTRAP-------DK--YDVDFVDKISQRRARMSRTF 74 (323)
Q Consensus 5 ~kg~~~vng~~lGryw~~~gpq~~lyvP~~-~l~~g~N~i~i~e~~~~~-------~~--~~i~~~~~~i~~~~~~~~~~ 74 (323)
|.|-.-|-+.-|-||++ +.++|+|.| |-+. --||...+.. ++ ..++|....-+....
T Consensus 129 GTGaLriga~Fl~~f~~----~~~I~ip~PTWgNh----~~if~~ag~~~~~yrYyd~~t~gld~~g~ledl~~~----- 195 (427)
T KOG1411|consen 129 GTGALRVGAEFLARFYP----SRDIYIPDPTWGNH----KNIFKDAGLPVKFYRYYDPKTRGLDFKGMLEDLGEA----- 195 (427)
T ss_pred CcchhhHHHHHHHhhcc----ccceeecCCccccc----CccccccCcceeeeeeccccccccchHHHHHHHhcC-----
Confidence 44555566666788884 889999998 4332 2244433311 01 112222111111111
Q ss_pred EEEecCCeEEeCCeeeEEeeeeecCCC-C--ChhcHHHHHHHHHHhCCC-EEEEEEeCCCcccCCCeecccccchHHHHH
Q psy13819 75 AIDLAGDTFRLNEDPFQFVSGSFHYFR-A--PPGRWCWIMRAMRAAGLN-ALSTYVEWRSHEVHPGHYHYDGHRDIEHFL 150 (323)
Q Consensus 75 ~v~~~~~~~~~dGk~~~i~~g~~Hy~r-~--p~~~W~~~l~~mk~~G~N-~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl 150 (323)
-+.-+.++.+|-|.+. + ..+.|++..+.|++..+= .+.. ...-.+-|..|- -..-+
T Consensus 196 -----------p~gs~ilLhaCaHNPTGvDPt~eqw~ki~~~~~~k~~~pffDm----AYQGfaSG~~d~-----DA~av 255 (427)
T KOG1411|consen 196 -----------PEGSIILLHACAHNPTGVDPTKEQWEKISDLIKEKNLLPFFDM----AYQGFASGDLDK-----DAQAV 255 (427)
T ss_pred -----------CCCcEEEeehhhcCCCCCCccHHHHHHHHHHhhhccccchhhh----hhcccccCCchh-----hHHHH
Confidence 1133778999999986 3 378999999999986432 1111 111122222222 23457
Q ss_pred HHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCC
Q psy13819 151 QLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVEN 230 (323)
Q Consensus 151 ~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieN 230 (323)
+++.+.|..++|.-. |. .+=|+ +-.+ .....+-+.++.-..+++.=++.++. +. |. .|||-+-+|-|
T Consensus 256 R~F~~~g~~~~laQS-yA----KNMGL--YgER-vGa~svvc~~ad~A~rV~SQlk~liR---pm-YS-nPP~hGArIv~ 322 (427)
T KOG1411|consen 256 RLFVEDGHEILLAQS-YA----KNMGL--YGER-VGALSVVCKDADEAKRVESQLKILIR---PM-YS-NPPLHGARIVA 322 (427)
T ss_pred HHHHHcCCceEeehh-hh----hhcch--hhhc-cceeEEEecCHHHHHHHHHHHHHHhc---cc-cc-CCCccchhhhh
Confidence 888888888877531 00 11111 0011 12222334555544454444443333 32 44 57788888777
Q ss_pred CcCC
Q psy13819 231 EYGS 234 (323)
Q Consensus 231 E~g~ 234 (323)
..=+
T Consensus 323 ~Il~ 326 (427)
T KOG1411|consen 323 TILS 326 (427)
T ss_pred hccC
Confidence 7543
No 283
>PRK00870 haloalkane dehalogenase; Provisional
Probab=31.76 E-value=1.3e+02 Score=27.82 Aligned_cols=68 Identities=16% Similarity=0.199 Sum_probs=42.8
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCC---CeecccccchHHHHHHHHHHCCCe
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHP---GHYHYDGHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~---G~fdf~g~~dl~~fl~~a~e~GL~ 159 (323)
+|+++.+++|- -.....|..-++.+.+.|+++|....+--.....+ ..|+++. ..+.+.+++++.|+.
T Consensus 45 ~~~~lvliHG~----~~~~~~w~~~~~~L~~~gy~vi~~Dl~G~G~S~~~~~~~~~~~~~--~a~~l~~~l~~l~~~ 115 (302)
T PRK00870 45 DGPPVLLLHGE----PSWSYLYRKMIPILAAAGHRVIAPDLIGFGRSDKPTRREDYTYAR--HVEWMRSWFEQLDLT 115 (302)
T ss_pred CCCEEEEECCC----CCchhhHHHHHHHHHhCCCEEEEECCCCCCCCCCCCCcccCCHHH--HHHHHHHHHHHcCCC
Confidence 36788888883 23356799888888888999999866643333222 2355543 233345556666774
No 284
>cd06416 GH25_Lys1-like Lys-1 is a lysozyme encoded by the Caenorhabditis elegans lys-1 gene. This gene is one of a several lysozyme genes upregulated upon infection by the Gram-negative bacterial pathogen Serratia marcescens. Lys-1 contains a glycosyl hydrolase family 25 (GH25) catalytic domain. This family also includes Lys-5 from Caenorhabditis elegans.
Probab=31.63 E-value=3.6e+02 Score=23.67 Aligned_cols=122 Identities=11% Similarity=0.057 Sum_probs=65.4
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCC
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPD 187 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~ 187 (323)
..+.+.+|+.|+..|=+-. -|-. +..| .....-++.|++.||.+ |-|. ... +.
T Consensus 12 ~i~w~~vk~~g~~fv~ika----teg~-~~~D----~~f~~n~~~A~~aGl~~----G~Yh------------f~~--~~ 64 (196)
T cd06416 12 VSTFQCLKNNGYSFAIIRA----YRSN-GSFD----PNSVTNIKNARAAGLST----DVYF------------FPC--IN 64 (196)
T ss_pred hhhhhHHHhCCceEEEEEE----EccC-CccC----hHHHHHHHHHHHcCCcc----ceEE------------Eec--CC
Confidence 3567889999988766632 1110 1122 23566689999999866 4331 111 10
Q ss_pred eeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCC---hHHHHHHHHHHHHHhcCCceEEEeeC
Q psy13819 188 ILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAEC---DPAHAVWLRDLLRTYVQDKAVLYSTD 264 (323)
Q Consensus 188 ~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~---~~~Y~~~l~~~~~~~~g~~~~l~t~d 264 (323)
+ ...-.+.++.+++.+... ....+.+.+.||-.-+..... -.+.++...+.++++ |..+.+||+-
T Consensus 65 ----~-~~~~~~Qa~~f~~~~~~~------~~~~~~i~lDiE~~~~~~~~~~~~~~~~~~~f~~~~~~~-G~~~~iYt~~ 132 (196)
T cd06416 65 ----C-CGSAAGQVQTFLQYLKAN------GIKYGTVWIDIEQNPCQWSSDVASNCQFLQELVSAAKAL-GLKVGIYSSQ 132 (196)
T ss_pred ----C-CCCHHHHHHHHHHHHHhC------CCceeEEEEEEecCCCCCcCCHHHHHHHHHHHHHHHHHh-CCeEEEEcCc
Confidence 0 012245666666665331 222334456777432221111 135566666677776 9899999975
Q ss_pred CCCc
Q psy13819 265 GAFD 268 (323)
Q Consensus 265 g~~~ 268 (323)
.-+.
T Consensus 133 ~~w~ 136 (196)
T cd06416 133 YDWS 136 (196)
T ss_pred chhc
Confidence 4443
No 285
>PRK15108 biotin synthase; Provisional
Probab=31.60 E-value=77 Score=30.99 Aligned_cols=49 Identities=12% Similarity=0.130 Sum_probs=31.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc----cccchHHHHHHHHHHCCCeEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY----DGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf----~g~~dl~~fl~~a~e~GL~vi 161 (323)
++.++++|++|++.+.+. +|..|+.|.- ...++.-+.++.|++.|+.+-
T Consensus 136 ~e~l~~LkeAGld~~n~~-----leT~p~~f~~I~~~~~~~~rl~~i~~a~~~G~~v~ 188 (345)
T PRK15108 136 ESQAQRLANAGLDYYNHN-----LDTSPEFYGNIITTRTYQERLDTLEKVRDAGIKVC 188 (345)
T ss_pred HHHHHHHHHcCCCEEeec-----cccChHhcCCCCCCCCHHHHHHHHHHHHHcCCcee
Confidence 677888889999887772 3334444421 122345566778888888653
No 286
>PLN02540 methylenetetrahydrofolate reductase
Probab=31.59 E-value=1.5e+02 Score=31.29 Aligned_cols=94 Identities=14% Similarity=0.071 Sum_probs=59.4
Q ss_pred cHHHHHHHHH---HhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCC--CeEEEccCCccc-------cccC
Q psy13819 106 RWCWIMRAMR---AAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEED--LYILLRPGPFIC-------GKRD 173 (323)
Q Consensus 106 ~W~~~l~~mk---~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~G--L~vilr~GPyi~-------aEw~ 173 (323)
..+.+++.|| ++|.+.+-|-.+ ||.+- +.+|++.|++.| +.+|...-|... +.+.
T Consensus 154 ~~~~dl~~Lk~KvdAGAdFiITQlf----------FD~d~---f~~f~~~~r~~Gi~vPIipGImPI~S~k~l~r~~~l~ 220 (565)
T PLN02540 154 AYQKDLAYLKEKVDAGADLIITQLF----------YDTDI---FLKFVNDCRQIGITCPIVPGIMPINNYKGFLRMTGFC 220 (565)
T ss_pred ChHHHHHHHHHHHHcCCCEEeeccc----------cCHHH---HHHHHHHHHhcCCCCCEEeeecccCCHHHHHHHHhcc
Confidence 3444555444 589999988322 44433 789999999999 567777777543 3466
Q ss_pred CCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
+--+|.|+.+.... ...++...++.-.++..++++.|.+
T Consensus 221 Gi~IP~~i~~rLe~--~kddde~v~~~Gieia~e~~~~L~~ 259 (565)
T PLN02540 221 KTKIPAEITAALEP--IKDNDEAVKAYGIHLGTEMCKKILA 259 (565)
T ss_pred CCcCCHHHHHHHHh--cCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 66679888763111 1234455666666777777776543
No 287
>cd04883 ACT_AcuB C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. This CD includes the C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. AcuB is putatively involved in the anaerobic catabolism of acetoin, and related proteins. Studies report the induction of AcuB by nitrate respiration and also by fermentation. Since acetoin can be secreted and later serve as a source of carbon, it has been proposed that, during anaerobic growth when other carbon sources are exhausted, the induction of the AcuB protein results in acetoin catabolism. AcuB-like proteins have two N-terminal tandem CBS domains and a single C-terminal ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=31.55 E-value=1.8e+02 Score=20.59 Aligned_cols=53 Identities=13% Similarity=0.147 Sum_probs=36.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCC--C--eecccccchHHHHHHHHHHCCCeEE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHP--G--HYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~--G--~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
....+.++.+.+.|+|...++.. .++.+ . .+..+. ++.+++++..++.|..|+
T Consensus 13 G~l~~i~~~l~~~~inI~~i~~~---~~~~~~~~~v~i~v~~-~~~~~~~~~L~~~G~~v~ 69 (72)
T cd04883 13 GQLADIAAIFKDRGVNIVSVLVY---PSKEEDNKILVFRVQT-MNPRPIIEDLRRAGYEVL 69 (72)
T ss_pred CHHHHHHHHHHHcCCCEEEEEEe---ccCCCCeEEEEEEEec-CCHHHHHHHHHHCCCeee
Confidence 45667788899999999999653 22222 2 334443 456688999999998664
No 288
>PLN02607 1-aminocyclopropane-1-carboxylate synthase
Probab=31.42 E-value=2.4e+02 Score=28.46 Aligned_cols=60 Identities=10% Similarity=0.148 Sum_probs=44.7
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
....+..++.++.+++-|.++=-+++. |-|.|.--.|+-+- +.++++.|+++++.+|.+-
T Consensus 181 ~~~~~~le~a~~~a~~~~~~vk~lll~-nP~NPtG~~~s~e~---l~~l~~~~~~~~i~lI~DE 240 (447)
T PLN02607 181 QVTPQALEAAYQEAEAANIRVRGVLIT-NPSNPLGATVQRSV---LEDILDFVVRKNIHLVSDE 240 (447)
T ss_pred cCCHHHHHHHHHHHHHhCCCeeEEEEe-CCCCCcCcccCHHH---HHHHHHHHHHCCCEEEEec
Confidence 455667777787777778875445655 66888766676554 7888999999999999864
No 289
>PF08821 CGGC: CGGC domain; InterPro: IPR014925 Proteins in this entry are a quite highly conserved sequence of CGGC in its central region. The region has many conserved cysteines and histidines suggestive of a zinc binding function.
Probab=31.15 E-value=1.4e+02 Score=24.36 Aligned_cols=57 Identities=9% Similarity=0.161 Sum_probs=38.6
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEEEcc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYILLRP 164 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vilr~ 164 (323)
.|.+.+...+++|+..|+.+|.+..==-...|.+ .- -+++.+.+..++. |+.||..+
T Consensus 49 Cpg~~~~~~~~~l~~~~~d~IHlssC~~~~~~~~-~C-----P~~~~~~~~I~~~~gi~VV~GT 106 (107)
T PF08821_consen 49 CPGRKLVRRIKKLKKNGADVIHLSSCMVKGNPHG-PC-----PHIDEIKKIIEEKFGIEVVEGT 106 (107)
T ss_pred CChhHHHHHHHHHHHCCCCEEEEcCCEecCCCCC-CC-----CCHHHHHHHHHHHhCCCEeeec
Confidence 4578899999999999999998832211222211 11 1267778888887 99998754
No 290
>cd06547 GH85_ENGase Endo-beta-N-acetylglucosaminidase (ENGase) hydrolyzes the N-N'-diacetylchitobiosyl core of N-glycosylproteins. The beta-1,4-glycosyl bond located between two N-acetylglucosamine residues is hydrolyzed such that N-acetylglucosamine 1 remains with the protein and N-acetylglucosamine 2 forms the reducing end of the released glycan. ENGase is a key enzyme in the processing of free oligosaccharides in the cytosol of eukaryotes. Oligosaccharides formed in the lumen of the endoplasmic reticulum are transported into the cytosol where they are catabolized by cytosolic ENGases and other enzymes, possibly to maximize the reutilization of the component sugars. ENGases have an eight-stranded alpha/beta barrel topology and are classified as a family 85 glycosyl hydrolase (GH85) domain. The GH85 ENGases are sequence-similar to the family 18 glycosyl hydrolases, also known as GH18 chitinases. An ENGase-like protein is also found in bacteria and is included in this alignment mod
Probab=31.06 E-value=1.2e+02 Score=29.72 Aligned_cols=115 Identities=18% Similarity=0.221 Sum_probs=63.0
Q ss_pred EEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHH
Q psy13819 121 ALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRY 200 (323)
Q Consensus 121 ~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~ 200 (323)
.|.+.+.|+.+--+. = =...++.|+++|++|+-.. .-||+ +-..|+..- +. ++ .+.
T Consensus 32 yvD~fvywsh~~~~i------P---p~~~idaAHknGV~Vlgti----~~e~~--~~~~~~~~l-----L~-~~---~~~ 87 (339)
T cd06547 32 YVDTFVYFSHSAVTI------P---PADWINAAHRNGVPVLGTF----IFEWT--GQVEWLEDF-----LK-KD---EDG 87 (339)
T ss_pred hhheeecccCccccC------C---CcHHHHHHHhcCCeEEEEE----EecCC--CchHHHHHH-----hc-cC---ccc
Confidence 467788887743220 0 0234899999999998543 12443 334555331 11 11 113
Q ss_pred HHHHHHHHHHHHHhhhccCCCcEEEEccCCCcC-CC-CCChHHHHHHHHHHHHHh-cCCceEEEee
Q psy13819 201 VTRWFQELFPRIQKYLYGNDRPIILVQVENEYG-SD-AECDPAHAVWLRDLLRTY-VQDKAVLYST 263 (323)
Q Consensus 201 ~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g-~~-~~~~~~Y~~~l~~~~~~~-~g~~~~l~t~ 263 (323)
..+++++|++..+.+-+ .| +.+-+||..+ .. ...-.++++.|++.+++. -+..++.|.+
T Consensus 88 ~~~~a~kLv~lak~yGf--DG--w~iN~E~~~~~~~~~~~l~~F~~~L~~~~~~~~~~~~v~WYDs 149 (339)
T cd06547 88 SFPVADKLVEVAKYYGF--DG--WLINIETELGDAEKAKRLIAFLRYLKAKLHENVPGSLVIWYDS 149 (339)
T ss_pred chHHHHHHHHHHHHhCC--Cc--eEeeeeccCCcHHHHHHHHHHHHHHHHHHhhcCCCcEEEEEec
Confidence 45667777777665543 23 4445566652 11 011156777777777653 2466788876
No 291
>PF01075 Glyco_transf_9: Glycosyltransferase family 9 (heptosyltransferase); InterPro: IPR002201 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 9 GT9 from CAZY comprises enzymes with two known activity; lipopolysaccharide N-acetylglucosaminyltransferase (2.4.1.56 from EC), heptosyltransferase (2.4 from EC). Heptosyltransferase I is thought to add L-glycero-D-manno-heptose to the inner 3-deoxy-D-manno-octulosonic acid (Kdo) residue of the lipopolysaccharide core []. Heptosyltransferase II is a glycosyltransferase involved in the synthesis of the inner core region of lipopolysaccharide []. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. These enzymes transfer heptose to the lipopolysaccharide core [].; GO: 0016757 transferase activity, transferring glycosyl groups, 0008152 metabolic process; PDB: 1PSW_A 2H1F_A 2GT1_A 3TOV_A 2H1H_A.
Probab=31.03 E-value=48 Score=29.80 Aligned_cols=74 Identities=19% Similarity=0.325 Sum_probs=49.1
Q ss_pred eEEeeeeecC-CCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcc--------cCCC----eecccccchHHHHHHHHHHC
Q psy13819 90 FQFVSGSFHY-FRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHE--------VHPG----HYHYDGHRDIEHFLQLAVEE 156 (323)
Q Consensus 90 ~~i~~g~~Hy-~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hE--------P~~G----~fdf~g~~dl~~fl~~a~e~ 156 (323)
+.+.-|+-+. =++|.+.|.+.++.+++.| +.+.+.|.-.| -.++ ..+..+..++..++.+.+..
T Consensus 108 i~i~~~a~~~~k~wp~e~~~~l~~~l~~~~---~~vvl~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~e~~ali~~a 184 (247)
T PF01075_consen 108 IGINPGASWPSKRWPAEKWAELIERLKERG---YRVVLLGGPEEQEKEIADQIAAGLQNPVINLAGKTSLRELAALISRA 184 (247)
T ss_dssp EEEE---SSGGGS--HHHHHHHHHHHCCCT----EEEE--SSHHHHHHHHHHHHTTHTTTTEEETTTS-HHHHHHHHHTS
T ss_pred EEEeecCCCccccCCHHHHHHHHHHHHhhC---ceEEEEccchHHHHHHHHHHHHhcccceEeecCCCCHHHHHHHHhcC
Confidence 3333344443 3689999999999999988 66678887776 1122 57787878899999999999
Q ss_pred CCeEEEccCC
Q psy13819 157 DLYILLRPGP 166 (323)
Q Consensus 157 GL~vilr~GP 166 (323)
.+.|-...||
T Consensus 185 ~~~I~~Dtg~ 194 (247)
T PF01075_consen 185 DLVIGNDTGP 194 (247)
T ss_dssp SEEEEESSHH
T ss_pred CEEEecCChH
Confidence 9988888876
No 292
>COG1306 Uncharacterized conserved protein [Function unknown]
Probab=30.83 E-value=2.5e+02 Score=27.53 Aligned_cols=97 Identities=13% Similarity=0.248 Sum_probs=63.6
Q ss_pred CChhcHHHHHHHHHH---hCCCEEEE-EEeCCCccc-------CCCeecccccchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 102 APPGRWCWIMRAMRA---AGLNALST-YVEWRSHEV-------HPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~---~G~N~Vr~-yv~W~~hEP-------~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
..+..|+--+..+|+ .|+|-|++ ||--..-.+ ...+++.+-.+.|..||..|+|+ |.|-+-.
T Consensus 190 y~~~~WeYNvtIAKEa~~fGfdEiQFDYIRFP~dg~~l~~A~~~~n~~~m~~~~Al~sfL~yArE~-l~vpIS~------ 262 (400)
T COG1306 190 YDKNLWEYNVTIAKEAAKFGFDEIQFDYIRFPADGGGLDKALNYRNTDNMTKSEALQSFLHYAREE-LEVPISA------ 262 (400)
T ss_pred cchhhhhhhHHHHHHHHHcCccceeeeEEEccCCCCchhhhhcccccccCChHHHHHHHHHHHHHh-cccceEE------
Confidence 457889876666655 59999999 887755444 24567777778899999999884 3332222
Q ss_pred ccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHh
Q psy13819 171 KRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQK 214 (323)
Q Consensus 171 Ew~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~ 214 (323)
+-.|.-.|.... +. --.+.+....+++.+.+..-|
T Consensus 263 --DIYG~nGw~~t~---~~----~GQ~~e~ls~yVDvIsPMfYP 297 (400)
T COG1306 263 --DIYGQNGWSSTD---MA----LGQFWEALSSYVDVISPMFYP 297 (400)
T ss_pred --EeecccCccCCc---ch----hhhhHHHHHhhhhhccccccc
Confidence 235666787652 11 124567777777777666544
No 293
>COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]
Probab=30.72 E-value=1.9e+02 Score=27.75 Aligned_cols=57 Identities=18% Similarity=0.209 Sum_probs=45.2
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc--chHHHHHHHHHHCCCeEEEcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH--RDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~--~dl~~fl~~a~e~GL~vilr~ 164 (323)
.|+....-+..|++|.+.+|-. .--|+---|||.|. +.|.-.-+.++++||.|+.+.
T Consensus 58 ~E~i~~~A~~vk~~Ga~~lRGg----afKPRTSPYsFQGlge~gL~~l~~a~~~~Gl~vvtEv 116 (286)
T COG2876 58 EEQVRETAESVKAAGAKALRGG----AFKPRTSPYSFQGLGEEGLKLLKRAADETGLPVVTEV 116 (286)
T ss_pred HHHHHHHHHHHHHcchhhccCC----cCCCCCCcccccccCHHHHHHHHHHHHHcCCeeEEEe
Confidence 4677788888999999999995 44666677899875 566666777889999998765
No 294
>PF01183 Glyco_hydro_25: Glycosyl hydrolases family 25; InterPro: IPR002053 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 25 GH25 from CAZY comprises enzymes with only one known activity; lysozyme (3.2.1.17 from EC). It has been shown [, ] that a number of cell-wall lytic enzymes are evolutionary related and can be classified into a single family. Two residues, an aspartate and a glutamate, have been shown [] to be important for the catalytic activity of the Charalopsis enzyme. These residues as well as some others in their vicinity are conserved in all proteins from this family.; GO: 0003796 lysozyme activity, 0009253 peptidoglycan catabolic process, 0016998 cell wall macromolecule catabolic process; PDB: 1JFX_A 2WW5_A 2WWD_A 2WWC_A 2X8R_D 2J8F_A 1OBA_A 2IXU_A 2J8G_A 2IXV_A ....
Probab=30.65 E-value=3.3e+02 Score=23.40 Aligned_cols=116 Identities=13% Similarity=0.130 Sum_probs=59.9
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeee
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILL 190 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~ 190 (323)
++++|+.|+..|-+-.. +|..-. ...+..-++.|++.||.| |-|. +.. +
T Consensus 12 w~~~k~~gi~fviikat-------eG~~~~--D~~~~~n~~~a~~aGl~~----G~Yh------------f~~--~---- 60 (181)
T PF01183_consen 12 WQKVKAAGIDFVIIKAT-------EGTSYV--DPYFESNIKNAKAAGLPV----GAYH------------FAR--A---- 60 (181)
T ss_dssp HHHHHHTTEEEEEEEEE-------ETTTEE---TTHHHHHHHHHHTTSEE----EEEE------------E----T----
T ss_pred HHHHHHCCCCEEEEEee-------eCCCee--cchHHHHHHHHHHcCCeE----EEEE------------Eec--c----
Confidence 67789999887766432 232111 123677789999999975 3231 111 0
Q ss_pred cCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCC--cCCCCCChHHHHHHHHHHHHHhcCCceEEEeeC
Q psy13819 191 RQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENE--YGSDAECDPAHAVWLRDLLRTYVQDKAVLYSTD 264 (323)
Q Consensus 191 R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE--~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~d 264 (323)
++...-.++++.|++.+- + ...+.-.+++.+|.. .+.-.....+.++...+.+++..|..+.+|++-
T Consensus 61 -~~~~~a~~qA~~f~~~~~----~--~~~~~~~~~lD~E~~~~~~~~~~~~~~~~~~f~~~~~~~~G~~~~iY~~~ 129 (181)
T PF01183_consen 61 -TNSSDAEAQADYFLNQVK----G--GDPGDLPPALDVEDDKSNNPSKSDNTAWVKAFLDEVEKAAGYKPGIYTSK 129 (181)
T ss_dssp -TTHCHHHHHHHHHHHCTH----T--SSTSCS-EEEEE-S-GGCCSSHHHHHHHHHHHHHHHHHHCTSEEEEEEEH
T ss_pred -CCcccHHHHHHHHHHHhc----c--cCCCcceEEEeccccccCCCCHHHHHHHHHHHHHHHHHHhCCceeEeecH
Confidence 011122445544444432 1 122333357788854 111111124556666666666679988899973
No 295
>COG0413 PanB Ketopantoate hydroxymethyltransferase [Coenzyme metabolism]
Probab=30.57 E-value=1.6e+02 Score=28.06 Aligned_cols=65 Identities=22% Similarity=0.337 Sum_probs=49.8
Q ss_pred cCCeEEeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCC
Q psy13819 79 AGDTFRLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 79 ~~~~~~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL 158 (323)
.++.|.+-+-||.- |.-.+.+..+.-.+.||+.|.|+|-+ +|-..+...++...+.|+
T Consensus 74 a~~~~vv~DmPF~s------y~~s~~~a~~nA~r~~ke~gA~aVKl----------------EGG~~~~~~i~~L~~~gI 131 (268)
T COG0413 74 APNAFVVADLPFGS------YEVSPEQALKNAARLMKEAGADAVKL----------------EGGEEMAETIKRLTERGI 131 (268)
T ss_pred CCCeeEEeCCCCcc------cCCCHHHHHHHHHHHHHHhCCCEEEE----------------cCCHHHHHHHHHHHHcCC
Confidence 35677777777753 33355678899999999999999998 233457888999999999
Q ss_pred eEEEccC
Q psy13819 159 YILLRPG 165 (323)
Q Consensus 159 ~vilr~G 165 (323)
-|+-..|
T Consensus 132 PV~gHiG 138 (268)
T COG0413 132 PVMGHIG 138 (268)
T ss_pred ceEEEec
Confidence 9996553
No 296
>TIGR03581 EF_0839 conserved hypothetical protein EF_0839/AHA_3917. Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.
Probab=30.57 E-value=1.1e+02 Score=28.39 Aligned_cols=73 Identities=12% Similarity=0.240 Sum_probs=48.8
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc---chHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH---RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLL 182 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~---~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~ 182 (323)
.-+.-++++|+||.+.|.++ .-.|. +.+...-+.|.++|++ |.| +||+-
T Consensus 136 ~vetAiaml~dmG~~SiKff-------------PM~Gl~~leE~~avA~aca~~g~~--lEP---------TGGId---- 187 (236)
T TIGR03581 136 PIETAIAMLKDMGGSSVKFF-------------PMGGLKHLEEYAAVAKACAKHGFY--LEP---------TGGID---- 187 (236)
T ss_pred eHHHHHHHHHHcCCCeeeEe-------------ecCCcccHHHHHHHHHHHHHcCCc--cCC---------CCCcc----
Confidence 34678999999999999984 23343 4566777889999997 477 57752
Q ss_pred ccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhc
Q psy13819 183 KVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLY 217 (323)
Q Consensus 183 ~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~ 217 (323)
-+.|.+-++-.+++=++++-|+.|
T Consensus 188 -----------l~Nf~~I~~i~ldaGv~kviPHIY 211 (236)
T TIGR03581 188 -----------LDNFEEIVQIALDAGVEKVIPHVY 211 (236)
T ss_pred -----------HHhHHHHHHHHHHcCCCeeccccc
Confidence 134444555555555566555555
No 297
>COG0602 NrdG Organic radical activating enzymes [Posttranslational modification, protein turnover, chaperones]
Probab=30.44 E-value=2.5e+02 Score=25.45 Aligned_cols=77 Identities=22% Similarity=0.239 Sum_probs=51.9
Q ss_pred CeeeEEee--eeecC--C-------------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHH
Q psy13819 87 EDPFQFVS--GSFHY--F-------------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHF 149 (323)
Q Consensus 87 Gk~~~i~~--g~~Hy--~-------------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~f 149 (323)
|.|..++. ||.+- . ...+..-++.++++++.+...-. |.=..=||. -..++..+
T Consensus 21 Gr~~vFVR~~GC~l~C~~Cdt~~t~~~~~~~~~~~~~~~~I~~~i~~~~~~~~~--V~lTGGEP~-------~~~~l~~L 91 (212)
T COG0602 21 GRPSVFVRFAGCNLRCPGCDTKYTWDFNYGKPGTPMSADEILADIKSLGYKARG--VSLTGGEPL-------LQPNLLEL 91 (212)
T ss_pred cceeEEEEcCCCCCCCCCCCChhhhcccccCCCCccCHHHHHHHHHhcCCCcce--EEEeCCcCC-------CcccHHHH
Confidence 67777777 87772 1 12234557889999997555542 222366662 12358899
Q ss_pred HHHHHHCCCeEEEccCCccccccCCCCCcccc
Q psy13819 150 LQLAVEEDLYILLRPGPFICGKRDFGGFPPWL 181 (323)
Q Consensus 150 l~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl 181 (323)
+++.++.|+++.+.+ +|.+|.|.
T Consensus 92 l~~l~~~g~~~~lET---------ngti~~~~ 114 (212)
T COG0602 92 LELLKRLGFRIALET---------NGTIPVWT 114 (212)
T ss_pred HHHHHhCCceEEecC---------CCCccccc
Confidence 999999999999988 46677664
No 298
>PF09587 PGA_cap: Bacterial capsule synthesis protein PGA_cap; InterPro: IPR019079 CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein [].
Probab=30.38 E-value=4.2e+02 Score=24.14 Aligned_cols=81 Identities=11% Similarity=0.116 Sum_probs=57.9
Q ss_pred CeEEeCCeeeEEeeeeecCCCCC-------------------------hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCC
Q psy13819 81 DTFRLNEDPFQFVSGSFHYFRAP-------------------------PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHP 135 (323)
Q Consensus 81 ~~~~~dGk~~~i~~g~~Hy~r~p-------------------------~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~ 135 (323)
..+.++|.++-++|.+....... .+...++++++| -+...|=+++.|..
T Consensus 122 ~i~~~~g~kia~l~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~r-~~~D~vIv~~HwG~----- 195 (250)
T PF09587_consen 122 AIIEVNGVKIAFLGYTDGENGYSSANGNRPYGFSYRPDKAGLNPNRPGIERIKEDIREAR-KKADVVIVSLHWGI----- 195 (250)
T ss_pred EEEEECCEEEEEEEEEcCCCCCccccccccccccccccccccccccchHHHHHHHHHHHh-cCCCEEEEEeccCC-----
Confidence 44678999999999987742110 155778888888 68999999999952
Q ss_pred CeecccccchHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 136 GHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 136 G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
+|...=.....++.+.+-+.|.-+|+.-+|.+
T Consensus 196 -e~~~~p~~~q~~~a~~lidaGaDiIiG~HpHv 227 (250)
T PF09587_consen 196 -EYENYPTPEQRELARALIDAGADIIIGHHPHV 227 (250)
T ss_pred -CCCCCCCHHHHHHHHHHHHcCCCEEEeCCCCc
Confidence 12222233456678888889999999877654
No 299
>PRK08508 biotin synthase; Provisional
Probab=30.36 E-value=86 Score=29.56 Aligned_cols=48 Identities=13% Similarity=0.086 Sum_probs=31.6
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--c--cchHHHHHHHHHHCCCeE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--G--HRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g--~~dl~~fl~~a~e~GL~v 160 (323)
++.+++||++|++.+-. .+|..+..|.-- + .++..+.++.|++.||.+
T Consensus 102 ~e~l~~Lk~aGld~~~~-----~lEt~~~~~~~i~~~~~~~~~l~~i~~a~~~Gi~v 153 (279)
T PRK08508 102 VEQLKELKKAGIFSYNH-----NLETSKEFFPKICTTHTWEERFQTCENAKEAGLGL 153 (279)
T ss_pred HHHHHHHHHcCCCEEcc-----cccchHHHhcCCCCCCCHHHHHHHHHHHHHcCCee
Confidence 78899999999999887 344443222211 1 123445778899999865
No 300
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=30.31 E-value=4.3e+02 Score=28.53 Aligned_cols=134 Identities=21% Similarity=0.245 Sum_probs=61.8
Q ss_pred HHHHHHHH----hCCCEEEEEEeC---------CCcccCCCeecccccchHHHHHHHHHH--CCCeEEEccCCccccccC
Q psy13819 109 WIMRAMRA----AGLNALSTYVEW---------RSHEVHPGHYHYDGHRDIEHFLQLAVE--EDLYILLRPGPFICGKRD 173 (323)
Q Consensus 109 ~~l~~mk~----~G~N~Vr~yv~W---------~~hEP~~G~fdf~g~~dl~~fl~~a~e--~GL~vilr~GPyi~aEw~ 173 (323)
+.|+.+=. +.++.+-+.+-= ..|+..+++-|...-... .+++.|++ -+|++..-+
T Consensus 65 qILd~LFkp~~Gasl~~lKvEiGgD~nst~GtEPshm~~~~d~n~~RGye~-~L~~eAKkrNP~ikl~~L~--------- 134 (669)
T PF02057_consen 65 QILDYLFKPNFGASLQILKVEIGGDANSTSGTEPSHMHSEDDENYFRGYEW-WLMAEAKKRNPNIKLYGLP--------- 134 (669)
T ss_dssp HHHHHHHSTTTSS--SEEEEEE--SSB-SSSB---S-SSTT---S-SSSHH-HHHHHHHHH-TT-EEEEEE---------
T ss_pred HHHHHHhCCCCCceeeEEEEEecCCCCCCCCCCCcccccccccccccChhh-hhHHHHHhhCCCCeEEEec---------
Confidence 55555522 347777766543 245555666665532222 34667776 467766544
Q ss_pred CCCCcccccccCCCeeecCCCHh-HHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHH
Q psy13819 174 FGGFPPWLLKVAPDILLRQNHPV-YQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRT 252 (323)
Q Consensus 174 ~gg~P~Wl~~~~p~~~~R~~~~~-f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~ 252 (323)
-+.|.|+.... ..-..++. ...++.+|+.-..+. + +-.|--+-++||=+. +.+|.++|++++.+
T Consensus 135 -W~~PgW~~~g~---~~~~~~~~~~a~Y~~~wl~ga~~~-----~--gl~idYvg~~NEr~~----~~~~ik~lr~~l~~ 199 (669)
T PF02057_consen 135 -WGFPGWVGNGW---NWPYDNPQLTAYYVVSWLLGAKKT-----H--GLDIDYVGIWNERGF----DVNYIKWLRKALNS 199 (669)
T ss_dssp -S-B-GGGGTTS---S-TTSSHHHHHHHHHHHHHHHHHH-----H-------EE-S-TTS-------HHHHHHHHHHHHH
T ss_pred -cCCCccccCCC---CCcccchhhhhHHHHHHHHHHHHH-----h--CCCceEechhhccCC----ChhHHHHHHHHHhh
Confidence 36899997631 01112222 233455665432222 2 334545668999863 68999999999987
Q ss_pred hcCCc-eEEEeeCCCCc
Q psy13819 253 YVQDK-AVLYSTDGAFD 268 (323)
Q Consensus 253 ~~g~~-~~l~t~dg~~~ 268 (323)
. |.. +.+..+|+.+.
T Consensus 200 ~-gy~~vkiva~D~~~~ 215 (669)
T PF02057_consen 200 N-GYNKVKIVAADNNWE 215 (669)
T ss_dssp T-T-TT-EEEEEEE-ST
T ss_pred c-cccceEEEEeCCCcc
Confidence 4 544 67788887654
No 301
>COG3664 XynB Beta-xylosidase [Carbohydrate transport and metabolism]
Probab=30.25 E-value=1.1e+02 Score=30.99 Aligned_cols=133 Identities=12% Similarity=0.115 Sum_probs=80.0
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeee
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILL 190 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~ 190 (323)
+..-++.|+|-|+.+.-|...+. .=-|+|+. ++++++...+.|+ -+++.+ .+--+-.++-. .|-.+
T Consensus 11 ~t~~~Ei~v~yi~~~~v~h~~~q-~~~~~~t~---~d~i~d~~~~~~~-~~ie~~-l~~~~l~~~~~-~wq~n------- 76 (428)
T COG3664 11 RTTDDEIQVNYIRRHGVWHVNAQ-KLFYPFTY---IDEIIDTLLDLGL-DLIELF-LIWNNLNTKEH-QWQLN------- 76 (428)
T ss_pred cchhhhhceeeehhcceeeeeec-cccCChHH---HHHHHHHHHHhcc-HHHHHh-hcccchhhhhh-hcccc-------
Confidence 33457899999999999884444 56788888 6677999999884 333331 00000011111 23222
Q ss_pred cCCCHh-HHHHHHHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEeeCCCC
Q psy13819 191 RQNHPV-YQRYVTRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYSTDGAF 267 (323)
Q Consensus 191 R~~~~~-f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~ 267 (323)
..+|. +.+...++++.++.++..... ...-.-+.||...+ ....+|.+....++++- -|-+..||++
T Consensus 77 -~~~~~~~~dl~~~fl~h~~~~vg~e~v----~kw~f~~~~~pn~~-ad~~eyfk~y~~~a~~~----~p~i~vg~~w 144 (428)
T COG3664 77 -VDDPKSVFDLIAAFLKHVIRRVGVEFV----RKWPFYSPNEPNLL-ADKQEYFKLYDATARQR----APSIQVGGSW 144 (428)
T ss_pred -cCCcHhHHHHHHHHHHHHHHHhChhhe----eecceeecCCCCcc-cchHHHHHHHHhhhhcc----Ccceeecccc
Confidence 33444 888888888888888543322 23344578888877 33677887777777641 2444455544
No 302
>cd08183 Fe-ADH2 Iron-containing alcohol dehydrogenases-like. Iron-containing alcohol dehydrogenases (Fe-ADH). Alcohol dehydrogenase catalyzes the reduction of acetaldehyde to alcohol with NADP as cofactor. Its activity requires iron ions. The protein structure represents a dehydroquinate synthase-like fold and is a member of the iron-activated alcohol dehydrogenase-like family. They are distinct from other alcohol dehydrogenases which contains different protein domain. Proteins of this family have not been characterized. Their specific function is unknown. They are mainly found in bacteria.
Probab=30.21 E-value=1.2e+02 Score=29.70 Aligned_cols=62 Identities=19% Similarity=0.320 Sum_probs=41.9
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
|++++++.+.... .+++..+.+++.|+....+. ..++|- ..++++.++.+++.+..+|+..|
T Consensus 22 ~~r~livtd~~~~------~~~~v~~~L~~~g~~~~~~~-----~~~~p~------~~~v~~~~~~~~~~~~D~IIaiG 83 (374)
T cd08183 22 GRRVLLVTGASSL------RAAWLIEALRAAGIEVTHVV-----VAGEPS------VELVDAAVAEARNAGCDVVIAIG 83 (374)
T ss_pred CCcEEEEECCchH------HHHHHHHHHHHcCCeEEEec-----CCCCcC------HHHHHHHHHHHHhcCCCEEEEec
Confidence 4666666664332 45666777888998764432 223332 24688889999999999999876
No 303
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=30.18 E-value=72 Score=30.46 Aligned_cols=48 Identities=15% Similarity=-0.004 Sum_probs=35.6
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
-++.|++++++|+..|-+-++=. -+....+-++++++.|+++.|.|..
T Consensus 108 ~~~~f~~~~~~Gv~GvKidF~~~--------d~Q~~v~~y~~i~~~AA~~~Lmvnf 155 (273)
T PF10566_consen 108 LDEAFKLYAKWGVKGVKIDFMDR--------DDQEMVNWYEDILEDAAEYKLMVNF 155 (273)
T ss_dssp HHHHHHHHHHCTEEEEEEE--SS--------TSHHHHHHHHHHHHHHHHTT-EEEE
T ss_pred HHHHHHHHHHcCCCEEeeCcCCC--------CCHHHHHHHHHHHHHHHHcCcEEEe
Confidence 38899999999999999954311 2333456688999999999998877
No 304
>PF13987 YedD: YedD-like protein
Probab=30.15 E-value=16 Score=29.70 Aligned_cols=19 Identities=42% Similarity=0.875 Sum_probs=15.9
Q ss_pred cccccCCCCCeeEEEeccc
Q psy13819 16 WGRYWSEVGPQYSLFVPEE 34 (323)
Q Consensus 16 lGryw~~~gpq~~lyvP~~ 34 (323)
|--||++.|||..|.-|..
T Consensus 10 L~G~WQs~GPQs~lvSpeA 28 (111)
T PF13987_consen 10 LAGYWQSKGPQSALVSPEA 28 (111)
T ss_pred cceeeeccCCcccccChhh
Confidence 4458999999999988875
No 305
>PRK09058 coproporphyrinogen III oxidase; Provisional
Probab=30.02 E-value=93 Score=31.51 Aligned_cols=52 Identities=17% Similarity=0.026 Sum_probs=33.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCC-Ceeccc----ccchHHHHHHHHHHCC-CeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHP-GHYHYD----GHRDIEHFLQLAVEED-LYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~-G~fdf~----g~~dl~~fl~~a~e~G-L~vilr~ 164 (323)
++.|+.||++|+|.|++.|- .-- .+-..- ...++.+.++.+++.| ..|.+..
T Consensus 163 ~e~l~~l~~aGvnRiSiGVQ-----Sf~d~vLk~lgR~~~~~~~~~~i~~l~~~g~~~v~~Dl 220 (449)
T PRK09058 163 DEKADAALDAGANRFSIGVQ-----SFNTQVRRRAGRKDDREEVLARLEELVARDRAAVVCDL 220 (449)
T ss_pred HHHHHHHHHcCCCEEEecCC-----cCCHHHHHHhCCCCCHHHHHHHHHHHHhCCCCcEEEEE
Confidence 57899999999999999432 210 000000 1245777889999999 5566654
No 306
>PRK06294 coproporphyrinogen III oxidase; Provisional
Probab=29.89 E-value=1.1e+02 Score=29.97 Aligned_cols=53 Identities=17% Similarity=0.154 Sum_probs=34.7
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc----cchHHHHHHHHHHCCCe-EEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG----HRDIEHFLQLAVEEDLY-ILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g----~~dl~~fl~~a~e~GL~-vilr~ 164 (323)
++.|+.+|++|+|.|++.|- .. .+.....-+ .+++.+.++.|++.|+. |-+..
T Consensus 103 ~~~l~~l~~~G~nrislGvQ--S~--~~~~L~~l~R~~~~~~~~~ai~~~~~~g~~~v~~Dl 160 (370)
T PRK06294 103 ESYIRALALTGINRISIGVQ--TF--DDPLLKLLGRTHSSSKAIDAVQECSEHGFSNLSIDL 160 (370)
T ss_pred HHHHHHHHHCCCCEEEEccc--cC--CHHHHHHcCCCCCHHHHHHHHHHHHHcCCCeEEEEe
Confidence 57799999999999999432 11 122222222 24577788999999996 54554
No 307
>cd08185 Fe-ADH1 Iron-containing alcohol dehydrogenases-like. Iron-containing alcohol dehydrogenases-like (ADH). Alcohol dehydrogenase catalyzes the reduction of acetaldehyde to alcohol with NADP as cofactor. Its activity requires iron ions. The protein structure represents a dehydroquinate synthase fold and is a member of the iron-containing alcohol dehydrogenase-like family. They are distinct from other alcohol dehydrogenases which contain different protein domains. Proteins of this family have not been characterized. Their specific function is unknown. They are present in bacteria and archaea.
Probab=29.38 E-value=2e+02 Score=28.26 Aligned_cols=68 Identities=18% Similarity=0.178 Sum_probs=45.7
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
||+.+++++...... ...+++..+.+++.|+....+ +..+|+|- .+++++.++++++.+..+|+..|-
T Consensus 25 g~r~livt~~~~~~~--~g~~~~v~~~L~~~~~~~~~~----~~v~~~p~------~~~v~~~~~~~~~~~~D~IiavGG 92 (380)
T cd08185 25 GKKALIVTGNGSSKK--TGYLDRVIELLKQAGVEVVVF----DKVEPNPT------TTTVMEGAALAREEGCDFVVGLGG 92 (380)
T ss_pred CCeEEEEeCCCchhh--ccHHHHHHHHHHHcCCeEEEe----CCccCCCC------HHHHHHHHHHHHHcCCCEEEEeCC
Confidence 356666666443211 245667777788888875432 46666543 455788899999999999998774
No 308
>PLN02763 hydrolase, hydrolyzing O-glycosyl compounds
Probab=29.30 E-value=2e+02 Score=32.53 Aligned_cols=67 Identities=12% Similarity=0.090 Sum_probs=50.8
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCeEEEccCCccccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLYILLRPGPFICGK 171 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~vilr~GPyi~aE 171 (323)
.+..++.++.+++.|+-+=-+++-+....-. +.|.|+ ..-|..++++..++.|++++.-.-|+|..+
T Consensus 200 q~eV~eva~~fre~~IP~DvIwlDidYm~g~-~~FTwD~~rFPdP~~mv~~Lh~~G~kvv~iidPgI~~d 268 (978)
T PLN02763 200 AKRVAEIARTFREKKIPCDVVWMDIDYMDGF-RCFTFDKERFPDPKGLADDLHSIGFKAIWMLDPGIKAE 268 (978)
T ss_pred HHHHHHHHHHHHHcCCCceEEEEehhhhcCC-CceeECcccCCCHHHHHHHHHHCCCEEEEEEcCCCccC
Confidence 4567889999999999888887777665532 344443 346889999999999999887767777653
No 309
>PRK10426 alpha-glucosidase; Provisional
Probab=29.26 E-value=1.7e+02 Score=31.14 Aligned_cols=66 Identities=14% Similarity=0.242 Sum_probs=46.9
Q ss_pred hcHHHHHHHHHHhCCCEEEEEE-eCCCcccC----CCeecccc----cchHHHHHHHHHHCCCeEEEccCCcccc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYV-EWRSHEVH----PGHYHYDG----HRDIEHFLQLAVEEDLYILLRPGPFICG 170 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv-~W~~hEP~----~G~fdf~g----~~dl~~fl~~a~e~GL~vilr~GPyi~a 170 (323)
+..++.++++++.|+-+=-+++ .|...... ..-+||+. .-|.+++++..++.|+++++..=|+|+.
T Consensus 221 ~~v~~v~~~~r~~~IP~d~i~lddw~~~~~~~~g~~~~~~~~~d~~~FPdp~~mi~~L~~~G~k~v~~i~P~v~~ 295 (635)
T PRK10426 221 EVVQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGKRLMWNWKWDSERYPQLDSRIKQLNEEGIQFLGYINPYLAS 295 (635)
T ss_pred HHHHHHHHHHHHcCCCeeEEEEecccccccccccccccccceEChhhCCCHHHHHHHHHHCCCEEEEEEcCccCC
Confidence 3467889999999987766655 35432221 11224433 3578999999999999999998899886
No 310
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=28.89 E-value=37 Score=38.29 Aligned_cols=37 Identities=30% Similarity=0.529 Sum_probs=23.8
Q ss_pred eeEEEEcCcccccccCCCCCeeE-EE-eccccccCCcceEEEE
Q psy13819 6 KGVLFANGRHWGRYWSEVGPQYS-LF-VPEEFLKVGTNRITIF 46 (323)
Q Consensus 6 kg~~~vng~~lGryw~~~gpq~~-ly-vP~~~l~~g~N~i~i~ 46 (323)
.-.|||||+.+|.. .|--.. -| | .+.|++|.|+|+|-
T Consensus 150 ~a~VwvNG~~VG~~---~g~~~pfefDI-T~~l~~G~N~L~V~ 188 (1027)
T PRK09525 150 AFHLWCNGRWVGYS---QDSRLPAEFDL-SPFLRAGENRLAVM 188 (1027)
T ss_pred EEEEEECCEEEEee---cCCCceEEEEC-hhhhcCCccEEEEE
Confidence 45799999999953 231111 12 3 34689999988764
No 311
>PRK15108 biotin synthase; Provisional
Probab=28.82 E-value=1.9e+02 Score=28.33 Aligned_cols=57 Identities=18% Similarity=0.165 Sum_probs=41.3
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
++++...+..+.+++.|+..|.+-..|. +|. .-+++. +.++++.+++.|+.+++..|
T Consensus 76 ls~eEI~~~a~~~~~~G~~~i~i~~~g~--~p~--~~~~e~---i~~~i~~ik~~~i~v~~s~G 132 (345)
T PRK15108 76 MEVEQVLESARKAKAAGSTRFCMGAAWK--NPH--ERDMPY---LEQMVQGVKAMGLETCMTLG 132 (345)
T ss_pred CCHHHHHHHHHHHHHcCCCEEEEEecCC--CCC--cchHHH---HHHHHHHHHhCCCEEEEeCC
Confidence 5678888888888899999997766763 332 223444 66668888888998887776
No 312
>COG2179 Predicted hydrolase of the HAD superfamily [General function prediction only]
Probab=28.68 E-value=1.4e+02 Score=26.70 Aligned_cols=45 Identities=16% Similarity=0.409 Sum_probs=36.5
Q ss_pred HHHHHHHhCCCEEEE-----EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 110 IMRAMRAAGLNALST-----YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~-----yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
+.+.+++.|+.+|=+ -|+|+.-+..| .+.+.++.++++|+++++-
T Consensus 19 ~~~~L~~~Gikgvi~DlDNTLv~wd~~~~tp---------e~~~W~~e~k~~gi~v~vv 68 (175)
T COG2179 19 TPDILKAHGIKGVILDLDNTLVPWDNPDATP---------ELRAWLAELKEAGIKVVVV 68 (175)
T ss_pred CHHHHHHcCCcEEEEeccCceecccCCCCCH---------HHHHHHHHHHhcCCEEEEE
Confidence 467899999999976 67888766552 4888899999999998763
No 313
>PF11347 DUF3148: Protein of unknown function (DUF3148); InterPro: IPR021495 This family of proteins has no known function.
Probab=28.37 E-value=1.3e+02 Score=22.38 Aligned_cols=61 Identities=15% Similarity=0.140 Sum_probs=33.4
Q ss_pred CCCeeEEEeccccccCCcceEEEEEEeeCCCCccccchhhhhhccccccceEEEEecCCeEEeCCee
Q psy13819 23 VGPQYSLFVPEEFLKVGTNRITIFELTRAPDKYDVDFVDKISQRRARMSRTFAIDLAGDTFRLNEDP 89 (323)
Q Consensus 23 ~gpq~~lyvP~~~l~~g~N~i~i~e~~~~~~~~~i~~~~~~i~~~~~~~~~~~v~~~~~~~~~dGk~ 89 (323)
+|-+..|-.+-+.||.-++-=++ .++..|+.-+.-.--.-.......|.+....|+||+|.
T Consensus 2 iG~~V~l~~~ppylKTAdpMPmL------Rp~dlv~~ge~G~i~~rrp~~~w~VRf~~GtfLid~~y 62 (63)
T PF11347_consen 2 IGDKVRLIEAPPYLKTADPMPML------RPPDLVKIGEVGRIVDRRPGDYWAVRFRRGTFLIDSKY 62 (63)
T ss_pred CCCeEEEeecCCceeccCCcccc------CCcCccccCCcEEEEEecCCCEEEEEEeceeEeecccc
Confidence 56677777666777764432221 11222332222221222334566788889999999875
No 314
>PRK10422 lipopolysaccharide core biosynthesis protein; Provisional
Probab=28.32 E-value=1.8e+02 Score=27.99 Aligned_cols=63 Identities=14% Similarity=0.229 Sum_probs=43.3
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcc-------------cCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHE-------------VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hE-------------P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
|+|.+.|.+.++.+++.|+..| +.+.-.| +.+...|..|..+|..+..+.+...+.|=...||
T Consensus 198 ~Wp~e~fa~l~~~L~~~~~~vv---l~ggp~e~e~~~~~~i~~~~~~~~~~~l~g~~sL~el~ali~~a~l~v~nDSGp 273 (352)
T PRK10422 198 CWDNDKFSAVIDALQARGYEVV---LTSGPDKDDLACVNEIAQGCQTPPVTALAGKTTFPELGALIDHAQLFIGVDSAP 273 (352)
T ss_pred CCCHHHHHHHHHHHHHCCCeEE---EEcCCChHHHHHHHHHHHhcCCCccccccCCCCHHHHHHHHHhCCEEEecCCHH
Confidence 6899999999999988787554 4444322 1123466667667777777777777777777665
No 315
>PRK08446 coproporphyrinogen III oxidase; Provisional
Probab=27.98 E-value=1.3e+02 Score=29.35 Aligned_cols=52 Identities=19% Similarity=0.208 Sum_probs=34.8
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC----CCe-ecccccchHHHHHHHHHHCCCe-EEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH----PGH-YHYDGHRDIEHFLQLAVEEDLY-ILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~----~G~-fdf~g~~dl~~fl~~a~e~GL~-vilr~ 164 (323)
++.++.++++|+|.|++.|- ...++ =|+ .+ ..++.+.++.+++.|+. |-+..
T Consensus 98 ~e~l~~l~~~GvnRiSiGvQ--S~~~~~L~~lgR~~~---~~~~~~ai~~lr~~g~~~v~iDl 155 (350)
T PRK08446 98 KAWLKGMKNLGVNRISFGVQ--SFNEDKLKFLGRIHS---QKQIIKAIENAKKAGFENISIDL 155 (350)
T ss_pred HHHHHHHHHcCCCEEEEecc--cCCHHHHHHcCCCCC---HHHHHHHHHHHHHcCCCEEEEEe
Confidence 58899999999999999543 11111 121 22 23577789999999986 43544
No 316
>PRK08599 coproporphyrinogen III oxidase; Provisional
Probab=27.94 E-value=69 Score=31.35 Aligned_cols=49 Identities=12% Similarity=0.039 Sum_probs=31.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCC---CcccCCCeecccccchHHHHHHHHHHCCCe
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWR---SHEVHPGHYHYDGHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~---~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~ 159 (323)
++.++.||++|+|.|.+.|.=- .++--....+ .+++.+.++.+++.|+.
T Consensus 100 ~e~l~~l~~~G~~rvsiGvqS~~~~~l~~l~r~~~---~~~~~~~i~~l~~~g~~ 151 (377)
T PRK08599 100 KEKLQVLKDSGVNRISLGVQTFNDELLKKIGRTHN---EEDVYEAIANAKKAGFD 151 (377)
T ss_pred HHHHHHHHHcCCCEEEEecccCCHHHHHHcCCCCC---HHHHHHHHHHHHHcCCC
Confidence 5778999999999888844310 0111011122 23577889999999976
No 317
>PRK14905 triosephosphate isomerase/PTS system glucose/sucrose-specific transporter subunit IIB; Provisional
Probab=27.88 E-value=1.2e+02 Score=29.92 Aligned_cols=50 Identities=12% Similarity=0.146 Sum_probs=37.8
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
..++|++|++.|-+ .|--++-.|.-+ ...+.+=+..|.++||.+|++.|-
T Consensus 88 ~~mL~d~G~~~vii-----GHSERR~~f~Et-d~~i~~Kv~~al~~gl~pIvCiGE 137 (355)
T PRK14905 88 PLMLKELGIELVMI-----GHSERRHVLKET-DQEENEKVLAALKHGFITLLCIGE 137 (355)
T ss_pred HHHHHHcCCCEEEE-----CcccccCccccc-HHHHHHHHHHHHHCCCEEEEEcCC
Confidence 46899999999999 555554555444 244666788999999999999973
No 318
>cd04740 DHOD_1B_like Dihydroorotate dehydrogenase (DHOD) class 1B FMN-binding domain. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively.
Probab=27.85 E-value=1.8e+02 Score=27.34 Aligned_cols=62 Identities=8% Similarity=0.149 Sum_probs=40.3
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEEEccC
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYILLRPG 165 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vilr~G 165 (323)
.++.|.+-.++++++|+..|++.+.=.... ..|..--...+.+.++++.+++. ++.|+++.+
T Consensus 100 ~~~~~~~~a~~~~~~G~d~iElN~~cP~~~-~~g~~~~~~~~~~~eiv~~vr~~~~~Pv~vKl~ 162 (296)
T cd04740 100 TVEEFVEVAEKLADAGADAIELNISCPNVK-GGGMAFGTDPEAVAEIVKAVKKATDVPVIVKLT 162 (296)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEECCCCCCC-CCcccccCCHHHHHHHHHHHHhccCCCEEEEeC
Confidence 368999999999999999999964422211 11111111224566778888877 777777764
No 319
>PRK07823 5'-methylthioadenosine phosphorylase; Validated
Probab=27.78 E-value=93 Score=29.49 Aligned_cols=57 Identities=21% Similarity=0.199 Sum_probs=38.4
Q ss_pred EeCCeeeEEe--eeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCccc--CCCeecc
Q psy13819 84 RLNEDPFQFV--SGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEV--HPGHYHY 140 (323)
Q Consensus 84 ~~dGk~~~i~--~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP--~~G~fdf 140 (323)
.++|+++.++ .|..|+.--+.-..+.-+..||++|+..|=.-=.=....| +||.+-.
T Consensus 44 ~l~g~~v~~l~RhGr~H~ye~~~i~~rani~alk~lGv~~ii~tnA~Gsln~~~~pGdlvi 104 (264)
T PRK07823 44 EVGGRRVAFLPRHGRDHEFSPHTVPYRANMWALRALGVRRVFAPCAVGSLRPELGPGTVVV 104 (264)
T ss_pred EECCEEEEEEeCCCCCCCcCCCCccchHHHHHHHHcCCCEEEEecccccCCCCCCCCCEEE
Confidence 4899999999 9999963211122677788999999988765222223333 4777665
No 320
>PLN02389 biotin synthase
Probab=27.49 E-value=95 Score=30.90 Aligned_cols=50 Identities=16% Similarity=0.096 Sum_probs=33.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc----cccchHHHHHHHHHHCCCeEEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY----DGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf----~g~~dl~~fl~~a~e~GL~vil 162 (323)
++.++++|++|++.+..- +|..|..|.- ...++..+.++.|++.||.|-.
T Consensus 178 ~E~l~~LkeAGld~~~~~-----LeTs~~~y~~i~~~~s~e~rl~ti~~a~~~Gi~v~s 231 (379)
T PLN02389 178 KEQAAQLKEAGLTAYNHN-----LDTSREYYPNVITTRSYDDRLETLEAVREAGISVCS 231 (379)
T ss_pred HHHHHHHHHcCCCEEEee-----ecCChHHhCCcCCCCCHHHHHHHHHHHHHcCCeEeE
Confidence 688999999999998873 3322222221 0112345778999999998744
No 321
>PF04914 DltD_C: DltD C-terminal region; InterPro: IPR006998 The dlt operon (dltA to dltD) of Lactobacillus rhamnosus 7469 encodes four proteins responsible for the esterification of lipoteichoic acid (LTA) by D-alanine. These esters play an important role in controlling the net anionic charge of the poly (GroP) moiety of LTA. DltA and DltC encode the D-alanine-D-alanyl carrier protein ligase (Dcl) and D-alanyl carrier protein (Dcp), respectively. Whereas the functions of DltA and DltC are defined, the functions of DltB and DltD are unknown. In vitro assays showed that DltD bound Dcp for ligation with D-alanine by Dcl in the presence of ATP. In contrast, the homologue of Dcp, the Escherichia coli acyl carrier protein (ACP), involved in fatty acid biosynthesis, was not bound to DltD and thus was not ligated with D-alanine. DltD also catalyzed the hydrolysis of the mischarged D-alanyl-ACP. The hydrophobic N-terminal sequence of DltD was required for anchoring the protein in the membrane. It is hypothesized that this membrane-associated DltD facilitates the binding of Dcp and Dcl for ligation of Dcp with D-alanine and that the resulting D-alanyl-Dcp is translocated to the primary site of D-alanylation []. These sequences contain the C-terminal region of DltD.; PDB: 3BMA_C.
Probab=27.24 E-value=57 Score=27.60 Aligned_cols=24 Identities=21% Similarity=0.339 Sum_probs=18.2
Q ss_pred cchHHHHHHHHHHCCCeEEEccCC
Q psy13819 143 HRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 143 ~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
..|++-+|+.|++.|+.|++-.-|
T Consensus 35 y~Dl~l~L~~~k~~g~~~lfVi~P 58 (130)
T PF04914_consen 35 YDDLQLLLDVCKELGIDVLFVIQP 58 (130)
T ss_dssp HHHHHHHHHHHHHTT-EEEEEE--
T ss_pred HHHHHHHHHHHHHcCCceEEEecC
Confidence 468999999999999998865533
No 322
>PRK05904 coproporphyrinogen III oxidase; Provisional
Probab=27.23 E-value=1.2e+02 Score=29.70 Aligned_cols=52 Identities=15% Similarity=0.130 Sum_probs=34.3
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-----CCeecccccchHHHHHHHHHHCCCe-EEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-----PGHYHYDGHRDIEHFLQLAVEEDLY-ILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-----~G~fdf~g~~dl~~fl~~a~e~GL~-vilr~ 164 (323)
++.|+.||++|+|.|++.|- ...++ ....+. +++.+.++.|++.|+. |-+..
T Consensus 103 ~e~l~~lk~~G~nrisiGvQ--S~~d~vL~~l~R~~~~---~~~~~ai~~lr~~G~~~v~~dl 160 (353)
T PRK05904 103 QSQINLLKKNKVNRISLGVQ--SMNNNILKQLNRTHTI---QDSKEAINLLHKNGIYNISCDF 160 (353)
T ss_pred HHHHHHHHHcCCCEEEEecc--cCCHHHHHHcCCCCCH---HHHHHHHHHHHHcCCCcEEEEE
Confidence 68899999999999888443 11111 111222 3577889999999986 44443
No 323
>PF02811 PHP: PHP domain; InterPro: IPR004013 The PHP (Polymerase and Histidinol Phosphatase) domain is a putative phosphoesterase domain. This family is often associated with an N-terminal region IPR003141 from INTERPRO.; GO: 0003824 catalytic activity; PDB: 2WJE_A 3QY8_A 2WJD_A 2WJF_A 1PB0_B 1M68_A 1M65_A 3E38_B 2W9M_A 3E0F_A ....
Probab=26.61 E-value=1.8e+02 Score=24.15 Aligned_cols=47 Identities=15% Similarity=0.244 Sum_probs=36.7
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..-++-++.+++.|+.+|-+- -| .++.+ ...+.+.+++.||.+++..
T Consensus 16 ~~~~e~v~~A~~~Gl~~i~iT----DH------~~~~~---~~~~~~~~~~~~i~vi~G~ 62 (175)
T PF02811_consen 16 DSPEEYVEQAKEKGLDAIAIT----DH------NNFAG---YPDFYKEAKKKGIKVIPGV 62 (175)
T ss_dssp SSHHHHHHHHHHTTESEEEEE----EE------TTTTT---HHHHHHHHHHTTSEEEEEE
T ss_pred CCHHHHHHHHHHcCCCEEEEc----CC------ccccc---chHHHHHHHhcCCceEEeE
Confidence 356788899999999999983 33 22333 7778999999999999865
No 324
>PRK05628 coproporphyrinogen III oxidase; Validated
Probab=26.54 E-value=1.3e+02 Score=29.44 Aligned_cols=53 Identities=19% Similarity=0.046 Sum_probs=34.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc----ccchHHHHHHHHHHCCCe-EEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD----GHRDIEHFLQLAVEEDLY-ILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~----g~~dl~~fl~~a~e~GL~-vilr~ 164 (323)
++.|+.|+++|+|.|++.|- ...++ .-+.- ..+++.+.++.+++.|+. |-+..
T Consensus 108 ~e~l~~l~~~G~~rvslGvQ--S~~~~--~L~~l~R~~s~~~~~~a~~~l~~~g~~~v~~dl 165 (375)
T PRK05628 108 PEFFAALRAAGFTRVSLGMQ--SAAPH--VLAVLDRTHTPGRAVAAAREARAAGFEHVNLDL 165 (375)
T ss_pred HHHHHHHHHcCCCEEEEecc--cCCHH--HHHHcCCCCCHHHHHHHHHHHHHcCCCcEEEEE
Confidence 57889999999999999543 11111 11111 123577788999999998 65543
No 325
>cd03174 DRE_TIM_metallolyase DRE-TIM metallolyase superfamily. The DRE-TIM metallolyase superfamily includes 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues that cap the core of the barrel. In addition, the catalytic site includes three invariant residues - an aspartate (D), an arginine (R), and a glutamate (E) - which is the basis for the domain name "DRE-TIM".
Probab=26.52 E-value=1e+02 Score=28.04 Aligned_cols=55 Identities=13% Similarity=0.068 Sum_probs=38.6
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCC-cccCCCeeccc---ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRS-HEVHPGHYHYD---GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~-hEP~~G~fdf~---g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+++++++++.|+..|++++.-+- |... ++..+ -.+.+...++.|++.|+.|.+-+
T Consensus 77 ~~~i~~a~~~g~~~i~i~~~~s~~~~~~--~~~~~~~~~~~~~~~~i~~a~~~G~~v~~~~ 135 (265)
T cd03174 77 EKGIERALEAGVDEVRIFDSASETHSRK--NLNKSREEDLENAEEAIEAAKEAGLEVEGSL 135 (265)
T ss_pred hhhHHHHHhCCcCEEEEEEecCHHHHHH--HhCCCHHHHHHHHHHHHHHHHHCCCeEEEEE
Confidence 78899999999999999876541 1100 00000 23457788999999999988765
No 326
>PRK04302 triosephosphate isomerase; Provisional
Probab=26.47 E-value=1.4e+02 Score=26.96 Aligned_cols=47 Identities=19% Similarity=0.134 Sum_probs=34.2
Q ss_pred HHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 109 WIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 109 ~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.-++.++++|++.|-+. .-|.. -.| .++.++++.|.++||.+|+..+
T Consensus 76 ~~~~~l~~~G~~~vii~----~ser~---~~~---~e~~~~v~~a~~~Gl~~I~~v~ 122 (223)
T PRK04302 76 ILPEAVKDAGAVGTLIN----HSERR---LTL---ADIEAVVERAKKLGLESVVCVN 122 (223)
T ss_pred hHHHHHHHcCCCEEEEe----ccccc---cCH---HHHHHHHHHHHHCCCeEEEEcC
Confidence 34889999999999882 11322 112 2378899999999999998764
No 327
>PF03740 PdxJ: Pyridoxal phosphate biosynthesis protein PdxJ; InterPro: IPR004569 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). Pyridoxal 5'-phosphate (PLP) is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination [, , ]. PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors []. Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy []. PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalysed reactions, enzymatic and non-enzymatic []. In Escherichia coli, the pdx genes involved in vitamin B6 have been characterised [, , ]. This entry represents PdxJ, which catalyses the condensation of 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one and 1-deoxy-D-xylulose-5-phosphate to form pyridoxine-5'-phosphate. The product of the PdxJ reaction is then oxidized by PdxH to pyridoxal 5'-phosphate.; GO: 0008615 pyridoxine biosynthetic process, 0005737 cytoplasm; PDB: 3F4N_B 3O6D_A 3O6C_A 1M5W_G 1IXQ_D 1IXP_B 1IXN_A 1HO4_C 1HO1_A 1IXO_D ....
Probab=26.44 E-value=72 Score=29.92 Aligned_cols=49 Identities=18% Similarity=0.260 Sum_probs=29.9
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE-EccCCcccc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL-LRPGPFICG 170 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi-lr~GPyi~a 170 (323)
.+...+.++++|+.|+ -|++ .++|. .+.++.|++.|...| |-+|||..+
T Consensus 110 ~~~l~~~i~~L~~~gI-rvSL-----FiDP~------------~~qi~~A~~~Gad~VELhTG~yA~a 159 (239)
T PF03740_consen 110 RDRLKPVIKRLKDAGI-RVSL-----FIDPD------------PEQIEAAKELGADRVELHTGPYANA 159 (239)
T ss_dssp HHHHHHHHHHHHHTT--EEEE-----EE-S-------------HHHHHHHHHTT-SEEEEETHHHHHH
T ss_pred HHHHHHHHHHHHhCCC-EEEE-----EeCCC------------HHHHHHHHHcCCCEEEEehhHhhhh
Confidence 3556777788888776 3333 34554 355788888888755 778877655
No 328
>PTZ00377 alanine aminotransferase; Provisional
Probab=26.42 E-value=3.4e+02 Score=27.42 Aligned_cols=59 Identities=8% Similarity=0.052 Sum_probs=37.7
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+..+..++.+...++-|-++--+++. +-|.|.--.++-+- ++++++.|+++|+.+|..=
T Consensus 200 ~d~~~l~~~l~~~~~~~~~~k~l~l~-~P~NPTG~~~s~e~---~~~i~~~a~~~~~~iI~De 258 (481)
T PTZ00377 200 LDQEELEEAYEQAVRNGITPRALVVI-NPGNPTGQVLTRDV---MEEIIKFCYEKGIVLMADE 258 (481)
T ss_pred CCHHHHHHHHHHHHhcCCCeeEEEEE-CCCCCCCcCCCHHH---HHHHHHHHHHCCCEEEEeh
Confidence 33444444444333334444335666 77888866666655 7777999999999998763
No 329
>PF11261 IRF-2BP1_2: Interferon regulatory factor 2-binding protein zinc finger; InterPro: IPR022750 IRF-2BP1 and IRF-2BP2 are nuclear transcriptional repressor proteins and can inhibit both enhancer-activated and basal transcription. They both contain N-terminal zinc finger and C-terminal RING finger domains []. This entry represents the N-terminal zinc finger domain of IRF-2BP1 and IRF-2BP2.
Probab=26.39 E-value=46 Score=23.78 Aligned_cols=30 Identities=27% Similarity=0.398 Sum_probs=25.1
Q ss_pred eCCCcccC-CCeecccccchHHHHHHHHHHC
Q psy13819 127 EWRSHEVH-PGHYHYDGHRDIEHFLQLAVEE 156 (323)
Q Consensus 127 ~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~ 156 (323)
.|...||- .|-.||+|.+.++..|+.|+..
T Consensus 19 i~df~EpVCRgCvNyEGaDrIe~vie~arq~ 49 (54)
T PF11261_consen 19 IWDFSEPVCRGCVNYEGADRIELVIESARQL 49 (54)
T ss_pred HhhccchhhhhhcCcccchhHHHHHHHHHHH
Confidence 35678887 7999999999999999988763
No 330
>PLN02450 1-aminocyclopropane-1-carboxylate synthase
Probab=26.29 E-value=1.9e+02 Score=29.34 Aligned_cols=58 Identities=2% Similarity=0.013 Sum_probs=40.7
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+..++.++..++.|.++=-+++. |-|.|.--.|+-+- +.++++.|+++|+.+|..=
T Consensus 174 ~~~~le~~~~~~~~~~~~~k~v~l~-nP~NPTG~~~s~e~---l~~ll~~a~~~~~~iI~DE 231 (468)
T PLN02450 174 TESALEEAYQQAQKLNLKVKGVLIT-NPSNPLGTTTTRTE---LNLLVDFITAKNIHLISDE 231 (468)
T ss_pred CHHHHHHHHHHHHhcCCCeeEEEEe-cCCCCCCcccCHHH---HHHHHHHHHHCCcEEEEEc
Confidence 3445555555555556655445777 78888866777665 7788999999999999764
No 331
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=26.27 E-value=7e+02 Score=25.30 Aligned_cols=55 Identities=9% Similarity=0.142 Sum_probs=38.4
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc--ccchHHHHHHHHHHCCCe--EEEccCCcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD--GHRDIEHFLQLAVEEDLY--ILLRPGPFI 168 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~--g~~dl~~fl~~a~e~GL~--vilr~GPyi 168 (323)
...++.+.+.|+|+++++. ..|+ .|... ...++++|.+.|+++||. .++--+||.
T Consensus 144 ~~a~~~a~~~g~~afqiF~----~npr--~w~~~~~~~~~~~~f~~~~~~~gi~~~~i~~HapYl 202 (413)
T PTZ00372 144 DNSPINAYNIAGQAFALFL----KNQR--TWNSPPLSDETIDKFKENCKKYNYDPKFILPHGSYL 202 (413)
T ss_pred HHHHHHHHHcCCCEEEEEc----CCCc--cCCCCCCCHHHHHHHHHHHHHcCCCcceEEeecCce
Confidence 4467889999999999963 3333 23322 135789999999999885 355566774
No 332
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=26.25 E-value=2.5e+02 Score=31.40 Aligned_cols=71 Identities=13% Similarity=0.088 Sum_probs=39.8
Q ss_pred chHHHHHHHHHHCCCeEEEccCCccccccCCCCCccccccc-CCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCc
Q psy13819 144 RDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKV-APDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRP 222 (323)
Q Consensus 144 ~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~-~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~ 222 (323)
+.|..||+.|++.|++||-..=|. -|.+-... ..---+| +..+.+.++++.+. .+.-++
T Consensus 808 ~~l~~~i~~~~~~~~~~ig~~~p~---------~p~y~~t~~fg~~g~~------rs~a~~~~~~~~~~-----~~~y~~ 867 (912)
T TIGR02171 808 NSLKAFIDETAKKGVKVIGTIFPQ---------SPGYKNTGSFGRYGPR------RSIAKKIIDSFKKM-----EKTYPH 867 (912)
T ss_pred HHHHHHHHHHHhCCCEEEEEECCC---------CCCccccCcccccCcc------hhhHHHHHHHHHHH-----HhhCCc
Confidence 568899999999999999654221 12221110 0000011 22334444443333 233578
Q ss_pred EEEEccCCCcCCC
Q psy13819 223 IILVQVENEYGSD 235 (323)
Q Consensus 223 Vi~~QieNE~g~~ 235 (323)
.|++. ||-.|.+
T Consensus 868 f~~~d-enk~g~h 879 (912)
T TIGR02171 868 FILFD-ENKDGLH 879 (912)
T ss_pred eEEEe-cCcCCcc
Confidence 89998 9999975
No 333
>cd08190 HOT Hydroxyacid-oxoacid transhydrogenase (HOT) involved in gamma-hydroxybutyrate metabolism. Hydroxyacid-oxoacid transhydrogenase (HOT), also known as D-2-hydroxyglutarate transhydrogenase. It catalyzes the conversion of gamma-hydroxybutyrate (GHB) to succinic semialdehyde (SSA), coupled to the stoichiometric conversion of alpha-ketoglutarate to D-2-hydroxyglutarate in gamma-Hydroxybutyrate catabolism. Unlike many other alcohols, which are oxidized by NAD-linked dehydrogenases, gamma-hydroxybutyrate is metabolized to succinate semialdehyde by hydroxyacid-oxoacid transhydrogenase which does not require free NAD or NADP, but instead using alpha -ketoglutarate as an acceptor, converting it to d-2-hydroxyglutarate. Alpha-ketoglutarate serves as an intermediate acceptor to regenerate NAD(P) required for the oxidation of GHB. HOT also catalyzes the reversible oxidation of a hydroxyacid obligatorily coupled to the reduction of an oxoacid, and requires no cofactor. In mammals, the HOT
Probab=26.20 E-value=2.4e+02 Score=28.18 Aligned_cols=65 Identities=14% Similarity=0.150 Sum_probs=44.6
Q ss_pred eeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 88 DPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 88 k~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
++++++++..-. + ...+++..+.+++.|+++..+ ...+|+|. .+.+.+.++.+++.+.-+|+..|
T Consensus 24 ~~vlivt~~~~~-~--~g~~~~v~~~L~~~gi~~~~f----~~v~~~p~------~~~v~~~~~~~~~~~~D~IIaiG 88 (414)
T cd08190 24 RRVCLVTDPNLA-Q--LPPVKVVLDSLEAAGINFEVY----DDVRVEPT------DESFKDAIAFAKKGQFDAFVAVG 88 (414)
T ss_pred CeEEEEECcchh-h--cchHHHHHHHHHHcCCcEEEe----CCCCCCcC------HHHHHHHHHHHHhcCCCEEEEeC
Confidence 555556553311 1 234566777788899986543 46677764 35688889999999999998876
No 334
>COG0076 GadB Glutamate decarboxylase and related PLP-dependent proteins [Amino acid transport and metabolism]
Probab=26.13 E-value=95 Score=31.76 Aligned_cols=71 Identities=15% Similarity=0.164 Sum_probs=48.4
Q ss_pred EEeCCeeeEEeeeeecCC----------------------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc
Q psy13819 83 FRLNEDPFQFVSGSFHYF----------------------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY 140 (323)
Q Consensus 83 ~~~dGk~~~i~~g~~Hy~----------------------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf 140 (323)
-.-.|+|..+.|-+.|+. |+.++.-++.+..-...| =.|- .-|..++
T Consensus 152 ~~~~~~P~ii~s~~aH~s~~Kaa~~lG~~~~~v~~~~~~~~id~~~l~~~i~~~t~~g-~vV~----------~aGtT~~ 220 (460)
T COG0076 152 GKPGGKPNIVCSETAHFSFEKAARYLGLGLRRVPTVPTDYRIDVDALEEAIDENTIGG-VVVG----------TAGTTDT 220 (460)
T ss_pred cccCCCCeEEecCcchhHHHHHHHHhCCCceeEEeccCccccCHHHHHHHHHhhccCc-eEEE----------EecCCCC
Confidence 344557878888888862 122334444455545555 1222 2488889
Q ss_pred cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 141 DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 141 ~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
...||++++-++|+++|+++.+.-
T Consensus 221 G~iDdi~~ia~ia~~~~i~lHVDA 244 (460)
T COG0076 221 GSIDDIEELADIAEEYGIWLHVDA 244 (460)
T ss_pred CccCCHHHHHHHHHHcCCcEEEEc
Confidence 899999999999999999998863
No 335
>PLN02389 biotin synthase
Probab=26.05 E-value=1.8e+02 Score=29.04 Aligned_cols=61 Identities=10% Similarity=0.113 Sum_probs=41.5
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
...+++...+.++.+++.|+..+-+-..|...-.+|- +|+ .+.++++.+++.++.+.+..|
T Consensus 114 ~~Ls~EeIl~~a~~~~~~G~~~~~ivts~rg~~~e~~--~~e---~i~eiir~ik~~~l~i~~s~G 174 (379)
T PLN02389 114 KLMSKDDVLEAAKRAKEAGSTRFCMGAAWRDTVGRKT--NFN---QILEYVKEIRGMGMEVCCTLG 174 (379)
T ss_pred ccCCHHHHHHHHHHHHHcCCCEEEEEecccCCCCChh--HHH---HHHHHHHHHhcCCcEEEECCC
Confidence 3456778888888899999999887666642212111 233 366678888888888777666
No 336
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=26.03 E-value=1.4e+02 Score=28.01 Aligned_cols=70 Identities=11% Similarity=0.042 Sum_probs=41.6
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCC-cccCCCeeccc-ccchHHHHHHHHHHCCCe
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRS-HEVHPGHYHYD-GHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~-hEP~~G~fdf~-g~~dl~~fl~~a~e~GL~ 159 (323)
+.+++|-.+..-.....|....+.+.+.|++++++...-.. -+.....++++ ..+|+...++.+++.|..
T Consensus 28 VlllHG~g~~~~~~~~~~~~la~~La~~Gy~Vl~~Dl~G~G~S~g~~~~~~~~~~~~Dv~~ai~~L~~~~~~ 99 (266)
T TIGR03101 28 VIYLPPFAEEMNKSRRMVALQARAFAAGGFGVLQIDLYGCGDSAGDFAAARWDVWKEDVAAAYRWLIEQGHP 99 (266)
T ss_pred EEEECCCcccccchhHHHHHHHHHHHHCCCEEEEECCCCCCCCCCccccCCHHHHHHHHHHHHHHHHhcCCC
Confidence 44555532211122356777788888999999999555322 22222333443 246777788888887764
No 337
>cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=25.96 E-value=1.3e+02 Score=28.44 Aligned_cols=57 Identities=11% Similarity=0.209 Sum_probs=37.7
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCC-CcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWR-SHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~-~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++.+...+..+.| .+++.|++.+.+- ..+ ..||. ..+|.++++.++++++.+|+.-
T Consensus 178 ~~~~~v~~H~af~Y-------------l~~~~gl~~~~~~-~~~~~~eps--------~~~l~~l~~~ik~~~v~~If~e 235 (286)
T cd01019 178 KTKPFFVFHDAYGY-------------FEKRYGLTQAGVF-TIDPEIDPG--------AKRLAKIRKEIKEKGATCVFAE 235 (286)
T ss_pred CCCeEEEecccHHH-------------HHHHcCCceeeee-cCCCCCCCC--------HHHHHHHHHHHHHcCCcEEEec
Confidence 45556666666666 4678899877642 111 12222 4678999999999999998743
No 338
>TIGR03567 FMN_reduc_SsuE FMN reductase, SsuE family. Members of this protein family use NAD(P)H to reduce FMN and regenerate FMNH2. Members include the homodimeric, NAD(P)H-dependent enzyme SsuE from Escherichia coli, which serves as a partner to an FMNH2-dependent alkanesulfonate monooxygenase. It is induced by sulfate starvation. The NADH-dependent enzyme MsuE from Pseudomonas aeruginosa is outside the scope of this model (see model TIGR03566).
Probab=25.89 E-value=2.4e+02 Score=24.27 Aligned_cols=82 Identities=12% Similarity=0.013 Sum_probs=49.1
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc-cccchHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY-DGHRDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf-~g~~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
+.++.|+.+-.......-+.-.+.+++.|..+..+.+. + -|.....+. ...+++.++.+...+.+..|+..|
T Consensus 2 il~I~gS~r~~S~t~~l~~~~~~~l~~~~~~~~~idl~-~--l~~~~~~~~~~~~~~~~~l~~~i~~AD~iI~~sP---- 74 (171)
T TIGR03567 2 VLTLSGSPSTPSRSSALLRHVREALQEQGVEVDHLSVR-D--LPAEDLLFARFDSPAIKAATAQVAQADGVVVATP---- 74 (171)
T ss_pred EEEEECCCCCCChHHHHHHHHHHHHHHCCCeEEEEEec-C--CChHHhhhcCCCCHHHHHHHHHHHHCCEEEEECC----
Confidence 45667766643444556666777788878765444321 1 111111111 123578999999999999999998
Q ss_pred ccccCCCCCcccc
Q psy13819 169 CGKRDFGGFPPWL 181 (323)
Q Consensus 169 ~aEw~~gg~P~Wl 181 (323)
|| ++++|.-|
T Consensus 75 --~Y-~~sip~~L 84 (171)
T TIGR03567 75 --VY-KASYSGVL 84 (171)
T ss_pred --cc-cCCCCHHH
Confidence 44 45666543
No 339
>PF03170 BcsB: Bacterial cellulose synthase subunit; InterPro: IPR018513 An operon encoding 4 proteins required for bacterial cellulose biosynthesis (bcs) in Acetobacter xylinus (Gluconacetobacter xylinus) has been isolated via genetic complementation with strains lacking cellulose synthase activity []. Nucleotide sequence analysis showed the cellulose synthase operon to consist of 4 genes, designated bcsA, bcsB, bcsC and bcsD, all of which are required for maximal bacterial cellulose synthesis in A. xylinum. The calculated molecular mass of the protein encoded by bcsB is 85.3kDa []. BcsB encodes the catalytic subunit of cellulose synthase. The protein polymerises uridine 5'-diphosphate glucose to cellulose: UDP-glucose + (1,4-beta-D-glucosyl)(N) = UDP + (1,4-beta-D-glucosyl)(N+1). The enzyme is specifically activated by the nucleotide cyclic diguanylic acid. Sequence analysis suggests that BcsB contains several transmembrane (TM) domains, and shares a high degree of similarity with Escherichia coli YhjN.; GO: 0006011 UDP-glucose metabolic process, 0016020 membrane
Probab=25.69 E-value=1.6e+02 Score=30.79 Aligned_cols=42 Identities=24% Similarity=0.437 Sum_probs=30.5
Q ss_pred eEEEEcCcccccccCC-----CCCeeEEEeccccccCCcceEEE-EEEe
Q psy13819 7 GVLFANGRHWGRYWSE-----VGPQYSLFVPEEFLKVGTNRITI-FELT 49 (323)
Q Consensus 7 g~~~vng~~lGryw~~-----~gpq~~lyvP~~~l~~g~N~i~i-~e~~ 49 (323)
-.|.|||+.||-+=-+ ..-..++.|| ..+..|.|+|-| |++.
T Consensus 363 l~V~vNg~~i~s~~L~~~~~~~~~~~~v~iP-~~~~~~~N~l~~~f~l~ 410 (605)
T PF03170_consen 363 LTVYVNGQFIGSLPLTPADGAGFDRYTVSIP-RLLLPGRNQLQFEFDLP 410 (605)
T ss_pred EEEEECCEEEEeEECCCCCCCccceeEEecC-chhcCCCcEEEEEEEee
Confidence 3688999999875322 1245778899 999999999875 3443
No 340
>COG1082 IolE Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]
Probab=25.65 E-value=1.6e+02 Score=26.75 Aligned_cols=61 Identities=8% Similarity=-0.008 Sum_probs=42.6
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccC--CCeecc-cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVH--PGHYHY-DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~--~G~fdf-~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.+.+++.++.++++|..+|-+.......... +....+ ...+.+.++.+.|++.|+.+-+.+
T Consensus 83 ~~~~~~~i~~a~~lg~~~vv~~~g~~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~~i~l~~e~ 146 (274)
T COG1082 83 LEELKRAIELAKELGAKVVVVHPGLGAGADDPDSPEEARERWAEALEELAEIAEELGIGLALEN 146 (274)
T ss_pred HHHHHHHHHHHHHcCCCeEEeecccCCcCCCCCCCcccHHHHHHHHHHHHHHHHHhCCceEEee
Confidence 4567788999999999999875544333333 122222 235668888999999999888876
No 341
>PRK12360 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional
Probab=25.64 E-value=1.8e+02 Score=27.92 Aligned_cols=73 Identities=12% Similarity=0.112 Sum_probs=47.9
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
|+|+..+|-=+|. +..++++++.|+.+|... ..+.. |..+..=+...--=....+.|++.|+.++-.+=|
T Consensus 31 ~~~vy~lG~iVHN--------~~Vv~~L~~~Gv~~v~~~-~~~~v-~~~~~ViirAHGv~~~~~~~~~~~g~~viDaTCP 100 (281)
T PRK12360 31 GKKIYTLGPLIHN--------NQVVSDLEEKGVKTIEES-EIDSL-KEGDVVIIRSHGVSKKVYKDLKDKGLEIIDATCP 100 (281)
T ss_pred CCCeEEecCCcCC--------HHHHHHHHHCcCEEECcC-chhhC-CCCCEEEEeCCCCCHHHHHHHHHCCCeEEeCCCc
Confidence 6788888888998 667899999999999220 01111 2233555554333345578888888888877766
Q ss_pred ccc
Q psy13819 167 FIC 169 (323)
Q Consensus 167 yi~ 169 (323)
++-
T Consensus 101 ~V~ 103 (281)
T PRK12360 101 FVK 103 (281)
T ss_pred cch
Confidence 553
No 342
>COG2108 Uncharacterized conserved protein related to pyruvate formate-lyase activating enzyme [General function prediction only]
Probab=25.42 E-value=1.2e+02 Score=29.84 Aligned_cols=48 Identities=13% Similarity=0.063 Sum_probs=36.4
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.|+++.++|++-||+... +++-.....+...|..|.+.|+-|=+..
T Consensus 124 ~e~l~~L~eAGLDEIRfHp~---------~~~~~~~e~~i~~l~~A~~~g~dvG~Ei 171 (353)
T COG2108 124 EEALKALAEAGLDEIRFHPP---------RPGSKSSEKYIENLKIAKKYGMDVGVEI 171 (353)
T ss_pred HHHHHHHHhCCCCeEEecCC---------CccccccHHHHHHHHHHHHhCccceeec
Confidence 67899999999999999432 3333344457788999999999887654
No 343
>COG1663 LpxK Tetraacyldisaccharide-1-P 4'-kinase [Cell envelope biogenesis, outer membrane]
Probab=25.17 E-value=1.3e+02 Score=29.55 Aligned_cols=59 Identities=22% Similarity=0.401 Sum_probs=45.9
Q ss_pred EeCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCC
Q psy13819 84 RLNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 84 ~~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL 158 (323)
...||++..++|--|+.| -++.++++|++.+.++. -|+.++|+- ++++.+.+.++..||
T Consensus 230 ~~~~~~v~afaGIg~P~r--------Ff~tL~~~g~~~~~~~~-------FpDH~~f~~-~~l~~l~~~~~~~~L 288 (336)
T COG1663 230 DLKGKRVVAFAGIGNPQR--------FFATLRNLGIQVVETLA-------FPDHYDFSA-ADLEDLAKKAQADGL 288 (336)
T ss_pred hcCCceEEEEEecCChHH--------HHHHHHHcCcceeeeec-------CCchhhccH-HHHHHHHhhhccceE
Confidence 345688999999999844 48899999999999964 478888886 457777887777333
No 344
>cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence).
Probab=25.08 E-value=1.2e+02 Score=28.42 Aligned_cols=58 Identities=19% Similarity=0.336 Sum_probs=38.8
Q ss_pred eCCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 85 LNEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 85 ~dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
+.++++...+-.+.| .+++.|++.+.+. ...++-+| ..+++.+++++++++|+++|+.
T Consensus 169 ~~~~~~v~~H~af~Y-------------~~~~~gl~~~~~~--~~~~~~ep------s~~~l~~l~~~ik~~~v~~if~ 226 (282)
T cd01017 169 AKGKTFVTQHAAFGY-------------LARRYGLKQIAIV--GVSPEVEP------SPKQLAELVEFVKKSDVKYIFF 226 (282)
T ss_pred cCCCeEEEecccHHH-------------HHHHCCCeEEecc--cCCCCCCC------CHHHHHHHHHHHHHcCCCEEEE
Confidence 356667766666666 5688899987641 11122222 2467899999999999998864
No 345
>cd06546 GH18_CTS3_chitinase GH18 domain of CTS3 (chitinase 3), an uncharacterized protein from the human fungal pathogen Coccidioides posadasii. CTS3 has a chitinase-like glycosyl hydrolase family 18 (GH18) domain; and has homologs in bacteria as well as fungi.
Probab=25.00 E-value=3.8e+02 Score=24.90 Aligned_cols=82 Identities=12% Similarity=0.095 Sum_probs=44.7
Q ss_pred HHHHHHHHHHCCCeEEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHHHhhhccCCCcEEE
Q psy13819 146 IEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRIQKYLYGNDRPIIL 225 (323)
Q Consensus 146 l~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i~~~~~~n~g~Vi~ 225 (323)
+.+-+..|++.|++|++-.| .|..+++ ..+ ..++. ..++|++.+.+.++++- +=+
T Consensus 61 ~~~~i~~~~~~g~KVllSiG-----G~~~~~f--------s~~---a~~~~---~r~~f~~s~~~~~~~~~------~DG 115 (256)
T cd06546 61 LWTELAILQSSGVKVMGMLG-----GAAPGSF--------SRL---DDDDE---DFERYYGQLRDMIRRRG------LDG 115 (256)
T ss_pred HHHHHHHHHhCCCEEEEEEC-----CCCCCCc--------ccc---cCCHH---HHHHHHHHHHHHHHHhC------CCc
Confidence 33446778999999999886 2222221 000 12222 33456666666665542 447
Q ss_pred EccCCCcCCCCCChHHHHHHHHHHHHHhcCC
Q psy13819 226 VQVENEYGSDAECDPAHAVWLRDLLRTYVQD 256 (323)
Q Consensus 226 ~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~ 256 (323)
+.|+-|+... ...|.+.++++ |+.++.
T Consensus 116 iDiDwE~p~~---~~~~~~ll~~L-r~~~~~ 142 (256)
T cd06546 116 LDLDVEEPMS---LDGIIRLIDRL-RSDFGP 142 (256)
T ss_pred eEEeeecCCC---HhHHHHHHHHH-HHHhCC
Confidence 7888888642 23455554443 443443
No 346
>TIGR02026 BchE magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase. This model respresents the cobalamin-dependent oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under anaerobic conditions. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under anaerobic conditions (a separate enzyme, AcsF, acts under aerobic conditions). This model identifies two clades of sequences, one from photosynthetic, non-cyanobacterial bacteria and another including Synechocystis and several non-photosynthetic bacteria. The function of the Synechocystis gene is supported by gene clustering with other photosynthetic genes, so the purpose of the gene in the non-photosynthetic bacteria is uncertain. Note that homologs of this gene are not found in plants which rely solely on the aerobic cyclase.
Probab=24.81 E-value=81 Score=32.32 Aligned_cols=51 Identities=16% Similarity=0.126 Sum_probs=33.7
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-CCeecc----cccchHHHHHHHHHHCCCeEEEc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-PGHYHY----DGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-~G~fdf----~g~~dl~~fl~~a~e~GL~vilr 163 (323)
++.|++|+++|++.|.+.+. .- +-..+. ...++..+.+++|+++||.+.+.
T Consensus 287 ~ell~~l~~aG~~~v~iGiE-----S~~~~~L~~~~K~~t~~~~~~ai~~l~~~Gi~~~~~ 342 (497)
T TIGR02026 287 ADILHLYRRAGLVHISLGTE-----AAAQATLDHFRKGTTTSTNKEAIRLLRQHNILSEAQ 342 (497)
T ss_pred HHHHHHHHHhCCcEEEEccc-----cCCHHHHHHhcCCCCHHHHHHHHHHHHHCCCcEEEE
Confidence 57799999999999998433 21 111100 01235678899999999976543
No 347
>TIGR03551 F420_cofH 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit. This enzyme, together with CofG, complete the biosynthesis of 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, the chromophore of coenzyme F420. The chromophore is also used in cyanobacteria DNA photolyases.
Probab=24.60 E-value=65 Score=31.27 Aligned_cols=51 Identities=10% Similarity=0.107 Sum_probs=31.2
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCccc------CCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEV------HPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP------~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
++.+++||++|++.+-. ..-....+ .|++..+ ++..+.++.|++.||.+-.
T Consensus 141 ~e~l~~LkeAGl~~i~~-~~~E~~~~~v~~~i~~~~~~~---~~~~~~i~~a~~~Gi~v~s 197 (343)
T TIGR03551 141 EEALKRLKEAGLDSMPG-TAAEILDDEVRKVICPDKLST---AEWIEIIKTAHKLGIPTTA 197 (343)
T ss_pred HHHHHHHHHhCcccccC-cchhhcCHHHHHhcCCCCCCH---HHHHHHHHHHHHcCCcccc
Confidence 68899999999998741 00111111 2232222 2346779999999997643
No 348
>PLN03244 alpha-amylase; Provisional
Probab=24.51 E-value=59 Score=35.72 Aligned_cols=47 Identities=15% Similarity=0.167 Sum_probs=31.8
Q ss_pred CCEEEEEEeCCCcccCCCeec-c--------------cccchHHHHHHHHHHCCCeEEEccC
Q psy13819 119 LNALSTYVEWRSHEVHPGHYH-Y--------------DGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 119 ~N~Vr~yv~W~~hEP~~G~fd-f--------------~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
+-..+..|-=+..||+=|.|. | ...+|+.+||+.|+++||.|||..=
T Consensus 401 lrIYE~HvGms~~e~kv~ty~eF~~~vt~fFApssRYGTPeDLK~LVD~aH~~GI~VILDvV 462 (872)
T PLN03244 401 LRIYECHVGISGSEPKISSFEEFTEKVTNFFAASSRYGTPDDFKRLVDEAHGLGLLVFLDIV 462 (872)
T ss_pred ceEEEEEeeecCCCCCcccHHHHhhccCcccccCcccCCHHHHHHHHHHHHHCCCEEEEEec
Confidence 334444455455565555553 2 2246899999999999999999853
No 349
>smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites
Probab=24.23 E-value=1.8e+02 Score=26.63 Aligned_cols=72 Identities=10% Similarity=0.020 Sum_probs=43.7
Q ss_pred eEEeeeeecCC---CCChhcHHHHHHHHHHhCCCEEEE-EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 90 FQFVSGSFHYF---RAPPGRWCWIMRAMRAAGLNALST-YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 90 ~~i~~g~~Hy~---r~p~~~W~~~l~~mk~~G~N~Vr~-yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
+.|..++.|.. +.+++..+..-+.+++.|+...-. ...-|...|.|.....+ .+.+.+.++.|++.|..+|+
T Consensus 27 vel~~~~~~~~~~~~~~~~~~~~l~~~~~~~gl~ls~h~p~~~nl~s~d~~~r~~~-~~~l~~~i~~A~~lGa~~vv 102 (273)
T smart00518 27 FQLFLGNPRSWKGVRLSEETAEKFKEALKENNIDVSVHAPYLINLASPDKEKVEKS-IERLIDEIKRCEELGIKALV 102 (273)
T ss_pred EEEECCCCCCCCCCCCCHHHHHHHHHHHHHcCCCEEEECCceecCCCCCHHHHHHH-HHHHHHHHHHHHHcCCCEEE
Confidence 33444444443 466777888788888899974322 11124444443333332 34588889999999998654
No 350
>PRK07094 biotin synthase; Provisional
Probab=24.18 E-value=73 Score=30.35 Aligned_cols=51 Identities=12% Similarity=-0.089 Sum_probs=31.3
Q ss_pred HHHHHHHHHhCCCEEEEEEeC---CCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEW---RSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W---~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
++.++.||++|++.|.+.+.= ..++---...++ ++..+.++.+++.|+.|-
T Consensus 129 ~e~l~~Lk~aG~~~v~~glEs~~~~~~~~i~~~~s~---~~~~~~i~~l~~~Gi~v~ 182 (323)
T PRK07094 129 YEEYKAWKEAGADRYLLRHETADKELYAKLHPGMSF---ENRIACLKDLKELGYEVG 182 (323)
T ss_pred HHHHHHHHHcCCCEEEeccccCCHHHHHHhCCCCCH---HHHHHHHHHHHHcCCeec
Confidence 577899999999998873320 011111111222 346677899999998753
No 351
>PRK09860 putative alcohol dehydrogenase; Provisional
Probab=24.13 E-value=2.6e+02 Score=27.59 Aligned_cols=66 Identities=12% Similarity=0.125 Sum_probs=43.8
Q ss_pred eeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 88 DPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 88 k~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
|+++++++..- .+ ...+++..+.+++.|+.++.+ +..+|+|- .+++++.++.+++.+..+|+..|-
T Consensus 32 ~~~livt~~~~-~~--~g~~~~v~~~L~~~~i~~~~f----~~v~~np~------~~~v~~~~~~~~~~~~D~IiaiGG 97 (383)
T PRK09860 32 TRTLIVTDNML-TK--LGMAGDVQKALEERNIFSVIY----DGTQPNPT------TENVAAGLKLLKENNCDSVISLGG 97 (383)
T ss_pred CEEEEEcCcch-hh--CccHHHHHHHHHHcCCeEEEe----CCCCCCcC------HHHHHHHHHHHHHcCCCEEEEeCC
Confidence 55556655311 11 246677777888999875433 45555542 455788899999999999998764
No 352
>cd07939 DRE_TIM_NifV Streptomyces rubellomurinus FrbC and related proteins, catalytic TIM barrel domain. FrbC (NifV) of Streptomyces rubellomurinus catalyzes the condensation of acetyl-CoA and alpha-ketoglutarate to form homocitrate and CoA, a reaction similar to one catalyzed by homocitrate synthase. The gene encoding FrbC is one of several genes required for the biosynthesis of FR900098, a potent antimalarial antibiotic. This protein is also required for assembly of the nitrogenase MoFe complex but its exact role is unknown. This family also includes the NifV proteins of Heliobacterium chlorum and Gluconacetobacter diazotrophicus, which appear to be orthologous to FrbC. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarbox
Probab=24.09 E-value=1.1e+02 Score=28.44 Aligned_cols=56 Identities=11% Similarity=0.108 Sum_probs=37.9
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccc---ccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYD---GHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~---g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.++++++++.|+..|+++++-+-.-- .-.++.+ ..+.+...++.|++.|+.|.+.+
T Consensus 72 ~~~v~~a~~~g~~~i~i~~~~s~~~~-~~~~~~~~~~~~~~~~~~i~~a~~~G~~v~~~~ 130 (259)
T cd07939 72 KEDIEAALRCGVTAVHISIPVSDIHL-AHKLGKDRAWVLDQLRRLVGRAKDRGLFVSVGA 130 (259)
T ss_pred HHHHHHHHhCCcCEEEEEEecCHHHH-HHHhCCCHHHHHHHHHHHHHHHHHCCCeEEEee
Confidence 56789999999999999886553210 0111211 13456788999999999877544
No 353
>PRK10150 beta-D-glucuronidase; Provisional
Probab=24.08 E-value=54 Score=34.37 Aligned_cols=40 Identities=18% Similarity=0.236 Sum_probs=23.7
Q ss_pred eeEEEEcCcccccccCCCCCeeEEEeccccccCCcc-eEEEEEE
Q psy13819 6 KGVLFANGRHWGRYWSEVGPQYSLFVPEEFLKVGTN-RITIFEL 48 (323)
Q Consensus 6 kg~~~vng~~lGryw~~~gpq~~lyvP~~~l~~g~N-~i~i~e~ 48 (323)
.-.|||||+.||+.-...-|- ++=| .+.|+.|+| +|+| ..
T Consensus 94 ~a~V~lNG~~vg~~~~~~~~f-~~DI-T~~l~~G~~n~L~V-~v 134 (604)
T PRK10150 94 YAKVWVNGQEVMEHKGGYTPF-EADI-TPYVYAGKSVRITV-CV 134 (604)
T ss_pred eEEEEECCEEeeeEcCCccce-EEeC-chhccCCCceEEEE-EE
Confidence 357999999999854211121 1114 346888876 5544 44
No 354
>PRK08883 ribulose-phosphate 3-epimerase; Provisional
Probab=24.06 E-value=2.1e+02 Score=26.21 Aligned_cols=41 Identities=20% Similarity=0.195 Sum_probs=32.1
Q ss_pred HHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 110 IMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 110 ~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
-++.+.++|.+.|.+ ..|.. .++.++|+.++++|+++-+-.
T Consensus 73 ~i~~~~~~gad~i~~-----H~Ea~---------~~~~~~l~~ik~~g~k~Glal 113 (220)
T PRK08883 73 IIPDFAKAGASMITF-----HVEAS---------EHVDRTLQLIKEHGCQAGVVL 113 (220)
T ss_pred HHHHHHHhCCCEEEE-----cccCc---------ccHHHHHHHHHHcCCcEEEEe
Confidence 367788899999999 34432 358899999999999876655
No 355
>PLN02587 L-galactose dehydrogenase
Probab=24.04 E-value=2e+02 Score=27.29 Aligned_cols=19 Identities=11% Similarity=0.223 Sum_probs=15.7
Q ss_pred HHHHHHHHHHCCCeEEEcc
Q psy13819 146 IEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 146 l~~fl~~a~e~GL~vilr~ 164 (323)
...+++.|+++|+-++..-
T Consensus 187 ~~~ll~~~~~~gi~v~a~s 205 (314)
T PLN02587 187 LEDLLPYLKSKGVGVISAS 205 (314)
T ss_pred HHHHHHHHHHcCceEEEec
Confidence 3578999999999988753
No 356
>cd08171 GlyDH-like2 Glycerol dehydrogenase-like. Glycerol dehydrogenases-like. The proteins in this family have not been characterized, but they show sequence homology with glycerol dehydrogenase. Glycerol dehydrogenases (GlyDH) is a key enzyme in the glycerol dissimilation pathway. In anaerobic conditions, many microorganisms utilize glycerol as a source of carbon through coupled oxidative and reductive pathways. One of the pathways involves the oxidation of glycerol to dihydroxyacetone with the reduction of NAD+ to NADH catalyzed by glycerol dehydrogenases. Dihydroxyacetone is then phosphorylated by dihydroxyacetone kinase and enters the glycolytic pathway for further degradation. The activity of GlyDH is zinc-dependent. The zinc ion plays a role in stabilizing an alkoxide intermediate at the active site.
Probab=23.99 E-value=1.4e+02 Score=28.97 Aligned_cols=65 Identities=9% Similarity=0.088 Sum_probs=38.7
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
|++.+++++...+ ....++..+.+++.|++.+.+. ..+++| + .+++++.++.+++.+.-+|+..|
T Consensus 22 ~~r~liv~d~~~~----~~~~~~v~~~l~~~~~~~~~~~----~~~~~p---~---~~~v~~~~~~~~~~~~d~iiavG 86 (345)
T cd08171 22 GKKVVVIGGKTAL----AAAKDKIKAALEQSGIEITDFI----WYGGES---T---YENVERLKKNPAVQEADMIFAVG 86 (345)
T ss_pred CCEEEEEeCHHHH----HHHHHHHHHHHHHCCCeEEEEE----ecCCCC---C---HHHHHHHHHHHhhcCCCEEEEeC
Confidence 4666777765433 1235555666777888765432 333333 1 34467777888888777777654
No 357
>PRK13962 bifunctional phosphoglycerate kinase/triosephosphate isomerase; Provisional
Probab=23.94 E-value=1.5e+02 Score=31.79 Aligned_cols=49 Identities=12% Similarity=0.112 Sum_probs=37.3
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.+++|++|++.|-+ .|--++..|.-+. ..+.+=++.|.++||.+|++.|
T Consensus 474 a~mLkd~G~~~vii-----GHSERR~~f~Etd-~~V~~K~~~al~~GL~pIvCVG 522 (645)
T PRK13962 474 GPMLAEIGVEYVII-----GHSERRQYFGETD-ELVNKKVLAALKAGLTPILCVG 522 (645)
T ss_pred HHHHHHcCCCEEEE-----CcccccCCcCcch-HHHHHHHHHHHHCCCEEEEEcC
Confidence 56899999999999 5655555565442 3344555999999999999997
No 358
>cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan.
Probab=23.94 E-value=1.7e+02 Score=27.01 Aligned_cols=43 Identities=21% Similarity=0.222 Sum_probs=32.4
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++-+++++++|++.|-+. .. | + .++.++++.|+++|+.+++-.
T Consensus 94 ~~fi~~~~~aG~~giiip---Dl--~------~---ee~~~~~~~~~~~g~~~i~~i 136 (242)
T cd04724 94 ERFLRDAKEAGVDGLIIP---DL--P------P---EEAEEFREAAKEYGLDLIFLV 136 (242)
T ss_pred HHHHHHHHHCCCcEEEEC---CC--C------H---HHHHHHHHHHHHcCCcEEEEe
Confidence 556888999999999882 01 1 1 257889999999999877644
No 359
>KOG4039|consensus
Probab=23.87 E-value=1.8e+02 Score=26.58 Aligned_cols=67 Identities=16% Similarity=0.327 Sum_probs=38.9
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc-chHHHHHHHHHHCCC--eEEEccCCcccc
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH-RDIEHFLQLAVEEDL--YILLRPGPFICG 170 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~-~dl~~fl~~a~e~GL--~vilr~GPyi~a 170 (323)
..++..++--+..+++|+-|+.++-+--+=..+-. -.|-+--. -.+ =+...|... ++|+||||..|-
T Consensus 103 fykvDhDyvl~~A~~AKe~Gck~fvLvSS~GAd~s--SrFlY~k~KGEv---E~~v~eL~F~~~~i~RPG~ll~~ 172 (238)
T KOG4039|consen 103 FYKVDHDYVLQLAQAAKEKGCKTFVLVSSAGADPS--SRFLYMKMKGEV---ERDVIELDFKHIIILRPGPLLGE 172 (238)
T ss_pred eEeechHHHHHHHHHHHhCCCeEEEEEeccCCCcc--cceeeeeccchh---hhhhhhccccEEEEecCcceecc
Confidence 44677788888899999999999877322112111 12222110 012 122334444 588999998874
No 360
>PRK15447 putative protease; Provisional
Probab=23.78 E-value=2.4e+02 Score=26.93 Aligned_cols=59 Identities=5% Similarity=-0.074 Sum_probs=41.6
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+-||...-++-.+.+++.|.++|-+.. ..-.- +-+|+- +++.+.++.|+++|.+|.+.+
T Consensus 10 ~~~p~~~~~~~~~~~~~~gaDaVY~g~--~~~~~---R~~f~~-~~l~e~v~~~~~~gkkvyva~ 68 (301)
T PRK15447 10 YYWPKETVRDFYQRAADSPVDIVYLGE--TVCSK---RRELKV-GDWLELAERLAAAGKEVVLST 68 (301)
T ss_pred cCCCCCCHHHHHHHHHcCCCCEEEECC--ccCCC---ccCCCH-HHHHHHHHHHHHcCCEEEEEe
Confidence 357888888889999999988887731 11010 113543 568888999999999988865
No 361
>cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=23.69 E-value=1.2e+02 Score=28.28 Aligned_cols=55 Identities=20% Similarity=0.363 Sum_probs=37.6
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.++.+...+-.+.| .+++.|++.+.+ ..+..||. .+++.+++++++++++++|+.
T Consensus 169 ~~~~~v~~H~af~Y-------------~~~~ygl~~~~~--~~~~~eps--------~~~l~~l~~~ik~~~v~~if~ 223 (266)
T cd01018 169 KQRAFMVYHPAWGY-------------FARDYGLTQIPI--EEEGKEPS--------PADLKRLIDLAKEKGVRVVFV 223 (266)
T ss_pred CCCeEEEECchhHH-------------HHHHcCCEEEec--CCCCCCCC--------HHHHHHHHHHHHHcCCCEEEE
Confidence 34455555555555 578889998754 22333333 567999999999999998874
No 362
>PF05378 Hydant_A_N: Hydantoinase/oxoprolinase N-terminal region; InterPro: IPR008040 This domain is found at the N terminus of the hydantoinase/oxoprolinase IPR002821 from INTERPRO family.
Probab=23.68 E-value=2.6e+02 Score=24.59 Aligned_cols=33 Identities=9% Similarity=0.142 Sum_probs=29.1
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccC
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVH 134 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~ 134 (323)
...+..+..++.+++.|+.+|-+...|+...|.
T Consensus 131 ld~~~v~~~~~~l~~~gv~avAV~~~fS~~np~ 163 (176)
T PF05378_consen 131 LDEDEVREALRELKDKGVEAVAVSLLFSYRNPE 163 (176)
T ss_pred CCHHHHHHHHHHHHhCCCCEEEEECccCCCCHH
Confidence 346789999999999999999999988888776
No 363
>cd08181 PPD-like 1,3-propanediol dehydrogenase-like (PPD). 1,3-propanediol dehydrogenase-like (PPD). This family is a member of the iron-containing alcohol dehydrogenase superfamily, and exhibits a dehydroquinate synthase-like fold. Protein sequence similarity search and other biochemical evidences suggest that they are close to the iron-containing 1,3-propanediol dehydrogenase (EC 1.1.1.202). 1,3-propanediol dehydrogenase catalyzes the oxidation of propane-1,3-diol to 3-hydroxypropanal with the simultaneous reduction of NADP+ to NADPH. The protein structure of Thermotoga maritima TM0920 gene contains one NADP+ and one iron ion.
Probab=23.44 E-value=2.6e+02 Score=27.22 Aligned_cols=67 Identities=15% Similarity=0.171 Sum_probs=43.5
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
||+.+++++..-+.+. ..+++..+.+++.|++... +...+|.|- .+++++.++.+++.+.-+|+..|
T Consensus 25 g~r~lvVt~~~~~~~~--g~~~~v~~~L~~~g~~~~~----~~~v~~~p~------~~~v~~~~~~~~~~~~D~IIavG 91 (357)
T cd08181 25 GKRALIVTGKSSAKKN--GSLDDVTKALEELGIEYEI----FDEVEENPS------LETIMEAVEIAKKFNADFVIGIG 91 (357)
T ss_pred CCEEEEEeCCchHhhc--CcHHHHHHHHHHcCCeEEE----eCCCCCCcC------HHHHHHHHHHHHhcCCCEEEEeC
Confidence 4666666664433221 2445666678888886432 245665542 34578889999999999998876
No 364
>cd03789 GT1_LPS_heptosyltransferase Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS). Lipopolysaccharide (LPS) is a major component of the outer membrane of gram-negative bacteria. LPS heptosyltransferase transfers heptose molecules from ADP-heptose to 3-deoxy-D-manno-octulosonic acid (KDO), a part of the inner core component of LPS. This family belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility.
Probab=23.32 E-value=1.8e+02 Score=26.65 Aligned_cols=76 Identities=20% Similarity=0.250 Sum_probs=52.2
Q ss_pred eEEeeeeecCC-CCChhcHHHHHHHHHHhCCCEEEEEEeCCCccc-----------CCCeecccccchHHHHHHHHHHCC
Q psy13819 90 FQFVSGSFHYF-RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEV-----------HPGHYHYDGHRDIEHFLQLAVEED 157 (323)
Q Consensus 90 ~~i~~g~~Hy~-r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP-----------~~G~fdf~g~~dl~~fl~~a~e~G 157 (323)
+.+..|+-+.. ++|.+.|.+.++.+++.|++.|-+. .-.|. .+...++.+..++..++.+.+...
T Consensus 124 i~i~~~~~~~~k~w~~~~~~~l~~~l~~~~~~ivl~g---~~~e~~~~~~i~~~~~~~~~~~~~~~~~l~e~~~li~~~~ 200 (279)
T cd03789 124 VVLPPGASGPAKRWPAERFAALADRLLARGARVVLTG---GPAERELAEEIAAALGGPRVVNLAGKTSLRELAALLARAD 200 (279)
T ss_pred EEECCCCCCccccCCHHHHHHHHHHHHHCCCEEEEEe---chhhHHHHHHHHHhcCCCccccCcCCCCHHHHHHHHHhCC
Confidence 44444444444 5788999999999998887776442 22211 234567777778888888888888
Q ss_pred CeEEEccCCcc
Q psy13819 158 LYILLRPGPFI 168 (323)
Q Consensus 158 L~vilr~GPyi 168 (323)
+.+-.+.||.-
T Consensus 201 l~I~~Dsg~~H 211 (279)
T cd03789 201 LVVTNDSGPMH 211 (279)
T ss_pred EEEeeCCHHHH
Confidence 88888777643
No 365
>PTZ00445 p36-lilke protein; Provisional
Probab=23.18 E-value=1.7e+02 Score=27.12 Aligned_cols=62 Identities=10% Similarity=0.095 Sum_probs=42.1
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCC-Ceecc---------cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHP-GHYHY---------DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~-G~fdf---------~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+.+.-+.-.+++++.|+++|-+-+-=-.+.-.- |-.+. +...++..+++.+.+.||.|++-+
T Consensus 27 ~~~~~~~~v~~L~~~GIk~Va~D~DnTlI~~HsgG~~~~~~~~~~~~~~~tpefk~~~~~l~~~~I~v~VVT 98 (219)
T PTZ00445 27 PHESADKFVDLLNECGIKVIASDFDLTMITKHSGGYIDPDNDDIRVLTSVTPDFKILGKRLKNSNIKISVVT 98 (219)
T ss_pred HHHHHHHHHHHHHHcCCeEEEecchhhhhhhhcccccCCCcchhhhhccCCHHHHHHHHHHHHCCCeEEEEE
Confidence 356667778899999999999854322222221 22222 245678899999999999987644
No 366
>PRK09249 coproporphyrinogen III oxidase; Provisional
Probab=23.14 E-value=90 Score=31.57 Aligned_cols=47 Identities=19% Similarity=0.140 Sum_probs=30.9
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc----cccchHHHHHHHHHHCCC
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY----DGHRDIEHFLQLAVEEDL 158 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf----~g~~dl~~fl~~a~e~GL 158 (323)
++.++.|+++|+|.|++.|- ...++ .-+. ...+++.+.++.+++.|+
T Consensus 151 ~e~l~~l~~aG~~risiGvq--S~~~~--~L~~l~r~~~~~~~~~ai~~l~~~G~ 201 (453)
T PRK09249 151 LEMLDALRELGFNRLSLGVQ--DFDPE--VQKAVNRIQPFEFTFALVEAARELGF 201 (453)
T ss_pred HHHHHHHHHcCCCEEEECCC--CCCHH--HHHHhCCCCCHHHHHHHHHHHHHcCC
Confidence 57899999999999998432 11111 1111 122357778999999998
No 367
>PF00282 Pyridoxal_deC: Pyridoxal-dependent decarboxylase conserved domain; InterPro: IPR002129 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). Pyridoxal 5'-phosphate (PLP) is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination [, , ]. PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors []. Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy []. PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalysed reactions, enzymatic and non-enzymatic []. A number of pyridoxal-dependent decarboxylases share regions of sequence similarity, particularly in the vicinity of a conserved lysine residue, which provides the attachment site for the pyridoxal-phosphate (PLP) group [, ]. Among these enzymes are aromatic-L-amino-acid decarboxylase (L-dopa decarboxylase or tryptophan decarboxylase), which catalyses the decarboxylation of tryptophan to tryptamine []; tyrosine decarboxylase, which converts tyrosine into tyramine; and histidine decarboxylase, which catalyses the decarboxylation of histidine to histamine []. These enzymes belong to the group II decarboxylases [, ].; GO: 0016831 carboxy-lyase activity, 0030170 pyridoxal phosphate binding, 0019752 carboxylic acid metabolic process; PDB: 3MC6_A 1XEY_A 1ES0_B 2OKK_A 2JIS_B 2QMA_A 3MAF_B 3MAD_B 3MAU_A 3MBB_A ....
Probab=23.11 E-value=1.8e+02 Score=28.61 Aligned_cols=72 Identities=14% Similarity=0.188 Sum_probs=42.1
Q ss_pred CCeeeEEeeeeecCC---------------------CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccc
Q psy13819 86 NEDPFQFVSGSFHYF---------------------RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHR 144 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~---------------------r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~ 144 (323)
.+++..+.|.+-||. |+..+.-++.|++..+.|..-+-+- ..-|..+....|
T Consensus 139 ~~~~~i~~s~~aH~S~~Kaa~~lGlg~~~I~~~~~~~md~~~L~~~l~~~~~~g~~p~~vv-------at~Gtt~~Ga~D 211 (373)
T PF00282_consen 139 IPKPVIYVSEQAHYSIEKAARILGLGVRKIPTDEDGRMDIEALEKALEKDIANGKTPFAVV-------ATAGTTNTGAID 211 (373)
T ss_dssp CSSEEEEEETTS-THHHHHHHHTTSEEEEE-BBTTSSB-HHHHHHHHHHHHHTTEEEEEEE-------EEBS-TTTSBB-
T ss_pred ccccccccccccccHHHHhcceeeeEEEEecCCcchhhhHHHhhhhhcccccccccceeee-------ccCCCccccccc
Confidence 346777788888872 2333445555566666666432221 123566666677
Q ss_pred hHHHHHHHHHHCCCeEEEcc
Q psy13819 145 DIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 145 dl~~fl~~a~e~GL~vilr~ 164 (323)
++.++.++|+++++++-|.-
T Consensus 212 ~l~~i~~i~~~~~~wlHVDa 231 (373)
T PF00282_consen 212 PLEEIADICEKYNIWLHVDA 231 (373)
T ss_dssp SHHHHHHHHHHCT-EEEEEE
T ss_pred CHHHHhhhccccceeeeecc
Confidence 88888888888888887764
No 368
>PLN02376 1-aminocyclopropane-1-carboxylate synthase
Probab=23.07 E-value=2.3e+02 Score=29.02 Aligned_cols=59 Identities=3% Similarity=0.140 Sum_probs=39.7
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
++..+..+..++.+++.+.++=-+++. |-|.|.--.++-+- +.++++.|+++|+.+|..
T Consensus 180 ~~~~~~le~a~~~a~~~~~~~k~l~l~-nP~NPTG~~~s~e~---l~~L~~~a~~~~i~lI~D 238 (496)
T PLN02376 180 KLTVDAADWAYKKAQESNKKVKGLILT-NPSNPLGTMLDKDT---LTNLVRFVTRKNIHLVVD 238 (496)
T ss_pred cCCHHHHHHHHHHHHhcCCCeeEEEEc-CCCCCCCccCCHHH---HHHHHHHHHHcCCEEEEE
Confidence 344455555555555555554335666 77888866776655 777899999999998875
No 369
>CHL00200 trpA tryptophan synthase alpha subunit; Provisional
Probab=23.05 E-value=1.3e+02 Score=28.41 Aligned_cols=44 Identities=9% Similarity=0.104 Sum_probs=33.5
Q ss_pred HHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 107 WCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 107 W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
-++-++.++++|++.|-+. -+|- .+..++++.|+++||..|.-.
T Consensus 108 ~e~F~~~~~~aGvdgviip-----DLP~---------ee~~~~~~~~~~~gi~~I~lv 151 (263)
T CHL00200 108 INKFIKKISQAGVKGLIIP-----DLPY---------EESDYLISVCNLYNIELILLI 151 (263)
T ss_pred HHHHHHHHHHcCCeEEEec-----CCCH---------HHHHHHHHHHHHcCCCEEEEE
Confidence 4667899999999999992 2221 346788999999999877544
No 370
>PRK13347 coproporphyrinogen III oxidase; Provisional
Probab=23.04 E-value=1.4e+02 Score=30.18 Aligned_cols=47 Identities=15% Similarity=0.137 Sum_probs=32.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-----CCeecccccchHHHHHHHHHHCCCe
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-----PGHYHYDGHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-----~G~fdf~g~~dl~~fl~~a~e~GL~ 159 (323)
++.++.||++|+|.|++.|- ...++ ....+ .+++.+.++.+++.|+.
T Consensus 152 ~e~l~~L~~~G~~rvsiGvQ--S~~~~vl~~l~R~~~---~~~~~~ai~~lr~~G~~ 203 (453)
T PRK13347 152 AEMLQALAALGFNRASFGVQ--DFDPQVQKAINRIQP---EEMVARAVELLRAAGFE 203 (453)
T ss_pred HHHHHHHHHcCCCEEEECCC--CCCHHHHHHhCCCCC---HHHHHHHHHHHHhcCCC
Confidence 68899999999999998442 11111 11122 23577889999999985
No 371
>cd08194 Fe-ADH6 Iron-containing alcohol dehydrogenases-like. Iron-containing alcohol dehydrogenase-like. Proteins of this family have not been characterized. Their specific function is unknown. The protein structure represents a dehydroquinate synthase-like fold and belongs to the alcohol dehydrogenase-like superfamily. They are distinct from other alcohol dehydrogenases which contain different protein domains. Alcohol dehydrogenase catalyzes the reduction of acetaldehyde to alcohol with NADP as cofactor. Its activity requires iron ions.
Probab=22.87 E-value=3e+02 Score=27.00 Aligned_cols=67 Identities=13% Similarity=0.150 Sum_probs=44.1
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+++.+++++..=+ + ....++..+.+++.|++...+ ...+++|- .+++++.++.+++.+.-+|+..|-
T Consensus 23 ~~r~livt~~~~~-~--~g~~~~v~~~L~~~gi~~~~~----~~v~~~p~------~~~v~~~~~~~~~~~~D~IIaiGG 89 (375)
T cd08194 23 GKRPLIVTDKVMV-K--LGLVDKLTDSLKKEGIESAIF----DDVVSEPT------DESVEEGVKLAKEGGCDVIIALGG 89 (375)
T ss_pred CCeEEEEcCcchh-h--cchHHHHHHHHHHCCCeEEEE----CCCCCCcC------HHHHHHHHHHHHhcCCCEEEEeCC
Confidence 3556666653222 1 235667777888889875432 44555543 455788899999999999998763
No 372
>cd07493 Peptidases_S8_9 Peptidase S8 family domain, uncharacterized subfamily 9. This family is a member of the Peptidases S8 or Subtilases serine endo- and exo-peptidase clan. They have an Asp/His/Ser catalytic triad similar to that found in trypsin-like proteases, but do not share their three-dimensional structure and are not homologous to trypsin. The stability of subtilases may be enhanced by calcium, some members have been shown to bind up to 4 ions via binding sites with different affinity. Some members of this clan contain disulfide bonds. These enzymes can be intra- and extracellular, some function at extreme temperatures and pH values.
Probab=22.82 E-value=2.3e+02 Score=25.80 Aligned_cols=62 Identities=13% Similarity=0.270 Sum_probs=43.2
Q ss_pred hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeeccccc----chHHHHHHHHHHCCCeEEEccC
Q psy13819 104 PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGH----RDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 104 ~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~----~dl~~fl~~a~e~GL~vilr~G 165 (323)
...|...++.+.+.|++.|.+.+-+......+..|.+... ..+.+.++.|.+.|+.++...|
T Consensus 90 ~~~~~~ai~~a~~~~v~VIn~S~G~~~~~~~~~~~~~~~~~~~~~~l~~a~~~a~~~gilvv~AAG 155 (261)
T cd07493 90 EDNWVAAAEWADSLGVDIISSSLGYTTFDNPTYSYTYADMDGKTSFISRAANIAASKGMLVVNSAG 155 (261)
T ss_pred HHHHHHHHHHHHHcCCCEEEeCCCcCCCCCcccccccccccccchHHHHHHHHHHhCCeEEEEECC
Confidence 4568888888899999999986555443333222332211 3477788999999999998887
No 373
>PRK14041 oxaloacetate decarboxylase; Provisional
Probab=22.66 E-value=8.5e+02 Score=25.04 Aligned_cols=50 Identities=10% Similarity=0.135 Sum_probs=40.4
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.|.+.-+..++++.+.|+..||+..+-|. .+++.+.++.|+++|+.|...
T Consensus 92 ~~dDvv~~fv~~A~~~Gvd~irif~~lnd------------~~n~~~~i~~ak~~G~~v~~~ 141 (467)
T PRK14041 92 YADDVVELFVKKVAEYGLDIIRIFDALND------------IRNLEKSIEVAKKHGAHVQGA 141 (467)
T ss_pred ccchhhHHHHHHHHHCCcCEEEEEEeCCH------------HHHHHHHHHHHHHCCCEEEEE
Confidence 45555667799999999999999877664 457888999999999987743
No 374
>PLN02229 alpha-galactosidase
Probab=22.62 E-value=1.8e+02 Score=29.68 Aligned_cols=72 Identities=14% Similarity=0.126 Sum_probs=49.4
Q ss_pred CCCCChhcHHHHHHHH-----HHhCCCEEEEEEeCCCccc-CCCee--ccccc-chHHHHHHHHHHCCCe--EEEccCCc
Q psy13819 99 YFRAPPGRWCWIMRAM-----RAAGLNALSTYVEWRSHEV-HPGHY--HYDGH-RDIEHFLQLAVEEDLY--ILLRPGPF 167 (323)
Q Consensus 99 y~r~p~~~W~~~l~~m-----k~~G~N~Vr~yv~W~~hEP-~~G~f--df~g~-~dl~~fl~~a~e~GL~--vilr~GPy 167 (323)
+..+..+..++-.+.| |++|.+.|-+---|...+. +-|.+ |-+.+ +-+..+.+..++.||+ +-..+|..
T Consensus 74 ~~~i~E~~i~~~ad~~v~~Gl~~~Gy~yv~iDDgW~~~~rd~~G~l~~d~~rFP~G~k~ladyiH~~GlKfGIy~d~G~~ 153 (427)
T PLN02229 74 ACNINETVIKETADALVSTGLADLGYIHVNIDDCWSNLKRDSKGQLVPDPKTFPSGIKLLADYVHSKGLKLGIYSDAGVF 153 (427)
T ss_pred CcccCHHHHHHHHHHHHHhHHHhCCCEEEEEcCCcCCCCcCCCCCEEEChhhcCCcHHHHHHHHHHCCCceEEeccCCCc
Confidence 3456667777777764 9999999999888964321 12433 21111 1277779999999998 45678888
Q ss_pred ccc
Q psy13819 168 ICG 170 (323)
Q Consensus 168 i~a 170 (323)
-|+
T Consensus 154 TC~ 156 (427)
T PLN02229 154 TCQ 156 (427)
T ss_pred ccC
Confidence 886
No 375
>PRK06756 flavodoxin; Provisional
Probab=22.60 E-value=4.4e+02 Score=21.73 Aligned_cols=99 Identities=10% Similarity=-0.022 Sum_probs=57.7
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
.-...+++...|.....|..+.+-++.++...++...+.++=+.- . .|.+. -.-+..+.+.+.+.|..++..+
T Consensus 49 ~~d~vi~gspt~~~g~~p~~~~~fl~~l~~~~l~~k~~~~fgt~~-~---~y~~~-~~a~~~l~~~l~~~g~~~v~~~-- 121 (148)
T PRK06756 49 QYDGIILGAYTWGDGDLPDDFLDFYDAMDSIDLTGKKAAVFGSCD-S---AYPKY-GVAVDILIEKLQERGAAVVLEG-- 121 (148)
T ss_pred cCCeEEEEeCCCCCCCCcHHHHHHHHHHhcCCCCCCEEEEEeCCC-C---chHHH-HHHHHHHHHHHHHCCCEEcCCC--
Confidence 334567777666555555668888888877666665554431110 0 12211 1346777888999999888754
Q ss_pred ccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHHHHHHHHHH
Q psy13819 167 FICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRWFQELFPRI 212 (323)
Q Consensus 167 yi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~~~~l~~~i 212 (323)
+.. ..+. ++.-++.+..|.+++++.+
T Consensus 122 -~~~------------------~~~p-~~~d~~~~~~~~~~~~~~~ 147 (148)
T PRK06756 122 -LKV------------------ELTP-EDEDVEKCLQFGAEFVKHL 147 (148)
T ss_pred -eEE------------------ecCC-CHHHHHHHHHHHHHHHHhc
Confidence 111 1111 2344777777777776653
No 376
>TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative. This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.
Probab=22.54 E-value=1.9e+02 Score=27.50 Aligned_cols=65 Identities=18% Similarity=0.241 Sum_probs=46.0
Q ss_pred CCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCc--c-----------cCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 99 YFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSH--E-----------VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 99 y~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~h--E-----------P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
+=|+|.+.|.+.++.+.+-|+..|=+ ..-. | +.+...|+.|..+|..++.+.+...+.|=...|
T Consensus 194 ~K~Wp~e~~~~l~~~l~~~~~~ivl~---g~p~~~e~~~~~~i~~~~~~~~~~~l~g~~sL~el~ali~~a~l~Vs~DSG 270 (344)
T TIGR02201 194 FKCWDNDRFSALIDALHARGYEVVLT---SGPDKDELAMVNEIAQGCQTPRVTSLAGKLTLPQLAALIDHARLFIGVDSV 270 (344)
T ss_pred ccCCCHHHHHHHHHHHHhCCCeEEEe---cCCCHHHHHHHHHHHhhCCCCcccccCCCCCHHHHHHHHHhCCEEEecCCH
Confidence 34789999999999998777665543 3311 1 223467788877788888888888887777777
Q ss_pred C
Q psy13819 166 P 166 (323)
Q Consensus 166 P 166 (323)
|
T Consensus 271 p 271 (344)
T TIGR02201 271 P 271 (344)
T ss_pred H
Confidence 6
No 377
>TIGR01210 conserved hypothetical protein TIGR01210. This family of exclusively archaeal proteins has no characterized close homologs. Several rounds of PSI-BLAST with a stringent cutoff of 1e-8 shows apparent similarity of the central region of this family to the central regions of the oxygen-independent coproporphyrinogen III dehydrogenase HemN and to other enzymes.
Probab=22.52 E-value=1.2e+02 Score=29.07 Aligned_cols=62 Identities=13% Similarity=0.030 Sum_probs=0.0
Q ss_pred HHHHHHHHHhCCC-EEEEEEeCCCcccC-CCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCc
Q psy13819 108 CWIMRAMRAAGLN-ALSTYVEWRSHEVH-PGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFP 178 (323)
Q Consensus 108 ~~~l~~mk~~G~N-~Vr~yv~W~~hEP~-~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P 178 (323)
++.|+.||++|+| .|.+.+.=..-+-. ...=-=...+++.+.++.++++|+.|.+-. .-|+|
T Consensus 117 ~e~L~~l~~aG~~~~v~iG~ES~~d~~L~~~inKg~t~~~~~~ai~~~~~~Gi~v~~~~---------i~G~P 180 (313)
T TIGR01210 117 EEKLEELRKIGVNVEVAVGLETANDRIREKSINKGSTFEDFIRAAELARKYGAGVKAYL---------LFKPP 180 (313)
T ss_pred HHHHHHHHHcCCCEEEEEecCcCCHHHHHHhhCCCCCHHHHHHHHHHHHHcCCcEEEEE---------EecCC
No 378
>cd00384 ALAD_PBGS Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. They either contain a cysteine-rich zinc binding site (consensus DXCXCX(Y/F)X3G(H/Q)CG) or an aspartate-rich magnesium binding site (consensus DXALDX(Y/F)X3G(H/Q)DG). The cyste
Probab=22.50 E-value=7.1e+02 Score=24.37 Aligned_cols=32 Identities=13% Similarity=0.159 Sum_probs=26.4
Q ss_pred cCCCeecccccchHHHHHHHHHHCCCeEEEccC
Q psy13819 133 VHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPG 165 (323)
Q Consensus 133 P~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~G 165 (323)
.-||+|.++- +.+.+.++.+.+.||..++-+|
T Consensus 41 sMPG~~r~s~-d~l~~~~~~~~~~Gi~~v~LFg 72 (314)
T cd00384 41 SMPGVYRLSV-DSLVEEAEELADLGIRAVILFG 72 (314)
T ss_pred CCCCceeeCH-HHHHHHHHHHHHCCCCEEEEEC
Confidence 3489999985 6688999999999999777664
No 379
>cd01311 PDC_hydrolase 2-pyrone-4,6-dicarboxylic acid (PDC) hydrolase hydrolyzes PDC to yield 4-oxalomesaconic acid (OMA) or its tautomer, 4-carboxy-2-hydroxymuconic acid (CHM). This reaction is part of the protocatechuate (PCA) 4,5-cleavage pathway. PCA is one of the most important intermediate metabolites in the bacterial pathways for various phenolic compounds, including lignin, which is the most abundant aromatic material in nature.
Probab=22.49 E-value=2e+02 Score=26.47 Aligned_cols=50 Identities=12% Similarity=0.115 Sum_probs=36.3
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
.+.|+.+++.|+-.||+...+. |..|- ..+.++++.|+++|+-|.+..+|
T Consensus 83 ~~~l~~~~~~g~rGvRl~~~~~------~~~~~---~~~~~~~~~~~~~gl~v~~~~~~ 132 (263)
T cd01311 83 DAELKEMHDAGVRGVRFNFLFG------GVDNK---DELDEIAKRAAELGWHVQVYFDA 132 (263)
T ss_pred HHHHHHHHHCCCeEEEEecccC------CCCCH---HHHHHHHHHHHHcCCEEEEEeCH
Confidence 5778888999999999854321 11122 33678899999999998887654
No 380
>cd01335 Radical_SAM Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme o
Probab=22.44 E-value=1.1e+02 Score=25.44 Aligned_cols=57 Identities=12% Similarity=0.105 Sum_probs=34.8
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCee-cccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHY-HYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~f-df~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.++.|+++|+..|.+.+.-.......-.- .-...++..+.++.+.+.|+.+.+..
T Consensus 88 ~~~~~~l~~~g~~~i~i~le~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 145 (204)
T cd01335 88 EELLKELKELGLDGVGVSLDSGDEEVADKIRGSGESFKERLEALKELREAGLGLSTTL 145 (204)
T ss_pred HHHHHHHHhCCCceEEEEcccCCHHHHHHHhcCCcCHHHHHHHHHHHHHcCCCceEEE
Confidence 5778889999999998866543322111000 11122346677888888888766543
No 381
>TIGR03566 FMN_reduc_MsuE FMN reductase, MsuE subfamily. Members of this protein family use NAD(P)H to reduce FMN and regenerate FMNH2. Members include the NADH-dependent enzyme MsuE from Pseudomonas aeruginosa, which serves as a partner to an FMNH2-dependent alkanesulfonate monooxygenase. The NADP-dependent enzyme from E. coli is outside the scope of this model.
Probab=22.43 E-value=2.9e+02 Score=23.67 Aligned_cols=83 Identities=17% Similarity=0.133 Sum_probs=47.8
Q ss_pred eEEeeeeecCCCCChhcHHHHHHHHH-HhCCCEEEEEEeCCCcccCCCe--ecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 90 FQFVSGSFHYFRAPPGRWCWIMRAMR-AAGLNALSTYVEWRSHEVHPGH--YHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 90 ~~i~~g~~Hy~r~p~~~W~~~l~~mk-~~G~N~Vr~yv~W~~hEP~~G~--fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+.+++|+..-.......-+.-.+.+. +.|..+..+.+ ....|.-+. +...-.+++..+.+...+.+..||..|
T Consensus 2 Il~i~GS~r~~s~t~~l~~~~~~~l~~~~g~ev~~idL--~~~~~~~~~~~~~~~~~~~~~~~~~~i~~AD~iIi~tP-- 77 (174)
T TIGR03566 2 VVGVSGSLTRPSRTLALVEALVAELAARLGISPRTIDL--ADLAPSLGGALWRSQLPPDAERILQAIESADLLVVGSP-- 77 (174)
T ss_pred EEEEECCCCCCChHHHHHHHHHHHHHHhcCCeEEEEEh--hhcChhhccccccCCCCHHHHHHHHHHHHCCEEEEECC--
Confidence 56778877743334444455555554 45765544443 122221111 111234678999999999999999988
Q ss_pred ccccccCCCCCcccc
Q psy13819 167 FICGKRDFGGFPPWL 181 (323)
Q Consensus 167 yi~aEw~~gg~P~Wl 181 (323)
|| ++++|.-|
T Consensus 78 ----~Y-~~s~~~~L 87 (174)
T TIGR03566 78 ----VY-RGSYTGLF 87 (174)
T ss_pred ----cC-cCcCcHHH
Confidence 33 45565543
No 382
>cd00502 DHQase_I Type I 3-dehydroquinase, (3-dehydroquinate dehydratase or DHQase.) Catalyzes the cis-dehydration of 3-dehydroquinate via a covalent imine intermediate to produce dehydroshikimate. Dehydroquinase is the third enzyme in the shikimate pathway, which is involved in the biosynthesis of aromatic amino acids. Type I DHQase exists as a homodimer. Type II 3-dehydroquinase also catalyzes the same overall reaction, but is unrelated in terms of sequence and structure, and utilizes a completely different reaction mechanism.
Probab=22.35 E-value=1.2e+02 Score=27.53 Aligned_cols=30 Identities=23% Similarity=0.366 Sum_probs=25.5
Q ss_pred eeecCCCCC--hhcHHHHHHHHHHhCCCEEEE
Q psy13819 95 GSFHYFRAP--PGRWCWIMRAMRAAGLNALST 124 (323)
Q Consensus 95 g~~Hy~r~p--~~~W~~~l~~mk~~G~N~Vr~ 124 (323)
++.|.|.-+ .+.|.+.+++|++.|.+.|++
T Consensus 118 ~S~H~f~~tp~~~~l~~~~~~~~~~gadivKl 149 (225)
T cd00502 118 GSYHDFSGTPSDEELVSRLEKMAALGADIVKI 149 (225)
T ss_pred EEeccCCCCcCHHHHHHHHHHHHHhCCCEEEE
Confidence 588988765 468889999999999999999
No 383
>COG0219 CspR Predicted rRNA methylase (SpoU class) [Translation, ribosomal structure and biogenesis]
Probab=22.34 E-value=68 Score=28.09 Aligned_cols=18 Identities=33% Similarity=0.580 Sum_probs=13.5
Q ss_pred CCCCCcccccccCCCeee
Q psy13819 173 DFGGFPPWLLKVAPDILL 190 (323)
Q Consensus 173 ~~gg~P~Wl~~~~p~~~~ 190 (323)
+..|+|.++.++.|.-.+
T Consensus 104 Es~GLP~~i~~~~~~~~i 121 (155)
T COG0219 104 ESRGLPEEILDAAPDRCI 121 (155)
T ss_pred CCCCCCHHHHHhCccceE
Confidence 678999999887665433
No 384
>PRK08931 5'-methylthioadenosine phosphorylase; Provisional
Probab=22.33 E-value=1.2e+02 Score=29.30 Aligned_cols=39 Identities=26% Similarity=0.489 Sum_probs=31.5
Q ss_pred EeCCeeeEEee--eeecCCCCChhc--HHHHHHHHHHhCCCEEEE
Q psy13819 84 RLNEDPFQFVS--GSFHYFRAPPGR--WCWIMRAMRAAGLNALST 124 (323)
Q Consensus 84 ~~dGk~~~i~~--g~~Hy~r~p~~~--W~~~l~~mk~~G~N~Vr~ 124 (323)
.++|+++.++. |..|+. |+.. .+.-+..||++|+..|=.
T Consensus 43 ~l~G~~V~~l~RhGr~H~y--~p~~i~~rAni~alk~lGv~~ii~ 85 (289)
T PRK08931 43 RLGGVPMVFLPRHGRGHRL--SPSDINYRANIDALKRAGVTDIVS 85 (289)
T ss_pred EECCEEEEEEeCCCCCCcc--ChHHcccHHHHHHHHHcCCCEEEE
Confidence 47999999994 999975 5443 566899999999998865
No 385
>TIGR01370 cysRS possible cysteinyl-tRNA synthetase, Methanococcus type. Assignment of this protein family as cysteinyl-tRNA synthetase is controversial, supported by PubMed:11333988 but challenged by PubMed:14679218. Members of this family from Deinococcus radiodurans (bacterial) and Methanococcus jannaschii (archaeal), species lacking a conventional cysteinyl-tRNA synthetase (Cys--tRNA ligase), have been indicated to be a novel form of that enzyme, perhaps distantly related to class I tRNA ligases. The member from Thermotoga maritima is presumed to be a second isozyme of cysteinyl-tRNA synthetase. A number of homologous but more distantly related proteins are annotated as alpha-1,4 polygalactosaminidases.
Probab=22.29 E-value=5.5e+02 Score=24.99 Aligned_cols=154 Identities=14% Similarity=0.134 Sum_probs=77.8
Q ss_pred HHHHHHCCCeEEE--ccCC------ccccccCCCCCcccccccCCCe----eecCCCHhHHHHHHHHHHHHHHHHHhhhc
Q psy13819 150 LQLAVEEDLYILL--RPGP------FICGKRDFGGFPPWLLKVAPDI----LLRQNHPVYQRYVTRWFQELFPRIQKYLY 217 (323)
Q Consensus 150 l~~a~e~GL~vil--r~GP------yi~aEw~~gg~P~Wl~~~~p~~----~~R~~~~~f~~~~~~~~~~l~~~i~~~~~ 217 (323)
|+.+++.|-+|+. -.|- |-..+|. ...|.|+....|++ .++..+|.+++.+...++.+.++ - +
T Consensus 87 i~~Lk~~g~~viaYlSvGe~E~~R~y~~~~~~-~~~~~~l~~~n~~W~g~~~vd~~~~~W~~il~~rl~~l~~k--G--f 161 (315)
T TIGR01370 87 IVRAAAAGRWPIAYLSIGAAEDYRFYWQKGWK-VNAPAWLGNEDPDWPGNYDVKYWDPEWKAIAFSYLDRVIAQ--G--F 161 (315)
T ss_pred HHHHHhCCcEEEEEEEchhccccchhhhhhhh-cCCHHHhCCCCCCCCCceeEecccHHHHHHHHHHHHHHHHc--C--C
Confidence 5666677887774 3331 1111121 14577887655554 24555677776666544443322 0 0
Q ss_pred cCC---CcEEEEccCCCcCCCCC----ChHHHHHHHHHHHHHhcCCceEEEeeCCCCcccccc-------ccCc-----e
Q psy13819 218 GND---RPIILVQVENEYGSDAE----CDPAHAVWLRDLLRTYVQDKAVLYSTDGAFDAYLRC-------TVDG-----V 278 (323)
Q Consensus 218 ~n~---g~Vi~~QieNE~g~~~~----~~~~Y~~~l~~~~~~~~g~~~~l~t~dg~~~~~~~~-------~~~g-----~ 278 (323)
+| .+|=++|..++-|.... .-.+|++.|.+.+|+. ..+.++..++|..- +.+ .+.| +
T Consensus 162 -DGvfLD~lDsy~~~~~~~~~~~~~~~~m~~~i~~Ia~~ar~~-~P~~~II~NnG~ei--l~~~~g~~~~~idgV~~Esl 237 (315)
T TIGR01370 162 -DGVYLDLIDAFEYWAENGDNRPGAAAEMIAFVCEIAAYARAQ-NPQFVIIPQNGEEL--LRDDHGGLAATVSGWAVEEL 237 (315)
T ss_pred -CeEeeccchhhhhhcccCCcchhhHHHHHHHHHHHHHHHHHH-CCCEEEEecCchhh--hhccccchhhhceEEEecce
Confidence 00 23445554444332111 1157888888888885 56677777776542 221 2222 3
Q ss_pred EEeecCCCCCc--hhHHHHHHHhhC-CCCCeEEEEecc
Q psy13819 279 YSTVDFTVFKD--VNVSFQAQRTRA-PQGPLVNAEFEF 313 (323)
Q Consensus 279 ~~~~nfg~~~~--~~~~~~~~r~~~-p~~Plm~~E~~~ 313 (323)
+. ...+..+. -+...+.|++.. .+.|++..||-.
T Consensus 238 f~-~~~~~~~e~dr~~~l~~L~~~~~~G~~Vl~IDY~~ 274 (315)
T TIGR01370 238 FY-YAANRPTEAERQRRLLALYRLWQQGKFVLTVDYVD 274 (315)
T ss_pred EE-cCCCCCCHHHHHHHHHHHHHHHHCCCcEEEEEecC
Confidence 31 11222111 122334454443 378999999965
No 386
>PF00218 IGPS: Indole-3-glycerol phosphate synthase; InterPro: IPR013798 Indole-3-glycerol phosphate synthase (4.1.1.48 from EC) (IGPS) catalyses the fourth step in the biosynthesis of tryptophan, the ring closure of 1-(2-carboxy-phenylamino)-1-deoxyribulose into indol-3-glycerol-phosphate. In some bacteria, IGPS is a single chain enzyme. In others, such as Escherichia coli, it is the N-terminal domain of a bifunctional enzyme that also catalyses N-(5'-phosphoribosyl)anthranilate isomerase (5.3.1.24 from EC) (PRAI) activity (see IPR001240 from INTERPRO), the third step of tryptophan biosynthesis. In fungi, IGPS is the central domain of a trifunctional enzyme that contains a PRAI C-terminal domain and a glutamine amidotransferase (2.4.2 from EC) (GATase) N-terminal domain (see IPR000991 from INTERPRO). A structure of the IGPS domain of the bifunctional enzyme from the mesophilic bacterium E. coli (eIGPS) has been compared with the monomeric indole-3-glycerol phosphate synthase from the hyperthermophilic archaeon Sulfolobus solfataricus (sIGPS). Both are single-domain (beta/alpha)8 barrel proteins, with one (eIGPS) or two (sIGPS) additional helices inserted before the first beta strand []. ; GO: 0004425 indole-3-glycerol-phosphate synthase activity; PDB: 1VC4_A 1PII_A 1JCM_P 1I4N_B 1J5T_A 3TSM_B 4FB7_A 3QJA_A 1JUL_A 2C3Z_A ....
Probab=22.06 E-value=2.2e+02 Score=26.78 Aligned_cols=102 Identities=20% Similarity=0.185 Sum_probs=55.2
Q ss_pred cceEEEEEEeeCCCCc--c----ccchhhhhhccccccceEEEEecCCeEEeCCeeeEEeeeeecC-------CCCC--h
Q psy13819 40 TNRITIFELTRAPDKY--D----VDFVDKISQRRARMSRTFAIDLAGDTFRLNEDPFQFVSGSFHY-------FRAP--P 104 (323)
Q Consensus 40 ~N~i~i~e~~~~~~~~--~----i~~~~~~i~~~~~~~~~~~v~~~~~~~~~dGk~~~i~~g~~Hy-------~r~p--~ 104 (323)
...-||-|.+.. .|. . ..-.+-...|...+..-+.|-.|... |+|++-+ ...| .
T Consensus 46 ~~~~vIAEiKra-SPs~G~i~~~~d~~~~a~~y~~~GA~aiSVlTe~~~----------F~Gs~~dL~~v~~~~~~PvL~ 114 (254)
T PF00218_consen 46 GRISVIAEIKRA-SPSKGDIREDFDPAEIAKAYEEAGAAAISVLTEPKF----------FGGSLEDLRAVRKAVDLPVLR 114 (254)
T ss_dssp SS-EEEEEE-SE-ETTTEESBSS-SHHHHHHHHHHTT-SEEEEE--SCC----------CHHHHHHHHHHHHHSSS-EEE
T ss_pred CCCeEEEEeecC-CCCCCccCccCCHHHHHHHHHhcCCCEEEEECCCCC----------CCCCHHHHHHHHHHhCCCccc
Confidence 345566688763 222 1 22333344555555566666555443 3444332 2233 1
Q ss_pred hcH---HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRW---CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W---~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..+ +-.+...+++|.++|-+-+ ..=..+.|..|++.|++.||-+++..
T Consensus 115 KDFIid~~QI~eA~~~GADaVLLI~------------~~L~~~~l~~l~~~a~~lGle~lVEV 165 (254)
T PF00218_consen 115 KDFIIDPYQIYEARAAGADAVLLIA------------AILSDDQLEELLELAHSLGLEALVEV 165 (254)
T ss_dssp ES---SHHHHHHHHHTT-SEEEEEG------------GGSGHHHHHHHHHHHHHTT-EEEEEE
T ss_pred ccCCCCHHHHHHHHHcCCCEeehhH------------HhCCHHHHHHHHHHHHHcCCCeEEEE
Confidence 122 3457778999999999832 22233458999999999999999987
No 387
>cd06831 PLPDE_III_ODC_like_AZI Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Ornithine Decarboxylase-like Antizyme Inhibitor. Antizyme inhibitor (AZI) is homologous to the fold type III PLP-dependent enzyme ODC but does not retain any decarboxylase activity. Like ODC, AZI is presumed to exist as a homodimer. Antizyme is a regulatory protein that binds directly to the ODC monomer to block its active site, leading to its degradation by the 26S proteasome. AZI binds to Antizyme with a higher affinity than ODC, preventing the formation of the Antizyme-ODC complex. Thus, AZI blocks the ability of Antizyme to promote ODC degradation, which leads to increased ODC enzymatic activity and polyamine levels. AZI also prevents the degradation of other proteins regulated by Antizyme, such as cyclin D1.
Probab=22.03 E-value=1.6e+02 Score=29.38 Aligned_cols=67 Identities=16% Similarity=0.245 Sum_probs=45.0
Q ss_pred CCCChhcHHHHHHHHHHhCCCEEEEEEe--CCCcccCCCeecccccchHHHHHHHHHHCCCe-EEEccCCccccccCCCC
Q psy13819 100 FRAPPGRWCWIMRAMRAAGLNALSTYVE--WRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLY-ILLRPGPFICGKRDFGG 176 (323)
Q Consensus 100 ~r~p~~~W~~~l~~mk~~G~N~Vr~yv~--W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~-vilr~GPyi~aEw~~gg 176 (323)
|=++++.+.+.++.+++.|++.+-+.+. .....++ .|.- ..++..++++++.+.|+. -+|..| ||
T Consensus 145 FGi~~~~~~~~l~~~~~~~l~~~Gih~HiGS~~~~~~--~~~~-a~~~~~~~~~~~~~~g~~l~~ldiG---------GG 212 (394)
T cd06831 145 FGTTLKNCRHLLECAKELDVQIVGVKFHVSSSCKEYQ--TYVH-ALSDARCVFDMAEEFGFKMNMLDIG---------GG 212 (394)
T ss_pred CCCCHHHHHHHHHHHHHCCCeEEEEEEECCCCCCCHH--HHHH-HHHHHHHHHHHHHHCCCCCCEEEeC---------CC
Confidence 4467889999999999999998877443 3332222 2221 122346678889998875 357785 88
Q ss_pred Cc
Q psy13819 177 FP 178 (323)
Q Consensus 177 ~P 178 (323)
+|
T Consensus 213 f~ 214 (394)
T cd06831 213 FT 214 (394)
T ss_pred cC
Confidence 87
No 388
>PRK06920 dnaE DNA polymerase III DnaE; Reviewed
Probab=21.96 E-value=1.6e+02 Score=33.76 Aligned_cols=51 Identities=8% Similarity=0.046 Sum_probs=39.1
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
..-++-++++|++|..+|.+- |.....-..+|.+.|+++|++.|+..=-|+
T Consensus 19 ~~i~~lv~~A~~~g~~alAlT-------------Dh~~m~Ga~~F~~~a~~~gIkPIiG~e~~v 69 (1107)
T PRK06920 19 CKIDELVVRAKELGYSSLAIT-------------DENVMYGVIPFYKACKKHGIHPIIGLTASI 69 (1107)
T ss_pred CCHHHHHHHHHHCCCCEEEEE-------------eCChHhHHHHHHHHHHHcCCCEeeeeEEEE
Confidence 345888999999999999993 333333478999999999999998653333
No 389
>PRK03204 haloalkane dehalogenase; Provisional
Probab=21.94 E-value=2.3e+02 Score=26.19 Aligned_cols=67 Identities=12% Similarity=0.086 Sum_probs=36.1
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcc--cCCCeecccccchHHHHHHHHHHCCCe
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHE--VHPGHYHYDGHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hE--P~~G~fdf~g~~dl~~fl~~a~e~GL~ 159 (323)
.|.++.+++|.- .....|+..+..+++ ++++|.+... .|- ..|..++++-.+-.+.+.+++++.|+.
T Consensus 33 ~~~~iv~lHG~~----~~~~~~~~~~~~l~~-~~~vi~~D~~--G~G~S~~~~~~~~~~~~~~~~~~~~~~~~~~~ 101 (286)
T PRK03204 33 TGPPILLCHGNP----TWSFLYRDIIVALRD-RFRCVAPDYL--GFGLSERPSGFGYQIDEHARVIGEFVDHLGLD 101 (286)
T ss_pred CCCEEEEECCCC----ccHHHHHHHHHHHhC-CcEEEEECCC--CCCCCCCCCccccCHHHHHHHHHHHHHHhCCC
Confidence 367888887732 344678877777754 5888877333 332 222233332222233344556666764
No 390
>PF06230 DUF1009: Protein of unknown function (DUF1009); InterPro: IPR010415 This is a family of uncharacterised bacterial proteins.
Probab=21.81 E-value=2.1e+02 Score=26.41 Aligned_cols=42 Identities=24% Similarity=0.266 Sum_probs=32.0
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEE
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYIL 161 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vi 161 (323)
.+-++.|+++|+.++-+. .|. +-.-|-++++++|+++||.++
T Consensus 172 ~~Tv~~~~~ag~~~lave---------Ag~---tl~ld~~~~i~~Ad~~gi~i~ 213 (214)
T PF06230_consen 172 PDTVENAAEAGLAGLAVE---------AGK---TLILDREEVIALADKAGIFIV 213 (214)
T ss_pred HHHHHHHHHcCCeEEEEe---------cCc---EEEecHHHHHHHHHHcCCEEe
Confidence 467899999999999882 111 123456888999999999875
No 391
>cd03334 Fab1_TCP TCP-1 like domain of the eukaryotic phosphatidylinositol 3-phosphate (PtdIns3P) 5-kinase Fab1. Fab1p is important for vacuole size regulation, presumably by modulating PtdIns(3,5)P2 effector activity. In the human homolog p235/PIKfyve deletion of this domain leads to loss of catalytic activity. However no exact function this domain has been defined. In general, chaperonins are involved in productive folding of proteins.
Probab=21.73 E-value=2.9e+02 Score=25.79 Aligned_cols=61 Identities=13% Similarity=0.143 Sum_probs=44.0
Q ss_pred CeeeEEeeeeecCCCCC-------------hhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHH
Q psy13819 87 EDPFQFVSGSFHYFRAP-------------PGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLA 153 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p-------------~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a 153 (323)
.-++.++..++-|.+.. .++.++.++++++.|.|.|=+ .++. ++ ..++.+
T Consensus 86 n~kIlll~~~Le~~~~~~~~~~~~~~~~~E~~~l~~~v~kI~~~g~nvIl~--~k~I-------------~~--~a~~~l 148 (261)
T cd03334 86 NPRILLLQGPLEYQRVENKLLSLDPVILQEKEYLKNLVSRIVALRPDVILV--EKSV-------------SR--IAQDLL 148 (261)
T ss_pred CCcEEEEeeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHhcCCCEEEE--CCcc-------------CH--HHHHHH
Confidence 34677777888777643 456678899999999999865 2322 22 247889
Q ss_pred HHCCCeEEEcc
Q psy13819 154 VEEDLYILLRP 164 (323)
Q Consensus 154 ~e~GL~vilr~ 164 (323)
.++|+.++-+.
T Consensus 149 ~k~gI~~v~~v 159 (261)
T cd03334 149 LEAGITLVLNV 159 (261)
T ss_pred HHCCCEEEEec
Confidence 99999988777
No 392
>PF15631 Imm-NTF2-2: NTF2 fold immunity protein
Probab=21.54 E-value=55 Score=24.56 Aligned_cols=30 Identities=20% Similarity=0.507 Sum_probs=23.8
Q ss_pred cccceEEEEecCCeEEeCCe-eeEEeeeeec
Q psy13819 69 RMSRTFAIDLAGDTFRLNED-PFQFVSGSFH 98 (323)
Q Consensus 69 ~~~~~~~v~~~~~~~~~dGk-~~~i~~g~~H 98 (323)
...++|.+..+++.+.++|. |-..+||.+|
T Consensus 19 ~~ekP~~v~~~~~~WiV~Gtl~~~~~GGv~~ 49 (66)
T PF15631_consen 19 EEEKPYRVTLDGDSWIVEGTLPPGMLGGVFY 49 (66)
T ss_pred hhcCCeEEecCCCeEEEEeecCCCccCCeEE
Confidence 35677888888999999997 6677888776
No 393
>TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases. Methanogens, then should and aparrently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea which lack a threonine dehydratase (mainly Euryarchaeota) should contain both a CimA and a LeuA gene. This is true of, for example, archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in TIGR00973 and one in TIGRT00977 which may fulfill these roles. Other species which have only one hit to this model and lack threonine dehydratase are very likely LeuA enzymes.
Probab=21.43 E-value=1.2e+02 Score=29.91 Aligned_cols=56 Identities=9% Similarity=0.086 Sum_probs=39.1
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc---cchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG---HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g---~~dl~~fl~~a~e~GL~vilr~ 164 (323)
+++++++.++|+..|+++++-+..--+ -++..+. .+.+.+.++.|+++|+.|.+.+
T Consensus 74 ~~di~~a~~~g~~~i~i~~~~Sd~~~~-~~~~~~~~~~~~~~~~~i~~ak~~G~~v~~~~ 132 (363)
T TIGR02090 74 KKDIDKAIDCGVDSIHTFIATSPIHLK-YKLKKSRDEVLEKAVEAVEYAKEHGLIVEFSA 132 (363)
T ss_pred HHHHHHHHHcCcCEEEEEEcCCHHHHH-HHhCCCHHHHHHHHHHHHHHHHHcCCEEEEEE
Confidence 678999999999999998876532111 1122221 2456678899999999987654
No 394
>PRK06582 coproporphyrinogen III oxidase; Provisional
Probab=21.32 E-value=2.7e+02 Score=27.65 Aligned_cols=53 Identities=15% Similarity=0.074 Sum_probs=34.1
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc----cchHHHHHHHHHHCCCeEEEcc
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG----HRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g----~~dl~~fl~~a~e~GL~vilr~ 164 (323)
++.|+.+|++|+|-|++.|- --.+.+..+-+ .++..+.++.|++.+..|-+..
T Consensus 111 ~e~l~~l~~~GvnRiSiGvQ----S~~d~~L~~lgR~h~~~~~~~ai~~~~~~~~~v~~Dl 167 (390)
T PRK06582 111 TEKFKAFKLAGINRVSIGVQ----SLKEDDLKKLGRTHDCMQAIKTIEAANTIFPRVSFDL 167 (390)
T ss_pred HHHHHHHHHCCCCEEEEECC----cCCHHHHHHcCCCCCHHHHHHHHHHHHHhCCcEEEEe
Confidence 58899999999999999432 11122222222 2456667888888876666654
No 395
>TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase. This model represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. Several species, including E. coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterized protein.
Probab=21.32 E-value=95 Score=31.39 Aligned_cols=47 Identities=21% Similarity=0.143 Sum_probs=31.6
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccC-----CCeecccccchHHHHHHHHHHCCCe
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVH-----PGHYHYDGHRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~-----~G~fdf~g~~dl~~fl~~a~e~GL~ 159 (323)
++.++.||++|+|.|.+.|- ...++ .+..++ +++.+.++.+++.|+.
T Consensus 151 ~e~l~~lk~~G~~risiGvq--S~~~~~l~~l~r~~~~---~~~~~ai~~l~~~G~~ 202 (455)
T TIGR00538 151 KDVIDALRDEGFNRLSFGVQ--DFNKEVQQAVNRIQPE---EMIFELMNHAREAGFT 202 (455)
T ss_pred HHHHHHHHHcCCCEEEEcCC--CCCHHHHHHhCCCCCH---HHHHHHHHHHHhcCCC
Confidence 57899999999999988432 11111 111222 3577889999999985
No 396
>TIGR02690 resist_ArsH arsenical resistance protein ArsH. Members of this protein family occur in arsenate resistance operons that include at least two different types of arsenate reductase. ArsH is not required for arsenate resistance in some systems. This family belongs to the larger family of NADPH-dependent FMN reductases (Pfam model pfam03358). The function of ArsH is not known.
Probab=21.30 E-value=6.4e+02 Score=23.10 Aligned_cols=80 Identities=13% Similarity=0.013 Sum_probs=54.2
Q ss_pred eeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCcc
Q psy13819 89 PFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFI 168 (323)
Q Consensus 89 ~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi 168 (323)
++.+++|+.-..+.....-+...+.+.+.|+.+-.+.+. . -|-+. .|.+....+.++.+..++.+-.+|+.|
T Consensus 28 kI~~I~GSlR~~S~n~~la~~~~~~~~~~g~~v~~idl~-~--lPl~~-~d~~~~p~v~~l~~~v~~ADgvii~TP---- 99 (219)
T TIGR02690 28 RILLLYGSLRERSYSRLLAEEAARLLGCEGRETRIFDPP-G--LPLPD-AAHADHPKVRELRQLSEWSEGQVWCSP---- 99 (219)
T ss_pred EEEEEECCCCCcchHHHHHHHHHHHHhhcCCEEEEeCcc-c--CCCCC-cCcccCHHHHHHHHHHHhCCEEEEeCC----
Confidence 467799999988888777777777777778876666542 1 12111 111123367888899999999999999
Q ss_pred ccccCCCCCcc
Q psy13819 169 CGKRDFGGFPP 179 (323)
Q Consensus 169 ~aEw~~gg~P~ 179 (323)
| .++++|.
T Consensus 100 --E-Yn~sipg 107 (219)
T TIGR02690 100 --E-RHGAITG 107 (219)
T ss_pred --c-cccCcCH
Confidence 4 3567764
No 397
>PRK05799 coproporphyrinogen III oxidase; Provisional
Probab=21.22 E-value=1.6e+02 Score=28.77 Aligned_cols=48 Identities=17% Similarity=0.211 Sum_probs=32.1
Q ss_pred HHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccc----cchHHHHHHHHHHCCCe
Q psy13819 108 CWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDG----HRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 108 ~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g----~~dl~~fl~~a~e~GL~ 159 (323)
++.++.++++|+|.|++.|. ...++ ..+.-+ .+++.+.++.+++.|+.
T Consensus 99 ~e~l~~l~~~G~~rvsiGvq--S~~d~--~L~~l~R~~~~~~~~~ai~~l~~~g~~ 150 (374)
T PRK05799 99 EEKLKILKSMGVNRLSIGLQ--AWQNS--LLKYLGRIHTFEEFLENYKLARKLGFN 150 (374)
T ss_pred HHHHHHHHHcCCCEEEEECc--cCCHH--HHHHcCCCCCHHHHHHHHHHHHHcCCC
Confidence 46899999999998888552 22221 111111 24677889999999985
No 398
>PRK08564 5'-methylthioadenosine phosphorylase II; Reviewed
Probab=21.12 E-value=1.4e+02 Score=28.21 Aligned_cols=41 Identities=12% Similarity=0.146 Sum_probs=29.1
Q ss_pred EeCCeeeEEe--eeeecCCCCChhcHHHHHHHHHHhCCCEEEE
Q psy13819 84 RLNEDPFQFV--SGSFHYFRAPPGRWCWIMRAMRAAGLNALST 124 (323)
Q Consensus 84 ~~dGk~~~i~--~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~ 124 (323)
.++|+++.++ .|-.|+.--+.-.-+.-+..||++|+.+|=.
T Consensus 47 ~l~g~~v~~l~RhGr~H~y~~~~i~~~a~i~aLk~LGvk~iI~ 89 (267)
T PRK08564 47 EIEGVEVAFLPRHGRGHRIPPHKINYRANIWALKELGVEWVIA 89 (267)
T ss_pred EECCEEEEEEeCCCCCcccCCccCcchHHHHHHHHCCCcEEEE
Confidence 4799999999 8999963211111356678899999987754
No 399
>PRK07279 dnaE DNA polymerase III DnaE; Reviewed
Probab=21.04 E-value=1.8e+02 Score=32.96 Aligned_cols=47 Identities=15% Similarity=0.112 Sum_probs=37.1
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
-.-++-++++|+.|..+|.+- |.....-..+|.+.|+++||++|+..
T Consensus 18 ~~i~~lv~~A~~~g~~alAlT-------------D~~~m~Ga~~F~~~a~~~gIkPIiG~ 64 (1034)
T PRK07279 18 IDLEKYVERAKELGYQTIGIM-------------DKDNLYGAYHFIEGAQKNGLQPILGL 64 (1034)
T ss_pred CCHHHHHHHHHHCCCCEEEEE-------------cCCccccHHHHHHHHHHcCCcEEEEE
Confidence 345788999999999999983 22223337899999999999999864
No 400
>cd06525 GH25_Lyc-like Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene. Lyc has a glycosyl hydrolase family 25 (GH25) catalytic domain. Endo-N-acetylmuramidases are lysozymes (also referred to as peptidoglycan hydrolases) that degrade bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues.
Probab=21.02 E-value=54 Score=28.72 Aligned_cols=88 Identities=11% Similarity=0.160 Sum_probs=50.2
Q ss_pred eeeecCCCC---ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEEEccCCccc
Q psy13819 94 SGSFHYFRA---PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYILLRPGPFIC 169 (323)
Q Consensus 94 ~g~~Hy~r~---p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vilr~GPyi~ 169 (323)
-|..||++- +.++-+.-++.++..+.+.. .|-..|...+.=.-+..+.+.+|++.++++ |-+++|=.++...
T Consensus 54 ~G~Yhy~~~~~~a~~qA~~f~~~~~~~~~~~~----~~lD~E~~~~~~~~~~~~~~~~f~~~v~~~~G~~~~iY~~~~~~ 129 (184)
T cd06525 54 VGFYHFLVGTSNPEEQAENFYNTIKGKKMDLK----PALDVEVNFGLSKDELNDYVLRFIEEFEKLSGLKVGIYTYTSFI 129 (184)
T ss_pred eEEEEEeeCCCCHHHHHHHHHHhccccCCCCC----eEEEEecCCCCCHHHHHHHHHHHHHHHHHHHCCCeEEEecHHHH
Confidence 477888763 34455666677776654421 122333332210011235678999999998 9999987776433
Q ss_pred cc-c--CCCCCcccccccCC
Q psy13819 170 GK-R--DFGGFPPWLLKVAP 186 (323)
Q Consensus 170 aE-w--~~gg~P~Wl~~~~p 186 (323)
.. + .....|.|+.+ ++
T Consensus 130 ~~~~~~~~~~~~lWiA~-Y~ 148 (184)
T cd06525 130 NNNLDSRLSSYPLWIAN-YG 148 (184)
T ss_pred HHhccccccCCCeEEEe-cc
Confidence 22 1 12346788876 44
No 401
>PF02548 Pantoate_transf: Ketopantoate hydroxymethyltransferase; InterPro: IPR003700 The panB gene from Escherichia coli encodes the first enzyme of the pantothenate biosynthesis pathway, ketopantoate hydroxymethyltransferase (KPHMT) 2.1.2.11 from EC. Fungal ketopantoate hydroxymethyltransferase is essential for the biosynthesis of coenzyme A, while the pathway intermediate 4'-phosphopantetheine is required for penicillin production [].; GO: 0003864 3-methyl-2-oxobutanoate hydroxymethyltransferase activity, 0015940 pantothenate biosynthetic process; PDB: 3VAV_G 1M3U_A 3EZ4_J 1O68_C 1O66_A 1OY0_D.
Probab=20.93 E-value=3.8e+02 Score=25.50 Aligned_cols=56 Identities=20% Similarity=0.222 Sum_probs=39.6
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCCccccccCCCCCcccc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGPFICGKRDFGGFPPWL 181 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GPyi~aEw~~gg~P~Wl 181 (323)
.+.+.-+.-.+.||+.|.++|.+. |..+....++..-+.|+-|+-..| =.|...
T Consensus 92 s~e~av~nA~rl~ke~GadaVKlE----------------Gg~~~~~~i~~l~~~GIPV~gHiG----------LtPQ~~ 145 (261)
T PF02548_consen 92 SPEQAVRNAGRLMKEAGADAVKLE----------------GGAEIAETIKALVDAGIPVMGHIG----------LTPQSV 145 (261)
T ss_dssp SHHHHHHHHHHHHHTTT-SEEEEE----------------BSGGGHHHHHHHHHTT--EEEEEE----------S-GGGH
T ss_pred CHHHHHHHHHHHHHhcCCCEEEec----------------cchhHHHHHHHHHHCCCcEEEEec----------Cchhhe
Confidence 446678889999999999999992 334466779999999999997663 357766
Q ss_pred cc
Q psy13819 182 LK 183 (323)
Q Consensus 182 ~~ 183 (323)
..
T Consensus 146 ~~ 147 (261)
T PF02548_consen 146 HQ 147 (261)
T ss_dssp HH
T ss_pred ec
Confidence 54
No 402
>PRK09532 DNA polymerase III subunit alpha; Reviewed
Probab=20.91 E-value=1.9e+02 Score=32.11 Aligned_cols=46 Identities=11% Similarity=0.145 Sum_probs=35.6
Q ss_pred cHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 106 RWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 106 ~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.-++.++++++.|+++|.+- .|.. ...+.+|.+.|++.||++|+..
T Consensus 20 ~~~elv~~A~~~G~~aiAiT----Dh~~---------~~g~~~f~~~~~~~gik~I~G~ 65 (874)
T PRK09532 20 QLPALVDRAIELGMPAIALT----DHGV---------MYGAIELLKVCRNKGIKPIIGN 65 (874)
T ss_pred CHHHHHHHHHHCCCCEEEEe----cCCC---------hhhHHHHHHHHHHcCCeEEEEE
Confidence 34688999999999999993 2322 2336788999999999999753
No 403
>PF11340 DUF3142: Protein of unknown function (DUF3142); InterPro: IPR021488 This bacterial family of proteins has no known function.
Probab=20.75 E-value=3.6e+02 Score=24.24 Aligned_cols=57 Identities=16% Similarity=0.221 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHhhhccCCCcEEEEccCCCcCCCCCChHHHHHHHHHHHHHhcCCceEEEe
Q psy13819 202 TRWFQELFPRIQKYLYGNDRPIILVQVENEYGSDAECDPAHAVWLRDLLRTYVQDKAVLYS 262 (323)
Q Consensus 202 ~~~~~~l~~~i~~~~~~n~g~Vi~~QieNE~g~~~~~~~~Y~~~l~~~~~~~~g~~~~l~t 262 (323)
+.-+.+++..++.++. .|-+|.++||+-...+. .-.+|...|+++ |+.+..+..+-.
T Consensus 23 ~~~~~~i~~~l~~W~~-~G~~v~giQIDfDa~t~--~L~~Y~~fL~~L-R~~LP~~~~LSI 79 (181)
T PF11340_consen 23 EQVLARILQLLQRWQA-AGNNVAGIQIDFDAATS--RLPAYAQFLQQL-RQRLPPDYRLSI 79 (181)
T ss_pred HHHHHHHHHHHHHHHH-cCCCceEEEEecCcccc--chHHHHHHHHHH-HHhCCCCceEee
Confidence 3445556666666654 45689999998887764 246788888777 443555554433
No 404
>PLN02880 tyrosine decarboxylase
Probab=20.71 E-value=2.2e+02 Score=29.25 Aligned_cols=54 Identities=9% Similarity=0.074 Sum_probs=38.1
Q ss_pred ChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 103 PPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 103 p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
.++.-++.|++.++.|..-+-+ =-.-|..+....++++++.++|+++|+++.|.
T Consensus 222 d~~~L~~~i~~~~~~g~~p~~v-------vataGTT~~GaiDpl~eI~~i~~~~~iwlHVD 275 (490)
T PLN02880 222 APELLSEAISTDLSSGLIPFFL-------CATVGTTSSTAVDPLLELGKIAKSNGMWFHVD 275 (490)
T ss_pred CHHHHHHHHHHHHHCCCccEEE-------EEecCCCcCcccCcHHHHHHHHHHcCCEEEEe
Confidence 3444456666666677543322 12247778888899999999999999999886
No 405
>PF04476 DUF556: Protein of unknown function (DUF556); InterPro: IPR007565 The proteins in this entry are functionally uncharacterised.
Probab=20.58 E-value=1.8e+02 Score=27.17 Aligned_cols=49 Identities=12% Similarity=0.145 Sum_probs=37.1
Q ss_pred HHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 111 MRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 111 l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
++.++++|+..+=+-- .+--...-|||-....|.+|++.|+++||.+=|
T Consensus 137 ~~~a~~aG~~gvMlDT---a~Kdg~~L~d~~~~~~L~~Fv~~ar~~gL~~aL 185 (235)
T PF04476_consen 137 PEIAAEAGFDGVMLDT---ADKDGGSLFDHLSEEELAEFVAQARAHGLMCAL 185 (235)
T ss_pred HHHHHHcCCCEEEEec---ccCCCCchhhcCCHHHHHHHHHHHHHccchhhc
Confidence 5677889998876631 222335678888888999999999999997654
No 406
>cd01137 PsaA Metal binding protein PsaA. These proteins have been shown to function as initial receptors in ABC transport of Mn2+ and as surface adhesins in some eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=20.53 E-value=1.6e+02 Score=27.73 Aligned_cols=55 Identities=15% Similarity=0.238 Sum_probs=36.7
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEE
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILL 162 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vil 162 (323)
++++...+-.+.| .+++.|++.+.+.-.=+..||. -.+|.+++++++++|+.+|+
T Consensus 177 ~~~~v~~H~af~Y-------------~~~~yGl~~~~~~~~~~~~eps--------~~~l~~l~~~ik~~~v~~if 231 (287)
T cd01137 177 KRKLVTSEGAFSY-------------FAKAYGLKEAYLWPINTEEEGT--------PKQVATLIEQVKKEKVPAVF 231 (287)
T ss_pred cCEEEEecccHHH-------------HHHHcCCeEeecccCCCCCCCC--------HHHHHHHHHHHHHhCCCEEE
Confidence 4556666666666 4678899987652110222332 56799999999999999876
No 407
>COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism]
Probab=20.48 E-value=2.1e+02 Score=27.11 Aligned_cols=56 Identities=7% Similarity=0.166 Sum_probs=36.7
Q ss_pred hcHHHHHHHHHHhCCCEEEE---EEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEc
Q psy13819 105 GRWCWIMRAMRAAGLNALST---YVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLR 163 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~---yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr 163 (323)
+-+++.+++++.+|+.||++ .|.-.-+.++--+ .| ...+..-.++|+++++..-+.
T Consensus 96 eiM~KaI~LA~dLGIRtIQLAGYDVYYE~~d~eT~~-rF--i~g~~~a~~lA~~aqV~lAvE 154 (287)
T COG3623 96 EIMEKAIQLAQDLGIRTIQLAGYDVYYEEADEETRQ-RF--IEGLKWAVELAARAQVMLAVE 154 (287)
T ss_pred HHHHHHHHHHHHhCceeEeeccceeeeccCCHHHHH-HH--HHHHHHHHHHHHhhccEEEee
Confidence 45788899999999999998 3443333332111 12 234677788999998875443
No 408
>TIGR01752 flav_long flavodoxin, long chain. Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They can act in nitrogen fixation by nitrogenase, in sulfite reduction, and light-dependent NADP+ reduction in during photosynthesis, among other roles. This model describes the long chain type, typical for nitrogen fixation but associated with pyruvate formate-lyase activation and cobalamin-dependent methionine synthase activity in E. coli.
Probab=20.46 E-value=5.4e+02 Score=21.93 Aligned_cols=116 Identities=7% Similarity=0.033 Sum_probs=64.1
Q ss_pred CeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc--cccchHHHHHHHHHHCCCeEEEcc
Q psy13819 87 EDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY--DGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 87 Gk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf--~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
.-...+++...|....+|..|.+-++.++...++...+.++=.. +...+ ...+-+..+-+++++.|..+|-..
T Consensus 44 ~~d~ii~gspty~~g~~p~~~~~fl~~l~~~~l~gk~v~~fg~g-----~~~~y~~~f~~a~~~l~~~l~~~G~~~ig~~ 118 (167)
T TIGR01752 44 AYDKLILGTPTWGVGELQEDWEDFLPTLEELDFTGKTVALFGLG-----DQEGYSETFCDGMGILYDKIKARGAKVVGFW 118 (167)
T ss_pred hCCEEEEEecCCCCCcCcHHHHHHHHHhhcCCCCCCEEEEEecC-----CCCcccHHHHHHHHHHHHHHHHcCCeEEcee
Confidence 44567788888876766788988888887766665555443111 11111 111224566777888999876543
Q ss_pred CCccccccCCCCCccccccc--CCCeee-cCCCHhH-HHHHHHHHHHHHHH
Q psy13819 165 GPFICGKRDFGGFPPWLLKV--APDILL-RQNHPVY-QRYVTRWFQELFPR 211 (323)
Q Consensus 165 GPyi~aEw~~gg~P~Wl~~~--~p~~~~-R~~~~~f-~~~~~~~~~~l~~~ 211 (323)
.|-.+++.+.+. +... .-++.+ ..+.+.. -++.++|+++|.+.
T Consensus 119 ---~~~gy~~~~~~~-~~~~~~f~gl~~~~~~~~~~~~~r~~~w~~~~~~~ 165 (167)
T TIGR01752 119 ---PTDGYHFEASKA-VRDGDKFVGLALDEDNQPDLTEERIEKWVEQIKPE 165 (167)
T ss_pred ---cCCCcccccchh-eeCCCEEEEEEecCCCchhhhHHHHHHHHHHHHHh
Confidence 344455444443 2110 001111 1333444 46778888877654
No 409
>PF00326 Peptidase_S9: Prolyl oligopeptidase family This family belongs to family S9 of the peptidase classification.; InterPro: IPR001375 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain covers the active site serine of the serine peptidases belonging to MEROPS peptidase family S9 (prolyl oligopeptidase family, clan SC). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Examples of protein families containing this domain are: Prolyl endopeptidase (3.4.21.26 from EC) (PE) (also called post-proline cleaving enzyme). PE is an enzyme that cleaves peptide bonds on the C-terminal side of prolyl residues. The sequence of PE has been obtained from a mammalian species (pig) and from bacteria (Flavobacterium meningosepticum and Aeromonas hydrophila); there is a high degree of sequence conservation between these sequences. Escherichia coli protease II (3.4.21.83 from EC) (oligopeptidase B) (gene prtB) which cleaves peptide bonds on the C-terminal side of lysyl and argininyl residues. Dipeptidyl peptidase IV (3.4.14.5 from EC) (DPP IV). DPP IV is an enzyme that removes N-terminal dipeptides sequentially from polypeptides having unsubstituted N-termini provided that the penultimate residue is proline. Saccharomyces cerevisiae (Baker's yeast) vacuolar dipeptidyl aminopeptidases A and B (DPAP A and DPAP B), encoded by the STE13 and DAP2 genes respectively. DPAP A is responsible for the proteolytic maturation of the alpha-factor precursor. Acylamino-acid-releasing enzyme (3.4.19.1 from EC) (acyl-peptide hydrolase). This enzyme catalyses the hydrolysis of the amino-terminal peptide bond of an N-acetylated protein to generate a N-acetylated amino acid and a protein with a free amino-terminus. These proteins belong to MEROPS peptidase families S9A, S9B and S9C.; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 2AJ8_D 1ORV_D 2AJB_C 2BUC_D 1ORW_D 2AJC_D 2AJD_C 2BUA_A 2HU8_B 3O4J_B ....
Probab=20.45 E-value=2.3e+02 Score=24.68 Aligned_cols=65 Identities=20% Similarity=0.211 Sum_probs=42.8
Q ss_pred CCeeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHH
Q psy13819 86 NEDPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQ 151 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~ 151 (323)
...|++|+.|+-- .++|++.=.+-.+.+++.|.++.-..++=..|.....+..-.....+.+|++
T Consensus 143 ~~~P~li~hG~~D-~~Vp~~~s~~~~~~L~~~g~~~~~~~~p~~gH~~~~~~~~~~~~~~~~~f~~ 207 (213)
T PF00326_consen 143 IKPPVLIIHGEND-PRVPPSQSLRLYNALRKAGKPVELLIFPGEGHGFGNPENRRDWYERILDFFD 207 (213)
T ss_dssp GGSEEEEEEETTB-SSSTTHHHHHHHHHHHHTTSSEEEEEETT-SSSTTSHHHHHHHHHHHHHHHH
T ss_pred CCCCEEEEccCCC-CccCHHHHHHHHHHHHhcCCCEEEEEcCcCCCCCCCchhHHHHHHHHHHHHH
Confidence 4578888988644 4789898899999999999995444666667744433333334344444444
No 410
>cd04909 ACT_PDH-BS C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH). The C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH) enzyme that catalyzes the formation of 4-hydroxyphenylpyruvate from prephenate, found in Bacillus subtilis (BS) and other Firmicutes, Deinococci, and Bacteroidetes. PDH is the first enzyme in the aromatic amino acid pathway specific for the biosynthesis of tyrosine. This enzyme is feedback-inhibited by tyrosine in B. subtilis and other microorganisms. Both phenylalanine and tryptophan have been shown to be inhibitors of this activity in B. subtilis. Bifunctional chorismate mutase-PDH (TyrA) enzymes such as those seen in Escherichia coli do not contain an ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=20.44 E-value=2.3e+02 Score=19.96 Aligned_cols=55 Identities=13% Similarity=0.084 Sum_probs=37.2
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecc--cccchHHHHHHHHHHCCCeE
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHY--DGHRDIEHFLQLAVEEDLYI 160 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf--~g~~dl~~fl~~a~e~GL~v 160 (323)
..-.+.++.+.+.|+|...++..=.... ..+...+ +...|.+++++..++.|..|
T Consensus 13 G~L~~l~~~l~~~~i~i~~~~~~~~~~~-~~~~~~i~v~~~~~~~~~~~~L~~~G~~v 69 (69)
T cd04909 13 GVIAEVTQILGDAGISIKNIEILEIREG-IGGILRISFKTQEDRERAKEILKEAGYEV 69 (69)
T ss_pred CHHHHHHHHHHHcCCCceeeEeEEeecC-CcEEEEEEECCHHHHHHHHHHHHHcCCcC
Confidence 4566788899999999998863211001 1444444 33357899999999999764
No 411
>TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the Pfam model pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine.
Probab=20.41 E-value=8.1e+02 Score=23.93 Aligned_cols=26 Identities=15% Similarity=0.056 Sum_probs=21.1
Q ss_pred CCChhcHHHHHHHHHHhCCCEEEEEE
Q psy13819 101 RAPPGRWCWIMRAMRAAGLNALSTYV 126 (323)
Q Consensus 101 r~p~~~W~~~l~~mk~~G~N~Vr~yv 126 (323)
..|.+.++...+..++.|+..+++.+
T Consensus 72 ~l~~e~~~~L~~~~~~~Gi~~~stpf 97 (329)
T TIGR03569 72 ELSEEDHRELKEYCESKGIEFLSTPF 97 (329)
T ss_pred CCCHHHHHHHHHHHHHhCCcEEEEeC
Confidence 46777788888888999999999953
No 412
>cd08551 Fe-ADH iron-containing alcohol dehydrogenases (Fe-ADH)-like. Large metal-containing alcohol dehydrogenases (ADH), known as iron-containing alcohol dehydrogenases. They contain a dehydroquinate synthase-like protein structural fold and mostly contain iron. They are distinct from other alcohol dehydrogenases which contains different protein domains. There are several distinct families of alcohol dehydrogenases: Zinc-containing long-chain alcohol dehydrogenases; insect-type, or short-chain alcohol dehydrogenases; iron-containing alcohol dehydrogenases, and others. The iron-containing family has a Rossmann fold-like topology that resembles the fold of the zinc-dependent alcohol dehydrogenases, but lacks sequence homology, and differs in strand arrangement. ADH catalyzes the reversible oxidation of alcohol to acetaldehyde with the simultaneous reduction of NAD(P)+ to NAD(P)H.
Probab=20.39 E-value=3.8e+02 Score=26.06 Aligned_cols=66 Identities=14% Similarity=0.144 Sum_probs=42.5
Q ss_pred eeeEEeeeeecCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEccCC
Q psy13819 88 DPFQFVSGSFHYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRPGP 166 (323)
Q Consensus 88 k~~~i~~g~~Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~GP 166 (323)
+++.++++..-. + ...+++..+.+++.|++.+.+ ...++.|- .+++.+.++.+++.+..+|+..|-
T Consensus 24 ~~~lvv~~~~~~-~--~~~~~~v~~~L~~~~~~~~~~----~~~~~~p~------~~~v~~~~~~~~~~~~d~IiaiGG 89 (370)
T cd08551 24 RKALIVTDPGLV-K--TGVLDKVIDSLKEAGIEVVIF----DGVEPNPT------LSNVDAAVAAYREEGCDGVIAVGG 89 (370)
T ss_pred CeEEEEeCcchh-h--CccHHHHHHHHHHcCCeEEEE----CCCCCCCC------HHHHHHHHHHHHhcCCCEEEEeCC
Confidence 555566553221 1 245666777778888875432 23444332 456888899999999999998764
No 413
>cd04739 DHOD_like Dihydroorotate dehydrogenase (DHOD) like proteins. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences and cellular location. Members of class 1 are cytosolic enzymes and multimers while class 2 enzymes are membrane associated and monomeric. The class 1 enzymes can be further divided into subtypes 1A and 1B which are homodimers and heterotetrameric proteins, respectively. This subgroup has the conserved FMN binding site, but lacks some catalytic residues and may therefore be inactive.
Probab=20.38 E-value=2.9e+02 Score=26.71 Aligned_cols=64 Identities=17% Similarity=0.290 Sum_probs=40.9
Q ss_pred CChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHC-CCeEEEccCCc
Q psy13819 102 APPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEE-DLYILLRPGPF 167 (323)
Q Consensus 102 ~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~-GL~vilr~GPy 167 (323)
..++.|.+-.++++++|++.|++-+. ...-.++.+..+-.+.+.++++.+.+. .+-|++...|+
T Consensus 109 ~~~~~~~~~a~~~~~~gad~iElN~s--~~~~~~~~~g~~~~~~~~eiv~~v~~~~~iPv~vKl~p~ 173 (325)
T cd04739 109 VSAGGWVDYARQIEEAGADALELNIY--ALPTDPDISGAEVEQRYLDILRAVKSAVTIPVAVKLSPF 173 (325)
T ss_pred CCHHHHHHHHHHHHhcCCCEEEEeCC--CCCCCCCcccchHHHHHHHHHHHHHhccCCCEEEEcCCC
Confidence 34689999999999999999999553 111123444333223355666666553 67777777665
No 414
>PRK05898 dnaE DNA polymerase III DnaE; Validated
Probab=20.36 E-value=1.9e+02 Score=32.51 Aligned_cols=47 Identities=9% Similarity=-0.060 Sum_probs=37.2
Q ss_pred hcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCeEEEcc
Q psy13819 105 GRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLYILLRP 164 (323)
Q Consensus 105 ~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~vilr~ 164 (323)
..-++-++++++.|..+|.+- |.....-..+|.+.|+++||+.|+..
T Consensus 18 ~~~~~lv~~A~e~g~~alALT-------------D~~nl~GaveF~~~ak~~gIkPIiG~ 64 (971)
T PRK05898 18 LSIDDIIKFALDNNQPYVCLT-------------DLNNLYGCIEFYDKAKAHNLIPIIGL 64 (971)
T ss_pred CCHHHHHHHHHHCCCCEEEEE-------------eCCccccHHHHHHHHHHcCCCEEEEE
Confidence 345788999999999999983 33333347899999999999999864
No 415
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=20.28 E-value=57 Score=21.60 Aligned_cols=11 Identities=27% Similarity=0.480 Sum_probs=9.4
Q ss_pred eeEEEEcCccc
Q psy13819 6 KGVLFANGRHW 16 (323)
Q Consensus 6 kg~~~vng~~l 16 (323)
-|.|.|||.++
T Consensus 23 pG~ViING~C~ 33 (36)
T PF08194_consen 23 PGNVIINGKCI 33 (36)
T ss_pred CCeEEECceee
Confidence 58999999875
No 416
>TIGR03099 dCO2ase_PEP1 pyridoxal-dependent decarboxylase, exosortase system type 1 associated. The sequences in this family contain the pyridoxal binding domain (pfam02784) and C-terminal sheet domain (pfam00278) of a family of Pyridoxal-dependent decarboxylases. Characterized enzymes in this family decarboxylate substrates such as ornithine, diaminopimelate and arginine. The genes of the family modeled here, with the exception of those observed in certain Burkholderia species, are all found in the context of exopolysaccharide biosynthesis loci containing the exosortase/PEP-CTERM protein sorting system. More specifically, these are characteristic of the type 1 exosortase system represented by the Genome Property GenProp0652. The substrate of these enzymes may be a precursor of the carrier or linker which is hypothesized to release the PEP-CTERM protein from the exosortase enzyme. These enzymes are apparently most closely related to the diaminopimelate decarboxylase modeled by TIGR01048
Probab=20.11 E-value=7.3e+02 Score=24.24 Aligned_cols=68 Identities=13% Similarity=0.044 Sum_probs=37.9
Q ss_pred cCCCCChhcHHHHHHHHHHhCCCEEEEEEeCCCcccCCCeecccccchHHHHHHHHHHCCCe-EEEccC
Q psy13819 98 HYFRAPPGRWCWIMRAMRAAGLNALSTYVEWRSHEVHPGHYHYDGHRDIEHFLQLAVEEDLY-ILLRPG 165 (323)
Q Consensus 98 Hy~r~p~~~W~~~l~~mk~~G~N~Vr~yv~W~~hEP~~G~fdf~g~~dl~~fl~~a~e~GL~-vilr~G 165 (323)
+-|=++++.+.+.++++++.|++.+-+.+.=......+..|.-.-.+-++.+.+.+++.|+. -++..|
T Consensus 164 srFGi~~~e~~~~~~~~~~~~l~l~Glh~h~gs~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~idiG 232 (398)
T TIGR03099 164 KQFGIDAEQVPAALAFIKAADLDFQGFHIFAGSQNLNAEAIIEAQAKTLALALRLAESAPAPVRVINIG 232 (398)
T ss_pred CcCCCCHHHHHHHHHHHHhCCCeEEEEEecccccCCCHHHHHHHHHHHHHHHHHHHHHhCCCCCEEEeC
Confidence 33446778899999999999887776643222221222212111112233446667777875 357775
No 417
>PRK01076 L-rhamnose isomerase; Provisional
Probab=20.05 E-value=8.5e+02 Score=24.77 Aligned_cols=104 Identities=12% Similarity=0.079 Sum_probs=64.5
Q ss_pred CCeeeEEeeeeecCCCCC--hhcHHHHHHHHHHh--CCCEEEEEEeCCCcccCCCeecccc--cchHHHHHHHHHHCCCe
Q psy13819 86 NEDPFQFVSGSFHYFRAP--PGRWCWIMRAMRAA--GLNALSTYVEWRSHEVHPGHYHYDG--HRDIEHFLQLAVEEDLY 159 (323)
Q Consensus 86 dGk~~~i~~g~~Hy~r~p--~~~W~~~l~~mk~~--G~N~Vr~yv~W~~hEP~~G~fdf~g--~~dl~~fl~~a~e~GL~ 159 (323)
.|.++-=+.+...|+=.| +++-.+|+.++.++ |-..|.+.++|. |.. ...|.+. -+|..++++.|+++||.
T Consensus 52 ~gt~~gGi~~tgnypG~aR~~~El~~D~~~~~~L~pg~~~vnLH~~y~--~~~-~~vdrd~~~p~~f~~w~~~Ak~~Glg 128 (419)
T PRK01076 52 EGSLTGGIQATGNYPGKARNADELRADLEKALSLIPGKHRLNLHAIYL--ESD-TPVDRDEIEPEHFKNWVEWAKENGLG 128 (419)
T ss_pred CCccccccceecCCCCCCCCHHHHHHHHHHHHHhcCCCCceeeecccc--cCC-CcccccccCcccHHHHHHHHHHcCCC
Confidence 344544455677776443 56667888888886 888999999998 211 1334333 35888999999999996
Q ss_pred EEEccCCccccccCCCCCcccccccCCCeeecCCCHhHHHHHHHH
Q psy13819 160 ILLRPGPFICGKRDFGGFPPWLLKVAPDILLRQNHPVYQRYVTRW 204 (323)
Q Consensus 160 vilr~GPyi~aEw~~gg~P~Wl~~~~p~~~~R~~~~~f~~~~~~~ 204 (323)
.=+.| +....|..- -+..+...|+..++.+-+-
T Consensus 129 lDfNp--------n~Fsh~~~k----~G~SLs~pD~~iR~fwI~H 161 (419)
T PRK01076 129 LDFNP--------TCFSHPLSA----DGFTLSHPDPEIRQFWIEH 161 (419)
T ss_pred cCcCc--------ccCCCcccc----CCCcccCCCHHHHHHHHHH
Confidence 55444 233334321 1234555667766655443
Done!