Query psy14880
Match_columns 497
No_of_seqs 213 out of 1574
Neff 6.6
Searched_HMMs 46136
Date Fri Aug 16 23:11:10 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14880.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14880hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd06562 GH20_HexA_HexB-like Be 100.0 1E-84 2.2E-89 673.4 33.7 339 27-478 1-348 (348)
2 KOG2499|consensus 100.0 1E-83 2.2E-88 660.1 30.3 377 3-497 151-542 (542)
3 cd06563 GH20_chitobiase-like T 100.0 2.9E-80 6.3E-85 642.6 31.0 341 27-467 1-357 (357)
4 cd06570 GH20_chitobiase-like_1 100.0 9E-79 2E-83 618.6 29.9 309 27-467 1-311 (311)
5 cd06569 GH20_Sm-chitobiase-lik 100.0 7.1E-78 1.5E-82 638.4 32.8 360 23-463 1-424 (445)
6 cd06568 GH20_SpHex_like A subg 100.0 3.7E-76 8.1E-81 604.6 29.2 313 27-467 1-329 (329)
7 PF00728 Glyco_hydro_20: Glyco 100.0 1.8E-71 3.9E-76 574.0 17.3 323 27-455 1-350 (351)
8 cd02742 GH20_hexosaminidase Be 100.0 8.1E-69 1.8E-73 545.5 26.5 293 29-454 1-303 (303)
9 cd06564 GH20_DspB_LnbB-like Gl 100.0 1.5E-66 3.2E-71 534.0 28.0 310 28-455 1-326 (326)
10 COG3525 Chb N-acetyl-beta-hexo 100.0 9.6E-65 2.1E-69 536.5 19.8 377 16-485 250-646 (732)
11 cd06565 GH20_GcnA-like Glycosy 100.0 1.4E-46 3.1E-51 381.9 23.8 287 29-454 1-301 (301)
12 KOG2499|consensus 97.5 9.5E-05 2.1E-09 78.3 4.6 45 87-131 234-278 (542)
13 PF02638 DUF187: Glycosyl hydr 95.7 0.19 4.1E-06 51.8 13.5 87 150-244 69-160 (311)
14 cd06562 GH20_HexA_HexB-like Be 95.6 0.0095 2.1E-07 62.3 3.4 42 90-131 57-98 (348)
15 PF10566 Glyco_hydro_97: Glyco 94.8 0.069 1.5E-06 53.9 6.7 71 147-233 69-148 (273)
16 cd06568 GH20_SpHex_like A subg 94.6 0.028 6E-07 58.4 3.5 35 96-130 68-102 (329)
17 cd06570 GH20_chitobiase-like_1 94.6 0.03 6.5E-07 57.7 3.7 34 97-130 62-95 (311)
18 cd06564 GH20_DspB_LnbB-like Gl 94.4 0.03 6.4E-07 58.0 3.2 36 95-130 74-109 (326)
19 PF00728 Glyco_hydro_20: Glyco 94.2 0.025 5.5E-07 58.5 2.2 41 90-130 57-100 (351)
20 cd06563 GH20_chitobiase-like T 94.0 0.042 9E-07 57.7 3.4 34 97-130 80-113 (357)
21 cd02742 GH20_hexosaminidase Be 94.0 0.041 8.9E-07 56.4 3.2 35 96-130 65-99 (303)
22 cd06569 GH20_Sm-chitobiase-lik 93.5 0.054 1.2E-06 58.5 3.2 34 97-130 91-124 (445)
23 cd06565 GH20_GcnA-like Glycosy 89.8 0.21 4.6E-06 51.2 2.5 32 99-130 56-87 (301)
24 COG1649 Uncharacterized protei 87.9 6.9 0.00015 42.0 12.3 156 26-243 44-204 (418)
25 PF13200 DUF4015: Putative gly 84.4 10 0.00022 39.3 11.2 77 150-234 60-136 (316)
26 PRK12568 glycogen branching en 81.2 26 0.00055 40.5 13.7 118 106-236 273-401 (730)
27 PLN02361 alpha-amylase 80.4 4.9 0.00011 43.0 7.3 84 149-233 74-169 (401)
28 PRK14705 glycogen branching en 78.4 29 0.00062 42.4 13.5 120 105-236 768-897 (1224)
29 PRK14706 glycogen branching en 77.1 31 0.00066 39.3 12.7 121 105-236 170-299 (639)
30 PF14871 GHL6: Hypothetical gl 76.7 12 0.00027 33.6 7.7 86 145-236 38-124 (132)
31 PRK05402 glycogen branching en 76.1 77 0.0017 36.7 15.8 123 101-236 263-397 (726)
32 PF02679 ComA: (2R)-phospho-3- 76.0 19 0.00041 36.0 9.4 31 23-56 7-37 (244)
33 TIGR01515 branching_enzym alph 75.7 43 0.00093 37.9 13.4 122 107-236 161-288 (613)
34 PRK12313 glycogen branching en 75.1 78 0.0017 35.9 15.4 123 103-236 170-302 (633)
35 PF02449 Glyco_hydro_42: Beta- 75.0 15 0.00032 38.6 9.0 83 146-237 42-125 (374)
36 COG0296 GlgB 1,4-alpha-glucan 74.2 36 0.00078 38.6 12.0 121 103-236 165-296 (628)
37 PF00128 Alpha-amylase: Alpha 69.1 14 0.00031 36.4 7.0 86 149-235 50-161 (316)
38 TIGR02100 glgX_debranch glycog 68.1 28 0.0006 40.0 9.7 91 143-236 233-334 (688)
39 PLN02960 alpha-amylase 67.0 83 0.0018 37.1 13.1 125 101-236 414-549 (897)
40 PLN02784 alpha-amylase 64.7 21 0.00046 41.7 7.9 30 149-179 566-595 (894)
41 COG3525 Chb N-acetyl-beta-hexo 62.5 5.8 0.00013 44.6 2.8 36 96-131 339-374 (732)
42 smart00642 Aamy Alpha-amylase 62.5 8.5 0.00018 35.9 3.6 30 149-179 68-97 (166)
43 cd06592 GH31_glucosidase_KIAA1 61.5 63 0.0014 33.0 10.1 136 99-248 26-165 (303)
44 TIGR03849 arch_ComA phosphosul 61.1 15 0.00031 36.6 5.0 22 32-56 3-24 (237)
45 PRK09441 cytoplasmic alpha-amy 60.7 27 0.00059 38.1 7.7 30 149-179 79-108 (479)
46 PLN00196 alpha-amylase; Provis 59.7 33 0.00072 37.0 8.0 32 147-179 88-119 (428)
47 cd06594 GH31_glucosidase_YihQ 59.7 29 0.00063 35.8 7.3 22 211-232 131-152 (317)
48 TIGR02402 trehalose_TreZ malto 58.1 1.5E+02 0.0033 32.9 13.1 119 104-236 112-237 (542)
49 PLN03244 alpha-amylase; Provis 56.6 47 0.001 38.7 8.6 77 148-236 438-524 (872)
50 PLN02447 1,4-alpha-glucan-bran 56.6 1.9E+02 0.004 33.8 13.5 116 106-236 254-383 (758)
51 PRK14510 putative bifunctional 53.6 67 0.0015 39.5 9.9 89 143-236 236-335 (1221)
52 cd06593 GH31_xylosidase_YicI Y 53.5 55 0.0012 33.3 8.1 122 100-233 21-147 (308)
53 PRK03705 glycogen debranching 52.7 60 0.0013 37.1 8.9 88 143-236 228-329 (658)
54 TIGR02403 trehalose_treC alpha 51.0 45 0.00098 37.1 7.5 30 149-179 73-102 (543)
55 PRK14459 ribosomal RNA large s 50.9 45 0.00098 35.4 7.1 78 74-175 242-326 (373)
56 TIGR03849 arch_ComA phosphosul 50.3 53 0.0012 32.7 7.0 103 17-173 20-122 (237)
57 cd07201 cPLA2_Grp-IVB-IVD-IVE- 47.4 49 0.0011 36.7 6.8 37 13-50 283-319 (541)
58 COG0134 TrpC Indole-3-glycerol 44.5 39 0.00084 34.0 5.0 73 147-244 139-211 (254)
59 COG0436 Aspartate/tyrosine/aro 44.5 24 0.00052 37.5 3.9 24 146-169 178-201 (393)
60 cd06600 GH31_MGAM-like This fa 43.9 1.5E+02 0.0031 30.6 9.5 22 211-232 125-146 (317)
61 cd00452 KDPG_aldolase KDPG and 42.4 37 0.00079 32.1 4.4 67 154-248 87-153 (190)
62 PLN02607 1-aminocyclopropane-1 40.6 71 0.0015 34.5 6.8 23 146-168 215-237 (447)
63 PRK14465 ribosomal RNA large s 40.3 2.7E+02 0.0058 29.3 10.8 113 14-170 210-324 (342)
64 PF01212 Beta_elim_lyase: Beta 38.9 27 0.0006 35.6 3.1 24 145-168 139-162 (290)
65 PF07555 NAGidase: beta-N-acet 38.5 85 0.0018 32.4 6.6 22 148-169 50-74 (306)
66 COG1168 MalY Bifunctional PLP- 38.3 35 0.00076 36.1 3.8 25 145-169 172-196 (388)
67 KOG0258|consensus 37.8 69 0.0015 34.3 5.8 54 102-170 200-254 (475)
68 PRK13957 indole-3-glycerol-pho 37.8 46 0.00099 33.3 4.4 38 147-184 134-171 (247)
69 TIGR02455 TreS_stutzeri trehal 36.8 38 0.00082 38.5 4.0 31 148-179 127-157 (688)
70 PF01565 FAD_binding_4: FAD bi 36.8 40 0.00087 29.6 3.5 28 149-178 9-36 (139)
71 cd06598 GH31_transferase_CtsZ 36.5 4E+02 0.0088 27.3 11.4 24 210-233 129-152 (317)
72 PRK09357 pyrC dihydroorotase; 35.0 1.2E+02 0.0026 32.1 7.5 93 41-171 156-255 (423)
73 KOG0259|consensus 35.0 37 0.0008 36.1 3.3 34 145-179 213-246 (447)
74 PRK14511 maltooligosyl trehalo 34.9 1.6E+02 0.0036 34.8 8.8 70 100-179 17-96 (879)
75 PRK14582 pgaB outer membrane N 34.6 1.7E+02 0.0036 33.7 8.7 36 211-248 433-468 (671)
76 TIGR02336 1,3-beta-galactosyl- 34.6 4.2E+02 0.0091 30.5 11.5 103 209-341 176-309 (719)
77 PF02065 Melibiase: Melibiase; 34.3 2.1E+02 0.0046 30.6 9.1 128 99-246 54-191 (394)
78 cd06603 GH31_GANC_GANAB_alpha 33.1 1.6E+02 0.0035 30.5 7.8 23 211-233 125-147 (339)
79 KOG0256|consensus 32.9 1.8E+02 0.0039 31.4 7.9 117 39-169 119-264 (471)
80 PF05913 DUF871: Bacterial pro 32.6 80 0.0017 33.3 5.5 55 102-172 13-68 (357)
81 PRK14465 ribosomal RNA large s 32.5 1.7E+02 0.0037 30.7 7.8 74 73-170 215-292 (342)
82 smart00481 POLIIIAc DNA polyme 32.4 66 0.0014 24.7 3.7 46 105-172 17-62 (67)
83 cd06602 GH31_MGAM_SI_GAA This 32.3 3.3E+02 0.0071 28.3 10.0 127 101-236 22-155 (339)
84 COG1809 (2R)-phospho-3-sulfola 32.3 2.6E+02 0.0056 27.8 8.3 101 17-168 7-107 (258)
85 PRK10933 trehalose-6-phosphate 31.8 1.3E+02 0.0027 33.7 7.1 30 149-179 79-108 (551)
86 PF14542 Acetyltransf_CG: GCN5 31.2 53 0.0012 26.5 3.1 23 148-170 39-61 (78)
87 cd06591 GH31_xylosidase_XylS X 31.1 4E+02 0.0086 27.3 10.3 20 40-59 20-39 (319)
88 TIGR02401 trehalose_TreY malto 31.0 2.1E+02 0.0045 33.8 8.8 71 100-180 13-93 (825)
89 cd06604 GH31_glucosidase_II_Ma 29.9 1.9E+02 0.0041 29.9 7.8 121 101-233 22-147 (339)
90 KOG0462|consensus 29.7 43 0.00093 37.3 2.9 32 148-179 91-138 (650)
91 PF03979 Sigma70_r1_1: Sigma-7 29.4 51 0.0011 27.0 2.7 55 103-173 7-61 (82)
92 PRK14457 ribosomal RNA large s 29.2 1E+02 0.0023 32.3 5.6 73 73-169 215-291 (345)
93 TIGR02668 moaA_archaeal probab 27.9 1.6E+02 0.0034 29.7 6.6 86 71-177 9-96 (302)
94 PF00682 HMGL-like: HMGL-like 27.7 1.5E+02 0.0033 28.6 6.3 57 103-178 108-164 (237)
95 cd03319 L-Ala-DL-Glu_epimerase 27.6 1.2E+02 0.0025 31.0 5.6 22 149-170 264-285 (316)
96 PRK14340 (dimethylallyl)adenos 27.6 87 0.0019 34.0 4.8 52 97-166 277-330 (445)
97 PF07071 DUF1341: Protein of u 27.4 76 0.0016 30.9 3.8 23 148-170 186-209 (218)
98 KOG0464|consensus 27.2 53 0.0011 35.4 2.9 30 151-180 74-116 (753)
99 PLN02231 alanine transaminase 27.1 1.7E+02 0.0038 32.4 7.2 24 146-169 286-309 (534)
100 cd07937 DRE_TIM_PC_TC_5S Pyruv 27.0 1.9E+02 0.0041 29.1 6.9 22 151-172 118-139 (275)
101 PF13199 Glyco_hydro_66: Glyco 26.9 3.8E+02 0.0082 30.2 9.7 93 146-244 165-264 (559)
102 TIGR00048 radical SAM enzyme, 26.8 2.2E+02 0.0049 29.9 7.6 75 74-172 219-297 (355)
103 PRK10785 maltodextrin glucosid 26.7 55 0.0012 36.9 3.3 30 149-179 224-253 (598)
104 TIGR02102 pullulan_Gpos pullul 26.7 3.3E+02 0.0071 33.3 9.7 80 150-236 554-635 (1111)
105 PRK14456 ribosomal RNA large s 26.7 1.6E+02 0.0035 31.2 6.6 74 74-170 238-315 (368)
106 PRK14507 putative bifunctional 26.6 2.6E+02 0.0056 35.7 9.1 71 98-178 753-833 (1693)
107 PRK01060 endonuclease IV; Prov 26.3 3.4E+02 0.0074 26.7 8.6 96 31-168 5-106 (281)
108 PRK00278 trpC indole-3-glycero 26.3 1.1E+02 0.0025 30.5 5.1 34 148-181 144-177 (260)
109 PTZ00372 endonuclease 4-like p 26.0 1.4E+02 0.0031 32.1 6.0 25 146-170 366-390 (413)
110 PF03102 NeuB: NeuB family; I 25.8 65 0.0014 32.1 3.2 25 147-171 52-76 (241)
111 PF09508 Lact_bio_phlase: Lact 25.0 5.2E+02 0.011 29.6 10.1 100 211-342 175-307 (716)
112 PF01055 Glyco_hydro_31: Glyco 24.9 2.6E+02 0.0056 29.9 7.9 135 100-248 40-180 (441)
113 cd09013 BphC-JF8_N_like N-term 24.7 68 0.0015 27.1 2.8 32 148-179 71-102 (121)
114 cd07939 DRE_TIM_NifV Streptomy 24.3 2.4E+02 0.0052 27.9 7.0 20 39-58 15-34 (259)
115 COG2984 ABC-type uncharacteriz 24.1 90 0.0019 32.4 3.9 29 142-170 166-194 (322)
116 PRK04296 thymidine kinase; Pro 24.1 80 0.0017 29.8 3.4 25 146-170 88-112 (190)
117 cd00308 enolase_like Enolase-s 23.9 1.8E+02 0.004 28.0 6.0 22 149-170 181-202 (229)
118 PRK14335 (dimethylallyl)adenos 23.9 1.3E+02 0.0028 32.7 5.3 52 97-166 287-340 (455)
119 TIGR00190 thiC thiamine biosyn 23.8 64 0.0014 34.5 2.8 23 154-178 246-268 (423)
120 TIGR03581 EF_0839 conserved hy 23.6 1.6E+02 0.0035 29.0 5.3 22 148-169 186-208 (236)
121 PF00218 IGPS: Indole-3-glycer 23.5 85 0.0018 31.5 3.5 39 146-184 140-178 (254)
122 cd03318 MLE Muconate Lactonizi 23.1 1.6E+02 0.0035 30.7 5.7 28 30-58 189-216 (365)
123 PLN02368 alanine transaminase 23.0 83 0.0018 33.6 3.6 24 146-169 225-248 (407)
124 COG3618 Predicted metal-depend 23.0 8E+02 0.017 25.1 10.3 164 30-248 53-237 (279)
125 PRK14333 (dimethylallyl)adenos 22.8 1.3E+02 0.0029 32.4 5.2 52 97-166 284-337 (448)
126 PRK15452 putative protease; Pr 22.6 3E+02 0.0066 29.9 7.8 105 86-227 29-133 (443)
127 TIGR00387 glcD glycolate oxida 22.5 78 0.0017 33.8 3.3 28 148-177 5-32 (413)
128 KOG3257|consensus 22.4 46 0.00099 31.0 1.3 23 148-170 135-158 (168)
129 PF00150 Cellulase: Cellulase 22.3 84 0.0018 30.6 3.3 57 150-236 61-117 (281)
130 PF06080 DUF938: Protein of un 22.3 50 0.0011 32.1 1.6 28 152-179 172-199 (204)
131 cd07940 DRE_TIM_IPMS 2-isoprop 22.2 2.8E+02 0.006 27.6 7.0 56 104-178 115-170 (268)
132 smart00642 Aamy Alpha-amylase 22.0 2.4E+02 0.0051 26.2 6.0 29 101-130 68-96 (166)
133 cd06840 PLPDE_III_Bif_AspK_Dap 21.9 5.8E+02 0.013 26.7 9.7 21 148-168 153-173 (368)
134 cd07265 2_3_CTD_N N-terminal d 21.8 91 0.002 26.2 3.0 32 147-178 69-102 (122)
135 COG2008 GLY1 Threonine aldolas 21.7 77 0.0017 33.2 2.9 23 145-167 144-166 (342)
136 COG3977 Alanine-alpha-ketoisov 21.7 79 0.0017 32.8 2.9 25 145-169 193-217 (417)
137 cd03327 MR_like_2 Mandelate ra 21.6 1.5E+02 0.0033 30.7 5.1 27 31-58 173-199 (341)
138 COG0277 GlcD FAD/FMN-containin 21.6 86 0.0019 33.4 3.4 30 148-179 39-68 (459)
139 KOG0256|consensus 21.4 2.4E+02 0.0051 30.6 6.4 114 31-193 197-311 (471)
140 COG0621 MiaB 2-methylthioadeni 21.3 1.4E+02 0.003 32.5 4.8 92 41-168 236-329 (437)
141 COG1217 TypA Predicted membran 21.3 56 0.0012 35.8 1.8 22 158-179 47-81 (603)
142 PRK02308 uvsE putative UV dama 21.2 4.7E+02 0.01 26.9 8.6 56 104-169 49-109 (303)
143 PRK14334 (dimethylallyl)adenos 21.2 1.3E+02 0.0029 32.4 4.7 52 98-166 267-319 (440)
144 PRK05942 aspartate aminotransf 21.2 94 0.002 32.5 3.6 23 146-168 185-207 (394)
145 PRK11194 ribosomal RNA large s 21.2 2.8E+02 0.0061 29.5 7.1 72 75-170 221-299 (372)
146 PRK09505 malS alpha-amylase; R 21.0 93 0.002 35.8 3.6 30 149-179 290-319 (683)
147 PF00155 Aminotran_1_2: Aminot 20.9 95 0.0021 31.7 3.5 24 146-169 163-186 (363)
148 PRK14454 ribosomal RNA large s 20.9 2.4E+02 0.0053 29.5 6.5 72 74-169 212-287 (342)
149 PF09711 Cas_Csn2: CRISPR-asso 20.9 1E+02 0.0022 29.6 3.3 25 145-169 140-164 (188)
150 COG2100 Predicted Fe-S oxidore 20.8 1.7E+02 0.0038 30.6 5.1 48 97-164 236-284 (414)
151 cd07943 DRE_TIM_HOA 4-hydroxy- 20.6 2.4E+02 0.0053 28.0 6.2 20 152-171 113-132 (263)
152 PRK09427 bifunctional indole-3 20.6 1.2E+02 0.0027 33.0 4.4 39 146-184 141-179 (454)
153 PF01408 GFO_IDH_MocA: Oxidore 20.6 2.6E+02 0.0056 23.4 5.7 21 148-168 97-117 (120)
154 PLN00145 tyrosine/nicotianamin 20.5 1E+02 0.0022 33.0 3.7 24 146-169 205-228 (430)
155 PRK09147 succinyldiaminopimela 20.5 97 0.0021 32.4 3.5 23 146-168 181-203 (396)
156 KOG4175|consensus 20.4 1.7E+02 0.0036 28.7 4.6 24 148-171 132-155 (268)
157 PF08671 SinI: Anti-repressor 20.4 2.5E+02 0.0055 18.8 4.2 13 148-160 17-29 (30)
158 TIGR01290 nifB nitrogenase cof 20.4 1.9E+02 0.0042 31.3 5.8 29 148-176 200-231 (442)
159 PRK09082 methionine aminotrans 20.4 95 0.0021 32.4 3.4 23 146-168 178-200 (386)
160 COG0826 Collagenase and relate 20.3 1.4E+02 0.0031 31.3 4.6 35 146-180 44-78 (347)
161 TIGR02456 treS_nterm trehalose 20.2 99 0.0022 34.3 3.6 30 149-179 74-103 (539)
162 TIGR03542 DAPAT_plant LL-diami 20.2 1.2E+02 0.0025 32.0 4.0 24 145-168 187-210 (402)
163 cd03320 OSBS o-Succinylbenzoat 20.0 1.8E+02 0.004 28.8 5.2 22 149-170 212-233 (263)
No 1
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=100.00 E-value=1e-84 Score=673.38 Aligned_cols=339 Identities=40% Similarity=0.716 Sum_probs=301.2
Q ss_pred CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880 27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ 106 (497)
Q Consensus 27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~ 106 (497)
|++||+||||||||+|+++||++||.||.|| ||+|||||+||||||+|++.||..
T Consensus 1 f~~RG~mlDvaR~f~~~~~ik~~Id~ma~~K-------------lN~lh~HltDd~~~rle~~~~P~L------------ 55 (348)
T cd06562 1 FPHRGLLLDTSRHFLSVDSIKRTIDAMAYNK-------------LNVLHWHITDSQSFPLESPSYPEL------------ 55 (348)
T ss_pred CCccceeeeccccCCCHHHHHHHHHHHHHhC-------------CcEEEEeEEcCCCceEeeCCCchh------------
Confidence 7899999999999999999999999999999 699999999999999999988732
Q ss_pred HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880 107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR 186 (497)
Q Consensus 107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~ 186 (497)
+..|+|+++++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus 56 ------------------------------t~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a~~~~~p~ 105 (348)
T cd06562 56 ------------------------------SKKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEIDTPGHTGSWGQGYPE 105 (348)
T ss_pred ------------------------------hhccCcCCCceECHHHHHHHHHHHHHcCCEEEEeccCchhhHHHHHhChh
Confidence 12456778899999999999999999999999999999999999999887
Q ss_pred CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880 187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV 266 (497)
Q Consensus 187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~ 266 (497)
+... |. ..|..++.++..++|||++|+|++|+++|++|++++|| +++|||||||
T Consensus 106 l~~~----~~--~~~~~~~~~~~~~~L~~~~~~t~~fl~~vl~E~~~lF~--~~~iHiGgDE------------------ 159 (348)
T cd06562 106 LLTG----CY--AVWRKYCPEPPCGQLNPTNPKTYDFLKTLFKEVSELFP--DKYFHLGGDE------------------ 159 (348)
T ss_pred hhCC----CC--ccccccccCCCCccccCCChhHHHHHHHHHHHHHHhcC--CcceEeecCC------------------
Confidence 6522 21 22444555666789999999999999999999999999 9999999999
Q ss_pred CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880 267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT 346 (497)
Q Consensus 267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~ 346 (497)
+..+||+++|+|+++|+++|++ ++.+|+.+|++++.++|++ +||++++|+|++.
T Consensus 160 ------------------~~~~~w~~~p~~~~~m~~~g~~----~~~~l~~~f~~~~~~~l~~----~Gk~~i~W~d~~~ 213 (348)
T cd06562 160 ------------------VNFNCWNSNPEIQKFMKKNNGT----DYSDLESYFIQRALDIVRS----LGKTPIVWEEVFD 213 (348)
T ss_pred ------------------CCCCcccCCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCeEEEeeeccc
Confidence 7789999999999999999973 6889999999999999984 8999999999987
Q ss_pred CchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCC-CCCCChhhhhccCCCC-------CC
Q psy14880 347 DPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGV-TSYYRWQRVYDNLLPS-------SP 418 (497)
Q Consensus 347 ~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~-~~~~~~~~iY~~~p~~-------~~ 418 (497)
..+ ..+ +++++|++|++.. .+.+++++||+||+|+++++|||+++.+. .++.+|+++|+++|.. .+
T Consensus 214 ~~~---~~~-~~~~iv~~W~~~~--~~~~~~~~G~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~y~~~p~~~~~~~~~~~ 287 (348)
T cd06562 214 NGV---YLL-PKDTIVQVWGGSD--ELKNVLAAGYKVILSSYDFWYLDCGFGGWVGPGNDWCDPYKNWPRIYSGTPEQKK 287 (348)
T ss_pred CCC---ccC-CCCeEEEECCCcH--HHHHHHHCCCCEEEeCCCcEEEeecCCCCCCCCCcHhhhhcCCCCCCCCChhhcc
Confidence 643 134 5799999999874 67899999999999999999999986321 3568999999988864 38
Q ss_pred CccceeeeeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHHHCC
Q psy14880 419 LVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRFLEQRERLVEMG 478 (497)
Q Consensus 419 ~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl~~~~~~l~~~G 478 (497)
+|+|+|+|||+|++++.++++++|||++|+||++|+++. ++|++|..||..+++||.+||
T Consensus 288 ~ilG~~~~lW~E~~~~~~~~~~~~PR~~a~AE~~W~~~~~~~~~~f~~Rl~~~~~~l~~~g 348 (348)
T cd06562 288 LVLGGEACMWGEQVDDTNLDQRLWPRASALAERLWSGPSDTNLTDAEPRLVEFRCRLVRRG 348 (348)
T ss_pred cEEEEEEEeeeeeeCccceeehhhhhHHHHHHHhhCCCcCCCHHHHHHHHHHHHHHHHhCc
Confidence 999999999999999989999999999999999999876 689999999999999999998
No 2
>KOG2499|consensus
Probab=100.00 E-value=1e-83 Score=660.07 Aligned_cols=377 Identities=40% Similarity=0.697 Sum_probs=336.4
Q ss_pred chHHHHHHHH------hhCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEE
Q psy14880 3 SFDDYIQLWA------HFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRW 76 (497)
Q Consensus 3 ~~~~~~~~~~------~~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhl 76 (497)
.++++-||-- .+.+..+.|.|.|+|.+||+|||+||||+|++.||+.||.||.-| +|+|||
T Consensus 151 glETfSqLv~~d~~~~~~~~~~~~I~D~PrF~hRGlLlDTSRhylpv~~IkrtLeaMa~nK-------------LNVlHW 217 (542)
T KOG2499|consen 151 GLETFSQLVWGDSIGGLFMIATAYIQDKPRFGHRGLLLDTSRHYLPVKVIKRTLEAMAANK-------------LNVLHW 217 (542)
T ss_pred HHHHHHHHheeccCCceEEeeeeeEeccCCCcccceEEecccceecHHHHHHHHHHHHhhh-------------hceeEE
Confidence 4678888832 455667889999999999999999999999999999999999999 599999
Q ss_pred eccCCcccccccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
Q psy14880 77 QMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQEL 156 (497)
Q Consensus 77 Hl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~ei 156 (497)
|++|+|+||+|+|+||.+ .++|+||+.+.||++|+++|
T Consensus 218 HivDs~SFPle~~~~PeL------------------------------------------~~kGaYs~~~vYT~eDv~ev 255 (542)
T KOG2499|consen 218 HIVDSQSFPLESPTFPEL------------------------------------------HRKGAYSPRHVYTREDVSEV 255 (542)
T ss_pred EeecCCCCccccCCchhh------------------------------------------hhcCCCCcceeecHHHHHHH
Confidence 999999999999998732 24799999999999999999
Q ss_pred HHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 157 VHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 157 v~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
|+||+.|||+|+||||+|||+++ |+| ++|++ ..++|+.++++++++.|||+++.+|+|+++++.|+.+.||
T Consensus 256 V~yarlRGIRVlpEfD~PgHt~s---Wg~--g~~~f----l~p~~~~~~~~~~~gplnP~~n~tydvls~i~~dv~evFp 326 (542)
T KOG2499|consen 256 VEYARLRGIRVLPEFDTPGHTGS---WGP--GYPDF----LTPCWSSFEVQPPFGPLNPTNNHTYDVLSEIFEDVSEVFP 326 (542)
T ss_pred HHHHHhccceeeecccCCccccc---ccC--CCCcc----cCCcccccccCCCCcCCCCCchhHHHHHHHHHHHHHHhCc
Confidence 99999999999999999999998 665 45663 2346788889999999999999999999999999999999
Q ss_pred CCCCeEEEcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHH
Q psy14880 237 YGDEMFHMGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLW 316 (497)
Q Consensus 237 ~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~ 316 (497)
..+||+|||| |+..||+++|++|++|+++|++. +..++.
T Consensus 327 --~~~~HlGGDE------------------------------------V~~~CW~s~~~Iq~fM~~kGfg~---~~~~~~ 365 (542)
T KOG2499|consen 327 --DEFFHLGGDE------------------------------------VSTPCWKSNPEIQDFMRKKGFGL---DTKSLE 365 (542)
T ss_pred --HHHeecCCce------------------------------------eecccccCChHHHHHHHhCCCCc---hHHHHH
Confidence 9999999999 88999999999999999999963 556677
Q ss_pred HHHHHHHHHHHHHHhhCCCceEEEecCCCCCchhhccCCCCCcEEEEecCCCCc-hhHHHHHHcCCeEEEeCCCcccccc
Q psy14880 317 AHFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPKRYIIQTWVPRID-PLADLLISKGYQVIISTKDAWYLDH 395 (497)
Q Consensus 317 ~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~~~~~~~~l~~~~~iv~~W~~~~~-~~~~~~~~~Gy~vI~s~~~~~Yld~ 395 (497)
.+|++.+.+++. +.+++++.|+|.+.+... + +.++|+|.|+.+.+ ....+.+.+||++|+|+..+||||+
T Consensus 366 ~~~~~~~~~i~~----s~~~~~I~Wqevf~n~~~----i-~p~tiiq~W~~~~~~~~~~k~v~~~~~~ivs~s~~wYLd~ 436 (542)
T KOG2499|consen 366 RLYIQFLLDILN----SWNKTVIVWQEVFDNKRK----I-DPRTIIQIWKIGTWYPKELKIVTKGYRFIVSNSAAWYLDH 436 (542)
T ss_pred HHHHHHHHHHHH----hcCceEEEeehhcccccc----C-CCCceeeeeccCCccHHHHHHHhccCceEEEeccceEeec
Confidence 888888878777 467899999999987643 2 35899999998752 4567888999999999999999998
Q ss_pred cccCCCCCCChhhhhccCCCCC-------CCccceeeeeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880 396 GFWGVTSYYRWQRVYDNLLPSS-------PLVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF 467 (497)
Q Consensus 396 ~~~~~~~~~~~~~iY~~~p~~~-------~~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl 467 (497)
+.|+ .+|+++|+.+|.++ +.|+|||+|||+|++|+..++.++|||+.|+|||+||.++ ..+.+...||
T Consensus 437 ~~~g----~dw~~~Y~~~p~~~~g~~~~k~lVlGGE~cmWgE~vD~t~L~~RlWPRAsA~AERLWS~~~~~~~~~A~~Rl 512 (542)
T KOG2499|consen 437 IGYG----SDWRKVYNTEPLSGMGTPEQKKLVLGGEVCMWGEYVDNTNLESRLWPRASAAAERLWSNKKVSRLLDAYPRL 512 (542)
T ss_pred cccC----CChhheeeccccccCCCHHHhhheecceeeeehhhccccccccccccchhHHHHHhhcccccchHHHHHHHH
Confidence 7763 68999999999873 5799999999999999999999999999999999999655 5789999999
Q ss_pred HHHHHHHHHCCCCcccCCCCeeecCCCCCC
Q psy14880 468 LEQRERLVEMGIRAEVTTPEWCYLNDGQCR 497 (497)
Q Consensus 468 ~~~~~~l~~~Gi~~~p~~p~~c~~~~~~c~ 497 (497)
..+||||++|||.+.|+.|+||.+++++|.
T Consensus 513 ~~~RcrLv~RGi~A~p~~p~~C~~~~~~c~ 542 (542)
T KOG2499|consen 513 HLFRCRLVARGIGAQPVQPGWCLQEEGECP 542 (542)
T ss_pred HHHHHHHHhcCCCcCCCCCcccccCCCCCC
Confidence 999999999999999999999999999994
No 3
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=2.9e-80 Score=642.62 Aligned_cols=341 Identities=24% Similarity=0.357 Sum_probs=284.8
Q ss_pred CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880 27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ 106 (497)
Q Consensus 27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~ 106 (497)
|++||+||||||||+|+++||++||.||+|| ||+|||||+||||||+|++.||.+
T Consensus 1 f~~RG~mLD~aR~f~~~~~ik~~Id~ma~~K-------------~N~lhlHltDdq~~rle~~~~P~L------------ 55 (357)
T cd06563 1 FSWRGLMLDVSRHFFPVDEVKRFIDLMALYK-------------LNVFHWHLTDDQGWRIEIKKYPKL------------ 55 (357)
T ss_pred CCccceeeeccccCcCHHHHHHHHHHHHHhc-------------cceEEEeeecCCCceecccCcchh------------
Confidence 7899999999999999999999999999999 699999999999999999999843
Q ss_pred HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880 107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR 186 (497)
Q Consensus 107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~ 186 (497)
.+.++.|+-. ....|... .+.-..+++||++|+++||+||++|||+||||||+|||+.++++++|+
T Consensus 56 --t~~ga~~~~~---~~~~~~~~---------~~~~~~~~~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~pe 121 (357)
T cd06563 56 --TEVGAWRGPT---EIGLPQGG---------GDGTPYGGFYTQEEIREIVAYAAERGITVIPEIDMPGHALAALAAYPE 121 (357)
T ss_pred --hhcccccCcc---cccccccc---------cCCCccCceECHHHHHHHHHHHHHcCCEEEEecCCchhHHHHHHhCcc
Confidence 2223333211 00011110 111134678999999999999999999999999999999999998876
Q ss_pred CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880 187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV 266 (497)
Q Consensus 187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~ 266 (497)
+.... ....++.. ...++++|||++|+|++|+++|++|++++|+ +++|||||||
T Consensus 122 l~~~~----~~~~~~~~--~~~~~~~L~~~~~~t~~f~~~ll~E~~~lF~--~~~iHiGgDE------------------ 175 (357)
T cd06563 122 LGCTG----GPGSVVSV--QGVVSNVLCPGKPETYTFLEDVLDEVAELFP--SPYIHIGGDE------------------ 175 (357)
T ss_pred ccCCC----CCCccccc--cCcCCCccCCCChhHHHHHHHHHHHHHHhCC--CCeEEEeccc------------------
Confidence 54211 11111111 1234578999999999999999999999999 9999999999
Q ss_pred CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880 267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT 346 (497)
Q Consensus 267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~ 346 (497)
+...||+++|.|+++|+++|+. +..+|+.+|++++.++|++ +|+++++|+|++.
T Consensus 176 ------------------~~~~~w~~~~~~~~~~~~~g~~----~~~~l~~~f~~~~~~~v~~----~G~~~i~W~d~~~ 229 (357)
T cd06563 176 ------------------VPKGQWEKSPACQARMKEEGLK----DEHELQSYFIKRVEKILAS----KGKKMIGWDEILE 229 (357)
T ss_pred ------------------cCCcccccCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCEEEEeecccc
Confidence 7778999999999999999984 5778999999999999984 8999999999875
Q ss_pred CchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCccccccccc-------CCCCCCChhhhhccCCCC---
Q psy14880 347 DPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFW-------GVTSYYRWQRVYDNLLPS--- 416 (497)
Q Consensus 347 ~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~-------~~~~~~~~~~iY~~~p~~--- 416 (497)
.. + +++++|++|++. ..+.+++++||+||+|+++++|||+++. +..++.+|+++|+++|..
T Consensus 230 ~~------l-~~~~iv~~W~~~--~~~~~~~~~G~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~~~~~~~y~~~P~~~~~ 300 (357)
T cd06563 230 GG------L-PPNATVMSWRGE--DGGIKAAKQGYDVIMSPGQYLYLDYAQSKGPDEPASWAGFNTLEKVYSFEPVPGGL 300 (357)
T ss_pred cC------C-CCCcEEEECCCc--hHHHHHHHCCCCEEEeCCCceEEecCCCCCCCCCccccCCCCHHHHhcCCCCCCCC
Confidence 42 4 589999999988 4678999999999999999999998751 135678999999999976
Q ss_pred ----CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880 417 ----SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF 467 (497)
Q Consensus 417 ----~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl 467 (497)
.++|+|+|+|||+|.+.+ +++++++|||++|+||++|++++ ++|++|..||
T Consensus 301 ~~~~~~~ilG~~~~lW~E~~~~~~~~~~~~~PR~~a~AE~~W~~~~~~~~~~f~~rl 357 (357)
T cd06563 301 TPEQAKRILGVQANLWTEYIPTPERVEYMAFPRLLALAEVAWTPPEKKDWEDFRKRL 357 (357)
T ss_pred ChhHhcCEEEEEEEecccccCCHHHHHHHHHHHHHHHHHHHhCCccCCCHHHHHhhC
Confidence 279999999999999976 78999999999999999999976 7999999886
No 4
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=9e-79 Score=618.61 Aligned_cols=309 Identities=33% Similarity=0.495 Sum_probs=268.1
Q ss_pred CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880 27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ 106 (497)
Q Consensus 27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~ 106 (497)
|++||+|||+||||+|+++||++||.||.+| ||+|||||+||||||+|+++||..
T Consensus 1 f~~RG~mlD~aR~f~~~~~ik~~Id~ma~~K-------------lN~lh~HltDd~~~rle~~~~P~l------------ 55 (311)
T cd06570 1 FPWRGLLIDVSRHFIPVAVIKRQLDAMASVK-------------LNVFHWHLTDDQGFRIESKKYPKL------------ 55 (311)
T ss_pred CCccCeEEecCCCCcCHHHHHHHHHHHHHhC-------------CeEEEEEEecCCCceeecCCCccc------------
Confidence 7899999999999999999999999999999 699999999999999999988742
Q ss_pred HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880 107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR 186 (497)
Q Consensus 107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~ 186 (497)
+ ..| +.+++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus 56 ----------------------t--------~~g--~~~~~yT~~di~elv~yA~~rgI~vIPEId~PGH~~a~~~~ype 103 (311)
T cd06570 56 ----------------------Q--------QKA--SDGLYYTQEQIREVVAYARDRGIRVVPEIDVPGHASAIAVAYPE 103 (311)
T ss_pred ----------------------c--------ccC--CCCCccCHHHHHHHHHHHHHcCCEEEEeecCccchHHHHHhCHH
Confidence 0 112 23578999999999999999999999999999999999987766
Q ss_pred CCCCCcccccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCC
Q psy14880 187 YGLGDLIVCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQ 265 (497)
Q Consensus 187 ~~~~~l~~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~ 265 (497)
+. .|... .....++. ..++|||++|+|++|+++|++|++++|| +++|||||||
T Consensus 104 l~-----~~~~~~~~~~~~~~--~~~~l~~~~p~t~~f~~~l~~E~~~lF~--~~~iHiGgDE----------------- 157 (311)
T cd06570 104 LA-----SGPGPYVIERGWGV--FEPLLDPTNEETYTFLDNLFGEMAELFP--DEYFHIGGDE----------------- 157 (311)
T ss_pred hc-----cCCCcccccccccc--CCCccCCCChhHHHHHHHHHHHHHHhCC--CCceEeeccC-----------------
Confidence 43 22221 11122222 2357999999999999999999999999 9999999999
Q ss_pred CCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCC
Q psy14880 266 VPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHL 345 (497)
Q Consensus 266 ~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~ 345 (497)
+...||+++|+|+++|+++|++ +..+|+.+|++++.++|++ +||++|+|+|++
T Consensus 158 -------------------~~~~~W~~~p~~~~~~~~~g~~----~~~~l~~~f~~~~~~~v~~----~Gk~~~~W~d~~ 210 (311)
T cd06570 158 -------------------VDPKQWNENPRIQAFMKEHGLK----DAAALQAYFNQRVEKILSK----HGKKMIGWDEVL 210 (311)
T ss_pred -------------------CCCCcccCCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCeEEEecccc
Confidence 7788999999999999999984 5778999999999999884 899999999987
Q ss_pred CCchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCCCCCCccceee
Q psy14880 346 TDPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLPSSPLVLGGEV 425 (497)
Q Consensus 346 ~~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~~~~~ilG~e~ 425 (497)
.. .+ +++++|+.|++. ..+.+++++||++|+|++ +|||+++ +++++|+++|. |+|||+
T Consensus 211 ~~------~l-~~~~iv~~W~~~--~~~~~~~~~G~~vI~s~~--~YlD~~~-------~~~~~y~~~p~----ilG~~~ 268 (311)
T cd06570 211 HP------DL-PKNVVIQSWRGH--DSLGEAAKAGYQGILSTG--YYIDQPQ-------PAAYHYRVDPM----ILGGEA 268 (311)
T ss_pred cc------CC-CCCeEEEEeCCc--hHHHHHHHCCCCEEEech--hheeCCC-------chhheeCCCCc----EEEEEE
Confidence 42 35 579999999975 457889999999999984 8999864 35689999987 999999
Q ss_pred eeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880 426 AMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF 467 (497)
Q Consensus 426 ~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl 467 (497)
|||||+++++++++++|||++|+||++|++++ ++|++|.+||
T Consensus 269 ~lW~E~v~~~~~~~~~~PR~~A~AE~~Ws~~~~~~~~~~~~Rl 311 (311)
T cd06570 269 TMWAELVSEETIDSRLWPRTAAIAERLWSAQDVRDEDDMYRRL 311 (311)
T ss_pred EEeecCcCHHHHHHHHhHHHHHHHHHhhCCCcCCCHHHHHhhC
Confidence 99999998789999999999999999999977 7999999886
No 5
>cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=7.1e-78 Score=638.43 Aligned_cols=360 Identities=21% Similarity=0.291 Sum_probs=283.0
Q ss_pred cCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCCh
Q psy14880 23 DEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTP 102 (497)
Q Consensus 23 D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~ 102 (497)
|+|+|++||+||||||||+|+++||++||.||++| ||+||||||||||||+|+++||.
T Consensus 1 D~Prf~~RGlmLDvaR~f~~~~~ik~~Id~ma~~K-------------~N~lHlHLtDdqgwriei~~~P~--------- 58 (445)
T cd06569 1 DAPRFEYRGMHLDVARNFHSKETVLKLLDQMAAYK-------------LNKLHLHLTDDEGWRLEIPGLPE--------- 58 (445)
T ss_pred CCCCCcccceeeeccCCCCCHHHHHHHHHHHHHhC-------------CceEEEEeecCCCcceeccCCch---------
Confidence 89999999999999999999999999999999999 69999999999999999999984
Q ss_pred hHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880 103 LEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW 181 (497)
Q Consensus 103 ~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l 181 (497)
+++.++.|+.... .+.-.|.+. .++ ....+.+++||++||+|||+||++|||+||||||+|||+.+++
T Consensus 59 -----Lt~~ga~r~~~~~~~~~~~~~~~-~~~-----~~~~~~~g~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~a~l 127 (445)
T cd06569 59 -----LTEVGAKRCHDLSETTCLLPQLG-SGP-----DTNNSGSGYYSRADYIEILKYAKARHIEVIPEIDMPGHARAAI 127 (445)
T ss_pred -----hhhcccccccccccccccccccc-cCc-----ccCcccCCccCHHHHHHHHHHHHHcCCEEEEccCCchhHHHHH
Confidence 3444444443211 011112211 100 0012457889999999999999999999999999999999999
Q ss_pred hhcCCCCCCCcccccCCC---CC--------Ccc-ccC-CCCCcCCCCChhHHHHHHHHHHHHHhccCC---CCCeEEEc
Q psy14880 182 QWGPRYGLGDLIVCLNDP---SW--------RAN-CIQ-PPCGQLNPINPHVYTVLKDIYGELMGMAKY---GDEMFHMG 245 (497)
Q Consensus 182 ~~~p~~~~~~l~~~~~~~---~~--------~~~-~~~-~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~---~~~~iHiG 245 (497)
++++. +||+|..+.... .+ ..+ +++ ..+++|||++|+|++|+++|++|++++|+. .+++||||
T Consensus 128 ~a~~~-~yp~l~~~g~~~~~~~~~~~d~~~~~~~~~~~~~~~~~L~p~~~~ty~fl~~vl~Ev~~lF~~~g~P~~yiHiG 206 (445)
T cd06569 128 KAMEA-RYRKLMAAGKPAEAEEYRLSDPADTSQYLSVQFYTDNVINPCMPSTYRFVDKVIDEIARMHQEAGQPLTTIHFG 206 (445)
T ss_pred Hhhhc-cchhhhccCCccccccccccCcccccccccccccccccccCCchhHHHHHHHHHHHHHHHHhccCCCcceEEEe
Confidence 87532 345554322100 00 000 111 134789999999999999999999999942 17999999
Q ss_pred CCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccCCHHHH--HHHHHcCCCCCcccHHHHHHHHHHHH
Q psy14880 246 ADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIV--DFMLAHGKNTTFDDYIQLWAHFQNKA 323 (497)
Q Consensus 246 gDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~--~~~~~~g~~~~~~~~~~l~~~F~~~~ 323 (497)
||| +...||+++|.|+ ++|+++|+. ++.+|+.+|++++
T Consensus 207 GDE------------------------------------v~~~~W~~~~~~~~~~~~~~~~~~----~~~~l~~~f~~~v 246 (445)
T cd06569 207 GDE------------------------------------VPEGAWGGSPACKAQLFAKEGSVK----DVEDLKDYFFERV 246 (445)
T ss_pred ccc------------------------------------cCCCcccCCHHHHHHHHHHhcCCC----CHHHHHHHHHHHH
Confidence 999 7788999999999 999998874 6789999999999
Q ss_pred HHHHHHHhhCCCceEEEecCCCCCchhh--ccCCCCCcEEEEecCCCC---chhHHHHHHcCCeEEEeCCCcccccccc-
Q psy14880 324 AASLDEAVGHNRTKLIVWSSHLTDPEII--LNYLDPKRYIIQTWVPRI---DPLADLLISKGYQVIISTKDAWYLDHGF- 397 (497)
Q Consensus 324 ~~~l~~~~~~~g~~~~~W~d~~~~~~~~--~~~l~~~~~iv~~W~~~~---~~~~~~~~~~Gy~vI~s~~~~~Yld~~~- 397 (497)
.++|++ +|+++++|+|++...... +..+ +++++|++|+... ...+.+++++||+||+||++++|||+++
T Consensus 247 ~~~l~~----~Gk~~i~W~e~~~~~~~~~~~~~~-~~~~~v~~W~~~~~~~~~~~~~~~~~G~~vI~s~~~~~YlD~~~~ 321 (445)
T cd06569 247 SKILKA----HGITLAGWEDGLLGKDTTNVDGFA-TPYVWNNVWGWGYWGGEDRAYKLANKGYDVVLSNATNLYFDFPYE 321 (445)
T ss_pred HHHHHH----cCCeEEEecccccCCCcccccccC-CCCeEEEEccCCcccccHHHHHHHHCCCCEEEeCCCcEEEecCCC
Confidence 999984 799999999998764321 1124 4689999998642 3567889999999999999999999864
Q ss_pred ---------cCCCCCCChhhhhccCCCC-----------------------------CCCccceeeeeeccccCC-Ccch
Q psy14880 398 ---------WGVTSYYRWQRVYDNLLPS-----------------------------SPLVLGGEVAMWTEYVDD-QSLD 438 (497)
Q Consensus 398 ---------~~~~~~~~~~~iY~~~p~~-----------------------------~~~ilG~e~~lWsE~~~~-~~l~ 438 (497)
| ++.|.+++++|+++|.+ .++|+|+|+|||||++++ +.++
T Consensus 322 ~~~~~~g~~w-~~~~~~~~~~y~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ilG~e~~lWsE~v~~~~~l~ 400 (445)
T cd06569 322 KHPEERGYYW-AGRFVDTKKVFSFMPDNLYANAEVTRDGDPIDDTALNGKVRLTLEGPKNILGLQGQLWSETIRTDEQLE 400 (445)
T ss_pred CCCCcCCccc-ccCCCCHHHhhccCCchhhccchhhcccCccccccccccccCChhHhcceeEEEEeeeccccCCHHHhH
Confidence 3 23458999999998741 269999999999999987 7899
Q ss_pred hhhhhhHHHHHHHhcCCCCCChHHH
Q psy14880 439 GRLWPRTAAAAERLWSNPKSSSSEA 463 (497)
Q Consensus 439 ~~l~PRl~A~AEr~W~~~~~~~~~f 463 (497)
+++|||++|+||++|++.. +|.++
T Consensus 401 ~~~~PR~~A~AE~~Ws~~~-~~~~~ 424 (445)
T cd06569 401 YMVFPRLLALAERAWHKAP-WEADY 424 (445)
T ss_pred HHhhhHHHHHHHHHhcCCc-ccccc
Confidence 9999999999999999854 34444
No 6
>cd06568 GH20_SpHex_like A subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex). SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=100.00 E-value=3.7e-76 Score=604.64 Aligned_cols=313 Identities=22% Similarity=0.312 Sum_probs=259.6
Q ss_pred CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880 27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ 106 (497)
Q Consensus 27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~ 106 (497)
|++||+|||+||||+|+++||++||.||++| ||+|||||+||||||+|++.||...
T Consensus 1 f~~RG~mlD~aR~f~~~~~lk~~id~ma~~K-------------lN~lhlHLtD~~~~rle~~~~P~lt----------- 56 (329)
T cd06568 1 FAYRGLMLDVARHFFTVAEVKRYIDLLALYK-------------LNVLHLHLTDDQGWRIEIKSWPKLT----------- 56 (329)
T ss_pred CCccceeeeccCCCcCHHHHHHHHHHHHHhC-------------CcEEEEEeecCCcceeeecCccccc-----------
Confidence 7899999999999999999999999999999 6999999999999999999987431
Q ss_pred HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880 107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR 186 (497)
Q Consensus 107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~ 186 (497)
+.++.+ .+..+++++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus 57 ---~~ga~~-----------------------~~~~~~~~~YT~~di~elv~yA~~rgI~vIPEiD~PGH~~a~~~~~p~ 110 (329)
T cd06568 57 ---EIGGST-----------------------EVGGGPGGYYTQEDYKDIVAYAAERHITVVPEIDMPGHTNAALAAYPE 110 (329)
T ss_pred ---cccccc-----------------------ccCCCCCCcCCHHHHHHHHHHHHHcCCEEEEecCCcHHHHHHHHhChh
Confidence 000000 011245788999999999999999999999999999999999988776
Q ss_pred CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880 187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV 266 (497)
Q Consensus 187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~ 266 (497)
+. |.+...-...+.++.+++|||++|+|++|+++|++|++++|| +++|||||||+
T Consensus 111 l~------~~~~~~~~~~~~~~~~~~l~~~~~~t~~fl~~v~~E~~~~f~--~~~iHiGgDE~----------------- 165 (329)
T cd06568 111 LN------CDGKAKPLYTGIEVGFSSLDVDKPTTYEFVDDVFRELAALTP--GPYIHIGGDEA----------------- 165 (329)
T ss_pred hc------cCCCCCccccccCCCCcccCCCCHHHHHHHHHHHHHHHHhCC--CCeEEEecccC-----------------
Confidence 54 322111001123445678999999999999999999999999 99999999993
Q ss_pred CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880 267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT 346 (497)
Q Consensus 267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~ 346 (497)
... ..+++.+|++++.++|++ +||++++|+|++.
T Consensus 166 -------------------~~~-----------------------~~~~~~~f~~~~~~~v~~----~Gk~~~~W~d~~~ 199 (329)
T cd06568 166 -------------------HST-----------------------PHDDYAYFVNRVRAIVAK----YGKTPVGWQEIAR 199 (329)
T ss_pred -------------------CCC-----------------------chHHHHHHHHHHHHHHHH----CCCeEEEECcccc
Confidence 211 124567899999999984 8999999999864
Q ss_pred CchhhccCCCCCcEEEEecCCCC-chhHHHHHHcCCeEEEeCCCccccccccc-----C--CCCCCChhhhhccCCCC--
Q psy14880 347 DPEIILNYLDPKRYIIQTWVPRI-DPLADLLISKGYQVIISTKDAWYLDHGFW-----G--VTSYYRWQRVYDNLLPS-- 416 (497)
Q Consensus 347 ~~~~~~~~l~~~~~iv~~W~~~~-~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~-----~--~~~~~~~~~iY~~~p~~-- 416 (497)
. .+ +++++|++|++.. ...+.+++++||++|+||++++|||+++. + ..++.+|+++|+++|..
T Consensus 200 ~------~l-~~~~iv~~W~~~~~~~~~~~~~~~G~~vI~s~~~~~YlD~~~~~~~~~~~~~~~~~~~~~~y~~~P~~~~ 272 (329)
T cd06568 200 A------DL-PAGTVAQYWSDRAPDADAAAALDKGAKVILSPADKAYLDMKYDADSPLGLTWAGPVEVREAYDWDPAAYG 272 (329)
T ss_pred c------CC-CCCeEEEECCCCCCchHHHHHHHCCCCEEEeCCCcEEEecCCCCCCCCCcccCCCCCHHHHeeeCCCCCC
Confidence 3 35 5799999999863 24678999999999999999999998641 0 13567999999999976
Q ss_pred ----CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880 417 ----SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF 467 (497)
Q Consensus 417 ----~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl 467 (497)
.++|+|+|+|||+|++++ +++++++|||++|+|||+|++++ ++|++|..||
T Consensus 273 ~~~~~~~ilG~~~~lW~E~~~~~~~~~~~~~PR~~a~AE~~Ws~~~~~~~~~f~~rl 329 (329)
T cd06568 273 PGVPDEAILGVEAPLWTETIRNLDDLEYMAFPRLAGVAEIGWSPQEARDWDDYKVRL 329 (329)
T ss_pred CcchhhCEEEEEEeecccccCCHHHHHHHHHhHHHHHHHHHhCCCcCCCHHHHHhhC
Confidence 279999999999999987 68999999999999999999976 7999999886
No 7
>PF00728 Glyco_hydro_20: Glycosyl hydrolase family 20, catalytic domain; InterPro: IPR015883 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3RPM_A 1C7T_A 1QBA_A 1QBB_A 1C7S_A 3RCN_A 2YL8_A 2YL6_A 2YLL_A 2YL5_C ....
Probab=100.00 E-value=1.8e-71 Score=574.04 Aligned_cols=323 Identities=35% Similarity=0.573 Sum_probs=261.7
Q ss_pred CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880 27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ 106 (497)
Q Consensus 27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~ 106 (497)
|++||+|||+||||+|+++||++||.||.+| ||+|||||+|+||||++++++|..
T Consensus 1 f~~RG~~lD~aR~~~~~~~ik~~id~ma~~k-------------~N~lhlhl~D~~~~~~~~~~~p~l------------ 55 (351)
T PF00728_consen 1 FPWRGLMLDVARHFFSVDTIKRLIDQMAYYK-------------LNVLHLHLSDDQGFRLESKSYPEL------------ 55 (351)
T ss_dssp SSEEEEEEETSSS-B-HHHHHHHHHHHHHTT--------------SEEEEEEESSTCB-BEBSTSTHH------------
T ss_pred CCccceeeccCCCCCCHHHHHHHHHHHHHcC-------------CcEEEEEEecCCCCccccCCCccc------------
Confidence 7899999999999999999999999999999 699999999999999999988731
Q ss_pred HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCC---CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhh
Q psy14880 107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSP---AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQW 183 (497)
Q Consensus 107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~---~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~ 183 (497)
.+ .|+++. +++||++||++||+||++|||+||||||+|||+.+++++
T Consensus 56 --~~----------------------------~ga~~~~~~~~~yT~~di~~lv~yA~~~gI~VIPeid~PGH~~~~l~~ 105 (351)
T PF00728_consen 56 --TE----------------------------KGAYRPSDAGGYYTKEDIRELVAYAKERGIEVIPEIDTPGHAEAWLKA 105 (351)
T ss_dssp --HH----------------------------TTTESTTCTESEBEHHHHHHHHHHHHHTT-EEEEEEEESSS-HHHHHH
T ss_pred --cc----------------------------cCccccccccccCCHHHHHHHHHHHHHcCCceeeeccCchHHHHHHHh
Confidence 11 122222 348999999999999999999999999999999999999
Q ss_pred cCCCCCCCcccccCCCCC-CccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcc
Q psy14880 184 GPRYGLGDLIVCLNDPSW-RANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCT 262 (497)
Q Consensus 184 ~p~~~~~~l~~~~~~~~~-~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~ 262 (497)
+|++.... |+....+ ...+..+.+++|||++|+|++|+++|++|++++|+ +++|||||||
T Consensus 106 ~p~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~~~~t~~~~~~l~~e~~~~f~--~~~iHiGgDE-------------- 166 (351)
T PF00728_consen 106 YPELGCSA---WPEDKSWPNSTCWYPDNGVLDPSNPETYEFLKDLLDEVADLFP--SKYIHIGGDE-------------- 166 (351)
T ss_dssp HHHHCCCH---TTCSSSCEEEETTSEEEEEE-TTSHHHHHHHHHHHHHHHHHHT--SSEEEEE-TS--------------
T ss_pred Cchhhccc---cccccccccccccCCCcccCCCCcHHHHHHHHHHHHHHHhhCC--CCeEEeCCcc--------------
Confidence 88765321 1111111 11122234468999999999999999999999999 9999999999
Q ss_pred cCCCCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEec
Q psy14880 263 RQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWS 342 (497)
Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~ 342 (497)
|..+||+++|+|+++|+++|+. +..+|+.+|++++.+++++ +|+++++|+
T Consensus 167 ----------------------v~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~f~~~~~~~v~~----~g~~~~~W~ 216 (351)
T PF00728_consen 167 ----------------------VNYNCWNNSPECQAWMKQNGLT----DPNDLFQYFVNRLADIVKK----HGKKPIIWN 216 (351)
T ss_dssp ----------------------TTTHHHHCHHHHHHHHHHTTTT----CHHHHHHHHHHHHHHHHHH----TTSEEEEES
T ss_pred ----------------------cccccccCCHHHhhHHhhcCCc----hHHHHHHHHHHHHHHHHHh----cCCcEEEEc
Confidence 6778999999999999999974 5789999999999999985 899999999
Q ss_pred CCCCCchhhccCCCCCcEEEEecCCCC-chhHHHHHHcCCeEEEeCCCcccccccc--cC------CCCCCChhhhhccC
Q psy14880 343 SHLTDPEIILNYLDPKRYIIQTWVPRI-DPLADLLISKGYQVIISTKDAWYLDHGF--WG------VTSYYRWQRVYDNL 413 (497)
Q Consensus 343 d~~~~~~~~~~~l~~~~~iv~~W~~~~-~~~~~~~~~~Gy~vI~s~~~~~Yld~~~--~~------~~~~~~~~~iY~~~ 413 (497)
|++...+.. ..+ +++++|++|++.. ...+..++++||++|+++.+++|+|+++ |. ..++.+++++|+++
T Consensus 217 D~~~~~~~~-~~~-~~~~~i~~W~~~~~~~~~~~~~~~g~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (351)
T PF00728_consen 217 DMLDDFPDA-SLL-PKDVIIQVWNYDWGPASAQEFAKKGYKVINSPSDYLYLDPGQSPWPEEDGNYWAPFNTWSDFYNWD 294 (351)
T ss_dssp TTTTTTCCG-HCS-CTTEEEEEESSTTHHHHHHHHHHTTHEEEESTHGTGBTTTTSSTTTSSSSBHTTSTHHHHHHHHHH
T ss_pred cccccCCcc-ccc-cCCceEEEeeccccchhhhHHHHhcCcEEEcCCCceEEccCcCCCCccCccccccccccccccccc
Confidence 999876421 124 6799999999965 4568899999999999999999999876 21 11244677888877
Q ss_pred CCC-------------CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCC
Q psy14880 414 LPS-------------SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSN 455 (497)
Q Consensus 414 p~~-------------~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~ 455 (497)
|.. +++|+|+++++|+|.+.+ +.+++++|||++|+|||+|++
T Consensus 295 ~~~~~~~~~~~~~~~~~~~i~G~~~~~W~E~~~~~~~l~~~~~Pr~~A~AE~~Ws~ 350 (351)
T PF00728_consen 295 PAYFNTNSTQIDPAQKSERILGGEAALWSENIRDEEDLDYRLWPRLAALAERLWSP 350 (351)
T ss_dssp HCHCTTTCSTTTTHHHCCCEEEEEEEEETTTTTSHHHHHHHHTTHHHHHHHHHHSS
T ss_pred ccccccccchhhhhcccCCCceeEEEeecCCCCCHHHHHHHHHHHHHHHHHHHhCC
Confidence 542 148999999999999965 899999999999999999986
No 8
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=8.1e-69 Score=545.50 Aligned_cols=293 Identities=27% Similarity=0.440 Sum_probs=245.0
Q ss_pred CceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHH
Q psy14880 29 NRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL 108 (497)
Q Consensus 29 ~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~ 108 (497)
+||+||||||||+|+++||++||.||++| ||+|||||+||||||+|++.||..
T Consensus 1 ~RG~mlD~aR~~~~~~~lk~~id~ma~~K-------------~N~lhlHl~D~~~~~le~~~~p~l-------------- 53 (303)
T cd02742 1 IRGIMLDVSRHFLSVESIKRTIDVLARYK-------------INTFHWHLTDDQAWRIESKKFPEL-------------- 53 (303)
T ss_pred CcceeeeccccCcCHHHHHHHHHHHHHhC-------------CcEEEEeeecCCCceEeeCccchh--------------
Confidence 69999999999999999999999999999 699999999999999999988743
Q ss_pred HHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCC
Q psy14880 109 VHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYG 188 (497)
Q Consensus 109 ~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~ 188 (497)
.+ .| . ...+++++++||++||+|||+||++|||+||||||+|||+.++++++|++.
T Consensus 54 ~~----~g--------------~------~~~~~~~~~~yT~~di~elv~yA~~rgI~viPEiD~PGH~~a~~~~~p~l~ 109 (303)
T cd02742 54 AE----KG--------------G------QINPRSPGGFYTYAQLKDIIEYAAARGIEVIPEIDMPGHSTAFVKSFPKLL 109 (303)
T ss_pred hh----hc--------------c------cccCCCCCCeECHHHHHHHHHHHHHcCCEEEEeccchHHHHHHHHhCHHhc
Confidence 10 00 0 012345678899999999999999999999999999999999999888754
Q ss_pred CCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCCC
Q psy14880 189 LGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVPC 268 (497)
Q Consensus 189 ~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~~ 268 (497)
.+ |.....+ ....+.|||++|+|++|+++|++|++++|+ +++|||||||+
T Consensus 110 ~~----~~~~~~~-----~~~~~~l~~~~~~t~~fl~~l~~e~~~lf~--~~~iHiGgDE~------------------- 159 (303)
T cd02742 110 TE----CYAGLKL-----RDVFDPLDPTLPKGYDFLDDLFGEIAELFP--DRYLHIGGDEA------------------- 159 (303)
T ss_pred cC----ccccCCC-----CCCCCccCCCCccHHHHHHHHHHHHHHhCC--CCeEEecceec-------------------
Confidence 22 2221111 113468999999999999999999999999 99999999993
Q ss_pred cCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCCc
Q psy14880 269 VQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTDP 348 (497)
Q Consensus 269 ~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~~ 348 (497)
... .+..+++.+|++++.++|++ +|+++++|+|++...
T Consensus 160 -----------------~~~---------------------~~~~~l~~~f~~~~~~~v~~----~g~~~~~W~d~~~~~ 197 (303)
T cd02742 160 -----------------HFK---------------------QDRKHLMSQFIQRVLDIVKK----KGKKVIVWQDGFDKK 197 (303)
T ss_pred -----------------CCC---------------------CCHHHHHHHHHHHHHHHHHH----cCCeEEEecccccCC
Confidence 211 12457889999999999984 899999999998653
Q ss_pred hhhccCCCCCcEEEEecCCCC---chhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCCC------CCC
Q psy14880 349 EIILNYLDPKRYIIQTWVPRI---DPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLPS------SPL 419 (497)
Q Consensus 349 ~~~~~~l~~~~~iv~~W~~~~---~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~~------~~~ 419 (497)
. .+ +++++|++|++.. ...+.+++++||+||+|++.++|+ . ..++.+++++|+++|.. .++
T Consensus 198 ~----~l-~~~~ii~~W~~~~~~~~~~~~~~~~~G~~vi~s~~~yly~---~--~~~~~~~~~~y~~~p~~~~~~~~~~~ 267 (303)
T cd02742 198 M----KL-KEDVIVQYWDYDGDKYNVELPEAAAKGFPVILSNGYYLDI---F--IDGALDARKVYKNDPLAVPTPQQKDL 267 (303)
T ss_pred C----CC-CCCeEEEEccCCCCcchHHHHHHHHCCCCEEEeCCceeee---e--CCCCCCHHHHhCCCCCCCCCcccccc
Confidence 2 35 5799999999974 246788999999999999877777 1 24568999999999875 379
Q ss_pred ccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcC
Q psy14880 420 VLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWS 454 (497)
Q Consensus 420 ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~ 454 (497)
|+|+++|||+|.+++ +.+++++|||++|+||++|+
T Consensus 268 vlG~~~~lW~E~~~~~~~~~~~~~pr~~a~AE~~Ws 303 (303)
T cd02742 268 VLGVIACLWGETVKDTKTLQYRFWPRALAVAERSWS 303 (303)
T ss_pred eEEEEEeeeccccCCHHHHHHHHHHHHHHHHHHhhC
Confidence 999999999999998 59999999999999999996
No 9
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=1.5e-66 Score=534.01 Aligned_cols=310 Identities=22% Similarity=0.323 Sum_probs=248.9
Q ss_pred CCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHH
Q psy14880 28 HNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE 107 (497)
Q Consensus 28 ~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~ 107 (497)
++||+|||+||||+|+++||++||.||++| ||+|||||+| +||++++.++..
T Consensus 1 ~~RG~mLD~aR~~~~~~~ik~~id~ma~~K-------------~N~lhlHltD--~~~~~~~~~p~l------------- 52 (326)
T cd06564 1 EVRGFMLDVGRKYYSMDFLKDIIKTMSWYK-------------MNDLQLHLND--NLIFNLDDMSTT------------- 52 (326)
T ss_pred CcceeEEEccCCCCCHHHHHHHHHHHHHcC-------------CceEEEeecC--CcccccCCCchh-------------
Confidence 479999999999999999999999999999 6999999999 788888877743
Q ss_pred HHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCC
Q psy14880 108 LVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRY 187 (497)
Q Consensus 108 ~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~ 187 (497)
.+.++.++-++. .|. ...++.+.+++||++||+|||+||++|||+||||||+|||+.++++++|++
T Consensus 53 -~~~~~~~~~~~~----~~~---------~~~~~~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a~~~~~pel 118 (326)
T cd06564 53 -VNNATYASDDVK----SGN---------NYYNLTANDGYYTKEEFKELIAYAKDRGVNIIPEIDSPGHSLAFTKAMPEL 118 (326)
T ss_pred -hhhhhhhccccc----ccc---------ccCCCCCCCCcccHHHHHHHHHHHHHcCCeEeccCCCcHHHHHHHHhhHHh
Confidence 112222221111 111 113445678999999999999999999999999999999999999988876
Q ss_pred CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCC
Q psy14880 188 GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVP 267 (497)
Q Consensus 188 ~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~ 267 (497)
+.+.. . ..+..++|||++|+|++|+++|++|++++|+..+++|||||||+
T Consensus 119 ~~~~~------~------~~~~~~~l~~~~~~t~~f~~~l~~E~~~~f~~~~~~~HiGgDE~------------------ 168 (326)
T cd06564 119 GLKNP------F------SKYDKDTLDISNPEAVKFVKALFDEYLDGFNPKSDTVHIGADEY------------------ 168 (326)
T ss_pred cCCCc------c------cCCCcccccCCCHHHHHHHHHHHHHHHHhcCCCCCEEEeccccc------------------
Confidence 53211 1 11244789999999999999999999999984469999999993
Q ss_pred CcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCC
Q psy14880 268 CVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTD 347 (497)
Q Consensus 268 ~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~ 347 (497)
... .+..+++..|++++.++|++ +||++++|+|++..
T Consensus 169 ------------------~~~---------------------~~~~~~~~~f~~~~~~~v~~----~gk~~~~W~d~~~~ 205 (326)
T cd06564 169 ------------------AGD---------------------AGYAEAFRAYVNDLAKYVKD----KGKTPRVWGDGIYY 205 (326)
T ss_pred ------------------ccc---------------------CccHHHHHHHHHHHHHHHHH----cCCeEEEeCCcccC
Confidence 211 02356788999999999985 79999999999875
Q ss_pred chhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCC------------
Q psy14880 348 PEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLP------------ 415 (497)
Q Consensus 348 ~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~------------ 415 (497)
.+. ...+ +++++|++|++.. ..+.+++++||+||||++.++|+|+++.+..++.+++++|++++.
T Consensus 206 ~~~-~~~l-~~~~iv~~W~~~~-~~~~~~~~~G~~vI~s~~~~~Y~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 282 (326)
T cd06564 206 KGD-TTVL-SKDVIINYWSYGW-ADPKELLNKGYKIINTNDGYLYIVPGAGYYGDYLNTEDIYNNWTPNKFGGTNATLPE 282 (326)
T ss_pred CCC-cccC-CCCeEEEeCCCcc-cCHHHHHHCCCcEEEeCCCcEEEeCCCccCCCccCHHHHHhcCCccccCCCCccCCC
Confidence 431 1246 5799999999875 457899999999999999999999875211355779999986542
Q ss_pred CCCCccceeeeeeccccC---C-CcchhhhhhhHHHHHHHhcCC
Q psy14880 416 SSPLVLGGEVAMWTEYVD---D-QSLDGRLWPRTAAAAERLWSN 455 (497)
Q Consensus 416 ~~~~ilG~e~~lWsE~~~---~-~~l~~~l~PRl~A~AEr~W~~ 455 (497)
..++|+|+++|||+|.++ + .++++++|||++|+||++|++
T Consensus 283 ~~~~ilG~~~~lW~E~~~~~~t~~~~~~~~~pr~~a~Ae~~W~~ 326 (326)
T cd06564 283 GDPQILGGMFAIWNDDSDAGISEVDIYDRIFPALPAFAEKTWGG 326 (326)
T ss_pred CCCCcchhheeeecCCCCcCcCHHHHHHHHHHHHHHHHHHhcCC
Confidence 137999999999999997 3 789999999999999999985
No 10
>COG3525 Chb N-acetyl-beta-hexosaminidase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=9.6e-65 Score=536.47 Aligned_cols=377 Identities=22% Similarity=0.268 Sum_probs=301.1
Q ss_pred CccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCC
Q psy14880 16 NKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYS 95 (497)
Q Consensus 16 ~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~ 95 (497)
.+...|.|.|||.|||+|+||||||+|++.+|++||.||.+| ||.|||||+||||||+||+++|-.
T Consensus 250 ~p~~~I~DaPRf~~rGllvDvaRqf~s~~~vk~~Id~laa~K-------------ln~~hlHLtddegwrleIk~~PkL- 315 (732)
T COG3525 250 FPAVTIVDAPRFAWRGLLVDVARQFHSTDDVKRLIDQLAAHK-------------LNVLHLHLTDDEGWRLEIKRYPKL- 315 (732)
T ss_pred eeeeecccCcccchhhhhHhhhhhcCCHHHHHHHHHHHHHhh-------------cceEEEeeccCcceeeccccCCcc-
Confidence 566668899999999999999999999999999999999999 699999999999999999999843
Q ss_pred CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880 96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPA 175 (497)
Q Consensus 96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PG 175 (497)
+..++.|| .-+.+.|.+. +|+=..+||||+++++||++||++|+|+||||||+||
T Consensus 316 -------------T~iga~R~---~de~~~Pq~g---------~~pe~~ggfytqd~~relv~yAsar~ItviPeiD~Pg 370 (732)
T COG3525 316 -------------TTIGAWRI---PDEPDLPQLG---------YGPERMGGFYTQDDIRELVAYASARQITVIPEIDMPG 370 (732)
T ss_pred -------------cccccccc---CCCcCCcccc---------cCcccccCcccHHHHHHHHHHHhhcCceecCCcCCcc
Confidence 11222232 1133334331 1111346789999999999999999999999999999
Q ss_pred cchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhh
Q psy14880 176 HAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMAL 255 (497)
Q Consensus 176 H~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~ 255 (497)
|+.+++.++|++.+..- .|..... -.+ ..+..|||+-+-+++|+++|++|+.++|| +.+|||||||
T Consensus 371 Ha~aav~A~p~~~l~~a-~~ds~~s-v~~---~~n~~ln~tl~~t~~fldkv~dEv~~lfP--s~~iHiGgDE------- 436 (732)
T COG3525 371 HARAAVVAYPDLNLGRA-DPDSYDS-VQA---YLNPVLNPTLDPTYQFLDKVLDEVADLFP--STTIHIGGDE------- 436 (732)
T ss_pred hhhhhhhhCcccccccc-CCCcchh-hhh---cccccccccccHHHHHHHHHHHHHHHhCC--cceEEeccch-------
Confidence 99999999885432111 1111100 001 12237999999999999999999999999 9999999999
Q ss_pred hhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccC-CHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCC
Q psy14880 256 CRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNN-TPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHN 334 (497)
Q Consensus 256 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~-~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~ 334 (497)
+..+.|+. +|.|++.|++.|++ +...|+.+|++++.+.+++ +
T Consensus 437 -----------------------------~~~~qwk~~sp~~q~l~~~~G~~----d~~~lq~~fi~q~~k~l~~----~ 479 (732)
T COG3525 437 -----------------------------FIDGQWKASSPLVQALMEKLGNK----DTFELQSYFITQVGKTLAS----K 479 (732)
T ss_pred -----------------------------hccCeeeccCHHHHHHHHHhccC----CcchhhHHHHHHHHHHHHh----c
Confidence 55678988 99999999999985 4578999999999999984 8
Q ss_pred CceEEEecCCCCCchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccc---cCC-----CCCCCh
Q psy14880 335 RTKLIVWSSHLTDPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGF---WGV-----TSYYRW 406 (497)
Q Consensus 335 g~~~~~W~d~~~~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~---~~~-----~~~~~~ 406 (497)
|++.++|+|++..... .+....+++.|++|.+.. ..-+++++||+||++|..++|||+.+ |.. ......
T Consensus 480 Gr~~igW~e~~~~~~~-~~~~~t~~~~vm~W~~~~--~ai~~akqg~dvv~tp~~~~ylD~~q~~~peepg~~~a~t~~l 556 (732)
T COG3525 480 GRRLIGWDEGAHGGDV-NGTALTANVTVMSWYGKD--KAIELAKQGYDVVLTPAQFVYLDMLQIAAPEEPGYSWATTTPL 556 (732)
T ss_pred CceEEeecchhccCCC-ccccccCceEEEEEecch--hhHHHHhhcccccccchhhhhhhhhcccccccCCCcccccccc
Confidence 9999999999876432 111112699999999874 46789999999999999999999976 210 111233
Q ss_pred hh-hhccCCCC--------CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHH
Q psy14880 407 QR-VYDNLLPS--------SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRFLEQRERLV 475 (497)
Q Consensus 407 ~~-iY~~~p~~--------~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl~~~~~~l~ 475 (497)
.+ .|..++.. .+.++|.++++|+|++.+ ..+.+++|||++|+|||+|+... +||..|..|+..+..+++
T Consensus 557 ~r~~y~~~~~g~~~~~de~~k~~~G~q~~lWse~~~~~~~f~~~vfprl~a~aEraw~p~a~~Dw~~~~~r~~~~~~l~~ 636 (732)
T COG3525 557 ERNKYAYDFAGKQPINDELAKRILGVQAALWSEHIQTRGRFEYMVFPRLAAAAERAWTPMAFNDWLYYLDRLSAQLPLLV 636 (732)
T ss_pred chhhhhhcccCcccCChHHhhhhhhhHHHHHHHHhhhhhHHHHHhcchHHHHHHhhCCchhhcchhhhhhhcchhcchhh
Confidence 44 67666543 379999999999999998 88999999999999999999865 799999999999999999
Q ss_pred HCCCCcccCC
Q psy14880 476 EMGIRAEVTT 485 (497)
Q Consensus 476 ~~Gi~~~p~~ 485 (497)
..+|.+.+.+
T Consensus 637 ~~~~~~~~~w 646 (732)
T COG3525 637 LISIPYNAWW 646 (732)
T ss_pred hhcccccccc
Confidence 8888776543
No 11
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00 E-value=1.4e-46 Score=381.90 Aligned_cols=287 Identities=17% Similarity=0.174 Sum_probs=210.0
Q ss_pred CceeeeccCC-CCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHH
Q psy14880 29 NRTKLIVWSS-HLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE 107 (497)
Q Consensus 29 ~RG~mlDvaR-~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~ 107 (497)
.||+|||+|| +|+++++||++||.||.+| +|.||||++|. |++ +.++
T Consensus 1 ~RG~mlD~~R~~~~~~~~lk~~id~ma~~k-------------~N~l~lhl~D~--f~~--~~~p--------------- 48 (301)
T cd06565 1 FRGVHLDLKRNAVPKVSYLKKLLRLLALLG-------------ANGLLLYYEDT--FPY--EGEP--------------- 48 (301)
T ss_pred CceEEEecCCCCCCCHHHHHHHHHHHHHcC-------------CCEEEEEEecc--eec--CCCc---------------
Confidence 5999999999 9999999999999999999 69999999994 322 1111
Q ss_pred HHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCC
Q psy14880 108 LVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRY 187 (497)
Q Consensus 108 ~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~ 187 (497)
++ + ..+++||++||+||++||++|||+||||||+|||+.+++++ |+
T Consensus 49 --------------~~----------------~--~~~~~yT~~ei~ei~~yA~~~gI~vIPeid~pGH~~~~l~~-~~- 94 (301)
T cd06565 49 --------------EV----------------G--RMRGAYTKEEIREIDDYAAELGIEVIPLIQTLGHLEFILKH-PE- 94 (301)
T ss_pred --------------cc----------------c--cCCCCcCHHHHHHHHHHHHHcCCEEEecCCCHHHHHHHHhC-cc-
Confidence 11 0 11456999999999999999999999999999999998864 33
Q ss_pred CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCC
Q psy14880 188 GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVP 267 (497)
Q Consensus 188 ~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~ 267 (497)
+..+. .. +.++++|||++|+|++|+++|++|++++|+ +++|||||||+++++.
T Consensus 95 -~~~l~--~~---------~~~~~~l~~~~~~t~~fi~~li~ev~~~f~--s~~~HIG~DE~~~~g~------------- 147 (301)
T cd06565 95 -FRHLR--EV---------DDPPQTLCPGEPKTYDFIEEMIRQVLELHP--SKYIHIGMDEAYDLGR------------- 147 (301)
T ss_pred -ccccc--cc---------CCCCCccCCCChhHHHHHHHHHHHHHHhCC--CCeEEECCCcccccCC-------------
Confidence 33332 11 113478999999999999999999999999 9999999999654321
Q ss_pred CcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCC
Q psy14880 268 CVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTD 347 (497)
Q Consensus 268 ~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~ 347 (497)
.+ ++++.+. .+..+|+..|++++.+++++ +|+++++|+|++..
T Consensus 148 -------------------------~~----~~~~~~~----~~~~~l~~~~~~~v~~~v~~----~g~~~~~W~D~~~~ 190 (301)
T cd06565 148 -------------------------GR----SLRKHGN----LGRGELYLEHLKKVLKIIKK----RGPKPMMWDDMLRK 190 (301)
T ss_pred -------------------------CH----HHHHhcC----CCHHHHHHHHHHHHHHHHHH----cCCEEEEEhHHhcC
Confidence 11 2222232 24568999999999999984 89999999999876
Q ss_pred ---chhhccCCCCCcEEEEecCCCCch-----hHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccC----C-
Q psy14880 348 ---PEIILNYLDPKRYIIQTWVPRIDP-----LADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNL----L- 414 (497)
Q Consensus 348 ---~~~~~~~l~~~~~iv~~W~~~~~~-----~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~----p- 414 (497)
.+.....+ |+++++++|.+.... ......+.|..++++++...+--. +...+.-+.+- .
T Consensus 191 ~~~~~~~~~~l-~~~v~~~~W~y~~~~~~~~~~~~~~~~~~~~~~~~~g~~~w~~~-------~~~~~~~~~n~~~~~~~ 262 (301)
T cd06565 191 LSIEPEALSGL-PKLVTPVVWDYYADLDEHDRPIGLWKKYGSVFAVAWGASAWKGA-------TPPNDKHLENIKSWLKA 262 (301)
T ss_pred CCCChHHHhCC-CCCeEEEEecCcCCcchhhHhHHHHHHhCCCceEeeeechhccC-------CCCHHHHHHHHHHHHHH
Confidence 33333457 589999999987632 344556677777777653222110 11111111110 0
Q ss_pred CCCCCccceeeeeeccccCCCcchhhhhhhHHHHHHHhcC
Q psy14880 415 PSSPLVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWS 454 (497)
Q Consensus 415 ~~~~~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~ 454 (497)
.....+.|.....|+|.... .....++|.+++.|+..|+
T Consensus 263 ~~~~~~~G~~~T~W~d~g~~-~~~~~~~p~~~~~~~~~~~ 301 (301)
T cd06565 263 AKKNGVQGILLTGWGDYGHE-AVLCELLPGLIPSLALALG 301 (301)
T ss_pred HHHCCCCEEEEEecCCCCCc-ccHHHHHHHHHHHHHHhcC
Confidence 11467899999999987654 3346899999999999985
No 12
>KOG2499|consensus
Probab=97.50 E-value=9.5e-05 Score=78.26 Aligned_cols=45 Identities=56% Similarity=0.926 Sum_probs=41.9
Q ss_pred ccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880 87 LLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN 131 (497)
Q Consensus 87 e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~ 131 (497)
|+.++|+++..++||..++.++++||++|||||.+|+|.|+|+++
T Consensus 234 eL~~kGaYs~~~vYT~eDv~evV~yarlRGIRVlpEfD~PgHt~s 278 (542)
T KOG2499|consen 234 ELHRKGAYSPRHVYTREDVSEVVEYARLRGIRVLPEFDTPGHTGS 278 (542)
T ss_pred hhhhcCCCCcceeecHHHHHHHHHHHHhccceeeecccCCccccc
Confidence 455689999999999999999999999999999999999999986
No 13
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=95.72 E-value=0.19 Score=51.81 Aligned_cols=87 Identities=21% Similarity=0.233 Sum_probs=54.5
Q ss_pred HHHHHHHHHHHHHcCCEEEEecC--Ccccch-hhhhhcCCCCCCCcccccCCCCCCccc--cCCCCCcCCCCChhHHHHH
Q psy14880 150 PLEVQELVHYAKLRGIRVIFELD--APAHAG-NGWQWGPRYGLGDLIVCLNDPSWRANC--IQPPCGQLNPINPHVYTVL 224 (497)
Q Consensus 150 ~~ei~eiv~yA~~rgI~VIPEID--~PGH~~-a~l~~~p~~~~~~l~~~~~~~~~~~~~--~~~~~~~L~~~~p~t~~fl 224 (497)
.+=+..+|+-|.+|||+|.+=+. +.++.. .+.+.+|+- + +...+.|.... .....--|||++||+.+|+
T Consensus 69 ~DpL~~~I~eaHkrGlevHAW~~~~~~~~~~~~~~~~~p~~----~--~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i 142 (311)
T PF02638_consen 69 FDPLEFMIEEAHKRGLEVHAWFRVGFNAPDVSHILKKHPEW----F--AVNHPGWVRTYEDANGGYYWLNPGHPEVRDYI 142 (311)
T ss_pred ccHHHHHHHHHHHcCCEEEEEEEeecCCCchhhhhhcCchh----h--eecCCCceeecccCCCCceEECCCCHHHHHHH
Confidence 55699999999999999999872 222221 122223321 0 00111111000 0011236999999999999
Q ss_pred HHHHHHHHhccCCCCCeEEE
Q psy14880 225 KDIYGELMGMAKYGDEMFHM 244 (497)
Q Consensus 225 ~~vl~E~~~lF~~~~~~iHi 244 (497)
.+++.|+..-++ -.=||+
T Consensus 143 ~~~v~Eiv~~Yd--vDGIhl 160 (311)
T PF02638_consen 143 IDIVKEIVKNYD--VDGIHL 160 (311)
T ss_pred HHHHHHHHhcCC--CCeEEe
Confidence 999999999888 777885
No 14
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=95.58 E-value=0.0095 Score=62.27 Aligned_cols=42 Identities=60% Similarity=1.095 Sum_probs=37.2
Q ss_pred cCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880 90 RFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN 131 (497)
Q Consensus 90 ~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~ 131 (497)
..|++++...||.++++++++||+.|||+|+||+|.|+|+..
T Consensus 57 ~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a 98 (348)
T cd06562 57 KKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEIDTPGHTGS 98 (348)
T ss_pred hccCcCCCceECHHHHHHHHHHHHHcCCEEEEeccCchhhHH
Confidence 346666778899999999999999999999999999999753
No 15
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=94.78 E-value=0.069 Score=53.95 Aligned_cols=71 Identities=14% Similarity=0.286 Sum_probs=42.0
Q ss_pred CCCHHHHHHHHHHHHHcCCEEEEecCCcc---------cchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCC
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVIFELDAPA---------HAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPIN 217 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PG---------H~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~ 217 (497)
.+...+|+|||+||+++||.|+==...-+ ++..+++.+.+.|.. ++ ..+-++-.+
T Consensus 69 ~~~~~dl~elv~Ya~~KgVgi~lw~~~~~~~~~~~~~~~~~~~f~~~~~~Gv~--------------Gv--KidF~~~d~ 132 (273)
T PF10566_consen 69 PIPDFDLPELVDYAKEKGVGIWLWYHSETGGNVANLEKQLDEAFKLYAKWGVK--------------GV--KIDFMDRDD 132 (273)
T ss_dssp B-TT--HHHHHHHHHHTT-EEEEEEECCHTTBHHHHHCCHHHHHHHHHHCTEE--------------EE--EEE--SSTS
T ss_pred cCCccCHHHHHHHHHHcCCCEEEEEeCCcchhhHhHHHHHHHHHHHHHHcCCC--------------EE--eeCcCCCCC
Confidence 36668999999999999988765433322 123333332222211 11 124577789
Q ss_pred hhHHHHHHHHHHHHHh
Q psy14880 218 PHVYTVLKDIYGELMG 233 (497)
Q Consensus 218 p~t~~fl~~vl~E~~~ 233 (497)
.++.++.++|+++.++
T Consensus 133 Q~~v~~y~~i~~~AA~ 148 (273)
T PF10566_consen 133 QEMVNWYEDILEDAAE 148 (273)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 9999999999999987
No 16
>cd06568 GH20_SpHex_like A subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex). SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=94.63 E-value=0.028 Score=58.41 Aligned_cols=35 Identities=34% Similarity=0.627 Sum_probs=32.3
Q ss_pred CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
....||.++++++++||+.|||+|+||+|.|+|..
T Consensus 68 ~~~~YT~~di~elv~yA~~rgI~vIPEiD~PGH~~ 102 (329)
T cd06568 68 PGGYYTQEDYKDIVAYAAERHITVVPEIDMPGHTN 102 (329)
T ss_pred CCCcCCHHHHHHHHHHHHHcCCEEEEecCCcHHHH
Confidence 35689999999999999999999999999999964
No 17
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.61 E-value=0.03 Score=57.68 Aligned_cols=34 Identities=44% Similarity=0.762 Sum_probs=31.7
Q ss_pred CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
+..||.++++++++||+.|||+|+||+|.|+|.+
T Consensus 62 ~~~yT~~di~elv~yA~~rgI~vIPEId~PGH~~ 95 (311)
T cd06570 62 GLYYTQEQIREVVAYARDRGIRVVPEIDVPGHAS 95 (311)
T ss_pred CCccCHHHHHHHHHHHHHcCCEEEEeecCccchH
Confidence 4579999999999999999999999999999964
No 18
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.44 E-value=0.03 Score=57.99 Aligned_cols=36 Identities=44% Similarity=0.652 Sum_probs=33.0
Q ss_pred CCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 95 SPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 95 ~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
.....||.++++++++||+.|||+|+||+|.|+|..
T Consensus 74 ~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~ 109 (326)
T cd06564 74 ANDGYYTKEEFKELIAYAKDRGVNIIPEIDSPGHSL 109 (326)
T ss_pred CCCCcccHHHHHHHHHHHHHcCCeEeccCCCcHHHH
Confidence 456789999999999999999999999999999964
No 19
>PF00728 Glyco_hydro_20: Glycosyl hydrolase family 20, catalytic domain; InterPro: IPR015883 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3RPM_A 1C7T_A 1QBA_A 1QBB_A 1C7S_A 3RCN_A 2YL8_A 2YL6_A 2YLL_A 2YL5_C ....
Probab=94.24 E-value=0.025 Score=58.52 Aligned_cols=41 Identities=51% Similarity=0.834 Sum_probs=33.6
Q ss_pred cCCCCCCCc---cCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 90 RFGAYSPAK---VYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 90 ~~g~~~~~~---~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
..|+.+... .||.++|+++++||+.|||+|+||+|.|+|.+
T Consensus 57 ~~ga~~~~~~~~~yT~~di~~lv~yA~~~gI~VIPeid~PGH~~ 100 (351)
T PF00728_consen 57 EKGAYRPSDAGGYYTKEDIRELVAYAKERGIEVIPEIDTPGHAE 100 (351)
T ss_dssp HTTTESTTCTESEBEHHHHHHHHHHHHHTT-EEEEEEEESSS-H
T ss_pred ccCccccccccccCCHHHHHHHHHHHHHcCCceeeeccCchHHH
Confidence 345555544 89999999999999999999999999999964
No 20
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.05 E-value=0.042 Score=57.67 Aligned_cols=34 Identities=47% Similarity=0.699 Sum_probs=31.8
Q ss_pred CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
...||.++++++++||+.|||+|+||+|.|+|+.
T Consensus 80 ~~~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~ 113 (357)
T cd06563 80 GGFYTQEEIREIVAYAAERGITVIPEIDMPGHAL 113 (357)
T ss_pred CceECHHHHHHHHHHHHHcCCEEEEecCCchhHH
Confidence 4679999999999999999999999999999964
No 21
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=94.04 E-value=0.041 Score=56.39 Aligned_cols=35 Identities=37% Similarity=0.749 Sum_probs=32.3
Q ss_pred CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
....||+++++++++||+.||++|+||+|.|+|..
T Consensus 65 ~~~~yT~~di~elv~yA~~rgI~viPEiD~PGH~~ 99 (303)
T cd02742 65 PGGFYTYAQLKDIIEYAAARGIEVIPEIDMPGHST 99 (303)
T ss_pred CCCeECHHHHHHHHHHHHHcCCEEEEeccchHHHH
Confidence 45689999999999999999999999999999964
No 22
>cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=93.52 E-value=0.054 Score=58.54 Aligned_cols=34 Identities=38% Similarity=0.619 Sum_probs=31.8
Q ss_pred CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
...||+++|+++++||+.|||+|+||+|.|+|+.
T Consensus 91 ~g~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~ 124 (445)
T cd06569 91 SGYYSRADYIEILKYAKARHIEVIPEIDMPGHAR 124 (445)
T ss_pred CCccCHHHHHHHHHHHHHcCCEEEEccCCchhHH
Confidence 4579999999999999999999999999999975
No 23
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=89.84 E-value=0.21 Score=51.17 Aligned_cols=32 Identities=31% Similarity=0.431 Sum_probs=30.2
Q ss_pred cCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 99 VYTPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 99 ~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
.||.++++++++||+.||++|+|+++.|+|..
T Consensus 56 ~yT~~ei~ei~~yA~~~gI~vIPeid~pGH~~ 87 (301)
T cd06565 56 AYTKEEIREIDDYAAELGIEVIPLIQTLGHLE 87 (301)
T ss_pred CcCHHHHHHHHHHHHHcCCEEEecCCCHHHHH
Confidence 49999999999999999999999999999964
No 24
>COG1649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.94 E-value=6.9 Score=41.97 Aligned_cols=156 Identities=16% Similarity=0.041 Sum_probs=85.3
Q ss_pred CCCCceeeec--cCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChh
Q psy14880 26 VGHNRTKLIV--WSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPL 103 (497)
Q Consensus 26 ~~~~RG~mlD--vaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~ 103 (497)
+-+.||+=|| -+|..++.+.+++.+|.+..+.+ |++-.-..-+ .+. .|-+.
T Consensus 44 ~~eiRGvWltn~~~~v~~~~~el~~~ld~l~~ln~-------------NTv~~qV~~~--------G~~------lypS~ 96 (418)
T COG1649 44 PQEIRGVWLTNADSRVLFQRQELKDILDDLQKLNF-------------NTVYPQVWND--------GDA------LYPSA 96 (418)
T ss_pred cccceeEEEecCCCcccccHHHHHHHHHHHHHcCC-------------ceeEEEEecC--------ccc------ccccc
Confidence 3577999999 56777899999999999998863 5553333221 110 01000
Q ss_pred HHHHHHHHHHHcCceeeeecCC-CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhh
Q psy14880 104 EVQELVHYAKLRGVRVIFELDA-PAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQ 182 (497)
Q Consensus 104 ~i~~~~~y~~~rG~ri~~e~~~-p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~ 182 (497)
+.+.++. |+..+ ..++| +=+..+|+-|.+|||+|+|=++.-.=+-. .+
T Consensus 97 ---------------~~p~s~~~~~~~~-------~~~g~--------DpLa~~I~~AHkr~l~v~aWf~~~~~a~~-~s 145 (418)
T COG1649 97 ---------------VLPWSDGLPGVLG-------VDPGY--------DPLAFVIAEAHKRGLEVHAWFNPYRMAPP-TS 145 (418)
T ss_pred ---------------ccccccCcCcccC-------CCCCC--------ChHHHHHHHHHhcCCeeeechhhcccCCC-CC
Confidence 0011111 11000 01122 34899999999999999998764322111 11
Q ss_pred hcCCCCCCCcccccCCCCCCccccCC--CCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEE
Q psy14880 183 WGPRYGLGDLIVCLNDPSWRANCIQP--PCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFH 243 (497)
Q Consensus 183 ~~p~~~~~~l~~~~~~~~~~~~~~~~--~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iH 243 (497)
-.+.. .|+-. ....+.|...-... ..-.|||..|++.+|+.+++.|+..=++ ..-||
T Consensus 146 ~~~~~-~p~~~-~~~~~~~~~~~~~~~~~~~~ldPg~Pevq~~i~~lv~evV~~Yd--vDGIQ 204 (418)
T COG1649 146 PLTKR-HPHWL-TTKRPGWVYVRHQGWGKRVWLDPGIPEVQDFITSLVVEVVRNYD--VDGIQ 204 (418)
T ss_pred hhHhh-CCCCc-ccCCCCeEEEecCCceeeeEeCCCChHHHHHHHHHHHHHHhCCC--CCcee
Confidence 00000 11110 01111121110000 1346999999999999999999998777 44344
No 25
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=84.43 E-value=10 Score=39.27 Aligned_cols=77 Identities=21% Similarity=0.200 Sum_probs=54.2
Q ss_pred HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHH
Q psy14880 150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYG 229 (497)
Q Consensus 150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~ 229 (497)
-.|+++|++.++++||.+|--|=+.-=...+. ..|+.+.. -.++..|.... ...-+||.++++.+.+-+|-.
T Consensus 60 i~D~~~l~~~l~e~gIY~IARIv~FkD~~la~-~~pe~av~----~~~G~~w~d~~---~~~WvnP~~~evw~Y~i~IA~ 131 (316)
T PF13200_consen 60 IKDLKALVKKLKEHGIYPIARIVVFKDPVLAE-AHPEWAVK----TKDGSVWRDNE---GEAWVNPYSKEVWDYNIDIAK 131 (316)
T ss_pred ccCHHHHHHHHHHCCCEEEEEEEEecChHHhh-hChhhEEE----CCCCCcccCCC---CCccCCCCCHHHHHHHHHHHH
Confidence 36899999999999999998887766443322 24443321 12334564321 234699999999999999999
Q ss_pred HHHhc
Q psy14880 230 ELMGM 234 (497)
Q Consensus 230 E~~~l 234 (497)
|++.+
T Consensus 132 Eaa~~ 136 (316)
T PF13200_consen 132 EAAKL 136 (316)
T ss_pred HHHHc
Confidence 99875
No 26
>PRK12568 glycogen branching enzyme; Provisional
Probab=81.19 E-value=26 Score=40.49 Aligned_cols=118 Identities=19% Similarity=0.286 Sum_probs=67.9
Q ss_pred HHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880 106 QELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW 181 (497)
Q Consensus 106 ~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l 181 (497)
++++.|.+.-|+. |+ |-...|. +..|.. ...|-|+++..| |.+|++++|+.|.++||.||-.+- |.|+..--
T Consensus 273 ~~ll~ylk~LGvt~I~LmPi~e~~~--~~~wGY-~~~~~~a~~~~~G~~~dfk~lV~~~H~~Gi~VIlD~V-~nH~~~d~ 348 (730)
T PRK12568 273 EQLIPYVQQLGFTHIELLPITEHPF--GGSWGY-QPLGLYAPTARHGSPDGFAQFVDACHRAGIGVILDWV-SAHFPDDA 348 (730)
T ss_pred HHHHHHHHHcCCCEEEECccccCCC--CCCCCC-CCCcCCccCcccCCHHHHHHHHHHHHHCCCEEEEEec-cccCCccc
Confidence 3467888888884 22 2221221 122321 112334444433 899999999999999999999874 88986521
Q ss_pred hhcCCCCCCCcccc--cC-----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 182 QWGPRYGLGDLIVC--LN-----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 182 ~~~p~~~~~~l~~~--~~-----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
. ....+..-... .+ ...|.. -.+|-.+|++.+|+-+.+.-.++-|.
T Consensus 349 ~--~l~~fdg~~~Ye~~d~~~g~~~~W~~-------~~~N~~~peVr~~li~~a~~Wl~eyh 401 (730)
T PRK12568 349 H--GLAQFDGAALYEHADPREGMHRDWNT-------LIYNYGRPEVTAYLLGSALEWIEHYH 401 (730)
T ss_pred c--ccccCCCccccccCCCcCCccCCCCC-------eecccCCHHHHHHHHHHHHHHHHHhC
Confidence 1 00000000000 00 011211 14699999999999998888877665
No 27
>PLN02361 alpha-amylase
Probab=80.41 E-value=4.9 Score=42.98 Aligned_cols=84 Identities=13% Similarity=0.060 Sum_probs=47.1
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc----CCC-CCC------CcccccCCCCC-CccccCCCCCcCCCC
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG----PRY-GLG------DLIVCLNDPSW-RANCIQPPCGQLNPI 216 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~----p~~-~~~------~l~~~~~~~~~-~~~~~~~~~~~L~~~ 216 (497)
|++|++++|+-|.++||+||-.+= +-|+....... ... +.+ ....|..+..+ +....-...--||-.
T Consensus 74 t~~el~~li~~~h~~gi~vi~D~V-~NH~~g~~~~~~~~y~~~~g~~~~wd~~~~~~~~~g~~~~~~~~~~~~lpDLd~~ 152 (401)
T PLN02361 74 SEHLLKSLLRKMKQYNVRAMADIV-INHRVGTTQGHGGMYNRYDGIPLPWDEHAVTSCTGGLGNRSTGDNFNGVPNIDHT 152 (401)
T ss_pred CHHHHHHHHHHHHHcCCEEEEEEc-cccccCCCCCCCCCcccCCCCcCCCCccccccccCCCCCccCCCCCccCCccCCC
Confidence 899999999999999999999863 46763211100 000 000 00001111000 000000111258889
Q ss_pred ChhHHHHHHHHHHHHHh
Q psy14880 217 NPHVYTVLKDIYGELMG 233 (497)
Q Consensus 217 ~p~t~~fl~~vl~E~~~ 233 (497)
||++.+.+++.+.-+.+
T Consensus 153 np~Vr~~l~~~~~wl~~ 169 (401)
T PLN02361 153 QHFVRKDIIGWLIWLRN 169 (401)
T ss_pred CHHHHHHHHHHHHHHHh
Confidence 99999999988876655
No 28
>PRK14705 glycogen branching enzyme; Provisional
Probab=78.38 E-value=29 Score=42.38 Aligned_cols=120 Identities=22% Similarity=0.320 Sum_probs=68.5
Q ss_pred HHHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880 105 VQELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNG 180 (497)
Q Consensus 105 i~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~ 180 (497)
+++++.|.+.-|+. |+ |=...|.. ..|.+ ...+-|.++..| |.+|++++|+.|.++||.||-.+ +|+|+..-
T Consensus 768 ~~~lldYlk~LGvt~IeLmPv~e~p~~--~swGY-~~~~y~ap~~ryGt~~dfk~lVd~~H~~GI~VILD~-V~nH~~~d 843 (1224)
T PRK14705 768 AKELVDYVKWLGFTHVEFMPVAEHPFG--GSWGY-QVTSYFAPTSRFGHPDEFRFLVDSLHQAGIGVLLDW-VPAHFPKD 843 (1224)
T ss_pred HHHHHHHHHHhCCCEEEECccccCCCC--CCCCC-CccccCCcCcccCCHHHHHHHHHHHHHCCCEEEEEe-ccccCCcc
Confidence 34567888888884 32 22222221 22311 122333444444 89999999999999999999864 78887432
Q ss_pred hhhc-CCC-CCCCc--cccc--CCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 181 WQWG-PRY-GLGDL--IVCL--NDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 181 l~~~-p~~-~~~~l--~~~~--~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
.+. ... +.+.. ..+. ....|.. ..+|..++++.+|+.+.+.-.++-|.
T Consensus 844 -~~~l~~fdg~~~y~~~d~~~g~~~~Wg~-------~~fn~~~~eVr~fli~~a~~Wl~eyh 897 (1224)
T PRK14705 844 -SWALAQFDGQPLYEHADPALGEHPDWGT-------LIFDFGRTEVRNFLVANALYWLDEFH 897 (1224)
T ss_pred -hhhhhhcCCCcccccCCcccCCCCCCCC-------ceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 010 000 00000 0000 0012221 24788999999999999888887665
No 29
>PRK14706 glycogen branching enzyme; Provisional
Probab=77.11 E-value=31 Score=39.31 Aligned_cols=121 Identities=19% Similarity=0.193 Sum_probs=68.1
Q ss_pred HHHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880 105 VQELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNG 180 (497)
Q Consensus 105 i~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~ 180 (497)
+++++.|-+.-|+. |+ |-...|.. ..|.. ...+-|+++..| |.+|++.+|+-|.++||.||-.+ +|.|+..-
T Consensus 170 ~~~l~~ylk~lG~t~velmPv~e~~~~--~~wGY-~~~~~~~~~~~~g~~~~~~~lv~~~H~~gi~VilD~-v~nH~~~~ 245 (639)
T PRK14706 170 AHRLGEYVTYMGYTHVELLGVMEHPFD--GSWGY-QVTGYYAPTSRLGTPEDFKYLVNHLHGLGIGVILDW-VPGHFPTD 245 (639)
T ss_pred HHHHHHHHHHcCCCEEEccchhcCCCC--CCCCc-CcccccccccccCCHHHHHHHHHHHHHCCCEEEEEe-cccccCcc
Confidence 34466888888884 32 22222221 12311 112334555555 89999999999999999999876 56887532
Q ss_pred hhhcCCC-CCCCcccccCC----CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 181 WQWGPRY-GLGDLIVCLND----PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 181 l~~~p~~-~~~~l~~~~~~----~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
-...... +.+........ ..|.. -.+|..+|++.+|+.+.+.-.++-|.
T Consensus 246 ~~~l~~~dg~~~y~~~~~~~g~~~~w~~-------~~~~~~~~eVr~~l~~~~~~W~~e~~ 299 (639)
T PRK14706 246 ESGLAHFDGGPLYEYADPRKGYHYDWNT-------YIFDYGRNEVVMFLIGSALKWLQDFH 299 (639)
T ss_pred hhhhhccCCCcceeccCCcCCcCCCCCC-------cccCCCCHHHHHHHHHHHHHHHHHhC
Confidence 1000000 00000000000 11211 14788999999999999888876555
No 30
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=76.68 E-value=12 Score=33.61 Aligned_cols=86 Identities=15% Similarity=0.032 Sum_probs=55.8
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCC-CCccccCCCCCcCCCCChhHHHH
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPS-WRANCIQPPCGQLNPINPHVYTV 223 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~-~~~~~~~~~~~~L~~~~p~t~~f 223 (497)
...++++=++|+|+-|+++||+|+-=+|.--+..++ ..+|+-... ..++.+ .......+....+|++.| -.++
T Consensus 38 hp~L~~Dllge~v~a~h~~Girv~ay~~~~~d~~~~-~~HPeW~~~----~~~G~~~~~~~~~~~~~~~~c~ns~-Y~e~ 111 (132)
T PF14871_consen 38 HPGLKRDLLGEQVEACHERGIRVPAYFDFSWDEDAA-ERHPEWFVR----DADGRPMRGERFGYPGWYTCCLNSP-YREF 111 (132)
T ss_pred CCCCCcCHHHHHHHHHHHCCCEEEEEEeeecChHHH-HhCCceeeE----CCCCCCcCCCCcCCCCceecCCCcc-HHHH
Confidence 345777788999999999999999999998777665 355553211 112221 000000111234666554 5599
Q ss_pred HHHHHHHHHhccC
Q psy14880 224 LKDIYGELMGMAK 236 (497)
Q Consensus 224 l~~vl~E~~~lF~ 236 (497)
+...++|+++.++
T Consensus 112 ~~~~i~Ei~~~y~ 124 (132)
T PF14871_consen 112 LLEQIREILDRYD 124 (132)
T ss_pred HHHHHHHHHHcCC
Confidence 9999999999887
No 31
>PRK05402 glycogen branching enzyme; Provisional
Probab=76.12 E-value=77 Score=36.66 Aligned_cols=123 Identities=20% Similarity=0.243 Sum_probs=67.4
Q ss_pred ChhHHH-HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880 101 TPLEVQ-ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAK-VYTPLEVQELVHYAKLRGIRVIFELDAPA 175 (497)
Q Consensus 101 ~~~~i~-~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~-~YT~~ei~eiv~yA~~rgI~VIPEID~PG 175 (497)
+-..+. +++.|-+.-|+.. .|-...|.. ..|.. ...+-|+.+. +=|.+|+++||+-|.++||.||=.+- |.
T Consensus 263 ~~~~i~~~l~~ylk~LGv~~i~L~Pi~e~~~~--~~~GY-~~~~y~ai~~~~Gt~~dfk~lV~~~H~~Gi~VilD~V-~N 338 (726)
T PRK05402 263 SYRELADQLIPYVKEMGFTHVELLPIAEHPFD--GSWGY-QPTGYYAPTSRFGTPDDFRYFVDACHQAGIGVILDWV-PA 338 (726)
T ss_pred CHHHHHHHHHHHHHHcCCCEEEECCcccCCCC--CCCCC-CcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEEC-CC
Confidence 334444 4568888888753 232222221 12211 1111223333 33899999999999999999998764 78
Q ss_pred cchhhhhhcCCCCCCCccccc--C-----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 176 HAGNGWQWGPRYGLGDLIVCL--N-----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 176 H~~a~l~~~p~~~~~~l~~~~--~-----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
|+..-- .....+....... + ...|.. ..+|..+|++.+++.+++.-.++-|.
T Consensus 339 H~~~~~--~~~~~~~~~~~y~~~~~~~~~~~~w~~-------~~~n~~~~~v~~~l~~~~~~W~~e~~ 397 (726)
T PRK05402 339 HFPKDA--HGLARFDGTALYEHADPREGEHPDWGT-------LIFNYGRNEVRNFLVANALYWLEEFH 397 (726)
T ss_pred CCCCCc--cchhccCCCcceeccCCcCCccCCCCC-------ccccCCCHHHHHHHHHHHHHHHHHhC
Confidence 885310 0000000000000 0 001211 14788999999999998888876555
No 32
>PF02679 ComA: (2R)-phospho-3-sulfolactate synthase (ComA); InterPro: IPR003830 Methanogenic archaea produce methane via the anaerobic reduction of acetate or single carbon compounds []. Coenzyme M (CoM; 2-mercaptoethanesulphonic acid) serves as the terminal methyl carrier for this process. Previously thought to be unique to methanogenic archaea, CoM has also been found in methylotrophic bacteria. Biosynthesis of CoM begins with the Michael addition of sulphite to phosphoenolpyruvate, forming 2-phospho-3-sulpholactate (PSL). This reaction is catalyzed by members of this family, PSL synthase (ComA) []. Subsequently, PSL is dephosphorylated by phosphosulpholactate phosphatase (ComB) to form 3-sulpholactate [], which is then converted to 3-sulphopyruvate by L-sulpholactate dehydrogenase (ComC; 1.1.1.272 from EC) []. Sulphopyruvate decarboxylase (ComDE; 4.1.1.79 from EC) converts 3-sulphopyruvate to sulphoacetaldehyde []. Reductive thiolation of sulphoacetaldehyde is the final step.; GO: 0019295 coenzyme M biosynthetic process; PDB: 1U83_A 1QWG_A.
Probab=76.03 E-value=19 Score=35.99 Aligned_cols=31 Identities=10% Similarity=-0.005 Sum_probs=22.9
Q ss_pred cCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcc
Q psy14880 23 DEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCG 56 (497)
Q Consensus 23 D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~ 56 (497)
-.||-.--..|+|-+ +++..++++++.++.|
T Consensus 7 ~KPR~~GlT~v~Dkg---lg~~~~~dlLe~ag~y 37 (244)
T PF02679_consen 7 EKPRSRGLTMVIDKG---LGLRYLEDLLESAGDY 37 (244)
T ss_dssp -SS-SSS-EEEEESS-----HHHHHHHHHHHGGG
T ss_pred CCCCCCCcEEEecCC---CCHHHHHHHHHHhhhh
Confidence 457776678899988 9999999999999987
No 33
>TIGR01515 branching_enzym alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase. A sequence from Arabidopsis thaliana, GP|9294564, scores just above trusted, but appears either to contain corrupt sequence or, more likely, to be a pseudogene as some of the conserved catalytic residues common to the alpha amylase family are not conserved here.
Probab=75.70 E-value=43 Score=37.88 Aligned_cols=122 Identities=19% Similarity=0.182 Sum_probs=66.4
Q ss_pred HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhh
Q psy14880 107 ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQ 182 (497)
Q Consensus 107 ~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~ 182 (497)
+++.|.+.-|+.- .|-...|. ...|... ..+-|+.+..| |.+|+++||+-|.++||.||-.+ ++.|+..--.
T Consensus 161 ~l~dyl~~LGvt~i~L~Pi~e~~~--~~~wGY~-~~~y~~~~~~~Gt~~dlk~lV~~~H~~Gi~VilD~-V~NH~~~~~~ 236 (613)
T TIGR01515 161 QLIPYVKELGFTHIELLPVAEHPF--DGSWGYQ-VTGYYAPTSRFGTPDDFMYFVDACHQAGIGVILDW-VPGHFPKDDH 236 (613)
T ss_pred HHHHHHHHcCCCEEEECCcccCCC--CCCCCCC-cccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe-cccCcCCccc
Confidence 3558888878753 23222221 1122111 11223344444 78999999999999999999876 4688863211
Q ss_pred hcCCC-CCCCc-ccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 183 WGPRY-GLGDL-IVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 183 ~~p~~-~~~~l-~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
..... +.+.. ..+........++ ...+|..+|++.+|+.+++.-.++-|.
T Consensus 237 ~~~~~~~~~~y~~~~~~~~~~~~w~----~~~~~~~~~~Vr~~l~~~~~~W~~ey~ 288 (613)
T TIGR01515 237 GLAEFDGTPLYEHKDPRDGEHWDWG----TLIFDYGRPEVRNFLVANALYWAEFYH 288 (613)
T ss_pred hhhccCCCcceeccCCccCcCCCCC----CceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 00000 00000 0000000000111 124788999999999999998887655
No 34
>PRK12313 glycogen branching enzyme; Provisional
Probab=75.11 E-value=78 Score=35.89 Aligned_cols=123 Identities=20% Similarity=0.244 Sum_probs=67.2
Q ss_pred hHHH-HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880 103 LEVQ-ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHA 177 (497)
Q Consensus 103 ~~i~-~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~ 177 (497)
..+. +++.|-+.-|+.. .|=...|.. ..|.. ...+-|+.+..| |.+|+++||+-|.++||.||-.+- |.|+
T Consensus 170 ~~~~~~ll~yl~~LGv~~i~L~Pi~~~~~~--~~~GY-~~~~y~~i~~~~Gt~~d~k~lv~~~H~~Gi~VilD~V-~nH~ 245 (633)
T PRK12313 170 RELADELIPYVKEMGYTHVEFMPLMEHPLD--GSWGY-QLTGYFAPTSRYGTPEDFMYLVDALHQNGIGVILDWV-PGHF 245 (633)
T ss_pred HHHHHHHHHHHHHcCCCEEEeCchhcCCCC--CCCCC-CCcCcCcCCCCCCCHHHHHHHHHHHHHCCCEEEEEEC-CCCC
Confidence 3434 4568888878753 232222321 11211 111222333333 899999999999999999999864 7888
Q ss_pred hhhhhhcCCCC-CCCcc-cccCC---CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 178 GNGWQWGPRYG-LGDLI-VCLND---PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 178 ~a~l~~~p~~~-~~~l~-~~~~~---~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
..--...+... .+... ..+.. ..|. ...+|-.+|++.+|+.+++.-.++-|.
T Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~-------~~~~n~~~~~vr~~l~~~~~~W~~~~~ 302 (633)
T PRK12313 246 PKDDDGLAYFDGTPLYEYQDPRRAENPDWG-------ALNFDLGKNEVRSFLISSALFWLDEYH 302 (633)
T ss_pred CCCcccccccCCCcceeecCCCCCcCCCCC-------CcccCCCCHHHHHHHHHHHHHHHHHhC
Confidence 64210000000 00000 00000 0121 125888999999999999888887655
No 35
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=74.99 E-value=15 Score=38.59 Aligned_cols=83 Identities=16% Similarity=0.196 Sum_probs=47.8
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc-ccCCCCCCccccCCCCCcCCCCChhHHHHH
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV-CLNDPSWRANCIQPPCGQLNPINPHVYTVL 224 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~-~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl 224 (497)
|.|.-+.+..+++.|+++||.||--+ |.++...+-. . .+|+... ..++.. ...+ .....|+.+|...+.+
T Consensus 42 G~ydF~~lD~~l~~a~~~Gi~viL~~--~~~~~P~Wl~--~-~~Pe~~~~~~~g~~-~~~g---~~~~~~~~~p~yr~~~ 112 (374)
T PF02449_consen 42 GQYDFSWLDRVLDLAAKHGIKVILGT--PTAAPPAWLY--D-KYPEILPVDADGRR-RGFG---SRQHYCPNSPAYREYA 112 (374)
T ss_dssp TB---HHHHHHHHHHHCTT-EEEEEE--CTTTS-HHHH--C-CSGCCC-B-TTTSB-EECC---CSTT-HCCHHHHHHHH
T ss_pred CeeecHHHHHHHHHHHhccCeEEEEe--cccccccchh--h-hcccccccCCCCCc-CccC---CccccchhHHHHHHHH
Confidence 34777889999999999999999644 4444443321 1 1333321 111110 0000 1134789999999999
Q ss_pred HHHHHHHHhccCC
Q psy14880 225 KDIYGELMGMAKY 237 (497)
Q Consensus 225 ~~vl~E~~~lF~~ 237 (497)
.++++++++-+.+
T Consensus 113 ~~~~~~l~~~y~~ 125 (374)
T PF02449_consen 113 RRFIRALAERYGD 125 (374)
T ss_dssp HHHHHHHHHHHTT
T ss_pred HHHHHHHHhhccc
Confidence 9999999988774
No 36
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=74.21 E-value=36 Score=38.56 Aligned_cols=121 Identities=24% Similarity=0.382 Sum_probs=70.2
Q ss_pred hHHHHHHHHHHHcCce-eeeecCCCCCCC-CCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 103 LEVQELVHYAKLRGVR-VIFELDAPAHAG-NGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 103 ~~i~~~~~y~~~rG~r-i~~e~~~p~~~~-~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
+..++++.|.+.-|+. |++ +.+..|++ .+|.. ..-|-|++...| |.+|++.+|..|-++||.||=.. +|||...
T Consensus 165 e~a~~llpYl~elG~T~IEL-MPv~e~p~~~sWGY-q~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD~-V~~HF~~ 241 (628)
T COG0296 165 ELAIELLPYLKELGITHIEL-MPVAEHPGDRSWGY-QGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILDW-VPNHFPP 241 (628)
T ss_pred HHHHHHhHHHHHhCCCEEEE-cccccCCCCCCCCC-CcceeccccccCCCHHHHHHHHHHHHHcCCEEEEEe-cCCcCCC
Confidence 4567788899988884 431 11122222 22311 122445554444 99999999999999999999763 7999876
Q ss_pred hhhhcCCCCCCCcccccC--------CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 180 GWQWGPRYGLGDLIVCLN--------DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 180 ~l~~~p~~~~~~l~~~~~--------~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
--.....+. +.. +.. .++|.. ...|-+.+|+..|+-+-..--++-|.
T Consensus 242 d~~~L~~fd-g~~--~~e~~~~~~~~~~~Wg~-------~i~~~gr~EVR~Fll~nal~Wl~~yH 296 (628)
T COG0296 242 DGNYLARFD-GTF--LYEHEDPRRGEHTDWGT-------AIFNYGRNEVRNFLLANALYWLEEYH 296 (628)
T ss_pred CcchhhhcC-Ccc--ccccCCcccccCCCccc-------chhccCcHHHHHHHHHHHHHHHHHhC
Confidence 211100000 000 011 123321 23455699999998877766666655
No 37
>PF00128 Alpha-amylase: Alpha amylase, catalytic domain; InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=69.14 E-value=14 Score=36.42 Aligned_cols=86 Identities=24% Similarity=0.359 Sum_probs=51.2
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc----C--CCCCCCcccc--------------cCCCCCCcc----
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG----P--RYGLGDLIVC--------------LNDPSWRAN---- 204 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~----p--~~~~~~l~~~--------------~~~~~~~~~---- 204 (497)
|.+|+++||+-|.+|||+||-.+- +.|+...-.+. . ...+++.... .+...|...
T Consensus 50 t~~d~~~Lv~~~h~~gi~VilD~V-~NH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (316)
T PF00128_consen 50 TMEDFKELVDAAHKRGIKVILDVV-PNHTSDDHPWFQDSLNYFDNPYSDYYYWRDGEGSPPGNWYSYFGGSNWEYDDWGD 128 (316)
T ss_dssp HHHHHHHHHHHHHHTTCEEEEEEE-TSEEETTSHHHHHHHTHTTSTTGTTBEEESBTSTTSSTBBCSTTTSSEESCHHTH
T ss_pred hhhhhhhhhhccccccceEEEeee-ccccccccccccccccccccccccceeeccccccccccccccccccccccccccc
Confidence 789999999999999999997653 44776543320 0 0001111000 001111110
Q ss_pred --ccCCCCCcCCCCChhHHHHHHHHHHHHHhcc
Q psy14880 205 --CIQPPCGQLNPINPHVYTVLKDIYGELMGMA 235 (497)
Q Consensus 205 --~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF 235 (497)
...+....||..+|++.+++.+++...++.+
T Consensus 129 ~~~~~~~~~dln~~n~~v~~~i~~~~~~w~~~g 161 (316)
T PF00128_consen 129 EYQFWSDLPDLNYENPEVREYIIDVLKFWIEEG 161 (316)
T ss_dssp CHSSSTTSEEBETTSHHHHHHHHHHHHHHHHTT
T ss_pred ccccccccchhhhhhhhhhhhhcccccchhhce
Confidence 0011123588999999999999998888643
No 38
>TIGR02100 glgX_debranch glycogen debranching enzyme GlgX. This family consists of the GlgX protein from the E. coli glycogen operon and probable equivalogs from other prokaryotic species. GlgX is not required for glycogen biosynthesis, but instead acts as a debranching enzyme for glycogen catabolism. This model distinguishes GlgX from pullanases and other related proteins that also operate on alpha-1,6-glycosidic linkages. In the wide band between the trusted and noise cutoffs are functionally similar enzymes, mostly from plants, that act similarly but usually are termed isoamylase.
Probab=68.11 E-value=28 Score=40.00 Aligned_cols=91 Identities=20% Similarity=0.282 Sum_probs=54.7
Q ss_pred CCCCCC----CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc------ccCC-CCCCccccCCCCC
Q psy14880 143 SPAKVY----TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV------CLND-PSWRANCIQPPCG 211 (497)
Q Consensus 143 s~~~~Y----T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~------~~~~-~~~~~~~~~~~~~ 211 (497)
.+++.| |.+|+|++|+-|.++||+||=.+ ++.|+...-...|...+..+.. -.+. ..+..+ ..-..
T Consensus 233 a~d~~y~~~g~~~efk~LV~~~H~~GI~VIlDv-V~NHt~~~~~~~~~~~~~~~d~~~yy~~~~~~~~~~~~~--~g~gn 309 (688)
T TIGR02100 233 APEPRYLASGQVAEFKTMVRALHDAGIEVILDV-VYNHTAEGNELGPTLSFRGIDNASYYRLQPDDKRYYIND--TGTGN 309 (688)
T ss_pred ccChhhcCCCCHHHHHHHHHHHHHCCCEEEEEE-CcCCccCcCCCCCcccccCCCCCcceEecCCCCceecCC--CCccc
Confidence 455566 78999999999999999999876 4578764311112111111100 0000 000000 00013
Q ss_pred cCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 212 QLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 212 ~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
.||..+|++.+++.+.+.-.+.-|.
T Consensus 310 ~ln~~~p~vr~~i~d~l~~W~~e~g 334 (688)
T TIGR02100 310 TLNLSHPRVLQMVMDSLRYWVTEMH 334 (688)
T ss_pred cccCCCHHHHHHHHHHHHHHHHHcC
Confidence 6899999999999999888876555
No 39
>PLN02960 alpha-amylase
Probab=66.97 E-value=83 Score=37.12 Aligned_cols=125 Identities=14% Similarity=0.114 Sum_probs=69.0
Q ss_pred ChhHHH-HHHHHHHHcCcee-e--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880 101 TPLEVQ-ELVHYAKLRGVRV-I--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPA 175 (497)
Q Consensus 101 ~~~~i~-~~~~y~~~rG~ri-~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PG 175 (497)
+-..+. +.+.|.+.-|+.- + |-...|.. .+|.. ...+-|+++..| |.+|+++||+-|.++||.||-.+ +|.
T Consensus 414 tf~~~~e~~LdYLk~LGvt~IeLmPv~e~~~~--~swGY-~~~~yfa~~~~yGtp~dfk~LVd~aH~~GI~VILDv-V~N 489 (897)
T PLN02960 414 SFKEFTQKVLPHVKKAGYNAIQLIGVQEHKDY--SSVGY-KVTNFFAVSSRFGTPDDFKRLVDEAHGLGLLVFLDI-VHS 489 (897)
T ss_pred CHHHHHHHHHHHHHHcCCCEEEECCcccCCCC--CCCCC-CcccCCCcccccCCHHHHHHHHHHHHHCCCEEEEEe-ccc
Confidence 334554 5678888888853 2 21111221 11211 112223444434 89999999999999999999875 678
Q ss_pred cchhhhhhc-CCC-CCCCcccccC----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 176 HAGNGWQWG-PRY-GLGDLIVCLN----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 176 H~~a~l~~~-p~~-~~~~l~~~~~----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
|+..--..+ ... +.+++-.-.+ ...|. +..+|..++++..|+-+.+.-.++-|.
T Consensus 490 H~~~d~~~~L~~FDG~~~~Yf~~~~~g~~~~WG-------~~~fNy~~~eVr~fLlsna~yWl~Eyh 549 (897)
T PLN02960 490 YAAADEMVGLSLFDGSNDCYFHSGKRGHHKRWG-------TRMFKYGDHEVLHFLLSNLNWWVTEYR 549 (897)
T ss_pred ccCCccccchhhcCCCccceeecCCCCccCCCC-------CcccCCCCHHHHHHHHHHHHHHHHHHC
Confidence 876520000 000 0000000000 01221 135789999999999999888876554
No 40
>PLN02784 alpha-amylase
Probab=64.75 E-value=21 Score=41.70 Aligned_cols=30 Identities=27% Similarity=0.241 Sum_probs=26.7
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|++|++++|+-|.++||.||-.| ++-|+..
T Consensus 566 T~~ELk~LI~a~H~~GIkVIlDi-ViNH~ag 595 (894)
T PLN02784 566 TIDELKDLVKSFHEVGIKVLGDA-VLNHRCA 595 (894)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-Ccccccc
Confidence 99999999999999999999876 6777754
No 41
>COG3525 Chb N-acetyl-beta-hexosaminidase [Carbohydrate transport and metabolism]
Probab=62.54 E-value=5.8 Score=44.58 Aligned_cols=36 Identities=42% Similarity=0.608 Sum_probs=32.8
Q ss_pred CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880 96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN 131 (497)
Q Consensus 96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~ 131 (497)
-+..||.++++++++|++-|.+-++||+|+|+|++.
T Consensus 339 ~ggfytqd~~relv~yAsar~ItviPeiD~PgHa~a 374 (732)
T COG3525 339 MGGFYTQDDIRELVAYASARQITVIPEIDMPGHARA 374 (732)
T ss_pred ccCcccHHHHHHHHHHHhhcCceecCCcCCcchhhh
Confidence 367899999999999999999999999999999754
No 42
>smart00642 Aamy Alpha-amylase domain.
Probab=62.53 E-value=8.5 Score=35.90 Aligned_cols=30 Identities=37% Similarity=0.566 Sum_probs=26.5
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|++++|+-|+++||+||-.+ +|-|+..
T Consensus 68 t~~d~~~lv~~~h~~Gi~vilD~-V~NH~~~ 97 (166)
T smart00642 68 TMEDFKELVDAAHARGIKVILDV-VINHTSD 97 (166)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-CCCCCCC
Confidence 78999999999999999999986 5677655
No 43
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=61.48 E-value=63 Score=32.99 Aligned_cols=136 Identities=14% Similarity=0.099 Sum_probs=70.3
Q ss_pred cCChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCccc
Q psy14880 99 VYTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAH 176 (497)
Q Consensus 99 ~y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH 176 (497)
..+.++|.++++-.+.+|+.+. .-+|. .| ....|.+.-+ ..+- +.+++++.-+++|+++++-|+ |+-
T Consensus 26 ~~s~~~v~~~~~~~~~~~iP~d~i~iD~------~w--~~~~g~f~~d~~~FP--dp~~mi~~l~~~G~k~~l~i~-P~i 94 (303)
T cd06592 26 DINQETVLNYAQEIIDNGFPNGQIEIDD------NW--ETCYGDFDFDPTKFP--DPKGMIDQLHDLGFRVTLWVH-PFI 94 (303)
T ss_pred CcCHHHHHHHHHHHHHcCCCCCeEEeCC------Cc--cccCCccccChhhCC--CHHHHHHHHHHCCCeEEEEEC-Cee
Confidence 4567788888877777776422 12221 11 1223333222 1232 599999999999999998776 322
Q ss_pred chhhhhhcCCCC-CCCcccccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880 177 AGNGWQWGPRYG-LGDLIVCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE 248 (497)
Q Consensus 177 ~~a~l~~~p~~~-~~~l~~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE 248 (497)
... -..+.+.. .+-+..-.++ .+.........+..+|.+||++.++..+.++++..--. -..|=+=.-|
T Consensus 95 ~~~-s~~~~e~~~~g~~vk~~~g~~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G--vdg~w~D~~E 165 (303)
T cd06592 95 NTD-SENFREAVEKGYLVSEPSGDIPALTRWWNGTAAVLDFTNPEAVDWFLSRLKSLQEKYG--IDSFKFDAGE 165 (303)
T ss_pred CCC-CHHHHhhhhCCeEEECCCCCCCcccceecCCcceEeCCCHHHHHHHHHHHHHHHHHhC--CcEEEeCCCC
Confidence 110 00111100 0111100111 11100000112457999999999999999998883223 3444444444
No 44
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=61.07 E-value=15 Score=36.59 Aligned_cols=22 Identities=14% Similarity=0.113 Sum_probs=17.7
Q ss_pred eeeccCCCCCCHHHHHHhhhhhhcc
Q psy14880 32 KLIVWSSHLTDPEIILNYLDPQTCG 56 (497)
Q Consensus 32 ~mlDvaR~~~~~~~lk~~id~ma~~ 56 (497)
.|+|-+ +++..++++++.|+.|
T Consensus 3 ~v~dkg---l~~~~~~d~Le~~g~y 24 (237)
T TIGR03849 3 MVLDKG---LPPKFVEDYLKVCGDY 24 (237)
T ss_pred eEecCC---CCHHHHHHHHHHhhhh
Confidence 355543 6999999999999987
No 45
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=60.70 E-value=27 Score=38.05 Aligned_cols=30 Identities=20% Similarity=0.257 Sum_probs=26.6
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.+|||.||-.+ ++-|+..
T Consensus 79 t~~dl~~Li~~~H~~Gi~vi~D~-V~NH~~~ 108 (479)
T PRK09441 79 TKEELLNAIDALHENGIKVYADV-VLNHKAG 108 (479)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-CcccccC
Confidence 89999999999999999999876 5778763
No 46
>PLN00196 alpha-amylase; Provisional
Probab=59.73 E-value=33 Score=37.04 Aligned_cols=32 Identities=16% Similarity=0.250 Sum_probs=27.5
Q ss_pred CCCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
+=|++|+++||+-|.++||.||-.+ ++-|+.+
T Consensus 88 fGt~~elk~Lv~~aH~~GIkVilDv-V~NH~~~ 119 (428)
T PLN00196 88 YGNEAQLKSLIEAFHGKGVQVIADI-VINHRTA 119 (428)
T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEE-CccCccc
Confidence 4589999999999999999999875 5777764
No 47
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=59.67 E-value=29 Score=35.80 Aligned_cols=22 Identities=27% Similarity=0.475 Sum_probs=19.0
Q ss_pred CcCCCCChhHHHHHHHHHHHHH
Q psy14880 211 GQLNPINPHVYTVLKDIYGELM 232 (497)
Q Consensus 211 ~~L~~~~p~t~~fl~~vl~E~~ 232 (497)
..+|.+||++.+...+.++++.
T Consensus 131 ~~~Dftnp~a~~ww~~~~~~~~ 152 (317)
T cd06594 131 GVLDLTNPAARDWFKQVIKEML 152 (317)
T ss_pred eeeecCCHHHHHHHHHHHHHHh
Confidence 5799999999999988888763
No 48
>TIGR02402 trehalose_TreZ malto-oligosyltrehalose trehalohydrolase. Members of this family are the trehalose biosynthetic enzyme malto-oligosyltrehalose trehalohydrolase, formally known as 4-alpha-D-{(1-4)-alpha-D-glucano}trehalose trehalohydrolase (EC 3.2.1.141). It is the TreZ protein of the TreYZ pathway for trehalose biosynthesis, and alternative to the OtsAB system.
Probab=58.06 E-value=1.5e+02 Score=32.95 Aligned_cols=119 Identities=22% Similarity=0.332 Sum_probs=63.0
Q ss_pred HHHHHHHHHHHcCce---eeeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 104 EVQELVHYAKLRGVR---VIFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 104 ~i~~~~~y~~~rG~r---i~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
.|.+-+.|.+.-|+. +.|-.+.|+. ..|... ...-|..+..| |.+|+++||+-|.++||+||=.+ ++.|+..
T Consensus 112 gi~~~l~yl~~LGv~~i~L~Pi~~~~~~--~~~GY~-~~~~~~~~~~~G~~~e~k~lV~~aH~~Gi~VilD~-V~NH~~~ 187 (542)
T TIGR02402 112 AAIEKLPYLADLGITAIELMPVAQFPGT--RGWGYD-GVLPYAPHNAYGGPDDLKALVDAAHGLGLGVILDV-VYNHFGP 187 (542)
T ss_pred HHHHhhHHHHHcCCCEEEeCccccCCCC--CCCCCC-ccCccccccccCCHHHHHHHHHHHHHCCCEEEEEE-ccCCCCC
Confidence 445556777777774 3333323321 122111 11112333333 88999999999999999999876 4678753
Q ss_pred hhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCCh---hHHHHHHHHHHHHHhccC
Q psy14880 180 GWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINP---HVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 180 ~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p---~t~~fl~~vl~E~~~lF~ 236 (497)
--...+... | .-.-.....|. ..+|..+| ++.+++.+.+.-.++-|.
T Consensus 188 ~~~~~~~~~-~-y~~~~~~~~wg--------~~~n~~~~~~~~vr~~i~~~~~~W~~e~~ 237 (542)
T TIGR02402 188 EGNYLPRYA-P-YFTDRYSTPWG--------AAINFDGPGSDEVRRYILDNALYWLREYH 237 (542)
T ss_pred ccccccccC-c-cccCCCCCCCC--------CccccCCCcHHHHHHHHHHHHHHHHHHhC
Confidence 211000000 1 00000011121 24777777 888888877776665444
No 49
>PLN03244 alpha-amylase; Provisional
Probab=56.63 E-value=47 Score=38.69 Aligned_cols=77 Identities=17% Similarity=0.105 Sum_probs=51.9
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccccc------C----CCCCCccccCCCCCcCCCCC
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCL------N----DPSWRANCIQPPCGQLNPIN 217 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~------~----~~~~~~~~~~~~~~~L~~~~ 217 (497)
=|.+|++.+|.-|.++||.||-.+ +|.|+..--. .++....-.+ + ...|+ +..+|..+
T Consensus 438 GTPeDLK~LVD~aH~~GI~VILDv-V~NH~~~d~~----~GL~~fDGt~~~Yf~~~~~g~~~~WG-------s~~fnyg~ 505 (872)
T PLN03244 438 GTPDDFKRLVDEAHGLGLLVFLDI-VHSYAAADEM----VGLSLFDGSNDCYFHTGKRGHHKHWG-------TRMFKYGD 505 (872)
T ss_pred CCHHHHHHHHHHHHHCCCEEEEEe-cCccCCCccc----cchhhcCCCccceeccCCCCccCCCC-------CceecCCC
Confidence 389999999999999999999876 4788864210 0111100000 0 01121 23688999
Q ss_pred hhHHHHHHHHHHHHHhccC
Q psy14880 218 PHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 218 p~t~~fl~~vl~E~~~lF~ 236 (497)
+++..||-+.+.-.++-|.
T Consensus 506 ~EVr~FLLsna~yWleEyh 524 (872)
T PLN03244 506 LDVLHFLISNLNWWITEYQ 524 (872)
T ss_pred HHHHHHHHHHHHHHHHHhC
Confidence 9999999999988887655
No 50
>PLN02447 1,4-alpha-glucan-branching enzyme
Probab=56.62 E-value=1.9e+02 Score=33.82 Aligned_cols=116 Identities=16% Similarity=0.134 Sum_probs=66.3
Q ss_pred HHHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880 106 QELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW 181 (497)
Q Consensus 106 ~~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l 181 (497)
++.+.|.+.-|+.- .|-...|.. ..|... ..+-|.++..| |.+|+++||+-|.++||.||-.+ +|.|+..--
T Consensus 254 ~~~L~ylk~LG~t~I~LmPi~e~~~~--~~wGY~-~~~~fa~~~~~Gtp~dlk~LVd~aH~~GI~VilDv-V~nH~~~~~ 329 (758)
T PLN02447 254 DDVLPRIKALGYNAVQLMAIQEHAYY--GSFGYH-VTNFFAVSSRSGTPEDLKYLIDKAHSLGLRVLMDV-VHSHASKNT 329 (758)
T ss_pred HHHHHHHHHcCCCEEEECCccccCCC--CCCCcC-cccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe-ccccccccc
Confidence 45677888778742 222222211 112111 11222333334 78999999999999999999875 688886421
Q ss_pred hhcCCCCCCCcc---------cccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 182 QWGPRYGLGDLI---------VCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 182 ~~~p~~~~~~l~---------~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
. .++..+. .+.+. ..|. ...+|-.++++.+||.+.+.-.++-|.
T Consensus 330 ~----~gl~~fDg~~~~Yf~~~~~g~~~~w~-------~~~~N~~~~eVr~fLl~~~~~Wl~ey~ 383 (758)
T PLN02447 330 L----DGLNGFDGTDGSYFHSGPRGYHWLWD-------SRLFNYGNWEVLRFLLSNLRWWLEEYK 383 (758)
T ss_pred c----ccccccCCCCccccccCCCCCcCcCC-------CceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 0 0111110 00000 0111 125888999999999999988887544
No 51
>PRK14510 putative bifunctional 4-alpha-glucanotransferase/glycogen debranching enzyme; Provisional
Probab=53.62 E-value=67 Score=39.45 Aligned_cols=89 Identities=17% Similarity=0.220 Sum_probs=55.0
Q ss_pred CCCCCC---CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcc------cccCC-CCCCccccCCCC-C
Q psy14880 143 SPAKVY---TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLI------VCLND-PSWRANCIQPPC-G 211 (497)
Q Consensus 143 s~~~~Y---T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~------~~~~~-~~~~~~~~~~~~-~ 211 (497)
.++..| |.+|+|++|+-|.++||+||-.+ ++-|+...-...|.++...+. ...+. ..+..+ ..+ +
T Consensus 236 a~dp~yg~~~~~efk~lV~~~H~~GI~VILDv-V~NHt~~~~~~~p~~~~~~~d~~~yy~~~~~~~~~y~~~---~G~gn 311 (1221)
T PRK14510 236 APDPRLAPGGEEEFAQAIKEAQSAGIAVILDV-VFNHTGESNHYGPTLSAYGSDNSPYYRLEPGNPKEYENW---WGCGN 311 (1221)
T ss_pred CcChhhccCcHHHHHHHHHHHHHCCCEEEEEE-ccccccCCCCCCCcccccCCCCCCceEecCCCCCcccCC---CCCCC
Confidence 455666 79999999999999999999875 567876532222222111110 00000 001000 011 3
Q ss_pred cCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 212 QLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 212 ~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
.+|..+|.+.+++.+.+.-.+. |.
T Consensus 312 ~~n~~~p~v~~~i~d~lr~Wv~-~g 335 (1221)
T PRK14510 312 LPNLERPFILRLPMDVLRSWAK-RG 335 (1221)
T ss_pred ccccCCHHHHHHHHHHHHHHHH-hC
Confidence 5777899999999999998888 65
No 52
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=53.49 E-value=55 Score=33.34 Aligned_cols=122 Identities=19% Similarity=0.302 Sum_probs=63.7
Q ss_pred CChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880 100 YTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHA 177 (497)
Q Consensus 100 y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~ 177 (497)
++.+++.++++-.+..|+-+. ..+|+ + |......+.|.-+ ..| -+.+++++..+++||+|+.-++ |. .
T Consensus 21 ~~~~~v~~~~~~~~~~~iP~d~~~lD~-~-----w~~~~~~~~f~~d~~~F--Pd~~~~i~~l~~~G~~~~~~~~-P~-i 90 (308)
T cd06593 21 YDEEEVNEFADGMRERNLPCDVIHLDC-F-----WMKEFQWCDFEFDPDRF--PDPEGMLSRLKEKGFKVCLWIN-PY-I 90 (308)
T ss_pred CCHHHHHHHHHHHHHcCCCeeEEEEec-c-----cccCCcceeeEECcccC--CCHHHHHHHHHHCCCeEEEEec-CC-C
Confidence 467788888877777887543 22321 1 1100001122111 112 1588999999999999999776 42 1
Q ss_pred hh---hhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880 178 GN---GWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG 233 (497)
Q Consensus 178 ~a---~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~ 233 (497)
.. ..+...+.++ +..-.++.++....-.+....+|.+||++.++.++.++++.+
T Consensus 91 ~~~~~~~~e~~~~g~--~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~ 147 (308)
T cd06593 91 AQKSPLFKEAAEKGY--LVKKPDGSVWQWDLWQPGMGIIDFTNPDACKWYKDKLKPLLD 147 (308)
T ss_pred CCCchhHHHHHHCCe--EEECCCCCeeeecccCCCcccccCCCHHHHHHHHHHHHHHHH
Confidence 10 0000000110 100001111100000123357999999999999999998876
No 53
>PRK03705 glycogen debranching enzyme; Provisional
Probab=52.66 E-value=60 Score=37.13 Aligned_cols=88 Identities=19% Similarity=0.196 Sum_probs=53.8
Q ss_pred CCCCCCC------HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc--c----cCC--CCCCccccCC
Q psy14880 143 SPAKVYT------PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV--C----LND--PSWRANCIQP 208 (497)
Q Consensus 143 s~~~~YT------~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~--~----~~~--~~~~~~~~~~ 208 (497)
.+++.|. .+|+|++|+-|.++||+||-.+ ++.|+...-...|...+..+.. . .+. ..|...
T Consensus 228 a~d~~ygt~~~~~~~efk~LV~~~H~~GI~VIlDv-V~NHt~~~~~~~~~~~~~~~d~~~yy~~~~~g~~~~~~g~---- 302 (658)
T PRK03705 228 ALDPAYASGPETALDEFRDAVKALHKAGIEVILDV-VFNHSAELDLDGPTLSLRGIDNRSYYWIREDGDYHNWTGC---- 302 (658)
T ss_pred ccccccCCCCcchHHHHHHHHHHHHHCCCEEEEEE-cccCccCcCCCCcchhcccCCCccceEECCCCCcCCCCCc----
Confidence 4555563 3699999999999999999876 4567764211112111100000 0 000 011110
Q ss_pred CCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 209 PCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 209 ~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
...||..+|++.+++.+++.-.++-|.
T Consensus 303 -g~~ln~~~p~Vr~~iid~l~~W~~e~g 329 (658)
T PRK03705 303 -GNTLNLSHPAVVDWAIDCLRYWVETCH 329 (658)
T ss_pred -cCcccCCCHHHHHHHHHHHHHHHHHhC
Confidence 136889999999999999998887666
No 54
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=51.00 E-value=45 Score=37.09 Aligned_cols=30 Identities=30% Similarity=0.483 Sum_probs=26.4
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.++||+||-.+ +|-|+..
T Consensus 73 t~~~~~~lv~~ah~~gi~vilD~-v~NH~~~ 102 (543)
T TIGR02403 73 TMADFEELVSEAKKRNIKIMLDM-VFNHTST 102 (543)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-Ccccccc
Confidence 89999999999999999999876 5777753
No 55
>PRK14459 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=50.87 E-value=45 Score=35.39 Aligned_cols=78 Identities=19% Similarity=0.116 Sum_probs=51.5
Q ss_pred eEEeccCCcccccccccCCCCCCCccCChhHHHHH-HHHHHHcCceeeeecCC-CCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880 74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL-VHYAKLRGVRVIFELDA-PAHAGNGWQWGPRFGAYSPAKVYTPL 151 (497)
Q Consensus 74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~-~~y~~~rG~ri~~e~~~-p~~~~~~~~~~~~~g~~s~~~~YT~~ 151 (497)
+-||..|++--.--.| .++.|..+++.+. .+|++..|.++.+++.. ++. -=|.+
T Consensus 242 vSLha~d~e~R~~l~p------~n~~~~l~~ll~a~~~~~~~~grrv~ieyvLi~Gv------------------NDs~e 297 (373)
T PRK14459 242 VSLHAPDDELRDELVP------VNTRWKVDEVLDAARYYADATGRRVSIEYALIRDI------------------NDQPW 297 (373)
T ss_pred EEeCCCCHHHHHHhcC------cccCCCHHHHHHHHHHHHHHhCCEEEEEEEEeCCC------------------CCCHH
Confidence 5677777654433333 1345666666664 67777788888877752 332 16889
Q ss_pred HHHHHHHHHHHc-----CCEEEEecCCcc
Q psy14880 152 EVQELVHYAKLR-----GIRVIFELDAPA 175 (497)
Q Consensus 152 ei~eiv~yA~~r-----gI~VIPEID~PG 175 (497)
++++|+++.+.. .|.+||--.++|
T Consensus 298 ~a~~L~~llk~~~~~~~~VNLIpyNp~~~ 326 (373)
T PRK14459 298 RADLLGKKLHGRGGGWVHVNLIPLNPTPG 326 (373)
T ss_pred HHHHHHHHHhhccCCCeEEEEEccCCCCC
Confidence 999999999987 567777554443
No 56
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=50.30 E-value=53 Score=32.66 Aligned_cols=103 Identities=14% Similarity=0.072 Sum_probs=66.0
Q ss_pred ccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCC
Q psy14880 17 KAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSP 96 (497)
Q Consensus 17 ~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~ 96 (497)
+++.-+|.=.|.+-...| +|.+.|++.|+..-.+.+ ..+-+ ||-+|+
T Consensus 20 ~~g~yID~lKfg~Gt~~l------~~~~~l~eki~la~~~~V------------------~v~~G-Gtl~E~-------- 66 (237)
T TIGR03849 20 VCGDYITFVKFGWGTSAL------IDRDIVKEKIEMYKDYGI------------------KVYPG-GTLFEI-------- 66 (237)
T ss_pred HhhhheeeEEecCceEee------ccHHHHHHHHHHHHHcCC------------------eEeCC-ccHHHH--------
Confidence 344455655566544433 677889999987666653 12233 444332
Q ss_pred CccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCC
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDA 173 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~ 173 (497)
.+.+..+++-+++++.-|+..+ |+. + | .-..+.++..++|+.++++|..|+||+-.
T Consensus 67 --~~~q~~~~~Yl~~~k~lGf~~I-EiS--~--G--------------~~~i~~~~~~rlI~~~~~~g~~v~~EvG~ 122 (237)
T TIGR03849 67 --AHSKGKFDEYLNECDELGFEAV-EIS--D--G--------------SMEISLEERCNLIERAKDNGFMVLSEVGK 122 (237)
T ss_pred --HHHhhhHHHHHHHHHHcCCCEE-EEc--C--C--------------ccCCCHHHHHHHHHHHHhCCCeEeccccc
Confidence 2334567777788888888642 221 0 1 12389999999999999999999999643
No 57
>cd07201 cPLA2_Grp-IVB-IVD-IVE-IVF Group IVB, IVD, IVE, and IVF cytosolic phospholipase A2; catalytic domain; Ca-dependent. Group IVB, IVD, IVE, and IVF cPLA2 consists of two domains: the regulatory C2 domain and alpha/beta hydrolase PLA2 domain. Group IVB, IVD, IVE, and IVF cPLA2 are also referred to as cPLA2-beta, -delta, -epsilon, and -zeta respectively. cPLA2-beta is approximately 30% identical to cPLA2-alpha and it shows low enzymatic activity compared to cPLA2alpha. cPLA2-beta hydrolyzes palmitic acid from 1-[14C]palmitoyl-2-arachidonoyl-PC and arachidonic acid from 1-palmitoyl-2[14C]arachidonoyl-PC, but not from 1-O-alkyl-2[3H]arachidonoyl-PC. cPLA2-delta, -epsilon, and -zeta are approximately 45-50% identical to cPLA2-beta and 31-37% identical to cPLA2-alpha. It's possible that cPLA2-beta, -delta, -epsilon, and -zeta may have arisen by gene duplication from an ancestral gene. The catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bon
Probab=47.39 E-value=49 Score=36.69 Aligned_cols=37 Identities=16% Similarity=-0.016 Sum_probs=24.4
Q ss_pred hhCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhh
Q psy14880 13 HFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYL 50 (497)
Q Consensus 13 ~~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~i 50 (497)
+.++....|.|.|....++---|..| ..|.+.|.+++
T Consensus 283 ~~r~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~~~~~~ 319 (541)
T cd07201 283 WTRDKVNDIEDEPPLPPRPPERLTTL-LTPGGPLSQAF 319 (541)
T ss_pred hccccccccccCcCcCCCCCcccccc-cCCCCHHHHHH
Confidence 44777777888887766666666555 66666655543
No 58
>COG0134 TrpC Indole-3-glycerol phosphate synthase [Amino acid transport and metabolism]
Probab=44.49 E-value=39 Score=33.99 Aligned_cols=73 Identities=16% Similarity=0.253 Sum_probs=52.5
Q ss_pred CCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHH
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKD 226 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~ 226 (497)
.++.+++++|+++|.++|++++-|+..--++.-+++..+. +. -+|-.|=++++.-.+
T Consensus 139 ~L~~~~l~el~~~A~~LGm~~LVEVh~~eEl~rAl~~ga~-----iI------------------GINnRdL~tf~vdl~ 195 (254)
T COG0134 139 ALDDEQLEELVDRAHELGMEVLVEVHNEEELERALKLGAK-----II------------------GINNRDLTTLEVDLE 195 (254)
T ss_pred hcCHHHHHHHHHHHHHcCCeeEEEECCHHHHHHHHhCCCC-----EE------------------EEeCCCcchheecHH
Confidence 4899999999999999999999999999888887753321 11 023334445555555
Q ss_pred HHHHHHhccCCCCCeEEE
Q psy14880 227 IYGELMGMAKYGDEMFHM 244 (497)
Q Consensus 227 vl~E~~~lF~~~~~~iHi 244 (497)
...++++..| ...+=|
T Consensus 196 ~t~~la~~~p--~~~~~I 211 (254)
T COG0134 196 TTEKLAPLIP--KDVILI 211 (254)
T ss_pred HHHHHHhhCC--CCcEEE
Confidence 6788889999 554444
No 59
>COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=44.48 E-value=24 Score=37.51 Aligned_cols=24 Identities=29% Similarity=0.436 Sum_probs=22.1
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
-.||++++++|++.|+++||-||=
T Consensus 178 av~~~~~l~~i~~~a~~~~i~ii~ 201 (393)
T COG0436 178 AVYSKEELKAIVELAREHDIIIIS 201 (393)
T ss_pred cCCCHHHHHHHHHHHHHcCeEEEE
Confidence 359999999999999999999886
No 60
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=43.92 E-value=1.5e+02 Score=30.59 Aligned_cols=22 Identities=14% Similarity=0.166 Sum_probs=19.5
Q ss_pred CcCCCCChhHHHHHHHHHHHHH
Q psy14880 211 GQLNPINPHVYTVLKDIYGELM 232 (497)
Q Consensus 211 ~~L~~~~p~t~~fl~~vl~E~~ 232 (497)
..+|.+||++.+...+.++++.
T Consensus 125 ~~~Dftnp~a~~ww~~~~~~~~ 146 (317)
T cd06600 125 VYPDFTNPDTREWWAGLFSEWL 146 (317)
T ss_pred cccCCCChHHHHHHHHHHHHHh
Confidence 3689999999999999998876
No 61
>cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources.
Probab=42.36 E-value=37 Score=32.12 Aligned_cols=67 Identities=19% Similarity=0.241 Sum_probs=45.1
Q ss_pred HHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880 154 QELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG 233 (497)
Q Consensus 154 ~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~ 233 (497)
.+++++++.+|+.+||++.+|.++..+++... +.. .+.|..+...+ .++++..
T Consensus 87 ~~~~~~~~~~~~~~i~gv~t~~e~~~A~~~Ga-----d~i------------------~~~p~~~~g~~----~~~~l~~ 139 (190)
T cd00452 87 PEVVKAANRAGIPLLPGVATPTEIMQALELGA-----DIV------------------KLFPAEAVGPA----YIKALKG 139 (190)
T ss_pred HHHHHHHHHcCCcEECCcCCHHHHHHHHHCCC-----CEE------------------EEcCCcccCHH----HHHHHHh
Confidence 57999999999999999999998877654321 111 12344443344 4455566
Q ss_pred ccCCCCCeEEEcCCc
Q psy14880 234 MAKYGDEMFHMGADE 248 (497)
Q Consensus 234 lF~~~~~~iHiGgDE 248 (497)
.||. -+.+=+||=.
T Consensus 140 ~~~~-~p~~a~GGI~ 153 (190)
T cd00452 140 PFPQ-VRFMPTGGVS 153 (190)
T ss_pred hCCC-CeEEEeCCCC
Confidence 6764 5788888865
No 62
>PLN02607 1-aminocyclopropane-1-carboxylate synthase
Probab=40.63 E-value=71 Score=34.54 Aligned_cols=23 Identities=4% Similarity=0.273 Sum_probs=21.3
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
..||++++++|++.|++++|.||
T Consensus 215 ~~~s~e~l~~l~~~~~~~~i~lI 237 (447)
T PLN02607 215 ATVQRSVLEDILDFVVRKNIHLV 237 (447)
T ss_pred cccCHHHHHHHHHHHHHCCCEEE
Confidence 45999999999999999999887
No 63
>PRK14465 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=40.28 E-value=2.7e+02 Score=29.25 Aligned_cols=113 Identities=15% Similarity=0.119 Sum_probs=73.7
Q ss_pred hCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhc-ccccccchhhHHhhhhceeEEeccCCcccccccccCC
Q psy14880 14 FQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTC-GTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFG 92 (497)
Q Consensus 14 ~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~-~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g 92 (497)
.+...+++.+.|.-+.|.-+.-++++| |++.+.+.++..+. .+ ..+-+.. +.+|.+.
T Consensus 210 ~~~~LaiSLhA~~~e~R~~l~Pi~~~~-~le~ll~al~~~~~~~~--------------r~v~iey-------vLI~GvN 267 (342)
T PRK14465 210 EPYNFAISLNHPDPNGRLQIMDIEEKF-PLEELLQAAKDFTRELK--------------RRITFEY-------VMIPGVN 267 (342)
T ss_pred cCceEEEEecCCChhhcceEeeccccC-CHHHHHHHHHHHHHHcC--------------CEEEEEE-------EEECCcc
Confidence 346778888999999999999999888 88888887775543 22 1111110 0122221
Q ss_pred CCCCCccCChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 93 AYSPAKVYTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 93 ~~~~~~~y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
=+.++++++.++.+..+.++. .++..+ + ++-..-|++++.++.+.-+++||.|..-
T Consensus 268 -------Ds~eda~~L~~ll~~l~~kVnLIPyN~~---~------------~~~~~ps~e~i~~F~~~L~~~Gi~v~~R 324 (342)
T PRK14465 268 -------MGRENANKLVKIARSLDCKINVIPLNTE---F------------FGWRRPTDDEVAEFIMLLEPAGVPILNR 324 (342)
T ss_pred -------CCHHHHHHHHHHHhhCCCcEEEEccCCC---C------------CCCCCCCHHHHHHHHHHHHHCCCeEEEe
Confidence 235688888888876566553 233211 0 1123378999999999999999998763
No 64
>PF01212 Beta_elim_lyase: Beta-eliminating lyase; InterPro: IPR001597 This domain is found in many tryptophanases (tryptophan indole-lyase, TNase), tyrosine phenol-lyases (TPL) and threonine aldolases. It is involved in the degradation of amino acids. The glycine cleavage system is composed of four proteins: P, T, L and H. In Bacillus subtilis, the P 'protein' is an heterodimer of two subunits. The glycine cleavage system catalyses the degradation of glycine. The P protein binds the alpha-amino group of glycine through its pyridoxal phosphate cofactor; CO(2) is released and the remaining methylamine moiety is then transferred to the lipoamide cofactor of the H protein; GO: 0016829 lyase activity, 0006520 cellular amino acid metabolic process; PDB: 3PJ0_C 2C44_C 2V0Y_A 2OQX_A 2V1P_A 1AX4_B 3LWS_A 1C7G_A 1V72_A 2YHK_B ....
Probab=38.86 E-value=27 Score=35.56 Aligned_cols=24 Identities=21% Similarity=0.394 Sum_probs=17.8
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
+..||.+|+++|.++|+++||.|.
T Consensus 139 G~~~s~~el~ai~~~a~~~gl~lh 162 (290)
T PF01212_consen 139 GTVYSLEELRAISELAREHGLPLH 162 (290)
T ss_dssp SB---HHHHHHHHHHHHHHT-EEE
T ss_pred CeeCCHHHHHHHHHHHHhCceEEE
Confidence 456999999999999999998774
No 65
>PF07555 NAGidase: beta-N-acetylglucosaminidase ; InterPro: IPR011496 This family consists of both eukaryotic and prokaryotic hyaluronidases. Human Q9HAR0 from SWISSPROT is expressed during meningioma []. Clostridium perfringens, P26831 from SWISSPROT, is involved in pathogenesis and is likely to act on connectivity tissue during gas gangrene []. It catalyses the random hydrolysis of 1->4-linkages between N-acetyl-beta-D-glucosamine and D-glucuronate residues in hyaluronate.; PDB: 2WB5_B 2V5C_B 2VUR_A 2V5D_A 2YDS_A 2CBI_A 2XPK_A 2CBJ_B 2J62_A 2X0Y_A ....
Probab=38.52 E-value=85 Score=32.42 Aligned_cols=22 Identities=32% Similarity=0.603 Sum_probs=15.9
Q ss_pred CCHH---HHHHHHHHHHHcCCEEEE
Q psy14880 148 YTPL---EVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 148 YT~~---ei~eiv~yA~~rgI~VIP 169 (497)
|+.+ ++++|++.|++.||+.|=
T Consensus 50 Yp~~el~~l~~L~~~a~~~~V~Fv~ 74 (306)
T PF07555_consen 50 YPEEELAELKELADAAKANGVDFVY 74 (306)
T ss_dssp --HHHHHHHHHHHHHHHHTT-EEEE
T ss_pred CCHHHHHHHHHHHHHHHHcCCEEEE
Confidence 6655 678889999999999885
No 66
>COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]
Probab=38.32 E-value=35 Score=36.14 Aligned_cols=25 Identities=28% Similarity=0.494 Sum_probs=22.9
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
+.+.|++|++.|.+.|+++||+||-
T Consensus 172 Grvwt~eeL~~i~elc~kh~v~VIS 196 (388)
T COG1168 172 GRVWTKEELRKIAELCLRHGVRVIS 196 (388)
T ss_pred CccccHHHHHHHHHHHHHcCCEEEe
Confidence 3569999999999999999999996
No 67
>KOG0258|consensus
Probab=37.81 E-value=69 Score=34.25 Aligned_cols=54 Identities=15% Similarity=0.313 Sum_probs=37.6
Q ss_pred hhHHHHHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 102 PLEVQELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 102 ~~~i~~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
.+++++.+.-|+ .|+++..-+. .|+.+ -+..+|.+-|++|+.+|++.|+-++..
T Consensus 200 ~~el~~~~~eA~-k~i~~r~lvvINPGNP--------------TGqvls~e~ie~i~~fa~~~~l~llaD 254 (475)
T KOG0258|consen 200 VAELERSVDEAR-KGINPRALVVINPGNP--------------TGQVLSEENIEGIICFAAEEGLVLLAD 254 (475)
T ss_pred HHHHHHHHHHHh-ccCCceEEEEECCCCc--------------cchhhcHHHHHHHHHHHHHcCeEEech
Confidence 457777877777 7775321111 13321 135699999999999999999999864
No 68
>PRK13957 indole-3-glycerol-phosphate synthase; Provisional
Probab=37.78 E-value=46 Score=33.33 Aligned_cols=38 Identities=21% Similarity=0.385 Sum_probs=33.6
Q ss_pred CCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG 184 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~ 184 (497)
.++++++++++++|..+|++++-|+-+..-...++...
T Consensus 134 ~L~~~~l~~l~~~a~~lGle~LVEVh~~~El~~a~~~g 171 (247)
T PRK13957 134 ILTPSQIKSFLKHASSLGMDVLVEVHTEDEAKLALDCG 171 (247)
T ss_pred hCCHHHHHHHHHHHHHcCCceEEEECCHHHHHHHHhCC
Confidence 48999999999999999999999999988887776543
No 69
>TIGR02455 TreS_stutzeri trehalose synthase, Pseudomonas stutzeri type. Trehalose synthase catalyzes a one-step conversion of maltose to trehalose. This is an alternative to the OtsAB and TreYZ pathways. This family includes a characterized example from Pseudomonas stutzeri plus very closely related sequences from other Pseudomonads. Cutoff scores are set to find a more distantly related sequence from Desulfovibrio vulgaris, likely to be functionally equivalent, between trusted and noise limits.
Probab=36.83 E-value=38 Score=38.52 Aligned_cols=31 Identities=26% Similarity=0.320 Sum_probs=27.1
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
=|.+|+++|++-|.+|||.||-.+ +|.|++.
T Consensus 127 GT~eDf~~L~~~Ah~~G~~vi~Dl-VpnHTs~ 157 (688)
T TIGR02455 127 GSEEELIQLSRMAAAHNAITIDDI-IPAHTGK 157 (688)
T ss_pred CCHHHHHHHHHHHHHCCCEEEEEe-CCCCCCC
Confidence 499999999999999999999775 6788764
No 70
>PF01565 FAD_binding_4: FAD binding domain This is only a subset of the Pfam family; InterPro: IPR006094 Various enzymes use FAD as a co-factor, most of these enzymes are oxygen-dependent oxidoreductases, containing a covalently bound FAD group which is attached to a histidine via an 8-alpha-(N3-histidyl)-riboflavin linkage. One of the enzymes Vanillyl-alcohol oxidase (VAO, 1.1.3.38 from EC) has a solved structure, the alignment includes the FAD binding site, called the PP-loop, between residues 99-110 []. The FAD molecule is covalently bound in the known structure, however the residue that links to the FAD is not in the alignment. VAO catalyses the oxidation of a wide variety of substrates, ranging from aromatic amines to 4-alkylphenols. ; GO: 0008762 UDP-N-acetylmuramate dehydrogenase activity, 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZR6_A 3HSU_A 2AXR_A 3D2J_A 3D2H_A 3FW9_A 3FW8_A 3FW7_A 3GSY_A 3FWA_A ....
Probab=36.78 E-value=40 Score=29.63 Aligned_cols=28 Identities=29% Similarity=0.382 Sum_probs=23.3
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
|.+|+++++++|+++++.|.|- .-||..
T Consensus 9 s~~ev~~~v~~a~~~~~~v~~~--g~G~~~ 36 (139)
T PF01565_consen 9 SVEEVQAIVKFANENGVPVRVR--GGGHSW 36 (139)
T ss_dssp SHHHHHHHHHHHHHTTSEEEEE--SSSTTS
T ss_pred CHHHHHHHHHHHHHcCCcEEEE--cCCCCc
Confidence 7899999999999999999984 345543
No 71
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=36.54 E-value=4e+02 Score=27.27 Aligned_cols=24 Identities=21% Similarity=0.416 Sum_probs=20.2
Q ss_pred CCcCCCCChhHHHHHHHHHHHHHh
Q psy14880 210 CGQLNPINPHVYTVLKDIYGELMG 233 (497)
Q Consensus 210 ~~~L~~~~p~t~~fl~~vl~E~~~ 233 (497)
...+|.+||++.+...+.++++.+
T Consensus 129 ~~~~Dftnp~a~~w~~~~~~~~~~ 152 (317)
T cd06598 129 TGLIDWFDPAAQAWFHDNYKKLID 152 (317)
T ss_pred ccccCCCCHHHHHHHHHHHHHhhh
Confidence 357899999999999999988743
No 72
>PRK09357 pyrC dihydroorotase; Validated
Probab=35.03 E-value=1.2e+02 Score=32.12 Aligned_cols=93 Identities=20% Similarity=0.166 Sum_probs=54.7
Q ss_pred CCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCccccccc-------ccCCCCCCCccCChhHHHHHHHHHH
Q psy14880 41 TDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLL-------PRFGAYSPAKVYTPLEVQELVHYAK 113 (497)
Q Consensus 41 ~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~-------~~~g~~~~~~~y~~~~i~~~~~y~~ 113 (497)
.+.+.|++.++.+..+++ .+|.|..+.+--.-+. .+++.......=....+.+.+..++
T Consensus 156 ~~~~~l~~~~~~a~~~g~--------------~v~iH~ee~~~~~~~~~~~g~~~~~~~~~~~p~~~E~~~i~~~~~la~ 221 (423)
T PRK09357 156 QDARLMRRALEYAKALDL--------------LIAQHCEDPSLTEGGVMNEGEVSARLGLPGIPAVAEEVMIARDVLLAE 221 (423)
T ss_pred CCHHHHHHHHHHHHhcCC--------------EEEEeCCCHHHhhcccccCChhhHHhCCCCCCHHHHHHHHHHHHHHHH
Confidence 356788888877777653 5789998854322111 1111110000001224666677777
Q ss_pred HcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEec
Q psy14880 114 LRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFEL 171 (497)
Q Consensus 114 ~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEI 171 (497)
..|.|+... ..|-.+-.++++.|+++|+.|..|+
T Consensus 222 ~~g~~~hi~------------------------H~s~~~~~~~i~~a~~~g~~v~~e~ 255 (423)
T PRK09357 222 ATGARVHIC------------------------HVSTAGSVELIRWAKALGIKVTAEV 255 (423)
T ss_pred HHCCcEEEE------------------------eCCCHHHHHHHHHHHHcCCCEEEEe
Confidence 777664321 1455567788889999999988875
No 73
>KOG0259|consensus
Probab=35.00 E-value=37 Score=36.13 Aligned_cols=34 Identities=26% Similarity=0.337 Sum_probs=30.2
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
+.+||++-+++|.+.|+++||.||-. +.-||+-.
T Consensus 213 GnVys~~HL~kiae~A~klgi~vIaD-EVY~~~vf 246 (447)
T KOG0259|consen 213 GNVYSEDHLKKIAETAKKLGIMVIAD-EVYGHTVF 246 (447)
T ss_pred cccccHHHHHHHHHHHHHhCCeEEeh-hhcceeec
Confidence 56799999999999999999999976 78888854
No 74
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=34.86 E-value=1.6e+02 Score=34.80 Aligned_cols=70 Identities=23% Similarity=0.299 Sum_probs=45.7
Q ss_pred CChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEEEE
Q psy14880 100 YTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 100 y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~VIP 169 (497)
|+-.++.+.+.|-+.-|+..+ |-+..+ .++. -+|....++ |.++++++++-|+++||.||-
T Consensus 17 ~tf~~~~~~l~YL~~LGis~IyLsPi~~a~--~gs~-------hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIl 87 (879)
T PRK14511 17 FTFDDAAELVPYFADLGVSHLYLSPILAAR--PGST-------HGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLIL 87 (879)
T ss_pred CCHHHHHHHhHHHHHcCCCEEEECcCccCC--CCCC-------CCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEE
Confidence 344678888888888888532 211111 1111 123222223 899999999999999999998
Q ss_pred ecCCcccchh
Q psy14880 170 ELDAPAHAGN 179 (497)
Q Consensus 170 EID~PGH~~a 179 (497)
.| +|-|+..
T Consensus 88 Di-V~NH~~~ 96 (879)
T PRK14511 88 DI-VPNHMAV 96 (879)
T ss_pred Ee-ccccccC
Confidence 86 6778765
No 75
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=34.56 E-value=1.7e+02 Score=33.70 Aligned_cols=36 Identities=22% Similarity=0.335 Sum_probs=29.9
Q ss_pred CcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880 211 GQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE 248 (497)
Q Consensus 211 ~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE 248 (497)
..|+|.+|++.++|++|+.|++.-.+ -.-||+=-|=
T Consensus 433 ~rl~P~~pe~r~~i~~i~~dla~~~~--~dGilf~Dd~ 468 (671)
T PRK14582 433 RRLSPFDDRVRAQVGMLYEDLAGHAA--FDGILFHDDA 468 (671)
T ss_pred cCCCCCCHHHHHHHHHHHHHHHHhCC--CceEEecccc
Confidence 35999999999999999999999766 5667766555
No 76
>TIGR02336 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase. Members of this family are found in phylogenetically diverse bacteria, including Clostridium perfringens (in the Firmicutes), Bifidobacterium longum and Propionibacterium acnes (in the Actinobacteria), and Vibrio vulnificus (in the Proteobacteria), most of which occur as mammalian pathogens or commensals. The nominal activity, 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase (EC 2.4.1.211), varies somewhat from instance to instance in relative rates for closely related substrates.
Probab=34.55 E-value=4.2e+02 Score=30.49 Aligned_cols=103 Identities=16% Similarity=0.172 Sum_probs=64.9
Q ss_pred CCCcCCCCChhHHHHHHHHHHHHHhccCCC-----CCeEE---EcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCC
Q psy14880 209 PCGQLNPINPHVYTVLKDIYGELMGMAKYG-----DEMFH---MGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSP 280 (497)
Q Consensus 209 ~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~-----~~~iH---iGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~ 280 (497)
..-.+|+..|+|.+++.+-+++.++-.|+. ..+|| +=-|| ...
T Consensus 176 h~ip~Dvr~p~t~~~~~~~l~~wl~~~p~~dVvRfTTfFy~Ftl~f~~-----------------------------~~k 226 (719)
T TIGR02336 176 HEIPFDIYHPATRKHVFDTFEQWLKDSPQTDVVRFTTFFYQFTLLFDE-----------------------------KRR 226 (719)
T ss_pred ccCCcCCcChHHHHHHHHHHHHHHHhCCCCcEEEEeeeeeeEeEEecc-----------------------------ccc
Confidence 334799999999999999999999988863 12222 22233 111
Q ss_pred CcceeecccccCC--HHHH-HHHHHcCCCCC--------------------cccHHHHHHHHHHHHHHHHHHHhhCCCce
Q psy14880 281 HLWTVFIKCWNNT--PAIV-DFMLAHGKNTT--------------------FDDYIQLWAHFQNKAAASLDEAVGHNRTK 337 (497)
Q Consensus 281 ~~~~v~~~~w~~~--p~~~-~~~~~~g~~~~--------------------~~~~~~l~~~F~~~~~~~l~~~~~~~g~~ 337 (497)
+.+ |+.-.+..| |... ++-++.|..-+ .-||.+.+..|+.+.++.+-+.+.+.||.
T Consensus 227 ek~-vDWfgY~~sVSp~ale~Fe~e~GY~l~pEd~vd~GyyN~~~rvPsk~~~D~~~F~~~fV~~~ak~lvd~~H~~Gke 305 (719)
T TIGR02336 227 EKV-VDWFGYACTVSPRALEDFEAKYGYKLRPEDFVDAGYYNSTWRVPRKQYRDWIDFLSGFVRENAKELVDMSHAAGKE 305 (719)
T ss_pred cce-eeccCcccccCHHHHHHHHHHhCCCCCHHHhhccccccCcccCCcHHHhHHHHHHHHHHHHHHHHHHHHHHhcCce
Confidence 222 333334333 5444 44445466321 13566777778888887777788889998
Q ss_pred EEEe
Q psy14880 338 LIVW 341 (497)
Q Consensus 338 ~~~W 341 (497)
.|+.
T Consensus 306 AmmF 309 (719)
T TIGR02336 306 AMMF 309 (719)
T ss_pred EEEe
Confidence 8876
No 77
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=34.32 E-value=2.1e+02 Score=30.60 Aligned_cols=128 Identities=20% Similarity=0.272 Sum_probs=66.0
Q ss_pred cCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCC-----CCCCCCCC-CCCCHHHHHHHHHHHHHcCCE----EE
Q psy14880 99 VYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGP-----RFGAYSPA-KVYTPLEVQELVHYAKLRGIR----VI 168 (497)
Q Consensus 99 ~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~-----~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~----VI 168 (497)
.++.+.++++++.++.-|+.. +-+| .+|+... ..|...++ ..+ ..=++.|+++.+++|+. +-
T Consensus 54 d~~e~~i~~~a~~~~~~G~e~-fviD------DGW~~~r~~d~~~~GdW~~~~~kF-P~Gl~~l~~~i~~~Gmk~GlW~e 125 (394)
T PF02065_consen 54 DITEEKILELADAAAELGYEY-FVID------DGWFGGRDDDNAGLGDWEPDPKKF-PNGLKPLADYIHSLGMKFGLWFE 125 (394)
T ss_dssp G--HHHHHHHHHHHHHHT-SE-EEE-------SSSBCTESTTTSTTSBECBBTTTS-TTHHHHHHHHHHHTT-EEEEEEE
T ss_pred CCCHHHHHHHHHHHHHhCCEE-EEEc------CccccccCCCcccCCceeEChhhh-CCcHHHHHHHHHHCCCeEEEEec
Confidence 356778888888888888764 2232 2332110 11222111 111 12499999999999987 55
Q ss_pred EecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcC
Q psy14880 169 FELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGA 246 (497)
Q Consensus 169 PEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGg 246 (497)
||+=.|+. .+.+.+|+. ....++.+-.. ....-.||+++|++.+++.+.+..+..-.. -.||=+-.
T Consensus 126 Pe~v~~~S--~l~~~hPdw-----~l~~~~~~~~~---~r~~~vLD~~~pev~~~l~~~i~~ll~~~g--idYiK~D~ 191 (394)
T PF02065_consen 126 PEMVSPDS--DLYREHPDW-----VLRDPGRPPTL---GRNQYVLDLSNPEVRDYLFEVIDRLLREWG--IDYIKWDF 191 (394)
T ss_dssp TTEEESSS--CHCCSSBGG-----BTCCTTSE-EC---BTTBEEB-TTSHHHHHHHHHHHHHHHHHTT---SEEEEE-
T ss_pred cccccchh--HHHHhCccc-----eeecCCCCCcC---cccceEEcCCCHHHHHHHHHHHHHHHHhcC--CCEEEecc
Confidence 66544443 233334432 11111100000 011136999999999999999998876555 56665543
No 78
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=33.10 E-value=1.6e+02 Score=30.52 Aligned_cols=23 Identities=13% Similarity=0.046 Sum_probs=20.5
Q ss_pred CcCCCCChhHHHHHHHHHHHHHh
Q psy14880 211 GQLNPINPHVYTVLKDIYGELMG 233 (497)
Q Consensus 211 ~~L~~~~p~t~~fl~~vl~E~~~ 233 (497)
..+|.+||++.+...+.++++..
T Consensus 125 ~~~Dftnp~a~~ww~~~~~~~~~ 147 (339)
T cd06603 125 SWPDFLNPEVRDWWASLFSYDKY 147 (339)
T ss_pred CCccCCChhHHHHHHHHHHHHhh
Confidence 57999999999999999998865
No 79
>KOG0256|consensus
Probab=32.89 E-value=1.8e+02 Score=31.43 Aligned_cols=117 Identities=12% Similarity=0.162 Sum_probs=65.8
Q ss_pred CCCCHHHHH-Hhhhhhhcccccccc-----hhhHH--hhhhceeEEeccC-CcccccccccCCCCCC-------------
Q psy14880 39 HLTDPEIIL-NYLDPQTCGTQQVVR-----EPLLL--VTLMALCRWQMCT-RQQVPCLLPRFGAYSP------------- 96 (497)
Q Consensus 39 ~~~~~~~lk-~~id~ma~~K~~~~~-----~~~~~--~~~~n~lhlHl~D-~q~~~~e~~~~g~~~~------------- 96 (497)
.|..+..++ .+++.|..++-|-.+ +.+.= ....+.|-+=|.| +..|-+-.|-||.++.
T Consensus 119 dy~Gl~~frqa~A~Fm~~~r~~~v~fdP~~~Vv~~G~T~ane~l~fcLadpgdafLvPtPyY~gfdrdl~~rTgveivpv 198 (471)
T KOG0256|consen 119 DYHGLPSFRQAVAEFMERARGNRVKFDPERVVVTNGATSANETLMFCLADPGDAFLVPTPYYPGFDRDLRWRTGVEIVPV 198 (471)
T ss_pred cccCchHHHHHHHHHHHHHhCCCCccCccceEEecccchhhHHHHHHhcCCCceeeecCCCCCcccccceeccCceEEEE
Confidence 344444444 467777776643221 11110 0112445555666 6677766777776531
Q ss_pred ----CccC--ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 97 ----AKVY--TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 97 ----~~~y--~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
...| |.+.+++...-|...|.+|- .-+..|.. +- +-.||++++..|+.+|..++|.||-
T Consensus 199 ~c~Ss~~f~itv~alE~A~~~A~~~~~kVkGvlitNPsN---------PL-----G~~~~~e~L~~ll~Fa~~kniHvI~ 264 (471)
T KOG0256|consen 199 HCSSSNGFQITVEALEAALNQARKLGLKVKGVLITNPSN---------PL-----GTTLSPEELISLLNFASRKNIHVIS 264 (471)
T ss_pred EeecCCCccccHHHHHHHHHHHHHhCCceeEEEEeCCCC---------CC-----CCccCHHHHHHHHHHHhhcceEEEe
Confidence 1112 34556666667777776642 11111211 11 2349999999999999999999875
No 80
>PF05913 DUF871: Bacterial protein of unknown function (DUF871); InterPro: IPR008589 This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown though a number are annotated as outer surface proteins.; PDB: 2P0O_A 1X7F_A.
Probab=32.57 E-value=80 Score=33.33 Aligned_cols=55 Identities=25% Similarity=0.294 Sum_probs=35.2
Q ss_pred hhHHHHHHHHHHHcCceeee-ecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecC
Q psy14880 102 PLEVQELVHYAKLRGVRVIF-ELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELD 172 (497)
Q Consensus 102 ~~~i~~~~~y~~~rG~ri~~-e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID 172 (497)
.++.++-++-++..|++.++ -+.+|... .--..+++++|+++|+++|++||..|.
T Consensus 13 ~~~~~~yi~~a~~~Gf~~iFTSL~ipe~~----------------~~~~~~~~~~l~~~a~~~~~~v~~Dis 68 (357)
T PF05913_consen 13 FEENKAYIEKAAKYGFKRIFTSLHIPEDD----------------PEDYLERLKELLKLAKELGMEVIADIS 68 (357)
T ss_dssp HHHHHHHHHHHHCTTEEEEEEEE-------------------------HHHHHHHHHHHHHHCT-EEEEEE-
T ss_pred HHHHHHHHHHHHHCCCCEEECCCCcCCCC----------------HHHHHHHHHHHHHHHHHCCCEEEEECC
Confidence 45667777778888986543 33344321 114578999999999999999999875
No 81
>PRK14465 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=32.52 E-value=1.7e+02 Score=30.71 Aligned_cols=74 Identities=15% Similarity=0.103 Sum_probs=46.3
Q ss_pred eeEEeccCCcccccccccCCCCCCCccCChhHHHHHH-HHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCH
Q psy14880 73 LCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV-HYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTP 150 (497)
Q Consensus 73 ~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~-~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~ 150 (497)
.+.||-.|++.-.--.|. ++.|.-+++.+.+ +|.+..|-++.+++. +|+.. =|.
T Consensus 215 aiSLhA~~~e~R~~l~Pi------~~~~~le~ll~al~~~~~~~~r~v~ieyvLI~GvN------------------Ds~ 270 (342)
T PRK14465 215 AISLNHPDPNGRLQIMDI------EEKFPLEELLQAAKDFTRELKRRITFEYVMIPGVN------------------MGR 270 (342)
T ss_pred EEEecCCChhhcceEeec------cccCCHHHHHHHHHHHHHHcCCEEEEEEEEECCcc------------------CCH
Confidence 367888887654333331 2345445555544 577777777776665 24321 678
Q ss_pred HHHHHHHHHHHHc--CCEEEEe
Q psy14880 151 LEVQELVHYAKLR--GIRVIFE 170 (497)
Q Consensus 151 ~ei~eiv~yA~~r--gI~VIPE 170 (497)
+++++|.++++.+ .|++||-
T Consensus 271 eda~~L~~ll~~l~~kVnLIPy 292 (342)
T PRK14465 271 ENANKLVKIARSLDCKINVIPL 292 (342)
T ss_pred HHHHHHHHHHhhCCCcEEEEcc
Confidence 9999999999985 4566663
No 82
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=32.44 E-value=66 Score=24.67 Aligned_cols=46 Identities=33% Similarity=0.385 Sum_probs=29.8
Q ss_pred HHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecC
Q psy14880 105 VQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELD 172 (497)
Q Consensus 105 i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID 172 (497)
++++++.++.+|+....= -.|. |-....++.+++++.||.|||-++
T Consensus 17 ~~~~~~~a~~~g~~~v~i---TDh~-------------------~~~~~~~~~~~~~~~gi~~i~G~E 62 (67)
T smart00481 17 PEELVKRAKELGLKAIAI---TDHG-------------------NLFGAVEFYKAAKKAGIKPIIGLE 62 (67)
T ss_pred HHHHHHHHHHcCCCEEEE---eeCC-------------------cccCHHHHHHHHHHcCCeEEEEEE
Confidence 567888899998865311 1111 122245677778899999999554
No 83
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=32.31 E-value=3.3e+02 Score=28.33 Aligned_cols=127 Identities=17% Similarity=0.121 Sum_probs=64.0
Q ss_pred ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880 101 TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 101 ~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
+.+++.++++--+.+++-+. ..+|. .+ ....+.|.-+ ..|-.-..+++|+.-+++||+|++-|+ |+-..
T Consensus 22 ~~~~v~~~~~~~r~~~iP~d~i~lD~-~~-------~~~~~~f~~d~~~FPdp~~~~mi~~L~~~G~k~~~~i~-P~v~~ 92 (339)
T cd06602 22 NVDEVKEVVENMRAAGIPLDVQWNDI-DY-------MDRRRDFTLDPVRFPGLKMPEFVDELHANGQHYVPILD-PAISA 92 (339)
T ss_pred CHHHHHHHHHHHHHhCCCcceEEECc-cc-------ccCccceecccccCCCccHHHHHHHHHHCCCEEEEEEe-Ccccc
Confidence 56788888866666776543 22221 11 1122333211 123322339999999999999999875 44322
Q ss_pred hh-hhhcCCC----CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880 179 NG-WQWGPRY----GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK 236 (497)
Q Consensus 179 a~-l~~~p~~----~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~ 236 (497)
.. -..++.. ..+-+..-.++.++....-......+|.+||++.+..++.++++.+-..
T Consensus 93 ~~~~~~~~~~~e~~~~g~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~~~G 155 (339)
T cd06602 93 NEPTGSYPPYDRGLEMDVFIKNDDGSPYIGKVWPGYTVFPDFLNPNTQEWWTDEIKDFHDQVP 155 (339)
T ss_pred CcCCCCCHHHHHHHHCCeEEECCCCCEEEEEeCCCCCcCcCCCCHHHHHHHHHHHHHHHhcCC
Confidence 10 0000000 0000000001111100000112246899999999999999988776444
No 84
>COG1809 (2R)-phospho-3-sulfolactate synthase (PSL synthase, CoM biosynthesis) [Coenzyme transport and metabolism]
Probab=32.26 E-value=2.6e+02 Score=27.80 Aligned_cols=101 Identities=18% Similarity=0.196 Sum_probs=62.2
Q ss_pred ccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCC
Q psy14880 17 KAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSP 96 (497)
Q Consensus 17 ~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~ 96 (497)
++++....||--=-.+|||- =+++..++++++..+.|= ... +|| |++
T Consensus 7 ~~~~r~~kPr~~G~T~vldk---g~~p~f~~D~~~vagdyV--------------Dfv---------------Kfg-wGT 53 (258)
T COG1809 7 LPPARPEKPRTFGMTVVLDK---GLGPRFVEDVLKVAGDYV--------------DFV---------------KFG-WGT 53 (258)
T ss_pred cCCCCCCCCccCCeEEEEeC---CCChHHHHHHHHhhhhhe--------------eee---------------eec-ccc
Confidence 33344555665555677874 489999999999988872 111 123 234
Q ss_pred CccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
+..-+.+.+++-+++.+.+++.+. |+.+ -..-+|+++-+.+...+|++.|-++|
T Consensus 54 ~~Li~kd~V~ekid~y~e~~i~v~-----pGGt-------------lfe~a~~~~kvdeyl~e~~~lGfe~i 107 (258)
T COG1809 54 SSLIDKDQVKEKIDMYKENDIYVF-----PGGT-------------LFEIAYSQDKVDEYLNEAKELGFEAI 107 (258)
T ss_pred cccccHHHHHHHHHHHHHcCceec-----CCce-------------EEEeehhcccHHHHHHHHHHcCccEE
Confidence 444455667777777777665432 2211 11234777778888888888887766
No 85
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=31.78 E-value=1.3e+02 Score=33.74 Aligned_cols=30 Identities=40% Similarity=0.556 Sum_probs=26.6
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.++||.||-.+ ++-|+..
T Consensus 79 t~~d~~~lv~~~h~~gi~vilD~-V~NH~s~ 108 (551)
T PRK10933 79 TLDDFDELVAQAKSRGIRIILDM-VFNHTST 108 (551)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-CCCCccC
Confidence 89999999999999999999886 5677765
No 86
>PF14542 Acetyltransf_CG: GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=31.23 E-value=53 Score=26.55 Aligned_cols=23 Identities=22% Similarity=0.372 Sum_probs=20.6
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
+-+.=++++++||++.|..|+|.
T Consensus 39 ia~~L~~~~l~~a~~~~~kv~p~ 61 (78)
T PF14542_consen 39 IAKKLVEAALDYARENGLKVVPT 61 (78)
T ss_dssp HHHHHHHHHHHHHHHTT-EEEET
T ss_pred HHHHHHHHHHHHHHHCCCEEEEE
Confidence 78888999999999999999995
No 87
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=31.08 E-value=4e+02 Score=27.34 Aligned_cols=20 Identities=10% Similarity=-0.021 Sum_probs=17.1
Q ss_pred CCCHHHHHHhhhhhhccccc
Q psy14880 40 LTDPEIILNYLDPQTCGTQQ 59 (497)
Q Consensus 40 ~~~~~~lk~~id~ma~~K~~ 59 (497)
|.+.++|+++++.+...++.
T Consensus 20 y~~~~ev~~~~~~~~~~~iP 39 (319)
T cd06591 20 YKTQEELLDVAKEYRKRGIP 39 (319)
T ss_pred CCCHHHHHHHHHHHHHhCCC
Confidence 46899999999999998864
No 88
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=31.05 E-value=2.1e+02 Score=33.77 Aligned_cols=71 Identities=24% Similarity=0.321 Sum_probs=46.7
Q ss_pred CChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEEEE
Q psy14880 100 YTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 100 y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~VIP 169 (497)
++-.++.+.+.|-+.-|+.-. |-+.++ .++. -+|....++ |.++++++++-|+++||.||-
T Consensus 13 ~tf~~~~~~L~YL~~LGv~~V~lsPi~~a~--~gs~-------hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIl 83 (825)
T TIGR02401 13 FTFDDAAALLPYLKSLGVSHLYLSPILTAV--PGST-------HGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIV 83 (825)
T ss_pred CCHHHHHHhhHHHHHcCCCEEEeCcCccCC--CCCC-------CCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEE
Confidence 344677888888888887432 222111 1111 123333333 899999999999999999998
Q ss_pred ecCCcccchhh
Q psy14880 170 ELDAPAHAGNG 180 (497)
Q Consensus 170 EID~PGH~~a~ 180 (497)
.| +|-|+...
T Consensus 84 Di-VpNH~a~~ 93 (825)
T TIGR02401 84 DI-VPNHMAVH 93 (825)
T ss_pred Ee-cccccccc
Confidence 76 58888753
No 89
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=29.94 E-value=1.9e+02 Score=29.92 Aligned_cols=121 Identities=13% Similarity=0.180 Sum_probs=61.6
Q ss_pred ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880 101 TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 101 ~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
+.+++.++++--+.+|+-+. +.+|.. +. ..++.|+-+ ..|. +.+++++..+++||+|++-++--=...
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~-~~-------~~~~~f~~d~~~fP--dp~~m~~~l~~~g~~~~~~~~P~v~~~ 91 (339)
T cd06604 22 PEEEVREIADEFRERDIPCDAIYLDID-YM-------DGYRVFTWDKERFP--DPKELIKELHEQGFKVVTIIDPGVKVD 91 (339)
T ss_pred CHHHHHHHHHHHHHhCCCcceEEECch-hh-------CCCCceeeccccCC--CHHHHHHHHHHCCCEEEEEEeCceeCC
Confidence 56678888877777777543 223311 11 112223211 1232 578999999999999997664211110
Q ss_pred ---hhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880 179 ---NGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG 233 (497)
Q Consensus 179 ---a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~ 233 (497)
.......+.++ +..-.++.++...+.......+|.+||++.++.+++++++.+
T Consensus 92 ~~~~~~~e~~~~g~--~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~ 147 (339)
T cd06604 92 PGYDVYEEGLENDY--FVKDPDGELYIGRVWPGLSAFPDFTNPKVREWWGSLYKKFVD 147 (339)
T ss_pred CCChHHHHHHHCCe--EEECCCCCEEEEEecCCCccccCCCChHHHHHHHHHHHHHhh
Confidence 00000000010 100011111110000112356899999999999999988764
No 90
>KOG0462|consensus
Probab=29.71 E-value=43 Score=37.27 Aligned_cols=32 Identities=22% Similarity=0.320 Sum_probs=25.7
Q ss_pred CCHHHHHHHHHHHHHcCCEEE----------------EecCCcccchh
Q psy14880 148 YTPLEVQELVHYAKLRGIRVI----------------FELDAPAHAGN 179 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VI----------------PEID~PGH~~a 179 (497)
--++++-+=++.-++|||+|- =-||||||...
T Consensus 91 ~~q~q~LDkl~vERERGITIkaQtasify~~~~~ylLNLIDTPGHvDF 138 (650)
T KOG0462|consen 91 IGQEQVLDKLQVERERGITIKAQTASIFYKDGQSYLLNLIDTPGHVDF 138 (650)
T ss_pred CchhhhhhhhhhhhhcCcEEEeeeeEEEEEcCCceEEEeecCCCcccc
Confidence 346777777788899999974 46999999975
No 91
>PF03979 Sigma70_r1_1: Sigma-70 factor, region 1.1; InterPro: IPR007127 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. This entry represents Region 1.1 which modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase [, ]. Region 1.1 is also involved in promoter binding.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2K6X_A.
Probab=29.40 E-value=51 Score=26.96 Aligned_cols=55 Identities=22% Similarity=0.329 Sum_probs=29.6
Q ss_pred hHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCC
Q psy14880 103 LEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDA 173 (497)
Q Consensus 103 ~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~ 173 (497)
..|+++++.+|.+|+=-.-++. .++ +....+.++|.+|..+-...||+|+.+.+.
T Consensus 7 ~~i~~Li~~gK~~G~lT~~eI~---------------~~L-~~~~~~~e~id~i~~~L~~~gI~Vvd~~~~ 61 (82)
T PF03979_consen 7 EAIKKLIEKGKKKGYLTYDEIN---------------DAL-PEDDLDPEQIDEIYDTLEDEGIEVVDEEED 61 (82)
T ss_dssp HHHHHHHHHHHHHSS-BHHHHH---------------HH--S-S---HHHHHHHHHHHHTT----B--S--
T ss_pred HHHHHHHHHHhhcCcCCHHHHH---------------HHc-CccCCCHHHHHHHHHHHHHCCCEEecCCCc
Confidence 4688999999999872111111 000 122389999999999999999999995433
No 92
>PRK14457 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=29.21 E-value=1e+02 Score=32.29 Aligned_cols=73 Identities=22% Similarity=0.246 Sum_probs=49.0
Q ss_pred eeEEeccCCcccccccccCCCCCCCccCChhHHH-HHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCH
Q psy14880 73 LCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ-ELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTP 150 (497)
Q Consensus 73 ~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~-~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~ 150 (497)
.+-+|-.|++-+.--.|- ++.|..+++. .+.+|.+..|-+|.+++. +|+.. -|.
T Consensus 215 aiSLha~~~e~r~~i~p~------~~~~~l~~l~~~~~~y~~~~gr~I~iey~LIpGvN------------------Ds~ 270 (345)
T PRK14457 215 AVSLHAPNQKLRETLIPS------AKNYPIENLLEDCRHYVAITGRRVSFEYILLGGVN------------------DLP 270 (345)
T ss_pred EEEeCCCCHHHHHHhcCC------ccCCCHHHHHHHHHHHHHHhCCEEEEEEEEECCcC------------------CCH
Confidence 377899998766543332 3446555554 455788888888877775 34431 688
Q ss_pred HHHHHHHHHHHHc--CCEEEE
Q psy14880 151 LEVQELVHYAKLR--GIRVIF 169 (497)
Q Consensus 151 ~ei~eiv~yA~~r--gI~VIP 169 (497)
+++++|+++++.. +|++||
T Consensus 271 e~a~~La~~l~~l~~~VnLIP 291 (345)
T PRK14457 271 EHAEELANLLRGFQSHVNLIP 291 (345)
T ss_pred HHHHHHHHHHhcCCCeEEEec
Confidence 9999999999876 344444
No 93
>TIGR02668 moaA_archaeal probable molybdenum cofactor biosynthesis protein A, archaeal. This model describes an archaeal family related, and predicted to be functionally equivalent, to molybdenum cofactor biosynthesis protein A (MoaA) of bacteria (see TIGR02666).
Probab=27.90 E-value=1.6e+02 Score=29.73 Aligned_cols=86 Identities=17% Similarity=0.278 Sum_probs=53.5
Q ss_pred hceeEEeccCCcccccccccCCCC--CCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy14880 71 MALCRWQMCTRQQVPCLLPRFGAY--SPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVY 148 (497)
Q Consensus 71 ~n~lhlHl~D~q~~~~e~~~~g~~--~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y 148 (497)
++.+.+++|+.=+++|..-..... ...+..+.+++.++++..+..|++.. -+ ++++-.
T Consensus 9 ~~~l~i~vT~~CNl~C~yC~~~~~~~~~~~~ls~eei~~~i~~~~~~gi~~I-~~-------------------tGGEPl 68 (302)
T TIGR02668 9 VTSLRISVTDRCNLSCFYCHMEGEDRSGGNELSPEEIERIVRVASEFGVRKV-KI-------------------TGGEPL 68 (302)
T ss_pred cCeEEEEEcccccCCCCCCCccccCCCccCcCCHHHHHHHHHHHHHcCCCEE-EE-------------------ECcccc
Confidence 578899999987777753221111 11345677888888887776665321 00 112335
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHA 177 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~ 177 (497)
...++.+|+++++++|+.-| .|.|-|..
T Consensus 69 l~~~l~~iv~~l~~~g~~~v-~i~TNG~l 96 (302)
T TIGR02668 69 LRKDLIEIIRRIKDYGIKDV-SMTTNGIL 96 (302)
T ss_pred cccCHHHHHHHHHhCCCceE-EEEcCchH
Confidence 56678999999999988322 35566643
No 94
>PF00682 HMGL-like: HMGL-like of this family is not conserved in other members. are a sub-families of this Pfam.; InterPro: IPR000891 Pyruvate carboxylase (6.4.1.1 from EC) (PC), a member of the biotin-dependent enzyme family, is involved in the gluconeogenesis by mediating the carboxylation of pyruvate to oxaloacetate. Biotin-dependent carboxylase enzymes perform a two step reaction. Enzyme-bound biotin is first carboxylated by bicarbonate and ATP and the carboxyl group temporarily bound to biotin is subsequently transferred to an acceptor substrate such as pyruvate []. PC has three functional domains: a biotin carboxylase (BC) domain, a carboxyltransferase (CT) domain which perform the second part of the reaction and a biotinyl domain [, ]. The mechanism by which the carboxyl group is transferred from the carboxybiotin to the pyruvate is not well understood. The pyruvate carboxyltransferase domain is also found in other pyruvate binding enzymes and acetyl-CoA dependent enzymes suggesting that this domain can be associated with different enzymatic activities. This domain is found towards the N-terminal region of various aldolase enzymes. This N-terminal TIM barrel domain [] interacts with the C-terminal domain. The C-terminal DmpG_comm domain (IPR012425 from INTERPRO) is thought to promote heterodimerisation with members of IPR003361 from INTERPRO to form a bifunctional aldolase-dehydrogenase []. ; GO: 0003824 catalytic activity; PDB: 3MP5_E 3MP3_E 2CW6_E 3MP4_D 3HBL_A 3HB9_C 3HO8_A 3BG5_C 1YDN_B 3RMJ_A ....
Probab=27.73 E-value=1.5e+02 Score=28.58 Aligned_cols=57 Identities=28% Similarity=0.318 Sum_probs=40.0
Q ss_pred hHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880 103 LEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 103 ~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
+.+.+++++++..|..+.+... ....+|.+++.++++.+.+.|+.+|=--||-|.+.
T Consensus 108 ~~~~~~v~~ak~~g~~v~~~~~-------------------~~~~~~~~~~~~~~~~~~~~g~~~i~l~Dt~G~~~ 164 (237)
T PF00682_consen 108 ERIEEAVKYAKELGYEVAFGCE-------------------DASRTDPEELLELAEALAEAGADIIYLADTVGIMT 164 (237)
T ss_dssp HHHHHHHHHHHHTTSEEEEEET-------------------TTGGSSHHHHHHHHHHHHHHT-SEEEEEETTS-S-
T ss_pred HHHHHHHHHHHhcCCceEeCcc-------------------ccccccHHHHHHHHHHHHHcCCeEEEeeCccCCcC
Confidence 4667788899988887743321 11348999999999999999998887667666653
No 95
>cd03319 L-Ala-DL-Glu_epimerase L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides. The genomic context and the substrate specificity of characterized members of this family from E.coli and B.subtilis indicates a possible role in the metabolism of the murein peptide of peptidoglycan, of which L-Ala-D-Glu is a component. L-Ala-D/L-Glu epimerase is a member of the enolase-superfamily, which is characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=27.63 E-value=1.2e+02 Score=31.00 Aligned_cols=22 Identities=18% Similarity=0.330 Sum_probs=19.5
Q ss_pred CHHHHHHHHHHHHHcCCEEEEe
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPE 170 (497)
.-.+.+++.++|+++||.++|.
T Consensus 264 Gi~~~~~~~~~a~~~gi~~~~~ 285 (316)
T cd03319 264 GLTEALRIADLARAAGLKVMVG 285 (316)
T ss_pred CHHHHHHHHHHHHHcCCCEEEE
Confidence 3467899999999999999996
No 96
>PRK14340 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=27.60 E-value=87 Score=33.96 Aligned_cols=52 Identities=13% Similarity=0.050 Sum_probs=35.1
Q ss_pred CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880 97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR 166 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~ 166 (497)
.+.|+.+++.++++..+.. |+.+...+ +-+++ + =|.+++++.++++++.++.
T Consensus 277 ~R~~t~~~~~~~v~~lr~~~pgi~i~td~-IvGfP---------------g--ET~edf~~tl~~~~~~~~~ 330 (445)
T PRK14340 277 NRGHTIEEYLEKIALIRSAIPGVTLSTDL-IAGFC---------------G--ETEEDHRATLSLMEEVRFD 330 (445)
T ss_pred CCCCCHHHHHHHHHHHHHhCCCCEEeccE-EEECC---------------C--CCHHHHHHHHHHHHhcCCC
Confidence 3568888888888887766 76543211 11211 1 3899999999999998743
No 97
>PF07071 DUF1341: Protein of unknown function (DUF1341); InterPro: IPR010763 Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.; PDB: 3NZR_D 3LM7_A 3M0Z_B 3M6Y_A 3N73_A 3MUX_A.
Probab=27.36 E-value=76 Score=30.88 Aligned_cols=23 Identities=22% Similarity=0.228 Sum_probs=11.0
Q ss_pred CCHHHHHHHHHHHHHcCC-EEEEe
Q psy14880 148 YTPLEVQELVHYAKLRGI-RVIFE 170 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI-~VIPE 170 (497)
.+.+.+.+|++-|.+-|| .|||-
T Consensus 186 Idl~N~~~I~~i~l~aGv~~viPH 209 (218)
T PF07071_consen 186 IDLDNFEEIVKICLDAGVEKVIPH 209 (218)
T ss_dssp --TTTHHHHHHHHHHTT-S-B--E
T ss_pred cCHHHHHHHHHHHHHcCCCeeccc
Confidence 555566666666666666 35553
No 98
>KOG0464|consensus
Probab=27.17 E-value=53 Score=35.35 Aligned_cols=30 Identities=27% Similarity=0.432 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHcCCEE-------------EEecCCcccchhh
Q psy14880 151 LEVQELVHYAKLRGIRV-------------IFELDAPAHAGNG 180 (497)
Q Consensus 151 ~ei~eiv~yA~~rgI~V-------------IPEID~PGH~~a~ 180 (497)
+-+.++.+.-++|||+| |--||+|||...-
T Consensus 74 dtvtdfla~erergitiqsaav~fdwkg~rinlidtpghvdf~ 116 (753)
T KOG0464|consen 74 DTVTDFLAIERERGITIQSAAVNFDWKGHRINLIDTPGHVDFR 116 (753)
T ss_pred chHHHHHHHHHhcCceeeeeeeecccccceEeeecCCCcceEE
Confidence 34677888899999998 5679999998753
No 99
>PLN02231 alanine transaminase
Probab=27.12 E-value=1.7e+02 Score=32.43 Aligned_cols=24 Identities=17% Similarity=0.414 Sum_probs=21.5
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
..||++++++|++.|+++||-||=
T Consensus 286 ~vls~e~l~~Iv~~a~~~~l~lI~ 309 (534)
T PLN02231 286 QVLAEENQRDIVEFCKQEGLVLLA 309 (534)
T ss_pred cCCCHHHHHHHHHHHHHcCCEEEE
Confidence 459999999999999999997773
No 100
>cd07937 DRE_TIM_PC_TC_5S Pyruvate carboxylase and Transcarboxylase 5S, carboxyltransferase domain. This family includes the carboxyltransferase domains of pyruvate carboxylase (PC) and the transcarboxylase (TC) 5S subunit. Transcarboxylase 5S is a cobalt-dependent metalloenzyme subunit of the biotin-dependent transcarboxylase multienzyme complex. Transcarboxylase 5S transfers carbon dioxide from the 1.3S biotin to pyruvate in the second of two carboxylation reactions catalyzed by TC. The first reaction involves the transfer of carbon dioxide from methylmalonyl-CoA to the 1.3S biotin, and is catalyzed by the 12S subunit. These two steps allow a carboxylate group to be transferred from oxaloacetate to propionyl-CoA to yield pyruvate and methylmalonyl-CoA. The catalytic domain of transcarboxylase 5S has a canonical TIM-barrel fold with a large C-terminal extension that forms a funnel leading to the active site. Transcarboxylase 5S forms a homodimer and there are six dimers per complex
Probab=27.01 E-value=1.9e+02 Score=29.11 Aligned_cols=22 Identities=14% Similarity=0.024 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHcCCEEEEecC
Q psy14880 151 LEVQELVHYAKLRGIRVIFELD 172 (497)
Q Consensus 151 ~ei~eiv~yA~~rgI~VIPEID 172 (497)
+.+++.+++|+++|++|++-+.
T Consensus 118 ~~~~~~i~~ak~~G~~v~~~i~ 139 (275)
T cd07937 118 RNLEVAIKAVKKAGKHVEGAIC 139 (275)
T ss_pred HHHHHHHHHHHHCCCeEEEEEE
Confidence 4566666666666666665443
No 101
>PF13199 Glyco_hydro_66: Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=26.93 E-value=3.8e+02 Score=30.15 Aligned_cols=93 Identities=13% Similarity=0.112 Sum_probs=50.4
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCC-CCcccccCCC--CCCccccCC--C--CCcCCCCCh
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGL-GDLIVCLNDP--SWRANCIQP--P--CGQLNPINP 218 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~-~~l~~~~~~~--~~~~~~~~~--~--~~~L~~~~p 218 (497)
...+.+-|++.|+-|+++|+..++=.=+-|=..-.-. .+. |+-....+.. .+....... . --.+||+|+
T Consensus 165 r~i~~~~Vk~yI~~ah~~Gmkam~Ynmiyaa~~~~~~----~gv~~eW~ly~d~~~~~~~~~~l~~~w~s~lyl~dP~N~ 240 (559)
T PF13199_consen 165 RQISTSTVKDYINAAHKYGMKAMAYNMIYAANNNYEE----DGVSPEWGLYKDDSHSNQDTYDLPDGWPSDLYLMDPGNP 240 (559)
T ss_dssp -EEEHHHHHHHHHHHHHTT-EEEEEEESSEEETT--S------SS-GGBEEESSSBTSB-EEEETT-E--EEEEB-TT-H
T ss_pred CEehHHHHHHHHHHHHHcCcceehhHhhhccccCccc----ccCCchhhhhhccCCCccceeecCcccccceEEecCCCH
Confidence 3477888999999999999999987555544333111 110 1100000000 000000000 0 125899999
Q ss_pred hHHHHHHHHHHHHHhccCCCCCeEEE
Q psy14880 219 HVYTVLKDIYGELMGMAKYGDEMFHM 244 (497)
Q Consensus 219 ~t~~fl~~vl~E~~~lF~~~~~~iHi 244 (497)
+=.+.|-+=+.++.+-|. ..=|||
T Consensus 241 ~WQ~yI~~q~~~~~~~~g--FDG~hl 264 (559)
T PF13199_consen 241 EWQNYIINQMNKAIQNFG--FDGWHL 264 (559)
T ss_dssp HHHHHHHHHHHHHHHHHT----EEEE
T ss_pred HHHHHHHHHHHHHHHccC--CceEee
Confidence 988888888888888888 888997
No 102
>TIGR00048 radical SAM enzyme, Cfr family. A Staphylococcus sciuri plasmid-borne member of this family, Cfr, has been identified as essential to transferrable resistance to chloramphenicol and florfenicol by an unknown mechanism. A 14-15 residue cluster with four perfectly conserved Cys residues suggests this protein may be an enzyme with an iron-sulfur cluster. The Cys cluster is part of the radical SAM domain, suggested to provide a general mechanism by which the Fe-S center cleaves S-adenosylmethionine to initiate radical-based catalysis. Members of this family lack apparent transmembrane domains.
Probab=26.85 E-value=2.2e+02 Score=29.89 Aligned_cols=75 Identities=19% Similarity=0.120 Sum_probs=47.1
Q ss_pred eEEeccCCcccccccccCCCCCCCccCChhHHHH-HHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880 74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE-LVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL 151 (497)
Q Consensus 74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~-~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ 151 (497)
+-+|-.|++-..--.| .++.|..+++.+ +.+|.+..|.++.+++. +|+. -=|.+
T Consensus 219 iSL~a~~~e~r~~l~p------~~~~~~l~~ll~~l~~~~~~~g~~VtieyvLI~Gv------------------NDs~e 274 (355)
T TIGR00048 219 ISLHAPNDELRSSLMP------INKKYNIETLLAAVRRYLNKTGRRVTFEYVLLDGV------------------NDQVE 274 (355)
T ss_pred EEeCCCCHHHHHHhcC------cccCCCHHHHHHHHHHHHHHhCCEEEEEEEEECCC------------------CCCHH
Confidence 6677777655432222 123455566664 34577777878776665 2432 15789
Q ss_pred HHHHHHHHHHHc--CCEEEEecC
Q psy14880 152 EVQELVHYAKLR--GIRVIFELD 172 (497)
Q Consensus 152 ei~eiv~yA~~r--gI~VIPEID 172 (497)
++++|.++++.. +|.+||.-.
T Consensus 275 ~a~~La~llk~l~~~VnLIPynp 297 (355)
T TIGR00048 275 HAEELAELLKGTKCKVNLIPWNP 297 (355)
T ss_pred HHHHHHHHHhcCCCceEEEeccc
Confidence 999999999986 556776543
No 103
>PRK10785 maltodextrin glucosidase; Provisional
Probab=26.73 E-value=55 Score=36.87 Aligned_cols=30 Identities=23% Similarity=0.434 Sum_probs=26.2
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.+|||+||-.+ ++.|++.
T Consensus 224 t~~df~~Lv~~aH~rGikVilD~-V~NH~~~ 253 (598)
T PRK10785 224 GDAALLRLRHATQQRGMRLVLDG-VFNHTGD 253 (598)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-CCCcCCC
Confidence 88999999999999999999875 4577765
No 104
>TIGR02102 pullulan_Gpos pullulanase, extracellular, Gram-positive. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. In contrast, a glycogen debranching enzyme such GlgX, homologous to this family, can release glucose at alpha,1-6 linkages from glycogen first subjected to limit degradation by phosphorylase. Characterized members of this family include a surface-located pullulanase from Streptococcus pneumoniae (PubMed:11083842) and an extracellular bifunctional amylase/pullulanase with C-terminal pullulanase activity (PubMed:8798645).
Probab=26.72 E-value=3.3e+02 Score=33.31 Aligned_cols=80 Identities=16% Similarity=0.133 Sum_probs=49.0
Q ss_pred HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcc-cc-cCCCCCCccccCCCCCcCCCCChhHHHHHHHH
Q psy14880 150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLI-VC-LNDPSWRANCIQPPCGQLNPINPHVYTVLKDI 227 (497)
Q Consensus 150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~-~~-~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~v 227 (497)
-+|+|++|+-|.++||+||-.+ ++-|+... ..+... .|..- .+ .++.+...+ ..+.++..+|.+.+|+.+.
T Consensus 554 i~EfK~LV~alH~~GI~VILDV-VyNHt~~~-~~f~~~-~p~Yy~~~~~~G~~~~~~----~g~~l~~e~~~vrk~iiDs 626 (1111)
T TIGR02102 554 IAEFKNLINEIHKRGMGVILDV-VYNHTAKV-YIFEDL-EPNYYHFMDADGTPRTSF----GGGRLGTTHEMSRRILVDS 626 (1111)
T ss_pred HHHHHHHHHHHHHCCCEEEEec-cccccccc-cccccc-CCCceEeeCCCCCccccc----CCCCCCcCCHHHHHHHHHH
Confidence 4899999999999999999875 56887542 111100 01100 00 011111111 1245778889999999988
Q ss_pred HHHHHhccC
Q psy14880 228 YGELMGMAK 236 (497)
Q Consensus 228 l~E~~~lF~ 236 (497)
+.-.++-|.
T Consensus 627 l~yWv~ey~ 635 (1111)
T TIGR02102 627 IKYLVDEFK 635 (1111)
T ss_pred HHHHHHhcC
Confidence 888877555
No 105
>PRK14456 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=26.69 E-value=1.6e+02 Score=31.17 Aligned_cols=74 Identities=20% Similarity=0.242 Sum_probs=46.1
Q ss_pred eEEeccCCcccccccccCCCCCCCccCChhHHHHHH-HHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880 74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV-HYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL 151 (497)
Q Consensus 74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~-~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ 151 (497)
+.+|-.|++-+.--.|.- .+.|.-+++.+.+ +|++..|.++.+++. +|+. -=|.+
T Consensus 238 iSL~a~~~e~r~~i~P~~-----~~~~~l~~l~~~i~~~~~~~g~~V~ieyvLI~Gv------------------NDs~e 294 (368)
T PRK14456 238 VSLHSADQEKRERLMPQA-----ARDYPLDELREALIGYASKTGEPVTLVYMLLEGI------------------NDSPE 294 (368)
T ss_pred EEecCCCHHHHHHhcccc-----CCCCCHHHHHHHHHHHHHhcCCeEEEEEEEEcCC------------------CCCHH
Confidence 556776765544333311 1245666777755 477777877766554 2322 15788
Q ss_pred HHHHHHHHHHHc--CCEEEEe
Q psy14880 152 EVQELVHYAKLR--GIRVIFE 170 (497)
Q Consensus 152 ei~eiv~yA~~r--gI~VIPE 170 (497)
|+++|+++++.+ .|++||-
T Consensus 295 da~~L~~~l~~~~~~VnlIpy 315 (368)
T PRK14456 295 DARKLIRFASRFFCKINLIDY 315 (368)
T ss_pred HHHHHHHHHhcCCCeeEEeee
Confidence 999999999987 5667764
No 106
>PRK14507 putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase; Provisional
Probab=26.62 E-value=2.6e+02 Score=35.66 Aligned_cols=71 Identities=18% Similarity=0.306 Sum_probs=48.7
Q ss_pred ccCChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEE
Q psy14880 98 KVYTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRV 167 (497)
Q Consensus 98 ~~y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~V 167 (497)
..++-+++.+.+.|.+.-|+.-+ |-+..+ .++ ..+|....++ |.++++.|++-|+++||.|
T Consensus 753 ~~~tf~~~~~~l~Yl~~LGv~~i~lsPi~~a~--~gs-------~hGYdv~D~~~idp~lG~~edf~~Lv~~ah~~Gi~v 823 (1693)
T PRK14507 753 KDFTFADAEAILPYLAALGISHVYASPILKAR--PGS-------THGYDIVDHSQINPEIGGEEGFERFCAALKAHGLGQ 823 (1693)
T ss_pred CCCCHHHHHHHhHHHHHcCCCEEEECCCcCCC--CCC-------CCCCCCCCCCccCcccCCHHHHHHHHHHHHHCCCEE
Confidence 35666789999999998888532 222111 111 1233333333 8999999999999999999
Q ss_pred EEecCCcccch
Q psy14880 168 IFELDAPAHAG 178 (497)
Q Consensus 168 IPEID~PGH~~ 178 (497)
|-.| +|.|++
T Consensus 824 ilDi-V~NH~~ 833 (1693)
T PRK14507 824 LLDI-VPNHMG 833 (1693)
T ss_pred EEEe-cccccC
Confidence 9886 678887
No 107
>PRK01060 endonuclease IV; Provisional
Probab=26.35 E-value=3.4e+02 Score=26.74 Aligned_cols=96 Identities=11% Similarity=0.049 Sum_probs=56.8
Q ss_pred eeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHHH
Q psy14880 31 TKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVH 110 (497)
Q Consensus 31 G~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~ 110 (497)
|+|..+.+. +.+.|+.++..++ +.+.+-+..-..|. ...++.+.++++.+
T Consensus 5 g~~~~~~~~------~~~~l~~~~~~G~-------------d~vEl~~~~p~~~~-----------~~~~~~~~~~~lk~ 54 (281)
T PRK01060 5 GAHVSAAGG------LEGAVAEAAEIGA-------------NAFMIFTGNPQQWK-----------RKPLEELNIEAFKA 54 (281)
T ss_pred EEeeecCCC------HHHHHHHHHHcCC-------------CEEEEECCCCCCCc-----------CCCCCHHHHHHHHH
Confidence 555555554 7788889998885 56665544322221 12356777888877
Q ss_pred HHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCC---CCCHHHHHHHHHHHHHcCCEEE
Q psy14880 111 YAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAK---VYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 111 y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~---~YT~~ei~eiv~yA~~rgI~VI 168 (497)
..+.+|+++ .... |-. .+ .+ +++. --+.+.++..++.|++.|+.+|
T Consensus 55 ~~~~~gl~~~~~~~h~--~~~-~n-------l~--~~d~~~r~~s~~~~~~~i~~A~~lga~~v 106 (281)
T PRK01060 55 ACEKYGISPEDILVHA--PYL-IN-------LG--NPNKEILEKSRDFLIQEIERCAALGAKLL 106 (281)
T ss_pred HHHHcCCCCCceEEec--ceE-ec-------CC--CCCHHHHHHHHHHHHHHHHHHHHcCCCEE
Confidence 788888873 2111 100 00 00 1111 2456789999999999999643
No 108
>PRK00278 trpC indole-3-glycerol-phosphate synthase; Reviewed
Probab=26.29 E-value=1.1e+02 Score=30.51 Aligned_cols=34 Identities=21% Similarity=0.361 Sum_probs=28.4
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW 181 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l 181 (497)
++.+++++++++|+.+|.+++-|+...-=...+.
T Consensus 144 l~~~~l~~li~~a~~lGl~~lvevh~~~E~~~A~ 177 (260)
T PRK00278 144 LDDEQLKELLDYAHSLGLDVLVEVHDEEELERAL 177 (260)
T ss_pred CCHHHHHHHHHHHHHcCCeEEEEeCCHHHHHHHH
Confidence 6789999999999999999999988776444443
No 109
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=26.00 E-value=1.4e+02 Score=32.14 Aligned_cols=25 Identities=12% Similarity=0.143 Sum_probs=18.8
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
|..-.+-++.|+...+-.+|.+|=|
T Consensus 366 G~Ig~~~f~~l~~~~~~~~iP~ILE 390 (413)
T PTZ00372 366 GKLGMETFKFIMNSKYFKNIPIILE 390 (413)
T ss_pred CCcChHHHHHHHhChhhCCCeEEEe
Confidence 4566777888888777778888766
No 110
>PF03102 NeuB: NeuB family; InterPro: IPR013132 NeuB is the prokaryotic N-acetylneuraminic acid synthase (Neu5Ac). It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only been observed in prokaryotes; eukaryotes synthesise the 9-phosphate form, Neu5Ac-9-P, and utilise ManNAc-6-P instead of ManNAc. Such eukaryotic enzymes are not present in this family []. This family also contains SpsE spore coat polysaccharide biosynthesis proteins.; GO: 0016051 carbohydrate biosynthetic process; PDB: 3G8R_B 1XUU_A 1XUZ_A 3CM4_A 2ZDR_A 1VLI_A 2WQP_A.
Probab=25.84 E-value=65 Score=32.08 Aligned_cols=25 Identities=24% Similarity=0.246 Sum_probs=20.0
Q ss_pred CCCHHHHHHHHHHHHHcCCEEEEec
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVIFEL 171 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VIPEI 171 (497)
.+|.+++++|.+||+++||..+-..
T Consensus 52 el~~e~~~~L~~~~~~~gi~f~stp 76 (241)
T PF03102_consen 52 ELSEEQHKELFEYCKELGIDFFSTP 76 (241)
T ss_dssp SS-HHHHHHHHHHHHHTT-EEEEEE
T ss_pred cCCHHHHHHHHHHHHHcCCEEEECC
Confidence 3899999999999999999887543
No 111
>PF09508 Lact_bio_phlase: Lacto-N-biose phosphorylase; InterPro: IPR012711 The gene which codes for this protein in gut-bacteria is located in a novel putative operon for galactose metabolism. The protein appears to be a carbohydrate-processing phosphorolytic enzyme (2.4.1.211 from EC), unlike either glycoside hydrolases or glycoside lyase. Intestinal colonisation by Bifidobacteria is important for human health, especially in paediatrics, because colonisation seems to prevent infection by some pathogenic bacteria that cause diarrhoea or other illnesses. The operon seems to be involved in intestinal colonisation by Bifidobacteria mediated by metabolism of mucin sugars. In addition, it may also resolve the question of the nature of the bifidus factor in human milk as the lacto-N-biose structure found in milk oligosaccharides. ; GO: 0016758 transferase activity, transferring hexosyl groups; PDB: 2ZUW_A 2ZUU_C 2ZUT_D 2ZUV_A 2ZUS_B.
Probab=24.95 E-value=5.2e+02 Score=29.65 Aligned_cols=100 Identities=13% Similarity=0.179 Sum_probs=55.2
Q ss_pred CcCCCCChhHHHHHHHHHHHHHhccCCC-----CCeE-----EEcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCC
Q psy14880 211 GQLNPINPHVYTVLKDIYGELMGMAKYG-----DEMF-----HMGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSP 280 (497)
Q Consensus 211 ~~L~~~~p~t~~fl~~vl~E~~~lF~~~-----~~~i-----HiGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~ 280 (497)
-.+||..|+|.+++.+-+++.++-.|.. ..+| -.|-++ .
T Consensus 175 ipfDvr~p~t~~~~~~~L~~wl~~hP~~dVVRFTTFFy~Ftl~f~~~~-------------------------------k 223 (716)
T PF09508_consen 175 IPFDVRQPKTREYVLEWLRKWLEEHPDTDVVRFTTFFYQFTLIFNEDA-------------------------------K 223 (716)
T ss_dssp ----TTSHHHHHHHHHHHHHHHHT-TT--EEEES--SB--EEEEBTTS-------------------------------S
T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCcEEEeeeeeeeeEEEecccc-------------------------------c
Confidence 4799999999999999999999988852 1111 123332 1
Q ss_pred CcceeecccccC--CHHHH-HHHHHcCCCCC--------------------cccHHHHHHHHHHHHHHHHHHHhhCCCce
Q psy14880 281 HLWTVFIKCWNN--TPAIV-DFMLAHGKNTT--------------------FDDYIQLWAHFQNKAAASLDEAVGHNRTK 337 (497)
Q Consensus 281 ~~~~v~~~~w~~--~p~~~-~~~~~~g~~~~--------------------~~~~~~l~~~F~~~~~~~l~~~~~~~g~~ 337 (497)
+.+ |+...+.. ||... ++-++.|..-+ ..||.+.++.|+-+.++.+-+.+.+.||+
T Consensus 224 ekf-vDWfGY~asVSp~aLe~FEke~GY~lrpEdfid~GyyNstfrvPsk~~~D~~~F~~~fV~~~akelVDivH~~GKe 302 (716)
T PF09508_consen 224 EKF-VDWFGYSASVSPYALEQFEKEYGYRLRPEDFIDQGYYNSTFRVPSKEFRDWMDFQQRFVAEFAKELVDIVHEYGKE 302 (716)
T ss_dssp EEE-EETT--GGGB-HHHHHHHHHHHSS---THHHHGGGTTT-TTS---HHHHHHHHHHHHHHHHHHHHHHHHHHHTT-E
T ss_pred ccc-ccccCcCcccCHHHHHHHHHHhCCCCCHHHhhccccccCcccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCe
Confidence 111 33333333 35444 44444466321 13466677777777777777777789998
Q ss_pred EEEec
Q psy14880 338 LIVWS 342 (497)
Q Consensus 338 ~~~W~ 342 (497)
.|+.-
T Consensus 303 AmMFl 307 (716)
T PF09508_consen 303 AMMFL 307 (716)
T ss_dssp EEEES
T ss_pred EEEec
Confidence 88763
No 112
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=24.88 E-value=2.6e+02 Score=29.86 Aligned_cols=135 Identities=14% Similarity=0.203 Sum_probs=68.8
Q ss_pred CChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880 100 YTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAK-VYTPLEVQELVHYAKLRGIRVIFELDAPAHA 177 (497)
Q Consensus 100 y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~-~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~ 177 (497)
++.+++.++++--+.+++.+. ..+|.. + ...++.|+-+. .+ .+.+++++..+++||+|++-++--=+.
T Consensus 40 ~~~~~v~~~i~~~~~~~iP~d~~~iD~~-~-------~~~~~~f~~d~~~F--Pd~~~~~~~l~~~G~~~~~~~~P~v~~ 109 (441)
T PF01055_consen 40 YNQDEVREVIDRYRSNGIPLDVIWIDDD-Y-------QDGYGDFTWDPERF--PDPKQMIDELHDQGIKVVLWVHPFVSN 109 (441)
T ss_dssp TSHHHHHHHHHHHHHTT--EEEEEE-GG-G-------SBTTBTT-B-TTTT--TTHHHHHHHHHHTT-EEEEEEESEEET
T ss_pred CCHHHHHHHHHHHHHcCCCccceecccc-c-------cccccccccccccc--cchHHHHHhHhhCCcEEEEEeecccCC
Confidence 467788888877777887654 233322 1 11122322111 22 279999999999999999776542222
Q ss_pred hhh----hhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880 178 GNG----WQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE 248 (497)
Q Consensus 178 ~a~----l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE 248 (497)
..- .....+.+ -+..-.+++++....-......+|.++|++.++.++.++++.+.++ -..+=+=.-|
T Consensus 110 ~~~~~~~~~~~~~~~--~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G--vdg~w~D~~E 180 (441)
T PF01055_consen 110 DSPDYENYDEAKEKG--YLVKNPDGSPYIGRVWPGKGGFIDFTNPEARDWWKEQLKELLDDYG--VDGWWLDFGE 180 (441)
T ss_dssp TTTB-HHHHHHHHTT---BEBCTTSSB-EEEETTEEEEEB-TTSHHHHHHHHHHHHHHHTTST---SEEEEESTT
T ss_pred CCCcchhhhhHhhcC--ceeecccCCcccccccCCcccccCCCChhHHHHHHHHHHHHHhccC--CceEEeecCC
Confidence 210 00000111 0110011111111000001357999999999999999999988866 5555555555
No 113
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=24.71 E-value=68 Score=27.08 Aligned_cols=32 Identities=16% Similarity=0.243 Sum_probs=26.4
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
-|.+++.++.+..+++||++.++...|||..+
T Consensus 71 ~~~~~v~~~~~~l~~~G~~~~~~~~~~~~~~~ 102 (121)
T cd09013 71 SSPEALERRVAALEASGLGIGWIEGDPGHGKA 102 (121)
T ss_pred CCHHHHHHHHHHHHHcCCccccccCCCCCcce
Confidence 57899999999999999998665566777654
No 114
>cd07939 DRE_TIM_NifV Streptomyces rubellomurinus FrbC and related proteins, catalytic TIM barrel domain. FrbC (NifV) of Streptomyces rubellomurinus catalyzes the condensation of acetyl-CoA and alpha-ketoglutarate to form homocitrate and CoA, a reaction similar to one catalyzed by homocitrate synthase. The gene encoding FrbC is one of several genes required for the biosynthesis of FR900098, a potent antimalarial antibiotic. This protein is also required for assembly of the nitrogenase MoFe complex but its exact role is unknown. This family also includes the NifV proteins of Heliobacterium chlorum and Gluconacetobacter diazotrophicus, which appear to be orthologous to FrbC. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarbox
Probab=24.27 E-value=2.4e+02 Score=27.91 Aligned_cols=20 Identities=10% Similarity=-0.195 Sum_probs=13.1
Q ss_pred CCCCHHHHHHhhhhhhcccc
Q psy14880 39 HLTDPEIILNYLDPQTCGTQ 58 (497)
Q Consensus 39 ~~~~~~~lk~~id~ma~~K~ 58 (497)
.+++.++.+++++.|...++
T Consensus 15 ~~~~~~~k~~i~~~L~~~Gv 34 (259)
T cd07939 15 VAFSREEKLAIARALDEAGV 34 (259)
T ss_pred CCCCHHHHHHHHHHHHHcCC
Confidence 35667777777777666654
No 115
>COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only]
Probab=24.09 E-value=90 Score=32.45 Aligned_cols=29 Identities=28% Similarity=0.406 Sum_probs=25.7
Q ss_pred CCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 142 YSPAKVYTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 142 ~s~~~~YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
|++++--+...++||.++|+..||+||+-
T Consensus 166 Y~p~E~ns~~l~eelk~~A~~~Gl~vve~ 194 (322)
T COG2984 166 YNPGEANSVSLVEELKKEARKAGLEVVEA 194 (322)
T ss_pred eCCCCcccHHHHHHHHHHHHHCCCEEEEE
Confidence 46677789999999999999999999984
No 116
>PRK04296 thymidine kinase; Provisional
Probab=24.06 E-value=80 Score=29.77 Aligned_cols=25 Identities=24% Similarity=0.183 Sum_probs=22.3
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPE 170 (497)
++++.+++.+|++.++..|++||=-
T Consensus 88 q~l~~~~v~~l~~~l~~~g~~vi~t 112 (190)
T PRK04296 88 QFLDKEQVVQLAEVLDDLGIPVICY 112 (190)
T ss_pred ccCCHHHHHHHHHHHHHcCCeEEEE
Confidence 5689999999999999999998864
No 117
>cd00308 enolase_like Enolase-superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Enolase superfamily contains different enzymes, like enolases, glutarate-, fucanate- and galactonate dehydratases, o-succinylbenzoate synthase, N-acylamino acid racemase, L-alanine-DL-glutamate epimerase, mandelate racemase, muconate lactonizing enzyme and 3-methylaspartase.
Probab=23.91 E-value=1.8e+02 Score=28.02 Aligned_cols=22 Identities=27% Similarity=0.305 Sum_probs=19.4
Q ss_pred CHHHHHHHHHHHHHcCCEEEEe
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPE 170 (497)
.-.+.+++.++|+++||.++|-
T Consensus 181 Gi~~~~~i~~~a~~~gi~~~~~ 202 (229)
T cd00308 181 GLTESRRAADLAEAFGIRVMVH 202 (229)
T ss_pred CHHHHHHHHHHHHHcCCEEeec
Confidence 3467899999999999999996
No 118
>PRK14335 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=23.87 E-value=1.3e+02 Score=32.73 Aligned_cols=52 Identities=15% Similarity=0.137 Sum_probs=35.2
Q ss_pred CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880 97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR 166 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~ 166 (497)
++.|+.+++.++++..+.. |+.+...+ +-++++ =|.+++++.++++++.+..
T Consensus 287 ~R~~t~e~~~~~v~~ir~~~pgi~i~~d~-IvGfPg-----------------ET~edf~~Tl~~i~~l~~~ 340 (455)
T PRK14335 287 NRSYTREHYLSLVGKLKASIPNVALSTDI-LIGFPG-----------------ETEEDFEQTLDLMREVEFD 340 (455)
T ss_pred CCCCCHHHHHHHHHHHHHhCCCCEEEEEE-EEeCCC-----------------CCHHHHHHHHHHHHhcCCC
Confidence 3568888888888887765 66543222 111111 3899999999999988753
No 119
>TIGR00190 thiC thiamine biosynthesis protein ThiC. The thiC ortholog is designated thiA in Bacillus subtilis.
Probab=23.76 E-value=64 Score=34.51 Aligned_cols=23 Identities=26% Similarity=0.485 Sum_probs=19.0
Q ss_pred HHHHHHHHHcCCEEEEecCCcccch
Q psy14880 154 QELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 154 ~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
-||++.|++.||.||- +-|||.-
T Consensus 246 geL~~rA~e~gVQvMV--EGPGHvP 268 (423)
T TIGR00190 246 GELVERAREADVQCMV--EGPGHVP 268 (423)
T ss_pred HHHHHHHHHcCCeEEE--ECCCCCc
Confidence 3456899999999995 6999983
No 120
>TIGR03581 EF_0839 conserved hypothetical protein EF_0839/AHA_3917. Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.
Probab=23.64 E-value=1.6e+02 Score=29.01 Aligned_cols=22 Identities=27% Similarity=0.270 Sum_probs=16.0
Q ss_pred CCHHHHHHHHHHHHHcCC-EEEE
Q psy14880 148 YTPLEVQELVHYAKLRGI-RVIF 169 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI-~VIP 169 (497)
.+.+.+.+|++.|.+-|| .|||
T Consensus 186 Idl~Nf~~I~~i~ldaGv~kviP 208 (236)
T TIGR03581 186 IDLDNFEEIVQIALDAGVEKVIP 208 (236)
T ss_pred ccHHhHHHHHHHHHHcCCCeecc
Confidence 677777777777777777 4666
No 121
>PF00218 IGPS: Indole-3-glycerol phosphate synthase; InterPro: IPR013798 Indole-3-glycerol phosphate synthase (4.1.1.48 from EC) (IGPS) catalyses the fourth step in the biosynthesis of tryptophan, the ring closure of 1-(2-carboxy-phenylamino)-1-deoxyribulose into indol-3-glycerol-phosphate. In some bacteria, IGPS is a single chain enzyme. In others, such as Escherichia coli, it is the N-terminal domain of a bifunctional enzyme that also catalyses N-(5'-phosphoribosyl)anthranilate isomerase (5.3.1.24 from EC) (PRAI) activity (see IPR001240 from INTERPRO), the third step of tryptophan biosynthesis. In fungi, IGPS is the central domain of a trifunctional enzyme that contains a PRAI C-terminal domain and a glutamine amidotransferase (2.4.2 from EC) (GATase) N-terminal domain (see IPR000991 from INTERPRO). A structure of the IGPS domain of the bifunctional enzyme from the mesophilic bacterium E. coli (eIGPS) has been compared with the monomeric indole-3-glycerol phosphate synthase from the hyperthermophilic archaeon Sulfolobus solfataricus (sIGPS). Both are single-domain (beta/alpha)8 barrel proteins, with one (eIGPS) or two (sIGPS) additional helices inserted before the first beta strand []. ; GO: 0004425 indole-3-glycerol-phosphate synthase activity; PDB: 1VC4_A 1PII_A 1JCM_P 1I4N_B 1J5T_A 3TSM_B 4FB7_A 3QJA_A 1JUL_A 2C3Z_A ....
Probab=23.49 E-value=85 Score=31.55 Aligned_cols=39 Identities=15% Similarity=0.294 Sum_probs=33.8
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG 184 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~ 184 (497)
..++++++++++++|..+|++++-|+-+.--...++...
T Consensus 140 ~~L~~~~l~~l~~~a~~lGle~lVEVh~~~El~~al~~~ 178 (254)
T PF00218_consen 140 AILSDDQLEELLELAHSLGLEALVEVHNEEELERALEAG 178 (254)
T ss_dssp GGSGHHHHHHHHHHHHHTT-EEEEEESSHHHHHHHHHTT
T ss_pred HhCCHHHHHHHHHHHHHcCCCeEEEECCHHHHHHHHHcC
Confidence 358999999999999999999999999999988877543
No 122
>cd03318 MLE Muconate Lactonizing Enzyme (MLE), an homooctameric enzyme, catalyses the conversion of cis,cis-muconate (CCM) to muconolactone (ML) in the catechol branch of the beta-ketoadipate pathway. This pathway is used in soil microbes to breakdown lignin-derived aromatics, catechol and protocatechuate, to citric acid cycle intermediates. Some bacterial species are also capable of dehalogenating chloroaromatic compounds by the action of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs are members of the enolase superfamily characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion.
Probab=23.07 E-value=1.6e+02 Score=30.68 Aligned_cols=28 Identities=7% Similarity=-0.064 Sum_probs=22.2
Q ss_pred ceeeeccCCCCCCHHHHHHhhhhhhcccc
Q psy14880 30 RTKLIVWSSHLTDPEIILNYLDPQTCGTQ 58 (497)
Q Consensus 30 RG~mlDvaR~~~~~~~lk~~id~ma~~K~ 58 (497)
-.+|||.-.. ++++...++++.+..+++
T Consensus 189 ~~l~iDaN~~-~~~~~A~~~~~~l~~~~~ 216 (365)
T cd03318 189 ASVRVDVNQA-WDESTAIRALPRLEAAGV 216 (365)
T ss_pred cEEEEECCCC-CCHHHHHHHHHHHHhcCc
Confidence 3699999644 688888899999888864
No 123
>PLN02368 alanine transaminase
Probab=23.03 E-value=83 Score=33.56 Aligned_cols=24 Identities=4% Similarity=0.221 Sum_probs=21.6
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
..||++++++|++.|++++|-||-
T Consensus 225 ~v~s~e~l~~l~~~a~~~~~~II~ 248 (407)
T PLN02368 225 QCLSEANLREILKFCYQERLVLLG 248 (407)
T ss_pred ccCCHHHHHHHHHHHHHcCCEEEE
Confidence 459999999999999999998874
No 124
>COG3618 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=22.96 E-value=8e+02 Score=25.08 Aligned_cols=164 Identities=18% Similarity=0.219 Sum_probs=88.9
Q ss_pred ceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHH
Q psy14880 30 RTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV 109 (497)
Q Consensus 30 RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~ 109 (497)
||+++.++=.--..+...+.+..|+..+ +- + +-..-++ .|.+ ...+++..
T Consensus 53 ~~V~vq~~~~~~D~~~e~~~v~~~~~~~-g~--~---------vg~id~~--------~~e~----------~a~L~~~~ 102 (279)
T COG3618 53 GGVLVQVNVDPRDNEKELAFVAELAERH-GG--I---------VGVIDEC--------RPEF----------AAKLERAR 102 (279)
T ss_pred eeEEEecccCccchHHHHHHHHhhHHhh-Cc--e---------EEEEecC--------CchH----------HHHHHHhc
Confidence 7889999888888999999999999876 10 0 0000000 0111 11222222
Q ss_pred HHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhh----hhhcC
Q psy14880 110 HYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNG----WQWGP 185 (497)
Q Consensus 110 ~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~----l~~~p 185 (497)
|.+++|+|.... ..|. +.+-..++++.++-.+.+|+.+ ++.+..|-... .+..|
T Consensus 103 -~~~~~GvR~~l~-~~p~------------------~~~~a~~~r~~~~rL~~~gl~f--dl~~~~~ql~~~i~l~~~~P 160 (279)
T COG3618 103 -YPFFRGVRRNLH-VVPD------------------GLFEAPAWRANVERLAKLGLHF--DLQVDPHQLPDLIPLALKAP 160 (279)
T ss_pred -ccccceeeehhh-cCCc------------------cchhhHHHHHHHHHHHhcCCeE--EEEeChhhhHHHHHHHhhCC
Confidence 344788886531 1121 1133378999999999998876 44444443222 12222
Q ss_pred CC-------CCCCcccccCCCCCCcc----ccCC-----CCCcCCCCChh-HHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880 186 RY-------GLGDLIVCLNDPSWRAN----CIQP-----PCGQLNPINPH-VYTVLKDIYGELMGMAKYGDEMFHMGADE 248 (497)
Q Consensus 186 ~~-------~~~~l~~~~~~~~~~~~----~~~~-----~~~~L~~~~p~-t~~fl~~vl~E~~~lF~~~~~~iHiGgDE 248 (497)
+. |.|+... ....+|... ...| -.+.+-.+.+. +++++..+..++++.|+ ...+-.|+|=
T Consensus 161 d~~~VldH~G~p~~~~-~~~~~w~~~m~~la~~pNv~~KlSG~~~~~~~~w~~~~v~p~~e~~i~~fg--~dR~vfGSdw 237 (279)
T COG3618 161 DVNFVLDHCGRPDIKI-NLEDPWKAALARLARRPNVWAKLSGVYAYSDESWTVEDVRPYVEELIELFG--WDRFVFGSDW 237 (279)
T ss_pred CCCEEeccCCCCCccc-cccCHHHHHHHHHHhCCCeEEEEeeecccccCCCCHHHHHHHHHHHHHhcC--ccceEecCCC
Confidence 11 1110000 000122100 0000 11334445555 79999999999999999 8888899987
No 125
>PRK14333 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=22.78 E-value=1.3e+02 Score=32.44 Aligned_cols=52 Identities=12% Similarity=0.155 Sum_probs=33.4
Q ss_pred CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880 97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR 166 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~ 166 (497)
.+.|+.+++.++++..+.. |+.+...+ +-++++ =|.+++++.++++++.+++
T Consensus 284 ~R~~t~e~~~~~i~~lr~~~p~i~i~~d~-IvGfPg-----------------ET~edf~~tl~~l~~~~~~ 337 (448)
T PRK14333 284 ARGYTHEKYRRIIDKIREYMPDASISADA-IVGFPG-----------------ETEAQFENTLKLVEEIGFD 337 (448)
T ss_pred CCCCCHHHHHHHHHHHHHhCCCcEEEeeE-EEECCC-----------------CCHHHHHHHHHHHHHcCCC
Confidence 3567888888888777665 33222111 112211 5899999999999988764
No 126
>PRK15452 putative protease; Provisional
Probab=22.57 E-value=3e+02 Score=29.92 Aligned_cols=105 Identities=10% Similarity=0.006 Sum_probs=0.0
Q ss_pred cccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCC
Q psy14880 86 CLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGI 165 (497)
Q Consensus 86 ~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI 165 (497)
+..++|+.+..+..|+.+++++.++++..+|.++...+..-.+. --.+++.+.++...+.||
T Consensus 29 ~G~~~~~~R~~~~~f~~edl~eav~~ah~~g~kvyvt~n~i~~e------------------~el~~~~~~l~~l~~~gv 90 (443)
T PRK15452 29 AGQPRYSLRVRNNEFNHENLALGINEAHALGKKFYVVVNIAPHN------------------AKLKTFIRDLEPVIAMKP 90 (443)
T ss_pred ECCCccchhhhccCCCHHHHHHHHHHHHHcCCEEEEEecCcCCH------------------HHHHHHHHHHHHHHhCCC
Q ss_pred EEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHH
Q psy14880 166 RVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDI 227 (497)
Q Consensus 166 ~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~v 227 (497)
.=|-=-| ||-...+-+..|.+. ...+. ++|+.|..+.+|++++
T Consensus 91 DgvIV~d-~G~l~~~ke~~p~l~------ih~st------------qlni~N~~a~~f~~~l 133 (443)
T PRK15452 91 DALIMSD-PGLIMMVREHFPEMP------IHLSV------------QANAVNWATVKFWQQM 133 (443)
T ss_pred CEEEEcC-HHHHHHHHHhCCCCe------EEEEe------------cccCCCHHHHHHHHHC
No 127
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=22.48 E-value=78 Score=33.82 Aligned_cols=28 Identities=7% Similarity=0.139 Sum_probs=23.5
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHA 177 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~ 177 (497)
=|.+|+.+++++|++++|.|+|- .-||.
T Consensus 5 ~s~eev~~iv~~a~~~~i~v~~~--G~Gt~ 32 (413)
T TIGR00387 5 KNTEQVARILKLCHEHRIPIVPR--GAGTG 32 (413)
T ss_pred CCHHHHHHHHHHHHHcCCcEEEE--CCCCC
Confidence 47899999999999999999993 44554
No 128
>KOG3257|consensus
Probab=22.37 E-value=46 Score=30.96 Aligned_cols=23 Identities=22% Similarity=0.203 Sum_probs=18.6
Q ss_pred CCHH-HHHHHHHHHHHcCCEEEEe
Q psy14880 148 YTPL-EVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 148 YT~~-ei~eiv~yA~~rgI~VIPE 170 (497)
.|-+ -+|-|+--|+..||.|+||
T Consensus 135 ~~lesi~rsiigtA~smGIkVvp~ 158 (168)
T KOG3257|consen 135 TTLESICRSIIGTARSMGIKVVPP 158 (168)
T ss_pred ccHHHHHHHHHHHHHhCccccchh
Confidence 4444 3688889999999999996
No 129
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=22.32 E-value=84 Score=30.63 Aligned_cols=57 Identities=25% Similarity=0.330 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHH
Q psy14880 150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYG 229 (497)
Q Consensus 150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~ 229 (497)
.+.++++|+.|+++||.||..+-.. ..+ +.+ +......+...+.+++++.
T Consensus 61 ~~~ld~~v~~a~~~gi~vild~h~~----~~w-------------~~~-------------~~~~~~~~~~~~~~~~~~~ 110 (281)
T PF00150_consen 61 LARLDRIVDAAQAYGIYVILDLHNA----PGW-------------ANG-------------GDGYGNNDTAQAWFKSFWR 110 (281)
T ss_dssp HHHHHHHHHHHHHTT-EEEEEEEES----TTC-------------SSS-------------TSTTTTHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCCeEEEEeccC----ccc-------------ccc-------------ccccccchhhHHHHHhhhh
Confidence 4579999999999999999854333 110 000 1122345557777888888
Q ss_pred HHHhccC
Q psy14880 230 ELMGMAK 236 (497)
Q Consensus 230 E~~~lF~ 236 (497)
.++.-|.
T Consensus 111 ~la~~y~ 117 (281)
T PF00150_consen 111 ALAKRYK 117 (281)
T ss_dssp HHHHHHT
T ss_pred hhccccC
Confidence 8888884
No 130
>PF06080 DUF938: Protein of unknown function (DUF938); InterPro: IPR010342 This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
Probab=22.25 E-value=50 Score=32.10 Aligned_cols=28 Identities=18% Similarity=0.343 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 152 EVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 152 ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|+.+|.+.|++.|.+.+--||||+--..
T Consensus 172 D~e~v~~lA~~~GL~l~~~~~MPANN~~ 199 (204)
T PF06080_consen 172 DIEDVEALAAAHGLELEEDIDMPANNLL 199 (204)
T ss_pred CHHHHHHHHHHCCCccCcccccCCCCeE
Confidence 6889999999999999999999986543
No 131
>cd07940 DRE_TIM_IPMS 2-isopropylmalate synthase (IPMS), N-terminal catalytic TIM barrel domain. 2-isopropylmalate synthase (IPMS) catalyzes an aldol-type condensation of acetyl-CoA and 2-oxoisovalerate yielding 2-isopropylmalate and CoA, the first committed step in leucine biosynthesis. This family includes the Arabidopsis thaliana IPMS1 and IPMS2 proteins, the Glycine max GmN56 protein, and the Brassica insularis BatIMS protein. This family also includes a group of archeal IPMS-like proteins represented by the Methanocaldococcus jannaschii AksA protein. AksA catalyzes the condensation of alpha-ketoglutarate and acetyl-CoA to form trans-homoaconitate, one of 13 steps in the conversion of alpha-ketoglutarate and acetylCoA to alpha-ketosuberate, a precursor to coenzyme B and biotin. AksA also catalyzes the condensation of alpha-ketoadipate or alpha-ketopimelate with acetylCoA to form, respectively, the (R)-homocitrate homologs (R)-2-hydroxy-1,2,5-pentanetricarboxylic acid and (R)-2-h
Probab=22.15 E-value=2.8e+02 Score=27.65 Aligned_cols=56 Identities=27% Similarity=0.147 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880 104 EVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG 178 (497)
Q Consensus 104 ~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~ 178 (497)
.+++++++++..|.++.+.. |. ....|.+.+.++++-+.+.|+..|=--||-|.+.
T Consensus 115 ~~~~~i~~a~~~G~~v~~~~--~~-----------------~~~~~~~~~~~~~~~~~~~G~~~i~l~DT~G~~~ 170 (268)
T cd07940 115 RAVEAVEYAKSHGLDVEFSA--ED-----------------ATRTDLDFLIEVVEAAIEAGATTINIPDTVGYLT 170 (268)
T ss_pred HHHHHHHHHHHcCCeEEEee--ec-----------------CCCCCHHHHHHHHHHHHHcCCCEEEECCCCCCCC
Confidence 45567777777776654221 11 1226777778888877777777666666666554
No 132
>smart00642 Aamy Alpha-amylase domain.
Probab=22.01 E-value=2.4e+02 Score=26.20 Aligned_cols=29 Identities=34% Similarity=0.516 Sum_probs=23.9
Q ss_pred ChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880 101 TPLEVQELVHYAKLRGVRVIFELDAPAHAG 130 (497)
Q Consensus 101 ~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~ 130 (497)
+.++++++++-+..+|++|+..+. ++|++
T Consensus 68 t~~d~~~lv~~~h~~Gi~vilD~V-~NH~~ 96 (166)
T smart00642 68 TMEDFKELVDAAHARGIKVILDVV-INHTS 96 (166)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEEC-CCCCC
Confidence 457899999999999999988775 66753
No 133
>cd06840 PLPDE_III_Bif_AspK_DapDC Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Bifunctional Aspartate Kinase/Diaminopimelate Decarboxylase. Bifunctional aspartate kinase/diaminopimelate decarboxylase (AspK/DapDC, EC 4.1.1.20/EC 2.7.2.4) typically exists in bacteria. These proteins contain an N-terminal AspK region and a C-terminal DapDC region, which contains a PLP-binding TIM-barrel domain followed by beta-sandwich domain, characteristic of fold type III PLP-dependent enzymes. Members of this subfamily have not been fully characterized. Based on their sequence, these proteins may catalyze both reactions catalyzed by AspK and DapDC. AspK catalyzes the phosphorylation of L-aspartate to produce 4-phospho-L-aspartate while DapDC participates in the last step of lysine biosynthesis, the conversion of meso-2,6-diaminoheptanedioate to L-lysine.
Probab=21.88 E-value=5.8e+02 Score=26.66 Aligned_cols=21 Identities=38% Similarity=0.316 Sum_probs=18.5
Q ss_pred CCHHHHHHHHHHHHHcCCEEE
Q psy14880 148 YTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VI 168 (497)
.+.+|+.++++.+++.|++|+
T Consensus 153 ~~~~~~~~~l~~~~~~~l~l~ 173 (368)
T cd06840 153 LDVDELDEARDLAKKAGIIVI 173 (368)
T ss_pred CCHHHHHHHHHHHHhCCCcEE
Confidence 688999999999999888765
No 134
>cd07265 2_3_CTD_N N-terminal domain of catechol 2,3-dioxygenase. This subfamily contains the N-terminal, non-catalytic, domain of catechol 2,3-dioxygenase. Catechol 2,3-dioxygenase (2,3-CTD, catechol:oxygen 2,3-oxidoreductase) catalyzes an extradiol cleavage of catechol to form 2-hydroxymuconate semialdehyde with the insertion of two atoms of oxygen. The enzyme is a homotetramer and contains catalytically essential Fe(II) . The reaction proceeds by an ordered bi-unit mechanism. First, catechol binds to the enzyme, this is then followed by the binding of dioxygen to form a tertiary complex, and then the aromatic ring is cleaved to produce 2-hydroxymuconate semialdehyde. Catechol 2,3-dioxygenase belongs to the type I extradiol dioxygenase family. The subunit comprises the N- and C-terminal domains of similar structure fold, resulting from an ancient gene duplication. The active site is located in a funnel-shaped space of the C-terminal domain. This subfamily represents the N-terminal do
Probab=21.79 E-value=91 Score=26.24 Aligned_cols=32 Identities=16% Similarity=0.152 Sum_probs=24.7
Q ss_pred CCCHHHHHHHHHHHHHcCCEEE--EecCCcccch
Q psy14880 147 VYTPLEVQELVHYAKLRGIRVI--FELDAPAHAG 178 (497)
Q Consensus 147 ~YT~~ei~eiv~yA~~rgI~VI--PEID~PGH~~ 178 (497)
+-+.+++.++.+..+++|++|+ |+-..||+..
T Consensus 69 v~~~~dv~~~~~~l~~~G~~~~~~~~~~~~~~~~ 102 (122)
T cd07265 69 VLDDADLEKLEARLQAYGVAVERIPAGELPGVGR 102 (122)
T ss_pred eCCHHHHHHHHHHHHHCCCcEEEcccCCCCCCce
Confidence 4578899999999999999987 4444565433
No 135
>COG2008 GLY1 Threonine aldolase [Amino acid transport and metabolism]
Probab=21.73 E-value=77 Score=33.25 Aligned_cols=23 Identities=22% Similarity=0.355 Sum_probs=20.6
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRV 167 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~V 167 (497)
+..||.+|+++|.++|+++||-+
T Consensus 144 GtVy~l~el~~i~~~~k~~~l~L 166 (342)
T COG2008 144 GTVYPLDELEAISAVCKEHGLPL 166 (342)
T ss_pred ceecCHHHHHHHHHHHHHhCCce
Confidence 45799999999999999999765
No 136
>COG3977 Alanine-alpha-ketoisovalerate (or valine-pyruvate) aminotransferase [Amino acid transport and metabolism]
Probab=21.70 E-value=79 Score=32.79 Aligned_cols=25 Identities=32% Similarity=0.414 Sum_probs=22.9
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
+.++|.+|++.|.+.|+++||-+|-
T Consensus 193 GNVlTdeE~~kldalA~~~giPliI 217 (417)
T COG3977 193 GNVLTDEELAKLDALARQHGIPLII 217 (417)
T ss_pred CCcccHHHHHHHHHHhhhcCCcEEE
Confidence 5679999999999999999999885
No 137
>cd03327 MR_like_2 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 2. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=21.59 E-value=1.5e+02 Score=30.69 Aligned_cols=27 Identities=0% Similarity=-0.196 Sum_probs=21.0
Q ss_pred eeeeccCCCCCCHHHHHHhhhhhhcccc
Q psy14880 31 TKLIVWSSHLTDPEIILNYLDPQTCGTQ 58 (497)
Q Consensus 31 G~mlDvaR~~~~~~~lk~~id~ma~~K~ 58 (497)
.+|+|.-.. ++++.-.++++.|+.|.+
T Consensus 173 ~l~vDan~~-~~~~~A~~~~~~l~~~~~ 199 (341)
T cd03327 173 DLMLDCYMS-WNLNYAIKMARALEKYEL 199 (341)
T ss_pred cEEEECCCC-CCHHHHHHHHHHhhhcCC
Confidence 589998755 577888888888888764
No 138
>COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion]
Probab=21.58 E-value=86 Score=33.37 Aligned_cols=30 Identities=20% Similarity=0.205 Sum_probs=25.5
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
-|.+|+.+++++|.++||.|+|- .-||+..
T Consensus 39 ~s~~eV~~iv~~a~~~~~~v~pr--G~gts~~ 68 (459)
T COG0277 39 KSEEEVAAILRLANENGIPVVPR--GGGTSLS 68 (459)
T ss_pred CCHHHHHHHHHHHHHcCCeEEEE--CCCCCcc
Confidence 58999999999999999999994 5556554
No 139
>KOG0256|consensus
Probab=21.38 E-value=2.4e+02 Score=30.57 Aligned_cols=114 Identities=17% Similarity=0.173 Sum_probs=68.9
Q ss_pred eeeeccCCCC-CCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHH
Q psy14880 31 TKLIVWSSHL-TDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV 109 (497)
Q Consensus 31 G~mlDvaR~~-~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~ 109 (497)
-+.++.+-+| ++++.+..-+.+.-...+++..| =++. |.-| -+..|+++++.+++
T Consensus 197 pv~c~Ss~~f~itv~alE~A~~~A~~~~~kVkGv-------------litN--------PsNP---LG~~~~~e~L~~ll 252 (471)
T KOG0256|consen 197 PVHCSSSNGFQITVEALEAALNQARKLGLKVKGV-------------LITN--------PSNP---LGTTLSPEELISLL 252 (471)
T ss_pred EEEeecCCCccccHHHHHHHHHHHHHhCCceeEE-------------EEeC--------CCCC---CCCccCHHHHHHHH
Confidence 3677777787 78888887776655554333322 1222 3333 35679999999999
Q ss_pred HHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCC
Q psy14880 110 HYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGL 189 (497)
Q Consensus 110 ~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~ 189 (497)
.|++...+|++...- +.++ ++.+.+++-+.++++...+ .|.-..-++.--+++|+
T Consensus 253 ~Fa~~kniHvI~DEI---ya~s---------------VF~~~~F~Sv~ev~~~~~~-------~~~rvHivyslSKD~Gl 307 (471)
T KOG0256|consen 253 NFASRKNIHVISDEI---YAGS---------------VFDKSEFRSVLEVRKDPHL-------DPDRVHIVYSLSKDFGL 307 (471)
T ss_pred HHHhhcceEEEeehh---hccc---------------ccCccCceEHHHHhhcccc-------CCCcEEEEEEeccccCC
Confidence 999999999875332 2221 2444455555555555443 33333344444567777
Q ss_pred CCcc
Q psy14880 190 GDLI 193 (497)
Q Consensus 190 ~~l~ 193 (497)
|.++
T Consensus 308 pGfR 311 (471)
T KOG0256|consen 308 PGFR 311 (471)
T ss_pred CceE
Confidence 7765
No 140
>COG0621 MiaB 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]
Probab=21.27 E-value=1.4e+02 Score=32.53 Aligned_cols=92 Identities=13% Similarity=0.093 Sum_probs=51.6
Q ss_pred CCHHHHHHhhhhhhcc-cccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHHHHHHHcCcee
Q psy14880 41 TDPEIILNYLDPQTCG-TQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRV 119 (497)
Q Consensus 41 ~~~~~lk~~id~ma~~-K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri 119 (497)
+|.+..-++|+.|+.. |+ +..|||+|| |-+| .- +.++ .+.||.++..++++--+..--.+
T Consensus 236 ~P~~~~d~lI~~~~~~~kv-~~~lHlPvQ--------sGsd-~I----Lk~M-----~R~yt~e~~~~~i~k~R~~~Pd~ 296 (437)
T COG0621 236 HPLEFTDDLIEAIAETPKV-CPHLHLPVQ--------SGSD-RI----LKRM-----KRGYTVEEYLEIIEKLRAARPDI 296 (437)
T ss_pred CchhcCHHHHHHHhcCCcc-cccccCccc--------cCCH-HH----HHHh-----CCCcCHHHHHHHHHHHHHhCCCc
Confidence 4477778899999996 65 333333333 2222 10 1222 46789888888765433210000
Q ss_pred eeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 120 IFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 120 ~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
.+-.| +-+++| =|.+|+.+.++..++-+..-+
T Consensus 297 ~i~tDiIVGFPg-----------------ETeedFe~tl~lv~e~~fd~~ 329 (437)
T COG0621 297 AISTDIIVGFPG-----------------ETEEDFEETLDLVEEVRFDRL 329 (437)
T ss_pred eEeccEEEECCC-----------------CCHHHHHHHHHHHHHhCCCEE
Confidence 01111 122332 489999999999999876543
No 141
>COG1217 TypA Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]
Probab=21.26 E-value=56 Score=35.81 Aligned_cols=22 Identities=36% Similarity=0.555 Sum_probs=17.6
Q ss_pred HHHHHcCCEEE-------------EecCCcccchh
Q psy14880 158 HYAKLRGIRVI-------------FELDAPAHAGN 179 (497)
Q Consensus 158 ~yA~~rgI~VI-------------PEID~PGH~~a 179 (497)
..-++|||+|. ==+|+|||+..
T Consensus 47 DlEkERGITILaKnTav~~~~~~INIvDTPGHADF 81 (603)
T COG1217 47 DLEKERGITILAKNTAVNYNGTRINIVDTPGHADF 81 (603)
T ss_pred chhhhcCcEEEeccceeecCCeEEEEecCCCcCCc
Confidence 35689999975 45899999975
No 142
>PRK02308 uvsE putative UV damage endonuclease; Provisional
Probab=21.22 E-value=4.7e+02 Score=26.85 Aligned_cols=56 Identities=21% Similarity=0.443 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHcCceee-eecC-CCC--CCC-CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 104 EVQELVHYAKLRGVRVI-FELD-APA--HAG-NGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 104 ~i~~~~~y~~~rG~ri~-~e~~-~p~--~~~-~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
++.++++|...+||+.. ..++ .|- |+. ..|. ..-..+++++++-+++++.||+++-
T Consensus 49 ~l~~~l~~~~~~~I~~~R~sS~l~P~~~h~~~~~w~----------~~~~~~~~~~~~g~~~~~~~irls~ 109 (303)
T PRK02308 49 NLLRILKYNIAHGIGLFRLSSSLIPLATHPELEGWD----------YIEPFKEELREIGEFIKEHNIRLSF 109 (303)
T ss_pred HHHHHHHHHHHCCCCEEEcccCcCCCCCChhhcccC----------CCCCCHHHHHHHHHHHHHcCCCeec
Confidence 45677788888888642 1111 121 210 0121 2337899999999999999987653
No 143
>PRK14334 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=21.21 E-value=1.3e+02 Score=32.42 Aligned_cols=52 Identities=17% Similarity=0.118 Sum_probs=28.6
Q ss_pred ccCChhHHHHHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880 98 KVYTPLEVQELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR 166 (497)
Q Consensus 98 ~~y~~~~i~~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~ 166 (497)
+.|+.+++.+.++..+..|..+.+..+ +-+.++ =|.+++.+.++++++.++.
T Consensus 267 R~~~~~~~~~~v~~lr~~~~~i~i~~d~IvG~Pg-----------------Et~ed~~~tl~~i~~l~~~ 319 (440)
T PRK14334 267 REYRREKYLERIAEIREALPDVVLSTDIIVGFPG-----------------ETEEDFQETLSLYDEVGYD 319 (440)
T ss_pred CCCCHHHHHHHHHHHHHhCCCcEEEEeEEEECCC-----------------CCHHHHHHHHHHHHhcCCC
Confidence 456667777776666655433211111 111111 4777888888887776643
No 144
>PRK05942 aspartate aminotransferase; Provisional
Probab=21.17 E-value=94 Score=32.54 Aligned_cols=23 Identities=17% Similarity=0.196 Sum_probs=21.1
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
..+|++++++|++.|+++||.||
T Consensus 185 ~~~s~~~~~~i~~~a~~~~~~iI 207 (394)
T PRK05942 185 ATAPREFFEEIVAFARKYEIMLV 207 (394)
T ss_pred CcCCHHHHHHHHHHHHHcCeEEE
Confidence 45999999999999999999877
No 145
>PRK11194 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.16 E-value=2.8e+02 Score=29.46 Aligned_cols=72 Identities=15% Similarity=0.047 Sum_probs=42.7
Q ss_pred EEeccCCcccccccccCCCCCCCccCChhHHH-HHHHHHHHcC---ceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCC
Q psy14880 75 RWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ-ELVHYAKLRG---VRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYT 149 (497)
Q Consensus 75 hlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~-~~~~y~~~rG---~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT 149 (497)
-+|..|++--.--.|. ++.|..+++. .+.+|.+..| -+|.+++. +|+.. =|
T Consensus 221 SLha~d~e~R~~lmPi------n~~~~l~~ll~a~~~y~~~~~~~~rrI~irypLIpGvN------------------Ds 276 (372)
T PRK11194 221 SLHAPNDELRDEIVPI------NKKYNIETFLAAVRRYLEKSNANQGRVTVEYVMLDHVN------------------DG 276 (372)
T ss_pred eccCCCHHHHHHhcCC------cccccHHHHHHHHHHHHHhcccCCCeEEEEEEeECCCC------------------CC
Confidence 5787776543322331 2334434333 3346766554 46666654 34431 58
Q ss_pred HHHHHHHHHHHHHc--CCEEEEe
Q psy14880 150 PLEVQELVHYAKLR--GIRVIFE 170 (497)
Q Consensus 150 ~~ei~eiv~yA~~r--gI~VIPE 170 (497)
.+++++|+++++.. +|++||-
T Consensus 277 ~e~a~~La~ll~~l~~~VnLIPY 299 (372)
T PRK11194 277 TEHAHQLAELLKDTPCKINLIPW 299 (372)
T ss_pred HHHHHHHHHHHhcCCceEEEecC
Confidence 89999999999987 6667763
No 146
>PRK09505 malS alpha-amylase; Reviewed
Probab=20.97 E-value=93 Score=35.79 Aligned_cols=30 Identities=37% Similarity=0.665 Sum_probs=26.3
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.+|||+||-.+ ++-|+..
T Consensus 290 t~~dfk~Lv~~aH~~Gi~VilD~-V~NH~~~ 319 (683)
T PRK09505 290 TEADLRTLVDEAHQRGIRILFDV-VMNHTGY 319 (683)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEE-CcCCCcc
Confidence 78999999999999999999876 4678773
No 147
>PF00155 Aminotran_1_2: Aminotransferase class I and II 1-aminocyclopropane-1-carboxylate synthase signature aspartate aminotransferase signature; InterPro: IPR004839 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped [] into class I and class II. This entry includes proteins from both subfamilies.; GO: 0016769 transferase activity, transferring nitrogenous groups, 0030170 pyridoxal phosphate binding, 0009058 biosynthetic process; PDB: 3NRA_B 3P6K_B 3OP7_A 3ASB_A 3ASA_A 1W7M_A 3FVX_A 1W7N_A 3FVU_B 3FVS_A ....
Probab=20.94 E-value=95 Score=31.67 Aligned_cols=24 Identities=29% Similarity=0.343 Sum_probs=19.8
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
..+|.+++++|++.|+++|+-||-
T Consensus 163 ~~~~~~~l~~l~~~~~~~~~~ii~ 186 (363)
T PF00155_consen 163 SVLSLEELRELAELAREYNIIIIV 186 (363)
T ss_dssp BB--HHHHHHHHHHHHHTTSEEEE
T ss_pred cccccccccchhhhhcccccceee
Confidence 359999999999999999998884
No 148
>PRK14454 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=20.94 E-value=2.4e+02 Score=29.49 Aligned_cols=72 Identities=18% Similarity=0.149 Sum_probs=41.7
Q ss_pred eEEeccCCcccccccccCCCCCCCccCChhHHHHH-HHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880 74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL-VHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL 151 (497)
Q Consensus 74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~-~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ 151 (497)
+-+|..|++-.+.-.| .+..|..+++.+. .+|.+..|-|+.+++. +|+.. =|.+
T Consensus 212 isLka~d~e~r~~l~p------v~~~~~L~~l~~~~~~~~~~~~~rv~iey~LI~gvN------------------Ds~e 267 (342)
T PRK14454 212 ISLHAPNDELRKKMMP------IANKYSIEELIEACKYYINKTNRRITFEYALVKGVN------------------DSKE 267 (342)
T ss_pred EecCCCCHHHHHHhcC------CcccCCHHHHHHHHHHHHHHhCCEEEEEEEeECCCC------------------CCHH
Confidence 5567777654332222 1234554555443 3556677777776665 24321 4788
Q ss_pred HHHHHHHHHHHc--CCEEEE
Q psy14880 152 EVQELVHYAKLR--GIRVIF 169 (497)
Q Consensus 152 ei~eiv~yA~~r--gI~VIP 169 (497)
++++|.++.+.+ .|++||
T Consensus 268 da~~La~llk~l~~~VnLiP 287 (342)
T PRK14454 268 DAKELGKLLKGMLCHVNLIP 287 (342)
T ss_pred HHHHHHHHHhcCCceEEEEe
Confidence 999999998865 344444
No 149
>PF09711 Cas_Csn2: CRISPR-associated protein (Cas_Csn2); InterPro: IPR010146 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents the Csn2 family of Cas proteins, which are found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas loci. The species range so far for this subtype is animal pathogens and commensals only. This protein is present in some but not all NMENI CRISPR/Cas loci.; PDB: 3TOC_A 3QHQ_A 3S5U_C.
Probab=20.87 E-value=1e+02 Score=29.63 Aligned_cols=25 Identities=40% Similarity=0.559 Sum_probs=21.4
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
..|+|++|++++++|+.-.+|.|+-
T Consensus 140 ~~YLT~eEl~el~e~i~~~~i~VL~ 164 (188)
T PF09711_consen 140 RSYLTEEELQELYEYIKYNKIKVLF 164 (188)
T ss_dssp GGGS-HHHHHHHHHHHHHTTSEEEE
T ss_pred HHhcCHHHHHHHHHHHHHhCCeEEE
Confidence 3569999999999999999999874
No 150
>COG2100 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=20.76 E-value=1.7e+02 Score=30.62 Aligned_cols=48 Identities=23% Similarity=0.369 Sum_probs=37.4
Q ss_pred CccCChhHHHHHHHHHHHcCceeee-ecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcC
Q psy14880 97 AKVYTPLEVQELVHYAKLRGVRVIF-ELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRG 164 (497)
Q Consensus 97 ~~~y~~~~i~~~~~y~~~rG~ri~~-e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rg 164 (497)
.+-|.-+.+.++.+|....|+.+.. +.-.|+ |..+|+..||++|++-|
T Consensus 236 ~~dYdv~kvle~aE~i~~a~idvlIaPv~lPG--------------------~ND~E~~~iIe~A~~iG 284 (414)
T COG2100 236 RKDYDVKKVLEVAEYIANAGIDVLIAPVWLPG--------------------VNDDEMPKIIEWAREIG 284 (414)
T ss_pred ccccCHHHHHHHHHHHHhCCCCEEEeeeecCC--------------------cChHHHHHHHHHHHHhC
Confidence 4567778889999998888887642 333332 88899999999999998
No 151
>cd07943 DRE_TIM_HOA 4-hydroxy-2-oxovalerate aldolase, N-terminal catalytic TIM barrel domain. 4-hydroxy 2-ketovalerate aldolase (Also known as 4-hydroxy-2-ketovalerate aldolase and 4-hydroxy-2-oxopentanoate aldolase (HOA)) converts 4-hydroxy-2-oxopentanoate to acetaldehyde and pyruvate, the penultimate step in the meta-cleavage pathway for the degradation of phenols, cresols and catechol. This family includes the Escherichia coli MhpE aldolase, the Pseudomonas DmpG aldolase, and the Burkholderia xenovorans BphI pyruvate aldolase. In Pseudomonas, the DmpG aldolase tightly associates with a dehydrogenase (DmpF ) and is inactive without it. HOA has a canonical TIM-barrel fold with a C-terminal extension that forms a funnel leading to the active site. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate
Probab=20.61 E-value=2.4e+02 Score=27.96 Aligned_cols=20 Identities=20% Similarity=0.154 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHcCCEEEEec
Q psy14880 152 EVQELVHYAKLRGIRVIFEL 171 (497)
Q Consensus 152 ei~eiv~yA~~rgI~VIPEI 171 (497)
.+++.+++|+++|++|..-+
T Consensus 113 ~~~~~i~~ak~~G~~v~~~~ 132 (263)
T cd07943 113 VSEQHIGAARKLGMDVVGFL 132 (263)
T ss_pred HHHHHHHHHHHCCCeEEEEE
Confidence 45566666666666555554
No 152
>PRK09427 bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase; Provisional
Probab=20.60 E-value=1.2e+02 Score=33.00 Aligned_cols=39 Identities=15% Similarity=0.163 Sum_probs=34.5
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG 184 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~ 184 (497)
..++++++++|+++|.++|++++-|+...-....++.+.
T Consensus 141 ~~L~~~~l~~l~~~a~~lGl~~lvEvh~~~El~~al~~~ 179 (454)
T PRK09427 141 SVLDDEQYRQLAAVAHSLNMGVLTEVSNEEELERAIALG 179 (454)
T ss_pred HhCCHHHHHHHHHHHHHcCCcEEEEECCHHHHHHHHhCC
Confidence 348999999999999999999999999988888877654
No 153
>PF01408 GFO_IDH_MocA: Oxidoreductase family, NAD-binding Rossmann fold; InterPro: IPR000683 This group of enzymes utilise NADP or NAD, and is known as the GFO/IDH/MOCA family in UniProtKB/Swiss-Prot. GFO is a glucose--fructose oxidoreductase, which converts D-glucose and D-fructose into D-gluconolactone and D-glucitol in the sorbitol-gluconate pathway. MOCA is a rhizopine catabolism protein which may catalyse the NADH-dependent dehydrogenase reaction involved in rhizopine catabolism. Other proteins belonging to this family include Gal80, a negative regulator for the expression of lactose and galactose metabolic genes; and several hypothetical proteins from yeast, Escherichia coli and Bacillus subtilis. The oxidoreductase, N-terminal domain is almost always associated with the oxidoreductase, C-terminal domain (see IPR004104 from INTERPRO).; GO: 0016491 oxidoreductase activity; PDB: 1LC0_A 1LC3_A 1GCU_A 3IP3_E 3CEA_C 3EVN_A 3NTQ_A 3NTR_B 3NT5_A 3MZ0_A ....
Probab=20.60 E-value=2.6e+02 Score=23.43 Aligned_cols=21 Identities=43% Similarity=0.569 Sum_probs=19.5
Q ss_pred CCHHHHHHHHHHHHHcCCEEE
Q psy14880 148 YTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VI 168 (497)
.|.+|.++|++.|++.|+.|.
T Consensus 97 ~~~~~~~~l~~~a~~~~~~~~ 117 (120)
T PF01408_consen 97 LTLEEAEELVEAAKEKGVKVM 117 (120)
T ss_dssp SSHHHHHHHHHHHHHHTSCEE
T ss_pred CCHHHHHHHHHHHHHhCCEEE
Confidence 799999999999999998875
No 154
>PLN00145 tyrosine/nicotianamine aminotransferase; Provisional
Probab=20.55 E-value=1e+02 Score=33.02 Aligned_cols=24 Identities=29% Similarity=0.360 Sum_probs=21.5
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIF 169 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIP 169 (497)
-.|+++++++|++.|+++||-||=
T Consensus 205 ~v~~~~~l~~i~~~a~~~~i~ii~ 228 (430)
T PLN00145 205 SVYSYEHLAKIAETARKLGILVIA 228 (430)
T ss_pred CCCCHHHHHHHHHHHHHcCCEEEE
Confidence 459999999999999999998773
No 155
>PRK09147 succinyldiaminopimelate transaminase; Provisional
Probab=20.47 E-value=97 Score=32.40 Aligned_cols=23 Identities=13% Similarity=0.124 Sum_probs=21.0
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
..||++++++|++.|++++|.||
T Consensus 181 ~~~s~~~~~~l~~~a~~~~~~ii 203 (396)
T PRK09147 181 AVLPLDDWKKLFALSDRYGFVIA 203 (396)
T ss_pred ccCCHHHHHHHHHHHHHcCeEEE
Confidence 45999999999999999999877
No 156
>KOG4175|consensus
Probab=20.44 E-value=1.7e+02 Score=28.68 Aligned_cols=24 Identities=25% Similarity=0.346 Sum_probs=20.9
Q ss_pred CCHHHHHHHHHHHHHcCCEEEEec
Q psy14880 148 YTPLEVQELVHYAKLRGIRVIFEL 171 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI~VIPEI 171 (497)
+-.+|-..+-.+|+++||.+||-+
T Consensus 132 lPpEEa~~~Rne~~k~gislvpLv 155 (268)
T KOG4175|consen 132 LPPEEAETLRNEARKHGISLVPLV 155 (268)
T ss_pred CChHHHHHHHHHHHhcCceEEEee
Confidence 556888889999999999999964
No 157
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=20.42 E-value=2.5e+02 Score=18.78 Aligned_cols=13 Identities=15% Similarity=0.485 Sum_probs=9.6
Q ss_pred CCHHHHHHHHHHH
Q psy14880 148 YTPLEVQELVHYA 160 (497)
Q Consensus 148 YT~~ei~eiv~yA 160 (497)
.|++|||+.+++-
T Consensus 17 ls~eeir~FL~~~ 29 (30)
T PF08671_consen 17 LSKEEIREFLEFN 29 (30)
T ss_dssp --HHHHHHHHHHH
T ss_pred CCHHHHHHHHHhC
Confidence 8999999998764
No 158
>TIGR01290 nifB nitrogenase cofactor biosynthesis protein NifB. This model describes NifB, a protein required for the biosynthesis of the iron-molybdenum (or iron-vanadium) cofactor used by the nitrogen-fixing enzyme nitrogenase. Archaeal homologs lack the most C-terminal region and score between the trusted and noise cutoffs of this model.
Probab=20.38 E-value=1.9e+02 Score=31.35 Aligned_cols=29 Identities=24% Similarity=0.269 Sum_probs=24.3
Q ss_pred CCHHHHHHHHHHHHHcCC---EEEEecCCccc
Q psy14880 148 YTPLEVQELVHYAKLRGI---RVIFELDAPAH 176 (497)
Q Consensus 148 YT~~ei~eiv~yA~~rgI---~VIPEID~PGH 176 (497)
++.+++.+|+++++++|+ .|+|-+..|-|
T Consensus 200 iND~~i~~l~~~~~~lg~~~~nl~p~~~~p~~ 231 (442)
T TIGR01290 200 INDEHLVEVSKQVKELGAFLHNVMPLISAPEH 231 (442)
T ss_pred cCHHHHHHHHHHHHhCCCcEEEeecCCCcccc
Confidence 556899999999999985 57888888766
No 159
>PRK09082 methionine aminotransferase; Validated
Probab=20.36 E-value=95 Score=32.40 Aligned_cols=23 Identities=17% Similarity=0.281 Sum_probs=21.0
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
..||++++++|++.|++++|-||
T Consensus 178 ~~~~~~~~~~i~~~a~~~~i~li 200 (386)
T PRK09082 178 TVWSAADMRALWQLIAGTDIYVL 200 (386)
T ss_pred cCCCHHHHHHHHHHHHHCCEEEE
Confidence 45899999999999999999877
No 160
>COG0826 Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]
Probab=20.34 E-value=1.4e+02 Score=31.33 Aligned_cols=35 Identities=17% Similarity=0.210 Sum_probs=32.0
Q ss_pred CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880 146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNG 180 (497)
Q Consensus 146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~ 180 (497)
..+|.+|++|+|+||.++|.++.=-+++.=|....
T Consensus 44 ~nfs~~~l~e~i~~ah~~gkk~~V~~N~~~~~~~~ 78 (347)
T COG0826 44 LNFSVEDLAEAVELAHSAGKKVYVAVNTLLHNDEL 78 (347)
T ss_pred ccCCHHHHHHHHHHHHHcCCeEEEEeccccccchh
Confidence 46999999999999999999999999999888764
No 161
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=20.21 E-value=99 Score=34.31 Aligned_cols=30 Identities=33% Similarity=0.535 Sum_probs=26.0
Q ss_pred CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN 179 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a 179 (497)
|.+|+++||+-|.++||+||=.+ ++.|+..
T Consensus 74 t~~df~~Lv~~ah~~Gi~vilD~-V~NH~s~ 103 (539)
T TIGR02456 74 TIDDFKDFVDEAHARGMRVIIDL-VLNHTSD 103 (539)
T ss_pred CHHHHHHHHHHHHHCCCEEEEEe-ccCcCCC
Confidence 78999999999999999999875 5777654
No 162
>TIGR03542 DAPAT_plant LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). This clade includes characterized species in plants and Chlamydia. Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh).
Probab=20.16 E-value=1.2e+02 Score=32.00 Aligned_cols=24 Identities=33% Similarity=0.512 Sum_probs=21.5
Q ss_pred CCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880 145 AKVYTPLEVQELVHYAKLRGIRVI 168 (497)
Q Consensus 145 ~~~YT~~ei~eiv~yA~~rgI~VI 168 (497)
+..||++++++|++.|+++++-||
T Consensus 187 G~~~s~~~~~~l~~~a~~~~~~iI 210 (402)
T TIGR03542 187 GTVLTKEQLKELVDYANEHGSLIL 210 (402)
T ss_pred CccCCHHHHHHHHHHHHHcCeEEE
Confidence 456999999999999999999776
No 163
>cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=20.04 E-value=1.8e+02 Score=28.79 Aligned_cols=22 Identities=32% Similarity=0.481 Sum_probs=19.5
Q ss_pred CHHHHHHHHHHHHHcCCEEEEe
Q psy14880 149 TPLEVQELVHYAKLRGIRVIFE 170 (497)
Q Consensus 149 T~~ei~eiv~yA~~rgI~VIPE 170 (497)
.-.+.+.+.++|+.+||.++|.
T Consensus 212 Git~~~~i~~~a~~~gi~~~~~ 233 (263)
T cd03320 212 GPRALLELAEEARARGIPAVVS 233 (263)
T ss_pred CHHHHHHHHHHHHHcCCCEEEE
Confidence 3467999999999999999996
Done!