Query         psy14880
Match_columns 497
No_of_seqs    213 out of 1574
Neff          6.6 
Searched_HMMs 46136
Date          Fri Aug 16 23:11:10 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14880.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14880hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd06562 GH20_HexA_HexB-like Be 100.0   1E-84 2.2E-89  673.4  33.7  339   27-478     1-348 (348)
  2 KOG2499|consensus              100.0   1E-83 2.2E-88  660.1  30.3  377    3-497   151-542 (542)
  3 cd06563 GH20_chitobiase-like T 100.0 2.9E-80 6.3E-85  642.6  31.0  341   27-467     1-357 (357)
  4 cd06570 GH20_chitobiase-like_1 100.0   9E-79   2E-83  618.6  29.9  309   27-467     1-311 (311)
  5 cd06569 GH20_Sm-chitobiase-lik 100.0 7.1E-78 1.5E-82  638.4  32.8  360   23-463     1-424 (445)
  6 cd06568 GH20_SpHex_like A subg 100.0 3.7E-76 8.1E-81  604.6  29.2  313   27-467     1-329 (329)
  7 PF00728 Glyco_hydro_20:  Glyco 100.0 1.8E-71 3.9E-76  574.0  17.3  323   27-455     1-350 (351)
  8 cd02742 GH20_hexosaminidase Be 100.0 8.1E-69 1.8E-73  545.5  26.5  293   29-454     1-303 (303)
  9 cd06564 GH20_DspB_LnbB-like Gl 100.0 1.5E-66 3.2E-71  534.0  28.0  310   28-455     1-326 (326)
 10 COG3525 Chb N-acetyl-beta-hexo 100.0 9.6E-65 2.1E-69  536.5  19.8  377   16-485   250-646 (732)
 11 cd06565 GH20_GcnA-like Glycosy 100.0 1.4E-46 3.1E-51  381.9  23.8  287   29-454     1-301 (301)
 12 KOG2499|consensus               97.5 9.5E-05 2.1E-09   78.3   4.6   45   87-131   234-278 (542)
 13 PF02638 DUF187:  Glycosyl hydr  95.7    0.19 4.1E-06   51.8  13.5   87  150-244    69-160 (311)
 14 cd06562 GH20_HexA_HexB-like Be  95.6  0.0095 2.1E-07   62.3   3.4   42   90-131    57-98  (348)
 15 PF10566 Glyco_hydro_97:  Glyco  94.8   0.069 1.5E-06   53.9   6.7   71  147-233    69-148 (273)
 16 cd06568 GH20_SpHex_like A subg  94.6   0.028   6E-07   58.4   3.5   35   96-130    68-102 (329)
 17 cd06570 GH20_chitobiase-like_1  94.6    0.03 6.5E-07   57.7   3.7   34   97-130    62-95  (311)
 18 cd06564 GH20_DspB_LnbB-like Gl  94.4    0.03 6.4E-07   58.0   3.2   36   95-130    74-109 (326)
 19 PF00728 Glyco_hydro_20:  Glyco  94.2   0.025 5.5E-07   58.5   2.2   41   90-130    57-100 (351)
 20 cd06563 GH20_chitobiase-like T  94.0   0.042   9E-07   57.7   3.4   34   97-130    80-113 (357)
 21 cd02742 GH20_hexosaminidase Be  94.0   0.041 8.9E-07   56.4   3.2   35   96-130    65-99  (303)
 22 cd06569 GH20_Sm-chitobiase-lik  93.5   0.054 1.2E-06   58.5   3.2   34   97-130    91-124 (445)
 23 cd06565 GH20_GcnA-like Glycosy  89.8    0.21 4.6E-06   51.2   2.5   32   99-130    56-87  (301)
 24 COG1649 Uncharacterized protei  87.9     6.9 0.00015   42.0  12.3  156   26-243    44-204 (418)
 25 PF13200 DUF4015:  Putative gly  84.4      10 0.00022   39.3  11.2   77  150-234    60-136 (316)
 26 PRK12568 glycogen branching en  81.2      26 0.00055   40.5  13.7  118  106-236   273-401 (730)
 27 PLN02361 alpha-amylase          80.4     4.9 0.00011   43.0   7.3   84  149-233    74-169 (401)
 28 PRK14705 glycogen branching en  78.4      29 0.00062   42.4  13.5  120  105-236   768-897 (1224)
 29 PRK14706 glycogen branching en  77.1      31 0.00066   39.3  12.7  121  105-236   170-299 (639)
 30 PF14871 GHL6:  Hypothetical gl  76.7      12 0.00027   33.6   7.7   86  145-236    38-124 (132)
 31 PRK05402 glycogen branching en  76.1      77  0.0017   36.7  15.8  123  101-236   263-397 (726)
 32 PF02679 ComA:  (2R)-phospho-3-  76.0      19 0.00041   36.0   9.4   31   23-56      7-37  (244)
 33 TIGR01515 branching_enzym alph  75.7      43 0.00093   37.9  13.4  122  107-236   161-288 (613)
 34 PRK12313 glycogen branching en  75.1      78  0.0017   35.9  15.4  123  103-236   170-302 (633)
 35 PF02449 Glyco_hydro_42:  Beta-  75.0      15 0.00032   38.6   9.0   83  146-237    42-125 (374)
 36 COG0296 GlgB 1,4-alpha-glucan   74.2      36 0.00078   38.6  12.0  121  103-236   165-296 (628)
 37 PF00128 Alpha-amylase:  Alpha   69.1      14 0.00031   36.4   7.0   86  149-235    50-161 (316)
 38 TIGR02100 glgX_debranch glycog  68.1      28  0.0006   40.0   9.7   91  143-236   233-334 (688)
 39 PLN02960 alpha-amylase          67.0      83  0.0018   37.1  13.1  125  101-236   414-549 (897)
 40 PLN02784 alpha-amylase          64.7      21 0.00046   41.7   7.9   30  149-179   566-595 (894)
 41 COG3525 Chb N-acetyl-beta-hexo  62.5     5.8 0.00013   44.6   2.8   36   96-131   339-374 (732)
 42 smart00642 Aamy Alpha-amylase   62.5     8.5 0.00018   35.9   3.6   30  149-179    68-97  (166)
 43 cd06592 GH31_glucosidase_KIAA1  61.5      63  0.0014   33.0  10.1  136   99-248    26-165 (303)
 44 TIGR03849 arch_ComA phosphosul  61.1      15 0.00031   36.6   5.0   22   32-56      3-24  (237)
 45 PRK09441 cytoplasmic alpha-amy  60.7      27 0.00059   38.1   7.7   30  149-179    79-108 (479)
 46 PLN00196 alpha-amylase; Provis  59.7      33 0.00072   37.0   8.0   32  147-179    88-119 (428)
 47 cd06594 GH31_glucosidase_YihQ   59.7      29 0.00063   35.8   7.3   22  211-232   131-152 (317)
 48 TIGR02402 trehalose_TreZ malto  58.1 1.5E+02  0.0033   32.9  13.1  119  104-236   112-237 (542)
 49 PLN03244 alpha-amylase; Provis  56.6      47   0.001   38.7   8.6   77  148-236   438-524 (872)
 50 PLN02447 1,4-alpha-glucan-bran  56.6 1.9E+02   0.004   33.8  13.5  116  106-236   254-383 (758)
 51 PRK14510 putative bifunctional  53.6      67  0.0015   39.5   9.9   89  143-236   236-335 (1221)
 52 cd06593 GH31_xylosidase_YicI Y  53.5      55  0.0012   33.3   8.1  122  100-233    21-147 (308)
 53 PRK03705 glycogen debranching   52.7      60  0.0013   37.1   8.9   88  143-236   228-329 (658)
 54 TIGR02403 trehalose_treC alpha  51.0      45 0.00098   37.1   7.5   30  149-179    73-102 (543)
 55 PRK14459 ribosomal RNA large s  50.9      45 0.00098   35.4   7.1   78   74-175   242-326 (373)
 56 TIGR03849 arch_ComA phosphosul  50.3      53  0.0012   32.7   7.0  103   17-173    20-122 (237)
 57 cd07201 cPLA2_Grp-IVB-IVD-IVE-  47.4      49  0.0011   36.7   6.8   37   13-50    283-319 (541)
 58 COG0134 TrpC Indole-3-glycerol  44.5      39 0.00084   34.0   5.0   73  147-244   139-211 (254)
 59 COG0436 Aspartate/tyrosine/aro  44.5      24 0.00052   37.5   3.9   24  146-169   178-201 (393)
 60 cd06600 GH31_MGAM-like This fa  43.9 1.5E+02  0.0031   30.6   9.5   22  211-232   125-146 (317)
 61 cd00452 KDPG_aldolase KDPG and  42.4      37 0.00079   32.1   4.4   67  154-248    87-153 (190)
 62 PLN02607 1-aminocyclopropane-1  40.6      71  0.0015   34.5   6.8   23  146-168   215-237 (447)
 63 PRK14465 ribosomal RNA large s  40.3 2.7E+02  0.0058   29.3  10.8  113   14-170   210-324 (342)
 64 PF01212 Beta_elim_lyase:  Beta  38.9      27  0.0006   35.6   3.1   24  145-168   139-162 (290)
 65 PF07555 NAGidase:  beta-N-acet  38.5      85  0.0018   32.4   6.6   22  148-169    50-74  (306)
 66 COG1168 MalY Bifunctional PLP-  38.3      35 0.00076   36.1   3.8   25  145-169   172-196 (388)
 67 KOG0258|consensus               37.8      69  0.0015   34.3   5.8   54  102-170   200-254 (475)
 68 PRK13957 indole-3-glycerol-pho  37.8      46 0.00099   33.3   4.4   38  147-184   134-171 (247)
 69 TIGR02455 TreS_stutzeri trehal  36.8      38 0.00082   38.5   4.0   31  148-179   127-157 (688)
 70 PF01565 FAD_binding_4:  FAD bi  36.8      40 0.00087   29.6   3.5   28  149-178     9-36  (139)
 71 cd06598 GH31_transferase_CtsZ   36.5   4E+02  0.0088   27.3  11.4   24  210-233   129-152 (317)
 72 PRK09357 pyrC dihydroorotase;   35.0 1.2E+02  0.0026   32.1   7.5   93   41-171   156-255 (423)
 73 KOG0259|consensus               35.0      37  0.0008   36.1   3.3   34  145-179   213-246 (447)
 74 PRK14511 maltooligosyl trehalo  34.9 1.6E+02  0.0036   34.8   8.8   70  100-179    17-96  (879)
 75 PRK14582 pgaB outer membrane N  34.6 1.7E+02  0.0036   33.7   8.7   36  211-248   433-468 (671)
 76 TIGR02336 1,3-beta-galactosyl-  34.6 4.2E+02  0.0091   30.5  11.5  103  209-341   176-309 (719)
 77 PF02065 Melibiase:  Melibiase;  34.3 2.1E+02  0.0046   30.6   9.1  128   99-246    54-191 (394)
 78 cd06603 GH31_GANC_GANAB_alpha   33.1 1.6E+02  0.0035   30.5   7.8   23  211-233   125-147 (339)
 79 KOG0256|consensus               32.9 1.8E+02  0.0039   31.4   7.9  117   39-169   119-264 (471)
 80 PF05913 DUF871:  Bacterial pro  32.6      80  0.0017   33.3   5.5   55  102-172    13-68  (357)
 81 PRK14465 ribosomal RNA large s  32.5 1.7E+02  0.0037   30.7   7.8   74   73-170   215-292 (342)
 82 smart00481 POLIIIAc DNA polyme  32.4      66  0.0014   24.7   3.7   46  105-172    17-62  (67)
 83 cd06602 GH31_MGAM_SI_GAA This   32.3 3.3E+02  0.0071   28.3  10.0  127  101-236    22-155 (339)
 84 COG1809 (2R)-phospho-3-sulfola  32.3 2.6E+02  0.0056   27.8   8.3  101   17-168     7-107 (258)
 85 PRK10933 trehalose-6-phosphate  31.8 1.3E+02  0.0027   33.7   7.1   30  149-179    79-108 (551)
 86 PF14542 Acetyltransf_CG:  GCN5  31.2      53  0.0012   26.5   3.1   23  148-170    39-61  (78)
 87 cd06591 GH31_xylosidase_XylS X  31.1   4E+02  0.0086   27.3  10.3   20   40-59     20-39  (319)
 88 TIGR02401 trehalose_TreY malto  31.0 2.1E+02  0.0045   33.8   8.8   71  100-180    13-93  (825)
 89 cd06604 GH31_glucosidase_II_Ma  29.9 1.9E+02  0.0041   29.9   7.8  121  101-233    22-147 (339)
 90 KOG0462|consensus               29.7      43 0.00093   37.3   2.9   32  148-179    91-138 (650)
 91 PF03979 Sigma70_r1_1:  Sigma-7  29.4      51  0.0011   27.0   2.7   55  103-173     7-61  (82)
 92 PRK14457 ribosomal RNA large s  29.2   1E+02  0.0023   32.3   5.6   73   73-169   215-291 (345)
 93 TIGR02668 moaA_archaeal probab  27.9 1.6E+02  0.0034   29.7   6.6   86   71-177     9-96  (302)
 94 PF00682 HMGL-like:  HMGL-like   27.7 1.5E+02  0.0033   28.6   6.3   57  103-178   108-164 (237)
 95 cd03319 L-Ala-DL-Glu_epimerase  27.6 1.2E+02  0.0025   31.0   5.6   22  149-170   264-285 (316)
 96 PRK14340 (dimethylallyl)adenos  27.6      87  0.0019   34.0   4.8   52   97-166   277-330 (445)
 97 PF07071 DUF1341:  Protein of u  27.4      76  0.0016   30.9   3.8   23  148-170   186-209 (218)
 98 KOG0464|consensus               27.2      53  0.0011   35.4   2.9   30  151-180    74-116 (753)
 99 PLN02231 alanine transaminase   27.1 1.7E+02  0.0038   32.4   7.2   24  146-169   286-309 (534)
100 cd07937 DRE_TIM_PC_TC_5S Pyruv  27.0 1.9E+02  0.0041   29.1   6.9   22  151-172   118-139 (275)
101 PF13199 Glyco_hydro_66:  Glyco  26.9 3.8E+02  0.0082   30.2   9.7   93  146-244   165-264 (559)
102 TIGR00048 radical SAM enzyme,   26.8 2.2E+02  0.0049   29.9   7.6   75   74-172   219-297 (355)
103 PRK10785 maltodextrin glucosid  26.7      55  0.0012   36.9   3.3   30  149-179   224-253 (598)
104 TIGR02102 pullulan_Gpos pullul  26.7 3.3E+02  0.0071   33.3   9.7   80  150-236   554-635 (1111)
105 PRK14456 ribosomal RNA large s  26.7 1.6E+02  0.0035   31.2   6.6   74   74-170   238-315 (368)
106 PRK14507 putative bifunctional  26.6 2.6E+02  0.0056   35.7   9.1   71   98-178   753-833 (1693)
107 PRK01060 endonuclease IV; Prov  26.3 3.4E+02  0.0074   26.7   8.6   96   31-168     5-106 (281)
108 PRK00278 trpC indole-3-glycero  26.3 1.1E+02  0.0025   30.5   5.1   34  148-181   144-177 (260)
109 PTZ00372 endonuclease 4-like p  26.0 1.4E+02  0.0031   32.1   6.0   25  146-170   366-390 (413)
110 PF03102 NeuB:  NeuB family;  I  25.8      65  0.0014   32.1   3.2   25  147-171    52-76  (241)
111 PF09508 Lact_bio_phlase:  Lact  25.0 5.2E+02   0.011   29.6  10.1  100  211-342   175-307 (716)
112 PF01055 Glyco_hydro_31:  Glyco  24.9 2.6E+02  0.0056   29.9   7.9  135  100-248    40-180 (441)
113 cd09013 BphC-JF8_N_like N-term  24.7      68  0.0015   27.1   2.8   32  148-179    71-102 (121)
114 cd07939 DRE_TIM_NifV Streptomy  24.3 2.4E+02  0.0052   27.9   7.0   20   39-58     15-34  (259)
115 COG2984 ABC-type uncharacteriz  24.1      90  0.0019   32.4   3.9   29  142-170   166-194 (322)
116 PRK04296 thymidine kinase; Pro  24.1      80  0.0017   29.8   3.4   25  146-170    88-112 (190)
117 cd00308 enolase_like Enolase-s  23.9 1.8E+02   0.004   28.0   6.0   22  149-170   181-202 (229)
118 PRK14335 (dimethylallyl)adenos  23.9 1.3E+02  0.0028   32.7   5.3   52   97-166   287-340 (455)
119 TIGR00190 thiC thiamine biosyn  23.8      64  0.0014   34.5   2.8   23  154-178   246-268 (423)
120 TIGR03581 EF_0839 conserved hy  23.6 1.6E+02  0.0035   29.0   5.3   22  148-169   186-208 (236)
121 PF00218 IGPS:  Indole-3-glycer  23.5      85  0.0018   31.5   3.5   39  146-184   140-178 (254)
122 cd03318 MLE Muconate Lactonizi  23.1 1.6E+02  0.0035   30.7   5.7   28   30-58    189-216 (365)
123 PLN02368 alanine transaminase   23.0      83  0.0018   33.6   3.6   24  146-169   225-248 (407)
124 COG3618 Predicted metal-depend  23.0   8E+02   0.017   25.1  10.3  164   30-248    53-237 (279)
125 PRK14333 (dimethylallyl)adenos  22.8 1.3E+02  0.0029   32.4   5.2   52   97-166   284-337 (448)
126 PRK15452 putative protease; Pr  22.6   3E+02  0.0066   29.9   7.8  105   86-227    29-133 (443)
127 TIGR00387 glcD glycolate oxida  22.5      78  0.0017   33.8   3.3   28  148-177     5-32  (413)
128 KOG3257|consensus               22.4      46 0.00099   31.0   1.3   23  148-170   135-158 (168)
129 PF00150 Cellulase:  Cellulase   22.3      84  0.0018   30.6   3.3   57  150-236    61-117 (281)
130 PF06080 DUF938:  Protein of un  22.3      50  0.0011   32.1   1.6   28  152-179   172-199 (204)
131 cd07940 DRE_TIM_IPMS 2-isoprop  22.2 2.8E+02   0.006   27.6   7.0   56  104-178   115-170 (268)
132 smart00642 Aamy Alpha-amylase   22.0 2.4E+02  0.0051   26.2   6.0   29  101-130    68-96  (166)
133 cd06840 PLPDE_III_Bif_AspK_Dap  21.9 5.8E+02   0.013   26.7   9.7   21  148-168   153-173 (368)
134 cd07265 2_3_CTD_N N-terminal d  21.8      91   0.002   26.2   3.0   32  147-178    69-102 (122)
135 COG2008 GLY1 Threonine aldolas  21.7      77  0.0017   33.2   2.9   23  145-167   144-166 (342)
136 COG3977 Alanine-alpha-ketoisov  21.7      79  0.0017   32.8   2.9   25  145-169   193-217 (417)
137 cd03327 MR_like_2 Mandelate ra  21.6 1.5E+02  0.0033   30.7   5.1   27   31-58    173-199 (341)
138 COG0277 GlcD FAD/FMN-containin  21.6      86  0.0019   33.4   3.4   30  148-179    39-68  (459)
139 KOG0256|consensus               21.4 2.4E+02  0.0051   30.6   6.4  114   31-193   197-311 (471)
140 COG0621 MiaB 2-methylthioadeni  21.3 1.4E+02   0.003   32.5   4.8   92   41-168   236-329 (437)
141 COG1217 TypA Predicted membran  21.3      56  0.0012   35.8   1.8   22  158-179    47-81  (603)
142 PRK02308 uvsE putative UV dama  21.2 4.7E+02    0.01   26.9   8.6   56  104-169    49-109 (303)
143 PRK14334 (dimethylallyl)adenos  21.2 1.3E+02  0.0029   32.4   4.7   52   98-166   267-319 (440)
144 PRK05942 aspartate aminotransf  21.2      94   0.002   32.5   3.6   23  146-168   185-207 (394)
145 PRK11194 ribosomal RNA large s  21.2 2.8E+02  0.0061   29.5   7.1   72   75-170   221-299 (372)
146 PRK09505 malS alpha-amylase; R  21.0      93   0.002   35.8   3.6   30  149-179   290-319 (683)
147 PF00155 Aminotran_1_2:  Aminot  20.9      95  0.0021   31.7   3.5   24  146-169   163-186 (363)
148 PRK14454 ribosomal RNA large s  20.9 2.4E+02  0.0053   29.5   6.5   72   74-169   212-287 (342)
149 PF09711 Cas_Csn2:  CRISPR-asso  20.9   1E+02  0.0022   29.6   3.3   25  145-169   140-164 (188)
150 COG2100 Predicted Fe-S oxidore  20.8 1.7E+02  0.0038   30.6   5.1   48   97-164   236-284 (414)
151 cd07943 DRE_TIM_HOA 4-hydroxy-  20.6 2.4E+02  0.0053   28.0   6.2   20  152-171   113-132 (263)
152 PRK09427 bifunctional indole-3  20.6 1.2E+02  0.0027   33.0   4.4   39  146-184   141-179 (454)
153 PF01408 GFO_IDH_MocA:  Oxidore  20.6 2.6E+02  0.0056   23.4   5.7   21  148-168    97-117 (120)
154 PLN00145 tyrosine/nicotianamin  20.5   1E+02  0.0022   33.0   3.7   24  146-169   205-228 (430)
155 PRK09147 succinyldiaminopimela  20.5      97  0.0021   32.4   3.5   23  146-168   181-203 (396)
156 KOG4175|consensus               20.4 1.7E+02  0.0036   28.7   4.6   24  148-171   132-155 (268)
157 PF08671 SinI:  Anti-repressor   20.4 2.5E+02  0.0055   18.8   4.2   13  148-160    17-29  (30)
158 TIGR01290 nifB nitrogenase cof  20.4 1.9E+02  0.0042   31.3   5.8   29  148-176   200-231 (442)
159 PRK09082 methionine aminotrans  20.4      95  0.0021   32.4   3.4   23  146-168   178-200 (386)
160 COG0826 Collagenase and relate  20.3 1.4E+02  0.0031   31.3   4.6   35  146-180    44-78  (347)
161 TIGR02456 treS_nterm trehalose  20.2      99  0.0022   34.3   3.6   30  149-179    74-103 (539)
162 TIGR03542 DAPAT_plant LL-diami  20.2 1.2E+02  0.0025   32.0   4.0   24  145-168   187-210 (402)
163 cd03320 OSBS o-Succinylbenzoat  20.0 1.8E+02   0.004   28.8   5.2   22  149-170   212-233 (263)

No 1  
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B  (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20).  The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits.  Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff.  Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in 
Probab=100.00  E-value=1e-84  Score=673.38  Aligned_cols=339  Identities=40%  Similarity=0.716  Sum_probs=301.2

Q ss_pred             CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880         27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ  106 (497)
Q Consensus        27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~  106 (497)
                      |++||+||||||||+|+++||++||.||.||             ||+|||||+||||||+|++.||..            
T Consensus         1 f~~RG~mlDvaR~f~~~~~ik~~Id~ma~~K-------------lN~lh~HltDd~~~rle~~~~P~L------------   55 (348)
T cd06562           1 FPHRGLLLDTSRHFLSVDSIKRTIDAMAYNK-------------LNVLHWHITDSQSFPLESPSYPEL------------   55 (348)
T ss_pred             CCccceeeeccccCCCHHHHHHHHHHHHHhC-------------CcEEEEeEEcCCCceEeeCCCchh------------
Confidence            7899999999999999999999999999999             699999999999999999988732            


Q ss_pred             HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880        107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR  186 (497)
Q Consensus       107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~  186 (497)
                                                    +..|+|+++++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus        56 ------------------------------t~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a~~~~~p~  105 (348)
T cd06562          56 ------------------------------SKKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEIDTPGHTGSWGQGYPE  105 (348)
T ss_pred             ------------------------------hhccCcCCCceECHHHHHHHHHHHHHcCCEEEEeccCchhhHHHHHhChh
Confidence                                          12456778899999999999999999999999999999999999999887


Q ss_pred             CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880        187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV  266 (497)
Q Consensus       187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~  266 (497)
                      +...    |.  ..|..++.++..++|||++|+|++|+++|++|++++||  +++|||||||                  
T Consensus       106 l~~~----~~--~~~~~~~~~~~~~~L~~~~~~t~~fl~~vl~E~~~lF~--~~~iHiGgDE------------------  159 (348)
T cd06562         106 LLTG----CY--AVWRKYCPEPPCGQLNPTNPKTYDFLKTLFKEVSELFP--DKYFHLGGDE------------------  159 (348)
T ss_pred             hhCC----CC--ccccccccCCCCccccCCChhHHHHHHHHHHHHHHhcC--CcceEeecCC------------------
Confidence            6522    21  22444555666789999999999999999999999999  9999999999                  


Q ss_pred             CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880        267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT  346 (497)
Q Consensus       267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~  346 (497)
                                        +..+||+++|+|+++|+++|++    ++.+|+.+|++++.++|++    +||++++|+|++.
T Consensus       160 ------------------~~~~~w~~~p~~~~~m~~~g~~----~~~~l~~~f~~~~~~~l~~----~Gk~~i~W~d~~~  213 (348)
T cd06562         160 ------------------VNFNCWNSNPEIQKFMKKNNGT----DYSDLESYFIQRALDIVRS----LGKTPIVWEEVFD  213 (348)
T ss_pred             ------------------CCCCcccCCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCeEEEeeeccc
Confidence                              7789999999999999999973    6889999999999999984    8999999999987


Q ss_pred             CchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCC-CCCCChhhhhccCCCC-------CC
Q psy14880        347 DPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGV-TSYYRWQRVYDNLLPS-------SP  418 (497)
Q Consensus       347 ~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~-~~~~~~~~iY~~~p~~-------~~  418 (497)
                      ..+   ..+ +++++|++|++..  .+.+++++||+||+|+++++|||+++.+. .++.+|+++|+++|..       .+
T Consensus       214 ~~~---~~~-~~~~iv~~W~~~~--~~~~~~~~G~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~y~~~p~~~~~~~~~~~  287 (348)
T cd06562         214 NGV---YLL-PKDTIVQVWGGSD--ELKNVLAAGYKVILSSYDFWYLDCGFGGWVGPGNDWCDPYKNWPRIYSGTPEQKK  287 (348)
T ss_pred             CCC---ccC-CCCeEEEECCCcH--HHHHHHHCCCCEEEeCCCcEEEeecCCCCCCCCCcHhhhhcCCCCCCCCChhhcc
Confidence            643   134 5799999999874  67899999999999999999999986321 3568999999988864       38


Q ss_pred             CccceeeeeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHHHCC
Q psy14880        419 LVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRFLEQRERLVEMG  478 (497)
Q Consensus       419 ~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl~~~~~~l~~~G  478 (497)
                      +|+|+|+|||+|++++.++++++|||++|+||++|+++. ++|++|..||..+++||.+||
T Consensus       288 ~ilG~~~~lW~E~~~~~~~~~~~~PR~~a~AE~~W~~~~~~~~~~f~~Rl~~~~~~l~~~g  348 (348)
T cd06562         288 LVLGGEACMWGEQVDDTNLDQRLWPRASALAERLWSGPSDTNLTDAEPRLVEFRCRLVRRG  348 (348)
T ss_pred             cEEEEEEEeeeeeeCccceeehhhhhHHHHHHHhhCCCcCCCHHHHHHHHHHHHHHHHhCc
Confidence            999999999999999989999999999999999999876 689999999999999999998


No 2  
>KOG2499|consensus
Probab=100.00  E-value=1e-83  Score=660.07  Aligned_cols=377  Identities=40%  Similarity=0.697  Sum_probs=336.4

Q ss_pred             chHHHHHHHH------hhCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEE
Q psy14880          3 SFDDYIQLWA------HFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRW   76 (497)
Q Consensus         3 ~~~~~~~~~~------~~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhl   76 (497)
                      .++++-||--      .+.+..+.|.|.|+|.+||+|||+||||+|++.||+.||.||.-|             +|+|||
T Consensus       151 glETfSqLv~~d~~~~~~~~~~~~I~D~PrF~hRGlLlDTSRhylpv~~IkrtLeaMa~nK-------------LNVlHW  217 (542)
T KOG2499|consen  151 GLETFSQLVWGDSIGGLFMIATAYIQDKPRFGHRGLLLDTSRHYLPVKVIKRTLEAMAANK-------------LNVLHW  217 (542)
T ss_pred             HHHHHHHHheeccCCceEEeeeeeEeccCCCcccceEEecccceecHHHHHHHHHHHHhhh-------------hceeEE
Confidence            4678888832      455667889999999999999999999999999999999999999             599999


Q ss_pred             eccCCcccccccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
Q psy14880         77 QMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQEL  156 (497)
Q Consensus        77 Hl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~ei  156 (497)
                      |++|+|+||+|+|+||.+                                          .++|+||+.+.||++|+++|
T Consensus       218 HivDs~SFPle~~~~PeL------------------------------------------~~kGaYs~~~vYT~eDv~ev  255 (542)
T KOG2499|consen  218 HIVDSQSFPLESPTFPEL------------------------------------------HRKGAYSPRHVYTREDVSEV  255 (542)
T ss_pred             EeecCCCCccccCCchhh------------------------------------------hhcCCCCcceeecHHHHHHH
Confidence            999999999999998732                                          24799999999999999999


Q ss_pred             HHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        157 VHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       157 v~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      |+||+.|||+|+||||+|||+++   |+|  ++|++    ..++|+.++++++++.|||+++.+|+|+++++.|+.+.||
T Consensus       256 V~yarlRGIRVlpEfD~PgHt~s---Wg~--g~~~f----l~p~~~~~~~~~~~gplnP~~n~tydvls~i~~dv~evFp  326 (542)
T KOG2499|consen  256 VEYARLRGIRVLPEFDTPGHTGS---WGP--GYPDF----LTPCWSSFEVQPPFGPLNPTNNHTYDVLSEIFEDVSEVFP  326 (542)
T ss_pred             HHHHHhccceeeecccCCccccc---ccC--CCCcc----cCCcccccccCCCCcCCCCCchhHHHHHHHHHHHHHHhCc
Confidence            99999999999999999999998   665  45663    2346788889999999999999999999999999999999


Q ss_pred             CCCCeEEEcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHH
Q psy14880        237 YGDEMFHMGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLW  316 (497)
Q Consensus       237 ~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~  316 (497)
                        ..+||+||||                                    |+..||+++|++|++|+++|++.   +..++.
T Consensus       327 --~~~~HlGGDE------------------------------------V~~~CW~s~~~Iq~fM~~kGfg~---~~~~~~  365 (542)
T KOG2499|consen  327 --DEFFHLGGDE------------------------------------VSTPCWKSNPEIQDFMRKKGFGL---DTKSLE  365 (542)
T ss_pred             --HHHeecCCce------------------------------------eecccccCChHHHHHHHhCCCCc---hHHHHH
Confidence              9999999999                                    88999999999999999999963   556677


Q ss_pred             HHHHHHHHHHHHHHhhCCCceEEEecCCCCCchhhccCCCCCcEEEEecCCCCc-hhHHHHHHcCCeEEEeCCCcccccc
Q psy14880        317 AHFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPKRYIIQTWVPRID-PLADLLISKGYQVIISTKDAWYLDH  395 (497)
Q Consensus       317 ~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~~~~~~~~l~~~~~iv~~W~~~~~-~~~~~~~~~Gy~vI~s~~~~~Yld~  395 (497)
                      .+|++.+.+++.    +.+++++.|+|.+.+...    + +.++|+|.|+.+.+ ....+.+.+||++|+|+..+||||+
T Consensus       366 ~~~~~~~~~i~~----s~~~~~I~Wqevf~n~~~----i-~p~tiiq~W~~~~~~~~~~k~v~~~~~~ivs~s~~wYLd~  436 (542)
T KOG2499|consen  366 RLYIQFLLDILN----SWNKTVIVWQEVFDNKRK----I-DPRTIIQIWKIGTWYPKELKIVTKGYRFIVSNSAAWYLDH  436 (542)
T ss_pred             HHHHHHHHHHHH----hcCceEEEeehhcccccc----C-CCCceeeeeccCCccHHHHHHHhccCceEEEeccceEeec
Confidence            888888878777    467899999999987643    2 35899999998752 4567888999999999999999998


Q ss_pred             cccCCCCCCChhhhhccCCCCC-------CCccceeeeeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880        396 GFWGVTSYYRWQRVYDNLLPSS-------PLVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF  467 (497)
Q Consensus       396 ~~~~~~~~~~~~~iY~~~p~~~-------~~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl  467 (497)
                      +.|+    .+|+++|+.+|.++       +.|+|||+|||+|++|+..++.++|||+.|+|||+||.++ ..+.+...||
T Consensus       437 ~~~g----~dw~~~Y~~~p~~~~g~~~~k~lVlGGE~cmWgE~vD~t~L~~RlWPRAsA~AERLWS~~~~~~~~~A~~Rl  512 (542)
T KOG2499|consen  437 IGYG----SDWRKVYNTEPLSGMGTPEQKKLVLGGEVCMWGEYVDNTNLESRLWPRASAAAERLWSNKKVSRLLDAYPRL  512 (542)
T ss_pred             cccC----CChhheeeccccccCCCHHHhhheecceeeeehhhccccccccccccchhHHHHHhhcccccchHHHHHHHH
Confidence            7763    68999999999873       5799999999999999999999999999999999999655 5789999999


Q ss_pred             HHHHHHHHHCCCCcccCCCCeeecCCCCCC
Q psy14880        468 LEQRERLVEMGIRAEVTTPEWCYLNDGQCR  497 (497)
Q Consensus       468 ~~~~~~l~~~Gi~~~p~~p~~c~~~~~~c~  497 (497)
                      ..+||||++|||.+.|+.|+||.+++++|.
T Consensus       513 ~~~RcrLv~RGi~A~p~~p~~C~~~~~~c~  542 (542)
T KOG2499|consen  513 HLFRCRLVARGIGAQPVQPGWCLQEEGECP  542 (542)
T ss_pred             HHHHHHHHhcCCCcCCCCCcccccCCCCCC
Confidence            999999999999999999999999999994


No 3  
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin.  Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=2.9e-80  Score=642.62  Aligned_cols=341  Identities=24%  Similarity=0.357  Sum_probs=284.8

Q ss_pred             CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880         27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ  106 (497)
Q Consensus        27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~  106 (497)
                      |++||+||||||||+|+++||++||.||+||             ||+|||||+||||||+|++.||.+            
T Consensus         1 f~~RG~mLD~aR~f~~~~~ik~~Id~ma~~K-------------~N~lhlHltDdq~~rle~~~~P~L------------   55 (357)
T cd06563           1 FSWRGLMLDVSRHFFPVDEVKRFIDLMALYK-------------LNVFHWHLTDDQGWRIEIKKYPKL------------   55 (357)
T ss_pred             CCccceeeeccccCcCHHHHHHHHHHHHHhc-------------cceEEEeeecCCCceecccCcchh------------
Confidence            7899999999999999999999999999999             699999999999999999999843            


Q ss_pred             HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880        107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR  186 (497)
Q Consensus       107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~  186 (497)
                        .+.++.|+-.   ....|...         .+.-..+++||++|+++||+||++|||+||||||+|||+.++++++|+
T Consensus        56 --t~~ga~~~~~---~~~~~~~~---------~~~~~~~~~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~pe  121 (357)
T cd06563          56 --TEVGAWRGPT---EIGLPQGG---------GDGTPYGGFYTQEEIREIVAYAAERGITVIPEIDMPGHALAALAAYPE  121 (357)
T ss_pred             --hhcccccCcc---cccccccc---------cCCCccCceECHHHHHHHHHHHHHcCCEEEEecCCchhHHHHHHhCcc
Confidence              2223333211   00011110         111134678999999999999999999999999999999999998876


Q ss_pred             CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880        187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV  266 (497)
Q Consensus       187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~  266 (497)
                      +....    ....++..  ...++++|||++|+|++|+++|++|++++|+  +++|||||||                  
T Consensus       122 l~~~~----~~~~~~~~--~~~~~~~L~~~~~~t~~f~~~ll~E~~~lF~--~~~iHiGgDE------------------  175 (357)
T cd06563         122 LGCTG----GPGSVVSV--QGVVSNVLCPGKPETYTFLEDVLDEVAELFP--SPYIHIGGDE------------------  175 (357)
T ss_pred             ccCCC----CCCccccc--cCcCCCccCCCChhHHHHHHHHHHHHHHhCC--CCeEEEeccc------------------
Confidence            54211    11111111  1234578999999999999999999999999  9999999999                  


Q ss_pred             CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880        267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT  346 (497)
Q Consensus       267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~  346 (497)
                                        +...||+++|.|+++|+++|+.    +..+|+.+|++++.++|++    +|+++++|+|++.
T Consensus       176 ------------------~~~~~w~~~~~~~~~~~~~g~~----~~~~l~~~f~~~~~~~v~~----~G~~~i~W~d~~~  229 (357)
T cd06563         176 ------------------VPKGQWEKSPACQARMKEEGLK----DEHELQSYFIKRVEKILAS----KGKKMIGWDEILE  229 (357)
T ss_pred             ------------------cCCcccccCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCEEEEeecccc
Confidence                              7778999999999999999984    5778999999999999984    8999999999875


Q ss_pred             CchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCccccccccc-------CCCCCCChhhhhccCCCC---
Q psy14880        347 DPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFW-------GVTSYYRWQRVYDNLLPS---  416 (497)
Q Consensus       347 ~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~-------~~~~~~~~~~iY~~~p~~---  416 (497)
                      ..      + +++++|++|++.  ..+.+++++||+||+|+++++|||+++.       +..++.+|+++|+++|..   
T Consensus       230 ~~------l-~~~~iv~~W~~~--~~~~~~~~~G~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~~~~~~~y~~~P~~~~~  300 (357)
T cd06563         230 GG------L-PPNATVMSWRGE--DGGIKAAKQGYDVIMSPGQYLYLDYAQSKGPDEPASWAGFNTLEKVYSFEPVPGGL  300 (357)
T ss_pred             cC------C-CCCcEEEECCCc--hHHHHHHHCCCCEEEeCCCceEEecCCCCCCCCCccccCCCCHHHHhcCCCCCCCC
Confidence            42      4 589999999988  4678999999999999999999998751       135678999999999976   


Q ss_pred             ----CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880        417 ----SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF  467 (497)
Q Consensus       417 ----~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl  467 (497)
                          .++|+|+|+|||+|.+.+ +++++++|||++|+||++|++++ ++|++|..||
T Consensus       301 ~~~~~~~ilG~~~~lW~E~~~~~~~~~~~~~PR~~a~AE~~W~~~~~~~~~~f~~rl  357 (357)
T cd06563         301 TPEQAKRILGVQANLWTEYIPTPERVEYMAFPRLLALAEVAWTPPEKKDWEDFRKRL  357 (357)
T ss_pred             ChhHhcCEEEEEEEecccccCCHHHHHHHHHHHHHHHHHHHhCCccCCCHHHHHhhC
Confidence                279999999999999976 78999999999999999999976 7999999886


No 4  
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of  the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin.  Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=9e-79  Score=618.61  Aligned_cols=309  Identities=33%  Similarity=0.495  Sum_probs=268.1

Q ss_pred             CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880         27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ  106 (497)
Q Consensus        27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~  106 (497)
                      |++||+|||+||||+|+++||++||.||.+|             ||+|||||+||||||+|+++||..            
T Consensus         1 f~~RG~mlD~aR~f~~~~~ik~~Id~ma~~K-------------lN~lh~HltDd~~~rle~~~~P~l------------   55 (311)
T cd06570           1 FPWRGLLIDVSRHFIPVAVIKRQLDAMASVK-------------LNVFHWHLTDDQGFRIESKKYPKL------------   55 (311)
T ss_pred             CCccCeEEecCCCCcCHHHHHHHHHHHHHhC-------------CeEEEEEEecCCCceeecCCCccc------------
Confidence            7899999999999999999999999999999             699999999999999999988742            


Q ss_pred             HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880        107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR  186 (497)
Q Consensus       107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~  186 (497)
                                            +        ..|  +.+++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus        56 ----------------------t--------~~g--~~~~~yT~~di~elv~yA~~rgI~vIPEId~PGH~~a~~~~ype  103 (311)
T cd06570          56 ----------------------Q--------QKA--SDGLYYTQEQIREVVAYARDRGIRVVPEIDVPGHASAIAVAYPE  103 (311)
T ss_pred             ----------------------c--------ccC--CCCCccCHHHHHHHHHHHHHcCCEEEEeecCccchHHHHHhCHH
Confidence                                  0        112  23578999999999999999999999999999999999987766


Q ss_pred             CCCCCcccccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCC
Q psy14880        187 YGLGDLIVCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQ  265 (497)
Q Consensus       187 ~~~~~l~~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~  265 (497)
                      +.     .|... .....++.  ..++|||++|+|++|+++|++|++++||  +++|||||||                 
T Consensus       104 l~-----~~~~~~~~~~~~~~--~~~~l~~~~p~t~~f~~~l~~E~~~lF~--~~~iHiGgDE-----------------  157 (311)
T cd06570         104 LA-----SGPGPYVIERGWGV--FEPLLDPTNEETYTFLDNLFGEMAELFP--DEYFHIGGDE-----------------  157 (311)
T ss_pred             hc-----cCCCcccccccccc--CCCccCCCChhHHHHHHHHHHHHHHhCC--CCceEeeccC-----------------
Confidence            43     22221 11122222  2357999999999999999999999999  9999999999                 


Q ss_pred             CCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCC
Q psy14880        266 VPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHL  345 (497)
Q Consensus       266 ~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~  345 (497)
                                         +...||+++|+|+++|+++|++    +..+|+.+|++++.++|++    +||++|+|+|++
T Consensus       158 -------------------~~~~~W~~~p~~~~~~~~~g~~----~~~~l~~~f~~~~~~~v~~----~Gk~~~~W~d~~  210 (311)
T cd06570         158 -------------------VDPKQWNENPRIQAFMKEHGLK----DAAALQAYFNQRVEKILSK----HGKKMIGWDEVL  210 (311)
T ss_pred             -------------------CCCCcccCCHHHHHHHHHcCCC----CHHHHHHHHHHHHHHHHHH----cCCeEEEecccc
Confidence                               7788999999999999999984    5778999999999999884    899999999987


Q ss_pred             CCchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCCCCCCccceee
Q psy14880        346 TDPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLPSSPLVLGGEV  425 (497)
Q Consensus       346 ~~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~~~~~ilG~e~  425 (497)
                      ..      .+ +++++|+.|++.  ..+.+++++||++|+|++  +|||+++       +++++|+++|.    |+|||+
T Consensus       211 ~~------~l-~~~~iv~~W~~~--~~~~~~~~~G~~vI~s~~--~YlD~~~-------~~~~~y~~~p~----ilG~~~  268 (311)
T cd06570         211 HP------DL-PKNVVIQSWRGH--DSLGEAAKAGYQGILSTG--YYIDQPQ-------PAAYHYRVDPM----ILGGEA  268 (311)
T ss_pred             cc------CC-CCCeEEEEeCCc--hHHHHHHHCCCCEEEech--hheeCCC-------chhheeCCCCc----EEEEEE
Confidence            42      35 579999999975  457889999999999984  8999864       35689999987    999999


Q ss_pred             eeeccccCCCcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880        426 AMWTEYVDDQSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF  467 (497)
Q Consensus       426 ~lWsE~~~~~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl  467 (497)
                      |||||+++++++++++|||++|+||++|++++ ++|++|.+||
T Consensus       269 ~lW~E~v~~~~~~~~~~PR~~A~AE~~Ws~~~~~~~~~~~~Rl  311 (311)
T cd06570         269 TMWAELVSEETIDSRLWPRTAAIAERLWSAQDVRDEDDMYRRL  311 (311)
T ss_pred             EEeecCcCHHHHHHHHhHHHHHHHHHhhCCCcCCCHHHHHhhC
Confidence            99999998789999999999999999999977 7999999886


No 5  
>cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=7.1e-78  Score=638.43  Aligned_cols=360  Identities=21%  Similarity=0.291  Sum_probs=283.0

Q ss_pred             cCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCCh
Q psy14880         23 DEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTP  102 (497)
Q Consensus        23 D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~  102 (497)
                      |+|+|++||+||||||||+|+++||++||.||++|             ||+||||||||||||+|+++||.         
T Consensus         1 D~Prf~~RGlmLDvaR~f~~~~~ik~~Id~ma~~K-------------~N~lHlHLtDdqgwriei~~~P~---------   58 (445)
T cd06569           1 DAPRFEYRGMHLDVARNFHSKETVLKLLDQMAAYK-------------LNKLHLHLTDDEGWRLEIPGLPE---------   58 (445)
T ss_pred             CCCCCcccceeeeccCCCCCHHHHHHHHHHHHHhC-------------CceEEEEeecCCCcceeccCCch---------
Confidence            89999999999999999999999999999999999             69999999999999999999984         


Q ss_pred             hHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880        103 LEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW  181 (497)
Q Consensus       103 ~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l  181 (497)
                           +++.++.|+.... .+.-.|.+. .++     ....+.+++||++||+|||+||++|||+||||||+|||+.+++
T Consensus        59 -----Lt~~ga~r~~~~~~~~~~~~~~~-~~~-----~~~~~~~g~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~a~l  127 (445)
T cd06569          59 -----LTEVGAKRCHDLSETTCLLPQLG-SGP-----DTNNSGSGYYSRADYIEILKYAKARHIEVIPEIDMPGHARAAI  127 (445)
T ss_pred             -----hhhcccccccccccccccccccc-cCc-----ccCcccCCccCHHHHHHHHHHHHHcCCEEEEccCCchhHHHHH
Confidence                 3444444443211 011112211 100     0012457889999999999999999999999999999999999


Q ss_pred             hhcCCCCCCCcccccCCC---CC--------Ccc-ccC-CCCCcCCCCChhHHHHHHHHHHHHHhccCC---CCCeEEEc
Q psy14880        182 QWGPRYGLGDLIVCLNDP---SW--------RAN-CIQ-PPCGQLNPINPHVYTVLKDIYGELMGMAKY---GDEMFHMG  245 (497)
Q Consensus       182 ~~~p~~~~~~l~~~~~~~---~~--------~~~-~~~-~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~---~~~~iHiG  245 (497)
                      ++++. +||+|..+....   .+        ..+ +++ ..+++|||++|+|++|+++|++|++++|+.   .+++||||
T Consensus       128 ~a~~~-~yp~l~~~g~~~~~~~~~~~d~~~~~~~~~~~~~~~~~L~p~~~~ty~fl~~vl~Ev~~lF~~~g~P~~yiHiG  206 (445)
T cd06569         128 KAMEA-RYRKLMAAGKPAEAEEYRLSDPADTSQYLSVQFYTDNVINPCMPSTYRFVDKVIDEIARMHQEAGQPLTTIHFG  206 (445)
T ss_pred             Hhhhc-cchhhhccCCccccccccccCcccccccccccccccccccCCchhHHHHHHHHHHHHHHHHhccCCCcceEEEe
Confidence            87532 345554322100   00        000 111 134789999999999999999999999942   17999999


Q ss_pred             CCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccCCHHHH--HHHHHcCCCCCcccHHHHHHHHHHHH
Q psy14880        246 ADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIV--DFMLAHGKNTTFDDYIQLWAHFQNKA  323 (497)
Q Consensus       246 gDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~--~~~~~~g~~~~~~~~~~l~~~F~~~~  323 (497)
                      |||                                    +...||+++|.|+  ++|+++|+.    ++.+|+.+|++++
T Consensus       207 GDE------------------------------------v~~~~W~~~~~~~~~~~~~~~~~~----~~~~l~~~f~~~v  246 (445)
T cd06569         207 GDE------------------------------------VPEGAWGGSPACKAQLFAKEGSVK----DVEDLKDYFFERV  246 (445)
T ss_pred             ccc------------------------------------cCCCcccCCHHHHHHHHHHhcCCC----CHHHHHHHHHHHH
Confidence            999                                    7788999999999  999998874    6789999999999


Q ss_pred             HHHHHHHhhCCCceEEEecCCCCCchhh--ccCCCCCcEEEEecCCCC---chhHHHHHHcCCeEEEeCCCcccccccc-
Q psy14880        324 AASLDEAVGHNRTKLIVWSSHLTDPEII--LNYLDPKRYIIQTWVPRI---DPLADLLISKGYQVIISTKDAWYLDHGF-  397 (497)
Q Consensus       324 ~~~l~~~~~~~g~~~~~W~d~~~~~~~~--~~~l~~~~~iv~~W~~~~---~~~~~~~~~~Gy~vI~s~~~~~Yld~~~-  397 (497)
                      .++|++    +|+++++|+|++......  +..+ +++++|++|+...   ...+.+++++||+||+||++++|||+++ 
T Consensus       247 ~~~l~~----~Gk~~i~W~e~~~~~~~~~~~~~~-~~~~~v~~W~~~~~~~~~~~~~~~~~G~~vI~s~~~~~YlD~~~~  321 (445)
T cd06569         247 SKILKA----HGITLAGWEDGLLGKDTTNVDGFA-TPYVWNNVWGWGYWGGEDRAYKLANKGYDVVLSNATNLYFDFPYE  321 (445)
T ss_pred             HHHHHH----cCCeEEEecccccCCCcccccccC-CCCeEEEEccCCcccccHHHHHHHHCCCCEEEeCCCcEEEecCCC
Confidence            999984    799999999998764321  1124 4689999998642   3567889999999999999999999864 


Q ss_pred             ---------cCCCCCCChhhhhccCCCC-----------------------------CCCccceeeeeeccccCC-Ccch
Q psy14880        398 ---------WGVTSYYRWQRVYDNLLPS-----------------------------SPLVLGGEVAMWTEYVDD-QSLD  438 (497)
Q Consensus       398 ---------~~~~~~~~~~~iY~~~p~~-----------------------------~~~ilG~e~~lWsE~~~~-~~l~  438 (497)
                               | ++.|.+++++|+++|.+                             .++|+|+|+|||||++++ +.++
T Consensus       322 ~~~~~~g~~w-~~~~~~~~~~y~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ilG~e~~lWsE~v~~~~~l~  400 (445)
T cd06569         322 KHPEERGYYW-AGRFVDTKKVFSFMPDNLYANAEVTRDGDPIDDTALNGKVRLTLEGPKNILGLQGQLWSETIRTDEQLE  400 (445)
T ss_pred             CCCCcCCccc-ccCCCCHHHhhccCCchhhccchhhcccCccccccccccccCChhHhcceeEEEEeeeccccCCHHHhH
Confidence                     3 23458999999998741                             269999999999999987 7899


Q ss_pred             hhhhhhHHHHHHHhcCCCCCChHHH
Q psy14880        439 GRLWPRTAAAAERLWSNPKSSSSEA  463 (497)
Q Consensus       439 ~~l~PRl~A~AEr~W~~~~~~~~~f  463 (497)
                      +++|||++|+||++|++.. +|.++
T Consensus       401 ~~~~PR~~A~AE~~Ws~~~-~~~~~  424 (445)
T cd06569         401 YMVFPRLLALAERAWHKAP-WEADY  424 (445)
T ss_pred             HHhhhHHHHHHHHHhcCCc-ccccc
Confidence            9999999999999999854 34444


No 6  
>cd06568 GH20_SpHex_like A subgroup of  the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex).  SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=100.00  E-value=3.7e-76  Score=604.64  Aligned_cols=313  Identities=22%  Similarity=0.312  Sum_probs=259.6

Q ss_pred             CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880         27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ  106 (497)
Q Consensus        27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~  106 (497)
                      |++||+|||+||||+|+++||++||.||++|             ||+|||||+||||||+|++.||...           
T Consensus         1 f~~RG~mlD~aR~f~~~~~lk~~id~ma~~K-------------lN~lhlHLtD~~~~rle~~~~P~lt-----------   56 (329)
T cd06568           1 FAYRGLMLDVARHFFTVAEVKRYIDLLALYK-------------LNVLHLHLTDDQGWRIEIKSWPKLT-----------   56 (329)
T ss_pred             CCccceeeeccCCCcCHHHHHHHHHHHHHhC-------------CcEEEEEeecCCcceeeecCccccc-----------
Confidence            7899999999999999999999999999999             6999999999999999999987431           


Q ss_pred             HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCC
Q psy14880        107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPR  186 (497)
Q Consensus       107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~  186 (497)
                         +.++.+                       .+..+++++||++||+|||+||++|||+||||||+|||+.++++++|+
T Consensus        57 ---~~ga~~-----------------------~~~~~~~~~YT~~di~elv~yA~~rgI~vIPEiD~PGH~~a~~~~~p~  110 (329)
T cd06568          57 ---EIGGST-----------------------EVGGGPGGYYTQEDYKDIVAYAAERHITVVPEIDMPGHTNAALAAYPE  110 (329)
T ss_pred             ---cccccc-----------------------ccCCCCCCcCCHHHHHHHHHHHHHcCCEEEEecCCcHHHHHHHHhChh
Confidence               000000                       011245788999999999999999999999999999999999988776


Q ss_pred             CCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCC
Q psy14880        187 YGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQV  266 (497)
Q Consensus       187 ~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~  266 (497)
                      +.      |.+...-...+.++.+++|||++|+|++|+++|++|++++||  +++|||||||+                 
T Consensus       111 l~------~~~~~~~~~~~~~~~~~~l~~~~~~t~~fl~~v~~E~~~~f~--~~~iHiGgDE~-----------------  165 (329)
T cd06568         111 LN------CDGKAKPLYTGIEVGFSSLDVDKPTTYEFVDDVFRELAALTP--GPYIHIGGDEA-----------------  165 (329)
T ss_pred             hc------cCCCCCccccccCCCCcccCCCCHHHHHHHHHHHHHHHHhCC--CCeEEEecccC-----------------
Confidence            54      322111001123445678999999999999999999999999  99999999993                 


Q ss_pred             CCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCC
Q psy14880        267 PCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLT  346 (497)
Q Consensus       267 ~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~  346 (497)
                                         ...                       ..+++.+|++++.++|++    +||++++|+|++.
T Consensus       166 -------------------~~~-----------------------~~~~~~~f~~~~~~~v~~----~Gk~~~~W~d~~~  199 (329)
T cd06568         166 -------------------HST-----------------------PHDDYAYFVNRVRAIVAK----YGKTPVGWQEIAR  199 (329)
T ss_pred             -------------------CCC-----------------------chHHHHHHHHHHHHHHHH----CCCeEEEECcccc
Confidence                               211                       124567899999999984    8999999999864


Q ss_pred             CchhhccCCCCCcEEEEecCCCC-chhHHHHHHcCCeEEEeCCCccccccccc-----C--CCCCCChhhhhccCCCC--
Q psy14880        347 DPEIILNYLDPKRYIIQTWVPRI-DPLADLLISKGYQVIISTKDAWYLDHGFW-----G--VTSYYRWQRVYDNLLPS--  416 (497)
Q Consensus       347 ~~~~~~~~l~~~~~iv~~W~~~~-~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~-----~--~~~~~~~~~iY~~~p~~--  416 (497)
                      .      .+ +++++|++|++.. ...+.+++++||++|+||++++|||+++.     +  ..++.+|+++|+++|..  
T Consensus       200 ~------~l-~~~~iv~~W~~~~~~~~~~~~~~~G~~vI~s~~~~~YlD~~~~~~~~~~~~~~~~~~~~~~y~~~P~~~~  272 (329)
T cd06568         200 A------DL-PAGTVAQYWSDRAPDADAAAALDKGAKVILSPADKAYLDMKYDADSPLGLTWAGPVEVREAYDWDPAAYG  272 (329)
T ss_pred             c------CC-CCCeEEEECCCCCCchHHHHHHHCCCCEEEeCCCcEEEecCCCCCCCCCcccCCCCCHHHHeeeCCCCCC
Confidence            3      35 5799999999863 24678999999999999999999998641     0  13567999999999976  


Q ss_pred             ----CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHH
Q psy14880        417 ----SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRF  467 (497)
Q Consensus       417 ----~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl  467 (497)
                          .++|+|+|+|||+|++++ +++++++|||++|+|||+|++++ ++|++|..||
T Consensus       273 ~~~~~~~ilG~~~~lW~E~~~~~~~~~~~~~PR~~a~AE~~Ws~~~~~~~~~f~~rl  329 (329)
T cd06568         273 PGVPDEAILGVEAPLWTETIRNLDDLEYMAFPRLAGVAEIGWSPQEARDWDDYKVRL  329 (329)
T ss_pred             CcchhhCEEEEEEeecccccCCHHHHHHHHHhHHHHHHHHHhCCCcCCCHHHHHhhC
Confidence                279999999999999987 68999999999999999999976 7999999886


No 7  
>PF00728 Glyco_hydro_20:  Glycosyl hydrolase family 20, catalytic domain;  InterPro: IPR015883 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3RPM_A 1C7T_A 1QBA_A 1QBB_A 1C7S_A 3RCN_A 2YL8_A 2YL6_A 2YLL_A 2YL5_C ....
Probab=100.00  E-value=1.8e-71  Score=574.04  Aligned_cols=323  Identities=35%  Similarity=0.573  Sum_probs=261.7

Q ss_pred             CCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHH
Q psy14880         27 GHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ  106 (497)
Q Consensus        27 ~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~  106 (497)
                      |++||+|||+||||+|+++||++||.||.+|             ||+|||||+|+||||++++++|..            
T Consensus         1 f~~RG~~lD~aR~~~~~~~ik~~id~ma~~k-------------~N~lhlhl~D~~~~~~~~~~~p~l------------   55 (351)
T PF00728_consen    1 FPWRGLMLDVARHFFSVDTIKRLIDQMAYYK-------------LNVLHLHLSDDQGFRLESKSYPEL------------   55 (351)
T ss_dssp             SSEEEEEEETSSS-B-HHHHHHHHHHHHHTT--------------SEEEEEEESSTCB-BEBSTSTHH------------
T ss_pred             CCccceeeccCCCCCCHHHHHHHHHHHHHcC-------------CcEEEEEEecCCCCccccCCCccc------------
Confidence            7899999999999999999999999999999             699999999999999999988731            


Q ss_pred             HHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCC---CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhh
Q psy14880        107 ELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSP---AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQW  183 (497)
Q Consensus       107 ~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~---~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~  183 (497)
                        .+                            .|+++.   +++||++||++||+||++|||+||||||+|||+.+++++
T Consensus        56 --~~----------------------------~ga~~~~~~~~~yT~~di~~lv~yA~~~gI~VIPeid~PGH~~~~l~~  105 (351)
T PF00728_consen   56 --TE----------------------------KGAYRPSDAGGYYTKEDIRELVAYAKERGIEVIPEIDTPGHAEAWLKA  105 (351)
T ss_dssp             --HH----------------------------TTTESTTCTESEBEHHHHHHHHHHHHHTT-EEEEEEEESSS-HHHHHH
T ss_pred             --cc----------------------------cCccccccccccCCHHHHHHHHHHHHHcCCceeeeccCchHHHHHHHh
Confidence              11                            122222   348999999999999999999999999999999999999


Q ss_pred             cCCCCCCCcccccCCCCC-CccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcc
Q psy14880        184 GPRYGLGDLIVCLNDPSW-RANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCT  262 (497)
Q Consensus       184 ~p~~~~~~l~~~~~~~~~-~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~  262 (497)
                      +|++....   |+....+ ...+..+.+++|||++|+|++|+++|++|++++|+  +++|||||||              
T Consensus       106 ~p~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~~~~t~~~~~~l~~e~~~~f~--~~~iHiGgDE--------------  166 (351)
T PF00728_consen  106 YPELGCSA---WPEDKSWPNSTCWYPDNGVLDPSNPETYEFLKDLLDEVADLFP--SKYIHIGGDE--------------  166 (351)
T ss_dssp             HHHHCCCH---TTCSSSCEEEETTSEEEEEE-TTSHHHHHHHHHHHHHHHHHHT--SSEEEEE-TS--------------
T ss_pred             Cchhhccc---cccccccccccccCCCcccCCCCcHHHHHHHHHHHHHHHhhCC--CCeEEeCCcc--------------
Confidence            88765321   1111111 11122234468999999999999999999999999  9999999999              


Q ss_pred             cCCCCCcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEec
Q psy14880        263 RQQVPCVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWS  342 (497)
Q Consensus       263 ~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~  342 (497)
                                            |..+||+++|+|+++|+++|+.    +..+|+.+|++++.+++++    +|+++++|+
T Consensus       167 ----------------------v~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~f~~~~~~~v~~----~g~~~~~W~  216 (351)
T PF00728_consen  167 ----------------------VNYNCWNNSPECQAWMKQNGLT----DPNDLFQYFVNRLADIVKK----HGKKPIIWN  216 (351)
T ss_dssp             ----------------------TTTHHHHCHHHHHHHHHHTTTT----CHHHHHHHHHHHHHHHHHH----TTSEEEEES
T ss_pred             ----------------------cccccccCCHHHhhHHhhcCCc----hHHHHHHHHHHHHHHHHHh----cCCcEEEEc
Confidence                                  6778999999999999999974    5789999999999999985    899999999


Q ss_pred             CCCCCchhhccCCCCCcEEEEecCCCC-chhHHHHHHcCCeEEEeCCCcccccccc--cC------CCCCCChhhhhccC
Q psy14880        343 SHLTDPEIILNYLDPKRYIIQTWVPRI-DPLADLLISKGYQVIISTKDAWYLDHGF--WG------VTSYYRWQRVYDNL  413 (497)
Q Consensus       343 d~~~~~~~~~~~l~~~~~iv~~W~~~~-~~~~~~~~~~Gy~vI~s~~~~~Yld~~~--~~------~~~~~~~~~iY~~~  413 (497)
                      |++...+.. ..+ +++++|++|++.. ...+..++++||++|+++.+++|+|+++  |.      ..++.+++++|+++
T Consensus       217 D~~~~~~~~-~~~-~~~~~i~~W~~~~~~~~~~~~~~~g~~vI~s~~~~~Yld~~~~~~~~~~~~~~~~~~~~~~~~~~~  294 (351)
T PF00728_consen  217 DMLDDFPDA-SLL-PKDVIIQVWNYDWGPASAQEFAKKGYKVINSPSDYLYLDPGQSPWPEEDGNYWAPFNTWSDFYNWD  294 (351)
T ss_dssp             TTTTTTCCG-HCS-CTTEEEEEESSTTHHHHHHHHHHTTHEEEESTHGTGBTTTTSSTTTSSSSBHTTSTHHHHHHHHHH
T ss_pred             cccccCCcc-ccc-cCCceEEEeeccccchhhhHHHHhcCcEEEcCCCceEEccCcCCCCccCccccccccccccccccc
Confidence            999876421 124 6799999999965 4568899999999999999999999876  21      11244677888877


Q ss_pred             CCC-------------CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCC
Q psy14880        414 LPS-------------SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSN  455 (497)
Q Consensus       414 p~~-------------~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~  455 (497)
                      |..             +++|+|+++++|+|.+.+ +.+++++|||++|+|||+|++
T Consensus       295 ~~~~~~~~~~~~~~~~~~~i~G~~~~~W~E~~~~~~~l~~~~~Pr~~A~AE~~Ws~  350 (351)
T PF00728_consen  295 PAYFNTNSTQIDPAQKSERILGGEAALWSENIRDEEDLDYRLWPRLAALAERLWSP  350 (351)
T ss_dssp             HCHCTTTCSTTTTHHHCCCEEEEEEEEETTTTTSHHHHHHHHTTHHHHHHHHHHSS
T ss_pred             ccccccccchhhhhcccCCCceeEEEeecCCCCCHHHHHHHHHHHHHHHHHHHhCC
Confidence            542             148999999999999965 899999999999999999986


No 8  
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides.  These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase.  The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=8.1e-69  Score=545.50  Aligned_cols=293  Identities=27%  Similarity=0.440  Sum_probs=245.0

Q ss_pred             CceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHH
Q psy14880         29 NRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL  108 (497)
Q Consensus        29 ~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~  108 (497)
                      +||+||||||||+|+++||++||.||++|             ||+|||||+||||||+|++.||..              
T Consensus         1 ~RG~mlD~aR~~~~~~~lk~~id~ma~~K-------------~N~lhlHl~D~~~~~le~~~~p~l--------------   53 (303)
T cd02742           1 IRGIMLDVSRHFLSVESIKRTIDVLARYK-------------INTFHWHLTDDQAWRIESKKFPEL--------------   53 (303)
T ss_pred             CcceeeeccccCcCHHHHHHHHHHHHHhC-------------CcEEEEeeecCCCceEeeCccchh--------------
Confidence            69999999999999999999999999999             699999999999999999988743              


Q ss_pred             HHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCC
Q psy14880        109 VHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYG  188 (497)
Q Consensus       109 ~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~  188 (497)
                      .+    .|              .      ...+++++++||++||+|||+||++|||+||||||+|||+.++++++|++.
T Consensus        54 ~~----~g--------------~------~~~~~~~~~~yT~~di~elv~yA~~rgI~viPEiD~PGH~~a~~~~~p~l~  109 (303)
T cd02742          54 AE----KG--------------G------QINPRSPGGFYTYAQLKDIIEYAAARGIEVIPEIDMPGHSTAFVKSFPKLL  109 (303)
T ss_pred             hh----hc--------------c------cccCCCCCCeECHHHHHHHHHHHHHcCCEEEEeccchHHHHHHHHhCHHhc
Confidence            10    00              0      012345678899999999999999999999999999999999999888754


Q ss_pred             CCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCCC
Q psy14880        189 LGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVPC  268 (497)
Q Consensus       189 ~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~~  268 (497)
                      .+    |.....+     ....+.|||++|+|++|+++|++|++++|+  +++|||||||+                   
T Consensus       110 ~~----~~~~~~~-----~~~~~~l~~~~~~t~~fl~~l~~e~~~lf~--~~~iHiGgDE~-------------------  159 (303)
T cd02742         110 TE----CYAGLKL-----RDVFDPLDPTLPKGYDFLDDLFGEIAELFP--DRYLHIGGDEA-------------------  159 (303)
T ss_pred             cC----ccccCCC-----CCCCCccCCCCccHHHHHHHHHHHHHHhCC--CCeEEecceec-------------------
Confidence            22    2221111     113468999999999999999999999999  99999999993                   


Q ss_pred             cCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCCc
Q psy14880        269 VQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTDP  348 (497)
Q Consensus       269 ~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~~  348 (497)
                                       ...                     .+..+++.+|++++.++|++    +|+++++|+|++...
T Consensus       160 -----------------~~~---------------------~~~~~l~~~f~~~~~~~v~~----~g~~~~~W~d~~~~~  197 (303)
T cd02742         160 -----------------HFK---------------------QDRKHLMSQFIQRVLDIVKK----KGKKVIVWQDGFDKK  197 (303)
T ss_pred             -----------------CCC---------------------CCHHHHHHHHHHHHHHHHHH----cCCeEEEecccccCC
Confidence                             211                     12457889999999999984    899999999998653


Q ss_pred             hhhccCCCCCcEEEEecCCCC---chhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCCC------CCC
Q psy14880        349 EIILNYLDPKRYIIQTWVPRI---DPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLPS------SPL  419 (497)
Q Consensus       349 ~~~~~~l~~~~~iv~~W~~~~---~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~~------~~~  419 (497)
                      .    .+ +++++|++|++..   ...+.+++++||+||+|++.++|+   .  ..++.+++++|+++|..      .++
T Consensus       198 ~----~l-~~~~ii~~W~~~~~~~~~~~~~~~~~G~~vi~s~~~yly~---~--~~~~~~~~~~y~~~p~~~~~~~~~~~  267 (303)
T cd02742         198 M----KL-KEDVIVQYWDYDGDKYNVELPEAAAKGFPVILSNGYYLDI---F--IDGALDARKVYKNDPLAVPTPQQKDL  267 (303)
T ss_pred             C----CC-CCCeEEEEccCCCCcchHHHHHHHHCCCCEEEeCCceeee---e--CCCCCCHHHHhCCCCCCCCCcccccc
Confidence            2    35 5799999999974   246788999999999999877777   1  24568999999999875      379


Q ss_pred             ccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcC
Q psy14880        420 VLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWS  454 (497)
Q Consensus       420 ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~  454 (497)
                      |+|+++|||+|.+++ +.+++++|||++|+||++|+
T Consensus       268 vlG~~~~lW~E~~~~~~~~~~~~~pr~~a~AE~~Ws  303 (303)
T cd02742         268 VLGVIACLWGETVKDTKTLQYRFWPRALAVAERSWS  303 (303)
T ss_pred             eEEEEEeeeccccCCHHHHHHHHHHHHHHHHHHhhC
Confidence            999999999999998 59999999999999999996


No 9  
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway.  The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=1.5e-66  Score=534.01  Aligned_cols=310  Identities=22%  Similarity=0.323  Sum_probs=248.9

Q ss_pred             CCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHH
Q psy14880         28 HNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE  107 (497)
Q Consensus        28 ~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~  107 (497)
                      ++||+|||+||||+|+++||++||.||++|             ||+|||||+|  +||++++.++..             
T Consensus         1 ~~RG~mLD~aR~~~~~~~ik~~id~ma~~K-------------~N~lhlHltD--~~~~~~~~~p~l-------------   52 (326)
T cd06564           1 EVRGFMLDVGRKYYSMDFLKDIIKTMSWYK-------------MNDLQLHLND--NLIFNLDDMSTT-------------   52 (326)
T ss_pred             CcceeEEEccCCCCCHHHHHHHHHHHHHcC-------------CceEEEeecC--CcccccCCCchh-------------
Confidence            479999999999999999999999999999             6999999999  788888877743             


Q ss_pred             HHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCC
Q psy14880        108 LVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRY  187 (497)
Q Consensus       108 ~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~  187 (497)
                       .+.++.++-++.    .|.         ...++.+.+++||++||+|||+||++|||+||||||+|||+.++++++|++
T Consensus        53 -~~~~~~~~~~~~----~~~---------~~~~~~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a~~~~~pel  118 (326)
T cd06564          53 -VNNATYASDDVK----SGN---------NYYNLTANDGYYTKEEFKELIAYAKDRGVNIIPEIDSPGHSLAFTKAMPEL  118 (326)
T ss_pred             -hhhhhhhccccc----ccc---------ccCCCCCCCCcccHHHHHHHHHHHHHcCCeEeccCCCcHHHHHHHHhhHHh
Confidence             112222221111    111         113445678999999999999999999999999999999999999988876


Q ss_pred             CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCC
Q psy14880        188 GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVP  267 (497)
Q Consensus       188 ~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~  267 (497)
                      +.+..      .      ..+..++|||++|+|++|+++|++|++++|+..+++|||||||+                  
T Consensus       119 ~~~~~------~------~~~~~~~l~~~~~~t~~f~~~l~~E~~~~f~~~~~~~HiGgDE~------------------  168 (326)
T cd06564         119 GLKNP------F------SKYDKDTLDISNPEAVKFVKALFDEYLDGFNPKSDTVHIGADEY------------------  168 (326)
T ss_pred             cCCCc------c------cCCCcccccCCCHHHHHHHHHHHHHHHHhcCCCCCEEEeccccc------------------
Confidence            53211      1      11244789999999999999999999999984469999999993                  


Q ss_pred             CcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCC
Q psy14880        268 CVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTD  347 (497)
Q Consensus       268 ~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~  347 (497)
                                        ...                     .+..+++..|++++.++|++    +||++++|+|++..
T Consensus       169 ------------------~~~---------------------~~~~~~~~~f~~~~~~~v~~----~gk~~~~W~d~~~~  205 (326)
T cd06564         169 ------------------AGD---------------------AGYAEAFRAYVNDLAKYVKD----KGKTPRVWGDGIYY  205 (326)
T ss_pred             ------------------ccc---------------------CccHHHHHHHHHHHHHHHHH----cCCeEEEeCCcccC
Confidence                              211                     02356788999999999985    79999999999875


Q ss_pred             chhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccCCC------------
Q psy14880        348 PEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNLLP------------  415 (497)
Q Consensus       348 ~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~p~------------  415 (497)
                      .+. ...+ +++++|++|++.. ..+.+++++||+||||++.++|+|+++.+..++.+++++|++++.            
T Consensus       206 ~~~-~~~l-~~~~iv~~W~~~~-~~~~~~~~~G~~vI~s~~~~~Y~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~  282 (326)
T cd06564         206 KGD-TTVL-SKDVIINYWSYGW-ADPKELLNKGYKIINTNDGYLYIVPGAGYYGDYLNTEDIYNNWTPNKFGGTNATLPE  282 (326)
T ss_pred             CCC-cccC-CCCeEEEeCCCcc-cCHHHHHHCCCcEEEeCCCcEEEeCCCccCCCccCHHHHHhcCCccccCCCCccCCC
Confidence            431 1246 5799999999875 457899999999999999999999875211355779999986542            


Q ss_pred             CCCCccceeeeeeccccC---C-CcchhhhhhhHHHHHHHhcCC
Q psy14880        416 SSPLVLGGEVAMWTEYVD---D-QSLDGRLWPRTAAAAERLWSN  455 (497)
Q Consensus       416 ~~~~ilG~e~~lWsE~~~---~-~~l~~~l~PRl~A~AEr~W~~  455 (497)
                      ..++|+|+++|||+|.++   + .++++++|||++|+||++|++
T Consensus       283 ~~~~ilG~~~~lW~E~~~~~~t~~~~~~~~~pr~~a~Ae~~W~~  326 (326)
T cd06564         283 GDPQILGGMFAIWNDDSDAGISEVDIYDRIFPALPAFAEKTWGG  326 (326)
T ss_pred             CCCCcchhheeeecCCCCcCcCHHHHHHHHHHHHHHHHHHhcCC
Confidence            137999999999999997   3 789999999999999999985


No 10 
>COG3525 Chb N-acetyl-beta-hexosaminidase [Carbohydrate transport and metabolism]
Probab=100.00  E-value=9.6e-65  Score=536.47  Aligned_cols=377  Identities=22%  Similarity=0.268  Sum_probs=301.1

Q ss_pred             CccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCC
Q psy14880         16 NKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYS   95 (497)
Q Consensus        16 ~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~   95 (497)
                      .+...|.|.|||.|||+|+||||||+|++.+|++||.||.+|             ||.|||||+||||||+||+++|-. 
T Consensus       250 ~p~~~I~DaPRf~~rGllvDvaRqf~s~~~vk~~Id~laa~K-------------ln~~hlHLtddegwrleIk~~PkL-  315 (732)
T COG3525         250 FPAVTIVDAPRFAWRGLLVDVARQFHSTDDVKRLIDQLAAHK-------------LNVLHLHLTDDEGWRLEIKRYPKL-  315 (732)
T ss_pred             eeeeecccCcccchhhhhHhhhhhcCCHHHHHHHHHHHHHhh-------------cceEEEeeccCcceeeccccCCcc-
Confidence            566668899999999999999999999999999999999999             699999999999999999999843 


Q ss_pred             CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880         96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPA  175 (497)
Q Consensus        96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PG  175 (497)
                                   +..++.||   .-+.+.|.+.         +|+=..+||||+++++||++||++|+|+||||||+||
T Consensus       316 -------------T~iga~R~---~de~~~Pq~g---------~~pe~~ggfytqd~~relv~yAsar~ItviPeiD~Pg  370 (732)
T COG3525         316 -------------TTIGAWRI---PDEPDLPQLG---------YGPERMGGFYTQDDIRELVAYASARQITVIPEIDMPG  370 (732)
T ss_pred             -------------cccccccc---CCCcCCcccc---------cCcccccCcccHHHHHHHHHHHhhcCceecCCcCCcc
Confidence                         11222232   1133334331         1111346789999999999999999999999999999


Q ss_pred             cchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhh
Q psy14880        176 HAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMAL  255 (497)
Q Consensus       176 H~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~  255 (497)
                      |+.+++.++|++.+..- .|..... -.+   ..+..|||+-+-+++|+++|++|+.++||  +.+|||||||       
T Consensus       371 Ha~aav~A~p~~~l~~a-~~ds~~s-v~~---~~n~~ln~tl~~t~~fldkv~dEv~~lfP--s~~iHiGgDE-------  436 (732)
T COG3525         371 HARAAVVAYPDLNLGRA-DPDSYDS-VQA---YLNPVLNPTLDPTYQFLDKVLDEVADLFP--STTIHIGGDE-------  436 (732)
T ss_pred             hhhhhhhhCcccccccc-CCCcchh-hhh---cccccccccccHHHHHHHHHHHHHHHhCC--cceEEeccch-------
Confidence            99999999885432111 1111100 001   12237999999999999999999999999  9999999999       


Q ss_pred             hhhhhcccCCCCCcCCCCCCCCCCCCcceeecccccC-CHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCC
Q psy14880        256 CRWQMCTRQQVPCVQPISGVDHLSPHLWTVFIKCWNN-TPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHN  334 (497)
Q Consensus       256 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~w~~-~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~  334 (497)
                                                   +..+.|+. +|.|++.|++.|++    +...|+.+|++++.+.+++    +
T Consensus       437 -----------------------------~~~~qwk~~sp~~q~l~~~~G~~----d~~~lq~~fi~q~~k~l~~----~  479 (732)
T COG3525         437 -----------------------------FIDGQWKASSPLVQALMEKLGNK----DTFELQSYFITQVGKTLAS----K  479 (732)
T ss_pred             -----------------------------hccCeeeccCHHHHHHHHHhccC----CcchhhHHHHHHHHHHHHh----c
Confidence                                         55678988 99999999999985    4578999999999999984    8


Q ss_pred             CceEEEecCCCCCchhhccCCCCCcEEEEecCCCCchhHHHHHHcCCeEEEeCCCcccccccc---cCC-----CCCCCh
Q psy14880        335 RTKLIVWSSHLTDPEIILNYLDPKRYIIQTWVPRIDPLADLLISKGYQVIISTKDAWYLDHGF---WGV-----TSYYRW  406 (497)
Q Consensus       335 g~~~~~W~d~~~~~~~~~~~l~~~~~iv~~W~~~~~~~~~~~~~~Gy~vI~s~~~~~Yld~~~---~~~-----~~~~~~  406 (497)
                      |++.++|+|++..... .+....+++.|++|.+..  ..-+++++||+||++|..++|||+.+   |..     ......
T Consensus       480 Gr~~igW~e~~~~~~~-~~~~~t~~~~vm~W~~~~--~ai~~akqg~dvv~tp~~~~ylD~~q~~~peepg~~~a~t~~l  556 (732)
T COG3525         480 GRRLIGWDEGAHGGDV-NGTALTANVTVMSWYGKD--KAIELAKQGYDVVLTPAQFVYLDMLQIAAPEEPGYSWATTTPL  556 (732)
T ss_pred             CceEEeecchhccCCC-ccccccCceEEEEEecch--hhHHHHhhcccccccchhhhhhhhhcccccccCCCcccccccc
Confidence            9999999999876432 111112699999999874  46789999999999999999999976   210     111233


Q ss_pred             hh-hhccCCCC--------CCCccceeeeeeccccCC-CcchhhhhhhHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHH
Q psy14880        407 QR-VYDNLLPS--------SPLVLGGEVAMWTEYVDD-QSLDGRLWPRTAAAAERLWSNPK-SSSSEAETRFLEQRERLV  475 (497)
Q Consensus       407 ~~-iY~~~p~~--------~~~ilG~e~~lWsE~~~~-~~l~~~l~PRl~A~AEr~W~~~~-~~~~~f~~Rl~~~~~~l~  475 (497)
                      .+ .|..++..        .+.++|.++++|+|++.+ ..+.+++|||++|+|||+|+... +||..|..|+..+..+++
T Consensus       557 ~r~~y~~~~~g~~~~~de~~k~~~G~q~~lWse~~~~~~~f~~~vfprl~a~aEraw~p~a~~Dw~~~~~r~~~~~~l~~  636 (732)
T COG3525         557 ERNKYAYDFAGKQPINDELAKRILGVQAALWSEHIQTRGRFEYMVFPRLAAAAERAWTPMAFNDWLYYLDRLSAQLPLLV  636 (732)
T ss_pred             chhhhhhcccCcccCChHHhhhhhhhHHHHHHHHhhhhhHHHHHhcchHHHHHHhhCCchhhcchhhhhhhcchhcchhh
Confidence            44 67666543        379999999999999998 88999999999999999999865 799999999999999999


Q ss_pred             HCCCCcccCC
Q psy14880        476 EMGIRAEVTT  485 (497)
Q Consensus       476 ~~Gi~~~p~~  485 (497)
                      ..+|.+.+.+
T Consensus       637 ~~~~~~~~~w  646 (732)
T COG3525         637 LISIPYNAWW  646 (732)
T ss_pred             hhcccccccc
Confidence            8888776543


No 11 
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA  is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers).  In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury.  GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain.  The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=100.00  E-value=1.4e-46  Score=381.90  Aligned_cols=287  Identities=17%  Similarity=0.174  Sum_probs=210.0

Q ss_pred             CceeeeccCC-CCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHH
Q psy14880         29 NRTKLIVWSS-HLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE  107 (497)
Q Consensus        29 ~RG~mlDvaR-~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~  107 (497)
                      .||+|||+|| +|+++++||++||.||.+|             +|.||||++|.  |++  +.++               
T Consensus         1 ~RG~mlD~~R~~~~~~~~lk~~id~ma~~k-------------~N~l~lhl~D~--f~~--~~~p---------------   48 (301)
T cd06565           1 FRGVHLDLKRNAVPKVSYLKKLLRLLALLG-------------ANGLLLYYEDT--FPY--EGEP---------------   48 (301)
T ss_pred             CceEEEecCCCCCCCHHHHHHHHHHHHHcC-------------CCEEEEEEecc--eec--CCCc---------------
Confidence            5999999999 9999999999999999999             69999999994  322  1111               


Q ss_pred             HHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCC
Q psy14880        108 LVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRY  187 (497)
Q Consensus       108 ~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~  187 (497)
                                    ++                +  ..+++||++||+||++||++|||+||||||+|||+.+++++ |+ 
T Consensus        49 --------------~~----------------~--~~~~~yT~~ei~ei~~yA~~~gI~vIPeid~pGH~~~~l~~-~~-   94 (301)
T cd06565          49 --------------EV----------------G--RMRGAYTKEEIREIDDYAAELGIEVIPLIQTLGHLEFILKH-PE-   94 (301)
T ss_pred             --------------cc----------------c--cCCCCcCHHHHHHHHHHHHHcCCEEEecCCCHHHHHHHHhC-cc-
Confidence                          11                0  11456999999999999999999999999999999998864 33 


Q ss_pred             CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCccchhhhhhhhhhcccCCCC
Q psy14880        188 GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADELVTLMALCRWQMCTRQQVP  267 (497)
Q Consensus       188 ~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE~~~~~~~~~w~~~~~~~~~  267 (497)
                       +..+.  ..         +.++++|||++|+|++|+++|++|++++|+  +++|||||||+++++.             
T Consensus        95 -~~~l~--~~---------~~~~~~l~~~~~~t~~fi~~li~ev~~~f~--s~~~HIG~DE~~~~g~-------------  147 (301)
T cd06565          95 -FRHLR--EV---------DDPPQTLCPGEPKTYDFIEEMIRQVLELHP--SKYIHIGMDEAYDLGR-------------  147 (301)
T ss_pred             -ccccc--cc---------CCCCCccCCCChhHHHHHHHHHHHHHHhCC--CCeEEECCCcccccCC-------------
Confidence             33332  11         113478999999999999999999999999  9999999999654321             


Q ss_pred             CcCCCCCCCCCCCCcceeecccccCCHHHHHHHHHcCCCCCcccHHHHHHHHHHHHHHHHHHHhhCCCceEEEecCCCCC
Q psy14880        268 CVQPISGVDHLSPHLWTVFIKCWNNTPAIVDFMLAHGKNTTFDDYIQLWAHFQNKAAASLDEAVGHNRTKLIVWSSHLTD  347 (497)
Q Consensus       268 ~~~~~~~~~~~~~~~~~v~~~~w~~~p~~~~~~~~~g~~~~~~~~~~l~~~F~~~~~~~l~~~~~~~g~~~~~W~d~~~~  347 (497)
                                               .+    ++++.+.    .+..+|+..|++++.+++++    +|+++++|+|++..
T Consensus       148 -------------------------~~----~~~~~~~----~~~~~l~~~~~~~v~~~v~~----~g~~~~~W~D~~~~  190 (301)
T cd06565         148 -------------------------GR----SLRKHGN----LGRGELYLEHLKKVLKIIKK----RGPKPMMWDDMLRK  190 (301)
T ss_pred             -------------------------CH----HHHHhcC----CCHHHHHHHHHHHHHHHHHH----cCCEEEEEhHHhcC
Confidence                                     11    2222232    24568999999999999984    89999999999876


Q ss_pred             ---chhhccCCCCCcEEEEecCCCCch-----hHHHHHHcCCeEEEeCCCcccccccccCCCCCCChhhhhccC----C-
Q psy14880        348 ---PEIILNYLDPKRYIIQTWVPRIDP-----LADLLISKGYQVIISTKDAWYLDHGFWGVTSYYRWQRVYDNL----L-  414 (497)
Q Consensus       348 ---~~~~~~~l~~~~~iv~~W~~~~~~-----~~~~~~~~Gy~vI~s~~~~~Yld~~~~~~~~~~~~~~iY~~~----p-  414 (497)
                         .+.....+ |+++++++|.+....     ......+.|..++++++...+--.       +...+.-+.+-    . 
T Consensus       191 ~~~~~~~~~~l-~~~v~~~~W~y~~~~~~~~~~~~~~~~~~~~~~~~~g~~~w~~~-------~~~~~~~~~n~~~~~~~  262 (301)
T cd06565         191 LSIEPEALSGL-PKLVTPVVWDYYADLDEHDRPIGLWKKYGSVFAVAWGASAWKGA-------TPPNDKHLENIKSWLKA  262 (301)
T ss_pred             CCCChHHHhCC-CCCeEEEEecCcCCcchhhHhHHHHHHhCCCceEeeeechhccC-------CCCHHHHHHHHHHHHHH
Confidence               33333457 589999999987632     344556677777777653222110       11111111110    0 


Q ss_pred             CCCCCccceeeeeeccccCCCcchhhhhhhHHHHHHHhcC
Q psy14880        415 PSSPLVLGGEVAMWTEYVDDQSLDGRLWPRTAAAAERLWS  454 (497)
Q Consensus       415 ~~~~~ilG~e~~lWsE~~~~~~l~~~l~PRl~A~AEr~W~  454 (497)
                      .....+.|.....|+|.... .....++|.+++.|+..|+
T Consensus       263 ~~~~~~~G~~~T~W~d~g~~-~~~~~~~p~~~~~~~~~~~  301 (301)
T cd06565         263 AKKNGVQGILLTGWGDYGHE-AVLCELLPGLIPSLALALG  301 (301)
T ss_pred             HHHCCCCEEEEEecCCCCCc-ccHHHHHHHHHHHHHHhcC
Confidence            11467899999999987654 3346899999999999985


No 12 
>KOG2499|consensus
Probab=97.50  E-value=9.5e-05  Score=78.26  Aligned_cols=45  Identities=56%  Similarity=0.926  Sum_probs=41.9

Q ss_pred             ccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880         87 LLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN  131 (497)
Q Consensus        87 e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~  131 (497)
                      |+.++|+++..++||..++.++++||++|||||.+|+|.|+|+++
T Consensus       234 eL~~kGaYs~~~vYT~eDv~evV~yarlRGIRVlpEfD~PgHt~s  278 (542)
T KOG2499|consen  234 ELHRKGAYSPRHVYTREDVSEVVEYARLRGIRVLPEFDTPGHTGS  278 (542)
T ss_pred             hhhhcCCCCcceeecHHHHHHHHHHHHhccceeeecccCCccccc
Confidence            455689999999999999999999999999999999999999986


No 13 
>PF02638 DUF187:  Glycosyl hydrolase like GH101;  InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=95.72  E-value=0.19  Score=51.81  Aligned_cols=87  Identities=21%  Similarity=0.233  Sum_probs=54.5

Q ss_pred             HHHHHHHHHHHHHcCCEEEEecC--Ccccch-hhhhhcCCCCCCCcccccCCCCCCccc--cCCCCCcCCCCChhHHHHH
Q psy14880        150 PLEVQELVHYAKLRGIRVIFELD--APAHAG-NGWQWGPRYGLGDLIVCLNDPSWRANC--IQPPCGQLNPINPHVYTVL  224 (497)
Q Consensus       150 ~~ei~eiv~yA~~rgI~VIPEID--~PGH~~-a~l~~~p~~~~~~l~~~~~~~~~~~~~--~~~~~~~L~~~~p~t~~fl  224 (497)
                      .+=+..+|+-|.+|||+|.+=+.  +.++.. .+.+.+|+-    +  +...+.|....  .....--|||++||+.+|+
T Consensus        69 ~DpL~~~I~eaHkrGlevHAW~~~~~~~~~~~~~~~~~p~~----~--~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i  142 (311)
T PF02638_consen   69 FDPLEFMIEEAHKRGLEVHAWFRVGFNAPDVSHILKKHPEW----F--AVNHPGWVRTYEDANGGYYWLNPGHPEVRDYI  142 (311)
T ss_pred             ccHHHHHHHHHHHcCCEEEEEEEeecCCCchhhhhhcCchh----h--eecCCCceeecccCCCCceEECCCCHHHHHHH
Confidence            55699999999999999999872  222221 122223321    0  00111111000  0011236999999999999


Q ss_pred             HHHHHHHHhccCCCCCeEEE
Q psy14880        225 KDIYGELMGMAKYGDEMFHM  244 (497)
Q Consensus       225 ~~vl~E~~~lF~~~~~~iHi  244 (497)
                      .+++.|+..-++  -.=||+
T Consensus       143 ~~~v~Eiv~~Yd--vDGIhl  160 (311)
T PF02638_consen  143 IDIVKEIVKNYD--VDGIHL  160 (311)
T ss_pred             HHHHHHHHhcCC--CCeEEe
Confidence            999999999888  777885


No 14 
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B  (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20).  The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits.  Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff.  Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in 
Probab=95.58  E-value=0.0095  Score=62.27  Aligned_cols=42  Identities=60%  Similarity=1.095  Sum_probs=37.2

Q ss_pred             cCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880         90 RFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN  131 (497)
Q Consensus        90 ~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~  131 (497)
                      ..|++++...||.++++++++||+.|||+|+||+|.|+|+..
T Consensus        57 ~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~a   98 (348)
T cd06562          57 KKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEIDTPGHTGS   98 (348)
T ss_pred             hccCcCCCceECHHHHHHHHHHHHHcCCEEEEeccCchhhHH
Confidence            346666778899999999999999999999999999999753


No 15 
>PF10566 Glyco_hydro_97:  Glycoside hydrolase 97  ;  InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site.  This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=94.78  E-value=0.069  Score=53.95  Aligned_cols=71  Identities=14%  Similarity=0.286  Sum_probs=42.0

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEEEecCCcc---------cchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCC
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVIFELDAPA---------HAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPIN  217 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PG---------H~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~  217 (497)
                      .+...+|+|||+||+++||.|+==...-+         ++..+++.+.+.|..              ++  ..+-++-.+
T Consensus        69 ~~~~~dl~elv~Ya~~KgVgi~lw~~~~~~~~~~~~~~~~~~~f~~~~~~Gv~--------------Gv--KidF~~~d~  132 (273)
T PF10566_consen   69 PIPDFDLPELVDYAKEKGVGIWLWYHSETGGNVANLEKQLDEAFKLYAKWGVK--------------GV--KIDFMDRDD  132 (273)
T ss_dssp             B-TT--HHHHHHHHHHTT-EEEEEEECCHTTBHHHHHCCHHHHHHHHHHCTEE--------------EE--EEE--SSTS
T ss_pred             cCCccCHHHHHHHHHHcCCCEEEEEeCCcchhhHhHHHHHHHHHHHHHHcCCC--------------EE--eeCcCCCCC
Confidence            36668999999999999988765433322         123333332222211              11  124577789


Q ss_pred             hhHHHHHHHHHHHHHh
Q psy14880        218 PHVYTVLKDIYGELMG  233 (497)
Q Consensus       218 p~t~~fl~~vl~E~~~  233 (497)
                      .++.++.++|+++.++
T Consensus       133 Q~~v~~y~~i~~~AA~  148 (273)
T PF10566_consen  133 QEMVNWYEDILEDAAE  148 (273)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            9999999999999987


No 16 
>cd06568 GH20_SpHex_like A subgroup of  the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex).  SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=94.63  E-value=0.028  Score=58.41  Aligned_cols=35  Identities=34%  Similarity=0.627  Sum_probs=32.3

Q ss_pred             CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      ....||.++++++++||+.|||+|+||+|.|+|..
T Consensus        68 ~~~~YT~~di~elv~yA~~rgI~vIPEiD~PGH~~  102 (329)
T cd06568          68 PGGYYTQEDYKDIVAYAAERHITVVPEIDMPGHTN  102 (329)
T ss_pred             CCCcCCHHHHHHHHHHHHHcCCEEEEecCCcHHHH
Confidence            35689999999999999999999999999999964


No 17 
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of  the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin.  Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.61  E-value=0.03  Score=57.68  Aligned_cols=34  Identities=44%  Similarity=0.762  Sum_probs=31.7

Q ss_pred             CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      +..||.++++++++||+.|||+|+||+|.|+|.+
T Consensus        62 ~~~yT~~di~elv~yA~~rgI~vIPEId~PGH~~   95 (311)
T cd06570          62 GLYYTQEQIREVVAYARDRGIRVVPEIDVPGHAS   95 (311)
T ss_pred             CCccCHHHHHHHHHHHHHcCCEEEEeecCccchH
Confidence            4579999999999999999999999999999964


No 18 
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway.  The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.44  E-value=0.03  Score=57.99  Aligned_cols=36  Identities=44%  Similarity=0.652  Sum_probs=33.0

Q ss_pred             CCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         95 SPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        95 ~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      .....||.++++++++||+.|||+|+||+|.|+|..
T Consensus        74 ~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~  109 (326)
T cd06564          74 ANDGYYTKEEFKELIAYAKDRGVNIIPEIDSPGHSL  109 (326)
T ss_pred             CCCCcccHHHHHHHHHHHHHcCCeEeccCCCcHHHH
Confidence            456789999999999999999999999999999964


No 19 
>PF00728 Glyco_hydro_20:  Glycosyl hydrolase family 20, catalytic domain;  InterPro: IPR015883 Glycoside hydrolase family 20 GH20 from CAZY comprises enzymes with several known activities; beta-hexosaminidase (3.2.1.52 from EC); lacto-N-biosidase (3.2.1.140 from EC). Carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base in this family of enzymes. In the brain and other tissues, beta-hexosaminidase A degrades GM2 gangliosides; specifically, the enzyme hydrolyses terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides. There are 3 forms of beta-hexosaminidase: hexosaminidase A is a trimer, with one alpha, one beta-A and one beta-B chain; hexosaminidase B is a tetramer of two beta-A and two beta-B chains; and hexosaminidase S is a homodimer of alpha chains. The two beta chains are derived from the cleavage of a precursor. Mutations in the beta-chain lead to Sandhoff disease, a lysosomal storage disorder characterised by accumulation of GM2 ganglioside [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 3RPM_A 1C7T_A 1QBA_A 1QBB_A 1C7S_A 3RCN_A 2YL8_A 2YL6_A 2YLL_A 2YL5_C ....
Probab=94.24  E-value=0.025  Score=58.52  Aligned_cols=41  Identities=51%  Similarity=0.834  Sum_probs=33.6

Q ss_pred             cCCCCCCCc---cCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         90 RFGAYSPAK---VYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        90 ~~g~~~~~~---~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      ..|+.+...   .||.++|+++++||+.|||+|+||+|.|+|.+
T Consensus        57 ~~ga~~~~~~~~~yT~~di~~lv~yA~~~gI~VIPeid~PGH~~  100 (351)
T PF00728_consen   57 EKGAYRPSDAGGYYTKEDIRELVAYAKERGIEVIPEIDTPGHAE  100 (351)
T ss_dssp             HTTTESTTCTESEBEHHHHHHHHHHHHHTT-EEEEEEEESSS-H
T ss_pred             ccCccccccccccCCHHHHHHHHHHHHHcCCceeeeccCchHHH
Confidence            345555544   89999999999999999999999999999964


No 20 
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin.  Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=94.05  E-value=0.042  Score=57.67  Aligned_cols=34  Identities=47%  Similarity=0.699  Sum_probs=31.8

Q ss_pred             CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      ...||.++++++++||+.|||+|+||+|.|+|+.
T Consensus        80 ~~~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~  113 (357)
T cd06563          80 GGFYTQEEIREIVAYAAERGITVIPEIDMPGHAL  113 (357)
T ss_pred             CceECHHHHHHHHHHHHHcCCEEEEecCCchhHH
Confidence            4679999999999999999999999999999964


No 21 
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides.  These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase.  The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=94.04  E-value=0.041  Score=56.39  Aligned_cols=35  Identities=37%  Similarity=0.749  Sum_probs=32.3

Q ss_pred             CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      ....||+++++++++||+.||++|+||+|.|+|..
T Consensus        65 ~~~~yT~~di~elv~yA~~rgI~viPEiD~PGH~~   99 (303)
T cd02742          65 PGGFYTYAQLKDIIEYAAARGIEVIPEIDMPGHST   99 (303)
T ss_pred             CCCeECHHHHHHHHHHHHHcCCEEEEeccchHHHH
Confidence            45689999999999999999999999999999964


No 22 
>cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=93.52  E-value=0.054  Score=58.54  Aligned_cols=34  Identities=38%  Similarity=0.619  Sum_probs=31.8

Q ss_pred             CccCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      ...||+++|+++++||+.|||+|+||+|.|+|+.
T Consensus        91 ~g~YT~~di~eiv~yA~~rgI~VIPEID~PGH~~  124 (445)
T cd06569          91 SGYYSRADYIEILKYAKARHIEVIPEIDMPGHAR  124 (445)
T ss_pred             CCccCHHHHHHHHHHHHHcCCEEEEccCCchhHH
Confidence            4579999999999999999999999999999975


No 23 
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA  is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers).  In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury.  GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain.  The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=89.84  E-value=0.21  Score=51.17  Aligned_cols=32  Identities=31%  Similarity=0.431  Sum_probs=30.2

Q ss_pred             cCChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880         99 VYTPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus        99 ~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      .||.++++++++||+.||++|+|+++.|+|..
T Consensus        56 ~yT~~ei~ei~~yA~~~gI~vIPeid~pGH~~   87 (301)
T cd06565          56 AYTKEEIREIDDYAAELGIEVIPLIQTLGHLE   87 (301)
T ss_pred             CcCHHHHHHHHHHHHHcCCEEEecCCCHHHHH
Confidence            49999999999999999999999999999964


No 24 
>COG1649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.94  E-value=6.9  Score=41.97  Aligned_cols=156  Identities=16%  Similarity=0.041  Sum_probs=85.3

Q ss_pred             CCCCceeeec--cCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChh
Q psy14880         26 VGHNRTKLIV--WSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPL  103 (497)
Q Consensus        26 ~~~~RG~mlD--vaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~  103 (497)
                      +-+.||+=||  -+|..++.+.+++.+|.+..+.+             |++-.-..-+        .+.      .|-+.
T Consensus        44 ~~eiRGvWltn~~~~v~~~~~el~~~ld~l~~ln~-------------NTv~~qV~~~--------G~~------lypS~   96 (418)
T COG1649          44 PQEIRGVWLTNADSRVLFQRQELKDILDDLQKLNF-------------NTVYPQVWND--------GDA------LYPSA   96 (418)
T ss_pred             cccceeEEEecCCCcccccHHHHHHHHHHHHHcCC-------------ceeEEEEecC--------ccc------ccccc
Confidence            3577999999  56777899999999999998863             5553333221        110      01000


Q ss_pred             HHHHHHHHHHHcCceeeeecCC-CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhh
Q psy14880        104 EVQELVHYAKLRGVRVIFELDA-PAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQ  182 (497)
Q Consensus       104 ~i~~~~~y~~~rG~ri~~e~~~-p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~  182 (497)
                                     +.+.++. |+..+       ..++|        +=+..+|+-|.+|||+|+|=++.-.=+-. .+
T Consensus        97 ---------------~~p~s~~~~~~~~-------~~~g~--------DpLa~~I~~AHkr~l~v~aWf~~~~~a~~-~s  145 (418)
T COG1649          97 ---------------VLPWSDGLPGVLG-------VDPGY--------DPLAFVIAEAHKRGLEVHAWFNPYRMAPP-TS  145 (418)
T ss_pred             ---------------ccccccCcCcccC-------CCCCC--------ChHHHHHHHHHhcCCeeeechhhcccCCC-CC
Confidence                           0011111 11000       01122        34899999999999999998764322111 11


Q ss_pred             hcCCCCCCCcccccCCCCCCccccCC--CCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEE
Q psy14880        183 WGPRYGLGDLIVCLNDPSWRANCIQP--PCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFH  243 (497)
Q Consensus       183 ~~p~~~~~~l~~~~~~~~~~~~~~~~--~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iH  243 (497)
                      -.+.. .|+-. ....+.|...-...  ..-.|||..|++.+|+.+++.|+..=++  ..-||
T Consensus       146 ~~~~~-~p~~~-~~~~~~~~~~~~~~~~~~~~ldPg~Pevq~~i~~lv~evV~~Yd--vDGIQ  204 (418)
T COG1649         146 PLTKR-HPHWL-TTKRPGWVYVRHQGWGKRVWLDPGIPEVQDFITSLVVEVVRNYD--VDGIQ  204 (418)
T ss_pred             hhHhh-CCCCc-ccCCCCeEEEecCCceeeeEeCCCChHHHHHHHHHHHHHHhCCC--CCcee
Confidence            00000 11110 01111121110000  1346999999999999999999998777  44344


No 25 
>PF13200 DUF4015:  Putative glycosyl hydrolase domain
Probab=84.43  E-value=10  Score=39.27  Aligned_cols=77  Identities=21%  Similarity=0.200  Sum_probs=54.2

Q ss_pred             HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHH
Q psy14880        150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYG  229 (497)
Q Consensus       150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~  229 (497)
                      -.|+++|++.++++||.+|--|=+.-=...+. ..|+.+..    -.++..|....   ...-+||.++++.+.+-+|-.
T Consensus        60 i~D~~~l~~~l~e~gIY~IARIv~FkD~~la~-~~pe~av~----~~~G~~w~d~~---~~~WvnP~~~evw~Y~i~IA~  131 (316)
T PF13200_consen   60 IKDLKALVKKLKEHGIYPIARIVVFKDPVLAE-AHPEWAVK----TKDGSVWRDNE---GEAWVNPYSKEVWDYNIDIAK  131 (316)
T ss_pred             ccCHHHHHHHHHHCCCEEEEEEEEecChHHhh-hChhhEEE----CCCCCcccCCC---CCccCCCCCHHHHHHHHHHHH
Confidence            36899999999999999998887766443322 24443321    12334564321   234699999999999999999


Q ss_pred             HHHhc
Q psy14880        230 ELMGM  234 (497)
Q Consensus       230 E~~~l  234 (497)
                      |++.+
T Consensus       132 Eaa~~  136 (316)
T PF13200_consen  132 EAAKL  136 (316)
T ss_pred             HHHHc
Confidence            99875


No 26 
>PRK12568 glycogen branching enzyme; Provisional
Probab=81.19  E-value=26  Score=40.49  Aligned_cols=118  Identities=19%  Similarity=0.286  Sum_probs=67.9

Q ss_pred             HHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880        106 QELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW  181 (497)
Q Consensus       106 ~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l  181 (497)
                      ++++.|.+.-|+. |+  |-...|.  +..|.. ...|-|+++..| |.+|++++|+.|.++||.||-.+- |.|+..--
T Consensus       273 ~~ll~ylk~LGvt~I~LmPi~e~~~--~~~wGY-~~~~~~a~~~~~G~~~dfk~lV~~~H~~Gi~VIlD~V-~nH~~~d~  348 (730)
T PRK12568        273 EQLIPYVQQLGFTHIELLPITEHPF--GGSWGY-QPLGLYAPTARHGSPDGFAQFVDACHRAGIGVILDWV-SAHFPDDA  348 (730)
T ss_pred             HHHHHHHHHcCCCEEEECccccCCC--CCCCCC-CCCcCCccCcccCCHHHHHHHHHHHHHCCCEEEEEec-cccCCccc
Confidence            3467888888884 22  2221221  122321 112334444433 899999999999999999999874 88986521


Q ss_pred             hhcCCCCCCCcccc--cC-----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        182 QWGPRYGLGDLIVC--LN-----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       182 ~~~p~~~~~~l~~~--~~-----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      .  ....+..-...  .+     ...|..       -.+|-.+|++.+|+-+.+.-.++-|.
T Consensus       349 ~--~l~~fdg~~~Ye~~d~~~g~~~~W~~-------~~~N~~~peVr~~li~~a~~Wl~eyh  401 (730)
T PRK12568        349 H--GLAQFDGAALYEHADPREGMHRDWNT-------LIYNYGRPEVTAYLLGSALEWIEHYH  401 (730)
T ss_pred             c--ccccCCCccccccCCCcCCccCCCCC-------eecccCCHHHHHHHHHHHHHHHHHhC
Confidence            1  00000000000  00     011211       14699999999999998888877665


No 27 
>PLN02361 alpha-amylase
Probab=80.41  E-value=4.9  Score=42.98  Aligned_cols=84  Identities=13%  Similarity=0.060  Sum_probs=47.1

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc----CCC-CCC------CcccccCCCCC-CccccCCCCCcCCCC
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG----PRY-GLG------DLIVCLNDPSW-RANCIQPPCGQLNPI  216 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~----p~~-~~~------~l~~~~~~~~~-~~~~~~~~~~~L~~~  216 (497)
                      |++|++++|+-|.++||+||-.+= +-|+.......    ... +.+      ....|..+..+ +....-...--||-.
T Consensus        74 t~~el~~li~~~h~~gi~vi~D~V-~NH~~g~~~~~~~~y~~~~g~~~~wd~~~~~~~~~g~~~~~~~~~~~~lpDLd~~  152 (401)
T PLN02361         74 SEHLLKSLLRKMKQYNVRAMADIV-INHRVGTTQGHGGMYNRYDGIPLPWDEHAVTSCTGGLGNRSTGDNFNGVPNIDHT  152 (401)
T ss_pred             CHHHHHHHHHHHHHcCCEEEEEEc-cccccCCCCCCCCCcccCCCCcCCCCccccccccCCCCCccCCCCCccCCccCCC
Confidence            899999999999999999999863 46763211100    000 000      00001111000 000000111258889


Q ss_pred             ChhHHHHHHHHHHHHHh
Q psy14880        217 NPHVYTVLKDIYGELMG  233 (497)
Q Consensus       217 ~p~t~~fl~~vl~E~~~  233 (497)
                      ||++.+.+++.+.-+.+
T Consensus       153 np~Vr~~l~~~~~wl~~  169 (401)
T PLN02361        153 QHFVRKDIIGWLIWLRN  169 (401)
T ss_pred             CHHHHHHHHHHHHHHHh
Confidence            99999999988876655


No 28 
>PRK14705 glycogen branching enzyme; Provisional
Probab=78.38  E-value=29  Score=42.38  Aligned_cols=120  Identities=22%  Similarity=0.320  Sum_probs=68.5

Q ss_pred             HHHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880        105 VQELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNG  180 (497)
Q Consensus       105 i~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~  180 (497)
                      +++++.|.+.-|+. |+  |=...|..  ..|.+ ...+-|.++..| |.+|++++|+.|.++||.||-.+ +|+|+..-
T Consensus       768 ~~~lldYlk~LGvt~IeLmPv~e~p~~--~swGY-~~~~y~ap~~ryGt~~dfk~lVd~~H~~GI~VILD~-V~nH~~~d  843 (1224)
T PRK14705        768 AKELVDYVKWLGFTHVEFMPVAEHPFG--GSWGY-QVTSYFAPTSRFGHPDEFRFLVDSLHQAGIGVLLDW-VPAHFPKD  843 (1224)
T ss_pred             HHHHHHHHHHhCCCEEEECccccCCCC--CCCCC-CccccCCcCcccCCHHHHHHHHHHHHHCCCEEEEEe-ccccCCcc
Confidence            34567888888884 32  22222221  22311 122333444444 89999999999999999999864 78887432


Q ss_pred             hhhc-CCC-CCCCc--cccc--CCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        181 WQWG-PRY-GLGDL--IVCL--NDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       181 l~~~-p~~-~~~~l--~~~~--~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                       .+. ... +.+..  ..+.  ....|..       ..+|..++++.+|+.+.+.-.++-|.
T Consensus       844 -~~~l~~fdg~~~y~~~d~~~g~~~~Wg~-------~~fn~~~~eVr~fli~~a~~Wl~eyh  897 (1224)
T PRK14705        844 -SWALAQFDGQPLYEHADPALGEHPDWGT-------LIFDFGRTEVRNFLVANALYWLDEFH  897 (1224)
T ss_pred             -hhhhhhcCCCcccccCCcccCCCCCCCC-------ceecCCCHHHHHHHHHHHHHHHHHhC
Confidence             010 000 00000  0000  0012221       24788999999999999888887665


No 29 
>PRK14706 glycogen branching enzyme; Provisional
Probab=77.11  E-value=31  Score=39.31  Aligned_cols=121  Identities=19%  Similarity=0.193  Sum_probs=68.1

Q ss_pred             HHHHHHHHHHcCce-ee--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880        105 VQELVHYAKLRGVR-VI--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNG  180 (497)
Q Consensus       105 i~~~~~y~~~rG~r-i~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~  180 (497)
                      +++++.|-+.-|+. |+  |-...|..  ..|.. ...+-|+++..| |.+|++.+|+-|.++||.||-.+ +|.|+..-
T Consensus       170 ~~~l~~ylk~lG~t~velmPv~e~~~~--~~wGY-~~~~~~~~~~~~g~~~~~~~lv~~~H~~gi~VilD~-v~nH~~~~  245 (639)
T PRK14706        170 AHRLGEYVTYMGYTHVELLGVMEHPFD--GSWGY-QVTGYYAPTSRLGTPEDFKYLVNHLHGLGIGVILDW-VPGHFPTD  245 (639)
T ss_pred             HHHHHHHHHHcCCCEEEccchhcCCCC--CCCCc-CcccccccccccCCHHHHHHHHHHHHHCCCEEEEEe-cccccCcc
Confidence            34466888888884 32  22222221  12311 112334555555 89999999999999999999876 56887532


Q ss_pred             hhhcCCC-CCCCcccccCC----CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        181 WQWGPRY-GLGDLIVCLND----PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       181 l~~~p~~-~~~~l~~~~~~----~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      -...... +.+........    ..|..       -.+|..+|++.+|+.+.+.-.++-|.
T Consensus       246 ~~~l~~~dg~~~y~~~~~~~g~~~~w~~-------~~~~~~~~eVr~~l~~~~~~W~~e~~  299 (639)
T PRK14706        246 ESGLAHFDGGPLYEYADPRKGYHYDWNT-------YIFDYGRNEVVMFLIGSALKWLQDFH  299 (639)
T ss_pred             hhhhhccCCCcceeccCCcCCcCCCCCC-------cccCCCCHHHHHHHHHHHHHHHHHhC
Confidence            1000000 00000000000    11211       14788999999999999888876555


No 30 
>PF14871 GHL6:  Hypothetical glycosyl hydrolase 6
Probab=76.68  E-value=12  Score=33.61  Aligned_cols=86  Identities=15%  Similarity=0.032  Sum_probs=55.8

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCC-CCccccCCCCCcCCCCChhHHHH
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPS-WRANCIQPPCGQLNPINPHVYTV  223 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~-~~~~~~~~~~~~L~~~~p~t~~f  223 (497)
                      ...++++=++|+|+-|+++||+|+-=+|.--+..++ ..+|+-...    ..++.+ .......+....+|++.| -.++
T Consensus        38 hp~L~~Dllge~v~a~h~~Girv~ay~~~~~d~~~~-~~HPeW~~~----~~~G~~~~~~~~~~~~~~~~c~ns~-Y~e~  111 (132)
T PF14871_consen   38 HPGLKRDLLGEQVEACHERGIRVPAYFDFSWDEDAA-ERHPEWFVR----DADGRPMRGERFGYPGWYTCCLNSP-YREF  111 (132)
T ss_pred             CCCCCcCHHHHHHHHHHHCCCEEEEEEeeecChHHH-HhCCceeeE----CCCCCCcCCCCcCCCCceecCCCcc-HHHH
Confidence            345777788999999999999999999998777665 355553211    112221 000000111234666554 5599


Q ss_pred             HHHHHHHHHhccC
Q psy14880        224 LKDIYGELMGMAK  236 (497)
Q Consensus       224 l~~vl~E~~~lF~  236 (497)
                      +...++|+++.++
T Consensus       112 ~~~~i~Ei~~~y~  124 (132)
T PF14871_consen  112 LLEQIREILDRYD  124 (132)
T ss_pred             HHHHHHHHHHcCC
Confidence            9999999999887


No 31 
>PRK05402 glycogen branching enzyme; Provisional
Probab=76.12  E-value=77  Score=36.66  Aligned_cols=123  Identities=20%  Similarity=0.243  Sum_probs=67.4

Q ss_pred             ChhHHH-HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880        101 TPLEVQ-ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAK-VYTPLEVQELVHYAKLRGIRVIFELDAPA  175 (497)
Q Consensus       101 ~~~~i~-~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~-~YT~~ei~eiv~yA~~rgI~VIPEID~PG  175 (497)
                      +-..+. +++.|-+.-|+..   .|-...|..  ..|.. ...+-|+.+. +=|.+|+++||+-|.++||.||=.+- |.
T Consensus       263 ~~~~i~~~l~~ylk~LGv~~i~L~Pi~e~~~~--~~~GY-~~~~y~ai~~~~Gt~~dfk~lV~~~H~~Gi~VilD~V-~N  338 (726)
T PRK05402        263 SYRELADQLIPYVKEMGFTHVELLPIAEHPFD--GSWGY-QPTGYYAPTSRFGTPDDFRYFVDACHQAGIGVILDWV-PA  338 (726)
T ss_pred             CHHHHHHHHHHHHHHcCCCEEEECCcccCCCC--CCCCC-CcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEEC-CC
Confidence            334444 4568888888753   232222221  12211 1111223333 33899999999999999999998764 78


Q ss_pred             cchhhhhhcCCCCCCCccccc--C-----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        176 HAGNGWQWGPRYGLGDLIVCL--N-----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       176 H~~a~l~~~p~~~~~~l~~~~--~-----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      |+..--  .....+.......  +     ...|..       ..+|..+|++.+++.+++.-.++-|.
T Consensus       339 H~~~~~--~~~~~~~~~~~y~~~~~~~~~~~~w~~-------~~~n~~~~~v~~~l~~~~~~W~~e~~  397 (726)
T PRK05402        339 HFPKDA--HGLARFDGTALYEHADPREGEHPDWGT-------LIFNYGRNEVRNFLVANALYWLEEFH  397 (726)
T ss_pred             CCCCCc--cchhccCCCcceeccCCcCCccCCCCC-------ccccCCCHHHHHHHHHHHHHHHHHhC
Confidence            885310  0000000000000  0     001211       14788999999999998888876555


No 32 
>PF02679 ComA:  (2R)-phospho-3-sulfolactate synthase (ComA);  InterPro: IPR003830 Methanogenic archaea produce methane via the anaerobic reduction of acetate or single carbon compounds []. Coenzyme M (CoM; 2-mercaptoethanesulphonic acid) serves as the terminal methyl carrier for this process. Previously thought to be unique to methanogenic archaea, CoM has also been found in methylotrophic bacteria. Biosynthesis of CoM begins with the Michael addition of sulphite to phosphoenolpyruvate, forming 2-phospho-3-sulpholactate (PSL). This reaction is catalyzed by members of this family, PSL synthase (ComA) []. Subsequently, PSL is dephosphorylated by phosphosulpholactate phosphatase (ComB) to form 3-sulpholactate [], which is then converted to 3-sulphopyruvate by L-sulpholactate dehydrogenase (ComC; 1.1.1.272 from EC) []. Sulphopyruvate decarboxylase (ComDE; 4.1.1.79 from EC) converts 3-sulphopyruvate to sulphoacetaldehyde []. Reductive thiolation of sulphoacetaldehyde is the final step.; GO: 0019295 coenzyme M biosynthetic process; PDB: 1U83_A 1QWG_A.
Probab=76.03  E-value=19  Score=35.99  Aligned_cols=31  Identities=10%  Similarity=-0.005  Sum_probs=22.9

Q ss_pred             cCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcc
Q psy14880         23 DEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCG   56 (497)
Q Consensus        23 D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~   56 (497)
                      -.||-.--..|+|-+   +++..++++++.++.|
T Consensus         7 ~KPR~~GlT~v~Dkg---lg~~~~~dlLe~ag~y   37 (244)
T PF02679_consen    7 EKPRSRGLTMVIDKG---LGLRYLEDLLESAGDY   37 (244)
T ss_dssp             -SS-SSS-EEEEESS-----HHHHHHHHHHHGGG
T ss_pred             CCCCCCCcEEEecCC---CCHHHHHHHHHHhhhh
Confidence            457776678899988   9999999999999987


No 33 
>TIGR01515 branching_enzym alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase. A sequence from Arabidopsis thaliana, GP|9294564, scores just above trusted, but appears either to contain corrupt sequence or, more likely, to be a pseudogene as some of the conserved catalytic residues common to the alpha amylase family are not conserved here.
Probab=75.70  E-value=43  Score=37.88  Aligned_cols=122  Identities=19%  Similarity=0.182  Sum_probs=66.4

Q ss_pred             HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhh
Q psy14880        107 ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQ  182 (497)
Q Consensus       107 ~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~  182 (497)
                      +++.|.+.-|+.-   .|-...|.  ...|... ..+-|+.+..| |.+|+++||+-|.++||.||-.+ ++.|+..--.
T Consensus       161 ~l~dyl~~LGvt~i~L~Pi~e~~~--~~~wGY~-~~~y~~~~~~~Gt~~dlk~lV~~~H~~Gi~VilD~-V~NH~~~~~~  236 (613)
T TIGR01515       161 QLIPYVKELGFTHIELLPVAEHPF--DGSWGYQ-VTGYYAPTSRFGTPDDFMYFVDACHQAGIGVILDW-VPGHFPKDDH  236 (613)
T ss_pred             HHHHHHHHcCCCEEEECCcccCCC--CCCCCCC-cccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe-cccCcCCccc
Confidence            3558888878753   23222221  1122111 11223344444 78999999999999999999876 4688863211


Q ss_pred             hcCCC-CCCCc-ccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        183 WGPRY-GLGDL-IVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       183 ~~p~~-~~~~l-~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      ..... +.+.. ..+........++    ...+|..+|++.+|+.+++.-.++-|.
T Consensus       237 ~~~~~~~~~~y~~~~~~~~~~~~w~----~~~~~~~~~~Vr~~l~~~~~~W~~ey~  288 (613)
T TIGR01515       237 GLAEFDGTPLYEHKDPRDGEHWDWG----TLIFDYGRPEVRNFLVANALYWAEFYH  288 (613)
T ss_pred             hhhccCCCcceeccCCccCcCCCCC----CceecCCCHHHHHHHHHHHHHHHHHhC
Confidence            00000 00000 0000000000111    124788999999999999998887655


No 34 
>PRK12313 glycogen branching enzyme; Provisional
Probab=75.11  E-value=78  Score=35.89  Aligned_cols=123  Identities=20%  Similarity=0.244  Sum_probs=67.2

Q ss_pred             hHHH-HHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880        103 LEVQ-ELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHA  177 (497)
Q Consensus       103 ~~i~-~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~  177 (497)
                      ..+. +++.|-+.-|+..   .|=...|..  ..|.. ...+-|+.+..| |.+|+++||+-|.++||.||-.+- |.|+
T Consensus       170 ~~~~~~ll~yl~~LGv~~i~L~Pi~~~~~~--~~~GY-~~~~y~~i~~~~Gt~~d~k~lv~~~H~~Gi~VilD~V-~nH~  245 (633)
T PRK12313        170 RELADELIPYVKEMGYTHVEFMPLMEHPLD--GSWGY-QLTGYFAPTSRYGTPEDFMYLVDALHQNGIGVILDWV-PGHF  245 (633)
T ss_pred             HHHHHHHHHHHHHcCCCEEEeCchhcCCCC--CCCCC-CCcCcCcCCCCCCCHHHHHHHHHHHHHCCCEEEEEEC-CCCC
Confidence            3434 4568888878753   232222321  11211 111222333333 899999999999999999999864 7888


Q ss_pred             hhhhhhcCCCC-CCCcc-cccCC---CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        178 GNGWQWGPRYG-LGDLI-VCLND---PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       178 ~a~l~~~p~~~-~~~l~-~~~~~---~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      ..--...+... .+... ..+..   ..|.       ...+|-.+|++.+|+.+++.-.++-|.
T Consensus       246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~-------~~~~n~~~~~vr~~l~~~~~~W~~~~~  302 (633)
T PRK12313        246 PKDDDGLAYFDGTPLYEYQDPRRAENPDWG-------ALNFDLGKNEVRSFLISSALFWLDEYH  302 (633)
T ss_pred             CCCcccccccCCCcceeecCCCCCcCCCCC-------CcccCCCCHHHHHHHHHHHHHHHHHhC
Confidence            64210000000 00000 00000   0121       125888999999999999888887655


No 35 
>PF02449 Glyco_hydro_42:  Beta-galactosidase;  InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=74.99  E-value=15  Score=38.59  Aligned_cols=83  Identities=16%  Similarity=0.196  Sum_probs=47.8

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc-ccCCCCCCccccCCCCCcCCCCChhHHHHH
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV-CLNDPSWRANCIQPPCGQLNPINPHVYTVL  224 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~-~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl  224 (497)
                      |.|.-+.+..+++.|+++||.||--+  |.++...+-.  . .+|+... ..++.. ...+   .....|+.+|...+.+
T Consensus        42 G~ydF~~lD~~l~~a~~~Gi~viL~~--~~~~~P~Wl~--~-~~Pe~~~~~~~g~~-~~~g---~~~~~~~~~p~yr~~~  112 (374)
T PF02449_consen   42 GQYDFSWLDRVLDLAAKHGIKVILGT--PTAAPPAWLY--D-KYPEILPVDADGRR-RGFG---SRQHYCPNSPAYREYA  112 (374)
T ss_dssp             TB---HHHHHHHHHHHCTT-EEEEEE--CTTTS-HHHH--C-CSGCCC-B-TTTSB-EECC---CSTT-HCCHHHHHHHH
T ss_pred             CeeecHHHHHHHHHHHhccCeEEEEe--cccccccchh--h-hcccccccCCCCCc-CccC---CccccchhHHHHHHHH
Confidence            34777889999999999999999644  4444443321  1 1333321 111110 0000   1134789999999999


Q ss_pred             HHHHHHHHhccCC
Q psy14880        225 KDIYGELMGMAKY  237 (497)
Q Consensus       225 ~~vl~E~~~lF~~  237 (497)
                      .++++++++-+.+
T Consensus       113 ~~~~~~l~~~y~~  125 (374)
T PF02449_consen  113 RRFIRALAERYGD  125 (374)
T ss_dssp             HHHHHHHHHHHTT
T ss_pred             HHHHHHHHhhccc
Confidence            9999999988774


No 36 
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=74.21  E-value=36  Score=38.56  Aligned_cols=121  Identities=24%  Similarity=0.382  Sum_probs=70.2

Q ss_pred             hHHHHHHHHHHHcCce-eeeecCCCCCCC-CCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        103 LEVQELVHYAKLRGVR-VIFELDAPAHAG-NGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       103 ~~i~~~~~y~~~rG~r-i~~e~~~p~~~~-~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      +..++++.|.+.-|+. |++ +.+..|++ .+|.. ..-|-|++...| |.+|++.+|..|-++||.||=.. +|||...
T Consensus       165 e~a~~llpYl~elG~T~IEL-MPv~e~p~~~sWGY-q~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD~-V~~HF~~  241 (628)
T COG0296         165 ELAIELLPYLKELGITHIEL-MPVAEHPGDRSWGY-QGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILDW-VPNHFPP  241 (628)
T ss_pred             HHHHHHhHHHHHhCCCEEEE-cccccCCCCCCCCC-CcceeccccccCCCHHHHHHHHHHHHHcCCEEEEEe-cCCcCCC
Confidence            4567788899988884 431 11122222 22311 122445554444 99999999999999999999763 7999876


Q ss_pred             hhhhcCCCCCCCcccccC--------CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        180 GWQWGPRYGLGDLIVCLN--------DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       180 ~l~~~p~~~~~~l~~~~~--------~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      --.....+. +..  +..        .++|..       ...|-+.+|+..|+-+-..--++-|.
T Consensus       242 d~~~L~~fd-g~~--~~e~~~~~~~~~~~Wg~-------~i~~~gr~EVR~Fll~nal~Wl~~yH  296 (628)
T COG0296         242 DGNYLARFD-GTF--LYEHEDPRRGEHTDWGT-------AIFNYGRNEVRNFLLANALYWLEEYH  296 (628)
T ss_pred             CcchhhhcC-Ccc--ccccCCcccccCCCccc-------chhccCcHHHHHHHHHHHHHHHHHhC
Confidence            211100000 000  011        123321       23455699999998877766666655


No 37 
>PF00128 Alpha-amylase:  Alpha amylase, catalytic domain;  InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site.  Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=69.14  E-value=14  Score=36.42  Aligned_cols=86  Identities=24%  Similarity=0.359  Sum_probs=51.2

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc----C--CCCCCCcccc--------------cCCCCCCcc----
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG----P--RYGLGDLIVC--------------LNDPSWRAN----  204 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~----p--~~~~~~l~~~--------------~~~~~~~~~----  204 (497)
                      |.+|+++||+-|.+|||+||-.+- +.|+...-.+.    .  ...+++....              .+...|...    
T Consensus        50 t~~d~~~Lv~~~h~~gi~VilD~V-~NH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  128 (316)
T PF00128_consen   50 TMEDFKELVDAAHKRGIKVILDVV-PNHTSDDHPWFQDSLNYFDNPYSDYYYWRDGEGSPPGNWYSYFGGSNWEYDDWGD  128 (316)
T ss_dssp             HHHHHHHHHHHHHHTTCEEEEEEE-TSEEETTSHHHHHHHTHTTSTTGTTBEEESBTSTTSSTBBCSTTTSSEESCHHTH
T ss_pred             hhhhhhhhhhccccccceEEEeee-ccccccccccccccccccccccccceeeccccccccccccccccccccccccccc
Confidence            789999999999999999997653 44776543320    0  0001111000              001111110    


Q ss_pred             --ccCCCCCcCCCCChhHHHHHHHHHHHHHhcc
Q psy14880        205 --CIQPPCGQLNPINPHVYTVLKDIYGELMGMA  235 (497)
Q Consensus       205 --~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF  235 (497)
                        ...+....||..+|++.+++.+++...++.+
T Consensus       129 ~~~~~~~~~dln~~n~~v~~~i~~~~~~w~~~g  161 (316)
T PF00128_consen  129 EYQFWSDLPDLNYENPEVREYIIDVLKFWIEEG  161 (316)
T ss_dssp             CHSSSTTSEEBETTSHHHHHHHHHHHHHHHHTT
T ss_pred             ccccccccchhhhhhhhhhhhhcccccchhhce
Confidence              0011123588999999999999998888643


No 38 
>TIGR02100 glgX_debranch glycogen debranching enzyme GlgX. This family consists of the GlgX protein from the E. coli glycogen operon and probable equivalogs from other prokaryotic species. GlgX is not required for glycogen biosynthesis, but instead acts as a debranching enzyme for glycogen catabolism. This model distinguishes GlgX from pullanases and other related proteins that also operate on alpha-1,6-glycosidic linkages. In the wide band between the trusted and noise cutoffs are functionally similar enzymes, mostly from plants, that act similarly but usually are termed isoamylase.
Probab=68.11  E-value=28  Score=40.00  Aligned_cols=91  Identities=20%  Similarity=0.282  Sum_probs=54.7

Q ss_pred             CCCCCC----CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc------ccCC-CCCCccccCCCCC
Q psy14880        143 SPAKVY----TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV------CLND-PSWRANCIQPPCG  211 (497)
Q Consensus       143 s~~~~Y----T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~------~~~~-~~~~~~~~~~~~~  211 (497)
                      .+++.|    |.+|+|++|+-|.++||+||=.+ ++.|+...-...|...+..+..      -.+. ..+..+  ..-..
T Consensus       233 a~d~~y~~~g~~~efk~LV~~~H~~GI~VIlDv-V~NHt~~~~~~~~~~~~~~~d~~~yy~~~~~~~~~~~~~--~g~gn  309 (688)
T TIGR02100       233 APEPRYLASGQVAEFKTMVRALHDAGIEVILDV-VYNHTAEGNELGPTLSFRGIDNASYYRLQPDDKRYYIND--TGTGN  309 (688)
T ss_pred             ccChhhcCCCCHHHHHHHHHHHHHCCCEEEEEE-CcCCccCcCCCCCcccccCCCCCcceEecCCCCceecCC--CCccc
Confidence            455566    78999999999999999999876 4578764311112111111100      0000 000000  00013


Q ss_pred             cCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        212 QLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       212 ~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      .||..+|++.+++.+.+.-.+.-|.
T Consensus       310 ~ln~~~p~vr~~i~d~l~~W~~e~g  334 (688)
T TIGR02100       310 TLNLSHPRVLQMVMDSLRYWVTEMH  334 (688)
T ss_pred             cccCCCHHHHHHHHHHHHHHHHHcC
Confidence            6899999999999999888876555


No 39 
>PLN02960 alpha-amylase
Probab=66.97  E-value=83  Score=37.12  Aligned_cols=125  Identities=14%  Similarity=0.114  Sum_probs=69.0

Q ss_pred             ChhHHH-HHHHHHHHcCcee-e--eecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcc
Q psy14880        101 TPLEVQ-ELVHYAKLRGVRV-I--FELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPA  175 (497)
Q Consensus       101 ~~~~i~-~~~~y~~~rG~ri-~--~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PG  175 (497)
                      +-..+. +.+.|.+.-|+.- +  |-...|..  .+|.. ...+-|+++..| |.+|+++||+-|.++||.||-.+ +|.
T Consensus       414 tf~~~~e~~LdYLk~LGvt~IeLmPv~e~~~~--~swGY-~~~~yfa~~~~yGtp~dfk~LVd~aH~~GI~VILDv-V~N  489 (897)
T PLN02960        414 SFKEFTQKVLPHVKKAGYNAIQLIGVQEHKDY--SSVGY-KVTNFFAVSSRFGTPDDFKRLVDEAHGLGLLVFLDI-VHS  489 (897)
T ss_pred             CHHHHHHHHHHHHHHcCCCEEEECCcccCCCC--CCCCC-CcccCCCcccccCCHHHHHHHHHHHHHCCCEEEEEe-ccc
Confidence            334554 5678888888853 2  21111221  11211 112223444434 89999999999999999999875 678


Q ss_pred             cchhhhhhc-CCC-CCCCcccccC----CCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        176 HAGNGWQWG-PRY-GLGDLIVCLN----DPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       176 H~~a~l~~~-p~~-~~~~l~~~~~----~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      |+..--..+ ... +.+++-.-.+    ...|.       +..+|..++++..|+-+.+.-.++-|.
T Consensus       490 H~~~d~~~~L~~FDG~~~~Yf~~~~~g~~~~WG-------~~~fNy~~~eVr~fLlsna~yWl~Eyh  549 (897)
T PLN02960        490 YAAADEMVGLSLFDGSNDCYFHSGKRGHHKRWG-------TRMFKYGDHEVLHFLLSNLNWWVTEYR  549 (897)
T ss_pred             ccCCccccchhhcCCCccceeecCCCCccCCCC-------CcccCCCCHHHHHHHHHHHHHHHHHHC
Confidence            876520000 000 0000000000    01221       135789999999999999888876554


No 40 
>PLN02784 alpha-amylase
Probab=64.75  E-value=21  Score=41.70  Aligned_cols=30  Identities=27%  Similarity=0.241  Sum_probs=26.7

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |++|++++|+-|.++||.||-.| ++-|+..
T Consensus       566 T~~ELk~LI~a~H~~GIkVIlDi-ViNH~ag  595 (894)
T PLN02784        566 TIDELKDLVKSFHEVGIKVLGDA-VLNHRCA  595 (894)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-Ccccccc
Confidence            99999999999999999999876 6777754


No 41 
>COG3525 Chb N-acetyl-beta-hexosaminidase [Carbohydrate transport and metabolism]
Probab=62.54  E-value=5.8  Score=44.58  Aligned_cols=36  Identities=42%  Similarity=0.608  Sum_probs=32.8

Q ss_pred             CCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCC
Q psy14880         96 PAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGN  131 (497)
Q Consensus        96 ~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~  131 (497)
                      -+..||.++++++++|++-|.+-++||+|+|+|++.
T Consensus       339 ~ggfytqd~~relv~yAsar~ItviPeiD~PgHa~a  374 (732)
T COG3525         339 MGGFYTQDDIRELVAYASARQITVIPEIDMPGHARA  374 (732)
T ss_pred             ccCcccHHHHHHHHHHHhhcCceecCCcCCcchhhh
Confidence            367899999999999999999999999999999754


No 42 
>smart00642 Aamy Alpha-amylase domain.
Probab=62.53  E-value=8.5  Score=35.90  Aligned_cols=30  Identities=37%  Similarity=0.566  Sum_probs=26.5

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|++++|+-|+++||+||-.+ +|-|+..
T Consensus        68 t~~d~~~lv~~~h~~Gi~vilD~-V~NH~~~   97 (166)
T smart00642       68 TMEDFKELVDAAHARGIKVILDV-VINHTSD   97 (166)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-CCCCCCC
Confidence            78999999999999999999986 5677655


No 43 
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=61.48  E-value=63  Score=32.99  Aligned_cols=136  Identities=14%  Similarity=0.099  Sum_probs=70.3

Q ss_pred             cCChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCccc
Q psy14880         99 VYTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAH  176 (497)
Q Consensus        99 ~y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH  176 (497)
                      ..+.++|.++++-.+.+|+.+. .-+|.      .|  ....|.+.-+ ..+-  +.+++++.-+++|+++++-|+ |+-
T Consensus        26 ~~s~~~v~~~~~~~~~~~iP~d~i~iD~------~w--~~~~g~f~~d~~~FP--dp~~mi~~l~~~G~k~~l~i~-P~i   94 (303)
T cd06592          26 DINQETVLNYAQEIIDNGFPNGQIEIDD------NW--ETCYGDFDFDPTKFP--DPKGMIDQLHDLGFRVTLWVH-PFI   94 (303)
T ss_pred             CcCHHHHHHHHHHHHHcCCCCCeEEeCC------Cc--cccCCccccChhhCC--CHHHHHHHHHHCCCeEEEEEC-Cee
Confidence            4567788888877777776422 12221      11  1223333222 1232  599999999999999998776 322


Q ss_pred             chhhhhhcCCCC-CCCcccccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880        177 AGNGWQWGPRYG-LGDLIVCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE  248 (497)
Q Consensus       177 ~~a~l~~~p~~~-~~~l~~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE  248 (497)
                      ... -..+.+.. .+-+..-.++ .+.........+..+|.+||++.++..+.++++..--.  -..|=+=.-|
T Consensus        95 ~~~-s~~~~e~~~~g~~vk~~~g~~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G--vdg~w~D~~E  165 (303)
T cd06592          95 NTD-SENFREAVEKGYLVSEPSGDIPALTRWWNGTAAVLDFTNPEAVDWFLSRLKSLQEKYG--IDSFKFDAGE  165 (303)
T ss_pred             CCC-CHHHHhhhhCCeEEECCCCCCCcccceecCCcceEeCCCHHHHHHHHHHHHHHHHHhC--CcEEEeCCCC
Confidence            110 00111100 0111100111 11100000112457999999999999999998883223  3444444444


No 44 
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=61.07  E-value=15  Score=36.59  Aligned_cols=22  Identities=14%  Similarity=0.113  Sum_probs=17.7

Q ss_pred             eeeccCCCCCCHHHHHHhhhhhhcc
Q psy14880         32 KLIVWSSHLTDPEIILNYLDPQTCG   56 (497)
Q Consensus        32 ~mlDvaR~~~~~~~lk~~id~ma~~   56 (497)
                      .|+|-+   +++..++++++.|+.|
T Consensus         3 ~v~dkg---l~~~~~~d~Le~~g~y   24 (237)
T TIGR03849         3 MVLDKG---LPPKFVEDYLKVCGDY   24 (237)
T ss_pred             eEecCC---CCHHHHHHHHHHhhhh
Confidence            355543   6999999999999987


No 45 
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=60.70  E-value=27  Score=38.05  Aligned_cols=30  Identities=20%  Similarity=0.257  Sum_probs=26.6

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.+|||.||-.+ ++-|+..
T Consensus        79 t~~dl~~Li~~~H~~Gi~vi~D~-V~NH~~~  108 (479)
T PRK09441         79 TKEELLNAIDALHENGIKVYADV-VLNHKAG  108 (479)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-CcccccC
Confidence            89999999999999999999876 5778763


No 46 
>PLN00196 alpha-amylase; Provisional
Probab=59.73  E-value=33  Score=37.04  Aligned_cols=32  Identities=16%  Similarity=0.250  Sum_probs=27.5

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      +=|++|+++||+-|.++||.||-.+ ++-|+.+
T Consensus        88 fGt~~elk~Lv~~aH~~GIkVilDv-V~NH~~~  119 (428)
T PLN00196         88 YGNEAQLKSLIEAFHGKGVQVIADI-VINHRTA  119 (428)
T ss_pred             CCCHHHHHHHHHHHHHCCCEEEEEE-CccCccc
Confidence            4589999999999999999999875 5777764


No 47 
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=59.67  E-value=29  Score=35.80  Aligned_cols=22  Identities=27%  Similarity=0.475  Sum_probs=19.0

Q ss_pred             CcCCCCChhHHHHHHHHHHHHH
Q psy14880        211 GQLNPINPHVYTVLKDIYGELM  232 (497)
Q Consensus       211 ~~L~~~~p~t~~fl~~vl~E~~  232 (497)
                      ..+|.+||++.+...+.++++.
T Consensus       131 ~~~Dftnp~a~~ww~~~~~~~~  152 (317)
T cd06594         131 GVLDLTNPAARDWFKQVIKEML  152 (317)
T ss_pred             eeeecCCHHHHHHHHHHHHHHh
Confidence            5799999999999988888763


No 48 
>TIGR02402 trehalose_TreZ malto-oligosyltrehalose trehalohydrolase. Members of this family are the trehalose biosynthetic enzyme malto-oligosyltrehalose trehalohydrolase, formally known as 4-alpha-D-{(1-4)-alpha-D-glucano}trehalose trehalohydrolase (EC 3.2.1.141). It is the TreZ protein of the TreYZ pathway for trehalose biosynthesis, and alternative to the OtsAB system.
Probab=58.06  E-value=1.5e+02  Score=32.95  Aligned_cols=119  Identities=22%  Similarity=0.332  Sum_probs=63.0

Q ss_pred             HHHHHHHHHHHcCce---eeeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        104 EVQELVHYAKLRGVR---VIFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       104 ~i~~~~~y~~~rG~r---i~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      .|.+-+.|.+.-|+.   +.|-.+.|+.  ..|... ...-|..+..| |.+|+++||+-|.++||+||=.+ ++.|+..
T Consensus       112 gi~~~l~yl~~LGv~~i~L~Pi~~~~~~--~~~GY~-~~~~~~~~~~~G~~~e~k~lV~~aH~~Gi~VilD~-V~NH~~~  187 (542)
T TIGR02402       112 AAIEKLPYLADLGITAIELMPVAQFPGT--RGWGYD-GVLPYAPHNAYGGPDDLKALVDAAHGLGLGVILDV-VYNHFGP  187 (542)
T ss_pred             HHHHhhHHHHHcCCCEEEeCccccCCCC--CCCCCC-ccCccccccccCCHHHHHHHHHHHHHCCCEEEEEE-ccCCCCC
Confidence            445556777777774   3333323321  122111 11112333333 88999999999999999999876 4678753


Q ss_pred             hhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCCh---hHHHHHHHHHHHHHhccC
Q psy14880        180 GWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINP---HVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       180 ~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p---~t~~fl~~vl~E~~~lF~  236 (497)
                      --...+... | .-.-.....|.        ..+|..+|   ++.+++.+.+.-.++-|.
T Consensus       188 ~~~~~~~~~-~-y~~~~~~~~wg--------~~~n~~~~~~~~vr~~i~~~~~~W~~e~~  237 (542)
T TIGR02402       188 EGNYLPRYA-P-YFTDRYSTPWG--------AAINFDGPGSDEVRRYILDNALYWLREYH  237 (542)
T ss_pred             ccccccccC-c-cccCCCCCCCC--------CccccCCCcHHHHHHHHHHHHHHHHHHhC
Confidence            211000000 1 00000011121        24777777   888888877776665444


No 49 
>PLN03244 alpha-amylase; Provisional
Probab=56.63  E-value=47  Score=38.69  Aligned_cols=77  Identities=17%  Similarity=0.105  Sum_probs=51.9

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccccc------C----CCCCCccccCCCCCcCCCCC
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCL------N----DPSWRANCIQPPCGQLNPIN  217 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~------~----~~~~~~~~~~~~~~~L~~~~  217 (497)
                      =|.+|++.+|.-|.++||.||-.+ +|.|+..--.    .++....-.+      +    ...|+       +..+|..+
T Consensus       438 GTPeDLK~LVD~aH~~GI~VILDv-V~NH~~~d~~----~GL~~fDGt~~~Yf~~~~~g~~~~WG-------s~~fnyg~  505 (872)
T PLN03244        438 GTPDDFKRLVDEAHGLGLLVFLDI-VHSYAAADEM----VGLSLFDGSNDCYFHTGKRGHHKHWG-------TRMFKYGD  505 (872)
T ss_pred             CCHHHHHHHHHHHHHCCCEEEEEe-cCccCCCccc----cchhhcCCCccceeccCCCCccCCCC-------CceecCCC
Confidence            389999999999999999999876 4788864210    0111100000      0    01121       23688999


Q ss_pred             hhHHHHHHHHHHHHHhccC
Q psy14880        218 PHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       218 p~t~~fl~~vl~E~~~lF~  236 (497)
                      +++..||-+.+.-.++-|.
T Consensus       506 ~EVr~FLLsna~yWleEyh  524 (872)
T PLN03244        506 LDVLHFLISNLNWWITEYQ  524 (872)
T ss_pred             HHHHHHHHHHHHHHHHHhC
Confidence            9999999999988887655


No 50 
>PLN02447 1,4-alpha-glucan-branching enzyme
Probab=56.62  E-value=1.9e+02  Score=33.82  Aligned_cols=116  Identities=16%  Similarity=0.134  Sum_probs=66.3

Q ss_pred             HHHHHHHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880        106 QELVHYAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAKVY-TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW  181 (497)
Q Consensus       106 ~~~~~y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l  181 (497)
                      ++.+.|.+.-|+.-   .|-...|..  ..|... ..+-|.++..| |.+|+++||+-|.++||.||-.+ +|.|+..--
T Consensus       254 ~~~L~ylk~LG~t~I~LmPi~e~~~~--~~wGY~-~~~~fa~~~~~Gtp~dlk~LVd~aH~~GI~VilDv-V~nH~~~~~  329 (758)
T PLN02447        254 DDVLPRIKALGYNAVQLMAIQEHAYY--GSFGYH-VTNFFAVSSRSGTPEDLKYLIDKAHSLGLRVLMDV-VHSHASKNT  329 (758)
T ss_pred             HHHHHHHHHcCCCEEEECCccccCCC--CCCCcC-cccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe-ccccccccc
Confidence            45677888778742   222222211  112111 11222333334 78999999999999999999875 688886421


Q ss_pred             hhcCCCCCCCcc---------cccCC-CCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        182 QWGPRYGLGDLI---------VCLND-PSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       182 ~~~p~~~~~~l~---------~~~~~-~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      .    .++..+.         .+.+. ..|.       ...+|-.++++.+||.+.+.-.++-|.
T Consensus       330 ~----~gl~~fDg~~~~Yf~~~~~g~~~~w~-------~~~~N~~~~eVr~fLl~~~~~Wl~ey~  383 (758)
T PLN02447        330 L----DGLNGFDGTDGSYFHSGPRGYHWLWD-------SRLFNYGNWEVLRFLLSNLRWWLEEYK  383 (758)
T ss_pred             c----ccccccCCCCccccccCCCCCcCcCC-------CceecCCCHHHHHHHHHHHHHHHHHhC
Confidence            0    0111110         00000 0111       125888999999999999988887544


No 51 
>PRK14510 putative bifunctional 4-alpha-glucanotransferase/glycogen debranching enzyme; Provisional
Probab=53.62  E-value=67  Score=39.45  Aligned_cols=89  Identities=17%  Similarity=0.220  Sum_probs=55.0

Q ss_pred             CCCCCC---CHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcc------cccCC-CCCCccccCCCC-C
Q psy14880        143 SPAKVY---TPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLI------VCLND-PSWRANCIQPPC-G  211 (497)
Q Consensus       143 s~~~~Y---T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~------~~~~~-~~~~~~~~~~~~-~  211 (497)
                      .++..|   |.+|+|++|+-|.++||+||-.+ ++-|+...-...|.++...+.      ...+. ..+..+   ..+ +
T Consensus       236 a~dp~yg~~~~~efk~lV~~~H~~GI~VILDv-V~NHt~~~~~~~p~~~~~~~d~~~yy~~~~~~~~~y~~~---~G~gn  311 (1221)
T PRK14510        236 APDPRLAPGGEEEFAQAIKEAQSAGIAVILDV-VFNHTGESNHYGPTLSAYGSDNSPYYRLEPGNPKEYENW---WGCGN  311 (1221)
T ss_pred             CcChhhccCcHHHHHHHHHHHHHCCCEEEEEE-ccccccCCCCCCCcccccCCCCCCceEecCCCCCcccCC---CCCCC
Confidence            455666   79999999999999999999875 567876532222222111110      00000 001000   011 3


Q ss_pred             cCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        212 QLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       212 ~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      .+|..+|.+.+++.+.+.-.+. |.
T Consensus       312 ~~n~~~p~v~~~i~d~lr~Wv~-~g  335 (1221)
T PRK14510        312 LPNLERPFILRLPMDVLRSWAK-RG  335 (1221)
T ss_pred             ccccCCHHHHHHHHHHHHHHHH-hC
Confidence            5777899999999999998888 65


No 52 
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=53.49  E-value=55  Score=33.34  Aligned_cols=122  Identities=19%  Similarity=0.302  Sum_probs=63.7

Q ss_pred             CChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880        100 YTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHA  177 (497)
Q Consensus       100 y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~  177 (497)
                      ++.+++.++++-.+..|+-+. ..+|+ +     |......+.|.-+ ..|  -+.+++++..+++||+|+.-++ |. .
T Consensus        21 ~~~~~v~~~~~~~~~~~iP~d~~~lD~-~-----w~~~~~~~~f~~d~~~F--Pd~~~~i~~l~~~G~~~~~~~~-P~-i   90 (308)
T cd06593          21 YDEEEVNEFADGMRERNLPCDVIHLDC-F-----WMKEFQWCDFEFDPDRF--PDPEGMLSRLKEKGFKVCLWIN-PY-I   90 (308)
T ss_pred             CCHHHHHHHHHHHHHcCCCeeEEEEec-c-----cccCCcceeeEECcccC--CCHHHHHHHHHHCCCeEEEEec-CC-C
Confidence            467788888877777887543 22321 1     1100001122111 112  1588999999999999999776 42 1


Q ss_pred             hh---hhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880        178 GN---GWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG  233 (497)
Q Consensus       178 ~a---~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~  233 (497)
                      ..   ..+...+.++  +..-.++.++....-.+....+|.+||++.++.++.++++.+
T Consensus        91 ~~~~~~~~e~~~~g~--~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~  147 (308)
T cd06593          91 AQKSPLFKEAAEKGY--LVKKPDGSVWQWDLWQPGMGIIDFTNPDACKWYKDKLKPLLD  147 (308)
T ss_pred             CCCchhHHHHHHCCe--EEECCCCCeeeecccCCCcccccCCCHHHHHHHHHHHHHHHH
Confidence            10   0000000110  100001111100000123357999999999999999998876


No 53 
>PRK03705 glycogen debranching enzyme; Provisional
Probab=52.66  E-value=60  Score=37.13  Aligned_cols=88  Identities=19%  Similarity=0.196  Sum_probs=53.8

Q ss_pred             CCCCCCC------HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCccc--c----cCC--CCCCccccCC
Q psy14880        143 SPAKVYT------PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIV--C----LND--PSWRANCIQP  208 (497)
Q Consensus       143 s~~~~YT------~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~--~----~~~--~~~~~~~~~~  208 (497)
                      .+++.|.      .+|+|++|+-|.++||+||-.+ ++.|+...-...|...+..+..  .    .+.  ..|...    
T Consensus       228 a~d~~ygt~~~~~~~efk~LV~~~H~~GI~VIlDv-V~NHt~~~~~~~~~~~~~~~d~~~yy~~~~~g~~~~~~g~----  302 (658)
T PRK03705        228 ALDPAYASGPETALDEFRDAVKALHKAGIEVILDV-VFNHSAELDLDGPTLSLRGIDNRSYYWIREDGDYHNWTGC----  302 (658)
T ss_pred             ccccccCCCCcchHHHHHHHHHHHHHCCCEEEEEE-cccCccCcCCCCcchhcccCCCccceEECCCCCcCCCCCc----
Confidence            4555563      3699999999999999999876 4567764211112111100000  0    000  011110    


Q ss_pred             CCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        209 PCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       209 ~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                       ...||..+|++.+++.+++.-.++-|.
T Consensus       303 -g~~ln~~~p~Vr~~iid~l~~W~~e~g  329 (658)
T PRK03705        303 -GNTLNLSHPAVVDWAIDCLRYWVETCH  329 (658)
T ss_pred             -cCcccCCCHHHHHHHHHHHHHHHHHhC
Confidence             136889999999999999998887666


No 54 
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=51.00  E-value=45  Score=37.09  Aligned_cols=30  Identities=30%  Similarity=0.483  Sum_probs=26.4

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.++||+||-.+ +|-|+..
T Consensus        73 t~~~~~~lv~~ah~~gi~vilD~-v~NH~~~  102 (543)
T TIGR02403        73 TMADFEELVSEAKKRNIKIMLDM-VFNHTST  102 (543)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-Ccccccc
Confidence            89999999999999999999876 5777753


No 55 
>PRK14459 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=50.87  E-value=45  Score=35.39  Aligned_cols=78  Identities=19%  Similarity=0.116  Sum_probs=51.5

Q ss_pred             eEEeccCCcccccccccCCCCCCCccCChhHHHHH-HHHHHHcCceeeeecCC-CCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880         74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL-VHYAKLRGVRVIFELDA-PAHAGNGWQWGPRFGAYSPAKVYTPL  151 (497)
Q Consensus        74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~-~~y~~~rG~ri~~e~~~-p~~~~~~~~~~~~~g~~s~~~~YT~~  151 (497)
                      +-||..|++--.--.|      .++.|..+++.+. .+|++..|.++.+++.. ++.                  -=|.+
T Consensus       242 vSLha~d~e~R~~l~p------~n~~~~l~~ll~a~~~~~~~~grrv~ieyvLi~Gv------------------NDs~e  297 (373)
T PRK14459        242 VSLHAPDDELRDELVP------VNTRWKVDEVLDAARYYADATGRRVSIEYALIRDI------------------NDQPW  297 (373)
T ss_pred             EEeCCCCHHHHHHhcC------cccCCCHHHHHHHHHHHHHHhCCEEEEEEEEeCCC------------------CCCHH
Confidence            5677777654433333      1345666666664 67777788888877752 332                  16889


Q ss_pred             HHHHHHHHHHHc-----CCEEEEecCCcc
Q psy14880        152 EVQELVHYAKLR-----GIRVIFELDAPA  175 (497)
Q Consensus       152 ei~eiv~yA~~r-----gI~VIPEID~PG  175 (497)
                      ++++|+++.+..     .|.+||--.++|
T Consensus       298 ~a~~L~~llk~~~~~~~~VNLIpyNp~~~  326 (373)
T PRK14459        298 RADLLGKKLHGRGGGWVHVNLIPLNPTPG  326 (373)
T ss_pred             HHHHHHHHHhhccCCCeEEEEEccCCCCC
Confidence            999999999987     567777554443


No 56 
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=50.30  E-value=53  Score=32.66  Aligned_cols=103  Identities=14%  Similarity=0.072  Sum_probs=66.0

Q ss_pred             ccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCC
Q psy14880         17 KAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSP   96 (497)
Q Consensus        17 ~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~   96 (497)
                      +++.-+|.=.|.+-...|      +|.+.|++.|+..-.+.+                  ..+-+ ||-+|+        
T Consensus        20 ~~g~yID~lKfg~Gt~~l------~~~~~l~eki~la~~~~V------------------~v~~G-Gtl~E~--------   66 (237)
T TIGR03849        20 VCGDYITFVKFGWGTSAL------IDRDIVKEKIEMYKDYGI------------------KVYPG-GTLFEI--------   66 (237)
T ss_pred             HhhhheeeEEecCceEee------ccHHHHHHHHHHHHHcCC------------------eEeCC-ccHHHH--------
Confidence            344455655566544433      677889999987666653                  12233 444332        


Q ss_pred             CccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCC
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDA  173 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~  173 (497)
                        .+.+..+++-+++++.-|+..+ |+.  +  |              .-..+.++..++|+.++++|..|+||+-.
T Consensus        67 --~~~q~~~~~Yl~~~k~lGf~~I-EiS--~--G--------------~~~i~~~~~~rlI~~~~~~g~~v~~EvG~  122 (237)
T TIGR03849        67 --AHSKGKFDEYLNECDELGFEAV-EIS--D--G--------------SMEISLEERCNLIERAKDNGFMVLSEVGK  122 (237)
T ss_pred             --HHHhhhHHHHHHHHHHcCCCEE-EEc--C--C--------------ccCCCHHHHHHHHHHHHhCCCeEeccccc
Confidence              2334567777788888888642 221  0  1              12389999999999999999999999643


No 57 
>cd07201 cPLA2_Grp-IVB-IVD-IVE-IVF Group IVB, IVD, IVE, and IVF cytosolic phospholipase A2; catalytic domain; Ca-dependent. Group IVB, IVD, IVE, and IVF cPLA2 consists of two domains: the regulatory C2 domain and alpha/beta hydrolase PLA2 domain. Group IVB, IVD, IVE, and IVF cPLA2 are also referred to as cPLA2-beta, -delta, -epsilon, and -zeta respectively. cPLA2-beta is approximately 30% identical to cPLA2-alpha and it shows low enzymatic activity compared to cPLA2alpha. cPLA2-beta hydrolyzes palmitic acid from 1-[14C]palmitoyl-2-arachidonoyl-PC and arachidonic acid from 1-palmitoyl-2[14C]arachidonoyl-PC, but not from 1-O-alkyl-2[3H]arachidonoyl-PC. cPLA2-delta, -epsilon, and -zeta are approximately 45-50% identical to cPLA2-beta and 31-37% identical to cPLA2-alpha. It's possible that cPLA2-beta, -delta, -epsilon, and -zeta may have arisen by gene duplication from an ancestral gene. The catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bon
Probab=47.39  E-value=49  Score=36.69  Aligned_cols=37  Identities=16%  Similarity=-0.016  Sum_probs=24.4

Q ss_pred             hhCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhh
Q psy14880         13 HFQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYL   50 (497)
Q Consensus        13 ~~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~i   50 (497)
                      +.++....|.|.|....++---|..| ..|.+.|.+++
T Consensus       283 ~~r~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~~~~~~  319 (541)
T cd07201         283 WTRDKVNDIEDEPPLPPRPPERLTTL-LTPGGPLSQAF  319 (541)
T ss_pred             hccccccccccCcCcCCCCCcccccc-cCCCCHHHHHH
Confidence            44777777888887766666666555 66666655543


No 58 
>COG0134 TrpC Indole-3-glycerol phosphate synthase [Amino acid transport and metabolism]
Probab=44.49  E-value=39  Score=33.99  Aligned_cols=73  Identities=16%  Similarity=0.253  Sum_probs=52.5

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHH
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKD  226 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~  226 (497)
                      .++.+++++|+++|.++|++++-|+..--++.-+++..+.     +.                  -+|-.|=++++.-.+
T Consensus       139 ~L~~~~l~el~~~A~~LGm~~LVEVh~~eEl~rAl~~ga~-----iI------------------GINnRdL~tf~vdl~  195 (254)
T COG0134         139 ALDDEQLEELVDRAHELGMEVLVEVHNEEELERALKLGAK-----II------------------GINNRDLTTLEVDLE  195 (254)
T ss_pred             hcCHHHHHHHHHHHHHcCCeeEEEECCHHHHHHHHhCCCC-----EE------------------EEeCCCcchheecHH
Confidence            4899999999999999999999999999888887753321     11                  023334445555555


Q ss_pred             HHHHHHhccCCCCCeEEE
Q psy14880        227 IYGELMGMAKYGDEMFHM  244 (497)
Q Consensus       227 vl~E~~~lF~~~~~~iHi  244 (497)
                      ...++++..|  ...+=|
T Consensus       196 ~t~~la~~~p--~~~~~I  211 (254)
T COG0134         196 TTEKLAPLIP--KDVILI  211 (254)
T ss_pred             HHHHHHhhCC--CCcEEE
Confidence            6788889999  554444


No 59 
>COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=44.48  E-value=24  Score=37.51  Aligned_cols=24  Identities=29%  Similarity=0.436  Sum_probs=22.1

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      -.||++++++|++.|+++||-||=
T Consensus       178 av~~~~~l~~i~~~a~~~~i~ii~  201 (393)
T COG0436         178 AVYSKEELKAIVELAREHDIIIIS  201 (393)
T ss_pred             cCCCHHHHHHHHHHHHHcCeEEEE
Confidence            359999999999999999999886


No 60 
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=43.92  E-value=1.5e+02  Score=30.59  Aligned_cols=22  Identities=14%  Similarity=0.166  Sum_probs=19.5

Q ss_pred             CcCCCCChhHHHHHHHHHHHHH
Q psy14880        211 GQLNPINPHVYTVLKDIYGELM  232 (497)
Q Consensus       211 ~~L~~~~p~t~~fl~~vl~E~~  232 (497)
                      ..+|.+||++.+...+.++++.
T Consensus       125 ~~~Dftnp~a~~ww~~~~~~~~  146 (317)
T cd06600         125 VYPDFTNPDTREWWAGLFSEWL  146 (317)
T ss_pred             cccCCCChHHHHHHHHHHHHHh
Confidence            3689999999999999998876


No 61 
>cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase,  is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources.
Probab=42.36  E-value=37  Score=32.12  Aligned_cols=67  Identities=19%  Similarity=0.241  Sum_probs=45.1

Q ss_pred             HHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880        154 QELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG  233 (497)
Q Consensus       154 ~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~  233 (497)
                      .+++++++.+|+.+||++.+|.++..+++...     +..                  .+.|..+...+    .++++..
T Consensus        87 ~~~~~~~~~~~~~~i~gv~t~~e~~~A~~~Ga-----d~i------------------~~~p~~~~g~~----~~~~l~~  139 (190)
T cd00452          87 PEVVKAANRAGIPLLPGVATPTEIMQALELGA-----DIV------------------KLFPAEAVGPA----YIKALKG  139 (190)
T ss_pred             HHHHHHHHHcCCcEECCcCCHHHHHHHHHCCC-----CEE------------------EEcCCcccCHH----HHHHHHh
Confidence            57999999999999999999998877654321     111                  12344443344    4455566


Q ss_pred             ccCCCCCeEEEcCCc
Q psy14880        234 MAKYGDEMFHMGADE  248 (497)
Q Consensus       234 lF~~~~~~iHiGgDE  248 (497)
                      .||. -+.+=+||=.
T Consensus       140 ~~~~-~p~~a~GGI~  153 (190)
T cd00452         140 PFPQ-VRFMPTGGVS  153 (190)
T ss_pred             hCCC-CeEEEeCCCC
Confidence            6764 5788888865


No 62 
>PLN02607 1-aminocyclopropane-1-carboxylate synthase
Probab=40.63  E-value=71  Score=34.54  Aligned_cols=23  Identities=4%  Similarity=0.273  Sum_probs=21.3

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      ..||++++++|++.|++++|.||
T Consensus       215 ~~~s~e~l~~l~~~~~~~~i~lI  237 (447)
T PLN02607        215 ATVQRSVLEDILDFVVRKNIHLV  237 (447)
T ss_pred             cccCHHHHHHHHHHHHHCCCEEE
Confidence            45999999999999999999887


No 63 
>PRK14465 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=40.28  E-value=2.7e+02  Score=29.25  Aligned_cols=113  Identities=15%  Similarity=0.119  Sum_probs=73.7

Q ss_pred             hCCccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhc-ccccccchhhHHhhhhceeEEeccCCcccccccccCC
Q psy14880         14 FQNKAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTC-GTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFG   92 (497)
Q Consensus        14 ~~~~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~-~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g   92 (497)
                      .+...+++.+.|.-+.|.-+.-++++| |++.+.+.++..+. .+              ..+-+..       +.+|.+.
T Consensus       210 ~~~~LaiSLhA~~~e~R~~l~Pi~~~~-~le~ll~al~~~~~~~~--------------r~v~iey-------vLI~GvN  267 (342)
T PRK14465        210 EPYNFAISLNHPDPNGRLQIMDIEEKF-PLEELLQAAKDFTRELK--------------RRITFEY-------VMIPGVN  267 (342)
T ss_pred             cCceEEEEecCCChhhcceEeeccccC-CHHHHHHHHHHHHHHcC--------------CEEEEEE-------EEECCcc
Confidence            346778888999999999999999888 88888887775543 22              1111110       0122221


Q ss_pred             CCCCCccCChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880         93 AYSPAKVYTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus        93 ~~~~~~~y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                             =+.++++++.++.+..+.++. .++..+   +            ++-..-|++++.++.+.-+++||.|..-
T Consensus       268 -------Ds~eda~~L~~ll~~l~~kVnLIPyN~~---~------------~~~~~ps~e~i~~F~~~L~~~Gi~v~~R  324 (342)
T PRK14465        268 -------MGRENANKLVKIARSLDCKINVIPLNTE---F------------FGWRRPTDDEVAEFIMLLEPAGVPILNR  324 (342)
T ss_pred             -------CCHHHHHHHHHHHhhCCCcEEEEccCCC---C------------CCCCCCCHHHHHHHHHHHHHCCCeEEEe
Confidence                   235688888888876566553 233211   0            1123378999999999999999998763


No 64 
>PF01212 Beta_elim_lyase:  Beta-eliminating lyase;  InterPro: IPR001597 This domain is found in many tryptophanases (tryptophan indole-lyase, TNase), tyrosine phenol-lyases (TPL) and threonine aldolases. It is involved in the degradation of amino acids. The glycine cleavage system is composed of four proteins: P, T, L and H. In Bacillus subtilis, the P 'protein' is an heterodimer of two subunits. The glycine cleavage system catalyses the degradation of glycine. The P protein binds the alpha-amino group of glycine through its pyridoxal phosphate cofactor; CO(2) is released and the remaining methylamine moiety is then transferred to the lipoamide cofactor of the H protein; GO: 0016829 lyase activity, 0006520 cellular amino acid metabolic process; PDB: 3PJ0_C 2C44_C 2V0Y_A 2OQX_A 2V1P_A 1AX4_B 3LWS_A 1C7G_A 1V72_A 2YHK_B ....
Probab=38.86  E-value=27  Score=35.56  Aligned_cols=24  Identities=21%  Similarity=0.394  Sum_probs=17.8

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      +..||.+|+++|.++|+++||.|.
T Consensus       139 G~~~s~~el~ai~~~a~~~gl~lh  162 (290)
T PF01212_consen  139 GTVYSLEELRAISELAREHGLPLH  162 (290)
T ss_dssp             SB---HHHHHHHHHHHHHHT-EEE
T ss_pred             CeeCCHHHHHHHHHHHHhCceEEE
Confidence            456999999999999999998774


No 65 
>PF07555 NAGidase:  beta-N-acetylglucosaminidase ;  InterPro: IPR011496 This family consists of both eukaryotic and prokaryotic hyaluronidases. Human Q9HAR0 from SWISSPROT is expressed during meningioma []. Clostridium perfringens, P26831 from SWISSPROT, is involved in pathogenesis and is likely to act on connectivity tissue during gas gangrene []. It catalyses the random hydrolysis of 1->4-linkages between N-acetyl-beta-D-glucosamine and D-glucuronate residues in hyaluronate.; PDB: 2WB5_B 2V5C_B 2VUR_A 2V5D_A 2YDS_A 2CBI_A 2XPK_A 2CBJ_B 2J62_A 2X0Y_A ....
Probab=38.52  E-value=85  Score=32.42  Aligned_cols=22  Identities=32%  Similarity=0.603  Sum_probs=15.9

Q ss_pred             CCHH---HHHHHHHHHHHcCCEEEE
Q psy14880        148 YTPL---EVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       148 YT~~---ei~eiv~yA~~rgI~VIP  169 (497)
                      |+.+   ++++|++.|++.||+.|=
T Consensus        50 Yp~~el~~l~~L~~~a~~~~V~Fv~   74 (306)
T PF07555_consen   50 YPEEELAELKELADAAKANGVDFVY   74 (306)
T ss_dssp             --HHHHHHHHHHHHHHHHTT-EEEE
T ss_pred             CCHHHHHHHHHHHHHHHHcCCEEEE
Confidence            6655   678889999999999885


No 66 
>COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism]
Probab=38.32  E-value=35  Score=36.14  Aligned_cols=25  Identities=28%  Similarity=0.494  Sum_probs=22.9

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      +.+.|++|++.|.+.|+++||+||-
T Consensus       172 Grvwt~eeL~~i~elc~kh~v~VIS  196 (388)
T COG1168         172 GRVWTKEELRKIAELCLRHGVRVIS  196 (388)
T ss_pred             CccccHHHHHHHHHHHHHcCCEEEe
Confidence            3569999999999999999999996


No 67 
>KOG0258|consensus
Probab=37.81  E-value=69  Score=34.25  Aligned_cols=54  Identities=15%  Similarity=0.313  Sum_probs=37.6

Q ss_pred             hhHHHHHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880        102 PLEVQELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       102 ~~~i~~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      .+++++.+.-|+ .|+++..-+. .|+.+              -+..+|.+-|++|+.+|++.|+-++..
T Consensus       200 ~~el~~~~~eA~-k~i~~r~lvvINPGNP--------------TGqvls~e~ie~i~~fa~~~~l~llaD  254 (475)
T KOG0258|consen  200 VAELERSVDEAR-KGINPRALVVINPGNP--------------TGQVLSEENIEGIICFAAEEGLVLLAD  254 (475)
T ss_pred             HHHHHHHHHHHh-ccCCceEEEEECCCCc--------------cchhhcHHHHHHHHHHHHHcCeEEech
Confidence            457777877777 7775321111 13321              135699999999999999999999864


No 68 
>PRK13957 indole-3-glycerol-phosphate synthase; Provisional
Probab=37.78  E-value=46  Score=33.33  Aligned_cols=38  Identities=21%  Similarity=0.385  Sum_probs=33.6

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG  184 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~  184 (497)
                      .++++++++++++|..+|++++-|+-+..-...++...
T Consensus       134 ~L~~~~l~~l~~~a~~lGle~LVEVh~~~El~~a~~~g  171 (247)
T PRK13957        134 ILTPSQIKSFLKHASSLGMDVLVEVHTEDEAKLALDCG  171 (247)
T ss_pred             hCCHHHHHHHHHHHHHcCCceEEEECCHHHHHHHHhCC
Confidence            48999999999999999999999999988887776543


No 69 
>TIGR02455 TreS_stutzeri trehalose synthase, Pseudomonas stutzeri type. Trehalose synthase catalyzes a one-step conversion of maltose to trehalose. This is an alternative to the OtsAB and TreYZ pathways. This family includes a characterized example from Pseudomonas stutzeri plus very closely related sequences from other Pseudomonads. Cutoff scores are set to find a more distantly related sequence from Desulfovibrio vulgaris, likely to be functionally equivalent, between trusted and noise limits.
Probab=36.83  E-value=38  Score=38.52  Aligned_cols=31  Identities=26%  Similarity=0.320  Sum_probs=27.1

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      =|.+|+++|++-|.+|||.||-.+ +|.|++.
T Consensus       127 GT~eDf~~L~~~Ah~~G~~vi~Dl-VpnHTs~  157 (688)
T TIGR02455       127 GSEEELIQLSRMAAAHNAITIDDI-IPAHTGK  157 (688)
T ss_pred             CCHHHHHHHHHHHHHCCCEEEEEe-CCCCCCC
Confidence            499999999999999999999775 6788764


No 70 
>PF01565 FAD_binding_4:  FAD binding domain  This is only a subset of the Pfam family;  InterPro: IPR006094  Various enzymes use FAD as a co-factor, most of these enzymes are oxygen-dependent oxidoreductases, containing a covalently bound FAD group which is attached to a histidine via an 8-alpha-(N3-histidyl)-riboflavin linkage. One of the enzymes Vanillyl-alcohol oxidase (VAO, 1.1.3.38 from EC) has a solved structure, the alignment includes the FAD binding site, called the PP-loop, between residues 99-110 []. The FAD molecule is covalently bound in the known structure, however the residue that links to the FAD is not in the alignment. VAO catalyses the oxidation of a wide variety of substrates, ranging from aromatic amines to 4-alkylphenols.  ; GO: 0008762 UDP-N-acetylmuramate dehydrogenase activity, 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZR6_A 3HSU_A 2AXR_A 3D2J_A 3D2H_A 3FW9_A 3FW8_A 3FW7_A 3GSY_A 3FWA_A ....
Probab=36.78  E-value=40  Score=29.63  Aligned_cols=28  Identities=29%  Similarity=0.382  Sum_probs=23.3

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      |.+|+++++++|+++++.|.|-  .-||..
T Consensus         9 s~~ev~~~v~~a~~~~~~v~~~--g~G~~~   36 (139)
T PF01565_consen    9 SVEEVQAIVKFANENGVPVRVR--GGGHSW   36 (139)
T ss_dssp             SHHHHHHHHHHHHHTTSEEEEE--SSSTTS
T ss_pred             CHHHHHHHHHHHHHcCCcEEEE--cCCCCc
Confidence            7899999999999999999984  345543


No 71 
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY.  CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain.  All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=36.54  E-value=4e+02  Score=27.27  Aligned_cols=24  Identities=21%  Similarity=0.416  Sum_probs=20.2

Q ss_pred             CCcCCCCChhHHHHHHHHHHHHHh
Q psy14880        210 CGQLNPINPHVYTVLKDIYGELMG  233 (497)
Q Consensus       210 ~~~L~~~~p~t~~fl~~vl~E~~~  233 (497)
                      ...+|.+||++.+...+.++++.+
T Consensus       129 ~~~~Dftnp~a~~w~~~~~~~~~~  152 (317)
T cd06598         129 TGLIDWFDPAAQAWFHDNYKKLID  152 (317)
T ss_pred             ccccCCCCHHHHHHHHHHHHHhhh
Confidence            357899999999999999988743


No 72 
>PRK09357 pyrC dihydroorotase; Validated
Probab=35.03  E-value=1.2e+02  Score=32.12  Aligned_cols=93  Identities=20%  Similarity=0.166  Sum_probs=54.7

Q ss_pred             CCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCccccccc-------ccCCCCCCCccCChhHHHHHHHHHH
Q psy14880         41 TDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLL-------PRFGAYSPAKVYTPLEVQELVHYAK  113 (497)
Q Consensus        41 ~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~-------~~~g~~~~~~~y~~~~i~~~~~y~~  113 (497)
                      .+.+.|++.++.+..+++              .+|.|..+.+--.-+.       .+++.......=....+.+.+..++
T Consensus       156 ~~~~~l~~~~~~a~~~g~--------------~v~iH~ee~~~~~~~~~~~g~~~~~~~~~~~p~~~E~~~i~~~~~la~  221 (423)
T PRK09357        156 QDARLMRRALEYAKALDL--------------LIAQHCEDPSLTEGGVMNEGEVSARLGLPGIPAVAEEVMIARDVLLAE  221 (423)
T ss_pred             CCHHHHHHHHHHHHhcCC--------------EEEEeCCCHHHhhcccccCChhhHHhCCCCCCHHHHHHHHHHHHHHHH
Confidence            356788888877777653              5789998854322111       1111110000001224666677777


Q ss_pred             HcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEec
Q psy14880        114 LRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFEL  171 (497)
Q Consensus       114 ~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEI  171 (497)
                      ..|.|+...                        ..|-.+-.++++.|+++|+.|..|+
T Consensus       222 ~~g~~~hi~------------------------H~s~~~~~~~i~~a~~~g~~v~~e~  255 (423)
T PRK09357        222 ATGARVHIC------------------------HVSTAGSVELIRWAKALGIKVTAEV  255 (423)
T ss_pred             HHCCcEEEE------------------------eCCCHHHHHHHHHHHHcCCCEEEEe
Confidence            777664321                        1455567788889999999988875


No 73 
>KOG0259|consensus
Probab=35.00  E-value=37  Score=36.13  Aligned_cols=34  Identities=26%  Similarity=0.337  Sum_probs=30.2

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      +.+||++-+++|.+.|+++||.||-. +.-||+-.
T Consensus       213 GnVys~~HL~kiae~A~klgi~vIaD-EVY~~~vf  246 (447)
T KOG0259|consen  213 GNVYSEDHLKKIAETAKKLGIMVIAD-EVYGHTVF  246 (447)
T ss_pred             cccccHHHHHHHHHHHHHhCCeEEeh-hhcceeec
Confidence            56799999999999999999999976 78888854


No 74 
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=34.86  E-value=1.6e+02  Score=34.80  Aligned_cols=70  Identities=23%  Similarity=0.299  Sum_probs=45.7

Q ss_pred             CChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEEEE
Q psy14880        100 YTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       100 y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~VIP  169 (497)
                      |+-.++.+.+.|-+.-|+..+   |-+..+  .++.       -+|....++       |.++++++++-|+++||.||-
T Consensus        17 ~tf~~~~~~l~YL~~LGis~IyLsPi~~a~--~gs~-------hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIl   87 (879)
T PRK14511         17 FTFDDAAELVPYFADLGVSHLYLSPILAAR--PGST-------HGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLIL   87 (879)
T ss_pred             CCHHHHHHHhHHHHHcCCCEEEECcCccCC--CCCC-------CCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEE
Confidence            344678888888888888532   211111  1111       123222223       899999999999999999998


Q ss_pred             ecCCcccchh
Q psy14880        170 ELDAPAHAGN  179 (497)
Q Consensus       170 EID~PGH~~a  179 (497)
                      .| +|-|+..
T Consensus        88 Di-V~NH~~~   96 (879)
T PRK14511         88 DI-VPNHMAV   96 (879)
T ss_pred             Ee-ccccccC
Confidence            86 6778765


No 75 
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=34.56  E-value=1.7e+02  Score=33.70  Aligned_cols=36  Identities=22%  Similarity=0.335  Sum_probs=29.9

Q ss_pred             CcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880        211 GQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE  248 (497)
Q Consensus       211 ~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE  248 (497)
                      ..|+|.+|++.++|++|+.|++.-.+  -.-||+=-|=
T Consensus       433 ~rl~P~~pe~r~~i~~i~~dla~~~~--~dGilf~Dd~  468 (671)
T PRK14582        433 RRLSPFDDRVRAQVGMLYEDLAGHAA--FDGILFHDDA  468 (671)
T ss_pred             cCCCCCCHHHHHHHHHHHHHHHHhCC--CceEEecccc
Confidence            35999999999999999999999766  5667766555


No 76 
>TIGR02336 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase. Members of this family are found in phylogenetically diverse bacteria, including Clostridium perfringens (in the Firmicutes), Bifidobacterium longum and Propionibacterium acnes (in the Actinobacteria), and Vibrio vulnificus (in the Proteobacteria), most of which occur as mammalian pathogens or commensals. The nominal activity, 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase (EC 2.4.1.211), varies somewhat from instance to instance in relative rates for closely related substrates.
Probab=34.55  E-value=4.2e+02  Score=30.49  Aligned_cols=103  Identities=16%  Similarity=0.172  Sum_probs=64.9

Q ss_pred             CCCcCCCCChhHHHHHHHHHHHHHhccCCC-----CCeEE---EcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCC
Q psy14880        209 PCGQLNPINPHVYTVLKDIYGELMGMAKYG-----DEMFH---MGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSP  280 (497)
Q Consensus       209 ~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~-----~~~iH---iGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~  280 (497)
                      ..-.+|+..|+|.+++.+-+++.++-.|+.     ..+||   +=-||                             ...
T Consensus       176 h~ip~Dvr~p~t~~~~~~~l~~wl~~~p~~dVvRfTTfFy~Ftl~f~~-----------------------------~~k  226 (719)
T TIGR02336       176 HEIPFDIYHPATRKHVFDTFEQWLKDSPQTDVVRFTTFFYQFTLLFDE-----------------------------KRR  226 (719)
T ss_pred             ccCCcCCcChHHHHHHHHHHHHHHHhCCCCcEEEEeeeeeeEeEEecc-----------------------------ccc
Confidence            334799999999999999999999988863     12222   22233                             111


Q ss_pred             CcceeecccccCC--HHHH-HHHHHcCCCCC--------------------cccHHHHHHHHHHHHHHHHHHHhhCCCce
Q psy14880        281 HLWTVFIKCWNNT--PAIV-DFMLAHGKNTT--------------------FDDYIQLWAHFQNKAAASLDEAVGHNRTK  337 (497)
Q Consensus       281 ~~~~v~~~~w~~~--p~~~-~~~~~~g~~~~--------------------~~~~~~l~~~F~~~~~~~l~~~~~~~g~~  337 (497)
                      +.+ |+.-.+..|  |... ++-++.|..-+                    .-||.+.+..|+.+.++.+-+.+.+.||.
T Consensus       227 ek~-vDWfgY~~sVSp~ale~Fe~e~GY~l~pEd~vd~GyyN~~~rvPsk~~~D~~~F~~~fV~~~ak~lvd~~H~~Gke  305 (719)
T TIGR02336       227 EKV-VDWFGYACTVSPRALEDFEAKYGYKLRPEDFVDAGYYNSTWRVPRKQYRDWIDFLSGFVRENAKELVDMSHAAGKE  305 (719)
T ss_pred             cce-eeccCcccccCHHHHHHHHHHhCCCCCHHHhhccccccCcccCCcHHHhHHHHHHHHHHHHHHHHHHHHHHhcCce
Confidence            222 333334333  5444 44445466321                    13566777778888887777788889998


Q ss_pred             EEEe
Q psy14880        338 LIVW  341 (497)
Q Consensus       338 ~~~W  341 (497)
                      .|+.
T Consensus       306 AmmF  309 (719)
T TIGR02336       306 AMMF  309 (719)
T ss_pred             EEEe
Confidence            8876


No 77 
>PF02065 Melibiase:  Melibiase;  InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=34.32  E-value=2.1e+02  Score=30.60  Aligned_cols=128  Identities=20%  Similarity=0.272  Sum_probs=66.0

Q ss_pred             cCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCC-----CCCCCCCC-CCCCHHHHHHHHHHHHHcCCE----EE
Q psy14880         99 VYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGP-----RFGAYSPA-KVYTPLEVQELVHYAKLRGIR----VI  168 (497)
Q Consensus        99 ~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~-----~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~----VI  168 (497)
                      .++.+.++++++.++.-|+.. +-+|      .+|+...     ..|...++ ..+ ..=++.|+++.+++|+.    +-
T Consensus        54 d~~e~~i~~~a~~~~~~G~e~-fviD------DGW~~~r~~d~~~~GdW~~~~~kF-P~Gl~~l~~~i~~~Gmk~GlW~e  125 (394)
T PF02065_consen   54 DITEEKILELADAAAELGYEY-FVID------DGWFGGRDDDNAGLGDWEPDPKKF-PNGLKPLADYIHSLGMKFGLWFE  125 (394)
T ss_dssp             G--HHHHHHHHHHHHHHT-SE-EEE-------SSSBCTESTTTSTTSBECBBTTTS-TTHHHHHHHHHHHTT-EEEEEEE
T ss_pred             CCCHHHHHHHHHHHHHhCCEE-EEEc------CccccccCCCcccCCceeEChhhh-CCcHHHHHHHHHHCCCeEEEEec
Confidence            356778888888888888764 2232      2332110     11222111 111 12499999999999987    55


Q ss_pred             EecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcC
Q psy14880        169 FELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGA  246 (497)
Q Consensus       169 PEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGg  246 (497)
                      ||+=.|+.  .+.+.+|+.     ....++.+-..   ....-.||+++|++.+++.+.+..+..-..  -.||=+-.
T Consensus       126 Pe~v~~~S--~l~~~hPdw-----~l~~~~~~~~~---~r~~~vLD~~~pev~~~l~~~i~~ll~~~g--idYiK~D~  191 (394)
T PF02065_consen  126 PEMVSPDS--DLYREHPDW-----VLRDPGRPPTL---GRNQYVLDLSNPEVRDYLFEVIDRLLREWG--IDYIKWDF  191 (394)
T ss_dssp             TTEEESSS--CHCCSSBGG-----BTCCTTSE-EC---BTTBEEB-TTSHHHHHHHHHHHHHHHHHTT---SEEEEE-
T ss_pred             cccccchh--HHHHhCccc-----eeecCCCCCcC---cccceEEcCCCHHHHHHHHHHHHHHHHhcC--CCEEEecc
Confidence            66544443  233334432     11111100000   011136999999999999999998876555  56665543


No 78 
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=33.10  E-value=1.6e+02  Score=30.52  Aligned_cols=23  Identities=13%  Similarity=0.046  Sum_probs=20.5

Q ss_pred             CcCCCCChhHHHHHHHHHHHHHh
Q psy14880        211 GQLNPINPHVYTVLKDIYGELMG  233 (497)
Q Consensus       211 ~~L~~~~p~t~~fl~~vl~E~~~  233 (497)
                      ..+|.+||++.+...+.++++..
T Consensus       125 ~~~Dftnp~a~~ww~~~~~~~~~  147 (339)
T cd06603         125 SWPDFLNPEVRDWWASLFSYDKY  147 (339)
T ss_pred             CCccCCChhHHHHHHHHHHHHhh
Confidence            57999999999999999998865


No 79 
>KOG0256|consensus
Probab=32.89  E-value=1.8e+02  Score=31.43  Aligned_cols=117  Identities=12%  Similarity=0.162  Sum_probs=65.8

Q ss_pred             CCCCHHHHH-Hhhhhhhcccccccc-----hhhHH--hhhhceeEEeccC-CcccccccccCCCCCC-------------
Q psy14880         39 HLTDPEIIL-NYLDPQTCGTQQVVR-----EPLLL--VTLMALCRWQMCT-RQQVPCLLPRFGAYSP-------------   96 (497)
Q Consensus        39 ~~~~~~~lk-~~id~ma~~K~~~~~-----~~~~~--~~~~n~lhlHl~D-~q~~~~e~~~~g~~~~-------------   96 (497)
                      .|..+..++ .+++.|..++-|-.+     +.+.=  ....+.|-+=|.| +..|-+-.|-||.++.             
T Consensus       119 dy~Gl~~frqa~A~Fm~~~r~~~v~fdP~~~Vv~~G~T~ane~l~fcLadpgdafLvPtPyY~gfdrdl~~rTgveivpv  198 (471)
T KOG0256|consen  119 DYHGLPSFRQAVAEFMERARGNRVKFDPERVVVTNGATSANETLMFCLADPGDAFLVPTPYYPGFDRDLRWRTGVEIVPV  198 (471)
T ss_pred             cccCchHHHHHHHHHHHHHhCCCCccCccceEEecccchhhHHHHHHhcCCCceeeecCCCCCcccccceeccCceEEEE
Confidence            344444444 467777776643221     11110  0112445555666 6677766777776531             


Q ss_pred             ----CccC--ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880         97 ----AKVY--TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus        97 ----~~~y--~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                          ...|  |.+.+++...-|...|.+|- .-+..|..         +-     +-.||++++..|+.+|..++|.||-
T Consensus       199 ~c~Ss~~f~itv~alE~A~~~A~~~~~kVkGvlitNPsN---------PL-----G~~~~~e~L~~ll~Fa~~kniHvI~  264 (471)
T KOG0256|consen  199 HCSSSNGFQITVEALEAALNQARKLGLKVKGVLITNPSN---------PL-----GTTLSPEELISLLNFASRKNIHVIS  264 (471)
T ss_pred             EeecCCCccccHHHHHHHHHHHHHhCCceeEEEEeCCCC---------CC-----CCccCHHHHHHHHHHHhhcceEEEe
Confidence                1112  34556666667777776642 11111211         11     2349999999999999999999875


No 80 
>PF05913 DUF871:  Bacterial protein of unknown function (DUF871);  InterPro: IPR008589 This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown though a number are annotated as outer surface proteins.; PDB: 2P0O_A 1X7F_A.
Probab=32.57  E-value=80  Score=33.33  Aligned_cols=55  Identities=25%  Similarity=0.294  Sum_probs=35.2

Q ss_pred             hhHHHHHHHHHHHcCceeee-ecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecC
Q psy14880        102 PLEVQELVHYAKLRGVRVIF-ELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELD  172 (497)
Q Consensus       102 ~~~i~~~~~y~~~rG~ri~~-e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID  172 (497)
                      .++.++-++-++..|++.++ -+.+|...                .--..+++++|+++|+++|++||..|.
T Consensus        13 ~~~~~~yi~~a~~~Gf~~iFTSL~ipe~~----------------~~~~~~~~~~l~~~a~~~~~~v~~Dis   68 (357)
T PF05913_consen   13 FEENKAYIEKAAKYGFKRIFTSLHIPEDD----------------PEDYLERLKELLKLAKELGMEVIADIS   68 (357)
T ss_dssp             HHHHHHHHHHHHCTTEEEEEEEE-------------------------HHHHHHHHHHHHHHCT-EEEEEE-
T ss_pred             HHHHHHHHHHHHHCCCCEEECCCCcCCCC----------------HHHHHHHHHHHHHHHHHCCCEEEEECC
Confidence            45667777778888986543 33344321                114578999999999999999999875


No 81 
>PRK14465 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=32.52  E-value=1.7e+02  Score=30.71  Aligned_cols=74  Identities=15%  Similarity=0.103  Sum_probs=46.3

Q ss_pred             eeEEeccCCcccccccccCCCCCCCccCChhHHHHHH-HHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCH
Q psy14880         73 LCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV-HYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTP  150 (497)
Q Consensus        73 ~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~-~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~  150 (497)
                      .+.||-.|++.-.--.|.      ++.|.-+++.+.+ +|.+..|-++.+++. +|+..                  =|.
T Consensus       215 aiSLhA~~~e~R~~l~Pi------~~~~~le~ll~al~~~~~~~~r~v~ieyvLI~GvN------------------Ds~  270 (342)
T PRK14465        215 AISLNHPDPNGRLQIMDI------EEKFPLEELLQAAKDFTRELKRRITFEYVMIPGVN------------------MGR  270 (342)
T ss_pred             EEEecCCChhhcceEeec------cccCCHHHHHHHHHHHHHHcCCEEEEEEEEECCcc------------------CCH
Confidence            367888887654333331      2345445555544 577777777776665 24321                  678


Q ss_pred             HHHHHHHHHHHHc--CCEEEEe
Q psy14880        151 LEVQELVHYAKLR--GIRVIFE  170 (497)
Q Consensus       151 ~ei~eiv~yA~~r--gI~VIPE  170 (497)
                      +++++|.++++.+  .|++||-
T Consensus       271 eda~~L~~ll~~l~~kVnLIPy  292 (342)
T PRK14465        271 ENANKLVKIARSLDCKINVIPL  292 (342)
T ss_pred             HHHHHHHHHHhhCCCcEEEEcc
Confidence            9999999999985  4566663


No 82 
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=32.44  E-value=66  Score=24.67  Aligned_cols=46  Identities=33%  Similarity=0.385  Sum_probs=29.8

Q ss_pred             HHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecC
Q psy14880        105 VQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELD  172 (497)
Q Consensus       105 i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID  172 (497)
                      ++++++.++.+|+....=   -.|.                   |-....++.+++++.||.|||-++
T Consensus        17 ~~~~~~~a~~~g~~~v~i---TDh~-------------------~~~~~~~~~~~~~~~gi~~i~G~E   62 (67)
T smart00481       17 PEELVKRAKELGLKAIAI---TDHG-------------------NLFGAVEFYKAAKKAGIKPIIGLE   62 (67)
T ss_pred             HHHHHHHHHHcCCCEEEE---eeCC-------------------cccCHHHHHHHHHHcCCeEEEEEE
Confidence            567888899998865311   1111                   122245677778899999999554


No 83 
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain.  Both of
Probab=32.31  E-value=3.3e+02  Score=28.33  Aligned_cols=127  Identities=17%  Similarity=0.121  Sum_probs=64.0

Q ss_pred             ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880        101 TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       101 ~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      +.+++.++++--+.+++-+. ..+|. .+       ....+.|.-+ ..|-.-..+++|+.-+++||+|++-|+ |+-..
T Consensus        22 ~~~~v~~~~~~~r~~~iP~d~i~lD~-~~-------~~~~~~f~~d~~~FPdp~~~~mi~~L~~~G~k~~~~i~-P~v~~   92 (339)
T cd06602          22 NVDEVKEVVENMRAAGIPLDVQWNDI-DY-------MDRRRDFTLDPVRFPGLKMPEFVDELHANGQHYVPILD-PAISA   92 (339)
T ss_pred             CHHHHHHHHHHHHHhCCCcceEEECc-cc-------ccCccceecccccCCCccHHHHHHHHHHCCCEEEEEEe-Ccccc
Confidence            56788888866666776543 22221 11       1122333211 123322339999999999999999875 44322


Q ss_pred             hh-hhhcCCC----CCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccC
Q psy14880        179 NG-WQWGPRY----GLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAK  236 (497)
Q Consensus       179 a~-l~~~p~~----~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~  236 (497)
                      .. -..++..    ..+-+..-.++.++....-......+|.+||++.+..++.++++.+-..
T Consensus        93 ~~~~~~~~~~~e~~~~g~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~~~G  155 (339)
T cd06602          93 NEPTGSYPPYDRGLEMDVFIKNDDGSPYIGKVWPGYTVFPDFLNPNTQEWWTDEIKDFHDQVP  155 (339)
T ss_pred             CcCCCCCHHHHHHHHCCeEEECCCCCEEEEEeCCCCCcCcCCCCHHHHHHHHHHHHHHHhcCC
Confidence            10 0000000    0000000001111100000112246899999999999999988776444


No 84 
>COG1809 (2R)-phospho-3-sulfolactate synthase (PSL synthase, CoM    biosynthesis) [Coenzyme transport and metabolism]
Probab=32.26  E-value=2.6e+02  Score=27.80  Aligned_cols=101  Identities=18%  Similarity=0.196  Sum_probs=62.2

Q ss_pred             ccceeccCCCCCCceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCC
Q psy14880         17 KAAASLDEAVGHNRTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSP   96 (497)
Q Consensus        17 ~~~~~~D~p~~~~RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~   96 (497)
                      ++++....||--=-.+|||-   =+++..++++++..+.|=              ...               +|| |++
T Consensus         7 ~~~~r~~kPr~~G~T~vldk---g~~p~f~~D~~~vagdyV--------------Dfv---------------Kfg-wGT   53 (258)
T COG1809           7 LPPARPEKPRTFGMTVVLDK---GLGPRFVEDVLKVAGDYV--------------DFV---------------KFG-WGT   53 (258)
T ss_pred             cCCCCCCCCccCCeEEEEeC---CCChHHHHHHHHhhhhhe--------------eee---------------eec-ccc
Confidence            33344555665555677874   489999999999988872              111               123 234


Q ss_pred             CccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      +..-+.+.+++-+++.+.+++.+.     |+.+             -..-+|+++-+.+...+|++.|-++|
T Consensus        54 ~~Li~kd~V~ekid~y~e~~i~v~-----pGGt-------------lfe~a~~~~kvdeyl~e~~~lGfe~i  107 (258)
T COG1809          54 SSLIDKDQVKEKIDMYKENDIYVF-----PGGT-------------LFEIAYSQDKVDEYLNEAKELGFEAI  107 (258)
T ss_pred             cccccHHHHHHHHHHHHHcCceec-----CCce-------------EEEeehhcccHHHHHHHHHHcCccEE
Confidence            444455667777777777665432     2211             11234777778888888888887766


No 85 
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=31.78  E-value=1.3e+02  Score=33.74  Aligned_cols=30  Identities=40%  Similarity=0.556  Sum_probs=26.6

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.++||.||-.+ ++-|+..
T Consensus        79 t~~d~~~lv~~~h~~gi~vilD~-V~NH~s~  108 (551)
T PRK10933         79 TLDDFDELVAQAKSRGIRIILDM-VFNHTST  108 (551)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-CCCCccC
Confidence            89999999999999999999886 5677765


No 86 
>PF14542 Acetyltransf_CG:  GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=31.23  E-value=53  Score=26.55  Aligned_cols=23  Identities=22%  Similarity=0.372  Sum_probs=20.6

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEe
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      +-+.=++++++||++.|..|+|.
T Consensus        39 ia~~L~~~~l~~a~~~~~kv~p~   61 (78)
T PF14542_consen   39 IAKKLVEAALDYARENGLKVVPT   61 (78)
T ss_dssp             HHHHHHHHHHHHHHHTT-EEEET
T ss_pred             HHHHHHHHHHHHHHHCCCEEEEE
Confidence            78888999999999999999995


No 87 
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.  The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=31.08  E-value=4e+02  Score=27.34  Aligned_cols=20  Identities=10%  Similarity=-0.021  Sum_probs=17.1

Q ss_pred             CCCHHHHHHhhhhhhccccc
Q psy14880         40 LTDPEIILNYLDPQTCGTQQ   59 (497)
Q Consensus        40 ~~~~~~lk~~id~ma~~K~~   59 (497)
                      |.+.++|+++++.+...++.
T Consensus        20 y~~~~ev~~~~~~~~~~~iP   39 (319)
T cd06591          20 YKTQEELLDVAKEYRKRGIP   39 (319)
T ss_pred             CCCHHHHHHHHHHHHHhCCC
Confidence            46899999999999998864


No 88 
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=31.05  E-value=2.1e+02  Score=33.77  Aligned_cols=71  Identities=24%  Similarity=0.321  Sum_probs=46.7

Q ss_pred             CChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEEEE
Q psy14880        100 YTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       100 y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ++-.++.+.+.|-+.-|+.-.   |-+.++  .++.       -+|....++       |.++++++++-|+++||.||-
T Consensus        13 ~tf~~~~~~L~YL~~LGv~~V~lsPi~~a~--~gs~-------hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIl   83 (825)
T TIGR02401        13 FTFDDAAALLPYLKSLGVSHLYLSPILTAV--PGST-------HGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIV   83 (825)
T ss_pred             CCHHHHHHhhHHHHHcCCCEEEeCcCccCC--CCCC-------CCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEE
Confidence            344677888888888887432   222111  1111       123333333       899999999999999999998


Q ss_pred             ecCCcccchhh
Q psy14880        170 ELDAPAHAGNG  180 (497)
Q Consensus       170 EID~PGH~~a~  180 (497)
                      .| +|-|+...
T Consensus        84 Di-VpNH~a~~   93 (825)
T TIGR02401        84 DI-VPNHMAVH   93 (825)
T ss_pred             Ee-cccccccc
Confidence            76 58888753


No 89 
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=29.94  E-value=1.9e+02  Score=29.92  Aligned_cols=121  Identities=13%  Similarity=0.180  Sum_probs=61.6

Q ss_pred             ChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880        101 TPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPA-KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       101 ~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~-~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      +.+++.++++--+.+|+-+. +.+|.. +.       ..++.|+-+ ..|.  +.+++++..+++||+|++-++--=...
T Consensus        22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~-~~-------~~~~~f~~d~~~fP--dp~~m~~~l~~~g~~~~~~~~P~v~~~   91 (339)
T cd06604          22 PEEEVREIADEFRERDIPCDAIYLDID-YM-------DGYRVFTWDKERFP--DPKELIKELHEQGFKVVTIIDPGVKVD   91 (339)
T ss_pred             CHHHHHHHHHHHHHhCCCcceEEECch-hh-------CCCCceeeccccCC--CHHHHHHHHHHCCCEEEEEEeCceeCC
Confidence            56678888877777777543 223311 11       112223211 1232  578999999999999997664211110


Q ss_pred             ---hhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHh
Q psy14880        179 ---NGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMG  233 (497)
Q Consensus       179 ---a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~  233 (497)
                         .......+.++  +..-.++.++...+.......+|.+||++.++.+++++++.+
T Consensus        92 ~~~~~~~e~~~~g~--~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~  147 (339)
T cd06604          92 PGYDVYEEGLENDY--FVKDPDGELYIGRVWPGLSAFPDFTNPKVREWWGSLYKKFVD  147 (339)
T ss_pred             CCChHHHHHHHCCe--EEECCCCCEEEEEecCCCccccCCCChHHHHHHHHHHHHHhh
Confidence               00000000010  100011111110000112356899999999999999988764


No 90 
>KOG0462|consensus
Probab=29.71  E-value=43  Score=37.27  Aligned_cols=32  Identities=22%  Similarity=0.320  Sum_probs=25.7

Q ss_pred             CCHHHHHHHHHHHHHcCCEEE----------------EecCCcccchh
Q psy14880        148 YTPLEVQELVHYAKLRGIRVI----------------FELDAPAHAGN  179 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VI----------------PEID~PGH~~a  179 (497)
                      --++++-+=++.-++|||+|-                =-||||||...
T Consensus        91 ~~q~q~LDkl~vERERGITIkaQtasify~~~~~ylLNLIDTPGHvDF  138 (650)
T KOG0462|consen   91 IGQEQVLDKLQVERERGITIKAQTASIFYKDGQSYLLNLIDTPGHVDF  138 (650)
T ss_pred             CchhhhhhhhhhhhhcCcEEEeeeeEEEEEcCCceEEEeecCCCcccc
Confidence            346777777788899999974                46999999975


No 91 
>PF03979 Sigma70_r1_1:  Sigma-70 factor, region 1.1;  InterPro: IPR007127 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.  With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ].  This entry represents Region 1.1 which modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase [, ]. Region 1.1 is also involved in promoter binding.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2K6X_A.
Probab=29.40  E-value=51  Score=26.96  Aligned_cols=55  Identities=22%  Similarity=0.329  Sum_probs=29.6

Q ss_pred             hHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCC
Q psy14880        103 LEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDA  173 (497)
Q Consensus       103 ~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~  173 (497)
                      ..|+++++.+|.+|+=-.-++.               .++ +....+.++|.+|..+-...||+|+.+.+.
T Consensus         7 ~~i~~Li~~gK~~G~lT~~eI~---------------~~L-~~~~~~~e~id~i~~~L~~~gI~Vvd~~~~   61 (82)
T PF03979_consen    7 EAIKKLIEKGKKKGYLTYDEIN---------------DAL-PEDDLDPEQIDEIYDTLEDEGIEVVDEEED   61 (82)
T ss_dssp             HHHHHHHHHHHHHSS-BHHHHH---------------HH--S-S---HHHHHHHHHHHHTT----B--S--
T ss_pred             HHHHHHHHHHhhcCcCCHHHHH---------------HHc-CccCCCHHHHHHHHHHHHHCCCEEecCCCc
Confidence            4688999999999872111111               000 122389999999999999999999995433


No 92 
>PRK14457 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=29.21  E-value=1e+02  Score=32.29  Aligned_cols=73  Identities=22%  Similarity=0.246  Sum_probs=49.0

Q ss_pred             eeEEeccCCcccccccccCCCCCCCccCChhHHH-HHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCH
Q psy14880         73 LCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ-ELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTP  150 (497)
Q Consensus        73 ~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~-~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~  150 (497)
                      .+-+|-.|++-+.--.|-      ++.|..+++. .+.+|.+..|-+|.+++. +|+..                  -|.
T Consensus       215 aiSLha~~~e~r~~i~p~------~~~~~l~~l~~~~~~y~~~~gr~I~iey~LIpGvN------------------Ds~  270 (345)
T PRK14457        215 AVSLHAPNQKLRETLIPS------AKNYPIENLLEDCRHYVAITGRRVSFEYILLGGVN------------------DLP  270 (345)
T ss_pred             EEEeCCCCHHHHHHhcCC------ccCCCHHHHHHHHHHHHHHhCCEEEEEEEEECCcC------------------CCH
Confidence            377899998766543332      3446555554 455788888888877775 34431                  688


Q ss_pred             HHHHHHHHHHHHc--CCEEEE
Q psy14880        151 LEVQELVHYAKLR--GIRVIF  169 (497)
Q Consensus       151 ~ei~eiv~yA~~r--gI~VIP  169 (497)
                      +++++|+++++..  +|++||
T Consensus       271 e~a~~La~~l~~l~~~VnLIP  291 (345)
T PRK14457        271 EHAEELANLLRGFQSHVNLIP  291 (345)
T ss_pred             HHHHHHHHHHhcCCCeEEEec
Confidence            9999999999876  344444


No 93 
>TIGR02668 moaA_archaeal probable molybdenum cofactor biosynthesis protein A, archaeal. This model describes an archaeal family related, and predicted to be functionally equivalent, to molybdenum cofactor biosynthesis protein A (MoaA) of bacteria (see TIGR02666).
Probab=27.90  E-value=1.6e+02  Score=29.73  Aligned_cols=86  Identities=17%  Similarity=0.278  Sum_probs=53.5

Q ss_pred             hceeEEeccCCcccccccccCCCC--CCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy14880         71 MALCRWQMCTRQQVPCLLPRFGAY--SPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVY  148 (497)
Q Consensus        71 ~n~lhlHl~D~q~~~~e~~~~g~~--~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~Y  148 (497)
                      ++.+.+++|+.=+++|..-.....  ...+..+.+++.++++..+..|++.. -+                   ++++-.
T Consensus         9 ~~~l~i~vT~~CNl~C~yC~~~~~~~~~~~~ls~eei~~~i~~~~~~gi~~I-~~-------------------tGGEPl   68 (302)
T TIGR02668         9 VTSLRISVTDRCNLSCFYCHMEGEDRSGGNELSPEEIERIVRVASEFGVRKV-KI-------------------TGGEPL   68 (302)
T ss_pred             cCeEEEEEcccccCCCCCCCccccCCCccCcCCHHHHHHHHHHHHHcCCCEE-EE-------------------ECcccc
Confidence            578899999987777753221111  11345677888888887776665321 00                   112335


Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHA  177 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~  177 (497)
                      ...++.+|+++++++|+.-| .|.|-|..
T Consensus        69 l~~~l~~iv~~l~~~g~~~v-~i~TNG~l   96 (302)
T TIGR02668        69 LRKDLIEIIRRIKDYGIKDV-SMTTNGIL   96 (302)
T ss_pred             cccCHHHHHHHHHhCCCceE-EEEcCchH
Confidence            56678999999999988322 35566643


No 94 
>PF00682 HMGL-like:  HMGL-like of this family is not conserved in other members. are a sub-families of this Pfam.;  InterPro: IPR000891  Pyruvate carboxylase (6.4.1.1 from EC) (PC), a member of the biotin-dependent enzyme family, is involved in the gluconeogenesis by mediating the carboxylation of pyruvate to oxaloacetate. Biotin-dependent carboxylase enzymes perform a two step reaction. Enzyme-bound biotin is first carboxylated by bicarbonate and ATP and the carboxyl group temporarily bound to biotin is subsequently transferred to an acceptor substrate such as pyruvate []. PC has three functional domains: a biotin carboxylase (BC) domain, a carboxyltransferase (CT) domain which perform the second part of the reaction and a biotinyl domain [, ]. The mechanism by which the carboxyl group is transferred from the carboxybiotin to the pyruvate is not well understood.   The pyruvate carboxyltransferase domain is also found in other pyruvate binding enzymes and acetyl-CoA dependent enzymes suggesting that this domain can be associated with different enzymatic activities. This domain is found towards the N-terminal region of various aldolase enzymes. This N-terminal TIM barrel domain [] interacts with the C-terminal domain. The C-terminal DmpG_comm domain (IPR012425 from INTERPRO) is thought to promote heterodimerisation with members of IPR003361 from INTERPRO to form a bifunctional aldolase-dehydrogenase []. ; GO: 0003824 catalytic activity; PDB: 3MP5_E 3MP3_E 2CW6_E 3MP4_D 3HBL_A 3HB9_C 3HO8_A 3BG5_C 1YDN_B 3RMJ_A ....
Probab=27.73  E-value=1.5e+02  Score=28.58  Aligned_cols=57  Identities=28%  Similarity=0.318  Sum_probs=40.0

Q ss_pred             hHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880        103 LEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       103 ~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      +.+.+++++++..|..+.+...                   ....+|.+++.++++.+.+.|+.+|=--||-|.+.
T Consensus       108 ~~~~~~v~~ak~~g~~v~~~~~-------------------~~~~~~~~~~~~~~~~~~~~g~~~i~l~Dt~G~~~  164 (237)
T PF00682_consen  108 ERIEEAVKYAKELGYEVAFGCE-------------------DASRTDPEELLELAEALAEAGADIIYLADTVGIMT  164 (237)
T ss_dssp             HHHHHHHHHHHHTTSEEEEEET-------------------TTGGSSHHHHHHHHHHHHHHT-SEEEEEETTS-S-
T ss_pred             HHHHHHHHHHHhcCCceEeCcc-------------------ccccccHHHHHHHHHHHHHcCCeEEEeeCccCCcC
Confidence            4667788899988887743321                   11348999999999999999998887667666653


No 95 
>cd03319 L-Ala-DL-Glu_epimerase L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides. The genomic context and the substrate specificity of characterized members of this family from E.coli and B.subtilis indicates a possible role in the metabolism of the murein peptide of peptidoglycan, of which L-Ala-D-Glu is a component. L-Ala-D/L-Glu epimerase is a member of the enolase-superfamily, which is characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=27.63  E-value=1.2e+02  Score=31.00  Aligned_cols=22  Identities=18%  Similarity=0.330  Sum_probs=19.5

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEe
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      .-.+.+++.++|+++||.++|.
T Consensus       264 Gi~~~~~~~~~a~~~gi~~~~~  285 (316)
T cd03319         264 GLTEALRIADLARAAGLKVMVG  285 (316)
T ss_pred             CHHHHHHHHHHHHHcCCCEEEE
Confidence            3467899999999999999996


No 96 
>PRK14340 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=27.60  E-value=87  Score=33.96  Aligned_cols=52  Identities=13%  Similarity=0.050  Sum_probs=35.1

Q ss_pred             CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880         97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR  166 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~  166 (497)
                      .+.|+.+++.++++..+..  |+.+...+ +-+++               +  =|.+++++.++++++.++.
T Consensus       277 ~R~~t~~~~~~~v~~lr~~~pgi~i~td~-IvGfP---------------g--ET~edf~~tl~~~~~~~~~  330 (445)
T PRK14340        277 NRGHTIEEYLEKIALIRSAIPGVTLSTDL-IAGFC---------------G--ETEEDHRATLSLMEEVRFD  330 (445)
T ss_pred             CCCCCHHHHHHHHHHHHHhCCCCEEeccE-EEECC---------------C--CCHHHHHHHHHHHHhcCCC
Confidence            3568888888888887766  76543211 11211               1  3899999999999998743


No 97 
>PF07071 DUF1341:  Protein of unknown function (DUF1341);  InterPro: IPR010763 Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.; PDB: 3NZR_D 3LM7_A 3M0Z_B 3M6Y_A 3N73_A 3MUX_A.
Probab=27.36  E-value=76  Score=30.88  Aligned_cols=23  Identities=22%  Similarity=0.228  Sum_probs=11.0

Q ss_pred             CCHHHHHHHHHHHHHcCC-EEEEe
Q psy14880        148 YTPLEVQELVHYAKLRGI-RVIFE  170 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI-~VIPE  170 (497)
                      .+.+.+.+|++-|.+-|| .|||-
T Consensus       186 Idl~N~~~I~~i~l~aGv~~viPH  209 (218)
T PF07071_consen  186 IDLDNFEEIVKICLDAGVEKVIPH  209 (218)
T ss_dssp             --TTTHHHHHHHHHHTT-S-B--E
T ss_pred             cCHHHHHHHHHHHHHcCCCeeccc
Confidence            555566666666666666 35553


No 98 
>KOG0464|consensus
Probab=27.17  E-value=53  Score=35.35  Aligned_cols=30  Identities=27%  Similarity=0.432  Sum_probs=24.4

Q ss_pred             HHHHHHHHHHHHcCCEE-------------EEecCCcccchhh
Q psy14880        151 LEVQELVHYAKLRGIRV-------------IFELDAPAHAGNG  180 (497)
Q Consensus       151 ~ei~eiv~yA~~rgI~V-------------IPEID~PGH~~a~  180 (497)
                      +-+.++.+.-++|||+|             |--||+|||...-
T Consensus        74 dtvtdfla~erergitiqsaav~fdwkg~rinlidtpghvdf~  116 (753)
T KOG0464|consen   74 DTVTDFLAIERERGITIQSAAVNFDWKGHRINLIDTPGHVDFR  116 (753)
T ss_pred             chHHHHHHHHHhcCceeeeeeeecccccceEeeecCCCcceEE
Confidence            34677888899999998             5679999998753


No 99 
>PLN02231 alanine transaminase
Probab=27.12  E-value=1.7e+02  Score=32.43  Aligned_cols=24  Identities=17%  Similarity=0.414  Sum_probs=21.5

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ..||++++++|++.|+++||-||=
T Consensus       286 ~vls~e~l~~Iv~~a~~~~l~lI~  309 (534)
T PLN02231        286 QVLAEENQRDIVEFCKQEGLVLLA  309 (534)
T ss_pred             cCCCHHHHHHHHHHHHHcCCEEEE
Confidence            459999999999999999997773


No 100
>cd07937 DRE_TIM_PC_TC_5S Pyruvate carboxylase and Transcarboxylase 5S, carboxyltransferase domain. This family includes the carboxyltransferase domains of pyruvate carboxylase (PC) and the transcarboxylase (TC) 5S subunit.  Transcarboxylase 5S is a cobalt-dependent metalloenzyme subunit of the biotin-dependent transcarboxylase multienzyme complex. Transcarboxylase 5S transfers carbon dioxide from the 1.3S biotin to pyruvate in the second of two carboxylation reactions catalyzed by TC. The first reaction involves the transfer of carbon dioxide from methylmalonyl-CoA to the 1.3S biotin, and is catalyzed by the 12S subunit.  These two steps allow a carboxylate group to be transferred from oxaloacetate to propionyl-CoA to yield pyruvate and methylmalonyl-CoA.  The catalytic domain of transcarboxylase 5S has a canonical TIM-barrel fold with a large C-terminal extension that forms a funnel leading to the active site.  Transcarboxylase 5S forms a homodimer and there are six dimers per complex
Probab=27.01  E-value=1.9e+02  Score=29.11  Aligned_cols=22  Identities=14%  Similarity=0.024  Sum_probs=13.8

Q ss_pred             HHHHHHHHHHHHcCCEEEEecC
Q psy14880        151 LEVQELVHYAKLRGIRVIFELD  172 (497)
Q Consensus       151 ~ei~eiv~yA~~rgI~VIPEID  172 (497)
                      +.+++.+++|+++|++|++-+.
T Consensus       118 ~~~~~~i~~ak~~G~~v~~~i~  139 (275)
T cd07937         118 RNLEVAIKAVKKAGKHVEGAIC  139 (275)
T ss_pred             HHHHHHHHHHHHCCCeEEEEEE
Confidence            4566666666666666665443


No 101
>PF13199 Glyco_hydro_66:  Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=26.93  E-value=3.8e+02  Score=30.15  Aligned_cols=93  Identities=13%  Similarity=0.112  Sum_probs=50.4

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCC-CCcccccCCC--CCCccccCC--C--CCcCCCCCh
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGL-GDLIVCLNDP--SWRANCIQP--P--CGQLNPINP  218 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~-~~l~~~~~~~--~~~~~~~~~--~--~~~L~~~~p  218 (497)
                      ...+.+-|++.|+-|+++|+..++=.=+-|=..-.-.    .+. |+-....+..  .+.......  .  --.+||+|+
T Consensus       165 r~i~~~~Vk~yI~~ah~~Gmkam~Ynmiyaa~~~~~~----~gv~~eW~ly~d~~~~~~~~~~l~~~w~s~lyl~dP~N~  240 (559)
T PF13199_consen  165 RQISTSTVKDYINAAHKYGMKAMAYNMIYAANNNYEE----DGVSPEWGLYKDDSHSNQDTYDLPDGWPSDLYLMDPGNP  240 (559)
T ss_dssp             -EEEHHHHHHHHHHHHHTT-EEEEEEESSEEETT--S------SS-GGBEEESSSBTSB-EEEETT-E--EEEEB-TT-H
T ss_pred             CEehHHHHHHHHHHHHHcCcceehhHhhhccccCccc----ccCCchhhhhhccCCCccceeecCcccccceEEecCCCH
Confidence            3477888999999999999999987555544333111    110 1100000000  000000000  0  125899999


Q ss_pred             hHHHHHHHHHHHHHhccCCCCCeEEE
Q psy14880        219 HVYTVLKDIYGELMGMAKYGDEMFHM  244 (497)
Q Consensus       219 ~t~~fl~~vl~E~~~lF~~~~~~iHi  244 (497)
                      +=.+.|-+=+.++.+-|.  ..=|||
T Consensus       241 ~WQ~yI~~q~~~~~~~~g--FDG~hl  264 (559)
T PF13199_consen  241 EWQNYIINQMNKAIQNFG--FDGWHL  264 (559)
T ss_dssp             HHHHHHHHHHHHHHHHHT----EEEE
T ss_pred             HHHHHHHHHHHHHHHccC--CceEee
Confidence            988888888888888888  888997


No 102
>TIGR00048 radical SAM enzyme, Cfr family. A Staphylococcus sciuri plasmid-borne member of this family, Cfr, has been identified as essential to transferrable resistance to chloramphenicol and florfenicol by an unknown mechanism. A 14-15 residue cluster with four perfectly conserved Cys residues suggests this protein may be an enzyme with an iron-sulfur cluster. The Cys cluster is part of the radical SAM domain, suggested to provide a general mechanism by which the Fe-S center cleaves S-adenosylmethionine to initiate radical-based catalysis. Members of this family lack apparent transmembrane domains.
Probab=26.85  E-value=2.2e+02  Score=29.89  Aligned_cols=75  Identities=19%  Similarity=0.120  Sum_probs=47.1

Q ss_pred             eEEeccCCcccccccccCCCCCCCccCChhHHHH-HHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880         74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQE-LVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL  151 (497)
Q Consensus        74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~-~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~  151 (497)
                      +-+|-.|++-..--.|      .++.|..+++.+ +.+|.+..|.++.+++. +|+.                  -=|.+
T Consensus       219 iSL~a~~~e~r~~l~p------~~~~~~l~~ll~~l~~~~~~~g~~VtieyvLI~Gv------------------NDs~e  274 (355)
T TIGR00048       219 ISLHAPNDELRSSLMP------INKKYNIETLLAAVRRYLNKTGRRVTFEYVLLDGV------------------NDQVE  274 (355)
T ss_pred             EEeCCCCHHHHHHhcC------cccCCCHHHHHHHHHHHHHHhCCEEEEEEEEECCC------------------CCCHH
Confidence            6677777655432222      123455566664 34577777878776665 2432                  15789


Q ss_pred             HHHHHHHHHHHc--CCEEEEecC
Q psy14880        152 EVQELVHYAKLR--GIRVIFELD  172 (497)
Q Consensus       152 ei~eiv~yA~~r--gI~VIPEID  172 (497)
                      ++++|.++++..  +|.+||.-.
T Consensus       275 ~a~~La~llk~l~~~VnLIPynp  297 (355)
T TIGR00048       275 HAEELAELLKGTKCKVNLIPWNP  297 (355)
T ss_pred             HHHHHHHHHhcCCCceEEEeccc
Confidence            999999999986  556776543


No 103
>PRK10785 maltodextrin glucosidase; Provisional
Probab=26.73  E-value=55  Score=36.87  Aligned_cols=30  Identities=23%  Similarity=0.434  Sum_probs=26.2

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.+|||+||-.+ ++.|++.
T Consensus       224 t~~df~~Lv~~aH~rGikVilD~-V~NH~~~  253 (598)
T PRK10785        224 GDAALLRLRHATQQRGMRLVLDG-VFNHTGD  253 (598)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-CCCcCCC
Confidence            88999999999999999999875 4577765


No 104
>TIGR02102 pullulan_Gpos pullulanase, extracellular, Gram-positive. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. In contrast, a glycogen debranching enzyme such GlgX, homologous to this family, can release glucose at alpha,1-6 linkages from glycogen first subjected to limit degradation by phosphorylase. Characterized members of this family include a surface-located pullulanase from Streptococcus pneumoniae (PubMed:11083842) and an extracellular bifunctional amylase/pullulanase with C-terminal pullulanase activity (PubMed:8798645).
Probab=26.72  E-value=3.3e+02  Score=33.31  Aligned_cols=80  Identities=16%  Similarity=0.133  Sum_probs=49.0

Q ss_pred             HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcc-cc-cCCCCCCccccCCCCCcCCCCChhHHHHHHHH
Q psy14880        150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLI-VC-LNDPSWRANCIQPPCGQLNPINPHVYTVLKDI  227 (497)
Q Consensus       150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~-~~-~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~v  227 (497)
                      -+|+|++|+-|.++||+||-.+ ++-|+... ..+... .|..- .+ .++.+...+    ..+.++..+|.+.+|+.+.
T Consensus       554 i~EfK~LV~alH~~GI~VILDV-VyNHt~~~-~~f~~~-~p~Yy~~~~~~G~~~~~~----~g~~l~~e~~~vrk~iiDs  626 (1111)
T TIGR02102       554 IAEFKNLINEIHKRGMGVILDV-VYNHTAKV-YIFEDL-EPNYYHFMDADGTPRTSF----GGGRLGTTHEMSRRILVDS  626 (1111)
T ss_pred             HHHHHHHHHHHHHCCCEEEEec-cccccccc-cccccc-CCCceEeeCCCCCccccc----CCCCCCcCCHHHHHHHHHH
Confidence            4899999999999999999875 56887542 111100 01100 00 011111111    1245778889999999988


Q ss_pred             HHHHHhccC
Q psy14880        228 YGELMGMAK  236 (497)
Q Consensus       228 l~E~~~lF~  236 (497)
                      +.-.++-|.
T Consensus       627 l~yWv~ey~  635 (1111)
T TIGR02102       627 IKYLVDEFK  635 (1111)
T ss_pred             HHHHHHhcC
Confidence            888877555


No 105
>PRK14456 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=26.69  E-value=1.6e+02  Score=31.17  Aligned_cols=74  Identities=20%  Similarity=0.242  Sum_probs=46.1

Q ss_pred             eEEeccCCcccccccccCCCCCCCccCChhHHHHHH-HHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880         74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV-HYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL  151 (497)
Q Consensus        74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~-~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~  151 (497)
                      +.+|-.|++-+.--.|.-     .+.|.-+++.+.+ +|++..|.++.+++. +|+.                  -=|.+
T Consensus       238 iSL~a~~~e~r~~i~P~~-----~~~~~l~~l~~~i~~~~~~~g~~V~ieyvLI~Gv------------------NDs~e  294 (368)
T PRK14456        238 VSLHSADQEKRERLMPQA-----ARDYPLDELREALIGYASKTGEPVTLVYMLLEGI------------------NDSPE  294 (368)
T ss_pred             EEecCCCHHHHHHhcccc-----CCCCCHHHHHHHHHHHHHhcCCeEEEEEEEEcCC------------------CCCHH
Confidence            556776765544333311     1245666777755 477777877766554 2322                  15788


Q ss_pred             HHHHHHHHHHHc--CCEEEEe
Q psy14880        152 EVQELVHYAKLR--GIRVIFE  170 (497)
Q Consensus       152 ei~eiv~yA~~r--gI~VIPE  170 (497)
                      |+++|+++++.+  .|++||-
T Consensus       295 da~~L~~~l~~~~~~VnlIpy  315 (368)
T PRK14456        295 DARKLIRFASRFFCKINLIDY  315 (368)
T ss_pred             HHHHHHHHHhcCCCeeEEeee
Confidence            999999999987  5667764


No 106
>PRK14507 putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase; Provisional
Probab=26.62  E-value=2.6e+02  Score=35.66  Aligned_cols=71  Identities=18%  Similarity=0.306  Sum_probs=48.7

Q ss_pred             ccCChhHHHHHHHHHHHcCceee---eecCCCCCCCCCCCCCCCCCCCCCCCCC-------CHHHHHHHHHHHHHcCCEE
Q psy14880         98 KVYTPLEVQELVHYAKLRGVRVI---FELDAPAHAGNGWQWGPRFGAYSPAKVY-------TPLEVQELVHYAKLRGIRV  167 (497)
Q Consensus        98 ~~y~~~~i~~~~~y~~~rG~ri~---~e~~~p~~~~~~~~~~~~~g~~s~~~~Y-------T~~ei~eiv~yA~~rgI~V  167 (497)
                      ..++-+++.+.+.|.+.-|+.-+   |-+..+  .++       ..+|....++       |.++++.|++-|+++||.|
T Consensus       753 ~~~tf~~~~~~l~Yl~~LGv~~i~lsPi~~a~--~gs-------~hGYdv~D~~~idp~lG~~edf~~Lv~~ah~~Gi~v  823 (1693)
T PRK14507        753 KDFTFADAEAILPYLAALGISHVYASPILKAR--PGS-------THGYDIVDHSQINPEIGGEEGFERFCAALKAHGLGQ  823 (1693)
T ss_pred             CCCCHHHHHHHhHHHHHcCCCEEEECCCcCCC--CCC-------CCCCCCCCCCccCcccCCHHHHHHHHHHHHHCCCEE
Confidence            35666789999999998888532   222111  111       1233333333       8999999999999999999


Q ss_pred             EEecCCcccch
Q psy14880        168 IFELDAPAHAG  178 (497)
Q Consensus       168 IPEID~PGH~~  178 (497)
                      |-.| +|.|++
T Consensus       824 ilDi-V~NH~~  833 (1693)
T PRK14507        824 LLDI-VPNHMG  833 (1693)
T ss_pred             EEEe-cccccC
Confidence            9886 678887


No 107
>PRK01060 endonuclease IV; Provisional
Probab=26.35  E-value=3.4e+02  Score=26.74  Aligned_cols=96  Identities=11%  Similarity=0.049  Sum_probs=56.8

Q ss_pred             eeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHHH
Q psy14880         31 TKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVH  110 (497)
Q Consensus        31 G~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~  110 (497)
                      |+|..+.+.      +.+.|+.++..++             +.+.+-+..-..|.           ...++.+.++++.+
T Consensus         5 g~~~~~~~~------~~~~l~~~~~~G~-------------d~vEl~~~~p~~~~-----------~~~~~~~~~~~lk~   54 (281)
T PRK01060          5 GAHVSAAGG------LEGAVAEAAEIGA-------------NAFMIFTGNPQQWK-----------RKPLEELNIEAFKA   54 (281)
T ss_pred             EEeeecCCC------HHHHHHHHHHcCC-------------CEEEEECCCCCCCc-----------CCCCCHHHHHHHHH
Confidence            555555554      7788889998885             56665544322221           12356777888877


Q ss_pred             HHHHcCcee---eeecCCCCCCCCCCCCCCCCCCCCCCC---CCCHHHHHHHHHHHHHcCCEEE
Q psy14880        111 YAKLRGVRV---IFELDAPAHAGNGWQWGPRFGAYSPAK---VYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       111 y~~~rG~ri---~~e~~~p~~~~~~~~~~~~~g~~s~~~---~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      ..+.+|+++   ....  |-. .+       .+  +++.   --+.+.++..++.|++.|+.+|
T Consensus        55 ~~~~~gl~~~~~~~h~--~~~-~n-------l~--~~d~~~r~~s~~~~~~~i~~A~~lga~~v  106 (281)
T PRK01060         55 ACEKYGISPEDILVHA--PYL-IN-------LG--NPNKEILEKSRDFLIQEIERCAALGAKLL  106 (281)
T ss_pred             HHHHcCCCCCceEEec--ceE-ec-------CC--CCCHHHHHHHHHHHHHHHHHHHHcCCCEE
Confidence            788888873   2111  100 00       00  1111   2456789999999999999643


No 108
>PRK00278 trpC indole-3-glycerol-phosphate synthase; Reviewed
Probab=26.29  E-value=1.1e+02  Score=30.51  Aligned_cols=34  Identities=21%  Similarity=0.361  Sum_probs=28.4

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccchhhh
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGW  181 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l  181 (497)
                      ++.+++++++++|+.+|.+++-|+...-=...+.
T Consensus       144 l~~~~l~~li~~a~~lGl~~lvevh~~~E~~~A~  177 (260)
T PRK00278        144 LDDEQLKELLDYAHSLGLDVLVEVHDEEELERAL  177 (260)
T ss_pred             CCHHHHHHHHHHHHHcCCeEEEEeCCHHHHHHHH
Confidence            6789999999999999999999988776444443


No 109
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=26.00  E-value=1.4e+02  Score=32.14  Aligned_cols=25  Identities=12%  Similarity=0.143  Sum_probs=18.8

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      |..-.+-++.|+...+-.+|.+|=|
T Consensus       366 G~Ig~~~f~~l~~~~~~~~iP~ILE  390 (413)
T PTZ00372        366 GKLGMETFKFIMNSKYFKNIPIILE  390 (413)
T ss_pred             CCcChHHHHHHHhChhhCCCeEEEe
Confidence            4566777888888777778888766


No 110
>PF03102 NeuB:  NeuB family;  InterPro: IPR013132 NeuB is the prokaryotic N-acetylneuraminic acid synthase (Neu5Ac). It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only been observed in prokaryotes; eukaryotes synthesise the 9-phosphate form, Neu5Ac-9-P, and utilise ManNAc-6-P instead of ManNAc. Such eukaryotic enzymes are not present in this family []. This family also contains SpsE spore coat polysaccharide biosynthesis proteins.; GO: 0016051 carbohydrate biosynthetic process; PDB: 3G8R_B 1XUU_A 1XUZ_A 3CM4_A 2ZDR_A 1VLI_A 2WQP_A.
Probab=25.84  E-value=65  Score=32.08  Aligned_cols=25  Identities=24%  Similarity=0.246  Sum_probs=20.0

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEEEec
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVIFEL  171 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VIPEI  171 (497)
                      .+|.+++++|.+||+++||..+-..
T Consensus        52 el~~e~~~~L~~~~~~~gi~f~stp   76 (241)
T PF03102_consen   52 ELSEEQHKELFEYCKELGIDFFSTP   76 (241)
T ss_dssp             SS-HHHHHHHHHHHHHTT-EEEEEE
T ss_pred             cCCHHHHHHHHHHHHHcCCEEEECC
Confidence            3899999999999999999887543


No 111
>PF09508 Lact_bio_phlase:  Lacto-N-biose phosphorylase;  InterPro: IPR012711  The gene which codes for this protein in gut-bacteria is located in a novel putative operon for galactose metabolism. The protein appears to be a carbohydrate-processing phosphorolytic enzyme (2.4.1.211 from EC), unlike either glycoside hydrolases or glycoside lyase. Intestinal colonisation by Bifidobacteria is important for human health, especially in paediatrics, because colonisation seems to prevent infection by some pathogenic bacteria that cause diarrhoea or other illnesses. The operon seems to be involved in intestinal colonisation by Bifidobacteria mediated by metabolism of mucin sugars. In addition, it may also resolve the question of the nature of the bifidus factor in human milk as the lacto-N-biose structure found in milk oligosaccharides. ; GO: 0016758 transferase activity, transferring hexosyl groups; PDB: 2ZUW_A 2ZUU_C 2ZUT_D 2ZUV_A 2ZUS_B.
Probab=24.95  E-value=5.2e+02  Score=29.65  Aligned_cols=100  Identities=13%  Similarity=0.179  Sum_probs=55.2

Q ss_pred             CcCCCCChhHHHHHHHHHHHHHhccCCC-----CCeE-----EEcCCccchhhhhhhhhhcccCCCCCcCCCCCCCCCCC
Q psy14880        211 GQLNPINPHVYTVLKDIYGELMGMAKYG-----DEMF-----HMGADELVTLMALCRWQMCTRQQVPCVQPISGVDHLSP  280 (497)
Q Consensus       211 ~~L~~~~p~t~~fl~~vl~E~~~lF~~~-----~~~i-----HiGgDE~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~  280 (497)
                      -.+||..|+|.+++.+-+++.++-.|..     ..+|     -.|-++                               .
T Consensus       175 ipfDvr~p~t~~~~~~~L~~wl~~hP~~dVVRFTTFFy~Ftl~f~~~~-------------------------------k  223 (716)
T PF09508_consen  175 IPFDVRQPKTREYVLEWLRKWLEEHPDTDVVRFTTFFYQFTLIFNEDA-------------------------------K  223 (716)
T ss_dssp             ----TTSHHHHHHHHHHHHHHHHT-TT--EEEES--SB--EEEEBTTS-------------------------------S
T ss_pred             CCCCCCCHHHHHHHHHHHHHHHHHCCCCcEEEeeeeeeeeEEEecccc-------------------------------c
Confidence            4799999999999999999999988852     1111     123332                               1


Q ss_pred             CcceeecccccC--CHHHH-HHHHHcCCCCC--------------------cccHHHHHHHHHHHHHHHHHHHhhCCCce
Q psy14880        281 HLWTVFIKCWNN--TPAIV-DFMLAHGKNTT--------------------FDDYIQLWAHFQNKAAASLDEAVGHNRTK  337 (497)
Q Consensus       281 ~~~~v~~~~w~~--~p~~~-~~~~~~g~~~~--------------------~~~~~~l~~~F~~~~~~~l~~~~~~~g~~  337 (497)
                      +.+ |+...+..  ||... ++-++.|..-+                    ..||.+.++.|+-+.++.+-+.+.+.||+
T Consensus       224 ekf-vDWfGY~asVSp~aLe~FEke~GY~lrpEdfid~GyyNstfrvPsk~~~D~~~F~~~fV~~~akelVDivH~~GKe  302 (716)
T PF09508_consen  224 EKF-VDWFGYSASVSPYALEQFEKEYGYRLRPEDFIDQGYYNSTFRVPSKEFRDWMDFQQRFVAEFAKELVDIVHEYGKE  302 (716)
T ss_dssp             EEE-EETT--GGGB-HHHHHHHHHHHSS---THHHHGGGTTT-TTS---HHHHHHHHHHHHHHHHHHHHHHHHHHHTT-E
T ss_pred             ccc-ccccCcCcccCHHHHHHHHHHhCCCCCHHHhhccccccCcccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCe
Confidence            111 33333333  35444 44444466321                    13466677777777777777777789998


Q ss_pred             EEEec
Q psy14880        338 LIVWS  342 (497)
Q Consensus       338 ~~~W~  342 (497)
                      .|+.-
T Consensus       303 AmMFl  307 (716)
T PF09508_consen  303 AMMFL  307 (716)
T ss_dssp             EEEES
T ss_pred             EEEec
Confidence            88763


No 112
>PF01055 Glyco_hydro_31:  Glycosyl hydrolases family 31 ;  InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC).  Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=24.88  E-value=2.6e+02  Score=29.86  Aligned_cols=135  Identities=14%  Similarity=0.203  Sum_probs=68.8

Q ss_pred             CChhHHHHHHHHHHHcCceee-eecCCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880        100 YTPLEVQELVHYAKLRGVRVI-FELDAPAHAGNGWQWGPRFGAYSPAK-VYTPLEVQELVHYAKLRGIRVIFELDAPAHA  177 (497)
Q Consensus       100 y~~~~i~~~~~y~~~rG~ri~-~e~~~p~~~~~~~~~~~~~g~~s~~~-~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~  177 (497)
                      ++.+++.++++--+.+++.+. ..+|.. +       ...++.|+-+. .+  .+.+++++..+++||+|++-++--=+.
T Consensus        40 ~~~~~v~~~i~~~~~~~iP~d~~~iD~~-~-------~~~~~~f~~d~~~F--Pd~~~~~~~l~~~G~~~~~~~~P~v~~  109 (441)
T PF01055_consen   40 YNQDEVREVIDRYRSNGIPLDVIWIDDD-Y-------QDGYGDFTWDPERF--PDPKQMIDELHDQGIKVVLWVHPFVSN  109 (441)
T ss_dssp             TSHHHHHHHHHHHHHTT--EEEEEE-GG-G-------SBTTBTT-B-TTTT--TTHHHHHHHHHHTT-EEEEEEESEEET
T ss_pred             CCHHHHHHHHHHHHHcCCCccceecccc-c-------cccccccccccccc--cchHHHHHhHhhCCcEEEEEeecccCC
Confidence            467788888877777887654 233322 1       11122322111 22  279999999999999999776542222


Q ss_pred             hhh----hhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880        178 GNG----WQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYGELMGMAKYGDEMFHMGADE  248 (497)
Q Consensus       178 ~a~----l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~E~~~lF~~~~~~iHiGgDE  248 (497)
                      ..-    .....+.+  -+..-.+++++....-......+|.++|++.++.++.++++.+.++  -..+=+=.-|
T Consensus       110 ~~~~~~~~~~~~~~~--~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G--vdg~w~D~~E  180 (441)
T PF01055_consen  110 DSPDYENYDEAKEKG--YLVKNPDGSPYIGRVWPGKGGFIDFTNPEARDWWKEQLKELLDDYG--VDGWWLDFGE  180 (441)
T ss_dssp             TTTB-HHHHHHHHTT---BEBCTTSSB-EEEETTEEEEEB-TTSHHHHHHHHHHHHHHHTTST---SEEEEESTT
T ss_pred             CCCcchhhhhHhhcC--ceeecccCCcccccccCCcccccCCCChhHHHHHHHHHHHHHhccC--CceEEeecCC
Confidence            210    00000111  0110011111111000001357999999999999999999988866  5555555555


No 113
>cd09013 BphC-JF8_N_like N-terminal, non-catalytic, domain of BphC_JF8, (2,3-dihydroxybiphenyl 1,2-dioxygenase) from Bacillus sp. JF8 and similar proteins. 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC) catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, a key step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). BphC belongs to the type I extradiol dioxygenase family, which requires a metal ion in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of BphCs. This subfamily of BphC is represented by the enzyme purified from the thermophilic biphenyl and naphthalene degrader, Bacillus sp. JF8. The members in this family of BphC enzymes may use either Mn(II) or Fe(II) as cofactors. The enzyme purified from Bacillus sp. JF8 is Mn(II)-dependent, however, the enzyme from Rhodococcus jostii RHAI has Fe(II) bound to it. BphC_JF8 is thermostable and its optimum activity is at 85 degrees C
Probab=24.71  E-value=68  Score=27.08  Aligned_cols=32  Identities=16%  Similarity=0.243  Sum_probs=26.4

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      -|.+++.++.+..+++||++.++...|||..+
T Consensus        71 ~~~~~v~~~~~~l~~~G~~~~~~~~~~~~~~~  102 (121)
T cd09013          71 SSPEALERRVAALEASGLGIGWIEGDPGHGKA  102 (121)
T ss_pred             CCHHHHHHHHHHHHHcCCccccccCCCCCcce
Confidence            57899999999999999998665566777654


No 114
>cd07939 DRE_TIM_NifV Streptomyces rubellomurinus FrbC and related proteins, catalytic TIM barrel domain. FrbC (NifV) of Streptomyces rubellomurinus catalyzes the condensation of acetyl-CoA and alpha-ketoglutarate to form homocitrate and CoA, a reaction similar to one catalyzed by homocitrate synthase.  The gene encoding FrbC is one of several genes required for the biosynthesis of FR900098, a potent antimalarial antibiotic.  This protein is also required for assembly of the nitrogenase MoFe complex but its exact role is unknown.   This family also includes the NifV proteins of Heliobacterium chlorum and Gluconacetobacter diazotrophicus, which appear to be orthologous to FrbC.  This family belongs to the DRE-TIM metallolyase superfamily.  DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarbox
Probab=24.27  E-value=2.4e+02  Score=27.91  Aligned_cols=20  Identities=10%  Similarity=-0.195  Sum_probs=13.1

Q ss_pred             CCCCHHHHHHhhhhhhcccc
Q psy14880         39 HLTDPEIILNYLDPQTCGTQ   58 (497)
Q Consensus        39 ~~~~~~~lk~~id~ma~~K~   58 (497)
                      .+++.++.+++++.|...++
T Consensus        15 ~~~~~~~k~~i~~~L~~~Gv   34 (259)
T cd07939          15 VAFSREEKLAIARALDEAGV   34 (259)
T ss_pred             CCCCHHHHHHHHHHHHHcCC
Confidence            35667777777777666654


No 115
>COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only]
Probab=24.09  E-value=90  Score=32.45  Aligned_cols=29  Identities=28%  Similarity=0.406  Sum_probs=25.7

Q ss_pred             CCCCCCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880        142 YSPAKVYTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       142 ~s~~~~YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      |++++--+...++||.++|+..||+||+-
T Consensus       166 Y~p~E~ns~~l~eelk~~A~~~Gl~vve~  194 (322)
T COG2984         166 YNPGEANSVSLVEELKKEARKAGLEVVEA  194 (322)
T ss_pred             eCCCCcccHHHHHHHHHHHHHCCCEEEEE
Confidence            46677789999999999999999999984


No 116
>PRK04296 thymidine kinase; Provisional
Probab=24.06  E-value=80  Score=29.77  Aligned_cols=25  Identities=24%  Similarity=0.183  Sum_probs=22.3

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEe
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      ++++.+++.+|++.++..|++||=-
T Consensus        88 q~l~~~~v~~l~~~l~~~g~~vi~t  112 (190)
T PRK04296         88 QFLDKEQVVQLAEVLDDLGIPVICY  112 (190)
T ss_pred             ccCCHHHHHHHHHHHHHcCCeEEEE
Confidence            5689999999999999999998864


No 117
>cd00308 enolase_like Enolase-superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Enolase superfamily contains different enzymes, like enolases, glutarate-, fucanate- and galactonate dehydratases, o-succinylbenzoate synthase, N-acylamino acid racemase, L-alanine-DL-glutamate epimerase, mandelate racemase, muconate lactonizing enzyme and 3-methylaspartase.
Probab=23.91  E-value=1.8e+02  Score=28.02  Aligned_cols=22  Identities=27%  Similarity=0.305  Sum_probs=19.4

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEe
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      .-.+.+++.++|+++||.++|-
T Consensus       181 Gi~~~~~i~~~a~~~gi~~~~~  202 (229)
T cd00308         181 GLTESRRAADLAEAFGIRVMVH  202 (229)
T ss_pred             CHHHHHHHHHHHHHcCCEEeec
Confidence            3467899999999999999996


No 118
>PRK14335 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=23.87  E-value=1.3e+02  Score=32.73  Aligned_cols=52  Identities=15%  Similarity=0.137  Sum_probs=35.2

Q ss_pred             CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880         97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR  166 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~  166 (497)
                      ++.|+.+++.++++..+..  |+.+...+ +-++++                 =|.+++++.++++++.+..
T Consensus       287 ~R~~t~e~~~~~v~~ir~~~pgi~i~~d~-IvGfPg-----------------ET~edf~~Tl~~i~~l~~~  340 (455)
T PRK14335        287 NRSYTREHYLSLVGKLKASIPNVALSTDI-LIGFPG-----------------ETEEDFEQTLDLMREVEFD  340 (455)
T ss_pred             CCCCCHHHHHHHHHHHHHhCCCCEEEEEE-EEeCCC-----------------CCHHHHHHHHHHHHhcCCC
Confidence            3568888888888887765  66543222 111111                 3899999999999988753


No 119
>TIGR00190 thiC thiamine biosynthesis protein ThiC. The thiC ortholog is designated thiA in Bacillus subtilis.
Probab=23.76  E-value=64  Score=34.51  Aligned_cols=23  Identities=26%  Similarity=0.485  Sum_probs=19.0

Q ss_pred             HHHHHHHHHcCCEEEEecCCcccch
Q psy14880        154 QELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       154 ~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      -||++.|++.||.||-  +-|||.-
T Consensus       246 geL~~rA~e~gVQvMV--EGPGHvP  268 (423)
T TIGR00190       246 GELVERAREADVQCMV--EGPGHVP  268 (423)
T ss_pred             HHHHHHHHHcCCeEEE--ECCCCCc
Confidence            3456899999999995  6999983


No 120
>TIGR03581 EF_0839 conserved hypothetical protein EF_0839/AHA_3917. Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.
Probab=23.64  E-value=1.6e+02  Score=29.01  Aligned_cols=22  Identities=27%  Similarity=0.270  Sum_probs=16.0

Q ss_pred             CCHHHHHHHHHHHHHcCC-EEEE
Q psy14880        148 YTPLEVQELVHYAKLRGI-RVIF  169 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI-~VIP  169 (497)
                      .+.+.+.+|++.|.+-|| .|||
T Consensus       186 Idl~Nf~~I~~i~ldaGv~kviP  208 (236)
T TIGR03581       186 IDLDNFEEIVQIALDAGVEKVIP  208 (236)
T ss_pred             ccHHhHHHHHHHHHHcCCCeecc
Confidence            677777777777777777 4666


No 121
>PF00218 IGPS:  Indole-3-glycerol phosphate synthase;  InterPro: IPR013798 Indole-3-glycerol phosphate synthase (4.1.1.48 from EC) (IGPS) catalyses the fourth step in the biosynthesis of tryptophan, the ring closure of 1-(2-carboxy-phenylamino)-1-deoxyribulose into indol-3-glycerol-phosphate. In some bacteria, IGPS is a single chain enzyme. In others, such as Escherichia coli, it is the N-terminal domain of a bifunctional enzyme that also catalyses N-(5'-phosphoribosyl)anthranilate isomerase (5.3.1.24 from EC) (PRAI) activity (see IPR001240 from INTERPRO), the third step of tryptophan biosynthesis. In fungi, IGPS is the central domain of a trifunctional enzyme that contains a PRAI C-terminal domain and a glutamine amidotransferase (2.4.2 from EC) (GATase) N-terminal domain (see IPR000991 from INTERPRO).  A structure of the IGPS domain of the bifunctional enzyme from the mesophilic bacterium E. coli (eIGPS) has been compared with the monomeric indole-3-glycerol phosphate synthase from the hyperthermophilic archaeon Sulfolobus solfataricus (sIGPS). Both are single-domain (beta/alpha)8 barrel proteins, with one (eIGPS) or two (sIGPS) additional helices inserted before the first beta strand []. ; GO: 0004425 indole-3-glycerol-phosphate synthase activity; PDB: 1VC4_A 1PII_A 1JCM_P 1I4N_B 1J5T_A 3TSM_B 4FB7_A 3QJA_A 1JUL_A 2C3Z_A ....
Probab=23.49  E-value=85  Score=31.55  Aligned_cols=39  Identities=15%  Similarity=0.294  Sum_probs=33.8

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG  184 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~  184 (497)
                      ..++++++++++++|..+|++++-|+-+.--...++...
T Consensus       140 ~~L~~~~l~~l~~~a~~lGle~lVEVh~~~El~~al~~~  178 (254)
T PF00218_consen  140 AILSDDQLEELLELAHSLGLEALVEVHNEEELERALEAG  178 (254)
T ss_dssp             GGSGHHHHHHHHHHHHHTT-EEEEEESSHHHHHHHHHTT
T ss_pred             HhCCHHHHHHHHHHHHHcCCCeEEEECCHHHHHHHHHcC
Confidence            358999999999999999999999999999988877543


No 122
>cd03318 MLE Muconate Lactonizing Enzyme (MLE), an homooctameric enzyme, catalyses the conversion of cis,cis-muconate (CCM) to muconolactone (ML) in the catechol branch of the beta-ketoadipate pathway. This pathway is used in soil microbes to breakdown lignin-derived aromatics, catechol and protocatechuate, to citric acid cycle intermediates. Some bacterial species are also capable of dehalogenating chloroaromatic compounds by the action of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs are members of the enolase superfamily characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion.
Probab=23.07  E-value=1.6e+02  Score=30.68  Aligned_cols=28  Identities=7%  Similarity=-0.064  Sum_probs=22.2

Q ss_pred             ceeeeccCCCCCCHHHHHHhhhhhhcccc
Q psy14880         30 RTKLIVWSSHLTDPEIILNYLDPQTCGTQ   58 (497)
Q Consensus        30 RG~mlDvaR~~~~~~~lk~~id~ma~~K~   58 (497)
                      -.+|||.-.. ++++...++++.+..+++
T Consensus       189 ~~l~iDaN~~-~~~~~A~~~~~~l~~~~~  216 (365)
T cd03318         189 ASVRVDVNQA-WDESTAIRALPRLEAAGV  216 (365)
T ss_pred             cEEEEECCCC-CCHHHHHHHHHHHHhcCc
Confidence            3699999644 688888899999888864


No 123
>PLN02368 alanine transaminase
Probab=23.03  E-value=83  Score=33.56  Aligned_cols=24  Identities=4%  Similarity=0.221  Sum_probs=21.6

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ..||++++++|++.|++++|-||-
T Consensus       225 ~v~s~e~l~~l~~~a~~~~~~II~  248 (407)
T PLN02368        225 QCLSEANLREILKFCYQERLVLLG  248 (407)
T ss_pred             ccCCHHHHHHHHHHHHHcCCEEEE
Confidence            459999999999999999998874


No 124
>COG3618 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=22.96  E-value=8e+02  Score=25.08  Aligned_cols=164  Identities=18%  Similarity=0.219  Sum_probs=88.9

Q ss_pred             ceeeeccCCCCCCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHH
Q psy14880         30 RTKLIVWSSHLTDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV  109 (497)
Q Consensus        30 RG~mlDvaR~~~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~  109 (497)
                      ||+++.++=.--..+...+.+..|+..+ +-  +         +-..-++        .|.+          ...+++..
T Consensus        53 ~~V~vq~~~~~~D~~~e~~~v~~~~~~~-g~--~---------vg~id~~--------~~e~----------~a~L~~~~  102 (279)
T COG3618          53 GGVLVQVNVDPRDNEKELAFVAELAERH-GG--I---------VGVIDEC--------RPEF----------AAKLERAR  102 (279)
T ss_pred             eeEEEecccCccchHHHHHHHHhhHHhh-Cc--e---------EEEEecC--------CchH----------HHHHHHhc
Confidence            7889999888888999999999999876 10  0         0000000        0111          11222222


Q ss_pred             HHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhh----hhhcC
Q psy14880        110 HYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNG----WQWGP  185 (497)
Q Consensus       110 ~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~----l~~~p  185 (497)
                       |.+++|+|.... ..|.                  +.+-..++++.++-.+.+|+.+  ++.+..|-...    .+..|
T Consensus       103 -~~~~~GvR~~l~-~~p~------------------~~~~a~~~r~~~~rL~~~gl~f--dl~~~~~ql~~~i~l~~~~P  160 (279)
T COG3618         103 -YPFFRGVRRNLH-VVPD------------------GLFEAPAWRANVERLAKLGLHF--DLQVDPHQLPDLIPLALKAP  160 (279)
T ss_pred             -ccccceeeehhh-cCCc------------------cchhhHHHHHHHHHHHhcCCeE--EEEeChhhhHHHHHHHhhCC
Confidence             344788886531 1121                  1133378999999999998876  44444443222    12222


Q ss_pred             CC-------CCCCcccccCCCCCCcc----ccCC-----CCCcCCCCChh-HHHHHHHHHHHHHhccCCCCCeEEEcCCc
Q psy14880        186 RY-------GLGDLIVCLNDPSWRAN----CIQP-----PCGQLNPINPH-VYTVLKDIYGELMGMAKYGDEMFHMGADE  248 (497)
Q Consensus       186 ~~-------~~~~l~~~~~~~~~~~~----~~~~-----~~~~L~~~~p~-t~~fl~~vl~E~~~lF~~~~~~iHiGgDE  248 (497)
                      +.       |.|+... ....+|...    ...|     -.+.+-.+.+. +++++..+..++++.|+  ...+-.|+|=
T Consensus       161 d~~~VldH~G~p~~~~-~~~~~w~~~m~~la~~pNv~~KlSG~~~~~~~~w~~~~v~p~~e~~i~~fg--~dR~vfGSdw  237 (279)
T COG3618         161 DVNFVLDHCGRPDIKI-NLEDPWKAALARLARRPNVWAKLSGVYAYSDESWTVEDVRPYVEELIELFG--WDRFVFGSDW  237 (279)
T ss_pred             CCCEEeccCCCCCccc-cccCHHHHHHHHHHhCCCeEEEEeeecccccCCCCHHHHHHHHHHHHHhcC--ccceEecCCC
Confidence            11       1110000 000122100    0000     11334445555 79999999999999999  8888899987


No 125
>PRK14333 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=22.78  E-value=1.3e+02  Score=32.44  Aligned_cols=52  Identities=12%  Similarity=0.155  Sum_probs=33.4

Q ss_pred             CccCChhHHHHHHHHHHHc--CceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880         97 AKVYTPLEVQELVHYAKLR--GVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR  166 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~r--G~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~  166 (497)
                      .+.|+.+++.++++..+..  |+.+...+ +-++++                 =|.+++++.++++++.+++
T Consensus       284 ~R~~t~e~~~~~i~~lr~~~p~i~i~~d~-IvGfPg-----------------ET~edf~~tl~~l~~~~~~  337 (448)
T PRK14333        284 ARGYTHEKYRRIIDKIREYMPDASISADA-IVGFPG-----------------ETEAQFENTLKLVEEIGFD  337 (448)
T ss_pred             CCCCCHHHHHHHHHHHHHhCCCcEEEeeE-EEECCC-----------------CCHHHHHHHHHHHHHcCCC
Confidence            3567888888888777665  33222111 112211                 5899999999999988764


No 126
>PRK15452 putative protease; Provisional
Probab=22.57  E-value=3e+02  Score=29.92  Aligned_cols=105  Identities=10%  Similarity=0.006  Sum_probs=0.0

Q ss_pred             cccccCCCCCCCccCChhHHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCC
Q psy14880         86 CLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGI  165 (497)
Q Consensus        86 ~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI  165 (497)
                      +..++|+.+..+..|+.+++++.++++..+|.++...+..-.+.                  --.+++.+.++...+.||
T Consensus        29 ~G~~~~~~R~~~~~f~~edl~eav~~ah~~g~kvyvt~n~i~~e------------------~el~~~~~~l~~l~~~gv   90 (443)
T PRK15452         29 AGQPRYSLRVRNNEFNHENLALGINEAHALGKKFYVVVNIAPHN------------------AKLKTFIRDLEPVIAMKP   90 (443)
T ss_pred             ECCCccchhhhccCCCHHHHHHHHHHHHHcCCEEEEEecCcCCH------------------HHHHHHHHHHHHHHhCCC


Q ss_pred             EEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHH
Q psy14880        166 RVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDI  227 (497)
Q Consensus       166 ~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~v  227 (497)
                      .=|-=-| ||-...+-+..|.+.      ...+.            ++|+.|..+.+|++++
T Consensus        91 DgvIV~d-~G~l~~~ke~~p~l~------ih~st------------qlni~N~~a~~f~~~l  133 (443)
T PRK15452         91 DALIMSD-PGLIMMVREHFPEMP------IHLSV------------QANAVNWATVKFWQQM  133 (443)
T ss_pred             CEEEEcC-HHHHHHHHHhCCCCe------EEEEe------------cccCCCHHHHHHHHHC


No 127
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=22.48  E-value=78  Score=33.82  Aligned_cols=28  Identities=7%  Similarity=0.139  Sum_probs=23.5

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccc
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHA  177 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~  177 (497)
                      =|.+|+.+++++|++++|.|+|-  .-||.
T Consensus         5 ~s~eev~~iv~~a~~~~i~v~~~--G~Gt~   32 (413)
T TIGR00387         5 KNTEQVARILKLCHEHRIPIVPR--GAGTG   32 (413)
T ss_pred             CCHHHHHHHHHHHHHcCCcEEEE--CCCCC
Confidence            47899999999999999999993  44554


No 128
>KOG3257|consensus
Probab=22.37  E-value=46  Score=30.96  Aligned_cols=23  Identities=22%  Similarity=0.203  Sum_probs=18.6

Q ss_pred             CCHH-HHHHHHHHHHHcCCEEEEe
Q psy14880        148 YTPL-EVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       148 YT~~-ei~eiv~yA~~rgI~VIPE  170 (497)
                      .|-+ -+|-|+--|+..||.|+||
T Consensus       135 ~~lesi~rsiigtA~smGIkVvp~  158 (168)
T KOG3257|consen  135 TTLESICRSIIGTARSMGIKVVPP  158 (168)
T ss_pred             ccHHHHHHHHHHHHHhCccccchh
Confidence            4444 3688889999999999996


No 129
>PF00150 Cellulase:  Cellulase (glycosyl hydrolase family 5);  InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=22.32  E-value=84  Score=30.63  Aligned_cols=57  Identities=25%  Similarity=0.330  Sum_probs=37.2

Q ss_pred             HHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCCCCcccccCCCCCCccccCCCCCcCCCCChhHHHHHHHHHH
Q psy14880        150 PLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGLGDLIVCLNDPSWRANCIQPPCGQLNPINPHVYTVLKDIYG  229 (497)
Q Consensus       150 ~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~~~l~~~~~~~~~~~~~~~~~~~~L~~~~p~t~~fl~~vl~  229 (497)
                      .+.++++|+.|+++||.||..+-..    ..+             +.+             +......+...+.+++++.
T Consensus        61 ~~~ld~~v~~a~~~gi~vild~h~~----~~w-------------~~~-------------~~~~~~~~~~~~~~~~~~~  110 (281)
T PF00150_consen   61 LARLDRIVDAAQAYGIYVILDLHNA----PGW-------------ANG-------------GDGYGNNDTAQAWFKSFWR  110 (281)
T ss_dssp             HHHHHHHHHHHHHTT-EEEEEEEES----TTC-------------SSS-------------TSTTTTHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhCCCeEEEEeccC----ccc-------------ccc-------------ccccccchhhHHHHHhhhh
Confidence            4579999999999999999854333    110             000             1122345557777888888


Q ss_pred             HHHhccC
Q psy14880        230 ELMGMAK  236 (497)
Q Consensus       230 E~~~lF~  236 (497)
                      .++.-|.
T Consensus       111 ~la~~y~  117 (281)
T PF00150_consen  111 ALAKRYK  117 (281)
T ss_dssp             HHHHHHT
T ss_pred             hhccccC
Confidence            8888884


No 130
>PF06080 DUF938:  Protein of unknown function (DUF938);  InterPro: IPR010342 This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
Probab=22.25  E-value=50  Score=32.10  Aligned_cols=28  Identities=18%  Similarity=0.343  Sum_probs=24.9

Q ss_pred             HHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        152 EVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       152 ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |+.+|.+.|++.|.+.+--||||+--..
T Consensus       172 D~e~v~~lA~~~GL~l~~~~~MPANN~~  199 (204)
T PF06080_consen  172 DIEDVEALAAAHGLELEEDIDMPANNLL  199 (204)
T ss_pred             CHHHHHHHHHHCCCccCcccccCCCCeE
Confidence            6889999999999999999999986543


No 131
>cd07940 DRE_TIM_IPMS 2-isopropylmalate synthase (IPMS), N-terminal catalytic TIM barrel domain. 2-isopropylmalate synthase (IPMS) catalyzes an aldol-type condensation of acetyl-CoA and 2-oxoisovalerate yielding 2-isopropylmalate and CoA, the first committed step in leucine biosynthesis.  This family includes the Arabidopsis thaliana IPMS1 and IPMS2 proteins, the Glycine max GmN56 protein, and the Brassica insularis BatIMS protein.  This family also includes a group of archeal IPMS-like proteins represented by the Methanocaldococcus jannaschii AksA protein.  AksA catalyzes the condensation of alpha-ketoglutarate and acetyl-CoA to form trans-homoaconitate, one of 13 steps in the conversion of alpha-ketoglutarate and acetylCoA to alpha-ketosuberate, a precursor to coenzyme B and biotin.  AksA also catalyzes the condensation of alpha-ketoadipate or alpha-ketopimelate with acetylCoA to form, respectively, the (R)-homocitrate homologs (R)-2-hydroxy-1,2,5-pentanetricarboxylic acid and (R)-2-h
Probab=22.15  E-value=2.8e+02  Score=27.65  Aligned_cols=56  Identities=27%  Similarity=0.147  Sum_probs=35.1

Q ss_pred             HHHHHHHHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccch
Q psy14880        104 EVQELVHYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAG  178 (497)
Q Consensus       104 ~i~~~~~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~  178 (497)
                      .+++++++++..|.++.+..  |.                 ....|.+.+.++++-+.+.|+..|=--||-|.+.
T Consensus       115 ~~~~~i~~a~~~G~~v~~~~--~~-----------------~~~~~~~~~~~~~~~~~~~G~~~i~l~DT~G~~~  170 (268)
T cd07940         115 RAVEAVEYAKSHGLDVEFSA--ED-----------------ATRTDLDFLIEVVEAAIEAGATTINIPDTVGYLT  170 (268)
T ss_pred             HHHHHHHHHHHcCCeEEEee--ec-----------------CCCCCHHHHHHHHHHHHHcCCCEEEECCCCCCCC
Confidence            45567777777776654221  11                 1226777778888877777777666666666554


No 132
>smart00642 Aamy Alpha-amylase domain.
Probab=22.01  E-value=2.4e+02  Score=26.20  Aligned_cols=29  Identities=34%  Similarity=0.516  Sum_probs=23.9

Q ss_pred             ChhHHHHHHHHHHHcCceeeeecCCCCCCC
Q psy14880        101 TPLEVQELVHYAKLRGVRVIFELDAPAHAG  130 (497)
Q Consensus       101 ~~~~i~~~~~y~~~rG~ri~~e~~~p~~~~  130 (497)
                      +.++++++++-+..+|++|+..+. ++|++
T Consensus        68 t~~d~~~lv~~~h~~Gi~vilD~V-~NH~~   96 (166)
T smart00642       68 TMEDFKELVDAAHARGIKVILDVV-INHTS   96 (166)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEEC-CCCCC
Confidence            457899999999999999988775 66753


No 133
>cd06840 PLPDE_III_Bif_AspK_DapDC Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Bifunctional Aspartate Kinase/Diaminopimelate Decarboxylase. Bifunctional aspartate kinase/diaminopimelate decarboxylase (AspK/DapDC, EC 4.1.1.20/EC 2.7.2.4) typically exists in bacteria. These proteins contain an N-terminal AspK region and a C-terminal DapDC region, which contains a PLP-binding TIM-barrel domain followed by beta-sandwich domain, characteristic of fold type III PLP-dependent enzymes. Members of this subfamily have not been fully characterized. Based on their sequence, these proteins may catalyze both reactions catalyzed by AspK and DapDC. AspK catalyzes the phosphorylation of L-aspartate to produce 4-phospho-L-aspartate while DapDC participates in the last step of lysine biosynthesis, the conversion of meso-2,6-diaminoheptanedioate to L-lysine.
Probab=21.88  E-value=5.8e+02  Score=26.66  Aligned_cols=21  Identities=38%  Similarity=0.316  Sum_probs=18.5

Q ss_pred             CCHHHHHHHHHHHHHcCCEEE
Q psy14880        148 YTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      .+.+|+.++++.+++.|++|+
T Consensus       153 ~~~~~~~~~l~~~~~~~l~l~  173 (368)
T cd06840         153 LDVDELDEARDLAKKAGIIVI  173 (368)
T ss_pred             CCHHHHHHHHHHHHhCCCcEE
Confidence            688999999999999888765


No 134
>cd07265 2_3_CTD_N N-terminal domain of catechol 2,3-dioxygenase. This subfamily contains the N-terminal, non-catalytic, domain of catechol 2,3-dioxygenase. Catechol 2,3-dioxygenase  (2,3-CTD, catechol:oxygen 2,3-oxidoreductase) catalyzes an extradiol cleavage of catechol to form 2-hydroxymuconate semialdehyde with the insertion of two atoms of oxygen. The enzyme is a homotetramer and contains catalytically essential Fe(II) . The reaction proceeds by an ordered bi-unit mechanism. First, catechol binds to the enzyme, this is then followed by the binding of dioxygen to form a tertiary complex, and then the aromatic ring is cleaved to produce 2-hydroxymuconate semialdehyde. Catechol 2,3-dioxygenase belongs to the type I extradiol dioxygenase family. The subunit comprises the N- and C-terminal domains of similar structure fold, resulting from an ancient gene duplication. The active site is located in a funnel-shaped space of the C-terminal domain. This subfamily represents the N-terminal do
Probab=21.79  E-value=91  Score=26.24  Aligned_cols=32  Identities=16%  Similarity=0.152  Sum_probs=24.7

Q ss_pred             CCCHHHHHHHHHHHHHcCCEEE--EecCCcccch
Q psy14880        147 VYTPLEVQELVHYAKLRGIRVI--FELDAPAHAG  178 (497)
Q Consensus       147 ~YT~~ei~eiv~yA~~rgI~VI--PEID~PGH~~  178 (497)
                      +-+.+++.++.+..+++|++|+  |+-..||+..
T Consensus        69 v~~~~dv~~~~~~l~~~G~~~~~~~~~~~~~~~~  102 (122)
T cd07265          69 VLDDADLEKLEARLQAYGVAVERIPAGELPGVGR  102 (122)
T ss_pred             eCCHHHHHHHHHHHHHCCCcEEEcccCCCCCCce
Confidence            4578899999999999999987  4444565433


No 135
>COG2008 GLY1 Threonine aldolase [Amino acid transport and metabolism]
Probab=21.73  E-value=77  Score=33.25  Aligned_cols=23  Identities=22%  Similarity=0.355  Sum_probs=20.6

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRV  167 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~V  167 (497)
                      +..||.+|+++|.++|+++||-+
T Consensus       144 GtVy~l~el~~i~~~~k~~~l~L  166 (342)
T COG2008         144 GTVYPLDELEAISAVCKEHGLPL  166 (342)
T ss_pred             ceecCHHHHHHHHHHHHHhCCce
Confidence            45799999999999999999765


No 136
>COG3977 Alanine-alpha-ketoisovalerate (or valine-pyruvate) aminotransferase [Amino acid transport and metabolism]
Probab=21.70  E-value=79  Score=32.79  Aligned_cols=25  Identities=32%  Similarity=0.414  Sum_probs=22.9

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      +.++|.+|++.|.+.|+++||-+|-
T Consensus       193 GNVlTdeE~~kldalA~~~giPliI  217 (417)
T COG3977         193 GNVLTDEELAKLDALARQHGIPLII  217 (417)
T ss_pred             CCcccHHHHHHHHHHhhhcCCcEEE
Confidence            5679999999999999999999885


No 137
>cd03327 MR_like_2 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 2. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues,  a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown.
Probab=21.59  E-value=1.5e+02  Score=30.69  Aligned_cols=27  Identities=0%  Similarity=-0.196  Sum_probs=21.0

Q ss_pred             eeeeccCCCCCCHHHHHHhhhhhhcccc
Q psy14880         31 TKLIVWSSHLTDPEIILNYLDPQTCGTQ   58 (497)
Q Consensus        31 G~mlDvaR~~~~~~~lk~~id~ma~~K~   58 (497)
                      .+|+|.-.. ++++.-.++++.|+.|.+
T Consensus       173 ~l~vDan~~-~~~~~A~~~~~~l~~~~~  199 (341)
T cd03327         173 DLMLDCYMS-WNLNYAIKMARALEKYEL  199 (341)
T ss_pred             cEEEECCCC-CCHHHHHHHHHHhhhcCC
Confidence            589998755 577888888888888764


No 138
>COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion]
Probab=21.58  E-value=86  Score=33.37  Aligned_cols=30  Identities=20%  Similarity=0.205  Sum_probs=25.5

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      -|.+|+.+++++|.++||.|+|-  .-||+..
T Consensus        39 ~s~~eV~~iv~~a~~~~~~v~pr--G~gts~~   68 (459)
T COG0277          39 KSEEEVAAILRLANENGIPVVPR--GGGTSLS   68 (459)
T ss_pred             CCHHHHHHHHHHHHHcCCeEEEE--CCCCCcc
Confidence            58999999999999999999994  5556554


No 139
>KOG0256|consensus
Probab=21.38  E-value=2.4e+02  Score=30.57  Aligned_cols=114  Identities=17%  Similarity=0.173  Sum_probs=68.9

Q ss_pred             eeeeccCCCC-CCHHHHHHhhhhhhcccccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHH
Q psy14880         31 TKLIVWSSHL-TDPEIILNYLDPQTCGTQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELV  109 (497)
Q Consensus        31 G~mlDvaR~~-~~~~~lk~~id~ma~~K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~  109 (497)
                      -+.++.+-+| ++++.+..-+.+.-...+++..|             =++.        |.-|   -+..|+++++.+++
T Consensus       197 pv~c~Ss~~f~itv~alE~A~~~A~~~~~kVkGv-------------litN--------PsNP---LG~~~~~e~L~~ll  252 (471)
T KOG0256|consen  197 PVHCSSSNGFQITVEALEAALNQARKLGLKVKGV-------------LITN--------PSNP---LGTTLSPEELISLL  252 (471)
T ss_pred             EEEeecCCCccccHHHHHHHHHHHHHhCCceeEE-------------EEeC--------CCCC---CCCccCHHHHHHHH
Confidence            3677777787 78888887776655554333322             1222        3333   35679999999999


Q ss_pred             HHHHHcCceeeeecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhcCCCCC
Q psy14880        110 HYAKLRGVRVIFELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWGPRYGL  189 (497)
Q Consensus       110 ~y~~~rG~ri~~e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~p~~~~  189 (497)
                      .|++...+|++...-   +.++               ++.+.+++-+.++++...+       .|.-..-++.--+++|+
T Consensus       253 ~Fa~~kniHvI~DEI---ya~s---------------VF~~~~F~Sv~ev~~~~~~-------~~~rvHivyslSKD~Gl  307 (471)
T KOG0256|consen  253 NFASRKNIHVISDEI---YAGS---------------VFDKSEFRSVLEVRKDPHL-------DPDRVHIVYSLSKDFGL  307 (471)
T ss_pred             HHHhhcceEEEeehh---hccc---------------ccCccCceEHHHHhhcccc-------CCCcEEEEEEeccccCC
Confidence            999999999875332   2221               2444455555555555443       33333344444567777


Q ss_pred             CCcc
Q psy14880        190 GDLI  193 (497)
Q Consensus       190 ~~l~  193 (497)
                      |.++
T Consensus       308 pGfR  311 (471)
T KOG0256|consen  308 PGFR  311 (471)
T ss_pred             CceE
Confidence            7765


No 140
>COG0621 MiaB 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]
Probab=21.27  E-value=1.4e+02  Score=32.53  Aligned_cols=92  Identities=13%  Similarity=0.093  Sum_probs=51.6

Q ss_pred             CCHHHHHHhhhhhhcc-cccccchhhHHhhhhceeEEeccCCcccccccccCCCCCCCccCChhHHHHHHHHHHHcCcee
Q psy14880         41 TDPEIILNYLDPQTCG-TQQVVREPLLLVTLMALCRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQELVHYAKLRGVRV  119 (497)
Q Consensus        41 ~~~~~lk~~id~ma~~-K~~~~~~~~~~~~~~n~lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~~~y~~~rG~ri  119 (497)
                      +|.+..-++|+.|+.. |+ +..|||+||        |-+| .-    +.++     .+.||.++..++++--+..--.+
T Consensus       236 ~P~~~~d~lI~~~~~~~kv-~~~lHlPvQ--------sGsd-~I----Lk~M-----~R~yt~e~~~~~i~k~R~~~Pd~  296 (437)
T COG0621         236 HPLEFTDDLIEAIAETPKV-CPHLHLPVQ--------SGSD-RI----LKRM-----KRGYTVEEYLEIIEKLRAARPDI  296 (437)
T ss_pred             CchhcCHHHHHHHhcCCcc-cccccCccc--------cCCH-HH----HHHh-----CCCcCHHHHHHHHHHHHHhCCCc
Confidence            4477778899999996 65 333333333        2222 10    1222     46789888888765433210000


Q ss_pred             eeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        120 IFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       120 ~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      .+-.| +-+++|                 =|.+|+.+.++..++-+..-+
T Consensus       297 ~i~tDiIVGFPg-----------------ETeedFe~tl~lv~e~~fd~~  329 (437)
T COG0621         297 AISTDIIVGFPG-----------------ETEEDFEETLDLVEEVRFDRL  329 (437)
T ss_pred             eEeccEEEECCC-----------------CCHHHHHHHHHHHHHhCCCEE
Confidence            01111 122332                 489999999999999876543


No 141
>COG1217 TypA Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]
Probab=21.26  E-value=56  Score=35.81  Aligned_cols=22  Identities=36%  Similarity=0.555  Sum_probs=17.6

Q ss_pred             HHHHHcCCEEE-------------EecCCcccchh
Q psy14880        158 HYAKLRGIRVI-------------FELDAPAHAGN  179 (497)
Q Consensus       158 ~yA~~rgI~VI-------------PEID~PGH~~a  179 (497)
                      ..-++|||+|.             ==+|+|||+..
T Consensus        47 DlEkERGITILaKnTav~~~~~~INIvDTPGHADF   81 (603)
T COG1217          47 DLEKERGITILAKNTAVNYNGTRINIVDTPGHADF   81 (603)
T ss_pred             chhhhcCcEEEeccceeecCCeEEEEecCCCcCCc
Confidence            35689999975             45899999975


No 142
>PRK02308 uvsE putative UV damage endonuclease; Provisional
Probab=21.22  E-value=4.7e+02  Score=26.85  Aligned_cols=56  Identities=21%  Similarity=0.443  Sum_probs=34.8

Q ss_pred             HHHHHHHHHHHcCceee-eecC-CCC--CCC-CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        104 EVQELVHYAKLRGVRVI-FELD-APA--HAG-NGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       104 ~i~~~~~y~~~rG~ri~-~e~~-~p~--~~~-~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ++.++++|...+||+.. ..++ .|-  |+. ..|.          ..-..+++++++-+++++.||+++-
T Consensus        49 ~l~~~l~~~~~~~I~~~R~sS~l~P~~~h~~~~~w~----------~~~~~~~~~~~~g~~~~~~~irls~  109 (303)
T PRK02308         49 NLLRILKYNIAHGIGLFRLSSSLIPLATHPELEGWD----------YIEPFKEELREIGEFIKEHNIRLSF  109 (303)
T ss_pred             HHHHHHHHHHHCCCCEEEcccCcCCCCCChhhcccC----------CCCCCHHHHHHHHHHHHHcCCCeec
Confidence            45677788888888642 1111 121  210 0121          2337899999999999999987653


No 143
>PRK14334 (dimethylallyl)adenosine tRNA methylthiotransferase; Provisional
Probab=21.21  E-value=1.3e+02  Score=32.42  Aligned_cols=52  Identities=17%  Similarity=0.118  Sum_probs=28.6

Q ss_pred             ccCChhHHHHHHHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcCCE
Q psy14880         98 KVYTPLEVQELVHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRGIR  166 (497)
Q Consensus        98 ~~y~~~~i~~~~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rgI~  166 (497)
                      +.|+.+++.+.++..+..|..+.+..+ +-+.++                 =|.+++.+.++++++.++.
T Consensus       267 R~~~~~~~~~~v~~lr~~~~~i~i~~d~IvG~Pg-----------------Et~ed~~~tl~~i~~l~~~  319 (440)
T PRK14334        267 REYRREKYLERIAEIREALPDVVLSTDIIVGFPG-----------------ETEEDFQETLSLYDEVGYD  319 (440)
T ss_pred             CCCCHHHHHHHHHHHHHhCCCcEEEEeEEEECCC-----------------CCHHHHHHHHHHHHhcCCC
Confidence            456667777776666655433211111 111111                 4777888888887776643


No 144
>PRK05942 aspartate aminotransferase; Provisional
Probab=21.17  E-value=94  Score=32.54  Aligned_cols=23  Identities=17%  Similarity=0.196  Sum_probs=21.1

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      ..+|++++++|++.|+++||.||
T Consensus       185 ~~~s~~~~~~i~~~a~~~~~~iI  207 (394)
T PRK05942        185 ATAPREFFEEIVAFARKYEIMLV  207 (394)
T ss_pred             CcCCHHHHHHHHHHHHHcCeEEE
Confidence            45999999999999999999877


No 145
>PRK11194 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.16  E-value=2.8e+02  Score=29.46  Aligned_cols=72  Identities=15%  Similarity=0.047  Sum_probs=42.7

Q ss_pred             EEeccCCcccccccccCCCCCCCccCChhHHH-HHHHHHHHcC---ceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCC
Q psy14880         75 RWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQ-ELVHYAKLRG---VRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYT  149 (497)
Q Consensus        75 hlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~-~~~~y~~~rG---~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT  149 (497)
                      -+|..|++--.--.|.      ++.|..+++. .+.+|.+..|   -+|.+++. +|+..                  =|
T Consensus       221 SLha~d~e~R~~lmPi------n~~~~l~~ll~a~~~y~~~~~~~~rrI~irypLIpGvN------------------Ds  276 (372)
T PRK11194        221 SLHAPNDELRDEIVPI------NKKYNIETFLAAVRRYLEKSNANQGRVTVEYVMLDHVN------------------DG  276 (372)
T ss_pred             eccCCCHHHHHHhcCC------cccccHHHHHHHHHHHHHhcccCCCeEEEEEEeECCCC------------------CC
Confidence            5787776543322331      2334434333 3346766554   46666654 34431                  58


Q ss_pred             HHHHHHHHHHHHHc--CCEEEEe
Q psy14880        150 PLEVQELVHYAKLR--GIRVIFE  170 (497)
Q Consensus       150 ~~ei~eiv~yA~~r--gI~VIPE  170 (497)
                      .+++++|+++++..  +|++||-
T Consensus       277 ~e~a~~La~ll~~l~~~VnLIPY  299 (372)
T PRK11194        277 TEHAHQLAELLKDTPCKINLIPW  299 (372)
T ss_pred             HHHHHHHHHHHhcCCceEEEecC
Confidence            89999999999987  6667763


No 146
>PRK09505 malS alpha-amylase; Reviewed
Probab=20.97  E-value=93  Score=35.79  Aligned_cols=30  Identities=37%  Similarity=0.665  Sum_probs=26.3

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.+|||+||-.+ ++-|+..
T Consensus       290 t~~dfk~Lv~~aH~~Gi~VilD~-V~NH~~~  319 (683)
T PRK09505        290 TEADLRTLVDEAHQRGIRILFDV-VMNHTGY  319 (683)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEE-CcCCCcc
Confidence            78999999999999999999876 4678773


No 147
>PF00155 Aminotran_1_2:  Aminotransferase class I and II 1-aminocyclopropane-1-carboxylate synthase signature aspartate aminotransferase signature;  InterPro: IPR004839 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped [] into class I and class II. This entry includes proteins from both subfamilies.; GO: 0016769 transferase activity, transferring nitrogenous groups, 0030170 pyridoxal phosphate binding, 0009058 biosynthetic process; PDB: 3NRA_B 3P6K_B 3OP7_A 3ASB_A 3ASA_A 1W7M_A 3FVX_A 1W7N_A 3FVU_B 3FVS_A ....
Probab=20.94  E-value=95  Score=31.67  Aligned_cols=24  Identities=29%  Similarity=0.343  Sum_probs=19.8

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ..+|.+++++|++.|+++|+-||-
T Consensus       163 ~~~~~~~l~~l~~~~~~~~~~ii~  186 (363)
T PF00155_consen  163 SVLSLEELRELAELAREYNIIIIV  186 (363)
T ss_dssp             BB--HHHHHHHHHHHHHTTSEEEE
T ss_pred             cccccccccchhhhhcccccceee
Confidence            359999999999999999998884


No 148
>PRK14454 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=20.94  E-value=2.4e+02  Score=29.49  Aligned_cols=72  Identities=18%  Similarity=0.149  Sum_probs=41.7

Q ss_pred             eEEeccCCcccccccccCCCCCCCccCChhHHHHH-HHHHHHcCceeeeecC-CCCCCCCCCCCCCCCCCCCCCCCCCHH
Q psy14880         74 CRWQMCTRQQVPCLLPRFGAYSPAKVYTPLEVQEL-VHYAKLRGVRVIFELD-APAHAGNGWQWGPRFGAYSPAKVYTPL  151 (497)
Q Consensus        74 lhlHl~D~q~~~~e~~~~g~~~~~~~y~~~~i~~~-~~y~~~rG~ri~~e~~-~p~~~~~~~~~~~~~g~~s~~~~YT~~  151 (497)
                      +-+|..|++-.+.-.|      .+..|..+++.+. .+|.+..|-|+.+++. +|+..                  =|.+
T Consensus       212 isLka~d~e~r~~l~p------v~~~~~L~~l~~~~~~~~~~~~~rv~iey~LI~gvN------------------Ds~e  267 (342)
T PRK14454        212 ISLHAPNDELRKKMMP------IANKYSIEELIEACKYYINKTNRRITFEYALVKGVN------------------DSKE  267 (342)
T ss_pred             EecCCCCHHHHHHhcC------CcccCCHHHHHHHHHHHHHHhCCEEEEEEEeECCCC------------------CCHH
Confidence            5567777654332222      1234554555443 3556677777776665 24321                  4788


Q ss_pred             HHHHHHHHHHHc--CCEEEE
Q psy14880        152 EVQELVHYAKLR--GIRVIF  169 (497)
Q Consensus       152 ei~eiv~yA~~r--gI~VIP  169 (497)
                      ++++|.++.+.+  .|++||
T Consensus       268 da~~La~llk~l~~~VnLiP  287 (342)
T PRK14454        268 DAKELGKLLKGMLCHVNLIP  287 (342)
T ss_pred             HHHHHHHHHhcCCceEEEEe
Confidence            999999998865  344444


No 149
>PF09711 Cas_Csn2:  CRISPR-associated protein (Cas_Csn2);  InterPro: IPR010146 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny.  This entry represents the Csn2 family of Cas proteins, which are found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas loci. The species range so far for this subtype is animal pathogens and commensals only. This protein is present in some but not all NMENI CRISPR/Cas loci.; PDB: 3TOC_A 3QHQ_A 3S5U_C.
Probab=20.87  E-value=1e+02  Score=29.63  Aligned_cols=25  Identities=40%  Similarity=0.559  Sum_probs=21.4

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      ..|+|++|++++++|+.-.+|.|+-
T Consensus       140 ~~YLT~eEl~el~e~i~~~~i~VL~  164 (188)
T PF09711_consen  140 RSYLTEEELQELYEYIKYNKIKVLF  164 (188)
T ss_dssp             GGGS-HHHHHHHHHHHHHTTSEEEE
T ss_pred             HHhcCHHHHHHHHHHHHHhCCeEEE
Confidence            3569999999999999999999874


No 150
>COG2100 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=20.76  E-value=1.7e+02  Score=30.62  Aligned_cols=48  Identities=23%  Similarity=0.369  Sum_probs=37.4

Q ss_pred             CccCChhHHHHHHHHHHHcCceeee-ecCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHcC
Q psy14880         97 AKVYTPLEVQELVHYAKLRGVRVIF-ELDAPAHAGNGWQWGPRFGAYSPAKVYTPLEVQELVHYAKLRG  164 (497)
Q Consensus        97 ~~~y~~~~i~~~~~y~~~rG~ri~~-e~~~p~~~~~~~~~~~~~g~~s~~~~YT~~ei~eiv~yA~~rg  164 (497)
                      .+-|.-+.+.++.+|....|+.+.. +.-.|+                    |..+|+..||++|++-|
T Consensus       236 ~~dYdv~kvle~aE~i~~a~idvlIaPv~lPG--------------------~ND~E~~~iIe~A~~iG  284 (414)
T COG2100         236 RKDYDVKKVLEVAEYIANAGIDVLIAPVWLPG--------------------VNDDEMPKIIEWAREIG  284 (414)
T ss_pred             ccccCHHHHHHHHHHHHhCCCCEEEeeeecCC--------------------cChHHHHHHHHHHHHhC
Confidence            4567778889999998888887642 333332                    88899999999999998


No 151
>cd07943 DRE_TIM_HOA 4-hydroxy-2-oxovalerate aldolase, N-terminal catalytic TIM barrel domain. 4-hydroxy 2-ketovalerate aldolase  (Also known as 4-hydroxy-2-ketovalerate aldolase and 4-hydroxy-2-oxopentanoate aldolase (HOA)) converts 4-hydroxy-2-oxopentanoate to acetaldehyde and pyruvate, the penultimate step in the meta-cleavage pathway for the degradation of phenols, cresols and catechol.  This family includes the Escherichia coli MhpE aldolase, the Pseudomonas DmpG aldolase, and the Burkholderia xenovorans BphI pyruvate aldolase.  In Pseudomonas, the DmpG aldolase tightly associates with a dehydrogenase (DmpF ) and is inactive without it.  HOA has a canonical TIM-barrel fold with a C-terminal extension that forms a funnel leading to the active site.  This family belongs to the DRE-TIM metallolyase superfamily.  DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate
Probab=20.61  E-value=2.4e+02  Score=27.96  Aligned_cols=20  Identities=20%  Similarity=0.154  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHcCCEEEEec
Q psy14880        152 EVQELVHYAKLRGIRVIFEL  171 (497)
Q Consensus       152 ei~eiv~yA~~rgI~VIPEI  171 (497)
                      .+++.+++|+++|++|..-+
T Consensus       113 ~~~~~i~~ak~~G~~v~~~~  132 (263)
T cd07943         113 VSEQHIGAARKLGMDVVGFL  132 (263)
T ss_pred             HHHHHHHHHHHCCCeEEEEE
Confidence            45566666666666555554


No 152
>PRK09427 bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase; Provisional
Probab=20.60  E-value=1.2e+02  Score=33.00  Aligned_cols=39  Identities=15%  Similarity=0.163  Sum_probs=34.5

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhhhhhc
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNGWQWG  184 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~l~~~  184 (497)
                      ..++++++++|+++|.++|++++-|+...-....++.+.
T Consensus       141 ~~L~~~~l~~l~~~a~~lGl~~lvEvh~~~El~~al~~~  179 (454)
T PRK09427        141 SVLDDEQYRQLAAVAHSLNMGVLTEVSNEEELERAIALG  179 (454)
T ss_pred             HhCCHHHHHHHHHHHHHcCCcEEEEECCHHHHHHHHhCC
Confidence            348999999999999999999999999988888877654


No 153
>PF01408 GFO_IDH_MocA:  Oxidoreductase family, NAD-binding Rossmann fold;  InterPro: IPR000683 This group of enzymes utilise NADP or NAD, and is known as the GFO/IDH/MOCA family in UniProtKB/Swiss-Prot. GFO is a glucose--fructose oxidoreductase, which converts D-glucose and D-fructose into D-gluconolactone and D-glucitol in the sorbitol-gluconate pathway. MOCA is a rhizopine catabolism protein which may catalyse the NADH-dependent dehydrogenase reaction involved in rhizopine catabolism. Other proteins belonging to this family include Gal80, a negative regulator for the expression of lactose and galactose metabolic genes; and several hypothetical proteins from yeast, Escherichia coli and Bacillus subtilis.  The oxidoreductase, N-terminal domain is almost always associated with the oxidoreductase, C-terminal domain (see IPR004104 from INTERPRO).; GO: 0016491 oxidoreductase activity; PDB: 1LC0_A 1LC3_A 1GCU_A 3IP3_E 3CEA_C 3EVN_A 3NTQ_A 3NTR_B 3NT5_A 3MZ0_A ....
Probab=20.60  E-value=2.6e+02  Score=23.43  Aligned_cols=21  Identities=43%  Similarity=0.569  Sum_probs=19.5

Q ss_pred             CCHHHHHHHHHHHHHcCCEEE
Q psy14880        148 YTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      .|.+|.++|++.|++.|+.|.
T Consensus        97 ~~~~~~~~l~~~a~~~~~~~~  117 (120)
T PF01408_consen   97 LTLEEAEELVEAAKEKGVKVM  117 (120)
T ss_dssp             SSHHHHHHHHHHHHHHTSCEE
T ss_pred             CCHHHHHHHHHHHHHhCCEEE
Confidence            799999999999999998875


No 154
>PLN00145 tyrosine/nicotianamine aminotransferase; Provisional
Probab=20.55  E-value=1e+02  Score=33.02  Aligned_cols=24  Identities=29%  Similarity=0.360  Sum_probs=21.5

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIF  169 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIP  169 (497)
                      -.|+++++++|++.|+++||-||=
T Consensus       205 ~v~~~~~l~~i~~~a~~~~i~ii~  228 (430)
T PLN00145        205 SVYSYEHLAKIAETARKLGILVIA  228 (430)
T ss_pred             CCCCHHHHHHHHHHHHHcCCEEEE
Confidence            459999999999999999998773


No 155
>PRK09147 succinyldiaminopimelate transaminase; Provisional
Probab=20.47  E-value=97  Score=32.40  Aligned_cols=23  Identities=13%  Similarity=0.124  Sum_probs=21.0

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      ..||++++++|++.|++++|.||
T Consensus       181 ~~~s~~~~~~l~~~a~~~~~~ii  203 (396)
T PRK09147        181 AVLPLDDWKKLFALSDRYGFVIA  203 (396)
T ss_pred             ccCCHHHHHHHHHHHHHcCeEEE
Confidence            45999999999999999999877


No 156
>KOG4175|consensus
Probab=20.44  E-value=1.7e+02  Score=28.68  Aligned_cols=24  Identities=25%  Similarity=0.346  Sum_probs=20.9

Q ss_pred             CCHHHHHHHHHHHHHcCCEEEEec
Q psy14880        148 YTPLEVQELVHYAKLRGIRVIFEL  171 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI~VIPEI  171 (497)
                      +-.+|-..+-.+|+++||.+||-+
T Consensus       132 lPpEEa~~~Rne~~k~gislvpLv  155 (268)
T KOG4175|consen  132 LPPEEAETLRNEARKHGISLVPLV  155 (268)
T ss_pred             CChHHHHHHHHHHHhcCceEEEee
Confidence            556888889999999999999964


No 157
>PF08671 SinI:  Anti-repressor SinI;  InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=20.42  E-value=2.5e+02  Score=18.78  Aligned_cols=13  Identities=15%  Similarity=0.485  Sum_probs=9.6

Q ss_pred             CCHHHHHHHHHHH
Q psy14880        148 YTPLEVQELVHYA  160 (497)
Q Consensus       148 YT~~ei~eiv~yA  160 (497)
                      .|++|||+.+++-
T Consensus        17 ls~eeir~FL~~~   29 (30)
T PF08671_consen   17 LSKEEIREFLEFN   29 (30)
T ss_dssp             --HHHHHHHHHHH
T ss_pred             CCHHHHHHHHHhC
Confidence            8999999998764


No 158
>TIGR01290 nifB nitrogenase cofactor biosynthesis protein NifB. This model describes NifB, a protein required for the biosynthesis of the iron-molybdenum (or iron-vanadium) cofactor used by the nitrogen-fixing enzyme nitrogenase. Archaeal homologs lack the most C-terminal region and score between the trusted and noise cutoffs of this model.
Probab=20.38  E-value=1.9e+02  Score=31.35  Aligned_cols=29  Identities=24%  Similarity=0.269  Sum_probs=24.3

Q ss_pred             CCHHHHHHHHHHHHHcCC---EEEEecCCccc
Q psy14880        148 YTPLEVQELVHYAKLRGI---RVIFELDAPAH  176 (497)
Q Consensus       148 YT~~ei~eiv~yA~~rgI---~VIPEID~PGH  176 (497)
                      ++.+++.+|+++++++|+   .|+|-+..|-|
T Consensus       200 iND~~i~~l~~~~~~lg~~~~nl~p~~~~p~~  231 (442)
T TIGR01290       200 INDEHLVEVSKQVKELGAFLHNVMPLISAPEH  231 (442)
T ss_pred             cCHHHHHHHHHHHHhCCCcEEEeecCCCcccc
Confidence            556899999999999985   57888888766


No 159
>PRK09082 methionine aminotransferase; Validated
Probab=20.36  E-value=95  Score=32.40  Aligned_cols=23  Identities=17%  Similarity=0.281  Sum_probs=21.0

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      ..||++++++|++.|++++|-||
T Consensus       178 ~~~~~~~~~~i~~~a~~~~i~li  200 (386)
T PRK09082        178 TVWSAADMRALWQLIAGTDIYVL  200 (386)
T ss_pred             cCCCHHHHHHHHHHHHHCCEEEE
Confidence            45899999999999999999877


No 160
>COG0826 Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]
Probab=20.34  E-value=1.4e+02  Score=31.33  Aligned_cols=35  Identities=17%  Similarity=0.210  Sum_probs=32.0

Q ss_pred             CCCCHHHHHHHHHHHHHcCCEEEEecCCcccchhh
Q psy14880        146 KVYTPLEVQELVHYAKLRGIRVIFELDAPAHAGNG  180 (497)
Q Consensus       146 ~~YT~~ei~eiv~yA~~rgI~VIPEID~PGH~~a~  180 (497)
                      ..+|.+|++|+|+||.++|.++.=-+++.=|....
T Consensus        44 ~nfs~~~l~e~i~~ah~~gkk~~V~~N~~~~~~~~   78 (347)
T COG0826          44 LNFSVEDLAEAVELAHSAGKKVYVAVNTLLHNDEL   78 (347)
T ss_pred             ccCCHHHHHHHHHHHHHcCCeEEEEeccccccchh
Confidence            46999999999999999999999999999888764


No 161
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=20.21  E-value=99  Score=34.31  Aligned_cols=30  Identities=33%  Similarity=0.535  Sum_probs=26.0

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEecCCcccchh
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFELDAPAHAGN  179 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPEID~PGH~~a  179 (497)
                      |.+|+++||+-|.++||+||=.+ ++.|+..
T Consensus        74 t~~df~~Lv~~ah~~Gi~vilD~-V~NH~s~  103 (539)
T TIGR02456        74 TIDDFKDFVDEAHARGMRVIIDL-VLNHTSD  103 (539)
T ss_pred             CHHHHHHHHHHHHHCCCEEEEEe-ccCcCCC
Confidence            78999999999999999999875 5777654


No 162
>TIGR03542 DAPAT_plant LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). This clade includes characterized species in plants and Chlamydia. Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh).
Probab=20.16  E-value=1.2e+02  Score=32.00  Aligned_cols=24  Identities=33%  Similarity=0.512  Sum_probs=21.5

Q ss_pred             CCCCCHHHHHHHHHHHHHcCCEEE
Q psy14880        145 AKVYTPLEVQELVHYAKLRGIRVI  168 (497)
Q Consensus       145 ~~~YT~~ei~eiv~yA~~rgI~VI  168 (497)
                      +..||++++++|++.|+++++-||
T Consensus       187 G~~~s~~~~~~l~~~a~~~~~~iI  210 (402)
T TIGR03542       187 GTVLTKEQLKELVDYANEHGSLIL  210 (402)
T ss_pred             CccCCHHHHHHHHHHHHHcCeEEE
Confidence            456999999999999999999776


No 163
>cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion.
Probab=20.04  E-value=1.8e+02  Score=28.79  Aligned_cols=22  Identities=32%  Similarity=0.481  Sum_probs=19.5

Q ss_pred             CHHHHHHHHHHHHHcCCEEEEe
Q psy14880        149 TPLEVQELVHYAKLRGIRVIFE  170 (497)
Q Consensus       149 T~~ei~eiv~yA~~rgI~VIPE  170 (497)
                      .-.+.+.+.++|+.+||.++|.
T Consensus       212 Git~~~~i~~~a~~~gi~~~~~  233 (263)
T cd03320         212 GPRALLELAEEARARGIPAVVS  233 (263)
T ss_pred             CHHHHHHHHHHHHHcCCCEEEE
Confidence            3467999999999999999996


Done!