Query psy11534
Match_columns 81
No_of_seqs 101 out of 1036
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 20:52:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11534.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11534hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd06601 GH31_lyase_GLase GLase 100.0 5.6E-31 1.2E-35 193.4 7.4 77 1-80 175-253 (332)
2 cd06596 GH31_CPE1046 CPE1046 i 100.0 1.1E-28 2.3E-33 176.8 7.2 75 2-80 130-204 (261)
3 cd06599 GH31_glycosidase_Aec37 99.9 1E-27 2.2E-32 174.3 7.5 77 1-80 196-272 (317)
4 PLN02763 hydrolase, hydrolyzin 99.9 1.1E-27 2.3E-32 193.9 7.6 77 1-80 374-450 (978)
5 cd06589 GH31 The enzymes of gl 99.9 1.5E-27 3.2E-32 169.4 7.4 76 1-79 144-219 (265)
6 cd06600 GH31_MGAM-like This fa 99.9 1.6E-27 3.5E-32 173.3 7.4 76 2-80 174-249 (317)
7 cd06598 GH31_transferase_CtsZ 99.9 2.3E-27 4.9E-32 172.4 7.3 77 1-80 188-265 (317)
8 cd06604 GH31_glucosidase_II_Ma 99.9 2.4E-27 5.2E-32 173.3 7.2 77 1-80 195-271 (339)
9 cd06591 GH31_xylosidase_XylS X 99.9 3E-27 6.6E-32 171.9 7.3 77 1-80 188-265 (319)
10 cd06593 GH31_xylosidase_YicI Y 99.9 4.9E-27 1.1E-31 169.2 7.4 76 1-79 182-257 (308)
11 cd06603 GH31_GANC_GANAB_alpha 99.9 5.4E-27 1.2E-31 171.6 7.2 77 1-79 194-270 (339)
12 COG1501 Alpha-glucosidases, fa 99.9 3.2E-27 7E-32 188.1 6.2 76 1-79 439-514 (772)
13 PRK10658 putative alpha-glucos 99.9 6.5E-27 1.4E-31 184.0 7.4 77 1-80 441-517 (665)
14 cd06602 GH31_MGAM_SI_GAA This 99.9 1.9E-26 4.1E-31 169.2 7.5 75 2-80 179-253 (339)
15 cd06597 GH31_transferase_CtsY 99.9 1.7E-26 3.7E-31 169.6 7.3 75 1-80 211-285 (340)
16 cd06594 GH31_glucosidase_YihQ 99.9 1.2E-26 2.5E-31 169.1 6.3 76 1-79 189-268 (317)
17 cd06595 GH31_xylosidase_XylS-l 99.9 5.8E-26 1.3E-30 163.5 7.1 72 3-79 174-245 (292)
18 KOG1066|consensus 99.9 2.5E-26 5.3E-31 181.0 1.9 78 1-80 542-619 (915)
19 PRK10426 alpha-glucosidase; Pr 99.9 3.7E-25 7.9E-30 173.4 6.5 76 1-79 386-465 (635)
20 PF01055 Glyco_hydro_31: Glyco 99.9 5.8E-24 1.3E-28 158.6 6.4 76 2-80 212-287 (441)
21 cd06592 GH31_glucosidase_KIAA1 99.7 1.8E-18 3.9E-23 125.2 5.2 58 20-79 197-258 (303)
22 KOG1065|consensus 99.7 6.8E-18 1.5E-22 134.7 1.9 75 2-80 480-554 (805)
23 PF12692 Methyltransf_17: S-ad 85.4 0.84 1.8E-05 31.1 2.5 56 7-75 42-104 (160)
24 PHA02516 W baseplate wedge sub 50.4 17 0.00038 22.4 2.4 23 39-61 4-26 (103)
25 PF11288 DUF3089: Protein of u 48.8 22 0.00049 25.0 3.0 21 4-25 80-100 (207)
26 PF07421 Pro-NT_NN: Neurotensi 37.7 13 0.00029 25.4 0.4 11 19-29 157-167 (169)
27 COG0262 FolA Dihydrofolate red 36.2 17 0.00037 24.2 0.8 46 2-50 41-90 (167)
28 KOG0155|consensus 31.9 20 0.00044 29.0 0.7 23 19-44 103-125 (617)
29 PRK10769 folA dihydrofolate re 30.2 33 0.00072 22.8 1.4 26 2-28 41-66 (159)
30 PF06309 Torsin: Torsin; Inte 29.8 75 0.0016 20.7 3.0 16 19-34 51-66 (127)
31 PRK00311 panB 3-methyl-2-oxobu 27.8 76 0.0016 23.0 3.0 59 19-79 14-90 (264)
32 PLN03231 putative alpha-galact 27.4 2.8E+02 0.0061 21.1 6.5 47 7-57 199-255 (357)
33 PLN02899 alpha-galactosidase 26.9 2.1E+02 0.0045 23.7 5.5 46 6-57 229-286 (633)
34 PF03545 YopE: Yersinia virule 25.5 64 0.0014 19.1 1.9 23 48-70 34-58 (70)
35 PRK10125 putative glycosyl tra 24.8 30 0.00065 25.9 0.5 27 44-73 313-339 (405)
36 PF01630 Glyco_hydro_56: Hyalu 24.4 51 0.0011 24.9 1.6 42 21-71 263-305 (337)
37 PLN02229 alpha-galactosidase 23.7 1.4E+02 0.0031 23.3 4.0 62 7-68 195-265 (427)
38 cd00209 DHFR Dihydrofolate red 22.8 36 0.00078 22.2 0.5 27 2-29 40-67 (158)
39 cd00952 CHBPH_aldolase Trans-o 22.6 87 0.0019 22.8 2.5 32 39-70 20-51 (309)
40 PF07655 Secretin_N_2: Secreti 21.3 79 0.0017 19.2 1.8 19 41-59 66-84 (98)
41 TIGR02965 xanthine_xdhB xanthi 21.3 83 0.0018 26.0 2.4 42 37-79 181-229 (758)
42 PRK00924 5-keto-4-deoxyuronate 20.5 57 0.0012 24.0 1.2 17 28-44 244-262 (276)
43 PHA03371 circ protein; Provisi 20.3 1.1E+02 0.0024 22.2 2.6 36 1-37 104-139 (240)
No 1
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=99.97 E-value=5.6e-31 Score=193.36 Aligned_cols=77 Identities=34% Similarity=0.574 Sum_probs=71.5
Q ss_pred ChHhHHHHHHHHhhc--CCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCccc
Q psy11534 1 MLQVMATFEGLLKRS--NYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAA 78 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~--~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~ 78 (81)
++++||+||++++.. ++ +||||||||+|+|+|||+++|+|||.++|++|+.||+++|+++|||+||||+|| |||.
T Consensus 175 ~~~~~a~~e~~~~~~~~~~-~RpfiltRS~~aGsqrY~~~WsGDn~stW~~L~~si~~~L~~~lsGip~~g~DI--GGF~ 251 (332)
T cd06601 175 YNLHKATWHGLNNLNARKN-KRNFIIGRGSYAGMQRFAGLWTGDNSSSWDFLQINIAQVLNLGMSGLPIAGSDI--GGFT 251 (332)
T ss_pred HHHHHHHHHHHHHhhcCCC-CCcEEEEecCcCccCCcCceeCCCcccCHHHHHHHHHHHHHhhhcCCCccCCcc--CCcC
Confidence 478999999998875 55 599999999999999999999999999999999999999999999999999999 5777
Q ss_pred CC
Q psy11534 79 GS 80 (81)
Q Consensus 79 g~ 80 (81)
|.
T Consensus 252 g~ 253 (332)
T cd06601 252 SY 253 (332)
T ss_pred CC
Confidence 63
No 2
>cd06596 GH31_CPE1046 CPE1046 is an uncharacterized Clostridium perfringens protein with a glycosyl hydrolase family 31 (GH31) domain. The domain architecture of CPE1046 and its orthologs includes a C-terminal fibronectin type 3 (FN3) domain and a coagulation factor 5/8 type C domain in addition to the GH31 domain. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.95 E-value=1.1e-28 Score=176.84 Aligned_cols=75 Identities=31% Similarity=0.612 Sum_probs=70.0
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
.++|++||++++. ++ +||||++||+|+|+|||+++|+||+.++|++|+.+|+++|+++|||+||||+|| |||.|.
T Consensus 130 ~~~ka~yeg~~~~-~~-~RpfiltRsg~aGsQRy~~~WsGD~~stWe~Lr~sI~~~L~~gLsG~p~~G~DI--GGF~g~ 204 (261)
T cd06596 130 NGVKAAADGIESN-SN-ARPFIVTVDGWAGTQRYAGIWTGDQSGSWEYIRFHIPTYIGSGLSGQPNTTSDV--DGIFGG 204 (261)
T ss_pred HHHHHHHHHHHhC-CC-CCCEEEEecCccccCCCCCccCCCCcCcHHHHHHHHHHHHHHHhcCCCcCcccc--CcCCCC
Confidence 6789999999997 65 699999999999999999999999999999999999999999999999999999 577653
No 3
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.95 E-value=1e-27 Score=174.29 Aligned_cols=77 Identities=34% Similarity=0.640 Sum_probs=71.9
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++++||+||++++..++ +|||+++||+|+|+|||+++|+||+.++|++|+.+|+++|++++||+||||+|| |||.+.
T Consensus 196 ~l~~~a~~~~~~~~~~~-~r~f~ltRs~~~G~qry~~~WsGD~~s~W~~L~~~i~~~L~~glsG~~~~g~DI--GGF~~~ 272 (317)
T cd06599 196 NLMARASHEAQAEHYPN-RRPYIVSRSGFAGIQRYAQTWSGDNRTSWKTLRYNIAMGLGMGLSGVANIGHDI--GGFAGP 272 (317)
T ss_pred HHHHHHHHHHHHHhCCC-CCcEEEEcCCcccccCCcCeeCCCcccCHHHHHHHHHHHHhhhccCCccccccc--CccCCC
Confidence 37899999999998776 599999999999999998899999999999999999999999999999999999 688763
No 4
>PLN02763 hydrolase, hydrolyzing O-glycosyl compounds
Probab=99.94 E-value=1.1e-27 Score=193.89 Aligned_cols=77 Identities=47% Similarity=0.889 Sum_probs=72.5
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++++||+||++++..++ +|||++|||+|+|+|||+++|+|||.++|++|+.+|+++|++++||+||||+|| |||.|.
T Consensus 374 ll~akatyEgl~~~~~~-kRPFilTRSgfaGsQRYaa~WtGDn~SsWe~L~~sI~~~LnlgLSGipf~G~DI--GGF~G~ 450 (978)
T PLN02763 374 MLMARSTYEGMLLANKN-KRPFVLTRAGFIGSQRYAATWTGDNLSNWEHLHMSIPMVLQLGLSGQPLSGPDI--GGFAGD 450 (978)
T ss_pred HHHHHHHHHHHHHhCCC-CCcEEEEccccCcCCCCceEECCCccCCHHHHHHHHHHHHHHHhcCCccccccc--CCCCCC
Confidence 47899999999998876 599999999999999999999999999999999999999999999999999999 688764
No 5
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=99.94 E-value=1.5e-27 Score=169.44 Aligned_cols=76 Identities=41% Similarity=0.761 Sum_probs=71.3
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++|++||++++..++ +|||+++||+|+|+|||+.+|+||+.++|+.||.+|+++|+++++|+|+||+|| |||.|
T Consensus 144 ~~~~~~~~~~~~~~~~~-~r~~~~sRs~~~Gsqry~~~W~GD~~stW~~l~~~i~~~l~~~l~G~~~~g~Di--gGf~~ 219 (265)
T cd06589 144 LLYAEATYEALRKNSKN-KRPFILSRSGYAGSQRYAGMWSGDNTSTWGYLRSQIPAGLTMSMSGIPFVGSDI--GGFTG 219 (265)
T ss_pred HHHHHHHHHHHHHhcCC-CCeEEEEcCCcccccCcCceeCCcccCCHHHHHHHHHHHHhhhccCCcccCCCc--CCCCC
Confidence 36899999999998866 599999999999999999899999999999999999999999999999999999 57766
No 6
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=99.94 E-value=1.6e-27 Score=173.31 Aligned_cols=76 Identities=47% Similarity=0.787 Sum_probs=71.7
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++++|++|++++..++ +|||+++||+|+|+|||+.+|+||+.++|+.|+.+|+++|+++++|+|+||+|| |||.|.
T Consensus 174 ~~~~a~~~~~~~~~~~-~r~~~~sRs~~~G~qry~~~W~GD~~s~W~~L~~~i~~~l~~gl~G~p~~g~Di--GGf~~~ 249 (317)
T cd06600 174 YEAMATAEGFRTSHPR-NRIFILTRSGFAGSQKYAAIWTGDNTASWDDLKLSIPLVLGLSISGIPFVGCDI--GGFQGD 249 (317)
T ss_pred HHHHHHHHHHHHhcCC-CCceEEEeccccccCCccceECCcccccHHHHHHHHHHHHHHHhcCCCccCccc--CCCCCC
Confidence 6899999999998876 599999999999999999999999999999999999999999999999999999 687763
No 7
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.94 E-value=2.3e-27 Score=172.41 Aligned_cols=77 Identities=36% Similarity=0.608 Sum_probs=71.8
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccce-eecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGA-IWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~-~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++|++||++++..++ +|||+++||+|+|+|||++ +|+||+.++|++|+.+|+++|+++++|+|+||+|| |||.|
T Consensus 188 ~~~~~~~~e~~~~~~~~-~r~~~~~Rs~~~Gsqry~~~~WsGD~~s~W~~L~~~i~~~l~~~l~G~~~~g~DI--GGf~~ 264 (317)
T cd06598 188 HLWAKSIYEGYQQNYPN-ERPFILMRAGFAGSQRYGVIPWSGDVGRTWDGLKSQPNAALQMSMSGIDYYHSDI--GGFAG 264 (317)
T ss_pred HHHHHHHHHHHHHhcCC-CCeEEEEecCcCccccCcCCccCCCCcCCHHHHHHHHHHHHhhhccCCcccCCCc--CCcCC
Confidence 37899999999998776 5999999999999999985 89999999999999999999999999999999999 68877
Q ss_pred C
Q psy11534 80 S 80 (81)
Q Consensus 80 ~ 80 (81)
.
T Consensus 265 ~ 265 (317)
T cd06598 265 G 265 (317)
T ss_pred C
Confidence 4
No 8
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=99.94 E-value=2.4e-27 Score=173.27 Aligned_cols=77 Identities=47% Similarity=0.849 Sum_probs=72.4
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++++||+||++++..++ +||||++||+|+|+|||+.+|+||+.++|+.|+.+|+++|+++++|+|+||+|| |||.+.
T Consensus 195 ~~~~~a~~~~~~~~~~~-~r~~~~sRs~~~G~qry~~~W~GD~~ssW~~L~~~i~~~l~~~l~G~~~~g~DI--GGf~~~ 271 (339)
T cd06604 195 LLMARATYEGLKKARPN-ERPFILTRAGYAGIQRYAAVWTGDNRSSWEHLRLSIPMLLNLGLSGVPFVGADI--GGFGGD 271 (339)
T ss_pred HHHHHHHHHHHHHhCCC-CCcEEEEecccccccccccccCCcccCCHHHHHHHHHHHHHHHhcCCccccccc--CCCCCC
Confidence 47899999999998876 599999999999999999999999999999999999999999999999999999 688763
No 9
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=99.94 E-value=3e-27 Score=171.88 Aligned_cols=77 Identities=30% Similarity=0.516 Sum_probs=71.2
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccce-eecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGA-IWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~-~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++|++||++++..++ +|||+++||+|+|+|||++ +|+||+.++|++||.+|+.+|++++||+|+||+|| |||.|
T Consensus 188 ~~~~~~~~e~~~~~~~~-~r~f~~sRs~~~Gsqry~~~~W~GD~~s~w~~L~~~i~~~l~~glsG~~~~g~Di--GGF~~ 264 (319)
T cd06591 188 LMHAKGIYEGQRAAGDE-KRVVILTRSAWAGSQRYGALVWSGDIDSSWETLRRQIAAGLNMGLSGIPWWTTDI--GGFFV 264 (319)
T ss_pred HHHHHHHHHHHHHhCCC-CCceEEEeccccccccccCceeCCCccccHHHHHHHHHHHHHHhhcCCccccCCc--CCcCC
Confidence 37899999999998765 5999999999999999995 89999999999999999999999999999999999 57765
Q ss_pred C
Q psy11534 80 S 80 (81)
Q Consensus 80 ~ 80 (81)
.
T Consensus 265 ~ 265 (319)
T cd06591 265 G 265 (319)
T ss_pred C
Confidence 3
No 10
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=99.94 E-value=4.9e-27 Score=169.17 Aligned_cols=76 Identities=26% Similarity=0.341 Sum_probs=71.4
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++|++||++++..+. +|||+++||+|+|+|||+.+|+||+.++|+.|+.+|+.+|+++++|+|+||+|| |||.+
T Consensus 182 ~~~~~~~~~~~~~~~~~-~r~~~~~Rs~~~Gsqry~~~w~GD~~s~w~~L~~~i~~~l~~~l~G~~~~g~Di--gGf~~ 257 (308)
T cd06593 182 LLYNKAVYEATKEVKGE-GEAVVWARSAWAGSQKYPVHWGGDCESTFEGMAESLRGGLSLGLSGFGFWSHDI--GGFEG 257 (308)
T ss_pred HHHHHHHHHHHHHhcCC-CCeEEEEcCCccccccCCCEECCCcccCHHHHHHHHHHHHhccccCCceecCcc--CCcCC
Confidence 36899999999998765 599999999999999999999999999999999999999999999999999999 57766
No 11
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=99.94 E-value=5.4e-27 Score=171.57 Aligned_cols=77 Identities=58% Similarity=0.892 Sum_probs=71.6
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++||+||++++..+.++|||+++||+|+|+|||+++|+||+.++|+.|+.+|+++|+++++|+|+||+|| |||.|
T Consensus 194 ~~~~~a~~e~~~~~~~~~~r~~~~sRs~~~G~qry~~~W~GD~~s~W~~L~~~i~~~L~~~l~G~~~~g~Di--GGf~~ 270 (339)
T cd06603 194 LYMHMATFDGLLKRSEGNKRPFVLTRSFFAGSQRYAAIWTGDNTATWEHLKISIPMLLSLNICGIPFCGADV--GGFFG 270 (339)
T ss_pred HHHHHHHHHHHHHhhccCCceEEEEecccccccceeeeeCCCccCCHHHHHHHHHHHHHHhhcCccccCCcc--CCcCC
Confidence 368999999999987534699999999999999999999999999999999999999999999999999999 57765
No 12
>COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=99.94 E-value=3.2e-27 Score=188.08 Aligned_cols=76 Identities=42% Similarity=0.728 Sum_probs=72.3
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
++++||+||++++..++ +|||+|+||+|+|+|||+++|+||+.++|++||.||+++|+++|||+|+||+|| |||.|
T Consensus 439 ~~~~~a~~~~~~~~~~~-~r~~~lsRsg~aG~Q~~~~~WsGD~~s~wd~l~~si~~~Ls~~~sGi~~wg~Di--GGF~g 514 (772)
T COG1501 439 LLYAKAVYEALKELGGN-ERPFILSRSGYAGSQRYAAHWSGDNRSSWDSLRESIPAGLSLSLSGIPFWGHDI--GGFTG 514 (772)
T ss_pred HHHHHHHHHHHHhhcCC-CceEEEEecccccceeccceeCCccccchHHHHhhHHhhhchhccCCccccccc--cccCC
Confidence 47999999999998776 699999999999999999999999999999999999999999999999999999 68877
No 13
>PRK10658 putative alpha-glucosidase; Provisional
Probab=99.94 E-value=6.5e-27 Score=184.02 Aligned_cols=77 Identities=23% Similarity=0.303 Sum_probs=72.1
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++|+||+||++++..+. +|||+++||+|+|+|||+++|+|||.++|++|+.+|+++|++++||+||||+|| |||.|.
T Consensus 441 ~l~~ka~~e~l~~~~~~-~r~~i~tRs~~aGsQry~~~WsGD~~stw~~l~~si~~~Ls~glsG~~~~g~DI--GGF~g~ 517 (665)
T PRK10658 441 YLYNKTVFDVLKETRGE-GEAVLFARSATVGGQQFPVHWGGDCYSNYESMAESLRGGLSLGLSGFGFWSHDI--GGFENT 517 (665)
T ss_pred HHHHHHHHHHHHHhcCC-CceEEEEecccCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHHhcCCccccCcc--CCCCCC
Confidence 47999999999998764 699999999999999999999999999999999999999999999999999999 688763
No 14
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=99.93 E-value=1.9e-26 Score=169.22 Aligned_cols=75 Identities=28% Similarity=0.513 Sum_probs=70.4
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
+++|++||++++. ++ +|||+++||+|+|+|||+++|+||+.++|++|+.+|+++|++++||+||||+|| |||.|.
T Consensus 179 ~~~~~~~~~~~~~-~~-~r~~~~sRs~~~G~qry~~~w~GD~~s~W~~L~~~i~~~l~~~~sG~~~~~~Di--gGf~g~ 253 (339)
T cd06602 179 SEAIATYKALQSI-PG-KRPFVISRSTFPGSGRYAGHWLGDNASTWEDLRYSIIGMLEFNLFGIPMVGADI--CGFNGD 253 (339)
T ss_pred HHHHHHHHHHHhc-CC-CCCEEEEecCcccccccceeECCCccCCHHHHHHHHHHHHHHHhcCCCcccCCC--CCCCCC
Confidence 6899999999998 55 599999999999999999999999999999999999999999999999999999 577663
No 15
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.93 E-value=1.7e-26 Score=169.62 Aligned_cols=75 Identities=21% Similarity=0.303 Sum_probs=69.7
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
++++|++||++++. + +||||++||+|+|+|||+++|+||+.++|++||.+|+++|+++++|+|+||+|| |||.|+
T Consensus 211 ~~~~~~~~e~~~~~--~-~r~filtRs~~~Gsqry~~~WsGD~~s~W~~L~~~i~~~L~~glsG~~~~g~DI--gGF~g~ 285 (340)
T cd06597 211 NHYVRAYNDFLRRA--K-KDGVTFSRAGYTGAQAHGIFWAGDENSTFGAFRWSVFAGLSASASGIPYWGWDL--AGFTGD 285 (340)
T ss_pred HHHHHHHHHHHHhc--c-CCcEEEEecccCccCCCcceecCCCCCCHHHHHHHHHHHHHHhhcCCCcCCCcc--CCcCCC
Confidence 37899999999885 5 499999999999999999999999999999999999999999999999999999 578764
No 16
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=99.93 E-value=1.2e-26 Score=169.10 Aligned_cols=76 Identities=28% Similarity=0.450 Sum_probs=70.2
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccce-eecCCCcCChH---HHHHHHHHHHhccccCCccccccCCCCc
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGA-IWTGDNMAEWS---HLKISLPMCLSLAVSEKKSANFVVGIGA 76 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~-~W~GD~~s~W~---~L~~~i~~~l~~~lsG~~~~gsDiG~~g 76 (81)
++++|++||++++..++ +||||++||+|+|+|||++ +|+||+.++|+ +|+.+|+++|+++++|+||||+|| ||
T Consensus 189 ~~~~~~~~~~~~~~~~~-~r~fvltRs~~~Gsqry~~~~WsGD~~s~W~~~~~L~~~i~~~L~~~lsG~~~~g~DI--GG 265 (317)
T cd06594 189 ELWAKLNREAVEEAGKT-GDILFFMRSGFTGSQKYSTLFWAGDQMVSWDAHDGLKSVVPGALSSGLSGYALHHSDI--GG 265 (317)
T ss_pred HHHHHHHHHHHHHhccC-CCeEEEEcccccccccccccccCCCCCCCCcCcccHHHHHHHHhhccccCCCcccCcc--CC
Confidence 37899999999998775 5999999999999999985 89999999998 799999999999999999999999 57
Q ss_pred ccC
Q psy11534 77 AAG 79 (81)
Q Consensus 77 f~g 79 (81)
|.+
T Consensus 266 F~~ 268 (317)
T cd06594 266 YTS 268 (317)
T ss_pred CcC
Confidence 764
No 17
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.93 E-value=5.8e-26 Score=163.51 Aligned_cols=72 Identities=18% Similarity=0.239 Sum_probs=65.6
Q ss_pred HhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 3 QVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 3 ~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
+.+..|+..++ ++ +|||||+||+|+|+|||+++|+||+.++|++|+.||+++|+++++|+||||+|| |||.|
T Consensus 174 ~~~~~y~~~~~--~~-~r~f~lsRs~~~G~qry~~~WsGD~~s~W~~l~~~i~~~l~~~~sG~p~~g~Di--GGF~~ 245 (292)
T cd06595 174 LNHVHYLDSAR--NG-RRPLIFSRWAGLGSHRYPIGFSGDTIISWASLAFQPYFTATASNIGYGYWSHDI--GGHML 245 (292)
T ss_pred HHHHHHHHhhc--cC-CCcEEEEeecccCCCcCCCccCCCcccCHHHHHHHHHHHHHHHhcCCCcCCCcc--CCCCC
Confidence 35677887765 55 599999999999999999999999999999999999999999999999999999 68876
No 18
>KOG1066|consensus
Probab=99.92 E-value=2.5e-26 Score=181.02 Aligned_cols=78 Identities=53% Similarity=0.865 Sum_probs=73.0
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
|++-.||||+|+++.++++|||||||+.|||||||+++|+|||.++|++||-+|+++|+++++|+||+|+|| |||-|.
T Consensus 542 ~~~h~aT~dGl~~R~~g~~RPFvLsRafFaGsQR~aAiWTGDN~A~W~HLkiSiPM~Lsl~iaG~~F~GADV--gGFFgN 619 (915)
T KOG1066|consen 542 LMVHMATFDGLIARSGGKERPFVLSRAFFAGSQRTAAIWTGDNTADWDHLKISIPMVLSLGIAGMPFVGADV--GGFFGN 619 (915)
T ss_pred eeeeehhhhhhhhhcCCCcCceEEEeccccCccceeeeeccCCccchhhheeecceeEecccccceeccccc--ccccCC
Confidence 345679999999999878999999999999999999999999999999999999999999999999999999 688764
No 19
>PRK10426 alpha-glucosidase; Provisional
Probab=99.91 E-value=3.7e-25 Score=173.37 Aligned_cols=76 Identities=26% Similarity=0.524 Sum_probs=69.7
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccce-eecCCCcCChH---HHHHHHHHHHhccccCCccccccCCCCc
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGA-IWTGDNMAEWS---HLKISLPMCLSLAVSEKKSANFVVGIGA 76 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~-~W~GD~~s~W~---~L~~~i~~~l~~~lsG~~~~gsDiG~~g 76 (81)
++|+|++||++++..+. +|||++|||+|+|+|||++ +|+|||.++|+ +|+.+|+++|++++||+|+||+|| ||
T Consensus 386 ~l~~~~~~e~~~~~~~~-~r~f~ltRsg~aGsQry~~~~WsGD~~ssW~~~d~L~~~I~~~Ls~glsG~~~~g~DI--GG 462 (635)
T PRK10426 386 ALWAKCNYEALEETGKL-GEILFFMRAGYTGSQKYSTLFWAGDQNVDWSLDDGLASVVPAALSLGMSGHGLHHSDI--GG 462 (635)
T ss_pred HHHHHHHHHHHHHhcCC-CCcEEEEccccCCcCCccccccCCCCCCcCcChhHHHHHHHHHHHHHhcCcCcccccc--CC
Confidence 47999999999997654 5999999999999999996 89999999995 899999999999999999999999 57
Q ss_pred ccC
Q psy11534 77 AAG 79 (81)
Q Consensus 77 f~g 79 (81)
|.+
T Consensus 463 F~~ 465 (635)
T PRK10426 463 YTT 465 (635)
T ss_pred CcC
Confidence 753
No 20
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=99.90 E-value=5.8e-24 Score=158.61 Aligned_cols=76 Identities=39% Similarity=0.706 Sum_probs=68.3
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
+++|+++|++++..++ +|||+++||+++|+|||+.+|+||+.++|++|+.+|+++|++++||+|+||+|| |||.+.
T Consensus 212 ~~~~~~~~~~~~~~~~-~r~~~~sRs~~~G~qr~~~~w~GD~~s~w~~L~~~i~~~l~~~~~G~~~~g~Di--gG~~~~ 287 (441)
T PF01055_consen 212 LYAKATYEALREIDPN-KRPFIFSRSGWAGSQRYGGHWSGDNSSSWDGLRSSIPAMLNMGLSGYPFWGSDI--GGFSGD 287 (441)
T ss_dssp HHHHHHHHHHHHHSTT-SC-EEEESSEETTGGGTCEEEECSSBSSHHHHHHHHHHHHHHHCTT-SSEEEEE--T-SBST
T ss_pred cchhhhhhhhhhccCC-CCcceeecccCCCCCccceeecccccccHHHHHHHHHHHHHHhhhhcceecCcc--cccCCC
Confidence 6899999999997665 599999999999999999999999999999999999999999999999999999 577763
No 21
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.74 E-value=1.8e-18 Score=125.20 Aligned_cols=58 Identities=17% Similarity=0.144 Sum_probs=53.2
Q ss_pred CceEEeccCCcCccccceeecCCCcCChHHH---HHHHHHHHhccccCCcccccc-CCCCcccC
Q psy11534 20 RPFILTRSGFAGSQRFGAIWTGDNMAEWSHL---KISLPMCLSLAVSEKKSANFV-VGIGAAAG 79 (81)
Q Consensus 20 R~filsRs~~aGsqry~~~W~GD~~s~W~~L---~~~i~~~l~~~lsG~~~~gsD-iG~~gf~g 79 (81)
+||+++||+|+|+|+++.+|+||+.++|+++ +.+|+++|++++||+|||++| | |||.+
T Consensus 197 ~~~~~~Rsg~~g~~~~~~~w~GD~~s~W~~~~gl~~~i~~~L~~~lsG~~~w~~D~i--GGf~~ 258 (303)
T cd06592 197 GDLIEVRAGWRSQGLPLFVRMMDKDSSWGGDNGLKSLIPTALTMGLLGYPFVLPDMI--GGNAY 258 (303)
T ss_pred ccceEEEeeeecCCCCeeEEcCCCCCCCCCCcCHHHHHHHHHHhhccCCcccCCCcc--CCccc
Confidence 5899999999998888889999999999887 999999999999999999999 8 46653
No 22
>KOG1065|consensus
Probab=99.69 E-value=6.8e-18 Score=134.70 Aligned_cols=75 Identities=31% Similarity=0.504 Sum_probs=69.8
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccCCcCccccceeecCCCcCChHHHHHHHHHHHhccccCCccccccCCCCcccCC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAGS 80 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g~ 80 (81)
..++|++.++.+.. + +|+++++||+|+|+.||+++|.|||.++|+.|+.+|..+|.++++|+|++|+|| +||.|.
T Consensus 480 ~~aiat~~a~~~v~-~-kr~~i~srsTf~g~g~y~~hwlgdn~~~w~~L~~sI~gml~fnl~Gip~Vg~Di--cgf~~~ 554 (805)
T KOG1065|consen 480 SEAIATHQALVDVP-G-KRSFILSRSTFVGSGRYAGHWLGDNTARWEDLQTSISGMLEFNLFGIPMVGSDI--CGFLGP 554 (805)
T ss_pred HHhhhhhccceecc-c-cccccccccceecccccceeecccccceehhccccchhhhcccccCCCccchhh--hcCCCC
Confidence 56889999999986 7 599999999999999999999999999999999999999999999999999999 777663
No 23
>PF12692 Methyltransf_17: S-adenosyl-L-methionine methyltransferase; PDB: 3IHT_B.
Probab=85.39 E-value=0.84 Score=31.09 Aligned_cols=56 Identities=21% Similarity=0.314 Sum_probs=29.5
Q ss_pred HHHHHHhhcCCCCCceEEeccCCcCccccce-------eecCCCcCChHHHHHHHHHHHhccccCCccccccCCCC
Q psy11534 7 TFEGLLKRSNYQQRPFILTRSGFAGSQRFGA-------IWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFVVGIG 75 (81)
Q Consensus 7 ~~e~l~~~~~~~~R~filsRs~~aGsqry~~-------~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~ 75 (81)
||+-|+|..|+ +|.++|-|. -|.++. .-.|| .+...+. +..-=.+..+.+.|+|++
T Consensus 42 TydHLRe~~p~-R~I~vfDR~----l~~hp~~~P~~~~~ilGd-------i~~tl~~-~~~~g~~a~laHaD~G~g 104 (160)
T PF12692_consen 42 TYDHLREIFPD-RRIYVFDRA----LACHPSSTPPEEDLILGD-------IRETLPA-LARFGAGAALAHADIGTG 104 (160)
T ss_dssp HHHHHHHH--S-S-EEEEESS------S-GGG---GGGEEES--------HHHHHHH-HHHH-S-EEEEEE----S
T ss_pred cHHHHHHhCCC-CeEEEEeee----cccCCCCCCchHheeecc-------HHHHhHH-HHhcCCceEEEEeecCCC
Confidence 89999999998 499999998 444442 34454 4445555 444445667889999743
No 24
>PHA02516 W baseplate wedge subunit; Provisional
Probab=50.45 E-value=17 Score=22.37 Aligned_cols=23 Identities=22% Similarity=0.293 Sum_probs=20.7
Q ss_pred ecCCCcCChHHHHHHHHHHHhcc
Q psy11534 39 WTGDNMAEWSHLKISLPMCLSLA 61 (81)
Q Consensus 39 W~GD~~s~W~~L~~~i~~~l~~~ 61 (81)
.+|...+.|++++++|..+|+--
T Consensus 4 ~tG~~~~~~~~I~qsI~~iL~T~ 26 (103)
T PHA02516 4 ETGRALSDLEHIRQSIGDILLTP 26 (103)
T ss_pred CCCCCccCHHHHHHHHHHHHcCC
Confidence 68999999999999999999863
No 25
>PF11288 DUF3089: Protein of unknown function (DUF3089); InterPro: IPR021440 This family of proteins has no known function.
Probab=48.81 E-value=22 Score=24.97 Aligned_cols=21 Identities=48% Similarity=0.662 Sum_probs=15.5
Q ss_pred hHHHHHHHHhhcCCCCCceEEe
Q psy11534 4 VMATFEGLLKRSNYQQRPFILT 25 (81)
Q Consensus 4 ~ka~~e~l~~~~~~~~R~fils 25 (81)
+++.+|...+..++ .|||||.
T Consensus 80 V~~AF~~yL~~~n~-GRPfILa 100 (207)
T PF11288_consen 80 VRAAFDYYLANYNN-GRPFILA 100 (207)
T ss_pred HHHHHHHHHHhcCC-CCCEEEE
Confidence 45677777777644 6999985
No 26
>PF07421 Pro-NT_NN: Neurotensin/neuromedin N precursor; InterPro: IPR008055 Neurotensin is a 13-residue peptide transmitter, sharing significant similarity in its 6 C-terminal amino acid residues with several other neuropeptides, including neuromedin N (which is derived from the same precursor). This C-terminal region is responsible for the full biological activity, the N-terminal portion having a modulatory role. Neurotensin is distributed throughout the central nervous system, with highest levels in the hypothalamus, amygdala and nucleus accumbens. It induces a variety of effects, including: analgesia, hypothermia and increased locomotor activity. It is also involved in regulation of dopamine pathways. In the periphery, neurotensin is found in endocrine cells of the small intestine, where it leads to secretion and smooth muscle contraction []. The neurotensin/neuromedin N precursor can also be processed to produce large 125-138 amino acid peptides with the neurotensin or neuromedin N sequence at their C terminus. These large peptides appear to be less potent than their smaller counterparts, but are also less sensitive to degradation and may represent endogenous, long-lasting activators in a number of pathophysiological situations.; GO: 0005184 neuropeptide hormone activity, 0005576 extracellular region; PDB: 3F6K_N.
Probab=37.67 E-value=13 Score=25.36 Aligned_cols=11 Identities=55% Similarity=1.192 Sum_probs=1.8
Q ss_pred CCceEEeccCC
Q psy11534 19 QRPFILTRSGF 29 (81)
Q Consensus 19 ~R~filsRs~~ 29 (81)
+||+|+-||.|
T Consensus 157 RRPYILKRssy 167 (169)
T PF07421_consen 157 RRPYILKRSSY 167 (169)
T ss_dssp ---EE------
T ss_pred CCCeeeccccc
Confidence 58999999876
No 27
>COG0262 FolA Dihydrofolate reductase [Coenzyme metabolism]
Probab=36.24 E-value=17 Score=24.21 Aligned_cols=46 Identities=15% Similarity=0.085 Sum_probs=29.0
Q ss_pred hHhHHHHHHHHh---hcCCCCCceEEeccCCcCccccc-eeecCCCcCChHHH
Q psy11534 2 LQVMATFEGLLK---RSNYQQRPFILTRSGFAGSQRFG-AIWTGDNMAEWSHL 50 (81)
Q Consensus 2 ~~~ka~~e~l~~---~~~~~~R~filsRs~~aGsqry~-~~W~GD~~s~W~~L 50 (81)
+|-+-|||.+.. -.|+ ++.+++||. +-.+.-. ..+.+|.......+
T Consensus 41 iMGRkT~esl~~~~~pl~~-r~~iV~sr~--~~~~~e~~~~v~~s~~~al~~~ 90 (167)
T COG0262 41 IMGRKTYESLPGEWRPLPG-RKNIVLSRN--PDLKTEGGVEVVDSIEEALLLL 90 (167)
T ss_pred EEccchhhhccccCCCCCC-CeEEEEecC--cccccCCCEEEeCCHHHHHHHH
Confidence 567889999994 4455 477889998 4444444 45555554444333
No 28
>KOG0155|consensus
Probab=31.90 E-value=20 Score=28.97 Aligned_cols=23 Identities=35% Similarity=0.833 Sum_probs=19.3
Q ss_pred CCceEEeccCCcCccccceeecCCCc
Q psy11534 19 QRPFILTRSGFAGSQRFGAIWTGDNM 44 (81)
Q Consensus 19 ~R~filsRs~~aGsqry~~~W~GD~~ 44 (81)
.||...++ .+|+- ++.+|+||+.
T Consensus 103 ~rPi~~~~--ipgtd-WcVVwTgD~R 125 (617)
T KOG0155|consen 103 DRPIFKKP--IPGTD-WCVVWTGDNR 125 (617)
T ss_pred CCCCCCCC--CCCCC-eEEEEeCCCc
Confidence 47877777 89988 9999999983
No 29
>PRK10769 folA dihydrofolate reductase; Provisional
Probab=30.18 E-value=33 Score=22.83 Aligned_cols=26 Identities=8% Similarity=0.123 Sum_probs=19.7
Q ss_pred hHhHHHHHHHHhhcCCCCCceEEeccC
Q psy11534 2 LQVMATFEGLLKRSNYQQRPFILTRSG 28 (81)
Q Consensus 2 ~~~ka~~e~l~~~~~~~~R~filsRs~ 28 (81)
+|-+-|||.+.+..|+ +..+|+||..
T Consensus 41 IMGRkTyes~~~plp~-r~~iVlSr~~ 66 (159)
T PRK10769 41 IMGRHTWESIGRPLPG-RKNIVISSQP 66 (159)
T ss_pred EEeHHHHHhhhhhcCC-CcEEEECCCC
Confidence 5678899999765676 4779999963
No 30
>PF06309 Torsin: Torsin; InterPro: IPR010448 This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterised by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of AAA ATPase, central region containing proteins () proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia [].; GO: 0005524 ATP binding, 0051085 chaperone mediated protein folding requiring cofactor
Probab=29.76 E-value=75 Score=20.74 Aligned_cols=16 Identities=25% Similarity=0.825 Sum_probs=14.7
Q ss_pred CCceEEeccCCcCccc
Q psy11534 19 QRPFILTRSGFAGSQR 34 (81)
Q Consensus 19 ~R~filsRs~~aGsqr 34 (81)
++|++++=-+++|++|
T Consensus 51 ~KpLVlSfHG~tGtGK 66 (127)
T PF06309_consen 51 RKPLVLSFHGWTGTGK 66 (127)
T ss_pred CCCEEEEeecCCCCcH
Confidence 5999999999999987
No 31
>PRK00311 panB 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed
Probab=27.75 E-value=76 Score=23.02 Aligned_cols=59 Identities=7% Similarity=0.021 Sum_probs=40.7
Q ss_pred CCceEEeccCCcCccc------cceeecCCCc------------CChHHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 19 QRPFILTRSGFAGSQR------FGAIWTGDNM------------AEWSHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 19 ~R~filsRs~~aGsqr------y~~~W~GD~~------------s~W~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
.+|+++.-..=+.+-| +-.+-+||.. .+.+.+-..+..+... ...|++-.|++|+++..
T Consensus 14 g~~i~~~tayD~~sArl~e~aG~d~i~vGds~~~~~lG~~Dt~~vtl~em~~h~~~V~r~--~~~p~vvaD~pfg~y~~ 90 (264)
T PRK00311 14 GEKIVMLTAYDYPFAKLFDEAGVDVILVGDSLGMVVLGYDSTLPVTLDDMIYHTKAVARG--APRALVVADMPFGSYQA 90 (264)
T ss_pred CCCEEEEeCCCHHHHHHHHHcCCCEEEECHHHHHHHcCCCCCCCcCHHHHHHHHHHHHhc--CCCCcEEEeCCCCCccC
Confidence 4788887777666666 3345567664 3556777777776654 45788889999999853
No 32
>PLN03231 putative alpha-galactosidase; Provisional
Probab=27.41 E-value=2.8e+02 Score=21.12 Aligned_cols=47 Identities=21% Similarity=0.400 Sum_probs=30.1
Q ss_pred HHHHHHhhcCCCCCceEEeccCCcC--------ccccceee--cCCCcCChHHHHHHHHHH
Q psy11534 7 TFEGLLKRSNYQQRPFILTRSGFAG--------SQRFGAIW--TGDNMAEWSHLKISLPMC 57 (81)
Q Consensus 7 ~~e~l~~~~~~~~R~filsRs~~aG--------sqry~~~W--~GD~~s~W~~L~~~i~~~ 57 (81)
..++|++. .||++++=+--.. .++++..| ++|...+|+.+...+...
T Consensus 199 m~~AL~~t----GRpIv~Slc~g~~~~~~~~~~i~~~an~WR~s~DI~d~W~~v~~~~~~~ 255 (357)
T PLN03231 199 VSKAIRNS----GRPMIYSLSPGDGATPGLAARVAQLVNMYRVTGDDWDDWKYLVKHFDVA 255 (357)
T ss_pred HHHHHHHh----CCCeEEEecCCCCCCchhhhhhhhhcCcccccCCcccchhhHHHHHHHH
Confidence 34455442 4999988774111 12334455 699999999988877655
No 33
>PLN02899 alpha-galactosidase
Probab=26.87 E-value=2.1e+02 Score=23.69 Aligned_cols=46 Identities=22% Similarity=0.426 Sum_probs=30.6
Q ss_pred HHHHHHHhhcCCCCCceEEeccCCcCcc----------ccceee--cCCCcCChHHHHHHHHHH
Q psy11534 6 ATFEGLLKRSNYQQRPFILTRSGFAGSQ----------RFGAIW--TGDNMAEWSHLKISLPMC 57 (81)
Q Consensus 6 a~~e~l~~~~~~~~R~filsRs~~aGsq----------ry~~~W--~GD~~s~W~~L~~~i~~~ 57 (81)
+..++|++. .||++++=+- |.+ .++..| +||...+|+.+...+...
T Consensus 229 ~ms~AL~aT----GRPIvySLsp--G~~~~p~wa~~v~~~aNmWRitgDI~D~W~sV~~~~d~~ 286 (633)
T PLN02899 229 YVSEVLKEL----DRPIVYSLSP--GTSATPTMAKEVSGLVNMYRITGDDWDTWGDVAAHFDVS 286 (633)
T ss_pred HHHHHHHHh----CCCeEEEecC--CcccchhhhhhhhccCccceecCCcccchHHHHHHHHHH
Confidence 445555552 4999998873 333 334444 599999999998876544
No 34
>PF03545 YopE: Yersinia virulence determinant (YopE); InterPro: IPR014773 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens, such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia []. The type III secretion system is of great interest, as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis. However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself [], type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. Exotoxins secreted by the type III system do not possess a secretion signal, and are considered unique for this reason []. Yersinia secrete a Rho GTPase-activating protein, YopE [, ], that disrupts the host cell actin cytoskeleton. YopE is regulated by another bacterial gene, SycE [], that enables the exotoxin to remain soluble in the bacterial cytoplasm. A similar protein, exoenzyme S from Pseudomonas aeruginosa, has both ADP-ribosylation and GTPase activity [, ].; PDB: 1HY5_B 1G4W_R 1G4U_S 1R4T_A 1HE1_A 1HE9_A.
Probab=25.54 E-value=64 Score=19.12 Aligned_cols=23 Identities=13% Similarity=0.002 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHhccccCCcc--ccc
Q psy11534 48 SHLKISLPMCLSLAVSEKKS--ANF 70 (81)
Q Consensus 48 ~~L~~~i~~~l~~~lsG~~~--~gs 70 (81)
+.++.....+|+..++|||| ||+
T Consensus 34 ~~~~~~A~~ll~~~VggipfsQwGT 58 (70)
T PF03545_consen 34 EQLQDQASTLLNIQVGGIPFSQWGT 58 (70)
T ss_dssp HHHHHHHHHHHTSEETTEEGGGCTS
T ss_pred HHHHHHHHHHhhCcccceeHHHhhc
Confidence 57888899999999999998 454
No 35
>PRK10125 putative glycosyl transferase; Provisional
Probab=24.82 E-value=30 Score=25.92 Aligned_cols=27 Identities=7% Similarity=-0.122 Sum_probs=21.4
Q ss_pred cCChHHHHHHHHHHHhccccCCccccccCC
Q psy11534 44 MAEWSHLKISLPMCLSLAVSEKKSANFVVG 73 (81)
Q Consensus 44 ~s~W~~L~~~i~~~l~~~lsG~~~~gsDiG 73 (81)
.|.||.+...+...+ .||.|.+++|+|
T Consensus 313 pS~~Egfp~vilEAm---A~G~PVVat~~g 339 (405)
T PRK10125 313 SSRVDNYPLILCEAL---SIGVPVIATHSD 339 (405)
T ss_pred CCccccCcCHHHHHH---HcCCCEEEeCCC
Confidence 466777777777666 799999999994
No 36
>PF01630 Glyco_hydro_56: Hyaluronidase; InterPro: IPR018155 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 56 GH56 from CAZY comprises enzymes with only one known activity; hyaluronidase 3.2.1.35 from EC. The venom of Apis mellifera (Honeybee) contains several biologically-active peptides and two enzymes, one of which is a hyaluronidase []. The amino acid sequence of bee venom hyaluronidase contains 349 amino acids, and includes four cysteines and a number of potential glycosylation sites []. The sequence shows a high degree of similarity to PH-20, a membrane protein of mammalian sperm involved in sperm-egg adhesion, supporting the view that hyaluronidases play a role in fertilisation []. PH-20 is required for sperm adhesion to the egg zona pellucida; it is located on both the sperm plasma membrane and acrosomal membrane []. The amino acid sequence of the mature protein contains 468 amino acids, and includes six potential N-linked glycosylation sites and twelve cysteines, eight of which are tightly clustered near the C terminus [].; GO: 0004415 hyalurononglucosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 1FCQ_A 1FCV_A 1FCU_A 2J88_A 2PE4_A 2ATM_A.
Probab=24.38 E-value=51 Score=24.94 Aligned_cols=42 Identities=14% Similarity=0.077 Sum_probs=28.0
Q ss_pred ceEEeccCC-cCccccceeecCCCcCChHHHHHHHHHHHhccccCCcccccc
Q psy11534 21 PFILTRSGF-AGSQRFGAIWTGDNMAEWSHLKISLPMCLSLAVSEKKSANFV 71 (81)
Q Consensus 21 ~filsRs~~-aGsqry~~~W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gsD 71 (81)
.|..+|--| .-+..| =+=++|...|.....++++|+-+||+.
T Consensus 263 V~~Y~r~~~~~~~~~f---------Ls~~DL~~TigesaalGa~GvViWG~s 305 (337)
T PF01630_consen 263 VFPYTRYVYYDSTDEF---------LSQEDLVNTIGESAALGAAGVVIWGSS 305 (337)
T ss_dssp EEEEEESEETTEEEEE-----------HHHHHHHHHHHHHTT-SEEEEE--G
T ss_pred eeeeeecccccCcccc---------chhhHHHHHHHHHHHcCCCeEEEeecc
Confidence 366788444 233333 234899999999999999999999974
No 37
>PLN02229 alpha-galactosidase
Probab=23.72 E-value=1.4e+02 Score=23.33 Aligned_cols=62 Identities=13% Similarity=0.212 Sum_probs=35.7
Q ss_pred HHHHHHhhcCCCCCceEEeccCCcCc------cccceee--cCCCcCChHHHHHHHHHHHhc-cccCCccc
Q psy11534 7 TFEGLLKRSNYQQRPFILTRSGFAGS------QRFGAIW--TGDNMAEWSHLKISLPMCLSL-AVSEKKSA 68 (81)
Q Consensus 7 ~~e~l~~~~~~~~R~filsRs~~aGs------qry~~~W--~GD~~s~W~~L~~~i~~~l~~-~lsG~~~~ 68 (81)
.|..+.+.-....||++++=+.+.-. ++++..| ++|...+|+.+...+...-.. ...|-..|
T Consensus 195 ~y~~m~~AL~~tGRpI~~SlC~WG~~~p~~w~~~~~n~WR~s~DI~d~W~sv~~i~~~~~~~~~~agPG~w 265 (427)
T PLN02229 195 RYPPMRDALNATGRSIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGW 265 (427)
T ss_pred HHHHHHHHHHhhCCCcEEEecCCCCCCHHHHHHhhcCeeeccCCcccccccHHHHHHHHHHHHhhcCCCCC
Confidence 34444443322259998886553221 3445555 599999999988877644333 33444444
No 38
>cd00209 DHFR Dihydrofolate reductase (DHFR). Reduces 7,8-dihydrofolate to 5,6,7,8-tetrahydrofolate with NADPH as a cofactor. This is an essential step in the biosynthesis of deoxythymidine phosphate since 5,6,7,8-tetrahydrofolate is required to regenerate 5,10-methylenetetrahydrofolate which is then utilized by thymidylate synthase. Inhibition of DHFR interrupts thymidilate synthesis and DNA replication, inhibitors of DHFR (such as Methotrexate) are used in cancer chemotherapy. 5,6,7,8-tetrahydrofolate also is involved in glycine, serine, and threonine metabolism and aminoacyl-tRNA biosynthesis.
Probab=22.78 E-value=36 Score=22.17 Aligned_cols=27 Identities=19% Similarity=0.140 Sum_probs=19.5
Q ss_pred hHhHHHHHHHHh-hcCCCCCceEEeccCC
Q psy11534 2 LQVMATFEGLLK-RSNYQQRPFILTRSGF 29 (81)
Q Consensus 2 ~~~ka~~e~l~~-~~~~~~R~filsRs~~ 29 (81)
++-+.|||.+.. ..|+ +..+++||+--
T Consensus 40 vmGR~Tye~~~~~~~~~-~~~~V~S~~~~ 67 (158)
T cd00209 40 IMGRKTFESIPRRPLPG-RTNIVLSRQLD 67 (158)
T ss_pred EEChhhHhhCCCcCCCC-CCEEEEccCCC
Confidence 567889999876 4565 36788998743
No 39
>cd00952 CHBPH_aldolase Trans-o-hydroxybenzylidenepyruvate hydratase-aldolase (HBPHA) and trans-2'-carboxybenzalpyruvate hydratase-aldolase (CBPHA). HBPHA catalyzes HBP to salicyaldehyde and pyruvate. This reaction is part of the degradative pathways for naphthalene and naphthalenesulfonates by bacteria. CBPHA is homologous to HBPHA and catalyzes the cleavage of CBP to 2-carboxylbenzaldehyde and pyruvate during the degradation of phenanthrene. They are member of the DHDPS family of Schiff-base-dependent class I aldolases.
Probab=22.62 E-value=87 Score=22.78 Aligned_cols=32 Identities=6% Similarity=0.128 Sum_probs=27.8
Q ss_pred ecCCCcCChHHHHHHHHHHHhccccCCccccc
Q psy11534 39 WTGDNMAEWSHLKISLPMCLSLAVSEKKSANF 70 (81)
Q Consensus 39 W~GD~~s~W~~L~~~i~~~l~~~lsG~~~~gs 70 (81)
|..|..-+++.|+..+..++..++.|+-..|+
T Consensus 20 ~~~~g~iD~~~l~~lv~~li~~Gv~Gi~v~Gs 51 (309)
T cd00952 20 WRATDTVDLDETARLVERLIAAGVDGILTMGT 51 (309)
T ss_pred cccCCCcCHHHHHHHHHHHHHcCCCEEEECcc
Confidence 55677899999999999999999999877665
No 40
>PF07655 Secretin_N_2: Secretin N-terminal domain; InterPro: IPR011514 This is a short domain found in bacterial type II/III secretory system proteins. The architecture of these proteins suggests that this family may be functionally analogous to IPR005644 from INTERPRO.; GO: 0009297 pilus assembly, 0019867 outer membrane
Probab=21.29 E-value=79 Score=19.22 Aligned_cols=19 Identities=21% Similarity=0.370 Sum_probs=15.5
Q ss_pred CCCcCChHHHHHHHHHHHh
Q psy11534 41 GDNMAEWSHLKISLPMCLS 59 (81)
Q Consensus 41 GD~~s~W~~L~~~i~~~l~ 59 (81)
=+...=|+.|+..|..+|.
T Consensus 66 ~s~~dfW~~L~~~l~~ilg 84 (98)
T PF07655_consen 66 ESKSDFWEDLQKTLQAILG 84 (98)
T ss_pred EECCchHHHHHHHHHHHhC
Confidence 3446679999999999995
No 41
>TIGR02965 xanthine_xdhB xanthine dehydrogenase, molybdopterin binding subunit. Members of the protein family are the molybdopterin-containing large subunit (or, in, eukaryotes, the molybdopterin-binding domain) of xanthine dehydrogenase, and enzyme that reduces the purine pool by catabolizing xanthine to urate. This model is based primarily on bacterial sequences; it does not manage to include all eukaryotic xanthine dehydrogenases and thereby discriminate them from the closely related enzyme aldehyde dehydrogenase.
Probab=21.26 E-value=83 Score=25.96 Aligned_cols=42 Identities=19% Similarity=0.261 Sum_probs=32.0
Q ss_pred eeecC-CCcCCh------HHHHHHHHHHHhccccCCccccccCCCCcccC
Q psy11534 37 AIWTG-DNMAEW------SHLKISLPMCLSLAVSEKKSANFVVGIGAAAG 79 (81)
Q Consensus 37 ~~W~G-D~~s~W------~~L~~~i~~~l~~~lsG~~~~gsDiG~~gf~g 79 (81)
+.|.. |...-| ..+|..|+..|.+-..-+-....++| |||.+
T Consensus 181 A~~~~~g~l~v~~stQ~p~~~r~~lA~~Lglp~~~VrV~~~~vG-GgFG~ 229 (758)
T TIGR02965 181 AVPGEDGGMHVWSSTQHPSEVQHLVAHVLGVPSHAVTVEVRRMG-GGFGG 229 (758)
T ss_pred EEEcCCCCEEEEECCCChHHHHHHHHHHhCCChHHEEEEeCCCc-cCccc
Confidence 35653 344444 46899999999999999999999997 78865
No 42
>PRK00924 5-keto-4-deoxyuronate isomerase; Provisional
Probab=20.50 E-value=57 Score=24.03 Aligned_cols=17 Identities=29% Similarity=0.683 Sum_probs=13.9
Q ss_pred CCcCccccceeec--CCCc
Q psy11534 28 GFAGSQRFGAIWT--GDNM 44 (81)
Q Consensus 28 ~~aGsqry~~~W~--GD~~ 44 (81)
.-+|++.|..+|. |+|.
T Consensus 244 ~g~gt~~y~fiw~m~gen~ 262 (276)
T PRK00924 244 SGVGTSNYTFIWGMAGENQ 262 (276)
T ss_pred cCcCccccEEEEEecccCc
Confidence 3579999999997 8874
No 43
>PHA03371 circ protein; Provisional
Probab=20.27 E-value=1.1e+02 Score=22.20 Aligned_cols=36 Identities=17% Similarity=0.105 Sum_probs=30.9
Q ss_pred ChHhHHHHHHHHhhcCCCCCceEEeccCCcCccccce
Q psy11534 1 MLQVMATFEGLLKRSNYQQRPFILTRSGFAGSQRFGA 37 (81)
Q Consensus 1 ~~~~ka~~e~l~~~~~~~~R~filsRs~~aGsqry~~ 37 (81)
|+.+|..+|-|....-. +|-|.+.+..+.++|--+-
T Consensus 104 H~LA~sLhdFL~~~GLS-qRDl~v~d~~~~~sqm~pl 139 (240)
T PHA03371 104 HFLAKSLHDFLTTKGLS-QRDLMVVDGTFLDSQMRPL 139 (240)
T ss_pred hhhHHHHHHHHHhcCcc-ccceEEEeeeeecccccce
Confidence 67899999999987554 6999999999999998774
Done!