Query psy438
Match_columns 669
No_of_seqs 383 out of 3290
Neff 5.7
Searched_HMMs 46136
Date Sat Aug 17 00:02:38 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy438.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/438hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2230|consensus 100.0 2.6E-73 5.7E-78 613.3 37.6 398 64-560 258-663 (867)
2 COG3250 LacZ Beta-galactosidas 100.0 8.3E-49 1.8E-53 455.3 28.6 402 10-554 171-615 (808)
3 PRK10150 beta-D-glucuronidase; 100.0 1.8E-43 4E-48 405.3 32.2 313 10-451 171-521 (604)
4 PRK10340 ebgA cryptic beta-D-g 100.0 6.2E-42 1.4E-46 410.8 33.0 250 10-389 198-476 (1021)
5 PRK09525 lacZ beta-D-galactosi 100.0 1.1E-40 2.4E-45 399.7 32.8 192 75-390 290-490 (1027)
6 PF02836 Glyco_hydro_2_C: Glyc 100.0 2.1E-33 4.6E-38 295.6 13.9 169 255-451 36-218 (298)
7 KOG2230|consensus 99.9 3.9E-23 8.4E-28 225.2 11.4 147 519-669 369-517 (867)
8 PF02836 Glyco_hydro_2_C: Glyc 99.8 2.4E-19 5.1E-24 189.1 4.4 139 474-645 10-159 (298)
9 PRK10150 beta-D-glucuronidase; 99.7 1E-18 2.2E-23 201.2 7.2 166 474-668 287-477 (604)
10 PRK10340 ebgA cryptic beta-D-g 99.7 1.6E-17 3.4E-22 200.8 6.5 139 474-645 329-477 (1021)
11 PRK09525 lacZ beta-D-galactosi 99.6 9.3E-17 2E-21 194.0 6.0 139 474-645 345-490 (1027)
12 COG3250 LacZ Beta-galactosidas 99.6 1.5E-16 3.2E-21 186.6 4.7 113 474-599 295-408 (808)
13 PF03198 Glyco_hydro_72: Gluca 99.1 8E-10 1.7E-14 116.8 11.4 175 255-442 53-249 (314)
14 PF00703 Glyco_hydro_2: Glycos 99.0 5.9E-09 1.3E-13 92.3 13.6 106 8-123 1-110 (110)
15 PF00150 Cellulase: Cellulase 98.6 6.5E-07 1.4E-11 92.2 12.9 127 257-389 23-172 (281)
16 PF02449 Glyco_hydro_42: Beta- 98.2 3.3E-06 7.3E-11 92.4 9.4 134 255-388 10-234 (374)
17 PF14488 DUF4434: Domain of un 97.8 0.0004 8.7E-09 68.2 13.1 123 254-391 19-163 (166)
18 PLN03059 beta-galactosidase; P 97.8 0.00063 1.4E-08 80.9 17.0 123 255-389 59-222 (840)
19 KOG2024|consensus 97.5 8.6E-05 1.9E-09 76.8 4.7 55 84-151 242-297 (297)
20 PF01301 Glyco_hydro_35: Glyco 97.5 0.0009 1.9E-08 72.3 12.2 89 256-344 25-151 (319)
21 COG1874 LacA Beta-galactosidas 97.5 0.00071 1.5E-08 79.1 11.8 89 256-345 31-163 (673)
22 COG3934 Endo-beta-mannanase [C 97.2 0.00078 1.7E-08 74.9 6.9 180 254-452 25-258 (587)
23 PF12876 Cellulase-like: Sugar 96.8 0.0024 5.2E-08 56.0 5.3 64 324-389 1-64 (88)
24 PF12876 Cellulase-like: Sugar 96.4 0.0055 1.2E-07 53.7 5.4 64 579-644 1-64 (88)
25 KOG0496|consensus 93.8 1.4 3E-05 51.6 15.2 88 256-344 50-176 (649)
26 PF07745 Glyco_hydro_53: Glyco 93.1 0.57 1.2E-05 51.2 10.2 125 258-387 27-174 (332)
27 PF13204 DUF4038: Protein of u 92.9 0.57 1.2E-05 50.0 9.8 115 255-387 30-184 (289)
28 PF00150 Cellulase: Cellulase 91.3 0.3 6.6E-06 50.2 5.2 73 566-644 99-172 (281)
29 COG3534 AbfA Alpha-L-arabinofu 91.1 2.2 4.8E-05 48.1 11.8 108 257-384 51-207 (501)
30 PF02449 Glyco_hydro_42: Beta- 90.4 0.89 1.9E-05 50.0 8.2 81 563-643 103-234 (374)
31 COG5309 Exo-beta-1,3-glucanase 90.4 1.7 3.7E-05 46.0 9.6 118 255-392 63-181 (305)
32 TIGR03356 BGL beta-galactosida 89.4 1.5 3.2E-05 49.5 9.0 88 256-345 55-164 (427)
33 COG3867 Arabinogalactan endo-1 83.3 11 0.00024 40.7 10.9 125 257-386 65-219 (403)
34 PRK13511 6-phospho-beta-galact 83.0 4.6 9.9E-05 46.2 8.7 88 256-346 55-164 (469)
35 PRK15014 6-phospho-beta-glucos 79.8 6.3 0.00014 45.2 8.3 87 256-344 70-180 (477)
36 PF14488 DUF4434: Domain of un 77.4 4.8 0.0001 39.7 5.7 94 536-644 66-161 (166)
37 COG2730 BglC Endoglucanase [Ca 76.6 12 0.00026 41.9 9.3 89 257-345 75-193 (407)
38 TIGR01233 lacG 6-phospho-beta- 75.0 11 0.00024 43.1 8.6 88 256-346 54-163 (467)
39 smart00633 Glyco_10 Glycosyl h 72.4 20 0.00042 37.4 9.1 69 312-386 54-122 (254)
40 COG3934 Endo-beta-mannanase [C 70.8 4 8.6E-05 46.5 3.6 97 540-648 87-190 (587)
41 PF02638 DUF187: Glycosyl hydr 68.1 85 0.0018 34.0 13.0 132 254-387 18-227 (311)
42 PLN02849 beta-glucosidase 65.8 21 0.00046 41.3 8.3 88 256-345 80-190 (503)
43 PRK09593 arb 6-phospho-beta-gl 65.3 26 0.00056 40.3 8.8 88 256-345 74-185 (478)
44 PRK09852 cryptic 6-phospho-bet 65.2 24 0.00052 40.6 8.5 88 256-345 72-183 (474)
45 PLN02814 beta-glucosidase 65.0 27 0.00058 40.5 8.9 89 256-346 78-189 (504)
46 PRK09589 celA 6-phospho-beta-g 64.0 27 0.00058 40.2 8.6 88 256-345 68-179 (476)
47 PLN02998 beta-glucosidase 62.5 28 0.0006 40.3 8.4 88 256-345 83-193 (497)
48 PRK09936 hypothetical protein; 61.9 64 0.0014 34.9 10.3 130 254-394 37-185 (296)
49 PF00331 Glyco_hydro_10: Glyco 57.7 30 0.00065 37.5 7.3 98 283-382 62-171 (320)
50 COG2723 BglB Beta-glucosidase/ 57.1 56 0.0012 37.5 9.4 90 256-347 60-173 (460)
51 PF03659 Glyco_hydro_71: Glyco 56.8 52 0.0011 36.9 9.1 75 255-337 17-96 (386)
52 PF00232 Glyco_hydro_1: Glycos 56.8 28 0.0006 39.6 7.1 91 255-347 58-171 (455)
53 PF14871 GHL6: Hypothetical gl 55.0 42 0.0009 31.9 6.9 44 259-302 4-66 (132)
54 TIGR00542 hxl6Piso_put hexulos 54.3 55 0.0012 34.2 8.4 77 255-341 94-185 (279)
55 PF01229 Glyco_hydro_39: Glyco 52.9 63 0.0014 37.1 9.2 123 256-388 40-202 (486)
56 cd00019 AP2Ec AP endonuclease 49.9 39 0.00085 35.2 6.5 80 255-342 85-180 (279)
57 cd06598 GH31_transferase_CtsZ 48.5 3.8E+02 0.0083 28.9 14.0 134 257-394 72-218 (317)
58 cd00019 AP2Ec AP endonuclease 46.8 3.1E+02 0.0068 28.5 12.6 85 257-342 12-110 (279)
59 TIGR03234 OH-pyruv-isom hydrox 46.6 84 0.0018 32.2 8.2 84 255-343 84-182 (254)
60 cd02742 GH20_hexosaminidase Be 45.7 1.4E+02 0.003 32.1 9.9 112 255-385 16-188 (303)
61 COG0296 GlgB 1,4-alpha-glucan 44.8 36 0.00078 40.5 5.6 47 255-301 165-235 (628)
62 cd06565 GH20_GcnA-like Glycosy 44.4 2.6E+02 0.0057 30.0 11.8 125 255-387 17-184 (301)
63 COG1501 Alpha-glucosidases, fa 42.0 3E+02 0.0065 33.8 12.9 185 257-454 323-527 (772)
64 PF01261 AP_endonuc_2: Xylose 40.1 1.2E+02 0.0027 29.2 7.9 82 255-343 71-169 (213)
65 COG3693 XynA Beta-1,4-xylanase 37.2 1.8E+02 0.0039 32.1 8.9 92 282-384 84-188 (345)
66 PRK09856 fructoselysine 3-epim 37.0 1.5E+02 0.0033 30.6 8.4 76 255-337 90-179 (275)
67 PRK13209 L-xylulose 5-phosphat 36.9 2.1E+02 0.0046 29.7 9.5 78 255-342 99-191 (283)
68 PRK01060 endonuclease IV; Prov 36.0 2.2E+02 0.0047 29.6 9.4 84 257-340 14-112 (281)
69 PF13204 DUF4038: Protein of u 35.9 78 0.0017 33.9 6.1 55 569-641 128-183 (289)
70 COG5014 Predicted Fe-S oxidore 35.1 1.2E+02 0.0027 30.6 6.7 71 252-337 75-149 (228)
71 PF07905 PucR: Purine cataboli 32.4 1.1E+02 0.0025 28.2 5.9 44 255-298 59-104 (123)
72 COG2876 AroA 3-deoxy-D-arabino 30.2 1.8E+02 0.0038 31.3 7.4 46 254-302 58-118 (286)
73 PF05089 NAGLU: Alpha-N-acetyl 30.2 1.2E+02 0.0026 33.5 6.4 117 254-386 94-215 (333)
74 PF08220 HTH_DeoR: DeoR-like h 28.4 63 0.0014 26.0 3.0 26 254-280 28-53 (57)
75 TIGR02631 xylA_Arthro xylose i 28.3 8.1E+02 0.018 27.4 12.7 83 257-340 34-138 (382)
76 PF04339 DUF482: Protein of un 28.2 2E+02 0.0043 32.2 7.8 87 255-345 120-212 (370)
77 KOG2233|consensus 27.0 3.8E+02 0.0083 31.3 9.6 141 230-386 132-279 (666)
78 KOG2836|consensus 26.8 1.5E+02 0.0032 28.9 5.5 81 245-340 23-106 (173)
79 PRK10426 alpha-glucosidase; Pr 26.5 1E+03 0.022 28.6 13.8 133 257-394 271-416 (635)
80 cd06564 GH20_DspB_LnbB-like Gl 25.5 8.5E+02 0.019 26.3 12.1 115 255-386 17-198 (326)
81 cd06563 GH20_chitobiase-like T 25.4 7.6E+02 0.017 27.1 11.8 124 255-386 18-223 (357)
82 cd06568 GH20_SpHex_like A subg 25.1 5.3E+02 0.012 28.1 10.4 106 255-380 18-188 (329)
83 smart00642 Aamy Alpha-amylase 23.3 1.4E+02 0.0031 29.2 5.0 48 256-303 20-93 (166)
84 cd06562 GH20_HexA_HexB-like Be 23.3 8.7E+02 0.019 26.6 11.7 117 255-380 18-202 (348)
85 PF14386 DUF4417: Domain of un 23.2 3.3E+02 0.0072 27.7 7.8 76 257-339 99-175 (200)
86 COG5309 Exo-beta-1,3-glucanase 23.0 2E+02 0.0044 31.0 6.3 121 508-649 63-183 (305)
87 PF05706 CDKN3: Cyclin-depende 22.9 3.6E+02 0.0077 27.0 7.6 78 256-342 59-144 (168)
88 smart00518 AP2Ec AP endonuclea 22.2 2.7E+02 0.0059 28.7 7.2 81 250-336 79-170 (273)
89 PF06230 DUF1009: Protein of u 21.9 1.1E+02 0.0024 31.6 4.1 38 260-297 174-213 (214)
90 cd06570 GH20_chitobiase-like_1 21.7 7.7E+02 0.017 26.7 10.7 51 255-305 18-93 (311)
91 smart00481 POLIIIAc DNA polyme 21.4 2.6E+02 0.0057 22.6 5.5 45 257-301 17-62 (67)
92 PRK13397 3-deoxy-7-phosphohept 21.3 2E+02 0.0044 30.4 6.0 45 254-301 28-87 (250)
93 PF03162 Y_phosphatase2: Tyros 21.2 3.7E+02 0.008 26.4 7.5 81 259-342 23-104 (164)
94 TIGR03234 OH-pyruv-isom hydrox 21.1 8.8E+02 0.019 24.6 13.5 42 257-298 16-57 (254)
95 COG4750 LicC CTP:phosphocholin 21.1 1.2E+02 0.0026 31.3 3.9 42 254-297 31-72 (231)
96 PRK09997 hydroxypyruvate isome 20.5 3.6E+02 0.0079 27.7 7.7 42 257-298 17-58 (258)
No 1
>KOG2230|consensus
Probab=100.00 E-value=2.6e-73 Score=613.28 Aligned_cols=398 Identities=44% Similarity=0.763 Sum_probs=356.4
Q ss_pred EEEEecCCcceEEEEEEecCCCCccccCCCCCCCCCcEEEEEEEEeCcEEEEEEEEEEEEEEEEeeecCCCccccCCceE
Q psy438 64 RDLLVSTKEANMNMLRVWGGGSEVELWWPNGYGEQPLYNLQITLASGVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYF 143 (669)
Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~p~lW~p~g~g~P~LY~l~~~l~~g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~ 143 (669)
-.+.+.+++..+++...+ ..+|+.|||+|+|.+.||.+++.. |+... +.++|||++++.+++..+ ++|+.|
T Consensus 258 Y~i~Lq~~k~i~~l~i~~--~~~~e~wwp~g~g~q~~y~~~v~~--gg~~~--ekki~frtvelv~~p~kp---~~g~nf 328 (867)
T KOG2230|consen 258 YRLSLQTSKNIMSLSIPM--EHEPERWWPNGMGEQKLYDVVVSM--GGQVK--EKKIGFKTVELVQDPKKP---EKGRNF 328 (867)
T ss_pred EEEEeccccceeeecccc--cccccccCCCCCCcceeEEEEEec--Cceee--eeeeeeEEEEEeecCCCC---CCCcee
Confidence 445677888777776655 378999999999999999999887 55443 339999999999987665 689999
Q ss_pred EEEEcCEEeEeccccccCCCCCCCCCCcchhhhhhhhhhhhhhccceeecCCccccccccCCCCcCCcccCCCcchhhhh
Q psy438 144 YFEVNKVPIYSKGSNLIPVDILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYEPVDILPERSNNESTIRDL 223 (669)
Q Consensus 144 ~~~lNG~~v~~rG~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (669)
||.+||.|+|+||+||+|.+.|..|.+.
T Consensus 329 yfkin~~pvflkg~nwip~s~f~dr~t~---------------------------------------------------- 356 (867)
T KOG2230|consen 329 YFKINDEPVFLKGTNWIPVSMFRDRENI---------------------------------------------------- 356 (867)
T ss_pred EEEEcCcEEEeecCCccChHHHHhhHHH----------------------------------------------------
Confidence 9999999999999999998887655433
Q ss_pred hhhhhhhhhhhhhhcCCccccccccccccchHHHHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeecccc
Q psy438 224 LVSTKEANMNMLRVWGGGVYMTRYTDMARHESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDMMFA 303 (669)
Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~~~~ 303 (669)
+.++.+|++++++||||||+||||.|++++||++||++|||||||+||+
T Consensus 357 -------------------------------~~~~~LL~Sv~e~~MN~lRVWGGGvYEsd~FY~lad~lGilVWQD~MFA 405 (867)
T KOG2230|consen 357 -------------------------------AKTEFLLDSVAEVGMNMLRVWGGGVYESDYFYQLADSLGILVWQDMMFA 405 (867)
T ss_pred -------------------------------HHHHHHHHHHHHhCcceEEEecCccccchhHHHHhhhccceehhhhHHH
Confidence 3688899999999999999999999999999999999999999999999
Q ss_pred cCCCCCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccc--cChhhHHHHHHHHHHHHHHHHHHhhCC
Q psy438 304 CNNYPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIR--ENPELYYKEYAELYVNTLKPIVLQYDP 381 (669)
Q Consensus 304 ~~~~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~v~~~Dp 381 (669)
|+.||.+++|++++++|++..+.|+++||||++|+++||.+.+..++|+.. +....+.++|..+|++.|.++++.-|+
T Consensus 406 CAlYPt~~eFl~sv~eEV~yn~~Rls~HpSviIfsgNNENEaAl~~nWy~~sf~~~~~~~kdyvlly~~~i~el~l~~~~ 485 (867)
T KOG2230|consen 406 CALYPTNDEFLSSVREEVRYNAMRLSHHPSVIIFSGNNENEAALVQNWYGTSFERDRFESKDYVLLYANVIHELKLVSHS 485 (867)
T ss_pred hhcccCcHHHHHHHHHHHHHHHHhhccCCeEEEEeCCCccHHHHHhhhhcccccccchhhhhhhHHHHHHHHHHHhhcCC
Confidence 999999999999999999999999999999999999999999999999932 344456678889999999999999999
Q ss_pred CCCEEecCCCCCcccccccccccCCCCCCCCCcccccccccccCCCCCCCCCccccccccCCCCCcchhcccCCcccccC
Q psy438 382 TRPYLTSSPTNGIESEKAKYALADNPYSNIYGDTHNYDYYQNLWDPSTAPKSRFCSEFGIQSLPQLSTFQKVATEADLAS 461 (669)
Q Consensus 382 tRp~~~~S~~~~~~~~~~~~~~~~~py~~~~gd~h~y~~~~~~~~~~~~~~~~f~sE~G~~s~p~~~sl~~~~~~~~~~~ 461 (669)
+||++.|||.+|..++...| ++.||++..+||+|+|+|..++|+...||+|||+||||+||+|+.+++++..+++|| .
T Consensus 486 srPfi~SSPsNG~ete~e~~-VS~NP~dn~~GDVHfYdy~~d~W~~~ifp~pRfaSEyG~QS~P~~~t~~~~~~e~Dw-~ 563 (867)
T KOG2230|consen 486 SRPFIVSSPSNGKETEPENY-VSSNPQDNQNGDVHFYDYTKDGWDPGIFPRPRFASEYGFQSFPGAYTWQRSKGEDDW-L 563 (867)
T ss_pred CCCceecCCCCCcccCcccc-ccCCCccccCCceEeeehhhccCCCCcccCchhhhhcCcccCccHHHHHhccCccch-h
Confidence 99999999999999998899 999999999999999999999999999999999999999999999999999999999 8
Q ss_pred CCCcccccccccCCchHHHHHHHhhhcCCCCcc-----HHHHHHHhhHhHHHHHHHHHHHHHhccccc-ccCCCCCCCCC
Q psy438 462 WRTPFFDSRQHLAGGTGILESSVGHQFEIGNLT-----LEYFAYLSQIYQAGAIKTITEQMRRDKGVL-REDGSGHNMGA 535 (669)
Q Consensus 462 ~~~~~~~h~~~~~~G~~~~~~~i~~~yg~P~d~-----ledFvy~SQvmQae~~R~~iE~~Rr~~n~~-R~~g~~~~sG~ 535 (669)
.+|.++.|||||++|+..++.++.+|+++|.+. .+..+|.||++|++|+|...|+|||.++.. |+ .+++||+
T Consensus 564 ~~sk~~~HRQHHpgGn~~~l~~v~~HlplP~s~~~~~~~k~i~YfsQV~Qa~a~KteTe~YRr~R~t~~~~--~g~tMgA 641 (867)
T KOG2230|consen 564 LGSKLITHRQHHPGGNVPVLALVERHLPLPFSENENYATKLIAYFSQVAQAMATKTETELYRRLRDTPHRT--LGNTMGA 641 (867)
T ss_pred hccceeeeeccCCCCcchHHHHHHhcCCCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHhccCchhh--hccchhe
Confidence 999999999999999999999999999999764 666789999999999999999999976654 33 5678999
Q ss_pred eeEeeccccCccceEehhhhhhhhc
Q psy438 536 LYWQLNDTCDELGILIWQDMMFACN 560 (669)
Q Consensus 536 LyWq~nd~~D~~GilvW~E~~~~~~ 560 (669)
+||||||.|-. -.|+.+.+...
T Consensus 642 lYWQLNDvW~A---PsWssidf~gn 663 (867)
T KOG2230|consen 642 LYWQLNDVWVA---PSWSSIDFYGN 663 (867)
T ss_pred eeeeeccEEec---CcceeeeccCc
Confidence 99999999887 45776665443
No 2
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=8.3e-49 Score=455.28 Aligned_cols=402 Identities=24% Similarity=0.323 Sum_probs=298.1
Q ss_pred eEeEEEEeecCCCCceeecceEEEEEeeccccCCccce------------EeEEEecCCCccceeEEEEEecCCcceEEE
Q psy438 10 FRTVELVQDHVDPNHLEKGRYFYFEVNKVPIYSKGSNL------------IPVDVLPERSNNESTIRDLLVSTKEANMNM 77 (669)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (669)
||||.|.- ++..|+.+ +.+.+.+...+.+++.. +.+++.-.+..+..... . +.+++..+..
T Consensus 171 ~RdV~l~i--~p~~~~~d---i~V~t~~~~~~~~~~~~~~~~~~~~~~~~ls~~L~~~~~~v~~~~~-~-~~~~~~~~~~ 243 (808)
T COG3250 171 HRDVMLYI--TPNTHVDD---ITVVTHLAEDCNHASLDVKIQQVVANGLDLSVELRDAEQQVVATGQ-E-RGTGQGAAAG 243 (808)
T ss_pred cceeEEEE--ccceeEee---eEEEEecchhhhhhheeeheeeecccCceEEEEEEcCCCceeeeee-c-cccCCcceeE
Confidence 56888885 44778876 67777776666555443 22333322111111111 1 3455566666
Q ss_pred EEEecCCCCccccCCCCCCCCCcEEEEEEEEe-CcEEEEEEEEEEEEEEEEeeecCCCccccCCceEEEEEcCEEeEecc
Q psy438 78 LRVWGGGSEVELWWPNGYGEQPLYNLQITLAS-GVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYFYFEVNKVPIYSKG 156 (669)
Q Consensus 78 ~~~~~~~~~p~lW~p~g~g~P~LY~l~~~l~~-g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~~~~lNG~~v~~rG 156 (669)
.+.+ ++|+||||+ +|+||+|.++|.+ |.++|+...+||||+|++.. |. |+|||+|||+||
T Consensus 244 ~~~V---~~p~lWsp~---~P~LY~l~~~L~~~~~~~d~~~~~iGfR~iei~~----------~~---~~iNGkpvf~kG 304 (808)
T COG3250 244 ELKV---ENPKLWSPE---DPYLYRLVVTLKDANTLIDAEALRIGFRTVEIKD----------GL---LLINGKPVFIRG 304 (808)
T ss_pred EEec---cCcccCCCC---CCceEEEEEEEEeCCceeeEEEeeeccEEEEEEC----------Ce---EEECCeEEEEee
Confidence 6667 899999999 9999999999999 68899999999999999975 32 999999999999
Q ss_pred ccccCCCCCCCCCCcchhhhhhhhhhhhhhccceeecCCccccccccCCCCcCCcccCCCcchhhhhhhhhhhhhhhhhh
Q psy438 157 SNLIPVDILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYEPVDILPERSNNESTIRDLLVSTKEANMNMLR 236 (669)
Q Consensus 157 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (669)
+|||+.++..++++.+
T Consensus 305 vnrHe~~~~~G~~~~~---------------------------------------------------------------- 320 (808)
T COG3250 305 VNRHEDDPILGRVTDE---------------------------------------------------------------- 320 (808)
T ss_pred eecccCCCccccccCH----------------------------------------------------------------
Confidence 9999999998887653
Q ss_pred hcCCccccccccccccchHHHHHHHHHHHHcCCCEEEccCCCCCC-cHHHHHHHHHcCCEEEeecccccCCCCCChHHHH
Q psy438 237 VWGGGVYMTRYTDMARHESTIRDLLVSTKEANMNMLRVWGGGVYM-SDYFYETCDELGILIWQDMMFACNNYPATPTFLQ 315 (669)
Q Consensus 237 ~~~~~~~~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~ 315 (669)
+.+.++|++||++|+||||+| ||| ++.||+||||+|||||+|+|++|+.+|.++.|.+
T Consensus 321 ------------------~~~~~dl~lmk~~n~N~vRts---HyP~~~~~ydLcDelGllV~~Ea~~~~~~~~~~~~~~k 379 (808)
T COG3250 321 ------------------DAMERDLKLMKEANMNSVRTS---HYPNSEEFYDLCDELGLLVIDEAMIETHGMPDDPEWRK 379 (808)
T ss_pred ------------------HHHHHHHHHHHHcCCCEEEec---CCCCCHHHHHHHHHhCcEEEEecchhhcCCCCCcchhH
Confidence 258889999999999999995 677 7899999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecCCC-C--
Q psy438 316 SVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPT-N-- 392 (669)
Q Consensus 316 ~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S~~-~-- 392 (669)
.+.+++++||+|+||||||||||+|||...+.. ...+...+++.||+|++..-+.. .
T Consensus 380 ~~~~~i~~mver~knHPSIiiWs~gNE~~~g~~--------------------~~~~~~~~k~~d~~r~~~~~~~~~~~~ 439 (808)
T COG3250 380 EVSEEVRRMVERDRNHPSIIIWSLGNESGHGSN--------------------HWALYRWFKASDPTRPVQYEGRGTEAT 439 (808)
T ss_pred HHHHHHHHHHHhccCCCcEEEEeccccccCccc--------------------cHHHHHHHhhcCCccceeccCccceee
Confidence 999999999999999999999999999865431 13455788888999888765432 0
Q ss_pred ---C----cccccccccccCCC-------C-----CCCCCccccccccccc--CCC---CCCCCCcccc--ccccCCCCC
Q psy438 393 ---G----IESEKAKYALADNP-------Y-----SNIYGDTHNYDYYQNL--WDP---STAPKSRFCS--EFGIQSLPQ 446 (669)
Q Consensus 393 ---~----~~~~~~~~~~~~~p-------y-----~~~~gd~h~y~~~~~~--~~~---~~~~~~~f~s--E~G~~s~p~ 446 (669)
. ...+...| ...| | +..+|+.|+|..+... .+. -.+-.++|++ |+|.++.+.
T Consensus 440 ~~~~~~~~~~~~~~~~--~~~p~p~~l~~~~~~~~n~~~G~~~yw~~~~~~~~~~~~~~~~~~~~~~~~~~e~g~~~~~~ 517 (808)
T COG3250 440 DILSPMYERVDEILYF--PGSPRPLILCEYAHAMGNSYGGDYHYWGVFGEYPRLQGGFIWDWVDQRLIPIDETGNQAYAY 517 (808)
T ss_pred ecccchhhcCcccccc--CCCCCCceeecchHhhccCcCCceeeeehhccccccccceehhhcccccCcccccCcccccc
Confidence 0 00000111 1111 1 1224555666544321 000 1123577888 999999999
Q ss_pred cchhcccCCcccccCCCCcccccccccCCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHhccccccc
Q psy438 447 LSTFQKVATEADLASWRTPFFDSRQHLAGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRRDKGVLRE 526 (669)
Q Consensus 447 ~~sl~~~~~~~~~~~~~~~~~~h~~~~~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr~~n~~R~ 526 (669)
...+..+...... ..++.++.-|+.+.+++..+.......|+.+++ .+-|.+.+|.+|+++.+.....+++...
T Consensus 518 ~g~~~~~~~~~~~-~~~~~v~~~~~~~~~~~e~~~~~~~~~f~~~k~-~~~~~~~s~~~~~~~~~~~~~~~~~~~~---- 591 (808)
T COG3250 518 GGDFGDYPNDRSF-ELNGLVFPDRQPNPGLKEAKVGTQFWAFGDPKT-FQGFLVTSENLFAEADRERLPKLRALLG---- 591 (808)
T ss_pred ccccCCCccccch-hcccccccccccCccchheeEEEeeeeeecccc-CCceEEechhheeeeehhhhhhhhhccc----
Confidence 9888887766555 566677666666556555544445556677765 6669999999999999999999888643
Q ss_pred CCCCCCCCCeeEeeccccCccceEehhh
Q psy438 527 DGSGHNMGALYWQLNDTCDELGILIWQD 554 (669)
Q Consensus 527 ~g~~~~sG~LyWq~nd~~D~~GilvW~E 554 (669)
.|.+..+|++++.++......+++
T Consensus 592 ----~~~~~~~~~l~~~~pg~~~~~~~~ 615 (808)
T COG3250 592 ----ETLGSLEWQLNDVPPGASWSSLDE 615 (808)
T ss_pred ----eEeeeEEEEeecCCCccccccccc
Confidence 488999999999999988777776
No 3
>PRK10150 beta-D-glucuronidase; Provisional
Probab=100.00 E-value=1.8e-43 Score=405.28 Aligned_cols=313 Identities=21% Similarity=0.255 Sum_probs=212.5
Q ss_pred eEeEEEEeecCCCCceeecceEEEEEeeccccCCccceEeEEEecCCCccceeEEEEEecCCcce-----EEEEEEecCC
Q psy438 10 FRTVELVQDHVDPNHLEKGRYFYFEVNKVPIYSKGSNLIPVDVLPERSNNESTIRDLLVSTKEAN-----MNMLRVWGGG 84 (669)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 84 (669)
+|+|+|...+ +.||++ +++.+.+...+.++...+.+++ .+.. ......+....++.. ....+.+
T Consensus 171 ~r~V~L~~~~--~~~i~d---v~v~~~~~~~~~~a~v~v~v~~--~~~~-~~~~~~l~~~~g~~v~~~~~~~~~~~i--- 239 (604)
T PRK10150 171 HRPVMLYTTP--KTHIDD---ITVVTELAQDLNHASVDWSVET--NGDV-DSVSVTLRDADGQVVATGQGTSGTLQV--- 239 (604)
T ss_pred CceEEEEEcC--CccCce---EEEEeecCCcCceEEEEEEEEE--cCCc-eEEEEEEECCCCCEEEeccccceEEEc---
Confidence 4889998754 788885 5666554333333333333333 2211 111111111111111 1233445
Q ss_pred CCccccCCCCCCCCCcEEEEEEEEe-CcEEEEEEEEEEEEEEEEeeecCCCccccCCceEEEEEcCEEeEeccccccCCC
Q psy438 85 SEVELWWPNGYGEQPLYNLQITLAS-GVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYFYFEVNKVPIYSKGSNLIPVD 163 (669)
Q Consensus 85 ~~p~lW~p~g~g~P~LY~l~~~l~~-g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~~~~lNG~~v~~rG~n~~~~~ 163 (669)
++|+||||+ +|+||+|+++|.. |+++|+.+.+||||+|+++. + . |+|||+||+|||+|+|+..
T Consensus 240 ~~p~lW~p~---~P~LY~l~v~l~~~g~~~d~~~~~~GfR~i~~~~----------~-~--f~lNG~pv~lrG~~~h~~~ 303 (604)
T PRK10150 240 VNPHLWQPG---EGYLYTLCVELAKSGTECDTYPLRFGIRSVAVKG----------G-Q--FLINGKPFYFKGFGKHEDA 303 (604)
T ss_pred CCCcccCCC---CCceEEEEEEEeeCCeeEEEEEeeeEEEEEEEeC----------C-E--EEECCEEEEEEeeeccCCC
Confidence 899999996 9999999999987 88999999999999999963 2 2 8999999999999999876
Q ss_pred CCCCCCCcchhhhhhhhhhhhhhccceeecCCccccccccCCCCcCCcccCCCcchhhhhhhhhhhhhhhhhhhcCCccc
Q psy438 164 ILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYEPVDILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVY 243 (669)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (669)
...|++.+
T Consensus 304 ~~~G~a~~------------------------------------------------------------------------ 311 (604)
T PRK10150 304 DIRGKGLD------------------------------------------------------------------------ 311 (604)
T ss_pred CccCCcCC------------------------------------------------------------------------
Confidence 65444322
Q ss_pred cccccccccchHHHHHHHHHHHHcCCCEEEccCCCCCC-cHHHHHHHHHcCCEEEeecccccC----C---------CC-
Q psy438 244 MTRYTDMARHESTIRDLLVSTKEANMNMLRVWGGGVYM-SDYFYETCDELGILIWQDMMFACN----N---------YP- 308 (669)
Q Consensus 244 ~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~~~~~~cDe~Gilv~~e~~~~~~----~---------~p- 308 (669)
++.+++++++||++|+|+||+ +|+| +++||++|||+|||||+|+|.... . .+
T Consensus 312 ----------~~~~~~d~~l~K~~G~N~vR~---sh~p~~~~~~~~cD~~GllV~~E~p~~~~~~~~~~~~~~~~~~~~~ 378 (604)
T PRK10150 312 ----------EVLNVHDHNLMKWIGANSFRT---SHYPYSEEMLDLADRHGIVVIDETPAVGLNLSFGAGLEAGNKPKET 378 (604)
T ss_pred ----------HHHHHHHHHHHHHCCCCEEEe---ccCCCCHHHHHHHHhcCcEEEEeccccccccccccccccccccccc
Confidence 135778899999999999999 6898 689999999999999999985311 0 00
Q ss_pred -----CChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCC
Q psy438 309 -----ATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTR 383 (669)
Q Consensus 309 -----~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptR 383 (669)
.+|++.+.+.+++++||+|+||||||++||++||+.... +....+++.|.+++|++||||
T Consensus 379 ~~~~~~~~~~~~~~~~~~~~mv~r~~NHPSIi~Ws~gNE~~~~~---------------~~~~~~~~~l~~~~k~~DptR 443 (604)
T PRK10150 379 YSEEAVNGETQQAHLQAIRELIARDKNHPSVVMWSIANEPASRE---------------QGAREYFAPLAELTRKLDPTR 443 (604)
T ss_pred ccccccchhHHHHHHHHHHHHHHhccCCceEEEEeeccCCCccc---------------hhHHHHHHHHHHHHHhhCCCC
Confidence 125688889999999999999999999999999975321 011235678899999999999
Q ss_pred CEEecCCCC-----CcccccccccccCCCCCCCCCccccccc-------ccccCCCCCCCCCccccccccCCCCCcchhc
Q psy438 384 PYLTSSPTN-----GIESEKAKYALADNPYSNIYGDTHNYDY-------YQNLWDPSTAPKSRFCSEFGIQSLPQLSTFQ 451 (669)
Q Consensus 384 p~~~~S~~~-----~~~~~~~~~~~~~~py~~~~gd~h~y~~-------~~~~~~~~~~~~~~f~sE~G~~s~p~~~sl~ 451 (669)
|++..+... .......+. ++.|.|.+||.+..-... ....+. ..+.+|.++||||+.+.+.+++..
T Consensus 444 ~vt~~~~~~~~~~~~~~~~~~Dv-~~~N~Y~~wy~~~~~~~~~~~~~~~~~~~~~-~~~~kP~~isEyg~~~~~~~h~~~ 521 (604)
T PRK10150 444 PVTCVNVMFATPDTDTVSDLVDV-LCLNRYYGWYVDSGDLETAEKVLEKELLAWQ-EKLHKPIIITEYGADTLAGLHSMY 521 (604)
T ss_pred ceEEEecccCCcccccccCcccE-EEEcccceecCCCCCHHHHHHHHHHHHHHHH-HhcCCCEEEEccCCccccccccCC
Confidence 999875311 111122333 455666666532110000 000011 123689999999998887765543
No 4
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=100.00 E-value=6.2e-42 Score=410.82 Aligned_cols=250 Identities=20% Similarity=0.306 Sum_probs=189.8
Q ss_pred eEeEEEEeecCCCCceeecceEEEEEeeccccCCccceEeEEEecCCCcc-ceeE-EE---------------EEecCCc
Q psy438 10 FRTVELVQDHVDPNHLEKGRYFYFEVNKVPIYSKGSNLIPVDVLPERSNN-ESTI-RD---------------LLVSTKE 72 (669)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~---------------~~~~~~~ 72 (669)
+|+|+|...+ +.|+++ +++.+.+...+.++...+.+++....... ..++ .. +.+. +.
T Consensus 198 ~R~V~L~~~p--~~~I~d---~~v~t~l~~~~~~a~l~v~v~~~n~~~~~~~~~v~~~l~d~~~~v~~~~~~~~~~~-~~ 271 (1021)
T PRK10340 198 FRDVYLVGKP--LTHIND---FTVRTDFDEDYCDATLSCEVVLENLAASPVVTTLEYTLFDGERVVHSSAIDHLAIE-KL 271 (1021)
T ss_pred cceEEEEEeC--CceEEe---eEEEeeccCccCceEEEEEEEEecCCCCccceEEEEEEECCCceEEEEeeceeecC-CC
Confidence 5788888755 778875 56666665555555444444442111100 0000 00 1111 12
Q ss_pred ceEEEEEEecCCCCccccCCCCCCCCCcEEEEEEEEe--CcEEEEEEEEEEEEEEEEeeecCCCccccCCceEEEEEcCE
Q psy438 73 ANMNMLRVWGGGSEVELWWPNGYGEQPLYNLQITLAS--GVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYFYFEVNKV 150 (669)
Q Consensus 73 ~~~~~~~~~~~~~~p~lW~p~g~g~P~LY~l~~~l~~--g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~~~~lNG~ 150 (669)
..+.+.+.+ ++|+||||+ +|+||+|+++|.+ |+++|+.+.+||||+|+++. |. |+|||+
T Consensus 272 ~~~~~~~~i---~~p~lWs~e---~P~LY~l~v~l~~~~g~~~d~~~~~~GfR~iei~~----------~~---f~lNGk 332 (1021)
T PRK10340 272 TSASFAFTV---EQPQQWSAE---SPYLYHLVMTLKDANGNVLEVVPQRVGFRDIKVRD----------GL---FWINNR 332 (1021)
T ss_pred ceEEEEEEc---CCCCcCCCC---CCeeEEEEEEEEcCCCCEEEEEEeeeEEEEEEEEC----------CE---EEECCE
Confidence 233445555 899999998 9999999999985 78999999999999999963 32 999999
Q ss_pred EeEeccccccCCCCCCCCCCcchhhhhhhhhhhhhhccceeecCCccccccccCCCCcCCcccCCCcchhhhhhhhhhhh
Q psy438 151 PIYSKGSNLIPVDILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYEPVDILPERSNNESTIRDLLVSTKEA 230 (669)
Q Consensus 151 ~v~~rG~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (669)
||+|||+|+|+.++..|++.+
T Consensus 333 pi~lrGvnrh~~~p~~G~a~~----------------------------------------------------------- 353 (1021)
T PRK10340 333 YVKLHGVNRHDNDHRKGRAVG----------------------------------------------------------- 353 (1021)
T ss_pred EEEEEEeecCCCCcccCccCC-----------------------------------------------------------
Confidence 999999999998877666432
Q ss_pred hhhhhhhcCCccccccccccccchHHHHHHHHHHHHcCCCEEEccCCCCCC-cHHHHHHHHHcCCEEEeecccccCCCC-
Q psy438 231 NMNMLRVWGGGVYMTRYTDMARHESTIRDLLVSTKEANMNMLRVWGGGVYM-SDYFYETCDELGILIWQDMMFACNNYP- 308 (669)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~~~~~~cDe~Gilv~~e~~~~~~~~p- 308 (669)
++.++++|++||++|+|+||+ +||| +++|||+|||+|||||+|+++.|+.++
T Consensus 354 -----------------------~e~~~~dl~lmK~~g~NavR~---sHyP~~~~fydlcDe~GllV~dE~~~e~~g~~~ 407 (1021)
T PRK10340 354 -----------------------MDRVEKDIQLMKQHNINSVRT---AHYPNDPRFYELCDIYGLFVMAETDVESHGFAN 407 (1021)
T ss_pred -----------------------HHHHHHHHHHHHHCCCCEEEe---cCCCCCHHHHHHHHHCCCEEEECCcccccCccc
Confidence 136888999999999999999 6898 479999999999999999998776532
Q ss_pred --------CChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhC
Q psy438 309 --------ATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYD 380 (669)
Q Consensus 309 --------~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~D 380 (669)
++|.|.+.+.+++++||+|+||||||++||+|||+..+. .+..|.+++|++|
T Consensus 408 ~~~~~~~~~~p~~~~~~~~~~~~mV~RdrNHPSIi~WslGNE~~~g~--------------------~~~~~~~~~k~~D 467 (1021)
T PRK10340 408 VGDISRITDDPQWEKVYVDRIVRHIHAQKNHPSIIIWSLGNESGYGC--------------------NIRAMYHAAKALD 467 (1021)
T ss_pred ccccccccCCHHHHHHHHHHHHHHHHhCCCCCEEEEEECccCccccH--------------------HHHHHHHHHHHhC
Confidence 467899999999999999999999999999999985322 1257889999999
Q ss_pred CCCCEEecC
Q psy438 381 PTRPYLTSS 389 (669)
Q Consensus 381 ptRp~~~~S 389 (669)
||||++...
T Consensus 468 ptR~v~~~~ 476 (1021)
T PRK10340 468 DTRLVHYEE 476 (1021)
T ss_pred CCceEEeCC
Confidence 999998764
No 5
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=100.00 E-value=1.1e-40 Score=399.66 Aligned_cols=192 Identities=22% Similarity=0.424 Sum_probs=163.5
Q ss_pred EEEEEEecCCCCccccCCCCCCCCCcEEEEEEEEe--CcEEEEEEEEEEEEEEEEeeecCCCccccCCceEEEEEcCEEe
Q psy438 75 MNMLRVWGGGSEVELWWPNGYGEQPLYNLQITLAS--GVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYFYFEVNKVPI 152 (669)
Q Consensus 75 ~~~~~~~~~~~~p~lW~p~g~g~P~LY~l~~~l~~--g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~~~~lNG~~v 152 (669)
+.+.+.+ ++|+||||+ +|+||+|+++|.+ |+++|+.+.+||||+|+++. |. |+|||+||
T Consensus 290 ~~~~~~v---~~p~lWs~e---~P~LY~l~v~l~~~~g~v~d~~~~~~GfR~iei~~----------~~---f~LNGkpi 350 (1027)
T PRK09525 290 VTLRLNV---ENPKLWSAE---TPNLYRAVVSLLDADGTLIEAEAYDVGFRKVEIEN----------GL---LKLNGKPL 350 (1027)
T ss_pred EEEEEEc---CCCCcCCCC---CCeeEEEEEEEEeCCCcEEEEEEeeEEEEEEEEEC----------CE---EEECCEEE
Confidence 3455555 899999997 9999999999975 78889999999999999963 32 99999999
Q ss_pred EeccccccCCCCCCCCCCcchhhhhhhhhhhhhhccceeecCCccccccccCCCCcCCcccCCCcchhhhhhhhhhhhhh
Q psy438 153 YSKGSNLIPVDILPERSNNESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYEPVDILPERSNNESTIRDLLVSTKEANM 232 (669)
Q Consensus 153 ~~rG~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (669)
+|||+|+|+.++..++..+
T Consensus 351 ~lrGvn~h~~~p~~G~a~t------------------------------------------------------------- 369 (1027)
T PRK09525 351 LIRGVNRHEHHPEHGQVMD------------------------------------------------------------- 369 (1027)
T ss_pred EEEEeEccccCcccCccCC-------------------------------------------------------------
Confidence 9999999988876665432
Q ss_pred hhhhhcCCccccccccccccchHHHHHHHHHHHHcCCCEEEccCCCCCC-cHHHHHHHHHcCCEEEeecccccCC-----
Q psy438 233 NMLRVWGGGVYMTRYTDMARHESTIRDLLVSTKEANMNMLRVWGGGVYM-SDYFYETCDELGILIWQDMMFACNN----- 306 (669)
Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~~~~~~cDe~Gilv~~e~~~~~~~----- 306 (669)
++.++++|++||++|+|+||+ +||| +++||++|||+|||||+|+++.++.
T Consensus 370 ---------------------~e~~~~di~lmK~~g~NaVR~---sHyP~~p~fydlcDe~GilV~dE~~~e~hg~~~~~ 425 (1027)
T PRK09525 370 ---------------------EETMVQDILLMKQHNFNAVRC---SHYPNHPLWYELCDRYGLYVVDEANIETHGMVPMN 425 (1027)
T ss_pred ---------------------HHHHHHHHHHHHHCCCCEEEe---cCCCCCHHHHHHHHHcCCEEEEecCccccCCcccc
Confidence 246888999999999999999 6898 5799999999999999999985433
Q ss_pred -CCCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCE
Q psy438 307 -YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPY 385 (669)
Q Consensus 307 -~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~ 385 (669)
...+++|.+.+.+++++||+|+|||||||+||++||...+. ....|.+++|++|||||+
T Consensus 426 ~~~~dp~~~~~~~~~~~~mV~RdrNHPSIi~WSlgNE~~~g~--------------------~~~~l~~~~k~~DptRpV 485 (1027)
T PRK09525 426 RLSDDPRWLPAMSERVTRMVQRDRNHPSIIIWSLGNESGHGA--------------------NHDALYRWIKSNDPSRPV 485 (1027)
T ss_pred CCCCCHHHHHHHHHHHHHHHHhCCCCCEEEEEeCccCCCcCh--------------------hHHHHHHHHHhhCCCCcE
Confidence 23578999999999999999999999999999999985321 135678899999999999
Q ss_pred EecCC
Q psy438 386 LTSSP 390 (669)
Q Consensus 386 ~~~S~ 390 (669)
+..+.
T Consensus 486 ~y~~~ 490 (1027)
T PRK09525 486 QYEGG 490 (1027)
T ss_pred EECCC
Confidence 99763
No 6
>PF02836 Glyco_hydro_2_C: Glycosyl hydrolases family 2, TIM barrel domain; InterPro: IPR006103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. Beta-galactosidase from E. coli has a TIM-barrel-like core surrounded by four other largely beta domains [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3CMG_A 3FN9_C 1YQ2_A 3K4D_B 3LPG_B 3LPF_A 3K4A_B 3K46_B 3GM8_A 3DEC_A ....
Probab=100.00 E-value=2.1e-33 Score=295.55 Aligned_cols=169 Identities=28% Similarity=0.452 Sum_probs=113.4
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC-cHHHHHHHHHcCCEEEeecccccCCC----------CCChHHHHHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM-SDYFYETCDELGILIWQDMMFACNNY----------PATPTFLQSVRSEISQ 323 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~~~~~~cDe~Gilv~~e~~~~~~~~----------p~~~~~~~~~~~~~~~ 323 (669)
+.++++|.++|++|+|+||+ +|+| +++||++|||+|||||+|+|..+... ..+++|.+.+.+++++
T Consensus 36 ~~~~~d~~l~k~~G~N~iR~---~h~p~~~~~~~~cD~~GilV~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (298)
T PF02836_consen 36 EAMERDLELMKEMGFNAIRT---HHYPPSPRFYDLCDELGILVWQEIPLEGHGSWQDFGNCNYDADDPEFRENAEQELRE 112 (298)
T ss_dssp HHHHHHHHHHHHTT-SEEEE---TTS--SHHHHHHHHHHT-EEEEE-S-BSCTSSSSTSCTSCTTTSGGHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhcCcceEEc---ccccCcHHHHHHHhhcCCEEEEeccccccCccccCCccccCCCCHHHHHHHHHHHHH
Confidence 46888899999999999999 5777 67999999999999999999843221 2478999999999999
Q ss_pred HHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecCCCCCcccccccccc
Q psy438 324 TVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPTNGIESEKAKYAL 403 (669)
Q Consensus 324 ~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S~~~~~~~~~~~~~~ 403 (669)
||+|+||||||++|+++||+... .+++.|.+++|++|||||++.+|......... .
T Consensus 113 ~v~~~~NHPSIi~W~~gNE~~~~--------------------~~~~~l~~~~k~~DptRpv~~~~~~~~~~~~~----~ 168 (298)
T PF02836_consen 113 MVRRDRNHPSIIMWSLGNESDYR--------------------EFLKELYDLVKKLDPTRPVTYASNGWDPYVDD----I 168 (298)
T ss_dssp HHHHHTT-TTEEEEEEEESSHHH--------------------HHHHHHHHHHHHH-TTSEEEEETGTSGGSTSS----C
T ss_pred HHHcCcCcCchheeecCccCccc--------------------cchhHHHHHHHhcCCCCceeeccccccccccc----c
Confidence 99999999999999999999321 24578889999999999999887721110000 1
Q ss_pred cCCCCCCCCCccccccc---ccccCCCCCCCCCccccccccCCCCCcchhc
Q psy438 404 ADNPYSNIYGDTHNYDY---YQNLWDPSTAPKSRFCSEFGIQSLPQLSTFQ 451 (669)
Q Consensus 404 ~~~py~~~~gd~h~y~~---~~~~~~~~~~~~~~f~sE~G~~s~p~~~sl~ 451 (669)
..+-+..+|........ +...+. ....+|.|+||||..+.++.....
T Consensus 169 ~~~~~~~~y~~~~~~~~~~~~~~~~~-~~~~kP~i~sEyg~~~~~~~g~~~ 218 (298)
T PF02836_consen 169 IFDIYSGWYNGYGDPEDFEKYLEDWY-KYPDKPIIISEYGADAYNSKGGDS 218 (298)
T ss_dssp EECSETTTSSSCCHHHHHHHHHHHHH-HHCTS-EEEEEESEBBSST-TTHH
T ss_pred cccccccccCCcccHHHHHHHHHhcc-ccCCCCeEehhccccccccCCCcc
Confidence 11223333322111111 111110 134689999999999988655443
No 7
>KOG2230|consensus
Probab=99.89 E-value=3.9e-23 Score=225.23 Aligned_cols=147 Identities=40% Similarity=0.740 Sum_probs=132.2
Q ss_pred hcccccccCCCCCCCCCeeEeeccccCccceEehhhhhhhhcCCCCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCC
Q psy438 519 RDKGVLREDGSGHNMGALYWQLNDTCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEM 598 (669)
Q Consensus 519 r~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW~E~~~~~~~yp~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~ 598 (669)
.+||++|+||++.+....++ ++||++||+|||+|+++|+.||.+++|++++++|++.++.|+..||||++|+++||.
T Consensus 369 ~~MN~lRVWGGGvYEsd~FY---~lad~lGilVWQD~MFACAlYPt~~eFl~sv~eEV~yn~~Rls~HpSviIfsgNNEN 445 (867)
T KOG2230|consen 369 VGMNMLRVWGGGVYESDYFY---QLADSLGILVWQDMMFACALYPTNDEFLSSVREEVRYNAMRLSHHPSVIIFSGNNEN 445 (867)
T ss_pred hCcceEEEecCccccchhHH---HHhhhccceehhhhHHHhhcccCcHHHHHHHHHHHHHHHHhhccCCeEEEEeCCCcc
Confidence 38999999999988776664 889999999999999999999999999999999999999999999999999999999
Q ss_pred chhhhhhhhhccC--chhhhhhHHHHHHHHHHHHHHHhCCCCCeecCCCCCCcCCccccccccCCCCCCCCCC
Q psy438 599 EGATIQKWYIREN--PELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPTNGIESEKAKYALADNPYSNIYGD 669 (669)
Q Consensus 599 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS~~~g~~~~~~~~~~~~n~y~~~y~~ 669 (669)
+.....+|...-. ...+.+++.-++.+.++++++..||+||.+-+||+||+.+....| ++.|||++.|||
T Consensus 446 EaAl~~nWy~~sf~~~~~~~kdyvlly~~~i~el~l~~~~srPfi~SSPsNG~ete~e~~-VS~NP~dn~~GD 517 (867)
T KOG2230|consen 446 EAALVQNWYGTSFERDRFESKDYVLLYANVIHELKLVSHSSRPFIVSSPSNGKETEPENY-VSSNPQDNQNGD 517 (867)
T ss_pred HHHHHhhhhcccccccchhhhhhhHHHHHHHHHHHhhcCCCCCceecCCCCCcccCcccc-ccCCCccccCCc
Confidence 8776667764322 234577777888888999999999999999999999999999998 888999999997
No 8
>PF02836 Glyco_hydro_2_C: Glycosyl hydrolases family 2, TIM barrel domain; InterPro: IPR006103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. Beta-galactosidase from E. coli has a TIM-barrel-like core surrounded by four other largely beta domains [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3CMG_A 3FN9_C 1YQ2_A 3K4D_B 3LPG_B 3LPF_A 3K4A_B 3K46_B 3GM8_A 3DEC_A ....
Probab=99.76 E-value=2.4e-19 Score=189.10 Aligned_cols=139 Identities=28% Similarity=0.479 Sum_probs=96.2
Q ss_pred CCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHh-cccccccCCCCCCCCCeeEeeccccCccceEeh
Q psy438 474 AGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRR-DKGVLREDGSGHNMGALYWQLNDTCDELGILIW 552 (669)
Q Consensus 474 ~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr-~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW 552 (669)
.||.+.++++++.+...|.. +..+..+.++..++.++. +.|++|+. |++-... |+++||++||+||
T Consensus 10 lNGk~~~l~Gv~~h~~~~~~--------g~a~~~~~~~~d~~l~k~~G~N~iR~~---h~p~~~~--~~~~cD~~GilV~ 76 (298)
T PF02836_consen 10 LNGKPIFLRGVNRHQDYPGL--------GRAMPDEAMERDLELMKEMGFNAIRTH---HYPPSPR--FYDLCDELGILVW 76 (298)
T ss_dssp ETTEEE-EEEEEE-S-BTTT--------BT---HHHHHHHHHHHHHTT-SEEEET---TS--SHH--HHHHHHHHT-EEE
T ss_pred ECCEEEEEEEEeeCcCcccc--------cccCCHHHHHHHHHHHHhcCcceEEcc---cccCcHH--HHHHHhhcCCEEE
Confidence 46777777777766544432 445566777788888887 88999873 2333333 4699999999999
Q ss_pred hhhhhhhcC----------CCCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCchhhhhhHHHH
Q psy438 553 QDMMFACNN----------YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAEL 622 (669)
Q Consensus 553 ~E~~~~~~~----------yp~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~~~~~~~~~~ 622 (669)
+|++..+.. ...++.+.+.+++++++||+|+||||||++||++||+.. ..
T Consensus 77 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~NHPSIi~W~~gNE~~~--------------------~~ 136 (298)
T PF02836_consen 77 QEIPLEGHGSWQDFGNCNYDADDPEFRENAEQELREMVRRDRNHPSIIMWSLGNESDY--------------------RE 136 (298)
T ss_dssp EE-S-BSCTSSSSTSCTSCTTTSGGHHHHHHHHHHHHHHHHTT-TTEEEEEEEESSHH--------------------HH
T ss_pred EeccccccCccccCCccccCCCCHHHHHHHHHHHHHHHHcCcCcCchheeecCccCcc--------------------cc
Confidence 999863221 124689999999999999999999999999999999832 23
Q ss_pred HHHHHHHHHHHhCCCCCeecCCC
Q psy438 623 YVNTLKPIVLQYDPTRPYLTSSP 645 (669)
Q Consensus 623 ~~~~L~~~vk~~DPTRp~~~sS~ 645 (669)
+.+.|.+++|++|||||++.+|.
T Consensus 137 ~~~~l~~~~k~~DptRpv~~~~~ 159 (298)
T PF02836_consen 137 FLKELYDLVKKLDPTRPVTYASN 159 (298)
T ss_dssp HHHHHHHHHHHH-TTSEEEEETG
T ss_pred chhHHHHHHHhcCCCCceeeccc
Confidence 34568999999999999998887
No 9
>PRK10150 beta-D-glucuronidase; Provisional
Probab=99.75 E-value=1e-18 Score=201.18 Aligned_cols=166 Identities=18% Similarity=0.175 Sum_probs=114.9
Q ss_pred CCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHh-cccccccCCCCCCCCCeeEeeccccCccceEeh
Q psy438 474 AGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRR-DKGVLREDGSGHNMGALYWQLNDTCDELGILIW 552 (669)
Q Consensus 474 ~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr-~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW 552 (669)
.||.+.+++++++|-..+. .+..+..+.++..++.++. +.|++|+. |++-... |+++||++||+||
T Consensus 287 lNG~pv~lrG~~~h~~~~~--------~G~a~~~~~~~~d~~l~K~~G~N~vR~s---h~p~~~~--~~~~cD~~GllV~ 353 (604)
T PRK10150 287 INGKPFYFKGFGKHEDADI--------RGKGLDEVLNVHDHNLMKWIGANSFRTS---HYPYSEE--MLDLADRHGIVVI 353 (604)
T ss_pred ECCEEEEEEeeeccCCCCc--------cCCcCCHHHHHHHHHHHHHCCCCEEEec---cCCCCHH--HHHHHHhcCcEEE
Confidence 4666666666665433221 1334455566777888888 88998872 3433332 4699999999999
Q ss_pred hhhhhhhc----C---------CC------CChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCch
Q psy438 553 QDMMFACN----N---------YP------ATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPE 613 (669)
Q Consensus 553 ~E~~~~~~----~---------yp------~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~ 613 (669)
+|++.... . .+ .++++.+++++++++||+|+||||||++||++||.....
T Consensus 354 ~E~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~mv~r~~NHPSIi~Ws~gNE~~~~~----------- 422 (604)
T PRK10150 354 DETPAVGLNLSFGAGLEAGNKPKETYSEEAVNGETQQAHLQAIRELIARDKNHPSVVMWSIANEPASRE----------- 422 (604)
T ss_pred EecccccccccccccccccccccccccccccchhHHHHHHHHHHHHHHhccCCceEEEEeeccCCCccc-----------
Confidence 99974211 0 00 125788999999999999999999999999999985321
Q ss_pred hhhhhHHHHHHHHHHHHHHHhCCCCCeecCCCC-----CCcCCccccccccCCCCCCCCC
Q psy438 614 LYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPT-----NGIESEKAKYALADNPYSNIYG 668 (669)
Q Consensus 614 ~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS~~-----~g~~~~~~~~~~~~n~y~~~y~ 668 (669)
.....+.+.|.+++|++||||||+.++-. ....... .+.++.|.|++||.
T Consensus 423 ----~~~~~~~~~l~~~~k~~DptR~vt~~~~~~~~~~~~~~~~~-~Dv~~~N~Y~~wy~ 477 (604)
T PRK10150 423 ----QGAREYFAPLAELTRKLDPTRPVTCVNVMFATPDTDTVSDL-VDVLCLNRYYGWYV 477 (604)
T ss_pred ----hhHHHHHHHHHHHHHhhCCCCceEEEecccCCcccccccCc-ccEEEEcccceecC
Confidence 11234456699999999999999977521 1222233 34689999999994
No 10
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=99.69 E-value=1.6e-17 Score=200.81 Aligned_cols=139 Identities=13% Similarity=0.186 Sum_probs=107.9
Q ss_pred CCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHh-cccccccCCCCCCCCCeeEeeccccCccceEeh
Q psy438 474 AGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRR-DKGVLREDGSGHNMGALYWQLNDTCDELGILIW 552 (669)
Q Consensus 474 ~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr-~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW 552 (669)
.||..++.+++++|-..|.. +..+..+.++..++.+++ ++|++|++ |++.... |+++||++||+||
T Consensus 329 lNGkpi~lrGvnrh~~~p~~--------G~a~~~e~~~~dl~lmK~~g~NavR~s---HyP~~~~--fydlcDe~GllV~ 395 (1021)
T PRK10340 329 INNRYVKLHGVNRHDNDHRK--------GRAVGMDRVEKDIQLMKQHNINSVRTA---HYPNDPR--FYELCDIYGLFVM 395 (1021)
T ss_pred ECCEEEEEEEeecCCCCccc--------CccCCHHHHHHHHHHHHHCCCCEEEec---CCCCCHH--HHHHHHHCCCEEE
Confidence 47777777788766444432 555667778888999988 89999974 4444443 3699999999999
Q ss_pred hhhhhhhcCC---------CCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCchhhhhhHHHHH
Q psy438 553 QDMMFACNNY---------PATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELY 623 (669)
Q Consensus 553 ~E~~~~~~~y---------p~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~~~~~~~~~~~ 623 (669)
+|++..|..+ ..++.+.+.+.+++++||+|+||||||++||+|||+..+ ..+
T Consensus 396 dE~~~e~~g~~~~~~~~~~~~~p~~~~~~~~~~~~mV~RdrNHPSIi~WslGNE~~~g-------------------~~~ 456 (1021)
T PRK10340 396 AETDVESHGFANVGDISRITDDPQWEKVYVDRIVRHIHAQKNHPSIIIWSLGNESGYG-------------------CNI 456 (1021)
T ss_pred ECCcccccCcccccccccccCCHHHHHHHHHHHHHHHHhCCCCCEEEEEECccCcccc-------------------HHH
Confidence 9998665432 135788999999999999999999999999999998533 122
Q ss_pred HHHHHHHHHHhCCCCCeecCCC
Q psy438 624 VNTLKPIVLQYDPTRPYLTSSP 645 (669)
Q Consensus 624 ~~~L~~~vk~~DPTRp~~~sS~ 645 (669)
..|.+++|++|||||++....
T Consensus 457 -~~~~~~~k~~DptR~v~~~~~ 477 (1021)
T PRK10340 457 -RAMYHAAKALDDTRLVHYEED 477 (1021)
T ss_pred -HHHHHHHHHhCCCceEEeCCC
Confidence 458899999999999997653
No 11
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=99.64 E-value=9.3e-17 Score=194.03 Aligned_cols=139 Identities=18% Similarity=0.316 Sum_probs=108.6
Q ss_pred CCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHh-cccccccCCCCCCCCCeeEeeccccCccceEeh
Q psy438 474 AGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRR-DKGVLREDGSGHNMGALYWQLNDTCDELGILIW 552 (669)
Q Consensus 474 ~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr-~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW 552 (669)
.||.+++++++++|-..|.. +..+..+.++..++.+++ ++|++|++ |+.-... |+++||++||+||
T Consensus 345 LNGkpi~lrGvn~h~~~p~~--------G~a~t~e~~~~di~lmK~~g~NaVR~s---HyP~~p~--fydlcDe~GilV~ 411 (1027)
T PRK09525 345 LNGKPLLIRGVNRHEHHPEH--------GQVMDEETMVQDILLMKQHNFNAVRCS---HYPNHPL--WYELCDRYGLYVV 411 (1027)
T ss_pred ECCEEEEEEEeEccccCccc--------CccCCHHHHHHHHHHHHHCCCCEEEec---CCCCCHH--HHHHHHHcCCEEE
Confidence 47777777888776544432 556777888888999998 89999973 4444433 3699999999999
Q ss_pred hhhhhhhcC------CCCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCchhhhhhHHHHHHHH
Q psy438 553 QDMMFACNN------YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNT 626 (669)
Q Consensus 553 ~E~~~~~~~------yp~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 626 (669)
+|+++.+.. ...++.+.+++.+++++||+|+||||||++||++||..... . .+.
T Consensus 412 dE~~~e~hg~~~~~~~~~dp~~~~~~~~~~~~mV~RdrNHPSIi~WSlgNE~~~g~-------------------~-~~~ 471 (1027)
T PRK09525 412 DEANIETHGMVPMNRLSDDPRWLPAMSERVTRMVQRDRNHPSIIIWSLGNESGHGA-------------------N-HDA 471 (1027)
T ss_pred EecCccccCCccccCCCCCHHHHHHHHHHHHHHHHhCCCCCEEEEEeCccCCCcCh-------------------h-HHH
Confidence 998864322 22468899999999999999999999999999999986321 1 244
Q ss_pred HHHHHHHhCCCCCeecCCC
Q psy438 627 LKPIVLQYDPTRPYLTSSP 645 (669)
Q Consensus 627 L~~~vk~~DPTRp~~~sS~ 645 (669)
|.+++|++|||||++..+.
T Consensus 472 l~~~~k~~DptRpV~y~~~ 490 (1027)
T PRK09525 472 LYRWIKSNDPSRPVQYEGG 490 (1027)
T ss_pred HHHHHHhhCCCCcEEECCC
Confidence 8899999999999998864
No 12
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=99.62 E-value=1.5e-16 Score=186.58 Aligned_cols=113 Identities=23% Similarity=0.422 Sum_probs=94.8
Q ss_pred CCchHHHHHHHhhhcCCCCccHHHHHHHhhHhHHHHHHHHHHHHHh-cccccccCCCCCCCCCeeEeeccccCccceEeh
Q psy438 474 AGGTGILESSVGHQFEIGNLTLEYFAYLSQIYQAGAIKTITEQMRR-DKGVLREDGSGHNMGALYWQLNDTCDELGILIW 552 (669)
Q Consensus 474 ~~G~~~~~~~i~~~yg~P~d~ledFvy~SQvmQae~~R~~iE~~Rr-~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW 552 (669)
.||..++.+|+++|-..|. .++.+-.+.|.-.+-.++. ++|++|++ +++.... |+++||++||+||
T Consensus 295 iNGkpvf~kGvnrHe~~~~--------~G~~~~~~~~~~dl~lmk~~n~N~vRts--HyP~~~~---~ydLcDelGllV~ 361 (808)
T COG3250 295 INGKPVFIRGVNRHEDDPI--------LGRVTDEDAMERDLKLMKEANMNSVRTS--HYPNSEE---FYDLCDELGLLVI 361 (808)
T ss_pred ECCeEEEEeeeecccCCCc--------cccccCHHHHHHHHHHHHHcCCCEEEec--CCCCCHH---HHHHHHHhCcEEE
Confidence 4777888888887655554 3777777777777778887 89999987 3333222 4699999999999
Q ss_pred hhhhhhhcCCCCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCc
Q psy438 553 QDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEME 599 (669)
Q Consensus 553 ~E~~~~~~~yp~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~ 599 (669)
+|++..|..++.++.+.+++.+|+++||+|++|||||||||++||..
T Consensus 362 ~Ea~~~~~~~~~~~~~~k~~~~~i~~mver~knHPSIiiWs~gNE~~ 408 (808)
T COG3250 362 DEAMIETHGMPDDPEWRKEVSEEVRRMVERDRNHPSIIIWSLGNESG 408 (808)
T ss_pred EecchhhcCCCCCcchhHHHHHHHHHHHHhccCCCcEEEEecccccc
Confidence 99999999999999999999999999999999999999999999975
No 13
>PF03198 Glyco_hydro_72: Glucanosyltransferase; InterPro: IPR004886 This family is a group of yeast glycolipid proteins anchored to the membrane. It includes Candida albicans (Yeast) pH-regulated protein, which is required for apical growth and plays a role in morphogenesis and Saccharomyces cerevisiae glycolipid anchored surface protein.; PDB: 2W61_A 2W62_A 2W63_A.
Probab=99.06 E-value=8e-10 Score=116.76 Aligned_cols=175 Identities=18% Similarity=0.232 Sum_probs=98.9
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC---cHHHHHHHHHcCCEEEeecccccCCC-CCCh--HHHHHHHHHHHHHHHHh
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM---SDYFYETCDELGILIWQDMMFACNNY-PATP--TFLQSVRSEISQTVRRV 328 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~---~~~~~~~cDe~Gilv~~e~~~~~~~~-p~~~--~~~~~~~~~~~~~v~r~ 328 (669)
+..++++.++|++|+|+||+.. ..| ++.++.++.+.||+|+.|+....... ..+| .|-....++...+|..+
T Consensus 53 ~~C~rDi~~l~~LgiNtIRVY~--vdp~~nHd~CM~~~~~aGIYvi~Dl~~p~~sI~r~~P~~sw~~~l~~~~~~vid~f 130 (314)
T PF03198_consen 53 EACKRDIPLLKELGINTIRVYS--VDPSKNHDECMSAFADAGIYVILDLNTPNGSINRSDPAPSWNTDLLDRYFAVIDAF 130 (314)
T ss_dssp HHHHHHHHHHHHHT-SEEEES-----TTS--HHHHHHHHHTT-EEEEES-BTTBS--TTS------HHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHHHcCCCEEEEEE--eCCCCCHHHHHHHHHhCCCEEEEecCCCCccccCCCCcCCCCHHHHHHHHHHHHHh
Confidence 4788999999999999999964 344 57999999999999999986542211 2345 78888888889999999
Q ss_pred CCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCC-CCCEEecCCCC-------------Cc
Q psy438 329 QHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDP-TRPYLTSSPTN-------------GI 394 (669)
Q Consensus 329 rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dp-tRp~~~~S~~~-------------~~ 394 (669)
..+|-++....|||+...... .... -|....++.|+..+++... +.|+..++... |.
T Consensus 131 a~Y~N~LgFf~GNEVin~~~~-----t~aa----p~vKAavRD~K~Yi~~~~~R~IPVGYsaaD~~~~r~~~a~Yl~Cg~ 201 (314)
T PF03198_consen 131 AKYDNTLGFFAGNEVINDASN-----TNAA----PYVKAAVRDMKAYIKSKGYRSIPVGYSAADDAEIRQDLANYLNCGD 201 (314)
T ss_dssp TT-TTEEEEEEEESSS-STT------GGGH----HHHHHHHHHHHHHHHHSSS----EEEEE---TTTHHHHHHHTTBTT
T ss_pred ccCCceEEEEecceeecCCCC-----cccH----HHHHHHHHHHHHHHHhcCCCCCceeEEccCChhHHHHHHHHhcCCC
Confidence 999999999999998643210 1112 3455677888888888765 44666654321 11
Q ss_pred ccccccccccCCCCCCCCCcccccc-ccccc-CCCCCCCCCccccccccC
Q psy438 395 ESEKAKYALADNPYSNIYGDTHNYD-YYQNL-WDPSTAPKSRFCSEFGIQ 442 (669)
Q Consensus 395 ~~~~~~~~~~~~py~~~~gd~h~y~-~~~~~-~~~~~~~~~~f~sE~G~~ 442 (669)
.....+| .+.|.|. |.|+..+-. .|... -..++|+.|.|.||||.-
T Consensus 202 ~~~~iDf-~g~N~Y~-WCg~Stf~~SGy~~l~~~f~~y~vPvffSEyGCn 249 (314)
T PF03198_consen 202 DDERIDF-FGLNSYE-WCGDSTFETSGYDRLTKEFSNYSVPVFFSEYGCN 249 (314)
T ss_dssp -----S--EEEEE-----SS--HHHHSHHHHHHHHTT-SS-EEEEEE---
T ss_pred cccccce-eeeccce-ecCCCccccccHHHHHHHhhCCCCCeEEcccCCC
Confidence 1122344 4555544 566644211 01111 123578999999999984
No 14
>PF00703 Glyco_hydro_2: Glycosyl hydrolases family 2; InterPro: IPR006102 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities: beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This entry describes the immunoglobulin-like beta-sandwich domain [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3FN9_C 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3GM8_A 3HN3_E 1BHG_A 2VZU_A 2X09_A ....
Probab=99.02 E-value=5.9e-09 Score=92.33 Aligned_cols=106 Identities=18% Similarity=0.259 Sum_probs=62.1
Q ss_pred eeeEeEEEEeecCCCCceeecceEEEEEeeccccC-CccceEeEEEecCC-CccceeEEEEEecCCc-ceEEEEEEecCC
Q psy438 8 IGFRTVELVQDHVDPNHLEKGRYFYFEVNKVPIYS-KGSNLIPVDVLPER-SNNESTIRDLLVSTKE-ANMNMLRVWGGG 84 (669)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 84 (669)
++++||++..+-.+ + .+ ..+.+++....... .....+.+.+.... ..............+. ..+.+.+.+
T Consensus 1 v~I~dv~v~~~~~~-~--~~-~~v~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i--- 73 (110)
T PF00703_consen 1 VHIEDVFVTPDLDD-D--DS-AKVSVEVEVRNESNKPLDVTVRVRLFDPEGKKVVTQSPVVSLSAPGQARITLTIEI--- 73 (110)
T ss_dssp CEEEEEEEEEEEET-T--SE-EEEEEEEEEEEESSSSCEEEEEEEEEETTSEEEEEEEEEEEECCCCEEEEEEEEEE---
T ss_pred CEEEEEEEEEEEcC-C--CE-EEEEEEEEEEeCCCCcEEEEEEEEEECCCCCEEEEeeeEEEecCCceeEEEEEEEc---
Confidence 57889998875422 2 22 23445553222221 22234444555332 2222222222333332 233346677
Q ss_pred CCccccCCCCCCCCCcEEEEEEEEe-CcEEEEEEEEEEEE
Q psy438 85 SEVELWWPNGYGEQPLYNLQITLAS-GVEMSTKSIKIGFR 123 (669)
Q Consensus 85 ~~p~lW~p~g~g~P~LY~l~~~l~~-g~~~d~~~~~~G~R 123 (669)
++|+||||+ +|+||+|+++|.. |+.+|..+.+||||
T Consensus 74 ~~~~lW~p~---~P~LY~l~v~l~~~g~~~d~~~~~~GfR 110 (110)
T PF00703_consen 74 PNPKLWSPE---DPYLYTLEVELDDDGEVLDSIETRFGFR 110 (110)
T ss_dssp ESS-BBESS---SBSEEEEEEEEEETTEEEEEEEEEEEB-
T ss_pred CCCCCcCCC---CceEEEEEEEEEeCCEEEEEEEeEeeEC
Confidence 799999995 9999999999976 88999999999999
No 15
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=98.56 E-value=6.5e-07 Score=92.17 Aligned_cols=127 Identities=19% Similarity=0.236 Sum_probs=86.4
Q ss_pred HHHHHHHHHHcCCCEEEccCC--CCCC--------c------HHHHHHHHHcCCEEEeecccc--c-C---CCCCChHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGG--GVYM--------S------DYFYETCDELGILIWQDMMFA--C-N---NYPATPTFL 314 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g--~~~~--------~------~~~~~~cDe~Gilv~~e~~~~--~-~---~~p~~~~~~ 314 (669)
.+++++.+|++|+|+||++.+ ...+ . +.++++|.++||.|+-++--. + . .++......
T Consensus 23 ~~~~~~~~~~~G~n~VRi~v~~~~~~~~~~~~~~~~~~~~~ld~~v~~a~~~gi~vild~h~~~~w~~~~~~~~~~~~~~ 102 (281)
T PF00150_consen 23 TEADFDQLKALGFNTVRIPVGWEAYQEPNPGYNYDETYLARLDRIVDAAQAYGIYVILDLHNAPGWANGGDGYGNNDTAQ 102 (281)
T ss_dssp HHHHHHHHHHTTESEEEEEEESTSTSTTSTTTSBTHHHHHHHHHHHHHHHHTT-EEEEEEEESTTCSSSTSTTTTHHHHH
T ss_pred HHHHHHHHHHCCCCEEEeCCCHHHhcCCCCCccccHHHHHHHHHHHHHHHhCCCeEEEEeccCccccccccccccchhhH
Confidence 445678899999999998663 1110 0 468899999999998887642 2 1 122223334
Q ss_pred HHHHHHHHHHHHHhCCCcceEeecccCCcchhhh-hhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecC
Q psy438 315 QSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATI-QKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSS 389 (669)
Q Consensus 315 ~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S 389 (669)
+.+.+-++.+.+|+++||.|+.|.+.||+..... ..| ....++.+. .+.+.+.+.+++.||.++++..+
T Consensus 103 ~~~~~~~~~la~~y~~~~~v~~~el~NEP~~~~~~~~w-~~~~~~~~~-----~~~~~~~~~Ir~~~~~~~i~~~~ 172 (281)
T PF00150_consen 103 AWFKSFWRALAKRYKDNPPVVGWELWNEPNGGNDDANW-NAQNPADWQ-----DWYQRAIDAIRAADPNHLIIVGG 172 (281)
T ss_dssp HHHHHHHHHHHHHHTTTTTTEEEESSSSGCSTTSTTTT-SHHHTHHHH-----HHHHHHHHHHHHTTSSSEEEEEE
T ss_pred HHHHhhhhhhccccCCCCcEEEEEecCCccccCCcccc-ccccchhhh-----hHHHHHHHHHHhcCCcceeecCC
Confidence 4455557889999999999999999999976532 123 112223322 34578889999999999987764
No 16
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=98.24 E-value=3.3e-06 Score=92.37 Aligned_cols=134 Identities=23% Similarity=0.336 Sum_probs=90.2
Q ss_pred HHHHHHHHHHHHcCCCEEEccC--CCCCCc----------HHHHHHHHHcCCEEEeeccccc------CCC---------
Q psy438 255 STIRDLLVSTKEANMNMLRVWG--GGVYMS----------DYFYETCDELGILIWQDMMFAC------NNY--------- 307 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~--g~~~~~----------~~~~~~cDe~Gilv~~e~~~~~------~~~--------- 307 (669)
+.+++++++||++|+|+||+.- .+..+| |.++++|.++||.|.--++... ..+
T Consensus 10 e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~eG~ydF~~lD~~l~~a~~~Gi~viL~~~~~~~P~Wl~~~~Pe~~~~~~~ 89 (374)
T PF02449_consen 10 EEWEEDLRLMKEAGFNTVRIGEFSWSWLEPEEGQYDFSWLDRVLDLAAKHGIKVILGTPTAAPPAWLYDKYPEILPVDAD 89 (374)
T ss_dssp CHHHHHHHHHHHHT-SEEEE-CCEHHHH-SBTTB---HHHHHHHHHHHCTT-EEEEEECTTTS-HHHHCCSGCCC-B-TT
T ss_pred HHHHHHHHHHHHcCCCEEEEEEechhhccCCCCeeecHHHHHHHHHHHhccCeEEEEecccccccchhhhcccccccCCC
Confidence 4799999999999999999621 112222 6899999999999876554211 011
Q ss_pred -------------CCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchh------------------------hhhh
Q psy438 308 -------------PATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGA------------------------TIQK 350 (669)
Q Consensus 308 -------------p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~------------------------~~~~ 350 (669)
+.+|.+.+.+.+-++++++|+++||.|+.|.+.||.... ....
T Consensus 90 g~~~~~g~~~~~~~~~p~yr~~~~~~~~~l~~~y~~~p~vi~~~i~NE~~~~~~~~~~~~~~f~~wLk~kY~ti~~LN~a 169 (374)
T PF02449_consen 90 GRRRGFGSRQHYCPNSPAYREYARRFIRALAERYGDHPAVIGWQIDNEPGYHRCYSPACQAAFRQWLKEKYGTIEALNRA 169 (374)
T ss_dssp TSBEECCCSTT-HCCHHHHHHHHHHHHHHHHHHHTTTTTEEEEEECCSTTCTS--SHHHHHHHHHHHHHHHSSHHHHHHH
T ss_pred CCcCccCCccccchhHHHHHHHHHHHHHHHHhhccccceEEEEEeccccCcCcCCChHHHHHHHHHHHHHhCCHHHHHHH
Confidence 245678888888889999999999999999999996431 1112
Q ss_pred hcc---------------c------cChh------hHHHHHHHHHHHHHHHHHHhhCCCCCEEec
Q psy438 351 WYI---------------R------ENPE------LYYKEYAELYVNTLKPIVLQYDPTRPYLTS 388 (669)
Q Consensus 351 ~~~---------------~------~~~~------~~~~~~~~~~~~~l~~~v~~~DptRp~~~~ 388 (669)
|.. + .++. +|..+....|...+.+++|+.||.+|++.+
T Consensus 170 Wgt~~ws~~~~~f~~v~~P~~~~~~~~~~~~~D~~rF~~~~~~~~~~~~~~~ir~~~p~~~vt~n 234 (374)
T PF02449_consen 170 WGTAFWSQRYSSFDEVPPPRPTSSPENPAQWLDWYRFQSDRVAEFFRWQADIIREYDPDHPVTTN 234 (374)
T ss_dssp HTTTGGG---SSGGG---S-S-SS---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHSTT-EEE-E
T ss_pred HcCCcccCccCcHHhcCCCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeC
Confidence 221 0 0222 233344456778899999999999999865
No 17
>PF14488 DUF4434: Domain of unknown function (DUF4434)
Probab=97.77 E-value=0.0004 Score=68.22 Aligned_cols=123 Identities=18% Similarity=0.300 Sum_probs=84.9
Q ss_pred hHHHHHHHHHHHHcCCCEE---Ec--cCCCCCCc---------------HHHHHHHHHcCCEEEeecccccCCCC-CChH
Q psy438 254 ESTIRDLLVSTKEANMNML---RV--WGGGVYMS---------------DYFYETCDELGILIWQDMMFACNNYP-ATPT 312 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~i---R~--~~g~~~~~---------------~~~~~~cDe~Gilv~~e~~~~~~~~p-~~~~ 312 (669)
++.++++++.||++||+.| |+ .+..-+|+ +.++++||++||-|+-.+.+...-+. .+.+
T Consensus 19 ~~~W~~~~~~m~~~GidtlIlq~~~~~~~~~yps~~~~~~~~~~~~d~l~~~L~~A~~~Gmkv~~Gl~~~~~~w~~~~~~ 98 (166)
T PF14488_consen 19 PAQWREEFRAMKAIGIDTLILQWTGYGGFAFYPSKLSPGGFYMPPVDLLEMILDAADKYGMKVFVGLYFDPDYWDQGDLD 98 (166)
T ss_pred HHHHHHHHHHHHHcCCcEEEEEEeecCCcccCCccccCccccCCcccHHHHHHHHHHHcCCEEEEeCCCCchhhhccCHH
Confidence 3579999999999999998 21 11111222 46999999999999998876432111 2333
Q ss_pred HHHH-HHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecCCC
Q psy438 313 FLQS-VRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPT 391 (669)
Q Consensus 313 ~~~~-~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S~~ 391 (669)
+... .+.-+.++..++.+|||+.-|=+..|..... |. ...+.+.|...++++.|..|+..|.-.
T Consensus 99 ~~~~~~~~v~~el~~~yg~h~sf~GWYip~E~~~~~---~~------------~~~~~~~l~~~lk~~s~~~Pv~ISpf~ 163 (166)
T PF14488_consen 99 WEAERNKQVADELWQRYGHHPSFYGWYIPYEIDDYN---WN------------APERFALLGKYLKQISPGKPVMISPFI 163 (166)
T ss_pred HHHHHHHHHHHHHHHHHcCCCCCceEEEecccCCcc---cc------------hHHHHHHHHHHHHHhCCCCCeEEecCc
Confidence 3221 1234456667899999999999999997643 11 123457888999999999999876443
No 18
>PLN03059 beta-galactosidase; Provisional
Probab=97.77 E-value=0.00063 Score=80.93 Aligned_cols=123 Identities=15% Similarity=0.342 Sum_probs=85.2
Q ss_pred HHHHHHHHHHHHcCCCEEEc---cCCCCCCc------------HHHHHHHHHcCCEEEeec-ccccCC-----C------
Q psy438 255 STIRDLLVSTKEANMNMLRV---WGGGVYMS------------DYFYETCDELGILIWQDM-MFACNN-----Y------ 307 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~---~~g~~~~~------------~~~~~~cDe~Gilv~~e~-~~~~~~-----~------ 307 (669)
+++++.|+.+|++|+|+|=+ |. .|-|. ..|+++|.|.||+|+.-. |+.|++ +
T Consensus 59 ~~W~d~L~k~Ka~GlNtV~tYV~Wn-~HEp~~G~~dF~G~~DL~~Fl~la~e~GLyvilRpGPYIcAEw~~GGlP~WL~~ 137 (840)
T PLN03059 59 EMWPDLIQKAKDGGLDVIQTYVFWN-GHEPSPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKY 137 (840)
T ss_pred HHHHHHHHHHHHcCCCeEEEEeccc-ccCCCCCeeeccchHHHHHHHHHHHHcCCEEEecCCcceeeeecCCCCchhhhc
Confidence 37889999999999999974 55 24332 489999999999997643 343332 1
Q ss_pred -C------CChHHHHHHHHHHHHHHHHhCCCc-------ceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHH
Q psy438 308 -P------ATPTFLQSVRSEISQTVRRVQHHP-------CIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLK 373 (669)
Q Consensus 308 -p------~~~~~~~~~~~~~~~~v~r~rnHP-------Si~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 373 (669)
| .++.|++.+.+-+.+++.+++.+| .|||--+-||-.... + ..... ...|++.|.
T Consensus 138 ~~~i~~Rs~d~~fl~~v~~~~~~l~~~l~~~~l~~~~GGPIImvQIENEYGs~~---~-~~~~~-------d~~Yl~~l~ 206 (840)
T PLN03059 138 VPGIEFRTDNGPFKAAMQKFTEKIVDMMKSEKLFEPQGGPIILSQIENEYGPVE---W-EIGAP-------GKAYTKWAA 206 (840)
T ss_pred CCCcccccCCHHHHHHHHHHHHHHHHHHhhcceeecCCCcEEEEEeccccccee---c-ccCcc-------hHHHHHHHH
Confidence 2 377899888777777777776444 589999999953210 0 00000 134678888
Q ss_pred HHHHhhCCCCCEEecC
Q psy438 374 PIVLQYDPTRPYLTSS 389 (669)
Q Consensus 374 ~~v~~~DptRp~~~~S 389 (669)
+++++..-+-|...+.
T Consensus 207 ~~~~~~Gi~VPl~t~d 222 (840)
T PLN03059 207 DMAVKLGTGVPWVMCK 222 (840)
T ss_pred HHHHHcCCCcceEECC
Confidence 8888888888887764
No 19
>KOG2024|consensus
Probab=97.54 E-value=8.6e-05 Score=76.83 Aligned_cols=55 Identities=31% Similarity=0.526 Sum_probs=49.7
Q ss_pred CCCccccCCCCCCCCCcEEEEEEEEe-CcEEEEEEEEEEEEEEEEeeecCCCccccCCceEEEEEcCEE
Q psy438 84 GSEVELWWPNGYGEQPLYNLQITLAS-GVEMSTKSIKIGFRTVELIQDHVDPNHLEKGRYFYFEVNKVP 151 (669)
Q Consensus 84 ~~~p~lW~p~g~g~P~LY~l~~~l~~-g~~~d~~~~~~G~R~i~~~~~~~~~~~~~~G~~~~~~lNG~~ 151 (669)
.+++++|||.+|+.+.||.+++.+.. |...|.....+|||++.+..+ + ++|||+|
T Consensus 242 ~~~~~~wwP~~mh~~~ly~lev~l~l~g~lqd~y~~~~gfrt~~~~n~---------~----~~ln~kp 297 (297)
T KOG2024|consen 242 VPNYQLWWPYLMHLGPLYSLEVVLELIGTLQDCYRLPVGFRTVNWGNS---------Q----GLLNGKP 297 (297)
T ss_pred cccccccchhhhcCCcccceEEEEEEeeehhhhhhccCceEEEEecCc---------c----ceecCCC
Confidence 48999999999999999999999987 988899999999999999653 3 8899986
No 20
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=97.49 E-value=0.0009 Score=72.26 Aligned_cols=89 Identities=21% Similarity=0.263 Sum_probs=57.7
Q ss_pred HHHHHHHHHHHcCCCEEEccCC--CCCCc------------HHHHHHHHHcCCEEEeec-ccccCC-----CC-------
Q psy438 256 TIRDLLVSTKEANMNMLRVWGG--GVYMS------------DYFYETCDELGILIWQDM-MFACNN-----YP------- 308 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~g--~~~~~------------~~~~~~cDe~Gilv~~e~-~~~~~~-----~p------- 308 (669)
.+++.|+.+|++|+|+|-+..- -|-+. ..|+++|.|.||.|+.-. |+.|.. +|
T Consensus 25 ~W~~~l~k~ka~G~n~v~~yv~W~~he~~~g~~df~g~~dl~~f~~~a~~~gl~vilrpGpyi~aE~~~gG~P~Wl~~~~ 104 (319)
T PF01301_consen 25 YWRDRLQKMKAAGLNTVSTYVPWNLHEPEEGQFDFTGNRDLDRFLDLAQENGLYVILRPGPYICAEWDNGGLPAWLLRKP 104 (319)
T ss_dssp GHHHHHHHHHHTT-SEEEEE--HHHHSSBTTB---SGGG-HHHHHHHHHHTT-EEEEEEES---TTBGGGG--GGGGGST
T ss_pred HHHHHHHHHHhCCcceEEEeccccccCCCCCcccccchhhHHHHHHHHHHcCcEEEecccceecccccchhhhhhhhccc
Confidence 5888899999999999986330 12221 489999999999997654 333332 22
Q ss_pred ------CChHHHHHHHHHHHHHHHHhCCC-----cceEeecccCCcc
Q psy438 309 ------ATPTFLQSVRSEISQTVRRVQHH-----PCIAVWAGNNEME 344 (669)
Q Consensus 309 ------~~~~~~~~~~~~~~~~v~r~rnH-----PSi~~W~~~NE~~ 344 (669)
.++.|++.+.+-++++++.++.| =.||+--+-||..
T Consensus 105 ~~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~GGpII~vQvENEyg 151 (319)
T PF01301_consen 105 DIRLRTNDPPFLEAVERWYRALAKIIKPLQYTNGGPIIMVQVENEYG 151 (319)
T ss_dssp TS-SSSS-HHHHHHHHHHHHHHHHHHGGGBGGGTSSEEEEEESSSGG
T ss_pred cccccccchhHHHHHHHHHHHHHHHHHhhhhcCCCceehhhhhhhhC
Confidence 46788888776666666655543 2588889999976
No 21
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=97.48 E-value=0.00071 Score=79.13 Aligned_cols=89 Identities=24% Similarity=0.391 Sum_probs=64.8
Q ss_pred HHHHHHHHHHHcCCCEEEc----cCCCCCCc----------HHHHHHHHHcCCEEEeec-c-cccC-----CC-------
Q psy438 256 TIRDLLVSTKEANMNMLRV----WGGGVYMS----------DYFYETCDELGILIWQDM-M-FACN-----NY------- 307 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~----~~g~~~~~----------~~~~~~cDe~Gilv~~e~-~-~~~~-----~~------- 307 (669)
.+.++|+.||++|+|.+|+ |+ -|-|. +.|+++|...||+|+.-- | .+|. .|
T Consensus 31 ~w~ddl~~mk~~G~N~V~ig~faW~-~~eP~eG~fdf~~~D~~~l~~a~~~Gl~vil~t~P~g~~P~Wl~~~~PeiL~~~ 109 (673)
T COG1874 31 TWMDDLRKMKALGLNTVRIGYFAWN-LHEPEEGKFDFTWLDEIFLERAYKAGLYVILRTGPTGAPPAWLAKKYPEILAVD 109 (673)
T ss_pred HHHHHHHHHHHhCCCeeEeeeEEee-ccCccccccCcccchHHHHHHHHhcCceEEEecCCCCCCchHHhcCChhheEec
Confidence 6888899999999999999 55 23332 247999999999998776 5 2221 12
Q ss_pred ---------------CCChHHHHHHHHHHHHHHHH-hCCCcceEeecccCCcch
Q psy438 308 ---------------PATPTFLQSVRSEISQTVRR-VQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 308 ---------------p~~~~~~~~~~~~~~~~v~r-~rnHPSi~~W~~~NE~~~ 345 (669)
+.++.+++.+..-++++.+| +.|||+|++|-..||=.+
T Consensus 110 ~~~~~~~~g~r~~~~~~~~~Yr~~~~~i~~~irer~~~~~~~v~~w~~dneY~~ 163 (673)
T COG1874 110 ENGRVRSDGARENICPVSPVYREYLDRILQQIRERLYGNGPAVITWQNDNEYGG 163 (673)
T ss_pred CCCcccCCCcccccccccHHHHHHHHHHHHHHHHHHhccCCceeEEEccCccCC
Confidence 22444556665555667777 899999999999998654
No 22
>COG3934 Endo-beta-mannanase [Carbohydrate transport and metabolism]
Probab=97.15 E-value=0.00078 Score=74.90 Aligned_cols=180 Identities=18% Similarity=0.246 Sum_probs=113.0
Q ss_pred hHHHHHHHHHHHHcCCCEEEccC-CCCC----------Cc-----HHHHHHHHHcCCEEEeecccc--------------
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWG-GGVY----------MS-----DYFYETCDELGILIWQDMMFA-------------- 303 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~-g~~~----------~~-----~~~~~~cDe~Gilv~~e~~~~-------------- 303 (669)
++.++++++.++.+|++.+|+|- -|.. .+ +.|++.|-+++|-|.--+-..
T Consensus 25 ~~ei~~dle~a~~vg~k~lR~fiLDgEdc~d~~G~~na~s~~~y~~~fla~a~~l~lkvlitlivg~~hmgg~Nw~Ipwa 104 (587)
T COG3934 25 NREIKADLEPAGFVGVKDLRLFILDGEDCRDKEGYRNAGSNVWYAAWFLAPAGYLDLKVLITLIVGLKHMGGTNWRIPWA 104 (587)
T ss_pred hhhhhcccccccCccceeEEEEEecCcchhhhhceecccccHHHHHHHhhhcccCcceEEEEEeecccccCcceeEeecC
Confidence 35788999999999999999981 1011 11 579999999999774433211
Q ss_pred cCCCCC----ChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhh
Q psy438 304 CNNYPA----TPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQY 379 (669)
Q Consensus 304 ~~~~p~----~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~ 379 (669)
....|+ ++.+..+.++-++.+|+-++-||.|.-|.+.||+---. +..+..+. .+...|...+|.+
T Consensus 105 g~~~pdn~iyD~k~~~~~kkyvedlVk~yk~~ptI~gw~l~Ne~lv~~------p~s~N~f~-----~w~~emy~yiK~l 173 (587)
T COG3934 105 GEQSPDNVIYDPKFRGPGKKYVEDLVKPYKLDPTIAGWALRNEPLVEA------PISVNNFW-----DWSGEMYAYIKWL 173 (587)
T ss_pred CCCCccccccchhhcccHHHHHHHHhhhhccChHHHHHHhcCCccccc------cCChhHHH-----HHHHHHHHHhhcc
Confidence 111232 56778888888999999999999999999999932100 01121111 2457889999999
Q ss_pred CCCCCEEecCCCCCcccccccccccCCCCCCC----CCcccccc-ccccc---CC----------CCCCC-CCccccccc
Q psy438 380 DPTRPYLTSSPTNGIESEKAKYALADNPYSNI----YGDTHNYD-YYQNL---WD----------PSTAP-KSRFCSEFG 440 (669)
Q Consensus 380 DptRp~~~~S~~~~~~~~~~~~~~~~~py~~~----~gd~h~y~-~~~~~---~~----------~~~~~-~~~f~sE~G 440 (669)
||...+...-+.... ....||+.+ |+..|-|. |-.++ |. .+... +|.+.-|||
T Consensus 174 dd~hlvsvGD~~sp~--------~~~~pyN~r~~vDya~~hLY~hyd~sl~~r~s~~yg~~~l~i~~~~g~~pV~leefG 245 (587)
T COG3934 174 DDGHLVSVGDPASPW--------PQYAPYNARFYVDYAANHLYRHYDTSLVSRVSTVYGKPYLDIPTIMGWQPVNLEEFG 245 (587)
T ss_pred CCCCeeecCCcCCcc--------cccCCcccceeeccccchhhhhccCChhheeeeeecchhhccchhcccceeeccccC
Confidence 999998765332211 112344433 44556552 21222 10 01112 678888999
Q ss_pred c-CCCCCcchhcc
Q psy438 441 I-QSLPQLSTFQK 452 (669)
Q Consensus 441 ~-~s~p~~~sl~~ 452 (669)
+ .++++-.|--.
T Consensus 246 fsta~g~e~s~ay 258 (587)
T COG3934 246 FSTAFGQENSPAY 258 (587)
T ss_pred Ccccccccccchh
Confidence 7 57777655444
No 23
>PF12876 Cellulase-like: Sugar-binding cellulase-like; InterPro: IPR024778 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This entry represents a family of putative cellulase enzymes.; PDB: 3GYC_B.
Probab=96.76 E-value=0.0024 Score=56.00 Aligned_cols=64 Identities=20% Similarity=0.168 Sum_probs=37.1
Q ss_pred HHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecC
Q psy438 324 TVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSS 389 (669)
Q Consensus 324 ~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S 389 (669)
.|.++++||.|++|-+.||....... -+.....+. ..+....+++.+.+.+|++||+.|++.+.
T Consensus 1 iv~~~~~~~~Il~Wdl~NE~p~~~~~-~~~~~~~~~-~~~~~~~~l~~~~~~iR~~dP~~pvt~g~ 64 (88)
T PF12876_consen 1 IVTRFGYDPRILAWDLWNEPPNNWAD-GYPAEWGDP-KAEAYAEWLKEAFRWIRAVDPSQPVTSGF 64 (88)
T ss_dssp -HHHTT-GGGEEEEESSTTTT-TT-T-T-TT-TT-T-TSHHHHHHHHHHHHHHHTT-TTS-EE--B
T ss_pred CchhhcCCCCEEEEEeecCCCCcccc-cccccccch-hHHHHHHHHHHHHHHHHHhCCCCcEEeec
Confidence 47899999999999999995442210 000000100 11233457789999999999999997653
No 24
>PF12876 Cellulase-like: Sugar-binding cellulase-like; InterPro: IPR024778 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This entry represents a family of putative cellulase enzymes.; PDB: 3GYC_B.
Probab=96.44 E-value=0.0055 Score=53.69 Aligned_cols=64 Identities=19% Similarity=0.164 Sum_probs=36.2
Q ss_pred HHHHhcCCceEEEEcCCCCCchhhhhhhhhccCchhhhhhHHHHHHHHHHHHHHHhCCCCCeecCC
Q psy438 579 TVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSS 644 (669)
Q Consensus 579 ~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS 644 (669)
.|.|.++||.|.+|-+.||......... ....... ..+....+.+++...+|++||+.|++.+.
T Consensus 1 iv~~~~~~~~Il~Wdl~NE~p~~~~~~~-~~~~~~~-~~~~~~~~l~~~~~~iR~~dP~~pvt~g~ 64 (88)
T PF12876_consen 1 IVTRFGYDPRILAWDLWNEPPNNWADGY-PAEWGDP-KAEAYAEWLKEAFRWIRAVDPSQPVTSGF 64 (88)
T ss_dssp -HHHTT-GGGEEEEESSTTTT-TT-TT--TT-TT-T-TSHHHHHHHHHHHHHHHTT-TTS-EE--B
T ss_pred CchhhcCCCCEEEEEeecCCCCcccccc-cccccch-hHHHHHHHHHHHHHHHHHhCCCCcEEeec
Confidence 3789999999999999999554210000 0000011 12223455666889999999999998665
No 25
>KOG0496|consensus
Probab=93.82 E-value=1.4 Score=51.63 Aligned_cols=88 Identities=17% Similarity=0.284 Sum_probs=55.3
Q ss_pred HHHHHHHHHHHcCCCEEEc---cCCCCCCc------------HHHHHHHHHcCCEEEeec-ccccCCC-----C------
Q psy438 256 TIRDLLVSTKEANMNMLRV---WGGGVYMS------------DYFYETCDELGILIWQDM-MFACNNY-----P------ 308 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~---~~g~~~~~------------~~~~~~cDe~Gilv~~e~-~~~~~~~-----p------ 308 (669)
++.+.|+.+|++|+|+|-+ |.+ |-|+ ..|..+|.+.|++|.--. |+.|+++ |
T Consensus 50 ~W~~~i~k~k~~Gln~IqtYVfWn~-Hep~~g~y~FsG~~DlvkFikl~~~~GLyv~LRiGPyIcaEw~~GG~P~wL~~~ 128 (649)
T KOG0496|consen 50 MWPDLIKKAKAGGLNVIQTYVFWNL-HEPSPGKYDFSGRYDLVKFIKLIHKAGLYVILRIGPYICAEWNFGGLPWWLRNV 128 (649)
T ss_pred hhHHHHHHHHhcCCceeeeeeeccc-ccCCCCcccccchhHHHHHHHHHHHCCeEEEecCCCeEEecccCCCcchhhhhC
Confidence 5778889999999999974 664 5543 379999999999996544 3333321 1
Q ss_pred -------CChHHHHHHHHHHHHHHHHhC-----CCcceEeecccCCcc
Q psy438 309 -------ATPTFLQSVRSEISQTVRRVQ-----HHPCIAVWAGNNEME 344 (669)
Q Consensus 309 -------~~~~~~~~~~~~~~~~v~r~r-----nHPSi~~W~~~NE~~ 344 (669)
+++.|.+.+.+=.+.+|.+.+ |===||+=-+-||-.
T Consensus 129 pg~~~Rt~nepfk~~~~~~~~~iv~~mk~L~~~qGGPIIl~QIENEYG 176 (649)
T KOG0496|consen 129 PGIVFRTDNEPFKAEMERWTTKIVPMMKKLFASQGGPIILVQIENEYG 176 (649)
T ss_pred CceEEecCChHHHHHHHHHHHHHHHHHHHHHhhcCCCEEEEEeechhh
Confidence 456666655544444444222 222355555667643
No 26
>PF07745 Glyco_hydro_53: Glycosyl hydrolase family 53; InterPro: IPR011683 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This domain is found in family 53 of the glycosyl hydrolase classification []. These enzymes are endo-1,4- beta-galactanases (3.2.1.89 from EC). The structure of this domain is known [] and has a TIM barrel fold.; GO: 0015926 glucosidase activity; PDB: 1HJQ_A 1HJS_A 1HJU_B 1FHL_A 1FOB_A 2GFT_A 1UR4_B 1UR0_A 1R8L_B 2CCR_A ....
Probab=93.13 E-value=0.57 Score=51.20 Aligned_cols=125 Identities=16% Similarity=0.192 Sum_probs=70.4
Q ss_pred HHHHHHHHHcCCCEEEc--cCC----CCCCcH---HHHHHHHHcCCEEEeecccccC-------CCCC-----C-hHHHH
Q psy438 258 RDLLVSTKEANMNMLRV--WGG----GVYMSD---YFYETCDELGILIWQDMMFACN-------NYPA-----T-PTFLQ 315 (669)
Q Consensus 258 ~~~l~~~k~~g~N~iR~--~~g----~~~~~~---~~~~~cDe~Gilv~~e~~~~~~-------~~p~-----~-~~~~~ 315 (669)
++.++++|+.|+|+||+ |.- |.+.-+ .....|.++||-|+-+|-++-. .-|. + .+..+
T Consensus 27 ~d~~~ilk~~G~N~vRlRvwv~P~~~g~~~~~~~~~~akrak~~Gm~vlldfHYSD~WaDPg~Q~~P~aW~~~~~~~l~~ 106 (332)
T PF07745_consen 27 KDLFQILKDHGVNAVRLRVWVNPYDGGYNDLEDVIALAKRAKAAGMKVLLDFHYSDFWADPGKQNKPAAWANLSFDQLAK 106 (332)
T ss_dssp --HHHHHHHTT--EEEEEE-SS-TTTTTTSHHHHHHHHHHHHHTT-EEEEEE-SSSS--BTTB-B--TTCTSSSHHHHHH
T ss_pred CCHHHHHHhcCCCeEEEEeccCCcccccCCHHHHHHHHHHHHHCCCeEEEeecccCCCCCCCCCCCCccCCCCCHHHHHH
Confidence 35688899999999975 431 233323 5667788899999999876321 1121 1 23455
Q ss_pred HHHHHHHHHHHHhCCC-cceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEe
Q psy438 316 SVRSEISQTVRRVQHH-PCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLT 387 (669)
Q Consensus 316 ~~~~~~~~~v~r~rnH-PSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~ 387 (669)
.+.+-.++.+..+++. -.+=++.+|||+..+....-....+.+ .+ ..+++.-.+.||+.+|+-.++.
T Consensus 107 ~v~~yT~~vl~~l~~~G~~pd~VQVGNEin~Gmlwp~g~~~~~~----~~-a~ll~ag~~AVr~~~p~~kV~l 174 (332)
T PF07745_consen 107 AVYDYTKDVLQALKAAGVTPDMVQVGNEINNGMLWPDGKPSNWD----NL-AKLLNAGIKAVREVDPNIKVML 174 (332)
T ss_dssp HHHHHHHHHHHHHHHTT--ESEEEESSSGGGESTBTTTCTT-HH----HH-HHHHHHHHHHHHTHSSTSEEEE
T ss_pred HHHHHHHHHHHHHHHCCCCccEEEeCccccccccCcCCCccCHH----HH-HHHHHHHHHHHHhcCCCCcEEE
Confidence 5666666677766654 345677899999776532101112222 22 2345666789999999766654
No 27
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=92.95 E-value=0.57 Score=50.03 Aligned_cols=115 Identities=16% Similarity=0.134 Sum_probs=65.7
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCC--------------------------Cc-------HHHHHHHHHcCCEEEeecc
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVY--------------------------MS-------DYFYETCDELGILIWQDMM 301 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~--------------------------~~-------~~~~~~cDe~Gilv~~e~~ 301 (669)
+..+..|+..|+.|||.||+..-.+. +. |.-.+.+.++||.+-- ++
T Consensus 30 ~e~~~yL~~r~~qgFN~iq~~~l~~~~~~~~~n~~~~~~~~~~~~~~~d~~~~N~~YF~~~d~~i~~a~~~Gi~~~l-v~ 108 (289)
T PF13204_consen 30 EEWEQYLDTRKEQGFNVIQMNVLPQWDGYNTPNRYGFAPFPDEDPGQFDFTRPNPAYFDHLDRRIEKANELGIEAAL-VP 108 (289)
T ss_dssp HHHHHHHHHHHHTT--EEEEES-SSSS-B----TTS-BS-SSTT------TT----HHHHHHHHHHHHHHTT-EEEE-ES
T ss_pred HHHHHHHHHHHHCCCCEEEEEeCCCcccccccccCCCcCCCCCCccccCCCCCCHHHHHHHHHHHHHHHHCCCeEEE-EE
Confidence 35666788899999999997542110 11 2345678899998842 22
Q ss_pred cc-cC----CCCC--ChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHH
Q psy438 302 FA-CN----NYPA--TPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKP 374 (669)
Q Consensus 302 ~~-~~----~~p~--~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 374 (669)
+. +. .+.. ...-.++.+.-++-+|+|++.-|-|+ |+++||..... ... .+-+.+.+
T Consensus 109 ~wg~~~~~~~Wg~~~~~m~~e~~~~Y~~yv~~Ry~~~~Nvi-W~l~gd~~~~~-------~~~---------~~w~~~~~ 171 (289)
T PF13204_consen 109 FWGCPYVPGTWGFGPNIMPPENAERYGRYVVARYGAYPNVI-WILGGDYFDTE-------KTR---------ADWDAMAR 171 (289)
T ss_dssp S-HHHHH-------TTSS-HHHHHHHHHHHHHHHTT-SSEE-EEEESSS--TT-------SSH---------HHHHHHHH
T ss_pred EECCccccccccccccCCCHHHHHHHHHHHHHHHhcCCCCE-EEecCccCCCC-------cCH---------HHHHHHHH
Confidence 22 11 1111 11225667777889999999999887 99999981100 111 12367888
Q ss_pred HHHhhCCCCCEEe
Q psy438 375 IVLQYDPTRPYLT 387 (669)
Q Consensus 375 ~v~~~DptRp~~~ 387 (669)
.|++.||..+.+.
T Consensus 172 ~i~~~dp~~L~T~ 184 (289)
T PF13204_consen 172 GIKENDPYQLITI 184 (289)
T ss_dssp HHHHH--SS-EEE
T ss_pred HHHhhCCCCcEEE
Confidence 9999999775544
No 28
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=91.28 E-value=0.3 Score=50.17 Aligned_cols=73 Identities=15% Similarity=0.167 Sum_probs=50.1
Q ss_pred hHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhh-hhhhccCchhhhhhHHHHHHHHHHHHHHHhCCCCCeecCC
Q psy438 566 PTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQ-KWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSS 644 (669)
Q Consensus 566 ~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS 644 (669)
....+.+.+-++.+..|.++||.|+.|.+.||....... .| .......-..+...+.+.+|+.||+++++..+
T Consensus 99 ~~~~~~~~~~~~~la~~y~~~~~v~~~el~NEP~~~~~~~~w------~~~~~~~~~~~~~~~~~~Ir~~~~~~~i~~~~ 172 (281)
T PF00150_consen 99 DTAQAWFKSFWRALAKRYKDNPPVVGWELWNEPNGGNDDANW------NAQNPADWQDWYQRAIDAIRAADPNHLIIVGG 172 (281)
T ss_dssp HHHHHHHHHHHHHHHHHHTTTTTTEEEESSSSGCSTTSTTTT------SHHHTHHHHHHHHHHHHHHHHTTSSSEEEEEE
T ss_pred hhhHHHHHhhhhhhccccCCCCcEEEEEecCCccccCCcccc------ccccchhhhhHHHHHHHHHHhcCCcceeecCC
Confidence 444555666678889999999999999999998754210 11 01111222445666899999999999977555
No 29
>COG3534 AbfA Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]
Probab=91.08 E-value=2.2 Score=48.07 Aligned_cols=108 Identities=24% Similarity=0.390 Sum_probs=65.2
Q ss_pred HHHHHHHHHHcCCCEEEccCCCC--------------------------CC-----cHHHHHHHHHcCCEEEeecccccC
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGV--------------------------YM-----SDYFYETCDELGILIWQDMMFACN 305 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~--------------------------~~-----~~~~~~~cDe~Gilv~~e~~~~~~ 305 (669)
-.+.|+.+|++.+-.||.-||.- .+ -.+|+++|...|.-+..-...+..
T Consensus 51 RkDVle~lk~Lk~P~lR~PGGnFvs~Y~WeDGIGP~e~Rp~rldlaW~t~EtN~~Gt~EF~~~~e~iGaep~~avN~Gsr 130 (501)
T COG3534 51 RKDVLEALKDLKIPVLRWPGGNFVSGYHWEDGIGPREERPRRLDLAWGTTETNEFGTHEFMDWCELIGAEPYIAVNLGSR 130 (501)
T ss_pred HHHHHHHHHhcCCceeecCCcccccccccccCcCchhhCchhhcccccccccccccHHHHHHHHHHhCCceEEEEecCCc
Confidence 34567888999999999887622 11 158999999999866544433221
Q ss_pred CCCCChHHHHHHHHHHHHHHHHhCCCcc------------------eEeecccCCcchhhhhhhccccChhhHHHHHHHH
Q psy438 306 NYPATPTFLQSVRSEISQTVRRVQHHPC------------------IAVWAGNNEMEGATIQKWYIRENPELYYKEYAEL 367 (669)
Q Consensus 306 ~~p~~~~~~~~~~~~~~~~v~r~rnHPS------------------i~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (669)
.. ++++++| .+-|||. |=.||+|||+.+++. |.-...+ +|..
T Consensus 131 ----gv-------d~ar~~v-EY~n~pggtywsdlR~~~G~~~P~nvK~w~lGNEm~GpWq--~G~~~a~-----EY~~- 190 (501)
T COG3534 131 ----GV-------DEARNWV-EYCNHPGGTYWSDLRRENGREEPWNVKYWGLGNEMDGPWQ--CGHKTAP-----EYGR- 190 (501)
T ss_pred ----cH-------HHHHHHH-HHccCCCCChhHHHHHhcCCCCCcccceEEeccccCCCcc--cccccCH-----HHHH-
Confidence 11 2233333 2445553 779999999987642 2222222 2322
Q ss_pred HHHHHHHHHHhhCCCCC
Q psy438 368 YVNTLKPIVLQYDPTRP 384 (669)
Q Consensus 368 ~~~~l~~~v~~~DptRp 384 (669)
|++.-.+..+-.||+--
T Consensus 191 ~A~e~~k~~k~~d~t~e 207 (501)
T COG3534 191 LANEYRKYMKYFDPTIE 207 (501)
T ss_pred HHHHHHHHHhhcCcccc
Confidence 44555667788888754
No 30
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=90.44 E-value=0.89 Score=49.98 Aligned_cols=81 Identities=23% Similarity=0.418 Sum_probs=56.6
Q ss_pred CCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchh-----------------------h-hhhhhhc---------
Q psy438 563 PATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGA-----------------------T-IQKWYIR--------- 609 (669)
Q Consensus 563 p~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~-----------------------~-~~~~~~~--------- 609 (669)
+.+|.+++.+++-++.+++|.++||.|+.|.+.||.... . ...|+..
T Consensus 103 ~~~p~yr~~~~~~~~~l~~~y~~~p~vi~~~i~NE~~~~~~~~~~~~~~f~~wLk~kY~ti~~LN~aWgt~~ws~~~~~f 182 (374)
T PF02449_consen 103 PNSPAYREYARRFIRALAERYGDHPAVIGWQIDNEPGYHRCYSPACQAAFRQWLKEKYGTIEALNRAWGTAFWSQRYSSF 182 (374)
T ss_dssp CCHHHHHHHHHHHHHHHHHHHTTTTTEEEEEECCSTTCTS--SHHHHHHHHHHHHHHHSSHHHHHHHHTTTGGG---SSG
T ss_pred hhHHHHHHHHHHHHHHHHhhccccceEEEEEeccccCcCcCCChHHHHHHHHHHHHHhCCHHHHHHHHcCCcccCccCcH
Confidence 346889999999999999999999999999999996431 0 0122110
Q ss_pred -----c-------Cc------hhhhhhHHHHHHHHHHHHHHHhCCCCCeecC
Q psy438 610 -----E-------NP------ELYYKEYAELYVNTLKPIVLQYDPTRPYLTS 643 (669)
Q Consensus 610 -----~-------~~------~~~~~~~~~~~~~~L~~~vk~~DPTRp~~~s 643 (669)
+ ++ ..|..+....+.+.+++++|+.||.+|++.+
T Consensus 183 ~~v~~P~~~~~~~~~~~~~D~~rF~~~~~~~~~~~~~~~ir~~~p~~~vt~n 234 (374)
T PF02449_consen 183 DEVPPPRPTSSPENPAQWLDWYRFQSDRVAEFFRWQADIIREYDPDHPVTTN 234 (374)
T ss_dssp GG---S-S-SS---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHSTT-EEE-E
T ss_pred HhcCCCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeC
Confidence 0 10 1223334467778899999999999999955
No 31
>COG5309 Exo-beta-1,3-glucanase [Carbohydrate transport and metabolism]
Probab=90.37 E-value=1.7 Score=46.00 Aligned_cols=118 Identities=12% Similarity=0.089 Sum_probs=88.7
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCCc-HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHHhCCCcc
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYMS-DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHPC 333 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~~-~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r~rnHPS 333 (669)
+.++.+|++++.... +||+.| .-+.. +..+-++...||-|...... .++....+.+++...++...--++
T Consensus 63 ~~~~sDLe~l~~~t~-~IR~Y~-sDCn~le~v~pAa~~~g~kv~lGiw~-------tdd~~~~~~~til~ay~~~~~~d~ 133 (305)
T COG5309 63 DQVASDLELLASYTH-SIRTYG-SDCNTLENVLPAAEASGFKVFLGIWP-------TDDIHDAVEKTILSAYLPYNGWDD 133 (305)
T ss_pred HHHHhHHHHhccCCc-eEEEee-ccchhhhhhHHHHHhcCceEEEEEee-------ccchhhhHHHHHHHHHhccCCCCc
Confidence 468889999999888 999988 44443 78999999999999877653 223455666688889999999999
Q ss_pred eEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEEecCCCC
Q psy438 334 IAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPTN 392 (669)
Q Consensus 334 i~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~~~S~~~ 392 (669)
|..-..|||.... +|...+. ...++...+..|++.+-+-|++..-+++
T Consensus 134 v~~v~VGnEal~r---~~~tasq--------l~~~I~~vrsav~~agy~gpV~T~dsw~ 181 (305)
T COG5309 134 VTTVTVGNEALNR---NDLTASQ--------LIEYIDDVRSAVKEAGYDGPVTTVDSWN 181 (305)
T ss_pred eEEEEechhhhhc---CCCCHHH--------HHHHHHHHHHHHHhcCCCCceeecccce
Confidence 9999999997542 2222111 1235678888999999899988876554
No 32
>TIGR03356 BGL beta-galactosidase.
Probab=89.43 E-value=1.5 Score=49.50 Aligned_cols=88 Identities=16% Similarity=0.070 Sum_probs=63.4
Q ss_pred HHHHHHHHHHHcCCCEEEccC--------C-CCCCc------HHHHHHHHHcCCEEEeecc-----cccC--CCCCChHH
Q psy438 256 TIRDLLVSTKEANMNMLRVWG--------G-GVYMS------DYFYETCDELGILIWQDMM-----FACN--NYPATPTF 313 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~--------g-~~~~~------~~~~~~cDe~Gilv~~e~~-----~~~~--~~p~~~~~ 313 (669)
.++.+|+++|++|+|++|+-. | +.+.. +.+++.|-+.||-++..+- .... +--.++++
T Consensus 55 ~y~eDi~l~~~~G~~~~R~si~Wsri~p~g~~~~n~~~~~~y~~~i~~l~~~gi~pivtL~Hfd~P~~l~~~gGw~~~~~ 134 (427)
T TIGR03356 55 RYEEDVALMKELGVDAYRFSIAWPRIFPEGTGPVNPKGLDFYDRLVDELLEAGIEPFVTLYHWDLPQALEDRGGWLNRDT 134 (427)
T ss_pred hHHHHHHHHHHcCCCeEEcccchhhcccCCCCCcCHHHHHHHHHHHHHHHHcCCeeEEeeccCCccHHHHhcCCCCChHH
Confidence 688999999999999999832 0 11111 3578899999999887663 1100 11134667
Q ss_pred HHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 314 LQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 314 ~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
.+.+++-++.+++++.. .|-.|..-||+..
T Consensus 135 ~~~f~~ya~~~~~~~~d--~v~~w~t~NEp~~ 164 (427)
T TIGR03356 135 AEWFAEYAAVVAERLGD--RVKHWITLNEPWC 164 (427)
T ss_pred HHHHHHHHHHHHHHhCC--cCCEEEEecCcce
Confidence 77788888899999988 5778999999863
No 33
>COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism]
Probab=83.31 E-value=11 Score=40.73 Aligned_cols=125 Identities=21% Similarity=0.234 Sum_probs=70.1
Q ss_pred HHHHHHHHHHcCCCEEEc--cC-----------CCCCCcH---HHHHHHHHcCCEEEeecccccC-CCCC---Ch-HH--
Q psy438 257 IRDLLVSTKEANMNMLRV--WG-----------GGVYMSD---YFYETCDELGILIWQDMMFACN-NYPA---TP-TF-- 313 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~--~~-----------g~~~~~~---~~~~~cDe~Gilv~~e~~~~~~-~~p~---~~-~~-- 313 (669)
-.+.++.+|++|+|.||+ |. ||-..-. +.-..|..+||-|..||-.+.. .-|. .| +|
T Consensus 65 ~qD~~~iLK~~GvNyvRlRvwndP~dsngn~yggGnnD~~k~ieiakRAk~~GmKVl~dFHYSDfwaDPakQ~kPkaW~~ 144 (403)
T COG3867 65 RQDALQILKNHGVNYVRLRVWNDPYDSNGNGYGGGNNDLKKAIEIAKRAKNLGMKVLLDFHYSDFWADPAKQKKPKAWEN 144 (403)
T ss_pred HHHHHHHHHHcCcCeEEEEEecCCccCCCCccCCCcchHHHHHHHHHHHHhcCcEEEeeccchhhccChhhcCCcHHhhh
Confidence 455688999999999875 43 3222212 4555677799999999865321 0011 11 12
Q ss_pred ------HHHHHHHHHHHHHHhCCCcce-EeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEE
Q psy438 314 ------LQSVRSEISQTVRRVQHHPCI-AVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 386 (669)
Q Consensus 314 ------~~~~~~~~~~~v~r~rnHPSi-~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~ 386 (669)
.+.+-+-.++.+...++---. =|=-+|||..++....|.+..+.. ...+++ +.-...|++.||+..+.
T Consensus 145 l~fe~lk~avy~yTk~~l~~m~~eGi~pdmVQVGNEtn~gflwp~Ge~~~f~----k~a~L~-n~g~~avrev~p~ikv~ 219 (403)
T COG3867 145 LNFEQLKKAVYSYTKYVLTTMKKEGILPDMVQVGNETNGGFLWPDGEGRNFD----KMAALL-NAGIRAVREVSPTIKVA 219 (403)
T ss_pred cCHHHHHHHHHHHHHHHHHHHHHcCCCccceEeccccCCceeccCCCCcChH----HHHHHH-HHHhhhhhhcCCCceEE
Confidence 122223333444444333222 233579999887754444322332 233455 45556799999988765
No 34
>PRK13511 6-phospho-beta-galactosidase; Provisional
Probab=82.99 E-value=4.6 Score=46.20 Aligned_cols=88 Identities=13% Similarity=0.118 Sum_probs=62.8
Q ss_pred HHHHHHHHHHHcCCCEEEcc---------CCCCCCc------HHHHHHHHHcCCEEEeec-----ccccC--CCCCChHH
Q psy438 256 TIRDLLVSTKEANMNMLRVW---------GGGVYMS------DYFYETCDELGILIWQDM-----MFACN--NYPATPTF 313 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~---------~g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~--~~p~~~~~ 313 (669)
.++.|+++||++|+|+.|+- |-|.... +.+.+.|-++||-.+.-+ |.+-. +--.+++.
T Consensus 55 ry~eDi~L~~~lG~~~yRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~l~~~GI~P~VTL~H~dlP~~L~~~GGW~n~~~ 134 (469)
T PRK13511 55 RYPEDLKLAEEFGVNGIRISIAWSRIFPDGYGEVNPKGVEYYHRLFAECHKRHVEPFVTLHHFDTPEALHSNGDWLNREN 134 (469)
T ss_pred hhHHHHHHHHHhCCCEEEeeccHhhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEecCCCCcHHHHHcCCCCCHHH
Confidence 68899999999999999972 2112211 357789999999876644 32211 11235666
Q ss_pred HHHHHHHHHHHHHHhCCCcceEeecccCCcchh
Q psy438 314 LQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGA 346 (669)
Q Consensus 314 ~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~ 346 (669)
.+.+.+-++-+++++.. |=.|...||+...
T Consensus 135 v~~F~~YA~~~~~~fgd---Vk~W~T~NEP~~~ 164 (469)
T PRK13511 135 IDHFVRYAEFCFEEFPE---VKYWTTFNEIGPI 164 (469)
T ss_pred HHHHHHHHHHHHHHhCC---CCEEEEccchhhh
Confidence 77777777888899887 8999999998643
No 35
>PRK15014 6-phospho-beta-glucosidase BglA; Provisional
Probab=79.81 E-value=6.3 Score=45.24 Aligned_cols=87 Identities=10% Similarity=0.050 Sum_probs=63.3
Q ss_pred HHHHHHHHHHHcCCCEEEccC----------CCCCCc------HHHHHHHHHcCCEEEeec-----ccccC---CCCCCh
Q psy438 256 TIRDLLVSTKEANMNMLRVWG----------GGVYMS------DYFYETCDELGILIWQDM-----MFACN---NYPATP 311 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~----------g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~---~~p~~~ 311 (669)
.++.||++||++|+|+.|+-. ++.... +.+.+.|-+.||-++..+ |.+-. +-..++
T Consensus 70 ry~EDI~Lm~elG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~l~~~GI~P~vTL~H~dlP~~L~~~yGGW~n~ 149 (477)
T PRK15014 70 HYKEDIKLFAEMGFKCFRTSIAWTRIFPKGDEAQPNEEGLKFYDDMFDELLKYNIEPVITLSHFEMPLHLVQQYGSWTNR 149 (477)
T ss_pred ccHHHHHHHHHcCCCEEEecccceeeccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEeeCCCCCHHHHHhcCCCCCh
Confidence 688999999999999999721 112111 357789999999887644 43211 112456
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcc
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEME 344 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~ 344 (669)
+..+.+.+-++-+++++... |-.|...||+.
T Consensus 150 ~~~~~F~~Ya~~~f~~fgdr--Vk~WiT~NEp~ 180 (477)
T PRK15014 150 KVVDFFVRFAEVVFERYKHK--VKYWMTFNEIN 180 (477)
T ss_pred HHHHHHHHHHHHHHHHhcCc--CCEEEEecCcc
Confidence 77778888888899999887 77999999985
No 36
>PF14488 DUF4434: Domain of unknown function (DUF4434)
Probab=77.41 E-value=4.8 Score=39.71 Aligned_cols=94 Identities=19% Similarity=0.336 Sum_probs=59.5
Q ss_pred eeEeeccccCccceEehhhhhhhhcCCC-CChHHH-HHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCch
Q psy438 536 LYWQLNDTCDELGILIWQDMMFACNNYP-ATPTFL-QSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPE 613 (669)
Q Consensus 536 LyWq~nd~~D~~GilvW~E~~~~~~~yp-~~~~f~-~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~ 613 (669)
++-.+.++||+.||-|+-.+.+...... .+.... +..+.-+.|...+..+|||+.-|=+..|+.... |
T Consensus 66 ~l~~~L~~A~~~Gmkv~~Gl~~~~~~w~~~~~~~~~~~~~~v~~el~~~yg~h~sf~GWYip~E~~~~~---~------- 135 (166)
T PF14488_consen 66 LLEMILDAADKYGMKVFVGLYFDPDYWDQGDLDWEAERNKQVADELWQRYGHHPSFYGWYIPYEIDDYN---W------- 135 (166)
T ss_pred HHHHHHHHHHHcCCEEEEeCCCCchhhhccCHHHHHHHHHHHHHHHHHHHcCCCCCceEEEecccCCcc---c-------
Confidence 3445668888888888765553321111 222222 112223355567899999999999999997541 0
Q ss_pred hhhhhHHHHHHHHHHHHHHHhCCCCCeecCC
Q psy438 614 LYYKEYAELYVNTLKPIVLQYDPTRPYLTSS 644 (669)
Q Consensus 614 ~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS 644 (669)
......+.|...++++.|..|+.-|.
T Consensus 136 -----~~~~~~~~l~~~lk~~s~~~Pv~ISp 161 (166)
T PF14488_consen 136 -----NAPERFALLGKYLKQISPGKPVMISP 161 (166)
T ss_pred -----chHHHHHHHHHHHHHhCCCCCeEEec
Confidence 01233455889999999999988554
No 37
>COG2730 BglC Endoglucanase [Carbohydrate transport and metabolism]
Probab=76.60 E-value=12 Score=41.90 Aligned_cols=89 Identities=17% Similarity=0.172 Sum_probs=60.9
Q ss_pred HHHHHHHHHHcCCCEEEccCC----CCCC---c-----------HHHHHHHHHcCCEEEeecc-cc----cCC-------
Q psy438 257 IRDLLVSTKEANMNMLRVWGG----GVYM---S-----------DYFYETCDELGILIWQDMM-FA----CNN------- 306 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g----~~~~---~-----------~~~~~~cDe~Gilv~~e~~-~~----~~~------- 306 (669)
.+.++..+|++|||+||+--| .+.. + ++..+.|.++||.|+.++- .. |.+
T Consensus 75 ~~~~~~~ik~~G~n~VRiPi~~~~~~~~~~~~p~~~~~~~~~~ld~~I~~a~~~gi~V~iD~H~~~~~~~~~~~s~~~~~ 154 (407)
T COG2730 75 TEEDFDQIKSAGFNAVRIPIGYWALQATDGDNPYLIGLTQLKILDEAINWAKKLGIYVLIDLHGYPGGNNGHEHSGYTSD 154 (407)
T ss_pred hhhHHHHHHHcCCcEEEcccchhhhhccCCCCCCeecchHHHHHHHHHHHHHhcCeeEEEEecccCCCCCCcCccccccc
Confidence 466788899999999997443 2221 1 2346778899999999952 22 111
Q ss_pred CCCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 307 YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 307 ~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
|....+-.+...+-.+.+..|+++-+-|+...+-||+.+
T Consensus 155 ~~~~~~~~~~~~~~w~~ia~~f~~~~~VIg~~~~NEP~~ 193 (407)
T COG2730 155 YKEENENVEATIDIWKFIANRFKNYDTVIGFELINEPNG 193 (407)
T ss_pred ccccchhHHHHHHHHHHHHHhccCCCceeeeeeecCCcc
Confidence 111112234455666788999999888998999999975
No 38
>TIGR01233 lacG 6-phospho-beta-galactosidase. This enzyme is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=74.97 E-value=11 Score=43.09 Aligned_cols=88 Identities=11% Similarity=0.066 Sum_probs=61.4
Q ss_pred HHHHHHHHHHHcCCCEEEccC---------CCCCCc------HHHHHHHHHcCCEEEeec-----ccccC--CCCCChHH
Q psy438 256 TIRDLLVSTKEANMNMLRVWG---------GGVYMS------DYFYETCDELGILIWQDM-----MFACN--NYPATPTF 313 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~---------g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~--~~p~~~~~ 313 (669)
.++.|+++||++|+|+.|+-. -+.... +.+.+.|-+.||-.+.-+ |.+-. +.-.+++.
T Consensus 54 ry~eDi~L~~~lG~~~yRfSIsWsRI~P~g~~~~N~~gl~~Y~~lid~l~~~GI~P~VTL~H~dlP~~L~~~GGW~n~~~ 133 (467)
T TIGR01233 54 KYPVDLELAEEYGVNGIRISIAWSRIFPTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTLHHFDTPEALHSNGDFLNREN 133 (467)
T ss_pred hHHHHHHHHHHcCCCEEEEecchhhccCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEeccCCCCcHHHHHcCCCCCHHH
Confidence 688999999999999999721 112211 345678899999876643 32111 11235666
Q ss_pred HHHHHHHHHHHHHHhCCCcceEeecccCCcchh
Q psy438 314 LQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGA 346 (669)
Q Consensus 314 ~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~ 346 (669)
.+.+.+-++-+++++. . |=.|...||+...
T Consensus 134 v~~F~~YA~~~f~~fg-d--Vk~WiT~NEP~~~ 163 (467)
T TIGR01233 134 IEHFIDYAAFCFEEFP-E--VNYWTTFNEIGPI 163 (467)
T ss_pred HHHHHHHHHHHHHHhC-C--CCEEEEecchhhh
Confidence 6777777788899997 3 8899999998643
No 39
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=72.43 E-value=20 Score=37.39 Aligned_cols=69 Identities=20% Similarity=0.263 Sum_probs=41.8
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEE
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 386 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~ 386 (669)
+..+.+.+-+++.+.|++.- |..|.+.||+......++.. ..|.......|+....+++++.||.-...
T Consensus 54 ~~~~~~~~~i~~v~~ry~g~--i~~wdV~NE~~~~~~~~~~~----~~w~~~~G~~~i~~af~~ar~~~P~a~l~ 122 (254)
T smart00633 54 TLLARLENHIKTVVGRYKGK--IYAWDVVNEALHDNGSGLRR----SVWYQILGEDYIEKAFRYAREADPDAKLF 122 (254)
T ss_pred HHHHHHHHHHHHHHHHhCCc--ceEEEEeeecccCCCccccc----chHHHhcChHHHHHHHHHHHHhCCCCEEE
Confidence 55667778888889988744 88899999975421000000 01111111235567778999999964443
No 40
>COG3934 Endo-beta-mannanase [Carbohydrate transport and metabolism]
Probab=70.78 E-value=4 Score=46.48 Aligned_cols=97 Identities=18% Similarity=0.166 Sum_probs=66.4
Q ss_pred eccccCccceEehhhhhhhhcC----CCCChHHHHHHHHHHHHHHHHhcCCceEEEEcCCCCCchhhhhhhhhccCchhh
Q psy438 540 LNDTCDELGILIWQDMMFACNN----YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELY 615 (669)
Q Consensus 540 ~nd~~D~~GilvW~E~~~~~~~----yp~~~~f~~~~~~El~e~IrR~rNHPSIvlWs~gNE~~~~~~~~~~~~~~~~~~ 615 (669)
+..-|+.+|.+-|. ++++... .-+++.++.+.++=+..+|.-.+-||.|.-|.+.||.. .. .....+.+
T Consensus 87 livg~~hmgg~Nw~-Ipwag~~~pdn~iyD~k~~~~~kkyvedlVk~yk~~ptI~gw~l~Ne~l-v~-----~p~s~N~f 159 (587)
T COG3934 87 LIVGLKHMGGTNWR-IPWAGEQSPDNVIYDPKFRGPGKKYVEDLVKPYKLDPTIAGWALRNEPL-VE-----APISVNNF 159 (587)
T ss_pred EeecccccCcceeE-eecCCCCCccccccchhhcccHHHHHHHHhhhhccChHHHHHHhcCCcc-cc-----ccCChhHH
Confidence 44557777777773 4434222 23478899999999999999999999999999999932 10 00111111
Q ss_pred hhhHHHHHHHHHHHHHHHhCCCCCeecC---CCCCC
Q psy438 616 YKEYAELYVNTLKPIVLQYDPTRPYLTS---SPTNG 648 (669)
Q Consensus 616 ~~~~~~~~~~~L~~~vk~~DPTRp~~~s---S~~~g 648 (669)
-.....|...+|.+||...+..+ |||.+
T Consensus 160 -----~~w~~emy~yiK~ldd~hlvsvGD~~sp~~~ 190 (587)
T COG3934 160 -----WDWSGEMYAYIKWLDDGHLVSVGDPASPWPQ 190 (587)
T ss_pred -----HHHHHHHHHHhhccCCCCeeecCCcCCcccc
Confidence 33456799999999999998844 55443
No 41
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=68.13 E-value=85 Score=34.00 Aligned_cols=132 Identities=17% Similarity=0.277 Sum_probs=74.5
Q ss_pred hHHHHHHHHHHHHcCCCEE----EccCCCCCCc----------------------HHHHHHHHHcCCEEEeeccc--ccC
Q psy438 254 ESTIRDLLVSTKEANMNML----RVWGGGVYMS----------------------DYFYETCDELGILIWQDMMF--ACN 305 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~i----R~~~g~~~~~----------------------~~~~~~cDe~Gilv~~e~~~--~~~ 305 (669)
++.++..|+.++++|||+| |..|-+.|++ +.+.+.|.+.||-|-.=+.+ ...
T Consensus 18 ~~~~~~~l~~l~~~~~N~V~~qVr~~gda~Y~S~~~p~s~~~~g~~~~~pg~DpL~~~I~eaHkrGlevHAW~~~~~~~~ 97 (311)
T PF02638_consen 18 KEQIDEMLDDLKSAGFNAVFVQVRPRGDALYPSDIEPWSGYLTGKQGKDPGFDPLEFMIEEAHKRGLEVHAWFRVGFNAP 97 (311)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEEEeCcEEEecccccccccccCCCCCCCCCccHHHHHHHHHHHcCCEEEEEEEeecCCC
Confidence 3578889999999999998 4443222221 47899999999988544321 100
Q ss_pred C-------C--------------------------CCChHHHHHHHHHHHHHHHHhC---------CCcceEeecccCCc
Q psy438 306 N-------Y--------------------------PATPTFLQSVRSEISQTVRRVQ---------HHPCIAVWAGNNEM 343 (669)
Q Consensus 306 ~-------~--------------------------p~~~~~~~~~~~~~~~~v~r~r---------nHPSi~~W~~~NE~ 343 (669)
. . |..|+-.+-+.+-++++|+++- --|.. -+ +-+..
T Consensus 98 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i~~~v~Eiv~~YdvDGIhlDdy~yp~~-~~-g~~~~ 175 (311)
T PF02638_consen 98 DVSHILKKHPEWFAVNHPGWVRTYEDANGGYYWLNPGHPEVRDYIIDIVKEIVKNYDVDGIHLDDYFYPPP-SF-GYDFP 175 (311)
T ss_pred chhhhhhcCchhheecCCCceeecccCCCCceEECCCCHHHHHHHHHHHHHHHhcCCCCeEEecccccccc-cC-CCCCc
Confidence 0 0 3455555555555566665542 11111 01 11111
Q ss_pred chhhhhhhcc---ccChh-----hHHHHHHHHHHHHHHHHHHhhCCCCCEEe
Q psy438 344 EGATIQKWYI---RENPE-----LYYKEYAELYVNTLKPIVLQYDPTRPYLT 387 (669)
Q Consensus 344 ~~~~~~~~~~---~~~~~-----~~~~~~~~~~~~~l~~~v~~~DptRp~~~ 387 (669)
.......... ...+. .|+++-...|++.+.+.+|+.+|.-.+..
T Consensus 176 ~~~~y~~~~g~~~~~~~~d~~W~~WRr~~I~~~V~~i~~~ik~~kP~v~~si 227 (311)
T PF02638_consen 176 DVAAYEKYTGKDPFSSPEDDAWTQWRRDNINNFVKRIYDAIKAIKPWVKFSI 227 (311)
T ss_pred cHHHHHHhcCcCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHhCCCCeEEE
Confidence 1111111110 12233 37777778899999999999999766544
No 42
>PLN02849 beta-glucosidase
Probab=65.84 E-value=21 Score=41.29 Aligned_cols=88 Identities=15% Similarity=0.194 Sum_probs=60.5
Q ss_pred HHHHHHHHHHHcCCCEEEcc---------CCCCCCc------HHHHHHHHHcCCEEEeec-----ccccCC-C--CCChH
Q psy438 256 TIRDLLVSTKEANMNMLRVW---------GGGVYMS------DYFYETCDELGILIWQDM-----MFACNN-Y--PATPT 312 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~---------~g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~~-~--p~~~~ 312 (669)
.++.||++||++|+|+.|+- |.|.... +.+.+.|-+.||-.+.-+ |.+-.. | -.+++
T Consensus 80 rY~eDI~Lm~~lG~~aYRfSIsWsRI~P~G~g~vN~~gl~fY~~lid~l~~~GI~P~VTL~H~dlP~~L~~~yGGW~nr~ 159 (503)
T PLN02849 80 KYKEDVKLMVETGLDAFRFSISWSRLIPNGRGSVNPKGLQFYKNFIQELVKHGIEPHVTLFHYDHPQYLEDDYGGWINRR 159 (503)
T ss_pred hHHHHHHHHHHcCCCeEEEeccHHhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCeEEEeecCCCCcHHHHHhcCCcCCch
Confidence 78999999999999999973 1122222 357789999999876643 332111 1 13455
Q ss_pred HHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 313 FLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 313 ~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
..+.+.+-++-.++++... |=.|...||+..
T Consensus 160 ~v~~F~~YA~~~f~~fgDr--Vk~WiT~NEP~~ 190 (503)
T PLN02849 160 IIKDFTAYADVCFREFGNH--VKFWTTINEANI 190 (503)
T ss_pred HHHHHHHHHHHHHHHhcCc--CCEEEEecchhh
Confidence 6666677777788888764 678999999864
No 43
>PRK09593 arb 6-phospho-beta-glucosidase; Reviewed
Probab=65.29 E-value=26 Score=40.29 Aligned_cols=88 Identities=10% Similarity=0.052 Sum_probs=59.4
Q ss_pred HHHHHHHHHHHcCCCEEEccC----------CCCCCc------HHHHHHHHHcCCEEEeec-----ccccC---CCCCCh
Q psy438 256 TIRDLLVSTKEANMNMLRVWG----------GGVYMS------DYFYETCDELGILIWQDM-----MFACN---NYPATP 311 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~----------g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~---~~p~~~ 311 (669)
.++.||++||++|+|+.|+-. ++...+ +.+.+.|-+.||-.+.-+ |.+-. +--.++
T Consensus 74 ry~eDi~Lm~~lG~~aYRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lId~L~~~GI~P~VTL~H~dlP~~L~~~~GGW~n~ 153 (478)
T PRK09593 74 HYKEDIALFAEMGFKTYRMSIAWTRIFPKGDELEPNEAGLQFYEDIFKECHKYGIEPLVTITHFDCPMHLIEEYGGWRNR 153 (478)
T ss_pred hhHHHHHHHHHcCCCEEEEecchhhcccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEecccCCCHHHHhhcCCCCCh
Confidence 688999999999999999721 111111 357788999999776533 43211 112345
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
+..+.+.+-++-.++++... |=.|...||+..
T Consensus 154 ~~v~~F~~YA~~~~~~fgdr--Vk~WiT~NEP~~ 185 (478)
T PRK09593 154 KMVGFYERLCRTLFTRYKGL--VKYWLTFNEINM 185 (478)
T ss_pred HHHHHHHHHHHHHHHHhcCc--CCEEEeecchhh
Confidence 55666666667778887764 678999999764
No 44
>PRK09852 cryptic 6-phospho-beta-glucosidase; Provisional
Probab=65.19 E-value=24 Score=40.57 Aligned_cols=88 Identities=10% Similarity=0.082 Sum_probs=61.3
Q ss_pred HHHHHHHHHHHcCCCEEEccC----------CCCCCc------HHHHHHHHHcCCEEEeec-----ccccC---CCCCCh
Q psy438 256 TIRDLLVSTKEANMNMLRVWG----------GGVYMS------DYFYETCDELGILIWQDM-----MFACN---NYPATP 311 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~----------g~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~---~~p~~~ 311 (669)
.++.++++||++|+|+.|+-. ++.... +.+.+.|-++||-++..+ |.+-. +--.++
T Consensus 72 ry~eDi~l~~~lG~~~yR~si~WsRi~P~g~~~~~n~~~~~~Y~~~i~~l~~~gi~p~VtL~H~~~P~~l~~~~GGW~~~ 151 (474)
T PRK09852 72 RYKEDIALMAEMGFKVFRTSIAWSRLFPQGDELTPNQQGIAFYRSVFEECKKYGIEPLVTLCHFDVPMHLVTEYGSWRNR 151 (474)
T ss_pred hhHHHHHHHHHcCCCeEEeeceeeeeeeCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEeeCCCCCHHHHHhcCCCCCH
Confidence 688999999999999999721 111111 356779999999876644 32211 112356
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
+..+.+.+-++-.++++..+ |=.|...||+..
T Consensus 152 ~~~~~F~~ya~~~~~~fgd~--Vk~WiTfNEPn~ 183 (474)
T PRK09852 152 KMVEFFSRYARTCFEAFDGL--VKYWLTFNEINI 183 (474)
T ss_pred HHHHHHHHHHHHHHHHhcCc--CCeEEeecchhh
Confidence 67777777788888998876 568999999864
No 45
>PLN02814 beta-glucosidase
Probab=65.02 E-value=27 Score=40.49 Aligned_cols=89 Identities=13% Similarity=0.179 Sum_probs=60.5
Q ss_pred HHHHHHHHHHHcCCCEEEcc---------CCCCCCc------HHHHHHHHHcCCEEEe-----ecccccCC-C--CCChH
Q psy438 256 TIRDLLVSTKEANMNMLRVW---------GGGVYMS------DYFYETCDELGILIWQ-----DMMFACNN-Y--PATPT 312 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~---------~g~~~~~------~~~~~~cDe~Gilv~~-----e~~~~~~~-~--p~~~~ 312 (669)
.++.||++||++|+|+.|+- |-|.... +.+.+.|-++||-.+. |+|.+-.. | -.+++
T Consensus 78 ry~EDI~L~k~lG~~ayRfSIsWsRI~P~G~g~~N~~Gl~fY~~lId~l~~~GI~P~VTL~H~dlP~~L~~~yGGW~n~~ 157 (504)
T PLN02814 78 KYKEDVKLMAEMGLESFRFSISWSRLIPNGRGLINPKGLLFYKNLIKELRSHGIEPHVTLYHYDLPQSLEDEYGGWINRK 157 (504)
T ss_pred hhHHHHHHHHHcCCCEEEEeccHhhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCceEEEecCCCCCHHHHHhcCCcCChh
Confidence 68899999999999999972 2112211 3577899999997654 33432111 1 13455
Q ss_pred HHHHHHHHHHHHHHHhCCCcceEeecccCCcchh
Q psy438 313 FLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGA 346 (669)
Q Consensus 313 ~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~ 346 (669)
..+.+.+-++-+++++... |=.|...||+...
T Consensus 158 ~i~~F~~YA~~~f~~fgdr--Vk~WiT~NEP~~~ 189 (504)
T PLN02814 158 IIEDFTAFADVCFREFGED--VKLWTTINEATIF 189 (504)
T ss_pred HHHHHHHHHHHHHHHhCCc--CCEEEeccccchh
Confidence 6666666777788888764 6789999998643
No 46
>PRK09589 celA 6-phospho-beta-glucosidase; Reviewed
Probab=64.01 E-value=27 Score=40.17 Aligned_cols=88 Identities=13% Similarity=0.100 Sum_probs=59.6
Q ss_pred HHHHHHHHHHHcCCCEEEccC----------CCCCCc------HHHHHHHHHcCCEEEee-----cccccC---CCCCCh
Q psy438 256 TIRDLLVSTKEANMNMLRVWG----------GGVYMS------DYFYETCDELGILIWQD-----MMFACN---NYPATP 311 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~----------g~~~~~------~~~~~~cDe~Gilv~~e-----~~~~~~---~~p~~~ 311 (669)
.++.||++||++|+|+.|+-. ++...+ +.+.+.|-+.||-.+.- +|.+-. +--.++
T Consensus 68 ry~eDi~Lm~~lG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~L~~~GI~P~VTL~H~dlP~~L~~~yGGW~n~ 147 (476)
T PRK09589 68 RYKEDIALFAEMGFKCFRTSIAWTRIFPQGDELEPNEEGLQFYDDLFDECLKQGIEPVVTLSHFEMPYHLVTEYGGWRNR 147 (476)
T ss_pred hhHHHHHHHHHcCCCEEEeccchhhcCcCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEecCCCCCHHHHHhcCCcCCh
Confidence 689999999999999999721 111111 35778999999977653 343211 112345
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
+..+.+.+-++-+++++... |=.|...||+..
T Consensus 148 ~~i~~F~~YA~~~f~~fgdr--Vk~WiT~NEp~~ 179 (476)
T PRK09589 148 KLIDFFVRFAEVVFTRYKDK--VKYWMTFNEINN 179 (476)
T ss_pred HHHHHHHHHHHHHHHHhcCC--CCEEEEecchhh
Confidence 56666666777788888654 568999999753
No 47
>PLN02998 beta-glucosidase
Probab=62.53 E-value=28 Score=40.31 Aligned_cols=88 Identities=11% Similarity=0.145 Sum_probs=59.8
Q ss_pred HHHHHHHHHHHcCCCEEEccC---------CCCCCc------HHHHHHHHHcCCEEEe-----ecccccCC-C--CCChH
Q psy438 256 TIRDLLVSTKEANMNMLRVWG---------GGVYMS------DYFYETCDELGILIWQ-----DMMFACNN-Y--PATPT 312 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~---------g~~~~~------~~~~~~cDe~Gilv~~-----e~~~~~~~-~--p~~~~ 312 (669)
.++.||++||++|+|+.|+-. -|.... +.+.+.|-+.||-.+. |+|.+-.. | -.+++
T Consensus 83 ry~EDi~lmk~lG~~~YRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~L~~~GIeP~VTL~H~dlP~~L~~~yGGW~n~~ 162 (497)
T PLN02998 83 KYKEDVKLMADMGLEAYRFSISWSRLLPSGRGPINPKGLQYYNNLIDELITHGIQPHVTLHHFDLPQALEDEYGGWLSQE 162 (497)
T ss_pred hhHHHHHHHHHcCCCeEEeeccHHhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCceEEEecCCCCCHHHHHhhCCcCCch
Confidence 688999999999999999731 122222 3577899999997654 33432111 1 13445
Q ss_pred HHHHHHHHHHHHHHHhCCCcceEeecccCCcch
Q psy438 313 FLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEG 345 (669)
Q Consensus 313 ~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~ 345 (669)
..+.+.+-++-.++++... |=.|...||+..
T Consensus 163 ~v~~F~~YA~~~~~~fgdr--Vk~WiT~NEP~~ 193 (497)
T PLN02998 163 IVRDFTAYADTCFKEFGDR--VSHWTTINEVNV 193 (497)
T ss_pred HHHHHHHHHHHHHHHhcCc--CCEEEEccCcch
Confidence 5666666667778888764 668999999864
No 48
>PRK09936 hypothetical protein; Provisional
Probab=61.87 E-value=64 Score=34.90 Aligned_cols=130 Identities=13% Similarity=0.167 Sum_probs=86.1
Q ss_pred hHHHHHHHHHHHHcCCCEEEc-c---CCCCCC-c----HHHHHHHHHcCCEEEeecccccC---CCCCC----hHHHHHH
Q psy438 254 ESTIRDLLVSTKEANMNMLRV-W---GGGVYM-S----DYFYETCDELGILIWQDMMFACN---NYPAT----PTFLQSV 317 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~-~---~g~~~~-~----~~~~~~cDe~Gilv~~e~~~~~~---~~p~~----~~~~~~~ 317 (669)
++.++..++.+++.||++|=+ | |++-+. + ..-++.|.+.||-|+-..++--. .-..| +.|++..
T Consensus 37 ~~qWq~~~~~~~~~G~~tLivQWt~yG~~~fg~~~g~La~~l~~A~~~Gl~v~vGL~~Dp~y~q~~~~d~~~~~~yl~~~ 116 (296)
T PRK09936 37 DTQWQGLWSQLRLQGFDTLVVQWTRYGDADFGGQRGWLAKRLAAAQQAGLKLVVGLYADPEFFMHQKQDGAALESYLNRQ 116 (296)
T ss_pred HHHHHHHHHHHHHcCCcEEEEEeeeccCCCcccchHHHHHHHHHHHHcCCEEEEcccCChHHHHHHhcCchhHHHHHHHH
Confidence 467999999999999999843 2 322222 2 36789999999999988875210 01223 3466665
Q ss_pred HHHHHHHHHHhCCCcce--EeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhC-CCCCEEecCCCCCc
Q psy438 318 RSEISQTVRRVQHHPCI--AVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYD-PTRPYLTSSPTNGI 394 (669)
Q Consensus 318 ~~~~~~~v~r~rnHPSi--~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~D-ptRp~~~~S~~~~~ 394 (669)
..+..++.+..++||.+ .-|-+-=|.... .|...+. ....+..|.+.++++| +.+|+..|.-.++.
T Consensus 117 l~~~~~qa~~~~~~~~~~v~GWYiP~ElDd~---~W~~~~r--------R~~L~~~L~~~~~~l~~~~kPv~ISay~~g~ 185 (296)
T PRK09936 117 LGASLQQARLWSAAWGVPVDGWYLPAELDDL---NWRDEAR--------RQPLLTWLNAAQRLIDVSAKPVHISAFFAGN 185 (296)
T ss_pred HHHHHHHHHHHHhccCCCCCeEEeeeccchh---cccCHHH--------HHHHHHHHHHHHHhCCCCCCCeEEEeecccC
Confidence 66666788888999888 558887787642 2332211 2345577888888887 56788777555443
No 49
>PF00331 Glyco_hydro_10: Glycosyl hydrolase family 10; InterPro: IPR001000 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 10 GH10 from CAZY comprises enzymes with a number of known activities; xylanase (3.2.1.8 from EC); endo-1,3-beta-xylanase (3.2.1.32 from EC); cellobiohydrolase (3.2.1.91 from EC). These enzymes were formerly known as cellulase family F. The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) [, ]. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family F [] or as the glycosyl hydrolases family 10 []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1UQZ_A 1UQY_A 1UR2_A 1UR1_A 2CNC_A 1OD8_A 1E0W_A 1E0V_A 1V0M_A 1E0X_B ....
Probab=57.73 E-value=30 Score=37.48 Aligned_cols=98 Identities=18% Similarity=0.208 Sum_probs=58.4
Q ss_pred HHHHHHHHHcCCEE------Eeec-ccccCCC-CCCh----HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhh
Q psy438 283 DYFYETCDELGILI------WQDM-MFACNNY-PATP----TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQK 350 (669)
Q Consensus 283 ~~~~~~cDe~Gilv------~~e~-~~~~~~~-p~~~----~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~ 350 (669)
+.+++.|.+.||.| |... |-+...- ..++ .+.+.+.+.+..++.|++....|..|-+-||+.......
T Consensus 62 D~~~~~a~~~g~~vrGH~LvW~~~~P~w~~~~~~~~~~~~~~~~~~l~~~I~~v~~~y~~~g~i~~WDVvNE~i~~~~~~ 141 (320)
T PF00331_consen 62 DAILDWARENGIKVRGHTLVWHSQTPDWVFNLANGSPDEKEELRARLENHIKTVVTRYKDKGRIYAWDVVNEAIDDDGNP 141 (320)
T ss_dssp HHHHHHHHHTT-EEEEEEEEESSSS-HHHHTSTTSSBHHHHHHHHHHHHHHHHHHHHTTTTTTESEEEEEES-B-TTSSS
T ss_pred hHHHHHHHhcCcceeeeeEEEcccccceeeeccCCCcccHHHHHHHHHHHHHHHHhHhccccceEEEEEeeecccCCCcc
Confidence 68999999999966 3321 1111011 0122 367788889999999999999999999999986543100
Q ss_pred hccccChhhHHHHHHHHHHHHHHHHHHhhCCC
Q psy438 351 WYIRENPELYYKEYAELYVNTLKPIVLQYDPT 382 (669)
Q Consensus 351 ~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dpt 382 (669)
.. -....+..-....|+....+.+++.||.
T Consensus 142 ~~--~r~~~~~~~lG~~yi~~aF~~A~~~~P~ 171 (320)
T PF00331_consen 142 GG--LRDSPWYDALGPDYIADAFRAAREADPN 171 (320)
T ss_dssp SS--BCTSHHHHHHTTCHHHHHHHHHHHHHTT
T ss_pred cc--ccCChhhhcccHhHHHHHHHHHHHhCCC
Confidence 00 0001122111123667778899999993
No 50
>COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]
Probab=57.08 E-value=56 Score=37.50 Aligned_cols=90 Identities=16% Similarity=0.215 Sum_probs=58.9
Q ss_pred HHHHHHHHHHHcCCCEEEcc---------CCCCCCc-------HHHHHHHHHcCCEEEeec-----ccccCC-C--CCCh
Q psy438 256 TIRDLLVSTKEANMNMLRVW---------GGGVYMS-------DYFYETCDELGILIWQDM-----MFACNN-Y--PATP 311 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~---------~g~~~~~-------~~~~~~cDe~Gilv~~e~-----~~~~~~-~--p~~~ 311 (669)
.++.|++++|++|+|+.|+- |++..++ +..+|.|-++||-.+.-+ |.+-.. | -.+.
T Consensus 60 rYkeDi~L~~emG~~~~R~SI~WsRIfP~g~~~e~N~~gl~fY~~l~del~~~gIep~vTL~Hfd~P~~L~~~ygGW~nR 139 (460)
T COG2723 60 RYKEDIALAKEMGLNAFRTSIEWSRIFPNGDGGEVNEKGLRFYDRLFDELKARGIEPFVTLYHFDLPLWLQKPYGGWENR 139 (460)
T ss_pred hhHHHHHHHHHcCCCEEEeeeeEEEeecCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEecccCCcHHHhhccCCccCH
Confidence 68889999999999999973 2221222 356789999999887643 432211 1 1233
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhh
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGAT 347 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~ 347 (669)
.-.+.+..-++-..+|+.. -|=-|...||+....
T Consensus 140 ~~i~~F~~ya~~vf~~f~d--kVk~W~TFNE~n~~~ 173 (460)
T COG2723 140 ETVDAFARYAATVFERFGD--KVKYWFTFNEPNVVV 173 (460)
T ss_pred HHHHHHHHHHHHHHHHhcC--cceEEEEecchhhhh
Confidence 4444555555666777764 367899999986544
No 51
>PF03659 Glyco_hydro_71: Glycosyl hydrolase family 71 ; InterPro: IPR005197 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha-1,3-glucanases belonging to glycoside hydrolase family 71 (GH71 from CAZY).
Probab=56.81 E-value=52 Score=36.85 Aligned_cols=75 Identities=9% Similarity=0.141 Sum_probs=52.7
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC--c---HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHHhC
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM--S---DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQ 329 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~--~---~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r~r 329 (669)
+.++.+|++++++|+...=+=-|..-+ . ...|+.|++.|+-+.--|..... ...+. +++..+|+++.
T Consensus 17 ~dw~~di~~A~~~GIDgFaLNig~~d~~~~~~l~~a~~AA~~~gFKlf~SfD~~~~--~~~~~------~~~~~~i~~y~ 88 (386)
T PF03659_consen 17 EDWEADIRLAQAAGIDGFALNIGSSDSWQPDQLADAYQAAEAVGFKLFFSFDMNSL--GPWSQ------DELIALIKKYA 88 (386)
T ss_pred HHHHHHHHHHHHcCCCEEEEecccCCcccHHHHHHHHHHHHhcCCEEEEEecccCC--CCCCH------HHHHHHHHHHc
Confidence 468899999999999876543321112 1 46899999999888665554321 11111 67789999999
Q ss_pred CCcceEee
Q psy438 330 HHPCIAVW 337 (669)
Q Consensus 330 nHPSi~~W 337 (669)
+||+.+..
T Consensus 89 ~~pa~~~~ 96 (386)
T PF03659_consen 89 GHPAYFRY 96 (386)
T ss_pred CChhHEeE
Confidence 99998874
No 52
>PF00232 Glyco_hydro_1: Glycosyl hydrolase family 1; InterPro: IPR001360 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 1 GH1 from CAZY comprises enzymes with a number of known activities; beta-glucosidase (3.2.1.21 from EC); beta-galactosidase (3.2.1.23 from EC); 6-phospho-beta-galactosidase (3.2.1.85 from EC); 6-phospho-beta-glucosidase (3.2.1.86 from EC); lactase-phlorizin hydrolase (3.2.1.62 from EC), (3.2.1.108 from EC); beta-mannosidase (3.2.1.25 from EC); myrosinase (3.2.1.147 from EC). ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1QVB_A 3AHY_D 2E9L_A 2ZOX_A 2JFE_X 2E9M_A 3FIZ_A 3FIY_A 3CMJ_A 3FJ0_A ....
Probab=56.76 E-value=28 Score=39.62 Aligned_cols=91 Identities=15% Similarity=0.088 Sum_probs=61.7
Q ss_pred HHHHHHHHHHHHcCCCEEEccC--------C--CCCCc------HHHHHHHHHcCCEEEeec-----ccccCCC--CCCh
Q psy438 255 STIRDLLVSTKEANMNMLRVWG--------G--GVYMS------DYFYETCDELGILIWQDM-----MFACNNY--PATP 311 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~--------g--~~~~~------~~~~~~cDe~Gilv~~e~-----~~~~~~~--p~~~ 311 (669)
..++.||++||++|+|+.|+-. | |.... +.+.+.|-+.||-.+.-+ |.+...+ -.++
T Consensus 58 ~~y~eDi~l~~~lg~~~yRfsi~W~Ri~P~g~~g~~n~~~~~~Y~~~i~~l~~~gi~P~vtL~H~~~P~~l~~~ggw~~~ 137 (455)
T PF00232_consen 58 HRYKEDIALMKELGVNAYRFSISWSRIFPDGFEGKVNEEGLDFYRDLIDELLENGIEPIVTLYHFDLPLWLEDYGGWLNR 137 (455)
T ss_dssp HHHHHHHHHHHHHT-SEEEEE--HHHHSTTSSSSSS-HHHHHHHHHHHHHHHHTT-EEEEEEESS--BHHHHHHTGGGST
T ss_pred hhhhHHHHHHHhhccceeeeecchhheeecccccccCHhHhhhhHHHHHHHHhhccceeeeeeecccccceeecccccCH
Confidence 4789999999999999999721 1 12221 357788999999887644 3221111 1345
Q ss_pred HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhh
Q psy438 312 TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGAT 347 (669)
Q Consensus 312 ~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~ 347 (669)
++.+.+.+-++..++++... |-.|...||+....
T Consensus 138 ~~~~~F~~Ya~~~~~~~gd~--V~~w~T~NEp~~~~ 171 (455)
T PF00232_consen 138 ETVDWFARYAEFVFERFGDR--VKYWITFNEPNVFA 171 (455)
T ss_dssp HHHHHHHHHHHHHHHHHTTT--BSEEEEEETHHHHH
T ss_pred HHHHHHHHHHHHHHHHhCCC--cceEEeccccceee
Confidence 66777777788888998865 88899999986544
No 53
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=55.05 E-value=42 Score=31.94 Aligned_cols=44 Identities=25% Similarity=0.393 Sum_probs=32.1
Q ss_pred HHHHHHHHcCCCEEEccCC-----CCCCc--------------HHHHHHHHHcCCEEEeeccc
Q psy438 259 DLLVSTKEANMNMLRVWGG-----GVYMS--------------DYFYETCDELGILIWQDMMF 302 (669)
Q Consensus 259 ~~l~~~k~~g~N~iR~~~g-----~~~~~--------------~~~~~~cDe~Gilv~~e~~~ 302 (669)
+.++.+|++|+|+|-+..+ +-||+ .++.++|.+.||-|..=+.+
T Consensus 4 ~~~~~lk~~~v~si~i~a~~h~g~ayYPt~~~~~hp~L~~Dllge~v~a~h~~Girv~ay~~~ 66 (132)
T PF14871_consen 4 QFVDTLKEAHVNSITIFAKCHGGYAYYPTKVGPRHPGLKRDLLGEQVEACHERGIRVPAYFDF 66 (132)
T ss_pred HHHHHHHHhCCCEEEEEcccccEEEEccCCCCcCCCCCCcCHHHHHHHHHHHCCCEEEEEEee
Confidence 3567789999999987443 22332 47899999999999765544
No 54
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=54.27 E-value=55 Score=34.17 Aligned_cols=77 Identities=13% Similarity=0.165 Sum_probs=50.8
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCC-C---c----------HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVY-M---S----------DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSE 320 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~-~---~----------~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~ 320 (669)
+.+++.++.++++|++.|+++++... . . .+..++|.++||.+--|.. .. .|.... .+
T Consensus 94 ~~~~~~i~~a~~lG~~~v~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~A~~~Gv~l~lE~~-~~-------~~~~t~-~~ 164 (279)
T TIGR00542 94 EIMEKAIQLARDLGIRTIQLAGYDVYYEEHDEETRRRFREGLKEAVELAARAQVTLAVEIM-DT-------PFMSSI-SK 164 (279)
T ss_pred HHHHHHHHHHHHhCCCEEEecCcccccCcCCHHHHHHHHHHHHHHHHHHHHcCCEEEEeeC-CC-------chhcCH-HH
Confidence 46889999999999999999874211 1 1 1456788899999988853 11 122222 24
Q ss_pred HHHHHHHhCCCcce-EeecccC
Q psy438 321 ISQTVRRVQHHPCI-AVWAGNN 341 (669)
Q Consensus 321 ~~~~v~r~rnHPSi-~~W~~~N 341 (669)
+.++++.+ +||.| ++|..+|
T Consensus 165 ~~~li~~v-~~~~v~~~~D~~h 185 (279)
T TIGR00542 165 WLKWDHYL-NSPWFTLYPDIGN 185 (279)
T ss_pred HHHHHHHc-CCCceEEEeCcCh
Confidence 45566555 67776 5677777
No 55
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=52.88 E-value=63 Score=37.06 Aligned_cols=123 Identities=20% Similarity=0.258 Sum_probs=67.5
Q ss_pred HHHHHHHHH-HHcCCCEEEccCCCCC------------------C---cHHHHHHHHHcCCEEEeeccccc-----C---
Q psy438 256 TIRDLLVST-KEANMNMLRVWGGGVY------------------M---SDYFYETCDELGILIWQDMMFAC-----N--- 305 (669)
Q Consensus 256 ~~~~~l~~~-k~~g~N~iR~~~g~~~------------------~---~~~~~~~cDe~Gilv~~e~~~~~-----~--- 305 (669)
.++..|..+ ++.||..||+|| -+ . -|..+|..-+.||..+.|+-+.- .
T Consensus 40 ~~q~~l~~~~~~~gf~yvR~h~--l~~ddm~~~~~~~~~~~~~Ynf~~lD~i~D~l~~~g~~P~vel~f~p~~~~~~~~~ 117 (486)
T PF01229_consen 40 DWQEQLRELQEELGFRYVRFHG--LFSDDMMVYSESDEDGIPPYNFTYLDQILDFLLENGLKPFVELGFMPMALASGYQT 117 (486)
T ss_dssp HHHHHHHHHHCCS--SEEEES---TTSTTTT-EEEEETTEEEEE--HHHHHHHHHHHHCT-EEEEEE-SB-GGGBSS--E
T ss_pred HHHHHHHHHHhccCceEEEEEe--eccCchhhccccccCCCCcCChHHHHHHHHHHHHcCCEEEEEEEechhhhcCCCCc
Confidence 355555555 589999999996 22 1 16789999999999998885411 0
Q ss_pred --------CCCCC-hHHHHHHHHHHHHHHHHhCC-CcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHH
Q psy438 306 --------NYPAT-PTFLQSVRSEISQTVRRVQH-HPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPI 375 (669)
Q Consensus 306 --------~~p~~-~~~~~~~~~~~~~~v~r~rn-HPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 375 (669)
..|.+ ..|.+-+.+-++..+.|+.. .=+--.|.+=||+..... |. ....+ +|.++| +...+.
T Consensus 118 ~~~~~~~~~pp~~~~~W~~lv~~~~~h~~~RYG~~ev~~W~fEiWNEPd~~~f--~~-~~~~~----ey~~ly-~~~~~~ 189 (486)
T PF01229_consen 118 VFWYKGNISPPKDYEKWRDLVRAFARHYIDRYGIEEVSTWYFEIWNEPDLKDF--WW-DGTPE----EYFELY-DATARA 189 (486)
T ss_dssp ETTTTEE-S-BS-HHHHHHHHHHHHHHHHHHHHHHHHTTSEEEESS-TTSTTT--SG-GG-HH----HHHHHH-HHHHHH
T ss_pred cccccCCcCCcccHHHHHHHHHHHHHHHHhhcCCccccceeEEeCcCCCcccc--cC-CCCHH----HHHHHH-HHHHHH
Confidence 01223 24555555555666666532 223335677799765321 11 11222 354555 667789
Q ss_pred HHhhCCCCCEEec
Q psy438 376 VLQYDPTRPYLTS 388 (669)
Q Consensus 376 v~~~DptRp~~~~ 388 (669)
+|+.||+-.+...
T Consensus 190 iK~~~p~~~vGGp 202 (486)
T PF01229_consen 190 IKAVDPELKVGGP 202 (486)
T ss_dssp HHHH-TTSEEEEE
T ss_pred HHHhCCCCcccCc
Confidence 9999998777654
No 56
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=49.92 E-value=39 Score=35.24 Aligned_cols=80 Identities=13% Similarity=0.271 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC---c--------H---HHHHHHHHcCCEEEeecccccC-CCCCChHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM---S--------D---YFYETCDELGILIWQDMMFACN-NYPATPTFLQSVRS 319 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~---~--------~---~~~~~cDe~Gilv~~e~~~~~~-~~p~~~~~~~~~~~ 319 (669)
+.+++.++.++++|+..|++|.| ..+ . + ...++|.++||.+-.|...... .+..+. .
T Consensus 85 ~~~~~~i~~A~~lG~~~v~~~~g-~~~~~~~~~~~~~~~~~l~~l~~~a~~~gi~l~lEn~~~~~~~~~~t~-------~ 156 (279)
T cd00019 85 ERLKDEIERCEELGIRLLVFHPG-SYLGQSKEEGLKRVIEALNELIDKAETKGVVIALETMAGQGNEIGSSF-------E 156 (279)
T ss_pred HHHHHHHHHHHHcCCCEEEECCC-CCCCCCHHHHHHHHHHHHHHHHHhccCCCCEEEEeCCCCCCCCCCCCH-------H
Confidence 47888999999999999999996 443 1 2 2345556799999888653221 012222 3
Q ss_pred HHHHHHHHhCCCcce-EeecccCC
Q psy438 320 EISQTVRRVQHHPCI-AVWAGNNE 342 (669)
Q Consensus 320 ~~~~~v~r~rnHPSi-~~W~~~NE 342 (669)
++.++++.+..||.+ +++-.+|=
T Consensus 157 ~~~~li~~v~~~~~~g~~lD~~h~ 180 (279)
T cd00019 157 ELKEIIDLIKEKPRVGVCIDTCHI 180 (279)
T ss_pred HHHHHHHhcCCCCCeEEEEEhhhH
Confidence 345667776668888 88888874
No 57
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=48.51 E-value=3.8e+02 Score=28.91 Aligned_cols=134 Identities=11% Similarity=0.098 Sum_probs=71.3
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeec---cc------ccCCCC--CChHHHHHHHHHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDM---MF------ACNNYP--ATPTFLQSVRSEISQTV 325 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~---~~------~~~~~p--~~~~~~~~~~~~~~~~v 325 (669)
.+++++.++++|+.++-.---........|+.+.+.|.++...- +. +...++ .+|+..+-+.+.+++
T Consensus 72 p~~mi~~L~~~G~k~~~~v~P~v~~~~~~y~e~~~~g~l~~~~~~~~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~-- 149 (317)
T cd06598 72 PAGMIADLAKKGVKTIVITEPFVLKNSKNWGEAVKAGALLKKDQGGVPTLFDFWFGNTGLIDWFDPAAQAWFHDNYKK-- 149 (317)
T ss_pred HHHHHHHHHHcCCcEEEEEcCcccCCchhHHHHHhCCCEEEECCCCCEeeeeccCCCccccCCCCHHHHHHHHHHHHH--
Confidence 45678889999999887522111123467899999999764421 11 111122 245544444444433
Q ss_pred HHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCC-CCCEEec-CCCCCc
Q psy438 326 RRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDP-TRPYLTS-SPTNGI 394 (669)
Q Consensus 326 ~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dp-tRp~~~~-S~~~~~ 394 (669)
+.....-.+|.=.||.................+..-|..++.+.+.+.+++.-+ .||++.+ |.+.|.
T Consensus 150 --~~~~Gvdg~w~D~~Ep~~~~~~~~~~~g~~~~~hN~y~~~~~~~~~e~~~~~~~~~r~~~~~Rs~~~Gs 218 (317)
T cd06598 150 --LIDQGVTGWWGDLGEPEVHPPDMCHHKGKAAEVHNIYGHLWAKSIYEGYQQNYPNERPFILMRAGFAGS 218 (317)
T ss_pred --hhhCCccEEEecCCCccccCCccccCCCcHhHHhhHHHHHHHHHHHHHHHHhcCCCCeEEEEecCcCcc
Confidence 344567778888899632111000000011112223555666777776766544 6888765 444443
No 58
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=46.83 E-value=3.1e+02 Score=28.47 Aligned_cols=85 Identities=11% Similarity=0.164 Sum_probs=50.4
Q ss_pred HHHHHHHHHHcCCCEEEccCCCC--CC--------cHHHHHHHHHc-CCEEEeecccccCCCCCChHHHHHHHHHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGV--YM--------SDYFYETCDEL-GILIWQDMMFACNNYPATPTFLQSVRSEISQTV 325 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~--~~--------~~~~~~~cDe~-Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v 325 (669)
+...++.++++|+..|=+..+.. +. -+.+-++|+++ |+.+....++.......+++-.+...+.++..+
T Consensus 12 l~~~l~~a~~~G~d~vEl~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~~~~~~~~i 91 (279)
T cd00019 12 LENALKRAKEIGFDTVAMFLGNPRSWLSRPLKKERAEKFKAIAEEGPSICLSVHAPYLINLASPDKEKREKSIERLKDEI 91 (279)
T ss_pred HHHHHHHHHHcCCCEEEEEcCCCCccCCCCCCHHHHHHHHHHHHHcCCCcEEEEcCceeccCCCCHHHHHHHHHHHHHHH
Confidence 67779999999999884432111 11 13789999999 777765544321111123334444445555544
Q ss_pred H---HhCCCcceEeecccCC
Q psy438 326 R---RVQHHPCIAVWAGNNE 342 (669)
Q Consensus 326 ~---r~rnHPSi~~W~~~NE 342 (669)
+ .+ ..+.|++|.+..+
T Consensus 92 ~~A~~l-G~~~v~~~~g~~~ 110 (279)
T cd00019 92 ERCEEL-GIRLLVFHPGSYL 110 (279)
T ss_pred HHHHHc-CCCEEEECCCCCC
Confidence 4 44 4668888877654
No 59
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=46.62 E-value=84 Score=32.20 Aligned_cols=84 Identities=10% Similarity=0.155 Sum_probs=52.7
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCCc--------------HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYMS--------------DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSE 320 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~~--------------~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~ 320 (669)
+.+++.++.++++|+..|+++.|..... ...-++|.++||.+.-|.-... .+|.. ++ .-.++
T Consensus 84 ~~~~~~i~~a~~lg~~~i~~~~g~~~~~~~~~~~~~~~~~~l~~l~~~A~~~gi~l~lE~~~~~-~~~~~--~l-~t~~~ 159 (254)
T TIGR03234 84 EGVALAIAYARALGCPQVNCLAGKRPAGVSPEEARATLVENLRYAADALDRIGLTLLIEPINSF-DMPGF--FL-TTTEQ 159 (254)
T ss_pred HHHHHHHHHHHHhCCCEEEECcCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCCEEEEEECCcc-cCCCC--hh-cCHHH
Confidence 4677889999999999999987643211 1235678899999988842110 11111 22 12244
Q ss_pred HHHHHHHhCCCcce-EeecccCCc
Q psy438 321 ISQTVRRVQHHPCI-AVWAGNNEM 343 (669)
Q Consensus 321 ~~~~v~r~rnHPSi-~~W~~~NE~ 343 (669)
+.++|++.. ||.+ ++|-.+|=.
T Consensus 160 ~~~li~~v~-~~~~~i~~D~~h~~ 182 (254)
T TIGR03234 160 ALAVIDDVG-RENLKLQYDLYHMQ 182 (254)
T ss_pred HHHHHHHhC-CCCEeEeeehhhhh
Confidence 556776664 6776 777777643
No 60
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=45.69 E-value=1.4e+02 Score=32.11 Aligned_cols=112 Identities=17% Similarity=0.277 Sum_probs=72.6
Q ss_pred HHHHHHHHHHHHcCCCEEEccC----------------------------CCCCCcH---HHHHHHHHcCCEEEeecccc
Q psy438 255 STIRDLLVSTKEANMNMLRVWG----------------------------GGVYMSD---YFYETCDELGILIWQDMMFA 303 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~----------------------------g~~~~~~---~~~~~cDe~Gilv~~e~~~~ 303 (669)
+.+++.|+.|...++|.+-.|- ++.|..+ ++.+.|.++||-|+.|+...
T Consensus 16 ~~lk~~id~ma~~K~N~lhlHl~D~~~~~le~~~~p~l~~~g~~~~~~~~~~~yT~~di~elv~yA~~rgI~viPEiD~P 95 (303)
T cd02742 16 ESIKRTIDVLARYKINTFHWHLTDDQAWRIESKKFPELAEKGGQINPRSPGGFYTYAQLKDIIEYAAARGIEVIPEIDMP 95 (303)
T ss_pred HHHHHHHHHHHHhCCcEEEEeeecCCCceEeeCccchhhhhcccccCCCCCCeECHHHHHHHHHHHHHcCCEEEEeccch
Confidence 4788999999999999996551 2345433 68899999999999988542
Q ss_pred cCC------C---------------------CCCh---HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhcc
Q psy438 304 CNN------Y---------------------PATP---TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYI 353 (669)
Q Consensus 304 ~~~------~---------------------p~~~---~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~ 353 (669)
.+. + |.++ +|++.+.+|+.++ +||=++=.||-|.....
T Consensus 96 GH~~a~~~~~p~l~~~~~~~~~~~~~~~~l~~~~~~t~~fl~~l~~e~~~l------f~~~~iHiGgDE~~~~~------ 163 (303)
T cd02742 96 GHSTAFVKSFPKLLTECYAGLKLRDVFDPLDPTLPKGYDFLDDLFGEIAEL------FPDRYLHIGGDEAHFKQ------ 163 (303)
T ss_pred HHHHHHHHhCHHhccCccccCCCCCCCCccCCCCccHHHHHHHHHHHHHHh------CCCCeEEecceecCCCC------
Confidence 110 0 2233 4666665555543 37777788888985421
Q ss_pred ccChhhHHHHHHHHHHHHHHHHHHhhCCCCCE
Q psy438 354 RENPELYYKEYAELYVNTLKPIVLQYDPTRPY 385 (669)
Q Consensus 354 ~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~ 385 (669)
++. +....|++.+.+.+++.. .+++
T Consensus 164 --~~~----~l~~~f~~~~~~~v~~~g-~~~~ 188 (303)
T cd02742 164 --DRK----HLMSQFIQRVLDIVKKKG-KKVI 188 (303)
T ss_pred --CHH----HHHHHHHHHHHHHHHHcC-CeEE
Confidence 111 122457788888888874 3344
No 61
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=44.83 E-value=36 Score=40.48 Aligned_cols=47 Identities=26% Similarity=0.435 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHcCCCEE-----------EccC----C-----CCCC-c---HHHHHHHHHcCCEEEeecc
Q psy438 255 STIRDLLVSTKEANMNML-----------RVWG----G-----GVYM-S---DYFYETCDELGILIWQDMM 301 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~i-----------R~~~----g-----~~~~-~---~~~~~~cDe~Gilv~~e~~ 301 (669)
+...+.|..+|++|+++| |-|| | +.|- + .+|.|.|-++||.|+-|.-
T Consensus 165 e~a~~llpYl~elG~T~IELMPv~e~p~~~sWGYq~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD~V 235 (628)
T COG0296 165 ELAIELLPYLKELGITHIELMPVAEHPGDRSWGYQGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILDWV 235 (628)
T ss_pred HHHHHHhHHHHHhCCCEEEEcccccCCCCCCCCCCcceeccccccCCCHHHHHHHHHHHHHcCCEEEEEec
Confidence 466777999999999999 6676 0 0111 1 3799999999999999874
No 62
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=44.35 E-value=2.6e+02 Score=30.00 Aligned_cols=125 Identities=18% Similarity=0.196 Sum_probs=73.0
Q ss_pred HHHHHHHHHHHHcCCCEEEccC---------------CCCCCcH---HHHHHHHHcCCEEEeecccccC--------C--
Q psy438 255 STIRDLLVSTKEANMNMLRVWG---------------GGVYMSD---YFYETCDELGILIWQDMMFACN--------N-- 306 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~---------------g~~~~~~---~~~~~cDe~Gilv~~e~~~~~~--------~-- 306 (669)
+.+++.|+.|...++|.+=+|- ++.|..+ ++.+.|.++||-|+.|+....+ .
T Consensus 17 ~~lk~~id~ma~~k~N~l~lhl~D~f~~~~~p~~~~~~~~yT~~ei~ei~~yA~~~gI~vIPeid~pGH~~~~l~~~~~~ 96 (301)
T cd06565 17 SYLKKLLRLLALLGANGLLLYYEDTFPYEGEPEVGRMRGAYTKEEIREIDDYAAELGIEVIPLIQTLGHLEFILKHPEFR 96 (301)
T ss_pred HHHHHHHHHHHHcCCCEEEEEEecceecCCCcccccCCCCcCHHHHHHHHHHHHHcCCEEEecCCCHHHHHHHHhCcccc
Confidence 5789999999999999984431 2455543 6889999999999988753111 0
Q ss_pred ------------CCCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhcc---ccChhhHHHHHHHHHHHH
Q psy438 307 ------------YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYI---RENPELYYKEYAELYVNT 371 (669)
Q Consensus 307 ------------~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 371 (669)
-|.+++-.+-+.+-+++++.- |||=.+=.|+-|........... ..+.. +-...|++.
T Consensus 97 ~l~~~~~~~~~l~~~~~~t~~fi~~li~ev~~~---f~s~~~HIG~DE~~~~g~~~~~~~~~~~~~~----~l~~~~~~~ 169 (301)
T cd06565 97 HLREVDDPPQTLCPGEPKTYDFIEEMIRQVLEL---HPSKYIHIGMDEAYDLGRGRSLRKHGNLGRG----ELYLEHLKK 169 (301)
T ss_pred cccccCCCCCccCCCChhHHHHHHHHHHHHHHh---CCCCeEEECCCcccccCCCHHHHHhcCCCHH----HHHHHHHHH
Confidence 023343333332223333322 77878878888875421111100 01111 112346788
Q ss_pred HHHHHHhhCCCCCEEe
Q psy438 372 LKPIVLQYDPTRPYLT 387 (669)
Q Consensus 372 l~~~v~~~DptRp~~~ 387 (669)
+.+.+++..+ +|+.-
T Consensus 170 v~~~v~~~g~-~~~~W 184 (301)
T cd06565 170 VLKIIKKRGP-KPMMW 184 (301)
T ss_pred HHHHHHHcCC-EEEEE
Confidence 8889998876 45443
No 63
>COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=41.96 E-value=3e+02 Score=33.85 Aligned_cols=185 Identities=14% Similarity=0.168 Sum_probs=100.0
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeecc--------cccCCCC--CChHHHHHHHHHHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDMM--------FACNNYP--ATPTFLQSVRSEISQTVR 326 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~~--------~~~~~~p--~~~~~~~~~~~~~~~~v~ 326 (669)
.+.+++.+++.|++++=.---........++.+.+.|.+|-.+-. .....++ .+|+..+-+.+ ...+
T Consensus 323 pk~mi~~l~~~Gikl~~~i~P~i~~d~~~~~e~~~~Gy~~k~~~g~~~~~~~w~~~~a~~DFtnp~~r~Ww~~---~~~~ 399 (772)
T COG1501 323 PKQMIAELHEKGIKLIVIINPYIKQDSPLFKEAIEKGYFVKDPDGEIYQADFWPGNSAFPDFTNPDAREWWAS---DKKK 399 (772)
T ss_pred HHHHHHHHHhcCceEEEEeccccccCCchHHHHHHCCeEEECCCCCEeeecccCCcccccCCCCHHHHHHHHH---HHHh
Confidence 457788899999999854110011123788999999999965521 1111122 24544444442 2334
Q ss_pred HhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCC-CCCEEec-CCCCCccccccccccc
Q psy438 327 RVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDP-TRPYLTS-SPTNGIESEKAKYALA 404 (669)
Q Consensus 327 r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dp-tRp~~~~-S~~~~~~~~~~~~~~~ 404 (669)
.+..-..-.+|.=.||........+......+....-|-.++.+...+.+++..| .||++.+ |.+.|.+.-
T Consensus 400 ~l~d~Gv~g~W~D~nEp~~~~~~~~~~g~~~~~~~N~yp~~~~~a~~~~~~~~~~~~r~~~lsRsg~aG~Q~~------- 472 (772)
T COG1501 400 NLLDLGVDGFWNDMNEPEPFDGDGFGNGIDHEEMHNLYPLLYAKAVYEALKELGGNERPFILSRSGYAGSQRY------- 472 (772)
T ss_pred HHHhcCccEEEccCCCCccccccccccccCHHHHhcchhHHHHHHHHHHHHhhcCCCceEEEEecccccceec-------
Confidence 4555678889999999865421101111222222223445677888888999976 6888776 566665521
Q ss_pred CCCCCCCCCccc-cccccccc----CCCCCCCCCccccccc-cCC--CCCcchhcccC
Q psy438 405 DNPYSNIYGDTH-NYDYYQNL----WDPSTAPKSRFCSEFG-IQS--LPQLSTFQKVA 454 (669)
Q Consensus 405 ~~py~~~~gd~h-~y~~~~~~----~~~~~~~~~~f~sE~G-~~s--~p~~~sl~~~~ 454 (669)
-..|.||.+ .|+..... ...+.-+.+...+..| +.+ .|+.+-+.+..
T Consensus 473 ---~~~WsGD~~s~wd~l~~si~~~Ls~~~sGi~~wg~DiGGF~g~~~~~~EL~~RW~ 527 (772)
T COG1501 473 ---AAHWSGDNRSSWDSLRESIPAGLSLSLSGIPFWGHDIGGFTGGDDPTAELYIRWY 527 (772)
T ss_pred ---cceeCCccccchHHHHhhHHhhhchhccCCccccccccccCCCCCCCHHHHHHHH
Confidence 134677764 23221100 0001112344445554 444 56666666643
No 64
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=40.12 E-value=1.2e+02 Score=29.17 Aligned_cols=82 Identities=13% Similarity=0.094 Sum_probs=51.2
Q ss_pred HHHHHHHHHHHHcCCCEEEccCC---CCCCc-------------HHHHHHHHHcCCEEEeecccccCC-CCCChHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGG---GVYMS-------------DYFYETCDELGILIWQDMMFACNN-YPATPTFLQSV 317 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g---~~~~~-------------~~~~~~cDe~Gilv~~e~~~~~~~-~p~~~~~~~~~ 317 (669)
+.+++.++.++.+|+..|++|.| ..... ....+.|.++|+.+..|....... .+.+
T Consensus 71 ~~~~~~i~~a~~lg~~~i~~~~g~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~gv~i~lE~~~~~~~~~~~~------- 143 (213)
T PF01261_consen 71 EYLKKAIDLAKRLGAKYIVVHSGRYPSGPEDDTEENWERLAENLRELAEIAEEYGVRIALENHPGPFSETPFS------- 143 (213)
T ss_dssp HHHHHHHHHHHHHTBSEEEEECTTESSSTTSSHHHHHHHHHHHHHHHHHHHHHHTSEEEEE-SSSSSSSEESS-------
T ss_pred HHHHHHHHHHHHhCCCceeecCcccccccCCCHHHHHHHHHHHHHHHHhhhhhhcceEEEecccCccccchhh-------
Confidence 57888999999999999999986 22221 356778889999998886533221 1111
Q ss_pred HHHHHHHHHHhCCCcceEeecccCCc
Q psy438 318 RSEISQTVRRVQHHPCIAVWAGNNEM 343 (669)
Q Consensus 318 ~~~~~~~v~r~rnHPSi~~W~~~NE~ 343 (669)
.+++.++++.+..--.=++|-.+|-.
T Consensus 144 ~~~~~~~l~~~~~~~~~i~~D~~h~~ 169 (213)
T PF01261_consen 144 VEEIYRLLEEVDSPNVGICFDTGHLI 169 (213)
T ss_dssp HHHHHHHHHHHTTTTEEEEEEHHHHH
T ss_pred HHHHHHHHhhcCCCcceEEEehHHHH
Confidence 23444566655432244666665554
No 65
>COG3693 XynA Beta-1,4-xylanase [Carbohydrate transport and metabolism]
Probab=37.20 E-value=1.8e+02 Score=32.12 Aligned_cols=92 Identities=18% Similarity=0.267 Sum_probs=58.1
Q ss_pred cHHHHHHHHHcCCE------EEeecccccCCC--CCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhh----hh
Q psy438 282 SDYFYETCDELGIL------IWQDMMFACNNY--PATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGAT----IQ 349 (669)
Q Consensus 282 ~~~~~~~cDe~Gil------v~~e~~~~~~~~--p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~----~~ 349 (669)
.|..++.|.+.|+- ||.-.--.|-.. -..+.+.+.+.+.+...+.|+++- ++.|-+-||..... ..
T Consensus 84 AD~ia~FAr~h~m~lhGHtLvW~~q~P~W~~~~e~~~~~~~~~~e~hI~tV~~rYkg~--~~sWDVVNE~vdd~g~~R~s 161 (345)
T COG3693 84 ADAIANFARKHNMPLHGHTLVWHSQVPDWLFGDELSKEALAKMVEEHIKTVVGRYKGS--VASWDVVNEAVDDQGSLRRS 161 (345)
T ss_pred hHHHHHHHHHcCCeeccceeeecccCCchhhccccChHHHHHHHHHHHHHHHHhccCc--eeEEEecccccCCCchhhhh
Confidence 47889999999984 444211111101 123467788888889999999986 99999999985421 11
Q ss_pred hhccc-cChhhHHHHHHHHHHHHHHHHHHhhCCCCC
Q psy438 350 KWYIR-ENPELYYKEYAELYVNTLKPIVLQYDPTRP 384 (669)
Q Consensus 350 ~~~~~-~~~~~~~~~~~~~~~~~l~~~v~~~DptRp 384 (669)
.|+.. ..+ .+++.....+++.||.--
T Consensus 162 ~w~~~~~gp---------d~I~~aF~~AreadP~Ak 188 (345)
T COG3693 162 AWYDGGTGP---------DYIKLAFHIAREADPDAK 188 (345)
T ss_pred hhhccCCcc---------HHHHHHHHHHHhhCCCce
Confidence 24332 222 234555678899999533
No 66
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=36.95 E-value=1.5e+02 Score=30.62 Aligned_cols=76 Identities=12% Similarity=0.135 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCCc--------------HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYMS--------------DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSE 320 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~~--------------~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~ 320 (669)
+.+++.++.++.+|...|++|.|..... .++.++|.++||.+-.|... |....+... .++
T Consensus 90 ~~~~~~i~~a~~lGa~~i~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~gv~l~iE~~~-----~~~~~~~~t-~~~ 163 (275)
T PRK09856 90 DMIKLAMDMAKEMNAGYTLISAAHAGYLTPPNVIWGRLAENLSELCEYAENIGMDLILEPLT-----PYESNVVCN-AND 163 (275)
T ss_pred HHHHHHHHHHHHhCCCEEEEcCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHcCCEEEEecCC-----CCcccccCC-HHH
Q ss_pred HHHHHHHhCCCcceEee
Q psy438 321 ISQTVRRVQHHPCIAVW 337 (669)
Q Consensus 321 ~~~~v~r~rnHPSi~~W 337 (669)
+.++++.. +||.|-+.
T Consensus 164 ~~~l~~~~-~~~~v~~~ 179 (275)
T PRK09856 164 VLHALALV-PSPRLFSM 179 (275)
T ss_pred HHHHHHHc-CCCcceeE
No 67
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=36.86 E-value=2.1e+02 Score=29.71 Aligned_cols=78 Identities=12% Similarity=0.110 Sum_probs=49.1
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC-c-------------HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM-S-------------DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSE 320 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~-~-------------~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~ 320 (669)
+.+++.++.++++|+..|+++|+.... . .+..++|.++|+.+.-|..... +-.+. .+
T Consensus 99 ~~~~~~i~~a~~lG~~~i~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~A~~~GV~i~iE~~~~~--~~~~~-------~~ 169 (283)
T PRK13209 99 EIMRKAIQLAQDLGIRVIQLAGYDVYYEQANNETRRRFIDGLKESVELASRASVTLAFEIMDTP--FMNSI-------SK 169 (283)
T ss_pred HHHHHHHHHHHHcCCCEEEECCccccccccHHHHHHHHHHHHHHHHHHHHHhCCEEEEeecCCc--ccCCH-------HH
Confidence 468889999999999999997642110 1 2356778889999988864211 11222 12
Q ss_pred HHHHHHHhCCCcce-EeecccCC
Q psy438 321 ISQTVRRVQHHPCI-AVWAGNNE 342 (669)
Q Consensus 321 ~~~~v~r~rnHPSi-~~W~~~NE 342 (669)
+.++++.. +||.+ +++-.+|=
T Consensus 170 ~~~ll~~v-~~~~lgl~~D~~h~ 191 (283)
T PRK13209 170 ALGYAHYL-NSPWFQLYPDIGNL 191 (283)
T ss_pred HHHHHHHh-CCCccceEeccchH
Confidence 33455544 67877 55666553
No 68
>PRK01060 endonuclease IV; Provisional
Probab=35.98 E-value=2.2e+02 Score=29.58 Aligned_cols=84 Identities=17% Similarity=0.297 Sum_probs=51.2
Q ss_pred HHHHHHHHHHcCCCEEEccCCCC--CCc--------HHHHHHHHHcCCE---EEeecccccCCCCCChHHHHHHHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGV--YMS--------DYFYETCDELGIL---IWQDMMFACNNYPATPTFLQSVRSEISQ 323 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~--~~~--------~~~~~~cDe~Gil---v~~e~~~~~~~~p~~~~~~~~~~~~~~~ 323 (669)
+++.++.++++|+..|=++.++. +.. ..+-++|+++||- +....|+....-..+++..+...+.+++
T Consensus 14 ~~~~l~~~~~~G~d~vEl~~~~p~~~~~~~~~~~~~~~lk~~~~~~gl~~~~~~~h~~~~~nl~~~d~~~r~~s~~~~~~ 93 (281)
T PRK01060 14 LEGAVAEAAEIGANAFMIFTGNPQQWKRKPLEELNIEAFKAACEKYGISPEDILVHAPYLINLGNPNKEILEKSRDFLIQ 93 (281)
T ss_pred HHHHHHHHHHcCCCEEEEECCCCCCCcCCCCCHHHHHHHHHHHHHcCCCCCceEEecceEecCCCCCHHHHHHHHHHHHH
Confidence 56678899999999997765311 111 3488999999995 4444443211112345555555556665
Q ss_pred HHHH--hCCCcceEeeccc
Q psy438 324 TVRR--VQHHPCIAVWAGN 340 (669)
Q Consensus 324 ~v~r--~rnHPSi~~W~~~ 340 (669)
.++. .-.-|.|+++.+.
T Consensus 94 ~i~~A~~lga~~vv~h~G~ 112 (281)
T PRK01060 94 EIERCAALGAKLLVFHPGS 112 (281)
T ss_pred HHHHHHHcCCCEEEEcCCc
Confidence 5543 3345678887664
No 69
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=35.87 E-value=78 Score=33.87 Aligned_cols=55 Identities=11% Similarity=0.122 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHhcCCceEEEEcCCCCC-chhhhhhhhhccCchhhhhhHHHHHHHHHHHHHHHhCCCCCee
Q psy438 569 LQSVRSEISQTVRRVQHHPCIAVWAGNNEM-EGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 641 (669)
Q Consensus 569 ~~~~~~El~e~IrR~rNHPSIvlWs~gNE~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~vk~~DPTRp~~ 641 (669)
.++.++=++.+|+|....|-|+ |+++||. ... . . ...-+++.+.+++.||..|++
T Consensus 128 ~e~~~~Y~~yv~~Ry~~~~Nvi-W~l~gd~~~~~--------~-~--------~~~w~~~~~~i~~~dp~~L~T 183 (289)
T PF13204_consen 128 PENAERYGRYVVARYGAYPNVI-WILGGDYFDTE--------K-T--------RADWDAMARGIKENDPYQLIT 183 (289)
T ss_dssp HHHHHHHHHHHHHHHTT-SSEE-EEEESSS--TT--------S-S--------HHHHHHHHHHHHHH--SS-EE
T ss_pred HHHHHHHHHHHHHHHhcCCCCE-EEecCccCCCC--------c-C--------HHHHHHHHHHHHhhCCCCcEE
Confidence 5677788889999999999887 9999998 111 0 0 122344889999999977754
No 70
>COG5014 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=35.10 E-value=1.2e+02 Score=30.62 Aligned_cols=71 Identities=13% Similarity=0.135 Sum_probs=43.1
Q ss_pred cchHHHHHHHHHHHHcCCCEEEccCCCCCC-cH---HHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHH
Q psy438 252 RHESTIRDLLVSTKEANMNMLRVWGGGVYM-SD---YFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRR 327 (669)
Q Consensus 252 ~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~-~~---~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r 327 (669)
+.++..+++++..|..|+..+|+-|+-..- ++ +..|+|-++-+.+-..-- .+..| +.+|+.
T Consensus 75 ~P~eVaeRL~ei~K~~g~d~vRiSG~EP~l~~EHvlevIeLl~~~tFvlETNG~----~~g~d-----------rslv~e 139 (228)
T COG5014 75 SPEEVAERLLEISKKRGCDLVRISGAEPILGREHVLEVIELLVNNTFVLETNGL----MFGFD-----------RSLVDE 139 (228)
T ss_pred CHHHHHHHHHHHHHhcCCcEEEeeCCCccccHHHHHHHHHhccCceEEEEeCCe----EEecC-----------HHHHHH
Confidence 345688899999999999999998752221 34 455666444443321110 11223 246777
Q ss_pred hCCCcceEee
Q psy438 328 VQHHPCIAVW 337 (669)
Q Consensus 328 ~rnHPSi~~W 337 (669)
+-|||-+.+=
T Consensus 140 l~nr~nv~vR 149 (228)
T COG5014 140 LVNRLNVLVR 149 (228)
T ss_pred HhcCCceEEE
Confidence 8888877653
No 71
>PF07905 PucR: Purine catabolism regulatory protein-like family; InterPro: IPR012914 This domain is found in the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR, O32138 from SWISSPROT). PucR is thought to be a transcriptional regulator of genes involved in the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression []. The other members of this family are also putative regulatory proteins.
Probab=32.43 E-value=1.1e+02 Score=28.19 Aligned_cols=44 Identities=20% Similarity=0.299 Sum_probs=34.3
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCC--cHHHHHHHHHcCCEEEe
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYM--SDYFYETCDELGILIWQ 298 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~--~~~~~~~cDe~Gilv~~ 298 (669)
+.+.+.++.+.+.|+=.+=+-.|...+ |++..++||++|+-++.
T Consensus 59 ~~~~~~i~~L~~~~~agL~i~~~~~~~~iP~~~i~~A~~~~lPli~ 104 (123)
T PF07905_consen 59 EELREFIRELAEKGAAGLGIKTGRYLDEIPEEIIELADELGLPLIE 104 (123)
T ss_pred HHHHHHHHHHHHCCCeEEEEeccCccccCCHHHHHHHHHcCCCEEE
Confidence 357888999999999988665542333 68999999999998743
No 72
>COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]
Probab=30.18 E-value=1.8e+02 Score=31.30 Aligned_cols=46 Identities=26% Similarity=0.395 Sum_probs=37.0
Q ss_pred hHHHHHHHHHHHHcCCCEEEccCCCCCCc---------------HHHHHHHHHcCCEEEeeccc
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWGGGVYMS---------------DYFYETCDELGILIWQDMMF 302 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~g~~~~~---------------~~~~~~cDe~Gilv~~e~~~ 302 (669)
+++++..-+.+|++|.+++|- |.|-| ...-+.||++|+.|..|.+-
T Consensus 58 ~E~i~~~A~~vk~~Ga~~lRG---gafKPRTSPYsFQGlge~gL~~l~~a~~~~Gl~vvtEvm~ 118 (286)
T COG2876 58 EEQVRETAESVKAAGAKALRG---GAFKPRTSPYSFQGLGEEGLKLLKRAADETGLPVVTEVMD 118 (286)
T ss_pred HHHHHHHHHHHHHcchhhccC---CcCCCCCCcccccccCHHHHHHHHHHHHHcCCeeEEEecC
Confidence 467888888899999999995 45432 35678899999999999983
No 73
>PF05089 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain; InterPro: IPR024733 Alpha-N-acetylglucosaminidase is a lysosomal enzyme that is required for the stepwise degradation of heparan sulphate []. Mutations on the alpha-N-acetylglucosaminidase gene can lead to mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B), characterised by neurological dysfunction but relatively mild somatic manifestations []. Alpha-N-acetylglucosaminidase is composed of three domains. This entry represents the central domain, which has a tim barrel fold [].; PDB: 4A4A_A 2VC9_A 2VCC_A 2VCB_A 2VCA_A.
Probab=30.15 E-value=1.2e+02 Score=33.48 Aligned_cols=117 Identities=16% Similarity=0.149 Sum_probs=61.7
Q ss_pred hHHHHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCC---EEEeecccccC--CCCCChHHHHHHHHHHHHHHHHh
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGI---LIWQDMMFACN--NYPATPTFLQSVRSEISQTVRRV 328 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gi---lv~~e~~~~~~--~~p~~~~~~~~~~~~~~~~v~r~ 328 (669)
.+.-++.|+.|+++||.-|=.-=.||+|. .|-+.--..-| --|.. +.|. .-|.+|.|.+-.+.=++++++.+
T Consensus 94 ~~Lq~kIl~RmreLGm~PVLPaF~G~VP~-~~~~~~P~a~i~~~~~W~~--f~~~~~L~P~dplF~~i~~~F~~~q~~~y 170 (333)
T PF05089_consen 94 AELQKKILDRMRELGMTPVLPAFAGHVPR-AFKRKYPNANITRQGNWNG--FCRPYFLDPTDPLFAEIAKLFYEEQIKLY 170 (333)
T ss_dssp HHHHHHHHHHHHHHT-EEEEE--S-EE-T-THHHHSTT--EE---EETT--EE--EEE-SS--HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCcccCCCcCCCCCh-HHHhcCCCCEEeeCCCcCC--CCCCceeCCCCchHHHHHHHHHHHHHHhc
Confidence 35677888899999999886422247764 33332222222 22332 2221 13789999998888888888888
Q ss_pred CCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEE
Q psy438 329 QHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 386 (669)
Q Consensus 329 rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~ 386 (669)
. --.+..=-..||+..... .+ +|.....+.+.+.+++.||.-..+
T Consensus 171 G-~~~~Y~~D~FnE~~p~~~-------~~-----~~l~~~s~~v~~am~~~dp~AvWv 215 (333)
T PF05089_consen 171 G-TDHIYAADPFNEGGPPSG-------DP-----EYLANVSKAVYKAMQAADPDAVWV 215 (333)
T ss_dssp ----SEEE--TTTTS---TT-------S--------HHHHHHHHHHHHHHH-TT-EEE
T ss_pred C-CCceeCCCccCCCCCCCC-------ch-----HHHHHHHHHHHHHHHhhCCCcEEE
Confidence 7 555666677799876531 11 334455678888999999975544
No 74
>PF08220 HTH_DeoR: DeoR-like helix-turn-helix domain; InterPro: IPR001034 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after the Escherichia coli protein DeoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerisation domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular
Probab=28.40 E-value=63 Score=25.97 Aligned_cols=26 Identities=35% Similarity=0.469 Sum_probs=22.4
Q ss_pred hHHHHHHHHHHHHcCCCEEEccCCCCC
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWGGGVY 280 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~g~~~ 280 (669)
+..+++||..+.+.|+ ..|+|||...
T Consensus 28 ~~TiRRDl~~L~~~g~-i~r~~GG~~~ 53 (57)
T PF08220_consen 28 EMTIRRDLNKLEKQGL-IKRTHGGAVL 53 (57)
T ss_pred HHHHHHHHHHHHHCCC-EEEEcCEEEe
Confidence 4589999999999999 9999998543
No 75
>TIGR02631 xylA_Arthro xylose isomerase, Arthrobacter type. This model describes a D-xylose isomerase that is also active as a D-glucose isomerase. It is tetrameric and dependent on a divalent cation Mg2+, Co2+ or Mn2+ as characterized in Arthrobacter. Members of this family differ substantially from the D-xylose isomerases of family TIGR02630.
Probab=28.33 E-value=8.1e+02 Score=27.42 Aligned_cols=83 Identities=10% Similarity=0.109 Sum_probs=48.8
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCC----c-------HHHHHHHHHcCCEEEeeccc-------ccCCC-CCChHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYM----S-------DYFYETCDELGILIWQDMMF-------ACNNY-PATPTFLQSV 317 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~----~-------~~~~~~cDe~Gilv~~e~~~-------~~~~~-p~~~~~~~~~ 317 (669)
....++.++++|+..|=+|....+| . +++-+++++.||-|-.-.+- ....+ ..++...+..
T Consensus 34 ~~e~i~~la~~GfdgVE~~~~dl~P~~~~~~e~~~~~~~lk~~L~~~GL~v~~v~~nl~~~~~~~~g~las~d~~vR~~a 113 (382)
T TIGR02631 34 PVEAVHKLAELGAYGVTFHDDDLIPFGAPPQERDQIVRRFKKALDETGLKVPMVTTNLFSHPVFKDGGFTSNDRSVRRYA 113 (382)
T ss_pred HHHHHHHHHHhCCCEEEecccccCCCCCChhHHHHHHHHHHHHHHHhCCeEEEeeccccCCccccCCCCCCCCHHHHHHH
Confidence 3445778999999999987531122 1 36889999999986542221 00011 1244444444
Q ss_pred HHHHHHHH---HHhCCCcceEeeccc
Q psy438 318 RSEISQTV---RRVQHHPCIAVWAGN 340 (669)
Q Consensus 318 ~~~~~~~v---~r~rnHPSi~~W~~~ 340 (669)
.+.+++.| +.+ .-|.|++|++.
T Consensus 114 i~~~kraId~A~eL-Ga~~v~v~~G~ 138 (382)
T TIGR02631 114 LRKVLRNMDLGAEL-GAETYVVWGGR 138 (382)
T ss_pred HHHHHHHHHHHHHh-CCCEEEEccCC
Confidence 44444443 444 45789999874
No 76
>PF04339 DUF482: Protein of unknown function, DUF482; InterPro: IPR007434 This family contains several proteins of uncharacterised function.
Probab=28.21 E-value=2e+02 Score=32.24 Aligned_cols=87 Identities=17% Similarity=0.253 Sum_probs=63.8
Q ss_pred HHHHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeecccccCC--CCCChHHHHHHHHHHHHHHHHhCCCc
Q psy438 255 STIRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDMMFACNN--YPATPTFLQSVRSEISQTVRRVQHHP 332 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~~~~~~~--~p~~~~~~~~~~~~~~~~v~r~rnHP 332 (669)
..+....+++++.|+..+-+ .++++...+++.+.|.+......+.|.+ |.+-++|+..+.+.-|..|||-|...
T Consensus 120 ~L~~~~~~~a~~~~~Ss~h~----lF~~~~~~~~l~~~G~~~r~~~qf~W~N~gy~~FDdfLa~Lss~kRk~IRrERr~v 195 (370)
T PF04339_consen 120 ALLQALEQLAEENGLSSWHI----LFPDEEDAAALEEAGFLSRQGVQFHWHNRGYRSFDDFLAALSSRKRKNIRRERRKV 195 (370)
T ss_pred HHHHHHHHHHHHcCCCccee----ecCCHHHHHHHHhCCCceecCCceEEecCCCCCHHHHHHHhchhhHHHHHHHHHHH
Confidence 45666667788899988876 6777878888999999999999887754 44556888887777777776655443
Q ss_pred ----ceEeecccCCcch
Q psy438 333 ----CIAVWAGNNEMEG 345 (669)
Q Consensus 333 ----Si~~W~~~NE~~~ 345 (669)
--+-+.-|.|...
T Consensus 196 ~~~Gi~~~~l~G~~i~~ 212 (370)
T PF04339_consen 196 AEQGIRIRTLTGDEITD 212 (370)
T ss_pred HHcCCEEEEEeCCCCCH
Confidence 3456667777644
No 77
>KOG2233|consensus
Probab=27.00 E-value=3.8e+02 Score=31.27 Aligned_cols=141 Identities=14% Similarity=0.202 Sum_probs=82.7
Q ss_pred hhhhhhhhcCCccccccccccccchHHHHHHHHHHHHcCCCEEEc-cCCCCCCc--HHHHHHHHHcCCEEEeeccc--cc
Q psy438 230 ANMNMLRVWGGGVYMTRYTDMARHESTIRDLLVSTKEANMNMLRV-WGGGVYMS--DYFYETCDELGILIWQDMMF--AC 304 (669)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~l~~~k~~g~N~iR~-~~g~~~~~--~~~~~~cDe~Gilv~~e~~~--~~ 304 (669)
++|-+++.|||.-=. ...+ ..--.-.+.|..+++.||.-|=. .. ||.|. +..+-.+.=.-+--|..|+. +|
T Consensus 132 ~RMGNl~awgGpLs~-aw~~--~ql~LqkrIidrm~~lGmTpvLPaFa-G~VP~al~rlfPesnf~rl~rWn~f~s~~~C 207 (666)
T KOG2233|consen 132 HRMGNLHAWGGPLSP-AWML--NQLLLQKRIIDRMLELGMTPVLPAFA-GHVPDALERLFPESNFTRLPRWNNFTSRYSC 207 (666)
T ss_pred HHhcCccccCCCCCH-HHHH--HHHHHHHHHHHHHHHcCCCccchhhc-cccHHHHHHhCchhceeeccccCCCCcceee
Confidence 577788889876410 0000 00002234566778889887753 33 47753 22333333233333444442 23
Q ss_pred CC--CCCChHHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCC
Q psy438 305 NN--YPATPTFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPT 382 (669)
Q Consensus 305 ~~--~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dpt 382 (669)
.. -|.+|.|.+-...=++++++.+.----|..=-..||..++.. . .+|...-...+++.+++.|+.
T Consensus 208 ~l~v~P~dplF~eIgs~Flr~~~kefG~~tniy~~DpFNE~~Pp~s-------e-----pey~~staaAiyesm~kvdkn 275 (666)
T KOG2233|consen 208 MLLVSPFDPLFQEIGSTFLRHQIKEFGGVTNIYSADPFNEILPPES-------E-----PEYVKSTAAAIYESMKKVDKN 275 (666)
T ss_pred eEEccCCcchHHHHHHHHHHHHHHHhCCcccccccCcccccCCCCC-------C-----hHHHHHHHHHHHHHHhccCcc
Confidence 21 278899988877788899999885444555455699876541 2 245566667788899999987
Q ss_pred CCEE
Q psy438 383 RPYL 386 (669)
Q Consensus 383 Rp~~ 386 (669)
-...
T Consensus 276 aVWl 279 (666)
T KOG2233|consen 276 AVWL 279 (666)
T ss_pred eEEe
Confidence 5554
No 78
>KOG2836|consensus
Probab=26.76 E-value=1.5e+02 Score=28.86 Aligned_cols=81 Identities=20% Similarity=0.294 Sum_probs=50.3
Q ss_pred ccccccccchHHHHHHHHHHHHcCCCEE-EccCCCCCCcHHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHH
Q psy438 245 TRYTDMARHESTIRDLLVSTKEANMNML-RVWGGGVYMSDYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQ 323 (669)
Q Consensus 245 ~~~~~~~~~e~~~~~~l~~~k~~g~N~i-R~~~g~~~~~~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~ 323 (669)
+||++ +.+...++.+|..|.++| |++- +-+=-..-.+-||-|. |++|.....|. +++.+.+.+ -
T Consensus 23 hnPtn-----aTln~fieELkKygvttvVRVCe-----~TYdt~~lek~GI~Vl-dw~f~dg~ppp-~qvv~~w~~---l 87 (173)
T KOG2836|consen 23 HNPTN-----ATLNKFIEELKKYGVTTVVRVCE-----PTYDTTPLEKEGITVL-DWPFDDGAPPP-NQVVDDWLS---L 87 (173)
T ss_pred cCCCc-----hhHHHHHHHHHhcCCeEEEEecc-----cccCCchhhhcCceEe-ecccccCCCCc-hHHHHHHHH---H
Confidence 46776 468889999999999986 8843 2222234567899885 56665543333 333333321 2
Q ss_pred HHHHhCCCc--ceEeeccc
Q psy438 324 TVRRVQHHP--CIAVWAGN 340 (669)
Q Consensus 324 ~v~r~rnHP--Si~~W~~~ 340 (669)
..-+++-|| ||++-|..
T Consensus 88 ~~~~f~e~p~~cvavhcva 106 (173)
T KOG2836|consen 88 VKTKFREEPGCCVAVHCVA 106 (173)
T ss_pred HHHHHhhCCCCeEEEEeec
Confidence 234677787 67766654
No 79
>PRK10426 alpha-glucosidase; Provisional
Probab=26.50 E-value=1e+03 Score=28.55 Aligned_cols=133 Identities=11% Similarity=0.057 Sum_probs=71.9
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeec---------ccccCCCC--CChHHHHHHHHHHHHHH
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDM---------MFACNNYP--ATPTFLQSVRSEISQTV 325 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~---------~~~~~~~p--~~~~~~~~~~~~~~~~v 325 (669)
.+..++.+++.|+.++-.---.....-..|+.|.+.|.+|-..- +..+ .+| .+|+..+-+.+.+++.+
T Consensus 271 p~~mi~~L~~~G~k~v~~i~P~v~~~~~~y~e~~~~gy~vk~~~g~~~~~~~~~~~~-~~~Dftnp~ar~Ww~~~~~~~~ 349 (635)
T PRK10426 271 LDSRIKQLNEEGIQFLGYINPYLASDGDLCEEAAEKGYLAKDADGGDYLVEFGEFYA-GVVDLTNPEAYEWFKEVIKKNM 349 (635)
T ss_pred HHHHHHHHHHCCCEEEEEEcCccCCCCHHHHHHHHCCcEEECCCCCEEEeEecCCCc-eeecCCCHHHHHHHHHHHHHHH
Confidence 45678889999999886521111112367899999999985421 1111 122 25555555555444333
Q ss_pred HHhCCCcceEeecccCCcchhhhhhhccccChhhHHHHHHHHHHHHHHHHHHhhCC-CCCEEec-CCCCCc
Q psy438 326 RRVQHHPCIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDP-TRPYLTS-SPTNGI 394 (669)
Q Consensus 326 ~r~rnHPSi~~W~~~NE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~Dp-tRp~~~~-S~~~~~ 394 (669)
.. ...=..|.=.||..+... .............-|..++.+...+.+++..+ .||++.+ |.+.|.
T Consensus 350 ~~---~Gvdg~w~D~~E~~p~d~-~~~~g~~~~~~hN~Y~~l~~~~~~e~~~~~~~~~r~f~ltRsg~aGs 416 (635)
T PRK10426 350 IG---LGCSGWMADFGEYLPTDA-YLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEILFFMRAGYTGS 416 (635)
T ss_pred hh---cCCCEEeeeCCCCCCCcc-eeeCCCCHHHhccHHHHHHHHHHHHHHHHhcCCCCcEEEEccccCCc
Confidence 33 334456777888532210 00011111222334556677777777777766 5888765 444443
No 80
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=25.55 E-value=8.5e+02 Score=26.30 Aligned_cols=115 Identities=19% Similarity=0.277 Sum_probs=71.7
Q ss_pred HHHHHHHHHHHHcCCCEEEcc-------------------------------------CCCCCCcH---HHHHHHHHcCC
Q psy438 255 STIRDLLVSTKEANMNMLRVW-------------------------------------GGGVYMSD---YFYETCDELGI 294 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~-------------------------------------~g~~~~~~---~~~~~cDe~Gi 294 (669)
+.+++.|+.|...++|.+=.| .++.|..+ ++.+.|.++||
T Consensus 17 ~~ik~~id~ma~~K~N~lhlHltD~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~~~~~~~~YT~~di~eiv~yA~~rgI 96 (326)
T cd06564 17 DFLKDIIKTMSWYKMNDLQLHLNDNLIFNLDDMSTTVNNATYASDDVKSGNNYYNLTANDGYYTKEEFKELIAYAKDRGV 96 (326)
T ss_pred HHHHHHHHHHHHcCCceEEEeecCCcccccCCCchhhhhhhhhccccccccccCCCCCCCCcccHHHHHHHHHHHHHcCC
Confidence 478899999999999988542 23355542 68899999999
Q ss_pred EEEeecccccCC------------------------CCCCh---HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhh
Q psy438 295 LIWQDMMFACNN------------------------YPATP---TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGAT 347 (669)
Q Consensus 295 lv~~e~~~~~~~------------------------~p~~~---~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~ 347 (669)
-|+.|+....+. -|.++ +|++.+.+|+. ..... +|=.+=.|+-|.....
T Consensus 97 ~vIPEID~PGH~~a~~~~~pel~~~~~~~~~~~~~l~~~~~~t~~f~~~l~~E~~---~~f~~-~~~~~HiGgDE~~~~~ 172 (326)
T cd06564 97 NIIPEIDSPGHSLAFTKAMPELGLKNPFSKYDKDTLDISNPEAVKFVKALFDEYL---DGFNP-KSDTVHIGADEYAGDA 172 (326)
T ss_pred eEeccCCCcHHHHHHHHhhHHhcCCCcccCCCcccccCCCHHHHHHHHHHHHHHH---HhcCC-CCCEEEeccccccccC
Confidence 999988653220 12334 34444444433 32211 2778888889986521
Q ss_pred hhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEE
Q psy438 348 IQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 386 (669)
Q Consensus 348 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~ 386 (669)
...+. ...|++.+.+.+++.. .++..
T Consensus 173 -------~~~~~-----~~~f~~~~~~~v~~~g-k~~~~ 198 (326)
T cd06564 173 -------GYAEA-----FRAYVNDLAKYVKDKG-KTPRV 198 (326)
T ss_pred -------ccHHH-----HHHHHHHHHHHHHHcC-CeEEE
Confidence 11122 2357888899998873 34443
No 81
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=25.43 E-value=7.6e+02 Score=27.12 Aligned_cols=124 Identities=18% Similarity=0.301 Sum_probs=75.2
Q ss_pred HHHHHHHHHHHHcCCCEEEccC----------------------------------------CCCCCcH---HHHHHHHH
Q psy438 255 STIRDLLVSTKEANMNMLRVWG----------------------------------------GGVYMSD---YFYETCDE 291 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~----------------------------------------g~~~~~~---~~~~~cDe 291 (669)
+.+++.|+.|....+|.+=.|- |+.|..+ ++.+.|.+
T Consensus 18 ~~ik~~Id~ma~~K~N~lhlHltDdq~~rle~~~~P~Lt~~ga~~~~~~~~~~~~~~~~~~~~~~YT~~di~eiv~yA~~ 97 (357)
T cd06563 18 DEVKRFIDLMALYKLNVFHWHLTDDQGWRIEIKKYPKLTEVGAWRGPTEIGLPQGGGDGTPYGGFYTQEEIREIVAYAAE 97 (357)
T ss_pred HHHHHHHHHHHHhccceEEEeeecCCCceecccCcchhhhcccccCcccccccccccCCCccCceECHHHHHHHHHHHHH
Confidence 4788999999999999987651 2455542 68899999
Q ss_pred cCCEEEeecccccCC------C------------------------CCCh---HHHHHHHHHHHHHHHHhCCCcceEeec
Q psy438 292 LGILIWQDMMFACNN------Y------------------------PATP---TFLQSVRSEISQTVRRVQHHPCIAVWA 338 (669)
Q Consensus 292 ~Gilv~~e~~~~~~~------~------------------------p~~~---~~~~~~~~~~~~~v~r~rnHPSi~~W~ 338 (669)
+||-|+.|+....+. | |.++ +|++.+.+|+.++ +||=.+=.
T Consensus 98 rgI~VIPEID~PGH~~a~l~~~pel~~~~~~~~~~~~~~~~~~~L~~~~~~t~~f~~~ll~E~~~l------F~~~~iHi 171 (357)
T cd06563 98 RGITVIPEIDMPGHALAALAAYPELGCTGGPGSVVSVQGVVSNVLCPGKPETYTFLEDVLDEVAEL------FPSPYIHI 171 (357)
T ss_pred cCCEEEEecCCchhHHHHHHhCccccCCCCCCccccccCcCCCccCCCChhHHHHHHHHHHHHHHh------CCCCeEEE
Confidence 999999988542221 1 1233 3555555554443 57888888
Q ss_pred ccCCcchhh------hhhhccccChhhHHHHHHHHHHHHHHHHHHhhCCCCCEE
Q psy438 339 GNNEMEGAT------IQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYL 386 (669)
Q Consensus 339 ~~NE~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~DptRp~~ 386 (669)
|+-|..... ...|+...+.... .+....|++.+.+.+++.. -+++.
T Consensus 172 GgDE~~~~~w~~~~~~~~~~~~~g~~~~-~~l~~~f~~~~~~~v~~~G-~~~i~ 223 (357)
T cd06563 172 GGDEVPKGQWEKSPACQARMKEEGLKDE-HELQSYFIKRVEKILASKG-KKMIG 223 (357)
T ss_pred eccccCCcccccCHHHHHHHHHcCCCCH-HHHHHHHHHHHHHHHHHcC-CEEEE
Confidence 889985421 1112211111111 1223457888888888764 34443
No 82
>cd06568 GH20_SpHex_like A subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the N-acetylhexosaminidase from Streptomyces plicatus (SpHex). SpHex catalyzes the hydrolysis of N-acetyl-beta-hexosaminides. An Asp residue within the active site plays a critical role in substrate-assisted catalysis by orienting the 2-acetamido group and stabilizing the transition state. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Proteins belonging to this subgroup lack the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases.
Probab=25.06 E-value=5.3e+02 Score=28.14 Aligned_cols=106 Identities=20% Similarity=0.364 Sum_probs=70.5
Q ss_pred HHHHHHHHHHHHcCCCEEEccC-----------------------------CCCCCcH---HHHHHHHHcCCEEEeeccc
Q psy438 255 STIRDLLVSTKEANMNMLRVWG-----------------------------GGVYMSD---YFYETCDELGILIWQDMMF 302 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~-----------------------------g~~~~~~---~~~~~cDe~Gilv~~e~~~ 302 (669)
+.+++.|+.|....+|.+=.|- |+.|..+ ++.+.|.+.||.|+.|+..
T Consensus 18 ~~lk~~id~ma~~KlN~lhlHLtD~~~~rle~~~~P~lt~~ga~~~~~~~~~~~YT~~di~elv~yA~~rgI~vIPEiD~ 97 (329)
T cd06568 18 AEVKRYIDLLALYKLNVLHLHLTDDQGWRIEIKSWPKLTEIGGSTEVGGGPGGYYTQEDYKDIVAYAAERHITVVPEIDM 97 (329)
T ss_pred HHHHHHHHHHHHhCCcEEEEEeecCCcceeeecCcccccccccccccCCCCCCcCCHHHHHHHHHHHHHcCCEEEEecCC
Confidence 4788999999999999884332 2356543 6888899999999998864
Q ss_pred ccCC------C------------------------CCCh---HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhh
Q psy438 303 ACNN------Y------------------------PATP---TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQ 349 (669)
Q Consensus 303 ~~~~------~------------------------p~~~---~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~ 349 (669)
..+. | |.++ +|++.+.+|+.++ .||=++=.||-|.....
T Consensus 98 PGH~~a~~~~~p~l~~~~~~~~~~~~~~~~~~~l~~~~~~t~~fl~~v~~E~~~~------f~~~~iHiGgDE~~~~~-- 169 (329)
T cd06568 98 PGHTNAALAAYPELNCDGKAKPLYTGIEVGFSSLDVDKPTTYEFVDDVFRELAAL------TPGPYIHIGGDEAHSTP-- 169 (329)
T ss_pred cHHHHHHHHhChhhccCCCCCccccccCCCCcccCCCCHHHHHHHHHHHHHHHHh------CCCCeEEEecccCCCCc--
Confidence 2210 1 1233 4566666555543 47878888889986422
Q ss_pred hhccccChhhHHHHHHHHHHHHHHHHHHhhC
Q psy438 350 KWYIRENPELYYKEYAELYVNTLKPIVLQYD 380 (669)
Q Consensus 350 ~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~D 380 (669)
.+. | ..|++.+.+.+++..
T Consensus 170 -------~~~----~-~~f~~~~~~~v~~~G 188 (329)
T cd06568 170 -------HDD----Y-AYFVNRVRAIVAKYG 188 (329)
T ss_pred -------hHH----H-HHHHHHHHHHHHHCC
Confidence 111 2 357888888888774
No 83
>smart00642 Aamy Alpha-amylase domain.
Probab=23.30 E-value=1.4e+02 Score=29.22 Aligned_cols=48 Identities=15% Similarity=0.123 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHcCCCEEEccC----------CCCCC--------------c--HHHHHHHHHcCCEEEeecccc
Q psy438 256 TIRDLLVSTKEANMNMLRVWG----------GGVYM--------------S--DYFYETCDELGILIWQDMMFA 303 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~----------g~~~~--------------~--~~~~~~cDe~Gilv~~e~~~~ 303 (669)
.+...|..+|++|+|+|-+-. +.-|. . ..+.+.|.+.||-|+.|+.+.
T Consensus 20 gi~~~l~yl~~lG~~~I~l~Pi~~~~~~~~~~~gY~~~d~~~i~~~~Gt~~d~~~lv~~~h~~Gi~vilD~V~N 93 (166)
T smart00642 20 GIIEKLDYLKDLGVTAIWLSPIFESPQGYPSYHGYDISDYKQIDPRFGTMEDFKELVDAAHARGIKVILDVVIN 93 (166)
T ss_pred HHHHHHHHHHHCCCCEEEECcceeCCCCCCCCCCcCccccCCCCcccCCHHHHHHHHHHHHHCCCEEEEEECCC
Confidence 455557789999999994311 00010 0 357778888999999998764
No 84
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=23.29 E-value=8.7e+02 Score=26.63 Aligned_cols=117 Identities=20% Similarity=0.354 Sum_probs=72.4
Q ss_pred HHHHHHHHHHHHcCCCEEEcc--------------------C----CCCCCcH---HHHHHHHHcCCEEEeecccccCC-
Q psy438 255 STIRDLLVSTKEANMNMLRVW--------------------G----GGVYMSD---YFYETCDELGILIWQDMMFACNN- 306 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~--------------------~----g~~~~~~---~~~~~cDe~Gilv~~e~~~~~~~- 306 (669)
+.+++.|+.|....+|.+=.| | ++.|..+ +..+.|.++||.|+.|+....+.
T Consensus 18 ~~ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~Lt~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEID~PGH~~ 97 (348)
T cd06562 18 DSIKRTIDAMAYNKLNVLHWHITDSQSFPLESPSYPELSKKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEIDTPGHTG 97 (348)
T ss_pred HHHHHHHHHHHHhCCcEEEEeEEcCCCceEeeCCCchhhhccCcCCCceECHHHHHHHHHHHHHcCCEEEEeccCchhhH
Confidence 478999999999999998665 1 2345533 67889999999999988542211
Q ss_pred -----C------------------------CCCh---HHHHHHHHHHHHHHHHhCCCcceEeecccCCcchhhhhhhccc
Q psy438 307 -----Y------------------------PATP---TFLQSVRSEISQTVRRVQHHPCIAVWAGNNEMEGATIQKWYIR 354 (669)
Q Consensus 307 -----~------------------------p~~~---~~~~~~~~~~~~~v~r~rnHPSi~~W~~~NE~~~~~~~~~~~~ 354 (669)
+ |.++ +|++.+.+|+.++ +||=++=.||-|.... .|...
T Consensus 98 a~~~~~p~l~~~~~~~~~~~~~~~~~~~L~~~~~~t~~fl~~vl~E~~~l------F~~~~iHiGgDE~~~~---~w~~~ 168 (348)
T cd06562 98 SWGQGYPELLTGCYAVWRKYCPEPPCGQLNPTNPKTYDFLKTLFKEVSEL------FPDKYFHLGGDEVNFN---CWNSN 168 (348)
T ss_pred HHHHhChhhhCCCCccccccccCCCCccccCCChhHHHHHHHHHHHHHHh------cCCcceEeecCCCCCC---cccCC
Confidence 1 1233 4555555554443 4688888888997542 13211
Q ss_pred cChhhH--------HHHHHHHHHHHHHHHHHhhC
Q psy438 355 ENPELY--------YKEYAELYVNTLKPIVLQYD 380 (669)
Q Consensus 355 ~~~~~~--------~~~~~~~~~~~l~~~v~~~D 380 (669)
...+.+ ..+....|++.+.+.+++.-
T Consensus 169 p~~~~~m~~~g~~~~~~l~~~f~~~~~~~l~~~G 202 (348)
T cd06562 169 PEIQKFMKKNNGTDYSDLESYFIQRALDIVRSLG 202 (348)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHcC
Confidence 000000 11233458888889998874
No 85
>PF14386 DUF4417: Domain of unknown function (DUF4417)
Probab=23.22 E-value=3.3e+02 Score=27.74 Aligned_cols=76 Identities=11% Similarity=0.039 Sum_probs=47.8
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCc-HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHHhCCCcceE
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMS-DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHPCIA 335 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~-~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r~rnHPSi~ 335 (669)
-+.....+.+.|++.|=+-.++.-.+ +..++--.+.|+..+.-. +.-.+.+..+.+.+-+.+|++|+ +|..+
T Consensus 99 ~r~~g~~~q~~Gi~VIP~v~W~~~~s~~~~~~gi~~~~ivaist~-----g~~~~~~~~~~f~~Gl~em~~rl--~P~~i 171 (200)
T PF14386_consen 99 SRWLGAYWQSNGIKVIPNVSWSDKRSFDFCFDGIPKGSIVAISTN-----GCINNKEDKKLFLDGLREMLKRL--RPKHI 171 (200)
T ss_pred HHHHHHHHHHCCCeEcceEEecCcchHHHHHhhcccCCEEEEEEe-----cccCCHHHHHHHHHHHHHHHhcc--CCCeE
Confidence 34455667899999985532223333 455555566677654322 22346677778888889999999 67666
Q ss_pred eecc
Q psy438 336 VWAG 339 (669)
Q Consensus 336 ~W~~ 339 (669)
+-.|
T Consensus 172 lvyG 175 (200)
T PF14386_consen 172 LVYG 175 (200)
T ss_pred EEEC
Confidence 5544
No 86
>COG5309 Exo-beta-1,3-glucanase [Carbohydrate transport and metabolism]
Probab=22.95 E-value=2e+02 Score=30.96 Aligned_cols=121 Identities=12% Similarity=0.079 Sum_probs=79.1
Q ss_pred HHHHHHHHHHHhcccccccCCCCCCCCCeeEeeccccCccceEehhhhhhhhcCCCCChHHHHHHHHHHHHHHHHhcCCc
Q psy438 508 GAIKTITEQMRRDKGVLREDGSGHNMGALYWQLNDTCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHP 587 (669)
Q Consensus 508 e~~R~~iE~~Rr~~n~~R~~g~~~~sG~LyWq~nd~~D~~GilvW~E~~~~~~~yp~~~~f~~~~~~El~e~IrR~rNHP 587 (669)
+.++..+|...+.-+++|+.| ..|... --+..++.+.|+.+...+. .++....++.+++...++...--+
T Consensus 63 ~~~~sDLe~l~~~t~~IR~Y~-sDCn~l--e~v~pAa~~~g~kv~lGiw-------~tdd~~~~~~~til~ay~~~~~~d 132 (305)
T COG5309 63 DQVASDLELLASYTHSIRTYG-SDCNTL--ENVLPAAEASGFKVFLGIW-------PTDDIHDAVEKTILSAYLPYNGWD 132 (305)
T ss_pred HHHHhHHHHhccCCceEEEee-ccchhh--hhhHHHHHhcCceEEEEEe-------eccchhhhHHHHHHHHHhccCCCC
Confidence 345677788888766888876 234321 1123566666755543322 245566677778888999999999
Q ss_pred eEEEEcCCCCCchhhhhhhhhccCchhhhhhHHHHHHHHHHHHHHHhCCCCCeecCCCCCCc
Q psy438 588 CIAVWAGNNEMEGATIQKWYIRENPELYYKEYAELYVNTLKPIVLQYDPTRPYLTSSPTNGI 649 (669)
Q Consensus 588 SIvlWs~gNE~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~vk~~DPTRp~~~sS~~~g~ 649 (669)
+|..=.+|||.-... + ......-.+....+.++++.+-+-||...-+|+-+
T Consensus 133 ~v~~v~VGnEal~r~----------~-~tasql~~~I~~vrsav~~agy~gpV~T~dsw~~~ 183 (305)
T COG5309 133 DVTTVTVGNEALNRN----------D-LTASQLIEYIDDVRSAVKEAGYDGPVTTVDSWNVV 183 (305)
T ss_pred ceEEEEechhhhhcC----------C-CCHHHHHHHHHHHHHHHHhcCCCCceeecccceee
Confidence 999999999985321 1 11122223334477888899999999888877743
No 87
>PF05706 CDKN3: Cyclin-dependent kinase inhibitor 3 (CDKN3); InterPro: IPR022778 This entry represents a domain found in cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner [], []. This domain is also found in MAP kinase phosphatase and esterases. This entry contains both eukaryotic and bacterial proteins.; GO: 0004721 phosphoprotein phosphatase activity, 0004725 protein tyrosine phosphatase activity; PDB: 1FQ1_A 1FPZ_F.
Probab=22.87 E-value=3.6e+02 Score=27.02 Aligned_cols=78 Identities=12% Similarity=0.117 Sum_probs=42.2
Q ss_pred HHHHHHHHHHHcCCCEEEccCCCCCCc--------HHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHH
Q psy438 256 TIRDLLVSTKEANMNMLRVWGGGVYMS--------DYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRR 327 (669)
Q Consensus 256 ~~~~~l~~~k~~g~N~iR~~~g~~~~~--------~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r 327 (669)
.+..+|+.+|+.|+..|=+ -.+. +.+.+++.+.||.+ .-+|+.-..-|+ + +.+.+-+.++...
T Consensus 59 dL~~DL~~Lk~~G~~~Vvt----l~~~~EL~~l~Vp~L~~~~~~~Gi~~-~h~PI~D~~aPd---~-~~~~~i~~eL~~~ 129 (168)
T PF05706_consen 59 DLQADLERLKDWGAQDVVT----LLTDHELARLGVPDLGEAAQARGIAW-HHLPIPDGSAPD---F-AAAWQILEELAAR 129 (168)
T ss_dssp -HHHHHHHHHHTT--EEEE-----S-HHHHHHTT-TTHHHHHHHTT-EE-EE----TTS------H-HHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHCCCCEEEE----eCcHHHHHHcCCccHHHHHHHcCCEE-EecCccCCCCCC---H-HHHHHHHHHHHHH
Confidence 4778899999999999955 2332 35789999999855 578876544453 2 2233344556667
Q ss_pred hCCCcceEeecccCC
Q psy438 328 VQHHPCIAVWAGNNE 342 (669)
Q Consensus 328 ~rnHPSi~~W~~~NE 342 (669)
+++.--|++=|-|==
T Consensus 130 L~~g~~V~vHC~GGl 144 (168)
T PF05706_consen 130 LENGRKVLVHCRGGL 144 (168)
T ss_dssp HHTT--EEEE-SSSS
T ss_pred HHcCCEEEEECCCCC
Confidence 777777887776543
No 88
>smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites
Probab=22.16 E-value=2.7e+02 Score=28.74 Aligned_cols=81 Identities=14% Similarity=0.211 Sum_probs=0.0
Q ss_pred cccchHHHHHHHHHHHHcCCCEEEccCCCCCCc----------HHHHHHHH-HcCCEEEeecccccCCCCCChHHHHHHH
Q psy438 250 MARHESTIRDLLVSTKEANMNMLRVWGGGVYMS----------DYFYETCD-ELGILIWQDMMFACNNYPATPTFLQSVR 318 (669)
Q Consensus 250 ~~~~e~~~~~~l~~~k~~g~N~iR~~~g~~~~~----------~~~~~~cD-e~Gilv~~e~~~~~~~~p~~~~~~~~~~ 318 (669)
++..-+.+++.++.++++|+..|.+|.|..... +.+-++|+ +.|+.+.-|.. +......-.-.
T Consensus 79 r~~~~~~l~~~i~~A~~lGa~~vv~h~g~~~~~~~e~~~~~~~~~l~~l~~~~~gv~l~lEn~------~~~~~~~~~~~ 152 (273)
T smart00518 79 VEKSIERLIDEIKRCEELGIKALVFHPGSYLKQSKEEALNRIIESLNEVIDETKGVVILLETT------AGKGSQIGSTF 152 (273)
T ss_pred HHHHHHHHHHHHHHHHHcCCCEEEEccccccCCCHHHHHHHHHHHHHHHHhccCCcEEEEecc------CCCCCccCCCH
Q ss_pred HHHHHHHHHhCCCcceEe
Q psy438 319 SEISQTVRRVQHHPCIAV 336 (669)
Q Consensus 319 ~~~~~~v~r~rnHPSi~~ 336 (669)
+++.++++.....|.|-+
T Consensus 153 ~~~~~ll~~v~~~~~~g~ 170 (273)
T smart00518 153 EDLKEIIDLIKELDRIGV 170 (273)
T ss_pred HHHHHHHHhcCCCCCeEE
No 89
>PF06230 DUF1009: Protein of unknown function (DUF1009); InterPro: IPR010415 This is a family of uncharacterised bacterial proteins.
Probab=21.92 E-value=1.1e+02 Score=31.60 Aligned_cols=38 Identities=24% Similarity=0.293 Sum_probs=31.6
Q ss_pred HHHHHHHcCCCEEEccCCCCCC--cHHHHHHHHHcCCEEE
Q psy438 260 LLVSTKEANMNMLRVWGGGVYM--SDYFYETCDELGILIW 297 (669)
Q Consensus 260 ~l~~~k~~g~N~iR~~~g~~~~--~~~~~~~cDe~Gilv~ 297 (669)
-++.++++|+..|=+-.|...- .+++.++||++||.||
T Consensus 174 Tv~~~~~ag~~~laveAg~tl~ld~~~~i~~Ad~~gi~i~ 213 (214)
T PF06230_consen 174 TVENAAEAGLAGLAVEAGKTLILDREEVIALADKAGIFIV 213 (214)
T ss_pred HHHHHHHcCCeEEEEecCcEEEecHHHHHHHHHHcCCEEe
Confidence 4778999999999887764432 5799999999999986
No 90
>cd06570 GH20_chitobiase-like_1 A functionally uncharacterized subgroup of the Glycosyl hydrolase family 20 (GH20) catalytic domain found in proteins similar to the chitobiase of Serratia marcescens, a beta-N-1,4-acetylhexosaminidase that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This subgroup lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=21.68 E-value=7.7e+02 Score=26.74 Aligned_cols=51 Identities=14% Similarity=0.185 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHcCCCEEEccC----------------------CCCCCcH---HHHHHHHHcCCEEEeecccccC
Q psy438 255 STIRDLLVSTKEANMNMLRVWG----------------------GGVYMSD---YFYETCDELGILIWQDMMFACN 305 (669)
Q Consensus 255 ~~~~~~l~~~k~~g~N~iR~~~----------------------g~~~~~~---~~~~~cDe~Gilv~~e~~~~~~ 305 (669)
+.+++.|+.|....+|.+=.|- |+.|..+ +..+.|.++||-|+.|+....+
T Consensus 18 ~~ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~lt~~g~~~~~yT~~di~elv~yA~~rgI~vIPEId~PGH 93 (311)
T cd06570 18 AVIKRQLDAMASVKLNVFHWHLTDDQGFRIESKKYPKLQQKASDGLYYTQEQIREVVAYARDRGIRVVPEIDVPGH 93 (311)
T ss_pred HHHHHHHHHHHHhCCeEEEEEEecCCCceeecCCCccccccCCCCCccCHHHHHHHHHHHHHcCCEEEEeecCccc
Confidence 4789999999999999886651 2456643 6888999999999999876443
No 91
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=21.37 E-value=2.6e+02 Score=22.55 Aligned_cols=45 Identities=18% Similarity=0.235 Sum_probs=33.0
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCC-CcHHHHHHHHHcCCEEEeecc
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVY-MSDYFYETCDELGILIWQDMM 301 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~-~~~~~~~~cDe~Gilv~~e~~ 301 (669)
.++.++.+++.|++.+=+--.+.. ....++++|.+.||-++..+.
T Consensus 17 ~~~~~~~a~~~g~~~v~iTDh~~~~~~~~~~~~~~~~gi~~i~G~E 62 (67)
T smart00481 17 PEELVKRAKELGLKAIAITDHGNLFGAVEFYKAAKKAGIKPIIGLE 62 (67)
T ss_pred HHHHHHHHHHcCCCEEEEeeCCcccCHHHHHHHHHHcCCeEEEEEE
Confidence 666788899999999966321111 136899999999999876544
No 92
>PRK13397 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=21.33 E-value=2e+02 Score=30.45 Aligned_cols=45 Identities=33% Similarity=0.542 Sum_probs=34.5
Q ss_pred hHHHHHHHHHHHHcCCCEEEccCCCCCCc---------------HHHHHHHHHcCCEEEeecc
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWGGGVYMS---------------DYFYETCDELGILIWQDMM 301 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~g~~~~~---------------~~~~~~cDe~Gilv~~e~~ 301 (669)
|+.+...-+.+|++|++++|- |.+-+ +.+.+.|+++||.+..|..
T Consensus 28 ~e~~~~~a~~~~~~g~~~~r~---g~~kpRts~~sf~G~G~~gl~~L~~~~~~~Gl~~~Tev~ 87 (250)
T PRK13397 28 YDHIRLAASSAKKLGYNYFRG---GAYKPRTSAASFQGLGLQGIRYLHEVCQEFGLLSVSEIM 87 (250)
T ss_pred HHHHHHHHHHHHHcCCCEEEe---cccCCCCCCcccCCCCHHHHHHHHHHHHHcCCCEEEeeC
Confidence 346666677799999999985 33321 4789999999999988765
No 93
>PF03162 Y_phosphatase2: Tyrosine phosphatase family; InterPro: IPR004861 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry represents protein-tyrosine phosphatases predominantly from fungi, plants and bacteria, several of which are putative enzymes. These proteins are closely related to the Y-phosphatase and DSPc families. This entry includes the PTPase SIW14 from Saccharomyces cerevisiae (Baker's yeast), which plays a role in actin filament organisation and endocytosis.; PDB: 2Q47_A 1XRI_A.
Probab=21.19 E-value=3.7e+02 Score=26.36 Aligned_cols=81 Identities=12% Similarity=0.076 Sum_probs=41.3
Q ss_pred HHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEeecccccCCCCCChHHHHHHHHHHHHHHHHhCCCcceEe-e
Q psy438 259 DLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQDMMFACNNYPATPTFLQSVRSEISQTVRRVQHHPCIAV-W 337 (669)
Q Consensus 259 ~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~e~~~~~~~~p~~~~~~~~~~~~~~~~v~r~rnHPSi~~-W 337 (669)
..+..++.+|+.+|=.-.+ .-+++.+++.|++.||-+..= +.....-| .....+....++-+.|..-+|||-++. +
T Consensus 23 ~n~~fL~~L~LKTII~L~~-e~~~~~~~~f~~~~~I~l~~~-~~~~~~~~-~~~~~~~~v~~aL~~ild~~n~PvLiHC~ 99 (164)
T PF03162_consen 23 ANFPFLERLGLKTIINLRP-EPPSQDFLEFAEENGIKLIHI-PMSSSKDP-WVPISEEQVAEALEIILDPRNYPVLIHCN 99 (164)
T ss_dssp HHHHHHHHHT-SEEEE--S-S---HHHHHHHHHTT-EEEE--------GG-G----HHHHHHHHHHHH-GGG-SEEEE-S
T ss_pred hhHHHHHHCCCceEEEecC-CCCCHHHHHHHhhcCceEEEe-ccccccCc-cccCCHHHHHHHHHHHhCCCCCCEEEEeC
Confidence 3466788999999965442 334678999999999977552 21111000 112234444555567777899998764 3
Q ss_pred cccCC
Q psy438 338 AGNNE 342 (669)
Q Consensus 338 ~~~NE 342 (669)
.|.+-
T Consensus 100 ~G~~r 104 (164)
T PF03162_consen 100 HGKDR 104 (164)
T ss_dssp SSSSH
T ss_pred CCCcc
Confidence 44444
No 94
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=21.13 E-value=8.8e+02 Score=24.63 Aligned_cols=42 Identities=5% Similarity=-0.058 Sum_probs=32.5
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEe
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQ 298 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~ 298 (669)
+...++.++++|++.|=++......-+++.++|+++||-|..
T Consensus 16 l~e~~~~~~e~G~~~vEl~~~~~~~~~~l~~~l~~~gl~v~~ 57 (254)
T TIGR03234 16 FLERFAAAAQAGFTGVEYLFPYDWDAEALKARLAAAGLEQVL 57 (254)
T ss_pred HHHHHHHHHHcCCCEEEecCCccCCHHHHHHHHHHcCCeEEE
Confidence 666788899999999988652112236799999999998764
No 95
>COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane]
Probab=21.08 E-value=1.2e+02 Score=31.27 Aligned_cols=42 Identities=19% Similarity=0.314 Sum_probs=37.1
Q ss_pred hHHHHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEE
Q psy438 254 ESTIRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIW 297 (669)
Q Consensus 254 e~~~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~ 297 (669)
++.+++.++.++++|++-|=+-.| |..+.|.-|-|++++-+.
T Consensus 31 ~plIErqI~~L~e~gI~dI~IVvG--YlkE~FeYLkdKy~vtLv 72 (231)
T COG4750 31 EPLIERQIEQLREAGIDDITIVVG--YLKEQFEYLKDKYDVTLV 72 (231)
T ss_pred cccHHHHHHHHHHCCCceEEEEee--ehHHHHHHHHHhcCeEEE
Confidence 347999999999999999988776 889999999999998763
No 96
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=20.46 E-value=3.6e+02 Score=27.72 Aligned_cols=42 Identities=2% Similarity=-0.180 Sum_probs=32.7
Q ss_pred HHHHHHHHHHcCCCEEEccCCCCCCcHHHHHHHHHcCCEEEe
Q psy438 257 IRDLLVSTKEANMNMLRVWGGGVYMSDYFYETCDELGILIWQ 298 (669)
Q Consensus 257 ~~~~l~~~k~~g~N~iR~~~g~~~~~~~~~~~cDe~Gilv~~ 298 (669)
+...++.++++|+..|=+++-.....+.+-++++++||-+..
T Consensus 17 l~~~l~~~a~~Gf~~VEl~~~~~~~~~~~~~~l~~~gl~~~~ 58 (258)
T PRK09997 17 FLARFEKAAQCGFRGVEFMFPYDYDIEELKQVLASNKLEHTL 58 (258)
T ss_pred HHHHHHHHHHhCCCEEEEcCCCCCCHHHHHHHHHHcCCcEEE
Confidence 556688899999999998762112346899999999998853
Done!