Query         005404
Match_columns 698
No_of_seqs    308 out of 1475
Neff          6.7 
Searched_HMMs 46136
Date          Thu Mar 28 22:54:52 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/005404.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/005404hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03059 beta-galactosidase; P 100.0  2E-168  5E-173 1437.6  62.6  686    1-690   154-840 (840)
  2 KOG0496 Beta-galactosidase [Ca 100.0  3E-107  7E-112  899.6  28.5  502    1-586   144-648 (649)
  3 PF01301 Glyco_hydro_35:  Glyco 100.0 3.7E-45 8.1E-50  391.5   9.7  179    1-188   119-318 (319)
  4 KOG4729 Galactoside-binding le  99.8   1E-21 2.2E-26  197.4   8.2   92  604-696    40-136 (265)
  5 PF02140 Gal_Lectin:  Galactose  99.8 2.1E-20 4.7E-25  160.7   5.5   76  612-689     1-80  (80)
  6 PF13364 BetaGal_dom4_5:  Beta-  99.0 1.3E-09 2.9E-14   99.6   7.5   68  467-560    33-104 (111)
  7 COG1874 LacA Beta-galactosidas  98.8 4.8E-09   1E-13  121.3   5.8  164    3-175   132-333 (673)
  8 PF13364 BetaGal_dom4_5:  Beta-  98.5 4.1E-07   9E-12   83.2   9.8   84  308-398    24-110 (111)
  9 PF02837 Glyco_hydro_2_N:  Glyc  98.3 6.7E-06 1.5E-10   80.0  11.0   99  315-419    64-164 (167)
 10 PRK10150 beta-D-glucuronidase;  96.7  0.0079 1.7E-07   70.8  11.1  100  316-421    62-179 (604)
 11 PF02837 Glyco_hydro_2_N:  Glyc  96.7  0.0031 6.6E-08   61.3   6.4   65  468-559    67-135 (167)
 12 PRK10340 ebgA cryptic beta-D-g  96.3   0.015 3.4E-07   72.2  10.5   97  319-421   109-206 (1021)
 13 PRK09525 lacZ beta-D-galactosi  95.9   0.032   7E-07   69.4  10.4   95  318-421   119-218 (1027)
 14 PF02449 Glyco_hydro_42:  Beta-  95.6   0.073 1.6E-06   58.9  10.9   87   93-191   286-373 (374)
 15 PRK10340 ebgA cryptic beta-D-g  93.9    0.13 2.8E-06   64.3   7.8   64  469-559   109-175 (1021)
 16 PRK10150 beta-D-glucuronidase;  92.9    0.24 5.1E-06   58.5   7.5   41  469-509    65-108 (604)
 17 PRK09525 lacZ beta-D-galactosi  92.5     0.2 4.3E-06   62.6   6.5   41  468-508   119-163 (1027)
 18 PF06832 BiPBP_C:  Penicillin-B  73.3     4.8  0.0001   35.0   4.1   50  342-399    34-84  (89)
 19 PF14683 CBM-like:  Polysacchar  70.9     4.4 9.4E-05   39.9   3.6   61  493-564    92-153 (167)
 20 KOG4729 Galactoside-binding le  64.4     8.3 0.00018   40.3   4.2   87  605-693   140-234 (265)
 21 PF08308 PEGA:  PEGA domain;  I  61.2     7.4 0.00016   32.1   2.6   46  343-400     3-48  (71)
 22 PF11875 DUF3395:  Domain of un  61.0     6.8 0.00015   38.0   2.7   67  623-691    55-135 (151)
 23 PF11324 DUF3126:  Protein of u  59.7      18 0.00039   29.8   4.5   24  348-371    25-48  (63)
 24 PF08531 Bac_rhamnosid_N:  Alph  56.2      53  0.0012   32.2   8.2   56  343-399     7-69  (172)
 25 KOG2024 Beta-Glucuronidase GUS  54.1      22 0.00048   37.4   5.1   58  305-363    71-131 (297)
 26 PF08531 Bac_rhamnosid_N:  Alph  49.4      23 0.00049   34.9   4.3   22  487-508     7-28  (172)
 27 PF07691 PA14:  PA14 domain;  I  49.2      90  0.0019   28.8   8.3   70  320-397    47-122 (145)
 28 PLN03059 beta-galactosidase; P  42.2 1.2E+02  0.0027   37.2   9.7   42  468-509   469-517 (840)
 29 TIGR01676 GLDHase galactonolac  40.4      31 0.00067   40.3   4.2   41   24-64    375-418 (541)
 30 KOG0496 Beta-galactosidase [Ca  39.5     7.9 0.00017   45.4  -0.7   58  614-671   332-389 (649)
 31 COG2884 FtsE Predicted ATPase   37.1      29 0.00063   35.2   2.8   16  494-509    55-70  (223)
 32 PF07009 DUF1312:  Protein of u  26.3 3.5E+02  0.0075   24.6   7.9   81  493-624    26-107 (113)
 33 PF02055 Glyco_hydro_30:  O-Gly  23.7   2E+02  0.0043   33.4   7.1  220    4-240   206-472 (496)
 34 PF14307 Glyco_tran_WbsX:  Glyc  21.8 1.9E+02  0.0041   31.8   6.2   61    2-70    138-198 (345)
 35 smart00758 PA14 domain in bact  21.7 1.6E+02  0.0035   27.0   4.9   38  471-509    46-83  (136)
 36 PF07691 PA14:  PA14 domain;  I  21.4 1.1E+02  0.0025   28.1   3.9   38  471-509    48-85  (145)
 37 smart00758 PA14 domain in bact  20.6 3.1E+02  0.0067   25.1   6.6   65  320-393    45-110 (136)
 38 COG2360 Aat Leu/Phe-tRNA-prote  20.3   1E+02  0.0022   31.6   3.3   35   16-54    167-205 (221)

No 1  
>PLN03059 beta-galactosidase; Provisional
Probab=100.00  E-value=2.2e-168  Score=1437.61  Aligned_cols=686  Identities=63%  Similarity=1.144  Sum_probs=623.8

Q ss_pred             ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCCCCCCccccC
Q 005404            1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKEDDAPDPVINS   80 (698)
Q Consensus         1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~~~~~~v~~~   80 (698)
                      |++|+++|+++|++++||+++||||||+|||||||++.++|+.+|++||+||++||+++|++||||||++.++++++++|
T Consensus       154 v~~~~~~l~~~l~~~~l~~~~GGPIImvQIENEYGs~~~~~~~~d~~Yl~~l~~~~~~~Gi~VPl~t~dg~~~~~~v~~t  233 (840)
T PLN03059        154 MQKFTEKIVDMMKSEKLFEPQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADMAVKLGTGVPWVMCKQEDAPDPVIDT  233 (840)
T ss_pred             HHHHHHHHHHHHhhcceeecCCCcEEEEEecccccceecccCcchHHHHHHHHHHHHHcCCCcceEECCCCCCCccceec
Confidence            68999999999988899999999999999999999998778889999999999999999999999999998888889999


Q ss_pred             CCCcccccCCCCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcccc
Q 005404           81 CNGFYCDAFSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFITT  160 (698)
Q Consensus        81 ~ng~~~~~f~~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~~T  160 (698)
                      +||.+++.|...++.+|+|+||||+|||++||++++.|+++|++.+++++|++|+|++||||||||||||||||+++++|
T Consensus       234 ~Ng~~~~~f~~~~~~~P~m~tE~w~GWf~~wG~~~~~r~~~d~a~~~~~~l~~g~S~~N~YMfhGGTNFG~~~Ga~~~~T  313 (840)
T PLN03059        234 CNGFYCENFKPNKDYKPKMWTEAWTGWYTEFGGAVPNRPAEDLAFSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIAT  313 (840)
T ss_pred             CCCchhhhcccCCCCCCcEEeccCchhHhhcCCCCCcCCHHHHHHHHHHHHHcCCeeEEeeeccCcCCcccccCCCcccc
Confidence            99988888888777789999999999999999999999999999999999999999889999999999999999999999


Q ss_pred             cccCCCCCCCCCCCCchhHHHHHHHHHHHHhhhccccCCCCcccCCCCcceeeeeecCCcceeeEeeccCCccceEEEeC
Q 005404          161 SYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVSSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSAARVTFN  240 (698)
Q Consensus       161 SYDY~APl~E~G~~~~~ky~~lr~l~~~~~~~~~~l~~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~~~v~~~  240 (698)
                      ||||||||+|+|++|.|||.+||++|++++.+++.|+..+|....+++.+++++|.... .|++|+.|++.+...+|+|+
T Consensus       314 SYDYdAPL~E~G~~t~pKy~~lr~l~~~~~~~~~~l~~~~p~~~~lg~~~ea~~y~~~~-~caaFl~n~~~~~~~~v~f~  392 (840)
T PLN03059        314 SYDYDAPLDEYGLPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKS-ACAAFLANYDTKYSVKVTFG  392 (840)
T ss_pred             ccccCCccccccCcchhHHHHHHHHHHHHHhcCccccCCCCceeccCCceeEEEccCcc-chhhheeccCCCCceeEEEC
Confidence            99999999999999546999999999999988888887777777889999999999666 79999999998888999999


Q ss_pred             CeeeecCCCceeecCCCCceeeccceeeeeeeeeeccccccccccccccccccccccCCCccccccchhhcCCCCCCccE
Q 005404          241 GRQYNLPPWSISILPDCKNVVFNTAKVAVQHTKMQMLPTGSKLLSWETYDEDISSLGESSTLTAIGLLEQINITRDTSDY  320 (698)
Q Consensus       241 ~~~~~~p~~sv~il~~~~~~~~~t~~v~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~p~~~Eql~~t~d~~Gy  320 (698)
                      |++|.||+|||+|||||+.++|+|++++.|++.++..+. ...+.|+++.|++.+...+.+++...++||+++|+|.+||
T Consensus       393 g~~y~lp~~Svsilpd~~~~lfnta~v~~q~~~~~~~~~-~~~~~w~~~~e~~~~~~~~~~~~~e~l~e~~n~t~d~~dY  471 (840)
T PLN03059        393 NGQYDLPPWSVSILPDCKTAVFNTARLGAQSSQMKMNPV-GSTFSWQSYNEETASAYTDDTTTMDGLWEQINVTRDATDY  471 (840)
T ss_pred             CcccccCccceeecccccceeeeccccccccceeecccc-cccccceeecccccccccCCCcchhhHHHhhcccCCCCce
Confidence            999999999999999999999999999888766644322 3456999999995543233567777889999999999999


Q ss_pred             EEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCCcc
Q 005404          321 LWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGLPN  400 (698)
Q Consensus       321 ~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr~N  400 (698)
                      +||+|+|....++..++.+.+.+|+|.+++|++||||||+++|+.++......++++.+++++.|.|+|+||||||||+|
T Consensus       472 lwY~t~i~~~~~~~~~~~~~~~~L~v~~~~d~~~vFVNg~~~Gt~~~~~~~~~~~~~~~v~l~~g~n~L~iLse~vG~~N  551 (840)
T PLN03059        472 LWYMTEVHIDPDEGFLKTGQYPVLTIFSAGHALHVFINGQLAGTVYGELSNPKLTFSQNVKLTVGINKISLLSVAVGLPN  551 (840)
T ss_pred             EEEEEEEeecCCccccccCCCceEEEcccCcEEEEEECCEEEEEEEeecCCcceEEecccccCCCceEEEEEEEeCCCCc
Confidence            99999998876654456777889999999999999999999999998766667888878889999999999999999999


Q ss_pred             cccCCCccccceeecEEEccccCCCccCccCCcEEeccCccccccccCCCCCCCCcccCCCccccccCCceEEEEEEeCC
Q 005404          401 VGLHYETWETGVRGAVVLHGLDHGNKDLTWQKWSYQVGLKGEAMNLVSPSEATSVDWTRGSLAAQGQQSLKWYKAYFDAP  480 (698)
Q Consensus       401 ~g~~~~~~~kGI~g~V~l~g~~~~~~~l~~~~W~~~~~l~~e~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~Yr~~F~~p  480 (698)
                      ||++|+++.|||+|+|+|+|.+.++++|+++.|.|+++|.||.++|+.+++...+.|...+..+. .++|+|||++|++|
T Consensus       552 yG~~le~~~kGI~g~V~i~g~~~g~~dls~~~W~y~lgL~GE~~~i~~~~~~~~~~W~~~~~~~~-~~p~twYK~~Fd~p  630 (840)
T PLN03059        552 VGLHFETWNAGVLGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTITGSSSVEWVEGSLLAQ-KQPLTWYKTTFDAP  630 (840)
T ss_pred             cCcccccccccccccEEEecccCCceecccCccccccCccceeccccccCCCCCccccccccccC-CCCceEEEEEEeCC
Confidence            99999999999999999999878888999889999999999999988876556788976544333 45799999999999


Q ss_pred             CCCCCeEEEeCCCceEEEEECCeeeeeeeeccc-cCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEE
Q 005404          481 TGNEPLALDLRSMGKGQVWINGQSIGRYWMAYA-KGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVV  559 (698)
Q Consensus       481 ~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW~~~~-~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivv  559 (698)
                      ++.|||||||++||||+|||||+||||||+.++ .+.|..|+|+|.|++.+|+|+||+|||+|||||++|||+|+|+|||
T Consensus       631 ~g~Dpv~LDm~gmGKG~aWVNG~nIGRYW~~~a~~~gC~~c~y~g~~~~~kc~~~cggP~q~lYHVPr~~Lk~g~N~lVi  710 (840)
T PLN03059        631 GGNDPLALDMSSMGKGQIWINGQSIGRHWPAYTAHGSCNGCNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLIV  710 (840)
T ss_pred             CCCCCEEEecccCCCeeEEECCcccccccccccccCCCccccccccccchhhhccCCCceeEEEeCcHHHhccCCceEEE
Confidence            999999999999999999999999999997642 2334889999999999999999999999999999999999999999


Q ss_pred             EEeeCCCCcceEEEeecccccccccccCCCCCCCcccccCCCCCcCCCCceeeecCCCCeEeeeeeeccCCCCCCCCCCC
Q 005404          560 FEELGGDASRISLVKRSVARVCADAHEHHPTTDNNDIENKGNSNSTGNAKVLLQCAPGQSITSIEFASFGTPSGTCGSFQ  639 (698)
Q Consensus       560 fE~~g~~~~~i~~~~~~~~~~~~~~~e~hp~~~~~~~~~~~~~~~ce~~~~~L~C~~g~~Is~I~~A~YGr~~~~C~~~~  639 (698)
                      |||+|++|..|+|.++++++||++++|+|| ++.+|++.....-.-....++|+|+.|++|+.|.+|+|||+.++|+++.
T Consensus       711 FEe~gg~p~~I~~~~~~~~~~c~~~~e~~p-~~~~w~~~~~~~~~~~~~~~~L~C~~G~~Is~I~fAsYGrp~gtC~~~~  789 (840)
T PLN03059        711 FEEWGGNPAGISLVKRTTDSVCADIFEGQP-ALKNWQIIASGKVNSLQPKAHLWCPPGQKISKIKFASFGVPQGTCGSFR  789 (840)
T ss_pred             EEecCCCCCceEEEEeecCcccccccccCC-ccccccccccccccccCCcEEEECCCCceEEEEEEecCCCCCCCCCCCC
Confidence            999999999999999999999999999995 6999999555432355678999999999997799999999999999999


Q ss_pred             CCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCCCCceEEEEEEee
Q 005404          640 KGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPNVLKRLSVQAVCS  690 (698)
Q Consensus       640 ~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g~~KYL~V~Y~C~  690 (698)
                      +++|++++++++|+++|+||++|+|.+++.+||+|||+||+|||+|+|+|.
T Consensus       790 ~g~C~a~~S~~vV~kaC~Gk~~CsV~asn~~FggDPC~gt~KyL~V~~~Cs  840 (840)
T PLN03059        790 EGSCHAHKSYDAFERNCIGKQSCSVTVAPEVFGGDPCPDSMKKLSVEAVCS  840 (840)
T ss_pred             CCCEeCCcHHHHHHHHCCCCCceEEEeccceecCCCCCCceeEEEEEEEeC
Confidence            999999999999999999999999999999996699999999999999994


No 2  
>KOG0496 consensus Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=100.00  E-value=3.1e-107  Score=899.63  Aligned_cols=502  Identities=58%  Similarity=1.049  Sum_probs=451.5

Q ss_pred             ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCCCCCCccccC
Q 005404            1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKEDDAPDPVINS   80 (698)
Q Consensus         1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~~~~~~v~~~   80 (698)
                      |++|+++|+|++|  +|+++|||||||+|||||||.+...|++..+.|++|-+.|+...+.+|||++|.+.++|+.++++
T Consensus       144 ~~~~~~~iv~~mk--~L~~~qGGPIIl~QIENEYG~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk~~dapd~~in~  221 (649)
T KOG0496|consen  144 MERWTTKIVPMMK--KLFASQGGPIILVQIENEYGNYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCKQDDAPDPGINT  221 (649)
T ss_pred             HHHHHHHHHHHHH--HHHhhcCCCEEEEEeechhhHHHHHHHHHHHHhhccceEEEEecCCCCceeEecCCCCCCccccc
Confidence            8999999999999  99999999999999999999998888888999999999999999999999999999999999999


Q ss_pred             CCCccc-ccCC-CCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcc
Q 005404           81 CNGFYC-DAFS-PNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFI  158 (698)
Q Consensus        81 ~ng~~~-~~f~-~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~  158 (698)
                      |||++| +.|. +++|++|+||||||+|||++||++++.|++++++..+++++++|+|++||||||||||||++|| ++.
T Consensus       222 cng~~c~~~f~~pn~~~kP~~wtE~wtgwf~~wGg~~~~R~~e~ia~~va~fls~ggs~vNyYM~hGGTNFGrt~G-~~~  300 (649)
T KOG0496|consen  222 CNGFYCGDTFKRPNSPNKPLVWTENWTGWFTHWGGPHPCRPVEDIALSVARFLSKGGSSVNYYMYHGGTNFGRTNG-PFI  300 (649)
T ss_pred             cCCccchhhhccCCCCCCCceecccccchhhhhCCCCCCCCHHHHHHHHHHHHhcCccceEEEEeecccCCCcccC-ccc
Confidence            999999 8888 8899999999999999999999999999999999999999999989999999999999999998 899


Q ss_pred             cccccCCCCCCCCCCCCchhHHHHHHHHHHHHhhhccccCCCCcccCCCCcceeeeeecCCcceeeEeeccCCccceEEE
Q 005404          159 TTSYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVSSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSAARVT  238 (698)
Q Consensus       159 ~TSYDY~APl~E~G~~~~~ky~~lr~l~~~~~~~~~~l~~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~~~v~  238 (698)
                      +|||||||||+  |..++|||.++|.+|..+..+++.+..+++..         ..|....++|..||.|++......+.
T Consensus       301 atsy~~dap~d--gl~~~pk~ghlk~~hts~d~~ep~lv~gd~~~---------~kyg~~~~~C~~Fl~n~~~~~~~~v~  369 (649)
T KOG0496|consen  301 ATSYDYDAPLD--GLLRQPKYGHLKPLHTSYDYCEPALVAGDITT---------AKYGNLREACAAFLSNNNGAPAAPVP  369 (649)
T ss_pred             ccccccccccc--hhhcCCCccccccchhhhhhcCccccccCccc---------ccccchhhHHHHHHhcCCCCCCCccc
Confidence            99999999999  99999999999999999999888876665433         34555556799999999988888999


Q ss_pred             eCCeeeecCCCceeecCCCCceeeccceeeeeeeeeeccccccccccccccccccccccCCCccccccchhhcCCCCCCc
Q 005404          239 FNGRQYNLPPWSISILPDCKNVVFNTAKVAVQHTKMQMLPTGSKLLSWETYDEDISSLGESSTLTAIGLLEQINITRDTS  318 (698)
Q Consensus       239 ~~~~~~~~p~~sv~il~~~~~~~~~t~~v~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~p~~~Eql~~t~d~~  318 (698)
                      +++..+.+|+|+++|+++|+.++|+|+++.+               .|....||++             +|..++   .+
T Consensus       370 f~~~~y~~~~~slsilpdck~~~~nta~~~~---------------~~~~~~e~~~-------------~~~~~~---~~  418 (649)
T KOG0496|consen  370 FNKPKYRLPPWSLSILPDCKTVVYNTAKVMA---------------QWISFTEPIP-------------SEAVGQ---SF  418 (649)
T ss_pred             cCCCccccCceeEEechhhcchhhhcccccc---------------ccccccCCCc-------------cccccC---cc
Confidence            9999999999999999999999999988743               1544445544             566655   78


Q ss_pred             cEEEEEEEeecCCCcccccCCCCceeeeC-CcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecC
Q 005404          319 DYLWYMTSVEISSSESFLRGGQKPTLTVE-SAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVG  397 (698)
Q Consensus       319 Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~-~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~G  397 (698)
                      ||++|++.++...+++       ..|+|. +++|++||||||+++|++++......+.+..++.|..|.|+|+|||||+|
T Consensus       419 ~~ll~~~~~t~d~sd~-------t~~~i~ls~g~~~hVfvNg~~~G~~~g~~~~~~~~~~~~~~l~~g~n~l~iL~~~~G  491 (649)
T KOG0496|consen  419 GGLLEQTNLTKDKSDT-------TSLKIPLSLGHALHVFVNGEFAGSLHGNNEKIKLNLSQPVGLKAGENKLALLSENVG  491 (649)
T ss_pred             eEEEEEEeeccccCCC-------ceEeecccccceEEEEECCEEeeeEeccccceeEEeecccccccCcceEEEEEEecC
Confidence            8999999998765542       468888 99999999999999999998765666777778889999999999999999


Q ss_pred             CcccccCCCccccceeecEEEccccCCCccCccCCcEEeccCccccccccCCCCCCCCcccCCCccccccCCceEEEEEE
Q 005404          398 LPNVGLHYETWETGVRGAVVLHGLDHGNKDLTWQKWSYQVGLKGEAMNLVSPSEATSVDWTRGSLAAQGQQSLKWYKAYF  477 (698)
Q Consensus       398 r~N~g~~~~~~~kGI~g~V~l~g~~~~~~~l~~~~W~~~~~l~~e~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~Yr~~F  477 (698)
                      |+||| +++++.|||+|+|+|+|.    ++++++.|.|+++|.+|.+.++..++..+++|......+. .+|.+||+ +|
T Consensus       492 ~~n~G-~~e~~~~Gi~g~v~l~g~----~~l~~~~w~~~~gl~ge~~~~~~~~~~~~v~w~~~~~~~~-k~P~~w~k-~f  564 (649)
T KOG0496|consen  492 LPNYG-HFENDFKGILGPVYLNGL----IDLTWTKWPYKVGLKGEKLGLHTEEGSSKVKWKKLSNTAT-KQPLTWYK-TF  564 (649)
T ss_pred             CCCcC-cccccccccccceEEeee----eccceeecceecccccchhhccccccccccceeeccCccc-CCCeEEEE-Ee
Confidence            99999 789999999999999997    4677678889999999999999888778899987755444 36889999 99


Q ss_pred             eCCCCCCCeEEEeCCCceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceE
Q 005404          478 DAPTGNEPLALDLRSMGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLL  557 (698)
Q Consensus       478 ~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~i  557 (698)
                      ++|++.+||||||.|||||+|||||+|||||||++           |             ||++|| ||++|||++.|.|
T Consensus       565 ~~p~g~~~t~Ldm~g~GKG~vwVNG~niGRYW~~~-----------G-------------~Q~~yh-vPr~~Lk~~~N~l  619 (649)
T KOG0496|consen  565 DIPSGSEPTALDMNGWGKGQVWVNGQNIGRYWPSF-----------G-------------PQRTYH-VPRSWLKPSGNLL  619 (649)
T ss_pred             cCCCCCCCeEEecCCCcceEEEECCcccccccCCC-----------C-------------CceEEE-CcHHHhCcCCceE
Confidence            99999999999999999999999999999999875           5             876665 9999999999999


Q ss_pred             EEEEeeCCCCcceEEEeeccccccccccc
Q 005404          558 VVFEELGGDASRISLVKRSVARVCADAHE  586 (698)
Q Consensus       558 vvfE~~g~~~~~i~~~~~~~~~~~~~~~e  586 (698)
                      |||||++++|..|+|+++.+..+|..+.|
T Consensus       620 vvfEee~~~p~~i~~~~~~~~~~~~~v~~  648 (649)
T KOG0496|consen  620 VVFEEEGGDPNGISFVTRPVLSTCAYVRE  648 (649)
T ss_pred             EEEEeccCCCccceEEEeEeeeEeeeccc
Confidence            99999999999999999998899988765


No 3  
>PF01301 Glyco_hydro_35:  Glycosyl hydrolases family 35;  InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=100.00  E-value=3.7e-45  Score=391.54  Aligned_cols=179  Identities=39%  Similarity=0.704  Sum_probs=130.9

Q ss_pred             ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCC-cceEeecCCC-------
Q 005404            1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTG-VPWVMCKEDD-------   72 (698)
Q Consensus         1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~-vp~~~~~~~~-------   72 (698)
                      |++|+++|+++|+  ++|+++||||||||||||||++     ..|++||+.|++++++.|++ +++++++...       
T Consensus       119 ~~~~~~~~~~~~~--~~~~~~GGpII~vQvENEyg~~-----~~~~~Y~~~l~~~~~~~g~~~~~~~t~d~~~~~~~~~~  191 (319)
T PF01301_consen  119 VERWYRALAKIIK--PLQYTNGGPIIMVQVENEYGSY-----GTDRAYMEALKDAYRDWGIDPVLLYTTDGPWGSWLPDG  191 (319)
T ss_dssp             HHHHHHHHHHHHG--GGBGGGTSSEEEEEESSSGGCT-----SS-HHHHHHHHHHHHHTT-SSSBEEEEESSSHCCHCCC
T ss_pred             HHHHHHHHHHHHH--hhhhcCCCceehhhhhhhhCCC-----cccHhHHHHHHHHHHHhhCccceeeccCCCcccccccC
Confidence            5799999999999  9999999999999999999963     38999999999999999998 6677776531       


Q ss_pred             -CCCccccCCCCccccc--------CCCCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEe
Q 005404           73 -APDPVINSCNGFYCDA--------FSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMY  143 (698)
Q Consensus        73 -~~~~v~~~~ng~~~~~--------f~~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMf  143 (698)
                       .++..+.+++++.+..        +...+|++|+|++|||+|||++||++++.+++++++..++++|+.|.+ +|||||
T Consensus       192 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~p~~P~~~~E~~~Gwf~~WG~~~~~~~~~~~~~~l~~~l~~g~~-~nyYM~  270 (319)
T PF01301_consen  192 GLPGADIYATDNFPPGDNPDEYFGDQRSFQPNQPLMCTEFWGGWFDHWGGPHYTRPAEDVAADLARMLSKGNS-LNYYMF  270 (319)
T ss_dssp             -TTTGSCEEEEEETTTSSHHHHHHHHHHHHTTS--EEEEEESS---BTTS--HHHHHHHHHHHHHHHHHHCSE-EEEEEC
T ss_pred             CCCcceEEeccccCCCchHHHHHhhhhhcCCCCCeEEEEeccccccccCCCCccCCHHHHHHHHHHHHHhhcc-cceeec
Confidence             2222233333333321        124578899999999999999999999989999999999999999965 799999


Q ss_pred             eccCCCCCCCCCCcc----cccccCCCCCCCCCCCCchhHHHHHHHHHH
Q 005404          144 HGGTNFGRTAGGPFI----TTSYDYDAPLDEYGLMRQPKYGHLKQLHEA  188 (698)
Q Consensus       144 hGGTNfG~~~G~~~~----~TSYDY~APl~E~G~~~~~ky~~lr~l~~~  188 (698)
                      |||||||+++|++..    +|||||+|||+|+|++ ||||++||+||++
T Consensus       271 hGGTNfG~~~ga~~~~~p~~TSYDY~ApI~E~G~~-~~Ky~~lr~l~~~  318 (319)
T PF01301_consen  271 HGGTNFGFWAGANYYGQPDITSYDYDAPIDEYGQL-TPKYYELRRLHQK  318 (319)
T ss_dssp             E--B--TT-B-EETTTEEB-SB--TT-SB-TTS-B--HHHHHHHHHHHT
T ss_pred             cccCCccccccCCCCCCCCcccCCcCCccCcCCCc-CHHHHHHHHHHhc
Confidence            999999999887654    5999999999999999 6999999999864


No 4  
>KOG4729 consensus Galactoside-binding lectin [General function prediction only]
Probab=99.85  E-value=1e-21  Score=197.44  Aligned_cols=92  Identities=30%  Similarity=0.622  Sum_probs=85.3

Q ss_pred             cCCCCceeeecCCCCeEeeeeeeccCCCC-CCCCCC----CCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCC
Q 005404          604 STGNAKVLLQCAPGQSITSIEFASFGTPS-GTCGSF----QKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPN  678 (698)
Q Consensus       604 ~ce~~~~~L~C~~g~~Is~I~~A~YGr~~-~~C~~~----~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g  678 (698)
                      +|||+.++|+||.|.+| +|++|+|||.. ..|.+.    .+.+|..+.+++++.++|+++++|.|.|+..+|+.|||||
T Consensus        40 aCdG~~i~L~CP~~dvI-sv~sanYGR~~~~iC~pd~~~~~Si~C~~p~s~~i~~~rCnnr~~C~vvv~s~~F~~DPCPg  118 (265)
T KOG4729|consen   40 ACDGERITLSCPRGDVI-SVQSANYGRFSDKICDPDPGREESINCYLPKSFSILSSRCNNRRQCTVVVDSDVFGDDPCPG  118 (265)
T ss_pred             eecCceEEEEcCCCCEE-EEEecccCcccccccCCccccccchhccChHHHHHHHHhcCCCceEEEEecCCccCCCCCCC
Confidence            69999999999999999 59999999994 799752    4689999999999999999999999999999999999999


Q ss_pred             CCceEEEEEEeecCCCCC
Q 005404          679 VLKRLSVQAVCSTADANT  696 (698)
Q Consensus       679 ~~KYL~V~Y~C~p~~~~~  696 (698)
                      |+|||+|+|.|+|.++.+
T Consensus       119 T~KYLev~Y~Cvp~~~~~  136 (265)
T KOG4729|consen  119 TSKYLEVQYGCVPYAFTE  136 (265)
T ss_pred             chhheEEEeccCcccccc
Confidence            999999999999996543


No 5  
>PF02140 Gal_Lectin:  Galactose binding lectin domain;  InterPro: IPR000922 The D-galactoside binding lectin purified from sea urchin (Anthocidaris crassispina) eggs exists as a disulphide-linked homodimer of two subunits; the dimeric form is essential for hemagglutination activity []. The sea urchin egg lectin (SUEL) forms a new class of lectins. Although SUEL was first isolated as a D-galactoside binding lectin, it was latter shown that it bind to L-rhamnose preferentially [, ]. L-rhamnose and D-galactose share the same hydroxyl group orientation at C2 and C4 of the pyranose ring structure. A cysteine-rich domain homologous to the SUEL protein has been identified in the following proteins [, , ]:  Plant beta-galactosidases (3.2.1.23 from EC) (lactases). Mammalian latrophilin, the calcium independent receptor of alpha-latrotoxin (CIRL). The galactose-binding lectin domain is not required for alpha-latratoxin binding []. Human lectomedin-1. Rhamnose-binding lectin (SAL) from catfish (Silurus asotus, Namazu) eggs. This protein is composed of three tandem repeat domains homologous to the SUEL lectin domain. All cysteine positions of each domain are completely conserved []. The hypothetical B0457.1, F32A7.3A and F32A7.3B proteins from Caenorhabditis elegans. The human KIAA0821 protein. ; GO: 0005529 sugar binding; PDB: 2JXA_A 2JX9_A 2ZX2_A 2ZX3_B 2ZX0_B 2ZX1_B 2ZX4_B.
Probab=99.81  E-value=2.1e-20  Score=160.72  Aligned_cols=76  Identities=41%  Similarity=0.852  Sum_probs=62.1

Q ss_pred             eecCCCCeEeeeeeeccCCCC-CCCCCC---CCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCCCCceEEEEE
Q 005404          612 LQCAPGQSITSIEFASFGTPS-GTCGSF---QKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPNVLKRLSVQA  687 (698)
Q Consensus       612 L~C~~g~~Is~I~~A~YGr~~-~~C~~~---~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g~~KYL~V~Y  687 (698)
                      |+|+.|++| .|.+|+|||+. .+|+..   .+.+|.+++++++|+++|+||++|+|.+++.+|| ||||++.|||+|+|
T Consensus         1 L~C~~g~~I-~I~~A~YGR~~~~~C~~~~~~~~~~C~~~~~~~~v~~~C~g~~~C~v~~~~~~f~-dpC~~~~KyL~V~Y   78 (80)
T PF02140_consen    1 LSCPPGKVI-SIDSAFYGRTSSSICPSSSSGSNTNCSAPDALSIVKERCNGKQSCSVPADNSVFG-DPCPGTSKYLEVTY   78 (80)
T ss_dssp             EE-STTEEE-EEEEEEEEBSSSSTT--GGGCS-TTB--TTHHHHHHHHHTTBSEEEEESSHHHH---SSTTS--EEEEEE
T ss_pred             CCCcCCCEE-EEEEeecCCCCCCCCcCCCcCCCCccccccccchhHHhCCCCCccEEEeccCccC-CCCCCCCeEEEEEE
Confidence            799999999 69999999995 699743   4678999999999999999999999999999998 99999999999999


Q ss_pred             Ee
Q 005404          688 VC  689 (698)
Q Consensus       688 ~C  689 (698)
                      +|
T Consensus        79 ~C   80 (80)
T PF02140_consen   79 TC   80 (80)
T ss_dssp             EE
T ss_pred             EC
Confidence            99


No 6  
>PF13364 BetaGal_dom4_5:  Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=98.96  E-value=1.3e-09  Score=99.62  Aligned_cols=68  Identities=32%  Similarity=0.711  Sum_probs=49.2

Q ss_pred             cCCceEEEEEEeCCCCCC-CeE-EEe--CCCceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceee
Q 005404          467 QQSLKWYKAYFDAPTGNE-PLA-LDL--RSMGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRW  542 (698)
Q Consensus       467 ~~~~~~Yr~~F~~p~~~d-~~~-Ld~--~g~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtl  542 (698)
                      ..+..|||++|.... .| .+. |+.  ....+++|||||++|||||+.+           |             ||+++
T Consensus        33 ~~g~~~Yrg~F~~~~-~~~~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~-----------g-------------~q~tf   87 (111)
T PF13364_consen   33 HAGYLWYRGTFTGTG-QDTSLTPLNIQGGNAFRASVWVNGWFLGSYWPGI-----------G-------------PQTTF   87 (111)
T ss_dssp             SSCEEEEEEEEETTT-EEEEEE-EEECSSTTEEEEEEETTEEEEEEETTT-----------E-------------CCEEE
T ss_pred             CCCCEEEEEEEeCCC-cceeEEEEeccCCCceEEEEEECCEEeeeecCCC-----------C-------------ccEEE
Confidence            347899999996421 22 233 333  3467999999999999999643           5             99988


Q ss_pred             EecCcccccCCcceEEEE
Q 005404          543 YHVPRSWLKPTKNLLVVF  560 (698)
Q Consensus       543 Y~vP~~~Lk~g~N~ivvf  560 (698)
                      + ||+++|+.+.|.|+|+
T Consensus        88 ~-~p~~il~~~n~v~~vl  104 (111)
T PF13364_consen   88 S-VPAGILKYGNNVLVVL  104 (111)
T ss_dssp             E-E-BTTBTTCEEEEEEE
T ss_pred             E-eCceeecCCCEEEEEE
Confidence            8 9999999875555554


No 7  
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=98.79  E-value=4.8e-09  Score=121.33  Aligned_cols=164  Identities=19%  Similarity=0.180  Sum_probs=98.1

Q ss_pred             hHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCC------C--CC
Q 005404            3 GFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKED------D--AP   74 (698)
Q Consensus         3 ~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~------~--~~   74 (698)
                      ++.+.|+++|+  ++++++|++|||+|++||||.+     .|..+|.+.....+.+.+-+ .+-..+..      +  ..
T Consensus       132 ~~~~~i~~~ir--er~~~~~~~v~~w~~dneY~~~-----~~~~~~~~~~f~~wLk~~yg-~l~~ln~~w~t~~ws~t~~  203 (673)
T COG1874         132 EYLDRILQQIR--ERLYGNGPAVITWQNDNEYGGH-----PCYCDYCQAAFRLWLKKGYG-SLDNLNEAWGTSFWSHTYK  203 (673)
T ss_pred             HHHHHHHHHHH--HHHhccCCceeEEEccCccCCc-----cccccccHHHHHHHHHhCcc-hHHhhhhhhhhhhcccccc
Confidence            45667788898  7779999999999999999984     24455555555543333332 11111110      0  00


Q ss_pred             C-ccccCCC-----Cccc--ccCCCCCCCC----CceeeeccCccc-cccCCCcCCCC-HHHHHHHHHHHHHcCCceeee
Q 005404           75 D-PVINSCN-----GFYC--DAFSPNKPYK----PTLWTEAWSGWF-TEFGGAVHRRP-VQDLAFAVARFIQKGGSFFNY  140 (698)
Q Consensus        75 ~-~v~~~~n-----g~~~--~~f~~~~p~~----P~~~~E~~~Gwf-~~wG~~~~~~~-~~~~~~~~~~~l~~g~s~~n~  140 (698)
                      + ..|.+.+     +...  -++......+    +....|.+-+|| +.|..+.-... .+.-...++..|..... -||
T Consensus       204 ~~~~i~~p~~~~e~~~~~~~ld~~~f~~e~~~~~~~~~~~~~~~~~P~~pvt~nl~~~~~~~~~~~~~~~ld~~sw-dny  282 (673)
T COG1874         204 DFDEIMSPNPFGELPLPGLYLDYRRFESEQILEFVREEGEAIKAYFPNRPVTPNLLAAFKKFDAYKWEKVLDFASW-DNY  282 (673)
T ss_pred             cHHhhcCCCCccccCCccchhhHhhhhhhhhHHHHHHHHHHHHHhCCCCCCChhHhhhhhhcchHHHHHhcChhhh-hhh
Confidence            0 0011111     0000  0122222233    556677788888 77766543333 23334455666666655 699


Q ss_pred             eEeeccCCCC------CCCCCC---c-------ccccccCCCCCCCCCCCC
Q 005404          141 YMYHGGTNFG------RTAGGP---F-------ITTSYDYDAPLDEYGLMR  175 (698)
Q Consensus       141 YMfhGGTNfG------~~~G~~---~-------~~TSYDY~APl~E~G~~~  175 (698)
                      ||||+|++|+      +.+|+.   +       ..+++++++.+.+.|.++
T Consensus       283 ~~~~~~~~~~~~~h~l~r~~~~~~~~~~me~~P~~vn~~~~n~~~~~G~~~  333 (673)
T COG1874         283 PAWHRGRDFTKFIHDLFRNGKQGQPFWLMEQLPSVVNWALYNKLKRPGALR  333 (673)
T ss_pred             hhhccccchhhhhHHHHHhhccCCceeeccCCcchhhhhhccCCCCCcccc
Confidence            9999999999      777764   2       579999999999999853


No 8  
>PF13364 BetaGal_dom4_5:  Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=98.55  E-value=4.1e-07  Score=83.24  Aligned_cols=84  Identities=27%  Similarity=0.377  Sum_probs=58.7

Q ss_pred             hhhcCCCCCCccEEEEEEEeecCCCcccccCCCCce-eee-CCcceEEEEEECCEEEEEEEccccccceEEeecc-cccC
Q 005404          308 LEQINITRDTSDYLWYMTSVEISSSESFLRGGQKPT-LTV-ESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPA-NLRA  384 (698)
Q Consensus       308 ~Eql~~t~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~-L~v-~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~-~l~~  384 (698)
                      .+..+..++..|++|||++|.....+.      ... |.+ .+.+.+++|||||+++|+..... ..+.+|++|. .|+.
T Consensus        24 ~l~~~~~g~~~g~~~Yrg~F~~~~~~~------~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~-g~q~tf~~p~~il~~   96 (111)
T PF13364_consen   24 VLYASDYGFHAGYLWYRGTFTGTGQDT------SLTPLNIQGGNAFRASVWVNGWFLGSYWPGI-GPQTTFSVPAGILKY   96 (111)
T ss_dssp             STCCGCGTSSSCEEEEEEEEETTTEEE------EEE-EEECSSTTEEEEEEETTEEEEEEETTT-ECCEEEEE-BTTBTT
T ss_pred             eeccCccccCCCCEEEEEEEeCCCcce------eEEEEeccCCCceEEEEEECCEEeeeecCCC-CccEEEEeCceeecC
Confidence            455555567999999999997543331      124 444 36899999999999999987322 2335666665 3556


Q ss_pred             CcCeEEEEEeecCC
Q 005404          385 GINKIALLSIAVGL  398 (698)
Q Consensus       385 g~~~L~ILven~Gr  398 (698)
                      ++++|.||+++||+
T Consensus        97 ~n~v~~vl~~~~g~  110 (111)
T PF13364_consen   97 GNNVLVVLWDNMGH  110 (111)
T ss_dssp             CEEEEEEEEE-STT
T ss_pred             CCEEEEEEEeCCCC
Confidence            67789999999996


No 9  
>PF02837 Glyco_hydro_2_N:  Glycosyl hydrolases family 2, sugar binding domain;  InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme.  This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=98.25  E-value=6.7e-06  Score=79.99  Aligned_cols=99  Identities=24%  Similarity=0.334  Sum_probs=71.2

Q ss_pred             CCCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCc-CeEEEEE
Q 005404          315 RDTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGI-NKIALLS  393 (698)
Q Consensus       315 ~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~-~~L~ILv  393 (698)
                      ....|+.|||++|.++...    .+....|.+.++.+.+.|||||+++|......  ..+.++++-.|+.|. |+|.|.|
T Consensus        64 ~~~~~~~wYr~~f~lp~~~----~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~~--~~~~~dIt~~l~~g~~N~l~V~v  137 (167)
T PF02837_consen   64 WDYSGYAWYRRTFTLPADW----KGKRVFLRFEGVDYAAEVYVNGKLVGSHEGGY--TPFEFDITDYLKPGEENTLAVRV  137 (167)
T ss_dssp             STCCSEEEEEEEEEESGGG----TTSEEEEEESEEESEEEEEETTEEEEEEESTT--S-EEEECGGGSSSEEEEEEEEEE
T ss_pred             cccCceEEEEEEEEeCchh----cCceEEEEeccceEeeEEEeCCeEEeeeCCCc--CCeEEeChhhccCCCCEEEEEEE
Confidence            3478999999999886432    24456789999999999999999999987543  346666665688887 9999999


Q ss_pred             eecCCcccccCCC-ccccceeecEEEc
Q 005404          394 IAVGLPNVGLHYE-TWETGVRGAVVLH  419 (698)
Q Consensus       394 en~Gr~N~g~~~~-~~~kGI~g~V~l~  419 (698)
                      .+...-.+-+.+. ....||.++|.|.
T Consensus       138 ~~~~~~~~~~~~~~~~~~GI~r~V~L~  164 (167)
T PF02837_consen  138 DNWPDGSTIPGFDYFNYAGIWRPVWLE  164 (167)
T ss_dssp             ESSSGGGCGBSSSEEE--EEESEEEEE
T ss_pred             eecCCCceeecCcCCccCccccEEEEE
Confidence            9655443311111 3568999999884


No 10 
>PRK10150 beta-D-glucuronidase; Provisional
Probab=96.74  E-value=0.0079  Score=70.81  Aligned_cols=100  Identities=22%  Similarity=0.191  Sum_probs=69.2

Q ss_pred             CCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCc-CeEEEEEe
Q 005404          316 DTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGI-NKIALLSI  394 (698)
Q Consensus       316 d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~-~~L~ILve  394 (698)
                      +..|..|||++|.++...    .|....|.+.++.-.+.|||||++||...+..  ..+.+++.-.|+.|. |+|.|.|.
T Consensus        62 ~~~G~~WYrr~f~lp~~~----~gk~v~L~Fegv~~~a~V~lNG~~vg~~~~~~--~~f~~DIT~~l~~G~~n~L~V~v~  135 (604)
T PRK10150         62 NYVGDVWYQREVFIPKGW----AGQRIVLRFGSVTHYAKVWVNGQEVMEHKGGY--TPFEADITPYVYAGKSVRITVCVN  135 (604)
T ss_pred             CCcccEEEEEEEECCccc----CCCEEEEEECcccceEEEEECCEEeeeEcCCc--cceEEeCchhccCCCceEEEEEEe
Confidence            367999999999886431    24457899999999999999999999876532  346666554567775 49999998


Q ss_pred             ecCCcc---cccCC-------------C-ccccceeecEEEccc
Q 005404          395 AVGLPN---VGLHY-------------E-TWETGVRGAVVLHGL  421 (698)
Q Consensus       395 n~Gr~N---~g~~~-------------~-~~~kGI~g~V~l~g~  421 (698)
                      |.-+..   .|...             + ...-||.++|.|...
T Consensus       136 n~~~~~~~p~g~~~~~~~~~~k~~~~~d~~~~~GI~r~V~L~~~  179 (604)
T PRK10150        136 NELNWQTLPPGNVIEDGNGKKKQKYNFDFFNYAGIHRPVMLYTT  179 (604)
T ss_pred             cCCCcccCCCCccccCCccccccccccccccccCCCceEEEEEc
Confidence            742210   11000             0 135799999999543


No 11 
>PF02837 Glyco_hydro_2_N:  Glycosyl hydrolases family 2, sugar binding domain;  InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme.  This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=96.73  E-value=0.0031  Score=61.27  Aligned_cols=65  Identities=23%  Similarity=0.505  Sum_probs=50.0

Q ss_pred             CCceEEEEEEeCCCCC--CCeEEEeCCCc-eEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEe
Q 005404          468 QSLKWYKAYFDAPTGN--EPLALDLRSMG-KGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYH  544 (698)
Q Consensus       468 ~~~~~Yr~~F~~p~~~--d~~~Ld~~g~g-KG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~  544 (698)
                      .+..|||.+|++|...  ..++|.+.|.. ...|||||+-+|+-...              |           .. .-|-
T Consensus        67 ~~~~wYr~~f~lp~~~~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~--------------~-----------~~-~~~d  120 (167)
T PF02837_consen   67 SGYAWYRRTFTLPADWKGKRVFLRFEGVDYAAEVYVNGKLVGSHEGG--------------Y-----------TP-FEFD  120 (167)
T ss_dssp             CSEEEEEEEEEESGGGTTSEEEEEESEEESEEEEEETTEEEEEEEST--------------T-----------S--EEEE
T ss_pred             CceEEEEEEEEeCchhcCceEEEEeccceEeeEEEeCCeEEeeeCCC--------------c-----------CC-eEEe
Confidence            4679999999999753  35899999875 89999999999997621              1           22 2355


Q ss_pred             cCcccccCCc-ceEEE
Q 005404          545 VPRSWLKPTK-NLLVV  559 (698)
Q Consensus       545 vP~~~Lk~g~-N~ivv  559 (698)
                      |+. .|++|+ |+|.|
T Consensus       121 It~-~l~~g~~N~l~V  135 (167)
T PF02837_consen  121 ITD-YLKPGEENTLAV  135 (167)
T ss_dssp             CGG-GSSSEEEEEEEE
T ss_pred             Chh-hccCCCCEEEEE
Confidence            875 789888 98887


No 12 
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=96.33  E-value=0.015  Score=72.21  Aligned_cols=97  Identities=19%  Similarity=0.260  Sum_probs=68.3

Q ss_pred             cEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCC
Q 005404          319 DYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGL  398 (698)
Q Consensus       319 Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr  398 (698)
                      |-.|||++|.++..-    .|....|.+.++.-.+.|||||++||...+.  ...+.|++.-.|+.|.|+|.|.|.+...
T Consensus       109 ~~g~Yrr~F~lp~~~----~gkrv~L~FeGV~s~a~VwvNG~~VG~~~g~--~~pfefDIT~~l~~G~N~LaV~V~~~~d  182 (1021)
T PRK10340        109 PTGAYQRTFTLSDGW----QGKQTIIKFDGVETYFEVYVNGQYVGFSKGS--RLTAEFDISAMVKTGDNLLCVRVMQWAD  182 (1021)
T ss_pred             CeEEEEEEEEeCccc----ccCcEEEEECccceEEEEEECCEEeccccCC--CccEEEEcchhhCCCccEEEEEEEecCC
Confidence            667999999886431    2455789999999999999999999986543  2346666554577888999999986532


Q ss_pred             cccccCCC-ccccceeecEEEccc
Q 005404          399 PNVGLHYE-TWETGVRGAVVLHGL  421 (698)
Q Consensus       399 ~N~g~~~~-~~~kGI~g~V~l~g~  421 (698)
                      -.|-...+ ....||..+|.|--.
T Consensus       183 ~s~le~qd~w~~sGI~R~V~L~~~  206 (1021)
T PRK10340        183 STYLEDQDMWWLAGIFRDVYLVGK  206 (1021)
T ss_pred             CCccccCCccccccccceEEEEEe
Confidence            22211100 134799999999544


No 13 
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=95.91  E-value=0.032  Score=69.38  Aligned_cols=95  Identities=20%  Similarity=0.234  Sum_probs=66.2

Q ss_pred             ccEEEEEEEeecCCCcccccCCC-CceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeec
Q 005404          318 SDYLWYMTSVEISSSESFLRGGQ-KPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAV  396 (698)
Q Consensus       318 ~Gy~~Y~t~i~~~~~~~~l~~g~-~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~  396 (698)
                      .+-.|||++|.++..-    .+. ...|.+.++.-.+.|||||+++|...+.  ...+.|++.-.|+.|.|+|.|.|...
T Consensus       119 n~~gwYrr~F~vp~~w----~~~~rv~L~FeGV~~~a~VwvNG~~VG~~~g~--~~pfefDIT~~l~~G~N~L~V~V~~~  192 (1027)
T PRK09525        119 NPTGCYSLTFTVDESW----LQSGQTRIIFDGVNSAFHLWCNGRWVGYSQDS--RLPAEFDLSPFLRAGENRLAVMVLRW  192 (1027)
T ss_pred             CCeEEEEEEEEeChhh----cCCCeEEEEECeeccEEEEEECCEEEEeecCC--CceEEEEChhhhcCCccEEEEEEEec
Confidence            3778999999886431    122 4689999999999999999999986542  23466666545778899999988533


Q ss_pred             CCcccccCCC----ccccceeecEEEccc
Q 005404          397 GLPNVGLHYE----TWETGVRGAVVLHGL  421 (698)
Q Consensus       397 Gr~N~g~~~~----~~~kGI~g~V~l~g~  421 (698)
                      -.   |..++    ....||..+|.|--.
T Consensus       193 sd---gs~~e~qd~w~~sGI~R~V~L~~~  218 (1027)
T PRK09525        193 SD---GSYLEDQDMWRMSGIFRDVSLLHK  218 (1027)
T ss_pred             CC---CCccccCCceeeccccceEEEEEc
Confidence            21   22122    134699999998543


No 14 
>PF02449 Glyco_hydro_42:  Beta-galactosidase;  InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=95.64  E-value=0.073  Score=58.90  Aligned_cols=87  Identities=17%  Similarity=0.200  Sum_probs=47.5

Q ss_pred             CCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcccccccCCCCCCCCC
Q 005404           93 KPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG  172 (698)
Q Consensus        93 ~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~~TSYDY~APl~E~G  172 (698)
                      ..++|.+++|.++| ...|+.......+..+....-..++.|+..+.|+-+ ..--+|.-.        | ..+.|+-+|
T Consensus       286 ~~~kpf~v~E~~~g-~~~~~~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~w-r~~~~g~E~--------~-~~g~~~~dg  354 (374)
T PF02449_consen  286 AKGKPFWVMEQQPG-PVNWRPYNRPPRPGELRLWSWQAIAHGADGILFWQW-RQSRFGAEQ--------F-HGGLVDHDG  354 (374)
T ss_dssp             TTT--EEEEEE--S---SSSSS-----TTHHHHHHHHHHHTT-S-EEEC-S-B--SSSTTT--------T-S--SB-TTS
T ss_pred             cCCCceEeecCCCC-CCCCccCCCCCCCCHHHHHHHHHHHHhCCeeEeeec-cCCCCCchh--------h-hcccCCccC
Confidence            47889999999999 566765544445566666566678999998777766 333334221        1 146788899


Q ss_pred             -CCCchhHHHHHHHHHHHHh
Q 005404          173 -LMRQPKYGHLKQLHEAIKL  191 (698)
Q Consensus       173 -~~~~~ky~~lr~l~~~~~~  191 (698)
                       .+ +++|.+++++.+.|+.
T Consensus       355 ~~~-~~~~~e~~~~~~~l~~  373 (374)
T PF02449_consen  355 REP-TRRYREVAQLGRELKK  373 (374)
T ss_dssp             --B--HHHHHHHHHHHHHHT
T ss_pred             CCC-CcHHHHHHHHHHHHhc
Confidence             66 7999999999887653


No 15 
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=93.93  E-value=0.13  Score=64.28  Aligned_cols=64  Identities=22%  Similarity=0.310  Sum_probs=46.8

Q ss_pred             CceEEEEEEeCCCCCC--CeEEEeCCC-ceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEec
Q 005404          469 SLKWYKAYFDAPTGNE--PLALDLRSM-GKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHV  545 (698)
Q Consensus       469 ~~~~Yr~~F~~p~~~d--~~~Ld~~g~-gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~v  545 (698)
                      +..|||.+|.+|+..+  -++|.+.|. ....|||||+-+|+-=              |.|+          |-  -|-|
T Consensus       109 ~~g~Yrr~F~lp~~~~gkrv~L~FeGV~s~a~VwvNG~~VG~~~--------------g~~~----------pf--efDI  162 (1021)
T PRK10340        109 PTGAYQRTFTLSDGWQGKQTIIKFDGVETYFEVYVNGQYVGFSK--------------GSRL----------TA--EFDI  162 (1021)
T ss_pred             CeEEEEEEEEeCcccccCcEEEEECccceEEEEEECCEEecccc--------------CCCc----------cE--EEEc
Confidence            5689999999997532  489999986 4689999999999642              2221          22  2435


Q ss_pred             CcccccCCcceEEE
Q 005404          546 PRSWLKPTKNLLVV  559 (698)
Q Consensus       546 P~~~Lk~g~N~ivv  559 (698)
                      .. .|++|+|+|+|
T Consensus       163 T~-~l~~G~N~LaV  175 (1021)
T PRK10340        163 SA-MVKTGDNLLCV  175 (1021)
T ss_pred             ch-hhCCCccEEEE
Confidence            54 67888898877


No 16 
>PRK10150 beta-D-glucuronidase; Provisional
Probab=92.87  E-value=0.24  Score=58.50  Aligned_cols=41  Identities=27%  Similarity=0.484  Sum_probs=34.6

Q ss_pred             CceEEEEEEeCCCCC--CCeEEEeCCC-ceEEEEECCeeeeeee
Q 005404          469 SLKWYKAYFDAPTGN--EPLALDLRSM-GKGQVWINGQSIGRYW  509 (698)
Q Consensus       469 ~~~~Yr~~F~~p~~~--d~~~Ld~~g~-gKG~vwVNG~nlGRYW  509 (698)
                      +..|||.+|++|+..  .-++|.+.|. ....|||||+.||+--
T Consensus        65 G~~WYrr~f~lp~~~~gk~v~L~Fegv~~~a~V~lNG~~vg~~~  108 (604)
T PRK10150         65 GDVWYQREVFIPKGWAGQRIVLRFGSVTHYAKVWVNGQEVMEHK  108 (604)
T ss_pred             ccEEEEEEEECCcccCCCEEEEEECcccceEEEEECCEEeeeEc
Confidence            678999999998643  2499999987 4799999999999764


No 17 
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=92.53  E-value=0.2  Score=62.55  Aligned_cols=41  Identities=15%  Similarity=0.261  Sum_probs=34.0

Q ss_pred             CCceEEEEEEeCCCCC--C-CeEEEeCCC-ceEEEEECCeeeeee
Q 005404          468 QSLKWYKAYFDAPTGN--E-PLALDLRSM-GKGQVWINGQSIGRY  508 (698)
Q Consensus       468 ~~~~~Yr~~F~~p~~~--d-~~~Ld~~g~-gKG~vwVNG~nlGRY  508 (698)
                      .+..|||.+|++|+..  . .++|.+.|. ....|||||+-+|.-
T Consensus       119 n~~gwYrr~F~vp~~w~~~~rv~L~FeGV~~~a~VwvNG~~VG~~  163 (1027)
T PRK09525        119 NPTGCYSLTFTVDESWLQSGQTRIIFDGVNSAFHLWCNGRWVGYS  163 (1027)
T ss_pred             CCeEEEEEEEEeChhhcCCCeEEEEECeeccEEEEEECCEEEEee
Confidence            3679999999999642  2 489999986 589999999999953


No 18 
>PF06832 BiPBP_C:  Penicillin-Binding Protein C-terminus Family;  InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=73.32  E-value=4.8  Score=34.96  Aligned_cols=50  Identities=20%  Similarity=0.293  Sum_probs=34.3

Q ss_pred             ceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccc-cCCcCeEEEEEeecCCc
Q 005404          342 PTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANL-RAGINKIALLSIAVGLP  399 (698)
Q Consensus       342 ~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l-~~g~~~L~ILven~Gr~  399 (698)
                      ..|++.+-...++-||||+++|.....   ..+.+    .+ ..|.++|.+ ++..|+.
T Consensus        34 l~l~a~~~~~~~~W~vdg~~~g~~~~~---~~~~~----~~~~~G~h~l~v-vD~~G~~   84 (89)
T PF06832_consen   34 LVLKAAGGRGPVYWFVDGEPLGTTQPG---HQLFW----QPDRPGEHTLTV-VDAQGRS   84 (89)
T ss_pred             EEEEEeCCCCcEEEEECCEEcccCCCC---CeEEe----CCCCCeeEEEEE-EcCCCCE
Confidence            355555446699999999999876542   12332    34 678999987 7777764


No 19 
>PF14683 CBM-like:  Polysaccharide lyase family 4, domain III; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=70.94  E-value=4.4  Score=39.94  Aligned_cols=61  Identities=23%  Similarity=0.299  Sum_probs=28.4

Q ss_pred             CceEEEEECCeeeeeeee-ccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEEEEeeC
Q 005404          493 MGKGQVWINGQSIGRYWM-AYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVVFEELG  564 (698)
Q Consensus       493 ~gKG~vwVNG~nlGRYW~-~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivvfE~~g  564 (698)
                      -++=+|.||| ..+..+. ..++   ++|.+++       -+-.|..+.--|.||+.+|++|.|+|.|=-..|
T Consensus        92 ~~~~~V~vNg-~~~~~~~~~~~~---d~~~~r~-------g~~~G~~~~~~~~ipa~~L~~G~Nti~lt~~~g  153 (167)
T PF14683_consen   92 GGRLQVSVNG-WSGPFPSAPFGN---DNAIYRS-------GIHRGNYRLYEFDIPASLLKAGENTITLTVPSG  153 (167)
T ss_dssp             T-EEEEEETT-EE--------------S--GGG-------T---S---EEEEEE-TTSS-SEEEEEEEEEE-S
T ss_pred             CCCEEEEEcC-ccCCccccccCC---CCceeeC-------ceecccEEEEEEEEcHHHEEeccEEEEEEEccC
Confidence            4567899999 6777663 2211   2344433       122234566667799999999999997744444


No 20 
>KOG4729 consensus Galactoside-binding lectin [General function prediction only]
Probab=64.43  E-value=8.3  Score=40.30  Aligned_cols=87  Identities=9%  Similarity=-0.088  Sum_probs=63.8

Q ss_pred             CCCCceeeecCCCCeEeeeeeeccCCCC---CCCCC----CCCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCC-
Q 005404          605 TGNAKVLLQCAPGQSITSIEFASFGTPS---GTCGS----FQKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPC-  676 (698)
Q Consensus       605 ce~~~~~L~C~~g~~Is~I~~A~YGr~~---~~C~~----~~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC-  676 (698)
                      |..+++..-|+....+ .++.+.+++.+   ..|..    .....|.....+..+...|.+++.|.+..++.-++ -+| 
T Consensus       140 ~~p~~~~~~~~~~~~~-~~e~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ct~~~~~~~~~-~~~~  217 (265)
T KOG4729|consen  140 CTPSPTDPPRSEIRLE-CREGRRLAVYSAVMKTSPQKDPETEIRHECVSSVLPQLLRQCHAKEGCTLKSDGIKGH-CRHG  217 (265)
T ss_pred             ecCCCCCCccCcccch-hhhcccccccccccccCCCCcccCCCCceeecccchhhhhcccccCCceeecCCcccc-cccc
Confidence            6666777777777666 67778888863   34543    12333444467888999999999999999999988 566 


Q ss_pred             CCCCceEEEEEEeecCC
Q 005404          677 PNVLKRLSVQAVCSTAD  693 (698)
Q Consensus       677 ~g~~KYL~V~Y~C~p~~  693 (698)
                      ++..+|+-|.+.|.+..
T Consensus       218 ~~~~~~~~~n~e~~~~~  234 (265)
T KOG4729|consen  218 HLHKVYVTVTEEIFSEE  234 (265)
T ss_pred             ceeEEEEEecccccchh
Confidence            44578999999987654


No 21 
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=61.24  E-value=7.4  Score=32.13  Aligned_cols=46  Identities=24%  Similarity=0.498  Sum_probs=28.2

Q ss_pred             eeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCCcc
Q 005404          343 TLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGLPN  400 (698)
Q Consensus       343 ~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr~N  400 (698)
                      .|.|...-.-|.|||||+++|...       ..+.   .+..|.++|.|  +.-|...
T Consensus         3 ~l~V~s~p~gA~V~vdg~~~G~tp-------~~~~---~l~~G~~~v~v--~~~Gy~~   48 (71)
T PF08308_consen    3 TLRVTSNPSGAEVYVDGKYIGTTP-------LTLK---DLPPGEHTVTV--EKPGYEP   48 (71)
T ss_pred             EEEEEEECCCCEEEECCEEeccCc-------ceee---ecCCccEEEEE--EECCCee
Confidence            466666566789999999999432       2221   25567665544  5555443


No 22 
>PF11875 DUF3395:  Domain of unknown function (DUF3395);  InterPro: IPR024586 Chaperone DnaJ was originally characterised from Escherichia coli as a 41 kDa heat shock protein. DnaJ has a modular structure consisting of a J-domain, a proximal G/F-domain, and a distal zinc finger domain, followed by less conserved C-terminal sequences. Since then, a large number of DnaJ-related proteins containing a J-domain have been characterised from a variety of different organisms. In the genome of Arabidopsis thaliana a total of 89 J-domain proteins have been identified []. This entry represents a C-terminal domain found in some eukaryotic DnaJ-like proteins, including member 11 from the subfamily C1 and protein DnaJ 13 from Arabidopsis. This domain is typically between 147 to 176 amino acids in length. 
Probab=60.99  E-value=6.8  Score=37.96  Aligned_cols=67  Identities=24%  Similarity=0.236  Sum_probs=37.3

Q ss_pred             eeeeccCCCCC--CCCCC-----CCCCccCCChHHHHHHHhcCCCceEEEec----CCCccC-CCCC--CCCceEEEEEE
Q 005404          623 IEFASFGTPSG--TCGSF-----QKGTCHAPNSHAMLEKECIGQESCSIFIS----SGVFGT-DPCP--NVLKRLSVQAV  688 (698)
Q Consensus       623 I~~A~YGr~~~--~C~~~-----~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~----~~~Fg~-DPC~--g~~KYL~V~Y~  688 (698)
                      |..|.||....  .+...     ....+..-+.+-.+.  |.=+.++-....    ..+.|. ||||  |..|.|.|.|+
T Consensus        55 I~~A~YG~~~~~~~~~~~~~~~~~~~~~~~iDVTipLq--~lV~dS~L~l~~~~sKs~L~GF~DP~p~~ge~K~L~V~Y~  132 (151)
T PF11875_consen   55 ILKAWYGNLPAKSDESNNDEPEDPDLDPPVIDVTIPLQ--ALVKDSQLILPEGVSKSGLPGFYDPCPFLGEPKQLRVRYR  132 (151)
T ss_pred             EEEEEcCCcccccccccccccccccccCcEEEEhhhhh--hEeecCEEEEcCCCchhhCCCCCCCccccCCccEEEEEEE
Confidence            88899999742  33221     112233334443333  333344433222    123333 9999  88999999998


Q ss_pred             eec
Q 005404          689 CST  691 (698)
Q Consensus       689 C~p  691 (698)
                      ...
T Consensus       133 f~g  135 (151)
T PF11875_consen  133 FRG  135 (151)
T ss_pred             ECC
Confidence            753


No 23 
>PF11324 DUF3126:  Protein of unknown function (DUF3126);  InterPro: IPR021473  This family of proteins with unknown function appear to be restricted to Alphaproteobacteria. 
Probab=59.69  E-value=18  Score=29.79  Aligned_cols=24  Identities=17%  Similarity=0.545  Sum_probs=20.7

Q ss_pred             CcceEEEEEECCEEEEEEEccccc
Q 005404          348 SAGHAVHVFINGQFLGSAFGTREN  371 (698)
Q Consensus       348 ~~~D~~~Vfvng~~~G~~~~~~~~  371 (698)
                      +..|.|.||++++|+|++++....
T Consensus        25 k~~dsaEV~~g~EfiGvi~~Dede   48 (63)
T PF11324_consen   25 KKDDSAEVYIGDEFIGVIYRDEDE   48 (63)
T ss_pred             CCCCceEEEeCCEEEEEEEeecCC
Confidence            578999999999999999986533


No 24 
>PF08531 Bac_rhamnosid_N:  Alpha-L-rhamnosidase N-terminal domain;  InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=56.16  E-value=53  Score=32.24  Aligned_cols=56  Identities=21%  Similarity=0.219  Sum_probs=30.8

Q ss_pred             eeeeCCcceEEEEEECCEEEEEEE--c--ccccc---ceEEeecccccCCcCeEEEEEeecCCc
Q 005404          343 TLTVESAGHAVHVFINGQFLGSAF--G--TRENR---RFTFSGPANLRAGINKIALLSIAVGLP  399 (698)
Q Consensus       343 ~L~v~~~~D~~~Vfvng~~~G~~~--~--~~~~~---~~~~~~~~~l~~g~~~L~ILven~Gr~  399 (698)
                      .|.|... .+..+||||+.||.-.  .  ....+   -.++++.--|+.|.|+|.|++-+....
T Consensus         7 ~l~isa~-g~Y~l~vNG~~V~~~~l~P~~t~y~~~~~Y~tyDVt~~L~~G~N~iav~lg~gw~~   69 (172)
T PF08531_consen    7 RLYISAL-GRYELYVNGERVGDGPLAPGWTDYDKRVYYQTYDVTPYLRPGENVIAVWLGNGWYN   69 (172)
T ss_dssp             EEEEEEE-SEEEEEETTEEEEEE--------BTTEEEEEEEE-TTT--TTEEEEEEEEEE--S-
T ss_pred             EEEEEeC-eeEEEEECCEEeeCCccccccccCCCceEEEEEeChHHhCCCCCEEEEEEeCCccc
Confidence            4555443 3668999999999643  1  11111   124444434788999999999775433


No 25 
>KOG2024 consensus Beta-Glucuronidase GUSB (glycosylhydrolase superfamily 2) [Carbohydrate transport and metabolism]
Probab=54.05  E-value=22  Score=37.44  Aligned_cols=58  Identities=24%  Similarity=0.313  Sum_probs=42.1

Q ss_pred             ccchhhcCCC---CCCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEE
Q 005404          305 IGLLEQINIT---RDTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLG  363 (698)
Q Consensus       305 p~~~Eql~~t---~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G  363 (698)
                      |.++-.+++.   +|-+|.+||+-++.++.+.. ...++...|++.+++-+|.|||||.-+=
T Consensus        71 pss~nDi~~d~~lrdfv~~~wyer~v~vpe~w~-~~~~~r~vlr~~s~H~~Aivwvng~~~~  131 (297)
T KOG2024|consen   71 PSSFNDIGQDWRLRDFVGLVWYERTVTVPESWT-QDLGKRVVLRIGSAHSYAIVWVNGVDAL  131 (297)
T ss_pred             ccchhccccCCccccceeeeEEEEEEEcchhhh-hhcCCeEEEEeecccceeEEEEcceeec
Confidence            3445566553   46899999999998875432 1224456889999999999999987643


No 26 
>PF08531 Bac_rhamnosid_N:  Alpha-L-rhamnosidase N-terminal domain;  InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=49.41  E-value=23  Score=34.87  Aligned_cols=22  Identities=23%  Similarity=0.613  Sum_probs=19.4

Q ss_pred             EEEeCCCceEEEEECCeeeeee
Q 005404          487 ALDLRSMGKGQVWINGQSIGRY  508 (698)
Q Consensus       487 ~Ld~~g~gKG~vwVNG~nlGRY  508 (698)
                      .|.+++.|+=.+||||+.+|+-
T Consensus         7 ~l~isa~g~Y~l~vNG~~V~~~   28 (172)
T PF08531_consen    7 RLYISALGRYELYVNGERVGDG   28 (172)
T ss_dssp             EEEEEEESEEEEEETTEEEEEE
T ss_pred             EEEEEeCeeEEEEECCEEeeCC
Confidence            4778888999999999999975


No 27 
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=49.20  E-value=90  Score=28.83  Aligned_cols=70  Identities=16%  Similarity=0.323  Sum_probs=42.0

Q ss_pred             EEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEcccc-----ccceEEeecccccCC-cCeEEEEE
Q 005404          320 YLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRE-----NRRFTFSGPANLRAG-INKIALLS  393 (698)
Q Consensus       320 y~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~-----~~~~~~~~~~~l~~g-~~~L~ILv  393 (698)
                      .+.+++.|..+.+..+       ++.+. ..|.+.+||||+.+-...+...     .........+.|.+| .+.|.|..
T Consensus        47 ~~~~~G~~~~~~~G~y-------~f~~~-~~d~~~l~idg~~vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y  118 (145)
T PF07691_consen   47 SVRWTGYFKPPETGTY-------TFSLT-SDDGARLWIDGKLVIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEY  118 (145)
T ss_dssp             EEEEEEEEEESSSEEE-------EEEEE-ESSEEEEEETTEEEEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEE
T ss_pred             EEEEEEEEecccCceE-------EEEEE-ecccEEEEECCEEEEcCCccccccccccccceEEEEEEeeCCeeEEEEEEE
Confidence            4568888887655432       34443 7789999999999977654321     001122223445554 67888876


Q ss_pred             eecC
Q 005404          394 IAVG  397 (698)
Q Consensus       394 en~G  397 (698)
                      .+.+
T Consensus       119 ~~~~  122 (145)
T PF07691_consen  119 FNRG  122 (145)
T ss_dssp             EECS
T ss_pred             EECC
Confidence            6655


No 28 
>PLN03059 beta-galactosidase; Provisional
Probab=42.19  E-value=1.2e+02  Score=37.20  Aligned_cols=42  Identities=26%  Similarity=0.395  Sum_probs=32.5

Q ss_pred             CCceEEEEEEeCCCCCC------CeEEEeCCCc-eEEEEECCeeeeeee
Q 005404          468 QSLKWYKAYFDAPTGNE------PLALDLRSMG-KGQVWINGQSIGRYW  509 (698)
Q Consensus       468 ~~~~~Yr~~F~~p~~~d------~~~Ld~~g~g-KG~vwVNG~nlGRYW  509 (698)
                      .+..||+++|+++....      .-.|.+.+.+ ..+|||||.-+|.-+
T Consensus       469 ~dYlwY~t~i~~~~~~~~~~~~~~~~L~v~~~~d~~~vFVNg~~~Gt~~  517 (840)
T PLN03059        469 TDYLWYMTEVHIDPDEGFLKTGQYPVLTIFSAGHALHVFINGQLAGTVY  517 (840)
T ss_pred             CceEEEEEEEeecCCccccccCCCceEEEcccCcEEEEEECCEEEEEEE
Confidence            46899999999865421      1237888776 699999999999875


No 29 
>TIGR01676 GLDHase galactonolactone dehydrogenase. This model represents L-Galactono-gamma-lactone dehydrogenase (EC 1.3.2.3). This enzyme catalyzes the final step in ascorbic acid biosynthesis in higher plants. This protein is homologous to ascorbic acid biosynthesis enzymes of other species: L-gulono-gamma-lactone oxidase in rat and L-galactono-gamma-lactone oxidase in yeast. All three covalently bind the cofactor FAD.
Probab=40.42  E-value=31  Score=40.32  Aligned_cols=41  Identities=12%  Similarity=-0.010  Sum_probs=29.5

Q ss_pred             cEEEEEeccccCCccccC---CCCcHHHHHHHHHHHHhCCCCcc
Q 005404           24 PIILSQIENEYGPESKSL---GAAGHAYVNWAAKMAVGLDTGVP   64 (698)
Q Consensus        24 PII~vQvENEYG~~~~~~---g~~d~~Ym~~L~~~a~~~Gi~vp   64 (698)
                      ++-.-|.|+||.--....   .-.+.++|+.|+++.++.++.++
T Consensus       375 ~~~f~q~~~EyaiP~e~~~~~~~~~~~~l~el~~~i~~~~~~~~  418 (541)
T TIGR01676       375 DCGGHQWVSETCFPAGTLAKPNMKDIEYIEELKQLIEKENIPAP  418 (541)
T ss_pred             ccccceEEEEEEEecccccccccchHHHHHHHHHHHHhcCCCCC
Confidence            555779999997521000   01469999999999999888875


No 30 
>KOG0496 consensus Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=39.53  E-value=7.9  Score=45.40  Aligned_cols=58  Identities=21%  Similarity=0.304  Sum_probs=50.1

Q ss_pred             cCCCCeEeeeeeeccCCCCCCCCCCCCCCccCCChHHHHHHHhcCCCceEEEecCCCc
Q 005404          614 CAPGQSITSIEFASFGTPSGTCGSFQKGTCHAPNSHAMLEKECIGQESCSIFISSGVF  671 (698)
Q Consensus       614 C~~g~~Is~I~~A~YGr~~~~C~~~~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~F  671 (698)
                      |..+.++..|..|.||...+.|+.+...+|.++.+...+.+.|-.+..|+|....+.+
T Consensus       332 ~ep~lv~gd~~~~kyg~~~~~C~~Fl~n~~~~~~~~v~f~~~~y~~~~~slsilpdck  389 (649)
T KOG0496|consen  332 CEPALVAGDITTAKYGNLREACAAFLSNNNGAPAAPVPFNKPKYRLPPWSLSILPDCK  389 (649)
T ss_pred             cCccccccCcccccccchhhHHHHHHhcCCCCCCCccccCCCccccCceeEEechhhc
Confidence            4567788888999999998889999888999999998999999999999998875443


No 31 
>COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning]
Probab=37.05  E-value=29  Score=35.23  Aligned_cols=16  Identities=25%  Similarity=0.661  Sum_probs=14.2

Q ss_pred             ceEEEEECCeeeeeee
Q 005404          494 GKGQVWINGQSIGRYW  509 (698)
Q Consensus       494 gKG~vwVNG~nlGRYW  509 (698)
                      .+|.|||||++|.|.=
T Consensus        55 t~G~i~~~~~dl~~l~   70 (223)
T COG2884          55 TRGKILVNGHDLSRLK   70 (223)
T ss_pred             CCceEEECCeeccccc
Confidence            5799999999999973


No 32 
>PF07009 DUF1312:  Protein of unknown function (DUF1312);  InterPro: IPR010739 This family consists of several bacterial proteins of around 120 residues in length. The function of this family is unknown.; PDB: 4ESN_B 1NPP_B 1M1G_D 1NPR_A 1M1H_A 2KPP_A 3LD7_C.
Probab=26.29  E-value=3.5e+02  Score=24.64  Aligned_cols=81  Identities=12%  Similarity=0.307  Sum_probs=47.2

Q ss_pred             CceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEEEEeeCCCCcceEE
Q 005404          493 MGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVVFEELGGDASRISL  572 (698)
Q Consensus       493 ~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivvfE~~g~~~~~i~~  572 (698)
                      -.+..|++||.-+.||=  + +.       ..             ..+++. |+.+   .|.|+|.|   ++   ..|++
T Consensus        26 ~~~~~I~~~g~~~~~i~--L-~~-------~~-------------~~~~i~-i~~~---~g~~~i~i---~~---g~vrv   72 (113)
T PF07009_consen   26 GKYAVIYVDGKEVKRIP--L-DK-------VN-------------EDKTIE-IDGD---GGYNTIEI---KD---GKVRV   72 (113)
T ss_dssp             EEEEEEEETTEEEEEEE--T-TS--------B-------------SEEEEE-EETT---TCEEEEEE---ET---TEEEE
T ss_pred             CeEEEEEECCEEEEEEE--C-CC-------CC-------------CCEEEE-EecC---CcEEEEEE---EC---CEEEE
Confidence            35789999999999993  2 10       01             344444 7553   34565544   22   34888


Q ss_pred             Eeec-ccccccccccCCCCCCCcccccCCCCCcCCCCceeeecCCCCeEeeee
Q 005404          573 VKRS-VARVCADAHEHHPTTDNNDIENKGNSNSTGNAKVLLQCAPGQSITSIE  624 (698)
Q Consensus       573 ~~~~-~~~~~~~~~e~hp~~~~~~~~~~~~~~~ce~~~~~L~C~~g~~Is~I~  624 (698)
                      .+.. .+++|.+.         .|..+.++         .+-|-+.+++..|.
T Consensus        73 ~~s~CpdkiCv~~---------G~I~~~G~---------~IVCLPn~lvI~I~  107 (113)
T PF07009_consen   73 IESDCPDKICVKT---------GWISRPGQ---------SIVCLPNRLVIEIE  107 (113)
T ss_dssp             EEESTSS-HHHHS----------SB-STT----------EEEETTTTEEEEEE
T ss_pred             EECCCCCcchhhC---------CCcCCCCC---------EEEEcCCEEEEEEE
Confidence            7775 67888876         45444444         47787766554555


No 33 
>PF02055 Glyco_hydro_30:  O-Glycosyl hydrolase family 30;  InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=23.73  E-value=2e+02  Score=33.44  Aligned_cols=220  Identities=18%  Similarity=0.261  Sum_probs=99.0

Q ss_pred             HHHHHHHHHhccCceeccCCcEEEEEeccccCCc---cccCCC------CcHHHHH-HHHHHHHhCCC--CcceEeecCC
Q 005404            4 FTQKIVQMMKNEKLFASQGGPIILSQIENEYGPE---SKSLGA------AGHAYVN-WAAKMAVGLDT--GVPWVMCKED   71 (698)
Q Consensus         4 ~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~---~~~~g~------~d~~Ym~-~L~~~a~~~Gi--~vp~~~~~~~   71 (698)
                      |-+.+++.|.  -+ .++|=||=++=+-||=...   ...|..      .-++.++ +|.-.+++.|+  ++-++..+..
T Consensus       206 yA~Y~vkfi~--aY-~~~GI~i~aiT~QNEP~~~~~~~~~~~s~~~t~~~~~~Fi~~~LgP~l~~~~~g~d~kI~~~D~n  282 (496)
T PF02055_consen  206 YADYFVKFIQ--AY-KKEGIPIWAITPQNEPDNGSDPNYPWPSMGWTPEEQADFIKNYLGPALRKAGLGKDVKILIYDHN  282 (496)
T ss_dssp             HHHHHHHHHH--HH-HCTT--ESEEESSSSCCGGGSTT-SSC--B--HHHHHHHHHHTHHHHHHTSTT-TTSEEEEEEEE
T ss_pred             HHHHHHHHHH--HH-HHCCCCeEEEeccCCCCCCCCCCCCCCcCCCCHHHHHHHHHHHHHHHHHhcCCCCceEEEEEecC
Confidence            3444445554  22 2677799999999994320   001111      1255665 47778888877  7777776652


Q ss_pred             --CCCC---cccc------CCCC--ccc--c--------cCCCCCCCCCceeeeccCccccccCCCcC---CCCHHHHHH
Q 005404           72 --DAPD---PVIN------SCNG--FYC--D--------AFSPNKPYKPTLWTEAWSGWFTEFGGAVH---RRPVQDLAF  125 (698)
Q Consensus        72 --~~~~---~v~~------~~ng--~~~--~--------~f~~~~p~~P~~~~E~~~Gwf~~wG~~~~---~~~~~~~~~  125 (698)
                        ..|+   .++.      ...|  +++  .        .+....|+..++.||-..|.. .|+....   ...++.++.
T Consensus       283 ~~~~~~~~~~il~d~~A~~yv~GiA~HwY~g~~~~~~l~~~h~~~P~k~l~~TE~~~g~~-~~~~~~~~g~w~~~~~y~~  361 (496)
T PF02055_consen  283 RDNLPDYADTILNDPEAAKYVDGIAFHWYGGDPSPQALDQVHNKFPDKFLLFTEACCGSW-NWDTSVDLGSWDRAERYAH  361 (496)
T ss_dssp             GGGTTHHHHHHHTSHHHHTTEEEEEEEETTCS-HCHHHHHHHHHSTTSEEEEEEEESS-S-TTS-SS-TTHHHHHHHHHH
T ss_pred             CcccchhhhhhhcChhhHhheeEEEEECCCCCchhhHHHHHHHHCCCcEEEeeccccCCC-CcccccccccHHHHHHHHH
Confidence              2221   1221      1111  111  1        111346888999999976542 1221111   111233444


Q ss_pred             HHHHHHHcCCceeeeeEe------eccCCCCCC-CCCCcccccccCCCCCCCCCC-CCchhHHHHHHHHHHHHhhhcccc
Q 005404          126 AVARFIQKGGSFFNYYMY------HGGTNFGRT-AGGPFITTSYDYDAPLDEYGL-MRQPKYGHLKQLHEAIKLCEYALV  197 (698)
Q Consensus       126 ~~~~~l~~g~s~~n~YMf------hGGTNfG~~-~G~~~~~TSYDY~APl~E~G~-~~~~ky~~lr~l~~~~~~~~~~l~  197 (698)
                      .+..-|..+++  ++-++      .||-|++.- ..++..+..        +.+. .++|.|+.|..+.+|++.-...+.
T Consensus       362 ~ii~~lnn~~~--gw~~WNl~LD~~GGP~~~~n~~d~~iivd~--------~~~~~~~~p~yY~~gHfSKFV~PGa~RI~  431 (496)
T PF02055_consen  362 DIIGDLNNWVS--GWIDWNLALDENGGPNWVGNFCDAPIIVDS--------DTGEFYKQPEYYAMGHFSKFVRPGAVRIG  431 (496)
T ss_dssp             HHHHHHHTTEE--EEEEEESEBETTS---TT---B--SEEEEG--------GGTEEEE-HHHHHHHHHHTTS-TT-EEEE
T ss_pred             HHHHHHHhhce--eeeeeeeecCCCCCCcccCCCCCceeEEEc--------CCCeEEEcHHHHHHHHHhcccCCCCEEEE
Confidence            44444566644  22222      388887532 112221111        1121 126899999998888764322221


Q ss_pred             CCCCcccCCCCcceeeeeecCCcceeeEeeccCCccc-eEEEeC
Q 005404          198 SSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSA-ARVTFN  240 (698)
Q Consensus       198 ~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~-~~v~~~  240 (698)
                      ..   ........+..+|...++..++-|.|...... .+|++.
T Consensus       432 st---~~~~~~~l~~vAF~nPDGs~vvVv~N~~~~~~~~~v~v~  472 (496)
T PF02055_consen  432 ST---SSSSDSGLEAVAFLNPDGSIVVVVLNRGDSDQNFSVTVK  472 (496)
T ss_dssp             EE---ESSSTTTEEEEEEEETTSEEEEEEEE-SSS-EEEEEEEE
T ss_pred             ee---ccCCCCceeEEEEECCCCCEEEEEEcCCCCccceEEEEe
Confidence            10   00001134555666666777766666544332 245553


No 34 
>PF14307 Glyco_tran_WbsX:  Glycosyltransferase WbsX
Probab=21.82  E-value=1.9e+02  Score=31.76  Aligned_cols=61  Identities=8%  Similarity=0.125  Sum_probs=46.4

Q ss_pred             hhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecC
Q 005404            2 QGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKE   70 (698)
Q Consensus         2 ~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~   70 (698)
                      ++.++.|++..++..++--+|=||++|=--.+.        ++-++.|+.+++.|+++|+..+.+....
T Consensus       138 ~~~~~~l~~~F~D~rYikVdGKPv~~Iy~p~~~--------pd~~~~~~~wr~~a~~~G~~giyii~~~  198 (345)
T PF14307_consen  138 KEHFRYLLPYFKDPRYIKVDGKPVFLIYRPGDI--------PDIKEMIERWREEAKEAGLPGIYIIAVQ  198 (345)
T ss_pred             HHHHHHHHHHhCCCCceeECCEEEEEEECcccc--------cCHHHHHHHHHHHHHHcCCCceEEEEEe
Confidence            456888999999877776788899998433222        2457899999999999999877665443


No 35 
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=21.73  E-value=1.6e+02  Score=27.04  Aligned_cols=38  Identities=21%  Similarity=0.604  Sum_probs=30.4

Q ss_pred             eEEEEEEeCCCCCCCeEEEeCCCceEEEEECCeeeeeee
Q 005404          471 KWYKAYFDAPTGNEPLALDLRSMGKGQVWINGQSIGRYW  509 (698)
Q Consensus       471 ~~Yr~~F~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW  509 (698)
                      .-+.++|.+|+... .-+.+.+-+...+||||.-+-..|
T Consensus        46 ~~~~g~i~~~~~G~-y~f~~~~~~~~~l~Idg~~vid~~   83 (136)
T smart00758       46 VRWTGYLKPPEDGE-YTFSITSDDGARLWIDGKLVIDNW   83 (136)
T ss_pred             EEEEEEEECCCCcc-EEEEEEcCCcEEEEECCcEEEcCC
Confidence            45678898887655 556777888899999999888777


No 36 
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=21.37  E-value=1.1e+02  Score=28.11  Aligned_cols=38  Identities=24%  Similarity=0.615  Sum_probs=29.7

Q ss_pred             eEEEEEEeCCCCCCCeEEEeCCCceEEEEECCeeeeeee
Q 005404          471 KWYKAYFDAPTGNEPLALDLRSMGKGQVWINGQSIGRYW  509 (698)
Q Consensus       471 ~~Yr~~F~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW  509 (698)
                      .=++++|++|.... .-|.+..-+...+||||.-+-..|
T Consensus        48 ~~~~G~~~~~~~G~-y~f~~~~~d~~~l~idg~~vid~~   85 (145)
T PF07691_consen   48 VRWTGYFKPPETGT-YTFSLTSDDGARLWIDGKLVIDNW   85 (145)
T ss_dssp             EEEEEEEEESSSEE-EEEEEEESSEEEEEETTEEEEECS
T ss_pred             EEEEEEEecccCce-EEEEEEecccEEEEECCEEEEcCC
Confidence            34789999987654 445555667789999999999998


No 37 
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=20.65  E-value=3.1e+02  Score=25.13  Aligned_cols=65  Identities=17%  Similarity=0.274  Sum_probs=35.9

Q ss_pred             EEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCC-cCeEEEEE
Q 005404          320 YLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAG-INKIALLS  393 (698)
Q Consensus       320 y~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g-~~~L~ILv  393 (698)
                      .+.+++.|..+.+..       -++.+ ...|.+.+||||+.+-...+... ........+.|.+| .+.|.|..
T Consensus        45 ~~~~~g~i~~~~~G~-------y~f~~-~~~~~~~l~Idg~~vid~~~~~~-~~~~~~~~v~l~~g~~~~i~v~y  110 (136)
T smart00758       45 SVRWTGYLKPPEDGE-------YTFSI-TSDDGARLWIDGKLVIDNWGKHE-ARPSTSSTLYLLAGGTYPIRIEY  110 (136)
T ss_pred             EEEEEEEEECCCCcc-------EEEEE-EcCCcEEEEECCcEEEcCCccCC-CccccceeEEEeCCcEEEEEEEE
Confidence            356788887664432       24555 45788999999998765322111 00111123445555 45665544


No 38 
>COG2360 Aat Leu/Phe-tRNA-protein transferase [Posttranslational modification, protein turnover, chaperones]
Probab=20.32  E-value=1e+02  Score=31.60  Aligned_cols=35  Identities=23%  Similarity=0.344  Sum_probs=28.2

Q ss_pred             CceeccCCcEEEEEecccc----CCccccCCCCcHHHHHHHHH
Q 005404           16 KLFASQGGPIILSQIENEY----GPESKSLGAAGHAYVNWAAK   54 (698)
Q Consensus        16 ~l~~~~GGPII~vQvENEY----G~~~~~~g~~d~~Ym~~L~~   54 (698)
                      .++...|+.+|=.|++||.    |.    +.-..++|++.|++
T Consensus       167 ~~L~~~g~~LiD~Q~~n~HL~~~GA----~~ipr~~y~~~L~~  205 (221)
T COG2360         167 EHLRRHGFVLIDCQVLNEHLASLGA----YEIPRKEYLNYLRR  205 (221)
T ss_pred             HHHHhcCceEEeeecCCHHHHhcCC----eecCHHHHHHHHHH
Confidence            3455889999999999994    44    33478999999998


Done!