Query 005404
Match_columns 698
No_of_seqs 308 out of 1475
Neff 6.7
Searched_HMMs 46136
Date Thu Mar 28 22:54:52 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/005404.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/005404hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03059 beta-galactosidase; P 100.0 2E-168 5E-173 1437.6 62.6 686 1-690 154-840 (840)
2 KOG0496 Beta-galactosidase [Ca 100.0 3E-107 7E-112 899.6 28.5 502 1-586 144-648 (649)
3 PF01301 Glyco_hydro_35: Glyco 100.0 3.7E-45 8.1E-50 391.5 9.7 179 1-188 119-318 (319)
4 KOG4729 Galactoside-binding le 99.8 1E-21 2.2E-26 197.4 8.2 92 604-696 40-136 (265)
5 PF02140 Gal_Lectin: Galactose 99.8 2.1E-20 4.7E-25 160.7 5.5 76 612-689 1-80 (80)
6 PF13364 BetaGal_dom4_5: Beta- 99.0 1.3E-09 2.9E-14 99.6 7.5 68 467-560 33-104 (111)
7 COG1874 LacA Beta-galactosidas 98.8 4.8E-09 1E-13 121.3 5.8 164 3-175 132-333 (673)
8 PF13364 BetaGal_dom4_5: Beta- 98.5 4.1E-07 9E-12 83.2 9.8 84 308-398 24-110 (111)
9 PF02837 Glyco_hydro_2_N: Glyc 98.3 6.7E-06 1.5E-10 80.0 11.0 99 315-419 64-164 (167)
10 PRK10150 beta-D-glucuronidase; 96.7 0.0079 1.7E-07 70.8 11.1 100 316-421 62-179 (604)
11 PF02837 Glyco_hydro_2_N: Glyc 96.7 0.0031 6.6E-08 61.3 6.4 65 468-559 67-135 (167)
12 PRK10340 ebgA cryptic beta-D-g 96.3 0.015 3.4E-07 72.2 10.5 97 319-421 109-206 (1021)
13 PRK09525 lacZ beta-D-galactosi 95.9 0.032 7E-07 69.4 10.4 95 318-421 119-218 (1027)
14 PF02449 Glyco_hydro_42: Beta- 95.6 0.073 1.6E-06 58.9 10.9 87 93-191 286-373 (374)
15 PRK10340 ebgA cryptic beta-D-g 93.9 0.13 2.8E-06 64.3 7.8 64 469-559 109-175 (1021)
16 PRK10150 beta-D-glucuronidase; 92.9 0.24 5.1E-06 58.5 7.5 41 469-509 65-108 (604)
17 PRK09525 lacZ beta-D-galactosi 92.5 0.2 4.3E-06 62.6 6.5 41 468-508 119-163 (1027)
18 PF06832 BiPBP_C: Penicillin-B 73.3 4.8 0.0001 35.0 4.1 50 342-399 34-84 (89)
19 PF14683 CBM-like: Polysacchar 70.9 4.4 9.4E-05 39.9 3.6 61 493-564 92-153 (167)
20 KOG4729 Galactoside-binding le 64.4 8.3 0.00018 40.3 4.2 87 605-693 140-234 (265)
21 PF08308 PEGA: PEGA domain; I 61.2 7.4 0.00016 32.1 2.6 46 343-400 3-48 (71)
22 PF11875 DUF3395: Domain of un 61.0 6.8 0.00015 38.0 2.7 67 623-691 55-135 (151)
23 PF11324 DUF3126: Protein of u 59.7 18 0.00039 29.8 4.5 24 348-371 25-48 (63)
24 PF08531 Bac_rhamnosid_N: Alph 56.2 53 0.0012 32.2 8.2 56 343-399 7-69 (172)
25 KOG2024 Beta-Glucuronidase GUS 54.1 22 0.00048 37.4 5.1 58 305-363 71-131 (297)
26 PF08531 Bac_rhamnosid_N: Alph 49.4 23 0.00049 34.9 4.3 22 487-508 7-28 (172)
27 PF07691 PA14: PA14 domain; I 49.2 90 0.0019 28.8 8.3 70 320-397 47-122 (145)
28 PLN03059 beta-galactosidase; P 42.2 1.2E+02 0.0027 37.2 9.7 42 468-509 469-517 (840)
29 TIGR01676 GLDHase galactonolac 40.4 31 0.00067 40.3 4.2 41 24-64 375-418 (541)
30 KOG0496 Beta-galactosidase [Ca 39.5 7.9 0.00017 45.4 -0.7 58 614-671 332-389 (649)
31 COG2884 FtsE Predicted ATPase 37.1 29 0.00063 35.2 2.8 16 494-509 55-70 (223)
32 PF07009 DUF1312: Protein of u 26.3 3.5E+02 0.0075 24.6 7.9 81 493-624 26-107 (113)
33 PF02055 Glyco_hydro_30: O-Gly 23.7 2E+02 0.0043 33.4 7.1 220 4-240 206-472 (496)
34 PF14307 Glyco_tran_WbsX: Glyc 21.8 1.9E+02 0.0041 31.8 6.2 61 2-70 138-198 (345)
35 smart00758 PA14 domain in bact 21.7 1.6E+02 0.0035 27.0 4.9 38 471-509 46-83 (136)
36 PF07691 PA14: PA14 domain; I 21.4 1.1E+02 0.0025 28.1 3.9 38 471-509 48-85 (145)
37 smart00758 PA14 domain in bact 20.6 3.1E+02 0.0067 25.1 6.6 65 320-393 45-110 (136)
38 COG2360 Aat Leu/Phe-tRNA-prote 20.3 1E+02 0.0022 31.6 3.3 35 16-54 167-205 (221)
No 1
>PLN03059 beta-galactosidase; Provisional
Probab=100.00 E-value=2.2e-168 Score=1437.61 Aligned_cols=686 Identities=63% Similarity=1.144 Sum_probs=623.8
Q ss_pred ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCCCCCCccccC
Q 005404 1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKEDDAPDPVINS 80 (698)
Q Consensus 1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~~~~~~v~~~ 80 (698)
|++|+++|+++|++++||+++||||||+|||||||++.++|+.+|++||+||++||+++|++||||||++.++++++++|
T Consensus 154 v~~~~~~l~~~l~~~~l~~~~GGPIImvQIENEYGs~~~~~~~~d~~Yl~~l~~~~~~~Gi~VPl~t~dg~~~~~~v~~t 233 (840)
T PLN03059 154 MQKFTEKIVDMMKSEKLFEPQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADMAVKLGTGVPWVMCKQEDAPDPVIDT 233 (840)
T ss_pred HHHHHHHHHHHHhhcceeecCCCcEEEEEecccccceecccCcchHHHHHHHHHHHHHcCCCcceEECCCCCCCccceec
Confidence 68999999999988899999999999999999999998778889999999999999999999999999998888889999
Q ss_pred CCCcccccCCCCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcccc
Q 005404 81 CNGFYCDAFSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFITT 160 (698)
Q Consensus 81 ~ng~~~~~f~~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~~T 160 (698)
+||.+++.|...++.+|+|+||||+|||++||++++.|+++|++.+++++|++|+|++||||||||||||||||+++++|
T Consensus 234 ~Ng~~~~~f~~~~~~~P~m~tE~w~GWf~~wG~~~~~r~~~d~a~~~~~~l~~g~S~~N~YMfhGGTNFG~~~Ga~~~~T 313 (840)
T PLN03059 234 CNGFYCENFKPNKDYKPKMWTEAWTGWYTEFGGAVPNRPAEDLAFSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIAT 313 (840)
T ss_pred CCCchhhhcccCCCCCCcEEeccCchhHhhcCCCCCcCCHHHHHHHHHHHHHcCCeeEEeeeccCcCCcccccCCCcccc
Confidence 99988888888777789999999999999999999999999999999999999999889999999999999999999999
Q ss_pred cccCCCCCCCCCCCCchhHHHHHHHHHHHHhhhccccCCCCcccCCCCcceeeeeecCCcceeeEeeccCCccceEEEeC
Q 005404 161 SYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVSSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSAARVTFN 240 (698)
Q Consensus 161 SYDY~APl~E~G~~~~~ky~~lr~l~~~~~~~~~~l~~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~~~v~~~ 240 (698)
||||||||+|+|++|.|||.+||++|++++.+++.|+..+|....+++.+++++|.... .|++|+.|++.+...+|+|+
T Consensus 314 SYDYdAPL~E~G~~t~pKy~~lr~l~~~~~~~~~~l~~~~p~~~~lg~~~ea~~y~~~~-~caaFl~n~~~~~~~~v~f~ 392 (840)
T PLN03059 314 SYDYDAPLDEYGLPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKS-ACAAFLANYDTKYSVKVTFG 392 (840)
T ss_pred ccccCCccccccCcchhHHHHHHHHHHHHHhcCccccCCCCceeccCCceeEEEccCcc-chhhheeccCCCCceeEEEC
Confidence 99999999999999546999999999999988888887777777889999999999666 79999999998888999999
Q ss_pred CeeeecCCCceeecCCCCceeeccceeeeeeeeeeccccccccccccccccccccccCCCccccccchhhcCCCCCCccE
Q 005404 241 GRQYNLPPWSISILPDCKNVVFNTAKVAVQHTKMQMLPTGSKLLSWETYDEDISSLGESSTLTAIGLLEQINITRDTSDY 320 (698)
Q Consensus 241 ~~~~~~p~~sv~il~~~~~~~~~t~~v~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~p~~~Eql~~t~d~~Gy 320 (698)
|++|.||+|||+|||||+.++|+|++++.|++.++..+. ...+.|+++.|++.+...+.+++...++||+++|+|.+||
T Consensus 393 g~~y~lp~~Svsilpd~~~~lfnta~v~~q~~~~~~~~~-~~~~~w~~~~e~~~~~~~~~~~~~e~l~e~~n~t~d~~dY 471 (840)
T PLN03059 393 NGQYDLPPWSVSILPDCKTAVFNTARLGAQSSQMKMNPV-GSTFSWQSYNEETASAYTDDTTTMDGLWEQINVTRDATDY 471 (840)
T ss_pred CcccccCccceeecccccceeeeccccccccceeecccc-cccccceeecccccccccCCCcchhhHHHhhcccCCCCce
Confidence 999999999999999999999999999888766644322 3456999999995543233567777889999999999999
Q ss_pred EEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCCcc
Q 005404 321 LWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGLPN 400 (698)
Q Consensus 321 ~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr~N 400 (698)
+||+|+|....++..++.+.+.+|+|.+++|++||||||+++|+.++......++++.+++++.|.|+|+||||||||+|
T Consensus 472 lwY~t~i~~~~~~~~~~~~~~~~L~v~~~~d~~~vFVNg~~~Gt~~~~~~~~~~~~~~~v~l~~g~n~L~iLse~vG~~N 551 (840)
T PLN03059 472 LWYMTEVHIDPDEGFLKTGQYPVLTIFSAGHALHVFINGQLAGTVYGELSNPKLTFSQNVKLTVGINKISLLSVAVGLPN 551 (840)
T ss_pred EEEEEEEeecCCccccccCCCceEEEcccCcEEEEEECCEEEEEEEeecCCcceEEecccccCCCceEEEEEEEeCCCCc
Confidence 99999998876654456777889999999999999999999999998766667888878889999999999999999999
Q ss_pred cccCCCccccceeecEEEccccCCCccCccCCcEEeccCccccccccCCCCCCCCcccCCCccccccCCceEEEEEEeCC
Q 005404 401 VGLHYETWETGVRGAVVLHGLDHGNKDLTWQKWSYQVGLKGEAMNLVSPSEATSVDWTRGSLAAQGQQSLKWYKAYFDAP 480 (698)
Q Consensus 401 ~g~~~~~~~kGI~g~V~l~g~~~~~~~l~~~~W~~~~~l~~e~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~Yr~~F~~p 480 (698)
||++|+++.|||+|+|+|+|.+.++++|+++.|.|+++|.||.++|+.+++...+.|...+..+. .++|+|||++|++|
T Consensus 552 yG~~le~~~kGI~g~V~i~g~~~g~~dls~~~W~y~lgL~GE~~~i~~~~~~~~~~W~~~~~~~~-~~p~twYK~~Fd~p 630 (840)
T PLN03059 552 VGLHFETWNAGVLGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTITGSSSVEWVEGSLLAQ-KQPLTWYKTTFDAP 630 (840)
T ss_pred cCcccccccccccccEEEecccCCceecccCccccccCccceeccccccCCCCCccccccccccC-CCCceEEEEEEeCC
Confidence 99999999999999999999878888999889999999999999988876556788976544333 45799999999999
Q ss_pred CCCCCeEEEeCCCceEEEEECCeeeeeeeeccc-cCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEE
Q 005404 481 TGNEPLALDLRSMGKGQVWINGQSIGRYWMAYA-KGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVV 559 (698)
Q Consensus 481 ~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW~~~~-~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivv 559 (698)
++.|||||||++||||+|||||+||||||+.++ .+.|..|+|+|.|++.+|+|+||+|||+|||||++|||+|+|+|||
T Consensus 631 ~g~Dpv~LDm~gmGKG~aWVNG~nIGRYW~~~a~~~gC~~c~y~g~~~~~kc~~~cggP~q~lYHVPr~~Lk~g~N~lVi 710 (840)
T PLN03059 631 GGNDPLALDMSSMGKGQIWINGQSIGRHWPAYTAHGSCNGCNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLIV 710 (840)
T ss_pred CCCCCEEEecccCCCeeEEECCcccccccccccccCCCccccccccccchhhhccCCCceeEEEeCcHHHhccCCceEEE
Confidence 999999999999999999999999999997642 2334889999999999999999999999999999999999999999
Q ss_pred EEeeCCCCcceEEEeecccccccccccCCCCCCCcccccCCCCCcCCCCceeeecCCCCeEeeeeeeccCCCCCCCCCCC
Q 005404 560 FEELGGDASRISLVKRSVARVCADAHEHHPTTDNNDIENKGNSNSTGNAKVLLQCAPGQSITSIEFASFGTPSGTCGSFQ 639 (698)
Q Consensus 560 fE~~g~~~~~i~~~~~~~~~~~~~~~e~hp~~~~~~~~~~~~~~~ce~~~~~L~C~~g~~Is~I~~A~YGr~~~~C~~~~ 639 (698)
|||+|++|..|+|.++++++||++++|+|| ++.+|++.....-.-....++|+|+.|++|+.|.+|+|||+.++|+++.
T Consensus 711 FEe~gg~p~~I~~~~~~~~~~c~~~~e~~p-~~~~w~~~~~~~~~~~~~~~~L~C~~G~~Is~I~fAsYGrp~gtC~~~~ 789 (840)
T PLN03059 711 FEEWGGNPAGISLVKRTTDSVCADIFEGQP-ALKNWQIIASGKVNSLQPKAHLWCPPGQKISKIKFASFGVPQGTCGSFR 789 (840)
T ss_pred EEecCCCCCceEEEEeecCcccccccccCC-ccccccccccccccccCCcEEEECCCCceEEEEEEecCCCCCCCCCCCC
Confidence 999999999999999999999999999995 6999999555432355678999999999997799999999999999999
Q ss_pred CCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCCCCceEEEEEEee
Q 005404 640 KGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPNVLKRLSVQAVCS 690 (698)
Q Consensus 640 ~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g~~KYL~V~Y~C~ 690 (698)
+++|++++++++|+++|+||++|+|.+++.+||+|||+||+|||+|+|+|.
T Consensus 790 ~g~C~a~~S~~vV~kaC~Gk~~CsV~asn~~FggDPC~gt~KyL~V~~~Cs 840 (840)
T PLN03059 790 EGSCHAHKSYDAFERNCIGKQSCSVTVAPEVFGGDPCPDSMKKLSVEAVCS 840 (840)
T ss_pred CCCEeCCcHHHHHHHHCCCCCceEEEeccceecCCCCCCceeEEEEEEEeC
Confidence 999999999999999999999999999999996699999999999999994
No 2
>KOG0496 consensus Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=3.1e-107 Score=899.63 Aligned_cols=502 Identities=58% Similarity=1.049 Sum_probs=451.5
Q ss_pred ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCCCCCCccccC
Q 005404 1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKEDDAPDPVINS 80 (698)
Q Consensus 1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~~~~~~v~~~ 80 (698)
|++|+++|+|++| +|+++|||||||+|||||||.+...|++..+.|++|-+.|+...+.+|||++|.+.++|+.++++
T Consensus 144 ~~~~~~~iv~~mk--~L~~~qGGPIIl~QIENEYG~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk~~dapd~~in~ 221 (649)
T KOG0496|consen 144 MERWTTKIVPMMK--KLFASQGGPIILVQIENEYGNYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCKQDDAPDPGINT 221 (649)
T ss_pred HHHHHHHHHHHHH--HHHhhcCCCEEEEEeechhhHHHHHHHHHHHHhhccceEEEEecCCCCceeEecCCCCCCccccc
Confidence 8999999999999 99999999999999999999998888888999999999999999999999999999999999999
Q ss_pred CCCccc-ccCC-CCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcc
Q 005404 81 CNGFYC-DAFS-PNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFI 158 (698)
Q Consensus 81 ~ng~~~-~~f~-~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~ 158 (698)
|||++| +.|. +++|++|+||||||+|||++||++++.|++++++..+++++++|+|++||||||||||||++|| ++.
T Consensus 222 cng~~c~~~f~~pn~~~kP~~wtE~wtgwf~~wGg~~~~R~~e~ia~~va~fls~ggs~vNyYM~hGGTNFGrt~G-~~~ 300 (649)
T KOG0496|consen 222 CNGFYCGDTFKRPNSPNKPLVWTENWTGWFTHWGGPHPCRPVEDIALSVARFLSKGGSSVNYYMYHGGTNFGRTNG-PFI 300 (649)
T ss_pred cCCccchhhhccCCCCCCCceecccccchhhhhCCCCCCCCHHHHHHHHHHHHhcCccceEEEEeecccCCCcccC-ccc
Confidence 999999 8888 8899999999999999999999999999999999999999999989999999999999999998 899
Q ss_pred cccccCCCCCCCCCCCCchhHHHHHHHHHHHHhhhccccCCCCcccCCCCcceeeeeecCCcceeeEeeccCCccceEEE
Q 005404 159 TTSYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVSSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSAARVT 238 (698)
Q Consensus 159 ~TSYDY~APl~E~G~~~~~ky~~lr~l~~~~~~~~~~l~~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~~~v~ 238 (698)
+|||||||||+ |..++|||.++|.+|..+..+++.+..+++.. ..|....++|..||.|++......+.
T Consensus 301 atsy~~dap~d--gl~~~pk~ghlk~~hts~d~~ep~lv~gd~~~---------~kyg~~~~~C~~Fl~n~~~~~~~~v~ 369 (649)
T KOG0496|consen 301 ATSYDYDAPLD--GLLRQPKYGHLKPLHTSYDYCEPALVAGDITT---------AKYGNLREACAAFLSNNNGAPAAPVP 369 (649)
T ss_pred ccccccccccc--hhhcCCCccccccchhhhhhcCccccccCccc---------ccccchhhHHHHHHhcCCCCCCCccc
Confidence 99999999999 99999999999999999999888876665433 34555556799999999988888999
Q ss_pred eCCeeeecCCCceeecCCCCceeeccceeeeeeeeeeccccccccccccccccccccccCCCccccccchhhcCCCCCCc
Q 005404 239 FNGRQYNLPPWSISILPDCKNVVFNTAKVAVQHTKMQMLPTGSKLLSWETYDEDISSLGESSTLTAIGLLEQINITRDTS 318 (698)
Q Consensus 239 ~~~~~~~~p~~sv~il~~~~~~~~~t~~v~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~p~~~Eql~~t~d~~ 318 (698)
+++..+.+|+|+++|+++|+.++|+|+++.+ .|....||++ +|..++ .+
T Consensus 370 f~~~~y~~~~~slsilpdck~~~~nta~~~~---------------~~~~~~e~~~-------------~~~~~~---~~ 418 (649)
T KOG0496|consen 370 FNKPKYRLPPWSLSILPDCKTVVYNTAKVMA---------------QWISFTEPIP-------------SEAVGQ---SF 418 (649)
T ss_pred cCCCccccCceeEEechhhcchhhhcccccc---------------ccccccCCCc-------------cccccC---cc
Confidence 9999999999999999999999999988743 1544445544 566655 78
Q ss_pred cEEEEEEEeecCCCcccccCCCCceeeeC-CcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecC
Q 005404 319 DYLWYMTSVEISSSESFLRGGQKPTLTVE-SAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVG 397 (698)
Q Consensus 319 Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~-~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~G 397 (698)
||++|++.++...+++ ..|+|. +++|++||||||+++|++++......+.+..++.|..|.|+|+|||||+|
T Consensus 419 ~~ll~~~~~t~d~sd~-------t~~~i~ls~g~~~hVfvNg~~~G~~~g~~~~~~~~~~~~~~l~~g~n~l~iL~~~~G 491 (649)
T KOG0496|consen 419 GGLLEQTNLTKDKSDT-------TSLKIPLSLGHALHVFVNGEFAGSLHGNNEKIKLNLSQPVGLKAGENKLALLSENVG 491 (649)
T ss_pred eEEEEEEeeccccCCC-------ceEeecccccceEEEEECCEEeeeEeccccceeEEeecccccccCcceEEEEEEecC
Confidence 8999999998765542 468888 99999999999999999998765666777778889999999999999999
Q ss_pred CcccccCCCccccceeecEEEccccCCCccCccCCcEEeccCccccccccCCCCCCCCcccCCCccccccCCceEEEEEE
Q 005404 398 LPNVGLHYETWETGVRGAVVLHGLDHGNKDLTWQKWSYQVGLKGEAMNLVSPSEATSVDWTRGSLAAQGQQSLKWYKAYF 477 (698)
Q Consensus 398 r~N~g~~~~~~~kGI~g~V~l~g~~~~~~~l~~~~W~~~~~l~~e~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~Yr~~F 477 (698)
|+||| +++++.|||+|+|+|+|. ++++++.|.|+++|.+|.+.++..++..+++|......+. .+|.+||+ +|
T Consensus 492 ~~n~G-~~e~~~~Gi~g~v~l~g~----~~l~~~~w~~~~gl~ge~~~~~~~~~~~~v~w~~~~~~~~-k~P~~w~k-~f 564 (649)
T KOG0496|consen 492 LPNYG-HFENDFKGILGPVYLNGL----IDLTWTKWPYKVGLKGEKLGLHTEEGSSKVKWKKLSNTAT-KQPLTWYK-TF 564 (649)
T ss_pred CCCcC-cccccccccccceEEeee----eccceeecceecccccchhhccccccccccceeeccCccc-CCCeEEEE-Ee
Confidence 99999 789999999999999997 4677678889999999999999888778899987755444 36889999 99
Q ss_pred eCCCCCCCeEEEeCCCceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceE
Q 005404 478 DAPTGNEPLALDLRSMGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLL 557 (698)
Q Consensus 478 ~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~i 557 (698)
++|++.+||||||.|||||+|||||+|||||||++ | ||++|| ||++|||++.|.|
T Consensus 565 ~~p~g~~~t~Ldm~g~GKG~vwVNG~niGRYW~~~-----------G-------------~Q~~yh-vPr~~Lk~~~N~l 619 (649)
T KOG0496|consen 565 DIPSGSEPTALDMNGWGKGQVWVNGQNIGRYWPSF-----------G-------------PQRTYH-VPRSWLKPSGNLL 619 (649)
T ss_pred cCCCCCCCeEEecCCCcceEEEECCcccccccCCC-----------C-------------CceEEE-CcHHHhCcCCceE
Confidence 99999999999999999999999999999999875 5 876665 9999999999999
Q ss_pred EEEEeeCCCCcceEEEeeccccccccccc
Q 005404 558 VVFEELGGDASRISLVKRSVARVCADAHE 586 (698)
Q Consensus 558 vvfE~~g~~~~~i~~~~~~~~~~~~~~~e 586 (698)
|||||++++|..|+|+++.+..+|..+.|
T Consensus 620 vvfEee~~~p~~i~~~~~~~~~~~~~v~~ 648 (649)
T KOG0496|consen 620 VVFEEEGGDPNGISFVTRPVLSTCAYVRE 648 (649)
T ss_pred EEEEeccCCCccceEEEeEeeeEeeeccc
Confidence 99999999999999999998899988765
No 3
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=100.00 E-value=3.7e-45 Score=391.54 Aligned_cols=179 Identities=39% Similarity=0.704 Sum_probs=130.9
Q ss_pred ChhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCC-cceEeecCCC-------
Q 005404 1 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTG-VPWVMCKEDD------- 72 (698)
Q Consensus 1 ~~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~-vp~~~~~~~~------- 72 (698)
|++|+++|+++|+ ++|+++||||||||||||||++ ..|++||+.|++++++.|++ +++++++...
T Consensus 119 ~~~~~~~~~~~~~--~~~~~~GGpII~vQvENEyg~~-----~~~~~Y~~~l~~~~~~~g~~~~~~~t~d~~~~~~~~~~ 191 (319)
T PF01301_consen 119 VERWYRALAKIIK--PLQYTNGGPIIMVQVENEYGSY-----GTDRAYMEALKDAYRDWGIDPVLLYTTDGPWGSWLPDG 191 (319)
T ss_dssp HHHHHHHHHHHHG--GGBGGGTSSEEEEEESSSGGCT-----SS-HHHHHHHHHHHHHTT-SSSBEEEEESSSHCCHCCC
T ss_pred HHHHHHHHHHHHH--hhhhcCCCceehhhhhhhhCCC-----cccHhHHHHHHHHHHHhhCccceeeccCCCcccccccC
Confidence 5799999999999 9999999999999999999963 38999999999999999998 6677776531
Q ss_pred -CCCccccCCCCccccc--------CCCCCCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEe
Q 005404 73 -APDPVINSCNGFYCDA--------FSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMY 143 (698)
Q Consensus 73 -~~~~v~~~~ng~~~~~--------f~~~~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMf 143 (698)
.++..+.+++++.+.. +...+|++|+|++|||+|||++||++++.+++++++..++++|+.|.+ +|||||
T Consensus 192 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~p~~P~~~~E~~~Gwf~~WG~~~~~~~~~~~~~~l~~~l~~g~~-~nyYM~ 270 (319)
T PF01301_consen 192 GLPGADIYATDNFPPGDNPDEYFGDQRSFQPNQPLMCTEFWGGWFDHWGGPHYTRPAEDVAADLARMLSKGNS-LNYYMF 270 (319)
T ss_dssp -TTTGSCEEEEEETTTSSHHHHHHHHHHHHTTS--EEEEEESS---BTTS--HHHHHHHHHHHHHHHHHHCSE-EEEEEC
T ss_pred CCCcceEEeccccCCCchHHHHHhhhhhcCCCCCeEEEEeccccccccCCCCccCCHHHHHHHHHHHHHhhcc-cceeec
Confidence 2222233333333321 124578899999999999999999999989999999999999999965 799999
Q ss_pred eccCCCCCCCCCCcc----cccccCCCCCCCCCCCCchhHHHHHHHHHH
Q 005404 144 HGGTNFGRTAGGPFI----TTSYDYDAPLDEYGLMRQPKYGHLKQLHEA 188 (698)
Q Consensus 144 hGGTNfG~~~G~~~~----~TSYDY~APl~E~G~~~~~ky~~lr~l~~~ 188 (698)
|||||||+++|++.. +|||||+|||+|+|++ ||||++||+||++
T Consensus 271 hGGTNfG~~~ga~~~~~p~~TSYDY~ApI~E~G~~-~~Ky~~lr~l~~~ 318 (319)
T PF01301_consen 271 HGGTNFGFWAGANYYGQPDITSYDYDAPIDEYGQL-TPKYYELRRLHQK 318 (319)
T ss_dssp E--B--TT-B-EETTTEEB-SB--TT-SB-TTS-B--HHHHHHHHHHHT
T ss_pred cccCCccccccCCCCCCCCcccCCcCCccCcCCCc-CHHHHHHHHHHhc
Confidence 999999999887654 5999999999999999 6999999999864
No 4
>KOG4729 consensus Galactoside-binding lectin [General function prediction only]
Probab=99.85 E-value=1e-21 Score=197.44 Aligned_cols=92 Identities=30% Similarity=0.622 Sum_probs=85.3
Q ss_pred cCCCCceeeecCCCCeEeeeeeeccCCCC-CCCCCC----CCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCC
Q 005404 604 STGNAKVLLQCAPGQSITSIEFASFGTPS-GTCGSF----QKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPN 678 (698)
Q Consensus 604 ~ce~~~~~L~C~~g~~Is~I~~A~YGr~~-~~C~~~----~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g 678 (698)
+|||+.++|+||.|.+| +|++|+|||.. ..|.+. .+.+|..+.+++++.++|+++++|.|.|+..+|+.|||||
T Consensus 40 aCdG~~i~L~CP~~dvI-sv~sanYGR~~~~iC~pd~~~~~Si~C~~p~s~~i~~~rCnnr~~C~vvv~s~~F~~DPCPg 118 (265)
T KOG4729|consen 40 ACDGERITLSCPRGDVI-SVQSANYGRFSDKICDPDPGREESINCYLPKSFSILSSRCNNRRQCTVVVDSDVFGDDPCPG 118 (265)
T ss_pred eecCceEEEEcCCCCEE-EEEecccCcccccccCCccccccchhccChHHHHHHHHhcCCCceEEEEecCCccCCCCCCC
Confidence 69999999999999999 59999999994 799752 4689999999999999999999999999999999999999
Q ss_pred CCceEEEEEEeecCCCCC
Q 005404 679 VLKRLSVQAVCSTADANT 696 (698)
Q Consensus 679 ~~KYL~V~Y~C~p~~~~~ 696 (698)
|+|||+|+|.|+|.++.+
T Consensus 119 T~KYLev~Y~Cvp~~~~~ 136 (265)
T KOG4729|consen 119 TSKYLEVQYGCVPYAFTE 136 (265)
T ss_pred chhheEEEeccCcccccc
Confidence 999999999999996543
No 5
>PF02140 Gal_Lectin: Galactose binding lectin domain; InterPro: IPR000922 The D-galactoside binding lectin purified from sea urchin (Anthocidaris crassispina) eggs exists as a disulphide-linked homodimer of two subunits; the dimeric form is essential for hemagglutination activity []. The sea urchin egg lectin (SUEL) forms a new class of lectins. Although SUEL was first isolated as a D-galactoside binding lectin, it was latter shown that it bind to L-rhamnose preferentially [, ]. L-rhamnose and D-galactose share the same hydroxyl group orientation at C2 and C4 of the pyranose ring structure. A cysteine-rich domain homologous to the SUEL protein has been identified in the following proteins [, , ]: Plant beta-galactosidases (3.2.1.23 from EC) (lactases). Mammalian latrophilin, the calcium independent receptor of alpha-latrotoxin (CIRL). The galactose-binding lectin domain is not required for alpha-latratoxin binding []. Human lectomedin-1. Rhamnose-binding lectin (SAL) from catfish (Silurus asotus, Namazu) eggs. This protein is composed of three tandem repeat domains homologous to the SUEL lectin domain. All cysteine positions of each domain are completely conserved []. The hypothetical B0457.1, F32A7.3A and F32A7.3B proteins from Caenorhabditis elegans. The human KIAA0821 protein. ; GO: 0005529 sugar binding; PDB: 2JXA_A 2JX9_A 2ZX2_A 2ZX3_B 2ZX0_B 2ZX1_B 2ZX4_B.
Probab=99.81 E-value=2.1e-20 Score=160.72 Aligned_cols=76 Identities=41% Similarity=0.852 Sum_probs=62.1
Q ss_pred eecCCCCeEeeeeeeccCCCC-CCCCCC---CCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCCCCCCceEEEEE
Q 005404 612 LQCAPGQSITSIEFASFGTPS-GTCGSF---QKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPCPNVLKRLSVQA 687 (698)
Q Consensus 612 L~C~~g~~Is~I~~A~YGr~~-~~C~~~---~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC~g~~KYL~V~Y 687 (698)
|+|+.|++| .|.+|+|||+. .+|+.. .+.+|.+++++++|+++|+||++|+|.+++.+|| ||||++.|||+|+|
T Consensus 1 L~C~~g~~I-~I~~A~YGR~~~~~C~~~~~~~~~~C~~~~~~~~v~~~C~g~~~C~v~~~~~~f~-dpC~~~~KyL~V~Y 78 (80)
T PF02140_consen 1 LSCPPGKVI-SIDSAFYGRTSSSICPSSSSGSNTNCSAPDALSIVKERCNGKQSCSVPADNSVFG-DPCPGTSKYLEVTY 78 (80)
T ss_dssp EE-STTEEE-EEEEEEEEBSSSSTT--GGGCS-TTB--TTHHHHHHHHHTTBSEEEEESSHHHH---SSTTS--EEEEEE
T ss_pred CCCcCCCEE-EEEEeecCCCCCCCCcCCCcCCCCccccccccchhHHhCCCCCccEEEeccCccC-CCCCCCCeEEEEEE
Confidence 799999999 69999999995 699743 4678999999999999999999999999999998 99999999999999
Q ss_pred Ee
Q 005404 688 VC 689 (698)
Q Consensus 688 ~C 689 (698)
+|
T Consensus 79 ~C 80 (80)
T PF02140_consen 79 TC 80 (80)
T ss_dssp EE
T ss_pred EC
Confidence 99
No 6
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=98.96 E-value=1.3e-09 Score=99.62 Aligned_cols=68 Identities=32% Similarity=0.711 Sum_probs=49.2
Q ss_pred cCCceEEEEEEeCCCCCC-CeE-EEe--CCCceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceee
Q 005404 467 QQSLKWYKAYFDAPTGNE-PLA-LDL--RSMGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRW 542 (698)
Q Consensus 467 ~~~~~~Yr~~F~~p~~~d-~~~-Ld~--~g~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtl 542 (698)
..+..|||++|.... .| .+. |+. ....+++|||||++|||||+.+ | ||+++
T Consensus 33 ~~g~~~Yrg~F~~~~-~~~~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~-----------g-------------~q~tf 87 (111)
T PF13364_consen 33 HAGYLWYRGTFTGTG-QDTSLTPLNIQGGNAFRASVWVNGWFLGSYWPGI-----------G-------------PQTTF 87 (111)
T ss_dssp SSCEEEEEEEEETTT-EEEEEE-EEECSSTTEEEEEEETTEEEEEEETTT-----------E-------------CCEEE
T ss_pred CCCCEEEEEEEeCCC-cceeEEEEeccCCCceEEEEEECCEEeeeecCCC-----------C-------------ccEEE
Confidence 347899999996421 22 233 333 3467999999999999999643 5 99988
Q ss_pred EecCcccccCCcceEEEE
Q 005404 543 YHVPRSWLKPTKNLLVVF 560 (698)
Q Consensus 543 Y~vP~~~Lk~g~N~ivvf 560 (698)
+ ||+++|+.+.|.|+|+
T Consensus 88 ~-~p~~il~~~n~v~~vl 104 (111)
T PF13364_consen 88 S-VPAGILKYGNNVLVVL 104 (111)
T ss_dssp E-E-BTTBTTCEEEEEEE
T ss_pred E-eCceeecCCCEEEEEE
Confidence 8 9999999875555554
No 7
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=98.79 E-value=4.8e-09 Score=121.33 Aligned_cols=164 Identities=19% Similarity=0.180 Sum_probs=98.1
Q ss_pred hHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecCC------C--CC
Q 005404 3 GFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKED------D--AP 74 (698)
Q Consensus 3 ~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~~------~--~~ 74 (698)
++.+.|+++|+ ++++++|++|||+|++||||.+ .|..+|.+.....+.+.+-+ .+-..+.. + ..
T Consensus 132 ~~~~~i~~~ir--er~~~~~~~v~~w~~dneY~~~-----~~~~~~~~~~f~~wLk~~yg-~l~~ln~~w~t~~ws~t~~ 203 (673)
T COG1874 132 EYLDRILQQIR--ERLYGNGPAVITWQNDNEYGGH-----PCYCDYCQAAFRLWLKKGYG-SLDNLNEAWGTSFWSHTYK 203 (673)
T ss_pred HHHHHHHHHHH--HHHhccCCceeEEEccCccCCc-----cccccccHHHHHHHHHhCcc-hHHhhhhhhhhhhcccccc
Confidence 45667788898 7779999999999999999984 24455555555543333332 11111110 0 00
Q ss_pred C-ccccCCC-----Cccc--ccCCCCCCCC----CceeeeccCccc-cccCCCcCCCC-HHHHHHHHHHHHHcCCceeee
Q 005404 75 D-PVINSCN-----GFYC--DAFSPNKPYK----PTLWTEAWSGWF-TEFGGAVHRRP-VQDLAFAVARFIQKGGSFFNY 140 (698)
Q Consensus 75 ~-~v~~~~n-----g~~~--~~f~~~~p~~----P~~~~E~~~Gwf-~~wG~~~~~~~-~~~~~~~~~~~l~~g~s~~n~ 140 (698)
+ ..|.+.+ +... -++......+ +....|.+-+|| +.|..+.-... .+.-...++..|..... -||
T Consensus 204 ~~~~i~~p~~~~e~~~~~~~ld~~~f~~e~~~~~~~~~~~~~~~~~P~~pvt~nl~~~~~~~~~~~~~~~ld~~sw-dny 282 (673)
T COG1874 204 DFDEIMSPNPFGELPLPGLYLDYRRFESEQILEFVREEGEAIKAYFPNRPVTPNLLAAFKKFDAYKWEKVLDFASW-DNY 282 (673)
T ss_pred cHHhhcCCCCccccCCccchhhHhhhhhhhhHHHHHHHHHHHHHhCCCCCCChhHhhhhhhcchHHHHHhcChhhh-hhh
Confidence 0 0011111 0000 0122222233 556677788888 77766543333 23334455666666655 699
Q ss_pred eEeeccCCCC------CCCCCC---c-------ccccccCCCCCCCCCCCC
Q 005404 141 YMYHGGTNFG------RTAGGP---F-------ITTSYDYDAPLDEYGLMR 175 (698)
Q Consensus 141 YMfhGGTNfG------~~~G~~---~-------~~TSYDY~APl~E~G~~~ 175 (698)
||||+|++|+ +.+|+. + ..+++++++.+.+.|.++
T Consensus 283 ~~~~~~~~~~~~~h~l~r~~~~~~~~~~me~~P~~vn~~~~n~~~~~G~~~ 333 (673)
T COG1874 283 PAWHRGRDFTKFIHDLFRNGKQGQPFWLMEQLPSVVNWALYNKLKRPGALR 333 (673)
T ss_pred hhhccccchhhhhHHHHHhhccCCceeeccCCcchhhhhhccCCCCCcccc
Confidence 9999999999 777764 2 579999999999999853
No 8
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=98.55 E-value=4.1e-07 Score=83.24 Aligned_cols=84 Identities=27% Similarity=0.377 Sum_probs=58.7
Q ss_pred hhhcCCCCCCccEEEEEEEeecCCCcccccCCCCce-eee-CCcceEEEEEECCEEEEEEEccccccceEEeecc-cccC
Q 005404 308 LEQINITRDTSDYLWYMTSVEISSSESFLRGGQKPT-LTV-ESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPA-NLRA 384 (698)
Q Consensus 308 ~Eql~~t~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~-L~v-~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~-~l~~ 384 (698)
.+..+..++..|++|||++|.....+. ... |.+ .+.+.+++|||||+++|+..... ..+.+|++|. .|+.
T Consensus 24 ~l~~~~~g~~~g~~~Yrg~F~~~~~~~------~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~-g~q~tf~~p~~il~~ 96 (111)
T PF13364_consen 24 VLYASDYGFHAGYLWYRGTFTGTGQDT------SLTPLNIQGGNAFRASVWVNGWFLGSYWPGI-GPQTTFSVPAGILKY 96 (111)
T ss_dssp STCCGCGTSSSCEEEEEEEEETTTEEE------EEE-EEECSSTTEEEEEEETTEEEEEEETTT-ECCEEEEE-BTTBTT
T ss_pred eeccCccccCCCCEEEEEEEeCCCcce------eEEEEeccCCCceEEEEEECCEEeeeecCCC-CccEEEEeCceeecC
Confidence 455555567999999999997543331 124 444 36899999999999999987322 2335666665 3556
Q ss_pred CcCeEEEEEeecCC
Q 005404 385 GINKIALLSIAVGL 398 (698)
Q Consensus 385 g~~~L~ILven~Gr 398 (698)
++++|.||+++||+
T Consensus 97 ~n~v~~vl~~~~g~ 110 (111)
T PF13364_consen 97 GNNVLVVLWDNMGH 110 (111)
T ss_dssp CEEEEEEEEE-STT
T ss_pred CCEEEEEEEeCCCC
Confidence 67789999999996
No 9
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=98.25 E-value=6.7e-06 Score=79.99 Aligned_cols=99 Identities=24% Similarity=0.334 Sum_probs=71.2
Q ss_pred CCCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCc-CeEEEEE
Q 005404 315 RDTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGI-NKIALLS 393 (698)
Q Consensus 315 ~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~-~~L~ILv 393 (698)
....|+.|||++|.++... .+....|.+.++.+.+.|||||+++|...... ..+.++++-.|+.|. |+|.|.|
T Consensus 64 ~~~~~~~wYr~~f~lp~~~----~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~~--~~~~~dIt~~l~~g~~N~l~V~v 137 (167)
T PF02837_consen 64 WDYSGYAWYRRTFTLPADW----KGKRVFLRFEGVDYAAEVYVNGKLVGSHEGGY--TPFEFDITDYLKPGEENTLAVRV 137 (167)
T ss_dssp STCCSEEEEEEEEEESGGG----TTSEEEEEESEEESEEEEEETTEEEEEEESTT--S-EEEECGGGSSSEEEEEEEEEE
T ss_pred cccCceEEEEEEEEeCchh----cCceEEEEeccceEeeEEEeCCeEEeeeCCCc--CCeEEeChhhccCCCCEEEEEEE
Confidence 3478999999999886432 24456789999999999999999999987543 346666665688887 9999999
Q ss_pred eecCCcccccCCC-ccccceeecEEEc
Q 005404 394 IAVGLPNVGLHYE-TWETGVRGAVVLH 419 (698)
Q Consensus 394 en~Gr~N~g~~~~-~~~kGI~g~V~l~ 419 (698)
.+...-.+-+.+. ....||.++|.|.
T Consensus 138 ~~~~~~~~~~~~~~~~~~GI~r~V~L~ 164 (167)
T PF02837_consen 138 DNWPDGSTIPGFDYFNYAGIWRPVWLE 164 (167)
T ss_dssp ESSSGGGCGBSSSEEE--EEESEEEEE
T ss_pred eecCCCceeecCcCCccCccccEEEEE
Confidence 9655443311111 3568999999884
No 10
>PRK10150 beta-D-glucuronidase; Provisional
Probab=96.74 E-value=0.0079 Score=70.81 Aligned_cols=100 Identities=22% Similarity=0.191 Sum_probs=69.2
Q ss_pred CCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCc-CeEEEEEe
Q 005404 316 DTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGI-NKIALLSI 394 (698)
Q Consensus 316 d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~-~~L~ILve 394 (698)
+..|..|||++|.++... .|....|.+.++.-.+.|||||++||...+.. ..+.+++.-.|+.|. |+|.|.|.
T Consensus 62 ~~~G~~WYrr~f~lp~~~----~gk~v~L~Fegv~~~a~V~lNG~~vg~~~~~~--~~f~~DIT~~l~~G~~n~L~V~v~ 135 (604)
T PRK10150 62 NYVGDVWYQREVFIPKGW----AGQRIVLRFGSVTHYAKVWVNGQEVMEHKGGY--TPFEADITPYVYAGKSVRITVCVN 135 (604)
T ss_pred CCcccEEEEEEEECCccc----CCCEEEEEECcccceEEEEECCEEeeeEcCCc--cceEEeCchhccCCCceEEEEEEe
Confidence 367999999999886431 24457899999999999999999999876532 346666554567775 49999998
Q ss_pred ecCCcc---cccCC-------------C-ccccceeecEEEccc
Q 005404 395 AVGLPN---VGLHY-------------E-TWETGVRGAVVLHGL 421 (698)
Q Consensus 395 n~Gr~N---~g~~~-------------~-~~~kGI~g~V~l~g~ 421 (698)
|.-+.. .|... + ...-||.++|.|...
T Consensus 136 n~~~~~~~p~g~~~~~~~~~~k~~~~~d~~~~~GI~r~V~L~~~ 179 (604)
T PRK10150 136 NELNWQTLPPGNVIEDGNGKKKQKYNFDFFNYAGIHRPVMLYTT 179 (604)
T ss_pred cCCCcccCCCCccccCCccccccccccccccccCCCceEEEEEc
Confidence 742210 11000 0 135799999999543
No 11
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=96.73 E-value=0.0031 Score=61.27 Aligned_cols=65 Identities=23% Similarity=0.505 Sum_probs=50.0
Q ss_pred CCceEEEEEEeCCCCC--CCeEEEeCCCc-eEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEe
Q 005404 468 QSLKWYKAYFDAPTGN--EPLALDLRSMG-KGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYH 544 (698)
Q Consensus 468 ~~~~~Yr~~F~~p~~~--d~~~Ld~~g~g-KG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~ 544 (698)
.+..|||.+|++|... ..++|.+.|.. ...|||||+-+|+-... | .. .-|-
T Consensus 67 ~~~~wYr~~f~lp~~~~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~--------------~-----------~~-~~~d 120 (167)
T PF02837_consen 67 SGYAWYRRTFTLPADWKGKRVFLRFEGVDYAAEVYVNGKLVGSHEGG--------------Y-----------TP-FEFD 120 (167)
T ss_dssp CSEEEEEEEEEESGGGTTSEEEEEESEEESEEEEEETTEEEEEEEST--------------T-----------S--EEEE
T ss_pred CceEEEEEEEEeCchhcCceEEEEeccceEeeEEEeCCeEEeeeCCC--------------c-----------CC-eEEe
Confidence 4679999999999753 35899999875 89999999999997621 1 22 2355
Q ss_pred cCcccccCCc-ceEEE
Q 005404 545 VPRSWLKPTK-NLLVV 559 (698)
Q Consensus 545 vP~~~Lk~g~-N~ivv 559 (698)
|+. .|++|+ |+|.|
T Consensus 121 It~-~l~~g~~N~l~V 135 (167)
T PF02837_consen 121 ITD-YLKPGEENTLAV 135 (167)
T ss_dssp CGG-GSSSEEEEEEEE
T ss_pred Chh-hccCCCCEEEEE
Confidence 875 789888 98887
No 12
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=96.33 E-value=0.015 Score=72.21 Aligned_cols=97 Identities=19% Similarity=0.260 Sum_probs=68.3
Q ss_pred cEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCC
Q 005404 319 DYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGL 398 (698)
Q Consensus 319 Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr 398 (698)
|-.|||++|.++..- .|....|.+.++.-.+.|||||++||...+. ...+.|++.-.|+.|.|+|.|.|.+...
T Consensus 109 ~~g~Yrr~F~lp~~~----~gkrv~L~FeGV~s~a~VwvNG~~VG~~~g~--~~pfefDIT~~l~~G~N~LaV~V~~~~d 182 (1021)
T PRK10340 109 PTGAYQRTFTLSDGW----QGKQTIIKFDGVETYFEVYVNGQYVGFSKGS--RLTAEFDISAMVKTGDNLLCVRVMQWAD 182 (1021)
T ss_pred CeEEEEEEEEeCccc----ccCcEEEEECccceEEEEEECCEEeccccCC--CccEEEEcchhhCCCccEEEEEEEecCC
Confidence 667999999886431 2455789999999999999999999986543 2346666554577888999999986532
Q ss_pred cccccCCC-ccccceeecEEEccc
Q 005404 399 PNVGLHYE-TWETGVRGAVVLHGL 421 (698)
Q Consensus 399 ~N~g~~~~-~~~kGI~g~V~l~g~ 421 (698)
-.|-...+ ....||..+|.|--.
T Consensus 183 ~s~le~qd~w~~sGI~R~V~L~~~ 206 (1021)
T PRK10340 183 STYLEDQDMWWLAGIFRDVYLVGK 206 (1021)
T ss_pred CCccccCCccccccccceEEEEEe
Confidence 22211100 134799999999544
No 13
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=95.91 E-value=0.032 Score=69.38 Aligned_cols=95 Identities=20% Similarity=0.234 Sum_probs=66.2
Q ss_pred ccEEEEEEEeecCCCcccccCCC-CceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeec
Q 005404 318 SDYLWYMTSVEISSSESFLRGGQ-KPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAV 396 (698)
Q Consensus 318 ~Gy~~Y~t~i~~~~~~~~l~~g~-~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~ 396 (698)
.+-.|||++|.++..- .+. ...|.+.++.-.+.|||||+++|...+. ...+.|++.-.|+.|.|+|.|.|...
T Consensus 119 n~~gwYrr~F~vp~~w----~~~~rv~L~FeGV~~~a~VwvNG~~VG~~~g~--~~pfefDIT~~l~~G~N~L~V~V~~~ 192 (1027)
T PRK09525 119 NPTGCYSLTFTVDESW----LQSGQTRIIFDGVNSAFHLWCNGRWVGYSQDS--RLPAEFDLSPFLRAGENRLAVMVLRW 192 (1027)
T ss_pred CCeEEEEEEEEeChhh----cCCCeEEEEECeeccEEEEEECCEEEEeecCC--CceEEEEChhhhcCCccEEEEEEEec
Confidence 3778999999886431 122 4689999999999999999999986542 23466666545778899999988533
Q ss_pred CCcccccCCC----ccccceeecEEEccc
Q 005404 397 GLPNVGLHYE----TWETGVRGAVVLHGL 421 (698)
Q Consensus 397 Gr~N~g~~~~----~~~kGI~g~V~l~g~ 421 (698)
-. |..++ ....||..+|.|--.
T Consensus 193 sd---gs~~e~qd~w~~sGI~R~V~L~~~ 218 (1027)
T PRK09525 193 SD---GSYLEDQDMWRMSGIFRDVSLLHK 218 (1027)
T ss_pred CC---CCccccCCceeeccccceEEEEEc
Confidence 21 22122 134699999998543
No 14
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=95.64 E-value=0.073 Score=58.90 Aligned_cols=87 Identities=17% Similarity=0.200 Sum_probs=47.5
Q ss_pred CCCCCceeeeccCccccccCCCcCCCCHHHHHHHHHHHHHcCCceeeeeEeeccCCCCCCCCCCcccccccCCCCCCCCC
Q 005404 93 KPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 172 (698)
Q Consensus 93 ~p~~P~~~~E~~~Gwf~~wG~~~~~~~~~~~~~~~~~~l~~g~s~~n~YMfhGGTNfG~~~G~~~~~TSYDY~APl~E~G 172 (698)
..++|.+++|.++| ...|+.......+..+....-..++.|+..+.|+-+ ..--+|.-. | ..+.|+-+|
T Consensus 286 ~~~kpf~v~E~~~g-~~~~~~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~w-r~~~~g~E~--------~-~~g~~~~dg 354 (374)
T PF02449_consen 286 AKGKPFWVMEQQPG-PVNWRPYNRPPRPGELRLWSWQAIAHGADGILFWQW-RQSRFGAEQ--------F-HGGLVDHDG 354 (374)
T ss_dssp TTT--EEEEEE--S---SSSSS-----TTHHHHHHHHHHHTT-S-EEEC-S-B--SSSTTT--------T-S--SB-TTS
T ss_pred cCCCceEeecCCCC-CCCCccCCCCCCCCHHHHHHHHHHHHhCCeeEeeec-cCCCCCchh--------h-hcccCCccC
Confidence 47889999999999 566765544445566666566678999998777766 333334221 1 146788899
Q ss_pred -CCCchhHHHHHHHHHHHHh
Q 005404 173 -LMRQPKYGHLKQLHEAIKL 191 (698)
Q Consensus 173 -~~~~~ky~~lr~l~~~~~~ 191 (698)
.+ +++|.+++++.+.|+.
T Consensus 355 ~~~-~~~~~e~~~~~~~l~~ 373 (374)
T PF02449_consen 355 REP-TRRYREVAQLGRELKK 373 (374)
T ss_dssp --B--HHHHHHHHHHHHHHT
T ss_pred CCC-CcHHHHHHHHHHHHhc
Confidence 66 7999999999887653
No 15
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=93.93 E-value=0.13 Score=64.28 Aligned_cols=64 Identities=22% Similarity=0.310 Sum_probs=46.8
Q ss_pred CceEEEEEEeCCCCCC--CeEEEeCCC-ceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEec
Q 005404 469 SLKWYKAYFDAPTGNE--PLALDLRSM-GKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHV 545 (698)
Q Consensus 469 ~~~~Yr~~F~~p~~~d--~~~Ld~~g~-gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~v 545 (698)
+..|||.+|.+|+..+ -++|.+.|. ....|||||+-+|+-= |.|+ |- -|-|
T Consensus 109 ~~g~Yrr~F~lp~~~~gkrv~L~FeGV~s~a~VwvNG~~VG~~~--------------g~~~----------pf--efDI 162 (1021)
T PRK10340 109 PTGAYQRTFTLSDGWQGKQTIIKFDGVETYFEVYVNGQYVGFSK--------------GSRL----------TA--EFDI 162 (1021)
T ss_pred CeEEEEEEEEeCcccccCcEEEEECccceEEEEEECCEEecccc--------------CCCc----------cE--EEEc
Confidence 5689999999997532 489999986 4689999999999642 2221 22 2435
Q ss_pred CcccccCCcceEEE
Q 005404 546 PRSWLKPTKNLLVV 559 (698)
Q Consensus 546 P~~~Lk~g~N~ivv 559 (698)
.. .|++|+|+|+|
T Consensus 163 T~-~l~~G~N~LaV 175 (1021)
T PRK10340 163 SA-MVKTGDNLLCV 175 (1021)
T ss_pred ch-hhCCCccEEEE
Confidence 54 67888898877
No 16
>PRK10150 beta-D-glucuronidase; Provisional
Probab=92.87 E-value=0.24 Score=58.50 Aligned_cols=41 Identities=27% Similarity=0.484 Sum_probs=34.6
Q ss_pred CceEEEEEEeCCCCC--CCeEEEeCCC-ceEEEEECCeeeeeee
Q 005404 469 SLKWYKAYFDAPTGN--EPLALDLRSM-GKGQVWINGQSIGRYW 509 (698)
Q Consensus 469 ~~~~Yr~~F~~p~~~--d~~~Ld~~g~-gKG~vwVNG~nlGRYW 509 (698)
+..|||.+|++|+.. .-++|.+.|. ....|||||+.||+--
T Consensus 65 G~~WYrr~f~lp~~~~gk~v~L~Fegv~~~a~V~lNG~~vg~~~ 108 (604)
T PRK10150 65 GDVWYQREVFIPKGWAGQRIVLRFGSVTHYAKVWVNGQEVMEHK 108 (604)
T ss_pred ccEEEEEEEECCcccCCCEEEEEECcccceEEEEECCEEeeeEc
Confidence 678999999998643 2499999987 4799999999999764
No 17
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=92.53 E-value=0.2 Score=62.55 Aligned_cols=41 Identities=15% Similarity=0.261 Sum_probs=34.0
Q ss_pred CCceEEEEEEeCCCCC--C-CeEEEeCCC-ceEEEEECCeeeeee
Q 005404 468 QSLKWYKAYFDAPTGN--E-PLALDLRSM-GKGQVWINGQSIGRY 508 (698)
Q Consensus 468 ~~~~~Yr~~F~~p~~~--d-~~~Ld~~g~-gKG~vwVNG~nlGRY 508 (698)
.+..|||.+|++|+.. . .++|.+.|. ....|||||+-+|.-
T Consensus 119 n~~gwYrr~F~vp~~w~~~~rv~L~FeGV~~~a~VwvNG~~VG~~ 163 (1027)
T PRK09525 119 NPTGCYSLTFTVDESWLQSGQTRIIFDGVNSAFHLWCNGRWVGYS 163 (1027)
T ss_pred CCeEEEEEEEEeChhhcCCCeEEEEECeeccEEEEEECCEEEEee
Confidence 3679999999999642 2 489999986 589999999999953
No 18
>PF06832 BiPBP_C: Penicillin-Binding Protein C-terminus Family; InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=73.32 E-value=4.8 Score=34.96 Aligned_cols=50 Identities=20% Similarity=0.293 Sum_probs=34.3
Q ss_pred ceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccc-cCCcCeEEEEEeecCCc
Q 005404 342 PTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANL-RAGINKIALLSIAVGLP 399 (698)
Q Consensus 342 ~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l-~~g~~~L~ILven~Gr~ 399 (698)
..|++.+-...++-||||+++|..... ..+.+ .+ ..|.++|.+ ++..|+.
T Consensus 34 l~l~a~~~~~~~~W~vdg~~~g~~~~~---~~~~~----~~~~~G~h~l~v-vD~~G~~ 84 (89)
T PF06832_consen 34 LVLKAAGGRGPVYWFVDGEPLGTTQPG---HQLFW----QPDRPGEHTLTV-VDAQGRS 84 (89)
T ss_pred EEEEEeCCCCcEEEEECCEEcccCCCC---CeEEe----CCCCCeeEEEEE-EcCCCCE
Confidence 355555446699999999999876542 12332 34 678999987 7777764
No 19
>PF14683 CBM-like: Polysaccharide lyase family 4, domain III; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=70.94 E-value=4.4 Score=39.94 Aligned_cols=61 Identities=23% Similarity=0.299 Sum_probs=28.4
Q ss_pred CceEEEEECCeeeeeeee-ccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEEEEeeC
Q 005404 493 MGKGQVWINGQSIGRYWM-AYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVVFEELG 564 (698)
Q Consensus 493 ~gKG~vwVNG~nlGRYW~-~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivvfE~~g 564 (698)
-++=+|.||| ..+..+. ..++ ++|.+++ -+-.|..+.--|.||+.+|++|.|+|.|=-..|
T Consensus 92 ~~~~~V~vNg-~~~~~~~~~~~~---d~~~~r~-------g~~~G~~~~~~~~ipa~~L~~G~Nti~lt~~~g 153 (167)
T PF14683_consen 92 GGRLQVSVNG-WSGPFPSAPFGN---DNAIYRS-------GIHRGNYRLYEFDIPASLLKAGENTITLTVPSG 153 (167)
T ss_dssp T-EEEEEETT-EE--------------S--GGG-------T---S---EEEEEE-TTSS-SEEEEEEEEEE-S
T ss_pred CCCEEEEEcC-ccCCccccccCC---CCceeeC-------ceecccEEEEEEEEcHHHEEeccEEEEEEEccC
Confidence 4567899999 6777663 2211 2344433 122234566667799999999999997744444
No 20
>KOG4729 consensus Galactoside-binding lectin [General function prediction only]
Probab=64.43 E-value=8.3 Score=40.30 Aligned_cols=87 Identities=9% Similarity=-0.088 Sum_probs=63.8
Q ss_pred CCCCceeeecCCCCeEeeeeeeccCCCC---CCCCC----CCCCCccCCChHHHHHHHhcCCCceEEEecCCCccCCCC-
Q 005404 605 TGNAKVLLQCAPGQSITSIEFASFGTPS---GTCGS----FQKGTCHAPNSHAMLEKECIGQESCSIFISSGVFGTDPC- 676 (698)
Q Consensus 605 ce~~~~~L~C~~g~~Is~I~~A~YGr~~---~~C~~----~~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~Fg~DPC- 676 (698)
|..+++..-|+....+ .++.+.+++.+ ..|.. .....|.....+..+...|.+++.|.+..++.-++ -+|
T Consensus 140 ~~p~~~~~~~~~~~~~-~~e~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ct~~~~~~~~~-~~~~ 217 (265)
T KOG4729|consen 140 CTPSPTDPPRSEIRLE-CREGRRLAVYSAVMKTSPQKDPETEIRHECVSSVLPQLLRQCHAKEGCTLKSDGIKGH-CRHG 217 (265)
T ss_pred ecCCCCCCccCcccch-hhhcccccccccccccCCCCcccCCCCceeecccchhhhhcccccCCceeecCCcccc-cccc
Confidence 6666777777777666 67778888863 34543 12333444467888999999999999999999988 566
Q ss_pred CCCCceEEEEEEeecCC
Q 005404 677 PNVLKRLSVQAVCSTAD 693 (698)
Q Consensus 677 ~g~~KYL~V~Y~C~p~~ 693 (698)
++..+|+-|.+.|.+..
T Consensus 218 ~~~~~~~~~n~e~~~~~ 234 (265)
T KOG4729|consen 218 HLHKVYVTVTEEIFSEE 234 (265)
T ss_pred ceeEEEEEecccccchh
Confidence 44578999999987654
No 21
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=61.24 E-value=7.4 Score=32.13 Aligned_cols=46 Identities=24% Similarity=0.498 Sum_probs=28.2
Q ss_pred eeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCCcCeEEEEEeecCCcc
Q 005404 343 TLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAGINKIALLSIAVGLPN 400 (698)
Q Consensus 343 ~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g~~~L~ILven~Gr~N 400 (698)
.|.|...-.-|.|||||+++|... ..+. .+..|.++|.| +.-|...
T Consensus 3 ~l~V~s~p~gA~V~vdg~~~G~tp-------~~~~---~l~~G~~~v~v--~~~Gy~~ 48 (71)
T PF08308_consen 3 TLRVTSNPSGAEVYVDGKYIGTTP-------LTLK---DLPPGEHTVTV--EKPGYEP 48 (71)
T ss_pred EEEEEEECCCCEEEECCEEeccCc-------ceee---ecCCccEEEEE--EECCCee
Confidence 466666566789999999999432 2221 25567665544 5555443
No 22
>PF11875 DUF3395: Domain of unknown function (DUF3395); InterPro: IPR024586 Chaperone DnaJ was originally characterised from Escherichia coli as a 41 kDa heat shock protein. DnaJ has a modular structure consisting of a J-domain, a proximal G/F-domain, and a distal zinc finger domain, followed by less conserved C-terminal sequences. Since then, a large number of DnaJ-related proteins containing a J-domain have been characterised from a variety of different organisms. In the genome of Arabidopsis thaliana a total of 89 J-domain proteins have been identified []. This entry represents a C-terminal domain found in some eukaryotic DnaJ-like proteins, including member 11 from the subfamily C1 and protein DnaJ 13 from Arabidopsis. This domain is typically between 147 to 176 amino acids in length.
Probab=60.99 E-value=6.8 Score=37.96 Aligned_cols=67 Identities=24% Similarity=0.236 Sum_probs=37.3
Q ss_pred eeeeccCCCCC--CCCCC-----CCCCccCCChHHHHHHHhcCCCceEEEec----CCCccC-CCCC--CCCceEEEEEE
Q 005404 623 IEFASFGTPSG--TCGSF-----QKGTCHAPNSHAMLEKECIGQESCSIFIS----SGVFGT-DPCP--NVLKRLSVQAV 688 (698)
Q Consensus 623 I~~A~YGr~~~--~C~~~-----~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~----~~~Fg~-DPC~--g~~KYL~V~Y~ 688 (698)
|..|.||.... .+... ....+..-+.+-.+. |.=+.++-.... ..+.|. |||| |..|.|.|.|+
T Consensus 55 I~~A~YG~~~~~~~~~~~~~~~~~~~~~~~iDVTipLq--~lV~dS~L~l~~~~sKs~L~GF~DP~p~~ge~K~L~V~Y~ 132 (151)
T PF11875_consen 55 ILKAWYGNLPAKSDESNNDEPEDPDLDPPVIDVTIPLQ--ALVKDSQLILPEGVSKSGLPGFYDPCPFLGEPKQLRVRYR 132 (151)
T ss_pred EEEEEcCCcccccccccccccccccccCcEEEEhhhhh--hEeecCEEEEcCCCchhhCCCCCCCccccCCccEEEEEEE
Confidence 88899999742 33221 112233334443333 333344433222 123333 9999 88999999998
Q ss_pred eec
Q 005404 689 CST 691 (698)
Q Consensus 689 C~p 691 (698)
...
T Consensus 133 f~g 135 (151)
T PF11875_consen 133 FRG 135 (151)
T ss_pred ECC
Confidence 753
No 23
>PF11324 DUF3126: Protein of unknown function (DUF3126); InterPro: IPR021473 This family of proteins with unknown function appear to be restricted to Alphaproteobacteria.
Probab=59.69 E-value=18 Score=29.79 Aligned_cols=24 Identities=17% Similarity=0.545 Sum_probs=20.7
Q ss_pred CcceEEEEEECCEEEEEEEccccc
Q 005404 348 SAGHAVHVFINGQFLGSAFGTREN 371 (698)
Q Consensus 348 ~~~D~~~Vfvng~~~G~~~~~~~~ 371 (698)
+..|.|.||++++|+|++++....
T Consensus 25 k~~dsaEV~~g~EfiGvi~~Dede 48 (63)
T PF11324_consen 25 KKDDSAEVYIGDEFIGVIYRDEDE 48 (63)
T ss_pred CCCCceEEEeCCEEEEEEEeecCC
Confidence 578999999999999999986533
No 24
>PF08531 Bac_rhamnosid_N: Alpha-L-rhamnosidase N-terminal domain; InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=56.16 E-value=53 Score=32.24 Aligned_cols=56 Identities=21% Similarity=0.219 Sum_probs=30.8
Q ss_pred eeeeCCcceEEEEEECCEEEEEEE--c--ccccc---ceEEeecccccCCcCeEEEEEeecCCc
Q 005404 343 TLTVESAGHAVHVFINGQFLGSAF--G--TRENR---RFTFSGPANLRAGINKIALLSIAVGLP 399 (698)
Q Consensus 343 ~L~v~~~~D~~~Vfvng~~~G~~~--~--~~~~~---~~~~~~~~~l~~g~~~L~ILven~Gr~ 399 (698)
.|.|... .+..+||||+.||.-. . ....+ -.++++.--|+.|.|+|.|++-+....
T Consensus 7 ~l~isa~-g~Y~l~vNG~~V~~~~l~P~~t~y~~~~~Y~tyDVt~~L~~G~N~iav~lg~gw~~ 69 (172)
T PF08531_consen 7 RLYISAL-GRYELYVNGERVGDGPLAPGWTDYDKRVYYQTYDVTPYLRPGENVIAVWLGNGWYN 69 (172)
T ss_dssp EEEEEEE-SEEEEEETTEEEEEE--------BTTEEEEEEEE-TTT--TTEEEEEEEEEE--S-
T ss_pred EEEEEeC-eeEEEEECCEEeeCCccccccccCCCceEEEEEeChHHhCCCCCEEEEEEeCCccc
Confidence 4555443 3668999999999643 1 11111 124444434788999999999775433
No 25
>KOG2024 consensus Beta-Glucuronidase GUSB (glycosylhydrolase superfamily 2) [Carbohydrate transport and metabolism]
Probab=54.05 E-value=22 Score=37.44 Aligned_cols=58 Identities=24% Similarity=0.313 Sum_probs=42.1
Q ss_pred ccchhhcCCC---CCCccEEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEE
Q 005404 305 IGLLEQINIT---RDTSDYLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLG 363 (698)
Q Consensus 305 p~~~Eql~~t---~d~~Gy~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G 363 (698)
|.++-.+++. +|-+|.+||+-++.++.+.. ...++...|++.+++-+|.|||||.-+=
T Consensus 71 pss~nDi~~d~~lrdfv~~~wyer~v~vpe~w~-~~~~~r~vlr~~s~H~~Aivwvng~~~~ 131 (297)
T KOG2024|consen 71 PSSFNDIGQDWRLRDFVGLVWYERTVTVPESWT-QDLGKRVVLRIGSAHSYAIVWVNGVDAL 131 (297)
T ss_pred ccchhccccCCccccceeeeEEEEEEEcchhhh-hhcCCeEEEEeecccceeEEEEcceeec
Confidence 3445566553 46899999999998875432 1224456889999999999999987643
No 26
>PF08531 Bac_rhamnosid_N: Alpha-L-rhamnosidase N-terminal domain; InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=49.41 E-value=23 Score=34.87 Aligned_cols=22 Identities=23% Similarity=0.613 Sum_probs=19.4
Q ss_pred EEEeCCCceEEEEECCeeeeee
Q 005404 487 ALDLRSMGKGQVWINGQSIGRY 508 (698)
Q Consensus 487 ~Ld~~g~gKG~vwVNG~nlGRY 508 (698)
.|.+++.|+=.+||||+.+|+-
T Consensus 7 ~l~isa~g~Y~l~vNG~~V~~~ 28 (172)
T PF08531_consen 7 RLYISALGRYELYVNGERVGDG 28 (172)
T ss_dssp EEEEEEESEEEEEETTEEEEEE
T ss_pred EEEEEeCeeEEEEECCEEeeCC
Confidence 4778888999999999999975
No 27
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=49.20 E-value=90 Score=28.83 Aligned_cols=70 Identities=16% Similarity=0.323 Sum_probs=42.0
Q ss_pred EEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEcccc-----ccceEEeecccccCC-cCeEEEEE
Q 005404 320 YLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRE-----NRRFTFSGPANLRAG-INKIALLS 393 (698)
Q Consensus 320 y~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~-----~~~~~~~~~~~l~~g-~~~L~ILv 393 (698)
.+.+++.|..+.+..+ ++.+. ..|.+.+||||+.+-...+... .........+.|.+| .+.|.|..
T Consensus 47 ~~~~~G~~~~~~~G~y-------~f~~~-~~d~~~l~idg~~vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~y 118 (145)
T PF07691_consen 47 SVRWTGYFKPPETGTY-------TFSLT-SDDGARLWIDGKLVIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIEY 118 (145)
T ss_dssp EEEEEEEEEESSSEEE-------EEEEE-ESSEEEEEETTEEEEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEEE
T ss_pred EEEEEEEEecccCceE-------EEEEE-ecccEEEEECCEEEEcCCccccccccccccceEEEEEEeeCCeeEEEEEEE
Confidence 4568888887655432 34443 7789999999999977654321 001122223445554 67888876
Q ss_pred eecC
Q 005404 394 IAVG 397 (698)
Q Consensus 394 en~G 397 (698)
.+.+
T Consensus 119 ~~~~ 122 (145)
T PF07691_consen 119 FNRG 122 (145)
T ss_dssp EECS
T ss_pred EECC
Confidence 6655
No 28
>PLN03059 beta-galactosidase; Provisional
Probab=42.19 E-value=1.2e+02 Score=37.20 Aligned_cols=42 Identities=26% Similarity=0.395 Sum_probs=32.5
Q ss_pred CCceEEEEEEeCCCCCC------CeEEEeCCCc-eEEEEECCeeeeeee
Q 005404 468 QSLKWYKAYFDAPTGNE------PLALDLRSMG-KGQVWINGQSIGRYW 509 (698)
Q Consensus 468 ~~~~~Yr~~F~~p~~~d------~~~Ld~~g~g-KG~vwVNG~nlGRYW 509 (698)
.+..||+++|+++.... .-.|.+.+.+ ..+|||||.-+|.-+
T Consensus 469 ~dYlwY~t~i~~~~~~~~~~~~~~~~L~v~~~~d~~~vFVNg~~~Gt~~ 517 (840)
T PLN03059 469 TDYLWYMTEVHIDPDEGFLKTGQYPVLTIFSAGHALHVFINGQLAGTVY 517 (840)
T ss_pred CceEEEEEEEeecCCccccccCCCceEEEcccCcEEEEEECCEEEEEEE
Confidence 46899999999865421 1237888776 699999999999875
No 29
>TIGR01676 GLDHase galactonolactone dehydrogenase. This model represents L-Galactono-gamma-lactone dehydrogenase (EC 1.3.2.3). This enzyme catalyzes the final step in ascorbic acid biosynthesis in higher plants. This protein is homologous to ascorbic acid biosynthesis enzymes of other species: L-gulono-gamma-lactone oxidase in rat and L-galactono-gamma-lactone oxidase in yeast. All three covalently bind the cofactor FAD.
Probab=40.42 E-value=31 Score=40.32 Aligned_cols=41 Identities=12% Similarity=-0.010 Sum_probs=29.5
Q ss_pred cEEEEEeccccCCccccC---CCCcHHHHHHHHHHHHhCCCCcc
Q 005404 24 PIILSQIENEYGPESKSL---GAAGHAYVNWAAKMAVGLDTGVP 64 (698)
Q Consensus 24 PII~vQvENEYG~~~~~~---g~~d~~Ym~~L~~~a~~~Gi~vp 64 (698)
++-.-|.|+||.--.... .-.+.++|+.|+++.++.++.++
T Consensus 375 ~~~f~q~~~EyaiP~e~~~~~~~~~~~~l~el~~~i~~~~~~~~ 418 (541)
T TIGR01676 375 DCGGHQWVSETCFPAGTLAKPNMKDIEYIEELKQLIEKENIPAP 418 (541)
T ss_pred ccccceEEEEEEEecccccccccchHHHHHHHHHHHHhcCCCCC
Confidence 555779999997521000 01469999999999999888875
No 30
>KOG0496 consensus Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=39.53 E-value=7.9 Score=45.40 Aligned_cols=58 Identities=21% Similarity=0.304 Sum_probs=50.1
Q ss_pred cCCCCeEeeeeeeccCCCCCCCCCCCCCCccCCChHHHHHHHhcCCCceEEEecCCCc
Q 005404 614 CAPGQSITSIEFASFGTPSGTCGSFQKGTCHAPNSHAMLEKECIGQESCSIFISSGVF 671 (698)
Q Consensus 614 C~~g~~Is~I~~A~YGr~~~~C~~~~~~~C~~~~s~~~V~~~C~Gk~~C~v~a~~~~F 671 (698)
|..+.++..|..|.||...+.|+.+...+|.++.+...+.+.|-.+..|+|....+.+
T Consensus 332 ~ep~lv~gd~~~~kyg~~~~~C~~Fl~n~~~~~~~~v~f~~~~y~~~~~slsilpdck 389 (649)
T KOG0496|consen 332 CEPALVAGDITTAKYGNLREACAAFLSNNNGAPAAPVPFNKPKYRLPPWSLSILPDCK 389 (649)
T ss_pred cCccccccCcccccccchhhHHHHHHhcCCCCCCCccccCCCccccCceeEEechhhc
Confidence 4567788888999999998889999888999999998999999999999998875443
No 31
>COG2884 FtsE Predicted ATPase involved in cell division [Cell division and chromosome partitioning]
Probab=37.05 E-value=29 Score=35.23 Aligned_cols=16 Identities=25% Similarity=0.661 Sum_probs=14.2
Q ss_pred ceEEEEECCeeeeeee
Q 005404 494 GKGQVWINGQSIGRYW 509 (698)
Q Consensus 494 gKG~vwVNG~nlGRYW 509 (698)
.+|.|||||++|.|.=
T Consensus 55 t~G~i~~~~~dl~~l~ 70 (223)
T COG2884 55 TRGKILVNGHDLSRLK 70 (223)
T ss_pred CCceEEECCeeccccc
Confidence 5799999999999973
No 32
>PF07009 DUF1312: Protein of unknown function (DUF1312); InterPro: IPR010739 This family consists of several bacterial proteins of around 120 residues in length. The function of this family is unknown.; PDB: 4ESN_B 1NPP_B 1M1G_D 1NPR_A 1M1H_A 2KPP_A 3LD7_C.
Probab=26.29 E-value=3.5e+02 Score=24.64 Aligned_cols=81 Identities=12% Similarity=0.307 Sum_probs=47.2
Q ss_pred CceEEEEECCeeeeeeeeccccCCCCccccCCCcCCccccCCCCCCceeeEecCcccccCCcceEEEEEeeCCCCcceEE
Q 005404 493 MGKGQVWINGQSIGRYWMAYAKGDCKTCSYAGTFRPINCQRRCGHPTQRWYHVPRSWLKPTKNLLVVFEELGGDASRISL 572 (698)
Q Consensus 493 ~gKG~vwVNG~nlGRYW~~~~~g~~~~~~~~G~~~~~~~~~~~~~PQqtlY~vP~~~Lk~g~N~ivvfE~~g~~~~~i~~ 572 (698)
-.+..|++||.-+.||= + +. .. ..+++. |+.+ .|.|+|.| ++ ..|++
T Consensus 26 ~~~~~I~~~g~~~~~i~--L-~~-------~~-------------~~~~i~-i~~~---~g~~~i~i---~~---g~vrv 72 (113)
T PF07009_consen 26 GKYAVIYVDGKEVKRIP--L-DK-------VN-------------EDKTIE-IDGD---GGYNTIEI---KD---GKVRV 72 (113)
T ss_dssp EEEEEEEETTEEEEEEE--T-TS--------B-------------SEEEEE-EETT---TCEEEEEE---ET---TEEEE
T ss_pred CeEEEEEECCEEEEEEE--C-CC-------CC-------------CCEEEE-EecC---CcEEEEEE---EC---CEEEE
Confidence 35789999999999993 2 10 01 344444 7553 34565544 22 34888
Q ss_pred Eeec-ccccccccccCCCCCCCcccccCCCCCcCCCCceeeecCCCCeEeeee
Q 005404 573 VKRS-VARVCADAHEHHPTTDNNDIENKGNSNSTGNAKVLLQCAPGQSITSIE 624 (698)
Q Consensus 573 ~~~~-~~~~~~~~~e~hp~~~~~~~~~~~~~~~ce~~~~~L~C~~g~~Is~I~ 624 (698)
.+.. .+++|.+. .|..+.++ .+-|-+.+++..|.
T Consensus 73 ~~s~CpdkiCv~~---------G~I~~~G~---------~IVCLPn~lvI~I~ 107 (113)
T PF07009_consen 73 IESDCPDKICVKT---------GWISRPGQ---------SIVCLPNRLVIEIE 107 (113)
T ss_dssp EEESTSS-HHHHS----------SB-STT----------EEEETTTTEEEEEE
T ss_pred EECCCCCcchhhC---------CCcCCCCC---------EEEEcCCEEEEEEE
Confidence 7775 67888876 45444444 47787766554555
No 33
>PF02055 Glyco_hydro_30: O-Glycosyl hydrolase family 30; InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=23.73 E-value=2e+02 Score=33.44 Aligned_cols=220 Identities=18% Similarity=0.261 Sum_probs=99.0
Q ss_pred HHHHHHHHHhccCceeccCCcEEEEEeccccCCc---cccCCC------CcHHHHH-HHHHHHHhCCC--CcceEeecCC
Q 005404 4 FTQKIVQMMKNEKLFASQGGPIILSQIENEYGPE---SKSLGA------AGHAYVN-WAAKMAVGLDT--GVPWVMCKED 71 (698)
Q Consensus 4 ~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~---~~~~g~------~d~~Ym~-~L~~~a~~~Gi--~vp~~~~~~~ 71 (698)
|-+.+++.|. -+ .++|=||=++=+-||=... ...|.. .-++.++ +|.-.+++.|+ ++-++..+..
T Consensus 206 yA~Y~vkfi~--aY-~~~GI~i~aiT~QNEP~~~~~~~~~~~s~~~t~~~~~~Fi~~~LgP~l~~~~~g~d~kI~~~D~n 282 (496)
T PF02055_consen 206 YADYFVKFIQ--AY-KKEGIPIWAITPQNEPDNGSDPNYPWPSMGWTPEEQADFIKNYLGPALRKAGLGKDVKILIYDHN 282 (496)
T ss_dssp HHHHHHHHHH--HH-HCTT--ESEEESSSSCCGGGSTT-SSC--B--HHHHHHHHHHTHHHHHHTSTT-TTSEEEEEEEE
T ss_pred HHHHHHHHHH--HH-HHCCCCeEEEeccCCCCCCCCCCCCCCcCCCCHHHHHHHHHHHHHHHHHhcCCCCceEEEEEecC
Confidence 3444445554 22 2677799999999994320 001111 1255665 47778888877 7777776652
Q ss_pred --CCCC---cccc------CCCC--ccc--c--------cCCCCCCCCCceeeeccCccccccCCCcC---CCCHHHHHH
Q 005404 72 --DAPD---PVIN------SCNG--FYC--D--------AFSPNKPYKPTLWTEAWSGWFTEFGGAVH---RRPVQDLAF 125 (698)
Q Consensus 72 --~~~~---~v~~------~~ng--~~~--~--------~f~~~~p~~P~~~~E~~~Gwf~~wG~~~~---~~~~~~~~~ 125 (698)
..|+ .++. ...| +++ . .+....|+..++.||-..|.. .|+.... ...++.++.
T Consensus 283 ~~~~~~~~~~il~d~~A~~yv~GiA~HwY~g~~~~~~l~~~h~~~P~k~l~~TE~~~g~~-~~~~~~~~g~w~~~~~y~~ 361 (496)
T PF02055_consen 283 RDNLPDYADTILNDPEAAKYVDGIAFHWYGGDPSPQALDQVHNKFPDKFLLFTEACCGSW-NWDTSVDLGSWDRAERYAH 361 (496)
T ss_dssp GGGTTHHHHHHHTSHHHHTTEEEEEEEETTCS-HCHHHHHHHHHSTTSEEEEEEEESS-S-TTS-SS-TTHHHHHHHHHH
T ss_pred CcccchhhhhhhcChhhHhheeEEEEECCCCCchhhHHHHHHHHCCCcEEEeeccccCCC-CcccccccccHHHHHHHHH
Confidence 2221 1221 1111 111 1 111346888999999976542 1221111 111233444
Q ss_pred HHHHHHHcCCceeeeeEe------eccCCCCCC-CCCCcccccccCCCCCCCCCC-CCchhHHHHHHHHHHHHhhhcccc
Q 005404 126 AVARFIQKGGSFFNYYMY------HGGTNFGRT-AGGPFITTSYDYDAPLDEYGL-MRQPKYGHLKQLHEAIKLCEYALV 197 (698)
Q Consensus 126 ~~~~~l~~g~s~~n~YMf------hGGTNfG~~-~G~~~~~TSYDY~APl~E~G~-~~~~ky~~lr~l~~~~~~~~~~l~ 197 (698)
.+..-|..+++ ++-++ .||-|++.- ..++..+.. +.+. .++|.|+.|..+.+|++.-...+.
T Consensus 362 ~ii~~lnn~~~--gw~~WNl~LD~~GGP~~~~n~~d~~iivd~--------~~~~~~~~p~yY~~gHfSKFV~PGa~RI~ 431 (496)
T PF02055_consen 362 DIIGDLNNWVS--GWIDWNLALDENGGPNWVGNFCDAPIIVDS--------DTGEFYKQPEYYAMGHFSKFVRPGAVRIG 431 (496)
T ss_dssp HHHHHHHTTEE--EEEEEESEBETTS---TT---B--SEEEEG--------GGTEEEE-HHHHHHHHHHTTS-TT-EEEE
T ss_pred HHHHHHHhhce--eeeeeeeecCCCCCCcccCCCCCceeEEEc--------CCCeEEEcHHHHHHHHHhcccCCCCEEEE
Confidence 44444566644 22222 388887532 112221111 1121 126899999998888764322221
Q ss_pred CCCCcccCCCCcceeeeeecCCcceeeEeeccCCccc-eEEEeC
Q 005404 198 SSDPTVTSLGTYQQAHVFSAGQQKCAAFLSNYNTKSA-ARVTFN 240 (698)
Q Consensus 198 ~~~p~~~~~~~~~~~~~y~~~~~~~~~Fl~n~~~~~~-~~v~~~ 240 (698)
.. ........+..+|...++..++-|.|...... .+|++.
T Consensus 432 st---~~~~~~~l~~vAF~nPDGs~vvVv~N~~~~~~~~~v~v~ 472 (496)
T PF02055_consen 432 ST---SSSSDSGLEAVAFLNPDGSIVVVVLNRGDSDQNFSVTVK 472 (496)
T ss_dssp EE---ESSSTTTEEEEEEEETTSEEEEEEEE-SSS-EEEEEEEE
T ss_pred ee---ccCCCCceeEEEEECCCCCEEEEEEcCCCCccceEEEEe
Confidence 10 00001134555666666777766666544332 245553
No 34
>PF14307 Glyco_tran_WbsX: Glycosyltransferase WbsX
Probab=21.82 E-value=1.9e+02 Score=31.76 Aligned_cols=61 Identities=8% Similarity=0.125 Sum_probs=46.4
Q ss_pred hhHHHHHHHHHhccCceeccCCcEEEEEeccccCCccccCCCCcHHHHHHHHHHHHhCCCCcceEeecC
Q 005404 2 QGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKMAVGLDTGVPWVMCKE 70 (698)
Q Consensus 2 ~~~~~~i~~~l~~~~l~~~~GGPII~vQvENEYG~~~~~~g~~d~~Ym~~L~~~a~~~Gi~vp~~~~~~ 70 (698)
++.++.|++..++..++--+|=||++|=--.+. ++-++.|+.+++.|+++|+..+.+....
T Consensus 138 ~~~~~~l~~~F~D~rYikVdGKPv~~Iy~p~~~--------pd~~~~~~~wr~~a~~~G~~giyii~~~ 198 (345)
T PF14307_consen 138 KEHFRYLLPYFKDPRYIKVDGKPVFLIYRPGDI--------PDIKEMIERWREEAKEAGLPGIYIIAVQ 198 (345)
T ss_pred HHHHHHHHHHhCCCCceeECCEEEEEEECcccc--------cCHHHHHHHHHHHHHHcCCCceEEEEEe
Confidence 456888999999877776788899998433222 2457899999999999999877665443
No 35
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=21.73 E-value=1.6e+02 Score=27.04 Aligned_cols=38 Identities=21% Similarity=0.604 Sum_probs=30.4
Q ss_pred eEEEEEEeCCCCCCCeEEEeCCCceEEEEECCeeeeeee
Q 005404 471 KWYKAYFDAPTGNEPLALDLRSMGKGQVWINGQSIGRYW 509 (698)
Q Consensus 471 ~~Yr~~F~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW 509 (698)
.-+.++|.+|+... .-+.+.+-+...+||||.-+-..|
T Consensus 46 ~~~~g~i~~~~~G~-y~f~~~~~~~~~l~Idg~~vid~~ 83 (136)
T smart00758 46 VRWTGYLKPPEDGE-YTFSITSDDGARLWIDGKLVIDNW 83 (136)
T ss_pred EEEEEEEECCCCcc-EEEEEEcCCcEEEEECCcEEEcCC
Confidence 45678898887655 556777888899999999888777
No 36
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=21.37 E-value=1.1e+02 Score=28.11 Aligned_cols=38 Identities=24% Similarity=0.615 Sum_probs=29.7
Q ss_pred eEEEEEEeCCCCCCCeEEEeCCCceEEEEECCeeeeeee
Q 005404 471 KWYKAYFDAPTGNEPLALDLRSMGKGQVWINGQSIGRYW 509 (698)
Q Consensus 471 ~~Yr~~F~~p~~~d~~~Ld~~g~gKG~vwVNG~nlGRYW 509 (698)
.=++++|++|.... .-|.+..-+...+||||.-+-..|
T Consensus 48 ~~~~G~~~~~~~G~-y~f~~~~~d~~~l~idg~~vid~~ 85 (145)
T PF07691_consen 48 VRWTGYFKPPETGT-YTFSLTSDDGARLWIDGKLVIDNW 85 (145)
T ss_dssp EEEEEEEEESSSEE-EEEEEEESSEEEEEETTEEEEECS
T ss_pred EEEEEEEecccCce-EEEEEEecccEEEEECCEEEEcCC
Confidence 34789999987654 445555667789999999999998
No 37
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=20.65 E-value=3.1e+02 Score=25.13 Aligned_cols=65 Identities=17% Similarity=0.274 Sum_probs=35.9
Q ss_pred EEEEEEEeecCCCcccccCCCCceeeeCCcceEEEEEECCEEEEEEEccccccceEEeecccccCC-cCeEEEEE
Q 005404 320 YLWYMTSVEISSSESFLRGGQKPTLTVESAGHAVHVFINGQFLGSAFGTRENRRFTFSGPANLRAG-INKIALLS 393 (698)
Q Consensus 320 y~~Y~t~i~~~~~~~~l~~g~~~~L~v~~~~D~~~Vfvng~~~G~~~~~~~~~~~~~~~~~~l~~g-~~~L~ILv 393 (698)
.+.+++.|..+.+.. -++.+ ...|.+.+||||+.+-...+... ........+.|.+| .+.|.|..
T Consensus 45 ~~~~~g~i~~~~~G~-------y~f~~-~~~~~~~l~Idg~~vid~~~~~~-~~~~~~~~v~l~~g~~~~i~v~y 110 (136)
T smart00758 45 SVRWTGYLKPPEDGE-------YTFSI-TSDDGARLWIDGKLVIDNWGKHE-ARPSTSSTLYLLAGGTYPIRIEY 110 (136)
T ss_pred EEEEEEEEECCCCcc-------EEEEE-EcCCcEEEEECCcEEEcCCccCC-CccccceeEEEeCCcEEEEEEEE
Confidence 356788887664432 24555 45788999999998765322111 00111123445555 45665544
No 38
>COG2360 Aat Leu/Phe-tRNA-protein transferase [Posttranslational modification, protein turnover, chaperones]
Probab=20.32 E-value=1e+02 Score=31.60 Aligned_cols=35 Identities=23% Similarity=0.344 Sum_probs=28.2
Q ss_pred CceeccCCcEEEEEecccc----CCccccCCCCcHHHHHHHHH
Q 005404 16 KLFASQGGPIILSQIENEY----GPESKSLGAAGHAYVNWAAK 54 (698)
Q Consensus 16 ~l~~~~GGPII~vQvENEY----G~~~~~~g~~d~~Ym~~L~~ 54 (698)
.++...|+.+|=.|++||. |. +.-..++|++.|++
T Consensus 167 ~~L~~~g~~LiD~Q~~n~HL~~~GA----~~ipr~~y~~~L~~ 205 (221)
T COG2360 167 EHLRRHGFVLIDCQVLNEHLASLGA----YEIPRKEYLNYLRR 205 (221)
T ss_pred HHHHhcCceEEeeecCCHHHHhcCC----eecCHHHHHHHHHH
Confidence 3455889999999999994 44 33478999999998
Done!