Query psy14029
Match_columns 277
No_of_seqs 156 out of 385
Neff 8.2
Searched_HMMs 46136
Date Fri Aug 16 17:48:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14029.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14029hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13385 Laminin_G_3: Concanav 99.8 5.9E-18 1.3E-22 135.8 14.5 155 79-250 2-157 (157)
2 cd00152 PTX Pentraxins are pla 99.6 4.5E-14 9.8E-19 120.0 20.4 169 77-262 6-183 (201)
3 smart00159 PTX Pentraxin / C-r 99.6 3.7E-14 8.1E-19 121.0 19.3 166 79-261 8-182 (206)
4 PF00354 Pentaxin: Pentaxin fa 99.4 4.6E-11 1E-15 101.0 16.9 159 80-255 3-170 (195)
5 smart00560 LamGL LamG-like jel 99.3 1.7E-10 3.6E-15 91.7 14.1 70 170-246 61-132 (133)
6 smart00210 TSPN Thrombospondin 99.2 2.4E-09 5.2E-14 89.8 16.6 150 76-242 26-183 (184)
7 cd00110 LamG Laminin G domain; 99.1 1.3E-08 2.9E-13 81.5 16.2 141 80-239 2-150 (151)
8 KOG1788|consensus 98.9 3.7E-09 8.1E-14 104.7 7.2 187 26-221 974-1206(2799)
9 smart00282 LamG Laminin G doma 98.8 4.8E-07 1E-11 71.5 14.7 123 104-241 3-134 (135)
10 PF07953 Toxin_R_bind_N: Clost 98.5 1.3E-06 2.9E-11 71.4 10.7 98 101-204 54-158 (195)
11 PF02973 Sialidase: Sialidase, 98.5 9.6E-06 2.1E-10 67.8 15.8 144 102-254 33-187 (190)
12 PF02210 Laminin_G_2: Laminin 98.2 4.7E-05 1E-09 58.8 13.1 108 126-239 9-125 (128)
13 PTZ00334 trans-sialidase; Prov 97.7 0.0013 2.9E-08 65.7 15.9 82 166-254 639-724 (780)
14 KOG1834|consensus 97.4 0.0043 9.3E-08 60.0 14.0 167 79-254 344-530 (952)
15 PF00054 Laminin_G_1: Laminin 97.4 0.003 6.6E-08 49.7 10.9 100 138-244 22-130 (131)
16 KOG3516|consensus 96.8 0.029 6.2E-07 57.9 14.1 130 77-223 364-493 (1306)
17 PF06439 DUF1080: Domain of Un 95.0 0.052 1.1E-06 44.7 5.5 37 163-204 120-156 (185)
18 KOG1788|consensus 94.9 0.0058 1.3E-07 62.2 -0.5 100 167-267 1274-1385(2799)
19 PF14099 Polysacc_lyase: Polys 94.4 0.95 2.1E-05 38.6 12.1 103 98-202 73-184 (224)
20 KOG3516|consensus 93.3 1.4 3E-05 46.1 12.5 77 162-246 862-940 (1306)
21 KOG3514|consensus 89.0 4.1 8.9E-05 42.4 10.5 150 79-250 25-194 (1591)
22 KOG4289|consensus 85.0 5.7 0.00012 42.8 9.2 105 126-242 1578-1682(2531)
23 PF09264 Sial-lect-inser: Vibr 75.9 29 0.00063 29.0 8.8 28 170-200 92-119 (198)
24 KOG3514|consensus 73.4 44 0.00095 35.3 11.0 82 162-250 326-413 (1591)
25 PHA02902 putative IMV membrane 69.3 2.7 5.8E-05 28.6 1.1 23 233-255 43-65 (70)
26 PHA02681 ORF089 virion membran 60.6 4.8 0.0001 28.8 1.1 20 236-255 44-63 (92)
27 PF02057 Glyco_hydro_59: Glyco 59.7 2E+02 0.0044 29.0 14.8 61 164-239 605-665 (669)
28 PF08671 SinI: Anti-repressor 57.3 22 0.00047 20.5 3.3 25 25-49 5-29 (30)
29 KOG1219|consensus 56.1 1.2E+02 0.0027 35.4 10.9 89 165-258 3758-3859(4289)
30 PF08787 Alginate_lyase2: Algi 52.1 1.6E+02 0.0034 25.5 13.8 101 97-202 77-191 (236)
31 cd00070 GLECT Galectin/galacto 49.0 1.2E+02 0.0026 23.2 7.6 36 164-204 71-106 (127)
32 KOG3546|consensus 44.0 1E+02 0.0022 30.9 7.5 76 164-248 151-231 (1167)
33 cd00413 Glyco_hydrolase_16 gly 43.1 66 0.0014 26.6 5.7 35 168-208 140-174 (210)
34 PF08308 PEGA: PEGA domain; I 41.7 70 0.0015 21.6 4.7 18 187-204 11-28 (71)
35 PF14193 DUF4315: Domain of un 40.8 41 0.00089 24.3 3.4 26 26-51 37-62 (83)
36 cd02175 GH16_lichenase lichena 40.8 68 0.0015 27.0 5.4 29 170-203 138-166 (212)
37 PF03701 UPF0181: Uncharacteri 39.2 1.1E+02 0.0023 19.9 4.7 40 11-50 2-42 (51)
38 PF00722 Glyco_hydro_16: Glyco 37.7 91 0.002 25.2 5.6 31 169-204 117-147 (185)
39 smart00276 GLECT Galectin. Gal 33.5 96 0.0021 23.8 4.8 36 164-204 70-105 (128)
40 cd08783 Death_MALT1 Death doma 32.7 2.1E+02 0.0045 21.3 6.2 38 34-71 37-85 (97)
41 PF12993 DUF3877: Domain of un 32.1 58 0.0013 26.8 3.3 27 24-50 96-122 (175)
42 cd08023 GH16_laminarinase_like 31.7 1E+02 0.0022 26.2 5.2 32 168-204 156-187 (235)
43 PF12926 MOZART2: Mitotic-spin 30.6 1.6E+02 0.0034 21.5 5.0 43 24-72 29-71 (88)
44 PF03750 DUF310: Protein of un 29.3 1.5E+02 0.0032 22.7 5.1 33 19-51 11-43 (119)
45 PF02018 CBM_4_9: Carbohydrate 29.1 2.4E+02 0.0051 20.8 10.1 29 165-194 97-125 (131)
46 PF03633 Glyco_hydro_65C: Glyc 29.0 40 0.00086 21.7 1.6 29 170-198 18-50 (54)
47 PRK05114 hypothetical protein; 28.9 1.8E+02 0.0039 19.4 4.8 40 11-50 2-42 (59)
48 PF03170 BcsB: Bacterial cellu 27.9 84 0.0018 31.1 4.4 33 172-204 343-378 (605)
49 PRK06789 flagellar motor switc 27.8 1.2E+02 0.0026 21.4 3.9 21 184-204 39-59 (74)
50 PF09224 DUF1961: Domain of un 27.8 2E+02 0.0043 24.7 5.9 31 168-203 157-187 (218)
51 COG3140 Uncharacterized protei 26.5 1.9E+02 0.004 19.1 4.3 40 11-50 2-42 (60)
52 cd02183 GH16_fungal_CRH1_trans 24.3 1.7E+02 0.0037 24.7 5.1 28 170-202 115-142 (203)
53 PF09313 DUF1971: Domain of un 24.1 2.7E+02 0.006 19.9 5.6 24 164-194 59-82 (82)
54 TIGR02543 List_Bact_rpt Lister 23.7 52 0.0011 20.5 1.3 13 99-111 1-13 (44)
55 PF04167 DUF402: Protein of un 22.8 87 0.0019 21.5 2.5 26 166-194 10-35 (72)
56 PF12554 MOZART1: Mitotic-spin 22.3 2E+02 0.0042 18.5 3.8 14 36-49 18-31 (48)
57 KOG2014|consensus 21.1 1.4E+02 0.003 27.1 4.0 56 32-90 263-324 (331)
58 PF02828 L27: L27 domain; Int 20.9 2.2E+02 0.0047 18.4 4.0 42 24-71 4-46 (56)
59 PF09479 Flg_new: Listeria-Bac 20.8 56 0.0012 19.6 1.1 15 99-113 1-15 (41)
60 PHA03170 UL37 tegument protein 20.8 1.1E+02 0.0024 26.9 3.2 19 186-204 41-59 (293)
61 PF00337 Gal-bind_lectin: Gala 20.6 3.9E+02 0.0084 20.3 9.5 37 163-204 75-111 (133)
No 1
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=99.78 E-value=5.9e-18 Score=135.76 Aligned_cols=155 Identities=20% Similarity=0.372 Sum_probs=97.7
Q ss_pred eEeecCCCcccEECCCCCCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCCCeEEEEEeCC-eEEEEEeeecCc
Q psy14029 79 FFSFPGKKGSAMVLPPLAKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKGVGYTAHFVGN-CLVLTSMKIKGK 157 (277)
Q Consensus 79 ff~F~g~~~s~i~lp~~~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g~g~e~~~~~~-~lvv~~~~~~~~ 157 (277)
.+.|+|.. +++.+|. ..+| .++||+++||+++..... ...+.....+..++.+.+..+ .+.+......+.
T Consensus 2 a~~f~g~~-~~i~~~~-~~~~-~~~fTi~~w~~~~~~~~~------~~~~~~~~~~~~~~~l~~~~~~~l~~~~~~~~~~ 72 (157)
T PF13385_consen 2 ALYFDGSN-DYISIPN-SDFP-SGSFTISFWVKPDSPSSS------QSFVFMDSSGSGGFGLFINNNGRLRFYIGNGGGG 72 (157)
T ss_dssp EEEE-STT--EEEEES-GGGG-GTEEEEEEEEEESS--SS------EEEEEESSSSSEEEEEEEETTSEEEEEETTSEEE
T ss_pred EEEECCCC-CEEEECC-cCCC-CCCEEEEEEEEeCCCCCC------ceEEEEecCCCCEEEEEEECCCEEEEEEeCCCce
Confidence 36788874 8999987 5678 699999999999998642 222221122334777777754 455543322111
Q ss_pred ceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCCCCCCccceEeccce
Q psy14029 158 GFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPELDEERVFCGQMAAI 237 (277)
Q Consensus 158 ~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~~~~~~~f~g~lg~~ 237 (277)
.........+.+++||||++|++ .+.+++||||+++.+.... ..........++||... .....|.|.|+++
T Consensus 73 ~~~~~~~~~~~~~~W~~l~~~~~-----~~~~~lyvnG~~~~~~~~~-~~~~~~~~~~~~iG~~~--~~~~~~~g~i~~~ 144 (157)
T PF13385_consen 73 NYSFSSDSNLPDNKWHHLALTYD-----GSTVTLYVNGELVGSSTIP-SNISLNSNGPLFIGGSG--GGSSPFNGYIDDL 144 (157)
T ss_dssp SS-EE-BS---TT-EEEEEEEEE-----TTEEEEEETTEEETTCTEE-SSSSTTSCCEEEESS-S--TT--B-EEEEEEE
T ss_pred eEEEecCcccCCCCEEEEEEEEE-----CCeEEEEECCEEEEeEecc-CCcCCCCcceEEEeecC--CCCCceEEEEEEE
Confidence 22334567889999999999998 5679999999999998765 33222344578999976 2356899999999
Q ss_pred eEecCCCCHHHHH
Q psy14029 238 YLFNESLSTHQIC 250 (277)
Q Consensus 238 ~lf~~aLs~~~I~ 250 (277)
.||++||+++||+
T Consensus 145 ~i~~~aLt~~eI~ 157 (157)
T PF13385_consen 145 RIYNRALTAEEIQ 157 (157)
T ss_dssp EEESS---HHHHH
T ss_pred EEECccCCHHHcC
Confidence 9999999999996
No 2
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=99.64 E-value=4.5e-14 Score=120.03 Aligned_cols=169 Identities=20% Similarity=0.281 Sum_probs=105.5
Q ss_pred cceEeecCCC-cccEECCCCCCCC-CCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCC-CeEEEEEeC-CeEEEEEe
Q psy14029 77 DVFFSFPGKK-GSAMVLPPLAKWP-HENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKG-VGYTAHFVG-NCLVLTSM 152 (277)
Q Consensus 77 ~~ff~F~g~~-~s~i~lp~~~~~p-~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g-~g~e~~~~~-~~lvv~~~ 152 (277)
..-+.|.... ..++.+++ ..| ..+.||+|+|++++..+. ...+||..+... ..+..+... +.+.+.+.
T Consensus 6 ~~~l~f~~~s~~~yv~l~~--~~~~~l~~fTv~~Wv~~~~~~~------~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~ 77 (201)
T cd00152 6 GKVFVFPKESDTSYVKLKP--ELPKPLQAFTLCLWVYTDLSTR------EYSLFSYATKGQDNELLLYKEKDGGYSLYIG 77 (201)
T ss_pred CCEEECCCCCCCceEEEcc--CCCCChhhEEEEEEEEecCCCC------CeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc
Confidence 3456776542 24566543 223 578999999999987532 356999876522 233333333 24444322
Q ss_pred eecCcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCC-----CCCC
Q psy14029 153 KIKGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPE-----LDEE 227 (277)
Q Consensus 153 ~~~~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~-----~~~~ 227 (277)
++... .......++|||||++.+.. ..++++||||+++.+..+..-..+. +-....||.... .+..
T Consensus 78 ---~~~~~--~~~~~~~g~W~hv~~t~d~~---~g~~~lyvnG~~~~~~~~~~~~~~~-~~g~l~lG~~q~~~gg~~~~~ 148 (201)
T cd00152 78 ---GKEVT--FKVPESDGAWHHICVTWEST---SGIAELWVNGKLSVRKSLKKGYTVG-PGGSIILGQEQDSYGGGFDAT 148 (201)
T ss_pred ---CEEEE--EeccCCCCCEEEEEEEEECC---CCcEEEEECCEEeccccccCCCEEC-CCCeEEEeecccCCCCCCCCC
Confidence 22211 12344889999999999842 4579999999999766432001111 122345665421 2335
Q ss_pred ccceEeccceeEecCCCCHHHHHHHHhCCCCCccC
Q psy14029 228 RVFCGQMAAIYLFNESLSTHQICAMHRLGPGYKVK 262 (277)
Q Consensus 228 ~~f~g~lg~~~lf~~aLs~~~I~~iy~lGp~y~~~ 262 (277)
..|.|+|+.++||+.+|+++||+.+|.-+..+.|+
T Consensus 149 ~~f~G~I~~v~iw~~~Ls~~eI~~l~~~~~~~~Gn 183 (201)
T cd00152 149 QSFVGEISDVNMWDSVLSPEEIKNVYSEGGTLSGN 183 (201)
T ss_pred cceEEEEceeEEEcccCCHHHHHHHHhcCCCCCCC
Confidence 68999999999999999999999999844444443
No 3
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=99.63 E-value=3.7e-14 Score=121.00 Aligned_cols=166 Identities=20% Similarity=0.241 Sum_probs=103.9
Q ss_pred eEeecCCC-cccEECCCCCCCC-CCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCCC-eEEEEEeCC-eEEEEEeee
Q psy14029 79 FFSFPGKK-GSAMVLPPLAKWP-HENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKGV-GYTAHFVGN-CLVLTSMKI 154 (277)
Q Consensus 79 ff~F~g~~-~s~i~lp~~~~~p-~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g~-g~e~~~~~~-~lvv~~~~~ 154 (277)
-+.|+... ..++.+++ ..| .-+.||+|+|++.+.... ...|||..+.+.. .+..+..++ .+.+.+.
T Consensus 8 ~~~fp~~s~~~yv~l~~--~~~~~l~~fTvc~W~k~~~~~~------~~~ifSy~~~~~~ne~~~~~~~~~~~~l~i~-- 77 (206)
T smart00159 8 VFVFPKESDTSYVKLKP--ELPKPLQAFTVCLWFYSDLSPR------GYSLFSYATKGQDNELLLYKEKQGEYSLYIG-- 77 (206)
T ss_pred EEECCCCCCCCeEEEcc--CCCCChhHEEEEEEEEecCCCC------ceEEEEEeCCCCCCeEEEEEcCCcEEEEEEc--
Confidence 46665442 34566643 224 568999999999997532 3579998775322 222233333 3433321
Q ss_pred cCcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCC-----CCCCcc
Q psy14029 155 KGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPE-----LDEERV 229 (277)
Q Consensus 155 ~~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~-----~~~~~~ 229 (277)
++ ....+..+.+++|||||++.+.. ..++++|+||+++....+. -...-.+.....||...+ .+....
T Consensus 78 -g~--~~~~~~~~~~g~W~hvc~tw~~~---~g~~~lyvnG~~~~~~~~~-~g~~i~~~G~lvlGq~qd~~gg~f~~~~~ 150 (206)
T smart00159 78 -GK--KVQFPVPESDGKWHHICTTWESS---SGIAELWVDGKPGVRKGLA-KGYTVKPGGSIILGQEQDSYGGGFDATQS 150 (206)
T ss_pred -Ce--EEEecccccCCceEEEEEEEECC---CCcEEEEECCEEccccccc-CCcEECCCCEEEEEecccCCCCCCCCCcc
Confidence 22 22234567899999999999842 4579999999998333222 000001122345665422 234568
Q ss_pred ceEeccceeEecCCCCHHHHHHHHhCCCCCcc
Q psy14029 230 FCGQMAAIYLFNESLSTHQICAMHRLGPGYKV 261 (277)
Q Consensus 230 f~g~lg~~~lf~~aLs~~~I~~iy~lGp~y~~ 261 (277)
|.|+|+.++||+++|+++||+.+|.-.+.+.|
T Consensus 151 f~G~i~~v~iw~~~Ls~~eI~~l~~~~~~~~G 182 (206)
T smart00159 151 FVGEIGDLNMWDSVLSPEEIKSVYKGSTFSIG 182 (206)
T ss_pred eeEEEeeeEEecccCCHHHHHHHHcCCCCCCC
Confidence 99999999999999999999999986554433
No 4
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=99.37 E-value=4.6e-11 Score=100.97 Aligned_cols=159 Identities=21% Similarity=0.332 Sum_probs=92.2
Q ss_pred EeecCCC-cccEECCCCCCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCCCeEEE--EEeC-CeEEEEEeeec
Q psy14029 80 FSFPGKK-GSAMVLPPLAKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKGVGYTA--HFVG-NCLVLTSMKIK 155 (277)
Q Consensus 80 f~F~g~~-~s~i~lp~~~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g~g~e~--~~~~-~~lvv~~~~~~ 155 (277)
|.|+... ..+..+.+....| -+-||+|+|++.+.... ...|||.-+.+ ..-|+ +... +.+.+.+.
T Consensus 3 ~~FP~~s~~~yv~l~~~~~~p-L~~fTvC~w~k~~~~~~------~~tifSYat~~-~~nell~~~~~~~~~~l~i~--- 71 (195)
T PF00354_consen 3 FHFPTRSTTDYVRLKPSVPLP-LSAFTVCFWVKTDDSSN------DGTIFSYATSS-QDNELLLFGSSSGSLRLYIN--- 71 (195)
T ss_dssp EEE-S-BSSBEEEEEESS-S--BSEEEEEEEEEESGSGS-------EEEEEEEETT-EEEEEEEEEETTTEEEEEET---
T ss_pred EECCCCCCcceEEEecCCCCC-cccEEEEEEEEeccCCC------ceEEEEEccCC-CCccEEEEEeCCceEEEEEC---
Confidence 5565442 2344544333557 58999999999977531 35799987643 33333 3333 34444322
Q ss_pred CcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCC-----CCCCccc
Q psy14029 156 GKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPE-----LDEERVF 230 (277)
Q Consensus 156 ~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~-----~~~~~~f 230 (277)
++ .......+.+++|||+|++-+. ....+.+|+||+...+..+. -...-.+-...-||...+ .+...+|
T Consensus 72 ~~--~~~~~~~~~~~~Whh~C~tW~s---~~G~~~ly~dG~~~~~~~~~-~g~~i~~gG~~vlGQeQd~~gG~fd~~q~F 145 (195)
T PF00354_consen 72 GS--SVSFSGPIRDGQWHHICVTWDS---STGRWQLYVDGVRLSSTGLA-TGHSIPGGGTLVLGQEQDSYGGGFDESQAF 145 (195)
T ss_dssp TE--EEEEEECS-TSS-EEEEEEEET---TTTEEEEEETTEEEEEEESS-TT--B-SSEEEEESS-BSBTTBTCSGGGB-
T ss_pred Ce--EeEeccccCCCCcEEEEEEEec---CCcEEEEEECCEeccccccc-CCceECCCCEEEECccccccCCCcCCccEe
Confidence 22 1233456788999999999873 24589999999966554433 111112223455776532 2345689
Q ss_pred eEeccceeEecCCCCHHHHHHHHhC
Q psy14029 231 CGQMAAIYLFNESLSTHQICAMHRL 255 (277)
Q Consensus 231 ~g~lg~~~lf~~aLs~~~I~~iy~l 255 (277)
.|.|..+.|++++|++++|+.|+.-
T Consensus 146 ~G~i~~~~iWd~vLs~~eI~~l~~~ 170 (195)
T PF00354_consen 146 VGEISDFNIWDRVLSPEEIRALASC 170 (195)
T ss_dssp -EEEEEEEEESS---HHHHHHHHHT
T ss_pred eEEEeceEEEeeeCCHHHHHHHHhC
Confidence 9999999999999999999999985
No 5
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=99.27 E-value=1.7e-10 Score=91.66 Aligned_cols=70 Identities=20% Similarity=0.298 Sum_probs=49.9
Q ss_pred CcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCC-CC-CCCCCccceEeccceeEecCCCCH
Q psy14029 170 RKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGA-TP-ELDEERVFCGQMAAIYLFNESLST 246 (277)
Q Consensus 170 ~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~-~~-~~~~~~~f~g~lg~~~lf~~aLs~ 246 (277)
++||||++|++.. ..++++||||+++.+.... +. .......||. .. .......|.|.|++++||+.||++
T Consensus 61 ~~W~hva~v~d~~---~g~~~lYvnG~~~~~~~~~--~~--~~~~~~~iG~~~~~~~~~~~~f~G~Idevriy~~aLs~ 132 (133)
T smart00560 61 GVWVHLAGVYDGG---AGKLSLYVNGVEVATSETQ--PS--PSSGNLPQGGRILLGGAGGENFSGRLDEVRVYNRALTA 132 (133)
T ss_pred CCEEEEEEEEECC---CCeEEEEECCEEccccccC--Cc--ccCCceEEeeeccCCCCCCCCceEEeeEEEEeccccCC
Confidence 9999999999942 3589999999999765432 21 1223446773 21 112345799999999999999986
No 6
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.16 E-value=2.4e-09 Score=89.75 Aligned_cols=150 Identities=15% Similarity=0.242 Sum_probs=94.3
Q ss_pred CcceEeecCCCcccEECCCCCCCC--CCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCC-CeEEEEEeCCe--EEEE
Q psy14029 76 PDVFFSFPGKKGSAMVLPPLAKWP--HENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKG-VGYTAHFVGNC--LVLT 150 (277)
Q Consensus 76 p~~ff~F~g~~~s~i~lp~~~~~p--~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g-~g~e~~~~~~~--lvv~ 150 (277)
+...+.|... .-+..|.-..+| ....|||++|+|..+-. ...||++.+.++ .-+++.+.+.. +.+.
T Consensus 26 ~~~Ay~~~~~--a~~~~~t~~~~p~~~~~~fsi~~~~r~~~~~-------~g~L~si~~~~~~~~l~v~l~g~~~~~~~~ 96 (184)
T smart00210 26 GSPAYRLGDP--ALVPQPTRDLFPSGLPEDFSLLTTFRQTPKS-------RGVLFAIYDAQNVRQFGLEVDGRANTLLLR 96 (184)
T ss_pred CceeEEecCC--cccCcchHHhCcCCCCCCeEEEEEEEeCCCC-------CeEEEEEEcCCCcEEEEEEEeCCccEEEEE
Confidence 4467788664 344444322333 24689999999998522 357999987533 35666665543 4443
Q ss_pred EeeecCcceee-eeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccc--cCCCcceeEeCCCCCCCCC
Q psy14029 151 SMKIKGKGFQH-CVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVS--TNEPFDKCYIGATPELDEE 227 (277)
Q Consensus 151 ~~~~~~~~~~~-~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~--~~~~~~~~~IG~~~~~~~~ 227 (277)
.....|+.... +....+.+|+||||+|+.. +..|++|||++.+.+..+. -+. ...+-....+|.... + .
T Consensus 97 ~~~~~g~~~~~~f~~~~l~dg~WH~lal~V~-----~~~v~LyvDC~~~~~~~l~-~~~~~~~~~~g~~~~g~~~~-~-~ 168 (184)
T smart00210 97 YQGVDGKQHTVSFRNLPLADGQWHKLALSVS-----GSSATLYVDCNEIDSRPLD-RPGQPPIDTDGIEVRGAQAA-D-R 168 (184)
T ss_pred ECCCCCcEEEEeecCCccccCCceEEEEEEe-----CCEEEEEECCccccceecC-CcccccccccceEEEeeccC-C-C
Confidence 32233433322 2236788999999999998 5689999999999998876 232 111112233444322 1 3
Q ss_pred ccceEeccceeEecC
Q psy14029 228 RVFCGQMAAIYLFNE 242 (277)
Q Consensus 228 ~~f~g~lg~~~lf~~ 242 (277)
.+|.|.|-.+.|+-+
T Consensus 169 ~~f~G~lq~l~i~~d 183 (184)
T smart00210 169 KPFQGDLQQLKIVCD 183 (184)
T ss_pred CcceEEeEEEEEecC
Confidence 479999999998764
No 7
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.06 E-value=1.3e-08 Score=81.50 Aligned_cols=141 Identities=18% Similarity=0.184 Sum_probs=91.7
Q ss_pred EeecCCCcccEECCCCCCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeC-CCCeEEEEEeCCeEEEEEeeecCcc
Q psy14029 80 FSFPGKKGSAMVLPPLAKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTS-KGVGYTAHFVGNCLVLTSMKIKGKG 158 (277)
Q Consensus 80 f~F~g~~~s~i~lp~~~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~-~g~g~e~~~~~~~lvv~~~~~~~~~ 158 (277)
+.|+|. +.+.+|.... + ..-+++++|||....+ ..||...+. .+.-+.+.+.++.+.+..... ++.
T Consensus 2 ~~F~g~--~~i~~~~~~~-~-~~~~~i~~~frt~~~~--------g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g-~~~ 68 (151)
T cd00110 2 VSFSGS--SYVRLPTLPA-P-RTRLSISFSFRTTSPN--------GLLLYAGSQNGGDFLALELEDGRLVLRYDLG-SGS 68 (151)
T ss_pred eEeCCC--ceEEecCCCC-C-cceeEEEEEEEeCCCC--------eEEEEecCCCCCCEEEEEEECCEEEEEEcCC-ccc
Confidence 468887 7888885211 1 5789999999987742 347776554 244567777777777764332 222
Q ss_pred eeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc-cccccCCCcceeEeCCCCCCC------CCccce
Q psy14029 159 FQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA-WFVSTNEPFDKCYIGATPELD------EERVFC 231 (277)
Q Consensus 159 ~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~-~~p~~~~~~~~~~IG~~~~~~------~~~~f~ 231 (277)
........+.+|+||||+|... ...+.+||||....+.... ..... ......+||+.+... ....|.
T Consensus 69 ~~~~~~~~v~dg~Wh~v~i~~~-----~~~~~l~VD~~~~~~~~~~~~~~~~-~~~~~~~iGg~~~~~~~~~~~~~~~F~ 142 (151)
T cd00110 69 LVLSSKTPLNDGQWHSVSVERN-----GRSVTLSVDGERVVESGSPGGSALL-NLDGPLYLGGLPEDLKSPGLPVSPGFV 142 (151)
T ss_pred EEEEccCccCCCCEEEEEEEEC-----CCEEEEEECCccEEeeeCCCCceee-cCCCCeEEcCCCCchhcccccccCCCc
Confidence 2222222589999999999987 5789999999744333221 00011 233456899987532 346899
Q ss_pred EeccceeE
Q psy14029 232 GQMAAIYL 239 (277)
Q Consensus 232 g~lg~~~l 239 (277)
|.|..+++
T Consensus 143 Gci~~v~i 150 (151)
T cd00110 143 GCIRDLKV 150 (151)
T ss_pred eEeeEeEe
Confidence 99998876
No 8
>KOG1788|consensus
Probab=98.87 E-value=3.7e-09 Score=104.69 Aligned_cols=187 Identities=14% Similarity=0.241 Sum_probs=109.2
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHHhhcCCCCCccHHHHHHHHHhccC-----C-------CCCcceEeecCCC---cccE
Q psy14029 26 IDMMGVLASYSITVKELKLFFAAMKAVNNKWPRHSTKLLNVLRQMPQ-----R-------NGPDVFFSFPGKK---GSAM 90 (277)
Q Consensus 26 ~~li~~L~s~sis~~elr~ll~ll~~~~~~~~~~~~~ll~~L~~m~~-----~-------~~p~~ff~F~g~~---~s~i 90 (277)
+++++.|+.|..++.|++.+++.... .+...+++++..|.+ . -.-..|.+++.+. .+..
T Consensus 974 lkIvemLgayrlspselsMlwryvaQ------mrVmnsgkAIlgmmeklilMeqVtplelLslApfveLdmsktghasvq 1047 (2799)
T KOG1788|consen 974 LKIVEMLGAYRLSPSELSMLWRYVAQ------MRVMNSGKAILGMMEKLILMEQVTPLELLSLAPFVELDMSKTGHASVQ 1047 (2799)
T ss_pred HHHHHHHhhccCCcHHHHHHHHHHHH------HHHHhhhHHHHHHHHhhHHHhhcchHHHHhhcchheecccccccccee
Confidence 36889999999999999999998752 111222333333221 1 1123556665542 0111
Q ss_pred ECCCCCCCCCCCceeEEEEEEecCCCCCC--------CCC-----------CCceEEEEEeCCCC----e--EEEEEeCC
Q psy14029 91 VLPPLAKWPHENGFTFTTWFRLDPINSVN--------IER-----------EKPYLYSFKTSKGV----G--YTAHFVGN 145 (277)
Q Consensus 91 ~lp~~~~~p~~~GfTfs~Wi~le~~~~~~--------~~~-----------~~~~Lfs~~~~~g~----g--~e~~~~~~ 145 (277)
..-..+.||+..||+|-+|++++++-... .+. +...+|..++.+.. . .+.|++.+
T Consensus 1048 vsLgerSwppaagysfvcwlqfrnfLttqslesEiakagGssktlqrskqqveqnifRdfsv~aIsnaspFyaelyfqEd 1127 (2799)
T KOG1788|consen 1048 VSLGERSWPPAAGYSFVCWLQFRNFLTTQSLESEIAKAGGSSKTLQRSKQQVEQNIFRDFSVNAISNASPFYAELYFQED 1127 (2799)
T ss_pred eecccccCCCCCCcceEEEEEecCchhhchhhHHHHhcCCCCccHHHHHHHHhhhhhhhhhhhhccCCCchHHHHHhhcc
Confidence 22245799999999999999999873210 000 11222222221111 1 13466666
Q ss_pred eEE-EEEeeecCcceeeeeeeEEeCCcEEEEEEEEEecc-----CCCCeEEEEECCEEeeeeecccccccCCCcceeEeC
Q psy14029 146 CLV-LTSMKIKGKGFQHCVKYEFQPRKWYMIAIVYIYNR-----WTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIG 219 (277)
Q Consensus 146 ~lv-v~~~~~~~~~~~~~~~~~~~~~~W~hlavv~~~~~-----~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG 219 (277)
++. +.. .+.....+.....++|+|||+||||+++. |..+...+|.||++.++.++.|-|+.-+.......|
T Consensus 1128 giltlat---sns~slSfsgleseeg~whhlavvhskpnalaglFqasvayvycdgklrhagklgyspspvgksLqvvtg 1204 (2799)
T KOG1788|consen 1128 GILTLAT---SNSNSLSFSGLESEEGKWHHLAVVHSKPNALAGLFQASVAYVYCDGKLRHAGKLGYSPSPVGKSLQVVTG 1204 (2799)
T ss_pred ceeEecc---cCCCccccccccccccccceeeeeccCchHhhhhccccceeeeehhhhhhhhccCCCCCCCCceEEEEec
Confidence 643 221 11111224456678899999999999854 677788999999999999987445433222334455
Q ss_pred CC
Q psy14029 220 AT 221 (277)
Q Consensus 220 ~~ 221 (277)
+.
T Consensus 1205 tv 1206 (2799)
T KOG1788|consen 1205 TV 1206 (2799)
T ss_pred cc
Confidence 54
No 9
>smart00282 LamG Laminin G domain.
Probab=98.76 E-value=4.8e-07 Score=71.47 Aligned_cols=123 Identities=18% Similarity=0.129 Sum_probs=79.3
Q ss_pred eeEEEEEEecCCCCCCCCCCCceEEEEEeC-CCCeEEEEEeCCeEEEEEeeecCcceeeeee-eEEeCCcEEEEEEEEEe
Q psy14029 104 FTFTTWFRLDPINSVNIEREKPYLYSFKTS-KGVGYTAHFVGNCLVLTSMKIKGKGFQHCVK-YEFQPRKWYMIAIVYIY 181 (277)
Q Consensus 104 fTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~-~g~g~e~~~~~~~lvv~~~~~~~~~~~~~~~-~~~~~~~W~hlavv~~~ 181 (277)
+++++|||....+ ..||...+. .+.-+.+.+.++.+.+......+ ....... ..+.+|+||||+|...
T Consensus 3 ~~i~~~frt~~~~--------g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~-~~~~~~~~~~~~dg~WH~v~i~~~- 72 (135)
T smart00282 3 LSISFSFRTTSPN--------GLLLYAGSKNGGDYLALELRDGRLVLRYDLGSG-PARLTSDPTPLNDGQWHRVAVERN- 72 (135)
T ss_pred eEEEEEEEeCCCC--------EEEEEeCCCCCCCEEEEEEECCEEEEEEECCCC-CEEEEECCeEeCCCCEEEEEEEEe-
Confidence 7899999987643 347665442 33456777777777766443222 2222233 7799999999999987
Q ss_pred ccCCCCeEEEEECCEEeeeeeccc-ccccCCCcceeEeCCCCCC------CCCccceEeccceeEec
Q psy14029 182 NRWTKSEIKCFVNGQLASSTEMAW-FVSTNEPFDKCYIGATPEL------DEERVFCGQMAAIYLFN 241 (277)
Q Consensus 182 ~~~~~s~v~lYvDG~l~~s~~l~~-~p~~~~~~~~~~IG~~~~~------~~~~~f~g~lg~~~lf~ 241 (277)
...+.++|||....+..... ...++ .....+||+.+.. .....|.|.|..+.+.+
T Consensus 73 ----~~~~~l~VD~~~~~~~~~~~~~~~l~-~~~~l~iGG~p~~~~~~~~~~~~~F~GCi~~v~in~ 134 (135)
T smart00282 73 ----GRRVTLSVDGENPVSGESPGGLTILN-LDGPLYLGGLPEDLKLPPLLVTPGFRGCIRNLKVNG 134 (135)
T ss_pred ----CCEEEEEECCCccccEECCCCceEEe-cCCCcEEccCCchhcccccccCCCCeeEeeEEEECC
Confidence 57899999997654333220 11121 2234689998753 12468999999998854
No 10
>PF07953 Toxin_R_bind_N: Clostridium neurotoxin, N-terminal receptor binding; InterPro: IPR012928 The Clostridium neurotoxin family is composed of tetanus neurotoxin and seven serotypes of botulinum neurotoxin. The structure of the botulinum neurotoxin reveals a four domain protein. The N-terminal catalytic domain (IPR000395 from INTERPRO), the central translocation domain and two receptor binding domains []. This domain is the N-terminal receptor binding domain, which is comprised of two seven-stranded beta-sheets sandwiched together to form a jelly role motif []. The role of this domain in receptor binding appears to be indirect. ; GO: 0004222 metalloendopeptidase activity, 0050827 toxin receptor binding, 0009405 pathogenesis, 0051609 inhibition of neurotransmitter uptake, 0005576 extracellular region; PDB: 3RSJ_B 3FUQ_A 3FFZ_B 1G9B_A 1S0F_A 1Z0H_B 1F31_A 1G9D_A 1S0C_A 1S0D_A ....
Probab=98.51 E-value=1.3e-06 Score=71.41 Aligned_cols=98 Identities=18% Similarity=0.365 Sum_probs=71.8
Q ss_pred CCceeEEEEEEecCCCCCCC-CCCCceEEEEEeCCCCeEEEEEeCCeEEEEEeeecCcceeeeeeeE------EeCCcEE
Q psy14029 101 ENGFTFTTWFRLDPINSVNI-EREKPYLYSFKTSKGVGYTAHFVGNCLVLTSMKIKGKGFQHCVKYE------FQPRKWY 173 (277)
Q Consensus 101 ~~GfTfs~Wi~le~~~~~~~-~~~~~~Lfs~~~~~g~g~e~~~~~~~lvv~~~~~~~~~~~~~~~~~------~~~~~W~ 173 (277)
.+.||+++|+|+...... + .....+|+... .+.+|.++++.+++++.+..+..|+.....+.+. =..++|+
T Consensus 54 ~~nFSIsFWlRipk~~~~-~~~~neytII~~~-~nNsGWkI~l~~n~iiwtl~D~ng~~k~i~f~y~~~~~~SdyiNkW~ 131 (195)
T PF07953_consen 54 YNNFSISFWLRIPKYDNN-INLNNEYTIINCM-KNNSGWKISLNNNGIIWTLIDSNGNEKSIYFNYSIMDNISDYINKWF 131 (195)
T ss_dssp SSEEEEEEEEEEECHHCC-HHTTSEEEEEEEE-ETTEEEEEEEETTEEEEEEEETTSEEEEEEEESSSTSSTTSSTTSEE
T ss_pred ccceEEEEEEEccCcccc-cccCcceEEEEee-cCCCceEEEEeCCcEEEEEEeCCCCEEEEEEEcccccchhhhcccEE
Confidence 478999999999876542 1 11235677766 4779999999999999998887776654433211 1238999
Q ss_pred EEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 174 MIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 174 hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
.|+|+.++ + ....+||||+++.+.++.
T Consensus 132 fITITnnr--L--~~~~IyINg~Li~~~~I~ 158 (195)
T PF07953_consen 132 FITITNNR--L--GNSKIYINGNLIDNESIK 158 (195)
T ss_dssp EEEEEEET--T--SEEEEEETTEEEEEEE-T
T ss_pred EEEEEccc--C--ccceEEECCEEEcccchh
Confidence 99999985 2 344999999999998865
No 11
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=98.50 E-value=9.6e-06 Score=67.80 Aligned_cols=144 Identities=22% Similarity=0.247 Sum_probs=86.1
Q ss_pred CceeEEEEEEecCCCCCCCCCCCceEEEEEeC-CCCe-EEEEEeCCeEEEEEeeecCccee-eeeeeEEeCC-----cEE
Q psy14029 102 NGFTFTTWFRLDPINSVNIEREKPYLYSFKTS-KGVG-YTAHFVGNCLVLTSMKIKGKGFQ-HCVKYEFQPR-----KWY 173 (277)
Q Consensus 102 ~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~-~g~g-~e~~~~~~~lvv~~~~~~~~~~~-~~~~~~~~~~-----~W~ 173 (277)
.-.|+.+=++.+.-.. ...||++.+. .+.. +.+|+.++.+.++....++.... ......+.++ .||
T Consensus 33 ~~gTI~i~Fk~~~~~~------~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~ 106 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSG------IQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFN 106 (190)
T ss_dssp SSEEEEEEEEESS-SS------EEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-E
T ss_pred cccEEEEEEecCCCcc------eeEEEEecCCCCccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEE
Confidence 3458888666655432 3568888664 2233 56777788888887666653322 1222444444 588
Q ss_pred EEEEEEEeccCCCCeEEEEECCEEeeeeecc--cccccCCCcceeEeCCCC-CCCCCccceEeccceeEecCCCCHHHHH
Q psy14029 174 MIAIVYIYNRWTKSEIKCFVNGQLASSTEMA--WFVSTNEPFDKCYIGATP-ELDEERVFCGQMAAIYLFNESLSTHQIC 250 (277)
Q Consensus 174 hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~--~~p~~~~~~~~~~IG~~~-~~~~~~~f~g~lg~~~lf~~aLs~~~I~ 250 (277)
-++++.+. ....+++|+||+++...... .|.+--+....+.||... ......+|.|.|+.+.||++||++++|.
T Consensus 107 tva~~ad~---~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g~~~y~f~G~I~~l~iYn~aLsdeel~ 183 (190)
T PF02973_consen 107 TVAFVADS---KNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAGSNAYPFNGTIDNLKIYNRALSDEELK 183 (190)
T ss_dssp EEEEEEET---TTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETTEEES--EEEEEEEEEESS---HHHHH
T ss_pred EEEEEEec---CCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCCCceecccceEEEEEEEcCcCCHHHHH
Confidence 88998882 26789999999877654321 133322345678999852 1223468999999999999999999999
Q ss_pred HHHh
Q psy14029 251 AMHR 254 (277)
Q Consensus 251 ~iy~ 254 (277)
....
T Consensus 184 ~~t~ 187 (190)
T PF02973_consen 184 ARTG 187 (190)
T ss_dssp HHHC
T ss_pred Hhhc
Confidence 8754
No 12
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=98.23 E-value=4.7e-05 Score=58.75 Aligned_cols=108 Identities=19% Similarity=0.222 Sum_probs=73.2
Q ss_pred eEEEEEeCCC-CeEEEEEeCCeEEEEEeeecCcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 126 YLYSFKTSKG-VGYTAHFVGNCLVLTSMKIKGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 126 ~Lfs~~~~~g-~g~e~~~~~~~lvv~~~~~~~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
.|+...+.+. .-+.+.+.++.+.+......+..........+..++||+|.+... ...+.++||+.........
T Consensus 9 ~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~-----~~~~~l~Vd~~~~~~~~~~ 83 (128)
T PF02210_consen 9 LLLYIGSEDNGDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRD-----GNRVTLTVDGQSVSSESLP 83 (128)
T ss_dssp EEEEEEESTTSEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEE-----TTEEEEEETTSEEEEEESS
T ss_pred eEEEEcCCCCCEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEe-----eeeEEEEecCccceEEecc
Confidence 4666555443 467778888888776543322222234456789999999999998 5679999999999887654
Q ss_pred cccc--cCCCcceeEeCCCCCCCC------CccceEeccceeE
Q psy14029 205 WFVS--TNEPFDKCYIGATPELDE------ERVFCGQMAAIYL 239 (277)
Q Consensus 205 ~~p~--~~~~~~~~~IG~~~~~~~------~~~f~g~lg~~~l 239 (277)
-+. ........+||+.+.... ...|.|.|..+++
T Consensus 84 -~~~~~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~v 125 (128)
T PF02210_consen 84 -SSSSDSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRV 125 (128)
T ss_dssp -STTHHCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEE
T ss_pred -ccceecccCCCCEEEecccCccccccccCCCCcEEEcCeEEE
Confidence 222 222345679999865321 5689999998876
No 13
>PTZ00334 trans-sialidase; Provisional
Probab=97.73 E-value=0.0013 Score=65.65 Aligned_cols=82 Identities=18% Similarity=0.334 Sum_probs=60.6
Q ss_pred EEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccC-CCcceeEeCCCCCC-CC--CccceEeccceeEec
Q psy14029 166 EFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTN-EPFDKCYIGATPEL-DE--ERVFCGQMAAIYLFN 241 (277)
Q Consensus 166 ~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~-~~~~~~~IG~~~~~-~~--~~~f~g~lg~~~lf~ 241 (277)
.-+|++=|||||+... .++.++||||+.+.+..+. ..... ....++|||+.... +. ..+. -+-.|.|||
T Consensus 639 tWe~~k~yqVal~L~~----G~~gsvYVDG~~vg~~~~~-l~~~~~~~IshFyiGgdg~~~~~~~~~~V--TV~NVlLYN 711 (780)
T PTZ00334 639 NWEPETTHQVAIVLRN----GKQGSAYVDGQRVGDASCE-LKNTDSKGISHFYIGGDGGSAGSKEDVPV--TATNVLLYN 711 (780)
T ss_pred cccCCCeEEEEEEEeC----CCeEEEEECCEEecCcccc-cCCCCCcccceEEECCCccccccCCCCCE--EEeEeEEeC
Confidence 3567888999999863 4689999999999776654 32221 23568999997532 11 2233 788999999
Q ss_pred CCCCHHHHHHHHh
Q psy14029 242 ESLSTHQICAMHR 254 (277)
Q Consensus 242 ~aLs~~~I~~iy~ 254 (277)
++|+++||++|..
T Consensus 712 RpL~~~Ei~~l~~ 724 (780)
T PTZ00334 712 RPLDDNEIRVLNA 724 (780)
T ss_pred CCCCHHHHHhhhc
Confidence 9999999999986
No 14
>KOG1834|consensus
Probab=97.41 E-value=0.0043 Score=60.04 Aligned_cols=167 Identities=17% Similarity=0.239 Sum_probs=98.2
Q ss_pred eEeecCCCcccEECCCC--CCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeC---CCCeEEEEEeCCeEEEEEee
Q psy14029 79 FFSFPGKKGSAMVLPPL--AKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTS---KGVGYTAHFVGNCLVLTSMK 153 (277)
Q Consensus 79 ff~F~g~~~s~i~lp~~--~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~---~g~g~e~~~~~~~lvv~~~~ 153 (277)
.|.|+|. .+..+|.- +..+ +.-||+++|.+-..-+... ...+.+|++-.+. +..-+.+|+.+=+|.+-...
T Consensus 344 i~eFdG~--qgv~vpdg~~~g~l-~dhFTlSfwMkHg~~p~~~-~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~ 419 (952)
T KOG1834|consen 344 IFEFDGT--QGVTVPDGNVSGSL-PDHFTLSFWMKHGPGPKDE-QSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRR 419 (952)
T ss_pred EEEEcCc--eeeEccCCCCCCCC-CCceEEEEeeecCCCCccc-cccceeEEecccccccccceeEEEEeccEEEEEEcc
Confidence 7999999 68888852 2333 4899999999988754311 1234456554322 23346777766555443222
Q ss_pred ecC--ccee----eeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc-ccccc-CCCcceeEeCCCCCCC
Q psy14029 154 IKG--KGFQ----HCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA-WFVST-NEPFDKCYIGATPELD 225 (277)
Q Consensus 154 ~~~--~~~~----~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~-~~p~~-~~~~~~~~IG~~~~~~ 225 (277)
..| +.+. .+.-.++-.++|||-++-.. -..|++|+||.--.-..+. -+|-= ...-.+..||.+=...
T Consensus 420 d~~~~~~fRpaef~Wkl~qVCD~EWH~Y~ln~e-----fp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g~ 494 (952)
T KOG1834|consen 420 DAGATSDFRPAEFHWKLPQVCDNEWHHYVLNVE-----FPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQGR 494 (952)
T ss_pred CccccccccchheeccchhhhhhhhheeEEeec-----CceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccCc
Confidence 111 1111 11123356789999999887 4469999999765432111 01210 0112345788763222
Q ss_pred C------CccceEeccceeEecCCCCHH-HHHHHHh
Q psy14029 226 E------ERVFCGQMAAIYLFNESLSTH-QICAMHR 254 (277)
Q Consensus 226 ~------~~~f~g~lg~~~lf~~aLs~~-~I~~iy~ 254 (277)
. ..-|.|+|+++++-...|-.+ .|.-+|.
T Consensus 495 ~~~~l~~aqfFrG~LasltlrsGkles~kvI~Cl~t 530 (952)
T KOG1834|consen 495 QQKPLKLAQFFRGQLASLTLRSGKLESEKVIKCLYT 530 (952)
T ss_pred cccchhHHHHhhcccceeEEeccccchHHHHHHHHh
Confidence 1 235899999999999987554 4566665
No 15
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=97.38 E-value=0.003 Score=49.67 Aligned_cols=100 Identities=14% Similarity=0.090 Sum_probs=65.4
Q ss_pred EEEEEeCCeEEEEEeeecCcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccC-C-Ccce
Q psy14029 138 YTAHFVGNCLVLTSMKIKGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTN-E-PFDK 215 (277)
Q Consensus 138 ~e~~~~~~~lvv~~~~~~~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~-~-~~~~ 215 (277)
+.+.+.++.+.++.....+ .........+.+|+||+|.+... ...+.+.|||....+...+ ..... . ....
T Consensus 22 ial~L~~G~l~~~~~~G~~-~~~~~~~~~i~dg~wh~v~~~r~-----~~~~~L~Vd~~~~~~~~s~-~~~~~~l~~~~~ 94 (131)
T PF00054_consen 22 IALELRDGRLEFRYNLGSG-PASLRSPQKINDGKWHTVSVSRN-----GRNGSLSVDGEEVVTGESP-SGATQSLDVDGP 94 (131)
T ss_dssp EEEEEETTEEEEEEESSSE-EEEEEESSETTSSSEEEEEEEEE-----TTEEEEEETTSEEEEEEEC-SSSSSSCEECSE
T ss_pred EEEEEECCEEEEEEeCCCc-cceecCCCccCCCcceEEEEEEc-----CcEEEEEECCccceeeecC-CccccccccccC
Confidence 5666778887776543332 22223344589999999999887 6789999999998444433 22221 1 1134
Q ss_pred eEeCCCCCC-------CCCccceEeccceeEecCCC
Q psy14029 216 CYIGATPEL-------DEERVFCGQMAAIYLFNESL 244 (277)
Q Consensus 216 ~~IG~~~~~-------~~~~~f~g~lg~~~lf~~aL 244 (277)
.+||+.|.. ....+|.|.|..+.+-++.+
T Consensus 95 lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~l 130 (131)
T PF00054_consen 95 LYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKPL 130 (131)
T ss_dssp EEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEEC
T ss_pred EEEccCCchhhcccccccCCCeeEEEEEeEECCEEC
Confidence 799998721 12468999999999977654
No 16
>KOG3516|consensus
Probab=96.84 E-value=0.029 Score=57.86 Aligned_cols=130 Identities=13% Similarity=0.103 Sum_probs=88.2
Q ss_pred cceEeecCCCcccEECCCCCCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCCCeEEEEEeCCeEEEEEeeecC
Q psy14029 77 DVFFSFPGKKGSAMVLPPLAKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKGVGYTAHFVGNCLVLTSMKIKG 156 (277)
Q Consensus 77 ~~ff~F~g~~~s~i~lp~~~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g~g~e~~~~~~~lvv~~~~~~~ 156 (277)
..=+.|.++. +.+.+|+. |....+++|+-+|.-.... .-|++....++..+++++.++.+.+.+.....
T Consensus 364 ~~pvtF~~ss-s~~~lpg~---~~~~~l~vSF~FRtw~~~G-------~ll~~~~~e~~g~v~~fl~eg~~~~~i~~~~r 432 (1306)
T KOG3516|consen 364 IIPVTFGNSS-SYLRLPGN---PNPDRLSVSFQFRTWNKTG-------LLLFSELKEGSGEVLLFLKEGKKFLQITQIGR 432 (1306)
T ss_pred CCCeEecccc-eeEEcCCC---CCCCceeeEEEEEeccccC-------ceeeeeeccCCceEEEEEeCCeEEEEEecccc
Confidence 3447887774 78999975 5568888888888766543 45777776666778999998875544332221
Q ss_pred cceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCC
Q psy14029 157 KGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPE 223 (277)
Q Consensus 157 ~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~ 223 (277)
+.........+..|+||.+.+... .+.+.+-|||....-.+.. .|-.-..-...++|+.+.
T Consensus 433 ~~~~~~~g~~lnDG~WHsv~~~ak-----~n~~~~~iDd~~~~~~~~~-~p~~V~tg~tY~fgg~~~ 493 (1306)
T KOG3516|consen 433 SKADAYAGLKLNDGAWHSVSFNAK-----KNRLVLMIDDGEAEIAPDS-KPLQVYTGTTYYFGGCPD 493 (1306)
T ss_pred chhhhcccccCCCCceEEEEEEee-----cceeEEEEcCcccccccCC-ccEEEEeCCeeEeccccc
Confidence 112223457799999999999887 5678999999998666554 343222334568888654
No 17
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=94.99 E-value=0.052 Score=44.73 Aligned_cols=37 Identities=24% Similarity=0.435 Sum_probs=28.7
Q ss_pred eeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 163 VKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 163 ~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
......+++|||+.|+.. .+.+++||||+++.+..-.
T Consensus 120 ~~~~~~~~~W~~~~I~~~-----g~~i~v~vnG~~v~~~~d~ 156 (185)
T PF06439_consen 120 VNVAIPPGEWNTVRIVVK-----GNRITVWVNGKPVADFTDP 156 (185)
T ss_dssp S--S--TTSEEEEEEEEE-----TTEEEEEETTEEEEEEETT
T ss_pred ccccCCCCceEEEEEEEE-----CCEEEEEECCEEEEEEEcC
Confidence 356789999999999998 7789999999999887643
No 18
>KOG1788|consensus
Probab=94.93 E-value=0.0058 Score=62.23 Aligned_cols=100 Identities=16% Similarity=0.055 Sum_probs=71.8
Q ss_pred EeCCcEEEEEEEEEecc-CCCCeEEEEECCEEeeeeecccccccCCCc-------cee---EeCCC-CCCCCCccceEec
Q psy14029 167 FQPRKWYMIAIVYIYNR-WTKSEIKCFVNGQLASSTEMAWFVSTNEPF-------DKC---YIGAT-PELDEERVFCGQM 234 (277)
Q Consensus 167 ~~~~~W~hlavv~~~~~-~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~-------~~~---~IG~~-~~~~~~~~f~g~l 234 (277)
..||+|.|+|++.++.. .+.+.+..|+||....++++. |..-++.+ ..| .||+. +.....+....++
T Consensus 1274 qlpGkklifAfdgtcseflraSgvfslLNlvdplsaaas-yiggiprfGrlvgnvstcrqnviggTirpvggmsvvlalv 1352 (2799)
T KOG1788|consen 1274 QLPGKKLIFAFDGTCSEFLRASGVFSLLNLVDPLSAAAS-YIGGIPRFGRLVGNVSTCRQNVIGGTIRPVGGMSVVLALV 1352 (2799)
T ss_pred hcCCceEEEEEeccHHHHHhhccceeeecccChhhHHHh-hhcCCcchhhhhhhhhheehhhhcccccccCchHHHHHHH
Confidence 56899999999999855 478899999999999998887 54433211 122 24432 3333334444566
Q ss_pred cceeEecCCCCHHHHHHHHhCCCCCccCCCCCc
Q psy14029 235 AAIYLFNESLSTHQICAMHRLGPGYKVKTGWVE 267 (277)
Q Consensus 235 g~~~lf~~aLs~~~I~~iy~lGp~y~~~fq~~~ 267 (277)
.+...-++...+..+.+.-.+-|.|.+..+..+
T Consensus 1353 eaaesremlhmAlsllAcalhQnhqnvkdmtts 1385 (2799)
T KOG1788|consen 1353 EAAESREMLHMALSLLACALHQNHQNVKDMTTS 1385 (2799)
T ss_pred HHHHhhhhhHHHHHHHHHHhhcCcccccchhhc
Confidence 777777788888899999999999999887643
No 19
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=94.40 E-value=0.95 Score=38.61 Aligned_cols=103 Identities=14% Similarity=0.181 Sum_probs=59.4
Q ss_pred CCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCC-----CeEEEEEeCCeEEEEEeeec-Cc---ceeeeeeeEEe
Q psy14029 98 WPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKG-----VGYTAHFVGNCLVLTSMKIK-GK---GFQHCVKYEFQ 168 (277)
Q Consensus 98 ~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g-----~g~e~~~~~~~lvv~~~~~~-~~---~~~~~~~~~~~ 168 (277)
.+....+-++..++++.--.. ......|+.+....+ -.+.+.+.++.+.+...... .. .........+.
T Consensus 73 ~~~g~~~~y~~s~~lp~~~~~--~~~~~~i~Q~~~~~~~~~~~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (224)
T PF14099_consen 73 FKNGDEYWYSFSFYLPPDFPF--SNQWTIIAQWHSSGGGSTGSPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVE 150 (224)
T ss_dssp EETTEEEEEEEEEEE-TTS----TTSEEEEEEEEEE---TTEEECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-
T ss_pred ccCCCeEEEEEEEEcCCCCcc--CCCcEEEEEEEECCCCCCCCCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcC
Confidence 344566888888888653221 122345555554431 13566777888776543322 10 11112235577
Q ss_pred CCcEEEEEEEEEeccCCCCeEEEEECCEEeeeee
Q psy14029 169 PRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTE 202 (277)
Q Consensus 169 ~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~ 202 (277)
.|+||+++|-...+.-....+.+++||+++...+
T Consensus 151 ~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~~~ 184 (224)
T PF14099_consen 151 RGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVDYK 184 (224)
T ss_dssp TTSEEEEEEEEEE-CCCTEEEEEEECCEECCEEE
T ss_pred CCcEEEEEEEEEECCCCCEEEEEEECCEEEEEEe
Confidence 8999999987775544556899999999997654
No 20
>KOG3516|consensus
Probab=93.33 E-value=1.4 Score=46.05 Aligned_cols=77 Identities=18% Similarity=0.238 Sum_probs=51.9
Q ss_pred eeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccC--CCcceeEeCCCCCCCCCccceEeccceeE
Q psy14029 162 CVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTN--EPFDKCYIGATPELDEERVFCGQMAAIYL 239 (277)
Q Consensus 162 ~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~--~~~~~~~IG~~~~~~~~~~f~g~lg~~~l 239 (277)
....++.+++||||++.++ ..+.+|-|||......+.. .+... .-..+-+||+..+ ..+.|.|.|-+++|
T Consensus 862 ~s~t~~nD~qWH~V~~Ern-----~K~a~LqVD~~~~~~r~sp-~~~~~~L~l~s~l~vGgt~~--~~~gF~GCIRsl~L 933 (1306)
T KOG3516|consen 862 RSPTELNDNQWHQVRAERN-----SKEASLQVDGLPKSIRTSP-IPGTRLLQLYSSLFVGGTVS--RQRGFLGCIRSLQL 933 (1306)
T ss_pred cCCcccCCCceEEEEEEec-----cccceEEEcCcccceecCC-CCCEEEEEeccceecccccc--CcCcceeeeeeeee
Confidence 3456899999999999998 5689999999877665543 11100 1123567998653 34478888888887
Q ss_pred ecCCCCH
Q psy14029 240 FNESLST 246 (277)
Q Consensus 240 f~~aLs~ 246 (277)
=-+.|.-
T Consensus 934 NGv~ldL 940 (1306)
T KOG3516|consen 934 NGVMLDL 940 (1306)
T ss_pred cceeeee
Confidence 5444443
No 21
>KOG3514|consensus
Probab=88.95 E-value=4.1 Score=42.42 Aligned_cols=150 Identities=17% Similarity=0.164 Sum_probs=80.2
Q ss_pred eEeecCCCcccEECCCCCCCCCCCceeEEEEEEecCCCCCCCCCCCceEEEEEeCCC-C--eEEEEEeCCeEEEEEeeec
Q psy14029 79 FFSFPGKKGSAMVLPPLAKWPHENGFTFTTWFRLDPINSVNIEREKPYLYSFKTSKG-V--GYTAHFVGNCLVLTSMKIK 155 (277)
Q Consensus 79 ff~F~g~~~s~i~lp~~~~~p~~~GfTfs~Wi~le~~~~~~~~~~~~~Lfs~~~~~g-~--g~e~~~~~~~lvv~~~~~~ 155 (277)
-|+|+|...+.-..| +|.....=++++=++-+..+ ..-+.+++| . -+|+-++++.|-++..-.
T Consensus 25 g~~l~Ga~~s~ary~---kW~~~~~g~ls~e~kt~q~~----------glllytDdGgt~df~eL~lveG~lrLrf~Lg- 90 (1591)
T KOG3514|consen 25 GIILTGAPDSYARYP---KWAHSFEGSLSMELKTRQSD----------GLLLYTDDGGTHDFYELTLVEGHLRLRFRLG- 90 (1591)
T ss_pred ceEecCCCcchhhch---hhhcccCceeeeeeeccCCC----------cEEEEecCCCceeeeEEEEecceEEEEEEec-
Confidence 377877743444433 67753222333333322211 222233333 2 356666777766554333
Q ss_pred CcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeeccc---ccccCCCcceeEeCCCCCC--------
Q psy14029 156 GKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAW---FVSTNEPFDKCYIGATPEL-------- 224 (277)
Q Consensus 156 ~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~---~p~~~~~~~~~~IG~~~~~-------- 224 (277)
+..........+..++||.|.|..+ --+.++.|||.++....... +-++ ....+||+.|.-
T Consensus 91 ~~~~~~q~~~~i~D~~WH~v~i~r~-----~e~t~L~vDgv~~~~~~~~~~f~fg~i---asdvfVGGlP~~~~la~l~l 162 (1591)
T KOG3514|consen 91 NSNEFGQRRVRIDDDKWHTVTIFRS-----WENTKLEVDGVLVFKILNQRSFVFGNI---ASDVFVGGLPNMHMLAVLSL 162 (1591)
T ss_pred CCCceeeecceecCCceeEEEEEec-----cccceEEechhhhhhhhhcceeeeeee---ehheeecCCChHHhhhhhcC
Confidence 2233344568899999999999876 34689999998775432210 2222 224589997631
Q ss_pred ---CCCccceEeccceeEecC---CCCHHHHH
Q psy14029 225 ---DEERVFCGQMAAIYLFNE---SLSTHQIC 250 (277)
Q Consensus 225 ---~~~~~f~g~lg~~~lf~~---aLs~~~I~ 250 (277)
.-..+|.|.+-....=+. .++++.++
T Consensus 163 p~v~yep~frg~~rnl~y~~~p~g~t~~q~l~ 194 (1591)
T KOG3514|consen 163 PLVRYEPRFRGNVRNLMYRQYPQGVTSPQLLE 194 (1591)
T ss_pred cccccccccCccceeeeeecCCCCcCChhhhh
Confidence 123466666655544333 24455554
No 22
>KOG4289|consensus
Probab=85.04 E-value=5.7 Score=42.81 Aligned_cols=105 Identities=11% Similarity=0.149 Sum_probs=66.5
Q ss_pred eEEEEEeCCCCeEEEEEeCCeEEEEEeeecCcceeeeeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeeccc
Q psy14029 126 YLYSFKTSKGVGYTAHFVGNCLVLTSMKIKGKGFQHCVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAW 205 (277)
Q Consensus 126 ~Lfs~~~~~g~g~e~~~~~~~lvv~~~~~~~~~~~~~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~ 205 (277)
.|+.....+...+.+.+.++.|-+++.. .........+..|.|||+.|-.. ...+.-..+|.....+++.
T Consensus 1578 vl~~~~~~~rst~~lqld~g~l~~~v~~----s~v~L~~~~vtdg~Wh~~~i~l~-----~d~~~t~d~g~~~aea~~g- 1647 (2531)
T KOG4289|consen 1578 VLMQAEFGGRSTYNLQLDDGTLKYNVGD----SSVELPAPRVTDGHWHHLVIELE-----ADSVATLDYGIYQAEAKAG- 1647 (2531)
T ss_pred EEEEEEeCCCceEEEEEcCCEEEEEecC----ceEEccCccccCCchhheeeeec-----cCeEEEEechhhhhhhhcC-
Confidence 4444433333334555555556555321 11123456688999999999876 3466777888888888776
Q ss_pred ccccCCCcceeEeCCCCCCCCCccceEeccceeEecC
Q psy14029 206 FVSTNEPFDKCYIGATPELDEERVFCGQMAAIYLFNE 242 (277)
Q Consensus 206 ~p~~~~~~~~~~IG~~~~~~~~~~f~g~lg~~~lf~~ 242 (277)
.+.+ ...+.++|+.|.....+.|.|.|-.|.+=..
T Consensus 1648 l~gl--~l~sl~vGgap~~g~p~gf~GCiqgV~v~g~ 1682 (2531)
T KOG4289|consen 1648 LSGL--NLESLYVGGAPATGVPRGFRGCIQGVRVGGV 1682 (2531)
T ss_pred CCCc--eeeEEEEccccCCCccccchhhhhceEECCE
Confidence 5544 3456789998755556789988887777543
No 23
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=75.87 E-value=29 Score=28.99 Aligned_cols=28 Identities=14% Similarity=0.276 Sum_probs=24.0
Q ss_pred CcEEEEEEEEEeccCCCCeEEEEECCEEeee
Q psy14029 170 RKWYMIAIVYIYNRWTKSEIKCFVNGQLASS 200 (277)
Q Consensus 170 ~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s 200 (277)
...|-..|++.+. .+..++||||+++.+
T Consensus 92 ~gyH~Y~i~~~p~---~~tASfy~DG~lI~t 119 (198)
T PF09264_consen 92 HGYHKYEIVFSPL---TNTASFYFDGTLIAT 119 (198)
T ss_dssp CSEEEEEEEEETT---TTEEEEEETTEEEEE
T ss_pred cceeEEEEEecCC---CCceEEEECCEEEee
Confidence 4688889999873 569999999999997
No 24
>KOG3514|consensus
Probab=73.36 E-value=44 Score=35.34 Aligned_cols=82 Identities=15% Similarity=0.250 Sum_probs=54.0
Q ss_pred eeeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccC--CCcceeEeCCCCCCC----CCccceEecc
Q psy14029 162 CVKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTN--EPFDKCYIGATPELD----EERVFCGQMA 235 (277)
Q Consensus 162 ~~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~--~~~~~~~IG~~~~~~----~~~~f~g~lg 235 (277)
..++.|..+.||-|.|..+ ...+++-|||..-...... |... .....+++|+.+... .+-+|.|.|-
T Consensus 326 p~~~rfdD~~WH~V~v~R~-----~~m~t~~VDg~~t~~~~~a--~~~tmlsss~~fyvgg~~~~~~l~gsrVsF~GClk 398 (1591)
T KOG3514|consen 326 PNSFRFDDDSWHTVIVERS-----LQMMTLIVDGRRTEIRQYA--PELTMLSSSDFFYVGGSPNTADLPGSRVSFMGCLK 398 (1591)
T ss_pred cccccccCCcceEEEEEee-----eEEEEEEEccEEecccccc--cceeEeeccceEEecCCCCccccCCCceeeeeeee
Confidence 4467899999998877655 5689999999998765432 2221 122246788875321 1235899998
Q ss_pred ceeEecCCCCHHHHH
Q psy14029 236 AIYLFNESLSTHQIC 250 (277)
Q Consensus 236 ~~~lf~~aLs~~~I~ 250 (277)
.|..=++++--+-+.
T Consensus 399 kV~y~~d~~rl~L~~ 413 (1591)
T KOG3514|consen 399 KVVYKNDDTRLELSR 413 (1591)
T ss_pred eeEeccCceeehhhH
Confidence 887766665544443
No 25
>PHA02902 putative IMV membrane protein; Provisional
Probab=69.31 E-value=2.7 Score=28.56 Aligned_cols=23 Identities=43% Similarity=0.611 Sum_probs=19.1
Q ss_pred eccceeEecCCCCHHHHHHHHhC
Q psy14029 233 QMAAIYLFNESLSTHQICAMHRL 255 (277)
Q Consensus 233 ~lg~~~lf~~aLs~~~I~~iy~l 255 (277)
.+-.--+|++.|+++||+++|.|
T Consensus 43 ~l~~d~~F~D~lTpDQirAlHrl 65 (70)
T PHA02902 43 TLEDDPLFKDSLTPDQIKALHRL 65 (70)
T ss_pred cCCCCchhhccCCHHHHHHHHHH
Confidence 34455789999999999999975
No 26
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=60.63 E-value=4.8 Score=28.82 Aligned_cols=20 Identities=25% Similarity=0.469 Sum_probs=17.6
Q ss_pred ceeEecCCCCHHHHHHHHhC
Q psy14029 236 AIYLFNESLSTHQICAMHRL 255 (277)
Q Consensus 236 ~~~lf~~aLs~~~I~~iy~l 255 (277)
.--+|++.|+++||+++|.|
T Consensus 44 ~ds~F~D~lTpDQVrAlHRl 63 (92)
T PHA02681 44 GASSFEDKMTDDQVRAFHAL 63 (92)
T ss_pred CCchhhccCCHHHHHHHHHH
Confidence 44689999999999999986
No 27
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=59.69 E-value=2e+02 Score=29.04 Aligned_cols=61 Identities=13% Similarity=0.186 Sum_probs=38.1
Q ss_pred eeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCCCCCCccceEeccceeE
Q psy14029 164 KYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPELDEERVFCGQMAAIYL 239 (277)
Q Consensus 164 ~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~~~~~~~f~g~lg~~~l 239 (277)
...+..++||-|+++.+ ++.++-++||..+.+.....+|+ -..+.||+.. --.+|.+.+.|
T Consensus 605 ~~~~~~~~WhtltL~~~-----g~~~ta~lng~~l~~~~~~~~p~----~G~aaIGT~~------~~~~QFDNf~V 665 (669)
T PF02057_consen 605 TADIGAGKWHTLTLTIS-----GSTATAMLNGTVLWTDVDSSYPK----NGWAAIGTSS------FETAQFDNFSV 665 (669)
T ss_dssp E-S--TT-EEEEEEEEE-----TTEEEEEETTEEEEEEEE--SS-------EEEEEESS------S--EEEEEEEE
T ss_pred eecccCCeEEEEEEEEE-----CCEEEEEECCEEeEEecccCCCC----CceEEEEcCC------CceeEeeeeEE
Confidence 34578899999999987 77899999999998754322332 2467899863 12456666655
No 28
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=57.32 E-value=22 Score=20.45 Aligned_cols=25 Identities=8% Similarity=0.256 Sum_probs=18.9
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHH
Q psy14029 25 LIDMMGVLASYSITVKELKLFFAAM 49 (277)
Q Consensus 25 l~~li~~L~s~sis~~elr~ll~ll 49 (277)
-++|+..--..++|..|+|.||..-
T Consensus 5 W~~Li~eA~~~Gls~eeir~FL~~~ 29 (30)
T PF08671_consen 5 WVELIKEAKESGLSKEEIREFLEFN 29 (30)
T ss_dssp HHHHHHHHHHTT--HHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHhC
Confidence 4577888888899999999999754
No 29
>KOG1219|consensus
Probab=56.12 E-value=1.2e+02 Score=35.38 Aligned_cols=89 Identities=22% Similarity=0.276 Sum_probs=57.9
Q ss_pred eEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCCCcceeEeCCCCC-----C-CCCccceEecccee
Q psy14029 165 YEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNEPFDKCYIGATPE-----L-DEERVFCGQMAAIY 238 (277)
Q Consensus 165 ~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~~~~~~~IG~~~~-----~-~~~~~f~g~lg~~~ 238 (277)
.....|+||.+++-.. +.-+++-+|..-..++....+-+....-.+.+.|+... . +....|.|.|+.+|
T Consensus 3758 ~~VnDgqWHsialerr-----r~~irlsvDd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr~~~~tqvs~Gf~GCldsiy 3832 (4289)
T KOG1219|consen 3758 RTVNDGQWHSIALERR-----RNHIRLSVDDDTYDSATVPGMKSTLNLDTHIYLGALVRLRHQRSTQVSYGFDGCLDSIY 3832 (4289)
T ss_pred eEeecCceeEEEeecc-----CCceEEEEcccCceeeecccceeeccccceEEEeeEeeeccCCCccccccccceeeeEE
Confidence 6688999999999876 44577888887776665432222211224667787542 1 22357999999999
Q ss_pred EecCCCC-------HHHHHHHHhCCCC
Q psy14029 239 LFNESLS-------THQICAMHRLGPG 258 (277)
Q Consensus 239 lf~~aLs-------~~~I~~iy~lGp~ 258 (277)
+=+..|+ -+.+..++.|-|.
T Consensus 3833 Lng~el~l~~k~~s~a~~~el~~l~pg 3859 (4289)
T KOG1219|consen 3833 LNGMELPLTRKGKSVAGLMELFGLQPG 3859 (4289)
T ss_pred EccccccccCCCchhhhhhhhhccccc
Confidence 9776654 3446666666664
No 30
>PF08787 Alginate_lyase2: Alginate lyase; InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=52.09 E-value=1.6e+02 Score=25.46 Aligned_cols=101 Identities=11% Similarity=0.178 Sum_probs=59.3
Q ss_pred CCCCCCce-eEEEEEEecCCCCCCCC-CCCceEEEEEeCCCC---e-EEEEE------eCCeEEEEEeee--cCcceeee
Q psy14029 97 KWPHENGF-TFTTWFRLDPINSVNIE-REKPYLYSFKTSKGV---G-YTAHF------VGNCLVLTSMKI--KGKGFQHC 162 (277)
Q Consensus 97 ~~p~~~Gf-Tfs~Wi~le~~~~~~~~-~~~~~Lfs~~~~~g~---g-~e~~~------~~~~lvv~~~~~--~~~~~~~~ 162 (277)
.|....+= ++.+=+++++.++.... .++..+--+...++. - +.+++ ..+.+.+..... .+......
T Consensus 77 ~W~~~~~~~~l~a~l~V~~v~~~~~~~~~~vvigQIH~~~~~~~~pllkl~~~~~~~~~~G~l~~~~~~~~~~~~~~~~~ 156 (236)
T PF08787_consen 77 NWSFSSAHHTLEATLAVDQVPSGGKSSSGRVVIGQIHGKDGNSNEPLLKLYYRKEPGNEKGSLYVYVRQSNPDGGDQEYT 156 (236)
T ss_dssp -BGCCSCEEEEEEEEEEEE-TTTSE----EEEEEEEEESSSSSEEEEEEEEEEEC--TETTEEEEEEESSTTTTSEEEEE
T ss_pred eeecCCCCcEEEEEEEEeccCCCCCceeeEEEEEEEecCCCCCCCCEEEEEEEEeeeccCCeEEEEEeccCCCCCcEEee
Confidence 67766666 88888999998764210 112344445554422 2 35566 345566554311 11122222
Q ss_pred eeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeee
Q psy14029 163 VKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTE 202 (277)
Q Consensus 163 ~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~ 202 (277)
.-..+..|+|..+.|... ...+.|+|||+......
T Consensus 157 ~~~~i~LG~~F~y~I~v~-----~~~l~V~ing~~~~~~~ 191 (236)
T PF08787_consen 157 IYGGIPLGEWFSYEIEVS-----GGTLTVTINGEGKTTVT 191 (236)
T ss_dssp EEEEEETT-EEEEEEEEE-----TTEEEEEETTEEEEEEH
T ss_pred eEcceeCCCEEEEEEEEE-----CCEEEEEEECCcceEEE
Confidence 334688999999999987 56999999999887753
No 31
>cd00070 GLECT Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
Probab=49.03 E-value=1.2e+02 Score=23.20 Aligned_cols=36 Identities=28% Similarity=0.401 Sum_probs=30.2
Q ss_pred eeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 164 KYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 164 ~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
.+.|.+|+-+-|.|+.. .+...+||||+.+.+.+-.
T Consensus 71 ~~pf~~g~~F~l~i~~~-----~~~f~i~vng~~~~~F~~R 106 (127)
T cd00070 71 GFPFQPGQPFELTILVE-----EDKFQIFVNGQHFFSFPHR 106 (127)
T ss_pred CCCCCCCCeEEEEEEEc-----CCEEEEEECCEeEEEecCc
Confidence 47788999999999887 6789999999988776544
No 32
>KOG3546|consensus
Probab=43.96 E-value=1e+02 Score=30.89 Aligned_cols=76 Identities=11% Similarity=0.181 Sum_probs=46.5
Q ss_pred eeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccccCC-----CcceeEeCCCCCCCCCccceEecccee
Q psy14029 164 KYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVSTNE-----PFDKCYIGATPELDEERVFCGQMAAIY 238 (277)
Q Consensus 164 ~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~~~~-----~~~~~~IG~~~~~~~~~~f~g~lg~~~ 238 (277)
.+..-.++|.|+|+... .-.|++|||=+.-....+. ..+. +..-.++|.... .....|.|-|..+.
T Consensus 151 ~~p~~~~~w~~~a~~v~-----g~~v~l~v~cee~~r~p~~---rss~~l~~e~~ag~f~~~ag~-~~~~~f~g~~~~l~ 221 (1167)
T KOG3546|consen 151 RLPAFVGQWTHLALSVA-----GGFVALYVDCEEFQRMPLA---RSSRGLELEPGAGLFVAQAGG-ADPDKFQGVIAELK 221 (1167)
T ss_pred ccchhhchhhheeeeec-----CceEEEEechHHhcccchh---ccccceeecCCcceEEeccCC-CChHhhhhhhhhee
Confidence 34455799999999886 6799999997666555443 1211 111224444321 12336788888887
Q ss_pred EecCCCCHHH
Q psy14029 239 LFNESLSTHQ 248 (277)
Q Consensus 239 lf~~aLs~~~ 248 (277)
+-.++.+.++
T Consensus 222 v~~dp~~~~~ 231 (1167)
T KOG3546|consen 222 VRRDPQVSPM 231 (1167)
T ss_pred ecCCCCCCcc
Confidence 7777665554
No 33
>cd00413 Glyco_hydrolase_16 glycosyl hydrolase family 16. The O-Glycosyl hydrolases are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycosyl hydrolase family 16. Family 16 includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=43.08 E-value=66 Score=26.64 Aligned_cols=35 Identities=14% Similarity=0.272 Sum_probs=28.0
Q ss_pred eCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecccccc
Q psy14029 168 QPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMAWFVS 208 (277)
Q Consensus 168 ~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~~~p~ 208 (277)
....||=.++.-. ...+..||||+++.+.... .|.
T Consensus 140 ~~~~~H~Y~~~W~-----~~~i~~yvDG~~~~~~~~~-~p~ 174 (210)
T cd00413 140 PADDFHTYRVDWT-----PGEITFYVDGVLVATITNQ-VPD 174 (210)
T ss_pred CccCeEEEEEEEe-----CCEEEEEECCEEEEEECCC-CCC
Confidence 3679988888776 4689999999999998765 454
No 34
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=41.66 E-value=70 Score=21.57 Aligned_cols=18 Identities=22% Similarity=0.503 Sum_probs=14.4
Q ss_pred CeEEEEECCEEeeeeecc
Q psy14029 187 SEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 187 s~v~lYvDG~l~~s~~l~ 204 (277)
..+.+||||+.+...++.
T Consensus 11 ~gA~V~vdg~~~G~tp~~ 28 (71)
T PF08308_consen 11 SGAEVYVDGKYIGTTPLT 28 (71)
T ss_pred CCCEEEECCEEeccCcce
Confidence 358899999999876654
No 35
>PF14193 DUF4315: Domain of unknown function (DUF4315)
Probab=40.84 E-value=41 Score=24.26 Aligned_cols=26 Identities=31% Similarity=0.556 Sum_probs=23.4
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHHhh
Q psy14029 26 IDMMGVLASYSITVKELKLFFAAMKA 51 (277)
Q Consensus 26 ~~li~~L~s~sis~~elr~ll~ll~~ 51 (277)
+++++.+-++++||.+|-.||+..+.
T Consensus 37 ~EIv~~VR~~~mtp~eL~~~L~~~~~ 62 (83)
T PF14193_consen 37 LEIVQMVRSMKMTPEELAAFLRAMKS 62 (83)
T ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHh
Confidence 47999999999999999999999863
No 36
>cd02175 GH16_lichenase lichenase, member of glycosyl hydrolase family 16. Lichenase, also known as 1,3-1,4-beta-glucanase, is a member of glycosyl hydrolase family 16, that specifically cleaves 1,4-beta-D-glucosidic bonds in mixed-linked beta glucans that also contain 1,3-beta-D-glucosidic linkages. Natural substrates of beta-glucanase are beta-glucans from grain endosperm cell walls or lichenan from the Islandic moss, Cetraria islandica. This protein is found not only in bacteria but also in anaerobic fungi. This domain includes two seven-stranded antiparallel beta-sheets that are adjacent to one another forming a compact, jellyroll beta-sandwich structure.
Probab=40.78 E-value=68 Score=27.01 Aligned_cols=29 Identities=17% Similarity=0.358 Sum_probs=24.3
Q ss_pred CcEEEEEEEEEeccCCCCeEEEEECCEEeeeeec
Q psy14029 170 RKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEM 203 (277)
Q Consensus 170 ~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l 203 (277)
..||..+|--+ ...+..||||+++.+..-
T Consensus 138 ~~~H~Y~v~W~-----~~~i~~yvDg~~v~~~~~ 166 (212)
T cd02175 138 EGFHTYAFEWE-----PDSIRWYVDGELVHEATA 166 (212)
T ss_pred cccEEEEEEEe-----CCEEEEEECCEEEEEEcC
Confidence 57888888776 568999999999998865
No 37
>PF03701 UPF0181: Uncharacterised protein family (UPF0181); InterPro: IPR005371 This family contains small proteins of about 50 amino acids of unknown function. The family includes YoaH P76260 from SWISSPROT.
Probab=39.24 E-value=1.1e+02 Score=19.91 Aligned_cols=40 Identities=13% Similarity=0.215 Sum_probs=32.4
Q ss_pred ccCCCCCchHHHHHHHHHHHHHhcCCCCHHHHHHHHHH-Hh
Q psy14029 11 YSGLPHADPVVADLLIDMMGVLASYSITVKELKLFFAA-MK 50 (277)
Q Consensus 11 l~~l~~~~~~~~~~l~~li~~L~s~sis~~elr~ll~l-l~ 50 (277)
++.++.+.+.-.+..++=|+.|.+.++|.-|.-++... +|
T Consensus 2 ~~~lp~LtHeeQQ~AvE~Iq~LMaqGmSsgEAI~~VA~~iR 42 (51)
T PF03701_consen 2 FNDLPSLTHEEQQQAVERIQELMAQGMSSGEAIAIVAQEIR 42 (51)
T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHH
Confidence 46788888888889999999999999999886665543 44
No 38
>PF00722 Glyco_hydro_16: Glycosyl hydrolases family 16; InterPro: IPR000757 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 16 GH16 from CAZY comprises enzymes with a number of known activities; lichenase (3.2.1.73 from EC); xyloglucan xyloglucosyltransferase (2.4.1.207 from EC); agarase (3.2.1.81 from EC); kappa-carrageenase (3.2.1.83 from EC); endo-beta-1,3-glucanase (3.2.1.39 from EC); endo-beta-1,3-1,4-glucanase (3.2.1.6 from EC); endo-beta-galactosidase (3.2.1.103 from EC).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DGT_A 2CL2_A 2WLQ_A 2WNE_A 2W39_A 2W52_A 3ILN_A 4DFS_A 1UMZ_A 1UN1_B ....
Probab=37.68 E-value=91 Score=25.24 Aligned_cols=31 Identities=10% Similarity=0.354 Sum_probs=26.2
Q ss_pred CCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 169 PRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 169 ~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
...||-.++.-. ...|..||||+++.+....
T Consensus 117 ~~~~h~y~~~W~-----~~~i~fyiDg~~~~~~~~~ 147 (185)
T PF00722_consen 117 STDFHTYGFEWT-----PDRIRFYIDGKLVRTVTNS 147 (185)
T ss_dssp TTSEEEEEEEEE-----TTEEEEEETTEEEEEEESS
T ss_pred CCCcEEEEEEEe-----cCeEEEEECCEEEEEEecc
Confidence 378999888887 5689999999999998765
No 39
>smart00276 GLECT Galectin. Galectin - galactose-binding lectin
Probab=33.54 E-value=96 Score=23.84 Aligned_cols=36 Identities=22% Similarity=0.395 Sum_probs=30.5
Q ss_pred eeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 164 KYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 164 ~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
...|.+|+-.-|.|+.. .....+||||+...+.+-.
T Consensus 70 ~~Pf~~g~~F~l~i~~~-----~~~f~i~vng~~~~~f~~R 105 (128)
T smart00276 70 GFPFQPGQPFDLTIIVQ-----PDHFQIFVNGVHITTFPHR 105 (128)
T ss_pred CCCCCCCCEEEEEEEEc-----CCEEEEEECCEeEEEecCC
Confidence 57788999999999887 5789999999998877654
No 40
>cd08783 Death_MALT1 Death domain similar to that found in Mucosa-associated lymphoid tissue-lymphoma-translocation gene 1. Death domain (DD) similar to that found in Malt1 (mucosa-associated lymphoid tissue-lymphoma-translocation gene 1). Malt1, together with Bcl10 (B-cell lymphoma 10), are the integral components of the CBM signalosome. They associate with CARD9 to form M-CBM (CBM complex in myeloid immune cells) and with CARMA1 to form L-CBM (CBM complex in lymphoid immune cells), to mediate activation of NF-kB and MAPK by ITAM-coupled receptors expressed on immune cells. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins
Probab=32.74 E-value=2.1e+02 Score=21.26 Aligned_cols=38 Identities=13% Similarity=0.233 Sum_probs=21.2
Q ss_pred cCCCCHHHH-HHHHHHHhhcCC----------CCCccHHHHHHHHHhcc
Q psy14029 34 SYSITVKEL-KLFFAAMKAVNN----------KWPRHSTKLLNVLRQMP 71 (277)
Q Consensus 34 s~sis~~el-r~ll~ll~~~~~----------~~~~~~~~ll~~L~~m~ 71 (277)
.+.+|+.|+ +..++.+.++.. ..-+-...|+++|..|-
T Consensus 37 ~fr~S~~el~~cslkvl~p~gSPsk~LL~~~~~rg~Tv~~Ll~~L~~Mg 85 (97)
T cd08783 37 RFRLSCLDLEQCSLKVLEPEGSPSRSLLKLLGERGCTVTELSEFLQAME 85 (97)
T ss_pred ccccCHHHHHHHHHHHhcCCCCchHHHHHHHHHcCCcHHHHHHHHHHhh
Confidence 478888888 333333332110 01233567888888884
No 41
>PF12993 DUF3877: Domain of unknown function, E. rectale Gene description (DUF3877); InterPro: IPR024539 This entry represents proteins of unknown function found primarily in Firmicutes. The Eubacterium rectale gene appears to be upregulated in the presence of Bacteroides thetaiotaomicron compared to growth in pure culture [].
Probab=32.07 E-value=58 Score=26.82 Aligned_cols=27 Identities=7% Similarity=0.310 Sum_probs=23.9
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHh
Q psy14029 24 LLIDMMGVLASYSITVKELKLFFAAMK 50 (277)
Q Consensus 24 ~l~~li~~L~s~sis~~elr~ll~ll~ 50 (277)
-|-+||+.++.|++|-.|++.+|+...
T Consensus 96 Fik~lIe~v~~hgcT~e~I~~~F~~ys 122 (175)
T PF12993_consen 96 FIKELIELVGKHGCTLEDILELFHKYS 122 (175)
T ss_pred HHHHHHHHHhcCCcCHHHHHHHHHHhc
Confidence 467899999999999999999998764
No 42
>cd08023 GH16_laminarinase_like Laminarinase, member of the glycosyl hydrolase family 16. Laminarinase, also known as glucan endo-1,3-beta-D-glucosidase, is a glycosyl hydrolase family 16 member that hydrolyzes 1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans such as laminarins, curdlans, paramylons, and pachymans, with very limited action on mixed-link (1,3-1,4-)-beta-D-glucans.
Probab=31.74 E-value=1e+02 Score=26.20 Aligned_cols=32 Identities=16% Similarity=0.272 Sum_probs=25.2
Q ss_pred eCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 168 QPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 168 ~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
....||=-++.-+ ...|+.||||+++.+.+-.
T Consensus 156 ~~~~fHtY~~~W~-----p~~i~~yvDG~~v~~~~~~ 187 (235)
T cd08023 156 LSDDFHTYAVEWT-----PDKITFYVDGKLYFTYTNP 187 (235)
T ss_pred cCCCcEEEEEEEE-----CCEEEEEECCEEEEEEccc
Confidence 4578877777776 4689999999999988643
No 43
>PF12926 MOZART2: Mitotic-spindle organizing gamma-tubulin ring associated; InterPro: IPR024332 The MOZART2 family of proteins (also known as FAM128 and Mitotic-spindle organizing protein 2) operate as part of the gamma-tubulin ring complex, gamma-TuRC, one of the complexes necessary for chromosome segregation. This complex is located at centrosomes and mediates the formation of bipolar spindles in mitosis; it consists of six subunits. However, unlike the other four known subunits, the MOZART proteins, both 1 and 2, do not carry the conserved 'Spc97-Spc98' GCP domain, so the TUBGCP nomenclature cannot be used for it. The exact function of MOZART2 is not clear [].
Probab=30.60 E-value=1.6e+02 Score=21.51 Aligned_cols=43 Identities=14% Similarity=0.329 Sum_probs=30.5
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHhhcCCCCCccHHHHHHHHHhccC
Q psy14029 24 LLIDMMGVLASYSITVKELKLFFAAMKAVNNKWPRHSTKLLNVLRQMPQ 72 (277)
Q Consensus 24 ~l~~li~~L~s~sis~~elr~ll~ll~~~~~~~~~~~~~ll~~L~~m~~ 72 (277)
.|.+|. .++...+.|.-.|.++.+++-... | ..+...|++|..
T Consensus 29 EL~ELa-~~AGv~~dp~VFriildLL~~nVs--P---~AI~qmLK~m~s 71 (88)
T PF12926_consen 29 ELYELA-QLAGVPMDPEVFRIILDLLRLNVS--P---DAIFQMLKSMCS 71 (88)
T ss_pred HHHHHH-HHhCCCcChHHHHHHHHHHHcCCC--H---HHHHHHHHHHHc
Confidence 344544 468899999999999999985332 3 456667777753
No 44
>PF03750 DUF310: Protein of unknown function (DUF310); InterPro: IPR010149 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents the C-terminal domain of a minor family of CRISPR-associated proteins. These proteins are found adjacent to a characteristic short, palindromic repeat cluster termed CRISPR, a probable mobile DNA element.
Probab=29.33 E-value=1.5e+02 Score=22.71 Aligned_cols=33 Identities=18% Similarity=0.339 Sum_probs=24.7
Q ss_pred hHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhh
Q psy14029 19 PVVADLLIDMMGVLASYSITVKELKLFFAAMKA 51 (277)
Q Consensus 19 ~~~~~~l~~li~~L~s~sis~~elr~ll~ll~~ 51 (277)
....+..-++.+.|....+++.+||++|..++.
T Consensus 11 ~~~~~~Ae~~~k~l~~~~lttsQlRkf~~~v~~ 43 (119)
T PF03750_consen 11 ELLVDYAEKIAKELKKNKLTTSQLRKFYDEVKR 43 (119)
T ss_pred HHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 334455555666666678999999999999975
No 45
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=29.08 E-value=2.4e+02 Score=20.78 Aligned_cols=29 Identities=3% Similarity=0.101 Sum_probs=19.3
Q ss_pred eEEeCCcEEEEEEEEEeccCCCCeEEEEEC
Q psy14029 165 YEFQPRKWYMIAIVYIYNRWTKSEIKCFVN 194 (277)
Q Consensus 165 ~~~~~~~W~hlavv~~~~~~~~s~v~lYvD 194 (277)
.....++|.++...++.. -....+.+|+-
T Consensus 97 ~~~~~~~W~~~s~~ft~~-~~~~~~~l~~~ 125 (131)
T PF02018_consen 97 TVTITGEWTKYSGTFTAP-SDDDTVRLYFE 125 (131)
T ss_dssp EEEETSSEEEEEEEEEEE-SSCEEEEEEEE
T ss_pred EEECCCCcEEEEEEEEEC-CCCceEEEEEE
Confidence 334469999999888865 33445666653
No 46
>PF03633 Glyco_hydro_65C: Glycosyl hydrolase family 65, C-terminal domain ; InterPro: IPR005194 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This family of glycosyl hydrolases (GH65 from CAZY) contains this domain and includes vacuolar acid trehalase and maltose phosphorylases. Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phosphate into beta-D-glucose-1-phosphate and glucose. The C-terminal domain forms a two layered jelly roll motif. This domain is situated at the base of the catalytic domain, however its function remains unknown [].; PDB: 1H54_A.
Probab=29.03 E-value=40 Score=21.72 Aligned_cols=29 Identities=14% Similarity=0.178 Sum_probs=12.7
Q ss_pred CcEEEEEEEEEecc---CCCC-eEEEEECCEEe
Q psy14029 170 RKWYMIAIVYIYNR---WTKS-EIKCFVNGQLA 198 (277)
Q Consensus 170 ~~W~hlavv~~~~~---~~~s-~v~lYvDG~l~ 198 (277)
|+|.+|.|.++.-. .++. .+.++++|+.+
T Consensus 18 g~~l~v~i~~~~v~v~~~~g~~~l~i~v~g~~~ 50 (54)
T PF03633_consen 18 GHWLEVEITHEKVTVTLLSGDAPLTIKVYGEEV 50 (54)
T ss_dssp TEEEEEEEETTEEEEEEEESS--EEEEETT---
T ss_pred CEEEEEEEECCEEEEEEccCCccEEEEECCCcc
Confidence 55555555443322 1233 67777777654
No 47
>PRK05114 hypothetical protein; Provisional
Probab=28.94 E-value=1.8e+02 Score=19.41 Aligned_cols=40 Identities=15% Similarity=0.282 Sum_probs=32.6
Q ss_pred ccCCCCCchHHHHHHHHHHHHHhcCCCCHHHHHHHHHH-Hh
Q psy14029 11 YSGLPHADPVVADLLIDMMGVLASYSITVKELKLFFAA-MK 50 (277)
Q Consensus 11 l~~l~~~~~~~~~~l~~li~~L~s~sis~~elr~ll~l-l~ 50 (277)
++.++.+.+.-.+..++=|+.|.+.++|.-|.-.+... +|
T Consensus 2 ~~~lp~LtHeeQQ~AVErIq~LMaqGmSsgEAI~~VA~eiR 42 (59)
T PRK05114 2 FAGLPSLTHEQQQKAVERIQELMAQGMSSGEAIALVAEELR 42 (59)
T ss_pred CCCcccCCHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHH
Confidence 56788888888889999999999999999886665544 55
No 48
>PF03170 BcsB: Bacterial cellulose synthase subunit; InterPro: IPR018513 An operon encoding 4 proteins required for bacterial cellulose biosynthesis (bcs) in Acetobacter xylinus (Gluconacetobacter xylinus) has been isolated via genetic complementation with strains lacking cellulose synthase activity []. Nucleotide sequence analysis showed the cellulose synthase operon to consist of 4 genes, designated bcsA, bcsB, bcsC and bcsD, all of which are required for maximal bacterial cellulose synthesis in A. xylinum. The calculated molecular mass of the protein encoded by bcsB is 85.3kDa []. BcsB encodes the catalytic subunit of cellulose synthase. The protein polymerises uridine 5'-diphosphate glucose to cellulose: UDP-glucose + (1,4-beta-D-glucosyl)(N) = UDP + (1,4-beta-D-glucosyl)(N+1). The enzyme is specifically activated by the nucleotide cyclic diguanylic acid. Sequence analysis suggests that BcsB contains several transmembrane (TM) domains, and shares a high degree of similarity with Escherichia coli YhjN.; GO: 0006011 UDP-glucose metabolic process, 0016020 membrane
Probab=27.88 E-value=84 Score=31.15 Aligned_cols=33 Identities=24% Similarity=0.427 Sum_probs=24.1
Q ss_pred EEEEEEEEEec--c-CCCCeEEEEECCEEeeeeecc
Q psy14029 172 WYMIAIVYIYN--R-WTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 172 W~hlavv~~~~--~-~~~s~v~lYvDG~l~~s~~l~ 204 (277)
---|-+-+.+. . ..++.+.+||||+.+.+.++.
T Consensus 343 ~i~l~L~y~y~~~~~~~~S~l~V~vNg~~i~s~~L~ 378 (605)
T PF03170_consen 343 GIPLHLRYRYTPGLDFDGSRLTVYVNGQFIGSLPLT 378 (605)
T ss_pred ceEEEEEEecCCCCCCCCcEEEEEECCEEEEeEECC
Confidence 33344444443 4 678899999999999999886
No 49
>PRK06789 flagellar motor switch protein; Validated
Probab=27.81 E-value=1.2e+02 Score=21.38 Aligned_cols=21 Identities=10% Similarity=0.218 Sum_probs=17.5
Q ss_pred CCCCeEEEEECCEEeeeeecc
Q psy14029 184 WTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 184 ~~~s~v~lYvDG~l~~s~~l~ 204 (277)
.....+.+|+||++++...+.
T Consensus 39 ~~~epvdI~vNg~lia~GEvV 59 (74)
T PRK06789 39 STKNTVRLMLENEEIGTGKIL 59 (74)
T ss_pred cCCCCEEEEECCEEEeEEeEE
Confidence 346689999999999998863
No 50
>PF09224 DUF1961: Domain of unknown function (DUF1961); InterPro: IPR015305 Members of this family are found in a set of hypothetical bacterial proteins. Their exact function has not, as yet, been determined. ; PDB: 1OQ1_C.
Probab=27.75 E-value=2e+02 Score=24.68 Aligned_cols=31 Identities=23% Similarity=0.144 Sum_probs=23.3
Q ss_pred eCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeec
Q psy14029 168 QPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEM 203 (277)
Q Consensus 168 ~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l 203 (277)
.+..|||+-|+.. +.+|+.-|||..+.+-.-
T Consensus 157 d~~~~Yr~~i~K~-----~~~v~f~In~L~vf~w~D 187 (218)
T PF09224_consen 157 DARGPYRMEIVKD-----GRTVRFSINGLPVFSWTD 187 (218)
T ss_dssp G--S-EEEEEEEE-----TTEEEEEETTEEEEEEE-
T ss_pred ccCCCEEEEEEEc-----CCEEEEEECCEEEEEEEc
Confidence 3568999999988 678999999999987643
No 51
>COG3140 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.49 E-value=1.9e+02 Score=19.14 Aligned_cols=40 Identities=15% Similarity=0.272 Sum_probs=31.8
Q ss_pred ccCCCCCchHHHHHHHHHHHHHhcCCCCHHHHHHHHHH-Hh
Q psy14029 11 YSGLPHADPVVADLLIDMMGVLASYSITVKELKLFFAA-MK 50 (277)
Q Consensus 11 l~~l~~~~~~~~~~l~~li~~L~s~sis~~elr~ll~l-l~ 50 (277)
++.++++.+.-.+..++=|+.|.+.++|.-|.-.+... +|
T Consensus 2 ~~~lp~LtHeqQQ~AVE~Iq~lMaeGmSsGEAIa~VA~elR 42 (60)
T COG3140 2 FAGLPSLTHEQQQKAVERIQELMAEGMSSGEAIALVAQELR 42 (60)
T ss_pred CCccccccHHHHHHHHHHHHHHHHccccchhHHHHHHHHHH
Confidence 45677888888888999999999999999876666544 44
No 52
>cd02183 GH16_fungal_CRH1_transglycosylase glycosylphosphatidylinositol-glucanosyltransferase. Group of fungal GH16 members related to Saccharomyces cerevisiae Crh1p. Chr1p and Crh2p are transglycosylases that are required for the linkage of chitin to beta(1-3)glucose branches of beta(1-6)glucan, an important step in the assembly of new cell wall. Both have been shown to be glycosylphosphatidylinositol (GPI)-anchored. A third homologous protein, Crr1p, functions in the formation of the spore wall. They belongs to the family 16 of glycosyl hydrolases that includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=24.25 E-value=1.7e+02 Score=24.68 Aligned_cols=28 Identities=14% Similarity=0.411 Sum_probs=22.3
Q ss_pred CcEEEEEEEEEeccCCCCeEEEEECCEEeeeee
Q psy14029 170 RKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTE 202 (277)
Q Consensus 170 ~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~ 202 (277)
..||--+|-=+ ...|..||||+++.+.+
T Consensus 115 ~dFHtY~veWt-----pd~I~~yVDG~~v~~~~ 142 (203)
T cd02183 115 EEFHTYTIDWT-----KDRITWYIDGKVVRTLT 142 (203)
T ss_pred cCcEEEEEEEe-----cCEEEEEECCEEEEEEe
Confidence 67777677654 67899999999998875
No 53
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=24.06 E-value=2.7e+02 Score=19.86 Aligned_cols=24 Identities=8% Similarity=0.396 Sum_probs=13.9
Q ss_pred eeEEeCCcEEEEEEEEEeccCCCCeEEEEEC
Q psy14029 164 KYEFQPRKWYMIAIVYIYNRWTKSEIKCFVN 194 (277)
Q Consensus 164 ~~~~~~~~W~hlavv~~~~~~~~s~v~lYvD 194 (277)
..-+.|..||+|... +..+.+||+
T Consensus 59 ~~~i~Pq~wH~V~p~-------s~D~~f~le 82 (82)
T PF09313_consen 59 PPVIEPQQWHRVEPL-------SDDLRFQLE 82 (82)
T ss_dssp EEEE-TT-EEEEEES-------STT-EEEEE
T ss_pred CceeCCCceEEEEEC-------CCCEEEEeC
Confidence 355889999998753 234666664
No 54
>TIGR02543 List_Bact_rpt Listeria/Bacterioides repeat. This model describes a conserved core region, about 43 residues in length, of at least two families of tandem repeats. These include 78-residue repeats from 2 to 15 in number, in some proteins of Bacteroides forsythus ATCC 43037, and 70-residue repeats in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few bacteria.
Probab=23.74 E-value=52 Score=20.54 Aligned_cols=13 Identities=31% Similarity=1.114 Sum_probs=10.9
Q ss_pred CCCCceeEEEEEE
Q psy14029 99 PHENGFTFTTWFR 111 (277)
Q Consensus 99 p~~~GfTfs~Wi~ 111 (277)
|...||+|..|..
T Consensus 1 P~k~GY~F~GWY~ 13 (44)
T TIGR02543 1 PTKPGYVFDGWYG 13 (44)
T ss_pred CCcCCcEeeeeEE
Confidence 4578999999984
No 55
>PF04167 DUF402: Protein of unknown function (DUF402); InterPro: IPR007295 This beta barrel domain is found in FomD, which is a predicted protein from a fosfomycin biosynthesis gene cluster in Streptomyces wedmorensis []. Its function is unknown.; PDB: 3EXM_A 3CBT_A 2P12_A.
Probab=22.84 E-value=87 Score=21.47 Aligned_cols=26 Identities=19% Similarity=0.506 Sum_probs=18.4
Q ss_pred EEeCCcEEEEEEEEEeccCCCCeEEEEEC
Q psy14029 166 EFQPRKWYMIAIVYIYNRWTKSEIKCFVN 194 (277)
Q Consensus 166 ~~~~~~W~hlavv~~~~~~~~s~v~lYvD 194 (277)
-+.++.||.+.+.+++. ......|+|
T Consensus 10 ~~~~~~~~~v~~~~~~~---~~~~~~Yvd 35 (72)
T PF04167_consen 10 FFPPGRWYNVTVYFDPD---GRFKGWYVD 35 (72)
T ss_dssp EEECCCTEEEEEEEETT---TECECEEEE
T ss_pred EEECCCCEEEEEEECCC---CcEEEEEEE
Confidence 46689999998888752 334557777
No 56
>PF12554 MOZART1: Mitotic-spindle organizing gamma-tubulin ring associated; InterPro: IPR022214 This family of proteins is found in eukaryotes. Proteins in this family are typically between 71 and 105 amino acids in length. There is a single completely conserved residue L that may be functionally important.
Probab=22.26 E-value=2e+02 Score=18.48 Aligned_cols=14 Identities=7% Similarity=0.176 Sum_probs=6.6
Q ss_pred CCCHHHHHHHHHHH
Q psy14029 36 SITVKELKLFFAAM 49 (277)
Q Consensus 36 sis~~elr~ll~ll 49 (277)
+++...|+-.++++
T Consensus 18 gLd~etL~ici~L~ 31 (48)
T PF12554_consen 18 GLDRETLSICIELC 31 (48)
T ss_pred CCCHHHHHHHHHHH
Confidence 44444444444444
No 57
>KOG2014|consensus
Probab=21.15 E-value=1.4e+02 Score=27.13 Aligned_cols=56 Identities=25% Similarity=0.435 Sum_probs=32.6
Q ss_pred HhcCCCCHHHHHHHHHHHhhcCCCCCccHH----HH-HHHHHhccCCCCC-cceEeecCCCcccE
Q psy14029 32 LASYSITVKELKLFFAAMKAVNNKWPRHST----KL-LNVLRQMPQRNGP-DVFFSFPGKKGSAM 90 (277)
Q Consensus 32 L~s~sis~~elr~ll~ll~~~~~~~~~~~~----~l-l~~L~~m~~~~~p-~~ff~F~g~~~s~i 90 (277)
+.+-++-|.++-+++.+.-.+ +++-+. .+ .++++.++....| .+||-|||..+.|+
T Consensus 263 l~s~~i~pd~~~~f~~~~~~e---f~Pv~AvVGGivaQevIk~isk~~~Pl~Nff~fdg~~g~g~ 324 (331)
T KOG2014|consen 263 LESETIIPDELLEFLSLIFTE---FAPVCAVVGGILAQEVIKAISKKGPPLNNFFIFDGETGKGP 324 (331)
T ss_pred ccccccCCchHHHHHHhcccc---cCchhhhhhhHhHHHHHHHhhcCCCcccceEEeecccCcee
Confidence 335777777776666666432 222211 12 5566666654444 68999999854443
No 58
>PF02828 L27: L27 domain; InterPro: IPR014775 The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7, as well as some protein kinases and human MPP2 protein.; PDB: 1ZL8_B 1VA8_A 3LRA_A 3UIT_A 1Y74_D 1RSO_B.
Probab=20.93 E-value=2.2e+02 Score=18.42 Aligned_cols=42 Identities=17% Similarity=0.279 Sum_probs=26.7
Q ss_pred HHHHHHHHHhcC-CCCHHHHHHHHHHHhhcCCCCCccHHHHHHHHHhcc
Q psy14029 24 LLIDMMGVLASY-SITVKELKLFFAAMKAVNNKWPRHSTKLLNVLRQMP 71 (277)
Q Consensus 24 ~l~~li~~L~s~-sis~~elr~ll~ll~~~~~~~~~~~~~ll~~L~~m~ 71 (277)
.++++++.|.+. +.+..+++.|-++|+ .++...|+++-..++
T Consensus 4 ~~~e~L~~L~~~~~~~~~~~~eL~~lL~------~p~~~aLl~~hD~va 46 (56)
T PF02828_consen 4 RVLELLEELQSLSSASQEDAQELQQLLQ------SPHFQALLEVHDKVA 46 (56)
T ss_dssp HHHHHHHHHHHHTSSTHHHHHHHHHHHH------SHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhccCCChHHHHHHHHHHc------CHHHHHHHHHHHHHH
Confidence 345555555432 233578888888887 346677777777665
No 59
>PF09479 Flg_new: Listeria-Bacteroides repeat domain (List_Bact_rpt); InterPro: IPR013378 This model describes a conserved core region of about 43 residues, which occurs in at least two families of tandem repeats. These include 78-residue repeats which occur from 2 to 15 times in some proteins of Tannerella forsythensis ATCC 43037, and 70-residue repeats found in families of internalins of Listeria species. Single copies are found in proteins of Fibrobacter succinogenes, Geobacter sulfurreducens, and a few other bacteria.; PDB: 2Y5P_B.
Probab=20.84 E-value=56 Score=19.55 Aligned_cols=15 Identities=33% Similarity=0.984 Sum_probs=11.3
Q ss_pred CCCCceeEEEEEEec
Q psy14029 99 PHENGFTFTTWFRLD 113 (277)
Q Consensus 99 p~~~GfTfs~Wi~le 113 (277)
|...||+|..|..-+
T Consensus 1 Ptr~Gy~F~GW~~~~ 15 (41)
T PF09479_consen 1 PTREGYTFDGWYTDS 15 (41)
T ss_dssp -EETTEEEEEEESST
T ss_pred CeeCCEEEEeEECCC
Confidence 346899999998654
No 60
>PHA03170 UL37 tegument protein; Provisional
Probab=20.77 E-value=1.1e+02 Score=26.93 Aligned_cols=19 Identities=32% Similarity=0.546 Sum_probs=15.4
Q ss_pred CCeEEEEECCEEeeeeecc
Q psy14029 186 KSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 186 ~s~v~lYvDG~l~~s~~l~ 204 (277)
...++||+|++++.+..+.
T Consensus 41 ~Ci~tCyvn~~Lv~~gsC~ 59 (293)
T PHA03170 41 ICISTCYVNGVLAGNSSCV 59 (293)
T ss_pred EEEEEEEECCEEEeecchh
Confidence 3457899999999988764
No 61
>PF00337 Gal-bind_lectin: Galactoside-binding lectin; InterPro: IPR001079 Galectins (also known as galaptins or S-lectin) are a family of proteins defined by having at least one characteristic carbohydrate recognition domain (CRD) with an affinity for beta-galactosides and sharing certain sequence elements. Members of the galectins family are found in mammals, birds, amphibians, fish, nematodes, sponges, and some fungi. Galectins are known to carry out intra- and extracellular functions through glycoconjugate-mediated recogntion. From the cytosol they may be secreted by non-classical pathways, but they may also be targeted to the nucleus or specific sub-cytosolic sites. Within the same peptide chain some galectins have a CRD with only a few additional amino acids, whereas others have two CRDs joined by a link peptide, and one (galectin-3) has one CRD joined to a different type of domain [, ]. The galectin carbohydrate recognition domain (CRD) is a beta-sandwich of about 135 amino acid. The two sheets are slightly bent with 6 strands forming the concave side and 5 strands forming the convex side. The concave side forms a groove in which carbohydrate is bound, and which is long enough to hold about a linear tetrasaccharide [, ].; GO: 0005529 sugar binding; PDB: 2WSU_B 2WT0_A 2WT1_A 2WT2_B 2WSV_A 1HLC_A 2ZGQ_A 3M3Q_B 1WW5_C 3M3E_A ....
Probab=20.57 E-value=3.9e+02 Score=20.28 Aligned_cols=37 Identities=27% Similarity=0.347 Sum_probs=29.9
Q ss_pred eeeEEeCCcEEEEEEEEEeccCCCCeEEEEECCEEeeeeecc
Q psy14029 163 VKYEFQPRKWYMIAIVYIYNRWTKSEIKCFVNGQLASSTEMA 204 (277)
Q Consensus 163 ~~~~~~~~~W~hlavv~~~~~~~~s~v~lYvDG~l~~s~~l~ 204 (277)
..+.|.+|+=.-|.|... .+...+||||+...+.+-.
T Consensus 75 ~~~pf~~g~~F~i~I~~~-----~~~f~I~vng~~~~~F~~R 111 (133)
T PF00337_consen 75 SPFPFQPGQPFEIRIRVE-----EDGFKIYVNGKHFCSFPHR 111 (133)
T ss_dssp SSTSSTTTSEEEEEEEEE-----SSEEEEEETTEEEEEEE-S
T ss_pred eeeeecCCceEEEEEEEe-----cCeeEEEECCeEEEEeeCc
Confidence 456788898888888887 6789999999999887654
Done!