Query psy17302
Match_columns 244
No_of_seqs 168 out of 1428
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 18:30:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17302.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17302hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00054 Laminin_G_1: Laminin 99.9 3.2E-25 6.9E-30 169.7 16.6 123 55-179 1-131 (131)
2 KOG3514|consensus 99.9 8.4E-26 1.8E-30 211.3 15.4 191 26-220 439-660 (1591)
3 cd00110 LamG Laminin G domain; 99.9 5.5E-24 1.2E-28 165.7 18.5 144 27-174 1-151 (151)
4 smart00282 LamG Laminin G doma 99.9 8.9E-24 1.9E-28 162.1 16.7 126 48-176 2-135 (135)
5 KOG1219|consensus 99.9 5.5E-23 1.2E-27 202.1 15.4 213 9-231 3669-3911(4289)
6 KOG3516|consensus 99.9 7.7E-22 1.7E-26 187.5 16.9 189 27-221 790-993 (1306)
7 KOG3514|consensus 99.9 5E-22 1.1E-26 186.3 13.0 183 14-236 12-194 (1591)
8 PF02210 Laminin_G_2: Laminin 99.9 2.2E-21 4.7E-26 146.6 14.2 121 55-176 1-128 (128)
9 KOG4289|consensus 99.9 5.9E-21 1.3E-25 182.8 13.4 184 17-204 1310-1519(2531)
10 KOG3516|consensus 99.8 2.6E-18 5.6E-23 163.8 16.4 150 27-180 179-340 (1306)
11 KOG4289|consensus 99.6 1.1E-15 2.3E-20 147.4 12.4 211 10-232 1523-1765(2531)
12 smart00210 TSPN Thrombospondin 99.1 1.8E-08 3.8E-13 81.4 17.8 110 22-133 24-143 (184)
13 PF13385 Laminin_G_3: Concanav 98.6 8.5E-07 1.8E-11 68.1 12.6 145 27-178 2-151 (157)
14 cd00152 PTX Pentraxins are pla 98.3 6.9E-05 1.5E-09 61.3 17.1 151 24-181 5-168 (201)
15 smart00159 PTX Pentraxin / C-r 98.3 7.6E-05 1.7E-09 61.3 17.1 152 24-182 5-169 (206)
16 PF00354 Pentaxin: Pentaxin fa 98.0 0.00039 8.5E-09 56.6 15.2 124 28-158 3-134 (195)
17 PF02973 Sialidase: Sialidase, 97.9 0.00087 1.9E-08 54.0 13.9 111 47-157 33-155 (190)
18 smart00560 LamGL LamG-like jel 97.0 0.05 1.1E-06 41.2 13.8 66 107-178 61-130 (133)
19 KOG3509|consensus 95.7 0.0098 2.1E-07 58.5 3.9 156 60-219 267-441 (964)
20 KOG1834|consensus 95.3 0.36 7.9E-06 45.6 12.1 102 27-130 344-465 (952)
21 PF06439 DUF1080: Domain of Un 95.0 0.041 9E-07 43.7 4.8 97 36-134 44-154 (185)
22 PF14099 Polysacc_lyase: Polys 91.3 1.2 2.6E-05 36.6 7.9 61 72-132 112-182 (224)
23 PF00008 EGF: EGF-like domain 89.4 0.24 5.1E-06 27.8 1.4 27 189-215 1-30 (32)
24 smart00282 LamG Laminin G doma 81.1 1.3 2.9E-05 33.0 2.4 16 209-224 119-134 (135)
25 PF02057 Glyco_hydro_59: Glyco 80.2 43 0.00093 32.4 12.5 86 47-133 542-637 (669)
26 PF02210 Laminin_G_2: Laminin 79.5 1.5 3.3E-05 31.9 2.3 15 210-224 113-127 (128)
27 KOG3546|consensus 79.2 15 0.00033 35.2 8.9 125 48-173 88-222 (1167)
28 PF00054 Laminin_G_1: Laminin 77.6 1.9 4.2E-05 32.3 2.4 19 208-226 111-129 (131)
29 KOG1836|consensus 75.8 2.5 5.4E-05 45.0 3.3 57 98-160 1608-1664(1705)
30 cd00110 LamG Laminin G domain; 74.9 2.2 4.8E-05 32.2 2.1 15 209-223 137-151 (151)
31 PF07622 DUF1583: Protein of u 74.3 7.7 0.00017 34.8 5.4 33 101-133 84-116 (399)
32 KOG1219|consensus 62.9 39 0.00084 37.8 8.3 42 188-229 3905-3948(4289)
33 cd01951 lectin_L-type legume l 55.2 1.1E+02 0.0025 24.8 13.8 24 107-130 154-179 (223)
34 PF09224 DUF1961: Domain of un 53.7 1.2E+02 0.0027 24.9 8.3 30 103-132 155-184 (218)
35 PF09313 DUF1971: Domain of un 52.6 75 0.0016 21.9 6.1 47 70-116 21-74 (82)
36 PF08787 Alginate_lyase2: Algi 51.1 1.4E+02 0.0031 24.8 12.8 61 73-133 121-190 (236)
37 PF00139 Lectin_legB: Legume l 42.3 1.8E+02 0.0039 24.0 8.0 29 100-128 160-190 (236)
38 PF09264 Sial-lect-inser: Vibr 41.2 1.9E+02 0.0042 23.4 8.5 84 47-132 32-119 (198)
39 COG3615 TehB Uncharacterized p 39.3 1.4E+02 0.0031 21.3 6.2 55 70-124 32-93 (99)
40 cd06899 lectin_legume_LecRK_Ar 34.9 2.6E+02 0.0057 23.1 9.2 26 104-129 160-187 (236)
41 cd06903 lectin_EMP46_EMP47 EMP 30.7 3.1E+02 0.0066 22.6 13.6 29 103-131 147-177 (215)
42 PTZ00334 trans-sialidase; Prov 29.7 2.4E+02 0.0053 28.1 7.6 74 104-179 640-715 (780)
43 smart00179 EGF_CA Calcium-bind 29.6 34 0.00073 18.9 1.2 15 189-203 5-20 (39)
44 cd08023 GH16_laminarinase_like 29.0 1.8E+02 0.004 23.7 6.0 31 104-134 155-185 (235)
45 PF11025 GP40: Glycoprotein GP 28.9 2.7E+02 0.0058 21.3 6.2 57 76-132 12-75 (165)
46 PF11250 DUF3049: Protein of u 27.6 1.7E+02 0.0037 18.7 5.0 37 53-89 18-55 (56)
47 cd02178 GH16_beta_agarase Beta 25.2 2.1E+02 0.0044 24.1 5.8 28 106-133 178-206 (258)
48 PF05910 DUF868: Plant protein 24.8 1.8E+02 0.0038 25.1 5.1 28 105-132 153-187 (274)
49 PF07974 EGF_2: EGF-like domai 22.8 68 0.0015 17.8 1.6 25 192-216 6-30 (32)
50 KOG3838|consensus 21.4 4.3E+02 0.0093 24.2 6.9 58 74-131 152-211 (497)
51 PF08530 PepX_C: X-Pro dipepti 21.1 1.5E+02 0.0033 23.9 4.1 31 102-132 146-184 (218)
52 cd00070 GLECT Galectin/galacto 20.6 3.5E+02 0.0076 19.8 10.7 32 101-132 71-102 (127)
No 1
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.94 E-value=3.2e-25 Score=169.72 Aligned_cols=123 Identities=32% Similarity=0.553 Sum_probs=102.4
Q ss_pred EEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCeEEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceee-ee
Q psy17302 55 FKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGAQILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQT-RT 133 (244)
Q Consensus 55 frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~-~~ 133 (244)
|||.+++|+|||.++..+.||++|+|.+|+++++|+.|++...+.++..++||+||+|++.|..+.+.|+||+.... ..
T Consensus 1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~~~ 80 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVTGE 80 (131)
T ss_dssp EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEEEE
T ss_pred CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCccceeee
Confidence 89999999999998777779999999999999999999998888888889999999999999999999999998873 34
Q ss_pred cCCCceeeeecccCCceEEeccCCCCCCC-------CCccccceeEECCEEec
Q psy17302 134 SRGKEFRFGHVTSNSWVYIGGMPPWYNAK-------LTLLALPSVIFAGERVP 179 (244)
Q Consensus 134 ~~~~~~~~~~l~~~~~lyiGG~p~~~~~~-------~~~GCl~~v~ing~~l~ 179 (244)
.+... . ..++....|||||+|...... .+.|||+++.+|++++|
T Consensus 81 s~~~~-~-~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~ld 131 (131)
T PF00054_consen 81 SPSGA-T-QSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKPLD 131 (131)
T ss_dssp ECSSS-S-SSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred cCCcc-c-cccccccCEEEccCCchhhcccccccCCCeeEEEEEeEECCEECc
Confidence 43321 1 136778889999999322211 23999999999999875
No 2
>KOG3514|consensus
Probab=99.94 E-value=8.4e-26 Score=211.33 Aligned_cols=191 Identities=25% Similarity=0.417 Sum_probs=156.4
Q ss_pred eeeEecCCCceeEEecccCcccceEEEEEEEeCCCCeEEEEEcC--CCCccEEEEEEeCCcEEEEEEcCCCeEEEE-EcC
Q psy17302 26 HSFMLDGSQDSYAQLHKWNAALNGSLELEFKTEQPNGLILYTDD--GGTYDFFEIKLVEGALRLRYNLGSGAQILT-VGH 102 (244)
Q Consensus 26 ~~~~F~gs~~syl~~~~~~~~~~~~is~~frT~~~~GlLl~~~~--~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~-s~~ 102 (244)
.+..|.- ++||+.+|+|......+|+|.|||+.|+|+|+|.+. ....||+++||.||+|.+.+++|++...++ +..
T Consensus 439 DpvtFtt-~es~l~LP~Wnt~~~gSiSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ldlGSG~iklras~r 517 (1591)
T KOG3514|consen 439 DPVTFTT-PESYLTLPRWNTKKSGSISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLLLDLGSGVIKLRASSR 517 (1591)
T ss_pred Cceeeec-ccceeeccccccCCcceeEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEEEecCCceEEeeeecc
Confidence 4566775 689999999999889999999999999999999974 346799999999999999999999986666 678
Q ss_pred ccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCC----------CccccceeE
Q psy17302 103 DLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKL----------TLLALPSVI 172 (244)
Q Consensus 103 ~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~----------~~GCl~~v~ 172 (244)
++|||+||+|.+.|+++..+++||........++.+ ..+++++++|||-.+....-+. ..|||++++
T Consensus 518 kv~DGeWhhv~l~R~gR~gsvsVd~~~~df~tpG~s---~iL~ld~~mylG~~~n~l~~P~~vWta~L~~GyvGCirdl~ 594 (1591)
T KOG3514|consen 518 KVNDGEWHHVDLQRDGRTGSVSVDAIKTDFSTPGDS---EILDLDDPMYLGEVPNNLVYPSEVWTAALRKGYVGCIRDLF 594 (1591)
T ss_pred cccCCceEEEEeeccCccceEEEeeeecCccCCCcc---eeEeecCceeeccCCCCccCcHHHHHHHHhccchheehhhe
Confidence 999999999999999999999999998888777643 3589999999996655432221 289999999
Q ss_pred ECCEEecCcccc---------------cccccCCCCccCccceeec---ccCcccCCCeeeeEeee
Q psy17302 173 FAGERVPVWSRH---------------FQLVGGMPPWYNAKLTLLA---LPSVIFEPRFVGFIRNV 220 (244)
Q Consensus 173 ing~~l~l~~~~---------------~~~~~~~~pc~n~~~~~~~---~~~~~~~~~f~Gci~~l 220 (244)
|||+..++-..+ ++.+|..+||+|+|.|.-. +-+.....+|.|-.|+.
T Consensus 595 i~G~s~di~q~ae~q~sagvkpsCs~~~~~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 595 IDGVSTDIRQEAEAQNSAGVKPSCSLSNEKICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ecceehhhHHHhhhccccccCcccchhhccccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 999999864211 1239999999999999843 33444568888866654
No 3
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.93 E-value=5.5e-24 Score=165.71 Aligned_cols=144 Identities=31% Similarity=0.530 Sum_probs=117.5
Q ss_pred eeEecCCCceeEEecccCc-ccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCeEEEEEcCccC
Q psy17302 27 SFMLDGSQDSYAQLHKWNA-ALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGAQILTVGHDLN 105 (244)
Q Consensus 27 ~~~F~gs~~syl~~~~~~~-~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~~~vn 105 (244)
++.|+|+ +|+.++.+.. ...++|+|.|||.+++|+|||.++....+|+.|+|.+|++.+.++.+.....+....+++
T Consensus 1 ~~~F~g~--~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~ 78 (151)
T cd00110 1 GVSFSGS--SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLN 78 (151)
T ss_pred CeEeCCC--ceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCCcccEEEEccCccC
Confidence 3678885 9999998765 678999999999999999999986546899999999999999999985556676666899
Q ss_pred CCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCC------CCCCccccceeEEC
Q psy17302 106 DGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYN------AKLTLLALPSVIFA 174 (244)
Q Consensus 106 dg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~------~~~~~GCl~~v~in 174 (244)
||+||+|.+.+.++.++|.||+........... . ..+.....+||||.|.... ...+.|||++|+||
T Consensus 79 dg~Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~-~-~~~~~~~~~~iGg~~~~~~~~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGG-S-ALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred CCCEEEEEEEECCCEEEEEECCccEEeeeCCCC-c-eeecCCCCeEEcCCCCchhcccccccCCCceEeeEeEeC
Confidence 999999999999999999999974433222211 1 1356789999999998542 23469999999987
No 4
>smart00282 LamG Laminin G domain.
Probab=99.92 E-value=8.9e-24 Score=162.13 Aligned_cols=126 Identities=33% Similarity=0.566 Sum_probs=105.1
Q ss_pred ceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCeEEEEEc-CccCCCCeEEEEEEEECcEEEEEEc
Q psy17302 48 NGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGAQILTVG-HDLNDGHWHSVRVKRRVEKTSLTVD 126 (244)
Q Consensus 48 ~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~-~~vndg~WH~V~v~~~~~~~~L~VD 126 (244)
+++|+|.|||.+++|+|||.++....+|+.++|.+|++.+.++.+++...++.. ..++||+||+|.+.+.++.+.|.||
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~~l~VD 81 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRVTLSVD 81 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEEEEEEC
Confidence 468999999999999999997655789999999999999999999877777766 8999999999999999999999999
Q ss_pred Cceeee-ecCCCceeeeecccCCceEEeccCCCCC------CCCCccccceeEECCE
Q psy17302 127 NITQTR-TSRGKEFRFGHVTSNSWVYIGGMPPWYN------AKLTLLALPSVIFAGE 176 (244)
Q Consensus 127 ~~~~~~-~~~~~~~~~~~l~~~~~lyiGG~p~~~~------~~~~~GCl~~v~ing~ 176 (244)
+..... ..++. ...++....+||||.|+... ...+.|||++|+||++
T Consensus 82 ~~~~~~~~~~~~---~~~l~~~~~l~iGG~p~~~~~~~~~~~~~F~GCi~~v~in~~ 135 (135)
T smart00282 82 GENPVSGESPGG---LTILNLDGPLYLGGLPEDLKLPPLLVTPGFRGCIRNLKVNGK 135 (135)
T ss_pred CCccccEECCCC---ceEEecCCCcEEccCCchhcccccccCCCCeeEeeEEEECCC
Confidence 976443 33332 23466778999999997532 2335999999999984
No 5
>KOG1219|consensus
Probab=99.90 E-value=5.5e-23 Score=202.08 Aligned_cols=213 Identities=23% Similarity=0.316 Sum_probs=170.9
Q ss_pred hhHHHHHHHHhcccccceeeEecCCCceeEEecccCc-ccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEE
Q psy17302 9 TPCVLTMLWVLLLLPLGHSFMLDGSQDSYAQLHKWNA-ALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRL 87 (244)
Q Consensus 9 ~~~~~~~~~~~~~c~~~~~~~F~gs~~syl~~~~~~~-~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~ 87 (244)
+.|+||.+ -..+|+...++.+.| +||.++..... ..++++.|+.||.+++|++||.. ..++..|+|.+|.+.+
T Consensus 3669 ~~cVcP~g-r~g~C~g~~elS~tG--nSYveyrlse~~n~~~kl~frLkT~~sngIiM~tr---~~d~~iLkLv~G~~~l 3742 (4289)
T KOG1219|consen 3669 YTCVCPIG-RFGFCQGDFELSSTG--NSYVEYRLSENQNTRMKLGFRLKTLQSNGIIMYTR---KTDLAILKLVGGSPQL 3742 (4289)
T ss_pred eeEeccCc-ccccCCCcceEeecC--ceeEEEEcccccccceEEEEEEEecccCcEEEEEc---CCceEEEEecCCcEEE
Confidence 47888887 456788777888777 59999986543 34599999999999999999995 5689999999999999
Q ss_pred EEEcCCCeEEEEEc-CccCCCCeEEEEEEEECcEEEEEEcCceeeee-cCCCceeeeecccCCceEEeccCC-CCCC---
Q psy17302 88 RYNLGSGAQILTVG-HDLNDGHWHSVRVKRRVEKTSLTVDNITQTRT-SRGKEFRFGHVTSNSWVYIGGMPP-WYNA--- 161 (244)
Q Consensus 88 ~~~~g~~~~~~~s~-~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~-~~~~~~~~~~l~~~~~lyiGG~p~-~~~~--- 161 (244)
.++.|++..++.++ ..+|||+||.|.++|+.+.+.|+||+...... +++ ....++.+..||+||.-. ....
T Consensus 3743 ~~~cgsG~Givg~q~~~VnDgqWHsialerrr~~irlsvDd~~~~~atvPg---~~~tln~d~hiy~Ga~vrlr~~~~tq 3819 (4289)
T KOG1219|consen 3743 LADCGSGPGIVGSQKRTVNDGQWHSIALERRRNHIRLSVDDDTYDSATVPG---MKSTLNLDTHIYLGALVRLRHQRSTQ 3819 (4289)
T ss_pred EEecCCCCCcccccceEeecCceeEEEeeccCCceEEEEcccCceeeeccc---ceeeccccceEEEeeEeeeccCCCcc
Confidence 99999998877765 89999999999999999999999999887764 444 345678889999999875 2111
Q ss_pred --CCCccccceeEECCEEecCccccccc------------------ccCCCCccCccceeeccc---CcccCCCeeeeEe
Q psy17302 162 --KLTLLALPSVIFAGERVPVWSRHFQL------------------VGGMPPWYNAKLTLLALP---SVIFEPRFVGFIR 218 (244)
Q Consensus 162 --~~~~GCl~~v~ing~~l~l~~~~~~~------------------~~~~~pc~n~~~~~~~~~---~~~~~~~f~Gci~ 218 (244)
..+.|||+.+.+||..+++..+...+ .|..+||||+|.|...-. .|.|.+.|.|-.|
T Consensus 3820 vs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~C 3899 (4289)
T KOG1219|consen 3820 VSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHC 3899 (4289)
T ss_pred ccccccceeeeEEEccccccccCCCchhhhhhhhhcccccccccccccccCcccCCCEecCCCCCceEEeCcccccCccc
Confidence 22499999999999999998765322 788899999999984322 2346799999999
Q ss_pred eeeecCCCCCCCc
Q psy17302 219 NVIYADPHASGPR 231 (244)
Q Consensus 219 ~l~~~~~~~~~~~ 231 (244)
++ =.+.|.++|=
T Consensus 3900 Ei-~~epC~snPC 3911 (4289)
T KOG1219|consen 3900 EI-DLEPCASNPC 3911 (4289)
T ss_pred cc-ccccccCCCC
Confidence 88 4445665543
No 6
>KOG3516|consensus
Probab=99.88 E-value=7.7e-22 Score=187.55 Aligned_cols=189 Identities=19% Similarity=0.299 Sum_probs=150.3
Q ss_pred eeEecCCCceeEEecccCcccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCC-cEEEEEEcCCCeE--EEEEcCc
Q psy17302 27 SFMLDGSQDSYAQLHKWNAALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEG-ALRLRYNLGSGAQ--ILTVGHD 103 (244)
Q Consensus 27 ~~~F~gs~~syl~~~~~~~~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G-~l~~~~~~g~~~~--~~~s~~~ 103 (244)
+..|.. +.+|+.|++........|+|.|||+.++|++|.+.+ ..||+.++|..+ .+.|.++.|++.. ++++...
T Consensus 790 svSF~~-~~syL~fp~f~~~~saDIsf~FrTt~~~gvflen~g--~~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~ 866 (1306)
T KOG3516|consen 790 SVSFHT-GASYLHFPPFHNELSADISFFFRTTASSGVFLENHG--INDFIRLELSSPVEVTFAFDVGNGPSQLTVRSPTE 866 (1306)
T ss_pred ceEeec-CcceeecCcccCcccccEEEEEEecCCceEeeeccC--CCceEEEEEcCCCceEEEEEcCCCceeEEEcCCcc
Confidence 345665 568999999988888999999999999999999964 679999999965 6899999999874 3556789
Q ss_pred cCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCCCccccceeEECCEEecCccc
Q psy17302 104 LNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKLTLLALPSVIFAGERVPVWSR 183 (244)
Q Consensus 104 vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~GCl~~v~ing~~l~l~~~ 183 (244)
+||++||.|.++|+.+...|+||+.+......+.. ....+.+.+++|+||... ..+.+-||||.+++||+.++|...
T Consensus 867 ~nD~qWH~V~~Ern~K~a~LqVD~~~~~~r~sp~~-~~~~L~l~s~l~vGgt~~--~~~gF~GCIRsl~LNGv~ldLe~r 943 (1306)
T KOG3516|consen 867 LNDNQWHQVRAERNSKEASLQVDGLPKSIRTSPIP-GTRLLQLYSSLFVGGTVS--RQRGFLGCIRSLQLNGVMLDLEYR 943 (1306)
T ss_pred cCCCceEEEEEEeccccceEEEcCcccceecCCCC-CEEEEEeccceecccccc--CcCcceeeeeeeeecceeeeehhh
Confidence 99999999999999999999999998876432222 234578889999999543 234568999999999999999654
Q ss_pred ccc---------cccCCCCccCccceeecccCcc---cCCCeeeeEeeee
Q psy17302 184 HFQ---------LVGGMPPWYNAKLTLLALPSVI---FEPRFVGFIRNVI 221 (244)
Q Consensus 184 ~~~---------~~~~~~pc~n~~~~~~~~~~~~---~~~~f~Gci~~l~ 221 (244)
+.. .-|.+.||+|+|.|......+. ..+.|.|-.|+..
T Consensus 944 a~~~~gv~~GC~GhCss~~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~e 993 (1306)
T KOG3516|consen 944 AYGTAGVSPGCEGHCSSYPCLNGGHCVERYDGYTCDCSRTAYDGPFCSKE 993 (1306)
T ss_pred hccCCcccCCCccccccccccCCCEEEEecCceeeccccCcCCCCccccc
Confidence 332 1677889999999986554333 3577888666553
No 7
>KOG3514|consensus
Probab=99.87 E-value=5e-22 Score=186.29 Aligned_cols=183 Identities=39% Similarity=0.670 Sum_probs=155.8
Q ss_pred HHHHHhcccccceeeEecCCCceeEEecccCcccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCC
Q psy17302 14 TMLWVLLLLPLGHSFMLDGSQDSYAQLHKWNAALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGS 93 (244)
Q Consensus 14 ~~~~~~~~c~~~~~~~F~gs~~syl~~~~~~~~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~ 93 (244)
+|+++.++.+-...+.|+|++++|..|+.|....+..+++.|||++++|+|+|..+.+..+|+.|.|++|+|+++|++|+
T Consensus 12 ll~l~~s~~~l~dg~~l~Ga~~s~ary~kW~~~~~g~ls~e~kt~q~~glllytDdGgt~df~eL~lveG~lrLrf~Lg~ 91 (1591)
T KOG3514|consen 12 LLVLTVSLFSLVDGIILTGAPDSYARYPKWAHSFEGSLSMELKTRQSDGLLLYTDDGGTHDFYELTLVEGHLRLRFRLGN 91 (1591)
T ss_pred HHHHHHHHHhhccceEecCCCcchhhchhhhcccCceeeeeeeccCCCcEEEEecCCCceeeeEEEEecceEEEEEEecC
Confidence 44566677778889999999999999999988888999999999999999999988778899999999999999999998
Q ss_pred CeEEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCCCccccceeEE
Q psy17302 94 GAQILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKLTLLALPSVIF 173 (244)
Q Consensus 94 ~~~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~GCl~~v~i 173 (244)
+...+....+++|++||+|.|.|+-++..|.||+.......... .+...+....+||||+|....
T Consensus 92 ~~~~~q~~~~i~D~~WH~v~i~r~~e~t~L~vDgv~~~~~~~~~--~f~fg~iasdvfVGGlP~~~~------------- 156 (1591)
T KOG3514|consen 92 SNEFGQRRVRIDDDKWHTVTIFRSWENTKLEVDGVLVFKILNQR--SFVFGNIASDVFVGGLPNMHM------------- 156 (1591)
T ss_pred CCceeeecceecCCceeEEEEEeccccceEEechhhhhhhhhcc--eeeeeeeehheeecCCChHHh-------------
Confidence 77888889999999999999999999999999996554332223 233344556899999996543
Q ss_pred CCEEecCcccccccccCCCCccCccceeecccCcccCCCeeeeEeeeeecCCCCCCCcccccc
Q psy17302 174 AGERVPVWSRHFQLVGGMPPWYNAKLTLLALPSVIFEPRFVGFIRNVIYADPHASGPRRQLAH 236 (244)
Q Consensus 174 ng~~l~l~~~~~~~~~~~~pc~n~~~~~~~~~~~~~~~~f~Gci~~l~~~~~~~~~~~~~~~~ 236 (244)
+..+++|...+.+.|.|-++|+.+...+....-+|..+
T Consensus 157 -------------------------la~l~lp~v~yep~frg~~rnl~y~~~p~g~t~~q~l~ 194 (1591)
T KOG3514|consen 157 -------------------------LAVLSLPLVRYEPRFRGNVRNLMYRQYPQGVTSPQLLE 194 (1591)
T ss_pred -------------------------hhhhcCcccccccccCccceeeeeecCCCCcCChhhhh
Confidence 33357888899999999999999999998888888777
No 8
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.87 E-value=2.2e-21 Score=146.57 Aligned_cols=121 Identities=31% Similarity=0.565 Sum_probs=99.4
Q ss_pred EEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCe-EEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeee
Q psy17302 55 FKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGA-QILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRT 133 (244)
Q Consensus 55 frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~-~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~ 133 (244)
|||++++|+|||.++....+|+.|+|.+|++.+.++.|+.. ........++||+||+|.+.+.++.++|.||+......
T Consensus 1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~~~ 80 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQSVSSE 80 (128)
T ss_dssp EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTSEEEEE
T ss_pred CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCccceEE
Confidence 89999999999998655568999999999999999999544 45567889999999999999999999999999988875
Q ss_pred cCCCceeeeecccCCceEEeccCCCCCC------CCCccccceeEECCE
Q psy17302 134 SRGKEFRFGHVTSNSWVYIGGMPPWYNA------KLTLLALPSVIFAGE 176 (244)
Q Consensus 134 ~~~~~~~~~~l~~~~~lyiGG~p~~~~~------~~~~GCl~~v~ing~ 176 (244)
....... ..++....+||||.|..... ..+.|||+++++||+
T Consensus 81 ~~~~~~~-~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 81 SLPSSSS-DSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp ESSSTTH-HCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEETTE
T ss_pred eccccce-ecccCCCCEEEecccCccccccccCCCCcEEEcCeEEECCC
Confidence 4432111 13566778999999987543 335999999999996
No 9
>KOG4289|consensus
Probab=99.85 E-value=5.9e-21 Score=182.81 Aligned_cols=184 Identities=17% Similarity=0.259 Sum_probs=143.2
Q ss_pred HHhcccccceeeEecCCCceeEEecccCcccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCeE
Q psy17302 17 WVLLLLPLGHSFMLDGSQDSYAQLHKWNAALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGAQ 96 (244)
Q Consensus 17 ~~~~~c~~~~~~~F~gs~~syl~~~~~~~~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~ 96 (244)
+.+.+|+- .+-.|.+ .||+.|+....+..++++|.|.|.+.+|+|+|.+ ....||++|++.++++++.|..|....
T Consensus 1310 ~e~prC~v-~trSFp~--~sfv~frglrqRfh~TlslsfaT~~~nGlL~ynG-nekhDFvalevVd~qvqltfS~Ges~t 1385 (2531)
T KOG4289|consen 1310 FEDPRCEV-TTRSFPP--ESFVTFRGLRQRFHFTLSLSFATIERNGLLLYNG-NEKHDFVALEVVDEQVQLTFSAGESTT 1385 (2531)
T ss_pred cCCCceEE-EeeccCc--hheEEEeccccceEEEEEEEEEEeeecceEEecC-CcccceEeeeeeeeeEEEEEecccccc
Confidence 34445532 2344664 5999999888888999999999999999999997 467899999999999999999996555
Q ss_pred EEE--EcCccCCCCeEEEEEEEECcEEEEEEcCceeeeec----CC-------CceeeeecccCCceEEeccCCCCCC--
Q psy17302 97 ILT--VGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTS----RG-------KEFRFGHVTSNSWVYIGGMPPWYNA-- 161 (244)
Q Consensus 97 ~~~--s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~----~~-------~~~~~~~l~~~~~lyiGG~p~~~~~-- 161 (244)
.+. .+..++||+||+|.++..++.+.++||+....... .+ ....-+.+++.++|++||.|.....
T Consensus 1386 ~v~p~Vp~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv~~ 1465 (2531)
T KOG4289|consen 1386 TVSPDVPGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPVIE 1465 (2531)
T ss_pred eecCCCCCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchhhH
Confidence 554 46899999999999999999999999998654211 00 0012234889999999999965432
Q ss_pred CCCccccceeEECCEEecCcccc-cc----------cccCCCCccCccceeecc
Q psy17302 162 KLTLLALPSVIFAGERVPVWSRH-FQ----------LVGGMPPWYNAKLTLLAL 204 (244)
Q Consensus 162 ~~~~GCl~~v~ing~~l~l~~~~-~~----------~~~~~~pc~n~~~~~~~~ 204 (244)
+.|.|||+++.++++.+|+...- .+ =.|...+|+|++.|....
T Consensus 1466 k~FvGCmrdLsvD~~~VDma~fianngt~eGC~ark~fCdsg~C~n~g~CvnrW 1519 (2531)
T KOG4289|consen 1466 KQFVGCMRDLSVDGRDVDMATFIANNGTHEGCKARKNFCDSGQCSNGGTCVNRW 1519 (2531)
T ss_pred hHhhhhhhhcccccccccHHHHHhhcCcccCchhhhcccCCCccCCCCeeeccc
Confidence 23599999999999999986421 11 188899999999997433
No 10
>KOG3516|consensus
Probab=99.79 E-value=2.6e-18 Score=163.84 Aligned_cols=150 Identities=27% Similarity=0.475 Sum_probs=124.5
Q ss_pred eeEecCCCceeEEecccC-cccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCe-------EEE
Q psy17302 27 SFMLDGSQDSYAQLHKWN-AALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGA-------QIL 98 (244)
Q Consensus 27 ~~~F~gs~~syl~~~~~~-~~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~-------~~~ 98 (244)
...|+|....+.+|+++. ...+..|+|.|||.+++|+|||.+ ..++||+.|+|.++++++.+++|+.. ..+
T Consensus 179 vi~fdg~s~~~yr~~~~~m~s~~d~is~~Fkt~~sdGvllh~e-g~QGd~itlql~~~kl~l~ld~G~~~~~~s~~~~si 257 (1306)
T KOG3516|consen 179 VIYFDGSSSLLYRFHRKLMSSLKDVISLKFKTMQSDGVLLHGE-GQQGDYITLQLIGGKLVLILDLGNSKLPSSRTPTSI 257 (1306)
T ss_pred eeEECCccceeeeccccccccccceeEEEEEeeccceeEEEcc-cCCCCEEEEEEeCCEEEEEEecCCccCccccCccee
Confidence 367999644455555543 345789999999999999999996 46789999999999999999999532 346
Q ss_pred EEcCccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCC----CccccceeEEC
Q psy17302 99 TVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKL----TLLALPSVIFA 174 (244)
Q Consensus 99 ~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~----~~GCl~~v~in 174 (244)
+.+..++|++||.|.|+|.++.+.++||+........+ .+..++++..+++||+|....... ++|||++|++|
T Consensus 258 s~GslLdD~hWHsV~i~r~~~~vnftvD~~~~~fr~~G---e~~~Ldld~e~~~GGiP~~~~~~~~~~nF~GCienly~N 334 (1306)
T KOG3516|consen 258 SAGSLLDDQHWHSVRIERQGRQVNFTVDGVVHHFRATG---EFDALDLDTEISFGGIPNDGKSVGFEKNFTGCLENLYYN 334 (1306)
T ss_pred ecccccCCCcceEEEEEecCcEEEEEEccceEeecccC---ccceeecceEEEECCccCCCcccceeeeeeeeeeeeeec
Confidence 67889999999999999999999999999887755544 456799999999999998766544 69999999999
Q ss_pred CEEecC
Q psy17302 175 GERVPV 180 (244)
Q Consensus 175 g~~l~l 180 (244)
+..+--
T Consensus 335 ~vdiid 340 (1306)
T KOG3516|consen 335 GVDIID 340 (1306)
T ss_pred CceeEe
Confidence 998843
No 11
>KOG4289|consensus
Probab=99.65 E-value=1.1e-15 Score=147.42 Aligned_cols=211 Identities=17% Similarity=0.232 Sum_probs=150.2
Q ss_pred hHHHHHHHHhcccccc--eeeEecCCCceeEEec--ccCcccceEEEEEEEeCCCCeEEEEEcCCCCccEEEEEEeCCcE
Q psy17302 10 PCVLTMLWVLLLLPLG--HSFMLDGSQDSYAQLH--KWNAALNGSLELEFKTEQPNGLILYTDDGGTYDFFEIKLVEGAL 85 (244)
Q Consensus 10 ~~~~~~~~~~~~c~~~--~~~~F~gs~~syl~~~--~~~~~~~~~is~~frT~~~~GlLl~~~~~~~~~~l~l~L~~G~l 85 (244)
.|=||+.|.|--|... ++-+|.|+ |-+... +...+..+.++|.|||++.+|+||-....+. .-+.++|.+|.+
T Consensus 1523 ~C~CP~~fggk~c~~~m~~pq~frG~--sl~sw~~~~~~vSvPwylsl~FRTr~ad~vl~~~~~~~r-st~~lqld~g~l 1599 (2531)
T KOG4289|consen 1523 SCECPLGFGGKGCCQGMAHPQHFRGH--SLVSWEGLPSQVSVPWYLSLMFRTRRADGVLMQAEFGGR-STYNLQLDDGTL 1599 (2531)
T ss_pred eecCccccCCcchhhccCCchhcccc--ceeeecCCCcceecceEEEEEEEeeccccEEEEEEeCCC-ceEEEEEcCCEE
Confidence 5789999999888664 56778885 544444 3334567999999999999999999975443 448999999999
Q ss_pred EEEEEcCCCeEEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCCCc
Q psy17302 86 RLRYNLGSGAQILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKLTL 165 (244)
Q Consensus 86 ~~~~~~g~~~~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~ 165 (244)
.+.+.- ..+......++||+||.+.++.... ..++.|......+... .+..+++. .||+||+|.....+.+.
T Consensus 1600 ~~~v~~---s~v~L~~~~vtdg~Wh~~~i~l~~d-~~~t~d~g~~~aea~~---gl~gl~l~-sl~vGgap~~g~p~gf~ 1671 (2531)
T KOG4289|consen 1600 KYNVGD---SSVELPAPRVTDGHWHHLVIELEAD-SVATLDYGIYQAEAKA---GLSGLNLE-SLYVGGAPATGVPRGFR 1671 (2531)
T ss_pred EEEecC---ceEEccCccccCCchhheeeeeccC-eEEEEechhhhhhhhc---CCCCceee-EEEEccccCCCccccch
Confidence 887753 2333457899999999999998864 5566665543322211 12335554 69999999777777789
Q ss_pred cccceeEECCEEecCccccc-----cc-------------------ccCCCCccCccceeecccC----cccCCCeeeeE
Q psy17302 166 LALPSVIFAGERVPVWSRHF-----QL-------------------VGGMPPWYNAKLTLLALPS----VIFEPRFVGFI 217 (244)
Q Consensus 166 GCl~~v~ing~~l~l~~~~~-----~~-------------------~~~~~pc~n~~~~~~~~~~----~~~~~~f~Gci 217 (244)
|||+.|+++|..+-...+-. .+ .|..+||+|.+.|.-+-.+ +.+.+++.|-.
T Consensus 1672 GCiqgV~v~g~~~l~~~kv~~~~GCvvpn~C~~d~sC~c~~~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~ 1751 (2531)
T KOG4289|consen 1672 GCIQGVRVGGVSILVPKKVNVEAGCVVPNPCSVDSSCPCDPYNCVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPY 1751 (2531)
T ss_pred hhhhceEECCEeeccccccccccCcccCCccccCCcccCCCCCccchhcccccccCceeecCCCCCceeEECCCcccCcc
Confidence 99999999999986653211 01 6677899999999743222 24568999988
Q ss_pred eeeeecCCCCCCCcc
Q psy17302 218 RNVIYADPHASGPRR 232 (244)
Q Consensus 218 ~~l~~~~~~~~~~~~ 232 (244)
|+.+. +.+...-+|
T Consensus 1752 Ce~~~-dq~CPrGWW 1765 (2531)
T KOG4289|consen 1752 CELRA-DQPCPRGWW 1765 (2531)
T ss_pred hhhhc-cCCCCCccc
Confidence 88866 444333333
No 12
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.09 E-value=1.8e-08 Score=81.42 Aligned_cols=110 Identities=17% Similarity=0.215 Sum_probs=77.2
Q ss_pred cccceeeEecCCCceeEEec-----ccCcccceEEEEEEEeC-CCCeEEEEEcCCCCccEEEEEEeCCcEEEEEEc--CC
Q psy17302 22 LPLGHSFMLDGSQDSYAQLH-----KWNAALNGSLELEFKTE-QPNGLILYTDDGGTYDFFEIKLVEGALRLRYNL--GS 93 (244)
Q Consensus 22 c~~~~~~~F~gs~~syl~~~-----~~~~~~~~~is~~frT~-~~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~--g~ 93 (244)
|....+|.|... ..+..+ +.....+++|.+.||+. ...|.||-..+.+...++.|.+..++..+.+.. ..
T Consensus 24 ~~~~~Ay~~~~~--a~~~~~t~~~~p~~~~~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~ 101 (184)
T smart00210 24 EPGSPAYRLGDP--ALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRYQGVD 101 (184)
T ss_pred CCCceeEEecCC--cccCcchHHhCcCCCCCCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEECCCC
Confidence 455567777642 222211 11234579999999997 678888888665566799999987765555543 22
Q ss_pred Ce-EEEE-EcCccCCCCeEEEEEEEECcEEEEEEcCceeeee
Q psy17302 94 GA-QILT-VGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRT 133 (244)
Q Consensus 94 ~~-~~~~-s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~ 133 (244)
+. ..+. .+..+.||+||+|.+...+..++|.||.......
T Consensus 102 g~~~~~~f~~~~l~dg~WH~lal~V~~~~v~LyvDC~~~~~~ 143 (184)
T smart00210 102 GKQHTVSFRNLPLADGQWHKLALSVSGSSATLYVDCNEIDSR 143 (184)
T ss_pred CcEEEEeecCCccccCCceEEEEEEeCCEEEEEECCccccce
Confidence 22 3333 3478999999999999999999999999887654
No 13
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.64 E-value=8.5e-07 Score=68.11 Aligned_cols=145 Identities=17% Similarity=0.254 Sum_probs=87.3
Q ss_pred eeEecCCCceeEEecccC-cccceEEEEEEEeCCCCe--EEEEEcCCCCccEEEEEEe-CCcEEEEEEcCCC-eEEEEEc
Q psy17302 27 SFMLDGSQDSYAQLHKWN-AALNGSLELEFKTEQPNG--LILYTDDGGTYDFFEIKLV-EGALRLRYNLGSG-AQILTVG 101 (244)
Q Consensus 27 ~~~F~gs~~syl~~~~~~-~~~~~~is~~frT~~~~G--lLl~~~~~~~~~~l~l~L~-~G~l~~~~~~g~~-~~~~~s~ 101 (244)
++.|+|+ ++|+.++... ....++|++.||...... ..+.. .....+.+.+.+. +|.+.+.+..+.+ ...+...
T Consensus 2 a~~f~g~-~~~i~~~~~~~~~~~fTi~~w~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~ 79 (157)
T PF13385_consen 2 ALYFDGS-NDYISIPNSDFPSGSFTISFWVKPDSPSSSQSFVFM-DSSGSGGFGLFINNNGRLRFYIGNGGGGNYSFSSD 79 (157)
T ss_dssp EEEE-ST-T-EEEEESGGGGGTEEEEEEEEEESS--SSEEEEEE-SSSSSEEEEEEEETTSEEEEEETTSEEESS-EE-B
T ss_pred EEEECCC-CCEEEECCcCCCCCCEEEEEEEEeCCCCCCceEEEE-ecCCCCEEEEEEECCCEEEEEEeCCCceeEEEecC
Confidence 5678874 7899998532 246799999999976433 23332 2223346777776 4666666554432 2345567
Q ss_pred CccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCCCccccceeEECCEEe
Q psy17302 102 HDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKLTLLALPSVIFAGERV 178 (244)
Q Consensus 102 ~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~GCl~~v~ing~~l 178 (244)
..+.+++||+|.+..++..+.|.||+........... ........++||+.... ...+.|-|.+++|=.+.+
T Consensus 80 ~~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~---~~~~~~~~~~iG~~~~~--~~~~~g~i~~~~i~~~aL 151 (157)
T PF13385_consen 80 SNLPDNKWHHLALTYDGSTVTLYVNGELVGSSTIPSN---ISLNSNGPLFIGGSGGG--SSPFNGYIDDLRIYNRAL 151 (157)
T ss_dssp S---TT-EEEEEEEEETTEEEEEETTEEETTCTEESS---SSTTSCCEEEESS-STT----B-EEEEEEEEEESS--
T ss_pred cccCCCCEEEEEEEEECCeEEEEECCEEEEeEeccCC---cCCCCcceEEEeecCCC--CCceEEEEEEEEEECccC
Confidence 8899999999999999999999999988765322111 11345678999998722 334579999988766554
No 14
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.34 E-value=6.9e-05 Score=61.32 Aligned_cols=151 Identities=18% Similarity=0.170 Sum_probs=94.3
Q ss_pred cceeeEecCCC-ceeEEecccC--cccceEEEEEEEeCC--CCeEEE-EEcCCCCccEEEEEEeCCcEEEEEEcCCCeEE
Q psy17302 24 LGHSFMLDGSQ-DSYAQLHKWN--AALNGSLELEFKTEQ--PNGLIL-YTDDGGTYDFFEIKLVEGALRLRYNLGSGAQI 97 (244)
Q Consensus 24 ~~~~~~F~gs~-~syl~~~~~~--~~~~~~is~~frT~~--~~GlLl-~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~ 97 (244)
.+..+.|.... ..|+.++... ....+++.+.+|+.. ..+.|| |........++...-.+|++.+.++ +....
T Consensus 5 ~~~~l~f~~~s~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~--~~~~~ 82 (201)
T cd00152 5 SGKVFVFPKESDTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYIG--GKEVT 82 (201)
T ss_pred CCCEEECCCCCCCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc--CEEEE
Confidence 35556666633 4899996543 345799999999875 444455 4433233344433334567666553 22222
Q ss_pred EEEcCccCCCCeEEEEEEEE--CcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCC-----CCCCCccccce
Q psy17302 98 LTVGHDLNDGHWHSVRVKRR--VEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWY-----NAKLTLLALPS 170 (244)
Q Consensus 98 ~~s~~~vndg~WH~V~v~~~--~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~-----~~~~~~GCl~~ 170 (244)
+ ....++|+||+|.+.++ ...+.|.||+........... ..+...+.+.||...+.. ....+.|.|.+
T Consensus 83 ~--~~~~~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~~~~~~---~~~~~~g~l~lG~~q~~~gg~~~~~~~f~G~I~~ 157 (201)
T cd00152 83 F--KVPESDGAWHHICVTWESTSGIAELWVNGKLSVRKSLKKG---YTVGPGGSIILGQEQDSYGGGFDATQSFVGEISD 157 (201)
T ss_pred E--eccCCCCCEEEEEEEEECCCCcEEEEECCEEeccccccCC---CEECCCCeEEEeecccCCCCCCCCCcceEEEEce
Confidence 2 33459999999999998 456899999987764432211 124556778888654331 12345899999
Q ss_pred eEECCEEecCc
Q psy17302 171 VIFAGERVPVW 181 (244)
Q Consensus 171 v~ing~~l~l~ 181 (244)
++|-++.+.-.
T Consensus 158 v~iw~~~Ls~~ 168 (201)
T cd00152 158 VNMWDSVLSPE 168 (201)
T ss_pred eEEEcccCCHH
Confidence 99988877654
No 15
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.33 E-value=7.6e-05 Score=61.30 Aligned_cols=152 Identities=16% Similarity=0.186 Sum_probs=93.3
Q ss_pred cceeeEecCC-CceeEEecccC--cccceEEEEEEEeCC--CCeEEE-EEcCCCCccEEEEEEeCCcEEEEEEcCCCeEE
Q psy17302 24 LGHSFMLDGS-QDSYAQLHKWN--AALNGSLELEFKTEQ--PNGLIL-YTDDGGTYDFFEIKLVEGALRLRYNLGSGAQI 97 (244)
Q Consensus 24 ~~~~~~F~gs-~~syl~~~~~~--~~~~~~is~~frT~~--~~GlLl-~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~ 97 (244)
....+.|... ...|+.+.... .-..+++.+.+|+.. .++.|| |..+.....++...-.++.+.+.+ ++. .
T Consensus 5 ~~~~~~fp~~s~~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~~ne~~~~~~~~~~~~l~i--~g~--~ 80 (206)
T smart00159 5 TGKVFVFPKESDTSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQDNELLLYKEKQGEYSLYI--GGK--K 80 (206)
T ss_pred CCcEEECCCCCCCCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCCCCeEEEEEcCCcEEEEEE--cCe--E
Confidence 3445666653 34899986543 335799999999975 344444 443333334433222345555544 322 3
Q ss_pred EEEcCccCCCCeEEEEEEEEC--cEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCC-----CCCCccccce
Q psy17302 98 LTVGHDLNDGHWHSVRVKRRV--EKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYN-----AKLTLLALPS 170 (244)
Q Consensus 98 ~~s~~~vndg~WH~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~-----~~~~~GCl~~ 170 (244)
+..+..+.||+||+|.+.++. ..+.|.||+........... ..+...+.+.||...+... .+.+.|.|.+
T Consensus 81 ~~~~~~~~~g~W~hvc~tw~~~~g~~~lyvnG~~~~~~~~~~g---~~i~~~G~lvlGq~qd~~gg~f~~~~~f~G~i~~ 157 (206)
T smart00159 81 VQFPVPESDGKWHHICTTWESSSGIAELWVDGKPGVRKGLAKG---YTVKPGGSIILGQEQDSYGGGFDATQSFVGEIGD 157 (206)
T ss_pred EEecccccCCceEEEEEEEECCCCcEEEEECCEEcccccccCC---cEECCCCEEEEEecccCCCCCCCCCcceeEEEee
Confidence 334567899999999999984 56899999987632221111 1245567788887544321 2335899999
Q ss_pred eEECCEEecCcc
Q psy17302 171 VIFAGERVPVWS 182 (244)
Q Consensus 171 v~ing~~l~l~~ 182 (244)
++|-++.+.-..
T Consensus 158 v~iw~~~Ls~~e 169 (206)
T smart00159 158 LNMWDSVLSPEE 169 (206)
T ss_pred eEEecccCCHHH
Confidence 999998886543
No 16
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=98.03 E-value=0.00039 Score=56.61 Aligned_cols=124 Identities=17% Similarity=0.250 Sum_probs=70.0
Q ss_pred eEecCC-CceeEEecccC--cccceEEEEEEEeCCC--CeEEEEEc-CCCCccEEEEEEeCCcEEEEEEcCCCeEEEEEc
Q psy17302 28 FMLDGS-QDSYAQLHKWN--AALNGSLELEFKTEQP--NGLILYTD-DGGTYDFFEIKLVEGALRLRYNLGSGAQILTVG 101 (244)
Q Consensus 28 ~~F~gs-~~syl~~~~~~--~~~~~~is~~frT~~~--~GlLl~~~-~~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~ 101 (244)
+.|... ...|+++.+.. +-..+++.+.+||... .+.||.-. .....+++.+.-..+.+.+.+ ++... ...
T Consensus 3 ~~FP~~s~~~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~~nell~~~~~~~~~~l~i--~~~~~--~~~ 78 (195)
T PF00354_consen 3 FHFPTRSTTDYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQDNELLLFGSSSGSLRLYI--NGSSV--SFS 78 (195)
T ss_dssp EEE-S-BSSBEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTEEEEEEEEEETTTEEEEEE--TTEEE--EEE
T ss_pred EECCCCCCcceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCCCccEEEEEeCCceEEEEE--CCeEe--Eec
Confidence 445442 24788887532 3457999999999875 66666554 322334444333345554444 33322 234
Q ss_pred CccCCCCeEEEEEEEEC--cEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCC
Q psy17302 102 HDLNDGHWHSVRVKRRV--EKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPW 158 (244)
Q Consensus 102 ~~vndg~WH~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~ 158 (244)
..+.|++||++.+.++. ..+.|.+||........... ..+..++.+.||--.+.
T Consensus 79 ~~~~~~~Whh~C~tW~s~~G~~~ly~dG~~~~~~~~~~g---~~i~~gG~~vlGQeQd~ 134 (195)
T PF00354_consen 79 GPIRDGQWHHICVTWDSSTGRWQLYVDGVRLSSTGLATG---HSIPGGGTLVLGQEQDS 134 (195)
T ss_dssp ECS-TSS-EEEEEEEETTTTEEEEEETTEEEEEEESSTT-----B-SSEEEEESS-BSB
T ss_pred cccCCCCcEEEEEEEecCCcEEEEEECCEecccccccCC---ceECCCCEEEECccccc
Confidence 56799999999999985 67899999995443322211 22455667777765543
No 17
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.85 E-value=0.00087 Score=54.03 Aligned_cols=111 Identities=15% Similarity=0.214 Sum_probs=72.6
Q ss_pred cceEEEEEEEeCCCCeE--EEEEcC-CCCccEEEEEEeCCcEEEEEEcCCCeE--EEEEc-----CccCCCCeEEEEEEE
Q psy17302 47 LNGSLELEFKTEQPNGL--ILYTDD-GGTYDFFEIKLVEGALRLRYNLGSGAQ--ILTVG-----HDLNDGHWHSVRVKR 116 (244)
Q Consensus 47 ~~~~is~~frT~~~~Gl--Ll~~~~-~~~~~~l~l~L~~G~l~~~~~~g~~~~--~~~s~-----~~vndg~WH~V~v~~ 116 (244)
...+|.++|++...+++ ||-.++ .....|+.+.+.++.+-+.++-..+.. ..... ...++-.||.|.+.-
T Consensus 33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~a 112 (190)
T PF02973_consen 33 EEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFVA 112 (190)
T ss_dssp SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEEE
T ss_pred cccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEEE
Confidence 46899999999877775 677764 345689999999998888888766532 22233 234566799999999
Q ss_pred E--CcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCC
Q psy17302 117 R--VEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPP 157 (244)
Q Consensus 117 ~--~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~ 157 (244)
+ .....|.+||............-+..+.--..++|||...
T Consensus 113 d~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R 155 (190)
T PF02973_consen 113 DSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNR 155 (190)
T ss_dssp ETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEE
T ss_pred ecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEe
Confidence 8 8889999999665543211111122233335789998743
No 18
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=97.01 E-value=0.05 Score=41.21 Aligned_cols=66 Identities=17% Similarity=0.078 Sum_probs=45.9
Q ss_pred CCeEEEEEEEEC--cEEEEEEcCceeeeecCCCceeeeecccCCceEEec-cCCC-CCCCCCccccceeEECCEEe
Q psy17302 107 GHWHSVRVKRRV--EKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGG-MPPW-YNAKLTLLALPSVIFAGERV 178 (244)
Q Consensus 107 g~WH~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG-~p~~-~~~~~~~GCl~~v~ing~~l 178 (244)
|+||+|.+..++ ..++|.||+......... ......++.||. .... .....+.|.|.+++|-++.+
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~------~~~~~~~~~iG~~~~~~~~~~~~f~G~Idevriy~~aL 130 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ------PSPSSGNLPQGGRILLGGAGGENFSGRLDEVRVYNRAL 130 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEccccccC------CcccCCceEEeeeccCCCCCCCCceEEeeEEEEecccc
Confidence 999999999998 789999999876533211 123456788883 2221 22234589999999877755
No 19
>KOG3509|consensus
Probab=95.75 E-value=0.0098 Score=58.52 Aligned_cols=156 Identities=15% Similarity=0.116 Sum_probs=98.7
Q ss_pred CCeEEEEEcCCCCccEEEEEEeCCcEEEEEEcCCCeEEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCce
Q psy17302 60 PNGLILYTDDGGTYDFFEIKLVEGALRLRYNLGSGAQILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEF 139 (244)
Q Consensus 60 ~~GlLl~~~~~~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~ 139 (244)
.+.+..+.......+|+++.+..|.+.+++..+.+...+......-.++||.+.+.| .-.+.+++.......... .
T Consensus 267 ~~~~~~~~~~~~~~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r---~s~~~~~g~~~~l~g~~~-~ 342 (964)
T KOG3509|consen 267 RDILGNFLFSSFKDGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR---GSGLSVSGHKGVLQGNSN-I 342 (964)
T ss_pred ccccccccccccccceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee---cccccccCcceeeccccc-c
Confidence 344555555556778999999999899999888877888888999999999999999 333455553333221111 0
Q ss_pred eeeecccCCceEEeccCCCCCCC-------CCccccceeEECCEEecCccccc-cc---------ccCCCCccCccceee
Q psy17302 140 RFGHVTSNSWVYIGGMPPWYNAK-------LTLLALPSVIFAGERVPVWSRHF-QL---------VGGMPPWYNAKLTLL 202 (244)
Q Consensus 140 ~~~~l~~~~~lyiGG~p~~~~~~-------~~~GCl~~v~ing~~l~l~~~~~-~~---------~~~~~pc~n~~~~~~ 202 (244)
....+.....+|+||+-+-...+ .+.||++++.+|++.+....... .+ .|..-||++.+.|..
T Consensus 343 ~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~~~c~g~~c~~~p~~~~g~c~p 422 (964)
T KOG3509|consen 343 LVSRITNEESVFLGGIINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQGTCLGDVCWRIPCQHDGPCLQ 422 (964)
T ss_pred cccceeecccccCCceeeeccccccCCCccCccceehhhhhhccccccccccccccccccccCCCccccccCCCCccccc
Confidence 12234555667777743322111 13899999999999998764221 11 333457888877764
Q ss_pred cccCc--ccCCCeeeeEee
Q psy17302 203 ALPSV--IFEPRFVGFIRN 219 (244)
Q Consensus 203 ~~~~~--~~~~~f~Gci~~ 219 (244)
..-.. .+.++|.|--|.
T Consensus 423 ~~~~~~c~c~~g~~G~~c~ 441 (964)
T KOG3509|consen 423 TLEGKQCLCPPGYTGDSCE 441 (964)
T ss_pred cccccceeccccccCchhh
Confidence 43322 235667664443
No 20
>KOG1834|consensus
Probab=95.31 E-value=0.36 Score=45.61 Aligned_cols=102 Identities=17% Similarity=0.172 Sum_probs=72.2
Q ss_pred eeEecCCCceeEEeccc----CcccceEEEEEEEeCCC-------CeEEEEEcCC--CCccEEEEEEeCCcEEEEEEcCC
Q psy17302 27 SFMLDGSQDSYAQLHKW----NAALNGSLELEFKTEQP-------NGLILYTDDG--GTYDFFEIKLVEGALRLRYNLGS 93 (244)
Q Consensus 27 ~~~F~gs~~syl~~~~~----~~~~~~~is~~frT~~~-------~GlLl~~~~~--~~~~~l~l~L~~G~l~~~~~~g~ 93 (244)
-|.|+|. --+..|.- .....++|+|.+|--.. .-.|+...+. ...++++|.+.+=+|.|.+.-..
T Consensus 344 i~eFdG~--qgv~vpdg~~~g~l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~ 421 (952)
T KOG1834|consen 344 IFEFDGT--QGVTVPDGNVSGSLPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDA 421 (952)
T ss_pred EEEEcCc--eeeEccCCCCCCCCCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCc
Confidence 4889985 33445422 12346889998886432 2366776532 23568899999999999987754
Q ss_pred CeE------EE-EEcCccCCCCeEEEEEEEECcEEEEEEcCcee
Q psy17302 94 GAQ------IL-TVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQ 130 (244)
Q Consensus 94 ~~~------~~-~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~ 130 (244)
+.. .. .+-..++|++||.-.+..+.-.++|.|||..-
T Consensus 422 ~~~~~fRpaef~Wkl~qVCD~EWH~Y~ln~efp~VtlyvDG~Sf 465 (952)
T KOG1834|consen 422 GATSDFRPAEFHWKLPQVCDNEWHHYVLNVEFPDVTLYVDGKSF 465 (952)
T ss_pred cccccccchheeccchhhhhhhhheeEEeecCceEEEEEcCccc
Confidence 332 22 35679999999999999999999999999764
No 21
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=95.02 E-value=0.041 Score=43.73 Aligned_cols=97 Identities=15% Similarity=0.243 Sum_probs=53.4
Q ss_pred eeEEecccCcccceEEEEEEEe-CC-CCeEEEEEcC----CCCccEEEEEEeCCcEEEEEEcCCCeEE--------EEEc
Q psy17302 36 SYAQLHKWNAALNGSLELEFKT-EQ-PNGLILYTDD----GGTYDFFEIKLVEGALRLRYNLGSGAQI--------LTVG 101 (244)
Q Consensus 36 syl~~~~~~~~~~~~is~~frT-~~-~~GlLl~~~~----~~~~~~l~l~L~~G~l~~~~~~g~~~~~--------~~s~ 101 (244)
.++.... ...+++++++||. .. ..|++|...+ ......+.++|.++.-........+... ....
T Consensus 44 ~~l~~~~--~~~df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~ 121 (185)
T PF06439_consen 44 GYLYTDK--KFSDFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVN 121 (185)
T ss_dssp --EEESS--EBSSEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS-
T ss_pred ceEEECC--ccccEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEecccccccccc
Confidence 4555542 3356788888884 33 3455555540 1223456666665432200011111111 1235
Q ss_pred CccCCCCeEEEEEEEECcEEEEEEcCceeeeec
Q psy17302 102 HDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTS 134 (244)
Q Consensus 102 ~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~ 134 (244)
.....++||+++|...++.+++.||+..+....
T Consensus 122 ~~~~~~~W~~~~I~~~g~~i~v~vnG~~v~~~~ 154 (185)
T PF06439_consen 122 VAIPPGEWNTVRIVVKGNRITVWVNGKPVADFT 154 (185)
T ss_dssp -S--TTSEEEEEEEEETTEEEEEETTEEEEEEE
T ss_pred ccCCCCceEEEEEEEECCEEEEEECCEEEEEEE
Confidence 678899999999999999999999999887644
No 22
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=91.31 E-value=1.2 Score=36.60 Aligned_cols=61 Identities=13% Similarity=0.150 Sum_probs=44.5
Q ss_pred CccEEEEEEeCCcEEEEEEcCC-C----eEEEEEcCccCCCCeEEEEEEEE-----CcEEEEEEcCceeee
Q psy17302 72 TYDFFEIKLVEGALRLRYNLGS-G----AQILTVGHDLNDGHWHSVRVKRR-----VEKTSLTVDNITQTR 132 (244)
Q Consensus 72 ~~~~l~l~L~~G~l~~~~~~g~-~----~~~~~s~~~vndg~WH~V~v~~~-----~~~~~L~VD~~~~~~ 132 (244)
....++|.+.+|++.+.+.... . .........+.-|+||++.+... ...+.+.+|++....
T Consensus 112 ~~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~ 182 (224)
T PF14099_consen 112 GSPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVD 182 (224)
T ss_dssp EEECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCE
T ss_pred CCCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEE
Confidence 4668999999999999988766 1 12233456788899999887774 456889999977654
No 23
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=89.38 E-value=0.24 Score=27.79 Aligned_cols=27 Identities=15% Similarity=0.135 Sum_probs=18.3
Q ss_pred cCCCCccCccceeecc-cC--cccCCCeee
Q psy17302 189 GGMPPWYNAKLTLLAL-PS--VIFEPRFVG 215 (244)
Q Consensus 189 ~~~~pc~n~~~~~~~~-~~--~~~~~~f~G 215 (244)
|..+||+|++.|.... .. ..+.++|.|
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 5678999999998544 22 233566766
No 24
>smart00282 LamG Laminin G domain.
Probab=81.10 E-value=1.3 Score=33.02 Aligned_cols=16 Identities=38% Similarity=0.590 Sum_probs=13.8
Q ss_pred cCCCeeeeEeeeeecC
Q psy17302 209 FEPRFVGFIRNVIYAD 224 (244)
Q Consensus 209 ~~~~f~Gci~~l~~~~ 224 (244)
...+|.|||+++++++
T Consensus 119 ~~~~F~GCi~~v~in~ 134 (135)
T smart00282 119 VTPGFRGCIRNLKVNG 134 (135)
T ss_pred cCCCCeeEeeEEEECC
Confidence 4689999999998875
No 25
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=80.20 E-value=43 Score=32.44 Aligned_cols=86 Identities=14% Similarity=0.279 Sum_probs=50.8
Q ss_pred cceEEEEEEEeCCC-CeEEEEEc--CC-------CCccEEEEEEeCCcEEEEEEcCCCeEEEEEcCccCCCCeEEEEEEE
Q psy17302 47 LNGSLELEFKTEQP-NGLILYTD--DG-------GTYDFFEIKLVEGALRLRYNLGSGAQILTVGHDLNDGHWHSVRVKR 116 (244)
Q Consensus 47 ~~~~is~~frT~~~-~GlLl~~~--~~-------~~~~~l~l~L~~G~l~~~~~~g~~~~~~~s~~~vndg~WH~V~v~~ 116 (244)
.+..++.++....+ .|-++.++ +. ..+-|+.| ..+|.-.+.=++.+..........+..++||++.+..
T Consensus 542 ~NytVs~DV~ie~~~~ggv~lagRv~~~g~~~~~~~G~~f~v-~~~G~w~vt~d~~~~~~l~~G~~~~~~~~WhtltL~~ 620 (669)
T PF02057_consen 542 SNYTVSCDVYIETPDTGGVFLAGRVNKGGCDVRSARGYFFWV-YANGTWSVTSDLAGTTTLASGTADIGAGKWHTLTLTI 620 (669)
T ss_dssp -EEEEEEEEEE-STTT-EEEEEEEE---GGGGGG-EEEEEEE-ETTTEEEEEEETTS-SEEEEEE-S--TT-EEEEEEEE
T ss_pred eEEEEEEEEEeccCCcCcEEEEEeecccccccCCCCeEEEEE-EcCCcEEEeccCCCcEEEeeeeecccCCeEEEEEEEE
Confidence 35667766666542 33333332 11 12335655 6788888876666533333334578889999999999
Q ss_pred ECcEEEEEEcCceeeee
Q psy17302 117 RVEKTSLTVDNITQTRT 133 (244)
Q Consensus 117 ~~~~~~L~VD~~~~~~~ 133 (244)
.+..++-.+|+......
T Consensus 621 ~g~~~ta~lng~~l~~~ 637 (669)
T PF02057_consen 621 SGSTATAMLNGTVLWTD 637 (669)
T ss_dssp ETTEEEEEETTEEEEEE
T ss_pred ECCEEEEEECCEEeEEe
Confidence 99999999999887653
No 26
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=79.49 E-value=1.5 Score=31.87 Aligned_cols=15 Identities=40% Similarity=0.705 Sum_probs=12.4
Q ss_pred CCCeeeeEeeeeecC
Q psy17302 210 EPRFVGFIRNVIYAD 224 (244)
Q Consensus 210 ~~~f~Gci~~l~~~~ 224 (244)
..+|.|||+++++++
T Consensus 113 ~~~f~Gci~~l~vng 127 (128)
T PF02210_consen 113 TPGFVGCIRDLRVNG 127 (128)
T ss_dssp TSB-EEEEEEEEETT
T ss_pred CCCcEEEcCeEEECC
Confidence 789999999998875
No 27
>KOG3546|consensus
Probab=79.22 E-value=15 Score=35.16 Aligned_cols=125 Identities=13% Similarity=0.135 Sum_probs=71.6
Q ss_pred ceEEEEEEEeCCC-CeEEEEEcCCC-CccEEEEEE---eCCcEEEEEEc---CCCe-EEEE-EcCccCCCCeEEEEEEEE
Q psy17302 48 NGSLELEFKTEQP-NGLILYTDDGG-TYDFFEIKL---VEGALRLRYNL---GSGA-QILT-VGHDLNDGHWHSVRVKRR 117 (244)
Q Consensus 48 ~~~is~~frT~~~-~GlLl~~~~~~-~~~~l~l~L---~~G~l~~~~~~---g~~~-~~~~-s~~~vndg~WH~V~v~~~ 117 (244)
.+.|.|.+|.... -|+||-+.+.. .--|+-|.| +||+-.+.+.+ |... ++.. -..++-.++|.++.+...
T Consensus 88 df~~~~~i~p~s~~~gvlfaitd~~q~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~v~ 167 (1167)
T KOG3546|consen 88 DFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVA 167 (1167)
T ss_pred cceEEEEeeccCCCCceEEEechhhhhhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeeeec
Confidence 4778888888665 45666654322 223555544 36765544332 2222 1222 256778899999999999
Q ss_pred CcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCCCCccccceeEE
Q psy17302 118 VEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAKLTLLALPSVIF 173 (244)
Q Consensus 118 ~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~GCl~~v~i 173 (244)
+..++|.||=+.....-...+..--.+....-||++-+-... ..+++|.|+.+++
T Consensus 168 g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~~-~~~f~g~~~~l~v 222 (1167)
T KOG3546|consen 168 GGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGAD-PDKFQGVIAELKV 222 (1167)
T ss_pred CceEEEEechHHhcccchhccccceeecCCcceEEeccCCCC-hHhhhhhhhheee
Confidence 999999999765432211110011124556778887755432 2334566665543
No 28
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=77.59 E-value=1.9 Score=32.32 Aligned_cols=19 Identities=32% Similarity=0.461 Sum_probs=14.4
Q ss_pred ccCCCeeeeEeeeeecCCC
Q psy17302 208 IFEPRFVGFIRNVIYADPH 226 (244)
Q Consensus 208 ~~~~~f~Gci~~l~~~~~~ 226 (244)
...++|.|||+++.+++..
T Consensus 111 ~~~~~f~GCi~~~~in~~~ 129 (131)
T PF00054_consen 111 PISPGFKGCIRNLSINGKP 129 (131)
T ss_dssp SCCSB-EEEEEEEEETTEE
T ss_pred ccCCCeeEEEEEeEECCEE
Confidence 3468999999999888753
No 29
>KOG1836|consensus
Probab=75.84 E-value=2.5 Score=44.96 Aligned_cols=57 Identities=19% Similarity=0.099 Sum_probs=43.1
Q ss_pred EEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCC
Q psy17302 98 LTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYN 160 (244)
Q Consensus 98 ~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~ 160 (244)
+......-++.||.+...+....+.+..|. ......+ ......+.++++||.|....
T Consensus 1608 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~-----~~~~~~~~p~~~~~~~~s~~ 1664 (1705)
T KOG1836|consen 1608 STPIVSLLPGGCHSVTSSTDPGVVQLEDDT-YTVGEIP-----PPPADTQEPIKLGGYPSSLT 1664 (1705)
T ss_pred chhhhhhcCCcceeeeeecCCccccccccc-eecccCC-----CCchhccCCcccCCcccccc
Confidence 444678889999999999999999999888 2222222 23367789999999998654
No 30
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=74.85 E-value=2.2 Score=32.16 Aligned_cols=15 Identities=40% Similarity=0.718 Sum_probs=12.8
Q ss_pred cCCCeeeeEeeeeec
Q psy17302 209 FEPRFVGFIRNVIYA 223 (244)
Q Consensus 209 ~~~~f~Gci~~l~~~ 223 (244)
...+|.|||+++++|
T Consensus 137 ~~~~F~Gci~~v~in 151 (151)
T cd00110 137 VSPGFVGCIRDLKVN 151 (151)
T ss_pred ccCCCceEeeEeEeC
Confidence 468999999999775
No 31
>PF07622 DUF1583: Protein of unknown function (DUF1583); InterPro: IPR011475 Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM.
Probab=74.30 E-value=7.7 Score=34.77 Aligned_cols=33 Identities=24% Similarity=0.384 Sum_probs=29.9
Q ss_pred cCccCCCCeEEEEEEEECcEEEEEEcCceeeee
Q psy17302 101 GHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRT 133 (244)
Q Consensus 101 ~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~ 133 (244)
..++++..|.+|++++.+.++.|.+|++.....
T Consensus 84 ~~~l~~~~wN~v~l~~~g~~v~l~LN~~~i~~~ 116 (399)
T PF07622_consen 84 TLPLKVNAWNRVRLQRRGDKVQLHLNGQLIYER 116 (399)
T ss_pred CCCCCccccceEEEEEeCCEEEEEeCCceeEec
Confidence 568899999999999999999999999987753
No 32
>KOG1219|consensus
Probab=62.94 E-value=39 Score=37.76 Aligned_cols=42 Identities=5% Similarity=-0.107 Sum_probs=32.5
Q ss_pred ccCCCCccCccceeeccc--CcccCCCeeeeEeeeeecCCCCCC
Q psy17302 188 VGGMPPWYNAKLTLLALP--SVIFEPRFVGFIRNVIYADPHASG 229 (244)
Q Consensus 188 ~~~~~pc~n~~~~~~~~~--~~~~~~~f~Gci~~l~~~~~~~~~ 229 (244)
.|..+||.++|.|-.... .|.|+.+|+|-.|+++=.+.|+++
T Consensus 3905 pC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3905 PCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred cccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence 689999999999986433 346789999999999645556654
No 33
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=55.25 E-value=1.1e+02 Score=24.83 Aligned_cols=24 Identities=25% Similarity=0.399 Sum_probs=20.7
Q ss_pred CCeEEEEEEEE--CcEEEEEEcCcee
Q psy17302 107 GHWHSVRVKRR--VEKTSLTVDNITQ 130 (244)
Q Consensus 107 g~WH~V~v~~~--~~~~~L~VD~~~~ 130 (244)
|+||+|.|..+ .+.+.+.+++...
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~~~ 179 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNGST 179 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCCCc
Confidence 99999999999 5888999987653
No 34
>PF09224 DUF1961: Domain of unknown function (DUF1961); InterPro: IPR015305 Members of this family are found in a set of hypothetical bacterial proteins. Their exact function has not, as yet, been determined. ; PDB: 1OQ1_C.
Probab=53.75 E-value=1.2e+02 Score=24.95 Aligned_cols=30 Identities=0% Similarity=-0.026 Sum_probs=23.1
Q ss_pred ccCCCCeEEEEEEEECcEEEEEEcCceeee
Q psy17302 103 DLNDGHWHSVRVKRRVEKTSLTVDNITQTR 132 (244)
Q Consensus 103 ~vndg~WH~V~v~~~~~~~~L~VD~~~~~~ 132 (244)
......|+++++.+.+..+...||+.....
T Consensus 155 ~~d~~~~Yr~~i~K~~~~v~f~In~L~vf~ 184 (218)
T PF09224_consen 155 VEDARGPYRMEIVKDGRTVRFSINGLPVFS 184 (218)
T ss_dssp GGG--S-EEEEEEEETTEEEEEETTEEEEE
T ss_pred ccccCCCEEEEEEEcCCEEEEEECCEEEEE
Confidence 445557999999999999999999877664
No 35
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=52.58 E-value=75 Score=21.93 Aligned_cols=47 Identities=15% Similarity=0.263 Sum_probs=32.0
Q ss_pred CCCccEEEEEEeCCcEEEEEEcCCCe----EEEEE---cCccCCCCeEEEEEEE
Q psy17302 70 GGTYDFFEIKLVEGALRLRYNLGSGA----QILTV---GHDLNDGHWHSVRVKR 116 (244)
Q Consensus 70 ~~~~~~l~l~L~~G~l~~~~~~g~~~----~~~~s---~~~vndg~WH~V~v~~ 116 (244)
.+.+-|-.|.+..|.|.|..-.+.+. ..+.. ...+.-.+||+|...-
T Consensus 21 TK~GtWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p~s 74 (82)
T PF09313_consen 21 TKAGTWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEPLS 74 (82)
T ss_dssp CSTTEEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEESS
T ss_pred CCCCeEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEECC
Confidence 45567999999999999998877532 22322 3577889999998543
No 36
>PF08787 Alginate_lyase2: Alginate lyase; InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=51.11 E-value=1.4e+02 Score=24.76 Aligned_cols=61 Identities=16% Similarity=0.249 Sum_probs=43.2
Q ss_pred ccEEEEEE------eCCcEEEEEEc---CCCeEEEEEcCccCCCCeEEEEEEEECcEEEEEEcCceeeee
Q psy17302 73 YDFFEIKL------VEGALRLRYNL---GSGAQILTVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTRT 133 (244)
Q Consensus 73 ~~~l~l~L------~~G~l~~~~~~---g~~~~~~~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~~ 133 (244)
...+.|.. .+|.|.+.++- .++......-..+.-|+|-++.|...+..+.+.+|+......
T Consensus 121 ~pllkl~~~~~~~~~~G~l~~~~~~~~~~~~~~~~~~~~~i~LG~~F~y~I~v~~~~l~V~ing~~~~~~ 190 (236)
T PF08787_consen 121 EPLLKLYYRKEPGNEKGSLYVYVRQSNPDGGDQEYTIYGGIPLGEWFSYEIEVSGGTLTVTINGEGKTTV 190 (236)
T ss_dssp EEEEEEEEEEC--TETTEEEEEEESSTTTTSEEEEEEEEEEETT-EEEEEEEEETTEEEEEETTEEEEEE
T ss_pred CCEEEEEEEEeeeccCCeEEEEEeccCCCCCcEEeeeEcceeCCCEEEEEEEEECCEEEEEEECCcceEE
Confidence 34456666 78999999982 222233333346777899999999999999999999877653
No 37
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=42.27 E-value=1.8e+02 Score=24.04 Aligned_cols=29 Identities=28% Similarity=0.593 Sum_probs=24.4
Q ss_pred EcCccCCCCeEEEEEEEEC--cEEEEEEcCc
Q psy17302 100 VGHDLNDGHWHSVRVKRRV--EKTSLTVDNI 128 (244)
Q Consensus 100 s~~~vndg~WH~V~v~~~~--~~~~L~VD~~ 128 (244)
....+.+|+||.|.|..+. +.+.+.++..
T Consensus 160 ~~~~l~~g~~~~v~I~Yd~~~~~L~V~l~~~ 190 (236)
T PF00139_consen 160 PSFSLSDGKWHTVWIDYDASTKRLSVYLDDN 190 (236)
T ss_dssp EEHHHGTTSEEEEEEEEETTTTEEEEEEEET
T ss_pred ccccccCCcEEEEEEEEcCCccEEEEEEecc
Confidence 4667899999999999997 6787777776
No 38
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=41.23 E-value=1.9e+02 Score=23.38 Aligned_cols=84 Identities=14% Similarity=0.129 Sum_probs=51.1
Q ss_pred cceEEEEEEEeCCCCeEEEEEcCCCCccEE-EEEEe-CCcEEEEEEcCCCeEEEEEcCccCCCCeEEEEEEEEC--cEEE
Q psy17302 47 LNGSLELEFKTEQPNGLILYTDDGGTYDFF-EIKLV-EGALRLRYNLGSGAQILTVGHDLNDGHWHSVRVKRRV--EKTS 122 (244)
Q Consensus 47 ~~~~is~~frT~~~~GlLl~~~~~~~~~~l-~l~L~-~G~l~~~~~~g~~~~~~~s~~~vndg~WH~V~v~~~~--~~~~ 122 (244)
.-++++-..|-..-...-.|.++ +...|+ .|.+. +|.|+..++..++..++... ..+-...|...+.... ...+
T Consensus 32 ~gW~ls~~~RV~~G~~n~~yyAn-G~~r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~-~~di~gyH~Y~i~~~p~~~tAS 109 (198)
T PF09264_consen 32 QGWSLSWESRVVSGGCNTNYYAN-GSKRYLPILSVNESGSLVAELEGQSSNTLLATT-GADIHGYHKYEIVFSPLTNTAS 109 (198)
T ss_dssp C-EEEEEEEEEEEES-EEEEEEE-SSEEEEEEEEE-TTS-EEEEETTS-S-EEEE-C-HHHHCSEEEEEEEEETTTTEEE
T ss_pred cCcceeeeEEEecCcceeEEEcC-CceEEEEEEEEcCCCCEEEEEecCCCcEEEecc-cccccceeEEEEEecCCCCceE
Confidence 34667767776654444444443 234555 45665 67788888766655555433 1222568999999965 8999
Q ss_pred EEEcCceeee
Q psy17302 123 LTVDNITQTR 132 (244)
Q Consensus 123 L~VD~~~~~~ 132 (244)
+.|||.....
T Consensus 110 fy~DG~lI~t 119 (198)
T PF09264_consen 110 FYFDGTLIAT 119 (198)
T ss_dssp EEETTEEEEE
T ss_pred EEECCEEEee
Confidence 9999998875
No 39
>COG3615 TehB Uncharacterized protein/domain, possibly involved in tellurite resistance [Inorganic ion transport and metabolism]
Probab=39.26 E-value=1.4e+02 Score=21.32 Aligned_cols=55 Identities=13% Similarity=0.252 Sum_probs=37.7
Q ss_pred CCCccEEEEEEeCCcEEEEEEcCCCe----EEEEE--c-CccCCCCeEEEEEEEECcEEEEE
Q psy17302 70 GGTYDFFEIKLVEGALRLRYNLGSGA----QILTV--G-HDLNDGHWHSVRVKRRVEKTSLT 124 (244)
Q Consensus 70 ~~~~~~l~l~L~~G~l~~~~~~g~~~----~~~~s--~-~~vndg~WH~V~v~~~~~~~~L~ 124 (244)
...+.|-.|.+-.|.+.+.--.+.+. .++.. . ..+--.+||+|+..-+.-...|.
T Consensus 32 Tk~G~w~kLsVl~G~vk~~~~~ee~~~~~e~~~~~ea~~~~~~PQ~WHrVea~tDD~e~~l~ 93 (99)
T COG3615 32 TKPGTWGKLSVLKGAVKFLGLAEEGETEPEHVFSIEAQFPVFPPQAWHRVEAMTDDAEFNLS 93 (99)
T ss_pred CCCCceeEEEEEeceeEEEEEcCCCCccceEEEeecCCCCccChhHeeeeeecccccEEEEE
Confidence 45567999999999999987666543 22322 2 34456789999987776666553
No 40
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=34.85 E-value=2.6e+02 Score=23.11 Aligned_cols=26 Identities=15% Similarity=0.152 Sum_probs=20.9
Q ss_pred cCCCCeEEEEEEEEC--cEEEEEEcCce
Q psy17302 104 LNDGHWHSVRVKRRV--EKTSLTVDNIT 129 (244)
Q Consensus 104 vndg~WH~V~v~~~~--~~~~L~VD~~~ 129 (244)
+.+|++|+|.|..++ +.+.+.++...
T Consensus 160 l~~g~~~~v~I~Y~~~~~~L~V~l~~~~ 187 (236)
T cd06899 160 LKSGKPMQAWIDYDSSSKRLSVTLAYSG 187 (236)
T ss_pred ccCCCeEEEEEEEcCCCCEEEEEEEeCC
Confidence 579999999999984 67777777543
No 41
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=30.66 E-value=3.1e+02 Score=22.57 Aligned_cols=29 Identities=17% Similarity=0.098 Sum_probs=22.4
Q ss_pred ccCCCCeEEEEEEEEC--cEEEEEEcCceee
Q psy17302 103 DLNDGHWHSVRVKRRV--EKTSLTVDNITQT 131 (244)
Q Consensus 103 ~vndg~WH~V~v~~~~--~~~~L~VD~~~~~ 131 (244)
..|.+.-.++.|+... ..+.+.||++...
T Consensus 147 ~rn~~~p~~iri~Y~~~~~~l~v~vd~~~Cf 177 (215)
T cd06903 147 YQDSGVPSTIRLSYDALNSLFKVQVDNRLCF 177 (215)
T ss_pred ccCCCCCEEEEEEEECCCCEEEEEECCCEEE
Confidence 3466677889999988 8899999986433
No 42
>PTZ00334 trans-sialidase; Provisional
Probab=29.75 E-value=2.4e+02 Score=28.09 Aligned_cols=74 Identities=18% Similarity=0.096 Sum_probs=42.3
Q ss_pred cCCCCeEEEEEEE-ECcEEEEEEcCceeeeecCCCceeeeecccCCceEEeccCCCCCCC-CCccccceeEECCEEec
Q psy17302 104 LNDGHWHSVRVKR-RVEKTSLTVDNITQTRTSRGKEFRFGHVTSNSWVYIGGMPPWYNAK-LTLLALPSVIFAGERVP 179 (244)
Q Consensus 104 vndg~WH~V~v~~-~~~~~~L~VD~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~-~~~GCl~~v~ing~~l~ 179 (244)
-..++-|+|.+.. ++++..+.||++.......... .. ...--...||||-....... ...=-+.+|.+=+++|.
T Consensus 640 We~~k~yqVal~L~~G~~gsvYVDG~~vg~~~~~l~-~~-~~~~IshFyiGgdg~~~~~~~~~~VTV~NVlLYNRpL~ 715 (780)
T PTZ00334 640 WEPETTHQVAIVLRNGKQGSAYVDGQRVGDASCELK-NT-DSKGISHFYIGGDGGSAGSKEDVPVTATNVLLYNRPLD 715 (780)
T ss_pred ccCCCeEEEEEEEeCCCeEEEEECCEEecCcccccC-CC-CCcccceEEECCCccccccCCCCCEEEeEeEEeCCCCC
Confidence 3456779999887 5779999999998743221111 00 11222679999965321110 11224566666666664
No 43
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=29.64 E-value=34 Score=18.93 Aligned_cols=15 Identities=13% Similarity=0.045 Sum_probs=10.6
Q ss_pred cCC-CCccCccceeec
Q psy17302 189 GGM-PPWYNAKLTLLA 203 (244)
Q Consensus 189 ~~~-~pc~n~~~~~~~ 203 (244)
|.. .||.+.+.|...
T Consensus 5 C~~~~~C~~~~~C~~~ 20 (39)
T smart00179 5 CASGNPCQNGGTCVNT 20 (39)
T ss_pred CcCCCCcCCCCEeECC
Confidence 545 689988888643
No 44
>cd08023 GH16_laminarinase_like Laminarinase, member of the glycosyl hydrolase family 16. Laminarinase, also known as glucan endo-1,3-beta-D-glucosidase, is a glycosyl hydrolase family 16 member that hydrolyzes 1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans such as laminarins, curdlans, paramylons, and pachymans, with very limited action on mixed-link (1,3-1,4-)-beta-D-glucans.
Probab=28.99 E-value=1.8e+02 Score=23.74 Aligned_cols=31 Identities=16% Similarity=0.130 Sum_probs=27.0
Q ss_pred cCCCCeEEEEEEEECcEEEEEEcCceeeeec
Q psy17302 104 LNDGHWHSVRVKRRVEKTSLTVDNITQTRTS 134 (244)
Q Consensus 104 vndg~WH~V~v~~~~~~~~L~VD~~~~~~~~ 134 (244)
-..+.||+-.+++....++..||++......
T Consensus 155 ~~~~~fHtY~~~W~p~~i~~yvDG~~v~~~~ 185 (235)
T cd08023 155 DLSDDFHTYAVEWTPDKITFYVDGKLYFTYT 185 (235)
T ss_pred CcCCCcEEEEEEEECCEEEEEECCEEEEEEc
Confidence 4578899999999999999999999887643
No 45
>PF11025 GP40: Glycoprotein GP40 of Cryptosporidium; InterPro: IPR021035 This entry represents proteins that are highly conserved in Cryptosporidium spp. Many members are annotated as being a 60 kDa glycoprotein.
Probab=28.86 E-value=2.7e+02 Score=21.34 Aligned_cols=57 Identities=16% Similarity=0.158 Sum_probs=39.3
Q ss_pred EEEEEeCCcEEEEEEcCCCeEEE-------EEcCccCCCCeEEEEEEEECcEEEEEEcCceeee
Q psy17302 76 FEIKLVEGALRLRYNLGSGAQIL-------TVGHDLNDGHWHSVRVKRRVEKTSLTVDNITQTR 132 (244)
Q Consensus 76 l~l~L~~G~l~~~~~~g~~~~~~-------~s~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~ 132 (244)
+.+....|.....+..|+-..+. ....++-.|.-..|.++.....+.+.||++.-..
T Consensus 12 FvmWf~~GtpvaTlkcg~YTiVyAP~k~~t~PaPrYISGev~~VtFeksd~TvkIkvd~kefst 75 (165)
T PF11025_consen 12 FVMWFGEGTPVATLKCGDYTIVYAPEKDQTDPAPRYISGEVKSVTFEKSDSTVKIKVDGKEFST 75 (165)
T ss_pred EEEEecCCcceEEEecCCEEEEEccccCCCCCCCceeecceEEEEEeccCCeEEEEECCeEccc
Confidence 33444556666666666533221 1235667788999999999999999999987654
No 46
>PF11250 DUF3049: Protein of unknown function (DUF3049); InterPro: IPR021410 This eukaryotic family of proteins has no known function.
Probab=27.62 E-value=1.7e+02 Score=18.65 Aligned_cols=37 Identities=16% Similarity=0.421 Sum_probs=20.3
Q ss_pred EEEEeCCCCeEEEEE-cCCCCccEEEEEEeCCcEEEEE
Q psy17302 53 LEFKTEQPNGLILYT-DDGGTYDFFEIKLVEGALRLRY 89 (244)
Q Consensus 53 ~~frT~~~~GlLl~~-~~~~~~~~l~l~L~~G~l~~~~ 89 (244)
+.+|+...||=|+.. -......++..+=.+|++.+.+
T Consensus 18 ~~~r~~r~dGRLvl~~v~v~~~~~~~A~R~~GRL~L~~ 55 (56)
T PF11250_consen 18 VLMRPHREDGRLVLEEVRVPSHEYFHAEREDGRLRLQF 55 (56)
T ss_pred EEEEEEccCCEEEEEEEEcCCcceEEEEccCCEEEEEe
Confidence 455666566643332 2233345666666777777654
No 47
>cd02178 GH16_beta_agarase Beta-agarase, member of glycosyl hydrolase family 16. Beta-agarase is a glycosyl hydrolase family 16 (GH16) member that hydrolyzes the internal beta-1,4-linkage of agarose, a hydrophilic polysaccharide found in the cell wall of Rhodophyceaea, marine red algae. Agarose is a linear chain of galactose units linked by alternating L-alpha-1,3- and D-beta-1,4-linkages that are additionally modified by a 3,6-anhydro-bridge. Agarose forms thermo-reversible gels that are widely used in the food industry or as a laboratory medium. While beta-agarases are also found in two other families derived from the sequence-based classification of glycosyl hydrolases (GH50, and GH86) the GH16 members are most abundant. This domain adopts a curved beta-sandwich conformation, with a tunnel-shaped active site cavity, referred to as a jellyroll fold.
Probab=25.24 E-value=2.1e+02 Score=24.06 Aligned_cols=28 Identities=11% Similarity=0.210 Sum_probs=24.6
Q ss_pred CCCeEEEEEEEE-CcEEEEEEcCceeeee
Q psy17302 106 DGHWHSVRVKRR-VEKTSLTVDNITQTRT 133 (244)
Q Consensus 106 dg~WH~V~v~~~-~~~~~L~VD~~~~~~~ 133 (244)
.+.||+-.+.+. ...++..||+......
T Consensus 178 ~~~fHtY~veW~~p~~i~fyvDG~~~~~~ 206 (258)
T cd02178 178 ADDFHVYGVYWKDPDTIRFYIDGVLVRTV 206 (258)
T ss_pred ccCeEEEEEEEcCCCeEEEEECCEEEEEE
Confidence 468999999999 9999999999877654
No 48
>PF05910 DUF868: Plant protein of unknown function (DUF868); InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=24.76 E-value=1.8e+02 Score=25.07 Aligned_cols=28 Identities=14% Similarity=0.206 Sum_probs=22.2
Q ss_pred CCCCeEEEEEEEE-------CcEEEEEEcCceeee
Q psy17302 105 NDGHWHSVRVKRR-------VEKTSLTVDNITQTR 132 (244)
Q Consensus 105 ndg~WH~V~v~~~-------~~~~~L~VD~~~~~~ 132 (244)
..|+.|.|.|+-. ...+.++||++....
T Consensus 153 e~G~~HeI~Iec~~~~~g~~dp~l~V~VDgk~v~~ 187 (274)
T PF05910_consen 153 EGGKEHEISIECGGETGGPKDPELWVSVDGKKVVQ 187 (274)
T ss_pred CCCcEEEEEEEEeccCCCCCCceEEEEECCEEEEE
Confidence 4678999999982 237899999988764
No 49
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=22.76 E-value=68 Score=17.77 Aligned_cols=25 Identities=4% Similarity=-0.007 Sum_probs=17.2
Q ss_pred CCccCccceeecccCcccCCCeeee
Q psy17302 192 PPWYNAKLTLLALPSVIFEPRFVGF 216 (244)
Q Consensus 192 ~pc~n~~~~~~~~~~~~~~~~f~Gc 216 (244)
..|.+.|.|...-......++|.|-
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCcCC
Confidence 3588888887443445667888874
No 50
>KOG3838|consensus
Probab=21.37 E-value=4.3e+02 Score=24.15 Aligned_cols=58 Identities=12% Similarity=0.035 Sum_probs=31.7
Q ss_pred cEEEEEEeCCcEEEEEEcCCCeEEEEEcCc--cCCCCeEEEEEEEECcEEEEEEcCceee
Q psy17302 74 DFFEIKLVEGALRLRYNLGSGAQILTVGHD--LNDGHWHSVRVKRRVEKTSLTVDNITQT 131 (244)
Q Consensus 74 ~~l~l~L~~G~l~~~~~~g~~~~~~~s~~~--vndg~WH~V~v~~~~~~~~L~VD~~~~~ 131 (244)
.++++-+.||++.+..-..+....+.+=.+ -|.=.--+..|+..++.+++.+|+....
T Consensus 152 P~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~innGmtp 211 (497)
T KOG3838|consen 152 PAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVMINNGMTP 211 (497)
T ss_pred ccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEEEcCCCCC
Confidence 456666666665554332221122222111 1222334677888999999999987655
No 51
>PF08530 PepX_C: X-Pro dipeptidyl-peptidase C-terminal non-catalytic domain; InterPro: IPR013736 This domain is found at the C terminus of cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). The domain, which is a beta sandwich, is also found in serine peptidases belonging to MEROPS peptidase family S15: Xaa-Pro dipeptidyl-peptidases. Members of this entry, that are not characterised as peptidases, show extensive low-level similarity to the Xaa-Pro dipeptidyl-peptidases. ; GO: 0008239 dipeptidyl-peptidase activity; PDB: 2B4K_D 1RYY_F 2B9V_O 1NX9_B 3PUH_B 3I2I_A 3I2G_A 1JU4_A 3I2K_A 1L7R_A ....
Probab=21.11 E-value=1.5e+02 Score=23.90 Aligned_cols=31 Identities=13% Similarity=0.144 Sum_probs=19.6
Q ss_pred CccCCCCeEEEEEEEE--------CcEEEEEEcCceeee
Q psy17302 102 HDLNDGHWHSVRVKRR--------VEKTSLTVDNITQTR 132 (244)
Q Consensus 102 ~~vndg~WH~V~v~~~--------~~~~~L~VD~~~~~~ 132 (244)
..+..|+++.+.|... |.++.|.|.+.....
T Consensus 146 ~~~~pg~~~~~~i~L~p~~~~~~~GhrLrl~I~~~d~~~ 184 (218)
T PF08530_consen 146 EPLEPGEPYDVTIELQPTAYVFPAGHRLRLSISSSDFPY 184 (218)
T ss_dssp ----TT-EEEEEEEEEEEEEEE-TT-EEEEEEESSBBTT
T ss_pred ccCCCCcEEEEEEEEchhccEECCCCEEEEEEEecCCCc
Confidence 5667899999888774 778899998876543
No 52
>cd00070 GLECT Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
Probab=20.60 E-value=3.5e+02 Score=19.77 Aligned_cols=32 Identities=13% Similarity=0.138 Sum_probs=28.7
Q ss_pred cCccCCCCeEEEEEEEECcEEEEEEcCceeee
Q psy17302 101 GHDLNDGHWHSVRVKRRVEKTSLTVDNITQTR 132 (244)
Q Consensus 101 ~~~vndg~WH~V~v~~~~~~~~L~VD~~~~~~ 132 (244)
..++..|+...+.+........+.||+.....
T Consensus 71 ~~pf~~g~~F~l~i~~~~~~f~i~vng~~~~~ 102 (127)
T cd00070 71 GFPFQPGQPFELTILVEEDKFQIFVNGQHFFS 102 (127)
T ss_pred CCCCCCCCeEEEEEEEcCCEEEEEECCEeEEE
Confidence 57899999999999999999999999987653
Done!