Query psy1323
Match_columns 186
No_of_seqs 155 out of 1051
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 23:25:55 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1323.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1323hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00054 Laminin_G_1: Laminin 100.0 8.9E-28 1.9E-32 170.4 16.8 130 13-155 1-130 (131)
2 smart00282 LamG Laminin G doma 99.9 4.3E-26 9.2E-31 162.3 16.9 132 6-150 2-134 (135)
3 cd00110 LamG Laminin G domain; 99.9 7.2E-24 1.6E-28 153.0 16.6 133 4-149 19-151 (151)
4 KOG3514|consensus 99.9 1E-24 2.2E-29 189.4 11.8 169 1-184 455-633 (1591)
5 PF02210 Laminin_G_2: Laminin 99.9 2.5E-23 5.4E-28 145.9 15.2 126 13-150 1-127 (128)
6 KOG4289|consensus 99.9 1.1E-21 2.3E-26 174.4 11.4 159 3-181 1335-1506(2531)
7 KOG3516|consensus 99.8 7.5E-20 1.6E-24 161.3 15.3 150 4-181 808-960 (1306)
8 KOG3516|consensus 99.8 5.2E-19 1.1E-23 156.1 14.8 152 8-177 203-361 (1306)
9 KOG3514|consensus 99.7 2.8E-17 6E-22 143.7 11.9 151 6-175 860-1014(1591)
10 KOG1219|consensus 99.7 5.1E-17 1.1E-21 149.4 13.9 160 7-184 3707-3874(4289)
11 KOG4289|consensus 99.6 1.4E-15 3.1E-20 136.1 11.9 143 2-180 1559-1703(2531)
12 smart00210 TSPN Thrombospondin 99.2 1.5E-09 3.2E-14 81.2 15.4 127 3-149 50-181 (184)
13 smart00159 PTX Pentraxin / C-r 98.3 8.2E-05 1.8E-09 56.6 15.3 127 6-149 32-161 (206)
14 cd00152 PTX Pentraxins are pla 98.1 0.00046 1E-08 52.2 15.7 127 6-149 32-161 (201)
15 PF13385 Laminin_G_3: Concanav 98.1 0.00012 2.6E-09 51.9 11.4 122 5-149 22-147 (157)
16 PF02973 Sialidase: Sialidase, 98.0 0.0014 3E-08 49.1 15.4 129 7-148 35-172 (190)
17 PF00354 Pentaxin: Pentaxin fa 97.9 0.00039 8.4E-09 52.5 11.8 128 6-150 26-156 (195)
18 smart00560 LamGL LamG-like jel 96.5 0.065 1.4E-06 37.6 10.1 65 70-149 61-126 (133)
19 KOG1834|consensus 92.5 2.1 4.4E-05 38.0 10.4 136 5-150 366-516 (952)
20 KOG3509|consensus 92.0 0.22 4.7E-06 45.9 4.3 104 38-150 282-385 (964)
21 PF06439 DUF1080: Domain of Un 90.7 0.19 4.1E-06 37.0 2.2 33 63-97 120-152 (185)
22 PF14099 Polysacc_lyase: Polys 89.0 3.3 7.1E-05 31.5 7.9 60 38-97 115-182 (224)
23 KOG3546|consensus 81.2 8.8 0.00019 34.2 7.5 128 6-150 88-224 (1167)
24 cd01951 lectin_L-type legume l 81.0 22 0.00048 27.0 12.9 25 70-94 154-178 (223)
25 KOG1836|consensus 74.6 0.99 2.1E-05 44.4 0.0 97 39-149 1584-1682(1705)
26 PF02057 Glyco_hydro_59: Glyco 67.5 97 0.0021 28.1 10.8 57 37-96 579-635 (669)
27 PF09264 Sial-lect-inser: Vibr 59.9 72 0.0016 24.0 7.8 84 6-96 33-118 (198)
28 PF07081 DUF1349: Protein of u 58.1 74 0.0016 23.5 12.2 120 5-149 52-176 (183)
29 PF14259 RRM_6: RNA recognitio 56.4 6.7 0.00014 23.5 1.2 31 118-149 1-31 (70)
30 cd06899 lectin_legume_LecRK_Ar 49.6 1.2E+02 0.0026 23.4 11.6 27 67-93 160-186 (236)
31 PF00139 Lectin_legB: Legume l 47.1 27 0.00058 26.9 3.5 30 64-93 161-190 (236)
32 KOG3838|consensus 42.8 39 0.00084 28.5 3.8 61 35-96 150-211 (497)
33 PF03633 Glyco_hydro_65C: Glyc 33.6 96 0.0021 17.7 4.1 27 70-96 5-31 (54)
34 PF09064 Tme5_EGF_like: Thromb 33.5 28 0.00061 18.3 1.2 14 173-186 6-19 (34)
35 PF09313 DUF1971: Domain of un 30.9 1.4E+02 0.0031 19.0 4.4 41 37-77 25-72 (82)
36 PF05910 DUF868: Plant protein 28.8 1.9E+02 0.0042 23.1 5.6 52 68-126 153-209 (274)
37 PF09061 Stirrup: Stirrup; In 28.1 24 0.00053 21.5 0.4 20 55-74 27-46 (79)
38 TIGR03436 acidobact_VWFA VWFA- 26.8 1.2E+02 0.0025 24.1 4.2 21 63-83 264-284 (296)
39 PF15407 Spo7_2_N: Sporulation 20.5 52 0.0011 20.3 0.8 13 115-127 27-39 (67)
No 1
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.96 E-value=8.9e-28 Score=170.40 Aligned_cols=130 Identities=27% Similarity=0.400 Sum_probs=98.2
Q ss_pred EEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEccccEEEEEEcC
Q psy1323 13 FRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVHGAKLIARVDN 92 (186)
Q Consensus 13 frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~ 92 (186)
|||.+++|+|||.+..... | |++|+|.+|+|+++|+.|.+...+.++..++||+||+|++.+... .+.|+||+
T Consensus 1 frT~~~~Gllly~g~~~~~---d--fial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~--~~~L~Vd~ 73 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGSKDGK---D--FIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGR--NGSLSVDG 73 (131)
T ss_dssp EEESSSSEEEEEEESSTTS---S--EEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETT--EEEEEETT
T ss_pred CccCCCCceEEECCcCCCC---C--EEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCc--EEEEEECC
Confidence 8999999999999887654 3 999999999999999999998888888889999999999977654 58888998
Q ss_pred CcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCCccc
Q psy1323 93 RTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGKAAS 155 (186)
Q Consensus 93 ~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~~~~ 155 (186)
....... .+.... ..++....+||||+|............++|.|||+++.+| ++++
T Consensus 74 ~~~~~~~---s~~~~~-~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in--~~~l 130 (131)
T PF00054_consen 74 EEVVTGE---SPSGAT-QSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSIN--GKPL 130 (131)
T ss_dssp SEEEEEE---ECSSSS-SSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEET--TEEC
T ss_pred ccceeee---cCCccc-cccccccCEEEccCCchhhcccccccCCCeeEEEEEeEEC--CEEC
Confidence 7752222 121111 1355667899999993333222222688899999999996 4444
No 2
>smart00282 LamG Laminin G domain.
Probab=99.95 E-value=4.3e-26 Score=162.28 Aligned_cols=132 Identities=29% Similarity=0.468 Sum_probs=101.4
Q ss_pred eeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEecC-CCccCCCceEEEEEEEcccc
Q psy1323 6 CREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSITVG-EGLNRDKWHSVMVRINVHGA 84 (186)
Q Consensus 6 ~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~~~-~~v~Dg~WH~V~i~~~~~~~ 84 (186)
.++|+|.|||.+++|+|||..+.... + |+.|+|.+|++++.++.+++...+... ..++||+||+|.+.+.. .
T Consensus 2 ~~~i~~~frt~~~~g~l~~~~~~~~~---~--~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~--~ 74 (135)
T smart00282 2 RLSISFSFRTTSPNGLLLYAGSKNGG---D--YLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNG--R 74 (135)
T ss_pred ceEEEEEEEeCCCCEEEEEeCCCCCC---C--EEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeC--C
Confidence 47899999999999999999774322 2 999999999999999999877776655 89999999999997764 4
Q ss_pred EEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECC
Q psy1323 85 KLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSA 150 (186)
Q Consensus 85 ~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~ 150 (186)
.+.|.||+....... .+... ..++....+||||+|......... ...+|+|||++|+||+
T Consensus 75 ~~~l~VD~~~~~~~~---~~~~~--~~l~~~~~l~iGG~p~~~~~~~~~-~~~~F~GCi~~v~in~ 134 (135)
T smart00282 75 RVTLSVDGENPVSGE---SPGGL--TILNLDGPLYLGGLPEDLKLPPLL-VTPGFRGCIRNLKVNG 134 (135)
T ss_pred EEEEEECCCccccEE---CCCCc--eEEecCCCcEEccCCchhcccccc-cCCCCeeEeeEEEECC
Confidence 688899986543222 12111 234566899999999865432222 5789999999999973
No 3
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.92 E-value=7.2e-24 Score=152.97 Aligned_cols=133 Identities=30% Similarity=0.471 Sum_probs=101.1
Q ss_pred eeeeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEccc
Q psy1323 4 RLCREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVHG 83 (186)
Q Consensus 4 r~~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~~ 83 (186)
+..++|+|+|||.+++|+|||.+..... .|+.|+|.+|++++.++.|.....+.....++||+||+|.+.+..
T Consensus 19 ~~~~~i~~~frt~~~~g~l~~~~~~~~~-----~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~dg~Wh~v~i~~~~-- 91 (151)
T cd00110 19 RTRLSISFSFRTTSPNGLLLYAGSQNGG-----DFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDGQWHSVSVERNG-- 91 (151)
T ss_pred cceeEEEEEEEeCCCCeEEEEecCCCCC-----CEEEEEEECCEEEEEEcCCcccEEEEccCccCCCCEEEEEEEECC--
Confidence 4678999999999999999999886322 299999999999999999865666766668999999999997764
Q ss_pred cEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 84 AKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 84 ~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
..+.|.||+....... .+... ..+.....+||||.|......... ...+|+|||++|+||
T Consensus 92 ~~~~l~VD~~~~~~~~---~~~~~--~~~~~~~~~~iGg~~~~~~~~~~~-~~~~F~Gci~~v~in 151 (151)
T cd00110 92 RSVTLSVDGERVVESG---SPGGS--ALLNLDGPLYLGGLPEDLKSPGLP-VSPGFVGCIRDLKVN 151 (151)
T ss_pred CEEEEEECCccEEeee---CCCCc--eeecCCCCeEEcCCCCchhccccc-ccCCCceEeeEeEeC
Confidence 4688999997322211 11111 135567899999999864322222 478899999999995
No 4
>KOG3514|consensus
Probab=99.92 E-value=1e-24 Score=189.43 Aligned_cols=169 Identities=20% Similarity=0.387 Sum_probs=126.9
Q ss_pred CceeeeeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEe-cCCCccCCCceEEEEEE
Q psy1323 1 MSFRLCREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSIT-VGEGLNRDKWHSVMVRI 79 (186)
Q Consensus 1 ~~~r~~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~-~~~~v~Dg~WH~V~i~~ 79 (186)
|+--..-+|||.|||.+|||||+|.++.... ..| |+++||.+|+|.+.+++|++...++ +..++|||+||+|.+.+
T Consensus 455 Wnt~~~gSiSf~FRTtepnGlil~~~g~~~~-~~d--~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv~l~R 531 (1591)
T KOG3514|consen 455 WNTKKSGSISFDFRTTEPNGLILFHGGPQAN-ATD--YFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHVDLQR 531 (1591)
T ss_pred cccCCcceeEEEEeecCCCceEEEccCcccc-ccc--EEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEEEeec
Confidence 3344456899999999999999999875332 123 9999999999999999999988877 67799999999999988
Q ss_pred EccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCcc-cccee--cCCCcEEEEeeEEECCCCcccc
Q psy1323 80 NVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKL-HGVKY--IIESFIGCIKDMVLSAGKAASD 156 (186)
Q Consensus 80 ~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~-~~~~~--~~~~F~GCi~~v~in~~~~~~~ 156 (186)
+++...+ +||...... ..|+. +..+++++.+|+|-.|..... +++|. ...+|+|||||+.|+ ++..|
T Consensus 532 ~gR~gsv--sVd~~~~df----~tpG~--s~iL~ld~~mylG~~~n~l~~P~~vWta~L~~GyvGCirdl~i~--G~s~d 601 (1591)
T KOG3514|consen 532 DGRTGSV--SVDAIKTDF----STPGD--SEILDLDDPMYLGEVPNNLVYPSEVWTAALRKGYVGCIRDLFID--GVSTD 601 (1591)
T ss_pred cCccceE--EEeeeecCc----cCCCc--ceeEeecCceeeccCCCCccCcHHHHHHHHhccchheehhheec--ceehh
Confidence 7776444 588765432 23333 245778899999977665332 22332 688999999999995 67778
Q ss_pred cccCcccccCCccccCccc----cccCC--CCCC
Q psy1323 157 LLPIKPLIATKHEHVEEGC----IDKKS--PCNR 184 (186)
Q Consensus 157 l~~~~~~~~~~~~~v~~gC----~~~C~--~~~~ 184 (186)
+...... ++..+|+++| ...|. ||+|
T Consensus 602 i~q~ae~--q~sagvkpsCs~~~~~~C~~nPC~N 633 (1591)
T KOG3514|consen 602 IRQEAEA--QNSAGVKPSCSLSNEKICESNPCQN 633 (1591)
T ss_pred hHHHhhh--ccccccCcccchhhccccCCCcccC
Confidence 7766543 4568999999 34786 6765
No 5
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.91 E-value=2.5e-23 Score=145.86 Aligned_cols=126 Identities=26% Similarity=0.424 Sum_probs=96.5
Q ss_pred EEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCc-eEEEecCCCccCCCceEEEEEEEccccEEEEEEc
Q psy1323 13 FRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKH-STSITVGEGLNRDKWHSVMVRINVHGAKLIARVD 91 (186)
Q Consensus 13 frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~-~~~~~~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd 91 (186)
|||.+++|+|||.+..... .|+.|+|.+|+|++.++.|.. .........++||+||+|.+.+... .+.|.||
T Consensus 1 Frt~~~~g~Ll~~~~~~~~-----~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~--~~~l~Vd 73 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSEDNG-----DFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGN--RVTLTVD 73 (128)
T ss_dssp EEESSSSEEEEEEEESTTS-----EEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETT--EEEEEET
T ss_pred CccCCCCEeEEEEcCCCCC-----EEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeee--eEEEEec
Confidence 8999999999999987742 399999999999999999944 4556688899999999999877654 5888999
Q ss_pred CCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECC
Q psy1323 92 NRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSA 150 (186)
Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~ 150 (186)
+......... .. ....++....+||||.|......... ...+|+|||++|++|+
T Consensus 74 ~~~~~~~~~~-~~---~~~~~~~~~~l~iGg~~~~~~~~~~~-~~~~f~Gci~~l~vng 127 (128)
T PF02210_consen 74 GQSVSSESLP-SS---SSDSLDPDGSLYIGGLPESNQPSGSV-DTPGFVGCIRDLRVNG 127 (128)
T ss_dssp TSEEEEEESS-ST---THHCBESEEEEEESSTTTTCTCTTSS-TTSB-EEEEEEEEETT
T ss_pred CccceEEecc-cc---ceecccCCCCEEEecccCcccccccc-CCCCcEEEcCeEEECC
Confidence 9876543321 11 10134456689999999886654432 2789999999999973
No 6
>KOG4289|consensus
Probab=99.86 E-value=1.1e-21 Score=174.35 Aligned_cols=159 Identities=23% Similarity=0.410 Sum_probs=121.6
Q ss_pred eeeeeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEe--cCCCccCCCceEEEEEEE
Q psy1323 3 FRLCREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSIT--VGEGLNRDKWHSVMVRIN 80 (186)
Q Consensus 3 ~r~~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~--~~~~v~Dg~WH~V~i~~~ 80 (186)
-|+.|.++|+|-|.+.+|+|||.+++++ | |++|++.++++++.|..|....++. .+..++||+||+|.++..
T Consensus 1335 qRfh~TlslsfaT~~~nGlL~ynGnekh----D--FvalevVd~qvqltfS~Ges~t~v~p~Vp~gvsDGqWHtV~l~Yy 1408 (2531)
T KOG4289|consen 1335 QRFHFTLSLSFATIERNGLLLYNGNEKH----D--FVALEVVDEQVQLTFSAGESTTTVSPDVPGGVSDGQWHTVQLEYY 1408 (2531)
T ss_pred cceEEEEEEEEEEeeecceEEecCCccc----c--eEeeeeeeeeEEEEEecccccceecCCCCCCcccCceeEEEEEEe
Confidence 4788999999999999999999997653 4 9999999999999999997655665 466999999999998765
Q ss_pred ccccEEEEEEcCCcceeEE-eecCC-------CCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCC
Q psy1323 81 VHGAKLIARVDNRTDETII-KGLDP-------NTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGK 152 (186)
Q Consensus 81 ~~~~~l~l~vd~~~~~~~~-~~~~~-------~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~ 152 (186)
. +.++++||++...... .+..+ .......++..++|++||+|..... ....|.|||+++.+ ++
T Consensus 1409 N--K~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv-----~~k~FvGCmrdLsv--D~ 1479 (2531)
T KOG4289|consen 1409 N--KVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPV-----IEKQFVGCMRDLSV--DG 1479 (2531)
T ss_pred c--eEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchh-----hHhHhhhhhhhccc--cc
Confidence 3 5688899988754322 11111 1112245778889999999966432 45679999999999 67
Q ss_pred cccccccCcccccCCccccCccc---cccCCC
Q psy1323 153 AASDLLPIKPLIATKHEHVEEGC---IDKKSP 181 (186)
Q Consensus 153 ~~~~l~~~~~~~~~~~~~v~~gC---~~~C~~ 181 (186)
+.+|+.... ...++.+|| ++-|+.
T Consensus 1480 ~~VDma~fi-----anngt~eGC~ark~fCds 1506 (2531)
T KOG4289|consen 1480 RDVDMATFI-----ANNGTHEGCKARKNFCDS 1506 (2531)
T ss_pred ccccHHHHH-----hhcCcccCchhhhcccCC
Confidence 888887764 236789999 567764
No 7
>KOG3516|consensus
Probab=99.84 E-value=7.5e-20 Score=161.29 Aligned_cols=150 Identities=23% Similarity=0.341 Sum_probs=116.6
Q ss_pred eeeeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCC-EEEEEEEcCCceEEE--ecCCCccCCCceEEEEEEE
Q psy1323 4 RLCREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKG-QLKVVHVFGKHSTSI--TVGEGLNRDKWHSVMVRIN 80 (186)
Q Consensus 4 r~~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G-~l~~~~~~g~~~~~~--~~~~~v~Dg~WH~V~i~~~ 80 (186)
....+|+|.|||..++|++|.+.+.. | |+.++|..+ .+.|.++.|+++.++ .+...+||++||.|.++++
T Consensus 808 ~~saDIsf~FrTt~~~gvflen~g~~-----d--fir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~~nD~qWH~V~~Ern 880 (1306)
T KOG3516|consen 808 ELSADISFFFRTTASSGVFLENHGIN-----D--FIRLELSSPVEVTFAFDVGNGPSQLTVRSPTELNDNQWHQVRAERN 880 (1306)
T ss_pred cccccEEEEEEecCCceEeeeccCCC-----c--eEEEEEcCCCceEEEEEcCCCceeEEEcCCcccCCCceEEEEEEec
Confidence 35678999999999999999998854 4 999999965 466789999987654 4667999999999999888
Q ss_pred ccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCCcccccccC
Q psy1323 81 VHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGKAASDLLPI 160 (186)
Q Consensus 81 ~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~~~~~l~~~ 160 (186)
..+ ..|+||+........ |... ...+...+++|+||... ..++|.||||.+.+| +..+||...
T Consensus 881 ~K~--a~LqVD~~~~~~r~s---p~~~-~~~L~l~s~l~vGgt~~---------~~~gF~GCIRsl~LN--Gv~ldLe~r 943 (1306)
T KOG3516|consen 881 SKE--ASLQVDGLPKSIRTS---PIPG-TRLLQLYSSLFVGGTVS---------RQRGFLGCIRSLQLN--GVMLDLEYR 943 (1306)
T ss_pred ccc--ceEEEcCcccceecC---CCCC-EEEEEeccceecccccc---------CcCcceeeeeeeeec--ceeeeehhh
Confidence 766 566799988765432 2111 25567788999998442 478899999999996 688998433
Q ss_pred cccccCCccccCccccccCCC
Q psy1323 161 KPLIATKHEHVEEGCIDKKSP 181 (186)
Q Consensus 161 ~~~~~~~~~~v~~gC~~~C~~ 181 (186)
+ ....+|.+||.-.|++
T Consensus 944 a----~~~~gv~~GC~GhCss 960 (1306)
T KOG3516|consen 944 A----YGTAGVSPGCEGHCSS 960 (1306)
T ss_pred h----ccCCcccCCCcccccc
Confidence 2 2337999999888874
No 8
>KOG3516|consensus
Probab=99.81 E-value=5.2e-19 Score=156.06 Aligned_cols=152 Identities=23% Similarity=0.373 Sum_probs=116.2
Q ss_pred EEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCc-------eEEEecCCCccCCCceEEEEEEE
Q psy1323 8 EISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKH-------STSITVGEGLNRDKWHSVMVRIN 80 (186)
Q Consensus 8 ~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~-------~~~~~~~~~v~Dg~WH~V~i~~~ 80 (186)
.|+|+|||.++||+|||..+.++ | |+.|+|.+|++++.+++|+. ...++.+..++|+.||.|.|.+.
T Consensus 203 ~is~~Fkt~~sdGvllh~eg~QG----d--~itlql~~~kl~l~ld~G~~~~~~s~~~~sis~GslLdD~hWHsV~i~r~ 276 (1306)
T KOG3516|consen 203 VISLKFKTMQSDGVLLHGEGQQG----D--YITLQLIGGKLVLILDLGNSKLPSSRTPTSISAGSLLDDQHWHSVRIERQ 276 (1306)
T ss_pred eeEEEEEeeccceeEEEcccCCC----C--EEEEEEeCCEEEEEEecCCccCccccCcceeecccccCCCcceEEEEEec
Confidence 48999999999999999988764 4 89999999999999999943 35577889999999999998775
Q ss_pred ccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCCcccccccC
Q psy1323 81 VHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGKAASDLLPI 160 (186)
Q Consensus 81 ~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~~~~~l~~~ 160 (186)
. ..+.++||+....... .++ + ..|+++..+|+||+|.......+ ...|.|||.+|.+|+ -.-++|...
T Consensus 277 ~--~~vnftvD~~~~~fr~----~Ge-~-~~Ldld~e~~~GGiP~~~~~~~~---~~nF~GCienly~N~-vdiidLa~~ 344 (1306)
T KOG3516|consen 277 G--RQVNFTVDGVVHHFRA----TGE-F-DALDLDTEISFGGIPNDGKSVGF---EKNFTGCLENLYYNG-VDIIDLAKR 344 (1306)
T ss_pred C--cEEEEEEccceEeecc----cCc-c-ceeecceEEEECCccCCCcccce---eeeeeeeeeeeeecC-ceeEeeecc
Confidence 4 4588999987654222 222 2 34667889999999998765543 478999999999984 334566554
Q ss_pred cccccCCccccCccccc
Q psy1323 161 KPLIATKHEHVEEGCID 177 (186)
Q Consensus 161 ~~~~~~~~~~v~~gC~~ 177 (186)
.+.+.....+|...|++
T Consensus 345 ~~~~~~~~gnv~f~C~~ 361 (1306)
T KOG3516|consen 345 RKSQISAMGNVSFSCSD 361 (1306)
T ss_pred cccceecccceeEeccC
Confidence 44444455688888943
No 9
>KOG3514|consensus
Probab=99.73 E-value=2.8e-17 Score=143.74 Aligned_cols=151 Identities=28% Similarity=0.467 Sum_probs=111.3
Q ss_pred eeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEe--cCCCccCCCceEEEEEEEccc
Q psy1323 6 CREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSIT--VGEGLNRDKWHSVMVRINVHG 83 (186)
Q Consensus 6 ~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~--~~~~v~Dg~WH~V~i~~~~~~ 83 (186)
...|-|+|+|.+|+|+|+|.++... | |+++||.+|+|++.|++|+++..+. +...+||++||.|.|.+.. .
T Consensus 860 s~~l~Fqfkt~sp~gll~fn~gd~n----d--fi~velvnG~ihYtfdlg~gp~~~k~~sr~hlnDnrWHnV~I~rd~-~ 932 (1591)
T KOG3514|consen 860 SMHLFFQFKTTSPDGLLLFNSGDGN----D--FIAVELVNGYIHYTFDLGNGPTSMKGPSRQHLNDNRWHNVLIYRDK-T 932 (1591)
T ss_pred EEEEEEEEeecCCCeEEEecCCCCC----c--eEEEEEeCcEEEEEEEcCCCcccccCcccCcCccccceeEEEEcCC-C
Confidence 3456678899999999999877542 3 9999999999999999999987766 6789999999999998774 4
Q ss_pred cEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCcc--ccceecCCCcEEEEeeEEECCCCcccccccCc
Q psy1323 84 AKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKL--HGVKYIIESFIGCIKDMVLSAGKAASDLLPIK 161 (186)
Q Consensus 84 ~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~--~~~~~~~~~F~GCi~~v~in~~~~~~~l~~~~ 161 (186)
+.-+|.||+........+ ..++++.+.|||||+...... ...-....+|.||+-.+-++ +...+|....
T Consensus 933 ~~HtL~vD~s~~t~~~~g-------~~~l~l~g~LyiGGv~k~m~~~~p~~~asR~g~~g~~~s~dl~--~r~p~L~~~a 1003 (1591)
T KOG3514|consen 933 NTHTLKVDNSSTTQIIDG-------AVNLDLKGKLYIGGVSKPMYSFLPKLVASRSGFQGCLASLDLG--GRLPDLISDA 1003 (1591)
T ss_pred CceEEEecCceEEEEecC-------ccccccccceecccccccccccccceeeccCCCCCCcCccCcc--ccchhHHHHh
Confidence 567788998876544431 256778889999998766443 22223578899999999995 3433444432
Q ss_pred ccccCCccccCccc
Q psy1323 162 PLIATKHEHVEEGC 175 (186)
Q Consensus 162 ~~~~~~~~~v~~gC 175 (186)
.. ....|..||
T Consensus 1004 ~~---~s~lv~~~~ 1014 (1591)
T KOG3514|consen 1004 LF---ESGLVEVGC 1014 (1591)
T ss_pred hh---hccceeeec
Confidence 21 224566666
No 10
>KOG1219|consensus
Probab=99.73 E-value=5.1e-17 Score=149.42 Aligned_cols=160 Identities=18% Similarity=0.335 Sum_probs=115.7
Q ss_pred eEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEec-CCCccCCCceEEEEEEEccccE
Q psy1323 7 REISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSITV-GEGLNRDKWHSVMVRINVHGAK 85 (186)
Q Consensus 7 ~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~~-~~~v~Dg~WH~V~i~~~~~~~~ 85 (186)
+.+.|+.||.+++|++||..... +..|.|.+|.+.+.++.|+++..+.+ ...+|||+||.|.+++.+ +.
T Consensus 3707 ~kl~frLkT~~sngIiM~tr~~d--------~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialerrr--~~ 3776 (4289)
T KOG1219|consen 3707 MKLGFRLKTLQSNGIIMYTRKTD--------LAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALERRR--NH 3776 (4289)
T ss_pred eEEEEEEEecccCcEEEEEcCCc--------eEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeeccC--Cc
Confidence 78899999999999999998432 78899999999999999999866654 479999999999986654 45
Q ss_pred EEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCCcccccccCcc--c
Q psy1323 86 LIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGKAASDLLPIKP--L 163 (186)
Q Consensus 86 l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~~~~~l~~~~~--~ 163 (186)
+.|.+|+....+. ..|+. ...+.....||+||.-.....+.-. ...+|.|||+.+.+|+.. +++...-. .
T Consensus 3777 irlsvDd~~~~~a---tvPg~--~~tln~d~hiy~Ga~vrlr~~~~tq-vs~Gf~GCldsiyLng~e--l~l~~k~~s~a 3848 (4289)
T KOG1219|consen 3777 IRLSVDDDTYDSA---TVPGM--KSTLNLDTHIYLGALVRLRHQRSTQ-VSYGFDGCLDSIYLNGME--LPLTRKGKSVA 3848 (4289)
T ss_pred eEEEEcccCceee---ecccc--eeeccccceEEEeeEeeeccCCCcc-ccccccceeeeEEEcccc--ccccCCCchhh
Confidence 7778888765442 23432 2335566899999976522222111 578899999999997533 44443211 1
Q ss_pred ccCCccccCccc---cccCC--CCCC
Q psy1323 164 IATKHEHVEEGC---IDKKS--PCNR 184 (186)
Q Consensus 164 ~~~~~~~v~~gC---~~~C~--~~~~ 184 (186)
.+....+++.|| ++.|. |||+
T Consensus 3849 ~~~el~~l~pgC~l~~d~C~~npCqh 3874 (4289)
T KOG1219|consen 3849 GLMELFGLQPGCSLLTDPCNDNPCQH 3874 (4289)
T ss_pred hhhhhhcccccccccccccccCcccC
Confidence 122446889999 68896 7775
No 11
>KOG4289|consensus
Probab=99.65 E-value=1.4e-15 Score=136.09 Aligned_cols=143 Identities=21% Similarity=0.339 Sum_probs=99.9
Q ss_pred ceeeeeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEc
Q psy1323 2 SFRLCREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINV 81 (186)
Q Consensus 2 ~~r~~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~ 81 (186)
++-.+|.++|+|||++.+|+||.+.....+ -+.++|.+|.|.+.+.- . .......+++||+||++.|.+..
T Consensus 1559 ~vSvPwylsl~FRTr~ad~vl~~~~~~~rs------t~~lqld~g~l~~~v~~--s-~v~L~~~~vtdg~Wh~~~i~l~~ 1629 (2531)
T KOG4289|consen 1559 QVSVPWYLSLMFRTRRADGVLMQAEFGGRS------TYNLQLDDGTLKYNVGD--S-SVELPAPRVTDGHWHHLVIELEA 1629 (2531)
T ss_pred ceecceEEEEEEEeeccccEEEEEEeCCCc------eEEEEEcCCEEEEEecC--c-eEEccCccccCCchhheeeeecc
Confidence 456789999999999999999988765532 47899999999986543 2 23336789999999999987754
Q ss_pred cccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECCCCcccccccCc
Q psy1323 82 HGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSAGKAASDLLPIK 161 (186)
Q Consensus 82 ~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~~~~~~~l~~~~ 161 (186)
. +++++|......... .. ..+++ -..+|+||.|... ...+|+|||++|.+. +.+. +..
T Consensus 1630 -d--~~~t~d~g~~~aea~-~g---l~gl~---l~sl~vGgap~~g-------~p~gf~GCiqgV~v~--g~~~--l~~- 1687 (2531)
T KOG4289|consen 1630 -D--SVATLDYGIYQAEAK-AG---LSGLN---LESLYVGGAPATG-------VPRGFRGCIQGVRVG--GVSI--LVP- 1687 (2531)
T ss_pred -C--eEEEEechhhhhhhh-cC---CCCce---eeEEEEccccCCC-------ccccchhhhhceEEC--CEee--ccc-
Confidence 2 556677665432211 00 11222 2589999999542 467899999999994 4432 111
Q ss_pred ccccCCccccCccc--cccCC
Q psy1323 162 PLIATKHEHVEEGC--IDKKS 180 (186)
Q Consensus 162 ~~~~~~~~~v~~gC--~~~C~ 180 (186)
...+++.|| ++.|+
T Consensus 1688 -----~kv~~~~GCvvpn~C~ 1703 (2531)
T KOG4289|consen 1688 -----KKVNVEAGCVVPNPCS 1703 (2531)
T ss_pred -----cccccccCcccCCccc
Confidence 125788899 88886
No 12
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.21 E-value=1.5e-09 Score=81.19 Aligned_cols=127 Identities=12% Similarity=0.104 Sum_probs=81.0
Q ss_pred eeeeeEEEEEEEcC-CCCeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEc--CCc-eEEEe-cCCCccCCCceEEEE
Q psy1323 3 FRLCREISFQFRTR-LPHGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVF--GKH-STSIT-VGEGLNRDKWHSVMV 77 (186)
Q Consensus 3 ~r~~~~is~~frT~-~~~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~--g~~-~~~~~-~~~~v~Dg~WH~V~i 77 (186)
+-..|.|.+.||+. ...|.||...+.+.. .++.|.+..+...+.+.. ..+ ...+. .+..+.||+||+|.+
T Consensus 50 ~~~~fsi~~~~r~~~~~~g~L~si~~~~~~-----~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal 124 (184)
T smart00210 50 LPEDFSLLTTFRQTPKSRGVLFAIYDAQNV-----RQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLAL 124 (184)
T ss_pred CCCCeEEEEEEEeCCCCCeEEEEEEcCCCc-----EEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEE
Confidence 34568899999987 788888888764322 388999887765554432 222 23333 347899999999999
Q ss_pred EEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 78 RINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 78 ~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
.+... .++|.||......... ... ....++. ..++++|.... ....|.|||++++|-
T Consensus 125 ~V~~~--~v~LyvDC~~~~~~~l--~~~--~~~~~~~-~g~~~~g~~~~--------~~~~f~G~lq~l~i~ 181 (184)
T smart00210 125 SVSGS--SATLYVDCNEIDSRPL--DRP--GQPPIDT-DGIEVRGAQAA--------DRKPFQGDLQQLKIV 181 (184)
T ss_pred EEeCC--EEEEEECCccccceec--CCc--ccccccc-cceEEEeeccC--------CCCcceEEeEEEEEe
Confidence 87654 5888899886554322 110 0012333 34455554432 234799999999983
No 13
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.30 E-value=8.2e-05 Score=56.61 Aligned_cols=127 Identities=18% Similarity=0.233 Sum_probs=76.0
Q ss_pred eeEEEEEEEcCC--CCeEEEEeeccCCCCCCCCceEEEEE-eCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEcc
Q psy1323 6 CREISFQFRTRL--PHGLLVYHSVKDRPEGLDPYALYVIV-EKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVH 82 (186)
Q Consensus 6 ~~~is~~frT~~--~~GlLl~~~~~~~~~~~d~~~~~l~l-~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~ 82 (186)
.+++.+.+||.. .++.||....... +. -+.+.. ..|.+.+.+ ++. .+..+..+.||+||+|.+..+..
T Consensus 32 ~fTvc~W~k~~~~~~~~~ifSy~~~~~----~n-e~~~~~~~~~~~~l~i--~g~--~~~~~~~~~~g~W~hvc~tw~~~ 102 (206)
T smart00159 32 AFTVCLWFYSDLSPRGYSLFSYATKGQ----DN-ELLLYKEKQGEYSLYI--GGK--KVQFPVPESDGKWHHICTTWESS 102 (206)
T ss_pred HEEEEEEEEecCCCCceEEEEEeCCCC----CC-eEEEEEcCCcEEEEEE--cCe--EEEecccccCCceEEEEEEEECC
Confidence 467888999864 4555554443332 11 232322 345555443 332 33344568999999999998887
Q ss_pred ccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 83 GAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 83 ~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
...+.|.+|+....... ..+ ...+...+.+.||.-..... .+.. ....|.|-|.+|.|-
T Consensus 103 ~g~~~lyvnG~~~~~~~--~~~----g~~i~~~G~lvlGq~qd~~g-g~f~-~~~~f~G~i~~v~iw 161 (206)
T smart00159 103 SGIAELWVDGKPGVRKG--LAK----GYTVKPGGSIILGQEQDSYG-GGFD-ATQSFVGEIGDLNMW 161 (206)
T ss_pred CCcEEEEECCEEccccc--ccC----CcEECCCCEEEEEecccCCC-CCCC-CCcceeEEEeeeEEe
Confidence 78899999987642111 111 12345667888886433211 1121 356799999999996
No 14
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.10 E-value=0.00046 Score=52.25 Aligned_cols=127 Identities=16% Similarity=0.164 Sum_probs=75.2
Q ss_pred eeEEEEEEEcCC--CCeEEEEeeccCCCCCCCCceEEEEE-eCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEcc
Q psy1323 6 CREISFQFRTRL--PHGLLVYHSVKDRPEGLDPYALYVIV-EKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVH 82 (186)
Q Consensus 6 ~~~is~~frT~~--~~GlLl~~~~~~~~~~~d~~~~~l~l-~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~ 82 (186)
.|++.+.+|+.. ..+.||....... +. .+.+.. ..|.+.+.+. +..... .....||+||+|.+..+..
T Consensus 32 ~fTv~~Wv~~~~~~~~~~ifSy~~~~~----~~-~~~l~~~~~g~~~~~i~--~~~~~~--~~~~~~g~W~hv~~t~d~~ 102 (201)
T cd00152 32 AFTLCLWVYTDLSTREYSLFSYATKGQ----DN-ELLLYKEKDGGYSLYIG--GKEVTF--KVPESDGAWHHICVTWEST 102 (201)
T ss_pred hEEEEEEEEecCCCCCeEEEEEeCCCC----CC-eEEEEEcCCCeEEEEEc--CEEEEE--eccCCCCCEEEEEEEEECC
Confidence 467888999764 4455554333322 22 233333 3467766542 222222 2345899999999988877
Q ss_pred ccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 83 GAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 83 ~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
...+.|.+|+........ . ....+...+.+.||....... .... ....|.|-|.+|+|-
T Consensus 103 ~g~~~lyvnG~~~~~~~~--~----~~~~~~~~g~l~lG~~q~~~g-g~~~-~~~~f~G~I~~v~iw 161 (201)
T cd00152 103 SGIAELWVNGKLSVRKSL--K----KGYTVGPGGSIILGQEQDSYG-GGFD-ATQSFVGEISDVNMW 161 (201)
T ss_pred CCcEEEEECCEEeccccc--c----CCCEECCCCeEEEeecccCCC-CCCC-CCcceEEEEceeEEE
Confidence 778899999876432211 1 112345567888886443211 0111 356799999999996
No 15
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.06 E-value=0.00012 Score=51.91 Aligned_cols=122 Identities=18% Similarity=0.266 Sum_probs=68.4
Q ss_pred eeeEEEEEEEcCCCCe--EEEEeeccCCCCCCCCceEEEEEe-CCEEEEEEEcCCc-eEEEecCCCccCCCceEEEEEEE
Q psy1323 5 LCREISFQFRTRLPHG--LLVYHSVKDRPEGLDPYALYVIVE-KGQLKVVHVFGKH-STSITVGEGLNRDKWHSVMVRIN 80 (186)
Q Consensus 5 ~~~~is~~frT~~~~G--lLl~~~~~~~~~~~d~~~~~l~l~-~G~l~~~~~~g~~-~~~~~~~~~v~Dg~WH~V~i~~~ 80 (186)
..++|++.||.....+ ..+.. .... + ..+.+.+. +|.+.+.+..+.+ ...+.....+.+++||+|.+.+.
T Consensus 22 ~~fTi~~w~~~~~~~~~~~~~~~-~~~~----~-~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~W~~l~~~~~ 95 (157)
T PF13385_consen 22 GSFTISFWVKPDSPSSSQSFVFM-DSSG----S-GGFGLFINNNGRLRFYIGNGGGGNYSFSSDSNLPDNKWHHLALTYD 95 (157)
T ss_dssp TEEEEEEEEEESS--SSEEEEEE-SSSS----S-EEEEEEEETTSEEEEEETTSEEESS-EE-BS---TT-EEEEEEEEE
T ss_pred CCEEEEEEEEeCCCCCCceEEEE-ecCC----C-CEEEEEEECCCEEEEEEeCCCceeEEEecCcccCCCCEEEEEEEEE
Confidence 3678999998765333 23322 1111 1 14566676 4777776555432 23455677888999999999776
Q ss_pred ccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 81 VHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 81 ~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
+..+.+.||+.......... . ........++||+... ....|.|-|.+|+|-
T Consensus 96 --~~~~~lyvnG~~~~~~~~~~-----~-~~~~~~~~~~iG~~~~---------~~~~~~g~i~~~~i~ 147 (157)
T PF13385_consen 96 --GSTVTLYVNGELVGSSTIPS-----N-ISLNSNGPLFIGGSGG---------GSSPFNGYIDDLRIY 147 (157)
T ss_dssp --TTEEEEEETTEEETTCTEES-----S-SSTTSCCEEEESS-ST---------T--B-EEEEEEEEEE
T ss_pred --CCeEEEEECCEEEEeEeccC-----C-cCCCCcceEEEeecCC---------CCCceEEEEEEEEEE
Confidence 44588889987654322111 0 1123456899998662 256699999999994
No 16
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.95 E-value=0.0014 Score=49.06 Aligned_cols=129 Identities=16% Similarity=0.123 Sum_probs=82.3
Q ss_pred eEEEEEEEcCCCCeE--EEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCc--eEEEecCC-----CccCCCceEEEE
Q psy1323 7 REISFQFRTRLPHGL--LVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKH--STSITVGE-----GLNRDKWHSVMV 77 (186)
Q Consensus 7 ~~is~~frT~~~~Gl--Ll~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~--~~~~~~~~-----~v~Dg~WH~V~i 77 (186)
.+|.++|++..++++ ||..++.... . .|+.|.+.++.+-+.+.-..+ ......+. ..++-.||.|.+
T Consensus 35 gTI~i~Fk~~~~~~~~sLfsiSn~~~~---n-~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~ 110 (190)
T PF02973_consen 35 GTIVIRFKSDSNSGIQSLFSISNSTKG---N-EYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAF 110 (190)
T ss_dssp EEEEEEEEESS-SSEEEEEEEE-TSTT---S-EEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEE
T ss_pred cEEEEEEecCCCcceeEEEEecCCCCc---c-ceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEE
Confidence 478999999877787 7777776543 2 499999999988887766544 23333333 345667999998
Q ss_pred EEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEE
Q psy1323 78 RINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVL 148 (186)
Q Consensus 78 ~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~i 148 (186)
..+..+....|.+|+......... ...+-..++.-..++|||+.... ...-+|.|=|++|.|
T Consensus 111 ~ad~~~~~ykly~NG~~v~~~~~~---~~~Fis~i~~~n~~~iG~t~R~g------~~~y~f~G~I~~l~i 172 (190)
T PF02973_consen 111 VADSKNKGYKLYVNGELVSTLSSK---SGNFISDIPGLNSVQIGGTNRAG------SNAYPFNGTIDNLKI 172 (190)
T ss_dssp EEETTTTEEEEEETTCEEEEEEEC---TSS-GGGSTT--EEEESSEEETT------EEES--EEEEEEEEE
T ss_pred EEecCCCeEEEEeCCeeEEEeccc---cccHhhcCcCCceEEEcceEeCC------CceecccceEEEEEE
Confidence 777667789999999654433221 11221234444589999974432 146679999999999
No 17
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=97.89 E-value=0.00039 Score=52.48 Aligned_cols=128 Identities=18% Similarity=0.267 Sum_probs=72.9
Q ss_pred eeEEEEEEEcCCC--CeEEEEeeccCCCCCCCCceEEEEEe-CCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEcc
Q psy1323 6 CREISFQFRTRLP--HGLLVYHSVKDRPEGLDPYALYVIVE-KGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVH 82 (186)
Q Consensus 6 ~~~is~~frT~~~--~GlLl~~~~~~~~~~~d~~~~~l~l~-~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~ 82 (186)
.+++-+.+||... .+.||..+.... +. -+.+... .+.+.+.+ ++....+ ...+.||+||++.+..+..
T Consensus 26 ~fTvC~w~k~~~~~~~~tifSYat~~~----~n-ell~~~~~~~~~~l~i--~~~~~~~--~~~~~~~~Whh~C~tW~s~ 96 (195)
T PF00354_consen 26 AFTVCFWVKTDDSSNDGTIFSYATSSQ----DN-ELLLFGSSSGSLRLYI--NGSSVSF--SGPIRDGQWHHICVTWDSS 96 (195)
T ss_dssp EEEEEEEEEESGSGS-EEEEEEEETTE----EE-EEEEEEETTTEEEEEE--TTEEEEE--EECS-TSS-EEEEEEEETT
T ss_pred cEEEEEEEEeccCCCceEEEEEccCCC----Cc-cEEEEEeCCceEEEEE--CCeEeEe--ccccCCCCcEEEEEEEecC
Confidence 5788899999765 777776654432 11 2323233 46655543 3332333 3357899999999999888
Q ss_pred ccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECC
Q psy1323 83 GAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSA 150 (186)
Q Consensus 83 ~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~ 150 (186)
...+.+-+|+..... ..... +..+...+.+.||--.+. ....+. ...+|.|=|.+|.|-+
T Consensus 97 ~G~~~ly~dG~~~~~--~~~~~----g~~i~~gG~~vlGQeQd~-~gG~fd-~~q~F~G~i~~~~iWd 156 (195)
T PF00354_consen 97 TGRWQLYVDGVRLSS--TGLAT----GHSIPGGGTLVLGQEQDS-YGGGFD-ESQAFVGEISDFNIWD 156 (195)
T ss_dssp TTEEEEEETTEEEEE--EESST----T--B-SSEEEEESS-BSB-TTBTCS-GGGB--EEEEEEEEES
T ss_pred CcEEEEEECCEeccc--ccccC----CceECCCCEEEECccccc-cCCCcC-CccEeeEEEeceEEEe
Confidence 889999999984322 11222 234556677888853321 111222 4678999999999973
No 18
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=96.47 E-value=0.065 Score=37.63 Aligned_cols=65 Identities=17% Similarity=0.107 Sum_probs=42.9
Q ss_pred CCceEEEEEEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcc-cCCCCccccceecCCCcEEEEeeEEE
Q psy1323 70 DKWHSVMVRINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGG-LSPEEKLHGVKYIIESFIGCIKDMVL 148 (186)
Q Consensus 70 g~WH~V~i~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG-~p~~~~~~~~~~~~~~F~GCi~~v~i 148 (186)
|+||+|.+..+.....+.|.||+........ + ......++.||. ..... .....|.|.|.+|+|
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~---~------~~~~~~~~~iG~~~~~~~------~~~~~f~G~Idevri 125 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSET---Q------PSPSSGNLPQGGRILLGG------AGGENFSGRLDEVRV 125 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEcccccc---C------CcccCCceEEeeeccCCC------CCCCCceEEeeEEEE
Confidence 9999999988876667999999865432111 0 112345788884 21111 135679999999999
Q ss_pred C
Q psy1323 149 S 149 (186)
Q Consensus 149 n 149 (186)
-
T Consensus 126 y 126 (133)
T smart00560 126 Y 126 (133)
T ss_pred e
Confidence 4
No 19
>KOG1834|consensus
Probab=92.50 E-value=2.1 Score=38.03 Aligned_cols=136 Identities=14% Similarity=0.178 Sum_probs=80.4
Q ss_pred eeeEEEEEEEcCCC-------CeEEEEeeccCCCCCCCCceEEEEEeCCEEEEEEEcCCc------eEEEe-cCCCccCC
Q psy1323 5 LCREISFQFRTRLP-------HGLLVYHSVKDRPEGLDPYALYVIVEKGQLKVVHVFGKH------STSIT-VGEGLNRD 70 (186)
Q Consensus 5 ~~~~is~~frT~~~-------~GlLl~~~~~~~~~~~d~~~~~l~l~~G~l~~~~~~g~~------~~~~~-~~~~v~Dg 70 (186)
-.|.|+|.+|--.. ---||..+++..- ..++.+|++.+=+|.|.+...-+ +.+.. .-..++|.
T Consensus 366 dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~em---nrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~ 442 (952)
T KOG1834|consen 366 DHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEM---NRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDN 442 (952)
T ss_pred CceEEEEeeecCCCCccccccceeEEeccccccc---ccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhh
Confidence 46889999884331 2246666665322 23477899999999987655322 22333 33579999
Q ss_pred CceEEEEEEEccccEEEEEEcCCcceeE-EeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 71 KWHSVMVRINVHGAKLIARVDNRTDETI-IKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 71 ~WH~V~i~~~~~~~~l~l~vd~~~~~~~-~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
+||.-.+.++.- .++|.+|+..-.-. ....-| . ........|-+|..=.......+. ....|+|-+..+.+-
T Consensus 443 EWH~Y~ln~efp--~VtlyvDG~Sfep~~i~ddwp---l-Hpsk~~tqLvVGACW~g~~~~~l~-~aqfFrG~Lasltlr 515 (952)
T KOG1834|consen 443 EWHHYVLNVEFP--DVTLYVDGKSFEPPLITDDWP---L-HPSKIETQLVVGACWQGRQQKPLK-LAQFFRGQLASLTLR 515 (952)
T ss_pred hhheeEEeecCc--eEEEEEcCcccCCceeccCCc---c-CcccccceeEEeeeccCccccchh-HHHHhhcccceeEEe
Confidence 999988766554 48888998653211 100000 0 011234577888643332222222 466799999999996
Q ss_pred C
Q psy1323 150 A 150 (186)
Q Consensus 150 ~ 150 (186)
+
T Consensus 516 s 516 (952)
T KOG1834|consen 516 S 516 (952)
T ss_pred c
Confidence 4
No 20
>KOG3509|consensus
Probab=92.04 E-value=0.22 Score=45.88 Aligned_cols=104 Identities=19% Similarity=0.235 Sum_probs=60.4
Q ss_pred eEEEEEeCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCc
Q psy1323 38 ALYVIVEKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSV 117 (186)
Q Consensus 38 ~~~l~l~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~ 117 (186)
|+.+.+..|.+-+.+..+............-+|+|+.+.+.+ +..+ .+++.... ..+..+... .-+.....
T Consensus 282 f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r---~s~~--~~~g~~~~--l~g~~~~~~--~~i~~ee~ 352 (964)
T KOG3509|consen 282 FRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR---GSGL--SVSGHKGV--LQGNSNILV--SRITNEES 352 (964)
T ss_pred eeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee---cccc--cccCccee--ecccccccc--cceeeccc
Confidence 777777777777777776665556666777789999998754 1112 23333221 111122111 22344556
Q ss_pred EEEcccCCCCccccceecCCCcEEEEeeEEECC
Q psy1323 118 VLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSA 150 (186)
Q Consensus 118 lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~ 150 (186)
+|.||.-.-...........+|.||++++.+|.
T Consensus 353 v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~ 385 (964)
T KOG3509|consen 353 VFLGGIINIETLQHNLPLPEGFAGCIRDLVMNL 385 (964)
T ss_pred ccCCceeeeccccccCCCccCccceehhhhhhc
Confidence 777773222112111125789999999999974
No 21
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=90.70 E-value=0.19 Score=36.96 Aligned_cols=33 Identities=12% Similarity=0.345 Sum_probs=22.2
Q ss_pred cCCCccCCCceEEEEEEEccccEEEEEEcCCccee
Q psy1323 63 VGEGLNRDKWHSVMVRINVHGAKLIARVDNRTDET 97 (186)
Q Consensus 63 ~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~~~~~ 97 (186)
.......|+||+++|+... ..+++.||+.....
T Consensus 120 ~~~~~~~~~W~~~~I~~~g--~~i~v~vnG~~v~~ 152 (185)
T PF06439_consen 120 VNVAIPPGEWNTVRIVVKG--NRITVWVNGKPVAD 152 (185)
T ss_dssp S--S--TTSEEEEEEEEET--TEEEEEETTEEEEE
T ss_pred ccccCCCCceEEEEEEEEC--CEEEEEECCEEEEE
Confidence 4456778999999987754 45888899876543
No 22
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=88.95 E-value=3.3 Score=31.52 Aligned_cols=60 Identities=12% Similarity=0.164 Sum_probs=40.8
Q ss_pred eEEEEEeCCEEEEEEEcCC-----ceEEEecCCCccCCCceEEEEEEEcc---ccEEEEEEcCCccee
Q psy1323 38 ALYVIVEKGQLKVVHVFGK-----HSTSITVGEGLNRDKWHSVMVRINVH---GAKLIARVDNRTDET 97 (186)
Q Consensus 38 ~~~l~l~~G~l~~~~~~g~-----~~~~~~~~~~v~Dg~WH~V~i~~~~~---~~~l~l~vd~~~~~~ 97 (186)
.++|.+.+|.+.+.+..+. ..........+.-|+||.+.+.+... ...+.+.+|+.....
T Consensus 115 ~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~ 182 (224)
T PF14099_consen 115 PFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVD 182 (224)
T ss_dssp CEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCE
T ss_pred cEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEE
Confidence 6889999999999876655 11223345567779999999865443 467888888866543
No 23
>KOG3546|consensus
Probab=81.24 E-value=8.8 Score=34.23 Aligned_cols=128 Identities=13% Similarity=0.123 Sum_probs=71.2
Q ss_pred eeEEEEEEEcCC-CCeEEEEeeccCCCCCCCCceEEEEEe---CCEEEEEE-Ec--CCceEE--EecCCCccCCCceEEE
Q psy1323 6 CREISFQFRTRL-PHGLLVYHSVKDRPEGLDPYALYVIVE---KGQLKVVH-VF--GKHSTS--ITVGEGLNRDKWHSVM 76 (186)
Q Consensus 6 ~~~is~~frT~~-~~GlLl~~~~~~~~~~~d~~~~~l~l~---~G~l~~~~-~~--g~~~~~--~~~~~~v~Dg~WH~V~ 76 (186)
.|.|.|..|... .-|+||-..+..+. -.|+-|.|. +|+-.+.+ .. |++... ......+-.++|.++.
T Consensus 88 df~~~~~i~p~s~~~gvlfaitd~~q~----~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a 163 (1167)
T KOG3546|consen 88 DFSLLFHIRPATEGPGVLFAITDSAQA----MVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLA 163 (1167)
T ss_pred cceEEEEeeccCCCCceEEEechhhhh----hheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhhee
Confidence 467777778754 45666655554322 124555443 67666542 22 333222 1123445579999988
Q ss_pred EEEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEECC
Q psy1323 77 VRINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLSA 150 (186)
Q Consensus 77 i~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in~ 150 (186)
+.+.. ..++|.||-+.-.......+. ..+.+....-||+|..-.. ....|+|.|+.+++-.
T Consensus 164 ~~v~g--~~v~l~v~cee~~r~p~~rss---~~l~~e~~ag~f~~~ag~~--------~~~~f~g~~~~l~v~~ 224 (1167)
T KOG3546|consen 164 LSVAG--GFVALYVDCEEFQRMPLARSS---RGLELEPGAGLFVAQAGGA--------DPDKFQGVIAELKVRR 224 (1167)
T ss_pred eeecC--ceEEEEechHHhcccchhccc---cceeecCCcceEEeccCCC--------ChHhhhhhhhheeecC
Confidence 76654 457777875543221111111 1244556678998854332 2445999999999953
No 24
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=81.03 E-value=22 Score=27.01 Aligned_cols=25 Identities=20% Similarity=0.259 Sum_probs=21.1
Q ss_pred CCceEEEEEEEccccEEEEEEcCCc
Q psy1323 70 DKWHSVMVRINVHGAKLIARVDNRT 94 (186)
Q Consensus 70 g~WH~V~i~~~~~~~~l~l~vd~~~ 94 (186)
|+||++.|..+.....|.+.++...
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~~ 178 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNGS 178 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCCC
Confidence 9999999988876678888888664
No 25
>KOG1836|consensus
Probab=74.64 E-value=0.99 Score=44.40 Aligned_cols=97 Identities=24% Similarity=0.205 Sum_probs=54.8
Q ss_pred EEEEEeCCEEEEEEEcCC--ceEEEecCCCccCCCceEEEEEEEccccEEEEEEcCCcceeEEeecCCCCCceeecCCCC
Q psy1323 39 LYVIVEKGQLKVVHVFGK--HSTSITVGEGLNRDKWHSVMVRINVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTS 116 (186)
Q Consensus 39 ~~l~l~~G~l~~~~~~g~--~~~~~~~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~ 116 (186)
..+....+.+...+..|- ....+......-++.||++.+... ...+.++.|. . +. +..+ ....+...
T Consensus 1584 ~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~-~---~~-~~~~----~~~~~~~~ 1652 (1705)
T KOG1836|consen 1584 LLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSSTD--PGVVQLEDDT-Y---TV-GEIP----PPPADTQE 1652 (1705)
T ss_pred HhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeecC--Cccccccccc-e---ec-ccCC----CCchhccC
Confidence 334444444444333332 123444566777999999986443 3345555554 1 11 1111 12234567
Q ss_pred cEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 117 VVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 117 ~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
++++||+|...... ......+|.||+ +.++
T Consensus 1653 p~~~~~~~~s~~~~-~~~~~~~~~~~~--~~~~ 1682 (1705)
T KOG1836|consen 1653 PIKLGGYPSSLTTL-RIAVLKSFTGCI--FVVM 1682 (1705)
T ss_pred CcccCCccccccce-eeecccccccce--EEec
Confidence 99999999875432 222578899999 6674
No 26
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=67.47 E-value=97 Score=28.05 Aligned_cols=57 Identities=18% Similarity=0.277 Sum_probs=34.2
Q ss_pred ceEEEEEeCCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEccccEEEEEEcCCcce
Q psy1323 37 YALYVIVEKGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVHGAKLIARVDNRTDE 96 (186)
Q Consensus 37 ~~~~l~l~~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~~~~ 96 (186)
.|+.| ..+|.-.+.-++....+.......+.-++||++.+.+... .++-.+|+....
T Consensus 579 ~~f~v-~~~G~w~vt~d~~~~~~l~~G~~~~~~~~WhtltL~~~g~--~~ta~lng~~l~ 635 (669)
T PF02057_consen 579 YFFWV-YANGTWSVTSDLAGTTTLASGTADIGAGKWHTLTLTISGS--TATAMLNGTVLW 635 (669)
T ss_dssp EEEEE-ETTTEEEEEEETTS-SEEEEEE-S--TT-EEEEEEEEETT--EEEEEETTEEEE
T ss_pred EEEEE-EcCCcEEEeccCCCcEEEeeeeecccCCeEEEEEEEEECC--EEEEEECCEEeE
Confidence 35555 7789988876766533222223457789999999987654 466667776543
No 27
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=59.85 E-value=72 Score=23.96 Aligned_cols=84 Identities=13% Similarity=0.025 Sum_probs=51.2
Q ss_pred eeEEEEEEEcCCCCeEEEEeeccCCCCCCCCceEEE-EEe-CCEEEEEEEcCCceEEEecCCCccCCCceEEEEEEEccc
Q psy1323 6 CREISFQFRTRLPHGLLVYHSVKDRPEGLDPYALYV-IVE-KGQLKVVHVFGKHSTSITVGEGLNRDKWHSVMVRINVHG 83 (186)
Q Consensus 6 ~~~is~~frT~~~~GlLl~~~~~~~~~~~d~~~~~l-~l~-~G~l~~~~~~g~~~~~~~~~~~v~Dg~WH~V~i~~~~~~ 83 (186)
-|.++=+.|-..-...-.|.++... +|+.+ .+. +|.|++.+.-.++...+..+ ..+-...|..+|......
T Consensus 33 gW~ls~~~RV~~G~~n~~yyAnG~~------r~l~~lsvn~sG~LvA~L~g~ss~~~~~~~-~~di~gyH~Y~i~~~p~~ 105 (198)
T PF09264_consen 33 GWSLSWESRVVSGGCNTNYYANGSK------RYLPILSVNESGSLVAELEGQSSNTLLATT-GADIHGYHKYEIVFSPLT 105 (198)
T ss_dssp -EEEEEEEEEEEES-EEEEEEESSE------EEEEEEEE-TTS-EEEEETTS-S-EEEE-C-HHHHCSEEEEEEEEETTT
T ss_pred CcceeeeEEEecCcceeEEEcCCce------EEEEEEEEcCCCCEEEEEecCCCcEEEecc-cccccceeEEEEEecCCC
Confidence 3566666777666666666666543 25544 344 68888887766665555432 111257999999887777
Q ss_pred cEEEEEEcCCcce
Q psy1323 84 AKLIARVDNRTDE 96 (186)
Q Consensus 84 ~~l~l~vd~~~~~ 96 (186)
...++.+|+....
T Consensus 106 ~tASfy~DG~lI~ 118 (198)
T PF09264_consen 106 NTASFYFDGTLIA 118 (198)
T ss_dssp TEEEEEETTEEEE
T ss_pred CceEEEECCEEEe
Confidence 7888999987654
No 28
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=58.12 E-value=74 Score=23.54 Aligned_cols=120 Identities=13% Similarity=0.042 Sum_probs=53.6
Q ss_pred eeeEEEEEEEc-CCCCeEEEEeeccCCCCCCCCceEEE--EE-eCCEEEEE-EEcCCceEEEecCCCccCCCceEEEEEE
Q psy1323 5 LCREISFQFRT-RLPHGLLVYHSVKDRPEGLDPYALYV--IV-EKGQLKVV-HVFGKHSTSITVGEGLNRDKWHSVMVRI 79 (186)
Q Consensus 5 ~~~~is~~frT-~~~~GlLl~~~~~~~~~~~d~~~~~l--~l-~~G~l~~~-~~~g~~~~~~~~~~~v~Dg~WH~V~i~~ 79 (186)
....++..++. .+.-|||+|..... ++.. |. .+|..++. +... +.........-.++..-.++|.+
T Consensus 52 ~~~~v~~~~~~~YDQaGL~v~~~~~~--------WiK~giE~~~~g~~~l~sV~t~-~~SDws~~~~~~~~~~~~lrv~R 122 (183)
T PF07081_consen 52 AEVKVSGDFKEQYDQAGLMVYQDEDN--------WIKAGIEYSNDGTPRLSSVVTN-GYSDWSLSPLPSDGQSVWLRVER 122 (183)
T ss_dssp EEEEEEE---STT-EEEEEEEEETTE--------EEEEEEEE-ETTCEEEEEEEEE-SSEEEEEEE--SBTTSEEEEEEE
T ss_pred EEEEEEeCCccceeeEEEEEEECCcc--------cEEEEEEEecCCCceEEEEecc-CcccccccccCCCCCEEEEEEEE
Confidence 34445555554 45667888776543 6655 44 57887763 3222 21111111112344555566656
Q ss_pred EccccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 80 NVHGAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 80 ~~~~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
.+..-.+..+.|+..-.... . ..+.....+.||=+-.. ....+|.....++.|.
T Consensus 123 ~g~~~~~~ys~DG~~w~~~R--~-------~~~~~~~~~~VG~~A~s-------P~~~g~~~~F~~f~i~ 176 (183)
T PF07081_consen 123 RGDDLWIYYSADGKTWTLLR--I-------FHFPEDWEVQVGVYACS-------PQGEGFEVEFDDFSIT 176 (183)
T ss_dssp ETTEEEEEEESSSS---EEE--E-------EE--S-S-EEEEEEEE--------SSSS--EEEEEEEEEE
T ss_pred eCCEEEEEEEcCCCEEEEEE--E-------EECCCCCcEEEEEEEeC-------CCCCcEEEEEeEEEEE
Confidence 55443333344654332211 1 11334567888843221 1366799999999993
No 29
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=56.44 E-value=6.7 Score=23.53 Aligned_cols=31 Identities=23% Similarity=0.434 Sum_probs=20.5
Q ss_pred EEEcccCCCCccccceecCCCcEEEEeeEEEC
Q psy1323 118 VLIGGLSPEEKLHGVKYIIESFIGCIKDMVLS 149 (186)
Q Consensus 118 lyiGG~p~~~~~~~~~~~~~~F~GCi~~v~in 149 (186)
|||+|+|.......+...-..| |.+.++.+.
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~-g~v~~v~~~ 31 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRF-GPVEKVRLI 31 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTS-SBEEEEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhc-CCcceEEEE
Confidence 6899999986654443334445 788888885
No 30
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=49.57 E-value=1.2e+02 Score=23.38 Aligned_cols=27 Identities=11% Similarity=0.110 Sum_probs=22.1
Q ss_pred ccCCCceEEEEEEEccccEEEEEEcCC
Q psy1323 67 LNRDKWHSVMVRINVHGAKLIARVDNR 93 (186)
Q Consensus 67 v~Dg~WH~V~i~~~~~~~~l~l~vd~~ 93 (186)
+.+|++|.+.|..+...+.|.+.++..
T Consensus 160 l~~g~~~~v~I~Y~~~~~~L~V~l~~~ 186 (236)
T cd06899 160 LKSGKPMQAWIDYDSSSKRLSVTLAYS 186 (236)
T ss_pred ccCCCeEEEEEEEcCCCCEEEEEEEeC
Confidence 579999999998887677788888654
No 31
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=47.08 E-value=27 Score=26.92 Aligned_cols=30 Identities=23% Similarity=0.488 Sum_probs=25.5
Q ss_pred CCCccCCCceEEEEEEEccccEEEEEEcCC
Q psy1323 64 GEGLNRDKWHSVMVRINVHGAKLIARVDNR 93 (186)
Q Consensus 64 ~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~ 93 (186)
...+.||.||.|.|..+.....|.+.++..
T Consensus 161 ~~~l~~g~~~~v~I~Yd~~~~~L~V~l~~~ 190 (236)
T PF00139_consen 161 SFSLSDGKWHTVWIDYDASTKRLSVYLDDN 190 (236)
T ss_dssp EHHHGTTSEEEEEEEEETTTTEEEEEEEET
T ss_pred cccccCCcEEEEEEEEcCCccEEEEEEecc
Confidence 456789999999999998888888888766
No 32
>KOG3838|consensus
Probab=42.82 E-value=39 Score=28.45 Aligned_cols=61 Identities=20% Similarity=0.153 Sum_probs=35.8
Q ss_pred CCceEEEEEeCCEEEEEEEcCCceEEEe-cCCCccCCCceEEEEEEEccccEEEEEEcCCcce
Q psy1323 35 DPYALYVIVEKGQLKVVHVFGKHSTSIT-VGEGLNRDKWHSVMVRINVHGAKLIARVDNRTDE 96 (186)
Q Consensus 35 d~~~~~l~l~~G~l~~~~~~g~~~~~~~-~~~~v~Dg~WH~V~i~~~~~~~~l~l~vd~~~~~ 96 (186)
++-+|++.+.+|.+.+. ..+++...+. +=.+=--++-|-|.+++.-.++.|++.||+....
T Consensus 150 nnP~Is~~lndGt~~yd-h~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~innGmtp 211 (497)
T KOG3838|consen 150 NNPAISVLLNDGTIPYD-HPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVMINNGMTP 211 (497)
T ss_pred CCccEEEEecCCccccc-CCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEEEcCCCCC
Confidence 33467777777777663 2233322221 1111112566777777777788899999988653
No 33
>PF03633 Glyco_hydro_65C: Glycosyl hydrolase family 65, C-terminal domain ; InterPro: IPR005194 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This family of glycosyl hydrolases (GH65 from CAZY) contains this domain and includes vacuolar acid trehalase and maltose phosphorylases. Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phosphate into beta-D-glucose-1-phosphate and glucose. The C-terminal domain forms a two layered jelly roll motif. This domain is situated at the base of the catalytic domain, however its function remains unknown [].; PDB: 1H54_A.
Probab=33.65 E-value=96 Score=17.68 Aligned_cols=27 Identities=19% Similarity=0.395 Sum_probs=18.9
Q ss_pred CCceEEEEEEEccccEEEEEEcCCcce
Q psy1323 70 DKWHSVMVRINVHGAKLIARVDNRTDE 96 (186)
Q Consensus 70 g~WH~V~i~~~~~~~~l~l~vd~~~~~ 96 (186)
.+|..+++.+..++..+.+.|+.....
T Consensus 5 ~~w~~l~F~~~~rg~~l~v~i~~~~v~ 31 (54)
T PF03633_consen 5 KQWSSLSFRLRYRGHWLEVEITHEKVT 31 (54)
T ss_dssp TT-SEEEEEEEETTEEEEEEEETTEEE
T ss_pred CccCEeEEEEEECCEEEEEEEECCEEE
Confidence 368888887777777788778776544
No 34
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=33.49 E-value=28 Score=18.34 Aligned_cols=14 Identities=21% Similarity=0.240 Sum_probs=10.2
Q ss_pred ccccccCCCCCCCC
Q psy1323 173 EGCIDKKSPCNRLQ 186 (186)
Q Consensus 173 ~gC~~~C~~~~~~~ 186 (186)
.-|+..|.|++..|
T Consensus 6 t~CpA~CDpn~~~~ 19 (34)
T PF09064_consen 6 TECPADCDPNSPGQ 19 (34)
T ss_pred ccCCCccCCCCCCc
Confidence 45888999887653
No 35
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=30.92 E-value=1.4e+02 Score=19.04 Aligned_cols=41 Identities=17% Similarity=0.145 Sum_probs=26.1
Q ss_pred ceEEEEEeCCEEEEEEEcCCc----eEEEe-c--CCCccCCCceEEEE
Q psy1323 37 YALYVIVEKGQLKVVHVFGKH----STSIT-V--GEGLNRDKWHSVMV 77 (186)
Q Consensus 37 ~~~~l~l~~G~l~~~~~~g~~----~~~~~-~--~~~v~Dg~WH~V~i 77 (186)
.|-.|.+..|.|.|..-.+.+ ...+. . ...+.=..||+|+.
T Consensus 25 tWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p 72 (82)
T PF09313_consen 25 TWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEP 72 (82)
T ss_dssp EEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEE
T ss_pred eEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEE
Confidence 488999999999997655432 12222 1 23556789999974
No 36
>PF05910 DUF868: Plant protein of unknown function (DUF868); InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=28.79 E-value=1.9e+02 Score=23.12 Aligned_cols=52 Identities=15% Similarity=0.226 Sum_probs=33.9
Q ss_pred cCCCceEEEEEEE--cc---ccEEEEEEcCCcceeEEeecCCCCCceeecCCCCcEEEcccCCC
Q psy1323 68 NRDKWHSVMVRIN--VH---GAKLIARVDNRTDETIIKGLDPNTNYGVTSDLTSVVLIGGLSPE 126 (186)
Q Consensus 68 ~Dg~WH~V~i~~~--~~---~~~l~l~vd~~~~~~~~~~~~~~~~~~~~l~~~~~lyiGG~p~~ 126 (186)
..|+.|.|.|+-. .. ...+.+.||+........ . .=.+.-+..++++|+|-.
T Consensus 153 e~G~~HeI~Iec~~~~~g~~dp~l~V~VDgk~v~~Vkr-L------~WkFRGNqti~vdg~~V~ 209 (274)
T PF05910_consen 153 EGGKEHEISIECGGETGGPKDPELWVSVDGKKVVQVKR-L------RWKFRGNQTIFVDGLPVQ 209 (274)
T ss_pred CCCcEEEEEEEEeccCCCCCCceEEEEECCEEEEEEEE-e------eecccCceEEEECCeEEE
Confidence 3799999998762 11 236888999887543321 1 123445678999998865
No 37
>PF09061 Stirrup: Stirrup; InterPro: IPR015146 The Stirrup domain, found in the prokaryotic protein ribonucleotide reductase, has a molecular mass of 9 kDa and is folded into an alpha/beta structure. It allows for binding of the reductase to DNA via electrostatic interactions, since it has a predominance of positive charges distributed on its surface []. ; GO: 0016788 hydrolase activity, acting on ester bonds; PDB: 1DQ3_A.
Probab=28.08 E-value=24 Score=21.52 Aligned_cols=20 Identities=25% Similarity=0.644 Sum_probs=13.7
Q ss_pred CCceEEEecCCCccCCCceE
Q psy1323 55 GKHSTSITVGEGLNRDKWHS 74 (186)
Q Consensus 55 g~~~~~~~~~~~v~Dg~WH~ 74 (186)
|+....+..+.+++-|+||.
T Consensus 27 gsqtlaii~~ekislgqwh~ 46 (79)
T PF09061_consen 27 GSQTLAIIKNEKISLGQWHT 46 (79)
T ss_dssp TTEEEEEETTEEEE-TTHHH
T ss_pred CceEEEeecCceeehhhhhh
Confidence 44445566778889999996
No 38
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=26.82 E-value=1.2e+02 Score=24.08 Aligned_cols=21 Identities=19% Similarity=0.352 Sum_probs=15.0
Q ss_pred cCCCccCCCceEEEEEEEccc
Q psy1323 63 VGEGLNRDKWHSVMVRINVHG 83 (186)
Q Consensus 63 ~~~~v~Dg~WH~V~i~~~~~~ 83 (186)
+...-.||.||+|+|+....+
T Consensus 264 ~~~~~~dg~~~~i~V~~~~~~ 284 (296)
T TIGR03436 264 PPNPAHDGKFRKIKVRVKRKG 284 (296)
T ss_pred CCCCCCCCceeEEEEEEeCCC
Confidence 344456999999999776543
No 39
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=20.47 E-value=52 Score=20.26 Aligned_cols=13 Identities=15% Similarity=0.212 Sum_probs=10.4
Q ss_pred CCcEEEcccCCCC
Q psy1323 115 TSVVLIGGLSPEE 127 (186)
Q Consensus 115 ~~~lyiGG~p~~~ 127 (186)
...++|||+|..-
T Consensus 27 Sr~vflG~IP~~W 39 (67)
T PF15407_consen 27 SRRVFLGPIPEIW 39 (67)
T ss_pred CceEEECCCChHH
Confidence 4589999999754
Done!