Query psy6121
Match_columns 126
No_of_seqs 103 out of 484
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 18:10:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6121.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6121hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00054 Laminin_G_1: Laminin 99.6 2.6E-15 5.7E-20 108.3 9.1 70 5-77 5-76 (131)
2 smart00282 LamG Laminin G doma 99.5 8.4E-14 1.8E-18 98.9 10.8 75 2-78 5-86 (135)
3 KOG3514|consensus 99.5 3.1E-13 6.6E-18 126.0 9.8 75 2-76 277-356 (1591)
4 cd00110 LamG Laminin G domain; 99.4 1.6E-12 3.4E-17 92.5 10.2 71 2-75 24-101 (151)
5 PF02210 Laminin_G_2: Laminin 99.4 6.1E-12 1.3E-16 86.5 9.5 76 7-84 7-84 (128)
6 KOG3516|consensus 99.3 8.1E-12 1.7E-16 117.2 9.5 80 2-82 813-898 (1306)
7 KOG3516|consensus 99.2 4.9E-11 1.1E-15 112.1 9.2 77 1-77 203-289 (1306)
8 KOG3514|consensus 99.2 6.2E-11 1.3E-15 110.9 9.0 72 2-75 463-543 (1591)
9 KOG4289|consensus 98.9 1E-08 2.2E-13 98.6 9.2 73 2-76 1341-1420(2531)
10 KOG1219|consensus 98.3 1.9E-06 4.1E-11 86.2 7.7 71 8-81 3720-3790(4289)
11 smart00210 TSPN Thrombospondin 98.1 3.5E-05 7.6E-10 58.8 9.5 71 10-81 70-144 (184)
12 PF13385 Laminin_G_3: Concanav 96.4 0.041 8.9E-07 37.8 8.4 63 17-81 49-112 (157)
13 KOG4289|consensus 94.3 0.13 2.7E-06 51.6 6.7 58 2-65 1566-1629(2531)
14 smart00159 PTX Pentraxin / C-r 93.5 0.6 1.3E-05 36.1 8.2 29 49-77 85-115 (206)
15 cd00152 PTX Pentraxins are pla 92.1 1.2 2.6E-05 34.1 8.1 62 12-80 53-118 (201)
16 KOG1834|consensus 92.1 0.73 1.6E-05 43.1 7.7 60 16-75 400-463 (952)
17 PF02973 Sialidase: Sialidase, 89.2 2.6 5.6E-05 33.3 7.6 76 2-77 37-128 (190)
18 smart00560 LamGL LamG-like jel 89.0 6.3 0.00014 28.0 9.0 25 54-78 61-87 (133)
19 PF14099 Polysacc_lyase: Polys 85.6 4.1 8.9E-05 31.1 6.8 61 17-77 113-180 (224)
20 PF00354 Pentaxin: Pentaxin fa 81.7 2.8 6.1E-05 32.5 4.5 27 49-75 79-107 (195)
21 PF06439 DUF1080: Domain of Un 75.1 4.6 0.0001 29.4 3.7 29 48-76 121-149 (185)
22 PRK06228 F0F1 ATP synthase sub 73.8 33 0.00072 25.2 9.7 67 10-78 13-86 (131)
23 TIGR03166 alt_F1F0_F1_eps alte 68.5 41 0.00089 24.4 7.4 66 10-77 10-82 (122)
24 KOG1653|consensus 68.1 6 0.00013 31.0 3.0 48 16-63 65-113 (175)
25 PF02057 Glyco_hydro_59: Glyco 64.9 28 0.00061 32.5 7.1 55 19-77 580-634 (669)
26 PF09313 DUF1971: Domain of un 62.8 36 0.00078 23.2 5.8 47 16-62 23-73 (82)
27 TIGR01216 ATP_synt_epsi ATP sy 60.5 43 0.00094 24.1 6.3 65 10-77 12-83 (130)
28 PRK14736 atpC F0F1 ATP synthas 58.0 49 0.0011 24.4 6.2 65 10-77 14-85 (133)
29 PF00139 Lectin_legB: Legume l 56.1 14 0.0003 29.0 3.2 28 48-75 161-190 (236)
30 PRK13447 F0F1 ATP synthase sub 55.1 84 0.0018 23.3 7.3 66 10-77 11-84 (136)
31 PF15016 DUF4520: Domain of un 50.1 84 0.0018 21.8 6.0 50 22-75 19-68 (85)
32 KOG4003|consensus 49.8 8.3 0.00018 31.0 1.0 15 50-64 38-52 (223)
33 COG3615 TehB Uncharacterized p 45.2 81 0.0017 22.7 5.4 56 16-71 34-93 (99)
34 PF02823 ATP-synt_DE_N: ATP sy 43.0 72 0.0016 20.8 4.7 61 10-73 11-78 (80)
35 PF07622 DUF1583: Protein of u 40.8 38 0.00082 29.8 3.8 34 48-81 84-117 (399)
36 cd01951 lectin_L-type legume l 40.3 1.4E+02 0.003 22.8 6.6 22 54-75 154-177 (223)
37 TIGR02148 Fibro_Slime fibro-sl 37.4 1.5E+02 0.0032 20.9 6.8 45 16-63 26-72 (90)
38 cd04508 TUDOR Tudor domains ar 35.9 40 0.00088 19.4 2.3 14 52-65 10-23 (48)
39 COG0266 Nei Formamidopyrimidin 35.3 1.9E+02 0.0041 24.1 6.9 51 12-66 52-103 (273)
40 PF00868 Transglut_N: Transglu 34.7 1.7E+02 0.0037 20.8 6.9 51 28-79 51-106 (118)
41 PRK00539 atpC F0F1 ATP synthas 34.7 1.8E+02 0.004 21.2 7.5 66 10-78 14-86 (133)
42 PRK13451 atpC F0F1 ATP synthas 33.9 1.6E+02 0.0035 20.4 7.1 49 10-61 11-60 (101)
43 PRK13450 atpC F0F1 ATP synthas 27.4 2.4E+02 0.0053 20.4 6.7 65 10-77 14-85 (132)
44 KOG0286|consensus 26.7 1.5E+02 0.0033 25.6 5.0 55 3-58 280-337 (343)
45 PF11859 DUF3379: Protein of u 26.7 1.8E+02 0.0038 23.8 5.3 49 29-77 166-220 (232)
46 PF11995 DUF3490: Domain of un 26.5 36 0.00078 26.5 1.2 30 2-31 18-49 (161)
47 PRK13444 atpC F0F1 ATP synthas 26.4 2.5E+02 0.0054 20.2 7.4 65 10-77 16-86 (127)
48 PRK09391 fixK transcriptional 25.2 1.8E+02 0.0038 22.3 4.9 31 8-39 47-77 (230)
49 cd00413 Glyco_hydrolase_16 gly 23.5 1.2E+02 0.0026 22.5 3.6 28 52-79 140-167 (210)
50 COG4499 Predicted membrane pro 23.1 73 0.0016 28.3 2.6 39 87-125 396-434 (434)
51 PRK13442 atpC F0F1 ATP synthas 22.0 2.6E+02 0.0056 18.9 6.9 65 10-77 16-87 (89)
52 cd08023 GH16_laminarinase_like 21.3 1.5E+02 0.0032 22.8 3.7 28 51-78 155-182 (235)
53 PF09264 Sial-lect-inser: Vibr 21.1 2.9E+02 0.0063 22.2 5.4 64 10-77 50-117 (198)
54 TIGR03436 acidobact_VWFA VWFA- 21.1 94 0.002 24.7 2.7 22 49-70 266-287 (296)
55 smart00333 TUDOR Tudor domain. 20.4 1.4E+02 0.0029 17.7 2.8 15 52-66 14-28 (57)
56 smart00564 PQQ beta-propeller 20.4 1.5E+02 0.0032 15.4 3.3 24 8-31 7-30 (33)
57 PF05523 FdtA: WxcM-like, C-te 20.2 2.7E+02 0.0059 20.0 4.8 43 18-60 54-98 (131)
No 1
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.62 E-value=2.6e-15 Score=108.33 Aligned_cols=70 Identities=21% Similarity=0.414 Sum_probs=60.4
Q ss_pred eeeceeeecCC--CCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEeccccc
Q psy6121 5 STDCWLGDGSD--GSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEK 77 (126)
Q Consensus 5 sfrtlLLyaG~--~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~ 77 (126)
..+++|||.|. ..|||+|+|.+|+|.+.|++|+++..+.. ...++||+||+|.+.|+.+.++|.||+...
T Consensus 5 ~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~~~~~---~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~ 76 (131)
T PF00054_consen 5 EPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPASLRS---PQKINDGKWHTVSVSRNGRNGSLSVDGEEV 76 (131)
T ss_dssp SSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEEEEEE---SSETTSSSEEEEEEEEETTEEEEEETTSEE
T ss_pred CCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccceecC---CCccCCCcceEEEEEEcCcEEEEEECCccc
Confidence 34567999876 55999999999999999999999876543 245999999999999999999999999876
No 2
>smart00282 LamG Laminin G domain.
Probab=99.54 E-value=8.4e-14 Score=98.90 Aligned_cols=75 Identities=23% Similarity=0.390 Sum_probs=64.3
Q ss_pred eEEeeece-----eeecCC--CCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecc
Q psy6121 2 LTLSTDCW-----LGDGSD--GSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKE 74 (126)
Q Consensus 2 IS~sfrtl-----LLyaG~--~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~ 74 (126)
|+|.|+|. |||.++ ..+|++|+|.+|++.+.++.|+++..+.. +..+++||+||+|.+.++.+.++|.||+
T Consensus 5 i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~--~~~~~~dg~WH~v~i~~~~~~~~l~VD~ 82 (135)
T smart00282 5 ISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGPARLTS--DPTPLNDGQWHRVAVERNGRRVTLSVDG 82 (135)
T ss_pred EEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEE--CCeEeCCCCEEEEEEEEeCCEEEEEECC
Confidence 67888875 888866 77999999999999999999988765543 2378999999999999999999999998
Q ss_pred cccc
Q psy6121 75 EEKK 78 (126)
Q Consensus 75 ~~~~ 78 (126)
....
T Consensus 83 ~~~~ 86 (135)
T smart00282 83 ENPV 86 (135)
T ss_pred Cccc
Confidence 6544
No 3
>KOG3514|consensus
Probab=99.45 E-value=3.1e-13 Score=125.97 Aligned_cols=75 Identities=35% Similarity=0.533 Sum_probs=71.6
Q ss_pred eEEeeece-----eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecccc
Q psy6121 2 LTLSTDCW-----LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEE 76 (126)
Q Consensus 2 IS~sfrtl-----LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~ 76 (126)
|||+|+|+ |||+|+..||+.|.|++|-+.+.+++++|.++..+.|+..++||++||.|.++|+...++|+||+.-
T Consensus 277 itl~FrT~q~ngllfytG~~~dYlnlaL~dGaV~l~~~l~~g~~e~~~~p~~~rfdD~~WH~V~v~R~~~m~t~~VDg~~ 356 (1591)
T KOG3514|consen 277 ITLTFRTVQGNGLLFYTGDEKDYLNLALQDGAVSLSSKLDGGDAEIIRMPNSFRFDDDSWHTVIVERSLQMMTLIVDGRR 356 (1591)
T ss_pred eEEEEEEecCceeEEEccCCcceeeEeecCCcEEEEEecCCccceeEEccccccccCCcceEEEEEeeeEEEEEEEccEE
Confidence 89999986 9999999999999999999999999999999999989889999999999999999999999999854
No 4
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.43 E-value=1.6e-12 Score=92.48 Aligned_cols=71 Identities=24% Similarity=0.396 Sum_probs=60.1
Q ss_pred eEEeeece-----eeecCCC--CCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecc
Q psy6121 2 LTLSTDCW-----LGDGSDG--SDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKE 74 (126)
Q Consensus 2 IS~sfrtl-----LLyaG~~--~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~ 74 (126)
|+|.|+|. |||.+.. .+|++|+|.+|++.+.++.|+.+..+. . ..+++||+||+|.+.++...++|.||+
T Consensus 24 i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g~~~~~~~--~-~~~v~dg~Wh~v~i~~~~~~~~l~VD~ 100 (151)
T cd00110 24 ISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLS--S-KTPLNDGQWHSVSVERNGRSVTLSVDG 100 (151)
T ss_pred EEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCCcccEEEE--c-cCccCCCCEEEEEEEECCCEEEEEECC
Confidence 67888764 8888765 899999999999999999996554332 2 237999999999999999999999999
Q ss_pred c
Q psy6121 75 E 75 (126)
Q Consensus 75 ~ 75 (126)
.
T Consensus 101 ~ 101 (151)
T cd00110 101 E 101 (151)
T ss_pred c
Confidence 8
No 5
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.37 E-value=6.1e-12 Score=86.53 Aligned_cols=76 Identities=18% Similarity=0.249 Sum_probs=61.6
Q ss_pred eceeeecCCCC--CeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecccccchhcccc
Q psy6121 7 DCWLGDGSDGS--DYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDK 84 (126)
Q Consensus 7 rtlLLyaG~~~--DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~~~~~~ 84 (126)
+++|||.++.. +|+.|+|.+|+|.+.+++|+....... +..+++|++||+|.+.|....++|.||+..........
T Consensus 7 ~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~--~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~~~~~~~ 84 (128)
T PF02210_consen 7 NGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGSEIVTTF--SNSNLNDGQWHKVSISRDGNRVTLTVDGQSVSSESLPS 84 (128)
T ss_dssp SEEEEEEEESTTSEEEEEEEETTEEEEEEESSSSEEEEEE--CSSSSTSSSEEEEEEEEETTEEEEEETTSEEEEEESSS
T ss_pred CEeEEEEcCCCCCEEEEEEEECCEEEEEEEccccceeeec--cCccccccceeEEEEEEeeeeEEEEecCccceEEeccc
Confidence 45688886533 699999999999999999954443332 34689999999999999999999999999887765543
No 6
>KOG3516|consensus
Probab=99.31 E-value=8.1e-12 Score=117.21 Aligned_cols=80 Identities=16% Similarity=0.281 Sum_probs=72.0
Q ss_pred eEEeeece-----eeecCCCCCeEEEEEeCCE-EEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEeccc
Q psy6121 2 LTLSTDCW-----LGDGSDGSDYLNVAIKDGG-VAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEE 75 (126)
Q Consensus 2 IS~sfrtl-----LLyaG~~~DFIaLeL~~G~-L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~ 75 (126)
|+|.|||. +|.+-+..|||.|+|..+. ++|.++.|+||..+++..+ ..+||++||.|.++||.++.+|+||+.
T Consensus 813 Isf~FrTt~~~gvflen~g~~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~-t~~nD~qWH~V~~Ern~K~a~LqVD~~ 891 (1306)
T KOG3516|consen 813 ISFFFRTTASSGVFLENHGINDFIRLELSSPVEVTFAFDVGNGPSQLTVRSP-TELNDNQWHQVRAERNSKEASLQVDGL 891 (1306)
T ss_pred EEEEEEecCCceEeeeccCCCceEEEEEcCCCceEEEEEcCCCceeEEEcCC-cccCCCceEEEEEEeccccceEEEcCc
Confidence 78999985 5555558899999999986 9999999999999988654 789999999999999999999999999
Q ss_pred ccchhcc
Q psy6121 76 EKKKRRE 82 (126)
Q Consensus 76 ~~~~~~~ 82 (126)
.+..|++
T Consensus 892 ~~~~r~s 898 (1306)
T KOG3516|consen 892 PKSIRTS 898 (1306)
T ss_pred ccceecC
Confidence 9998876
No 7
>KOG3516|consensus
Probab=99.21 E-value=4.9e-11 Score=112.05 Aligned_cols=77 Identities=21% Similarity=0.310 Sum_probs=64.0
Q ss_pred CeEEeeece-----eeec-CCCCCeEEEEEeCCEEEEEEEcCCceEEEEEc----CCCcccCCCCceEEEEEEeceeEEE
Q psy6121 1 MLTLSTDCW-----LGDG-SDGSDYLNVAIKDGGVAINLKLAMGKLDMYIR----PNRLRFDDNQWHHLSIHRRIQEGTG 70 (126)
Q Consensus 1 ~IS~sfrtl-----LLya-G~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~----~s~~~lnDgqWH~V~v~R~~r~~sL 70 (126)
+|+|.|||. |||+ |.++|||.|+|++|++.|.+|+|+.+...+.. +..+.|+|++||.|.+.|.++.+.+
T Consensus 203 ~is~~Fkt~~sdGvllh~eg~QGd~itlql~~~kl~l~ld~G~~~~~~s~~~~sis~GslLdD~hWHsV~i~r~~~~vnf 282 (1306)
T KOG3516|consen 203 VISLKFKTMQSDGVLLHGEGQQGDYITLQLIGGKLVLILDLGNSKLPSSRTPTSISAGSLLDDQHWHSVRIERQGRQVNF 282 (1306)
T ss_pred eeEEEEEeeccceeEEEcccCCCCEEEEEEeCCEEEEEEecCCccCccccCcceeecccccCCCcceEEEEEecCcEEEE
Confidence 589999975 8887 77999999999999999999999654322221 2256799999999999999999999
Q ss_pred Eeccccc
Q psy6121 71 CGKEEEK 77 (126)
Q Consensus 71 ~vd~~~~ 77 (126)
+||+-.-
T Consensus 283 tvD~~~~ 289 (1306)
T KOG3516|consen 283 TVDGVVH 289 (1306)
T ss_pred EEccceE
Confidence 9998654
No 8
>KOG3514|consensus
Probab=99.19 E-value=6.2e-11 Score=110.92 Aligned_cols=72 Identities=18% Similarity=0.378 Sum_probs=61.8
Q ss_pred eEEeeece-----eeec-CC---CCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEe
Q psy6121 2 LTLSTDCW-----LGDG-SD---GSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCG 72 (126)
Q Consensus 2 IS~sfrtl-----LLya-G~---~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~v 72 (126)
|||.|||. |+|+ |. ..|||+++|.||+|.+.+++|+|.+.+.. +...+|||.||+|.+.|++|.++++|
T Consensus 463 iSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ldlGSG~iklra--s~rkv~DGeWhhv~l~R~gR~gsvsV 540 (1591)
T KOG3514|consen 463 ISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLLLDLGSGVIKLRA--SSRKVNDGEWHHVDLQRDGRTGSVSV 540 (1591)
T ss_pred eEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEEEecCCceEEeee--ecccccCCceEEEEeeccCccceEEE
Confidence 78999974 4444 44 56999999999999999999999876654 34689999999999999999999999
Q ss_pred ccc
Q psy6121 73 KEE 75 (126)
Q Consensus 73 d~~ 75 (126)
|+.
T Consensus 541 d~~ 543 (1591)
T KOG3514|consen 541 DAI 543 (1591)
T ss_pred eee
Confidence 975
No 9
>KOG4289|consensus
Probab=98.86 E-value=1e-08 Score=98.63 Aligned_cols=73 Identities=16% Similarity=0.271 Sum_probs=59.8
Q ss_pred eEEeeec-----eeeecCC-CCCeEEEEEeCCEEEEEEEcCCceEEEEEcCC-CcccCCCCceEEEEEEeceeEEEEecc
Q psy6121 2 LTLSTDC-----WLGDGSD-GSDYLNVAIKDGGVAINLKLAMGKLDMYIRPN-RLRFDDNQWHHLSIHRRIQEGTGCGKE 74 (126)
Q Consensus 2 IS~sfrt-----lLLyaG~-~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s-~~~lnDgqWH~V~v~R~~r~~sL~vd~ 74 (126)
++|+|.| +|||.|+ ..||++|++.++.+++.|.+|..... +.|. ...++||+||+|.+...++.++++||+
T Consensus 1341 lslsfaT~~~nGlL~ynGnekhDFvalevVd~qvqltfS~Ges~t~--v~p~Vp~gvsDGqWHtV~l~YyNK~av~svDd 1418 (2531)
T KOG4289|consen 1341 LSLSFATIERNGLLLYNGNEKHDFVALEVVDEQVQLTFSAGESTTT--VSPDVPGGVSDGQWHTVQLEYYNKVAVVSVDD 1418 (2531)
T ss_pred EEEEEEEeeecceEEecCCcccceEeeeeeeeeEEEEEecccccce--ecCCCCCCcccCceeEEEEEEeceEEEEEecc
Confidence 5677765 4999997 56999999999999999999964432 2222 357999999999999999999999997
Q ss_pred cc
Q psy6121 75 EE 76 (126)
Q Consensus 75 ~~ 76 (126)
-.
T Consensus 1419 Cd 1420 (2531)
T KOG4289|consen 1419 CD 1420 (2531)
T ss_pred cc
Confidence 54
No 10
>KOG1219|consensus
Probab=98.29 E-value=1.9e-06 Score=86.18 Aligned_cols=71 Identities=13% Similarity=0.165 Sum_probs=58.8
Q ss_pred ceeeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecccccchhc
Q psy6121 8 CWLGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR 81 (126)
Q Consensus 8 tlLLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~~~ 81 (126)
|.++|+- ..||..|.|.+|.+.+.++.|+|+-. +......+|||+||.|.+.|+++.++|.||++...+..
T Consensus 3720 gIiM~tr-~~d~~iLkLv~G~~~l~~~cgsG~Gi--vg~q~~~VnDgqWHsialerrr~~irlsvDd~~~~~at 3790 (4289)
T KOG1219|consen 3720 GIIMYTR-KTDLAILKLVGGSPQLLADCGSGPGI--VGSQKRTVNDGQWHSIALERRRNHIRLSVDDDTYDSAT 3790 (4289)
T ss_pred cEEEEEc-CCceEEEEecCCcEEEEEecCCCCCc--ccccceEeecCceeEEEeeccCCceEEEEcccCceeee
Confidence 3477754 78999999999999999999999832 22223579999999999999999999999998766543
No 11
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.09 E-value=3.5e-05 Score=58.76 Aligned_cols=71 Identities=18% Similarity=0.166 Sum_probs=50.7
Q ss_pred eeecC--CCCCeEEEEEeCCEEEEEEEc--CCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecccccchhc
Q psy6121 10 LGDGS--DGSDYLNVAIKDGGVAINLKL--AMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR 81 (126)
Q Consensus 10 LLyaG--~~~DFIaLeL~~G~L~l~~nl--GsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~~~ 81 (126)
||... ++..++.|.|..+...+.|.. ..|.......+ ..++.||+||+|.+...+..++|-||......+.
T Consensus 70 L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~-~~~l~dg~WH~lal~V~~~~v~LyvDC~~~~~~~ 144 (184)
T smart00210 70 LFAIYDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFR-NLPLADGQWHKLALSVSGSSATLYVDCNEIDSRP 144 (184)
T ss_pred EEEEEcCCCcEEEEEEEeCCccEEEEEECCCCCcEEEEeec-CCccccCCceEEEEEEeCCEEEEEECCcccccee
Confidence 55543 355689999987765555553 33443332222 3679999999999999999999999998776653
No 12
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=96.35 E-value=0.041 Score=37.79 Aligned_cols=63 Identities=19% Similarity=0.209 Sum_probs=40.2
Q ss_pred CCeEEEEEeC-CEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEecccccchhc
Q psy6121 17 SDYLNVAIKD-GGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR 81 (126)
Q Consensus 17 ~DFIaLeL~~-G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~~~ 81 (126)
.+.+.|.+.+ |.+.+.+..+++..... .+...+.++.||+|.+..+...++|-||+.......
T Consensus 49 ~~~~~l~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~ 112 (157)
T PF13385_consen 49 SGGFGLFINNNGRLRFYIGNGGGGNYSF--SSDSNLPDNKWHHLALTYDGSTVTLYVNGELVGSST 112 (157)
T ss_dssp SEEEEEEEETTSEEEEEETTSEEESS-E--E-BS---TT-EEEEEEEEETTEEEEEETTEEETTCT
T ss_pred CCEEEEEEECCCEEEEEEeCCCceeEEE--ecCcccCCCCEEEEEEEEECCeEEEEECCEEEEeEe
Confidence 3467777764 66666666554221111 124678999999999999999999999998766543
No 13
>KOG4289|consensus
Probab=94.27 E-value=0.13 Score=51.58 Aligned_cols=58 Identities=28% Similarity=0.475 Sum_probs=41.7
Q ss_pred eEEeeece-----eeec-CCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEec
Q psy6121 2 LTLSTDCW-----LGDG-SDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRI 65 (126)
Q Consensus 2 IS~sfrtl-----LLya-G~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~ 65 (126)
++|-|+|. ||.+ ..+..-+.++|.+|.|.+.+.- ..+.+ + ...++||+||.+.|.-..
T Consensus 1566 lsl~FRTr~ad~vl~~~~~~~rst~~lqld~g~l~~~v~~--s~v~L---~-~~~vtdg~Wh~~~i~l~~ 1629 (2531)
T KOG4289|consen 1566 LSLMFRTRRADGVLMQAEFGGRSTYNLQLDDGTLKYNVGD--SSVEL---P-APRVTDGHWHHLVIELEA 1629 (2531)
T ss_pred EEEEEEeeccccEEEEEEeCCCceEEEEEcCCEEEEEecC--ceEEc---c-CccccCCchhheeeeecc
Confidence 67888886 6665 2234459999999999887753 23222 3 368999999999987753
No 14
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=93.54 E-value=0.6 Score=36.11 Aligned_cols=29 Identities=14% Similarity=0.240 Sum_probs=24.3
Q ss_pred cccCCCCceEEEEEEecee--EEEEeccccc
Q psy6121 49 LRFDDNQWHHLSIHRRIQE--GTGCGKEEEK 77 (126)
Q Consensus 49 ~~lnDgqWH~V~v~R~~r~--~sL~vd~~~~ 77 (126)
..+.|++||+|.++.+... .+|-||+...
T Consensus 85 ~~~~~g~W~hvc~tw~~~~g~~~lyvnG~~~ 115 (206)
T smart00159 85 VPESDGKWHHICTTWESSSGIAELWVDGKPG 115 (206)
T ss_pred ccccCCceEEEEEEEECCCCcEEEEECCEEc
Confidence 4689999999999998654 6788999875
No 15
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=92.11 E-value=1.2 Score=34.13 Aligned_cols=62 Identities=15% Similarity=0.231 Sum_probs=37.7
Q ss_pred ecCC-CCCeEEEEE-eCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEece--eEEEEecccccchh
Q psy6121 12 DGSD-GSDYLNVAI-KDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQ--EGTGCGKEEEKKKR 80 (126)
Q Consensus 12 yaG~-~~DFIaLeL-~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r--~~sL~vd~~~~~~~ 80 (126)
|++. ..+.+.+.. ..|.+.+.++ +.... . + .+.+|++||+|.++.+.. .++|-||+......
T Consensus 53 y~~~~~~~~~~l~~~~~g~~~~~i~--~~~~~--~-~--~~~~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~ 118 (201)
T cd00152 53 YATKGQDNELLLYKEKDGGYSLYIG--GKEVT--F-K--VPESDGAWHHICVTWESTSGIAELWVNGKLSVRK 118 (201)
T ss_pred EeCCCCCCeEEEEEcCCCeEEEEEc--CEEEE--E-e--ccCCCCCEEEEEEEEECCCCcEEEEECCEEeccc
Confidence 4433 333444433 3456666663 22221 1 1 345999999999999855 46899999876543
No 16
>KOG1834|consensus
Probab=92.06 E-value=0.73 Score=43.13 Aligned_cols=60 Identities=13% Similarity=0.196 Sum_probs=45.8
Q ss_pred CCCeEEEEEeCCEEEEEEEcCCceEEEE----EcCCCcccCCCCceEEEEEEeceeEEEEeccc
Q psy6121 16 GSDYLNVAIKDGGVAINLKLAMGKLDMY----IRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEE 75 (126)
Q Consensus 16 ~~DFIaLeL~~G~L~l~~nlGsG~~~~~----i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~ 75 (126)
+..+.+|++.+=+|.|.+.-..|..+-. .++....++|+.||+-.+.-+...++|.||+.
T Consensus 400 nrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~EWH~Y~ln~efp~VtlyvDG~ 463 (952)
T KOG1834|consen 400 NRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNEWHHYVLNVEFPDVTLYVDGK 463 (952)
T ss_pred ccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhhhheeEEeecCceEEEEEcCc
Confidence 4468999999999999998644332111 01223469999999999999999999999985
No 17
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=89.17 E-value=2.6 Score=33.28 Aligned_cols=76 Identities=12% Similarity=0.119 Sum_probs=51.0
Q ss_pred eEEeeece-------eeecCC---CCCeEEEEEeCCEEEEEEEcCCc-eEEEEEcCC---CcccCCCCceEEEEEEe--c
Q psy6121 2 LTLSTDCW-------LGDGSD---GSDYLNVAIKDGGVAINLKLAMG-KLDMYIRPN---RLRFDDNQWHHLSIHRR--I 65 (126)
Q Consensus 2 IS~sfrtl-------LLyaG~---~~DFIaLeL~~G~L~l~~nlGsG-~~~~~i~~s---~~~lnDgqWH~V~v~R~--~ 65 (126)
|.++|+.. ||-.++ ...|+.|++.++.+-+-++-..+ .......|. ....++..||+|.+.-+ .
T Consensus 37 I~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~ad~~~ 116 (190)
T PF02973_consen 37 IVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFVADSKN 116 (190)
T ss_dssp EEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEEEETTT
T ss_pred EEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEEEecCC
Confidence 56666651 555433 34799999999988888876554 222222221 23478889999999997 8
Q ss_pred eeEEEEeccccc
Q psy6121 66 QEGTGCGKEEEK 77 (126)
Q Consensus 66 r~~sL~vd~~~~ 77 (126)
....|=|||..-
T Consensus 117 ~~ykly~NG~~v 128 (190)
T PF02973_consen 117 KGYKLYVNGELV 128 (190)
T ss_dssp TEEEEEETTCEE
T ss_pred CeEEEEeCCeeE
Confidence 889999999543
No 18
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=88.95 E-value=6.3 Score=28.01 Aligned_cols=25 Identities=16% Similarity=-0.027 Sum_probs=22.0
Q ss_pred CCceEEEEEEec--eeEEEEecccccc
Q psy6121 54 NQWHHLSIHRRI--QEGTGCGKEEEKK 78 (126)
Q Consensus 54 gqWH~V~v~R~~--r~~sL~vd~~~~~ 78 (126)
++||+|.+..+. ..++|-||+....
T Consensus 61 ~~W~hva~v~d~~~g~~~lYvnG~~~~ 87 (133)
T smart00560 61 GVWVHLAGVYDGGAGKLSLYVNGVEVA 87 (133)
T ss_pred CCEEEEEEEEECCCCeEEEEECCEEcc
Confidence 999999999998 7899999997543
No 19
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=85.61 E-value=4.1 Score=31.14 Aligned_cols=61 Identities=13% Similarity=0.089 Sum_probs=38.7
Q ss_pred CCeEEEEEeCCEEEEEEEcCC--ceEEEEEcCCCcccCCCCceEEEEEEece-----eEEEEeccccc
Q psy6121 17 SDYLNVAIKDGGVAINLKLAM--GKLDMYIRPNRLRFDDNQWHHLSIHRRIQ-----EGTGCGKEEEK 77 (126)
Q Consensus 17 ~DFIaLeL~~G~L~l~~nlGs--G~~~~~i~~s~~~lnDgqWH~V~v~R~~r-----~~sL~vd~~~~ 77 (126)
...++|.+.+|.+.+.+..+. ............++.-|+||+|.+.-.-. .+.+-+|+...
T Consensus 113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v 180 (224)
T PF14099_consen 113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLV 180 (224)
T ss_dssp EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEEC
T ss_pred CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEE
Confidence 457999999999999998766 11111111223568889999999876433 46666677543
No 20
>PF00354 Pentaxin: Pentaxin family; InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=81.73 E-value=2.8 Score=32.45 Aligned_cols=27 Identities=19% Similarity=0.219 Sum_probs=20.1
Q ss_pred cccCCCCceEEEEEEece--eEEEEeccc
Q psy6121 49 LRFDDNQWHHLSIHRRIQ--EGTGCGKEE 75 (126)
Q Consensus 49 ~~lnDgqWH~V~v~R~~r--~~sL~vd~~ 75 (126)
..+.|++||+|-++.+.. ...|-+|+.
T Consensus 79 ~~~~~~~Whh~C~tW~s~~G~~~ly~dG~ 107 (195)
T PF00354_consen 79 GPIRDGQWHHICVTWDSSTGRWQLYVDGV 107 (195)
T ss_dssp ECS-TSS-EEEEEEEETTTTEEEEEETTE
T ss_pred cccCCCCcEEEEEEEecCCcEEEEEECCE
Confidence 468999999999999875 466668887
No 21
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=75.05 E-value=4.6 Score=29.42 Aligned_cols=29 Identities=10% Similarity=0.121 Sum_probs=23.4
Q ss_pred CcccCCCCceEEEEEEeceeEEEEecccc
Q psy6121 48 RLRFDDNQWHHLSIHRRIQEGTGCGKEEE 76 (126)
Q Consensus 48 ~~~lnDgqWH~V~v~R~~r~~sL~vd~~~ 76 (126)
...+..++||++.|.-.+..++..||+..
T Consensus 121 ~~~~~~~~W~~~~I~~~g~~i~v~vnG~~ 149 (185)
T PF06439_consen 121 NVAIPPGEWNTVRIVVKGNRITVWVNGKP 149 (185)
T ss_dssp --S--TTSEEEEEEEEETTEEEEEETTEE
T ss_pred cccCCCCceEEEEEEEECCEEEEEECCEE
Confidence 34689999999999999999999999974
No 22
>PRK06228 F0F1 ATP synthase subunit epsilon; Validated
Probab=73.75 E-value=33 Score=25.25 Aligned_cols=67 Identities=6% Similarity=0.000 Sum_probs=41.8
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEEE-----EeceeEEEEecccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSIH-----RRIQEGTGCGKEEEKK 78 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v~-----R~~r~~sL~vd~~~~~ 78 (126)
++|.+.+.+++.+.=.+|.+-+.-+. .|+-..+.|....+ .|+.|+.+.+. -....+++.+|..++.
T Consensus 13 ~~~~g~~v~~V~lpt~~G~~GILp~H--~p~it~L~~G~l~i~~~~~~~~~~av~gGf~ev~~n~V~Ilad~ae~~ 86 (131)
T PRK06228 13 VFAEKKGVTRIVAETREGSFGLLPHR--LDCVAALVPGILVYETEAEGEVYVAVDEGILVKTGPDVLVSVRNAIGG 86 (131)
T ss_pred eEECCCcEEEEEEEcCccCeEECCCC--cceEeeecceEEEEEECCCcEEEEEEcceEEEEECCEEEEEEceeEch
Confidence 88888778899998888886665543 33333333333344 47889987643 3455566666665443
No 23
>TIGR03166 alt_F1F0_F1_eps alternate F1F0 ATPase, F1 subunit epsilon. A small number of taxonomically diverse prokaryotic species have what appears to be a second ATP synthase, in addition to the normal F1F0 ATPase in bacteria and A1A0 ATPase in archaea. These enzymes use ion gradients to synthesize ATP, and in principle may run in either direction. This model represents the F1 epsilon subunit of this apparent second ATP synthase.
Probab=68.52 E-value=41 Score=24.40 Aligned_cols=66 Identities=9% Similarity=-0.024 Sum_probs=39.2
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEE-----EEeceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSI-----HRRIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v-----~R~~r~~sL~vd~~~~ 77 (126)
+||.....+++.+.-.+|.+-+.-+. .|+-..+.|....+ .|+.|+.+.+ .-.+..+++++|..++
T Consensus 10 ~~~~~~~v~~v~~~~~~G~~GILp~H--~p~it~L~~G~l~i~~~~~~~~~~av~gGf~~v~~n~v~Il~~~ae~ 82 (122)
T TIGR03166 10 VFLDKLPVTRIVAETESGSFGLLPGH--VDCVAALVPGILIYETADGGEHYVAVDQGILVKRGADVEVSVRNAVG 82 (122)
T ss_pred eEEecCcEEEEEEEcCcCCeEEcCCC--cCeEeEeccEEEEEEECCCcEEEEEEeeeEEEEECCEEEEEeceeEc
Confidence 67764457788888888876665543 33333333333334 4889988643 3355557777666553
No 24
>KOG1653|consensus
Probab=68.08 E-value=6 Score=31.02 Aligned_cols=48 Identities=19% Similarity=0.276 Sum_probs=32.1
Q ss_pred CCCeEEEEEeCCEEEEEEEcCCceEEEEEcC-CCcccCCCCceEEEEEE
Q psy6121 16 GSDYLNVAIKDGGVAINLKLAMGKLDMYIRP-NRLRFDDNQWHHLSIHR 63 (126)
Q Consensus 16 ~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~-s~~~lnDgqWH~V~v~R 63 (126)
+.|=+.-.+++|+....|++|.+.+.-.-.+ .......-+||+|.+..
T Consensus 65 GqdPl~k~~rngrpVtiFsv~T~~~~k~r~~q~g~~~~~tqWHRVsVf~ 113 (175)
T KOG1653|consen 65 GQDPLQKILRNGRPVTIFSVGTGGMFKQRLYQAGDQPQPTQWHRVSVFN 113 (175)
T ss_pred ccchHHHhhcCCCeEEEEEeecCccccccccccCCcCCcceeEEEEeeC
Confidence 4455556688999888898887654321111 12346777999999987
No 25
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=64.85 E-value=28 Score=32.51 Aligned_cols=55 Identities=15% Similarity=0.164 Sum_probs=35.9
Q ss_pred eEEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEeccccc
Q psy6121 19 YLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEK 77 (126)
Q Consensus 19 FIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~ 77 (126)
|+.| ..+|.-.+.=++++-.+.. .....+.-++||++.+.-++..++-.+|+..-
T Consensus 580 ~f~v-~~~G~w~vt~d~~~~~~l~---~G~~~~~~~~WhtltL~~~g~~~ta~lng~~l 634 (669)
T PF02057_consen 580 FFWV-YANGTWSVTSDLAGTTTLA---SGTADIGAGKWHTLTLTISGSTATAMLNGTVL 634 (669)
T ss_dssp EEEE-ETTTEEEEEEETTS-SEEE---EEE-S--TT-EEEEEEEEETTEEEEEETTEEE
T ss_pred EEEE-EcCCcEEEeccCCCcEEEe---eeeecccCCeEEEEEEEEECCEEEEEECCEEe
Confidence 5665 6778777766665522211 11245888999999999999999999998754
No 26
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=62.81 E-value=36 Score=23.25 Aligned_cols=47 Identities=13% Similarity=0.273 Sum_probs=30.5
Q ss_pred CCCeEEEEEeCCEEEEEEEcCCce---EEEEEcCC-CcccCCCCceEEEEE
Q psy6121 16 GSDYLNVAIKDGGVAINLKLAMGK---LDMYIRPN-RLRFDDNQWHHLSIH 62 (126)
Q Consensus 16 ~~DFIaLeL~~G~L~l~~nlGsG~---~~~~i~~s-~~~lnDgqWH~V~v~ 62 (126)
.+=|-.|.+..|.|.+.+-.+.|. ....+.++ ...+..+.||+|...
T Consensus 23 ~GtWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p~ 73 (82)
T PF09313_consen 23 AGTWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEPL 73 (82)
T ss_dssp TTEEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEES
T ss_pred CCeEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEEC
Confidence 455888999999999998877532 22222232 235999999999853
No 27
>TIGR01216 ATP_synt_epsi ATP synthase, F1 epsilon subunit (delta in mitochondria). This model describes one of the five types of subunits in the F1 part of F1/F0 ATP synthases. Members of this family are designated epsilon in bacterial and chloroplast systems but designated delta in mitochondria, where the counterpart of the bacterial delta subunit is designated OSCP. In a few cases (Propionigenium modestum, Acetobacterium woodii) scoring above the trusted cutoff and designated here as exceptions, Na+ replaces H+ for translocation.
Probab=60.45 E-value=43 Score=24.14 Aligned_cols=65 Identities=14% Similarity=0.113 Sum_probs=36.1
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCcc--cCCCCceEEEEEE-----eceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLR--FDDNQWHHLSIHR-----RIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~--lnDgqWH~V~v~R-----~~r~~sL~vd~~~~ 77 (126)
.+|.+ ..+++.+.-.+|.+-+.-+.- |+-..+.|+... ..|+.|+.+.+.. +...++++.+...+
T Consensus 12 ~~~~~-~v~~v~~~t~~G~~giL~~H~--p~i~~L~~G~v~i~~~~~~~~~~~v~gG~~~v~~~~v~Il~~~a~~ 83 (130)
T TIGR01216 12 EIYSG-EVESVILPGSEGELGILPGHA--PLITALKPGVVRIRKLGDDWEHIAVSGGFAEVQPDKVTILADGAVF 83 (130)
T ss_pred eEEee-eEEEEEEECCcCCeeecCCCh--hhEeEecceEEEEEECCCCEEEEEEcCcEEEEECCEEEEEEeEEEE
Confidence 56654 567888887788766655433 332233333223 3578899876443 33345555554443
No 28
>PRK14736 atpC F0F1 ATP synthase subunit epsilon; Provisional
Probab=58.01 E-value=49 Score=24.36 Aligned_cols=65 Identities=3% Similarity=-0.050 Sum_probs=38.0
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEEE-----EeceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSIH-----RRIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v~-----R~~r~~sL~vd~~~~ 77 (126)
++|.+ ..+++.+.-.+|.+-+.-+. .|+-..+.|....+ +|+.|+.+.+. .+...+++++|..++
T Consensus 14 ~~~~~-~v~~V~vp~~~G~~gILp~H--~P~it~L~~G~l~i~~~~~~~~~~~v~gGf~eV~~n~v~Ila~~ae~ 85 (133)
T PRK14736 14 TLYSG-EVEAVQLPGSEGEMTVLPGH--APVLTTLKVGVITVTETTGNGKRIYVRGGFAEIGPTSVTVLAERAAP 85 (133)
T ss_pred cEEee-eeEEEEEecCccCeeEcCCC--hhhEeEecceEEEEEECCCcEEEEEEeceEEEEECCEEEEEeeeeEE
Confidence 67765 57888888888876665443 23322233332333 58899887643 344456666665544
No 29
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=56.10 E-value=14 Score=29.00 Aligned_cols=28 Identities=14% Similarity=0.290 Sum_probs=21.3
Q ss_pred CcccCCCCceEEEEEEece--eEEEEeccc
Q psy6121 48 RLRFDDNQWHHLSIHRRIQ--EGTGCGKEE 75 (126)
Q Consensus 48 ~~~lnDgqWH~V~v~R~~r--~~sL~vd~~ 75 (126)
...+.|+.||.|.|..+.. .+++.++..
T Consensus 161 ~~~l~~g~~~~v~I~Yd~~~~~L~V~l~~~ 190 (236)
T PF00139_consen 161 SFSLSDGKWHTVWIDYDASTKRLSVYLDDN 190 (236)
T ss_dssp EHHHGTTSEEEEEEEEETTTTEEEEEEEET
T ss_pred cccccCCcEEEEEEEEcCCccEEEEEEecc
Confidence 3569999999999999995 455554554
No 30
>PRK13447 F0F1 ATP synthase subunit epsilon; Provisional
Probab=55.13 E-value=84 Score=23.28 Aligned_cols=66 Identities=14% Similarity=0.032 Sum_probs=32.8
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEEEE-----e-ceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSIHR-----R-IQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v~R-----~-~r~~sL~vd~~~~ 77 (126)
+||.+....++.+.-.+|.+-+.-+ -.|+-..+.|....+ .|+.|+.+.+.- + ...+++++|..++
T Consensus 11 ~~~~g~~v~~V~~~t~~G~~GILp~--HaPlit~L~~G~l~i~~~~g~~~~~aVsGGfleV~~~n~V~Ilad~ae~ 84 (136)
T PRK13447 11 VVVDELDIVSLRAEDASGGFGILPG--HADFLTVLRASVVRWRRADGATHYCAVRGGVLRVTGGARVEIACREAVL 84 (136)
T ss_pred eEECCCcEEEEEecCCcCceEEcCC--CcceEeEecceEEEEEECCCcEEEEEEeCcEEEEecCCEEEEEeceeEc
Confidence 6776542333333335676555444 333323333332333 588999876543 3 4455555554443
No 31
>PF15016 DUF4520: Domain of unknown function (DUF4520)
Probab=50.09 E-value=84 Score=21.79 Aligned_cols=50 Identities=18% Similarity=0.089 Sum_probs=34.7
Q ss_pred EEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEeceeEEEEeccc
Q psy6121 22 VAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEE 75 (126)
Q Consensus 22 LeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~ 75 (126)
+++.||++++.|+=|. .+. +.+.-....+..|.++. .+++....+.++..
T Consensus 19 ~AysDgrVr~~F~Drt-~L~--l~~~~~~~~~~~~c~l~-~pDG~~~~i~i~~p 68 (85)
T PF15016_consen 19 TAYSDGRVRVHFDDRT-ILT--LIWNFSSREELGWCKLT-FPDGQQVLIQIEHP 68 (85)
T ss_pred EEEcCCeEEEEEcCCC-EEE--EEeCCCCcccCCEEEEE-ccCCCEEEEEccCC
Confidence 4678999999998665 332 22333455688899876 67888888877654
No 32
>KOG4003|consensus
Probab=49.84 E-value=8.3 Score=31.04 Aligned_cols=15 Identities=20% Similarity=0.654 Sum_probs=13.0
Q ss_pred ccCCCCceEEEEEEe
Q psy6121 50 RFDDNQWHHLSIHRR 64 (126)
Q Consensus 50 ~lnDgqWH~V~v~R~ 64 (126)
..+|..||+|.++.+
T Consensus 38 q~~d~dw~~Vv~TKD 52 (223)
T KOG4003|consen 38 QDADRDWHRVVVTKD 52 (223)
T ss_pred HhcccccceEEEecc
Confidence 589999999998874
No 33
>COG3615 TehB Uncharacterized protein/domain, possibly involved in tellurite resistance [Inorganic ion transport and metabolism]
Probab=45.24 E-value=81 Score=22.70 Aligned_cols=56 Identities=13% Similarity=0.122 Sum_probs=37.0
Q ss_pred CCCeEEEEEeCCEEEEEEEcCCceE---EEEEcCC-CcccCCCCceEEEEEEeceeEEEE
Q psy6121 16 GSDYLNVAIKDGGVAINLKLAMGKL---DMYIRPN-RLRFDDNQWHHLSIHRRIQEGTGC 71 (126)
Q Consensus 16 ~~DFIaLeL~~G~L~l~~nlGsG~~---~~~i~~s-~~~lnDgqWH~V~v~R~~r~~sL~ 71 (126)
.+-|-.|.+-.|.+.+.-=.++|.. .....-. ...+....||+|...-+.-+.+|.
T Consensus 34 ~G~w~kLsVl~G~vk~~~~~ee~~~~~e~~~~~ea~~~~~~PQ~WHrVea~tDD~e~~l~ 93 (99)
T COG3615 34 PGTWGKLSVLKGAVKFLGLAEEGETEPEHVFSIEAQFPVFPPQAWHRVEAMTDDAEFNLS 93 (99)
T ss_pred CCceeEEEEEeceeEEEEEcCCCCccceEEEeecCCCCccChhHeeeeeecccccEEEEE
Confidence 5568999999999888776665431 1111111 125889999999977666666553
No 34
>PF02823 ATP-synt_DE_N: ATP synthase, Delta/Epsilon chain, beta-sandwich domain; InterPro: IPR020546 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This family represents subunits called delta (in mitochondrial ATPase) or epsilon (in bacteria or chloroplast ATPase). The interaction site of subunit C of the F0 complex with the delta or epsilon subunit of the F1 complex may be important for connecting the rotor of F1 (gamma subunit) to the rotor of F0 (C subunit) []. In bacterial species, the delta subunit is the equivalent of the Oligomycin sensitive subunit (OSCP, IPR000711 from INTERPRO) in metazoans. The C-terminal domain of the epsilon subunit appears to act as an inhibitor of ATPase activity []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046933 hydrogen ion transporting ATP synthase activity, rotational mechanism, 0046961 proton-transporting ATPase activity, rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0045261 proton-transporting ATP synthase complex, catalytic core F(1); PDB: 2WSS_H 1E79_H 2JDI_H 2W6I_H 2W6H_H 2W6J_H 2CK3_H 2V7Q_H 1H8E_H 2XND_H ....
Probab=42.95 E-value=72 Score=20.82 Aligned_cols=61 Identities=13% Similarity=0.113 Sum_probs=34.1
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCC--cccCCCCceEEEEE-----EeceeEEEEec
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNR--LRFDDNQWHHLSIH-----RRIQEGTGCGK 73 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~--~~lnDgqWH~V~v~-----R~~r~~sL~vd 73 (126)
.+|.++ .+++.+.-.+|.+-+.-+.-. +-..+.+.. ...+|+.|+.+.+. .....+++++|
T Consensus 11 ~~~~~~-v~~v~~~t~~G~~gIl~~H~p--~i~~l~~G~~~i~~~~~~~~~~~v~~G~~~v~~n~v~Il~~ 78 (80)
T PF02823_consen 11 IFFEGE-VESVVLPTTDGEFGILPNHAP--FITALKPGELRIKDADGEEKKFAVSGGFLEVKDNEVTILAD 78 (80)
T ss_dssp EEEEEE-ESEEEEEBSSSEEEEETTS-S--EEEEEESEEEEEEESSSEEEEEEESSEEEEEETSEEEEEES
T ss_pred eEEeeE-EEEEEEECCCcChhhccCCch--hheeccceEEEEEEcCCCEEEEEEcCEEEEEECCEEEEEEc
Confidence 555544 788888888887666554333 222222221 13478999887743 34444555544
No 35
>PF07622 DUF1583: Protein of unknown function (DUF1583); InterPro: IPR011475 Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM.
Probab=40.81 E-value=38 Score=29.83 Aligned_cols=34 Identities=18% Similarity=0.208 Sum_probs=29.4
Q ss_pred CcccCCCCceEEEEEEeceeEEEEecccccchhc
Q psy6121 48 RLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR 81 (126)
Q Consensus 48 ~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~~~ 81 (126)
..++++++|-+|.+.+.+..+.|..+++.-.++.
T Consensus 84 ~~~l~~~~wN~v~l~~~g~~v~l~LN~~~i~~~~ 117 (399)
T PF07622_consen 84 TLPLKVNAWNRVRLQRRGDKVQLHLNGQLIYERP 117 (399)
T ss_pred CCCCCccccceEEEEEeCCEEEEEeCCceeEecc
Confidence 4579999999999999999999999988765554
No 36
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=40.28 E-value=1.4e+02 Score=22.82 Aligned_cols=22 Identities=14% Similarity=-0.055 Sum_probs=18.5
Q ss_pred CCceEEEEEEe--ceeEEEEeccc
Q psy6121 54 NQWHHLSIHRR--IQEGTGCGKEE 75 (126)
Q Consensus 54 gqWH~V~v~R~--~r~~sL~vd~~ 75 (126)
|.||+|.|..+ .+.+++.++..
T Consensus 154 g~~~~v~I~Y~~~~~~L~v~l~~~ 177 (223)
T cd01951 154 GNEHTVRITYDPTTNTLTVYLDNG 177 (223)
T ss_pred CCEEEEEEEEeCCCCEEEEEECCC
Confidence 99999999999 47777777765
No 37
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=37.39 E-value=1.5e+02 Score=20.93 Aligned_cols=45 Identities=13% Similarity=0.145 Sum_probs=27.5
Q ss_pred CCCeEEEEEeCCEEEEEEEcCC--ceEEEEEcCCCcccCCCCceEEEEEE
Q psy6121 16 GSDYLNVAIKDGGVAINLKLAM--GKLDMYIRPNRLRFDDNQWHHLSIHR 63 (126)
Q Consensus 16 ~~DFIaLeL~~G~L~l~~nlGs--G~~~~~i~~s~~~lnDgqWH~V~v~R 63 (126)
+.|=++|.| ||+| .+++|+ ++...++......+..|+||.+.+-.
T Consensus 26 GDDDvWVFI-n~kL--v~DlGG~H~~~~~sV~l~~lgl~~g~~Y~~d~F~ 72 (90)
T TIGR02148 26 GDDDVWVFI-NNKL--VVDIGGQHPAVPGAVDLDTLGLKEGKTYPFDIFY 72 (90)
T ss_pred cCCeEEEEE-CCEE--EEEccCcCCCcccEEEhhhcCCccCcEeeEEEEE
Confidence 567788888 5555 567774 23222222223468899999988643
No 38
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=35.86 E-value=40 Score=19.39 Aligned_cols=14 Identities=29% Similarity=0.636 Sum_probs=11.2
Q ss_pred CCCCceEEEEEEec
Q psy6121 52 DDNQWHHLSIHRRI 65 (126)
Q Consensus 52 nDgqWH~V~v~R~~ 65 (126)
+|++|++-.|..-.
T Consensus 10 ~d~~wyra~V~~~~ 23 (48)
T cd04508 10 DDGKWYRAKITSIL 23 (48)
T ss_pred CCCeEEEEEEEEEC
Confidence 47999998887754
No 39
>COG0266 Nei Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
Probab=35.29 E-value=1.9e+02 Score=24.15 Aligned_cols=51 Identities=16% Similarity=0.180 Sum_probs=33.4
Q ss_pred ecCCCCCeEEEEEeCCEEEEEEEcC-CceEEEEEcCCCcccCCCCceEEEEEEece
Q psy6121 12 DGSDGSDYLNVAIKDGGVAINLKLA-MGKLDMYIRPNRLRFDDNQWHHLSIHRRIQ 66 (126)
Q Consensus 12 yaG~~~DFIaLeL~~G~L~l~~nlG-sG~~~~~i~~s~~~lnDgqWH~V~v~R~~r 66 (126)
..+..+.||.+.+ .|.+.+...+| +|.+.+.. ...+ ++.++|-+-...++.
T Consensus 52 ~v~rRgKyLl~~l-~g~~~Li~HLgM~G~~~~~~--~~~~-~~kH~hv~~~~~~g~ 103 (273)
T COG0266 52 SVERRGKYLLIEL-SGDLTLISHLGMSGSLRLEA--RGEP-DEKHDHVDFVLSDGK 103 (273)
T ss_pred eEecccceEEEEc-CCCcEEEEecCcceEEEEec--CCCC-CCcceEEEEEEcCCc
Confidence 3355789999999 88777777887 56665432 2233 666677666555553
No 40
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=34.68 E-value=1.7e+02 Score=20.82 Aligned_cols=51 Identities=8% Similarity=0.090 Sum_probs=31.6
Q ss_pred EEEEEEEcCCceE-----EEEEcCCCcccCCCCceEEEEEEeceeEEEEecccccch
Q psy6121 28 GVAINLKLAMGKL-----DMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKK 79 (126)
Q Consensus 28 ~L~l~~nlGsG~~-----~~~i~~s~~~lnDgqWH~V~v~R~~r~~sL~vd~~~~~~ 79 (126)
.+.|.+.+|..|. .+.+ |.....+++.|.-.....++..++|.|..-..+.
T Consensus 51 ~l~l~~~~G~~P~~~~gT~~~~-~~~~~~~~~~W~a~v~~~~~~~~tv~V~spa~A~ 106 (118)
T PF00868_consen 51 QLSLEFETGPNPSESKGTKVVF-PVSSSLDSSSWSARVESQDGNSVTVSVTSPANAP 106 (118)
T ss_dssp EEEEEEEESSS--TTTTSEEEE-EECSSS-TSSSEEEEEEEETTEEEEEEE--TTS-
T ss_pred EEEEEEEEecccccCCCcEEEE-EEccCCCCCCEEEEEEecCCCEEEEEEECCCCCc
Confidence 3666666664221 1111 2245789999999999999999999987766554
No 41
>PRK00539 atpC F0F1 ATP synthase subunit epsilon; Validated
Probab=34.66 E-value=1.8e+02 Score=21.24 Aligned_cols=66 Identities=12% Similarity=-0.014 Sum_probs=40.2
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEE-----EEeceeEEEEecccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSI-----HRRIQEGTGCGKEEEKK 78 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v-----~R~~r~~sL~vd~~~~~ 78 (126)
++|.+. .+++.+.-.+|.+-+.-+ -.|+-..+.+....+ .++.|+.+.+ ..+...+++++|..+..
T Consensus 14 ~~~~~~-v~~V~l~t~~G~~gIL~~--Haplit~L~~G~~~i~~~~~~~~~~~v~gGf~ev~~n~v~Ilad~ae~~ 86 (133)
T PRK00539 14 IKLDEK-IISAQVKTTEGYAGLNRN--RAPLIAAIQSHVCKITFADKTKRSAIIGAGLLLIKKTEAKIFTENFVFA 86 (133)
T ss_pred eEEece-EEEEEEecCccCceecCC--CcceEeEecceEEEEEECCCcEEEEEEeeeEEEEECCEEEEEECeEEch
Confidence 777654 788888888887666544 334333333332233 4788887654 34566688887776653
No 42
>PRK13451 atpC F0F1 ATP synthase subunit epsilon; Provisional
Probab=33.91 E-value=1.6e+02 Score=20.41 Aligned_cols=49 Identities=8% Similarity=0.129 Sum_probs=28.0
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc-CCCCceEEEE
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF-DDNQWHHLSI 61 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l-nDgqWH~V~v 61 (126)
.||.+ ..+++.+.-.+|.+-+.-+. .|+-..+.|....+ .++.|+.+.+
T Consensus 11 ~~~~~-~v~~V~~pt~~G~~GILp~H--~p~it~L~~G~l~i~~~~~~~~~~v 60 (101)
T PRK13451 11 IVYDR-ESDFVSFRTVEGEMGILPRR--APIIAKLSVCDVKIKSGDDEYEYKV 60 (101)
T ss_pred eEEec-eeEEEEEEcCccCeEEcCCC--cceEEEecceEEEEEECCcEEEEEE
Confidence 67766 47888888777776554443 33333333332333 3677887653
No 43
>PRK13450 atpC F0F1 ATP synthase subunit epsilon; Provisional
Probab=27.40 E-value=2.4e+02 Score=20.39 Aligned_cols=65 Identities=14% Similarity=-0.013 Sum_probs=36.8
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEEE-----EeceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSIH-----RRIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v~-----R~~r~~sL~vd~~~~ 77 (126)
+||.+ ..+++.+.-.+|.+-+.=+ -.|+-..+.|....+ .|+.|+.+.+. -+...+++..|..++
T Consensus 14 ~~~~~-~v~~V~~p~~~G~~GILp~--H~p~it~L~~G~l~i~~~~~~~~~~~v~gGf~~v~~~~v~Il~~~a~~ 85 (132)
T PRK13450 14 NFYIG-EVKEVITEGLDGDIAILPN--HVPLITYLKPTITKIIDENGEKKKIFTSSGVLKVENNEVYILCDASEW 85 (132)
T ss_pred eEEec-eEEEEEEECCccCceecCC--CcccEeEEccEEEEEEECCCcEEEEEEcCeEEEEECCEEEEEehhhcc
Confidence 66754 5678888777777554433 334333333332233 48889887643 344456666666543
No 44
>KOG0286|consensus
Probab=26.72 E-value=1.5e+02 Score=25.59 Aligned_cols=55 Identities=13% Similarity=0.132 Sum_probs=34.8
Q ss_pred EEeeeceeeecCCCCCe---EEEEEeCCEEEEEEEcCCceEEEEEcCCCcccCCCCceE
Q psy6121 3 TLSTDCWLGDGSDGSDY---LNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHH 58 (126)
Q Consensus 3 S~sfrtlLLyaG~~~DF---IaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~ 58 (126)
.|+..|.||++| ..|| ++=.|+.-++-......+-..-+.+.|....++.|.|-.
T Consensus 280 ~FS~SGRlLfag-y~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~DG~av~TgSWDs 337 (343)
T KOG0286|consen 280 AFSKSGRLLFAG-YDDFTCNVWDTLKGERVGVLAGHENRVSCLGVSPDGMAVATGSWDS 337 (343)
T ss_pred EEcccccEEEee-ecCCceeEeeccccceEEEeeccCCeeEEEEECCCCcEEEecchhH
Confidence 455567788886 3343 444677777777777776332233444456799999964
No 45
>PF11859 DUF3379: Protein of unknown function (DUF3379); InterPro: IPR021806 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 234 to 251 amino acids in length.
Probab=26.70 E-value=1.8e+02 Score=23.80 Aligned_cols=49 Identities=16% Similarity=0.271 Sum_probs=37.4
Q ss_pred EEEEEEcCCceEEEEEcCCCc------ccCCCCceEEEEEEeceeEEEEeccccc
Q psy6121 29 VAINLKLAMGKLDMYIRPNRL------RFDDNQWHHLSIHRRIQEGTGCGKEEEK 77 (126)
Q Consensus 29 L~l~~nlGsG~~~~~i~~s~~------~lnDgqWH~V~v~R~~r~~sL~vd~~~~ 77 (126)
|++.+....|++++.+.|... .++|++.+-+.+......+-|..+..+.
T Consensus 166 LHlV~qge~GkVTlFivP~~~~~~~~~~F~d~~~~G~~~~~~~a~lilVge~~~d 220 (232)
T PF11859_consen 166 LHLVFQGEQGKVTLFIVPIESRMVLQEDFADDGYQGIVFPMGNASLILVGEKGED 220 (232)
T ss_pred eEEEEecCCCcEEEEEeeccccchhhhhhcccCceEEEEEcCCeEEEEEecCccc
Confidence 788888777888777766522 5889999999988888777777665544
No 46
>PF11995 DUF3490: Domain of unknown function (DUF3490); InterPro: IPR021881 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 160 amino acids in length. This domain is found associated with PF00225 from PFAM. This domain is found associated with PF00225 from PFAM. This domain has two conserved sequence motifs: EVE and ESA.
Probab=26.50 E-value=36 Score=26.47 Aligned_cols=30 Identities=13% Similarity=0.179 Sum_probs=22.1
Q ss_pred eEEeeece--eeecCCCCCeEEEEEeCCEEEE
Q psy6121 2 LTLSTDCW--LGDGSDGSDYLNVAIKDGGVAI 31 (126)
Q Consensus 2 IS~sfrtl--LLyaG~~~DFIaLeL~~G~L~l 31 (126)
+++..+|. |++-|++.|+|.++..--++.+
T Consensus 18 VsLvHRTyFfLLFkGdpaD~iYmEVElRRLs~ 49 (161)
T PF11995_consen 18 VSLVHRTYFFLLFKGDPADSIYMEVELRRLSF 49 (161)
T ss_pred cchhhhhhhhheecCCcccceEEEeehHHHHH
Confidence 45666774 7778999999999886655544
No 47
>PRK13444 atpC F0F1 ATP synthase subunit epsilon; Provisional
Probab=26.44 E-value=2.5e+02 Score=20.20 Aligned_cols=65 Identities=12% Similarity=0.006 Sum_probs=36.3
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc-CCCCceEEEEEE-----eceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF-DDNQWHHLSIHR-----RIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l-nDgqWH~V~v~R-----~~r~~sL~vd~~~~ 77 (126)
+||.+ ..+++.+.-.+|.+-+.-+ -.|+-..+.|....+ .|+.|+.+.+.- ....+++++|..++
T Consensus 16 ~~~~~-~v~~V~~p~~~G~~gILp~--H~p~it~L~~G~l~i~~~~~~~~~~v~gG~~~v~~~~v~Il~~~a~~ 86 (127)
T PRK13444 16 ILYKG-EVDSLIVPGSEGFFGILPN--HAPLVATLGIGLLEIRKGEKLKRISVEGGFCEVKDNQISILTDHGAL 86 (127)
T ss_pred eEEec-eEEEEEEECCccCeEecCC--CcCeEeEeccEEEEEEECCeEEEEEEeceEEEEECCEEEEEEeEEEe
Confidence 66654 5678888888887655544 334333333333333 388888765432 34445555554443
No 48
>PRK09391 fixK transcriptional regulator FixK; Provisional
Probab=25.18 E-value=1.8e+02 Score=22.30 Aligned_cols=31 Identities=19% Similarity=0.266 Sum_probs=23.3
Q ss_pred ceeeecCCCCCeEEEEEeCCEEEEEEEcCCce
Q psy6121 8 CWLGDGSDGSDYLNVAIKDGGVAINLKLAMGK 39 (126)
Q Consensus 8 tlLLyaG~~~DFIaLeL~~G~L~l~~nlGsG~ 39 (126)
..||+.|+..+++.+ |..|.+.+.....+|.
T Consensus 47 e~l~~~Gd~~~~ly~-I~~G~vkl~~~~~~G~ 77 (230)
T PRK09391 47 EEIYGEGEPADYVYQ-VESGAVRTYRLLSDGR 77 (230)
T ss_pred CEEECCCCCCCeEEE-EEeCEEEEEEECCCCc
Confidence 348888888888777 6789988877665554
No 49
>cd00413 Glyco_hydrolase_16 glycosyl hydrolase family 16. The O-Glycosyl hydrolases are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycosyl hydrolase family 16. Family 16 includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=23.51 E-value=1.2e+02 Score=22.50 Aligned_cols=28 Identities=11% Similarity=-0.058 Sum_probs=24.5
Q ss_pred CCCCceEEEEEEeceeEEEEecccccch
Q psy6121 52 DDNQWHHLSIHRRIQEGTGCGKEEEKKK 79 (126)
Q Consensus 52 nDgqWH~V~v~R~~r~~sL~vd~~~~~~ 79 (126)
..+.||+..+......++.-||+.+-..
T Consensus 140 ~~~~~H~Y~~~W~~~~i~~yvDG~~~~~ 167 (210)
T cd00413 140 PADDFHTYRVDWTPGEITFYVDGVLVAT 167 (210)
T ss_pred CccCeEEEEEEEeCCEEEEEECCEEEEE
Confidence 5789999999999999999999876543
No 50
>COG4499 Predicted membrane protein [Function unknown]
Probab=23.05 E-value=73 Score=28.29 Aligned_cols=39 Identities=36% Similarity=0.567 Sum_probs=19.9
Q ss_pred hhhhhhhhHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCC
Q psy6121 87 EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125 (126)
Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 125 (126)
++...|+++++...+..+++.|..+.+-|+.|.+.-|+|
T Consensus 396 ~~aeaka~eek~~~ee~~~~qke~~dedkek~qkde~k~ 434 (434)
T COG4499 396 EEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434 (434)
T ss_pred hhhHhhhhhhhhhhhhhhhhHHhhhhhhhhhhhhhhccC
Confidence 334445555554444444445555556666666555543
No 51
>PRK13442 atpC F0F1 ATP synthase subunit epsilon; Provisional
Probab=22.03 E-value=2.6e+02 Score=18.85 Aligned_cols=65 Identities=8% Similarity=-0.039 Sum_probs=37.2
Q ss_pred eeecCCCCCeEEEEEeCCEEEEEEEcCCceEEEEEcCCCccc--CCCCceEEEEE-----EeceeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRF--DDNQWHHLSIH-----RRIQEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s~~~l--nDgqWH~V~v~-----R~~r~~sL~vd~~~~ 77 (126)
++|.+ ..+++.+.-.+|.+-+.=+ -.|+-..+.|....+ .|+.|+.+.+. -....+++++|..++
T Consensus 16 ~~~~~-~v~~V~~p~~~G~~gILp~--H~p~it~L~~G~l~i~~~~~~~~~~~v~gG~~~v~~n~v~Il~~~ae~ 87 (89)
T PRK13442 16 PVWSG-EATMVVARTTEGDIGILPG--HEPLLGVLESGTVTVVTPGGERISAAVDGGFISFDSNKLTVLAERAEL 87 (89)
T ss_pred cEEec-eeEEEEEECCcCCcEecCC--CcCeEEEEcceEEEEEECCCcEEEEEEccEEEEEECCEEEEEEeEEEE
Confidence 66654 4678888777777555443 233323333332333 47888887643 345567777766543
No 52
>cd08023 GH16_laminarinase_like Laminarinase, member of the glycosyl hydrolase family 16. Laminarinase, also known as glucan endo-1,3-beta-D-glucosidase, is a glycosyl hydrolase family 16 member that hydrolyzes 1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans such as laminarins, curdlans, paramylons, and pachymans, with very limited action on mixed-link (1,3-1,4-)-beta-D-glucans.
Probab=21.32 E-value=1.5e+02 Score=22.80 Aligned_cols=28 Identities=7% Similarity=-0.135 Sum_probs=24.6
Q ss_pred cCCCCceEEEEEEeceeEEEEecccccc
Q psy6121 51 FDDNQWHHLSIHRRIQEGTGCGKEEEKK 78 (126)
Q Consensus 51 lnDgqWH~V~v~R~~r~~sL~vd~~~~~ 78 (126)
-..+.||+..+......++.-||+.+=.
T Consensus 155 ~~~~~fHtY~~~W~p~~i~~yvDG~~v~ 182 (235)
T cd08023 155 DLSDDFHTYAVEWTPDKITFYVDGKLYF 182 (235)
T ss_pred CcCCCcEEEEEEEECCEEEEEECCEEEE
Confidence 3678999999999999999999987643
No 53
>PF09264 Sial-lect-inser: Vibrio cholerae sialidase, lectin insertion; InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=21.05 E-value=2.9e+02 Score=22.17 Aligned_cols=64 Identities=11% Similarity=0.049 Sum_probs=38.8
Q ss_pred eeecCCCCCeEEE-EEe-CCEEEEEEEcCCceEEEEEcCCCcccCCCCceEEEEEEec--eeEEEEeccccc
Q psy6121 10 LGDGSDGSDYLNV-AIK-DGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRI--QEGTGCGKEEEK 77 (126)
Q Consensus 10 LLyaG~~~DFIaL-eL~-~G~L~l~~nlGsG~~~~~i~~s~~~lnDgqWH~V~v~R~~--r~~sL~vd~~~~ 77 (126)
++|+....-|+.+ .|. +|.|.+.++-++++..+ +. ...+=.-.|...+.-.. ...++.||+..-
T Consensus 50 ~yyAnG~~r~l~~lsvn~sG~LvA~L~g~ss~~~~---~~-~~~di~gyH~Y~i~~~p~~~tASfy~DG~lI 117 (198)
T PF09264_consen 50 NYYANGSKRYLPILSVNESGSLVAELEGQSSNTLL---AT-TGADIHGYHKYEIVFSPLTNTASFYFDGTLI 117 (198)
T ss_dssp EEEEESSEEEEEEEEE-TTS-EEEEETTS-S-EEE---E--CHHHHCSEEEEEEEEETTTTEEEEEETTEEE
T ss_pred EEEcCCceEEEEEEEEcCCCCEEEEEecCCCcEEE---ec-ccccccceeEEEEEecCCCCceEEEECCEEE
Confidence 5665445567655 555 46688888766666433 21 11222568999998844 899999999753
No 54
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=21.05 E-value=94 Score=24.70 Aligned_cols=22 Identities=5% Similarity=0.057 Sum_probs=16.0
Q ss_pred cccCCCCceEEEEEEeceeEEE
Q psy6121 49 LRFDDNQWHHLSIHRRIQEGTG 70 (126)
Q Consensus 49 ~~lnDgqWH~V~v~R~~r~~sL 70 (126)
..-.||.||+|.|.-+.+.+++
T Consensus 266 ~~~~dg~~~~i~V~~~~~~~~~ 287 (296)
T TIGR03436 266 NPAHDGKFRKIKVRVKRKGLKV 287 (296)
T ss_pred CCCCCCceeEEEEEEeCCCceE
Confidence 3456999999998877655433
No 55
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=20.40 E-value=1.4e+02 Score=17.68 Aligned_cols=15 Identities=20% Similarity=0.410 Sum_probs=12.2
Q ss_pred CCCCceEEEEEEece
Q psy6121 52 DDNQWHHLSIHRRIQ 66 (126)
Q Consensus 52 nDgqWH~V~v~R~~r 66 (126)
+|+.|++..|..-..
T Consensus 14 ~d~~wyra~I~~~~~ 28 (57)
T smart00333 14 EDGEWYRARIIKVDG 28 (57)
T ss_pred CCCCEEEEEEEEECC
Confidence 699999998877554
No 56
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=20.37 E-value=1.5e+02 Score=15.35 Aligned_cols=24 Identities=17% Similarity=0.116 Sum_probs=12.5
Q ss_pred ceeeecCCCCCeEEEEEeCCEEEE
Q psy6121 8 CWLGDGSDGSDYLNVAIKDGGVAI 31 (126)
Q Consensus 8 tlLLyaG~~~DFIaLeL~~G~L~l 31 (126)
+.+|.....+-+.++...+|.+.-
T Consensus 7 ~~v~~~~~~g~l~a~d~~~G~~~W 30 (33)
T smart00564 7 GTVYVGSTDGTLYALDAKTGEILW 30 (33)
T ss_pred CEEEEEcCCCEEEEEEcccCcEEE
Confidence 334444445556666666665543
No 57
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=20.21 E-value=2.7e+02 Score=20.03 Aligned_cols=43 Identities=12% Similarity=0.190 Sum_probs=24.1
Q ss_pred CeEEEEEeCCEEEEEEEcCCceEEEEEcCC--CcccCCCCceEEE
Q psy6121 18 DYLNVAIKDGGVAINLKLAMGKLDMYIRPN--RLRFDDNQWHHLS 60 (126)
Q Consensus 18 DFIaLeL~~G~L~l~~nlGsG~~~~~i~~s--~~~lnDgqWH~V~ 60 (126)
..-.+....|.+.+.++.|.+.-...+..+ ...+..+-||.+.
T Consensus 54 ~~~~~~~l~Gs~~v~~~d~~~~~~~~L~~~~~~L~Ippg~w~~~~ 98 (131)
T PF05523_consen 54 TTQWFIVLSGSFKVVLDDGREEEEFILDEPNKGLYIPPGVWHGIK 98 (131)
T ss_dssp --EEEEEEES-EEEEEE-SS-EEEEEE--TTEEEEE-TT-EEEEE
T ss_pred ccEEEEEEeCEEEEEEecCCCcEEEEECCCCeEEEECCchhhHhh
Confidence 345566668888888888877654444333 2358999999986
Done!