Query psy7854
Match_columns 153
No_of_seqs 171 out of 1527
Neff 9.6
Searched_HMMs 46136
Date Fri Aug 16 20:31:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7854.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7854hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.3 5.1E-12 1.1E-16 102.6 6.8 122 25-151 723-859 (1289)
2 KOG1219|consensus 99.2 6.6E-11 1.4E-15 103.7 6.8 108 31-149 3858-3972(4289)
3 PF07645 EGF_CA: Calcium-bindi 98.8 2.7E-09 5.9E-14 56.8 2.8 33 117-149 1-34 (42)
4 KOG1214|consensus 98.8 2.4E-08 5.2E-13 81.8 7.8 93 56-151 717-820 (1289)
5 PF07645 EGF_CA: Calcium-bindi 98.8 7E-09 1.5E-13 55.2 2.7 38 75-112 1-39 (42)
6 KOG4289|consensus 98.7 2.2E-08 4.7E-13 85.8 5.6 77 63-147 1225-1308(2531)
7 KOG1219|consensus 98.6 8.2E-08 1.8E-12 85.3 5.9 73 72-152 3859-3936(4289)
8 PF12662 cEGF: Complement Clr- 98.5 8.6E-08 1.9E-12 44.3 2.3 24 97-120 1-24 (24)
9 KOG4260|consensus 98.4 1.8E-07 3.8E-12 67.9 2.0 72 70-149 230-304 (350)
10 KOG4289|consensus 98.1 3.4E-06 7.3E-11 73.0 4.2 53 92-150 1216-1270(2531)
11 PF14670 FXa_inhibition: Coagu 98.0 3.3E-06 7.1E-11 43.1 2.0 31 84-115 6-36 (36)
12 smart00179 EGF_CA Calcium-bind 97.9 1.6E-05 3.5E-10 40.9 3.7 32 117-149 1-33 (39)
13 KOG4260|consensus 97.9 5.2E-06 1.1E-10 60.5 2.3 52 96-149 215-268 (350)
14 PF00008 EGF: EGF-like domain 97.8 1E-05 2.2E-10 40.2 1.5 29 124-152 2-32 (32)
15 PF12947 EGF_3: EGF domain; I 97.8 1.1E-05 2.4E-10 41.2 1.6 32 82-115 4-36 (36)
16 PF12947 EGF_3: EGF domain; I 97.7 1.9E-05 4E-10 40.3 1.0 30 122-151 2-32 (36)
17 KOG1217|consensus 97.6 0.00031 6.7E-09 55.3 7.7 61 89-151 243-304 (487)
18 smart00179 EGF_CA Calcium-bind 97.6 0.00015 3.2E-09 37.2 3.6 31 76-107 2-33 (39)
19 cd00054 EGF_CA Calcium-binding 97.5 0.00017 3.8E-09 36.5 3.6 33 118-151 2-35 (38)
20 PF14670 FXa_inhibition: Coagu 97.5 6.6E-05 1.4E-09 38.3 1.4 25 125-150 5-29 (36)
21 cd01475 vWA_Matrilin VWA_Matri 97.3 0.00025 5.4E-09 51.1 3.8 40 111-151 180-219 (224)
22 cd01475 vWA_Matrilin VWA_Matri 97.3 0.00031 6.8E-09 50.6 3.8 42 70-112 181-222 (224)
23 KOG1217|consensus 97.1 0.0036 7.7E-08 49.3 8.1 111 32-150 267-388 (487)
24 smart00181 EGF Epidermal growt 97.0 0.0012 2.6E-08 33.0 3.4 25 126-150 6-30 (35)
25 cd00053 EGF Epidermal growth f 97.0 0.0015 3.3E-08 32.4 3.5 27 125-151 5-32 (36)
26 cd00054 EGF_CA Calcium-binding 96.9 0.002 4.3E-08 32.4 3.6 32 76-108 2-34 (38)
27 PF00008 EGF: EGF-like domain 96.8 0.00098 2.1E-08 33.0 1.9 25 84-108 4-30 (32)
28 smart00181 EGF Epidermal growt 96.4 0.0069 1.5E-07 30.1 3.5 25 84-108 6-30 (35)
29 cd00053 EGF Epidermal growth f 96.0 0.015 3.2E-07 28.6 3.5 25 84-108 6-31 (36)
30 PF06247 Plasmod_Pvs28: Plasmo 95.1 0.007 1.5E-07 42.1 0.4 66 84-150 6-80 (197)
31 KOG1225|consensus 92.5 0.44 9.5E-06 38.7 6.1 69 61-149 266-336 (525)
32 PF12946 EGF_MSP1_1: MSP1 EGF 92.3 0.097 2.1E-06 26.6 1.4 25 125-149 4-30 (37)
33 PF12661 hEGF: Human growth fa 90.9 0.12 2.5E-06 20.1 0.7 10 99-108 1-10 (13)
34 PF07974 EGF_2: EGF-like domai 88.5 1.1 2.4E-05 22.0 3.3 24 126-151 6-30 (32)
35 PHA03099 epidermal growth fact 86.7 0.77 1.7E-05 30.1 2.6 35 118-152 42-79 (139)
36 PHA03099 epidermal growth fact 84.6 1.1 2.3E-05 29.5 2.5 32 84-119 51-84 (139)
37 PF06247 Plasmod_Pvs28: Plasmo 83.1 0.47 1E-05 33.2 0.5 88 53-150 68-161 (197)
38 KOG1225|consensus 82.3 3.2 6.8E-05 34.0 4.9 61 64-149 300-362 (525)
39 PHA02887 EGF-like protein; Pro 80.8 1.6 3.4E-05 28.2 2.2 27 126-152 92-120 (126)
40 PHA02887 EGF-like protein; Pro 80.3 1.9 4.1E-05 27.9 2.4 31 84-118 92-124 (126)
41 PF09064 Tme5_EGF_like: Thromb 79.5 1.5 3.2E-05 21.8 1.4 25 125-151 5-29 (34)
42 PF00954 S_locus_glycop: S-loc 78.2 3 6.5E-05 26.4 3.0 33 117-151 76-109 (110)
43 KOG1215|consensus 71.5 7 0.00015 34.0 4.4 71 76-150 325-396 (877)
44 KOG0994|consensus 67.1 12 0.00027 33.7 4.8 59 90-150 831-896 (1758)
45 KOG1226|consensus 64.4 50 0.0011 28.5 7.7 48 99-150 526-576 (783)
46 KOG0994|consensus 58.9 40 0.00086 30.8 6.4 62 90-152 877-946 (1758)
47 KOG3516|consensus 56.7 10 0.00022 34.0 2.6 39 72-116 541-581 (1306)
48 PF01683 EB: EB module; Inter 56.1 23 0.0005 18.9 3.2 29 112-150 19-47 (52)
49 KOG1226|consensus 51.6 53 0.0011 28.3 5.8 21 99-123 567-590 (783)
50 KOG1836|consensus 50.5 25 0.00053 33.3 4.1 58 91-153 749-811 (1705)
51 KOG3516|consensus 46.0 18 0.00039 32.6 2.5 36 113-150 540-577 (1306)
52 KOG0196|consensus 45.0 23 0.0005 30.8 2.9 51 98-148 259-316 (996)
53 smart00051 DSL delta serrate l 41.1 69 0.0015 18.2 3.7 45 97-151 16-61 (63)
54 PF07359 LEAP-2: Liver-express 35.8 26 0.00056 20.8 1.4 25 107-133 46-70 (77)
55 PF12955 DUF3844: Domain of un 29.6 34 0.00073 21.7 1.2 30 119-148 6-41 (103)
56 KOG3512|consensus 28.8 1.6E+02 0.0035 24.2 5.0 15 95-109 369-383 (592)
57 PF01826 TIL: Trypsin Inhibito 23.0 28 0.00061 18.8 0.0 19 100-119 35-53 (55)
58 KOG3509|consensus 21.8 2.7E+02 0.0058 25.1 5.5 60 84-147 412-473 (964)
No 1
>KOG1214|consensus
Probab=99.30 E-value=5.1e-12 Score=102.59 Aligned_cols=122 Identities=25% Similarity=0.363 Sum_probs=89.1
Q ss_pred eecccCCCCcccCcCCCCCCCeeeCCCcee----cccCCCCccCCCCCCCCcCCc------ccCCCCCCCCCC-CC--ce
Q psy7854 25 RLAVALRRSISIDEFTVHSSTLIALSQESQ----DLERLKFGLEPPTSRSEVHGA------NHHTTENPGICT-NG--AC 91 (153)
Q Consensus 25 ~~~~~~~~c~~i~~c~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~C~~------~~~C~~~~~~C~-~~--~C 91 (153)
.+..+.+.|.|++||+.-...|..+..|.+ +.|.|..+|..... +.+|+. ++.|..+.+.|. ++ .|
T Consensus 723 g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd-~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c 801 (1289)
T KOG1214|consen 723 GYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADD-RHTCVLITPPAPANPCEDGSHTCAIAGQARC 801 (1289)
T ss_pred ccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccC-CcceEEecCCCCCCccccCccccCcCCceEE
Confidence 344578999999999887766666666643 33444445543332 345643 455666556673 33 66
Q ss_pred eecC-CCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 92 ENLM-GTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 92 ~~~~-g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
.... ++|.|.|.+||.+ +|..|.++||| .++.|+. ++|.|++|+|.|.|.+||-++
T Consensus 802 ~~hGgs~y~C~CLPGfsG--DG~~c~dvDeC--~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 802 VHHGGSTYSCACLPGFSG--DGHQCTDVDEC--SPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred EecCCceEEEeecCCccC--Ccccccccccc--CccccCCCceEecCCCcceeecccCccCC
Confidence 6554 5799999999988 89999999999 6999985 999999999999999999754
No 2
>KOG1219|consensus
Probab=99.16 E-value=6.6e-11 Score=103.72 Aligned_cols=108 Identities=22% Similarity=0.365 Sum_probs=82.9
Q ss_pred CCCccc-CcCCCCCCCeeeCCCceec-ccCCCCccCCCCCCCCcC-CcccCCCCCCCCCCCC-ceeecCCCceeeCCCCc
Q psy7854 31 RRSISI-DEFTVHSSTLIALSQESQD-LERLKFGLEPPTSRSEVH-GANHHTTENPGICTNG-ACENLMGTYRCICNPGY 106 (153)
Q Consensus 31 ~~c~~i-~~c~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~C-~~~~~C~~~~~~C~~~-~C~~~~g~~~C~C~~g~ 106 (153)
.-|.-. +.|..+ .|++++.+.. ...-..+.|++.|.++.| .+++.|.. +||..| +|....+.|.|.|+.||
T Consensus 3858 pgC~l~~d~C~~n---pCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~s--nPC~~GgtCip~~n~f~CnC~~gy 3932 (4289)
T KOG1219|consen 3858 PGCSLLTDPCNDN---PCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCAS--NPCLTGGTCIPFYNGFLCNCPNGY 3932 (4289)
T ss_pred ccccccccccccC---cccCCCEecCCCCCceEEeCcccccCcccccccccccC--CCCCCCCEEEecCCCeeEeCCCCc
Confidence 334433 455444 4555554332 222334457778889999 57899988 899754 99999999999999999
Q ss_pred ccCCCCCCCc--cCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854 107 EGDPTGKLCS--DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP 149 (153)
Q Consensus 107 ~~~~~g~~C~--~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~ 149 (153)
.|.+|+ .|+|| ..+.|.+ +.|+|.+|+|.|.|.+||.
T Consensus 3933 ----TG~~Ce~~Gi~eC--s~n~C~~gg~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3933 ----TGKRCEARGISEC--SKNVCGTGGQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred ----cCceeeccccccc--ccccccCCceeeccCCceEeccChhHh
Confidence 888995 48999 7899997 9999999999999999985
No 3
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.84 E-value=2.7e-09 Score=56.78 Aligned_cols=33 Identities=33% Similarity=0.601 Sum_probs=30.5
Q ss_pred cCccccCCCCCCC-CCeeeecCCCcEEecCCCCC
Q psy7854 117 DQNECLMDVMICS-GGTCKNTPGSFQSNFQDSSP 149 (153)
Q Consensus 117 ~i~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~ 149 (153)
|||||....+.|. ++.|+|+.|+|.|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 6899998889998 49999999999999999996
No 4
>KOG1214|consensus
Probab=98.78 E-value=2.4e-08 Score=81.80 Aligned_cols=93 Identities=20% Similarity=0.361 Sum_probs=75.4
Q ss_pred ccCCCCccCCCCCCCCcCCcccCCCCCCCCC-CCCceeecCCCceeeCCCCcccCCCCCCCc------cCccccCCCCCC
Q psy7854 56 LERLKFGLEPPTSRSEVHGANHHTTENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLCS------DQNECLMDVMIC 128 (153)
Q Consensus 56 ~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~------~i~~C~~~~~~C 128 (153)
.|.|..++... ++.|.|+++|+...+.| ++..|++.+++|+|.|..||....++.+|. .++.|..+.+.|
T Consensus 717 tcecs~g~~gd---gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C 793 (1289)
T KOG1214|consen 717 TCECSSGYQGD---GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTC 793 (1289)
T ss_pred EEEEeeccCCC---CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCcccc
Confidence 44555555433 67899999999988889 588999999999999999999988888885 468899888999
Q ss_pred C-C--Ceeeec-CCCcEEecCCCCCcc
Q psy7854 129 S-G--GTCKNT-PGSFQSNFQDSSPIL 151 (153)
Q Consensus 129 ~-~--~~C~~~-~g~~~C~C~~g~~~~ 151 (153)
. + ++|+.. .+.|.|.|-|||.++
T Consensus 794 ~i~g~a~c~~hGgs~y~C~CLPGfsGD 820 (1289)
T KOG1214|consen 794 AIAGQARCVHHGGSTYSCACLPGFSGD 820 (1289)
T ss_pred CcCCceEEEecCCceEEEeecCCccCC
Confidence 7 4 567665 456999999999865
No 5
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.75 E-value=7e-09 Score=55.17 Aligned_cols=38 Identities=42% Similarity=0.905 Sum_probs=32.3
Q ss_pred cccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCC
Q psy7854 75 ANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTG 112 (153)
Q Consensus 75 ~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g 112 (153)
|||||....+.|. ++.|+|+.|+|.|.|++||.....+
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~ 39 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDG 39 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTS
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCC
Confidence 6899998888895 6799999999999999999754443
No 6
>KOG4289|consensus
Probab=98.71 E-value=2.2e-08 Score=85.80 Aligned_cols=77 Identities=31% Similarity=0.669 Sum_probs=65.8
Q ss_pred cCCCCCCCCcC-CcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCc-c--CccccCCCCCCCC-Ceeeec
Q psy7854 63 LEPPTSRSEVH-GANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCS-D--QNECLMDVMICSG-GTCKNT 136 (153)
Q Consensus 63 ~~~~~~~~~~C-~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~-~--i~~C~~~~~~C~~-~~C~~~ 136 (153)
-||++|.+..| ..+|.|-. ++|. ++.|...+|+|+|.|.+|| .|..|+ + .-.| .++.|.| ++|++.
T Consensus 1225 rCPpGFTgd~CeTeiDlCYs--~pC~nng~C~srEggYtCeCrpg~----tGehCEvs~~agrC--vpGvC~nggtC~~~ 1296 (2531)
T KOG4289|consen 1225 RCPPGFTGDYCETEIDLCYS--GPCGNNGRCRSREGGYTCECRPGF----TGEHCEVSARAGRC--VPGVCKNGGTCVNL 1296 (2531)
T ss_pred eCCCCCCcccccchhHhhhc--CCCCCCCceEEecCceeEEecCCc----cccceeeecccCcc--ccceecCCCEEeec
Confidence 47888999999 57999987 8995 7799999999999999999 778886 2 3457 7899997 899985
Q ss_pred -CCCcEEecCCC
Q psy7854 137 -PGSFQSNFQDS 147 (153)
Q Consensus 137 -~g~~~C~C~~g 147 (153)
.|+|.|.|+.|
T Consensus 1297 ~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYG 1308 (2531)
T ss_pred CCCceeccCCCc
Confidence 68899999987
No 7
>KOG1219|consensus
Probab=98.59 E-value=8.2e-08 Score=85.26 Aligned_cols=73 Identities=25% Similarity=0.523 Sum_probs=62.6
Q ss_pred cCCcc-cCCCCCCCCCCCC-ceeecC-CCceeeCCCCcccCCCCCCCc-cCccccCCCCCCCC-CeeeecCCCcEEecCC
Q psy7854 72 VHGAN-HHTTENPGICTNG-ACENLM-GTYRCICNPGYEGDPTGKLCS-DQNECLMDVMICSG-GTCKNTPGSFQSNFQD 146 (153)
Q Consensus 72 ~C~~~-~~C~~~~~~C~~~-~C~~~~-g~~~C~C~~g~~~~~~g~~C~-~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~ 146 (153)
.|... +.|.. .||+|+ .|...+ ++|.|.|++.| .|+.|+ +++.| ..++|.. ++|+...++|.|.|+.
T Consensus 3859 gC~l~~d~C~~--npCqhgG~C~~~~~ggy~CkCpsqy----sG~~CEi~~epC--~snPC~~GgtCip~~n~f~CnC~~ 3930 (4289)
T KOG1219|consen 3859 GCSLLTDPCND--NPCQHGGTCISQPKGGYKCKCPSQY----SGNHCEIDLEPC--ASNPCLTGGTCIPFYNGFLCNCPN 3930 (4289)
T ss_pred ccccccccccc--CcccCCCEecCCCCCceEEeCcccc----cCcccccccccc--cCCCCCCCCEEEecCCCeeEeCCC
Confidence 45433 78877 899876 998775 78999999999 889995 89999 7899986 8999999999999999
Q ss_pred CCCccc
Q psy7854 147 SSPILK 152 (153)
Q Consensus 147 g~~~~~ 152 (153)
||.++|
T Consensus 3931 gyTG~~ 3936 (4289)
T KOG1219|consen 3931 GYTGKR 3936 (4289)
T ss_pred CccCce
Confidence 999875
No 8
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=98.52 E-value=8.6e-08 Score=44.27 Aligned_cols=24 Identities=46% Similarity=1.169 Sum_probs=22.5
Q ss_pred CceeeCCCCcccCCCCCCCccCcc
Q psy7854 97 TYRCICNPGYEGDPTGKLCSDQNE 120 (153)
Q Consensus 97 ~~~C~C~~g~~~~~~g~~C~~i~~ 120 (153)
+|.|.|++||.+.++++.|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 689999999999999999999987
No 9
>KOG4260|consensus
Probab=98.37 E-value=1.8e-07 Score=67.94 Aligned_cols=72 Identities=21% Similarity=0.461 Sum_probs=56.4
Q ss_pred CCcCCcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCC--CCeeeecCCCcEEecCC
Q psy7854 70 SEVHGANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICS--GGTCKNTPGSFQSNFQD 146 (153)
Q Consensus 70 ~~~C~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~--~~~C~~~~g~~~C~C~~ 146 (153)
...|.|||||...+.+|. +..|+|+.|+|.|.+++||.. ++|+|..-...|. +..|.++.+.|+|+|..
T Consensus 230 e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~--------g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~ 301 (350)
T KOG4260|consen 230 EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK--------GVDECQFCADVCASKNRPCMNIDGQYRCVCFS 301 (350)
T ss_pred ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC--------ChHHhhhhhhhcccCCCCcccCCccEEEEecc
Confidence 447899999998888894 559999999999999999953 2556643224453 47899999999999988
Q ss_pred CCC
Q psy7854 147 SSP 149 (153)
Q Consensus 147 g~~ 149 (153)
|+.
T Consensus 302 ~~~ 304 (350)
T KOG4260|consen 302 GLI 304 (350)
T ss_pred cce
Confidence 864
No 10
>KOG4289|consensus
Probab=98.09 E-value=3.4e-06 Score=72.99 Aligned_cols=53 Identities=25% Similarity=0.658 Sum_probs=46.5
Q ss_pred eecCCCceeeCCCCcccCCCCCCC-ccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCc
Q psy7854 92 ENLMGTYRCICNPGYEGDPTGKLC-SDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 92 ~~~~g~~~C~C~~g~~~~~~g~~C-~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~ 150 (153)
.+..++++|.|++|| .|..| ++||+| ..++|.+ +.|....|+|.|.|++||.+
T Consensus 1216 i~pvnglrCrCPpGF----Tgd~CeTeiDlC--Ys~pC~nng~C~srEggYtCeCrpg~tG 1270 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGF----TGDYCETEIDLC--YSGPCGNNGRCRSREGGYTCECRPGFTG 1270 (2531)
T ss_pred ccccCceeEeCCCCC----CcccccchhHhh--hcCCCCCCCceEEecCceeEEecCCccc
Confidence 344577999999999 66789 489999 8999995 99999999999999999975
No 11
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.04 E-value=3.3e-06 Score=43.10 Aligned_cols=31 Identities=45% Similarity=1.113 Sum_probs=25.2
Q ss_pred CCCCCCceeecCCCceeeCCCCcccCCCCCCC
Q psy7854 84 GICTNGACENLMGTYRCICNPGYEGDPTGKLC 115 (153)
Q Consensus 84 ~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C 115 (153)
+.|.+ .|++.+++|.|.|++||.+.++++.|
T Consensus 6 GgC~h-~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 6 GGCSH-ICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGSSS-EEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCcCC-CCccCCCceEeECCCCCEECcCCCCC
Confidence 56776 99999999999999999998887764
No 12
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.95 E-value=1.6e-05 Score=40.95 Aligned_cols=32 Identities=34% Similarity=0.642 Sum_probs=26.6
Q ss_pred cCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854 117 DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP 149 (153)
Q Consensus 117 ~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~ 149 (153)
++++|... .+|.+ +.|+++.|+|.|.|++||.
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence 46788532 78885 7999999999999999997
No 13
>KOG4260|consensus
Probab=97.94 E-value=5.2e-06 Score=60.46 Aligned_cols=52 Identities=29% Similarity=0.579 Sum_probs=45.1
Q ss_pred CCcee-eCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854 96 GTYRC-ICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP 149 (153)
Q Consensus 96 g~~~C-~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~ 149 (153)
++..| .|+.||.+ +...|.|||||...+.+|.. ..|+|+.|+|.|.+++||.
T Consensus 215 ~~k~C~kCkkGW~l--de~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~ 268 (350)
T KOG4260|consen 215 SSKGCSKCKKGWKL--DEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYK 268 (350)
T ss_pred CCCChhhhccccee--cccccccHHHHhcCCCCCChhheeecCCCceEeccccccc
Confidence 34455 79999988 56789999999999999984 8999999999999999985
No 14
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.82 E-value=1e-05 Score=40.24 Aligned_cols=29 Identities=21% Similarity=0.460 Sum_probs=24.5
Q ss_pred CCCCCCC-CeeeecC-CCcEEecCCCCCccc
Q psy7854 124 DVMICSG-GTCKNTP-GSFQSNFQDSSPILK 152 (153)
Q Consensus 124 ~~~~C~~-~~C~~~~-g~~~C~C~~g~~~~~ 152 (153)
.+++|.| ++|+... ++|.|.|++||.+.+
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 3568886 9999988 999999999998753
No 15
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.81 E-value=1.1e-05 Score=41.17 Aligned_cols=32 Identities=53% Similarity=1.246 Sum_probs=24.0
Q ss_pred CCCCC-CCCceeecCCCceeeCCCCcccCCCCCCC
Q psy7854 82 NPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLC 115 (153)
Q Consensus 82 ~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C 115 (153)
..+.| .++.|.++.++|.|+|++||.+ +|..|
T Consensus 4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~G--dG~~C 36 (36)
T PF12947_consen 4 NNGGCHPNATCTNTGGSYTCTCKPGYEG--DGFFC 36 (36)
T ss_dssp GGGGS-TTCEEEE-TTSEEEEE-CEEEC--CSTCE
T ss_pred CCCCCCCCcEeecCCCCEEeECCCCCcc--CCcCC
Confidence 34677 5789999999999999999987 55543
No 16
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.66 E-value=1.9e-05 Score=40.35 Aligned_cols=30 Identities=27% Similarity=0.324 Sum_probs=22.1
Q ss_pred cCCCCCCC-CCeeeecCCCcEEecCCCCCcc
Q psy7854 122 LMDVMICS-GGTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 122 ~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~ 151 (153)
....+.|+ ++.|++++++|.|.|++||.+.
T Consensus 2 ~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 2 LENNGGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp TTGGGGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCCCCCcEeecCCCCEEeECCCCCccC
Confidence 33557787 5999999999999999999754
No 17
>KOG1217|consensus
Probab=97.61 E-value=0.00031 Score=55.27 Aligned_cols=61 Identities=36% Similarity=0.821 Sum_probs=49.3
Q ss_pred CceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 89 GACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 89 ~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
+.|.+..++|+|.|++||.+... ..+.++++|..... |.+ ++|++..+.|.|.|++||.+.
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~ 304 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGR 304 (487)
T ss_pred CcccccCCceeeeCCCCcccccc-ceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCC
Confidence 57888889999999999966211 35789999975544 876 899999999999999999753
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.56 E-value=0.00015 Score=37.20 Aligned_cols=31 Identities=39% Similarity=0.980 Sum_probs=25.2
Q ss_pred ccCCCCCCCCCC-CCceeecCCCceeeCCCCcc
Q psy7854 76 NHHTTENPGICT-NGACENLMGTYRCICNPGYE 107 (153)
Q Consensus 76 ~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~ 107 (153)
+++|... .+|. ++.|.+..++|.|.|++||.
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 2 IDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred cccCcCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence 5777653 5785 45999999999999999995
No 19
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.52 E-value=0.00017 Score=36.53 Aligned_cols=33 Identities=30% Similarity=0.508 Sum_probs=26.3
Q ss_pred CccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 118 QNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 118 i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
+++|... .+|.+ +.|.+..++|.|.|++||.+.
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 5677422 67875 899999999999999999763
No 20
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.45 E-value=6.6e-05 Score=38.26 Aligned_cols=25 Identities=32% Similarity=0.542 Sum_probs=20.5
Q ss_pred CCCCCCCeeeecCCCcEEecCCCCCc
Q psy7854 125 VMICSGGTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 125 ~~~C~~~~C~~~~g~~~C~C~~g~~~ 150 (153)
.+.|. ..|++++++|+|.|++||.+
T Consensus 5 NGgC~-h~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 5 NGGCS-HICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp GGGSS-SEEEEETTSEEEE-STTEEE
T ss_pred CCCcC-CCCccCCCceEeECCCCCEE
Confidence 45566 89999999999999999965
No 21
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.34 E-value=0.00025 Score=51.14 Aligned_cols=40 Identities=28% Similarity=0.639 Sum_probs=34.7
Q ss_pred CCCCCccCccccCCCCCCCCCeeeecCCCcEEecCCCCCcc
Q psy7854 111 TGKLCSDQNECLMDVMICSGGTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 111 ~g~~C~~i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~ 151 (153)
.+..|.++++|....+.|. ..|.++.|+|.|.|++||.+.
T Consensus 180 ~~~~C~~~~~C~~~~~~c~-~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQ-QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred ccccCcCchhhcCCCCCcc-ceEEcCCCCEEeECCCCccCC
Confidence 3567999999987778887 799999999999999999764
No 22
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.28 E-value=0.00031 Score=50.60 Aligned_cols=42 Identities=19% Similarity=0.453 Sum_probs=36.0
Q ss_pred CCcCCcccCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCC
Q psy7854 70 SEVHGANHHTTENPGICTNGACENLMGTYRCICNPGYEGDPTG 112 (153)
Q Consensus 70 ~~~C~~~~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~~~g 112 (153)
+..|.++++|...++.|.+ .|.+..|+|.|.|..||.+.+++
T Consensus 181 ~~~C~~~~~C~~~~~~c~~-~C~~~~g~~~c~c~~g~~~~~~~ 222 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQ-VCISTPGSYLCACTEGYALLEDN 222 (224)
T ss_pred cccCcCchhhcCCCCCccc-eEEcCCCCEEeECCCCccCCCCC
Confidence 6789999999877678876 89999999999999999876554
No 23
>KOG1217|consensus
Probab=97.08 E-value=0.0036 Score=49.28 Aligned_cols=111 Identities=28% Similarity=0.519 Sum_probs=68.4
Q ss_pred CCcccCcCCCCCCCeeeCCCceecccCCCCccCCCCCCCCc---CCcccCCCC--CCCCCCCC-ce--eecCCCceeeCC
Q psy7854 32 RSISIDEFTVHSSTLIALSQESQDLERLKFGLEPPTSRSEV---HGANHHTTE--NPGICTNG-AC--ENLMGTYRCICN 103 (153)
Q Consensus 32 ~c~~i~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---C~~~~~C~~--~~~~C~~~-~C--~~~~g~~~C~C~ 103 (153)
.+.++++|..... +.+.+.|...... ....+++++.+.. |.+.++|.. ...+|.++ .| ....+.+.|.|.
T Consensus 267 ~~~~~~~C~~~~~-c~~~~~C~~~~~~-~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~ 344 (487)
T KOG1217|consen 267 TCVDVDSCALIAS-CPNGGTCVNVPGS-YRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACG 344 (487)
T ss_pred eeeeccccCCCCc-cCCCCeeecCCCc-ceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCC
Confidence 4567777766542 3333444322211 1122444444444 445567753 23557544 67 334457889999
Q ss_pred CCcccCCCCCCCccCc-cccCCCCCCCC-Ceeee-cCCCcEEecCCCCCc
Q psy7854 104 PGYEGDPTGKLCSDQN-ECLMDVMICSG-GTCKN-TPGSFQSNFQDSSPI 150 (153)
Q Consensus 104 ~g~~~~~~g~~C~~i~-~C~~~~~~C~~-~~C~~-~~g~~~C~C~~g~~~ 150 (153)
.+| .+..|+..+ +|. ...+.+ +.|++ ..+.|.|.|+.+|.+
T Consensus 345 ~~~----~g~~C~~~~~~C~--~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 345 PGF----TGRRCEDSNDECA--SSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred CCC----CCCccccCCcccc--CCccccCCEeccCCCCCeEecCCCcccc
Confidence 886 788999884 895 343554 89998 799999999999864
No 24
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.03 E-value=0.0012 Score=32.98 Aligned_cols=25 Identities=40% Similarity=0.636 Sum_probs=21.3
Q ss_pred CCCCCCeeeecCCCcEEecCCCCCc
Q psy7854 126 MICSGGTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 126 ~~C~~~~C~~~~g~~~C~C~~g~~~ 150 (153)
++|.+++|+++.++|.|.|++||.+
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCcc
Confidence 5676338999999999999999976
No 25
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.97 E-value=0.0015 Score=32.38 Aligned_cols=27 Identities=37% Similarity=0.529 Sum_probs=22.9
Q ss_pred CCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 125 VMICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 125 ~~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
..+|.+ +.|+++.+.|.|.|+.||.+.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 566765 899999999999999999764
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.89 E-value=0.002 Score=32.44 Aligned_cols=32 Identities=41% Similarity=1.027 Sum_probs=24.4
Q ss_pred ccCCCCCCCCCC-CCceeecCCCceeeCCCCccc
Q psy7854 76 NHHTTENPGICT-NGACENLMGTYRCICNPGYEG 108 (153)
Q Consensus 76 ~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~ 108 (153)
+++|... .+|. ++.|.+..+.|.|.|+.||.+
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCCeEeECCCCCcC
Confidence 4667542 4685 458999999999999999943
No 27
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.82 E-value=0.00098 Score=32.98 Aligned_cols=25 Identities=52% Similarity=1.250 Sum_probs=21.5
Q ss_pred CCCCC-CceeecC-CCceeeCCCCccc
Q psy7854 84 GICTN-GACENLM-GTYRCICNPGYEG 108 (153)
Q Consensus 84 ~~C~~-~~C~~~~-g~~~C~C~~g~~~ 108 (153)
.+|.+ ++|+... +.|.|.|++||.+
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 68864 5999998 9999999999954
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.41 E-value=0.0069 Score=30.12 Aligned_cols=25 Identities=56% Similarity=1.350 Sum_probs=20.4
Q ss_pred CCCCCCceeecCCCceeeCCCCccc
Q psy7854 84 GICTNGACENLMGTYRCICNPGYEG 108 (153)
Q Consensus 84 ~~C~~~~C~~~~g~~~C~C~~g~~~ 108 (153)
.+|.++.|.+..++|.|.|++||.+
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCcc
Confidence 4675338999999999999999955
No 29
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.01 E-value=0.015 Score=28.59 Aligned_cols=25 Identities=52% Similarity=1.363 Sum_probs=21.1
Q ss_pred CCCC-CCceeecCCCceeeCCCCccc
Q psy7854 84 GICT-NGACENLMGTYRCICNPGYEG 108 (153)
Q Consensus 84 ~~C~-~~~C~~~~g~~~C~C~~g~~~ 108 (153)
.+|. ++.|.+..+.|.|.|+.||.+
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcc
Confidence 6674 468999999999999999955
No 30
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.12 E-value=0.007 Score=42.12 Aligned_cols=66 Identities=21% Similarity=0.521 Sum_probs=45.3
Q ss_pred CCCCCCceeecCCCceeeCCCCcccCCCCCCCccCccccC---CCCCCCC-CeeeecC-----CCcEEecCCCCCc
Q psy7854 84 GICTNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLM---DVMICSG-GTCKNTP-----GSFQSNFQDSSPI 150 (153)
Q Consensus 84 ~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~---~~~~C~~-~~C~~~~-----g~~~C~C~~g~~~ 150 (153)
..|.++........|.|.|..||.+. +...|+...+|.. ...+|.. +.|++.+ ..|.|.|.+||.+
T Consensus 6 T~CKNG~LiQMSNHfEC~Cnegfvl~-~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~ 80 (197)
T PF06247_consen 6 TICKNGYLIQMSNHFECKCNEGFVLK-NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL 80 (197)
T ss_dssp ---BTEEEEEESSEEEEEESTTEEEE-ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred ccccCCEEEEccCceEEEcCCCcEEc-cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence 56777888888899999999999774 4568887777753 2366876 8998865 5699999999964
No 31
>KOG1225|consensus
Probab=92.55 E-value=0.44 Score=38.75 Aligned_cols=69 Identities=35% Similarity=0.709 Sum_probs=33.9
Q ss_pred CccCCCCCCCCcCCcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCC
Q psy7854 61 FGLEPPTSRSEVHGANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPG 138 (153)
Q Consensus 61 ~~~~~~~~~~~~C~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g 138 (153)
...++++|.+..|... .|.. .|. ++.+. .+ .|.|.+|| .|..|+... | +..|.+ +.|+ .|
T Consensus 266 ~CIC~~Gf~G~dC~e~-~Cp~---~cs~~g~~~--~g--~CiC~~g~----~G~dCs~~~-c---padC~g~G~Ci--~G 327 (525)
T KOG1225|consen 266 RCICPPGFTGDDCDEL-VCPV---DCSGGGVCV--DG--ECICNPGY----SGKDCSIRR-C---PADCSGHGKCI--DG 327 (525)
T ss_pred eEeCCCCCcCCCCCcc-cCCc---ccCCCceec--CC--EeecCCCc----ccccccccc-C---CccCCCCCccc--CC
Confidence 3346666666666431 2322 242 22222 22 67888888 555554221 3 344543 5664 22
Q ss_pred CcEEecCCCCC
Q psy7854 139 SFQSNFQDSSP 149 (153)
Q Consensus 139 ~~~C~C~~g~~ 149 (153)
+|.|.+||.
T Consensus 328 --~C~C~~Gy~ 336 (525)
T KOG1225|consen 328 --ECLCDEGYT 336 (525)
T ss_pred --ceEeCCCCc
Confidence 566666664
No 32
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.26 E-value=0.097 Score=26.62 Aligned_cols=25 Identities=16% Similarity=0.142 Sum_probs=17.5
Q ss_pred CCCCC-CCeeeecC-CCcEEecCCCCC
Q psy7854 125 VMICS-GGTCKNTP-GSFQSNFQDSSP 149 (153)
Q Consensus 125 ~~~C~-~~~C~~~~-g~~~C~C~~g~~ 149 (153)
...|. |+.|++.. |.+.|+|-+||.
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 35666 58999876 999999999995
No 33
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.87 E-value=0.12 Score=20.08 Aligned_cols=10 Identities=50% Similarity=1.491 Sum_probs=6.8
Q ss_pred eeeCCCCccc
Q psy7854 99 RCICNPGYEG 108 (153)
Q Consensus 99 ~C~C~~g~~~ 108 (153)
.|.|++||.+
T Consensus 1 ~C~C~~G~~G 10 (13)
T PF12661_consen 1 TCQCPPGWTG 10 (13)
T ss_dssp EEEE-TTEET
T ss_pred CccCcCCCcC
Confidence 4889999943
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=88.46 E-value=1.1 Score=21.97 Aligned_cols=24 Identities=29% Similarity=0.400 Sum_probs=18.0
Q ss_pred CCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 126 MICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 126 ~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
..|.+ ++|+.. ..+|.|.+||.+.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 35764 899765 3589999999764
No 35
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.66 E-value=0.77 Score=30.12 Aligned_cols=35 Identities=11% Similarity=0.186 Sum_probs=24.0
Q ss_pred CccccCC-CCCCCCCeeeec--CCCcEEecCCCCCccc
Q psy7854 118 QNECLMD-VMICSGGTCKNT--PGSFQSNFQDSSPILK 152 (153)
Q Consensus 118 i~~C~~~-~~~C~~~~C~~~--~g~~~C~C~~g~~~~~ 152 (153)
+.+|... .+.|-|+.|... ...+.|.|..||.+.|
T Consensus 42 i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYtGeR 79 (139)
T PHA03099 42 IRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYTGIR 79 (139)
T ss_pred cccCChhhCCEeECCEEEeeccCCCceeECCCCccccc
Confidence 4455322 356766788654 4778999999998876
No 36
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=84.58 E-value=1.1 Score=29.47 Aligned_cols=32 Identities=34% Similarity=0.788 Sum_probs=24.7
Q ss_pred CCCCCCceeecC--CCceeeCCCCcccCCCCCCCccCc
Q psy7854 84 GICTNGACENLM--GTYRCICNPGYEGDPTGKLCSDQN 119 (153)
Q Consensus 84 ~~C~~~~C~~~~--g~~~C~C~~g~~~~~~g~~C~~i~ 119 (153)
+.|-||.|.-.. ..+.|.|..|| .|.+|+-.+
T Consensus 51 ~YClHG~C~yI~dl~~~~CrC~~GY----tGeRCEh~d 84 (139)
T PHA03099 51 GYCLHGDCIHARDIDGMYCRCSHGY----TGIRCQHVV 84 (139)
T ss_pred CEeECCEEEeeccCCCceeECCCCc----cccccccee
Confidence 567777887654 78999999999 778887543
No 37
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.11 E-value=0.47 Score=33.23 Aligned_cols=88 Identities=19% Similarity=0.314 Sum_probs=48.2
Q ss_pred eecccCCCCccCCCCCCCCcCCcccCCCCCCCCCCCCceeecC---CCceeeCCCCcccCCCCCCCc--cCccccCCCCC
Q psy7854 53 SQDLERLKFGLEPPTSRSEVHGANHHTTENPGICTNGACENLM---GTYRCICNPGYEGDPTGKLCS--DQNECLMDVMI 127 (153)
Q Consensus 53 ~~~~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~~---g~~~C~C~~g~~~~~~g~~C~--~i~~C~~~~~~ 127 (153)
....|.|..+|..... .|. .+.|.. -.|..+.|+-.+ ....|+|.-|+.. .+...|+ ...+|. -.
T Consensus 68 ~~~~C~C~~gY~~~~~---vCv-p~~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV~-~dn~kCtk~G~T~C~---LK 137 (197)
T PF06247_consen 68 RAYKCDCINGYILKQG---VCV-PNKCNN--KDCGSGKCILDPDNPNNPTCSCNIGKVP-DDNKKCTKTGETKCS---LK 137 (197)
T ss_dssp TSEEEEE-TTEEESSS---SEE-EGGGSS-----TTEEEEEEEGGGSEEEEEE-TEEET-TTTTESEEEE----------
T ss_pred eeEEEecccCceeeCC---eEc-hhhcCc--eecCCCeEEecCCCCCCceeEeeeceEe-ccCCcccCCCcccee---ee
Confidence 3455666667655432 453 234544 457667786543 3458999999983 2566774 445673 44
Q ss_pred CC-CCeeeecCCCcEEecCCCCCc
Q psy7854 128 CS-GGTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 128 C~-~~~C~~~~g~~~C~C~~g~~~ 150 (153)
|. +..|....+-|+|.|.+||.+
T Consensus 138 Ck~nE~CK~~~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 138 CKENEECKLVDGYYKCVCKEGFPG 161 (197)
T ss_dssp -TTTEEEEEETTEEEEEE-TT-EE
T ss_pred cCCCcceeeeCcEEEeecCCCCCC
Confidence 65 479999999999999999864
No 38
>KOG1225|consensus
Probab=82.29 E-value=3.2 Score=33.98 Aligned_cols=61 Identities=30% Similarity=0.675 Sum_probs=34.9
Q ss_pred CCCCCCCCcCCcccCCCCCCCCC-CCCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcE
Q psy7854 64 EPPTSRSEVHGANHHTTENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQ 141 (153)
Q Consensus 64 ~~~~~~~~~C~~~~~C~~~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~ 141 (153)
+.++|.+..|.... |. ..| .++.|+ .-+|.|.+|| .|..|.... |.+ +.|++.
T Consensus 300 C~~g~~G~dCs~~~-cp---adC~g~G~Ci----~G~C~C~~Gy----~G~~C~~~~--------C~~~g~cv~g----- 354 (525)
T KOG1225|consen 300 CNPGYSGKDCSIRR-CP---ADCSGHGKCI----DGECLCDEGY----TGELCIQRA--------CSGGGQCVNG----- 354 (525)
T ss_pred cCCCcccccccccc-CC---ccCCCCCccc----CCceEeCCCC----cCCcccccc--------cCCCceeccC-----
Confidence 44455555554222 32 456 366776 3478999999 566664432 444 566442
Q ss_pred EecCCCCC
Q psy7854 142 SNFQDSSP 149 (153)
Q Consensus 142 C~C~~g~~ 149 (153)
|.|..||.
T Consensus 355 C~C~~Gw~ 362 (525)
T KOG1225|consen 355 CKCKKGWR 362 (525)
T ss_pred ceeccCcc
Confidence 66777775
No 39
>PHA02887 EGF-like protein; Provisional
Probab=80.81 E-value=1.6 Score=28.19 Aligned_cols=27 Identities=15% Similarity=0.215 Sum_probs=20.0
Q ss_pred CCCCCCeeeec--CCCcEEecCCCCCccc
Q psy7854 126 MICSGGTCKNT--PGSFQSNFQDSSPILK 152 (153)
Q Consensus 126 ~~C~~~~C~~~--~g~~~C~C~~g~~~~~ 152 (153)
+.|-++.|... .....|.|+.||.+.|
T Consensus 92 ~YCiHG~C~yI~dL~epsCrC~~GYtG~R 120 (126)
T PHA02887 92 DFCINGECMNIIDLDEKFCICNKGYTGIR 120 (126)
T ss_pred CEeeCCEEEccccCCCceeECCCCcccCC
Confidence 55656788653 4558899999998765
No 40
>PHA02887 EGF-like protein; Provisional
Probab=80.28 E-value=1.9 Score=27.88 Aligned_cols=31 Identities=45% Similarity=1.059 Sum_probs=23.6
Q ss_pred CCCCCCceeec--CCCceeeCCCCcccCCCCCCCccC
Q psy7854 84 GICTNGACENL--MGTYRCICNPGYEGDPTGKLCSDQ 118 (153)
Q Consensus 84 ~~C~~~~C~~~--~g~~~C~C~~g~~~~~~g~~C~~i 118 (153)
+.|-||.|.-. .....|.|..|| .|.+|+..
T Consensus 92 ~YCiHG~C~yI~dL~epsCrC~~GY----tG~RCE~v 124 (126)
T PHA02887 92 DFCINGECMNIIDLDEKFCICNKGY----TGIRCDEV 124 (126)
T ss_pred CEeeCCEEEccccCCCceeECCCCc----ccCCCCcc
Confidence 56777888765 466899999999 77788654
No 41
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=79.53 E-value=1.5 Score=21.81 Aligned_cols=25 Identities=12% Similarity=0.121 Sum_probs=15.5
Q ss_pred CCCCCCCeeeecCCCcEEecCCCCCcc
Q psy7854 125 VMICSGGTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 125 ~~~C~~~~C~~~~g~~~C~C~~g~~~~ 151 (153)
...|. +.|..... +.|.|+.||.+.
T Consensus 5 ~t~Cp-A~CDpn~~-~~C~CPeGyIld 29 (34)
T PF09064_consen 5 QTECP-ADCDPNSP-GQCFCPEGYILD 29 (34)
T ss_pred cccCC-CccCCCCC-CceeCCCceEec
Confidence 34454 66643222 489999999753
No 42
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=78.20 E-value=3 Score=26.40 Aligned_cols=33 Identities=15% Similarity=0.259 Sum_probs=23.1
Q ss_pred cCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854 117 DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 117 ~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~ 151 (153)
..+.|. ....|.. +.|. ......|.|.+||...
T Consensus 76 p~d~Cd-~y~~CG~~g~C~-~~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCD-VYGFCGPNGICN-SNNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCC-CccccCCccEeC-CCCCCceECCCCcCCC
Confidence 345675 3578884 9994 3345679999999754
No 43
>KOG1215|consensus
Probab=71.55 E-value=7 Score=34.01 Aligned_cols=71 Identities=24% Similarity=0.396 Sum_probs=45.3
Q ss_pred ccCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCCCeee-ecCCCcEEecCCCCCc
Q psy7854 76 NHHTTENPGICTNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSGGTCK-NTPGSFQSNFQDSSPI 150 (153)
Q Consensus 76 ~~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~~~C~-~~~g~~~C~C~~g~~~ 150 (153)
++++......+.+ .+.+......|.|..++.....+.. +...|....+.|. ..|+ +.++.|.|.|..||.+
T Consensus 325 ~~~~~~~~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~Cs-q~C~~~~p~~~~c~c~~g~~~ 396 (877)
T KOG1215|consen 325 LNECAERVLKCSH-KCPDVSVGPRCDCMGAKVLPLGART--DSNPCESDNGGCS-QLCVPNSPGTFKCACSPGYEL 396 (877)
T ss_pred cccchhhcccccC-CCCccccCCcccCCccceecccccc--cCCcccccCCccc-eeccCCCCCceeEecCCCcEe
Confidence 3444433333433 5666677778888887765333333 3344554667776 7887 5689999999999964
No 44
>KOG0994|consensus
Probab=67.11 E-value=12 Score=33.71 Aligned_cols=59 Identities=27% Similarity=0.593 Sum_probs=37.6
Q ss_pred ceeecCCCc--ee-eCCCCcccCCCCCCCc---cCccccCCCCCCCCCeeeecCCCcEEe-cCCCCCc
Q psy7854 90 ACENLMGTY--RC-ICNPGYEGDPTGKLCS---DQNECLMDVMICSGGTCKNTPGSFQSN-FQDSSPI 150 (153)
Q Consensus 90 ~C~~~~g~~--~C-~C~~g~~~~~~g~~C~---~i~~C~~~~~~C~~~~C~~~~g~~~C~-C~~g~~~ 150 (153)
.|.=..|.| .| .|.+||++.|.-+.|+ -.++|...++.|. .|....+++.|. |-.||-+
T Consensus 831 QC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi--~CqD~T~G~~CdrCl~GyyG 896 (1758)
T KOG0994|consen 831 QCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCDPITGACI--DCQDSTTGHSCDRCLDGYYG 896 (1758)
T ss_pred ceeeccccchhhccccCCCccCCCcCccccccCcccccCccccccc--cccccccccchhhhhccccC
Confidence 343334444 56 6899999888766664 3466765556664 455666667775 8888743
No 45
>KOG1226|consensus
Probab=64.37 E-value=50 Score=28.48 Aligned_cols=48 Identities=25% Similarity=0.594 Sum_probs=25.2
Q ss_pred eeeCCCCcccCCCCCCCccCc-cccC-CCCCCC-CCeeeecCCCcEEecCCCCCc
Q psy7854 99 RCICNPGYEGDPTGKLCSDQN-ECLM-DVMICS-GGTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 99 ~C~C~~g~~~~~~g~~C~~i~-~C~~-~~~~C~-~~~C~~~~g~~~C~C~~g~~~ 150 (153)
.|.|.+...+..+|..|+.-| .|.. ....|. ++.|. -| +|+|.+||.+
T Consensus 526 qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~--CG--~CvC~~GwtG 576 (783)
T KOG1226|consen 526 QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCE--CG--RCVCNPGWTG 576 (783)
T ss_pred ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEe--CC--cEEcCCCCcc
Confidence 466666554433577776322 2321 123455 35552 22 6888888865
No 46
>KOG0994|consensus
Probab=58.94 E-value=40 Score=30.75 Aligned_cols=62 Identities=21% Similarity=0.372 Sum_probs=32.7
Q ss_pred ceeecCCCcee-eCCCCcccCCC---CCCCccCccccCCCCC-CCC-Ceeee--cCCCcEEecCCCCCccc
Q psy7854 90 ACENLMGTYRC-ICNPGYEGDPT---GKLCSDQNECLMDVMI-CSG-GTCKN--TPGSFQSNFQDSSPILK 152 (153)
Q Consensus 90 ~C~~~~g~~~C-~C~~g~~~~~~---g~~C~~i~~C~~~~~~-C~~-~~C~~--~~g~~~C~C~~g~~~~~ 152 (153)
.|...-.++.| .|..||.++|. |..|..- +|-..+.. -++ ..|.. ......|.|.+||.+.|
T Consensus 877 ~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPC-pCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~R 946 (1758)
T KOG0994|consen 877 DCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPC-PCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSR 946 (1758)
T ss_pred cccccccccchhhhhccccCCcccCCCCCCCCC-CCCCCCccchhccccccccccccceeeecccCccccc
Confidence 45666677888 89999987653 2333221 11111100 011 13321 12235799999998765
No 47
>KOG3516|consensus
Probab=56.67 E-value=10 Score=34.02 Aligned_cols=39 Identities=26% Similarity=0.762 Sum_probs=30.5
Q ss_pred cCCcccCCCCCCCCCCCC-ceeecCCCceeeCC-CCcccCCCCCCCc
Q psy7854 72 VHGANHHTTENPGICTNG-ACENLMGTYRCICN-PGYEGDPTGKLCS 116 (153)
Q Consensus 72 ~C~~~~~C~~~~~~C~~~-~C~~~~g~~~C~C~-~g~~~~~~g~~C~ 116 (153)
.|..+|.|.. ++|+|+ .|......|.|.|. .|| .|..|.
T Consensus 541 ~C~i~drClP--N~CehgG~C~Qs~~~f~C~C~~TGY----~GatCH 581 (1306)
T KOG3516|consen 541 MCGISDRCLP--NPCEHGGKCSQSWDDFECNCELTGY----KGATCH 581 (1306)
T ss_pred ccccccccCC--ccccCCCcccccccceeEecccccc----cccccc
Confidence 5666777766 899765 89888889999998 799 566664
No 48
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=56.10 E-value=23 Score=18.93 Aligned_cols=29 Identities=24% Similarity=0.666 Sum_probs=17.8
Q ss_pred CCCCccCccccCCCCCCCCCeeeecCCCcEEecCCCCCc
Q psy7854 112 GKLCSDQNECLMDVMICSGGTCKNTPGSFQSNFQDSSPI 150 (153)
Q Consensus 112 g~~C~~i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~ 150 (153)
|..|..-.+|. .++.|++ | .|.|++||..
T Consensus 19 g~~C~~~~qC~------~~s~C~~--g--~C~C~~g~~~ 47 (52)
T PF01683_consen 19 GESCESDEQCI------GGSVCVN--G--RCQCPPGYVE 47 (52)
T ss_pred CCCCCCcCCCC------CcCEEcC--C--EeECCCCCEe
Confidence 44565555553 2367744 2 7999999854
No 49
>KOG1226|consensus
Probab=51.55 E-value=53 Score=28.33 Aligned_cols=21 Identities=48% Similarity=1.319 Sum_probs=13.9
Q ss_pred eeeCCCCcccCCCCCCCc---cCccccC
Q psy7854 99 RCICNPGYEGDPTGKLCS---DQNECLM 123 (153)
Q Consensus 99 ~C~C~~g~~~~~~g~~C~---~i~~C~~ 123 (153)
+|+|.+|| .|..|. +.+.|..
T Consensus 567 ~CvC~~Gw----tG~~C~C~~std~C~~ 590 (783)
T KOG1226|consen 567 RCVCNPGW----TGSACNCPLSTDTCES 590 (783)
T ss_pred cEEcCCCC----ccCCCCCCCCCccccC
Confidence 67899999 555553 4555653
No 50
>KOG1836|consensus
Probab=50.53 E-value=25 Score=33.26 Aligned_cols=58 Identities=22% Similarity=0.393 Sum_probs=32.1
Q ss_pred eeecCCCcee-eCCCCcccCCCCCCCccCccccCCCCCCCC-Ceeeec--CCCcEEe-cCCCCCcccC
Q psy7854 91 CENLMGTYRC-ICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNT--PGSFQSN-FQDSSPILKI 153 (153)
Q Consensus 91 C~~~~g~~~C-~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~--~g~~~C~-C~~g~~~~~~ 153 (153)
|.....+-.| .|..||.+.+....-.| | .+=+|.+ +-|..+ .....|. |++||.++|+
T Consensus 749 C~~~t~G~~C~~C~~GfYg~~~~~~~~d---C--~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 749 CKHNTFGGQCAQCVDGFYGLPDLGTSGD---C--QPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred cccCCCCCchhhhcCCCCCccccCCCCC---C--ccCCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence 4333333344 78888855444322222 4 3344443 445443 4556888 9999998874
No 51
>KOG3516|consensus
Probab=46.00 E-value=18 Score=32.58 Aligned_cols=36 Identities=19% Similarity=0.449 Sum_probs=29.2
Q ss_pred CCCccCccccCCCCCCCC-CeeeecCCCcEEecC-CCCCc
Q psy7854 113 KLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQ-DSSPI 150 (153)
Q Consensus 113 ~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~-~g~~~ 150 (153)
..|.-++.| .+++|++ +.|..+...|.|.|. .||.+
T Consensus 540 d~C~i~drC--lPN~CehgG~C~Qs~~~f~C~C~~TGY~G 577 (1306)
T KOG3516|consen 540 DMCGISDRC--LPNPCEHGGKCSQSWDDFECNCELTGYKG 577 (1306)
T ss_pred ccccccccc--CCccccCCCcccccccceeEecccccccc
Confidence 356667788 7899997 899888889999998 68753
No 52
>KOG0196|consensus
Probab=45.04 E-value=23 Score=30.83 Aligned_cols=51 Identities=20% Similarity=0.472 Sum_probs=29.1
Q ss_pred ceeeCCCCcccCCCCCCCccC----ccccCCC---CCCCCCeeeecCCCcEEecCCCC
Q psy7854 98 YRCICNPGYEGDPTGKLCSDQ----NECLMDV---MICSGGTCKNTPGSFQSNFQDSS 148 (153)
Q Consensus 98 ~~C~C~~g~~~~~~g~~C~~i----~~C~~~~---~~C~~~~C~~~~g~~~C~C~~g~ 148 (153)
-.|.|.+||+....+..|+.- -.-.... .+|....-...+|.-.|.|..||
T Consensus 259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gy 316 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGY 316 (996)
T ss_pred CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCc
Confidence 368899999775556666521 1111112 23431111235788899999998
No 53
>smart00051 DSL delta serrate ligand.
Probab=41.08 E-value=69 Score=18.19 Aligned_cols=45 Identities=20% Similarity=0.270 Sum_probs=24.3
Q ss_pred CceeeCCCCcccCCCCCCCccCccccCCCCCCC-CCeeeecCCCcEEecCCCCCcc
Q psy7854 97 TYRCICNPGYEGDPTGKLCSDQNECLMDVMICS-GGTCKNTPGSFQSNFQDSSPIL 151 (153)
Q Consensus 97 ~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~ 151 (153)
.+.-.|.++| .|..|... |.. ...+. +..|.. .| .++|.+||.+.
T Consensus 16 ~~rv~C~~~~----yG~~C~~~--C~~-~~d~~~~~~Cd~-~G--~~~C~~Gw~G~ 61 (63)
T smart00051 16 QIRVTCDENY----YGEGCNKF--CRP-RDDFFGHYTCDE-NG--NKGCLEGWMGP 61 (63)
T ss_pred EEEeeCCCCC----cCCccCCE--eCc-CccccCCccCCc-CC--CEecCCCCcCC
Confidence 3455788888 55566422 321 11122 246632 34 47899999764
No 54
>PF07359 LEAP-2: Liver-expressed antimicrobial peptide 2 precursor (LEAP-2); InterPro: IPR009955 This family consists of several mammalian liver-expressed antimicrobial peptide 2 (LEAP-2) sequences. LEAP-2 is a cysteine-rich, and cationic protein. LEAP-2 contains a core structure with two disulphide bonds formed by cysteine residues in relative 1-3 and 2-4 positions. LEAP-2 is synthesised as a 77-residue precursor, which is predominantly expressed in the liver and highly conserved among mammals. The largest native LEAP-2 form of 40 amino acid residues is generated from the precursor at a putative cleavage site for a furin-like endoprotease. In contrast to smaller LEAP-2 variants, this peptide exhibits dose-dependent antimicrobial activity against selected microbial model organisms []. The exact function of this family is unclear.; GO: 0042742 defense response to bacterium; PDB: 2L1Q_A.
Probab=35.84 E-value=26 Score=20.79 Aligned_cols=25 Identities=36% Similarity=0.905 Sum_probs=16.7
Q ss_pred ccCCCCCCCccCccccCCCCCCCCCee
Q psy7854 107 EGDPTGKLCSDQNECLMDVMICSGGTC 133 (153)
Q Consensus 107 ~~~~~g~~C~~i~~C~~~~~~C~~~~C 133 (153)
...|.|..|.|-.|| .+..|.++.|
T Consensus 46 s~kP~GAsCrd~~EC--~T~lCR~~~C 70 (77)
T PF07359_consen 46 SLKPIGASCRDNSEC--STGLCRKGHC 70 (77)
T ss_dssp SSB-TTSB-SSGGGB--TTS-EETTEE
T ss_pred cCCcCcchhccchhh--hHHHhcCCcc
Confidence 334578899999999 6788876666
No 55
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=29.60 E-value=34 Score=21.71 Aligned_cols=30 Identities=20% Similarity=0.443 Sum_probs=20.6
Q ss_pred ccccCCCCCCC-CCeeeecC-----CCcEEecCCCC
Q psy7854 119 NECLMDVMICS-GGTCKNTP-----GSFQSNFQDSS 148 (153)
Q Consensus 119 ~~C~~~~~~C~-~~~C~~~~-----g~~~C~C~~g~ 148 (153)
++|....+.|. ++.|++.. .=|.|.|.+.+
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~ 41 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTV 41 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeeccc
Confidence 45666677786 48888762 34889998743
No 56
>KOG3512|consensus
Probab=28.77 E-value=1.6e+02 Score=24.21 Aligned_cols=15 Identities=33% Similarity=0.578 Sum_probs=8.6
Q ss_pred CCCceeeCCCCcccC
Q psy7854 95 MGTYRCICNPGYEGD 109 (153)
Q Consensus 95 ~g~~~C~C~~g~~~~ 109 (153)
.|.+--.|..||..+
T Consensus 369 aGrhChyCreGyyRd 383 (592)
T KOG3512|consen 369 AGRHCHYCREGYYRD 383 (592)
T ss_pred CCcccccccCccccC
Confidence 343333588888653
No 57
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=22.98 E-value=28 Score=18.79 Aligned_cols=19 Identities=32% Similarity=0.901 Sum_probs=12.0
Q ss_pred eeCCCCcccCCCCCCCccCc
Q psy7854 100 CICNPGYEGDPTGKLCSDQN 119 (153)
Q Consensus 100 C~C~~g~~~~~~g~~C~~i~ 119 (153)
|.|++||.+... ..|....
T Consensus 35 C~C~~G~v~~~~-~~CV~~~ 53 (55)
T PF01826_consen 35 CFCPPGYVRNDN-GRCVPPS 53 (55)
T ss_dssp EEETTTEEEETT-SEEEEGG
T ss_pred CCCCCCeeEcCC-CCEEcHH
Confidence 889999976433 3454443
No 58
>KOG3509|consensus
Probab=21.77 E-value=2.7e+02 Score=25.13 Aligned_cols=60 Identities=30% Similarity=0.563 Sum_probs=36.9
Q ss_pred CCCC-CCceeecCCCceeeCCCCcccCCCCCCCcc-CccccCCCCCCCCCeeeecCCCcEEecCCC
Q psy7854 84 GICT-NGACENLMGTYRCICNPGYEGDPTGKLCSD-QNECLMDVMICSGGTCKNTPGSFQSNFQDS 147 (153)
Q Consensus 84 ~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~-i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g 147 (153)
.+|+ .+.|........|.|++|| .|..|.+ .+.|...++.+.-++|....+...+.|.+|
T Consensus 412 ~p~~~~g~c~p~~~~~~c~c~~g~----~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg 473 (964)
T KOG3509|consen 412 IPCQHDGPCLQTLEGKQCLCPPGY----TGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG 473 (964)
T ss_pred ccCCCCccccccccccceeccccc----cCchhhccCccccccCCccccceEeccCCCcceeecCC
Confidence 4564 3367777888899999999 4555543 223322333332267766666666678777
Done!