Query psy6349
Match_columns 145
No_of_seqs 118 out of 1951
Neff 11.0
Searched_HMMs 46136
Date Sat Aug 17 00:08:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6349.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6349hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.6 1.4E-14 3E-19 114.5 9.5 108 10-117 3865-3975(4289)
2 KOG1214|consensus 99.3 2.6E-11 5.7E-16 89.6 10.7 137 6-143 691-860 (1289)
3 KOG1219|consensus 99.2 4.2E-11 9.1E-16 95.8 8.6 96 46-141 3864-3973(4289)
4 KOG4289|consensus 99.1 7E-10 1.5E-14 86.2 7.5 85 28-112 1221-1308(2531)
5 KOG1214|consensus 99.0 3.6E-09 7.9E-14 78.7 7.9 114 2-116 729-859 (1289)
6 KOG4289|consensus 98.8 8.3E-09 1.8E-13 80.5 4.9 76 63-138 1218-1308(2531)
7 KOG1225|consensus 98.7 2.1E-07 4.5E-12 67.3 9.7 95 30-143 266-365 (525)
8 KOG4260|consensus 98.5 4E-07 8.6E-12 60.0 6.2 124 15-140 150-304 (350)
9 KOG1225|consensus 98.0 6.7E-05 1.5E-09 54.7 8.9 95 30-142 235-338 (525)
10 PF07645 EGF_CA: Calcium-bindi 98.0 9.9E-06 2.1E-10 38.8 3.3 33 6-38 1-34 (42)
11 KOG1217|consensus 98.0 0.00011 2.5E-09 53.1 10.2 124 20-143 140-305 (487)
12 PF06247 Plasmod_Pvs28: Plasmo 97.8 2.5E-05 5.5E-10 48.9 2.9 125 15-143 6-163 (197)
13 KOG1217|consensus 97.7 0.00054 1.2E-08 49.6 9.6 132 4-137 268-421 (487)
14 PF07645 EGF_CA: Calcium-bindi 97.6 4.2E-05 9.1E-10 36.6 2.1 19 122-140 16-34 (42)
15 PF12947 EGF_3: EGF domain; I 97.4 0.00013 2.9E-09 33.6 1.9 26 118-143 7-33 (36)
16 PF00008 EGF: EGF-like domain 97.3 0.00017 3.6E-09 32.3 1.8 23 92-114 6-29 (32)
17 PF00008 EGF: EGF-like domain 97.3 0.00027 5.9E-09 31.6 2.2 27 14-40 3-31 (32)
18 smart00179 EGF_CA Calcium-bind 97.2 0.001 2.2E-08 30.9 4.0 18 21-38 16-33 (39)
19 PF14670 FXa_inhibition: Coagu 97.2 0.00024 5.2E-09 32.7 1.5 23 122-144 10-32 (36)
20 smart00179 EGF_CA Calcium-bind 97.2 0.0012 2.5E-08 30.7 3.9 35 46-80 2-38 (39)
21 PF12947 EGF_3: EGF domain; I 97.1 0.00049 1.1E-08 31.7 2.0 27 13-39 4-31 (36)
22 PF12662 cEGF: Complement Clr- 97.0 0.00074 1.6E-08 28.0 1.7 15 130-144 1-15 (24)
23 cd00054 EGF_CA Calcium-binding 96.8 0.004 8.6E-08 28.4 4.0 19 21-39 16-34 (38)
24 smart00181 EGF Epidermal growt 96.5 0.0098 2.1E-07 26.8 3.9 25 15-39 6-30 (35)
25 cd00054 EGF_CA Calcium-binding 96.2 0.014 3.1E-07 26.5 3.7 24 92-115 11-34 (38)
26 KOG1226|consensus 96.1 0.068 1.5E-06 41.0 8.5 79 30-116 479-577 (783)
27 KOG4260|consensus 96.0 0.006 1.3E-07 40.9 2.4 34 5-38 234-268 (350)
28 cd00053 EGF Epidermal growth f 96.0 0.023 4.9E-07 25.3 3.8 20 122-141 12-31 (36)
29 smart00181 EGF Epidermal growt 95.9 0.024 5.2E-07 25.4 3.7 19 59-77 12-30 (35)
30 PF14670 FXa_inhibition: Coagu 95.8 0.0089 1.9E-07 27.4 1.9 23 94-116 8-30 (36)
31 PF12661 hEGF: Human growth fa 95.8 0.0061 1.3E-07 21.4 1.0 12 31-42 2-13 (13)
32 cd00053 EGF Epidermal growth f 95.6 0.04 8.6E-07 24.5 3.9 20 20-39 12-31 (36)
33 PF12946 EGF_MSP1_1: MSP1 EGF 95.2 0.0086 1.9E-07 27.5 0.8 24 121-144 10-34 (37)
34 PF07974 EGF_2: EGF-like domai 94.8 0.054 1.2E-06 24.1 2.7 26 15-42 6-32 (32)
35 KOG0994|consensus 94.0 0.38 8.3E-06 39.1 7.3 13 30-42 783-795 (1758)
36 PHA02887 EGF-like protein; Pro 92.0 0.2 4.4E-06 29.3 2.6 27 17-43 94-122 (126)
37 KOG1226|consensus 92.0 0.58 1.3E-05 36.2 5.6 67 68-142 479-577 (783)
38 cd01475 vWA_Matrilin VWA_Matri 90.0 0.31 6.8E-06 32.1 2.5 23 122-144 199-221 (224)
39 PHA03099 epidermal growth fact 89.9 0.35 7.7E-06 28.8 2.3 27 17-43 53-81 (139)
40 cd01475 vWA_Matrilin VWA_Matri 88.9 0.69 1.5E-05 30.5 3.6 33 83-115 186-218 (224)
41 KOG0994|consensus 88.0 1.2 2.6E-05 36.5 4.6 18 97-116 1078-1095(1758)
42 PF09064 Tme5_EGF_like: Thromb 83.1 0.92 2E-05 20.4 1.3 13 132-144 19-31 (34)
43 KOG3512|consensus 83.0 14 0.00031 27.5 7.7 28 16-43 279-309 (592)
44 smart00051 DSL delta serrate l 71.0 13 0.00028 19.3 4.1 45 30-80 18-63 (63)
45 PF00954 S_locus_glycop: S-loc 67.0 5.8 0.00012 23.0 2.1 19 121-140 89-107 (110)
46 KOG1836|consensus 61.3 22 0.00048 31.1 5.0 30 87-116 777-809 (1705)
47 KOG3516|consensus 60.6 9.9 0.00021 31.6 2.8 35 12-46 548-584 (1306)
48 PF00053 Laminin_EGF: Laminin 55.6 14 0.0003 17.8 2.0 22 20-43 11-32 (49)
49 cd00055 EGF_Lam Laminin-type e 54.6 25 0.00054 17.0 3.1 13 30-42 20-32 (50)
50 PF01683 EB: EB module; Inter 43.8 25 0.00054 17.1 1.9 9 106-114 38-46 (52)
51 KOG3516|consensus 41.7 32 0.0007 28.9 3.0 37 46-82 545-582 (1306)
52 KOG3514|consensus 34.4 35 0.00077 28.5 2.2 35 48-82 625-660 (1591)
53 smart00180 EGF_Lam Laminin-typ 30.7 70 0.0015 15.2 3.1 13 30-42 19-31 (46)
54 PF12955 DUF3844: Domain of un 26.7 97 0.0021 18.0 2.6 26 90-115 13-43 (103)
55 PF04863 EGF_alliinase: Alliin 25.2 50 0.0011 16.7 1.1 17 65-81 34-50 (56)
No 1
>KOG1219|consensus
Probab=99.57 E-value=1.4e-14 Score=114.49 Aligned_cols=108 Identities=31% Similarity=0.912 Sum_probs=97.6
Q ss_pred CCCCCCCCCC-CEeeeCC-CceeeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCcccc-CCC
Q psy6349 10 GLKREALCHN-GTCEDFG-NSHRCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELN-VDD 86 (145)
Q Consensus 10 ~~~~~~~c~~-~~c~~~~-~~~~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~-~~~ 86 (145)
-.|...+|+| |.|.... +.|.|.|.+.|+|..|+.++..|.+.||..++.|....+.+.|.|+.+|.|..|+.. +++
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~e 3944 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISE 3944 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccc
Confidence 3446678887 7999775 679999999999999998899999999999999999999999999999999999876 789
Q ss_pred CcCCCCCCCCEEeeCCCCeEeeCCCCCccCC
Q psy6349 87 CKPNPCQNGGICHDLINHFSCSCPSGLKREA 117 (145)
Q Consensus 87 c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~~ 117 (145)
|+...|.+++.|+++.|+|+|.|.++|.+..
T Consensus 3945 Cs~n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3945 CSKNVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred cccccccCCceeeccCCceEeccChhHhccc
Confidence 9999999999999999999999999998763
No 2
>KOG1214|consensus
Probab=99.32 E-value=2.6e-11 Score=89.61 Aligned_cols=137 Identities=27% Similarity=0.707 Sum_probs=99.9
Q ss_pred ecCCCCCCCCCCC-CCEeeeCCC-ceeeecCCCcc--cCCCCccCCCCCCC--CCCCCCeeeeCCCceEEeCCCCcc--c
Q psy6349 6 VSSPGLKREALCH-NGTCEDFGN-SHRCHCMEGYT--GSYCSIEINECESD--PCQNGATCQDLIGGYSCQCARGFQ--G 77 (145)
Q Consensus 6 ~~~~~~~~~~~c~-~~~c~~~~~-~~~C~c~~g~~--~~~c~~~~~~c~~~--~c~~~~~c~~~~~~~~c~c~~g~~--g 77 (145)
..++|.++.+.|. ++.|.+.++ .|.|.|..+|. +..|. ++++|... .|+.+..|++..+.+.|.|..+|. +
T Consensus 691 ~~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d 769 (1289)
T KOG1214|consen 691 PVNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD 769 (1289)
T ss_pred ccccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceecc
Confidence 4467788888885 568887765 58999999997 66774 67777653 488899999999999998877764 2
Q ss_pred --CCccc-----cCCCCc--CCCCCCCC--EEeeCC-CCeEeeCCCCCccC------------CCCC-CCeeeeCCCcee
Q psy6349 78 --QNCEL-----NVDDCK--PNPCQNGG--ICHDLI-NHFSCSCPSGLKRE------------ALCH-NGTCEDFGNSHR 132 (145)
Q Consensus 78 --~~c~~-----~~~~c~--~~~~~~~~--~c~~~~-~~~~C~c~~g~~~~------------~~C~-~~~c~~~~g~~~ 132 (145)
-.|-. ..+.|. .+.|...+ .|+... +.|.|.|+|||.++ ..|+ ++.|+++.|++.
T Consensus 770 d~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~Cyntpgsfs 849 (1289)
T KOG1214|consen 770 DRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFS 849 (1289)
T ss_pred CCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcce
Confidence 23421 223443 23454444 344443 56999999999876 2465 789999999999
Q ss_pred EecCCCCCcCC
Q psy6349 133 CHCICWDDLRN 143 (145)
Q Consensus 133 c~c~~~~~~~~ 143 (145)
|+|++||.+++
T Consensus 850 C~C~pGy~GDG 860 (1289)
T KOG1214|consen 850 CRCQPGYYGDG 860 (1289)
T ss_pred eecccCccCCC
Confidence 99999998865
No 3
>KOG1219|consensus
Probab=99.24 E-value=4.2e-11 Score=95.76 Aligned_cols=96 Identities=35% Similarity=0.957 Sum_probs=85.9
Q ss_pred CCCCCCCCCCCCCeeeeCC-CceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC--------
Q psy6349 46 INECESDPCQNGATCQDLI-GGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE-------- 116 (145)
Q Consensus 46 ~~~c~~~~c~~~~~c~~~~-~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~-------- 116 (145)
.+.|.+.||.+++.|.... +.|.|.|+..|.|+.|..+...|.+.+|..++.|+...+.|.|.|+.+|++.
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~ 3943 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGIS 3943 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccccc
Confidence 3778999999999998764 6799999999999999988999999999999999999999999999999875
Q ss_pred ----CCCC-CCeeeeCCCceeEecCCCCCc
Q psy6349 117 ----ALCH-NGTCEDFGNSHRCHCICWDDL 141 (145)
Q Consensus 117 ----~~C~-~~~c~~~~g~~~c~c~~~~~~ 141 (145)
+.|. +|+|.+..|+|+|-|-++|.+
T Consensus 3944 eCs~n~C~~gg~C~n~~gsf~CncT~g~~g 3973 (4289)
T KOG1219|consen 3944 ECSKNVCGTGGQCINIPGSFHCNCTPGILG 3973 (4289)
T ss_pred ccccccccCCceeeccCCceEeccChhHhc
Confidence 3455 679999999999999988864
No 4
>KOG4289|consensus
Probab=99.05 E-value=7e-10 Score=86.18 Aligned_cols=85 Identities=46% Similarity=1.194 Sum_probs=75.6
Q ss_pred ceeeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccC--CCCcCCCCCCCCEEeeCC-CC
Q psy6349 28 SHRCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNV--DDCKPNPCQNGGICHDLI-NH 104 (145)
Q Consensus 28 ~~~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~--~~c~~~~~~~~~~c~~~~-~~ 104 (145)
..+|.|++||+|..|+++++.|-+.||.+++.|....+.|.|.|.++|.|..|..+. ..|.+..|.+++.|++.. +.
T Consensus 1221 glrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ngg 1300 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGG 1300 (2531)
T ss_pred ceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCc
Confidence 468999999999999999999999999999999999999999999999999997654 357788899999998754 67
Q ss_pred eEeeCCCC
Q psy6349 105 FSCSCPSG 112 (145)
Q Consensus 105 ~~C~c~~g 112 (145)
+.|.|+.|
T Consensus 1301 f~c~Cp~g 1308 (2531)
T KOG4289|consen 1301 FCCHCPYG 1308 (2531)
T ss_pred eeccCCCc
Confidence 88999987
No 5
>KOG1214|consensus
Probab=98.96 E-value=3.6e-09 Score=78.70 Aligned_cols=114 Identities=32% Similarity=0.752 Sum_probs=83.0
Q ss_pred ceeeecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc----cCCCCcc-----CCCCCCC--CCCCCC--eeeeC-CCc
Q psy6349 2 RNLIVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT----GSYCSIE-----INECESD--PCQNGA--TCQDL-IGG 66 (145)
Q Consensus 2 ~~~~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~----~~~c~~~-----~~~c~~~--~c~~~~--~c~~~-~~~ 66 (145)
++|..++.++...+-|- +..|++-+++|+|.|..+|. +..|..- ++.|... .|...+ .|+.. .+.
T Consensus 729 r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~ 808 (1289)
T KOG1214|consen 729 RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGST 808 (1289)
T ss_pred CCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCce
Confidence 45667777888888885 68999999999888877764 3455321 1223221 232223 33333 357
Q ss_pred eEEeCCCCccc--CCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC
Q psy6349 67 YSCQCARGFQG--QNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE 116 (145)
Q Consensus 67 ~~c~c~~g~~g--~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~ 116 (145)
|.|.|.+||.| ..|. ++++|.++.|.....|.++.+++.|+|.+||.++
T Consensus 809 y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred EEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccCC
Confidence 99999999984 4565 6799999999999999999999999999999876
No 6
>KOG4289|consensus
Probab=98.79 E-value=8.3e-09 Score=80.54 Aligned_cols=76 Identities=34% Similarity=0.895 Sum_probs=63.4
Q ss_pred CCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC--------CCCCC------CeeeeC-
Q psy6349 63 LIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE--------ALCHN------GTCEDF- 127 (145)
Q Consensus 63 ~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~--------~~C~~------~~c~~~- 127 (145)
..+.+.|.|++||.|..|...++.|-..+|.+++.|....+.|+|.|.++|+++ ..|.+ +.|.+.
T Consensus 1218 pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred ccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence 346789999999999999988899988899999999999999999999999987 23433 345443
Q ss_pred CCceeEecCCC
Q psy6349 128 GNSHRCHCICW 138 (145)
Q Consensus 128 ~g~~~c~c~~~ 138 (145)
.|+++|+|+.|
T Consensus 1298 nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1298 NGGFCCHCPYG 1308 (2531)
T ss_pred CCceeccCCCc
Confidence 48889999988
No 7
>KOG1225|consensus
Probab=98.71 E-value=2.1e-07 Score=67.29 Aligned_cols=95 Identities=34% Similarity=0.877 Sum_probs=69.2
Q ss_pred eeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeC
Q psy6349 30 RCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSC 109 (145)
Q Consensus 30 ~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c 109 (145)
+|+|++||+|..| ++..|... |+.++.++ .+ .|+|+++|.|..|+. ..|. ..|..++.|+.. .|.|
T Consensus 266 ~CIC~~Gf~G~dC--~e~~Cp~~-cs~~g~~~--~g--~CiC~~g~~G~dCs~--~~cp-adC~g~G~Ci~G----~C~C 331 (525)
T KOG1225|consen 266 RCICPPGFTGDDC--DELVCPVD-CSGGGVCV--DG--ECICNPGYSGKDCSI--RRCP-ADCSGHGKCIDG----ECLC 331 (525)
T ss_pred eEeCCCCCcCCCC--CcccCCcc-cCCCceec--CC--EeecCCCcccccccc--ccCC-ccCCCCCcccCC----ceEe
Confidence 7999999999999 34456544 66666664 22 699999999998852 3343 568888999733 7999
Q ss_pred CCCCccC----CCCC-CCeeeeCCCceeEecCCCCCcCC
Q psy6349 110 PSGLKRE----ALCH-NGTCEDFGNSHRCHCICWDDLRN 143 (145)
Q Consensus 110 ~~g~~~~----~~C~-~~~c~~~~g~~~c~c~~~~~~~~ 143 (145)
.+||.++ ..|+ +++|++ | |.|++||++++
T Consensus 332 ~~Gy~G~~C~~~~C~~~g~cv~--g---C~C~~Gw~G~d 365 (525)
T KOG1225|consen 332 DEGYTGELCIQRACSGGGQCVN--G---CKCKKGWRGPD 365 (525)
T ss_pred CCCCcCCcccccccCCCceecc--C---ceeccCccCCC
Confidence 9999986 2255 456653 2 99999998765
No 8
>KOG4260|consensus
Probab=98.52 E-value=4e-07 Score=59.97 Aligned_cols=124 Identities=31% Similarity=0.779 Sum_probs=77.6
Q ss_pred CCCC-CCEee-eC--CCceeeecCCCcccCCCCc-cCCCCCC-----C----CCC--CCCeeeeCCCceEE-eCCCCcc-
Q psy6349 15 ALCH-NGTCE-DF--GNSHRCHCMEGYTGSYCSI-EINECES-----D----PCQ--NGATCQDLIGGYSC-QCARGFQ- 76 (145)
Q Consensus 15 ~~c~-~~~c~-~~--~~~~~C~c~~g~~~~~c~~-~~~~c~~-----~----~c~--~~~~c~~~~~~~~c-~c~~g~~- 76 (145)
.+|. +|.|. .. .|+..|.|..||.|+.|.. ++....+ . .|. ..+.|.... .-.| .|..||.
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~-~k~C~kCkkGW~l 228 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES-SKGCSKCKKGWKL 228 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCC-CCChhhhccccee
Confidence 4575 57886 22 3456899999999999841 1100000 0 011 111232111 1122 4666765
Q ss_pred -cCCccccCCCCc--CCCCCCCCEEeeCCCCeEeeCCCCCccC--------CCCC--CCeeeeCCCceeEecCCCCC
Q psy6349 77 -GQNCELNVDDCK--PNPCQNGGICHDLINHFSCSCPSGLKRE--------ALCH--NGTCEDFGNSHRCHCICWDD 140 (145)
Q Consensus 77 -g~~c~~~~~~c~--~~~~~~~~~c~~~~~~~~C~c~~g~~~~--------~~C~--~~~c~~~~g~~~c~c~~~~~ 140 (145)
...| .++++|. +.+|..+..|+|+.|+|.|...+||... ..|. +..|.++.++|+|+|+.++.
T Consensus 229 de~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 229 DEEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred ccccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence 2234 3677775 4567788899999999999999999753 2343 67899999999999988764
No 9
>KOG1225|consensus
Probab=98.02 E-value=6.7e-05 Score=54.69 Aligned_cols=95 Identities=35% Similarity=0.828 Sum_probs=63.0
Q ss_pred eeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeC
Q psy6349 30 RCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSC 109 (145)
Q Consensus 30 ~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c 109 (145)
.|.|..+|.++.|. ...|.. .|...+.|+ .-+|+|.++|.|.+|.. ..|... |..++.+++. +|.|
T Consensus 235 ic~c~~~~~g~~c~--~~~C~~-~c~~~g~c~----~G~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g----~CiC 300 (525)
T KOG1225|consen 235 ICECPEGYFGPLCS--TIYCPG-GCTGRGQCV----EGRCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG----ECIC 300 (525)
T ss_pred eeecCCceeCCccc--cccCCC-CCcccceEe----CCeEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC----Eeec
Confidence 46777777777663 233322 233334554 23699999999999952 445444 6666676544 8999
Q ss_pred CCCCccC--------CCCC-CCeeeeCCCceeEecCCCCCcC
Q psy6349 110 PSGLKRE--------ALCH-NGTCEDFGNSHRCHCICWDDLR 142 (145)
Q Consensus 110 ~~g~~~~--------~~C~-~~~c~~~~g~~~c~c~~~~~~~ 142 (145)
.++|.++ ..|. ++.|.+ | +|.|.+||++.
T Consensus 301 ~~g~~G~dCs~~~cpadC~g~G~Ci~--G--~C~C~~Gy~G~ 338 (525)
T KOG1225|consen 301 NPGYSGKDCSIRRCPADCSGHGKCID--G--ECLCDEGYTGE 338 (525)
T ss_pred CCCccccccccccCCccCCCCCcccC--C--ceEeCCCCcCC
Confidence 9999986 2355 678873 3 69999999864
No 10
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.02 E-value=9.9e-06 Score=38.83 Aligned_cols=33 Identities=30% Similarity=0.551 Sum_probs=27.6
Q ss_pred ecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc
Q psy6349 6 VSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT 38 (145)
Q Consensus 6 ~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~ 38 (145)
++++|....+.|. ++.|++..|+|.|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4677888888886 57899999999999999887
No 11
>KOG1217|consensus
Probab=98.01 E-value=0.00011 Score=53.07 Aligned_cols=124 Identities=37% Similarity=0.906 Sum_probs=83.0
Q ss_pred CEeeeC---CCceeeecCCCcccCCCCccCCCCCC--CCCCCCCeeeeCCCceEEeCCCCcccCCcccc-----------
Q psy6349 20 GTCEDF---GNSHRCHCMEGYTGSYCSIEINECES--DPCQNGATCQDLIGGYSCQCARGFQGQNCELN----------- 83 (145)
Q Consensus 20 ~~c~~~---~~~~~C~c~~g~~~~~c~~~~~~c~~--~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~----------- 83 (145)
+.|... ...+.|.|..+|.+..+....+.|.. .+|..+..|.+..+.|.|.|..+|.+..+...
T Consensus 140 ~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~ 219 (487)
T KOG1217|consen 140 GSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVA 219 (487)
T ss_pred hhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEeccee
Confidence 455543 24678999999998877533356763 34777888888888899999999986655421
Q ss_pred --------CCCCcC--CCCCCC-CEEeeCCCCeEeeCCCCCccCC--------------CCC-CCeeeeCCCceeEecCC
Q psy6349 84 --------VDDCKP--NPCQNG-GICHDLINHFSCSCPSGLKREA--------------LCH-NGTCEDFGNSHRCHCIC 137 (145)
Q Consensus 84 --------~~~c~~--~~~~~~-~~c~~~~~~~~C~c~~g~~~~~--------------~C~-~~~c~~~~g~~~c~c~~ 137 (145)
...+.. ..+... +.|.+..+.+.|.++++|.+.. .|. ++.|.+..+.|.|.|++
T Consensus 220 ~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~ 299 (487)
T KOG1217|consen 220 CSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPP 299 (487)
T ss_pred ccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCC
Confidence 011110 112211 6677777778898888876431 255 56888888889999999
Q ss_pred CCCcCC
Q psy6349 138 WDDLRN 143 (145)
Q Consensus 138 ~~~~~~ 143 (145)
+|.+..
T Consensus 300 g~~g~~ 305 (487)
T KOG1217|consen 300 GFTGRL 305 (487)
T ss_pred CCCCCC
Confidence 998764
No 12
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.77 E-value=2.5e-05 Score=48.92 Aligned_cols=125 Identities=28% Similarity=0.654 Sum_probs=75.0
Q ss_pred CCCCCCEeeeCCCceeeecCCCcc---cCCCCccCCCCC-----CCCCCCCCeeeeCC-----CceEEeCCCCcccC--C
Q psy6349 15 ALCHNGTCEDFGNSHRCHCMEGYT---GSYCSIEINECE-----SDPCQNGATCQDLI-----GGYSCQCARGFQGQ--N 79 (145)
Q Consensus 15 ~~c~~~~c~~~~~~~~C~c~~g~~---~~~c~~~~~~c~-----~~~c~~~~~c~~~~-----~~~~c~c~~g~~g~--~ 79 (145)
..|.+|..++++..|.|.|..||+ ...|+ ....|. ..+|...+.|.... ..|.|.|..+|.-. .
T Consensus 6 T~CKNG~LiQMSNHfEC~Cnegfvl~~EntCE-~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~v 84 (197)
T PF06247_consen 6 TICKNGYLIQMSNHFECKCNEGFVLKNENTCE-EKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGV 84 (197)
T ss_dssp ---BTEEEEEESSEEEEEESTTEEEEETTEEE-E----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSS
T ss_pred ccccCCEEEEccCceEEEcCCCcEEccccccc-cceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCe
Confidence 467889999999999999999997 34454 333443 24677777787654 57999999999622 2
Q ss_pred ccccCCCCcCCCCCCCCEEeeCC---CCeEeeCCCCCccC----------CC----CC-CCeeeeCCCceeEecCCCCCc
Q psy6349 80 CELNVDDCKPNPCQNGGICHDLI---NHFSCSCPSGLKRE----------AL----CH-NGTCEDFGNSHRCHCICWDDL 141 (145)
Q Consensus 80 c~~~~~~c~~~~~~~~~~c~~~~---~~~~C~c~~g~~~~----------~~----C~-~~~c~~~~g~~~c~c~~~~~~ 141 (145)
|. ...|....|. .+.|+... ....|+|.-|+..+ .. |. +..|....+-|+|.+..++.+
T Consensus 85 Cv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 85 CV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCKEGFPG 161 (197)
T ss_dssp EE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred Ec--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecCCCCCC
Confidence 32 2344445565 56775332 23488888887622 23 33 568888999999999999876
Q ss_pred CC
Q psy6349 142 RN 143 (145)
Q Consensus 142 ~~ 143 (145)
..
T Consensus 162 ~~ 163 (197)
T PF06247_consen 162 DG 163 (197)
T ss_dssp ET
T ss_pred CC
Confidence 54
No 13
>KOG1217|consensus
Probab=97.71 E-value=0.00054 Score=49.57 Aligned_cols=132 Identities=33% Similarity=0.872 Sum_probs=87.5
Q ss_pred eeecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcccCCC--CccCCCCC----CCCCCCCCeee--eCCCceEEeCCCC
Q psy6349 4 LIVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYTGSYC--SIEINECE----SDPCQNGATCQ--DLIGGYSCQCARG 74 (145)
Q Consensus 4 ~~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~~~~c--~~~~~~c~----~~~c~~~~~c~--~~~~~~~c~c~~g 74 (145)
+..++.+....+ |. ++.|++..+.|.|.|.++|.+..+ ..+...|. ..+|..+..|. .....+.|.+..+
T Consensus 268 ~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~ 346 (487)
T KOG1217|consen 268 CVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPG 346 (487)
T ss_pred eeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCC
Confidence 344556655544 65 479999888899999999998886 22334553 33455555662 2234567889888
Q ss_pred cccCCccccCCCCcCCCCCCCCEEee-CCCCeEeeCCCCCcc-C----------CCCC-CCeeeeCCCceeEecCC
Q psy6349 75 FQGQNCELNVDDCKPNPCQNGGICHD-LINHFSCSCPSGLKR-E----------ALCH-NGTCEDFGNSHRCHCIC 137 (145)
Q Consensus 75 ~~g~~c~~~~~~c~~~~~~~~~~c~~-~~~~~~C~c~~g~~~-~----------~~C~-~~~c~~~~g~~~c~c~~ 137 (145)
+.|..|....+.|....+...+.|.+ ..+.+.|.++.+|.+ . ..|. .+.|.+..+++.|. +.
T Consensus 347 ~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 347 FTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSGCGDCVNGPGGGACT-PP 421 (487)
T ss_pred CCCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccCCcceeccCCCCccc-cC
Confidence 88888863223676666777888988 688899999998765 2 1222 23455666667776 55
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.65 E-value=4.2e-05 Score=36.62 Aligned_cols=19 Identities=32% Similarity=0.546 Sum_probs=9.4
Q ss_pred CeeeeCCCceeEecCCCCC
Q psy6349 122 GTCEDFGNSHRCHCICWDD 140 (145)
Q Consensus 122 ~~c~~~~g~~~c~c~~~~~ 140 (145)
+.|.|+.|+|+|.|++||.
T Consensus 16 ~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 16 GTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SEEEEETTEEEEEESTTEE
T ss_pred CEEEcCCCCEEeeCCCCcE
Confidence 4455555555555555444
No 15
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.40 E-value=0.00013 Score=33.59 Aligned_cols=26 Identities=35% Similarity=0.605 Sum_probs=16.4
Q ss_pred CCC-CCeeeeCCCceeEecCCCCCcCC
Q psy6349 118 LCH-NGTCEDFGNSHRCHCICWDDLRN 143 (145)
Q Consensus 118 ~C~-~~~c~~~~g~~~c~c~~~~~~~~ 143 (145)
.|+ ++.|.++.++|.|+|++||.+++
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCC
Confidence 455 67777777777777777777665
No 16
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.34 E-value=0.00017 Score=32.33 Aligned_cols=23 Identities=52% Similarity=1.295 Sum_probs=10.0
Q ss_pred CCCCCEEeeCC-CCeEeeCCCCCc
Q psy6349 92 CQNGGICHDLI-NHFSCSCPSGLK 114 (145)
Q Consensus 92 ~~~~~~c~~~~-~~~~C~c~~g~~ 114 (145)
|.+++.|+... ..|.|.|++||.
T Consensus 6 C~n~g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 6 CQNGGTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp STTTEEEEEESTSEEEEEEBTTEE
T ss_pred CCCCeEEEeCCCCCEEeECCCCCc
Confidence 44444444433 344444444443
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.28 E-value=0.00027 Score=31.63 Aligned_cols=27 Identities=44% Similarity=0.982 Sum_probs=17.6
Q ss_pred CCCCCC-CEeeeCC-CceeeecCCCcccC
Q psy6349 14 EALCHN-GTCEDFG-NSHRCHCMEGYTGS 40 (145)
Q Consensus 14 ~~~c~~-~~c~~~~-~~~~C~c~~g~~~~ 40 (145)
..+|.| +.|++.. +.|.|.|.+||.|.
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 346654 5777666 66777777777654
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.23 E-value=0.001 Score=30.88 Aligned_cols=18 Identities=50% Similarity=0.949 Sum_probs=9.0
Q ss_pred EeeeCCCceeeecCCCcc
Q psy6349 21 TCEDFGNSHRCHCMEGYT 38 (145)
Q Consensus 21 ~c~~~~~~~~C~c~~g~~ 38 (145)
.|++..+.|.|.|..+|.
T Consensus 16 ~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 16 TCVNTVGSYRCECPPGYT 33 (39)
T ss_pred EeECCCCCeEeECCCCCc
Confidence 455444445555555544
No 19
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.19 E-value=0.00024 Score=32.68 Aligned_cols=23 Identities=26% Similarity=0.388 Sum_probs=16.2
Q ss_pred CeeeeCCCceeEecCCCCCcCCC
Q psy6349 122 GTCEDFGNSHRCHCICWDDLRNS 144 (145)
Q Consensus 122 ~~c~~~~g~~~c~c~~~~~~~~~ 144 (145)
..|.+..++|+|.|++||.|..|
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~D 32 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAED 32 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-TT
T ss_pred CCCccCCCceEeECCCCCEECcC
Confidence 36777888888888888887765
No 20
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.18 E-value=0.0012 Score=30.66 Aligned_cols=35 Identities=54% Similarity=1.414 Sum_probs=28.5
Q ss_pred CCCCCC-CCCCCCCeeeeCCCceEEeCCCCcc-cCCc
Q psy6349 46 INECES-DPCQNGATCQDLIGGYSCQCARGFQ-GQNC 80 (145)
Q Consensus 46 ~~~c~~-~~c~~~~~c~~~~~~~~c~c~~g~~-g~~c 80 (145)
+++|.. .+|..++.|.+..++|.|.|..+|. |..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 355655 6788778999999999999999998 7766
No 21
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.08 E-value=0.00049 Score=31.68 Aligned_cols=27 Identities=44% Similarity=1.005 Sum_probs=18.1
Q ss_pred CCCCCC-CCEeeeCCCceeeecCCCccc
Q psy6349 13 REALCH-NGTCEDFGNSHRCHCMEGYTG 39 (145)
Q Consensus 13 ~~~~c~-~~~c~~~~~~~~C~c~~g~~~ 39 (145)
..+.|+ ++.|.+..++|.|.|++||.|
T Consensus 4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~G 31 (36)
T PF12947_consen 4 NNGGCHPNATCTNTGGSYTCTCKPGYEG 31 (36)
T ss_dssp GGGGS-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred CCCCCCCCcEeecCCCCEEeECCCCCcc
Confidence 445665 678888888888888888875
No 22
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.95 E-value=0.00074 Score=28.02 Aligned_cols=15 Identities=27% Similarity=0.321 Sum_probs=11.7
Q ss_pred ceeEecCCCCCcCCC
Q psy6349 130 SHRCHCICWDDLRNS 144 (145)
Q Consensus 130 ~~~c~c~~~~~~~~~ 144 (145)
+|+|.|++||+|.++
T Consensus 1 sy~C~C~~Gy~l~~d 15 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPD 15 (24)
T ss_pred CEEeeCCCCCcCCCC
Confidence 578899999988654
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.81 E-value=0.004 Score=28.44 Aligned_cols=19 Identities=53% Similarity=1.004 Sum_probs=8.4
Q ss_pred EeeeCCCceeeecCCCccc
Q psy6349 21 TCEDFGNSHRCHCMEGYTG 39 (145)
Q Consensus 21 ~c~~~~~~~~C~c~~g~~~ 39 (145)
.|.+..+.|.|.|..+|.|
T Consensus 16 ~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 16 TCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred EeECCCCCeEeECCCCCcC
Confidence 4444444444444444443
No 24
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.46 E-value=0.0098 Score=26.79 Aligned_cols=25 Identities=48% Similarity=0.975 Sum_probs=16.2
Q ss_pred CCCCCCEeeeCCCceeeecCCCccc
Q psy6349 15 ALCHNGTCEDFGNSHRCHCMEGYTG 39 (145)
Q Consensus 15 ~~c~~~~c~~~~~~~~C~c~~g~~~ 39 (145)
.+|.++.|.+..+.|.|.|..||.+
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCcc
Confidence 3454336776666777777777766
No 25
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.19 E-value=0.014 Score=26.45 Aligned_cols=24 Identities=46% Similarity=1.222 Sum_probs=11.8
Q ss_pred CCCCCEEeeCCCCeEeeCCCCCcc
Q psy6349 92 CQNGGICHDLINHFSCSCPSGLKR 115 (145)
Q Consensus 92 ~~~~~~c~~~~~~~~C~c~~g~~~ 115 (145)
|..++.|.+..+.|.|.|+++|.+
T Consensus 11 C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 11 CQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cCCCCEeECCCCCeEeECCCCCcC
Confidence 333444554445555555555543
No 26
>KOG1226|consensus
Probab=96.12 E-value=0.068 Score=40.96 Aligned_cols=79 Identities=27% Similarity=0.683 Sum_probs=46.7
Q ss_pred eeecCCCcccCCCCccCC---------CCCC----CCCCCCCeeeeCCCceEEeCCCCcc----cCCccccCCCCcC---
Q psy6349 30 RCHCMEGYTGSYCSIEIN---------ECES----DPCQNGATCQDLIGGYSCQCARGFQ----GQNCELNVDDCKP--- 89 (145)
Q Consensus 30 ~C~c~~g~~~~~c~~~~~---------~c~~----~~c~~~~~c~~~~~~~~c~c~~g~~----g~~c~~~~~~c~~--- 89 (145)
.|.|.+||.|..|+...+ .|+. .+|+..+.|.= ..|.|.+... |+.|.-+.-.|..
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCECDnfsC~r~~g 554 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNGKIYGKFCECDNFSCERHKG 554 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCCceeeeeeeccCcccccccC
Confidence 479999999999874322 2321 24555555531 1366655443 6666533333322
Q ss_pred CCCCCCCEEeeCCCCeEeeCCCCCccC
Q psy6349 90 NPCQNGGICHDLINHFSCSCPSGLKRE 116 (145)
Q Consensus 90 ~~~~~~~~c~~~~~~~~C~c~~g~~~~ 116 (145)
..|..++.|.-. +|.|.+||++.
T Consensus 555 ~lC~g~G~C~CG----~CvC~~GwtG~ 577 (783)
T KOG1226|consen 555 VLCGGHGRCECG----RCVCNPGWTGS 577 (783)
T ss_pred cccCCCCeEeCC----cEEcCCCCccC
Confidence 246777777443 78999999886
No 27
>KOG4260|consensus
Probab=96.00 E-value=0.006 Score=40.87 Aligned_cols=34 Identities=24% Similarity=0.455 Sum_probs=17.6
Q ss_pred eecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc
Q psy6349 5 IVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT 38 (145)
Q Consensus 5 ~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~ 38 (145)
.+++.|...+.+|. +..|+|+.|+|.|....||.
T Consensus 234 vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~ 268 (350)
T KOG4260|consen 234 VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYK 268 (350)
T ss_pred ccHHHHhcCCCCCChhheeecCCCceEeccccccc
Confidence 34445554555553 34555555555555555554
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.97 E-value=0.023 Score=25.33 Aligned_cols=20 Identities=35% Similarity=0.439 Sum_probs=9.6
Q ss_pred CeeeeCCCceeEecCCCCCc
Q psy6349 122 GTCEDFGNSHRCHCICWDDL 141 (145)
Q Consensus 122 ~~c~~~~g~~~c~c~~~~~~ 141 (145)
+.|.+..+.|.|.|+.||.+
T Consensus 12 ~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 12 GTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CEEecCCCCeEeECCCCCcc
Confidence 44444444455555555443
No 29
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.93 E-value=0.024 Score=25.44 Aligned_cols=19 Identities=42% Similarity=1.184 Sum_probs=8.9
Q ss_pred eeeeCCCceEEeCCCCccc
Q psy6349 59 TCQDLIGGYSCQCARGFQG 77 (145)
Q Consensus 59 ~c~~~~~~~~c~c~~g~~g 77 (145)
.|.+..+.+.|.|..+|.+
T Consensus 12 ~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 12 TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred EEECCCCCeEeECCCCCcc
Confidence 4444444445555554443
No 30
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.80 E-value=0.0089 Score=27.45 Aligned_cols=23 Identities=35% Similarity=0.795 Sum_probs=17.8
Q ss_pred CCCEEeeCCCCeEeeCCCCCccC
Q psy6349 94 NGGICHDLINHFSCSCPSGLKRE 116 (145)
Q Consensus 94 ~~~~c~~~~~~~~C~c~~g~~~~ 116 (145)
..+.|++..++|+|.|++||...
T Consensus 8 C~h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 8 CSHICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSSEEEEETTSEEEE-STTEEE-
T ss_pred cCCCCccCCCceEeECCCCCEEC
Confidence 45688888999999999999864
No 31
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.77 E-value=0.0061 Score=21.38 Aligned_cols=12 Identities=50% Similarity=1.431 Sum_probs=6.7
Q ss_pred eecCCCcccCCC
Q psy6349 31 CHCMEGYTGSYC 42 (145)
Q Consensus 31 C~c~~g~~~~~c 42 (145)
|.|++||+|..|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 566666666544
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.63 E-value=0.04 Score=24.48 Aligned_cols=20 Identities=55% Similarity=1.062 Sum_probs=9.5
Q ss_pred CEeeeCCCceeeecCCCccc
Q psy6349 20 GTCEDFGNSHRCHCMEGYTG 39 (145)
Q Consensus 20 ~~c~~~~~~~~C~c~~g~~~ 39 (145)
+.|.+..+.|.|.|..+|.+
T Consensus 12 ~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 12 GTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CEEecCCCCeEeECCCCCcc
Confidence 44544444444555544443
No 33
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.25 E-value=0.0086 Score=27.49 Aligned_cols=24 Identities=21% Similarity=0.479 Sum_probs=15.1
Q ss_pred CCeeeeCC-CceeEecCCCCCcCCC
Q psy6349 121 NGTCEDFG-NSHRCHCICWDDLRNS 144 (145)
Q Consensus 121 ~~~c~~~~-g~~~c~c~~~~~~~~~ 144 (145)
++.|.+.. |.+.|+|.+||+..++
T Consensus 10 NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 10 NAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp TEEEEEETTSEEEEEE-TTEEEETT
T ss_pred CcccEEcCCCCEEEEeeCCccccCC
Confidence 66676655 6666777777766543
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.80 E-value=0.054 Score=24.12 Aligned_cols=26 Identities=46% Similarity=1.057 Sum_probs=19.0
Q ss_pred CCCC-CCEeeeCCCceeeecCCCcccCCC
Q psy6349 15 ALCH-NGTCEDFGNSHRCHCMEGYTGSYC 42 (145)
Q Consensus 15 ~~c~-~~~c~~~~~~~~C~c~~g~~~~~c 42 (145)
..|. +|+|+... ..|.|.++|.|..|
T Consensus 6 ~~C~~~G~C~~~~--g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPC--GRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCC--CEEECCCCCcCCCC
Confidence 3575 58898652 37999999988765
No 35
>KOG0994|consensus
Probab=93.99 E-value=0.38 Score=39.06 Aligned_cols=13 Identities=31% Similarity=1.004 Sum_probs=10.9
Q ss_pred eeecCCCcccCCC
Q psy6349 30 RCHCMEGYTGSYC 42 (145)
Q Consensus 30 ~C~c~~g~~~~~c 42 (145)
.|.|++..+|..|
T Consensus 783 qCqCkPnVVGR~C 795 (1758)
T KOG0994|consen 783 QCQCKPNVVGRRC 795 (1758)
T ss_pred eecccCccccccc
Confidence 6888888888877
No 36
>PHA02887 EGF-like protein; Provisional
Probab=91.97 E-value=0.2 Score=29.28 Aligned_cols=27 Identities=41% Similarity=0.979 Sum_probs=15.2
Q ss_pred CCCCEeeeC--CCceeeecCCCcccCCCC
Q psy6349 17 CHNGTCEDF--GNSHRCHCMEGYTGSYCS 43 (145)
Q Consensus 17 c~~~~c~~~--~~~~~C~c~~g~~~~~c~ 43 (145)
|.||.|.-. .....|.|..||.|.+|+
T Consensus 94 CiHG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 94 CINGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred eeCCEEEccccCCCceeECCCCcccCCCC
Confidence 445555422 234567777777776664
No 37
>KOG1226|consensus
Probab=91.96 E-value=0.58 Score=36.21 Aligned_cols=67 Identities=31% Similarity=0.728 Sum_probs=38.0
Q ss_pred EEeCCCCcccCCccccC---------CCCc----CCCCCCCCEEeeCCCCeEeeCCCCCc----cC--------------
Q psy6349 68 SCQCARGFQGQNCELNV---------DDCK----PNPCQNGGICHDLINHFSCSCPSGLK----RE-------------- 116 (145)
Q Consensus 68 ~c~c~~g~~g~~c~~~~---------~~c~----~~~~~~~~~c~~~~~~~~C~c~~g~~----~~-------------- 116 (145)
.|.|.+||.|+.|.-.. +.|. ...|...+.|.=. +|.|.+... +.
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~fCECDnfsC~r~~g 554 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGKFCECDNFSCERHKG 554 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeeeeeeccCcccccccC
Confidence 47899999888764211 1221 1246666666432 566655433 11
Q ss_pred CCCC-CCeeeeCCCceeEecCCCCCcC
Q psy6349 117 ALCH-NGTCEDFGNSHRCHCICWDDLR 142 (145)
Q Consensus 117 ~~C~-~~~c~~~~g~~~c~c~~~~~~~ 142 (145)
..|. +++|.= | +|+|.+||++.
T Consensus 555 ~lC~g~G~C~C--G--~CvC~~GwtG~ 577 (783)
T KOG1226|consen 555 VLCGGHGRCEC--G--RCVCNPGWTGS 577 (783)
T ss_pred cccCCCCeEeC--C--cEEcCCCCccC
Confidence 2455 555532 2 59999999875
No 38
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=90.04 E-value=0.31 Score=32.10 Aligned_cols=23 Identities=22% Similarity=0.254 Sum_probs=20.0
Q ss_pred CeeeeCCCceeEecCCCCCcCCC
Q psy6349 122 GTCEDFGNSHRCHCICWDDLRNS 144 (145)
Q Consensus 122 ~~c~~~~g~~~c~c~~~~~~~~~ 144 (145)
..|.+..|+|.|.|..||++..+
T Consensus 199 ~~C~~~~g~~~c~c~~g~~~~~~ 221 (224)
T cd01475 199 QVCISTPGSYLCACTEGYALLED 221 (224)
T ss_pred ceEEcCCCCEEeECCCCccCCCC
Confidence 47999999999999999998654
No 39
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=89.88 E-value=0.35 Score=28.80 Aligned_cols=27 Identities=37% Similarity=0.907 Sum_probs=18.0
Q ss_pred CCCCEeeeC--CCceeeecCCCcccCCCC
Q psy6349 17 CHNGTCEDF--GNSHRCHCMEGYTGSYCS 43 (145)
Q Consensus 17 c~~~~c~~~--~~~~~C~c~~g~~~~~c~ 43 (145)
|.||.|.-. ...+.|.|..||+|.+|+
T Consensus 53 ClHG~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 53 CLHGDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred eECCEEEeeccCCCceeECCCCccccccc
Confidence 455666533 245678888888888885
No 40
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.93 E-value=0.69 Score=30.47 Aligned_cols=33 Identities=18% Similarity=0.398 Sum_probs=24.4
Q ss_pred cCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCcc
Q psy6349 83 NVDDCKPNPCQNGGICHDLINHFSCSCPSGLKR 115 (145)
Q Consensus 83 ~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~ 115 (145)
+.++|..........|.++.|+|.|.|.+||..
T Consensus 186 ~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 186 VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 455665433334468999999999999999975
No 41
>KOG0994|consensus
Probab=87.99 E-value=1.2 Score=36.51 Aligned_cols=18 Identities=22% Similarity=0.840 Sum_probs=11.2
Q ss_pred EEeeCCCCeEeeCCCCCccC
Q psy6349 97 ICHDLINHFSCSCPSGLKRE 116 (145)
Q Consensus 97 ~c~~~~~~~~C~c~~g~~~~ 116 (145)
.|....| +|.|.+||.+.
T Consensus 1078 qCN~ftG--QCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGR 1095 (1758)
T ss_pred ccccccc--ceeccCCCCCc
Confidence 4433334 68888888653
No 42
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=83.11 E-value=0.92 Score=20.38 Aligned_cols=13 Identities=23% Similarity=0.340 Sum_probs=8.3
Q ss_pred eEecCCCCCcCCC
Q psy6349 132 RCHCICWDDLRNS 144 (145)
Q Consensus 132 ~c~c~~~~~~~~~ 144 (145)
.|.|++||-|..+
T Consensus 19 ~C~CPeGyIlde~ 31 (34)
T PF09064_consen 19 QCFCPEGYILDEG 31 (34)
T ss_pred ceeCCCceEecCC
Confidence 5777777766544
No 43
>KOG3512|consensus
Probab=83.03 E-value=14 Score=27.50 Aligned_cols=28 Identities=21% Similarity=0.597 Sum_probs=19.2
Q ss_pred CCC-C-CEeeeC-CCceeeecCCCcccCCCC
Q psy6349 16 LCH-N-GTCEDF-GNSHRCHCMEGYTGSYCS 43 (145)
Q Consensus 16 ~c~-~-~~c~~~-~~~~~C~c~~g~~~~~c~ 43 (145)
+|+ | ..|+-- .+...|.|...-.|+.|.
T Consensus 279 KCNgHAs~Cv~d~~~~ltCdC~HNTaGPdCg 309 (592)
T KOG3512|consen 279 KCNGHASRCVMDESSHLTCDCEHNTAGPDCG 309 (592)
T ss_pred eecCccceeeeccCCceEEecccCCCCCCcc
Confidence 354 4 367744 445889999888888874
No 44
>smart00051 DSL delta serrate ligand.
Probab=71.02 E-value=13 Score=19.34 Aligned_cols=45 Identities=31% Similarity=0.700 Sum_probs=22.8
Q ss_pred eeecCCCcccCCCCccCCCCCCC-CCCCCCeeeeCCCceEEeCCCCcccCCc
Q psy6349 30 RCHCMEGYTGSYCSIEINECESD-PCQNGATCQDLIGGYSCQCARGFQGQNC 80 (145)
Q Consensus 30 ~C~c~~g~~~~~c~~~~~~c~~~-~c~~~~~c~~~~~~~~c~c~~g~~g~~c 80 (145)
.-.|..+|.|..|. ..|... ....+..|.. .+ .++|.+||.|+.|
T Consensus 18 rv~C~~~~yG~~C~---~~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCN---KFCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccC---CEeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence 44677777777773 233221 1223344532 12 3567777766543
No 45
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=66.98 E-value=5.8 Score=22.97 Aligned_cols=19 Identities=37% Similarity=0.773 Sum_probs=9.0
Q ss_pred CCeeeeCCCceeEecCCCCC
Q psy6349 121 NGTCEDFGNSHRCHCICWDD 140 (145)
Q Consensus 121 ~~~c~~~~g~~~c~c~~~~~ 140 (145)
++.|. ......|.|.+||+
T Consensus 89 ~g~C~-~~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 89 NGICN-SNNSPKCSCLPGFE 107 (110)
T ss_pred ccEeC-CCCCCceECCCCcC
Confidence 44552 22333466666654
No 46
>KOG1836|consensus
Probab=61.33 E-value=22 Score=31.13 Aligned_cols=30 Identities=40% Similarity=1.016 Sum_probs=19.7
Q ss_pred CcCCCCCCCCEEeeCC--CCeEee-CCCCCccC
Q psy6349 87 CKPNPCQNGGICHDLI--NHFSCS-CPSGLKRE 116 (145)
Q Consensus 87 c~~~~~~~~~~c~~~~--~~~~C~-c~~g~~~~ 116 (145)
|.+-.|...+.|..+. ....|. |+++|++.
T Consensus 777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~ 809 (1705)
T KOG1836|consen 777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL 809 (1705)
T ss_pred CccCCCCCChhhcCcCcccceecCCCCCCCccc
Confidence 6666666666665433 345787 88888765
No 47
>KOG3516|consensus
Probab=60.61 E-value=9.9 Score=31.57 Aligned_cols=35 Identities=29% Similarity=0.741 Sum_probs=28.1
Q ss_pred CCCCCCCC-CEeeeCCCceeeecC-CCcccCCCCccC
Q psy6349 12 KREALCHN-GTCEDFGNSHRCHCM-EGYTGSYCSIEI 46 (145)
Q Consensus 12 ~~~~~c~~-~~c~~~~~~~~C~c~-~g~~~~~c~~~~ 46 (145)
|.+++|.| |.|.+.-..|.|.|. .||.|..|.+.+
T Consensus 548 ClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi 584 (1306)
T KOG3516|consen 548 CLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSI 584 (1306)
T ss_pred cCCccccCCCcccccccceeEeccccccccccccCCC
Confidence 47788986 689987778999997 889998886544
No 48
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=55.59 E-value=14 Score=17.81 Aligned_cols=22 Identities=36% Similarity=0.828 Sum_probs=14.6
Q ss_pred CEeeeCCCceeeecCCCcccCCCC
Q psy6349 20 GTCEDFGNSHRCHCMEGYTGSYCS 43 (145)
Q Consensus 20 ~~c~~~~~~~~C~c~~g~~~~~c~ 43 (145)
..|.... ..|.|.++|.|..|.
T Consensus 11 ~~C~~~~--G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPST--GQCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEETC--EEESBSTTEESTTS-
T ss_pred CcccCCC--CEEeccccccCCcCc
Confidence 3565543 368888888888874
No 49
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=54.56 E-value=25 Score=17.04 Aligned_cols=13 Identities=38% Similarity=1.027 Sum_probs=10.4
Q ss_pred eeecCCCcccCCC
Q psy6349 30 RCHCMEGYTGSYC 42 (145)
Q Consensus 30 ~C~c~~g~~~~~c 42 (145)
.|.|.+++.|..|
T Consensus 20 ~C~C~~~~~G~~C 32 (50)
T cd00055 20 QCECKPNTTGRRC 32 (50)
T ss_pred EEeCCCcCCCCCC
Confidence 5788888888877
No 50
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=43.83 E-value=25 Score=17.10 Aligned_cols=9 Identities=44% Similarity=1.327 Sum_probs=4.9
Q ss_pred EeeCCCCCc
Q psy6349 106 SCSCPSGLK 114 (145)
Q Consensus 106 ~C~c~~g~~ 114 (145)
.|.|+++|.
T Consensus 38 ~C~C~~g~~ 46 (52)
T PF01683_consen 38 RCQCPPGYV 46 (52)
T ss_pred EeECCCCCE
Confidence 455555554
No 51
>KOG3516|consensus
Probab=41.71 E-value=32 Score=28.88 Aligned_cols=37 Identities=27% Similarity=0.987 Sum_probs=30.5
Q ss_pred CCCCCCCCCCCCCeeeeCCCceEEeCC-CCcccCCccc
Q psy6349 46 INECESDPCQNGATCQDLIGGYSCQCA-RGFQGQNCEL 82 (145)
Q Consensus 46 ~~~c~~~~c~~~~~c~~~~~~~~c~c~-~g~~g~~c~~ 82 (145)
.+.|.+++|..++.|.-+-..+.|.|. .||.|..|..
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccC
Confidence 466888999999999887788999996 8898877753
No 52
>KOG3514|consensus
Probab=34.36 E-value=35 Score=28.53 Aligned_cols=35 Identities=46% Similarity=1.161 Sum_probs=29.8
Q ss_pred CCCCCCCCCCCeeeeCCCceEEeCC-CCcccCCccc
Q psy6349 48 ECESDPCQNGATCQDLIGGYSCQCA-RGFQGQNCEL 82 (145)
Q Consensus 48 ~c~~~~c~~~~~c~~~~~~~~c~c~-~g~~g~~c~~ 82 (145)
.|.++||.+++.|...-+.+.|-|. .+|.|+.|.+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 6888999999999888888999985 6888988875
No 53
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=30.67 E-value=70 Score=15.18 Aligned_cols=13 Identities=38% Similarity=1.048 Sum_probs=10.2
Q ss_pred eeecCCCcccCCC
Q psy6349 30 RCHCMEGYTGSYC 42 (145)
Q Consensus 30 ~C~c~~g~~~~~c 42 (145)
.|.|.+++.|..|
T Consensus 19 ~C~C~~~~~G~~C 31 (46)
T smart00180 19 QCECKPNVTGRRC 31 (46)
T ss_pred EEECCCCCCCCCC
Confidence 5788888888777
No 54
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=26.72 E-value=97 Score=18.03 Aligned_cols=26 Identities=27% Similarity=0.759 Sum_probs=16.6
Q ss_pred CCCCCCCEEeeCC-----CCeEeeCCCCCcc
Q psy6349 90 NPCQNGGICHDLI-----NHFSCSCPSGLKR 115 (145)
Q Consensus 90 ~~~~~~~~c~~~~-----~~~~C~c~~g~~~ 115 (145)
..|..++.|++.. .-|.|.|.+.+..
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 4577788887653 2367777765543
No 55
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=25.21 E-value=50 Score=16.71 Aligned_cols=17 Identities=29% Similarity=0.970 Sum_probs=7.8
Q ss_pred CceEEeCCCCcccCCcc
Q psy6349 65 GGYSCQCARGFQGQNCE 81 (145)
Q Consensus 65 ~~~~c~c~~g~~g~~c~ 81 (145)
+...|.|..-|.|+.|+
T Consensus 34 G~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 34 GSPVCECNSCYGGPDCS 50 (56)
T ss_dssp TEE--EE-TTEESTTS-
T ss_pred CCccccccCCcCCCCcc
Confidence 33456666666666665
Done!