Query psy9688
Match_columns 181
No_of_seqs 112 out of 1221
Neff 10.3
Searched_HMMs 46136
Date Fri Aug 16 17:03:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9688.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9688hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.6 6.6E-16 1.4E-20 133.7 9.1 106 43-181 3865-3971(4289)
2 KOG1214|consensus 99.4 8.1E-12 1.8E-16 101.3 10.8 139 16-181 692-856 (1289)
3 KOG4289|consensus 99.3 9.8E-12 2.1E-16 105.4 6.2 91 57-180 1216-1308(2531)
4 KOG1219|consensus 99.1 4.9E-10 1.1E-14 98.7 7.8 98 17-145 3865-3977(4289)
5 KOG4289|consensus 99.0 1.5E-09 3.2E-14 92.6 8.2 74 2-77 1225-1316(2531)
6 KOG1214|consensus 98.9 1.3E-08 2.7E-13 83.3 9.8 105 9-145 728-862 (1289)
7 KOG1217|consensus 98.6 1.3E-06 2.8E-11 69.6 13.0 175 2-181 155-386 (487)
8 KOG1225|consensus 98.6 4.5E-07 9.7E-12 72.2 10.1 80 50-145 256-341 (525)
9 KOG1217|consensus 98.5 4.1E-06 8.9E-11 66.7 12.4 122 43-181 170-301 (487)
10 PF07645 EGF_CA: Calcium-bindi 98.3 7.3E-07 1.6E-11 46.9 2.6 32 148-181 1-33 (42)
11 smart00179 EGF_CA Calcium-bind 98.0 1.7E-05 3.6E-10 40.7 4.4 36 41-76 1-38 (39)
12 PF00008 EGF: EGF-like domain 98.0 4.2E-06 9.1E-11 41.2 1.4 30 45-74 1-31 (32)
13 PF00008 EGF: EGF-like domain 97.9 9E-06 2E-10 40.0 2.0 31 112-143 1-32 (32)
14 KOG4260|consensus 97.9 1.4E-05 3E-10 58.0 3.5 124 47-181 149-303 (350)
15 KOG1225|consensus 97.8 0.00015 3.3E-09 58.1 8.9 96 63-181 234-335 (525)
16 PF07645 EGF_CA: Calcium-bindi 97.8 1.9E-05 4.1E-10 41.4 2.0 32 41-72 1-34 (42)
17 smart00179 EGF_CA Calcium-bind 97.7 7.2E-05 1.6E-09 38.3 4.1 35 109-144 2-38 (39)
18 cd00054 EGF_CA Calcium-binding 97.7 0.00011 2.3E-09 37.2 4.3 35 42-76 2-37 (38)
19 cd00054 EGF_CA Calcium-binding 97.4 0.00033 7.2E-09 35.3 4.0 34 110-144 3-37 (38)
20 KOG4260|consensus 97.3 0.00014 3.1E-09 52.9 2.1 69 3-72 222-304 (350)
21 cd00053 EGF Epidermal growth f 97.2 0.00097 2.1E-08 33.0 4.2 30 47-76 5-35 (36)
22 smart00181 EGF Epidermal growt 97.2 0.001 2.2E-08 33.1 4.1 31 45-76 2-34 (35)
23 KOG1226|consensus 97.1 0.0097 2.1E-07 49.5 10.9 107 49-165 468-603 (783)
24 PF12947 EGF_3: EGF domain; I 96.9 0.00058 1.3E-08 34.4 1.6 24 156-181 6-29 (36)
25 cd00053 EGF Epidermal growth f 96.9 0.0025 5.3E-08 31.5 3.9 30 114-144 5-35 (36)
26 PF12662 cEGF: Complement Clr- 96.8 0.0014 3.1E-08 29.7 2.3 20 62-82 1-24 (24)
27 smart00181 EGF Epidermal growt 96.8 0.003 6.4E-08 31.3 3.8 28 115-144 6-34 (35)
28 PF12947 EGF_3: EGF domain; I 96.8 0.0013 2.7E-08 33.1 2.0 26 116-142 7-32 (36)
29 KOG1226|consensus 96.7 0.026 5.6E-07 47.1 9.9 89 49-146 515-621 (783)
30 PF12662 cEGF: Complement Clr- 96.5 0.0027 5.7E-08 28.8 2.1 20 130-151 1-24 (24)
31 PF12661 hEGF: Human growth fa 96.3 0.0022 4.7E-08 24.6 0.9 13 132-144 1-13 (13)
32 PF07974 EGF_2: EGF-like domai 95.7 0.02 4.4E-07 27.9 3.1 27 48-76 6-32 (32)
33 PF07974 EGF_2: EGF-like domai 95.2 0.031 6.7E-07 27.2 2.8 26 116-144 7-32 (32)
34 PF14670 FXa_inhibition: Coagu 95.2 0.016 3.6E-07 29.0 1.8 21 157-181 7-27 (36)
35 PF14670 FXa_inhibition: Coagu 93.1 0.071 1.5E-06 26.7 1.5 19 54-72 10-28 (36)
36 cd01475 vWA_Matrilin VWA_Matri 92.5 0.16 3.6E-06 36.7 3.5 35 141-181 181-216 (224)
37 PF12946 EGF_MSP1_1: MSP1 EGF 92.3 0.099 2.2E-06 26.2 1.4 28 113-140 3-30 (37)
38 KOG0994|consensus 88.9 1.7 3.7E-05 38.8 6.5 17 62-78 933-949 (1758)
39 cd01475 vWA_Matrilin VWA_Matri 88.7 0.56 1.2E-05 33.9 3.3 35 39-73 184-218 (224)
40 PF06247 Plasmod_Pvs28: Plasmo 88.0 0.17 3.7E-06 35.2 0.3 48 130-181 109-159 (197)
41 KOG3516|consensus 87.7 0.59 1.3E-05 41.4 3.2 44 39-82 542-586 (1306)
42 PHA03099 epidermal growth fact 87.4 0.81 1.8E-05 29.8 3.0 28 49-77 52-81 (139)
43 PHA02887 EGF-like protein; Pro 87.2 0.86 1.9E-05 29.2 3.0 27 50-77 94-122 (126)
44 smart00051 DSL delta serrate l 85.5 1.7 3.7E-05 24.7 3.4 42 131-181 17-58 (63)
45 PF06247 Plasmod_Pvs28: Plasmo 83.0 0.61 1.3E-05 32.6 1.0 88 48-140 50-160 (197)
46 PHA03099 epidermal growth fact 80.2 2.1 4.6E-05 27.9 2.7 33 148-181 41-75 (139)
47 PHA02887 EGF-like protein; Pro 79.1 2.6 5.6E-05 27.1 2.7 31 150-181 84-116 (126)
48 KOG3514|consensus 68.2 4.3 9.2E-05 36.2 2.3 37 44-80 625-662 (1591)
49 KOG0994|consensus 64.2 27 0.00059 31.9 6.3 58 123-181 878-942 (1758)
50 KOG3516|consensus 62.4 7 0.00015 35.2 2.6 38 108-146 544-582 (1306)
51 PF01683 EB: EB module; Inter 61.3 21 0.00045 19.0 3.6 20 49-72 27-46 (52)
52 PF12955 DUF3844: Domain of un 60.8 12 0.00026 23.6 2.8 22 116-137 14-39 (103)
53 PF00954 S_locus_glycop: S-loc 57.7 15 0.00033 23.2 3.0 31 148-181 76-106 (110)
54 KOG1836|consensus 55.7 31 0.00066 33.0 5.5 45 134-181 760-806 (1705)
55 cd00055 EGF_Lam Laminin-type e 53.8 19 0.00042 19.1 2.6 14 64-77 20-33 (50)
56 PF00053 Laminin_EGF: Laminin 51.5 9.7 0.00021 20.1 1.2 16 130-145 17-32 (49)
57 KOG3514|consensus 47.6 16 0.00034 32.9 2.3 36 110-146 624-660 (1591)
58 PF09064 Tme5_EGF_like: Thromb 47.2 12 0.00026 18.4 1.0 8 174-181 19-26 (34)
59 smart00180 EGF_Lam Laminin-typ 36.2 31 0.00067 18.0 1.6 15 131-145 18-32 (46)
60 KOG1218|consensus 32.7 83 0.0018 23.7 4.1 39 131-170 162-200 (316)
61 KOG3509|consensus 22.3 2.4E+02 0.0053 25.6 5.4 36 110-146 407-442 (964)
62 KOG3512|consensus 21.7 4.6E+02 0.01 21.8 6.3 25 54-78 285-310 (592)
63 PF04863 EGF_alliinase: Alliin 21.1 49 0.0011 18.3 0.7 18 129-146 34-51 (56)
No 1
>KOG1219|consensus
Probab=99.64 E-value=6.6e-16 Score=133.73 Aligned_cols=106 Identities=38% Similarity=0.940 Sum_probs=92.1
Q ss_pred cCCCCCCCCCCCEEecCC-CCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCC
Q psy9688 43 DHCLTDPCLGHGTCINRS-DRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDC 121 (181)
Q Consensus 43 ~~c~~~~c~~~~~C~~~~-~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~ 121 (181)
++|..+||+++|+|.... +.|.|.|++.|+|+.|+.++.+| ..+||..++
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC-----------------------------~snPC~~Gg 3915 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPC-----------------------------ASNPCLTGG 3915 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccc-----------------------------cCCCCCCCC
Confidence 789999999999998754 57999999999999998665444 455667777
Q ss_pred EEeeCCCCceEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 122 VCVDDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 122 ~C~~~~~~~~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
+|.. ..+.|.|.|+.||+|+.|+...+++|.. ++|.++|.|+|++|+ |+|.|-+||
T Consensus 3916 tCip-~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~-n~C~~gg~C~n~~gs--f~CncT~g~ 3971 (4289)
T KOG1219|consen 3916 TCIP-FYNGFLCNCPNGYTGKRCEARGISECSK-NVCGTGGQCINIPGS--FHCNCTPGI 3971 (4289)
T ss_pred EEEe-cCCCeeEeCCCCccCceeeccccccccc-ccccCCceeeccCCc--eEeccChhH
Confidence 8877 7888999999999999999855899988 999999999999999 999998764
No 2
>KOG1214|consensus
Probab=99.36 E-value=8.1e-12 Score=101.27 Aligned_cols=139 Identities=27% Similarity=0.693 Sum_probs=86.6
Q ss_pred cccccCCCCCC-CCceeecCCC-Cc--------------cccCcCCCC--CCCCCCCEEecCCCCeeEeCCCCCc----C
Q psy9688 16 INECETVKDVC-NYGICVNTNG-NG--------------NSKVDHCLT--DPCLGHGTCINRSDRYQCLCTPRYS----G 73 (181)
Q Consensus 16 ~~~c~~~~~~C-~~~~C~~~~~-~~--------------~~~~~~c~~--~~c~~~~~C~~~~~~~~C~c~~g~~----g 73 (181)
+++|..+.+.| ....|....+ .| +.|.++|.. ..|..+..|++..++|.|.|..||. +
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~ 771 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDR 771 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCC
Confidence 45555555556 5556654432 11 455555543 3477777777777777777777764 2
Q ss_pred CCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCC--CCEEeeCCCCceEeecCCCCcc--CCccccCC
Q psy9688 74 NNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLN--DCVCVDDGNLSFYCNCTEDFTG--EYCQFENS 149 (181)
Q Consensus 74 ~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~--~~~C~~~~~~~~~C~C~~g~~g--~~c~~~~~ 149 (181)
..|..-.+.-..++|..+ ...|.- .+.|+....+.|+|+|.+||+| ..|. ++
T Consensus 772 ~tCV~i~~pap~n~Ce~g----------------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~--dv 827 (1289)
T KOG1214|consen 772 HTCVLITPPAPANPCEDG----------------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT--DV 827 (1289)
T ss_pred cceEEecCCCCCCccccC----------------------ccccCcCCceEEEecCCceEEEeecCCccCCccccc--cc
Confidence 333311111112233221 112332 3455553456799999999985 5565 78
Q ss_pred CCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 150 AACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 150 ~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
|+|.. +-|+..++|++++++ |.|.|++||
T Consensus 828 DeC~p-srChp~A~Cyntpgs--fsC~C~pGy 856 (1289)
T KOG1214|consen 828 DECSP-SRCHPAATCYNTPGS--FSCRCQPGY 856 (1289)
T ss_pred cccCc-cccCCCceEecCCCc--ceeecccCc
Confidence 99987 889999999999998 889999997
No 3
>KOG4289|consensus
Probab=99.25 E-value=9.8e-12 Score=105.36 Aligned_cols=91 Identities=32% Similarity=0.775 Sum_probs=72.8
Q ss_pred ecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecC
Q psy9688 57 INRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCT 136 (181)
Q Consensus 57 ~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~ 136 (181)
++..+...|.|++||+|..|+.+|+.| -+.||.+++.|.. ..+.|+|.|.
T Consensus 1216 i~pvnglrCrCPpGFTgd~CeTeiDlC-----------------------------Ys~pC~nng~C~s-rEggYtCeCr 1265 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCETEIDLC-----------------------------YSGPCGNNGRCRS-REGGYTCECR 1265 (2531)
T ss_pred ccccCceeEeCCCCCCcccccchhHhh-----------------------------hcCCCCCCCceEE-ecCceeEEec
Confidence 345677899999999999999776555 3456677778887 8999999999
Q ss_pred CCCccCCccccCC--CCCCCCCCCCCCCEEeeCCCCcceEEecCCC
Q psy9688 137 EDFTGEYCQFENS--AACVTLNPCQNNATCVASPGEKQITCLCLKE 180 (181)
Q Consensus 137 ~g~~g~~c~~~~~--~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g 180 (181)
+||+|.+|+. +. -.|.. ..|.++++|.+..-+ -|.|+|+.|
T Consensus 1266 pg~tGehCEv-s~~agrCvp-GvC~nggtC~~~~ng-gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1266 PGFTGEHCEV-SARAGRCVP-GVCKNGGTCVNLLNG-GFCCHCPYG 1308 (2531)
T ss_pred CCccccceee-ecccCcccc-ceecCCCEEeecCCC-ceeccCCCc
Confidence 9999999985 33 24766 889999999876543 388999876
No 4
>KOG1219|consensus
Probab=99.06 E-value=4.9e-10 Score=98.75 Aligned_cols=98 Identities=32% Similarity=0.789 Sum_probs=82.6
Q ss_pred ccccCCCCCC-CCceeecCCC-Cc-------------cccCcCCCCCCCCCCCEEecCCCCeeEeCCCCCcCCCCCCCCC
Q psy9688 17 NECETVKDVC-NYGICVNTNG-NG-------------NSKVDHCLTDPCLGHGTCINRSDRYQCLCTPRYSGNNCEKDNG 81 (181)
Q Consensus 17 ~~c~~~~~~C-~~~~C~~~~~-~~-------------~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~ 81 (181)
++|. .++| .++.|...++ +| ..++.+|.++||..+++|+...+.|.|.|+.||+|..|+.+
T Consensus 3865 d~C~--~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~-- 3940 (4289)
T KOG1219|consen 3865 DPCN--DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR-- 3940 (4289)
T ss_pred cccc--cCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc--
Confidence 6676 7889 9999998874 44 45778899999999999999999999999999999999854
Q ss_pred CCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCCCccCCcc
Q psy9688 82 DCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTEDFTGEYCQ 145 (181)
Q Consensus 82 ~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~ 145 (181)
.+++|+.++|..++.|.+ ..|+|+|.|.+|+.|..|.
T Consensus 3941 --------------------------Gi~eCs~n~C~~gg~C~n-~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3941 --------------------------GISECSKNVCGTGGQCIN-IPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred --------------------------cccccccccccCCceeec-cCCceEeccChhHhcccCc
Confidence 134556667788888988 8999999999999998875
No 5
>KOG4289|consensus
Probab=99.00 E-value=1.5e-09 Score=92.64 Aligned_cols=74 Identities=27% Similarity=0.645 Sum_probs=60.3
Q ss_pred CCCCCcccCCCCCCcccccCCCCCC-CCceeecCCCCc---c------------ccCcCCCCCCCCCCCEEecCC-CCee
Q psy9688 2 YFKPISLSTLCESDINECETVKDVC-NYGICVNTNGNG---N------------SKVDHCLTDPCLGHGTCINRS-DRYQ 64 (181)
Q Consensus 2 ~C~~g~~g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~---~------------~~~~~c~~~~c~~~~~C~~~~-~~~~ 64 (181)
.|++||+|..|++.++.|. ..+| +++.|+...|+| | .....|.+.-|.++++|++.. +.+.
T Consensus 1225 rCPpGFTgd~CeTeiDlCY--s~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~ 1302 (2531)
T KOG4289|consen 1225 RCPPGFTGDYCETEIDLCY--SGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFC 1302 (2531)
T ss_pred eCCCCCCcccccchhHhhh--cCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCcee
Confidence 5899999999999999999 8999 999999999998 1 123457777799999999854 5789
Q ss_pred EeCCCC-CcCCCCC
Q psy9688 65 CLCTPR-YSGNNCE 77 (181)
Q Consensus 65 C~c~~g-~~g~~c~ 77 (181)
|.|+.| |++..|+
T Consensus 1303 c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1303 CHCPYGEFEDPRCE 1316 (2531)
T ss_pred ccCCCcccCCCceE
Confidence 999988 3355554
No 6
>KOG1214|consensus
Probab=98.90 E-value=1.3e-08 Score=83.28 Aligned_cols=105 Identities=29% Similarity=0.733 Sum_probs=75.7
Q ss_pred cCCCCCCcccccCCCCCC-CCceeecCCCCc----------------c------ccCcCCCC--CCCCCCC--EEec-CC
Q psy9688 9 STLCESDINECETVKDVC-NYGICVNTNGNG----------------N------SKVDHCLT--DPCLGHG--TCIN-RS 60 (181)
Q Consensus 9 g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~----------------~------~~~~~c~~--~~c~~~~--~C~~-~~ 60 (181)
|..|. |+++|+...+.| .++.|++.+++| | ...+.|.. +.|...+ .|+. +.
T Consensus 728 gr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGg 806 (1289)
T KOG1214|consen 728 GRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGG 806 (1289)
T ss_pred CCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCC
Confidence 57895 999999888899 999999999987 1 11233322 2244433 4554 44
Q ss_pred CCeeEeCCCCCcCC--CCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCC
Q psy9688 61 DRYQCLCTPRYSGN--NCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTED 138 (181)
Q Consensus 61 ~~~~C~c~~g~~g~--~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g 138 (181)
+.|+|.|.+||.|. .|. ++++|.+.-|+..+.|.+ +.+++.|.|.+|
T Consensus 807 s~y~C~CLPGfsGDG~~c~------------------------------dvDeC~psrChp~A~Cyn-tpgsfsC~C~pG 855 (1289)
T KOG1214|consen 807 STYSCACLPGFSGDGHQCT------------------------------DVDECSPSRCHPAATCYN-TPGSFSCRCQPG 855 (1289)
T ss_pred ceEEEeecCCccCCccccc------------------------------cccccCccccCCCceEec-CCCcceeecccC
Confidence 57999999999853 333 446666666788889998 899999999999
Q ss_pred CccCCcc
Q psy9688 139 FTGEYCQ 145 (181)
Q Consensus 139 ~~g~~c~ 145 (181)
|.|+.-+
T Consensus 856 y~GDGf~ 862 (1289)
T KOG1214|consen 856 YYGDGFQ 862 (1289)
T ss_pred ccCCCce
Confidence 9865443
No 7
>KOG1217|consensus
Probab=98.61 E-value=1.3e-06 Score=69.65 Aligned_cols=175 Identities=33% Similarity=0.754 Sum_probs=117.3
Q ss_pred CCCCCcccCCCCCCcccccCCCCCC-CCceeecCCCCc------------ccc--------------------CcCCCC-
Q psy9688 2 YFKPISLSTLCESDINECETVKDVC-NYGICVNTNGNG------------NSK--------------------VDHCLT- 47 (181)
Q Consensus 2 ~C~~g~~g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~------------~~~--------------------~~~c~~- 47 (181)
.|..||.+..+....++|......| ..+.|.+..++| +.. .+.|..
T Consensus 155 ~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~ 234 (487)
T KOG1217|consen 155 SCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVS 234 (487)
T ss_pred eeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccc
Confidence 4788888888875547887556678 888898887765 110 011111
Q ss_pred -CCCCCC-CEEecCCCCeeEeCCCCCcCCCC--CCCCCCCCCCC-CCCCcEEeeCC----------eecccc-ccCCCCC
Q psy9688 48 -DPCLGH-GTCINRSDRYQCLCTPRYSGNNC--EKDNGDCSSNP-CLNDGVCVDEV----------YKGRYW-ELPEIRD 111 (181)
Q Consensus 48 -~~c~~~-~~C~~~~~~~~C~c~~g~~g~~c--~~~~~~c~~~~-c~~~~~C~~~~----------~~g~~c-~~~~~~~ 111 (181)
..+... +.|++..+.+.|.+.+||.+..+ ..+++.|.... |.+.+.|++.. |.+..+ ...+...
T Consensus 235 ~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~ 314 (487)
T KOG1217|consen 235 IVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDE 314 (487)
T ss_pred cccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCcccccccc
Confidence 112222 67888888899999999997762 23677787653 77788888764 555555 2223345
Q ss_pred CC----CCCCCCCCEEee-CCCCceEeecCCCCccCCccccCC-CCCCCCCCCCCCCEEee-CCCCcceEEecCCCC
Q psy9688 112 CT----SNPCLNDCVCVD-DGNLSFYCNCTEDFTGEYCQFENS-AACVTLNPCQNNATCVA-SPGEKQITCLCLKEV 181 (181)
Q Consensus 112 c~----~~~c~~~~~C~~-~~~~~~~C~C~~g~~g~~c~~~~~-~~c~~~~~C~~~~~C~~-~~g~~~~~C~C~~g~ 181 (181)
|. ..+|..++.|.. .....+.|.|..+|.|..|+ .. ++|.. ..+..++.|++ ..+. +.|.|..+|
T Consensus 315 C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~--~~~~~C~~-~~~~~~~~c~~~~~~~--~~c~~~~~~ 386 (487)
T KOG1217|consen 315 CSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCE--DSNDECAS-SPCCPGGTCVNETPGS--YRCACPAGF 386 (487)
T ss_pred ccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccc--cCCccccC-CccccCCEeccCCCCC--eEecCCCcc
Confidence 53 345777777721 13345779999999999998 44 47877 56888999998 5776 889988765
No 8
>KOG1225|consensus
Probab=98.61 E-value=4.5e-07 Score=72.19 Aligned_cols=80 Identities=38% Similarity=1.036 Sum_probs=42.0
Q ss_pred CCCCCEEecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCC------eeccccccCCCCCCCCCCCCCCCEE
Q psy9688 50 CLGHGTCINRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEV------YKGRYWELPEIRDCTSNPCLNDCVC 123 (181)
Q Consensus 50 c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~------~~g~~c~~~~~~~c~~~~c~~~~~C 123 (181)
|..++.|++. +|+|++||+|..|.+ ..|... |+.++.+++.. |.|..|.. ..| +.+|..++.|
T Consensus 256 c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g~CiC~~g~~G~dCs~---~~c-padC~g~G~C 324 (525)
T KOG1225|consen 256 CTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDGECICNPGYSGKDCSI---RRC-PADCSGHGKC 324 (525)
T ss_pred CcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCCEeecCCCcccccccc---ccC-CccCCCCCcc
Confidence 4445566655 677777777776653 223222 33333333222 55555542 122 2456666666
Q ss_pred eeCCCCceEeecCCCCccCCcc
Q psy9688 124 VDDGNLSFYCNCTEDFTGEYCQ 145 (181)
Q Consensus 124 ~~~~~~~~~C~C~~g~~g~~c~ 145 (181)
+. -+|.|.+||+|..|.
T Consensus 325 i~-----G~C~C~~Gy~G~~C~ 341 (525)
T KOG1225|consen 325 ID-----GECLCDEGYTGELCI 341 (525)
T ss_pred cC-----CceEeCCCCcCCccc
Confidence 53 247777777777665
No 9
>KOG1217|consensus
Probab=98.47 E-value=4.1e-06 Score=66.75 Aligned_cols=122 Identities=37% Similarity=0.895 Sum_probs=82.0
Q ss_pred cCCC--CCCCCCCCEEecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeC------C-eeccccccCCCCCCC
Q psy9688 43 DHCL--TDPCLGHGTCINRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDE------V-YKGRYWELPEIRDCT 113 (181)
Q Consensus 43 ~~c~--~~~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~------~-~~g~~c~~~~~~~c~ 113 (181)
++|. ..+|.+.+.|.+..+.|.|.|+++|.+..++.. .....|... . +.+..+.. .+..+.
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------~~~~~c~~~~~~~~~~g~~~~~c~~-~~~~~~ 239 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------GNGGTCVDSVACSCPPGARGPECEV-SIVECA 239 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC---------CCCceEecceeccCCCCCCCCCccc-cccccc
Confidence 5675 345888889998888899999999998887633 111222211 0 11222221 122222
Q ss_pred CCCCCCCCEEeeCCCCceEeecCCCCccCCc-cccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 114 SNPCLNDCVCVDDGNLSFYCNCTEDFTGEYC-QFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 114 ~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c-~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
.. . ++|++ ..+.+.|.|++||.+..+ ...+++.|.....|.++++|++..+. |.|.|++||
T Consensus 240 ~~---~-~~c~~-~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~--~~C~C~~g~ 301 (487)
T KOG1217|consen 240 SG---D-GTCVN-TVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGS--YRCTCPPGF 301 (487)
T ss_pred CC---C-Ccccc-cCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCc--ceeeCCCCC
Confidence 11 2 78888 788899999999998762 21267888772239999999999987 889999987
No 10
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.28 E-value=7.3e-07 Score=46.88 Aligned_cols=32 Identities=28% Similarity=0.707 Sum_probs=28.1
Q ss_pred CCCCCCC-CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 148 NSAACVT-LNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 148 ~~~~c~~-~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
||++|.. .+.|..++.|+|+.|+ |.|.|++||
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gs--y~C~C~~Gy 33 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGS--YSCSCPPGY 33 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTE--EEEEESTTE
T ss_pred CccccCCCCCcCCCCCEEEcCCCC--EEeeCCCCc
Confidence 5788876 4679889999999999 899999997
No 11
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.00 E-value=1.7e-05 Score=40.70 Aligned_cols=36 Identities=44% Similarity=1.231 Sum_probs=30.6
Q ss_pred cCcCCCC-CCCCCCCEEecCCCCeeEeCCCCCc-CCCC
Q psy9688 41 KVDHCLT-DPCLGHGTCINRSDRYQCLCTPRYS-GNNC 76 (181)
Q Consensus 41 ~~~~c~~-~~c~~~~~C~~~~~~~~C~c~~g~~-g~~c 76 (181)
++++|.. .+|.++++|++..+.|.|.|++||. |..|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 3567866 7899888999999999999999998 7765
No 12
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.95 E-value=4.2e-06 Score=41.16 Aligned_cols=30 Identities=43% Similarity=1.166 Sum_probs=26.5
Q ss_pred CCCCCCCCCCEEecCC-CCeeEeCCCCCcCC
Q psy9688 45 CLTDPCLGHGTCINRS-DRYQCLCTPRYSGN 74 (181)
Q Consensus 45 c~~~~c~~~~~C~~~~-~~~~C~c~~g~~g~ 74 (181)
|.+.+|.++++|+... +.|.|.|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4567999999999887 88999999999885
No 13
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.90 E-value=9e-06 Score=39.97 Aligned_cols=31 Identities=39% Similarity=0.954 Sum_probs=25.8
Q ss_pred CCCCCCCCCCEEeeCCC-CceEeecCCCCccCC
Q psy9688 112 CTSNPCLNDCVCVDDGN-LSFYCNCTEDFTGEY 143 (181)
Q Consensus 112 c~~~~c~~~~~C~~~~~-~~~~C~C~~g~~g~~ 143 (181)
|.+.+|.++++|++ .. +.|+|.|++||+|+.
T Consensus 1 C~~~~C~n~g~C~~-~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCID-LPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEE-ESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEe-CCCCCEEeECCCCCccCC
Confidence 34568899999998 66 899999999999863
No 14
>KOG4260|consensus
Probab=97.88 E-value=1.4e-05 Score=57.95 Aligned_cols=124 Identities=27% Similarity=0.696 Sum_probs=74.9
Q ss_pred CCCCCCCCEEecC---CCCeeEeCCCCCcCCCCCCCC--------CCCCC--CCCCC--CcEEeeCC----------e--
Q psy9688 47 TDPCLGHGTCINR---SDRYQCLCTPRYSGNNCEKDN--------GDCSS--NPCLN--DGVCVDEV----------Y-- 99 (181)
Q Consensus 47 ~~~c~~~~~C~~~---~~~~~C~c~~g~~g~~c~~~~--------~~c~~--~~c~~--~~~C~~~~----------~-- 99 (181)
..+|+.++.|... .++-.|.|.+||.|..|..-. ++-.. ..|.. .+.|.... |
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~l 228 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKL 228 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhccccee
Confidence 3568888888632 245689999999998885210 00000 00110 01221111 2
Q ss_pred eccccccCCCCCCC--CCCCCCCCEEeeCCCCceEeecCCCCccC--CccccCCCCCCCCCCCCCCCEEeeCCCCcceEE
Q psy9688 100 KGRYWELPEIRDCT--SNPCLNDCVCVDDGNLSFYCNCTEDFTGE--YCQFENSAACVTLNPCQNNATCVASPGEKQITC 175 (181)
Q Consensus 100 ~g~~c~~~~~~~c~--~~~c~~~~~C~~~~~~~~~C~C~~g~~g~--~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C 175 (181)
+-..|. ++++|. +.||.....|+| +.|+|.|.+.+||.+. .|+. -.+.|. ..+..|++++++ |.|
T Consensus 229 de~gCv--DvnEC~~ep~~c~~~qfCvN-teGSf~C~dk~Gy~~g~d~C~~-~~d~~~-----~kn~~c~ni~~~--~r~ 297 (350)
T KOG4260|consen 229 DEEGCV--DVNECQNEPAPCKAHQFCVN-TEGSFKCEDKEGYKKGVDECQF-CADVCA-----SKNRPCMNIDGQ--YRC 297 (350)
T ss_pred cccccc--cHHHHhcCCCCCChhheeec-CCCceEecccccccCChHHhhh-hhhhcc-----cCCCCcccCCcc--EEE
Confidence 122233 788886 357999999999 9999999999998742 2221 112222 244578899998 779
Q ss_pred ecCCCC
Q psy9688 176 LCLKEV 181 (181)
Q Consensus 176 ~C~~g~ 181 (181)
+|..|+
T Consensus 298 v~f~~~ 303 (350)
T KOG4260|consen 298 VCFSGL 303 (350)
T ss_pred Eecccc
Confidence 987764
No 15
>KOG1225|consensus
Probab=97.83 E-value=0.00015 Score=58.08 Aligned_cols=96 Identities=30% Similarity=0.761 Sum_probs=65.7
Q ss_pred eeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCC------eeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecC
Q psy9688 63 YQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEV------YKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCT 136 (181)
Q Consensus 63 ~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~------~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~ 136 (181)
..|.|..+|.|..|+. + .| +..|...+.|+.-. |.|..|.. ..|... |..++.+++ + +|.|+
T Consensus 234 ~ic~c~~~~~g~~c~~-~-~C-~~~c~~~g~c~~G~CIC~~Gf~G~dC~e---~~Cp~~-cs~~g~~~~---g--~CiC~ 301 (525)
T KOG1225|consen 234 GICECPEGYFGPLCST-I-YC-PGGCTGRGQCVEGRCICPPGFTGDDCDE---LVCPVD-CSGGGVCVD---G--ECICN 301 (525)
T ss_pred ceeecCCceeCCcccc-c-cC-CCCCcccceEeCCeEeCCCCCcCCCCCc---ccCCcc-cCCCceecC---C--EeecC
Confidence 4789999999988872 2 24 34444445554432 77777752 344333 767677754 2 79999
Q ss_pred CCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 137 EDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 137 ~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
+||+|+.|+. ..|. .+|..++.|+ .|. |.|.+||
T Consensus 302 ~g~~G~dCs~---~~cp--adC~g~G~Ci--~G~----C~C~~Gy 335 (525)
T KOG1225|consen 302 PGYSGKDCSI---RRCP--ADCSGHGKCI--DGE----CLCDEGY 335 (525)
T ss_pred CCcccccccc---ccCC--ccCCCCCccc--CCc----eEeCCCC
Confidence 9999999963 4463 5899999998 544 9998887
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.75 E-value=1.9e-05 Score=41.41 Aligned_cols=32 Identities=38% Similarity=1.030 Sum_probs=27.4
Q ss_pred cCcCCCC--CCCCCCCEEecCCCCeeEeCCCCCc
Q psy9688 41 KVDHCLT--DPCLGHGTCINRSDRYQCLCTPRYS 72 (181)
Q Consensus 41 ~~~~c~~--~~c~~~~~C~~~~~~~~C~c~~g~~ 72 (181)
|+++|.. ..|...+.|+|..++|.|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 5678855 3588889999999999999999997
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.74 E-value=7.2e-05 Score=38.26 Aligned_cols=35 Identities=43% Similarity=1.019 Sum_probs=28.4
Q ss_pred CCCCCC-CCCCCCCEEeeCCCCceEeecCCCCc-cCCc
Q psy9688 109 IRDCTS-NPCLNDCVCVDDGNLSFYCNCTEDFT-GEYC 144 (181)
Q Consensus 109 ~~~c~~-~~c~~~~~C~~~~~~~~~C~C~~g~~-g~~c 144 (181)
+++|.. .+|..+++|++ ..+.|.|.|++||. |..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~-~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVN-TVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred cccCcCCCCcCCCCEeEC-CCCCeEeECCCCCccCCcC
Confidence 355655 57888889998 88899999999998 7766
No 18
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.70 E-value=0.00011 Score=37.22 Aligned_cols=35 Identities=46% Similarity=1.268 Sum_probs=29.6
Q ss_pred CcCCCC-CCCCCCCEEecCCCCeeEeCCCCCcCCCC
Q psy9688 42 VDHCLT-DPCLGHGTCINRSDRYQCLCTPRYSGNNC 76 (181)
Q Consensus 42 ~~~c~~-~~c~~~~~C~~~~~~~~C~c~~g~~g~~c 76 (181)
+++|.. .+|.+++.|++..+.|.|.|+.||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 466765 68888889999999999999999998766
No 19
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.44 E-value=0.00033 Score=35.32 Aligned_cols=34 Identities=41% Similarity=1.010 Sum_probs=27.4
Q ss_pred CCCCC-CCCCCCCEEeeCCCCceEeecCCCCccCCc
Q psy9688 110 RDCTS-NPCLNDCVCVDDGNLSFYCNCTEDFTGEYC 144 (181)
Q Consensus 110 ~~c~~-~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c 144 (181)
++|.. .+|..++.|++ ..+.|.|.|+.||.|..|
T Consensus 3 ~~C~~~~~C~~~~~C~~-~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 3 DECASGNPCQNGGTCVN-TVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred ccCCCCCCcCCCCEeEC-CCCCeEeECCCCCcCCcC
Confidence 45555 56777789988 888899999999998766
No 20
>KOG4260|consensus
Probab=97.30 E-value=0.00014 Score=52.88 Aligned_cols=69 Identities=32% Similarity=0.611 Sum_probs=44.4
Q ss_pred CCCCcccC--CCCCCcccccCCCCCC-CCceeecCCCCc-ccc-------CcCCCC--CCC-CCCCEEecCCCCeeEeCC
Q psy9688 3 FKPISLST--LCESDINECETVKDVC-NYGICVNTNGNG-NSK-------VDHCLT--DPC-LGHGTCINRSDRYQCLCT 68 (181)
Q Consensus 3 C~~g~~g~--~C~~~~~~c~~~~~~C-~~~~C~~~~~~~-~~~-------~~~c~~--~~c-~~~~~C~~~~~~~~C~c~ 68 (181)
|..||... .|. |||+|...+.+| .+.+|+|+.|+| +.+ +|+|+. .-| ..+..|+++.+.|.|+|.
T Consensus 222 CkkGW~lde~gCv-DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f 300 (350)
T KOG4260|consen 222 CKKGWKLDEEGCV-DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCF 300 (350)
T ss_pred hcccceecccccc-cHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEec
Confidence 55666542 463 788887667777 777888888887 333 444532 112 344567777777888877
Q ss_pred CCCc
Q psy9688 69 PRYS 72 (181)
Q Consensus 69 ~g~~ 72 (181)
.++.
T Consensus 301 ~~~~ 304 (350)
T KOG4260|consen 301 SGLI 304 (350)
T ss_pred ccce
Confidence 7764
No 21
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.22 E-value=0.00097 Score=32.99 Aligned_cols=30 Identities=43% Similarity=1.316 Sum_probs=25.6
Q ss_pred CCCCCCCCEEecCCCCeeEeCCCCCcCC-CC
Q psy9688 47 TDPCLGHGTCINRSDRYQCLCTPRYSGN-NC 76 (181)
Q Consensus 47 ~~~c~~~~~C~~~~~~~~C~c~~g~~g~-~c 76 (181)
..+|.+++.|++..+.|.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 5678888999998889999999999887 44
No 22
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.20 E-value=0.001 Score=33.05 Aligned_cols=31 Identities=45% Similarity=1.221 Sum_probs=25.4
Q ss_pred CCC-CCCCCCCEEecCCCCeeEeCCCCCcC-CCC
Q psy9688 45 CLT-DPCLGHGTCINRSDRYQCLCTPRYSG-NNC 76 (181)
Q Consensus 45 c~~-~~c~~~~~C~~~~~~~~C~c~~g~~g-~~c 76 (181)
|.. .+|.++ .|++..+.|.|.|++||.| ..|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 444 578877 9999888999999999988 554
No 23
>KOG1226|consensus
Probab=97.12 E-value=0.0097 Score=49.52 Aligned_cols=107 Identities=28% Similarity=0.628 Sum_probs=66.7
Q ss_pred CCCCCCEEecCCCCeeEeCCCCCcCCCCCCCC---------CCCC----CCCCCCCcEEeeCC----------eeccccc
Q psy9688 49 PCLGHGTCINRSDRYQCLCTPRYSGNNCEKDN---------GDCS----SNPCLNDGVCVDEV----------YKGRYWE 105 (181)
Q Consensus 49 ~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~---------~~c~----~~~c~~~~~C~~~~----------~~g~~c~ 105 (181)
.|..+|+.+=+ .|.|.+||.|+.|+-.. +.|. ..+|+..+.|.-.. +.|..|+
T Consensus 468 ~C~g~G~~~CG----~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCE 543 (783)
T KOG1226|consen 468 LCHGNGTFVCG----QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCE 543 (783)
T ss_pred ccCCCCcEEec----ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeee
Confidence 35555555444 78999999999986322 1121 12577777665332 4577777
Q ss_pred cCCCCCCCCC---CCCCCCEEeeCCCCceEeecCCCCccCCcccc-CCCCCCC--CCCCCCCCEEe
Q psy9688 106 LPEIRDCTSN---PCLNDCVCVDDGNLSFYCNCTEDFTGEYCQFE-NSAACVT--LNPCQNNATCV 165 (181)
Q Consensus 106 ~~~~~~c~~~---~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~-~~~~c~~--~~~C~~~~~C~ 165 (181)
. +.-.|... .|..++.|.= -.|+|.+||+|..|+.. +.+.|.. ...|...|.|.
T Consensus 544 C-DnfsC~r~~g~lC~g~G~C~C-----G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~ 603 (783)
T KOG1226|consen 544 C-DNFSCERHKGVLCGGHGRCEC-----GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCE 603 (783)
T ss_pred c-cCcccccccCcccCCCCeEeC-----CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceee
Confidence 6 33344432 5888888843 26999999999998751 3455654 12366666664
No 24
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.93 E-value=0.00058 Score=34.38 Aligned_cols=24 Identities=38% Similarity=0.779 Sum_probs=18.9
Q ss_pred CCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 156 NPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 156 ~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
+.|+.+++|++++++ |.|.|++||
T Consensus 6 ~~C~~nA~C~~~~~~--~~C~C~~Gy 29 (36)
T PF12947_consen 6 GGCHPNATCTNTGGS--YTCTCKPGY 29 (36)
T ss_dssp GGS-TTCEEEE-TTS--EEEEE-CEE
T ss_pred CCCCCCcEeecCCCC--EEeECCCCC
Confidence 579999999999997 999999987
No 25
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.90 E-value=0.0025 Score=31.46 Aligned_cols=30 Identities=43% Similarity=0.978 Sum_probs=24.2
Q ss_pred CCCCCCCCEEeeCCCCceEeecCCCCccC-Cc
Q psy9688 114 SNPCLNDCVCVDDGNLSFYCNCTEDFTGE-YC 144 (181)
Q Consensus 114 ~~~c~~~~~C~~~~~~~~~C~C~~g~~g~-~c 144 (181)
..+|..++.|++ ..+.|.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~-~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVN-TPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEec-CCCCeEeECCCCCcccCCc
Confidence 346777789988 7788999999999877 44
No 26
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.83 E-value=0.0014 Score=29.65 Aligned_cols=20 Identities=45% Similarity=1.178 Sum_probs=15.0
Q ss_pred CeeEeCCCCCc----CCCCCCCCCC
Q psy9688 62 RYQCLCTPRYS----GNNCEKDNGD 82 (181)
Q Consensus 62 ~~~C~c~~g~~----g~~c~~~~~~ 82 (181)
+|.|.|++||. |..|. +|+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~-DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE-DIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc-cCCC
Confidence 58999999998 55665 5553
No 27
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.82 E-value=0.003 Score=31.31 Aligned_cols=28 Identities=36% Similarity=1.020 Sum_probs=22.9
Q ss_pred CCCCCCCEEeeCCCCceEeecCCCCcc-CCc
Q psy9688 115 NPCLNDCVCVDDGNLSFYCNCTEDFTG-EYC 144 (181)
Q Consensus 115 ~~c~~~~~C~~~~~~~~~C~C~~g~~g-~~c 144 (181)
.+|..+ .|++ ..+.|.|.|++||.+ ..|
T Consensus 6 ~~C~~~-~C~~-~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCIN-TPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEEC-CCCCeEeECCCCCccCCcc
Confidence 467776 8988 788999999999988 554
No 28
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.75 E-value=0.0013 Score=33.14 Aligned_cols=26 Identities=23% Similarity=0.786 Sum_probs=20.5
Q ss_pred CCCCCCEEeeCCCCceEeecCCCCccC
Q psy9688 116 PCLNDCVCVDDGNLSFYCNCTEDFTGE 142 (181)
Q Consensus 116 ~c~~~~~C~~~~~~~~~C~C~~g~~g~ 142 (181)
.|+.+++|++ ..++|.|.|.+||.|+
T Consensus 7 ~C~~nA~C~~-~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTN-TGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE--TTSEEEEE-CEEECC
T ss_pred CCCCCcEeec-CCCCEEeECCCCCccC
Confidence 5788899999 8889999999999865
No 29
>KOG1226|consensus
Probab=96.65 E-value=0.026 Score=47.10 Aligned_cols=89 Identities=31% Similarity=0.723 Sum_probs=43.6
Q ss_pred CCCCCCEEecCCCCeeEeCCCCCc----CCCCCCCCCCCCC---CCCCCCcEEeeCC------eeccccccC-CCCCCCC
Q psy9688 49 PCLGHGTCINRSDRYQCLCTPRYS----GNNCEKDNGDCSS---NPCLNDGVCVDEV------YKGRYWELP-EIRDCTS 114 (181)
Q Consensus 49 ~c~~~~~C~~~~~~~~C~c~~g~~----g~~c~~~~~~c~~---~~c~~~~~C~~~~------~~g~~c~~~-~~~~c~~ 114 (181)
+|..+|.|+=+ .|.|.+... |..|+-+.-.|.. ..|...+.|.-.. |+|..|.++ +.+.|..
T Consensus 515 vCSgrG~C~CG----qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~ 590 (783)
T KOG1226|consen 515 VCSGRGDCVCG----QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCES 590 (783)
T ss_pred CcCCCCcEeCC----ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccC
Confidence 45555555433 455544433 5555433222321 1244444443322 555555542 2344443
Q ss_pred C---CCCCCCEEeeCCCCceEeecCCC-CccCCccc
Q psy9688 115 N---PCLNDCVCVDDGNLSFYCNCTED-FTGEYCQF 146 (181)
Q Consensus 115 ~---~c~~~~~C~~~~~~~~~C~C~~g-~~g~~c~~ 146 (181)
. -|..+++|.= -+|.|... |.|..|+.
T Consensus 591 ~~G~iCSGrG~C~C-----g~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 591 SDGQICSGRGTCEC-----GRCKCTDPPYSGEFCEK 621 (783)
T ss_pred CCCceeCCCceeeC-----CceEcCCCCcCcchhhc
Confidence 2 3666666643 14777665 88888863
No 30
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.53 E-value=0.0027 Score=28.78 Aligned_cols=20 Identities=25% Similarity=0.874 Sum_probs=14.6
Q ss_pred ceEeecCCCCc----cCCccccCCCC
Q psy9688 130 SFYCNCTEDFT----GEYCQFENSAA 151 (181)
Q Consensus 130 ~~~C~C~~g~~----g~~c~~~~~~~ 151 (181)
+|+|.|++||. |..|. +|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~--DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE--DIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc--cCCC
Confidence 48999999997 45565 5553
No 31
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.28 E-value=0.0022 Score=24.62 Aligned_cols=13 Identities=38% Similarity=1.122 Sum_probs=10.2
Q ss_pred EeecCCCCccCCc
Q psy9688 132 YCNCTEDFTGEYC 144 (181)
Q Consensus 132 ~C~C~~g~~g~~c 144 (181)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4899999999876
No 32
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.70 E-value=0.02 Score=27.90 Aligned_cols=27 Identities=41% Similarity=1.153 Sum_probs=21.8
Q ss_pred CCCCCCCEEecCCCCeeEeCCCCCcCCCC
Q psy9688 48 DPCLGHGTCINRSDRYQCLCTPRYSGNNC 76 (181)
Q Consensus 48 ~~c~~~~~C~~~~~~~~C~c~~g~~g~~c 76 (181)
..|..+++|+.. ...|+|.+||.|..|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 458899999854 349999999998765
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.24 E-value=0.031 Score=27.23 Aligned_cols=26 Identities=31% Similarity=0.718 Sum_probs=20.9
Q ss_pred CCCCCCEEeeCCCCceEeecCCCCccCCc
Q psy9688 116 PCLNDCVCVDDGNLSFYCNCTEDFTGEYC 144 (181)
Q Consensus 116 ~c~~~~~C~~~~~~~~~C~C~~g~~g~~c 144 (181)
.|.++++|+. . ..+|.|.+||.|+.|
T Consensus 7 ~C~~~G~C~~-~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVS-P--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeC-C--CCEEECCCCCcCCCC
Confidence 5788899975 3 358999999999865
No 34
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.19 E-value=0.016 Score=29.05 Aligned_cols=21 Identities=33% Similarity=0.895 Sum_probs=17.2
Q ss_pred CCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 157 PCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 157 ~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
.|.+ +|++++++ |.|.|++||
T Consensus 7 gC~h--~C~~~~g~--~~C~C~~Gy 27 (36)
T PF14670_consen 7 GCSH--ICVNTPGS--YRCSCPPGY 27 (36)
T ss_dssp GSSS--EEEEETTS--EEEE-STTE
T ss_pred CcCC--CCccCCCc--eEeECCCCC
Confidence 4666 99999998 999999997
No 35
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.08 E-value=0.071 Score=26.72 Aligned_cols=19 Identities=37% Similarity=1.074 Sum_probs=16.2
Q ss_pred CEEecCCCCeeEeCCCCCc
Q psy9688 54 GTCINRSDRYQCLCTPRYS 72 (181)
Q Consensus 54 ~~C~~~~~~~~C~c~~g~~ 72 (181)
..|++..++|.|.|++||.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp SEEEEETTSEEEE-STTEE
T ss_pred CCCccCCCceEeECCCCCE
Confidence 4799989999999999997
No 36
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=92.52 E-value=0.16 Score=36.67 Aligned_cols=35 Identities=31% Similarity=0.852 Sum_probs=27.6
Q ss_pred cCCccccCCCCCCC-CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 141 GEYCQFENSAACVT-LNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 141 g~~c~~~~~~~c~~-~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
+..|. ++++|.. .+.|.. .|+++.|+ |.|.|++||
T Consensus 181 ~~~C~--~~~~C~~~~~~c~~--~C~~~~g~--~~c~c~~g~ 216 (224)
T cd01475 181 GKICV--VPDLCATLSHVCQQ--VCISTPGS--YLCACTEGY 216 (224)
T ss_pred cccCc--CchhhcCCCCCccc--eEEcCCCC--EEeECCCCc
Confidence 56676 6778865 356765 89999999 899999997
No 37
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.31 E-value=0.099 Score=26.23 Aligned_cols=28 Identities=18% Similarity=0.413 Sum_probs=18.9
Q ss_pred CCCCCCCCCEEeeCCCCceEeecCCCCc
Q psy9688 113 TSNPCLNDCVCVDDGNLSFYCNCTEDFT 140 (181)
Q Consensus 113 ~~~~c~~~~~C~~~~~~~~~C~C~~g~~ 140 (181)
....|+.++.|++...|.+.|.|..||+
T Consensus 3 ~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 3 IDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred cCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 3446788889987344999999999997
No 38
>KOG0994|consensus
Probab=88.87 E-value=1.7 Score=38.79 Aligned_cols=17 Identities=41% Similarity=0.974 Sum_probs=14.4
Q ss_pred CeeEeCCCCCcCCCCCC
Q psy9688 62 RYQCLCTPRYSGNNCEK 78 (181)
Q Consensus 62 ~~~C~c~~g~~g~~c~~ 78 (181)
...|.|.+||+|..|++
T Consensus 933 ~ivC~C~~GY~G~RCe~ 949 (1758)
T KOG0994|consen 933 QIVCHCQEGYSGSRCEI 949 (1758)
T ss_pred ceeeecccCccccchhh
Confidence 35899999999999874
No 39
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.72 E-value=0.56 Score=33.91 Aligned_cols=35 Identities=29% Similarity=0.519 Sum_probs=25.8
Q ss_pred cccCcCCCCCCCCCCCEEecCCCCeeEeCCCCCcC
Q psy9688 39 NSKVDHCLTDPCLGHGTCINRSDRYQCLCTPRYSG 73 (181)
Q Consensus 39 ~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~~g~~g 73 (181)
|.+.++|...+......|.+..+.|.|.|++||+.
T Consensus 184 C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred CcCchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 56677785433222357999999999999999974
No 40
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=88.05 E-value=0.17 Score=35.21 Aligned_cols=48 Identities=21% Similarity=0.568 Sum_probs=25.9
Q ss_pred ceEeecCCCCc---cCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 130 SFYCNCTEDFT---GEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 130 ~~~C~C~~g~~---g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
...|+|..|+. ...|.......|.. .|..+-.|-...+- |.|.+.+||
T Consensus 109 ~~~CSC~IGkV~~dn~kCtk~G~T~C~L--KCk~nE~CK~~~~~--Y~C~~~~~~ 159 (197)
T PF06247_consen 109 NPTCSCNIGKVPDDNKKCTKTGETKCSL--KCKENEECKLVDGY--YKCVCKEGF 159 (197)
T ss_dssp EEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTE--EEEEE-TT-
T ss_pred CceeEeeeceEeccCCcccCCCccceee--ecCCCcceeeeCcE--EEeecCCCC
Confidence 34899998875 34454323334543 56777789888775 889988775
No 41
>KOG3516|consensus
Probab=87.70 E-value=0.59 Score=41.39 Aligned_cols=44 Identities=30% Similarity=0.750 Sum_probs=37.9
Q ss_pred cccCcCCCCCCCCCCCEEecCCCCeeEeCC-CCCcCCCCCCCCCC
Q psy9688 39 NSKVDHCLTDPCLGHGTCINRSDRYQCLCT-PRYSGNNCEKDNGD 82 (181)
Q Consensus 39 ~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~-~g~~g~~c~~~~~~ 82 (181)
|.-++.|.|++|.+++.|......+.|.|. .||.|..|...+.+
T Consensus 542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e 586 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE 586 (1306)
T ss_pred cccccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence 556788999999999999998888999998 99999999876644
No 42
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=87.37 E-value=0.81 Score=29.80 Aligned_cols=28 Identities=36% Similarity=0.863 Sum_probs=22.1
Q ss_pred CCCCCCEEecC--CCCeeEeCCCCCcCCCCC
Q psy9688 49 PCLGHGTCINR--SDRYQCLCTPRYSGNNCE 77 (181)
Q Consensus 49 ~c~~~~~C~~~--~~~~~C~c~~g~~g~~c~ 77 (181)
-|.++ .|.-. ...+.|.|..||+|.+|+
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCccccccc
Confidence 36664 78643 457899999999999997
No 43
>PHA02887 EGF-like protein; Provisional
Probab=87.20 E-value=0.86 Score=29.18 Aligned_cols=27 Identities=37% Similarity=1.085 Sum_probs=21.6
Q ss_pred CCCCCEEecC--CCCeeEeCCCCCcCCCCC
Q psy9688 50 CLGHGTCINR--SDRYQCLCTPRYSGNNCE 77 (181)
Q Consensus 50 c~~~~~C~~~--~~~~~C~c~~g~~g~~c~ 77 (181)
|. +|+|.-. .....|.|..||+|.+|+
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 66 4688643 446899999999999997
No 44
>smart00051 DSL delta serrate ligand.
Probab=85.50 E-value=1.7 Score=24.74 Aligned_cols=42 Identities=26% Similarity=0.491 Sum_probs=28.6
Q ss_pred eEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 131 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 131 ~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
+.-.|.++|.|..|.. .|...+.+..+.+|-. .|. ++|.+||
T Consensus 17 ~rv~C~~~~yG~~C~~----~C~~~~d~~~~~~Cd~-~G~----~~C~~Gw 58 (63)
T smart00051 17 IRVTCDENYYGEGCNK----FCRPRDDFFGHYTCDE-NGN----KGCLEGW 58 (63)
T ss_pred EEeeCCCCCcCCccCC----EeCcCccccCCccCCc-CCC----EecCCCC
Confidence 4566888999998862 3433244566778853 565 8899987
No 45
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=82.95 E-value=0.61 Score=32.59 Aligned_cols=88 Identities=24% Similarity=0.573 Sum_probs=46.0
Q ss_pred CCCCCCCEEecCC-----CCeeEeCCCCCcCC--CCCCCCCCCCCCCCCCCcEEeeCC----------------eecccc
Q psy9688 48 DPCLGHGTCINRS-----DRYQCLCTPRYSGN--NCEKDNGDCSSNPCLNDGVCVDEV----------------YKGRYW 104 (181)
Q Consensus 48 ~~c~~~~~C~~~~-----~~~~C~c~~g~~g~--~c~~~~~~c~~~~c~~~~~C~~~~----------------~~g~~c 104 (181)
.+|...+.|++.. ..|.|.|.+||... .|. .+.|....|. .+.|+... .+...|
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kC 126 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKC 126 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTES
T ss_pred ccccchhhhhcCCCcccceeEEEecccCceeeCCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcc
Confidence 4588888898654 35899999999732 221 1334444555 45665433 011122
Q ss_pred ccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCCCc
Q psy9688 105 ELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTEDFT 140 (181)
Q Consensus 105 ~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~ 140 (181)
..+....|+. .|..+..|.. ..+-|.|.+..+|.
T Consensus 127 tk~G~T~C~L-KCk~nE~CK~-~~~~Y~C~~~~~~~ 160 (197)
T PF06247_consen 127 TKTGETKCSL-KCKENEECKL-VDGYYKCVCKEGFP 160 (197)
T ss_dssp EEEE---------TTTEEEEE-ETTEEEEEE-TT-E
T ss_pred cCCCccceee-ecCCCcceee-eCcEEEeecCCCCC
Confidence 2211233332 4778889988 78899999999986
No 46
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.15 E-value=2.1 Score=27.92 Aligned_cols=33 Identities=15% Similarity=0.386 Sum_probs=23.5
Q ss_pred CCCCCCC--CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 148 NSAACVT--LNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 148 ~~~~c~~--~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
++..|.. .+.|.+| .|+..+.-..+.|.|..||
T Consensus 41 ~i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GY 75 (139)
T PHA03099 41 AIRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGY 75 (139)
T ss_pred ccccCChhhCCEeECC-EEEeeccCCCceeECCCCc
Confidence 3445543 3568775 8987776666899999987
No 47
>PHA02887 EGF-like protein; Provisional
Probab=79.06 E-value=2.6 Score=27.06 Aligned_cols=31 Identities=23% Similarity=0.504 Sum_probs=23.1
Q ss_pred CCCCC--CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 150 AACVT--LNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 150 ~~c~~--~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
.+|.+ .+.|. +|+|...+.-...+|.|++||
T Consensus 84 ~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GY 116 (126)
T PHA02887 84 EKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGY 116 (126)
T ss_pred cccChHhhCEee-CCEEEccccCCCceeECCCCc
Confidence 45654 35687 579987777667899999997
No 48
>KOG3514|consensus
Probab=68.18 E-value=4.3 Score=36.15 Aligned_cols=37 Identities=30% Similarity=0.898 Sum_probs=32.2
Q ss_pred CCCCCCCCCCCEEecCCCCeeEeC-CCCCcCCCCCCCC
Q psy9688 44 HCLTDPCLGHGTCINRSDRYQCLC-TPRYSGNNCEKDN 80 (181)
Q Consensus 44 ~c~~~~c~~~~~C~~~~~~~~C~c-~~g~~g~~c~~~~ 80 (181)
.|.++||.+++.|...-+.+.|-| -.+|.|+.|+.+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~ 662 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCEREA 662 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcccccee
Confidence 688999999999999889999986 5789999998654
No 49
>KOG0994|consensus
Probab=64.24 E-value=27 Score=31.85 Aligned_cols=58 Identities=21% Similarity=0.487 Sum_probs=30.9
Q ss_pred EeeCCCCceEe-ecCCCCccCCcccc----CCCCCCC-C-CCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 123 CVDDGNLSFYC-NCTEDFTGEYCQFE----NSAACVT-L-NPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 123 C~~~~~~~~~C-~C~~g~~g~~c~~~----~~~~c~~-~-~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
|.+ ....+.| .|..||.|+----. ..-.|.+ + +.-.+...|...+-+.+..|+|++||
T Consensus 878 CqD-~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY 942 (1758)
T KOG0994|consen 878 CQD-STTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGY 942 (1758)
T ss_pred ccc-cccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccccccccceeeecccCc
Confidence 444 4455666 58888865321100 1122332 1 11223346765666667899999997
No 50
>KOG3516|consensus
Probab=62.38 E-value=7 Score=35.16 Aligned_cols=38 Identities=29% Similarity=0.696 Sum_probs=33.5
Q ss_pred CCCCCCCCCCCCCCEEeeCCCCceEeecC-CCCccCCccc
Q psy9688 108 EIRDCTSNPCLNDCVCVDDGNLSFYCNCT-EDFTGEYCQF 146 (181)
Q Consensus 108 ~~~~c~~~~c~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~ 146 (181)
-++.|.+++|..++.|.. ....|.|.|. .||+|..|..
T Consensus 544 i~drClPN~CehgG~C~Q-s~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQ-SWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred cccccCCccccCCCcccc-cccceeEeccccccccccccC
Confidence 457888999999999998 8899999997 8999999875
No 51
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=61.28 E-value=21 Score=19.05 Aligned_cols=20 Identities=40% Similarity=1.223 Sum_probs=15.7
Q ss_pred CCCCCCEEecCCCCeeEeCCCCCc
Q psy9688 49 PCLGHGTCINRSDRYQCLCTPRYS 72 (181)
Q Consensus 49 ~c~~~~~C~~~~~~~~C~c~~g~~ 72 (181)
.|..++.|++. +|.|++||.
T Consensus 27 qC~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCC----EeECCCCCE
Confidence 35577888776 999999986
No 52
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=60.75 E-value=12 Score=23.61 Aligned_cols=22 Identities=27% Similarity=0.650 Sum_probs=13.8
Q ss_pred CCCCCCEEeeCC----CCceEeecCC
Q psy9688 116 PCLNDCVCVDDG----NLSFYCNCTE 137 (181)
Q Consensus 116 ~c~~~~~C~~~~----~~~~~C~C~~ 137 (181)
.|..++.|+... ..-|.|.|.+
T Consensus 14 ~CsgHG~C~~~~~~~~~~C~~C~C~~ 39 (103)
T PF12955_consen 14 NCSGHGSCVKKYGSGGGDCFACKCKP 39 (103)
T ss_pred CCCCCceEeeccCCCccceEEEEeec
Confidence 466677776521 2447788876
No 53
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=57.65 E-value=15 Score=23.17 Aligned_cols=31 Identities=23% Similarity=0.540 Sum_probs=22.1
Q ss_pred CCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688 148 NSAACVTLNPCQNNATCVASPGEKQITCLCLKEV 181 (181)
Q Consensus 148 ~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~ 181 (181)
..+.|.....|...+.|... .. ..|.|.+||
T Consensus 76 p~d~Cd~y~~CG~~g~C~~~-~~--~~C~Cl~GF 106 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNSN-NS--PKCSCLPGF 106 (110)
T ss_pred cccCCCCccccCCccEeCCC-CC--CceECCCCc
Confidence 44567655789999999533 23 459999997
No 54
>KOG1836|consensus
Probab=55.66 E-value=31 Score=32.97 Aligned_cols=45 Identities=24% Similarity=0.626 Sum_probs=24.6
Q ss_pred ecCCCCccCCccccCCCC-CCCCCCCCCCCEEeeCCCCcceEEe-cCCCC
Q psy9688 134 NCTEDFTGEYCQFENSAA-CVTLNPCQNNATCVASPGEKQITCL-CLKEV 181 (181)
Q Consensus 134 ~C~~g~~g~~c~~~~~~~-c~~~~~C~~~~~C~~~~g~~~~~C~-C~~g~ 181 (181)
+|..||.|..-. +... |.. -+|..++.|..+......-|. |++||
T Consensus 760 ~C~~GfYg~~~~--~~~~dC~~-C~Cp~~~~~~~~~~~~~~iCk~Cp~gy 806 (1705)
T KOG1836|consen 760 QCVDGFYGLPDL--GTSGDCQP-CPCPNGGACGQTPEILEVVCKNCPPGY 806 (1705)
T ss_pred hhcCCCCCcccc--CCCCCCcc-CCCCCChhhcCcCcccceecCCCCCCC
Confidence 466666543221 1111 544 466677777655544445677 88876
No 55
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=53.81 E-value=19 Score=19.08 Aligned_cols=14 Identities=43% Similarity=1.234 Sum_probs=12.6
Q ss_pred eEeCCCCCcCCCCC
Q psy9688 64 QCLCTPRYSGNNCE 77 (181)
Q Consensus 64 ~C~c~~g~~g~~c~ 77 (181)
.|.|++++.|..|+
T Consensus 20 ~C~C~~~~~G~~C~ 33 (50)
T cd00055 20 QCECKPNTTGRRCD 33 (50)
T ss_pred EEeCCCcCCCCCCC
Confidence 78899999999886
No 56
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=51.51 E-value=9.7 Score=20.10 Aligned_cols=16 Identities=31% Similarity=0.669 Sum_probs=13.2
Q ss_pred ceEeecCCCCccCCcc
Q psy9688 130 SFYCNCTEDFTGEYCQ 145 (181)
Q Consensus 130 ~~~C~C~~g~~g~~c~ 145 (181)
..+|.|.++|.|..|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 17 TGQCVCKPGTTGPRCD 32 (49)
T ss_dssp CEEESBSTTEESTTS-
T ss_pred CCEEeccccccCCcCc
Confidence 4589999999999996
No 57
>KOG3514|consensus
Probab=47.58 E-value=16 Score=32.89 Aligned_cols=36 Identities=39% Similarity=0.944 Sum_probs=31.2
Q ss_pred CCCCCCCCCCCCEEeeCCCCceEeecC-CCCccCCccc
Q psy9688 110 RDCTSNPCLNDCVCVDDGNLSFYCNCT-EDFTGEYCQF 146 (181)
Q Consensus 110 ~~c~~~~c~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~ 146 (181)
..|..+||.+.+.|.. ....+.|.|. .+|.|+.|..
T Consensus 624 ~~C~~nPC~N~g~C~e-gwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 624 KICESNPCQNGGKCSE-GWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred cccCCCcccCCCCccc-cccccccccccCcccCccccc
Confidence 3677889999999998 8899999995 5889999975
No 58
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=47.18 E-value=12 Score=18.37 Aligned_cols=8 Identities=25% Similarity=0.916 Sum_probs=6.7
Q ss_pred EEecCCCC
Q psy9688 174 TCLCLKEV 181 (181)
Q Consensus 174 ~C~C~~g~ 181 (181)
.|.|++||
T Consensus 19 ~C~CPeGy 26 (34)
T PF09064_consen 19 QCFCPEGY 26 (34)
T ss_pred ceeCCCce
Confidence 49999987
No 59
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=36.17 E-value=31 Score=17.97 Aligned_cols=15 Identities=33% Similarity=0.866 Sum_probs=12.6
Q ss_pred eEeecCCCCccCCcc
Q psy9688 131 FYCNCTEDFTGEYCQ 145 (181)
Q Consensus 131 ~~C~C~~g~~g~~c~ 145 (181)
-+|.|++++.|..|+
T Consensus 18 G~C~C~~~~~G~~C~ 32 (46)
T smart00180 18 GQCECKPNVTGRRCD 32 (46)
T ss_pred CEEECCCCCCCCCCC
Confidence 479999999998885
No 60
>KOG1218|consensus
Probab=32.70 E-value=83 Score=23.69 Aligned_cols=39 Identities=33% Similarity=0.818 Sum_probs=25.7
Q ss_pred eEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCC
Q psy9688 131 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGE 170 (181)
Q Consensus 131 ~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~ 170 (181)
-.|.|.+||.+..+.. ....|.....|.+++.|....+.
T Consensus 162 ~~c~c~~g~~g~~~~~-~~~~c~~~~~~~~g~~C~~~~~~ 200 (316)
T KOG1218|consen 162 GICTCQPGFVGVFCVE-SCSGCSPLTACENGAKCNRSTGS 200 (316)
T ss_pred CceeccCCcccccccc-cCCCcCCCcccCCCCeeeccccc
Confidence 3588999999988874 22225443557677777766554
No 61
>KOG3509|consensus
Probab=22.29 E-value=2.4e+02 Score=25.57 Aligned_cols=36 Identities=28% Similarity=0.625 Sum_probs=26.2
Q ss_pred CCCCCCCCCCCCEEeeCCCCceEeecCCCCccCCccc
Q psy9688 110 RDCTSNPCLNDCVCVDDGNLSFYCNCTEDFTGEYCQF 146 (181)
Q Consensus 110 ~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~~ 146 (181)
+.|+..++...+.|.. ......|.|+++|.|..|..
T Consensus 407 ~~c~~~p~~~~g~c~p-~~~~~~c~c~~g~~G~~c~d 442 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQ-TLEGKQCLCPPGYTGDSCED 442 (964)
T ss_pred CccccccCCCCccccc-cccccceeccccccCchhhc
Confidence 3455556666666666 56667899999999998874
No 62
>KOG3512|consensus
Probab=21.73 E-value=4.6e+02 Score=21.79 Aligned_cols=25 Identities=24% Similarity=0.732 Sum_probs=17.9
Q ss_pred CEEec-CCCCeeEeCCCCCcCCCCCC
Q psy9688 54 GTCIN-RSDRYQCLCTPRYSGNNCEK 78 (181)
Q Consensus 54 ~~C~~-~~~~~~C~c~~g~~g~~c~~ 78 (181)
..|+- ..+.++|.|...-.|..|++
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgr 310 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGR 310 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCccc
Confidence 35764 34458999998888888763
No 63
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=21.05 E-value=49 Score=18.26 Aligned_cols=18 Identities=28% Similarity=0.576 Sum_probs=11.5
Q ss_pred CceEeecCCCCccCCccc
Q psy9688 129 LSFYCNCTEDFTGEYCQF 146 (181)
Q Consensus 129 ~~~~C~C~~g~~g~~c~~ 146 (181)
|.-.|.|..-|.|.+|+.
T Consensus 34 G~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 34 GSPVCECNSCYGGPDCST 51 (56)
T ss_dssp TEE--EE-TTEESTTS-E
T ss_pred CCccccccCCcCCCCccc
Confidence 446799999999999975
Done!