Query psy3697
Match_columns 117
No_of_seqs 162 out of 1315
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 23:15:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3697.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3697hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1225|consensus 99.5 2.1E-14 4.6E-19 113.6 8.7 93 1-96 241-347 (525)
2 KOG1225|consensus 99.3 7.9E-12 1.7E-16 99.2 8.7 83 19-104 242-324 (525)
3 KOG1226|consensus 99.1 3.3E-10 7.3E-15 92.2 8.5 84 1-90 485-580 (783)
4 KOG1226|consensus 98.7 7.2E-08 1.6E-12 78.9 7.2 71 29-102 553-633 (783)
5 KOG4289|consensus 98.3 1.1E-06 2.4E-11 76.2 5.3 50 43-93 1223-1277(2531)
6 KOG1219|consensus 98.3 1.9E-06 4.1E-11 77.4 6.1 76 31-107 3870-3958(4289)
7 KOG4289|consensus 98.2 1.2E-06 2.5E-11 76.1 4.6 75 1-92 1229-1318(2531)
8 PF07974 EGF_2: EGF-like domai 98.1 2.6E-06 5.7E-11 43.4 2.9 24 32-55 7-32 (32)
9 PF07974 EGF_2: EGF-like domai 98.0 8.8E-06 1.9E-10 41.5 3.1 27 62-89 6-32 (32)
10 KOG1219|consensus 97.9 2.4E-05 5.3E-10 70.7 5.3 71 19-90 3894-3977(4289)
11 smart00051 DSL delta serrate l 97.7 5.3E-05 1.1E-09 44.5 3.0 44 43-89 18-63 (63)
12 PF12661 hEGF: Human growth fa 97.3 0.00014 3.1E-09 29.7 1.2 13 77-89 1-13 (13)
13 smart00051 DSL delta serrate l 97.3 0.00034 7.3E-09 41.0 3.2 24 32-55 39-63 (63)
14 PF00008 EGF: EGF-like domain 96.9 0.00099 2.1E-08 33.7 2.2 25 62-87 4-31 (32)
15 PF00008 EGF: EGF-like domain 96.3 0.0023 5E-08 32.3 1.2 23 32-54 5-32 (32)
16 smart00179 EGF_CA Calcium-bind 95.2 0.034 7.3E-07 28.3 3.2 21 69-89 15-38 (39)
17 PF01414 DSL: Delta serrate li 95.1 0.0047 1E-07 36.1 -0.4 46 42-89 17-63 (63)
18 cd00055 EGF_Lam Laminin-type e 95.0 0.026 5.6E-07 31.2 2.4 25 70-96 13-37 (50)
19 smart00179 EGF_CA Calcium-bind 94.9 0.058 1.3E-06 27.4 3.5 25 32-56 10-39 (39)
20 PF00053 Laminin_EGF: Laminin 94.8 0.022 4.7E-07 31.3 1.8 26 69-96 11-36 (49)
21 cd00054 EGF_CA Calcium-binding 94.7 0.05 1.1E-06 27.2 3.0 13 77-89 25-37 (38)
22 cd00053 EGF Epidermal growth f 94.3 0.089 1.9E-06 25.7 3.3 11 77-87 22-32 (36)
23 PF01414 DSL: Delta serrate li 94.3 0.0067 1.5E-07 35.5 -1.1 21 35-55 42-63 (63)
24 cd00053 EGF Epidermal growth f 93.8 0.095 2.1E-06 25.6 2.9 24 32-55 7-35 (36)
25 KOG4260|consensus 93.8 0.064 1.4E-06 40.2 3.0 42 46-90 132-182 (350)
26 cd00054 EGF_CA Calcium-binding 93.8 0.081 1.7E-06 26.4 2.6 25 32-56 10-38 (38)
27 KOG1217|consensus 93.6 0.3 6.4E-06 37.6 6.6 73 33-106 280-371 (487)
28 KOG3607|consensus 93.5 0.097 2.1E-06 44.0 3.9 57 29-92 602-658 (716)
29 smart00180 EGF_Lam Laminin-typ 93.2 0.081 1.8E-06 28.7 2.1 26 69-96 11-36 (46)
30 KOG0994|consensus 93.2 0.23 4.9E-06 43.8 5.5 53 38-90 1078-1146(1758)
31 cd00055 EGF_Lam Laminin-type e 93.0 0.11 2.4E-06 28.6 2.4 17 40-56 17-33 (50)
32 KOG1218|consensus 91.9 0.82 1.8E-05 33.8 6.7 18 74-91 160-177 (316)
33 smart00181 EGF Epidermal growt 91.2 0.29 6.2E-06 24.3 2.6 23 32-55 7-34 (35)
34 KOG1218|consensus 91.0 0.92 2E-05 33.5 6.2 48 38-86 158-209 (316)
35 KOG1217|consensus 90.7 0.72 1.6E-05 35.4 5.5 50 42-92 152-208 (487)
36 PF01683 EB: EB module; Inter 90.7 0.37 8.1E-06 26.5 2.9 20 32-51 27-46 (52)
37 PF12947 EGF_3: EGF domain; I 90.7 0.23 4.9E-06 25.7 1.9 24 63-87 7-32 (36)
38 PHA02887 EGF-like protein; Pro 87.4 0.62 1.3E-05 30.6 2.5 27 63-91 93-123 (126)
39 PF01683 EB: EB module; Inter 87.0 3.4 7.3E-05 22.5 5.2 39 40-85 8-46 (52)
40 KOG4260|consensus 86.9 0.76 1.7E-05 34.6 3.1 27 31-57 150-183 (350)
41 PF07645 EGF_CA: Calcium-bindi 86.7 1.4 2.9E-05 23.2 3.3 23 62-85 10-34 (42)
42 KOG1836|consensus 85.4 1.2 2.5E-05 41.1 4.0 59 32-91 732-813 (1705)
43 KOG3607|consensus 85.3 0.87 1.9E-05 38.4 3.0 27 32-58 631-658 (716)
44 KOG0994|consensus 85.2 2.2 4.8E-05 38.1 5.4 69 4-78 792-875 (1758)
45 PHA02887 EGF-like protein; Pro 85.0 0.8 1.7E-05 30.1 2.1 26 32-58 93-124 (126)
46 PF04863 EGF_alliinase: Alliin 83.4 0.52 1.1E-05 26.7 0.7 14 77-90 37-50 (56)
47 KOG1214|consensus 83.2 2.4 5.2E-05 36.7 4.7 45 42-87 809-859 (1289)
48 PHA03099 epidermal growth fact 79.8 1.5 3.3E-05 29.2 2.0 17 42-58 67-83 (139)
49 PF12955 DUF3844: Domain of un 79.3 2.2 4.8E-05 27.3 2.6 10 83-92 53-62 (103)
50 PF09064 Tme5_EGF_like: Thromb 78.0 2.8 6E-05 21.4 2.2 27 55-87 3-29 (34)
51 PHA03099 epidermal growth fact 77.9 2.1 4.5E-05 28.6 2.2 23 69-91 56-82 (139)
52 KOG1836|consensus 59.5 7.6 0.00016 36.1 2.4 48 43-90 696-759 (1705)
53 KOG3516|consensus 54.7 12 0.00026 33.6 2.7 27 32-58 962-993 (1306)
54 PF12662 cEGF: Complement Clr- 47.6 12 0.00026 17.5 1.0 10 77-86 3-12 (24)
55 PF14670 FXa_inhibition: Coagu 41.9 21 0.00045 18.2 1.5 18 70-87 11-30 (36)
56 KOG3512|consensus 37.7 44 0.00095 27.3 3.3 22 69-90 407-428 (592)
57 PF01826 TIL: Trypsin Inhibito 28.0 57 0.0012 17.7 1.9 14 37-51 29-42 (55)
58 PF00954 S_locus_glycop: S-loc 25.7 96 0.0021 19.3 2.9 24 62-86 84-108 (110)
59 KOG1214|consensus 24.3 1.8E+02 0.0039 25.9 4.9 44 42-86 716-767 (1289)
60 PF05294 Toxin_5: Scorpion sho 22.0 1E+02 0.0022 15.4 1.9 15 32-46 18-32 (32)
No 1
>KOG1225|consensus
Probab=99.55 E-value=2.1e-14 Score=113.60 Aligned_cols=93 Identities=32% Similarity=0.781 Sum_probs=82.1
Q ss_pred CccCCCCCCCCCCCCCCC--------------CCCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCC
Q psy3697 1 MYTGEACDVPICPNNCSY--------------SSVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSY 66 (117)
Q Consensus 1 G~~G~~C~~~~C~~~c~~--------------~~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~ 66 (117)
+|+|+.|+...|+.+|++ |.+.+|....++.+|++++.+++++|+|+++|+|..|++..|+.+|++
T Consensus 241 ~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g 320 (525)
T KOG1225|consen 241 GYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSG 320 (525)
T ss_pred ceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccCCccCCC
Confidence 688999998889999875 678888888888789999999999999999999999998889999999
Q ss_pred CCCeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697 67 SNGVCKHEFHRCECMDKYKEMKQLFDACDV 96 (117)
Q Consensus 67 ~~G~C~~~~g~C~C~~g~~G~~C~~~~C~~ 96 (117)
+ |.|. .++|.|++||+|..|+.+.|..
T Consensus 321 ~-G~Ci--~G~C~C~~Gy~G~~C~~~~C~~ 347 (525)
T KOG1225|consen 321 H-GKCI--DGECLCDEGYTGELCIQRACSG 347 (525)
T ss_pred C-Cccc--CCceEeCCCCcCCcccccccCC
Confidence 8 9998 8999999999999999864443
No 2
>KOG1225|consensus
Probab=99.32 E-value=7.9e-12 Score=99.17 Aligned_cols=83 Identities=30% Similarity=0.688 Sum_probs=74.8
Q ss_pred CCCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCcccCCCcCccCCCC
Q psy3697 19 SSVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYKEMKQLFDACDVSK 98 (117)
Q Consensus 19 ~~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~~~~C~~~~ 98 (117)
|.++.|....++..|.++|.|+.|.|+|++||+|.+|+...|+.+|+.+ +.+. .+.|+|+++|.|.+|+++.|+.+|
T Consensus 242 ~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~-g~~~--~g~CiC~~g~~G~dCs~~~cpadC 318 (525)
T KOG1225|consen 242 YFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGG-GVCV--DGECICNPGYSGKDCSIRRCPADC 318 (525)
T ss_pred eeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCC-ceec--CCEeecCCCccccccccccCCccC
Confidence 5677787777788999999999999999999999999998899889987 8887 679999999999999998899999
Q ss_pred CCCCCC
Q psy3697 99 NAHSVA 104 (117)
Q Consensus 99 ~~~g~~ 104 (117)
.+||..
T Consensus 319 ~g~G~C 324 (525)
T KOG1225|consen 319 SGHGKC 324 (525)
T ss_pred CCCCcc
Confidence 888853
No 3
>KOG1226|consensus
Probab=99.11 E-value=3.3e-10 Score=92.22 Aligned_cols=84 Identities=26% Similarity=0.655 Sum_probs=66.9
Q ss_pred CccCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCC----CCCCCCCC--CC----CCCCCCC
Q psy3697 1 MYTGEACDVPICPNNCSYS--SVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYT----GEACDVPI--CP----NNCSYSN 68 (117)
Q Consensus 1 G~~G~~C~~~~C~~~c~~~--~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~----G~~C~~~~--c~----~~C~~~~ 68 (117)
||.|..|+ |+....+. --+.|......++|+++|.|+.|+|+|++... |.+|+.+. |+ ..|+++
T Consensus 485 G~~G~~CE---C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~- 560 (783)
T KOG1226|consen 485 GWLGKKCE---CSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGH- 560 (783)
T ss_pred CCCCCccc---CCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeeccCcccccccCcccCCC-
Confidence 78999995 44433222 13567776777899999999999999998877 99998653 54 349998
Q ss_pred CeEecCCCeEeeCCCCcccCCC
Q psy3697 69 GVCKHEFHRCECMDKYKEMKQL 90 (117)
Q Consensus 69 G~C~~~~g~C~C~~g~~G~~C~ 90 (117)
|+|. .|+|+|.+||+|..|+
T Consensus 561 G~C~--CG~CvC~~GwtG~~C~ 580 (783)
T KOG1226|consen 561 GRCE--CGRCVCNPGWTGSACN 580 (783)
T ss_pred CeEe--CCcEEcCCCCccCCCC
Confidence 9998 9999999999999997
No 4
>KOG1226|consensus
Probab=98.66 E-value=7.2e-08 Score=78.89 Aligned_cols=71 Identities=32% Similarity=0.753 Sum_probs=55.9
Q ss_pred CCCCCCCCcEEeCCcEEeCCCCCCCCCCCCC----CC----CCCCCCCCeEecCCCeEeeCCC-CcccCCCc-CccCCCC
Q psy3697 29 SDKPCSGQGVCIEGVCTCDAMYTGEACDVPI----CP----NNCSYSNGVCKHEFHRCECMDK-YKEMKQLF-DACDVSK 98 (117)
Q Consensus 29 ~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~----c~----~~C~~~~G~C~~~~g~C~C~~g-~~G~~C~~-~~C~~~~ 98 (117)
....|++||+|..|+|+|.+||+|..|+.+. |. .-|+++ |+|. .|+|+|... |.|+.|+. +.|+..|
T Consensus 553 ~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGr-G~C~--Cg~C~C~~~~~sG~~CE~cptc~~~C 629 (783)
T KOG1226|consen 553 KGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGR-GTCE--CGRCKCTDPPYSGEFCEKCPTCPDPC 629 (783)
T ss_pred cCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCC-ceee--CCceEcCCCCcCcchhhcCCCCCCcc
Confidence 4568999999999999999999999999753 32 247886 9998 899999776 99999985 3455544
Q ss_pred CCCC
Q psy3697 99 NAHS 102 (117)
Q Consensus 99 ~~~g 102 (117)
..+.
T Consensus 630 ~~~~ 633 (783)
T KOG1226|consen 630 AENK 633 (783)
T ss_pred cccc
Confidence 4433
No 5
>KOG4289|consensus
Probab=98.29 E-value=1.1e-06 Score=76.19 Aligned_cols=50 Identities=28% Similarity=0.730 Sum_probs=42.8
Q ss_pred cEEeCCCCCCCCCCCCC--C-CCCCCCCCCeEecCCC--eEeeCCCCcccCCCcCc
Q psy3697 43 VCTCDAMYTGEACDVPI--C-PNNCSYSNGVCKHEFH--RCECMDKYKEMKQLFDA 93 (117)
Q Consensus 43 ~C~C~~g~~G~~C~~~~--c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~~ 93 (117)
.|.|++||+|++|+..+ | ..+|.++ |+|...+| +|+|.++|+|.+|++..
T Consensus 1223 rCrCPpGFTgd~CeTeiDlCYs~pC~nn-g~C~srEggYtCeCrpg~tGehCEvs~ 1277 (2531)
T KOG4289|consen 1223 RCRCPPGFTGDYCETEIDLCYSGPCGNN-GRCRSREGGYTCECRPGFTGEHCEVSA 1277 (2531)
T ss_pred eEeCCCCCCcccccchhHhhhcCCCCCC-CceEEecCceeEEecCCccccceeeec
Confidence 59999999999999754 5 3689997 99987666 99999999999998643
No 6
>KOG1219|consensus
Probab=98.26 E-value=1.9e-06 Score=77.36 Aligned_cols=76 Identities=26% Similarity=0.738 Sum_probs=62.5
Q ss_pred CCCCCCcEEeC-----CcEEeCCCCCCCCCCCCC--C-CCCCCCCCCeEecCCC--eEeeCCCCcccCCCcCc---cCCC
Q psy3697 31 KPCSGQGVCIE-----GVCTCDAMYTGEACDVPI--C-PNNCSYSNGVCKHEFH--RCECMDKYKEMKQLFDA---CDVS 97 (117)
Q Consensus 31 ~~C~~~G~C~~-----g~C~C~~g~~G~~C~~~~--c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~~---C~~~ 97 (117)
.+|++.|+|.. ++|.|++-|.|..|++.. | +++|..+ |+|....+ .|.|+.||+|..|+... |...
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~G-gtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTG-GTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCC-CEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence 38999999953 579999999999999753 5 3789886 99987654 89999999999998653 8887
Q ss_pred CCCCCCCCCC
Q psy3697 98 KNAHSVACPS 107 (117)
Q Consensus 98 ~~~~g~~c~~ 107 (117)
.+.+|..|.+
T Consensus 3949 ~C~~gg~C~n 3958 (4289)
T KOG1219|consen 3949 VCGTGGQCIN 3958 (4289)
T ss_pred cccCCceeec
Confidence 7777777754
No 7
>KOG4289|consensus
Probab=98.25 E-value=1.2e-06 Score=76.09 Aligned_cols=75 Identities=29% Similarity=0.829 Sum_probs=56.8
Q ss_pred CccCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCcEEeC----CcEEeCCCCCCCCCCCCC----C-CCCCCCCCC
Q psy3697 1 MYTGEACDVP--ICPNNCSYSSVNSCPSSKSDKPCSGQGVCIE----GVCTCDAMYTGEACDVPI----C-PNNCSYSNG 69 (117)
Q Consensus 1 G~~G~~C~~~--~C~~~c~~~~~~~C~~~~~~~~C~~~G~C~~----g~C~C~~g~~G~~C~~~~----c-~~~C~~~~G 69 (117)
||+|..|+.. .|..+ +|.++|+|+. ++|.|.++|+|..|+... | +..|.+. |
T Consensus 1229 GFTgd~CeTeiDlCYs~----------------pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~ng-g 1291 (2531)
T KOG4289|consen 1229 GFTGDYCETEIDLCYSG----------------PCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNG-G 1291 (2531)
T ss_pred CCCcccccchhHhhhcC----------------CCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCC-C
Confidence 6888888654 35544 8999999954 579999999999999753 4 4568886 9
Q ss_pred eEecCCC---eEeeCCC-CcccCCCcC
Q psy3697 70 VCKHEFH---RCECMDK-YKEMKQLFD 92 (117)
Q Consensus 70 ~C~~~~g---~C~C~~g-~~G~~C~~~ 92 (117)
+|..... .|.|++| |.++.|+..
T Consensus 1292 tC~~~~nggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1292 TCVNLLNGGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred EEeecCCCceeccCCCcccCCCceEEE
Confidence 9975432 7899875 677888753
No 8
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=98.14 E-value=2.6e-06 Score=43.45 Aligned_cols=24 Identities=54% Similarity=1.312 Sum_probs=20.7
Q ss_pred CCCCCcEEe--CCcEEeCCCCCCCCC
Q psy3697 32 PCSGQGVCI--EGVCTCDAMYTGEAC 55 (117)
Q Consensus 32 ~C~~~G~C~--~g~C~C~~g~~G~~C 55 (117)
.|+++|+|+ .++|+|+++|+|++|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 688999997 688999999998876
No 9
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.99 E-value=8.8e-06 Score=41.48 Aligned_cols=27 Identities=33% Similarity=0.605 Sum_probs=23.5
Q ss_pred CCCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697 62 NNCSYSNGVCKHEFHRCECMDKYKEMKQ 89 (117)
Q Consensus 62 ~~C~~~~G~C~~~~g~C~C~~g~~G~~C 89 (117)
..|+++ |+|+...++|+|++||+|++|
T Consensus 6 ~~C~~~-G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGH-GTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCC-CEEeCCCCEEECCCCCcCCCC
Confidence 358897 999876689999999999987
No 10
>KOG1219|consensus
Probab=97.85 E-value=2.4e-05 Score=70.66 Aligned_cols=71 Identities=30% Similarity=0.624 Sum_probs=55.4
Q ss_pred CCCCCCCCCCC---CCCCCCCcEEeC----CcEEeCCCCCCCCCCCCC---CC-CCCCCCCCeEecCCC--eEeeCCCCc
Q psy3697 19 SSVNSCPSSKS---DKPCSGQGVCIE----GVCTCDAMYTGEACDVPI---CP-NNCSYSNGVCKHEFH--RCECMDKYK 85 (117)
Q Consensus 19 ~~~~~C~~~~~---~~~C~~~G~C~~----g~C~C~~g~~G~~C~~~~---c~-~~C~~~~G~C~~~~g--~C~C~~g~~ 85 (117)
|.|..|..... ..+|..+|+|+. ..|.|+.+|+|..|+..- |. +.|..+ |.|....| .|.|.+||.
T Consensus 3894 ysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~g-g~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3894 YSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTG-GQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred ccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCC-ceeeccCCceEeccChhHh
Confidence 44555543322 348999999954 469999999999999762 65 678886 99998888 999999999
Q ss_pred ccCCC
Q psy3697 86 EMKQL 90 (117)
Q Consensus 86 G~~C~ 90 (117)
|..|.
T Consensus 3973 gr~c~ 3977 (4289)
T KOG1219|consen 3973 GRTCC 3977 (4289)
T ss_pred cccCc
Confidence 99984
No 11
>smart00051 DSL delta serrate ligand.
Probab=97.65 E-value=5.3e-05 Score=44.45 Aligned_cols=44 Identities=23% Similarity=0.440 Sum_probs=34.6
Q ss_pred cEEeCCCCCCCCCCCCCCC--CCCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697 43 VCTCDAMYTGEACDVPICP--NNCSYSNGVCKHEFHRCECMDKYKEMKQ 89 (117)
Q Consensus 43 ~C~C~~g~~G~~C~~~~c~--~~C~~~~G~C~~~~g~C~C~~g~~G~~C 89 (117)
.=.|+++|.|..|+.. |. ++..++ .+|+. .|.++|.+||+|++|
T Consensus 18 rv~C~~~~yG~~C~~~-C~~~~d~~~~-~~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCNKF-CRPRDDFFGH-YTCDE-NGNKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccCCE-eCcCccccCC-ccCCc-CCCEecCCCCcCCCC
Confidence 3468899999999853 42 345665 88975 789999999999987
No 12
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.30 E-value=0.00014 Score=29.66 Aligned_cols=13 Identities=15% Similarity=0.506 Sum_probs=7.9
Q ss_pred eEeeCCCCcccCC
Q psy3697 77 RCECMDKYKEMKQ 89 (117)
Q Consensus 77 ~C~C~~g~~G~~C 89 (117)
+|+|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3667777777665
No 13
>smart00051 DSL delta serrate ligand.
Probab=97.28 E-value=0.00034 Score=41.02 Aligned_cols=24 Identities=25% Similarity=0.454 Sum_probs=19.7
Q ss_pred CCCCCcEE-eCCcEEeCCCCCCCCC
Q psy3697 32 PCSGQGVC-IEGVCTCDAMYTGEAC 55 (117)
Q Consensus 32 ~C~~~G~C-~~g~C~C~~g~~G~~C 55 (117)
+..++.+| ..|.++|.+||+|..|
T Consensus 39 d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 39 DFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred cccCCccCCcCCCEecCCCCcCCCC
Confidence 45677888 5689999999999876
No 14
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.87 E-value=0.00099 Score=33.69 Aligned_cols=25 Identities=32% Similarity=0.665 Sum_probs=15.3
Q ss_pred CCCCCCCCeEecCC-C--eEeeCCCCccc
Q psy3697 62 NNCSYSNGVCKHEF-H--RCECMDKYKEM 87 (117)
Q Consensus 62 ~~C~~~~G~C~~~~-g--~C~C~~g~~G~ 87 (117)
++|.++ |+|.... + +|+|++||+|+
T Consensus 4 ~~C~n~-g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNG-GTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTT-EEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCC-eEEEeCCCCCEEeECCCCCccC
Confidence 356665 6775544 2 67777777765
No 15
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.28 E-value=0.0023 Score=32.31 Aligned_cols=23 Identities=43% Similarity=1.100 Sum_probs=19.4
Q ss_pred CCCCCcEEeC-----CcEEeCCCCCCCC
Q psy3697 32 PCSGQGVCIE-----GVCTCDAMYTGEA 54 (117)
Q Consensus 32 ~C~~~G~C~~-----g~C~C~~g~~G~~ 54 (117)
+|.++|+|+. ..|+|++||+|..
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 8999999954 3699999999963
No 16
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.22 E-value=0.034 Score=28.28 Aligned_cols=21 Identities=33% Similarity=0.748 Sum_probs=10.4
Q ss_pred CeEecCCC--eEeeCCCCc-ccCC
Q psy3697 69 GVCKHEFH--RCECMDKYK-EMKQ 89 (117)
Q Consensus 69 G~C~~~~g--~C~C~~g~~-G~~C 89 (117)
|+|....+ .|.|.+||. |..|
T Consensus 15 ~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 15 GTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CEeECCCCCeEeECCCCCccCCcC
Confidence 45543332 455555555 5444
No 17
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=95.10 E-value=0.0047 Score=36.13 Aligned_cols=46 Identities=17% Similarity=0.264 Sum_probs=24.0
Q ss_pred CcEEeCCCCCCCCCCCCCCCC-CCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697 42 GVCTCDAMYTGEACDVPICPN-NCSYSNGVCKHEFHRCECMDKYKEMKQ 89 (117)
Q Consensus 42 g~C~C~~g~~G~~C~~~~c~~-~C~~~~G~C~~~~g~C~C~~g~~G~~C 89 (117)
.+-+|.+.|.|..|+...-+. .-..+ -+|+ ..|.=+|.+||+|++|
T Consensus 17 ~rv~C~~nyyG~~C~~~C~~~~d~~gh-y~Cd-~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSKFCKPRDDSFGH-YTCD-SNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-EE---EEETTEE-EEE--SS--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccCCcCCCcCCcCC-cccC-CCCCCCCCCCCcCCCC
Confidence 456789999999998642121 12233 5787 4788899999999987
No 18
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=94.97 E-value=0.026 Score=31.21 Aligned_cols=25 Identities=20% Similarity=0.436 Sum_probs=20.1
Q ss_pred eEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697 70 VCKHEFHRCECMDKYKEMKQLFDACDV 96 (117)
Q Consensus 70 ~C~~~~g~C~C~~g~~G~~C~~~~C~~ 96 (117)
.|+...|+|.|.++|+|..|+ .|..
T Consensus 13 ~C~~~~G~C~C~~~~~G~~C~--~C~~ 37 (50)
T cd00055 13 QCDPGTGQCECKPNTTGRRCD--RCAP 37 (50)
T ss_pred cccCCCCEEeCCCcCCCCCCC--CCCC
Confidence 577678899999999999997 3543
No 19
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=94.86 E-value=0.058 Score=27.37 Aligned_cols=25 Identities=40% Similarity=1.132 Sum_probs=20.2
Q ss_pred CCCCCcEEeC----CcEEeCCCCC-CCCCC
Q psy3697 32 PCSGQGVCIE----GVCTCDAMYT-GEACD 56 (117)
Q Consensus 32 ~C~~~G~C~~----g~C~C~~g~~-G~~C~ 56 (117)
+|.++++|+. ..|.|+++|. |..|+
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 6888889964 3599999999 88774
No 20
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=94.83 E-value=0.022 Score=31.25 Aligned_cols=26 Identities=15% Similarity=0.319 Sum_probs=20.6
Q ss_pred CeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697 69 GVCKHEFHRCECMDKYKEMKQLFDACDV 96 (117)
Q Consensus 69 G~C~~~~g~C~C~~g~~G~~C~~~~C~~ 96 (117)
.+|+...++|.|.++|+|+.|+. |..
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~~--C~~ 36 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPRCDQ--CKP 36 (49)
T ss_dssp SSEEETCEEESBSTTEESTTS-E--E-T
T ss_pred CcccCCCCEEeccccccCCcCcC--CCC
Confidence 58988889999999999999983 543
No 21
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.74 E-value=0.05 Score=27.17 Aligned_cols=13 Identities=31% Similarity=0.705 Sum_probs=6.6
Q ss_pred eEeeCCCCcccCC
Q psy3697 77 RCECMDKYKEMKQ 89 (117)
Q Consensus 77 ~C~C~~g~~G~~C 89 (117)
.|.|.++|.|..|
T Consensus 25 ~C~C~~g~~g~~C 37 (38)
T cd00054 25 RCSCPPGYTGRNC 37 (38)
T ss_pred EeECCCCCcCCcC
Confidence 4555555555443
No 22
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.30 E-value=0.089 Score=25.75 Aligned_cols=11 Identities=36% Similarity=0.778 Sum_probs=6.1
Q ss_pred eEeeCCCCccc
Q psy3697 77 RCECMDKYKEM 87 (117)
Q Consensus 77 ~C~C~~g~~G~ 87 (117)
+|.|+.||.|.
T Consensus 22 ~C~C~~g~~g~ 32 (36)
T cd00053 22 RCVCPPGYTGD 32 (36)
T ss_pred EeECCCCCccc
Confidence 55555555554
No 23
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=94.29 E-value=0.0067 Score=35.49 Aligned_cols=21 Identities=33% Similarity=0.702 Sum_probs=14.7
Q ss_pred CCcEE-eCCcEEeCCCCCCCCC
Q psy3697 35 GQGVC-IEGVCTCDAMYTGEAC 55 (117)
Q Consensus 35 ~~G~C-~~g~C~C~~g~~G~~C 55 (117)
+|-+| .+|+-+|.+||+|+.|
T Consensus 42 ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 42 GHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp EEEEE-SS--EEE-TTEESTTS
T ss_pred CCcccCCCCCCCCCCCCcCCCC
Confidence 45678 5689999999999986
No 24
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=93.84 E-value=0.095 Score=25.64 Aligned_cols=24 Identities=46% Similarity=1.222 Sum_probs=19.5
Q ss_pred CCCCCcEEeC----CcEEeCCCCCCC-CC
Q psy3697 32 PCSGQGVCIE----GVCTCDAMYTGE-AC 55 (117)
Q Consensus 32 ~C~~~G~C~~----g~C~C~~g~~G~-~C 55 (117)
.|.+++.|+. ..|.|+.+|.|. .|
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 6888888854 479999999998 55
No 25
>KOG4260|consensus
Probab=93.82 E-value=0.064 Score=40.16 Aligned_cols=42 Identities=26% Similarity=0.574 Sum_probs=31.8
Q ss_pred eCCCCCCCCCCCCCCC----CCCCCCCCeEec-----CCCeEeeCCCCcccCCC
Q psy3697 46 CDAMYTGEACDVPICP----NNCSYSNGVCKH-----EFHRCECMDKYKEMKQL 90 (117)
Q Consensus 46 C~~g~~G~~C~~~~c~----~~C~~~~G~C~~-----~~g~C~C~~g~~G~~C~ 90 (117)
|++|-.|++|.. |+ .+|.+. |.|+. ..|.|.|.+||+|+.|.
T Consensus 132 Cp~gtyGpdCl~--Cpggser~C~Gn-G~C~GdGsR~GsGkCkC~~GY~Gp~C~ 182 (350)
T KOG4260|consen 132 CPDGTYGPDCLQ--CPGGSERPCFGN-GSCHGDGSREGSGKCKCETGYTGPLCR 182 (350)
T ss_pred cCCCCcCCcccc--CCCCCcCCcCCC-CcccCCCCCCCCCcccccCCCCCcccc
Confidence 678899999874 43 457776 78853 23589999999999885
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=93.80 E-value=0.081 Score=26.38 Aligned_cols=25 Identities=40% Similarity=1.153 Sum_probs=19.9
Q ss_pred CCCCCcEEeC----CcEEeCCCCCCCCCC
Q psy3697 32 PCSGQGVCIE----GVCTCDAMYTGEACD 56 (117)
Q Consensus 32 ~C~~~G~C~~----g~C~C~~g~~G~~C~ 56 (117)
.|.+++.|+. ..|.|.++|.|..|+
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 6777888854 369999999998774
No 27
>KOG1217|consensus
Probab=93.64 E-value=0.3 Score=37.56 Aligned_cols=73 Identities=26% Similarity=0.649 Sum_probs=48.5
Q ss_pred CCCCcEEeC----CcEEeCCCCCCCCC-CC---CCCC-----CCCCCCCCeEecC----CCeEeeCCCCcccCCCcC--c
Q psy3697 33 CSGQGVCIE----GVCTCDAMYTGEAC-DV---PICP-----NNCSYSNGVCKHE----FHRCECMDKYKEMKQLFD--A 93 (117)
Q Consensus 33 C~~~G~C~~----g~C~C~~g~~G~~C-~~---~~c~-----~~C~~~~G~C~~~----~g~C~C~~g~~G~~C~~~--~ 93 (117)
|.++++|+. ..|.|+++|+|..| .. ..|. ..|.+. +.|... ...|.|.++|.|..|+.. .
T Consensus 280 c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g-~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~ 358 (487)
T KOG1217|consen 280 CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANG-GTCNTLGSFGGFRCACGPGFTGRRCEDSNDE 358 (487)
T ss_pred cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCC-cccccCCCCCCCCcCCCCCCCCCccccCCcc
Confidence 788899954 57999999999998 21 2232 235554 677321 126999999999999866 5
Q ss_pred cCCCCCCCCCCCC
Q psy3697 94 CDVSKNAHSVACP 106 (117)
Q Consensus 94 C~~~~~~~g~~c~ 106 (117)
|......++..|.
T Consensus 359 C~~~~~~~~~~c~ 371 (487)
T KOG1217|consen 359 CASSPCCPGGTCV 371 (487)
T ss_pred ccCCccccCCEec
Confidence 7665334444343
No 28
>KOG3607|consensus
Probab=93.52 E-value=0.097 Score=43.96 Aligned_cols=57 Identities=23% Similarity=0.611 Sum_probs=42.7
Q ss_pred CCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCcccCCCcC
Q psy3697 29 SDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYKEMKQLFD 92 (117)
Q Consensus 29 ~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~~~ 92 (117)
.+..|...-.|++.+|+=.. ..+..+ |+..|+++ |.|+. ...|.|.+||.+++|++.
T Consensus 602 dGt~Cg~~~vC~~~~C~~~~-v~~~~~----~~~~C~g~-GVCnn-~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 602 DGTSCGPGMICINHRCLSAS-VLNSSC----CPTTCNGH-GVCNN-ELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred CCCccCCCceecCCcchhhh-hhcccc----cccccCCC-cccCC-CcceeeCCCCCCCccccc
Confidence 34567777778888886555 555443 44569998 99974 669999999999999853
No 29
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=93.20 E-value=0.081 Score=28.73 Aligned_cols=26 Identities=23% Similarity=0.577 Sum_probs=20.6
Q ss_pred CeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697 69 GVCKHEFHRCECMDKYKEMKQLFDACDV 96 (117)
Q Consensus 69 G~C~~~~g~C~C~~g~~G~~C~~~~C~~ 96 (117)
..|+...|+|.|.++|+|.+|+ .|..
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~--~C~~ 36 (46)
T smart00180 11 GTCDPDTGQCECKPNVTGRRCD--RCAP 36 (46)
T ss_pred CcccCCCCEEECCCCCCCCCCC--cCCC
Confidence 4677667899999999999997 3543
No 30
>KOG0994|consensus
Probab=93.16 E-value=0.23 Score=43.84 Aligned_cols=53 Identities=23% Similarity=0.584 Sum_probs=39.2
Q ss_pred EE--eCCcEEeCCCCCCCCCCCCC----------CC-CCCCCC---CCeEecCCCeEeeCCCCcccCCC
Q psy3697 38 VC--IEGVCTCDAMYTGEACDVPI----------CP-NNCSYS---NGVCKHEFHRCECMDKYKEMKQL 90 (117)
Q Consensus 38 ~C--~~g~C~C~~g~~G~~C~~~~----------c~-~~C~~~---~G~C~~~~g~C~C~~g~~G~~C~ 90 (117)
.| -+|+|+|.+||.|..|++-. |- -.|... .-.|+..+|+|+|.+|-.|+.|.
T Consensus 1078 qCN~ftGQCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1078 QCNEFTGQCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred cccccccceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh
Confidence 56 46899999999999997521 10 123332 13688888999999999999997
No 31
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=92.97 E-value=0.11 Score=28.60 Aligned_cols=17 Identities=41% Similarity=0.886 Sum_probs=15.5
Q ss_pred eCCcEEeCCCCCCCCCC
Q psy3697 40 IEGVCTCDAMYTGEACD 56 (117)
Q Consensus 40 ~~g~C~C~~g~~G~~C~ 56 (117)
.+|+|.|.++|+|..|+
T Consensus 17 ~~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 17 GTGQCECKPNTTGRRCD 33 (50)
T ss_pred CCCEEeCCCcCCCCCCC
Confidence 46899999999999998
No 32
>KOG1218|consensus
Probab=91.91 E-value=0.82 Score=33.78 Aligned_cols=18 Identities=11% Similarity=0.202 Sum_probs=11.4
Q ss_pred CCCeEeeCCCCcccCCCc
Q psy3697 74 EFHRCECMDKYKEMKQLF 91 (117)
Q Consensus 74 ~~g~C~C~~g~~G~~C~~ 91 (117)
..+.|.|.+||.|..+..
T Consensus 160 ~~~~c~c~~g~~g~~~~~ 177 (316)
T KOG1218|consen 160 KNGICTCQPGFVGVFCVE 177 (316)
T ss_pred CCCceeccCCcccccccc
Confidence 355666777777766653
No 33
>smart00181 EGF Epidermal growth factor-like domain.
Probab=91.16 E-value=0.29 Score=24.27 Aligned_cols=23 Identities=48% Similarity=1.197 Sum_probs=14.6
Q ss_pred CCCCCcEEeC----CcEEeCCCCCC-CCC
Q psy3697 32 PCSGQGVCIE----GVCTCDAMYTG-EAC 55 (117)
Q Consensus 32 ~C~~~G~C~~----g~C~C~~g~~G-~~C 55 (117)
.|.++ +|+. ..|.|++||.| ..|
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 56655 6643 35777777777 554
No 34
>KOG1218|consensus
Probab=91.04 E-value=0.92 Score=33.51 Aligned_cols=48 Identities=25% Similarity=0.688 Sum_probs=35.3
Q ss_pred EEeCCcEEeCCCCCCCCCCCCC--CC--CCCCCCCCeEecCCCeEeeCCCCcc
Q psy3697 38 VCIEGVCTCDAMYTGEACDVPI--CP--NNCSYSNGVCKHEFHRCECMDKYKE 86 (117)
Q Consensus 38 ~C~~g~C~C~~g~~G~~C~~~~--c~--~~C~~~~G~C~~~~g~C~C~~g~~G 86 (117)
.+..+.|.|.+||.|..+.... |. ..+.+. +.|+...+.+.+.+.|.+
T Consensus 158 ~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g-~~C~~~~~~~~~~~~~~~ 209 (316)
T KOG1218|consen 158 DCKNGICTCQPGFVGVFCVESCSGCSPLTACENG-AKCNRSTGSCLCYPGPSG 209 (316)
T ss_pred CCCCCceeccCCcccccccccCCCcCCCcccCCC-CeeeccccccccCCCCcc
Confidence 3466889999999999998653 33 234553 689877778888777754
No 35
>KOG1217|consensus
Probab=90.71 E-value=0.72 Score=35.44 Aligned_cols=50 Identities=26% Similarity=0.590 Sum_probs=38.7
Q ss_pred CcEEeCCCCCCCCCCCC--CCC---CCCCCCCCeEecCCC--eEeeCCCCcccCCCcC
Q psy3697 42 GVCTCDAMYTGEACDVP--ICP---NNCSYSNGVCKHEFH--RCECMDKYKEMKQLFD 92 (117)
Q Consensus 42 g~C~C~~g~~G~~C~~~--~c~---~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~ 92 (117)
..|.|..+|.+..++.. .|. ..|.+. +.|....+ .|.|.++|.+..++..
T Consensus 152 ~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~-~~C~~~~~~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 152 FRCSCTEGYEGEPCETDLDECIQYSSPCQNG-GTCVNTGGSYLCSCPPGYTGSTCETT 208 (487)
T ss_pred eeeeeCCCcccccccccccccccCCCCcCCC-cccccCCCCeeEeCCCCccCCcCcCC
Confidence 45899999999999864 454 347775 88876544 7999999999998754
No 36
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=90.71 E-value=0.37 Score=26.49 Aligned_cols=20 Identities=40% Similarity=1.097 Sum_probs=17.1
Q ss_pred CCCCCcEEeCCcEEeCCCCC
Q psy3697 32 PCSGQGVCIEGVCTCDAMYT 51 (117)
Q Consensus 32 ~C~~~G~C~~g~C~C~~g~~ 51 (117)
.|..+..|+.+.|+|+++|.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCCEeECCCCCE
Confidence 67788889999999999875
No 37
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.69 E-value=0.23 Score=25.68 Aligned_cols=24 Identities=21% Similarity=0.567 Sum_probs=14.1
Q ss_pred CCCCCCCeEecCCC--eEeeCCCCccc
Q psy3697 63 NCSYSNGVCKHEFH--RCECMDKYKEM 87 (117)
Q Consensus 63 ~C~~~~G~C~~~~g--~C~C~~g~~G~ 87 (117)
.|..+ ++|....+ +|+|.+||.|+
T Consensus 7 ~C~~n-A~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPN-ATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TT-CEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCC-cEeecCCCCEEeECCCCCccC
Confidence 35554 67765444 78888888775
No 38
>PHA02887 EGF-like protein; Provisional
Probab=87.40 E-value=0.62 Score=30.56 Aligned_cols=27 Identities=22% Similarity=0.479 Sum_probs=20.0
Q ss_pred CCCCCCCeEecCC----CeEeeCCCCcccCCCc
Q psy3697 63 NCSYSNGVCKHEF----HRCECMDKYKEMKQLF 91 (117)
Q Consensus 63 ~C~~~~G~C~~~~----g~C~C~~g~~G~~C~~ 91 (117)
.|- + |+|.-.. -.|.|+.||+|..|+.
T Consensus 93 YCi-H-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCI-N-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred Eee-C-CEEEccccCCCceeECCCCcccCCCCc
Confidence 455 3 7885433 3899999999999974
No 39
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=86.96 E-value=3.4 Score=22.55 Aligned_cols=39 Identities=33% Similarity=0.769 Sum_probs=25.8
Q ss_pred eCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCc
Q psy3697 40 IEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYK 85 (117)
Q Consensus 40 ~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~ 85 (117)
+.++|+=. ...|..|+.. ..|..+ ..|. .++|.|++||+
T Consensus 8 ~~~~C~~~-~~~g~~C~~~---~qC~~~-s~C~--~g~C~C~~g~~ 46 (52)
T PF01683_consen 8 INGQCVPR-VQPGESCESD---EQCIGG-SVCV--NGRCQCPPGYV 46 (52)
T ss_pred ECCEECcc-CCCCCCCCCc---CCCCCc-CEEc--CCEeECCCCCE
Confidence 34445433 4556678753 356665 8996 78999999985
No 40
>KOG4260|consensus
Probab=86.91 E-value=0.76 Score=34.57 Aligned_cols=27 Identities=44% Similarity=1.165 Sum_probs=22.2
Q ss_pred CCCCCCcEEe-------CCcEEeCCCCCCCCCCC
Q psy3697 31 KPCSGQGVCI-------EGVCTCDAMYTGEACDV 57 (117)
Q Consensus 31 ~~C~~~G~C~-------~g~C~C~~g~~G~~C~~ 57 (117)
.+|.++|.|. +|.|.|.+||+|+.|..
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~ 183 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY 183 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence 4688889883 26899999999999864
No 41
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.66 E-value=1.4 Score=23.19 Aligned_cols=23 Identities=30% Similarity=0.798 Sum_probs=16.7
Q ss_pred CCCCCCCCeEecCCC--eEeeCCCCc
Q psy3697 62 NNCSYSNGVCKHEFH--RCECMDKYK 85 (117)
Q Consensus 62 ~~C~~~~G~C~~~~g--~C~C~~g~~ 85 (117)
..|... ++|....| .|.|++||.
T Consensus 10 ~~C~~~-~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 10 HNCPEN-GTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSSTT-SEEEEETTEEEEEESTTEE
T ss_pred CcCCCC-CEEEcCCCCEEeeCCCCcE
Confidence 456655 78877666 788888887
No 42
>KOG1836|consensus
Probab=85.40 E-value=1.2 Score=41.08 Aligned_cols=59 Identities=25% Similarity=0.656 Sum_probs=37.3
Q ss_pred CCCCC-cEE--eCCcEEeCCCCCCCCCCCCC--------------CC-CCCCCCCCeEe----cCCCeEe-eCCCCcccC
Q psy3697 32 PCSGQ-GVC--IEGVCTCDAMYTGEACDVPI--------------CP-NNCSYSNGVCK----HEFHRCE-CMDKYKEMK 88 (117)
Q Consensus 32 ~C~~~-G~C--~~g~C~C~~g~~G~~C~~~~--------------c~-~~C~~~~G~C~----~~~g~C~-C~~g~~G~~ 88 (117)
+|.+| ..| .+|.|.|...-.|..|+.-. |. -+|.+. +.|. .....|. |+++|+|..
T Consensus 732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~-~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNG-GACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred ccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCC-hhhcCcCcccceecCCCCCCCcccc
Confidence 55554 557 45788888777777776410 11 124443 4443 2345898 999999999
Q ss_pred CCc
Q psy3697 89 QLF 91 (117)
Q Consensus 89 C~~ 91 (117)
|+.
T Consensus 811 Ce~ 813 (1705)
T KOG1836|consen 811 CEE 813 (1705)
T ss_pred ccc
Confidence 974
No 43
>KOG3607|consensus
Probab=85.29 E-value=0.87 Score=38.44 Aligned_cols=27 Identities=33% Similarity=0.800 Sum_probs=24.2
Q ss_pred CCCCCcEEe-CCcEEeCCCCCCCCCCCC
Q psy3697 32 PCSGQGVCI-EGVCTCDAMYTGEACDVP 58 (117)
Q Consensus 32 ~C~~~G~C~-~g~C~C~~g~~G~~C~~~ 58 (117)
.|.+||+|. ...|+|.++|.+++|+..
T Consensus 631 ~C~g~GVCnn~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 631 TCNGHGVCNNELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccCCCcccCCCcceeeCCCCCCCccccc
Confidence 799999995 478999999999999975
No 44
>KOG0994|consensus
Probab=85.20 E-value=2.2 Score=38.07 Aligned_cols=69 Identities=28% Similarity=0.683 Sum_probs=37.9
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCc----EE--eCCcEEeCCCCCCCCCCCC--------CC-CCCCCCCC
Q psy3697 4 GEACDVPICPNNCSYSSVNSCPSSKSDKPCSGQG----VC--IEGVCTCDAMYTGEACDVP--------IC-PNNCSYSN 68 (117)
Q Consensus 4 G~~C~~~~C~~~c~~~~~~~C~~~~~~~~C~~~G----~C--~~g~C~C~~g~~G~~C~~~--------~c-~~~C~~~~ 68 (117)
|..|+ .|...-.+|....|..- +|...| .| ++|+|+|.+|-.|..|+.- .| +-.|.+|.
T Consensus 792 GR~Cd--qCApGtyGFGPsGCk~C----dC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA 865 (1758)
T KOG0994|consen 792 GRRCD--QCAPGTYGFGPSGCKAC----DCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHA 865 (1758)
T ss_pred ccccc--ccCCcccCcCCccCccc----cccccccccccccccccceeeccccchhhccccCCCccCCCcCccccccCcc
Confidence 44443 35555444444445432 444333 35 6689999998888887741 12 11344443
Q ss_pred CeEecCCCeE
Q psy3697 69 GVCKHEFHRC 78 (117)
Q Consensus 69 G~C~~~~g~C 78 (117)
.+|++..|.|
T Consensus 866 ~~Cd~~tGaC 875 (1758)
T KOG0994|consen 866 DTCDPITGAC 875 (1758)
T ss_pred cccCcccccc
Confidence 4676666665
No 45
>PHA02887 EGF-like protein; Provisional
Probab=84.95 E-value=0.8 Score=30.06 Aligned_cols=26 Identities=42% Similarity=0.957 Sum_probs=20.0
Q ss_pred CCCCCcEEe------CCcEEeCCCCCCCCCCCC
Q psy3697 32 PCSGQGVCI------EGVCTCDAMYTGEACDVP 58 (117)
Q Consensus 32 ~C~~~G~C~------~g~C~C~~g~~G~~C~~~ 58 (117)
-|. ||+|. ...|.|+.||+|..|+..
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence 355 68882 246999999999999863
No 46
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=83.41 E-value=0.52 Score=26.75 Aligned_cols=14 Identities=29% Similarity=0.468 Sum_probs=4.9
Q ss_pred eEeeCCCCcccCCC
Q psy3697 77 RCECMDKYKEMKQL 90 (117)
Q Consensus 77 ~C~C~~g~~G~~C~ 90 (117)
.|+|+.-|.|++|+
T Consensus 37 ~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 37 VCECNSCYGGPDCS 50 (56)
T ss_dssp --EE-TTEESTTS-
T ss_pred cccccCCcCCCCcc
Confidence 34444444444444
No 47
>KOG1214|consensus
Probab=83.22 E-value=2.4 Score=36.65 Aligned_cols=45 Identities=27% Similarity=0.718 Sum_probs=32.5
Q ss_pred CcEEeCCCCCCCC--CC-CCCC-CCCCCCCCCeEecCCC--eEeeCCCCccc
Q psy3697 42 GVCTCDAMYTGEA--CD-VPIC-PNNCSYSNGVCKHEFH--RCECMDKYKEM 87 (117)
Q Consensus 42 g~C~C~~g~~G~~--C~-~~~c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~ 87 (117)
+.|.|-+||.|+- |. ...| +..|... ++|....+ .|.|.+||.|+
T Consensus 809 y~C~CLPGfsGDG~~c~dvDeC~psrChp~-A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGHQCTDVDECSPSRCHPA-ATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred EEEeecCCccCCccccccccccCccccCCC-ceEecCCCcceeecccCccCC
Confidence 4699999999863 33 2334 3456665 88876555 89999999987
No 48
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=79.78 E-value=1.5 Score=29.22 Aligned_cols=17 Identities=35% Similarity=0.718 Sum_probs=13.6
Q ss_pred CcEEeCCCCCCCCCCCC
Q psy3697 42 GVCTCDAMYTGEACDVP 58 (117)
Q Consensus 42 g~C~C~~g~~G~~C~~~ 58 (117)
..|.|+.||+|..||..
T Consensus 67 ~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 67 MYCRCSHGYTGIRCQHV 83 (139)
T ss_pred ceeECCCCcccccccce
Confidence 45888888888888864
No 49
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=79.30 E-value=2.2 Score=27.34 Aligned_cols=10 Identities=0% Similarity=-0.223 Sum_probs=7.2
Q ss_pred CCcccCCCcC
Q psy3697 83 KYKEMKQLFD 92 (117)
Q Consensus 83 g~~G~~C~~~ 92 (117)
.|.|+.|+..
T Consensus 53 ~W~G~aCqKk 62 (103)
T PF12955_consen 53 HWGGPACQKK 62 (103)
T ss_pred eecccccccc
Confidence 5888888753
No 50
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=78.01 E-value=2.8 Score=21.44 Aligned_cols=27 Identities=26% Similarity=0.680 Sum_probs=15.2
Q ss_pred CCCCCCCCCCCCCCCeEecCCCeEeeCCCCccc
Q psy3697 55 CDVPICPNNCSYSNGVCKHEFHRCECMDKYKEM 87 (117)
Q Consensus 55 C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~ 87 (117)
|+...|+..|..+ ..+.|.|++||.-+
T Consensus 3 Cn~t~CpA~CDpn------~~~~C~CPeGyIld 29 (34)
T PF09064_consen 3 CNQTECPADCDPN------SPGQCFCPEGYILD 29 (34)
T ss_pred cccccCCCccCCC------CCCceeCCCceEec
Confidence 4544555444432 24478888888543
No 51
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=77.94 E-value=2.1 Score=28.60 Aligned_cols=23 Identities=26% Similarity=0.613 Sum_probs=17.9
Q ss_pred CeEecCC----CeEeeCCCCcccCCCc
Q psy3697 69 GVCKHEF----HRCECMDKYKEMKQLF 91 (117)
Q Consensus 69 G~C~~~~----g~C~C~~g~~G~~C~~ 91 (117)
|.|.-.. -.|.|..||+|..|+.
T Consensus 56 G~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 56 GDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CEEEeeccCCCceeECCCCcccccccc
Confidence 6774322 3799999999999984
No 52
>KOG1836|consensus
Probab=59.52 E-value=7.6 Score=36.09 Aligned_cols=48 Identities=27% Similarity=0.701 Sum_probs=34.0
Q ss_pred cEEeCCCCCCCCCCCC-------------CC---CCCCCCCCCeEecCCCeEeeCCCCcccCCC
Q psy3697 43 VCTCDAMYTGEACDVP-------------IC---PNNCSYSNGVCKHEFHRCECMDKYKEMKQL 90 (117)
Q Consensus 43 ~C~C~~g~~G~~C~~~-------------~c---~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~ 90 (117)
.|.|+.+|+|.+|+.- .+ +-+|.++..+|+..+|.|.|.+.-.|..|+
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~ 759 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCA 759 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchh
Confidence 4999999999999851 01 113444225788888999888777777775
No 53
>KOG3516|consensus
Probab=54.74 E-value=12 Score=33.58 Aligned_cols=27 Identities=37% Similarity=0.987 Sum_probs=22.3
Q ss_pred CCCCCcEEeC----CcEEeC-CCCCCCCCCCC
Q psy3697 32 PCSGQGVCIE----GVCTCD-AMYTGEACDVP 58 (117)
Q Consensus 32 ~C~~~G~C~~----g~C~C~-~g~~G~~C~~~ 58 (117)
.|.|+|.|+. +.|.|. ..|.|++|...
T Consensus 962 ~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~e 993 (1306)
T KOG3516|consen 962 PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKE 993 (1306)
T ss_pred cccCCCEEEEecCceeeccccCcCCCCccccc
Confidence 8999999954 579986 68999999864
No 54
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=47.60 E-value=12 Score=17.47 Aligned_cols=10 Identities=30% Similarity=0.843 Sum_probs=8.0
Q ss_pred eEeeCCCCcc
Q psy3697 77 RCECMDKYKE 86 (117)
Q Consensus 77 ~C~C~~g~~G 86 (117)
+|.|++||+-
T Consensus 3 ~C~C~~Gy~l 12 (24)
T PF12662_consen 3 TCSCPPGYQL 12 (24)
T ss_pred EeeCCCCCcC
Confidence 6889999963
No 55
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=41.94 E-value=21 Score=18.23 Aligned_cols=18 Identities=33% Similarity=0.741 Sum_probs=10.3
Q ss_pred eEecCCC--eEeeCCCCccc
Q psy3697 70 VCKHEFH--RCECMDKYKEM 87 (117)
Q Consensus 70 ~C~~~~g--~C~C~~g~~G~ 87 (117)
.|....+ +|.|++||.-.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp EEEEETTSEEEE-STTEEE-
T ss_pred CCccCCCceEeECCCCCEEC
Confidence 4544333 89999998644
No 56
>KOG3512|consensus
Probab=37.68 E-value=44 Score=27.35 Aligned_cols=22 Identities=14% Similarity=0.434 Sum_probs=20.0
Q ss_pred CeEecCCCeEeeCCCCcccCCC
Q psy3697 69 GVCKHEFHRCECMDKYKEMKQL 90 (117)
Q Consensus 69 G~C~~~~g~C~C~~g~~G~~C~ 90 (117)
-+|.+.+|+|.|.+|-+|..|.
T Consensus 407 ktCNq~tGqCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 407 KTCNQTTGQCPCKEGVTGLTCN 428 (592)
T ss_pred ccccccCCcccCCCCCcccccc
Confidence 5898889999999999999985
No 57
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=27.98 E-value=57 Score=17.66 Aligned_cols=14 Identities=43% Similarity=1.115 Sum_probs=8.5
Q ss_pred cEEeCCcEEeCCCCC
Q psy3697 37 GVCIEGVCTCDAMYT 51 (117)
Q Consensus 37 G~C~~g~C~C~~g~~ 51 (117)
..|+.| |.|++||.
T Consensus 29 ~~C~~g-C~C~~G~v 42 (55)
T PF01826_consen 29 EPCVEG-CFCPPGYV 42 (55)
T ss_dssp SS-ESE-EEETTTEE
T ss_pred CCCCcc-CCCCCCee
Confidence 345556 77777764
No 58
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=25.67 E-value=96 Score=19.34 Aligned_cols=24 Identities=21% Similarity=0.808 Sum_probs=14.3
Q ss_pred CCCCCCCCeEecCC-CeEeeCCCCcc
Q psy3697 62 NNCSYSNGVCKHEF-HRCECMDKYKE 86 (117)
Q Consensus 62 ~~C~~~~G~C~~~~-g~C~C~~g~~G 86 (117)
..|... |.|.... ..|.|.+||.-
T Consensus 84 ~~CG~~-g~C~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 84 GFCGPN-GICNSNNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCc-cEeCCCCCCceECCCCcCC
Confidence 356665 7775322 26888887753
No 59
>KOG1214|consensus
Probab=24.33 E-value=1.8e+02 Score=25.92 Aligned_cols=44 Identities=30% Similarity=0.639 Sum_probs=30.2
Q ss_pred CcEEeCCCCCCC--CCCCC----CCCCCCCCCCCeEecCCC--eEeeCCCCcc
Q psy3697 42 GVCTCDAMYTGE--ACDVP----ICPNNCSYSNGVCKHEFH--RCECMDKYKE 86 (117)
Q Consensus 42 g~C~C~~g~~G~--~C~~~----~c~~~C~~~~G~C~~~~g--~C~C~~g~~G 86 (117)
.+|.|..+|.|. .|..+ .+...|..+ ..|....+ +|+|..+|.-
T Consensus 716 ~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~-s~Cin~pg~~rceC~~gy~F 767 (1289)
T KOG1214|consen 716 YTCECSSGYQGDGRNCVDENECATGFHRCGPN-SVCINLPGSYRCECRSGYEF 767 (1289)
T ss_pred eEEEEeeccCCCCCCCCChhhhccCCCCCCCC-ceeecCCCceeEEEeeccee
Confidence 479999999875 44322 244578776 88976665 7888777643
No 60
>PF05294 Toxin_5: Scorpion short toxin; InterPro: IPR007958 This family contains various secreted scorpion short toxins which seem to be unrelated to those described in IPR001947 from INTERPRO.; GO: 0009405 pathogenesis, 0005576 extracellular region; PDB: 1SIS_A 1CHL_A.
Probab=22.01 E-value=1e+02 Score=15.37 Aligned_cols=15 Identities=40% Similarity=1.156 Sum_probs=10.9
Q ss_pred CCCCCcEEeCCcEEe
Q psy3697 32 PCSGQGVCIEGVCTC 46 (117)
Q Consensus 32 ~C~~~G~C~~g~C~C 46 (117)
-|.++|.|.-.+|+|
T Consensus 18 CCgg~GkC~GpqClC 32 (32)
T PF05294_consen 18 CCGGRGKCFGPQCLC 32 (32)
T ss_dssp HCTTSEEEETTEEEE
T ss_pred HhCCCCeEcCCcccC
Confidence 366778887777775
Done!