Query psy11801
Match_columns 73
No_of_seqs 101 out of 1748
Neff 10.6
Searched_HMMs 46136
Date Fri Aug 16 19:22:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11801.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11801hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.3 1.1E-11 2.3E-16 83.7 6.1 72 1-72 3869-3954(4289)
2 KOG4289|consensus 98.9 2.4E-09 5.1E-14 70.8 4.4 57 13-69 1221-1283(2531)
3 KOG1225|consensus 98.9 2E-08 4.4E-13 61.2 7.1 67 4-72 256-322 (525)
4 KOG1219|consensus 98.6 3.9E-08 8.5E-13 67.6 4.2 60 1-60 3908-3977(4289)
5 KOG1225|consensus 98.6 1.4E-07 3.1E-12 57.7 6.2 58 3-62 286-343 (525)
6 PF00008 EGF: EGF-like domain 97.9 2.9E-06 6.4E-11 33.5 0.4 24 2-25 4-31 (32)
7 PF07974 EGF_2: EGF-like domai 97.8 4.1E-05 8.9E-10 30.3 3.1 25 3-27 7-32 (32)
8 KOG1226|consensus 97.8 8.2E-05 1.8E-09 47.5 5.8 64 3-68 515-588 (783)
9 KOG1226|consensus 97.8 0.00012 2.6E-09 46.7 6.0 66 3-70 556-631 (783)
10 KOG4289|consensus 97.7 7.1E-05 1.5E-09 50.9 3.8 61 1-61 1244-1317(2531)
11 PF12661 hEGF: Human growth fa 97.4 8.5E-05 1.8E-09 23.6 0.8 11 16-26 2-12 (13)
12 PF07974 EGF_2: EGF-like domai 97.4 0.00042 9.1E-09 27.3 3.0 26 34-59 6-32 (32)
13 smart00179 EGF_CA Calcium-bind 97.1 0.00096 2.1E-08 26.7 3.0 25 3-27 10-38 (39)
14 cd00054 EGF_CA Calcium-binding 97.0 0.0017 3.6E-08 25.7 2.9 25 3-27 10-37 (38)
15 cd00053 EGF Epidermal growth f 96.9 0.0021 4.5E-08 25.0 3.1 23 3-25 7-32 (36)
16 smart00051 DSL delta serrate l 96.9 0.0028 6E-08 28.8 3.4 45 14-59 17-63 (63)
17 KOG1214|consensus 96.6 0.0077 1.7E-07 39.8 5.1 56 14-69 809-874 (1289)
18 smart00181 EGF Epidermal growt 96.4 0.0069 1.5E-07 23.7 2.8 21 3-24 7-30 (35)
19 PF07645 EGF_CA: Calcium-bindi 94.2 0.022 4.7E-07 23.6 0.7 20 4-23 12-34 (42)
20 PHA02887 EGF-like protein; Pro 92.8 0.16 3.5E-06 25.9 2.5 25 36-61 94-123 (126)
21 PF12947 EGF_3: EGF domain; I 92.4 0.12 2.5E-06 20.8 1.5 22 36-57 8-32 (36)
22 KOG1217|consensus 91.3 0.85 1.8E-05 27.5 4.9 25 4-28 179-206 (487)
23 PHA03099 epidermal growth fact 91.1 0.28 6.1E-06 25.5 2.3 26 35-61 52-82 (139)
24 KOG4260|consensus 90.9 0.41 8.9E-06 28.1 3.1 45 16-60 130-182 (350)
25 PF12662 cEGF: Complement Clr- 90.7 0.18 3.9E-06 18.5 1.0 10 14-23 2-11 (24)
26 KOG3514|consensus 88.4 0.34 7.4E-06 33.5 1.7 29 1-29 628-660 (1591)
27 PF01414 DSL: Delta serrate li 87.5 0.32 7E-06 22.0 0.9 46 14-59 17-63 (63)
28 KOG1217|consensus 86.0 1.9 4E-05 26.1 3.8 49 14-62 152-208 (487)
29 PF12955 DUF3844: Domain of un 85.7 1.1 2.3E-05 22.5 2.2 19 2-20 13-39 (103)
30 KOG3516|consensus 85.0 0.91 2E-05 31.6 2.3 29 2-30 961-993 (1306)
31 PF00053 Laminin_EGF: Laminin 82.8 1.1 2.4E-05 18.9 1.4 17 45-61 17-33 (49)
32 KOG3516|consensus 80.4 1.5 3.2E-05 30.7 1.9 30 1-30 550-583 (1306)
33 smart00180 EGF_Lam Laminin-typ 79.1 2.4 5.2E-05 17.7 1.8 15 46-60 18-32 (46)
34 cd00055 EGF_Lam Laminin-type e 79.1 2.2 4.9E-05 18.1 1.8 15 46-60 19-33 (50)
35 KOG3607|consensus 73.4 3.6 7.8E-05 27.4 2.2 26 4-29 632-657 (716)
36 PF01683 EB: EB module; Inter 68.6 8.5 0.00018 16.3 2.5 8 47-54 38-45 (52)
37 PF14670 FXa_inhibition: Coagu 60.9 7.4 0.00016 15.5 1.3 11 45-55 18-28 (36)
38 PF04863 EGF_alliinase: Alliin 60.6 4.5 9.7E-05 17.9 0.6 29 34-62 17-52 (56)
39 KOG0994|consensus 59.5 14 0.0003 26.7 2.9 48 14-61 1084-1147(1758)
40 KOG3607|consensus 58.8 9.8 0.00021 25.5 2.1 28 34-61 630-657 (716)
41 PF00954 S_locus_glycop: S-loc 44.6 26 0.00056 17.3 2.0 23 34-56 84-108 (110)
42 PF09064 Tme5_EGF_like: Thromb 42.6 21 0.00047 14.2 1.2 9 14-22 18-26 (34)
43 PF09402 MSC: Man1-Src1p-C-ter 28.9 19 0.0004 21.5 0.0 21 3-23 54-74 (334)
44 PF12946 EGF_MSP1_1: MSP1 EGF 23.5 62 0.0013 13.1 1.1 20 36-55 7-30 (37)
No 1
>KOG1219|consensus
Probab=99.28 E-value=1.1e-11 Score=83.71 Aligned_cols=72 Identities=39% Similarity=0.947 Sum_probs=62.0
Q ss_pred CCCCCCCCEecC----CCeEEcCCCCcCCCCCccc--c-CCCCCCCcEEe---CCCeeeeCCCCccCCCCcC----CCCC
Q psy11801 1 MSEFRNGGSCLG----GTRCVCRAGYKGDNCGEAE--C-KEPCLNGGRCI---GPDKCACLYGYAGRRCEAD----YRTG 66 (73)
Q Consensus 1 ~~~c~~~~~c~~----~~~c~c~~~~~g~~c~~~~--c-~~~c~~~g~c~---~~~~c~c~~g~~g~~c~~~----~~~~ 66 (73)
.+||+++|+|.. +|.|.|++.|+|+.|+... | ++||..+|+|. ..+.|.|+.+|+|.+|+.+ ...+
T Consensus 3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred cCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence 378999999986 3899999999999998643 6 58999999998 3478999999999999987 3678
Q ss_pred CCCCCC
Q psy11801 67 PCYTKM 72 (73)
Q Consensus 67 ~C~~g~ 72 (73)
+|.+|+
T Consensus 3949 ~C~~gg 3954 (4289)
T KOG1219|consen 3949 VCGTGG 3954 (4289)
T ss_pred cccCCc
Confidence 998886
No 2
>KOG4289|consensus
Probab=98.90 E-value=2.4e-09 Score=70.83 Aligned_cols=57 Identities=44% Similarity=1.080 Sum_probs=48.0
Q ss_pred CCeEEcCCCCcCCCCCccc--c-CCCCCCCcEEe---CCCeeeeCCCCccCCCCcCCCCCCCC
Q psy11801 13 GTRCVCRAGYKGDNCGEAE--C-KEPCLNGGRCI---GPDKCACLYGYAGRRCEADYRTGPCY 69 (73)
Q Consensus 13 ~~~c~c~~~~~g~~c~~~~--c-~~~c~~~g~c~---~~~~c~c~~g~~g~~c~~~~~~~~C~ 69 (73)
++.|.||+||+|..|+..+ | +.||.++++|. +.|.|.|.++|+|..|+.+....-|.
T Consensus 1221 glrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCv 1283 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCV 1283 (2531)
T ss_pred ceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccc
Confidence 3789999999999998755 6 47999999997 46999999999999999888554443
No 3
>KOG1225|consensus
Probab=98.86 E-value=2e-08 Score=61.22 Aligned_cols=67 Identities=34% Similarity=0.944 Sum_probs=46.9
Q ss_pred CCCCCEecCCCeEEcCCCCcCCCCCccccCCCCCCCcEEeCCCeeeeCCCCccCCCCcCCCCCCCCCCC
Q psy11801 4 FRNGGSCLGGTRCVCRAGYKGDNCGEAECKEPCLNGGRCIGPDKCACLYGYAGRRCEADYRTGPCYTKM 72 (73)
Q Consensus 4 c~~~~~c~~~~~c~c~~~~~g~~c~~~~c~~~c~~~g~c~~~~~c~c~~g~~g~~c~~~~~~~~C~~g~ 72 (73)
|.+.+.|++. .|.|+++|+|..|....|...|+.++.+.. ..|.|.++|.|..|++...+.+|.+++
T Consensus 256 c~~~g~c~~G-~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~G~dCs~~~cpadC~g~G 322 (525)
T KOG1225|consen 256 CTGRGQCVEG-RCICPPGFTGDDCDELVCPVDCSGGGVCVD-GECICNPGYSGKDCSIRRCPADCSGHG 322 (525)
T ss_pred CcccceEeCC-eEeCCCCCcCCCCCcccCCcccCCCceecC-CEeecCCCccccccccccCCccCCCCC
Confidence 5555667766 888888888888877667656766666654 377788888888777666666665543
No 4
>KOG1219|consensus
Probab=98.65 E-value=3.9e-08 Score=67.60 Aligned_cols=60 Identities=38% Similarity=0.950 Sum_probs=52.7
Q ss_pred CCCCCCCCEecC---CCeEEcCCCCcCCCCCccc---cC-CCCCCCcEEe---CCCeeeeCCCCccCCCC
Q psy11801 1 MSEFRNGGSCLG---GTRCVCRAGYKGDNCGEAE---CK-EPCLNGGRCI---GPDKCACLYGYAGRRCE 60 (73)
Q Consensus 1 ~~~c~~~~~c~~---~~~c~c~~~~~g~~c~~~~---c~-~~c~~~g~c~---~~~~c~c~~g~~g~~c~ 60 (73)
++||.++|+|.. +|.|.|+.+|+|..|+... |+ ++|..+|.|. +++.|.|.+++.|+.|.
T Consensus 3908 snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3908 SNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred CCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence 589999999986 4899999999999999873 54 8899999997 46899999999999884
No 5
>KOG1225|consensus
Probab=98.65 E-value=1.4e-07 Score=57.66 Aligned_cols=58 Identities=36% Similarity=1.042 Sum_probs=50.8
Q ss_pred CCCCCCEecCCCeEEcCCCCcCCCCCccccCCCCCCCcEEeCCCeeeeCCCCccCCCCcC
Q psy11801 3 EFRNGGSCLGGTRCVCRAGYKGDNCGEAECKEPCLNGGRCIGPDKCACLYGYAGRRCEAD 62 (73)
Q Consensus 3 ~c~~~~~c~~~~~c~c~~~~~g~~c~~~~c~~~c~~~g~c~~~~~c~c~~g~~g~~c~~~ 62 (73)
.|..++.+++. .|.|+++|+|..|+.+.|...|+.+|.|. ..+|.|.+||+|..|...
T Consensus 286 ~cs~~g~~~~g-~CiC~~g~~G~dCs~~~cpadC~g~G~Ci-~G~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 286 DCSGGGVCVDG-ECICNPGYSGKDCSIRRCPADCSGHGKCI-DGECLCDEGYTGELCIQR 343 (525)
T ss_pred ccCCCceecCC-EeecCCCccccccccccCCccCCCCCccc-CCceEeCCCCcCCccccc
Confidence 46677777777 99999999999998877888999999998 578999999999999864
No 6
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.92 E-value=2.9e-06 Score=33.53 Aligned_cols=24 Identities=42% Similarity=1.118 Sum_probs=15.7
Q ss_pred CCCCCCCEecC----CCeEEcCCCCcCC
Q psy11801 2 SEFRNGGSCLG----GTRCVCRAGYKGD 25 (73)
Q Consensus 2 ~~c~~~~~c~~----~~~c~c~~~~~g~ 25 (73)
+||.++++|+. .|.|.|+++|+|.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 56777777754 2667777777664
No 7
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.84 E-value=4.1e-05 Score=30.25 Aligned_cols=25 Identities=40% Similarity=1.071 Sum_probs=18.3
Q ss_pred CCCCCCEecCC-CeEEcCCCCcCCCC
Q psy11801 3 EFRNGGSCLGG-TRCVCRAGYKGDNC 27 (73)
Q Consensus 3 ~c~~~~~c~~~-~~c~c~~~~~g~~c 27 (73)
.|.++|+|+.. ..|.|.++|+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 47778888753 57888888887764
No 8
>KOG1226|consensus
Probab=97.83 E-value=8.2e-05 Score=47.46 Aligned_cols=64 Identities=31% Similarity=0.802 Sum_probs=47.7
Q ss_pred CCCCCCEecCCCeEEcCCCCc----CCCCCccc--cC----CCCCCCcEEeCCCeeeeCCCCccCCCCcCCCCCCC
Q psy11801 3 EFRNGGSCLGGTRCVCRAGYK----GDNCGEAE--CK----EPCLNGGRCIGPDKCACLYGYAGRRCEADYRTGPC 68 (73)
Q Consensus 3 ~c~~~~~c~~~~~c~c~~~~~----g~~c~~~~--c~----~~c~~~g~c~~~~~c~c~~g~~g~~c~~~~~~~~C 68 (73)
+|++.|.|.-+ .|+|.+... |.+|+... |. ..|..+|.|.- ++|+|.+||.|..|+-+....-|
T Consensus 515 vCSgrG~C~CG-qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~C~~std~C 588 (783)
T KOG1226|consen 515 VCSGRGDCVCG-QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACNCPLSTDTC 588 (783)
T ss_pred CcCCCCcEeCC-ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeC-CcEEcCCCCccCCCCCCCCCccc
Confidence 68888888877 899987776 78877543 42 45888888863 58999999999998866544443
No 9
>KOG1226|consensus
Probab=97.79 E-value=0.00012 Score=46.74 Aligned_cols=66 Identities=33% Similarity=0.810 Sum_probs=48.3
Q ss_pred CCCCCCEecCCCeEEcCCCCcCCCCCccc----cC----CCCCCCcEEeCCCeeeeCCC-CccCCCCcC-CCCCCCCC
Q psy11801 3 EFRNGGSCLGGTRCVCRAGYKGDNCGEAE----CK----EPCLNGGRCIGPDKCACLYG-YAGRRCEAD-YRTGPCYT 70 (73)
Q Consensus 3 ~c~~~~~c~~~~~c~c~~~~~g~~c~~~~----c~----~~c~~~g~c~~~~~c~c~~g-~~g~~c~~~-~~~~~C~~ 70 (73)
.|.++|.|.-+ .|+|.+||+|..|.... |. ..|+.+|+|.- .+|.|... |.|..|+.. ..+++|..
T Consensus 556 lC~g~G~C~CG-~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~C-g~C~C~~~~~sG~~CE~cptc~~~C~~ 631 (783)
T KOG1226|consen 556 LCGGHGRCECG-RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCEC-GRCKCTDPPYSGEFCEKCPTCPDPCAE 631 (783)
T ss_pred ccCCCCeEeCC-cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeC-CceEcCCCCcCcchhhcCCCCCCcccc
Confidence 37778888776 99999999999997532 32 35777788763 57888866 899999853 35666644
No 10
>KOG4289|consensus
Probab=97.66 E-value=7.1e-05 Score=50.95 Aligned_cols=61 Identities=39% Similarity=1.011 Sum_probs=46.7
Q ss_pred CCCCCCCCEecC---CCeEEcCCCCcCCCCCccc----c-CCCCCCCcEEe----CCCeeeeCCC-CccCCCCc
Q psy11801 1 MSEFRNGGSCLG---GTRCVCRAGYKGDNCGEAE----C-KEPCLNGGRCI----GPDKCACLYG-YAGRRCEA 61 (73)
Q Consensus 1 ~~~c~~~~~c~~---~~~c~c~~~~~g~~c~~~~----c-~~~c~~~g~c~----~~~~c~c~~g-~~g~~c~~ 61 (73)
+.||.++++|.. +|+|.|-++|+|..|+... | ..-|.++++|. +.+.|.|+.| |.+..|+.
T Consensus 1244 s~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1244 SGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred cCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence 468999999985 4999999999999998633 3 24588889997 3467889875 45666764
No 11
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.38 E-value=8.5e-05 Score=23.58 Aligned_cols=11 Identities=45% Similarity=1.253 Sum_probs=4.8
Q ss_pred EEcCCCCcCCC
Q psy11801 16 CVCRAGYKGDN 26 (73)
Q Consensus 16 c~c~~~~~g~~ 26 (73)
|+|++||+|..
T Consensus 2 C~C~~G~~G~~ 12 (13)
T PF12661_consen 2 CQCPPGWTGPN 12 (13)
T ss_dssp EEE-TTEETTT
T ss_pred ccCcCCCcCCC
Confidence 44555555443
No 12
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.38 E-value=0.00042 Score=27.33 Aligned_cols=26 Identities=38% Similarity=1.122 Sum_probs=21.7
Q ss_pred CCCCCCcEEeCC-CeeeeCCCCccCCC
Q psy11801 34 EPCLNGGRCIGP-DKCACLYGYAGRRC 59 (73)
Q Consensus 34 ~~c~~~g~c~~~-~~c~c~~g~~g~~c 59 (73)
..|+.+|+|... ..|.|.++|.|..|
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 358889999854 79999999999876
No 13
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.14 E-value=0.00096 Score=26.75 Aligned_cols=25 Identities=48% Similarity=1.228 Sum_probs=13.8
Q ss_pred CCCCCCEecC---CCeEEcCCCCc-CCCC
Q psy11801 3 EFRNGGSCLG---GTRCVCRAGYK-GDNC 27 (73)
Q Consensus 3 ~c~~~~~c~~---~~~c~c~~~~~-g~~c 27 (73)
+|.++++|+. +|.|.|+++|. |..|
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 4555556654 25566666665 5444
No 14
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.96 E-value=0.0017 Score=25.69 Aligned_cols=25 Identities=48% Similarity=1.240 Sum_probs=13.1
Q ss_pred CCCCCCEecC---CCeEEcCCCCcCCCC
Q psy11801 3 EFRNGGSCLG---GTRCVCRAGYKGDNC 27 (73)
Q Consensus 3 ~c~~~~~c~~---~~~c~c~~~~~g~~c 27 (73)
+|.+++.|.. .|.|.|+++|.|..|
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 4544555543 255666666655444
No 15
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.94 E-value=0.0021 Score=24.99 Aligned_cols=23 Identities=52% Similarity=1.237 Sum_probs=12.5
Q ss_pred CCCCCCEecC---CCeEEcCCCCcCC
Q psy11801 3 EFRNGGSCLG---GTRCVCRAGYKGD 25 (73)
Q Consensus 3 ~c~~~~~c~~---~~~c~c~~~~~g~ 25 (73)
+|.++++|+. .+.|.|+.+|.|.
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCccc
Confidence 4555555554 2556666666554
No 16
>smart00051 DSL delta serrate ligand.
Probab=96.86 E-value=0.0028 Score=28.80 Aligned_cols=45 Identities=27% Similarity=0.718 Sum_probs=31.4
Q ss_pred CeEEcCCCCcCCCCCccccC--CCCCCCcEEeCCCeeeeCCCCccCCC
Q psy11801 14 TRCVCRAGYKGDNCGEAECK--EPCLNGGRCIGPDKCACLYGYAGRRC 59 (73)
Q Consensus 14 ~~c~c~~~~~g~~c~~~~c~--~~c~~~g~c~~~~~c~c~~g~~g~~c 59 (73)
+.-.|++.|.|..|... |. .-...+..|...+.+.|.+||.|..|
T Consensus 17 ~rv~C~~~~yG~~C~~~-C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNKF-CRPRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCCE-eCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence 35668899999998543 32 22345556665567899999999876
No 17
>KOG1214|consensus
Probab=96.62 E-value=0.0077 Score=39.77 Aligned_cols=56 Identities=36% Similarity=0.907 Sum_probs=39.9
Q ss_pred CeEEcCCCCcCCC--CC-ccccC-CCCCCCcEEe---CCCeeeeCCCCccC--CCCcC-CCCCCCC
Q psy11801 14 TRCVCRAGYKGDN--CG-EAECK-EPCLNGGRCI---GPDKCACLYGYAGR--RCEAD-YRTGPCY 69 (73)
Q Consensus 14 ~~c~c~~~~~g~~--c~-~~~c~-~~c~~~g~c~---~~~~c~c~~g~~g~--~c~~~-~~~~~C~ 69 (73)
|.|+|.|||+|+. |. ...|. +.|.....|. +++.|.|.+||.|+ +|-.+ ....+|+
T Consensus 809 y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~ 874 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCE 874 (1289)
T ss_pred EEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCccc
Confidence 8999999999864 32 23353 6688888887 46789999999986 45555 3444554
No 18
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.40 E-value=0.0069 Score=23.70 Aligned_cols=21 Identities=38% Similarity=1.045 Sum_probs=11.3
Q ss_pred CCCCCCEecC---CCeEEcCCCCcC
Q psy11801 3 EFRNGGSCLG---GTRCVCRAGYKG 24 (73)
Q Consensus 3 ~c~~~~~c~~---~~~c~c~~~~~g 24 (73)
+|.++ +|+. .|.|.|+++|.|
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCcc
Confidence 44444 5543 255666666655
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=94.21 E-value=0.022 Score=23.57 Aligned_cols=20 Identities=30% Similarity=0.944 Sum_probs=9.7
Q ss_pred CCCCCEecC---CCeEEcCCCCc
Q psy11801 4 FRNGGSCLG---GTRCVCRAGYK 23 (73)
Q Consensus 4 c~~~~~c~~---~~~c~c~~~~~ 23 (73)
|...+.|++ +|.|.|++||.
T Consensus 12 C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 12 CPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTSEEEEETTEEEEEESTTEE
T ss_pred CCCCCEEEcCCCCEEeeCCCCcE
Confidence 444445543 25555555554
No 20
>PHA02887 EGF-like protein; Provisional
Probab=92.76 E-value=0.16 Score=25.90 Aligned_cols=25 Identities=40% Similarity=1.168 Sum_probs=19.8
Q ss_pred CCCCcEEe-----CCCeeeeCCCCccCCCCc
Q psy11801 36 CLNGGRCI-----GPDKCACLYGYAGRRCEA 61 (73)
Q Consensus 36 c~~~g~c~-----~~~~c~c~~g~~g~~c~~ 61 (73)
|. +|.|. ....|.|..||.|.+|+.
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 55 46886 246899999999999974
No 21
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=92.44 E-value=0.12 Score=20.81 Aligned_cols=22 Identities=36% Similarity=0.964 Sum_probs=12.7
Q ss_pred CCCCcEEe---CCCeeeeCCCCccC
Q psy11801 36 CLNGGRCI---GPDKCACLYGYAGR 57 (73)
Q Consensus 36 c~~~g~c~---~~~~c~c~~g~~g~ 57 (73)
|+...+|. +.+.|.|.+||.|+
T Consensus 8 C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 8 CHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp S-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCcEeecCCCCEEeECCCCCccC
Confidence 44455564 34678888888765
No 22
>KOG1217|consensus
Probab=91.34 E-value=0.85 Score=27.52 Aligned_cols=25 Identities=40% Similarity=1.111 Sum_probs=14.6
Q ss_pred CCCCCEecC---CCeEEcCCCCcCCCCC
Q psy11801 4 FRNGGSCLG---GTRCVCRAGYKGDNCG 28 (73)
Q Consensus 4 c~~~~~c~~---~~~c~c~~~~~g~~c~ 28 (73)
|.+.+.|.. .|.|.|.++|.+..++
T Consensus 179 c~~~~~C~~~~~~~~C~c~~~~~~~~~~ 206 (487)
T KOG1217|consen 179 CQNGGTCVNTGGSYLCSCPPGYTGSTCE 206 (487)
T ss_pred cCCCcccccCCCCeeEeCCCCccCCcCc
Confidence 555555544 2566777777666554
No 23
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=91.13 E-value=0.28 Score=25.48 Aligned_cols=26 Identities=46% Similarity=1.065 Sum_probs=20.2
Q ss_pred CCCCCcEEe-----CCCeeeeCCCCccCCCCc
Q psy11801 35 PCLNGGRCI-----GPDKCACLYGYAGRRCEA 61 (73)
Q Consensus 35 ~c~~~g~c~-----~~~~c~c~~g~~g~~c~~ 61 (73)
-|.++ .|. ....|.|+.||.|.+|+.
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccc
Confidence 36554 786 247899999999999985
No 24
>KOG4260|consensus
Probab=90.90 E-value=0.41 Score=28.11 Aligned_cols=45 Identities=36% Similarity=0.945 Sum_probs=31.7
Q ss_pred EEcCCCCcCCCCCccc--cCCCCCCCcEEe------CCCeeeeCCCCccCCCC
Q psy11801 16 CVCRAGYKGDNCGEAE--CKEPCLNGGRCI------GPDKCACLYGYAGRRCE 60 (73)
Q Consensus 16 c~c~~~~~g~~c~~~~--c~~~c~~~g~c~------~~~~c~c~~g~~g~~c~ 60 (73)
--|+.+-.|+.|..-. ...+|...|.|+ +++.|.|.+||.|..|.
T Consensus 130 vCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~ 182 (350)
T KOG4260|consen 130 VCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR 182 (350)
T ss_pred eccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc
Confidence 3467777788875321 235677777776 35789999999998875
No 25
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=90.68 E-value=0.18 Score=18.49 Aligned_cols=10 Identities=40% Similarity=1.172 Sum_probs=5.5
Q ss_pred CeEEcCCCCc
Q psy11801 14 TRCVCRAGYK 23 (73)
Q Consensus 14 ~~c~c~~~~~ 23 (73)
|.|.|++||.
T Consensus 2 y~C~C~~Gy~ 11 (24)
T PF12662_consen 2 YTCSCPPGYQ 11 (24)
T ss_pred EEeeCCCCCc
Confidence 4555555554
No 26
>KOG3514|consensus
Probab=88.41 E-value=0.34 Score=33.45 Aligned_cols=29 Identities=34% Similarity=0.904 Sum_probs=24.4
Q ss_pred CCCCCCCCEecCC---CeEEcC-CCCcCCCCCc
Q psy11801 1 MSEFRNGGSCLGG---TRCVCR-AGYKGDNCGE 29 (73)
Q Consensus 1 ~~~c~~~~~c~~~---~~c~c~-~~~~g~~c~~ 29 (73)
++||+|.|.|.++ |.|.|. .+|.|+.|+.
T Consensus 628 ~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 628 SNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred CCcccCCCCccccccccccccccCcccCccccc
Confidence 5799999999874 888886 6899999974
No 27
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=87.50 E-value=0.32 Score=22.04 Aligned_cols=46 Identities=28% Similarity=0.595 Sum_probs=21.0
Q ss_pred CeEEcCCCCcCCCCCccccCC-CCCCCcEEeCCCeeeeCCCCccCCC
Q psy11801 14 TRCVCRAGYKGDNCGEAECKE-PCLNGGRCIGPDKCACLYGYAGRRC 59 (73)
Q Consensus 14 ~~c~c~~~~~g~~c~~~~c~~-~c~~~g~c~~~~~c~c~~g~~g~~c 59 (73)
+.-.|...|.|..|....-.. --..+-+|...+.-+|.+||.|..|
T Consensus 17 ~rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSKFCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-EE---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccCCcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 456788889999886432110 0122344554456779999999876
No 28
>KOG1217|consensus
Probab=86.00 E-value=1.9 Score=26.10 Aligned_cols=49 Identities=47% Similarity=1.212 Sum_probs=37.2
Q ss_pred CeEEcCCCCcCCCCCcc--ccC---CCCCCCcEEe---CCCeeeeCCCCccCCCCcC
Q psy11801 14 TRCVCRAGYKGDNCGEA--ECK---EPCLNGGRCI---GPDKCACLYGYAGRRCEAD 62 (73)
Q Consensus 14 ~~c~c~~~~~g~~c~~~--~c~---~~c~~~g~c~---~~~~c~c~~g~~g~~c~~~ 62 (73)
+.|.|..+|.+..+... .|. .+|.+++.|. ..+.|.|+++|.+..++..
T Consensus 152 ~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~ 208 (487)
T KOG1217|consen 152 FRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT 208 (487)
T ss_pred eeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC
Confidence 67999999999888754 343 3477777886 2467999999999887654
No 29
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=85.74 E-value=1.1 Score=22.48 Aligned_cols=19 Identities=26% Similarity=0.723 Sum_probs=12.7
Q ss_pred CCCCCCCEecCC--------CeEEcCC
Q psy11801 2 SEFRNGGSCLGG--------TRCVCRA 20 (73)
Q Consensus 2 ~~c~~~~~c~~~--------~~c~c~~ 20 (73)
+.|.++|.|+.. |.|.|.+
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~ 39 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKP 39 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeec
Confidence 458888888752 5566655
No 30
>KOG3516|consensus
Probab=85.01 E-value=0.91 Score=31.62 Aligned_cols=29 Identities=34% Similarity=0.846 Sum_probs=22.9
Q ss_pred CCCCCCCEecC---CCeEEcC-CCCcCCCCCcc
Q psy11801 2 SEFRNGGSCLG---GTRCVCR-AGYKGDNCGEA 30 (73)
Q Consensus 2 ~~c~~~~~c~~---~~~c~c~-~~~~g~~c~~~ 30 (73)
.+|.|+|.|++ +|+|.|. ..|.|+.|...
T Consensus 961 ~~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~e 993 (1306)
T KOG3516|consen 961 YPCLNGGHCVERYDGYTCDCSRTAYDGPFCSKE 993 (1306)
T ss_pred ccccCCCEEEEecCceeeccccCcCCCCccccc
Confidence 47899999986 4788886 77889988753
No 31
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=82.78 E-value=1.1 Score=18.88 Aligned_cols=17 Identities=35% Similarity=0.966 Sum_probs=11.3
Q ss_pred CCeeeeCCCCccCCCCc
Q psy11801 45 PDKCACLYGYAGRRCEA 61 (73)
Q Consensus 45 ~~~c~c~~g~~g~~c~~ 61 (73)
.+.|.|.+++.|..|+.
T Consensus 17 ~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 17 TGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp CEEESBSTTEESTTS-E
T ss_pred CCEEeccccccCCcCcC
Confidence 35677777788777763
No 32
>KOG3516|consensus
Probab=80.38 E-value=1.5 Score=30.71 Aligned_cols=30 Identities=33% Similarity=0.848 Sum_probs=24.6
Q ss_pred CCCCCCCCEecCC---CeEEcC-CCCcCCCCCcc
Q psy11801 1 MSEFRNGGSCLGG---TRCVCR-AGYKGDNCGEA 30 (73)
Q Consensus 1 ~~~c~~~~~c~~~---~~c~c~-~~~~g~~c~~~ 30 (73)
|++|+++|.|..+ |.|.|. .||.|.+|...
T Consensus 550 PN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 550 PNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred CccccCCCcccccccceeEeccccccccccccCC
Confidence 5789999999753 889998 89999998643
No 33
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=79.15 E-value=2.4 Score=17.74 Aligned_cols=15 Identities=40% Similarity=1.099 Sum_probs=11.2
Q ss_pred CeeeeCCCCccCCCC
Q psy11801 46 DKCACLYGYAGRRCE 60 (73)
Q Consensus 46 ~~c~c~~g~~g~~c~ 60 (73)
+.|.|.+++.|..|+
T Consensus 18 G~C~C~~~~~G~~C~ 32 (46)
T smart00180 18 GQCECKPNVTGRRCD 32 (46)
T ss_pred CEEECCCCCCCCCCC
Confidence 567788888887775
No 34
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.15 E-value=2.2 Score=18.08 Aligned_cols=15 Identities=40% Similarity=1.081 Sum_probs=11.9
Q ss_pred CeeeeCCCCccCCCC
Q psy11801 46 DKCACLYGYAGRRCE 60 (73)
Q Consensus 46 ~~c~c~~g~~g~~c~ 60 (73)
..|.|.+++.|..|+
T Consensus 19 G~C~C~~~~~G~~C~ 33 (50)
T cd00055 19 GQCECKPNTTGRRCD 33 (50)
T ss_pred CEEeCCCcCCCCCCC
Confidence 568888888888876
No 35
>KOG3607|consensus
Probab=73.36 E-value=3.6 Score=27.41 Aligned_cols=26 Identities=23% Similarity=0.531 Sum_probs=20.0
Q ss_pred CCCCCEecCCCeEEcCCCCcCCCCCc
Q psy11801 4 FRNGGSCLGGTRCVCRAGYKGDNCGE 29 (73)
Q Consensus 4 c~~~~~c~~~~~c~c~~~~~g~~c~~ 29 (73)
|.++|+|...+.|.|.++|.++.|..
T Consensus 632 C~g~GVCnn~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 632 CNGHGVCNNELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cCCCcccCCCcceeeCCCCCCCcccc
Confidence 66777787767888888888887764
No 36
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=68.59 E-value=8.5 Score=16.26 Aligned_cols=8 Identities=50% Similarity=1.435 Sum_probs=4.1
Q ss_pred eeeeCCCC
Q psy11801 47 KCACLYGY 54 (73)
Q Consensus 47 ~c~c~~g~ 54 (73)
.|.|+.+|
T Consensus 38 ~C~C~~g~ 45 (52)
T PF01683_consen 38 RCQCPPGY 45 (52)
T ss_pred EeECCCCC
Confidence 45555554
No 37
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=60.92 E-value=7.4 Score=15.51 Aligned_cols=11 Identities=36% Similarity=0.999 Sum_probs=7.6
Q ss_pred CCeeeeCCCCc
Q psy11801 45 PDKCACLYGYA 55 (73)
Q Consensus 45 ~~~c~c~~g~~ 55 (73)
.++|.|++||.
T Consensus 18 ~~~C~C~~Gy~ 28 (36)
T PF14670_consen 18 SYRCSCPPGYK 28 (36)
T ss_dssp SEEEE-STTEE
T ss_pred ceEeECCCCCE
Confidence 46788888876
No 38
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=60.63 E-value=4.5 Score=17.94 Aligned_cols=29 Identities=31% Similarity=0.708 Sum_probs=13.2
Q ss_pred CCCCCCcEEe-------CCCeeeeCCCCccCCCCcC
Q psy11801 34 EPCLNGGRCI-------GPDKCACLYGYAGRRCEAD 62 (73)
Q Consensus 34 ~~c~~~g~c~-------~~~~c~c~~g~~g~~c~~~ 62 (73)
..|+.+|+.. +...|.|..-|.|..|+..
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~ 52 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTL 52 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccC
Confidence 3455555554 2245778888888877653
No 39
>KOG0994|consensus
Probab=59.46 E-value=14 Score=26.67 Aligned_cols=48 Identities=29% Similarity=0.840 Sum_probs=30.5
Q ss_pred CeEEcCCCCcCCCCCc---cccCC--------CCCCCcE----Ee-CCCeeeeCCCCccCCCCc
Q psy11801 14 TRCVCRAGYKGDNCGE---AECKE--------PCLNGGR----CI-GPDKCACLYGYAGRRCEA 61 (73)
Q Consensus 14 ~~c~c~~~~~g~~c~~---~~c~~--------~c~~~g~----c~-~~~~c~c~~g~~g~~c~~ 61 (73)
..|+|-+||.|.+|.. ..+-. -|...|+ |+ ..+.|+|.+|..|..|..
T Consensus 1084 GQCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCdq 1147 (1758)
T KOG0994|consen 1084 GQCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCDQ 1147 (1758)
T ss_pred cceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchhh
Confidence 5799999999998853 11111 1222222 32 246899999999988753
No 40
>KOG3607|consensus
Probab=58.81 E-value=9.8 Score=25.51 Aligned_cols=28 Identities=29% Similarity=0.572 Sum_probs=23.5
Q ss_pred CCCCCCcEEeCCCeeeeCCCCccCCCCc
Q psy11801 34 EPCLNGGRCIGPDKCACLYGYAGRRCEA 61 (73)
Q Consensus 34 ~~c~~~g~c~~~~~c~c~~g~~g~~c~~ 61 (73)
..|+.+|+|.....|.|.++|.+..|+.
T Consensus 630 ~~C~g~GVCnn~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 630 TTCNGHGVCNNELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cccCCCcccCCCcceeeCCCCCCCcccc
Confidence 3488888887667899999999999975
No 41
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=44.58 E-value=26 Score=17.27 Aligned_cols=23 Identities=35% Similarity=0.772 Sum_probs=16.5
Q ss_pred CCCCCCcEEeC--CCeeeeCCCCcc
Q psy11801 34 EPCLNGGRCIG--PDKCACLYGYAG 56 (73)
Q Consensus 34 ~~c~~~g~c~~--~~~c~c~~g~~g 56 (73)
..|..-+.|.. ...|.|++||..
T Consensus 84 ~~CG~~g~C~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 84 GFCGPNGICNSNNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCccEeCCCCCCceECCCCcCC
Confidence 45777778862 356999999864
No 42
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=42.56 E-value=21 Score=14.16 Aligned_cols=9 Identities=44% Similarity=1.368 Sum_probs=4.9
Q ss_pred CeEEcCCCC
Q psy11801 14 TRCVCRAGY 22 (73)
Q Consensus 14 ~~c~c~~~~ 22 (73)
..|.|+.||
T Consensus 18 ~~C~CPeGy 26 (34)
T PF09064_consen 18 GQCFCPEGY 26 (34)
T ss_pred CceeCCCce
Confidence 355555555
No 43
>PF09402 MSC: Man1-Src1p-C-terminal domain; InterPro: IPR018996 This entry represents the Inner nuclear membrane proteins MAN1 (also known as LEM domain-containing protein 3) and LEM domain-containing protein 2 (or LEM protein 2). Emerin and MAN1 are LEM domain-containing integral membrane proteins of the vertebrate nuclear envelope []. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C-terminal nulceoplasmic region forms a DNA binding winged helix and binds to Smad []. LEM protein 2 is an essential protein involved in chromosome segregation and cell division, probably via its interaction with lmn-1, the main component of nuclear lamina. Has some overlapping function with emr-1.; GO: 0005639 integral to nuclear inner membrane; PDB: 2CH0_A.
Probab=28.88 E-value=19 Score=21.47 Aligned_cols=21 Identities=24% Similarity=0.562 Sum_probs=0.0
Q ss_pred CCCCCCEecCCCeEEcCCCCc
Q psy11801 3 EFRNGGSCLGGTRCVCRAGYK 23 (73)
Q Consensus 3 ~c~~~~~c~~~~~c~c~~~~~ 23 (73)
||+-++.|..+....|.++|.
T Consensus 54 pCP~~a~C~~~~~~~C~~~y~ 74 (334)
T PF09402_consen 54 PCPEHAICYPGLKLECEPGYV 74 (334)
T ss_dssp ---------------------
T ss_pred ccccccccccccccccccccc
Confidence 688889998766777878875
No 44
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=23.50 E-value=62 Score=13.12 Aligned_cols=20 Identities=40% Similarity=1.208 Sum_probs=10.8
Q ss_pred CCCCcEEe----CCCeeeeCCCCc
Q psy11801 36 CLNGGRCI----GPDKCACLYGYA 55 (73)
Q Consensus 36 c~~~g~c~----~~~~c~c~~g~~ 55 (73)
|.....|. +...|.|..||.
T Consensus 7 cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 7 CPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp --TTEEEEEETTSEEEEEE-TTEE
T ss_pred CCCCcccEEcCCCCEEEEeeCCcc
Confidence 34445565 235788888886
Done!