Query psy18122
Match_columns 188
No_of_seqs 219 out of 1499
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 23:04:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18122.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18122hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0994|consensus 99.3 1E-11 2.2E-16 113.4 8.3 121 17-144 776-904 (1758)
2 KOG1219|consensus 99.2 2.4E-11 5.3E-16 116.3 7.6 106 5-132 3865-3982(4289)
3 KOG1225|consensus 99.2 1.3E-10 2.8E-15 101.0 8.9 93 8-125 252-365 (525)
4 KOG1226|consensus 98.8 1.1E-08 2.4E-13 91.0 8.4 31 6-36 463-493 (783)
5 KOG0994|consensus 98.8 1.4E-08 3E-13 93.5 8.4 107 21-132 428-549 (1758)
6 KOG4289|consensus 98.8 6E-09 1.3E-13 97.4 6.0 83 21-129 1222-1318(2531)
7 KOG1225|consensus 98.7 3.2E-08 6.9E-13 86.3 7.0 64 9-80 284-365 (525)
8 KOG4289|consensus 98.6 9.5E-08 2.1E-12 89.7 7.8 107 7-128 1719-1838(2531)
9 KOG1226|consensus 98.4 1.1E-06 2.4E-11 78.7 9.3 100 9-128 513-621 (783)
10 KOG1836|consensus 98.4 7.2E-07 1.6E-11 86.9 8.4 125 11-140 732-881 (1705)
11 KOG1836|consensus 98.3 1.1E-06 2.4E-11 85.6 6.9 114 21-141 695-825 (1705)
12 KOG1219|consensus 98.2 2.1E-06 4.5E-11 83.9 5.0 69 4-83 3903-3978(4289)
13 smart00051 DSL delta serrate l 98.0 3.3E-06 7.2E-11 53.5 2.5 51 30-81 13-63 (63)
14 KOG3512|consensus 98.0 3.4E-05 7.3E-10 65.9 7.9 107 21-134 295-434 (592)
15 PF00008 EGF: EGF-like domain 97.8 1.2E-05 2.5E-10 44.0 1.9 26 8-33 2-32 (32)
16 PF07974 EGF_2: EGF-like domai 97.7 5.7E-05 1.2E-09 41.2 3.1 25 10-34 6-32 (32)
17 cd00055 EGF_Lam Laminin-type e 97.5 0.00018 3.9E-09 43.3 3.7 28 18-46 14-43 (50)
18 PF00008 EGF: EGF-like domain 97.3 7.4E-05 1.6E-09 40.7 0.7 26 55-80 3-32 (32)
19 PF00053 Laminin_EGF: Laminin 97.3 0.0001 2.2E-09 44.1 1.1 30 17-47 12-43 (49)
20 smart00051 DSL delta serrate l 97.3 0.00013 2.8E-09 46.2 1.5 45 78-126 14-63 (63)
21 PF07974 EGF_2: EGF-like domai 97.2 0.00047 1E-08 37.5 2.8 25 57-81 7-32 (32)
22 PF12661 hEGF: Human growth fa 97.1 0.00021 4.6E-09 30.8 0.7 13 22-34 1-13 (13)
23 smart00179 EGF_CA Calcium-bind 97.1 0.0011 2.3E-08 37.1 3.9 26 10-35 9-39 (39)
24 cd00055 EGF_Lam Laminin-type e 97.0 0.00092 2E-08 40.2 3.3 30 63-93 13-43 (50)
25 smart00180 EGF_Lam Laminin-typ 97.0 0.00094 2E-08 39.5 3.1 27 17-44 12-40 (46)
26 PF00053 Laminin_EGF: Laminin 96.9 0.00039 8.4E-09 41.6 0.9 32 62-94 11-43 (49)
27 KOG1218|consensus 96.8 0.0079 1.7E-07 49.4 8.6 63 62-131 117-180 (316)
28 KOG4260|consensus 96.8 0.0014 3E-08 52.7 3.4 53 25-91 132-190 (350)
29 cd00054 EGF_CA Calcium-binding 96.7 0.0028 6.2E-08 34.8 3.8 26 10-35 9-38 (38)
30 smart00180 EGF_Lam Laminin-typ 96.7 0.0016 3.4E-08 38.5 2.8 28 63-91 12-40 (46)
31 KOG1217|consensus 96.6 0.021 4.4E-07 49.0 9.8 114 8-129 175-355 (487)
32 PF01414 DSL: Delta serrate li 96.4 0.00059 1.3E-08 43.1 -0.5 43 38-81 21-63 (63)
33 cd00053 EGF Epidermal growth f 96.3 0.0075 1.6E-07 32.5 3.6 26 9-34 5-35 (36)
34 KOG4260|consensus 96.2 0.0039 8.5E-08 50.2 2.6 33 10-43 150-189 (350)
35 smart00181 EGF Epidermal growt 96.0 0.013 2.8E-07 31.9 3.6 25 10-35 6-35 (35)
36 smart00179 EGF_CA Calcium-bind 95.9 0.01 2.2E-07 32.9 3.1 27 56-82 9-39 (39)
37 KOG1217|consensus 95.7 0.087 1.9E-06 45.1 9.3 102 21-125 110-264 (487)
38 KOG3512|consensus 95.6 0.019 4.2E-07 49.5 4.7 69 21-92 359-437 (592)
39 cd00054 EGF_CA Calcium-binding 95.4 0.021 4.6E-07 31.2 3.0 27 56-82 9-38 (38)
40 PF01414 DSL: Delta serrate li 95.4 0.007 1.5E-07 38.2 1.0 39 84-126 20-63 (63)
41 KOG1218|consensus 95.3 0.13 2.9E-06 42.0 8.6 26 13-38 154-179 (316)
42 cd00053 EGF Epidermal growth f 95.2 0.027 5.8E-07 30.2 3.0 26 56-81 6-35 (36)
43 KOG1214|consensus 94.8 0.062 1.4E-06 49.6 5.5 22 15-36 705-733 (1289)
44 smart00181 EGF Epidermal growt 94.8 0.043 9.4E-07 29.7 3.0 25 56-81 6-34 (35)
45 KOG0196|consensus 94.5 0.24 5.2E-06 45.9 8.5 92 58-153 248-351 (996)
46 KOG1214|consensus 94.0 0.12 2.6E-06 47.9 5.5 47 21-80 809-860 (1289)
47 PF07645 EGF_CA: Calcium-bindi 91.2 0.18 4E-06 28.8 2.1 22 9-30 9-34 (42)
48 KOG1388|consensus 91.0 0.44 9.6E-06 37.2 4.5 71 58-132 54-131 (217)
49 PF07645 EGF_CA: Calcium-bindi 89.7 0.21 4.5E-06 28.6 1.4 23 55-77 9-34 (42)
50 PF12947 EGF_3: EGF domain; I 88.7 0.13 2.7E-06 28.7 -0.0 24 56-79 6-32 (36)
51 PF12947 EGF_3: EGF domain; I 88.5 0.24 5.2E-06 27.5 1.0 23 10-32 6-32 (36)
52 KOG3514|consensus 87.7 0.35 7.7E-06 46.0 2.1 36 2-37 621-661 (1591)
53 PHA03099 epidermal growth fact 84.7 0.77 1.7E-05 33.0 2.1 25 12-37 53-83 (139)
54 PF01683 EB: EB module; Inter 84.6 3.8 8.2E-05 24.2 5.0 20 57-77 27-46 (52)
55 PF01683 EB: EB module; Inter 80.0 2.7 5.8E-05 24.9 3.0 25 6-30 22-46 (52)
56 PF12955 DUF3844: Domain of un 78.6 1.8 3.8E-05 30.1 2.1 25 8-32 11-44 (103)
57 KOG0196|consensus 78.4 3.4 7.4E-05 38.7 4.5 52 21-77 259-317 (996)
58 PHA02887 EGF-like protein; Pro 77.9 2 4.3E-05 30.4 2.2 26 57-83 93-123 (126)
59 PF12662 cEGF: Complement Clr- 75.0 3 6.4E-05 21.0 1.8 11 68-78 2-12 (24)
60 KOG3516|consensus 73.7 2.9 6.2E-05 40.5 2.7 33 5-37 546-583 (1306)
61 KOG3607|consensus 72.5 2.8 6.1E-05 38.8 2.4 29 9-37 629-658 (716)
62 KOG3516|consensus 66.4 4.4 9.5E-05 39.3 2.3 38 46-83 541-582 (1306)
63 PHA03099 epidermal growth fact 60.0 15 0.00033 26.5 3.6 17 67-83 66-82 (139)
64 PHA02887 EGF-like protein; Pro 59.7 12 0.00027 26.5 3.0 8 12-20 94-101 (126)
65 KOG3607|consensus 58.2 14 0.00029 34.5 3.9 28 56-83 630-657 (716)
66 KOG3514|consensus 54.1 8.6 0.00019 37.2 2.0 32 52-83 625-660 (1591)
67 PF04863 EGF_alliinase: Alliin 51.4 6.5 0.00014 23.9 0.5 28 9-36 16-51 (56)
68 KOG1388|consensus 47.7 19 0.00041 28.3 2.7 69 12-83 54-127 (217)
69 PF00954 S_locus_glycop: S-loc 47.4 21 0.00046 24.5 2.7 23 55-77 83-107 (110)
70 PF14670 FXa_inhibition: Coagu 42.5 8.4 0.00018 21.2 0.0 13 67-79 18-30 (36)
71 KOG3509|consensus 37.1 44 0.00096 32.2 3.8 67 21-92 718-792 (964)
72 PF09064 Tme5_EGF_like: Thromb 35.2 20 0.00043 19.6 0.8 10 21-30 18-27 (34)
73 KOG3509|consensus 27.3 81 0.0018 30.5 3.8 23 67-90 717-739 (964)
74 PF11403 Yeast_MT: Yeast metal 24.3 39 0.00085 18.3 0.7 18 56-75 12-29 (40)
75 cd01475 vWA_Matrilin VWA_Matri 23.1 92 0.002 24.1 3.0 17 63-79 200-219 (224)
76 PF01826 TIL: Trypsin Inhibito 21.6 1E+02 0.0022 18.1 2.4 9 69-77 34-42 (55)
No 1
>KOG0994|consensus
Probab=99.28 E-value=1e-11 Score=113.38 Aligned_cols=121 Identities=26% Similarity=0.556 Sum_probs=97.0
Q ss_pred eEec--ceEeCCCCCcCCCCCCCCCCCCCCCCCC--CCccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCccC
Q psy18122 17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKCS--MLCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYYG 91 (188)
Q Consensus 17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C~--~~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G 91 (188)
.|.+ |+|+|+|+..|+.|+. |.+|+||..-+ ..|.|.... .-+..|.. ++.|.|.+|..|..|. .|.+||||
T Consensus 776 vCn~~GGqCqCkPnVVGR~Cdq-CApGtyGFGPsGCk~CdC~~~G-s~~~~Cd~~tGQC~C~~g~ygrqCn-qCqpG~Wg 852 (1758)
T KOG0994|consen 776 VCNPNGGQCQCKPNVVGRRCDQ-CAPGTYGFGPSGCKACDCNSIG-SLDKYCDKITGQCQCRPGTYGRQCN-QCQPGYWG 852 (1758)
T ss_pred cccCCCceecccCccccccccc-cCCcccCcCCccCccccccccc-cccccccccccceeeccccchhhcc-ccCCCccC
Confidence 4444 9999999999999998 99999998866 567665321 12345766 7999999999999998 79999999
Q ss_pred CCccCCCcccc---cceeccccccccccccCCccCCCCCCCCCCCCCCCeeecCcc
Q psy18122 92 ADCKQKCGECI---DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVLDSSY 144 (188)
Q Consensus 92 ~~C~~~C~~C~---~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~~~~~ 144 (188)
..--.+|. |+ ..|++.+|.|+. |..-. +|.+|++ |..++++.+++.+.+
T Consensus 853 FPeCr~Cq-CNgHA~~Cd~~tGaCi~-CqD~T-~G~~Cdr-Cl~GyyGdP~lg~g~ 904 (1758)
T KOG0994|consen 853 FPECRPCQ-CNGHADTCDPITGACID-CQDST-TGHSCDR-CLDGYYGDPRLGSGI 904 (1758)
T ss_pred CCcCcccc-ccCcccccCcccccccc-ccccc-cccchhh-hhccccCCcccCCCC
Confidence 64334677 88 899999999986 88877 8999987 888888887776554
No 2
>KOG1219|consensus
Probab=99.22 E-value=2.4e-11 Score=116.33 Aligned_cols=106 Identities=26% Similarity=0.652 Sum_probs=84.5
Q ss_pred CCCCCCCCCCCCeEec-----ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCC
Q psy18122 5 TNENSIECLNGGVEMD-----SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGY 76 (188)
Q Consensus 5 ~~~~~~~C~~~g~C~~-----~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~ 76 (188)
+.++.+||+|+|+|.. |.|.|++.|.|..||..-. .|.++||..||+|.. .+.|.|+.||
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~e------------pC~snPC~~GgtCip~~n~f~CnC~~gy 3932 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLE------------PCASNPCLTGGTCIPFYNGFLCNCPNGY 3932 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccc------------cccCCCCCCCCEEEecCCCeeEeCCCCc
Confidence 5678899999999987 8999999999999997422 456889999999987 5669999999
Q ss_pred CCCCCCCCCCCCccCCCccCCCcccc--cceecccc--ccccccccCCccCCCCCCCCCC
Q psy18122 77 QGEQCLEHCPEDYYGADCKQKCGECI--DGCDQYTG--VCRGGCTTPYLIRPYCKQPHSY 132 (188)
Q Consensus 77 ~G~~C~~~C~~g~~G~~C~~~C~~C~--~~C~~~~g--~C~~~C~~g~~~G~~C~~~c~~ 132 (188)
+|.+|+.. | =..|+.. .|. +.|.+..| .|. |.+|| .|..|...-|.
T Consensus 3933 TG~~Ce~~---G--i~eCs~n--~C~~gg~C~n~~gsf~Cn--cT~g~-~gr~c~~~~pn 3982 (4289)
T KOG1219|consen 3933 TGKRCEAR---G--ISECSKN--VCGTGGQCINIPGSFHCN--CTPGI-LGRTCCAEKPN 3982 (4289)
T ss_pred cCceeecc---c--ccccccc--cccCCceeeccCCceEec--cChhH-hcccCccccCc
Confidence 99999853 1 1223321 255 88888888 899 99999 99998764444
No 3
>KOG1225|consensus
Probab=99.16 E-value=1.3e-10 Score=101.00 Aligned_cols=93 Identities=33% Similarity=0.919 Sum_probs=65.4
Q ss_pred CCCCCCCCCeEecceEeCCCCCcCCCCCCC------------------CCCCCCCCCCCC-CccCCCCCCCCCcEecCCC
Q psy18122 8 NSIECLNGGVEMDSMCLCTPGFAGKKCEIP------------------CDRNNFGHKCSM-LCSKSPNQCKGMVLCTSNF 68 (188)
Q Consensus 8 ~~~~C~~~g~C~~~~C~C~~G~~G~~C~~~------------------C~~g~~G~~C~~-~C~c~~~~C~~~g~C~~~~ 68 (188)
.+.-|.++|.|.++.|+|++||+|.+|++. |.++|.|.+|+. +| +..|.++|.|++ +
T Consensus 252 C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~c---padC~g~G~Ci~-G 327 (525)
T KOG1225|consen 252 CPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRC---PADCSGHGKCID-G 327 (525)
T ss_pred CCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccC---CccCCCCCcccC-C
Confidence 344577778999999999999999999873 666777777763 33 245777777774 7
Q ss_pred ceecCCCCCCCCCCCCCCCCccCCCccCCCcccc--cceeccccccccccccCCccCCC
Q psy18122 69 GCSCATGYQGEQCLEHCPEDYYGADCKQKCGECI--DGCDQYTGVCRGGCTTPYLIRPY 125 (188)
Q Consensus 69 ~C~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~C~--~~C~~~~g~C~~~C~~g~~~G~~ 125 (188)
+|.|.+||+|..|.+. . |. +.| +.| |+ |..|| .|.+
T Consensus 328 ~C~C~~Gy~G~~C~~~--------------~-C~~~g~c--v~g-C~--C~~Gw-~G~d 365 (525)
T KOG1225|consen 328 ECLCDEGYTGELCIQR--------------A-CSGGGQC--VNG-CK--CKKGW-RGPD 365 (525)
T ss_pred ceEeCCCCcCCccccc--------------c-cCCCcee--ccC-ce--eccCc-cCCC
Confidence 7777777777776642 1 32 333 256 88 99999 8887
No 4
>KOG1226|consensus
Probab=98.84 E-value=1.1e-08 Score=91.05 Aligned_cols=31 Identities=39% Similarity=0.753 Sum_probs=26.6
Q ss_pred CCCCCCCCCCCeEecceEeCCCCCcCCCCCC
Q psy18122 6 NENSIECLNGGVEMDSMCLCTPGFAGKKCEI 36 (188)
Q Consensus 6 ~~~~~~C~~~g~C~~~~C~C~~G~~G~~C~~ 36 (188)
+.++.-|+.+|+...|.|.|.+||.|..|+-
T Consensus 463 e~~s~~C~g~G~~~CG~C~C~~G~~G~~CEC 493 (783)
T KOG1226|consen 463 EPNSALCHGNGTFVCGQCRCDEGWLGKKCEC 493 (783)
T ss_pred CCCccccCCCCcEEecceecCCCCCCCcccC
Confidence 3456678889999999999999999999983
No 5
>KOG0994|consensus
Probab=98.82 E-value=1.4e-08 Score=93.46 Aligned_cols=107 Identities=23% Similarity=0.583 Sum_probs=85.0
Q ss_pred ceEeCCCCCcCCCCCCCCCCCCCCCCCCC-----CccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCccCCCc
Q psy18122 21 SMCLCTPGFAGKKCEIPCDRNNFGHKCSM-----LCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYYGADC 94 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~~-----~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~C 94 (188)
|+|.|+++..|.+|+. |.+||||..-.+ .|.|.+-.-.+++.|+. ++.|.|+.-.+|..|. .|.|.+||..-
T Consensus 428 GqC~CK~~V~G~RCd~-Ck~Gywgl~~~dp~GC~~C~CN~lGT~~~s~CD~~TG~C~ckrlvTg~~cd-qclPeh~gLs~ 505 (1758)
T KOG0994|consen 428 GQCRCKEHVAGRRCDR-CKDGYWGLTSADPYGCRPCDCNPLGTRNGSGCDPETGDCYCKRLVTGIDCD-QCLPEHWGLSN 505 (1758)
T ss_pred cccccccCcCccccch-hccCcccCccCCCCCccccccccccccCCCCCCCCCCceEeeccccCCCcc-ccCccccccCC
Confidence 8999999999999998 999999988553 45554222234444877 7889999999999998 79999999654
Q ss_pred cC-CCcc--cc------cceeccccccccccccCCccCCCCCCCCCC
Q psy18122 95 KQ-KCGE--CI------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSY 132 (188)
Q Consensus 95 ~~-~C~~--C~------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~ 132 (188)
.. .|.+ |. .+|...+|+|. |.+++ .|.+|++.++.
T Consensus 506 ~~~gc~~cdcd~GGs~d~sc~~~sGqC~--CRe~~-~GR~c~~~~~~ 549 (1758)
T KOG0994|consen 506 DLEGCRPCDCDQGGSYDNSCDLHSGQCE--CREHM-LGRRCEQVCPG 549 (1758)
T ss_pred CCCCCcccccCCCCCCCcccccccCccc--ccccc-ccccccccCCc
Confidence 42 2443 44 57888999999 99999 99999998765
No 6
>KOG4289|consensus
Probab=98.82 E-value=6e-09 Score=97.41 Aligned_cols=83 Identities=31% Similarity=0.803 Sum_probs=62.7
Q ss_pred ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCCCCCCCCC-----CCCCCccCC
Q psy18122 21 SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGYQGEQCLE-----HCPEDYYGA 92 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~~G~~C~~-----~C~~g~~G~ 92 (188)
..|+|||||+|..|++.-+ .|-.+||.++|.|.. .|+|.|.+||+|.+|+. .|.+|+
T Consensus 1222 lrCrCPpGFTgd~CeTeiD------------lCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGv--- 1286 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEID------------LCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGV--- 1286 (2531)
T ss_pred eeEeCCCCCCcccccchhH------------hhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccce---
Confidence 5799999999999987532 344789999999986 68899999999999985 344442
Q ss_pred CccCCCcccc--cceeccc--c-ccccccccC-CccCCCCCCC
Q psy18122 93 DCKQKCGECI--DGCDQYT--G-VCRGGCTTP-YLIRPYCKQP 129 (188)
Q Consensus 93 ~C~~~C~~C~--~~C~~~~--g-~C~~~C~~g-~~~G~~C~~~ 129 (188)
|. ++|.+.. | .|. |+.| | .++.|+..
T Consensus 1287 --------C~nggtC~~~~nggf~c~--Cp~ge~-e~prC~v~ 1318 (2531)
T KOG4289|consen 1287 --------CKNGGTCVNLLNGGFCCH--CPYGEF-EDPRCEVT 1318 (2531)
T ss_pred --------ecCCCEEeecCCCceecc--CCCccc-CCCceEEE
Confidence 54 6665421 2 567 8887 6 88999874
No 7
>KOG1225|consensus
Probab=98.71 E-value=3.2e-08 Score=86.27 Aligned_cols=64 Identities=38% Similarity=0.987 Sum_probs=56.0
Q ss_pred CCCCCCCCeEecceEeCCCCCcCCCCCCC------------------CCCCCCCCCCCCCccCCCCCCCCCcEecCCCce
Q psy18122 9 SIECLNGGVEMDSMCLCTPGFAGKKCEIP------------------CDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGC 70 (188)
Q Consensus 9 ~~~C~~~g~C~~~~C~C~~G~~G~~C~~~------------------C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C 70 (188)
+.+|++|+.+.++.|+|++||+|..|++. |.+||+|..|..+ .|.+++.|.+ + |
T Consensus 284 p~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~C~~~------~C~~~g~cv~-g-C 355 (525)
T KOG1225|consen 284 PVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGELCIQR------ACSGGGQCVN-G-C 355 (525)
T ss_pred CcccCCCceecCCEeecCCCccccccccccCCccCCCCCcccCCceEeCCCCcCCccccc------ccCCCceecc-C-c
Confidence 44588999999999999999999999742 7899999999866 2889999998 7 9
Q ss_pred ecCCCCCCCC
Q psy18122 71 SCATGYQGEQ 80 (188)
Q Consensus 71 ~C~~g~~G~~ 80 (188)
.|..||.|.+
T Consensus 356 ~C~~Gw~G~d 365 (525)
T KOG1225|consen 356 KCKKGWRGPD 365 (525)
T ss_pred eeccCccCCC
Confidence 9999999988
No 8
>KOG4289|consensus
Probab=98.63 E-value=9.5e-08 Score=89.69 Aligned_cols=107 Identities=30% Similarity=0.658 Sum_probs=73.2
Q ss_pred CCCCCCCCCCeEec------ceEeCCCCCcCCCCCC----CCCCCCCCCCCCCCccCCCCCCCCCcEecC-CCceecCCC
Q psy18122 7 ENSIECLNGGVEMD------SMCLCTPGFAGKKCEI----PCDRNNFGHKCSMLCSKSPNQCKGMVLCTS-NFGCSCATG 75 (188)
Q Consensus 7 ~~~~~C~~~g~C~~------~~C~C~~G~~G~~C~~----~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~-~~~C~C~~g 75 (188)
+.-+||.|.|+|.. |+|.|++||.|++|+. .|+.||||..--.+|.|+... .-...|.+ ++.|.|+..
T Consensus 1719 C~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~CgpC~CavsK-gfdp~CnKt~G~CqCKe~ 1797 (2531)
T KOG4289|consen 1719 CSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTCGPCNCAVSK-GFDPDCNKTNGQCQCKEN 1797 (2531)
T ss_pred hcccccccCceeecCCCCCceeEECCCcccCcchhhhccCCCCCcccCCCCccCccccccC-CCCCCccccCcceeeccc
Confidence 45688999999986 8999999999999985 499999997655777776210 01123444 588988876
Q ss_pred CCCCC--CCCCCCCCccCCCccCCCcccccceeccccccccccccCCccCCCCCC
Q psy18122 76 YQGEQ--CLEHCPEDYYGADCKQKCGECIDGCDQYTGVCRGGCTTPYLIRPYCKQ 128 (188)
Q Consensus 76 ~~G~~--C~~~C~~g~~G~~C~~~C~~C~~~C~~~~g~C~~~C~~g~~~G~~C~~ 128 (188)
+.-.. |- .|.- |+|..- ..|+ .+|+|. |.+|- .|.+|++
T Consensus 1798 hy~~~~~Cl-~CdC-~~Gs~S--------r~C~-adGqC~--C~pga-iGRqCdr 1838 (2531)
T KOG4289|consen 1798 HYRPIGSCL-PCDC-YFGSDS--------RECD-ADGQCP--CKPGA-IGRQCDR 1838 (2531)
T ss_pred cccCCCcce-eecc-ccCCCc--------cccc-CCCcCC--CCCcc-ccccccc
Confidence 54221 21 1211 122221 3555 689999 99999 9999987
No 9
>KOG1226|consensus
Probab=98.45 E-value=1.1e-06 Score=78.66 Aligned_cols=100 Identities=32% Similarity=0.783 Sum_probs=64.7
Q ss_pred CCCCCCCCeEecceEeCCCCCc----CCCCCCCCCCCCCCCCCCCCccCC-CCCCCCCcEecCCCceecCCCCCCCCCCC
Q psy18122 9 SIECLNGGVEMDSMCLCTPGFA----GKKCEIPCDRNNFGHKCSMLCSKS-PNQCKGMVLCTSNFGCSCATGYQGEQCLE 83 (188)
Q Consensus 9 ~~~C~~~g~C~~~~C~C~~G~~----G~~C~~~C~~g~~G~~C~~~C~c~-~~~C~~~g~C~~~~~C~C~~g~~G~~C~~ 83 (188)
+.+|++.|.|..|+|.|.+... |+.|+- +. ..|.-. ..-|.++|.|.- ++|.|.+||+|..|+
T Consensus 513 ~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCEC--Dn--------fsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~- 580 (783)
T KOG1226|consen 513 SPVCSGRGDCVCGQCVCHKPDNGKIYGKFCEC--DN--------FSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACN- 580 (783)
T ss_pred CCCcCCCCcEeCCceEecCCCCCceeeeeeec--cC--------cccccccCcccCCCCeEeC-CcEEcCCCCccCCCC-
Confidence 3489999999999999988877 777763 21 222211 345889999988 999999999999996
Q ss_pred CCCCCccCCCccCCCc-ccc--cceeccccccccccccC-CccCCCCCC
Q psy18122 84 HCPEDYYGADCKQKCG-ECI--DGCDQYTGVCRGGCTTP-YLIRPYCKQ 128 (188)
Q Consensus 84 ~C~~g~~G~~C~~~C~-~C~--~~C~~~~g~C~~~C~~g-~~~G~~C~~ 128 (188)
|+.. -..|...=. .|. +.|. -|+|+ |... | .|..|+.
T Consensus 581 -C~~s--td~C~~~~G~iCSGrG~C~--Cg~C~--C~~~~~-sG~~CE~ 621 (783)
T KOG1226|consen 581 -CPLS--TDTCESSDGQICSGRGTCE--CGRCK--CTDPPY-SGEFCEK 621 (783)
T ss_pred -CCCC--CccccCCCCceeCCCceee--CCceE--cCCCCc-Ccchhhc
Confidence 4421 011111100 033 3332 25666 7555 7 8888776
No 10
>KOG1836|consensus
Probab=98.43 E-value=7.2e-07 Score=86.85 Aligned_cols=125 Identities=22% Similarity=0.536 Sum_probs=90.3
Q ss_pred CCCCC-CeEec--ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCcc-CCCCCCCCCcEecC-----CCcee-cCCCCCCCC
Q psy18122 11 ECLNG-GVEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCS-KSPNQCKGMVLCTS-----NFGCS-CATGYQGEQ 80 (188)
Q Consensus 11 ~C~~~-g~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~-c~~~~C~~~g~C~~-----~~~C~-C~~g~~G~~ 80 (188)
+|+++ .+|.. |.|.|.+...|..|+. |.+||||..=...=. |.+=+|.+++.|.. .+.|. |+++|+|.+
T Consensus 732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~-C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQ-CVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred ccCCccccccCCCCceecccCCCCCchhh-hcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccc
Confidence 34443 57776 9999999999999998 999999987542111 22335666666654 46697 999999999
Q ss_pred CCCCCCCCccCCCccC-----CCcc--cc--------cceeccccccccccccCCccCCCCCCCCCCCCCCCeee
Q psy18122 81 CLEHCPEDYYGADCKQ-----KCGE--CI--------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVL 140 (188)
Q Consensus 81 C~~~C~~g~~G~~C~~-----~C~~--C~--------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~ 140 (188)
|+ .|..||+|..-.. .|.. |+ +.|+..+|.|.. |..+- .|.+|+. |.+++++..+.
T Consensus 811 Ce-~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~-ci~nT-~g~~cd~-c~~g~~gd~l~ 881 (1705)
T KOG1836|consen 811 CE-ECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLK-CIHNT-AGEYCDL-CKEGYFGDPLA 881 (1705)
T ss_pred cc-cCCCccccCCCCCCCCcccCccceeccccCccccccccccccceee-ccCCc-ccccccc-cccCccccccC
Confidence 98 7999999853221 1322 43 788889999843 87877 8888887 76666666655
No 11
>KOG1836|consensus
Probab=98.32 E-value=1.1e-06 Score=85.57 Aligned_cols=114 Identities=25% Similarity=0.603 Sum_probs=82.6
Q ss_pred ceEeCCCCCcCCCCCCCCCCCCCCCCCC--CCccCCCCCCCCC-cEecC-CCceecCCCCCCCCCCCCCCCCccCCC---
Q psy18122 21 SMCLCTPGFAGKKCEIPCDRNNFGHKCS--MLCSKSPNQCKGM-VLCTS-NFGCSCATGYQGEQCLEHCPEDYYGAD--- 93 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~--~~C~c~~~~C~~~-g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~--- 93 (188)
-.|.|++||+|..|+. |.++|+...=. ..+.|.+-.|.++ ..|+. ++.|.|.+...|..|+ .|.+||+|..
T Consensus 695 e~c~C~~g~tG~~Ce~-C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~-~C~~GfYg~~~~~ 772 (1705)
T KOG1836|consen 695 EQCTCPVGYTGQFCES-CAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCA-QCVDGFYGLPDLG 772 (1705)
T ss_pred hhccCCCCcccchhhh-cchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchh-hhcCCCCCccccC
Confidence 5699999999999998 99999866533 1233332234443 67876 7889999999999998 7999999863
Q ss_pred ----ccCCCcccc--c----ceeccccccccccccCCccCCCCCCCCCCCCCCCeeec
Q psy18122 94 ----CKQKCGECI--D----GCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVLD 141 (188)
Q Consensus 94 ----C~~~C~~C~--~----~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~~ 141 (188)
|. +|. |. + .++...+.|.. |+++| +|..|+. |..+++++..-.
T Consensus 773 ~~~dC~-~C~-Cp~~~~~~~~~~~~~~iCk~-Cp~gy-tG~rCe~-c~dgyfg~p~~~ 825 (1705)
T KOG1836|consen 773 TSGDCQ-PCP-CPNGGACGQTPEILEVVCKN-CPPGY-TGLRCEE-CADGYFGNPLGH 825 (1705)
T ss_pred CCCCCc-cCC-CCCChhhcCcCcccceecCC-CCCCC-ccccccc-CCCccccCCCCC
Confidence 32 344 54 3 33344567775 99999 9999998 777776555443
No 12
>KOG1219|consensus
Probab=98.17 E-value=2.1e-06 Score=83.93 Aligned_cols=69 Identities=33% Similarity=0.739 Sum_probs=57.8
Q ss_pred CCCCCCCCCCCCCeEec----ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCC
Q psy18122 4 NTNENSIECLNGGVEMD----SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGY 76 (188)
Q Consensus 4 ~~~~~~~~C~~~g~C~~----~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~ 76 (188)
.+.+.++||.+||+|.+ +.|.|+.||+|.+|+.. | .-.|..++|.++|.|.+ ++.|.|.++|
T Consensus 3903 ~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~------G-----i~eCs~n~C~~gg~C~n~~gsf~CncT~g~ 3971 (4289)
T KOG1219|consen 3903 LEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR------G-----ISECSKNVCGTGGQCINIPGSFHCNCTPGI 3971 (4289)
T ss_pred cccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc------c-----ccccccccccCCceeeccCCceEeccChhH
Confidence 35678999999999998 88999999999999875 1 11344678999999987 4679999999
Q ss_pred CCCCCCC
Q psy18122 77 QGEQCLE 83 (188)
Q Consensus 77 ~G~~C~~ 83 (188)
.|+.|..
T Consensus 3972 ~gr~c~~ 3978 (4289)
T KOG1219|consen 3972 LGRTCCA 3978 (4289)
T ss_pred hcccCcc
Confidence 9999864
No 13
>smart00051 DSL delta serrate ligand.
Probab=98.04 E-value=3.3e-06 Score=53.46 Aligned_cols=51 Identities=25% Similarity=0.631 Sum_probs=36.6
Q ss_pred cCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecCCCceecCCCCCCCCC
Q psy18122 30 AGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGCSCATGYQGEQC 81 (188)
Q Consensus 30 ~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C~C~~g~~G~~C 81 (188)
.|..++..|+++|||..|...|.+.. .+.++.+|...+.+.|.+||+|..|
T Consensus 13 l~~~~rv~C~~~~yG~~C~~~C~~~~-d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 13 LEYQIRVTCDENYYGEGCNKFCRPRD-DFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEEEEEeeCCCCCcCCccCCEeCcCc-cccCCccCCcCCCEecCCCCcCCCC
Confidence 34445556777777777777775322 2567888887788999999999876
No 14
>KOG3512|consensus
Probab=97.96 E-value=3.4e-05 Score=65.93 Aligned_cols=107 Identities=23% Similarity=0.549 Sum_probs=73.6
Q ss_pred ceEeCCCCCcCCCCCCCCCCCCCCCCCC----------CCccCC--CCCCCCCcEecC------CCce-ecCCCCCCCCC
Q psy18122 21 SMCLCTPGFAGKKCEIPCDRNNFGHKCS----------MLCSKS--PNQCKGMVLCTS------NFGC-SCATGYQGEQC 81 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~----------~~C~c~--~~~C~~~g~C~~------~~~C-~C~~g~~G~~C 81 (188)
.+|.|.-+.+|+.|+. |.+.|++..-. ..|.|. ...|..+..+-. .+.| .|...-.|.+|
T Consensus 295 ltCdC~HNTaGPdCgr-CKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhC 373 (592)
T KOG3512|consen 295 LTCDCEHNTAGPDCGR-CKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHC 373 (592)
T ss_pred eEEecccCCCCCCccc-ccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCccc
Confidence 6899999999999998 99988766643 112222 112222222211 2456 69999999999
Q ss_pred CCCCCCCccCCC-----ccC---CCcccc------cceeccccccccccccCCccCCCCCCCCCCCC
Q psy18122 82 LEHCPEDYYGAD-----CKQ---KCGECI------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLE 134 (188)
Q Consensus 82 ~~~C~~g~~G~~-----C~~---~C~~C~------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~ 134 (188)
. .|.+||+=.. -.. .|+ |. .+|+..+|+|. |.+|. +|..|.+ |.+++
T Consensus 374 h-yCreGyyRd~s~pl~hrkaCk~Cd-ChpVGs~gktCNq~tGqCp--CkeGv-tG~tCnr-Ca~gy 434 (592)
T KOG3512|consen 374 H-YCREGYYRDGSKPLTHRKACKACD-CHPVGSAGKTCNQTTGQCP--CKEGV-TGLTCNR-CAPGY 434 (592)
T ss_pred c-cccCccccCCCCCCchhhhhhhcC-CcccccccccccccCCccc--CCCCC-ccccccc-ccchh
Confidence 8 8999997421 111 144 54 68998999999 99999 9999987 55443
No 15
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.82 E-value=1.2e-05 Score=43.95 Aligned_cols=26 Identities=46% Similarity=1.039 Sum_probs=22.9
Q ss_pred CCCCCCCCCeEec-----ceEeCCCCCcCCC
Q psy18122 8 NSIECLNGGVEMD-----SMCLCTPGFAGKK 33 (188)
Q Consensus 8 ~~~~C~~~g~C~~-----~~C~C~~G~~G~~ 33 (188)
.++||.|+|+|+. +.|+|++||+|++
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 4569999999987 8899999999974
No 16
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.67 E-value=5.7e-05 Score=41.16 Aligned_cols=25 Identities=28% Similarity=0.788 Sum_probs=22.9
Q ss_pred CCCCCCCeEe--cceEeCCCCCcCCCC
Q psy18122 10 IECLNGGVEM--DSMCLCTPGFAGKKC 34 (188)
Q Consensus 10 ~~C~~~g~C~--~~~C~C~~G~~G~~C 34 (188)
..|+++|+|. .+.|.|++||+|+.|
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 5799999999 599999999999986
No 17
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.47 E-value=0.00018 Score=43.32 Aligned_cols=28 Identities=25% Similarity=0.735 Sum_probs=25.0
Q ss_pred Eec--ceEeCCCCCcCCCCCCCCCCCCCCCC
Q psy18122 18 EMD--SMCLCTPGFAGKKCEIPCDRNNFGHK 46 (188)
Q Consensus 18 C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~ 46 (188)
|.. ++|.|+++|.|..|+. |.++|||..
T Consensus 14 C~~~~G~C~C~~~~~G~~C~~-C~~g~~~~~ 43 (50)
T cd00055 14 CDPGTGQCECKPNTTGRRCDR-CAPGYYGLP 43 (50)
T ss_pred ccCCCCEEeCCCcCCCCCCCC-CCCCCccCC
Confidence 765 9999999999999996 999999764
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.33 E-value=7.4e-05 Score=40.72 Aligned_cols=26 Identities=31% Similarity=0.928 Sum_probs=21.7
Q ss_pred CCCCCCCcEecC----CCceecCCCCCCCC
Q psy18122 55 PNQCKGMVLCTS----NFGCSCATGYQGEQ 80 (188)
Q Consensus 55 ~~~C~~~g~C~~----~~~C~C~~g~~G~~ 80 (188)
+.+|.++|+|.+ .+.|.|++||+|++
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 558999999986 46799999999964
No 19
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.30 E-value=0.0001 Score=44.14 Aligned_cols=30 Identities=30% Similarity=0.805 Sum_probs=25.1
Q ss_pred eEec--ceEeCCCCCcCCCCCCCCCCCCCCCCC
Q psy18122 17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKC 47 (188)
Q Consensus 17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C 47 (188)
.|.. ++|.|+++|+|+.|++ |.++||+...
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~~-C~~g~~~~~~ 43 (49)
T PF00053_consen 12 TCDPSTGQCVCKPGTTGPRCDQ-CKPGYFGLPS 43 (49)
T ss_dssp SEEETCEEESBSTTEESTTS-E-E-TTEECSTT
T ss_pred cccCCCCEEeccccccCCcCcC-CCCccccccC
Confidence 7776 9999999999999997 9999998765
No 20
>smart00051 DSL delta serrate ligand.
Probab=97.27 E-value=0.00013 Score=46.17 Aligned_cols=45 Identities=33% Similarity=0.810 Sum_probs=31.0
Q ss_pred CCCCCCCCCCCccCCCccCCCcc---cc--cceeccccccccccccCCccCCCC
Q psy18122 78 GEQCLEHCPEDYYGADCKQKCGE---CI--DGCDQYTGVCRGGCTTPYLIRPYC 126 (188)
Q Consensus 78 G~~C~~~C~~g~~G~~C~~~C~~---C~--~~C~~~~g~C~~~C~~g~~~G~~C 126 (188)
+..+...|+++|+|..|...|.+ +. ..|+. .|.+. |.+|| +|..|
T Consensus 14 ~~~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~--C~~Gw-~G~~C 63 (63)
T smart00051 14 EYQIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKG--CLEGW-MGPYC 63 (63)
T ss_pred EEEEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEe--cCCCC-cCCCC
Confidence 33444566677777777666652 22 57776 68999 99999 99876
No 21
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.17 E-value=0.00047 Score=37.52 Aligned_cols=25 Identities=40% Similarity=0.974 Sum_probs=22.0
Q ss_pred CCCCCcEecCC-CceecCCCCCCCCC
Q psy18122 57 QCKGMVLCTSN-FGCSCATGYQGEQC 81 (188)
Q Consensus 57 ~C~~~g~C~~~-~~C~C~~g~~G~~C 81 (188)
.|.++|+|+.. +.|.|.+||+|..|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 58899999964 89999999999876
No 22
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.08 E-value=0.00021 Score=30.85 Aligned_cols=13 Identities=46% Similarity=1.474 Sum_probs=10.7
Q ss_pred eEeCCCCCcCCCC
Q psy18122 22 MCLCTPGFAGKKC 34 (188)
Q Consensus 22 ~C~C~~G~~G~~C 34 (188)
.|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999987
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.07 E-value=0.0011 Score=37.06 Aligned_cols=26 Identities=42% Similarity=1.104 Sum_probs=22.9
Q ss_pred CCCCCCCeEec----ceEeCCCCCc-CCCCC
Q psy18122 10 IECLNGGVEMD----SMCLCTPGFA-GKKCE 35 (188)
Q Consensus 10 ~~C~~~g~C~~----~~C~C~~G~~-G~~C~ 35 (188)
.+|.+++.|.+ +.|.|++||. |..|+
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 78999999987 7899999999 88774
No 24
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.01 E-value=0.00092 Score=40.17 Aligned_cols=30 Identities=30% Similarity=0.825 Sum_probs=25.5
Q ss_pred EecC-CCceecCCCCCCCCCCCCCCCCccCCC
Q psy18122 63 LCTS-NFGCSCATGYQGEQCLEHCPEDYYGAD 93 (188)
Q Consensus 63 ~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~ 93 (188)
.|.. ++.|.|+++|+|..|+ .|.++|+|..
T Consensus 13 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~ 43 (50)
T cd00055 13 QCDPGTGQCECKPNTTGRRCD-RCAPGYYGLP 43 (50)
T ss_pred cccCCCCEEeCCCcCCCCCCC-CCCCCCccCC
Confidence 3554 6889999999999998 7999999854
No 25
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=96.97 E-value=0.00094 Score=39.45 Aligned_cols=27 Identities=26% Similarity=0.821 Sum_probs=24.2
Q ss_pred eEec--ceEeCCCCCcCCCCCCCCCCCCCC
Q psy18122 17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFG 44 (188)
Q Consensus 17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G 44 (188)
.|.. ++|.|+++|+|.+|+. |.++|||
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~~-C~~g~~g 40 (46)
T smart00180 12 TCDPDTGQCECKPNVTGRRCDR-CAPGYYG 40 (46)
T ss_pred cccCCCCEEECCCCCCCCCCCc-CCCCcCC
Confidence 5655 8999999999999996 9999998
No 26
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=96.88 E-value=0.00039 Score=41.58 Aligned_cols=32 Identities=28% Similarity=0.774 Sum_probs=25.2
Q ss_pred cEecC-CCceecCCCCCCCCCCCCCCCCccCCCc
Q psy18122 62 VLCTS-NFGCSCATGYQGEQCLEHCPEDYYGADC 94 (188)
Q Consensus 62 g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~C 94 (188)
..|.. ++.|.|+++|+|+.|+ .|.++|++...
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~~ 43 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPRCD-QCKPGYFGLPS 43 (49)
T ss_dssp SSEEETCEEESBSTTEESTTS--EE-TTEECSTT
T ss_pred CcccCCCCEEeccccccCCcCc-CCCCccccccC
Confidence 35655 6889999999999999 69999999754
No 27
>KOG1218|consensus
Probab=96.83 E-value=0.0079 Score=49.36 Aligned_cols=63 Identities=37% Similarity=0.971 Sum_probs=46.2
Q ss_pred cEecCCC-ceecCCCCCCCCCCCCCCCCccCCCccCCCcccccceeccccccccccccCCccCCCCCCCCC
Q psy18122 62 VLCTSNF-GCSCATGYQGEQCLEHCPEDYYGADCKQKCGECIDGCDQYTGVCRGGCTTPYLIRPYCKQPHS 131 (188)
Q Consensus 62 g~C~~~~-~C~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~C~~~C~~~~g~C~~~C~~g~~~G~~C~~~c~ 131 (188)
-+|.... .|.+..+|.+..|.. ++|+|..|...|. +...++...+.|. |.+|| .|.+|...++
T Consensus 117 ~~C~~~~~~c~~~~~~~~~~C~~---~~~~g~~C~~~c~-~~~~~~~~~~~c~--c~~g~-~g~~~~~~~~ 180 (316)
T KOG1218|consen 117 KTCANPRRECRCGGGYIGEQCGE---ENLVGLKCQRDCQ-CTGGCDCKNGICT--CQPGF-VGVFCVESCS 180 (316)
T ss_pred cccCCCccceecCCcCccccccc---cCCCCCCccCCCC-CccccCCCCCcee--ccCCc-ccccccccCC
Confidence 3455433 478888888888863 6888888887763 4456666678899 99999 9998887643
No 28
>KOG4260|consensus
Probab=96.77 E-value=0.0014 Score=52.72 Aligned_cols=53 Identities=32% Similarity=0.780 Sum_probs=35.9
Q ss_pred CCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC------CCceecCCCCCCCCCCCCCCCCccC
Q psy18122 25 CTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS------NFGCSCATGYQGEQCLEHCPEDYYG 91 (188)
Q Consensus 25 C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~------~~~C~C~~g~~G~~C~~~C~~g~~G 91 (188)
|++|..|++|.. |+-+. ..+|.++|.|.- ++.|.|.+||+|+.|. .|.++|+-
T Consensus 132 Cp~gtyGpdCl~-Cpggs------------er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-~Cg~eyfe 190 (350)
T KOG4260|consen 132 CPDGTYGPDCLQ-CPGGS------------ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-YCGIEYFE 190 (350)
T ss_pred cCCCCcCCcccc-CCCCC------------cCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-ccchHHHH
Confidence 566666666654 54221 235666666753 5789999999999997 67777764
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.75 E-value=0.0028 Score=34.81 Aligned_cols=26 Identities=42% Similarity=1.113 Sum_probs=22.6
Q ss_pred CCCCCCCeEec----ceEeCCCCCcCCCCC
Q psy18122 10 IECLNGGVEMD----SMCLCTPGFAGKKCE 35 (188)
Q Consensus 10 ~~C~~~g~C~~----~~C~C~~G~~G~~C~ 35 (188)
.+|.+++.|.. +.|.|++||.|..|+
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 68999999987 789999999998774
No 30
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=96.75 E-value=0.0016 Score=38.49 Aligned_cols=28 Identities=32% Similarity=0.920 Sum_probs=24.5
Q ss_pred EecC-CCceecCCCCCCCCCCCCCCCCccC
Q psy18122 63 LCTS-NFGCSCATGYQGEQCLEHCPEDYYG 91 (188)
Q Consensus 63 ~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G 91 (188)
.|.. ++.|.|+++|+|..|+ .|++||+|
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~g 40 (46)
T smart00180 12 TCDPDTGQCECKPNVTGRRCD-RCAPGYYG 40 (46)
T ss_pred cccCCCCEEECCCCCCCCCCC-cCCCCcCC
Confidence 4544 5889999999999998 79999998
No 31
>KOG1217|consensus
Probab=96.58 E-value=0.021 Score=48.99 Aligned_cols=114 Identities=26% Similarity=0.706 Sum_probs=67.9
Q ss_pred CCCCCCCCCeEec----ceEeCCCCCcCCCCCCC-------------CCCCCCCCCCC----------------------
Q psy18122 8 NSIECLNGGVEMD----SMCLCTPGFAGKKCEIP-------------CDRNNFGHKCS---------------------- 48 (188)
Q Consensus 8 ~~~~C~~~g~C~~----~~C~C~~G~~G~~C~~~-------------C~~g~~G~~C~---------------------- 48 (188)
...+|.+++.|.. +.|.|+++|.|..|+.. +..++.+..|.
T Consensus 175 ~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C 254 (487)
T KOG1217|consen 175 YSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTC 254 (487)
T ss_pred CCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceee
Confidence 3456899999987 67999999999987642 23333333332
Q ss_pred ---------------CCccCCCC-CCCCCcEecC---CCceecCCCCCCCCCC-----CCCCCCccCCCccCCCcccccc
Q psy18122 49 ---------------MLCSKSPN-QCKGMVLCTS---NFGCSCATGYQGEQCL-----EHCPEDYYGADCKQKCGECIDG 104 (188)
Q Consensus 49 ---------------~~C~c~~~-~C~~~g~C~~---~~~C~C~~g~~G~~C~-----~~C~~g~~G~~C~~~C~~C~~~ 104 (188)
..-.|... +|.++++|.. .+.|.|+++|+|..|. ..|...+.+..|... ..
T Consensus 255 ~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g-----~~ 329 (487)
T KOG1217|consen 255 RCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANG-----GT 329 (487)
T ss_pred eCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCC-----cc
Confidence 11122222 3777888876 2678888888888871 133332222222211 23
Q ss_pred eec----cccccccccccCCccCCCCCCC
Q psy18122 105 CDQ----YTGVCRGGCTTPYLIRPYCKQP 129 (188)
Q Consensus 105 C~~----~~g~C~~~C~~g~~~G~~C~~~ 129 (188)
|.. ....|. |..+| .|..|+..
T Consensus 330 C~~~~~~~~~~C~--c~~~~-~g~~C~~~ 355 (487)
T KOG1217|consen 330 CNTLGSFGGFRCA--CGPGF-TGRRCEDS 355 (487)
T ss_pred cccCCCCCCCCcC--CCCCC-CCCccccC
Confidence 311 112588 88999 99999854
No 32
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=96.44 E-value=0.00059 Score=43.13 Aligned_cols=43 Identities=35% Similarity=0.804 Sum_probs=20.9
Q ss_pred CCCCCCCCCCCCCccCCCCCCCCCcEecCCCceecCCCCCCCCC
Q psy18122 38 CDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGCSCATGYQGEQC 81 (188)
Q Consensus 38 C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C~C~~g~~G~~C 81 (188)
|.++|||..|...|.-.. .-..+.+|...+.=+|.+||+|+.|
T Consensus 21 C~~nyyG~~C~~~C~~~~-d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 21 CDENYYGPNCSKFCKPRD-DSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp --TTEETTTT-EE---EE-ETTEEEEE-SS--EEE-TTEESTTS
T ss_pred CCCCCCCccccCCcCCCc-CCcCCcccCCCCCCCCCCCCcCCCC
Confidence 666666666665554211 1135667777677788888888765
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.31 E-value=0.0075 Score=32.51 Aligned_cols=26 Identities=42% Similarity=1.101 Sum_probs=22.5
Q ss_pred CCCCCCCCeEec----ceEeCCCCCcCC-CC
Q psy18122 9 SIECLNGGVEMD----SMCLCTPGFAGK-KC 34 (188)
Q Consensus 9 ~~~C~~~g~C~~----~~C~C~~G~~G~-~C 34 (188)
..+|.+++.|.. +.|.|++||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 578999999987 889999999998 55
No 34
>KOG4260|consensus
Probab=96.15 E-value=0.0039 Score=50.18 Aligned_cols=33 Identities=27% Similarity=0.765 Sum_probs=27.4
Q ss_pred CCCCCCCeEec-------ceEeCCCCCcCCCCCCCCCCCCC
Q psy18122 10 IECLNGGVEMD-------SMCLCTPGFAGKKCEIPCDRNNF 43 (188)
Q Consensus 10 ~~C~~~g~C~~-------~~C~C~~G~~G~~C~~~C~~g~~ 43 (188)
-||.++|.|+- |.|.|.+||.|+.|.. |.++||
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~-Cg~eyf 189 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY-CGIEYF 189 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccc-cchHHH
Confidence 56888888875 8999999999999986 666654
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.00 E-value=0.013 Score=31.91 Aligned_cols=25 Identities=44% Similarity=1.099 Sum_probs=21.0
Q ss_pred CCCCCCCeEec----ceEeCCCCCcC-CCCC
Q psy18122 10 IECLNGGVEMD----SMCLCTPGFAG-KKCE 35 (188)
Q Consensus 10 ~~C~~~g~C~~----~~C~C~~G~~G-~~C~ 35 (188)
.+|.++ .|.. +.|.|++||.| ..|+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence 689888 8986 89999999999 6653
No 36
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.94 E-value=0.01 Score=32.91 Aligned_cols=27 Identities=33% Similarity=0.995 Sum_probs=21.2
Q ss_pred CCCCCCcEecC---CCceecCCCCC-CCCCC
Q psy18122 56 NQCKGMVLCTS---NFGCSCATGYQ-GEQCL 82 (188)
Q Consensus 56 ~~C~~~g~C~~---~~~C~C~~g~~-G~~C~ 82 (188)
.+|.+++.|.+ .+.|.|++||. |..|+
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 46778888876 46799999998 87764
No 37
>KOG1217|consensus
Probab=95.71 E-value=0.087 Score=45.11 Aligned_cols=102 Identities=30% Similarity=0.774 Sum_probs=60.5
Q ss_pred ceEeCCCCCcCCCCCCC-----------------------------CCCCCCCCCCCC---CccCCCCCCCCCcEecC--
Q psy18122 21 SMCLCTPGFAGKKCEIP-----------------------------CDRNNFGHKCSM---LCSKSPNQCKGMVLCTS-- 66 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~-----------------------------C~~g~~G~~C~~---~C~c~~~~C~~~g~C~~-- 66 (188)
+.|.|++||.|..|+.. |..+|.+..+.. .|.-...+|.+++.|.+
T Consensus 110 ~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~ 189 (487)
T KOG1217|consen 110 YECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTG 189 (487)
T ss_pred ceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCC
Confidence 66788888877655541 334444444442 23222446888888876
Q ss_pred -CCceecCCCCCCCCCCCC-------------CCCCccCCCccCCCcccc---cceeccc--cccccccccCCccCCC
Q psy18122 67 -NFGCSCATGYQGEQCLEH-------------CPEDYYGADCKQKCGECI---DGCDQYT--GVCRGGCTTPYLIRPY 125 (188)
Q Consensus 67 -~~~C~C~~g~~G~~C~~~-------------C~~g~~G~~C~~~C~~C~---~~C~~~~--g~C~~~C~~g~~~G~~ 125 (188)
.+.|.|+++|.|..|+.. +..++.+..|......|. +.|.... .+|. |++|| .+..
T Consensus 190 ~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~--~~~g~-~~~~ 264 (487)
T KOG1217|consen 190 GSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCR--CPEGY-TGDA 264 (487)
T ss_pred CCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeee--CCCCc-cccc
Confidence 466999999999988742 455666666554321132 2332221 2566 77787 7665
No 38
>KOG3512|consensus
Probab=95.63 E-value=0.019 Score=49.53 Aligned_cols=69 Identities=25% Similarity=0.642 Sum_probs=51.5
Q ss_pred ceEe-CCCCCcCCCCCCCCCCCCCCCCCC--------CCccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCcc
Q psy18122 21 SMCL-CTPGFAGKKCEIPCDRNNFGHKCS--------MLCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYY 90 (188)
Q Consensus 21 ~~C~-C~~G~~G~~C~~~C~~g~~G~~C~--------~~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~ 90 (188)
+.|+ |.-...|++|.- |.+|||...=. ..|.|.+- =.-+-+|.. ++.|.|++|.+|..|. .|.+||.
T Consensus 359 gvClnCrHnTaGrhChy-CreGyyRd~s~pl~hrkaCk~CdChpV-Gs~gktCNq~tGqCpCkeGvtG~tCn-rCa~gyq 435 (592)
T KOG3512|consen 359 GVCLNCRHNTAGRHCHY-CREGYYRDGSKPLTHRKACKACDCHPV-GSAGKTCNQTTGQCPCKEGVTGLTCN-RCAPGYQ 435 (592)
T ss_pred ceEeecccCCCCccccc-ccCccccCCCCCCchhhhhhhcCCccc-ccccccccccCCcccCCCCCcccccc-cccchhh
Confidence 6776 899999999987 99999966532 33544421 123446764 7899999999999998 6999987
Q ss_pred CC
Q psy18122 91 GA 92 (188)
Q Consensus 91 G~ 92 (188)
-.
T Consensus 436 qs 437 (592)
T KOG3512|consen 436 QS 437 (592)
T ss_pred cc
Confidence 54
No 39
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.41 E-value=0.021 Score=31.16 Aligned_cols=27 Identities=37% Similarity=1.020 Sum_probs=20.4
Q ss_pred CCCCCCcEecC---CCceecCCCCCCCCCC
Q psy18122 56 NQCKGMVLCTS---NFGCSCATGYQGEQCL 82 (188)
Q Consensus 56 ~~C~~~g~C~~---~~~C~C~~g~~G~~C~ 82 (188)
.+|.+++.|.+ .+.|.|+++|.|..|+
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 35777778876 4669999999987763
No 40
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=95.40 E-value=0.007 Score=38.23 Aligned_cols=39 Identities=33% Similarity=0.830 Sum_probs=19.8
Q ss_pred CCCCCccCCCccCCCccc---c--cceeccccccccccccCCccCCCC
Q psy18122 84 HCPEDYYGADCKQKCGEC---I--DGCDQYTGVCRGGCTTPYLIRPYC 126 (188)
Q Consensus 84 ~C~~g~~G~~C~~~C~~C---~--~~C~~~~g~C~~~C~~g~~~G~~C 126 (188)
.|.+.|+|..|...|.+= . -+|+ .+|.-+ |.+|| +|+.|
T Consensus 20 ~C~~nyyG~~C~~~C~~~~d~~ghy~Cd-~~G~~~--C~~Gw-~G~~C 63 (63)
T PF01414_consen 20 VCDENYYGPNCSKFCKPRDDSFGHYTCD-SNGNKV--CLPGW-TGPNC 63 (63)
T ss_dssp ---TTEETTTT-EE---EEETTEEEEE--SS--EE--E-TTE-ESTTS
T ss_pred ECCCCCCCccccCCcCCCcCCcCCcccC-CCCCCC--CCCCC-cCCCC
Confidence 566666666666555431 1 4666 478888 99999 99987
No 41
>KOG1218|consensus
Probab=95.27 E-value=0.13 Score=42.03 Aligned_cols=26 Identities=35% Similarity=0.832 Sum_probs=18.2
Q ss_pred CCCCeEecceEeCCCCCcCCCCCCCC
Q psy18122 13 LNGGVEMDSMCLCTPGFAGKKCEIPC 38 (188)
Q Consensus 13 ~~~g~C~~~~C~C~~G~~G~~C~~~C 38 (188)
..+-.+..+.|.|.+||.|.+|...+
T Consensus 154 ~~~~~~~~~~c~c~~g~~g~~~~~~~ 179 (316)
T KOG1218|consen 154 TGGCDCKNGICTCQPGFVGVFCVESC 179 (316)
T ss_pred ccccCCCCCceeccCCcccccccccC
Confidence 33334445888899999999887653
No 42
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.24 E-value=0.027 Score=30.21 Aligned_cols=26 Identities=35% Similarity=1.059 Sum_probs=20.0
Q ss_pred CCCCCCcEecC---CCceecCCCCCCC-CC
Q psy18122 56 NQCKGMVLCTS---NFGCSCATGYQGE-QC 81 (188)
Q Consensus 56 ~~C~~~g~C~~---~~~C~C~~g~~G~-~C 81 (188)
.+|.+++.|.. .+.|.|++||.|. .|
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 46777788876 5779999999887 54
No 43
>KOG1214|consensus
Probab=94.76 E-value=0.062 Score=49.60 Aligned_cols=22 Identities=23% Similarity=0.564 Sum_probs=14.5
Q ss_pred CCeEec-----ceEeCCCCCcCC--CCCC
Q psy18122 15 GGVEMD-----SMCLCTPGFAGK--KCEI 36 (188)
Q Consensus 15 ~g~C~~-----~~C~C~~G~~G~--~C~~ 36 (188)
+..|.+ ++|.|..||.|. +|..
T Consensus 705 ~a~C~pg~~~~~tcecs~g~~gdgr~c~d 733 (1289)
T KOG1214|consen 705 TARCHPGTGVDYTCECSSGYQGDGRNCVD 733 (1289)
T ss_pred CccccCCCCcceEEEEeeccCCCCCCCCC
Confidence 345655 589999988644 5543
No 44
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.75 E-value=0.043 Score=29.71 Aligned_cols=25 Identities=36% Similarity=1.131 Sum_probs=18.6
Q ss_pred CCCCCCcEecC---CCceecCCCCCC-CCC
Q psy18122 56 NQCKGMVLCTS---NFGCSCATGYQG-EQC 81 (188)
Q Consensus 56 ~~C~~~g~C~~---~~~C~C~~g~~G-~~C 81 (188)
.+|.++ +|.+ .+.|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 356676 7765 577999999998 655
No 45
>KOG0196|consensus
Probab=94.49 E-value=0.24 Score=45.90 Aligned_cols=92 Identities=28% Similarity=0.549 Sum_probs=53.3
Q ss_pred CCCCcEecC-CCceecCCCCC----CCCCCCCCCCCccCCCc-cCCCcccc-ccee--ccccccccccccCCccCCC---
Q psy18122 58 CKGMVLCTS-NFGCSCATGYQ----GEQCLEHCPEDYYGADC-KQKCGECI-DGCD--QYTGVCRGGCTTPYLIRPY--- 125 (188)
Q Consensus 58 C~~~g~C~~-~~~C~C~~g~~----G~~C~~~C~~g~~G~~C-~~~C~~C~-~~C~--~~~g~C~~~C~~g~~~G~~--- 125 (188)
|..-|.... .+.|.|.+||. |..|+ .|+.|++=..= ...|.+|. ..-. .....|. |..|| .-..
T Consensus 248 C~~dGeWlvpiG~C~C~aGye~~~~~~~C~-aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~--C~~gy-yRA~~Dp 323 (996)
T KOG0196|consen 248 CSGDGEWLVPIGGCVCKAGYEEAENGKACQ-ACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCT--CENGY-YRADSDP 323 (996)
T ss_pred EcCCCcEEEEcCceeecCCCCcccCCCcce-eCCCCcccCCCCCCCCCCCCCCCCCCCCCCCccc--ccCCc-ccCCCCC
Confidence 444444443 58899999994 67887 79999875433 33365565 2221 2233698 99998 4322
Q ss_pred CCCCCCCCCCCCeeecCccceEEEEEee
Q psy18122 126 CKQPHSYLEESPRVLDSSYNAVNLQIDL 153 (188)
Q Consensus 126 C~~~c~~~~~~~~~~~~~~~~~~~~~~~ 153 (188)
=+.+|....+.|.-+-...+..++.++.
T Consensus 324 ~~mpCT~PPSaP~nlis~vn~Ts~~L~W 351 (996)
T KOG0196|consen 324 PSMPCTRPPSAPRNLISNVNGTSLILEW 351 (996)
T ss_pred CCCCCCCCCCccceeeeecccceEEEEe
Confidence 2234654456666554444444444443
No 46
>KOG1214|consensus
Probab=93.98 E-value=0.12 Score=47.87 Aligned_cols=47 Identities=36% Similarity=0.957 Sum_probs=33.5
Q ss_pred ceEeCCCCCcCC--CCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCCCCCC
Q psy18122 21 SMCLCTPGFAGK--KCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGYQGEQ 80 (188)
Q Consensus 21 ~~C~C~~G~~G~--~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~~G~~ 80 (188)
++|.|.|||.|. .|.. .-.|.+..|....+|.. ++.|.|.+||.|.-
T Consensus 809 y~C~CLPGfsGDG~~c~d-------------vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGHQCTD-------------VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred EEEeecCCccCCcccccc-------------ccccCccccCCCceEecCCCcceeecccCccCCC
Confidence 789999999876 2322 11334566778888876 57799999998764
No 47
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=91.24 E-value=0.18 Score=28.79 Aligned_cols=22 Identities=27% Similarity=0.818 Sum_probs=19.4
Q ss_pred CCCCCCCCeEec----ceEeCCCCCc
Q psy18122 9 SIECLNGGVEMD----SMCLCTPGFA 30 (188)
Q Consensus 9 ~~~C~~~g~C~~----~~C~C~~G~~ 30 (188)
+.+|..++.|++ |.|.|++||.
T Consensus 9 ~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 9 PHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 357988999998 8999999997
No 48
>KOG1388|consensus
Probab=91.01 E-value=0.44 Score=37.22 Aligned_cols=71 Identities=30% Similarity=0.735 Sum_probs=48.6
Q ss_pred CCCCcEecCCCce-ecCCCCCCCCCCCCCCCCccCCCccCCCcc--cc---cceecccccccccc-ccCCccCCCCCCCC
Q psy18122 58 CKGMVLCTSNFGC-SCATGYQGEQCLEHCPEDYYGADCKQKCGE--CI---DGCDQYTGVCRGGC-TTPYLIRPYCKQPH 130 (188)
Q Consensus 58 C~~~g~C~~~~~C-~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~--C~---~~C~~~~g~C~~~C-~~g~~~G~~C~~~c 130 (188)
|.+.+.|.....| .|..+-+|..|+ .|..+|+|..-...|.+ |. ..|...+++|. | ..++ .|..|.+-.
T Consensus 54 cNGh~~c~t~~v~~~~~N~~~g~~c~-kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~--c~~kgv-vgd~c~~~e 129 (217)
T KOG1388|consen 54 CNGHSDCNTQHVCWRCENGTTGAHCE-KCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCF--CTTKGV-VGDLCPKCE 129 (217)
T ss_pred hcCCCCcccceeeeeccCccccccCC-ceEEEEEecCCCCccCHhhhcCCeeeeeccCCccc--cccceE-ecccCcccc
Confidence 4455556543445 678888999998 69999999432223433 43 67778889999 8 5677 888888744
Q ss_pred CC
Q psy18122 131 SY 132 (188)
Q Consensus 131 ~~ 132 (188)
.+
T Consensus 130 ~~ 131 (217)
T KOG1388|consen 130 VP 131 (217)
T ss_pred cc
Confidence 33
No 49
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=89.67 E-value=0.21 Score=28.56 Aligned_cols=23 Identities=35% Similarity=1.061 Sum_probs=18.9
Q ss_pred CCCCCCCcEecC---CCceecCCCCC
Q psy18122 55 PNQCKGMVLCTS---NFGCSCATGYQ 77 (188)
Q Consensus 55 ~~~C~~~g~C~~---~~~C~C~~g~~ 77 (188)
...|..++.|++ +|.|.|++||.
T Consensus 9 ~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 9 PHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 446878889987 57899999997
No 50
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.73 E-value=0.13 Score=28.67 Aligned_cols=24 Identities=33% Similarity=1.053 Sum_probs=17.0
Q ss_pred CCCCCCcEecC---CCceecCCCCCCC
Q psy18122 56 NQCKGMVLCTS---NFGCSCATGYQGE 79 (188)
Q Consensus 56 ~~C~~~g~C~~---~~~C~C~~g~~G~ 79 (188)
..|..+..|.. .+.|.|++||.|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 35677788876 5779999999875
No 51
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.46 E-value=0.24 Score=27.52 Aligned_cols=23 Identities=26% Similarity=0.704 Sum_probs=17.6
Q ss_pred CCCCCCCeEec----ceEeCCCCCcCC
Q psy18122 10 IECLNGGVEMD----SMCLCTPGFAGK 32 (188)
Q Consensus 10 ~~C~~~g~C~~----~~C~C~~G~~G~ 32 (188)
..|..+..|.. +.|.|++||+|.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 46778888887 899999999886
No 52
>KOG3514|consensus
Probab=87.67 E-value=0.35 Score=45.97 Aligned_cols=36 Identities=33% Similarity=0.684 Sum_probs=29.6
Q ss_pred CCCCCCCCCCCCCCCeEec----ceEeC-CCCCcCCCCCCC
Q psy18122 2 TSNTNENSIECLNGGVEMD----SMCLC-TPGFAGKKCEIP 37 (188)
Q Consensus 2 ~~~~~~~~~~C~~~g~C~~----~~C~C-~~G~~G~~C~~~ 37 (188)
+++.-++++||.|+|.|.. +.|.| ..||.|+.|+..
T Consensus 621 ~~~~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE 661 (1591)
T KOG3514|consen 621 SNEKICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE 661 (1591)
T ss_pred hhccccCCCcccCCCCccccccccccccccCcccCccccce
Confidence 3455689999999999987 78988 568999999863
No 53
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=84.67 E-value=0.77 Score=32.96 Aligned_cols=25 Identities=32% Similarity=0.914 Sum_probs=18.5
Q ss_pred CCCCCeEec------ceEeCCCCCcCCCCCCC
Q psy18122 12 CLNGGVEMD------SMCLCTPGFAGKKCEIP 37 (188)
Q Consensus 12 C~~~g~C~~------~~C~C~~G~~G~~C~~~ 37 (188)
|.|| .|.- ..|.|+.||+|.+|+..
T Consensus 53 ClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 53 CLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred eECC-EEEeeccCCCceeECCCCcccccccce
Confidence 5554 6643 77999999999999864
No 54
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=84.63 E-value=3.8 Score=24.25 Aligned_cols=20 Identities=40% Similarity=0.977 Sum_probs=16.1
Q ss_pred CCCCCcEecCCCceecCCCCC
Q psy18122 57 QCKGMVLCTSNFGCSCATGYQ 77 (188)
Q Consensus 57 ~C~~~g~C~~~~~C~C~~g~~ 77 (188)
.|..+..|.+ +.|.|++||.
T Consensus 27 qC~~~s~C~~-g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVN-GRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcC-CEeECCCCCE
Confidence 4567788876 8999999975
No 55
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=80.01 E-value=2.7 Score=24.94 Aligned_cols=25 Identities=32% Similarity=0.902 Sum_probs=19.8
Q ss_pred CCCCCCCCCCCeEecceEeCCCCCc
Q psy18122 6 NENSIECLNGGVEMDSMCLCTPGFA 30 (188)
Q Consensus 6 ~~~~~~C~~~g~C~~~~C~C~~G~~ 30 (188)
+.....|..+..|..+.|.|++||.
T Consensus 22 C~~~~qC~~~s~C~~g~C~C~~g~~ 46 (52)
T PF01683_consen 22 CESDEQCIGGSVCVNGRCQCPPGYV 46 (52)
T ss_pred CCCcCCCCCcCEEcCCEeECCCCCE
Confidence 3455668888899889999999974
No 56
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=78.57 E-value=1.8 Score=30.05 Aligned_cols=25 Identities=20% Similarity=0.476 Sum_probs=18.7
Q ss_pred CCCCCCCCCeEec---------ceEeCCCCCcCC
Q psy18122 8 NSIECLNGGVEMD---------SMCLCTPGFAGK 32 (188)
Q Consensus 8 ~~~~C~~~g~C~~---------~~C~C~~G~~G~ 32 (188)
..+.|++||.|.. +.|.|.+.+...
T Consensus 11 ~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~ 44 (103)
T PF12955_consen 11 ATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT 44 (103)
T ss_pred hccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence 4577999999976 679998854433
No 57
>KOG0196|consensus
Probab=78.44 E-value=3.4 Score=38.66 Aligned_cols=52 Identities=29% Similarity=0.790 Sum_probs=34.2
Q ss_pred ceEeCCCCCc----CCCCCCCCCCCCCCCCC-CCCccCCCCCCCCCcEecC--CCceecCCCCC
Q psy18122 21 SMCLCTPGFA----GKKCEIPCDRNNFGHKC-SMLCSKSPNQCKGMVLCTS--NFGCSCATGYQ 77 (188)
Q Consensus 21 ~~C~C~~G~~----G~~C~~~C~~g~~G~~C-~~~C~c~~~~C~~~g~C~~--~~~C~C~~g~~ 77 (188)
|.|.|.+||. |..|+. |++|+|-..- ...|. +|..+..=.. ...|.|..||.
T Consensus 259 G~C~C~aGye~~~~~~~C~a-Cp~G~yK~~~~~~~C~----~CP~~S~s~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQA-CPPGTYKASQGDSLCL----PCPPNSHSSSEGATSCTCENGYY 317 (996)
T ss_pred CceeecCCCCcccCCCccee-CCCCcccCCCCCCCCC----CCCCCCCCCCCCCCcccccCCcc
Confidence 9999999995 567876 9999885544 23443 3444332211 34588888875
No 58
>PHA02887 EGF-like protein; Provisional
Probab=77.88 E-value=2 Score=30.41 Aligned_cols=26 Identities=35% Similarity=0.700 Sum_probs=17.5
Q ss_pred CCCCCcEecC-----CCceecCCCCCCCCCCC
Q psy18122 57 QCKGMVLCTS-----NFGCSCATGYQGEQCLE 83 (188)
Q Consensus 57 ~C~~~g~C~~-----~~~C~C~~g~~G~~C~~ 83 (188)
-|- +|.|.- ...|.|+.||+|.+|+.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 344 356753 35588888888888863
No 59
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=74.99 E-value=3 Score=20.95 Aligned_cols=11 Identities=55% Similarity=1.280 Sum_probs=8.9
Q ss_pred CceecCCCCCC
Q psy18122 68 FGCSCATGYQG 78 (188)
Q Consensus 68 ~~C~C~~g~~G 78 (188)
+.|.|++||.-
T Consensus 2 y~C~C~~Gy~l 12 (24)
T PF12662_consen 2 YTCSCPPGYQL 12 (24)
T ss_pred EEeeCCCCCcC
Confidence 67899999863
No 60
>KOG3516|consensus
Probab=73.67 E-value=2.9 Score=40.49 Aligned_cols=33 Identities=24% Similarity=0.544 Sum_probs=28.8
Q ss_pred CCCCCCCCCCCCeEec----ceEeCC-CCCcCCCCCCC
Q psy18122 5 TNENSIECLNGGVEMD----SMCLCT-PGFAGKKCEIP 37 (188)
Q Consensus 5 ~~~~~~~C~~~g~C~~----~~C~C~-~G~~G~~C~~~ 37 (188)
+-+.+++|.++|.|.. +.|.|. .||+|..|...
T Consensus 546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred cccCCccccCCCcccccccceeEeccccccccccccCC
Confidence 3468899999999987 999998 89999999864
No 61
>KOG3607|consensus
Probab=72.45 E-value=2.8 Score=38.85 Aligned_cols=29 Identities=34% Similarity=0.614 Sum_probs=25.7
Q ss_pred CCCCCCCCeEec-ceEeCCCCCcCCCCCCC
Q psy18122 9 SIECLNGGVEMD-SMCLCTPGFAGKKCEIP 37 (188)
Q Consensus 9 ~~~C~~~g~C~~-~~C~C~~G~~G~~C~~~ 37 (188)
+..|+.+|.|.. ..|.|.+||.+++|+..
T Consensus 629 ~~~C~g~GVCnn~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 629 PTTCNGHGVCNNELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccccCCCcccCCCcceeeCCCCCCCccccc
Confidence 344999999999 99999999999999874
No 62
>KOG3516|consensus
Probab=66.42 E-value=4.4 Score=39.28 Aligned_cols=38 Identities=34% Similarity=0.843 Sum_probs=32.4
Q ss_pred CCCCCccCCCCCCCCCcEecC---CCceecC-CCCCCCCCCC
Q psy18122 46 KCSMLCSKSPNQCKGMVLCTS---NFGCSCA-TGYQGEQCLE 83 (188)
Q Consensus 46 ~C~~~C~c~~~~C~~~g~C~~---~~~C~C~-~g~~G~~C~~ 83 (188)
.|...+.|.+++|+++|.|.. .+.|.|. .||.|..|..
T Consensus 541 ~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 541 MCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred ccccccccCCccccCCCcccccccceeEeccccccccccccC
Confidence 356677788999999999987 5789998 8999999985
No 63
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=59.97 E-value=15 Score=26.47 Aligned_cols=17 Identities=35% Similarity=1.013 Sum_probs=14.1
Q ss_pred CCceecCCCCCCCCCCC
Q psy18122 67 NFGCSCATGYQGEQCLE 83 (188)
Q Consensus 67 ~~~C~C~~g~~G~~C~~ 83 (188)
...|.|..||+|.+|+.
T Consensus 66 ~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 66 GMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CceeECCCCcccccccc
Confidence 36789999999999985
No 64
>PHA02887 EGF-like protein; Provisional
Probab=59.65 E-value=12 Score=26.48 Aligned_cols=8 Identities=38% Similarity=0.953 Sum_probs=4.6
Q ss_pred CCCCCeEec
Q psy18122 12 CLNGGVEMD 20 (188)
Q Consensus 12 C~~~g~C~~ 20 (188)
|. ||.|.-
T Consensus 94 Ci-HG~C~y 101 (126)
T PHA02887 94 CI-NGECMN 101 (126)
T ss_pred ee-CCEEEc
Confidence 55 467753
No 65
>KOG3607|consensus
Probab=58.24 E-value=14 Score=34.51 Aligned_cols=28 Identities=25% Similarity=0.860 Sum_probs=24.7
Q ss_pred CCCCCCcEecCCCceecCCCCCCCCCCC
Q psy18122 56 NQCKGMVLCTSNFGCSCATGYQGEQCLE 83 (188)
Q Consensus 56 ~~C~~~g~C~~~~~C~C~~g~~G~~C~~ 83 (188)
..|.++|+|.....|.|.+||.+++|+.
T Consensus 630 ~~C~g~GVCnn~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 630 TTCNGHGVCNNELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cccCCCcccCCCcceeeCCCCCCCcccc
Confidence 3588999999888999999999999984
No 66
>KOG3514|consensus
Probab=54.10 E-value=8.6 Score=37.19 Aligned_cols=32 Identities=31% Similarity=0.799 Sum_probs=23.9
Q ss_pred cCCCCCCCCCcEecC---CCceecC-CCCCCCCCCC
Q psy18122 52 SKSPNQCKGMVLCTS---NFGCSCA-TGYQGEQCLE 83 (188)
Q Consensus 52 ~c~~~~C~~~g~C~~---~~~C~C~-~g~~G~~C~~ 83 (188)
.|.++||.|+|.|.+ .+.|-|. .+|.|+.|++
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 456889999999987 4567775 4677777765
No 67
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=51.36 E-value=6.5 Score=23.94 Aligned_cols=28 Identities=25% Similarity=0.717 Sum_probs=15.5
Q ss_pred CCCCCCCCeEec--------ceEeCCCCCcCCCCCC
Q psy18122 9 SIECLNGGVEMD--------SMCLCTPGFAGKKCEI 36 (188)
Q Consensus 9 ~~~C~~~g~C~~--------~~C~C~~G~~G~~C~~ 36 (188)
.-+|+.||..-. -.|.|..-|.|++|++
T Consensus 16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 16 AISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCST 51 (56)
T ss_dssp TS--TTSEE--TTS-EETTEE--EE-TTEESTTS-E
T ss_pred cCCcCCCCeeeeccccccCCccccccCCcCCCCccc
Confidence 347888886642 4699999999999876
No 68
>KOG1388|consensus
Probab=47.67 E-value=19 Score=28.29 Aligned_cols=69 Identities=28% Similarity=0.639 Sum_probs=42.7
Q ss_pred CCCCCeEec-ceE-eCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCC-cEec-CCCceec-CCCCCCCCCCC
Q psy18122 12 CLNGGVEMD-SMC-LCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGM-VLCT-SNFGCSC-ATGYQGEQCLE 83 (188)
Q Consensus 12 C~~~g~C~~-~~C-~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~-g~C~-~~~~C~C-~~g~~G~~C~~ 83 (188)
|++++.|.. -.| .|..+-+|..|+. |..+|+|..=...|. +..|.++ ..|. ..+.|.| .-++.|..|.+
T Consensus 54 cNGh~~c~t~~v~~~~~N~~~g~~c~k-c~~g~~GdtN~g~c~--~~~~~g~~~~~~~~~~~c~c~~kgvvgd~c~~ 127 (217)
T KOG1388|consen 54 CNGHSDCNTQHVCWRCENGTTGAHCEK-CIVGFYGDTNGGKCQ--PCDCNGGASACVTLTGKCFCTTKGVVGDLCPK 127 (217)
T ss_pred hcCCCCcccceeeeeccCccccccCCc-eEEEEEecCCCCccC--HhhhcCCeeeeeccCCccccccceEecccCcc
Confidence 455555554 334 3788999999987 899999963322332 1112222 1233 3678888 66888888875
No 69
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=47.35 E-value=21 Score=24.51 Aligned_cols=23 Identities=30% Similarity=0.703 Sum_probs=17.5
Q ss_pred CCCCCCCcEecC--CCceecCCCCC
Q psy18122 55 PNQCKGMVLCTS--NFGCSCATGYQ 77 (188)
Q Consensus 55 ~~~C~~~g~C~~--~~~C~C~~g~~ 77 (188)
.+.|...+.|.. ...|.|.+||.
T Consensus 83 y~~CG~~g~C~~~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 83 YGFCGPNGICNSNNSPKCSCLPGFE 107 (110)
T ss_pred ccccCCccEeCCCCCCceECCCCcC
Confidence 357888899975 34599998885
No 70
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=42.54 E-value=8.4 Score=21.25 Aligned_cols=13 Identities=38% Similarity=1.078 Sum_probs=8.3
Q ss_pred CCceecCCCCCCC
Q psy18122 67 NFGCSCATGYQGE 79 (188)
Q Consensus 67 ~~~C~C~~g~~G~ 79 (188)
.+.|.|++||.-.
T Consensus 18 ~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 18 SYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEE-STTEEE-
T ss_pred ceEeECCCCCEEC
Confidence 4679999988643
No 71
>KOG3509|consensus
Probab=37.06 E-value=44 Score=32.23 Aligned_cols=67 Identities=22% Similarity=0.520 Sum_probs=38.5
Q ss_pred ceEeCCCCCcCCCCCCCCCCCCCCC---CCC--CCccCCCCCCCCC-cEecC-CCce-ecCCCCCCCCCCCCCCCCccCC
Q psy18122 21 SMCLCTPGFAGKKCEIPCDRNNFGH---KCS--MLCSKSPNQCKGM-VLCTS-NFGC-SCATGYQGEQCLEHCPEDYYGA 92 (188)
Q Consensus 21 ~~C~C~~G~~G~~C~~~C~~g~~G~---~C~--~~C~c~~~~C~~~-g~C~~-~~~C-~C~~g~~G~~C~~~C~~g~~G~ 92 (188)
-.|+|++++.|..|+. |.+++.-. .+. ....| .|..+ ..|.. .+.| .|.....|.+|+ .|.+|++|.
T Consensus 718 ~~C~c~~g~~G~~ce~-c~e~~~ls~t~~~~~~~~~~c---~~~~h~~~c~~~~~~nt~~q~~~~~~~~~-~~~~g~~~d 792 (964)
T KOG3509|consen 718 EQCQCPKGLVGTSCED-CAEGYTLSTTGGLYPGLCEDC---ECNSHISQCEDDLGYNTDCQNNTEGDRCE-LCSPGTYGD 792 (964)
T ss_pred cccccCccccCccccc-ccccccccccCCcCcccCccc---ccCCCcccccccccccccccccCccceee-ecCCCcccc
Confidence 6899999999999987 77765311 111 11111 12221 23333 2233 456666777776 677777764
No 72
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=35.23 E-value=20 Score=19.58 Aligned_cols=10 Identities=30% Similarity=1.162 Sum_probs=7.9
Q ss_pred ceEeCCCCCc
Q psy18122 21 SMCLCTPGFA 30 (188)
Q Consensus 21 ~~C~C~~G~~ 30 (188)
++|.|+.||.
T Consensus 18 ~~C~CPeGyI 27 (34)
T PF09064_consen 18 GQCFCPEGYI 27 (34)
T ss_pred CceeCCCceE
Confidence 6888888884
No 73
>KOG3509|consensus
Probab=27.31 E-value=81 Score=30.54 Aligned_cols=23 Identities=35% Similarity=0.852 Sum_probs=19.2
Q ss_pred CCceecCCCCCCCCCCCCCCCCcc
Q psy18122 67 NFGCSCATGYQGEQCLEHCPEDYY 90 (188)
Q Consensus 67 ~~~C~C~~g~~G~~C~~~C~~g~~ 90 (188)
...|.|+.++.|..|+ .|.+++.
T Consensus 717 ~~~C~c~~g~~G~~ce-~c~e~~~ 739 (964)
T KOG3509|consen 717 VEQCQCPKGLVGTSCE-DCAEGYT 739 (964)
T ss_pred ccccccCccccCcccc-ccccccc
Confidence 4579999999999998 6877764
No 74
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=24.31 E-value=39 Score=18.32 Aligned_cols=18 Identities=44% Similarity=1.047 Sum_probs=7.5
Q ss_pred CCCCCCcEecCCCceecCCC
Q psy18122 56 NQCKGMVLCTSNFGCSCATG 75 (188)
Q Consensus 56 ~~C~~~g~C~~~~~C~C~~g 75 (188)
+.|.++-.|.. .|.|+.|
T Consensus 12 gscknneqcqk--scscptg 29 (40)
T PF11403_consen 12 GSCKNNEQCQK--SCSCPTG 29 (40)
T ss_dssp STTTT-TTSTT--S-SS-TT
T ss_pred CCccChHHHhh--cCCCCCC
Confidence 35666555543 4555544
No 75
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=23.09 E-value=92 Score=24.13 Aligned_cols=17 Identities=35% Similarity=0.942 Sum_probs=12.4
Q ss_pred EecC---CCceecCCCCCCC
Q psy18122 63 LCTS---NFGCSCATGYQGE 79 (188)
Q Consensus 63 ~C~~---~~~C~C~~g~~G~ 79 (188)
.|.. +|.|.|++||+..
T Consensus 200 ~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 200 VCISTPGSYLCACTEGYALL 219 (224)
T ss_pred eEEcCCCCEEeECCCCccCC
Confidence 4655 5779999998753
No 76
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=21.62 E-value=1e+02 Score=18.05 Aligned_cols=9 Identities=56% Similarity=1.483 Sum_probs=6.3
Q ss_pred ceecCCCCC
Q psy18122 69 GCSCATGYQ 77 (188)
Q Consensus 69 ~C~C~~g~~ 77 (188)
.|.|++||.
T Consensus 34 gC~C~~G~v 42 (55)
T PF01826_consen 34 GCFCPPGYV 42 (55)
T ss_dssp EEEETTTEE
T ss_pred cCCCCCCee
Confidence 388887764
Done!