Query psy6687
Match_columns 116
No_of_seqs 173 out of 1964
Neff 11.2
Searched_HMMs 46136
Date Sat Aug 17 00:29:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6687.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6687hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1225|consensus 99.8 7.4E-18 1.6E-22 111.1 9.4 108 1-114 235-342 (525)
2 KOG1225|consensus 99.7 1.1E-16 2.3E-21 105.7 8.6 100 1-111 266-365 (525)
3 KOG1219|consensus 99.5 3.9E-14 8.5E-19 104.3 6.3 97 17-113 3866-3977(4289)
4 KOG1226|consensus 99.5 5.3E-13 1.1E-17 90.6 9.1 112 1-115 479-622 (783)
5 KOG4289|consensus 99.3 2.8E-12 6.1E-17 92.0 3.5 84 31-114 1221-1317(2531)
6 KOG4289|consensus 99.0 2.3E-10 4.9E-15 82.7 4.1 82 1-82 1223-1317(2531)
7 KOG1219|consensus 99.0 3.7E-10 7.9E-15 84.5 4.5 81 1-81 3887-3977(4289)
8 KOG1226|consensus 98.9 8.9E-09 1.9E-13 70.7 7.5 89 22-113 469-580 (783)
9 KOG1214|consensus 98.4 1.5E-06 3.3E-11 61.0 6.3 109 2-110 718-859 (1289)
10 KOG1217|consensus 98.1 3.4E-05 7.3E-10 51.5 8.8 112 1-112 153-306 (487)
11 PF07974 EGF_2: EGF-like domai 98.0 1.2E-05 2.5E-10 33.9 3.0 25 88-112 7-32 (32)
12 KOG1217|consensus 98.0 0.00013 2.8E-09 48.8 9.3 108 2-109 254-388 (487)
13 PF07974 EGF_2: EGF-like domai 97.7 6.1E-05 1.3E-09 31.7 2.7 25 21-45 7-32 (32)
14 PF12661 hEGF: Human growth fa 97.6 2.5E-05 5.4E-10 26.1 0.4 12 101-112 2-13 (13)
15 smart00051 DSL delta serrate l 97.6 0.00019 4.1E-09 35.2 3.6 47 32-80 17-63 (63)
16 smart00051 DSL delta serrate l 97.5 0.00031 6.6E-09 34.4 3.8 44 68-112 18-63 (63)
17 PF00008 EGF: EGF-like domain 97.5 4.7E-05 1E-09 32.1 0.7 23 88-110 5-31 (32)
18 PF00008 EGF: EGF-like domain 97.1 0.00015 3.3E-09 30.5 0.3 22 21-42 5-30 (32)
19 KOG4260|consensus 96.8 0.003 6.5E-08 39.6 4.3 106 4-109 132-269 (350)
20 smart00179 EGF_CA Calcium-bind 96.8 0.0025 5.5E-08 27.6 3.1 14 99-112 24-38 (39)
21 smart00179 EGF_CA Calcium-bind 96.7 0.0026 5.6E-08 27.5 2.8 31 51-81 4-39 (39)
22 KOG0994|consensus 96.7 0.0076 1.6E-07 44.8 5.9 83 31-113 1036-1146(1758)
23 cd00054 EGF_CA Calcium-binding 96.4 0.0076 1.6E-07 25.7 3.1 14 99-112 24-37 (38)
24 cd00054 EGF_CA Calcium-binding 96.3 0.0064 1.4E-07 25.9 2.7 27 55-81 9-38 (38)
25 PF01414 DSL: Delta serrate li 96.3 0.0011 2.3E-08 32.5 0.0 46 32-80 17-63 (63)
26 KOG4260|consensus 96.1 0.011 2.4E-07 37.2 3.6 46 68-113 129-182 (350)
27 cd00053 EGF Epidermal growth f 96.0 0.012 2.6E-07 24.6 2.8 14 99-112 21-35 (36)
28 KOG1214|consensus 95.7 0.012 2.6E-07 42.5 3.0 47 31-78 808-859 (1289)
29 smart00181 EGF Epidermal growt 95.6 0.021 4.5E-07 24.0 2.6 14 99-112 20-34 (35)
30 KOG1836|consensus 94.5 0.12 2.6E-06 40.6 5.4 81 2-82 697-813 (1705)
31 PHA02887 EGF-like protein; Pro 94.4 0.05 1.1E-06 29.8 2.5 27 88-115 93-124 (126)
32 KOG1836|consensus 94.4 0.13 2.7E-06 40.5 5.3 82 33-114 696-813 (1705)
33 PF07645 EGF_CA: Calcium-bindi 94.2 0.021 4.6E-07 25.4 0.7 21 88-108 11-34 (42)
34 PF12662 cEGF: Complement Clr- 94.0 0.059 1.3E-06 21.0 1.7 10 32-41 2-11 (24)
35 KOG0994|consensus 93.7 0.3 6.6E-06 37.1 5.8 56 26-81 1077-1146(1758)
36 cd00055 EGF_Lam Laminin-type e 93.3 0.14 2.9E-06 23.7 2.6 15 99-113 19-33 (50)
37 PHA03099 epidermal growth fact 93.0 0.11 2.4E-06 29.0 2.2 17 98-114 66-82 (139)
38 PF12947 EGF_3: EGF domain; I 92.3 0.034 7.4E-07 24.0 -0.2 23 88-110 7-32 (36)
39 smart00180 EGF_Lam Laminin-typ 91.1 0.24 5.2E-06 22.5 1.9 15 99-113 18-32 (46)
40 PF01683 EB: EB module; Inter 88.7 1.4 2.9E-05 20.4 3.5 20 88-108 27-46 (52)
41 KOG3607|consensus 88.6 0.47 1E-05 34.4 2.6 28 87-114 630-657 (716)
42 PF00053 Laminin_EGF: Laminin 88.0 0.18 4E-06 23.1 0.3 16 98-113 17-32 (49)
43 PF09064 Tme5_EGF_like: Thromb 87.9 0.75 1.6E-05 19.5 2.0 10 99-108 18-27 (34)
44 KOG3607|consensus 83.4 1.3 2.8E-05 32.3 2.7 30 54-83 629-658 (716)
45 KOG1218|consensus 79.3 14 0.0003 23.7 8.8 15 99-113 162-176 (316)
46 KOG1218|consensus 78.1 15 0.00033 23.6 9.2 64 33-96 125-195 (316)
47 KOG3514|consensus 74.5 2.5 5.5E-05 32.3 2.0 31 84-114 625-660 (1591)
48 PF12955 DUF3844: Domain of un 66.5 5.2 0.00011 21.7 1.7 10 56-65 14-23 (103)
49 PF00954 S_locus_glycop: S-loc 63.3 16 0.00035 19.7 3.3 24 87-110 84-109 (110)
50 KOG3516|consensus 58.2 12 0.00025 29.2 2.7 28 87-114 551-582 (1306)
51 PF06247 Plasmod_Pvs28: Plasmo 51.9 0.34 7.3E-06 29.1 -4.7 106 2-108 22-160 (197)
52 PF14670 FXa_inhibition: Coagu 51.9 4.8 0.0001 17.3 -0.0 11 98-108 18-28 (36)
53 PF04863 EGF_alliinase: Alliin 49.0 4 8.6E-05 19.4 -0.5 14 100-113 37-50 (56)
54 KOG3516|consensus 48.7 19 0.0004 28.3 2.5 35 49-83 545-583 (1306)
55 KOG3512|consensus 48.3 17 0.00038 25.4 2.1 24 90-113 404-428 (592)
56 KOG3514|consensus 31.0 52 0.0011 26.0 2.4 32 51-82 625-660 (1591)
57 PF05294 Toxin_5: Scorpion sho 21.3 76 0.0017 13.2 1.1 7 90-96 20-26 (32)
58 KOG3509|consensus 20.2 3.6E+02 0.0078 21.4 5.0 29 55-83 412-443 (964)
No 1
>KOG1225|consensus
Probab=99.76 E-value=7.4e-18 Score=111.09 Aligned_cols=108 Identities=40% Similarity=1.039 Sum_probs=85.4
Q ss_pred CccCCCCCcCCCCCCCCCCCCCCCCcEecCCCceeCCCCCccCCCCCCCCCCCCCCCCCCceeCCCCceecCCCCcCCCC
Q psy6687 1 MCQCPKGYQGTYCGTAMCFPQCLNNGTCTAPGVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWCDSQQMCQCPKGYQGTYC 80 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~~~c~~~c~~~g~C~~~~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c~~~~~c~C~~g~~g~~c 80 (116)
+|.|..+|+|.+|+...|...|.+.+.|+. ++|+|++||+|.+|+... ++..|+.++.+.. +.|+|+++|.|..|
T Consensus 235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~-G~CIC~~Gf~G~dC~e~~---Cp~~cs~~g~~~~-g~CiC~~g~~G~dC 309 (525)
T KOG1225|consen 235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVE-GRCICPPGFTGDDCDELV---CPVDCSGGGVCVD-GECICNPGYSGKDC 309 (525)
T ss_pred eeecCCceeCCccccccCCCCCcccceEeC-CeEeCCCCCcCCCCCccc---CCcccCCCceecC-CEeecCCCcccccc
Confidence 367788888888887778777887788875 788888888888887533 4445777777776 58888888888888
Q ss_pred CCCCCCCCCCCCcEecCCCceeCCCCCcCCCCCC
Q psy6687 81 GTAMCFPQCLNNGTCTAPGVCSCPPGFQGLHCEG 114 (116)
Q Consensus 81 ~~~~c~~~c~~~g~c~~~~~C~C~~g~~g~~C~~ 114 (116)
++..|+..|+++|.|+ .++|.|.+||+|..|+.
T Consensus 310 s~~~cpadC~g~G~Ci-~G~C~C~~Gy~G~~C~~ 342 (525)
T KOG1225|consen 310 SIRRCPADCSGHGKCI-DGECLCDEGYTGELCIQ 342 (525)
T ss_pred ccccCCccCCCCCccc-CCceEeCCCCcCCcccc
Confidence 8777778888888888 57888888888888865
No 2
>KOG1225|consensus
Probab=99.70 E-value=1.1e-16 Score=105.72 Aligned_cols=100 Identities=36% Similarity=0.993 Sum_probs=89.2
Q ss_pred CccCCCCCcCCCCCCCCCCCCCCCCcEecCCCceeCCCCCccCCCCCCCCCCCCCCCCCCceeCCCCceecCCCCcCCCC
Q psy6687 1 MCQCPKGYQGTYCGTAMCFPQCLNNGTCTAPGVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWCDSQQMCQCPKGYQGTYC 80 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~~~c~~~c~~~g~C~~~~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c~~~~~c~C~~g~~g~~c 80 (116)
+|.|++||+|.+|+...|...|+.++.+++ +.|.|+++|.|..|+... ++..|.++|.|+ .+.|.|.+||+|..|
T Consensus 266 ~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~G~dCs~~~---cpadC~g~G~Ci-~G~C~C~~Gy~G~~C 340 (525)
T KOG1225|consen 266 RCICPPGFTGDDCDELVCPVDCSGGGVCVD-GECICNPGYSGKDCSIRR---CPADCSGHGKCI-DGECLCDEGYTGELC 340 (525)
T ss_pred eEeCCCCCcCCCCCcccCCcccCCCceecC-CEeecCCCcccccccccc---CCccCCCCCccc-CCceEeCCCCcCCcc
Confidence 489999999999999889888999898886 699999999999998554 567899999999 489999999999999
Q ss_pred CCCCCCCCCCCCcEecCCCceeCCCCCcCCC
Q psy6687 81 GTAMCFPQCLNNGTCTAPGVCSCPPGFQGLH 111 (116)
Q Consensus 81 ~~~~c~~~c~~~g~c~~~~~C~C~~g~~g~~ 111 (116)
... . |.+++.|++ + |.|..||.|++
T Consensus 341 ~~~---~-C~~~g~cv~-g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 341 IQR---A-CSGGGQCVN-G-CKCKKGWRGPD 365 (525)
T ss_pred ccc---c-cCCCceecc-C-ceeccCccCCC
Confidence 977 3 889999997 5 99999999987
No 3
>KOG1219|consensus
Probab=99.51 E-value=3.9e-14 Score=104.26 Aligned_cols=97 Identities=35% Similarity=0.864 Sum_probs=84.9
Q ss_pred CCCC-CCCCCcEecC----CCceeCCCCCccCCCCCCCCCCCCCCCCCCceeCC---CCceecCCCCcCCCCCCC---CC
Q psy6687 17 MCFP-QCLNNGTCTA----PGVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWCDS---QQMCQCPKGYQGTYCGTA---MC 85 (116)
Q Consensus 17 ~c~~-~c~~~g~C~~----~~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c~~---~~~c~C~~g~~g~~c~~~---~c 85 (116)
.|.. +|+++|.|.. .|.|+|++.|.|..|+.+..+|...||..+|+|.. .+.|.|+.+|+|.+|+.. +|
T Consensus 3866 ~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eC 3945 (4289)
T KOG1219|consen 3866 PCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISEC 3945 (4289)
T ss_pred ccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccccccc
Confidence 4543 6999999975 58999999999999999999999999999999975 368999999999999854 57
Q ss_pred C-CCCCCCcEecC---CCceeCCCCCcCCCCC
Q psy6687 86 F-PQCLNNGTCTA---PGVCSCPPGFQGLHCE 113 (116)
Q Consensus 86 ~-~~c~~~g~c~~---~~~C~C~~g~~g~~C~ 113 (116)
. ++|.++|.|.+ ++.|.|.+||.|..|.
T Consensus 3946 s~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3946 SKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred ccccccCCceeeccCCceEeccChhHhcccCc
Confidence 7 78999999986 3789999999999874
No 4
>KOG1226|consensus
Probab=99.47 E-value=5.3e-13 Score=90.62 Aligned_cols=112 Identities=30% Similarity=0.792 Sum_probs=86.7
Q ss_pred CccCCCCCcCCCCCCC-----------CCCC-----CCCCCcEecCCCceeCCCCCc----cCCCCCCCCCCCCC---CC
Q psy6687 1 MCQCPKGYQGTYCGTA-----------MCFP-----QCLNNGTCTAPGVCSCPPGFQ----GLHCEGGPGPGCDQ---KC 57 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~~-----------~c~~-----~c~~~g~C~~~~~C~c~~g~~----g~~c~~~~~~~~~~---~c 57 (116)
+|.|.+||.|..|+.+ .|.. .|++.|.|.- ++|.|.+... |++|+-+.-.|... .|
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC 557 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHKGVLC 557 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCcccccccCccc
Confidence 5899999999988642 1322 4899999986 8999988776 89998655444332 48
Q ss_pred CCCceeCCCCceecCCCCcCCCCCC----CCCC----CCCCCCcEecCCCceeCCCC-CcCCCCCCC
Q psy6687 58 SNGGWCDSQQMCQCPKGYQGTYCGT----AMCF----PQCLNNGTCTAPGVCSCPPG-FQGLHCEGG 115 (116)
Q Consensus 58 ~~~g~c~~~~~c~C~~g~~g~~c~~----~~c~----~~c~~~g~c~~~~~C~C~~g-~~g~~C~~~ 115 (116)
..+|.|.. ++|+|.+||+|..|+- +.|. ..|+.+|+|.- ++|.|... |.|+.||+.
T Consensus 558 ~g~G~C~C-G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~C-g~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 558 GGHGRCEC-GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCEC-GRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred CCCCeEeC-CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeC-CceEcCCCCcCcchhhcC
Confidence 89999987 8999999999999983 2344 35888888886 78999765 999999864
No 5
>KOG4289|consensus
Probab=99.28 E-value=2.8e-12 Score=92.04 Aligned_cols=84 Identities=38% Similarity=0.863 Sum_probs=71.9
Q ss_pred CCceeCCCCCccCCCCCCCCCCCCCCCCCCceeCC---CCceecCCCCcCCCCCCC----CCC-CCCCCCcEecC----C
Q psy6687 31 PGVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWCDS---QQMCQCPKGYQGTYCGTA----MCF-PQCLNNGTCTA----P 98 (116)
Q Consensus 31 ~~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c~~---~~~c~C~~g~~g~~c~~~----~c~-~~c~~~g~c~~----~ 98 (116)
.++|.|++||+|++|+..++.|-..+|.++|.|.. .++|.|.++|+|.+|+++ -|. ..|.++|+|++ .
T Consensus 1221 glrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ngg 1300 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGG 1300 (2531)
T ss_pred ceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCc
Confidence 46899999999999999999999999999999975 379999999999999964 244 67999999984 2
Q ss_pred CceeCCCC-CcCCCCCC
Q psy6687 99 GVCSCPPG-FQGLHCEG 114 (116)
Q Consensus 99 ~~C~C~~g-~~g~~C~~ 114 (116)
+.|.|++| |.+++|+.
T Consensus 1301 f~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1301 FCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred eeccCCCcccCCCceEE
Confidence 68999986 67888874
No 6
>KOG4289|consensus
Probab=99.04 E-value=2.3e-10 Score=82.72 Aligned_cols=82 Identities=41% Similarity=1.032 Sum_probs=66.6
Q ss_pred CccCCCCCcCCCCCC--CCC-CCCCCCCcEecC---CCceeCCCCCccCCCCCC--CCCCCCCCCCCCceeCCC----Cc
Q psy6687 1 MCQCPKGYQGTYCGT--AMC-FPQCLNNGTCTA---PGVCSCPPGFQGLHCEGG--PGPGCDQKCSNGGWCDSQ----QM 68 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~--~~c-~~~c~~~g~C~~---~~~C~c~~g~~g~~c~~~--~~~~~~~~c~~~g~c~~~----~~ 68 (116)
.|.|++||+|+.|+. +.| ..+|.++|+|.. .|.|.|.++|+|.+|+.+ ...|.+..|.++|.|.+. +.
T Consensus 1223 rCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~ 1302 (2531)
T KOG4289|consen 1223 RCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFC 1302 (2531)
T ss_pred eEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCcee
Confidence 489999999999975 457 447999999975 689999999999999843 345666779999999763 57
Q ss_pred eecCCC-CcCCCCCC
Q psy6687 69 CQCPKG-YQGTYCGT 82 (116)
Q Consensus 69 c~C~~g-~~g~~c~~ 82 (116)
|.|+.| |.++.|+.
T Consensus 1303 c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1303 CHCPYGEFEDPRCEV 1317 (2531)
T ss_pred ccCCCcccCCCceEE
Confidence 899976 67888874
No 7
>KOG1219|consensus
Probab=99.02 E-value=3.7e-10 Score=84.45 Aligned_cols=81 Identities=35% Similarity=0.885 Sum_probs=71.0
Q ss_pred CccCCCCCcCCCCCCC--CCCC-CCCCCcEecC---CCceeCCCCCccCCCCCC-CCCCCCCCCCCCceeCCC---Ccee
Q psy6687 1 MCQCPKGYQGTYCGTA--MCFP-QCLNNGTCTA---PGVCSCPPGFQGLHCEGG-PGPGCDQKCSNGGWCDSQ---QMCQ 70 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~~--~c~~-~c~~~g~C~~---~~~C~c~~g~~g~~c~~~-~~~~~~~~c~~~g~c~~~---~~c~ 70 (116)
+|.|+..|.|.+||.. .|.. ||..+|+|.. .+.|.|+.+|+|..|++. +.+|...+|.++|.|.+. +.|.
T Consensus 3887 ~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~Cn 3966 (4289)
T KOG1219|consen 3887 KCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCN 3966 (4289)
T ss_pred EEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeeccCCceEec
Confidence 5899999999999864 4754 7999999974 688999999999999987 788988999999999874 6899
Q ss_pred cCCCCcCCCCC
Q psy6687 71 CPKGYQGTYCG 81 (116)
Q Consensus 71 C~~g~~g~~c~ 81 (116)
|.++|.|..|.
T Consensus 3967 cT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3967 CTPGILGRTCC 3977 (4289)
T ss_pred cChhHhcccCc
Confidence 99999998875
No 8
>KOG1226|consensus
Probab=98.90 E-value=8.9e-09 Score=70.69 Aligned_cols=89 Identities=35% Similarity=0.834 Sum_probs=67.7
Q ss_pred CCCCcEecCCCceeCCCCCccCCCCCCCC---------CCC----CCCCCCCceeCCCCceecCCCCc----CCCCCCC-
Q psy6687 22 CLNNGTCTAPGVCSCPPGFQGLHCEGGPG---------PGC----DQKCSNGGWCDSQQMCQCPKGYQ----GTYCGTA- 83 (116)
Q Consensus 22 c~~~g~C~~~~~C~c~~g~~g~~c~~~~~---------~~~----~~~c~~~g~c~~~~~c~C~~g~~----g~~c~~~- 83 (116)
|+.+|+.+. +.|.|.+||.|+.|+-..+ .|. ..+|++.|.|.- ++|+|.+... |.+|+-+
T Consensus 469 C~g~G~~~C-G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDn 546 (783)
T KOG1226|consen 469 CHGNGTFVC-GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDN 546 (783)
T ss_pred cCCCCcEEe-cceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccC
Confidence 766676664 7999999999999973321 111 125888888877 7999987665 8888743
Q ss_pred -CCC----CCCCCCcEecCCCceeCCCCCcCCCCC
Q psy6687 84 -MCF----PQCLNNGTCTAPGVCSCPPGFQGLHCE 113 (116)
Q Consensus 84 -~c~----~~c~~~g~c~~~~~C~C~~g~~g~~C~ 113 (116)
.|. ..|.++|.|.. ++|.|.+||+|..|+
T Consensus 547 fsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~ 580 (783)
T KOG1226|consen 547 FSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACN 580 (783)
T ss_pred cccccccCcccCCCCeEeC-CcEEcCCCCccCCCC
Confidence 344 46999999986 899999999999986
No 9
>KOG1214|consensus
Probab=98.37 E-value=1.5e-06 Score=60.99 Aligned_cols=109 Identities=30% Similarity=0.741 Sum_probs=67.4
Q ss_pred ccCCCCCcCC--CCCC-C---CCCCCCCCCcEecC---CCceeCCCCCc--c--CCCCCCC-----CCCCC--CCCCCCc
Q psy6687 2 CQCPKGYQGT--YCGT-A---MCFPQCLNNGTCTA---PGVCSCPPGFQ--G--LHCEGGP-----GPGCD--QKCSNGG 61 (116)
Q Consensus 2 C~C~~g~~g~--~c~~-~---~c~~~c~~~g~C~~---~~~C~c~~g~~--g--~~c~~~~-----~~~~~--~~c~~~g 61 (116)
|.|..+|.|+ .|.. . .+...|..+.+|++ +++|.|..+|. + -.|.... ..|.. ..|...|
T Consensus 718 cecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g 797 (1289)
T KOG1214|consen 718 CECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAG 797 (1289)
T ss_pred EEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCC
Confidence 5566667654 4532 2 24445888889976 56777776653 3 3443221 11211 1244334
Q ss_pred ee--CC----CCceecCCCCcCC--CC-CCCCCC-CCCCCCcEecC---CCceeCCCCCcCC
Q psy6687 62 WC--DS----QQMCQCPKGYQGT--YC-GTAMCF-PQCLNNGTCTA---PGVCSCPPGFQGL 110 (116)
Q Consensus 62 ~c--~~----~~~c~C~~g~~g~--~c-~~~~c~-~~c~~~g~c~~---~~~C~C~~g~~g~ 110 (116)
.+ +. .+.|.|.|||.|+ .| +.++|. +.|...++|++ ++.|.|.+||.|.
T Consensus 798 ~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 798 QARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred ceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCccCC
Confidence 33 21 2789999999875 33 356776 67999999985 4899999999865
No 10
>KOG1217|consensus
Probab=98.13 E-value=3.4e-05 Score=51.55 Aligned_cols=112 Identities=42% Similarity=1.039 Sum_probs=76.3
Q ss_pred CccCCCCCcCCCCCCC--CCCC---CCCCCcEecC---CCceeCCCCCccCCCCCC-------------------CCCCC
Q psy6687 1 MCQCPKGYQGTYCGTA--MCFP---QCLNNGTCTA---PGVCSCPPGFQGLHCEGG-------------------PGPGC 53 (116)
Q Consensus 1 ~C~C~~g~~g~~c~~~--~c~~---~c~~~g~C~~---~~~C~c~~g~~g~~c~~~-------------------~~~~~ 53 (116)
.|.|..+|.+..+... .|.. .|.+.+.|.+ .+.|.|.++|.+..++.. ...+.
T Consensus 153 ~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~ 232 (487)
T KOG1217|consen 153 RCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECE 232 (487)
T ss_pred eeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcc
Confidence 3688999999988753 5652 3778888875 367999999998887643 10111
Q ss_pred CC--CCCCC-ceeCCC---CceecCCCCcCCCC----CCCCCC-C-CCCCCcEecCC---CceeCCCCCcCCCC
Q psy6687 54 DQ--KCSNG-GWCDSQ---QMCQCPKGYQGTYC----GTAMCF-P-QCLNNGTCTAP---GVCSCPPGFQGLHC 112 (116)
Q Consensus 54 ~~--~c~~~-g~c~~~---~~c~C~~g~~g~~c----~~~~c~-~-~c~~~g~c~~~---~~C~C~~g~~g~~C 112 (116)
.. .+... +.|... +.|.++++|.+..+ .++.|. . .|.++++|... +.|.|+++|+|..+
T Consensus 233 ~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 233 VSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred cccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 10 12211 555442 57888999998873 345565 2 37888899853 78999999999887
No 11
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=98.00 E-value=1.2e-05 Score=33.94 Aligned_cols=25 Identities=44% Similarity=1.233 Sum_probs=16.5
Q ss_pred CCCCCcEecCC-CceeCCCCCcCCCC
Q psy6687 88 QCLNNGTCTAP-GVCSCPPGFQGLHC 112 (116)
Q Consensus 88 ~c~~~g~c~~~-~~C~C~~g~~g~~C 112 (116)
.|+++|+|+.. ++|.|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 46666777643 67777777777665
No 12
>KOG1217|consensus
Probab=97.98 E-value=0.00013 Score=48.80 Aligned_cols=108 Identities=40% Similarity=1.011 Sum_probs=74.0
Q ss_pred ccCCCCCcCCCC----CCCCCCC--CCCCCcEecC---CCceeCCCCCccCCC-C-CCCCCC----CCCCCCCCceeCC-
Q psy6687 2 CQCPKGYQGTYC----GTAMCFP--QCLNNGTCTA---PGVCSCPPGFQGLHC-E-GGPGPG----CDQKCSNGGWCDS- 65 (116)
Q Consensus 2 C~C~~g~~g~~c----~~~~c~~--~c~~~g~C~~---~~~C~c~~g~~g~~c-~-~~~~~~----~~~~c~~~g~c~~- 65 (116)
|.|.+||.+..+ ....|.. .|.++++|.. .+.|.|+++|.|..+ . .....+ ...+|.+++.|..
T Consensus 254 C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~ 333 (487)
T KOG1217|consen 254 CRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTL 333 (487)
T ss_pred eeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccC
Confidence 667788887763 2344543 2777888876 378999999999988 1 122333 2334666666622
Q ss_pred ----CCceecCCCCcCCCCCCC--CCC-CCCCCCcEecC----CCceeCCCCCcC
Q psy6687 66 ----QQMCQCPKGYQGTYCGTA--MCF-PQCLNNGTCTA----PGVCSCPPGFQG 109 (116)
Q Consensus 66 ----~~~c~C~~g~~g~~c~~~--~c~-~~c~~~g~c~~----~~~C~C~~g~~g 109 (116)
.+.|.+..+|.|..|+.. .|. .++..++.|.+ .+.|.++.+|.+
T Consensus 334 ~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 334 GSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred CCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence 146899999999999855 576 44777788875 368899988776
No 13
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.70 E-value=6.1e-05 Score=31.75 Aligned_cols=25 Identities=44% Similarity=1.233 Sum_probs=20.2
Q ss_pred CCCCCcEecCC-CceeCCCCCccCCC
Q psy6687 21 QCLNNGTCTAP-GVCSCPPGFQGLHC 45 (116)
Q Consensus 21 ~c~~~g~C~~~-~~C~c~~g~~g~~c 45 (116)
.|+++|+|+.. ++|.|.++|+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 47888999854 78999999988765
No 14
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.56 E-value=2.5e-05 Score=26.13 Aligned_cols=12 Identities=58% Similarity=1.818 Sum_probs=6.7
Q ss_pred eeCCCCCcCCCC
Q psy6687 101 CSCPPGFQGLHC 112 (116)
Q Consensus 101 C~C~~g~~g~~C 112 (116)
|.|++||+|.+|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 556666666654
No 15
>smart00051 DSL delta serrate ligand.
Probab=97.55 E-value=0.00019 Score=35.16 Aligned_cols=47 Identities=21% Similarity=0.395 Sum_probs=27.8
Q ss_pred CceeCCCCCccCCCCCCCCCCCCCCCCCCceeCCCCceecCCCCcCCCC
Q psy6687 32 GVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWCDSQQMCQCPKGYQGTYC 80 (116)
Q Consensus 32 ~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c~~~~~c~C~~g~~g~~c 80 (116)
+.-.|.++|.|..|+..-. .......+..|...+.++|.+||.|+.|
T Consensus 17 ~rv~C~~~~yG~~C~~~C~--~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNKFCR--PRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCCEeC--cCccccCCccCCcCCCEecCCCCcCCCC
Confidence 3446777888887763211 0011334556655567888888888765
No 16
>smart00051 DSL delta serrate ligand.
Probab=97.48 E-value=0.00031 Score=34.42 Aligned_cols=44 Identities=27% Similarity=0.628 Sum_probs=33.2
Q ss_pred ceecCCCCcCCCCCCCCCC--CCCCCCcEecCCCceeCCCCCcCCCC
Q psy6687 68 MCQCPKGYQGTYCGTAMCF--PQCLNNGTCTAPGVCSCPPGFQGLHC 112 (116)
Q Consensus 68 ~c~C~~g~~g~~c~~~~c~--~~c~~~g~c~~~~~C~C~~g~~g~~C 112 (116)
.-.|.++|.|..|+.. |. .....+.+|...+.+.|.+||+|+.|
T Consensus 18 rv~C~~~~yG~~C~~~-C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCNKF-CRPRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccCCE-eCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence 4578899999999742 32 23456677776688999999999986
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.46 E-value=4.7e-05 Score=32.12 Aligned_cols=23 Identities=48% Similarity=1.196 Sum_probs=14.3
Q ss_pred CCCCCcEecC----CCceeCCCCCcCC
Q psy6687 88 QCLNNGTCTA----PGVCSCPPGFQGL 110 (116)
Q Consensus 88 ~c~~~g~c~~----~~~C~C~~g~~g~ 110 (116)
+|.++|+|+. .+.|.|++||+|+
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5666666652 2567777777665
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.09 E-value=0.00015 Score=30.52 Aligned_cols=22 Identities=50% Similarity=1.281 Sum_probs=10.5
Q ss_pred CCCCCcEecC----CCceeCCCCCcc
Q psy6687 21 QCLNNGTCTA----PGVCSCPPGFQG 42 (116)
Q Consensus 21 ~c~~~g~C~~----~~~C~c~~g~~g 42 (116)
+|+++|+|+. .+.|.|+++|+|
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 3455555542 234555555544
No 19
>KOG4260|consensus
Probab=96.83 E-value=0.003 Score=39.60 Aligned_cols=106 Identities=26% Similarity=0.649 Sum_probs=59.8
Q ss_pred CCCCCcCCCCCCCC--CCCCCCCCcEecC------CCceeCCCCCccCCCCCCCC-------CCCCC---CCCC--Ccee
Q psy6687 4 CPKGYQGTYCGTAM--CFPQCLNNGTCTA------PGVCSCPPGFQGLHCEGGPG-------PGCDQ---KCSN--GGWC 63 (116)
Q Consensus 4 C~~g~~g~~c~~~~--c~~~c~~~g~C~~------~~~C~c~~g~~g~~c~~~~~-------~~~~~---~c~~--~g~c 63 (116)
|+.|-+|.+|..-+ -..+|.++|.|.. ++.|.|.+||+|+.|..-.. .-... .|+. .+.|
T Consensus 132 Cp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~C 211 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVC 211 (350)
T ss_pred cCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhccc
Confidence 67788888875311 1124777777753 57999999999999862100 00000 1111 1233
Q ss_pred CCC--Cce-ecCCCCcCCC--C-CCCCCC---CCCCCCcEecC---CCceeCCCCCcC
Q psy6687 64 DSQ--QMC-QCPKGYQGTY--C-GTAMCF---PQCLNNGTCTA---PGVCSCPPGFQG 109 (116)
Q Consensus 64 ~~~--~~c-~C~~g~~g~~--c-~~~~c~---~~c~~~g~c~~---~~~C~C~~g~~g 109 (116)
... ..| .|..||.-+. | ++++|. .+|..+-.|++ +|.|...+||.+
T Consensus 212 sg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 212 SGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred CCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 321 234 4778886432 2 244453 45666666764 478888888865
No 20
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.82 E-value=0.0025 Score=27.57 Aligned_cols=14 Identities=50% Similarity=1.455 Sum_probs=7.1
Q ss_pred CceeCCCCCc-CCCC
Q psy6687 99 GVCSCPPGFQ-GLHC 112 (116)
Q Consensus 99 ~~C~C~~g~~-g~~C 112 (116)
+.|.|++||. |..|
T Consensus 24 ~~C~C~~g~~~g~~C 38 (39)
T smart00179 24 YRCECPPGYTDGRNC 38 (39)
T ss_pred eEeECCCCCccCCcC
Confidence 4455555555 4444
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.74 E-value=0.0026 Score=27.54 Aligned_cols=31 Identities=39% Similarity=0.985 Sum_probs=22.4
Q ss_pred CCCC-CCCCCCceeCCC---CceecCCCCc-CCCCC
Q psy6687 51 PGCD-QKCSNGGWCDSQ---QMCQCPKGYQ-GTYCG 81 (116)
Q Consensus 51 ~~~~-~~c~~~g~c~~~---~~c~C~~g~~-g~~c~ 81 (116)
+|.. .+|.+++.|.+. +.|.|+++|. |..|+
T Consensus 4 ~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 4 ECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 3444 578777888753 6799999998 77663
No 22
>KOG0994|consensus
Probab=96.66 E-value=0.0076 Score=44.83 Aligned_cols=83 Identities=34% Similarity=0.865 Sum_probs=48.4
Q ss_pred CCceeCCCCCccCCCCCCC---------CCCCCCCCCC--CceeCC-CCceecCCCCcCCCCCCC----------CCC-C
Q psy6687 31 PGVCSCPPGFQGLHCEGGP---------GPGCDQKCSN--GGWCDS-QQMCQCPKGYQGTYCGTA----------MCF-P 87 (116)
Q Consensus 31 ~~~C~c~~g~~g~~c~~~~---------~~~~~~~c~~--~g~c~~-~~~c~C~~g~~g~~c~~~----------~c~-~ 87 (116)
+++|.|.+...|..|+.-. ..|.+-.|.. ...|.. +++|.|.|||.|..|+.- .|. -
T Consensus 1036 tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftGQCqCkpGfGGR~C~qCqel~WGdP~~~C~aC 1115 (1758)
T KOG0994|consen 1036 TGQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTGQCQCKPGFGGRTCSQCQELYWGDPNEKCRAC 1115 (1758)
T ss_pred cCcCCCCcccccccccccccchhccccCCCCCccCCCccCCccccccccceeccCCCCCcchhHHHHhhcCCCCCCceec
Confidence 6788898888888876211 0111111222 123433 368999999999887631 111 1
Q ss_pred CCCCCc----Eec-CCCceeCCCCCcCCCCC
Q psy6687 88 QCLNNG----TCT-APGVCSCPPGFQGLHCE 113 (116)
Q Consensus 88 ~c~~~g----~c~-~~~~C~C~~g~~g~~C~ 113 (116)
.|...| .|. .+++|.|.+|..|++|.
T Consensus 1116 dCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1116 DCDPRGIETPQCDRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred CCCCCCCCCCCccccCCceeecCCCCCcchh
Confidence 122222 232 24799999999999885
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.36 E-value=0.0076 Score=25.67 Aligned_cols=14 Identities=57% Similarity=1.502 Sum_probs=7.0
Q ss_pred CceeCCCCCcCCCC
Q psy6687 99 GVCSCPPGFQGLHC 112 (116)
Q Consensus 99 ~~C~C~~g~~g~~C 112 (116)
+.|.|+++|.|..|
T Consensus 24 ~~C~C~~g~~g~~C 37 (38)
T cd00054 24 YRCSCPPGYTGRNC 37 (38)
T ss_pred eEeECCCCCcCCcC
Confidence 34555555555444
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.33 E-value=0.0064 Score=25.91 Aligned_cols=27 Identities=44% Similarity=1.131 Sum_probs=20.3
Q ss_pred CCCCCCceeCCC---CceecCCCCcCCCCC
Q psy6687 55 QKCSNGGWCDSQ---QMCQCPKGYQGTYCG 81 (116)
Q Consensus 55 ~~c~~~g~c~~~---~~c~C~~g~~g~~c~ 81 (116)
.+|.+++.|.+. +.|.|+++|.|..|+
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 467777778653 679999999987663
No 25
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=96.28 E-value=0.0011 Score=32.50 Aligned_cols=46 Identities=22% Similarity=0.367 Sum_probs=18.8
Q ss_pred CceeCCCCCccCCCCCCCCCCCCC-CCCCCceeCCCCceecCCCCcCCCC
Q psy6687 32 GVCSCPPGFQGLHCEGGPGPGCDQ-KCSNGGWCDSQQMCQCPKGYQGTYC 80 (116)
Q Consensus 32 ~~C~c~~g~~g~~c~~~~~~~~~~-~c~~~g~c~~~~~c~C~~g~~g~~c 80 (116)
++..|.+.|.|+.|+.. |.+. .-..+-.|...+.-+|.+||.|+.|
T Consensus 17 ~rv~C~~nyyG~~C~~~---C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSKF---CKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-EE------EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccCC---cCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 45667777888777621 1111 0122335655566778888887765
No 26
>KOG4260|consensus
Probab=96.06 E-value=0.011 Score=37.16 Aligned_cols=46 Identities=33% Similarity=0.821 Sum_probs=31.2
Q ss_pred ceecCCCCcCCCCCCC--CCCCCCCCCcEecC------CCceeCCCCCcCCCCC
Q psy6687 68 MCQCPKGYQGTYCGTA--MCFPQCLNNGTCTA------PGVCSCPPGFQGLHCE 113 (116)
Q Consensus 68 ~c~C~~g~~g~~c~~~--~c~~~c~~~g~c~~------~~~C~C~~g~~g~~C~ 113 (116)
.--|+++..|+.|..- ....+|..+|.|.. ++.|.|.+||+|+.|.
T Consensus 129 kvCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~ 182 (350)
T KOG4260|consen 129 KVCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR 182 (350)
T ss_pred eeccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc
Confidence 3347788888887631 11245777777763 4789999999998874
No 27
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.04 E-value=0.012 Score=24.61 Aligned_cols=14 Identities=50% Similarity=1.350 Sum_probs=8.7
Q ss_pred CceeCCCCCcCC-CC
Q psy6687 99 GVCSCPPGFQGL-HC 112 (116)
Q Consensus 99 ~~C~C~~g~~g~-~C 112 (116)
+.|.|+.||.|. .|
T Consensus 21 ~~C~C~~g~~g~~~C 35 (36)
T cd00053 21 YRCVCPPGYTGDRSC 35 (36)
T ss_pred eEeECCCCCcccCCc
Confidence 566666666665 44
No 28
>KOG1214|consensus
Probab=95.72 E-value=0.012 Score=42.53 Aligned_cols=47 Identities=30% Similarity=0.782 Sum_probs=38.7
Q ss_pred CCceeCCCCCcc--CCCCCCCCCCCCCCCCCCceeCCC---CceecCCCCcCC
Q psy6687 31 PGVCSCPPGFQG--LHCEGGPGPGCDQKCSNGGWCDSQ---QMCQCPKGYQGT 78 (116)
Q Consensus 31 ~~~C~c~~g~~g--~~c~~~~~~~~~~~c~~~g~c~~~---~~c~C~~g~~g~ 78 (116)
+|.|.|-+||.| ..|. +.++|....|.....|.++ +.|.|.+||.|+
T Consensus 808 ~y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 808 TYSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred eEEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccCC
Confidence 589999999985 5665 4588888889999999864 789999999865
No 29
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.62 E-value=0.021 Score=24.04 Aligned_cols=14 Identities=57% Similarity=1.538 Sum_probs=9.1
Q ss_pred CceeCCCCCcC-CCC
Q psy6687 99 GVCSCPPGFQG-LHC 112 (116)
Q Consensus 99 ~~C~C~~g~~g-~~C 112 (116)
+.|.|++||.| ..|
T Consensus 20 ~~C~C~~g~~g~~~C 34 (35)
T smart00181 20 YTCSCPPGYTGDKRC 34 (35)
T ss_pred eEeECCCCCccCCcc
Confidence 56777777766 554
No 30
>KOG1836|consensus
Probab=94.51 E-value=0.12 Score=40.63 Aligned_cols=81 Identities=36% Similarity=0.991 Sum_probs=48.3
Q ss_pred ccCCCCCcCCCCCCC-------------CC--C-CCCCCC-cEecC-CCceeCCCCCccCCCC------------CCCCC
Q psy6687 2 CQCPKGYQGTYCGTA-------------MC--F-PQCLNN-GTCTA-PGVCSCPPGFQGLHCE------------GGPGP 51 (116)
Q Consensus 2 C~C~~g~~g~~c~~~-------------~c--~-~~c~~~-g~C~~-~~~C~c~~g~~g~~c~------------~~~~~ 51 (116)
|.|+.+|+|..|+.- .+ . -.|.++ .+|.. ++.|.|...-.|..|+ .....
T Consensus 697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~d 776 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGD 776 (1705)
T ss_pred ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCC
Confidence 789999999988630 01 0 023333 34542 5666666555555553 12222
Q ss_pred CCCCCCCCCceeCC-----CCcee-cCCCCcCCCCCC
Q psy6687 52 GCDQKCSNGGWCDS-----QQMCQ-CPKGYQGTYCGT 82 (116)
Q Consensus 52 ~~~~~c~~~g~c~~-----~~~c~-C~~g~~g~~c~~ 82 (116)
|..-+|.+.+.|.. ...|. |+++|+|.+|+.
T Consensus 777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred CccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 55556766666543 25787 999999999874
No 31
>PHA02887 EGF-like protein; Provisional
Probab=94.45 E-value=0.05 Score=29.80 Aligned_cols=27 Identities=33% Similarity=1.019 Sum_probs=19.4
Q ss_pred CCCCCcEec-----CCCceeCCCCCcCCCCCCC
Q psy6687 88 QCLNNGTCT-----APGVCSCPPGFQGLHCEGG 115 (116)
Q Consensus 88 ~c~~~g~c~-----~~~~C~C~~g~~g~~C~~~ 115 (116)
.|. +|+|. ....|.|++||+|.+|+..
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence 355 35675 2368999999999999753
No 32
>KOG1836|consensus
Probab=94.42 E-value=0.13 Score=40.49 Aligned_cols=82 Identities=35% Similarity=0.857 Sum_probs=44.7
Q ss_pred ceeCCCCCccCCCCCCC-----------CCCCCCCCCCC---ceeCC-CCceecCCCCcCCCCCC--------------C
Q psy6687 33 VCSCPPGFQGLHCEGGP-----------GPGCDQKCSNG---GWCDS-QQMCQCPKGYQGTYCGT--------------A 83 (116)
Q Consensus 33 ~C~c~~g~~g~~c~~~~-----------~~~~~~~c~~~---g~c~~-~~~c~C~~g~~g~~c~~--------------~ 83 (116)
.|.|+++|+|..|+.-. ..+...+|.-+ .+|.. ++.|.|.+.-.|..|.+ .
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~ 775 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSG 775 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCC
Confidence 48999999999997310 00111123222 23432 24566555544544432 1
Q ss_pred CCC-CCCCCCcEec-----CCCcee-CCCCCcCCCCCC
Q psy6687 84 MCF-PQCLNNGTCT-----APGVCS-CPPGFQGLHCEG 114 (116)
Q Consensus 84 ~c~-~~c~~~g~c~-----~~~~C~-C~~g~~g~~C~~ 114 (116)
+|. -+|.+++.|. ....|. |+++|+|.+|+.
T Consensus 776 dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 776 DCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred CCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 132 2344444443 235787 999999999974
No 33
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=94.24 E-value=0.021 Score=25.42 Aligned_cols=21 Identities=52% Similarity=1.333 Sum_probs=15.0
Q ss_pred CCCCCcEecC---CCceeCCCCCc
Q psy6687 88 QCLNNGTCTA---PGVCSCPPGFQ 108 (116)
Q Consensus 88 ~c~~~g~c~~---~~~C~C~~g~~ 108 (116)
.|..++.|++ +|+|.|++||.
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4666677764 37888888886
No 34
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=93.97 E-value=0.059 Score=21.02 Aligned_cols=10 Identities=70% Similarity=1.690 Sum_probs=5.3
Q ss_pred CceeCCCCCc
Q psy6687 32 GVCSCPPGFQ 41 (116)
Q Consensus 32 ~~C~c~~g~~ 41 (116)
|.|.|++||.
T Consensus 2 y~C~C~~Gy~ 11 (24)
T PF12662_consen 2 YTCSCPPGYQ 11 (24)
T ss_pred EEeeCCCCCc
Confidence 4555555553
No 35
>KOG0994|consensus
Probab=93.67 E-value=0.3 Score=37.09 Aligned_cols=56 Identities=32% Similarity=0.748 Sum_probs=35.6
Q ss_pred cEecC-CCceeCCCCCccCCCCC----CC----CCCCCCCCCCCce----eCC-CCceecCCCCcCCCCC
Q psy6687 26 GTCTA-PGVCSCPPGFQGLHCEG----GP----GPGCDQKCSNGGW----CDS-QQMCQCPKGYQGTYCG 81 (116)
Q Consensus 26 g~C~~-~~~C~c~~g~~g~~c~~----~~----~~~~~~~c~~~g~----c~~-~~~c~C~~g~~g~~c~ 81 (116)
..|.. +++|.|.+||-|..|+. .+ ..|-.-.|...|+ |.. +++|+|.+|..|..|+
T Consensus 1077 pqCN~ftGQCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1077 PQCNEFTGQCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred ccccccccceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh
Confidence 34544 67999999999998862 11 1122223444432 322 3789999999998887
No 36
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=93.27 E-value=0.14 Score=23.71 Aligned_cols=15 Identities=40% Similarity=1.170 Sum_probs=8.0
Q ss_pred CceeCCCCCcCCCCC
Q psy6687 99 GVCSCPPGFQGLHCE 113 (116)
Q Consensus 99 ~~C~C~~g~~g~~C~ 113 (116)
++|.|.++++|.+|+
T Consensus 19 G~C~C~~~~~G~~C~ 33 (50)
T cd00055 19 GQCECKPNTTGRRCD 33 (50)
T ss_pred CEEeCCCcCCCCCCC
Confidence 455555555555554
No 37
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.95 E-value=0.11 Score=29.01 Aligned_cols=17 Identities=29% Similarity=0.968 Sum_probs=14.6
Q ss_pred CCceeCCCCCcCCCCCC
Q psy6687 98 PGVCSCPPGFQGLHCEG 114 (116)
Q Consensus 98 ~~~C~C~~g~~g~~C~~ 114 (116)
...|.|..||+|.+||.
T Consensus 66 ~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 66 GMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CceeECCCCcccccccc
Confidence 46899999999999974
No 38
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=92.32 E-value=0.034 Score=23.97 Aligned_cols=23 Identities=43% Similarity=1.044 Sum_probs=13.0
Q ss_pred CCCCCcEecC---CCceeCCCCCcCC
Q psy6687 88 QCLNNGTCTA---PGVCSCPPGFQGL 110 (116)
Q Consensus 88 ~c~~~g~c~~---~~~C~C~~g~~g~ 110 (116)
.|+.+++|.+ ++.|.|++||.|.
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 3555566653 3677788777664
No 39
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=91.10 E-value=0.24 Score=22.48 Aligned_cols=15 Identities=40% Similarity=1.221 Sum_probs=7.0
Q ss_pred CceeCCCCCcCCCCC
Q psy6687 99 GVCSCPPGFQGLHCE 113 (116)
Q Consensus 99 ~~C~C~~g~~g~~C~ 113 (116)
++|.|+++++|++|+
T Consensus 18 G~C~C~~~~~G~~C~ 32 (46)
T smart00180 18 GQCECKPNVTGRRCD 32 (46)
T ss_pred CEEECCCCCCCCCCC
Confidence 344444444444443
No 40
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=88.73 E-value=1.4 Score=20.39 Aligned_cols=20 Identities=45% Similarity=1.374 Sum_probs=11.9
Q ss_pred CCCCCcEecCCCceeCCCCCc
Q psy6687 88 QCLNNGTCTAPGVCSCPPGFQ 108 (116)
Q Consensus 88 ~c~~~g~c~~~~~C~C~~g~~ 108 (116)
.|..+..|.+ +.|.|++||+
T Consensus 27 qC~~~s~C~~-g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVN-GRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcC-CEeECCCCCE
Confidence 4445566653 6777777664
No 41
>KOG3607|consensus
Probab=88.57 E-value=0.47 Score=34.39 Aligned_cols=28 Identities=29% Similarity=0.719 Sum_probs=25.0
Q ss_pred CCCCCCcEecCCCceeCCCCCcCCCCCC
Q psy6687 87 PQCLNNGTCTAPGVCSCPPGFQGLHCEG 114 (116)
Q Consensus 87 ~~c~~~g~c~~~~~C~C~~g~~g~~C~~ 114 (116)
..|..+|+|.+..+|.|.+||.+++|+.
T Consensus 630 ~~C~g~GVCnn~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 630 TTCNGHGVCNNELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cccCCCcccCCCcceeeCCCCCCCcccc
Confidence 4588899999889999999999999974
No 42
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=88.00 E-value=0.18 Score=23.07 Aligned_cols=16 Identities=44% Similarity=1.200 Sum_probs=10.4
Q ss_pred CCceeCCCCCcCCCCC
Q psy6687 98 PGVCSCPPGFQGLHCE 113 (116)
Q Consensus 98 ~~~C~C~~g~~g~~C~ 113 (116)
+++|.|.++|+|++|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 17 TGQCVCKPGTTGPRCD 32 (49)
T ss_dssp CEEESBSTTEESTTS-
T ss_pred CCEEeccccccCCcCc
Confidence 3567777777777765
No 43
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=87.92 E-value=0.75 Score=19.47 Aligned_cols=10 Identities=50% Similarity=1.404 Sum_probs=7.7
Q ss_pred CceeCCCCCc
Q psy6687 99 GVCSCPPGFQ 108 (116)
Q Consensus 99 ~~C~C~~g~~ 108 (116)
++|.|++||.
T Consensus 18 ~~C~CPeGyI 27 (34)
T PF09064_consen 18 GQCFCPEGYI 27 (34)
T ss_pred CceeCCCceE
Confidence 5788888875
No 44
>KOG3607|consensus
Probab=83.36 E-value=1.3 Score=32.26 Aligned_cols=30 Identities=23% Similarity=0.754 Sum_probs=25.7
Q ss_pred CCCCCCCceeCCCCceecCCCCcCCCCCCC
Q psy6687 54 DQKCSNGGWCDSQQMCQCPKGYQGTYCGTA 83 (116)
Q Consensus 54 ~~~c~~~g~c~~~~~c~C~~g~~g~~c~~~ 83 (116)
...|+.+|+|.+...|.|.++|.++.|+..
T Consensus 629 ~~~C~g~GVCnn~~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 629 PTTCNGHGVCNNELNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccccCCCcccCCCcceeeCCCCCCCccccc
Confidence 345888999998889999999999999853
No 45
>KOG1218|consensus
Probab=79.34 E-value=14 Score=23.74 Aligned_cols=15 Identities=53% Similarity=1.588 Sum_probs=10.0
Q ss_pred CceeCCCCCcCCCCC
Q psy6687 99 GVCSCPPGFQGLHCE 113 (116)
Q Consensus 99 ~~C~C~~g~~g~~C~ 113 (116)
..|.|.+||.+.++.
T Consensus 162 ~~c~c~~g~~g~~~~ 176 (316)
T KOG1218|consen 162 GICTCQPGFVGVFCV 176 (316)
T ss_pred CceeccCCccccccc
Confidence 566677777776654
No 46
>KOG1218|consensus
Probab=78.12 E-value=15 Score=23.55 Aligned_cols=64 Identities=30% Similarity=0.870 Sum_probs=33.3
Q ss_pred ceeCCCCCccCCCCC-C-CCCCCCCCCCCCcee-CCCCceecCCCCcCCCCCCCC--CC--CCCCCCcEec
Q psy6687 33 VCSCPPGFQGLHCEG-G-PGPGCDQKCSNGGWC-DSQQMCQCPKGYQGTYCGTAM--CF--PQCLNNGTCT 96 (116)
Q Consensus 33 ~C~c~~g~~g~~c~~-~-~~~~~~~~c~~~g~c-~~~~~c~C~~g~~g~~c~~~~--c~--~~c~~~g~c~ 96 (116)
.|.+..+|.+..|.. . ...-+...|.....+ .....|.|.+||.+.++.... |. ..+.+++.|+
T Consensus 125 ~c~~~~~~~~~~C~~~~~~g~~C~~~c~~~~~~~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~ 195 (316)
T KOG1218|consen 125 ECRCGGGYIGEQCGEENLVGLKCQRDCQCTGGCDCKNGICTCQPGFVGVFCVESCSGCSPLTACENGAKCN 195 (316)
T ss_pred ceecCCcCccccccccCCCCCCccCCCCCccccCCCCCceeccCCcccccccccCCCcCCCcccCCCCeee
Confidence 366666666666654 1 111111122111111 123688899999999887542 33 3455555665
No 47
>KOG3514|consensus
Probab=74.53 E-value=2.5 Score=32.35 Aligned_cols=31 Identities=39% Similarity=1.042 Sum_probs=25.0
Q ss_pred CCC-CCCCCCcEec---CCCceeCC-CCCcCCCCCC
Q psy6687 84 MCF-PQCLNNGTCT---APGVCSCP-PGFQGLHCEG 114 (116)
Q Consensus 84 ~c~-~~c~~~g~c~---~~~~C~C~-~g~~g~~C~~ 114 (116)
.|. +||.++|+|. +.+.|.|. .+|.|+.|++
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 455 6899999997 35789986 5899999985
No 48
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=66.49 E-value=5.2 Score=21.70 Aligned_cols=10 Identities=40% Similarity=0.849 Sum_probs=5.4
Q ss_pred CCCCCceeCC
Q psy6687 56 KCSNGGWCDS 65 (116)
Q Consensus 56 ~c~~~g~c~~ 65 (116)
.|+.+|.|..
T Consensus 14 ~CsgHG~C~~ 23 (103)
T PF12955_consen 14 NCSGHGSCVK 23 (103)
T ss_pred CCCCCceEee
Confidence 3555565543
No 49
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.29 E-value=16 Score=19.71 Aligned_cols=24 Identities=42% Similarity=0.872 Sum_probs=15.7
Q ss_pred CCCCCCcEecCC--CceeCCCCCcCC
Q psy6687 87 PQCLNNGTCTAP--GVCSCPPGFQGL 110 (116)
Q Consensus 87 ~~c~~~g~c~~~--~~C~C~~g~~g~ 110 (116)
..|...+.|... ..|.|++||...
T Consensus 84 ~~CG~~g~C~~~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICNSNNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeCCCCCCceECCCCcCCC
Confidence 456666777532 468888888653
No 50
>KOG3516|consensus
Probab=58.17 E-value=12 Score=29.25 Aligned_cols=28 Identities=29% Similarity=0.874 Sum_probs=21.7
Q ss_pred CCCCCCcEecC---CCceeCC-CCCcCCCCCC
Q psy6687 87 PQCLNNGTCTA---PGVCSCP-PGFQGLHCEG 114 (116)
Q Consensus 87 ~~c~~~g~c~~---~~~C~C~-~g~~g~~C~~ 114 (116)
++|..+|.|.. .+.|.|. .||.|..|..
T Consensus 551 N~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 551 NPCEHGGKCSQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred ccccCCCcccccccceeEeccccccccccccC
Confidence 67888888874 3788887 7898888864
No 51
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=51.94 E-value=0.34 Score=29.05 Aligned_cols=106 Identities=25% Similarity=0.598 Sum_probs=49.6
Q ss_pred ccCCCCCcC---CCCCC-CCCCC------CCCCCcEecC--------CCceeCCCCCccCCCCCCCCCCCCCCCCCCcee
Q psy6687 2 CQCPKGYQG---TYCGT-AMCFP------QCLNNGTCTA--------PGVCSCPPGFQGLHCEGGPGPGCDQKCSNGGWC 63 (116)
Q Consensus 2 C~C~~g~~g---~~c~~-~~c~~------~c~~~g~C~~--------~~~C~c~~g~~g~~c~~~~~~~~~~~c~~~g~c 63 (116)
|.|.+||.- +.|+. ..|.. +|...+.|.. .+.|.|.++|.-..-.-.+..|....|. .|.|
T Consensus 22 C~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~~~Cg-~GKC 100 (197)
T PF06247_consen 22 CKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNNKDCG-SGKC 100 (197)
T ss_dssp EEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS---T-TEEE
T ss_pred EEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCceecC-CCeE
Confidence 677888852 23443 23432 3666777752 4789999999743221112222233344 4666
Q ss_pred CC------CCceecCCCCc---CCCCC---CCCCCCCCCCCcEecC---CCceeCCCCCc
Q psy6687 64 DS------QQMCQCPKGYQ---GTYCG---TAMCFPQCLNNGTCTA---PGVCSCPPGFQ 108 (116)
Q Consensus 64 ~~------~~~c~C~~g~~---g~~c~---~~~c~~~c~~~g~c~~---~~~C~C~~g~~ 108 (116)
+. ...|+|.-|+. ...|. ...|..-|..+..|.. .|+|.+..++.
T Consensus 101 I~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~ 160 (197)
T PF06247_consen 101 ILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCKEGFP 160 (197)
T ss_dssp EEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-TT-E
T ss_pred EecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecCCCCC
Confidence 42 13788888776 12232 1234445655666652 37888888774
No 52
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=51.90 E-value=4.8 Score=17.25 Aligned_cols=11 Identities=55% Similarity=1.452 Sum_probs=7.3
Q ss_pred CCceeCCCCCc
Q psy6687 98 PGVCSCPPGFQ 108 (116)
Q Consensus 98 ~~~C~C~~g~~ 108 (116)
.++|.|++||.
T Consensus 18 ~~~C~C~~Gy~ 28 (36)
T PF14670_consen 18 SYRCSCPPGYK 28 (36)
T ss_dssp SEEEE-STTEE
T ss_pred ceEeECCCCCE
Confidence 36788888875
No 53
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=49.04 E-value=4 Score=19.37 Aligned_cols=14 Identities=36% Similarity=1.061 Sum_probs=5.9
Q ss_pred ceeCCCCCcCCCCC
Q psy6687 100 VCSCPPGFQGLHCE 113 (116)
Q Consensus 100 ~C~C~~g~~g~~C~ 113 (116)
.|.|..-|.|++|+
T Consensus 37 ~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 37 VCECNSCYGGPDCS 50 (56)
T ss_dssp --EE-TTEESTTS-
T ss_pred cccccCCcCCCCcc
Confidence 45555556666554
No 54
>KOG3516|consensus
Probab=48.73 E-value=19 Score=28.28 Aligned_cols=35 Identities=31% Similarity=0.801 Sum_probs=27.8
Q ss_pred CCCCCCCCCCCCceeCCC---CceecC-CCCcCCCCCCC
Q psy6687 49 PGPGCDQKCSNGGWCDSQ---QMCQCP-KGYQGTYCGTA 83 (116)
Q Consensus 49 ~~~~~~~~c~~~g~c~~~---~~c~C~-~g~~g~~c~~~ 83 (116)
.+.|.+.+|.++|.|..+ +.|.|. .||.|..|...
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccCC
Confidence 466778889999998764 688888 79999988744
No 55
>KOG3512|consensus
Probab=48.28 E-value=17 Score=25.44 Aligned_cols=24 Identities=38% Similarity=0.956 Sum_probs=18.2
Q ss_pred CCCcEec-CCCceeCCCCCcCCCCC
Q psy6687 90 LNNGTCT-APGVCSCPPGFQGLHCE 113 (116)
Q Consensus 90 ~~~g~c~-~~~~C~C~~g~~g~~C~ 113 (116)
+-+.+|+ .+++|.|.+|.+|..|.
T Consensus 404 s~gktCNq~tGqCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 404 SAGKTCNQTTGQCPCKEGVTGLTCN 428 (592)
T ss_pred cccccccccCCcccCCCCCcccccc
Confidence 3445666 45899999999998875
No 56
>KOG3514|consensus
Probab=31.01 E-value=52 Score=26.05 Aligned_cols=32 Identities=31% Similarity=0.881 Sum_probs=24.2
Q ss_pred CCCCCCCCCCceeCC---CCceecC-CCCcCCCCCC
Q psy6687 51 PGCDQKCSNGGWCDS---QQMCQCP-KGYQGTYCGT 82 (116)
Q Consensus 51 ~~~~~~c~~~g~c~~---~~~c~C~-~g~~g~~c~~ 82 (116)
.|...||.++|.|.. .+.|.|. .+|.|..|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 455678999998875 3678886 5788988874
No 57
>PF05294 Toxin_5: Scorpion short toxin; InterPro: IPR007958 This family contains various secreted scorpion short toxins which seem to be unrelated to those described in IPR001947 from INTERPRO.; GO: 0009405 pathogenesis, 0005576 extracellular region; PDB: 1SIS_A 1CHL_A.
Probab=21.32 E-value=76 Score=13.16 Aligned_cols=7 Identities=29% Similarity=0.586 Sum_probs=3.0
Q ss_pred CCCcEec
Q psy6687 90 LNNGTCT 96 (116)
Q Consensus 90 ~~~g~c~ 96 (116)
.+.|.|.
T Consensus 20 gg~GkC~ 26 (32)
T PF05294_consen 20 GGRGKCF 26 (32)
T ss_dssp TTSEEEE
T ss_pred CCCCeEc
Confidence 3334444
No 58
>KOG3509|consensus
Probab=20.16 E-value=3.6e+02 Score=21.39 Aligned_cols=29 Identities=34% Similarity=0.907 Sum_probs=19.3
Q ss_pred CCCCCCceeCCC---CceecCCCCcCCCCCCC
Q psy6687 55 QKCSNGGWCDSQ---QMCQCPKGYQGTYCGTA 83 (116)
Q Consensus 55 ~~c~~~g~c~~~---~~c~C~~g~~g~~c~~~ 83 (116)
.++...+.|... ..|.|+++|+|+.|...
T Consensus 412 ~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~ 443 (964)
T KOG3509|consen 412 IPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDC 443 (964)
T ss_pred ccCCCCccccccccccceeccccccCchhhcc
Confidence 344444444432 57899999999988753
Done!