Query psy590
Match_columns 257
No_of_seqs 188 out of 2076
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 21:10:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy590.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/590hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.6 2.9E-15 6.2E-20 138.8 10.9 147 39-209 693-865 (1289)
2 KOG1219|consensus 99.5 3.9E-14 8.5E-19 141.2 5.4 110 39-203 3865-3975(4289)
3 KOG1214|consensus 99.3 4.3E-11 9.2E-16 111.6 12.0 107 7-138 729-862 (1289)
4 KOG1219|consensus 99.2 2.1E-11 4.5E-16 122.5 8.1 98 13-136 3865-3975(4289)
5 KOG4289|consensus 98.9 1.1E-09 2.4E-14 106.9 6.0 128 13-198 1180-1308(2531)
6 KOG1217|consensus 98.8 2.3E-07 5E-12 85.0 14.5 140 51-215 243-405 (487)
7 KOG4289|consensus 98.6 3.7E-07 7.9E-12 90.0 10.7 79 28-131 1217-1308(2531)
8 KOG4260|consensus 98.5 8.1E-08 1.8E-12 79.7 4.0 82 63-201 221-305 (350)
9 KOG1217|consensus 98.5 1.5E-06 3.2E-11 79.6 11.2 166 15-213 129-315 (487)
10 PF07645 EGF_CA: Calcium-bindi 98.2 6.7E-07 1.4E-11 54.2 2.0 33 169-201 1-35 (42)
11 KOG4260|consensus 98.1 5.9E-06 1.3E-10 68.8 4.8 93 109-201 150-269 (350)
12 PF00008 EGF: EGF-like domain 98.0 3.3E-06 7.2E-11 47.9 1.3 30 173-202 1-31 (32)
13 PF07645 EGF_CA: Calcium-bindi 97.9 8.1E-06 1.7E-10 49.4 2.7 35 86-134 1-35 (42)
14 smart00179 EGF_CA Calcium-bind 97.9 1.7E-05 3.7E-10 46.7 3.9 33 170-202 2-36 (39)
15 PF12947 EGF_3: EGF domain; I 97.9 9E-06 1.9E-10 47.3 1.8 30 176-205 6-35 (36)
16 KOG1225|consensus 97.7 0.00029 6.3E-09 65.1 9.4 77 110-203 286-365 (525)
17 cd00054 EGF_CA Calcium-binding 97.5 0.00017 3.7E-09 41.9 3.9 34 170-203 2-36 (38)
18 PF12947 EGF_3: EGF domain; I 97.5 6.5E-05 1.4E-09 43.7 2.0 30 44-73 5-34 (36)
19 PF00008 EGF: EGF-like domain 97.4 7.4E-05 1.6E-09 42.3 1.5 28 44-71 3-31 (32)
20 PF12662 cEGF: Complement Clr- 97.3 0.00025 5.4E-09 37.1 2.4 23 190-212 1-24 (24)
21 smart00179 EGF_CA Calcium-bind 97.2 0.00055 1.2E-08 40.1 3.8 33 39-71 3-36 (39)
22 PF12662 cEGF: Complement Clr- 97.1 0.00059 1.3E-08 35.7 2.5 24 59-89 1-24 (24)
23 cd00053 EGF Epidermal growth f 97.0 0.0011 2.4E-08 37.7 3.6 28 175-202 5-32 (36)
24 smart00181 EGF Epidermal growt 96.9 0.0016 3.5E-08 37.2 3.7 29 173-202 2-31 (35)
25 KOG1225|consensus 96.7 0.012 2.5E-07 54.7 9.7 83 109-203 254-339 (525)
26 cd00054 EGF_CA Calcium-binding 96.7 0.0029 6.3E-08 36.4 3.8 33 39-71 3-35 (38)
27 cd00053 EGF Epidermal growth f 96.2 0.0091 2E-07 33.7 3.7 28 44-71 5-32 (36)
28 smart00181 EGF Epidermal growt 96.2 0.0094 2E-07 33.9 3.7 25 45-70 6-30 (35)
29 PF06247 Plasmod_Pvs28: Plasmo 96.0 0.0031 6.7E-08 50.1 1.2 131 19-203 7-163 (197)
30 PF14670 FXa_inhibition: Coagu 95.0 0.013 2.8E-07 34.0 1.2 22 182-203 10-31 (36)
31 PF06247 Plasmod_Pvs28: Plasmo 94.7 0.0053 1.2E-07 48.8 -1.2 105 46-200 7-119 (197)
32 PF07974 EGF_2: EGF-like domai 93.8 0.089 1.9E-06 29.6 2.9 25 177-203 7-31 (32)
33 PF14670 FXa_inhibition: Coagu 93.7 0.053 1.1E-06 31.4 1.9 25 45-71 6-30 (36)
34 cd01475 vWA_Matrilin VWA_Matri 93.5 0.087 1.9E-06 43.7 3.7 36 166-203 183-220 (224)
35 PF12661 hEGF: Human growth fa 93.4 0.036 7.9E-07 24.5 0.6 12 192-203 1-12 (13)
36 PF12946 EGF_MSP1_1: MSP1 EGF 93.1 0.027 5.9E-07 32.6 0.1 32 173-204 2-34 (37)
37 PF07974 EGF_2: EGF-like domai 91.6 0.28 6.1E-06 27.5 3.0 25 45-71 6-30 (32)
38 KOG1226|consensus 91.3 1.8 3.9E-05 41.9 9.7 64 45-136 514-578 (783)
39 KOG0994|consensus 89.0 1.7 3.6E-05 44.0 7.6 15 122-136 932-946 (1758)
40 KOG1226|consensus 88.8 2.5 5.4E-05 41.0 8.4 89 19-136 515-618 (783)
41 cd01475 vWA_Matrilin VWA_Matri 88.4 0.58 1.3E-05 38.8 3.8 39 81-135 181-219 (224)
42 PF12946 EGF_MSP1_1: MSP1 EGF 83.9 0.58 1.3E-05 27.1 1.0 29 44-72 4-33 (37)
43 PHA02887 EGF-like protein; Pro 83.5 1.3 2.9E-05 32.5 3.0 37 170-210 83-124 (126)
44 PF00954 S_locus_glycop: S-loc 82.8 1.4 3.1E-05 32.1 3.1 32 103-135 78-109 (110)
45 PHA03099 epidermal growth fact 81.3 1.3 2.8E-05 33.1 2.3 38 170-211 42-84 (139)
46 PF09064 Tme5_EGF_like: Thromb 77.6 2.8 6E-05 23.7 2.3 21 182-203 10-30 (34)
47 PF00954 S_locus_glycop: S-loc 74.8 3.3 7.1E-05 30.2 2.9 31 39-70 78-108 (110)
48 KOG1836|consensus 72.4 11 0.00023 40.6 6.8 48 155-203 760-810 (1705)
49 KOG3516|consensus 69.6 3.8 8.1E-05 41.7 2.7 37 167-203 542-579 (1306)
50 PF01683 EB: EB module; Inter 69.4 11 0.00023 23.3 3.9 24 176-203 26-49 (52)
51 smart00051 DSL delta serrate l 65.6 11 0.00023 24.7 3.5 22 179-203 41-62 (63)
52 PHA02887 EGF-like protein; Pro 61.4 9.4 0.0002 28.2 2.8 35 39-77 84-122 (126)
53 KOG3514|consensus 59.8 9.8 0.00021 38.6 3.4 25 14-38 625-649 (1591)
54 PHA03099 epidermal growth fact 56.7 11 0.00023 28.4 2.5 31 44-78 50-82 (139)
55 KOG0994|consensus 55.3 42 0.0009 34.7 6.8 22 183-204 877-899 (1758)
56 KOG3516|consensus 43.5 17 0.00038 37.2 2.3 34 5-38 538-571 (1306)
57 KOG1836|consensus 43.4 84 0.0018 34.2 7.3 27 110-136 781-810 (1705)
58 cd00055 EGF_Lam Laminin-type e 38.6 43 0.00093 20.4 2.8 12 192-203 20-31 (50)
59 KOG3514|consensus 38.6 18 0.00039 36.8 1.6 36 172-210 625-661 (1591)
60 PF00053 Laminin_EGF: Laminin 32.0 27 0.00058 21.1 1.1 19 183-203 12-30 (49)
61 PF12955 DUF3844: Domain of un 28.5 53 0.0011 23.8 2.3 26 176-201 13-43 (103)
62 cd01328 FSL_SPARC Follistatin- 26.9 1.4E+02 0.0031 20.8 4.2 29 14-42 1-30 (86)
63 KOG3512|consensus 23.9 1.9E+02 0.0041 27.0 5.4 88 112-215 282-396 (592)
64 PF01826 TIL: Trypsin Inhibito 23.3 26 0.00057 21.7 -0.1 21 193-213 35-55 (55)
No 1
>KOG1214|consensus
Probab=99.61 E-value=2.9e-15 Score=138.83 Aligned_cols=147 Identities=28% Similarity=0.669 Sum_probs=118.5
Q ss_pred ccCC-cCCCCCCCCEeeeCCC-CcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCe
Q psy590 39 DLCA-IASLCAPSAKCHVVKH-RPVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAV 116 (257)
Q Consensus 39 ~~C~-~~~~C~~~~~C~~~~g-~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~ 116 (257)
++|- ..+.|..++.|....+ .|+|.|..||.|++. .|.+.++|.... ..|.+++.
T Consensus 693 npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr---------~c~d~~eca~~~--------------~~CGp~s~ 749 (1289)
T KOG1214|consen 693 NPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGR---------NCVDENECATGF--------------HRCGPNSV 749 (1289)
T ss_pred ccceecCcccCCCccccCCCCcceEEEEeeccCCCCC---------CCCChhhhccCC--------------CCCCCCce
Confidence 5555 3678888889987765 699999999999854 488899998754 78999999
Q ss_pred eeeCCCceeeeCCCCCccCCC-CCcccCCC---------CCCCCC--------------CCCCCCCCCccCCCCCccCCC
Q psy590 117 CINKNHATDCSCLEGFQGNGF-VGCLPVRN---------YQPICQ--------------YNEDCPPDKLCDRLNRICINP 172 (257)
Q Consensus 117 C~~~~g~~~C~C~~g~~g~~~-~~C~~~~~---------~~~~C~--------------~~~~C~~g~~~~~~~~~~id~ 172 (257)
|++.+++|.|.|..||...+. -.|..+.. ..+.|. +.+.|.+||.|++..|.++|+
T Consensus 750 Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDe 829 (1289)
T KOG1214|consen 750 CINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDE 829 (1289)
T ss_pred eecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccc
Confidence 999999999999999864432 22443321 012332 345689999999999999999
Q ss_pred CCCCCCCCCCEeccCCCCeeeecCCCCccCCCCCccc
Q psy590 173 CQEDSCGEHAICIPKNHGIDCQCQSGHQGNPYIACLS 209 (257)
Q Consensus 173 C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~ 209 (257)
|.++.|++.++|+|+.++|.|+|.+||.|++.. |.+
T Consensus 830 C~psrChp~A~CyntpgsfsC~C~pGy~GDGf~-CVP 865 (1289)
T KOG1214|consen 830 CSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQ-CVP 865 (1289)
T ss_pred cCccccCCCceEecCCCcceeecccCccCCCce-ecC
Confidence 999999999999999999999999999999977 754
No 2
>KOG1219|consensus
Probab=99.47 E-value=3.9e-14 Score=141.24 Aligned_cols=110 Identities=31% Similarity=0.769 Sum_probs=97.1
Q ss_pred ccCCcCCCCCCCCEeeeCC-CCcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCee
Q psy590 39 DLCAIASLCAPSAKCHVVK-HRPVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVC 117 (257)
Q Consensus 39 ~~C~~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C 117 (257)
+.|. .++|.+++.|..+. |+|.|.|++-|.|. .|+ .+++.|.. +||..+++|
T Consensus 3865 d~C~-~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CE-------i~~epC~s----------------nPC~~GgtC 3917 (4289)
T KOG1219|consen 3865 DPCN-DNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCE-------IDLEPCAS----------------NPCLTGGTC 3917 (4289)
T ss_pred cccc-cCcccCCCEecCCCCCceEEeCcccccCc---ccc-------cccccccC----------------CCCCCCCEE
Confidence 7787 78999999999887 68999999999999 787 56777774 899999999
Q ss_pred eeCCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCccCCCCCccCCCCCCCCCCCCCEeccCCCCeeeecCC
Q psy590 118 INKNHATDCSCLEGFQGNGFVGCLPVRNYQPICQYNEDCPPDKLCDRLNRICINPCQEDSCGEHAICIPKNHGIDCQCQS 197 (257)
Q Consensus 118 ~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~C~~~~~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~~~~g~~~C~C~~ 197 (257)
+...++|.|.|+.||+|. .|+.. + +++|..++|.++|+|+|..|+|.|.|.+
T Consensus 3918 ip~~n~f~CnC~~gyTG~-------------~Ce~~----------G-----i~eCs~n~C~~gg~C~n~~gsf~CncT~ 3969 (4289)
T KOG1219|consen 3918 IPFYNGFLCNCPNGYTGK-------------RCEAR----------G-----ISECSKNVCGTGGQCINIPGSFHCNCTP 3969 (4289)
T ss_pred EecCCCeeEeCCCCccCc-------------eeecc----------c-----ccccccccccCCceeeccCCceEeccCh
Confidence 999999999999999999 33321 1 8999999999999999999999999999
Q ss_pred CCccCC
Q psy590 198 GHQGNP 203 (257)
Q Consensus 198 G~~g~~ 203 (257)
||.|..
T Consensus 3970 g~~gr~ 3975 (4289)
T KOG1219|consen 3970 GILGRT 3975 (4289)
T ss_pred hHhccc
Confidence 999987
No 3
>KOG1214|consensus
Probab=99.27 E-value=4.3e-11 Score=111.64 Aligned_cols=107 Identities=25% Similarity=0.658 Sum_probs=85.5
Q ss_pred ccCCCCCCCCCCC--CCCCCeeccCCCceeeec---------------------cccCCc-CCCCCCCC--EeeeCC-CC
Q psy590 7 QCALPGTAKVPRP--CQSDLDCLDAEACYSGQC---------------------EDLCAI-ASLCAPSA--KCHVVK-HR 59 (257)
Q Consensus 7 ~~~~~~~~c~~~~--C~~~~~C~~~~g~~~C~C---------------------~~~C~~-~~~C~~~~--~C~~~~-g~ 59 (257)
+-|...++|++.+ |+.++.|+|.+|+|+|+| .+.|.. .+.|...+ .|+... +.
T Consensus 729 r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~ 808 (1289)
T KOG1214|consen 729 RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGST 808 (1289)
T ss_pred CCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCce
Confidence 4455555677654 999999999999999988 266773 47787654 455554 46
Q ss_pred cEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCC
Q psy590 60 PVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQGNGFV 138 (257)
Q Consensus 60 ~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~ 138 (257)
|.|+|.+||.|++. .|.++|||.. +-|..+++|.++++++.|+|.+||.|++..
T Consensus 809 y~C~CLPGfsGDG~---------~c~dvDeC~p----------------srChp~A~CyntpgsfsC~C~pGy~GDGf~ 862 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGH---------QCTDVDECSP----------------SRCHPAATCYNTPGSFSCRCQPGYYGDGFQ 862 (1289)
T ss_pred EEEeecCCccCCcc---------ccccccccCc----------------cccCCCceEecCCCcceeecccCccCCCce
Confidence 99999999999964 3778899983 779999999999999999999999999754
No 4
>KOG1219|consensus
Probab=99.23 E-value=2.1e-11 Score=122.47 Aligned_cols=98 Identities=28% Similarity=0.700 Sum_probs=87.4
Q ss_pred CCCCCCCCCCCCeeccCC-Cceeeec------------cccCCcCCCCCCCCEeeeCCCCcEeeCCCCCccCCCCCccCC
Q psy590 13 TAKVPRPCQSDLDCLDAE-ACYSGQC------------EDLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLP 79 (257)
Q Consensus 13 ~~c~~~~C~~~~~C~~~~-g~~~C~C------------~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~ 79 (257)
++|..+||+++|+|...+ |+|+|.| ..+|. .+||..+++|+...++|.|.|+.||+|. +|+..
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~-snPC~~GgtCip~~n~f~CnC~~gyTG~---~Ce~~ 3940 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCA-SNPCLTGGTCIPFYNGFLCNCPNGYTGK---RCEAR 3940 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCccccccccccc-CCCCCCCCEEEecCCCeeEeCCCCccCc---eeecc
Confidence 799999999999999985 6799999 26677 8999999999999999999999999999 88742
Q ss_pred CCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCC
Q psy590 80 NPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQGNG 136 (257)
Q Consensus 80 ~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 136 (257)
.++||.. ++|.+++.|+|.+|+|.|.|.+||.|..
T Consensus 3941 ------Gi~eCs~----------------n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3941 ------GISECSK----------------NVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred ------ccccccc----------------ccccCCceeeccCCceEeccChhHhccc
Confidence 3778874 7899999999999999999999998775
No 5
>KOG4289|consensus
Probab=98.94 E-value=1.1e-09 Score=106.95 Aligned_cols=128 Identities=23% Similarity=0.573 Sum_probs=91.5
Q ss_pred CCCCCCCCCCCCeeccCCCceeeeccccCCcCCCCCCCCEeeeCCCCcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCC
Q psy590 13 TAKVPRPCQSDLDCLDAEACYSGQCEDLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEECAS 92 (257)
Q Consensus 13 ~~c~~~~C~~~~~C~~~~g~~~C~C~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~ 92 (257)
+.|...||.+.+.|+.... |.-.- +-..... .-..=++..+++.|+|++||+|+ .|+ .++|+|-+
T Consensus 1180 niClrEPCenymkCvsvlr-Fdssa--pf~~s~s--~lfRpi~pvnglrCrCPpGFTgd---~Ce-------TeiDlCYs 1244 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLR-FDSSA--PFLASDS--VLFRPIHPVNGLRCRCPPGFTGD---YCE-------TEIDLCYS 1244 (2531)
T ss_pred chhhcchhHHHHhhhhhee-ecccC--ccccccc--eeeeeccccCceeEeCCCCCCcc---ccc-------chhHhhhc
Confidence 4899999999888886531 00000 0000000 00123345578999999999999 777 57888875
Q ss_pred CCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCccCCCCCccCCC
Q psy590 93 TEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQGNGFVGCLPVRNYQPICQYNEDCPPDKLCDRLNRICINP 172 (257)
Q Consensus 93 ~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~C~~~~~C~~g~~~~~~~~~~id~ 172 (257)
.+|.+++.|....|+|.|.|.+||+|.+ |+-.- . .-.
T Consensus 1245 ----------------~pC~nng~C~srEggYtCeCrpg~tGeh---CEvs~-------~-----------------agr 1281 (2531)
T KOG4289|consen 1245 ----------------GPCGNNGRCRSREGGYTCECRPGFTGEH---CEVSA-------R-----------------AGR 1281 (2531)
T ss_pred ----------------CCCCCCCceEEecCceeEEecCCccccc---eeeec-------c-----------------cCc
Confidence 7899999999999999999999999995 22110 0 234
Q ss_pred CCCCCCCCCCEeccC-CCCeeeecCCC
Q psy590 173 CQEDSCGEHAICIPK-NHGIDCQCQSG 198 (257)
Q Consensus 173 C~~~~C~~~~~C~~~-~g~~~C~C~~G 198 (257)
|.+..|.++++|++. .|+|.|.|+.|
T Consensus 1282 CvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1282 CVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred cccceecCCCEEeecCCCceeccCCCc
Confidence 666779999999987 68899999998
No 6
>KOG1217|consensus
Probab=98.75 E-value=2.3e-07 Score=85.00 Aligned_cols=140 Identities=24% Similarity=0.621 Sum_probs=88.1
Q ss_pred CEeeeCCCCcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCC
Q psy590 51 AKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLE 130 (257)
Q Consensus 51 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~ 130 (257)
+.|++..++|.|.|++||.+.. + ..+.++++|... ..|.++++|++..+.|.|.|.+
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~---~-----~~~~~~~~C~~~---------------~~c~~~~~C~~~~~~~~C~C~~ 299 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDA---C-----VTCVDVDSCALI---------------ASCPNGGTCVNVPGSYRCTCPP 299 (487)
T ss_pred CcccccCCceeeeCCCCccccc---c-----ceeeeccccCCC---------------CccCCCCeeecCCCcceeeCCC
Confidence 4555555666666666666652 0 124455555542 2388889999999889999999
Q ss_pred CCccCCCCCcccCCC-----------CCCCCC-------CCCCCCCCCccCCCCCccC-CCCCCCCCCCCCEecc-CCCC
Q psy590 131 GFQGNGFVGCLPVRN-----------YQPICQ-------YNEDCPPDKLCDRLNRICI-NPCQEDSCGEHAICIP-KNHG 190 (257)
Q Consensus 131 g~~g~~~~~C~~~~~-----------~~~~C~-------~~~~C~~g~~~~~~~~~~i-d~C~~~~C~~~~~C~~-~~g~ 190 (257)
||.+.....+.+... ....|. +.+.|..++.+ ..|... ++|...++..++.|++ ..++
T Consensus 300 g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g--~~C~~~~~~C~~~~~~~~~~c~~~~~~~ 377 (487)
T KOG1217|consen 300 GFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTG--RRCEDSNDECASSPCCPGGTCVNETPGS 377 (487)
T ss_pred CCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCC--CccccCCccccCCccccCCEeccCCCCC
Confidence 999886411222111 011221 11334444333 334445 4888777888899999 7899
Q ss_pred eeeecCCCCccC---CCCCcccCccccC
Q psy590 191 IDCQCQSGHQGN---PYIACLSELICLC 215 (257)
Q Consensus 191 ~~C~C~~G~~g~---~~~~C~~~~~C~~ 215 (257)
|.|.|+.+|.+. ....+.++++|..
T Consensus 378 ~~c~~~~~~~~~~~~~~~~~~~~~~c~~ 405 (487)
T KOG1217|consen 378 YRCACPAGFAGKANGDGVGCEDIDECSG 405 (487)
T ss_pred eEecCCCccccCCccccccccccccccC
Confidence 999999999873 2234777887754
No 7
>KOG4289|consensus
Probab=98.58 E-value=3.7e-07 Score=89.99 Aligned_cols=79 Identities=23% Similarity=0.613 Sum_probs=60.3
Q ss_pred cCCCceeeec------------cccCCcCCCCCCCCEeeeCCCCcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCc
Q psy590 28 DAEACYSGQC------------EDLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEA 95 (257)
Q Consensus 28 ~~~g~~~C~C------------~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~ 95 (257)
+..++++|+| +|+|. ..+|.+++.|....|+|+|.|++||+|. +|+-. ..-..|.
T Consensus 1217 ~pvnglrCrCPpGFTgd~CeTeiDlCY-s~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs-----~~agrCv---- 1283 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCY-SGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVS-----ARAGRCV---- 1283 (2531)
T ss_pred cccCceeEeCCCCCCcccccchhHhhh-cCCCCCCCceEEecCceeEEecCCcccc---ceeee-----cccCccc----
Confidence 3445566666 47777 8999999999999999999999999998 77631 1112333
Q ss_pred ccCCCccCCCCCCCCCCCCCeeeeCC-CceeeeCCCC
Q psy590 96 CINSQCQKPCAVHSPCAPNAVCINKN-HATDCSCLEG 131 (257)
Q Consensus 96 c~g~~c~~~C~~~~~C~~~~~C~~~~-g~~~C~C~~g 131 (257)
+..|+++++|++.. +++.|.|+.|
T Consensus 1284 ------------pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1284 ------------PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred ------------cceecCCCEEeecCCCceeccCCCc
Confidence 36789899999754 7888999876
No 8
>KOG4260|consensus
Probab=98.53 E-value=8.1e-08 Score=79.65 Aligned_cols=82 Identities=28% Similarity=0.711 Sum_probs=61.2
Q ss_pred eCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCCccc
Q psy590 63 TCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQGNGFVGCLP 142 (257)
Q Consensus 63 ~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~ 142 (257)
.|+.||..+ ...|+|+|||...+ .+|.....|+|+.|+|.|...+||.+.
T Consensus 221 kCkkGW~ld---------e~gCvDvnEC~~ep--------------~~c~~~qfCvNteGSf~C~dk~Gy~~g------- 270 (350)
T KOG4260|consen 221 KCKKGWKLD---------EEGCVDVNECQNEP--------------APCKAHQFCVNTEGSFKCEDKEGYKKG------- 270 (350)
T ss_pred hhcccceec---------ccccccHHHHhcCC--------------CCCChhheeecCCCceEecccccccCC-------
Confidence 466666655 45688999998764 889988999999999999999888763
Q ss_pred CCCCCCCCCCCCCCCCCCccCCCCCccCCCCCC--CCC-CCCCEeccCCCCeeeecCCCCcc
Q psy590 143 VRNYQPICQYNEDCPPDKLCDRLNRICINPCQE--DSC-GEHAICIPKNHGIDCQCQSGHQG 201 (257)
Q Consensus 143 ~~~~~~~C~~~~~C~~g~~~~~~~~~~id~C~~--~~C-~~~~~C~~~~g~~~C~C~~G~~g 201 (257)
+|+|.. ..| ..+..|.|..+.|+|+|..|+.-
T Consensus 271 ---------------------------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 271 ---------------------------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred ---------------------------hHHhhhhhhhcccCCCCcccCCccEEEEeccccee
Confidence 233321 122 24567889999999999988753
No 9
>KOG1217|consensus
Probab=98.46 E-value=1.5e-06 Score=79.64 Aligned_cols=166 Identities=25% Similarity=0.631 Sum_probs=103.5
Q ss_pred CCCCC--CCCCCeeccCC---Cceeeecc------------ccCC-cCCCCCCCCEeeeCCCCcEeeCCCCCccCCCCCc
Q psy590 15 KVPRP--CQSDLDCLDAE---ACYSGQCE------------DLCA-IASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKC 76 (257)
Q Consensus 15 c~~~~--C~~~~~C~~~~---g~~~C~C~------------~~C~-~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C 76 (257)
|...+ +...+.|.+.. ..|.|.|. ++|. ..++|.+++.|.+..++|.|.|++||.+. .|
T Consensus 129 C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~---~~ 205 (487)
T KOG1217|consen 129 CVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS---TC 205 (487)
T ss_pred ecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC---cC
Confidence 55554 24446677643 47778772 4676 35669989999999999999999999988 33
Q ss_pred cCC-CCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCC-CeeeeCCCceeeeCCCCCccCCCCCcccCCCCCCCCCCCC
Q psy590 77 YLP-NPLGCVSNEECASTEACINSQCQKPCAVHSPCAPN-AVCINKNHATDCSCLEGFQGNGFVGCLPVRNYQPICQYNE 154 (257)
Q Consensus 77 ~~~-~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~-~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~C~~~~ 154 (257)
+.. ....|.+...|.......+..|... ...|..+ +.|++..++|.|.|.+||.+...
T Consensus 206 ~~~~~~~~c~~~~~~~~~~g~~~~~c~~~---~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~----------------- 265 (487)
T KOG1217|consen 206 ETTGNGGTCVDSVACSCPPGARGPECEVS---IVECASGDGTCVNTVGSYTCRCPEGYTGDAC----------------- 265 (487)
T ss_pred cCCCCCceEecceeccCCCCCCCCCcccc---cccccCCCCcccccCCceeeeCCCCcccccc-----------------
Confidence 321 0112222111111111112222210 0122222 78888888889999888887731
Q ss_pred CCCCCCccCCCCCccCCCCCCCC-CCCCCEeccCCCCeeeecCCCCccCCCCCcccCccc
Q psy590 155 DCPPDKLCDRLNRICINPCQEDS-CGEHAICIPKNHGIDCQCQSGHQGNPYIACLSELIC 213 (257)
Q Consensus 155 ~C~~g~~~~~~~~~~id~C~~~~-C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~C 213 (257)
..+.++++|.... |.++++|++..+.|.|.|++||++.....+.+..+|
T Consensus 266 ----------~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C 315 (487)
T KOG1217|consen 266 ----------VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDEC 315 (487)
T ss_pred ----------ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccc
Confidence 1133478888753 888999999999999999999999874113344455
No 10
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.22 E-value=6.7e-07 Score=54.21 Aligned_cols=33 Identities=27% Similarity=0.726 Sum_probs=29.5
Q ss_pred cCCCCCC--CCCCCCCEeccCCCCeeeecCCCCcc
Q psy590 169 CINPCQE--DSCGEHAICIPKNHGIDCQCQSGHQG 201 (257)
Q Consensus 169 ~id~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~g 201 (257)
|||||.. +.|..++.|+|+.|+|+|.|++||+.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 4899975 57988899999999999999999983
No 11
>KOG4260|consensus
Probab=98.05 E-value=5.9e-06 Score=68.80 Aligned_cols=93 Identities=22% Similarity=0.518 Sum_probs=66.0
Q ss_pred CCCCCCCeee---eCCCceeeeCCCCCccCCCCCcccC------CC-----------CCCCCCCC-----CCCCCCCccC
Q psy590 109 SPCAPNAVCI---NKNHATDCSCLEGFQGNGFVGCLPV------RN-----------YQPICQYN-----EDCPPDKLCD 163 (257)
Q Consensus 109 ~~C~~~~~C~---~~~g~~~C~C~~g~~g~~~~~C~~~------~~-----------~~~~C~~~-----~~C~~g~~~~ 163 (257)
.+|..++.|. ...|+-.|.|..||.|+....|... ++ ....|... .+|..|+.-+
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld 229 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD 229 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceec
Confidence 5677777775 2346678999999999864333321 11 11122210 2256677777
Q ss_pred CCCCccCCCCCC--CCCCCCCEeccCCCCeeeecCCCCcc
Q psy590 164 RLNRICINPCQE--DSCGEHAICIPKNHGIDCQCQSGHQG 201 (257)
Q Consensus 164 ~~~~~~id~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~g 201 (257)
...|.|||+|.. .+|..+..|+|+.|+|+|.+++||.+
T Consensus 230 e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 230 EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 778999999974 68999899999999999999999987
No 12
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.96 E-value=3.3e-06 Score=47.91 Aligned_cols=30 Identities=30% Similarity=0.860 Sum_probs=26.9
Q ss_pred CCCCCCCCCCEeccCC-CCeeeecCCCCccC
Q psy590 173 CQEDSCGEHAICIPKN-HGIDCQCQSGHQGN 202 (257)
Q Consensus 173 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 202 (257)
|.+++|.++|+|++.. ++|+|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4567899999999998 99999999999986
No 13
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.93 E-value=8.1e-06 Score=49.40 Aligned_cols=35 Identities=31% Similarity=0.763 Sum_probs=30.0
Q ss_pred cCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCcc
Q psy590 86 SNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQG 134 (257)
Q Consensus 86 ~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g 134 (257)
|++||.... +.|..++.|+|+.|+|.|.|++||..
T Consensus 1 DidEC~~~~--------------~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGP--------------HNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTS--------------SSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCC--------------CcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 467787653 78988899999999999999999983
No 14
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.91 E-value=1.7e-05 Score=46.75 Aligned_cols=33 Identities=24% Similarity=0.763 Sum_probs=29.5
Q ss_pred CCCCCC-CCCCCCCEeccCCCCeeeecCCCCc-cC
Q psy590 170 INPCQE-DSCGEHAICIPKNHGIDCQCQSGHQ-GN 202 (257)
Q Consensus 170 id~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-g~ 202 (257)
+|+|.. ++|.+++.|++..++|.|.|+.||. |.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 788877 7899889999999999999999998 54
No 15
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.85 E-value=9e-06 Score=47.32 Aligned_cols=30 Identities=23% Similarity=0.736 Sum_probs=24.3
Q ss_pred CCCCCCCEeccCCCCeeeecCCCCccCCCC
Q psy590 176 DSCGEHAICIPKNHGIDCQCQSGHQGNPYI 205 (257)
Q Consensus 176 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 205 (257)
+.|+.+|+|+++.++|.|+|++||.|++..
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 358999999999999999999999999854
No 16
>KOG1225|consensus
Probab=97.66 E-value=0.00029 Score=65.09 Aligned_cols=77 Identities=22% Similarity=0.613 Sum_probs=47.9
Q ss_pred CCCCCCeeeeCCCceeeeCCCCCccCCC--CCcccCCCCCCCCC-CCCCCCCCCccCCCCCccCCCCCCCCCCCCCEecc
Q psy590 110 PCAPNAVCINKNHATDCSCLEGFQGNGF--VGCLPVRNYQPICQ-YNEDCPPDKLCDRLNRICINPCQEDSCGEHAICIP 186 (257)
Q Consensus 110 ~C~~~~~C~~~~g~~~C~C~~g~~g~~~--~~C~~~~~~~~~C~-~~~~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~~ 186 (257)
.|..++.+++. .|.|.+||.|... ..|...-...+.|. ..+.|.+||.|.. |....|.+++.|++
T Consensus 286 ~cs~~g~~~~g----~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~--------C~~~~C~~~g~cv~ 353 (525)
T KOG1225|consen 286 DCSGGGVCVDG----ECICNPGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGEL--------CIQRACSGGGQCVN 353 (525)
T ss_pred ccCCCceecCC----EeecCCCccccccccccCCccCCCCCcccCCceEeCCCCcCCc--------ccccccCCCceecc
Confidence 35555555433 6889999988862 12332222344555 2455677887653 33224888888876
Q ss_pred CCCCeeeecCCCCccCC
Q psy590 187 KNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 187 ~~g~~~C~C~~G~~g~~ 203 (257)
. |.|..||+|..
T Consensus 354 g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 354 G-----CKCKKGWRGPD 365 (525)
T ss_pred C-----ceeccCccCCC
Confidence 3 89999999876
No 17
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.50 E-value=0.00017 Score=41.87 Aligned_cols=34 Identities=24% Similarity=0.717 Sum_probs=29.6
Q ss_pred CCCCCC-CCCCCCCEeccCCCCeeeecCCCCccCC
Q psy590 170 INPCQE-DSCGEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 170 id~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
+++|.. .+|.+++.|++..++|+|.|+.||.|..
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence 677876 6898889999999999999999999854
No 18
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.50 E-value=6.5e-05 Score=43.69 Aligned_cols=30 Identities=37% Similarity=0.758 Sum_probs=24.6
Q ss_pred CCCCCCCCEeeeCCCCcEeeCCCCCccCCC
Q psy590 44 ASLCAPSAKCHVVKHRPVCTCPQGYEGNPA 73 (257)
Q Consensus 44 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 73 (257)
.+.|+.+|.|+++.++|.|.|++||.|++.
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 568999999999999999999999999863
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.43 E-value=7.4e-05 Score=42.30 Aligned_cols=28 Identities=29% Similarity=0.776 Sum_probs=24.7
Q ss_pred CCCCCCCCEeeeCC-CCcEeeCCCCCccC
Q psy590 44 ASLCAPSAKCHVVK-HRPVCTCPQGYEGN 71 (257)
Q Consensus 44 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 71 (257)
.++|.++++|++.. +.|.|.|++||+|.
T Consensus 3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 56899999999988 88999999999885
No 20
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.28 E-value=0.00025 Score=37.14 Aligned_cols=23 Identities=26% Similarity=0.436 Sum_probs=18.2
Q ss_pred CeeeecCCCCccCCC-CCcccCcc
Q psy590 190 GIDCQCQSGHQGNPY-IACLSELI 212 (257)
Q Consensus 190 ~~~C~C~~G~~g~~~-~~C~~~~~ 212 (257)
+|+|.|++||+.... .+|.||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 689999999995432 35999986
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.22 E-value=0.00055 Score=40.08 Aligned_cols=33 Identities=33% Similarity=0.756 Sum_probs=26.7
Q ss_pred ccCCcCCCCCCCCEeeeCCCCcEeeCCCCCc-cC
Q psy590 39 DLCAIASLCAPSAKCHVVKHRPVCTCPQGYE-GN 71 (257)
Q Consensus 39 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~-g~ 71 (257)
++|....+|.+++.|+++.++|.|.|++||. |.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 5676226888888999999999999999998 54
No 22
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.08 E-value=0.00059 Score=35.72 Aligned_cols=24 Identities=33% Similarity=0.850 Sum_probs=18.9
Q ss_pred CcEeeCCCCCccCCCCCccCCCCCCCCcCCC
Q psy590 59 RPVCTCPQGYEGNPATKCYLPNPLGCVSNEE 89 (257)
Q Consensus 59 ~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~e 89 (257)
+|+|.|++||+..+ +...|+|+||
T Consensus 1 sy~C~C~~Gy~l~~-------d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP-------DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC-------CCCccccCCC
Confidence 58999999998764 3567888875
No 23
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.01 E-value=0.0011 Score=37.71 Aligned_cols=28 Identities=21% Similarity=0.713 Sum_probs=25.1
Q ss_pred CCCCCCCCEeccCCCCeeeecCCCCccC
Q psy590 175 EDSCGEHAICIPKNHGIDCQCQSGHQGN 202 (257)
Q Consensus 175 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 202 (257)
..+|.+++.|++..++|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4568888999999999999999999987
No 24
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.91 E-value=0.0016 Score=37.20 Aligned_cols=29 Identities=28% Similarity=0.822 Sum_probs=24.4
Q ss_pred CCC-CCCCCCCEeccCCCCeeeecCCCCccC
Q psy590 173 CQE-DSCGEHAICIPKNHGIDCQCQSGHQGN 202 (257)
Q Consensus 173 C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~ 202 (257)
|.. ++|.++ .|++..++|.|.|+.||.|+
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 444 568877 99999999999999999984
No 25
>KOG1225|consensus
Probab=96.74 E-value=0.012 Score=54.72 Aligned_cols=83 Identities=25% Similarity=0.601 Sum_probs=51.9
Q ss_pred CCCCCCCeeeeCCCceeeeCCCCCccCCCC--CcccCCCCCCCCC-CCCCCCCCCccCCCCCccCCCCCCCCCCCCCEec
Q psy590 109 SPCAPNAVCINKNHATDCSCLEGFQGNGFV--GCLPVRNYQPICQ-YNEDCPPDKLCDRLNRICINPCQEDSCGEHAICI 185 (257)
Q Consensus 109 ~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~--~C~~~~~~~~~C~-~~~~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~ 185 (257)
..|..++.|++. .|.|++||.|..-. .|...--....+. ..+.|.+||.|.... +-.|. .+|.+++.|+
T Consensus 254 ~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs---~~~cp-adC~g~G~Ci 325 (525)
T KOG1225|consen 254 GGCTGRGQCVEG----RCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCS---IRRCP-ADCSGHGKCI 325 (525)
T ss_pred CCCcccceEeCC----eEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccc---cccCC-ccCCCCCccc
Confidence 445555667644 69999999998622 1222100111222 235688999885422 33343 6799999999
Q ss_pred cCCCCeeeecCCCCccCC
Q psy590 186 PKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 186 ~~~g~~~C~C~~G~~g~~ 203 (257)
.+ +|.|.+||+|..
T Consensus 326 --~G--~C~C~~Gy~G~~ 339 (525)
T KOG1225|consen 326 --DG--ECLCDEGYTGEL 339 (525)
T ss_pred --CC--ceEeCCCCcCCc
Confidence 22 699999999987
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.71 E-value=0.0029 Score=36.44 Aligned_cols=33 Identities=33% Similarity=0.773 Sum_probs=26.2
Q ss_pred ccCCcCCCCCCCCEeeeCCCCcEeeCCCCCccC
Q psy590 39 DLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGN 71 (257)
Q Consensus 39 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 71 (257)
++|....+|.+++.|++..+.|.|.|+.||.|.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 456521678878899998889999999999885
No 27
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.17 E-value=0.0091 Score=33.71 Aligned_cols=28 Identities=29% Similarity=0.784 Sum_probs=23.4
Q ss_pred CCCCCCCCEeeeCCCCcEeeCCCCCccC
Q psy590 44 ASLCAPSAKCHVVKHRPVCTCPQGYEGN 71 (257)
Q Consensus 44 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 71 (257)
..+|.+++.|++..+.|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 3577778889988888999999999875
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.17 E-value=0.0094 Score=33.92 Aligned_cols=25 Identities=32% Similarity=0.865 Sum_probs=22.4
Q ss_pred CCCCCCCEeeeCCCCcEeeCCCCCcc
Q psy590 45 SLCAPSAKCHVVKHRPVCTCPQGYEG 70 (257)
Q Consensus 45 ~~C~~~~~C~~~~g~~~C~C~~G~~g 70 (257)
.+|.++ .|++..++|.|.|++||.|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCcc
Confidence 578877 8999989999999999988
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.95 E-value=0.0031 Score=50.13 Aligned_cols=131 Identities=23% Similarity=0.510 Sum_probs=73.7
Q ss_pred CCCCCCeeccCCCceeeecc--------------ccCCc----CCCCCCCCEeeeCC-----CCcEeeCCCCCccCCCCC
Q psy590 19 PCQSDLDCLDAEACYSGQCE--------------DLCAI----ASLCAPSAKCHVVK-----HRPVCTCPQGYEGNPATK 75 (257)
Q Consensus 19 ~C~~~~~C~~~~g~~~C~C~--------------~~C~~----~~~C~~~~~C~~~~-----g~~~C~C~~G~~g~~~~~ 75 (257)
.|.+ |..+..-..|.|.|. .+|.. ..+|+..+.|.+.. ..|.|.|.+||+.... .
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-v 84 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG-V 84 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-S
T ss_pred cccC-CEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC-e
Confidence 4554 455545566788882 33431 46798899999876 4689999999987732 2
Q ss_pred ccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCC---CceeeeCCCCCccCCCCCcccCCCCCCCCCC
Q psy590 76 CYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKN---HATDCSCLEGFQGNGFVGCLPVRNYQPICQY 152 (257)
Q Consensus 76 C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~---g~~~C~C~~g~~g~~~~~C~~~~~~~~~C~~ 152 (257)
|. .++|.. ..|. .+.|+..+ ....|+|.-|+.......|....
T Consensus 85 --------Cv-p~~C~~----------------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G-------- 130 (197)
T PF06247_consen 85 --------CV-PNKCNN----------------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTG-------- 130 (197)
T ss_dssp --------EE-EGGGSS-------------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------
T ss_pred --------Ec-hhhcCc----------------eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCC--------
Confidence 32 133432 4577 57887433 34589999998843333233211
Q ss_pred CCCCCCCCccCCCCCccCCCCCCCCCCCCCEeccCCCCeeeecCCCCccCC
Q psy590 153 NEDCPPDKLCDRLNRICINPCQEDSCGEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 153 ~~~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
-.+|.. .|..+..|..+.+-|+|.|..||.++.
T Consensus 131 -----------------~T~C~L-KCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 131 -----------------ETKCSL-KCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp --------------------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred -----------------ccceee-ecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 223432 266778999999999999999998765
No 30
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.97 E-value=0.013 Score=33.98 Aligned_cols=22 Identities=23% Similarity=0.579 Sum_probs=18.2
Q ss_pred CEeccCCCCeeeecCCCCccCC
Q psy590 182 AICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 182 ~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
..|++++++|+|.|++||+...
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCccCCCceEeECCCCCEECc
Confidence 5899999999999999998754
No 31
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=94.71 E-value=0.0053 Score=48.81 Aligned_cols=105 Identities=30% Similarity=0.711 Sum_probs=61.5
Q ss_pred CCCCCCEeeeCCCCcEeeCCCCCccCCCCCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCC----
Q psy590 46 LCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKN---- 121 (257)
Q Consensus 46 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~---- 121 (257)
.|.+ +..+.....|.|.|.+||.... .+.|....+|.... ....+|...+.|++..
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cnegfvl~~--------EntCE~kv~C~~~e-----------~~~K~Cgdya~C~~~~~~~~ 66 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEGFVLKN--------ENTCEEKVECDKLE-----------NVNKPCGDYAKCINQANKGE 66 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTTEEEEE--------TTEEEE----SG-G-----------GTTSEEETTEEEEE-SSTTS
T ss_pred cccC-CEEEEccCceEEEcCCCcEEcc--------ccccccceecCccc-----------ccCccccchhhhhcCCCccc
Confidence 4544 4666667789999999998763 33455555554310 0135788889998765
Q ss_pred -CceeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCccCCCCCccCCCCCCCCCCCCCEeccCC---CCeeeecCC
Q psy590 122 -HATDCSCLEGFQGNGFVGCLPVRNYQPICQYNEDCPPDKLCDRLNRICINPCQEDSCGEHAICIPKN---HGIDCQCQS 197 (257)
Q Consensus 122 -g~~~C~C~~g~~g~~~~~C~~~~~~~~~C~~~~~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~~~~---g~~~C~C~~ 197 (257)
..|.|.|.+||...... |.+ +.|....|+ .|.|+-.+ ....|+|..
T Consensus 67 ~~~~~C~C~~gY~~~~~v-Cvp----------------------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~I 116 (197)
T PF06247_consen 67 ERAYKCDCINGYILKQGV-CVP----------------------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNI 116 (197)
T ss_dssp STSEEEEE-TTEEESSSS-EEE----------------------------GGGSS---T-TEEEEEEEGGGSEEEEEE-T
T ss_pred ceeEEEecccCceeeCCe-Ech----------------------------hhcCceecC-CCeEEecCCCCCCceeEeee
Confidence 47999999999876432 221 344445577 57885432 245999999
Q ss_pred CCc
Q psy590 198 GHQ 200 (257)
Q Consensus 198 G~~ 200 (257)
|+.
T Consensus 117 GkV 119 (197)
T PF06247_consen 117 GKV 119 (197)
T ss_dssp EEE
T ss_pred ceE
Confidence 998
No 32
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.77 E-value=0.089 Score=29.57 Aligned_cols=25 Identities=32% Similarity=0.757 Sum_probs=21.1
Q ss_pred CCCCCCEeccCCCCeeeecCCCCccCC
Q psy590 177 SCGEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 177 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
.|.++++|+.. ..+|+|.+||+|+.
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCC
Confidence 58889999876 45899999999975
No 33
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.72 E-value=0.053 Score=31.40 Aligned_cols=25 Identities=28% Similarity=0.643 Sum_probs=19.3
Q ss_pred CCCCCCCEeeeCCCCcEeeCCCCCccC
Q psy590 45 SLCAPSAKCHVVKHRPVCTCPQGYEGN 71 (257)
Q Consensus 45 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 71 (257)
..|.. .|++++++|.|.|++||.+.
T Consensus 6 GgC~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 6 GGCSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp GGSSS--EEEEETTSEEEE-STTEEE-
T ss_pred CCcCC--CCccCCCceEeECCCCCEEC
Confidence 34553 79999999999999999886
No 34
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.47 E-value=0.087 Score=43.75 Aligned_cols=36 Identities=19% Similarity=0.446 Sum_probs=28.3
Q ss_pred CCccCCCCCC--CCCCCCCEeccCCCCeeeecCCCCccCC
Q psy590 166 NRICINPCQE--DSCGEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 166 ~~~~id~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
.|.++++|.. +.|. ..|.++.|+|.|.|+.||+...
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence 3555788864 4565 5899999999999999998743
No 35
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.36 E-value=0.036 Score=24.51 Aligned_cols=12 Identities=42% Similarity=1.013 Sum_probs=9.2
Q ss_pred eeecCCCCccCC
Q psy590 192 DCQCQSGHQGNP 203 (257)
Q Consensus 192 ~C~C~~G~~g~~ 203 (257)
.|+|++||+|..
T Consensus 1 ~C~C~~G~~G~~ 12 (13)
T PF12661_consen 1 TCQCPPGWTGPN 12 (13)
T ss_dssp EEEE-TTEETTT
T ss_pred CccCcCCCcCCC
Confidence 489999999975
No 36
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.15 E-value=0.027 Score=32.59 Aligned_cols=32 Identities=22% Similarity=0.521 Sum_probs=22.1
Q ss_pred CCCCCCCCCCEeccCC-CCeeeecCCCCccCCC
Q psy590 173 CQEDSCGEHAICIPKN-HGIDCQCQSGHQGNPY 204 (257)
Q Consensus 173 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~~ 204 (257)
|....|..++.|++.. |.+.|+|..||..++.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 3345677889999886 8999999999987664
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.64 E-value=0.28 Score=27.53 Aligned_cols=25 Identities=28% Similarity=0.780 Sum_probs=20.4
Q ss_pred CCCCCCCEeeeCCCCcEeeCCCCCccC
Q psy590 45 SLCAPSAKCHVVKHRPVCTCPQGYEGN 71 (257)
Q Consensus 45 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 71 (257)
..|..+++|+.. ...|.|.+||.|.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 468889999865 3579999999987
No 38
>KOG1226|consensus
Probab=91.27 E-value=1.8 Score=41.88 Aligned_cols=64 Identities=22% Similarity=0.599 Sum_probs=37.5
Q ss_pred CCCCCCCEeeeCCCCcEeeCCCCCccCCC-CCccCCCCCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCc
Q psy590 45 SLCAPSAKCHVVKHRPVCTCPQGYEGNPA-TKCYLPNPLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHA 123 (257)
Q Consensus 45 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~-~~C~~~~~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~ 123 (257)
.+|..++.|+= .+|.|.+...+.-+ .+|+ |.+. .|.... ...|..++.|.-.
T Consensus 514 ~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCE------CDnf-sC~r~~-------------g~lC~g~G~C~CG--- 566 (783)
T KOG1226|consen 514 PVCSGRGDCVC----GQCVCHKPDNGKIYGKFCE------CDNF-SCERHK-------------GVLCGGHGRCECG--- 566 (783)
T ss_pred CCcCCCCcEeC----CceEecCCCCCceeeeeee------ccCc-cccccc-------------CcccCCCCeEeCC---
Confidence 36777777752 24788777662211 1454 2221 232211 2568888887533
Q ss_pred eeeeCCCCCccCC
Q psy590 124 TDCSCLEGFQGNG 136 (257)
Q Consensus 124 ~~C~C~~g~~g~~ 136 (257)
.|+|.+||+|..
T Consensus 567 -~CvC~~GwtG~~ 578 (783)
T KOG1226|consen 567 -RCVCNPGWTGSA 578 (783)
T ss_pred -cEEcCCCCccCC
Confidence 599999999985
No 39
>KOG0994|consensus
Probab=89.01 E-value=1.7 Score=44.04 Aligned_cols=15 Identities=33% Similarity=0.904 Sum_probs=12.4
Q ss_pred CceeeeCCCCCccCC
Q psy590 122 HATDCSCLEGFQGNG 136 (257)
Q Consensus 122 g~~~C~C~~g~~g~~ 136 (257)
....|.|.+||.|..
T Consensus 932 ~~ivC~C~~GY~G~R 946 (1758)
T KOG0994|consen 932 QQIVCHCQEGYSGSR 946 (1758)
T ss_pred cceeeecccCccccc
Confidence 456899999999984
No 40
>KOG1226|consensus
Probab=88.78 E-value=2.5 Score=40.97 Aligned_cols=89 Identities=25% Similarity=0.740 Sum_probs=54.2
Q ss_pred CCCCCCeec-------cCCC----ceeeecccc-CC--cCCCCCCCCEeeeCCCCcEeeCCCCCccCCCCCccCCCCCCC
Q psy590 19 PCQSDLDCL-------DAEA----CYSGQCEDL-CA--IASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGC 84 (257)
Q Consensus 19 ~C~~~~~C~-------~~~g----~~~C~C~~~-C~--~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~C 84 (257)
+|-..|.|. .... +.+|+|.|- |. ....|..++.|.= ..|.|.+||+|. .|.-.
T Consensus 515 vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~C----G~CvC~~GwtG~---~C~C~----- 582 (783)
T KOG1226|consen 515 VCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCEC----GRCVCNPGWTGS---ACNCP----- 582 (783)
T ss_pred CcCCCCcEeCCceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeC----CcEEcCCCCccC---CCCCC-----
Confidence 677666654 2222 457888654 65 2567888888853 359999999998 66521
Q ss_pred CcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCC-CccCC
Q psy590 85 VSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEG-FQGNG 136 (257)
Q Consensus 85 ~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g-~~g~~ 136 (257)
.+.+.|.... ...|...++|.-. +|+|... |.|..
T Consensus 583 ~std~C~~~~-------------G~iCSGrG~C~Cg----~C~C~~~~~sG~~ 618 (783)
T KOG1226|consen 583 LSTDTCESSD-------------GQICSGRGTCECG----RCKCTDPPYSGEF 618 (783)
T ss_pred CCCccccCCC-------------CceeCCCceeeCC----ceEcCCCCcCcch
Confidence 2334444322 2456666666533 4777755 77774
No 41
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.40 E-value=0.58 Score=38.76 Aligned_cols=39 Identities=31% Similarity=0.611 Sum_probs=29.6
Q ss_pred CCCCCcCCCCCCCCcccCCCccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCccC
Q psy590 81 PLGCVSNEECASTEACINSQCQKPCAVHSPCAPNAVCINKNHATDCSCLEGFQGN 135 (257)
Q Consensus 81 ~~~C~~~~eC~~~~~c~g~~c~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~ 135 (257)
...|.+.++|.... ..|. ..|.++.|+|.|.|..||...
T Consensus 181 ~~~C~~~~~C~~~~--------------~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLS--------------HVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCC--------------CCcc--ceEEcCCCCEEeECCCCccCC
Confidence 34576777776532 5565 479999999999999999865
No 42
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=83.90 E-value=0.58 Score=27.11 Aligned_cols=29 Identities=24% Similarity=0.550 Sum_probs=20.3
Q ss_pred CCCCCCCCEeeeCC-CCcEeeCCCCCccCC
Q psy590 44 ASLCAPSAKCHVVK-HRPVCTCPQGYEGNP 72 (257)
Q Consensus 44 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~ 72 (257)
...|..++.|.+.. |.+.|+|..||...+
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 45677888998877 889999999997653
No 43
>PHA02887 EGF-like protein; Provisional
Probab=83.55 E-value=1.3 Score=32.51 Aligned_cols=37 Identities=24% Similarity=0.533 Sum_probs=27.0
Q ss_pred CCCCCC---CCCCCCCEeccC--CCCeeeecCCCCccCCCCCcccC
Q psy590 170 INPCQE---DSCGEHAICIPK--NHGIDCQCQSGHQGNPYIACLSE 210 (257)
Q Consensus 170 id~C~~---~~C~~~~~C~~~--~g~~~C~C~~G~~g~~~~~C~~~ 210 (257)
.++|.+ +-|. +|+|... ...+.|.|+.||+|.+ |+.+
T Consensus 83 f~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~R---CE~v 124 (126)
T PHA02887 83 FEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIR---CDEV 124 (126)
T ss_pred ccccChHhhCEee-CCEEEccccCCCceeECCCCcccCC---CCcc
Confidence 566754 4577 4799654 4568999999999998 7643
No 44
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=82.76 E-value=1.4 Score=32.09 Aligned_cols=32 Identities=41% Similarity=0.997 Sum_probs=24.1
Q ss_pred CCCCCCCCCCCCCeeeeCCCceeeeCCCCCccC
Q psy590 103 KPCAVHSPCAPNAVCINKNHATDCSCLEGFQGN 135 (257)
Q Consensus 103 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~ 135 (257)
++|.....|+.++.|.. .....|.|.+||...
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 35656789999999953 345679999999753
No 45
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=81.29 E-value=1.3 Score=33.14 Aligned_cols=38 Identities=29% Similarity=0.630 Sum_probs=27.6
Q ss_pred CCCCCC---CCCCCCCEeccC--CCCeeeecCCCCccCCCCCcccCc
Q psy590 170 INPCQE---DSCGEHAICIPK--NHGIDCQCQSGHQGNPYIACLSEL 211 (257)
Q Consensus 170 id~C~~---~~C~~~~~C~~~--~g~~~C~C~~G~~g~~~~~C~~~~ 211 (257)
+.+|.+ +-|.++ +|... ...+.|.|..||+|.+ |+-.+
T Consensus 42 i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeR---CEh~d 84 (139)
T PHA03099 42 IRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIR---CQHVV 84 (139)
T ss_pred cccCChhhCCEeECC-EEEeeccCCCceeECCCCccccc---cccee
Confidence 445653 457764 89655 4678999999999998 87544
No 46
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=77.64 E-value=2.8 Score=23.73 Aligned_cols=21 Identities=29% Similarity=0.487 Sum_probs=14.5
Q ss_pred CEeccCCCCeeeecCCCCccCC
Q psy590 182 AICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 182 ~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
+.|.... .+.|.|++||..+.
T Consensus 10 A~CDpn~-~~~C~CPeGyIlde 30 (34)
T PF09064_consen 10 ADCDPNS-PGQCFCPEGYILDE 30 (34)
T ss_pred CccCCCC-CCceeCCCceEecC
Confidence 3565433 34899999998765
No 47
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=74.84 E-value=3.3 Score=30.17 Aligned_cols=31 Identities=32% Similarity=0.984 Sum_probs=24.7
Q ss_pred ccCCcCCCCCCCCEeeeCCCCcEeeCCCCCcc
Q psy590 39 DLCAIASLCAPSAKCHVVKHRPVCTCPQGYEG 70 (257)
Q Consensus 39 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g 70 (257)
+.|.....|+..+.|.. .....|.|.+||..
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence 77876789999999954 34567999999964
No 48
>KOG1836|consensus
Probab=72.37 E-value=11 Score=40.60 Aligned_cols=48 Identities=19% Similarity=0.309 Sum_probs=28.0
Q ss_pred CCCCCCccCCCCCccCCCCCCCCCCCCCEeccCC--CCeeee-cCCCCccCC
Q psy590 155 DCPPDKLCDRLNRICINPCQEDSCGEHAICIPKN--HGIDCQ-CQSGHQGNP 203 (257)
Q Consensus 155 ~C~~g~~~~~~~~~~id~C~~~~C~~~~~C~~~~--g~~~C~-C~~G~~g~~ 203 (257)
+|..||++....-...| |..=+|..++.|..+. ....|. |++||+|..
T Consensus 760 ~C~~GfYg~~~~~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred hhcCCCCCccccCCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCcccc
Confidence 35556655432111122 5555566666665553 456788 999999887
No 49
>KOG3516|consensus
Probab=69.65 E-value=3.8 Score=41.68 Aligned_cols=37 Identities=19% Similarity=0.552 Sum_probs=32.6
Q ss_pred CccCCCCCCCCCCCCCEeccCCCCeeeecC-CCCccCC
Q psy590 167 RICINPCQEDSCGEHAICIPKNHGIDCQCQ-SGHQGNP 203 (257)
Q Consensus 167 ~~~id~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~g~~ 203 (257)
|.-+|.|.+++|.+++.|..+...|.|.|. .||.|..
T Consensus 542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Gat 579 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGAT 579 (1306)
T ss_pred cccccccCCccccCCCcccccccceeEecccccccccc
Confidence 333688999999999999999999999999 9999876
No 50
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=69.41 E-value=11 Score=23.33 Aligned_cols=24 Identities=25% Similarity=0.672 Sum_probs=17.5
Q ss_pred CCCCCCCEeccCCCCeeeecCCCCccCC
Q psy590 176 DSCGEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 176 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
..|..++.|++. +|+|++||.-.+
T Consensus 26 ~qC~~~s~C~~g----~C~C~~g~~~~~ 49 (52)
T PF01683_consen 26 EQCIGGSVCVNG----RCQCPPGYVEVG 49 (52)
T ss_pred CCCCCcCEEcCC----EeECCCCCEecC
Confidence 356677888553 799999987654
No 51
>smart00051 DSL delta serrate ligand.
Probab=65.64 E-value=11 Score=24.70 Aligned_cols=22 Identities=27% Similarity=0.345 Sum_probs=14.2
Q ss_pred CCCCEeccCCCCeeeecCCCCccCC
Q psy590 179 GEHAICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 179 ~~~~~C~~~~g~~~C~C~~G~~g~~ 203 (257)
..+..|.. .| .++|.+||+|..
T Consensus 41 ~~~~~Cd~-~G--~~~C~~Gw~G~~ 62 (63)
T smart00051 41 FGHYTCDE-NG--NKGCLEGWMGPY 62 (63)
T ss_pred cCCccCCc-CC--CEecCCCCcCCC
Confidence 34455643 23 478999999875
No 52
>PHA02887 EGF-like protein; Provisional
Probab=61.43 E-value=9.4 Score=28.19 Aligned_cols=35 Identities=26% Similarity=0.821 Sum_probs=25.3
Q ss_pred ccCCc--CCCCCCCCEeeeCC--CCcEeeCCCCCccCCCCCcc
Q psy590 39 DLCAI--ASLCAPSAKCHVVK--HRPVCTCPQGYEGNPATKCY 77 (257)
Q Consensus 39 ~~C~~--~~~C~~~~~C~~~~--g~~~C~C~~G~~g~~~~~C~ 77 (257)
.+|.. .+-|. +|.|.-.. ....|.|..||.|. +|+
T Consensus 84 ~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE 122 (126)
T PHA02887 84 EKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI---RCD 122 (126)
T ss_pred cccChHhhCEee-CCEEEccccCCCceeECCCCcccC---CCC
Confidence 45552 45676 46897655 46889999999998 665
No 53
>KOG3514|consensus
Probab=59.76 E-value=9.8 Score=38.57 Aligned_cols=25 Identities=20% Similarity=0.452 Sum_probs=24.0
Q ss_pred CCCCCCCCCCCeeccCCCceeeecc
Q psy590 14 AKVPRPCQSDLDCLDAEACYSGQCE 38 (257)
Q Consensus 14 ~c~~~~C~~~~~C~~~~g~~~C~C~ 38 (257)
.|.++||+|+|.|.+.+..|.|.|.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs 649 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCS 649 (1591)
T ss_pred ccCCCcccCCCCccccccccccccc
Confidence 7999999999999999999999995
No 54
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=56.72 E-value=11 Score=28.38 Aligned_cols=31 Identities=26% Similarity=0.745 Sum_probs=23.4
Q ss_pred CCCCCCCCEeeeCC--CCcEeeCCCCCccCCCCCccC
Q psy590 44 ASLCAPSAKCHVVK--HRPVCTCPQGYEGNPATKCYL 78 (257)
Q Consensus 44 ~~~C~~~~~C~~~~--g~~~C~C~~G~~g~~~~~C~~ 78 (257)
.+-|.++ .|.... ..+.|.|..||+|. +|+.
T Consensus 50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGe---RCEh 82 (139)
T PHA03099 50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGI---RCQH 82 (139)
T ss_pred CCEeECC-EEEeeccCCCceeECCCCcccc---cccc
Confidence 3556654 887655 67899999999998 6663
No 55
>KOG0994|consensus
Probab=55.26 E-value=42 Score=34.70 Aligned_cols=22 Identities=32% Similarity=0.773 Sum_probs=16.0
Q ss_pred EeccCCCCeee-ecCCCCccCCC
Q psy590 183 ICIPKNHGIDC-QCQSGHQGNPY 204 (257)
Q Consensus 183 ~C~~~~g~~~C-~C~~G~~g~~~ 204 (257)
.|.....++.| +|..||.|+..
T Consensus 877 ~CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 877 DCQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred cccccccccchhhhhccccCCcc
Confidence 35666777888 58899987653
No 56
>KOG3516|consensus
Probab=43.53 E-value=17 Score=37.20 Aligned_cols=34 Identities=18% Similarity=0.421 Sum_probs=29.6
Q ss_pred ccccCCCCCCCCCCCCCCCCeeccCCCceeeecc
Q psy590 5 TVQCALPGTAKVPRPCQSDLDCLDAEACYSGQCE 38 (257)
Q Consensus 5 ~~~~~~~~~~c~~~~C~~~~~C~~~~g~~~C~C~ 38 (257)
..-.|+-.|+|.+++|.++|.|.-.+..|.|.|.
T Consensus 538 ~id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~ 571 (1306)
T KOG3516|consen 538 QIDMCGISDRCLPNPCEHGGKCSQSWDDFECNCE 571 (1306)
T ss_pred eecccccccccCCccccCCCcccccccceeEecc
Confidence 3456778899999999999999999999999984
No 57
>KOG1836|consensus
Probab=43.40 E-value=84 Score=34.22 Aligned_cols=27 Identities=26% Similarity=0.671 Sum_probs=17.5
Q ss_pred CCCCCCeeeeC--CCceeee-CCCCCccCC
Q psy590 110 PCAPNAVCINK--NHATDCS-CLEGFQGNG 136 (257)
Q Consensus 110 ~C~~~~~C~~~--~g~~~C~-C~~g~~g~~ 136 (257)
+|.+.+.|... .....|. |++||+|..
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCcccc
Confidence 35444455433 3456788 999999984
No 58
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=38.57 E-value=43 Score=20.43 Aligned_cols=12 Identities=25% Similarity=0.933 Sum_probs=10.8
Q ss_pred eeecCCCCccCC
Q psy590 192 DCQCQSGHQGNP 203 (257)
Q Consensus 192 ~C~C~~G~~g~~ 203 (257)
+|.|++||+|..
T Consensus 20 ~C~C~~~~~G~~ 31 (50)
T cd00055 20 QCECKPNTTGRR 31 (50)
T ss_pred EEeCCCcCCCCC
Confidence 689999999987
No 59
>KOG3514|consensus
Probab=38.57 E-value=18 Score=36.82 Aligned_cols=36 Identities=25% Similarity=0.709 Sum_probs=30.2
Q ss_pred CCCCCCCCCCCEeccCCCCeeeecC-CCCccCCCCCcccC
Q psy590 172 PCQEDSCGEHAICIPKNHGIDCQCQ-SGHQGNPYIACLSE 210 (257)
Q Consensus 172 ~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~g~~~~~C~~~ 210 (257)
.|.++||.+++.|......|.|.|. .||.|.. |+.+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~---CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRT---CERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcc---ccce
Confidence 6888999999999999999999996 5777765 7643
No 60
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=31.97 E-value=27 Score=21.15 Aligned_cols=19 Identities=32% Similarity=0.782 Sum_probs=14.8
Q ss_pred EeccCCCCeeeecCCCCccCC
Q psy590 183 ICIPKNHGIDCQCQSGHQGNP 203 (257)
Q Consensus 183 ~C~~~~g~~~C~C~~G~~g~~ 203 (257)
.|... ..+|.|+++|+|..
T Consensus 12 ~C~~~--~G~C~C~~~~~G~~ 30 (49)
T PF00053_consen 12 TCDPS--TGQCVCKPGTTGPR 30 (49)
T ss_dssp SEEET--CEEESBSTTEESTT
T ss_pred cccCC--CCEEeccccccCCc
Confidence 56553 45899999999998
No 61
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=28.52 E-value=53 Score=23.82 Aligned_cols=26 Identities=23% Similarity=0.699 Sum_probs=19.5
Q ss_pred CCCCCCCEeccCC-----CCeeeecCCCCcc
Q psy590 176 DSCGEHAICIPKN-----HGIDCQCQSGHQG 201 (257)
Q Consensus 176 ~~C~~~~~C~~~~-----g~~~C~C~~G~~g 201 (257)
+.|..++.|++.. .=|.|.|.+.+..
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 6788899998873 3388999886543
No 62
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=26.92 E-value=1.4e+02 Score=20.78 Aligned_cols=29 Identities=21% Similarity=0.297 Sum_probs=22.4
Q ss_pred CCCCCCCCCCCeecc-CCCceeeeccccCC
Q psy590 14 AKVPRPCQSDLDCLD-AEACYSGQCEDLCA 42 (257)
Q Consensus 14 ~c~~~~C~~~~~C~~-~~g~~~C~C~~~C~ 42 (257)
||....|..+..|+- ..+.-.|.|++.|.
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~CvC~~~Cp 30 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCVCIDPCP 30 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEecCCcCC
Confidence 577888888888874 35667888888887
No 63
>KOG3512|consensus
Probab=23.89 E-value=1.9e+02 Score=26.98 Aligned_cols=88 Identities=20% Similarity=0.448 Sum_probs=0.0
Q ss_pred CCCCeeeeCCCc-eeeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCccCC---CCCccCCCCCCCCCCCCCE----
Q psy590 112 APNAVCINKNHA-TDCSCLEGFQGNGFVGCLPVRNYQPICQYNEDCPPDKLCDR---LNRICINPCQEDSCGEHAI---- 183 (257)
Q Consensus 112 ~~~~~C~~~~g~-~~C~C~~g~~g~~~~~C~~~~~~~~~C~~~~~C~~g~~~~~---~~~~~id~C~~~~C~~~~~---- 183 (257)
.+...|+-...+ +.|.|.+.-.|. .|. .|.+.|.... ..-.++++|....|..++.
T Consensus 282 gHAs~Cv~d~~~~ltCdC~HNTaGP-------------dCg---rCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrf 345 (592)
T KOG3512|consen 282 GHASRCVMDESSHLTCDCEHNTAGP-------------DCG---RCKPFYYDRPWGRATALPANECVACNCNGHARRCRF 345 (592)
T ss_pred CccceeeeccCCceEEecccCCCCC-------------Ccc---cccccccCCCccccccCCCccccccccchhhhhccc
Q ss_pred ------------------eccCCCCeeee-cCCCCccCCCCCcccCccccC
Q psy590 184 ------------------CIPKNHGIDCQ-CQSGHQGNPYIACLSELICLC 215 (257)
Q Consensus 184 ------------------C~~~~g~~~C~-C~~G~~g~~~~~C~~~~~C~~ 215 (257)
|.....+.+|. |++||..++...-.+...|..
T Consensus 346 n~Ely~lSgr~SggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~ 396 (592)
T KOG3512|consen 346 NMELYRLSGRRSGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKA 396 (592)
T ss_pred chhhhcccCccccceEeecccCCCCcccccccCccccCCCCCCchhhhhhh
No 64
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=23.33 E-value=26 Score=21.73 Aligned_cols=21 Identities=29% Similarity=0.619 Sum_probs=14.2
Q ss_pred eecCCCCccCCCCCcccCccc
Q psy590 193 CQCQSGHQGNPYIACLSELIC 213 (257)
Q Consensus 193 C~C~~G~~g~~~~~C~~~~~C 213 (257)
|.|++||..+....|.+..+|
T Consensus 35 C~C~~G~v~~~~~~CV~~~~C 55 (55)
T PF01826_consen 35 CFCPPGYVRNDNGRCVPPSEC 55 (55)
T ss_dssp EEETTTEEEETTSEEEEGGGC
T ss_pred CCCCCCeeEcCCCCEEcHHHC
Confidence 889999986654346655443
Done!