Query psy4097
Match_columns 129
No_of_seqs 181 out of 1834
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 16:52:38 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4097.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4097hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.6 7.9E-15 1.7E-19 108.4 7.9 111 3-124 746-865 (1289)
2 KOG1219|consensus 99.6 7.7E-15 1.7E-19 116.5 7.7 103 3-121 3874-3977(4289)
3 KOG4289|consensus 99.3 2.6E-12 5.7E-17 99.6 6.5 100 9-122 1217-1317(2531)
4 KOG1219|consensus 99.1 1.1E-10 2.3E-15 94.0 6.0 82 29-123 3859-3940(4289)
5 KOG1214|consensus 99.0 8.2E-10 1.8E-14 82.6 6.2 106 4-119 705-821 (1289)
6 KOG4289|consensus 98.6 1.9E-08 4.1E-13 79.1 2.7 56 60-123 1221-1276(2531)
7 KOG4260|consensus 98.4 1.7E-07 3.6E-12 62.4 3.0 89 13-116 216-304 (350)
8 PF00008 EGF: EGF-like domain 98.4 9.5E-08 2.1E-12 43.7 1.1 31 88-119 1-32 (32)
9 KOG1217|consensus 98.4 1.9E-06 4.2E-11 62.0 8.2 104 3-120 181-306 (487)
10 smart00179 EGF_CA Calcium-bind 98.4 1E-06 2.3E-11 41.8 4.0 35 86-121 3-39 (39)
11 KOG1217|consensus 98.3 4.6E-06 1E-10 60.1 7.9 107 5-123 243-355 (487)
12 PF07645 EGF_CA: Calcium-bindi 98.2 8.8E-07 1.9E-11 43.1 2.2 32 35-70 3-34 (42)
13 PF12947 EGF_3: EGF domain; I 98.2 6.3E-07 1.4E-11 42.0 1.5 28 2-30 9-36 (36)
14 PF07645 EGF_CA: Calcium-bindi 98.1 1.1E-06 2.3E-11 42.8 1.4 30 86-116 3-34 (42)
15 cd00054 EGF_CA Calcium-binding 98.1 6.1E-06 1.3E-10 38.7 3.8 35 86-121 3-38 (38)
16 PF12662 cEGF: Complement Clr- 98.0 6E-06 1.3E-10 35.0 1.9 22 13-36 1-24 (24)
17 PF12947 EGF_3: EGF domain; I 98.0 3.7E-06 8E-11 39.4 1.2 32 39-74 3-34 (36)
18 KOG1225|consensus 97.9 2.9E-05 6.3E-10 56.8 5.8 78 15-123 266-343 (525)
19 smart00181 EGF Epidermal growt 97.9 2.8E-05 6.1E-10 36.0 3.6 29 91-121 6-35 (35)
20 cd00053 EGF Epidermal growth f 97.9 3.4E-05 7.5E-10 35.5 3.7 30 90-120 5-35 (36)
21 KOG4260|consensus 97.9 1.2E-05 2.5E-10 53.8 2.6 92 11-117 165-269 (350)
22 smart00179 EGF_CA Calcium-bind 97.5 0.00023 5E-09 33.5 3.6 33 35-72 3-36 (39)
23 KOG1225|consensus 97.5 0.00065 1.4E-08 50.0 7.1 79 15-124 235-313 (525)
24 PF14670 FXa_inhibition: Coagu 97.3 0.00016 3.5E-09 33.8 1.8 25 5-30 10-36 (36)
25 PF00008 EGF: EGF-like domain 97.2 0.00023 5.1E-09 32.3 1.3 27 43-72 5-31 (32)
26 PF12661 hEGF: Human growth fa 97.1 0.00019 4E-09 25.9 0.7 13 108-120 1-13 (13)
27 cd00054 EGF_CA Calcium-binding 96.9 0.002 4.3E-08 29.8 3.4 33 35-72 3-35 (38)
28 PHA03099 epidermal growth fact 96.8 0.0013 2.9E-08 39.1 2.8 36 91-127 51-87 (139)
29 PHA02887 EGF-like protein; Pro 96.5 0.0036 7.8E-08 36.8 3.0 38 86-124 84-125 (126)
30 PF06247 Plasmod_Pvs28: Plasmo 96.5 0.00072 1.6E-08 43.0 0.1 100 7-116 13-119 (197)
31 smart00181 EGF Epidermal growt 96.4 0.0074 1.6E-07 27.5 3.4 25 42-71 6-30 (35)
32 cd00053 EGF Epidermal growth f 96.3 0.0059 1.3E-07 27.6 2.7 21 3-24 10-30 (36)
33 PF14670 FXa_inhibition: Coagu 96.1 0.0029 6.4E-08 29.5 1.0 25 92-119 7-31 (36)
34 PF07974 EGF_2: EGF-like domai 96.0 0.013 2.8E-07 26.6 2.9 26 92-120 7-32 (32)
35 KOG1226|consensus 95.7 0.075 1.6E-06 40.9 7.3 90 11-122 472-581 (783)
36 PF06247 Plasmod_Pvs28: Plasmo 95.4 0.0063 1.4E-07 38.9 0.8 104 4-119 55-163 (197)
37 PF12946 EGF_MSP1_1: MSP1 EGF 95.3 0.0074 1.6E-07 28.1 0.6 28 2-30 8-36 (37)
38 KOG0994|consensus 94.5 0.15 3.3E-06 41.3 6.1 55 61-122 885-949 (1758)
39 cd01475 vWA_Matrilin VWA_Matri 94.2 0.077 1.7E-06 35.0 3.6 36 78-119 183-220 (224)
40 cd01475 vWA_Matrilin VWA_Matri 93.8 0.079 1.7E-06 35.0 3.1 40 26-72 180-219 (224)
41 smart00051 DSL delta serrate l 93.4 0.21 4.7E-06 26.3 3.8 46 61-120 17-63 (63)
42 KOG1836|consensus 92.4 0.41 9E-06 40.6 5.8 56 62-124 757-815 (1705)
43 KOG3516|consensus 91.1 0.22 4.8E-06 40.3 2.8 36 86-122 546-582 (1306)
44 KOG3514|consensus 86.2 0.61 1.3E-05 37.8 2.3 35 87-122 625-660 (1591)
45 KOG1226|consensus 85.8 4.1 8.8E-05 32.0 6.3 63 43-124 556-623 (783)
46 KOG0994|consensus 83.6 6.3 0.00014 32.8 6.7 17 107-123 530-546 (1758)
47 PF00954 S_locus_glycop: S-loc 82.9 2.2 4.7E-05 24.9 3.2 31 86-118 78-109 (110)
48 PF00053 Laminin_EGF: Laminin 76.7 1.8 3.9E-05 21.2 1.3 18 105-122 16-33 (49)
49 PF09064 Tme5_EGF_like: Thromb 76.4 3.5 7.6E-05 18.8 2.0 13 107-119 18-30 (34)
50 KOG3516|consensus 73.5 3.9 8.6E-05 33.7 3.0 37 86-123 956-993 (1306)
51 KOG3512|consensus 71.9 16 0.00034 27.4 5.4 18 104-121 461-478 (592)
52 PF01414 DSL: Delta serrate li 70.0 1.2 2.7E-05 23.4 -0.3 46 61-120 17-63 (63)
53 cd00055 EGF_Lam Laminin-type e 67.0 4.7 0.0001 19.9 1.5 14 108-121 20-33 (50)
54 smart00180 EGF_Lam Laminin-typ 66.4 5.1 0.00011 19.4 1.6 14 108-121 19-32 (46)
55 PF12955 DUF3844: Domain of un 66.0 6.2 0.00014 23.0 2.0 33 91-124 13-63 (103)
56 PF01683 EB: EB module; Inter 60.2 8.7 0.00019 19.0 1.8 9 15-24 38-46 (52)
57 cd01328 FSL_SPARC Follistatin- 51.7 25 0.00054 19.8 2.8 8 101-108 41-48 (86)
58 PF01826 TIL: Trypsin Inhibito 44.4 5.4 0.00012 20.0 -0.5 17 16-33 35-52 (55)
59 KOG1836|consensus 42.2 43 0.00093 29.5 3.9 60 7-72 749-809 (1705)
60 KOG3509|consensus 38.0 34 0.00073 28.1 2.6 38 86-124 407-444 (964)
61 PF02822 Antistasin: Antistasi 33.4 23 0.0005 15.0 0.7 14 18-31 9-22 (26)
62 PF04863 EGF_alliinase: Alliin 33.0 22 0.00049 18.1 0.7 21 103-123 32-52 (56)
63 PF09289 FOLN: Follistatin/Ost 29.3 43 0.00092 13.7 1.1 8 92-99 5-12 (22)
64 smart00274 FOLN Follistatin-N- 28.7 59 0.0013 13.7 2.0 19 91-109 5-23 (26)
65 KOG3514|consensus 26.0 49 0.0011 27.8 1.8 37 86-123 1019-1056(1591)
66 KOG0196|consensus 24.4 1.5E+02 0.0033 24.3 4.1 16 101-116 302-317 (996)
67 KOG1218|consensus 21.8 2.8E+02 0.006 19.0 6.9 15 108-122 163-177 (316)
No 1
>KOG1214|consensus
Probab=99.58 E-value=7.9e-15 Score=108.43 Aligned_cols=111 Identities=30% Similarity=0.715 Sum_probs=90.8
Q ss_pred CCceeeeCCCceecccCCCCcc--cCCCCcccCC-----CCccCCCCCCCCCCeee--eCCCCCcceeeecCCCCCcCCC
Q psy4097 3 PYVECINTQGYRKCGQCPHGWV--GDGTTCRQGT-----TGCSVQNGGCHPLATCR--ETSDTVRSVISCTCPPGMGGSG 73 (129)
Q Consensus 3 ~~~~C~~~~g~~~C~~C~~g~~--~~~~~C~~~~-----~~C~~~~~~C~~~~~C~--~~~~g~~~~~~c~c~~g~~g~~ 73 (129)
+++.|+|.+++|.| .|..||. .++.+|..+. +.|....+.|...++|. ... ++ .|.|.|.+||.|+|
T Consensus 746 p~s~Cin~pg~~rc-eC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs--~y~C~CLPGfsGDG 821 (1289)
T KOG1214|consen 746 PNSVCINLPGSYRC-ECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GS--TYSCACLPGFSGDG 821 (1289)
T ss_pred CCceeecCCCceeE-EEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-Cc--eEEEeecCCccCCc
Confidence 67899999999999 9999998 5667887552 55776666787655544 444 32 49999999999998
Q ss_pred CCcCCcccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097 74 VGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLVP 124 (129)
Q Consensus 74 ~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~ 124 (129)
. .|. ++ |+|.+..|+..+.| .+.++++.|+|.+||.|++-+.++
T Consensus 822 ~---~c~-dv--DeC~psrChp~A~C-yntpgsfsC~C~pGy~GDGf~CVP 865 (1289)
T KOG1214|consen 822 H---QCT-DV--DECSPSRCHPAATC-YNTPGSFSCRCQPGYYGDGFQCVP 865 (1289)
T ss_pred c---ccc-cc--cccCccccCCCceE-ecCCCcceeecccCccCCCceecC
Confidence 8 787 77 99999999999999 999999999999999998865444
No 2
>KOG1219|consensus
Probab=99.57 E-value=7.7e-15 Score=116.47 Aligned_cols=103 Identities=28% Similarity=0.663 Sum_probs=91.7
Q ss_pred CCceeeeCC-CceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCccc
Q psy4097 3 PYVECINTQ-GYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAY 81 (129)
Q Consensus 3 ~~~~C~~~~-g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~ 81 (129)
++++|...+ |+|.| .|++.|. |+.|+...++|.++ ||..+++|.... +. |.|.|+.+|+|. .|+
T Consensus 3874 hgG~C~~~~~ggy~C-kCpsqys--G~~CEi~~epC~sn--PC~~GgtCip~~-n~---f~CnC~~gyTG~-----~Ce- 3938 (4289)
T KOG1219|consen 3874 HGGTCISQPKGGYKC-KCPSQYS--GNHCEIDLEPCASN--PCLTGGTCIPFY-NG---FLCNCPNGYTGK-----RCE- 3938 (4289)
T ss_pred CCCEecCCCCCceEE-eCccccc--CcccccccccccCC--CCCCCCEEEecC-CC---eeEeCCCCccCc-----eee-
Confidence 567888887 67999 9999999 99998766999988 999999999877 54 999999999999 998
Q ss_pred CCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCc
Q psy4097 82 GMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESEL 121 (129)
Q Consensus 82 ~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~ 121 (129)
....++|..++|.+++.| ++..|+|+|.|.+||.|..|.
T Consensus 3939 ~~Gi~eCs~n~C~~gg~C-~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3939 ARGISECSKNVCGTGGQC-INIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred cccccccccccccCCcee-eccCCceEeccChhHhcccCc
Confidence 432289999999999999 999999999999999999874
No 3
>KOG4289|consensus
Probab=99.34 E-value=2.6e-12 Score=99.55 Aligned_cols=100 Identities=26% Similarity=0.684 Sum_probs=81.9
Q ss_pred eCCCceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCC
Q psy4097 9 NTQGYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPC 88 (129)
Q Consensus 9 ~~~g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C 88 (129)
+..+.+.| .|++||+ +..|+..+|.|-+. +|.+++.|...+ +. |+|.|.++|+|. +|+.+.....|
T Consensus 1217 ~pvnglrC-rCPpGFT--gd~CeTeiDlCYs~--pC~nng~C~srE-gg---YtCeCrpg~tGe-----hCEvs~~agrC 1282 (2531)
T KOG4289|consen 1217 HPVNGLRC-RCPPGFT--GDYCETEIDLCYSG--PCGNNGRCRSRE-GG---YTCECRPGFTGE-----HCEVSARAGRC 1282 (2531)
T ss_pred cccCceeE-eCCCCCC--cccccchhHhhhcC--CCCCCCceEEec-Cc---eeEEecCCcccc-----ceeeecccCcc
Confidence 44567999 9999999 77998766999987 999999999888 76 999999999999 89732222467
Q ss_pred CCCCCCCCCeeccCCCCceeeeCCCC-CccCCCcc
Q psy4097 89 GGVTCEHGGICAPIGDRGYRCQCEPG-FTGESELL 122 (129)
Q Consensus 89 ~~~~C~~~~~C~~~~~~~~~C~C~~g-~~g~~c~~ 122 (129)
.+..|.+++.|+....+++.|.|+.| |.+..|+.
T Consensus 1283 vpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1283 VPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred ccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence 78889999999434568899999987 56777764
No 4
>KOG1219|consensus
Probab=99.13 E-value=1.1e-10 Score=93.98 Aligned_cols=82 Identities=28% Similarity=0.649 Sum_probs=70.4
Q ss_pred CcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeeccCCCCcee
Q psy4097 29 TCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYR 108 (129)
Q Consensus 29 ~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~ 108 (129)
.|....+.|..+ ||++++.|...+ ++ +|.|.|++.|.|. +|+.+. ++|..+||..++.| +...++|.
T Consensus 3859 gC~l~~d~C~~n--pCqhgG~C~~~~-~g--gy~CkCpsqysG~-----~CEi~~--epC~snPC~~GgtC-ip~~n~f~ 3925 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDN--PCQHGGTCISQP-KG--GYKCKCPSQYSGN-----HCEIDL--EPCASNPCLTGGTC-IPFYNGFL 3925 (4289)
T ss_pred cccccccccccC--cccCCCEecCCC-CC--ceEEeCcccccCc-----cccccc--ccccCCCCCCCCEE-EecCCCee
Confidence 455444788877 999999999877 42 4999999999999 998444 89999999999999 88999999
Q ss_pred eeCCCCCccCCCccc
Q psy4097 109 CQCEPGFTGESELLV 123 (129)
Q Consensus 109 C~C~~g~~g~~c~~~ 123 (129)
|.|+.||+|++|+..
T Consensus 3926 CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3926 CNCPNGYTGKRCEAR 3940 (4289)
T ss_pred EeCCCCccCceeecc
Confidence 999999999999854
No 5
>KOG1214|consensus
Probab=99.01 E-value=8.2e-10 Score=82.59 Aligned_cols=106 Identities=30% Similarity=0.770 Sum_probs=79.9
Q ss_pred CceeeeCCC-ceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc--CCCCCcCCcc
Q psy4097 4 YVECINTQG-YRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG--GSGVGLMGCA 80 (129)
Q Consensus 4 ~~~C~~~~g-~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~--g~~~~~~~c~ 80 (129)
++.|....+ .|.| .|..||.+++++|.++ ++|+.....|..+++|++.+ ++ |.|.|..+|. +.+. .|.
T Consensus 705 ~a~C~pg~~~~~tc-ecs~g~~gdgr~c~d~-~eca~~~~~CGp~s~Cin~p-g~---~rceC~~gy~F~dd~~---tCV 775 (1289)
T KOG1214|consen 705 TARCHPGTGVDYTC-ECSSGYQGDGRNCVDE-NECATGFHRCGPNSVCINLP-GS---YRCECRSGYEFADDRH---TCV 775 (1289)
T ss_pred CccccCCCCcceEE-EEeeccCCCCCCCCCh-hhhccCCCCCCCCceeecCC-Cc---eeEEEeecceeccCCc---ceE
Confidence 456666554 6999 9999999999999999 99999888999999999999 88 9999988875 3333 465
Q ss_pred cCCC----CCCCC--CCCCCCCCe--eccCCCCceeeeCCCCCccCC
Q psy4097 81 YGMS----GNPCG--GVTCEHGGI--CAPIGDRGYRCQCEPGFTGES 119 (129)
Q Consensus 81 ~~~~----~~~C~--~~~C~~~~~--C~~~~~~~~~C~C~~g~~g~~ 119 (129)
.+. .+.|. .+.|...+. |+....+.|.|.|.+||.|++
T Consensus 776 -~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG 821 (1289)
T KOG1214|consen 776 -LITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG 821 (1289)
T ss_pred -EecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence 332 14454 255654334 423456789999999999887
No 6
>KOG4289|consensus
Probab=98.63 E-value=1.9e-08 Score=79.07 Aligned_cols=56 Identities=38% Similarity=0.908 Sum_probs=50.4
Q ss_pred eeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCccc
Q psy4097 60 VISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLV 123 (129)
Q Consensus 60 ~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~ 123 (129)
++.|+|++||+++ .|+ ... |.|-..||.+++.| ....++|.|.|.+||+|+.|+.+
T Consensus 1221 glrCrCPpGFTgd-----~Ce-Tei-DlCYs~pC~nng~C-~srEggYtCeCrpg~tGehCEvs 1276 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGD-----YCE-TEI-DLCYSGPCGNNGRC-RSREGGYTCECRPGFTGEHCEVS 1276 (2531)
T ss_pred ceeEeCCCCCCcc-----ccc-chh-HhhhcCCCCCCCce-EEecCceeEEecCCccccceeee
Confidence 3889999999999 898 544 89999999999999 88999999999999999999864
No 7
>KOG4260|consensus
Probab=98.44 E-value=1.7e-07 Score=62.36 Aligned_cols=89 Identities=26% Similarity=0.620 Sum_probs=67.8
Q ss_pred ceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCC
Q psy4097 13 YRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVT 92 (129)
Q Consensus 13 ~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~ 92 (129)
+..|+.|..||..+...|.++ |+|...+.+|.....|+|+. || |.|...+||.. +.. .|++.. +.|.
T Consensus 216 ~k~C~kCkkGW~lde~gCvDv-nEC~~ep~~c~~~qfCvNte-GS---f~C~dk~Gy~~-g~d--~C~~~~--d~~~--- 282 (350)
T KOG4260|consen 216 SKGCSKCKKGWKLDEEGCVDV-NECQNEPAPCKAHQFCVNTE-GS---FKCEDKEGYKK-GVD--ECQFCA--DVCA--- 282 (350)
T ss_pred CCChhhhcccceecccccccH-HHHhcCCCCCChhheeecCC-Cc---eEecccccccC-ChH--Hhhhhh--hhcc---
Confidence 455668999999888899999 99998888999999999999 99 99999888876 222 444111 2232
Q ss_pred CCCCCeeccCCCCceeeeCCCCCc
Q psy4097 93 CEHGGICAPIGDRGYRCQCEPGFT 116 (129)
Q Consensus 93 C~~~~~C~~~~~~~~~C~C~~g~~ 116 (129)
.....| .+..+.|+|.|..++.
T Consensus 283 -~kn~~c-~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 283 -SKNRPC-MNIDGQYRCVCFSGLI 304 (350)
T ss_pred -cCCCCc-ccCCccEEEEecccce
Confidence 123357 8899999999988763
No 8
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.42 E-value=9.5e-08 Score=43.72 Aligned_cols=31 Identities=42% Similarity=1.117 Sum_probs=26.0
Q ss_pred CCCCCCCCCCeeccCCC-CceeeeCCCCCccCC
Q psy4097 88 CGGVTCEHGGICAPIGD-RGYRCQCEPGFTGES 119 (129)
Q Consensus 88 C~~~~C~~~~~C~~~~~-~~~~C~C~~g~~g~~ 119 (129)
|.+++|.+++.| +... +.|.|.|++||.|+.
T Consensus 1 C~~~~C~n~g~C-~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTC-IDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEE-EEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEE-EeCCCCCEEeECCCCCccCC
Confidence 445689999999 6666 999999999999873
No 9
>KOG1217|consensus
Probab=98.41 E-value=1.9e-06 Score=62.05 Aligned_cols=104 Identities=37% Similarity=0.839 Sum_probs=73.1
Q ss_pred CCceeeeCCCceecccCCCCcccCCCCcccC-------------------CCCccCCCCCCCCC-CeeeeCCCCCcceee
Q psy4097 3 PYVECINTQGYRKCGQCPHGWVGDGTTCRQG-------------------TTGCSVQNGGCHPL-ATCRETSDTVRSVIS 62 (129)
Q Consensus 3 ~~~~C~~~~g~~~C~~C~~g~~~~~~~C~~~-------------------~~~C~~~~~~C~~~-~~C~~~~~g~~~~~~ 62 (129)
.+..|.+..++|.| .|.++|. +..|... ...|......|... ++|.+.. ++ +.
T Consensus 181 ~~~~C~~~~~~~~C-~c~~~~~--~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~-~~---~~ 253 (487)
T KOG1217|consen 181 NGGTCVNTGGSYLC-SCPPGYT--GSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTV-GS---YT 253 (487)
T ss_pred CCcccccCCCCeeE-eCCCCcc--CCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccC-Cc---ee
Confidence 35678888888999 9999998 3333211 01222221223322 6788777 66 89
Q ss_pred ecCCCCCcCCC-CCcCCcccCCCCCCCCCC-CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097 63 CTCPPGMGGSG-VGLMGCAYGMSGNPCGGV-TCEHGGICAPIGDRGYRCQCEPGFTGESE 120 (129)
Q Consensus 63 c~c~~g~~g~~-~~~~~c~~~~~~~~C~~~-~C~~~~~C~~~~~~~~~C~C~~g~~g~~c 120 (129)
|.++++|.+.. . .+. ++ +.|... +|.+++.| ++..+.|.|.|++||.|..+
T Consensus 254 C~~~~g~~~~~~~---~~~-~~--~~C~~~~~c~~~~~C-~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 254 CRCPEGYTGDACV---TCV-DV--DSCALIASCPNGGTC-VNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred eeCCCCccccccc---eee-ec--cccCCCCccCCCCee-ecCCCcceeeCCCCCCCCCC
Confidence 99999998773 1 244 55 788754 38888899 88888899999999999987
No 10
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.35 E-value=1e-06 Score=41.84 Aligned_cols=35 Identities=37% Similarity=0.970 Sum_probs=30.2
Q ss_pred CCCCC-CCCCCCCeeccCCCCceeeeCCCCCc-cCCCc
Q psy4097 86 NPCGG-VTCEHGGICAPIGDRGYRCQCEPGFT-GESEL 121 (129)
Q Consensus 86 ~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~-g~~c~ 121 (129)
++|.. .+|.+++.| ++..++|.|.|++||. |..|+
T Consensus 3 ~~C~~~~~C~~~~~C-~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 3 DECASGNPCQNGGTC-VNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ccCcCCCCcCCCCEe-ECCCCCeEeECCCCCccCCcCC
Confidence 67776 689888899 8999999999999999 87774
No 11
>KOG1217|consensus
Probab=98.29 E-value=4.6e-06 Score=60.06 Aligned_cols=107 Identities=36% Similarity=0.853 Sum_probs=79.4
Q ss_pred ceeeeCCCceecccCCCCcccCC-CCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCC
Q psy4097 5 VECINTQGYRKCGQCPHGWVGDG-TTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGM 83 (129)
Q Consensus 5 ~~C~~~~g~~~C~~C~~g~~~~~-~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~ 83 (129)
..|+++.+++.| .|.+||..+. ..|.++ ++|..... |.++++|.+.. +. |.|.|+++|.+..- ..+. +.
T Consensus 243 ~~c~~~~~~~~C-~~~~g~~~~~~~~~~~~-~~C~~~~~-c~~~~~C~~~~-~~---~~C~C~~g~~g~~~--~~~~-~~ 312 (487)
T KOG1217|consen 243 GTCVNTVGSYTC-RCPEGYTGDACVTCVDV-DSCALIAS-CPNGGTCVNVP-GS---YRCTCPPGFTGRLC--TECV-DV 312 (487)
T ss_pred CcccccCCceee-eCCCCccccccceeeec-cccCCCCc-cCCCCeeecCC-Cc---ceeeCCCCCCCCCC--cccc-cc
Confidence 678888888999 9999999554 456677 88988643 88889999988 76 89999999998832 0122 33
Q ss_pred CCCCCC----CCCCCCCCeec-cCCCCceeeeCCCCCccCCCccc
Q psy4097 84 SGNPCG----GVTCEHGGICA-PIGDRGYRCQCEPGFTGESELLV 123 (129)
Q Consensus 84 ~~~~C~----~~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~ 123 (129)
..|. ..+|..++.|. ......+.|.|..+|.|..|+..
T Consensus 313 --~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~ 355 (487)
T KOG1217|consen 313 --DECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS 355 (487)
T ss_pred --ccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccC
Confidence 5663 35577766671 34445788999999999988754
No 12
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.24 E-value=8.8e-07 Score=43.08 Aligned_cols=32 Identities=34% Similarity=0.778 Sum_probs=29.7
Q ss_pred CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc
Q psy4097 35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG 70 (129)
Q Consensus 35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~ 70 (129)
+||....+.|..++.|+|+. |+ |.|.|++||.
T Consensus 3 dEC~~~~~~C~~~~~C~N~~-Gs---y~C~C~~Gy~ 34 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTE-GS---YSCSCPPGYE 34 (42)
T ss_dssp STTTTTSSSSSTTSEEEEET-TE---EEEEESTTEE
T ss_pred cccCCCCCcCCCCCEEEcCC-CC---EEeeCCCCcE
Confidence 89998888898899999999 98 9999999997
No 13
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.23 E-value=6.3e-07 Score=42.00 Aligned_cols=28 Identities=43% Similarity=0.954 Sum_probs=22.9
Q ss_pred CCCceeeeCCCceecccCCCCcccCCCCc
Q psy4097 2 TPYVECINTQGYRKCGQCPHGWVGDGTTC 30 (129)
Q Consensus 2 ~~~~~C~~~~g~~~C~~C~~g~~~~~~~C 30 (129)
+++|.|+++.++|.| .|.+||.++|..|
T Consensus 9 ~~nA~C~~~~~~~~C-~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 9 HPNATCTNTGGSYTC-TCKPGYEGDGFFC 36 (36)
T ss_dssp -TTCEEEE-TTSEEE-EE-CEEECCSTCE
T ss_pred CCCcEeecCCCCEEe-ECCCCCccCCcCC
Confidence 578999999999999 9999999888765
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.15 E-value=1.1e-06 Score=42.79 Aligned_cols=30 Identities=30% Similarity=0.961 Sum_probs=26.6
Q ss_pred CCCCC--CCCCCCCeeccCCCCceeeeCCCCCc
Q psy4097 86 NPCGG--VTCEHGGICAPIGDRGYRCQCEPGFT 116 (129)
Q Consensus 86 ~~C~~--~~C~~~~~C~~~~~~~~~C~C~~g~~ 116 (129)
++|.. +.|...+.| +|+.|+|.|.|++||.
T Consensus 3 dEC~~~~~~C~~~~~C-~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 3 DECAEGPHNCPENGTC-VNTEGSYSCSCPPGYE 34 (42)
T ss_dssp STTTTTSSSSSTTSEE-EEETTEEEEEESTTEE
T ss_pred cccCCCCCcCCCCCEE-EcCCCCEEeeCCCCcE
Confidence 78873 569878899 9999999999999998
No 15
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.12 E-value=6.1e-06 Score=38.72 Aligned_cols=35 Identities=37% Similarity=0.955 Sum_probs=29.7
Q ss_pred CCCCC-CCCCCCCeeccCCCCceeeeCCCCCccCCCc
Q psy4097 86 NPCGG-VTCEHGGICAPIGDRGYRCQCEPGFTGESEL 121 (129)
Q Consensus 86 ~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~ 121 (129)
++|.. .+|.+++.| .+..+.|.|.|+.||.|..|+
T Consensus 3 ~~C~~~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGNPCQNGGTC-VNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCCCcCCCCEe-ECCCCCeEeECCCCCcCCcCC
Confidence 66765 688877899 888999999999999998774
No 16
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.98 E-value=6e-06 Score=34.95 Aligned_cols=22 Identities=32% Similarity=0.815 Sum_probs=18.9
Q ss_pred ceecccCCCCcc--cCCCCcccCCCC
Q psy4097 13 YRKCGQCPHGWV--GDGTTCRQGTTG 36 (129)
Q Consensus 13 ~~~C~~C~~g~~--~~~~~C~~~~~~ 36 (129)
+|.| .|++||. .++.+|+++ ||
T Consensus 1 sy~C-~C~~Gy~l~~d~~~C~DI-dE 24 (24)
T PF12662_consen 1 SYTC-SCPPGYQLSPDGRSCEDI-DE 24 (24)
T ss_pred CEEe-eCCCCCcCCCCCCccccC-CC
Confidence 5899 9999999 577899998 65
No 17
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.96 E-value=3.7e-06 Score=39.38 Aligned_cols=32 Identities=53% Similarity=1.129 Sum_probs=24.8
Q ss_pred CCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCC
Q psy4097 39 VQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGV 74 (129)
Q Consensus 39 ~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~ 74 (129)
...+.|+.++.|.++. ++ +.|.|.+||.|+|+
T Consensus 3 ~~~~~C~~nA~C~~~~-~~---~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 3 ENNGGCHPNATCTNTG-GS---YTCTCKPGYEGDGF 34 (36)
T ss_dssp TGGGGS-TTCEEEE-T-TS---EEEEE-CEEECCST
T ss_pred CCCCCCCCCcEeecCC-CC---EEeECCCCCccCCc
Confidence 3355799999999999 87 99999999999976
No 18
>KOG1225|consensus
Probab=97.93 E-value=2.9e-05 Score=56.81 Aligned_cols=78 Identities=35% Similarity=0.912 Sum_probs=47.0
Q ss_pred ecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCC
Q psy4097 15 KCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE 94 (129)
Q Consensus 15 ~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~ 94 (129)
+| .|++||. +..|... .|.. .|..++.+.+ + .|.|.++|.|. .|+ . ..| +.+|.
T Consensus 266 ~C-IC~~Gf~--G~dC~e~--~Cp~---~cs~~g~~~~---g-----~CiC~~g~~G~-----dCs-~---~~c-padC~ 319 (525)
T KOG1225|consen 266 RC-ICPPGFT--GDDCDEL--VCPV---DCSGGGVCVD---G-----ECICNPGYSGK-----DCS-I---RRC-PADCS 319 (525)
T ss_pred eE-eCCCCCc--CCCCCcc--cCCc---ccCCCceecC---C-----EeecCCCcccc-----ccc-c---ccC-CccCC
Confidence 56 7777777 6666542 2432 2444433432 2 57788888777 665 3 234 35677
Q ss_pred CCCeeccCCCCceeeeCCCCCccCCCccc
Q psy4097 95 HGGICAPIGDRGYRCQCEPGFTGESELLV 123 (129)
Q Consensus 95 ~~~~C~~~~~~~~~C~C~~g~~g~~c~~~ 123 (129)
.++.| + .+ +|.|.+||+|+.|...
T Consensus 320 g~G~C-i--~G--~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 320 GHGKC-I--DG--ECLCDEGYTGELCIQR 343 (525)
T ss_pred CCCcc-c--CC--ceEeCCCCcCCccccc
Confidence 77777 4 22 6778888888777653
No 19
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.89 E-value=2.8e-05 Score=35.96 Aligned_cols=29 Identities=34% Similarity=0.842 Sum_probs=24.6
Q ss_pred CCCCCCCeeccCCCCceeeeCCCCCcc-CCCc
Q psy4097 91 VTCEHGGICAPIGDRGYRCQCEPGFTG-ESEL 121 (129)
Q Consensus 91 ~~C~~~~~C~~~~~~~~~C~C~~g~~g-~~c~ 121 (129)
.+|.++ .| ++..+.|.|.|++||.| ..|+
T Consensus 6 ~~C~~~-~C-~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TC-INTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCC-EE-ECCCCCeEeECCCCCccCCccC
Confidence 578877 89 88899999999999999 6553
No 20
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.87 E-value=3.4e-05 Score=35.50 Aligned_cols=30 Identities=43% Similarity=1.022 Sum_probs=25.3
Q ss_pred CCCCCCCCeeccCCCCceeeeCCCCCccC-CC
Q psy4097 90 GVTCEHGGICAPIGDRGYRCQCEPGFTGE-SE 120 (129)
Q Consensus 90 ~~~C~~~~~C~~~~~~~~~C~C~~g~~g~-~c 120 (129)
..+|.+++.| ++..+.|.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTC-VNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEE-ecCCCCeEeECCCCCcccCCc
Confidence 4567777799 888899999999999998 55
No 21
>KOG4260|consensus
Probab=97.86 E-value=1.2e-05 Score=53.83 Aligned_cols=92 Identities=26% Similarity=0.674 Sum_probs=60.0
Q ss_pred CCceecccCCCCcccCCCCcccCC--------CCccCCCCCCCC--CCeeeeCCCCCcceeee-cCCCCCcCCCCCcCCc
Q psy4097 11 QGYRKCGQCPHGWVGDGTTCRQGT--------TGCSVQNGGCHP--LATCRETSDTVRSVISC-TCPPGMGGSGVGLMGC 79 (129)
Q Consensus 11 ~g~~~C~~C~~g~~~~~~~C~~~~--------~~C~~~~~~C~~--~~~C~~~~~g~~~~~~c-~c~~g~~g~~~~~~~c 79 (129)
.|+-+| .|.+||. +..|..-. ++-...-..|+. .++|.... + ..| .|..||..... .|
T Consensus 165 ~GsGkC-kC~~GY~--Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~--~---k~C~kCkkGW~lde~---gC 233 (350)
T KOG4260|consen 165 EGSGKC-KCETGYT--GPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES--S---KGCSKCKKGWKLDEE---GC 233 (350)
T ss_pred CCCCcc-cccCCCC--CccccccchHHHHhhcccccchhhhhhhhhhcccCCCC--C---CChhhhcccceeccc---cc
Confidence 367789 9999999 77764310 110000001221 12343222 1 345 68889875533 68
Q ss_pred ccCCCCCCCC--CCCCCCCCeeccCCCCceeeeCCCCCcc
Q psy4097 80 AYGMSGNPCG--GVTCEHGGICAPIGDRGYRCQCEPGFTG 117 (129)
Q Consensus 80 ~~~~~~~~C~--~~~C~~~~~C~~~~~~~~~C~C~~g~~g 117 (129)
. ++ |+|. +.+|.....| +|+.++|.|.+.+||.+
T Consensus 234 v-Dv--nEC~~ep~~c~~~qfC-vNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 234 V-DV--NECQNEPAPCKAHQFC-VNTEGSFKCEDKEGYKK 269 (350)
T ss_pred c-cH--HHHhcCCCCCChhhee-ecCCCceEecccccccC
Confidence 8 88 9997 5789888899 99999999999999975
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.49 E-value=0.00023 Score=33.46 Aligned_cols=33 Identities=33% Similarity=0.785 Sum_probs=26.9
Q ss_pred CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc-CC
Q psy4097 35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG-GS 72 (129)
Q Consensus 35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~-g~ 72 (129)
++|... .+|.+++.|.++. ++ |.|.|+.+|. +.
T Consensus 3 ~~C~~~-~~C~~~~~C~~~~-g~---~~C~C~~g~~~g~ 36 (39)
T smart00179 3 DECASG-NPCQNGGTCVNTV-GS---YRCECPPGYTDGR 36 (39)
T ss_pred ccCcCC-CCcCCCCEeECCC-CC---eEeECCCCCccCC
Confidence 677762 4898888999988 77 9999999998 55
No 23
>KOG1225|consensus
Probab=97.48 E-value=0.00065 Score=49.98 Aligned_cols=79 Identities=37% Similarity=0.988 Sum_probs=55.0
Q ss_pred ecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCC
Q psy4097 15 KCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE 94 (129)
Q Consensus 15 ~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~ 94 (129)
.| .|..+|. +..|.. -.|.. .|...+.|.. | .|.|++||.|. .|. . -.|... |.
T Consensus 235 ic-~c~~~~~--g~~c~~--~~C~~---~c~~~g~c~~---G-----~CIC~~Gf~G~-----dC~-e---~~Cp~~-cs 288 (525)
T KOG1225|consen 235 IC-ECPEGYF--GPLCST--IYCPG---GCTGRGQCVE---G-----RCICPPGFTGD-----DCD-E---LVCPVD-CS 288 (525)
T ss_pred ee-ecCCcee--CCcccc--ccCCC---CCcccceEeC---C-----eEeCCCCCcCC-----CCC-c---ccCCcc-cC
Confidence 57 7888887 666763 34543 4555556653 3 59999999999 886 4 356443 77
Q ss_pred CCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097 95 HGGICAPIGDRGYRCQCEPGFTGESELLVP 124 (129)
Q Consensus 95 ~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~ 124 (129)
.++.+ ..+ .|.|++||.|+.|+...
T Consensus 289 ~~g~~---~~g--~CiC~~g~~G~dCs~~~ 313 (525)
T KOG1225|consen 289 GGGVC---VDG--ECICNPGYSGKDCSIRR 313 (525)
T ss_pred CCcee---cCC--EeecCCCcccccccccc
Confidence 76566 333 89999999999997544
No 24
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.35 E-value=0.00016 Score=33.78 Aligned_cols=25 Identities=48% Similarity=1.127 Sum_probs=19.3
Q ss_pred ceeeeCCCceecccCCCCcc--cCCCCc
Q psy4097 5 VECINTQGYRKCGQCPHGWV--GDGTTC 30 (129)
Q Consensus 5 ~~C~~~~g~~~C~~C~~g~~--~~~~~C 30 (129)
..|++++++|.| .|++||. .|+.+|
T Consensus 10 h~C~~~~g~~~C-~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 10 HICVNTPGSYRC-SCPPGYKLAEDGRTC 36 (36)
T ss_dssp SEEEEETTSEEE-E-STTEEE-TTSSSE
T ss_pred CCCccCCCceEe-ECCCCCEECcCCCCC
Confidence 468999999999 9999998 444444
No 25
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.15 E-value=0.00023 Score=32.32 Aligned_cols=27 Identities=33% Similarity=0.776 Sum_probs=22.3
Q ss_pred CCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097 43 GCHPLATCRETSDTVRSVISCTCPPGMGGS 72 (129)
Q Consensus 43 ~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~ 72 (129)
+|.++++|+... .. .|.|.|++||.|.
T Consensus 5 ~C~n~g~C~~~~-~~--~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLP-GG--GYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEES-TS--EEEEEEBTTEEST
T ss_pred cCCCCeEEEeCC-CC--CEEeECCCCCccC
Confidence 899999999765 22 3999999999875
No 26
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.14 E-value=0.00019 Score=25.86 Aligned_cols=13 Identities=54% Similarity=1.262 Sum_probs=10.4
Q ss_pred eeeCCCCCccCCC
Q psy4097 108 RCQCEPGFTGESE 120 (129)
Q Consensus 108 ~C~C~~g~~g~~c 120 (129)
.|+|++||.|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4899999999876
No 27
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.92 E-value=0.002 Score=29.77 Aligned_cols=33 Identities=33% Similarity=0.801 Sum_probs=26.2
Q ss_pred CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097 35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGS 72 (129)
Q Consensus 35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~ 72 (129)
++|... .+|.+++.|.+.. +. |.|.|+.+|.+.
T Consensus 3 ~~C~~~-~~C~~~~~C~~~~-~~---~~C~C~~g~~g~ 35 (38)
T cd00054 3 DECASG-NPCQNGGTCVNTV-GS---YRCSCPPGYTGR 35 (38)
T ss_pred ccCCCC-CCcCCCCEeECCC-CC---eEeECCCCCcCC
Confidence 567652 3788888999888 76 999999999876
No 28
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=96.82 E-value=0.0013 Score=39.15 Aligned_cols=36 Identities=33% Similarity=0.597 Sum_probs=28.4
Q ss_pred CCCCCCCeec-cCCCCceeeeCCCCCccCCCccccccc
Q psy4097 91 VTCEHGGICA-PIGDRGYRCQCEPGFTGESELLVPLLS 127 (129)
Q Consensus 91 ~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~~~~~ 127 (129)
+.|.++ .|+ ......+.|.|..||.|++||...++.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~dLl~ 87 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVVLVD 87 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccccccceeeee
Confidence 568776 785 455678899999999999999876543
No 29
>PHA02887 EGF-like protein; Provisional
Probab=96.49 E-value=0.0036 Score=36.77 Aligned_cols=38 Identities=29% Similarity=0.484 Sum_probs=28.3
Q ss_pred CCCC---CCCCCCCCeec-cCCCCceeeeCCCCCccCCCcccc
Q psy4097 86 NPCG---GVTCEHGGICA-PIGDRGYRCQCEPGFTGESELLVP 124 (129)
Q Consensus 86 ~~C~---~~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~~ 124 (129)
.+|. .+.|.++ .|. ........|.|+.||.|.+|+.+.
T Consensus 84 ~pC~~eyk~YCiHG-~C~yI~dL~epsCrC~~GYtG~RCE~vs 125 (126)
T PHA02887 84 EKCKNDFNDFCING-ECMNIIDLDEKFCICNKGYTGIRCDEVS 125 (126)
T ss_pred cccChHhhCEeeCC-EEEccccCCCceeECCCCcccCCCCccc
Confidence 5665 2567765 884 345677899999999999998653
No 30
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.48 E-value=0.00072 Score=43.04 Aligned_cols=100 Identities=24% Similarity=0.528 Sum_probs=60.4
Q ss_pred eeeCCCceecccCCCCcc-cCCCCcccCCCCccC---CCCCCCCCCeeeeCCC-CCcceeeecCCCCCcCCCCCcCCccc
Q psy4097 7 CINTQGYRKCGQCPHGWV-GDGTTCRQGTTGCSV---QNGGCHPLATCRETSD-TVRSVISCTCPPGMGGSGVGLMGCAY 81 (129)
Q Consensus 7 C~~~~g~~~C~~C~~g~~-~~~~~C~~~~~~C~~---~~~~C~~~~~C~~~~~-g~~~~~~c~c~~g~~g~~~~~~~c~~ 81 (129)
.+.....|.| .|.+||. .+..+|+.. .+|.. ...+|...+.|.+... +....|.|.|..+|....- .|.
T Consensus 13 LiQMSNHfEC-~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~---vCv- 86 (197)
T PF06247_consen 13 LIQMSNHFEC-KCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG---VCV- 86 (197)
T ss_dssp EEEESSEEEE-EESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---SEE-
T ss_pred EEEccCceEE-EcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---eEc-
Confidence 3444567999 9999998 667899887 66654 2346888889987651 1112489999999975422 465
Q ss_pred CCCCCCCCCCCCCCCCeecc--CCCCceeeeCCCCCc
Q psy4097 82 GMSGNPCGGVTCEHGGICAP--IGDRGYRCQCEPGFT 116 (129)
Q Consensus 82 ~~~~~~C~~~~C~~~~~C~~--~~~~~~~C~C~~g~~ 116 (129)
. +.|....|..+ .|++ .......|.|.-|+.
T Consensus 87 p---~~C~~~~Cg~G-KCI~d~~~~~~~~CSC~IGkV 119 (197)
T PF06247_consen 87 P---NKCNNKDCGSG-KCILDPDNPNNPTCSCNIGKV 119 (197)
T ss_dssp E---GGGSS---TTE-EEEEEEGGGSEEEEEE-TEEE
T ss_pred h---hhcCceecCCC-eEEecCCCCCCceeEeeeceE
Confidence 3 46766678744 8842 223345899998887
No 31
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.41 E-value=0.0074 Score=27.52 Aligned_cols=25 Identities=44% Similarity=1.022 Sum_probs=21.2
Q ss_pred CCCCCCCeeeeCCCCCcceeeecCCCCCcC
Q psy4097 42 GGCHPLATCRETSDTVRSVISCTCPPGMGG 71 (129)
Q Consensus 42 ~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g 71 (129)
.+|.++ +|.+.. +. +.|.|+.||.+
T Consensus 6 ~~C~~~-~C~~~~-~~---~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNG-TCINTP-GS---YTCSCPPGYTG 30 (35)
T ss_pred CCCCCC-EEECCC-CC---eEeECCCCCcc
Confidence 378877 899887 76 99999999987
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.31 E-value=0.0059 Score=27.64 Aligned_cols=21 Identities=38% Similarity=0.899 Sum_probs=18.9
Q ss_pred CCceeeeCCCceecccCCCCcc
Q psy4097 3 PYVECINTQGYRKCGQCPHGWV 24 (129)
Q Consensus 3 ~~~~C~~~~g~~~C~~C~~g~~ 24 (129)
+++.|+++.+.|.| .|+.||.
T Consensus 10 ~~~~C~~~~~~~~C-~C~~g~~ 30 (36)
T cd00053 10 NGGTCVNTPGSYRC-VCPPGYT 30 (36)
T ss_pred CCCEEecCCCCeEe-ECCCCCc
Confidence 45889999999999 9999998
No 33
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.09 E-value=0.0029 Score=29.49 Aligned_cols=25 Identities=40% Similarity=1.006 Sum_probs=18.8
Q ss_pred CCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097 92 TCEHGGICAPIGDRGYRCQCEPGFTGES 119 (129)
Q Consensus 92 ~C~~~~~C~~~~~~~~~C~C~~g~~g~~ 119 (129)
.|.+ .| ++.+++|+|.|++||....
T Consensus 7 gC~h--~C-~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 7 GCSH--IC-VNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GSSS--EE-EEETTSEEEE-STTEEE-T
T ss_pred CcCC--CC-ccCCCceEeECCCCCEECc
Confidence 4655 79 8889999999999997544
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.00 E-value=0.013 Score=26.55 Aligned_cols=26 Identities=35% Similarity=0.791 Sum_probs=20.5
Q ss_pred CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097 92 TCEHGGICAPIGDRGYRCQCEPGFTGESE 120 (129)
Q Consensus 92 ~C~~~~~C~~~~~~~~~C~C~~g~~g~~c 120 (129)
.|..+++| +.. ..+|.|.+||.|..|
T Consensus 7 ~C~~~G~C-~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTC-VSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEE-eCC--CCEEECCCCCcCCCC
Confidence 57777899 544 458999999999876
No 35
>KOG1226|consensus
Probab=95.69 E-value=0.075 Score=40.88 Aligned_cols=90 Identities=30% Similarity=0.706 Sum_probs=50.2
Q ss_pred CCceecc--cCCCCcccCCCCcccC---------CCCccCC--CCCCCCCCeeeeCCCCCcceeeecCCCCCc----CCC
Q psy4097 11 QGYRKCG--QCPHGWVGDGTTCRQG---------TTGCSVQ--NGGCHPLATCRETSDTVRSVISCTCPPGMG----GSG 73 (129)
Q Consensus 11 ~g~~~C~--~C~~g~~~~~~~C~~~---------~~~C~~~--~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~----g~~ 73 (129)
.|.+.|+ .|.+||. |..|+=. .+.|... ..+|...+.|.- | +|.|.+... |.
T Consensus 472 ~G~~~CG~C~C~~G~~--G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C---G-----qC~C~~~~~~~i~G~- 540 (783)
T KOG1226|consen 472 NGTFVCGQCRCDEGWL--GKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC---G-----QCVCHKPDNGKIYGK- 540 (783)
T ss_pred CCcEEecceecCCCCC--CCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC---C-----ceEecCCCCCceeee-
Confidence 4555443 8999999 7766522 0223221 115666666642 2 466666544 44
Q ss_pred CCcCCcccCCCCCCCCC---CCCCCCCeeccCCCCceeeeCCCCCccCCCcc
Q psy4097 74 VGLMGCAYGMSGNPCGG---VTCEHGGICAPIGDRGYRCQCEPGFTGESELL 122 (129)
Q Consensus 74 ~~~~~c~~~~~~~~C~~---~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~ 122 (129)
.|+ -.- -.|.. ..|..++.| .-| +|.|.+||+|..|+-
T Consensus 541 ----fCE-CDn-fsC~r~~g~lC~g~G~C---~CG--~CvC~~GwtG~~C~C 581 (783)
T KOG1226|consen 541 ----FCE-CDN-FSCERHKGVLCGGHGRC---ECG--RCVCNPGWTGSACNC 581 (783)
T ss_pred ----eee-ccC-cccccccCcccCCCCeE---eCC--cEEcCCCCccCCCCC
Confidence 565 320 23332 346655567 222 689999999999873
No 36
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.40 E-value=0.0063 Score=38.85 Aligned_cols=104 Identities=25% Similarity=0.652 Sum_probs=58.7
Q ss_pred CceeeeCC-----CceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCC
Q psy4097 4 YVECINTQ-----GYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMG 78 (129)
Q Consensus 4 ~~~C~~~~-----g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~ 78 (129)
.+.|++.. ..|.| .|.+||......|.. +.|... .|. .+.|+..+ .......|+|.-|+... ....
T Consensus 55 ya~C~~~~~~~~~~~~~C-~C~~gY~~~~~vCvp--~~C~~~--~Cg-~GKCI~d~-~~~~~~~CSC~IGkV~~--dn~k 125 (197)
T PF06247_consen 55 YAKCINQANKGEERAYKC-DCINGYILKQGVCVP--NKCNNK--DCG-SGKCILDP-DNPNNPTCSCNIGKVPD--DNKK 125 (197)
T ss_dssp TEEEEE-SSTTSSTSEEE-EE-TTEEESSSSEEE--GGGSS-----T-TEEEEEEE-GGGSEEEEEE-TEEETT--TTTE
T ss_pred hhhhhcCCCcccceeEEE-ecccCceeeCCeEch--hhcCce--ecC-CCeEEecC-CCCCCceeEeeeceEec--cCCc
Confidence 46777665 36999 999999976677865 467655 777 46787544 22123689998887711 1113
Q ss_pred cccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097 79 CAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGES 119 (129)
Q Consensus 79 c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~ 119 (129)
|. .....+|. --|.....| -...+-|.|.|..++.+..
T Consensus 126 Ct-k~G~T~C~-LKCk~nE~C-K~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 126 CT-KTGETKCS-LKCKENEEC-KLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp SE-EEE---------TTTEEE-EEETTEEEEEE-TT-EEET
T ss_pred cc-CCCcccee-eecCCCcce-eeeCcEEEeecCCCCCCCC
Confidence 44 22113453 246677789 8888999999999986444
No 37
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.27 E-value=0.0074 Score=28.15 Aligned_cols=28 Identities=25% Similarity=0.700 Sum_probs=21.5
Q ss_pred CCCceeeeCC-CceecccCCCCcccCCCCc
Q psy4097 2 TPYVECINTQ-GYRKCGQCPHGWVGDGTTC 30 (129)
Q Consensus 2 ~~~~~C~~~~-g~~~C~~C~~g~~~~~~~C 30 (129)
..+|.|++.. |...| .|..||..++..|
T Consensus 8 P~NA~C~~~~dG~eec-rCllgyk~~~~~C 36 (37)
T PF12946_consen 8 PANAGCFRYDDGSEEC-RCLLGYKKVGGKC 36 (37)
T ss_dssp -TTEEEEEETTSEEEE-EE-TTEEEETTEE
T ss_pred CCCcccEEcCCCCEEE-EeeCCccccCCCc
Confidence 3578898887 89999 9999999666555
No 38
>KOG0994|consensus
Probab=94.50 E-value=0.15 Score=41.30 Aligned_cols=55 Identities=25% Similarity=0.573 Sum_probs=33.3
Q ss_pred eee-cCCCCCcCCCCCcCCcccCCCCCCCCCCCCC--------CCCeec-cCCCCceeeeCCCCCccCCCcc
Q psy4097 61 ISC-TCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE--------HGGICA-PIGDRGYRCQCEPGFTGESELL 122 (129)
Q Consensus 61 ~~c-~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~--------~~~~C~-~~~~~~~~C~C~~g~~g~~c~~ 122 (129)
+.| +|..||.|++.. -. . ..|.+-||. +...|. -+......|.|.+||.|.+|+.
T Consensus 885 ~~CdrCl~GyyGdP~l---g~-g---~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~ 949 (1758)
T KOG0994|consen 885 HSCDRCLDGYYGDPRL---GS-G---IGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI 949 (1758)
T ss_pred cchhhhhccccCCccc---CC-C---CCCCCCCCCCCCccchhccccccccccccceeeecccCccccchhh
Confidence 778 899999988652 11 1 233322332 222341 1234556799999999999874
No 39
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=94.18 E-value=0.077 Score=35.05 Aligned_cols=36 Identities=22% Similarity=0.621 Sum_probs=27.9
Q ss_pred CcccCCCCCCCC--CCCCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097 78 GCAYGMSGNPCG--GVTCEHGGICAPIGDRGYRCQCEPGFTGES 119 (129)
Q Consensus 78 ~c~~~~~~~~C~--~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~ 119 (129)
.|. +. ++|. .++|.+ .| .+..++|.|.|+.||....
T Consensus 183 ~C~-~~--~~C~~~~~~c~~--~C-~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICV-VP--DLCATLSHVCQQ--VC-ISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCc-Cc--hhhcCCCCCccc--eE-EcCCCCEEeECCCCccCCC
Confidence 566 55 7786 355765 69 9999999999999997643
No 40
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.77 E-value=0.079 Score=34.98 Aligned_cols=40 Identities=23% Similarity=0.497 Sum_probs=32.0
Q ss_pred CCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097 26 DGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGS 72 (129)
Q Consensus 26 ~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~ 72 (129)
.+..|.+. ++|...++.|. ..|.++. |+ |.|.|..||...
T Consensus 180 ~~~~C~~~-~~C~~~~~~c~--~~C~~~~-g~---~~c~c~~g~~~~ 219 (224)
T cd01475 180 QGKICVVP-DLCATLSHVCQ--QVCISTP-GS---YLCACTEGYALL 219 (224)
T ss_pred ccccCcCc-hhhcCCCCCcc--ceEEcCC-CC---EEeECCCCccCC
Confidence 35678877 89987766776 4699998 88 999999999754
No 41
>smart00051 DSL delta serrate ligand.
Probab=93.44 E-value=0.21 Score=26.29 Aligned_cols=46 Identities=24% Similarity=0.386 Sum_probs=30.2
Q ss_pred eeecCCCCCcCCCCCcCCcccCCCCCCCCC-CCCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097 61 ISCTCPPGMGGSGVGLMGCAYGMSGNPCGG-VTCEHGGICAPIGDRGYRCQCEPGFTGESE 120 (129)
Q Consensus 61 ~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c 120 (129)
+.-.|..+|.|. .|. ..|.. +.+..+..| .. .| .+.|.+||.|..|
T Consensus 17 ~rv~C~~~~yG~-----~C~-----~~C~~~~d~~~~~~C-d~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGE-----GCN-----KFCRPRDDFFGHYTC-DE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCC-----ccC-----CEeCcCccccCCccC-Cc-CC--CEecCCCCcCCCC
Confidence 445788888888 776 44542 234445567 33 33 4689999999876
No 42
>KOG1836|consensus
Probab=92.41 E-value=0.41 Score=40.61 Aligned_cols=56 Identities=30% Similarity=0.607 Sum_probs=35.0
Q ss_pred ee-cCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeec-cCCCCceeee-CCCCCccCCCcccc
Q psy4097 62 SC-TCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICA-PIGDRGYRCQ-CEPGFTGESELLVP 124 (129)
Q Consensus 62 ~c-~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~-~~~~~~~~C~-C~~g~~g~~c~~~~ 124 (129)
.| +|..||.+.... ... +.|..=+|..+..|. +.....+.|. |++||+|.+|+...
T Consensus 757 ~C~~C~~GfYg~~~~------~~~-~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~ 815 (1705)
T KOG1836|consen 757 QCAQCVDGFYGLPDL------GTS-GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA 815 (1705)
T ss_pred chhhhcCCCCCcccc------CCC-CCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCC
Confidence 45 688888776331 110 226555666665662 2225667898 99999999998543
No 43
>KOG3516|consensus
Probab=91.05 E-value=0.22 Score=40.28 Aligned_cols=36 Identities=33% Similarity=0.819 Sum_probs=32.4
Q ss_pred CCCCCCCCCCCCeeccCCCCceeeeCC-CCCccCCCcc
Q psy4097 86 NPCGGVTCEHGGICAPIGDRGYRCQCE-PGFTGESELL 122 (129)
Q Consensus 86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~ 122 (129)
+.|.+++|++++.| ..+...|.|.|. .||.|..|..
T Consensus 546 drClPN~CehgG~C-~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 546 DRCLPNPCEHGGKC-SQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred cccCCccccCCCcc-cccccceeEeccccccccccccC
Confidence 78889999999999 778889999998 7999999874
No 44
>KOG3514|consensus
Probab=86.23 E-value=0.61 Score=37.77 Aligned_cols=35 Identities=31% Similarity=0.771 Sum_probs=30.8
Q ss_pred CCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCcc
Q psy4097 87 PCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELL 122 (129)
Q Consensus 87 ~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~ 122 (129)
.|.++||.+++.| ....+.|.|.|.. +|.|..|+.
T Consensus 625 ~C~~nPC~N~g~C-~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKC-SEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCc-cccccccccccccCcccCccccc
Confidence 6888999999999 8888999999964 899999974
No 45
>KOG1226|consensus
Probab=85.76 E-value=4.1 Score=31.98 Aligned_cols=63 Identities=30% Similarity=0.720 Sum_probs=36.3
Q ss_pred CCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCccc-CCCCCCCCC---CCCCCCCeeccCCCCceeeeCCCC-Ccc
Q psy4097 43 GCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAY-GMSGNPCGG---VTCEHGGICAPIGDRGYRCQCEPG-FTG 117 (129)
Q Consensus 43 ~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~-~~~~~~C~~---~~C~~~~~C~~~~~~~~~C~C~~g-~~g 117 (129)
.|..+++|.- | .|+|.+||.|. .|.- ... +.|.+ .-|...+.| .= + +|+|... |.|
T Consensus 556 lC~g~G~C~C---G-----~CvC~~GwtG~-----~C~C~~st-d~C~~~~G~iCSGrG~C-~C--g--~C~C~~~~~sG 616 (783)
T KOG1226|consen 556 LCGGHGRCEC---G-----RCVCNPGWTGS-----ACNCPLST-DTCESSDGQICSGRGTC-EC--G--RCKCTDPPYSG 616 (783)
T ss_pred ccCCCCeEeC---C-----cEEcCCCCccC-----CCCCCCCC-ccccCCCCceeCCCcee-eC--C--ceEcCCCCcCc
Confidence 4666666653 2 58999999998 5531 111 45542 224444445 11 1 4666655 888
Q ss_pred CCCcccc
Q psy4097 118 ESELLVP 124 (129)
Q Consensus 118 ~~c~~~~ 124 (129)
..|+..+
T Consensus 617 ~~CE~cp 623 (783)
T KOG1226|consen 617 EFCEKCP 623 (783)
T ss_pred chhhcCC
Confidence 8887543
No 46
>KOG0994|consensus
Probab=83.65 E-value=6.3 Score=32.79 Aligned_cols=17 Identities=24% Similarity=0.468 Sum_probs=14.1
Q ss_pred eeeeCCCCCccCCCccc
Q psy4097 107 YRCQCEPGFTGESELLV 123 (129)
Q Consensus 107 ~~C~C~~g~~g~~c~~~ 123 (129)
-.|.|++++.|.+|+.+
T Consensus 530 GqC~CRe~~~GR~c~~~ 546 (1758)
T KOG0994|consen 530 GQCECREHMLGRRCEQV 546 (1758)
T ss_pred Ccccccccccccccccc
Confidence 37999999999998743
No 47
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=82.90 E-value=2.2 Score=24.88 Aligned_cols=31 Identities=32% Similarity=0.829 Sum_probs=22.9
Q ss_pred CCCC-CCCCCCCCeeccCCCCceeeeCCCCCccC
Q psy4097 86 NPCG-GVTCEHGGICAPIGDRGYRCQCEPGFTGE 118 (129)
Q Consensus 86 ~~C~-~~~C~~~~~C~~~~~~~~~C~C~~g~~g~ 118 (129)
+.|. -..|...+.| .. .....|.|.+||..+
T Consensus 78 d~Cd~y~~CG~~g~C-~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 78 DQCDVYGFCGPNGIC-NS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCCCccccCCccEe-CC-CCCCceECCCCcCCC
Confidence 6776 3679888899 43 345579999999754
No 48
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=76.72 E-value=1.8 Score=21.21 Aligned_cols=18 Identities=33% Similarity=0.569 Sum_probs=13.8
Q ss_pred CceeeeCCCCCccCCCcc
Q psy4097 105 RGYRCQCEPGFTGESELL 122 (129)
Q Consensus 105 ~~~~C~C~~g~~g~~c~~ 122 (129)
....|.|+++|.|..|+.
T Consensus 16 ~~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp TCEEESBSTTEESTTS-E
T ss_pred CCCEEeccccccCCcCcC
Confidence 344899999999999874
No 49
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=76.40 E-value=3.5 Score=18.81 Aligned_cols=13 Identities=23% Similarity=0.680 Sum_probs=10.4
Q ss_pred eeeeCCCCCccCC
Q psy4097 107 YRCQCEPGFTGES 119 (129)
Q Consensus 107 ~~C~C~~g~~g~~ 119 (129)
+.|.|+.||+.+.
T Consensus 18 ~~C~CPeGyIlde 30 (34)
T PF09064_consen 18 GQCFCPEGYILDE 30 (34)
T ss_pred CceeCCCceEecC
Confidence 4899999997554
No 50
>KOG3516|consensus
Probab=73.47 E-value=3.9 Score=33.69 Aligned_cols=37 Identities=27% Similarity=0.645 Sum_probs=30.4
Q ss_pred CCCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCccc
Q psy4097 86 NPCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELLV 123 (129)
Q Consensus 86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~~ 123 (129)
-.|.+.+|.+++.| +..-.+|.|.|.. .|.|..|...
T Consensus 956 GhCss~~C~NGG~C-very~gytCDCs~Tay~Gp~Cs~e 993 (1306)
T KOG3516|consen 956 GHCSSYPCLNGGHC-VERYDGYTCDCSRTAYDGPFCSKE 993 (1306)
T ss_pred cccccccccCCCEE-EEecCceeeccccCcCCCCccccc
Confidence 45777899999999 7777899999964 7899998643
No 51
>KOG3512|consensus
Probab=71.94 E-value=16 Score=27.43 Aligned_cols=18 Identities=11% Similarity=0.167 Sum_probs=12.4
Q ss_pred CCceeeeCCCCCccCCCc
Q psy4097 104 DRGYRCQCEPGFTGESEL 121 (129)
Q Consensus 104 ~~~~~C~C~~g~~g~~c~ 121 (129)
+..+.+.|+.++.+.+++
T Consensus 461 ~qd~~s~Ck~~~~~~r~n 478 (592)
T KOG3512|consen 461 PQDQCSKCKASPGGKRLN 478 (592)
T ss_pred chhccccCCCCCcceecc
Confidence 455667788888776654
No 52
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=70.01 E-value=1.2 Score=23.42 Aligned_cols=46 Identities=24% Similarity=0.473 Sum_probs=18.8
Q ss_pred eeecCCCCCcCCCCCcCCcccCCCCCCCCCC-CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097 61 ISCTCPPGMGGSGVGLMGCAYGMSGNPCGGV-TCEHGGICAPIGDRGYRCQCEPGFTGESE 120 (129)
Q Consensus 61 ~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~-~C~~~~~C~~~~~~~~~C~C~~g~~g~~c 120 (129)
+...|...|.|. .|. ..|.+. .-..+-.| . ..|.. .|.+||.|..|
T Consensus 17 ~rv~C~~nyyG~-----~C~-----~~C~~~~d~~ghy~C-d-~~G~~--~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGP-----NCS-----KFCKPRDDSFGHYTC-D-SNGNK--VCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETT-----TT------EE---EEETTEEEEE---SS--E--EE-TTEESTTS
T ss_pred EEEECCCCCCCc-----ccc-----CCcCCCcCCcCCccc-C-CCCCC--CCCCCCcCCCC
Confidence 556788888888 776 345421 01111245 3 33433 58899999876
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=66.99 E-value=4.7 Score=19.88 Aligned_cols=14 Identities=36% Similarity=0.845 Sum_probs=10.8
Q ss_pred eeeCCCCCccCCCc
Q psy4097 108 RCQCEPGFTGESEL 121 (129)
Q Consensus 108 ~C~C~~g~~g~~c~ 121 (129)
+|.|++++.|..|+
T Consensus 20 ~C~C~~~~~G~~C~ 33 (50)
T cd00055 20 QCECKPNTTGRRCD 33 (50)
T ss_pred EEeCCCcCCCCCCC
Confidence 57788888888776
No 54
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=66.39 E-value=5.1 Score=19.42 Aligned_cols=14 Identities=36% Similarity=0.899 Sum_probs=10.4
Q ss_pred eeeCCCCCccCCCc
Q psy4097 108 RCQCEPGFTGESEL 121 (129)
Q Consensus 108 ~C~C~~g~~g~~c~ 121 (129)
.|.|++++.|..|+
T Consensus 19 ~C~C~~~~~G~~C~ 32 (46)
T smart00180 19 QCECKPNVTGRRCD 32 (46)
T ss_pred EEECCCCCCCCCCC
Confidence 67777887777765
No 55
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=65.98 E-value=6.2 Score=23.00 Aligned_cols=33 Identities=21% Similarity=0.539 Sum_probs=21.7
Q ss_pred CCCCCCCeeccCC-----CCceeeeCCC-------------CCccCCCcccc
Q psy4097 91 VTCEHGGICAPIG-----DRGYRCQCEP-------------GFTGESELLVP 124 (129)
Q Consensus 91 ~~C~~~~~C~~~~-----~~~~~C~C~~-------------g~~g~~c~~~~ 124 (129)
+.|..++.| ++. ..=|.|.|.+ .|.|..|++.+
T Consensus 13 n~CsgHG~C-~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD 63 (103)
T PF12955_consen 13 NNCSGHGSC-VKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD 63 (103)
T ss_pred cCCCCCceE-eeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence 557666678 443 2448899977 57777787543
No 56
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=60.19 E-value=8.7 Score=18.95 Aligned_cols=9 Identities=67% Similarity=1.811 Sum_probs=6.0
Q ss_pred ecccCCCCcc
Q psy4097 15 KCGQCPHGWV 24 (129)
Q Consensus 15 ~C~~C~~g~~ 24 (129)
.| .|++||.
T Consensus 38 ~C-~C~~g~~ 46 (52)
T PF01683_consen 38 RC-QCPPGYV 46 (52)
T ss_pred Ee-ECCCCCE
Confidence 56 6777766
No 57
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=51.67 E-value=25 Score=19.80 Aligned_cols=8 Identities=0% Similarity=0.156 Sum_probs=3.2
Q ss_pred cCCCCcee
Q psy4097 101 PIGDRGYR 108 (129)
Q Consensus 101 ~~~~~~~~ 108 (129)
-+....|.
T Consensus 41 GsDg~TY~ 48 (86)
T cd01328 41 TNDNETFD 48 (86)
T ss_pred CCCCCCcc
Confidence 33334443
No 58
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=44.44 E-value=5.4 Score=19.95 Aligned_cols=17 Identities=35% Similarity=0.986 Sum_probs=10.9
Q ss_pred cccCCCCcccCCC-CcccC
Q psy4097 16 CGQCPHGWVGDGT-TCRQG 33 (129)
Q Consensus 16 C~~C~~g~~~~~~-~C~~~ 33 (129)
| .|++||..+.. .|...
T Consensus 35 C-~C~~G~v~~~~~~CV~~ 52 (55)
T PF01826_consen 35 C-FCPPGYVRNDNGRCVPP 52 (55)
T ss_dssp E-EETTTEEEETTSEEEEG
T ss_pred C-CCCCCeeEcCCCCEEcH
Confidence 7 88888884444 55543
No 59
>KOG1836|consensus
Probab=42.18 E-value=43 Score=29.49 Aligned_cols=60 Identities=27% Similarity=0.616 Sum_probs=31.1
Q ss_pred eeeCCCceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeec-CCCCCcCC
Q psy4097 7 CINTQGYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCT-CPPGMGGS 72 (129)
Q Consensus 7 C~~~~g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~-c~~g~~g~ 72 (129)
|+....+-.|+.|..||.++... +..+.|..= +|.+.+.|..+. .. ....|. |+++|+|.
T Consensus 749 C~~~t~G~~C~~C~~GfYg~~~~--~~~~dC~~C--~Cp~~~~~~~~~-~~-~~~iCk~Cp~gytG~ 809 (1705)
T KOG1836|consen 749 CKHNTFGGQCAQCVDGFYGLPDL--GTSGDCQPC--PCPNGGACGQTP-EI-LEVVCKNCPPGYTGL 809 (1705)
T ss_pred cccCCCCCchhhhcCCCCCcccc--CCCCCCccC--CCCCChhhcCcC-cc-cceecCCCCCCCccc
Confidence 33333345676899999832211 100114432 555555555433 11 125687 89998877
No 60
>KOG3509|consensus
Probab=38.04 E-value=34 Score=28.15 Aligned_cols=38 Identities=34% Similarity=0.737 Sum_probs=29.4
Q ss_pred CCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097 86 NPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLVP 124 (129)
Q Consensus 86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~ 124 (129)
+.|+..+|.+.+.| ........|.|++||.|..|+.-+
T Consensus 407 ~~c~~~p~~~~g~c-~p~~~~~~c~c~~g~~G~~c~d~~ 444 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPC-LQTLEGKQCLCPPGYTGDSCEDCM 444 (964)
T ss_pred CccccccCCCCccc-cccccccceeccccccCchhhccC
Confidence 45666778777777 677777889999999999987543
No 61
>PF02822 Antistasin: Antistasin family; InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=33.37 E-value=23 Score=14.98 Aligned_cols=14 Identities=36% Similarity=0.869 Sum_probs=7.2
Q ss_pred cCCCCcccCCCCcc
Q psy4097 18 QCPHGWVGDGTTCR 31 (129)
Q Consensus 18 ~C~~g~~~~~~~C~ 31 (129)
.|+.||..|...|+
T Consensus 9 ~C~~Gf~~D~~GC~ 22 (26)
T PF02822_consen 9 YCPYGFKTDENGCP 22 (26)
T ss_dssp --TT-EEE-TTSSE
T ss_pred cCCCcccCCCCCCC
Confidence 68899987666664
No 62
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=33.00 E-value=22 Score=18.15 Aligned_cols=21 Identities=19% Similarity=0.469 Sum_probs=13.6
Q ss_pred CCCceeeeCCCCCccCCCccc
Q psy4097 103 GDRGYRCQCEPGFTGESELLV 123 (129)
Q Consensus 103 ~~~~~~C~C~~g~~g~~c~~~ 123 (129)
..|...|.|..-|.|..|...
T Consensus 32 ~dG~p~CECn~Cy~GpdCS~~ 52 (56)
T PF04863_consen 32 ADGSPVCECNSCYGGPDCSTL 52 (56)
T ss_dssp ETTEE--EE-TTEESTTS-EE
T ss_pred ccCCccccccCCcCCCCcccC
Confidence 456678999999999999754
No 63
>PF09289 FOLN: Follistatin/Osteonectin-like EGF domain; InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=29.27 E-value=43 Score=13.66 Aligned_cols=8 Identities=38% Similarity=1.377 Sum_probs=3.0
Q ss_pred CCCCCCee
Q psy4097 92 TCEHGGIC 99 (129)
Q Consensus 92 ~C~~~~~C 99 (129)
.|..+..|
T Consensus 5 ~Ck~GKvC 12 (22)
T PF09289_consen 5 HCKRGKVC 12 (22)
T ss_dssp --BTTEEE
T ss_pred ccCCCCEe
Confidence 44444455
No 64
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=28.73 E-value=59 Score=13.72 Aligned_cols=19 Identities=32% Similarity=0.725 Sum_probs=9.0
Q ss_pred CCCCCCCeeccCCCCceee
Q psy4097 91 VTCEHGGICAPIGDRGYRC 109 (129)
Q Consensus 91 ~~C~~~~~C~~~~~~~~~C 109 (129)
..|..+..|++...+.-+|
T Consensus 5 v~C~~G~~C~~d~~g~p~C 23 (26)
T smart00274 5 VQCPFGKVCVVDKGGNARC 23 (26)
T ss_pred EECCCCCEEEeCCCCCEEE
Confidence 3455566663333444444
No 65
>KOG3514|consensus
Probab=25.98 E-value=49 Score=27.81 Aligned_cols=37 Identities=22% Similarity=0.538 Sum_probs=28.2
Q ss_pred CCCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCccc
Q psy4097 86 NPCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELLV 123 (129)
Q Consensus 86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~~ 123 (129)
..|..+-|.+++.| +....+|.|.|.. .|.|..|...
T Consensus 1019 t~c~~~acanhG~c-~q~w~~~~c~csmtS~~Gp~C~d~ 1056 (1591)
T KOG3514|consen 1019 TTCSEDACANHGVC-IQQWNGIACDCSMTSYSGPRCNDP 1056 (1591)
T ss_pred cccchhhhhcccee-eeeecceeeeccccccCCCccCCC
Confidence 34555668788889 8888899999975 6889888643
No 66
>KOG0196|consensus
Probab=24.35 E-value=1.5e+02 Score=24.32 Aligned_cols=16 Identities=25% Similarity=0.721 Sum_probs=12.5
Q ss_pred cCCCCceeeeCCCCCc
Q psy4097 101 PIGDRGYRCQCEPGFT 116 (129)
Q Consensus 101 ~~~~~~~~C~C~~g~~ 116 (129)
....++..|.|..||-
T Consensus 302 s~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 302 SSSEGATSCTCENGYY 317 (996)
T ss_pred CCCCCCCcccccCCcc
Confidence 3567888899999973
No 67
>KOG1218|consensus
Probab=21.76 E-value=2.8e+02 Score=19.04 Aligned_cols=15 Identities=40% Similarity=0.919 Sum_probs=7.7
Q ss_pred eeeCCCCCccCCCcc
Q psy4097 108 RCQCEPGFTGESELL 122 (129)
Q Consensus 108 ~C~C~~g~~g~~c~~ 122 (129)
.|.|.+||.+..++.
T Consensus 163 ~c~c~~g~~g~~~~~ 177 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVE 177 (316)
T ss_pred ceeccCCcccccccc
Confidence 344555555555543
Done!