Query psy5752
Match_columns 222
No_of_seqs 214 out of 2167
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 17:08:53 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5752.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5752hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.8 6.5E-20 1.4E-24 163.1 15.3 144 10-165 692-864 (1289)
2 KOG1219|consensus 99.5 1.9E-14 4.1E-19 138.3 9.4 118 5-165 3858-3977(4289)
3 KOG1214|consensus 99.4 3.3E-12 7.2E-17 114.7 9.8 130 62-192 699-858 (1289)
4 KOG4289|consensus 99.3 7.1E-12 1.5E-16 117.5 6.4 122 8-165 1177-1316(2531)
5 KOG1219|consensus 98.9 4.5E-09 9.7E-14 102.6 9.2 107 51-192 3859-3973(4289)
6 KOG4260|consensus 98.9 6.7E-09 1.4E-13 83.0 7.6 145 38-188 130-301 (350)
7 KOG4289|consensus 98.7 2.6E-08 5.7E-13 94.3 7.2 69 8-87 1237-1308(2531)
8 PF07645 EGF_CA: Calcium-bindi 98.6 4.6E-08 1E-12 57.3 3.8 34 124-158 1-34 (42)
9 KOG1217|consensus 98.6 1.6E-06 3.5E-11 76.3 13.4 167 11-192 170-388 (487)
10 KOG1217|consensus 98.5 3.5E-06 7.6E-11 74.1 13.0 155 21-188 243-421 (487)
11 PF12947 EGF_3: EGF domain; I 98.4 1.4E-07 3E-12 53.1 2.2 36 128-164 1-36 (36)
12 KOG1225|consensus 98.2 2.5E-05 5.5E-10 69.3 11.4 44 37-93 266-309 (525)
13 KOG4260|consensus 98.1 2.3E-06 4.9E-11 68.7 3.9 130 15-159 150-305 (350)
14 PF12947 EGF_3: EGF domain; I 97.9 5E-06 1.1E-10 46.7 1.7 31 63-95 6-36 (36)
15 PF07645 EGF_CA: Calcium-bindi 97.9 3.9E-06 8.5E-11 49.0 1.3 31 9-45 1-34 (42)
16 smart00179 EGF_CA Calcium-bind 97.9 2.3E-05 4.9E-10 44.6 4.2 38 124-164 1-38 (39)
17 PF00008 EGF: EGF-like domain 97.8 4.9E-06 1.1E-10 45.6 0.6 29 14-47 3-31 (32)
18 smart00179 EGF_CA Calcium-bind 97.8 2.6E-05 5.5E-10 44.3 3.7 33 9-47 1-36 (39)
19 PF00008 EGF: EGF-like domain 97.8 2.3E-05 5E-10 42.9 2.7 27 134-160 5-31 (32)
20 PF12662 cEGF: Complement Clr- 97.7 2.6E-05 5.7E-10 39.4 2.2 22 148-169 1-24 (24)
21 PF12662 cEGF: Complement Clr- 97.6 4.5E-05 9.7E-10 38.6 2.1 22 79-100 1-24 (24)
22 PF14670 FXa_inhibition: Coagu 97.5 6.5E-05 1.4E-09 42.1 2.1 32 130-164 3-36 (36)
23 PF06247 Plasmod_Pvs28: Plasmo 97.5 5.4E-05 1.2E-09 57.8 1.8 130 16-160 7-162 (197)
24 cd00054 EGF_CA Calcium-binding 97.4 0.00021 4.6E-09 40.0 3.6 33 9-47 1-35 (38)
25 KOG1225|consensus 97.4 0.00047 1E-08 61.4 7.0 58 13-93 283-340 (525)
26 cd00054 EGF_CA Calcium-binding 97.3 0.00049 1.1E-08 38.4 3.9 33 125-159 2-34 (38)
27 cd00053 EGF Epidermal growth f 97.1 0.0013 2.7E-08 36.1 4.1 27 133-160 6-32 (36)
28 smart00181 EGF Epidermal growt 96.9 0.0018 3.8E-08 35.7 4.0 26 133-160 6-31 (35)
29 cd00053 EGF Epidermal growth f 96.7 0.0039 8.4E-08 34.0 3.9 27 63-91 6-32 (36)
30 smart00181 EGF Epidermal growt 96.6 0.0031 6.8E-08 34.7 3.3 26 15-47 6-31 (35)
31 PF12946 EGF_MSP1_1: MSP1 EGF 96.1 0.0046 1E-07 34.5 2.1 31 134-164 6-36 (37)
32 cd01475 vWA_Matrilin VWA_Matri 96.0 0.008 1.7E-07 48.1 3.7 39 118-159 180-218 (224)
33 KOG1226|consensus 95.5 0.091 2E-06 48.5 8.7 69 16-103 515-590 (783)
34 PF07974 EGF_2: EGF-like domai 95.5 0.023 4.9E-07 30.9 3.1 25 15-47 6-30 (32)
35 PF12661 hEGF: Human growth fa 95.4 0.0062 1.4E-07 26.1 0.6 11 37-47 1-11 (13)
36 PF14670 FXa_inhibition: Coagu 95.4 0.011 2.4E-07 33.0 1.7 21 69-91 10-30 (36)
37 PF12946 EGF_MSP1_1: MSP1 EGF 95.2 0.015 3.2E-07 32.5 1.8 35 59-95 2-36 (37)
38 KOG1226|consensus 95.2 0.21 4.6E-06 46.3 10.1 120 37-173 479-628 (783)
39 KOG0994|consensus 94.5 0.1 2.2E-06 50.3 6.4 53 37-93 886-947 (1758)
40 PF07974 EGF_2: EGF-like domai 94.0 0.1 2.2E-06 28.3 3.3 23 134-159 7-29 (32)
41 KOG0994|consensus 93.0 0.42 9.2E-06 46.4 7.6 64 81-163 831-900 (1758)
42 PHA02887 EGF-like protein; Pro 92.6 0.13 2.9E-06 36.4 2.9 40 9-56 82-125 (126)
43 PHA03099 epidermal growth fact 90.1 0.24 5.2E-06 35.7 2.2 40 9-56 41-84 (139)
44 PF06247 Plasmod_Pvs28: Plasmo 90.0 0.15 3.3E-06 39.3 1.2 75 63-164 6-85 (197)
45 smart00051 DSL delta serrate l 89.2 0.82 1.8E-05 28.9 3.9 44 37-91 18-61 (63)
46 cd01475 vWA_Matrilin VWA_Matri 87.8 0.63 1.4E-05 37.1 3.5 36 51-91 183-219 (224)
47 PF00954 S_locus_glycop: S-loc 85.9 0.88 1.9E-05 32.0 3.0 32 9-47 76-109 (110)
48 PF00954 S_locus_glycop: S-loc 82.1 1.8 3.9E-05 30.4 3.3 30 58-90 79-108 (110)
49 PF01683 EB: EB module; Inter 77.8 3.9 8.4E-05 24.4 3.3 27 11-47 20-48 (52)
50 PHA03099 epidermal growth fact 75.9 3.2 7E-05 30.0 2.9 32 62-94 50-81 (139)
51 PHA02887 EGF-like protein; Pro 75.2 4 8.6E-05 29.1 3.1 35 58-93 85-121 (126)
52 KOG3516|consensus 75.1 2.8 6E-05 41.1 3.1 41 7-56 542-584 (1306)
53 PF09064 Tme5_EGF_like: Thromb 63.7 7.5 0.00016 21.2 2.0 19 70-91 11-29 (34)
54 KOG3514|consensus 62.8 5.5 0.00012 38.9 2.2 35 12-55 625-661 (1591)
55 KOG0196|consensus 62.7 24 0.00052 33.8 6.2 72 81-172 260-336 (996)
56 KOG1836|consensus 60.5 22 0.00047 37.1 6.0 49 108-159 758-808 (1705)
57 KOG3516|consensus 60.3 7.6 0.00016 38.3 2.7 41 121-166 541-582 (1306)
58 PF12955 DUF3844: Domain of un 49.3 12 0.00026 26.1 1.6 33 126-158 6-42 (103)
59 KOG3512|consensus 40.8 34 0.00074 30.6 3.4 14 174-187 527-540 (592)
60 PF00053 Laminin_EGF: Laminin 40.1 16 0.00034 21.4 1.0 14 37-53 19-32 (49)
61 cd00055 EGF_Lam Laminin-type e 39.4 39 0.00084 19.9 2.6 14 37-53 20-33 (50)
62 KOG3514|consensus 33.8 69 0.0015 31.9 4.5 35 127-166 625-660 (1591)
63 KOG1836|consensus 33.4 56 0.0012 34.3 4.1 53 37-93 757-811 (1705)
64 KOG3509|consensus 29.8 1.3E+02 0.0029 29.6 5.7 66 10-87 406-473 (964)
65 PF02740 Colipase_C: Colipase, 26.7 50 0.0011 19.1 1.5 22 21-45 3-24 (45)
66 KOG1215|consensus 24.6 3.6E+02 0.0078 26.3 7.9 62 126-190 365-426 (877)
67 smart00180 EGF_Lam Laminin-typ 23.8 61 0.0013 18.7 1.6 14 37-53 19-32 (46)
68 KOG0196|consensus 23.3 93 0.002 30.1 3.4 46 37-89 260-317 (996)
69 PF01826 TIL: Trypsin Inhibito 22.0 47 0.001 19.8 0.9 19 82-100 35-54 (55)
70 PF11403 Yeast_MT: Yeast metal 20.6 94 0.002 16.8 1.7 9 37-45 23-31 (40)
No 1
>KOG1214|consensus
Probab=99.84 E-value=6.5e-20 Score=163.11 Aligned_cols=144 Identities=30% Similarity=0.805 Sum_probs=122.7
Q ss_pred CCCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecC
Q psy5752 10 LSEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCA 85 (222)
Q Consensus 10 ~~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~ 85 (222)
+++| +.-|.-++.|... .+..|.|.|..||.|+++ .|.+.++ |+. .+.|+.+++|+ |.+++|+|+|.
T Consensus 692 ~npCy~gsh~cdt~a~C~pg-----~~~~~tcecs~g~~gdgr-~c~d~~e-ca~~~~~CGp~s~Ci--n~pg~~rceC~ 762 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPG-----TGVDYTCECSSGYQGDGR-NCVDENE-CATGFHRCGPNSVCI--NLPGSYRCECR 762 (1289)
T ss_pred cccceecCcccCCCccccCC-----CCcceEEEEeeccCCCCC-CCCChhh-hccCCCCCCCCceee--cCCCceeEEEe
Confidence 5667 6778888999984 555669999999999998 9998884 987 78999999997 99999999999
Q ss_pred CCCc--cCCCeeec------CCCcCCCCCCCCC-----------------CCCCcccCCCccccCCCCCCCCCCCCCCCe
Q psy5752 86 QGYV--GNGVECHP------LKSCLEDRSLCGK-----------------DSQGYVGNGVECHPLKSCLEDRSLCGKDAS 140 (222)
Q Consensus 86 ~G~~--~~~~~C~~------~~~C~~~~~~C~~-----------------c~~g~~~~g~~C~~id~C~~~~~~C~~~~~ 140 (222)
.||. +++.+|.. ++.|.++.+.|.. |.+||.+||..|.++|+|.. ..|++.+.
T Consensus 763 ~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~p--srChp~A~ 840 (1289)
T KOG1214|consen 763 SGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSP--SRCHPAAT 840 (1289)
T ss_pred ecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCc--cccCCCce
Confidence 9986 77788874 4668877666643 78999999999999999984 45999999
Q ss_pred eeeCCCCeeEeecCCCCccCCCccc
Q psy5752 141 CVVASQGHFHCECNEGFTGNGITCK 165 (222)
Q Consensus 141 C~n~~~g~~~C~C~~G~~gd~~~C~ 165 (222)
|.|++ |+|.|+|.+||.|||..|.
T Consensus 841 Cyntp-gsfsC~C~pGy~GDGf~CV 864 (1289)
T KOG1214|consen 841 CYNTP-GSFSCRCQPGYYGDGFQCV 864 (1289)
T ss_pred EecCC-CcceeecccCccCCCceec
Confidence 99998 6699999999999987663
No 2
>KOG1219|consensus
Probab=99.54 E-value=1.9e-14 Score=138.34 Aligned_cols=118 Identities=31% Similarity=0.816 Sum_probs=98.8
Q ss_pred CCccCC-CCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEE
Q psy5752 5 PIRFSL-SEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRC 82 (222)
Q Consensus 5 ~~~~~~-~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C 82 (222)
|.|.-. ++| .+||+|+|+|...+ .++| .|.|++.|.|. .|+...++|.+ .+|..+++|+ ...++|.|
T Consensus 3858 pgC~l~~d~C~~npCqhgG~C~~~~----~ggy-~CkCpsqysG~---~CEi~~epC~s-nPC~~GgtCi--p~~n~f~C 3926 (4289)
T KOG1219|consen 3858 PGCSLLTDPCNDNPCQHGGTCISQP----KGGY-KCKCPSQYSGN---HCEIDLEPCAS-NPCLTGGTCI--PFYNGFLC 3926 (4289)
T ss_pred ccccccccccccCcccCCCEecCCC----CCce-EEeCcccccCc---ccccccccccC-CCCCCCCEEE--ecCCCeeE
Confidence 345433 899 89999999999975 5666 99999999999 99988888998 9999999997 67889999
Q ss_pred ecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCC
Q psy5752 83 QCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGI 162 (222)
Q Consensus 83 ~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~ 162 (222)
.|+.||+|..++ . ..+++|+.+. |.++|.|+|.. |+|.|.|.+||.| .
T Consensus 3927 nC~~gyTG~~Ce--~-------------------------~Gi~eCs~n~--C~~gg~C~n~~-gsf~CncT~g~~g--r 3974 (4289)
T KOG1219|consen 3927 NCPNGYTGKRCE--A-------------------------RGISECSKNV--CGTGGQCINIP-GSFHCNCTPGILG--R 3974 (4289)
T ss_pred eCCCCccCceee--c-------------------------cccccccccc--ccCCceeeccC-CceEeccChhHhc--c
Confidence 999999988543 1 0278887655 99999999997 6699999999998 4
Q ss_pred ccc
Q psy5752 163 TCK 165 (222)
Q Consensus 163 ~C~ 165 (222)
.|.
T Consensus 3975 ~c~ 3977 (4289)
T KOG1219|consen 3975 TCC 3977 (4289)
T ss_pred cCc
Confidence 543
No 3
>KOG1214|consensus
Probab=99.36 E-value=3.3e-12 Score=114.66 Aligned_cols=130 Identities=26% Similarity=0.595 Sum_probs=106.9
Q ss_pred CCCCCCCCEeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCC--------------CCCCc--ccCCCcccc-
Q psy5752 62 VKNCHANAECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGK--------------DSQGY--VGNGVECHP- 124 (222)
Q Consensus 62 ~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~--------------c~~g~--~~~g~~C~~- 124 (222)
.+.|..++.|+ ..+.-.|+|.|..||.|+++.|.++++|.+.++.|+. |..|| ..++.+|..
T Consensus 699 sh~cdt~a~C~-pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i 777 (1289)
T KOG1214|consen 699 SHMCDTTARCH-PGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLI 777 (1289)
T ss_pred CcccCCCcccc-CCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEe
Confidence 36788888897 2555689999999999999999999999999888865 44555 467778874
Q ss_pred -----CCCCCCCCCCCCC--CCeeeeCCCCeeEeecCCCCccCCCccccCCCCC------CCeeeecCCcEEEEcCCCCC
Q psy5752 125 -----LKSCLEDRSLCGK--DASCVVASQGHFHCECNEGFTGNGITCKPVRKKE------SDFLLVNQGMFMLRVPYQPT 191 (222)
Q Consensus 125 -----id~C~~~~~~C~~--~~~C~n~~~g~~~C~C~~G~~gd~~~C~~~~~~~------~~~~~~~~g~~~~~~~~~~~ 191 (222)
++.|.++.+.|.- .+.|+...+++|.|.|.+||.|||..|.++++|. .++|.++.|++.|+|..++.
T Consensus 778 ~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~ 857 (1289)
T KOG1214|consen 778 TPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYY 857 (1289)
T ss_pred cCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCcc
Confidence 4678888788964 4567788788899999999999999999998875 45799999999999987666
Q ss_pred C
Q psy5752 192 R 192 (222)
Q Consensus 192 ~ 192 (222)
+
T Consensus 858 G 858 (1289)
T KOG1214|consen 858 G 858 (1289)
T ss_pred C
Confidence 4
No 4
>KOG4289|consensus
Probab=99.26 E-value=7.1e-12 Score=117.46 Aligned_cols=122 Identities=29% Similarity=0.695 Sum_probs=92.0
Q ss_pred cCCCCC-CCCCCCCCeecCCCC-----------------CCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCC
Q psy5752 8 FSLSEC-DTPCHPNAQCTSPDE-----------------FNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANA 69 (222)
Q Consensus 8 ~~~~~C-~~~C~~~~~C~~~~~-----------------~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~ 69 (222)
+|-+.| ..||.|..+|+.+-. -++.+++ .|+|++||+|+ .|+...++|.+ .+|.+++
T Consensus 1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvngl-rCrCPpGFTgd---~CeTeiDlCYs-~pC~nng 1251 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGL-RCRCPPGFTGD---YCETEIDLCYS-GPCGNNG 1251 (2531)
T ss_pred ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCce-eEeCCCCCCcc---cccchhHhhhc-CCCCCCC
Confidence 455678 889999998875300 0113445 89999999999 99977778998 9999999
Q ss_pred EeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCee
Q psy5752 70 ECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHF 149 (222)
Q Consensus 70 ~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~ 149 (222)
.|. ...|+|+|.|.+||+|..++- + ...-.|. |+.|.++++|+|...|.|
T Consensus 1252 ~C~--srEggYtCeCrpg~tGehCEv-s-------------------------~~agrCv--pGvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1252 RCR--SREGGYTCECRPGFTGEHCEV-S-------------------------ARAGRCV--PGVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred ceE--EecCceeEEecCCccccceee-e-------------------------cccCccc--cceecCCCEEeecCCCce
Confidence 996 788999999999999885430 0 0122344 567999999999988889
Q ss_pred EeecCCCCccCCCccc
Q psy5752 150 HCECNEGFTGNGITCK 165 (222)
Q Consensus 150 ~C~C~~G~~gd~~~C~ 165 (222)
.|.|+.| .-.+..|+
T Consensus 1302 ~c~Cp~g-e~e~prC~ 1316 (2531)
T KOG4289|consen 1302 CCHCPYG-EFEDPRCE 1316 (2531)
T ss_pred eccCCCc-ccCCCceE
Confidence 9999998 32224555
No 5
>KOG1219|consensus
Probab=98.92 E-value=4.5e-09 Score=102.64 Aligned_cols=107 Identities=25% Similarity=0.624 Sum_probs=84.0
Q ss_pred CcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCC
Q psy5752 51 ECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLE 130 (222)
Q Consensus 51 ~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~ 130 (222)
.|....+.|.. .+|++++.|. ....++|.|+|++-|+|+.++ .++.+|..
T Consensus 3859 gC~l~~d~C~~-npCqhgG~C~-~~~~ggy~CkCpsqysG~~CE----------------------------i~~epC~s 3908 (4289)
T KOG1219|consen 3859 GCSLLTDPCND-NPCQHGGTCI-SQPKGGYKCKCPSQYSGNHCE----------------------------IDLEPCAS 3908 (4289)
T ss_pred ccccccccccc-CcccCCCEec-CCCCCceEEeCcccccCcccc----------------------------cccccccC
Confidence 45544346888 9999999997 355678999999999888432 25788987
Q ss_pred CCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcccc--CCCC------CCCeeeecCCcEEEEcCCCCCC
Q psy5752 131 DRSLCGKDASCVVASQGHFHCECNEGFTGNGITCKP--VRKK------ESDFLLVNQGMFMLRVPYQPTR 192 (222)
Q Consensus 131 ~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C~~--~~~~------~~~~~~~~~g~~~~~~~~~~~~ 192 (222)
+| |..+++|+...+| |.|.|+.||+| .+|+. +++| .++.|.+..|+|.|.|..+..+
T Consensus 3909 nP--C~~GgtCip~~n~-f~CnC~~gyTG--~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~g 3973 (4289)
T KOG1219|consen 3909 NP--CLTGGTCIPFYNG-FLCNCPNGYTG--KRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILG 3973 (4289)
T ss_pred CC--CCCCCEEEecCCC-eeEeCCCCccC--ceeecccccccccccccCCceeeccCCceEeccChhHhc
Confidence 76 9999999998766 99999999998 56653 3444 4667999999999999865553
No 6
>KOG4260|consensus
Probab=98.88 E-value=6.7e-09 Score=82.98 Aligned_cols=145 Identities=23% Similarity=0.475 Sum_probs=100.2
Q ss_pred eecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeee-cCCCCCeEEecCCCCccCCCe-e----------------ecC-
Q psy5752 38 CICNFGYTGDGVTECNPESLGCNVVKNCHANAECVY-NATSAGYRCQCAQGYVGNGVE-C----------------HPL- 98 (222)
Q Consensus 38 C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~-~n~~g~~~C~C~~G~~~~~~~-C----------------~~~- 98 (222)
--|++|-.|.+--.|.... ..+|.-++.|.- -...|+-.|.|..||.|+.+. | ..-
T Consensus 130 vCCp~gtyGpdCl~Cpggs-----er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch 204 (350)
T KOG4260|consen 130 VCCPDGTYGPDCLQCPGGS-----ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACH 204 (350)
T ss_pred eccCCCCcCCccccCCCCC-----cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhh
Confidence 3478888887222333322 267877888841 012456789999999987532 1 110
Q ss_pred ----CCcC-CCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCc---cccCCCC
Q psy5752 99 ----KSCL-EDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGIT---CKPVRKK 170 (222)
Q Consensus 99 ----~~C~-~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~---C~~~~~~ 170 (222)
.-|. ..+..|..|..||..+-..|.|||||...+..|.....|+|+. |+|+|.+.+||.++... |.++-..
T Consensus 205 ~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNte-GSf~C~dk~Gy~~g~d~C~~~~d~~~~ 283 (350)
T KOG4260|consen 205 EGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTE-GSFKCEDKEGYKKGVDECQFCADVCAS 283 (350)
T ss_pred hhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCC-CceEecccccccCChHHhhhhhhhccc
Confidence 0122 2345677789999988889999999998888898888999996 77999999999973222 3344444
Q ss_pred CCCeeeecCCcEEEEcCC
Q psy5752 171 ESDFLLVNQGMFMLRVPY 188 (222)
Q Consensus 171 ~~~~~~~~~g~~~~~~~~ 188 (222)
....|.++.+.+.+.+..
T Consensus 284 kn~~c~ni~~~~r~v~f~ 301 (350)
T KOG4260|consen 284 KNRPCMNIDGQYRCVCFS 301 (350)
T ss_pred CCCCcccCCccEEEEecc
Confidence 556789999999888764
No 7
>KOG4289|consensus
Probab=98.73 E-value=2.6e-08 Score=94.30 Aligned_cols=69 Identities=30% Similarity=0.788 Sum_probs=58.6
Q ss_pred cCCCCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC--CCCCCCCCCCCCCEeeecCCCCCeEEec
Q psy5752 8 FSLSEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES--LGCNVVKNCHANAECVYNATSAGYRCQC 84 (222)
Q Consensus 8 ~~~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~--~~C~~~~~C~~~~~C~~~n~~g~~~C~C 84 (222)
++||.| +.||.+|+.|.. . .|+| .|.|.+||+|. .|+... ..|.+ ..|.++++|+ ....|+|.|.|
T Consensus 1237 TeiDlCYs~pC~nng~C~s--r---EggY-tCeCrpg~tGe---hCEvs~~agrCvp-GvC~nggtC~-~~~nggf~c~C 1305 (2531)
T KOG4289|consen 1237 TEIDLCYSGPCGNNGRCRS--R---EGGY-TCECRPGFTGE---HCEVSARAGRCVP-GVCKNGGTCV-NLLNGGFCCHC 1305 (2531)
T ss_pred chhHhhhcCCCCCCCceEE--e---cCce-eEEecCCcccc---ceeeecccCcccc-ceecCCCEEe-ecCCCceeccC
Confidence 368999 999999999999 4 7888 99999999999 998443 23776 9999999998 45678899999
Q ss_pred CCC
Q psy5752 85 AQG 87 (222)
Q Consensus 85 ~~G 87 (222)
+.|
T Consensus 1306 p~g 1308 (2531)
T KOG4289|consen 1306 PYG 1308 (2531)
T ss_pred CCc
Confidence 988
No 8
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.63 E-value=4.6e-08 Score=57.27 Aligned_cols=34 Identities=26% Similarity=0.845 Sum_probs=31.0
Q ss_pred cCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCc
Q psy5752 124 PLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFT 158 (222)
Q Consensus 124 ~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~ 158 (222)
|||||+..++.|..++.|+|+. |+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~-Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTE-GSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEET-TEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCC-CCEEeeCCCCcE
Confidence 6899998888998889999997 669999999998
No 9
>KOG1217|consensus
Probab=98.55 E-value=1.6e-06 Score=76.30 Aligned_cols=167 Identities=29% Similarity=0.706 Sum_probs=112.2
Q ss_pred CCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccC-------------------CCCCCC-CCCCCC
Q psy5752 11 SEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPE-------------------SLGCNV-VKNCHA 67 (222)
Q Consensus 11 ~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~-------------------~~~C~~-~~~C~~ 67 (222)
++| ..+|.+++.|.+. .+.| .|.|++||.+. .|+.. ...|.. ...|..
T Consensus 170 ~~C~~~~~~c~~~~~C~~~-----~~~~-~C~c~~~~~~~---~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~ 240 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNT-----GGSY-LCSCPPGYTGS---TCETTGNGGTCVDSVACSCPPGARGPECEVSIVECAS 240 (487)
T ss_pred cccccCCCCcCCCcccccC-----CCCe-eEeCCCCccCC---cCcCCCCCceEecceeccCCCCCCCCCcccccccccC
Confidence 577 3459999999994 4555 99999999987 33322 111222 233433
Q ss_pred C-CEeeecCCCCCeEEecCCCCccCC-CeeecCCCcCCCCCCCCC--------------CCCCcccCCC-ccccCCCCCC
Q psy5752 68 N-AECVYNATSAGYRCQCAQGYVGNG-VECHPLKSCLEDRSLCGK--------------DSQGYVGNGV-ECHPLKSCLE 130 (222)
Q Consensus 68 ~-~~C~~~n~~g~~~C~C~~G~~~~~-~~C~~~~~C~~~~~~C~~--------------c~~g~~~~g~-~C~~id~C~~ 130 (222)
. +.|+ +..++|+|.|++||.+.. ..+.++++|..... |.+ |..||.+..- .+.+..+|..
T Consensus 241 ~~~~c~--~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~ 317 (487)
T KOG1217|consen 241 GDGTCV--NTVGSYTCRCPEGYTGDACVTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSP 317 (487)
T ss_pred CCCccc--ccCCceeeeCCCCccccccceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccc
Confidence 3 7885 888899999999999887 57788888876543 322 6788876642 4556678852
Q ss_pred --CCCCCCCCCeee--eCCCCeeEeecCCCCccCCCccccCC-CC------CCCeeee-cCCcEEEEcCCCCCC
Q psy5752 131 --DRSLCGKDASCV--VASQGHFHCECNEGFTGNGITCKPVR-KK------ESDFLLV-NQGMFMLRVPYQPTR 192 (222)
Q Consensus 131 --~~~~C~~~~~C~--n~~~g~~~C~C~~G~~gd~~~C~~~~-~~------~~~~~~~-~~g~~~~~~~~~~~~ 192 (222)
....|.++++|. +.. +.+.|.|..||.| ..|+... ++ ....+.+ +.+.+.|.++....+
T Consensus 318 ~~~~~~c~~g~~C~~~~~~-~~~~C~c~~~~~g--~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 318 RNAGGPCANGGTCNTLGSF-GGFRCACGPGFTG--RRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred cccCCcCCCCcccccCCCC-CCCCcCCCCCCCC--CccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence 344588777882 222 3478999999776 6666552 33 3445777 789999998876443
No 10
>KOG1217|consensus
Probab=98.46 E-value=3.5e-06 Score=74.13 Aligned_cols=155 Identities=26% Similarity=0.638 Sum_probs=108.3
Q ss_pred CeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCC-eeecCC
Q psy5752 21 AQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGV-ECHPLK 99 (222)
Q Consensus 21 ~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~-~C~~~~ 99 (222)
+.|++. .+.+ .|.|++||++.....|.+++. |.....|.+++.|+ +..+.|.|.|++||.+... .+.+..
T Consensus 243 ~~c~~~-----~~~~-~C~~~~g~~~~~~~~~~~~~~-C~~~~~c~~~~~C~--~~~~~~~C~C~~g~~g~~~~~~~~~~ 313 (487)
T KOG1217|consen 243 GTCVNT-----VGSY-TCRCPEGYTGDACVTCVDVDS-CALIASCPNGGTCV--NVPGSYRCTCPPGFTGRLCTECVDVD 313 (487)
T ss_pred Cccccc-----CCce-eeeCCCCccccccceeeeccc-cCCCCccCCCCeee--cCCCcceeeCCCCCCCCCCccccccc
Confidence 789884 5566 999999999984114666664 88722388899996 7888899999999998875 556666
Q ss_pred CcCC--CCC------CC--------CCCCCCcccCCCccccC-CCCCCCCCCCCCCCeeee-CCCCeeEeecCCCCcc--
Q psy5752 100 SCLE--DRS------LC--------GKDSQGYVGNGVECHPL-KSCLEDRSLCGKDASCVV-ASQGHFHCECNEGFTG-- 159 (222)
Q Consensus 100 ~C~~--~~~------~C--------~~c~~g~~~~g~~C~~i-d~C~~~~~~C~~~~~C~n-~~~g~~~C~C~~G~~g-- 159 (222)
+|.. ... .| ..|..++...+..|+.. ++|...+ +..++.|++ .. ++|.|.|+.+|.+
T Consensus 314 ~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~--~~~~~~c~~~~~-~~~~c~~~~~~~~~~ 390 (487)
T KOG1217|consen 314 ECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSP--CCPGGTCVNETP-GSYRCACPAGFAGKA 390 (487)
T ss_pred cccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCc--cccCCEeccCCC-CCeEecCCCccccCC
Confidence 7752 111 22 11222222466689888 4998655 777889998 54 5599999999885
Q ss_pred --CCCccccCCCCC-CCeeeecCCcEEEEcCC
Q psy5752 160 --NGITCKPVRKKE-SDFLLVNQGMFMLRVPY 188 (222)
Q Consensus 160 --d~~~C~~~~~~~-~~~~~~~~g~~~~~~~~ 188 (222)
++..+.+++++. ...+++..+.+.|. +.
T Consensus 391 ~~~~~~~~~~~~c~~~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 391 NGDGVGCEDIDECSGCGDCVNGPGGGACT-PP 421 (487)
T ss_pred ccccccccccccccCCcceeccCCCCccc-cC
Confidence 345677777775 34466667777766 55
No 11
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.43 E-value=1.4e-07 Score=53.07 Aligned_cols=36 Identities=33% Similarity=0.995 Sum_probs=27.8
Q ss_pred CCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752 128 CLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITC 164 (222)
Q Consensus 128 C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C 164 (222)
|+.+++.|+.+|+|+++++ +|.|+|.+||.|||..|
T Consensus 1 C~~~~~~C~~nA~C~~~~~-~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGG-SYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TT-SEEEEE-CEEECCSTCE
T ss_pred CCCCCCCCCCCcEeecCCC-CEEeECCCCCccCCcCC
Confidence 4455677999999999975 69999999999999876
No 12
>KOG1225|consensus
Probab=98.17 E-value=2.5e-05 Score=69.29 Aligned_cols=44 Identities=41% Similarity=1.115 Sum_probs=29.8
Q ss_pred eeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752 37 KCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGV 93 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~ 93 (222)
.|+|++||+|. .|.... |.. .|+.+..++ .+ .|.|++||.|..+
T Consensus 266 ~CIC~~Gf~G~---dC~e~~--Cp~--~cs~~g~~~----~g--~CiC~~g~~G~dC 309 (525)
T KOG1225|consen 266 RCICPPGFTGD---DCDELV--CPV--DCSGGGVCV----DG--ECICNPGYSGKDC 309 (525)
T ss_pred eEeCCCCCcCC---CCCccc--CCc--ccCCCceec----CC--EeecCCCcccccc
Confidence 68888888888 776643 654 376666563 22 7888888877644
No 13
>KOG4260|consensus
Probab=98.14 E-value=2.3e-06 Score=68.73 Aligned_cols=130 Identities=25% Similarity=0.648 Sum_probs=82.7
Q ss_pred CCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccC-----CCC----CCC-CCCCCCCCEeeecCCCCCeEE-e
Q psy5752 15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPE-----SLG----CNV-VKNCHANAECVYNATSAGYRC-Q 83 (222)
Q Consensus 15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~-----~~~----C~~-~~~C~~~~~C~~~n~~g~~~C-~ 83 (222)
.||.-+|.|....++ .|+. +|.|.+||+|.--+.|... .++ |.. ...| .+.| ....+-.| +
T Consensus 150 r~C~GnG~C~GdGsR--~GsG-kCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C--~~~C---sg~~~k~C~k 221 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSR--EGSG-KCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC--LGVC---SGESSKGCSK 221 (350)
T ss_pred CCcCCCCcccCCCCC--CCCC-cccccCCCCCccccccchHHHHhhcccccchhhhhhhhh--hccc---CCCCCCChhh
Confidence 578888899875441 2455 8999999999822222211 000 111 1233 2256 34444455 6
Q ss_pred cCCCCccCCCeeecCCCcCCCCCCCCC--------------CCCCcccCCCccccCCCCCCCCCCCC-CCCeeeeCCCCe
Q psy5752 84 CAQGYVGNGVECHPLKSCLEDRSLCGK--------------DSQGYVGNGVECHPLKSCLEDRSLCG-KDASCVVASQGH 148 (222)
Q Consensus 84 C~~G~~~~~~~C~~~~~C~~~~~~C~~--------------c~~g~~~~g~~C~~id~C~~~~~~C~-~~~~C~n~~~g~ 148 (222)
|+.||..+...|.|||+|+..+..|.. +.+||.. ++|+|..-...|. .+..|+|+. +.
T Consensus 222 CkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~------g~d~C~~~~d~~~~kn~~c~ni~-~~ 294 (350)
T KOG4260|consen 222 CKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK------GVDECQFCADVCASKNRPCMNID-GQ 294 (350)
T ss_pred hcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccC------ChHHhhhhhhhcccCCCCcccCC-cc
Confidence 999999888889999999988877754 1233322 2677765333453 356899997 55
Q ss_pred eEeecCCCCcc
Q psy5752 149 FHCECNEGFTG 159 (222)
Q Consensus 149 ~~C~C~~G~~g 159 (222)
|+|.|..|+.-
T Consensus 295 ~r~v~f~~~~~ 305 (350)
T KOG4260|consen 295 YRCVCFSGLII 305 (350)
T ss_pred EEEEeccccee
Confidence 99999999873
No 14
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.94 E-value=5e-06 Score=46.73 Aligned_cols=31 Identities=45% Similarity=1.115 Sum_probs=24.8
Q ss_pred CCCCCCCEeeecCCCCCeEEecCCCCccCCCee
Q psy5752 63 KNCHANAECVYNATSAGYRCQCAQGYVGNGVEC 95 (222)
Q Consensus 63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C 95 (222)
+.|+.+|+|+ ++.++|.|+|++||.|++..|
T Consensus 6 ~~C~~nA~C~--~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCT--NTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEE--E-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCcEee--cCCCCEEeECCCCCccCCcCC
Confidence 6788899997 888899999999999998754
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.93 E-value=3.9e-06 Score=49.00 Aligned_cols=31 Identities=29% Similarity=0.775 Sum_probs=27.7
Q ss_pred CCCCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCc
Q psy5752 9 SLSEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYT 45 (222)
Q Consensus 9 ~~~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~ 45 (222)
||||| .+.|..++.|+| + .|+| .|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N--~---~Gsy-~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVN--T---EGSY-SCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEE--E---TTEE-EEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEc--C---CCCE-EeeCCCCcE
Confidence 78999 468998999999 4 7888 999999998
No 16
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.90 E-value=2.3e-05 Score=44.56 Aligned_cols=38 Identities=29% Similarity=0.882 Sum_probs=29.0
Q ss_pred cCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752 124 PLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITC 164 (222)
Q Consensus 124 ~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C 164 (222)
++++|... ..|.++++|+++.++ |.|.|++||. ++..|
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~-~~C~C~~g~~-~g~~C 38 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGS-YRCECPPGYT-DGRNC 38 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCC-eEeECCCCCc-cCCcC
Confidence 36778753 459888899999754 9999999998 33555
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.85 E-value=4.9e-06 Score=45.57 Aligned_cols=29 Identities=31% Similarity=0.762 Sum_probs=24.3
Q ss_pred CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 14 DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 14 ~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
++||.|+|+|+... .+.| +|.|++||+|+
T Consensus 3 ~~~C~n~g~C~~~~----~~~y-~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTCIDLP----GGGY-TCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEEEEES----TSEE-EEEEBTTEEST
T ss_pred CCcCCCCeEEEeCC----CCCE-EeECCCCCccC
Confidence 67999999999942 2566 99999999986
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.84 E-value=2.6e-05 Score=44.34 Aligned_cols=33 Identities=33% Similarity=0.916 Sum_probs=27.8
Q ss_pred CCCCC-C-CCCCCCCeecCCCCCCCCCCceeeecCCCCc-cC
Q psy5752 9 SLSEC-D-TPCHPNAQCTSPDEFNESREQAKCICNFGYT-GD 47 (222)
Q Consensus 9 ~~~~C-~-~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~-g~ 47 (222)
++++| . .+|.++++|++. .++| .|.|++||. |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~-----~g~~-~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNT-----VGSY-RCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECC-----CCCe-EeECCCCCccCC
Confidence 57899 5 799999999994 6666 999999999 65
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.78 E-value=2.3e-05 Score=42.87 Aligned_cols=27 Identities=33% Similarity=0.960 Sum_probs=23.6
Q ss_pred CCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752 134 LCGKDASCVVASQGHFHCECNEGFTGN 160 (222)
Q Consensus 134 ~C~~~~~C~n~~~g~~~C~C~~G~~gd 160 (222)
+|.++++|++...+.|.|.|++||+|+
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 599999999987466999999999984
No 20
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.74 E-value=2.6e-05 Score=39.44 Aligned_cols=22 Identities=23% Similarity=0.915 Sum_probs=19.4
Q ss_pred eeEeecCCCCc--cCCCccccCCC
Q psy5752 148 HFHCECNEGFT--GNGITCKPVRK 169 (222)
Q Consensus 148 ~~~C~C~~G~~--gd~~~C~~~~~ 169 (222)
+|+|+|++||. .|+.+|.+|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 49999999999 57889999975
No 21
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.63 E-value=4.5e-05 Score=38.59 Aligned_cols=22 Identities=32% Similarity=0.973 Sum_probs=16.8
Q ss_pred CeEEecCCCCc--cCCCeeecCCC
Q psy5752 79 GYRCQCAQGYV--GNGVECHPLKS 100 (222)
Q Consensus 79 ~~~C~C~~G~~--~~~~~C~~~~~ 100 (222)
||+|+|++||. .++..|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 68999999998 45666666654
No 22
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.53 E-value=6.5e-05 Score=42.11 Aligned_cols=32 Identities=28% Similarity=0.782 Sum_probs=23.1
Q ss_pred CCCCCCCCCCeeeeCCCCeeEeecCCCCcc--CCCcc
Q psy5752 130 EDRSLCGKDASCVVASQGHFHCECNEGFTG--NGITC 164 (222)
Q Consensus 130 ~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g--d~~~C 164 (222)
...+.|.+ .|+++++ +|+|.|++||.. |+++|
T Consensus 3 ~~NGgC~h--~C~~~~g-~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 3 VNNGGCSH--ICVNTPG-SYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp TGGGGSSS--EEEEETT-SEEEE-STTEEE-TTSSSE
T ss_pred CCCCCcCC--CCccCCC-ceEeECCCCCEECcCCCCC
Confidence 34566865 8999975 599999999994 55554
No 23
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.48 E-value=5.4e-05 Score=57.81 Aligned_cols=130 Identities=26% Similarity=0.674 Sum_probs=72.1
Q ss_pred CCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC----CCCCCCCCEeeec---CCCCCeEEecCCCC
Q psy5752 16 PCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV----VKNCHANAECVYN---ATSAGYRCQCAQGY 88 (222)
Q Consensus 16 ~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~----~~~C~~~~~C~~~---n~~g~~~C~C~~G~ 88 (222)
.|.| |.-+. +...|.|.|.+||.-...++|+...+ |.. ..+|+..+.|+.. .....|.|.|.+||
T Consensus 7 ~CKN-G~LiQ------MSNHfEC~Cnegfvl~~EntCE~kv~-C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY 78 (197)
T PF06247_consen 7 ICKN-GYLIQ------MSNHFECKCNEGFVLKNENTCEEKVE-CDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGY 78 (197)
T ss_dssp --BT-EEEEE------ESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred cccC-CEEEE------ccCceEEEcCCCcEEcccccccccee-cCcccccCccccchhhhhcCCCcccceeEEEecccCc
Confidence 4553 45554 45556999999998544458988774 764 4789999999731 12367999999999
Q ss_pred ccCCCeeecCCCcCCCCCCCCC----------------CCCCcc-cCCCcccc--CCCCCCCCCCCCCCCeeeeCCCCee
Q psy5752 89 VGNGVECHPLKSCLEDRSLCGK----------------DSQGYV-GNGVECHP--LKSCLEDRSLCGKDASCVVASQGHF 149 (222)
Q Consensus 89 ~~~~~~C~~~~~C~~~~~~C~~----------------c~~g~~-~~g~~C~~--id~C~~~~~~C~~~~~C~n~~~g~~ 149 (222)
+.....|.+. .|.. -.|.. |.-|+. .+...|.. ..+|++ .|..+..|.... +.|
T Consensus 79 ~~~~~vCvp~-~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L---KCk~nE~CK~~~-~~Y 151 (197)
T PF06247_consen 79 ILKQGVCVPN-KCNN--KDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL---KCKENEECKLVD-GYY 151 (197)
T ss_dssp EESSSSEEEG-GGSS-----TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-----------TTTEEEEEET-TEE
T ss_pred eeeCCeEchh-hcCc--eecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee---ecCCCcceeeeC-cEE
Confidence 9665555431 1111 11111 333444 34445542 355654 277778899884 779
Q ss_pred EeecCCCCccC
Q psy5752 150 HCECNEGFTGN 160 (222)
Q Consensus 150 ~C~C~~G~~gd 160 (222)
+|.|.+||.++
T Consensus 152 ~C~~~~~~~~~ 162 (197)
T PF06247_consen 152 KCVCKEGFPGD 162 (197)
T ss_dssp EEEE-TT-EEE
T ss_pred EeecCCCCCCC
Confidence 99999999864
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.44 E-value=0.00021 Score=39.98 Aligned_cols=33 Identities=33% Similarity=0.925 Sum_probs=27.4
Q ss_pred CCCCC-C-CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 9 SLSEC-D-TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 9 ~~~~C-~-~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
++++| . .+|.+++.|++. .+.| .|.|++||.|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~-----~~~~-~C~C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNT-----VGSY-RCSCPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECC-----CCCe-EeECCCCCcCC
Confidence 46888 5 789999999984 5565 99999999986
No 25
>KOG1225|consensus
Probab=97.41 E-value=0.00047 Score=61.41 Aligned_cols=58 Identities=33% Similarity=0.995 Sum_probs=45.0
Q ss_pred CCCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCC
Q psy5752 13 CDTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNG 92 (222)
Q Consensus 13 C~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~ 92 (222)
|+..|..++.+++ . .|.|++||.|. .|+... |. ..|..++.|+ . -+|.|.+||+|..
T Consensus 283 Cp~~cs~~g~~~~--g--------~CiC~~g~~G~---dCs~~~--cp--adC~g~G~Ci----~--G~C~C~~Gy~G~~ 339 (525)
T KOG1225|consen 283 CPVDCSGGGVCVD--G--------ECICNPGYSGK---DCSIRR--CP--ADCSGHGKCI----D--GECLCDEGYTGEL 339 (525)
T ss_pred CCcccCCCceecC--C--------EeecCCCcccc---cccccc--CC--ccCCCCCccc----C--CceEeCCCCcCCc
Confidence 5455877788876 2 79999999999 887654 43 6898899995 2 2799999999874
Q ss_pred C
Q psy5752 93 V 93 (222)
Q Consensus 93 ~ 93 (222)
+
T Consensus 340 C 340 (525)
T KOG1225|consen 340 C 340 (525)
T ss_pred c
Confidence 3
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.28 E-value=0.00049 Score=38.43 Aligned_cols=33 Identities=27% Similarity=0.869 Sum_probs=26.2
Q ss_pred CCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752 125 LKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG 159 (222)
Q Consensus 125 id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g 159 (222)
+++|... ..|.+++.|++..++ |+|.|++||.|
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~-~~C~C~~g~~g 34 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGS-YRCSCPPGYTG 34 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCC-eEeECCCCCcC
Confidence 5677652 358878899999755 99999999988
No 27
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.06 E-value=0.0013 Score=36.10 Aligned_cols=27 Identities=30% Similarity=0.924 Sum_probs=22.8
Q ss_pred CCCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752 133 SLCGKDASCVVASQGHFHCECNEGFTGN 160 (222)
Q Consensus 133 ~~C~~~~~C~n~~~g~~~C~C~~G~~gd 160 (222)
..|.+++.|++..++ |+|.|++||.++
T Consensus 6 ~~C~~~~~C~~~~~~-~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGS-YRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCC-eEeECCCCCccc
Confidence 458878899999755 999999999985
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.95 E-value=0.0018 Score=35.73 Aligned_cols=26 Identities=31% Similarity=1.048 Sum_probs=21.6
Q ss_pred CCCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752 133 SLCGKDASCVVASQGHFHCECNEGFTGN 160 (222)
Q Consensus 133 ~~C~~~~~C~n~~~g~~~C~C~~G~~gd 160 (222)
+.|.++ .|+++. ++|.|.|++||.++
T Consensus 6 ~~C~~~-~C~~~~-~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTP-GSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCC-CCeEeECCCCCccC
Confidence 458877 899996 55999999999984
No 29
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.65 E-value=0.0039 Score=34.04 Aligned_cols=27 Identities=41% Similarity=1.078 Sum_probs=23.9
Q ss_pred CCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752 63 KNCHANAECVYNATSAGYRCQCAQGYVGN 91 (222)
Q Consensus 63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~~~ 91 (222)
.+|.++++|+ +..++|.|.|+.||.+.
T Consensus 6 ~~C~~~~~C~--~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCV--NTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEe--cCCCCeEeECCCCCccc
Confidence 6788889997 88899999999999876
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.62 E-value=0.0031 Score=34.69 Aligned_cols=26 Identities=38% Similarity=0.921 Sum_probs=21.5
Q ss_pred CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
.+|.++ .|++. .+.| .|.|++||.|.
T Consensus 6 ~~C~~~-~C~~~-----~~~~-~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINT-----PGSY-TCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECC-----CCCe-EeECCCCCccC
Confidence 589888 99983 5666 99999999983
No 31
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.14 E-value=0.0046 Score=34.51 Aligned_cols=31 Identities=29% Similarity=0.687 Sum_probs=23.3
Q ss_pred CCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752 134 LCGKDASCVVASQGHFHCECNEGFTGNGITC 164 (222)
Q Consensus 134 ~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C 164 (222)
.|..++.|++..+|++.|.|..||..++..|
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 4888899999987889999999999765555
No 32
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=95.98 E-value=0.008 Score=48.12 Aligned_cols=39 Identities=26% Similarity=0.722 Sum_probs=32.9
Q ss_pred CCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752 118 NGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG 159 (222)
Q Consensus 118 ~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g 159 (222)
.+..|.++++|...++.|.+ .|.++. |+|.|.|++||+.
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~--~C~~~~-g~~~c~c~~g~~~ 218 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQ--VCISTP-GSYLCACTEGYAL 218 (224)
T ss_pred ccccCcCchhhcCCCCCccc--eEEcCC-CCEEeECCCCccC
Confidence 45689999999887888975 799997 6699999999984
No 33
>KOG1226|consensus
Probab=95.48 E-value=0.091 Score=48.55 Aligned_cols=69 Identities=23% Similarity=0.710 Sum_probs=49.1
Q ss_pred CCCCCCeecCCCCCCCCCCceeeecCCCCc----cCCCCCcccCCCCCCC--CCCCCCCCEeeecCCCCCeEEecCCCCc
Q psy5752 16 PCHPNAQCTSPDEFNESREQAKCICNFGYT----GDGVTECNPESLGCNV--VKNCHANAECVYNATSAGYRCQCAQGYV 89 (222)
Q Consensus 16 ~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~----g~~~~~C~~~~~~C~~--~~~C~~~~~C~~~n~~g~~~C~C~~G~~ 89 (222)
+|...|.|+= . .|.|.+... |. .|+-++-.|.. ...|..++.|. = -+|.|.+||+
T Consensus 515 vCSgrG~C~C--G--------qC~C~~~~~~~i~G~---fCECDnfsC~r~~g~lC~g~G~C~----C--G~CvC~~Gwt 575 (783)
T KOG1226|consen 515 VCSGRGDCVC--G--------QCVCHKPDNGKIYGK---FCECDNFSCERHKGVLCGGHGRCE----C--GRCVCNPGWT 575 (783)
T ss_pred CcCCCCcEeC--C--------ceEecCCCCCceeee---eeeccCcccccccCcccCCCCeEe----C--CcEEcCCCCc
Confidence 7888888876 2 799987776 55 88866545665 34688888884 1 2799999999
Q ss_pred cCCCeee-cCCCcCC
Q psy5752 90 GNGVECH-PLKSCLE 103 (222)
Q Consensus 90 ~~~~~C~-~~~~C~~ 103 (222)
|..+.|. +.+.|..
T Consensus 576 G~~C~C~~std~C~~ 590 (783)
T KOG1226|consen 576 GSACNCPLSTDTCES 590 (783)
T ss_pred cCCCCCCCCCccccC
Confidence 9988775 3344543
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.47 E-value=0.023 Score=30.89 Aligned_cols=25 Identities=40% Similarity=1.034 Sum_probs=20.5
Q ss_pred CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
..|.++|+|+.. .. .|.|.+||+|+
T Consensus 6 ~~C~~~G~C~~~-------~g-~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP-------CG-RCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC-------CC-EEECCCCCcCC
Confidence 369999999972 12 89999999998
No 35
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.44 E-value=0.0062 Score=26.06 Aligned_cols=11 Identities=45% Similarity=1.078 Sum_probs=9.0
Q ss_pred eeecCCCCccC
Q psy5752 37 KCICNFGYTGD 47 (222)
Q Consensus 37 ~C~C~~G~~g~ 47 (222)
.|+|++||+|.
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 49999999998
No 36
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.40 E-value=0.011 Score=33.03 Aligned_cols=21 Identities=43% Similarity=0.931 Sum_probs=16.5
Q ss_pred CEeeecCCCCCeEEecCCCCccC
Q psy5752 69 AECVYNATSAGYRCQCAQGYVGN 91 (222)
Q Consensus 69 ~~C~~~n~~g~~~C~C~~G~~~~ 91 (222)
.+|+ +++++|+|.|++||...
T Consensus 10 h~C~--~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICV--NTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEE--EETTSEEEE-STTEEE-
T ss_pred CCCc--cCCCceEeECCCCCEEC
Confidence 4897 88999999999999843
No 37
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.21 E-value=0.015 Score=32.53 Aligned_cols=35 Identities=34% Similarity=0.769 Sum_probs=23.3
Q ss_pred CCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCCee
Q psy5752 59 CNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGVEC 95 (222)
Q Consensus 59 C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C 95 (222)
|.. ..|..++.|+ +...|++.|+|..||..++..|
T Consensus 2 C~~-~~cP~NA~C~-~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CID-TKCPANAGCF-RYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp -SS-S---TTEEEE-EETTSEEEEEE-TTEEEETTEE
T ss_pred ccC-ccCCCCcccE-EcCCCCEEEEeeCCccccCCCc
Confidence 444 6777899998 4666999999999998665544
No 38
>KOG1226|consensus
Probab=95.20 E-value=0.21 Score=46.26 Aligned_cols=120 Identities=24% Similarity=0.710 Sum_probs=63.0
Q ss_pred eeecCCCCccCCCCCcccCC---------CCCCC--C-CCCCCCCEeeecCCCCCeEEecCCCCc----cCCCeeecCCC
Q psy5752 37 KCICNFGYTGDGVTECNPES---------LGCNV--V-KNCHANAECVYNATSAGYRCQCAQGYV----GNGVECHPLKS 100 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~~~~---------~~C~~--~-~~C~~~~~C~~~n~~g~~~C~C~~G~~----~~~~~C~~~~~ 100 (222)
.|.|.+||.|+ .|+=.. +.|.. . ..|...+.|+ =| +|.|.+... |..++|.+. .
T Consensus 479 ~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~----CG--qC~C~~~~~~~i~G~fCECDnf-s 548 (783)
T KOG1226|consen 479 QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV----CG--QCVCHKPDNGKIYGKFCECDNF-S 548 (783)
T ss_pred ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEe----CC--ceEecCCCCCceeeeeeeccCc-c
Confidence 57999999998 665211 12432 1 2566666664 11 577766554 333333222 1
Q ss_pred cCCC-------CCC--CCC--CCCCcccCCCccc-cCCCCCCCC-CCCCCCCeeeeCCCCeeEeecCCC-CccCCCcccc
Q psy5752 101 CLED-------RSL--CGK--DSQGYVGNGVECH-PLKSCLEDR-SLCGKDASCVVASQGHFHCECNEG-FTGNGITCKP 166 (222)
Q Consensus 101 C~~~-------~~~--C~~--c~~g~~~~g~~C~-~id~C~~~~-~~C~~~~~C~n~~~g~~~C~C~~G-~~gd~~~C~~ 166 (222)
|... ++. |+. |..||+|..-.|. +.+.|.... ..|...++|. =| +|.|... |.| .+|+.
T Consensus 549 C~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~---Cg--~C~C~~~~~sG--~~CE~ 621 (783)
T KOG1226|consen 549 CERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCE---CG--RCKCTDPPYSG--EFCEK 621 (783)
T ss_pred cccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceee---CC--ceEcCCCCcCc--chhhc
Confidence 2211 111 222 5778877633332 346664422 2355555554 13 4777665 777 77877
Q ss_pred CCCCCCC
Q psy5752 167 VRKKESD 173 (222)
Q Consensus 167 ~~~~~~~ 173 (222)
-+.+...
T Consensus 622 cptc~~~ 628 (783)
T KOG1226|consen 622 CPTCPDP 628 (783)
T ss_pred CCCCCCc
Confidence 6666554
No 39
>KOG0994|consensus
Probab=94.51 E-value=0.1 Score=50.31 Aligned_cols=53 Identities=28% Similarity=0.651 Sum_probs=29.8
Q ss_pred ee-ecCCCCccCCCCCcccCCCCCCCCCCCC--------CCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752 37 KC-ICNFGYTGDGVTECNPESLGCNVVKNCH--------ANAECVYNATSAGYRCQCAQGYVGNGV 93 (222)
Q Consensus 37 ~C-~C~~G~~g~~~~~C~~~~~~C~~~~~C~--------~~~~C~~~n~~g~~~C~C~~G~~~~~~ 93 (222)
+| +|..||.|+++ .=..+ .|.+ -+|. ....|.-.+......|.|.+||+|..+
T Consensus 886 ~CdrCl~GyyGdP~-lg~g~--~CrP-CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RC 947 (1758)
T KOG0994|consen 886 SCDRCLDGYYGDPR-LGSGI--GCRP-CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRC 947 (1758)
T ss_pred chhhhhccccCCcc-cCCCC--CCCC-CCCCCCCccchhccccccccccccceeeecccCccccch
Confidence 67 89999998754 11111 1333 2332 122453112334567999999998754
No 40
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.04 E-value=0.1 Score=28.33 Aligned_cols=23 Identities=35% Similarity=1.014 Sum_probs=19.5
Q ss_pred CCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752 134 LCGKDASCVVASQGHFHCECNEGFTG 159 (222)
Q Consensus 134 ~C~~~~~C~n~~~g~~~C~C~~G~~g 159 (222)
.|..+++|++.. .+|.|.+||+|
T Consensus 7 ~C~~~G~C~~~~---g~C~C~~g~~G 29 (32)
T PF07974_consen 7 ICSGHGTCVSPC---GRCVCDSGYTG 29 (32)
T ss_pred ccCCCCEEeCCC---CEEECCCCCcC
Confidence 488899999762 57999999998
No 41
>KOG0994|consensus
Probab=93.02 E-value=0.42 Score=46.36 Aligned_cols=64 Identities=28% Similarity=0.687 Sum_probs=37.7
Q ss_pred EEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCC--Cccc---cCCCCCCCCCCCCCCCeeeeCCCCeeEe-ecC
Q psy5752 81 RCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNG--VECH---PLKSCLEDRSLCGKDASCVVASQGHFHC-ECN 154 (222)
Q Consensus 81 ~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g--~~C~---~id~C~~~~~~C~~~~~C~n~~~g~~~C-~C~ 154 (222)
+|.|.+|.-|.. |..|.+||++-. +.|. ..+.|....+.|.. |.+...| +.| .|.
T Consensus 831 QC~C~~g~ygrq---------------CnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi~---CqD~T~G-~~CdrCl 891 (1758)
T KOG0994|consen 831 QCQCRPGTYGRQ---------------CNQCQPGYWGFPECRPCQCNGHADTCDPITGACID---CQDSTTG-HSCDRCL 891 (1758)
T ss_pred ceeeccccchhh---------------ccccCCCccCCCcCccccccCcccccCcccccccc---ccccccc-cchhhhh
Confidence 677776654442 344566665321 2222 13556555555643 7777667 888 799
Q ss_pred CCCccCCCc
Q psy5752 155 EGFTGNGIT 163 (222)
Q Consensus 155 ~G~~gd~~~ 163 (222)
.||+||...
T Consensus 892 ~GyyGdP~l 900 (1758)
T KOG0994|consen 892 DGYYGDPRL 900 (1758)
T ss_pred ccccCCccc
Confidence 999987643
No 42
>PHA02887 EGF-like protein; Provisional
Probab=92.64 E-value=0.13 Score=36.35 Aligned_cols=40 Identities=38% Similarity=0.787 Sum_probs=30.3
Q ss_pred CCCCC----CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC
Q psy5752 9 SLSEC----DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES 56 (222)
Q Consensus 9 ~~~~C----~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~ 56 (222)
-+.+| .+-|- ||.|..... .... .|.|+.||+|. .|+.++
T Consensus 82 hf~pC~~eyk~YCi-HG~C~yI~d---L~ep-sCrC~~GYtG~---RCE~vs 125 (126)
T PHA02887 82 FFEKCKNDFNDFCI-NGECMNIID---LDEK-FCICNKGYTGI---RCDEVS 125 (126)
T ss_pred CccccChHhhCEee-CCEEEcccc---CCCc-eeECCCCcccC---CCCccc
Confidence 46778 36787 579998644 5555 89999999999 887653
No 43
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=90.09 E-value=0.24 Score=35.69 Aligned_cols=40 Identities=28% Similarity=0.665 Sum_probs=29.9
Q ss_pred CCCCC----CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC
Q psy5752 9 SLSEC----DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES 56 (222)
Q Consensus 9 ~~~~C----~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~ 56 (222)
++.+| .+-|-|+ .|..... ...+ .|.|..||+|. .|+..+
T Consensus 41 ~i~~Cp~ey~~YClHG-~C~yI~d---l~~~-~CrC~~GYtGe---RCEh~d 84 (139)
T PHA03099 41 AIRLCGPEGDGYCLHG-DCIHARD---IDGM-YCRCSHGYTGI---RCQHVV 84 (139)
T ss_pred ccccCChhhCCEeECC-EEEeecc---CCCc-eeECCCCcccc---ccccee
Confidence 35667 3668764 8987654 5666 89999999999 888654
No 44
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=90.04 E-value=0.15 Score=39.27 Aligned_cols=75 Identities=23% Similarity=0.594 Sum_probs=42.6
Q ss_pred CCCCCCCEeeecCCCCCeEEecCCCCc-cCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCee
Q psy5752 63 KNCHANAECVYNATSAGYRCQCAQGYV-GNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASC 141 (222)
Q Consensus 63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~-~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C 141 (222)
..|.+ +..+ ...+.|.|.|.+||. -+..+|+...+|... ......|...++|
T Consensus 6 T~CKN-G~Li--QMSNHfEC~Cnegfvl~~EntCE~kv~C~~~------------------------e~~~K~Cgdya~C 58 (197)
T PF06247_consen 6 TICKN-GYLI--QMSNHFECKCNEGFVLKNENTCEEKVECDKL------------------------ENVNKPCGDYAKC 58 (197)
T ss_dssp ---BT-EEEE--EESSEEEEEESTTEEEEETTEEEE----SG-------------------------GGTTSEEETTEEE
T ss_pred ccccC-CEEE--EccCceEEEcCCCcEEccccccccceecCcc------------------------cccCccccchhhh
Confidence 45543 4454 456789999999997 344555555444321 0112348778899
Q ss_pred eeCCC----CeeEeecCCCCccCCCcc
Q psy5752 142 VVASQ----GHFHCECNEGFTGNGITC 164 (222)
Q Consensus 142 ~n~~~----g~~~C~C~~G~~gd~~~C 164 (222)
++.+. ..|.|.|.+||......|
T Consensus 59 ~~~~~~~~~~~~~C~C~~gY~~~~~vC 85 (197)
T PF06247_consen 59 INQANKGEERAYKCDCINGYILKQGVC 85 (197)
T ss_dssp EE-SSTTSSTSEEEEE-TTEEESSSSE
T ss_pred hcCCCcccceeEEEecccCceeeCCeE
Confidence 97763 459999999999543343
No 45
>smart00051 DSL delta serrate ligand.
Probab=89.17 E-value=0.82 Score=28.93 Aligned_cols=44 Identities=23% Similarity=0.650 Sum_probs=30.2
Q ss_pred eeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752 37 KCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGN 91 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~ 91 (222)
.-.|+++|.|. .|... |.+.+.+..+..| +..| .+.|.+||.|.
T Consensus 18 rv~C~~~~yG~---~C~~~---C~~~~d~~~~~~C---d~~G--~~~C~~Gw~G~ 61 (63)
T smart00051 18 RVTCDENYYGE---GCNKF---CRPRDDFFGHYTC---DENG--NKGCLEGWMGP 61 (63)
T ss_pred EeeCCCCCcCC---ccCCE---eCcCccccCCccC---CcCC--CEecCCCCcCC
Confidence 56899999998 77532 5543445556678 4334 57899999876
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=87.79 E-value=0.63 Score=37.11 Aligned_cols=36 Identities=28% Similarity=0.712 Sum_probs=28.0
Q ss_pred CcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752 51 ECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCAQGYVGN 91 (222)
Q Consensus 51 ~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~ 91 (222)
.|++.+ +|.. .+.|. ..|. ++.|+|.|.|.+||+..
T Consensus 183 ~C~~~~-~C~~~~~~c~--~~C~--~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 183 ICVVPD-LCATLSHVCQ--QVCI--STPGSYLCACTEGYALL 219 (224)
T ss_pred cCcCch-hhcCCCCCcc--ceEE--cCCCCEEeECCCCccCC
Confidence 787666 4875 45674 4796 99999999999999843
No 47
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=85.91 E-value=0.88 Score=32.03 Aligned_cols=32 Identities=31% Similarity=0.707 Sum_probs=24.9
Q ss_pred CCCCC--CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 9 SLSEC--DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 9 ~~~~C--~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
..|+| ...|+.++.|.. .... .|.|.+||+-+
T Consensus 76 p~d~Cd~y~~CG~~g~C~~------~~~~-~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS------NNSP-KCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC------CCCC-ceECCCCcCCC
Confidence 46789 689999999965 3344 79999999753
No 48
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=82.14 E-value=1.8 Score=30.41 Aligned_cols=30 Identities=27% Similarity=0.734 Sum_probs=23.9
Q ss_pred CCCCCCCCCCCCEeeecCCCCCeEEecCCCCcc
Q psy5752 58 GCNVVKNCHANAECVYNATSAGYRCQCAQGYVG 90 (222)
Q Consensus 58 ~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~ 90 (222)
.|.....|+..+.| +...+..|.|.+||.-
T Consensus 79 ~Cd~y~~CG~~g~C---~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 79 QCDVYGFCGPNGIC---NSNNSPKCSCLPGFEP 108 (110)
T ss_pred CCCCccccCCccEe---CCCCCCceECCCCcCC
Confidence 47766899999999 4555668999999963
No 49
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=77.81 E-value=3.9 Score=24.42 Aligned_cols=27 Identities=30% Similarity=0.807 Sum_probs=20.6
Q ss_pred CCC--CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752 11 SEC--DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD 47 (222)
Q Consensus 11 ~~C--~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~ 47 (222)
+.| ...|..++.|++ . .|.|++||.-.
T Consensus 20 ~~C~~~~qC~~~s~C~~--g--------~C~C~~g~~~~ 48 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVN--G--------RCQCPPGYVEV 48 (52)
T ss_pred CCCCCcCCCCCcCEEcC--C--------EeECCCCCEec
Confidence 446 567888889976 2 89999999744
No 50
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.95 E-value=3.2 Score=30.02 Aligned_cols=32 Identities=25% Similarity=0.690 Sum_probs=24.8
Q ss_pred CCCCCCCCEeeecCCCCCeEEecCCCCccCCCe
Q psy5752 62 VKNCHANAECVYNATSAGYRCQCAQGYVGNGVE 94 (222)
Q Consensus 62 ~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~ 94 (222)
.+.|-++ .|.+......+.|.|..||+|..++
T Consensus 50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence 4677765 8975556678999999999998654
No 51
>PHA02887 EGF-like protein; Provisional
Probab=75.17 E-value=4 Score=29.06 Aligned_cols=35 Identities=29% Similarity=0.577 Sum_probs=25.6
Q ss_pred CCCC--CCCCCCCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752 58 GCNV--VKNCHANAECVYNATSAGYRCQCAQGYVGNGV 93 (222)
Q Consensus 58 ~C~~--~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~ 93 (222)
+|.. .+.|- ++.|.+......+.|.|.+||+|..+
T Consensus 85 pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~RC 121 (126)
T PHA02887 85 KCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIRC 121 (126)
T ss_pred ccChHhhCEee-CCEEEccccCCCceeECCCCcccCCC
Confidence 3554 46776 56997545567789999999998855
No 52
>KOG3516|consensus
Probab=75.14 E-value=2.8 Score=41.11 Aligned_cols=41 Identities=24% Similarity=0.681 Sum_probs=32.7
Q ss_pred ccCCCCC-CCCCCCCCeecCCCCCCCCCCceeeecC-CCCccCCCCCcccCC
Q psy5752 7 RFSLSEC-DTPCHPNAQCTSPDEFNESREQAKCICN-FGYTGDGVTECNPES 56 (222)
Q Consensus 7 ~~~~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~-~G~~g~~~~~C~~~~ 56 (222)
|.-++.| +++|++++.|... ...+ .|.|. .||+|. +|...+
T Consensus 542 C~i~drClPN~CehgG~C~Qs-----~~~f-~C~C~~TGY~Ga---tCHtsi 584 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQS-----WDDF-ECNCELTGYKGA---TCHTSI 584 (1306)
T ss_pred cccccccCCccccCCCccccc-----ccce-eEeccccccccc---cccCCC
Confidence 4457788 9999999999983 4455 99998 999998 887543
No 53
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=63.70 E-value=7.5 Score=21.23 Aligned_cols=19 Identities=26% Similarity=0.824 Sum_probs=13.3
Q ss_pred EeeecCCCCCeEEecCCCCccC
Q psy5752 70 ECVYNATSAGYRCQCAQGYVGN 91 (222)
Q Consensus 70 ~C~~~n~~g~~~C~C~~G~~~~ 91 (222)
.|. ......|.|+.||+.+
T Consensus 11 ~CD---pn~~~~C~CPeGyIld 29 (34)
T PF09064_consen 11 DCD---PNSPGQCFCPEGYILD 29 (34)
T ss_pred ccC---CCCCCceeCCCceEec
Confidence 673 3334589999999855
No 54
>KOG3514|consensus
Probab=62.79 E-value=5.5 Score=38.89 Aligned_cols=35 Identities=29% Similarity=0.825 Sum_probs=28.9
Q ss_pred CC-CCCCCCCCeecCCCCCCCCCCceeeecC-CCCccCCCCCcccC
Q psy5752 12 EC-DTPCHPNAQCTSPDEFNESREQAKCICN-FGYTGDGVTECNPE 55 (222)
Q Consensus 12 ~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~-~G~~g~~~~~C~~~ 55 (222)
.| ++||+|+|.|... ...+ .|-|. .||.|. +|+..
T Consensus 625 ~C~~nPC~N~g~C~eg-----wNrf-iCDCs~T~~~G~---~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEG-----WNRF-ICDCSGTGFEGR---TCERE 661 (1591)
T ss_pred ccCCCcccCCCCcccc-----cccc-ccccccCcccCc---cccce
Confidence 68 9999999999984 5566 99995 788888 88754
No 55
>KOG0196|consensus
Probab=62.66 E-value=24 Score=33.81 Aligned_cols=72 Identities=19% Similarity=0.488 Sum_probs=39.7
Q ss_pred EEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc-
Q psy5752 81 RCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG- 159 (222)
Q Consensus 81 ~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g- 159 (222)
.|.|++||.... ....|..|+.|+.... .....|.. |..+..-. . .|+-.|.|..||..
T Consensus 260 ~C~C~aGye~~~-----------~~~~C~aCp~G~yK~~---~~~~~C~~----CP~~S~s~-~-ega~~C~C~~gyyRA 319 (996)
T KOG0196|consen 260 GCVCKAGYEEAE-----------NGKACQACPPGTYKAS---QGDSLCLP----CPPNSHSS-S-EGATSCTCENGYYRA 319 (996)
T ss_pred ceeecCCCCccc-----------CCCcceeCCCCcccCC---CCCCCCCC----CCCCCCCC-C-CCCCcccccCCcccC
Confidence 688999997421 1234666777764210 11234442 55443322 2 35578999999983
Q ss_pred --CCC--ccccCCCCCC
Q psy5752 160 --NGI--TCKPVRKKES 172 (222)
Q Consensus 160 --d~~--~C~~~~~~~~ 172 (222)
|+. -|+..+..+.
T Consensus 320 ~~Dp~~mpCT~PPSaP~ 336 (996)
T KOG0196|consen 320 DSDPPSMPCTRPPSAPR 336 (996)
T ss_pred CCCCCCCCCCCCCCccc
Confidence 332 3776554433
No 56
>KOG1836|consensus
Probab=60.47 E-value=22 Score=37.07 Aligned_cols=49 Identities=22% Similarity=0.494 Sum_probs=25.3
Q ss_pred CCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCC-CCeeEee-cCCCCcc
Q psy5752 108 CGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVAS-QGHFHCE-CNEGFTG 159 (222)
Q Consensus 108 C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~-~g~~~C~-C~~G~~g 159 (222)
|..|..||.++...=..-| |..-+ |..++.|..+. ..+..|. |++||+|
T Consensus 758 C~~C~~GfYg~~~~~~~~d-C~~C~--Cp~~~~~~~~~~~~~~iCk~Cp~gytG 808 (1705)
T KOG1836|consen 758 CAQCVDGFYGLPDLGTSGD-CQPCP--CPNGGACGQTPEILEVVCKNCPPGYTG 808 (1705)
T ss_pred hhhhcCCCCCccccCCCCC-CccCC--CCCChhhcCcCcccceecCCCCCCCcc
Confidence 4456677754311101111 44322 65555665544 2336787 8888887
No 57
>KOG3516|consensus
Probab=60.26 E-value=7.6 Score=38.27 Aligned_cols=41 Identities=29% Similarity=0.821 Sum_probs=32.2
Q ss_pred ccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecC-CCCccCCCcccc
Q psy5752 121 ECHPLKSCLEDRSLCGKDASCVVASQGHFHCECN-EGFTGNGITCKP 166 (222)
Q Consensus 121 ~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~-~G~~gd~~~C~~ 166 (222)
.|.-+|.|.. +.|.+++.|..+. ..|.|.|. .||+| .+|..
T Consensus 541 ~C~i~drClP--N~CehgG~C~Qs~-~~f~C~C~~TGY~G--atCHt 582 (1306)
T KOG3516|consen 541 MCGISDRCLP--NPCEHGGKCSQSW-DDFECNCELTGYKG--ATCHT 582 (1306)
T ss_pred ccccccccCC--ccccCCCcccccc-cceeEecccccccc--ccccC
Confidence 4556678875 4599999999875 55999998 89998 77764
No 58
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=49.32 E-value=12 Score=26.14 Aligned_cols=33 Identities=24% Similarity=0.616 Sum_probs=23.9
Q ss_pred CCCCCCCCCCCCCCeeeeCC----CCeeEeecCCCCc
Q psy5752 126 KSCLEDRSLCGKDASCVVAS----QGHFHCECNEGFT 158 (222)
Q Consensus 126 d~C~~~~~~C~~~~~C~n~~----~g~~~C~C~~G~~ 158 (222)
++|....+.|..++.|++.. +.-|.|.|.+-+.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence 45656667798899999773 2458999987554
No 59
>KOG3512|consensus
Probab=40.82 E-value=34 Score=30.55 Aligned_cols=14 Identities=29% Similarity=0.370 Sum_probs=11.6
Q ss_pred eeeecCCcEEEEcC
Q psy5752 174 FLLVNQGMFMLRVP 187 (222)
Q Consensus 174 ~~~~~~g~~~~~~~ 187 (222)
.++...+...|+||
T Consensus 527 ~lwv~~~~i~c~cp 540 (592)
T KOG3512|consen 527 SLWVPAGDIACKCP 540 (592)
T ss_pred eeeecCCCceecCC
Confidence 46888888889988
No 60
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=40.10 E-value=16 Score=21.37 Aligned_cols=14 Identities=43% Similarity=1.232 Sum_probs=11.9
Q ss_pred eeecCCCCccCCCCCcc
Q psy5752 37 KCICNFGYTGDGVTECN 53 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~ 53 (222)
.|.|+++|+|. .|+
T Consensus 19 ~C~C~~~~~G~---~C~ 32 (49)
T PF00053_consen 19 QCVCKPGTTGP---RCD 32 (49)
T ss_dssp EESBSTTEEST---TS-
T ss_pred EEeccccccCC---cCc
Confidence 89999999999 776
No 61
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=39.37 E-value=39 Score=19.86 Aligned_cols=14 Identities=36% Similarity=0.956 Sum_probs=12.0
Q ss_pred eeecCCCCccCCCCCcc
Q psy5752 37 KCICNFGYTGDGVTECN 53 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~ 53 (222)
.|.|+++|+|. .|+
T Consensus 20 ~C~C~~~~~G~---~C~ 33 (50)
T cd00055 20 QCECKPNTTGR---RCD 33 (50)
T ss_pred EEeCCCcCCCC---CCC
Confidence 79999999998 664
No 62
>KOG3514|consensus
Probab=33.81 E-value=69 Score=31.86 Aligned_cols=35 Identities=31% Similarity=0.849 Sum_probs=27.5
Q ss_pred CCCCCCCCCCCCCeeeeCCCCeeEeecC-CCCccCCCcccc
Q psy5752 127 SCLEDRSLCGKDASCVVASQGHFHCECN-EGFTGNGITCKP 166 (222)
Q Consensus 127 ~C~~~~~~C~~~~~C~n~~~g~~~C~C~-~G~~gd~~~C~~ 166 (222)
.|..+| |.++++|....+. |.|.|. .||.| .+|+.
T Consensus 625 ~C~~nP--C~N~g~C~egwNr-fiCDCs~T~~~G--~~Cer 660 (1591)
T KOG3514|consen 625 ICESNP--CQNGGKCSEGWNR-FICDCSGTGFEG--RTCER 660 (1591)
T ss_pred ccCCCc--ccCCCCccccccc-cccccccCcccC--ccccc
Confidence 476655 9999999977644 999996 58888 78875
No 63
>KOG1836|consensus
Probab=33.41 E-value=56 Score=34.26 Aligned_cols=53 Identities=25% Similarity=0.483 Sum_probs=32.8
Q ss_pred ee-ecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEe-cCCCCccCCC
Q psy5752 37 KC-ICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQ-CAQGYVGNGV 93 (222)
Q Consensus 37 ~C-~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~-C~~G~~~~~~ 93 (222)
.| +|..||.|... .+....|.. -+|.+++.|.-........|. |++||+|..+
T Consensus 757 ~C~~C~~GfYg~~~---~~~~~dC~~-C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 757 QCAQCVDGFYGLPD---LGTSGDCQP-CPCPNGGACGQTPEILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred chhhhcCCCCCccc---cCCCCCCcc-CCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence 35 78888887632 111112665 677777777411124567898 9999998854
No 64
>KOG3509|consensus
Probab=29.83 E-value=1.3e+02 Score=29.61 Aligned_cols=66 Identities=27% Similarity=0.638 Sum_probs=41.5
Q ss_pred CCCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecCCC
Q psy5752 10 LSEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCAQG 87 (222)
Q Consensus 10 ~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~~G 87 (222)
.+.| ..+|.+.+.|...+ -.. .|.|++||+|+ .|++-.+.|.. ++.. -.++|. ...+.....|.+|
T Consensus 406 g~~c~~~p~~~~g~c~p~~-----~~~-~c~c~~g~~G~---~c~d~~~~~~~~~~g~-y~~t~~--~~~~~~~~~c~pg 473 (964)
T KOG3509|consen 406 GDVCWRIPCQHDGPCLQTL-----EGK-QCLCPPGYTGD---SCEDCMNGCDRSPNGS-YLGTCV--PIQGKRCEYCGPG 473 (964)
T ss_pred CCccccccCCCCccccccc-----ccc-ceeccccccCc---hhhccCccccccCCcc-ccceEe--ccCCCcceeecCC
Confidence 3566 67898888888853 333 89999999999 77665444443 1222 234553 3333455677777
No 65
>PF02740 Colipase_C: Colipase, C-terminal domain; InterPro: IPR017914 This entry represents the C-terminal domain of colipase proteins. Colipase [, ] is a small protein cofactor needed by pancreatic lipase for efficient dietary lipid hydrolyisis. It also binds to the bile-salt covered triacylglycerol interface, thus allowing the enzyme to anchor itself to the water-lipid interface. Efficient absorption of dietary fats is dependent on the action of pancreatic triglyceride lipase. Colipase binds to the C-terminal, non-catalytic domain of lipase, thereby stabilising as active conformation and considerably increasing the overall hydrophobic binding site. Structural studies of the complex and of colipase alone have revealed the functionality of its architecture [, ]. Colipase is a small protein with five conserved disulphide bonds. Structural analogies have been recognised between a developmental protein (Dickkopf), the pancreatic lipase C-terminal domain, the N-terminal domains of lipoxygenases and the C-terminal domain of alpha-toxin. These non-catalytic domains in the latter enzymes are important for interaction with membrane. It has not been established if these domains are also involved in eventual protein cofactor binding as is the case for pancreatic lipase [].; GO: 0008047 enzyme activator activity, 0007586 digestion, 0016042 lipid catabolic process, 0005576 extracellular region; PDB: 1PCO_A 1PCN_A 1LPA_A 1N8S_C 1LPB_A 1ETH_D.
Probab=26.75 E-value=50 Score=19.12 Aligned_cols=22 Identities=27% Similarity=0.523 Sum_probs=12.7
Q ss_pred CeecCCCCCCCCCCceeeecCCCCc
Q psy5752 21 AQCTSPDEFNESREQAKCICNFGYT 45 (222)
Q Consensus 21 ~~C~~~~~~~~~g~~~~C~C~~G~~ 45 (222)
+.|..... -|.||.|.|.+|.+
T Consensus 3 ~~cs~~tl---yGvYy~CPCE~GLt 24 (45)
T PF02740_consen 3 SECSPKTL---YGVYYKCPCERGLT 24 (45)
T ss_dssp SEEESS-S---SSCBSS--BSTTEE
T ss_pred ccccccEe---eeEEEeccCCCCcE
Confidence 44554434 67888999988764
No 66
>KOG1215|consensus
Probab=24.63 E-value=3.6e+02 Score=26.25 Aligned_cols=62 Identities=26% Similarity=0.553 Sum_probs=36.6
Q ss_pred CCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCccccCCCCCCCeeeecCCcEEEEcCCCC
Q psy5752 126 KSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITCKPVRKKESDFLLVNQGMFMLRVPYQP 190 (222)
Q Consensus 126 d~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C~~~~~~~~~~~~~~~g~~~~~~~~~~ 190 (222)
..|...++.|.+ .|.....+.|.|.|..||......|..... ...+.+.+...-+-..++..
T Consensus 365 ~~~~~~~g~Csq--~C~~~~p~~~~c~c~~g~~~~~~~c~~~~~-~~~~l~~s~~~~ir~~~~~~ 426 (877)
T KOG1215|consen 365 NPCESDNGGCSQ--LCVPNSPGTFKCACSPGYELRLDKCEASDQ-PEAFLLFSNRHDIRRISLDC 426 (877)
T ss_pred CcccccCCccce--eccCCCCCceeEecCCCcEeccCCceecCC-CCcEEEEecCccceecccCC
Confidence 455556677865 788433466999999999953333544332 23455555555544444433
No 67
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=23.82 E-value=61 Score=18.69 Aligned_cols=14 Identities=36% Similarity=0.975 Sum_probs=12.2
Q ss_pred eeecCCCCccCCCCCcc
Q psy5752 37 KCICNFGYTGDGVTECN 53 (222)
Q Consensus 37 ~C~C~~G~~g~~~~~C~ 53 (222)
.|.|+++++|. .|+
T Consensus 19 ~C~C~~~~~G~---~C~ 32 (46)
T smart00180 19 QCECKPNVTGR---RCD 32 (46)
T ss_pred EEECCCCCCCC---CCC
Confidence 79999999998 665
No 68
>KOG0196|consensus
Probab=23.33 E-value=93 Score=30.07 Aligned_cols=46 Identities=28% Similarity=0.803 Sum_probs=28.4
Q ss_pred eeecCCCCcc--CCCCCcccC----------CCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCc
Q psy5752 37 KCICNFGYTG--DGVTECNPE----------SLGCNVVKNCHANAECVYNATSAGYRCQCAQGYV 89 (222)
Q Consensus 37 ~C~C~~G~~g--~~~~~C~~~----------~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~ 89 (222)
.|.|.+||.- .+. .|+.= +..| .+|..+..= ...++..|.|..||-
T Consensus 260 ~C~C~aGye~~~~~~-~C~aCp~G~yK~~~~~~~C---~~CP~~S~s---~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 260 GCVCKAGYEEAENGK-ACQACPPGTYKASQGDSLC---LPCPPNSHS---SSEGATSCTCENGYY 317 (996)
T ss_pred ceeecCCCCcccCCC-cceeCCCCcccCCCCCCCC---CCCCCCCCC---CCCCCCcccccCCcc
Confidence 6999999984 333 56531 1112 234444332 457888999999985
No 69
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=22.02 E-value=47 Score=19.78 Aligned_cols=19 Identities=42% Similarity=0.955 Sum_probs=12.9
Q ss_pred EecCCCCccCCC-eeecCCC
Q psy5752 82 CQCAQGYVGNGV-ECHPLKS 100 (222)
Q Consensus 82 C~C~~G~~~~~~-~C~~~~~ 100 (222)
|.|++||..+.. .|+...+
T Consensus 35 C~C~~G~v~~~~~~CV~~~~ 54 (55)
T PF01826_consen 35 CFCPPGYVRNDNGRCVPPSE 54 (55)
T ss_dssp EEETTTEEEETTSEEEEGGG
T ss_pred CCCCCCeeEcCCCCEEcHHH
Confidence 899999984433 6665543
No 70
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=20.58 E-value=94 Score=16.83 Aligned_cols=9 Identities=33% Similarity=0.918 Sum_probs=4.0
Q ss_pred eeecCCCCc
Q psy5752 37 KCICNFGYT 45 (222)
Q Consensus 37 ~C~C~~G~~ 45 (222)
+|.|+.|-.
T Consensus 23 scscptgcn 31 (40)
T PF11403_consen 23 SCSCPTGCN 31 (40)
T ss_dssp S-SS-TTTT
T ss_pred cCCCCCCCC
Confidence 567766543
Done!