Query psy11797
Match_columns 249
No_of_seqs 245 out of 1876
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 19:14:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11797.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11797hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.3 6.5E-12 1.4E-16 115.1 8.6 158 31-224 692-860 (1289)
2 KOG1214|consensus 99.3 4.5E-11 9.7E-16 109.8 12.3 107 1-131 724-865 (1289)
3 KOG1219|consensus 99.1 1.6E-10 3.5E-15 114.6 6.6 108 32-180 3865-3974(4289)
4 KOG1219|consensus 98.9 1.2E-09 2.7E-14 108.6 6.0 94 12-131 3865-3978(4289)
5 KOG4289|consensus 98.9 2.1E-09 4.6E-14 103.5 5.3 127 10-176 1178-1308(2531)
6 PF07645 EGF_CA: Calcium-bindi 98.7 2.3E-08 5E-13 59.7 3.6 40 89-128 1-41 (42)
7 PF07645 EGF_CA: Calcium-bindi 98.6 6E-08 1.3E-12 57.9 3.3 34 30-63 1-36 (42)
8 KOG1217|consensus 98.5 1.8E-06 4E-11 77.9 12.2 192 32-249 170-378 (487)
9 KOG1217|consensus 98.4 4.7E-06 1E-10 75.3 11.6 170 43-242 140-330 (487)
10 PF14670 FXa_inhibition: Coagu 98.2 9E-07 2E-11 50.6 2.5 32 97-129 5-36 (36)
11 KOG4260|consensus 98.1 1.5E-06 3.3E-11 71.0 3.0 67 29-121 234-304 (350)
12 PF12662 cEGF: Complement Clr- 98.1 2.9E-06 6.2E-11 43.7 2.1 24 111-134 1-24 (24)
13 KOG4289|consensus 98.0 5.7E-06 1.2E-10 80.7 3.6 91 106-219 1216-1308(2531)
14 PF12662 cEGF: Complement Clr- 97.9 7.2E-06 1.6E-10 42.2 2.2 24 51-92 1-24 (24)
15 PF14670 FXa_inhibition: Coagu 97.8 1.5E-05 3.3E-10 45.5 2.7 33 34-66 1-33 (36)
16 smart00179 EGF_CA Calcium-bind 97.8 2.7E-05 5.9E-10 45.1 3.4 35 30-64 1-37 (39)
17 KOG4260|consensus 97.7 1.9E-05 4.2E-10 64.7 2.0 73 85-178 231-304 (350)
18 PF12947 EGF_3: EGF domain; I 97.5 4.7E-05 1E-09 43.5 1.5 31 34-64 1-33 (36)
19 PF00008 EGF: EGF-like domain 97.5 6.2E-05 1.3E-09 41.9 1.9 30 34-63 1-31 (32)
20 cd00054 EGF_CA Calcium-binding 97.4 0.00018 3.8E-09 41.2 3.4 36 30-65 1-37 (38)
21 smart00179 EGF_CA Calcium-bind 97.4 0.00021 4.6E-09 41.2 3.5 37 89-129 1-38 (39)
22 KOG1225|consensus 97.3 0.0011 2.3E-08 60.5 8.5 121 53-224 235-365 (525)
23 PF12947 EGF_3: EGF domain; I 97.2 0.00021 4.5E-09 40.8 2.0 31 97-129 5-36 (36)
24 PF00683 TB: TB domain; Inter 96.9 0.00017 3.7E-09 42.7 -0.7 30 183-212 11-40 (42)
25 KOG1225|consensus 96.8 0.0091 2E-07 54.5 9.4 74 113-224 266-339 (525)
26 cd00054 EGF_CA Calcium-binding 96.8 0.0019 4.2E-08 36.6 3.4 35 90-129 2-37 (38)
27 PF00008 EGF: EGF-like domain 96.6 0.0018 3.9E-08 35.9 2.3 24 98-121 4-29 (32)
28 cd00053 EGF Epidermal growth f 96.3 0.0049 1.1E-07 34.3 3.1 21 43-63 12-32 (36)
29 cd01475 vWA_Matrilin VWA_Matri 96.3 0.0038 8.2E-08 51.2 3.5 44 83-127 180-223 (224)
30 smart00181 EGF Epidermal growt 96.3 0.0052 1.1E-07 34.5 3.1 19 44-62 12-30 (35)
31 smart00181 EGF Epidermal growt 96.2 0.007 1.5E-07 33.9 3.2 24 98-121 6-29 (35)
32 PF06247 Plasmod_Pvs28: Plasmo 96.2 0.0014 3.1E-08 51.2 0.3 135 43-225 11-164 (197)
33 cd00053 EGF Epidermal growth f 96.1 0.0089 1.9E-07 33.2 3.2 24 98-121 6-30 (36)
34 cd01475 vWA_Matrilin VWA_Matri 95.8 0.0073 1.6E-07 49.5 3.0 37 29-65 185-221 (224)
35 PF12661 hEGF: Human growth fa 94.3 0.028 6.1E-07 24.4 1.2 12 53-64 1-12 (13)
36 KOG0994|consensus 92.8 1.5 3.2E-05 43.8 11.0 60 162-224 878-946 (1758)
37 KOG0994|consensus 92.6 0.34 7.4E-06 47.9 6.5 32 90-123 865-897 (1758)
38 KOG1226|consensus 90.3 3.5 7.6E-05 39.4 10.4 23 43-66 467-492 (783)
39 PF07974 EGF_2: EGF-like domai 89.9 0.52 1.1E-05 26.0 3.0 25 99-129 7-32 (32)
40 PF06247 Plasmod_Pvs28: Plasmo 88.3 0.39 8.4E-06 37.8 2.4 99 43-180 56-162 (197)
41 PF12946 EGF_MSP1_1: MSP1 EGF 81.0 1.1 2.4E-05 25.5 1.4 25 40-64 8-33 (37)
42 PHA03099 epidermal growth fact 76.8 2.3 5E-05 31.3 2.4 39 89-131 41-82 (139)
43 KOG1226|consensus 75.5 7.1 0.00015 37.4 5.8 15 113-131 479-493 (783)
44 KOG1836|consensus 74.3 10 0.00022 40.2 7.0 14 54-67 697-710 (1705)
45 smart00051 DSL delta serrate l 70.3 9.1 0.0002 24.6 3.8 16 51-66 16-31 (63)
46 KOG1836|consensus 65.9 19 0.00042 38.3 6.9 71 32-131 738-813 (1705)
47 PHA02887 EGF-like protein; Pro 57.5 9.9 0.00021 27.6 2.3 38 90-131 83-123 (126)
48 PHA03099 epidermal growth fact 56.6 9.4 0.0002 28.2 2.1 24 43-66 56-81 (139)
49 PF09064 Tme5_EGF_like: Thromb 56.3 13 0.00028 20.7 2.2 24 40-64 7-30 (34)
50 PF00954 S_locus_glycop: S-loc 56.2 12 0.00026 26.7 2.7 30 90-121 77-107 (110)
51 KOG3512|consensus 43.0 90 0.002 28.6 6.4 25 43-67 285-310 (592)
52 KOG1215|consensus 42.8 43 0.00094 33.3 5.1 66 40-126 334-400 (877)
53 cd00055 EGF_Lam Laminin-type e 37.1 93 0.002 18.6 4.5 16 187-203 20-35 (50)
54 KOG0196|consensus 33.4 51 0.0011 32.4 3.6 51 168-221 258-317 (996)
55 KOG3516|consensus 32.5 33 0.00071 34.9 2.3 39 86-130 541-581 (1306)
56 smart00180 EGF_Lam Laminin-typ 27.4 1.4E+02 0.003 17.5 3.7 16 187-203 19-34 (46)
57 PF12955 DUF3844: Domain of un 27.2 40 0.00087 24.0 1.4 31 91-121 6-42 (103)
58 PF01826 TIL: Trypsin Inhibito 24.0 40 0.00086 20.6 0.9 18 113-131 34-51 (55)
59 KOG3516|consensus 21.7 77 0.0017 32.4 2.7 37 31-67 545-582 (1306)
No 1
>KOG1214|consensus
Probab=99.31 E-value=6.5e-12 Score=115.15 Aligned_cols=158 Identities=27% Similarity=0.556 Sum_probs=114.8
Q ss_pred CCccCCCCCCCC--CCceecCC-CeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeee
Q psy11797 31 VDECRTPANTCK--FSCKNLIG-SYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCV 106 (249)
Q Consensus 31 id~C~~~~~~c~--~~C~n~~g-sy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~ 106 (249)
++.|..+.+.|. +.|....+ .|+|.|..||.|++ ..|.|+++|+.....|. +.+|+
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdg--------------------r~c~d~~eca~~~~~CGp~s~Ci 751 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDG--------------------RNCVDENECATGFHRCGPNSVCI 751 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCC--------------------CCCCChhhhccCCCCCCCCceee
Confidence 567777777776 77876654 59999999999876 67899999998788899 88999
Q ss_pred eCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccC--CCCCcc--cccccc-CCCCcccCCCCCCccC
Q psy11797 107 NLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNN--NNNQRL--GFCYRS-LTNGRCVLPTGPALLM 181 (249)
Q Consensus 107 ~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~C~~~-~~~~~C~c~~g~~~~~ 181 (249)
+.+++|+|.|..||...+++.+|..+..-+. .+.|.. ..|.-. ..|... .+.|.|.|.+||.++.
T Consensus 752 n~pg~~rceC~~gy~F~dd~~tCV~i~~pap----------~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG 821 (1289)
T KOG1214|consen 752 NLPGSYRCECRSGYEFADDRHTCVLITPPAP----------ANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG 821 (1289)
T ss_pred cCCCceeEEEeecceeccCCcceEEecCCCC----------CCccccCccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence 9999999999999999999999987432111 111211 223333 344444 3459999999998875
Q ss_pred --CCCcceeecCCCCccCCCCccCCCCCCchhhccCCCCCccCCC
Q psy11797 182 --EVTRMDCCCTMGMAWGPQCQLCPTRGSQEYTDLCLESGLTVDG 224 (249)
Q Consensus 182 --~~~~~~C~C~~g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~ 224 (249)
+.+.++|.=+..+- .++| ....++|.|.| .+||.||+
T Consensus 822 ~~c~dvDeC~psrChp-~A~C----yntpgsfsC~C-~pGy~GDG 860 (1289)
T KOG1214|consen 822 HQCTDVDECSPSRCHP-AATC----YNTPGSFSCRC-QPGYYGDG 860 (1289)
T ss_pred cccccccccCccccCC-CceE----ecCCCcceeec-ccCccCCC
Confidence 45667775222222 4455 45557799999 89999988
No 2
>KOG1214|consensus
Probab=99.28 E-value=4.5e-11 Score=109.79 Aligned_cols=107 Identities=34% Similarity=0.768 Sum_probs=88.0
Q ss_pred CCCCCCCCccCcccCCCCCCCC--cceeeC---------------------------CCCCccCCCCCCCC----CCcee
Q psy11797 1 MSQVTFICSDVDECRTPANTCK--FSCKNL---------------------------IDVDECRTPANTCK----FSCKN 47 (249)
Q Consensus 1 ~~~~g~~C~di~eC~~~~~~c~--~~C~~~---------------------------~did~C~~~~~~c~----~~C~n 47 (249)
+.+++++|.|++||+..+..|+ .+|++. ..++.|..+.+.|. +.|+.
T Consensus 724 ~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~ 803 (1289)
T KOG1214|consen 724 YQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVH 803 (1289)
T ss_pred cCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEe
Confidence 3578999999999998877665 588887 23688888877775 45665
Q ss_pred cC-CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCC
Q psy11797 48 LI-GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLD 125 (249)
Q Consensus 48 ~~-gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~ 125 (249)
.. ++|.|.|.+||.|++ +.|.++|+|+ ++.|. ++.|.+++++|.|.|.+||. .|
T Consensus 804 hGgs~y~C~CLPGfsGDG--------------------~~c~dvDeC~--psrChp~A~CyntpgsfsC~C~pGy~--GD 859 (1289)
T KOG1214|consen 804 HGGSTYSCACLPGFSGDG--------------------HQCTDVDECS--PSRCHPAATCYNTPGSFSCRCQPGYY--GD 859 (1289)
T ss_pred cCCceEEEeecCCccCCc--------------------cccccccccC--ccccCCCceEecCCCcceeecccCcc--CC
Confidence 54 459999999999976 6688999999 58898 77999999999999999998 56
Q ss_pred CCCccc
Q psy11797 126 GKQCLG 131 (249)
Q Consensus 126 g~~C~~ 131 (249)
|..|.+
T Consensus 860 Gf~CVP 865 (1289)
T KOG1214|consen 860 GFQCVP 865 (1289)
T ss_pred CceecC
Confidence 778876
No 3
>KOG1219|consensus
Probab=99.08 E-value=1.6e-10 Score=114.60 Aligned_cols=108 Identities=23% Similarity=0.557 Sum_probs=94.6
Q ss_pred CccCCCCCCCCCCceecC-CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCC
Q psy11797 32 DECRTPANTCKFSCKNLI-GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLE 109 (249)
Q Consensus 32 d~C~~~~~~c~~~C~n~~-gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~ 109 (249)
+.|..+|++.+++|...+ |+|.|.|++.|.|..|+ .++..|.. +||. .|.|+...
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE---------------------i~~epC~s--nPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE---------------------IDLEPCAS--NPCLTGGTCIPFY 3921 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccc---------------------cccccccC--CCCCCCCEEEecC
Confidence 788888888889998776 67999999999998887 68899995 8999 55999999
Q ss_pred CcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCcc
Q psy11797 110 GSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALL 180 (249)
Q Consensus 110 g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~ 180 (249)
++|.|.|+.||+ |++|+.. +.+.|..++|..++.|.+..++|.|.|..|+.+.
T Consensus 3922 n~f~CnC~~gyT----G~~Ce~~--------------Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYT----GKRCEAR--------------GISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred CCeeEeCCCCcc----Cceeecc--------------cccccccccccCCceeeccCCceEeccChhHhcc
Confidence 999999999999 9999872 2455778889999999999999999999998765
No 4
>KOG1219|consensus
Probab=98.93 E-value=1.2e-09 Score=108.60 Aligned_cols=94 Identities=27% Similarity=0.753 Sum_probs=83.3
Q ss_pred cccCCCCCCCCcceeeC-------------------CCCCccCCCCCCCCCCceecCCCeeeecCCCccccCCCcccccc
Q psy11797 12 DECRTPANTCKFSCKNL-------------------IDVDECRTPANTCKFSCKNLIGSYMCTCPPGYQQVTHSTVAIAT 72 (249)
Q Consensus 12 ~eC~~~~~~c~~~C~~~-------------------~did~C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~ 72 (249)
+.|..+||.++++|... +++.+|.++|+.-+++|....++|.|.|+.||+|..|+.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~----- 3939 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA----- 3939 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec-----
Confidence 88999999999999776 688999999999999999999999999999999988872
Q ss_pred cCCccccCCCCCCceecCcccccCCCCCCC-CeeeeCCCcceeecCCCceeCCCCCCccc
Q psy11797 73 TDTRTAESGGKSHECVDVNECELNLDSCAN-GRCVNLEGSYRCECERGFKLSLDGKQCLG 131 (249)
Q Consensus 73 ~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~-g~C~~~~g~~~C~C~~G~~~~~~g~~C~~ 131 (249)
..|++|+ .++|.+ |.|+|.+|+|.|.|.+||. |++|.+
T Consensus 3940 ---------------~Gi~eCs--~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3940 ---------------RGISECS--KNVCGTGGQCINIPGSFHCNCTPGIL----GRTCCA 3978 (4289)
T ss_pred ---------------ccccccc--cccccCCceeeccCCceEeccChhHh----cccCcc
Confidence 2489998 489995 5999999999999999998 777743
No 5
>KOG4289|consensus
Probab=98.87 E-value=2.1e-09 Score=103.53 Aligned_cols=127 Identities=24% Similarity=0.525 Sum_probs=91.3
Q ss_pred cCcccCCCCCCCCcceeeCCCCCccCCCCCCCC--CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCce
Q psy11797 10 DVDECRTPANTCKFSCKNLIDVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHEC 87 (249)
Q Consensus 10 di~eC~~~~~~c~~~C~~~~did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C 87 (249)
|-|-|...||.+-+.|+.....|.=+.....-. ..=++..+++.|+||+||+|+.|+
T Consensus 1178 dDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~Ce--------------------- 1236 (2531)
T KOG4289|consen 1178 DDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCE--------------------- 1236 (2531)
T ss_pred cCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCccccc---------------------
Confidence 346788888877788877632221111000000 122356788999999999998887
Q ss_pred ecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccccc-
Q psy11797 88 VDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRS- 165 (249)
Q Consensus 88 ~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~- 165 (249)
+.||+|-. ++|. +|.|...+|+|+|.|.+||+ |.+|+.... .-.|-..-|++.++|.+.
T Consensus 1237 TeiDlCYs--~pC~nng~C~srEggYtCeCrpg~t----GehCEvs~~-------------agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1237 TEIDLCYS--GPCGNNGRCRSREGGYTCECRPGFT----GEHCEVSAR-------------AGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred chhHhhhc--CCCCCCCceEEecCceeEEecCCcc----ccceeeecc-------------cCccccceecCCCEEeecC
Confidence 68999985 8999 78999999999999999999 999976211 112333447899999987
Q ss_pred CCCCcccCCCC
Q psy11797 166 LTNGRCVLPTG 176 (249)
Q Consensus 166 ~~~~~C~c~~g 176 (249)
.+++.|+|++|
T Consensus 1298 nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1298 NGGFCCHCPYG 1308 (2531)
T ss_pred CCceeccCCCc
Confidence 45688999988
No 6
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.68 E-value=2.3e-08 Score=59.68 Aligned_cols=40 Identities=48% Similarity=1.066 Sum_probs=34.3
Q ss_pred cCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCC
Q psy11797 89 DVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQ 128 (249)
Q Consensus 89 ~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~ 128 (249)
|||||+...+.|. ++.|+|+.|+|+|.|++||.+...+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~ 41 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTT 41 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCc
Confidence 6899998778898 679999999999999999996655544
No 7
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.57 E-value=6e-08 Score=57.87 Aligned_cols=34 Identities=47% Similarity=1.090 Sum_probs=29.8
Q ss_pred CCCccCCCCCCCC--CCceecCCCeeeecCCCcccc
Q psy11797 30 DVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQV 63 (249)
Q Consensus 30 did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~ 63 (249)
|||||+..++.|. +.|+|+.|+|+|.|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 6888888777776 899999999999999999843
No 8
>KOG1217|consensus
Probab=98.48 E-value=1.8e-06 Score=77.94 Aligned_cols=192 Identities=28% Similarity=0.498 Sum_probs=115.5
Q ss_pred CccCCCCCCCC--CCceecCCCeeeecCCCccccCCCccc-cccc---CCccccCCCCCCce-ecCcccccCCCCCCCCe
Q psy11797 32 DECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVA-IATT---DTRTAESGGKSHEC-VDVNECELNLDSCANGR 104 (249)
Q Consensus 32 d~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~-~~~~---~~~~~~~~~~~~~C-~~i~~C~~~~~~C~~g~ 104 (249)
++|......|. +.|.+..++|.|.|++||.+..++... ...+ ....+..++.+..| .++.++.. . . +.
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~--~--~-~~ 244 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECAS--G--D-GT 244 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccC--C--C-Cc
Confidence 67775554444 789999999999999999998776320 0000 11222233333333 12333332 1 2 68
Q ss_pred eeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCC-C
Q psy11797 105 CVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLME-V 183 (249)
Q Consensus 105 C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~-~ 183 (249)
|+++.++|.|.|++||.+... ..+.++++|.... .+.++++|.+..+.|.|.|..+|.+... .
T Consensus 245 c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~---------------~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~ 308 (487)
T KOG1217|consen 245 CVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIA---------------SCPNGGTCVNVPGSYRCTCPPGFTGRLCTE 308 (487)
T ss_pred ccccCCceeeeCCCCcccccc-ceeeeccccCCCC---------------ccCCCCeeecCCCcceeeCCCCCCCCCCcc
Confidence 999999999999999984321 2344444433211 1567899999988899999999988753 1
Q ss_pred CcceeecCCC-----CccCCCCccCCCCCCchhhccCCCCCccCCC-CCC-cccccCCCCC-CCCcccc-cccCC
Q psy11797 184 TRMDCCCTMG-----MAWGPQCQLCPTRGSQEYTDLCLESGLTVDG-RDI-DECVTIPAVE-SSKLAKM-FLRAY 249 (249)
Q Consensus 184 ~~~~C~C~~g-----~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~-~di-deC~~~~~~C-~ng~c~~-~~~~y 249 (249)
......|... ..++.+|. .......|.|.| ..||.+.. ++. ++|...+ + ..+.|++ ..++|
T Consensus 309 ~~~~~~C~~~~~~~~c~~g~~C~--~~~~~~~~~C~c-~~~~~g~~C~~~~~~C~~~~--~~~~~~c~~~~~~~~ 378 (487)
T KOG1217|consen 309 CVDVDECSPRNAGGPCANGGTCN--TLGSFGGFRCAC-GPGFTGRRCEDSNDECASSP--CCPGGTCVNETPGSY 378 (487)
T ss_pred ccccccccccccCCcCCCCcccc--cCCCCCCCCcCC-CCCCCCCccccCCccccCCc--cccCCEeccCCCCCe
Confidence 1111223221 12233551 133445678999 67877755 455 5998877 5 5577887 44443
No 9
>KOG1217|consensus
Probab=98.35 E-value=4.7e-06 Score=75.26 Aligned_cols=170 Identities=26% Similarity=0.476 Sum_probs=104.7
Q ss_pred CCceec---CCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCC
Q psy11797 43 FSCKNL---IGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECER 118 (249)
Q Consensus 43 ~~C~n~---~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~ 118 (249)
+.|.+. ...|.|.|..||.+..+. ...++|......|. .+.|.+..++|.|.|++
T Consensus 140 ~~c~~~~~~~~~~~c~C~~g~~~~~~~---------------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~ 198 (487)
T KOG1217|consen 140 GSCSNGPGSVGPFRCSCTEGYEGEPCE---------------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPP 198 (487)
T ss_pred hhhcCCCCCCCceeeeeCCCccccccc---------------------ccccccccCCCCcCCCcccccCCCCeeEeCCC
Confidence 566654 357999999999987665 23367875556788 45899999999999999
Q ss_pred CceeCCCCCCcccC---CCccceeeeecCCc-ccccccC--CCCCcc-ccccccCCCCcccCCCCCCccC---CCCccee
Q psy11797 119 GFKLSLDGKQCLGK---GQFVEFRIILSMPK-AENSVNN--NNNQRL-GFCYRSLTNGRCVLPTGPALLM---EVTRMDC 188 (249)
Q Consensus 119 G~~~~~~g~~C~~~---~~~~~~~~~~~~~~-~~~~~~~--~~~~~~-~~C~~~~~~~~C~c~~g~~~~~---~~~~~~C 188 (249)
+|. +..++.. ..+.........+. ....+.. ..+... +.|.+..+++.|.+..+|++.. .....+|
T Consensus 199 ~~~----~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C 274 (487)
T KOG1217|consen 199 GYT----GSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSC 274 (487)
T ss_pred Ccc----CCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeecccc
Confidence 998 4444331 11111000000000 0001111 112222 8899999999999999998773 1245556
Q ss_pred ecCCCCccCCCCccCCCCCCchhhccCCCCCccCCCC----CCccccc--CCCCCCCC-cc
Q psy11797 189 CCTMGMAWGPQCQLCPTRGSQEYTDLCLESGLTVDGR----DIDECVT--IPAVESSK-LA 242 (249)
Q Consensus 189 ~C~~g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~~----dideC~~--~~~~C~ng-~c 242 (249)
.-.....++.+| +...+.|.|.| .+||++... ++++|.. ....|.++ .|
T Consensus 275 ~~~~~c~~~~~C----~~~~~~~~C~C-~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C 330 (487)
T KOG1217|consen 275 ALIASCPNGGTC----VNVPGSYRCTC-PPGFTGRLCTECVDVDECSPRNAGGPCANGGTC 330 (487)
T ss_pred CCCCccCCCCee----ecCCCcceeeC-CCCCCCCCCccccccccccccccCCcCCCCccc
Confidence 533212346677 45555599999 699999653 5578864 34458665 66
No 10
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.23 E-value=9e-07 Score=50.55 Aligned_cols=32 Identities=50% Similarity=1.161 Sum_probs=26.8
Q ss_pred CCCCCCCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797 97 LDSCANGRCVNLEGSYRCECERGFKLSLDGKQC 129 (249)
Q Consensus 97 ~~~C~~g~C~~~~g~~~C~C~~G~~~~~~g~~C 129 (249)
.+.|.| .|++++++|+|.|++||.|..|+++|
T Consensus 5 NGgC~h-~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 5 NGGCSH-ICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp GGGSSS-EEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCcCC-CCccCCCceEeECCCCCEECcCCCCC
Confidence 466777 89999999999999999999999876
No 11
>KOG4260|consensus
Probab=98.14 E-value=1.5e-06 Score=71.03 Aligned_cols=67 Identities=42% Similarity=0.955 Sum_probs=54.7
Q ss_pred CCCCccCCCCCCCC--CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC--CCe
Q psy11797 29 IDVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA--NGR 104 (249)
Q Consensus 29 ~did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~--~g~ 104 (249)
+|||||...+..|. +.|+|+.|||.|...+||.+ ++|+|+.-...|. +..
T Consensus 234 vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~--------------------------g~d~C~~~~d~~~~kn~~ 287 (350)
T KOG4260|consen 234 VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK--------------------------GVDECQFCADVCASKNRP 287 (350)
T ss_pred ccHHHHhcCCCCCChhheeecCCCceEecccccccC--------------------------ChHHhhhhhhhcccCCCC
Confidence 79999999888887 79999999999999999975 3455553223444 568
Q ss_pred eeeCCCcceeecCCCce
Q psy11797 105 CVNLEGSYRCECERGFK 121 (249)
Q Consensus 105 C~~~~g~~~C~C~~G~~ 121 (249)
|.|+.+.|+|.|..|+.
T Consensus 288 c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 288 CMNIDGQYRCVCFSGLI 304 (350)
T ss_pred cccCCccEEEEecccce
Confidence 99999999999999876
No 12
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=98.06 E-value=2.9e-06 Score=43.67 Aligned_cols=24 Identities=42% Similarity=0.991 Sum_probs=22.1
Q ss_pred cceeecCCCceeCCCCCCcccCCC
Q psy11797 111 SYRCECERGFKLSLDGKQCLGKGQ 134 (249)
Q Consensus 111 ~~~C~C~~G~~~~~~g~~C~~~~~ 134 (249)
+|+|.|++||.+..++++|++|+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 689999999999999999999875
No 13
>KOG4289|consensus
Probab=97.96 E-value=5.7e-06 Score=80.74 Aligned_cols=91 Identities=21% Similarity=0.327 Sum_probs=69.1
Q ss_pred eeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccC-CCC
Q psy11797 106 VNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLM-EVT 184 (249)
Q Consensus 106 ~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~-~~~ 184 (249)
++..++++|.||+||+ +..|+. ..+.|...+|.+++.|....++|+|.|.++|++.. +++
T Consensus 1216 i~pvnglrCrCPpGFT----gd~CeT---------------eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs 1276 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFT----GDYCET---------------EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVS 1276 (2531)
T ss_pred ccccCceeEeCCCCCC----cccccc---------------hhHhhhcCCCCCCCceEEecCceeEEecCCccccceeee
Confidence 4556789999999999 778865 44567888999999999999999999999999876 333
Q ss_pred cceeecCCCC-ccCCCCccCCCCCCchhhccCCCCC
Q psy11797 185 RMDCCCTMGM-AWGPQCQLCPTRGSQEYTDLCLESG 219 (249)
Q Consensus 185 ~~~C~C~~g~-~~g~~C~~C~~~~~~~~~c~Cp~~G 219 (249)
...=.|.+|+ .+|.+|+ ....++|.|.|| .|
T Consensus 1277 ~~agrCvpGvC~nggtC~---~~~nggf~c~Cp-~g 1308 (2531)
T KOG4289|consen 1277 ARAGRCVPGVCKNGGTCV---NLLNGGFCCHCP-YG 1308 (2531)
T ss_pred cccCccccceecCCCEEe---ecCCCceeccCC-Cc
Confidence 2222344555 3477774 667788999995 44
No 14
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.95 E-value=7.2e-06 Score=42.17 Aligned_cols=24 Identities=50% Similarity=1.227 Sum_probs=21.0
Q ss_pred CeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcc
Q psy11797 51 SYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNE 92 (249)
Q Consensus 51 sy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~ 92 (249)
||+|+|++||+..... +.|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~------------------~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDG------------------RSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCC------------------CccccCCC
Confidence 6999999999987766 78999986
No 15
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.85 E-value=1.5e-05 Score=45.47 Aligned_cols=33 Identities=39% Similarity=0.994 Sum_probs=25.9
Q ss_pred cCCCCCCCCCCceecCCCeeeecCCCccccCCC
Q psy11797 34 CRTPANTCKFSCKNLIGSYMCTCPPGYQQVTHS 66 (249)
Q Consensus 34 C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~~ 66 (249)
|......|..+|++++++|+|.|++||++....
T Consensus 1 C~~~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~ 33 (36)
T PF14670_consen 1 CSVNNGGCSHICVNTPGSYRCSCPPGYKLAEDG 33 (36)
T ss_dssp CTTGGGGSSSEEEEETTSEEEE-STTEEE-TTS
T ss_pred CCCCCCCcCCCCccCCCceEeECCCCCEECcCC
Confidence 344556788999999999999999999987755
No 16
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.80 E-value=2.7e-05 Score=45.12 Aligned_cols=35 Identities=43% Similarity=0.950 Sum_probs=26.4
Q ss_pred CCCccCC-CCCCCCCCceecCCCeeeecCCCcc-ccC
Q psy11797 30 DVDECRT-PANTCKFSCKNLIGSYMCTCPPGYQ-QVT 64 (249)
Q Consensus 30 did~C~~-~~~~c~~~C~n~~gsy~C~C~~G~~-g~~ 64 (249)
++|+|.. .++...++|+++.++|.|.|++||. |..
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN 37 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCc
Confidence 4677776 3444446899999999999999998 543
No 17
>KOG4260|consensus
Probab=97.69 E-value=1.9e-05 Score=64.74 Aligned_cols=73 Identities=33% Similarity=0.585 Sum_probs=54.4
Q ss_pred CceecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccc
Q psy11797 85 HECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCY 163 (249)
Q Consensus 85 ~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~ 163 (249)
..|+|||||...+.+|. +..|+|+.|+|.|...+||..+ ...|+. ++ ..-...+..|.
T Consensus 231 ~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g--~d~C~~---~~----------------d~~~~kn~~c~ 289 (350)
T KOG4260|consen 231 EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG--VDECQF---CA----------------DVCASKNRPCM 289 (350)
T ss_pred cccccHHHHhcCCCCCChhheeecCCCceEecccccccCC--hHHhhh---hh----------------hhcccCCCCcc
Confidence 56899999998889998 7799999999999999999732 222221 00 00013467889
Q ss_pred ccCCCCcccCCCCCC
Q psy11797 164 RSLTNGRCVLPTGPA 178 (249)
Q Consensus 164 ~~~~~~~C~c~~g~~ 178 (249)
++.++|+|.+..++.
T Consensus 290 ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 290 NIDGQYRCVCFSGLI 304 (350)
T ss_pred cCCccEEEEecccce
Confidence 999999999988865
No 18
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.53 E-value=4.7e-05 Score=43.52 Aligned_cols=31 Identities=42% Similarity=0.943 Sum_probs=22.8
Q ss_pred cCCCCCCCC--CCceecCCCeeeecCCCccccC
Q psy11797 34 CRTPANTCK--FSCKNLIGSYMCTCPPGYQQVT 64 (249)
Q Consensus 34 C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~ 64 (249)
|+.+++.|. ++|+++.++|.|.|++||+|++
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 344455565 8999999999999999999976
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.51 E-value=6.2e-05 Score=41.89 Aligned_cols=30 Identities=37% Similarity=0.897 Sum_probs=23.3
Q ss_pred cCCCCCCCCCCceecC-CCeeeecCCCcccc
Q psy11797 34 CRTPANTCKFSCKNLI-GSYMCTCPPGYQQV 63 (249)
Q Consensus 34 C~~~~~~c~~~C~n~~-gsy~C~C~~G~~g~ 63 (249)
|.+.++..+++|++.. ++|.|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 3444555567888887 88999999999985
No 20
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.45 E-value=0.00018 Score=41.15 Aligned_cols=36 Identities=42% Similarity=0.926 Sum_probs=26.8
Q ss_pred CCCccCC-CCCCCCCCceecCCCeeeecCCCccccCC
Q psy11797 30 DVDECRT-PANTCKFSCKNLIGSYMCTCPPGYQQVTH 65 (249)
Q Consensus 30 did~C~~-~~~~c~~~C~n~~gsy~C~C~~G~~g~~~ 65 (249)
++++|.. .++...+.|++..++|.|.|++||.|..+
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 3566765 34443578999999999999999988543
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.41 E-value=0.00021 Score=41.24 Aligned_cols=37 Identities=51% Similarity=1.296 Sum_probs=28.3
Q ss_pred cCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797 89 DVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC 129 (249)
Q Consensus 89 ~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C 129 (249)
++++|... .+|. ++.|+++.++|.|.|++||. +++.|
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C 38 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence 35677642 6787 45999999999999999997 35554
No 22
>KOG1225|consensus
Probab=97.33 E-value=0.0011 Score=60.51 Aligned_cols=121 Identities=27% Similarity=0.601 Sum_probs=67.9
Q ss_pred eeecCCCccccCCCccccc---------ccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCCCcee
Q psy11797 53 MCTCPPGYQQVTHSTVAIA---------TTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKL 122 (249)
Q Consensus 53 ~C~C~~G~~g~~~~~~~~~---------~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~ 122 (249)
.|.|+.+|.+..+...... .-..++|..||++..|.. -.|. ..|. ++.+++ + .|.|++||.
T Consensus 235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e-~~Cp---~~cs~~g~~~~--g--~CiC~~g~~- 305 (525)
T KOG1225|consen 235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDE-LVCP---VDCSGGGVCVD--G--ECICNPGYS- 305 (525)
T ss_pred eeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCc-ccCC---cccCCCceecC--C--EeecCCCcc-
Confidence 6777777777666522110 112245666666666643 2344 2355 445543 3 788999988
Q ss_pred CCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCCCCcceeecCCCCccCCCCcc
Q psy11797 123 SLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLMEVTRMDCCCTMGMAWGPQCQL 202 (249)
Q Consensus 123 ~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~~~~~~C~C~~g~~~g~~C~~ 202 (249)
|+.|+... | ...|...+.|. ...|.|.+||++....... |. .+..|+
T Consensus 306 ---G~dCs~~~-----------------c-padC~g~G~Ci----~G~C~C~~Gy~G~~C~~~~-C~------~~g~cv- 352 (525)
T KOG1225|consen 306 ---GKDCSIRR-----------------C-PADCSGHGKCI----DGECLCDEGYTGELCIQRA-CS------GGGQCV- 352 (525)
T ss_pred ---cccccccc-----------------C-CccCCCCCccc----CCceEeCCCCcCCcccccc-cC------CCceec-
Confidence 77776511 1 12356777777 3458888888877533331 21 233441
Q ss_pred CCCCCCchhhccCCCCCccCCC
Q psy11797 203 CPTRGSQEYTDLCLESGLTVDG 224 (249)
Q Consensus 203 C~~~~~~~~~c~Cp~~G~~~~~ 224 (249)
. - |.| ..||+|..
T Consensus 353 ------~-g-C~C-~~Gw~G~d 365 (525)
T KOG1225|consen 353 ------N-G-CKC-KKGWRGPD 365 (525)
T ss_pred ------c-C-cee-ccCccCCC
Confidence 1 2 777 78888753
No 23
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.25 E-value=0.00021 Score=40.85 Aligned_cols=31 Identities=42% Similarity=0.972 Sum_probs=22.2
Q ss_pred CCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797 97 LDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC 129 (249)
Q Consensus 97 ~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C 129 (249)
.+.|. ++.|++++++|.|.|++||. .+|..|
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~--GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYE--GDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEE--CCSTCE
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCc--cCCcCC
Confidence 45677 78999999999999999998 335443
No 24
>PF00683 TB: TB domain; InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=96.87 E-value=0.00017 Score=42.66 Aligned_cols=30 Identities=47% Similarity=1.367 Sum_probs=23.2
Q ss_pred CCcceeecCCCCccCCCCccCCCCCCchhh
Q psy11797 183 VTRMDCCCTMGMAWGPQCQLCPTRGSQEYT 212 (249)
Q Consensus 183 ~~~~~C~C~~g~~~g~~C~~C~~~~~~~~~ 212 (249)
+++.+|.|+.|.+||..|++||.+++..|.
T Consensus 11 ~tk~~CCCs~G~aWG~~Ce~CP~~~t~ef~ 40 (42)
T PF00683_consen 11 VTKSECCCSVGRAWGSPCEPCPPPGTDEFN 40 (42)
T ss_dssp EEHHHHHTTT-SEETTTTEE---TTSHHHH
T ss_pred eeccccCCCCCCcCCCccccCCCCCChHHh
Confidence 577899999999999999999999998775
No 25
>KOG1225|consensus
Probab=96.80 E-value=0.0091 Score=54.54 Aligned_cols=74 Identities=23% Similarity=0.402 Sum_probs=42.3
Q ss_pred eeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCCCCcceeecCC
Q psy11797 113 RCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLMEVTRMDCCCTM 192 (249)
Q Consensus 113 ~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~~~~~~C~C~~ 192 (249)
+|.|++||+ |..|... .|... |..++.+++. .|.|.++|.+.... ..+| ..
T Consensus 266 ~CIC~~Gf~----G~dC~e~-----------------~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs-~~~c--pa 316 (525)
T KOG1225|consen 266 RCICPPGFT----GDDCDEL-----------------VCPVD-CSGGGVCVDG----ECICNPGYSGKDCS-IRRC--PA 316 (525)
T ss_pred eEeCCCCCc----CCCCCcc-----------------cCCcc-cCCCceecCC----EeecCCCccccccc-cccC--Cc
Confidence 689999998 8777651 12222 3444555432 58888888766421 1112 22
Q ss_pred CCccCCCCccCCCCCCchhhccCCCCCccCCC
Q psy11797 193 GMAWGPQCQLCPTRGSQEYTDLCLESGLTVDG 224 (249)
Q Consensus 193 g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~ 224 (249)
.....+.| ++ -+|.| .+||+|+-
T Consensus 317 dC~g~G~C----i~----G~C~C-~~Gy~G~~ 339 (525)
T KOG1225|consen 317 DCSGHGKC----ID----GECLC-DEGYTGEL 339 (525)
T ss_pred cCCCCCcc----cC----CceEe-CCCCcCCc
Confidence 22123445 33 45999 79999964
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.79 E-value=0.0019 Score=36.65 Aligned_cols=35 Identities=46% Similarity=1.249 Sum_probs=26.8
Q ss_pred CcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797 90 VNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC 129 (249)
Q Consensus 90 i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C 129 (249)
+++|... .+|. ++.|++..+.|.|.|+.||. ++.|
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~----g~~C 37 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYT----GRNC 37 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCCeEeECCCCCc----CCcC
Confidence 4566532 5677 56999999999999999998 5555
No 27
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.63 E-value=0.0018 Score=35.90 Aligned_cols=24 Identities=42% Similarity=1.284 Sum_probs=21.6
Q ss_pred CCCC-CCeeeeCC-CcceeecCCCce
Q psy11797 98 DSCA-NGRCVNLE-GSYRCECERGFK 121 (249)
Q Consensus 98 ~~C~-~g~C~~~~-g~~~C~C~~G~~ 121 (249)
++|. +|+|++.. +.|.|.|++||.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEE
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCc
Confidence 5888 46999998 999999999998
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.35 E-value=0.0049 Score=34.31 Aligned_cols=21 Identities=52% Similarity=1.103 Sum_probs=19.1
Q ss_pred CCceecCCCeeeecCCCcccc
Q psy11797 43 FSCKNLIGSYMCTCPPGYQQV 63 (249)
Q Consensus 43 ~~C~n~~gsy~C~C~~G~~g~ 63 (249)
+.|++..++|.|.|+.||.+.
T Consensus 12 ~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 12 GTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CEEecCCCCeEeECCCCCccc
Confidence 788888899999999999885
No 29
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.32 E-value=0.0038 Score=51.21 Aligned_cols=44 Identities=32% Similarity=0.712 Sum_probs=36.7
Q ss_pred CCCceecCcccccCCCCCCCCeeeeCCCcceeecCCCceeCCCCC
Q psy11797 83 KSHECVDVNECELNLDSCANGRCVNLEGSYRCECERGFKLSLDGK 127 (249)
Q Consensus 83 ~~~~C~~i~~C~~~~~~C~~g~C~~~~g~~~C~C~~G~~~~~~g~ 127 (249)
.+..|.++++|....+.|.+ .|.++.|+|.|.|++||.+..+++
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~-~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQ-VCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred ccccCcCchhhcCCCCCccc-eEEcCCCCEEeECCCCccCCCCCC
Confidence 44568889999876677886 899999999999999999876654
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.32 E-value=0.0052 Score=34.46 Aligned_cols=19 Identities=58% Similarity=1.399 Sum_probs=17.8
Q ss_pred CceecCCCeeeecCCCccc
Q psy11797 44 SCKNLIGSYMCTCPPGYQQ 62 (249)
Q Consensus 44 ~C~n~~gsy~C~C~~G~~g 62 (249)
+|++..++|.|+|++||.|
T Consensus 12 ~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 12 TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred EEECCCCCeEeECCCCCcc
Confidence 7888899999999999988
No 31
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.21 E-value=0.007 Score=33.91 Aligned_cols=24 Identities=46% Similarity=1.247 Sum_probs=20.7
Q ss_pred CCCCCCeeeeCCCcceeecCCCce
Q psy11797 98 DSCANGRCVNLEGSYRCECERGFK 121 (249)
Q Consensus 98 ~~C~~g~C~~~~g~~~C~C~~G~~ 121 (249)
.+|.+++|+++.++|.|.|++||.
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~ 29 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYT 29 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCc
Confidence 467743899999999999999998
No 32
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.19 E-value=0.0014 Score=51.16 Aligned_cols=135 Identities=24% Similarity=0.558 Sum_probs=72.6
Q ss_pred CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccC---CCCCC-CCeeeeCC-----Ccce
Q psy11797 43 FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELN---LDSCA-NGRCVNLE-----GSYR 113 (249)
Q Consensus 43 ~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~---~~~C~-~g~C~~~~-----g~~~ 113 (249)
+..+.+...|.|.|.+||.... + .+|+...+|... ..+|. .+.|++.+ ..|.
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~-E------------------ntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~ 71 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKN-E------------------NTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYK 71 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEE-T------------------TEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEE
T ss_pred CEEEEccCceEEEcCCCcEEcc-c------------------cccccceecCcccccCccccchhhhhcCCCcccceeEE
Confidence 4455567789999999998763 2 578888788752 34687 57898865 5799
Q ss_pred eecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccccc---CCCCcccCCCCCCccCCCCcceeec
Q psy11797 114 CECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRS---LTNGRCVLPTGPALLMEVTRMDCCC 190 (249)
Q Consensus 114 C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~---~~~~~C~c~~g~~~~~~~~~~~C~C 190 (249)
|.|.+||.+..+ .|.+ ..|+...|. .|.|+-. .....|+|.-|++.. +...|.
T Consensus 72 C~C~~gY~~~~~--vCvp-----------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~---dn~kCt- 127 (197)
T PF06247_consen 72 CDCINGYILKQG--VCVP-----------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPD---DNKKCT- 127 (197)
T ss_dssp EEE-TTEEESSS--SEEE-----------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETT---TTTESE-
T ss_pred EecccCceeeCC--eEch-----------------hhcCceecC-CCeEEecCCCCCCceeEeeeceEec---cCCccc-
Confidence 999999997643 5654 122222233 4555432 223478888888722 233332
Q ss_pred CCCCccCCCCc-------cCCCCCCchhhccCCCCCccCCCC
Q psy11797 191 TMGMAWGPQCQ-------LCPTRGSQEYTDLCLESGLTVDGR 225 (249)
Q Consensus 191 ~~g~~~g~~C~-------~C~~~~~~~~~c~Cp~~G~~~~~~ 225 (249)
..| --.|. .| .....-|+|.| ..||.++++
T Consensus 128 k~G---~T~C~LKCk~nE~C-K~~~~~Y~C~~-~~~~~~~~~ 164 (197)
T PF06247_consen 128 KTG---ETKCSLKCKENEEC-KLVDGYYKCVC-KEGFPGDGE 164 (197)
T ss_dssp EEE-----------TTTEEE-EEETTEEEEEE--TT-EEETT
T ss_pred CCC---ccceeeecCCCcce-eeeCcEEEeec-CCCCCCCCC
Confidence 111 11122 12 22234588999 888877543
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.07 E-value=0.0089 Score=33.21 Aligned_cols=24 Identities=50% Similarity=1.272 Sum_probs=21.0
Q ss_pred CCCC-CCeeeeCCCcceeecCCCce
Q psy11797 98 DSCA-NGRCVNLEGSYRCECERGFK 121 (249)
Q Consensus 98 ~~C~-~g~C~~~~g~~~C~C~~G~~ 121 (249)
.+|. ++.|+++.+.|.|.|+.||.
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~ 30 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYT 30 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCc
Confidence 5676 47999999999999999997
No 34
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=95.84 E-value=0.0073 Score=49.49 Aligned_cols=37 Identities=32% Similarity=0.782 Sum_probs=32.5
Q ss_pred CCCCccCCCCCCCCCCceecCCCeeeecCCCccccCC
Q psy11797 29 IDVDECRTPANTCKFSCKNLIGSYMCTCPPGYQQVTH 65 (249)
Q Consensus 29 ~did~C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~ 65 (249)
.++++|...++.|...|.++.|+|.|.|+.||++...
T Consensus 185 ~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~ 221 (224)
T cd01475 185 VVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLED 221 (224)
T ss_pred cCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCC
Confidence 4678999888889999999999999999999987543
No 35
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.29 E-value=0.028 Score=24.40 Aligned_cols=12 Identities=42% Similarity=1.276 Sum_probs=9.2
Q ss_pred eeecCCCccccC
Q psy11797 53 MCTCPPGYQQVT 64 (249)
Q Consensus 53 ~C~C~~G~~g~~ 64 (249)
.|.|++||+|..
T Consensus 1 ~C~C~~G~~G~~ 12 (13)
T PF12661_consen 1 TCQCPPGWTGPN 12 (13)
T ss_dssp EEEE-TTEETTT
T ss_pred CccCcCCCcCCC
Confidence 489999999864
No 36
>KOG0994|consensus
Probab=92.80 E-value=1.5 Score=43.78 Aligned_cols=60 Identities=25% Similarity=0.348 Sum_probs=31.5
Q ss_pred ccccCCCCcc-cCCCCCCccCC----CCcceeecCCCCccC----CCCccCCCCCCchhhccCCCCCccCCC
Q psy11797 162 CYRSLTNGRC-VLPTGPALLME----VTRMDCCCTMGMAWG----PQCQLCPTRGSQEYTDLCLESGLTVDG 224 (249)
Q Consensus 162 C~~~~~~~~C-~c~~g~~~~~~----~~~~~C~C~~g~~~g----~~C~~C~~~~~~~~~c~Cp~~G~~~~~ 224 (249)
|.+..+++.| .|..||.++.. ..-..|.|..|.+.| ..|.+ -+.+....|.| .+||+|..
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~--d~~t~~ivC~C-~~GY~G~R 946 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYL--DTRTQQIVCHC-QEGYSGSR 946 (1758)
T ss_pred ccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccc--cccccceeeec-ccCccccc
Confidence 3445566666 67788877642 122344455443222 22311 11223446899 89999853
No 37
>KOG0994|consensus
Probab=92.56 E-value=0.34 Score=47.91 Aligned_cols=32 Identities=19% Similarity=0.568 Sum_probs=21.5
Q ss_pred CcccccCCCCCCCCeeeeCCCccee-ecCCCceeC
Q psy11797 90 VNECELNLDSCANGRCVNLEGSYRC-ECERGFKLS 123 (249)
Q Consensus 90 i~~C~~~~~~C~~g~C~~~~g~~~C-~C~~G~~~~ 123 (249)
.+.|....+.|. .|.+...++.| .|..||+..
T Consensus 865 A~~Cd~~tGaCi--~CqD~T~G~~CdrCl~GyyGd 897 (1758)
T KOG0994|consen 865 ADTCDPITGACI--DCQDSTTGHSCDRCLDGYYGD 897 (1758)
T ss_pred ccccCccccccc--cccccccccchhhhhccccCC
Confidence 345554445555 36677788888 699999843
No 38
>KOG1226|consensus
Probab=90.28 E-value=3.5 Score=39.39 Aligned_cols=23 Identities=22% Similarity=0.613 Sum_probs=16.9
Q ss_pred CCceecCCCee---eecCCCccccCCC
Q psy11797 43 FSCKNLIGSYM---CTCPPGYQQVTHS 66 (249)
Q Consensus 43 ~~C~n~~gsy~---C~C~~G~~g~~~~ 66 (249)
+.|. ..|.|. |.|.+||.|..|+
T Consensus 467 ~~C~-g~G~~~CG~C~C~~G~~G~~CE 492 (783)
T KOG1226|consen 467 ALCH-GNGTFVCGQCRCDEGWLGKKCE 492 (783)
T ss_pred cccC-CCCcEEecceecCCCCCCCccc
Confidence 4554 456666 4899999998887
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=89.90 E-value=0.52 Score=25.96 Aligned_cols=25 Identities=40% Similarity=1.152 Sum_probs=19.3
Q ss_pred CCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797 99 SCA-NGRCVNLEGSYRCECERGFKLSLDGKQC 129 (249)
Q Consensus 99 ~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C 129 (249)
.|. +|+|+.. ..+|.|.+||. |..|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~----G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYT----GPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCc----CCCC
Confidence 466 7899876 46899999998 6554
No 40
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=88.26 E-value=0.39 Score=37.83 Aligned_cols=99 Identities=25% Similarity=0.475 Sum_probs=53.0
Q ss_pred CCceecC-----CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCCCCeeeeCC---Cccee
Q psy11797 43 FSCKNLI-----GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCANGRCVNLE---GSYRC 114 (249)
Q Consensus 43 ~~C~n~~-----gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~g~C~~~~---g~~~C 114 (249)
+.|++.+ ..|.|.|.+||..... .|. .+.|.. -.|..|.|+-.+ ....|
T Consensus 56 a~C~~~~~~~~~~~~~C~C~~gY~~~~~--------------------vCv-p~~C~~--~~Cg~GKCI~d~~~~~~~~C 112 (197)
T PF06247_consen 56 AKCINQANKGEERAYKCDCINGYILKQG--------------------VCV-PNKCNN--KDCGSGKCILDPDNPNNPTC 112 (197)
T ss_dssp EEEEE-SSTTSSTSEEEEE-TTEEESSS--------------------SEE-EGGGSS-----TTEEEEEEEGGGSEEEE
T ss_pred hhhhcCCCcccceeEEEecccCceeeCC--------------------eEc-hhhcCc--eecCCCeEEecCCCCCCcee
Confidence 7787665 4699999999987653 243 234552 467778897543 34589
Q ss_pred ecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCcc
Q psy11797 115 ECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALL 180 (249)
Q Consensus 115 ~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~ 180 (249)
+|.-|+. -.+...|...++ ..|.+. |..+..|....+-|+|.+..++.+.
T Consensus 113 SC~IGkV-~~dn~kCtk~G~--------------T~C~LK-Ck~nE~CK~~~~~Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 113 SCNIGKV-PDDNKKCTKTGE--------------TKCSLK-CKENEECKLVDGYYKCVCKEGFPGD 162 (197)
T ss_dssp EE-TEEE-TTTTTESEEEE-----------------------TTTEEEEEETTEEEEEE-TT-EEE
T ss_pred EeeeceE-eccCCcccCCCc--------------cceeee-cCCCcceeeeCcEEEeecCCCCCCC
Confidence 9999998 344556655221 112111 2445566655555666666665444
No 41
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.00 E-value=1.1 Score=25.52 Aligned_cols=25 Identities=32% Similarity=0.616 Sum_probs=18.4
Q ss_pred CCCCCceecC-CCeeeecCCCccccC
Q psy11797 40 TCKFSCKNLI-GSYMCTCPPGYQQVT 64 (249)
Q Consensus 40 ~c~~~C~n~~-gsy~C~C~~G~~g~~ 64 (249)
+-.+.|.+.. |++.|+|.+||+...
T Consensus 8 P~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 8 PANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred CCCcccEEcCCCCEEEEeeCCccccC
Confidence 3347888776 899999999998644
No 42
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=76.82 E-value=2.3 Score=31.34 Aligned_cols=39 Identities=23% Similarity=0.732 Sum_probs=29.3
Q ss_pred cCcccccC-CCCCCCCeeeeCC--CcceeecCCCceeCCCCCCccc
Q psy11797 89 DVNECELN-LDSCANGRCVNLE--GSYRCECERGFKLSLDGKQCLG 131 (249)
Q Consensus 89 ~i~~C~~~-~~~C~~g~C~~~~--g~~~C~C~~G~~~~~~g~~C~~ 131 (249)
++.+|... .+-|.||.|.-.+ ..+.|.|+.||. |..|+.
T Consensus 41 ~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYt----GeRCEh 82 (139)
T PHA03099 41 AIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYT----GIRCQH 82 (139)
T ss_pred ccccCChhhCCEeECCEEEeeccCCCceeECCCCcc----cccccc
Confidence 45566533 3567788887654 679999999999 999976
No 43
>KOG1226|consensus
Probab=75.46 E-value=7.1 Score=37.41 Aligned_cols=15 Identities=40% Similarity=1.201 Sum_probs=11.5
Q ss_pred eeecCCCceeCCCCCCccc
Q psy11797 113 RCECERGFKLSLDGKQCLG 131 (249)
Q Consensus 113 ~C~C~~G~~~~~~g~~C~~ 131 (249)
.|.|.+||. |+.|+=
T Consensus 479 ~C~C~~G~~----G~~CEC 493 (783)
T KOG1226|consen 479 QCRCDEGWL----GKKCEC 493 (783)
T ss_pred ceecCCCCC----CCcccC
Confidence 368999998 888753
No 44
>KOG1836|consensus
Probab=74.25 E-value=10 Score=40.17 Aligned_cols=14 Identities=43% Similarity=0.985 Sum_probs=12.2
Q ss_pred eecCCCccccCCCc
Q psy11797 54 CTCPPGYQQVTHST 67 (249)
Q Consensus 54 C~C~~G~~g~~~~~ 67 (249)
|.|+.||+|+.++.
T Consensus 697 c~C~~g~tG~~Ce~ 710 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCES 710 (1705)
T ss_pred ccCCCCcccchhhh
Confidence 89999999998873
No 45
>smart00051 DSL delta serrate ligand.
Probab=70.29 E-value=9.1 Score=24.61 Aligned_cols=16 Identities=19% Similarity=0.266 Sum_probs=12.5
Q ss_pred CeeeecCCCccccCCC
Q psy11797 51 SYMCTCPPGYQQVTHS 66 (249)
Q Consensus 51 sy~C~C~~G~~g~~~~ 66 (249)
.+.-.|.++|.|..|.
T Consensus 16 ~~rv~C~~~~yG~~C~ 31 (63)
T smart00051 16 QIRVTCDENYYGEGCN 31 (63)
T ss_pred EEEeeCCCCCcCCccC
Confidence 3556799999998877
No 46
>KOG1836|consensus
Probab=65.94 E-value=19 Score=38.26 Aligned_cols=71 Identities=24% Similarity=0.604 Sum_probs=39.8
Q ss_pred CccCCCCCCCCCCceecCCCeee-ecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeC-
Q psy11797 32 DECRTPANTCKFSCKNLIGSYMC-TCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNL- 108 (249)
Q Consensus 32 d~C~~~~~~c~~~C~n~~gsy~C-~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~- 108 (249)
+.|......|. |+....+-.| .|.+||+|...... .-| |. +=+|. .+.|..+
T Consensus 738 ~~Cd~~tG~C~--C~~~t~G~~C~~C~~GfYg~~~~~~--------------------~~d-C~--~C~Cp~~~~~~~~~ 792 (1705)
T KOG1836|consen 738 NICDPRTGQCK--CKHNTFGGQCAQCVDGFYGLPDLGT--------------------SGD-CQ--PCPCPNGGACGQTP 792 (1705)
T ss_pred ccccCCCCcee--cccCCCCCchhhhcCCCCCccccCC--------------------CCC-Cc--cCCCCCChhhcCcC
Confidence 34444443443 5544444566 68999988765421 111 43 13343 2244443
Q ss_pred -CCcceee-cCCCceeCCCCCCccc
Q psy11797 109 -EGSYRCE-CERGFKLSLDGKQCLG 131 (249)
Q Consensus 109 -~g~~~C~-C~~G~~~~~~g~~C~~ 131 (249)
.....|. |++||+ |..|+.
T Consensus 793 ~~~~~iCk~Cp~gyt----G~rCe~ 813 (1705)
T KOG1836|consen 793 EILEVVCKNCPPGYT----GLRCEE 813 (1705)
T ss_pred cccceecCCCCCCCc----cccccc
Confidence 3456787 999999 888876
No 47
>PHA02887 EGF-like protein; Provisional
Probab=57.51 E-value=9.9 Score=27.62 Aligned_cols=38 Identities=32% Similarity=0.832 Sum_probs=26.8
Q ss_pred CcccccC-CCCCCCCeeeeCC--CcceeecCCCceeCCCCCCccc
Q psy11797 90 VNECELN-LDSCANGRCVNLE--GSYRCECERGFKLSLDGKQCLG 131 (249)
Q Consensus 90 i~~C~~~-~~~C~~g~C~~~~--g~~~C~C~~G~~~~~~g~~C~~ 131 (249)
+.+|... .+-|.||.|.-.. ....|.|+.||. |..|+.
T Consensus 83 f~pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYt----G~RCE~ 123 (126)
T PHA02887 83 FEKCKNDFNDFCINGECMNIIDLDEKFCICNKGYT----GIRCDE 123 (126)
T ss_pred ccccChHhhCEeeCCEEEccccCCCceeECCCCcc----cCCCCc
Confidence 3455432 3457778887644 568899999999 888875
No 48
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=56.61 E-value=9.4 Score=28.21 Aligned_cols=24 Identities=25% Similarity=0.641 Sum_probs=19.1
Q ss_pred CCceec--CCCeeeecCCCccccCCC
Q psy11797 43 FSCKNL--IGSYMCTCPPGYQQVTHS 66 (249)
Q Consensus 43 ~~C~n~--~gsy~C~C~~G~~g~~~~ 66 (249)
+.|.-. ...+.|.|..||+|..|+
T Consensus 56 G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 56 GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CEEEeeccCCCceeECCCCccccccc
Confidence 567544 367999999999998876
No 49
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=56.34 E-value=13 Score=20.67 Aligned_cols=24 Identities=29% Similarity=0.676 Sum_probs=15.8
Q ss_pred CCCCCceecCCCeeeecCCCccccC
Q psy11797 40 TCKFSCKNLIGSYMCTCPPGYQQVT 64 (249)
Q Consensus 40 ~c~~~C~n~~gsy~C~C~~G~~g~~ 64 (249)
.|.+.|... ..+.|.||.||..+.
T Consensus 7 ~CpA~CDpn-~~~~C~CPeGyIlde 30 (34)
T PF09064_consen 7 ECPADCDPN-SPGQCFCPEGYILDE 30 (34)
T ss_pred cCCCccCCC-CCCceeCCCceEecC
Confidence 345666442 234899999998754
No 50
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=56.19 E-value=12 Score=26.75 Aligned_cols=30 Identities=33% Similarity=0.923 Sum_probs=22.4
Q ss_pred CcccccCCCCCC-CCeeeeCCCcceeecCCCce
Q psy11797 90 VNECELNLDSCA-NGRCVNLEGSYRCECERGFK 121 (249)
Q Consensus 90 i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~ 121 (249)
.+.|.. .+.|. +|.|.. .....|.|.+||.
T Consensus 77 ~d~Cd~-y~~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 77 KDQCDV-YGFCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred ccCCCC-ccccCCccEeCC-CCCCceECCCCcC
Confidence 456775 47888 789943 4566799999997
No 51
>KOG3512|consensus
Probab=42.96 E-value=90 Score=28.60 Aligned_cols=25 Identities=16% Similarity=0.211 Sum_probs=19.5
Q ss_pred CCceecCCC-eeeecCCCccccCCCc
Q psy11797 43 FSCKNLIGS-YMCTCPPGYQQVTHST 67 (249)
Q Consensus 43 ~~C~n~~gs-y~C~C~~G~~g~~~~~ 67 (249)
..|+-..++ ++|.|...-.|..|+.
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgr 310 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGR 310 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCccc
Confidence 467665555 9999999999988874
No 52
>KOG1215|consensus
Probab=42.77 E-value=43 Score=33.26 Aligned_cols=66 Identities=29% Similarity=0.697 Sum_probs=43.5
Q ss_pred CCCCCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCCCCeee-eCCCcceeecCC
Q psy11797 40 TCKFSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCANGRCV-NLEGSYRCECER 118 (249)
Q Consensus 40 ~c~~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~g~C~-~~~g~~~C~C~~ 118 (249)
.+.+.+.+......+.|..+++..... . .+...|....+.|.+ .|+ +.++.|.|.|..
T Consensus 334 ~~~~~~~~~~v~~~~~~~~~~~~~~~~------------------~--~~~~~~~~~~g~Csq-~C~~~~p~~~~c~c~~ 392 (877)
T KOG1215|consen 334 KCSHKCPDVSVGPRCDCMGAKVLPLGA------------------R--TDSNPCESDNGGCSQ-LCVPNSPGTFKCACSP 392 (877)
T ss_pred cccCCCCccccCCcccCCccceecccc------------------c--ccCCcccccCCccce-eccCCCCCceeEecCC
Confidence 333455566666777777777664433 1 122344444567776 788 568999999999
Q ss_pred CceeCCCC
Q psy11797 119 GFKLSLDG 126 (249)
Q Consensus 119 G~~~~~~g 126 (249)
||.+..++
T Consensus 393 g~~~~~~~ 400 (877)
T KOG1215|consen 393 GYELRLDK 400 (877)
T ss_pred CcEeccCC
Confidence 99987765
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=37.10 E-value=93 Score=18.56 Aligned_cols=16 Identities=31% Similarity=0.997 Sum_probs=8.5
Q ss_pred eeecCCCCccCCCCccC
Q psy11797 187 DCCCTMGMAWGPQCQLC 203 (249)
Q Consensus 187 ~C~C~~g~~~g~~C~~C 203 (249)
+|.|+.++. |..|+.|
T Consensus 20 ~C~C~~~~~-G~~C~~C 35 (50)
T cd00055 20 QCECKPNTT-GRRCDRC 35 (50)
T ss_pred EEeCCCcCC-CCCCCCC
Confidence 455555544 5556443
No 54
>KOG0196|consensus
Probab=33.36 E-value=51 Score=32.37 Aligned_cols=51 Identities=25% Similarity=0.484 Sum_probs=32.9
Q ss_pred CCcccCCCCCCccCCCCccee-ecCCCCc----cCCCCccCCCCC----CchhhccCCCCCcc
Q psy11797 168 NGRCVLPTGPALLMEVTRMDC-CCTMGMA----WGPQCQLCPTRG----SQEYTDLCLESGLT 221 (249)
Q Consensus 168 ~~~C~c~~g~~~~~~~~~~~C-~C~~g~~----~g~~C~~C~~~~----~~~~~c~Cp~~G~~ 221 (249)
...|.|.+||... .....| .|..|+. .-..|..||... .+.-.|.| ..||.
T Consensus 258 iG~C~C~aGye~~--~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C-~~gyy 317 (996)
T KOG0196|consen 258 IGGCVCKAGYEEA--ENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTC-ENGYY 317 (996)
T ss_pred cCceeecCCCCcc--cCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccc-cCCcc
Confidence 3568899998764 345566 3666651 135788887543 34567999 78864
No 55
>KOG3516|consensus
Probab=32.45 E-value=33 Score=34.87 Aligned_cols=39 Identities=31% Similarity=0.888 Sum_probs=30.5
Q ss_pred ceecCcccccCCCCCCCC-eeeeCCCcceeecC-CCceeCCCCCCcc
Q psy11797 86 ECVDVNECELNLDSCANG-RCVNLEGSYRCECE-RGFKLSLDGKQCL 130 (249)
Q Consensus 86 ~C~~i~~C~~~~~~C~~g-~C~~~~g~~~C~C~-~G~~~~~~g~~C~ 130 (249)
.|.-++.|. +++|+|| .|......|.|.|. .||+ |.+|.
T Consensus 541 ~C~i~drCl--PN~CehgG~C~Qs~~~f~C~C~~TGY~----GatCH 581 (1306)
T KOG3516|consen 541 MCGISDRCL--PNPCEHGGKCSQSWDDFECNCELTGYK----GATCH 581 (1306)
T ss_pred ccccccccC--CccccCCCcccccccceeEeccccccc----ccccc
Confidence 345566676 5899965 99988889999999 7898 77775
No 56
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=27.43 E-value=1.4e+02 Score=17.53 Aligned_cols=16 Identities=31% Similarity=1.043 Sum_probs=8.3
Q ss_pred eeecCCCCccCCCCccC
Q psy11797 187 DCCCTMGMAWGPQCQLC 203 (249)
Q Consensus 187 ~C~C~~g~~~g~~C~~C 203 (249)
+|.|+.++. |..|+.|
T Consensus 19 ~C~C~~~~~-G~~C~~C 34 (46)
T smart00180 19 QCECKPNVT-GRRCDRC 34 (46)
T ss_pred EEECCCCCC-CCCCCcC
Confidence 455555544 5555443
No 57
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=27.20 E-value=40 Score=24.04 Aligned_cols=31 Identities=26% Similarity=0.789 Sum_probs=21.6
Q ss_pred cccccCCCCCC-CCeeeeCC-----CcceeecCCCce
Q psy11797 91 NECELNLDSCA-NGRCVNLE-----GSYRCECERGFK 121 (249)
Q Consensus 91 ~~C~~~~~~C~-~g~C~~~~-----g~~~C~C~~G~~ 121 (249)
+.|....+.|. ||.|++.. .=|.|.|.+.+.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence 44555557787 89999863 338899988544
No 58
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=24.00 E-value=40 Score=20.56 Aligned_cols=18 Identities=22% Similarity=0.661 Sum_probs=13.2
Q ss_pred eeecCCCceeCCCCCCccc
Q psy11797 113 RCECERGFKLSLDGKQCLG 131 (249)
Q Consensus 113 ~C~C~~G~~~~~~g~~C~~ 131 (249)
.|.|++||.+... ..|..
T Consensus 34 gC~C~~G~v~~~~-~~CV~ 51 (55)
T PF01826_consen 34 GCFCPPGYVRNDN-GRCVP 51 (55)
T ss_dssp EEEETTTEEEETT-SEEEE
T ss_pred cCCCCCCeeEcCC-CCEEc
Confidence 3899999987654 46655
No 59
>KOG3516|consensus
Probab=21.67 E-value=77 Score=32.39 Aligned_cols=37 Identities=24% Similarity=0.516 Sum_probs=29.4
Q ss_pred CCccCCCCCCCCCCceecCCCeeeecC-CCccccCCCc
Q psy11797 31 VDECRTPANTCKFSCKNLIGSYMCTCP-PGYQQVTHST 67 (249)
Q Consensus 31 id~C~~~~~~c~~~C~n~~gsy~C~C~-~G~~g~~~~~ 67 (249)
+|.|.++++..++.|......|.|.|. .||+|..|..
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccC
Confidence 366777777777889887788999998 8999988763
Done!