Query psy13158
Match_columns 289
No_of_seqs 190 out of 1207
Neff 9.1
Searched_HMMs 46136
Date Fri Aug 16 21:39:07 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13158.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13158hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.6 6E-15 1.3E-19 138.2 12.0 166 11-263 691-865 (1289)
2 KOG1219|consensus 99.4 6.6E-13 1.4E-17 134.4 8.8 117 113-267 3865-3983(4289)
3 KOG1214|consensus 99.3 1.5E-11 3.3E-16 115.8 12.9 181 6-263 729-916 (1289)
4 KOG1219|consensus 99.2 6.7E-11 1.4E-15 120.4 9.0 107 13-199 3865-3974(4289)
5 KOG4289|consensus 99.1 4.9E-09 1.1E-13 103.8 15.6 59 5-66 1232-1295(2531)
6 KOG1217|consensus 99.0 4.7E-08 1E-12 91.4 17.9 213 7-272 164-405 (487)
7 KOG4289|consensus 98.7 2E-08 4.3E-13 99.7 6.3 91 127-252 1217-1308(2531)
8 KOG1217|consensus 98.7 7.4E-07 1.6E-11 83.3 15.8 203 14-258 91-306 (487)
9 PF07645 EGF_CA: Calcium-bindi 98.5 8.9E-08 1.9E-12 58.5 2.1 33 11-43 1-35 (42)
10 KOG4260|consensus 98.4 1.5E-07 3.3E-12 78.8 3.3 82 135-254 221-304 (350)
11 PF07645 EGF_CA: Calcium-bindi 98.2 9.1E-07 2E-11 54.0 2.3 35 209-255 1-35 (42)
12 smart00179 EGF_CA Calcium-bind 98.1 3.3E-06 7.1E-11 50.4 3.9 34 11-44 1-36 (39)
13 KOG1225|consensus 98.1 2.4E-05 5.3E-10 73.0 11.1 82 116-257 284-365 (525)
14 KOG4260|consensus 98.1 4.4E-06 9.5E-11 70.1 5.1 115 117-256 150-270 (350)
15 PF12947 EGF_3: EGF domain; I 97.9 4.7E-06 1E-10 48.8 1.6 32 17-48 5-36 (36)
16 PF12947 EGF_3: EGF domain; I 97.9 4E-06 8.6E-11 49.1 1.2 31 229-259 5-35 (36)
17 PF00008 EGF: EGF-like domain 97.9 5.3E-06 1.2E-10 47.4 1.5 30 15-44 1-31 (32)
18 cd00054 EGF_CA Calcium-binding 97.8 3.5E-05 7.5E-10 45.4 3.8 34 11-44 1-35 (38)
19 KOG1225|consensus 97.8 0.0003 6.6E-09 65.8 11.4 90 116-263 253-342 (525)
20 PF00008 EGF: EGF-like domain 97.5 3.6E-05 7.9E-10 43.9 1.3 27 230-256 4-31 (32)
21 smart00179 EGF_CA Calcium-bind 97.5 0.0002 4.3E-09 42.5 4.0 26 231-256 10-36 (39)
22 PF12662 cEGF: Complement Clr- 97.2 0.00027 5.9E-09 37.1 2.3 23 244-269 1-24 (24)
23 smart00181 EGF Epidermal growt 97.1 0.00068 1.5E-08 39.2 3.5 28 15-43 2-30 (35)
24 PF06247 Plasmod_Pvs28: Plasmo 97.1 0.00012 2.6E-09 58.5 0.2 148 19-258 7-164 (197)
25 PF12662 cEGF: Complement Clr- 97.0 0.00065 1.4E-08 35.7 2.4 13 32-44 1-13 (24)
26 cd00053 EGF Epidermal growth f 97.0 0.00093 2E-08 38.4 3.4 28 17-44 5-32 (36)
27 cd00054 EGF_CA Calcium-binding 97.0 0.0015 3.3E-08 38.1 4.0 26 231-256 10-35 (38)
28 cd00053 EGF Epidermal growth f 96.6 0.0036 7.8E-08 35.8 3.5 27 230-256 6-32 (36)
29 smart00181 EGF Epidermal growt 96.3 0.0054 1.2E-07 35.3 3.2 25 231-256 7-31 (35)
30 PF14670 FXa_inhibition: Coagu 95.8 0.0059 1.3E-07 35.6 1.5 25 231-257 7-31 (36)
31 KOG1226|consensus 95.5 0.35 7.5E-06 47.1 13.1 24 174-203 556-579 (783)
32 PF14670 FXa_inhibition: Coagu 95.3 0.016 3.4E-07 33.8 2.2 20 25-44 11-30 (36)
33 PF07974 EGF_2: EGF-like domai 95.1 0.029 6.3E-07 31.8 3.0 25 18-44 6-30 (32)
34 PF12661 hEGF: Human growth fa 94.1 0.027 5.9E-07 25.0 1.0 11 34-44 1-11 (13)
35 PF06247 Plasmod_Pvs28: Plasmo 93.6 0.019 4.2E-07 46.1 -0.0 108 118-262 7-127 (197)
36 PF07974 EGF_2: EGF-like domai 93.1 0.14 3.1E-06 28.9 3.0 24 231-256 7-30 (32)
37 KOG0994|consensus 92.8 0.76 1.7E-05 46.8 9.4 21 125-145 878-899 (1758)
38 KOG0994|consensus 92.8 0.42 9.1E-06 48.6 7.7 112 131-258 781-899 (1758)
39 PF12946 EGF_MSP1_1: MSP1 EGF 92.7 0.034 7.4E-07 32.3 0.1 30 15-44 2-32 (37)
40 KOG1226|consensus 92.6 0.9 1.9E-05 44.4 9.4 103 133-271 479-589 (783)
41 cd01475 vWA_Matrilin VWA_Matri 91.4 0.22 4.8E-06 42.0 3.6 38 206-257 183-220 (224)
42 PF12946 EGF_MSP1_1: MSP1 EGF 90.3 0.091 2E-06 30.6 0.2 28 231-258 6-34 (37)
43 PF01683 EB: EB module; Inter 86.3 1.9 4E-05 27.1 4.4 23 231-257 27-49 (52)
44 smart00051 DSL delta serrate l 83.8 2.1 4.4E-05 28.4 3.8 45 188-256 17-61 (63)
45 cd01475 vWA_Matrilin VWA_Matri 81.9 1.7 3.6E-05 36.6 3.5 33 165-199 185-219 (224)
46 PF00954 S_locus_glycop: S-loc 80.0 1.8 3.9E-05 32.0 2.8 33 11-44 76-109 (110)
47 PHA03099 epidermal growth fact 76.1 3.1 6.8E-05 31.4 3.0 30 230-263 51-82 (139)
48 PHA03099 epidermal growth fact 72.7 4.2 9.1E-05 30.8 2.9 36 11-49 41-81 (139)
49 PF09064 Tme5_EGF_like: Thromb 72.1 4.3 9.2E-05 23.1 2.2 14 245-258 18-31 (34)
50 PHA02887 EGF-like protein; Pro 70.0 4.8 0.0001 29.9 2.7 27 20-49 94-122 (126)
51 PF00954 S_locus_glycop: S-loc 65.3 8 0.00017 28.4 3.2 26 230-256 84-109 (110)
52 KOG3516|consensus 65.3 4.7 0.0001 41.6 2.4 37 11-49 544-581 (1306)
53 KOG1836|consensus 65.0 45 0.00098 36.7 9.6 53 132-204 756-812 (1705)
54 PHA02887 EGF-like protein; Pro 61.4 8.7 0.00019 28.6 2.6 30 230-263 92-123 (126)
55 PF00053 Laminin_EGF: Laminin 44.5 22 0.00047 21.8 2.2 22 24-49 11-32 (49)
56 cd00055 EGF_Lam Laminin-type e 43.7 32 0.00069 21.2 2.9 17 246-262 20-36 (50)
57 KOG3512|consensus 43.2 1.7E+02 0.0037 27.6 8.3 27 123-149 285-312 (592)
58 KOG3516|consensus 38.6 27 0.00059 36.3 2.9 42 210-262 539-581 (1306)
59 KOG3607|consensus 33.1 48 0.001 33.1 3.6 46 209-257 602-654 (716)
60 PF12955 DUF3844: Domain of un 29.7 28 0.0006 25.5 1.0 31 13-43 6-43 (103)
61 KOG1836|consensus 24.6 2.3E+02 0.0049 31.6 6.9 28 231-258 781-811 (1705)
62 smart00180 EGF_Lam Laminin-typ 22.4 80 0.0017 19.1 2.0 16 246-261 19-34 (46)
No 1
>KOG1214|consensus
Probab=99.60 E-value=6e-15 Score=138.18 Aligned_cols=166 Identities=30% Similarity=0.716 Sum_probs=120.3
Q ss_pred CcCCCC--CCCCCCCCeeeeCCC-CceeeCCCCccCCCCCCcccCCCCCCCCCCCccccCCCcccCCCCCCCCccccccc
Q psy13158 11 ATLECY--PSPCGPYSECRNING-GPSCSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQNSECPYDKACIREKC 87 (289)
Q Consensus 11 did~C~--~~~C~~~~~C~n~~g-s~~C~C~~Gf~g~~~~C~~~C~~~~~c~~~~~C~~~~~C~~~~~c~~~~~c~~~~c 87 (289)
.++.|. ++-|..++.|...++ .|.|.|..||.|.|.+ |.+.++|...
T Consensus 691 ~~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~----------------c~d~~eca~~-------------- 740 (1289)
T KOG1214|consen 691 PVNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRN----------------CVDENECATG-------------- 740 (1289)
T ss_pred ccccceecCcccCCCccccCCCCcceEEEEeeccCCCCCC----------------CCChhhhccC--------------
Confidence 456666 566777788887764 6888888888887333 4444443322
Q ss_pred cCCCCCCCCCCCeeeeccCCCCcccCCCCCCCCCCCeeeecCCCceeecCCCCccCCC-CCCCCCCCCCCCCCCCCCccc
Q psy13158 88 VDPCPGSCGYGAVCTVINHSNEKCQDPCPGSCGYNAECKVINHNPICSCSQGYIGDGF-SGCYPKPPEVPPPPQQDVQEY 166 (289)
Q Consensus 88 ~~~C~g~~~~~~~C~~~~~~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~-~~C~~~~~~~~~~~~~~~~~~ 166 (289)
...|..+++|++..++|+|.|..||..... ..|.++..+ ..
T Consensus 741 ----------------------------~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~p----------ap 782 (1289)
T KOG1214|consen 741 ----------------------------FHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPP----------AP 782 (1289)
T ss_pred ----------------------------CCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCC----------CC
Confidence 356888999999999999999999875432 236444332 22
Q ss_pred cCCCC--CCCCCCCC--eeeecC-CCCeeeCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeC
Q psy13158 167 VNPCV--PSPCGLYS--QCRDIG-GSPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVI 241 (289)
Q Consensus 167 ~~~C~--~~~C~~~~--~C~~~~-g~~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~ 241 (289)
++.|. .+.|...+ .|+... ++|.|.|.|||.|++.. |.++|+|..+ .|++.++|+|+
T Consensus 783 ~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~----c~dvDeC~ps--------------rChp~A~Cynt 844 (1289)
T KOG1214|consen 783 ANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ----CTDVDECSPS--------------RCHPAATCYNT 844 (1289)
T ss_pred CCccccCccccCcCCceEEEecCCceEEEeecCCccCCccc----cccccccCcc--------------ccCCCceEecC
Confidence 34454 23455444 455554 45999999999999764 4678999865 79999999999
Q ss_pred CCCceeeCCCCCccCCCCCCCC
Q psy13158 242 NHTPTCTCPDGFIGDAFLSCHP 263 (289)
Q Consensus 242 ~g~~~C~C~~G~~g~~~~~C~~ 263 (289)
.|+|.|.|.+||.|++.+ |.+
T Consensus 845 pgsfsC~C~pGy~GDGf~-CVP 865 (1289)
T KOG1214|consen 845 PGSFSCRCQPGYYGDGFQ-CVP 865 (1289)
T ss_pred CCcceeecccCccCCCce-ecC
Confidence 999999999999999986 875
No 2
>KOG1219|consensus
Probab=99.40 E-value=6.6e-13 Score=134.40 Aligned_cols=117 Identities=29% Similarity=0.688 Sum_probs=100.9
Q ss_pred CCC-CCCCCCCCeeeecC-CCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCeeeecCCCCee
Q psy13158 113 DPC-PGSCGYNAECKVIN-HNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCGLYSQCRDIGGSPSC 190 (289)
Q Consensus 113 ~~c-~~~C~~~~~C~~~~-g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C 190 (289)
++| .++|+++|+|.... ++|.|.|++.|+|.. |+.++..|.++||..+++|+...+.|.|
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~------------------CEi~~epC~snPC~~GgtCip~~n~f~C 3926 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNH------------------CEIDLEPCASNPCLTGGTCIPFYNGFLC 3926 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcc------------------cccccccccCCCCCCCCEEEecCCCeeE
Confidence 677 89999999999887 779999999999986 5688999999999999999999999999
Q ss_pred eCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCCCCC
Q psy13158 191 SCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGDAFLSCHPKPPE 267 (289)
Q Consensus 191 ~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~ 267 (289)
.|+.||+|. .|+. ..+++|..+ .|+++|.|+|+.|+|+|.|-+||.|..+ |.++++|
T Consensus 3927 nC~~gyTG~--~Ce~--~Gi~eCs~n--------------~C~~gg~C~n~~gsf~CncT~g~~gr~c--~~~~pni 3983 (4289)
T KOG1219|consen 3927 NCPNGYTGK--RCEA--RGISECSKN--------------VCGTGGQCINIPGSFHCNCTPGILGRTC--CAEKPNI 3983 (4289)
T ss_pred eCCCCccCc--eeec--ccccccccc--------------cccCCceeeccCCceEeccChhHhcccC--ccccCcc
Confidence 999999999 7752 126788776 8999999999999999999999999853 3444443
No 3
>KOG1214|consensus
Probab=99.33 E-value=1.5e-11 Score=115.78 Aligned_cols=181 Identities=27% Similarity=0.542 Sum_probs=119.1
Q ss_pred cccCCCcCCCC--CCCCCCCCeeeeCCCCceeeCCCCccCCCCCCcccCCCCCCCCCCCccccCCCcccCCCCCCCCccc
Q psy13158 6 VHVDLATLECY--PSPCGPYSECRNINGGPSCSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQNSECPYDKACI 83 (289)
Q Consensus 6 ~~~~~did~C~--~~~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C~~~C~~~~~c~~~~~C~~~~~C~~~~~c~~~~~c~ 83 (289)
-+| .|++||. .+.|+.+++|+|.+++|.|.|..||... .++.+|+.+..=.
T Consensus 729 r~c-~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~--------------dd~~tCV~i~~pa------------ 781 (1289)
T KOG1214|consen 729 RNC-VDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFA--------------DDRHTCVLITPPA------------ 781 (1289)
T ss_pred CCC-CChhhhccCCCCCCCCceeecCCCceeEEEeecceec--------------cCCcceEEecCCC------------
Confidence 345 4889999 5679999999999999999999998644 1123343221100
Q ss_pred cccccCCCCCCCCCCCeeeeccCCCCcccCCCCCCCCCCCe--eeecC-CCceeecCCCCccCCCCCCCCCCCCCCCCCC
Q psy13158 84 REKCVDPCPGSCGYGAVCTVINHSNEKCQDPCPGSCGYNAE--CKVIN-HNPICSCSQGYIGDGFSGCYPKPPEVPPPPQ 160 (289)
Q Consensus 84 ~~~c~~~C~g~~~~~~~C~~~~~~~~~c~~~c~~~C~~~~~--C~~~~-g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~ 160 (289)
....|++. .+.|..++. |+... +.|.|.|.+||+|+... |
T Consensus 782 -------------p~n~Ce~g-----------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c------------ 824 (1289)
T KOG1214|consen 782 -------------PANPCEDG-----------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-C------------ 824 (1289)
T ss_pred -------------CCCccccC-----------ccccCcCCceEEEecCCceEEEeecCCccCCccc-c------------
Confidence 00111110 133433333 44333 46999999999998754 3
Q ss_pred CCCccccCCCCCCCCCCCCeeeecCCCCeeeCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeee
Q psy13158 161 QDVQEYVNPCVPSPCGLYSQCRDIGGSPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKV 240 (289)
Q Consensus 161 ~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~ 240 (289)
.++|+|.++.|...+.|.++.+++.|+|.+||.|++..|.+.=.....|..... =+..|+.++.|..
T Consensus 825 ----~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er~---------hpl~chg~t~~~~ 891 (1289)
T KOG1214|consen 825 ----TDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQERF---------HPLQCHGSTGFCW 891 (1289)
T ss_pred ----ccccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCccccccc---------cceeeccccceeE
Confidence 677999999999999999999999999999999998776533122345543210 0234665554432
Q ss_pred --CCCCceeeCCCCCccCCCCCCCC
Q psy13158 241 --INHTPTCTCPDGFIGDAFLSCHP 263 (289)
Q Consensus 241 --~~g~~~C~C~~G~~g~~~~~C~~ 263 (289)
.+.+|.+.|.++-.|++...|.+
T Consensus 892 ~~Dp~~~e~p~~~~ppG~~~~~c~~ 916 (1289)
T KOG1214|consen 892 CVDPDGHEVPGTQTPPGSTPPHCGP 916 (1289)
T ss_pred eeCCCcccCCCCCCCCCCCCCCCCC
Confidence 34567888888777776666753
No 4
>KOG1219|consensus
Probab=99.18 E-value=6.7e-11 Score=120.40 Aligned_cols=107 Identities=31% Similarity=0.809 Sum_probs=93.7
Q ss_pred CCCCCCCCCCCCeeeeCC-CCceeeCCCCccCCCCCCcccCCCCCCCCCCCccccCCCcccCCCCCCCCccccccccCCC
Q psy13158 13 LECYPSPCGPYSECRNIN-GGPSCSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQNSECPYDKACIREKCVDPC 91 (289)
Q Consensus 13 d~C~~~~C~~~~~C~n~~-gs~~C~C~~Gf~g~~~~C~~~C~~~~~c~~~~~C~~~~~C~~~~~c~~~~~c~~~~c~~~C 91 (289)
+.|...||.++|+|...+ |+|.|.|++.|+|. +|+. +
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~--~CEi---------------~------------------------- 3902 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN--HCEI---------------D------------------------- 3902 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc--cccc---------------c-------------------------
Confidence 789999999999999987 68999999999999 7751 0
Q ss_pred CCCCCCCCeeeeccCCCCcccCCC-CCCCCCCCeeeecCCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccc-cCC
Q psy13158 92 PGSCGYGAVCTVINHSNEKCQDPC-PGSCGYNAECKVINHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEY-VNP 169 (289)
Q Consensus 92 ~g~~~~~~~C~~~~~~~~~c~~~c-~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~-~~~ 169 (289)
+.+| +++|..+++|+...++|.|.|+.||+|.. | +.. +++
T Consensus 3903 --------------------~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~---C---------------e~~Gi~e 3944 (4289)
T KOG1219|consen 3903 --------------------LEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKR---C---------------EARGISE 3944 (4289)
T ss_pred --------------------cccccCCCCCCCCEEEecCCCeeEeCCCCccCce---e---------------ecccccc
Confidence 3334 67899999999999999999999999985 3 344 899
Q ss_pred CCCCCCCCCCeeeecCCCCeeeCCCCcccC
Q psy13158 170 CVPSPCGLYSQCRDIGGSPSCSCLPNYIGA 199 (289)
Q Consensus 170 C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 199 (289)
|+.++|.++++|++..|+|.|.|.+||.|.
T Consensus 3945 Cs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3945 CSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred cccccccCCceeeccCCceEeccChhHhcc
Confidence 999999999999999999999999999987
No 5
>KOG4289|consensus
Probab=99.06 E-value=4.9e-09 Score=103.76 Aligned_cols=59 Identities=34% Similarity=0.900 Sum_probs=50.4
Q ss_pred ccccCCCcCCCCCCCCCCCCeeeeCCCCceeeCCCCccCCCCCCc-----ccCCCCCCCCCCCcccc
Q psy13158 5 LVHVDLATLECYPSPCGPYSECRNINGGPSCSCRPGYIGSPPNCR-----PECVMNSECPSHEACIK 66 (289)
Q Consensus 5 ~~~~~~did~C~~~~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C~-----~~C~~~~~c~~~~~C~~ 66 (289)
+.-|++.||+|.+.||+++|.|....|.|.|.|++||+|. +|+ ..|. ++.|.++++|++
T Consensus 1232 gd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe--hCEvs~~agrCv-pGvC~nggtC~~ 1295 (2531)
T KOG4289|consen 1232 GDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE--HCEVSARAGRCV-PGVCKNGGTCVN 1295 (2531)
T ss_pred cccccchhHhhhcCCCCCCCceEEecCceeEEecCCcccc--ceeeecccCccc-cceecCCCEEee
Confidence 3468999999999999999999999999999999999999 998 3343 456777888875
No 6
>KOG1217|consensus
Probab=98.96 E-value=4.7e-08 Score=91.39 Aligned_cols=213 Identities=29% Similarity=0.629 Sum_probs=130.7
Q ss_pred ccCCCcCCCC--CCCCCCCCeeeeCCCCceeeCCCCccCCCCCCcccCCCCCCCCCCCccccC-----------CCccc-
Q psy13158 7 HVDLATLECY--PSPCGPYSECRNINGGPSCSCRPGYIGSPPNCRPECVMNSECPSHEACIKI-----------PECIQ- 72 (289)
Q Consensus 7 ~~~~did~C~--~~~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C~~~C~~~~~c~~~~~C~~~-----------~~C~~- 72 (289)
.+..+.++|. ..+|.+++.|.+..++|.|.|++||.+. .++.. ..+..|.+. ..|..
T Consensus 164 ~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~~~~-------~~~~~c~~~~~~~~~~g~~~~~c~~~ 234 (487)
T KOG1217|consen 164 PCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS--TCETT-------GNGGTCVDSVACSCPPGARGPECEVS 234 (487)
T ss_pred cccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC--cCcCC-------CCCceEecceeccCCCCCCCCCcccc
Confidence 3444447887 4569999999999999999999999988 55421 111122211 11111
Q ss_pred CCCCCCC-CccccccccCCC---CCCCCCC-CeeeeccCCCCcccCCC-CC-CCCCCCeeeecCCCceeecCCCCccCCC
Q psy13158 73 NSECPYD-KACIREKCVDPC---PGSCGYG-AVCTVINHSNEKCQDPC-PG-SCGYNAECKVINHNPICSCSQGYIGDGF 145 (289)
Q Consensus 73 ~~~c~~~-~~c~~~~c~~~C---~g~~~~~-~~C~~~~~~~~~c~~~c-~~-~C~~~~~C~~~~g~~~C~C~~G~~~~~~ 145 (289)
...+..+ ..|.+....+.| +||.... ..+.+ ++.| .. .|.++++|++..+.|.|.|++||.+...
T Consensus 235 ~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~--------~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 235 IVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVD--------VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred cccccCCCCcccccCCceeeeCCCCccccccceeee--------ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 0111111 233332222222 5665554 34555 4555 22 3888899999998899999999998863
Q ss_pred CCCCCCCCCCCCCCCCCCccccCCCC----CCCCCCCCee--eecCCCCeeeCCCCcccCCCCCccCCCCC-CCCCCCCc
Q psy13158 146 SGCYPKPPEVPPPPQQDVQEYVNPCV----PSPCGLYSQC--RDIGGSPSCSCLPNYIGAPPNCRPECLQN-SECPNDKA 218 (289)
Q Consensus 146 ~~C~~~~~~~~~~~~~~~~~~~~~C~----~~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~c~~~C~~~-~~C~~~~~ 218 (289)
..+ .+..+|. ..+|..+..| ....+.+.|.|..+|.|. .| ... ++|...
T Consensus 307 ~~~----------------~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~--~C----~~~~~~C~~~-- 362 (487)
T KOG1217|consen 307 TEC----------------VDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR--RC----EDSNDECASS-- 362 (487)
T ss_pred ccc----------------cccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC--cc----ccCCccccCC--
Confidence 111 2334553 3457776677 234456789999998777 44 444 366654
Q ss_pred cccCcccCCCCCCCCCCCeeee-CCCCceeeCCCCCccCCCCCCCCCCCCCccCC
Q psy13158 219 CIREKCADPCPGSCGYNAQCKV-INHTPTCTCPDGFIGDAFLSCHPKPPERECLY 272 (289)
Q Consensus 219 c~~~~c~~~c~~~C~~~~~C~~-~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~ 272 (289)
.+..++.|++ ..++|.|.|+.+|.+..........++++|..
T Consensus 363 ------------~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~ 405 (487)
T KOG1217|consen 363 ------------PCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG 405 (487)
T ss_pred ------------ccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC
Confidence 4667889999 78999999999998741111122227888864
No 7
>KOG4289|consensus
Probab=98.71 E-value=2e-08 Score=99.67 Aligned_cols=91 Identities=30% Similarity=0.791 Sum_probs=74.0
Q ss_pred ecCCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCeeeecCCCCeeeCCCCcccCCCCCccC
Q psy13158 127 VINHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCGLYSQCRDIGGSPSCSCLPNYIGAPPNCRPE 206 (289)
Q Consensus 127 ~~~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c~~~ 206 (289)
+..+++.|.|++||+|+. |+..+|.|-+.||.+++.|....|.|.|.|.+||+|. .|+-
T Consensus 1217 ~pvnglrCrCPpGFTgd~------------------CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe--hCEv- 1275 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDY------------------CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE--HCEV- 1275 (2531)
T ss_pred cccCceeEeCCCCCCccc------------------ccchhHhhhcCCCCCCCceEEecCceeEEecCCcccc--ceee-
Confidence 344678999999999996 5678999999999999999999999999999999999 6641
Q ss_pred CCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeC-CCCceeeCCCC
Q psy13158 207 CLQNSECPNDKACIREKCADPCPGSCGYNAQCKVI-NHTPTCTCPDG 252 (289)
Q Consensus 207 C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~-~g~~~C~C~~G 252 (289)
+ ...+.|+ ++.|.++++|++. .|+|.|+|+.|
T Consensus 1276 --s----~~agrCv--------pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1276 --S----ARAGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred --e----cccCccc--------cceecCCCEEeecCCCceeccCCCc
Confidence 1 1222333 2389999999985 68899999988
No 8
>KOG1217|consensus
Probab=98.68 E-value=7.4e-07 Score=83.28 Aligned_cols=203 Identities=26% Similarity=0.583 Sum_probs=115.8
Q ss_pred CCCCCCCCCCCeeeeCCCCceeeCCCCccCCCCCCcc--cCCCCCCCCCCCccccCCCcccCCCCCCCCccccccccCCC
Q psy13158 14 ECYPSPCGPYSECRNINGGPSCSCRPGYIGSPPNCRP--ECVMNSECPSHEACIKIPECIQNSECPYDKACIREKCVDPC 91 (289)
Q Consensus 14 ~C~~~~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C~~--~C~~~~~c~~~~~C~~~~~C~~~~~c~~~~~c~~~~c~~~C 91 (289)
.+...+....+.+....++|.|.|++||.+. .+.. .|..... .+.....|..... .-..+.+.+.
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~--~~~~~~~C~~~~~-----~~~~~~~c~~~~~-----~~~~~~c~C~- 157 (487)
T KOG1217|consen 91 PCRSPCLLLCGECVDCVGSYECTCPPGYQGT--PCEGECECVTGPG-----VCCIDGSCSNGPG-----SVGPFRCSCT- 157 (487)
T ss_pred cccCCcccCCccccCCCCCceeeCCCccccC--cCCcceeecCCCC-----CeeCchhhcCCCC-----CCCceeeeeC-
Confidence 3334444555667778889999999999998 6653 2422211 0111112211100 0000111111
Q ss_pred CCCCCCCCeeeeccCCCCcccCCC---CCCCCCCCeeeecCCCceeecCCCCccCCCCC------CCCC--CCCCCCCCC
Q psy13158 92 PGSCGYGAVCTVINHSNEKCQDPC---PGSCGYNAECKVINHNPICSCSQGYIGDGFSG------CYPK--PPEVPPPPQ 160 (289)
Q Consensus 92 ~g~~~~~~~C~~~~~~~~~c~~~c---~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~~------C~~~--~~~~~~~~~ 160 (289)
.+|... .+... .++| ...|.+.+.|.+..++|.|.|.++|.+..... |... .........
T Consensus 158 ~g~~~~--~~~~~-------~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~ 228 (487)
T KOG1217|consen 158 EGYEGE--PCETD-------LDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARG 228 (487)
T ss_pred CCcccc--ccccc-------ccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCC
Confidence 333221 11110 1233 34588889999999999999999998875220 1000 000001111
Q ss_pred CCCccccCCCCCCCCCCCCeeeecCCCCeeeCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeee
Q psy13158 161 QDVQEYVNPCVPSPCGLYSQCRDIGGSPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKV 240 (289)
Q Consensus 161 ~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~ 240 (289)
..+...+.++... . +.|+++.+++.|.+++||.+.. + ..+.++++|... .+ |.++++|++
T Consensus 229 ~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~--~-~~~~~~~~C~~~------------~~-c~~~~~C~~ 288 (487)
T KOG1217|consen 229 PECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDA--C-VTCVDVDSCALI------------AS-CPNGGTCVN 288 (487)
T ss_pred CCcccccccccCC---C-CcccccCCceeeeCCCCccccc--c-ceeeeccccCCC------------Cc-cCCCCeeec
Confidence 1222222333221 3 7888888899999999999883 1 133566788776 33 888899999
Q ss_pred CCCCceeeCCCCCccCCC
Q psy13158 241 INHTPTCTCPDGFIGDAF 258 (289)
Q Consensus 241 ~~g~~~C~C~~G~~g~~~ 258 (289)
..+.|.|.|++||+|...
T Consensus 289 ~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 289 VPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred CCCcceeeCCCCCCCCCC
Confidence 999999999999999864
No 9
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.46 E-value=8.9e-08 Score=58.54 Aligned_cols=33 Identities=42% Similarity=0.825 Sum_probs=30.1
Q ss_pred CcCCCC--CCCCCCCCeeeeCCCCceeeCCCCccC
Q psy13158 11 ATLECY--PSPCGPYSECRNINGGPSCSCRPGYIG 43 (289)
Q Consensus 11 did~C~--~~~C~~~~~C~n~~gs~~C~C~~Gf~g 43 (289)
|||||. .+.|..++.|+|+.|+|.|.|++||..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 799999 457998999999999999999999983
No 10
>KOG4260|consensus
Probab=98.43 E-value=1.5e-07 Score=78.75 Aligned_cols=82 Identities=30% Similarity=0.653 Sum_probs=60.6
Q ss_pred ecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCC--CCCCCCCCeeeecCCCCeeeCCCCcccCCCCCccCCCCCCC
Q psy13158 135 SCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCV--PSPCGLYSQCRDIGGSPSCSCLPNYIGAPPNCRPECLQNSE 212 (289)
Q Consensus 135 ~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c~~~C~~~~~ 212 (289)
.|..||...... | .|+++|. +.+|.....|+|+.|+|.|...+||.+....|+ .|. +.
T Consensus 221 kCkkGW~lde~g-C----------------vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~-~~~--d~ 280 (350)
T KOG4260|consen 221 KCKKGWKLDEEG-C----------------VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQ-FCA--DV 280 (350)
T ss_pred hhcccceecccc-c----------------ccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhh-hhh--hh
Confidence 688888876432 4 7899997 578988899999999999999999977533332 111 22
Q ss_pred CCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCc
Q psy13158 213 CPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFI 254 (289)
Q Consensus 213 C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 254 (289)
|.. .+..|.|+.++|+|+|..|+.
T Consensus 281 ~~~------------------kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 281 CAS------------------KNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred ccc------------------CCCCcccCCccEEEEecccce
Confidence 322 256788999999999998864
No 11
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.20 E-value=9.1e-07 Score=54.03 Aligned_cols=35 Identities=29% Similarity=0.724 Sum_probs=30.3
Q ss_pred CCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCcc
Q psy13158 209 QNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIG 255 (289)
Q Consensus 209 ~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g 255 (289)
||+||... ++.|..++.|+|+.|+|+|.|++||+.
T Consensus 1 DidEC~~~------------~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEG------------PHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTT------------SSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCC------------CCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 46778776 678988899999999999999999984
No 12
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.14 E-value=3.3e-06 Score=50.44 Aligned_cols=34 Identities=38% Similarity=0.854 Sum_probs=30.8
Q ss_pred CcCCCCC-CCCCCCCeeeeCCCCceeeCCCCcc-CC
Q psy13158 11 ATLECYP-SPCGPYSECRNINGGPSCSCRPGYI-GS 44 (289)
Q Consensus 11 did~C~~-~~C~~~~~C~n~~gs~~C~C~~Gf~-g~ 44 (289)
|+|+|.. .+|.++++|+++.++|.|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 5789987 7999999999999999999999999 65
No 13
>KOG1225|consensus
Probab=98.13 E-value=2.4e-05 Score=72.96 Aligned_cols=82 Identities=32% Similarity=0.759 Sum_probs=54.7
Q ss_pred CCCCCCCCeeeecCCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCeeeecCCCCeeeCCCC
Q psy13158 116 PGSCGYNAECKVINHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCGLYSQCRDIGGSPSCSCLPN 195 (289)
Q Consensus 116 ~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G 195 (289)
+..|..++.+++ + .|.|.+||.|...+ +..|. ..|..++.|+. -+|.|.+|
T Consensus 284 p~~cs~~g~~~~--g--~CiC~~g~~G~dCs--------------------~~~cp-adC~g~G~Ci~----G~C~C~~G 334 (525)
T KOG1225|consen 284 PVDCSGGGVCVD--G--ECICNPGYSGKDCS--------------------IRRCP-ADCSGHGKCID----GECLCDEG 334 (525)
T ss_pred CcccCCCceecC--C--EeecCCCccccccc--------------------cccCC-ccCCCCCcccC----CceEeCCC
Confidence 444655555543 2 79999999998622 22343 47888888872 36999999
Q ss_pred cccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCC
Q psy13158 196 YIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGDA 257 (289)
Q Consensus 196 ~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 257 (289)
|+|. .| ... .|.+++.|++. |.|.+||+|..
T Consensus 335 y~G~--~C----~~~--------------------~C~~~g~cv~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 335 YTGE--LC----IQR--------------------ACSGGGQCVNG-----CKCKKGWRGPD 365 (525)
T ss_pred CcCC--cc----ccc--------------------ccCCCceeccC-----ceeccCccCCC
Confidence 9998 43 322 25556677653 88889998876
No 14
>KOG4260|consensus
Probab=98.10 E-value=4.4e-06 Score=70.15 Aligned_cols=115 Identities=23% Similarity=0.444 Sum_probs=71.5
Q ss_pred CCCCCCCeeeec---CCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCC--CCCCCCCCeeeecCCCCee-
Q psy13158 117 GSCGYNAECKVI---NHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCV--PSPCGLYSQCRDIGGSPSC- 190 (289)
Q Consensus 117 ~~C~~~~~C~~~---~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~--~~~C~~~~~C~~~~g~~~C- 190 (289)
.+|+.++.|... .|+-.|.|..||.|.....|.....+..+.... -.|. ...|. +.|. ..++..|
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~------lvCt~Ch~~C~--~~Cs-g~~~k~C~ 220 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQH------LVCTACHEGCL--GVCS-GESSKGCS 220 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhccccc------chhhhhhhhhh--cccC-CCCCCChh
Confidence 345555555432 366789999999998754342221111110000 0111 01122 1332 2233445
Q ss_pred eCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 191 SCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 191 ~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
.|..||.++. ..|.|+++|... +.+|.....|+|+.|||.|.+.+||.+.
T Consensus 221 kCkkGW~lde----~gCvDvnEC~~e------------p~~c~~~qfCvNteGSf~C~dk~Gy~~g 270 (350)
T KOG4260|consen 221 KCKKGWKLDE----EGCVDVNECQNE------------PAPCKAHQFCVNTEGSFKCEDKEGYKKG 270 (350)
T ss_pred hhcccceecc----cccccHHHHhcC------------CCCCChhheeecCCCceEecccccccCC
Confidence 5899998872 267899999887 6789888899999999999999999873
No 15
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.93 E-value=4.7e-06 Score=48.80 Aligned_cols=32 Identities=38% Similarity=0.959 Sum_probs=25.0
Q ss_pred CCCCCCCCeeeeCCCCceeeCCCCccCCCCCC
Q psy13158 17 PSPCGPYSECRNINGGPSCSCRPGYIGSPPNC 48 (289)
Q Consensus 17 ~~~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C 48 (289)
.+.|..+++|+++.++|.|.|++||.|+|..|
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence 35688999999999999999999999986543
No 16
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.92 E-value=4e-06 Score=49.11 Aligned_cols=31 Identities=42% Similarity=0.881 Sum_probs=25.2
Q ss_pred CCCCCCCCeeeeCCCCceeeCCCCCccCCCC
Q psy13158 229 PGSCGYNAQCKVINHTPTCTCPDGFIGDAFL 259 (289)
Q Consensus 229 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 259 (289)
.+.|+.+|.|+++.++|.|.|++||.|++..
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 4579999999999999999999999999864
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.91 E-value=5.3e-06 Score=47.36 Aligned_cols=30 Identities=40% Similarity=1.105 Sum_probs=27.3
Q ss_pred CCCCCCCCCCeeeeCC-CCceeeCCCCccCC
Q psy13158 15 CYPSPCGPYSECRNIN-GGPSCSCRPGYIGS 44 (289)
Q Consensus 15 C~~~~C~~~~~C~n~~-gs~~C~C~~Gf~g~ 44 (289)
|.+.+|.++|+|++.. ++|.|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5567999999999998 89999999999986
No 18
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.79 E-value=3.5e-05 Score=45.39 Aligned_cols=34 Identities=41% Similarity=0.873 Sum_probs=30.5
Q ss_pred CcCCCCC-CCCCCCCeeeeCCCCceeeCCCCccCC
Q psy13158 11 ATLECYP-SPCGPYSECRNINGGPSCSCRPGYIGS 44 (289)
Q Consensus 11 did~C~~-~~C~~~~~C~n~~gs~~C~C~~Gf~g~ 44 (289)
++|+|.. .+|.+++.|+++.++|.|.|++||.|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 4678886 789888999999999999999999987
No 19
>KOG1225|consensus
Probab=97.76 E-value=0.0003 Score=65.78 Aligned_cols=90 Identities=32% Similarity=0.841 Sum_probs=60.5
Q ss_pred CCCCCCCCeeeecCCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCeeeecCCCCeeeCCCC
Q psy13158 116 PGSCGYNAECKVINHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCGLYSQCRDIGGSPSCSCLPN 195 (289)
Q Consensus 116 ~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G 195 (289)
+..|...+.|+. -+|.|++||.|.. |. .-.|.. .|+.++.+++. .|.|++|
T Consensus 253 ~~~c~~~g~c~~----G~CIC~~Gf~G~d---C~-----------------e~~Cp~-~cs~~g~~~~g----~CiC~~g 303 (525)
T KOG1225|consen 253 PGGCTGRGQCVE----GRCICPPGFTGDD---CD-----------------ELVCPV-DCSGGGVCVDG----ECICNPG 303 (525)
T ss_pred CCCCcccceEeC----CeEeCCCCCcCCC---CC-----------------cccCCc-ccCCCceecCC----EeecCCC
Confidence 344444455653 2799999999987 31 123543 36666666542 7999999
Q ss_pred cccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCC
Q psy13158 196 YIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGDAFLSCHP 263 (289)
Q Consensus 196 ~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~ 263 (289)
|.|+ .|+ . ..| +..|..++.|++. +|.|.+||+|. .|++
T Consensus 304 ~~G~--dCs----~-~~c---------------padC~g~G~Ci~G----~C~C~~Gy~G~---~C~~ 342 (525)
T KOG1225|consen 304 YSGK--DCS----I-RRC---------------PADCSGHGKCIDG----ECLCDEGYTGE---LCIQ 342 (525)
T ss_pred cccc--ccc----c-ccC---------------CccCCCCCcccCC----ceEeCCCCcCC---cccc
Confidence 9998 442 1 123 4468888999933 79999999999 5654
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.55 E-value=3.6e-05 Score=43.87 Aligned_cols=27 Identities=30% Similarity=0.791 Sum_probs=24.9
Q ss_pred CCCCCCCeeeeCC-CCceeeCCCCCccC
Q psy13158 230 GSCGYNAQCKVIN-HTPTCTCPDGFIGD 256 (289)
Q Consensus 230 ~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 256 (289)
++|.++++|++.. ++|.|.|++||+|+
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4799999999988 99999999999986
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.48 E-value=0.0002 Score=42.50 Aligned_cols=26 Identities=27% Similarity=0.759 Sum_probs=23.6
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCc-cC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFI-GD 256 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~-g~ 256 (289)
+|.+++.|+++.++|.|.|++||. |.
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCC
Confidence 688888999999999999999999 65
No 22
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.25 E-value=0.00027 Score=37.12 Aligned_cols=23 Identities=35% Similarity=0.664 Sum_probs=18.3
Q ss_pred CceeeCCCCCccCCC-CCCCCCCCCCc
Q psy13158 244 TPTCTCPDGFIGDAF-LSCHPKPPERE 269 (289)
Q Consensus 244 ~~~C~C~~G~~g~~~-~~C~~~~~~~~ 269 (289)
||+|.|++||+.... ++|. ||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~---DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE---DIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc---cCCC
Confidence 689999999996433 4598 9986
No 23
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.13 E-value=0.00068 Score=39.20 Aligned_cols=28 Identities=46% Similarity=1.114 Sum_probs=24.6
Q ss_pred CCC-CCCCCCCeeeeCCCCceeeCCCCccC
Q psy13158 15 CYP-SPCGPYSECRNINGGPSCSCRPGYIG 43 (289)
Q Consensus 15 C~~-~~C~~~~~C~n~~gs~~C~C~~Gf~g 43 (289)
|.. .+|.++ +|+++.++|.|.|++||.|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence 455 578887 9999999999999999998
No 24
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.12 E-value=0.00012 Score=58.48 Aligned_cols=148 Identities=26% Similarity=0.589 Sum_probs=82.7
Q ss_pred CCCCCCeeeeCCCCceeeCCCCccCCCCCCcccCCCCCCCCCCCccccCCCcccCCCCCCCCccccccccCCCCCCCCCC
Q psy13158 19 PCGPYSECRNINGGPSCSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQNSECPYDKACIREKCVDPCPGSCGYG 98 (289)
Q Consensus 19 ~C~~~~~C~n~~gs~~C~C~~Gf~g~~~~C~~~C~~~~~c~~~~~C~~~~~C~~~~~c~~~~~c~~~~c~~~C~g~~~~~ 98 (289)
.|. +|..+.....|.|.|.+||... +..+|....+|.....
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~---------------~EntCE~kv~C~~~e~----------------------- 47 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLK---------------NENTCEEKVECDKLEN----------------------- 47 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEE---------------ETTEEEE----SG-GG-----------------------
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEc---------------cccccccceecCcccc-----------------------
Confidence 454 5778888889999999999754 1224544444443200
Q ss_pred CeeeeccCCCCcccCCCCCCCCCCCeeeecC-----CCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCC
Q psy13158 99 AVCTVINHSNEKCQDPCPGSCGYNAECKVIN-----HNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPS 173 (289)
Q Consensus 99 ~~C~~~~~~~~~c~~~c~~~C~~~~~C~~~~-----g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~ 173 (289)
+ ..+|...+.|++.. ..|.|.|.+||...... |. .+.|...
T Consensus 48 -------------~---~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~v-Cv-----------------p~~C~~~ 93 (197)
T PF06247_consen 48 -------------V---NKPCGDYAKCINQANKGEERAYKCDCINGYILKQGV-CV-----------------PNKCNNK 93 (197)
T ss_dssp -------------T---TSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSS-EE-----------------EGGGSS-
T ss_pred -------------c---CccccchhhhhcCCCcccceeEEEecccCceeeCCe-Ec-----------------hhhcCce
Confidence 0 13455566776654 46899999999987543 31 2356556
Q ss_pred CCCCCCeeeecC---CCCeeeCCCCcccCCCCCccCCCC--CCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceee
Q psy13158 174 PCGLYSQCRDIG---GSPSCSCLPNYIGAPPNCRPECLQ--NSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCT 248 (289)
Q Consensus 174 ~C~~~~~C~~~~---g~~~C~C~~G~~g~~~~c~~~C~~--~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~ 248 (289)
.|. .+.|+..+ ....|+|.-|+..+... .|.. ..+|+. .|..+..|..+.+-|+|.
T Consensus 94 ~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~---kCtk~G~T~C~L---------------KCk~nE~CK~~~~~Y~C~ 154 (197)
T PF06247_consen 94 DCG-SGKCILDPDNPNNPTCSCNIGKVPDDNK---KCTKTGETKCSL---------------KCKENEECKLVDGYYKCV 154 (197)
T ss_dssp --T-TEEEEEEEGGGSEEEEEE-TEEETTTTT---ESEEEE-----------------------TTTEEEEEETTEEEEE
T ss_pred ecC-CCeEEecCCCCCCceeEeeeceEeccCC---cccCCCccceee---------------ecCCCcceeeeCcEEEee
Confidence 677 57886432 23489999998732111 2221 134544 377788999999999999
Q ss_pred CCCCCccCCC
Q psy13158 249 CPDGFIGDAF 258 (289)
Q Consensus 249 C~~G~~g~~~ 258 (289)
+.+||.+++.
T Consensus 155 ~~~~~~~~~~ 164 (197)
T PF06247_consen 155 CKEGFPGDGE 164 (197)
T ss_dssp E-TT-EEETT
T ss_pred cCCCCCCCCC
Confidence 9999987643
No 25
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.02 E-value=0.00065 Score=35.71 Aligned_cols=13 Identities=54% Similarity=1.050 Sum_probs=11.5
Q ss_pred CceeeCCCCccCC
Q psy13158 32 GPSCSCRPGYIGS 44 (289)
Q Consensus 32 s~~C~C~~Gf~g~ 44 (289)
||.|.|++||+..
T Consensus 1 sy~C~C~~Gy~l~ 13 (24)
T PF12662_consen 1 SYTCSCPPGYQLS 13 (24)
T ss_pred CEEeeCCCCCcCC
Confidence 6899999999865
No 26
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.02 E-value=0.00093 Score=38.44 Aligned_cols=28 Identities=39% Similarity=0.967 Sum_probs=25.2
Q ss_pred CCCCCCCCeeeeCCCCceeeCCCCccCC
Q psy13158 17 PSPCGPYSECRNINGGPSCSCRPGYIGS 44 (289)
Q Consensus 17 ~~~C~~~~~C~n~~gs~~C~C~~Gf~g~ 44 (289)
..+|.++++|+++.++|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5678888999999999999999999875
No 27
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.95 E-value=0.0015 Score=38.11 Aligned_cols=26 Identities=27% Similarity=0.779 Sum_probs=23.6
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
.|.+++.|++..++|.|.|++||.|.
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCC
Confidence 68777899999999999999999986
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.57 E-value=0.0036 Score=35.85 Aligned_cols=27 Identities=30% Similarity=0.832 Sum_probs=24.2
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 230 GSCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 230 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
..|.+++.|++..++|.|.|+.||.|.
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCccc
Confidence 367778899999999999999999987
No 29
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.31 E-value=0.0054 Score=35.30 Aligned_cols=25 Identities=36% Similarity=0.937 Sum_probs=22.5
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
.|.++ .|+++.++|.|.|++||.|.
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccC
Confidence 68777 99999999999999999994
No 30
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.76 E-value=0.0059 Score=35.61 Aligned_cols=25 Identities=24% Similarity=0.632 Sum_probs=19.6
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCccCC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFIGDA 257 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 257 (289)
.|.+ .|++++++|+|.|++||+...
T Consensus 7 gC~h--~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 7 GCSH--ICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GSSS--EEEEETTSEEEE-STTEEE-T
T ss_pred CcCC--CCccCCCceEeECCCCCEECc
Confidence 4654 799999999999999998764
No 31
>KOG1226|consensus
Probab=95.52 E-value=0.35 Score=47.10 Aligned_cols=24 Identities=33% Similarity=0.935 Sum_probs=18.0
Q ss_pred CCCCCCeeeecCCCCeeeCCCCcccCCCCC
Q psy13158 174 PCGLYSQCRDIGGSPSCSCLPNYIGAPPNC 203 (289)
Q Consensus 174 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~c 203 (289)
.|..++.|.- -+|.|.+||+|. .|
T Consensus 556 lC~g~G~C~C----G~CvC~~GwtG~--~C 579 (783)
T KOG1226|consen 556 LCGGHGRCEC----GRCVCNPGWTGS--AC 579 (783)
T ss_pred ccCCCCeEeC----CcEEcCCCCccC--CC
Confidence 4776777742 369999999999 66
No 32
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.28 E-value=0.016 Score=33.85 Aligned_cols=20 Identities=45% Similarity=0.894 Sum_probs=17.1
Q ss_pred eeeeCCCCceeeCCCCccCC
Q psy13158 25 ECRNINGGPSCSCRPGYIGS 44 (289)
Q Consensus 25 ~C~n~~gs~~C~C~~Gf~g~ 44 (289)
.|++++++|.|.|++||+..
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp EEEEETTSEEEE-STTEEE-
T ss_pred CCccCCCceEeECCCCCEEC
Confidence 89999999999999999865
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.15 E-value=0.029 Score=31.77 Aligned_cols=25 Identities=28% Similarity=0.715 Sum_probs=21.2
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCccCC
Q psy13158 18 SPCGPYSECRNINGGPSCSCRPGYIGS 44 (289)
Q Consensus 18 ~~C~~~~~C~n~~gs~~C~C~~Gf~g~ 44 (289)
..|.++|+|++. ...|.|.+||+|.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 358899999987 3479999999997
No 34
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.14 E-value=0.027 Score=24.99 Aligned_cols=11 Identities=45% Similarity=1.467 Sum_probs=8.9
Q ss_pred eeeCCCCccCC
Q psy13158 34 SCSCRPGYIGS 44 (289)
Q Consensus 34 ~C~C~~Gf~g~ 44 (289)
.|.|++||+|.
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 48999999998
No 35
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.61 E-value=0.019 Score=46.11 Aligned_cols=108 Identities=28% Similarity=0.686 Sum_probs=62.9
Q ss_pred CCCCCCeeeecCCCceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCC-----CCCCCCCCeeeecC-----CC
Q psy13158 118 SCGYNAECKVINHNPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCV-----PSPCGLYSQCRDIG-----GS 187 (289)
Q Consensus 118 ~C~~~~~C~~~~g~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~-----~~~C~~~~~C~~~~-----g~ 187 (289)
.|. +|..+.....|.|.|..||.......| +...+|. ..+|...+.|++.. ..
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~EntC----------------E~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~ 69 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKNENTC----------------EEKVECDKLENVNKPCGDYAKCINQANKGEERA 69 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEETTEE----------------EE----SG-GGTTSEEETTEEEEE-SSTTSSTS
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEcccccc----------------ccceecCcccccCccccchhhhhcCCCccccee
Confidence 455 466777778899999999987543322 3334554 24688888998754 56
Q ss_pred CeeeCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCC---CCceeeCCCCCccCCCCCCC
Q psy13158 188 PSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVIN---HTPTCTCPDGFIGDAFLSCH 262 (289)
Q Consensus 188 ~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~---g~~~C~C~~G~~g~~~~~C~ 262 (289)
|.|.|.+||......|. . +.|... .|+ .|.|+..+ ....|.|..|+..+....|.
T Consensus 70 ~~C~C~~gY~~~~~vCv----p-~~C~~~--------------~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCt 127 (197)
T PF06247_consen 70 YKCDCINGYILKQGVCV----P-NKCNNK--------------DCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCT 127 (197)
T ss_dssp EEEEE-TTEEESSSSEE----E-GGGSS-----------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESE
T ss_pred EEEecccCceeeCCeEc----h-hhcCce--------------ecC-CCeEEecCCCCCCceeEeeeceEeccCCccc
Confidence 99999999988733222 1 234333 577 57887432 23489999999843333344
No 36
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.12 E-value=0.14 Score=28.94 Aligned_cols=24 Identities=25% Similarity=0.715 Sum_probs=20.2
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
.|..+++|+.. ..+|.|.+||+|.
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCC
Confidence 58888999866 3489999999998
No 37
>KOG0994|consensus
Probab=92.84 E-value=0.76 Score=46.82 Aligned_cols=21 Identities=33% Similarity=0.759 Sum_probs=15.5
Q ss_pred eeecCCCcee-ecCCCCccCCC
Q psy13158 125 CKVINHNPIC-SCSQGYIGDGF 145 (289)
Q Consensus 125 C~~~~g~~~C-~C~~G~~~~~~ 145 (289)
|.+...+++| .|..||.|++.
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred ccccccccchhhhhccccCCcc
Confidence 4455566777 79999998864
No 38
>KOG0994|consensus
Probab=92.82 E-value=0.42 Score=48.56 Aligned_cols=112 Identities=27% Similarity=0.636 Sum_probs=60.3
Q ss_pred CceeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCC-CCCeeeecCCCC--ee-eCCCCcccCCCCCcc-
Q psy13158 131 NPICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCG-LYSQCRDIGGSP--SC-SCLPNYIGAPPNCRP- 205 (289)
Q Consensus 131 ~~~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~-~~~~C~~~~g~~--~C-~C~~G~~g~~~~c~~- 205 (289)
+.+|.|.++..|....+|.++.+..+..+.+.|.-...--..+.|. ..++|.-.+|.| .| +|.|||+|-| .|++
T Consensus 781 GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFP-eCr~C 859 (1758)
T KOG0994|consen 781 GGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFP-ECRPC 859 (1758)
T ss_pred CceecccCccccccccccCCcccCcCCccCccccccccccccccccccccceeeccccchhhccccCCCccCCC-cCccc
Confidence 3578899988888766677666665555544332211100112232 234444334443 34 5778887753 3331
Q ss_pred CCCC-CCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCcee-eCCCCCccCCC
Q psy13158 206 ECLQ-NSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTC-TCPDGFIGDAF 258 (289)
Q Consensus 206 ~C~~-~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C-~C~~G~~g~~~ 258 (289)
.|.. .++|... ++.|. .|.+..+++.| .|.+||.|+..
T Consensus 860 qCNgHA~~Cd~~------------tGaCi---~CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 860 QCNGHADTCDPI------------TGACI---DCQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred cccCcccccCcc------------ccccc---cccccccccchhhhhccccCCcc
Confidence 1111 1344333 33444 36667778888 69999988764
No 39
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.69 E-value=0.034 Score=32.34 Aligned_cols=30 Identities=27% Similarity=0.727 Sum_probs=21.7
Q ss_pred CCCCCCCCCCeeeeCC-CCceeeCCCCccCC
Q psy13158 15 CYPSPCGPYSECRNIN-GGPSCSCRPGYIGS 44 (289)
Q Consensus 15 C~~~~C~~~~~C~n~~-gs~~C~C~~Gf~g~ 44 (289)
|....|..|+.|++.. |++.|.|..||...
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 4556788999999987 99999999999765
No 40
>KOG1226|consensus
Probab=92.60 E-value=0.9 Score=44.38 Aligned_cols=103 Identities=25% Similarity=0.615 Sum_probs=55.5
Q ss_pred eeecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCC----CCCCCCCCeeeecCCCCeeeCCCCcc----cCCCCCc
Q psy13158 133 ICSCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCV----PSPCGLYSQCRDIGGSPSCSCLPNYI----GAPPNCR 204 (289)
Q Consensus 133 ~C~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~----~~~C~~~~~C~~~~g~~~C~C~~G~~----g~~~~c~ 204 (289)
.|.|.+||.|.... |...... .....+.|. +.+|...+.|.= -.|.|.+... |. +|+
T Consensus 479 ~C~C~~G~~G~~CE-C~~~~~s--------s~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~~i~G~--fCE 543 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCE-CSTDELS--------SSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNGKIYGK--FCE 543 (783)
T ss_pred ceecCCCCCCCccc-CCccccC--------cHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCCceeee--eee
Confidence 57999999998633 2111000 000123343 126777776643 2578876655 44 553
Q ss_pred cCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCCCCCCccC
Q psy13158 205 PECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGDAFLSCHPKPPERECL 271 (289)
Q Consensus 205 ~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~ 271 (289)
|.+ -.|.... -..|..++.|.=. +|+|.+||+|..+. |.. +.+-|.
T Consensus 544 --CDn-fsC~r~~-----------g~lC~g~G~C~CG----~CvC~~GwtG~~C~-C~~--std~C~ 589 (783)
T KOG1226|consen 544 --CDN-FSCERHK-----------GVLCGGHGRCECG----RCVCNPGWTGSACN-CPL--STDTCE 589 (783)
T ss_pred --ccC-ccccccc-----------CcccCCCCeEeCC----cEEcCCCCccCCCC-CCC--CCcccc
Confidence 211 1233321 2357767777643 69999999999654 443 444443
No 41
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.39 E-value=0.22 Score=41.96 Aligned_cols=38 Identities=18% Similarity=0.448 Sum_probs=29.0
Q ss_pred CCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCC
Q psy13158 206 ECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGDA 257 (289)
Q Consensus 206 ~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 257 (289)
.|.++++|... ++.|.+ .|.++.|+|.|.|+.||+...
T Consensus 183 ~C~~~~~C~~~------------~~~c~~--~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICVVPDLCATL------------SHVCQQ--VCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCcCchhhcCC------------CCCccc--eEEcCCCCEEeECCCCccCCC
Confidence 35556667654 456764 799999999999999998753
No 42
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.34 E-value=0.091 Score=30.60 Aligned_cols=28 Identities=25% Similarity=0.449 Sum_probs=20.6
Q ss_pred CCCCCCeeeeCC-CCceeeCCCCCccCCC
Q psy13158 231 SCGYNAQCKVIN-HTPTCTCPDGFIGDAF 258 (289)
Q Consensus 231 ~C~~~~~C~~~~-g~~~C~C~~G~~g~~~ 258 (289)
.|..++.|++.. |++.|.|..||..++.
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred cCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 577789999876 9999999999987654
No 43
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=86.32 E-value=1.9 Score=27.12 Aligned_cols=23 Identities=26% Similarity=0.690 Sum_probs=16.8
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCccCC
Q psy13158 231 SCGYNAQCKVINHTPTCTCPDGFIGDA 257 (289)
Q Consensus 231 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 257 (289)
.|..++.|++. +|.|++||+-.+
T Consensus 27 qC~~~s~C~~g----~C~C~~g~~~~~ 49 (52)
T PF01683_consen 27 QCIGGSVCVNG----RCQCPPGYVEVG 49 (52)
T ss_pred CCCCcCEEcCC----EeECCCCCEecC
Confidence 45567788654 799999987654
No 44
>smart00051 DSL delta serrate ligand.
Probab=83.79 E-value=2.1 Score=28.37 Aligned_cols=45 Identities=24% Similarity=0.535 Sum_probs=28.3
Q ss_pred CeeeCCCCcccCCCCCccCCCCCCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 188 PSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 188 ~~C~C~~G~~g~~~~c~~~C~~~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
++-.|.++|.|. .|...|...+. ...+..|.. .| .+.|.+||+|.
T Consensus 17 ~rv~C~~~~yG~--~C~~~C~~~~d-------------------~~~~~~Cd~-~G--~~~C~~Gw~G~ 61 (63)
T smart00051 17 IRVTCDENYYGE--GCNKFCRPRDD-------------------FFGHYTCDE-NG--NKGCLEGWMGP 61 (63)
T ss_pred EEeeCCCCCcCC--ccCCEeCcCcc-------------------ccCCccCCc-CC--CEecCCCCcCC
Confidence 455788999988 66544443222 233445633 33 47899999997
No 45
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.92 E-value=1.7 Score=36.59 Aligned_cols=33 Identities=24% Similarity=0.497 Sum_probs=25.7
Q ss_pred cccCCCCC--CCCCCCCeeeecCCCCeeeCCCCcccC
Q psy13158 165 EYVNPCVP--SPCGLYSQCRDIGGSPSCSCLPNYIGA 199 (289)
Q Consensus 165 ~~~~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~g~ 199 (289)
.++++|.. ..|. ..|.++.|+|.|.|.+||.+.
T Consensus 185 ~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 185 VVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 45677763 3454 589999999999999999875
No 46
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=79.95 E-value=1.8 Score=31.96 Aligned_cols=33 Identities=36% Similarity=0.862 Sum_probs=25.6
Q ss_pred CcCCCC-CCCCCCCCeeeeCCCCceeeCCCCccCC
Q psy13158 11 ATLECY-PSPCGPYSECRNINGGPSCSCRPGYIGS 44 (289)
Q Consensus 11 did~C~-~~~C~~~~~C~n~~gs~~C~C~~Gf~g~ 44 (289)
..|+|. ...|+.+|.|.. ..+..|.|.+||...
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 456888 678999999954 455679999999753
No 47
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=76.12 E-value=3.1 Score=31.41 Aligned_cols=30 Identities=27% Similarity=0.618 Sum_probs=23.0
Q ss_pred CCCCCCCeeee--CCCCceeeCCCCCccCCCCCCCC
Q psy13158 230 GSCGYNAQCKV--INHTPTCTCPDGFIGDAFLSCHP 263 (289)
Q Consensus 230 ~~C~~~~~C~~--~~g~~~C~C~~G~~g~~~~~C~~ 263 (289)
+-|.++ +|.. ....+.|.|..||+|. +|+.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe---RCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI---RCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccc---cccc
Confidence 457764 8865 3477899999999999 6763
No 48
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=72.66 E-value=4.2 Score=30.76 Aligned_cols=36 Identities=33% Similarity=0.777 Sum_probs=26.1
Q ss_pred CcCCCCC---CCCCCCCeeeeCC--CCceeeCCCCccCCCCCCc
Q psy13158 11 ATLECYP---SPCGPYSECRNIN--GGPSCSCRPGYIGSPPNCR 49 (289)
Q Consensus 11 did~C~~---~~C~~~~~C~n~~--gs~~C~C~~Gf~g~~~~C~ 49 (289)
++.+|.+ +=|.+ |+|..-+ ..+.|.|..||+|. .|+
T Consensus 41 ~i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe--RCE 81 (139)
T PHA03099 41 AIRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI--RCQ 81 (139)
T ss_pred ccccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc--ccc
Confidence 3456652 33765 4888765 68999999999999 665
No 49
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=72.12 E-value=4.3 Score=23.11 Aligned_cols=14 Identities=43% Similarity=0.959 Sum_probs=11.5
Q ss_pred ceeeCCCCCccCCC
Q psy13158 245 PTCTCPDGFIGDAF 258 (289)
Q Consensus 245 ~~C~C~~G~~g~~~ 258 (289)
+.|.|++||..+..
T Consensus 18 ~~C~CPeGyIlde~ 31 (34)
T PF09064_consen 18 GQCFCPEGYILDEG 31 (34)
T ss_pred CceeCCCceEecCC
Confidence 37999999998754
No 50
>PHA02887 EGF-like protein; Provisional
Probab=70.00 E-value=4.8 Score=29.93 Aligned_cols=27 Identities=41% Similarity=0.880 Sum_probs=21.7
Q ss_pred CCCCCeeeeCC--CCceeeCCCCccCCCCCCc
Q psy13158 20 CGPYSECRNIN--GGPSCSCRPGYIGSPPNCR 49 (289)
Q Consensus 20 C~~~~~C~n~~--gs~~C~C~~Gf~g~~~~C~ 49 (289)
|. ||+|.... ..+.|.|.+||+|. .|+
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~--RCE 122 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGI--RCD 122 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccC--CCC
Confidence 66 57898765 56899999999998 665
No 51
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=65.35 E-value=8 Score=28.45 Aligned_cols=26 Identities=42% Similarity=0.999 Sum_probs=20.2
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13158 230 GSCGYNAQCKVINHTPTCTCPDGFIGD 256 (289)
Q Consensus 230 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 256 (289)
..|+.++.|.. .....|.|.+||...
T Consensus 84 ~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeCC-CCCCceECCCCcCCC
Confidence 47888999953 456679999999754
No 52
>KOG3516|consensus
Probab=65.32 E-value=4.7 Score=41.55 Aligned_cols=37 Identities=30% Similarity=0.829 Sum_probs=33.1
Q ss_pred CcCCCCCCCCCCCCeeeeCCCCceeeCC-CCccCCCCCCc
Q psy13158 11 ATLECYPSPCGPYSECRNINGGPSCSCR-PGYIGSPPNCR 49 (289)
Q Consensus 11 did~C~~~~C~~~~~C~n~~gs~~C~C~-~Gf~g~~~~C~ 49 (289)
-+|.|.+++|.++|.|..+-..|.|.|. .||+|. .|.
T Consensus 544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred cccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 4688889999999999998889999999 999998 766
No 53
>KOG1836|consensus
Probab=65.03 E-value=45 Score=36.68 Aligned_cols=53 Identities=30% Similarity=0.673 Sum_probs=31.7
Q ss_pred cee-ecCCCCccCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCCCCeeeec--CCCCeee-CCCCcccCCCCCc
Q psy13158 132 PIC-SCSQGYIGDGFSGCYPKPPEVPPPPQQDVQEYVNPCVPSPCGLYSQCRDI--GGSPSCS-CLPNYIGAPPNCR 204 (289)
Q Consensus 132 ~~C-~C~~G~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~--~g~~~C~-C~~G~~g~~~~c~ 204 (289)
-+| +|..||.+.+... ... .|.+-+|..+..|..+ ...+.|. |++||+|. .|+
T Consensus 756 ~~C~~C~~GfYg~~~~~-----------------~~~-dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~--rCe 812 (1705)
T KOG1836|consen 756 GQCAQCVDGFYGLPDLG-----------------TSG-DCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL--RCE 812 (1705)
T ss_pred CchhhhcCCCCCccccC-----------------CCC-CCccCCCCCChhhcCcCcccceecCCCCCCCccc--ccc
Confidence 344 6888888765321 111 2554456665555443 3456787 99999988 554
No 54
>PHA02887 EGF-like protein; Provisional
Probab=61.40 E-value=8.7 Score=28.59 Aligned_cols=30 Identities=27% Similarity=0.623 Sum_probs=22.4
Q ss_pred CCCCCCCeeee--CCCCceeeCCCCCccCCCCCCCC
Q psy13158 230 GSCGYNAQCKV--INHTPTCTCPDGFIGDAFLSCHP 263 (289)
Q Consensus 230 ~~C~~~~~C~~--~~g~~~C~C~~G~~g~~~~~C~~ 263 (289)
+-|. +|+|.. ....+.|.|+.||+|. +|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence 4576 468864 3456899999999999 6763
No 55
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=44.52 E-value=22 Score=21.78 Aligned_cols=22 Identities=36% Similarity=1.072 Sum_probs=16.5
Q ss_pred CeeeeCCCCceeeCCCCccCCCCCCc
Q psy13158 24 SECRNINGGPSCSCRPGYIGSPPNCR 49 (289)
Q Consensus 24 ~~C~n~~gs~~C~C~~Gf~g~~~~C~ 49 (289)
.+|....| .|.|+++|+|. .|+
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~--~C~ 32 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGP--RCD 32 (49)
T ss_dssp SSEEETCE--EESBSTTEEST--TS-
T ss_pred CcccCCCC--EEeccccccCC--cCc
Confidence 36776444 79999999999 775
No 56
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=43.74 E-value=32 Score=21.23 Aligned_cols=17 Identities=24% Similarity=0.600 Sum_probs=12.7
Q ss_pred eeeCCCCCccCCCCCCC
Q psy13158 246 TCTCPDGFIGDAFLSCH 262 (289)
Q Consensus 246 ~C~C~~G~~g~~~~~C~ 262 (289)
+|.|+++|+|.....|.
T Consensus 20 ~C~C~~~~~G~~C~~C~ 36 (50)
T cd00055 20 QCECKPNTTGRRCDRCA 36 (50)
T ss_pred EEeCCCcCCCCCCCCCC
Confidence 68999999999544443
No 57
>KOG3512|consensus
Probab=43.16 E-value=1.7e+02 Score=27.61 Aligned_cols=27 Identities=19% Similarity=0.553 Sum_probs=18.0
Q ss_pred CeeeecCC-CceeecCCCCccCCCCCCC
Q psy13158 123 AECKVINH-NPICSCSQGYIGDGFSGCY 149 (289)
Q Consensus 123 ~~C~~~~g-~~~C~C~~G~~~~~~~~C~ 149 (289)
..|+-... +++|.|...-.|....+|.
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCK 312 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCK 312 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCccccc
Confidence 45665554 4899998887777654443
No 58
>KOG3516|consensus
Probab=38.56 E-value=27 Score=36.34 Aligned_cols=42 Identities=26% Similarity=0.761 Sum_probs=32.7
Q ss_pred CCCCCCCCccccCcccCCCCCCCCCCCeeeeCCCCceeeCC-CCCccCCCCCCC
Q psy13158 210 NSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPTCTCP-DGFIGDAFLSCH 262 (289)
Q Consensus 210 ~~~C~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~-~G~~g~~~~~C~ 262 (289)
++.|...+.|. ++.|.+++.|......|.|.|. .||.|. +|.
T Consensus 539 id~C~i~drCl--------PN~CehgG~C~Qs~~~f~C~C~~TGY~Ga---tCH 581 (1306)
T KOG3516|consen 539 IDMCGISDRCL--------PNPCEHGGKCSQSWDDFECNCELTGYKGA---TCH 581 (1306)
T ss_pred ecccccccccC--------CccccCCCcccccccceeEeccccccccc---ccc
Confidence 35555554443 4689999999998889999999 899998 555
No 59
>KOG3607|consensus
Probab=33.10 E-value=48 Score=33.14 Aligned_cols=46 Identities=24% Similarity=0.696 Sum_probs=36.2
Q ss_pred CCCCCCCCCccccCccc-------CCCCCCCCCCCeeeeCCCCceeeCCCCCccCC
Q psy13158 209 QNSECPNDKACIREKCA-------DPCPGSCGYNAQCKVINHTPTCTCPDGFIGDA 257 (289)
Q Consensus 209 ~~~~C~~~~~c~~~~c~-------~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 257 (289)
+.+.|..+..|++.+|. ..|+..|+.+++|.|. +.|+|.+||.+..
T Consensus 602 dGt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~---~~ChC~~gwapp~ 654 (716)
T KOG3607|consen 602 DGTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNE---LNCHCEPGWAPPF 654 (716)
T ss_pred CCCccCCCceecCCcchhhhhhcccccccccCCCcccCCC---cceeeCCCCCCCc
Confidence 44678888888888883 3456679988999876 5799999999884
No 60
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=29.69 E-value=28 Score=25.50 Aligned_cols=31 Identities=19% Similarity=0.528 Sum_probs=22.8
Q ss_pred CCCC--CCCCCCCCeeeeCC-----CCceeeCCCCccC
Q psy13158 13 LECY--PSPCGPYSECRNIN-----GGPSCSCRPGYIG 43 (289)
Q Consensus 13 d~C~--~~~C~~~~~C~n~~-----gs~~C~C~~Gf~g 43 (289)
+.|. .+.|..||.|++.. .=|.|.|.+.+..
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 4455 46688999999873 3388999997653
No 61
>KOG1836|consensus
Probab=24.63 E-value=2.3e+02 Score=31.64 Aligned_cols=28 Identities=25% Similarity=0.638 Sum_probs=17.0
Q ss_pred CCCCCCeeeeC--CCCceee-CCCCCccCCC
Q psy13158 231 SCGYNAQCKVI--NHTPTCT-CPDGFIGDAF 258 (289)
Q Consensus 231 ~C~~~~~C~~~--~g~~~C~-C~~G~~g~~~ 258 (289)
.|..++.|..+ .....|. |++||+|..+
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCccccc
Confidence 34444555433 3445787 8899988843
No 62
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=22.43 E-value=80 Score=19.08 Aligned_cols=16 Identities=25% Similarity=0.702 Sum_probs=12.0
Q ss_pred eeeCCCCCccCCCCCC
Q psy13158 246 TCTCPDGFIGDAFLSC 261 (289)
Q Consensus 246 ~C~C~~G~~g~~~~~C 261 (289)
.|.|+++|+|.....|
T Consensus 19 ~C~C~~~~~G~~C~~C 34 (46)
T smart00180 19 QCECKPNVTGRRCDRC 34 (46)
T ss_pred EEECCCCCCCCCCCcC
Confidence 7899999999844333
Done!