Query psy13147
Match_columns 180
No_of_seqs 166 out of 1357
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 20:46:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13147.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13147hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.7 4.3E-16 9.4E-21 134.7 9.4 148 2-159 676-865 (1289)
2 KOG1219|consensus 99.5 5.1E-14 1.1E-18 131.0 8.2 109 15-153 3865-3975(4289)
3 KOG1214|consensus 99.1 5.5E-10 1.2E-14 97.6 10.3 142 10-157 730-914 (1289)
4 KOG1219|consensus 99.1 1.9E-10 4.1E-15 108.1 7.3 92 75-176 3865-3959(4289)
5 KOG4289|consensus 98.9 1E-09 2.3E-14 100.1 5.3 108 12-148 1177-1308(2531)
6 KOG4289|consensus 98.6 3.6E-08 7.7E-13 90.4 4.6 78 88-175 1216-1295(2531)
7 KOG1217|consensus 98.5 9.9E-07 2.2E-11 74.2 10.9 150 11-173 166-330 (487)
8 KOG4260|consensus 98.4 2.3E-07 4.9E-12 72.2 3.4 114 19-151 149-269 (350)
9 KOG1217|consensus 98.4 2.4E-06 5.3E-11 71.9 9.0 118 11-152 268-389 (487)
10 PF07645 EGF_CA: Calcium-bindi 98.3 2.2E-07 4.7E-12 52.9 1.3 33 13-47 1-35 (42)
11 PF07645 EGF_CA: Calcium-bindi 98.3 7.1E-07 1.5E-11 50.7 2.5 32 120-151 2-35 (42)
12 KOG1225|consensus 98.1 1.8E-05 3.9E-10 67.8 8.7 11 37-48 266-276 (525)
13 PF12947 EGF_3: EGF domain; I 98.1 2.1E-06 4.6E-11 47.0 1.8 29 126-154 6-34 (36)
14 PF12947 EGF_3: EGF domain; I 98.1 1.3E-06 2.8E-11 47.9 0.8 32 19-52 5-36 (36)
15 PF00008 EGF: EGF-like domain 98.0 1.7E-06 3.7E-11 46.2 1.0 30 17-48 1-31 (32)
16 PF00008 EGF: EGF-like domain 98.0 3.6E-06 7.9E-11 44.9 2.1 30 123-152 1-31 (32)
17 smart00179 EGF_CA Calcium-bind 98.0 9.7E-06 2.1E-10 44.7 3.7 32 13-46 1-33 (39)
18 smart00179 EGF_CA Calcium-bind 97.8 3.2E-05 6.9E-10 42.6 3.7 31 120-150 2-33 (39)
19 KOG4260|consensus 97.6 3.3E-05 7.1E-10 60.4 1.9 71 10-104 232-305 (350)
20 cd00054 EGF_CA Calcium-binding 97.6 9.1E-05 2E-09 40.2 3.2 34 13-48 1-35 (38)
21 cd00054 EGF_CA Calcium-binding 97.4 0.00027 5.8E-09 38.3 3.5 32 121-152 3-35 (38)
22 cd00053 EGF Epidermal growth f 97.4 0.00033 7.2E-09 37.3 3.6 28 125-152 5-32 (36)
23 KOG1225|consensus 97.2 0.002 4.3E-08 55.5 8.6 105 37-176 235-354 (525)
24 cd00053 EGF Epidermal growth f 97.2 0.00049 1.1E-08 36.6 3.1 30 17-48 2-32 (36)
25 PF06247 Plasmod_Pvs28: Plasmo 97.2 5.4E-05 1.2E-09 56.1 -1.2 127 20-176 6-147 (197)
26 smart00181 EGF Epidermal growt 97.1 0.00086 1.9E-08 35.9 3.3 26 126-152 6-31 (35)
27 smart00181 EGF Epidermal growt 96.9 0.0011 2.4E-08 35.5 2.9 30 16-48 1-31 (35)
28 PF12662 cEGF: Complement Clr- 96.6 0.0025 5.4E-08 31.5 2.3 20 93-112 1-21 (24)
29 PF06247 Plasmod_Pvs28: Plasmo 96.5 0.00097 2.1E-08 49.6 0.7 106 20-153 50-163 (197)
30 PF12662 cEGF: Complement Clr- 96.0 0.0071 1.5E-07 29.8 2.0 13 140-152 1-13 (24)
31 PF14670 FXa_inhibition: Coagu 95.7 0.0097 2.1E-07 32.4 2.0 22 132-153 10-31 (36)
32 PF12946 EGF_MSP1_1: MSP1 EGF 94.8 0.021 4.5E-07 31.2 1.6 30 125-154 4-34 (37)
33 KOG1226|consensus 94.5 0.52 1.1E-05 42.4 10.3 116 37-178 479-607 (783)
34 PF14670 FXa_inhibition: Coagu 94.5 0.032 7E-07 30.3 1.9 22 85-106 10-31 (36)
35 PF12946 EGF_MSP1_1: MSP1 EGF 94.1 0.014 3.1E-07 31.8 0.1 34 17-51 2-35 (37)
36 KOG0994|consensus 93.8 0.37 8E-06 45.3 8.3 60 86-154 877-947 (1758)
37 PF07974 EGF_2: EGF-like domai 93.2 0.11 2.5E-06 27.4 2.6 25 126-152 6-30 (32)
38 PF07974 EGF_2: EGF-like domai 93.2 0.13 2.8E-06 27.2 2.8 24 80-105 7-30 (32)
39 PF12661 hEGF: Human growth fa 93.0 0.047 1E-06 22.8 0.7 11 95-105 1-11 (13)
40 cd01475 vWA_Matrilin VWA_Matri 92.7 0.14 2.9E-06 39.5 3.4 32 120-153 187-220 (224)
41 PF00954 S_locus_glycop: S-loc 88.4 0.46 1E-05 32.3 2.6 32 13-47 76-108 (110)
42 PF00954 S_locus_glycop: S-loc 84.7 1.1 2.4E-05 30.5 2.9 32 120-152 77-109 (110)
43 cd01475 vWA_Matrilin VWA_Matri 79.1 2.1 4.5E-05 32.9 2.9 21 85-105 199-219 (224)
44 PHA02887 EGF-like protein; Pro 76.5 3 6.5E-05 28.8 2.7 29 80-112 93-123 (126)
45 KOG1226|consensus 76.4 17 0.00037 33.1 8.0 60 80-154 556-619 (783)
46 smart00051 DSL delta serrate l 74.2 5.4 0.00012 24.4 3.2 45 93-152 16-61 (63)
47 PHA03099 epidermal growth fact 74.2 3.5 7.6E-05 29.0 2.6 29 80-112 52-82 (139)
48 PHA02887 EGF-like protein; Pro 73.8 3.5 7.6E-05 28.5 2.5 32 121-153 84-120 (126)
49 PF01683 EB: EB module; Inter 71.0 4.9 0.00011 23.2 2.4 27 17-49 22-49 (52)
50 KOG0994|consensus 69.7 14 0.00031 35.4 6.1 61 85-154 830-899 (1758)
51 KOG3516|consensus 63.5 5.3 0.00012 38.0 2.2 35 12-48 543-578 (1306)
52 PHA03099 epidermal growth fact 63.4 8.5 0.00018 27.1 2.7 27 126-153 51-79 (139)
53 PF09064 Tme5_EGF_like: Thromb 51.9 13 0.00028 19.8 1.6 15 92-106 16-30 (34)
54 KOG3516|consensus 46.9 16 0.00035 34.9 2.5 34 120-153 545-579 (1306)
55 KOG3514|consensus 38.3 32 0.00068 33.0 3.0 31 16-48 625-656 (1591)
56 PF04706 Dickkopf_N: Dickkopf 33.9 83 0.0018 18.4 3.3 18 56-73 33-50 (52)
57 PF00053 Laminin_EGF: Laminin 28.3 60 0.0013 18.2 2.1 20 26-49 11-30 (49)
58 cd00185 TNFR Tumor necrosis fa 26.7 1.2E+02 0.0026 19.9 3.7 14 92-105 73-86 (98)
59 cd00055 EGF_Lam Laminin-type e 25.7 1E+02 0.0022 17.4 2.8 13 142-154 20-32 (50)
60 KOG3607|consensus 23.1 92 0.002 28.6 3.3 49 58-112 602-657 (716)
61 PF01414 DSL: Delta serrate li 22.2 43 0.00092 20.4 0.7 45 93-152 16-61 (63)
No 1
>KOG1214|consensus
Probab=99.65 E-value=4.3e-16 Score=134.73 Aligned_cols=148 Identities=27% Similarity=0.624 Sum_probs=116.1
Q ss_pred CccCCCCCCC----CCCCCCC--CCCCCCCCeeeecCCCeeecCCCCCCCcCCCCCCC--CCCCCC-CCCCCCCcccC--
Q psy13147 2 FTAYLPPYPS----NDSLACK--PNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHCR--PECLCN-SDCPFNMACLG-- 70 (180)
Q Consensus 2 ~~~~~~~~~~----~d~~~C~--~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C~--~~C~~~-~~C~~~~~C~~-- 70 (180)
|+..++|+.. ..+++|. ++-|+.++.|....+..|+|. |..||.|++..|. +||... ..|..+..|++
T Consensus 676 ~Sn~igpV~E~S~~~~~npCy~gsh~cdt~a~C~pg~~~~~tce-cs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~p 754 (1289)
T KOG1214|consen 676 VSNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGVDYTCE-CSSGYQGDGRNCVDENECATGFHRCGPNSVCINLP 754 (1289)
T ss_pred hhhcccceecCCCCcccccceecCcccCCCccccCCCCcceEEE-EeeccCCCCCCCCChhhhccCCCCCCCCceeecCC
Confidence 3445566532 4577786 788999999998877669999 9999999999886 577654 45888888865
Q ss_pred --C---------------ccc--------CCCc---cCCCCCCeEe--ecC-CCceeeCCCCCccCCCCCccCCCCCCCC
Q psy13147 71 --Q---------------KCR--------DPCQ---GTCGVNALCT--VVH-HTPACYCPQGTIGNPYEHCATPLAPVPP 119 (180)
Q Consensus 71 --~---------------~C~--------~~C~---~~C~~~~~C~--~~~-g~~~C~C~~g~~g~~~~~C~~~~~~~~~ 119 (180)
+ +|+ +.|. ..|..++.++ ... +.|+|.|.+||.|++. .|.+
T Consensus 755 g~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~-~c~d------- 826 (1289)
T KOG1214|consen 755 GSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH-QCTD------- 826 (1289)
T ss_pred CceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc-cccc-------
Confidence 2 232 4563 3577666554 333 6799999999999998 8887
Q ss_pred CCCCCCCCCCCCCeeeecCCceeeecCCCCccCCcCCCcc
Q psy13147 120 PNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGNGALGCRP 159 (180)
Q Consensus 120 ~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~d 159 (180)
+|+|.++.|+..|+|.+++|+|.|+|.+||.|+++ .|++
T Consensus 827 vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf-~CVP 865 (1289)
T KOG1214|consen 827 VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF-QCVP 865 (1289)
T ss_pred ccccCccccCCCceEecCCCcceeecccCccCCCc-eecC
Confidence 49999999999999999999999999999999998 5754
No 2
>KOG1219|consensus
Probab=99.50 E-value=5.1e-14 Score=131.00 Aligned_cols=109 Identities=25% Similarity=0.660 Sum_probs=95.0
Q ss_pred CCCCCCCCCCCCeeeecCCCeeecCCCCCCCcCCCCCCCCCCCCCCCCCCCCcccCCcccCCC-ccCCCCCCeEeecCCC
Q psy13147 15 LACKPNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHCRPECLCNSDCPFNMACLGQKCRDPC-QGTCGVNALCTVVHHT 93 (180)
Q Consensus 15 ~~C~~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C-~~~C~~~~~C~~~~g~ 93 (180)
++|..+||+++|+|...+++.|+|. |++.|+|.. | ... +++| .++|..+++|+...++
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~Ck-CpsqysG~~--C----------Ei~--------~epC~snPC~~GgtCip~~n~ 3923 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCK-CPSQYSGNH--C----------EID--------LEPCASNPCLTGGTCIPFYNG 3923 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEe-CcccccCcc--c----------ccc--------cccccCCCCCCCCEEEecCCC
Confidence 8999999999999999987769999 999999852 2 111 2556 3589999999999999
Q ss_pred ceeeCCCCCccCCCCCccCCCCCCCC-CCCCCCCCCCCCCeeeecCCceeeecCCCCccCC
Q psy13147 94 PACYCPQGTIGNPYEHCATPLAPVPP-PNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGNG 153 (180)
Q Consensus 94 ~~C~C~~g~~g~~~~~C~~~~~~~~~-~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 153 (180)
|.|.|+.||+|. .|+. . +++|..++|+.++.|+|.+|+|.|.|-+||.|..
T Consensus 3924 f~CnC~~gyTG~---~Ce~------~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3924 FLCNCPNGYTGK---RCEA------RGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred eeEeCCCCccCc---eeec------ccccccccccccCCceeeccCCceEeccChhHhccc
Confidence 999999999999 8975 3 5899989999999999999999999999999864
No 3
>KOG1214|consensus
Probab=99.11 E-value=5.5e-10 Score=97.59 Aligned_cols=142 Identities=20% Similarity=0.397 Sum_probs=99.4
Q ss_pred CCCCCCCCC--CCCCCCCCeeeecCCCeeecCCCCCCCc--CCCCCCC--------CCCCCC-CCCCCCCccc--C----
Q psy13147 10 PSNDSLACK--PNPCDPYSSCSVYSEHVAMCDPCSGPQA--PWLPHCR--------PECLCN-SDCPFNMACL--G---- 70 (180)
Q Consensus 10 ~~~d~~~C~--~~~C~~~~~C~~~~~~~~~C~~C~~G~~--g~~~~C~--------~~C~~~-~~C~~~~~C~--~---- 70 (180)
.+.|+++|+ ...|+++++|++.+++ |+|. |..||. +++.+|+ +.|.+. .+|.....++ .
T Consensus 730 ~c~d~~eca~~~~~CGp~s~Cin~pg~-~rce-C~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs 807 (1289)
T KOG1214|consen 730 NCVDENECATGFHRCGPNSVCINLPGS-YRCE-CRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGS 807 (1289)
T ss_pred CCCChhhhccCCCCCCCCceeecCCCc-eeEE-EeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCc
Confidence 456888998 5789999999999999 9998 888875 5556664 245444 3454433331 0
Q ss_pred -Cc-------------c--cCCC-ccCCCCCCeEeecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCC-----CCCC
Q psy13147 71 -QK-------------C--RDPC-QGTCGVNALCTVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCD-----HVYC 128 (180)
Q Consensus 71 -~~-------------C--~~~C-~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~-----~~~C 128 (180)
++ | +|+| .+.|.+++.|.++++++.|+|.+||.|++. .|+...++ ...|. +..|
T Consensus 808 ~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf-~CVP~~~~---~T~C~~er~hpl~c 883 (1289)
T KOG1214|consen 808 TYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF-QCVPDTSS---LTPCEQERFHPLQC 883 (1289)
T ss_pred eEEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCccCCCc-eecCCCcc---CCccccccccceee
Confidence 12 2 3889 468999999999999999999999999988 78652111 13453 2336
Q ss_pred CCCCeee--ecCCceeeecCCCCccCCcCCC
Q psy13147 129 GSNAVCK--HTNGIVTCECLPTYYGNGALGC 157 (180)
Q Consensus 129 ~~~~~C~--~~~g~~~C~C~~G~~g~~~~~C 157 (180)
+.++.|. ..+.+|.+.|.++-.|++...|
T Consensus 884 hg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c 914 (1289)
T KOG1214|consen 884 HGSTGFCWCVDPDGHEVPGTQTPPGSTPPHC 914 (1289)
T ss_pred ccccceeEeeCCCcccCCCCCCCCCCCCCCC
Confidence 6555444 4577789988888777765445
No 4
>KOG1219|consensus
Probab=99.10 E-value=1.9e-10 Score=108.11 Aligned_cols=92 Identities=28% Similarity=0.612 Sum_probs=81.0
Q ss_pred CCCc-cCCCCCCeEeecC-CCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 75 DPCQ-GTCGVNALCTVVH-HTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 75 ~~C~-~~C~~~~~C~~~~-g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
+.|. .+|+.++.|..++ ++|.|.|++-|.|. +|+. .+.+|.++||..++.|+...++|.|.|+.||+|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CEi------~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCEI------DLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc---cccc------ccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence 6674 5899999999887 78999999999999 9986 4689999999999999999999999999999998
Q ss_pred CcCCC-ccCCcCCCCCCCCCcccCC
Q psy13147 153 GALGC-RPECVLNTDCPNSAACINN 176 (180)
Q Consensus 153 ~~~~C-~deC~~~~~C~~~~~C~n~ 176 (180)
....- ++||+. ++|..++.|+|+
T Consensus 3936 ~Ce~~Gi~eCs~-n~C~~gg~C~n~ 3959 (4289)
T KOG1219|consen 3936 RCEARGISECSK-NVCGTGGQCINI 3959 (4289)
T ss_pred eeeccccccccc-ccccCCceeecc
Confidence 54333 679998 699999999996
No 5
>KOG4289|consensus
Probab=98.93 E-value=1e-09 Score=100.05 Aligned_cols=108 Identities=27% Similarity=0.595 Sum_probs=84.4
Q ss_pred CCCCCCCCCCCCCCCeee----------------------ecCCCeeecCCCCCCCcCCCCCCCCCCCCCCCCCCCCccc
Q psy13147 12 NDSLACKPNPCDPYSSCS----------------------VYSEHVAMCDPCSGPQAPWLPHCRPECLCNSDCPFNMACL 69 (180)
Q Consensus 12 ~d~~~C~~~~C~~~~~C~----------------------~~~~~~~~C~~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~ 69 (180)
.|.+.|...||.+..+|+ +..++ +.|. |++||+|+- |+ ..
T Consensus 1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvng-lrCr-CPpGFTgd~--Ce----------Te---- 1238 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNG-LRCR-CPPGFTGDY--CE----------TE---- 1238 (2531)
T ss_pred ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCc-eeEe-CCCCCCccc--cc----------ch----
Confidence 466778888898888886 22344 8999 999999963 21 00
Q ss_pred CCcccCCC-ccCCCCCCeEeecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeec-CCceeeecCC
Q psy13147 70 GQKCRDPC-QGTCGVNALCTVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHT-NGIVTCECLP 147 (180)
Q Consensus 70 ~~~C~~~C-~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~-~g~~~C~C~~ 147 (180)
+|+| .++|+++++|...+|+|.|.|.+||+|. +|+-. . ....|.+..|.+++.|++. .|+|.|.|+.
T Consensus 1239 ----iDlCYs~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs---~-~agrCvpGvC~nggtC~~~~nggf~c~Cp~ 1307 (2531)
T KOG4289|consen 1239 ----IDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVS---A-RAGRCVPGVCKNGGTCVNLLNGGFCCHCPY 1307 (2531)
T ss_pred ----hHhhhcCCCCCCCceEEecCceeEEecCCcccc---ceeee---c-ccCccccceecCCCEEeecCCCceeccCCC
Confidence 2556 5689999999999999999999999999 88630 0 1245778899999999976 6899999999
Q ss_pred C
Q psy13147 148 T 148 (180)
Q Consensus 148 G 148 (180)
|
T Consensus 1308 g 1308 (2531)
T KOG4289|consen 1308 G 1308 (2531)
T ss_pred c
Confidence 8
No 6
>KOG4289|consensus
Probab=98.62 E-value=3.6e-08 Score=90.44 Aligned_cols=78 Identities=32% Similarity=0.677 Sum_probs=61.7
Q ss_pred eecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeecCCceeeecCCCCccCCcCCCc--cCCcCCC
Q psy13147 88 TVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGNGALGCR--PECVLNT 165 (180)
Q Consensus 88 ~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~--deC~~~~ 165 (180)
++..++++|.|++||+|+ .|+. .+|.|-+.||+++++|....|+|+|.|.+||+|+..+... --|.- +
T Consensus 1216 i~pvnglrCrCPpGFTgd---~CeT------eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvp-G 1285 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGD---YCET------EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVP-G 1285 (2531)
T ss_pred ccccCceeEeCCCCCCcc---cccc------hhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcccc-c
Confidence 345588999999999999 8987 5699999999999999999999999999999998542111 13444 4
Q ss_pred CCCCCCcccC
Q psy13147 166 DCPNSAACIN 175 (180)
Q Consensus 166 ~C~~~~~C~n 175 (180)
.|.++++|+|
T Consensus 1286 vC~nggtC~~ 1295 (2531)
T KOG4289|consen 1286 VCKNGGTCVN 1295 (2531)
T ss_pred eecCCCEEee
Confidence 6666666665
No 7
>KOG1217|consensus
Probab=98.53 E-value=9.9e-07 Score=74.23 Aligned_cols=150 Identities=21% Similarity=0.564 Sum_probs=97.1
Q ss_pred CCCCCCCC--CCCCCCCCeeeecCCCeeecCCCCCCCcCCCCCCC---CCCCCCCCCCCCCcccCCcccCCCc---cCCC
Q psy13147 11 SNDSLACK--PNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHCR---PECLCNSDCPFNMACLGQKCRDPCQ---GTCG 82 (180)
Q Consensus 11 ~~d~~~C~--~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C~---~~C~~~~~C~~~~~C~~~~C~~~C~---~~C~ 82 (180)
..+.++|. ..+|.+.+.|.+..++ |.|. |++||.+...... ..|.....|..... +. .+.|. ..+.
T Consensus 166 ~~~~~~C~~~~~~c~~~~~C~~~~~~-~~C~-c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g---~~-~~~c~~~~~~~~ 239 (487)
T KOG1217|consen 166 ETDLDECIQYSSPCQNGGTCVNTGGS-YLCS-CPPGYTGSTCETTGNGGTCVDSVACSCPPG---AR-GPECEVSIVECA 239 (487)
T ss_pred cccccccccCCCCcCCCcccccCCCC-eeEe-CCCCccCCcCcCCCCCceEecceeccCCCC---CC-CCCccccccccc
Confidence 33446887 5569999999999988 9999 9999998642111 11111100000000 00 11221 1233
Q ss_pred CC-CeEeecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCC-CCCCCCeeeecCCceeeecCCCCccCCcCCCc--
Q psy13147 83 VN-ALCTVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHV-YCGSNAVCKHTNGIVTCECLPTYYGNGALGCR-- 158 (180)
Q Consensus 83 ~~-~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~-~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~-- 158 (180)
.+ ++|++..+.|.|.|++||.+.....+.+ +++|... +|..+++|++..+.|.|.|++||.|.....+.
T Consensus 240 ~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~-------~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~ 312 (487)
T KOG1217|consen 240 SGDGTCVNTVGSYTCRCPEGYTGDACVTCVD-------VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDV 312 (487)
T ss_pred CCCCcccccCCceeeeCCCCccccccceeee-------ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCcccccc
Confidence 33 8899999999999999999885114555 4888654 38889999999999999999999987641232
Q ss_pred cCCcC---CCCCCCCCcc
Q psy13147 159 PECVL---NTDCPNSAAC 173 (180)
Q Consensus 159 deC~~---~~~C~~~~~C 173 (180)
++|.. ...|..+.+|
T Consensus 313 ~~C~~~~~~~~c~~g~~C 330 (487)
T KOG1217|consen 313 DECSPRNAGGPCANGGTC 330 (487)
T ss_pred ccccccccCCcCCCCccc
Confidence 46652 2456665555
No 8
>KOG4260|consensus
Probab=98.40 E-value=2.3e-07 Score=72.22 Aligned_cols=114 Identities=24% Similarity=0.489 Sum_probs=70.8
Q ss_pred CCCCCCCCeeeec---CCCeeecCCCCCCCcCCC-CCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeEeecCCCc
Q psy13147 19 PNPCDPYSSCSVY---SEHVAMCDPCSGPQAPWL-PHCRPECLCNSDCPFNMACLGQKCRDPCQGTCGVNALCTVVHHTP 94 (180)
Q Consensus 19 ~~~C~~~~~C~~~---~~~~~~C~~C~~G~~g~~-~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~ 94 (180)
..||..+++|.-- .|+ -+|. |.+||+|.. ..|-++=-+...=..+.+| ..|...|. +.|... +..
T Consensus 149 er~C~GnG~C~GdGsR~Gs-GkCk-C~~GY~Gp~C~~Cg~eyfes~Rne~~lvC------t~Ch~~C~--~~Csg~-~~k 217 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGS-GKCK-CETGYTGPLCRYCGIEYFESSRNEQHLVC------TACHEGCL--GVCSGE-SSK 217 (350)
T ss_pred cCCcCCCCcccCCCCCCCC-Cccc-ccCCCCCccccccchHHHHhhcccccchh------hhhhhhhh--cccCCC-CCC
Confidence 3689889999732 234 6999 999999865 2332100000000001111 01111221 123222 333
Q ss_pred ee-eCCCCCccCCCCCccCCCCCCCCCCCCC--CCCCCCCCeeeecCCceeeecCCCCcc
Q psy13147 95 AC-YCPQGTIGNPYEHCATPLAPVPPPNPCD--HVYCGSNAVCKHTNGIVTCECLPTYYG 151 (180)
Q Consensus 95 ~C-~C~~g~~g~~~~~C~~~~~~~~~~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g 151 (180)
.| .|+.||..+.. .|.| | |||. +.+|..+..|+|+.|+|.|...+||.+
T Consensus 218 ~C~kCkkGW~lde~-gCvD------v-nEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 218 GCSKCKKGWKLDEE-GCVD------V-NECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred Chhhhcccceeccc-cccc------H-HHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 44 58999988755 7998 4 9994 688999999999999999999999987
No 9
>KOG1217|consensus
Probab=98.36 E-value=2.4e-06 Score=71.88 Aligned_cols=118 Identities=22% Similarity=0.492 Sum_probs=84.3
Q ss_pred CCCCCCCCCC-CCCCCCeeeecCCCeeecCCCCCCCcCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeE--
Q psy13147 11 SNDSLACKPN-PCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHCRPECLCNSDCPFNMACLGQKCRDPCQGTCGVNALC-- 87 (180)
Q Consensus 11 ~~d~~~C~~~-~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C-- 87 (180)
.+++++|... +|.++++|++..+. |.|. |++||.|... .++.....|.... ....|..++.|
T Consensus 268 ~~~~~~C~~~~~c~~~~~C~~~~~~-~~C~-C~~g~~g~~~---~~~~~~~~C~~~~----------~~~~c~~g~~C~~ 332 (487)
T KOG1217|consen 268 CVDVDSCALIASCPNGGTCVNVPGS-YRCT-CPPGFTGRLC---TECVDVDECSPRN----------AGGPCANGGTCNT 332 (487)
T ss_pred eeeccccCCCCccCCCCeeecCCCc-ceee-CCCCCCCCCC---ccccccccccccc----------cCCcCCCCccccc
Confidence 4578999854 39989999999988 9999 9999998753 1122112221100 01246666666
Q ss_pred eecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeee-cCCceeeecCCCCccC
Q psy13147 88 TVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKH-TNGIVTCECLPTYYGN 152 (180)
Q Consensus 88 ~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~-~~g~~~C~C~~G~~g~ 152 (180)
....+.+.|.|..+|.|. .|+. ..++|...++..++.|++ ..+.|.|.|+.+|.+.
T Consensus 333 ~~~~~~~~C~c~~~~~g~---~C~~------~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 333 LGSFGGFRCACGPGFTGR---RCED------SNDECASSPCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred CCCCCCCCcCCCCCCCCC---cccc------CCccccCCccccCCEeccCCCCCeEecCCCccccC
Confidence 233357889999998777 8886 324887777888999999 7999999999999874
No 10
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.32 E-value=2.2e-07 Score=52.90 Aligned_cols=33 Identities=21% Similarity=0.480 Sum_probs=30.0
Q ss_pred CCCCCC--CCCCCCCCeeeecCCCeeecCCCCCCCcC
Q psy13147 13 DSLACK--PNPCDPYSSCSVYSEHVAMCDPCSGPQAP 47 (180)
Q Consensus 13 d~~~C~--~~~C~~~~~C~~~~~~~~~C~~C~~G~~g 47 (180)
|||||. .+.|..++.|+|+.|+ |.|. |++||..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gs-y~C~-C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGS-YSCS-CPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTE-EEEE-ESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCC-EEee-CCCCcEE
Confidence 689998 5689989999999999 9999 9999983
No 11
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.26 E-value=7.1e-07 Score=50.74 Aligned_cols=32 Identities=31% Similarity=0.707 Sum_probs=28.3
Q ss_pred CCCCC--CCCCCCCCeeeecCCceeeecCCCCcc
Q psy13147 120 PNPCD--HVYCGSNAVCKHTNGIVTCECLPTYYG 151 (180)
Q Consensus 120 ~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g 151 (180)
+|||. .+.|..++.|+|+.|+|.|.|++||..
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 38995 367988999999999999999999984
No 12
>KOG1225|consensus
Probab=98.09 E-value=1.8e-05 Score=67.78 Aligned_cols=11 Identities=18% Similarity=0.181 Sum_probs=7.7
Q ss_pred ecCCCCCCCcCC
Q psy13147 37 MCDPCSGPQAPW 48 (180)
Q Consensus 37 ~C~~C~~G~~g~ 48 (180)
+|. |++||+|+
T Consensus 266 ~CI-C~~Gf~G~ 276 (525)
T KOG1225|consen 266 RCI-CPPGFTGD 276 (525)
T ss_pred eEe-CCCCCcCC
Confidence 677 77777765
No 13
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.07 E-value=2.1e-06 Score=47.01 Aligned_cols=29 Identities=45% Similarity=0.882 Sum_probs=24.3
Q ss_pred CCCCCCCeeeecCCceeeecCCCCccCCc
Q psy13147 126 VYCGSNAVCKHTNGIVTCECLPTYYGNGA 154 (180)
Q Consensus 126 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 154 (180)
..|+.+|.|+++.++|+|+|++||.|++.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 57899999999999999999999999986
No 14
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.06 E-value=1.3e-06 Score=47.88 Aligned_cols=32 Identities=19% Similarity=0.404 Sum_probs=25.3
Q ss_pred CCCCCCCCeeeecCCCeeecCCCCCCCcCCCCCC
Q psy13147 19 PNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHC 52 (180)
Q Consensus 19 ~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C 52 (180)
...|+.+|+|+++.++ |.|. |++||.|++..|
T Consensus 5 ~~~C~~nA~C~~~~~~-~~C~-C~~Gy~GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGS-YTCT-CKPGYEGDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTS-EEEE-E-CEEECCSTCE
T ss_pred CCCCCCCcEeecCCCC-EEeE-CCCCCccCCcCC
Confidence 3579999999999998 9999 999999998543
No 15
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.04 E-value=1.7e-06 Score=46.20 Aligned_cols=30 Identities=23% Similarity=0.575 Sum_probs=26.5
Q ss_pred CCCCCCCCCCeeeecC-CCeeecCCCCCCCcCC
Q psy13147 17 CKPNPCDPYSSCSVYS-EHVAMCDPCSGPQAPW 48 (180)
Q Consensus 17 C~~~~C~~~~~C~~~~-~~~~~C~~C~~G~~g~ 48 (180)
|.++||.++|+|++.. +. |+|. |++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~-y~C~-C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGG-YTCE-CPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSE-EEEE-EBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCC-EEeE-CCCCCccC
Confidence 5677999999999998 55 9999 99999985
No 16
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.02 E-value=3.6e-06 Score=44.88 Aligned_cols=30 Identities=37% Similarity=0.822 Sum_probs=26.5
Q ss_pred CCCCCCCCCCeeeecC-CceeeecCCCCccC
Q psy13147 123 CDHVYCGSNAVCKHTN-GIVTCECLPTYYGN 152 (180)
Q Consensus 123 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 152 (180)
|.+++|.++++|++.. ++|+|.|++||.|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 3456899999999998 99999999999885
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.00 E-value=9.7e-06 Score=44.72 Aligned_cols=32 Identities=25% Similarity=0.558 Sum_probs=29.1
Q ss_pred CCCCCCC-CCCCCCCeeeecCCCeeecCCCCCCCc
Q psy13147 13 DSLACKP-NPCDPYSSCSVYSEHVAMCDPCSGPQA 46 (180)
Q Consensus 13 d~~~C~~-~~C~~~~~C~~~~~~~~~C~~C~~G~~ 46 (180)
|+++|.. .+|.++++|+++.++ |.|. |++||+
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~-~~C~-C~~g~~ 33 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGS-YRCE-CPPGYT 33 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCC-eEeE-CCCCCc
Confidence 5789986 799988999999998 9999 999998
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.82 E-value=3.2e-05 Score=42.58 Aligned_cols=31 Identities=32% Similarity=0.738 Sum_probs=27.9
Q ss_pred CCCCCC-CCCCCCCeeeecCCceeeecCCCCc
Q psy13147 120 PNPCDH-VYCGSNAVCKHTNGIVTCECLPTYY 150 (180)
Q Consensus 120 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~ 150 (180)
+++|.. .+|..++.|+++.++|.|.|+.||.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence 378876 7898889999999999999999998
No 19
>KOG4260|consensus
Probab=97.58 E-value=3.3e-05 Score=60.41 Aligned_cols=71 Identities=24% Similarity=0.466 Sum_probs=55.0
Q ss_pred CCCCCCCCC--CCCCCCCCeeeecCCCeeecCCCCCCCcCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCC-CCCCe
Q psy13147 10 PSNDSLACK--PNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPHCRPECLCNSDCPFNMACLGQKCRDPCQGTC-GVNAL 86 (180)
Q Consensus 10 ~~~d~~~C~--~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C-~~~~~ 86 (180)
.+.|||||. +.||..+..|+|+.|+ |.|. +.+||.+. .|+|+. |...| ..+..
T Consensus 232 gCvDvnEC~~ep~~c~~~qfCvNteGS-f~C~-dk~Gy~~g----~d~C~~------------------~~d~~~~kn~~ 287 (350)
T KOG4260|consen 232 GCVDVNECQNEPAPCKAHQFCVNTEGS-FKCE-DKEGYKKG----VDECQF------------------CADVCASKNRP 287 (350)
T ss_pred ccccHHHHhcCCCCCChhheeecCCCc-eEec-ccccccCC----hHHhhh------------------hhhhcccCCCC
Confidence 468999998 7899999999999999 9999 99999874 333321 01112 34677
Q ss_pred EeecCCCceeeCCCCCcc
Q psy13147 87 CTVVHHTPACYCPQGTIG 104 (180)
Q Consensus 87 C~~~~g~~~C~C~~g~~g 104 (180)
|.++.++|+|.|..|+.-
T Consensus 288 c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 288 CMNIDGQYRCVCFSGLII 305 (350)
T ss_pred cccCCccEEEEeccccee
Confidence 899999999999988743
No 20
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.58 E-value=9.1e-05 Score=40.22 Aligned_cols=34 Identities=24% Similarity=0.473 Sum_probs=29.6
Q ss_pred CCCCCCC-CCCCCCCeeeecCCCeeecCCCCCCCcCC
Q psy13147 13 DSLACKP-NPCDPYSSCSVYSEHVAMCDPCSGPQAPW 48 (180)
Q Consensus 13 d~~~C~~-~~C~~~~~C~~~~~~~~~C~~C~~G~~g~ 48 (180)
++++|.. .+|.+++.|++..+. |.|. |++||.|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~-~~C~-C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGS-YRCS-CPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCC-eEeE-CCCCCcCC
Confidence 4678886 799988999999988 9999 99999873
No 21
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.38 E-value=0.00027 Score=38.30 Aligned_cols=32 Identities=31% Similarity=0.778 Sum_probs=28.3
Q ss_pred CCCCC-CCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 121 NPCDH-VYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 121 d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
++|.. .+|..++.|++..++|.|.|+.||.|.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 77866 789888999999999999999999884
No 22
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.35 E-value=0.00033 Score=37.30 Aligned_cols=28 Identities=32% Similarity=0.765 Sum_probs=25.1
Q ss_pred CCCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 125 HVYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 125 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
..+|..++.|+++.++|.|.|+.||.|+
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4678778999999999999999999887
No 23
>KOG1225|consensus
Probab=97.24 E-value=0.002 Score=55.54 Aligned_cols=105 Identities=27% Similarity=0.732 Sum_probs=67.3
Q ss_pred ecCCCCCCCcCCC---CCCCCCCCCCCCCCCCCcccCCccc-------CC-----CccCCCCCCeEeecCCCceeeCCCC
Q psy13147 37 MCDPCSGPQAPWL---PHCRPECLCNSDCPFNMACLGQKCR-------DP-----CQGTCGVNALCTVVHHTPACYCPQG 101 (180)
Q Consensus 37 ~C~~C~~G~~g~~---~~C~~~C~~~~~C~~~~~C~~~~C~-------~~-----C~~~C~~~~~C~~~~g~~~C~C~~g 101 (180)
.|. |+.+|++.. ..|..- |.....|+.++|+ +. |...|..++.+++ + .|.|.+|
T Consensus 235 ic~-c~~~~~g~~c~~~~C~~~------c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~--g--~CiC~~g 303 (525)
T KOG1225|consen 235 ICE-CPEGYFGPLCSTIYCPGG------CTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVD--G--ECICNPG 303 (525)
T ss_pred eee-cCCceeCCccccccCCCC------CcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecC--C--EeecCCC
Confidence 688 888888753 123221 2222345555554 22 3333554454443 2 8999999
Q ss_pred CccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeecCCceeeecCCCCccCCcCCCccCCcCCCCCCCCCcccCC
Q psy13147 102 TIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGNGALGCRPECVLNTDCPNSAACINN 176 (180)
Q Consensus 102 ~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~deC~~~~~C~~~~~C~n~ 176 (180)
|.|. .|+. -.| +..|..+++|+ .| +|.|.+||+|.. |.... |+..+.|+|.
T Consensus 304 ~~G~---dCs~--------~~c-padC~g~G~Ci--~G--~C~C~~Gy~G~~-------C~~~~-C~~~g~cv~g 354 (525)
T KOG1225|consen 304 YSGK---DCSI--------RRC-PADCSGHGKCI--DG--ECLCDEGYTGEL-------CIQRA-CSGGGQCVNG 354 (525)
T ss_pred cccc---cccc--------ccC-CccCCCCCccc--CC--ceEeCCCCcCCc-------ccccc-cCCCceeccC
Confidence 9998 7863 334 46788999998 44 799999999873 44433 8777888887
No 24
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.19 E-value=0.00049 Score=36.61 Aligned_cols=30 Identities=23% Similarity=0.481 Sum_probs=26.0
Q ss_pred CC-CCCCCCCCeeeecCCCeeecCCCCCCCcCC
Q psy13147 17 CK-PNPCDPYSSCSVYSEHVAMCDPCSGPQAPW 48 (180)
Q Consensus 17 C~-~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~ 48 (180)
|. ..+|.++++|++..+. |.|. |+.||.|.
T Consensus 2 C~~~~~C~~~~~C~~~~~~-~~C~-C~~g~~g~ 32 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGS-YRCV-CPPGYTGD 32 (36)
T ss_pred CCCCCCCCCCCEEecCCCC-eEeE-CCCCCccc
Confidence 44 5789888999999888 9999 99999885
No 25
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.18 E-value=5.4e-05 Score=56.15 Aligned_cols=127 Identities=24% Similarity=0.432 Sum_probs=73.3
Q ss_pred CCCCCCCeeeecCCCeeecCCCCCCCcCCC-CCCC--CCCCCCCCCCCCCcccCCcccCCCccCCCCCCeEeecC-----
Q psy13147 20 NPCDPYSSCSVYSEHVAMCDPCSGPQAPWL-PHCR--PECLCNSDCPFNMACLGQKCRDPCQGTCGVNALCTVVH----- 91 (180)
Q Consensus 20 ~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~-~~C~--~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~----- 91 (180)
..|. +|.-+....+ |.|. |.+||.... .+|+ .+|... . -+ ..+|+.-+.|++..
T Consensus 6 T~CK-NG~LiQMSNH-fEC~-Cnegfvl~~EntCE~kv~C~~~------e------~~---~K~Cgdya~C~~~~~~~~~ 67 (197)
T PF06247_consen 6 TICK-NGYLIQMSNH-FECK-CNEGFVLKNENTCEEKVECDKL------E------NV---NKPCGDYAKCINQANKGEE 67 (197)
T ss_dssp ---B-TEEEEEESSE-EEEE-ESTTEEEEETTEEEE----SG-------G------GT---TSEEETTEEEEE-SSTTSS
T ss_pred cccc-CCEEEEccCc-eEEE-cCCCcEEccccccccceecCcc------c------cc---CccccchhhhhcCCCcccc
Confidence 4564 6788888888 9999 999998753 2332 223210 0 00 12577788998765
Q ss_pred CCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeec---CCceeeecCCCCccCCcCCCc----cCCcCC
Q psy13147 92 HTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHT---NGIVTCECLPTYYGNGALGCR----PECVLN 164 (180)
Q Consensus 92 g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~---~g~~~C~C~~G~~g~~~~~C~----deC~~~ 164 (180)
..|.|.|.+||+.... .|.. ++|....|+ .+.|+-. +....|+|.-|++.+....|. .+|.+
T Consensus 68 ~~~~C~C~~gY~~~~~-vCvp--------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L- 136 (197)
T PF06247_consen 68 RAYKCDCINGYILKQG-VCVP--------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL- 136 (197)
T ss_dssp TSEEEEE-TTEEESSS-SEEE--------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-------
T ss_pred eeEEEecccCceeeCC-eEch--------hhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee-
Confidence 5799999999988755 6753 677777887 6899933 445699999999944333452 26776
Q ss_pred CCCCCCCcccCC
Q psy13147 165 TDCPNSAACINN 176 (180)
Q Consensus 165 ~~C~~~~~C~n~ 176 (180)
.|..+..|..+
T Consensus 137 -KCk~nE~CK~~ 147 (197)
T PF06247_consen 137 -KCKENEECKLV 147 (197)
T ss_dssp ---TTTEEEEEE
T ss_pred -ecCCCcceeee
Confidence 46666666543
No 26
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.08 E-value=0.00086 Score=35.92 Aligned_cols=26 Identities=38% Similarity=0.862 Sum_probs=23.3
Q ss_pred CCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 126 VYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 126 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
.+|..+ +|+++.++|+|.|++||.|+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 578777 99999999999999999884
No 27
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.92 E-value=0.0011 Score=35.50 Aligned_cols=30 Identities=20% Similarity=0.480 Sum_probs=24.8
Q ss_pred CCCC-CCCCCCCeeeecCCCeeecCCCCCCCcCC
Q psy13147 16 ACKP-NPCDPYSSCSVYSEHVAMCDPCSGPQAPW 48 (180)
Q Consensus 16 ~C~~-~~C~~~~~C~~~~~~~~~C~~C~~G~~g~ 48 (180)
+|.. .+|.++ .|++..++ |.|. |++||.|.
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~-~~C~-C~~g~~g~ 31 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGS-YTCS-CPPGYTGD 31 (35)
T ss_pred CCCCcCCCCCC-EEECCCCC-eEeE-CCCCCccC
Confidence 3555 689877 99999888 9999 99999884
No 28
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.57 E-value=0.0025 Score=31.46 Aligned_cols=20 Identities=30% Similarity=0.702 Sum_probs=15.7
Q ss_pred CceeeCCCCCccCC-CCCccC
Q psy13147 93 TPACYCPQGTIGNP-YEHCAT 112 (180)
Q Consensus 93 ~~~C~C~~g~~g~~-~~~C~~ 112 (180)
+|+|+|++||...+ ...|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 58999999998643 238988
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.46 E-value=0.00097 Score=49.58 Aligned_cols=106 Identities=25% Similarity=0.671 Sum_probs=64.8
Q ss_pred CCCCCCCeeeecCC----CeeecCCCCCCCcCCCCCCC-CCCCCCCCCCCCCcccCCcccCCCccCCCCCCeEeecC---
Q psy13147 20 NPCDPYSSCSVYSE----HVAMCDPCSGPQAPWLPHCR-PECLCNSDCPFNMACLGQKCRDPCQGTCGVNALCTVVH--- 91 (180)
Q Consensus 20 ~~C~~~~~C~~~~~----~~~~C~~C~~G~~g~~~~C~-~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~--- 91 (180)
.+|+..+.|++.+. ..|.|. |.+||......|. ++|.. -.|+ .+.|+..+
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~-C~~gY~~~~~vCvp~~C~~--------------------~~Cg-~GKCI~d~~~~ 107 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCD-CINGYILKQGVCVPNKCNN--------------------KDCG-SGKCILDPDNP 107 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEE-E-TTEEESSSSEEEGGGSS-----------------------T-TEEEEEEEGGG
T ss_pred ccccchhhhhcCCCcccceeEEEe-cccCceeeCCeEchhhcCc--------------------eecC-CCeEEecCCCC
Confidence 68999999998874 349999 9999998765554 33432 1344 46776543
Q ss_pred CCceeeCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCCCCeeeecCCceeeecCCCCccCC
Q psy13147 92 HTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGSNAVCKHTNGIVTCECLPTYYGNG 153 (180)
Q Consensus 92 g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 153 (180)
....|+|.-|+..+....|... -..+|. .-|-.+..|..+.+-|+|.+..||.+++
T Consensus 108 ~~~~CSC~IGkV~~dn~kCtk~-----G~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 108 NNPTCSCNIGKVPDDNKKCTKT-----GETKCS-LKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp SEEEEEE-TEEETTTTTESEEE-----E---------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred CCceeEeeeceEeccCCcccCC-----Ccccee-eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 3459999999984333356420 013453 3467788999999999999999998654
No 30
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.95 E-value=0.0071 Score=29.84 Aligned_cols=13 Identities=38% Similarity=0.759 Sum_probs=11.3
Q ss_pred ceeeecCCCCccC
Q psy13147 140 IVTCECLPTYYGN 152 (180)
Q Consensus 140 ~~~C~C~~G~~g~ 152 (180)
+|+|+|++||+..
T Consensus 1 sy~C~C~~Gy~l~ 13 (24)
T PF12662_consen 1 SYTCSCPPGYQLS 13 (24)
T ss_pred CEEeeCCCCCcCC
Confidence 6999999999854
No 31
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.65 E-value=0.0097 Score=32.41 Aligned_cols=22 Identities=32% Similarity=0.697 Sum_probs=17.9
Q ss_pred CeeeecCCceeeecCCCCccCC
Q psy13147 132 AVCKHTNGIVTCECLPTYYGNG 153 (180)
Q Consensus 132 ~~C~~~~g~~~C~C~~G~~g~~ 153 (180)
..|++++++|+|.|++||....
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCccCCCceEeECCCCCEECc
Confidence 4899999999999999998653
No 32
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=94.76 E-value=0.021 Score=31.17 Aligned_cols=30 Identities=33% Similarity=0.640 Sum_probs=21.5
Q ss_pred CCCCCCCCeeeecC-CceeeecCCCCccCCc
Q psy13147 125 HVYCGSNAVCKHTN-GIVTCECLPTYYGNGA 154 (180)
Q Consensus 125 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~~~ 154 (180)
...|..+|.|++.. |++.|+|..||..++.
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 34677899999775 9999999999987654
No 33
>KOG1226|consensus
Probab=94.49 E-value=0.52 Score=42.38 Aligned_cols=116 Identities=22% Similarity=0.559 Sum_probs=63.0
Q ss_pred ecCCCCCCCcCCCCCCCCCCCCC----CCCCCCCcccCCcccCCCccCCCCCCeEeecCCCceeeCCCCCccCC-CCCcc
Q psy13147 37 MCDPCSGPQAPWLPHCRPECLCN----SDCPFNMACLGQKCRDPCQGTCGVNALCTVVHHTPACYCPQGTIGNP-YEHCA 111 (180)
Q Consensus 37 ~C~~C~~G~~g~~~~C~~~C~~~----~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~g~~g~~-~~~C~ 111 (180)
.|. |.+||.|..-.|....... ..|+.... ...|...+.|+= -.|.|.+...+.- ...|+
T Consensus 479 ~C~-C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~----------~~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCE 543 (783)
T KOG1226|consen 479 QCR-CDEGWLGKKCECSTDELSSSEEEDKCRENSD----------SPVCSGRGDCVC----GQCVCHKPDNGKIYGKFCE 543 (783)
T ss_pred cee-cCCCCCCCcccCCccccCcHhHHhhccCCCC----------CCCcCCCCcEeC----CceEecCCCCCceeeeeee
Confidence 789 9999999754442111110 11111110 014555555542 2467766655211 01665
Q ss_pred CCCCCCCCCCCCC---CCCCCCCCeeeecCCceeeecCCCCccCCcCCC---ccCCcCC--CCCCCCCcccCCcc
Q psy13147 112 TPLAPVPPPNPCD---HVYCGSNAVCKHTNGIVTCECLPTYYGNGALGC---RPECVLN--TDCPNSAACINNSI 178 (180)
Q Consensus 112 ~~~~~~~~~d~C~---~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C---~deC~~~--~~C~~~~~C~n~~~ 178 (180)
- +--.|. ...|..+++|.= | +|+|.+||+|+.. .| .|-|+.. ..|...+.|.=.+|
T Consensus 544 C------DnfsC~r~~g~lC~g~G~C~C--G--~CvC~~GwtG~~C-~C~~std~C~~~~G~iCSGrG~C~Cg~C 607 (783)
T KOG1226|consen 544 C------DNFSCERHKGVLCGGHGRCEC--G--RCVCNPGWTGSAC-NCPLSTDTCESSDGQICSGRGTCECGRC 607 (783)
T ss_pred c------cCcccccccCcccCCCCeEeC--C--cEEcCCCCccCCC-CCCCCCccccCCCCceeCCCceeeCCce
Confidence 4 212343 246888888843 2 6999999999865 23 3566653 56766666665544
No 34
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.46 E-value=0.032 Score=30.34 Aligned_cols=22 Identities=23% Similarity=0.493 Sum_probs=17.9
Q ss_pred CeEeecCCCceeeCCCCCccCC
Q psy13147 85 ALCTVVHHTPACYCPQGTIGNP 106 (180)
Q Consensus 85 ~~C~~~~g~~~C~C~~g~~g~~ 106 (180)
..|++++++|+|.|++||....
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCccCCCceEeECCCCCEECc
Confidence 5899999999999999998753
No 35
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=94.12 E-value=0.014 Score=31.77 Aligned_cols=34 Identities=18% Similarity=0.364 Sum_probs=22.9
Q ss_pred CCCCCCCCCCeeeecCCCeeecCCCCCCCcCCCCC
Q psy13147 17 CKPNPCDPYSSCSVYSEHVAMCDPCSGPQAPWLPH 51 (180)
Q Consensus 17 C~~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~~~ 51 (180)
|....|+.|+.|.+...+.+.|. |..||..++..
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecr-Cllgyk~~~~~ 35 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECR-CLLGYKKVGGK 35 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEE-E-TTEEEETTE
T ss_pred ccCccCCCCcccEEcCCCCEEEE-eeCCccccCCC
Confidence 44567889999999884449999 99999876543
No 36
>KOG0994|consensus
Probab=93.81 E-value=0.37 Score=45.26 Aligned_cols=60 Identities=22% Similarity=0.512 Sum_probs=35.1
Q ss_pred eEeecCCCcee-eCCCCCccCCCCCccCCCCCCCCCCCCCCCCCCC-------C-Ceee--ecCCceeeecCCCCccCCc
Q psy13147 86 LCTVVHHTPAC-YCPQGTIGNPYEHCATPLAPVPPPNPCDHVYCGS-------N-AVCK--HTNGIVTCECLPTYYGNGA 154 (180)
Q Consensus 86 ~C~~~~g~~~C-~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~~C~~-------~-~~C~--~~~g~~~C~C~~G~~g~~~ 154 (180)
.|....+++.| +|..||.|++. .=.. ..|.+-||.. + -.|. +......|.|.+||+|+-.
T Consensus 877 ~CqD~T~G~~CdrCl~GyyGdP~-lg~g--------~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RC 947 (1758)
T KOG0994|consen 877 DCQDSTTGHSCDRCLDGYYGDPR-LGSG--------IGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRC 947 (1758)
T ss_pred cccccccccchhhhhccccCCcc-cCCC--------CCCCCCCCCCCCccchhccccccccccccceeeecccCccccch
Confidence 35566677888 59999999874 1111 2233223311 1 1233 2345568999999998754
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.23 E-value=0.11 Score=27.39 Aligned_cols=25 Identities=24% Similarity=0.590 Sum_probs=20.8
Q ss_pred CCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 126 VYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 126 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
..|..+++|+.. ..+|+|.+||.|.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 368889999877 4589999999986
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.22 E-value=0.13 Score=27.17 Aligned_cols=24 Identities=25% Similarity=0.634 Sum_probs=19.8
Q ss_pred CCCCCCeEeecCCCceeeCCCCCccC
Q psy13147 80 TCGVNALCTVVHHTPACYCPQGTIGN 105 (180)
Q Consensus 80 ~C~~~~~C~~~~g~~~C~C~~g~~g~ 105 (180)
.|..+++|+.. ..+|.|.+||.|.
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCC
Confidence 47778888865 5789999999987
No 39
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=92.96 E-value=0.047 Score=22.82 Aligned_cols=11 Identities=45% Similarity=1.135 Sum_probs=8.7
Q ss_pred eeeCCCCCccC
Q psy13147 95 ACYCPQGTIGN 105 (180)
Q Consensus 95 ~C~C~~g~~g~ 105 (180)
.|.|++||+|.
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 48999999987
No 40
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=92.69 E-value=0.14 Score=39.47 Aligned_cols=32 Identities=31% Similarity=0.623 Sum_probs=25.7
Q ss_pred CCCCC--CCCCCCCCeeeecCCceeeecCCCCccCC
Q psy13147 120 PNPCD--HVYCGSNAVCKHTNGIVTCECLPTYYGNG 153 (180)
Q Consensus 120 ~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 153 (180)
.++|. ++.|. ..|.++.|+|.|.|++||....
T Consensus 187 ~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 187 PDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred chhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence 48895 35564 5899999999999999998653
No 41
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=88.43 E-value=0.46 Score=32.32 Aligned_cols=32 Identities=22% Similarity=0.544 Sum_probs=25.6
Q ss_pred CCCCCC-CCCCCCCCeeeecCCCeeecCCCCCCCcC
Q psy13147 13 DSLACK-PNPCDPYSSCSVYSEHVAMCDPCSGPQAP 47 (180)
Q Consensus 13 d~~~C~-~~~C~~~~~C~~~~~~~~~C~~C~~G~~g 47 (180)
+.|.|. ...|+.++.|.. ... ..|. |.+||..
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~-~~C~-Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNS-PKCS-CLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC-CCC-CceE-CCCCcCC
Confidence 567898 689999999954 334 6899 9999975
No 42
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=84.73 E-value=1.1 Score=30.46 Aligned_cols=32 Identities=34% Similarity=0.949 Sum_probs=24.6
Q ss_pred CCCCC-CCCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 120 PNPCD-HVYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 120 ~d~C~-~~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
.|.|. ...|+..+.|.. .....|.|.+||..+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 37786 478999999954 455679999999753
No 43
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=79.12 E-value=2.1 Score=32.86 Aligned_cols=21 Identities=19% Similarity=0.403 Sum_probs=18.9
Q ss_pred CeEeecCCCceeeCCCCCccC
Q psy13147 85 ALCTVVHHTPACYCPQGTIGN 105 (180)
Q Consensus 85 ~~C~~~~g~~~C~C~~g~~g~ 105 (180)
..|.++.|+|.|.|.+||+..
T Consensus 199 ~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 199 QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred ceEEcCCCCEEeECCCCccCC
Confidence 579999999999999999764
No 44
>PHA02887 EGF-like protein; Provisional
Probab=76.47 E-value=3 Score=28.85 Aligned_cols=29 Identities=28% Similarity=0.642 Sum_probs=23.3
Q ss_pred CCCCCCeEeecC--CCceeeCCCCCccCCCCCccC
Q psy13147 80 TCGVNALCTVVH--HTPACYCPQGTIGNPYEHCAT 112 (180)
Q Consensus 80 ~C~~~~~C~~~~--g~~~C~C~~g~~g~~~~~C~~ 112 (180)
-|- +|+|.... ..+.|.|.+||+|. .|+.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~ 123 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE 123 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence 465 67997765 67999999999998 7864
No 45
>KOG1226|consensus
Probab=76.40 E-value=17 Score=33.10 Aligned_cols=60 Identities=25% Similarity=0.741 Sum_probs=36.6
Q ss_pred CCCCCCeEeecCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCCC---CCCCCCCeeeecCCceeeecCCC-CccCCc
Q psy13147 80 TCGVNALCTVVHHTPACYCPQGTIGNPYEHCATPLAPVPPPNPCDH---VYCGSNAVCKHTNGIVTCECLPT-YYGNGA 154 (180)
Q Consensus 80 ~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~---~~C~~~~~C~~~~g~~~C~C~~G-~~g~~~ 154 (180)
.|..+++|.= -+|.|.+||+|..= .|.. ..+.|.+ ..|...++|.=. +|.|... |+|...
T Consensus 556 lC~g~G~C~C----G~CvC~~GwtG~~C-~C~~------std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~C 619 (783)
T KOG1226|consen 556 LCGGHGRCEC----GRCVCNPGWTGSAC-NCPL------STDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFC 619 (783)
T ss_pred ccCCCCeEeC----CcEEcCCCCccCCC-CCCC------CCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchh
Confidence 4777777743 36999999999832 2322 2356642 346666666421 5677655 877654
No 46
>smart00051 DSL delta serrate ligand.
Probab=74.20 E-value=5.4 Score=24.42 Aligned_cols=45 Identities=22% Similarity=0.444 Sum_probs=28.8
Q ss_pred CceeeCCCCCccCCCCCccCCCCCCCCCCCCCC-CCCCCCCeeeecCCceeeecCCCCccC
Q psy13147 93 TPACYCPQGTIGNPYEHCATPLAPVPPPNPCDH-VYCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 93 ~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
.++-.|.++|.|. .|.. .|.+ .....+..|.. .| .++|.+||.|.
T Consensus 16 ~~rv~C~~~~yG~---~C~~---------~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~ 61 (63)
T smart00051 16 QIRVTCDENYYGE---GCNK---------FCRPRDDFFGHYTCDE-NG--NKGCLEGWMGP 61 (63)
T ss_pred EEEeeCCCCCcCC---ccCC---------EeCcCccccCCccCCc-CC--CEecCCCCcCC
Confidence 4566799999888 6632 2422 22345667743 34 47899999885
No 47
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=74.19 E-value=3.5 Score=29.01 Aligned_cols=29 Identities=24% Similarity=0.533 Sum_probs=23.1
Q ss_pred CCCCCCeEeecC--CCceeeCCCCCccCCCCCccC
Q psy13147 80 TCGVNALCTVVH--HTPACYCPQGTIGNPYEHCAT 112 (180)
Q Consensus 80 ~C~~~~~C~~~~--g~~~C~C~~g~~g~~~~~C~~ 112 (180)
-|.. +.|.... ..+.|.|..||+|. .|+-
T Consensus 52 YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh 82 (139)
T PHA03099 52 YCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQH 82 (139)
T ss_pred EeEC-CEEEeeccCCCceeECCCCcccc---cccc
Confidence 4654 5897765 78999999999999 7864
No 48
>PHA02887 EGF-like protein; Provisional
Probab=73.81 E-value=3.5 Score=28.50 Aligned_cols=32 Identities=25% Similarity=0.534 Sum_probs=23.9
Q ss_pred CCCC---CCCCCCCCeeee--cCCceeeecCCCCccCC
Q psy13147 121 NPCD---HVYCGSNAVCKH--TNGIVTCECLPTYYGNG 153 (180)
Q Consensus 121 d~C~---~~~C~~~~~C~~--~~g~~~C~C~~G~~g~~ 153 (180)
++|. .+.|. |++|.. ......|+|+.||.|.-
T Consensus 84 ~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~R 120 (126)
T PHA02887 84 EKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIR 120 (126)
T ss_pred cccChHhhCEee-CCEEEccccCCCceeECCCCcccCC
Confidence 5563 25675 689984 46778999999999973
No 49
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=70.96 E-value=4.9 Score=23.22 Aligned_cols=27 Identities=22% Similarity=0.434 Sum_probs=18.5
Q ss_pred CC-CCCCCCCCeeeecCCCeeecCCCCCCCcCCC
Q psy13147 17 CK-PNPCDPYSSCSVYSEHVAMCDPCSGPQAPWL 49 (180)
Q Consensus 17 C~-~~~C~~~~~C~~~~~~~~~C~~C~~G~~g~~ 49 (180)
|. ...|..++.|++ -+|. |++||...+
T Consensus 22 C~~~~qC~~~s~C~~-----g~C~-C~~g~~~~~ 49 (52)
T PF01683_consen 22 CESDEQCIGGSVCVN-----GRCQ-CPPGYVEVG 49 (52)
T ss_pred CCCcCCCCCcCEEcC-----CEeE-CCCCCEecC
Confidence 44 446667778853 4798 999987654
No 50
>KOG0994|consensus
Probab=69.68 E-value=14 Score=35.38 Aligned_cols=61 Identities=30% Similarity=0.663 Sum_probs=34.5
Q ss_pred CeEeecCCCc--ee-eCCCCCccCCC-C--CccCCCCCCCCCCCCCC--CCCCCCCeeeecCCceee-ecCCCCccCCc
Q psy13147 85 ALCTVVHHTP--AC-YCPQGTIGNPY-E--HCATPLAPVPPPNPCDH--VYCGSNAVCKHTNGIVTC-ECLPTYYGNGA 154 (180)
Q Consensus 85 ~~C~~~~g~~--~C-~C~~g~~g~~~-~--~C~~~~~~~~~~d~C~~--~~C~~~~~C~~~~g~~~C-~C~~G~~g~~~ 154 (180)
+.|+=.+|.| .| .|.+||.|-+. . .|.+ -.|+|.+ ..|. .|.+...++.| +|..||.|++.
T Consensus 830 GQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNg------HA~~Cd~~tGaCi---~CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 830 GQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNG------HADTCDPITGACI---DCQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred cceeeccccchhhccccCCCccCCCcCccccccC------cccccCccccccc---cccccccccchhhhhccccCCcc
Confidence 3344334444 34 48888887652 1 2222 2355543 2231 35566777888 69999998764
No 51
>KOG3516|consensus
Probab=63.46 E-value=5.3 Score=37.96 Aligned_cols=35 Identities=29% Similarity=0.509 Sum_probs=29.4
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCeeecCCCC-CCCcCC
Q psy13147 12 NDSLACKPNPCDPYSSCSVYSEHVAMCDPCS-GPQAPW 48 (180)
Q Consensus 12 ~d~~~C~~~~C~~~~~C~~~~~~~~~C~~C~-~G~~g~ 48 (180)
.-+|.|.+++|.+++.|...-.. |.|. |. .||.|.
T Consensus 543 ~i~drClPN~CehgG~C~Qs~~~-f~C~-C~~TGY~Ga 578 (1306)
T KOG3516|consen 543 GISDRCLPNPCEHGGKCSQSWDD-FECN-CELTGYKGA 578 (1306)
T ss_pred ccccccCCccccCCCcccccccc-eeEe-ccccccccc
Confidence 34677889999999999986666 9999 99 899885
No 52
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=63.38 E-value=8.5 Score=27.12 Aligned_cols=27 Identities=30% Similarity=0.552 Sum_probs=21.2
Q ss_pred CCCCCCCeeee--cCCceeeecCCCCccCC
Q psy13147 126 VYCGSNAVCKH--TNGIVTCECLPTYYGNG 153 (180)
Q Consensus 126 ~~C~~~~~C~~--~~g~~~C~C~~G~~g~~ 153 (180)
+.|.+ +.|.. ....+.|+|..||.|.-
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGeR 79 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGIR 79 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCccccc
Confidence 56754 58984 45889999999999974
No 53
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=51.93 E-value=13 Score=19.83 Aligned_cols=15 Identities=33% Similarity=0.922 Sum_probs=11.9
Q ss_pred CCceeeCCCCCccCC
Q psy13147 92 HTPACYCPQGTIGNP 106 (180)
Q Consensus 92 g~~~C~C~~g~~g~~ 106 (180)
....|.|+.||+.+.
T Consensus 16 ~~~~C~CPeGyIlde 30 (34)
T PF09064_consen 16 SPGQCFCPEGYILDE 30 (34)
T ss_pred CCCceeCCCceEecC
Confidence 345899999998764
No 54
>KOG3516|consensus
Probab=46.86 E-value=16 Score=34.95 Aligned_cols=34 Identities=21% Similarity=0.496 Sum_probs=30.4
Q ss_pred CCCCCCCCCCCCCeeeecCCceeeecC-CCCccCC
Q psy13147 120 PNPCDHVYCGSNAVCKHTNGIVTCECL-PTYYGNG 153 (180)
Q Consensus 120 ~d~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~g~~ 153 (180)
+|.|.+++|..++.|.-.-..|.|.|. .||.|..
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Gat 579 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGAT 579 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccc
Confidence 588999999999999998999999999 7998863
No 55
>KOG3514|consensus
Probab=38.28 E-value=32 Score=32.97 Aligned_cols=31 Identities=35% Similarity=0.721 Sum_probs=25.8
Q ss_pred CCCCCCCCCCCeeeecCCCeeecCCCCC-CCcCC
Q psy13147 16 ACKPNPCDPYSSCSVYSEHVAMCDPCSG-PQAPW 48 (180)
Q Consensus 16 ~C~~~~C~~~~~C~~~~~~~~~C~~C~~-G~~g~ 48 (180)
.|.++||.+++.|...-.. |.|. |.. ||.|.
T Consensus 625 ~C~~nPC~N~g~C~egwNr-fiCD-Cs~T~~~G~ 656 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNR-FICD-CSGTGFEGR 656 (1591)
T ss_pred ccCCCcccCCCCccccccc-cccc-cccCcccCc
Confidence 7899999999999987777 9998 864 67664
No 56
>PF04706 Dickkopf_N: Dickkopf N-terminal cysteine-rich region; InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=33.92 E-value=83 Score=18.44 Aligned_cols=18 Identities=22% Similarity=0.600 Sum_probs=8.2
Q ss_pred CCCCCCCCCCCcccCCcc
Q psy13147 56 CLCNSDCPFNMACLGQKC 73 (180)
Q Consensus 56 C~~~~~C~~~~~C~~~~C 73 (180)
|.-+..|-.+..|++++|
T Consensus 33 C~Rd~~CC~g~~CvnG~C 50 (52)
T PF04706_consen 33 CTRDAMCCPGNLCVNGVC 50 (52)
T ss_pred CCCCcccCCCCeeeCCEe
Confidence 333344444455555444
No 57
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=28.30 E-value=60 Score=18.21 Aligned_cols=20 Identities=15% Similarity=0.336 Sum_probs=14.7
Q ss_pred CeeeecCCCeeecCCCCCCCcCCC
Q psy13147 26 SSCSVYSEHVAMCDPCSGPQAPWL 49 (180)
Q Consensus 26 ~~C~~~~~~~~~C~~C~~G~~g~~ 49 (180)
..|.. .. .+|. |+++|+|..
T Consensus 11 ~~C~~--~~-G~C~-C~~~~~G~~ 30 (49)
T PF00053_consen 11 QTCDP--ST-GQCV-CKPGTTGPR 30 (49)
T ss_dssp SSEEE--TC-EEES-BSTTEESTT
T ss_pred CcccC--CC-CEEe-ccccccCCc
Confidence 46765 23 6899 999999853
No 58
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=26.73 E-value=1.2e+02 Score=19.87 Aligned_cols=14 Identities=21% Similarity=0.434 Sum_probs=10.2
Q ss_pred CCceeeCCCCCccC
Q psy13147 92 HTPACYCPQGTIGN 105 (180)
Q Consensus 92 g~~~C~C~~g~~g~ 105 (180)
....|.|.+||.-.
T Consensus 73 ~dt~C~C~~G~y~~ 86 (98)
T cd00185 73 RNTVCGCKPGFYCL 86 (98)
T ss_pred CCCeEeCCCCCEec
Confidence 45678899998544
No 59
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=25.71 E-value=1e+02 Score=17.44 Aligned_cols=13 Identities=38% Similarity=0.902 Sum_probs=8.8
Q ss_pred eeecCCCCccCCc
Q psy13147 142 TCECLPTYYGNGA 154 (180)
Q Consensus 142 ~C~C~~G~~g~~~ 154 (180)
.|.|++++.|...
T Consensus 20 ~C~C~~~~~G~~C 32 (50)
T cd00055 20 QCECKPNTTGRRC 32 (50)
T ss_pred EEeCCCcCCCCCC
Confidence 5777777777543
No 60
>KOG3607|consensus
Probab=23.08 E-value=92 Score=28.63 Aligned_cols=49 Identities=24% Similarity=0.645 Sum_probs=34.7
Q ss_pred CCCCCCCCCcccCCccc-------CCCccCCCCCCeEeecCCCceeeCCCCCccCCCCCccC
Q psy13147 58 CNSDCPFNMACLGQKCR-------DPCQGTCGVNALCTVVHHTPACYCPQGTIGNPYEHCAT 112 (180)
Q Consensus 58 ~~~~C~~~~~C~~~~C~-------~~C~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~C~~ 112 (180)
++..|..+.+|.+.+|+ ..|...|..++.|-+ .+.|.|.+||.+. .|..
T Consensus 602 dGt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn---~~~ChC~~gwapp---~C~~ 657 (716)
T KOG3607|consen 602 DGTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNN---ELNCHCEPGWAPP---FCFI 657 (716)
T ss_pred CCCccCCCceecCCcchhhhhhcccccccccCCCcccCC---CcceeeCCCCCCC---cccc
Confidence 34568888888888884 334445777777754 5689999999877 6653
No 61
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=22.20 E-value=43 Score=20.41 Aligned_cols=45 Identities=24% Similarity=0.515 Sum_probs=16.2
Q ss_pred CceeeCCCCCccCCCCCccCCCCCCCCCCCCCCC-CCCCCCeeeecCCceeeecCCCCccC
Q psy13147 93 TPACYCPQGTIGNPYEHCATPLAPVPPPNPCDHV-YCGSNAVCKHTNGIVTCECLPTYYGN 152 (180)
Q Consensus 93 ~~~C~C~~g~~g~~~~~C~~~~~~~~~~d~C~~~-~C~~~~~C~~~~g~~~C~C~~G~~g~ 152 (180)
.++-.|.+.|.|. .|.. .|.+. .-..+-+|. ..|.. +|.+||.|.
T Consensus 16 ~~rv~C~~nyyG~---~C~~---------~C~~~~d~~ghy~Cd-~~G~~--~C~~Gw~G~ 61 (63)
T PF01414_consen 16 RIRVVCDENYYGP---NCSK---------FCKPRDDSFGHYTCD-SNGNK--VCLPGWTGP 61 (63)
T ss_dssp -------TTEETT---TT-E---------E---EEETTEEEEE--SS--E--EE-TTEEST
T ss_pred EEEEECCCCCCCc---cccC---------CcCCCcCCcCCcccC-CCCCC--CCCCCCcCC
Confidence 4566788888887 5632 23221 011233444 23332 588898875
Done!