Query psy15668
Match_columns 365
No_of_seqs 333 out of 2518
Neff 9.4
Searched_HMMs 46136
Date Fri Aug 16 18:31:07 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15668.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15668hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.7 5E-17 1.1E-21 163.4 17.2 108 5-137 1177-1308(2531)
2 KOG1214|consensus 99.6 5.4E-15 1.2E-19 142.8 14.4 163 67-266 692-865 (1289)
3 KOG1214|consensus 99.6 2E-13 4.3E-18 132.2 18.6 204 12-269 699-919 (1289)
4 KOG4289|consensus 99.5 3.8E-13 8.2E-18 136.1 15.2 90 83-196 1218-1308(2531)
5 KOG1217|consensus 99.5 2.9E-12 6.2E-17 125.4 20.5 274 10-346 92-389 (487)
6 KOG1219|consensus 99.4 4.4E-13 9.5E-18 140.2 8.6 109 7-141 3864-3974(4289)
7 KOG1219|consensus 99.4 6.8E-13 1.5E-17 138.8 8.5 108 110-259 3865-3974(4289)
8 KOG1217|consensus 99.3 4.4E-11 9.6E-16 117.0 17.2 202 8-259 170-389 (487)
9 KOG1225|consensus 98.9 2.8E-08 6E-13 95.5 15.0 131 88-347 235-365 (525)
10 KOG0994|consensus 98.9 1.9E-08 4E-13 101.3 13.5 233 80-355 878-1152(1758)
11 KOG1225|consensus 98.9 1.7E-08 3.6E-13 97.0 11.7 132 28-260 234-365 (525)
12 KOG4260|consensus 98.8 6E-09 1.3E-13 89.4 4.7 150 12-198 149-304 (350)
13 KOG0994|consensus 98.7 2.2E-07 4.8E-12 93.8 14.7 231 88-352 842-1101(1758)
14 KOG4260|consensus 98.5 3.1E-07 6.7E-12 79.1 5.9 135 74-257 152-304 (350)
15 PF07645 EGF_CA: Calcium-bindi 98.4 1.7E-07 3.8E-12 58.9 2.1 34 6-39 1-36 (42)
16 PF00008 EGF: EGF-like domain 98.3 3.9E-07 8.5E-12 53.4 1.6 30 10-39 1-31 (32)
17 smart00179 EGF_CA Calcium-bind 98.2 2.4E-06 5.2E-11 52.6 4.0 34 6-39 1-36 (39)
18 PF06247 Plasmod_Pvs28: Plasmo 98.1 3.2E-07 7E-12 75.0 -1.3 141 77-260 10-163 (197)
19 PF07645 EGF_CA: Calcium-bindi 97.9 3.5E-06 7.6E-11 52.9 0.9 32 308-344 1-34 (42)
20 PF00008 EGF: EGF-like domain 97.9 3.6E-06 7.8E-11 49.3 0.6 32 312-347 1-32 (32)
21 cd00054 EGF_CA Calcium-binding 97.9 2.5E-05 5.3E-10 47.6 4.0 34 6-39 1-35 (38)
22 PF12947 EGF_3: EGF domain; I 97.8 1.3E-05 2.8E-10 48.1 2.0 29 13-41 6-34 (36)
23 smart00179 EGF_CA Calcium-bind 97.7 6.1E-05 1.3E-09 46.2 3.8 34 309-347 2-37 (39)
24 PF12947 EGF_3: EGF domain; I 97.6 3.2E-05 7E-10 46.4 1.7 29 233-261 6-34 (36)
25 KOG1836|consensus 97.6 0.004 8.6E-08 68.5 18.5 242 80-355 749-1027(1705)
26 cd00053 EGF Epidermal growth f 97.3 0.00028 6.1E-09 42.0 3.6 30 10-39 2-32 (36)
27 KOG1226|consensus 97.3 0.0041 8.9E-08 61.9 12.9 143 131-362 479-635 (783)
28 smart00181 EGF Epidermal growt 97.3 0.00037 7.9E-09 41.6 3.7 30 9-39 1-31 (35)
29 PF06247 Plasmod_Pvs28: Plasmo 97.2 5E-05 1.1E-09 62.4 -1.0 148 14-201 7-163 (197)
30 cd00054 EGF_CA Calcium-binding 97.2 0.00053 1.2E-08 41.4 3.7 34 309-347 2-36 (38)
31 KOG1226|consensus 97.2 0.0052 1.1E-07 61.2 11.9 128 88-267 479-622 (783)
32 PF12662 cEGF: Complement Clr- 97.1 0.00044 9.6E-09 37.2 2.5 24 27-54 1-24 (24)
33 KOG1836|consensus 97.1 0.014 3E-07 64.4 15.1 174 171-353 777-977 (1705)
34 cd00053 EGF Epidermal growth f 96.6 0.0027 5.9E-08 37.6 3.5 28 314-346 5-32 (36)
35 PF12662 cEGF: Complement Clr- 96.4 0.0033 7.1E-08 33.8 2.2 23 247-269 1-24 (24)
36 smart00181 EGF Epidermal growt 96.4 0.0047 1E-07 36.6 3.2 29 312-346 2-31 (35)
37 PF07974 EGF_2: EGF-like domai 96.2 0.0082 1.8E-07 34.9 3.3 25 13-39 6-30 (32)
38 PF14670 FXa_inhibition: Coagu 96.1 0.0039 8.4E-08 37.3 1.9 23 15-39 8-30 (36)
39 PF07974 EGF_2: EGF-like domai 94.8 0.047 1E-06 31.7 3.3 24 234-259 7-30 (32)
40 PF14670 FXa_inhibition: Coagu 94.5 0.036 7.8E-07 33.2 2.3 22 238-259 9-30 (36)
41 PF12661 hEGF: Human growth fa 94.2 0.02 4.3E-07 26.0 0.6 11 249-259 1-11 (13)
42 PF12946 EGF_MSP1_1: MSP1 EGF 91.5 0.1 2.2E-06 31.2 1.1 30 10-39 2-32 (37)
43 PF12946 EGF_MSP1_1: MSP1 EGF 87.4 0.34 7.5E-06 28.9 1.2 28 173-200 4-32 (37)
44 KOG3512|consensus 86.4 3.9 8.6E-05 39.0 8.2 158 180-347 285-476 (592)
45 cd01475 vWA_Matrilin VWA_Matri 85.3 0.97 2.1E-05 39.5 3.6 38 303-347 181-220 (224)
46 cd01475 vWA_Matrilin VWA_Matri 81.6 1.4 2.9E-05 38.6 3.0 21 239-259 199-219 (224)
47 smart00051 DSL delta serrate l 80.5 2.6 5.6E-05 28.7 3.4 23 317-347 40-62 (63)
48 KOG3516|consensus 79.6 1.3 2.9E-05 46.8 2.5 33 7-39 545-578 (1306)
49 KOG1218|consensus 77.4 46 0.001 30.3 12.0 14 26-39 13-26 (316)
50 PHA02887 EGF-like protein; Pro 75.9 2.9 6.2E-05 32.0 2.7 36 7-46 83-123 (126)
51 PHA03099 epidermal growth fact 75.5 3.1 6.7E-05 32.4 2.9 37 6-46 41-82 (139)
52 PF00053 Laminin_EGF: Laminin 75.4 1.9 4E-05 27.6 1.5 22 333-354 16-37 (49)
53 PF00954 S_locus_glycop: S-loc 74.9 3.4 7.4E-05 31.6 3.1 32 6-38 76-108 (110)
54 smart00051 DSL delta serrate l 74.1 4.9 0.00011 27.4 3.3 44 87-141 17-61 (63)
55 cd00055 EGF_Lam Laminin-type e 72.7 5.7 0.00012 25.5 3.3 20 335-354 19-38 (50)
56 PHA02887 EGF-like protein; Pro 71.7 3.5 7.6E-05 31.5 2.3 30 233-266 92-123 (126)
57 PHA03099 epidermal growth fact 67.6 5.6 0.00012 31.0 2.7 31 233-267 51-83 (139)
58 KOG3514|consensus 61.2 5.6 0.00012 41.9 2.2 31 9-39 625-656 (1591)
59 PF09064 Tme5_EGF_like: Thromb 59.8 8.8 0.00019 22.4 1.9 13 335-347 18-30 (34)
60 PF01683 EB: EB module; Inter 58.8 24 0.00053 22.6 4.3 28 310-346 20-48 (52)
61 smart00180 EGF_Lam Laminin-typ 58.7 12 0.00026 23.6 2.7 20 335-354 18-37 (46)
62 KOG3516|consensus 55.0 9.7 0.00021 40.7 2.7 37 306-347 542-579 (1306)
63 PF00954 S_locus_glycop: S-loc 54.4 12 0.00025 28.6 2.5 24 74-98 86-109 (110)
64 KOG3512|consensus 49.9 43 0.00094 32.3 5.8 62 285-353 364-432 (592)
65 PF12955 DUF3844: Domain of un 48.5 7.8 0.00017 29.2 0.7 35 5-39 3-44 (103)
66 KOG1218|consensus 37.2 3.2E+02 0.0069 24.7 12.7 40 92-141 96-135 (316)
67 KOG3514|consensus 36.9 25 0.00054 37.5 2.3 32 311-347 625-657 (1591)
68 PF01414 DSL: Delta serrate li 36.2 16 0.00035 24.8 0.7 14 188-201 16-29 (63)
69 KOG3607|consensus 26.7 63 0.0014 33.5 3.3 49 52-106 603-658 (716)
No 1
>KOG4289|consensus
Probab=99.74 E-value=5e-17 Score=163.42 Aligned_cols=108 Identities=31% Similarity=0.749 Sum_probs=89.3
Q ss_pred cCCCCCCCCCCCCCCeeeec----------------------CCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCccc
Q psy15668 5 MGGDPCSPNPCGSNTQCNVA----------------------SNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACK 62 (365)
Q Consensus 5 ~did~C~~~~C~~~~~C~~~----------------------~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~ 62 (365)
.|-+.|...||.+..+|+.+ .+++.|.||+||+|+ .|+. .
T Consensus 1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd---~CeT----e----------- 1238 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGD---YCET----E----------- 1238 (2531)
T ss_pred ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcc---cccc----h-----------
Confidence 45677999999998889743 468899999999999 6653 2
Q ss_pred CCccccccccC-CCCCCeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeC-CCCceeeCCCC
Q psy15668 63 EYRCVDVCAGQ-CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVI-NMVPTCSCLPG 137 (365)
Q Consensus 63 ~~~C~~~C~~~-C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~-~~~~~C~C~~G 137 (365)
+|.|... |+++++|...+|+|.|.|.+||+|. .|+.......|.+..|.++++|++. .+++.|.|+.|
T Consensus 1239 ----iDlCYs~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1239 ----IDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred ----hHhhhcCCCCCCCceEEecCceeEEecCCcccc---ceeeecccCccccceecCCCEEeecCCCceeccCCCc
Confidence 4444444 8999999999999999999999999 9986644466888999999999987 57888999987
No 2
>KOG1214|consensus
Probab=99.62 E-value=5.4e-15 Score=142.81 Aligned_cols=163 Identities=29% Similarity=0.647 Sum_probs=117.1
Q ss_pred ccccccC---CCCCCeeeecCC-CceeeCCCCCccCCCCCcccCCCCCCCCC--CCCCCCCeeeeCCCCceeeCCCCCCC
Q psy15668 67 VDVCAGQ---CGVNSECNVRNH-IPVCSCPPGYTGDPLTQCRRFDPQELCDR--SPCGVNTRCEVINMVPTCSCLPGYTG 140 (365)
Q Consensus 67 ~~~C~~~---C~~~~~C~~~~g-~~~C~C~~G~~g~~~~~C~~~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g 140 (365)
+++|... |..++.|....+ .|.|.|..||.|+++ .|.++ ++|+. ..|++++.|++.+++|+|.|..||..
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr-~c~d~---~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F 767 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGR-NCVDE---NECATGFHRCGPNSVCINLPGSYRCECRSGYEF 767 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCC-CCCCh---hhhccCCCCCCCCceeecCCCceeEEEeeccee
Confidence 4566543 888899987654 599999999999988 79887 78875 66999999999999999999998864
Q ss_pred CCCCCCCCCCCCCCCCCCCCcccCCcccCCCCC--CCCCCCC--eeeeC-CCceeeeCCCCCccCCCCccCccccCCCCC
Q psy15668 141 SPLSGCRHECDSDYDCGPSQSCVNYKCANPCAS--GACAPTA--QCEVR-NHRAVCSCPVGYLGDPYTSCRAECLAHSDC 215 (365)
Q Consensus 141 ~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~--~~C~~~~--~C~~~-~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C 215 (365)
. .+...|....+=.+ ++.|.. ..|...+ +|+.. .+.|.|.|.+||.|++.. |.++
T Consensus 768 ~---dd~~tCV~i~~pap---------~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-----c~dv--- 827 (1289)
T KOG1214|consen 768 A---DDRHTCVLITPPAP---------ANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-----CTDV--- 827 (1289)
T ss_pred c---cCCcceEEecCCCC---------CCccccCccccCcCCceEEEecCCceEEEeecCCccCCccc-----cccc---
Confidence 4 12223332211111 233332 3355444 45544 456999999999999753 3333
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCcccc
Q psy15668 216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRP 266 (365)
Q Consensus 216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~ 266 (365)
++|+. ..|+.+++|++++++|.|+|.+||+|+++ .|.+
T Consensus 828 ----DeC~p--------srChp~A~CyntpgsfsC~C~pGy~GDGf-~CVP 865 (1289)
T KOG1214|consen 828 ----DECSP--------SRCHPAATCYNTPGSFSCRCQPGYYGDGF-QCVP 865 (1289)
T ss_pred ----cccCc--------cccCCCceEecCCCcceeecccCccCCCc-eecC
Confidence 44432 44788999999999999999999999987 5765
No 3
>KOG1214|consensus
Probab=99.56 E-value=2e-13 Score=132.22 Aligned_cols=204 Identities=25% Similarity=0.603 Sum_probs=141.3
Q ss_pred CCCCCCCCeeeec-CCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCceee
Q psy15668 12 PNPCGSNTQCNVA-SNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPVCS 90 (365)
Q Consensus 12 ~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~ 90 (365)
+.-|..++.|... .-.|+|.|..||.|++ ..|.|.++|... ...|+.++.|++.+++|+|.
T Consensus 699 sh~cdt~a~C~pg~~~~~tcecs~g~~gdg------r~c~d~~eca~~------------~~~CGp~s~Cin~pg~~rce 760 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTCECSSGYQGDG------RNCVDENECATG------------FHRCGPNSVCINLPGSYRCE 760 (1289)
T ss_pred CcccCCCccccCCCCcceEEEEeeccCCCC------CCCCChhhhccC------------CCCCCCCceeecCCCceeEE
Confidence 4446677778854 3569999999999994 456677766643 23488999999999999999
Q ss_pred CCCCCc--cCCCCCcccCCC---CCCCCC--CCCCCCCe--eeeC-CCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 91 CPPGYT--GDPLTQCRRFDP---QELCDR--SPCGVNTR--CEVI-NMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQ 160 (365)
Q Consensus 91 C~~G~~--g~~~~~C~~~~~---~~~C~~--~~C~~~~~--C~~~-~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~ 160 (365)
|..||. ++.. +|..+.+ .+.|.. +.|...+. |+.. .+.|.|.|.+||+|++. .|.+
T Consensus 761 C~~gy~F~dd~~-tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~-----~c~d-------- 826 (1289)
T KOG1214|consen 761 CRSGYEFADDRH-TCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH-----QCTD-------- 826 (1289)
T ss_pred EeecceeccCCc-ceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc-----cccc--------
Confidence 999885 3333 6876542 244542 44554444 4444 45799999999999865 4444
Q ss_pred cccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCC----CCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 161 SCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAH----SDCPTDRPSCLGNKCMNPCAGQCG 236 (365)
Q Consensus 161 ~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~----~~C~~~~~~C~~~~C~~~c~~~C~ 236 (365)
+|+|.++.|...+.|.+++++|.|.|.+||.|+++. |+.. ..|+... -.| ..|+
T Consensus 827 -------vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~-----CVP~~~~~T~C~~er----~hp------l~ch 884 (1289)
T KOG1214|consen 827 -------VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQ-----CVPDTSSLTPCEQER----FHP------LQCH 884 (1289)
T ss_pred -------ccccCccccCCCceEecCCCcceeecccCccCCCce-----ecCCCccCCcccccc----ccc------eeec
Confidence 688988889999999999999999999999999864 2222 1222210 001 2355
Q ss_pred CCceeee--cCCCceeeCCCCCccCCCCccccCCC
Q psy15668 237 INAKCEV--RGATPICSCPRDMTGDPFVRCRPFDK 269 (365)
Q Consensus 237 ~~~~C~~--~~~~~~C~C~~G~~g~~~~~C~~~~~ 269 (365)
.+..|.- .+..+.+.|.++-.|++..+|.++++
T Consensus 885 g~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~ 919 (1289)
T KOG1214|consen 885 GSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPE 919 (1289)
T ss_pred cccceeEeeCCCcccCCCCCCCCCCCCCCCCCccc
Confidence 4443321 25678899888888887667876543
No 4
>KOG4289|consensus
Probab=99.50 E-value=3.8e-13 Score=136.08 Aligned_cols=90 Identities=39% Similarity=0.868 Sum_probs=75.8
Q ss_pred cCCCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcc
Q psy15668 83 RNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSC 162 (365)
Q Consensus 83 ~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C 162 (365)
..++++|.|++||+|+ .|+..+ |+|-+.||.+++.|....++|+|.|++||+|. .|+... .
T Consensus 1218 pvnglrCrCPpGFTgd---~CeTei--DlCYs~pC~nng~C~srEggYtCeCrpg~tGe-------hCEvs~---~---- 1278 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGD---YCETEI--DLCYSGPCGNNGRCRSREGGYTCECRPGFTGE-------HCEVSA---R---- 1278 (2531)
T ss_pred ccCceeEeCCCCCCcc---cccchh--HhhhcCCCCCCCceEEecCceeEEecCCcccc-------ceeeec---c----
Confidence 4577899999999999 898764 89999999999999999999999999999999 887541 1
Q ss_pred cCCcccCCCCCCCCCCCCeeeeC-CCceeeeCCCC
Q psy15668 163 VNYKCANPCASGACAPTAQCEVR-NHRAVCSCPVG 196 (365)
Q Consensus 163 ~~~~c~~~C~~~~C~~~~~C~~~-~g~~~C~C~~G 196 (365)
.-.|.++.|.++++|++. .++|.|.|+.|
T Consensus 1279 -----agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1279 -----AGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred -----cCccccceecCCCEEeecCCCceeccCCCc
Confidence 224556788889999864 46788999888
No 5
>KOG1217|consensus
Probab=99.49 E-value=2.9e-12 Score=125.44 Aligned_cols=274 Identities=26% Similarity=0.558 Sum_probs=168.3
Q ss_pred CCCCCCCCCCeeeecCCCceeeCCCCCccCCCCCccCC-CccCCCCCCCCCcccCCccccccccCCCCCCeeee---cCC
Q psy15668 10 CSPNPCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRG-ECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNV---RNH 85 (365)
Q Consensus 10 C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~-~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~---~~g 85 (365)
+...+....+.+......|.|.|++||.|. .++.. .|..... .+...+.|.. ...
T Consensus 92 ~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~---~~~~~~~C~~~~~------------------~~~~~~~c~~~~~~~~ 150 (487)
T KOG1217|consen 92 CRSPCLLLCGECVDCVGSYECTCPPGYQGT---PCEGECECVTGPG------------------VCCIDGSCSNGPGSVG 150 (487)
T ss_pred ccCCcccCCccccCCCCCceeeCCCccccC---cCCcceeecCCCC------------------CeeCchhhcCCCCCCC
Confidence 333344455667778899999999999998 33322 2322211 0111233443 345
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCCCC--CCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCccc
Q psy15668 86 IPVCSCPPGYTGDPLTQCRRFDPQELCD--RSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCV 163 (365)
Q Consensus 86 ~~~C~C~~G~~g~~~~~C~~~~~~~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~ 163 (365)
.|.|.|..||.+. .+.... ++|. ..+|.+.+.|.+..++|.|.|+++|.+. .++.. .....|.
T Consensus 151 ~~~c~C~~g~~~~---~~~~~~--~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~-------~~~~~---~~~~~c~ 215 (487)
T KOG1217|consen 151 PFRCSCTEGYEGE---PCETDL--DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS-------TCETT---GNGGTCV 215 (487)
T ss_pred ceeeeeCCCcccc---cccccc--cccccCCCCcCCCcccccCCCCeeEeCCCCccCC-------cCcCC---CCCceEe
Confidence 7899999999998 665432 4676 3569989999999999999999999988 44332 1111111
Q ss_pred CC-cc-------cCCCCC--CCCCCC-CeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 164 NY-KC-------ANPCAS--GACAPT-AQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCA 232 (365)
Q Consensus 164 ~~-~c-------~~~C~~--~~C~~~-~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~ 232 (365)
.. .+ .+.|.. ..+... ++|++..++|.|.|++||.+... ..+.+++.|. ...
T Consensus 216 ~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~----~~~~~~~~C~-------~~~------ 278 (487)
T KOG1217|consen 216 DSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDAC----VTCVDVDSCA-------LIA------ 278 (487)
T ss_pred cceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccccc----ceeeeccccC-------CCC------
Confidence 00 00 111111 112222 78999999999999999998841 1122233333 221
Q ss_pred CCCCCCceeeecCCCceeeCCCCCccCCCCccccCCCC-----Ccccccccccceeee-cCCccceeeeeecCCCCCCCC
Q psy15668 233 GQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDKY-----VAPLINDYLKIYWRY-QNNKTIFYVSLVSLNYPYVTP 306 (365)
Q Consensus 233 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~~-----~~~~~~~~~~~~~~c-~~~~~~~~~~~c~~~~~~~~~ 306 (365)
. |.++++|++..+.|.|.|++||+|.....+...... ...|.++. .| ..+....+.+.+..+|.+..|
T Consensus 279 ~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~-----~C~~~~~~~~~~C~c~~~~~g~~C 352 (487)
T KOG1217|consen 279 S-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGG-----TCNTLGSFGGFRCACGPGFTGRRC 352 (487)
T ss_pred c-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCc-----ccccCCCCCCCCcCCCCCCCCCcc
Confidence 1 556899999998899999999999832011111110 01111111 11 111112233456667778778
Q ss_pred CCC-CCCCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668 307 LPD-DLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 346 (365)
Q Consensus 307 ~~~-d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~ 346 (365)
... ++|...++..++.|.+ ...++|.|.|+.+|.+.
T Consensus 353 ~~~~~~C~~~~~~~~~~c~~----~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 353 EDSNDECASSPCCPGGTCVN----ETPGSYRCACPAGFAGK 389 (487)
T ss_pred ccCCccccCCccccCCEecc----CCCCCeEecCCCccccC
Confidence 777 4998888889999997 24688999999999984
No 6
>KOG1219|consensus
Probab=99.41 E-value=4.4e-13 Score=140.15 Aligned_cols=109 Identities=32% Similarity=0.735 Sum_probs=96.1
Q ss_pred CCCCCCCCCCCCCeeeecC-CCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccC-CCCCCeeeecC
Q psy15668 7 GDPCSPNPCGSNTQCNVAS-NRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQ-CGVNSECNVRN 84 (365)
Q Consensus 7 id~C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~-C~~~~~C~~~~ 84 (365)
.+.|..+||+++|+|+.++ ++|.|.|++.|+|. +|+. + +.+|... |..+++|+...
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CEi----~---------------~epC~snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCEI----D---------------LEPCASNPCLTGGTCIPFY 3921 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCc---cccc----c---------------cccccCCCCCCCCEEEecC
Confidence 3889999999999999776 77999999999999 7764 2 3344444 88899999999
Q ss_pred CCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668 85 HIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGS 141 (365)
Q Consensus 85 g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 141 (365)
+.|.|.|+.||+|. +|+.. .+++|+.++|..++.|++..|+|+|.|.+||.|.
T Consensus 3922 n~f~CnC~~gyTG~---~Ce~~-Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYTGK---RCEAR-GISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred CCeeEeCCCCccCc---eeecc-cccccccccccCCceeeccCCceEeccChhHhcc
Confidence 99999999999999 99875 1389999999999999999999999999999988
No 7
>KOG1219|consensus
Probab=99.39 E-value=6.8e-13 Score=138.80 Aligned_cols=108 Identities=29% Similarity=0.727 Sum_probs=97.0
Q ss_pred CCCCCCCCCCCCeeeeC-CCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCCc
Q psy15668 110 ELCDRSPCGVNTRCEVI-NMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNHR 188 (365)
Q Consensus 110 ~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g~ 188 (365)
+.|..+||++++.|... .++|.|.|++.|+|. .|+.. +++|.++||..+++|+...++
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~-------~CEi~--------------~epC~snPC~~GgtCip~~n~ 3923 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN-------HCEID--------------LEPCASNPCLTGGTCIPFYNG 3923 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc-------ccccc--------------cccccCCCCCCCCEEEecCCC
Confidence 67888999999999987 468999999999999 99987 789999999999999999999
Q ss_pred eeeeCCCCCccCCCCccCccccCCCCCCCC-CCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccC
Q psy15668 189 AVCSCPVGYLGDPYTSCRAECLAHSDCPTD-RPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGD 259 (365)
Q Consensus 189 ~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~-~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 259 (365)
|.|.|+.||+|. .|+.. +++|...+ |..+|.|+|..|+|+|.|.+||.|+
T Consensus 3924 f~CnC~~gyTG~-------------~Ce~~Gi~eCs~n~--------C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3924 FLCNCPNGYTGK-------------RCEARGISECSKNV--------CGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred eeEeCCCCccCc-------------eeeccccccccccc--------ccCCceeeccCCceEeccChhHhcc
Confidence 999999999999 56655 66776655 5569999999999999999999998
No 8
>KOG1217|consensus
Probab=99.33 E-value=4.4e-11 Score=117.03 Aligned_cols=202 Identities=29% Similarity=0.695 Sum_probs=135.5
Q ss_pred CCCC--CCCCCCCCeeeecCCCceeeCCCCCccCCCCCccCC----CccCCCCCCCCCcccCCccccccccC---CCCC-
Q psy15668 8 DPCS--PNPCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRG----ECQDHSDCSHSKACKEYRCVDVCAGQ---CGVN- 77 (365)
Q Consensus 8 d~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~----~C~~~~~C~~~~~C~~~~C~~~C~~~---C~~~- 77 (365)
++|. ..+|.++++|.+..++|.|.|++||.+. .++.. .|.+...|..... .. ...|... |...
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~---~~~~~~~~~~c~~~~~~~~~~g---~~-~~~c~~~~~~~~~~~ 242 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS---TCETTGNGGTCVDSVACSCPPG---AR-GPECEVSIVECASGD 242 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCCccCC---cCcCCCCCceEecceeccCCCC---CC-CCCcccccccccCCC
Confidence 6786 4569999999999999999999999998 34322 2322211110000 00 1111111 3323
Q ss_pred CeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCCCC-CCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 78 SECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSP-CGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDC 156 (365)
Q Consensus 78 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C 156 (365)
++|++..++|.|.|++||++.....+.++ ++|.... |.+++.|++..+.|.|.|++||.+. .+ ..+
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~~---~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~-------~~---~~~ 309 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDACVTCVDV---DSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR-------LC---TEC 309 (487)
T ss_pred CcccccCCceeeeCCCCccccccceeeec---cccCCCCccCCCCeeecCCCcceeeCCCCCCCC-------CC---ccc
Confidence 88999999999999999999831134555 7787753 8889999999998999999999998 43 011
Q ss_pred CCCCcccCCcccCCC----CCCCCCCCCee--eeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCC
Q psy15668 157 GPSQSCVNYKCANPC----ASGACAPTAQC--EVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNP 230 (365)
Q Consensus 157 ~~~~~C~~~~c~~~C----~~~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~ 230 (365)
.. ..+| ...+|..++.| ....+.+.|.|..||.|. .|+...++|...+
T Consensus 310 ~~---------~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~-------------~C~~~~~~C~~~~---- 363 (487)
T KOG1217|consen 310 VD---------VDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR-------------RCEDSNDECASSP---- 363 (487)
T ss_pred cc---------cccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC-------------ccccCCccccCCc----
Confidence 11 1233 23457777777 344557889999998888 4443223444333
Q ss_pred CCCCCCCCceeee-cCCCceeeCCCCCccC
Q psy15668 231 CAGQCGINAKCEV-RGATPICSCPRDMTGD 259 (365)
Q Consensus 231 c~~~C~~~~~C~~-~~~~~~C~C~~G~~g~ 259 (365)
+..++.|++ ..++|.|.|+.+|.+.
T Consensus 364 ----~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 364 ----CCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred ----cccCCEeccCCCCCeEecCCCccccC
Confidence 445889998 6899999999999874
No 9
>KOG1225|consensus
Probab=98.93 E-value=2.8e-08 Score=95.48 Aligned_cols=131 Identities=34% Similarity=0.887 Sum_probs=96.4
Q ss_pred eeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcc
Q psy15668 88 VCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKC 167 (365)
Q Consensus 88 ~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c 167 (365)
.|.|..+|+|. .|... .|.. .|..++.|++. +|.|++||+|. .|...
T Consensus 235 ic~c~~~~~g~---~c~~~----~C~~-~c~~~g~c~~G----~CIC~~Gf~G~-------dC~e~-------------- 281 (525)
T KOG1225|consen 235 ICECPEGYFGP---LCSTI----YCPG-GCTGRGQCVEG----RCICPPGFTGD-------DCDEL-------------- 281 (525)
T ss_pred eeecCCceeCC---ccccc----cCCC-CCcccceEeCC----eEeCCCCCcCC-------CCCcc--------------
Confidence 79999999998 77743 3543 45556778766 89999999999 66642
Q ss_pred cCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCC
Q psy15668 168 ANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGAT 247 (365)
Q Consensus 168 ~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~ 247 (365)
.|... |..++.+++. .|.|++||.|. .|.... | ...|+.++.|+ .
T Consensus 282 --~Cp~~-cs~~g~~~~g----~CiC~~g~~G~-------------dCs~~~--c---------padC~g~G~Ci----~ 326 (525)
T KOG1225|consen 282 --VCPVD-CSGGGVCVDG----ECICNPGYSGK-------------DCSIRR--C---------PADCSGHGKCI----D 326 (525)
T ss_pred --cCCcc-cCCCceecCC----EeecCCCcccc-------------cccccc--C---------CccCCCCCccc----C
Confidence 34333 6666666543 69999999999 555422 2 24577799998 2
Q ss_pred ceeeCCCCCccCCCCccccCCCCCcccccccccceeeecCCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCCeecCCC
Q psy15668 248 PICSCPRDMTGDPFVRCRPFDKYVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENAKCQPGY 327 (365)
Q Consensus 248 ~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~~C~~~~ 327 (365)
-+|.|.+||+|. .|+. . .|.+++.|++
T Consensus 327 G~C~C~~Gy~G~---~C~~-------------------------------------------~-----~C~~~g~cv~-- 353 (525)
T KOG1225|consen 327 GECLCDEGYTGE---LCIQ-------------------------------------------R-----ACSGGGQCVN-- 353 (525)
T ss_pred CceEeCCCCcCC---cccc-------------------------------------------c-----ccCCCceecc--
Confidence 379999999998 6763 1 3788889987
Q ss_pred CCCCCCCceeeCCCCCccCC
Q psy15668 328 DKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 328 ~~~~~~~~~C~C~~G~~g~~ 347 (365)
. |.|..||.|..
T Consensus 354 -------g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 354 -------G-CKCKKGWRGPD 365 (525)
T ss_pred -------C-ceeccCccCCC
Confidence 3 99999999887
No 10
>KOG0994|consensus
Probab=98.91 E-value=1.9e-08 Score=101.25 Aligned_cols=233 Identities=23% Similarity=0.441 Sum_probs=124.7
Q ss_pred eeecCCCcee-eCCCCCccCCCCCcccCCCCCCCCCCCCCCCC--------eeee--CCCCceeeCCCCCCCCCCCCCCC
Q psy15668 80 CNVRNHIPVC-SCPPGYTGDPLTQCRRFDPQELCDRSPCGVNT--------RCEV--INMVPTCSCLPGYTGSPLSGCRH 148 (365)
Q Consensus 80 C~~~~g~~~C-~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~--------~C~~--~~~~~~C~C~~G~~g~~~~~~~~ 148 (365)
|.+...++.| .|..||.|+++..-. ..|.+-||..+. .|.. ......|.|.+||+|.
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~g-----~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~------- 945 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGSG-----IGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS------- 945 (1758)
T ss_pred ccccccccchhhhhccccCCcccCCC-----CCCCCCCCCCCCccchhccccccccccccceeeecccCcccc-------
Confidence 5566777889 899999998752211 346655554421 2322 2334579999999998
Q ss_pred CCCCC-----CCCCCCCcccCCcc---cCCCCCCCCCC-CCee---eeCCCceee-eCCCCCccCCCCccCccccCCCCC
Q psy15668 149 ECDSD-----YDCGPSQSCVNYKC---ANPCASGACAP-TAQC---EVRNHRAVC-SCPVGYLGDPYTSCRAECLAHSDC 215 (365)
Q Consensus 149 ~C~~~-----~~C~~~~~C~~~~c---~~~C~~~~C~~-~~~C---~~~~g~~~C-~C~~G~~g~~~~~~~~~C~~~~~C 215 (365)
.|+.= ..-..+++|..-.| ||.-.+..|.. .|.| .....+-+| .|.+||.|+.....-..| .|
T Consensus 946 RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC----~C 1021 (1758)
T KOG0994|consen 946 RCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRC----VC 1021 (1758)
T ss_pred chhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhh----ec
Confidence 55421 00112556654322 44444444542 3334 333334456 799999998643211111 11
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCccccCCC------CCccc--------ccccccc
Q psy15668 216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDK------YVAPL--------INDYLKI 281 (365)
Q Consensus 216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~------~~~~~--------~~~~~~~ 281 (365)
..-. . .+..+|.. -+-+|.|.|...|.....|..... .+.+| ..+...+
T Consensus 1022 n~LG--------------T-n~~~~CDr--~tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG 1084 (1758)
T KOG0994|consen 1022 NFLG--------------T-NSTCHCDR--FTGQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG 1084 (1758)
T ss_pred cccc--------------c-CCcccccc--ccCcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc
Confidence 1000 0 00112221 222455555555542222222111 11111 2333455
Q ss_pred eeeecCCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCC----eecCCCCCCCCCCceeeCCCCCccCCCCCCcCCC
Q psy15668 282 YWRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENA----KCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGE 355 (365)
Q Consensus 282 ~~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~----~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~~~ 355 (365)
.+.|.+++++..|+.|..-|||..-+ .|..-.|...| .|+. ...+|+|.+|-.|..+++|...-
T Consensus 1085 QCqCkpGfGGR~C~qCqel~WGdP~~---~C~aCdCd~rG~~tpQCdr-------~tG~C~C~~Gv~G~rCdqCaRgy 1152 (1758)
T KOG0994|consen 1085 QCQCKPGFGGRTCSQCQELYWGDPNE---KCRACDCDPRGIETPQCDR-------ATGRCVCRPGVGGPRCDQCARGY 1152 (1758)
T ss_pred ceeccCCCCCcchhHHHHhhcCCCCC---CceecCCCCCCCCCCCccc-------cCCceeecCCCCCcchhhhhhhh
Confidence 78899999999999999999985422 34332344332 3554 22489999999999988888653
No 11
>KOG1225|consensus
Probab=98.88 E-value=1.7e-08 Score=96.97 Aligned_cols=132 Identities=32% Similarity=0.874 Sum_probs=92.4
Q ss_pred ceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCceeeCCCCCccCCCCCcccCC
Q psy15668 28 PVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFD 107 (365)
Q Consensus 28 ~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~ 107 (365)
+.|.|+.||+|. .|....| +..|..++.|++. +|.|++||+|. .|...
T Consensus 234 ~ic~c~~~~~g~---~c~~~~C---------------------~~~c~~~g~c~~G----~CIC~~Gf~G~---dC~e~- 281 (525)
T KOG1225|consen 234 GICECPEGYFGP---LCSTIYC---------------------PGGCTGRGQCVEG----RCICPPGFTGD---DCDEL- 281 (525)
T ss_pred ceeecCCceeCC---ccccccC---------------------CCCCcccceEeCC----eEeCCCCCcCC---CCCcc-
Confidence 368888888887 4433222 2224445566543 79999999999 88764
Q ss_pred CCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCC
Q psy15668 108 PQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNH 187 (365)
Q Consensus 108 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g 187 (365)
.|... |..++.+++. .|.|.+||+|. .|+.. .|. ..|..++.|+. +
T Consensus 282 ---~Cp~~-cs~~g~~~~g----~CiC~~g~~G~-------dCs~~----------------~cp-adC~g~G~Ci~--G 327 (525)
T KOG1225|consen 282 ---VCPVD-CSGGGVCVDG----ECICNPGYSGK-------DCSIR----------------RCP-ADCSGHGKCID--G 327 (525)
T ss_pred ---cCCcc-cCCCceecCC----EeecCCCcccc-------ccccc----------------cCC-ccCCCCCcccC--C
Confidence 26544 7777777665 89999999999 77643 233 56888899982 2
Q ss_pred ceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCC
Q psy15668 188 RAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDP 260 (365)
Q Consensus 188 ~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 260 (365)
+|.|.+||+|. .|... .|..++.|++ + |.|..||+|..
T Consensus 328 --~C~C~~Gy~G~-------------~C~~~---------------~C~~~g~cv~--g---C~C~~Gw~G~d 365 (525)
T KOG1225|consen 328 --ECLCDEGYTGE-------------LCIQR---------------ACSGGGQCVN--G---CKCKKGWRGPD 365 (525)
T ss_pred --ceEeCCCCcCC-------------ccccc---------------ccCCCceecc--C---ceeccCccCCC
Confidence 49999999999 55542 1445777765 2 89999999874
No 12
>KOG4260|consensus
Probab=98.79 E-value=6e-09 Score=89.44 Aligned_cols=150 Identities=24% Similarity=0.532 Sum_probs=95.6
Q ss_pred CCCCCCCCeee---ecCCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCce
Q psy15668 12 PNPCGSNTQCN---VASNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPV 88 (365)
Q Consensus 12 ~~~C~~~~~C~---~~~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~ 88 (365)
..||..+|.|. ...|+..|.|.+||+|..+..|....=+... =..+..|. .|...|.. .|. ..++-.
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~R-ne~~lvCt--~Ch~~C~~------~Cs-g~~~k~ 218 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSR-NEQHLVCT--ACHEGCLG------VCS-GESSKG 218 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhc-ccccchhh--hhhhhhhc------ccC-CCCCCC
Confidence 46799999999 4558899999999999954333210000000 00011111 11112221 232 223335
Q ss_pred e-eCCCCCccCCCCCcccCCCCCCCC--CCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCC
Q psy15668 89 C-SCPPGYTGDPLTQCRRFDPQELCD--RSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNY 165 (365)
Q Consensus 89 C-~C~~G~~g~~~~~C~~~~~~~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~ 165 (365)
| +|+.||..+.. .|.++ ++|. +.||..+..|+|+.|+|.|..++||.+. .+.|+.-
T Consensus 219 C~kCkkGW~lde~-gCvDv---nEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d~C~~~------------ 277 (350)
T KOG4260|consen 219 CSKCKKGWKLDEE-GCVDV---NECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VDECQFC------------ 277 (350)
T ss_pred hhhhcccceeccc-ccccH---HHHhcCCCCCChhheeecCCCceEecccccccCC-----hHHhhhh------------
Confidence 6 79999998755 79888 8896 4789999999999999999999999763 1133321
Q ss_pred cccCCCCCCCCCCCCeeeeCCCceeeeCCCCCc
Q psy15668 166 KCANPCASGACAPTAQCEVRNHRAVCSCPVGYL 198 (365)
Q Consensus 166 ~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~ 198 (365)
.+.|. ..+..|.++.++|+|.|..|+.
T Consensus 278 --~d~~~----~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 278 --ADVCA----SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred --hhhcc----cCCCCcccCCccEEEEecccce
Confidence 12222 2456788999999999988874
No 13
>KOG0994|consensus
Probab=98.74 E-value=2.2e-07 Score=93.77 Aligned_cols=231 Identities=25% Similarity=0.567 Sum_probs=121.8
Q ss_pred ee-eCCCCCccCCCCCcccCC---CCCCCCCCCCCCCC---eeeeCCCCcee-eCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 88 VC-SCPPGYTGDPLTQCRRFD---PQELCDRSPCGVNT---RCEVINMVPTC-SCLPGYTGSPLSGCRHECDSDYDCGPS 159 (365)
Q Consensus 88 ~C-~C~~G~~g~~~~~C~~~~---~~~~C~~~~C~~~~---~C~~~~~~~~C-~C~~G~~g~~~~~~~~~C~~~~~C~~~ 159 (365)
.| .|.+||+|.+ .|.... -.+.|.+. .+ .|.+...+++| .|..||.|++.......|..
T Consensus 842 qCnqCqpG~WgFP--eCr~CqCNgHA~~Cd~~----tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrP------- 908 (1758)
T KOG0994|consen 842 QCNQCQPGYWGFP--ECRPCQCNGHADTCDPI----TGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRP------- 908 (1758)
T ss_pred hccccCCCccCCC--cCccccccCcccccCcc----ccccccccccccccchhhhhccccCCcccCCCCCCCC-------
Confidence 45 6888888875 343210 00222221 12 34556677889 89999999865332222221
Q ss_pred CcccCCcccCCCCCCCC---CCCCeee--eCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCC---
Q psy15668 160 QSCVNYKCANPCASGAC---APTAQCE--VRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPC--- 231 (365)
Q Consensus 160 ~~C~~~~c~~~C~~~~C---~~~~~C~--~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c--- 231 (365)
=+|...|- .....|. +......|.|.+||.|.+++.+.+.-...+. ....|+.-.|.+-=
T Consensus 909 ---------CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~---~GGtCq~CeC~~NiD~~ 976 (1758)
T KOG0994|consen 909 ---------CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPS---EGGTCQKCECSNNIDLY 976 (1758)
T ss_pred ---------CCCCCCCccchhccccccccccccceeeecccCccccchhhhcccccCCcc---cCCccccccccCCcCcc
Confidence 01111110 0111242 2334568999999999987654432111100 01112111111100
Q ss_pred -CCCCCC-Cce---eeecCCCcee-eCCCCCccCC-CCccccCCC----CCcccccccccceeeecCCccceeeeeecCC
Q psy15668 232 -AGQCGI-NAK---CEVRGATPIC-SCPRDMTGDP-FVRCRPFDK----YVAPLINDYLKIYWRYQNNKTIFYVSLVSLN 300 (365)
Q Consensus 232 -~~~C~~-~~~---C~~~~~~~~C-~C~~G~~g~~-~~~C~~~~~----~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~ 300 (365)
.+.|.. .|. |+....+-+| .|++||.|+. ...|..-.. ....+..+...+.+.|.++.-+..|.+|..+
T Consensus 977 d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDqCA~N 1056 (1758)
T KOG0994|consen 977 DPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQCAEN 1056 (1758)
T ss_pred CCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhheccccccCCccccccccCcCCCCcccccccccccccc
Confidence 011221 122 3333334466 7999999986 233443111 2334666777778999999999999999999
Q ss_pred CCCCCCCCCCCCCCCCCCC--CCeecCCCCCCCCCCceeeCCCCCccCCCCCCc
Q psy15668 301 YPYVTPLPDDLCEPNPCGE--NAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCR 352 (365)
Q Consensus 301 ~~~~~~~~~d~C~~~~C~~--~~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~ 352 (365)
+|...- ...|++-.|.. +-+|.. -..+|+|+|||-|..+++|.
T Consensus 1057 ~w~laS--G~GCe~C~Cd~~~~pqCN~-------ftGQCqCkpGfGGR~C~qCq 1101 (1758)
T KOG0994|consen 1057 HWNLAS--GEGCEPCNCDPIGGPQCNE-------FTGQCQCKPGFGGRTCSQCQ 1101 (1758)
T ss_pred hhcccc--CCCCCccCCCccCCccccc-------cccceeccCCCCCcchhHHH
Confidence 985320 12333333333 224543 22489999999998877665
No 14
>KOG4260|consensus
Probab=98.46 E-value=3.1e-07 Score=79.12 Aligned_cols=135 Identities=24% Similarity=0.501 Sum_probs=84.8
Q ss_pred CCCCCeeee---cCCCceeeCCCCCccCCCCCcccCCC------CCC----CCC--CCCCCCCeeeeCCCCcee-eCCCC
Q psy15668 74 CGVNSECNV---RNHIPVCSCPPGYTGDPLTQCRRFDP------QEL----CDR--SPCGVNTRCEVINMVPTC-SCLPG 137 (365)
Q Consensus 74 C~~~~~C~~---~~g~~~C~C~~G~~g~~~~~C~~~~~------~~~----C~~--~~C~~~~~C~~~~~~~~C-~C~~G 137 (365)
|..++.|.- ..|+..|.|.+||+|. .|..... .++ |.. .+|. +.|.. .+.-.| .|..|
T Consensus 152 C~GnG~C~GdGsR~GsGkCkC~~GY~Gp---~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg-~~~k~C~kCkkG 225 (350)
T KOG4260|consen 152 CFGNGSCHGDGSREGSGKCKCETGYTGP---LCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSG-ESSKGCSKCKKG 225 (350)
T ss_pred cCCCCcccCCCCCCCCCcccccCCCCCc---cccccchHHHHhhcccccchhhhhhhhhh--cccCC-CCCCChhhhccc
Confidence 444555542 4577899999999998 6653210 000 100 1121 23332 223345 78888
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCC--CCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCC
Q psy15668 138 YTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCA--SGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDC 215 (365)
Q Consensus 138 ~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C 215 (365)
|..... .|.+ ||+|. +.||..+..|+|+.|+|.|..++||.+. . ++|
T Consensus 226 W~lde~-----gCvD---------------vnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-~----------d~C 274 (350)
T KOG4260|consen 226 WKLDEE-----GCVD---------------VNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-V----------DEC 274 (350)
T ss_pred ceeccc-----cccc---------------HHHHhcCCCCCChhheeecCCCceEecccccccCC-h----------HHh
Confidence 876521 3333 66775 4568888899999999999999999763 1 155
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCc
Q psy15668 216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMT 257 (365)
Q Consensus 216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~ 257 (365)
+.-.+.|. ..+..|.|+.+.|+|+|..|+.
T Consensus 275 ~~~~d~~~------------~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 275 QFCADVCA------------SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred hhhhhhcc------------cCCCCcccCCccEEEEecccce
Confidence 54222232 2367899999999999998874
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.38 E-value=1.7e-07 Score=58.86 Aligned_cols=34 Identities=32% Similarity=0.579 Sum_probs=30.3
Q ss_pred CCCCCCC--CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668 6 GGDPCSP--NPCGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 6 did~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
|||||+. ++|..+++|+|+.|+|+|.|++||...
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 7999974 569889999999999999999999843
No 16
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.27 E-value=3.9e-07 Score=53.43 Aligned_cols=30 Identities=37% Similarity=0.825 Sum_probs=27.9
Q ss_pred CCCCCCCCCCeeeecC-CCceeeCCCCCccC
Q psy15668 10 CSPNPCGSNTQCNVAS-NRPVCSCLPGHWGN 39 (365)
Q Consensus 10 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~ 39 (365)
|.++||.++|+|++.. ++|+|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 6678999999999999 99999999999986
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.19 E-value=2.4e-06 Score=52.63 Aligned_cols=34 Identities=32% Similarity=0.716 Sum_probs=31.2
Q ss_pred CCCCCCC-CCCCCCCeeeecCCCceeeCCCCCc-cC
Q psy15668 6 GGDPCSP-NPCGSNTQCNVASNRPVCSCLPGHW-GN 39 (365)
Q Consensus 6 did~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~-g~ 39 (365)
|+|+|.. ++|.++++|+++.++|.|.|++||. |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 5899987 8999999999999999999999999 66
No 18
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=98.12 E-value=3.2e-07 Score=75.02 Aligned_cols=141 Identities=25% Similarity=0.613 Sum_probs=86.7
Q ss_pred CCeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCC-----CCCCCCCeeeeCC-----CCceeeCCCCCCCCCCCCC
Q psy15668 77 NSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDR-----SPCGVNTRCEVIN-----MVPTCSCLPGYTGSPLSGC 146 (365)
Q Consensus 77 ~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~-----~~C~~~~~C~~~~-----~~~~C~C~~G~~g~~~~~~ 146 (365)
+|..+...+.|.|.|.+||......+|+.. .+|.. .+|+..+.|++.. ..|.|.|.+||....
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~k---v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~---- 82 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTCEEK---VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ---- 82 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS----
T ss_pred CCEEEEccCceEEEcCCCcEEccccccccc---eecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC----
Confidence 466677788899999999987555478877 45643 6799999998775 479999999998662
Q ss_pred CCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeC---CCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCC
Q psy15668 147 RHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVR---NHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCL 223 (365)
Q Consensus 147 ~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~---~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~ 223 (365)
..|+ .+.|....|+ .|.|+.. .....|+|.-|+..+. ...|....
T Consensus 83 -------------~vCv----p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d----------n~kCtk~G---- 130 (197)
T PF06247_consen 83 -------------GVCV----PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD----------NKKCTKTG---- 130 (197)
T ss_dssp -------------SSEE----EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT----------TTESEEEE----
T ss_pred -------------CeEc----hhhcCceecC-CCeEEecCCCCCCceeEeeeceEecc----------CCcccCCC----
Confidence 1222 2345545676 5789732 3345999999987321 11232211
Q ss_pred CCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCC
Q psy15668 224 GNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDP 260 (365)
Q Consensus 224 ~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 260 (365)
..+|...|..+..|..+.+-|+|.+.+||.+++
T Consensus 131 ----~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 131 ----ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp ------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred ----ccceeeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 123345677789999999999999999998764
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.92 E-value=3.5e-06 Score=52.87 Aligned_cols=32 Identities=34% Similarity=0.770 Sum_probs=28.4
Q ss_pred CCCCCCC--CCCCCCCeecCCCCCCCCCCceeeCCCCCc
Q psy15668 308 PDDLCEP--NPCGENAKCQPGYDKSGKDRPVCTCLPGYV 344 (365)
Q Consensus 308 ~~d~C~~--~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~ 344 (365)
|||||.. +.|..++.|+| +.|+|+|.|++||+
T Consensus 1 DidEC~~~~~~C~~~~~C~N-----~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVN-----TEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEE-----ETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEc-----CCCCEEeeCCCCcE
Confidence 4899974 57988999999 77999999999998
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.89 E-value=3.6e-06 Score=49.33 Aligned_cols=32 Identities=34% Similarity=0.886 Sum_probs=26.5
Q ss_pred CCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668 312 CEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 312 C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~ 347 (365)
|.+++|.++|+|++ ...++|+|+|++||+|..
T Consensus 1 C~~~~C~n~g~C~~----~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCID----LPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEE----ESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEe----CCCCCEEeECCCCCccCC
Confidence 45679999999998 223889999999999963
No 21
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.85 E-value=2.5e-05 Score=47.56 Aligned_cols=34 Identities=35% Similarity=0.735 Sum_probs=31.0
Q ss_pred CCCCCCC-CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668 6 GGDPCSP-NPCGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 6 did~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
++++|.. .+|.++++|++..++|.|.|++||.|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 4788987 799988999999999999999999986
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.80 E-value=1.3e-05 Score=48.15 Aligned_cols=29 Identities=28% Similarity=0.700 Sum_probs=23.6
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy15668 13 NPCGSNTQCNVASNRPVCSCLPGHWGNPL 41 (365)
Q Consensus 13 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~ 41 (365)
..|+.+++|+++.++|+|.|++||.|+++
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 45999999999999999999999999963
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.67 E-value=6.1e-05 Score=46.19 Aligned_cols=34 Identities=35% Similarity=0.818 Sum_probs=29.2
Q ss_pred CCCCCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCc-cCC
Q psy15668 309 DDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYV-GDA 347 (365)
Q Consensus 309 ~d~C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~-g~~ 347 (365)
+|+|.. ++|.++++|++ ..++|+|.|++||+ |..
T Consensus 2 ~~~C~~~~~C~~~~~C~~-----~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 2 IDECASGNPCQNGGTCVN-----TVGSYRCECPPGYTDGRN 37 (39)
T ss_pred cccCcCCCCcCCCCEeEC-----CCCCeEeECCCCCccCCc
Confidence 688987 78999999998 66889999999998 543
No 24
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.61 E-value=3.2e-05 Score=46.42 Aligned_cols=29 Identities=38% Similarity=0.836 Sum_probs=23.7
Q ss_pred CCCCCCceeeecCCCceeeCCCCCccCCC
Q psy15668 233 GQCGINAKCEVRGATPICSCPRDMTGDPF 261 (365)
Q Consensus 233 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~ 261 (365)
+.|+.+++|+++.++|.|+|++||+|++.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 45888999999999999999999999975
No 25
>KOG1836|consensus
Probab=97.60 E-value=0.004 Score=68.49 Aligned_cols=242 Identities=23% Similarity=0.475 Sum_probs=120.8
Q ss_pred eeecCCCcee-eCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCC--CCceee-CCCCCCCCCCCCCCCCCCCCC-
Q psy15668 80 CNVRNHIPVC-SCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVIN--MVPTCS-CLPGYTGSPLSGCRHECDSDY- 154 (365)
Q Consensus 80 C~~~~g~~~C-~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~--~~~~C~-C~~G~~g~~~~~~~~~C~~~~- 154 (365)
|+....+-.| +|..||.|.... ... ..|.+-+|...+.|..+. ....|. |++||+|. .|+.-+
T Consensus 749 C~~~t~G~~C~~C~~GfYg~~~~--~~~---~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~-------rCe~c~d 816 (1705)
T KOG1836|consen 749 CKHNTFGGQCAQCVDGFYGLPDL--GTS---GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL-------RCEECAD 816 (1705)
T ss_pred cccCCCCCchhhhcCCCCCcccc--CCC---CCCccCCCCCChhhcCcCcccceecCCCCCCCccc-------ccccCCC
Confidence 4433444466 799999987431 111 227777888877777654 456787 99999998 444211
Q ss_pred -----CC---CCCCcccCCcc---cCCCCCCCCCC-CCee---eeCCCceee-eCCCCCccCCCCccCccccCCCCCCCC
Q psy15668 155 -----DC---GPSQSCVNYKC---ANPCASGACAP-TAQC---EVRNHRAVC-SCPVGYLGDPYTSCRAECLAHSDCPTD 218 (365)
Q Consensus 155 -----~C---~~~~~C~~~~c---~~~C~~~~C~~-~~~C---~~~~g~~~C-~C~~G~~g~~~~~~~~~C~~~~~C~~~ 218 (365)
+= .+...|..-.| +|+=....|.. .+.| +....+..| .|.+||.|+...-..
T Consensus 817 gyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p------------ 884 (1705)
T KOG1836|consen 817 GYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNP------------ 884 (1705)
T ss_pred ccccCCCCCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccCCCc------------
Confidence 00 00122221111 22222223332 2233 322333445 799999988653110
Q ss_pred CCCCCCCCCCCCC----CCCCCC-Ccee--eecCCCcee-eCCCCCccCC-CCccccCCC---CCcccccccccceeeec
Q psy15668 219 RPSCLGNKCMNPC----AGQCGI-NAKC--EVRGATPIC-SCPRDMTGDP-FVRCRPFDK---YVAPLINDYLKIYWRYQ 286 (365)
Q Consensus 219 ~~~C~~~~C~~~c----~~~C~~-~~~C--~~~~~~~~C-~C~~G~~g~~-~~~C~~~~~---~~~~~~~~~~~~~~~c~ 286 (365)
.+.|..--|...= ...|.+ -|.| .....+-.| .|.+||.+.. ...|+.... ............++.|.
T Consensus 885 ~~~c~~c~c~p~gs~~~~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~~tGqc~c~ 964 (1705)
T KOG1836|consen 885 EDKCFACGCVPAGSELPSLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDVGTGQCYCR 964 (1705)
T ss_pred CCccccccCccCCcccccccCCCcccceeccCCCCccccccccccccccCCCCCcccccccccccccccccccCCceeee
Confidence 1111111110000 000111 1111 111111222 4555555442 112332111 00111222234467789
Q ss_pred CCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCC----eecCCCCCCCCCCceeeCCCCCccCCCCCCcCCC
Q psy15668 287 NNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENA----KCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGE 355 (365)
Q Consensus 287 ~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~----~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~~~ 355 (365)
++.++..+.+|..+++++.- ..|..--|...+ .|.. ...+|.|+++|.|...++|.++.
T Consensus 965 ~gVtgqrc~qc~~~~~~~~~---~gc~~c~c~~~Gs~~~qc~~-------~~G~c~c~~~~~g~~c~~c~~~~ 1027 (1705)
T KOG1836|consen 965 PGVTGQRCDQCETYHFGFQT---EGCGLCECDPLGSRGFQCDP-------EDGQCPCRPGFEGRRCDQCEEGF 1027 (1705)
T ss_pred cCccccccCccccCcccccc---cCCcceecccCCcccceecc-------cCCeeeecCCCCCcccccccCCc
Confidence 99999999999999998763 333322244433 5765 23489999999999999998764
No 26
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.35 E-value=0.00028 Score=42.04 Aligned_cols=30 Identities=33% Similarity=0.852 Sum_probs=27.0
Q ss_pred CC-CCCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668 10 CS-PNPCGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 10 C~-~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
|. ..+|.++++|+++.++|+|.|++||.|.
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 55 6789989999999999999999999887
No 27
>KOG1226|consensus
Probab=97.31 E-value=0.0041 Score=61.87 Aligned_cols=143 Identities=29% Similarity=0.656 Sum_probs=82.4
Q ss_pred eeeCCCCCCCCCCCCCCCCCCCCCCCCCC----CcccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCc----cCCC
Q psy15668 131 TCSCLPGYTGSPLSGCRHECDSDYDCGPS----QSCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYL----GDPY 202 (365)
Q Consensus 131 ~C~C~~G~~g~~~~~~~~~C~~~~~C~~~----~~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~----g~~~ 202 (365)
.|.|.+||.|. .|+-....... ..|.. .-.+.+|...|.|.=. .|+|.+... |.
T Consensus 479 ~C~C~~G~~G~-------~CEC~~~~~ss~~~~~~Cr~-----~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~-- 540 (783)
T KOG1226|consen 479 QCRCDEGWLGK-------KCECSTDELSSSEEEDKCRE-----NSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK-- 540 (783)
T ss_pred ceecCCCCCCC-------cccCCccccCcHhHHhhccC-----CCCCCCcCCCCcEeCC----ceEecCCCCCceeee--
Confidence 57999999999 55532211111 11211 1112368888877532 378877765 44
Q ss_pred CccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCccccCCCCCcccccccccce
Q psy15668 203 TSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDKYVAPLINDYLKIY 282 (365)
Q Consensus 203 ~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~~~ 282 (365)
.|+-+.-.|.+.. ...|+.+++|.- -+|+|.+||+|+ .|+--
T Consensus 541 -----------fCECDnfsC~r~~-----g~lC~g~G~C~C----G~CvC~~GwtG~---~C~C~--------------- 582 (783)
T KOG1226|consen 541 -----------FCECDNFSCERHK-----GVLCGGHGRCEC----GRCVCNPGWTGS---ACNCP--------------- 582 (783)
T ss_pred -----------eeeccCccccccc-----CcccCCCCeEeC----CcEEcCCCCccC---CCCCC---------------
Confidence 2322222222111 134777888853 379999999999 66520
Q ss_pred eeecCCccceeeeeecCCCCCCCCCCCCCCCC---CCCCCCCeecCCCCCCCCCCceeeCCCC-CccCCCCCCcC--CCC
Q psy15668 283 WRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEP---NPCGENAKCQPGYDKSGKDRPVCTCLPG-YVGDALTYCRR--GEC 356 (365)
Q Consensus 283 ~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~---~~C~~~~~C~~~~~~~~~~~~~C~C~~G-~~g~~~~~C~~--~~C 356 (365)
.+.+.|.+ ..|...|+|.= .+|+|... |.|..++.|.. ++|
T Consensus 583 ------------------------~std~C~~~~G~iCSGrG~C~C---------g~C~C~~~~~sG~~CE~cptc~~~C 629 (783)
T KOG1226|consen 583 ------------------------LSTDTCESSDGQICSGRGTCEC---------GRCKCTDPPYSGEFCEKCPTCPDPC 629 (783)
T ss_pred ------------------------CCCccccCCCCceeCCCceeeC---------CceEcCCCCcCcchhhcCCCCCCcc
Confidence 11466653 24777777775 37888765 89888666663 356
Q ss_pred CCCCcc
Q psy15668 357 QSDAEC 362 (365)
Q Consensus 357 ~~~~~C 362 (365)
.....|
T Consensus 630 ~~~~~C 635 (783)
T KOG1226|consen 630 AENKSC 635 (783)
T ss_pred cccccc
Confidence 655554
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.30 E-value=0.00037 Score=41.60 Aligned_cols=30 Identities=33% Similarity=0.872 Sum_probs=26.2
Q ss_pred CCCC-CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668 9 PCSP-NPCGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 9 ~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
+|.. .+|.++ +|+++.++|+|.|++||.|.
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 3666 689888 99999999999999999983
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.22 E-value=5e-05 Score=62.40 Aligned_cols=148 Identities=24% Similarity=0.561 Sum_probs=87.0
Q ss_pred CCCCCCeeeecCCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccC-CCCCCeeeecC-----CCc
Q psy15668 14 PCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQ-CGVNSECNVRN-----HIP 87 (365)
Q Consensus 14 ~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~-C~~~~~C~~~~-----g~~ 87 (365)
.|. +|.-+...+.|.|.|.+||.... +++|+...+|..... ... |...++|++.. ..|
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~-----EntCE~kv~C~~~e~----------~~K~Cgdya~C~~~~~~~~~~~~ 70 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKN-----ENTCEEKVECDKLEN----------VNKPCGDYAKCINQANKGEERAY 70 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEE-----TTEEEE----SG-GG----------TTSEEETTEEEEE-SSTTSSTSE
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEcc-----ccccccceecCcccc----------cCccccchhhhhcCCCcccceeE
Confidence 465 46888888999999999998762 344544555543100 112 77788898754 569
Q ss_pred eeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCC---CCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccC
Q psy15668 88 VCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVIN---MVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVN 164 (365)
Q Consensus 88 ~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~---~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~ 164 (365)
.|.|.+||+.... .|.. +.|....|+ .+.|+..+ ....|+|.-|+.... ...|....
T Consensus 71 ~C~C~~gY~~~~~-vCvp----~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d----n~kCtk~G---------- 130 (197)
T PF06247_consen 71 KCDCINGYILKQG-VCVP----NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD----NKKCTKTG---------- 130 (197)
T ss_dssp EEEE-TTEEESSS-SEEE----GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT----TTESEEEE----------
T ss_pred EEecccCceeeCC-eEch----hhcCceecC-CCeEEecCCCCCCceeEeeeceEecc----CCcccCCC----------
Confidence 9999999987644 5765 457777788 58897543 345899999987221 11232210
Q ss_pred CcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCC
Q psy15668 165 YKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDP 201 (365)
Q Consensus 165 ~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 201 (365)
..+|+ -.|..+..|....+-|.|.+..||.+++
T Consensus 131 ---~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 131 ---ETKCS-LKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp ------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred ---cccee-eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 11222 2366678999999999999999998774
No 30
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.20 E-value=0.00053 Score=41.44 Aligned_cols=34 Identities=35% Similarity=0.834 Sum_probs=28.9
Q ss_pred CCCCCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668 309 DDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 309 ~d~C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~ 347 (365)
+++|.. .+|.+++.|++ ..++|+|.|++||.|..
T Consensus 2 ~~~C~~~~~C~~~~~C~~-----~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 2 IDECASGNPCQNGGTCVN-----TVGSYRCSCPPGYTGRN 36 (38)
T ss_pred cccCCCCCCcCCCCEeEC-----CCCCeEeECCCCCcCCc
Confidence 678876 78998899998 56889999999999854
No 31
>KOG1226|consensus
Probab=97.16 E-value=0.0052 Score=61.17 Aligned_cols=128 Identities=24% Similarity=0.691 Sum_probs=77.3
Q ss_pred eeeCCCCCccCCCCCcccCCC-------CCCCCC----CCCCCCCeeeeCCCCceeeCCCCCC----CCCCCCCCCCCCC
Q psy15668 88 VCSCPPGYTGDPLTQCRRFDP-------QELCDR----SPCGVNTRCEVINMVPTCSCLPGYT----GSPLSGCRHECDS 152 (365)
Q Consensus 88 ~C~C~~G~~g~~~~~C~~~~~-------~~~C~~----~~C~~~~~C~~~~~~~~C~C~~G~~----g~~~~~~~~~C~~ 152 (365)
.|.|.+||.|+ .|+-..+ .+.|.. .+|...+.|.=. .|.|.+... |. .|+-
T Consensus 479 ~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~-------fCEC 544 (783)
T KOG1226|consen 479 QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK-------FCEC 544 (783)
T ss_pred ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeee-------eeec
Confidence 57999999998 7763211 133432 267777777544 678877665 55 5553
Q ss_pred CCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy15668 153 DYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCA 232 (365)
Q Consensus 153 ~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~ 232 (365)
++ |+|... ....|..++.|.=. +|.|.+||+|..+ .|....+.|.+.. .
T Consensus 545 Dn----------fsC~r~-~g~lC~g~G~C~CG----~CvC~~GwtG~~C-----------~C~~std~C~~~~-----G 593 (783)
T KOG1226|consen 545 DN----------FSCERH-KGVLCGGHGRCECG----RCVCNPGWTGSAC-----------NCPLSTDTCESSD-----G 593 (783)
T ss_pred cC----------cccccc-cCcccCCCCeEeCC----cEEcCCCCccCCC-----------CCCCCCccccCCC-----C
Confidence 31 111110 11247777887532 4999999999965 3555555554321 1
Q ss_pred CCCCCCceeeecCCCceeeCCCC-CccCCCCccccC
Q psy15668 233 GQCGINAKCEVRGATPICSCPRD-MTGDPFVRCRPF 267 (365)
Q Consensus 233 ~~C~~~~~C~~~~~~~~C~C~~G-~~g~~~~~C~~~ 267 (365)
..|+..|+|.= -+|+|... |.|. .|+.-
T Consensus 594 ~iCSGrG~C~C----g~C~C~~~~~sG~---~CE~c 622 (783)
T KOG1226|consen 594 QICSGRGTCEC----GRCKCTDPPYSGE---FCEKC 622 (783)
T ss_pred ceeCCCceeeC----CceEcCCCCcCcc---hhhcC
Confidence 34666777754 25888777 8898 77753
No 32
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.15 E-value=0.00044 Score=37.16 Aligned_cols=24 Identities=33% Similarity=0.708 Sum_probs=18.0
Q ss_pred CceeeCCCCCccCCCCCccCCCccCCCC
Q psy15668 27 RPVCSCLPGHWGNPLTYCQRGECQDHSD 54 (365)
Q Consensus 27 ~~~C~C~~G~~g~~~~~C~~~~C~~~~~ 54 (365)
+|+|+|++||...+ +...|+|++|
T Consensus 1 sy~C~C~~Gy~l~~----d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP----DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC----CCCccccCCC
Confidence 69999999999763 2345777764
No 33
>KOG1836|consensus
Probab=97.06 E-value=0.014 Score=64.36 Aligned_cols=174 Identities=22% Similarity=0.466 Sum_probs=94.2
Q ss_pred CCCCCCCCCCeeeeC--CCceeee-CCCCCccCCCCccCccccCCCCCC-CCCCCCCCCCCCC---CCC-CCCCC-Ccee
Q psy15668 171 CASGACAPTAQCEVR--NHRAVCS-CPVGYLGDPYTSCRAECLAHSDCP-TDRPSCLGNKCMN---PCA-GQCGI-NAKC 241 (365)
Q Consensus 171 C~~~~C~~~~~C~~~--~g~~~C~-C~~G~~g~~~~~~~~~C~~~~~C~-~~~~~C~~~~C~~---~c~-~~C~~-~~~C 241 (365)
|.+-+|...+.|... .....|. |++||+|..++.+.+.......=. .+...|..-+|.. +=. ..|.. .+.|
T Consensus 777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c 856 (1705)
T KOG1836|consen 777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGEC 856 (1705)
T ss_pred CccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCccccccccccccce
Confidence 555667777777533 4567887 999999998775544332211100 1122344433311 000 11221 2233
Q ss_pred ---eecCCCcee-eCCCCCccCCCC-----ccccC-----CCCCcccccccccceeeecCCccceeeeeecCCCCCCCCC
Q psy15668 242 ---EVRGATPIC-SCPRDMTGDPFV-----RCRPF-----DKYVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVTPL 307 (365)
Q Consensus 242 ---~~~~~~~~C-~C~~G~~g~~~~-----~C~~~-----~~~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~~~ 307 (365)
+.....++| .|.+||.|+... .|... ........-....+.+.|.+...+..+..|..||++..
T Consensus 857 ~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~-- 934 (1705)
T KOG1836|consen 857 LKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLN-- 934 (1705)
T ss_pred eeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCcccceeccCCCCccccccccccccccC--
Confidence 333334555 799999888532 12211 10100111122234677788888888989999998843
Q ss_pred CCCCCCCCCCCCC----CeecCCCCCCCCCCceeeCCCCCccCCCCCCcC
Q psy15668 308 PDDLCEPNPCGEN----AKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRR 353 (365)
Q Consensus 308 ~~d~C~~~~C~~~----~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~ 353 (365)
.-..|+.-+|..- ..|+. ++.+|.|.+|-+|...+.|..
T Consensus 935 s~~gC~~c~c~~~gs~~~~c~~-------~tGqc~c~~gVtgqrc~qc~~ 977 (1705)
T KOG1836|consen 935 SGVGCEPCNCDPTGSESSDCDV-------GTGQCYCRPGVTGQRCDQCET 977 (1705)
T ss_pred CCCCcccccccccccccccccc-------cCCceeeecCccccccCcccc
Confidence 1125555445432 25654 446899999999888777764
No 34
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.65 E-value=0.0027 Score=37.60 Aligned_cols=28 Identities=39% Similarity=0.973 Sum_probs=24.2
Q ss_pred CCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668 314 PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 346 (365)
Q Consensus 314 ~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~ 346 (365)
..+|.+++.|++ ..++|+|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~-----~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVN-----TPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEec-----CCCCeEeECCCCCccc
Confidence 567888899998 5588999999999987
No 35
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.36 E-value=0.0033 Score=33.80 Aligned_cols=23 Identities=30% Similarity=0.638 Sum_probs=17.7
Q ss_pred CceeeCCCCCccCC-CCccccCCC
Q psy15668 247 TPICSCPRDMTGDP-FVRCRPFDK 269 (365)
Q Consensus 247 ~~~C~C~~G~~g~~-~~~C~~~~~ 269 (365)
+|+|.|++||+... ...|++|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 58999999998654 457888753
No 36
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.35 E-value=0.0047 Score=36.64 Aligned_cols=29 Identities=38% Similarity=1.062 Sum_probs=23.9
Q ss_pred CCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668 312 CEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 346 (365)
Q Consensus 312 C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~ 346 (365)
|.. ++|.++ +|++ ..++|+|.|++||.|.
T Consensus 2 C~~~~~C~~~-~C~~-----~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 2 CASGGPCSNG-TCIN-----TPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCcCCCCCC-EEEC-----CCCCeEeECCCCCccC
Confidence 444 578888 9998 5689999999999984
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.16 E-value=0.0082 Score=34.91 Aligned_cols=25 Identities=28% Similarity=0.553 Sum_probs=21.8
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668 13 NPCGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 13 ~~C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
..|+++|+|+.. ..+|.|.+||+|.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 469999999976 4599999999998
No 38
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.14 E-value=0.0039 Score=37.34 Aligned_cols=23 Identities=30% Similarity=0.576 Sum_probs=19.2
Q ss_pred CCCCCeeeecCCCceeeCCCCCccC
Q psy15668 15 CGSNTQCNVASNRPVCSCLPGHWGN 39 (365)
Q Consensus 15 C~~~~~C~~~~~~~~C~C~~G~~g~ 39 (365)
|++ +|++++++|+|.|++||.+.
T Consensus 8 C~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 8 CSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSS--EEEEETTSEEEE-STTEEE-
T ss_pred cCC--CCccCCCceEeECCCCCEEC
Confidence 655 89999999999999999987
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.84 E-value=0.047 Score=31.74 Aligned_cols=24 Identities=25% Similarity=0.555 Sum_probs=20.1
Q ss_pred CCCCCceeeecCCCceeeCCCCCccC
Q psy15668 234 QCGINAKCEVRGATPICSCPRDMTGD 259 (365)
Q Consensus 234 ~C~~~~~C~~~~~~~~C~C~~G~~g~ 259 (365)
.|..+++|+.. ..+|+|.+||+|.
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCC
Confidence 47779999865 4689999999997
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.49 E-value=0.036 Score=33.17 Aligned_cols=22 Identities=23% Similarity=0.415 Sum_probs=18.1
Q ss_pred CceeeecCCCceeeCCCCCccC
Q psy15668 238 NAKCEVRGATPICSCPRDMTGD 259 (365)
Q Consensus 238 ~~~C~~~~~~~~C~C~~G~~g~ 259 (365)
...|++++++|+|.|++||+..
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-
T ss_pred CCCCccCCCceEeECCCCCEEC
Confidence 3689999999999999999876
No 41
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.23 E-value=0.02 Score=25.99 Aligned_cols=11 Identities=45% Similarity=1.114 Sum_probs=8.9
Q ss_pred eeeCCCCCccC
Q psy15668 249 ICSCPRDMTGD 259 (365)
Q Consensus 249 ~C~C~~G~~g~ 259 (365)
.|+|++||+|.
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 48999999998
No 42
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.47 E-value=0.1 Score=31.18 Aligned_cols=30 Identities=27% Similarity=0.533 Sum_probs=21.7
Q ss_pred CCCCCCCCCCeeeecC-CCceeeCCCCCccC
Q psy15668 10 CSPNPCGSNTQCNVAS-NRPVCSCLPGHWGN 39 (365)
Q Consensus 10 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~ 39 (365)
|...+|..|+.|++.. |++.|.|.+||..+
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 5567788999999777 99999999999865
No 43
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=87.35 E-value=0.34 Score=28.92 Aligned_cols=28 Identities=25% Similarity=0.549 Sum_probs=20.0
Q ss_pred CCCCCCCCeeeeCC-CceeeeCCCCCccC
Q psy15668 173 SGACAPTAQCEVRN-HRAVCSCPVGYLGD 200 (365)
Q Consensus 173 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 200 (365)
...|+.++.|++.. |++.|.|..||..+
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 35678899998766 99999999999765
No 44
>KOG3512|consensus
Probab=86.45 E-value=3.9 Score=39.05 Aligned_cols=158 Identities=14% Similarity=0.204 Sum_probs=81.3
Q ss_pred CeeeeCC-CceeeeCCCCCccCCCCccCccccCCCC---CCCCCCCCCCCCCCC---CCC---------CCCCCCceeee
Q psy15668 180 AQCEVRN-HRAVCSCPVGYLGDPYTSCRAECLAHSD---CPTDRPSCLGNKCMN---PCA---------GQCGINAKCEV 243 (365)
Q Consensus 180 ~~C~~~~-g~~~C~C~~G~~g~~~~~~~~~C~~~~~---C~~~~~~C~~~~C~~---~c~---------~~C~~~~~C~~ 243 (365)
..|+... +.+.|.|..+..|..+..+.+.-.+..- -....++|....|.. -|. +.+ .+++|+|
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~-SggvCln 363 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRR-SGGVCLN 363 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCcc-ccceEee
Confidence 3576444 4489999999888866655443322210 001222332222211 000 112 2567764
Q ss_pred c---CCCcee-eCCCCCccCCCC------ccccCCC---CCcccccccccceeeecCCccceeeeeecCCCCCCC-----
Q psy15668 244 R---GATPIC-SCPRDMTGDPFV------RCRPFDK---YVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVT----- 305 (365)
Q Consensus 244 ~---~~~~~C-~C~~G~~g~~~~------~C~~~~~---~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~----- 305 (365)
- ..+-+| .|++||..++.. .|..-+. ..+.-..+-..+++.|.++.++..|-+|..||....
T Consensus 364 CrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tGqCpCkeGvtG~tCnrCa~gyqqsrs~vap 443 (592)
T KOG3512|consen 364 CRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTGQCPCKEGVTGLTCNRCAPGYQQSRSPVAP 443 (592)
T ss_pred cccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCCcccCCCCCcccccccccchhhcccCCCcC
Confidence 3 223445 699999876531 1221111 111111222345788999999999999999998532
Q ss_pred CCCCCCCCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668 306 PLPDDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 306 ~~~~d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~ 347 (365)
|+.++.=.+..++++.+ +..+.+.|+.++.|..
T Consensus 444 cik~p~~~~~~~~s~ve---------~qd~~s~Ck~~~~~~r 476 (592)
T KOG3512|consen 444 CIKIPTDAPTLGSSGVE---------PQDQCSKCKASPGGKR 476 (592)
T ss_pred ceecCCCCccccCCCCc---------chhccccCCCCCccee
Confidence 33222211222333333 2336677888887766
No 45
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.35 E-value=0.97 Score=39.51 Aligned_cols=38 Identities=24% Similarity=0.485 Sum_probs=28.9
Q ss_pred CCCCCCCCCCC--CCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668 303 YVTPLPDDLCE--PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 303 ~~~~~~~d~C~--~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~ 347 (365)
+..|.++++|. .++|.. .|.+ ..|+|.|.|++||+...
T Consensus 181 ~~~C~~~~~C~~~~~~c~~--~C~~-----~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQ--VCIS-----TPGSYLCACTEGYALLE 220 (224)
T ss_pred cccCcCchhhcCCCCCccc--eEEc-----CCCCEEeECCCCccCCC
Confidence 34466788895 356764 6997 77999999999998643
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.61 E-value=1.4 Score=38.60 Aligned_cols=21 Identities=14% Similarity=0.358 Sum_probs=18.9
Q ss_pred ceeeecCCCceeeCCCCCccC
Q psy15668 239 AKCEVRGATPICSCPRDMTGD 259 (365)
Q Consensus 239 ~~C~~~~~~~~C~C~~G~~g~ 259 (365)
..|.++.|+|.|.|++||+..
T Consensus 199 ~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 199 QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred ceEEcCCCCEEeECCCCccCC
Confidence 579999999999999999875
No 47
>smart00051 DSL delta serrate ligand.
Probab=80.51 E-value=2.6 Score=28.74 Aligned_cols=23 Identities=22% Similarity=0.413 Sum_probs=16.5
Q ss_pred CCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668 317 CGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 347 (365)
Q Consensus 317 C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~ 347 (365)
...+..|.. . ..++|.+||.|..
T Consensus 40 ~~~~~~Cd~----~----G~~~C~~Gw~G~~ 62 (63)
T smart00051 40 FFGHYTCDE----N----GNKGCLEGWMGPY 62 (63)
T ss_pred ccCCccCCc----C----CCEecCCCCcCCC
Confidence 455667864 1 3689999999875
No 48
>KOG3516|consensus
Probab=79.58 E-value=1.3 Score=46.82 Aligned_cols=33 Identities=33% Similarity=0.780 Sum_probs=30.9
Q ss_pred CCCCCCCCCCCCCeeeecCCCceeeCC-CCCccC
Q psy15668 7 GDPCSPNPCGSNTQCNVASNRPVCSCL-PGHWGN 39 (365)
Q Consensus 7 id~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~ 39 (365)
+|.|.+|+|.++|.|......|.|.|. .||.|.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccc
Confidence 578889999999999999999999999 899998
No 49
>KOG1218|consensus
Probab=77.39 E-value=46 Score=30.33 Aligned_cols=14 Identities=36% Similarity=0.684 Sum_probs=11.5
Q ss_pred CCceeeCCCCCccC
Q psy15668 26 NRPVCSCLPGHWGN 39 (365)
Q Consensus 26 ~~~~C~C~~G~~g~ 39 (365)
....|.|.+||+|.
T Consensus 13 ~~~~c~c~~~~~g~ 26 (316)
T KOG1218|consen 13 GSGQCFCDPGYTGR 26 (316)
T ss_pred CCCceecCCCcccc
Confidence 45689999999985
No 50
>PHA02887 EGF-like protein; Provisional
Probab=75.90 E-value=2.9 Score=31.96 Aligned_cols=36 Identities=25% Similarity=0.528 Sum_probs=25.1
Q ss_pred CCCCCC---CCCCCCCeeeec--CCCceeeCCCCCccCCCCCccC
Q psy15668 7 GDPCSP---NPCGSNTQCNVA--SNRPVCSCLPGHWGNPLTYCQR 46 (365)
Q Consensus 7 id~C~~---~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~ 46 (365)
..+|.. +=|- ||+|.-. ...+.|.|++||+|. +|+.
T Consensus 83 f~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~ 123 (126)
T PHA02887 83 FEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE 123 (126)
T ss_pred ccccChHhhCEee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence 355642 3376 4789843 356899999999999 6653
No 51
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.47 E-value=3.1 Score=32.37 Aligned_cols=37 Identities=30% Similarity=0.548 Sum_probs=26.4
Q ss_pred CCCCCCC---CCCCCCCeeeecC--CCceeeCCCCCccCCCCCccC
Q psy15668 6 GGDPCSP---NPCGSNTQCNVAS--NRPVCSCLPGHWGNPLTYCQR 46 (365)
Q Consensus 6 did~C~~---~~C~~~~~C~~~~--~~~~C~C~~G~~g~~~~~C~~ 46 (365)
+|-+|.+ +=|-+ |+|.-.. ..+.|.|..||+|. +|+.
T Consensus 41 ~i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh 82 (139)
T PHA03099 41 AIRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQH 82 (139)
T ss_pred ccccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc---cccc
Confidence 4556643 33766 4898444 77999999999999 6653
No 52
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=75.36 E-value=1.9 Score=27.62 Aligned_cols=22 Identities=32% Similarity=0.794 Sum_probs=16.9
Q ss_pred CCceeeCCCCCccCCCCCCcCC
Q psy15668 333 DRPVCTCLPGYVGDALTYCRRG 354 (365)
Q Consensus 333 ~~~~C~C~~G~~g~~~~~C~~~ 354 (365)
...+|.|+++|+|...++|.+.
T Consensus 16 ~~G~C~C~~~~~G~~C~~C~~g 37 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCDQCKPG 37 (49)
T ss_dssp TCEEESBSTTEESTTS-EE-TT
T ss_pred CCCEEeccccccCCcCcCCCCc
Confidence 3469999999999998888864
No 53
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=74.91 E-value=3.4 Score=31.56 Aligned_cols=32 Identities=47% Similarity=0.923 Sum_probs=25.4
Q ss_pred CCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcc
Q psy15668 6 GGDPCS-PNPCGSNTQCNVASNRPVCSCLPGHWG 38 (365)
Q Consensus 6 did~C~-~~~C~~~~~C~~~~~~~~C~C~~G~~g 38 (365)
..|+|. ...|+.+|.|.. .....|.|.+||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence 457897 578999999964 45678999999974
No 54
>smart00051 DSL delta serrate ligand.
Probab=74.07 E-value=4.9 Score=27.39 Aligned_cols=44 Identities=27% Similarity=0.652 Sum_probs=30.2
Q ss_pred ceeeCCCCCccCCCCCcccCCCCCCCCC-CCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668 87 PVCSCPPGYTGDPLTQCRRFDPQELCDR-SPCGVNTRCEVINMVPTCSCLPGYTGS 141 (365)
Q Consensus 87 ~~C~C~~G~~g~~~~~C~~~~~~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~ 141 (365)
+.-.|.++|.|. .|... |.+ .....+..|.. .| .++|.+||+|.
T Consensus 17 ~rv~C~~~~yG~---~C~~~-----C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~ 61 (63)
T smart00051 17 IRVTCDENYYGE---GCNKF-----CRPRDDFFGHYTCDE-NG--NKGCLEGWMGP 61 (63)
T ss_pred EEeeCCCCCcCC---ccCCE-----eCcCccccCCccCCc-CC--CEecCCCCcCC
Confidence 455899999999 88643 643 22445667743 23 57899999987
No 55
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=72.71 E-value=5.7 Score=25.50 Aligned_cols=20 Identities=30% Similarity=0.713 Sum_probs=17.2
Q ss_pred ceeeCCCCCccCCCCCCcCC
Q psy15668 335 PVCTCLPGYVGDALTYCRRG 354 (365)
Q Consensus 335 ~~C~C~~G~~g~~~~~C~~~ 354 (365)
.+|.|+++|+|...+.|.+.
T Consensus 19 G~C~C~~~~~G~~C~~C~~g 38 (50)
T cd00055 19 GQCECKPNTTGRRCDRCAPG 38 (50)
T ss_pred CEEeCCCcCCCCCCCCCCCC
Confidence 58999999999998888754
No 56
>PHA02887 EGF-like protein; Provisional
Probab=71.67 E-value=3.5 Score=31.49 Aligned_cols=30 Identities=30% Similarity=0.602 Sum_probs=23.2
Q ss_pred CCCCCCceeeec--CCCceeeCCCCCccCCCCcccc
Q psy15668 233 GQCGINAKCEVR--GATPICSCPRDMTGDPFVRCRP 266 (365)
Q Consensus 233 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~ 266 (365)
+.|. +|+|... .....|+|+.||+|. +|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence 4455 5788765 456889999999999 7875
No 57
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=67.58 E-value=5.6 Score=30.96 Aligned_cols=31 Identities=29% Similarity=0.578 Sum_probs=24.0
Q ss_pred CCCCCCceeeec--CCCceeeCCCCCccCCCCccccC
Q psy15668 233 GQCGINAKCEVR--GATPICSCPRDMTGDPFVRCRPF 267 (365)
Q Consensus 233 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~~ 267 (365)
+.|.+ |+|... ...+.|+|..||+|. +|+..
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~ 83 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHV 83 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCcccc---cccce
Confidence 45665 488765 477899999999999 88853
No 58
>KOG3514|consensus
Probab=61.19 E-value=5.6 Score=41.93 Aligned_cols=31 Identities=35% Similarity=0.833 Sum_probs=28.5
Q ss_pred CCCCCCCCCCCeeeecCCCceeeCCC-CCccC
Q psy15668 9 PCSPNPCGSNTQCNVASNRPVCSCLP-GHWGN 39 (365)
Q Consensus 9 ~C~~~~C~~~~~C~~~~~~~~C~C~~-G~~g~ 39 (365)
.|.++||.++|+|...+.+|.|.|.. ||.|.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~ 656 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR 656 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc
Confidence 69999999999999999999999975 78887
No 59
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=59.79 E-value=8.8 Score=22.43 Aligned_cols=13 Identities=38% Similarity=0.838 Sum_probs=11.1
Q ss_pred ceeeCCCCCccCC
Q psy15668 335 PVCTCLPGYVGDA 347 (365)
Q Consensus 335 ~~C~C~~G~~g~~ 347 (365)
+.|.||+||+.+.
T Consensus 18 ~~C~CPeGyIlde 30 (34)
T PF09064_consen 18 GQCFCPEGYILDE 30 (34)
T ss_pred CceeCCCceEecC
Confidence 5899999998765
No 60
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=58.83 E-value=24 Score=22.59 Aligned_cols=28 Identities=36% Similarity=0.828 Sum_probs=20.1
Q ss_pred CCCC-CCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668 310 DLCE-PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 346 (365)
Q Consensus 310 d~C~-~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~ 346 (365)
+.|. ...|..++.|++ .+|+|++||+-.
T Consensus 20 ~~C~~~~qC~~~s~C~~---------g~C~C~~g~~~~ 48 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVN---------GRCQCPPGYVEV 48 (52)
T ss_pred CCCCCcCCCCCcCEEcC---------CEeECCCCCEec
Confidence 3454 235668889987 389999999754
No 61
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=58.65 E-value=12 Score=23.55 Aligned_cols=20 Identities=30% Similarity=0.727 Sum_probs=16.6
Q ss_pred ceeeCCCCCccCCCCCCcCC
Q psy15668 335 PVCTCLPGYVGDALTYCRRG 354 (365)
Q Consensus 335 ~~C~C~~G~~g~~~~~C~~~ 354 (365)
.+|.|+++|+|...+.|.+.
T Consensus 18 G~C~C~~~~~G~~C~~C~~g 37 (46)
T smart00180 18 GQCECKPNVTGRRCDRCAPG 37 (46)
T ss_pred CEEECCCCCCCCCCCcCCCC
Confidence 58999999999888777653
No 62
>KOG3516|consensus
Probab=54.99 E-value=9.7 Score=40.75 Aligned_cols=37 Identities=38% Similarity=0.750 Sum_probs=31.8
Q ss_pred CCCCCCCCCCCCCCCCeecCCCCCCCCCCceeeCC-CCCccCC
Q psy15668 306 PLPDDLCEPNPCGENAKCQPGYDKSGKDRPVCTCL-PGYVGDA 347 (365)
Q Consensus 306 ~~~~d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~-~G~~g~~ 347 (365)
|..+|.|.+++|.+++.|.. ....|.|.|. .||.|..
T Consensus 542 C~i~drClPN~CehgG~C~Q-----s~~~f~C~C~~TGY~Gat 579 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQ-----SWDDFECNCELTGYKGAT 579 (1306)
T ss_pred cccccccCCccccCCCcccc-----cccceeEecccccccccc
Confidence 55578899999999999997 4577999999 9999987
No 63
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=54.37 E-value=12 Score=28.56 Aligned_cols=24 Identities=50% Similarity=1.140 Sum_probs=18.6
Q ss_pred CCCCCeeeecCCCceeeCCCCCccC
Q psy15668 74 CGVNSECNVRNHIPVCSCPPGYTGD 98 (365)
Q Consensus 74 C~~~~~C~~~~g~~~C~C~~G~~g~ 98 (365)
|+.++.|.. .....|.|.+||.-+
T Consensus 86 CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 86 CGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCccEeCC-CCCCceECCCCcCCC
Confidence 888999954 345689999999743
No 64
>KOG3512|consensus
Probab=49.88 E-value=43 Score=32.34 Aligned_cols=62 Identities=16% Similarity=0.294 Sum_probs=39.8
Q ss_pred ecCCccceeeeeecCCCCCCCCC---CCCCCCCCCCCC----CCeecCCCCCCCCCCceeeCCCCCccCCCCCCcC
Q psy15668 285 YQNNKTIFYVSLVSLNYPYVTPL---PDDLCEPNPCGE----NAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRR 353 (365)
Q Consensus 285 c~~~~~~~~~~~c~~~~~~~~~~---~~d~C~~~~C~~----~~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~ 353 (365)
|.-+.++..|..|-+||+-..-. +...|..-.|++ +-+|.. .+.+|.|++|-+|..++.|.+
T Consensus 364 CrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq-------~tGqCpCkeGvtG~tCnrCa~ 432 (592)
T KOG3512|consen 364 CRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQ-------TTGQCPCKEGVTGLTCNRCAP 432 (592)
T ss_pred cccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccc-------cCCcccCCCCCcccccccccc
Confidence 34445678888999999843322 223354434544 235654 235899999999999888875
No 65
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=48.46 E-value=7.8 Score=29.20 Aligned_cols=35 Identities=23% Similarity=0.530 Sum_probs=25.6
Q ss_pred cCCCCCC--CCCCCCCCeeeecC-----CCceeeCCCCCccC
Q psy15668 5 MGGDPCS--PNPCGSNTQCNVAS-----NRPVCSCLPGHWGN 39 (365)
Q Consensus 5 ~did~C~--~~~C~~~~~C~~~~-----~~~~C~C~~G~~g~ 39 (365)
...++|. .+.|..||.|+... .=|.|.|.+.+...
T Consensus 3 ~S~~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~ 44 (103)
T PF12955_consen 3 SSNDACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT 44 (103)
T ss_pred CCHHHHHHhccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence 4455664 67899999999773 44899999966543
No 66
>KOG1218|consensus
Probab=37.19 E-value=3.2e+02 Score=24.72 Aligned_cols=40 Identities=33% Similarity=0.722 Sum_probs=21.7
Q ss_pred CCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668 92 PPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGS 141 (365)
Q Consensus 92 ~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 141 (365)
..+|.+. .|..+ .++... |.. .+|.+... .|.+..+|.+.
T Consensus 96 ~~~~~g~---~C~~~---~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~~ 135 (316)
T KOG1218|consen 96 LNGYEGP---QCESP---CPCGDG-CAE-KTCANPRR--ECRCGGGYIGE 135 (316)
T ss_pred CCCCCcc---cccCC---CCcCCc-ccc-cccCCCcc--ceecCCcCccc
Confidence 5777777 77755 233222 222 34544432 46666677666
No 67
>KOG3514|consensus
Probab=36.92 E-value=25 Score=37.48 Aligned_cols=32 Identities=41% Similarity=1.076 Sum_probs=28.2
Q ss_pred CCCCCCCCCCCeecCCCCCCCCCCceeeCC-CCCccCC
Q psy15668 311 LCEPNPCGENAKCQPGYDKSGKDRPVCTCL-PGYVGDA 347 (365)
Q Consensus 311 ~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~-~G~~g~~ 347 (365)
.|.++||.|+|+|.. +..+|.|.|. .||.|..
T Consensus 625 ~C~~nPC~N~g~C~e-----gwNrfiCDCs~T~~~G~~ 657 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSE-----GWNRFICDCSGTGFEGRT 657 (1591)
T ss_pred ccCCCcccCCCCccc-----cccccccccccCcccCcc
Confidence 688999999999997 7788999996 6898887
No 68
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=36.23 E-value=16 Score=24.84 Aligned_cols=14 Identities=29% Similarity=0.548 Sum_probs=5.9
Q ss_pred ceeeeCCCCCccCC
Q psy15668 188 RAVCSCPVGYLGDP 201 (365)
Q Consensus 188 ~~~C~C~~G~~g~~ 201 (365)
.++-.|.+.|.|..
T Consensus 16 ~~rv~C~~nyyG~~ 29 (63)
T PF01414_consen 16 RIRVVCDENYYGPN 29 (63)
T ss_dssp -------TTEETTT
T ss_pred EEEEECCCCCCCcc
Confidence 45678899999983
No 69
>KOG3607|consensus
Probab=26.68 E-value=63 Score=33.53 Aligned_cols=49 Identities=29% Similarity=0.766 Sum_probs=34.8
Q ss_pred CCCCCCCCcccCCccc-------cccccCCCCCCeeeecCCCceeeCCCCCccCCCCCcccC
Q psy15668 52 HSDCSHSKACKEYRCV-------DVCAGQCGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRF 106 (365)
Q Consensus 52 ~~~C~~~~~C~~~~C~-------~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~ 106 (365)
...|..+..|.+.+|+ ..|...|..++.|.+.. .|.|.+||.+. .|...
T Consensus 603 Gt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~~---~ChC~~gwapp---~C~~~ 658 (716)
T KOG3607|consen 603 GTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNEL---NCHCEPGWAPP---FCFIF 658 (716)
T ss_pred CCccCCCceecCCcchhhhhhcccccccccCCCcccCCCc---ceeeCCCCCCC---ccccc
Confidence 3446666677777772 33445588889886654 89999999988 78754
Done!