Query psy2457
Match_columns 189
No_of_seqs 126 out of 2135
Neff 9.8
Searched_HMMs 46136
Date Fri Aug 16 16:47:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2457.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2457hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.4 4.9E-13 1.1E-17 120.8 8.4 114 24-168 3859-3976(4289)
2 KOG1214|consensus 99.4 2.2E-12 4.7E-17 108.4 8.1 131 19-174 725-866 (1289)
3 KOG4289|consensus 98.8 5.1E-09 1.1E-13 92.7 6.2 98 42-168 1216-1315(2531)
4 KOG4289|consensus 98.8 3.6E-08 7.9E-13 87.6 9.0 84 5-103 1224-1308(2531)
5 KOG1219|consensus 98.5 3.4E-07 7.3E-12 84.6 6.3 89 10-108 3885-3975(4289)
6 KOG1214|consensus 98.1 1.6E-05 3.4E-10 68.2 8.4 122 35-178 702-831 (1289)
7 PF00008 EGF: EGF-like domain 98.0 3E-06 6.6E-11 43.3 1.4 27 141-167 5-32 (32)
8 PF07645 EGF_CA: Calcium-bindi 98.0 5.6E-06 1.2E-10 45.2 2.0 24 141-164 11-34 (42)
9 PF07645 EGF_CA: Calcium-bindi 97.7 2E-05 4.3E-10 42.9 1.9 32 27-58 1-35 (42)
10 PF12947 EGF_3: EGF domain; I 97.7 2.6E-05 5.5E-10 40.9 2.2 28 141-168 7-34 (36)
11 KOG1217|consensus 97.6 0.0006 1.3E-08 56.2 9.8 112 29-168 170-306 (487)
12 smart00179 EGF_CA Calcium-bind 97.6 0.00013 2.8E-09 38.7 3.6 27 141-167 10-37 (39)
13 KOG1225|consensus 97.4 0.00043 9.3E-09 57.6 6.4 92 49-167 266-365 (525)
14 cd00054 EGF_CA Calcium-binding 97.2 0.00074 1.6E-08 35.2 3.5 27 141-167 10-36 (38)
15 KOG1225|consensus 97.1 0.002 4.4E-08 53.8 7.4 45 49-110 297-341 (525)
16 KOG4260|consensus 97.1 0.0013 2.8E-08 49.8 5.6 107 52-166 132-270 (350)
17 smart00179 EGF_CA Calcium-bind 97.1 0.0008 1.7E-08 35.5 3.3 32 28-59 2-36 (39)
18 KOG1217|consensus 97.1 0.0044 9.5E-08 51.1 9.0 115 24-166 267-389 (487)
19 cd00053 EGF Epidermal growth f 97.0 0.0015 3.3E-08 33.5 3.6 26 141-166 7-32 (36)
20 KOG4260|consensus 96.8 0.00084 1.8E-08 50.8 2.1 82 5-105 220-304 (350)
21 PF00008 EGF: EGF-like domain 96.7 0.00074 1.6E-08 34.4 1.2 25 35-59 6-31 (32)
22 PF12662 cEGF: Complement Clr- 96.7 0.0019 4E-08 30.5 2.2 23 154-176 1-24 (24)
23 PF06247 Plasmod_Pvs28: Plasmo 96.5 0.00098 2.1E-08 47.7 1.0 126 35-167 8-163 (197)
24 cd00054 EGF_CA Calcium-binding 96.4 0.0054 1.2E-07 31.8 3.2 31 29-59 3-35 (38)
25 PF12947 EGF_3: EGF domain; I 96.3 0.0037 8.1E-08 32.6 2.2 26 35-60 8-33 (36)
26 smart00181 EGF Epidermal growt 96.2 0.0074 1.6E-07 31.0 3.1 25 141-166 7-31 (35)
27 PF12661 hEGF: Human growth fa 95.9 0.0032 6.8E-08 25.2 0.6 13 156-168 1-13 (13)
28 PF14670 FXa_inhibition: Coagu 95.5 0.012 2.5E-07 30.8 1.9 22 146-167 10-31 (36)
29 PF07974 EGF_2: EGF-like domai 95.2 0.033 7E-07 28.2 2.9 26 141-168 7-32 (32)
30 smart00181 EGF Epidermal growt 95.2 0.034 7.3E-07 28.4 3.1 23 35-58 8-30 (35)
31 KOG1226|consensus 95.0 0.24 5.3E-06 43.0 9.2 14 49-65 479-492 (783)
32 cd00053 EGF Epidermal growth f 94.8 0.043 9.3E-07 27.7 3.0 25 35-59 8-32 (36)
33 PF14670 FXa_inhibition: Coagu 94.7 0.033 7.1E-07 29.1 2.2 23 35-59 8-30 (36)
34 PF12662 cEGF: Complement Clr- 94.7 0.024 5.2E-07 26.7 1.5 22 4-30 3-24 (24)
35 KOG1226|consensus 92.2 1.2 2.6E-05 39.0 8.6 58 35-110 516-580 (783)
36 PF12946 EGF_MSP1_1: MSP1 EGF 89.9 0.12 2.6E-06 26.9 0.4 28 141-168 6-34 (37)
37 cd01475 vWA_Matrilin VWA_Matri 89.3 0.38 8.3E-06 36.0 2.9 36 22-59 181-219 (224)
38 PHA02887 EGF-like protein; Pro 87.8 0.62 1.3E-05 30.9 2.7 28 141-169 93-122 (126)
39 PHA03099 epidermal growth fact 87.3 0.93 2E-05 30.6 3.4 52 10-65 24-81 (139)
40 PHA03099 epidermal growth fact 86.5 0.76 1.6E-05 31.0 2.6 28 141-169 52-81 (139)
41 cd01475 vWA_Matrilin VWA_Matri 85.5 0.91 2E-05 33.9 3.0 25 141-167 196-220 (224)
42 cd00055 EGF_Lam Laminin-type e 83.3 1.4 3.1E-05 24.5 2.5 23 148-172 14-36 (50)
43 KOG0994|consensus 83.1 2 4.3E-05 39.6 4.4 25 147-173 1078-1102(1758)
44 PF00053 Laminin_EGF: Laminin 81.7 0.94 2E-05 25.1 1.3 25 147-173 12-36 (49)
45 PHA02887 EGF-like protein; Pro 76.4 4.8 0.0001 26.8 3.5 36 26-65 81-122 (126)
46 PF00954 S_locus_glycop: S-loc 76.4 2.7 5.8E-05 27.6 2.5 23 141-164 85-107 (110)
47 smart00180 EGF_Lam Laminin-typ 74.9 3.4 7.4E-05 22.5 2.3 17 156-172 19-35 (46)
48 PF01683 EB: EB module; Inter 73.4 9.7 0.00021 21.1 4.0 22 141-166 27-48 (52)
49 PF00954 S_locus_glycop: S-loc 67.8 5.4 0.00012 26.2 2.4 30 29-59 78-109 (110)
50 KOG0994|consensus 67.6 19 0.0004 33.8 6.2 21 148-168 878-899 (1758)
51 KOG1836|consensus 56.1 45 0.00097 32.9 6.8 33 141-173 781-816 (1705)
52 KOG3512|consensus 55.9 62 0.0013 27.3 6.8 22 38-59 284-306 (592)
53 KOG3516|consensus 50.3 12 0.00027 34.7 2.2 35 135-169 545-581 (1306)
54 PF09064 Tme5_EGF_like: Thromb 50.0 14 0.0003 18.9 1.5 12 156-167 19-30 (34)
55 KOG3607|consensus 45.4 27 0.00058 31.2 3.5 48 119-169 602-656 (716)
56 KOG3516|consensus 45.1 18 0.00039 33.8 2.4 39 24-65 541-581 (1306)
57 smart00051 DSL delta serrate l 44.0 52 0.0011 19.3 3.6 44 49-108 18-62 (63)
58 KOG3514|consensus 24.7 55 0.0012 30.7 2.1 34 136-169 624-659 (1591)
No 1
>KOG1219|consensus
Probab=99.42 E-value=4.9e-13 Score=120.79 Aligned_cols=114 Identities=25% Similarity=0.587 Sum_probs=96.7
Q ss_pred Cccc-cccchhh-CCCCCEEeeCC-CCeeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCCCeeecCCCCCCCCCCeeeC
Q psy2457 24 DYLG-IRLSDSQ-CGVNSECNVRN-HIPVCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTC 100 (189)
Q Consensus 24 ~c~~-~~~c~~~-C~~~~~C~~~~-~~~~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~C~~~~~~~~~~~~~c~c 100 (189)
.|.. .++|.+. |.++|.|...+ +.|.|.|++.|.|. +|+ .+...|..+||..+++|.. ..+.+.|.|
T Consensus 3859 gC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CE--i~~epC~snPC~~GgtCip-----~~n~f~CnC 3928 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCE--IDLEPCASNPCLTGGTCIP-----FYNGFLCNC 3928 (4289)
T ss_pred cccccccccccCcccCCCEecCCCCCceEEeCcccccCc---ccc--cccccccCCCCCCCCEEEe-----cCCCeeEeC
Confidence 4543 3889888 99999999986 78999999999999 998 7899999999999999986 556799999
Q ss_pred CCCcccCCCCcCccCCCCCCCCCCCCCccCCCeecCCccc-cCCCCCEeeeCCCCeeeeCCCCCccCCc
Q psy2457 101 LPGYVGDALTYCRRGECQSDAECNYDQVCNNYNCEKACTS-QCGINAQCTARNHVATCSCPAGYQGDAL 168 (189)
Q Consensus 101 ~~g~~~~~~~~c~~~~c~~~~~c~~~~~c~~~~c~~~c~~-~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 168 (189)
+.||.|..|+ .+ . +++|+. .|.++|.|.+..|+|+|.|.+||.|..|
T Consensus 3929 ~~gyTG~~Ce---~~---G---------------i~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3929 PNGYTGKRCE---AR---G---------------ISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred CCCccCceee---cc---c---------------ccccccccccCCceeeccCCceEeccChhHhcccC
Confidence 9999998633 11 1 344544 6999999999999999999999999986
No 2
>KOG1214|consensus
Probab=99.36 E-value=2.2e-12 Score=108.41 Aligned_cols=131 Identities=30% Similarity=0.666 Sum_probs=95.9
Q ss_pred ccCCCCccccccchhh---CCCCCEEeeCCCCeeeeCCCCCccCCC-CCCcccC---CCCCCCC--CCCCCC--CeeecC
Q psy2457 19 VASPRDYLGIRLSDSQ---CGVNSECNVRNHIPVCSCPPGYTGDPL-TQCRRFD---PHDLCEP--NPCGEN--AKCQPG 87 (189)
Q Consensus 19 ~~~~~~c~~~~~c~~~---C~~~~~C~~~~~~~~C~c~~g~~~~~~-~~c~~~~---~~~~c~~--~~c~~~--~~C~~~ 87 (189)
..++++|.|+++|+.. |.++.+|++.++++.|+|..+|.-.+. ..|..+. .++.|.. +.|.-. +.|+.
T Consensus 725 ~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~- 803 (1289)
T KOG1214|consen 725 QGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVH- 803 (1289)
T ss_pred CCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEe-
Confidence 4678899999999876 999999999999999999999874322 2453211 1233432 334322 34543
Q ss_pred CCCCCCCCCeeeCCCCcccCCCCcCccCCCCCCCCCCCCCccCCCeecCCccccCCCCCEeeeCCCCeeeeCCCCCccCC
Q psy2457 88 YDKSGKDRPVCTCLPGYVGDALTYCRRGECQSDAECNYDQVCNNYNCEKACTSQCGINAQCTARNHVATCSCPAGYQGDA 167 (189)
Q Consensus 88 ~~~~~~~~~~c~c~~g~~~~~~~~c~~~~c~~~~~c~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 167 (189)
-+.+.|.|.|.+||.|+.. .|.++++|..+ -|...+.|.+..+++.|.|.+||.|++
T Consensus 804 ---hGgs~y~C~CLPGfsGDG~------~c~dvDeC~ps--------------rChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 804 ---HGGSTYSCACLPGFSGDGH------QCTDVDECSPS--------------RCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred ---cCCceEEEeecCCccCCcc------ccccccccCcc--------------ccCCCceEecCCCcceeecccCccCCC
Confidence 1446799999999999873 46665555543 399999999999999999999999999
Q ss_pred cccCeec
Q psy2457 168 LSRCYPA 174 (189)
Q Consensus 168 ~~~c~~~ 174 (189)
.. |.+-
T Consensus 861 f~-CVP~ 866 (1289)
T KOG1214|consen 861 FQ-CVPD 866 (1289)
T ss_pred ce-ecCC
Confidence 76 6554
No 3
>KOG4289|consensus
Probab=98.84 E-value=5.1e-09 Score=92.72 Aligned_cols=98 Identities=30% Similarity=0.725 Sum_probs=75.5
Q ss_pred eeCCCCeeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCCCcCccCCCCCCC
Q psy2457 42 NVRNHIPVCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGECQSDA 121 (189)
Q Consensus 42 ~~~~~~~~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~~~c~~~~c~~~~ 121 (189)
++..+.+.|.|++||++. .|+ +.+++|-..||.+++.|.. ..+.|+|.|.+||.|+-++ .+ ....
T Consensus 1216 i~pvnglrCrCPpGFTgd---~Ce--TeiDlCYs~pC~nng~C~s-----rEggYtCeCrpg~tGehCE---vs--~~ag 1280 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGD---YCE--TEIDLCYSGPCGNNGRCRS-----REGGYTCECRPGFTGEHCE---VS--ARAG 1280 (2531)
T ss_pred ccccCceeEeCCCCCCcc---ccc--chhHhhhcCCCCCCCceEE-----ecCceeEEecCCcccccee---ee--cccC
Confidence 344577899999999999 998 8899999999999999987 6678999999999998633 11 0112
Q ss_pred CCCCCCccCCCeecCCccccCCCCCEeeeCC-CCeeeeCCCC-CccCCc
Q psy2457 122 ECNYDQVCNNYNCEKACTSQCGINAQCTARN-HVATCSCPAG-YQGDAL 168 (189)
Q Consensus 122 ~c~~~~~c~~~~c~~~c~~~C~~~~~C~~~~-g~~~C~C~~G-~~g~~~ 168 (189)
.|..+ .|.++++|++.. |++.|+|+.| |++..|
T Consensus 1281 rCvpG--------------vC~nggtC~~~~nggf~c~Cp~ge~e~prC 1315 (2531)
T KOG4289|consen 1281 RCVPG--------------VCKNGGTCVNLLNGGFCCHCPYGEFEDPRC 1315 (2531)
T ss_pred ccccc--------------eecCCCEEeecCCCceeccCCCcccCCCce
Confidence 33322 488899998754 7999999988 225554
No 4
>KOG4289|consensus
Probab=98.77 E-value=3.6e-08 Score=87.55 Aligned_cols=84 Identities=30% Similarity=0.583 Sum_probs=65.2
Q ss_pred CCCCCCCccCCCCcccCCCCccccccchhh-CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCCCe
Q psy2457 5 FPPRPGFEPSPSRLVASPRDYLGIRLSDSQ-CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGENAK 83 (189)
Q Consensus 5 ~~~~~gf~~~~~~~~~~~~~c~~~~~c~~~-C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~ 83 (189)
+.|-+||.+. .+.++++.|... |.+++.|....|.|+|.|.+||+|. +|+-......|.+.-|.++++
T Consensus 1224 CrCPpGFTgd--------~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs~~agrCvpGvC~nggt 1292 (2531)
T KOG4289|consen 1224 CRCPPGFTGD--------YCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVSARAGRCVPGVCKNGGT 1292 (2531)
T ss_pred EeCCCCCCcc--------cccchhHhhhcCCCCCCCceEEecCceeEEecCCcccc---ceeeecccCccccceecCCCE
Confidence 4566777644 445689999888 9999999999999999999999999 996212234666777999999
Q ss_pred eecCCCCCCCCCCeeeCCCC
Q psy2457 84 CQPGYDKSGKDRPVCTCLPG 103 (189)
Q Consensus 84 C~~~~~~~~~~~~~c~c~~g 103 (189)
|.+ ...+.+.|.|+.|
T Consensus 1293 C~~----~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1293 CVN----LLNGGFCCHCPYG 1308 (2531)
T ss_pred Eee----cCCCceeccCCCc
Confidence 987 2445688888876
No 5
>KOG1219|consensus
Probab=98.46 E-value=3.4e-07 Score=84.63 Aligned_cols=89 Identities=25% Similarity=0.558 Sum_probs=78.0
Q ss_pred CCccCCCCcccCCCCccccccchhh-CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCC-CCCCCCCCCCCCCeeecC
Q psy2457 10 GFEPSPSRLVASPRDYLGIRLSDSQ-CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDP-HDLCEPNPCGENAKCQPG 87 (189)
Q Consensus 10 gf~~~~~~~~~~~~~c~~~~~c~~~-C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~-~~~c~~~~c~~~~~C~~~ 87 (189)
||...-..++.+..+..++.+|... |..+++|+...+.|.|.|+.||+|. .|+ .. +++|..++|..++.|.+
T Consensus 3885 gy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~---~Ce--~~Gi~eCs~n~C~~gg~C~n- 3958 (4289)
T KOG1219|consen 3885 GYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK---RCE--ARGISECSKNVCGTGGQCIN- 3958 (4289)
T ss_pred ceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc---eee--cccccccccccccCCceeec-
Confidence 5655556666777777899999988 9999999999999999999999999 997 45 89999999999999998
Q ss_pred CCCCCCCCCeeeCCCCcccCC
Q psy2457 88 YDKSGKDRPVCTCLPGYVGDA 108 (189)
Q Consensus 88 ~~~~~~~~~~c~c~~g~~~~~ 108 (189)
..|+|.|.|.+|+.|..
T Consensus 3959 ----~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3959 ----IPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred ----cCCceEeccChhHhccc
Confidence 77889999999998766
No 6
>KOG1214|consensus
Probab=98.11 E-value=1.6e-05 Score=68.19 Aligned_cols=122 Identities=25% Similarity=0.554 Sum_probs=79.1
Q ss_pred CCCCCEEeeCC-CCeeeeCCCCCccCCCCCCcccCCCCCCC--CCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCC-C
Q psy2457 35 CGVNSECNVRN-HIPVCSCPPGYTGDPLTQCRRFDPHDLCE--PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDAL-T 110 (189)
Q Consensus 35 C~~~~~C~~~~-~~~~C~c~~g~~~~~~~~c~~~~~~~~c~--~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~-~ 110 (189)
|.....|.... -.|+|.|..||.+.++ .| .++++|. .+.|+..++|++ .++.++|.|..+|.-..- .
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdgr-~c---~d~~eca~~~~~CGp~s~Cin-----~pg~~rceC~~gy~F~dd~~ 772 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDGR-NC---VDENECATGFHRCGPNSVCIN-----LPGSYRCECRSGYEFADDRH 772 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCCC-CC---CChhhhccCCCCCCCCceeec-----CCCceeEEEeecceeccCCc
Confidence 55555676654 4689999999998776 67 3567776 456899999998 778899999988753221 1
Q ss_pred cCccC-CCCCCCCCCCCCccCCCeecCCccccCCCCCEe--eeC-CCCeeeeCCCCCccCCcccCeecCCcc
Q psy2457 111 YCRRG-ECQSDAECNYDQVCNNYNCEKACTSQCGINAQC--TAR-NHVATCSCPAGYQGDALSRCYPAETTS 178 (189)
Q Consensus 111 ~c~~~-~c~~~~~c~~~~~c~~~~c~~~c~~~C~~~~~C--~~~-~g~~~C~C~~G~~g~~~~~c~~~~~~~ 178 (189)
.|... +-...+.|.... +.|...+.+ ... .+.|.|.|.+||.|++.. |.+++.|.
T Consensus 773 tCV~i~~pap~n~Ce~g~------------h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c~dvDeC~ 831 (1289)
T KOG1214|consen 773 TCVLITPPAPANPCEDGS------------HTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-CTDVDECS 831 (1289)
T ss_pred ceEEecCCCCCCccccCc------------cccCcCCceEEEecCCceEEEeecCCccCCccc-cccccccC
Confidence 12110 001122232210 146555444 433 358999999999999986 77887764
No 7
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.99 E-value=3e-06 Score=43.32 Aligned_cols=27 Identities=33% Similarity=0.784 Sum_probs=24.3
Q ss_pred cCCCCCEeeeCC-CCeeeeCCCCCccCC
Q psy2457 141 QCGINAQCTARN-HVATCSCPAGYQGDA 167 (189)
Q Consensus 141 ~C~~~~~C~~~~-g~~~C~C~~G~~g~~ 167 (189)
+|.++|+|++.. +.|.|.|++||+|++
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 588899999888 999999999999874
No 8
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.95 E-value=5.6e-06 Score=45.18 Aligned_cols=24 Identities=38% Similarity=0.974 Sum_probs=22.4
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCc
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQ 164 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~ 164 (189)
.|...+.|+|..|+|.|.|++||+
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcE
Confidence 587789999999999999999998
No 9
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.73 E-value=2e-05 Score=42.95 Aligned_cols=32 Identities=34% Similarity=0.655 Sum_probs=28.1
Q ss_pred ccccchhh---CCCCCEEeeCCCCeeeeCCCCCcc
Q psy2457 27 GIRLSDSQ---CGVNSECNVRNHIPVCSCPPGYTG 58 (189)
Q Consensus 27 ~~~~c~~~---C~~~~~C~~~~~~~~C~c~~g~~~ 58 (189)
|+++|... |..++.|+|+.|+|.|.|++||..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 57788776 887899999999999999999983
No 10
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.72 E-value=2.6e-05 Score=40.91 Aligned_cols=28 Identities=43% Similarity=0.935 Sum_probs=23.2
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccCCc
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGDAL 168 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 168 (189)
.|+.+|+|++..++|.|.|++||.|++.
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 5899999999999999999999999875
No 11
>KOG1217|consensus
Probab=97.60 E-value=0.0006 Score=56.20 Aligned_cols=112 Identities=29% Similarity=0.662 Sum_probs=70.0
Q ss_pred ccchhh---CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCC-------------------CCCCCC--CCCCCC-Ce
Q psy2457 29 RLSDSQ---CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDP-------------------HDLCEP--NPCGEN-AK 83 (189)
Q Consensus 29 ~~c~~~---C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~-------------------~~~c~~--~~c~~~-~~ 83 (189)
++|... |.+.+.|.+..++|.|.|.++|.+. .++ .. ...|.. ..+... +.
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~---~~~--~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~ 244 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS---TCE--TTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGT 244 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCCccCC---cCc--CCCCCceEecceeccCCCCCCCCCcccccccccCCCCc
Confidence 556522 7777899998888999999999877 443 11 001110 112111 45
Q ss_pred eecCCCCCCCCCCeeeCCCCcccCCCCcCccCCCCCCCCCCCCCccCCCeecCCccccCCCCCEeeeCCCCeeeeCCCCC
Q psy2457 84 CQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGECQSDAECNYDQVCNNYNCEKACTSQCGINAQCTARNHVATCSCPAGY 163 (189)
Q Consensus 84 C~~~~~~~~~~~~~c~c~~g~~~~~~~~c~~~~c~~~~~c~~~~~c~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~ 163 (189)
|.+ ..+.+.|.+.+||.+.... .+.+++.|.... .|.++++|.+..+.|.|.|++||
T Consensus 245 c~~-----~~~~~~C~~~~g~~~~~~~-----~~~~~~~C~~~~-------------~c~~~~~C~~~~~~~~C~C~~g~ 301 (487)
T KOG1217|consen 245 CVN-----TVGSYTCRCPEGYTGDACV-----TCVDVDSCALIA-------------SCPNGGTCVNVPGSYRCTCPPGF 301 (487)
T ss_pred ccc-----cCCceeeeCCCCccccccc-----eeeeccccCCCC-------------ccCCCCeeecCCCcceeeCCCCC
Confidence 554 4455788888888776410 122222222210 26777899988888999999999
Q ss_pred ccCCc
Q psy2457 164 QGDAL 168 (189)
Q Consensus 164 ~g~~~ 168 (189)
+|..+
T Consensus 302 ~g~~~ 306 (487)
T KOG1217|consen 302 TGRLC 306 (487)
T ss_pred CCCCC
Confidence 98886
No 12
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.57 E-value=0.00013 Score=38.67 Aligned_cols=27 Identities=30% Similarity=0.789 Sum_probs=23.6
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCc-cCC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQ-GDA 167 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~-g~~ 167 (189)
+|.++++|++..++|.|.|++||. |..
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRN 37 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCc
Confidence 587778999999999999999998 654
No 13
>KOG1225|consensus
Probab=97.42 E-value=0.00043 Score=57.64 Aligned_cols=92 Identities=34% Similarity=0.923 Sum_probs=44.2
Q ss_pred eeeCCCCCccCCCCCCcccCCCCCCCCCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCCCcCccCCCCCCCCCCCCCc
Q psy2457 49 VCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGECQSDAECNYDQV 128 (189)
Q Consensus 49 ~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~~~c~~~~c~~~~~c~~~~~ 128 (189)
+|.|++||.+. +|. ...|... |..++.+.+ ..|.|.++|.|..++ +..|. ..|..+..
T Consensus 266 ~CIC~~Gf~G~---dC~----e~~Cp~~-cs~~g~~~~---------g~CiC~~g~~G~dCs---~~~cp--adC~g~G~ 323 (525)
T KOG1225|consen 266 RCICPPGFTGD---DCD----ELVCPVD-CSGGGVCVD---------GECICNPGYSGKDCS---IRRCP--ADCSGHGK 323 (525)
T ss_pred eEeCCCCCcCC---CCC----cccCCcc-cCCCceecC---------CEeecCCCccccccc---cccCC--ccCCCCCc
Confidence 46677777666 562 1233222 544444443 367777777766532 22222 23333333
Q ss_pred cCCCee-------cCCccc-cCCCCCEeeeCCCCeeeeCCCCCccCC
Q psy2457 129 CNNYNC-------EKACTS-QCGINAQCTARNHVATCSCPAGYQGDA 167 (189)
Q Consensus 129 c~~~~c-------~~~c~~-~C~~~~~C~~~~g~~~C~C~~G~~g~~ 167 (189)
|...+| -+.|.. .|.+++.|++. |.|..||+|.+
T Consensus 324 Ci~G~C~C~~Gy~G~~C~~~~C~~~g~cv~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 324 CIDGECLCDEGYTGELCIQRACSGGGQCVNG-----CKCKKGWRGPD 365 (525)
T ss_pred ccCCceEeCCCCcCCcccccccCCCceeccC-----ceeccCccCCC
Confidence 443333 122322 35555555431 66666666665
No 14
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.16 E-value=0.00074 Score=35.24 Aligned_cols=27 Identities=33% Similarity=0.813 Sum_probs=23.5
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccCC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGDA 167 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 167 (189)
+|.+++.|.+..+.|.|.|++||.|..
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCc
Confidence 477778999999999999999999865
No 15
>KOG1225|consensus
Probab=97.13 E-value=0.002 Score=53.76 Aligned_cols=45 Identities=38% Similarity=0.989 Sum_probs=35.3
Q ss_pred eeeCCCCCccCCCCCCcccCCCCCCCCCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCCC
Q psy2457 49 VCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDALT 110 (189)
Q Consensus 49 ~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~~ 110 (189)
.|.|.+||.|. .|. +..| +..|...+.|+. ..|.|.+||.|..+.
T Consensus 297 ~CiC~~g~~G~---dCs----~~~c-padC~g~G~Ci~---------G~C~C~~Gy~G~~C~ 341 (525)
T KOG1225|consen 297 ECICNPGYSGK---DCS----IRRC-PADCSGHGKCID---------GECLCDEGYTGELCI 341 (525)
T ss_pred EeecCCCcccc---ccc----cccC-CccCCCCCcccC---------CceEeCCCCcCCccc
Confidence 79999999999 884 2333 467878888874 689999999998744
No 16
>KOG4260|consensus
Probab=97.12 E-value=0.0013 Score=49.84 Aligned_cols=107 Identities=24% Similarity=0.592 Sum_probs=65.3
Q ss_pred CCCCCccCCCCCCcccCCCCCCCCCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCCCcCccC-----------CCC--
Q psy2457 52 CPPGYTGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRG-----------ECQ-- 118 (189)
Q Consensus 52 c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~~~c~~~-----------~c~-- 118 (189)
|+.|..|+ +|. .-.--...||...+.|.. +.+ ..|+..|.|..||.|..+..|.+. .|.
T Consensus 132 Cp~gtyGp---dCl---~Cpggser~C~GnG~C~G-dGs-R~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~C 203 (350)
T KOG4260|consen 132 CPDGTYGP---DCL---QCPGGSERPCFGNGSCHG-DGS-REGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTAC 203 (350)
T ss_pred cCCCCcCC---ccc---cCCCCCcCCcCCCCcccC-CCC-CCCCCcccccCCCCCccccccchHHHHhhcccccchhhhh
Confidence 77777777 663 111112356777777754 111 346788999999999887654320 010
Q ss_pred -C--CCCCCCC-----CccCC------Cee--cCCccc---cCCCCCEeeeCCCCeeeeCCCCCccC
Q psy2457 119 -S--DAECNYD-----QVCNN------YNC--EKACTS---QCGINAQCTARNHVATCSCPAGYQGD 166 (189)
Q Consensus 119 -~--~~~c~~~-----~~c~~------~~c--~~~c~~---~C~~~~~C~~~~g~~~C~C~~G~~g~ 166 (189)
. ...|... ..|.+ .-| ||+|.. +|..+-.|+|+.|+|.|.+.+||.+.
T Consensus 204 h~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g 270 (350)
T KOG4260|consen 204 HEGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG 270 (350)
T ss_pred hhhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC
Confidence 0 0011100 01111 112 688865 79888899999999999999999963
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.10 E-value=0.0008 Score=35.52 Aligned_cols=32 Identities=38% Similarity=0.735 Sum_probs=26.2
Q ss_pred cccchh-h-CCCCCEEeeCCCCeeeeCCCCCc-cC
Q psy2457 28 IRLSDS-Q-CGVNSECNVRNHIPVCSCPPGYT-GD 59 (189)
Q Consensus 28 ~~~c~~-~-C~~~~~C~~~~~~~~C~c~~g~~-~~ 59 (189)
+++|.. . |.+++.|++..++|.|.|+.||. +.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 455655 3 87778999999999999999998 65
No 18
>KOG1217|consensus
Probab=97.07 E-value=0.0044 Score=51.08 Aligned_cols=115 Identities=26% Similarity=0.653 Sum_probs=75.4
Q ss_pred Cccccccchhh--CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCCCCCCC----CCCCCCCCeeecCCCCCCCCCCe
Q psy2457 24 DYLGIRLSDSQ--CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPHDLCE----PNPCGENAKCQPGYDKSGKDRPV 97 (189)
Q Consensus 24 ~c~~~~~c~~~--C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~~~~c~----~~~c~~~~~C~~~~~~~~~~~~~ 97 (189)
.+.+++.|... |.++++|++..+.|.|.|++||.+. .+........|. ..+|...+.|.. ......+.
T Consensus 267 ~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~---~~~~~~~~~~C~~~~~~~~c~~g~~C~~---~~~~~~~~ 340 (487)
T KOG1217|consen 267 TCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR---LCTECVDVDECSPRNAGGPCANGGTCNT---LGSFGGFR 340 (487)
T ss_pred eeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCC---CCccccccccccccccCCcCCCCccccc---CCCCCCCC
Confidence 45577777765 7778899999888999999999988 431002345663 345666666622 01223466
Q ss_pred eeCCCCcccCCCCcCccCCCCCC-CCCCCCCccCCCeecCCccccCCCCCEeee-CCCCeeeeCCCCCccC
Q psy2457 98 CTCLPGYVGDALTYCRRGECQSD-AECNYDQVCNNYNCEKACTSQCGINAQCTA-RNHVATCSCPAGYQGD 166 (189)
Q Consensus 98 c~c~~g~~~~~~~~c~~~~c~~~-~~c~~~~~c~~~~c~~~c~~~C~~~~~C~~-~~g~~~C~C~~G~~g~ 166 (189)
|.+..++.+..+ ... ++|... .+...+.|.+ ..+.|.|.|+.+|.+.
T Consensus 341 C~c~~~~~g~~C--------~~~~~~C~~~--------------~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 341 CACGPGFTGRRC--------EDSNDECASS--------------PCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred cCCCCCCCCCcc--------ccCCccccCC--------------ccccCCEeccCCCCCeEecCCCccccC
Confidence 888888766653 222 123221 3666788888 6889999999998873
No 19
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.97 E-value=0.0015 Score=33.48 Aligned_cols=26 Identities=35% Similarity=0.906 Sum_probs=23.2
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGD 166 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 166 (189)
+|.+++.|++..+.|.|.|+.||.|.
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCccc
Confidence 47777899999999999999999987
No 20
>KOG4260|consensus
Probab=96.76 E-value=0.00084 Score=50.78 Aligned_cols=82 Identities=17% Similarity=0.433 Sum_probs=56.7
Q ss_pred CCCCCCCccCCCCcccCCCCccccccchhh---CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCCCCCCCCCCCCCC
Q psy2457 5 FPPRPGFEPSPSRLVASPRDYLGIRLSDSQ---CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPHDLCEPNPCGEN 81 (189)
Q Consensus 5 ~~~~~gf~~~~~~~~~~~~~c~~~~~c~~~---C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~~~~c~~~~c~~~ 81 (189)
..|++||.+. -..|.|+++|... |..+..|+|+.|+|.|...+||..- ...|+ ...+.|.. ..
T Consensus 220 ~kCkkGW~ld-------e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-~d~C~--~~~d~~~~----kn 285 (350)
T KOG4260|consen 220 SKCKKGWKLD-------EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-VDECQ--FCADVCAS----KN 285 (350)
T ss_pred hhhcccceec-------ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC-hHHhh--hhhhhccc----CC
Confidence 3567788744 4569999999876 8888899999999999999998752 21453 11122221 23
Q ss_pred CeeecCCCCCCCCCCeeeCCCCcc
Q psy2457 82 AKCQPGYDKSGKDRPVCTCLPGYV 105 (189)
Q Consensus 82 ~~C~~~~~~~~~~~~~c~c~~g~~ 105 (189)
..|++ +.+.|.|+|..++.
T Consensus 286 ~~c~n-----i~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 286 RPCMN-----IDGQYRCVCFSGLI 304 (350)
T ss_pred CCccc-----CCccEEEEecccce
Confidence 45666 56778899887764
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.75 E-value=0.00074 Score=34.35 Aligned_cols=25 Identities=40% Similarity=0.867 Sum_probs=22.9
Q ss_pred CCCCCEEeeCC-CCeeeeCCCCCccC
Q psy2457 35 CGVNSECNVRN-HIPVCSCPPGYTGD 59 (189)
Q Consensus 35 C~~~~~C~~~~-~~~~C~c~~g~~~~ 59 (189)
|.++++|+... +.|.|.|++||.|+
T Consensus 6 C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 6 CQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp STTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCeEEEeCCCCCEEeECCCCCccC
Confidence 88889999998 89999999999876
No 22
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.69 E-value=0.0019 Score=30.47 Aligned_cols=23 Identities=39% Similarity=0.730 Sum_probs=16.9
Q ss_pred CeeeeCCCCCccC-CcccCeecCC
Q psy2457 154 VATCSCPAGYQGD-ALSRCYPAET 176 (189)
Q Consensus 154 ~~~C~C~~G~~g~-~~~~c~~~~~ 176 (189)
+|.|.|++||+.. .-.+|.+|++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 5899999999943 2255888763
No 23
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.53 E-value=0.00098 Score=47.73 Aligned_cols=126 Identities=26% Similarity=0.656 Sum_probs=69.4
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCccCCCCCCcccCCCCCCC-----CCCCCCCCeeecCCCCCCCCCCeeeCCCCcccCCC
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPHDLCE-----PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDAL 109 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~~~~~~~c~~~~~~~~c~-----~~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~~ 109 (189)
|++ |..+...+.|.|.|..||.......|+ ...+|. ..+|...+.|........+..+.|.|.+||....-
T Consensus 8 CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE---~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~ 83 (197)
T PF06247_consen 8 CKN-GYLIQMSNHFECKCNEGFVLKNENTCE---EKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG 83 (197)
T ss_dssp -BT-EEEEEESSEEEEEESTTEEEEETTEEE---E----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS
T ss_pred ccC-CEEEEccCceEEEcCCCcEEccccccc---cceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC
Confidence 553 555555678999999999865443575 234443 35688889998733223345689999999976542
Q ss_pred CcCccCCCCCCCCCCCCCccC-------CCee------------------cCCccccCCCCCEeeeCCCCeeeeCCCCCc
Q psy2457 110 TYCRRGECQSDAECNYDQVCN-------NYNC------------------EKACTSQCGINAQCTARNHVATCSCPAGYQ 164 (189)
Q Consensus 110 ~~c~~~~c~~~~~c~~~~~c~-------~~~c------------------~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 164 (189)
.|.+..|.+. .|. ...|+ ...| ..+|...|..+..|....+-|.|.+.+|+.
T Consensus 84 -vCvp~~C~~~-~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~ 160 (197)
T PF06247_consen 84 -VCVPNKCNNK-DCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCKEGFP 160 (197)
T ss_dssp -SEEEGGGSS----T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-TT-E
T ss_pred -eEchhhcCce-ecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecCCCCC
Confidence 2555555442 222 11120 0122 245655687778999999999999999987
Q ss_pred cCC
Q psy2457 165 GDA 167 (189)
Q Consensus 165 g~~ 167 (189)
++.
T Consensus 161 ~~~ 163 (197)
T PF06247_consen 161 GDG 163 (197)
T ss_dssp EET
T ss_pred CCC
Confidence 433
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.40 E-value=0.0054 Score=31.81 Aligned_cols=31 Identities=39% Similarity=0.738 Sum_probs=25.0
Q ss_pred ccchh-h-CCCCCEEeeCCCCeeeeCCCCCccC
Q psy2457 29 RLSDS-Q-CGVNSECNVRNHIPVCSCPPGYTGD 59 (189)
Q Consensus 29 ~~c~~-~-C~~~~~C~~~~~~~~C~c~~g~~~~ 59 (189)
++|.. . |.+++.|.+..+.|.|.|..||.+.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 44544 3 7767899999999999999999886
No 25
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.32 E-value=0.0037 Score=32.65 Aligned_cols=26 Identities=38% Similarity=0.926 Sum_probs=21.1
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCccCC
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYTGDP 60 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~~~~ 60 (189)
|..+++|++..+++.|.|.+||.|.+
T Consensus 8 C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 8 CHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp S-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCcEeecCCCCEEeECCCCCccCC
Confidence 88889999999999999999999874
No 26
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.23 E-value=0.0074 Score=30.96 Aligned_cols=25 Identities=44% Similarity=1.006 Sum_probs=21.5
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGD 166 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 166 (189)
+|.++ .|++..++|.|.|++||.|.
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccC
Confidence 36666 89989999999999999984
No 27
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.92 E-value=0.0032 Score=25.16 Aligned_cols=13 Identities=46% Similarity=1.239 Sum_probs=10.0
Q ss_pred eeeCCCCCccCCc
Q psy2457 156 TCSCPAGYQGDAL 168 (189)
Q Consensus 156 ~C~C~~G~~g~~~ 168 (189)
.|.|++||+|.++
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4899999999874
No 28
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.49 E-value=0.012 Score=30.77 Aligned_cols=22 Identities=32% Similarity=0.711 Sum_probs=17.3
Q ss_pred CEeeeCCCCeeeeCCCCCccCC
Q psy2457 146 AQCTARNHVATCSCPAGYQGDA 167 (189)
Q Consensus 146 ~~C~~~~g~~~C~C~~G~~g~~ 167 (189)
..|++.+++|+|.|++||+...
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-T
T ss_pred CCCccCCCceEeECCCCCEECc
Confidence 3799999999999999998443
No 29
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.20 E-value=0.033 Score=28.24 Aligned_cols=26 Identities=27% Similarity=0.747 Sum_probs=20.4
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccCCc
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGDAL 168 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 168 (189)
.|..+++|.... .+|.|.+||.|..+
T Consensus 7 ~C~~~G~C~~~~--g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPC--GRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCC--CEEECCCCCcCCCC
Confidence 377889998652 38999999998763
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.15 E-value=0.034 Score=28.40 Aligned_cols=23 Identities=48% Similarity=1.057 Sum_probs=20.5
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCcc
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYTG 58 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~~ 58 (189)
|.++ .|++..+.|.|.|+.||.+
T Consensus 8 C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 8 CSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCC-EEECCCCCeEeECCCCCcc
Confidence 6666 8999989999999999988
No 31
>KOG1226|consensus
Probab=95.00 E-value=0.24 Score=43.05 Aligned_cols=14 Identities=36% Similarity=1.189 Sum_probs=11.2
Q ss_pred eeeCCCCCccCCCCCCc
Q psy2457 49 VCSCPPGYTGDPLTQCR 65 (189)
Q Consensus 49 ~C~c~~g~~~~~~~~c~ 65 (189)
.|.|.+||.|+ .|+
T Consensus 479 ~C~C~~G~~G~---~CE 492 (783)
T KOG1226|consen 479 QCRCDEGWLGK---KCE 492 (783)
T ss_pred ceecCCCCCCC---ccc
Confidence 47899999988 664
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.85 E-value=0.043 Score=27.74 Aligned_cols=25 Identities=44% Similarity=0.979 Sum_probs=21.9
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCccC
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYTGD 59 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~~~ 59 (189)
|.+++.|++..+.+.|.|+.||.+.
T Consensus 8 C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 8 CSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCEEecCCCCeEeECCCCCccc
Confidence 7667899999899999999999765
No 33
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.74 E-value=0.033 Score=29.06 Aligned_cols=23 Identities=39% Similarity=0.798 Sum_probs=19.1
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCccC
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYTGD 59 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~~~ 59 (189)
|.+ +|++.+++++|.|++||...
T Consensus 8 C~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 8 CSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSS--EEEEETTSEEEE-STTEEE-
T ss_pred cCC--CCccCCCceEeECCCCCEEC
Confidence 765 99999999999999999865
No 34
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.69 E-value=0.024 Score=26.68 Aligned_cols=22 Identities=23% Similarity=0.285 Sum_probs=18.5
Q ss_pred cCCCCCCCccCCCCcccCCCCcccccc
Q psy2457 4 KFPPRPGFEPSPSRLVASPRDYLGIRL 30 (189)
Q Consensus 4 ~~~~~~gf~~~~~~~~~~~~~c~~~~~ 30 (189)
...|..||.+. .+.+.|.||++
T Consensus 3 ~C~C~~Gy~l~-----~d~~~C~DIdE 24 (24)
T PF12662_consen 3 TCSCPPGYQLS-----PDGRSCEDIDE 24 (24)
T ss_pred EeeCCCCCcCC-----CCCCccccCCC
Confidence 45789999988 78899999885
No 35
>KOG1226|consensus
Probab=92.24 E-value=1.2 Score=38.97 Aligned_cols=58 Identities=29% Similarity=0.730 Sum_probs=35.6
Q ss_pred CCCCCEEeeCCCCeeeeCCCCCc----cCCCCCCcccCCCCCCCC---CCCCCCCeeecCCCCCCCCCCeeeCCCCcccC
Q psy2457 35 CGVNSECNVRNHIPVCSCPPGYT----GDPLTQCRRFDPHDLCEP---NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 107 (189)
Q Consensus 35 C~~~~~C~~~~~~~~C~c~~g~~----~~~~~~c~~~~~~~~c~~---~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~ 107 (189)
|...|.|.=. .|.|..... |+ .|+ -+--.|.. ..|...+.|.- ..|+|.+||.|.
T Consensus 516 CSgrG~C~CG----qC~C~~~~~~~i~G~---fCE--CDnfsC~r~~g~lC~g~G~C~C---------G~CvC~~GwtG~ 577 (783)
T KOG1226|consen 516 CSGRGDCVCG----QCVCHKPDNGKIYGK---FCE--CDNFSCERHKGVLCGGHGRCEC---------GRCVCNPGWTGS 577 (783)
T ss_pred cCCCCcEeCC----ceEecCCCCCceeee---eee--ccCcccccccCcccCCCCeEeC---------CcEEcCCCCccC
Confidence 5555555321 366665554 55 664 23333432 34777777764 679999999999
Q ss_pred CCC
Q psy2457 108 ALT 110 (189)
Q Consensus 108 ~~~ 110 (189)
.+.
T Consensus 578 ~C~ 580 (783)
T KOG1226|consen 578 ACN 580 (783)
T ss_pred CCC
Confidence 865
No 36
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.89 E-value=0.12 Score=26.94 Aligned_cols=28 Identities=29% Similarity=0.592 Sum_probs=19.7
Q ss_pred cCCCCCEeeeCC-CCeeeeCCCCCccCCc
Q psy2457 141 QCGINAQCTARN-HVATCSCPAGYQGDAL 168 (189)
Q Consensus 141 ~C~~~~~C~~~~-g~~~C~C~~G~~g~~~ 168 (189)
.|..++.|++.. |.+.|.|..||+.++-
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred cCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 466778898776 9999999999986553
No 37
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=89.27 E-value=0.38 Score=35.96 Aligned_cols=36 Identities=19% Similarity=0.331 Sum_probs=29.1
Q ss_pred CCCccccccchhh---CCCCCEEeeCCCCeeeeCCCCCccC
Q psy2457 22 PRDYLGIRLSDSQ---CGVNSECNVRNHIPVCSCPPGYTGD 59 (189)
Q Consensus 22 ~~~c~~~~~c~~~---C~~~~~C~~~~~~~~C~c~~g~~~~ 59 (189)
...|.+.++|... |.+ .|.++.|+|.|.|+.||...
T Consensus 181 ~~~C~~~~~C~~~~~~c~~--~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQ--VCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCCCCccc--eEEcCCCCEEeECCCCccCC
Confidence 4467788888654 654 89999999999999999864
No 38
>PHA02887 EGF-like protein; Provisional
Probab=87.76 E-value=0.62 Score=30.93 Aligned_cols=28 Identities=29% Similarity=0.560 Sum_probs=21.1
Q ss_pred cCCCCCEeeeC--CCCeeeeCCCCCccCCcc
Q psy2457 141 QCGINAQCTAR--NHVATCSCPAGYQGDALS 169 (189)
Q Consensus 141 ~C~~~~~C~~~--~g~~~C~C~~G~~g~~~~ 169 (189)
.|. +|+|.-. ...+.|.|+.||+|.+|.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 466 4688543 456789999999999964
No 39
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=87.34 E-value=0.93 Score=30.64 Aligned_cols=52 Identities=27% Similarity=0.590 Sum_probs=35.9
Q ss_pred CCccCCCCcccCCCCccccccchhh----CCCCCEEeeCC--CCeeeeCCCCCccCCCCCCc
Q psy2457 10 GFEPSPSRLVASPRDYLGIRLSDSQ----CGVNSECNVRN--HIPVCSCPPGYTGDPLTQCR 65 (189)
Q Consensus 10 gf~~~~~~~~~~~~~c~~~~~c~~~----C~~~~~C~~~~--~~~~C~c~~g~~~~~~~~c~ 65 (189)
..+-.+.++......-.++.+|... |-+ |.|.-.. ..+.|.|..||.|. .|+
T Consensus 24 ~~~~~~~~~~~~~~~~~~i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCE 81 (139)
T PHA03099 24 AIETTSPEITNATTDIPAIRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQ 81 (139)
T ss_pred eeeecChhhccCccCCcccccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc---ccc
Confidence 3344445544444455567777665 776 5887654 67899999999999 996
No 40
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.49 E-value=0.76 Score=31.05 Aligned_cols=28 Identities=25% Similarity=0.522 Sum_probs=21.6
Q ss_pred cCCCCCEeee--CCCCeeeeCCCCCccCCcc
Q psy2457 141 QCGINAQCTA--RNHVATCSCPAGYQGDALS 169 (189)
Q Consensus 141 ~C~~~~~C~~--~~g~~~C~C~~G~~g~~~~ 169 (189)
.|.+ |+|.- ....+.|.|..||+|++|.
T Consensus 52 YClH-G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 52 YCLH-GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred EeEC-CEEEeeccCCCceeECCCCccccccc
Confidence 4666 47854 3467899999999999975
No 41
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.48 E-value=0.91 Score=33.91 Aligned_cols=25 Identities=24% Similarity=0.565 Sum_probs=20.5
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccCC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGDA 167 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 167 (189)
.|. ..|.+..|+|.|.|+.||+...
T Consensus 196 ~c~--~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 196 VCQ--QVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred Ccc--ceEEcCCCCEEeECCCCccCCC
Confidence 455 4799999999999999998543
No 42
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=83.29 E-value=1.4 Score=24.49 Aligned_cols=23 Identities=26% Similarity=0.725 Sum_probs=17.0
Q ss_pred eeeCCCCeeeeCCCCCccCCcccCe
Q psy2457 148 CTARNHVATCSCPAGYQGDALSRCY 172 (189)
Q Consensus 148 C~~~~g~~~C~C~~G~~g~~~~~c~ 172 (189)
|....| .|.|+++++|..+..|.
T Consensus 14 C~~~~G--~C~C~~~~~G~~C~~C~ 36 (50)
T cd00055 14 CDPGTG--QCECKPNTTGRRCDRCA 36 (50)
T ss_pred ccCCCC--EEeCCCcCCCCCCCCCC
Confidence 543334 79999999999987664
No 43
>KOG0994|consensus
Probab=83.10 E-value=2 Score=39.62 Aligned_cols=25 Identities=32% Similarity=0.834 Sum_probs=16.7
Q ss_pred EeeeCCCCeeeeCCCCCccCCcccCee
Q psy2457 147 QCTARNHVATCSCPAGYQGDALSRCYP 173 (189)
Q Consensus 147 ~C~~~~g~~~C~C~~G~~g~~~~~c~~ 173 (189)
+|-...| .|+|.+||-|..+..|.+
T Consensus 1078 qCN~ftG--QCqCkpGfGGR~C~qCqe 1102 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQE 1102 (1758)
T ss_pred ccccccc--ceeccCCCCCcchhHHHH
Confidence 4544444 689999998887654443
No 44
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=81.74 E-value=0.94 Score=25.07 Aligned_cols=25 Identities=28% Similarity=0.798 Sum_probs=17.6
Q ss_pred EeeeCCCCeeeeCCCCCccCCcccCee
Q psy2457 147 QCTARNHVATCSCPAGYQGDALSRCYP 173 (189)
Q Consensus 147 ~C~~~~g~~~C~C~~G~~g~~~~~c~~ 173 (189)
.|....| .|.|+++|+|..+..|.+
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 12 TCDPSTG--QCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp SEEETCE--EESBSTTEESTTS-EE-T
T ss_pred cccCCCC--EEeccccccCCcCcCCCC
Confidence 5655444 899999999999876643
No 45
>PHA02887 EGF-like protein; Provisional
Probab=76.39 E-value=4.8 Score=26.83 Aligned_cols=36 Identities=31% Similarity=0.605 Sum_probs=27.7
Q ss_pred cccccchhh----CCCCCEEeeCC--CCeeeeCCCCCccCCCCCCc
Q psy2457 26 LGIRLSDSQ----CGVNSECNVRN--HIPVCSCPPGYTGDPLTQCR 65 (189)
Q Consensus 26 ~~~~~c~~~----C~~~~~C~~~~--~~~~C~c~~g~~~~~~~~c~ 65 (189)
..+.+|... |- +|.|.-.. ..+.|.|..||.|. .|+
T Consensus 81 ~hf~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE 122 (126)
T PHA02887 81 MFFEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI---RCD 122 (126)
T ss_pred cCccccChHhhCEee-CCEEEccccCCCceeECCCCcccC---CCC
Confidence 456777765 77 47997653 56789999999999 895
No 46
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=76.36 E-value=2.7 Score=27.63 Aligned_cols=23 Identities=39% Similarity=1.005 Sum_probs=17.3
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCc
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQ 164 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~ 164 (189)
.|+..+.|.. .....|.|.+||.
T Consensus 85 ~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 85 FCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred ccCCccEeCC-CCCCceECCCCcC
Confidence 4888888953 3455799999996
No 47
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=74.90 E-value=3.4 Score=22.55 Aligned_cols=17 Identities=29% Similarity=0.802 Sum_probs=13.9
Q ss_pred eeeCCCCCccCCcccCe
Q psy2457 156 TCSCPAGYQGDALSRCY 172 (189)
Q Consensus 156 ~C~C~~G~~g~~~~~c~ 172 (189)
.|.|+++++|..+..|.
T Consensus 19 ~C~C~~~~~G~~C~~C~ 35 (46)
T smart00180 19 QCECKPNVTGRRCDRCA 35 (46)
T ss_pred EEECCCCCCCCCCCcCC
Confidence 79999999998876553
No 48
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=73.42 E-value=9.7 Score=21.15 Aligned_cols=22 Identities=36% Similarity=0.841 Sum_probs=16.1
Q ss_pred cCCCCCEeeeCCCCeeeeCCCCCccC
Q psy2457 141 QCGINAQCTARNHVATCSCPAGYQGD 166 (189)
Q Consensus 141 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 166 (189)
.|..++.|++. .|.|++||.-.
T Consensus 27 qC~~~s~C~~g----~C~C~~g~~~~ 48 (52)
T PF01683_consen 27 QCIGGSVCVNG----RCQCPPGYVEV 48 (52)
T ss_pred CCCCcCEEcCC----EeECCCCCEec
Confidence 46667788553 89999998743
No 49
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=67.77 E-value=5.4 Score=26.16 Aligned_cols=30 Identities=43% Similarity=0.915 Sum_probs=21.9
Q ss_pred ccchhh--CCCCCEEeeCCCCeeeeCCCCCccC
Q psy2457 29 RLSDSQ--CGVNSECNVRNHIPVCSCPPGYTGD 59 (189)
Q Consensus 29 ~~c~~~--C~~~~~C~~~~~~~~C~c~~g~~~~ 59 (189)
+.|... |+..+.|.. .....|.|.+||..+
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 456544 888999943 455679999999754
No 50
>KOG0994|consensus
Probab=67.61 E-value=19 Score=33.79 Aligned_cols=21 Identities=33% Similarity=0.819 Sum_probs=14.5
Q ss_pred eeeCCCCeeee-CCCCCccCCc
Q psy2457 148 CTARNHVATCS-CPAGYQGDAL 168 (189)
Q Consensus 148 C~~~~g~~~C~-C~~G~~g~~~ 168 (189)
|.+..++++|. |..||.|+..
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred ccccccccchhhhhccccCCcc
Confidence 56666777775 7888876653
No 51
>KOG1836|consensus
Probab=56.08 E-value=45 Score=32.92 Aligned_cols=33 Identities=27% Similarity=0.669 Sum_probs=21.7
Q ss_pred cCCCCCEeeeC--CCCeeee-CCCCCccCCcccCee
Q psy2457 141 QCGINAQCTAR--NHVATCS-CPAGYQGDALSRCYP 173 (189)
Q Consensus 141 ~C~~~~~C~~~--~g~~~C~-C~~G~~g~~~~~c~~ 173 (189)
+|..++.|... .....|. |++||+|.++..|.+
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 45555555432 3455788 999999999865544
No 52
>KOG3512|consensus
Probab=55.89 E-value=62 Score=27.31 Aligned_cols=22 Identities=23% Similarity=0.546 Sum_probs=15.6
Q ss_pred CCEEeeCCC-CeeeeCCCCCccC
Q psy2457 38 NSECNVRNH-IPVCSCPPGYTGD 59 (189)
Q Consensus 38 ~~~C~~~~~-~~~C~c~~g~~~~ 59 (189)
...|+.... .++|.|...-.|+
T Consensus 284 As~Cv~d~~~~ltCdC~HNTaGP 306 (592)
T KOG3512|consen 284 ASRCVMDESSHLTCDCEHNTAGP 306 (592)
T ss_pred cceeeeccCCceEEecccCCCCC
Confidence 346776654 4889988887777
No 53
>KOG3516|consensus
Probab=50.29 E-value=12 Score=34.74 Aligned_cols=35 Identities=23% Similarity=0.673 Sum_probs=30.6
Q ss_pred cCCccc-cCCCCCEeeeCCCCeeeeCC-CCCccCCcc
Q psy2457 135 EKACTS-QCGINAQCTARNHVATCSCP-AGYQGDALS 169 (189)
Q Consensus 135 ~~~c~~-~C~~~~~C~~~~g~~~C~C~-~G~~g~~~~ 169 (189)
+|.|.+ +|.+++.|.-+...|.|.|. .||+|..|.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCH 581 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccccc
Confidence 567766 89999999888999999997 999999884
No 54
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=50.02 E-value=14 Score=18.86 Aligned_cols=12 Identities=50% Similarity=1.126 Sum_probs=10.0
Q ss_pred eeeCCCCCccCC
Q psy2457 156 TCSCPAGYQGDA 167 (189)
Q Consensus 156 ~C~C~~G~~g~~ 167 (189)
.|.||.||..+.
T Consensus 19 ~C~CPeGyIlde 30 (34)
T PF09064_consen 19 QCFCPEGYILDE 30 (34)
T ss_pred ceeCCCceEecC
Confidence 799999998554
No 55
>KOG3607|consensus
Probab=45.43 E-value=27 Score=31.19 Aligned_cols=48 Identities=21% Similarity=0.636 Sum_probs=35.1
Q ss_pred CCCCCCCCCccCCCee-------cCCccccCCCCCEeeeCCCCeeeeCCCCCccCCcc
Q psy2457 119 SDAECNYDQVCNNYNC-------EKACTSQCGINAQCTARNHVATCSCPAGYQGDALS 169 (189)
Q Consensus 119 ~~~~c~~~~~c~~~~c-------~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 169 (189)
++..|.....|.+.+| .+.|...|..+|+|-+. ++|+|.+||.+-.|.
T Consensus 602 dGt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~---~~ChC~~gwapp~C~ 656 (716)
T KOG3607|consen 602 DGTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNE---LNCHCEPGWAPPFCF 656 (716)
T ss_pred CCCccCCCceecCCcchhhhhhcccccccccCCCcccCCC---cceeeCCCCCCCccc
Confidence 4556777777888888 23344479889999665 489999999877653
No 56
>KOG3516|consensus
Probab=45.13 E-value=18 Score=33.78 Aligned_cols=39 Identities=21% Similarity=0.391 Sum_probs=32.5
Q ss_pred Cccccccchhh-CCCCCEEeeCCCCeeeeCC-CCCccCCCCCCc
Q psy2457 24 DYLGIRLSDSQ-CGVNSECNVRNHIPVCSCP-PGYTGDPLTQCR 65 (189)
Q Consensus 24 ~c~~~~~c~~~-C~~~~~C~~~~~~~~C~c~-~g~~~~~~~~c~ 65 (189)
.|.-.+.|.++ |.+++.|..+...|+|-|. +||.|. .|.
T Consensus 541 ~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga---tCH 581 (1306)
T KOG3516|consen 541 MCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA---TCH 581 (1306)
T ss_pred ccccccccCCccccCCCcccccccceeEeccccccccc---ccc
Confidence 45567778877 9999999998889999998 999887 664
No 57
>smart00051 DSL delta serrate ligand.
Probab=44.02 E-value=52 Score=19.32 Aligned_cols=44 Identities=25% Similarity=0.663 Sum_probs=25.6
Q ss_pred eeeCCCCCccCCCCCCcccCCCCCCCC-CCCCCCCeeecCCCCCCCCCCeeeCCCCcccCC
Q psy2457 49 VCSCPPGYTGDPLTQCRRFDPHDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA 108 (189)
Q Consensus 49 ~C~c~~g~~~~~~~~c~~~~~~~~c~~-~~c~~~~~C~~~~~~~~~~~~~c~c~~g~~~~~ 108 (189)
.-.|.+.|.|. .|. ..|.. .-......|.. ...++|.+||.|..
T Consensus 18 rv~C~~~~yG~---~C~-----~~C~~~~d~~~~~~Cd~--------~G~~~C~~Gw~G~~ 62 (63)
T smart00051 18 RVTCDENYYGE---GCN-----KFCRPRDDFFGHYTCDE--------NGNKGCLEGWMGPY 62 (63)
T ss_pred EeeCCCCCcCC---ccC-----CEeCcCccccCCccCCc--------CCCEecCCCCcCCC
Confidence 44688888888 673 23321 11233445532 24578999998764
No 58
>KOG3514|consensus
Probab=24.74 E-value=55 Score=30.66 Aligned_cols=34 Identities=26% Similarity=0.727 Sum_probs=27.7
Q ss_pred CCccc-cCCCCCEeeeCCCCeeeeCC-CCCccCCcc
Q psy2457 136 KACTS-QCGINAQCTARNHVATCSCP-AGYQGDALS 169 (189)
Q Consensus 136 ~~c~~-~C~~~~~C~~~~g~~~C~C~-~G~~g~~~~ 169 (189)
..|.. +|.++|+|...+..|.|.|. .||.|..|.
T Consensus 624 ~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 624 KICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred cccCCCcccCCCCccccccccccccccCcccCcccc
Confidence 35777 89999999999999999994 567777763
Done!