Query psy2060
Match_columns 193
No_of_seqs 194 out of 2072
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 23:12:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2060.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2060hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 100.0 5.9E-28 1.3E-32 218.3 14.5 160 20-189 1217-1389(2531)
2 KOG4289|consensus 99.5 1.5E-12 3.2E-17 119.9 14.5 97 73-179 1502-1604(2531)
3 KOG1219|consensus 99.4 1.3E-13 2.8E-18 130.5 5.6 88 16-113 3877-3978(4289)
4 KOG1219|consensus 99.2 2.4E-10 5.1E-15 109.2 11.2 122 51-190 3631-3756(4289)
5 PF00054 Laminin_G_1: Laminin 98.9 3.8E-09 8.2E-14 76.3 7.7 50 141-193 1-50 (131)
6 KOG3516|consensus 98.9 2.2E-08 4.7E-13 92.0 13.9 108 74-191 957-1088(1306)
7 KOG3514|consensus 98.9 1.6E-08 3.6E-13 91.8 10.7 109 64-181 616-729 (1591)
8 KOG3514|consensus 98.9 1.9E-08 4.1E-13 91.5 10.9 104 74-188 211-323 (1591)
9 cd00110 LamG Laminin G domain; 98.8 4.6E-08 1E-12 71.5 10.3 73 116-191 2-75 (151)
10 KOG3516|consensus 98.8 1.7E-07 3.8E-12 86.3 13.3 128 48-188 726-860 (1306)
11 smart00282 LamG Laminin G doma 98.4 2E-06 4.4E-11 61.9 8.5 53 135-190 3-55 (135)
12 KOG1225|consensus 98.2 1.6E-06 3.5E-11 75.3 4.9 70 26-113 266-342 (525)
13 PF00008 EGF: EGF-like domain 98.1 1.8E-06 3.9E-11 46.8 1.4 28 75-103 1-29 (32)
14 PF02210 Laminin_G_2: Laminin 98.0 3.5E-05 7.6E-10 54.3 8.0 48 141-191 1-49 (128)
15 PF07645 EGF_CA: Calcium-bindi 97.6 3.5E-05 7.5E-10 44.4 2.1 28 74-103 4-34 (42)
16 KOG1214|consensus 97.5 8.2E-05 1.8E-09 67.2 4.1 55 50-111 806-863 (1289)
17 PF12661 hEGF: Human growth fa 97.4 4.5E-05 9.7E-10 32.6 0.3 13 54-66 1-13 (13)
18 KOG1217|consensus 97.4 0.00037 8E-09 59.7 5.8 57 47-111 246-306 (487)
19 smart00179 EGF_CA Calcium-bind 97.3 0.00037 8E-09 38.8 3.4 33 74-111 4-38 (39)
20 KOG1225|consensus 97.3 0.00041 8.8E-09 60.7 5.1 72 26-113 235-311 (525)
21 KOG4260|consensus 97.3 0.0003 6.4E-09 56.0 3.7 75 26-104 130-269 (350)
22 cd00054 EGF_CA Calcium-binding 97.2 0.00047 1E-08 37.9 3.1 33 74-112 4-38 (38)
23 KOG1217|consensus 97.1 0.00097 2.1E-08 57.1 5.9 83 21-113 106-207 (487)
24 cd00053 EGF Epidermal growth f 97.0 0.0012 2.5E-08 35.7 3.3 25 77-103 6-30 (36)
25 smart00181 EGF Epidermal growt 96.8 0.0018 4E-08 35.1 3.2 29 77-112 6-35 (35)
26 KOG1214|consensus 96.7 0.0026 5.6E-08 57.9 5.1 77 18-105 686-768 (1289)
27 PF12947 EGF_3: EGF domain; I 96.3 0.0019 4.2E-08 35.7 1.1 31 77-111 6-36 (36)
28 PF00008 EGF: EGF-like domain 96.2 0.00094 2E-08 36.0 -0.6 23 43-65 7-32 (32)
29 PF14670 FXa_inhibition: Coagu 95.8 0.0054 1.2E-07 33.9 1.5 26 84-111 11-36 (36)
30 smart00179 EGF_CA Calcium-bind 95.8 0.0082 1.8E-07 33.1 2.1 21 47-67 18-39 (39)
31 KOG1836|consensus 95.7 0.016 3.5E-07 57.3 5.1 84 27-116 697-816 (1705)
32 smart00051 DSL delta serrate l 95.6 0.014 3.1E-07 36.5 3.0 37 28-66 21-63 (63)
33 PF12662 cEGF: Complement Clr- 95.6 0.009 2E-07 29.9 1.6 16 52-67 1-20 (24)
34 cd00054 EGF_CA Calcium-binding 95.1 0.018 3.8E-07 31.3 2.1 21 47-67 18-38 (38)
35 PF12662 cEGF: Complement Clr- 95.1 0.015 3.2E-07 29.1 1.5 21 92-113 1-21 (24)
36 smart00181 EGF Epidermal growt 95.1 0.019 4.2E-07 30.9 2.1 20 48-67 15-35 (35)
37 KOG1226|consensus 95.0 0.037 7.9E-07 50.2 4.6 51 53-113 566-621 (783)
38 PF14670 FXa_inhibition: Coagu 94.9 0.0096 2.1E-07 32.9 0.5 16 47-62 13-28 (36)
39 cd00053 EGF Epidermal growth f 94.1 0.044 9.5E-07 29.1 2.1 20 47-66 15-35 (36)
40 KOG0994|consensus 94.1 0.093 2E-06 49.8 5.2 83 28-118 846-954 (1758)
41 PHA03099 epidermal growth fact 94.0 0.043 9.2E-07 39.0 2.3 39 25-69 42-83 (139)
42 PF07645 EGF_CA: Calcium-bindi 93.8 0.015 3.3E-07 33.2 -0.2 16 47-62 19-34 (42)
43 PHA02887 EGF-like protein; Pro 93.5 0.053 1.2E-06 37.8 2.0 23 47-69 102-124 (126)
44 PF07974 EGF_2: EGF-like domai 93.4 0.085 1.8E-06 28.3 2.3 27 77-111 6-32 (32)
45 PHA03099 epidermal growth fact 92.9 0.083 1.8E-06 37.5 2.3 33 77-115 51-84 (139)
46 smart00051 DSL delta serrate l 92.9 0.13 2.8E-06 32.2 2.9 46 52-111 16-63 (63)
47 PHA02887 EGF-like protein; Pro 92.9 0.093 2E-06 36.7 2.4 32 77-114 92-124 (126)
48 KOG1226|consensus 92.7 0.18 3.9E-06 45.9 4.6 67 27-103 480-575 (783)
49 PF07974 EGF_2: EGF-like domai 91.6 0.13 2.8E-06 27.6 1.5 15 52-66 18-32 (32)
50 KOG4260|consensus 91.4 0.22 4.7E-06 40.1 3.3 46 57-113 132-183 (350)
51 KOG3509|consensus 90.6 0.41 9E-06 45.1 4.7 121 57-189 161-307 (964)
52 PF01414 DSL: Delta serrate li 88.6 0.15 3.4E-06 31.8 0.3 37 28-66 21-63 (63)
53 KOG1836|consensus 87.2 1.2 2.5E-05 44.9 5.4 50 22-71 754-816 (1705)
54 PF12946 EGF_MSP1_1: MSP1 EGF 87.0 0.34 7.3E-06 26.8 1.0 34 75-111 2-36 (37)
55 PF06247 Plasmod_Pvs28: Plasmo 84.0 0.38 8.3E-06 36.6 0.4 52 49-104 16-80 (197)
56 cd01475 vWA_Matrilin VWA_Matri 83.6 1.4 3.1E-05 34.3 3.5 35 63-104 181-218 (224)
57 PF06247 Plasmod_Pvs28: Plasmo 83.1 0.56 1.2E-05 35.7 0.9 76 22-103 17-119 (197)
58 KOG0994|consensus 83.1 0.81 1.8E-05 43.8 2.1 20 23-44 932-952 (1758)
59 PF00954 S_locus_glycop: S-loc 77.1 2.5 5.3E-05 29.2 2.6 28 73-103 78-107 (110)
60 PF00053 Laminin_EGF: Laminin 74.3 1.6 3.4E-05 25.4 0.9 18 51-68 16-33 (49)
61 cd00055 EGF_Lam Laminin-type e 73.4 2.5 5.5E-05 24.8 1.6 16 52-67 18-33 (50)
62 KOG1218|consensus 73.1 8.1 0.00018 31.4 5.1 38 50-89 159-198 (316)
63 smart00180 EGF_Lam Laminin-typ 73.0 2.7 5.9E-05 24.2 1.7 16 52-67 17-32 (46)
64 PF01683 EB: EB module; Inter 73.0 5.7 0.00012 23.3 3.1 21 77-103 26-46 (52)
65 KOG3509|consensus 69.6 26 0.00056 33.6 7.9 39 139-179 572-612 (964)
66 cd01475 vWA_Matrilin VWA_Matri 63.6 6.1 0.00013 30.8 2.4 17 47-63 202-218 (224)
67 KOG0196|consensus 63.0 11 0.00023 35.4 4.0 13 91-104 306-318 (996)
68 PF04863 EGF_alliinase: Alliin 62.2 2.6 5.6E-05 25.4 0.0 23 49-71 32-54 (56)
69 PF09064 Tme5_EGF_like: Thromb 46.7 13 0.00029 20.0 1.2 12 51-62 16-27 (34)
70 PF12955 DUF3844: Domain of un 38.3 15 0.00032 25.3 0.7 26 74-100 7-39 (103)
71 PF02973 Sialidase: Sialidase, 30.9 2.5E+02 0.0054 21.6 9.1 50 132-181 32-83 (190)
72 PF08300 HCV_NS5a_1a: Hepatiti 28.3 16 0.00036 22.5 -0.3 13 24-36 2-14 (62)
73 TIGR01492 CPW_WPC Plasmodium f 27.5 58 0.0013 20.0 2.1 16 52-68 20-35 (62)
74 PF12599 DUF3768: Protein of u 26.0 97 0.0021 20.5 3.0 34 157-192 34-67 (84)
75 smart00274 FOLN Follistatin-N- 25.3 86 0.0019 15.7 2.1 21 75-95 2-23 (26)
76 smart00564 PQQ beta-propeller 24.6 1.1E+02 0.0023 15.3 3.8 19 158-178 14-32 (33)
77 cd01328 FSL_SPARC Follistatin- 23.9 99 0.0021 20.5 2.8 25 74-99 1-26 (86)
78 PHA03399 pif3 per os infectivi 23.8 57 0.0012 25.3 1.8 10 53-62 159-168 (200)
79 COG4939 Major membrane immunog 23.4 1.2E+02 0.0025 21.8 3.2 34 157-191 44-77 (147)
80 PF07546 EMI: EMI domain; Int 22.3 83 0.0018 19.8 2.1 6 30-35 66-71 (72)
81 PHA02151 hypothetical protein 21.9 1.5E+02 0.0032 22.2 3.6 31 145-177 140-172 (217)
No 1
>KOG4289|consensus
Probab=99.95 E-value=5.9e-28 Score=218.31 Aligned_cols=160 Identities=36% Similarity=0.648 Sum_probs=142.7
Q ss_pred CCCCCcee-cCCCCccCCCCcceeec---------CC--ccCCCCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEe
Q psy2060 20 GPFQPYLS-GLPDNEGLEKRVQTFVD---------GG--HRTLDLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCS 86 (193)
Q Consensus 20 ~~~~~~~c-C~~g~~g~~~~c~~~~~---------gg--~~~~~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~ 86 (193)
.|...+.| |||||+|.. ||+.+| +| ..-.++|.|.|.+||+|.+||.+.....|. ..|.|||+|+
T Consensus 1217 ~pvnglrCrCPpGFTgd~--CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~ 1294 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDY--CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCV 1294 (2531)
T ss_pred cccCceeEeCCCCCCccc--ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEe
Confidence 45667789 999999998 999888 33 455789999999999999999887557899 9999999999
Q ss_pred eCCCCceecccCCCCCCccccccccceecccCCCcceEEecCCCCCceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEE
Q psy2060 87 LNPSGGYICDDCSLAGSYEHYDRLCRLRARSFPKNSFLTFPALKQRNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALE 166 (193)
Q Consensus 87 ~~~~~~~~C~~C~~g~~~~~~g~~C~~~~~sF~g~syl~~~~~~~~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~ 166 (193)
+...++|.|+ ||.+ ..+ +++|+..+.+|.+.+|+.+.....+.++.++|.|.|...+|+|+|+| ++++||++|+
T Consensus 1295 ~~~nggf~c~-Cp~g-e~e--~prC~v~trSFp~~sfv~frglrqRfh~TlslsfaT~~~nGlL~ynG--nekhDFvale 1368 (2531)
T KOG4289|consen 1295 NLLNGGFCCH-CPYG-EFE--DPRCEVTTRSFPPESFVTFRGLRQRFHFTLSLSFATIERNGLLLYNG--NEKHDFVALE 1368 (2531)
T ss_pred ecCCCceecc-CCCc-ccC--CCceEEEeeccCchheEEEeccccceEEEEEEEEEEeeecceEEecC--CcccceEeee
Confidence 9887999999 9987 333 99999999999999999999999999999999999999999999999 7899999999
Q ss_pred ecCCeeEEEEEEeCCCceEEEEc
Q psy2060 167 LTDSGRSVQFSWSLGSEVARLNR 189 (193)
Q Consensus 167 l~~G~~~l~~~~~~g~g~~~l~s 189 (193)
++++. |+++|+.|...+++..
T Consensus 1369 vVd~q--vqltfS~Ges~t~v~p 1389 (2531)
T KOG4289|consen 1369 VVDEQ--VQLTFSAGESTTTVSP 1389 (2531)
T ss_pred eeeee--EEEEEecccccceecC
Confidence 99999 9999999966555543
No 2
>KOG4289|consensus
Probab=99.46 E-value=1.5e-12 Score=119.87 Aligned_cols=97 Identities=26% Similarity=0.496 Sum_probs=76.9
Q ss_pred CCCC-CCCCCCcEEeeCCCCceecccCCCCCCccccccccceecc---cCCCcceEEecCCCC--CceeEEEEEEEeCCC
Q psy2060 73 SDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLRAR---SFPKNSFLTFPALKQ--RNRLHIKLQFSTLHD 146 (193)
Q Consensus 73 ~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~~~---sF~g~syl~~~~~~~--~~~~~i~~~f~T~~~ 146 (193)
+-|. .+|.|+|+|++.+ ++|.|+ ||.++. |..|+..+. -|.|.|-+.+..... .-.+.++|.|||+..
T Consensus 1502 ~fCdsg~C~n~g~CvnrW-g~~~C~-CP~~fg----gk~c~~~m~~pq~frG~sl~sw~~~~~~vSvPwylsl~FRTr~a 1575 (2531)
T KOG4289|consen 1502 NFCDSGQCSNGGTCVNRW-GGFSCE-CPLGFG----GKGCCQGMAHPQHFRGHSLVSWEGLPSQVSVPWYLSLMFRTRRA 1575 (2531)
T ss_pred cccCCCccCCCCeeeccc-CcEeec-CccccC----CcchhhccCCchhccccceeeecCCCcceecceEEEEEEEeecc
Confidence 4567 8999999999998 999999 999998 899988653 588888888765443 334678999999999
Q ss_pred CeEEEEeecCCCCCceEEEEecCCeeEEEEEEe
Q psy2060 147 SGLLLYNGRYNERHDFIALELTDSGRSVQFSWS 179 (193)
Q Consensus 147 ~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~ 179 (193)
+++||-... .+.+ -+.|+|.+|. |++...
T Consensus 1576 d~vl~~~~~-~~rs-t~~lqld~g~--l~~~v~ 1604 (2531)
T KOG4289|consen 1576 DGVLMQAEF-GGRS-TYNLQLDDGT--LKYNVG 1604 (2531)
T ss_pred ccEEEEEEe-CCCc-eEEEEEcCCE--EEEEec
Confidence 999997642 2333 4889999999 776543
No 3
>KOG1219|consensus
Probab=99.43 E-value=1.3e-13 Score=130.54 Aligned_cols=88 Identities=23% Similarity=0.425 Sum_probs=77.7
Q ss_pred eccCCCCCCcee-cCCCCccCCCCcceeec---------CC--ccCCCCceeeCCCCCCCCccccc-CCCCCCC-CCCCC
Q psy2060 16 DLTNGPFQPYLS-GLPDNEGLEKRVQTFVD---------GG--HRTLDLLHDMGARYHWGDTCETD-ITKSDCK-GVCSP 81 (193)
Q Consensus 16 ~~~~~~~~~~~c-C~~g~~g~~~~c~~~~~---------gg--~~~~~~~~C~C~~g~~G~~C~~~-~~~~~C~-~pC~~ 81 (193)
.+...|.+.|.| |++.|+|++ ||.+++ || .-..++|.|.||.||||.+||.. ++ +|. ++|.+
T Consensus 3877 ~C~~~~~ggy~CkCpsqysG~~--CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~--eCs~n~C~~ 3952 (4289)
T KOG1219|consen 3877 TCISQPKGGYKCKCPSQYSGNH--CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGIS--ECSKNVCGT 3952 (4289)
T ss_pred EecCCCCCceEEeCcccccCcc--cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccccc--ccccccccC
Confidence 345667888999 999999998 998776 66 45568999999999999999988 66 999 99999
Q ss_pred CcEEeeCCCCceecccCCCCCCccccccccce
Q psy2060 82 GATCSLNPSGGYICDDCSLAGSYEHYDRLCRL 113 (193)
Q Consensus 82 gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~ 113 (193)
||.|++.+ ++|.|. |.+++. |+.|..
T Consensus 3953 gg~C~n~~-gsf~Cn-cT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3953 GGQCINIP-GSFHCN-CTPGIL----GRTCCA 3978 (4289)
T ss_pred CceeeccC-CceEec-cChhHh----cccCcc
Confidence 99999999 999999 999998 888865
No 4
>KOG1219|consensus
Probab=99.16 E-value=2.4e-10 Score=109.23 Aligned_cols=122 Identities=20% Similarity=0.425 Sum_probs=101.3
Q ss_pred CCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCC-CCceecccCCCCCCccccccccce-ecccCCCcceEEec
Q psy2060 51 DLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRL-RARSFPKNSFLTFP 127 (193)
Q Consensus 51 ~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~-~~~sF~g~syl~~~ 127 (193)
....|.|+.|| |+..- +.|. .||..+..|++.. ...|+|. ||.|. -..|+- ...++.|.||..++
T Consensus 3631 r~a~ClC~~G~----Cp~~~--~~C~~~pcp~~~~Cvs~~~~~~~~cV-cP~gr-----~g~C~g~~elS~tGnSYveyr 3698 (4289)
T KOG1219|consen 3631 RTAACLCNRGF----CPVET--NQCAKSPCPAGNLCVSSVHNSTYTCV-CPIGR-----FGFCQGDFELSSTGNSYVEYR 3698 (4289)
T ss_pred ccceeeecCCc----CCccc--CccccCCCcccCcccccccccceeEe-ccCcc-----cccCCCcceEeecCceeEEEE
Confidence 45789999998 88774 4999 9999999999875 4679999 99874 345655 35788899999997
Q ss_pred CCCC-CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcC
Q psy2060 128 ALKQ-RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRA 190 (193)
Q Consensus 128 ~~~~-~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~ 190 (193)
.... ...+.+.|+++|..++|++||.. +.|++.|.|.+|. +++.|+.|+|+..+.+.
T Consensus 3699 lse~~n~~~kl~frLkT~~sngIiM~tr----~~d~~iLkLv~G~--~~l~~~cgsG~Givg~q 3756 (4289)
T KOG1219|consen 3699 LSENQNTRMKLGFRLKTLQSNGIIMYTR----KTDLAILKLVGGS--PQLLADCGSGPGIVGSQ 3756 (4289)
T ss_pred cccccccceEEEEEEEecccCcEEEEEc----CCceEEEEecCCc--EEEEEecCCCCCccccc
Confidence 5544 22378999999999999999973 6799999999999 99999999998877765
No 5
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.95 E-value=3.8e-09 Score=76.34 Aligned_cols=50 Identities=46% Similarity=0.743 Sum_probs=45.1
Q ss_pred EEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCCCC
Q psy2060 141 FSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRADSI 193 (193)
Q Consensus 141 f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~~i 193 (193)
|||..++|+|||.++ ....||++|+|.+|+ |+|+|++|+++..+.+++.|
T Consensus 1 frT~~~~Gllly~g~-~~~~dfial~L~~G~--l~~~~~~G~~~~~~~~~~~i 50 (131)
T PF00054_consen 1 FRTSEPNGLLLYLGS-KDGKDFIALELRDGR--LEFRYNLGSGPASLRSPQKI 50 (131)
T ss_dssp EEESSSSEEEEEEES-STTSSEEEEEEETTE--EEEEEESSSEEEEEEESSET
T ss_pred CccCCCCceEEECCc-CCCCCEEEEEEECCE--EEEEEeCCCccceecCCCcc
Confidence 899999999999986 445599999999999 99999999999999988754
No 6
>KOG3516|consensus
Probab=98.94 E-value=2.2e-08 Score=92.04 Aligned_cols=108 Identities=25% Similarity=0.432 Sum_probs=84.2
Q ss_pred CCC-CCCCCCcEEeeCCCCceecccCCC-CCCccccccccceec-ccCCCcceEEecCCCC-------------------
Q psy2060 74 DCK-GVCSPGATCSLNPSGGYICDDCSL-AGSYEHYDRLCRLRA-RSFPKNSFLTFPALKQ------------------- 131 (193)
Q Consensus 74 ~C~-~pC~~gg~C~~~~~~~~~C~~C~~-g~~~~~~g~~C~~~~-~sF~g~syl~~~~~~~------------------- 131 (193)
.|. .+|.|||+|+... .+|.|. |.. .|. |+.|.... .-|...+++.|....+
T Consensus 957 hCss~~C~NGG~Cvery-~gytCD-Cs~Tay~----Gp~Cs~eig~~fe~gs~i~y~fq~~~~~a~~~~~~~~~~~~~~~ 1030 (1306)
T KOG3516|consen 957 HCSSYPCLNGGHCVERY-DGYTCD-CSRTAYD----GPFCSKEIGVFFERGSSIRYNFQKPMRSAVFESSRVKQKLEIEI 1030 (1306)
T ss_pred ccccccccCCCEEEEec-Cceeec-cccCcCC----CCccccccceEecCCceEEEeccchHHHhhhhhhhhhhcccccc
Confidence 788 9999999999998 999999 975 344 99998864 4466677777653221
Q ss_pred CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEec-CCeeEEEEEEeCCC-ceEEEEcCC
Q psy2060 132 RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELT-DSGRSVQFSWSLGS-EVARLNRAD 191 (193)
Q Consensus 132 ~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~-~G~~~l~~~~~~g~-g~~~l~s~~ 191 (193)
.....|.|.|+|....++|||.+ .-..||+++.|. +|. |+++|.+|. .+.+++.++
T Consensus 1031 ~~~e~i~~sftTt~~ps~LLfvs--sF~~~y~~V~v~~nGs--Lq~ry~lg~~e~~~~~~~~ 1088 (1306)
T KOG3516|consen 1031 NPNEEINFSFTTTRAPSDLLFVS--SFTDDYLAVLVKDNGS--LQTRYMLGFREPFEYQFKD 1088 (1306)
T ss_pred CccceEEEEEEeccCceEEEEee--ccccceEEEEEeCCCc--eEEEEecCCcCceEEeccc
Confidence 11236889999999999999987 456789999888 799 999999998 566665543
No 7
>KOG3514|consensus
Probab=98.87 E-value=1.6e-08 Score=91.84 Aligned_cols=109 Identities=25% Similarity=0.418 Sum_probs=85.8
Q ss_pred CcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee--cccCCCcceEEecCCCC--CceeEEE
Q psy2060 64 DTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR--ARSFPKNSFLTFPALKQ--RNRLHIK 138 (193)
Q Consensus 64 ~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~--~~sF~g~syl~~~~~~~--~~~~~i~ 138 (193)
+.|...-. ..|+ +||+|+|+|...+ +.|.|. |...... |+.|+.. +.+|+|..|+..-.... ....++.
T Consensus 616 psCs~~~~-~~C~~nPC~N~g~C~egw-NrfiCD-Cs~T~~~---G~~CerE~t~ls~nGs~~m~i~L~~~~~tq~E~v~ 689 (1591)
T KOG3514|consen 616 PSCSLSNE-KICESNPCQNGGKCSEGW-NRFICD-CSGTGFE---GRTCEREATALSYNGSMSMKIVLPHTMHTQAEDVS 689 (1591)
T ss_pred cccchhhc-cccCCCcccCCCCccccc-cccccc-cccCccc---CccccceeeeEEEcCeeeEEEEecccceeecceEE
Confidence 34554432 3899 9999999999988 999999 9753332 9999884 67899988876532222 2234799
Q ss_pred EEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCC
Q psy2060 139 LQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLG 181 (193)
Q Consensus 139 ~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g 181 (193)
++|+|..+.||||-.+. .++.|-+.|+|.+|+ +++..++.
T Consensus 690 iRF~t~r~~Gll~~Tta-~~s~D~l~l~L~~g~--vkl~v~ls 729 (1591)
T KOG3514|consen 690 IRFRTQRAYGLLFATTA-RGSADTLRLELDAGQ--VKLFVNLS 729 (1591)
T ss_pred EEEEecccceeEEEecc-CCCCceEEEEEecce--EEEEEecC
Confidence 99999999999998765 578899999999999 99988886
No 8
>KOG3514|consensus
Probab=98.87 E-value=1.9e-08 Score=91.46 Aligned_cols=104 Identities=26% Similarity=0.536 Sum_probs=86.6
Q ss_pred CCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee----cccCCCcceEEecCCCC---CceeEEEEEEEeC
Q psy2060 74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR----ARSFPKNSFLTFPALKQ---RNRLHIKLQFSTL 144 (193)
Q Consensus 74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~----~~sF~g~syl~~~~~~~---~~~~~i~~~f~T~ 144 (193)
.|. .+|.+++.|.... .+..|. |...+. |+.|++. ...|.|+-|+-|....+ ...-.|+|.|||.
T Consensus 211 ~~~~~~~c~~~g~c~s~d-~gp~c~-c~~~~d----gq~cekeK~~~eaTF~G~ef~~YDls~npI~s~~d~itl~FrT~ 284 (1591)
T KOG3514|consen 211 SSREQFVCLNDGECYSSD-DGPHCD-CQFDHD----GQNCEKEKNDGEATFGGDEFVGYDLSQNPIRSKKDNITLTFRTV 284 (1591)
T ss_pred cccccceeccCCeEecCC-CCCccc-cccccC----ccccccccCcceEEecCceEEEeeccCCcccccccceEEEEEEe
Confidence 555 8999999999887 888899 977444 9999984 56899999998864433 3335799999999
Q ss_pred CCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEE
Q psy2060 145 HDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLN 188 (193)
Q Consensus 145 ~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~ 188 (193)
+.+|+|||.| +..||+.|.|.+|. |.+..++++|..++.
T Consensus 285 q~ngllfytG---~~~dYlnlaL~dGa--V~l~~~l~~g~~e~~ 323 (1591)
T KOG3514|consen 285 QGNGLLFYTG---DEKDYLNLALQDGA--VSLSSKLDGGDAEII 323 (1591)
T ss_pred cCceeEEEcc---CCcceeeEeecCCc--EEEEEecCCccceeE
Confidence 9999999995 57899999999999 999999998877654
No 9
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.83 E-value=4.6e-08 Score=71.53 Aligned_cols=73 Identities=38% Similarity=0.643 Sum_probs=62.3
Q ss_pred ccCCCcceEEecCCCC-CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCC
Q psy2060 116 RSFPKNSFLTFPALKQ-RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRAD 191 (193)
Q Consensus 116 ~sF~g~syl~~~~~~~-~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~ 191 (193)
..|.+.+|+.++.... ....+|+++|+|..++|+||+.+. ....+|++|+|.+|+ |.+.++.|.....++++.
T Consensus 2 ~~F~g~~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~-~~~~~~~~l~l~~g~--l~~~~~~g~~~~~~~~~~ 75 (151)
T cd00110 2 VSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGS-QNGGDFLALELEDGR--LVLRYDLGSGSLVLSSKT 75 (151)
T ss_pred eEeCCCceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecC-CCCCCEEEEEEECCE--EEEEEcCCcccEEEEccC
Confidence 4688899999987665 667899999999999999999864 225899999999999 999999997788887764
No 10
>KOG3516|consensus
Probab=98.76 E-value=1.7e-07 Score=86.27 Aligned_cols=128 Identities=22% Similarity=0.191 Sum_probs=95.7
Q ss_pred cCCCCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee-----cccCC-C
Q psy2060 48 RTLDLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR-----ARSFP-K 120 (193)
Q Consensus 48 ~~~~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~-----~~sF~-g 120 (193)
..+..+.|.|.++..-=.|. .-|. .-++.+.+|+... ..-++. |...+... +-+|+.. +++|+ +
T Consensus 726 C~d~~~~CNCDa~~~ewt~D-----tg~l~~k~hLPVt~vv~g-dTg~~~-sea~~~lg--PLrC~gDr~~wnsvSF~~~ 796 (1306)
T KOG3516|consen 726 CLDPQLYCNCDADEKEWTTD-----TGCLAYKDHLPVTQVVIG-DTGRSQ-SEAPYVLG--PLRCEGDRNFWNSVSFHTG 796 (1306)
T ss_pred ccCcceeeeccCCCcccccc-----ccccchhhcCCeeEEEEc-cCCCcc-cccceeec--ceEeecccccccceEeecC
Confidence 34567899998775432222 1444 6677888888665 333366 65544432 6678763 56787 4
Q ss_pred cceEEecCCCCCceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEE
Q psy2060 121 NSFLTFPALKQRNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLN 188 (193)
Q Consensus 121 ~syl~~~~~~~~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~ 188 (193)
.+|+.++.....++..|+|.|||..+.|++|.+. +..|||.|+|.... ++.|.|+.|+|+.+++
T Consensus 797 ~syL~fp~f~~~~saDIsf~FrTt~~~gvflen~---g~~dfir~eL~~~~-~vtf~~dvgnGp~~~~ 860 (1306)
T KOG3516|consen 797 ASYLHFPPFHNELSADISFFFRTTASSGVFLENH---GINDFIRLELSSPV-EVTFAFDVGNGPSQLT 860 (1306)
T ss_pred cceeecCcccCcccccEEEEEEecCCceEeeecc---CCCceEEEEEcCCC-ceEEEEEcCCCceeEE
Confidence 7899999999999999999999999999999983 47899999999765 5999999999977664
No 11
>smart00282 LamG Laminin G domain.
Probab=98.41 E-value=2e-06 Score=61.94 Aligned_cols=53 Identities=42% Similarity=0.682 Sum_probs=46.4
Q ss_pred eEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcC
Q psy2060 135 LHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRA 190 (193)
Q Consensus 135 ~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~ 190 (193)
.+|+|.|+|..++|+|||.+. ....+|++|+|.+|+ +.+.++.|+++..++++
T Consensus 3 ~~i~~~frt~~~~g~l~~~~~-~~~~~~l~l~l~~g~--l~~~~~~g~~~~~~~~~ 55 (135)
T smart00282 3 LSISFSFRTTSPNGLLLYAGS-KNGGDYLALELRDGR--LVLRYDLGSGPARLTSD 55 (135)
T ss_pred eEEEEEEEeCCCCEEEEEeCC-CCCCCEEEEEEECCE--EEEEEECCCCCEEEEEC
Confidence 579999999999999999853 246799999999999 99999999888887765
No 12
>KOG1225|consensus
Probab=98.21 E-value=1.6e-06 Score=75.35 Aligned_cols=70 Identities=20% Similarity=0.270 Sum_probs=54.3
Q ss_pred ee-cCCCCccCCCCcce--eec---CC-ccCCCCceeeCCCCCCCCcccccCCCCCCCCCCCCCcEEeeCCCCceecccC
Q psy2060 26 LS-GLPDNEGLEKRVQT--FVD---GG-HRTLDLLHDMGARYHWGDTCETDITKSDCKGVCSPGATCSLNPSGGYICDDC 98 (193)
Q Consensus 26 ~c-C~~g~~g~~~~c~~--~~~---gg-~~~~~~~~C~C~~g~~G~~C~~~~~~~~C~~pC~~gg~C~~~~~~~~~C~~C 98 (193)
.| |+|||+|.. |+. +.. ++ ....+ .|+|++||.|..|+.. .|...|.+.|.|++ . +|. |
T Consensus 266 ~CIC~~Gf~G~d--C~e~~Cp~~cs~~g~~~~g--~CiC~~g~~G~dCs~~----~cpadC~g~G~Ci~-G----~C~-C 331 (525)
T KOG1225|consen 266 RCICPPGFTGDD--CDELVCPVDCSGGGVCVDG--ECICNPGYSGKDCSIR----RCPADCSGHGKCID-G----ECL-C 331 (525)
T ss_pred eEeCCCCCcCCC--CCcccCCcccCCCceecCC--EeecCCCccccccccc----cCCccCCCCCcccC-C----ceE-e
Confidence 48 999999998 876 211 33 44444 9999999999999865 67788999999983 3 588 8
Q ss_pred CCCCCccccccccce
Q psy2060 99 SLAGSYEHYDRLCRL 113 (193)
Q Consensus 99 ~~g~~~~~~g~~C~~ 113 (193)
.+||+ |..|++
T Consensus 332 ~~Gy~----G~~C~~ 342 (525)
T KOG1225|consen 332 DEGYT----GELCIQ 342 (525)
T ss_pred CCCCc----CCcccc
Confidence 89998 666665
No 13
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.06 E-value=1.8e-06 Score=46.79 Aligned_cols=28 Identities=39% Similarity=0.993 Sum_probs=25.1
Q ss_pred CC-CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 75 CK-GVCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 75 C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
|. +||.|+|+|++...++|.|. |+++|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~-C~~G~~ 29 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCE-CPPGYT 29 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEE-EBTTEE
T ss_pred CCCCcCCCCeEEEeCCCCCEEeE-CCCCCc
Confidence 55 89999999998766999999 999987
No 14
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=98.03 E-value=3.5e-05 Score=54.30 Aligned_cols=48 Identities=38% Similarity=0.605 Sum_probs=38.7
Q ss_pred EEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCC-CceEEEEcCC
Q psy2060 141 FSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLG-SEVARLNRAD 191 (193)
Q Consensus 141 f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g-~g~~~l~s~~ 191 (193)
|||..++|+|||.+... ..+|++|+|.+|+ |++.|++| .....+.++.
T Consensus 1 Frt~~~~g~Ll~~~~~~-~~~~l~l~l~~g~--l~~~~~~g~~~~~~~~~~~ 49 (128)
T PF02210_consen 1 FRTRSPNGLLLYIGSED-NGDFLSLELVDGR--LVVRYNLGGSEIVTTFSNS 49 (128)
T ss_dssp EEESSSSEEEEEEEEST-TSEEEEEEEETTE--EEEEEESSSSEEEEEECSS
T ss_pred CccCCCCEeEEEEcCCC-CCEEEEEEEECCE--EEEEEEccccceeeeccCc
Confidence 89999999999997522 2689999999999 99999999 4455555444
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.64 E-value=3.5e-05 Score=44.36 Aligned_cols=28 Identities=29% Similarity=0.759 Sum_probs=25.8
Q ss_pred CCC---CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 74 DCK---GVCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 74 ~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
+|. +.|..++.|++.. ++|.|. |++||.
T Consensus 4 EC~~~~~~C~~~~~C~N~~-Gsy~C~-C~~Gy~ 34 (42)
T PF07645_consen 4 ECAEGPHNCPENGTCVNTE-GSYSCS-CPPGYE 34 (42)
T ss_dssp TTTTTSSSSSTTSEEEEET-TEEEEE-ESTTEE
T ss_pred ccCCCCCcCCCCCEEEcCC-CCEEee-CCCCcE
Confidence 888 6899899999998 999999 999986
No 16
>KOG1214|consensus
Probab=97.55 E-value=8.2e-05 Score=67.15 Aligned_cols=55 Identities=25% Similarity=0.496 Sum_probs=46.1
Q ss_pred CCCceeeCCCCCCCC--cccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 50 LDLLHDMGARYHWGD--TCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 50 ~~~~~C~C~~g~~G~--~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
.+.|.|.|.|||.|+ .|... + +|. +.|...++|.+.+ ++|.|. |.+||.++ |..|
T Consensus 806 gs~y~C~CLPGfsGDG~~c~dv-D--eC~psrChp~A~Cyntp-gsfsC~-C~pGy~GD--Gf~C 863 (1289)
T KOG1214|consen 806 GSTYSCACLPGFSGDGHQCTDV-D--ECSPSRCHPAATCYNTP-GSFSCR-CQPGYYGD--GFQC 863 (1289)
T ss_pred CceEEEeecCCccCCccccccc-c--ccCccccCCCceEecCC-Ccceee-cccCccCC--Ccee
Confidence 457999999999964 56543 3 999 9999999999999 999999 99999865 5555
No 17
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.41 E-value=4.5e-05 Score=32.62 Aligned_cols=13 Identities=15% Similarity=0.067 Sum_probs=8.9
Q ss_pred eeeCCCCCCCCcc
Q psy2060 54 HDMGARYHWGDTC 66 (193)
Q Consensus 54 ~C~C~~g~~G~~C 66 (193)
.|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4778888887776
No 18
>KOG1217|consensus
Probab=97.36 E-value=0.00037 Score=59.73 Aligned_cols=57 Identities=32% Similarity=0.605 Sum_probs=45.8
Q ss_pred ccCCCCceeeCCCCCCCCcc--cccCCCCCCC-C-CCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 47 HRTLDLLHDMGARYHWGDTC--ETDITKSDCK-G-VCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~G~~C--~~~~~~~~C~-~-pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
.+..++|.|.|++||++..+ ...+ +.|. . +|.++++|++.. +.|.|. |+++|. |..|
T Consensus 246 ~~~~~~~~C~~~~g~~~~~~~~~~~~--~~C~~~~~c~~~~~C~~~~-~~~~C~-C~~g~~----g~~~ 306 (487)
T KOG1217|consen 246 VNTVGSYTCRCPEGYTGDACVTCVDV--DSCALIASCPNGGTCVNVP-GSYRCT-CPPGFT----GRLC 306 (487)
T ss_pred cccCCceeeeCCCCccccccceeeec--cccCCCCccCCCCeeecCC-Ccceee-CCCCCC----CCCC
Confidence 45566799999999999874 3333 3898 5 499999999887 779999 999998 6665
No 19
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.29 E-value=0.00037 Score=38.79 Aligned_cols=33 Identities=33% Similarity=0.900 Sum_probs=27.4
Q ss_pred CCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
+|. .+|.+++.|++.. ++|.|. |+.+|. . |..|
T Consensus 4 ~C~~~~~C~~~~~C~~~~-g~~~C~-C~~g~~-~--g~~C 38 (39)
T smart00179 4 ECASGNPCQNGGTCVNTV-GSYRCE-CPPGYT-D--GRNC 38 (39)
T ss_pred cCcCCCCcCCCCEeECCC-CCeEeE-CCCCCc-c--CCcC
Confidence 777 5899999999987 999999 999886 1 5555
No 20
>KOG1225|consensus
Probab=97.28 E-value=0.00041 Score=60.68 Aligned_cols=72 Identities=24% Similarity=0.307 Sum_probs=52.7
Q ss_pred ee-cCCCCccCCCCcceeec-CC-cc--CCCCceeeCCCCCCCCcccccCCCCCCCCCCCCCcEEeeCCCCceecccCCC
Q psy2060 26 LS-GLPDNEGLEKRVQTFVD-GG-HR--TLDLLHDMGARYHWGDTCETDITKSDCKGVCSPGATCSLNPSGGYICDDCSL 100 (193)
Q Consensus 26 ~c-C~~g~~g~~~~c~~~~~-gg-~~--~~~~~~C~C~~g~~G~~C~~~~~~~~C~~pC~~gg~C~~~~~~~~~C~~C~~ 100 (193)
.| |+-+|+|.. |..-.- ++ .+ .-..-+|+|++||+|..|... .|...|..++.|++ + .|. |++
T Consensus 235 ic~c~~~~~g~~--c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~----~Cp~~cs~~g~~~~----g-~Ci-C~~ 302 (525)
T KOG1225|consen 235 ICECPEGYFGPL--CSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDEL----VCPVDCSGGGVCVD----G-ECI-CNP 302 (525)
T ss_pred eeecCCceeCCc--cccccCCCCCcccceEeCCeEeCCCCCcCCCCCcc----cCCcccCCCceecC----C-Eee-cCC
Confidence 47 999999997 763221 11 11 112358999999999999875 78844998888873 3 799 999
Q ss_pred CCCccccccccce
Q psy2060 101 AGSYEHYDRLCRL 113 (193)
Q Consensus 101 g~~~~~~g~~C~~ 113 (193)
+|. |+.|+.
T Consensus 303 g~~----G~dCs~ 311 (525)
T KOG1225|consen 303 GYS----GKDCSI 311 (525)
T ss_pred Ccc----cccccc
Confidence 999 888864
No 21
>KOG4260|consensus
Probab=97.26 E-value=0.0003 Score=56.04 Aligned_cols=75 Identities=12% Similarity=0.130 Sum_probs=56.3
Q ss_pred eecCCCCccCCCCcceeec------CC-------ccCCCCceeeCCCCCCCCccccc-----------------------
Q psy2060 26 LSGLPDNEGLEKRVQTFVD------GG-------HRTLDLLHDMGARYHWGDTCETD----------------------- 69 (193)
Q Consensus 26 ~cC~~g~~g~~~~c~~~~~------gg-------~~~~~~~~C~C~~g~~G~~C~~~----------------------- 69 (193)
+|||+|-.|+. |..+.. +| ..-.++-.|.|.+||+|+.|..-
T Consensus 130 vCCp~gtyGpd--Cl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C 207 (350)
T KOG4260|consen 130 VCCPDGTYGPD--CLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC 207 (350)
T ss_pred eccCCCCcCCc--cccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhh
Confidence 47999999998 887743 11 22345678999999999888431
Q ss_pred --------------------------CCCCCCC---CCCCCCcEEeeCCCCceecccCCCCCCc
Q psy2060 70 --------------------------ITKSDCK---GVCSPGATCSLNPSGGYICDDCSLAGSY 104 (193)
Q Consensus 70 --------------------------~~~~~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~~ 104 (193)
+|.++|. .||...-.|++.. ++|+|. +.+||..
T Consensus 208 ~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNte-GSf~C~-dk~Gy~~ 269 (350)
T KOG4260|consen 208 LGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTE-GSFKCE-DKEGYKK 269 (350)
T ss_pred hcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCC-CceEec-ccccccC
Confidence 1124665 7899989999997 999999 9888874
No 22
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.19 E-value=0.00047 Score=37.88 Aligned_cols=33 Identities=33% Similarity=0.855 Sum_probs=28.1
Q ss_pred CCC--CCCCCCcEEeeCCCCceecccCCCCCCccccccccc
Q psy2060 74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCR 112 (193)
Q Consensus 74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~ 112 (193)
+|. .+|.+++.|++.. ++|.|. |+.+|. |..|+
T Consensus 4 ~C~~~~~C~~~~~C~~~~-~~~~C~-C~~g~~----g~~C~ 38 (38)
T cd00054 4 ECASGNPCQNGGTCVNTV-GSYRCS-CPPGYT----GRNCE 38 (38)
T ss_pred cCCCCCCcCCCCEeECCC-CCeEeE-CCCCCc----CCcCC
Confidence 676 5899999999887 899999 999988 77774
No 23
>KOG1217|consensus
Probab=97.13 E-value=0.00097 Score=57.13 Aligned_cols=83 Identities=24% Similarity=0.507 Sum_probs=63.9
Q ss_pred CCCCcee-cCCCCccCCCCcce---eec-------CC--cc---CCCCceeeCCCCCCCCcccccCCCCCCC---CCCCC
Q psy2060 21 PFQPYLS-GLPDNEGLEKRVQT---FVD-------GG--HR---TLDLLHDMGARYHWGDTCETDITKSDCK---GVCSP 81 (193)
Q Consensus 21 ~~~~~~c-C~~g~~g~~~~c~~---~~~-------gg--~~---~~~~~~C~C~~g~~G~~C~~~~~~~~C~---~pC~~ 81 (193)
....+.| |++||.|.. |+. +.. .+ .+ ....+.|.|..||.+..|+...+ .|. .+|.+
T Consensus 106 ~~~~~~c~c~~g~~~~~--~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~--~C~~~~~~c~~ 181 (487)
T KOG1217|consen 106 CVGSYECTCPPGYQGTP--CEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLD--ECIQYSSPCQN 181 (487)
T ss_pred CCCCceeeCCCccccCc--CCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCccccccccccc--ccccCCCCcCC
Confidence 4456778 999999986 555 222 12 22 34589999999999999987644 887 57999
Q ss_pred CcEEeeCCCCceecccCCCCCCccccccccce
Q psy2060 82 GATCSLNPSGGYICDDCSLAGSYEHYDRLCRL 113 (193)
Q Consensus 82 gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~ 113 (193)
++.|.+.. ++|.|. |+.++. +..|+.
T Consensus 182 ~~~C~~~~-~~~~C~-c~~~~~----~~~~~~ 207 (487)
T KOG1217|consen 182 GGTCVNTG-GSYLCS-CPPGYT----GSTCET 207 (487)
T ss_pred CcccccCC-CCeeEe-CCCCcc----CCcCcC
Confidence 99999987 789999 999998 666654
No 24
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.96 E-value=0.0012 Score=35.68 Aligned_cols=25 Identities=40% Similarity=0.869 Sum_probs=22.7
Q ss_pred CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 77 GVCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
.+|.+++.|++.. ++|.|. |+.+|.
T Consensus 6 ~~C~~~~~C~~~~-~~~~C~-C~~g~~ 30 (36)
T cd00053 6 NPCSNGGTCVNTP-GSYRCV-CPPGYT 30 (36)
T ss_pred CCCCCCCEEecCC-CCeEeE-CCCCCc
Confidence 6788899999988 899999 999998
No 25
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.79 E-value=0.0018 Score=35.14 Aligned_cols=29 Identities=41% Similarity=1.008 Sum_probs=24.2
Q ss_pred CCCCCCcEEeeCCCCceecccCCCCCCccccc-cccc
Q psy2060 77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYD-RLCR 112 (193)
Q Consensus 77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g-~~C~ 112 (193)
.+|.++ +|++.. ++|.|. |+.+|. | ..|+
T Consensus 6 ~~C~~~-~C~~~~-~~~~C~-C~~g~~----g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TCINTP-GSYTCS-CPPGYT----GDKRCE 35 (35)
T ss_pred CCCCCC-EEECCC-CCeEeE-CCCCCc----cCCccC
Confidence 378888 999886 999999 999998 6 6653
No 26
>KOG1214|consensus
Probab=96.72 E-value=0.0026 Score=57.90 Aligned_cols=77 Identities=18% Similarity=0.311 Sum_probs=59.8
Q ss_pred cCCCCCCcee-cCCCCccCCCCcceeecCCccCCCCceeeCCCCCC--CCcccccCCCCCCC---CCCCCCcEEeeCCCC
Q psy2060 18 TNGPFQPYLS-GLPDNEGLEKRVQTFVDGGHRTLDLLHDMGARYHW--GDTCETDITKSDCK---GVCSPGATCSLNPSG 91 (193)
Q Consensus 18 ~~~~~~~~~c-C~~g~~g~~~~c~~~~~gg~~~~~~~~C~C~~g~~--G~~C~~~~~~~~C~---~pC~~gg~C~~~~~~ 91 (193)
.+.|...+.| |..++++..++|.. .+...|.|.|..||. |.+|-.. ++|. ..|.....|++.+ +
T Consensus 686 ~S~~~~~npCy~gsh~cdt~a~C~p------g~~~~~tcecs~g~~gdgr~c~d~---~eca~~~~~CGp~s~Cin~p-g 755 (1289)
T KOG1214|consen 686 DSDPTPVNPCYDGSHMCDTTARCHP------GTGVDYTCECSSGYQGDGRNCVDE---NECATGFHRCGPNSVCINLP-G 755 (1289)
T ss_pred CCCCcccccceecCcccCCCccccC------CCCcceEEEEeeccCCCCCCCCCh---hhhccCCCCCCCCceeecCC-C
Confidence 4455666678 88888888777752 334579999999998 5678643 3887 8899999999999 9
Q ss_pred ceecccCCCCCCcc
Q psy2060 92 GYICDDCSLAGSYE 105 (193)
Q Consensus 92 ~~~C~~C~~g~~~~ 105 (193)
+|+|+ |..+|...
T Consensus 756 ~~rce-C~~gy~F~ 768 (1289)
T KOG1214|consen 756 SYRCE-CRSGYEFA 768 (1289)
T ss_pred ceeEE-Eeecceec
Confidence 99999 98776544
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.28 E-value=0.0019 Score=35.74 Aligned_cols=31 Identities=35% Similarity=0.830 Sum_probs=22.9
Q ss_pred CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
..|...++|++.. ++|.|. |++||.++ |..|
T Consensus 6 ~~C~~nA~C~~~~-~~~~C~-C~~Gy~Gd--G~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTG-GSYTCT-CKPGYEGD--GFFC 36 (36)
T ss_dssp GGS-TTCEEEE-T-TSEEEE-E-CEEECC--STCE
T ss_pred CCCCCCcEeecCC-CCEEeE-CCCCCccC--CcCC
Confidence 5688889999998 899999 99998854 5544
No 28
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.17 E-value=0.00094 Score=35.97 Aligned_cols=23 Identities=13% Similarity=-0.104 Sum_probs=16.7
Q ss_pred ecCC--ccCC-CCceeeCCCCCCCCc
Q psy2060 43 VDGG--HRTL-DLLHDMGARYHWGDT 65 (193)
Q Consensus 43 ~~gg--~~~~-~~~~C~C~~g~~G~~ 65 (193)
.|+| .+.. ++|.|.|++||+|.+
T Consensus 7 ~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 7 QNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCeEEEeCCCCCEEeECCCCCccCC
Confidence 3455 3444 889999999999864
No 29
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.83 E-value=0.0054 Score=33.95 Aligned_cols=26 Identities=31% Similarity=0.746 Sum_probs=19.7
Q ss_pred EEeeCCCCceecccCCCCCCcccccccc
Q psy2060 84 TCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 84 ~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
.|++.+ ++|+|. ||.||....+++.|
T Consensus 11 ~C~~~~-g~~~C~-C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 11 ICVNTP-GSYRCS-CPPGYKLAEDGRTC 36 (36)
T ss_dssp EEEEET-TSEEEE--STTEEE-TTSSSE
T ss_pred CCccCC-CceEeE-CCCCCEECcCCCCC
Confidence 789998 999999 99999876555544
No 30
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.77 E-value=0.0082 Score=33.13 Aligned_cols=21 Identities=19% Similarity=0.091 Sum_probs=18.3
Q ss_pred ccCCCCceeeCCCCCC-CCccc
Q psy2060 47 HRTLDLLHDMGARYHW-GDTCE 67 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~-G~~C~ 67 (193)
.+..++|.|.|++||. |..|+
T Consensus 18 ~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 18 VNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ECCCCCeEeECCCCCccCCcCC
Confidence 5778899999999999 88874
No 31
>KOG1836|consensus
Probab=95.71 E-value=0.016 Score=57.33 Aligned_cols=84 Identities=17% Similarity=0.309 Sum_probs=58.8
Q ss_pred e-cCCCCccCCCCcceeec--------------------CC---ccCCCCceeeCCCCCCCCcccccCC----------C
Q psy2060 27 S-GLPDNEGLEKRVQTFVD--------------------GG---HRTLDLLHDMGARYHWGDTCETDIT----------K 72 (193)
Q Consensus 27 c-C~~g~~g~~~~c~~~~~--------------------gg---~~~~~~~~C~C~~g~~G~~C~~~~~----------~ 72 (193)
| |++||+|.+ |+.+-+ .| .-...+-.|.|.+.--|+.|+...+ .
T Consensus 697 c~C~~g~tG~~--Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~ 774 (1705)
T KOG1836|consen 697 CTCPVGYTGQF--CESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTS 774 (1705)
T ss_pred ccCCCCcccch--hhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCC
Confidence 8 999999998 986654 11 1112345566666666666665322 1
Q ss_pred CCCC-CCCCCCcEEeeCC-CCceecccCCCCCCccccccccceecc
Q psy2060 73 SDCK-GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRAR 116 (193)
Q Consensus 73 ~~C~-~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~~ 116 (193)
..|. -+|.+++.|.... .....|..||++|+ |.+|+.-..
T Consensus 775 ~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gyt----G~rCe~c~d 816 (1705)
T KOG1836|consen 775 GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYT----GLRCEECAD 816 (1705)
T ss_pred CCCccCCCCCChhhcCcCcccceecCCCCCCCc----ccccccCCC
Confidence 2388 8999999998765 47778989999999 999988543
No 32
>smart00051 DSL delta serrate ligand.
Probab=95.63 E-value=0.014 Score=36.52 Aligned_cols=37 Identities=11% Similarity=-0.013 Sum_probs=23.7
Q ss_pred cCCCCccCCCCcceeecCCccCCCC------ceeeCCCCCCCCcc
Q psy2060 28 GLPDNEGLEKRVQTFVDGGHRTLDL------LHDMGARYHWGDTC 66 (193)
Q Consensus 28 C~~g~~g~~~~c~~~~~gg~~~~~~------~~C~C~~g~~G~~C 66 (193)
|.++|.|.. |+..-....+..+. -.|.|++||+|+.|
T Consensus 21 C~~~~yG~~--C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 21 CDENYYGEG--CNKFCRPRDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred CCCCCcCCc--cCCEeCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence 899999997 76432211112223 34669999999887
No 33
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.57 E-value=0.009 Score=29.90 Aligned_cols=16 Identities=19% Similarity=0.094 Sum_probs=12.4
Q ss_pred CceeeCCCCCC----CCccc
Q psy2060 52 LLHDMGARYHW----GDTCE 67 (193)
Q Consensus 52 ~~~C~C~~g~~----G~~C~ 67 (193)
+|+|.|++||. |..|+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE 20 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc
Confidence 58999999997 45554
No 34
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.13 E-value=0.018 Score=31.30 Aligned_cols=21 Identities=19% Similarity=0.101 Sum_probs=17.9
Q ss_pred ccCCCCceeeCCCCCCCCccc
Q psy2060 47 HRTLDLLHDMGARYHWGDTCE 67 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~G~~C~ 67 (193)
.+..++|.|.|++||.|..|+
T Consensus 18 ~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 18 VNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ECCCCCeEeECCCCCcCCcCC
Confidence 567788999999999998774
No 35
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.11 E-value=0.015 Score=29.11 Aligned_cols=21 Identities=24% Similarity=0.521 Sum_probs=17.4
Q ss_pred ceecccCCCCCCccccccccce
Q psy2060 92 GYICDDCSLAGSYEHYDRLCRL 113 (193)
Q Consensus 92 ~~~C~~C~~g~~~~~~g~~C~~ 113 (193)
+|+|. |++||.....|..|+.
T Consensus 1 sy~C~-C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCS-CPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEee-CCCCCcCCCCCCcccc
Confidence 58999 9999987667888864
No 36
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.07 E-value=0.019 Score=30.95 Aligned_cols=20 Identities=20% Similarity=0.057 Sum_probs=16.3
Q ss_pred cCCCCceeeCCCCCCC-Cccc
Q psy2060 48 RTLDLLHDMGARYHWG-DTCE 67 (193)
Q Consensus 48 ~~~~~~~C~C~~g~~G-~~C~ 67 (193)
+..++|.|.|++||.| ..|+
T Consensus 15 ~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 15 NTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCeEeECCCCCccCCccC
Confidence 5578899999999998 6663
No 37
>KOG1226|consensus
Probab=94.98 E-value=0.037 Score=50.18 Aligned_cols=51 Identities=22% Similarity=0.496 Sum_probs=36.8
Q ss_pred ceeeCCCCCCCCcccccCCCCCCC----CCCCCCcEEeeCCCCceecccCCCC-CCccccccccce
Q psy2060 53 LHDMGARYHWGDTCETDITKSDCK----GVCSPGATCSLNPSGGYICDDCSLA-GSYEHYDRLCRL 113 (193)
Q Consensus 53 ~~C~C~~g~~G~~C~~~~~~~~C~----~pC~~gg~C~~~~~~~~~C~~C~~g-~~~~~~g~~C~~ 113 (193)
-+|+|.+||+|..|+.+.+.+.|. ..|...|+|+=. +|. |..+ |. |+.||.
T Consensus 566 G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg-----~C~-C~~~~~s----G~~CE~ 621 (783)
T KOG1226|consen 566 GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG-----RCK-CTDPPYS----GEFCEK 621 (783)
T ss_pred CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC-----ceE-cCCCCcC----cchhhc
Confidence 478999999999998877667887 578887877632 355 5444 44 666665
No 38
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.90 E-value=0.0096 Score=32.93 Aligned_cols=16 Identities=6% Similarity=-0.408 Sum_probs=13.1
Q ss_pred ccCCCCceeeCCCCCC
Q psy2060 47 HRTLDLLHDMGARYHW 62 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~ 62 (193)
.+++++|+|.|++||.
T Consensus 13 ~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 13 VNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEETTSEEEE-STTEE
T ss_pred ccCCCceEeECCCCCE
Confidence 5778899999999996
No 39
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.14 E-value=0.044 Score=29.14 Aligned_cols=20 Identities=20% Similarity=0.127 Sum_probs=16.5
Q ss_pred ccCCCCceeeCCCCCCCC-cc
Q psy2060 47 HRTLDLLHDMGARYHWGD-TC 66 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~G~-~C 66 (193)
.+..++|.|.|++||.|. .|
T Consensus 15 ~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 15 VNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred ecCCCCeEeECCCCCcccCCc
Confidence 466678999999999988 55
No 40
>KOG0994|consensus
Probab=94.10 E-value=0.093 Score=49.77 Aligned_cols=83 Identities=18% Similarity=0.297 Sum_probs=50.9
Q ss_pred cCCCCccCCCCcceeecCC---------------ccCCCCcee-eCCCCCCCCcccccCCCCCCC-CCCCCC--------
Q psy2060 28 GLPDNEGLEKRVQTFVDGG---------------HRTLDLLHD-MGARYHWGDTCETDITKSDCK-GVCSPG-------- 82 (193)
Q Consensus 28 C~~g~~g~~~~c~~~~~gg---------------~~~~~~~~C-~C~~g~~G~~C~~~~~~~~C~-~pC~~g-------- 82 (193)
|.|||+|-. .|.+++-.| .+...++.| .|..||.|+.=--.- ..|. -||..|
T Consensus 846 CqpG~WgFP-eCr~CqCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g--~~CrPCpCP~gp~Sg~~~A 922 (1758)
T KOG0994|consen 846 CQPGYWGFP-ECRPCQCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSG--IGCRPCPCPDGPASGRQHA 922 (1758)
T ss_pred cCCCccCCC-cCccccccCcccccCccccccccccccccccchhhhhccccCCcccCCC--CCCCCCCCCCCCccchhcc
Confidence 888887764 466665322 455567888 488998865322111 2454 455543
Q ss_pred cEEeeCC-CCceecccCCCCCCccccccccceecccC
Q psy2060 83 ATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRARSF 118 (193)
Q Consensus 83 g~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~~sF 118 (193)
..|..+. .....|. |.+||. |.+|+.-...|
T Consensus 923 ~sC~~d~~t~~ivC~-C~~GY~----G~RCe~CA~~~ 954 (1758)
T KOG0994|consen 923 DSCYLDTRTQQIVCH-CQEGYS----GSRCEICADNH 954 (1758)
T ss_pred ccccccccccceeee-cccCcc----ccchhhhcccc
Confidence 2455433 2556699 999999 89998754443
No 41
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=94.03 E-value=0.043 Score=38.96 Aligned_cols=39 Identities=18% Similarity=0.190 Sum_probs=28.5
Q ss_pred ceecCCCCccCCCCcceeecCC---ccCCCCceeeCCCCCCCCccccc
Q psy2060 25 YLSGLPDNEGLEKRVQTFVDGG---HRTLDLLHDMGARYHWGDTCETD 69 (193)
Q Consensus 25 ~~cC~~g~~g~~~~c~~~~~gg---~~~~~~~~C~C~~g~~G~~C~~~ 69 (193)
+.-|++.|.+- |++|. ......+.|.|+.||+|..||+.
T Consensus 42 i~~Cp~ey~~Y------ClHG~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 42 IRLCGPEGDGY------CLHGDCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred cccCChhhCCE------eECCEEEeeccCCCceeECCCCcccccccce
Confidence 33488877654 45554 34456899999999999999865
No 42
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=93.81 E-value=0.015 Score=33.17 Aligned_cols=16 Identities=6% Similarity=-0.452 Sum_probs=15.1
Q ss_pred ccCCCCceeeCCCCCC
Q psy2060 47 HRTLDLLHDMGARYHW 62 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~ 62 (193)
.|+.|+|.|.|++||.
T Consensus 19 ~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 19 VNTEGSYSCSCPPGYE 34 (42)
T ss_dssp EEETTEEEEEESTTEE
T ss_pred EcCCCCEEeeCCCCcE
Confidence 7999999999999997
No 43
>PHA02887 EGF-like protein; Provisional
Probab=93.52 E-value=0.053 Score=37.85 Aligned_cols=23 Identities=17% Similarity=0.044 Sum_probs=18.9
Q ss_pred ccCCCCceeeCCCCCCCCccccc
Q psy2060 47 HRTLDLLHDMGARYHWGDTCETD 69 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~G~~C~~~ 69 (193)
........|.|++||+|..|+..
T Consensus 102 I~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 102 IIDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred cccCCCceeECCCCcccCCCCcc
Confidence 34456799999999999999853
No 44
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.44 E-value=0.085 Score=28.29 Aligned_cols=27 Identities=26% Similarity=0.488 Sum_probs=21.4
Q ss_pred CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
..|.+.|+|+.. ..+|. |..+|. |+.|
T Consensus 6 ~~C~~~G~C~~~---~g~C~-C~~g~~----G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP---CGRCV-CDSGYT----GPDC 32 (32)
T ss_pred CccCCCCEEeCC---CCEEE-CCCCCc----CCCC
Confidence 368899999843 35899 999998 7766
No 45
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.94 E-value=0.083 Score=37.52 Aligned_cols=33 Identities=27% Similarity=0.573 Sum_probs=27.7
Q ss_pred CCCCCCcEEeeCC-CCceecccCCCCCCccccccccceec
Q psy2060 77 GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRA 115 (193)
Q Consensus 77 ~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~ 115 (193)
+.|.|| +|.-.+ ...+.|. |+.||. |.+||...
T Consensus 51 ~YClHG-~C~yI~dl~~~~Cr-C~~GYt----GeRCEh~d 84 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCR-CSHGYT----GIRCQHVV 84 (139)
T ss_pred CEeECC-EEEeeccCCCceeE-CCCCcc----ccccccee
Confidence 789997 897654 3788999 999999 99999864
No 46
>smart00051 DSL delta serrate ligand.
Probab=92.90 E-value=0.13 Score=32.17 Aligned_cols=46 Identities=20% Similarity=0.325 Sum_probs=32.2
Q ss_pred CceeeCCCCCCCCcccccCCCCCCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 52 LLHDMGARYHWGDTCETDITKSDCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 52 ~~~C~C~~g~~G~~C~~~~~~~~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
.+.-.|+++|.|..|+. .|. +-+.++.+|... + .|. |++|+. |..|
T Consensus 16 ~~rv~C~~~~yG~~C~~-----~C~~~~d~~~~~~Cd~~--G--~~~-C~~Gw~----G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNK-----FCRPRDDFFGHYTCDEN--G--NKG-CLEGWM----GPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCC-----EeCcCccccCCccCCcC--C--CEe-cCCCCc----CCCC
Confidence 34557999999999963 565 446677778432 2 356 879998 7766
No 47
>PHA02887 EGF-like protein; Provisional
Probab=92.88 E-value=0.093 Score=36.66 Aligned_cols=32 Identities=16% Similarity=0.263 Sum_probs=25.9
Q ss_pred CCCCCCcEEeeCC-CCceecccCCCCCCcccccccccee
Q psy2060 77 GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLR 114 (193)
Q Consensus 77 ~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~ 114 (193)
+.|.+ |+|.-.. .....|. |+.||. |.+|+..
T Consensus 92 ~YCiH-G~C~yI~dL~epsCr-C~~GYt----G~RCE~v 124 (126)
T PHA02887 92 DFCIN-GECMNIIDLDEKFCI-CNKGYT----GIRCDEV 124 (126)
T ss_pred CEeeC-CEEEccccCCCceeE-CCCCcc----cCCCCcc
Confidence 78886 5998654 3778899 999999 9999864
No 48
>KOG1226|consensus
Probab=92.69 E-value=0.18 Score=45.88 Aligned_cols=67 Identities=16% Similarity=0.232 Sum_probs=42.6
Q ss_pred e-cCCCCccCCCCcceeecC------------CccC---CCC-----ceeeCCCCCC----CCcccccCCCCCCC----C
Q psy2060 27 S-GLPDNEGLEKRVQTFVDG------------GHRT---LDL-----LHDMGARYHW----GDTCETDITKSDCK----G 77 (193)
Q Consensus 27 c-C~~g~~g~~~~c~~~~~g------------g~~~---~~~-----~~C~C~~g~~----G~~C~~~~~~~~C~----~ 77 (193)
| |.+||.|.. ||-..+. ..+. .+. -.|.|.+... |.+||-+.- .|+ .
T Consensus 480 C~C~~G~~G~~--CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnf--sC~r~~g~ 555 (783)
T KOG1226|consen 480 CRCDEGWLGKK--CECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNF--SCERHKGV 555 (783)
T ss_pred eecCCCCCCCc--ccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeeccCc--ccccccCc
Confidence 6 999999997 8743320 0111 122 3566765555 999997744 777 6
Q ss_pred CCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 78 VCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 78 pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
.|.+.|.|.=. +|. |.+|++
T Consensus 556 lC~g~G~C~CG-----~Cv-C~~Gwt 575 (783)
T KOG1226|consen 556 LCGGHGRCECG-----RCV-CNPGWT 575 (783)
T ss_pred ccCCCCeEeCC-----cEE-cCCCCc
Confidence 89988888522 366 666665
No 49
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.56 E-value=0.13 Score=27.59 Aligned_cols=15 Identities=13% Similarity=-0.035 Sum_probs=13.3
Q ss_pred CceeeCCCCCCCCcc
Q psy2060 52 LLHDMGARYHWGDTC 66 (193)
Q Consensus 52 ~~~C~C~~g~~G~~C 66 (193)
..+|.|.+||+|+.|
T Consensus 18 ~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 18 CGRCVCDSGYTGPDC 32 (32)
T ss_pred CCEEECCCCCcCCCC
Confidence 579999999999887
No 50
>KOG4260|consensus
Probab=91.45 E-value=0.22 Score=40.10 Aligned_cols=46 Identities=24% Similarity=0.575 Sum_probs=37.1
Q ss_pred CCCCCCCCcccccCCCCCCC----CCCCCCcEEeeCC--CCceecccCCCCCCccccccccce
Q psy2060 57 GARYHWGDTCETDITKSDCK----GVCSPGATCSLNP--SGGYICDDCSLAGSYEHYDRLCRL 113 (193)
Q Consensus 57 C~~g~~G~~C~~~~~~~~C~----~pC~~gg~C~~~~--~~~~~C~~C~~g~~~~~~g~~C~~ 113 (193)
||+|-.|+.|. .|. .||...|.|...- .++-+|. |.+||+ |+.|..
T Consensus 132 Cp~gtyGpdCl------~Cpggser~C~GnG~C~GdGsR~GsGkCk-C~~GY~----Gp~C~~ 183 (350)
T KOG4260|consen 132 CPDGTYGPDCL------QCPGGSERPCFGNGSCHGDGSREGSGKCK-CETGYT----GPLCRY 183 (350)
T ss_pred cCCCCcCCccc------cCCCCCcCCcCCCCcccCCCCCCCCCccc-ccCCCC----Cccccc
Confidence 89999999997 564 7899999998542 3666899 999999 777754
No 51
>KOG3509|consensus
Probab=90.58 E-value=0.41 Score=45.07 Aligned_cols=121 Identities=17% Similarity=0.162 Sum_probs=64.3
Q ss_pred CCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCccccccccceec----ccCC------------
Q psy2060 57 GARYHWGDTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLRA----RSFP------------ 119 (193)
Q Consensus 57 C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~~----~sF~------------ 119 (193)
|+..+.+..++.. .|. .+|..+++|......+|.+. |+.... ...|+... ..|.
T Consensus 161 ~~~~~~~~~~~~~----~~~~~~~q~g~tC~~~~~~~~~~~-~~~~~~----~~~c~~~~~r~~~~f~~~~~g~~~~~~~ 231 (964)
T KOG3509|consen 161 CPTSEGGPGTEPI----HCAQPVCQGGATCEVRNGKGYSLE-CPDCKV----RVVCEACKPRAFCPFEKSVEGCLKCFCF 231 (964)
T ss_pred CcccCCCCccccc----cccCcccccceeEEecCCcceeee-cccccc----ceehhhccCceecccccccccccceeec
Confidence 4444455556544 667 99999999998877888888 876544 44455421 1111
Q ss_pred -------CcceEEecCCCCCceeEEEEEEEeCCC--CeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEc
Q psy2060 120 -------KNSFLTFPALKQRNRLHIKLQFSTLHD--SGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNR 189 (193)
Q Consensus 120 -------g~syl~~~~~~~~~~~~i~~~f~T~~~--~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s 189 (193)
...++.+.......+......|.+... +.++.+... ....+|+++.+..|. +-++|..+.++..+..
T Consensus 232 ~vp~~~e~S~~~~~~~h~~~~~~~~~~~~~~~s~~~~~~~~~~~~-~~~~~f~~lt~~~g~--~g~~~~~~~~~~~~~~ 307 (964)
T KOG3509|consen 232 GVPRPSESSLHAFRAIHGATLHVDSLGVFFSDSEHRDILGNFLFS-SFKDGFRALTLDGGT--DGVRYDCGLPQREDRL 307 (964)
T ss_pred CCCccccchhhhHhhhccchhccchheeecccccccccccccccc-ccccceeeeccCCCC--ccccccccCcchhhhh
Confidence 011111111111111111122333322 233333322 456789999988888 7777777766555443
No 52
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=88.65 E-value=0.15 Score=31.82 Aligned_cols=37 Identities=11% Similarity=0.069 Sum_probs=19.1
Q ss_pred cCCCCccCCCCcceeecCCccCCCCcee------eCCCCCCCCcc
Q psy2060 28 GLPDNEGLEKRVQTFVDGGHRTLDLLHD------MGARYHWGDTC 66 (193)
Q Consensus 28 C~~g~~g~~~~c~~~~~gg~~~~~~~~C------~C~~g~~G~~C 66 (193)
|.+.|.|.. |.+--..-.+..+.|.| +|.+||+|+.|
T Consensus 21 C~~nyyG~~--C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 21 CDENYYGPN--CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp --TTEETTT--T-EE---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred CCCCCCCcc--ccCCcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 788899997 76532211122233443 57899999887
No 53
>KOG1836|consensus
Probab=87.18 E-value=1.2 Score=44.86 Aligned_cols=50 Identities=18% Similarity=0.076 Sum_probs=32.5
Q ss_pred CCCcee-cCCCCccCC-----CCcceee--cCC----ccCCCCceee-CCCCCCCCcccccCC
Q psy2060 22 FQPYLS-GLPDNEGLE-----KRVQTFV--DGG----HRTLDLLHDM-GARYHWGDTCETDIT 71 (193)
Q Consensus 22 ~~~~~c-C~~g~~g~~-----~~c~~~~--~gg----~~~~~~~~C~-C~~g~~G~~C~~~~~ 71 (193)
.++..- |.+||+|.. ..|..+- +++ ........|+ ||++|+|..|+...+
T Consensus 754 ~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 754 FGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred CCCchhhhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 444543 888888774 1133322 123 2335678999 999999999997644
No 54
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=86.96 E-value=0.34 Score=26.83 Aligned_cols=34 Identities=26% Similarity=0.517 Sum_probs=22.3
Q ss_pred CC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060 75 CK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC 111 (193)
Q Consensus 75 C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C 111 (193)
|. ..|...+.|+...++.+.|. |-+||... |..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecr-Cllgyk~~--~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECR-CLLGYKKV--GGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEE-E-TTEEEE--TTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEE-eeCCcccc--CCCc
Confidence 55 67888899998766999999 99988643 4444
No 55
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.98 E-value=0.38 Score=36.61 Aligned_cols=52 Identities=21% Similarity=0.395 Sum_probs=37.4
Q ss_pred CCCCceeeCCCCCC---CCcccccCCCCCCC------CCCCCCcEEeeCC----CCceecccCCCCCCc
Q psy2060 49 TLDLLHDMGARYHW---GDTCETDITKSDCK------GVCSPGATCSLNP----SGGYICDDCSLAGSY 104 (193)
Q Consensus 49 ~~~~~~C~C~~g~~---G~~C~~~~~~~~C~------~pC~~gg~C~~~~----~~~~~C~~C~~g~~~ 104 (193)
..+.|.|.|.+||. -..||... .|. .+|..-++|+..+ ...|+|. |..+|..
T Consensus 16 MSNHfEC~Cnegfvl~~EntCE~kv---~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-C~~gY~~ 80 (197)
T PF06247_consen 16 MSNHFECKCNEGFVLKNENTCEEKV---ECDKLENVNKPCGDYAKCINQANKGEERAYKCD-CINGYIL 80 (197)
T ss_dssp ESSEEEEEESTTEEEEETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-E-TTEEE
T ss_pred ccCceEEEcCCCcEEccccccccce---ecCcccccCccccchhhhhcCCCcccceeEEEe-cccCcee
Confidence 35679999999997 56787654 554 6999999999765 2789999 9998864
No 56
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=83.58 E-value=1.4 Score=34.30 Aligned_cols=35 Identities=31% Similarity=0.800 Sum_probs=26.2
Q ss_pred CCcccccCCCCCCC---CCCCCCcEEeeCCCCceecccCCCCCCc
Q psy2060 63 GDTCETDITKSDCK---GVCSPGATCSLNPSGGYICDDCSLAGSY 104 (193)
Q Consensus 63 G~~C~~~~~~~~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~~ 104 (193)
+..|+ +. ++|. .+|. ..|.+.. ++|.|. |+.||..
T Consensus 181 ~~~C~-~~--~~C~~~~~~c~--~~C~~~~-g~~~c~-c~~g~~~ 218 (224)
T cd01475 181 GKICV-VP--DLCATLSHVCQ--QVCISTP-GSYLCA-CTEGYAL 218 (224)
T ss_pred cccCc-Cc--hhhcCCCCCcc--ceEEcCC-CCEEeE-CCCCccC
Confidence 56675 33 3786 4566 4899887 999999 9999874
No 57
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.08 E-value=0.56 Score=35.72 Aligned_cols=76 Identities=12% Similarity=0.204 Sum_probs=45.1
Q ss_pred CCCcee-cCCCCccCC-CCcceeec-------------------CC-ccCCCCceeeCCCCCC--CCcccccCCCCCCC-
Q psy2060 22 FQPYLS-GLPDNEGLE-KRVQTFVD-------------------GG-HRTLDLLHDMGARYHW--GDTCETDITKSDCK- 76 (193)
Q Consensus 22 ~~~~~c-C~~g~~g~~-~~c~~~~~-------------------gg-~~~~~~~~C~C~~g~~--G~~C~~~~~~~~C~- 76 (193)
...|.| |.+||.-.. ..||.-++ .+ ......|.|.|.+||+ ...|-.. .|.
T Consensus 17 SNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~----~C~~ 92 (197)
T PF06247_consen 17 SNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPN----KCNN 92 (197)
T ss_dssp SSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEG----GGSS
T ss_pred cCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchh----hcCc
Confidence 356889 999995321 12553322 11 2445789999999997 3456543 788
Q ss_pred CCCCCCcEEeeCC--CCceecccCCCCCC
Q psy2060 77 GVCSPGATCSLNP--SGGYICDDCSLAGS 103 (193)
Q Consensus 77 ~pC~~gg~C~~~~--~~~~~C~~C~~g~~ 103 (193)
..|. .|.|+..+ .....|. |.-|..
T Consensus 93 ~~Cg-~GKCI~d~~~~~~~~CS-C~IGkV 119 (197)
T PF06247_consen 93 KDCG-SGKCILDPDNPNNPTCS-CNIGKV 119 (197)
T ss_dssp ---T-TEEEEEEEGGGSEEEEE-E-TEEE
T ss_pred eecC-CCeEEecCCCCCCceeE-eeeceE
Confidence 8888 58998664 2456899 977654
No 58
>KOG0994|consensus
Probab=83.06 E-value=0.81 Score=43.81 Aligned_cols=20 Identities=15% Similarity=0.144 Sum_probs=16.3
Q ss_pred CCcee-cCCCCccCCCCcceeec
Q psy2060 23 QPYLS-GLPDNEGLEKRVQTFVD 44 (193)
Q Consensus 23 ~~~~c-C~~g~~g~~~~c~~~~~ 44 (193)
+...| |.+||+|.- |+.+-+
T Consensus 932 ~~ivC~C~~GY~G~R--Ce~CA~ 952 (1758)
T KOG0994|consen 932 QQIVCHCQEGYSGSR--CEICAD 952 (1758)
T ss_pred cceeeecccCccccc--hhhhcc
Confidence 55568 999999995 998765
No 59
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=77.06 E-value=2.5 Score=29.16 Aligned_cols=28 Identities=29% Similarity=0.764 Sum_probs=22.6
Q ss_pred CCCC--CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 73 SDCK--GVCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 73 ~~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
+.|+ ..|...|.|... ....|. |++||.
T Consensus 78 d~Cd~y~~CG~~g~C~~~--~~~~C~-Cl~GF~ 107 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNSN--NSPKCS-CLPGFE 107 (110)
T ss_pred cCCCCccccCCccEeCCC--CCCceE-CCCCcC
Confidence 4888 999999999643 455699 999986
No 60
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=74.32 E-value=1.6 Score=25.43 Aligned_cols=18 Identities=11% Similarity=0.062 Sum_probs=14.5
Q ss_pred CCceeeCCCCCCCCcccc
Q psy2060 51 DLLHDMGARYHWGDTCET 68 (193)
Q Consensus 51 ~~~~C~C~~g~~G~~C~~ 68 (193)
....|.|+++|+|+.|+.
T Consensus 16 ~~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp TCEEESBSTTEESTTS-E
T ss_pred CCCEEeccccccCCcCcC
Confidence 456899999999999974
No 61
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=73.41 E-value=2.5 Score=24.77 Aligned_cols=16 Identities=13% Similarity=0.046 Sum_probs=14.0
Q ss_pred CceeeCCCCCCCCccc
Q psy2060 52 LLHDMGARYHWGDTCE 67 (193)
Q Consensus 52 ~~~C~C~~g~~G~~C~ 67 (193)
.-.|.|+++++|..|+
T Consensus 18 ~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 18 TGQCECKPNTTGRRCD 33 (50)
T ss_pred CCEEeCCCcCCCCCCC
Confidence 4689999999999997
No 62
>KOG1218|consensus
Probab=73.12 E-value=8.1 Score=31.37 Aligned_cols=38 Identities=18% Similarity=0.260 Sum_probs=30.3
Q ss_pred CCCceeeCCCCCCCCcccccCCCCCCC--CCCCCCcEEeeCC
Q psy2060 50 LDLLHDMGARYHWGDTCETDITKSDCK--GVCSPGATCSLNP 89 (193)
Q Consensus 50 ~~~~~C~C~~g~~G~~C~~~~~~~~C~--~pC~~gg~C~~~~ 89 (193)
...-.|.|++||+|.+|..... .|. ..|.+++.|....
T Consensus 159 ~~~~~c~c~~g~~g~~~~~~~~--~c~~~~~~~~g~~C~~~~ 198 (316)
T KOG1218|consen 159 CKNGICTCQPGFVGVFCVESCS--GCSPLTACENGAKCNRST 198 (316)
T ss_pred CCCCceeccCCcccccccccCC--CcCCCcccCCCCeeeccc
Confidence 3456789999999999998754 477 8888988998653
No 63
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.01 E-value=2.7 Score=24.22 Aligned_cols=16 Identities=13% Similarity=0.011 Sum_probs=13.5
Q ss_pred CceeeCCCCCCCCccc
Q psy2060 52 LLHDMGARYHWGDTCE 67 (193)
Q Consensus 52 ~~~C~C~~g~~G~~C~ 67 (193)
.-.|.|+++++|..|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (46)
T smart00180 17 TGQCECKPNVTGRRCD 32 (46)
T ss_pred CCEEECCCCCCCCCCC
Confidence 4589999999999886
No 64
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=72.98 E-value=5.7 Score=23.30 Aligned_cols=21 Identities=24% Similarity=0.580 Sum_probs=15.4
Q ss_pred CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060 77 GVCSPGATCSLNPSGGYICDDCSLAGS 103 (193)
Q Consensus 77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~ 103 (193)
..|..++.|++. +|. |+.++.
T Consensus 26 ~qC~~~s~C~~g-----~C~-C~~g~~ 46 (52)
T PF01683_consen 26 EQCIGGSVCVNG-----RCQ-CPPGYV 46 (52)
T ss_pred CCCCCcCEEcCC-----EeE-CCCCCE
Confidence 567788888643 599 988865
No 65
>KOG3509|consensus
Probab=69.62 E-value=26 Score=33.59 Aligned_cols=39 Identities=18% Similarity=0.109 Sum_probs=22.8
Q ss_pred EEEEeCCCCeEEEEeec--CCCCCceEEEEecCCeeEEEEEEe
Q psy2060 139 LQFSTLHDSGLLLYNGR--YNERHDFIALELTDSGRSVQFSWS 179 (193)
Q Consensus 139 ~~f~T~~~~glLl~~g~--~~~~~dfi~l~l~~G~~~l~~~~~ 179 (193)
.+|+.....+.+++..+ .....|+++....... ..+.|.
T Consensus 572 ~s~~v~a~~~~~~~~~~~P~~~vad~vs~~g~~~~--~~~s~~ 612 (964)
T KOG3509|consen 572 ISFSVSAEAGGLLLRWSLPAKPVADKVSPLGGALS--QTHSYK 612 (964)
T ss_pred ceeeeeeeccceEEEecCCCCCCcceeccccceee--eeeccC
Confidence 45666666666665543 2334566666555555 666676
No 66
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=63.62 E-value=6.1 Score=30.76 Aligned_cols=17 Identities=6% Similarity=-0.490 Sum_probs=15.0
Q ss_pred ccCCCCceeeCCCCCCC
Q psy2060 47 HRTLDLLHDMGARYHWG 63 (193)
Q Consensus 47 ~~~~~~~~C~C~~g~~G 63 (193)
.++.++|.|.|++||+.
T Consensus 202 ~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 202 ISTPGSYLCACTEGYAL 218 (224)
T ss_pred EcCCCCEEeECCCCccC
Confidence 57889999999999974
No 67
>KOG0196|consensus
Probab=63.04 E-value=11 Score=35.42 Aligned_cols=13 Identities=23% Similarity=0.422 Sum_probs=8.7
Q ss_pred CceecccCCCCCCc
Q psy2060 91 GGYICDDCSLAGSY 104 (193)
Q Consensus 91 ~~~~C~~C~~g~~~ 104 (193)
+...|. |..||..
T Consensus 306 ga~~C~-C~~gyyR 318 (996)
T KOG0196|consen 306 GATSCT-CENGYYR 318 (996)
T ss_pred CCCccc-ccCCccc
Confidence 556688 8777653
No 68
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=62.19 E-value=2.6 Score=25.42 Aligned_cols=23 Identities=17% Similarity=-0.054 Sum_probs=14.6
Q ss_pred CCCCceeeCCCCCCCCcccccCC
Q psy2060 49 TLDLLHDMGARYHWGDTCETDIT 71 (193)
Q Consensus 49 ~~~~~~C~C~~g~~G~~C~~~~~ 71 (193)
..|...|.|..-|+|++|...+.
T Consensus 32 ~dG~p~CECn~Cy~GpdCS~~~~ 54 (56)
T PF04863_consen 32 ADGSPVCECNSCYGGPDCSTLIP 54 (56)
T ss_dssp ETTEE--EE-TTEESTTS-EE-T
T ss_pred ccCCccccccCCcCCCCcccCCC
Confidence 45668999999999999987653
No 69
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=46.71 E-value=13 Score=20.04 Aligned_cols=12 Identities=0% Similarity=-0.438 Sum_probs=9.8
Q ss_pred CCceeeCCCCCC
Q psy2060 51 DLLHDMGARYHW 62 (193)
Q Consensus 51 ~~~~C~C~~g~~ 62 (193)
....|.||.||.
T Consensus 16 ~~~~C~CPeGyI 27 (34)
T PF09064_consen 16 SPGQCFCPEGYI 27 (34)
T ss_pred CCCceeCCCceE
Confidence 456999999995
No 70
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=38.30 E-value=15 Score=25.30 Aligned_cols=26 Identities=23% Similarity=0.686 Sum_probs=19.6
Q ss_pred CCC---CCCCCCcEEeeCCC----CceecccCCC
Q psy2060 74 DCK---GVCSPGATCSLNPS----GGYICDDCSL 100 (193)
Q Consensus 74 ~C~---~pC~~gg~C~~~~~----~~~~C~~C~~ 100 (193)
.|. +-|.+.|.|+.... .=|.|. |.+
T Consensus 7 aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~-C~~ 39 (103)
T PF12955_consen 7 ACENATNNCSGHGSCVKKYGSGGGDCFACK-CKP 39 (103)
T ss_pred HHHHhccCCCCCceEeeccCCCccceEEEE-eec
Confidence 566 78999999998741 347899 966
No 71
>PF02973 Sialidase: Sialidase, N-terminal domain; InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections []. The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=30.90 E-value=2.5e+02 Score=21.60 Aligned_cols=50 Identities=14% Similarity=0.270 Sum_probs=32.2
Q ss_pred CceeEEEEEEEeCCCCeE--EEEeecCCCCCceEEEEecCCeeEEEEEEeCC
Q psy2060 132 RNRLHIKLQFSTLHDSGL--LLYNGRYNERHDFIALELTDSGRSVQFSWSLG 181 (193)
Q Consensus 132 ~~~~~i~~~f~T~~~~gl--Ll~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g 181 (193)
.....|.++|++....++ ||-.+.......|+.|-+.++.+.+.+|=.-|
T Consensus 32 L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~ 83 (190)
T PF02973_consen 32 LEEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKG 83 (190)
T ss_dssp -SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTT
T ss_pred ccccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCC
Confidence 445688999999766665 44433324556899999999985555554444
No 72
>PF08300 HCV_NS5a_1a: Hepatitis C virus non-structural 5a zinc finger domain; InterPro: IPR013192 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in the non-structural 5a protein (NS5a) in Hepatitis C virus. The molecular function of NS5a is uncertain, but it is phosphorylated when expressed in mammalian cells. It is thought to interact with the dsRNA dependent (interferon inducible) kinase PKR, P19525 from SWISSPROT [, ]. This region corresponds to the N-terminal zinc binding domain (1a) []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003968 RNA-directed RNA polymerase activity, 0004252 serine-type endopeptidase activity, 0008270 zinc ion binding, 0017111 nucleoside-triphosphatase activity, 0006355 regulation of transcription, DNA-dependent, 0006915 apoptosis, 0030683 evasion by virus of host immune response, 0005789 endoplasmic reticulum membrane, 0016021 integral to membrane; PDB: 1ZH1_B 3FQM_A 3FQQ_B.
Probab=28.30 E-value=16 Score=22.51 Aligned_cols=13 Identities=31% Similarity=0.473 Sum_probs=6.1
Q ss_pred CceecCCCCccCC
Q psy2060 24 PYLSGLPDNEGLE 36 (193)
Q Consensus 24 ~~~cC~~g~~g~~ 36 (193)
||..|..||.|.+
T Consensus 2 Pf~sCqrGy~G~W 14 (62)
T PF08300_consen 2 PFLSCQRGYKGVW 14 (62)
T ss_dssp TTTS----B-SEE
T ss_pred CccccCCCCCcEE
Confidence 6777888888886
No 73
>TIGR01492 CPW_WPC Plasmodium falciparum CPW-WPC domain. This model represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats, and is known so far only in Plasmodium falciparum. It is named for motifs of CPxxW and (less well conserved) WPC.
Probab=27.54 E-value=58 Score=19.95 Aligned_cols=16 Identities=19% Similarity=0.210 Sum_probs=10.3
Q ss_pred CceeeCCCCCCCCcccc
Q psy2060 52 LLHDMGARYHWGDTCET 68 (193)
Q Consensus 52 ~~~C~C~~g~~G~~C~~ 68 (193)
++.|.-|..|.|+ |..
T Consensus 20 ~~~C~ap~~Y~Gp-C~~ 35 (62)
T TIGR01492 20 KYGCIAPDNYKGP-CER 35 (62)
T ss_pred CCeeCCCcccCCC-CCC
Confidence 4447777777776 653
No 74
>PF12599 DUF3768: Protein of unknown function (DUF3768); InterPro: IPR022243 This family of proteins is found in bacteria. Proteins in this family are typically between 108 and 129 amino acids in length. There are two conserved sequence motifs: NDP and RVLT.
Probab=26.02 E-value=97 Score=20.48 Aligned_cols=34 Identities=21% Similarity=0.232 Sum_probs=24.8
Q ss_pred CCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCCC
Q psy2060 157 NERHDFIALELTDSGRSVQFSWSLGSEVARLNRADS 192 (193)
Q Consensus 157 ~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~~ 192 (193)
.+.+||=++++.+-+ +.++++.-+...+.-|+++
T Consensus 34 ~gEHDFG~v~~~g~~--vfWKIDyYD~d~~~gS~dP 67 (84)
T PF12599_consen 34 YGEHDFGAVEFDGEK--VFWKIDYYDPDLEFGSPDP 67 (84)
T ss_pred cccccceEEEECCEE--EEEEEEEEcCccccCCCCC
Confidence 356899999988888 8888777666666666554
No 75
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=25.34 E-value=86 Score=15.71 Aligned_cols=21 Identities=29% Similarity=0.824 Sum_probs=12.2
Q ss_pred CC-CCCCCCcEEeeCCCCceec
Q psy2060 75 CK-GVCSPGATCSLNPSGGYIC 95 (193)
Q Consensus 75 C~-~pC~~gg~C~~~~~~~~~C 95 (193)
|. ..|..|..|+.+..+...|
T Consensus 2 C~~v~C~~G~~C~~d~~g~p~C 23 (26)
T smart00274 2 CRNVQCPFGKVCVVDKGGNARC 23 (26)
T ss_pred CCCEECCCCCEEEeCCCCCEEE
Confidence 45 5677777777643244444
No 76
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=24.57 E-value=1.1e+02 Score=15.30 Aligned_cols=19 Identities=11% Similarity=0.314 Sum_probs=13.8
Q ss_pred CCCceEEEEecCCeeEEEEEE
Q psy2060 158 ERHDFIALELTDSGRSVQFSW 178 (193)
Q Consensus 158 ~~~dfi~l~l~~G~~~l~~~~ 178 (193)
..+.+.++...+|. +..++
T Consensus 14 ~~g~l~a~d~~~G~--~~W~~ 32 (33)
T smart00564 14 TDGTLYALDAKTGE--ILWTY 32 (33)
T ss_pred CCCEEEEEEcccCc--EEEEc
Confidence 35678888888888 76654
No 77
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=23.91 E-value=99 Score=20.50 Aligned_cols=25 Identities=24% Similarity=0.728 Sum_probs=17.7
Q ss_pred CCC-CCCCCCcEEeeCCCCceecccCC
Q psy2060 74 DCK-GVCSPGATCSLNPSGGYICDDCS 99 (193)
Q Consensus 74 ~C~-~pC~~gg~C~~~~~~~~~C~~C~ 99 (193)
+|. ..|..|.+|+....+...|. |.
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~Cv-C~ 26 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCV-CI 26 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEe-cC
Confidence 466 77888888886533667787 64
No 78
>PHA03399 pif3 per os infectivity factor 3; Provisional
Probab=23.82 E-value=57 Score=25.29 Aligned_cols=10 Identities=10% Similarity=-0.486 Sum_probs=7.1
Q ss_pred ceeeCCCCCC
Q psy2060 53 LHDMGARYHW 62 (193)
Q Consensus 53 ~~C~C~~g~~ 62 (193)
.-|+|.+||+
T Consensus 159 ~dC~C~~gyt 168 (200)
T PHA03399 159 DDCVCASGYT 168 (200)
T ss_pred hhcCcCCCCE
Confidence 4577777776
No 79
>COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown]
Probab=23.35 E-value=1.2e+02 Score=21.83 Aligned_cols=34 Identities=12% Similarity=0.325 Sum_probs=23.8
Q ss_pred CCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCC
Q psy2060 157 NERHDFIALELTDSGRSVQFSWSLGSEVARLNRAD 191 (193)
Q Consensus 157 ~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~ 191 (193)
.+..+|+.+.+.+|.+ ++..|+.-.+...+.|.|
T Consensus 44 ~gwk~f~~iti~dGKi-v~~~ydy~~k~G~~Ks~D 77 (147)
T COG4939 44 HGWKAFVTITIQDGKI-VACTYDYRDKKGNIKSDD 77 (147)
T ss_pred cCccceEEEEEeCCEE-EEEEeeeecCCCCccccc
Confidence 4567899999999982 345676666666666654
No 80
>PF07546 EMI: EMI domain; InterPro: IPR011489 The EMI domain, first named after its presence in proteins of the EMILIN family, is a small cysteine-rich module of around 75 amino acids. The EMI domain is most often found at the N terminus of metazoan extracellular proteins that are forming or are compatible with multimer formation []. It is found in association with other domains, such as C1q, laminin-type EGF-like, collagen-like, FN3, WAP, ZP or FAS1 []. It has been suggested that the EMI domain could be a protein-protein interaction module, as the EMI domain of EMILIN-1 was found to interact with the C1q domain of EMILIN-2 []. The EMI domain possesses six highly conserved cysteines residues, which likely form disulphide bonds. Other key features of the EMI domain are the C-C-x-G-[WYFH] pattern, a hydrophobic position just preceding the first cysteine (Cys1) of the domain and a cluster of hydrophobic residues between Cys3 and Cys4. The EMI domain could be made of two sub-domains, the fold of the second one sharing similarities with the C-terminal sub-module characteristic of EGF-like domains []. Proteins known to contain a EMI domain include: Vertebrate Emilins, extracellular matrix glycoproteins. Vertebrate Multimerins, extracellular matrix glycoproteins. Vetebrate Emu proteins, which could interact with several different extracellular matrix components and serve to connect and integrate the function of multiple partner molecules. Vertebrate beta-IG-H3. Vertebrate osteoblast-specific factor 2 (OSF-2). Mammalian NEU1/NG3 proteins. Drosophila midline fasciclin. Caenorhabditis elegans ced-1, a transmembrane receptor that mediates cell corpse engulfment. The Pfam alignment for this domain is truncated at the C terminus and does not include the final cysteine []. This is to stop the family overlapping with other domains.; GO: 0005515 protein binding
Probab=22.26 E-value=83 Score=19.79 Aligned_cols=6 Identities=33% Similarity=0.656 Sum_probs=3.5
Q ss_pred CCCccC
Q psy2060 30 PDNEGL 35 (193)
Q Consensus 30 ~g~~g~ 35 (193)
|||.|.
T Consensus 66 pGy~~~ 71 (72)
T PF07546_consen 66 PGYSGT 71 (72)
T ss_pred cCcccC
Confidence 366664
No 81
>PHA02151 hypothetical protein
Probab=21.92 E-value=1.5e+02 Score=22.18 Aligned_cols=31 Identities=29% Similarity=0.534 Sum_probs=22.7
Q ss_pred CCCeEEEEeecCCCCCceEEEEec--CCeeEEEEE
Q psy2060 145 HDSGLLLYNGRYNERHDFIALELT--DSGRSVQFS 177 (193)
Q Consensus 145 ~~~glLl~~g~~~~~~dfi~l~l~--~G~~~l~~~ 177 (193)
.++||.||.|-.+...-|+.|.+. +|+ +.++
T Consensus 140 ~~~glcly~g~~g~adv~lvl~~~~~~g~--~~~~ 172 (217)
T PHA02151 140 RADGLCLYAGYKGNADVFLVLHMENVDGQ--TYFN 172 (217)
T ss_pred ccCceEEecccCCCCcEEEEEeeecccce--EEec
Confidence 578999999753455668888776 677 7664
Done!