Query psy8281
Match_columns 178
No_of_seqs 184 out of 1541
Neff 11.1
Searched_HMMs 46136
Date Fri Aug 16 23:29:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8281.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8281hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.7 1.9E-16 4.2E-21 130.8 9.6 118 54-171 3860-3980(4289)
2 KOG1219|consensus 99.6 1.9E-15 4.1E-20 125.2 9.7 116 13-130 3860-3977(4289)
3 KOG4289|consensus 99.5 4.4E-13 9.5E-18 108.2 11.8 88 2-92 1224-1316(2531)
4 KOG1225|consensus 99.4 3.5E-12 7.7E-17 95.8 11.1 130 1-166 235-365 (525)
5 KOG4289|consensus 99.4 5.9E-13 1.3E-17 107.4 7.1 104 71-174 1215-1322(2531)
6 KOG1214|consensus 99.4 6.6E-12 1.4E-16 97.1 10.3 138 25-166 702-860 (1289)
7 KOG1217|consensus 99.3 8.1E-11 1.8E-15 89.7 13.2 166 2-170 154-355 (487)
8 KOG4260|consensus 99.3 5.9E-12 1.3E-16 85.7 6.0 153 3-163 131-304 (350)
9 KOG1217|consensus 99.2 1E-09 2.2E-14 83.7 13.1 165 2-167 112-306 (487)
10 KOG1214|consensus 99.0 4.2E-09 9.1E-14 82.1 9.5 120 4-128 720-860 (1289)
11 KOG1225|consensus 99.0 4.8E-09 1E-13 79.2 9.7 109 42-170 235-343 (525)
12 KOG1226|consensus 98.6 1E-06 2.2E-11 68.7 10.1 116 1-131 479-621 (783)
13 KOG1226|consensus 98.5 1.3E-06 2.8E-11 68.1 10.6 129 25-172 469-624 (783)
14 smart00179 EGF_CA Calcium-bind 98.4 6.4E-07 1.4E-11 44.0 4.2 36 133-168 2-39 (39)
15 PF00008 EGF: EGF-like domain 98.4 2.3E-07 5E-12 43.4 1.7 31 136-166 1-32 (32)
16 KOG0994|consensus 98.3 3.6E-06 7.8E-11 68.4 8.3 163 3-170 889-1100(1758)
17 PF07645 EGF_CA: Calcium-bindi 98.2 8.3E-07 1.8E-11 44.4 2.3 32 132-163 1-34 (42)
18 PF00008 EGF: EGF-like domain 98.2 4.9E-07 1.1E-11 42.3 1.1 31 20-52 1-31 (32)
19 cd00054 EGF_CA Calcium-binding 98.2 4.5E-06 9.7E-11 40.5 4.2 36 133-168 2-38 (38)
20 KOG4260|consensus 98.2 4E-06 8.7E-11 57.8 5.0 117 44-164 131-269 (350)
21 smart00179 EGF_CA Calcium-bind 98.0 1.3E-05 2.9E-10 39.2 4.1 34 18-54 3-38 (39)
22 KOG0994|consensus 98.0 2.3E-05 5.1E-10 63.9 7.2 101 70-176 1031-1154(1758)
23 smart00181 EGF Epidermal growt 97.8 4.3E-05 9.4E-10 36.4 3.9 29 139-168 6-35 (35)
24 cd00053 EGF Epidermal growth f 97.8 4.8E-05 1E-09 36.2 4.1 31 138-168 5-36 (36)
25 cd00054 EGF_CA Calcium-binding 97.8 7.2E-05 1.6E-09 36.1 4.0 34 18-54 3-37 (38)
26 cd00053 EGF Epidermal growth f 97.6 0.00019 4.1E-09 34.0 3.9 30 22-54 5-35 (36)
27 PF07974 EGF_2: EGF-like domai 97.5 0.00018 3.8E-09 33.5 3.2 26 140-167 7-32 (32)
28 smart00181 EGF Epidermal growt 97.5 0.00025 5.5E-09 33.6 3.8 31 20-54 2-34 (35)
29 PF12661 hEGF: Human growth fa 97.5 3.5E-05 7.7E-10 28.1 0.4 13 155-167 1-13 (13)
30 KOG1836|consensus 97.5 0.0025 5.4E-08 55.5 11.6 53 3-56 760-813 (1705)
31 PF07645 EGF_CA: Calcium-bindi 97.5 5.5E-05 1.2E-09 37.7 1.3 31 17-50 2-34 (42)
32 PF12947 EGF_3: EGF domain; I 97.5 8E-05 1.7E-09 35.6 1.7 27 140-166 7-33 (36)
33 PF12662 cEGF: Complement Clr- 97.2 0.00041 8.8E-09 29.8 2.2 19 40-59 1-23 (24)
34 KOG1836|consensus 97.2 0.0013 2.8E-08 57.2 6.5 57 119-175 760-819 (1705)
35 PF07974 EGF_2: EGF-like domai 97.1 0.00098 2.1E-08 30.9 3.0 27 23-54 6-32 (32)
36 PF12947 EGF_3: EGF domain; I 97.0 0.00034 7.3E-09 33.5 1.0 28 23-53 6-33 (36)
37 PF06247 Plasmod_Pvs28: Plasmo 96.6 0.0011 2.3E-08 43.7 1.3 124 37-165 16-162 (197)
38 smart00051 DSL delta serrate l 96.3 0.0081 1.8E-07 32.7 3.5 44 2-54 19-63 (63)
39 PF14670 FXa_inhibition: Coagu 96.3 0.0025 5.5E-08 30.4 1.2 20 144-163 9-28 (36)
40 smart00051 DSL delta serrate l 96.1 0.013 2.9E-07 31.8 3.8 45 117-167 18-63 (63)
41 PHA03099 epidermal growth fact 95.5 0.021 4.5E-07 35.2 3.3 34 140-174 52-87 (139)
42 cd00055 EGF_Lam Laminin-type e 95.5 0.02 4.4E-07 29.5 2.8 22 155-176 20-41 (50)
43 PF00053 Laminin_EGF: Laminin 95.4 0.0071 1.5E-07 31.1 1.0 29 146-176 12-40 (49)
44 PHA02887 EGF-like protein; Pro 94.9 0.035 7.5E-07 33.7 2.9 32 139-171 92-125 (126)
45 smart00180 EGF_Lam Laminin-typ 94.8 0.035 7.6E-07 28.1 2.5 22 155-176 19-40 (46)
46 cd01475 vWA_Matrilin VWA_Matri 93.9 0.093 2E-06 36.3 3.8 39 126-165 181-219 (224)
47 PF06247 Plasmod_Pvs28: Plasmo 93.7 0.03 6.4E-07 37.1 0.9 121 2-126 22-161 (197)
48 KOG3512|consensus 92.9 0.99 2.2E-05 34.6 7.8 28 146-175 408-435 (592)
49 PF12955 DUF3844: Domain of un 91.4 0.24 5.2E-06 29.7 2.6 36 138-173 12-65 (103)
50 KOG1218|consensus 91.3 4.3 9.3E-05 29.5 12.2 36 117-152 163-199 (316)
51 KOG3516|consensus 89.9 0.35 7.6E-06 41.0 3.2 37 17-56 545-582 (1306)
52 PF01414 DSL: Delta serrate li 89.6 0.063 1.4E-06 29.2 -0.8 46 116-167 17-63 (63)
53 PHA02887 EGF-like protein; Pro 89.1 0.54 1.2E-05 28.7 2.8 29 25-55 94-122 (126)
54 PF12946 EGF_MSP1_1: MSP1 EGF 87.3 0.45 9.7E-06 22.7 1.3 25 139-163 5-30 (37)
55 PHA03099 epidermal growth fact 86.8 0.69 1.5E-05 28.7 2.3 30 25-56 53-82 (139)
56 KOG3516|consensus 86.0 0.83 1.8E-05 38.9 3.0 39 132-170 544-583 (1306)
57 KOG3514|consensus 85.8 0.75 1.6E-05 38.9 2.7 34 19-55 625-659 (1591)
58 PF04863 EGF_alliinase: Alliin 83.3 0.48 1E-05 24.7 0.5 33 24-56 18-51 (56)
59 cd01475 vWA_Matrilin VWA_Matri 82.4 2.1 4.7E-05 29.6 3.6 37 89-126 182-218 (224)
60 KOG3514|consensus 82.3 1.2 2.6E-05 37.8 2.5 39 135-173 625-664 (1591)
61 PF00954 S_locus_glycop: S-loc 67.8 9.1 0.0002 23.2 3.2 32 133-165 77-109 (110)
62 KOG3607|consensus 67.2 4.8 0.0001 33.2 2.3 31 140-173 631-661 (716)
63 PF01683 EB: EB module; Inter 65.3 11 0.00024 19.2 2.8 20 140-163 27-46 (52)
64 PF09064 Tme5_EGF_like: Thromb 57.4 10 0.00022 17.7 1.5 10 155-164 19-28 (34)
65 KOG1218|consensus 53.5 84 0.0018 22.8 12.6 36 79-114 163-199 (316)
66 KOG3509|consensus 40.7 92 0.002 27.1 5.5 65 18-85 407-473 (964)
67 KOG3509|consensus 37.2 1.1E+02 0.0024 26.6 5.5 60 102-161 413-473 (964)
68 KOG3607|consensus 35.5 28 0.00061 29.0 2.0 25 25-55 632-656 (716)
69 KOG3512|consensus 28.1 75 0.0016 25.1 3.0 30 146-175 286-316 (592)
No 1
>KOG1219|consensus
Probab=99.68 E-value=1.9e-16 Score=130.81 Aligned_cols=118 Identities=37% Similarity=1.010 Sum_probs=106.0
Q ss_pred CCCCCCCCCCCCCCC-ceEeeCC-CCeeeeCCCCCcCCCccccCCCcCCCCCCCCCeEecCCCCeeeecCCCcccCCccc
Q psy8281 54 CQHNLDDCASSPCGH-GICVDQT-DGYRCYCQPGYSGEQCQYEYNECESSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQI 131 (178)
Q Consensus 54 C~~~~~~c~~~~c~~-~~c~~~~-~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~~ 131 (178)
|....+.|...+|.| |.|.... ++|.|.|++-|.|++|+.++..|...||..++.|....+.+.|.|+.+|+|..|+.
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~ 3939 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA 3939 (4289)
T ss_pred ccccccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec
Confidence 443336788888987 6898654 78999999999999999999999999999999999999999999999999999997
Q ss_pred c-CCCCCCCCCCCCCeeeeCCCceeeeCCCCCCCCCCCCCC
Q psy8281 132 K-VDLCDPNPCSHHHYCVDKGNTFACECPKGYQGPNCDVPG 171 (178)
Q Consensus 132 ~-~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~C~~~~ 171 (178)
+ +++|..++|..++.|+|..|+|+|.|.+||.|..|....
T Consensus 3940 ~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~ 3980 (4289)
T KOG1219|consen 3940 RGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCCAEK 3980 (4289)
T ss_pred ccccccccccccCCceeeccCCceEeccChhHhcccCcccc
Confidence 7 899999999999999999999999999999999986443
No 2
>KOG1219|consensus
Probab=99.63 E-value=1.9e-15 Score=125.17 Aligned_cols=116 Identities=39% Similarity=1.046 Sum_probs=105.7
Q ss_pred CCCCCCCCCCCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCCCCCCCCCCC-ceEeeCCCCeeeeCCCCCcCCCc
Q psy8281 13 CEDTSDPCESGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLDDCASSPCGH-GICVDQTDGYRCYCQPGYSGEQC 91 (178)
Q Consensus 13 C~~~~~~c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~~c~~~~c~~-~~c~~~~~~~~C~c~~g~~g~~c 91 (178)
|....++|..+||+++|+|... ..+.|.|.|++.|+|..|+.++..|.+.+|.. ++|+...+++.|.|+.||+|.+|
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~--~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~C 3937 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQ--PKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRC 3937 (4289)
T ss_pred ccccccccccCcccCCCEecCC--CCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCcee
Confidence 4444578999999999999874 56789999999999999998999999999975 79999999999999999999999
Q ss_pred ccc-CCCcCCCCCCCCCeEecCCCCeeeecCCCcccCCcc
Q psy8281 92 QYE-YNECESSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQ 130 (178)
Q Consensus 92 ~~~-~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~ 130 (178)
+.. +++|...+|..++.|.+..|.++|.|.+++.|+.|.
T Consensus 3938 e~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3938 EARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred ecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence 877 889999999999999999999999999999988875
No 3
>KOG4289|consensus
Probab=99.49 E-value=4.4e-13 Score=108.16 Aligned_cols=88 Identities=36% Similarity=0.937 Sum_probs=73.0
Q ss_pred CCCCCCcCCCCCCCCCCCCCCCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCC--CCCCCCCCCC-ceEeeC-CCC
Q psy8281 2 FGVRCGFTGKTCEDTSDPCESGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNL--DDCASSPCGH-GICVDQ-TDG 77 (178)
Q Consensus 2 ~~c~~g~~G~~C~~~~~~c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~--~~c~~~~c~~-~~c~~~-~~~ 77 (178)
|.|++||+|..|+..++.|-..||.++|+|.. .++.|+|.|.+||+|..|+-+. ..|.+..|.+ ++|++. .+.
T Consensus 1224 CrCPpGFTgd~CeTeiDlCYs~pC~nng~C~s---rEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ngg 1300 (2531)
T KOG4289|consen 1224 CRCPPGFTGDYCETEIDLCYSGPCGNNGRCRS---REGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGG 1300 (2531)
T ss_pred EeCCCCCCcccccchhHhhhcCCCCCCCceEE---ecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCc
Confidence 88999999999999999999999999999998 8899999999999999998432 3566666765 688765 466
Q ss_pred eeeeCCCC-CcCCCcc
Q psy8281 78 YRCYCQPG-YSGEQCQ 92 (178)
Q Consensus 78 ~~C~c~~g-~~g~~c~ 92 (178)
+.|.|+.| |.++.|+
T Consensus 1301 f~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1301 FCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred eeccCCCcccCCCceE
Confidence 78899887 3355555
No 4
>KOG1225|consensus
Probab=99.41 E-value=3.5e-12 Score=95.77 Aligned_cols=130 Identities=40% Similarity=1.085 Sum_probs=98.6
Q ss_pred CCCCCCCcCCCCCCCCCCCCCCCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCCCCCCCCCC-CceEeeCCCCee
Q psy8281 1 MFGVRCGFTGKTCEDTSDPCESGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLDDCASSPCG-HGICVDQTDGYR 79 (178)
Q Consensus 1 ~~~c~~g~~G~~C~~~~~~c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~~c~~~~c~-~~~c~~~~~~~~ 79 (178)
||.|..+|+|..|.. ..|... |..++.|+.. +|+|++||+|.+|.+ ..|... |. ++.+++ + .
T Consensus 235 ic~c~~~~~g~~c~~--~~C~~~-c~~~g~c~~G-------~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~--g--~ 297 (525)
T KOG1225|consen 235 ICECPEGYFGPLCST--IYCPGG-CTGRGQCVEG-------RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVD--G--E 297 (525)
T ss_pred eeecCCceeCCcccc--ccCCCC-CcccceEeCC-------eEeCCCCCcCCCCCc--ccCCcc-cCCCceecC--C--E
Confidence 578889999999983 446554 7777888752 799999999999964 445544 44 344432 2 7
Q ss_pred eeCCCCCcCCCccccCCCcCCCCCCCCCeEecCCCCeeeecCCCcccCCccccCCCCCCCCCCCCCeeeeCCCceeeeCC
Q psy8281 80 CYCQPGYSGEQCQYEYNECESSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQIKVDLCDPNPCSHHHYCVDKGNTFACECP 159 (178)
Q Consensus 80 C~c~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~ 159 (178)
|+|.++|.|+.|.+.. |. ..|..++.|+ .+ +|.|.+||+|..|... .|.+++.|++. |.|.
T Consensus 298 CiC~~g~~G~dCs~~~--cp-adC~g~G~Ci--~G--~C~C~~Gy~G~~C~~~-------~C~~~g~cv~g-----C~C~ 358 (525)
T KOG1225|consen 298 CICNPGYSGKDCSIRR--CP-ADCSGHGKCI--DG--ECLCDEGYTGELCIQR-------ACSGGGQCVNG-----CKCK 358 (525)
T ss_pred eecCCCcccccccccc--CC-ccCCCCCccc--CC--ceEeCCCCcCCccccc-------ccCCCceeccC-----ceec
Confidence 9999999999996332 33 5599999998 33 4999999999999732 28888888773 8999
Q ss_pred CCCCCCC
Q psy8281 160 KGYQGPN 166 (178)
Q Consensus 160 ~g~~g~~ 166 (178)
.||.|++
T Consensus 359 ~Gw~G~d 365 (525)
T KOG1225|consen 359 KGWRGPD 365 (525)
T ss_pred cCccCCC
Confidence 9999988
No 5
>KOG4289|consensus
Probab=99.41 E-value=5.9e-13 Score=107.45 Aligned_cols=104 Identities=36% Similarity=0.977 Sum_probs=90.4
Q ss_pred EeeCCCCeeeeCCCCCcCCCccccCCCcCCCCCCCCCeEecCCCCeeeecCCCcccCCccccC--CCCCCCCCCCCCeee
Q psy8281 71 CVDQTDGYRCYCQPGYSGEQCQYEYNECESSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQIKV--DLCDPNPCSHHHYCV 148 (178)
Q Consensus 71 c~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~~~~--~~c~~~~c~~~~~c~ 148 (178)
-++..++++|.|++||+|..|+.+++.|-..+|.+++.|....++|+|.|+++|+|.+|+.+. -.|.+..|.++++|+
T Consensus 1215 pi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~ 1294 (2531)
T KOG4289|consen 1215 PIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCV 1294 (2531)
T ss_pred eccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEe
Confidence 345667889999999999999999999999999999999999999999999999999998543 467888999999999
Q ss_pred eCC-CceeeeCCCC-CCCCCCCCCCeee
Q psy8281 149 DKG-NTFACECPKG-YQGPNCDVPGIVF 174 (178)
Q Consensus 149 ~~~-~~~~C~C~~g-~~g~~C~~~~~~~ 174 (178)
+.. |.+.|+|+.| |.+++|++.+..|
T Consensus 1295 ~~~nggf~c~Cp~ge~e~prC~v~trSF 1322 (2531)
T KOG4289|consen 1295 NLLNGGFCCHCPYGEFEDPRCEVTTRSF 1322 (2531)
T ss_pred ecCCCceeccCCCcccCCCceEEEeecc
Confidence 875 6788999987 7799998766544
No 6
>KOG1214|consensus
Probab=99.37 E-value=6.6e-12 Score=97.12 Aligned_cols=138 Identities=31% Similarity=0.905 Sum_probs=101.0
Q ss_pred CCCCCeeccCCCCCCceeeeCCCCCC--CCCCCCCCCCCCC--CCCC-CceEeeCCCCeeeeCCCCCc--C--CCccc--
Q psy8281 25 CQNGGSCAASNLTAQQFKCLCPPGFS--GSLCQHNLDDCAS--SPCG-HGICVDQTDGYRCYCQPGYS--G--EQCQY-- 93 (178)
Q Consensus 25 C~~~~~C~~~~~~~~~~~C~C~~g~~--g~~C~~~~~~c~~--~~c~-~~~c~~~~~~~~C~c~~g~~--g--~~c~~-- 93 (178)
|..+..|..+ +...|+|.|..||. |+.|. +.++|.. ..|. +..|++..+.++|+|..+|. + -+|..
T Consensus 702 cdt~a~C~pg--~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~ 778 (1289)
T KOG1214|consen 702 CDTTARCHPG--TGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLIT 778 (1289)
T ss_pred cCCCccccCC--CCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEec
Confidence 5666777765 45579999999998 55676 5566654 2354 67999999999999998875 2 24431
Q ss_pred ---cCCCcCC--CCCCCCCeE--e-cCCCCeeeecCCCccc--CCccccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCC
Q psy8281 94 ---EYNECES--SPCLNGGSC--S-DHVGRFSCTCGHGYTG--QRCQIKVDLCDPNPCSHHHYCVDKGNTFACECPKGYQ 163 (178)
Q Consensus 94 ---~~~~c~~--~~c~~~~~c--~-~~~~~~~C~C~~g~~g--~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~ 163 (178)
.++.|.. ..|.-.+++ + ...++|.|.|.+||.| ..|. ++++|.++.|+..++|.|+++++.|.|.+||.
T Consensus 779 ~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~ 857 (1289)
T KOG1214|consen 779 PPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYY 857 (1289)
T ss_pred CCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCcc
Confidence 2233432 234444433 3 2346789999999984 5676 78999999999999999999999999999998
Q ss_pred CCC
Q psy8281 164 GPN 166 (178)
Q Consensus 164 g~~ 166 (178)
|+.
T Consensus 858 GDG 860 (1289)
T KOG1214|consen 858 GDG 860 (1289)
T ss_pred CCC
Confidence 764
No 7
>KOG1217|consensus
Probab=99.30 E-value=8.1e-11 Score=89.69 Aligned_cols=166 Identities=37% Similarity=0.955 Sum_probs=121.9
Q ss_pred CCCCCCcCCCCCCCCCCCCC--CCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCC-------------------CCC
Q psy8281 2 FGVRCGFTGKTCEDTSDPCE--SGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHN-------------------LDD 60 (178)
Q Consensus 2 ~~c~~g~~G~~C~~~~~~c~--~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~-------------------~~~ 60 (178)
+.|..||.|..+....+.|. ..+|.+++.|.+ ..+.|.|.|.++|.+..++.. ...
T Consensus 154 c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~---~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~ 230 (487)
T KOG1217|consen 154 CSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVN---TGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPE 230 (487)
T ss_pred eeeCCCcccccccccccccccCCCCcCCCccccc---CCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCC
Confidence 67899999999986546786 445999999998 566799999999998877643 111
Q ss_pred CCCC--CCC-C-ceEeeCCCCeeeeCCCCCcCCC--ccccCCCcCCCC-CCCCCeEecCCCCeeeecCCCcccCCc--cc
Q psy8281 61 CASS--PCG-H-GICVDQTDGYRCYCQPGYSGEQ--CQYEYNECESSP-CLNGGSCSDHVGRFSCTCGHGYTGQRC--QI 131 (178)
Q Consensus 61 c~~~--~c~-~-~~c~~~~~~~~C~c~~g~~g~~--c~~~~~~c~~~~-c~~~~~c~~~~~~~~C~C~~g~~g~~c--~~ 131 (178)
+... .+. . +.|.+..+.+.|.+.+||.+.. ...+++.|.... |.+++.|.+..+.+.|.|+++|.+..+ ..
T Consensus 231 c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~ 310 (487)
T KOG1217|consen 231 CEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECV 310 (487)
T ss_pred cccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCcccc
Confidence 1111 111 1 5777778889999999998775 233667777653 888899999888899999999998887 22
Q ss_pred cCCCC----CCCCCCCCCee--eeCCCceeeeCCCCCCCCCCCCC
Q psy8281 132 KVDLC----DPNPCSHHHYC--VDKGNTFACECPKGYQGPNCDVP 170 (178)
Q Consensus 132 ~~~~c----~~~~c~~~~~c--~~~~~~~~C~C~~g~~g~~C~~~ 170 (178)
+...| ....|..+..| ....+.+.|.|.++|.|..|+..
T Consensus 311 ~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~ 355 (487)
T KOG1217|consen 311 DVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS 355 (487)
T ss_pred ccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccC
Confidence 34566 34457777777 33445678999999999999865
No 8
>KOG4260|consensus
Probab=99.30 E-value=5.9e-12 Score=85.67 Aligned_cols=153 Identities=28% Similarity=0.654 Sum_probs=106.5
Q ss_pred CCCCCcCCCCCCCCCCCCCCCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCCCCCC-------CC---CCC---c
Q psy8281 3 GVRCGFTGKTCEDTSDPCESGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLDDCAS-------SP---CGH---G 69 (178)
Q Consensus 3 ~c~~g~~G~~C~~~~~~c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~~c~~-------~~---c~~---~ 69 (178)
=|++|.+|++|... .--...||..+|.|.-.....++..|.|..||+|..|..-..+-.. .. |+. +
T Consensus 131 CCp~gtyGpdCl~C-pggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~ 209 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQC-PGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLG 209 (350)
T ss_pred ccCCCCcCCccccC-CCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhc
Confidence 37899999999853 3345678999999998877888999999999999988631111100 01 221 2
Q ss_pred eEeeCCCCeee-eCCCCCcC--CCccccCCCcC--CCCCCCCCeEecCCCCeeeecCCCcccCCccccCCCCCC--CCC-
Q psy8281 70 ICVDQTDGYRC-YCQPGYSG--EQCQYEYNECE--SSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQIKVDLCDP--NPC- 141 (178)
Q Consensus 70 ~c~~~~~~~~C-~c~~g~~g--~~c~~~~~~c~--~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~~~~~~c~~--~~c- 141 (178)
.|.... +..| .|..||.- ..|. ++++|. +.+|..+..|+|+.|+|.|..++||.+. +++|.. ..|
T Consensus 210 ~Csg~~-~k~C~kCkkGW~lde~gCv-DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d~C~~~~d~~~ 282 (350)
T KOG4260|consen 210 VCSGES-SKGCSKCKKGWKLDEEGCV-DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VDECQFCADVCA 282 (350)
T ss_pred ccCCCC-CCChhhhcccceecccccc-cHHHHhcCCCCCChhheeecCCCceEecccccccCC-----hHHhhhhhhhcc
Confidence 343222 2234 68999973 3454 788885 4568888899999999999999998752 333321 223
Q ss_pred CCCCeeeeCCCceeeeCCCCCC
Q psy8281 142 SHHHYCVDKGNTFACECPKGYQ 163 (178)
Q Consensus 142 ~~~~~c~~~~~~~~C~C~~g~~ 163 (178)
..+..|+|+.+.|+|+|..++.
T Consensus 283 ~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 283 SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred cCCCCcccCCccEEEEecccce
Confidence 3446699999999999998853
No 9
>KOG1217|consensus
Probab=99.17 E-value=1e-09 Score=83.73 Aligned_cols=165 Identities=37% Similarity=0.970 Sum_probs=117.7
Q ss_pred CCCCCCcCCCCCCCCCCCCCCCC--CCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCCCCCC--CCCCC-ceEeeCCC
Q psy8281 2 FGVRCGFTGKTCEDTSDPCESGP--CQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLDDCAS--SPCGH-GICVDQTD 76 (178)
Q Consensus 2 ~~c~~g~~G~~C~~~~~~c~~~~--C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~~c~~--~~c~~-~~c~~~~~ 76 (178)
|.|.+||.|..+.... .|...+ +...+.|.........+.|.|..||.+..+....++|.. ..|.+ +.|.+..+
T Consensus 112 c~c~~g~~~~~~~~~~-~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~ 190 (487)
T KOG1217|consen 112 CTCPPGYQGTPCEGEC-ECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGG 190 (487)
T ss_pred eeCCCccccCcCCcce-eecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCC
Confidence 5688889888887531 355444 245566654211124688999999999988755567763 33654 68988888
Q ss_pred CeeeeCCCCCcCCCcccc-------------------CCCcCC--CCCCCC-CeEecCCCCeeeecCCCcccCC--cccc
Q psy8281 77 GYRCYCQPGYSGEQCQYE-------------------YNECES--SPCLNG-GSCSDHVGRFSCTCGHGYTGQR--CQIK 132 (178)
Q Consensus 77 ~~~C~c~~g~~g~~c~~~-------------------~~~c~~--~~c~~~-~~c~~~~~~~~C~C~~g~~g~~--c~~~ 132 (178)
.+.|.|..+|.+..+... ...+.. ..+... +.|.+..+.+.|.++++|.+.. ...+
T Consensus 191 ~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~ 270 (487)
T KOG1217|consen 191 SYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVD 270 (487)
T ss_pred CeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeee
Confidence 899999999987766532 111111 112222 6788888889999999998776 2347
Q ss_pred CCCCCCCC-CCCCCeeeeCCCceeeeCCCCCCCCCC
Q psy8281 133 VDLCDPNP-CSHHHYCVDKGNTFACECPKGYQGPNC 167 (178)
Q Consensus 133 ~~~c~~~~-c~~~~~c~~~~~~~~C~C~~g~~g~~C 167 (178)
++.|.... |.++++|++..+.|.|.|++||+|..|
T Consensus 271 ~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 271 VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC 306 (487)
T ss_pred ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence 78887654 888899999998899999999999998
No 10
>KOG1214|consensus
Probab=98.99 E-value=4.2e-09 Score=82.11 Aligned_cols=120 Identities=31% Similarity=0.886 Sum_probs=86.2
Q ss_pred CCCCcC--CCCCCCCCCCCCC--CCCCCCCeeccCCCCCCceeeeCCCCCC--C--CCCCC-----CCCCCCCC--CCC-
Q psy8281 4 VRCGFT--GKTCEDTSDPCES--GPCQNGGSCAASNLTAQQFKCLCPPGFS--G--SLCQH-----NLDDCASS--PCG- 67 (178)
Q Consensus 4 c~~g~~--G~~C~~~~~~c~~--~~C~~~~~C~~~~~~~~~~~C~C~~g~~--g--~~C~~-----~~~~c~~~--~c~- 67 (178)
|..||. |..|.+ .++|+. ..|..+..|++ .+++|+|.|..||. + -.|.. .++.|... .|.
T Consensus 720 cs~g~~gdgr~c~d-~~eca~~~~~CGp~s~Cin---~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i 795 (1289)
T KOG1214|consen 720 CSSGYQGDGRNCVD-ENECATGFHRCGPNSVCIN---LPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAI 795 (1289)
T ss_pred EeeccCCCCCCCCC-hhhhccCCCCCCCCceeec---CCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCc
Confidence 445554 556664 356654 34999999998 89999999999875 2 24432 22334322 243
Q ss_pred --CceEeeC-CCCeeeeCCCCCcCC--CccccCCCcCCCCCCCCCeEecCCCCeeeecCCCcccCC
Q psy8281 68 --HGICVDQ-TDGYRCYCQPGYSGE--QCQYEYNECESSPCLNGGSCSDHVGRFSCTCGHGYTGQR 128 (178)
Q Consensus 68 --~~~c~~~-~~~~~C~c~~g~~g~--~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~ 128 (178)
+..|+.. .+.|.|.|.+||.|. .|. +.++|.++.|.....|.++.+++.|+|.+||.|..
T Consensus 796 ~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 796 AGQARCVHHGGSTYSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred CCceEEEecCCceEEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccCCC
Confidence 2355544 356899999999954 666 67999999999999999999999999999998643
No 11
>KOG1225|consensus
Probab=98.99 E-value=4.8e-09 Score=79.24 Aligned_cols=109 Identities=41% Similarity=1.068 Sum_probs=79.5
Q ss_pred eeeCCCCCCCCCCCCCCCCCCCCCCCCceEeeCCCCeeeeCCCCCcCCCccccCCCcCCCCCCCCCeEecCCCCeeeecC
Q psy8281 42 KCLCPPGFSGSLCQHNLDDCASSPCGHGICVDQTDGYRCYCQPGYSGEQCQYEYNECESSPCLNGGSCSDHVGRFSCTCG 121 (178)
Q Consensus 42 ~C~C~~g~~g~~C~~~~~~c~~~~c~~~~c~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~ 121 (178)
.|.|..+|.|..|+. ..|...--.++.|+.. +|+|++||.|..|.. ..|... |..++.+++. .|+|.
T Consensus 235 ic~c~~~~~g~~c~~--~~C~~~c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g----~CiC~ 301 (525)
T KOG1225|consen 235 ICECPEGYFGPLCST--IYCPGGCTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG----ECICN 301 (525)
T ss_pred eeecCCceeCCcccc--ccCCCCCcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC----EeecC
Confidence 689999999988862 3343321123455433 799999999999863 334433 6666666543 59999
Q ss_pred CCcccCCccccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCCCCCCCCC
Q psy8281 122 HGYTGQRCQIKVDLCDPNPCSHHHYCVDKGNTFACECPKGYQGPNCDVP 170 (178)
Q Consensus 122 ~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~~C~~~ 170 (178)
++|.|..|+. ..| +..|++++.|+.. +|.|.+||+|..|+..
T Consensus 302 ~g~~G~dCs~--~~c-padC~g~G~Ci~G----~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 302 PGYSGKDCSI--RRC-PADCSGHGKCIDG----ECLCDEGYTGELCIQR 343 (525)
T ss_pred CCcccccccc--ccC-CccCCCCCcccCC----ceEeCCCCcCCccccc
Confidence 9999999973 334 3679999999843 8999999999999875
No 12
>KOG1226|consensus
Probab=98.55 E-value=1e-06 Score=68.69 Aligned_cols=116 Identities=34% Similarity=0.864 Sum_probs=77.8
Q ss_pred CCCCCCCcCCCCCCCCCC---------CCCC----CCCCCCCeeccCCCCCCceeeeCCCCCC----CCCCCCCCCCCCC
Q psy8281 1 MFGVRCGFTGKTCEDTSD---------PCES----GPCQNGGSCAASNLTAQQFKCLCPPGFS----GSLCQHNLDDCAS 63 (178)
Q Consensus 1 ~~~c~~g~~G~~C~~~~~---------~c~~----~~C~~~~~C~~~~~~~~~~~C~C~~g~~----g~~C~~~~~~c~~ 63 (178)
+|.|.+||.|+.|+-..+ .|.. .+|+..|.|.-+ .|.|.+... |..|+.+--.|..
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG-------qC~C~~~~~~~i~G~fCECDnfsC~r 551 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG-------QCVCHKPDNGKIYGKFCECDNFSCER 551 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC-------ceEecCCCCCceeeeeeeccCccccc
Confidence 689999999999986422 2321 269999999873 599988776 8888865555554
Q ss_pred C---CCC-CceEeeCCCCeeeeCCCCCcCCCcccc--CCCcCCC---CCCCCCeEecCCCCeeeecCCC-cccCCccc
Q psy8281 64 S---PCG-HGICVDQTDGYRCYCQPGYSGEQCQYE--YNECESS---PCLNGGSCSDHVGRFSCTCGHG-YTGQRCQI 131 (178)
Q Consensus 64 ~---~c~-~~~c~~~~~~~~C~c~~g~~g~~c~~~--~~~c~~~---~c~~~~~c~~~~~~~~C~C~~g-~~g~~c~~ 131 (178)
. .|. ||+|.-. +|.|.+||+|..|+-+ .+.|... .|...+.|.-. +|.|... |.|..|+.
T Consensus 552 ~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 552 HKGVLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK 621 (783)
T ss_pred ccCcccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence 3 353 6777544 6899999999988643 2344322 25555555432 3677655 88988873
No 13
>KOG1226|consensus
Probab=98.55 E-value=1.3e-06 Score=68.13 Aligned_cols=129 Identities=29% Similarity=0.774 Sum_probs=81.6
Q ss_pred CCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCC---------CCCC----CCCC-CceEeeCCCCeeeeCCCCCc---
Q psy8281 25 CQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLD---------DCAS----SPCG-HGICVDQTDGYRCYCQPGYS--- 87 (178)
Q Consensus 25 C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~---------~c~~----~~c~-~~~c~~~~~~~~C~c~~g~~--- 87 (178)
|..+|.... ..|.|.+||.|+.|+...+ .|.. ..|. +|.|.-. .|.|.+...
T Consensus 469 C~g~G~~~C-------G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i 537 (783)
T KOG1226|consen 469 CHGNGTFVC-------GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKI 537 (783)
T ss_pred cCCCCcEEe-------cceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCce
Confidence 555555544 2689999999999874322 1211 1232 3445433 467776654
Q ss_pred -CCCccccCCCcC---CCCCCCCCeEecCCCCeeeecCCCcccCCcc--ccCCCCCC---CCCCCCCeeeeCCCceeeeC
Q psy8281 88 -GEQCQYEYNECE---SSPCLNGGSCSDHVGRFSCTCGHGYTGQRCQ--IKVDLCDP---NPCSHHHYCVDKGNTFACEC 158 (178)
Q Consensus 88 -g~~c~~~~~~c~---~~~c~~~~~c~~~~~~~~C~C~~g~~g~~c~--~~~~~c~~---~~c~~~~~c~~~~~~~~C~C 158 (178)
|+.|+-+.-.|. ...|..++.|.-. +|+|.+||+|..|. .+.+.|.. ..|..+++|.-. +|.|
T Consensus 538 ~G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C 609 (783)
T KOG1226|consen 538 YGKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKC 609 (783)
T ss_pred eeeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEc
Confidence 777763333332 2347777877643 49999999999876 34455542 247777877665 7888
Q ss_pred CCC-CCCCCCCCCCe
Q psy8281 159 PKG-YQGPNCDVPGI 172 (178)
Q Consensus 159 ~~g-~~g~~C~~~~~ 172 (178)
... |.|..||....
T Consensus 610 ~~~~~sG~~CE~cpt 624 (783)
T KOG1226|consen 610 TDPPYSGEFCEKCPT 624 (783)
T ss_pred CCCCcCcchhhcCCC
Confidence 865 99999987543
No 14
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.42 E-value=6.4e-07 Score=43.98 Aligned_cols=36 Identities=44% Similarity=1.185 Sum_probs=29.8
Q ss_pred CCCCCC-CCCCCCCeeeeCCCceeeeCCCCCC-CCCCC
Q psy8281 133 VDLCDP-NPCSHHHYCVDKGNTFACECPKGYQ-GPNCD 168 (178)
Q Consensus 133 ~~~c~~-~~c~~~~~c~~~~~~~~C~C~~g~~-g~~C~ 168 (178)
+++|.. .+|..++.|++..++|.|.|++||. |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 456665 6788888999999999999999999 88874
No 15
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.37 E-value=2.3e-07 Score=43.40 Aligned_cols=31 Identities=42% Similarity=1.148 Sum_probs=26.0
Q ss_pred CCCCCCCCCCeeeeCC-CceeeeCCCCCCCCC
Q psy8281 136 CDPNPCSHHHYCVDKG-NTFACECPKGYQGPN 166 (178)
Q Consensus 136 c~~~~c~~~~~c~~~~-~~~~C~C~~g~~g~~ 166 (178)
|..++|.++++|+... +.|.|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 3456899999999888 889999999999864
No 16
>KOG0994|consensus
Probab=98.32 E-value=3.6e-06 Score=68.37 Aligned_cols=163 Identities=28% Similarity=0.675 Sum_probs=84.5
Q ss_pred CCCCCcCCCCCCCCCCCCCCCCCCCC--------CeeccCCCCCCceeeeCCCCCCCCCCCCCCC----------CCC--
Q psy8281 3 GVRCGFTGKTCEDTSDPCESGPCQNG--------GSCAASNLTAQQFKCLCPPGFSGSLCQHNLD----------DCA-- 62 (178)
Q Consensus 3 ~c~~g~~G~~C~~~~~~c~~~~C~~~--------~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~----------~c~-- 62 (178)
+|..||+|..=-...+.|.+-||..+ -.|...+ ......|+|.+||+|..|+...+ .|+
T Consensus 889 rCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~-~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~C 967 (1758)
T KOG0994|consen 889 RCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDT-RTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKC 967 (1758)
T ss_pred hhhccccCCcccCCCCCCCCCCCCCCCccchhccccccccc-cccceeeecccCccccchhhhcccccCCcccCCccccc
Confidence 46778887654444455666555432 2343332 33456899999999988863111 111
Q ss_pred ----------CCCCC--Cc---eEeeCCCCeee-eCCCCCcCCCccccCCCcC--CCCCCCCCeEecCCCCeeeecCCCc
Q psy8281 63 ----------SSPCG--HG---ICVDQTDGYRC-YCQPGYSGEQCQYEYNECE--SSPCLNGGSCSDHVGRFSCTCGHGY 124 (178)
Q Consensus 63 ----------~~~c~--~~---~c~~~~~~~~C-~c~~g~~g~~c~~~~~~c~--~~~c~~~~~c~~~~~~~~C~C~~g~ 124 (178)
+..|. .| .|...+.+.+| .|..||.|..-......|. ..-.....+|....|. |.|.+..
T Consensus 968 eC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQ--CpClpNv 1045 (1758)
T KOG0994|consen 968 ECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQ--CPCLPNV 1045 (1758)
T ss_pred cccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhheccccccCCccccccccCc--CCCCccc
Confidence 11121 12 12222223355 5888887653221222221 1111122456655554 8888888
Q ss_pred ccCCccc---------cCCCCCCCCCCCC--CeeeeCCCceeeeCCCCCCCCCCCCC
Q psy8281 125 TGQRCQI---------KVDLCDPNPCSHH--HYCVDKGNTFACECPKGYQGPNCDVP 170 (178)
Q Consensus 125 ~g~~c~~---------~~~~c~~~~c~~~--~~c~~~~~~~~C~C~~g~~g~~C~~~ 170 (178)
.|..|+. ....|.+-.|+.. -.|....| +|.|.|||-|..|+..
T Consensus 1046 ~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qC 1100 (1758)
T KOG0994|consen 1046 QGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQC 1100 (1758)
T ss_pred ccccccccccchhccccCCCCCccCCCccCCcccccccc--ceeccCCCCCcchhHH
Confidence 8887752 1122332233321 13433334 8999999999988643
No 17
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.25 E-value=8.3e-07 Score=44.37 Aligned_cols=32 Identities=31% Similarity=1.015 Sum_probs=27.2
Q ss_pred cCCCCCC--CCCCCCCeeeeCCCceeeeCCCCCC
Q psy8281 132 KVDLCDP--NPCSHHHYCVDKGNTFACECPKGYQ 163 (178)
Q Consensus 132 ~~~~c~~--~~c~~~~~c~~~~~~~~C~C~~g~~ 163 (178)
|+++|+. ..|...+.|+|+.|+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4677764 4588889999999999999999997
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.22 E-value=4.9e-07 Score=42.29 Aligned_cols=31 Identities=48% Similarity=1.299 Sum_probs=25.6
Q ss_pred CCCCCCCCCCeeccCCCCCCceeeeCCCCCCCC
Q psy8281 20 CESGPCQNGGSCAASNLTAQQFKCLCPPGFSGS 52 (178)
Q Consensus 20 c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~ 52 (178)
|..++|.++|+|+... .+.|.|.|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~--~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLP--GGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEES--TSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCC--CCCEEeECCCCCccC
Confidence 4567899999999832 388999999999986
No 19
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.18 E-value=4.5e-06 Score=40.52 Aligned_cols=36 Identities=42% Similarity=1.154 Sum_probs=29.0
Q ss_pred CCCCCC-CCCCCCCeeeeCCCceeeeCCCCCCCCCCC
Q psy8281 133 VDLCDP-NPCSHHHYCVDKGNTFACECPKGYQGPNCD 168 (178)
Q Consensus 133 ~~~c~~-~~c~~~~~c~~~~~~~~C~C~~g~~g~~C~ 168 (178)
+++|.. .+|..++.|++..+.|.|.|++||.|..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 345554 578778889999999999999999998774
No 20
>KOG4260|consensus
Probab=98.17 E-value=4e-06 Score=57.81 Aligned_cols=117 Identities=31% Similarity=0.812 Sum_probs=74.1
Q ss_pred eCCCCCCCCCCCCCCCCCCCCCCC-CceEee---CCCCeeeeCCCCCcCCCccccCCC--c------CCCC---CCC--C
Q psy8281 44 LCPPGFSGSLCQHNLDDCASSPCG-HGICVD---QTDGYRCYCQPGYSGEQCQYEYNE--C------ESSP---CLN--G 106 (178)
Q Consensus 44 ~C~~g~~g~~C~~~~~~c~~~~c~-~~~c~~---~~~~~~C~c~~g~~g~~c~~~~~~--c------~~~~---c~~--~ 106 (178)
-|+.|-.|.+|.. ...-...+|. +|.|.. ..|+-.|.|..||.|+.|. .... - .... |+. .
T Consensus 131 CCp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-~Cg~eyfes~Rne~~lvCt~Ch~~C~ 208 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-YCGIEYFESSRNEQHLVCTACHEGCL 208 (350)
T ss_pred ccCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-ccchHHHHhhcccccchhhhhhhhhh
Confidence 4888888998873 2222234564 688863 2356689999999999996 1100 0 0000 111 1
Q ss_pred CeEecCCCCeee-ecCCCcc--cCCccccCCCCC--CCCCCCCCeeeeCCCceeeeCCCCCCC
Q psy8281 107 GSCSDHVGRFSC-TCGHGYT--GQRCQIKVDLCD--PNPCSHHHYCVDKGNTFACECPKGYQG 164 (178)
Q Consensus 107 ~~c~~~~~~~~C-~C~~g~~--g~~c~~~~~~c~--~~~c~~~~~c~~~~~~~~C~C~~g~~g 164 (178)
+.|.-. +...| .|..||. -..| .|+++|. +.+|..++.|+|+.|+|.|.+.+||.+
T Consensus 209 ~~Csg~-~~k~C~kCkkGW~lde~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 209 GVCSGE-SSKGCSKCKKGWKLDEEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred cccCCC-CCCChhhhcccceeccccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 123211 11223 4778886 2345 4888886 567999999999999999999999975
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.03 E-value=1.3e-05 Score=39.15 Aligned_cols=34 Identities=53% Similarity=1.400 Sum_probs=28.0
Q ss_pred CCCCC-CCCCCCCeeccCCCCCCceeeeCCCCCC-CCCC
Q psy8281 18 DPCES-GPCQNGGSCAASNLTAQQFKCLCPPGFS-GSLC 54 (178)
Q Consensus 18 ~~c~~-~~C~~~~~C~~~~~~~~~~~C~C~~g~~-g~~C 54 (178)
++|.. .+|.++++|.+ ..++|.|.|++||. |..|
T Consensus 3 ~~C~~~~~C~~~~~C~~---~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 3 DECASGNPCQNGGTCVN---TVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred ccCcCCCCcCCCCEeEC---CCCCeEeECCCCCccCCcC
Confidence 55665 67988889987 67789999999998 7766
No 22
>KOG0994|consensus
Probab=98.02 E-value=2.3e-05 Score=63.92 Aligned_cols=101 Identities=28% Similarity=0.686 Sum_probs=55.7
Q ss_pred eEeeCCCCeeeeCCCCCcCCCccccC---------CCcCCCCCCC--CCeEecCCCCeeeecCCCcccCCccc-------
Q psy8281 70 ICVDQTDGYRCYCQPGYSGEQCQYEY---------NECESSPCLN--GGSCSDHVGRFSCTCGHGYTGQRCQI------- 131 (178)
Q Consensus 70 ~c~~~~~~~~C~c~~g~~g~~c~~~~---------~~c~~~~c~~--~~~c~~~~~~~~C~C~~g~~g~~c~~------- 131 (178)
.|....| .|-|.+...|..|..-. ..|.+-.|.. ...|....| .|.|++||-|..|..
T Consensus 1031 ~CDr~tG--QCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel~WG 1106 (1758)
T KOG0994|consen 1031 HCDRFTG--QCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQCQELYWG 1106 (1758)
T ss_pred ccccccC--cCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc--ceeccCCCCCcchhHHHHhhcC
Confidence 3444444 57777777776654100 1122212222 123444444 489999998887752
Q ss_pred -cCCCCCCCCCCCCC----eeeeCCCceeeeCCCCCCCCCCCCCCeeeee
Q psy8281 132 -KVDLCDPNPCSHHH----YCVDKGNTFACECPKGYQGPNCDVPGIVFYL 176 (178)
Q Consensus 132 -~~~~c~~~~c~~~~----~c~~~~~~~~C~C~~g~~g~~C~~~~~~~~~ 176 (178)
....|..-.|...+ .|....| +|+|.+|..|++|+....+|.-
T Consensus 1107 dP~~~C~aCdCd~rG~~tpQCdr~tG--~C~C~~Gv~G~rCdqCaRgy~G 1154 (1758)
T KOG0994|consen 1107 DPNEKCRACDCDPRGIETPQCDRATG--RCVCRPGVGGPRCDQCARGYSG 1154 (1758)
T ss_pred CCCCCceecCCCCCCCCCCCccccCC--ceeecCCCCCcchhhhhhhhcC
Confidence 11123222344333 2444444 8999999999999887776643
No 23
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.84 E-value=4.3e-05 Score=36.36 Aligned_cols=29 Identities=38% Similarity=1.235 Sum_probs=23.9
Q ss_pred CCCCCCCeeeeCCCceeeeCCCCCCC-CCCC
Q psy8281 139 NPCSHHHYCVDKGNTFACECPKGYQG-PNCD 168 (178)
Q Consensus 139 ~~c~~~~~c~~~~~~~~C~C~~g~~g-~~C~ 168 (178)
.+|.++ .|++..+.|.|.|++||.| ..|+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence 467776 8998889999999999999 6663
No 24
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.84 E-value=4.8e-05 Score=36.20 Aligned_cols=31 Identities=42% Similarity=1.154 Sum_probs=25.6
Q ss_pred CCCCCCCCeeeeCCCceeeeCCCCCCCC-CCC
Q psy8281 138 PNPCSHHHYCVDKGNTFACECPKGYQGP-NCD 168 (178)
Q Consensus 138 ~~~c~~~~~c~~~~~~~~C~C~~g~~g~-~C~ 168 (178)
..+|..++.|++..+.|.|.|++||.|. .|+
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~ 36 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence 4567777889998889999999999998 653
No 25
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.76 E-value=7.2e-05 Score=36.10 Aligned_cols=34 Identities=53% Similarity=1.406 Sum_probs=27.1
Q ss_pred CCCCC-CCCCCCCeeccCCCCCCceeeeCCCCCCCCCC
Q psy8281 18 DPCES-GPCQNGGSCAASNLTAQQFKCLCPPGFSGSLC 54 (178)
Q Consensus 18 ~~c~~-~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C 54 (178)
++|.. .+|.+++.|.+ ..+.|.|.|..||.|..|
T Consensus 3 ~~C~~~~~C~~~~~C~~---~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 3 DECASGNPCQNGGTCVN---TVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred ccCCCCCCcCCCCEeEC---CCCCeEeECCCCCcCCcC
Confidence 44655 67888889987 667899999999998766
No 26
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.58 E-value=0.00019 Score=34.03 Aligned_cols=30 Identities=50% Similarity=1.455 Sum_probs=24.4
Q ss_pred CCCCCCCCeeccCCCCCCceeeeCCCCCCCC-CC
Q psy8281 22 SGPCQNGGSCAASNLTAQQFKCLCPPGFSGS-LC 54 (178)
Q Consensus 22 ~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~-~C 54 (178)
..+|.+++.|.+ ..+.|.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~---~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVN---TPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEec---CCCCeEeECCCCCcccCCc
Confidence 466888899987 5678999999999887 54
No 27
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.53 E-value=0.00018 Score=33.46 Aligned_cols=26 Identities=46% Similarity=1.124 Sum_probs=21.4
Q ss_pred CCCCCCeeeeCCCceeeeCCCCCCCCCC
Q psy8281 140 PCSHHHYCVDKGNTFACECPKGYQGPNC 167 (178)
Q Consensus 140 ~c~~~~~c~~~~~~~~C~C~~g~~g~~C 167 (178)
.|.++++|+...+ +|.|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 5888899986633 8999999999876
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.50 E-value=0.00025 Score=33.62 Aligned_cols=31 Identities=52% Similarity=1.443 Sum_probs=24.1
Q ss_pred CCC-CCCCCCCeeccCCCCCCceeeeCCCCCCC-CCC
Q psy8281 20 CES-GPCQNGGSCAASNLTAQQFKCLCPPGFSG-SLC 54 (178)
Q Consensus 20 c~~-~~C~~~~~C~~~~~~~~~~~C~C~~g~~g-~~C 54 (178)
|.. .+|.++ .|.+ ..+.|.|.|++||.| ..|
T Consensus 2 C~~~~~C~~~-~C~~---~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCIN---TPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCcCCCCCC-EEEC---CCCCeEeECCCCCccCCcc
Confidence 444 578777 8887 577899999999998 555
No 29
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.49 E-value=3.5e-05 Score=28.11 Aligned_cols=13 Identities=62% Similarity=1.855 Sum_probs=9.1
Q ss_pred eeeCCCCCCCCCC
Q psy8281 155 ACECPKGYQGPNC 167 (178)
Q Consensus 155 ~C~C~~g~~g~~C 167 (178)
+|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4778888888776
No 30
>KOG1836|consensus
Probab=97.49 E-value=0.0025 Score=55.51 Aligned_cols=53 Identities=36% Similarity=0.831 Sum_probs=37.5
Q ss_pred CCCCCcCCCCCCCCCCCCCCCCCCCCCeeccCCCCCCceeee-CCCCCCCCCCCC
Q psy8281 3 GVRCGFTGKTCEDTSDPCESGPCQNGGSCAASNLTAQQFKCL-CPPGFSGSLCQH 56 (178)
Q Consensus 3 ~c~~g~~G~~C~~~~~~c~~~~C~~~~~C~~~~~~~~~~~C~-C~~g~~g~~C~~ 56 (178)
+|.+||+|..=......|.+-+|...+.|.... ....+.|. |++||+|..|+.
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~-~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTP-EILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcC-cccceecCCCCCCCccccccc
Confidence 477777776543333338888888888886632 24567897 999999999874
No 31
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.48 E-value=5.5e-05 Score=37.69 Aligned_cols=31 Identities=35% Similarity=1.060 Sum_probs=24.0
Q ss_pred CCCCCC--CCCCCCCeeccCCCCCCceeeeCCCCCC
Q psy8281 17 SDPCES--GPCQNGGSCAASNLTAQQFKCLCPPGFS 50 (178)
Q Consensus 17 ~~~c~~--~~C~~~~~C~~~~~~~~~~~C~C~~g~~ 50 (178)
+++|.. ..|..++.|++ +.++|.|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N---~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVN---TEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEE---ETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEc---CCCCEEeeCCCCcE
Confidence 355654 35877889988 78899999999987
No 32
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.47 E-value=8e-05 Score=35.64 Aligned_cols=27 Identities=30% Similarity=0.977 Sum_probs=21.0
Q ss_pred CCCCCCeeeeCCCceeeeCCCCCCCCC
Q psy8281 140 PCSHHHYCVDKGNTFACECPKGYQGPN 166 (178)
Q Consensus 140 ~c~~~~~c~~~~~~~~C~C~~g~~g~~ 166 (178)
.|+.++.|+++.++|.|.|++||.|..
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCC
Confidence 478889999999999999999998864
No 33
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.21 E-value=0.00041 Score=29.84 Aligned_cols=19 Identities=42% Similarity=1.412 Sum_probs=12.7
Q ss_pred ceeeeCCCCCC----CCCCCCCCC
Q psy8281 40 QFKCLCPPGFS----GSLCQHNLD 59 (178)
Q Consensus 40 ~~~C~C~~g~~----g~~C~~~~~ 59 (178)
+|+|.|++||. |..|+ +|+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~-DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE-DID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc-cCC
Confidence 47888888887 44555 444
No 34
>KOG1836|consensus
Probab=97.16 E-value=0.0013 Score=57.17 Aligned_cols=57 Identities=26% Similarity=0.644 Sum_probs=39.1
Q ss_pred ecCCCcccCCccccCCCCCCCCCCCCCeeeeCC--Cceeee-CCCCCCCCCCCCCCeeee
Q psy8281 119 TCGHGYTGQRCQIKVDLCDPNPCSHHHYCVDKG--NTFACE-CPKGYQGPNCDVPGIVFY 175 (178)
Q Consensus 119 ~C~~g~~g~~c~~~~~~c~~~~c~~~~~c~~~~--~~~~C~-C~~g~~g~~C~~~~~~~~ 175 (178)
+|..||.|..-......|.+-+|...+.|..+. ....|. |++||+|.+|+.++.++|
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyf 819 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYF 819 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccc
Confidence 466777655432111226666777777665443 456897 999999999999998876
No 35
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.08 E-value=0.00098 Score=30.94 Aligned_cols=27 Identities=30% Similarity=0.950 Sum_probs=21.7
Q ss_pred CCCCCCCeeccCCCCCCceeeeCCCCCCCCCC
Q psy8281 23 GPCQNGGSCAASNLTAQQFKCLCPPGFSGSLC 54 (178)
Q Consensus 23 ~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C 54 (178)
..|.++|+|+.. ..+|.|.+||.|..|
T Consensus 6 ~~C~~~G~C~~~-----~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP-----CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC-----CCEEECCCCCcCCCC
Confidence 458999999852 347999999999875
No 36
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.98 E-value=0.00034 Score=33.47 Aligned_cols=28 Identities=32% Similarity=0.972 Sum_probs=20.8
Q ss_pred CCCCCCCeeccCCCCCCceeeeCCCCCCCCC
Q psy8281 23 GPCQNGGSCAASNLTAQQFKCLCPPGFSGSL 53 (178)
Q Consensus 23 ~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~ 53 (178)
..|+.++.|.+ ++++|.|.|++||.|+.
T Consensus 6 ~~C~~nA~C~~---~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 6 GGCHPNATCTN---TGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGS-TTCEEEE----TTSEEEEE-CEEECCS
T ss_pred CCCCCCcEeec---CCCCEEeECCCCCccCC
Confidence 35888899998 66799999999998763
No 37
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.57 E-value=0.0011 Score=43.67 Aligned_cols=124 Identities=27% Similarity=0.767 Sum_probs=69.5
Q ss_pred CCCceeeeCCCCCC---CCCCCCCCCCCCC-----CCCC-CceEeeCC-----CCeeeeCCCCCcCC--CccccCCCcCC
Q psy8281 37 TAQQFKCLCPPGFS---GSLCQHNLDDCAS-----SPCG-HGICVDQT-----DGYRCYCQPGYSGE--QCQYEYNECES 100 (178)
Q Consensus 37 ~~~~~~C~C~~g~~---g~~C~~~~~~c~~-----~~c~-~~~c~~~~-----~~~~C~c~~g~~g~--~c~~~~~~c~~ 100 (178)
....|.|.|.+||. .+.|+. ..+|.. .+|. .+.|.+.. ..+.|.|.+||.-. .|. ...|..
T Consensus 16 MSNHfEC~Cnegfvl~~EntCE~-kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv--p~~C~~ 92 (197)
T PF06247_consen 16 MSNHFECKCNEGFVLKNENTCEE-KVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCV--PNKCNN 92 (197)
T ss_dssp ESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEE--EGGGSS
T ss_pred ccCceEEEcCCCcEEcccccccc-ceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEc--hhhcCc
Confidence 34568999999996 456663 334432 2454 37887654 46899999999733 332 244555
Q ss_pred CCCCCCCeEecC---CCCeeeecCCCcc---cCCccccC-CCCCCCCCCCCCeeeeCCCceeeeCCCCCCCC
Q psy8281 101 SPCLNGGSCSDH---VGRFSCTCGHGYT---GQRCQIKV-DLCDPNPCSHHHYCVDKGNTFACECPKGYQGP 165 (178)
Q Consensus 101 ~~c~~~~~c~~~---~~~~~C~C~~g~~---g~~c~~~~-~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~ 165 (178)
..|. .|.|+.. .....|.|.-|.. ...|..+. ..|+ -.|..+..|....+-|+|.+.+++.+.
T Consensus 93 ~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 93 KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCVCKEGFPGD 162 (197)
T ss_dssp ---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE-TT-EEE
T ss_pred eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeCcEEEeecCCCCCCC
Confidence 5565 5667522 2234799988876 22343221 1232 247777889999999999999998643
No 38
>smart00051 DSL delta serrate ligand.
Probab=96.30 E-value=0.0081 Score=32.68 Aligned_cols=44 Identities=18% Similarity=0.437 Sum_probs=28.8
Q ss_pred CCCCCCcCCCCCCCCCCCCCC-CCCCCCCeeccCCCCCCceeeeCCCCCCCCCC
Q psy8281 2 FGVRCGFTGKTCEDTSDPCES-GPCQNGGSCAASNLTAQQFKCLCPPGFSGSLC 54 (178)
Q Consensus 2 ~~c~~g~~G~~C~~~~~~c~~-~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C 54 (178)
..|.++|+|..|... |.+ .....+.+|.. . ..+.|.+||+|..|
T Consensus 19 v~C~~~~yG~~C~~~---C~~~~d~~~~~~Cd~---~---G~~~C~~Gw~G~~C 63 (63)
T smart00051 19 VTCDENYYGEGCNKF---CRPRDDFFGHYTCDE---N---GNKGCLEGWMGPYC 63 (63)
T ss_pred eeCCCCCcCCccCCE---eCcCccccCCccCCc---C---CCEecCCCCcCCCC
Confidence 357888888888643 432 23556677743 1 24788888888765
No 39
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.26 E-value=0.0025 Score=30.42 Aligned_cols=20 Identities=40% Similarity=1.218 Sum_probs=16.8
Q ss_pred CCeeeeCCCceeeeCCCCCC
Q psy8281 144 HHYCVDKGNTFACECPKGYQ 163 (178)
Q Consensus 144 ~~~c~~~~~~~~C~C~~g~~ 163 (178)
.+.|++.+++|+|.|++||.
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp SSEEEEETTSEEEE-STTEE
T ss_pred CCCCccCCCceEeECCCCCE
Confidence 46799999999999999985
No 40
>smart00051 DSL delta serrate ligand.
Probab=96.14 E-value=0.013 Score=31.85 Aligned_cols=45 Identities=31% Similarity=0.832 Sum_probs=29.0
Q ss_pred eeecCCCcccCCccccCCCCCC-CCCCCCCeeeeCCCceeeeCCCCCCCCCC
Q psy8281 117 SCTCGHGYTGQRCQIKVDLCDP-NPCSHHHYCVDKGNTFACECPKGYQGPNC 167 (178)
Q Consensus 117 ~C~C~~g~~g~~c~~~~~~c~~-~~c~~~~~c~~~~~~~~C~C~~g~~g~~C 167 (178)
+-+|.++|.|..|. ..|.+ +....+..|.. .| .++|.+||+|+.|
T Consensus 18 rv~C~~~~yG~~C~---~~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCN---KFCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccC---CEeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence 45688888888886 23332 12344555643 23 6889999999876
No 41
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=95.54 E-value=0.021 Score=35.24 Aligned_cols=34 Identities=29% Similarity=0.709 Sum_probs=25.3
Q ss_pred CCCCCCeeeeCC--CceeeeCCCCCCCCCCCCCCeee
Q psy8281 140 PCSHHHYCVDKG--NTFACECPKGYQGPNCDVPGIVF 174 (178)
Q Consensus 140 ~c~~~~~c~~~~--~~~~C~C~~g~~g~~C~~~~~~~ 174 (178)
.|.++ .|.... ..+.|+|..||+|.+||..++-.
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~dLl~ 87 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVVLVD 87 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccceeeee
Confidence 46654 776443 56789999999999999877533
No 42
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=95.51 E-value=0.02 Score=29.52 Aligned_cols=22 Identities=32% Similarity=0.665 Sum_probs=18.7
Q ss_pred eeeCCCCCCCCCCCCCCeeeee
Q psy8281 155 ACECPKGYQGPNCDVPGIVFYL 176 (178)
Q Consensus 155 ~C~C~~g~~g~~C~~~~~~~~~ 176 (178)
+|.|+++|+|.+|+....++|.
T Consensus 20 ~C~C~~~~~G~~C~~C~~g~~~ 41 (50)
T cd00055 20 QCECKPNTTGRRCDRCAPGYYG 41 (50)
T ss_pred EEeCCCcCCCCCCCCCCCCCcc
Confidence 8999999999999988777663
No 43
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=95.45 E-value=0.0071 Score=31.08 Aligned_cols=29 Identities=28% Similarity=0.565 Sum_probs=20.2
Q ss_pred eeeeCCCceeeeCCCCCCCCCCCCCCeeeee
Q psy8281 146 YCVDKGNTFACECPKGYQGPNCDVPGIVFYL 176 (178)
Q Consensus 146 ~c~~~~~~~~C~C~~g~~g~~C~~~~~~~~~ 176 (178)
.|.... .+|.|+++|+|.+|+....++|.
T Consensus 12 ~C~~~~--G~C~C~~~~~G~~C~~C~~g~~~ 40 (49)
T PF00053_consen 12 TCDPST--GQCVCKPGTTGPRCDQCKPGYFG 40 (49)
T ss_dssp SEEETC--EEESBSTTEESTTS-EE-TTEEC
T ss_pred cccCCC--CEEeccccccCCcCcCCCCcccc
Confidence 455543 38999999999999987776663
No 44
>PHA02887 EGF-like protein; Provisional
Probab=94.93 E-value=0.035 Score=33.69 Aligned_cols=32 Identities=31% Similarity=0.673 Sum_probs=23.5
Q ss_pred CCCCCCCeeeeCC--CceeeeCCCCCCCCCCCCCC
Q psy8281 139 NPCSHHHYCVDKG--NTFACECPKGYQGPNCDVPG 171 (178)
Q Consensus 139 ~~c~~~~~c~~~~--~~~~C~C~~g~~g~~C~~~~ 171 (178)
+.|- +|.|.... ....|.|+.||+|.+|+..+
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~vs 125 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDEVS 125 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCCCCccc
Confidence 3466 46786443 45689999999999998654
No 45
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=94.84 E-value=0.035 Score=28.09 Aligned_cols=22 Identities=32% Similarity=0.677 Sum_probs=18.4
Q ss_pred eeeCCCCCCCCCCCCCCeeeee
Q psy8281 155 ACECPKGYQGPNCDVPGIVFYL 176 (178)
Q Consensus 155 ~C~C~~g~~g~~C~~~~~~~~~ 176 (178)
+|.|+++|+|.+|+....++|.
T Consensus 19 ~C~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 19 QCECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred EEECCCCCCCCCCCcCCCCcCC
Confidence 8899999999999987777663
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.95 E-value=0.093 Score=36.35 Aligned_cols=39 Identities=26% Similarity=0.671 Sum_probs=28.5
Q ss_pred cCCccccCCCCCCCCCCCCCeeeeCCCceeeeCCCCCCCC
Q psy8281 126 GQRCQIKVDLCDPNPCSHHHYCVDKGNTFACECPKGYQGP 165 (178)
Q Consensus 126 g~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g~ 165 (178)
+..|. +.++|...+....+.|.++.|+|.|.|++||+..
T Consensus 181 ~~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccCC
Confidence 45565 6677764433334679999999999999999753
No 47
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.66 E-value=0.03 Score=37.11 Aligned_cols=121 Identities=26% Similarity=0.708 Sum_probs=67.1
Q ss_pred CCCCCCcC---CCCCCCCCCCCC-----CCCCCCCCeeccCC--CCCCceeeeCCCCCCCC--CCCCCCCCCCCCCCCCc
Q psy8281 2 FGVRCGFT---GKTCEDTSDPCE-----SGPCQNGGSCAASN--LTAQQFKCLCPPGFSGS--LCQHNLDDCASSPCGHG 69 (178)
Q Consensus 2 ~~c~~g~~---G~~C~~~~~~c~-----~~~C~~~~~C~~~~--~~~~~~~C~C~~g~~g~--~C~~~~~~c~~~~c~~~ 69 (178)
|.|..||. -..|+. ...|. ..+|..-+.|.... ..+..|.|.|.+||... .|. ...|....|..|
T Consensus 22 C~Cnegfvl~~EntCE~-kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv--p~~C~~~~Cg~G 98 (197)
T PF06247_consen 22 CKCNEGFVLKNENTCEE-KVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCV--PNKCNNKDCGSG 98 (197)
T ss_dssp EEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEE--EGGGSS---TTE
T ss_pred EEcCCCcEEcccccccc-ceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEc--hhhcCceecCCC
Confidence 45666664 233443 22343 35688888998743 24567999999999733 342 355666667788
Q ss_pred eEeeCC---CCeeeeCCCCCc---CCCccc-cCCCcCCCCCCCCCeEecCCCCeeeecCCCccc
Q psy8281 70 ICVDQT---DGYRCYCQPGYS---GEQCQY-EYNECESSPCLNGGSCSDHVGRFSCTCGHGYTG 126 (178)
Q Consensus 70 ~c~~~~---~~~~C~c~~g~~---g~~c~~-~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g 126 (178)
.|+..+ ....|+|..|+. ...|.. ..+.|... |..+-.|....+.|+|.+..++.+
T Consensus 99 KCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LK-Ck~nE~CK~~~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 99 KCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLK-CKENEECKLVDGYYKCVCKEGFPG 161 (197)
T ss_dssp EEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred eEEecCCCCCCceeEeeeceEeccCCcccCCCccceeee-cCCCcceeeeCcEEEeecCCCCCC
Confidence 887432 334899998886 223321 12344433 677778988899999999998864
No 48
>KOG3512|consensus
Probab=92.92 E-value=0.99 Score=34.60 Aligned_cols=28 Identities=21% Similarity=0.566 Sum_probs=21.6
Q ss_pred eeeeCCCceeeeCCCCCCCCCCCCCCeeee
Q psy8281 146 YCVDKGNTFACECPKGYQGPNCDVPGIVFY 175 (178)
Q Consensus 146 ~c~~~~~~~~C~C~~g~~g~~C~~~~~~~~ 175 (178)
.|..+.| +|.|.+|-+|.+|.....+++
T Consensus 408 tCNq~tG--qCpCkeGvtG~tCnrCa~gyq 435 (592)
T KOG3512|consen 408 TCNQTTG--QCPCKEGVTGLTCNRCAPGYQ 435 (592)
T ss_pred cccccCC--cccCCCCCcccccccccchhh
Confidence 3543444 899999999999998877764
No 49
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=91.44 E-value=0.24 Score=29.69 Aligned_cols=36 Identities=39% Similarity=0.875 Sum_probs=24.5
Q ss_pred CCCCCCCCeeeeCC-----CceeeeCCC-------------CCCCCCCCCCCee
Q psy8281 138 PNPCSHHHYCVDKG-----NTFACECPK-------------GYQGPNCDVPGIV 173 (178)
Q Consensus 138 ~~~c~~~~~c~~~~-----~~~~C~C~~-------------g~~g~~C~~~~~~ 173 (178)
.+.|..|+.|++.. .-|.|.|.+ .|.|+.|+..|..
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDvS 65 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKDVS 65 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccccccc
Confidence 35577777777662 346777766 5888889877654
No 50
>KOG1218|consensus
Probab=91.35 E-value=4.3 Score=29.52 Aligned_cols=36 Identities=25% Similarity=0.613 Sum_probs=21.9
Q ss_pred eeecCCCcccCCccccCCCCC-CCCCCCCCeeeeCCC
Q psy8281 117 SCTCGHGYTGQRCQIKVDLCD-PNPCSHHHYCVDKGN 152 (178)
Q Consensus 117 ~C~C~~g~~g~~c~~~~~~c~-~~~c~~~~~c~~~~~ 152 (178)
.|.|.+||.+..+......|. ...+.+++.|....+
T Consensus 163 ~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred ceeccCCcccccccccCCCcCCCcccCCCCeeecccc
Confidence 478999999888764333343 233555556665544
No 51
>KOG3516|consensus
Probab=89.92 E-value=0.35 Score=40.99 Aligned_cols=37 Identities=38% Similarity=1.092 Sum_probs=32.9
Q ss_pred CCCCCCCCCCCCCeeccCCCCCCceeeeCC-CCCCCCCCCC
Q psy8281 17 SDPCESGPCQNGGSCAASNLTAQQFKCLCP-PGFSGSLCQH 56 (178)
Q Consensus 17 ~~~c~~~~C~~~~~C~~~~~~~~~~~C~C~-~g~~g~~C~~ 56 (178)
.+.|.+++|.++|.|.- .+..|.|.|. .||.|..|..
T Consensus 545 ~drClPN~CehgG~C~Q---s~~~f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQ---SWDDFECNCELTGYKGATCHT 582 (1306)
T ss_pred ccccCCccccCCCcccc---cccceeEeccccccccccccC
Confidence 47799999999999987 6788999998 8999999974
No 52
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=89.64 E-value=0.063 Score=29.18 Aligned_cols=46 Identities=30% Similarity=0.780 Sum_probs=19.2
Q ss_pred eeeecCCCcccCCccccCCCCCCCC-CCCCCeeeeCCCceeeeCCCCCCCCCC
Q psy8281 116 FSCTCGHGYTGQRCQIKVDLCDPNP-CSHHHYCVDKGNTFACECPKGYQGPNC 167 (178)
Q Consensus 116 ~~C~C~~g~~g~~c~~~~~~c~~~~-c~~~~~c~~~~~~~~C~C~~g~~g~~C 167 (178)
++.+|.+.|.|..|.. .|.+.. -..+-.|. ..| .-+|.+||+|+.|
T Consensus 17 ~rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd-~~G--~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSK---FCKPRDDSFGHYTCD-SNG--NKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-E---E---EEETTEEEEE--SS----EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccC---CcCCCcCCcCCcccC-CCC--CCCCCCCCcCCCC
Confidence 3457888888888862 232210 11122344 333 4468999999876
No 53
>PHA02887 EGF-like protein; Provisional
Probab=89.09 E-value=0.54 Score=28.71 Aligned_cols=29 Identities=28% Similarity=0.804 Sum_probs=19.8
Q ss_pred CCCCCeeccCCCCCCceeeeCCCCCCCCCCC
Q psy8281 25 CQNGGSCAASNLTAQQFKCLCPPGFSGSLCQ 55 (178)
Q Consensus 25 C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~ 55 (178)
|. ||.|..-. ......|.|..||+|..|+
T Consensus 94 Ci-HG~C~yI~-dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 94 CI-NGECMNII-DLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ee-CCEEEccc-cCCCceeECCCCcccCCCC
Confidence 65 46776532 3345678888888888886
No 54
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=87.28 E-value=0.45 Score=22.73 Aligned_cols=25 Identities=24% Similarity=0.662 Sum_probs=17.1
Q ss_pred CCCCCCCeeeeCC-CceeeeCCCCCC
Q psy8281 139 NPCSHHHYCVDKG-NTFACECPKGYQ 163 (178)
Q Consensus 139 ~~c~~~~~c~~~~-~~~~C~C~~g~~ 163 (178)
..|..++.|.+.. |...|+|..||.
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 4567778888776 888999999986
No 55
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.78 E-value=0.69 Score=28.74 Aligned_cols=30 Identities=33% Similarity=0.935 Sum_probs=20.2
Q ss_pred CCCCCeeccCCCCCCceeeeCCCCCCCCCCCC
Q psy8281 25 CQNGGSCAASNLTAQQFKCLCPPGFSGSLCQH 56 (178)
Q Consensus 25 C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~ 56 (178)
|.++ .|..-. ....+.|.|..||+|..|+.
T Consensus 53 ClHG-~C~yI~-dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 53 CLHG-DCIHAR-DIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred eECC-EEEeec-cCCCceeECCCCcccccccc
Confidence 5553 675522 33567789999999988873
No 56
>KOG3516|consensus
Probab=85.96 E-value=0.83 Score=38.95 Aligned_cols=39 Identities=38% Similarity=0.978 Sum_probs=34.0
Q ss_pred cCCCCCCCCCCCCCeeeeCCCceeeeCC-CCCCCCCCCCC
Q psy8281 132 KVDLCDPNPCSHHHYCVDKGNTFACECP-KGYQGPNCDVP 170 (178)
Q Consensus 132 ~~~~c~~~~c~~~~~c~~~~~~~~C~C~-~g~~g~~C~~~ 170 (178)
-.+.|.+++|.+++.|..+...|.|.|. .||.|..|...
T Consensus 544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred cccccCCccccCCCcccccccceeEeccccccccccccCC
Confidence 4567889999999999988888999999 89999999754
No 57
>KOG3514|consensus
Probab=85.80 E-value=0.75 Score=38.92 Aligned_cols=34 Identities=50% Similarity=1.161 Sum_probs=30.5
Q ss_pred CCCCCCCCCCCeeccCCCCCCceeeeC-CCCCCCCCCC
Q psy8281 19 PCESGPCQNGGSCAASNLTAQQFKCLC-PPGFSGSLCQ 55 (178)
Q Consensus 19 ~c~~~~C~~~~~C~~~~~~~~~~~C~C-~~g~~g~~C~ 55 (178)
.|.++||.++|.|.. .+.++.|.| ..+|.|..|+
T Consensus 625 ~C~~nPC~N~g~C~e---gwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSE---GWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccc---cccccccccccCcccCcccc
Confidence 688999999999998 688999999 5689999997
No 58
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=83.35 E-value=0.48 Score=24.69 Aligned_cols=33 Identities=21% Similarity=0.516 Sum_probs=15.7
Q ss_pred CCCCCCeeccCCC-CCCceeeeCCCCCCCCCCCC
Q psy8281 24 PCQNGGSCAASNL-TAQQFKCLCPPGFSGSLCQH 56 (178)
Q Consensus 24 ~C~~~~~C~~~~~-~~~~~~C~C~~g~~g~~C~~ 56 (178)
+|+.||+...... ..+.-.|.|+.-|.|.+|+.
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCST 51 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-E
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCccc
Confidence 4677777653221 23445688888888888864
No 59
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.43 E-value=2.1 Score=29.59 Aligned_cols=37 Identities=22% Similarity=0.556 Sum_probs=25.1
Q ss_pred CCccccCCCcCCCCCCCCCeEecCCCCeeeecCCCccc
Q psy8281 89 EQCQYEYNECESSPCLNGGSCSDHVGRFSCTCGHGYTG 126 (178)
Q Consensus 89 ~~c~~~~~~c~~~~c~~~~~c~~~~~~~~C~C~~g~~g 126 (178)
..|. +.++|...+......|.++.|.|.|.|.+||+.
T Consensus 182 ~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 182 KICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred ccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 3454 455664333233357999999999999999863
No 60
>KOG3514|consensus
Probab=82.25 E-value=1.2 Score=37.83 Aligned_cols=39 Identities=31% Similarity=0.958 Sum_probs=34.0
Q ss_pred CCCCCCCCCCCeeeeCCCceeeeCC-CCCCCCCCCCCCee
Q psy8281 135 LCDPNPCSHHHYCVDKGNTFACECP-KGYQGPNCDVPGIV 173 (178)
Q Consensus 135 ~c~~~~c~~~~~c~~~~~~~~C~C~-~g~~g~~C~~~~~~ 173 (178)
.|..+||.+++.|......|.|.|. .+|.|+.||+..+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~t~ 664 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCEREATA 664 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccceeee
Confidence 6788999999999999999999997 69999999976553
No 61
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=67.81 E-value=9.1 Score=23.18 Aligned_cols=32 Identities=28% Similarity=0.669 Sum_probs=21.2
Q ss_pred CCCCC-CCCCCCCCeeeeCCCceeeeCCCCCCCC
Q psy8281 133 VDLCD-PNPCSHHHYCVDKGNTFACECPKGYQGP 165 (178)
Q Consensus 133 ~~~c~-~~~c~~~~~c~~~~~~~~C~C~~g~~g~ 165 (178)
.+.|. ...|...+.|... ....|.|.+||..+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCCC-CCCceECCCCcCCC
Confidence 34554 4668888888433 34469999998754
No 62
>KOG3607|consensus
Probab=67.20 E-value=4.8 Score=33.18 Aligned_cols=31 Identities=29% Similarity=0.791 Sum_probs=25.2
Q ss_pred CCCCCCeeeeCCCceeeeCCCCCCCCCCCCCCee
Q psy8281 140 PCSHHHYCVDKGNTFACECPKGYQGPNCDVPGIV 173 (178)
Q Consensus 140 ~c~~~~~c~~~~~~~~C~C~~g~~g~~C~~~~~~ 173 (178)
.|+.+++|.|. +.|+|.+||.++.|++...+
T Consensus 631 ~C~g~GVCnn~---~~ChC~~gwapp~C~~~~~~ 661 (716)
T KOG3607|consen 631 TCNGHGVCNNE---LNCHCEPGWAPPFCFIFGYG 661 (716)
T ss_pred ccCCCcccCCC---cceeeCCCCCCCccccccCC
Confidence 47888888665 38999999999999986654
No 63
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=65.33 E-value=11 Score=19.21 Aligned_cols=20 Identities=40% Similarity=1.089 Sum_probs=14.7
Q ss_pred CCCCCCeeeeCCCceeeeCCCCCC
Q psy8281 140 PCSHHHYCVDKGNTFACECPKGYQ 163 (178)
Q Consensus 140 ~c~~~~~c~~~~~~~~C~C~~g~~ 163 (178)
.|..+..|++. +|.|++||+
T Consensus 27 qC~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCC----EeECCCCCE
Confidence 35566778664 899999875
No 64
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=57.42 E-value=10 Score=17.73 Aligned_cols=10 Identities=50% Similarity=1.192 Sum_probs=8.3
Q ss_pred eeeCCCCCCC
Q psy8281 155 ACECPKGYQG 164 (178)
Q Consensus 155 ~C~C~~g~~g 164 (178)
+|.|++||.-
T Consensus 19 ~C~CPeGyIl 28 (34)
T PF09064_consen 19 QCFCPEGYIL 28 (34)
T ss_pred ceeCCCceEe
Confidence 8999999863
No 65
>KOG1218|consensus
Probab=53.45 E-value=84 Score=22.79 Aligned_cols=36 Identities=36% Similarity=0.911 Sum_probs=22.3
Q ss_pred eeeCCCCCcCCCccccCCCcC-CCCCCCCCeEecCCC
Q psy8281 79 RCYCQPGYSGEQCQYEYNECE-SSPCLNGGSCSDHVG 114 (178)
Q Consensus 79 ~C~c~~g~~g~~c~~~~~~c~-~~~c~~~~~c~~~~~ 114 (178)
.|.|.+||.+..+......|. ...+.+++.|....+
T Consensus 163 ~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred ceeccCCcccccccccCCCcCCCcccCCCCeeecccc
Confidence 678999999888764443343 223455556665544
No 66
>KOG3509|consensus
Probab=40.68 E-value=92 Score=27.06 Aligned_cols=65 Identities=40% Similarity=0.917 Sum_probs=39.4
Q ss_pred CCCCCCCCCCCCeeccCCCCCCceeeeCCCCCCCCCCCCCCCCCCCCCC--CCceEeeCCCCeeeeCCCC
Q psy8281 18 DPCESGPCQNGGSCAASNLTAQQFKCLCPPGFSGSLCQHNLDDCASSPC--GHGICVDQTDGYRCYCQPG 85 (178)
Q Consensus 18 ~~c~~~~C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~~~~~~c~~~~c--~~~~c~~~~~~~~C~c~~g 85 (178)
+.|...++...+.|.. .+....|.|++||.|..|....+.+...+- ..+++....+.....|.++
T Consensus 407 ~~c~~~p~~~~g~c~p---~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg 473 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQ---TLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG 473 (964)
T ss_pred CccccccCCCCccccc---cccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC
Confidence 5567778888888876 555668999999999988743333332221 1234443333333455555
No 67
>KOG3509|consensus
Probab=37.17 E-value=1.1e+02 Score=26.62 Aligned_cols=60 Identities=32% Similarity=0.654 Sum_probs=30.3
Q ss_pred CCCCCCeEecCCCCeeeecCCCcccCCccccCCCCCCCC-CCCCCeeeeCCCceeeeCCCC
Q psy8281 102 PCLNGGSCSDHVGRFSCTCGHGYTGQRCQIKVDLCDPNP-CSHHHYCVDKGNTFACECPKG 161 (178)
Q Consensus 102 ~c~~~~~c~~~~~~~~C~C~~g~~g~~c~~~~~~c~~~~-c~~~~~c~~~~~~~~C~C~~g 161 (178)
++...+.|........|.|+++|+|..|....+.+...+ -...++|....+.....|.++
T Consensus 413 p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg 473 (964)
T KOG3509|consen 413 PCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG 473 (964)
T ss_pred cCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC
Confidence 344444555555555677777777777653333332211 122334444444445556666
No 68
>KOG3607|consensus
Probab=35.52 E-value=28 Score=28.97 Aligned_cols=25 Identities=32% Similarity=1.084 Sum_probs=18.7
Q ss_pred CCCCCeeccCCCCCCceeeeCCCCCCCCCCC
Q psy8281 25 CQNGGSCAASNLTAQQFKCLCPPGFSGSLCQ 55 (178)
Q Consensus 25 C~~~~~C~~~~~~~~~~~C~C~~g~~g~~C~ 55 (178)
|..+|.|.+ .+.|+|.+||.+.+|.
T Consensus 632 C~g~GVCnn------~~~ChC~~gwapp~C~ 656 (716)
T KOG3607|consen 632 CNGHGVCNN------ELNCHCEPGWAPPFCF 656 (716)
T ss_pred cCCCcccCC------CcceeeCCCCCCCccc
Confidence 777788865 2368888888888876
No 69
>KOG3512|consensus
Probab=28.06 E-value=75 Score=25.06 Aligned_cols=30 Identities=27% Similarity=0.790 Sum_probs=24.1
Q ss_pred eeeeCCCc-eeeeCCCCCCCCCCCCCCeeee
Q psy8281 146 YCVDKGNT-FACECPKGYQGPNCDVPGIVFY 175 (178)
Q Consensus 146 ~c~~~~~~-~~C~C~~g~~g~~C~~~~~~~~ 175 (178)
.|+....+ +.|.|..+-+|+.|++.-.+++
T Consensus 286 ~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~ 316 (592)
T KOG3512|consen 286 RCVMDESSHLTCDCEHNTAGPDCGRCKPFYY 316 (592)
T ss_pred eeeeccCCceEEecccCCCCCCccccccccc
Confidence 47765554 8999999999999998877665
Done!