Query psy7015
Match_columns 284
No_of_seqs 328 out of 1734
Neff 9.7
Searched_HMMs 46136
Date Sat Aug 17 00:37:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7015.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7015hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.7 8.5E-17 1.8E-21 152.9 15.9 90 42-131 1217-1309(2531)
2 KOG1217|consensus 99.7 1.3E-15 2.8E-20 139.5 17.4 235 42-282 105-362 (487)
3 KOG1217|consensus 99.6 1.2E-14 2.6E-19 133.1 17.8 214 44-270 149-389 (487)
4 KOG1219|consensus 99.6 9.5E-15 2.1E-19 143.8 9.8 117 154-273 3859-3977(4289)
5 KOG1219|consensus 99.5 2.8E-14 6.1E-19 140.6 9.6 117 98-235 3860-3977(4289)
6 KOG1214|consensus 99.4 1.6E-12 3.4E-17 119.2 14.0 185 72-269 702-908 (1289)
7 KOG1214|consensus 99.4 9.8E-12 2.1E-16 114.1 15.4 206 46-265 715-947 (1289)
8 KOG4289|consensus 99.4 1.6E-12 3.4E-17 124.5 10.5 88 81-187 1218-1308(2531)
9 KOG1225|consensus 99.3 2.9E-11 6.4E-16 108.8 12.8 131 86-271 235-365 (525)
10 KOG1225|consensus 99.3 5.6E-11 1.2E-15 107.0 14.2 131 48-233 235-365 (525)
11 KOG4260|consensus 99.0 4.2E-10 9E-15 91.2 5.0 165 50-230 131-304 (350)
12 KOG0994|consensus 99.0 2.5E-09 5.4E-14 101.4 9.4 224 40-274 775-1052(1758)
13 KOG4260|consensus 99.0 9.5E-10 2.1E-14 89.1 5.2 163 88-268 131-304 (350)
14 KOG0994|consensus 98.9 6.4E-09 1.4E-13 98.8 10.9 57 213-273 1078-1146(1758)
15 KOG1226|consensus 98.5 9.5E-07 2.1E-11 81.8 10.1 120 143-274 479-621 (783)
16 KOG1226|consensus 98.4 3.3E-06 7.2E-11 78.3 12.7 146 72-253 469-636 (783)
17 KOG1836|consensus 98.2 4.9E-05 1.1E-09 77.9 15.1 57 213-273 953-1021(1705)
18 PF00008 EGF: EGF-like domain 98.1 2.7E-06 5.8E-11 47.1 2.2 29 242-270 2-31 (32)
19 smart00179 EGF_CA Calcium-bind 98.1 8.4E-06 1.8E-10 47.3 4.4 36 64-99 2-39 (39)
20 PF00008 EGF: EGF-like domain 98.0 5.6E-06 1.2E-10 45.8 2.1 31 203-233 1-32 (32)
21 smart00179 EGF_CA Calcium-bind 97.9 1.8E-05 3.9E-10 45.9 4.2 29 206-234 9-38 (39)
22 PF07645 EGF_CA: Calcium-bindi 97.9 5.8E-06 1.3E-10 49.0 1.6 32 63-94 1-34 (42)
23 cd00054 EGF_CA Calcium-binding 97.8 5.3E-05 1.2E-09 43.4 4.3 35 64-98 2-37 (38)
24 PF07645 EGF_CA: Calcium-bindi 97.8 2.2E-05 4.8E-10 46.4 2.5 31 200-230 2-34 (42)
25 cd00054 EGF_CA Calcium-binding 97.6 0.00011 2.3E-09 42.1 4.0 29 244-272 9-37 (38)
26 cd00053 EGF Epidermal growth f 97.4 0.00036 7.8E-09 39.2 4.1 30 69-98 5-35 (36)
27 PF06247 Plasmod_Pvs28: Plasmo 97.4 7.2E-05 1.6E-09 58.1 1.5 135 76-232 11-162 (197)
28 smart00181 EGF Epidermal growt 97.4 0.0004 8.7E-09 39.1 4.0 28 70-98 6-34 (35)
29 cd00053 EGF Epidermal growth f 97.3 0.00038 8.2E-09 39.1 3.9 30 243-272 5-35 (36)
30 PF12947 EGF_3: EGF domain; I 97.3 0.00016 3.5E-09 41.0 2.2 27 244-270 6-32 (36)
31 PF06247 Plasmod_Pvs28: Plasmo 97.3 0.00011 2.4E-09 57.1 1.2 102 165-271 50-163 (197)
32 smart00181 EGF Epidermal growt 97.2 0.00057 1.2E-08 38.4 3.8 28 244-272 6-34 (35)
33 PF07974 EGF_2: EGF-like domai 97.2 0.00052 1.1E-08 37.7 3.2 26 245-272 7-32 (32)
34 KOG1836|consensus 97.0 0.0062 1.3E-07 63.1 11.4 176 87-274 697-925 (1705)
35 PF12947 EGF_3: EGF domain; I 97.0 0.00055 1.2E-08 38.8 2.0 28 206-233 6-33 (36)
36 PF12662 cEGF: Complement Clr- 96.9 0.00071 1.5E-08 34.4 2.0 20 258-278 1-24 (24)
37 PF12662 cEGF: Complement Clr- 96.9 0.00085 1.8E-08 34.1 2.2 11 46-56 1-11 (24)
38 PF12661 hEGF: Human growth fa 96.9 0.00046 1E-08 29.7 1.0 13 260-272 1-13 (13)
39 PF07974 EGF_2: EGF-like domai 96.8 0.002 4.3E-08 35.4 3.2 26 207-234 7-32 (32)
40 smart00051 DSL delta serrate l 95.6 0.02 4.3E-07 36.9 3.7 46 221-272 17-63 (63)
41 KOG3512|consensus 95.6 0.079 1.7E-06 47.3 8.5 87 181-274 372-479 (592)
42 KOG1218|consensus 95.5 1.1 2.3E-05 38.8 15.7 182 44-257 12-199 (316)
43 smart00051 DSL delta serrate l 94.3 0.094 2E-06 33.8 4.1 47 46-98 16-63 (63)
44 PF14670 FXa_inhibition: Coagu 93.9 0.037 8E-07 31.3 1.4 18 39-56 11-28 (36)
45 KOG3512|consensus 93.4 0.19 4.2E-06 44.9 5.8 105 167-274 280-429 (592)
46 KOG1218|consensus 92.9 5.5 0.00012 34.3 15.6 148 45-219 47-199 (316)
47 PF14670 FXa_inhibition: Coagu 92.8 0.096 2.1E-06 29.6 2.0 20 212-231 10-29 (36)
48 PF12946 EGF_MSP1_1: MSP1 EGF 92.5 0.082 1.8E-06 29.8 1.4 26 243-268 4-30 (37)
49 PHA02887 EGF-like protein; Pro 92.3 0.13 2.8E-06 36.9 2.6 28 246-274 94-123 (126)
50 PHA03099 epidermal growth fact 91.9 0.15 3.1E-06 37.3 2.5 28 246-274 53-82 (139)
51 PF12946 EGF_MSP1_1: MSP1 EGF 91.7 0.068 1.5E-06 30.1 0.5 27 68-94 3-30 (37)
52 cd01475 vWA_Matrilin VWA_Matri 89.3 0.46 1E-05 39.1 3.7 38 57-95 181-218 (224)
53 PF00053 Laminin_EGF: Laminin 89.3 0.25 5.5E-06 29.9 1.6 22 251-274 12-33 (49)
54 PHA02887 EGF-like protein; Pro 88.5 0.51 1.1E-05 34.0 2.9 28 72-100 94-123 (126)
55 cd00055 EGF_Lam Laminin-type e 87.9 0.54 1.2E-05 28.6 2.4 17 257-273 17-33 (50)
56 PF04863 EGF_alliinase: Alliin 87.8 0.32 7E-06 29.9 1.3 36 244-279 17-56 (56)
57 PF01414 DSL: Delta serrate li 87.7 0.16 3.5E-06 32.7 -0.0 47 46-98 16-63 (63)
58 PHA03099 epidermal growth fact 85.9 0.78 1.7E-05 33.6 2.6 28 72-100 53-82 (139)
59 cd01475 vWA_Matrilin VWA_Matri 84.0 1.6 3.5E-05 35.8 4.2 38 152-190 181-218 (224)
60 PF00053 Laminin_EGF: Laminin 82.3 1.1 2.3E-05 27.1 1.9 22 212-235 11-32 (49)
61 smart00180 EGF_Lam Laminin-typ 80.2 1.6 3.4E-05 26.1 2.0 16 258-273 17-32 (46)
62 cd00055 EGF_Lam Laminin-type e 79.9 1.9 4.1E-05 26.2 2.4 20 214-235 14-33 (50)
63 KOG3516|consensus 78.4 2 4.2E-05 43.2 3.1 39 63-101 544-583 (1306)
64 KOG3516|consensus 75.5 2.4 5.2E-05 42.6 2.8 41 238-278 545-586 (1306)
65 PF09064 Tme5_EGF_like: Thromb 67.9 6.4 0.00014 21.7 2.2 22 172-194 11-32 (34)
66 PF00954 S_locus_glycop: S-loc 67.8 7.5 0.00016 27.9 3.4 31 64-95 77-108 (110)
67 KOG3514|consensus 67.3 4 8.7E-05 40.7 2.3 35 66-100 625-660 (1591)
68 KOG3514|consensus 62.7 5.9 0.00013 39.6 2.5 36 240-275 625-661 (1591)
69 PF12955 DUF3844: Domain of un 59.7 9.6 0.00021 27.1 2.5 9 266-274 53-61 (103)
70 KOG3509|consensus 53.4 29 0.00064 34.8 5.5 71 201-272 407-478 (964)
71 PF00954 S_locus_glycop: S-loc 52.9 18 0.00039 25.9 3.2 24 244-268 84-107 (110)
72 PF01683 EB: EB module; Inter 46.4 48 0.001 19.9 3.9 30 151-190 18-47 (52)
73 KOG0196|consensus 45.5 43 0.00094 33.0 5.1 67 208-279 248-328 (996)
74 KOG0196|consensus 38.9 84 0.0018 31.1 5.9 60 47-109 259-332 (996)
75 KOG3509|consensus 23.9 72 0.0016 32.2 2.9 43 240-282 408-450 (964)
76 KOG3607|consensus 22.2 67 0.0015 31.5 2.4 28 245-275 631-658 (716)
77 KOG3607|consensus 20.3 76 0.0017 31.1 2.3 27 71-100 631-657 (716)
No 1
>KOG4289|consensus
Probab=99.73 E-value=8.5e-17 Score=152.88 Aligned_cols=90 Identities=41% Similarity=0.973 Sum_probs=79.9
Q ss_pred cCCCceEEeCCCCCccCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCC--CCCCCCCCCCCCEEeeC
Q psy7015 42 AVPSSYTCYCIDGYTGVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNV--DECGSNPCQNNGTCHDL 119 (284)
Q Consensus 42 ~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~--~~C~~~~C~~~~~C~~~ 119 (284)
...++++|.|++||+|+.|+..+|.|...||.++|+|....|.|+|.|.+||+|..||.+. ..|.+..|.++++|++.
T Consensus 1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~ 1296 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNL 1296 (2531)
T ss_pred cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeec
Confidence 3457889999999999999999999999999999999999999999999999999998643 45788889999999976
Q ss_pred C-CCceEeCCCCC
Q psy7015 120 L-NGFVCSCHPGF 131 (284)
Q Consensus 120 ~-~~~~C~C~~g~ 131 (284)
. +++.|+|+.|-
T Consensus 1297 ~nggf~c~Cp~ge 1309 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYGE 1309 (2531)
T ss_pred CCCceeccCCCcc
Confidence 4 56888999873
No 2
>KOG1217|consensus
Probab=99.69 E-value=1.3e-15 Score=139.54 Aligned_cols=235 Identities=40% Similarity=0.979 Sum_probs=178.7
Q ss_pred cCCCceEEeCCCCCccCCCCCCCCCCCCCC--CCCCCeEecC---CCCeeeeCCCCCcCCCcccCCCCCC--CCCCCCCC
Q psy7015 42 AVPSSYTCYCIDGYTGVHCQTNWDECWSNP--CHNGGSCIDG---IAAYNCSCPPGYTGPSCESNVDECG--SNPCQNNG 114 (284)
Q Consensus 42 ~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~--C~~~g~C~~~---~g~~~C~C~~G~~G~~C~~~~~~C~--~~~C~~~~ 114 (284)
...+++.|.|.+||.|..++.. .+|...+ +...+.|... ...+.|.|..||.+..+....++|. ..+|.+.+
T Consensus 105 ~~~~~~~c~c~~g~~~~~~~~~-~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~ 183 (487)
T KOG1217|consen 105 DCVGSYECTCPPGYQGTPCEGE-CECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGG 183 (487)
T ss_pred CCCCCceeeCCCccccCcCCcc-eeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCc
Confidence 3567899999999999988742 1466555 3566777764 4588999999999999976557886 44599899
Q ss_pred EEeeCCCCceEeCCCCCeeeeecCC-------CCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCC
Q psy7015 115 TCHDLLNGFVCSCHPGFTGNCIDGI-------AAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPG 187 (284)
Q Consensus 115 ~C~~~~~~~~C~C~~g~~g~c~~~~-------~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g 187 (284)
.|.+..+.|.|.|+.+|.+.-.... ..+.|.+.+++.+..+...+.++... . ++|.+..++++|.|++|
T Consensus 184 ~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g 259 (487)
T KOG1217|consen 184 TCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEG 259 (487)
T ss_pred ccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCC---C-CcccccCCceeeeCCCC
Confidence 9999999999999999987632221 11457788999988888766665444 4 88999999999999999
Q ss_pred CccCCCCCccCCCCCCCCCC-CCCCCEEeeCCCCeeeecCCCCccCCC--cccCCcC----CCCCCCCCCEE--eecCCC
Q psy7015 188 FTGWTGSLCQSATNECESSP-CQNGGVCVDLHAAYTCACLFGFTGRNC--DIELKIC----ENSPCLNEALC--LEEEEE 258 (284)
Q Consensus 188 ~~g~~~~~c~~~~~~C~~~~-C~~~g~C~~~~g~~~C~C~~G~~g~~C--~~~~~~C----~~~~C~~~~~C--~~~~~~ 258 (284)
|.+... ....++++|.... |.++++|++..+.|.|.|++||+|..+ ......| ...+|.+++.| ......
T Consensus 260 ~~~~~~-~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~ 338 (487)
T KOG1217|consen 260 YTGDAC-VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGG 338 (487)
T ss_pred cccccc-ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCC
Confidence 988431 1234677887654 899999999999899999999999998 2233566 34568888888 334456
Q ss_pred eeeecCCCCcCCCccccccccccC
Q psy7015 259 QVCYCVPDYHGNRCQYQYDECQIT 282 (284)
Q Consensus 259 ~~C~C~~G~~G~~C~~~~~~C~~~ 282 (284)
+.|.|..+|.|..|+...++|...
T Consensus 339 ~~C~c~~~~~g~~C~~~~~~C~~~ 362 (487)
T KOG1217|consen 339 FRCACGPGFTGRRCEDSNDECASS 362 (487)
T ss_pred CCcCCCCCCCCCccccCCccccCC
Confidence 789999999999999655688664
No 3
>KOG1217|consensus
Probab=99.64 E-value=1.2e-14 Score=133.12 Aligned_cols=214 Identities=42% Similarity=1.029 Sum_probs=161.0
Q ss_pred CCceEEeCCCCCccCCCCCCCCCCC--CCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCC
Q psy7015 44 PSSYTCYCIDGYTGVHCQTNWDECW--SNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLN 121 (284)
Q Consensus 44 ~g~~~C~C~~G~~g~~C~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~ 121 (284)
...+.|.|..||.+..++...++|. ..+|.+.+.|.+..+.|.|.|++||.+..++.. ...+.|+..
T Consensus 149 ~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------~~~~~c~~~-- 217 (487)
T KOG1217|consen 149 VGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------GNGGTCVDS-- 217 (487)
T ss_pred CCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC---------CCCceEecc--
Confidence 3589999999999999987667887 445999999999999999999999999988643 112223221
Q ss_pred CceEeCCCCCe---------------eeeecCCCCeeeeCCCCccCCC--CccCCCCCCCCC-CCCCCEeccCCCCeeEe
Q psy7015 122 GFVCSCHPGFT---------------GNCIDGIAAYNCSCPPGYTGPS--CESNVDECGSNP-CQNNGTCHDLLNGFVCS 183 (284)
Q Consensus 122 ~~~C~C~~g~~---------------g~c~~~~~~~~C~C~~g~~g~~--C~~~~~~C~~~~-C~~~~~C~~~~g~~~C~ 183 (284)
+.|.+..++. +.|++..+.++|++++||.+.. ...+++.|.... |..+++|++..+.|.|.
T Consensus 218 -~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~ 296 (487)
T KOG1217|consen 218 -VACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCT 296 (487)
T ss_pred -eeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceee
Confidence 1122222211 5667777788999999999876 233678887754 88899999999889999
Q ss_pred CCCCCccCCCCCccCCCCCC----CCCCCCCCCEE--eeCCCCeeeecCCCCccCCCcccCCcCCCCCCCCCCEEee-cC
Q psy7015 184 CHPGFTGWTGSLCQSATNEC----ESSPCQNGGVC--VDLHAAYTCACLFGFTGRNCDIELKICENSPCLNEALCLE-EE 256 (284)
Q Consensus 184 C~~g~~g~~~~~c~~~~~~C----~~~~C~~~g~C--~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~C~~~~~C~~-~~ 256 (284)
|++||.+..... ..+..+| ...+|.+++.| ....+.+.|.|..+|.|..|+...+.|...++..++.|++ ..
T Consensus 297 C~~g~~g~~~~~-~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~ 375 (487)
T KOG1217|consen 297 CPPGFTGRLCTE-CVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETP 375 (487)
T ss_pred CCCCCCCCCCcc-ccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCC
Confidence 999999843211 2234566 34558888888 3344467899999999999985445888888999999998 68
Q ss_pred CCeeeecCCCCcCC
Q psy7015 257 EEQVCYCVPDYHGN 270 (284)
Q Consensus 257 ~~~~C~C~~G~~G~ 270 (284)
+++.|.|..+|.+.
T Consensus 376 ~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 376 GSYRCACPAGFAGK 389 (487)
T ss_pred CCeEecCCCccccC
Confidence 89999999999874
No 4
>KOG1219|consensus
Probab=99.56 E-value=9.5e-15 Score=143.77 Aligned_cols=117 Identities=31% Similarity=0.869 Sum_probs=108.3
Q ss_pred CCccCCCCCCCCCCCCCCEeccCC-CCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccC
Q psy7015 154 SCESNVDECGSNPCQNNGTCHDLL-NGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAYTCACLFGFTGR 232 (284)
Q Consensus 154 ~C~~~~~~C~~~~C~~~~~C~~~~-g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 232 (284)
.|..-.+.|..+||.++|+|.... ++|.|.|++-|.| ..|+.++..|..+||..+|+|+...+.|.|.|+.||+|.
T Consensus 3859 gC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG---~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSG---NHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred cccccccccccCcccCCCEecCCCCCceEEeCcccccC---cccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence 454444789999999999999765 6799999999998 999999999999999999999999999999999999999
Q ss_pred CCccc-CCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCcc
Q psy7015 233 NCDIE-LKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 233 ~C~~~-~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~ 273 (284)
+|+.. +++|...+|..+|+|++..++|.|.|.+||.|..|.
T Consensus 3936 ~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3936 RCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred eeecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence 99987 889999999999999999999999999999999985
No 5
>KOG1219|consensus
Probab=99.53 E-value=2.8e-14 Score=140.58 Aligned_cols=117 Identities=40% Similarity=1.071 Sum_probs=105.0
Q ss_pred cccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCC
Q psy7015 98 CESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLL 177 (284)
Q Consensus 98 C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~ 177 (284)
|..-.+.|..+||+++|.|+..++ ++|.|.|++-|.|..|+.++..|.++||..+++|+...
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~~~------------------ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPK------------------GGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFY 3921 (4289)
T ss_pred ccccccccccCcccCCCEecCCCC------------------CceEEeCcccccCcccccccccccCCCCCCCCEEEecC
Confidence 433347888999999999987643 46678888999999999999999999999999999999
Q ss_pred CCeeEeCCCCCccCCCCCccCC-CCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCc
Q psy7015 178 NGFVCSCHPGFTGWTGSLCQSA-TNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCD 235 (284)
Q Consensus 178 g~~~C~C~~g~~g~~~~~c~~~-~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~ 235 (284)
++|.|.|+.||+| .+|+.+ +++|+.++|..+|.|++..|+|.|.|.+||.|+.|.
T Consensus 3922 n~f~CnC~~gyTG---~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYTG---KRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred CCeeEeCCCCccC---ceeecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence 9999999999998 888877 899999999999999999999999999999999985
No 6
>KOG1214|consensus
Probab=99.45 E-value=1.6e-12 Score=119.18 Aligned_cols=185 Identities=26% Similarity=0.678 Sum_probs=126.0
Q ss_pred CCCCCeEecCCC-CeeeeCCCCCcC--CCcccCCCCCCCC--CCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeC
Q psy7015 72 CHNGGSCIDGIA-AYNCSCPPGYTG--PSCESNVDECGSN--PCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSC 146 (284)
Q Consensus 72 C~~~g~C~~~~g-~~~C~C~~G~~G--~~C~~~~~~C~~~--~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C 146 (284)
|..+..|....+ .|+|.|..||.| .+|. ++++|... .|.++..|++.+++|+|.|..||.-. ...++|+-
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~----dd~~tCV~ 776 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFA----DDRHTCVL 776 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceec----cCCcceEE
Confidence 455667776644 689999999985 5675 66788543 49999999999999999999887521 11122321
Q ss_pred CCCccCCCCccCCCCCC--CCCCCCCCE--eccC-CCCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCe
Q psy7015 147 PPGYTGPSCESNVDECG--SNPCQNNGT--CHDL-LNGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAY 221 (284)
Q Consensus 147 ~~g~~g~~C~~~~~~C~--~~~C~~~~~--C~~~-~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~ 221 (284)
..-- ..++.|. .+.|.-.+. |+.. .++|.|.|.+||.| ++..|. ++++|.++.|...+.|.++++++
T Consensus 777 i~~p------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsG-DG~~c~-dvDeC~psrChp~A~Cyntpgsf 848 (1289)
T KOG1214|consen 777 ITPP------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSG-DGHQCT-DVDECSPSRCHPAATCYNTPGSF 848 (1289)
T ss_pred ecCC------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccC-Cccccc-cccccCccccCCCceEecCCCcc
Confidence 0000 1233342 234555554 4433 35799999999999 566665 67999999999999999999999
Q ss_pred eeecCCCCccCC--Ccc---cCCcCCC-----CCCCCCCEEee--cCCCeeeecCCCCcC
Q psy7015 222 TCACLFGFTGRN--CDI---ELKICEN-----SPCLNEALCLE--EEEEQVCYCVPDYHG 269 (284)
Q Consensus 222 ~C~C~~G~~g~~--C~~---~~~~C~~-----~~C~~~~~C~~--~~~~~~C~C~~G~~G 269 (284)
.|+|.+||+|+. |.. ....|.. ..|+.+..|.+ ++..+.+.|.++--|
T Consensus 849 sC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG 908 (1289)
T KOG1214|consen 849 SCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPG 908 (1289)
T ss_pred eeecccCccCCCceecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCC
Confidence 999999999764 432 1233432 23666665543 455678888776666
No 7
>KOG1214|consensus
Probab=99.40 E-value=9.8e-12 Score=114.10 Aligned_cols=206 Identities=28% Similarity=0.673 Sum_probs=125.1
Q ss_pred ceEEeCCCCCcc--CCCCCCCCCCCCC--CCCCCCeEecCCCCeeeeCCCCCc----CCCcccCCCCCCCCCCCCC-CEE
Q psy7015 46 SYTCYCIDGYTG--VHCQTNWDECWSN--PCHNGGSCIDGIAAYNCSCPPGYT----GPSCESNVDECGSNPCQNN-GTC 116 (284)
Q Consensus 46 ~~~C~C~~G~~g--~~C~~~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~G~~----G~~C~~~~~~C~~~~C~~~-~~C 116 (284)
.|+|.|..||.| ..|. ++++|... .|..+..|++.+++|+|.|..||. +-+|....+.-..++|... ..|
T Consensus 715 ~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C 793 (1289)
T KOG1214|consen 715 DYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTC 793 (1289)
T ss_pred ceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCcccc
Confidence 589999999984 5665 67788754 499999999999999999999885 3456433332233334322 122
Q ss_pred eeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCC--CCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCC
Q psy7015 117 HDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGP--SCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGS 194 (284)
Q Consensus 117 ~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~--~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~ 194 (284)
. ..+ +++|. ..+.+.|.|.|.+||.|+ .|. ++++|.++-|...++|.++.+++.|+|.+||.| ++.
T Consensus 794 ~-i~g--~a~c~-------~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~G-DGf 861 (1289)
T KOG1214|consen 794 A-IAG--QARCV-------HHGGSTYSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYG-DGF 861 (1289)
T ss_pred C-cCC--ceEEE-------ecCCceEEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccC-CCc
Confidence 1 111 11111 123356777788888754 565 679999999999999999999999999999999 566
Q ss_pred CccCC---CCCCCC-----CCCCCCCEEe--eCCCCeeeecCCCCcc---CCCcccCCcCCCCCCCCCCEEeec---CCC
Q psy7015 195 LCQSA---TNECES-----SPCQNGGVCV--DLHAAYTCACLFGFTG---RNCDIELKICENSPCLNEALCLEE---EEE 258 (284)
Q Consensus 195 ~c~~~---~~~C~~-----~~C~~~g~C~--~~~g~~~C~C~~G~~g---~~C~~~~~~C~~~~C~~~~~C~~~---~~~ 258 (284)
.|..+ ...|.. ..|+.+..|. ..+..+.+.|.++=.| ..|.... +----.|..++.+... ..+
T Consensus 862 ~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~-~~~vp~Cd~hgh~ap~qchG~~ 940 (1289)
T KOG1214|consen 862 QCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSP-EQYVPQCDDHGHFAPLQCHGKS 940 (1289)
T ss_pred eecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCCCcc-cccCCCccccccccccccCCCc
Confidence 66543 223322 2255544332 2234466766555444 3443211 1001125555555432 234
Q ss_pred eeeecCC
Q psy7015 259 QVCYCVP 265 (284)
Q Consensus 259 ~~C~C~~ 265 (284)
++|.|..
T Consensus 941 ~~CwCvd 947 (1289)
T KOG1214|consen 941 DFCWCVD 947 (1289)
T ss_pred ceeEEec
Confidence 6677755
No 8
>KOG4289|consensus
Probab=99.39 E-value=1.6e-12 Score=124.51 Aligned_cols=88 Identities=45% Similarity=1.143 Sum_probs=66.8
Q ss_pred CCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCC-
Q psy7015 81 GIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNV- 159 (284)
Q Consensus 81 ~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~- 159 (284)
..+.+.|.|++||+|..|+.++|.|...||.+++.|....++|.|.|+ +||+|..|+.+.
T Consensus 1218 pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCr-------------------pg~tGehCEvs~~ 1278 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECR-------------------PGFTGEHCEVSAR 1278 (2531)
T ss_pred ccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEec-------------------CCccccceeeecc
Confidence 345689999999999999999999999999999999988777666655 555555665322
Q ss_pred -CCCCCCCCCCCCEeccCC-CCeeEeCCCC
Q psy7015 160 -DECGSNPCQNNGTCHDLL-NGFVCSCHPG 187 (284)
Q Consensus 160 -~~C~~~~C~~~~~C~~~~-g~~~C~C~~g 187 (284)
-.|.+..|.++++|++.. +.+.|.|+.|
T Consensus 1279 agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1279 AGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred cCccccceecCCCEEeecCCCceeccCCCc
Confidence 346666677777887653 5677777776
No 9
>KOG1225|consensus
Probab=99.31 E-value=2.9e-11 Score=108.80 Aligned_cols=131 Identities=35% Similarity=0.982 Sum_probs=93.8
Q ss_pred eeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCC
Q psy7015 86 NCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSN 165 (284)
Q Consensus 86 ~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~ 165 (284)
.|.|+.+|+|..|. ...|... |..++.|++ .+|.|++||+|..|.. -.|...
T Consensus 235 ic~c~~~~~g~~c~--~~~C~~~-c~~~g~c~~-----------------------G~CIC~~Gf~G~dC~e--~~Cp~~ 286 (525)
T KOG1225|consen 235 ICECPEGYFGPLCS--TIYCPGG-CTGRGQCVE-----------------------GRCICPPGFTGDDCDE--LVCPVD 286 (525)
T ss_pred eeecCCceeCCccc--cccCCCC-CcccceEeC-----------------------CeEeCCCCCcCCCCCc--ccCCcc
Confidence 68888888888875 2233222 444445543 2677888889999963 345544
Q ss_pred CCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCCCC
Q psy7015 166 PCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICENSP 245 (284)
Q Consensus 166 ~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~ 245 (284)
|+.++.+++ ..|.|++||+| ..|.. ..|. ..|.++|.|+ .| +|.|.+||+|..|+.. .
T Consensus 287 -cs~~g~~~~----g~CiC~~g~~G---~dCs~--~~cp-adC~g~G~Ci--~G--~C~C~~Gy~G~~C~~~------~- 344 (525)
T KOG1225|consen 287 -CSGGGVCVD----GECICNPGYSG---KDCSI--RRCP-ADCSGHGKCI--DG--ECLCDEGYTGELCIQR------A- 344 (525)
T ss_pred -cCCCceecC----CEeecCCCccc---ccccc--ccCC-ccCCCCCccc--CC--ceEeCCCCcCCccccc------c-
Confidence 777777765 38999999988 66642 2343 5699999998 33 7999999999999753 3
Q ss_pred CCCCCEEeecCCCeeeecCCCCcCCC
Q psy7015 246 CLNEALCLEEEEEQVCYCVPDYHGNR 271 (284)
Q Consensus 246 C~~~~~C~~~~~~~~C~C~~G~~G~~ 271 (284)
|++++.|++ + |+|..||.|.+
T Consensus 345 C~~~g~cv~--g---C~C~~Gw~G~d 365 (525)
T KOG1225|consen 345 CSGGGQCVN--G---CKCKKGWRGPD 365 (525)
T ss_pred cCCCceecc--C---ceeccCccCCC
Confidence 888899854 2 99999999998
No 10
>KOG1225|consensus
Probab=99.30 E-value=5.6e-11 Score=107.04 Aligned_cols=131 Identities=40% Similarity=1.079 Sum_probs=99.6
Q ss_pred EEeCCCCCccCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeC
Q psy7015 48 TCYCIDGYTGVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSC 127 (284)
Q Consensus 48 ~C~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C 127 (284)
.|.|..+|.|..|+. ..|. ..|..++.|++. +|+|++||+|.+|.. -.|... |+.++.+++.
T Consensus 235 ic~c~~~~~g~~c~~--~~C~-~~c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g-------- 296 (525)
T KOG1225|consen 235 ICECPEGYFGPLCST--IYCP-GGCTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG-------- 296 (525)
T ss_pred eeecCCceeCCcccc--ccCC-CCCcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC--------
Confidence 799999999999973 2343 347777889887 599999999999963 345444 7666666542
Q ss_pred CCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCCCCCCCCC
Q psy7015 128 HPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSATNECESSP 207 (284)
Q Consensus 128 ~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~ 207 (284)
+|+|.+||.|..|+.. .|. ..|..+|.|++ .+|.|.+||+| ..|... .
T Consensus 297 ---------------~CiC~~g~~G~dCs~~--~cp-adC~g~G~Ci~----G~C~C~~Gy~G---~~C~~~-------~ 344 (525)
T KOG1225|consen 297 ---------------ECICNPGYSGKDCSIR--RCP-ADCSGHGKCID----GECLCDEGYTG---ELCIQR-------A 344 (525)
T ss_pred ---------------EeecCCCccccccccc--cCC-ccCCCCCcccC----CceEeCCCCcC---Cccccc-------c
Confidence 5667888888888643 343 45999999982 48999999998 666532 3
Q ss_pred CCCCCEEeeCCCCeeeecCCCCccCC
Q psy7015 208 CQNGGVCVDLHAAYTCACLFGFTGRN 233 (284)
Q Consensus 208 C~~~g~C~~~~g~~~C~C~~G~~g~~ 233 (284)
|.+++.|++ + |.|..||.|.+
T Consensus 345 C~~~g~cv~--g---C~C~~Gw~G~d 365 (525)
T KOG1225|consen 345 CSGGGQCVN--G---CKCKKGWRGPD 365 (525)
T ss_pred cCCCceecc--C---ceeccCccCCC
Confidence 888899985 2 99999999998
No 11
>KOG4260|consensus
Probab=99.00 E-value=4.2e-10 Score=91.18 Aligned_cols=165 Identities=29% Similarity=0.747 Sum_probs=95.8
Q ss_pred eCCCCCccCCCCCCCCCCCCCCCCCCCeEec---CCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceE-
Q psy7015 50 YCIDGYTGVHCQTNWDECWSNPCHNGGSCID---GIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVC- 125 (284)
Q Consensus 50 ~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~---~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C- 125 (284)
-|++|-.|++|.. ...-...||..+|.|.- ..|+..|.|.+||+|..|.. |...--. . ........|
T Consensus 131 CCp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~----Cg~eyfe---s-~Rne~~lvCt 201 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY----CGIEYFE---S-SRNEQHLVCT 201 (350)
T ss_pred ccCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc----cchHHHH---h-hcccccchhh
Confidence 3889999998863 22223467999999973 34677899999999998852 2100000 0 000000111
Q ss_pred eCCCCCeeeeecCCCCeee-eCCCCcc--CCCCccCCCCCCC--CCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCC
Q psy7015 126 SCHPGFTGNCIDGIAAYNC-SCPPGYT--GPSCESNVDECGS--NPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSAT 200 (284)
Q Consensus 126 ~C~~g~~g~c~~~~~~~~C-~C~~g~~--g~~C~~~~~~C~~--~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~ 200 (284)
.|..+-.|.|... .+..| .|..||. -..|. ++++|.. .||.....|+|+.|+|.|.+++||.+ ....|+...
T Consensus 202 ~Ch~~C~~~Csg~-~~k~C~kCkkGW~lde~gCv-DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~-g~d~C~~~~ 278 (350)
T KOG4260|consen 202 ACHEGCLGVCSGE-SSKGCSKCKKGWKLDEEGCV-DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK-GVDECQFCA 278 (350)
T ss_pred hhhhhhhcccCCC-CCCChhhhcccceecccccc-cHHHHhcCCCCCChhheeecCCCceEecccccccC-ChHHhhhhh
Confidence 1222222222111 12223 3666665 23565 7888843 56877788999999999988888876 222232111
Q ss_pred CCCCCCCCCCCCEEeeCCCCeeeecCCCCc
Q psy7015 201 NECESSPCQNGGVCVDLHAAYTCACLFGFT 230 (284)
Q Consensus 201 ~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~ 230 (284)
+.| -..+..|.++++.|+|+|..|+.
T Consensus 279 d~~----~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 279 DVC----ASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred hhc----ccCCCCcccCCccEEEEecccce
Confidence 111 12356788888889998888764
No 12
>KOG0994|consensus
Probab=98.97 E-value=2.5e-09 Score=101.43 Aligned_cols=224 Identities=25% Similarity=0.602 Sum_probs=108.6
Q ss_pred eecCCCceEEeCCCCCccCCCCCCC--------CCCCCCCCCCCC----eEecCCCCeeeeCCCCCcCCCccc------C
Q psy7015 40 IFAVPSSYTCYCIDGYTGVHCQTNW--------DECWSNPCHNGG----SCIDGIAAYNCSCPPGYTGPSCES------N 101 (284)
Q Consensus 40 ~~~~~g~~~C~C~~G~~g~~C~~~~--------~~C~~~~C~~~g----~C~~~~g~~~C~C~~G~~G~~C~~------~ 101 (284)
....+.+.+|.|+|+-.|..|.... ..|....|...| .|....| +|.|.+|-+|..|.. .
T Consensus 775 ~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tG--QC~C~~g~ygrqCnqCqpG~Wg 852 (1758)
T KOG0994|consen 775 SVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITG--QCQCRPGTYGRQCNQCQPGYWG 852 (1758)
T ss_pred ccccCCCceecccCccccccccccCCcccCcCCccCcccccccccccccccccccc--ceeeccccchhhccccCCCccC
Confidence 3444667799999999998886311 112222233322 3444444 488888888877653 2
Q ss_pred CCCCCCCCCCCCC-EEeeCCCCceEeCCCCCeeeeecCCCCeee-eCCCCccCCCCccCCCCCCCCCCCCCC--------
Q psy7015 102 VDECGSNPCQNNG-TCHDLLNGFVCSCHPGFTGNCIDGIAAYNC-SCPPGYTGPSCESNVDECGSNPCQNNG-------- 171 (284)
Q Consensus 102 ~~~C~~~~C~~~~-~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C-~C~~g~~g~~C~~~~~~C~~~~C~~~~-------- 171 (284)
..+|.+..|+.|+ .|... .|.--.|.+...++.| .|..||+|+.-...-..|.+-||..+-
T Consensus 853 FPeCr~CqCNgHA~~Cd~~---------tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~ 923 (1758)
T KOG0994|consen 853 FPECRPCQCNGHADTCDPI---------TGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHAD 923 (1758)
T ss_pred CCcCccccccCcccccCcc---------ccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccc
Confidence 3344443344332 22221 1222234444445555 366666654322222233333333211
Q ss_pred Eec--cCCCCeeEeCCCCCccCCCCCccCC------------CCCCC-------CCCCCC-CCE---EeeCCCCeee-ec
Q psy7015 172 TCH--DLLNGFVCSCHPGFTGWTGSLCQSA------------TNECE-------SSPCQN-GGV---CVDLHAAYTC-AC 225 (284)
Q Consensus 172 ~C~--~~~g~~~C~C~~g~~g~~~~~c~~~------------~~~C~-------~~~C~~-~g~---C~~~~g~~~C-~C 225 (284)
.|. +......|.|.+||.|.....|... .-+|. +..|.. .|. |...+.+.+| .|
T Consensus 924 sC~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~C 1003 (1758)
T KOG0994|consen 924 SCYLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHC 1003 (1758)
T ss_pred cccccccccceeeecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhc
Confidence 232 2233456888888877322222100 00111 111211 122 2222333455 58
Q ss_pred CCCCccCCCcccCCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCccc
Q psy7015 226 LFGFTGRNCDIELKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 226 ~~G~~g~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~ 274 (284)
.+||+|+.-......|.-..-..+.+|.-+..+++|.|.+..+|.+|+.
T Consensus 1004 k~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDq 1052 (1758)
T KOG0994|consen 1004 KDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQ 1052 (1758)
T ss_pred cccchhHHHHhhhhhheccccccCCccccccccCcCCCCcccccccccc
Confidence 9999987543333333222111223354477788999999999999953
No 13
>KOG4260|consensus
Probab=98.95 E-value=9.5e-10 Score=89.13 Aligned_cols=163 Identities=25% Similarity=0.596 Sum_probs=100.0
Q ss_pred eCCCCCcCCCcccCCCCCCCCCCCCCCEEee---CCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCC
Q psy7015 88 SCPPGYTGPSCESNVDECGSNPCQNNGTCHD---LLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGS 164 (284)
Q Consensus 88 ~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~---~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~ 164 (284)
-|++|-+|++|..- ..-+..+|..++.|.- ..|+.+|.|.+||.|.-.. .|..+|+-..=....-.|..
T Consensus 131 CCp~gtyGpdCl~C-pggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-------~Cg~eyfes~Rne~~lvCt~ 202 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQC-PGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-------YCGIEYFESSRNEQHLVCTA 202 (350)
T ss_pred ccCCCCcCCccccC-CCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-------ccchHHHHhhcccccchhhh
Confidence 38999999999642 2335667999999863 2344566666666654211 14444431100000001110
Q ss_pred --CCCCCCCEeccCCCCeeE-eCCCCCccCCCCCccCCCCCCC--CCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCC
Q psy7015 165 --NPCQNNGTCHDLLNGFVC-SCHPGFTGWTGSLCQSATNECE--SSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELK 239 (284)
Q Consensus 165 --~~C~~~~~C~~~~g~~~C-~C~~g~~g~~~~~c~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~ 239 (284)
..|. +.|..... -.| .|..||... ...| .|+++|. +.+|.....|+|+.|+|.|.+++||.+. ++
T Consensus 203 Ch~~C~--~~Csg~~~-k~C~kCkkGW~ld-e~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d 272 (350)
T KOG4260|consen 203 CHEGCL--GVCSGESS-KGCSKCKKGWKLD-EEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VD 272 (350)
T ss_pred hhhhhh--cccCCCCC-CChhhhcccceec-cccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCC-----hH
Confidence 0121 23432222 234 689999874 3455 4899995 5679999999999999999999999873 22
Q ss_pred cC---CCCCCCCCCEEeecCCCeeeecCCCCc
Q psy7015 240 IC---ENSPCLNEALCLEEEEEQVCYCVPDYH 268 (284)
Q Consensus 240 ~C---~~~~C~~~~~C~~~~~~~~C~C~~G~~ 268 (284)
.| ...-=..+..|.+.++.|+|+|.+|+.
T Consensus 273 ~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 273 ECQFCADVCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred HhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence 22 222223567788899999999998874
No 14
>KOG0994|consensus
Probab=98.94 E-value=6.4e-09 Score=98.75 Aligned_cols=57 Identities=30% Similarity=0.780 Sum_probs=36.6
Q ss_pred EEeeCCCCeeeecCCCCccCCCccc--------CCcCCCCCCCCCC----EEeecCCCeeeecCCCCcCCCcc
Q psy7015 213 VCVDLHAAYTCACLFGFTGRNCDIE--------LKICENSPCLNEA----LCLEEEEEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 213 ~C~~~~g~~~C~C~~G~~g~~C~~~--------~~~C~~~~C~~~~----~C~~~~~~~~C~C~~G~~G~~C~ 273 (284)
+|...+| .|.|.+||-|+.|+.. ...|....|...| +| +..++.|+|.+|..|.+|+
T Consensus 1078 qCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQC--dr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQC--DRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred ccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCc--cccCCceeecCCCCCcchh
Confidence 5655555 8999999999998732 1123222233332 44 3345679999999998884
No 15
>KOG1226|consensus
Probab=98.48 E-value=9.5e-07 Score=81.80 Aligned_cols=120 Identities=32% Similarity=0.765 Sum_probs=80.5
Q ss_pred eeeCCCCccCCCCccCC---------CCCC----CCCCCCCCEeccCCCCeeEeCCCCCcc-CCCCCccCCCCCCCCC--
Q psy7015 143 NCSCPPGYTGPSCESNV---------DECG----SNPCQNNGTCHDLLNGFVCSCHPGFTG-WTGSLCQSATNECESS-- 206 (284)
Q Consensus 143 ~C~C~~g~~g~~C~~~~---------~~C~----~~~C~~~~~C~~~~g~~~C~C~~g~~g-~~~~~c~~~~~~C~~~-- 206 (284)
.|.|.+||.|+.|+-.. +.|. ..+|...|.|.= .+|+|.+...+ ..|..|.-+.-.|..+
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCECDnfsC~r~~g 554 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNGKIYGKFCECDNFSCERHKG 554 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCCceeeeeeeccCcccccccC
Confidence 45677777777775322 2231 236888888764 47888876552 1236676554445433
Q ss_pred -CCCCCCEEeeCCCCeeeecCCCCccCCCcc--cCCcCCC---CCCCCCCEEeecCCCeeeecCCC-CcCCCccc
Q psy7015 207 -PCQNGGVCVDLHAAYTCACLFGFTGRNCDI--ELKICEN---SPCLNEALCLEEEEEQVCYCVPD-YHGNRCQY 274 (284)
Q Consensus 207 -~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~--~~~~C~~---~~C~~~~~C~~~~~~~~C~C~~G-~~G~~C~~ 274 (284)
.|.++|.|.-. +|+|.+||+|..|+- +.+.|.. ..|+..|.|.-. +|+|... |.|..||.
T Consensus 555 ~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 555 VLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK 621 (783)
T ss_pred cccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence 49999998643 799999999998863 4455643 248888888543 4999776 99999985
No 16
>KOG1226|consensus
Probab=98.45 E-value=3.3e-06 Score=78.30 Aligned_cols=146 Identities=34% Similarity=0.819 Sum_probs=87.9
Q ss_pred CCCCCeEecCCCCeeeeCCCCCcCCCcccCC---------CCCCC----CCCCCCCEEeeCCCCceEeCCCCCeeeeecC
Q psy7015 72 CHNGGSCIDGIAAYNCSCPPGYTGPSCESNV---------DECGS----NPCQNNGTCHDLLNGFVCSCHPGFTGNCIDG 138 (284)
Q Consensus 72 C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~---------~~C~~----~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~ 138 (284)
|+.+|+.+-+ .|.|.+||.|+.|+-.. +.|+. .+|.++|.|+= .+|.|.+...+
T Consensus 469 C~g~G~~~CG----~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~----- 535 (783)
T KOG1226|consen 469 CHGNGTFVCG----QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNG----- 535 (783)
T ss_pred cCCCCcEEec----ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCC-----
Confidence 6666665544 48999999999997322 22321 14555555542 13333332221
Q ss_pred CCCeeeeCCCCccCCCCccCCCCCCC---CCCCCCCEeccCCCCeeEeCCCCCccCCCCCcc--CCCCCCCC---CCCCC
Q psy7015 139 IAAYNCSCPPGYTGPSCESNVDECGS---NPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQ--SATNECES---SPCQN 210 (284)
Q Consensus 139 ~~~~~C~C~~g~~g~~C~~~~~~C~~---~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~--~~~~~C~~---~~C~~ 210 (284)
-++|..|+-+.-.|.. ..|..+|.|.- .+|+|.+||+| ..|. .+.+.|.+ ..|.+
T Consensus 536 ----------~i~G~fCECDnfsC~r~~g~lC~g~G~C~C----G~CvC~~GwtG---~~C~C~~std~C~~~~G~iCSG 598 (783)
T KOG1226|consen 536 ----------KIYGKFCECDNFSCERHKGVLCGGHGRCEC----GRCVCNPGWTG---SACNCPLSTDTCESSDGQICSG 598 (783)
T ss_pred ----------ceeeeeeeccCcccccccCcccCCCCeEeC----CcEEcCCCCcc---CCCCCCCCCccccCCCCceeCC
Confidence 1236777644434432 35888888864 48999999998 5443 34555643 23888
Q ss_pred CCEEeeCCCCeeeecCCC-CccCCCcccCCcCCCCCCCCCCEEe
Q psy7015 211 GGVCVDLHAAYTCACLFG-FTGRNCDIELKICENSPCLNEALCL 253 (284)
Q Consensus 211 ~g~C~~~~g~~~C~C~~G-~~g~~C~~~~~~C~~~~C~~~~~C~ 253 (284)
.|+|.-. +|.|... |.|..|++... | ..+|..+..|+
T Consensus 599 rG~C~Cg----~C~C~~~~~sG~~CE~cpt-c-~~~C~~~~~Cv 636 (783)
T KOG1226|consen 599 RGTCECG----RCKCTDPPYSGEFCEKCPT-C-PDPCAENKSCV 636 (783)
T ss_pred CceeeCC----ceEcCCCCcCcchhhcCCC-C-CCcccccccch
Confidence 8888643 6888777 99999985432 2 23366665553
No 17
>KOG1836|consensus
Probab=98.17 E-value=4.9e-05 Score=77.88 Aligned_cols=57 Identities=32% Similarity=0.674 Sum_probs=37.5
Q ss_pred EEeeCCCCeeeecCCCCccCCCcccC--------CcCCCCCCCCCC----EEeecCCCeeeecCCCCcCCCcc
Q psy7015 213 VCVDLHAAYTCACLFGFTGRNCDIEL--------KICENSPCLNEA----LCLEEEEEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 213 ~C~~~~g~~~C~C~~G~~g~~C~~~~--------~~C~~~~C~~~~----~C~~~~~~~~C~C~~G~~G~~C~ 273 (284)
.|... +..|.|.+|.+|..|+... ..|....|...| +| .+..++|.|++++.|.+|.
T Consensus 953 ~c~~~--tGqc~c~~gVtgqrc~qc~~~~~~~~~~gc~~c~c~~~Gs~~~qc--~~~~G~c~c~~~~~g~~c~ 1021 (1705)
T KOG1836|consen 953 DCDVG--TGQCYCRPGVTGQRCDQCETYHFGFQTEGCGLCECDPLGSRGFQC--DPEDGQCPCRPGFEGRRCD 1021 (1705)
T ss_pred ccccc--CCceeeecCccccccCccccCcccccccCCcceecccCCccccee--cccCCeeeecCCCCCcccc
Confidence 45433 3489999999998887321 223333355555 56 3445689999999998775
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.07 E-value=2.7e-06 Score=47.14 Aligned_cols=29 Identities=31% Similarity=0.841 Sum_probs=18.5
Q ss_pred CCCCCCCCCEEeecC-CCeeeecCCCCcCC
Q psy7015 242 ENSPCLNEALCLEEE-EEQVCYCVPDYHGN 270 (284)
Q Consensus 242 ~~~~C~~~~~C~~~~-~~~~C~C~~G~~G~ 270 (284)
.+.+|.++|+|+... .+|.|+|++||+|+
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 344666666666665 66666666666665
No 19
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.06 E-value=8.4e-06 Score=47.30 Aligned_cols=36 Identities=61% Similarity=1.522 Sum_probs=30.8
Q ss_pred CCCCCC-CCCCCCCeEecCCCCeeeeCCCCCc-CCCcc
Q psy7015 64 WDECWS-NPCHNGGSCIDGIAAYNCSCPPGYT-GPSCE 99 (284)
Q Consensus 64 ~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~-G~~C~ 99 (284)
+++|.. .+|.++++|+++.++|.|.|++||. |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 567776 7899889999999999999999998 87763
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.96 E-value=5.6e-06 Score=45.82 Aligned_cols=31 Identities=58% Similarity=1.372 Sum_probs=27.1
Q ss_pred CCCCCCCCCCEEeeCC-CCeeeecCCCCccCC
Q psy7015 203 CESSPCQNGGVCVDLH-AAYTCACLFGFTGRN 233 (284)
Q Consensus 203 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~~ 233 (284)
|.+++|.++|+|+... ++|.|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 4456899999999988 899999999999964
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.93 E-value=1.8e-05 Score=45.86 Aligned_cols=29 Identities=59% Similarity=1.382 Sum_probs=14.9
Q ss_pred CCCCCCCEEeeCCCCeeeecCCCCc-cCCC
Q psy7015 206 SPCQNGGVCVDLHAAYTCACLFGFT-GRNC 234 (284)
Q Consensus 206 ~~C~~~g~C~~~~g~~~C~C~~G~~-g~~C 234 (284)
.+|.++++|+++.++|.|.|++||. |..|
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 3455555555555555555555555 4443
No 22
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.89 E-value=5.8e-06 Score=48.97 Aligned_cols=32 Identities=44% Similarity=1.187 Sum_probs=24.9
Q ss_pred CCCCCCCC--CCCCCCeEecCCCCeeeeCCCCCc
Q psy7015 63 NWDECWSN--PCHNGGSCIDGIAAYNCSCPPGYT 94 (284)
Q Consensus 63 ~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~G~~ 94 (284)
|++||... .|..++.|+|+.|+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 46777654 477788888888888888888887
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.77 E-value=5.3e-05 Score=43.41 Aligned_cols=35 Identities=63% Similarity=1.551 Sum_probs=29.7
Q ss_pred CCCCCC-CCCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015 64 WDECWS-NPCHNGGSCIDGIAAYNCSCPPGYTGPSC 98 (284)
Q Consensus 64 ~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~G~~C 98 (284)
+++|.. .+|..++.|++..+.|.|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 456766 67888899999999999999999998776
No 24
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.75 E-value=2.2e-05 Score=46.45 Aligned_cols=31 Identities=32% Similarity=0.985 Sum_probs=21.7
Q ss_pred CCCCCC--CCCCCCCEEeeCCCCeeeecCCCCc
Q psy7015 200 TNECES--SPCQNGGVCVDLHAAYTCACLFGFT 230 (284)
Q Consensus 200 ~~~C~~--~~C~~~g~C~~~~g~~~C~C~~G~~ 230 (284)
++||.. +.|..++.|+|+.|+|.|.|++||+
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 566643 3476677777777777777777776
No 25
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.61 E-value=0.00011 Score=42.08 Aligned_cols=29 Identities=34% Similarity=0.938 Sum_probs=16.4
Q ss_pred CCCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015 244 SPCLNEALCLEEEEEQVCYCVPDYHGNRC 272 (284)
Q Consensus 244 ~~C~~~~~C~~~~~~~~C~C~~G~~G~~C 272 (284)
.+|.+++.|++..++|.|.|+.||.|..|
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 34555555655555556666666665554
No 26
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.38 E-value=0.00036 Score=39.18 Aligned_cols=30 Identities=63% Similarity=1.552 Sum_probs=25.4
Q ss_pred CCCCCCCCeEecCCCCeeeeCCCCCcCC-Cc
Q psy7015 69 SNPCHNGGSCIDGIAAYNCSCPPGYTGP-SC 98 (284)
Q Consensus 69 ~~~C~~~g~C~~~~g~~~C~C~~G~~G~-~C 98 (284)
..+|..++.|++..+.|+|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 4668888999998889999999999887 44
No 27
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.38 E-value=7.2e-05 Score=58.07 Aligned_cols=135 Identities=27% Similarity=0.674 Sum_probs=72.7
Q ss_pred CeEecCCCCeeeeCCCCCc---CCCcccCCCCC-----CCCCCCCCCEEeeCCC-----CceEeCCCCCeeeeecCCCCe
Q psy7015 76 GSCIDGIAAYNCSCPPGYT---GPSCESNVDEC-----GSNPCQNNGTCHDLLN-----GFVCSCHPGFTGNCIDGIAAY 142 (284)
Q Consensus 76 g~C~~~~g~~~C~C~~G~~---G~~C~~~~~~C-----~~~~C~~~~~C~~~~~-----~~~C~C~~g~~g~c~~~~~~~ 142 (284)
|.-+...+.|.|.|++||. -..|+.. .+| ...+|...+.|++... .|.|.|..||...
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~-------- 81 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILK-------- 81 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES--------
T ss_pred CEEEEccCceEEEcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceee--------
Confidence 4444445667888888886 3445532 234 2356888888886642 3455555544421
Q ss_pred eeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCC---CCeeEeCCCCCccCCCCCccCCC-CCCCCCCCCCCCEEeeCC
Q psy7015 143 NCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLL---NGFVCSCHPGFTGWTGSLCQSAT-NECESSPCQNGGVCVDLH 218 (284)
Q Consensus 143 ~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~---g~~~C~C~~g~~g~~~~~c~~~~-~~C~~~~C~~~g~C~~~~ 218 (284)
...|. ...|....|. .|.|+-.+ ....|.|.-|+...+...|..+. .+|. -.|..+.+|..+.
T Consensus 82 ---------~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~ 148 (197)
T PF06247_consen 82 ---------QGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVD 148 (197)
T ss_dssp ---------SSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEET
T ss_pred ---------CCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeC
Confidence 12232 2345555576 57887332 34589999998854445554322 2332 2377788999999
Q ss_pred CCeeeecCCCCccC
Q psy7015 219 AAYTCACLFGFTGR 232 (284)
Q Consensus 219 g~~~C~C~~G~~g~ 232 (284)
+-|+|.+..||.+.
T Consensus 149 ~~Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 149 GYYKCVCKEGFPGD 162 (197)
T ss_dssp TEEEEEE-TT-EEE
T ss_pred cEEEeecCCCCCCC
Confidence 99999999998754
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.35 E-value=0.0004 Score=39.06 Aligned_cols=28 Identities=61% Similarity=1.541 Sum_probs=23.4
Q ss_pred CCCCCCCeEecCCCCeeeeCCCCCcC-CCc
Q psy7015 70 NPCHNGGSCIDGIAAYNCSCPPGYTG-PSC 98 (284)
Q Consensus 70 ~~C~~~g~C~~~~g~~~C~C~~G~~G-~~C 98 (284)
.+|.++ +|++..+.|+|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 568777 8998888999999999988 555
No 29
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.35 E-value=0.00038 Score=39.11 Aligned_cols=30 Identities=33% Similarity=0.951 Sum_probs=19.7
Q ss_pred CCCCCCCCEEeecCCCeeeecCCCCcCC-Cc
Q psy7015 243 NSPCLNEALCLEEEEEQVCYCVPDYHGN-RC 272 (284)
Q Consensus 243 ~~~C~~~~~C~~~~~~~~C~C~~G~~G~-~C 272 (284)
..+|.+++.|++..+.+.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 3456666777666666777777777766 44
No 30
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.33 E-value=0.00016 Score=40.97 Aligned_cols=27 Identities=30% Similarity=0.759 Sum_probs=18.8
Q ss_pred CCCCCCCEEeecCCCeeeecCCCCcCC
Q psy7015 244 SPCLNEALCLEEEEEQVCYCVPDYHGN 270 (284)
Q Consensus 244 ~~C~~~~~C~~~~~~~~C~C~~G~~G~ 270 (284)
..|+.+++|++..+++.|+|++||.|+
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 347778888888888888888888876
No 31
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.26 E-value=0.00011 Score=57.11 Aligned_cols=102 Identities=24% Similarity=0.643 Sum_probs=62.0
Q ss_pred CCCCCCCEeccCC-----CCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCC---CCeeeecCCCCc---cCC
Q psy7015 165 NPCQNNGTCHDLL-----NGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLH---AAYTCACLFGFT---GRN 233 (284)
Q Consensus 165 ~~C~~~~~C~~~~-----g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~---g~~~C~C~~G~~---g~~ 233 (284)
.+|...++|++.. ..|.|.|.+||....+ .|. ...|....|. .|.|+-.+ ....|.|.-|+. ...
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~k 125 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQG-VCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKK 125 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-SEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTE
T ss_pred ccccchhhhhcCCCcccceeEEEecccCceeeCC-eEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCc
Confidence 5688889998654 5799999999987433 442 3456666677 68997432 235899999987 223
Q ss_pred CcccCC-cCCCCCCCCCCEEeecCCCeeeecCCCCcCCC
Q psy7015 234 CDIELK-ICENSPCLNEALCLEEEEEQVCYCVPDYHGNR 271 (284)
Q Consensus 234 C~~~~~-~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~ 271 (284)
|..+-+ .| ...|..+-.|.....-|.|.|.+++.++.
T Consensus 126 Ctk~G~T~C-~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 126 CTKTGETKC-SLKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp SEEEE---------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred ccCCCccce-eeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 432211 22 23377788999999999999999997543
No 32
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.23 E-value=0.00057 Score=38.41 Aligned_cols=28 Identities=39% Similarity=1.064 Sum_probs=17.2
Q ss_pred CCCCCCCEEeecCCCeeeecCCCCcC-CCc
Q psy7015 244 SPCLNEALCLEEEEEQVCYCVPDYHG-NRC 272 (284)
Q Consensus 244 ~~C~~~~~C~~~~~~~~C~C~~G~~G-~~C 272 (284)
.+|.++ +|++..+++.|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 345555 6666666666666666666 444
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.21 E-value=0.00052 Score=37.74 Aligned_cols=26 Identities=27% Similarity=0.740 Sum_probs=19.4
Q ss_pred CCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015 245 PCLNEALCLEEEEEQVCYCVPDYHGNRC 272 (284)
Q Consensus 245 ~C~~~~~C~~~~~~~~C~C~~G~~G~~C 272 (284)
.|+++|+|+.. .++|+|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 47888888654 456888888888776
No 34
>KOG1836|consensus
Probab=97.04 E-value=0.0062 Score=63.13 Aligned_cols=176 Identities=32% Similarity=0.730 Sum_probs=91.7
Q ss_pred eeCCCCCcCCCcccC-----------CCCCCCCCCCCCC---EEeeCCCCceEeCCCCCeeeeecCCCCeee-eCCCCcc
Q psy7015 87 CSCPPGYTGPSCESN-----------VDECGSNPCQNNG---TCHDLLNGFVCSCHPGFTGNCIDGIAAYNC-SCPPGYT 151 (284)
Q Consensus 87 C~C~~G~~G~~C~~~-----------~~~C~~~~C~~~~---~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C-~C~~g~~ 151 (284)
|.|++||+|..|+.- .+.+...+|..++ .|... +..|.|.....| .+| +|..||+
T Consensus 697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~--tG~C~C~~~t~G--------~~C~~C~~GfY 766 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPR--TGQCKCKHNTFG--------GQCAQCVDGFY 766 (1705)
T ss_pred ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCC--CCceecccCCCC--------CchhhhcCCCC
Confidence 899999999988731 1112222333333 23322 234555443332 344 4889999
Q ss_pred CCCCccCCCCCCCCCCCCCCEeccCC--CCeeEe-CCCCCccCCCCCccC-----------CCCCCCCCCCCC-------
Q psy7015 152 GPSCESNVDECGSNPCQNNGTCHDLL--NGFVCS-CHPGFTGWTGSLCQS-----------ATNECESSPCQN------- 210 (284)
Q Consensus 152 g~~C~~~~~~C~~~~C~~~~~C~~~~--g~~~C~-C~~g~~g~~~~~c~~-----------~~~~C~~~~C~~------- 210 (284)
|..-......|..-+|...+.|.... ....|. |++||+|.....|.. ++..|.+.+|..
T Consensus 767 g~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~ 846 (1705)
T KOG1836|consen 767 GLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAF 846 (1705)
T ss_pred CccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCcccc
Confidence 86554333347777788777776443 456787 999998843333321 111233322222
Q ss_pred ------CCEE---eeCCCCeee-ecCCCCccCCCc-ccCCcCCCCCCCCC------CEEeecCCCeeeecCCCCcCCCcc
Q psy7015 211 ------GGVC---VDLHAAYTC-ACLFGFTGRNCD-IELKICENSPCLNE------ALCLEEEEEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 211 ------~g~C---~~~~g~~~C-~C~~G~~g~~C~-~~~~~C~~~~C~~~------~~C~~~~~~~~C~C~~G~~G~~C~ 273 (284)
.+.| +.......| .|.+||.|+.-. .+.+.|...-|... .+| ...++.|.|.+...|..|.
T Consensus 847 g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c--~~~tGQcec~~~v~g~~c~ 924 (1705)
T KOG1836|consen 847 GNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTC--NPVTGQCECKPNVEGRDCL 924 (1705)
T ss_pred ccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccC--CCcccceeccCCCCccccc
Confidence 1222 111222233 577777766443 11222322222221 234 4456678888888888775
Q ss_pred c
Q psy7015 274 Y 274 (284)
Q Consensus 274 ~ 274 (284)
.
T Consensus 925 ~ 925 (1705)
T KOG1836|consen 925 Y 925 (1705)
T ss_pred c
Confidence 3
No 35
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.99 E-value=0.00055 Score=38.79 Aligned_cols=28 Identities=29% Similarity=0.838 Sum_probs=22.2
Q ss_pred CCCCCCCEEeeCCCCeeeecCCCCccCC
Q psy7015 206 SPCQNGGVCVDLHAAYTCACLFGFTGRN 233 (284)
Q Consensus 206 ~~C~~~g~C~~~~g~~~C~C~~G~~g~~ 233 (284)
..|+.+++|+++.++++|.|++||+|+.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 3588899999999999999999999864
No 36
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.95 E-value=0.00071 Score=34.40 Aligned_cols=20 Identities=40% Similarity=1.047 Sum_probs=13.1
Q ss_pred CeeeecCCCCc----CCCccccccc
Q psy7015 258 EQVCYCVPDYH----GNRCQYQYDE 278 (284)
Q Consensus 258 ~~~C~C~~G~~----G~~C~~~~~~ 278 (284)
+|.|.|++||. |..|+ ||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~-DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCE-DIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccc-cCCC
Confidence 46777777775 45675 6665
No 37
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.93 E-value=0.00085 Score=34.11 Aligned_cols=11 Identities=64% Similarity=1.352 Sum_probs=8.3
Q ss_pred ceEEeCCCCCc
Q psy7015 46 SYTCYCIDGYT 56 (284)
Q Consensus 46 ~~~C~C~~G~~ 56 (284)
||+|+|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 57788888876
No 38
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.91 E-value=0.00046 Score=29.69 Aligned_cols=13 Identities=38% Similarity=1.242 Sum_probs=7.9
Q ss_pred eeecCCCCcCCCc
Q psy7015 260 VCYCVPDYHGNRC 272 (284)
Q Consensus 260 ~C~C~~G~~G~~C 272 (284)
+|+|++||+|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3667777777665
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.79 E-value=0.002 Score=35.45 Aligned_cols=26 Identities=38% Similarity=0.925 Sum_probs=22.2
Q ss_pred CCCCCCEEeeCCCCeeeecCCCCccCCC
Q psy7015 207 PCQNGGVCVDLHAAYTCACLFGFTGRNC 234 (284)
Q Consensus 207 ~C~~~g~C~~~~g~~~C~C~~G~~g~~C 234 (284)
.|.++|+|+... .+|.|.+||+|..|
T Consensus 7 ~C~~~G~C~~~~--g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPC--GRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCC--CEEECCCCCcCCCC
Confidence 589999998763 38999999999876
No 40
>smart00051 DSL delta serrate ligand.
Probab=95.60 E-value=0.02 Score=36.94 Aligned_cols=46 Identities=20% Similarity=0.503 Sum_probs=33.4
Q ss_pred eeeecCCCCccCCCcccCCcCCC-CCCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015 221 YTCACLFGFTGRNCDIELKICEN-SPCLNEALCLEEEEEQVCYCVPDYHGNRC 272 (284)
Q Consensus 221 ~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~G~~C 272 (284)
+.-.|.++|.|..|+. .|.+ .-...+..|.. .+.++|.+||+|..|
T Consensus 17 ~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCC---EeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence 4558999999999974 3322 12566788843 356999999999987
No 41
>KOG3512|consensus
Probab=95.59 E-value=0.079 Score=47.28 Aligned_cols=87 Identities=22% Similarity=0.487 Sum_probs=47.1
Q ss_pred eE-eCCCCCccCCCCCccCCCCCCCCCCCCC----CCEEeeCCCCeeeecCCCCccCCCccc----------CCcCCCCC
Q psy7015 181 VC-SCHPGFTGWTGSLCQSATNECESSPCQN----GGVCVDLHAAYTCACLFGFTGRNCDIE----------LKICENSP 245 (284)
Q Consensus 181 ~C-~C~~g~~g~~~~~c~~~~~~C~~~~C~~----~g~C~~~~g~~~C~C~~G~~g~~C~~~----------~~~C~~~~ 245 (284)
+| .|++||.-..+..- .+...|..-.|+. +-+|..+.| +|.|.+|.+|..|... +.+|...|
T Consensus 372 hChyCreGyyRd~s~pl-~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrCa~gyqqsrs~vapcik~p 448 (592)
T KOG3512|consen 372 HCHYCREGYYRDGSKPL-THRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRCAPGYQQSRSPVAPCIKIP 448 (592)
T ss_pred ccccccCccccCCCCCC-chhhhhhhcCCcccccccccccccCC--cccCCCCCcccccccccchhhcccCCCcCceecC
Confidence 45 48888765322111 1122333333443 345665555 8999999999888631 12222111
Q ss_pred ------CCCCCEEeecCCCeeeecCCCCcCCCccc
Q psy7015 246 ------CLNEALCLEEEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 246 ------C~~~~~C~~~~~~~~C~C~~G~~G~~C~~ 274 (284)
++++.+ .....+.|+.++.|.+++.
T Consensus 449 ~~~~~~~~s~ve----~qd~~s~Ck~~~~~~r~n~ 479 (592)
T KOG3512|consen 449 TDAPTLGSSGVE----PQDQCSKCKASPGGKRLNQ 479 (592)
T ss_pred CCCccccCCCCc----chhccccCCCCCcceeccc
Confidence 222222 2344578999998887764
No 42
>KOG1218|consensus
Probab=95.55 E-value=1.1 Score=38.76 Aligned_cols=182 Identities=30% Similarity=0.679 Sum_probs=82.4
Q ss_pred CCceEEeCCCCCccC-CCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCC
Q psy7015 44 PSSYTCYCIDGYTGV-HCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNG 122 (284)
Q Consensus 44 ~g~~~C~C~~G~~g~-~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~ 122 (284)
..+..|.|.+||.|. .+.. .... .++...-.+ .....+|.+..+|.+..|..... . .. .++.|..
T Consensus 12 ~~~~~c~c~~~~~g~~~~~~-~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~c~~~~~-~--~~--~~~~c~~---- 77 (316)
T KOG1218|consen 12 GGSGQCFCDPGYTGRLQCEH-QAVT--SACSGICPC--EVNSGECGLGYGFVGSVCRIECV-C--GN--AGGGCSQ---- 77 (316)
T ss_pred CCCCceecCCCccccccccC-CCCC--ccccccCCc--cCCceeEecccccCCCccccccc-c--CC--CCCcccC----
Confidence 356789999999995 2221 1111 111111111 22344688899999887653211 1 00 1222221
Q ss_pred ceEeCCCCCeeeeecCCCCeeeeC-CCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccC---
Q psy7015 123 FVCSCHPGFTGNCIDGIAAYNCSC-PPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQS--- 198 (284)
Q Consensus 123 ~~C~C~~g~~g~c~~~~~~~~C~C-~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~--- 198 (284)
.+.|..++.-. .....+ ..+|.|..|.. ..++... |.. .+|.+... .|.+..+|.+ ..|..
T Consensus 78 -~~~c~~~~~~~------~~~~~~~~~~~~g~~C~~-~~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~---~~C~~~~~ 142 (316)
T KOG1218|consen 78 -PCRCKNGGTCV------SSTGYCHLNGYEGPQCES-PCPCGDG-CAE-KTCANPRR--ECRCGGGYIG---EQCGEENL 142 (316)
T ss_pred -ccccCCCCccc------CCCCcccCCCCCcccccC-CCCcCCc-ccc-cccCCCcc--ceecCCcCcc---ccccccCC
Confidence 11122222111 111123 46777777763 3333222 222 34443322 4555555544 33332
Q ss_pred CCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCC-CCCCCCCEEeecCC
Q psy7015 199 ATNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICEN-SPCLNEALCLEEEE 257 (284)
Q Consensus 199 ~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~~~C~~~~~ 257 (284)
....|... |.....+... ...|.|.+||+|..+......|.. ..+.+++.|....+
T Consensus 143 ~g~~C~~~-c~~~~~~~~~--~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 143 VGLKCQRD-CQCTGGCDCK--NGICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred CCCCccCC-CCCccccCCC--CCceeccCCcccccccccCCCcCCCcccCCCCeeecccc
Confidence 11112111 2111222212 236889999999988754433442 34666667755443
No 43
>smart00051 DSL delta serrate ligand.
Probab=94.28 E-value=0.094 Score=33.80 Aligned_cols=47 Identities=26% Similarity=0.605 Sum_probs=32.5
Q ss_pred ceEEeCCCCCccCCCCCCCCCCCC-CCCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015 46 SYTCYCIDGYTGVHCQTNWDECWS-NPCHNGGSCIDGIAAYNCSCPPGYTGPSC 98 (284)
Q Consensus 46 ~~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~G~~C 98 (284)
.+.=.|.++|.|..|+. .|.+ .....+.+|.. .| .+.|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCC---EeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence 34557999999999974 3332 12456677754 34 4899999999876
No 44
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.86 E-value=0.037 Score=31.27 Aligned_cols=18 Identities=39% Similarity=0.855 Sum_probs=8.3
Q ss_pred ceecCCCceEEeCCCCCc
Q psy7015 39 PIFAVPSSYTCYCIDGYT 56 (284)
Q Consensus 39 ~~~~~~g~~~C~C~~G~~ 56 (284)
.+.+++++|+|.|++||.
T Consensus 11 ~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEEEETTSEEEE-STTEE
T ss_pred CCccCCCceEeECCCCCE
Confidence 344445555555555554
No 45
>KOG3512|consensus
Probab=93.43 E-value=0.19 Score=44.93 Aligned_cols=105 Identities=24% Similarity=0.605 Sum_probs=56.1
Q ss_pred CCCCC-EeccCCC-CeeEeCCCCCccCCCCCccC-------------CCCCCCCCCCCC-------------------CC
Q psy7015 167 CQNNG-TCHDLLN-GFVCSCHPGFTGWTGSLCQS-------------ATNECESSPCQN-------------------GG 212 (284)
Q Consensus 167 C~~~~-~C~~~~g-~~~C~C~~g~~g~~~~~c~~-------------~~~~C~~~~C~~-------------------~g 212 (284)
|..++ .|+.... ..+|.|..+-.|.+...|.. +.++|....|.. +|
T Consensus 280 CNgHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~Sgg 359 (592)
T KOG3512|consen 280 CNGHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGG 359 (592)
T ss_pred ecCccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccc
Confidence 44333 4665444 48888888877744333321 234444333322 34
Q ss_pred EEee----CCCCeee-ecCCCCccCCCcc--cCCcCCCCCCCC----CCEEeecCCCeeeecCCCCcCCCccc
Q psy7015 213 VCVD----LHAAYTC-ACLFGFTGRNCDI--ELKICENSPCLN----EALCLEEEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 213 ~C~~----~~g~~~C-~C~~G~~g~~C~~--~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~G~~G~~C~~ 274 (284)
+|+| +.|. .| .|++||+-+.-.. .-..|....|+. +-+| +..+++|.|++|.+|..|..
T Consensus 360 vClnCrHnTaGr-hChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktC--Nq~tGqCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 360 VCLNCRHNTAGR-HCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTC--NQTTGQCPCKEGVTGLTCNR 429 (592)
T ss_pred eEeecccCCCCc-ccccccCccccCCCCCCchhhhhhhcCCcccccccccc--cccCCcccCCCCCccccccc
Confidence 5543 3332 34 5888887433221 112233333433 3456 44577899999999998853
No 46
>KOG1218|consensus
Probab=92.95 E-value=5.5 Score=34.30 Aligned_cols=148 Identities=30% Similarity=0.724 Sum_probs=68.4
Q ss_pred CceEEeCCCCCccCCCCCCCC-CCCCCCCCCCCeEecCC--CCeeeeC-CCCCcCCCcccCCCCCCCCCCCCCCEEeeCC
Q psy7015 45 SSYTCYCIDGYTGVHCQTNWD-ECWSNPCHNGGSCIDGI--AAYNCSC-PPGYTGPSCESNVDECGSNPCQNNGTCHDLL 120 (284)
Q Consensus 45 g~~~C~C~~G~~g~~C~~~~~-~C~~~~C~~~g~C~~~~--g~~~C~C-~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~ 120 (284)
.+..|.+..+|.+..|..... ......|.....|.... ..+...| ..+|.|..|+. ..++... |.. ..|.+..
T Consensus 47 ~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~-~~~~~~~-c~~-~~C~~~~ 123 (316)
T KOG1218|consen 47 NSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCES-PCPCGDG-CAE-KTCANPR 123 (316)
T ss_pred CceeEecccccCCCccccccccCCCCCcccCccccCCCCcccCCCCcccCCCCCcccccC-CCCcCCc-ccc-cccCCCc
Confidence 456788999999888764211 11222244444443221 1222344 68888888863 2333222 222 3444322
Q ss_pred CCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCC
Q psy7015 121 NGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSAT 200 (284)
Q Consensus 121 ~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~ 200 (284)
. .|.+..+|.+. .|.. +++.|..|..+ |.....+.. ....|.|.+||.+ ..+....
T Consensus 124 ~--~c~~~~~~~~~--------~C~~-~~~~g~~C~~~--------c~~~~~~~~--~~~~c~c~~g~~g---~~~~~~~ 179 (316)
T KOG1218|consen 124 R--ECRCGGGYIGE--------QCGE-ENLVGLKCQRD--------CQCTGGCDC--KNGICTCQPGFVG---VFCVESC 179 (316)
T ss_pred c--ceecCCcCccc--------cccc-cCCCCCCccCC--------CCCccccCC--CCCceeccCCccc---ccccccC
Confidence 1 34444444322 1111 25556666422 111111211 1236778888877 4443222
Q ss_pred CCCC-CCCCCCCCEEeeCCC
Q psy7015 201 NECE-SSPCQNGGVCVDLHA 219 (284)
Q Consensus 201 ~~C~-~~~C~~~g~C~~~~g 219 (284)
..|. ...+.+++.|....+
T Consensus 180 ~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 180 SGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred CCcCCCcccCCCCeeecccc
Confidence 2233 234556667776554
No 47
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.82 E-value=0.096 Score=29.57 Aligned_cols=20 Identities=30% Similarity=0.850 Sum_probs=16.3
Q ss_pred CEEeeCCCCeeeecCCCCcc
Q psy7015 212 GVCVDLHAAYTCACLFGFTG 231 (284)
Q Consensus 212 g~C~~~~g~~~C~C~~G~~g 231 (284)
..|++++++|+|.|++||+-
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE
T ss_pred CCCccCCCceEeECCCCCEE
Confidence 38899999999999999974
No 48
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.50 E-value=0.082 Score=29.80 Aligned_cols=26 Identities=23% Similarity=0.682 Sum_probs=13.4
Q ss_pred CCCCCCCCEEeecC-CCeeeecCCCCc
Q psy7015 243 NSPCLNEALCLEEE-EEQVCYCVPDYH 268 (284)
Q Consensus 243 ~~~C~~~~~C~~~~-~~~~C~C~~G~~ 268 (284)
...|..++.|++.. ++..|.|..||.
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 34456666665544 666666666664
No 49
>PHA02887 EGF-like protein; Provisional
Probab=92.30 E-value=0.13 Score=36.93 Aligned_cols=28 Identities=32% Similarity=0.966 Sum_probs=20.6
Q ss_pred CCCCCEEee--cCCCeeeecCCCCcCCCccc
Q psy7015 246 CLNEALCLE--EEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 246 C~~~~~C~~--~~~~~~C~C~~G~~G~~C~~ 274 (284)
|. +|+|.- ......|.|.+||+|.+|+.
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 55 467754 34557799999999999984
No 50
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=91.92 E-value=0.15 Score=37.31 Aligned_cols=28 Identities=36% Similarity=0.891 Sum_probs=20.8
Q ss_pred CCCCCEEee--cCCCeeeecCCCCcCCCccc
Q psy7015 246 CLNEALCLE--EEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 246 C~~~~~C~~--~~~~~~C~C~~G~~G~~C~~ 274 (284)
|.+ |+|.- +...+.|.|..||+|.+||.
T Consensus 53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred eEC-CEEEeeccCCCceeECCCCcccccccc
Confidence 554 47754 34677899999999999985
No 51
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.70 E-value=0.068 Score=30.13 Aligned_cols=27 Identities=22% Similarity=0.580 Sum_probs=15.9
Q ss_pred CCCCCCCCCeEecCC-CCeeeeCCCCCc
Q psy7015 68 WSNPCHNGGSCIDGI-AAYNCSCPPGYT 94 (284)
Q Consensus 68 ~~~~C~~~g~C~~~~-g~~~C~C~~G~~ 94 (284)
....|..++.|++.. |+++|.|.+||.
T Consensus 3 ~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 3 IDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred cCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 334566777777654 777788888775
No 52
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=89.34 E-value=0.46 Score=39.09 Aligned_cols=38 Identities=29% Similarity=0.519 Sum_probs=25.6
Q ss_pred cCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcC
Q psy7015 57 GVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTG 95 (284)
Q Consensus 57 g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G 95 (284)
+..|+ +.++|...+......|.++.|+|.|.|++||+.
T Consensus 181 ~~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 34454 566775443333357888888888888888874
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=89.34 E-value=0.25 Score=29.90 Aligned_cols=22 Identities=32% Similarity=0.759 Sum_probs=15.3
Q ss_pred EEeecCCCeeeecCCCCcCCCccc
Q psy7015 251 LCLEEEEEQVCYCVPDYHGNRCQY 274 (284)
Q Consensus 251 ~C~~~~~~~~C~C~~G~~G~~C~~ 274 (284)
.|.. .+++|.|+++|+|.+|+.
T Consensus 12 ~C~~--~~G~C~C~~~~~G~~C~~ 33 (49)
T PF00053_consen 12 TCDP--STGQCVCKPGTTGPRCDQ 33 (49)
T ss_dssp SEEE--TCEEESBSTTEESTTS-E
T ss_pred cccC--CCCEEeccccccCCcCcC
Confidence 5533 556788888888888864
No 54
>PHA02887 EGF-like protein; Provisional
Probab=88.52 E-value=0.51 Score=33.96 Aligned_cols=28 Identities=36% Similarity=1.010 Sum_probs=21.9
Q ss_pred CCCCCeEec--CCCCeeeeCCCCCcCCCccc
Q psy7015 72 CHNGGSCID--GIAAYNCSCPPGYTGPSCES 100 (284)
Q Consensus 72 C~~~g~C~~--~~g~~~C~C~~G~~G~~C~~ 100 (284)
|- +|+|.- ....+.|.|+.||+|.+|+.
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 55 578873 34567899999999999973
No 55
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=87.88 E-value=0.54 Score=28.64 Aligned_cols=17 Identities=35% Similarity=0.917 Sum_probs=13.9
Q ss_pred CCeeeecCCCCcCCCcc
Q psy7015 257 EEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 257 ~~~~C~C~~G~~G~~C~ 273 (284)
.+++|.|+++|+|.+|+
T Consensus 17 ~~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 17 GTGQCECKPNTTGRRCD 33 (50)
T ss_pred CCCEEeCCCcCCCCCCC
Confidence 45678899999999886
No 56
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=87.77 E-value=0.32 Score=29.88 Aligned_cols=36 Identities=22% Similarity=0.541 Sum_probs=18.5
Q ss_pred CCCCCCCEEee----cCCCeeeecCCCCcCCCcccccccc
Q psy7015 244 SPCLNEALCLE----EEEEQVCYCVPDYHGNRCQYQYDEC 279 (284)
Q Consensus 244 ~~C~~~~~C~~----~~~~~~C~C~~G~~G~~C~~~~~~C 279 (284)
.+|+.||.... ..+...|.|..-|.|.+|+..+..|
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~~C 56 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIPNC 56 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-TT-
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcccCCCCC
Confidence 35777776642 3455779999999999998766544
No 57
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=87.70 E-value=0.16 Score=32.68 Aligned_cols=47 Identities=28% Similarity=0.648 Sum_probs=19.7
Q ss_pred ceEEeCCCCCccCCCCCCCCCCCCC-CCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015 46 SYTCYCIDGYTGVHCQTNWDECWSN-PCHNGGSCIDGIAAYNCSCPPGYTGPSC 98 (284)
Q Consensus 46 ~~~C~C~~G~~g~~C~~~~~~C~~~-~C~~~g~C~~~~g~~~C~C~~G~~G~~C 98 (284)
.+.-.|.+.|.|..|.. -|.+. .-..+-+|.. .|. =.|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd~-~G~--~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSK---FCKPRDDSFGHYTCDS-NGN--KVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETTTT-E---E---EEETTEEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEECCCCCCCccccC---CcCCCcCCcCCcccCC-CCC--CCCCCCCcCCCC
Confidence 45668999999998874 22221 0122334442 232 478899998876
No 58
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=85.91 E-value=0.78 Score=33.62 Aligned_cols=28 Identities=43% Similarity=1.098 Sum_probs=21.5
Q ss_pred CCCCCeEec--CCCCeeeeCCCCCcCCCccc
Q psy7015 72 CHNGGSCID--GIAAYNCSCPPGYTGPSCES 100 (284)
Q Consensus 72 C~~~g~C~~--~~g~~~C~C~~G~~G~~C~~ 100 (284)
|-+ |.|.- ....+.|.|..||+|.+|+.
T Consensus 53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred eEC-CEEEeeccCCCceeECCCCcccccccc
Confidence 444 47863 34678899999999999973
No 59
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.04 E-value=1.6 Score=35.83 Aligned_cols=38 Identities=21% Similarity=0.552 Sum_probs=27.9
Q ss_pred CCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCcc
Q psy7015 152 GPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTG 190 (284)
Q Consensus 152 g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g 190 (284)
+..|. +.++|...+......|.++.|+|.|.|++||+.
T Consensus 181 ~~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 45565 567775433222358999999999999999986
No 60
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=82.33 E-value=1.1 Score=27.07 Aligned_cols=22 Identities=36% Similarity=0.709 Sum_probs=16.9
Q ss_pred CEEeeCCCCeeeecCCCCccCCCc
Q psy7015 212 GVCVDLHAAYTCACLFGFTGRNCD 235 (284)
Q Consensus 212 g~C~~~~g~~~C~C~~G~~g~~C~ 235 (284)
..|.... .+|.|+++|+|..|+
T Consensus 11 ~~C~~~~--G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPST--GQCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEETC--EEESBSTTEESTTS-
T ss_pred CcccCCC--CEEeccccccCCcCc
Confidence 3676644 489999999999997
No 61
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=80.24 E-value=1.6 Score=26.08 Aligned_cols=16 Identities=38% Similarity=1.007 Sum_probs=11.5
Q ss_pred CeeeecCCCCcCCCcc
Q psy7015 258 EQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 258 ~~~C~C~~G~~G~~C~ 273 (284)
+++|.|+++|+|.+|+
T Consensus 17 ~G~C~C~~~~~G~~C~ 32 (46)
T smart00180 17 TGQCECKPNVTGRRCD 32 (46)
T ss_pred CCEEECCCCCCCCCCC
Confidence 4567777777777775
No 62
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.92 E-value=1.9 Score=26.19 Aligned_cols=20 Identities=40% Similarity=0.821 Sum_probs=15.5
Q ss_pred EeeCCCCeeeecCCCCccCCCc
Q psy7015 214 CVDLHAAYTCACLFGFTGRNCD 235 (284)
Q Consensus 214 C~~~~g~~~C~C~~G~~g~~C~ 235 (284)
|....| +|.|+++|+|..|+
T Consensus 14 C~~~~G--~C~C~~~~~G~~C~ 33 (50)
T cd00055 14 CDPGTG--QCECKPNTTGRRCD 33 (50)
T ss_pred ccCCCC--EEeCCCcCCCCCCC
Confidence 543333 89999999999996
No 63
>KOG3516|consensus
Probab=78.39 E-value=2 Score=43.18 Aligned_cols=39 Identities=36% Similarity=1.053 Sum_probs=34.6
Q ss_pred CCCCCCCCCCCCCCeEecCCCCeeeeCC-CCCcCCCcccC
Q psy7015 63 NWDECWSNPCHNGGSCIDGIAAYNCSCP-PGYTGPSCESN 101 (284)
Q Consensus 63 ~~~~C~~~~C~~~g~C~~~~g~~~C~C~-~G~~G~~C~~~ 101 (284)
.++.|.+++|.++|.|......|.|.|. .||+|..|...
T Consensus 544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts 583 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS 583 (1306)
T ss_pred cccccCCccccCCCcccccccceeEeccccccccccccCC
Confidence 4678999999999999998889999997 99999999753
No 64
>KOG3516|consensus
Probab=75.53 E-value=2.4 Score=42.57 Aligned_cols=41 Identities=24% Similarity=0.596 Sum_probs=31.7
Q ss_pred CCcCCCCCCCCCCEEeecCCCeeeecC-CCCcCCCccccccc
Q psy7015 238 LKICENSPCLNEALCLEEEEEQVCYCV-PDYHGNRCQYQYDE 278 (284)
Q Consensus 238 ~~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~G~~C~~~~~~ 278 (284)
.+.|.+.+|..+|.|...-..++|.|. .||.|..|...|-|
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e 586 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE 586 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence 467778888888888777777888887 88888888766544
No 65
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=67.92 E-value=6.4 Score=21.70 Aligned_cols=22 Identities=27% Similarity=0.415 Sum_probs=14.5
Q ss_pred EeccCCCCeeEeCCCCCccCCCC
Q psy7015 172 TCHDLLNGFVCSCHPGFTGWTGS 194 (284)
Q Consensus 172 ~C~~~~g~~~C~C~~g~~g~~~~ 194 (284)
.|..... ..|.|++||....+.
T Consensus 11 ~CDpn~~-~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 11 DCDPNSP-GQCFCPEGYILDEGS 32 (34)
T ss_pred ccCCCCC-CceeCCCceEecCCc
Confidence 5554332 489999999875443
No 66
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=67.78 E-value=7.5 Score=27.89 Aligned_cols=31 Identities=32% Similarity=0.720 Sum_probs=22.5
Q ss_pred CCCCC-CCCCCCCCeEecCCCCeeeeCCCCCcC
Q psy7015 64 WDECW-SNPCHNGGSCIDGIAAYNCSCPPGYTG 95 (284)
Q Consensus 64 ~~~C~-~~~C~~~g~C~~~~g~~~C~C~~G~~G 95 (284)
.+.|. ...|+..|.|... ....|.|.+||.-
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP 108 (110)
T ss_pred ccCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence 35565 4669999999643 4557999999964
No 67
>KOG3514|consensus
Probab=67.28 E-value=4 Score=40.70 Aligned_cols=35 Identities=46% Similarity=1.162 Sum_probs=31.8
Q ss_pred CCCCCCCCCCCeEecCCCCeeeeC-CCCCcCCCccc
Q psy7015 66 ECWSNPCHNGGSCIDGIAAYNCSC-PPGYTGPSCES 100 (284)
Q Consensus 66 ~C~~~~C~~~g~C~~~~g~~~C~C-~~G~~G~~C~~ 100 (284)
.|.++||.++|.|...-..|.|.| ..||.|+.|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 689999999999999999999999 46899999984
No 68
>KOG3514|consensus
Probab=62.74 E-value=5.9 Score=39.60 Aligned_cols=36 Identities=33% Similarity=0.851 Sum_probs=32.1
Q ss_pred cCCCCCCCCCCEEeecCCCeeeec-CCCCcCCCcccc
Q psy7015 240 ICENSPCLNEALCLEEEEEQVCYC-VPDYHGNRCQYQ 275 (284)
Q Consensus 240 ~C~~~~C~~~~~C~~~~~~~~C~C-~~G~~G~~C~~~ 275 (284)
.|+..||.|+|.|...-..+.|.| ..||.|..||..
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence 789999999999988888999999 579999999864
No 69
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=59.68 E-value=9.6 Score=27.11 Aligned_cols=9 Identities=33% Similarity=0.752 Sum_probs=6.4
Q ss_pred CCcCCCccc
Q psy7015 266 DYHGNRCQY 274 (284)
Q Consensus 266 G~~G~~C~~ 274 (284)
.|.|+.|+.
T Consensus 53 ~W~G~aCqK 61 (103)
T PF12955_consen 53 HWGGPACQK 61 (103)
T ss_pred eeccccccc
Confidence 577777874
No 70
>KOG3509|consensus
Probab=53.43 E-value=29 Score=34.84 Aligned_cols=71 Identities=28% Similarity=0.658 Sum_probs=51.0
Q ss_pred CCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCCCC-CCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015 201 NECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICENSP-CLNEALCLEEEEEQVCYCVPDYHGNRC 272 (284)
Q Consensus 201 ~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~G~~G~~C 272 (284)
+.|...++...+.|..+.....|.|++||+|..|+...+.+...+ =.-.++|....+.....|.+| .|...
T Consensus 407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~ 478 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPT 478 (964)
T ss_pred CccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCcc
Confidence 355666777778888777778899999999999987666665432 223466766655667788888 66655
No 71
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=52.88 E-value=18 Score=25.91 Aligned_cols=24 Identities=21% Similarity=0.721 Sum_probs=11.5
Q ss_pred CCCCCCCEEeecCCCeeeecCCCCc
Q psy7015 244 SPCLNEALCLEEEEEQVCYCVPDYH 268 (284)
Q Consensus 244 ~~C~~~~~C~~~~~~~~C~C~~G~~ 268 (284)
..|...+.|.. .....|.|.+||.
T Consensus 84 ~~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 84 GFCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred cccCCccEeCC-CCCCceECCCCcC
Confidence 34555555532 2233455555554
No 72
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=46.40 E-value=48 Score=19.93 Aligned_cols=30 Identities=37% Similarity=0.938 Sum_probs=20.2
Q ss_pred cCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCcc
Q psy7015 151 TGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTG 190 (284)
Q Consensus 151 ~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g 190 (284)
.|..|..+ ..|..++.|++ .+|+|++||..
T Consensus 18 ~g~~C~~~------~qC~~~s~C~~----g~C~C~~g~~~ 47 (52)
T PF01683_consen 18 PGESCESD------EQCIGGSVCVN----GRCQCPPGYVE 47 (52)
T ss_pred CCCCCCCc------CCCCCcCEEcC----CEeECCCCCEe
Confidence 35666532 23667788864 48999999865
No 73
>KOG0196|consensus
Probab=45.55 E-value=43 Score=32.98 Aligned_cols=67 Identities=22% Similarity=0.535 Sum_probs=36.7
Q ss_pred CCCCCEEeeCCCCeeeecCCCCc----cCCCccc----------CCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCcc
Q psy7015 208 CQNGGVCVDLHAAYTCACLFGFT----GRNCDIE----------LKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQ 273 (284)
Q Consensus 208 C~~~g~C~~~~g~~~C~C~~G~~----g~~C~~~----------~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~ 273 (284)
|...|.-+...| .|.|.+||. +..|+.. ...|. +|..+.. ...+++..|.|..||.-..=+
T Consensus 248 C~~dGeWlvpiG--~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~--~CP~~S~-s~~ega~~C~C~~gyyRA~~D 322 (996)
T KOG0196|consen 248 CSGDGEWLVPIG--GCVCKAGYEEAENGKACQACPPGTYKASQGDSLCL--PCPPNSH-SSSEGATSCTCENGYYRADSD 322 (996)
T ss_pred EcCCCcEEEEcC--ceeecCCCCcccCCCcceeCCCCcccCCCCCCCCC--CCCCCCC-CCCCCCCcccccCCcccCCCC
Confidence 555555544444 688888886 3444411 11222 2332221 124567789999999866555
Q ss_pred cccccc
Q psy7015 274 YQYDEC 279 (284)
Q Consensus 274 ~~~~~C 279 (284)
.+--+|
T Consensus 323 p~~mpC 328 (996)
T KOG0196|consen 323 PPSMPC 328 (996)
T ss_pred CCCCCC
Confidence 444445
No 74
>KOG0196|consensus
Probab=38.91 E-value=84 Score=31.11 Aligned_cols=60 Identities=28% Similarity=0.651 Sum_probs=33.1
Q ss_pred eEEeCCCCCc----cCCCCCC----------CCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCC
Q psy7015 47 YTCYCIDGYT----GVHCQTN----------WDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNP 109 (284)
Q Consensus 47 ~~C~C~~G~~----g~~C~~~----------~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~ 109 (284)
..|.|.+||+ +..|+.. ...|. +|..+..= ..+++..|.|..||+-..-+.....|...|
T Consensus 259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~--~CP~~S~s-~~ega~~C~C~~gyyRA~~Dp~~mpCT~PP 332 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCL--PCPPNSHS-SSEGATSCTCENGYYRADSDPPSMPCTRPP 332 (996)
T ss_pred CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCC--CCCCCCCC-CCCCCCcccccCCcccCCCCCCCCCCCCCC
Confidence 3699999996 4556521 11222 23333321 235667899999998554433333454433
No 75
>KOG3509|consensus
Probab=23.93 E-value=72 Score=32.25 Aligned_cols=43 Identities=30% Similarity=0.862 Sum_probs=32.5
Q ss_pred cCCCCCCCCCCEEeecCCCeeeecCCCCcCCCccccccccccC
Q psy7015 240 ICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQYQYDECQIT 282 (284)
Q Consensus 240 ~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~ 282 (284)
.|...++...+.|-.......|.|++||+|+.|+...+.|...
T Consensus 408 ~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~ 450 (964)
T KOG3509|consen 408 VCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRS 450 (964)
T ss_pred ccccccCCCCccccccccccceeccccccCchhhccCcccccc
Confidence 4445566666777667777889999999999999777776543
No 76
>KOG3607|consensus
Probab=22.18 E-value=67 Score=31.47 Aligned_cols=28 Identities=25% Similarity=0.604 Sum_probs=23.2
Q ss_pred CCCCCCEEeecCCCeeeecCCCCcCCCcccc
Q psy7015 245 PCLNEALCLEEEEEQVCYCVPDYHGNRCQYQ 275 (284)
Q Consensus 245 ~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~ 275 (284)
.|+.+|+|.+. +.|+|.+||.+..|++.
T Consensus 631 ~C~g~GVCnn~---~~ChC~~gwapp~C~~~ 658 (716)
T KOG3607|consen 631 TCNGHGVCNNE---LNCHCEPGWAPPFCFIF 658 (716)
T ss_pred ccCCCcccCCC---cceeeCCCCCCCccccc
Confidence 38899999554 46999999999999864
No 77
>KOG3607|consensus
Probab=20.26 E-value=76 Score=31.11 Aligned_cols=27 Identities=37% Similarity=1.040 Sum_probs=20.1
Q ss_pred CCCCCCeEecCCCCeeeeCCCCCcCCCccc
Q psy7015 71 PCHNGGSCIDGIAAYNCSCPPGYTGPSCES 100 (284)
Q Consensus 71 ~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~ 100 (284)
.|..+|+|.+.. .|+|.+||.+..|+.
T Consensus 631 ~C~g~GVCnn~~---~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 631 TCNGHGVCNNEL---NCHCEPGWAPPFCFI 657 (716)
T ss_pred ccCCCcccCCCc---ceeeCCCCCCCcccc
Confidence 377788886553 588888888888863
Done!