Query psy11059
Match_columns 429
No_of_seqs 331 out of 2310
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 17:01:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11059.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11059hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.8 8.6E-18 1.9E-22 176.0 16.0 115 292-424 1717-1839(2531)
2 KOG4289|consensus 99.7 2E-16 4.3E-21 166.0 16.2 90 9-117 1179-1297(2531)
3 KOG1217|consensus 99.5 1.3E-12 2.7E-17 133.7 24.5 212 175-419 155-389 (487)
4 KOG1217|consensus 99.5 3.2E-12 7E-17 130.7 23.3 298 15-414 93-421 (487)
5 KOG1219|consensus 99.5 1E-13 2.2E-18 150.8 9.2 115 291-424 3864-3979(4289)
6 KOG1219|consensus 99.5 1E-13 2.2E-18 150.8 8.5 114 9-187 3864-3978(4289)
7 KOG1214|consensus 99.3 2.2E-10 4.8E-15 116.4 19.3 129 273-414 808-947 (1289)
8 KOG0994|consensus 99.3 8.6E-11 1.9E-15 122.5 15.1 99 314-423 1033-1147(1758)
9 KOG1214|consensus 99.1 2.9E-10 6.2E-15 115.5 11.0 143 8-184 691-860 (1289)
10 KOG1225|consensus 99.0 3.4E-09 7.3E-14 106.3 10.9 126 249-420 234-365 (525)
11 KOG0994|consensus 98.8 7.1E-08 1.5E-12 101.4 13.9 95 33-131 830-949 (1758)
12 KOG1225|consensus 98.8 6.3E-08 1.4E-12 97.3 12.0 118 3-184 243-365 (525)
13 KOG4260|consensus 98.6 1E-07 2.2E-12 85.7 6.6 131 277-417 131-304 (350)
14 KOG1836|consensus 98.4 1.2E-05 2.5E-10 91.4 16.9 135 272-423 864-1022(1705)
15 PF07645 EGF_CA: Calcium-bindi 98.3 3.5E-07 7.7E-12 60.2 2.0 34 8-42 1-34 (42)
16 KOG1226|consensus 98.2 1.1E-05 2.3E-10 83.1 10.6 144 233-424 467-622 (783)
17 PF00008 EGF: EGF-like domain 97.9 4.9E-06 1.1E-10 51.2 1.6 27 15-42 2-29 (32)
18 KOG4260|consensus 97.9 1.5E-05 3.3E-10 72.0 4.9 96 55-158 138-262 (350)
19 PF12947 EGF_3: EGF domain; I 97.8 6.3E-06 1.4E-10 51.9 0.9 32 12-44 1-32 (36)
20 smart00179 EGF_CA Calcium-bind 97.8 3.2E-05 7E-10 49.7 4.2 32 8-42 1-33 (39)
21 PF00008 EGF: EGF-like domain 97.8 1.1E-05 2.4E-10 49.6 1.4 29 391-419 2-31 (32)
22 smart00179 EGF_CA Calcium-bind 97.8 4E-05 8.6E-10 49.3 4.2 36 387-422 2-39 (39)
23 PF07645 EGF_CA: Calcium-bindi 97.7 1.8E-05 3.9E-10 52.0 1.7 33 61-97 1-34 (42)
24 cd00054 EGF_CA Calcium-binding 97.6 8.3E-05 1.8E-09 47.3 4.1 32 8-42 1-33 (38)
25 KOG1226|consensus 97.5 0.0011 2.4E-08 68.7 11.7 99 16-138 466-589 (783)
26 cd00054 EGF_CA Calcium-binding 97.4 0.00025 5.4E-09 45.0 4.1 36 387-422 2-38 (38)
27 KOG1836|consensus 97.4 0.00072 1.6E-08 77.3 10.1 112 251-385 697-813 (1705)
28 cd00053 EGF Epidermal growth f 97.1 0.00074 1.6E-08 42.1 3.9 26 16-42 5-30 (36)
29 smart00181 EGF Epidermal growt 97.0 0.00094 2E-08 41.7 3.8 28 11-42 1-29 (35)
30 cd00053 EGF Epidermal growth f 96.8 0.0021 4.6E-08 39.9 4.1 30 392-421 5-35 (36)
31 PF12662 cEGF: Complement Clr- 96.8 0.0013 2.8E-08 37.1 2.6 23 32-63 1-23 (24)
32 smart00181 EGF Epidermal growt 96.8 0.0021 4.5E-08 40.1 3.9 28 393-421 6-34 (35)
33 PF07974 EGF_2: EGF-like domai 96.5 0.0047 1E-07 37.7 3.6 23 17-42 6-28 (32)
34 PF12947 EGF_3: EGF domain; I 96.4 0.0018 3.9E-08 40.8 1.5 27 393-419 6-32 (36)
35 PF12662 cEGF: Complement Clr- 96.0 0.0066 1.4E-07 34.3 2.3 10 408-417 2-11 (24)
36 PF12661 hEGF: Human growth fa 95.6 0.0047 1E-07 29.5 0.6 13 409-421 1-13 (13)
37 PF07974 EGF_2: EGF-like domai 95.6 0.017 3.6E-07 35.3 3.1 26 394-421 7-32 (32)
38 KOG3512|consensus 95.4 0.15 3.2E-06 50.3 10.4 163 239-423 285-479 (592)
39 PF06247 Plasmod_Pvs28: Plasmo 95.1 0.0032 6.9E-08 54.4 -1.7 136 267-419 13-162 (197)
40 PF14670 FXa_inhibition: Coagu 94.9 0.012 2.6E-07 37.0 0.9 18 24-42 11-28 (36)
41 KOG1218|consensus 94.4 4.4 9.4E-05 38.9 18.3 65 255-327 140-208 (316)
42 PF06247 Plasmod_Pvs28: Plasmo 93.4 0.024 5.2E-07 49.1 0.2 121 23-158 11-154 (197)
43 smart00051 DSL delta serrate l 92.6 0.21 4.5E-06 35.8 4.1 45 276-331 19-63 (63)
44 KOG3512|consensus 92.4 0.37 8.1E-06 47.6 6.7 109 311-423 288-429 (592)
45 PF14670 FXa_inhibition: Coagu 92.4 0.059 1.3E-06 33.9 0.8 22 72-97 7-28 (36)
46 KOG1218|consensus 92.0 3.3 7.2E-05 39.7 13.1 42 247-288 13-63 (316)
47 PF00053 Laminin_EGF: Laminin 90.9 0.14 3.1E-06 34.5 1.6 29 315-345 15-43 (49)
48 cd00055 EGF_Lam Laminin-type e 90.6 0.39 8.5E-06 32.6 3.5 28 315-344 16-43 (50)
49 smart00051 DSL delta serrate l 90.1 0.44 9.5E-06 34.1 3.6 13 371-383 51-63 (63)
50 smart00180 EGF_Lam Laminin-typ 89.2 0.47 1E-05 31.6 3.0 25 316-342 16-40 (46)
51 PF12946 EGF_MSP1_1: MSP1 EGF 89.1 0.22 4.7E-06 31.3 1.2 30 15-44 3-32 (37)
52 cd00055 EGF_Lam Laminin-type e 88.2 0.71 1.5E-05 31.3 3.4 27 247-286 17-43 (50)
53 PF00053 Laminin_EGF: Laminin 85.7 0.34 7.4E-06 32.6 0.7 27 247-286 16-42 (49)
54 PHA02887 EGF-like protein; Pro 84.7 0.85 1.8E-05 36.4 2.6 31 233-264 92-123 (126)
55 cd01475 vWA_Matrilin VWA_Matri 84.1 1.1 2.4E-05 40.9 3.6 39 52-97 178-217 (224)
56 PF12946 EGF_MSP1_1: MSP1 EGF 83.8 0.47 1E-05 29.8 0.7 26 70-97 4-30 (37)
57 PHA02887 EGF-like protein; Pro 82.8 1.2 2.7E-05 35.5 2.8 29 394-423 93-123 (126)
58 PHA03099 epidermal growth fact 82.2 1.1 2.5E-05 36.3 2.4 29 394-423 52-82 (139)
59 smart00180 EGF_Lam Laminin-typ 82.2 1.7 3.8E-05 28.8 3.0 24 248-284 17-40 (46)
60 KOG3516|consensus 80.5 1.5 3.3E-05 48.4 3.4 42 386-427 544-586 (1306)
61 PHA03099 epidermal growth fact 79.6 1.8 3.9E-05 35.2 2.8 31 234-265 52-83 (139)
62 KOG3516|consensus 78.0 1.9 4.1E-05 47.7 3.2 47 7-68 543-590 (1306)
63 cd01475 vWA_Matrilin VWA_Matri 75.9 3.4 7.4E-05 37.7 4.0 36 380-418 181-218 (224)
64 KOG3514|consensus 74.5 2 4.3E-05 46.9 2.2 36 11-61 625-661 (1591)
65 PF00954 S_locus_glycop: S-loc 69.9 4.5 9.7E-05 32.4 2.9 29 66-97 79-107 (110)
66 PF12955 DUF3844: Domain of un 69.4 2.3 5E-05 33.5 1.0 51 9-60 5-61 (103)
67 PF01414 DSL: Delta serrate li 66.8 1.6 3.4E-05 31.3 -0.4 39 248-287 16-63 (63)
68 PF00954 S_locus_glycop: S-loc 63.0 8.2 0.00018 30.8 3.1 31 9-42 77-107 (110)
69 KOG3514|consensus 60.1 6.5 0.00014 43.2 2.5 36 389-424 625-661 (1591)
70 PF01683 EB: EB module; Inter 57.3 23 0.0005 23.8 4.2 27 10-42 20-46 (52)
71 PF04863 EGF_alliinase: Alliin 48.6 6.8 0.00015 26.9 0.3 35 16-62 16-53 (56)
72 PF09064 Tme5_EGF_like: Thromb 34.5 30 0.00065 21.3 1.5 13 32-44 17-29 (34)
73 KOG3509|consensus 31.0 93 0.002 34.7 5.6 71 350-421 407-478 (964)
74 KOG3509|consensus 21.2 1.8E+02 0.004 32.5 5.7 68 8-95 405-473 (964)
75 KOG0196|consensus 20.5 1.3E+02 0.0029 32.8 4.3 57 248-328 258-318 (996)
No 1
>KOG4289|consensus
Probab=99.76 E-value=8.6e-18 Score=175.96 Aligned_cols=115 Identities=30% Similarity=0.693 Sum_probs=72.5
Q ss_pred CCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCc---ccccCCCccCC-CCccCCCcCCCCCc-CCCCccccC
Q psy11059 292 TGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICE---KCFCRPGFAGD-HCDVDFDECLSNPC-FNGATCQNK 366 (429)
Q Consensus 292 ~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~---~C~C~~g~~g~-~C~~~i~~C~~~~C-~~~~~C~~~ 366 (429)
+.|.- ++|.+.++|... +...+|.|.|++||.|..|+ .=.|+.||+|. .| ..|.-..- .....|..+
T Consensus 1717 ~vC~l-npc~~~g~Cv~s---p~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~C----gpC~CavsKgfdp~CnKt 1788 (2531)
T KOG4289|consen 1717 DVCSL-NPCENQGTCVRS---PGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTC----GPCNCAVSKGFDPDCNKT 1788 (2531)
T ss_pred chhcc-cccccCceeecC---CCCCceeEECCCcccCcchhhhccCCCCCcccCCCCc----cCccccccCCCCCCcccc
Confidence 34544 889999999632 24578999999999999998 34689999985 22 11210000 123456666
Q ss_pred CCceEEecCCCCCCCCccCCCCCCCCCCCCCCC---EEccCCCCeeeeCCCCCCCCCCCCC
Q psy11059 367 INGYTCVCAPGYSGKECSININECESSPCLHGA---TCIDEVATFSCVCPKGLTGRLCETN 424 (429)
Q Consensus 367 ~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~---~C~~~~~~~~C~C~~g~~G~~C~~~ 424 (429)
.| .|.|+..+.-. +..|.+..|..++ +|. ...+|.|++|-.|+.|+.-
T Consensus 1789 ~G--~CqCKe~hy~~-----~~~Cl~CdC~~Gs~Sr~C~---adGqC~C~pgaiGRqCdrC 1839 (2531)
T KOG4289|consen 1789 NG--QCQCKENHYRP-----IGSCLPCDCYFGSDSRECD---ADGQCPCKPGAIGRQCDRC 1839 (2531)
T ss_pred Cc--ceeeccccccC-----CCcceeeccccCCCccccc---CCCcCCCCCcccccccccc
Confidence 55 78888765321 2223433344332 353 4558999999999988753
No 2
>KOG4289|consensus
Probab=99.71 E-value=2e-16 Score=165.99 Aligned_cols=90 Identities=36% Similarity=0.801 Sum_probs=79.3
Q ss_pred CCCCCCCCCCCCCCCEecc---------------------CCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCC
Q psy11059 9 SSPCDAQRNPCQNGGKCNE---------------------DETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQIC 67 (429)
Q Consensus 9 ~~~C~~~~~~C~~~g~C~~---------------------~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C 67 (429)
.+-| ...||.|..+|+. ...+.++|.|++||+ |.+||+.+| +|
T Consensus 1179 DniC--lrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFT------------gd~CeTeiD--lC 1242 (2531)
T KOG4289|consen 1179 DNIC--LREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFT------------GDYCETEID--LC 1242 (2531)
T ss_pred Cchh--hcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCC------------cccccchhH--hh
Confidence 5679 8999999999985 234578999999999 999999999 99
Q ss_pred CCCCCCCCCCeEeeCCCCCCeeeeCCCCCc--------ccCCCCCCCCCCCCeEecCC
Q psy11059 68 TTAPPCLNGATCRPQLTEQLYECVCPPGYK--------EIRDCTSNPCLNDGVCVWMF 117 (429)
Q Consensus 68 ~~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~--------~~~~C~~~~C~~~g~C~~~~ 117 (429)
. +.+|.++|+|. ...+.|+|.|.+||+ ....|.+..|.++|+|++..
T Consensus 1243 Y-s~pC~nng~C~--srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1243 Y-SGPCGNNGRCR--SREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred h-cCCCCCCCceE--EecCceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence 9 99999999999 999999999999998 34568888899999998765
No 3
>KOG1217|consensus
Probab=99.55 E-value=1.3e-12 Score=133.70 Aligned_cols=212 Identities=38% Similarity=0.991 Sum_probs=159.4
Q ss_pred eecCCcccCCcccCCCCCCC--CCCCCCCCEEeeCCCCe-eeccCCCC--CCCCCCCCCCCCCCCCCCCcEeccCCCCCe
Q psy11059 175 VCVDVYKGRYWELPEIRDCT--SNPCLNDGVCVDEVYKG-RYWELPEI--RDCTSNPCLNDCVNPCQNGGKCNEDETGNY 249 (429)
Q Consensus 175 ~C~~~~~G~~c~~~~~~~C~--~~~C~~~~~C~~~~~~~-C~C~~~g~--~~C~~~~C~~~~c~~C~~~g~C~~~~~~~~ 249 (429)
.|..+|.+..++. ..++|. ..+|.++++|.+..++| |.| +++. ..+... .++++| ... +
T Consensus 155 ~C~~g~~~~~~~~-~~~~C~~~~~~c~~~~~C~~~~~~~~C~c-~~~~~~~~~~~~----------~~~~~c-~~~---~ 218 (487)
T KOG1217|consen 155 SCTEGYEGEPCET-DLDECIQYSSPCQNGGTCVNTGGSYLCSC-PPGYTGSTCETT----------GNGGTC-VDS---V 218 (487)
T ss_pred eeCCCcccccccc-cccccccCCCCcCCCcccccCCCCeeEeC-CCCccCCcCcCC----------CCCceE-ecc---e
Confidence 3999999999886 557898 35699999999999999 999 8772 222211 234567 222 6
Q ss_pred eEeCCCCCcCCCCCC---------CccccCCCCeeeeCCCCCCCCC--CccCCCCCCCCCCCCCCCeeCCccccCCCCCe
Q psy11059 250 DCTCDALHTGDPCKH---------GSCVDKRAGYFCDCPPTYGGKN--CSVELTGCVGPDTCLNGGTCKPYLVDETQHRF 318 (429)
Q Consensus 250 ~C~C~~g~~G~~C~~---------~~C~~~~~~~~C~C~~G~~g~~--c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~ 318 (429)
.|.+..++.+..|+. +.|++..++|+|.|++||.+.. ...+++.|....+|.++++| .+ ..+.|
T Consensus 219 ~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C----~~-~~~~~ 293 (487)
T KOG1217|consen 219 ACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC----VN-VPGSY 293 (487)
T ss_pred eccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee----ec-CCCcc
Confidence 789999999887765 5688888899999999999886 23456788873348888999 44 55559
Q ss_pred eeeCCCCccCCCCcccccCCCccCCCCccCCCcC----CCCCcCCCCcc--ccCCCceEEecCCCCCCCCccCCCCCCCC
Q psy11059 319 NCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDEC----LSNPCFNGATC--QNKINGYTCVCAPGYSGKECSININECES 392 (429)
Q Consensus 319 ~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C----~~~~C~~~~~C--~~~~g~~~C~C~~G~~G~~C~~~~~~C~~ 392 (429)
.|.|++||.|..+. .+ .+..+| ...+|.++++| ....+.+.|.|..+|.|..|+...++|..
T Consensus 294 ~C~C~~g~~g~~~~-----------~~-~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~ 361 (487)
T KOG1217|consen 294 RCTCPPGFTGRLCT-----------EC-VDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECAS 361 (487)
T ss_pred eeeCCCCCCCCCCc-----------cc-cccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccC
Confidence 99999999988541 11 133445 33457777788 34445688999999999999744458998
Q ss_pred CCCCCCCEEcc-CCCCeeeeCCCCCCCC
Q psy11059 393 SPCLHGATCID-EVATFSCVCPKGLTGR 419 (429)
Q Consensus 393 ~~C~~~~~C~~-~~~~~~C~C~~g~~G~ 419 (429)
.++..++.|++ ..++|+|.++.+|.+.
T Consensus 362 ~~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 362 SPCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred CccccCCEeccCCCCCeEecCCCccccC
Confidence 88999999999 6889999999998874
No 4
>KOG1217|consensus
Probab=99.50 E-value=3.2e-12 Score=130.68 Aligned_cols=298 Identities=34% Similarity=0.776 Sum_probs=194.0
Q ss_pred CCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCC--CCCCCeEeeCCC---CCCee
Q psy11059 15 QRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPP--CLNGATCRPQLT---EQLYE 89 (429)
Q Consensus 15 ~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~--C~~~g~C~~~~~---~~~~~ 89 (429)
...+....+.++ ....++.|.|++||. |..|+... +|. ..+ +...+.|. .. ...+.
T Consensus 93 ~~~~~~~~~~~~-~~~~~~~c~c~~g~~------------~~~~~~~~---~C~-~~~~~~~~~~~c~--~~~~~~~~~~ 153 (487)
T KOG1217|consen 93 RSPCLLLCGECV-DCVGSYECTCPPGYQ------------GTPCEGEC---ECV-TGPGVCCIDGSCS--NGPGSVGPFR 153 (487)
T ss_pred cCCcccCCcccc-CCCCCceeeCCCccc------------cCcCCcce---eec-CCCCCeeCchhhc--CCCCCCCcee
Confidence 344445566777 788899999999999 88776432 265 332 35557777 53 46899
Q ss_pred eeCCCCCc------ccCCCC--CCCCCCCCeEecCCC---ce-eCCeeccCCCCCCCCCCCCCCCCCCCeEeeCCCCccc
Q psy11059 90 CVCPPGYK------EIRDCT--SNPCLNDGVCVWMFD---VT-IQVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRYC 157 (429)
Q Consensus 90 C~C~~Gy~------~~~~C~--~~~C~~~g~C~~~~~---C~-~~g~~G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C 157 (429)
|.|..||. ..++|. ..+|.+++.|.+..+ |. +++|.|..|+. . .+++.|++. +.|
T Consensus 154 c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~-~---------~~~~~c~~~---~~~ 220 (487)
T KOG1217|consen 154 CSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET-T---------GNGGTCVDS---VAC 220 (487)
T ss_pred eeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC-C---------CCCceEecc---eec
Confidence 99999998 226887 556999999998876 89 99999999985 2 445566654 333
Q ss_pred cCCCCCCCCCCCCCCCceecCCcccCCcccCCCCCCCCCCCCCCCEEeeCCCCe-eeccCCCC--CCC----CCCCCCCC
Q psy11059 158 ELPEIGDCSSNPCLNDGVCVDVYKGRYWELPEIRDCTSNPCLNDGVCVDEVYKG-RYWELPEI--RDC----TSNPCLND 230 (429)
Q Consensus 158 ~~~~~~~C~~~~C~~~~~C~~~~~G~~c~~~~~~~C~~~~C~~~~~C~~~~~~~-C~C~~~g~--~~C----~~~~C~~~ 230 (429)
. +..+|.+..++. .+.++... + ++|++..++| |.+ ++|. ..+ ....|...
T Consensus 221 ~-----------------~~~g~~~~~c~~-~~~~~~~~---~-~~c~~~~~~~~C~~-~~g~~~~~~~~~~~~~~C~~~ 277 (487)
T KOG1217|consen 221 S-----------------CPPGARGPECEV-SIVECASG---D-GTCVNTVGSYTCRC-PEGYTGDACVTCVDVDSCALI 277 (487)
T ss_pred c-----------------CCCCCCCCCccc-ccccccCC---C-CcccccCCceeeeC-CCCccccccceeeeccccCCC
Confidence 3 667788888887 77777655 4 8899988888 998 7772 221 11222211
Q ss_pred CCCCCCCCcEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCcc
Q psy11059 231 CVNPCQNGGKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYL 310 (429)
Q Consensus 231 ~c~~C~~~g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~ 310 (429)
. +|.++++| .+..+.|.|.|++||+|..| ..+... ..|.+.+.+ ..|.+++.|.
T Consensus 278 ~--~c~~~~~C-~~~~~~~~C~C~~g~~g~~~--~~~~~~-----~~C~~~~~~-------------~~c~~g~~C~--- 331 (487)
T KOG1217|consen 278 A--SCPNGGTC-VNVPGSYRCTCPPGFTGRLC--TECVDV-----DECSPRNAG-------------GPCANGGTCN--- 331 (487)
T ss_pred C--ccCCCCee-ecCCCcceeeCCCCCCCCCC--cccccc-----ccccccccC-------------CcCCCCcccc---
Confidence 1 27777788 55555577777777777766 111110 122222111 3355555551
Q ss_pred ccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCcccc-CCCceEEecCCCCCCC------Cc
Q psy11059 311 VDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQN-KINGYTCVCAPGYSGK------EC 383 (429)
Q Consensus 311 ~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~-~~g~~~C~C~~G~~G~------~C 383 (429)
.......+.|.|..+|. |..|+...++|...++..++.|++ ..++|.|.++.+|.+. .+
T Consensus 332 ~~~~~~~~~C~c~~~~~--------------g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~ 397 (487)
T KOG1217|consen 332 TLGSFGGFRCACGPGFT--------------GRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGC 397 (487)
T ss_pred cCCCCCCCCcCCCCCCC--------------CCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccc
Confidence 11133455666666655 445542225888888999999999 6899999999999874 12
Q ss_pred cCCCCCCCCCCCCCCCEEccCCCCeeeeCCC
Q psy11059 384 SININECESSPCLHGATCIDEVATFSCVCPK 414 (429)
Q Consensus 384 ~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~ 414 (429)
.++++|.. .+.|++..+++.|. .+
T Consensus 398 -~~~~~c~~-----~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 398 -EDIDECSG-----CGDCVNGPGGGACT-PP 421 (487)
T ss_pred -cccccccC-----CcceeccCCCCccc-cC
Confidence 24444443 55687788888888 66
No 5
>KOG1219|consensus
Probab=99.47 E-value=1e-13 Score=150.82 Aligned_cols=115 Identities=40% Similarity=1.055 Sum_probs=103.4
Q ss_pred CCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCCCce
Q psy11059 291 LTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKINGY 370 (429)
Q Consensus 291 ~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~g~~ 370 (429)
.+.|.+ +||+++|+| .....++|.|.|++-|.|. +|+.++..|.++||..+++|+...++|
T Consensus 3864 ~d~C~~-npCqhgG~C----~~~~~ggy~CkCpsqysG~--------------~CEi~~epC~snPC~~GgtCip~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPCND-NPCQHGGTC----ISQPKGGYKCKCPSQYSGN--------------HCEIDLEPCASNPCLTGGTCIPFYNGF 3924 (4289)
T ss_pred cccccc-CcccCCCEe----cCCCCCceEEeCcccccCc--------------ccccccccccCCCCCCCCEEEecCCCe
Confidence 367888 999999999 4446678999988877655 555789999999999999999999999
Q ss_pred EEecCCCCCCCCccCC-CCCCCCCCCCCCCEEccCCCCeeeeCCCCCCCCCCCCC
Q psy11059 371 TCVCAPGYSGKECSIN-INECESSPCLHGATCIDEVATFSCVCPKGLTGRLCETN 424 (429)
Q Consensus 371 ~C~C~~G~~G~~C~~~-~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~G~~C~~~ 424 (429)
.|.|+.||+|.+|+.+ +++|..++|.++|.|++..|+|.|.|.+||.|..|...
T Consensus 3925 ~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~ 3979 (4289)
T KOG1219|consen 3925 LCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCCAE 3979 (4289)
T ss_pred eEeCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhcccCccc
Confidence 9999999999999988 99999999999999999999999999999999998643
No 6
>KOG1219|consensus
Probab=99.46 E-value=1e-13 Score=150.83 Aligned_cols=114 Identities=37% Similarity=0.885 Sum_probs=103.3
Q ss_pred CCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCCCCCCCeEeeCCCCCCe
Q psy11059 9 SSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPPCLNGATCRPQLTEQLY 88 (429)
Q Consensus 9 ~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~C~~~g~C~~~~~~~~~ 88 (429)
.|+| ..+||+++|+|+....|.|+|.|++-|+ |..||+++. +|. ++||..+|+|+ ...+.|
T Consensus 3864 ~d~C--~~npCqhgG~C~~~~~ggy~CkCpsqys------------G~~CEi~~e--pC~-snPC~~GgtCi--p~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPC--NDNPCQHGGTCISQPKGGYKCKCPSQYS------------GNHCEIDLE--PCA-SNPCLTGGTCI--PFYNGF 3924 (4289)
T ss_pred cccc--ccCcccCCCEecCCCCCceEEeCccccc------------Ccccccccc--ccc-CCCCCCCCEEE--ecCCCe
Confidence 4889 7999999999995556889999999999 999999999 999 99999999999 888999
Q ss_pred eeeCCCCCcccCCCCCCCCCCCCeEecCCCceeCCeeccCCCCCC-CCCCCCCCCCCCCeEeeCCCCccccCCCCCCCCC
Q psy11059 89 ECVCPPGYKEIRDCTSNPCLNDGVCVWMFDVTIQVYKGRYCELPE-IGDCSSNPCLNDGVCVDVYKGRYCELPEIGDCSS 167 (429)
Q Consensus 89 ~C~C~~Gy~~~~~C~~~~C~~~g~C~~~~~C~~~g~~G~~C~~~~-i~~C~~~~C~~~g~C~~~~~g~~C~~~~~~~C~~ 167 (429)
.|.|+.| |+|.+||. + |++|..++|.++|.|++..+.+.|.
T Consensus 3925 ~CnC~~g----------------------------yTG~~Ce~-~Gi~eCs~n~C~~gg~C~n~~gsf~Cn--------- 3966 (4289)
T KOG1219|consen 3925 LCNCPNG----------------------------YTGKRCEA-RGISECSKNVCGTGGQCINIPGSFHCN--------- 3966 (4289)
T ss_pred eEeCCCC----------------------------ccCceeec-ccccccccccccCCceeeccCCceEec---------
Confidence 9999877 77888988 5 9999999999999999999999998
Q ss_pred CCCCCCceecCCcccCCccc
Q psy11059 168 NPCLNDGVCVDVYKGRYWEL 187 (429)
Q Consensus 168 ~~C~~~~~C~~~~~G~~c~~ 187 (429)
|..+|.|..|..
T Consensus 3967 --------cT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3967 --------CTPGILGRTCCA 3978 (4289)
T ss_pred --------cChhHhcccCcc
Confidence 888898888754
No 7
>KOG1214|consensus
Probab=99.29 E-value=2.2e-10 Score=116.35 Aligned_cols=129 Identities=22% Similarity=0.462 Sum_probs=71.4
Q ss_pred CeeeeCCCCCCCC--CCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCcc-CCCCccCC
Q psy11059 273 GYFCDCPPTYGGK--NCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFA-GDHCDVDF 349 (429)
Q Consensus 273 ~~~C~C~~G~~g~--~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~-g~~C~~~i 349 (429)
.|.|.|.|||.|+ .|. +.++|.+ +.|...++| ++ +.+++.|+|.+||.|+.- +|.|+-. -..|+...
T Consensus 808 ~y~C~CLPGfsGDG~~c~-dvDeC~p-srChp~A~C----yn-tpgsfsC~C~pGy~GDGf---~CVP~~~~~T~C~~er 877 (1289)
T KOG1214|consen 808 TYSCACLPGFSGDGHQCT-DVDECSP-SRCHPAATC----YN-TPGSFSCRCQPGYYGDGF---QCVPDTSSLTPCEQER 877 (1289)
T ss_pred eEEEeecCCccCCccccc-cccccCc-cccCCCceE----ec-CCCcceeecccCccCCCc---eecCCCccCCcccccc
Confidence 4666666666554 333 5588887 899999999 66 889999999999998721 2333311 11232110
Q ss_pred CcCCCCCcCCCCccc--cCCCceEEecCCCCCC---CCccCCCCCCCCCCCCCCCEEccC---CCCeeeeCCC
Q psy11059 350 DECLSNPCFNGATCQ--NKINGYTCVCAPGYSG---KECSININECESSPCLHGATCIDE---VATFSCVCPK 414 (429)
Q Consensus 350 ~~C~~~~C~~~~~C~--~~~g~~~C~C~~G~~G---~~C~~~~~~C~~~~C~~~~~C~~~---~~~~~C~C~~ 414 (429)
- -+..|...+.+. ..+.+|.+.+.++-.| ..|. .+.+=---.|..++.+..+ ..+++|+|..
T Consensus 878 ~--hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~-~~~~~~vp~Cd~hgh~ap~qchG~~~~CwCvd 947 (1289)
T KOG1214|consen 878 F--HPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCG-PSPEQYVPQCDDHGHFAPLQCHGKSDFCWCVD 947 (1289)
T ss_pred c--cceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCC-CcccccCCCccccccccccccCCCcceeEEec
Confidence 0 011255444332 1345678887766555 3453 1111011235555555433 2247788865
No 8
>KOG0994|consensus
Probab=99.26 E-value=8.6e-11 Score=122.53 Aligned_cols=99 Identities=30% Similarity=0.804 Sum_probs=62.2
Q ss_pred CCCCeeeeCCCCccCCCCcccccCCCcc----CCCCccCCCcCCCCCcCCCCccccCCCceEEecCCCCCCCCccC----
Q psy11059 314 TQHRFNCTCPSGYHGKICEKCFCRPGFA----GDHCDVDFDECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSI---- 385 (429)
Q Consensus 314 ~~~~~~C~C~~G~~G~~C~~C~C~~g~~----g~~C~~~i~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~---- 385 (429)
...+++|.|.+...|.+|+ .|.+.++ |..|+ .|.-+| ..+.+|....| .|.|++||.|+.|..
T Consensus 1033 Dr~tGQCpClpNv~G~~CD--qCA~N~w~laSG~GCe----~C~Cd~-~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel 1103 (1758)
T KOG0994|consen 1033 DRFTGQCPCLPNVQGVRCD--QCAENHWNLASGEGCE----PCNCDP-IGGPQCNEFTG--QCQCKPGFGGRTCSQCQEL 1103 (1758)
T ss_pred ccccCcCCCCccccccccc--ccccchhccccCCCCC----ccCCCc-cCCcccccccc--ceeccCCCCCcchhHHHHh
Confidence 4456788888888888888 5566654 55564 233222 23346766665 899999999998862
Q ss_pred ---CCC-CCCCCCCCCCC----EEccCCCCeeeeCCCCCCCCCCCC
Q psy11059 386 ---NIN-ECESSPCLHGA----TCIDEVATFSCVCPKGLTGRLCET 423 (429)
Q Consensus 386 ---~~~-~C~~~~C~~~~----~C~~~~~~~~C~C~~g~~G~~C~~ 423 (429)
+.+ .|..-.|...| .|.. .+.+|+|.+|..|.+|++
T Consensus 1104 ~WGdP~~~C~aCdCd~rG~~tpQCdr--~tG~C~C~~Gv~G~rCdq 1147 (1758)
T KOG0994|consen 1104 YWGDPNEKCRACDCDPRGIETPQCDR--ATGRCVCRPGVGGPRCDQ 1147 (1758)
T ss_pred hcCCCCCCceecCCCCCCCCCCCccc--cCCceeecCCCCCcchhh
Confidence 111 23333344433 2322 245789999999988863
No 9
>KOG1214|consensus
Probab=99.13 E-value=2.9e-10 Score=115.54 Aligned_cols=143 Identities=22% Similarity=0.544 Sum_probs=117.7
Q ss_pred CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCC-CCCCCCCCCeEeeCCCCC
Q psy11059 8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICT-TAPPCLNGATCRPQLTEQ 86 (429)
Q Consensus 8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~-~~~~C~~~g~C~~~~~~~ 86 (429)
.+++|-.+++-|..++.|.....-.|+|.|..||.|+ |.+|. +++ +|+ ..+.|..+++|+ +.++
T Consensus 691 ~~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd----------gr~c~-d~~--eca~~~~~CGp~s~Ci--n~pg 755 (1289)
T KOG1214|consen 691 PVNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD----------GRNCV-DEN--ECATGFHRCGPNSVCI--NLPG 755 (1289)
T ss_pred ccccceecCcccCCCccccCCCCcceEEEEeeccCCC----------CCCCC-Chh--hhccCCCCCCCCceee--cCCC
Confidence 4678877888899999999444557999999999988 99995 777 898 577899999999 9999
Q ss_pred CeeeeCCCCCc---------------ccCCCCC--CCCCCCC--eEecCCC----ce-eCCeec--cCCCCCCCCCCCCC
Q psy11059 87 LYECVCPPGYK---------------EIRDCTS--NPCLNDG--VCVWMFD----VT-IQVYKG--RYCELPEIGDCSSN 140 (429)
Q Consensus 87 ~~~C~C~~Gy~---------------~~~~C~~--~~C~~~g--~C~~~~~----C~-~~g~~G--~~C~~~~i~~C~~~ 140 (429)
+|+|.|..||. .++.|.. ..|.-.| .|+...+ |. .+||.| ..|. ++|+|..+
T Consensus 756 ~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~--dvDeC~ps 833 (1289)
T KOG1214|consen 756 SYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT--DVDECSPS 833 (1289)
T ss_pred ceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc--cccccCcc
Confidence 99999999997 3445552 3454443 4555544 99 999986 5677 88999999
Q ss_pred CCCCCCeEeeCCCCccccCCCCCCCCCCCCCCCceecCCcccCC
Q psy11059 141 PCLNDGVCVDVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRY 184 (429)
Q Consensus 141 ~C~~~g~C~~~~~g~~C~~~~~~~C~~~~C~~~~~C~~~~~G~~ 184 (429)
.|...++|++..+.+.|+ |++||.|+.
T Consensus 834 rChp~A~CyntpgsfsC~-----------------C~pGy~GDG 860 (1289)
T KOG1214|consen 834 RCHPAATCYNTPGSFSCR-----------------CQPGYYGDG 860 (1289)
T ss_pred ccCCCceEecCCCcceee-----------------cccCccCCC
Confidence 999999999999999998 999999864
No 10
>KOG1225|consensus
Probab=98.96 E-value=3.4e-09 Score=106.35 Aligned_cols=126 Identities=33% Similarity=0.815 Sum_probs=83.9
Q ss_pred eeEeCCCCCcCCCCCCCccccCC------CCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeC
Q psy11059 249 YDCTCDALHTGDPCKHGSCVDKR------AGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTC 322 (429)
Q Consensus 249 ~~C~C~~g~~G~~C~~~~C~~~~------~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C 322 (429)
+.|.|+.+|.|..|+...|...- ..-+|.|++||+|..|+. ..|.. . |..++.+ + . ..|+|
T Consensus 234 ~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e--~~Cp~-~-cs~~g~~----~---~--g~CiC 300 (525)
T KOG1225|consen 234 GICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDE--LVCPV-D-CSGGGVC----V---D--GECIC 300 (525)
T ss_pred ceeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCc--ccCCc-c-cCCCcee----c---C--CEeec
Confidence 36777777777777655554431 113577777777777743 34543 2 6555555 2 1 26666
Q ss_pred CCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCCCCCCCCCEEc
Q psy11059 323 PSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSININECESSPCLHGATCI 402 (429)
Q Consensus 323 ~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~ 402 (429)
++||+|..|+ +..|. ..|.+++.|++. +|.|.+||+|..|... +|.+++.|+
T Consensus 301 ~~g~~G~dCs----------------~~~cp-adC~g~G~Ci~G----~C~C~~Gy~G~~C~~~-------~C~~~g~cv 352 (525)
T KOG1225|consen 301 NPGYSGKDCS----------------IRRCP-ADCSGHGKCIDG----ECLCDEGYTGELCIQR-------ACSGGGQCV 352 (525)
T ss_pred CCCccccccc----------------cccCC-ccCCCCCcccCC----ceEeCCCCcCCccccc-------ccCCCceec
Confidence 6666655443 23343 459999999833 8999999999999643 388888885
Q ss_pred cCCCCeeeeCCCCCCCCC
Q psy11059 403 DEVATFSCVCPKGLTGRL 420 (429)
Q Consensus 403 ~~~~~~~C~C~~g~~G~~ 420 (429)
+. |+|..||.|++
T Consensus 353 ~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 353 NG-----CKCKKGWRGPD 365 (525)
T ss_pred cC-----ceeccCccCCC
Confidence 43 99999999987
No 11
>KOG0994|consensus
Probab=98.80 E-value=7.1e-08 Score=101.35 Aligned_cols=95 Identities=21% Similarity=0.382 Sum_probs=59.0
Q ss_pred eEEecCCCCcccccccccCcCCC-CCCCCCCCCCCCC-CCCCCCC-CCeEee-CCCCCCeee-eCCCCCc------ccCC
Q psy11059 33 YDCTCDALHTVCCVGLANQTLGS-IHCETPISNQICT-TAPPCLN-GATCRP-QLTEQLYEC-VCPPGYK------EIRD 101 (429)
Q Consensus 33 ~~C~C~~g~~g~~~~~~~~~~~G-~~C~~~~~~~~C~-~~~~C~~-~g~C~~-~~~~~~~~C-~C~~Gy~------~~~~ 101 (429)
.+|+|.+|-.|.++..|..+|=| +.|..- .|. -.+.|.. -|.|+- .+....+.| .|..||+ ....
T Consensus 830 GQC~C~~g~ygrqCnqCqpG~WgFPeCr~C----qCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~ 905 (1758)
T KOG0994|consen 830 GQCQCRPGTYGRQCNQCQPGYWGFPECRPC----QCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIG 905 (1758)
T ss_pred cceeeccccchhhccccCCCccCCCcCccc----cccCcccccCccccccccccccccccchhhhhccccCCcccCCCCC
Confidence 46888888777777777776655 333311 121 0222322 244441 045566778 6999999 3467
Q ss_pred CCCCCCCCCC--------eEecCCC-----ce-eCCeeccCCCC
Q psy11059 102 CTSNPCLNDG--------VCVWMFD-----VT-IQVYKGRYCEL 131 (429)
Q Consensus 102 C~~~~C~~~g--------~C~~~~~-----C~-~~g~~G~~C~~ 131 (429)
|.+.||..+- .|.-... |. .+||+|.+|+.
T Consensus 906 CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~ 949 (1758)
T KOG0994|consen 906 CRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI 949 (1758)
T ss_pred CCCCCCCCCCccchhccccccccccccceeeecccCccccchhh
Confidence 8888887652 3543332 89 99999999984
No 12
>KOG1225|consensus
Probab=98.76 E-value=6.3e-08 Score=97.35 Aligned_cols=118 Identities=31% Similarity=0.785 Sum_probs=85.6
Q ss_pred cccCCCCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCCCCCCCeEeeC
Q psy11059 3 FKPISLSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPPCLNGATCRPQ 82 (429)
Q Consensus 3 ~~~~~~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~C~~~g~C~~~ 82 (429)
+.|....--| ++-|.++|.|+ ..+|+|++||+ |.+|... .|. .. |+.++.++
T Consensus 243 ~g~~c~~~~C---~~~c~~~g~c~-----~G~CIC~~Gf~------------G~dC~e~----~Cp-~~-cs~~g~~~-- 294 (525)
T KOG1225|consen 243 FGPLCSTIYC---PGGCTGRGQCV-----EGRCICPPGFT------------GDDCDEL----VCP-VD-CSGGGVCV-- 294 (525)
T ss_pred eCCccccccC---CCCCcccceEe-----CCeEeCCCCCc------------CCCCCcc----cCC-cc-cCCCceec--
Confidence 3444444445 66677778888 56899999999 9999753 366 33 88888888
Q ss_pred CCCCCeeeeCCCCCc----ccCCCCCCCCCCCCeEecCCCce-eCCeeccCCCCCCCCCCCCCCCCCCCeEeeCCCCccc
Q psy11059 83 LTEQLYECVCPPGYK----EIRDCTSNPCLNDGVCVWMFDVT-IQVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRYC 157 (429)
Q Consensus 83 ~~~~~~~C~C~~Gy~----~~~~C~~~~C~~~g~C~~~~~C~-~~g~~G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C 157 (429)
+. .|+|++||+ .+..|. .+|..+|.|+ ...|. .+||+|..|+. . +|.+++.|++. |
T Consensus 295 ~g----~CiC~~g~~G~dCs~~~cp-adC~g~G~Ci-~G~C~C~~Gy~G~~C~~-~-------~C~~~g~cv~g-----C 355 (525)
T KOG1225|consen 295 DG----ECICNPGYSGKDCSIRRCP-ADCSGHGKCI-DGECLCDEGYTGELCIQ-R-------ACSGGGQCVNG-----C 355 (525)
T ss_pred CC----EeecCCCccccccccccCC-ccCCCCCccc-CCceEeCCCCcCCcccc-c-------ccCCCceeccC-----c
Confidence 32 699999998 334444 6699999999 33399 99999999985 2 37777787752 4
Q ss_pred cCCCCCCCCCCCCCCCceecCCcccCC
Q psy11059 158 ELPEIGDCSSNPCLNDGVCVDVYKGRY 184 (429)
Q Consensus 158 ~~~~~~~C~~~~C~~~~~C~~~~~G~~ 184 (429)
. |..||.|..
T Consensus 356 ~-----------------C~~Gw~G~d 365 (525)
T KOG1225|consen 356 K-----------------CKKGWRGPD 365 (525)
T ss_pred e-----------------eccCccCCC
Confidence 4 889999987
No 13
>KOG4260|consensus
Probab=98.58 E-value=1e-07 Score=85.72 Aligned_cols=131 Identities=29% Similarity=0.749 Sum_probs=88.7
Q ss_pred eCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCc------------------------
Q psy11059 277 DCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICE------------------------ 332 (429)
Q Consensus 277 ~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~------------------------ 332 (429)
-|++|.+|..|..-... + ..+|..++.|.. -..+.++..|.|.+||.|..|.
T Consensus 131 CCp~gtyGpdCl~Cpgg-s-er~C~GnG~C~G--dGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~ 206 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQCPGG-S-ERPCFGNGSCHG--DGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEG 206 (350)
T ss_pred ccCCCCcCCccccCCCC-C-cCCcCCCCcccC--CCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhh
Confidence 36777777776421110 1 256777778742 1125688999999999999886
Q ss_pred -----------cc-ccCCCccC--CCCccCCCcCCC--CCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCC--CC
Q psy11059 333 -----------KC-FCRPGFAG--DHCDVDFDECLS--NPCFNGATCQNKINGYTCVCAPGYSGKECSININECES--SP 394 (429)
Q Consensus 333 -----------~C-~C~~g~~g--~~C~~~i~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~--~~ 394 (429)
.| .|..||.- ..| +||++|.. .||.....|+|+.|+|.|..++||.+. +|+|.. ..
T Consensus 207 C~~~Csg~~~k~C~kCkkGW~lde~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d~C~~~~d~ 280 (350)
T KOG4260|consen 207 CLGVCSGESSKGCSKCKKGWKLDEEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VDECQFCADV 280 (350)
T ss_pred hhcccCCCCCCChhhhcccceeccccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCC-----hHHhhhhhhh
Confidence 12 24555542 234 48888864 568888899999999999999998762 444442 23
Q ss_pred C-CCCCEEccCCCCeeeeCCCCCC
Q psy11059 395 C-LHGATCIDEVATFSCVCPKGLT 417 (429)
Q Consensus 395 C-~~~~~C~~~~~~~~C~C~~g~~ 417 (429)
| ..+..|.++.+.|+|+|..++.
T Consensus 281 ~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 281 CASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred cccCCCCcccCCccEEEEecccce
Confidence 3 2345688889999999988753
No 14
>KOG1836|consensus
Probab=98.35 E-value=1.2e-05 Score=91.41 Aligned_cols=135 Identities=27% Similarity=0.594 Sum_probs=78.1
Q ss_pred CCeee-eCCCCCCCCCCc-cCCCCCCCCCCCCCC------CeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCccCC
Q psy11059 272 AGYFC-DCPPTYGGKNCS-VELTGCVGPDTCLNG------GTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGD 343 (429)
Q Consensus 272 ~~~~C-~C~~G~~g~~c~-~~~~~C~~~~~C~~~------~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~ 343 (429)
.+.+| .|.+||.|..-. ...+.|.. .-|... .+| ..-...|.|.+.-.|..|. .|.+||++.
T Consensus 864 ~g~~cd~c~~g~~gd~l~~~p~~~c~~-c~c~p~gs~~~~~~c-------~~~tGQcec~~~v~g~~c~--~c~~g~fnl 933 (1705)
T KOG1836|consen 864 AGEYCDLCKEGYFGDPLAPNPEDKCFA-CGCVPAGSELPSLTC-------NPVTGQCECKPNVEGRDCL--YCFKGFFNL 933 (1705)
T ss_pred ccccccccccCccccccCCCcCCcccc-ccCccCCcccccccC-------CCcccceeccCCCCccccc--ccccccccc
Confidence 33445 688888887543 12233433 223222 223 3456789999999999888 677888866
Q ss_pred CCccCCCcCCCCCcCCC----CccccCCCceEEecCCCCCCCCccCC--------CCCCCCCCCCCCC----EEccCCCC
Q psy11059 344 HCDVDFDECLSNPCFNG----ATCQNKINGYTCVCAPGYSGKECSIN--------INECESSPCLHGA----TCIDEVAT 407 (429)
Q Consensus 344 ~C~~~i~~C~~~~C~~~----~~C~~~~g~~~C~C~~G~~G~~C~~~--------~~~C~~~~C~~~~----~C~~~~~~ 407 (429)
.- -..|+.-.|... ..|.... ..|.|.+|-+|.+|..- +..|..--|...| .|... .
T Consensus 934 ~s---~~gC~~c~c~~~gs~~~~c~~~t--Gqc~c~~gVtgqrc~qc~~~~~~~~~~gc~~c~c~~~Gs~~~qc~~~--~ 1006 (1705)
T KOG1836|consen 934 NS---GVGCEPCNCDPTGSESSDCDVGT--GQCYCRPGVTGQRCDQCETYHFGFQTEGCGLCECDPLGSRGFQCDPE--D 1006 (1705)
T ss_pred CC---CCCcccccccccccccccccccC--CceeeecCccccccCccccCcccccccCCcceecccCCcccceeccc--C
Confidence 51 122333334322 2454444 48999999999887521 1112222243333 45432 3
Q ss_pred eeeeCCCCCCCCCCCC
Q psy11059 408 FSCVCPKGLTGRLCET 423 (429)
Q Consensus 408 ~~C~C~~g~~G~~C~~ 423 (429)
.+|.|+++|.|.+|..
T Consensus 1007 G~c~c~~~~~g~~c~~ 1022 (1705)
T KOG1836|consen 1007 GQCPCRPGFEGRRCDQ 1022 (1705)
T ss_pred CeeeecCCCCCccccc
Confidence 4899999999987763
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.29 E-value=3.5e-07 Score=60.22 Aligned_cols=34 Identities=24% Similarity=0.663 Sum_probs=32.2
Q ss_pred CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
|||||+..+++|..+++|+ ++.|+|+|.|++||+
T Consensus 1 DidEC~~~~~~C~~~~~C~-N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCV-NTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEE-EETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEE-cCCCCEEeeCCCCcE
Confidence 5899998888999999999 999999999999998
No 16
>KOG1226|consensus
Probab=98.17 E-value=1.1e-05 Score=83.10 Aligned_cols=144 Identities=28% Similarity=0.650 Sum_probs=84.9
Q ss_pred CCCCCCcEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCC---CCCCCCCCeeCCc
Q psy11059 233 NPCQNGGKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVG---PDTCLNGGTCKPY 309 (429)
Q Consensus 233 ~~C~~~g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~---~~~C~~~~~C~~~ 309 (429)
..|+.+|+. .-..|.|.+||.|..|+-. ..-.... + ..+.|.. ..+|...|.|.
T Consensus 467 ~~C~g~G~~-----~CG~C~C~~G~~G~~CEC~--------------~~~~ss~-~-~~~~Cr~~~~~~vCSgrG~C~-- 523 (783)
T KOG1226|consen 467 ALCHGNGTF-----VCGQCRCDEGWLGKKCECS--------------TDELSSS-E-EEDKCRENSDSPVCSGRGDCV-- 523 (783)
T ss_pred cccCCCCcE-----EecceecCCCCCCCcccCC--------------ccccCcH-h-HHhhccCCCCCCCcCCCCcEe--
Confidence 356655554 2246888888888887632 1111110 0 0122321 13577777772
Q ss_pred cccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCC---CCcCCCCccccCCCceEEecCCCCCCCCcc--
Q psy11059 310 LVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLS---NPCFNGATCQNKINGYTCVCAPGYSGKECS-- 384 (429)
Q Consensus 310 ~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~---~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~-- 384 (429)
=++|+|.+...|. ++|..|+-|--.|.. .-|..++.|.-. +|+|.+||+|..|+
T Consensus 524 -------CGqC~C~~~~~~~----------i~G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 524 -------CGQCVCHKPDNGK----------IYGKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCP 582 (783)
T ss_pred -------CCceEecCCCCCc----------eeeeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence 1356776655421 235566533333433 237788887654 79999999999875
Q ss_pred CCCCCCCCC---CCCCCCEEccCCCCeeeeCCCC-CCCCCCCCC
Q psy11059 385 ININECESS---PCLHGATCIDEVATFSCVCPKG-LTGRLCETN 424 (429)
Q Consensus 385 ~~~~~C~~~---~C~~~~~C~~~~~~~~C~C~~g-~~G~~C~~~ 424 (429)
.+.+.|.+. .|...|+|.-. +|+|... |.|..||..
T Consensus 583 ~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 583 LSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEKC 622 (783)
T ss_pred CCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhcC
Confidence 455666542 47777777544 6888866 999999864
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.92 E-value=4.9e-06 Score=51.16 Aligned_cols=27 Identities=52% Similarity=1.068 Sum_probs=25.0
Q ss_pred CCCCCCCCCEeccCCC-CceEEecCCCCc
Q psy11059 15 QRNPCQNGGKCNEDET-GNYDCTCDALHT 42 (429)
Q Consensus 15 ~~~~C~~~g~C~~~~~-~~~~C~C~~g~~ 42 (429)
.++||+|+|+|+ +.. ++|+|+|++||+
T Consensus 2 ~~~~C~n~g~C~-~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 2 SSNPCQNGGTCI-DLPGGGYTCECPPGYT 29 (32)
T ss_dssp TTTSSTTTEEEE-EESTSEEEEEEBTTEE
T ss_pred CCCcCCCCeEEE-eCCCCCEEeECCCCCc
Confidence 578999999999 777 999999999999
No 18
>KOG4260|consensus
Probab=97.89 E-value=1.5e-05 Score=72.00 Aligned_cols=96 Identities=26% Similarity=0.600 Sum_probs=64.4
Q ss_pred CCCCCCCCCCCCCC--CCCCCCCCCeEeeC-CCCCCeeeeCCCCCc--ccCCCCC-----------C---CCCC--CCeE
Q psy11059 55 SIHCETPISNQICT--TAPPCLNGATCRPQ-LTEQLYECVCPPGYK--EIRDCTS-----------N---PCLN--DGVC 113 (429)
Q Consensus 55 G~~C~~~~~~~~C~--~~~~C~~~g~C~~~-~~~~~~~C~C~~Gy~--~~~~C~~-----------~---~C~~--~g~C 113 (429)
|+.|. .|. +..+|..+|.|.-. ...|+..|.|.+||+ .-..|.. . .|.. .++|
T Consensus 138 GpdCl------~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~C 211 (350)
T KOG4260|consen 138 GPDCL------QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVC 211 (350)
T ss_pred CCccc------cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhccc
Confidence 99997 453 35678788888721 145678999999998 1122221 0 1211 1245
Q ss_pred ecCCC--ce--eCCeec--cCCCCCCCCCCC--CCCCCCCCeEeeCCCCcccc
Q psy11059 114 VWMFD--VT--IQVYKG--RYCELPEIGDCS--SNPCLNDGVCVDVYKGRYCE 158 (429)
Q Consensus 114 ~~~~~--C~--~~g~~G--~~C~~~~i~~C~--~~~C~~~g~C~~~~~g~~C~ 158 (429)
..... |. ..||.- ..|. |||||. +.||.....|+|+.++|.|+
T Consensus 212 sg~~~k~C~kCkkGW~lde~gCv--DvnEC~~ep~~c~~~qfCvNteGSf~C~ 262 (350)
T KOG4260|consen 212 SGESSKGCSKCKKGWKLDEEGCV--DVNECQNEPAPCKAHQFCVNTEGSFKCE 262 (350)
T ss_pred CCCCCCChhhhcccceecccccc--cHHHHhcCCCCCChhheeecCCCceEec
Confidence 43332 77 788874 4576 899998 57799999999999999998
No 19
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.82 E-value=6.3e-06 Score=51.94 Aligned_cols=32 Identities=25% Similarity=0.641 Sum_probs=25.4
Q ss_pred CCCCCCCCCCCCEeccCCCCceEEecCCCCccc
Q psy11059 12 CDAQRNPCQNGGKCNEDETGNYDCTCDALHTVC 44 (429)
Q Consensus 12 C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~ 44 (429)
|+...+.|+.+++|+ ++.++|+|+|++||.|+
T Consensus 1 C~~~~~~C~~nA~C~-~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 1 CLENNGGCHPNATCT-NTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp TTTGGGGS-TTCEEE-E-TTSEEEEE-CEEECC
T ss_pred CCCCCCCCCCCcEee-cCCCCEEeECCCCCccC
Confidence 344567899999999 99999999999999988
No 20
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.81 E-value=3.2e-05 Score=49.72 Aligned_cols=32 Identities=44% Similarity=0.966 Sum_probs=29.0
Q ss_pred CCCCCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 8 LSSPCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 8 ~~~~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
++|+| .. ++|.++|+|+ +..++|+|.|++||.
T Consensus 1 d~~~C--~~~~~C~~~~~C~-~~~g~~~C~C~~g~~ 33 (39)
T smart00179 1 DIDEC--ASGNPCQNGGTCV-NTVGSYRCECPPGYT 33 (39)
T ss_pred CcccC--cCCCCcCCCCEeE-CCCCCeEeECCCCCc
Confidence 47899 45 7999999999 999999999999998
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.78 E-value=1.1e-05 Score=49.59 Aligned_cols=29 Identities=48% Similarity=1.145 Sum_probs=19.1
Q ss_pred CCCCCCCCCEEccCC-CCeeeeCCCCCCCC
Q psy11059 391 ESSPCLHGATCIDEV-ATFSCVCPKGLTGR 419 (429)
Q Consensus 391 ~~~~C~~~~~C~~~~-~~~~C~C~~g~~G~ 419 (429)
.+++|.++|+|++.. ++|+|.|++||+|+
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 445666667776666 66777777777665
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.77 E-value=4e-05 Score=49.28 Aligned_cols=36 Identities=53% Similarity=1.297 Sum_probs=24.0
Q ss_pred CCCCCC-CCCCCCCEEccCCCCeeeeCCCCCC-CCCCC
Q psy11059 387 INECES-SPCLHGATCIDEVATFSCVCPKGLT-GRLCE 422 (429)
Q Consensus 387 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~g~~-G~~C~ 422 (429)
+++|.. .+|.++++|+++.++|+|.|++||+ |.+|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 455655 5676666777777777777777777 66653
No 23
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.70 E-value=1.8e-05 Score=52.03 Aligned_cols=33 Identities=42% Similarity=0.951 Sum_probs=29.3
Q ss_pred CCCCCCCC-CCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059 61 PISNQICT-TAPPCLNGATCRPQLTEQLYECVCPPGYK 97 (429)
Q Consensus 61 ~~~~~~C~-~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~ 97 (429)
||| ||+ ..+.|..+++|+ |+.|+|+|.|++||+
T Consensus 1 Did--EC~~~~~~C~~~~~C~--N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DID--ECAEGPHNCPENGTCV--NTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESS--TTTTTSSSSSTTSEEE--EETTEEEEEESTTEE
T ss_pred Ccc--ccCCCCCcCCCCCEEE--cCCCCEEeeCCCCcE
Confidence 466 898 466899899999 999999999999998
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.64 E-value=8.3e-05 Score=47.26 Aligned_cols=32 Identities=44% Similarity=0.982 Sum_probs=28.7
Q ss_pred CCCCCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 8 LSSPCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 8 ~~~~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
++|+| .. .+|.++++|+ +..+.|+|.|++||.
T Consensus 1 ~~~~C--~~~~~C~~~~~C~-~~~~~~~C~C~~g~~ 33 (38)
T cd00054 1 DIDEC--ASGNPCQNGGTCV-NTVGSYRCSCPPGYT 33 (38)
T ss_pred CcccC--CCCCCcCCCCEeE-CCCCCeEeECCCCCc
Confidence 36889 45 7999999999 999999999999999
No 25
>KOG1226|consensus
Probab=97.47 E-value=0.0011 Score=68.69 Aligned_cols=99 Identities=22% Similarity=0.513 Sum_probs=64.1
Q ss_pred CCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCC-------CCCC---CCCCCCCCCeEeeCCCC
Q psy11059 16 RNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISN-------QICT---TAPPCLNGATCRPQLTE 85 (429)
Q Consensus 16 ~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~-------~~C~---~~~~C~~~g~C~~~~~~ 85 (429)
...|+.+|+.+ =.+|.|.+||. |..||-+.+. +.|. +..+|.++|.|+ =.
T Consensus 466 s~~C~g~G~~~-----CG~C~C~~G~~------------G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~--CG- 525 (783)
T KOG1226|consen 466 SALCHGNGTFV-----CGQCRCDEGWL------------GKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV--CG- 525 (783)
T ss_pred ccccCCCCcEE-----ecceecCCCCC------------CCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEe--CC-
Confidence 44577777665 45799999999 7777754321 1343 133788999888 22
Q ss_pred CCeeeeCCCCCc----------ccCCCC---CCCCCCCCeEecCCCce-eCCeeccCCCCC-CCCCCC
Q psy11059 86 QLYECVCPPGYK----------EIRDCT---SNPCLNDGVCVWMFDVT-IQVYKGRYCELP-EIGDCS 138 (429)
Q Consensus 86 ~~~~C~C~~Gy~----------~~~~C~---~~~C~~~g~C~~~~~C~-~~g~~G~~C~~~-~i~~C~ 138 (429)
.|+|.+... +.-.|. ...|..+|.|.-.. |. .+||+|..|+-+ +.+.|.
T Consensus 526 ---qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~-CvC~~GwtG~~C~C~~std~C~ 589 (783)
T KOG1226|consen 526 ---QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECGR-CVCNPGWTGSACNCPLSTDTCE 589 (783)
T ss_pred ---ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCCc-EEcCCCCccCCCCCCCCCcccc
Confidence 378876554 222333 24588888886444 99 999999998741 334444
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.41 E-value=0.00025 Score=44.98 Aligned_cols=36 Identities=53% Similarity=1.305 Sum_probs=23.5
Q ss_pred CCCCCC-CCCCCCCEEccCCCCeeeeCCCCCCCCCCC
Q psy11059 387 INECES-SPCLHGATCIDEVATFSCVCPKGLTGRLCE 422 (429)
Q Consensus 387 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~g~~G~~C~ 422 (429)
+++|.. .+|.++++|++..++|+|.|++||+|.+|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 345554 566666677777777777777777776653
No 27
>KOG1836|consensus
Probab=97.39 E-value=0.00072 Score=77.31 Aligned_cols=112 Identities=29% Similarity=0.669 Sum_probs=78.4
Q ss_pred EeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCcc-CCCCCCCCCCCCC-CCeeCCccccCCCCCeeeeCCCCccC
Q psy11059 251 CTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSV-ELTGCVGPDTCLN-GGTCKPYLVDETQHRFNCTCPSGYHG 328 (429)
Q Consensus 251 C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~-~~~~C~~~~~C~~-~~~C~~~~~~~~~~~~~C~C~~G~~G 328 (429)
|.|+.||+|..|+ .|.+||....-.. +...|.+ ..|.. ..+| ...+..|.|.+...|
T Consensus 697 c~C~~g~tG~~Ce-------------~C~~gfrr~~~~~~~~~~c~~-C~cngh~~~C-------d~~tG~C~C~~~t~G 755 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCE-------------SCAPGFRRLSPQLGPFCPCIP-CDCNGHSNIC-------DPRTGQCKCKHNTFG 755 (1705)
T ss_pred ccCCCCcccchhh-------------hcchhhhcccccCCCCCcccc-cccCCccccc-------cCCCCceecccCCCC
Confidence 9999999999998 5888885432111 1122322 23322 2345 556788999999999
Q ss_pred CCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCC--CceEEe-cCCCCCCCCccC
Q psy11059 329 KICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKI--NGYTCV-CAPGYSGKECSI 385 (429)
Q Consensus 329 ~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~--g~~~C~-C~~G~~G~~C~~ 385 (429)
..|+ +|..||+|..=.-....|.+-+|.+++.|.... ....|. |++||+|.+|+.
T Consensus 756 ~~C~--~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 756 GQCA--QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred Cchh--hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 9998 789999987533122237777888888887554 567898 999999999974
No 28
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.13 E-value=0.00074 Score=42.05 Aligned_cols=26 Identities=46% Similarity=1.088 Sum_probs=24.6
Q ss_pred CCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 16 RNPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 16 ~~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
..+|.++++|+ +..+.|+|.|+.||.
T Consensus 5 ~~~C~~~~~C~-~~~~~~~C~C~~g~~ 30 (36)
T cd00053 5 SNPCSNGGTCV-NTPGSYRCVCPPGYT 30 (36)
T ss_pred CCCCCCCCEEe-cCCCCeEeECCCCCc
Confidence 67999999999 988999999999999
No 29
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.05 E-value=0.00094 Score=41.70 Aligned_cols=28 Identities=39% Similarity=1.035 Sum_probs=24.8
Q ss_pred CCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 11 PCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 11 ~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
+| .. ++|.++ +|+ +..++|+|.|++||.
T Consensus 1 ~C--~~~~~C~~~-~C~-~~~~~~~C~C~~g~~ 29 (35)
T smart00181 1 EC--ASGGPCSNG-TCI-NTPGSYTCSCPPGYT 29 (35)
T ss_pred CC--CCcCCCCCC-EEE-CCCCCeEeECCCCCc
Confidence 46 44 789998 999 999999999999999
No 30
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.82 E-value=0.0021 Score=39.91 Aligned_cols=30 Identities=47% Similarity=1.230 Sum_probs=21.6
Q ss_pred CCCCCCCCEEccCCCCeeeeCCCCCCCC-CC
Q psy11059 392 SSPCLHGATCIDEVATFSCVCPKGLTGR-LC 421 (429)
Q Consensus 392 ~~~C~~~~~C~~~~~~~~C~C~~g~~G~-~C 421 (429)
..+|.++++|++..++|+|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 4567667777777777778888777777 54
No 31
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.81 E-value=0.0013 Score=37.14 Aligned_cols=23 Identities=26% Similarity=0.589 Sum_probs=19.2
Q ss_pred ceEEecCCCCcccccccccCcCCCCCCCCCCC
Q psy11059 32 NYDCTCDALHTVCCVGLANQTLGSIHCETPIS 63 (429)
Q Consensus 32 ~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~ 63 (429)
+|+|+|++||+ ...+|..|+ |||
T Consensus 1 sy~C~C~~Gy~--------l~~d~~~C~-DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQ--------LSPDGRSCE-DID 23 (24)
T ss_pred CEEeeCCCCCc--------CCCCCCccc-cCC
Confidence 69999999999 445689996 877
No 32
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.80 E-value=0.0021 Score=40.07 Aligned_cols=28 Identities=46% Similarity=1.293 Sum_probs=20.4
Q ss_pred CCCCCCCEEccCCCCeeeeCCCCCCC-CCC
Q psy11059 393 SPCLHGATCIDEVATFSCVCPKGLTG-RLC 421 (429)
Q Consensus 393 ~~C~~~~~C~~~~~~~~C~C~~g~~G-~~C 421 (429)
.+|.++ +|+++.++|+|.|++||+| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 467666 7777777778888888877 555
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.47 E-value=0.0047 Score=37.71 Aligned_cols=23 Identities=35% Similarity=0.674 Sum_probs=20.0
Q ss_pred CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 17 NPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 17 ~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
..|+++|+|+ .. ..+|+|.+||+
T Consensus 6 ~~C~~~G~C~-~~--~g~C~C~~g~~ 28 (32)
T PF07974_consen 6 NICSGHGTCV-SP--CGRCVCDSGYT 28 (32)
T ss_pred CccCCCCEEe-CC--CCEEECCCCCc
Confidence 4699999999 54 57999999999
No 34
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.38 E-value=0.0018 Score=40.76 Aligned_cols=27 Identities=30% Similarity=0.808 Sum_probs=18.2
Q ss_pred CCCCCCCEEccCCCCeeeeCCCCCCCC
Q psy11059 393 SPCLHGATCIDEVATFSCVCPKGLTGR 419 (429)
Q Consensus 393 ~~C~~~~~C~~~~~~~~C~C~~g~~G~ 419 (429)
..|+.+|+|+++.++|+|+|++||+|+
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 357777888888778888888888765
No 35
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.98 E-value=0.0066 Score=34.30 Aligned_cols=10 Identities=40% Similarity=1.404 Sum_probs=4.8
Q ss_pred eeeeCCCCCC
Q psy11059 408 FSCVCPKGLT 417 (429)
Q Consensus 408 ~~C~C~~g~~ 417 (429)
|+|.|++||+
T Consensus 2 y~C~C~~Gy~ 11 (24)
T PF12662_consen 2 YTCSCPPGYQ 11 (24)
T ss_pred EEeeCCCCCc
Confidence 4445555543
No 36
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.64 E-value=0.0047 Score=29.51 Aligned_cols=13 Identities=54% Similarity=1.347 Sum_probs=7.4
Q ss_pred eeeCCCCCCCCCC
Q psy11059 409 SCVCPKGLTGRLC 421 (429)
Q Consensus 409 ~C~C~~g~~G~~C 421 (429)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3666666666655
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.56 E-value=0.017 Score=35.30 Aligned_cols=26 Identities=38% Similarity=0.818 Sum_probs=18.6
Q ss_pred CCCCCCEEccCCCCeeeeCCCCCCCCCC
Q psy11059 394 PCLHGATCIDEVATFSCVCPKGLTGRLC 421 (429)
Q Consensus 394 ~C~~~~~C~~~~~~~~C~C~~g~~G~~C 421 (429)
.|.++|+|+.. ..+|+|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 47777788654 347888888888765
No 38
>KOG3512|consensus
Probab=95.36 E-value=0.15 Score=50.35 Aligned_cols=163 Identities=23% Similarity=0.574 Sum_probs=91.1
Q ss_pred cEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCC----ccCCCCCCCCCCCCCCCe-eCC----c
Q psy11059 239 GKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNC----SVELTGCVGPDTCLNGGT-CKP----Y 309 (429)
Q Consensus 239 g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c----~~~~~~C~~~~~C~~~~~-C~~----~ 309 (429)
..|+.+..+..+|.|..+-+|..|+ .|.+-|..... ..++.+|.. ..|..++. |.- .
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCg-------------rCKpfy~dRPW~raT~~~a~~c~a-c~Cn~harrcrfn~Ely 350 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCG-------------RCKPFYYDRPWGRATALPANECVA-CNCNGHARRCRFNMELY 350 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcc-------------cccccccCCCccccccCCCccccc-cccchhhhhcccchhhh
Confidence 4676777777999999999999997 35565533322 124455554 44433322 210 0
Q ss_pred cccCCCCCeee-eCCCCccCCCCcccccCCCccCCCCc--cCCCcCCCCCcC----CCCccccCCCceEEecCCCCCCCC
Q psy11059 310 LVDETQHRFNC-TCPSGYHGKICEKCFCRPGFAGDHCD--VDFDECLSNPCF----NGATCQNKINGYTCVCAPGYSGKE 382 (429)
Q Consensus 310 ~~~~~~~~~~C-~C~~G~~G~~C~~C~C~~g~~g~~C~--~~i~~C~~~~C~----~~~~C~~~~g~~~C~C~~G~~G~~ 382 (429)
.......+..| .|.....|..|. -|..||+-+.-. .+...|..-.|+ .+-+|....| +|.|++|-+|.+
T Consensus 351 ~lSgr~SggvClnCrHnTaGrhCh--yCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~t 426 (592)
T KOG3512|consen 351 RLSGRRSGGVCLNCRHNTAGRHCH--YCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLT 426 (592)
T ss_pred cccCccccceEeecccCCCCcccc--cccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCcccc
Confidence 00112233466 788888888887 678888743211 122233322232 3446765655 899999999988
Q ss_pred ccC----------CCCCCCCC------CCCCCCEEccCCCCeeeeCCCCCCCCCCCC
Q psy11059 383 CSI----------NINECESS------PCLHGATCIDEVATFSCVCPKGLTGRLCET 423 (429)
Q Consensus 383 C~~----------~~~~C~~~------~C~~~~~C~~~~~~~~C~C~~g~~G~~C~~ 423 (429)
|.. .+-+|.-. .+.++.+ +..+.+.|+.++.|.+++.
T Consensus 427 CnrCa~gyqqsrs~vapcik~p~~~~~~~~s~ve----~qd~~s~Ck~~~~~~r~n~ 479 (592)
T KOG3512|consen 427 CNRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE----PQDQCSKCKASPGGKRLNQ 479 (592)
T ss_pred cccccchhhcccCCCcCceecCCCCccccCCCCc----chhccccCCCCCcceeccc
Confidence 741 11122111 1222222 3345678999998887764
No 39
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.06 E-value=0.0032 Score=54.42 Aligned_cols=136 Identities=22% Similarity=0.536 Sum_probs=73.1
Q ss_pred cccCCCCeeeeCCCCCCC---CCCccCCCCCCC----CCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCC
Q psy11059 267 CVDKRAGYFCDCPPTYGG---KNCSVELTGCVG----PDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPG 339 (429)
Q Consensus 267 C~~~~~~~~C~C~~G~~g---~~c~~~~~~C~~----~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g 339 (429)
.+...+.|.|.|.+||.. ..|+.. ..|.. ..+|...+.|...........|.|.|.+||.-.
T Consensus 13 LiQMSNHfEC~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~---------- 81 (197)
T PF06247_consen 13 LIQMSNHFECKCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILK---------- 81 (197)
T ss_dssp EEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES----------
T ss_pred EEEccCceEEEcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceee----------
Confidence 444556677888888742 344422 23432 145777788843211124578999999999743
Q ss_pred ccCCCCccCCCcCCCCCcCCCCccccCC---CceEEecCCCCC---CCCccCCCC-CCCCCCCCCCCEEccCCCCeeeeC
Q psy11059 340 FAGDHCDVDFDECLSNPCFNGATCQNKI---NGYTCVCAPGYS---GKECSININ-ECESSPCLHGATCIDEVATFSCVC 412 (429)
Q Consensus 340 ~~g~~C~~~i~~C~~~~C~~~~~C~~~~---g~~~C~C~~G~~---G~~C~~~~~-~C~~~~C~~~~~C~~~~~~~~C~C 412 (429)
...|. ...|....|. .|.|+..+ ....|+|.-|+. ...|..+-+ +|. --|..+.+|....+-|+|.+
T Consensus 82 --~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~~ 155 (197)
T PF06247_consen 82 --QGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCVC 155 (197)
T ss_dssp --SSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE
T ss_pred --CCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeCcEEEeec
Confidence 11221 2445544565 56776433 345999999986 334432211 222 24777889999999999999
Q ss_pred CCCCCCC
Q psy11059 413 PKGLTGR 419 (429)
Q Consensus 413 ~~g~~G~ 419 (429)
.++|.+.
T Consensus 156 ~~~~~~~ 162 (197)
T PF06247_consen 156 KEGFPGD 162 (197)
T ss_dssp -TT-EEE
T ss_pred CCCCCCC
Confidence 9998643
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.87 E-value=0.012 Score=37.01 Aligned_cols=18 Identities=28% Similarity=0.778 Sum_probs=16.0
Q ss_pred EeccCCCCceEEecCCCCc
Q psy11059 24 KCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 24 ~C~~~~~~~~~C~C~~g~~ 42 (429)
.|+ +.+++|+|.|++||.
T Consensus 11 ~C~-~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 11 ICV-NTPGSYRCSCPPGYK 28 (36)
T ss_dssp EEE-EETTSEEEE-STTEE
T ss_pred CCc-cCCCceEeECCCCCE
Confidence 788 889999999999999
No 41
>KOG1218|consensus
Probab=94.45 E-value=4.4 Score=38.88 Aligned_cols=65 Identities=25% Similarity=0.553 Sum_probs=37.3
Q ss_pred CCCcCCCCCCCccccC----CCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCcc
Q psy11059 255 ALHTGDPCKHGSCVDK----RAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYH 327 (429)
Q Consensus 255 ~g~~G~~C~~~~C~~~----~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~ 327 (429)
++|.|..|.... ... .....|.|.+||.|..+......|.....+.+++.|. .....+.+.+.+.
T Consensus 140 ~~~~g~~C~~~c-~~~~~~~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~-------~~~~~~~~~~~~~ 208 (316)
T KOG1218|consen 140 ENLVGLKCQRDC-QCTGGCDCKNGICTCQPGFVGVFCVESCSGCSPLTACENGAKCN-------RSTGSCLCYPGPS 208 (316)
T ss_pred cCCCCCCccCCC-CCccccCCCCCceeccCCcccccccccCCCcCCCcccCCCCeee-------ccccccccCCCCc
Confidence 356666665422 111 2234678899998888765544466556677777772 2334455555554
No 42
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.38 E-value=0.024 Score=49.13 Aligned_cols=121 Identities=24% Similarity=0.494 Sum_probs=70.4
Q ss_pred CEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCC----CCCCCCCCCeEeeCC---CCCCeeeeCCCC
Q psy11059 23 GKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICT----TAPPCLNGATCRPQL---TEQLYECVCPPG 95 (429)
Q Consensus 23 g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~----~~~~C~~~g~C~~~~---~~~~~~C~C~~G 95 (429)
|.-+ ...+.|.|.|.+||.-. .-.+||.-+ +|. ...+|.+-++|+... ....|+|.|.+|
T Consensus 11 G~Li-QMSNHfEC~Cnegfvl~---------~EntCE~kv---~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~g 77 (197)
T PF06247_consen 11 GYLI-QMSNHFECKCNEGFVLK---------NENTCEEKV---ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCING 77 (197)
T ss_dssp EEEE-EESSEEEEEESTTEEEE---------ETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TT
T ss_pred CEEE-EccCceEEEcCCCcEEc---------cccccccce---ecCcccccCccccchhhhhcCCCcccceeEEEecccC
Confidence 6666 66778999999999822 235676433 565 256799999999432 246799999999
Q ss_pred Cc-c-----cCCCCCCCCCCCCeEecCCC------ce-eCCee---ccCCCCCCCCCCCCCCCCCCCeEeeCCCCcccc
Q psy11059 96 YK-E-----IRDCTSNPCLNDGVCVWMFD------VT-IQVYK---GRYCELPEIGDCSSNPCLNDGVCVDVYKGRYCE 158 (429)
Q Consensus 96 y~-~-----~~~C~~~~C~~~g~C~~~~~------C~-~~g~~---G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C~ 158 (429)
|. . .+.|..-.|. .|.|+-.+. |. .-|+. ...|...--.+|. ..|..+..|....+-|.|.
T Consensus 78 Y~~~~~vCvp~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~ 154 (197)
T PF06247_consen 78 YILKQGVCVPNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCV 154 (197)
T ss_dssp EEESSSSEEEGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEE
T ss_pred ceeeCCeEchhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeCcEEEee
Confidence 99 2 3455555666 688876543 77 77776 1223210011222 2366777888887777887
No 43
>smart00051 DSL delta serrate ligand.
Probab=92.62 E-value=0.21 Score=35.80 Aligned_cols=45 Identities=29% Similarity=0.625 Sum_probs=27.1
Q ss_pred eeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCC
Q psy11059 276 CDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKIC 331 (429)
Q Consensus 276 C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C 331 (429)
-.|+++|.|..|+ ..|.+.+....+.+| .. ...++|.+||+|..|
T Consensus 19 v~C~~~~yG~~C~---~~C~~~~d~~~~~~C-------d~-~G~~~C~~Gw~G~~C 63 (63)
T smart00051 19 VTCDENYYGEGCN---KFCRPRDDFFGHYTC-------DE-NGNKGCLEGWMGPYC 63 (63)
T ss_pred eeCCCCCcCCccC---CEeCcCccccCCccC-------Cc-CCCEecCCCCcCCCC
Confidence 3677777777774 234433344556666 22 356788888887654
No 44
>KOG3512|consensus
Probab=92.38 E-value=0.37 Score=47.65 Aligned_cols=109 Identities=26% Similarity=0.642 Sum_probs=61.1
Q ss_pred ccCCCCCeeeeCCCCccCCCCcccccCCCccCC----CCccCCCcCCCCCcCCC-------------------Ccccc--
Q psy11059 311 VDETQHRFNCTCPSGYHGKICEKCFCRPGFAGD----HCDVDFDECLSNPCFNG-------------------ATCQN-- 365 (429)
Q Consensus 311 ~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~----~C~~~i~~C~~~~C~~~-------------------~~C~~-- 365 (429)
+.+..+..+|.|..+..|..|+ .|.+-|.+. .-..++++|....|..+ ++|++
T Consensus 288 v~d~~~~ltCdC~HNTaGPdCg--rCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCr 365 (592)
T KOG3512|consen 288 VMDESSHLTCDCEHNTAGPDCG--RCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCR 365 (592)
T ss_pred eeccCCceEEecccCCCCCCcc--cccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecc
Confidence 3435566999999999999998 566666532 11124455554333222 23332
Q ss_pred -CCCceEE-ecCCCCCCCCcc--CCCCCCCCCCCCC----CCEEccCCCCeeeeCCCCCCCCCCCC
Q psy11059 366 -KINGYTC-VCAPGYSGKECS--ININECESSPCLH----GATCIDEVATFSCVCPKGLTGRLCET 423 (429)
Q Consensus 366 -~~g~~~C-~C~~G~~G~~C~--~~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~g~~G~~C~~ 423 (429)
...+-+| .|++||.-+.=. .+...|..-.|+. +.+|..+. .+|.|++|.+|..|+.
T Consensus 366 HnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~t--GqCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 366 HNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTT--GQCPCKEGVTGLTCNR 429 (592)
T ss_pred cCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccC--CcccCCCCCccccccc
Confidence 1122234 477777522111 1223344444543 34675553 4899999999998874
No 45
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.35 E-value=0.059 Score=33.86 Aligned_cols=22 Identities=50% Similarity=1.162 Sum_probs=18.1
Q ss_pred CCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059 72 PCLNGATCRPQLTEQLYECVCPPGYK 97 (429)
Q Consensus 72 ~C~~~g~C~~~~~~~~~~C~C~~Gy~ 97 (429)
.|.+ +|+ +++++|+|.|++||+
T Consensus 7 gC~h--~C~--~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 7 GCSH--ICV--NTPGSYRCSCPPGYK 28 (36)
T ss_dssp GSSS--EEE--EETTSEEEE-STTEE
T ss_pred CcCC--CCc--cCCCceEeECCCCCE
Confidence 4555 899 889999999999998
No 46
>KOG1218|consensus
Probab=92.02 E-value=3.3 Score=39.69 Aligned_cols=42 Identities=31% Similarity=0.600 Sum_probs=22.7
Q ss_pred CCeeEeCCCCCcCC-CCCC--------CccccCCCCeeeeCCCCCCCCCCc
Q psy11059 247 GNYDCTCDALHTGD-PCKH--------GSCVDKRAGYFCDCPPTYGGKNCS 288 (429)
Q Consensus 247 ~~~~C~C~~g~~G~-~C~~--------~~C~~~~~~~~C~C~~G~~g~~c~ 288 (429)
....|.|.++|+|. .+.. ..+........|.+..+|.+..|.
T Consensus 13 ~~~~c~c~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~ 63 (316)
T KOG1218|consen 13 GSGQCFCDPGYTGRLQCEHQAVTSACSGICPCEVNSGECGLGYGFVGSVCR 63 (316)
T ss_pred CCCceecCCCccccccccCCCCCccccccCCccCCceeEecccccCCCccc
Confidence 45567788888773 2222 012222234456777777776654
No 47
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=90.86 E-value=0.14 Score=34.53 Aligned_cols=29 Identities=31% Similarity=0.805 Sum_probs=22.5
Q ss_pred CCCeeeeCCCCccCCCCcccccCCCccCCCC
Q psy11059 315 QHRFNCTCPSGYHGKICEKCFCRPGFAGDHC 345 (429)
Q Consensus 315 ~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C 345 (429)
...+.|.|+++|.|..|+ +|.++|++..-
T Consensus 15 ~~~G~C~C~~~~~G~~C~--~C~~g~~~~~~ 43 (49)
T PF00053_consen 15 PSTGQCVCKPGTTGPRCD--QCKPGYFGLPS 43 (49)
T ss_dssp ETCEEESBSTTEESTTS---EE-TTEECSTT
T ss_pred CCCCEEeccccccCCcCc--CCCCccccccC
Confidence 356899999999999999 68899987643
No 48
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=90.56 E-value=0.39 Score=32.57 Aligned_cols=28 Identities=29% Similarity=0.738 Sum_probs=22.9
Q ss_pred CCCeeeeCCCCccCCCCcccccCCCccCCC
Q psy11059 315 QHRFNCTCPSGYHGKICEKCFCRPGFAGDH 344 (429)
Q Consensus 315 ~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~ 344 (429)
..+.+|.|+++|.|..|+ .|.++|++..
T Consensus 16 ~~~G~C~C~~~~~G~~C~--~C~~g~~~~~ 43 (50)
T cd00055 16 PGTGQCECKPNTTGRRCD--RCAPGYYGLP 43 (50)
T ss_pred CCCCEEeCCCcCCCCCCC--CCCCCCccCC
Confidence 345789999999999999 6788888753
No 49
>smart00051 DSL delta serrate ligand.
Probab=90.09 E-value=0.44 Score=34.12 Aligned_cols=13 Identities=31% Similarity=0.726 Sum_probs=6.5
Q ss_pred EEecCCCCCCCCc
Q psy11059 371 TCVCAPGYSGKEC 383 (429)
Q Consensus 371 ~C~C~~G~~G~~C 383 (429)
.++|.+||+|..|
T Consensus 51 ~~~C~~Gw~G~~C 63 (63)
T smart00051 51 NKGCLEGWMGPYC 63 (63)
T ss_pred CEecCCCCcCCCC
Confidence 3455555555443
No 50
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=89.19 E-value=0.47 Score=31.55 Aligned_cols=25 Identities=32% Similarity=0.931 Sum_probs=21.9
Q ss_pred CCeeeeCCCCccCCCCcccccCCCccC
Q psy11059 316 HRFNCTCPSGYHGKICEKCFCRPGFAG 342 (429)
Q Consensus 316 ~~~~C~C~~G~~G~~C~~C~C~~g~~g 342 (429)
.+.+|.|+++|+|..|+ .|++||+|
T Consensus 16 ~~G~C~C~~~~~G~~C~--~C~~g~~g 40 (46)
T smart00180 16 DTGQCECKPNVTGRRCD--RCAPGYYG 40 (46)
T ss_pred CCCEEECCCCCCCCCCC--cCCCCcCC
Confidence 35689999999999999 78999998
No 51
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.13 E-value=0.22 Score=31.25 Aligned_cols=30 Identities=17% Similarity=0.380 Sum_probs=21.3
Q ss_pred CCCCCCCCCEeccCCCCceEEecCCCCccc
Q psy11059 15 QRNPCQNGGKCNEDETGNYDCTCDALHTVC 44 (429)
Q Consensus 15 ~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~ 44 (429)
...+|..|+.|+....|++.|.|.+||..+
T Consensus 3 ~~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 3 IDTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred cCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 456899999999555599999999999944
No 52
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=88.22 E-value=0.71 Score=31.27 Aligned_cols=27 Identities=37% Similarity=0.860 Sum_probs=21.0
Q ss_pred CCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCC
Q psy11059 247 GNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKN 286 (429)
Q Consensus 247 ~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~ 286 (429)
.+.+|.|+++|+|..|+ .|.+||++..
T Consensus 17 ~~G~C~C~~~~~G~~C~-------------~C~~g~~~~~ 43 (50)
T cd00055 17 GTGQCECKPNTTGRRCD-------------RCAPGYYGLP 43 (50)
T ss_pred CCCEEeCCCcCCCCCCC-------------CCCCCCccCC
Confidence 45678999999999987 4778887653
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=85.70 E-value=0.34 Score=32.64 Aligned_cols=27 Identities=33% Similarity=0.752 Sum_probs=20.2
Q ss_pred CCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCC
Q psy11059 247 GNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKN 286 (429)
Q Consensus 247 ~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~ 286 (429)
.+.+|.|.++|+|..|+ .|.++|++..
T Consensus 16 ~~G~C~C~~~~~G~~C~-------------~C~~g~~~~~ 42 (49)
T PF00053_consen 16 STGQCVCKPGTTGPRCD-------------QCKPGYFGLP 42 (49)
T ss_dssp TCEEESBSTTEESTTS--------------EE-TTEECST
T ss_pred CCCEEeccccccCCcCc-------------CCCCcccccc
Confidence 56789999999999998 4777877653
No 54
>PHA02887 EGF-like protein; Provisional
Probab=84.72 E-value=0.85 Score=36.39 Aligned_cols=31 Identities=26% Similarity=0.530 Sum_probs=21.5
Q ss_pred CCCCCCcEec-cCCCCCeeEeCCCCCcCCCCCC
Q psy11059 233 NPCQNGGKCN-EDETGNYDCTCDALHTGDPCKH 264 (429)
Q Consensus 233 ~~C~~~g~C~-~~~~~~~~C~C~~g~~G~~C~~ 264 (429)
+.|. +|+|. ........|.|+.||+|..|++
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 4566 46783 2344567888888888888875
No 55
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.05 E-value=1.1 Score=40.90 Aligned_cols=39 Identities=26% Similarity=0.623 Sum_probs=31.6
Q ss_pred cCCCCCCCCCCCCCCCC-CCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059 52 TLGSIHCETPISNQICT-TAPPCLNGATCRPQLTEQLYECVCPPGYK 97 (429)
Q Consensus 52 ~~~G~~C~~~~~~~~C~-~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~ 97 (429)
.+.+..|+ +++ +|. ..++|.+ .|. ++.++|.|.|.+||+
T Consensus 178 ~l~~~~C~-~~~--~C~~~~~~c~~--~C~--~~~g~~~c~c~~g~~ 217 (224)
T cd01475 178 KFQGKICV-VPD--LCATLSHVCQQ--VCI--STPGSYLCACTEGYA 217 (224)
T ss_pred hcccccCc-Cch--hhcCCCCCccc--eEE--cCCCCEEeECCCCcc
Confidence 34578897 677 897 4566765 799 999999999999997
No 56
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=83.84 E-value=0.47 Score=29.78 Aligned_cols=26 Identities=35% Similarity=0.622 Sum_probs=18.8
Q ss_pred CCCCCCCCeEeeCCCC-CCeeeeCCCCCc
Q psy11059 70 APPCLNGATCRPQLTE-QLYECVCPPGYK 97 (429)
Q Consensus 70 ~~~C~~~g~C~~~~~~-~~~~C~C~~Gy~ 97 (429)
...|..++.|+ +.. |+++|.|..||.
T Consensus 4 ~~~cP~NA~C~--~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 4 DTKCPANAGCF--RYDDGSEECRCLLGYK 30 (37)
T ss_dssp SS---TTEEEE--EETTSEEEEEE-TTEE
T ss_pred CccCCCCcccE--EcCCCCEEEEeeCCcc
Confidence 46778889999 655 899999999998
No 57
>PHA02887 EGF-like protein; Provisional
Probab=82.83 E-value=1.2 Score=35.47 Aligned_cols=29 Identities=31% Similarity=0.918 Sum_probs=21.4
Q ss_pred CCCCCCEEcc--CCCCeeeeCCCCCCCCCCCC
Q psy11059 394 PCLHGATCID--EVATFSCVCPKGLTGRLCET 423 (429)
Q Consensus 394 ~C~~~~~C~~--~~~~~~C~C~~g~~G~~C~~ 423 (429)
-|.+ |+|.- ......|+|.+||+|.+|+.
T Consensus 93 YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred EeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence 4654 58854 34567899999999999984
No 58
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=82.25 E-value=1.1 Score=36.32 Aligned_cols=29 Identities=41% Similarity=1.045 Sum_probs=21.6
Q ss_pred CCCCCCEEcc--CCCCeeeeCCCCCCCCCCCC
Q psy11059 394 PCLHGATCID--EVATFSCVCPKGLTGRLCET 423 (429)
Q Consensus 394 ~C~~~~~C~~--~~~~~~C~C~~g~~G~~C~~ 423 (429)
-|.++ +|.- ....+.|+|..||+|.+||.
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccc
Confidence 46664 7854 34678899999999999985
No 59
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=82.23 E-value=1.7 Score=28.81 Aligned_cols=24 Identities=38% Similarity=0.894 Sum_probs=20.0
Q ss_pred CeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCC
Q psy11059 248 NYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGG 284 (429)
Q Consensus 248 ~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g 284 (429)
+.+|.|+++|+|..|+ .|.+||+|
T Consensus 17 ~G~C~C~~~~~G~~C~-------------~C~~g~~g 40 (46)
T smart00180 17 TGQCECKPNVTGRRCD-------------RCAPGYYG 40 (46)
T ss_pred CCEEECCCCCCCCCCC-------------cCCCCcCC
Confidence 5688999999999987 47888877
No 60
>KOG3516|consensus
Probab=80.46 E-value=1.5 Score=48.37 Aligned_cols=42 Identities=33% Similarity=0.822 Sum_probs=35.4
Q ss_pred CCCCCCCCCCCCCCEEccCCCCeeeeCC-CCCCCCCCCCCCCC
Q psy11059 386 NINECESSPCLHGATCIDEVATFSCVCP-KGLTGRLCETNIDD 427 (429)
Q Consensus 386 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~-~g~~G~~C~~~i~~ 427 (429)
.++.|.+++|.+++.|......|.|.|. .||.|..|...|.|
T Consensus 544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e 586 (1306)
T KOG3516|consen 544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE 586 (1306)
T ss_pred cccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence 4677888999999999888888999998 89999999877654
No 61
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=79.64 E-value=1.8 Score=35.19 Aligned_cols=31 Identities=29% Similarity=0.636 Sum_probs=21.9
Q ss_pred CCCCCcEec-cCCCCCeeEeCCCCCcCCCCCCC
Q psy11059 234 PCQNGGKCN-EDETGNYDCTCDALHTGDPCKHG 265 (429)
Q Consensus 234 ~C~~~g~C~-~~~~~~~~C~C~~g~~G~~C~~~ 265 (429)
-|.++ +|. ......+.|.|..||+|..|++.
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence 46554 773 23446788999999999998863
No 62
>KOG3516|consensus
Probab=78.00 E-value=1.9 Score=47.68 Aligned_cols=47 Identities=32% Similarity=0.863 Sum_probs=39.4
Q ss_pred CCCCCCCCCCCCCCCCCEeccCCCCceEEecC-CCCcccccccccCcCCCCCCCCCCCCCCCC
Q psy11059 7 SLSSPCDAQRNPCQNGGKCNEDETGNYDCTCD-ALHTVCCVGLANQTLGSIHCETPISNQICT 68 (429)
Q Consensus 7 ~~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~-~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~ 68 (429)
-.+|.| .+|+|.++|.|. -....|.|.|. .||. |.+|+..+.+..|.
T Consensus 543 ~i~drC--lPN~CehgG~C~-Qs~~~f~C~C~~TGY~------------GatCHtsi~e~SCe 590 (1306)
T KOG3516|consen 543 GISDRC--LPNPCEHGGKCS-QSWDDFECNCELTGYK------------GATCHTSIYELSCE 590 (1306)
T ss_pred cccccc--CCccccCCCccc-ccccceeEeccccccc------------cccccCCCcchhhH
Confidence 346889 899999999999 58889999998 9999 99999877633454
No 63
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=75.90 E-value=3.4 Score=37.70 Aligned_cols=36 Identities=28% Similarity=0.733 Sum_probs=26.0
Q ss_pred CCCccCCCCCCCC--CCCCCCCEEccCCCCeeeeCCCCCCC
Q psy11059 380 GKECSININECES--SPCLHGATCIDEVATFSCVCPKGLTG 418 (429)
Q Consensus 380 G~~C~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~g~~G 418 (429)
+..|. ++++|.. ++|. ..|.++.|+|.|.|++||+.
T Consensus 181 ~~~C~-~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICV-VPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCc-CchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence 55664 6677753 3454 47888999999999999874
No 64
>KOG3514|consensus
Probab=74.52 E-value=2 Score=46.90 Aligned_cols=36 Identities=39% Similarity=0.927 Sum_probs=32.0
Q ss_pred CCCCCCCCCCCCCEeccCCCCceEEec-CCCCcccccccccCcCCCCCCCCC
Q psy11059 11 PCDAQRNPCQNGGKCNEDETGNYDCTC-DALHTVCCVGLANQTLGSIHCETP 61 (429)
Q Consensus 11 ~C~~~~~~C~~~g~C~~~~~~~~~C~C-~~g~~g~~~~~~~~~~~G~~C~~~ 61 (429)
.| .++||+|+|+|. ...+.|.|.| ..||. |+.||..
T Consensus 625 ~C--~~nPC~N~g~C~-egwNrfiCDCs~T~~~------------G~~CerE 661 (1591)
T KOG3514|consen 625 IC--ESNPCQNGGKCS-EGWNRFICDCSGTGFE------------GRTCERE 661 (1591)
T ss_pred cc--CCCcccCCCCcc-ccccccccccccCccc------------Cccccce
Confidence 68 899999999999 9999999999 57888 8888854
No 65
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=69.86 E-value=4.5 Score=32.37 Aligned_cols=29 Identities=24% Similarity=0.639 Sum_probs=23.7
Q ss_pred CCCCCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059 66 ICTTAPPCLNGATCRPQLTEQLYECVCPPGYK 97 (429)
Q Consensus 66 ~C~~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~ 97 (429)
.|.....|...|.|. .. ....|.|++||+
T Consensus 79 ~Cd~y~~CG~~g~C~--~~-~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 79 QCDVYGFCGPNGICN--SN-NSPKCSCLPGFE 107 (110)
T ss_pred CCCCccccCCccEeC--CC-CCCceECCCCcC
Confidence 787678999999997 33 456799999996
No 66
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=69.44 E-value=2.3 Score=33.50 Aligned_cols=51 Identities=20% Similarity=0.537 Sum_probs=33.7
Q ss_pred CCCCCCCCCCCCCCCEeccCC-----CCceEEecCCCCcccccc-cccCcCCCCCCCC
Q psy11059 9 SSPCDAQRNPCQNGGKCNEDE-----TGNYDCTCDALHTVCCVG-LANQTLGSIHCET 60 (429)
Q Consensus 9 ~~~C~~~~~~C~~~g~C~~~~-----~~~~~C~C~~g~~g~~~~-~~~~~~~G~~C~~ 60 (429)
.+.|..+++.|++||.|+ .. ..=|.|.|.+.+.....+ .=...|.|..|+.
T Consensus 5 ~~aC~~~Tn~CsgHG~C~-~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK 61 (103)
T PF12955_consen 5 NDACENATNNCSGHGSCV-KKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK 61 (103)
T ss_pred HHHHHHhccCCCCCceEe-eccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence 456877899999999999 44 244899999866522100 0112455888873
No 67
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=66.83 E-value=1.6 Score=31.29 Aligned_cols=39 Identities=31% Similarity=0.724 Sum_probs=19.1
Q ss_pred CeeEeCCCCCcCCCCCCCccccC---CCCeee------eCCCCCCCCCC
Q psy11059 248 NYDCTCDALHTGDPCKHGSCVDK---RAGYFC------DCPPTYGGKNC 287 (429)
Q Consensus 248 ~~~C~C~~g~~G~~C~~~~C~~~---~~~~~C------~C~~G~~g~~c 287 (429)
.++-.|.+.|.|..|.. .|... .+.|+| +|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~~C~~-~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSK-FCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETTTT-E-E---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred EEEEECCCCCCCccccC-CcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence 45678899999998864 35443 344555 57888888765
No 68
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.04 E-value=8.2 Score=30.84 Aligned_cols=31 Identities=26% Similarity=0.598 Sum_probs=24.5
Q ss_pred CCCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 9 SSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 9 ~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
.|+|. ....|+.+|.|. . .....|.|.+||.
T Consensus 77 ~d~Cd-~y~~CG~~g~C~-~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 77 KDQCD-VYGFCGPNGICN-S-NNSPKCSCLPGFE 107 (110)
T ss_pred ccCCC-CccccCCccEeC-C-CCCCceECCCCcC
Confidence 46884 457999999997 3 3456799999997
No 69
>KOG3514|consensus
Probab=60.09 E-value=6.5 Score=43.19 Aligned_cols=36 Identities=42% Similarity=1.018 Sum_probs=31.6
Q ss_pred CCCCCCCCCCCEEccCCCCeeeeCC-CCCCCCCCCCC
Q psy11059 389 ECESSPCLHGATCIDEVATFSCVCP-KGLTGRLCETN 424 (429)
Q Consensus 389 ~C~~~~C~~~~~C~~~~~~~~C~C~-~g~~G~~C~~~ 424 (429)
.|.++||.++|+|......|.|.|. .+|.|+.|+..
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence 6888999999999998899999997 58999999864
No 70
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=57.35 E-value=23 Score=23.84 Aligned_cols=27 Identities=26% Similarity=0.685 Sum_probs=21.0
Q ss_pred CCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059 10 SPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT 42 (429)
Q Consensus 10 ~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~ 42 (429)
+.|. ....|..++.|+ ..+|+|++||.
T Consensus 20 ~~C~-~~~qC~~~s~C~-----~g~C~C~~g~~ 46 (52)
T PF01683_consen 20 ESCE-SDEQCIGGSVCV-----NGRCQCPPGYV 46 (52)
T ss_pred CCCC-CcCCCCCcCEEc-----CCEeECCCCCE
Confidence 4564 455678889998 56999999998
No 71
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=48.60 E-value=6.8 Score=26.87 Aligned_cols=35 Identities=26% Similarity=0.501 Sum_probs=19.8
Q ss_pred CCCCCCCCEeccCC---CCceEEecCCCCcccccccccCcCCCCCCCCCC
Q psy11059 16 RNPCQNGGKCNEDE---TGNYDCTCDALHTVCCVGLANQTLGSIHCETPI 62 (429)
Q Consensus 16 ~~~C~~~g~C~~~~---~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~ 62 (429)
.-+|+.||+-..+. .|...|+|..-|. |+.|.+.+
T Consensus 16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~------------GpdCS~~~ 53 (56)
T PF04863_consen 16 AISCSGHGRAFLDGLIADGSPVCECNSCYG------------GPDCSTLI 53 (56)
T ss_dssp TS--TTSEE--TTS-EETTEE--EE-TTEE------------STTS-EE-
T ss_pred cCCcCCCCeeeeccccccCCccccccCCcC------------CCCcccCC
Confidence 44788999876432 5678899999999 99997543
No 72
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=34.53 E-value=30 Score=21.29 Aligned_cols=13 Identities=15% Similarity=0.194 Sum_probs=10.5
Q ss_pred ceEEecCCCCccc
Q psy11059 32 NYDCTCDALHTVC 44 (429)
Q Consensus 32 ~~~C~C~~g~~g~ 44 (429)
.++|.|++||..+
T Consensus 17 ~~~C~CPeGyIld 29 (34)
T PF09064_consen 17 PGQCFCPEGYILD 29 (34)
T ss_pred CCceeCCCceEec
Confidence 4589999999844
No 73
>KOG3509|consensus
Probab=31.00 E-value=93 Score=34.72 Aligned_cols=71 Identities=31% Similarity=0.724 Sum_probs=50.6
Q ss_pred CcCCCCCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCCCC-CCCCCEEccCCCCeeeeCCCCCCCCCC
Q psy11059 350 DECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSININECESSP-CLHGATCIDEVATFSCVCPKGLTGRLC 421 (429)
Q Consensus 350 ~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~g~~G~~C 421 (429)
+.|...++...+.|....-...|.|++||+|..|....+.+...+ =...++|....+.....|.++ .|...
T Consensus 407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~ 478 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPT 478 (964)
T ss_pred CccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCcc
Confidence 456667788888888888888999999999999975555554322 223456766655667788888 66554
No 74
>KOG3509|consensus
Probab=21.15 E-value=1.8e+02 Score=32.51 Aligned_cols=68 Identities=28% Similarity=0.501 Sum_probs=49.0
Q ss_pred CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCC-CCCCCCCCeEeeCCCCC
Q psy11059 8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTT-APPCLNGATCRPQLTEQ 86 (429)
Q Consensus 8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~-~~~C~~~g~C~~~~~~~ 86 (429)
..++| ...||+..+.|. ...-..+|.|++||+ |..|+...+ .+.. .+-.. .++|. ...+
T Consensus 405 ~g~~c--~~~p~~~~g~c~-p~~~~~~c~c~~g~~------------G~~c~d~~~--~~~~~~~g~y-~~t~~--~~~~ 464 (964)
T KOG3509|consen 405 LGDVC--WRIPCQHDGPCL-QTLEGKQCLCPPGYT------------GDSCEDCMN--GCDRSPNGSY-LGTCV--PIQG 464 (964)
T ss_pred CCCcc--ccccCCCCcccc-ccccccceecccccc------------CchhhccCc--cccccCCccc-cceEe--ccCC
Confidence 45677 788999999998 888889999999999 888875544 4441 22222 26777 5555
Q ss_pred CeeeeCCCC
Q psy11059 87 LYECVCPPG 95 (429)
Q Consensus 87 ~~~C~C~~G 95 (429)
.....|.+|
T Consensus 465 ~~~~~c~pg 473 (964)
T KOG3509|consen 465 KRCEYCGPG 473 (964)
T ss_pred CcceeecCC
Confidence 566788888
No 75
>KOG0196|consensus
Probab=20.45 E-value=1.3e+02 Score=32.77 Aligned_cols=57 Identities=25% Similarity=0.541 Sum_probs=34.3
Q ss_pred CeeEeCCCCCcC----CCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCC
Q psy11059 248 NYDCTCDALHTG----DPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCP 323 (429)
Q Consensus 248 ~~~C~C~~g~~G----~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~ 323 (429)
...|.|.+||.- ..|+ .|++|+.-..- ....|. +|..+. .....++..|.|.
T Consensus 258 iG~C~C~aGye~~~~~~~C~-------------aCp~G~yK~~~--~~~~C~---~CP~~S------~s~~ega~~C~C~ 313 (996)
T KOG0196|consen 258 IGGCVCKAGYEEAENGKACQ-------------ACPPGTYKASQ--GDSLCL---PCPPNS------HSSSEGATSCTCE 313 (996)
T ss_pred cCceeecCCCCcccCCCcce-------------eCCCCcccCCC--CCCCCC---CCCCCC------CCCCCCCCccccc
Confidence 457999999963 3333 68888853321 123343 233332 2236778899999
Q ss_pred CCccC
Q psy11059 324 SGYHG 328 (429)
Q Consensus 324 ~G~~G 328 (429)
.||.-
T Consensus 314 ~gyyR 318 (996)
T KOG0196|consen 314 NGYYR 318 (996)
T ss_pred CCccc
Confidence 99863
Done!