Query psy11501
Match_columns 133
No_of_seqs 154 out of 1749
Neff 9.9
Searched_HMMs 46136
Date Fri Aug 16 20:01:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11501.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11501hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1225|consensus 99.5 5.5E-13 1.2E-17 99.8 9.5 107 8-133 237-365 (525)
2 KOG0994|consensus 99.4 4E-13 8.7E-18 106.2 5.7 128 2-133 997-1144(1758)
3 KOG0994|consensus 99.2 1.7E-11 3.7E-16 97.3 6.4 123 7-132 936-1095(1758)
4 KOG4289|consensus 99.1 1.7E-10 3.6E-15 93.5 7.1 85 2-93 1221-1317(2531)
5 KOG1219|consensus 99.1 3E-10 6.5E-15 95.0 6.9 97 23-132 3871-3974(4289)
6 KOG1225|consensus 99.0 1.2E-09 2.5E-14 82.2 7.0 71 8-93 268-342 (525)
7 KOG4289|consensus 98.8 4.8E-09 1E-13 85.3 4.5 83 34-128 1221-1308(2531)
8 KOG1219|consensus 98.6 4.4E-08 9.6E-13 82.7 5.1 84 2-93 3885-3978(4289)
9 KOG1836|consensus 98.6 2.5E-07 5.4E-12 77.8 8.4 79 23-103 732-822 (1705)
10 KOG1226|consensus 98.5 1E-06 2.2E-11 68.4 8.6 109 8-132 481-617 (783)
11 KOG1836|consensus 98.3 7.4E-06 1.6E-10 69.3 9.6 126 2-132 865-1018(1705)
12 smart00051 DSL delta serrate l 98.3 6.9E-07 1.5E-11 49.2 2.4 44 4-48 16-63 (63)
13 cd00055 EGF_Lam Laminin-type e 98.0 1E-05 2.2E-10 42.5 3.8 31 29-60 13-43 (50)
14 KOG1226|consensus 97.9 7.9E-05 1.7E-09 58.3 8.2 87 34-132 477-577 (783)
15 KOG3512|consensus 97.9 0.0001 2.2E-09 54.9 8.3 100 1-103 304-438 (592)
16 KOG1218|consensus 97.8 0.00028 6E-09 50.5 9.6 108 10-128 96-206 (316)
17 smart00180 EGF_Lam Laminin-typ 97.8 4.4E-05 9.6E-10 39.3 3.7 30 28-58 11-40 (46)
18 PF00053 Laminin_EGF: Laminin 97.8 8.6E-06 1.9E-10 42.5 0.8 33 28-61 11-43 (49)
19 smart00051 DSL delta serrate l 97.7 2.5E-05 5.5E-10 42.9 2.4 43 90-133 16-62 (63)
20 KOG4260|consensus 97.7 5.8E-05 1.3E-09 52.6 4.3 49 8-58 131-190 (350)
21 PF07974 EGF_2: EGF-like domai 97.7 6.8E-05 1.5E-09 35.5 3.1 26 23-48 7-32 (32)
22 KOG1217|consensus 97.5 0.0013 2.8E-08 49.1 9.7 121 8-132 155-304 (487)
23 cd00055 EGF_Lam Laminin-type e 97.5 0.00022 4.8E-09 37.3 3.8 31 72-103 13-43 (50)
24 PF07974 EGF_2: EGF-like domai 97.5 0.00014 3E-09 34.4 2.7 25 109-133 7-31 (32)
25 KOG1217|consensus 97.3 0.003 6.4E-08 47.2 9.2 119 8-132 113-263 (487)
26 PF00008 EGF: EGF-like domain 97.3 8.5E-05 1.8E-09 35.2 0.6 25 109-133 5-32 (32)
27 PF00053 Laminin_EGF: Laminin 97.3 0.00012 2.6E-09 38.1 1.0 33 71-104 11-43 (49)
28 KOG1214|consensus 97.2 0.0015 3.3E-08 52.1 7.0 116 8-132 719-859 (1289)
29 smart00180 EGF_Lam Laminin-typ 97.2 0.00058 1.3E-08 35.1 3.1 30 71-101 11-40 (46)
30 KOG1218|consensus 97.1 0.0074 1.6E-07 43.1 9.3 66 8-81 136-202 (316)
31 PF12661 hEGF: Human growth fa 97.1 0.00026 5.6E-09 26.6 0.9 13 36-48 1-13 (13)
32 KOG3512|consensus 97.0 0.0037 8.1E-08 46.9 7.1 109 22-132 278-425 (592)
33 PF01414 DSL: Delta serrate li 97.0 0.00016 3.4E-09 39.8 0.0 40 8-48 20-63 (63)
34 PF00008 EGF: EGF-like domain 96.7 0.00052 1.1E-08 32.4 0.6 25 66-90 5-32 (32)
35 smart00179 EGF_CA Calcium-bind 95.9 0.013 2.9E-07 28.2 3.0 27 23-49 10-39 (39)
36 KOG4260|consensus 95.9 0.011 2.3E-07 41.6 3.3 52 39-99 132-188 (350)
37 smart00179 EGF_CA Calcium-bind 95.7 0.026 5.7E-07 27.1 3.6 27 66-92 10-39 (39)
38 cd00054 EGF_CA Calcium-binding 95.5 0.024 5.3E-07 26.9 2.9 27 23-49 10-38 (38)
39 cd00053 EGF Epidermal growth f 95.2 0.036 7.7E-07 25.9 3.0 26 23-48 7-35 (36)
40 cd00054 EGF_CA Calcium-binding 95.2 0.05 1.1E-06 25.8 3.5 27 66-92 10-38 (38)
41 PF12947 EGF_3: EGF domain; I 94.9 0.012 2.6E-07 28.5 0.8 24 109-132 7-32 (36)
42 cd00053 EGF Epidermal growth f 94.8 0.072 1.6E-06 24.7 3.5 26 66-91 7-35 (36)
43 smart00181 EGF Epidermal growt 94.5 0.061 1.3E-06 25.3 2.9 24 24-48 8-34 (35)
44 PF01414 DSL: Delta serrate li 94.5 0.0084 1.8E-07 32.9 -0.4 21 112-133 42-62 (63)
45 KOG1214|consensus 94.1 0.066 1.4E-06 43.3 3.7 51 34-90 808-860 (1289)
46 PF07645 EGF_CA: Calcium-bindi 92.5 0.062 1.4E-06 26.8 0.9 22 109-130 11-34 (42)
47 PF01683 EB: EB module; Inter 86.8 1.4 3E-05 22.8 3.3 21 109-131 27-47 (52)
48 PF12662 cEGF: Complement Clr- 83.7 0.98 2.1E-05 19.7 1.5 11 77-87 1-11 (24)
49 KOG0196|consensus 83.6 2.1 4.6E-05 35.1 4.2 63 67-130 248-317 (996)
50 KOG0196|consensus 83.0 2.6 5.7E-05 34.6 4.5 63 24-87 248-317 (996)
51 PHA03099 epidermal growth fact 80.2 2.3 5E-05 26.7 2.7 17 34-50 66-82 (139)
52 PHA02887 EGF-like protein; Pro 79.6 2.4 5.2E-05 26.2 2.6 26 24-50 94-123 (126)
53 PHA02887 EGF-like protein; Pro 77.1 2.8 6.2E-05 25.9 2.4 26 67-93 94-123 (126)
54 PF09064 Tme5_EGF_like: Thromb 74.5 2.3 4.9E-05 20.2 1.2 17 115-131 11-28 (34)
55 PF00954 S_locus_glycop: S-loc 68.3 7.6 0.00017 23.4 3.0 29 60-88 79-108 (110)
56 KOG1388|consensus 66.4 4 8.7E-05 28.0 1.5 47 2-49 75-126 (217)
57 PF14670 FXa_inhibition: Coagu 66.1 2.6 5.7E-05 20.2 0.5 17 115-131 11-29 (36)
58 KOG3607|consensus 53.3 11 0.00023 30.8 2.1 27 23-50 631-657 (716)
59 KOG3509|consensus 33.0 64 0.0014 27.5 3.6 23 34-57 717-739 (964)
60 KOG3607|consensus 28.1 46 0.001 27.3 2.0 26 67-93 632-657 (716)
61 PF12955 DUF3844: Domain of un 25.7 1E+02 0.0022 18.7 2.7 41 23-63 14-61 (103)
62 KOG3514|consensus 22.2 71 0.0015 27.8 2.1 28 66-93 630-660 (1591)
63 PF04863 EGF_alliinase: Alliin 22.0 45 0.00097 17.7 0.6 28 23-50 18-51 (56)
No 1
>KOG1225|consensus
Probab=99.46 E-value=5.5e-13 Score=99.81 Aligned_cols=107 Identities=40% Similarity=1.120 Sum_probs=78.8
Q ss_pred ccccCCCCCCCCC-CC--CCCCCCEEecCCCeeecCCCCccCCCCc-cCCCCCCCCCCCCCCCCCCCCeecCCCCeeeCC
Q psy11501 8 VCGPGRFGQNCSQ-EC--QCRNGAECHPATGECSCQPGFTGSLCEE-RCPPGTHGPSCINRCRCQNGAICNPANGQCLCA 83 (133)
Q Consensus 8 ~C~~g~~g~~c~~-~C--~C~~~g~C~~~~~~C~C~~g~~G~~C~~-~C~~g~~g~~C~~~~~C~~~~~C~~~~~~C~C~ 83 (133)
.|..+|+|..|+. .| .|.+++.|. .+.|+|++||+|.+|++ .|+. .|+.++.+. .++|+|.
T Consensus 237 ~c~~~~~g~~c~~~~C~~~c~~~g~c~--~G~CIC~~Gf~G~dC~e~~Cp~-----------~cs~~g~~~--~g~CiC~ 301 (525)
T KOG1225|consen 237 ECPEGYFGPLCSTIYCPGGCTGRGQCV--EGRCICPPGFTGDDCDELVCPV-----------DCSGGGVCV--DGECICN 301 (525)
T ss_pred ecCCceeCCccccccCCCCCcccceEe--CCeEeCCCCCcCCCCCcccCCc-----------ccCCCceec--CCEeecC
Confidence 6888899988875 36 366677887 67999999999999985 2332 255555554 4588888
Q ss_pred CCcccCCCCC------------------CCCCCccCCCCCCCCCCCCCCeeccCCCceeCCCCCccCC
Q psy11501 84 PGWMGSVCNV------------------PCTPGMWGQGCTVPCECFNGASCHHVTGECQCEPGFKGQK 133 (133)
Q Consensus 84 ~g~~g~~c~~------------------~c~~g~~g~~c~~~c~C~~~g~C~~~~g~C~C~~g~~G~~ 133 (133)
++|.|..|++ .|.+||+|..|... .|.+++.|. ++ |+|..||.|.+
T Consensus 302 ~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~C~~~-~C~~~g~cv--~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 302 PGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGELCIQR-ACSGGGQCV--NG-CKCKKGWRGPD 365 (525)
T ss_pred CCccccccccccCCccCCCCCcccCCceEeCCCCcCCccccc-ccCCCceec--cC-ceeccCccCCC
Confidence 8888887753 46677777777766 488888884 56 99999998864
No 2
>KOG0994|consensus
Probab=99.40 E-value=4e-13 Score=106.15 Aligned_cols=128 Identities=37% Similarity=0.994 Sum_probs=99.0
Q ss_pred CCccccccccCCCCCCCCC---CCCCC-----CCCEEecCCCeeecCCCCccCCCCccCCCCCC----CCCCCCCCCCCC
Q psy11501 2 GTHCEEVCGPGRFGQNCSQ---ECQCR-----NGAECHPATGECSCQPGFTGSLCEERCPPGTH----GPSCINRCRCQN 69 (133)
Q Consensus 2 g~~c~~~C~~g~~g~~c~~---~C~C~-----~~g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~----g~~C~~~~~C~~ 69 (133)
|.+|+ .|.+||||+.-.. .|.|+ +...|+..+++|.|.++..|..|+ +|++.+| |..|+. +.|..
T Consensus 997 G~hCe-~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CD-qCA~N~w~laSG~GCe~-C~Cd~ 1073 (1758)
T KOG0994|consen 997 GDHCE-HCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCD-QCAENHWNLASGEGCEP-CNCDP 1073 (1758)
T ss_pred ccchh-hccccchhHHHHhhhhhheccccccCCccccccccCcCCCCccccccccc-ccccchhccccCCCCCc-cCCCc
Confidence 78898 9999999974332 23332 235677889999999999999998 8999887 555653 23443
Q ss_pred --CCeecCCCCeeeCCCCcccCCCCCCCCCCccCCCCC--CCCCCCCCC----eeccCCCceeCCCCCccCC
Q psy11501 70 --GAICNPANGQCLCAPGWMGSVCNVPCTPGMWGQGCT--VPCECFNGA----SCHHVTGECQCEPGFKGQK 133 (133)
Q Consensus 70 --~~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~~c~--~~c~C~~~g----~C~~~~g~C~C~~g~~G~~ 133 (133)
+..|+..+++|+|.+||.|..|. +|.+-|||+.=. ..|+|+..| .|+..+|+|.|.+|..|.+
T Consensus 1074 ~~~pqCN~ftGQCqCkpGfGGR~C~-qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~r 1144 (1758)
T KOG0994|consen 1074 IGGPQCNEFTGQCQCKPGFGGRTCS-QCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPR 1144 (1758)
T ss_pred cCCccccccccceeccCCCCCcchh-HHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcc
Confidence 34688789999999999999999 899999988633 246776644 5888899999999988753
No 3
>KOG0994|consensus
Probab=99.24 E-value=1.7e-11 Score=97.26 Aligned_cols=123 Identities=39% Similarity=0.998 Sum_probs=89.2
Q ss_pred cccccCCCCCCCCC----------------CCCCCCC------CEEecCCCee-ecCCCCccCCCCccCCCCCCCCCCCC
Q psy11501 7 EVCGPGRFGQNCSQ----------------ECQCRNG------AECHPATGEC-SCQPGFTGSLCEERCPPGTHGPSCIN 63 (133)
Q Consensus 7 ~~C~~g~~g~~c~~----------------~C~C~~~------g~C~~~~~~C-~C~~g~~G~~C~~~C~~g~~g~~C~~ 63 (133)
++|.+||.|..|++ .|.|+++ +.|+..++.| .|..-..|.+|+ -|.+||+|..-.+
T Consensus 936 C~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe-~Ck~Gf~GdA~~q 1014 (1758)
T KOG0994|consen 936 CHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCE-HCKDGFYGDALRQ 1014 (1758)
T ss_pred eecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchh-hccccchhHHHHh
Confidence 35666666665542 3456654 4566668888 588888999998 7999999864222
Q ss_pred C---CCCCC-----CCeecCCCCeeeCCCCcccCCCCCCCCCCcc----CCCCCCCCCCCC--CCeeccCCCceeCCCCC
Q psy11501 64 R---CRCQN-----GAICNPANGQCLCAPGWMGSVCNVPCTPGMW----GQGCTVPCECFN--GASCHHVTGECQCEPGF 129 (133)
Q Consensus 64 ~---~~C~~-----~~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~----g~~c~~~c~C~~--~g~C~~~~g~C~C~~g~ 129 (133)
. +.|.. ...|+..+++|.|.+...|.+|+ +|.+.+| |..|+ +|.|+. +.+|+..+|+|+|.+||
T Consensus 1015 ~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CD-qCA~N~w~laSG~GCe-~C~Cd~~~~pqCN~ftGQCqCkpGf 1092 (1758)
T KOG0994|consen 1015 NCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCD-QCAENHWNLASGEGCE-PCNCDPIGGPQCNEFTGQCQCKPGF 1092 (1758)
T ss_pred hhhhheccccccCCccccccccCcCCCCccccccccc-ccccchhccccCCCCC-ccCCCccCCccccccccceeccCCC
Confidence 1 22322 24566679999999999999999 8999887 55664 555643 45898889999999999
Q ss_pred ccC
Q psy11501 130 KGQ 132 (133)
Q Consensus 130 ~G~ 132 (133)
.|.
T Consensus 1093 GGR 1095 (1758)
T KOG0994|consen 1093 GGR 1095 (1758)
T ss_pred CCc
Confidence 885
No 4
>KOG4289|consensus
Probab=99.12 E-value=1.7e-10 Score=93.45 Aligned_cols=85 Identities=36% Similarity=0.906 Sum_probs=63.3
Q ss_pred CCccccccccCCCCCCCCC---CC---CCCCCCEEec--CCCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCee
Q psy11501 2 GTHCEEVCGPGRFGQNCSQ---EC---QCRNGAECHP--ATGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAIC 73 (133)
Q Consensus 2 g~~c~~~C~~g~~g~~c~~---~C---~C~~~g~C~~--~~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C 73 (133)
|..| +|++||+|+.|+. .| +|.+++.|.. .+|+|.|.++|+|.+|+..-..+ .|... -|.++++|
T Consensus 1221 glrC--rCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~ag----rCvpG-vC~nggtC 1293 (2531)
T KOG4289|consen 1221 GLRC--RCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAG----RCVPG-VCKNGGTC 1293 (2531)
T ss_pred ceeE--eCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccC----ccccc-eecCCCEE
Confidence 4555 7999999999985 46 7999999985 47899999999999998532221 13222 47888888
Q ss_pred cCC---CCeeeCCCC-cccCCCCC
Q psy11501 74 NPA---NGQCLCAPG-WMGSVCNV 93 (133)
Q Consensus 74 ~~~---~~~C~C~~g-~~g~~c~~ 93 (133)
.+. .+.|.|++| |.+++|+.
T Consensus 1294 ~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1294 VNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred eecCCCceeccCCCcccCCCceEE
Confidence 754 567899886 66777764
No 5
>KOG1219|consensus
Probab=99.08 E-value=3e-10 Score=95.01 Aligned_cols=97 Identities=31% Similarity=0.858 Sum_probs=78.6
Q ss_pred CCCCCCEEecC---CCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecCC--CCeeeCCCCcccCCCCCCCCC
Q psy11501 23 QCRNGAECHPA---TGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNPA--NGQCLCAPGWMGSVCNVPCTP 97 (133)
Q Consensus 23 ~C~~~g~C~~~---~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~~--~~~C~C~~g~~g~~c~~~c~~ 97 (133)
+|.++|+|... .|.|.|++-|.|.+|+....+ |... ||..+++|... .+.|.|+.+|+|.+|+..
T Consensus 3871 pCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~ep------C~sn-PC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~--- 3940 (4289)
T KOG1219|consen 3871 PCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEP------CASN-PCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR--- 3940 (4289)
T ss_pred cccCCCEecCCCCCceEEeCcccccCccccccccc------ccCC-CCCCCCEEEecCCCeeEeCCCCccCceeecc---
Confidence 79999999843 679999999999999965444 6554 89999999865 789999999999999863
Q ss_pred CccCCCCCCCCCCCCCCeeccCCC--ceeCCCCCccC
Q psy11501 98 GMWGQGCTVPCECFNGASCHHVTG--ECQCEPGFKGQ 132 (133)
Q Consensus 98 g~~g~~c~~~c~C~~~g~C~~~~g--~C~C~~g~~G~ 132 (133)
| -..|+.. +|.++|.|.+..| .|.|.+||.|.
T Consensus 3941 G--i~eCs~n-~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3941 G--ISECSKN-VCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred c--ccccccc-cccCCceeeccCCceEeccChhHhcc
Confidence 1 1234332 7999999999877 89999999875
No 6
>KOG1225|consensus
Probab=99.00 E-value=1.2e-09 Score=82.22 Aligned_cols=71 Identities=41% Similarity=1.171 Sum_probs=52.4
Q ss_pred ccccCCCCCCCCC-CCC--CCCCCEEecCCCeeecCCCCccCCCCcc-CCCCCCCCCCCCCCCCCCCCeecCCCCeeeCC
Q psy11501 8 VCGPGRFGQNCSQ-ECQ--CRNGAECHPATGECSCQPGFTGSLCEER-CPPGTHGPSCINRCRCQNGAICNPANGQCLCA 83 (133)
Q Consensus 8 ~C~~g~~g~~c~~-~C~--C~~~g~C~~~~~~C~C~~g~~G~~C~~~-C~~g~~g~~C~~~~~C~~~~~C~~~~~~C~C~ 83 (133)
+|++||+|.+|++ .|+ |+.++.+. .++|+|+++|+|..|+++ |+. .|.++|.|+ .++|.|+
T Consensus 268 IC~~Gf~G~dC~e~~Cp~~cs~~g~~~--~g~CiC~~g~~G~dCs~~~cpa-----------dC~g~G~Ci--~G~C~C~ 332 (525)
T KOG1225|consen 268 ICPPGFTGDDCDELVCPVDCSGGGVCV--DGECICNPGYSGKDCSIRRCPA-----------DCSGHGKCI--DGECLCD 332 (525)
T ss_pred eCCCCCcCCCCCcccCCcccCCCceec--CCEeecCCCccccccccccCCc-----------cCCCCCccc--CCceEeC
Confidence 7999999999988 574 66777776 559999999999999742 221 455666665 5667777
Q ss_pred CCcccCCCCC
Q psy11501 84 PGWMGSVCNV 93 (133)
Q Consensus 84 ~g~~g~~c~~ 93 (133)
+||+|..|..
T Consensus 333 ~Gy~G~~C~~ 342 (525)
T KOG1225|consen 333 EGYTGELCIQ 342 (525)
T ss_pred CCCcCCcccc
Confidence 7777776665
No 7
>KOG4289|consensus
Probab=98.81 E-value=4.8e-09 Score=85.34 Aligned_cols=83 Identities=39% Similarity=0.945 Sum_probs=62.9
Q ss_pred CCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecCC--CCeeeCCCCcccCCCCCCCCCCccCCCCCCCCCCC
Q psy11501 34 TGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNPA--NGQCLCAPGWMGSVCNVPCTPGMWGQGCTVPCECF 111 (133)
Q Consensus 34 ~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~~--~~~C~C~~g~~g~~c~~~c~~g~~g~~c~~~c~C~ 111 (133)
+..|.|++||+|++|+...+. |... +|.+++.|... .++|.|.++|+|.+|+..-..+ .| .+-.|.
T Consensus 1221 glrCrCPpGFTgd~CeTeiDl------CYs~-pC~nng~C~srEggYtCeCrpg~tGehCEvs~~ag----rC-vpGvC~ 1288 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGDYCETEIDL------CYSG-PCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAG----RC-VPGVCK 1288 (2531)
T ss_pred ceeEeCCCCCCcccccchhHh------hhcC-CCCCCCceEEecCceeEEecCCccccceeeecccC----cc-ccceec
Confidence 458999999999999977666 5544 89999999744 7899999999999998621111 22 123689
Q ss_pred CCCeeccCC-C--ceeCCCC
Q psy11501 112 NGASCHHVT-G--ECQCEPG 128 (133)
Q Consensus 112 ~~g~C~~~~-g--~C~C~~g 128 (133)
|+++|.+.. + .|+|+.|
T Consensus 1289 nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1289 NGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred CCCEEeecCCCceeccCCCc
Confidence 999998752 2 7999987
No 8
>KOG1219|consensus
Probab=98.63 E-value=4.4e-08 Score=82.73 Aligned_cols=84 Identities=35% Similarity=0.857 Sum_probs=68.3
Q ss_pred CCccccccccCCCCCCCCC---CC---CCCCCCEEecC--CCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCee
Q psy11501 2 GTHCEEVCGPGRFGQNCSQ---EC---QCRNGAECHPA--TGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAIC 73 (133)
Q Consensus 2 g~~c~~~C~~g~~g~~c~~---~C---~C~~~g~C~~~--~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C 73 (133)
|..| .|+..|.|..|+. +| ||..+|+|.+. .+.|.|+.+|+|..|+.+ | -..|+.. +|.+++.|
T Consensus 3885 gy~C--kCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~---G--i~eCs~n-~C~~gg~C 3956 (4289)
T KOG1219|consen 3885 GYKC--KCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR---G--ISECSKN-VCGTGGQC 3956 (4289)
T ss_pred ceEE--eCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc---c--ccccccc-cccCCcee
Confidence 4455 6999999999985 57 79999999865 678999999999999864 1 0236544 79999999
Q ss_pred cCC--CCeeeCCCCcccCCCCC
Q psy11501 74 NPA--NGQCLCAPGWMGSVCNV 93 (133)
Q Consensus 74 ~~~--~~~C~C~~g~~g~~c~~ 93 (133)
.+. .+.|.|.++|.|..|..
T Consensus 3957 ~n~~gsf~CncT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3957 INIPGSFHCNCTPGILGRTCCA 3978 (4289)
T ss_pred eccCCceEeccChhHhcccCcc
Confidence 877 56999999999998764
No 9
>KOG1836|consensus
Probab=98.59 E-value=2.5e-07 Score=77.83 Aligned_cols=79 Identities=33% Similarity=0.896 Sum_probs=62.5
Q ss_pred CCCCC-CEEecCCCeeecCCCCccCCCCccCCCCCCCCCCC------CCCCCCCCCeecCC----CCeee-CCCCcccCC
Q psy11501 23 QCRNG-AECHPATGECSCQPGFTGSLCEERCPPGTHGPSCI------NRCRCQNGAICNPA----NGQCL-CAPGWMGSV 90 (133)
Q Consensus 23 ~C~~~-g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~C~------~~~~C~~~~~C~~~----~~~C~-C~~g~~g~~ 90 (133)
.|..+ .+|+..++.|.|.+...|..|+ +|..||||..-. ++++|.+++.|... .+.|. |+++|+|.+
T Consensus 732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~-~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r 810 (1705)
T KOG1836|consen 732 DCNGHSNICDPRTGQCKCKHNTFGGQCA-QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLR 810 (1705)
T ss_pred ccCCccccccCCCCceecccCCCCCchh-hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccc
Confidence 44443 5788889999999999999998 999999975321 24577777666543 57887 999999999
Q ss_pred CCCCCCCCccCCC
Q psy11501 91 CNVPCTPGMWGQG 103 (133)
Q Consensus 91 c~~~c~~g~~g~~ 103 (133)
|+ .|..+|++..
T Consensus 811 Ce-~c~dgyfg~p 822 (1705)
T KOG1836|consen 811 CE-ECADGYFGNP 822 (1705)
T ss_pred cc-cCCCccccCC
Confidence 99 7999998764
No 10
>KOG1226|consensus
Probab=98.48 E-value=1e-06 Score=68.43 Aligned_cols=109 Identities=32% Similarity=0.905 Sum_probs=69.4
Q ss_pred ccccCCCCCCCCC------------CC-------CCCCCCEEecCCCeeecCCCCc----cCCCCccCCCCCCCCCCCCC
Q psy11501 8 VCGPGRFGQNCSQ------------EC-------QCRNGAECHPATGECSCQPGFT----GSLCEERCPPGTHGPSCINR 64 (133)
Q Consensus 8 ~C~~g~~g~~c~~------------~C-------~C~~~g~C~~~~~~C~C~~g~~----G~~C~~~C~~g~~g~~C~~~ 64 (133)
.|.+||+|..|+- .| +|++.|.|. =|+|.|.+... |.+|+ |+. ..|...
T Consensus 481 ~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~--CGqC~C~~~~~~~i~G~fCE--CDn----fsC~r~ 552 (783)
T KOG1226|consen 481 RCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV--CGQCVCHKPDNGKIYGKFCE--CDN----FSCERH 552 (783)
T ss_pred ecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEe--CCceEecCCCCCceeeeeee--ccC----cccccc
Confidence 6999999998851 13 477777776 67889987766 88886 332 113221
Q ss_pred --CCCCCCCeecCCCCeeeCCCCcccCCCCCCCCCCccCCCCCCCC--CCCCCCeeccCCCceeCCCC-CccC
Q psy11501 65 --CRCQNGAICNPANGQCLCAPGWMGSVCNVPCTPGMWGQGCTVPC--ECFNGASCHHVTGECQCEPG-FKGQ 132 (133)
Q Consensus 65 --~~C~~~~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~~c~~~c--~C~~~g~C~~~~g~C~C~~g-~~G~ 132 (133)
..|..+|.|. -++|+|.+||+|+.|+. +. .-..|..+. .|...|+| .-++|+|... |.|.
T Consensus 553 ~g~lC~g~G~C~--CG~CvC~~GwtG~~C~C--~~--std~C~~~~G~iCSGrG~C--~Cg~C~C~~~~~sG~ 617 (783)
T KOG1226|consen 553 KGVLCGGHGRCE--CGRCVCNPGWTGSACNC--PL--STDTCESSDGQICSGRGTC--ECGRCKCTDPPYSGE 617 (783)
T ss_pred cCcccCCCCeEe--CCcEEcCCCCccCCCCC--CC--CCccccCCCCceeCCCcee--eCCceEcCCCCcCcc
Confidence 2477777776 78999999999999964 21 111222211 35556666 3567777654 6664
No 11
>KOG1836|consensus
Probab=98.27 E-value=7.4e-06 Score=69.32 Aligned_cols=126 Identities=37% Similarity=0.979 Sum_probs=93.7
Q ss_pred CCccccccccCCCCCCCC----CC---CCCCCC------CEEecCCCeeecCCCCccCCCCccCCCCCCCCC----CCCC
Q psy11501 2 GTHCEEVCGPGRFGQNCS----QE---CQCRNG------AECHPATGECSCQPGFTGSLCEERCPPGTHGPS----CINR 64 (133)
Q Consensus 2 g~~c~~~C~~g~~g~~c~----~~---C~C~~~------g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~----C~~~ 64 (133)
|.+|+ .|.++|+|+.-. .. |.|... .+|.+.+++|.|.+...|..|. .|.+++++.. |..
T Consensus 865 g~~cd-~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tGQcec~~~v~g~~c~-~c~~g~fnl~s~~gC~~- 941 (1705)
T KOG1836|consen 865 GEYCD-LCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTGQCECKPNVEGRDCL-YCFKGFFNLNSGVGCEP- 941 (1705)
T ss_pred ccccc-ccccCccccccCCCcCCccccccCccCCcccccccCCCcccceeccCCCCccccc-cccccccccCCCCCccc-
Confidence 67788 999999998654 22 333322 3577779999999999999997 8999998765 321
Q ss_pred CCCCCC----CeecCCCCeeeCCCCcccCCCCCCCCCCccCC---CCCCCCCCCCCC----eeccCCCceeCCCCCccC
Q psy11501 65 CRCQNG----AICNPANGQCLCAPGWMGSVCNVPCTPGMWGQ---GCTVPCECFNGA----SCHHVTGECQCEPGFKGQ 132 (133)
Q Consensus 65 ~~C~~~----~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~---~c~~~c~C~~~g----~C~~~~g~C~C~~g~~G~ 132 (133)
+.|... ..|+..+++|.|.++.+|.+|. +|.+.+++. .|. .|.|...| .|+..+|+|.|.+++.|.
T Consensus 942 c~c~~~gs~~~~c~~~tGqc~c~~gVtgqrc~-qc~~~~~~~~~~gc~-~c~c~~~Gs~~~qc~~~~G~c~c~~~~~g~ 1018 (1705)
T KOG1836|consen 942 CNCDPTGSESSDCDVGTGQCYCRPGVTGQRCD-QCETYHFGFQTEGCG-LCECDPLGSRGFQCDPEDGQCPCRPGFEGR 1018 (1705)
T ss_pred ccccccccccccccccCCceeeecCccccccC-ccccCcccccccCCc-ceecccCCcccceecccCCeeeecCCCCCc
Confidence 234332 3677779999999999999999 788776654 332 35666554 688889999999999874
No 12
>smart00051 DSL delta serrate ligand.
Probab=98.26 E-value=6.9e-07 Score=49.16 Aligned_cols=44 Identities=23% Similarity=0.682 Sum_probs=36.0
Q ss_pred ccccccccCCCCCCCCCCCCC----CCCCEEecCCCeeecCCCCccCCC
Q psy11501 4 HCEEVCGPGRFGQNCSQECQC----RNGAECHPATGECSCQPGFTGSLC 48 (133)
Q Consensus 4 ~c~~~C~~g~~g~~c~~~C~C----~~~g~C~~~~~~C~C~~g~~G~~C 48 (133)
.+...|+++|+|..|+..|.. ..+.+|+. .+.+.|.+||+|+.|
T Consensus 16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEecCCCCcCCCC
Confidence 344579999999999988854 56678875 789999999999876
No 13
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=98.03 E-value=1e-05 Score=42.47 Aligned_cols=31 Identities=48% Similarity=1.237 Sum_probs=27.3
Q ss_pred EEecCCCeeecCCCCccCCCCccCCCCCCCCC
Q psy11501 29 ECHPATGECSCQPGFTGSLCEERCPPGTHGPS 60 (133)
Q Consensus 29 ~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~ 60 (133)
.|+..+++|.|.++|+|..|+ .|.++|++..
T Consensus 13 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~ 43 (50)
T cd00055 13 QCDPGTGQCECKPNTTGRRCD-RCAPGYYGLP 43 (50)
T ss_pred cccCCCCEEeCCCcCCCCCCC-CCCCCCccCC
Confidence 487778999999999999998 8999998753
No 14
>KOG1226|consensus
Probab=97.90 E-value=7.9e-05 Score=58.30 Aligned_cols=87 Identities=30% Similarity=0.786 Sum_probs=53.5
Q ss_pred CCeeecCCCCccCCCCccCCCCCCCC-----CCCCC---CCCCCCCeecCCCCeeeCCCCcc----cCCCCCCCCCCccC
Q psy11501 34 TGECSCQPGFTGSLCEERCPPGTHGP-----SCINR---CRCQNGAICNPANGQCLCAPGWM----GSVCNVPCTPGMWG 101 (133)
Q Consensus 34 ~~~C~C~~g~~G~~C~~~C~~g~~g~-----~C~~~---~~C~~~~~C~~~~~~C~C~~g~~----g~~c~~~c~~g~~g 101 (133)
=|.|.|.+||.|+.|+ |+.+-+.. .|... -.|+..|.|. =++|+|.+... |..|+. ..
T Consensus 477 CG~C~C~~G~~G~~CE--C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~--CGqC~C~~~~~~~i~G~fCEC--Dn---- 546 (783)
T KOG1226|consen 477 CGQCRCDEGWLGKKCE--CSTDELSSSEEEDKCRENSDSPVCSGRGDCV--CGQCVCHKPDNGKIYGKFCEC--DN---- 546 (783)
T ss_pred ecceecCCCCCCCccc--CCccccCcHhHHhhccCCCCCCCcCCCCcEe--CCceEecCCCCCceeeeeeec--cC----
Confidence 3689999999999997 55432222 12211 1477777775 67888877665 666653 10
Q ss_pred CCCCCC--CCCCCCCeeccCCCceeCCCCCccC
Q psy11501 102 QGCTVP--CECFNGASCHHVTGECQCEPGFKGQ 132 (133)
Q Consensus 102 ~~c~~~--c~C~~~g~C~~~~g~C~C~~g~~G~ 132 (133)
..|... -.|.++|+| .-|+|+|.+||+|.
T Consensus 547 fsC~r~~g~lC~g~G~C--~CG~CvC~~GwtG~ 577 (783)
T KOG1226|consen 547 FSCERHKGVLCGGHGRC--ECGRCVCNPGWTGS 577 (783)
T ss_pred cccccccCcccCCCCeE--eCCcEEcCCCCccC
Confidence 112111 036667777 45788888888885
No 15
>KOG3512|consensus
Probab=97.90 E-value=0.0001 Score=54.93 Aligned_cols=100 Identities=31% Similarity=0.783 Sum_probs=68.3
Q ss_pred CCCccccccccCCCCCCCCC----------CCCCCCCCE-E------ec-----CCCee-ecCCCCccCCCCccCCCCCC
Q psy11501 1 MGTHCEEVCGPGRFGQNCSQ----------ECQCRNGAE-C------HP-----ATGEC-SCQPGFTGSLCEERCPPGTH 57 (133)
Q Consensus 1 ~g~~c~~~C~~g~~g~~c~~----------~C~C~~~g~-C------~~-----~~~~C-~C~~g~~G~~C~~~C~~g~~ 57 (133)
+|+-|+ .|.+.|+...-.. .|.|..++. | .. +.+.| .|..+..|.+|. -|.+||+
T Consensus 304 aGPdCg-rCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhCh-yCreGyy 381 (592)
T KOG3512|consen 304 AGPDCG-RCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHCH-YCREGYY 381 (592)
T ss_pred CCCCcc-cccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCcccc-cccCccc
Confidence 467777 8888887653221 234544332 2 11 13567 599999999997 8999998
Q ss_pred CCCCCC--------CCCCCC----CCeecCCCCeeeCCCCcccCCCCCCCCCCccCCC
Q psy11501 58 GPSCIN--------RCRCQN----GAICNPANGQCLCAPGWMGSVCNVPCTPGMWGQG 103 (133)
Q Consensus 58 g~~C~~--------~~~C~~----~~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~~ 103 (133)
.+.-.. .+.|.. +-+|++.+++|.|.+|-+|..|. .|.+||.-..
T Consensus 382 Rd~s~pl~hrkaCk~CdChpVGs~gktCNq~tGqCpCkeGvtG~tCn-rCa~gyqqsr 438 (592)
T KOG3512|consen 382 RDGSKPLTHRKACKACDCHPVGSAGKTCNQTTGQCPCKEGVTGLTCN-RCAPGYQQSR 438 (592)
T ss_pred cCCCCCCchhhhhhhcCCcccccccccccccCCcccCCCCCcccccc-cccchhhccc
Confidence 654321 123433 34687789999999999999999 7999997443
No 16
>KOG1218|consensus
Probab=97.83 E-value=0.00028 Score=50.46 Aligned_cols=108 Identities=37% Similarity=0.905 Sum_probs=67.2
Q ss_pred ccCCCCCCCCCCCCCCCC---CEEecCCCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecCCCCeeeCCCCc
Q psy11501 10 GPGRFGQNCSQECQCRNG---AECHPATGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNPANGQCLCAPGW 86 (133)
Q Consensus 10 ~~g~~g~~c~~~C~C~~~---g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~~~~~C~C~~g~ 86 (133)
..+|.|..|...+++... -+|.+....|.+..+|.+..|.. ++|+|..|...+ .....+......|.|.+||
T Consensus 96 ~~~~~g~~C~~~~~~~~~c~~~~C~~~~~~c~~~~~~~~~~C~~---~~~~g~~C~~~c--~~~~~~~~~~~~c~c~~g~ 170 (316)
T KOG1218|consen 96 LNGYEGPQCESPCPCGDGCAEKTCANPRRECRCGGGYIGEQCGE---ENLVGLKCQRDC--QCTGGCDCKNGICTCQPGF 170 (316)
T ss_pred CCCCCcccccCCCCcCCcccccccCCCccceecCCcCccccccc---cCCCCCCccCCC--CCccccCCCCCceeccCCc
Confidence 466777777776665543 45554333678888888888863 678888888765 2233444447788999999
Q ss_pred ccCCCCCCCCCCccCCCCCCCCCCCCCCeeccCCCceeCCCC
Q psy11501 87 MGSVCNVPCTPGMWGQGCTVPCECFNGASCHHVTGECQCEPG 128 (133)
Q Consensus 87 ~g~~c~~~c~~g~~g~~c~~~c~C~~~g~C~~~~g~C~C~~g 128 (133)
.+..+...+.. |...+.+.+++.|+...+.+.+.+.
T Consensus 171 ~g~~~~~~~~~------c~~~~~~~~g~~C~~~~~~~~~~~~ 206 (316)
T KOG1218|consen 171 VGVFCVESCSG------CSPLTACENGAKCNRSTGSCLCYPG 206 (316)
T ss_pred ccccccccCCC------cCCCcccCCCCeeeccccccccCCC
Confidence 99988754321 2233345555566555444444333
No 17
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.79 E-value=4.4e-05 Score=39.30 Aligned_cols=30 Identities=50% Similarity=1.301 Sum_probs=27.0
Q ss_pred CEEecCCCeeecCCCCccCCCCccCCCCCCC
Q psy11501 28 AECHPATGECSCQPGFTGSLCEERCPPGTHG 58 (133)
Q Consensus 28 g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g 58 (133)
..|+..+++|.|+++|+|..|+ +|+++|+|
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~g 40 (46)
T smart00180 11 GTCDPDTGQCECKPNVTGRRCD-RCAPGYYG 40 (46)
T ss_pred CcccCCCCEEECCCCCCCCCCC-cCCCCcCC
Confidence 4677778999999999999998 89999998
No 18
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.77 E-value=8.6e-06 Score=42.52 Aligned_cols=33 Identities=45% Similarity=1.185 Sum_probs=27.2
Q ss_pred CEEecCCCeeecCCCCccCCCCccCCCCCCCCCC
Q psy11501 28 AECHPATGECSCQPGFTGSLCEERCPPGTHGPSC 61 (133)
Q Consensus 28 g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~C 61 (133)
..|+..+++|.|.++|+|+.|+ +|.++|++...
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~~ 43 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPRCD-QCKPGYFGLPS 43 (49)
T ss_dssp SSEEETCEEESBSTTEESTTS--EE-TTEECSTT
T ss_pred CcccCCCCEEeccccccCCcCc-CCCCccccccC
Confidence 4788889999999999999998 79999988643
No 19
>smart00051 DSL delta serrate ligand.
Probab=97.74 E-value=2.5e-05 Score=42.94 Aligned_cols=43 Identities=28% Similarity=0.714 Sum_probs=29.0
Q ss_pred CCCCCCCCCccCCCCCCCCC----CCCCCeeccCCCceeCCCCCccCC
Q psy11501 90 VCNVPCTPGMWGQGCTVPCE----CFNGASCHHVTGECQCEPGFKGQK 133 (133)
Q Consensus 90 ~c~~~c~~g~~g~~c~~~c~----C~~~g~C~~~~g~C~C~~g~~G~~ 133 (133)
.+...|.++|+|..|...|. ...+.+|+. .|.++|++||+|.+
T Consensus 16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~C~~Gw~G~~ 62 (63)
T smart00051 16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKGCLEGWMGPY 62 (63)
T ss_pred EEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEecCCCCcCCC
Confidence 33445555555555554443 466788975 68999999999974
No 20
>KOG4260|consensus
Probab=97.72 E-value=5.8e-05 Score=52.57 Aligned_cols=49 Identities=37% Similarity=1.034 Sum_probs=40.1
Q ss_pred ccccCCCCCCCCCCC------CCCCCCEEec-----CCCeeecCCCCccCCCCccCCCCCCC
Q psy11501 8 VCGPGRFGQNCSQEC------QCRNGAECHP-----ATGECSCQPGFTGSLCEERCPPGTHG 58 (133)
Q Consensus 8 ~C~~g~~g~~c~~~C------~C~~~g~C~~-----~~~~C~C~~g~~G~~C~~~C~~g~~g 58 (133)
-|+.|-||.+|.. | +|+.+|.|.- +++.|.|.+||+|+.|. .|.++|+-
T Consensus 131 CCp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-~Cg~eyfe 190 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-YCGIEYFE 190 (350)
T ss_pred ccCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-ccchHHHH
Confidence 4899999999853 4 5888888862 36899999999999998 88888763
No 21
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.68 E-value=6.8e-05 Score=35.51 Aligned_cols=26 Identities=35% Similarity=0.993 Sum_probs=22.7
Q ss_pred CCCCCCEEecCCCeeecCCCCccCCC
Q psy11501 23 QCRNGAECHPATGECSCQPGFTGSLC 48 (133)
Q Consensus 23 ~C~~~g~C~~~~~~C~C~~g~~G~~C 48 (133)
.|+++|+|+...++|+|.++|+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 47889999866799999999999876
No 22
>KOG1217|consensus
Probab=97.53 E-value=0.0013 Score=49.12 Aligned_cols=121 Identities=40% Similarity=1.010 Sum_probs=77.0
Q ss_pred ccccCCCCCCCCC---CC-----CCCCCCEEecC--CCeeecCCCCccCCCCcc-------------CCCCCCCCCCCCC
Q psy11501 8 VCGPGRFGQNCSQ---EC-----QCRNGAECHPA--TGECSCQPGFTGSLCEER-------------CPPGTHGPSCINR 64 (133)
Q Consensus 8 ~C~~g~~g~~c~~---~C-----~C~~~g~C~~~--~~~C~C~~g~~G~~C~~~-------------C~~g~~g~~C~~~ 64 (133)
.|..+|.+..+.. .| .|.+.+.|.+. .+.|.|+++|.+..++.. +..++.+..|...
T Consensus 155 ~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~ 234 (487)
T KOG1217|consen 155 SCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVS 234 (487)
T ss_pred eeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccc
Confidence 6899999887763 34 37777788754 468999999999988743 3344555444432
Q ss_pred -CCCCCC-CeecCC--CCeeeCCCCcccCCCCCCCCCCccCCCCCCCCCCCCCCeeccCC--CceeCCCCCccC
Q psy11501 65 -CRCQNG-AICNPA--NGQCLCAPGWMGSVCNVPCTPGMWGQGCTVPCECFNGASCHHVT--GECQCEPGFKGQ 132 (133)
Q Consensus 65 -~~C~~~-~~C~~~--~~~C~C~~g~~g~~c~~~c~~g~~g~~c~~~c~C~~~g~C~~~~--g~C~C~~g~~G~ 132 (133)
..+... +.|.+. .++|.++++|.+..+. .+. ....|...-.|.++++|.+.. ..|.|+++|+|.
T Consensus 235 ~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~-~~~---~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~ 304 (487)
T KOG1217|consen 235 IVECASGDGTCVNTVGSYTCRCPEGYTGDACV-TCV---DVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR 304 (487)
T ss_pred cccccCCCCcccccCCceeeeCCCCccccccc-eee---eccccCCCCccCCCCeeecCCCcceeeCCCCCCCC
Confidence 123322 566544 4688888888887630 000 011222211377788998765 489999999986
No 23
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.50 E-value=0.00022 Score=37.30 Aligned_cols=31 Identities=42% Similarity=1.119 Sum_probs=27.1
Q ss_pred eecCCCCeeeCCCCcccCCCCCCCCCCccCCC
Q psy11501 72 ICNPANGQCLCAPGWMGSVCNVPCTPGMWGQG 103 (133)
Q Consensus 72 ~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~~ 103 (133)
.|+..+++|.|.++|.|.+|+ +|.++|++..
T Consensus 13 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~ 43 (50)
T cd00055 13 QCDPGTGQCECKPNTTGRRCD-RCAPGYYGLP 43 (50)
T ss_pred cccCCCCEEeCCCcCCCCCCC-CCCCCCccCC
Confidence 476668999999999999999 8999998764
No 24
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.49 E-value=0.00014 Score=34.44 Aligned_cols=25 Identities=28% Similarity=0.857 Sum_probs=22.0
Q ss_pred CCCCCCeeccCCCceeCCCCCccCC
Q psy11501 109 ECFNGASCHHVTGECQCEPGFKGQK 133 (133)
Q Consensus 109 ~C~~~g~C~~~~g~C~C~~g~~G~~ 133 (133)
.|.++|+|+...++|+|.+||+|+.
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCC
Confidence 5889999987779999999999974
No 25
>KOG1217|consensus
Probab=97.30 E-value=0.003 Score=47.23 Aligned_cols=119 Identities=37% Similarity=0.937 Sum_probs=77.2
Q ss_pred ccccCCCCCCCCCC--CCC-----CCCCEEecC-----CCeeecCCCCccCCCCccCCCCCCCCCCCC-CCCCCCCCeec
Q psy11501 8 VCGPGRFGQNCSQE--CQC-----RNGAECHPA-----TGECSCQPGFTGSLCEERCPPGTHGPSCIN-RCRCQNGAICN 74 (133)
Q Consensus 8 ~C~~g~~g~~c~~~--C~C-----~~~g~C~~~-----~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~-~~~C~~~~~C~ 74 (133)
.|+.||.+..+... |.- ...+.|... .+.|.|..+|.+..++..... |.. ...|.+++.|.
T Consensus 113 ~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~------C~~~~~~c~~~~~C~ 186 (487)
T KOG1217|consen 113 TCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDE------CIQYSSPCQNGGTCV 186 (487)
T ss_pred eCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccc------cccCCCCcCCCcccc
Confidence 58889999888763 521 233445432 467899999999888743222 442 22477777886
Q ss_pred CC--CCeeeCCCCcccCCCCCC-------------CCCCccCCCCCCC-CCCCCC-CeeccCCC--ceeCCCCCccC
Q psy11501 75 PA--NGQCLCAPGWMGSVCNVP-------------CTPGMWGQGCTVP-CECFNG-ASCHHVTG--ECQCEPGFKGQ 132 (133)
Q Consensus 75 ~~--~~~C~C~~g~~g~~c~~~-------------c~~g~~g~~c~~~-c~C~~~-g~C~~~~g--~C~C~~g~~G~ 132 (133)
+. .+.|.|+++|.+..++.. +.+++.+..+... ..+... ++|.+..+ +|.|+++|.+.
T Consensus 187 ~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~ 263 (487)
T KOG1217|consen 187 NTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGD 263 (487)
T ss_pred cCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccc
Confidence 55 468999999999887642 3445555555432 234433 77876543 79999999875
No 26
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.30 E-value=8.5e-05 Score=35.19 Aligned_cols=25 Identities=40% Similarity=1.242 Sum_probs=21.5
Q ss_pred CCCCCCeeccCC-C--ceeCCCCCccCC
Q psy11501 109 ECFNGASCHHVT-G--ECQCEPGFKGQK 133 (133)
Q Consensus 109 ~C~~~g~C~~~~-g--~C~C~~g~~G~~ 133 (133)
+|.++|+|.... . +|+|++||+|++
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 789999998765 3 899999999975
No 27
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.26 E-value=0.00012 Score=38.10 Aligned_cols=33 Identities=42% Similarity=1.181 Sum_probs=26.6
Q ss_pred CeecCCCCeeeCCCCcccCCCCCCCCCCccCCCC
Q psy11501 71 AICNPANGQCLCAPGWMGSVCNVPCTPGMWGQGC 104 (133)
Q Consensus 71 ~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g~~c 104 (133)
..|+..+++|.|.++|.|++|+ +|.++|++...
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~~ 43 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPRCD-QCKPGYFGLPS 43 (49)
T ss_dssp SSEEETCEEESBSTTEESTTS--EE-TTEECSTT
T ss_pred CcccCCCCEEeccccccCCcCc-CCCCccccccC
Confidence 4677778999999999999999 79999997653
No 28
>KOG1214|consensus
Probab=97.22 E-value=0.0015 Score=52.08 Aligned_cols=116 Identities=30% Similarity=0.838 Sum_probs=65.1
Q ss_pred ccccCCCCC--CCCC--CC-----CCCCCCEEecC--CCeeecCCCCc--cC--CCCccCCCCCCCCCCCCC-CCCCCCC
Q psy11501 8 VCGPGRFGQ--NCSQ--EC-----QCRNGAECHPA--TGECSCQPGFT--GS--LCEERCPPGTHGPSCINR-CRCQNGA 71 (133)
Q Consensus 8 ~C~~g~~g~--~c~~--~C-----~C~~~g~C~~~--~~~C~C~~g~~--G~--~C~~~C~~g~~g~~C~~~-~~C~~~~ 71 (133)
.|..+|.|+ .|.+ +| .|..+..|.+. +++|.|..+|. ++ .|...-++. --+.|+.. ..|...+
T Consensus 719 ecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa-p~n~Ce~g~h~C~i~g 797 (1289)
T KOG1214|consen 719 ECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA-PANPCEDGSHTCAIAG 797 (1289)
T ss_pred EEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC-CCCccccCccccCcCC
Confidence 366667654 4554 23 47778888753 66777776663 33 343211110 01123322 1233323
Q ss_pred e--ecCC---CCeeeCCCCcccCC--CCCCCCCCccCCCCCCCCCCCCCCeeccCCC--ceeCCCCCccC
Q psy11501 72 I--CNPA---NGQCLCAPGWMGSV--CNVPCTPGMWGQGCTVPCECFNGASCHHVTG--ECQCEPGFKGQ 132 (133)
Q Consensus 72 ~--C~~~---~~~C~C~~g~~g~~--c~~~c~~g~~g~~c~~~c~C~~~g~C~~~~g--~C~C~~g~~G~ 132 (133)
. |+.. ++.|.|.+||.|+- |.. ++.|. +-.|...++|.+..+ .|.|.+||.|+
T Consensus 798 ~a~c~~hGgs~y~C~CLPGfsGDG~~c~d-------vDeC~-psrChp~A~CyntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 798 QARCVHHGGSTYSCACLPGFSGDGHQCTD-------VDECS-PSRCHPAATCYNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred ceEEEecCCceEEEeecCCccCCcccccc-------ccccC-ccccCCCceEecCCCcceeecccCccCC
Confidence 2 3222 67888888888752 221 12232 226778899988766 89999999986
No 29
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=97.18 E-value=0.00058 Score=35.07 Aligned_cols=30 Identities=43% Similarity=1.213 Sum_probs=26.4
Q ss_pred CeecCCCCeeeCCCCcccCCCCCCCCCCccC
Q psy11501 71 AICNPANGQCLCAPGWMGSVCNVPCTPGMWG 101 (133)
Q Consensus 71 ~~C~~~~~~C~C~~g~~g~~c~~~c~~g~~g 101 (133)
..|+..+++|.|+++++|.+|+ +|.++|++
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~g 40 (46)
T smart00180 11 GTCDPDTGQCECKPNVTGRRCD-RCAPGYYG 40 (46)
T ss_pred CcccCCCCEEECCCCCCCCCCC-cCCCCcCC
Confidence 3566668999999999999999 89999998
No 30
>KOG1218|consensus
Probab=97.11 E-value=0.0074 Score=43.13 Aligned_cols=66 Identities=42% Similarity=1.044 Sum_probs=38.1
Q ss_pred cccc-CCCCCCCCCCCCCCCCCEEecCCCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecCCCCeee
Q psy11501 8 VCGP-GRFGQNCSQECQCRNGAECHPATGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNPANGQCL 81 (133)
Q Consensus 8 ~C~~-g~~g~~c~~~C~C~~~g~C~~~~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~~~~~C~ 81 (133)
+|.+ +|+|..|...|. ....+....+.|.|.+||.|..+...+.. |.....+.+++.|+...+.+.
T Consensus 136 ~C~~~~~~g~~C~~~c~--~~~~~~~~~~~c~c~~g~~g~~~~~~~~~------c~~~~~~~~g~~C~~~~~~~~ 202 (316)
T KOG1218|consen 136 QCGEENLVGLKCQRDCQ--CTGGCDCKNGICTCQPGFVGVFCVESCSG------CSPLTACENGAKCNRSTGSCL 202 (316)
T ss_pred cccccCCCCCCccCCCC--CccccCCCCCceeccCCcccccccccCCC------cCCCcccCCCCeeeccccccc
Confidence 4444 777777776662 22334445778889999999888754332 333334444445554433333
No 31
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.09 E-value=0.00026 Score=26.56 Aligned_cols=13 Identities=54% Similarity=1.502 Sum_probs=8.1
Q ss_pred eeecCCCCccCCC
Q psy11501 36 ECSCQPGFTGSLC 48 (133)
Q Consensus 36 ~C~C~~g~~G~~C 48 (133)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3677777777665
No 32
>KOG3512|consensus
Probab=97.03 E-value=0.0037 Score=46.91 Aligned_cols=109 Identities=28% Similarity=0.747 Sum_probs=70.5
Q ss_pred CCCCCCC-EEec---CCCeeecCCCCccCCCCccCCCCCCCCC----CCCC------CCCCC-------CCeecCC----
Q psy11501 22 CQCRNGA-ECHP---ATGECSCQPGFTGSLCEERCPPGTHGPS----CINR------CRCQN-------GAICNPA---- 76 (133)
Q Consensus 22 C~C~~~g-~C~~---~~~~C~C~~g~~G~~C~~~C~~g~~g~~----C~~~------~~C~~-------~~~C~~~---- 76 (133)
|.|+.++ .|.- ...+|.|..+.+|+.|. +|.+-|+... -... +.|.. +..+...
T Consensus 278 CKCNgHAs~Cv~d~~~~ltCdC~HNTaGPdCg-rCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~ 356 (592)
T KOG3512|consen 278 CKCNGHASRCVMDESSHLTCDCEHNTAGPDCG-RCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRR 356 (592)
T ss_pred eeecCccceeeeccCCceEEecccCCCCCCcc-cccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCcc
Confidence 5666554 4542 24689999999999998 8988665321 1110 12221 1112111
Q ss_pred -CCee-eCCCCcccCCCCCCCCCCccCCCCCC--------CCCCCC----CCeeccCCCceeCCCCCccC
Q psy11501 77 -NGQC-LCAPGWMGSVCNVPCTPGMWGQGCTV--------PCECFN----GASCHHVTGECQCEPGFKGQ 132 (133)
Q Consensus 77 -~~~C-~C~~g~~g~~c~~~c~~g~~g~~c~~--------~c~C~~----~g~C~~~~g~C~C~~g~~G~ 132 (133)
-++| .|.....|.+|. .|.+||+-+.-.. .|.|+. +-+|+..+|+|.|.+|.+|.
T Consensus 357 SggvClnCrHnTaGrhCh-yCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tGqCpCkeGvtG~ 425 (592)
T KOG3512|consen 357 SGGVCLNCRHNTAGRHCH-YCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTGQCPCKEGVTGL 425 (592)
T ss_pred ccceEeecccCCCCcccc-cccCccccCCCCCCchhhhhhhcCCcccccccccccccCCcccCCCCCccc
Confidence 2455 688888999999 7999998543221 245654 45898889999999999885
No 33
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=97.02 E-value=0.00016 Score=39.77 Aligned_cols=40 Identities=38% Similarity=0.883 Sum_probs=23.8
Q ss_pred ccccCCCCCCCCCCCCC----CCCCEEecCCCeeecCCCCccCCC
Q psy11501 8 VCGPGRFGQNCSQECQC----RNGAECHPATGECSCQPGFTGSLC 48 (133)
Q Consensus 8 ~C~~g~~g~~c~~~C~C----~~~g~C~~~~~~C~C~~g~~G~~C 48 (133)
.|.+.|||..|...|.= ..+-+|+ ..|.-+|.+||+|+.|
T Consensus 20 ~C~~nyyG~~C~~~C~~~~d~~ghy~Cd-~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 20 VCDENYYGPNCSKFCKPRDDSFGHYTCD-SNGNKVCLPGWTGPNC 63 (63)
T ss_dssp ---TTEETTTT-EE---EEETTEEEEE--SS--EEE-TTEESTTS
T ss_pred ECCCCCCCccccCCcCCCcCCcCCcccC-CCCCCCCCCCCcCCCC
Confidence 79999999999987732 2345776 4788999999999876
No 34
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.74 E-value=0.00052 Score=32.42 Aligned_cols=25 Identities=40% Similarity=0.929 Sum_probs=20.8
Q ss_pred CCCCCCeecCC---CCeeeCCCCcccCC
Q psy11501 66 RCQNGAICNPA---NGQCLCAPGWMGSV 90 (133)
Q Consensus 66 ~C~~~~~C~~~---~~~C~C~~g~~g~~ 90 (133)
+|.++++|.+. .++|.|++||.|++
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 78899999754 57999999999864
No 35
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.90 E-value=0.013 Score=28.23 Aligned_cols=27 Identities=44% Similarity=1.159 Sum_probs=20.6
Q ss_pred CCCCCCEEecC--CCeeecCCCCc-cCCCC
Q psy11501 23 QCRNGAECHPA--TGECSCQPGFT-GSLCE 49 (133)
Q Consensus 23 ~C~~~g~C~~~--~~~C~C~~g~~-G~~C~ 49 (133)
+|.++++|.+. .+.|.|+++|. |..|+
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 46677788754 56899999998 87764
No 36
>KOG4260|consensus
Probab=95.86 E-value=0.011 Score=41.60 Aligned_cols=52 Identities=33% Similarity=0.903 Sum_probs=33.9
Q ss_pred cCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecC-----CCCeeeCCCCcccCCCCCCCCCCc
Q psy11501 39 CQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNP-----ANGQCLCAPGWMGSVCNVPCTPGM 99 (133)
Q Consensus 39 C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~-----~~~~C~C~~g~~g~~c~~~c~~g~ 99 (133)
|++|..|++|. .|+-+- .. +|...|.|+. .+++|.|.+||+|+.|. .|.++|
T Consensus 132 Cp~gtyGpdCl-~Cpggs------er-~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-~Cg~ey 188 (350)
T KOG4260|consen 132 CPDGTYGPDCL-QCPGGS------ER-PCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-YCGIEY 188 (350)
T ss_pred cCCCCcCCccc-cCCCCC------cC-CcCCCCcccCCCCCCCCCcccccCCCCCcccc-ccchHH
Confidence 66666677765 555321 11 4666667753 27899999999999987 344433
No 37
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.68 E-value=0.026 Score=27.14 Aligned_cols=27 Identities=41% Similarity=1.058 Sum_probs=20.5
Q ss_pred CCCCCCeecCC--CCeeeCCCCcc-cCCCC
Q psy11501 66 RCQNGAICNPA--NGQCLCAPGWM-GSVCN 92 (133)
Q Consensus 66 ~C~~~~~C~~~--~~~C~C~~g~~-g~~c~ 92 (133)
+|.+++.|.+. .+.|.|+++|. |.+|+
T Consensus 10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 10 PCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 57777788755 67899999998 77663
No 38
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.45 E-value=0.024 Score=26.90 Aligned_cols=27 Identities=48% Similarity=1.184 Sum_probs=20.4
Q ss_pred CCCCCCEEecC--CCeeecCCCCccCCCC
Q psy11501 23 QCRNGAECHPA--TGECSCQPGFTGSLCE 49 (133)
Q Consensus 23 ~C~~~g~C~~~--~~~C~C~~g~~G~~C~ 49 (133)
+|.+++.|.+. .+.|.|+++|.|..|+
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 46667788754 5689999999997764
No 39
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.17 E-value=0.036 Score=25.85 Aligned_cols=26 Identities=42% Similarity=1.141 Sum_probs=20.1
Q ss_pred CCCCCCEEecC--CCeeecCCCCccC-CC
Q psy11501 23 QCRNGAECHPA--TGECSCQPGFTGS-LC 48 (133)
Q Consensus 23 ~C~~~g~C~~~--~~~C~C~~g~~G~-~C 48 (133)
+|.+++.|.+. .+.|.|+.+|.|. .|
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 46667888754 6789999999988 54
No 40
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.17 E-value=0.05 Score=25.76 Aligned_cols=27 Identities=41% Similarity=1.067 Sum_probs=20.0
Q ss_pred CCCCCCeecCC--CCeeeCCCCcccCCCC
Q psy11501 66 RCQNGAICNPA--NGQCLCAPGWMGSVCN 92 (133)
Q Consensus 66 ~C~~~~~C~~~--~~~C~C~~g~~g~~c~ 92 (133)
+|.+++.|.+. .+.|.|+++|.|..|+
T Consensus 10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 10 PCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 56666777654 5789999999887663
No 41
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.93 E-value=0.012 Score=28.54 Aligned_cols=24 Identities=38% Similarity=1.165 Sum_probs=17.4
Q ss_pred CCCCCCeeccCCC--ceeCCCCCccC
Q psy11501 109 ECFNGASCHHVTG--ECQCEPGFKGQ 132 (133)
Q Consensus 109 ~C~~~g~C~~~~g--~C~C~~g~~G~ 132 (133)
.|...++|.+..+ +|+|++||.|+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 5777889988655 89999999986
No 42
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=94.81 E-value=0.072 Score=24.73 Aligned_cols=26 Identities=38% Similarity=1.138 Sum_probs=19.6
Q ss_pred CCCCCCeecCC--CCeeeCCCCcccC-CC
Q psy11501 66 RCQNGAICNPA--NGQCLCAPGWMGS-VC 91 (133)
Q Consensus 66 ~C~~~~~C~~~--~~~C~C~~g~~g~-~c 91 (133)
+|.+++.|.+. .++|.|+.+|.|. .|
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 56666777654 6789999999887 44
No 43
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.54 E-value=0.061 Score=25.29 Aligned_cols=24 Identities=50% Similarity=1.259 Sum_probs=17.9
Q ss_pred CCCCCEEecC--CCeeecCCCCcc-CCC
Q psy11501 24 CRNGAECHPA--TGECSCQPGFTG-SLC 48 (133)
Q Consensus 24 C~~~g~C~~~--~~~C~C~~g~~G-~~C 48 (133)
|.++ .|.+. .+.|.|++||.| ..|
T Consensus 8 C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 8 CSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 5555 77654 678999999999 655
No 44
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=94.49 E-value=0.0084 Score=32.91 Aligned_cols=21 Identities=29% Similarity=0.741 Sum_probs=13.2
Q ss_pred CCCeeccCCCceeCCCCCccCC
Q psy11501 112 NGASCHHVTGECQCEPGFKGQK 133 (133)
Q Consensus 112 ~~g~C~~~~g~C~C~~g~~G~~ 133 (133)
.+.+|+ ..|.-+|.+||+|++
T Consensus 42 ghy~Cd-~~G~~~C~~Gw~G~~ 62 (63)
T PF01414_consen 42 GHYTCD-SNGNKVCLPGWTGPN 62 (63)
T ss_dssp EEEEE--SS--EEE-TTEESTT
T ss_pred CCcccC-CCCCCCCCCCCcCCC
Confidence 456787 478899999999974
No 45
>KOG1214|consensus
Probab=94.15 E-value=0.066 Score=43.32 Aligned_cols=51 Identities=35% Similarity=0.846 Sum_probs=36.3
Q ss_pred CCeeecCCCCccCCCCccCCCCCCCCCCCCCCCCCCCCeecCC--CCeeeCCCCcccCC
Q psy11501 34 TGECSCQPGFTGSLCEERCPPGTHGPSCINRCRCQNGAICNPA--NGQCLCAPGWMGSV 90 (133)
Q Consensus 34 ~~~C~C~~g~~G~~C~~~C~~g~~g~~C~~~~~C~~~~~C~~~--~~~C~C~~g~~g~~ 90 (133)
+|.|.|.+||.|+-= .|.. +..|... .|....+|.+. ++.|.|.+||.|+-
T Consensus 808 ~y~C~CLPGfsGDG~--~c~d---vDeC~ps-rChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 808 TYSCACLPGFSGDGH--QCTD---VDECSPS-RCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred eEEEeecCCccCCcc--cccc---ccccCcc-ccCCCceEecCCCcceeecccCccCCC
Confidence 679999999998742 1111 1236543 68888888766 78999999999863
No 46
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=92.54 E-value=0.062 Score=26.76 Aligned_cols=22 Identities=32% Similarity=1.097 Sum_probs=18.9
Q ss_pred CCCCCCeeccCCC--ceeCCCCCc
Q psy11501 109 ECFNGASCHHVTG--ECQCEPGFK 130 (133)
Q Consensus 109 ~C~~~g~C~~~~g--~C~C~~g~~ 130 (133)
.|...++|+++.| +|.|++||.
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcE
Confidence 5777899999766 899999998
No 47
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=86.76 E-value=1.4 Score=22.81 Aligned_cols=21 Identities=48% Similarity=1.358 Sum_probs=16.9
Q ss_pred CCCCCCeeccCCCceeCCCCCcc
Q psy11501 109 ECFNGASCHHVTGECQCEPGFKG 131 (133)
Q Consensus 109 ~C~~~g~C~~~~g~C~C~~g~~G 131 (133)
.|..+..| ..++|.|++||+-
T Consensus 27 qC~~~s~C--~~g~C~C~~g~~~ 47 (52)
T PF01683_consen 27 QCIGGSVC--VNGRCQCPPGYVE 47 (52)
T ss_pred CCCCcCEE--cCCEeECCCCCEe
Confidence 67777888 5689999999864
No 48
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=83.74 E-value=0.98 Score=19.71 Aligned_cols=11 Identities=36% Similarity=1.071 Sum_probs=8.6
Q ss_pred CCeeeCCCCcc
Q psy11501 77 NGQCLCAPGWM 87 (133)
Q Consensus 77 ~~~C~C~~g~~ 87 (133)
+++|.|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 36788888887
No 49
>KOG0196|consensus
Probab=83.56 E-value=2.1 Score=35.10 Aligned_cols=63 Identities=25% Similarity=0.705 Sum_probs=37.2
Q ss_pred CCCCCeecCCCCeeeCCCCcc----cCCCCCCCCCCccCCCC-CCCC-CCCCCCeecc-CCCceeCCCCCc
Q psy11501 67 CQNGAICNPANGQCLCAPGWM----GSVCNVPCTPGMWGQGC-TVPC-ECFNGASCHH-VTGECQCEPGFK 130 (133)
Q Consensus 67 C~~~~~C~~~~~~C~C~~g~~----g~~c~~~c~~g~~g~~c-~~~c-~C~~~g~C~~-~~g~C~C~~g~~ 130 (133)
|...|......+.|.|.+||. +..|+ .|++|++-..- ...| +|..+..-.. ....|.|..||.
T Consensus 248 C~~dGeWlvpiG~C~C~aGye~~~~~~~C~-aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 248 CSGDGEWLVPIGGCVCKAGYEEAENGKACQ-ACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYY 317 (996)
T ss_pred EcCCCcEEEEcCceeecCCCCcccCCCcce-eCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcc
Confidence 444444444478999999996 56788 79999874432 1223 3433322111 122788888875
No 50
>KOG0196|consensus
Probab=82.98 E-value=2.6 Score=34.60 Aligned_cols=63 Identities=30% Similarity=0.789 Sum_probs=37.0
Q ss_pred CCCCCEEecCCCeeecCCCCc----cCCCCccCCCCCCCCCC-CCCC-CCCCCCeec-CCCCeeeCCCCcc
Q psy11501 24 CRNGAECHPATGECSCQPGFT----GSLCEERCPPGTHGPSC-INRC-RCQNGAICN-PANGQCLCAPGWM 87 (133)
Q Consensus 24 C~~~g~C~~~~~~C~C~~g~~----G~~C~~~C~~g~~g~~C-~~~~-~C~~~~~C~-~~~~~C~C~~g~~ 87 (133)
|+..|.-....|.|.|.+||. +..|+ .|+.|+|-..- ...| +|..+..-. .....|.|..||.
T Consensus 248 C~~dGeWlvpiG~C~C~aGye~~~~~~~C~-aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyy 317 (996)
T KOG0196|consen 248 CSGDGEWLVPIGGCVCKAGYEEAENGKACQ-ACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYY 317 (996)
T ss_pred EcCCCcEEEEcCceeecCCCCcccCCCcce-eCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcc
Confidence 333333333368999999994 67887 89999874322 1111 343332221 2255788888876
No 51
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.18 E-value=2.3 Score=26.67 Aligned_cols=17 Identities=35% Similarity=0.882 Sum_probs=14.7
Q ss_pred CCeeecCCCCccCCCCc
Q psy11501 34 TGECSCQPGFTGSLCEE 50 (133)
Q Consensus 34 ~~~C~C~~g~~G~~C~~ 50 (133)
...|.|..||+|..|+.
T Consensus 66 ~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 66 GMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CceeECCCCcccccccc
Confidence 56899999999999985
No 52
>PHA02887 EGF-like protein; Provisional
Probab=79.60 E-value=2.4 Score=26.18 Aligned_cols=26 Identities=38% Similarity=0.940 Sum_probs=18.7
Q ss_pred CCCCCEEec----CCCeeecCCCCccCCCCc
Q psy11501 24 CRNGAECHP----ATGECSCQPGFTGSLCEE 50 (133)
Q Consensus 24 C~~~g~C~~----~~~~C~C~~g~~G~~C~~ 50 (133)
|. +|.|.- ....|.|..||+|..|++
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 44 367752 146899999999999873
No 53
>PHA02887 EGF-like protein; Provisional
Probab=77.09 E-value=2.8 Score=25.85 Aligned_cols=26 Identities=31% Similarity=0.841 Sum_probs=18.6
Q ss_pred CCCCCeecCC----CCeeeCCCCcccCCCCC
Q psy11501 67 CQNGAICNPA----NGQCLCAPGWMGSVCNV 93 (133)
Q Consensus 67 C~~~~~C~~~----~~~C~C~~g~~g~~c~~ 93 (133)
|.+ |.|.-. ...|.|++||.|.+|+.
T Consensus 94 CiH-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 94 CIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred eeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence 543 577422 56899999999998874
No 54
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=74.49 E-value=2.3 Score=20.21 Aligned_cols=17 Identities=29% Similarity=0.849 Sum_probs=11.8
Q ss_pred eecc-CCCceeCCCCCcc
Q psy11501 115 SCHH-VTGECQCEPGFKG 131 (133)
Q Consensus 115 ~C~~-~~g~C~C~~g~~G 131 (133)
.|+. ..++|.||+||.-
T Consensus 11 ~CDpn~~~~C~CPeGyIl 28 (34)
T PF09064_consen 11 DCDPNSPGQCFCPEGYIL 28 (34)
T ss_pred ccCCCCCCceeCCCceEe
Confidence 4443 3458999999974
No 55
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=68.32 E-value=7.6 Score=23.36 Aligned_cols=29 Identities=31% Similarity=0.728 Sum_probs=20.9
Q ss_pred CCCCCCCCCCCCeecCC-CCeeeCCCCccc
Q psy11501 60 SCINRCRCQNGAICNPA-NGQCLCAPGWMG 88 (133)
Q Consensus 60 ~C~~~~~C~~~~~C~~~-~~~C~C~~g~~g 88 (133)
.|+....|...+.|... ...|.|.+||.-
T Consensus 79 ~Cd~y~~CG~~g~C~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 79 QCDVYGFCGPNGICNSNNSPKCSCLPGFEP 108 (110)
T ss_pred CCCCccccCCccEeCCCCCCceECCCCcCC
Confidence 36555578888888644 457999999864
No 56
>KOG1388|consensus
Probab=66.38 E-value=4 Score=27.96 Aligned_cols=47 Identities=43% Similarity=1.047 Sum_probs=32.3
Q ss_pred CCccccccccCCCCCCCCCCC---CCCCCC-EEecCCCeeec-CCCCccCCCC
Q psy11501 2 GTHCEEVCGPGRFGQNCSQEC---QCRNGA-ECHPATGECSC-QPGFTGSLCE 49 (133)
Q Consensus 2 g~~c~~~C~~g~~g~~c~~~C---~C~~~g-~C~~~~~~C~C-~~g~~G~~C~ 49 (133)
|..|+ .|..+|+|++-...| .|.... -|...+++|.| ..++.|..|+
T Consensus 75 g~~c~-kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~c~~kgvvgd~c~ 126 (217)
T KOG1388|consen 75 GAHCE-KCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCFCTTKGVVGDLCP 126 (217)
T ss_pred cccCC-ceEEEEEecCCCCccCHhhhcCCeeeeeccCCccccccceEecccCc
Confidence 55666 788999997433333 444433 35556889998 6888999987
No 57
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=66.15 E-value=2.6 Score=20.25 Aligned_cols=17 Identities=35% Similarity=0.905 Sum_probs=10.5
Q ss_pred eeccCC--CceeCCCCCcc
Q psy11501 115 SCHHVT--GECQCEPGFKG 131 (133)
Q Consensus 115 ~C~~~~--g~C~C~~g~~G 131 (133)
+|.+.. .+|.|++||+-
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp EEEEETTSEEEE-STTEEE
T ss_pred CCccCCCceEeECCCCCEE
Confidence 344433 38999999874
No 58
>KOG3607|consensus
Probab=53.33 E-value=11 Score=30.82 Aligned_cols=27 Identities=26% Similarity=0.817 Sum_probs=23.0
Q ss_pred CCCCCCEEecCCCeeecCCCCccCCCCc
Q psy11501 23 QCRNGAECHPATGECSCQPGFTGSLCEE 50 (133)
Q Consensus 23 ~C~~~g~C~~~~~~C~C~~g~~G~~C~~ 50 (133)
.|..+|.|++ ...|+|.++|.+++|+.
T Consensus 631 ~C~g~GVCnn-~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 631 TCNGHGVCNN-ELNCHCEPGWAPPFCFI 657 (716)
T ss_pred ccCCCcccCC-CcceeeCCCCCCCcccc
Confidence 4788888864 77999999999999984
No 59
>KOG3509|consensus
Probab=33.04 E-value=64 Score=27.48 Aligned_cols=23 Identities=35% Similarity=0.990 Sum_probs=19.6
Q ss_pred CCeeecCCCCccCCCCccCCCCCC
Q psy11501 34 TGECSCQPGFTGSLCEERCPPGTH 57 (133)
Q Consensus 34 ~~~C~C~~g~~G~~C~~~C~~g~~ 57 (133)
.-+|.|++++.|..|+ .|.+++.
T Consensus 717 ~~~C~c~~g~~G~~ce-~c~e~~~ 739 (964)
T KOG3509|consen 717 VEQCQCPKGLVGTSCE-DCAEGYT 739 (964)
T ss_pred ccccccCccccCcccc-ccccccc
Confidence 4589999999999998 7888765
No 60
>KOG3607|consensus
Probab=28.13 E-value=46 Score=27.34 Aligned_cols=26 Identities=35% Similarity=1.104 Sum_probs=20.6
Q ss_pred CCCCCeecCCCCeeeCCCCcccCCCCC
Q psy11501 67 CQNGAICNPANGQCLCAPGWMGSVCNV 93 (133)
Q Consensus 67 C~~~~~C~~~~~~C~C~~g~~g~~c~~ 93 (133)
|...|.|+ ....|+|.++|.++.|+.
T Consensus 632 C~g~GVCn-n~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 632 CNGHGVCN-NELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cCCCcccC-CCcceeeCCCCCCCcccc
Confidence 55566775 467899999999999985
No 61
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=25.65 E-value=1e+02 Score=18.72 Aligned_cols=41 Identities=20% Similarity=0.468 Sum_probs=21.9
Q ss_pred CCCCCCEEecC----C---CeeecCCCCccCCCCccCCCCCCCCCCCC
Q psy11501 23 QCRNGAECHPA----T---GECSCQPGFTGSLCEERCPPGTHGPSCIN 63 (133)
Q Consensus 23 ~C~~~g~C~~~----~---~~C~C~~g~~G~~C~~~C~~g~~g~~C~~ 63 (133)
.|+++|.|... . +.|.|.+.+.....+..=...|-|+.|+.
T Consensus 14 ~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK 61 (103)
T PF12955_consen 14 NCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK 61 (103)
T ss_pred CCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence 58889999854 1 36778775543332211122344555543
No 62
>KOG3514|consensus
Probab=22.23 E-value=71 Score=27.79 Aligned_cols=28 Identities=36% Similarity=0.847 Sum_probs=21.2
Q ss_pred CCCCCCeecCC--CCeeeCCC-CcccCCCCC
Q psy11501 66 RCQNGAICNPA--NGQCLCAP-GWMGSVCNV 93 (133)
Q Consensus 66 ~C~~~~~C~~~--~~~C~C~~-g~~g~~c~~ 93 (133)
||.+++.|... .+.|.|.. +|.|..|+.
T Consensus 630 PC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 630 PCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred cccCCCCccccccccccccccCcccCccccc
Confidence 78898888765 67787754 688888875
No 63
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=22.03 E-value=45 Score=17.73 Aligned_cols=28 Identities=18% Similarity=0.602 Sum_probs=14.1
Q ss_pred CCCCCCEEe------cCCCeeecCCCCccCCCCc
Q psy11501 23 QCRNGAECH------PATGECSCQPGFTGSLCEE 50 (133)
Q Consensus 23 ~C~~~g~C~------~~~~~C~C~~g~~G~~C~~ 50 (133)
+|+.+|... ++...|.|..-|.|++|+.
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCST 51 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-E
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCccc
Confidence 456665543 2234799999999999974
Done!