Query psy5614
Match_columns 98
No_of_seqs 144 out of 1033
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 22:04:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5614.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5614hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.5 4.6E-14 9.9E-19 115.1 5.4 71 6-97 798-873 (1289)
2 PF12947 EGF_3: EGF domain; I 99.0 2.1E-10 4.5E-15 60.7 1.7 31 57-88 6-36 (36)
3 KOG1214|consensus 98.9 2E-09 4.2E-14 88.6 6.6 88 3-91 702-827 (1289)
4 PF12947 EGF_3: EGF domain; I 98.7 1.1E-08 2.3E-13 54.1 2.0 29 2-30 7-36 (36)
5 KOG1219|consensus 98.5 1.2E-07 2.6E-12 84.2 5.6 79 3-84 3872-3975(4289)
6 PF07645 EGF_CA: Calcium-bindi 98.4 2.7E-07 5.7E-12 49.9 2.8 29 53-81 3-34 (42)
7 PF00008 EGF: EGF-like domain 98.3 3.8E-07 8.2E-12 46.7 2.0 27 57-83 4-31 (32)
8 KOG4289|consensus 98.2 1.1E-06 2.3E-11 76.0 3.9 54 11-84 1218-1272(2531)
9 smart00179 EGF_CA Calcium-bind 98.1 7.8E-06 1.7E-10 42.3 3.8 32 53-85 3-36 (39)
10 KOG4260|consensus 97.7 2.7E-05 5.9E-10 57.8 2.9 78 3-82 152-269 (350)
11 KOG4289|consensus 97.7 5E-05 1.1E-09 66.2 4.4 61 2-79 1246-1308(2531)
12 cd00053 EGF Epidermal growth f 97.6 7.1E-05 1.5E-09 37.4 3.0 27 57-83 6-32 (36)
13 cd00054 EGF_CA Calcium-binding 97.6 8.7E-05 1.9E-09 37.7 3.2 27 57-83 9-35 (38)
14 PF00008 EGF: EGF-like domain 97.5 5.3E-05 1.1E-09 38.6 1.6 25 2-26 5-31 (32)
15 smart00181 EGF Epidermal growt 97.5 0.00018 3.9E-09 36.5 3.2 26 57-83 6-31 (35)
16 KOG1225|consensus 97.4 0.00026 5.7E-09 56.5 5.4 62 14-83 264-338 (525)
17 PF06247 Plasmod_Pvs28: Plasmo 97.4 0.00041 8.9E-09 48.9 5.0 82 3-85 52-164 (197)
18 PF12662 cEGF: Complement Clr- 97.3 0.00022 4.9E-09 34.3 2.1 21 71-92 1-23 (24)
19 PF07645 EGF_CA: Calcium-bindi 97.3 0.00015 3.3E-09 38.9 1.6 24 2-25 11-35 (42)
20 smart00179 EGF_CA Calcium-bind 97.2 0.00051 1.1E-08 35.2 2.9 27 3-30 11-38 (39)
21 PF12946 EGF_MSP1_1: MSP1 EGF 96.9 0.00039 8.5E-09 36.8 1.1 29 2-30 6-36 (37)
22 cd00053 EGF Epidermal growth f 96.9 0.001 2.2E-08 33.0 2.7 25 2-26 7-32 (36)
23 KOG1217|consensus 96.9 0.0034 7.3E-08 47.2 6.5 81 3-83 280-389 (487)
24 KOG1217|consensus 96.9 0.0015 3.2E-08 49.2 4.4 60 7-85 243-306 (487)
25 smart00181 EGF Epidermal growt 96.7 0.0023 4.9E-08 32.3 2.7 24 2-26 7-31 (35)
26 cd00054 EGF_CA Calcium-binding 96.5 0.0029 6.3E-08 31.8 2.6 23 3-25 11-34 (38)
27 PF12946 EGF_MSP1_1: MSP1 EGF 96.5 0.0011 2.3E-08 35.1 0.8 31 57-88 5-36 (37)
28 PF12662 cEGF: Complement Clr- 96.4 0.0023 4.9E-08 30.8 1.5 19 14-32 1-21 (24)
29 PF14670 FXa_inhibition: Coagu 96.2 0.0045 9.8E-08 32.4 2.1 20 63-82 10-29 (36)
30 KOG1225|consensus 96.1 0.015 3.3E-07 46.6 5.8 60 16-84 297-365 (525)
31 PF07974 EGF_2: EGF-like domai 96.1 0.0071 1.5E-07 30.8 2.5 24 58-83 7-30 (32)
32 KOG1219|consensus 96.1 0.0088 1.9E-07 55.0 4.6 36 48-83 3896-3935(4289)
33 PF12661 hEGF: Human growth fa 95.5 0.0092 2E-07 24.6 1.3 10 16-25 1-10 (13)
34 PF06247 Plasmod_Pvs28: Plasmo 95.2 0.011 2.3E-07 41.8 1.5 69 6-91 10-88 (197)
35 KOG4260|consensus 95.0 0.016 3.5E-07 43.3 2.1 57 3-81 246-304 (350)
36 PF07974 EGF_2: EGF-like domai 94.7 0.043 9.2E-07 27.9 2.6 21 3-25 8-29 (32)
37 cd01475 vWA_Matrilin VWA_Matri 93.9 0.064 1.4E-06 37.8 3.1 32 50-83 183-219 (224)
38 PF00954 S_locus_glycop: S-loc 93.8 0.061 1.3E-06 34.1 2.6 29 53-82 78-108 (110)
39 PF14670 FXa_inhibition: Coagu 93.2 0.083 1.8E-06 27.6 2.0 18 8-25 11-29 (36)
40 KOG1226|consensus 93.1 0.26 5.7E-06 41.2 5.7 72 8-85 468-579 (783)
41 PF01683 EB: EB module; Inter 91.8 0.77 1.7E-05 25.1 4.9 25 57-85 26-50 (52)
42 PHA02887 EGF-like protein; Pro 88.4 0.51 1.1E-05 31.1 2.6 23 6-30 96-121 (126)
43 smart00051 DSL delta serrate l 87.2 0.82 1.8E-05 26.6 2.8 22 59-83 40-61 (63)
44 PHA02887 EGF-like protein; Pro 86.9 0.72 1.6E-05 30.4 2.6 26 57-83 92-119 (126)
45 KOG0994|consensus 82.8 1.8 3.8E-05 38.5 3.8 21 65-85 878-899 (1758)
46 PHA03099 epidermal growth fact 78.0 2.3 4.9E-05 28.5 2.4 24 6-31 55-81 (139)
47 PF00053 Laminin_EGF: Laminin 77.5 2.2 4.7E-05 23.0 1.9 26 63-90 11-36 (49)
48 PHA03099 epidermal growth fact 75.1 3.1 6.7E-05 27.9 2.4 30 57-90 51-82 (139)
49 cd00055 EGF_Lam Laminin-type e 74.1 4.4 9.6E-05 22.0 2.6 19 72-90 19-37 (50)
50 KOG0994|consensus 73.9 3.4 7.5E-05 36.8 3.0 19 11-29 881-900 (1758)
51 PF09064 Tme5_EGF_like: Thromb 73.5 3.2 6.8E-05 21.4 1.7 13 72-84 18-30 (34)
52 KOG1226|consensus 69.4 7.4 0.00016 33.0 3.9 21 3-26 557-577 (783)
53 KOG3607|consensus 62.0 9 0.0002 32.2 3.1 45 37-84 603-654 (716)
54 smart00180 EGF_Lam Laminin-typ 60.8 8.5 0.00018 20.6 2.0 19 72-90 18-36 (46)
55 KOG1836|consensus 59.4 9.9 0.00022 35.0 3.1 31 58-88 781-814 (1705)
56 PF04863 EGF_alliinase: Alliin 55.1 8.3 0.00018 22.1 1.3 28 1-30 17-49 (56)
57 KOG3516|consensus 52.7 12 0.00026 33.3 2.4 31 53-83 546-578 (1306)
58 cd00064 FU Furin-like repeats. 42.1 18 0.00039 19.2 1.4 23 14-36 14-37 (49)
59 PF01826 TIL: Trypsin Inhibito 34.1 13 0.00028 20.3 -0.1 19 73-91 34-52 (55)
60 KOG0196|consensus 31.9 70 0.0015 27.9 3.8 58 15-82 259-318 (996)
61 PF01414 DSL: Delta serrate li 31.0 22 0.00049 20.5 0.6 8 18-25 53-60 (63)
62 PF09402 MSC: Man1-Src1p-C-ter 22.7 28 0.00061 25.8 0.0 24 57-83 53-76 (334)
63 PTZ00214 high cysteine membran 21.3 3.2E+02 0.0069 23.6 5.9 23 69-92 748-770 (800)
No 1
>KOG1214|consensus
Probab=99.47 E-value=4.6e-14 Score=115.13 Aligned_cols=71 Identities=34% Similarity=0.762 Sum_probs=59.7
Q ss_pred CcEEe--cCCCCeEEcCCCcEeCCCCCcc--cCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEecCCccEEeCCCCCe
Q psy5614 6 WIQCS--MQSNPSCSCLPNYIGAPPNCRP--ECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVINHTPICTCPEGYI 81 (98)
Q Consensus 6 ~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~--eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~ 81 (98)
+++|+ ..+.|+|.|+|||.|||..|.+ ||.. +.|+++|.|+|+.++|.|+|++||.
T Consensus 798 ~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~p--------------------srChp~A~CyntpgsfsC~C~pGy~ 857 (1289)
T KOG1214|consen 798 QARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSP--------------------SRCHPAATCYNTPGSFSCRCQPGYY 857 (1289)
T ss_pred ceEEEecCCceEEEeecCCccCCccccccccccCc--------------------cccCCCceEecCCCcceeecccCcc
Confidence 34566 6789999999999999998863 4542 5799999999999999999999999
Q ss_pred eCCCCCCccC-CCCCCC
Q psy5614 82 GDPFSSCFPK-PPERKC 97 (98)
Q Consensus 82 G~g~~~C~~~-~~~~~c 97 (98)
|||+ +|+|. .+-.+|
T Consensus 858 GDGf-~CVP~~~~~T~C 873 (1289)
T KOG1214|consen 858 GDGF-QCVPDTSSLTPC 873 (1289)
T ss_pred CCCc-eecCCCccCCcc
Confidence 9999 89997 555555
No 2
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.98 E-value=2.1e-10 Score=60.67 Aligned_cols=31 Identities=45% Similarity=0.938 Sum_probs=25.9
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeCCCCCC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGDPFSSC 88 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~~~C 88 (98)
+.|+++|+|+++.++|+|+|++||.|||+ .|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~-~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF-FC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCST-CE
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCc-CC
Confidence 57999999999999999999999999998 44
No 3
>KOG1214|consensus
Probab=98.94 E-value=2e-09 Score=88.62 Aligned_cols=88 Identities=30% Similarity=0.696 Sum_probs=68.0
Q ss_pred ccCCcEEe--cCCCCeEEcCCCcEeCCCCCcc--cCcC-CCCCCCCceeeC-------------------Ceec------
Q psy5614 3 WLMWIQCS--MQSNPSCSCLPNYIGAPPNCRP--ECVM-NSECASNLACIN-------------------EKCR------ 52 (98)
Q Consensus 3 ~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~--eC~~-~~~C~~~~~C~~-------------------~~C~------ 52 (98)
|..++.|. ..-.|+|.|..||.|+|+.|.+ ||.. ...|.++.+|++ .+|+
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa 781 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA 781 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC
Confidence 67889999 6678999999999999999985 6664 356777777742 4666
Q ss_pred --CCCC---CCCCCCC--eeEecCC-ccEEeCCCCCeeCCCCCCccC
Q psy5614 53 --DPCP---GSCGYNA--QCKVINH-TPICTCPEGYIGDPFSSCFPK 91 (98)
Q Consensus 53 --d~C~---~~C~~~a--~C~~~~g-~~~C~C~~Gy~G~g~~~C~~~ 91 (98)
++|. +.|+.++ .|+..++ .|+|+|.|||.|||+ .|...
T Consensus 782 p~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~-~c~dv 827 (1289)
T KOG1214|consen 782 PANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH-QCTDV 827 (1289)
T ss_pred CCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc-ccccc
Confidence 5673 6888755 5666555 899999999999998 56554
No 4
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.68 E-value=1.1e-08 Score=54.06 Aligned_cols=29 Identities=24% Similarity=0.536 Sum_probs=24.0
Q ss_pred CccCCcEEe-cCCCCeEEcCCCcEeCCCCC
Q psy5614 2 SWLMWIQCS-MQSNPSCSCLPNYIGAPPNC 30 (98)
Q Consensus 2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C 30 (98)
.|+++|+|+ +.++|+|+|++||.|||+.|
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence 489999999 88899999999999999875
No 5
>KOG1219|consensus
Probab=98.54 E-value=1.2e-07 Score=84.19 Aligned_cols=79 Identities=25% Similarity=0.665 Sum_probs=65.1
Q ss_pred ccCCcEEe--cCCCCeEEcCCCcEeCCCCCcc---cCcCCCCCCCCcee---------------eCCeec----CCCC-C
Q psy5614 3 WLMWIQCS--MQSNPSCSCLPNYIGAPPNCRP---ECVMNSECASNLAC---------------INEKCR----DPCP-G 57 (98)
Q Consensus 3 ~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~---eC~~~~~C~~~~~C---------------~~~~C~----d~C~-~ 57 (98)
|...++|. ..++|.|.|++-|+| .+|+. .|.. .+|..+++| ++.+|+ ++|+ .
T Consensus 3872 CqhgG~C~~~~~ggy~CkCpsqysG--~~CEi~~epC~s-nPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3872 CQHGGTCISQPKGGYKCKCPSQYSG--NHCEIDLEPCAS-NPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred ccCCCEecCCCCCceEEeCcccccC--cccccccccccC-CCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence 67889999 778999999999999 78873 3543 356655555 256888 7897 8
Q ss_pred CCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614 58 SCGYNAQCKVINHTPICTCPEGYIGDP 84 (98)
Q Consensus 58 ~C~~~a~C~~~~g~~~C~C~~Gy~G~g 84 (98)
+|..++.|+|..|+|.|.|-+||.|.+
T Consensus 3949 ~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3949 VCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred cccCCceeeccCCceEeccChhHhccc
Confidence 999999999999999999999999865
No 6
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.40 E-value=2.7e-07 Score=49.91 Aligned_cols=29 Identities=34% Similarity=0.843 Sum_probs=26.0
Q ss_pred CCCC---CCCCCCCeeEecCCccEEeCCCCCe
Q psy5614 53 DPCP---GSCGYNAQCKVINHTPICTCPEGYI 81 (98)
Q Consensus 53 d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~ 81 (98)
|||. +.|..++.|+|+.|+|+|.|++||.
T Consensus 3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 4563 6899999999999999999999998
No 7
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.31 E-value=3.8e-07 Score=46.74 Aligned_cols=27 Identities=30% Similarity=0.789 Sum_probs=25.2
Q ss_pred CCCCCCCeeEecC-CccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVIN-HTPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~-g~~~C~C~~Gy~G~ 83 (98)
.+|.++|+|+... +.|+|+|++||+|+
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5899999999998 89999999999996
No 8
>KOG4289|consensus
Probab=98.22 E-value=1.1e-06 Score=76.02 Aligned_cols=54 Identities=31% Similarity=0.843 Sum_probs=44.5
Q ss_pred cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCC-CCCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614 11 MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPC-PGSCGYNAQCKVINHTPICTCPEGYIGDP 84 (98)
Q Consensus 11 ~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C-~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g 84 (98)
..++++|.|++||+|| .|+.| . |+| +++|.+|+.|....|.|+|.|++||+|..
T Consensus 1218 pvnglrCrCPpGFTgd--~CeTe----i--------------DlCYs~pC~nng~C~srEggYtCeCrpg~tGeh 1272 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGD--YCETE----I--------------DLCYSGPCGNNGRCRSREGGYTCECRPGFTGEH 1272 (2531)
T ss_pred ccCceeEeCCCCCCcc--cccch----h--------------HhhhcCCCCCCCceEEecCceeEEecCCccccc
Confidence 4578999999999995 66532 0 344 37999999999999999999999999965
No 9
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.06 E-value=7.8e-06 Score=42.31 Aligned_cols=32 Identities=31% Similarity=0.728 Sum_probs=26.5
Q ss_pred CCCC--CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614 53 DPCP--GSCGYNAQCKVINHTPICTCPEGYIGDPF 85 (98)
Q Consensus 53 d~C~--~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~ 85 (98)
|+|. .+|..++.|+++.++|+|.|++||+ +|.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~-~g~ 36 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYT-DGR 36 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCc-cCC
Confidence 4553 5799999999999999999999999 443
No 10
>KOG4260|consensus
Probab=97.71 E-value=2.7e-05 Score=57.81 Aligned_cols=78 Identities=27% Similarity=0.606 Sum_probs=54.3
Q ss_pred ccCCcEEe----cCCCCeEEcCCCcEeCCCCCcc-----------c-------CcCC--CCCCCC--c---eee------
Q psy5614 3 WLMWIQCS----MQSNPSCSCLPNYIGAPPNCRP-----------E-------CVMN--SECASN--L---ACI------ 47 (98)
Q Consensus 3 ~~~~a~C~----~~g~~~C~C~~Gy~Gdg~~C~~-----------e-------C~~~--~~C~~~--~---~C~------ 47 (98)
|.-+++|. ..|+..|.|.+||+| ..|.+ + |... ..|... . .|.
T Consensus 152 C~GnG~C~GdGsR~GsGkCkC~~GY~G--p~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld 229 (350)
T KOG4260|consen 152 CFGNGSCHGDGSREGSGKCKCETGYTG--PLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD 229 (350)
T ss_pred cCCCCcccCCCCCCCCCcccccCCCCC--ccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceec
Confidence 56778887 567889999999999 45541 0 1110 122211 1 121
Q ss_pred CCeec--CCCC---CCCCCCCeeEecCCccEEeCCCCCee
Q psy5614 48 NEKCR--DPCP---GSCGYNAQCKVINHTPICTCPEGYIG 82 (98)
Q Consensus 48 ~~~C~--d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~G 82 (98)
-..|+ |+|. .+|.+...|+|+.|||.|.+++||.+
T Consensus 230 e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 230 EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 13566 7883 79999999999999999999999988
No 11
>KOG4289|consensus
Probab=97.68 E-value=5e-05 Score=66.21 Aligned_cols=61 Identities=25% Similarity=0.630 Sum_probs=47.4
Q ss_pred CccCCcEEe-cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEec-CCccEEeCCCC
Q psy5614 2 SWLMWIQCS-MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVI-NHTPICTCPEG 79 (98)
Q Consensus 2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~-~g~~~C~C~~G 79 (98)
-|.+|++|. ..|+|+|+|++||+| .+|+-. .....|.+ +.|.+.++|++. +|.|.|.|+.|
T Consensus 1246 pC~nng~C~srEggYtCeCrpg~tG--ehCEvs-~~agrCvp--------------GvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1246 PCGNNGRCRSREGGYTCECRPGFTG--EHCEVS-ARAGRCVP--------------GVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred CCCCCCceEEecCceeEEecCCccc--cceeee-cccCcccc--------------ceecCCCEEeecCCCceeccCCCc
Confidence 378999999 999999999999999 777521 01122222 578999999995 67999999998
No 12
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.63 E-value=7.1e-05 Score=37.43 Aligned_cols=27 Identities=33% Similarity=0.840 Sum_probs=25.0
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
.+|.+++.|++..+.|+|.|++||.|+
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCccc
Confidence 578889999999999999999999997
No 13
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.61 E-value=8.7e-05 Score=37.68 Aligned_cols=27 Identities=30% Similarity=0.764 Sum_probs=24.4
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
.+|.+++.|++..+.|+|.|++||.|.
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 478888999999999999999999883
No 14
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.53 E-value=5.3e-05 Score=38.63 Aligned_cols=25 Identities=24% Similarity=0.546 Sum_probs=22.4
Q ss_pred CccCCcEEe-cC-CCCeEEcCCCcEeC
Q psy5614 2 SWLMWIQCS-MQ-SNPSCSCLPNYIGA 26 (98)
Q Consensus 2 ~~~~~a~C~-~~-g~~~C~C~~Gy~Gd 26 (98)
.|..+|+|+ .. +.|+|+|++||+|.
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 488899999 66 99999999999994
No 15
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.47 E-value=0.00018 Score=36.50 Aligned_cols=26 Identities=38% Similarity=0.982 Sum_probs=23.4
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
.+|..+ .|++..++|+|.|++||.|+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 378888 99999999999999999996
No 16
>KOG1225|consensus
Probab=97.45 E-value=0.00026 Score=56.53 Aligned_cols=62 Identities=31% Similarity=0.962 Sum_probs=37.8
Q ss_pred CCeEEcCCCcEeCCCCCcc-cCcCCCCCCCCceeeCCeec------------CCCCCCCCCCCeeEecCCccEEeCCCCC
Q psy5614 14 NPSCSCLPNYIGAPPNCRP-ECVMNSECASNLACINEKCR------------DPCPGSCGYNAQCKVINHTPICTCPEGY 80 (98)
Q Consensus 14 ~~~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~~~C~~~~C~------------d~C~~~C~~~a~C~~~~g~~~C~C~~Gy 80 (98)
..+|.|++||+| ..|.. .|... |+.+..+++..|+ ..|+..|..++.|+ .-+|.|.+||
T Consensus 264 ~G~CIC~~Gf~G--~dC~e~~Cp~~--cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci----~G~C~C~~Gy 335 (525)
T KOG1225|consen 264 EGRCICPPGFTG--DDCDELVCPVD--CSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKCI----DGECLCDEGY 335 (525)
T ss_pred CCeEeCCCCCcC--CCCCcccCCcc--cCCCceecCCEeecCCCccccccccccCCccCCCCCccc----CCceEeCCCC
Confidence 446889999988 56653 34322 5555555555554 22446677777777 2257777777
Q ss_pred eeC
Q psy5614 81 IGD 83 (98)
Q Consensus 81 ~G~ 83 (98)
+|+
T Consensus 336 ~G~ 338 (525)
T KOG1225|consen 336 TGE 338 (525)
T ss_pred cCC
Confidence 775
No 17
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.37 E-value=0.00041 Score=48.86 Aligned_cols=82 Identities=30% Similarity=0.703 Sum_probs=51.0
Q ss_pred ccCCcEEe-cC-----CCCeEEcCCCcEeCCCCCcc-cCcCCCCCCCC-----------cee---------eCCeec---
Q psy5614 3 WLMWIQCS-MQ-----SNPSCSCLPNYIGAPPNCRP-ECVMNSECASN-----------LAC---------INEKCR--- 52 (98)
Q Consensus 3 ~~~~a~C~-~~-----g~~~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~-----------~~C---------~~~~C~--- 52 (98)
|...|+|+ .. ..|.|.|.+||+-....|.+ +|.. ..|... ..| .+.+|+
T Consensus 52 Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~-~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G 130 (197)
T PF06247_consen 52 CGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNN-KDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTG 130 (197)
T ss_dssp EETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS----TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE
T ss_pred ccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCc-eecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCC
Confidence 67889998 33 68999999999876567774 5643 345421 123 135666
Q ss_pred -CCCCCCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614 53 -DPCPGSCGYNAQCKVINHTPICTCPEGYIGDPF 85 (98)
Q Consensus 53 -d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~ 85 (98)
.+|+--|..+..|..+.+-|+|.|++||.+++-
T Consensus 131 ~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 131 ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp --------TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred ccceeeecCCCcceeeeCcEEEeecCCCCCCCCC
Confidence 567667899999999999999999999987764
No 18
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.29 E-value=0.00022 Score=34.29 Aligned_cols=21 Identities=38% Similarity=0.738 Sum_probs=17.7
Q ss_pred ccEEeCCCCCe--eCCCCCCccCC
Q psy5614 71 TPICTCPEGYI--GDPFSSCFPKP 92 (98)
Q Consensus 71 ~~~C~C~~Gy~--G~g~~~C~~~~ 92 (98)
+|+|+|++||+ .|+. .|+.++
T Consensus 1 sy~C~C~~Gy~l~~d~~-~C~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGR-SCEDID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCC-ccccCC
Confidence 69999999998 6776 798875
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.28 E-value=0.00015 Score=38.94 Aligned_cols=24 Identities=29% Similarity=0.433 Sum_probs=21.7
Q ss_pred CccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614 2 SWLMWIQCS-MQSNPSCSCLPNYIG 25 (98)
Q Consensus 2 ~~~~~a~C~-~~g~~~C~C~~Gy~G 25 (98)
.|..++.|+ +.|+|+|.|++||..
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 377899999 999999999999984
No 20
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.17 E-value=0.00051 Score=35.25 Aligned_cols=27 Identities=26% Similarity=0.515 Sum_probs=22.6
Q ss_pred ccCCcEEe-cCCCCeEEcCCCcEeCCCCC
Q psy5614 3 WLMWIQCS-MQSNPSCSCLPNYIGAPPNC 30 (98)
Q Consensus 3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C 30 (98)
|..++.|+ ..++|.|.|++||+ +|..|
T Consensus 11 C~~~~~C~~~~g~~~C~C~~g~~-~g~~C 38 (39)
T smart00179 11 CQNGGTCVNTVGSYRCECPPGYT-DGRNC 38 (39)
T ss_pred cCCCCEeECCCCCeEeECCCCCc-cCCcC
Confidence 67788999 99999999999999 33555
No 21
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.95 E-value=0.00039 Score=36.76 Aligned_cols=29 Identities=21% Similarity=0.426 Sum_probs=21.6
Q ss_pred CccCCcEEe--cCCCCeEEcCCCcEeCCCCC
Q psy5614 2 SWLMWIQCS--MQSNPSCSCLPNYIGAPPNC 30 (98)
Q Consensus 2 ~~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C 30 (98)
.|..||.|+ .+|++.|.|+.||..++..|
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 478999999 56999999999998876554
No 22
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.94 E-value=0.001 Score=33.01 Aligned_cols=25 Identities=24% Similarity=0.474 Sum_probs=21.9
Q ss_pred CccCCcEEe-cCCCCeEEcCCCcEeC
Q psy5614 2 SWLMWIQCS-MQSNPSCSCLPNYIGA 26 (98)
Q Consensus 2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gd 26 (98)
.|..++.|+ ..+.|+|.|+.||.|+
T Consensus 7 ~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 7 PCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCEEecCCCCeEeECCCCCccc
Confidence 366789999 8889999999999985
No 23
>KOG1217|consensus
Probab=96.93 E-value=0.0034 Score=47.24 Aligned_cols=81 Identities=26% Similarity=0.684 Sum_probs=57.3
Q ss_pred ccCCcEEe-cCCCCeEEcCCCcEeCCC-CCc--ccCc---CCCCCCCCcee-----------------eCCeec---CCC
Q psy5614 3 WLMWIQCS-MQSNPSCSCLPNYIGAPP-NCR--PECV---MNSECASNLAC-----------------INEKCR---DPC 55 (98)
Q Consensus 3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~-~C~--~eC~---~~~~C~~~~~C-----------------~~~~C~---d~C 55 (98)
|..+++|+ ..+.|.|.|++||.|..- .+. .+|. ....|....+| .+..|+ ++|
T Consensus 280 c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C 359 (487)
T KOG1217|consen 280 CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDEC 359 (487)
T ss_pred cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccc
Confidence 67788999 777799999999999553 232 2443 12224443333 235666 268
Q ss_pred C-CCCCCCCeeEe-cCCccEEeCCCCCeeC
Q psy5614 56 P-GSCGYNAQCKV-INHTPICTCPEGYIGD 83 (98)
Q Consensus 56 ~-~~C~~~a~C~~-~~g~~~C~C~~Gy~G~ 83 (98)
. .++..++.|++ ..++|+|.|+.+|.+.
T Consensus 360 ~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~ 389 (487)
T KOG1217|consen 360 ASSPCCPGGTCVNETPGSYRCACPAGFAGK 389 (487)
T ss_pred cCCccccCCEeccCCCCCeEecCCCccccC
Confidence 5 56889999999 7889999999999873
No 24
>KOG1217|consensus
Probab=96.91 E-value=0.0015 Score=49.20 Aligned_cols=60 Identities=28% Similarity=0.658 Sum_probs=41.8
Q ss_pred cEEe-cCCCCeEEcCCCcEeCC-CCCcccCcCCCCCCCCceeeCCeecCCCC-C-CCCCCCeeEecCCccEEeCCCCCee
Q psy5614 7 IQCS-MQSNPSCSCLPNYIGAP-PNCRPECVMNSECASNLACINEKCRDPCP-G-SCGYNAQCKVINHTPICTCPEGYIG 82 (98)
Q Consensus 7 a~C~-~~g~~~C~C~~Gy~Gdg-~~C~~eC~~~~~C~~~~~C~~~~C~d~C~-~-~C~~~a~C~~~~g~~~C~C~~Gy~G 82 (98)
++|+ ..++|+|.|++||+++. ..+.+ . ++|. . .|.++++|++..+.|+|.|++||+|
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~----~---------------~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g 303 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDACVTCVD----V---------------DSCALIASCPNGGTCVNVPGSYRCTCPPGFTG 303 (487)
T ss_pred CcccccCCceeeeCCCCccccccceeee----c---------------cccCCCCccCCCCeeecCCCcceeeCCCCCCC
Confidence 5566 66667777777777743 12211 0 3442 2 3888999999999999999999999
Q ss_pred CCC
Q psy5614 83 DPF 85 (98)
Q Consensus 83 ~g~ 85 (98)
...
T Consensus 304 ~~~ 306 (487)
T KOG1217|consen 304 RLC 306 (487)
T ss_pred CCC
Confidence 764
No 25
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.66 E-value=0.0023 Score=32.27 Aligned_cols=24 Identities=29% Similarity=0.613 Sum_probs=20.9
Q ss_pred CccCCcEEe-cCCCCeEEcCCCcEeC
Q psy5614 2 SWLMWIQCS-MQSNPSCSCLPNYIGA 26 (98)
Q Consensus 2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gd 26 (98)
.|..+ +|+ ..++|+|.|++||.|+
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccC
Confidence 36677 899 8899999999999985
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.53 E-value=0.0029 Score=31.78 Aligned_cols=23 Identities=30% Similarity=0.548 Sum_probs=20.7
Q ss_pred ccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614 3 WLMWIQCS-MQSNPSCSCLPNYIG 25 (98)
Q Consensus 3 ~~~~a~C~-~~g~~~C~C~~Gy~G 25 (98)
|..++.|+ ..+.|+|.|++||+|
T Consensus 11 C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 11 CQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cCCCCEeECCCCCeEeECCCCCcC
Confidence 66788999 888999999999998
No 27
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.51 E-value=0.0011 Score=35.10 Aligned_cols=31 Identities=29% Similarity=0.509 Sum_probs=23.1
Q ss_pred CCCCCCCeeEecC-CccEEeCCCCCeeCCCCCC
Q psy5614 57 GSCGYNAQCKVIN-HTPICTCPEGYIGDPFSSC 88 (98)
Q Consensus 57 ~~C~~~a~C~~~~-g~~~C~C~~Gy~G~g~~~C 88 (98)
..|..||.|++.. |++.|.|..||..++. .|
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk~~~~-~C 36 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYKKVGG-KC 36 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEEEETT-EE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCccccCC-Cc
Confidence 4789999999976 8999999999998875 45
No 28
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.40 E-value=0.0023 Score=30.76 Aligned_cols=19 Identities=32% Similarity=0.768 Sum_probs=15.1
Q ss_pred CCeEEcCCCcE--eCCCCCcc
Q psy5614 14 NPSCSCLPNYI--GAPPNCRP 32 (98)
Q Consensus 14 ~~~C~C~~Gy~--Gdg~~C~~ 32 (98)
+|+|.|++||. .|+..|.+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 58999999997 56667753
No 29
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.19 E-value=0.0045 Score=32.42 Aligned_cols=20 Identities=30% Similarity=0.625 Sum_probs=17.0
Q ss_pred CeeEecCCccEEeCCCCCee
Q psy5614 63 AQCKVINHTPICTCPEGYIG 82 (98)
Q Consensus 63 a~C~~~~g~~~C~C~~Gy~G 82 (98)
-.|++++++|+|.|++||+-
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE
T ss_pred CCCccCCCceEeECCCCCEE
Confidence 37999999999999999974
No 30
>KOG1225|consensus
Probab=96.13 E-value=0.015 Score=46.62 Aligned_cols=60 Identities=32% Similarity=0.895 Sum_probs=42.8
Q ss_pred eEEcCCCcEeCCCCCcc-cCcCCCCCCCCceeeCCeec-------CCCC-CCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614 16 SCSCLPNYIGAPPNCRP-ECVMNSECASNLACINEKCR-------DPCP-GSCGYNAQCKVINHTPICTCPEGYIGDP 84 (98)
Q Consensus 16 ~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~~~C~~~~C~-------d~C~-~~C~~~a~C~~~~g~~~C~C~~Gy~G~g 84 (98)
.|.|++||.| ..|+. +|. ++|..++.|+..+|. ..|. ..|..+..|++. |+|+.||.|..
T Consensus 297 ~CiC~~g~~G--~dCs~~~cp--adC~g~G~Ci~G~C~C~~Gy~G~~C~~~~C~~~g~cv~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 297 ECICNPGYSG--KDCSIRRCP--ADCSGHGKCIDGECLCDEGYTGELCIQRACSGGGQCVNG-----CKCKKGWRGPD 365 (525)
T ss_pred EeecCCCccc--cccccccCC--ccCCCCCcccCCceEeCCCCcCCcccccccCCCceeccC-----ceeccCccCCC
Confidence 7999999999 66652 343 567777888877776 3353 236666666532 99999999976
No 31
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.09 E-value=0.0071 Score=30.83 Aligned_cols=24 Identities=29% Similarity=0.718 Sum_probs=20.5
Q ss_pred CCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 58 SCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 58 ~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
.|..+++|+.. ..+|+|.+||+|+
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCC
Confidence 58889999876 4689999999996
No 32
>KOG1219|consensus
Probab=96.08 E-value=0.0088 Score=55.02 Aligned_cols=36 Identities=31% Similarity=0.944 Sum_probs=32.2
Q ss_pred CCeec---CCCC-CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 48 NEKCR---DPCP-GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 48 ~~~C~---d~C~-~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
+..|+ .+|. +||..+++|+...+.|.|.|+.||+|+
T Consensus 3896 G~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3896 GNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred CcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence 45777 6785 899999999999999999999999995
No 33
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.53 E-value=0.0092 Score=24.57 Aligned_cols=10 Identities=40% Similarity=1.348 Sum_probs=8.2
Q ss_pred eEEcCCCcEe
Q psy5614 16 SCSCLPNYIG 25 (98)
Q Consensus 16 ~C~C~~Gy~G 25 (98)
+|+|++||+|
T Consensus 1 ~C~C~~G~~G 10 (13)
T PF12661_consen 1 TCQCPPGWTG 10 (13)
T ss_dssp EEEE-TTEET
T ss_pred CccCcCCCcC
Confidence 5899999999
No 34
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.17 E-value=0.011 Score=41.80 Aligned_cols=69 Identities=25% Similarity=0.507 Sum_probs=44.6
Q ss_pred CcEEe-cCCCCeEEcCCCcEe-CCCCCcc--cCcC-CCCCCCCceeeCCeecCCCCCCCCCCCeeEecC-----CccEEe
Q psy5614 6 WIQCS-MQSNPSCSCLPNYIG-APPNCRP--ECVM-NSECASNLACINEKCRDPCPGSCGYNAQCKVIN-----HTPICT 75 (98)
Q Consensus 6 ~a~C~-~~g~~~C~C~~Gy~G-dg~~C~~--eC~~-~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~-----g~~~C~ 75 (98)
|+.-+ +.+.|.|.|.+||.= +-..|++ +|.. .. . ..+|++.|.|++.. ..|+|.
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~-----~-----------~K~Cgdya~C~~~~~~~~~~~~~C~ 73 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLEN-----V-----------NKPCGDYAKCINQANKGEERAYKCD 73 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GG-----T-----------TSEEETTEEEEE-SSTTSSTSEEEE
T ss_pred CCEEEEccCceEEEcCCCcEEccccccccceecCcccc-----c-----------CccccchhhhhcCCCcccceeEEEe
Confidence 45556 778999999999964 1123432 2322 00 0 15899999999965 489999
Q ss_pred CCCCCeeCCCCCCccC
Q psy5614 76 CPEGYIGDPFSSCFPK 91 (98)
Q Consensus 76 C~~Gy~G~g~~~C~~~ 91 (98)
|.+||+-..- .|+|.
T Consensus 74 C~~gY~~~~~-vCvp~ 88 (197)
T PF06247_consen 74 CINGYILKQG-VCVPN 88 (197)
T ss_dssp E-TTEEESSS-SEEEG
T ss_pred cccCceeeCC-eEchh
Confidence 9999987654 57653
No 35
>KOG4260|consensus
Probab=95.01 E-value=0.016 Score=43.32 Aligned_cols=57 Identities=23% Similarity=0.420 Sum_probs=42.7
Q ss_pred ccCCcEEe-cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCCCCCC-CCCCeeEecCCccEEeCCCCC
Q psy5614 3 WLMWIQCS-MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPCPGSC-GYNAQCKVINHTPICTCPEGY 80 (98)
Q Consensus 3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C-~~~a~C~~~~g~~~C~C~~Gy 80 (98)
|.++..|+ +.|+|+|..++||.++ .++|+.- ...| ..+..|.++.++|+|+|..|+
T Consensus 246 c~~~qfCvNteGSf~C~dk~Gy~~g----~d~C~~~------------------~d~~~~kn~~c~ni~~~~r~v~f~~~ 303 (350)
T KOG4260|consen 246 CKAHQFCVNTEGSFKCEDKEGYKKG----VDECQFC------------------ADVCASKNRPCMNIDGQYRCVCFSGL 303 (350)
T ss_pred CChhheeecCCCceEecccccccCC----hHHhhhh------------------hhhcccCCCCcccCCccEEEEecccc
Confidence 56778899 9999999999999883 3344320 0122 356788899999999999987
Q ss_pred e
Q psy5614 81 I 81 (98)
Q Consensus 81 ~ 81 (98)
.
T Consensus 304 ~ 304 (350)
T KOG4260|consen 304 I 304 (350)
T ss_pred e
Confidence 4
No 36
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.67 E-value=0.043 Score=27.91 Aligned_cols=21 Identities=24% Similarity=0.401 Sum_probs=18.3
Q ss_pred ccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614 3 WLMWIQCS-MQSNPSCSCLPNYIG 25 (98)
Q Consensus 3 ~~~~a~C~-~~g~~~C~C~~Gy~G 25 (98)
|..+++|+ . ..+|+|.+||+|
T Consensus 8 C~~~G~C~~~--~g~C~C~~g~~G 29 (32)
T PF07974_consen 8 CSGHGTCVSP--CGRCVCDSGYTG 29 (32)
T ss_pred cCCCCEEeCC--CCEEECCCCCcC
Confidence 77888897 5 678999999999
No 37
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.91 E-value=0.064 Score=37.81 Aligned_cols=32 Identities=31% Similarity=0.700 Sum_probs=24.5
Q ss_pred eec--CCCC---CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 50 KCR--DPCP---GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 50 ~C~--d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
.|. ++|. +.|. ..|.++.|+|.|.|++||+..
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 454 5663 4564 579999999999999999753
No 38
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=93.83 E-value=0.061 Score=34.11 Aligned_cols=29 Identities=41% Similarity=1.012 Sum_probs=23.2
Q ss_pred CCCC--CCCCCCCeeEecCCccEEeCCCCCee
Q psy5614 53 DPCP--GSCGYNAQCKVINHTPICTCPEGYIG 82 (98)
Q Consensus 53 d~C~--~~C~~~a~C~~~~g~~~C~C~~Gy~G 82 (98)
|.|. +.|++++.|. ...+..|.|.+||.-
T Consensus 78 d~Cd~y~~CG~~g~C~-~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICN-SNNSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeC-CCCCCceECCCCcCC
Confidence 5674 7899999995 445778999999964
No 39
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.16 E-value=0.083 Score=27.57 Aligned_cols=18 Identities=33% Similarity=0.704 Sum_probs=14.8
Q ss_pred EEe-cCCCCeEEcCCCcEe
Q psy5614 8 QCS-MQSNPSCSCLPNYIG 25 (98)
Q Consensus 8 ~C~-~~g~~~C~C~~Gy~G 25 (98)
.|+ ..++|+|.|++||+=
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp EEEEETTSEEEE-STTEEE
T ss_pred CCccCCCceEeECCCCCEE
Confidence 678 889999999999964
No 40
>KOG1226|consensus
Probab=93.12 E-value=0.26 Score=41.21 Aligned_cols=72 Identities=26% Similarity=0.766 Sum_probs=45.5
Q ss_pred EEecCCCCe---EEcCCCcEeCCCCCc--c----------cCcCC---CCCCCCceee---------------CCeec-C
Q psy5614 8 QCSMQSNPS---CSCLPNYIGAPPNCR--P----------ECVMN---SECASNLACI---------------NEKCR-D 53 (98)
Q Consensus 8 ~C~~~g~~~---C~C~~Gy~Gdg~~C~--~----------eC~~~---~~C~~~~~C~---------------~~~C~-d 53 (98)
.|..+|.+. |.|.+||.| ..|+ . .|+.. ..|+..+.|. +..|+ |
T Consensus 468 ~C~g~G~~~CG~C~C~~G~~G--~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECD 545 (783)
T KOG1226|consen 468 LCHGNGTFVCGQCRCDEGWLG--KKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECD 545 (783)
T ss_pred ccCCCCcEEecceecCCCCCC--CcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeecc
Confidence 343345554 599999999 5554 1 25431 2577776552 34666 3
Q ss_pred C--CC----CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614 54 P--CP----GSCGYNAQCKVINHTPICTCPEGYIGDPF 85 (98)
Q Consensus 54 ~--C~----~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~ 85 (98)
. |. ..|..+++|.= -+|+|.+||+|.-.
T Consensus 546 nfsC~r~~g~lC~g~G~C~C----G~CvC~~GwtG~~C 579 (783)
T KOG1226|consen 546 NFSCERHKGVLCGGHGRCEC----GRCVCNPGWTGSAC 579 (783)
T ss_pred CcccccccCcccCCCCeEeC----CcEEcCCCCccCCC
Confidence 3 43 35888888763 36999999999764
No 41
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=91.76 E-value=0.77 Score=25.13 Aligned_cols=25 Identities=28% Similarity=0.575 Sum_probs=18.1
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGDPF 85 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~ 85 (98)
..|..++.|++ -+|+|++||+-.+.
T Consensus 26 ~qC~~~s~C~~----g~C~C~~g~~~~~~ 50 (52)
T PF01683_consen 26 EQCIGGSVCVN----GRCQCPPGYVEVGG 50 (52)
T ss_pred CCCCCcCEEcC----CEeECCCCCEecCC
Confidence 45667888853 37999999976543
No 42
>PHA02887 EGF-like protein; Provisional
Probab=88.38 E-value=0.51 Score=31.09 Aligned_cols=23 Identities=26% Similarity=0.553 Sum_probs=19.2
Q ss_pred CcEEe---cCCCCeEEcCCCcEeCCCCC
Q psy5614 6 WIQCS---MQSNPSCSCLPNYIGAPPNC 30 (98)
Q Consensus 6 ~a~C~---~~g~~~C~C~~Gy~Gdg~~C 30 (98)
|++|. ......|.|..||+| ..|
T Consensus 96 HG~C~yI~dL~epsCrC~~GYtG--~RC 121 (126)
T PHA02887 96 NGECMNIIDLDEKFCICNKGYTG--IRC 121 (126)
T ss_pred CCEEEccccCCCceeECCCCccc--CCC
Confidence 78998 557899999999999 555
No 43
>smart00051 DSL delta serrate ligand.
Probab=87.17 E-value=0.82 Score=26.60 Aligned_cols=22 Identities=27% Similarity=0.406 Sum_probs=14.7
Q ss_pred CCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 59 CGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 59 C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
...+..|.. .| .++|.+||+|+
T Consensus 40 ~~~~~~Cd~-~G--~~~C~~Gw~G~ 61 (63)
T smart00051 40 FFGHYTCDE-NG--NKGCLEGWMGP 61 (63)
T ss_pred ccCCccCCc-CC--CEecCCCCcCC
Confidence 345566643 23 47899999985
No 44
>PHA02887 EGF-like protein; Provisional
Probab=86.89 E-value=0.72 Score=30.39 Aligned_cols=26 Identities=35% Similarity=0.761 Sum_probs=20.9
Q ss_pred CCCCCCCeeEecCC--ccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVINH--TPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~g--~~~C~C~~Gy~G~ 83 (98)
+-|- +++|..... .+.|.|.+||+|.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~ 119 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGI 119 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccC
Confidence 5666 679988654 7899999999994
No 45
>KOG0994|consensus
Probab=82.76 E-value=1.8 Score=38.48 Aligned_cols=21 Identities=38% Similarity=0.906 Sum_probs=16.1
Q ss_pred eEecCCccEE-eCCCCCeeCCC
Q psy5614 65 CKVINHTPIC-TCPEGYIGDPF 85 (98)
Q Consensus 65 C~~~~g~~~C-~C~~Gy~G~g~ 85 (98)
|.....++.| .|..||.||..
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~ 899 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPR 899 (1758)
T ss_pred ccccccccchhhhhccccCCcc
Confidence 5555567777 89999999975
No 46
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=78.04 E-value=2.3 Score=28.52 Aligned_cols=24 Identities=25% Similarity=0.577 Sum_probs=19.4
Q ss_pred CcEEe---cCCCCeEEcCCCcEeCCCCCc
Q psy5614 6 WIQCS---MQSNPSCSCLPNYIGAPPNCR 31 (98)
Q Consensus 6 ~a~C~---~~g~~~C~C~~Gy~Gdg~~C~ 31 (98)
|++|. ....+.|.|..||+| ..|+
T Consensus 55 HG~C~yI~dl~~~~CrC~~GYtG--eRCE 81 (139)
T PHA03099 55 HGDCIHARDIDGMYCRCSHGYTG--IRCQ 81 (139)
T ss_pred CCEEEeeccCCCceeECCCCccc--cccc
Confidence 45898 458899999999999 5664
No 47
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=77.46 E-value=2.2 Score=23.00 Aligned_cols=26 Identities=31% Similarity=0.691 Sum_probs=17.5
Q ss_pred CeeEecCCccEEeCCCCCeeCCCCCCcc
Q psy5614 63 AQCKVINHTPICTCPEGYIGDPFSSCFP 90 (98)
Q Consensus 63 a~C~~~~g~~~C~C~~Gy~G~g~~~C~~ 90 (98)
+.|... ..+|.|+++|+|..-..|.+
T Consensus 11 ~~C~~~--~G~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 11 QTCDPS--TGQCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp SSEEET--CEEESBSTTEESTTS-EE-T
T ss_pred CcccCC--CCEEeccccccCCcCcCCCC
Confidence 367663 44799999999987656654
No 48
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.07 E-value=3.1 Score=27.88 Aligned_cols=30 Identities=30% Similarity=0.687 Sum_probs=22.5
Q ss_pred CCCCCCCeeEecC--CccEEeCCCCCeeCCCCCCcc
Q psy5614 57 GSCGYNAQCKVIN--HTPICTCPEGYIGDPFSSCFP 90 (98)
Q Consensus 57 ~~C~~~a~C~~~~--g~~~C~C~~Gy~G~g~~~C~~ 90 (98)
+-|-++ +|.... ..+.|.|..||+|. +|.-
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe---RCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI---RCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccc---cccc
Confidence 456554 898865 58999999999995 5643
No 49
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=74.14 E-value=4.4 Score=21.99 Aligned_cols=19 Identities=26% Similarity=0.665 Sum_probs=13.6
Q ss_pred cEEeCCCCCeeCCCCCCcc
Q psy5614 72 PICTCPEGYIGDPFSSCFP 90 (98)
Q Consensus 72 ~~C~C~~Gy~G~g~~~C~~ 90 (98)
-+|.|+++|+|..-..|.+
T Consensus 19 G~C~C~~~~~G~~C~~C~~ 37 (50)
T cd00055 19 GQCECKPNTTGRRCDRCAP 37 (50)
T ss_pred CEEeCCCcCCCCCCCCCCC
Confidence 4688888888876656654
No 50
>KOG0994|consensus
Probab=73.94 E-value=3.4 Score=36.78 Aligned_cols=19 Identities=37% Similarity=0.793 Sum_probs=15.3
Q ss_pred cCCCCeE-EcCCCcEeCCCC
Q psy5614 11 MQSNPSC-SCLPNYIGAPPN 29 (98)
Q Consensus 11 ~~g~~~C-~C~~Gy~Gdg~~ 29 (98)
..+++.| .|..||.||++.
T Consensus 881 ~T~G~~CdrCl~GyyGdP~l 900 (1758)
T KOG0994|consen 881 STTGHSCDRCLDGYYGDPRL 900 (1758)
T ss_pred cccccchhhhhccccCCccc
Confidence 6677888 999999988653
No 51
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=73.48 E-value=3.2 Score=21.44 Aligned_cols=13 Identities=62% Similarity=1.196 Sum_probs=10.8
Q ss_pred cEEeCCCCCeeCC
Q psy5614 72 PICTCPEGYIGDP 84 (98)
Q Consensus 72 ~~C~C~~Gy~G~g 84 (98)
..|.|+.||.-|-
T Consensus 18 ~~C~CPeGyIlde 30 (34)
T PF09064_consen 18 GQCFCPEGYILDE 30 (34)
T ss_pred CceeCCCceEecC
Confidence 4899999998664
No 52
>KOG1226|consensus
Probab=69.44 E-value=7.4 Score=32.96 Aligned_cols=21 Identities=24% Similarity=0.534 Sum_probs=13.8
Q ss_pred ccCCcEEecCCCCeEEcCCCcEeC
Q psy5614 3 WLMWIQCSMQSNPSCSCLPNYIGA 26 (98)
Q Consensus 3 ~~~~a~C~~~g~~~C~C~~Gy~Gd 26 (98)
|..|++|. -.+|+|.+||+|+
T Consensus 557 C~g~G~C~---CG~CvC~~GwtG~ 577 (783)
T KOG1226|consen 557 CGGHGRCE---CGRCVCNPGWTGS 577 (783)
T ss_pred cCCCCeEe---CCcEEcCCCCccC
Confidence 34455553 2358999999994
No 53
>KOG3607|consensus
Probab=61.98 E-value=9 Score=32.23 Aligned_cols=45 Identities=27% Similarity=0.751 Sum_probs=34.0
Q ss_pred CCCCCCCceeeCCeec-------CCCCCCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614 37 NSECASNLACINEKCR-------DPCPGSCGYNAQCKVINHTPICTCPEGYIGDP 84 (98)
Q Consensus 37 ~~~C~~~~~C~~~~C~-------d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g 84 (98)
...|.+..+|++.+|. +.|+..|+.++.|-+. +.|.|.+||++..
T Consensus 603 Gt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~---~~ChC~~gwapp~ 654 (716)
T KOG3607|consen 603 GTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNE---LNCHCEPGWAPPF 654 (716)
T ss_pred CCccCCCceecCCcchhhhhhcccccccccCCCcccCCC---cceeeCCCCCCCc
Confidence 4557777788888885 4466668888888764 5699999998854
No 54
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=60.82 E-value=8.5 Score=20.57 Aligned_cols=19 Identities=26% Similarity=0.679 Sum_probs=12.3
Q ss_pred cEEeCCCCCeeCCCCCCcc
Q psy5614 72 PICTCPEGYIGDPFSSCFP 90 (98)
Q Consensus 72 ~~C~C~~Gy~G~g~~~C~~ 90 (98)
-+|.|+++|+|..-..|.+
T Consensus 18 G~C~C~~~~~G~~C~~C~~ 36 (46)
T smart00180 18 GQCECKPNVTGRRCDRCAP 36 (46)
T ss_pred CEEECCCCCCCCCCCcCCC
Confidence 3688888888865444543
No 55
>KOG1836|consensus
Probab=59.42 E-value=9.9 Score=35.01 Aligned_cols=31 Identities=29% Similarity=0.786 Sum_probs=18.2
Q ss_pred CCCCCCeeEecC--CccEEe-CCCCCeeCCCCCC
Q psy5614 58 SCGYNAQCKVIN--HTPICT-CPEGYIGDPFSSC 88 (98)
Q Consensus 58 ~C~~~a~C~~~~--g~~~C~-C~~Gy~G~g~~~C 88 (98)
+|.....|.... ....|+ |++||+|.-...|
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c 814 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEEC 814 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCcccccccC
Confidence 344444454433 466787 8888888544333
No 56
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=55.07 E-value=8.3 Score=22.05 Aligned_cols=28 Identities=29% Similarity=0.597 Sum_probs=15.5
Q ss_pred CCccCCcEEe-----cCCCCeEEcCCCcEeCCCCC
Q psy5614 1 MSWLMWIQCS-----MQSNPSCSCLPNYIGAPPNC 30 (98)
Q Consensus 1 ~~~~~~a~C~-----~~g~~~C~C~~Gy~Gdg~~C 30 (98)
++|.-|++-. ..|...|+|..-|.| ..|
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~G--pdC 49 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGG--PDC 49 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEES--TTS
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCC--CCc
Confidence 3667777775 467788999999998 454
No 57
>KOG3516|consensus
Probab=52.66 E-value=12 Score=33.32 Aligned_cols=31 Identities=29% Similarity=0.697 Sum_probs=27.2
Q ss_pred CCCC-CCCCCCCeeEecCCccEEeCC-CCCeeC
Q psy5614 53 DPCP-GSCGYNAQCKVINHTPICTCP-EGYIGD 83 (98)
Q Consensus 53 d~C~-~~C~~~a~C~~~~g~~~C~C~-~Gy~G~ 83 (98)
|.|+ ++|...+.|......|.|.|. .||+|-
T Consensus 546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred cccCCccccCCCcccccccceeEeccccccccc
Confidence 6674 899999999997789999999 999884
No 58
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=42.08 E-value=18 Score=19.16 Aligned_cols=23 Identities=26% Similarity=0.567 Sum_probs=15.1
Q ss_pred CCeE-EcCCCcEeCCCCCcccCcC
Q psy5614 14 NPSC-SCLPNYIGAPPNCRPECVM 36 (98)
Q Consensus 14 ~~~C-~C~~Gy~Gdg~~C~~eC~~ 36 (98)
...| .|+.||..+...|.++|..
T Consensus 14 ~~~C~~C~~~~~~~~~~Cv~~C~~ 37 (49)
T cd00064 14 PDQCTSCRHGFYLDGGTCVSECPE 37 (49)
T ss_pred cCcCccCcCccCCCCCcccccCCC
Confidence 4567 8888886655677655543
No 59
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=34.14 E-value=13 Score=20.28 Aligned_cols=19 Identities=37% Similarity=1.001 Sum_probs=13.9
Q ss_pred EEeCCCCCeeCCCCCCccC
Q psy5614 73 ICTCPEGYIGDPFSSCFPK 91 (98)
Q Consensus 73 ~C~C~~Gy~G~g~~~C~~~ 91 (98)
-|.|++||.-+....|++.
T Consensus 34 gC~C~~G~v~~~~~~CV~~ 52 (55)
T PF01826_consen 34 GCFCPPGYVRNDNGRCVPP 52 (55)
T ss_dssp EEEETTTEEEETTSEEEEG
T ss_pred cCCCCCCeeEcCCCCEEcH
Confidence 3999999986554467764
No 60
>KOG0196|consensus
Probab=31.94 E-value=70 Score=27.94 Aligned_cols=58 Identities=22% Similarity=0.499 Sum_probs=30.8
Q ss_pred CeEEcCCCcEe--CCCCCcccCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEecCCccEEeCCCCCee
Q psy5614 15 PSCSCLPNYIG--APPNCRPECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVINHTPICTCPEGYIG 82 (98)
Q Consensus 15 ~~C~C~~Gy~G--dg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G 82 (98)
..|.|++||.- .+..|+ .|..+..= ..=.... | .+|.+|..=. ..++..|.|..||.-
T Consensus 259 G~C~C~aGye~~~~~~~C~-aCp~G~yK---~~~~~~~----C-~~CP~~S~s~-~ega~~C~C~~gyyR 318 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQ-ACPPGTYK---ASQGDSL----C-LPCPPNSHSS-SEGATSCTCENGYYR 318 (996)
T ss_pred CceeecCCCCcccCCCcce-eCCCCccc---CCCCCCC----C-CCCCCCCCCC-CCCCCcccccCCccc
Confidence 45899999965 455664 23221100 0000011 2 2455544322 467889999999963
No 61
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=30.99 E-value=22 Score=20.48 Aligned_cols=8 Identities=50% Similarity=1.356 Sum_probs=5.4
Q ss_pred EcCCCcEe
Q psy5614 18 SCLPNYIG 25 (98)
Q Consensus 18 ~C~~Gy~G 25 (98)
.|++||.|
T Consensus 53 ~C~~Gw~G 60 (63)
T PF01414_consen 53 VCLPGWTG 60 (63)
T ss_dssp EE-TTEES
T ss_pred CCCCCCcC
Confidence 67788887
No 62
>PF09402 MSC: Man1-Src1p-C-terminal domain; InterPro: IPR018996 This entry represents the Inner nuclear membrane proteins MAN1 (also known as LEM domain-containing protein 3) and LEM domain-containing protein 2 (or LEM protein 2). Emerin and MAN1 are LEM domain-containing integral membrane proteins of the vertebrate nuclear envelope []. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C-terminal nulceoplasmic region forms a DNA binding winged helix and binds to Smad []. LEM protein 2 is an essential protein involved in chromosome segregation and cell division, probably via its interaction with lmn-1, the main component of nuclear lamina. Has some overlapping function with emr-1.; GO: 0005639 integral to nuclear inner membrane; PDB: 2CH0_A.
Probab=22.70 E-value=28 Score=25.85 Aligned_cols=24 Identities=25% Similarity=0.414 Sum_probs=0.0
Q ss_pred CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614 57 GSCGYNAQCKVINHTPICTCPEGYIGD 83 (98)
Q Consensus 57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~ 83 (98)
-||.+||.|... ..-.|.+||.-.
T Consensus 53 ~pCP~~a~C~~~---~~~~C~~~y~~~ 76 (334)
T PF09402_consen 53 EPCPEHAICYPG---LKLECEPGYVLK 76 (334)
T ss_dssp ---------------------------
T ss_pred cccccccccccc---cccccccccccc
Confidence 389999999853 457899999865
No 63
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=21.33 E-value=3.2e+02 Score=23.57 Aligned_cols=23 Identities=17% Similarity=0.446 Sum_probs=16.7
Q ss_pred CCccEEeCCCCCeeCCCCCCccCC
Q psy5614 69 NHTPICTCPEGYIGDPFSSCFPKP 92 (98)
Q Consensus 69 ~g~~~C~C~~Gy~G~g~~~C~~~~ 92 (98)
.....|+|..||.-.+- .|++..
T Consensus 748 ~~~~vC~C~~g~~l~~~-~c~~~~ 770 (800)
T PTZ00214 748 SNQGVCMCELDAVLTKG-VCVPAK 770 (800)
T ss_pred ccCCeEEeCCcceecCC-eeEecc
Confidence 34568999999986664 687653
Done!