Query psy6902
Match_columns 86
No_of_seqs 195 out of 1261
Neff 8.3
Searched_HMMs 46136
Date Fri Aug 16 21:23:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6902.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6902hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1219|consensus 99.5 1.6E-14 3.6E-19 115.9 4.8 69 2-86 3865-3938(4289)
2 KOG1219|consensus 99.1 1.3E-10 2.9E-15 94.3 4.6 70 2-86 3904-3977(4289)
3 PF00008 EGF: EGF-like domain 98.9 9.7E-10 2.1E-14 52.4 2.2 32 50-84 1-32 (32)
4 KOG4289|consensus 98.8 4E-09 8.7E-14 83.6 3.2 53 18-86 1222-1274(2531)
5 smart00179 EGF_CA Calcium-bind 98.6 6.8E-08 1.5E-12 47.0 4.0 36 47-86 2-39 (39)
6 cd00054 EGF_CA Calcium-binding 98.5 3.6E-07 7.7E-12 43.8 4.0 36 47-86 2-38 (38)
7 KOG4289|consensus 98.2 1.5E-06 3.3E-11 69.6 3.9 72 3-86 1241-1316(2531)
8 cd00053 EGF Epidermal growth f 98.1 5.9E-06 1.3E-10 38.8 4.0 33 50-86 2-36 (36)
9 smart00181 EGF Epidermal growt 98.0 1.1E-05 2.4E-10 38.3 3.9 32 50-86 2-35 (35)
10 PF07645 EGF_CA: Calcium-bindi 97.9 1.5E-05 3.3E-10 39.8 2.5 31 47-81 2-34 (42)
11 KOG1214|consensus 97.6 8.7E-05 1.9E-09 57.5 4.4 51 18-83 809-859 (1289)
12 PF07974 EGF_2: EGF-like domai 97.3 0.0006 1.3E-08 32.3 3.4 27 53-85 6-32 (32)
13 KOG3514|consensus 97.3 0.00017 3.7E-09 57.0 2.2 35 49-86 625-659 (1591)
14 PF12661 hEGF: Human growth fa 97.1 0.00025 5.3E-09 27.1 1.0 13 72-85 1-13 (13)
15 KOG3516|consensus 97.1 0.00041 9E-09 55.1 2.7 37 47-86 545-581 (1306)
16 PHA03099 epidermal growth fact 96.6 0.0019 4.1E-08 39.8 2.6 37 47-86 42-81 (139)
17 KOG1217|consensus 96.6 0.01 2.2E-07 41.8 6.4 35 47-85 271-306 (487)
18 KOG3516|consensus 96.3 0.0034 7.4E-08 50.2 3.1 35 49-86 957-991 (1306)
19 PHA02887 EGF-like protein; Pro 96.3 0.0055 1.2E-07 37.2 3.0 37 47-86 83-122 (126)
20 PF12947 EGF_3: EGF domain; I 96.1 0.0047 1E-07 29.9 1.8 26 54-83 7-32 (36)
21 KOG1225|consensus 95.9 0.02 4.4E-07 42.6 5.0 26 53-86 316-341 (525)
22 KOG1217|consensus 95.7 0.016 3.6E-07 40.8 3.9 36 47-86 169-206 (487)
23 KOG1225|consensus 95.0 0.044 9.4E-07 40.9 4.3 61 3-86 246-310 (525)
24 PF12662 cEGF: Complement Clr- 94.1 0.05 1.1E-06 24.0 1.8 11 70-81 1-11 (24)
25 KOG1836|consensus 91.4 0.41 8.8E-06 40.4 4.8 68 13-86 740-812 (1705)
26 PF14670 FXa_inhibition: Coagu 89.6 0.29 6.3E-06 23.5 1.6 18 60-81 11-28 (36)
27 PF00053 Laminin_EGF: Laminin 89.4 0.42 9E-06 24.1 2.2 22 59-86 11-32 (49)
28 smart00051 DSL delta serrate l 89.4 0.56 1.2E-05 25.3 2.8 24 55-85 40-63 (63)
29 KOG1214|consensus 89.0 0.74 1.6E-05 36.7 4.2 60 20-83 744-820 (1289)
30 PF12946 EGF_MSP1_1: MSP1 EGF 85.4 0.81 1.8E-05 22.3 1.8 29 50-81 2-30 (37)
31 cd01475 vWA_Matrilin VWA_Matri 85.0 1.5 3.3E-05 28.7 3.6 30 47-82 187-218 (224)
32 KOG1226|consensus 82.2 2.3 4.9E-05 33.4 3.9 33 47-86 585-620 (783)
33 cd00055 EGF_Lam Laminin-type e 81.9 1.8 3.9E-05 21.9 2.4 15 71-86 19-33 (50)
34 KOG0994|consensus 77.2 5.9 0.00013 33.0 4.8 48 37-86 889-948 (1758)
35 smart00180 EGF_Lam Laminin-typ 75.6 2.6 5.7E-05 21.0 1.8 15 71-86 18-32 (46)
36 KOG4260|consensus 75.6 4.5 9.7E-05 28.4 3.4 33 52-85 149-181 (350)
37 KOG1226|consensus 73.1 5.8 0.00013 31.2 3.7 40 34-86 538-580 (783)
38 PF06247 Plasmod_Pvs28: Plasmo 71.4 1.1 2.4E-05 29.5 -0.3 64 7-81 6-79 (197)
39 KOG4260|consensus 68.2 3.8 8.3E-05 28.7 1.7 33 46-82 235-269 (350)
40 PF00954 S_locus_glycop: S-loc 65.0 10 0.00022 22.1 3.0 32 47-83 77-109 (110)
41 KOG3509|consensus 64.2 6.1 0.00013 32.0 2.3 36 47-86 406-441 (964)
42 PF12955 DUF3844: Domain of un 63.9 10 0.00023 22.5 2.8 26 52-78 12-39 (103)
43 PF01414 DSL: Delta serrate li 63.2 2.6 5.6E-05 22.7 0.1 11 74-85 53-63 (63)
44 KOG0994|consensus 59.4 16 0.00035 30.7 3.9 30 13-46 1079-1108(1758)
45 KOG3514|consensus 53.0 13 0.00029 30.9 2.5 37 47-86 1018-1054(1591)
46 PF09064 Tme5_EGF_like: Thromb 50.1 15 0.00033 17.4 1.6 24 4-27 3-27 (34)
47 PF12020 TAFA: TAFA family; I 41.2 21 0.00045 20.6 1.5 25 49-74 56-80 (90)
48 PF01683 EB: EB module; Inter 40.0 42 0.0009 16.7 2.5 20 54-81 27-46 (52)
49 KOG1836|consensus 38.3 82 0.0018 27.5 5.0 35 3-44 776-820 (1705)
50 KOG3512|consensus 28.6 1.2E+02 0.0026 23.2 4.1 68 3-85 396-477 (592)
No 1
>KOG1219|consensus
Probab=99.50 E-value=1.6e-14 Score=115.87 Aligned_cols=69 Identities=32% Similarity=0.925 Sum_probs=62.3
Q ss_pred CCCCCCCCCCC--CeeC---CccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCC
Q psy6902 2 SACDPAHVRPG--CRIQ---DNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCS 76 (86)
Q Consensus 2 ~~C~~~~c~~g--C~~~---~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~ 76 (86)
.+|..+||+|| |... .|.|.|++. |+|.+||.+ +.+|.++||++||+|+... ++|.|.|+
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsq-------ysG~~CEi~-----~epC~snPC~~GgtCip~~---n~f~CnC~ 3929 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQ-------YSGNHCEID-----LEPCASNPCLTGGTCIPFY---NGFLCNCP 3929 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCccc-------ccCcccccc-----cccccCCCCCCCCEEEecC---CCeeEeCC
Confidence 47999999999 8763 399999999 489999999 8999999999999999875 49999999
Q ss_pred CCCcCCCCCC
Q psy6902 77 GTGYYGKKCQ 86 (86)
Q Consensus 77 ~~g~~G~~Ce 86 (86)
.||+|++||
T Consensus 3930 -~gyTG~~Ce 3938 (4289)
T KOG1219|consen 3930 -NGYTGKRCE 3938 (4289)
T ss_pred -CCccCceee
Confidence 899999997
No 2
>KOG1219|consensus
Probab=99.07 E-value=1.3e-10 Score=94.32 Aligned_cols=70 Identities=27% Similarity=0.688 Sum_probs=60.8
Q ss_pred CCCCCCCCCCC--CeeC--CccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCC
Q psy6902 2 SACDPAHVRPG--CRIQ--DNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSG 77 (86)
Q Consensus 2 ~~C~~~~c~~g--C~~~--~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~ 77 (86)
.+|.++||++| |... .|.|.|+.+ |||.+|+..- +++|..++|.+||.|++.. +.|.|.|.
T Consensus 3904 epC~snPC~~GgtCip~~n~f~CnC~~g-------yTG~~Ce~~G----i~eCs~n~C~~gg~C~n~~---gsf~CncT- 3968 (4289)
T KOG1219|consen 3904 EPCASNPCLTGGTCIPFYNGFLCNCPNG-------YTGKRCEARG----ISECSKNVCGTGGQCINIP---GSFHCNCT- 3968 (4289)
T ss_pred ccccCCCCCCCCEEEecCCCeeEeCCCC-------ccCceeeccc----ccccccccccCCceeeccC---CceEeccC-
Confidence 36999999999 8763 499999999 4999998641 8999999999999999985 48999999
Q ss_pred CCcCCCCCC
Q psy6902 78 TGYYGKKCQ 86 (86)
Q Consensus 78 ~g~~G~~Ce 86 (86)
+||.|+.|+
T Consensus 3969 ~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3969 PGILGRTCC 3977 (4289)
T ss_pred hhHhcccCc
Confidence 999999873
No 3
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.90 E-value=9.7e-10 Score=52.41 Aligned_cols=32 Identities=47% Similarity=1.216 Sum_probs=27.9
Q ss_pred CCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCC
Q psy6902 50 CDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKK 84 (86)
Q Consensus 50 C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~ 84 (86)
|.++||+|+|+|++.. .+.|+|.|+ +||+|++
T Consensus 1 C~~~~C~n~g~C~~~~--~~~y~C~C~-~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLP--GGGYTCECP-PGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEES--TSEEEEEEB-TTEESTT
T ss_pred CCCCcCCCCeEEEeCC--CCCEEeECC-CCCccCC
Confidence 6788999999999885 258999999 8999975
No 4
>KOG4289|consensus
Probab=98.78 E-value=4e-09 Score=83.61 Aligned_cols=53 Identities=36% Similarity=0.950 Sum_probs=48.6
Q ss_pred ccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 18 NQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 18 ~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+.|.||+| |||..|++. +|.|.++||.|+|+|.... ++|+|.|. +||+|.+||
T Consensus 1222 lrCrCPpG-------FTgd~CeTe-----iDlCYs~pC~nng~C~srE---ggYtCeCr-pg~tGehCE 1274 (2531)
T KOG4289|consen 1222 LRCRCPPG-------FTGDYCETE-----IDLCYSGPCGNNGRCRSRE---GGYTCECR-PGFTGEHCE 1274 (2531)
T ss_pred eeEeCCCC-------CCcccccch-----hHhhhcCCCCCCCceEEec---CceeEEec-CCcccccee
Confidence 78999999 388899998 9999999999999999874 59999999 999999997
No 5
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.64 E-value=6.8e-08 Score=46.97 Aligned_cols=36 Identities=42% Similarity=1.227 Sum_probs=31.7
Q ss_pred cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcC-CCCCC
Q psy6902 47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYY-GKKCQ 86 (86)
Q Consensus 47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~-G~~Ce 86 (86)
+++|.. +||.++++|++.. ++|.|.|+ +||+ |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~---g~~~C~C~-~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTV---GSYRCECP-PGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCC---CCeEeECC-CCCccCCcCC
Confidence 678888 8999999999874 48999999 8999 99886
No 6
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.47 E-value=3.6e-07 Score=43.80 Aligned_cols=36 Identities=42% Similarity=1.221 Sum_probs=31.2
Q ss_pred cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+++|.. .||.+++.|++.. ++|.|.|+ .||.|+.|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~---~~~~C~C~-~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTV---GSYRCSCP-PGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCC---CCeEeECC-CCCcCCcCC
Confidence 578887 7999999999875 48999999 899999885
No 7
>KOG4289|consensus
Probab=98.20 E-value=1.5e-06 Score=69.61 Aligned_cols=72 Identities=26% Similarity=0.721 Sum_probs=55.6
Q ss_pred CCCCCCCCCC--Cee--CCccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCC
Q psy6902 3 ACDPAHVRPG--CRI--QDNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGT 78 (86)
Q Consensus 3 ~C~~~~c~~g--C~~--~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~ 78 (86)
.|-+.||.++ |.. ..|.|.|.++ | +|.+||... +.-.|.+.-|+|||+|++... ++|.|+|+..
T Consensus 1241 lCYs~pC~nng~C~srEggYtCeCrpg----~---tGehCEvs~---~agrCvpGvC~nggtC~~~~n--ggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1241 LCYSGPCGNNGRCRSREGGYTCECRPG----F---TGEHCEVSA---RAGRCVPGVCKNGGTCVNLLN--GGFCCHCPYG 1308 (2531)
T ss_pred hhhcCCCCCCCceEEecCceeEEecCC----c---cccceeeec---ccCccccceecCCCEEeecCC--CceeccCCCc
Confidence 4667888876 865 4499999888 4 889998761 123599999999999998863 6899999944
Q ss_pred CcCCCCCC
Q psy6902 79 GYYGKKCQ 86 (86)
Q Consensus 79 g~~G~~Ce 86 (86)
.|.+++||
T Consensus 1309 e~e~prC~ 1316 (2531)
T KOG4289|consen 1309 EFEDPRCE 1316 (2531)
T ss_pred ccCCCceE
Confidence 57788775
No 8
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=98.14 E-value=5.9e-06 Score=38.81 Aligned_cols=33 Identities=45% Similarity=1.241 Sum_probs=27.9
Q ss_pred CC-CCCCcCCceEecCCCCCCCeEEeCCCCCcCCC-CCC
Q psy6902 50 CD-TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK-KCQ 86 (86)
Q Consensus 50 C~-~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~-~Ce 86 (86)
|. .++|.+++.|++.. ++|.|.|+ .||.|. .|+
T Consensus 2 C~~~~~C~~~~~C~~~~---~~~~C~C~-~g~~g~~~C~ 36 (36)
T cd00053 2 CAASNPCSNGGTCVNTP---GSYRCVCP-PGYTGDRSCE 36 (36)
T ss_pred CCCCCCCCCCCEEecCC---CCeEeECC-CCCcccCCcC
Confidence 44 67899999999875 48999999 899998 775
No 9
>smart00181 EGF Epidermal growth factor-like domain.
Probab=98.05 E-value=1.1e-05 Score=38.26 Aligned_cols=32 Identities=44% Similarity=1.216 Sum_probs=27.0
Q ss_pred CCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCC-CCCC
Q psy6902 50 CDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYG-KKCQ 86 (86)
Q Consensus 50 C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G-~~Ce 86 (86)
|.. ++|.++ .|++.. ++|.|.|+ .||.| +.|+
T Consensus 2 C~~~~~C~~~-~C~~~~---~~~~C~C~-~g~~g~~~C~ 35 (35)
T smart00181 2 CASGGPCSNG-TCINTP---GSYTCSCP-PGYTGDKRCE 35 (35)
T ss_pred CCCcCCCCCC-EEECCC---CCeEeECC-CCCccCCccC
Confidence 556 689998 999874 48999999 89999 8875
No 10
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.85 E-value=1.5e-05 Score=39.81 Aligned_cols=31 Identities=32% Similarity=0.854 Sum_probs=26.7
Q ss_pred cccCCCC--CCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902 47 QNACDTN--PCYHGGRCTQTFQDRTGYKCRCSGTGYY 81 (86)
Q Consensus 47 ~~~C~~~--PC~ngg~C~~~~~~~~~~~C~C~~~g~~ 81 (86)
||||... +|..++.|+++. ++|.|.|+ +||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~~---Gsy~C~C~-~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTE---GSYSCSCP-PGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEET---TEEEEEES-TTEE
T ss_pred ccccCCCCCcCCCCCEEEcCC---CCEEeeCC-CCcE
Confidence 7888764 598899999985 59999999 8997
No 11
>KOG1214|consensus
Probab=97.61 E-value=8.7e-05 Score=57.46 Aligned_cols=51 Identities=29% Similarity=0.801 Sum_probs=40.9
Q ss_pred ccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902 18 NQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK 83 (86)
Q Consensus 18 ~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~ 83 (86)
|.|.|.+|+.+ .|..|. ++|+|.++-|...++|.+.. ++|.|.|. +||.|.
T Consensus 809 y~C~CLPGfsG-----DG~~c~------dvDeC~psrChp~A~Cyntp---gsfsC~C~-pGy~GD 859 (1289)
T KOG1214|consen 809 YSCACLPGFSG-----DGHQCT------DVDECSPSRCHPAATCYNTP---GSFSCRCQ-PGYYGD 859 (1289)
T ss_pred EEEeecCCccC-----Cccccc------cccccCccccCCCceEecCC---Ccceeecc-cCccCC
Confidence 88888888544 223443 37999999999999999985 59999999 999974
No 12
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.27 E-value=0.0006 Score=32.28 Aligned_cols=27 Identities=37% Similarity=0.942 Sum_probs=23.1
Q ss_pred CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902 53 NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC 85 (86)
Q Consensus 53 ~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C 85 (86)
..|.+.|+|+.. ..+|.|. +||+|+.|
T Consensus 6 ~~C~~~G~C~~~-----~g~C~C~-~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP-----CGRCVCD-SGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC-----CCEEECC-CCCcCCCC
Confidence 469999999975 4689999 89999987
No 13
>KOG3514|consensus
Probab=97.26 E-value=0.00017 Score=56.98 Aligned_cols=35 Identities=43% Similarity=1.319 Sum_probs=32.5
Q ss_pred cCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 49 ACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 49 ~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.|.++||+|+|.|.+..+ .|.|.|.+.+|.|+.||
T Consensus 625 ~C~~nPC~N~g~C~egwN---rfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWN---RFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCcccccc---ccccccccCcccCcccc
Confidence 799999999999999874 89999998999999996
No 14
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.12 E-value=0.00025 Score=27.11 Aligned_cols=13 Identities=38% Similarity=1.336 Sum_probs=10.6
Q ss_pred EEeCCCCCcCCCCC
Q psy6902 72 KCRCSGTGYYGKKC 85 (86)
Q Consensus 72 ~C~C~~~g~~G~~C 85 (86)
+|.|+ +||+|++|
T Consensus 1 ~C~C~-~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCP-PGWTGPNC 13 (13)
T ss_dssp EEEE--TTEETTTT
T ss_pred CccCc-CCCcCCCC
Confidence 58999 89999987
No 15
>KOG3516|consensus
Probab=97.07 E-value=0.00041 Score=55.12 Aligned_cols=37 Identities=43% Similarity=1.192 Sum_probs=33.1
Q ss_pred cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+|.|.+|||+|||.|.+... .|.|.|...||.|..|+
T Consensus 545 ~drClPN~CehgG~C~Qs~~---~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWD---DFECNCELTGYKGATCH 581 (1306)
T ss_pred ccccCCccccCCCccccccc---ceeEecccccccccccc
Confidence 68899999999999999754 89999987899999885
No 16
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=96.62 E-value=0.0019 Score=39.76 Aligned_cols=37 Identities=41% Similarity=0.952 Sum_probs=28.0
Q ss_pred cccCCC---CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDT---NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~---~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+.+|.+ +-|.|| .|.-... ...+.|+|. .||+|.+||
T Consensus 42 i~~Cp~ey~~YClHG-~C~yI~d-l~~~~CrC~-~GYtGeRCE 81 (139)
T PHA03099 42 IRLCGPEGDGYCLHG-DCIHARD-IDGMYCRCS-HGYTGIRCQ 81 (139)
T ss_pred cccCChhhCCEeECC-EEEeecc-CCCceeECC-CCccccccc
Confidence 344543 469997 8986543 457999999 899999997
No 17
>KOG1217|consensus
Probab=96.58 E-value=0.01 Score=41.77 Aligned_cols=35 Identities=31% Similarity=0.970 Sum_probs=30.1
Q ss_pred cccCCCCC-CcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902 47 QNACDTNP-CYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC 85 (86)
Q Consensus 47 ~~~C~~~P-C~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C 85 (86)
++.|...+ |.++++|++... .|.|.|+ +||+|..|
T Consensus 271 ~~~C~~~~~c~~~~~C~~~~~---~~~C~C~-~g~~g~~~ 306 (487)
T KOG1217|consen 271 VDSCALIASCPNGGTCVNVPG---SYRCTCP-PGFTGRLC 306 (487)
T ss_pred ccccCCCCccCCCCeeecCCC---cceeeCC-CCCCCCCC
Confidence 78898875 999999999853 5999999 99999875
No 18
>KOG3516|consensus
Probab=96.35 E-value=0.0034 Score=50.20 Aligned_cols=35 Identities=43% Similarity=1.191 Sum_probs=31.1
Q ss_pred cCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 49 ACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 49 ~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
-|.+.||+|||.|++... +|+|.|....|.|..|.
T Consensus 957 hCss~~C~NGG~Cvery~---gytCDCs~Tay~Gp~Cs 991 (1306)
T KOG3516|consen 957 HCSSYPCLNGGHCVERYD---GYTCDCSRTAYDGPFCS 991 (1306)
T ss_pred ccccccccCCCEEEEecC---ceeeccccCcCCCCccc
Confidence 588999999999999864 99999987899999883
No 19
>PHA02887 EGF-like protein; Provisional
Probab=96.26 E-value=0.0055 Score=37.19 Aligned_cols=37 Identities=27% Similarity=0.775 Sum_probs=28.3
Q ss_pred cccCCC---CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDT---NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~---~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.++|.. +-|.| |+|.-... .....|.|. .||+|.+||
T Consensus 83 f~pC~~eyk~YCiH-G~C~yI~d-L~epsCrC~-~GYtG~RCE 122 (126)
T PHA02887 83 FEKCKNDFNDFCIN-GECMNIID-LDEKFCICN-KGYTGIRCD 122 (126)
T ss_pred ccccChHhhCEeeC-CEEEcccc-CCCceeECC-CCcccCCCC
Confidence 456754 46886 59986554 457899999 899999996
No 20
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.09 E-value=0.0047 Score=29.90 Aligned_cols=26 Identities=38% Similarity=0.876 Sum_probs=19.2
Q ss_pred CCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902 54 PCYHGGRCTQTFQDRTGYKCRCSGTGYYGK 83 (86)
Q Consensus 54 PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~ 83 (86)
.|...++|++... +|+|.|+ +||.|.
T Consensus 7 ~C~~nA~C~~~~~---~~~C~C~-~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGG---SYTCTCK-PGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TT---SEEEEE--CEEECC
T ss_pred CCCCCcEeecCCC---CEEeECC-CCCccC
Confidence 4777789999853 8999999 899874
No 21
>KOG1225|consensus
Probab=95.87 E-value=0.02 Score=42.65 Aligned_cols=26 Identities=35% Similarity=0.974 Sum_probs=16.6
Q ss_pred CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 53 NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 53 ~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
..|.+.|.|++. .|.|. +||+|..|+
T Consensus 316 adC~g~G~Ci~G-------~C~C~-~Gy~G~~C~ 341 (525)
T KOG1225|consen 316 ADCSGHGKCIDG-------ECLCD-EGYTGELCI 341 (525)
T ss_pred ccCCCCCcccCC-------ceEeC-CCCcCCccc
Confidence 566777777632 46676 677776663
No 22
>KOG1217|consensus
Probab=95.72 E-value=0.016 Score=40.75 Aligned_cols=36 Identities=39% Similarity=1.042 Sum_probs=29.7
Q ss_pred cccCC--CCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACD--TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~--~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.++|. ..+|++++.|.+... .|.|.|+ ++|++..|+
T Consensus 169 ~~~C~~~~~~c~~~~~C~~~~~---~~~C~c~-~~~~~~~~~ 206 (487)
T KOG1217|consen 169 LDECIQYSSPCQNGGTCVNTGG---SYLCSCP-PGYTGSTCE 206 (487)
T ss_pred ccccccCCCCcCCCcccccCCC---CeeEeCC-CCccCCcCc
Confidence 46887 456999999999854 6999999 899998764
No 23
>KOG1225|consensus
Probab=95.00 E-value=0.044 Score=40.94 Aligned_cols=61 Identities=36% Similarity=0.990 Sum_probs=39.6
Q ss_pred CCCCCCCCCCCee----CCccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCC
Q psy6902 3 ACDPAHVRPGCRI----QDNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGT 78 (86)
Q Consensus 3 ~C~~~~c~~gC~~----~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~ 78 (86)
.|+...|.++|.. ....|.|+++ | +|..|+.. .|... |..++.+++ + .|.|. +
T Consensus 246 ~c~~~~C~~~c~~~g~c~~G~CIC~~G----f---~G~dC~e~-------~Cp~~-cs~~g~~~~------g-~CiC~-~ 302 (525)
T KOG1225|consen 246 LCSTIYCPGGCTGRGQCVEGRCICPPG----F---TGDDCDEL-------VCPVD-CSGGGVCVD------G-ECICN-P 302 (525)
T ss_pred ccccccCCCCCcccceEeCCeEeCCCC----C---cCCCCCcc-------cCCcc-cCCCceecC------C-EeecC-C
Confidence 4556666666642 3467888887 3 67777643 46544 766666664 3 78888 7
Q ss_pred CcCCCCCC
Q psy6902 79 GYYGKKCQ 86 (86)
Q Consensus 79 g~~G~~Ce 86 (86)
+|+|+.|+
T Consensus 303 g~~G~dCs 310 (525)
T KOG1225|consen 303 GYSGKDCS 310 (525)
T ss_pred Cccccccc
Confidence 88887763
No 24
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.09 E-value=0.05 Score=24.02 Aligned_cols=11 Identities=45% Similarity=1.340 Sum_probs=9.5
Q ss_pred CeEEeCCCCCcC
Q psy6902 70 GYKCRCSGTGYY 81 (86)
Q Consensus 70 ~~~C~C~~~g~~ 81 (86)
+|+|.|+ +||.
T Consensus 1 sy~C~C~-~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCP-PGYQ 11 (24)
T ss_pred CEEeeCC-CCCc
Confidence 5899999 8996
No 25
>KOG1836|consensus
Probab=91.42 E-value=0.41 Score=40.41 Aligned_cols=68 Identities=29% Similarity=0.759 Sum_probs=45.1
Q ss_pred CeeCCccCCCCCCccCCcccccccccccccCCcc----cccCCCCCCcCCceEecCCCCCCCeEEe-CCCCCcCCCCCC
Q psy6902 13 CRIQDNQCICGKGCYPEYKFITQEECMTALKGKR----QNACDTNPCYHGGRCTQTFQDRTGYKCR-CSGTGYYGKKCQ 86 (86)
Q Consensus 13 C~~~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~----~~~C~~~PC~ngg~C~~~~~~~~~~~C~-C~~~g~~G~~Ce 86 (86)
|......|.|...- ++..- ..|..+|.+.. ...|.+=||.+++.|..... .....|. |+ +||+|.+|+
T Consensus 740 Cd~~tG~C~C~~~t---~G~~C-~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~-~~~~iCk~Cp-~gytG~rCe 812 (1705)
T KOG1836|consen 740 CDPRTGQCKCKHNT---FGGQC-AQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPE-ILEVVCKNCP-PGYTGLRCE 812 (1705)
T ss_pred ccCCCCceecccCC---CCCch-hhhcCCCCCccccCCCCCCccCCCCCChhhcCcCc-ccceecCCCC-CCCcccccc
Confidence 54444566655431 11111 34567777653 22399999999999998753 3468898 99 999999985
No 26
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.62 E-value=0.29 Score=23.54 Aligned_cols=18 Identities=39% Similarity=0.997 Sum_probs=13.9
Q ss_pred eEecCCCCCCCeEEeCCCCCcC
Q psy6902 60 RCTQTFQDRTGYKCRCSGTGYY 81 (86)
Q Consensus 60 ~C~~~~~~~~~~~C~C~~~g~~ 81 (86)
+|++.. ++|+|.|+ +||+
T Consensus 11 ~C~~~~---g~~~C~C~-~Gy~ 28 (36)
T PF14670_consen 11 ICVNTP---GSYRCSCP-PGYK 28 (36)
T ss_dssp EEEEET---TSEEEE-S-TTEE
T ss_pred CCccCC---CceEeECC-CCCE
Confidence 688774 48999999 8996
No 27
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=89.41 E-value=0.42 Score=24.10 Aligned_cols=22 Identities=27% Similarity=0.804 Sum_probs=16.2
Q ss_pred ceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 59 GRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 59 g~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.+|... ...|.|+ ++|+|++|+
T Consensus 11 ~~C~~~-----~G~C~C~-~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPS-----TGQCVCK-PGTTGPRCD 32 (49)
T ss_dssp SSEEET-----CEEESBS-TTEESTTS-
T ss_pred CcccCC-----CCEEecc-ccccCCcCc
Confidence 356653 5689999 899999885
No 28
>smart00051 DSL delta serrate ligand.
Probab=89.38 E-value=0.56 Score=25.31 Aligned_cols=24 Identities=21% Similarity=0.545 Sum_probs=17.1
Q ss_pred CcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902 55 CYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC 85 (86)
Q Consensus 55 C~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C 85 (86)
+..+.+|... =.+.|+ +||+|..|
T Consensus 40 ~~~~~~Cd~~------G~~~C~-~Gw~G~~C 63 (63)
T smart00051 40 FFGHYTCDEN------GNKGCL-EGWMGPYC 63 (63)
T ss_pred ccCCccCCcC------CCEecC-CCCcCCCC
Confidence 4566777432 247799 89999987
No 29
>KOG1214|consensus
Probab=89.01 E-value=0.74 Score=36.73 Aligned_cols=60 Identities=27% Similarity=0.546 Sum_probs=32.0
Q ss_pred CCCCCCccCCcccccccccccccCCcc--------cccCCCCCCcCC-------c--eEecCCCCCCCeEEeCCCCCcCC
Q psy6902 20 CICGKGCYPEYKFITQEECMTALKGKR--------QNACDTNPCYHG-------G--RCTQTFQDRTGYKCRCSGTGYYG 82 (86)
Q Consensus 20 C~c~~~c~~~~~~~~g~~C~~~~~~~~--------~~~C~~~PC~ng-------g--~C~~~~~~~~~~~C~C~~~g~~G 82 (86)
|--.+.|++.-+.|. +.|+.+|...+ .+.=-.+||..+ | .|+... .+.|+|.|. +||.|
T Consensus 744 CGp~s~Cin~pg~~r-ceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG--gs~y~C~CL-PGfsG 819 (1289)
T KOG1214|consen 744 CGPNSVCINLPGSYR-CECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG--GSTYSCACL-PGFSG 819 (1289)
T ss_pred CCCCceeecCCCcee-EEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC--CceEEEeec-CCccC
Confidence 333345555554555 56665543221 122223455544 3 344443 247999999 99998
Q ss_pred C
Q psy6902 83 K 83 (86)
Q Consensus 83 ~ 83 (86)
.
T Consensus 820 D 820 (1289)
T KOG1214|consen 820 D 820 (1289)
T ss_pred C
Confidence 4
No 30
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=85.37 E-value=0.81 Score=22.25 Aligned_cols=29 Identities=28% Similarity=0.630 Sum_probs=19.2
Q ss_pred CCCCCCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902 50 CDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYY 81 (86)
Q Consensus 50 C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~ 81 (86)
|...+|..++.|+...+ +.+.|+|. .||.
T Consensus 2 C~~~~cP~NA~C~~~~d--G~eecrCl-lgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDD--GSEECRCL-LGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETT--SEEEEEE--TTEE
T ss_pred ccCccCCCCcccEEcCC--CCEEEEee-CCcc
Confidence 55667888899998752 57999999 9996
No 31
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.99 E-value=1.5 Score=28.70 Aligned_cols=30 Identities=33% Similarity=0.699 Sum_probs=22.4
Q ss_pred cccCCCC--CCcCCceEecCCCCCCCeEEeCCCCCcCC
Q psy6902 47 QNACDTN--PCYHGGRCTQTFQDRTGYKCRCSGTGYYG 82 (86)
Q Consensus 47 ~~~C~~~--PC~ngg~C~~~~~~~~~~~C~C~~~g~~G 82 (86)
.++|... +|. ..|.+.. ++|.|.|+ .||+.
T Consensus 187 ~~~C~~~~~~c~--~~C~~~~---g~~~c~c~-~g~~~ 218 (224)
T cd01475 187 PDLCATLSHVCQ--QVCISTP---GSYLCACT-EGYAL 218 (224)
T ss_pred chhhcCCCCCcc--ceEEcCC---CCEEeECC-CCccC
Confidence 5677543 465 4799875 48999999 89975
No 32
>KOG1226|consensus
Probab=82.23 E-value=2.3 Score=33.38 Aligned_cols=33 Identities=27% Similarity=0.913 Sum_probs=21.0
Q ss_pred cccCCCC---CCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDTN---PCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~~---PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.+.|.+. -|...|+|.-. +|.|..++|.|..||
T Consensus 585 td~C~~~~G~iCSGrG~C~Cg-------~C~C~~~~~sG~~CE 620 (783)
T KOG1226|consen 585 TDTCESSDGQICSGRGTCECG-------RCKCTDPPYSGEFCE 620 (783)
T ss_pred CccccCCCCceeCCCceeeCC-------ceEcCCCCcCcchhh
Confidence 6777653 25555555532 578885559998886
No 33
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=81.94 E-value=1.8 Score=21.88 Aligned_cols=15 Identities=27% Similarity=0.935 Sum_probs=10.8
Q ss_pred eEEeCCCCCcCCCCCC
Q psy6902 71 YKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 71 ~~C~C~~~g~~G~~Ce 86 (86)
-+|.|+ ++++|..|+
T Consensus 19 G~C~C~-~~~~G~~C~ 33 (50)
T cd00055 19 GQCECK-PNTTGRRCD 33 (50)
T ss_pred CEEeCC-CcCCCCCCC
Confidence 467787 788887764
No 34
>KOG0994|consensus
Probab=77.19 E-value=5.9 Score=33.03 Aligned_cols=48 Identities=27% Similarity=0.758 Sum_probs=31.8
Q ss_pred ccccccCCcc----cccCCCCCCcCCc--------eEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 37 ECMTALKGKR----QNACDTNPCYHGG--------RCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 37 ~C~~~~~~~~----~~~C~~~PC~ngg--------~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+|..+|.|.. -..|.+=||..+. .|..... .....|+|. .||+|.+|+
T Consensus 889 rCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~-t~~ivC~C~-~GY~G~RCe 948 (1758)
T KOG0994|consen 889 RCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTR-TQQIVCHCQ-EGYSGSRCE 948 (1758)
T ss_pred hhhccccCCcccCCCCCCCCCCCCCCCccchhcccccccccc-ccceeeecc-cCccccchh
Confidence 4556776643 3567777877653 3554322 235789999 899999985
No 35
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=75.64 E-value=2.6 Score=20.99 Aligned_cols=15 Identities=27% Similarity=0.953 Sum_probs=10.4
Q ss_pred eEEeCCCCCcCCCCCC
Q psy6902 71 YKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 71 ~~C~C~~~g~~G~~Ce 86 (86)
-.|.|+ ++++|++|+
T Consensus 18 G~C~C~-~~~~G~~C~ 32 (46)
T smart00180 18 GQCECK-PNVTGRRCD 32 (46)
T ss_pred CEEECC-CCCCCCCCC
Confidence 367777 777777764
No 36
>KOG4260|consensus
Probab=75.56 E-value=4.5 Score=28.39 Aligned_cols=33 Identities=36% Similarity=0.907 Sum_probs=25.7
Q ss_pred CCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902 52 TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC 85 (86)
Q Consensus 52 ~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C 85 (86)
..||...|.|.-.....++-.|.|. +||+|+.|
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~-~GY~Gp~C 181 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCE-TGYTGPLC 181 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCccccc-CCCCCccc
Confidence 4578888999865433456789999 99999887
No 37
>KOG1226|consensus
Probab=73.10 E-value=5.8 Score=31.25 Aligned_cols=40 Identities=28% Similarity=0.755 Sum_probs=26.6
Q ss_pred cccccccccCCcccccCCCC---CCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 34 TQEECMTALKGKRQNACDTN---PCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 34 ~g~~C~~~~~~~~~~~C~~~---PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.|..|+-+ .-.|... -|...|+|.=. .|.|. +||+|..|+
T Consensus 538 ~G~fCECD-----nfsC~r~~g~lC~g~G~C~CG-------~CvC~-~GwtG~~C~ 580 (783)
T KOG1226|consen 538 YGKFCECD-----NFSCERHKGVLCGGHGRCECG-------RCVCN-PGWTGSACN 580 (783)
T ss_pred eeeeeecc-----CcccccccCcccCCCCeEeCC-------cEEcC-CCCccCCCC
Confidence 45666544 2234432 47777887643 69999 899999885
No 38
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=71.38 E-value=1.1 Score=29.50 Aligned_cols=64 Identities=25% Similarity=0.641 Sum_probs=37.8
Q ss_pred CCCCCC-CeeCC--ccCCCCCCccCCcccccccccccccCCcccccCCC-----CCCcCCceEecCCC--CCCCeEEeCC
Q psy6902 7 AHVRPG-CRIQD--NQCICGKGCYPEYKFITQEECMTALKGKRQNACDT-----NPCYHGGRCTQTFQ--DRTGYKCRCS 76 (86)
Q Consensus 7 ~~c~~g-C~~~~--~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~-----~PC~ngg~C~~~~~--~~~~~~C~C~ 76 (86)
..|.|| -..++ |-|.|..+++- -+-..|+.. .+|.. .||-.-+.|+.... ....|.|.|.
T Consensus 6 T~CKNG~LiQMSNHfEC~Cnegfvl----~~EntCE~k------v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~ 75 (197)
T PF06247_consen 6 TICKNGYLIQMSNHFECKCNEGFVL----KNENTCEEK------VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCI 75 (197)
T ss_dssp ---BTEEEEEESSEEEEEESTTEEE----EETTEEEE----------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-
T ss_pred ccccCCEEEEccCceEEEcCCCcEE----ccccccccc------eecCcccccCccccchhhhhcCCCcccceeEEEecc
Confidence 467777 43333 88999998432 233456654 34533 58999999997642 2457999999
Q ss_pred CCCcC
Q psy6902 77 GTGYY 81 (86)
Q Consensus 77 ~~g~~ 81 (86)
.||+
T Consensus 76 -~gY~ 79 (197)
T PF06247_consen 76 -NGYI 79 (197)
T ss_dssp -TTEE
T ss_pred -cCce
Confidence 8996
No 39
>KOG4260|consensus
Probab=68.16 E-value=3.8 Score=28.74 Aligned_cols=33 Identities=30% Similarity=0.638 Sum_probs=26.8
Q ss_pred ccccCC--CCCCcCCceEecCCCCCCCeEEeCCCCCcCC
Q psy6902 46 RQNACD--TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYG 82 (86)
Q Consensus 46 ~~~~C~--~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G 82 (86)
+||+|. ++||...-.|++.. ++|+|.+. +||.+
T Consensus 235 DvnEC~~ep~~c~~~qfCvNte---GSf~C~dk-~Gy~~ 269 (350)
T KOG4260|consen 235 DVNECQNEPAPCKAHQFCVNTE---GSFKCEDK-EGYKK 269 (350)
T ss_pred cHHHHhcCCCCCChhheeecCC---CceEeccc-ccccC
Confidence 378885 56899999999985 49999998 78865
No 40
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=64.96 E-value=10 Score=22.06 Aligned_cols=32 Identities=31% Similarity=0.717 Sum_probs=24.2
Q ss_pred cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902 47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK 83 (86)
Q Consensus 47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~ 83 (86)
.|.|.. ..|-..|.|.... .-.|.|+ +||.-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~~----~~~C~Cl-~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSNN----SPKCSCL-PGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCCCC----CCceECC-CCcCCC
Confidence 567864 7899999996532 4579999 899754
No 41
>KOG3509|consensus
Probab=64.17 E-value=6.1 Score=31.97 Aligned_cols=36 Identities=42% Similarity=1.046 Sum_probs=30.2
Q ss_pred cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
.++|...|+.+.+.|.+... ...|.|+ ++|+|..|+
T Consensus 406 g~~c~~~p~~~~g~c~p~~~---~~~c~c~-~g~~G~~c~ 441 (964)
T KOG3509|consen 406 GDVCWRIPCQHDGPCLQTLE---GKQCLCP-PGYTGDSCE 441 (964)
T ss_pred CCccccccCCCCcccccccc---ccceecc-ccccCchhh
Confidence 46788899999999988753 7899999 899998764
No 42
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=63.90 E-value=10 Score=22.52 Aligned_cols=26 Identities=27% Similarity=0.819 Sum_probs=18.1
Q ss_pred CCCCcCCceEecCCCCC--CCeEEeCCCC
Q psy6902 52 TNPCYHGGRCTQTFQDR--TGYKCRCSGT 78 (86)
Q Consensus 52 ~~PC~ngg~C~~~~~~~--~~~~C~C~~~ 78 (86)
.+-|...|.|+...... .=|.|.|. +
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~-~ 39 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCK-P 39 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEee-c
Confidence 35688889999864311 24999998 5
No 43
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=63.22 E-value=2.6 Score=22.72 Aligned_cols=11 Identities=36% Similarity=1.047 Sum_probs=8.4
Q ss_pred eCCCCCcCCCCC
Q psy6902 74 RCSGTGYYGKKC 85 (86)
Q Consensus 74 ~C~~~g~~G~~C 85 (86)
.|. +||+|+.|
T Consensus 53 ~C~-~Gw~G~~C 63 (63)
T PF01414_consen 53 VCL-PGWTGPNC 63 (63)
T ss_dssp EE--TTEESTTS
T ss_pred CCC-CCCcCCCC
Confidence 467 89999987
No 44
>KOG0994|consensus
Probab=59.42 E-value=16 Score=30.70 Aligned_cols=30 Identities=27% Similarity=0.511 Sum_probs=18.0
Q ss_pred CeeCCccCCCCCCccCCcccccccccccccCCcc
Q psy6902 13 CRIQDNQCICGKGCYPEYKFITQEECMTALKGKR 46 (86)
Q Consensus 13 C~~~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~ 46 (86)
|.+....|.|.+| |++.+-..|+.-|+|..
T Consensus 1079 CN~ftGQCqCkpG----fGGR~C~qCqel~WGdP 1108 (1758)
T KOG0994|consen 1079 CNEFTGQCQCKPG----FGGRTCSQCQELYWGDP 1108 (1758)
T ss_pred ccccccceeccCC----CCCcchhHHHHhhcCCC
Confidence 5555577888887 43334234666677654
No 45
>KOG3514|consensus
Probab=52.96 E-value=13 Score=30.87 Aligned_cols=37 Identities=35% Similarity=0.935 Sum_probs=30.2
Q ss_pred cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902 47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ 86 (86)
Q Consensus 47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce 86 (86)
+..|..+-|.|.|.|++... +|.|.|.-..|+|+.|.
T Consensus 1018 st~c~~~acanhG~c~q~w~---~~~c~csmtS~~Gp~C~ 1054 (1591)
T KOG3514|consen 1018 STTCSEDACANHGVCIQQWN---GIACDCSMTSYSGPRCN 1054 (1591)
T ss_pred Ccccchhhhhccceeeeeec---ceeeeccccccCCCccC
Confidence 45688888999999999864 89999975678888773
No 46
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=50.09 E-value=15 Score=17.44 Aligned_cols=24 Identities=25% Similarity=0.753 Sum_probs=13.7
Q ss_pred CCCCCCCCCCee-CCccCCCCCCcc
Q psy6902 4 CDPAHVRPGCRI-QDNQCICGKGCY 27 (86)
Q Consensus 4 C~~~~c~~gC~~-~~~~C~c~~~c~ 27 (86)
|+...|...|.. ....|.||.|++
T Consensus 3 Cn~t~CpA~CDpn~~~~C~CPeGyI 27 (34)
T PF09064_consen 3 CNQTECPADCDPNSPGQCFCPEGYI 27 (34)
T ss_pred cccccCCCccCCCCCCceeCCCceE
Confidence 344445545554 336788888743
No 47
>PF12020 TAFA: TAFA family; InterPro: IPR020350 This entry contains the protein FAM19A2 (also known as TAFA2); it is a secreted, 11 kDa member of the FAM19/TAFA family of chemokine-like proteins [], which are composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The FAM19/TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines that act as regulators of immune and nervous cells [].
Probab=41.24 E-value=21 Score=20.60 Aligned_cols=25 Identities=28% Similarity=0.828 Sum_probs=19.3
Q ss_pred cCCCCCCcCCceEecCCCCCCCeEEe
Q psy6902 49 ACDTNPCYHGGRCTQTFQDRTGYKCR 74 (86)
Q Consensus 49 ~C~~~PC~ngg~C~~~~~~~~~~~C~ 74 (86)
-|...||+.|..|....+ ..+++|.
T Consensus 56 WC~MePCl~gEeC~~Lpd-~sGWsCs 80 (90)
T PF12020_consen 56 WCQMEPCLEGEECKVLPD-LSGWSCS 80 (90)
T ss_pred eeecccccCCcceEEecc-CCCceec
Confidence 488999999999987654 3467765
No 48
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=40.00 E-value=42 Score=16.68 Aligned_cols=20 Identities=35% Similarity=1.104 Sum_probs=14.5
Q ss_pred CCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902 54 PCYHGGRCTQTFQDRTGYKCRCSGTGYY 81 (86)
Q Consensus 54 PC~ngg~C~~~~~~~~~~~C~C~~~g~~ 81 (86)
.|..+..|++. +|.|+ .||.
T Consensus 27 qC~~~s~C~~g-------~C~C~-~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNG-------RCQCP-PGYV 46 (52)
T ss_pred CCCCcCEEcCC-------EeECC-CCCE
Confidence 46677778543 79999 7885
No 49
>KOG1836|consensus
Probab=38.32 E-value=82 Score=27.54 Aligned_cols=35 Identities=29% Similarity=0.638 Sum_probs=22.3
Q ss_pred CCCCCCCCCC--Cee----CCccCC-CCCCccCCcccccccccc---cccCC
Q psy6902 3 ACDPAHVRPG--CRI----QDNQCI-CGKGCYPEYKFITQEECM---TALKG 44 (86)
Q Consensus 3 ~C~~~~c~~g--C~~----~~~~C~-c~~~c~~~~~~~~g~~C~---~~~~~ 44 (86)
.|.+-||-++ |.. ....|. |+.+ |+|.+|+ .+|++
T Consensus 776 dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~g-------ytG~rCe~c~dgyfg 820 (1705)
T KOG1836|consen 776 DCQPCPCPNGGACGQTPEILEVVCKNCPPG-------YTGLRCEECADGYFG 820 (1705)
T ss_pred CCccCCCCCChhhcCcCcccceecCCCCCC-------CcccccccCCCcccc
Confidence 3677777666 533 337787 8888 3777765 45554
No 50
>KOG3512|consensus
Probab=28.62 E-value=1.2e+02 Score=23.17 Aligned_cols=68 Identities=28% Similarity=0.503 Sum_probs=35.8
Q ss_pred CCCCCCCCC-C--CeeCCccCCCCCCccCCccccccc---ccccccCCcc--cccCCCCC------CcCCceEecCCCCC
Q psy6902 3 ACDPAHVRP-G--CRIQDNQCICGKGCYPEYKFITQE---ECMTALKGKR--QNACDTNP------CYHGGRCTQTFQDR 68 (86)
Q Consensus 3 ~C~~~~c~~-g--C~~~~~~C~c~~~c~~~~~~~~g~---~C~~~~~~~~--~~~C~~~P------C~ngg~C~~~~~~~ 68 (86)
+|+.+|... | |.....+|.|..| . +|. +|..+|+-.. +-+|..-| +.++++ +
T Consensus 396 ~CdChpVGs~gktCNq~tGqCpCkeG----v---tG~tCnrCa~gyqqsrs~vapcik~p~~~~~~~~s~ve-------~ 461 (592)
T KOG3512|consen 396 ACDCHPVGSAGKTCNQTTGQCPCKEG----V---TGLTCNRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE-------P 461 (592)
T ss_pred hcCCcccccccccccccCCcccCCCC----C---cccccccccchhhcccCCCcCceecCCCCccccCCCCc-------c
Confidence 345555532 2 6555678888887 3 444 3555654332 34443322 333333 2
Q ss_pred CCeEEeCCCCCcCCCCC
Q psy6902 69 TGYKCRCSGTGYYGKKC 85 (86)
Q Consensus 69 ~~~~C~C~~~g~~G~~C 85 (86)
..+-+.|+ .++.|.++
T Consensus 462 qd~~s~Ck-~~~~~~r~ 477 (592)
T KOG3512|consen 462 QDQCSKCK-ASPGGKRL 477 (592)
T ss_pred hhccccCC-CCCcceec
Confidence 24566788 77777654
Done!