Query         psy7854
Match_columns 153
No_of_seqs    171 out of 1527
Neff          9.6 
Searched_HMMs 46136
Date          Fri Aug 16 20:31:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7854.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7854hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.3 5.1E-12 1.1E-16  102.6   6.8  122   25-151   723-859 (1289)
  2 KOG1219|consensus               99.2 6.6E-11 1.4E-15  103.7   6.8  108   31-149  3858-3972(4289)
  3 PF07645 EGF_CA:  Calcium-bindi  98.8 2.7E-09 5.9E-14   56.8   2.8   33  117-149     1-34  (42)
  4 KOG1214|consensus               98.8 2.4E-08 5.2E-13   81.8   7.8   93   56-151   717-820 (1289)
  5 PF07645 EGF_CA:  Calcium-bindi  98.8   7E-09 1.5E-13   55.2   2.7   38   75-112     1-39  (42)
  6 KOG4289|consensus               98.7 2.2E-08 4.7E-13   85.8   5.6   77   63-147  1225-1308(2531)
  7 KOG1219|consensus               98.6 8.2E-08 1.8E-12   85.3   5.9   73   72-152  3859-3936(4289)
  8 PF12662 cEGF:  Complement Clr-  98.5 8.6E-08 1.9E-12   44.3   2.3   24   97-120     1-24  (24)
  9 KOG4260|consensus               98.4 1.8E-07 3.8E-12   67.9   2.0   72   70-149   230-304 (350)
 10 KOG4289|consensus               98.1 3.4E-06 7.3E-11   73.0   4.2   53   92-150  1216-1270(2531)
 11 PF14670 FXa_inhibition:  Coagu  98.0 3.3E-06 7.1E-11   43.1   2.0   31   84-115     6-36  (36)
 12 smart00179 EGF_CA Calcium-bind  97.9 1.6E-05 3.5E-10   40.9   3.7   32  117-149     1-33  (39)
 13 KOG4260|consensus               97.9 5.2E-06 1.1E-10   60.5   2.3   52   96-149   215-268 (350)
 14 PF00008 EGF:  EGF-like domain   97.8   1E-05 2.2E-10   40.2   1.5   29  124-152     2-32  (32)
 15 PF12947 EGF_3:  EGF domain;  I  97.8 1.1E-05 2.4E-10   41.2   1.6   32   82-115     4-36  (36)
 16 PF12947 EGF_3:  EGF domain;  I  97.7 1.9E-05   4E-10   40.3   1.0   30  122-151     2-32  (36)
 17 KOG1217|consensus               97.6 0.00031 6.7E-09   55.3   7.7   61   89-151   243-304 (487)
 18 smart00179 EGF_CA Calcium-bind  97.6 0.00015 3.2E-09   37.2   3.6   31   76-107     2-33  (39)
 19 cd00054 EGF_CA Calcium-binding  97.5 0.00017 3.8E-09   36.5   3.6   33  118-151     2-35  (38)
 20 PF14670 FXa_inhibition:  Coagu  97.5 6.6E-05 1.4E-09   38.3   1.4   25  125-150     5-29  (36)
 21 cd01475 vWA_Matrilin VWA_Matri  97.3 0.00025 5.4E-09   51.1   3.8   40  111-151   180-219 (224)
 22 cd01475 vWA_Matrilin VWA_Matri  97.3 0.00031 6.8E-09   50.6   3.8   42   70-112   181-222 (224)
 23 KOG1217|consensus               97.1  0.0036 7.7E-08   49.3   8.1  111   32-150   267-388 (487)
 24 smart00181 EGF Epidermal growt  97.0  0.0012 2.6E-08   33.0   3.4   25  126-150     6-30  (35)
 25 cd00053 EGF Epidermal growth f  97.0  0.0015 3.3E-08   32.4   3.5   27  125-151     5-32  (36)
 26 cd00054 EGF_CA Calcium-binding  96.9   0.002 4.3E-08   32.4   3.6   32   76-108     2-34  (38)
 27 PF00008 EGF:  EGF-like domain   96.8 0.00098 2.1E-08   33.0   1.9   25   84-108     4-30  (32)
 28 smart00181 EGF Epidermal growt  96.4  0.0069 1.5E-07   30.1   3.5   25   84-108     6-30  (35)
 29 cd00053 EGF Epidermal growth f  96.0   0.015 3.2E-07   28.6   3.5   25   84-108     6-31  (36)
 30 PF06247 Plasmod_Pvs28:  Plasmo  95.1   0.007 1.5E-07   42.1   0.4   66   84-150     6-80  (197)
 31 KOG1225|consensus               92.5    0.44 9.5E-06   38.7   6.1   69   61-149   266-336 (525)
 32 PF12946 EGF_MSP1_1:  MSP1 EGF   92.3   0.097 2.1E-06   26.6   1.4   25  125-149     4-30  (37)
 33 PF12661 hEGF:  Human growth fa  90.9    0.12 2.5E-06   20.1   0.7   10   99-108     1-10  (13)
 34 PF07974 EGF_2:  EGF-like domai  88.5     1.1 2.4E-05   22.0   3.3   24  126-151     6-30  (32)
 35 PHA03099 epidermal growth fact  86.7    0.77 1.7E-05   30.1   2.6   35  118-152    42-79  (139)
 36 PHA03099 epidermal growth fact  84.6     1.1 2.3E-05   29.5   2.5   32   84-119    51-84  (139)
 37 PF06247 Plasmod_Pvs28:  Plasmo  83.1    0.47   1E-05   33.2   0.5   88   53-150    68-161 (197)
 38 KOG1225|consensus               82.3     3.2 6.8E-05   34.0   4.9   61   64-149   300-362 (525)
 39 PHA02887 EGF-like protein; Pro  80.8     1.6 3.4E-05   28.2   2.2   27  126-152    92-120 (126)
 40 PHA02887 EGF-like protein; Pro  80.3     1.9 4.1E-05   27.9   2.4   31   84-118    92-124 (126)
 41 PF09064 Tme5_EGF_like:  Thromb  79.5     1.5 3.2E-05   21.8   1.4   25  125-151     5-29  (34)
 42 PF00954 S_locus_glycop:  S-loc  78.2       3 6.5E-05   26.4   3.0   33  117-151    76-109 (110)
 43 KOG1215|consensus               71.5       7 0.00015   34.0   4.4   71   76-150   325-396 (877)
 44 KOG0994|consensus               67.1      12 0.00027   33.7   4.8   59   90-150   831-896 (1758)
 45 KOG1226|consensus               64.4      50  0.0011   28.5   7.7   48   99-150   526-576 (783)
 46 KOG0994|consensus               58.9      40 0.00086   30.8   6.4   62   90-152   877-946 (1758)
 47 KOG3516|consensus               56.7      10 0.00022   34.0   2.6   39   72-116   541-581 (1306)
 48 PF01683 EB:  EB module;  Inter  56.1      23  0.0005   18.9   3.2   29  112-150    19-47  (52)
 49 KOG1226|consensus               51.6      53  0.0011   28.3   5.8   21   99-123   567-590 (783)
 50 KOG1836|consensus               50.5      25 0.00053   33.3   4.1   58   91-153   749-811 (1705)
 51 KOG3516|consensus               46.0      18 0.00039   32.6   2.5   36  113-150   540-577 (1306)
 52 KOG0196|consensus               45.0      23  0.0005   30.8   2.9   51   98-148   259-316 (996)
 53 smart00051 DSL delta serrate l  41.1      69  0.0015   18.2   3.7   45   97-151    16-61  (63)
 54 PF07359 LEAP-2:  Liver-express  35.8      26 0.00056   20.8   1.4   25  107-133    46-70  (77)
 55 PF12955 DUF3844:  Domain of un  29.6      34 0.00073   21.7   1.2   30  119-148     6-41  (103)
 56 KOG3512|consensus               28.8 1.6E+02  0.0035   24.2   5.0   15   95-109   369-383 (592)
 57 PF01826 TIL:  Trypsin Inhibito  23.0      28 0.00061   18.8   0.0   19  100-119    35-53  (55)
 58 KOG3509|consensus               21.8 2.7E+02  0.0058   25.1   5.5   60   84-147   412-473 (964)

No 1  
>KOG1214|consensus
Probab=99.30  E-value=5.1e-12  Score=102.59  Aligned_cols=122  Identities=25%  Similarity=0.363  Sum_probs=89.1

Q ss_pred             eecccCCCCcccCcCCCCCCCeeeCCCcee----cccCCCCccCCCCCCCCcCCc------ccCCCCCCCCCC-CC--ce
Q psy7854          25 RLAVALRRSISIDEFTVHSSTLIALSQESQ----DLERLKFGLEPPTSRSEVHGA------NHHTTENPGICT-NG--AC   91 (153)
Q Consensus        25 ~~~~~~~~c~~i~~c~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~C~~------~~~C~~~~~~C~-~~--~C   91 (153)
                      .+..+.+.|.|++||+.-...|..+..|.+    +.|.|..+|..... +.+|+.      ++.|..+.+.|. ++  .|
T Consensus       723 g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd-~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c  801 (1289)
T KOG1214|consen  723 GYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADD-RHTCVLITPPAPANPCEDGSHTCAIAGQARC  801 (1289)
T ss_pred             ccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccC-CcceEEecCCCCCCccccCccccCcCCceEE
Confidence            344578999999999887766666666643    33444445543332 345643      455666556673 33  66


Q ss_pred             eecC-CCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854          92 ENLM-GTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus        92 ~~~~-g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      .... ++|.|.|.+||.+  +|..|.++|||  .++.|+. ++|.|++|+|.|.|.+||-++
T Consensus       802 ~~hGgs~y~C~CLPGfsG--DG~~c~dvDeC--~psrChp~A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  802 VHHGGSTYSCACLPGFSG--DGHQCTDVDEC--SPSRCHPAATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             EecCCceEEEeecCCccC--Ccccccccccc--CccccCCCceEecCCCcceeecccCccCC
Confidence            6554 5799999999988  89999999999  6999985 999999999999999999754


No 2  
>KOG1219|consensus
Probab=99.16  E-value=6.6e-11  Score=103.72  Aligned_cols=108  Identities=22%  Similarity=0.365  Sum_probs=82.9

Q ss_pred             CCCccc-CcCCCCCCCeeeCCCceec-ccCCCCccCCCCCCCCcC-CcccCCCCCCCCCCCC-ceeecCCCceeeCCCCc
Q psy7854          31 RRSISI-DEFTVHSSTLIALSQESQD-LERLKFGLEPPTSRSEVH-GANHHTTENPGICTNG-ACENLMGTYRCICNPGY  106 (153)
Q Consensus        31 ~~c~~i-~~c~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~C-~~~~~C~~~~~~C~~~-~C~~~~g~~~C~C~~g~  106 (153)
                      .-|.-. +.|..+   .|++++.+.. ...-..+.|++.|.++.| .+++.|..  +||..| +|....+.|.|.|+.||
T Consensus      3858 pgC~l~~d~C~~n---pCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~s--nPC~~GgtCip~~n~f~CnC~~gy 3932 (4289)
T KOG1219|consen 3858 PGCSLLTDPCNDN---PCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCAS--NPCLTGGTCIPFYNGFLCNCPNGY 3932 (4289)
T ss_pred             ccccccccccccC---cccCCCEecCCCCCceEEeCcccccCcccccccccccC--CCCCCCCEEEecCCCeeEeCCCCc
Confidence            334433 455444   4555554332 222334457778889999 57899988  899754 99999999999999999


Q ss_pred             ccCCCCCCCc--cCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854         107 EGDPTGKLCS--DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP  149 (153)
Q Consensus       107 ~~~~~g~~C~--~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~  149 (153)
                          .|.+|+  .|+||  ..+.|.+ +.|+|.+|+|.|.|.+||.
T Consensus      3933 ----TG~~Ce~~Gi~eC--s~n~C~~gg~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3933 ----TGKRCEARGISEC--SKNVCGTGGQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred             ----cCceeeccccccc--ccccccCCceeeccCCceEeccChhHh
Confidence                888995  48999  7899997 9999999999999999985


No 3  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.84  E-value=2.7e-09  Score=56.78  Aligned_cols=33  Identities=33%  Similarity=0.601  Sum_probs=30.5

Q ss_pred             cCccccCCCCCCC-CCeeeecCCCcEEecCCCCC
Q psy7854         117 DQNECLMDVMICS-GGTCKNTPGSFQSNFQDSSP  149 (153)
Q Consensus       117 ~i~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~  149 (153)
                      |||||....+.|. ++.|+|+.|+|.|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            6899998889998 49999999999999999996


No 4  
>KOG1214|consensus
Probab=98.78  E-value=2.4e-08  Score=81.80  Aligned_cols=93  Identities=20%  Similarity=0.361  Sum_probs=75.4

Q ss_pred             ccCCCCccCCCCCCCCcCCcccCCCCCCCCC-CCCceeecCCCceeeCCCCcccCCCCCCCc------cCccccCCCCCC
Q psy7854          56 LERLKFGLEPPTSRSEVHGANHHTTENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLCS------DQNECLMDVMIC  128 (153)
Q Consensus        56 ~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~------~i~~C~~~~~~C  128 (153)
                      .|.|..++...   ++.|.|+++|+...+.| ++..|++.+++|+|.|..||....++.+|.      .++.|..+.+.|
T Consensus       717 tcecs~g~~gd---gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C  793 (1289)
T KOG1214|consen  717 TCECSSGYQGD---GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTC  793 (1289)
T ss_pred             EEEEeeccCCC---CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCcccc
Confidence            44555555433   67899999999988889 588999999999999999999988888885      468899888999


Q ss_pred             C-C--Ceeeec-CCCcEEecCCCCCcc
Q psy7854         129 S-G--GTCKNT-PGSFQSNFQDSSPIL  151 (153)
Q Consensus       129 ~-~--~~C~~~-~g~~~C~C~~g~~~~  151 (153)
                      . +  ++|+.. .+.|.|.|-|||.++
T Consensus       794 ~i~g~a~c~~hGgs~y~C~CLPGfsGD  820 (1289)
T KOG1214|consen  794 AIAGQARCVHHGGSTYSCACLPGFSGD  820 (1289)
T ss_pred             CcCCceEEEecCCceEEEeecCCccCC
Confidence            7 4  567665 456999999999865


No 5  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.75  E-value=7e-09  Score=55.17  Aligned_cols=38  Identities=42%  Similarity=0.905  Sum_probs=32.3

Q ss_pred             cccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCC
Q psy7854          75 ANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTG  112 (153)
Q Consensus        75 ~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g  112 (153)
                      |||||....+.|. ++.|+|+.|+|.|.|++||.....+
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~   39 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDG   39 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTS
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCC
Confidence            6899998888895 6799999999999999999754443


No 6  
>KOG4289|consensus
Probab=98.71  E-value=2.2e-08  Score=85.80  Aligned_cols=77  Identities=31%  Similarity=0.669  Sum_probs=65.8

Q ss_pred             cCCCCCCCCcC-CcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCc-c--CccccCCCCCCCC-Ceeeec
Q psy7854          63 LEPPTSRSEVH-GANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCS-D--QNECLMDVMICSG-GTCKNT  136 (153)
Q Consensus        63 ~~~~~~~~~~C-~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~-~--i~~C~~~~~~C~~-~~C~~~  136 (153)
                      -||++|.+..| ..+|.|-.  ++|. ++.|...+|+|+|.|.+||    .|..|+ +  .-.|  .++.|.| ++|++.
T Consensus      1225 rCPpGFTgd~CeTeiDlCYs--~pC~nng~C~srEggYtCeCrpg~----tGehCEvs~~agrC--vpGvC~nggtC~~~ 1296 (2531)
T KOG4289|consen 1225 RCPPGFTGDYCETEIDLCYS--GPCGNNGRCRSREGGYTCECRPGF----TGEHCEVSARAGRC--VPGVCKNGGTCVNL 1296 (2531)
T ss_pred             eCCCCCCcccccchhHhhhc--CCCCCCCceEEecCceeEEecCCc----cccceeeecccCcc--ccceecCCCEEeec
Confidence            47888999999 57999987  8995 7799999999999999999    778886 2  3457  7899997 899985


Q ss_pred             -CCCcEEecCCC
Q psy7854         137 -PGSFQSNFQDS  147 (153)
Q Consensus       137 -~g~~~C~C~~g  147 (153)
                       .|+|.|.|+.|
T Consensus      1297 ~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYG 1308 (2531)
T ss_pred             CCCceeccCCCc
Confidence             68899999987


No 7  
>KOG1219|consensus
Probab=98.59  E-value=8.2e-08  Score=85.26  Aligned_cols=73  Identities=25%  Similarity=0.523  Sum_probs=62.6

Q ss_pred             cCCcc-cCCCCCCCCCCCC-ceeecC-CCceeeCCCCcccCCCCCCCc-cCccccCCCCCCCC-CeeeecCCCcEEecCC
Q psy7854          72 VHGAN-HHTTENPGICTNG-ACENLM-GTYRCICNPGYEGDPTGKLCS-DQNECLMDVMICSG-GTCKNTPGSFQSNFQD  146 (153)
Q Consensus        72 ~C~~~-~~C~~~~~~C~~~-~C~~~~-g~~~C~C~~g~~~~~~g~~C~-~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~  146 (153)
                      .|... +.|..  .||+|+ .|...+ ++|.|.|++.|    .|+.|+ +++.|  ..++|.. ++|+...++|.|.|+.
T Consensus      3859 gC~l~~d~C~~--npCqhgG~C~~~~~ggy~CkCpsqy----sG~~CEi~~epC--~snPC~~GgtCip~~n~f~CnC~~ 3930 (4289)
T KOG1219|consen 3859 GCSLLTDPCND--NPCQHGGTCISQPKGGYKCKCPSQY----SGNHCEIDLEPC--ASNPCLTGGTCIPFYNGFLCNCPN 3930 (4289)
T ss_pred             ccccccccccc--CcccCCCEecCCCCCceEEeCcccc----cCcccccccccc--cCCCCCCCCEEEecCCCeeEeCCC
Confidence            45433 78877  899876 998775 78999999999    889995 89999  7899986 8999999999999999


Q ss_pred             CCCccc
Q psy7854         147 SSPILK  152 (153)
Q Consensus       147 g~~~~~  152 (153)
                      ||.++|
T Consensus      3931 gyTG~~ 3936 (4289)
T KOG1219|consen 3931 GYTGKR 3936 (4289)
T ss_pred             CccCce
Confidence            999875


No 8  
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=98.52  E-value=8.6e-08  Score=44.27  Aligned_cols=24  Identities=46%  Similarity=1.169  Sum_probs=22.5

Q ss_pred             CceeeCCCCcccCCCCCCCccCcc
Q psy7854          97 TYRCICNPGYEGDPTGKLCSDQNE  120 (153)
Q Consensus        97 ~~~C~C~~g~~~~~~g~~C~~i~~  120 (153)
                      +|.|.|++||.+.++++.|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            689999999999999999999987


No 9  
>KOG4260|consensus
Probab=98.37  E-value=1.8e-07  Score=67.94  Aligned_cols=72  Identities=21%  Similarity=0.461  Sum_probs=56.4

Q ss_pred             CCcCCcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCC--CCeeeecCCCcEEecCC
Q psy7854          70 SEVHGANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICS--GGTCKNTPGSFQSNFQD  146 (153)
Q Consensus        70 ~~~C~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~--~~~C~~~~g~~~C~C~~  146 (153)
                      ...|.|||||...+.+|. +..|+|+.|+|.|.+++||..        ++|+|..-...|.  +..|.++.+.|+|+|..
T Consensus       230 e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~--------g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~  301 (350)
T KOG4260|consen  230 EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK--------GVDECQFCADVCASKNRPCMNIDGQYRCVCFS  301 (350)
T ss_pred             ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC--------ChHHhhhhhhhcccCCCCcccCCccEEEEecc
Confidence            447899999998888894 559999999999999999953        2556643224453  47899999999999988


Q ss_pred             CCC
Q psy7854         147 SSP  149 (153)
Q Consensus       147 g~~  149 (153)
                      |+.
T Consensus       302 ~~~  304 (350)
T KOG4260|consen  302 GLI  304 (350)
T ss_pred             cce
Confidence            864


No 10 
>KOG4289|consensus
Probab=98.09  E-value=3.4e-06  Score=72.99  Aligned_cols=53  Identities=25%  Similarity=0.658  Sum_probs=46.5

Q ss_pred             eecCCCceeeCCCCcccCCCCCCC-ccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCc
Q psy7854          92 ENLMGTYRCICNPGYEGDPTGKLC-SDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus        92 ~~~~g~~~C~C~~g~~~~~~g~~C-~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~  150 (153)
                      .+..++++|.|++||    .|..| ++||+|  ..++|.+ +.|....|+|.|.|++||.+
T Consensus      1216 i~pvnglrCrCPpGF----Tgd~CeTeiDlC--Ys~pC~nng~C~srEggYtCeCrpg~tG 1270 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGF----TGDYCETEIDLC--YSGPCGNNGRCRSREGGYTCECRPGFTG 1270 (2531)
T ss_pred             ccccCceeEeCCCCC----CcccccchhHhh--hcCCCCCCCceEEecCceeEEecCCccc
Confidence            344577999999999    66789 489999  8999995 99999999999999999975


No 11 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.04  E-value=3.3e-06  Score=43.10  Aligned_cols=31  Identities=45%  Similarity=1.113  Sum_probs=25.2

Q ss_pred             CCCCCCceeecCCCceeeCCCCcccCCCCCCC
Q psy7854          84 GICTNGACENLMGTYRCICNPGYEGDPTGKLC  115 (153)
Q Consensus        84 ~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C  115 (153)
                      +.|.+ .|++.+++|.|.|++||.+.++++.|
T Consensus         6 GgC~h-~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    6 GGCSH-ICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGSSS-EEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCcCC-CCccCCCceEeECCCCCEECcCCCCC
Confidence            56776 99999999999999999998887764


No 12 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.95  E-value=1.6e-05  Score=40.95  Aligned_cols=32  Identities=34%  Similarity=0.642  Sum_probs=26.6

Q ss_pred             cCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854         117 DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP  149 (153)
Q Consensus       117 ~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~  149 (153)
                      ++++|... .+|.+ +.|+++.|+|.|.|++||.
T Consensus         1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179        1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence            46788532 78885 7999999999999999997


No 13 
>KOG4260|consensus
Probab=97.94  E-value=5.2e-06  Score=60.46  Aligned_cols=52  Identities=29%  Similarity=0.579  Sum_probs=45.1

Q ss_pred             CCcee-eCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCC
Q psy7854          96 GTYRC-ICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSP  149 (153)
Q Consensus        96 g~~~C-~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~  149 (153)
                      ++..| .|+.||.+  +...|.|||||...+.+|.. ..|+|+.|+|.|.+++||.
T Consensus       215 ~~k~C~kCkkGW~l--de~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~  268 (350)
T KOG4260|consen  215 SSKGCSKCKKGWKL--DEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYK  268 (350)
T ss_pred             CCCChhhhccccee--cccccccHHHHhcCCCCCChhheeecCCCceEeccccccc
Confidence            34455 79999988  56789999999999999984 8999999999999999985


No 14 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.82  E-value=1e-05  Score=40.24  Aligned_cols=29  Identities=21%  Similarity=0.460  Sum_probs=24.5

Q ss_pred             CCCCCCC-CeeeecC-CCcEEecCCCCCccc
Q psy7854         124 DVMICSG-GTCKNTP-GSFQSNFQDSSPILK  152 (153)
Q Consensus       124 ~~~~C~~-~~C~~~~-g~~~C~C~~g~~~~~  152 (153)
                      .+++|.| ++|+... ++|.|.|++||.+.+
T Consensus         2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            3568886 9999988 999999999998753


No 15 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.81  E-value=1.1e-05  Score=41.17  Aligned_cols=32  Identities=53%  Similarity=1.246  Sum_probs=24.0

Q ss_pred             CCCCC-CCCceeecCCCceeeCCCCcccCCCCCCC
Q psy7854          82 NPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLC  115 (153)
Q Consensus        82 ~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C  115 (153)
                      ..+.| .++.|.++.++|.|+|++||.+  +|..|
T Consensus         4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~G--dG~~C   36 (36)
T PF12947_consen    4 NNGGCHPNATCTNTGGSYTCTCKPGYEG--DGFFC   36 (36)
T ss_dssp             GGGGS-TTCEEEE-TTSEEEEE-CEEEC--CSTCE
T ss_pred             CCCCCCCCcEeecCCCCEEeECCCCCcc--CCcCC
Confidence            34677 5789999999999999999987  55543


No 16 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.66  E-value=1.9e-05  Score=40.35  Aligned_cols=30  Identities=27%  Similarity=0.324  Sum_probs=22.1

Q ss_pred             cCCCCCCC-CCeeeecCCCcEEecCCCCCcc
Q psy7854         122 LMDVMICS-GGTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       122 ~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      ....+.|+ ++.|++++++|.|.|++||.+.
T Consensus         2 ~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    2 LENNGGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             TTGGGGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCCCCCcEeecCCCCEEeECCCCCccC
Confidence            33557787 5999999999999999999754


No 17 
>KOG1217|consensus
Probab=97.61  E-value=0.00031  Score=55.27  Aligned_cols=61  Identities=36%  Similarity=0.821  Sum_probs=49.3

Q ss_pred             CceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854          89 GACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus        89 ~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      +.|.+..++|+|.|++||.+... ..+.++++|..... |.+ ++|++..+.|.|.|++||.+.
T Consensus       243 ~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~  304 (487)
T KOG1217|consen  243 GTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGR  304 (487)
T ss_pred             CcccccCCceeeeCCCCcccccc-ceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCC
Confidence            57888889999999999966211 35789999975544 876 899999999999999999753


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.56  E-value=0.00015  Score=37.20  Aligned_cols=31  Identities=39%  Similarity=0.980  Sum_probs=25.2

Q ss_pred             ccCCCCCCCCCC-CCceeecCCCceeeCCCCcc
Q psy7854          76 NHHTTENPGICT-NGACENLMGTYRCICNPGYE  107 (153)
Q Consensus        76 ~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~  107 (153)
                      +++|... .+|. ++.|.+..++|.|.|++||.
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179        2 IDECASG-NPCQNGGTCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             cccCcCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence            5777653 5785 45999999999999999995


No 19 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.52  E-value=0.00017  Score=36.53  Aligned_cols=33  Identities=30%  Similarity=0.508  Sum_probs=26.3

Q ss_pred             CccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854         118 QNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       118 i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      +++|... .+|.+ +.|.+..++|.|.|++||.+.
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g~   35 (38)
T cd00054           2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR   35 (38)
T ss_pred             cccCCCC-CCcCCCCEeECCCCCeEeECCCCCcCC
Confidence            5677422 67875 899999999999999999763


No 20 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.45  E-value=6.6e-05  Score=38.26  Aligned_cols=25  Identities=32%  Similarity=0.542  Sum_probs=20.5

Q ss_pred             CCCCCCCeeeecCCCcEEecCCCCCc
Q psy7854         125 VMICSGGTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus       125 ~~~C~~~~C~~~~g~~~C~C~~g~~~  150 (153)
                      .+.|. ..|++++++|+|.|++||.+
T Consensus         5 NGgC~-h~C~~~~g~~~C~C~~Gy~L   29 (36)
T PF14670_consen    5 NGGCS-HICVNTPGSYRCSCPPGYKL   29 (36)
T ss_dssp             GGGSS-SEEEEETTSEEEE-STTEEE
T ss_pred             CCCcC-CCCccCCCceEeECCCCCEE
Confidence            45566 89999999999999999965


No 21 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.34  E-value=0.00025  Score=51.14  Aligned_cols=40  Identities=28%  Similarity=0.639  Sum_probs=34.7

Q ss_pred             CCCCCccCccccCCCCCCCCCeeeecCCCcEEecCCCCCcc
Q psy7854         111 TGKLCSDQNECLMDVMICSGGTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       111 ~g~~C~~i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      .+..|.++++|....+.|. ..|.++.|+|.|.|++||.+.
T Consensus       180 ~~~~C~~~~~C~~~~~~c~-~~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         180 QGKICVVPDLCATLSHVCQ-QVCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             ccccCcCchhhcCCCCCcc-ceEEcCCCCEEeECCCCccCC
Confidence            3567999999987778887 799999999999999999764


No 22 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.28  E-value=0.00031  Score=50.60  Aligned_cols=42  Identities=19%  Similarity=0.453  Sum_probs=36.0

Q ss_pred             CCcCCcccCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCC
Q psy7854          70 SEVHGANHHTTENPGICTNGACENLMGTYRCICNPGYEGDPTG  112 (153)
Q Consensus        70 ~~~C~~~~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~~~g  112 (153)
                      +..|.++++|...++.|.+ .|.+..|+|.|.|..||.+.+++
T Consensus       181 ~~~C~~~~~C~~~~~~c~~-~C~~~~g~~~c~c~~g~~~~~~~  222 (224)
T cd01475         181 GKICVVPDLCATLSHVCQQ-VCISTPGSYLCACTEGYALLEDN  222 (224)
T ss_pred             cccCcCchhhcCCCCCccc-eEEcCCCCEEeECCCCccCCCCC
Confidence            6789999999877678876 89999999999999999876554


No 23 
>KOG1217|consensus
Probab=97.08  E-value=0.0036  Score=49.28  Aligned_cols=111  Identities=28%  Similarity=0.519  Sum_probs=68.4

Q ss_pred             CCcccCcCCCCCCCeeeCCCceecccCCCCccCCCCCCCCc---CCcccCCCC--CCCCCCCC-ce--eecCCCceeeCC
Q psy7854          32 RSISIDEFTVHSSTLIALSQESQDLERLKFGLEPPTSRSEV---HGANHHTTE--NPGICTNG-AC--ENLMGTYRCICN  103 (153)
Q Consensus        32 ~c~~i~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---C~~~~~C~~--~~~~C~~~-~C--~~~~g~~~C~C~  103 (153)
                      .+.++++|..... +.+.+.|...... ....+++++.+..   |.+.++|..  ...+|.++ .|  ....+.+.|.|.
T Consensus       267 ~~~~~~~C~~~~~-c~~~~~C~~~~~~-~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~  344 (487)
T KOG1217|consen  267 TCVDVDSCALIAS-CPNGGTCVNVPGS-YRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACG  344 (487)
T ss_pred             eeeeccccCCCCc-cCCCCeeecCCCc-ceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCC
Confidence            4567777766542 3333444322211 1122444444444   445567753  23557544 67  334457889999


Q ss_pred             CCcccCCCCCCCccCc-cccCCCCCCCC-Ceeee-cCCCcEEecCCCCCc
Q psy7854         104 PGYEGDPTGKLCSDQN-ECLMDVMICSG-GTCKN-TPGSFQSNFQDSSPI  150 (153)
Q Consensus       104 ~g~~~~~~g~~C~~i~-~C~~~~~~C~~-~~C~~-~~g~~~C~C~~g~~~  150 (153)
                      .+|    .+..|+..+ +|.  ...+.+ +.|++ ..+.|.|.|+.+|.+
T Consensus       345 ~~~----~g~~C~~~~~~C~--~~~~~~~~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  345 PGF----TGRRCEDSNDECA--SSPCCPGGTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             CCC----CCCccccCCcccc--CCccccCCEeccCCCCCeEecCCCcccc
Confidence            886    788999884 895  343554 89998 799999999999864


No 24 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.03  E-value=0.0012  Score=32.98  Aligned_cols=25  Identities=40%  Similarity=0.636  Sum_probs=21.3

Q ss_pred             CCCCCCeeeecCCCcEEecCCCCCc
Q psy7854         126 MICSGGTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus       126 ~~C~~~~C~~~~g~~~C~C~~g~~~  150 (153)
                      ++|.+++|+++.++|.|.|++||.+
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCCcc
Confidence            5676338999999999999999976


No 25 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.97  E-value=0.0015  Score=32.38  Aligned_cols=27  Identities=37%  Similarity=0.529  Sum_probs=22.9

Q ss_pred             CCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854         125 VMICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       125 ~~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      ..+|.+ +.|+++.+.|.|.|+.||.+.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            566765 899999999999999999764


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.89  E-value=0.002  Score=32.44  Aligned_cols=32  Identities=41%  Similarity=1.027  Sum_probs=24.4

Q ss_pred             ccCCCCCCCCCC-CCceeecCCCceeeCCCCccc
Q psy7854          76 NHHTTENPGICT-NGACENLMGTYRCICNPGYEG  108 (153)
Q Consensus        76 ~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~  108 (153)
                      +++|... .+|. ++.|.+..+.|.|.|+.||.+
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054           2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             cccCCCC-CCcCCCCEeECCCCCeEeECCCCCcC
Confidence            4667542 4685 458999999999999999943


No 27 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.82  E-value=0.00098  Score=32.98  Aligned_cols=25  Identities=52%  Similarity=1.250  Sum_probs=21.5

Q ss_pred             CCCCC-CceeecC-CCceeeCCCCccc
Q psy7854          84 GICTN-GACENLM-GTYRCICNPGYEG  108 (153)
Q Consensus        84 ~~C~~-~~C~~~~-g~~~C~C~~g~~~  108 (153)
                      .+|.+ ++|+... +.|.|.|++||.+
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G   30 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCECPPGYTG   30 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred             CcCCCCeEEEeCCCCCEEeECCCCCcc
Confidence            68864 5999998 9999999999954


No 28 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.41  E-value=0.0069  Score=30.12  Aligned_cols=25  Identities=56%  Similarity=1.350  Sum_probs=20.4

Q ss_pred             CCCCCCceeecCCCceeeCCCCccc
Q psy7854          84 GICTNGACENLMGTYRCICNPGYEG  108 (153)
Q Consensus        84 ~~C~~~~C~~~~g~~~C~C~~g~~~  108 (153)
                      .+|.++.|.+..++|.|.|++||.+
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCCcc
Confidence            4675338999999999999999955


No 29 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.01  E-value=0.015  Score=28.59  Aligned_cols=25  Identities=52%  Similarity=1.363  Sum_probs=21.1

Q ss_pred             CCCC-CCceeecCCCceeeCCCCccc
Q psy7854          84 GICT-NGACENLMGTYRCICNPGYEG  108 (153)
Q Consensus        84 ~~C~-~~~C~~~~g~~~C~C~~g~~~  108 (153)
                      .+|. ++.|.+..+.|.|.|+.||.+
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~g   31 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYTG   31 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCcc
Confidence            6674 468999999999999999955


No 30 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.12  E-value=0.007  Score=42.12  Aligned_cols=66  Identities=21%  Similarity=0.521  Sum_probs=45.3

Q ss_pred             CCCCCCceeecCCCceeeCCCCcccCCCCCCCccCccccC---CCCCCCC-CeeeecC-----CCcEEecCCCCCc
Q psy7854          84 GICTNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLM---DVMICSG-GTCKNTP-----GSFQSNFQDSSPI  150 (153)
Q Consensus        84 ~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~---~~~~C~~-~~C~~~~-----g~~~C~C~~g~~~  150 (153)
                      ..|.++........|.|.|..||.+. +...|+...+|..   ...+|.. +.|++.+     ..|.|.|.+||.+
T Consensus         6 T~CKNG~LiQMSNHfEC~Cnegfvl~-~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~   80 (197)
T PF06247_consen    6 TICKNGYLIQMSNHFECKCNEGFVLK-NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL   80 (197)
T ss_dssp             ---BTEEEEEESSEEEEEESTTEEEE-ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred             ccccCCEEEEccCceEEEcCCCcEEc-cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence            56777888888899999999999774 4568887777753   2366876 8998865     5699999999964


No 31 
>KOG1225|consensus
Probab=92.55  E-value=0.44  Score=38.75  Aligned_cols=69  Identities=35%  Similarity=0.709  Sum_probs=33.9

Q ss_pred             CccCCCCCCCCcCCcccCCCCCCCCCC-CCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCC
Q psy7854          61 FGLEPPTSRSEVHGANHHTTENPGICT-NGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPG  138 (153)
Q Consensus        61 ~~~~~~~~~~~~C~~~~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g  138 (153)
                      ...++++|.+..|... .|..   .|. ++.+.  .+  .|.|.+||    .|..|+... |   +..|.+ +.|+  .|
T Consensus       266 ~CIC~~Gf~G~dC~e~-~Cp~---~cs~~g~~~--~g--~CiC~~g~----~G~dCs~~~-c---padC~g~G~Ci--~G  327 (525)
T KOG1225|consen  266 RCICPPGFTGDDCDEL-VCPV---DCSGGGVCV--DG--ECICNPGY----SGKDCSIRR-C---PADCSGHGKCI--DG  327 (525)
T ss_pred             eEeCCCCCcCCCCCcc-cCCc---ccCCCceec--CC--EeecCCCc----ccccccccc-C---CccCCCCCccc--CC
Confidence            3346666666666431 2322   242 22222  22  67888888    555554221 3   344543 5664  22


Q ss_pred             CcEEecCCCCC
Q psy7854         139 SFQSNFQDSSP  149 (153)
Q Consensus       139 ~~~C~C~~g~~  149 (153)
                        +|.|.+||.
T Consensus       328 --~C~C~~Gy~  336 (525)
T KOG1225|consen  328 --ECLCDEGYT  336 (525)
T ss_pred             --ceEeCCCCc
Confidence              566666664


No 32 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.26  E-value=0.097  Score=26.62  Aligned_cols=25  Identities=16%  Similarity=0.142  Sum_probs=17.5

Q ss_pred             CCCCC-CCeeeecC-CCcEEecCCCCC
Q psy7854         125 VMICS-GGTCKNTP-GSFQSNFQDSSP  149 (153)
Q Consensus       125 ~~~C~-~~~C~~~~-g~~~C~C~~g~~  149 (153)
                      ...|. |+.|++.. |.+.|+|-+||.
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            35666 58999876 999999999995


No 33 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.87  E-value=0.12  Score=20.08  Aligned_cols=10  Identities=50%  Similarity=1.491  Sum_probs=6.8

Q ss_pred             eeeCCCCccc
Q psy7854          99 RCICNPGYEG  108 (153)
Q Consensus        99 ~C~C~~g~~~  108 (153)
                      .|.|++||.+
T Consensus         1 ~C~C~~G~~G   10 (13)
T PF12661_consen    1 TCQCPPGWTG   10 (13)
T ss_dssp             EEEE-TTEET
T ss_pred             CccCcCCCcC
Confidence            4889999943


No 34 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=88.46  E-value=1.1  Score=21.97  Aligned_cols=24  Identities=29%  Similarity=0.400  Sum_probs=18.0

Q ss_pred             CCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854         126 MICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       126 ~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      ..|.+ ++|+..  ..+|.|.+||.+.
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGP   30 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCC
Confidence            35764 899765  3589999999764


No 35 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.66  E-value=0.77  Score=30.12  Aligned_cols=35  Identities=11%  Similarity=0.186  Sum_probs=24.0

Q ss_pred             CccccCC-CCCCCCCeeeec--CCCcEEecCCCCCccc
Q psy7854         118 QNECLMD-VMICSGGTCKNT--PGSFQSNFQDSSPILK  152 (153)
Q Consensus       118 i~~C~~~-~~~C~~~~C~~~--~g~~~C~C~~g~~~~~  152 (153)
                      +.+|... .+.|-|+.|...  ...+.|.|..||.+.|
T Consensus        42 i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYtGeR   79 (139)
T PHA03099         42 IRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYTGIR   79 (139)
T ss_pred             cccCChhhCCEeECCEEEeeccCCCceeECCCCccccc
Confidence            4455322 356766788654  4778999999998876


No 36 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=84.58  E-value=1.1  Score=29.47  Aligned_cols=32  Identities=34%  Similarity=0.788  Sum_probs=24.7

Q ss_pred             CCCCCCceeecC--CCceeeCCCCcccCCCCCCCccCc
Q psy7854          84 GICTNGACENLM--GTYRCICNPGYEGDPTGKLCSDQN  119 (153)
Q Consensus        84 ~~C~~~~C~~~~--g~~~C~C~~g~~~~~~g~~C~~i~  119 (153)
                      +.|-||.|.-..  ..+.|.|..||    .|.+|+-.+
T Consensus        51 ~YClHG~C~yI~dl~~~~CrC~~GY----tGeRCEh~d   84 (139)
T PHA03099         51 GYCLHGDCIHARDIDGMYCRCSHGY----TGIRCQHVV   84 (139)
T ss_pred             CEeECCEEEeeccCCCceeECCCCc----cccccccee
Confidence            567777887654  78999999999    778887543


No 37 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.11  E-value=0.47  Score=33.23  Aligned_cols=88  Identities=19%  Similarity=0.314  Sum_probs=48.2

Q ss_pred             eecccCCCCccCCCCCCCCcCCcccCCCCCCCCCCCCceeecC---CCceeeCCCCcccCCCCCCCc--cCccccCCCCC
Q psy7854          53 SQDLERLKFGLEPPTSRSEVHGANHHTTENPGICTNGACENLM---GTYRCICNPGYEGDPTGKLCS--DQNECLMDVMI  127 (153)
Q Consensus        53 ~~~~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~~---g~~~C~C~~g~~~~~~g~~C~--~i~~C~~~~~~  127 (153)
                      ....|.|..+|.....   .|. .+.|..  -.|..+.|+-.+   ....|+|.-|+.. .+...|+  ...+|.   -.
T Consensus        68 ~~~~C~C~~gY~~~~~---vCv-p~~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV~-~dn~kCtk~G~T~C~---LK  137 (197)
T PF06247_consen   68 RAYKCDCINGYILKQG---VCV-PNKCNN--KDCGSGKCILDPDNPNNPTCSCNIGKVP-DDNKKCTKTGETKCS---LK  137 (197)
T ss_dssp             TSEEEEE-TTEEESSS---SEE-EGGGSS-----TTEEEEEEEGGGSEEEEEE-TEEET-TTTTESEEEE----------
T ss_pred             eeEEEecccCceeeCC---eEc-hhhcCc--eecCCCeEEecCCCCCCceeEeeeceEe-ccCCcccCCCcccee---ee
Confidence            3455666667655432   453 234544  457667786543   3458999999983 2566774  445673   44


Q ss_pred             CC-CCeeeecCCCcEEecCCCCCc
Q psy7854         128 CS-GGTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus       128 C~-~~~C~~~~g~~~C~C~~g~~~  150 (153)
                      |. +..|....+-|+|.|.+||.+
T Consensus       138 Ck~nE~CK~~~~~Y~C~~~~~~~~  161 (197)
T PF06247_consen  138 CKENEECKLVDGYYKCVCKEGFPG  161 (197)
T ss_dssp             -TTTEEEEEETTEEEEEE-TT-EE
T ss_pred             cCCCcceeeeCcEEEeecCCCCCC
Confidence            65 479999999999999999864


No 38 
>KOG1225|consensus
Probab=82.29  E-value=3.2  Score=33.98  Aligned_cols=61  Identities=30%  Similarity=0.675  Sum_probs=34.9

Q ss_pred             CCCCCCCCcCCcccCCCCCCCCC-CCCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCC-CeeeecCCCcE
Q psy7854          64 EPPTSRSEVHGANHHTTENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNTPGSFQ  141 (153)
Q Consensus        64 ~~~~~~~~~C~~~~~C~~~~~~C-~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~  141 (153)
                      +.++|.+..|.... |.   ..| .++.|+    .-+|.|.+||    .|..|....        |.+ +.|++.     
T Consensus       300 C~~g~~G~dCs~~~-cp---adC~g~G~Ci----~G~C~C~~Gy----~G~~C~~~~--------C~~~g~cv~g-----  354 (525)
T KOG1225|consen  300 CNPGYSGKDCSIRR-CP---ADCSGHGKCI----DGECLCDEGY----TGELCIQRA--------CSGGGQCVNG-----  354 (525)
T ss_pred             cCCCcccccccccc-CC---ccCCCCCccc----CCceEeCCCC----cCCcccccc--------cCCCceeccC-----
Confidence            44455555554222 32   456 366776    3478999999    566664432        444 566442     


Q ss_pred             EecCCCCC
Q psy7854         142 SNFQDSSP  149 (153)
Q Consensus       142 C~C~~g~~  149 (153)
                      |.|..||.
T Consensus       355 C~C~~Gw~  362 (525)
T KOG1225|consen  355 CKCKKGWR  362 (525)
T ss_pred             ceeccCcc
Confidence            66777775


No 39 
>PHA02887 EGF-like protein; Provisional
Probab=80.81  E-value=1.6  Score=28.19  Aligned_cols=27  Identities=15%  Similarity=0.215  Sum_probs=20.0

Q ss_pred             CCCCCCeeeec--CCCcEEecCCCCCccc
Q psy7854         126 MICSGGTCKNT--PGSFQSNFQDSSPILK  152 (153)
Q Consensus       126 ~~C~~~~C~~~--~g~~~C~C~~g~~~~~  152 (153)
                      +.|-++.|...  .....|.|+.||.+.|
T Consensus        92 ~YCiHG~C~yI~dL~epsCrC~~GYtG~R  120 (126)
T PHA02887         92 DFCINGECMNIIDLDEKFCICNKGYTGIR  120 (126)
T ss_pred             CEeeCCEEEccccCCCceeECCCCcccCC
Confidence            55656788653  4558899999998765


No 40 
>PHA02887 EGF-like protein; Provisional
Probab=80.28  E-value=1.9  Score=27.88  Aligned_cols=31  Identities=45%  Similarity=1.059  Sum_probs=23.6

Q ss_pred             CCCCCCceeec--CCCceeeCCCCcccCCCCCCCccC
Q psy7854          84 GICTNGACENL--MGTYRCICNPGYEGDPTGKLCSDQ  118 (153)
Q Consensus        84 ~~C~~~~C~~~--~g~~~C~C~~g~~~~~~g~~C~~i  118 (153)
                      +.|-||.|.-.  .....|.|..||    .|.+|+..
T Consensus        92 ~YCiHG~C~yI~dL~epsCrC~~GY----tG~RCE~v  124 (126)
T PHA02887         92 DFCINGECMNIIDLDEKFCICNKGY----TGIRCDEV  124 (126)
T ss_pred             CEeeCCEEEccccCCCceeECCCCc----ccCCCCcc
Confidence            56777888765  466899999999    77788654


No 41 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=79.53  E-value=1.5  Score=21.81  Aligned_cols=25  Identities=12%  Similarity=0.121  Sum_probs=15.5

Q ss_pred             CCCCCCCeeeecCCCcEEecCCCCCcc
Q psy7854         125 VMICSGGTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       125 ~~~C~~~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      ...|. +.|..... +.|.|+.||.+.
T Consensus         5 ~t~Cp-A~CDpn~~-~~C~CPeGyIld   29 (34)
T PF09064_consen    5 QTECP-ADCDPNSP-GQCFCPEGYILD   29 (34)
T ss_pred             cccCC-CccCCCCC-CceeCCCceEec
Confidence            34454 66643222 489999999753


No 42 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=78.20  E-value=3  Score=26.40  Aligned_cols=33  Identities=15%  Similarity=0.259  Sum_probs=23.1

Q ss_pred             cCccccCCCCCCCC-CeeeecCCCcEEecCCCCCcc
Q psy7854         117 DQNECLMDVMICSG-GTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus       117 ~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      ..+.|. ....|.. +.|. ......|.|.+||...
T Consensus        76 p~d~Cd-~y~~CG~~g~C~-~~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   76 PKDQCD-VYGFCGPNGICN-SNNSPKCSCLPGFEPK  109 (110)
T ss_pred             cccCCC-CccccCCccEeC-CCCCCceECCCCcCCC
Confidence            345675 3578884 9994 3345679999999754


No 43 
>KOG1215|consensus
Probab=71.55  E-value=7  Score=34.01  Aligned_cols=71  Identities=24%  Similarity=0.396  Sum_probs=45.3

Q ss_pred             ccCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCCCCCccCccccCCCCCCCCCeee-ecCCCcEEecCCCCCc
Q psy7854          76 NHHTTENPGICTNGACENLMGTYRCICNPGYEGDPTGKLCSDQNECLMDVMICSGGTCK-NTPGSFQSNFQDSSPI  150 (153)
Q Consensus        76 ~~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~~~C~-~~~g~~~C~C~~g~~~  150 (153)
                      ++++......+.+ .+.+......|.|..++.....+..  +...|....+.|. ..|+ +.++.|.|.|..||.+
T Consensus       325 ~~~~~~~~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~Cs-q~C~~~~p~~~~c~c~~g~~~  396 (877)
T KOG1215|consen  325 LNECAERVLKCSH-KCPDVSVGPRCDCMGAKVLPLGART--DSNPCESDNGGCS-QLCVPNSPGTFKCACSPGYEL  396 (877)
T ss_pred             cccchhhcccccC-CCCccccCCcccCCccceecccccc--cCCcccccCCccc-eeccCCCCCceeEecCCCcEe
Confidence            3444433333433 5666677778888887765333333  3344554667776 7887 5689999999999964


No 44 
>KOG0994|consensus
Probab=67.11  E-value=12  Score=33.71  Aligned_cols=59  Identities=27%  Similarity=0.593  Sum_probs=37.6

Q ss_pred             ceeecCCCc--ee-eCCCCcccCCCCCCCc---cCccccCCCCCCCCCeeeecCCCcEEe-cCCCCCc
Q psy7854          90 ACENLMGTY--RC-ICNPGYEGDPTGKLCS---DQNECLMDVMICSGGTCKNTPGSFQSN-FQDSSPI  150 (153)
Q Consensus        90 ~C~~~~g~~--~C-~C~~g~~~~~~g~~C~---~i~~C~~~~~~C~~~~C~~~~g~~~C~-C~~g~~~  150 (153)
                      .|.=..|.|  .| .|.+||++.|.-+.|+   -.++|...++.|.  .|....+++.|. |-.||-+
T Consensus       831 QC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi--~CqD~T~G~~CdrCl~GyyG  896 (1758)
T KOG0994|consen  831 QCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHADTCDPITGACI--DCQDSTTGHSCDRCLDGYYG  896 (1758)
T ss_pred             ceeeccccchhhccccCCCccCCCcCccccccCcccccCccccccc--cccccccccchhhhhccccC
Confidence            343334444  56 6899999888766664   3466765556664  455666667775 8888743


No 45 
>KOG1226|consensus
Probab=64.37  E-value=50  Score=28.48  Aligned_cols=48  Identities=25%  Similarity=0.594  Sum_probs=25.2

Q ss_pred             eeeCCCCcccCCCCCCCccCc-cccC-CCCCCC-CCeeeecCCCcEEecCCCCCc
Q psy7854          99 RCICNPGYEGDPTGKLCSDQN-ECLM-DVMICS-GGTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus        99 ~C~C~~g~~~~~~g~~C~~i~-~C~~-~~~~C~-~~~C~~~~g~~~C~C~~g~~~  150 (153)
                      .|.|.+...+..+|..|+.-| .|.. ....|. ++.|.  -|  +|+|.+||.+
T Consensus       526 qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~--CG--~CvC~~GwtG  576 (783)
T KOG1226|consen  526 QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCE--CG--RCVCNPGWTG  576 (783)
T ss_pred             ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEe--CC--cEEcCCCCcc
Confidence            466666554433577776322 2321 123455 35552  22  6888888865


No 46 
>KOG0994|consensus
Probab=58.94  E-value=40  Score=30.75  Aligned_cols=62  Identities=21%  Similarity=0.372  Sum_probs=32.7

Q ss_pred             ceeecCCCcee-eCCCCcccCCC---CCCCccCccccCCCCC-CCC-Ceeee--cCCCcEEecCCCCCccc
Q psy7854          90 ACENLMGTYRC-ICNPGYEGDPT---GKLCSDQNECLMDVMI-CSG-GTCKN--TPGSFQSNFQDSSPILK  152 (153)
Q Consensus        90 ~C~~~~g~~~C-~C~~g~~~~~~---g~~C~~i~~C~~~~~~-C~~-~~C~~--~~g~~~C~C~~g~~~~~  152 (153)
                      .|...-.++.| .|..||.++|.   |..|..- +|-..+.. -++ ..|..  ......|.|.+||.+.|
T Consensus       877 ~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPC-pCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~R  946 (1758)
T KOG0994|consen  877 DCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPC-PCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSR  946 (1758)
T ss_pred             cccccccccchhhhhccccCCcccCCCCCCCCC-CCCCCCccchhccccccccccccceeeecccCccccc
Confidence            45666677888 89999987653   2333221 11111100 011 13321  12235799999998765


No 47 
>KOG3516|consensus
Probab=56.67  E-value=10  Score=34.02  Aligned_cols=39  Identities=26%  Similarity=0.762  Sum_probs=30.5

Q ss_pred             cCCcccCCCCCCCCCCCC-ceeecCCCceeeCC-CCcccCCCCCCCc
Q psy7854          72 VHGANHHTTENPGICTNG-ACENLMGTYRCICN-PGYEGDPTGKLCS  116 (153)
Q Consensus        72 ~C~~~~~C~~~~~~C~~~-~C~~~~g~~~C~C~-~g~~~~~~g~~C~  116 (153)
                      .|..+|.|..  ++|+|+ .|......|.|.|. .||    .|..|.
T Consensus       541 ~C~i~drClP--N~CehgG~C~Qs~~~f~C~C~~TGY----~GatCH  581 (1306)
T KOG3516|consen  541 MCGISDRCLP--NPCEHGGKCSQSWDDFECNCELTGY----KGATCH  581 (1306)
T ss_pred             ccccccccCC--ccccCCCcccccccceeEecccccc----cccccc
Confidence            5666777766  899765 89888889999998 799    566664


No 48 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=56.10  E-value=23  Score=18.93  Aligned_cols=29  Identities=24%  Similarity=0.666  Sum_probs=17.8

Q ss_pred             CCCCccCccccCCCCCCCCCeeeecCCCcEEecCCCCCc
Q psy7854         112 GKLCSDQNECLMDVMICSGGTCKNTPGSFQSNFQDSSPI  150 (153)
Q Consensus       112 g~~C~~i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g~~~  150 (153)
                      |..|..-.+|.      .++.|++  |  .|.|++||..
T Consensus        19 g~~C~~~~qC~------~~s~C~~--g--~C~C~~g~~~   47 (52)
T PF01683_consen   19 GESCESDEQCI------GGSVCVN--G--RCQCPPGYVE   47 (52)
T ss_pred             CCCCCCcCCCC------CcCEEcC--C--EeECCCCCEe
Confidence            44565555553      2367744  2  7999999854


No 49 
>KOG1226|consensus
Probab=51.55  E-value=53  Score=28.33  Aligned_cols=21  Identities=48%  Similarity=1.319  Sum_probs=13.9

Q ss_pred             eeeCCCCcccCCCCCCCc---cCccccC
Q psy7854          99 RCICNPGYEGDPTGKLCS---DQNECLM  123 (153)
Q Consensus        99 ~C~C~~g~~~~~~g~~C~---~i~~C~~  123 (153)
                      +|+|.+||    .|..|.   +.+.|..
T Consensus       567 ~CvC~~Gw----tG~~C~C~~std~C~~  590 (783)
T KOG1226|consen  567 RCVCNPGW----TGSACNCPLSTDTCES  590 (783)
T ss_pred             cEEcCCCC----ccCCCCCCCCCccccC
Confidence            67899999    555553   4555653


No 50 
>KOG1836|consensus
Probab=50.53  E-value=25  Score=33.26  Aligned_cols=58  Identities=22%  Similarity=0.393  Sum_probs=32.1

Q ss_pred             eeecCCCcee-eCCCCcccCCCCCCCccCccccCCCCCCCC-Ceeeec--CCCcEEe-cCCCCCcccC
Q psy7854          91 CENLMGTYRC-ICNPGYEGDPTGKLCSDQNECLMDVMICSG-GTCKNT--PGSFQSN-FQDSSPILKI  153 (153)
Q Consensus        91 C~~~~g~~~C-~C~~g~~~~~~g~~C~~i~~C~~~~~~C~~-~~C~~~--~g~~~C~-C~~g~~~~~~  153 (153)
                      |.....+-.| .|..||.+.+....-.|   |  .+=+|.+ +-|..+  .....|. |++||.++|+
T Consensus       749 C~~~t~G~~C~~C~~GfYg~~~~~~~~d---C--~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC  811 (1705)
T KOG1836|consen  749 CKHNTFGGQCAQCVDGFYGLPDLGTSGD---C--QPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRC  811 (1705)
T ss_pred             cccCCCCCchhhhcCCCCCccccCCCCC---C--ccCCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence            4333333344 78888855444322222   4  3344443 445443  4556888 9999998874


No 51 
>KOG3516|consensus
Probab=46.00  E-value=18  Score=32.58  Aligned_cols=36  Identities=19%  Similarity=0.449  Sum_probs=29.2

Q ss_pred             CCCccCccccCCCCCCCC-CeeeecCCCcEEecC-CCCCc
Q psy7854         113 KLCSDQNECLMDVMICSG-GTCKNTPGSFQSNFQ-DSSPI  150 (153)
Q Consensus       113 ~~C~~i~~C~~~~~~C~~-~~C~~~~g~~~C~C~-~g~~~  150 (153)
                      ..|.-++.|  .+++|++ +.|..+...|.|.|. .||.+
T Consensus       540 d~C~i~drC--lPN~CehgG~C~Qs~~~f~C~C~~TGY~G  577 (1306)
T KOG3516|consen  540 DMCGISDRC--LPNPCEHGGKCSQSWDDFECNCELTGYKG  577 (1306)
T ss_pred             ccccccccc--CCccccCCCcccccccceeEecccccccc
Confidence            356667788  7899997 899888889999998 68753


No 52 
>KOG0196|consensus
Probab=45.04  E-value=23  Score=30.83  Aligned_cols=51  Identities=20%  Similarity=0.472  Sum_probs=29.1

Q ss_pred             ceeeCCCCcccCCCCCCCccC----ccccCCC---CCCCCCeeeecCCCcEEecCCCC
Q psy7854          98 YRCICNPGYEGDPTGKLCSDQ----NECLMDV---MICSGGTCKNTPGSFQSNFQDSS  148 (153)
Q Consensus        98 ~~C~C~~g~~~~~~g~~C~~i----~~C~~~~---~~C~~~~C~~~~g~~~C~C~~g~  148 (153)
                      -.|.|.+||+....+..|+.-    -.-....   .+|....-...+|.-.|.|..||
T Consensus       259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gy  316 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGY  316 (996)
T ss_pred             CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCc
Confidence            368899999775556666521    1111112   23431111235788899999998


No 53 
>smart00051 DSL delta serrate ligand.
Probab=41.08  E-value=69  Score=18.19  Aligned_cols=45  Identities=20%  Similarity=0.270  Sum_probs=24.3

Q ss_pred             CceeeCCCCcccCCCCCCCccCccccCCCCCCC-CCeeeecCCCcEEecCCCCCcc
Q psy7854          97 TYRCICNPGYEGDPTGKLCSDQNECLMDVMICS-GGTCKNTPGSFQSNFQDSSPIL  151 (153)
Q Consensus        97 ~~~C~C~~g~~~~~~g~~C~~i~~C~~~~~~C~-~~~C~~~~g~~~C~C~~g~~~~  151 (153)
                      .+.-.|.++|    .|..|...  |.. ...+. +..|.. .|  .++|.+||.+.
T Consensus        16 ~~rv~C~~~~----yG~~C~~~--C~~-~~d~~~~~~Cd~-~G--~~~C~~Gw~G~   61 (63)
T smart00051       16 QIRVTCDENY----YGEGCNKF--CRP-RDDFFGHYTCDE-NG--NKGCLEGWMGP   61 (63)
T ss_pred             EEEeeCCCCC----cCCccCCE--eCc-CccccCCccCCc-CC--CEecCCCCcCC
Confidence            3455788888    55566422  321 11122 246632 34  47899999764


No 54 
>PF07359 LEAP-2:  Liver-expressed antimicrobial peptide 2 precursor (LEAP-2);  InterPro: IPR009955 This family consists of several mammalian liver-expressed antimicrobial peptide 2 (LEAP-2) sequences. LEAP-2 is a cysteine-rich, and cationic protein. LEAP-2 contains a core structure with two disulphide bonds formed by cysteine residues in relative 1-3 and 2-4 positions. LEAP-2 is synthesised as a 77-residue precursor, which is predominantly expressed in the liver and highly conserved among mammals. The largest native LEAP-2 form of 40 amino acid residues is generated from the precursor at a putative cleavage site for a furin-like endoprotease. In contrast to smaller LEAP-2 variants, this peptide exhibits dose-dependent antimicrobial activity against selected microbial model organisms []. The exact function of this family is unclear.; GO: 0042742 defense response to bacterium; PDB: 2L1Q_A.
Probab=35.84  E-value=26  Score=20.79  Aligned_cols=25  Identities=36%  Similarity=0.905  Sum_probs=16.7

Q ss_pred             ccCCCCCCCccCccccCCCCCCCCCee
Q psy7854         107 EGDPTGKLCSDQNECLMDVMICSGGTC  133 (153)
Q Consensus       107 ~~~~~g~~C~~i~~C~~~~~~C~~~~C  133 (153)
                      ...|.|..|.|-.||  .+..|.++.|
T Consensus        46 s~kP~GAsCrd~~EC--~T~lCR~~~C   70 (77)
T PF07359_consen   46 SLKPIGASCRDNSEC--STGLCRKGHC   70 (77)
T ss_dssp             SSB-TTSB-SSGGGB--TTS-EETTEE
T ss_pred             cCCcCcchhccchhh--hHHHhcCCcc
Confidence            334578899999999  6788876666


No 55 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=29.60  E-value=34  Score=21.71  Aligned_cols=30  Identities=20%  Similarity=0.443  Sum_probs=20.6

Q ss_pred             ccccCCCCCCC-CCeeeecC-----CCcEEecCCCC
Q psy7854         119 NECLMDVMICS-GGTCKNTP-----GSFQSNFQDSS  148 (153)
Q Consensus       119 ~~C~~~~~~C~-~~~C~~~~-----g~~~C~C~~g~  148 (153)
                      ++|....+.|. ++.|++..     .=|.|.|.+.+
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~   41 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTV   41 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeeccc
Confidence            45666677786 48888762     34889998743


No 56 
>KOG3512|consensus
Probab=28.77  E-value=1.6e+02  Score=24.21  Aligned_cols=15  Identities=33%  Similarity=0.578  Sum_probs=8.6

Q ss_pred             CCCceeeCCCCcccC
Q psy7854          95 MGTYRCICNPGYEGD  109 (153)
Q Consensus        95 ~g~~~C~C~~g~~~~  109 (153)
                      .|.+--.|..||..+
T Consensus       369 aGrhChyCreGyyRd  383 (592)
T KOG3512|consen  369 AGRHCHYCREGYYRD  383 (592)
T ss_pred             CCcccccccCccccC
Confidence            343333588888653


No 57 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=22.98  E-value=28  Score=18.79  Aligned_cols=19  Identities=32%  Similarity=0.901  Sum_probs=12.0

Q ss_pred             eeCCCCcccCCCCCCCccCc
Q psy7854         100 CICNPGYEGDPTGKLCSDQN  119 (153)
Q Consensus       100 C~C~~g~~~~~~g~~C~~i~  119 (153)
                      |.|++||.+... ..|....
T Consensus        35 C~C~~G~v~~~~-~~CV~~~   53 (55)
T PF01826_consen   35 CFCPPGYVRNDN-GRCVPPS   53 (55)
T ss_dssp             EEETTTEEEETT-SEEEEGG
T ss_pred             CCCCCCeeEcCC-CCEEcHH
Confidence            889999976433 3454443


No 58 
>KOG3509|consensus
Probab=21.77  E-value=2.7e+02  Score=25.13  Aligned_cols=60  Identities=30%  Similarity=0.563  Sum_probs=36.9

Q ss_pred             CCCC-CCceeecCCCceeeCCCCcccCCCCCCCcc-CccccCCCCCCCCCeeeecCCCcEEecCCC
Q psy7854          84 GICT-NGACENLMGTYRCICNPGYEGDPTGKLCSD-QNECLMDVMICSGGTCKNTPGSFQSNFQDS  147 (153)
Q Consensus        84 ~~C~-~~~C~~~~g~~~C~C~~g~~~~~~g~~C~~-i~~C~~~~~~C~~~~C~~~~g~~~C~C~~g  147 (153)
                      .+|+ .+.|........|.|++||    .|..|.+ .+.|...++.+.-++|....+...+.|.+|
T Consensus       412 ~p~~~~g~c~p~~~~~~c~c~~g~----~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg  473 (964)
T KOG3509|consen  412 IPCQHDGPCLQTLEGKQCLCPPGY----TGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG  473 (964)
T ss_pred             ccCCCCccccccccccceeccccc----cCchhhccCccccccCCccccceEeccCCCcceeecCC
Confidence            4564 3367777888899999999    4555543 223322333332267766666666678777


Done!