Query         psy5752
Match_columns 222
No_of_seqs    214 out of 2167
Neff          9.1 
Searched_HMMs 46136
Date          Fri Aug 16 17:08:53 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5752.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5752hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.8 6.5E-20 1.4E-24  163.1  15.3  144   10-165   692-864 (1289)
  2 KOG1219|consensus               99.5 1.9E-14 4.1E-19  138.3   9.4  118    5-165  3858-3977(4289)
  3 KOG1214|consensus               99.4 3.3E-12 7.2E-17  114.7   9.8  130   62-192   699-858 (1289)
  4 KOG4289|consensus               99.3 7.1E-12 1.5E-16  117.5   6.4  122    8-165  1177-1316(2531)
  5 KOG1219|consensus               98.9 4.5E-09 9.7E-14  102.6   9.2  107   51-192  3859-3973(4289)
  6 KOG4260|consensus               98.9 6.7E-09 1.4E-13   83.0   7.6  145   38-188   130-301 (350)
  7 KOG4289|consensus               98.7 2.6E-08 5.7E-13   94.3   7.2   69    8-87   1237-1308(2531)
  8 PF07645 EGF_CA:  Calcium-bindi  98.6 4.6E-08   1E-12   57.3   3.8   34  124-158     1-34  (42)
  9 KOG1217|consensus               98.6 1.6E-06 3.5E-11   76.3  13.4  167   11-192   170-388 (487)
 10 KOG1217|consensus               98.5 3.5E-06 7.6E-11   74.1  13.0  155   21-188   243-421 (487)
 11 PF12947 EGF_3:  EGF domain;  I  98.4 1.4E-07   3E-12   53.1   2.2   36  128-164     1-36  (36)
 12 KOG1225|consensus               98.2 2.5E-05 5.5E-10   69.3  11.4   44   37-93    266-309 (525)
 13 KOG4260|consensus               98.1 2.3E-06 4.9E-11   68.7   3.9  130   15-159   150-305 (350)
 14 PF12947 EGF_3:  EGF domain;  I  97.9   5E-06 1.1E-10   46.7   1.7   31   63-95      6-36  (36)
 15 PF07645 EGF_CA:  Calcium-bindi  97.9 3.9E-06 8.5E-11   49.0   1.3   31    9-45      1-34  (42)
 16 smart00179 EGF_CA Calcium-bind  97.9 2.3E-05 4.9E-10   44.6   4.2   38  124-164     1-38  (39)
 17 PF00008 EGF:  EGF-like domain   97.8 4.9E-06 1.1E-10   45.6   0.6   29   14-47      3-31  (32)
 18 smart00179 EGF_CA Calcium-bind  97.8 2.6E-05 5.5E-10   44.3   3.7   33    9-47      1-36  (39)
 19 PF00008 EGF:  EGF-like domain   97.8 2.3E-05   5E-10   42.9   2.7   27  134-160     5-31  (32)
 20 PF12662 cEGF:  Complement Clr-  97.7 2.6E-05 5.7E-10   39.4   2.2   22  148-169     1-24  (24)
 21 PF12662 cEGF:  Complement Clr-  97.6 4.5E-05 9.7E-10   38.6   2.1   22   79-100     1-24  (24)
 22 PF14670 FXa_inhibition:  Coagu  97.5 6.5E-05 1.4E-09   42.1   2.1   32  130-164     3-36  (36)
 23 PF06247 Plasmod_Pvs28:  Plasmo  97.5 5.4E-05 1.2E-09   57.8   1.8  130   16-160     7-162 (197)
 24 cd00054 EGF_CA Calcium-binding  97.4 0.00021 4.6E-09   40.0   3.6   33    9-47      1-35  (38)
 25 KOG1225|consensus               97.4 0.00047   1E-08   61.4   7.0   58   13-93    283-340 (525)
 26 cd00054 EGF_CA Calcium-binding  97.3 0.00049 1.1E-08   38.4   3.9   33  125-159     2-34  (38)
 27 cd00053 EGF Epidermal growth f  97.1  0.0013 2.7E-08   36.1   4.1   27  133-160     6-32  (36)
 28 smart00181 EGF Epidermal growt  96.9  0.0018 3.8E-08   35.7   4.0   26  133-160     6-31  (35)
 29 cd00053 EGF Epidermal growth f  96.7  0.0039 8.4E-08   34.0   3.9   27   63-91      6-32  (36)
 30 smart00181 EGF Epidermal growt  96.6  0.0031 6.8E-08   34.7   3.3   26   15-47      6-31  (35)
 31 PF12946 EGF_MSP1_1:  MSP1 EGF   96.1  0.0046   1E-07   34.5   2.1   31  134-164     6-36  (37)
 32 cd01475 vWA_Matrilin VWA_Matri  96.0   0.008 1.7E-07   48.1   3.7   39  118-159   180-218 (224)
 33 KOG1226|consensus               95.5   0.091   2E-06   48.5   8.7   69   16-103   515-590 (783)
 34 PF07974 EGF_2:  EGF-like domai  95.5   0.023 4.9E-07   30.9   3.1   25   15-47      6-30  (32)
 35 PF12661 hEGF:  Human growth fa  95.4  0.0062 1.4E-07   26.1   0.6   11   37-47      1-11  (13)
 36 PF14670 FXa_inhibition:  Coagu  95.4   0.011 2.4E-07   33.0   1.7   21   69-91     10-30  (36)
 37 PF12946 EGF_MSP1_1:  MSP1 EGF   95.2   0.015 3.2E-07   32.5   1.8   35   59-95      2-36  (37)
 38 KOG1226|consensus               95.2    0.21 4.6E-06   46.3  10.1  120   37-173   479-628 (783)
 39 KOG0994|consensus               94.5     0.1 2.2E-06   50.3   6.4   53   37-93    886-947 (1758)
 40 PF07974 EGF_2:  EGF-like domai  94.0     0.1 2.2E-06   28.3   3.3   23  134-159     7-29  (32)
 41 KOG0994|consensus               93.0    0.42 9.2E-06   46.4   7.6   64   81-163   831-900 (1758)
 42 PHA02887 EGF-like protein; Pro  92.6    0.13 2.9E-06   36.4   2.9   40    9-56     82-125 (126)
 43 PHA03099 epidermal growth fact  90.1    0.24 5.2E-06   35.7   2.2   40    9-56     41-84  (139)
 44 PF06247 Plasmod_Pvs28:  Plasmo  90.0    0.15 3.3E-06   39.3   1.2   75   63-164     6-85  (197)
 45 smart00051 DSL delta serrate l  89.2    0.82 1.8E-05   28.9   3.9   44   37-91     18-61  (63)
 46 cd01475 vWA_Matrilin VWA_Matri  87.8    0.63 1.4E-05   37.1   3.5   36   51-91    183-219 (224)
 47 PF00954 S_locus_glycop:  S-loc  85.9    0.88 1.9E-05   32.0   3.0   32    9-47     76-109 (110)
 48 PF00954 S_locus_glycop:  S-loc  82.1     1.8 3.9E-05   30.4   3.3   30   58-90     79-108 (110)
 49 PF01683 EB:  EB module;  Inter  77.8     3.9 8.4E-05   24.4   3.3   27   11-47     20-48  (52)
 50 PHA03099 epidermal growth fact  75.9     3.2   7E-05   30.0   2.9   32   62-94     50-81  (139)
 51 PHA02887 EGF-like protein; Pro  75.2       4 8.6E-05   29.1   3.1   35   58-93     85-121 (126)
 52 KOG3516|consensus               75.1     2.8   6E-05   41.1   3.1   41    7-56    542-584 (1306)
 53 PF09064 Tme5_EGF_like:  Thromb  63.7     7.5 0.00016   21.2   2.0   19   70-91     11-29  (34)
 54 KOG3514|consensus               62.8     5.5 0.00012   38.9   2.2   35   12-55    625-661 (1591)
 55 KOG0196|consensus               62.7      24 0.00052   33.8   6.2   72   81-172   260-336 (996)
 56 KOG1836|consensus               60.5      22 0.00047   37.1   6.0   49  108-159   758-808 (1705)
 57 KOG3516|consensus               60.3     7.6 0.00016   38.3   2.7   41  121-166   541-582 (1306)
 58 PF12955 DUF3844:  Domain of un  49.3      12 0.00026   26.1   1.6   33  126-158     6-42  (103)
 59 KOG3512|consensus               40.8      34 0.00074   30.6   3.4   14  174-187   527-540 (592)
 60 PF00053 Laminin_EGF:  Laminin   40.1      16 0.00034   21.4   1.0   14   37-53     19-32  (49)
 61 cd00055 EGF_Lam Laminin-type e  39.4      39 0.00084   19.9   2.6   14   37-53     20-33  (50)
 62 KOG3514|consensus               33.8      69  0.0015   31.9   4.5   35  127-166   625-660 (1591)
 63 KOG1836|consensus               33.4      56  0.0012   34.3   4.1   53   37-93    757-811 (1705)
 64 KOG3509|consensus               29.8 1.3E+02  0.0029   29.6   5.7   66   10-87    406-473 (964)
 65 PF02740 Colipase_C:  Colipase,  26.7      50  0.0011   19.1   1.5   22   21-45      3-24  (45)
 66 KOG1215|consensus               24.6 3.6E+02  0.0078   26.3   7.9   62  126-190   365-426 (877)
 67 smart00180 EGF_Lam Laminin-typ  23.8      61  0.0013   18.7   1.6   14   37-53     19-32  (46)
 68 KOG0196|consensus               23.3      93   0.002   30.1   3.4   46   37-89    260-317 (996)
 69 PF01826 TIL:  Trypsin Inhibito  22.0      47   0.001   19.8   0.9   19   82-100    35-54  (55)
 70 PF11403 Yeast_MT:  Yeast metal  20.6      94   0.002   16.8   1.7    9   37-45     23-31  (40)

No 1  
>KOG1214|consensus
Probab=99.84  E-value=6.5e-20  Score=163.11  Aligned_cols=144  Identities=30%  Similarity=0.805  Sum_probs=122.7

Q ss_pred             CCCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecC
Q psy5752          10 LSEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCA   85 (222)
Q Consensus        10 ~~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~   85 (222)
                      +++|   +.-|.-++.|...     .+..|.|.|..||.|+++ .|.+.++ |+. .+.|+.+++|+  |.+++|+|+|.
T Consensus       692 ~npCy~gsh~cdt~a~C~pg-----~~~~~tcecs~g~~gdgr-~c~d~~e-ca~~~~~CGp~s~Ci--n~pg~~rceC~  762 (1289)
T KOG1214|consen  692 VNPCYDGSHMCDTTARCHPG-----TGVDYTCECSSGYQGDGR-NCVDENE-CATGFHRCGPNSVCI--NLPGSYRCECR  762 (1289)
T ss_pred             cccceecCcccCCCccccCC-----CCcceEEEEeeccCCCCC-CCCChhh-hccCCCCCCCCceee--cCCCceeEEEe
Confidence            5667   6778888999984     555669999999999998 9998884 987 78999999997  99999999999


Q ss_pred             CCCc--cCCCeeec------CCCcCCCCCCCCC-----------------CCCCcccCCCccccCCCCCCCCCCCCCCCe
Q psy5752          86 QGYV--GNGVECHP------LKSCLEDRSLCGK-----------------DSQGYVGNGVECHPLKSCLEDRSLCGKDAS  140 (222)
Q Consensus        86 ~G~~--~~~~~C~~------~~~C~~~~~~C~~-----------------c~~g~~~~g~~C~~id~C~~~~~~C~~~~~  140 (222)
                      .||.  +++.+|..      ++.|.++.+.|..                 |.+||.+||..|.++|+|..  ..|++.+.
T Consensus       763 ~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~p--srChp~A~  840 (1289)
T KOG1214|consen  763 SGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSP--SRCHPAAT  840 (1289)
T ss_pred             ecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCc--cccCCCce
Confidence            9986  77788874      4668877666643                 78999999999999999984  45999999


Q ss_pred             eeeCCCCeeEeecCCCCccCCCccc
Q psy5752         141 CVVASQGHFHCECNEGFTGNGITCK  165 (222)
Q Consensus       141 C~n~~~g~~~C~C~~G~~gd~~~C~  165 (222)
                      |.|++ |+|.|+|.+||.|||..|.
T Consensus       841 Cyntp-gsfsC~C~pGy~GDGf~CV  864 (1289)
T KOG1214|consen  841 CYNTP-GSFSCRCQPGYYGDGFQCV  864 (1289)
T ss_pred             EecCC-CcceeecccCccCCCceec
Confidence            99998 6699999999999987663


No 2  
>KOG1219|consensus
Probab=99.54  E-value=1.9e-14  Score=138.34  Aligned_cols=118  Identities=31%  Similarity=0.816  Sum_probs=98.8

Q ss_pred             CCccCC-CCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEE
Q psy5752           5 PIRFSL-SEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRC   82 (222)
Q Consensus         5 ~~~~~~-~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C   82 (222)
                      |.|.-. ++| .+||+|+|+|...+    .++| .|.|++.|.|.   .|+...++|.+ .+|..+++|+  ...++|.|
T Consensus      3858 pgC~l~~d~C~~npCqhgG~C~~~~----~ggy-~CkCpsqysG~---~CEi~~epC~s-nPC~~GgtCi--p~~n~f~C 3926 (4289)
T KOG1219|consen 3858 PGCSLLTDPCNDNPCQHGGTCISQP----KGGY-KCKCPSQYSGN---HCEIDLEPCAS-NPCLTGGTCI--PFYNGFLC 3926 (4289)
T ss_pred             ccccccccccccCcccCCCEecCCC----CCce-EEeCcccccCc---ccccccccccC-CCCCCCCEEE--ecCCCeeE
Confidence            345433 899 89999999999975    5666 99999999999   99988888998 9999999997  67889999


Q ss_pred             ecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCC
Q psy5752          83 QCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGI  162 (222)
Q Consensus        83 ~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~  162 (222)
                      .|+.||+|..++  .                         ..+++|+.+.  |.++|.|+|.. |+|.|.|.+||.|  .
T Consensus      3927 nC~~gyTG~~Ce--~-------------------------~Gi~eCs~n~--C~~gg~C~n~~-gsf~CncT~g~~g--r 3974 (4289)
T KOG1219|consen 3927 NCPNGYTGKRCE--A-------------------------RGISECSKNV--CGTGGQCINIP-GSFHCNCTPGILG--R 3974 (4289)
T ss_pred             eCCCCccCceee--c-------------------------cccccccccc--ccCCceeeccC-CceEeccChhHhc--c
Confidence            999999988543  1                         0278887655  99999999997 6699999999998  4


Q ss_pred             ccc
Q psy5752         163 TCK  165 (222)
Q Consensus       163 ~C~  165 (222)
                      .|.
T Consensus      3975 ~c~ 3977 (4289)
T KOG1219|consen 3975 TCC 3977 (4289)
T ss_pred             cCc
Confidence            543


No 3  
>KOG1214|consensus
Probab=99.36  E-value=3.3e-12  Score=114.66  Aligned_cols=130  Identities=26%  Similarity=0.595  Sum_probs=106.9

Q ss_pred             CCCCCCCCEeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCC--------------CCCCc--ccCCCcccc-
Q psy5752          62 VKNCHANAECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGK--------------DSQGY--VGNGVECHP-  124 (222)
Q Consensus        62 ~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~--------------c~~g~--~~~g~~C~~-  124 (222)
                      .+.|..++.|+ ..+.-.|+|.|..||.|+++.|.++++|.+.++.|+.              |..||  ..++.+|.. 
T Consensus       699 sh~cdt~a~C~-pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i  777 (1289)
T KOG1214|consen  699 SHMCDTTARCH-PGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLI  777 (1289)
T ss_pred             CcccCCCcccc-CCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEe
Confidence            36788888897 2555689999999999999999999999999888865              44555  467778874 


Q ss_pred             -----CCCCCCCCCCCCC--CCeeeeCCCCeeEeecCCCCccCCCccccCCCCC------CCeeeecCCcEEEEcCCCCC
Q psy5752         125 -----LKSCLEDRSLCGK--DASCVVASQGHFHCECNEGFTGNGITCKPVRKKE------SDFLLVNQGMFMLRVPYQPT  191 (222)
Q Consensus       125 -----id~C~~~~~~C~~--~~~C~n~~~g~~~C~C~~G~~gd~~~C~~~~~~~------~~~~~~~~g~~~~~~~~~~~  191 (222)
                           ++.|.++.+.|.-  .+.|+...+++|.|.|.+||.|||..|.++++|.      .++|.++.|++.|+|..++.
T Consensus       778 ~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~  857 (1289)
T KOG1214|consen  778 TPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYY  857 (1289)
T ss_pred             cCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcceeecccCcc
Confidence                 4678888788964  4567788788899999999999999999998875      45799999999999987666


Q ss_pred             C
Q psy5752         192 R  192 (222)
Q Consensus       192 ~  192 (222)
                      +
T Consensus       858 G  858 (1289)
T KOG1214|consen  858 G  858 (1289)
T ss_pred             C
Confidence            4


No 4  
>KOG4289|consensus
Probab=99.26  E-value=7.1e-12  Score=117.46  Aligned_cols=122  Identities=29%  Similarity=0.695  Sum_probs=92.0

Q ss_pred             cCCCCC-CCCCCCCCeecCCCC-----------------CCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCC
Q psy5752           8 FSLSEC-DTPCHPNAQCTSPDE-----------------FNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANA   69 (222)
Q Consensus         8 ~~~~~C-~~~C~~~~~C~~~~~-----------------~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~   69 (222)
                      +|-+.| ..||.|..+|+.+-.                 -++.+++ .|+|++||+|+   .|+...++|.+ .+|.+++
T Consensus      1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvngl-rCrCPpGFTgd---~CeTeiDlCYs-~pC~nng 1251 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGL-RCRCPPGFTGD---YCETEIDLCYS-GPCGNNG 1251 (2531)
T ss_pred             ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCce-eEeCCCCCCcc---cccchhHhhhc-CCCCCCC
Confidence            455678 889999998875300                 0113445 89999999999   99977778998 9999999


Q ss_pred             EeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCee
Q psy5752          70 ECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHF  149 (222)
Q Consensus        70 ~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~  149 (222)
                      .|.  ...|+|+|.|.+||+|..++- +                         ...-.|.  |+.|.++++|+|...|.|
T Consensus      1252 ~C~--srEggYtCeCrpg~tGehCEv-s-------------------------~~agrCv--pGvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1252 RCR--SREGGYTCECRPGFTGEHCEV-S-------------------------ARAGRCV--PGVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred             ceE--EecCceeEEecCCccccceee-e-------------------------cccCccc--cceecCCCEEeecCCCce
Confidence            996  788999999999999885430 0                         0122344  567999999999988889


Q ss_pred             EeecCCCCccCCCccc
Q psy5752         150 HCECNEGFTGNGITCK  165 (222)
Q Consensus       150 ~C~C~~G~~gd~~~C~  165 (222)
                      .|.|+.| .-.+..|+
T Consensus      1302 ~c~Cp~g-e~e~prC~ 1316 (2531)
T KOG4289|consen 1302 CCHCPYG-EFEDPRCE 1316 (2531)
T ss_pred             eccCCCc-ccCCCceE
Confidence            9999998 32224555


No 5  
>KOG1219|consensus
Probab=98.92  E-value=4.5e-09  Score=102.64  Aligned_cols=107  Identities=25%  Similarity=0.624  Sum_probs=84.0

Q ss_pred             CcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCC
Q psy5752          51 ECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLE  130 (222)
Q Consensus        51 ~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~  130 (222)
                      .|....+.|.. .+|++++.|. ....++|.|+|++-|+|+.++                            .++.+|..
T Consensus      3859 gC~l~~d~C~~-npCqhgG~C~-~~~~ggy~CkCpsqysG~~CE----------------------------i~~epC~s 3908 (4289)
T KOG1219|consen 3859 GCSLLTDPCND-NPCQHGGTCI-SQPKGGYKCKCPSQYSGNHCE----------------------------IDLEPCAS 3908 (4289)
T ss_pred             ccccccccccc-CcccCCCEec-CCCCCceEEeCcccccCcccc----------------------------cccccccC
Confidence            45544346888 9999999997 355678999999999888432                            25788987


Q ss_pred             CCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcccc--CCCC------CCCeeeecCCcEEEEcCCCCCC
Q psy5752         131 DRSLCGKDASCVVASQGHFHCECNEGFTGNGITCKP--VRKK------ESDFLLVNQGMFMLRVPYQPTR  192 (222)
Q Consensus       131 ~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C~~--~~~~------~~~~~~~~~g~~~~~~~~~~~~  192 (222)
                      +|  |..+++|+...+| |.|.|+.||+|  .+|+.  +++|      .++.|.+..|+|.|.|..+..+
T Consensus      3909 nP--C~~GgtCip~~n~-f~CnC~~gyTG--~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~g 3973 (4289)
T KOG1219|consen 3909 NP--CLTGGTCIPFYNG-FLCNCPNGYTG--KRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILG 3973 (4289)
T ss_pred             CC--CCCCCEEEecCCC-eeEeCCCCccC--ceeecccccccccccccCCceeeccCCceEeccChhHhc
Confidence            76  9999999998766 99999999998  56653  3444      4667999999999999865553


No 6  
>KOG4260|consensus
Probab=98.88  E-value=6.7e-09  Score=82.98  Aligned_cols=145  Identities=23%  Similarity=0.475  Sum_probs=100.2

Q ss_pred             eecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeee-cCCCCCeEEecCCCCccCCCe-e----------------ecC-
Q psy5752          38 CICNFGYTGDGVTECNPESLGCNVVKNCHANAECVY-NATSAGYRCQCAQGYVGNGVE-C----------------HPL-   98 (222)
Q Consensus        38 C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~-~n~~g~~~C~C~~G~~~~~~~-C----------------~~~-   98 (222)
                      --|++|-.|.+--.|....     ..+|.-++.|.- -...|+-.|.|..||.|+.+. |                ..- 
T Consensus       130 vCCp~gtyGpdCl~Cpggs-----er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch  204 (350)
T KOG4260|consen  130 VCCPDGTYGPDCLQCPGGS-----ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACH  204 (350)
T ss_pred             eccCCCCcCCccccCCCCC-----cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhh
Confidence            3478888887222333322     267877888841 012456789999999987532 1                110 


Q ss_pred             ----CCcC-CCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCc---cccCCCC
Q psy5752          99 ----KSCL-EDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGIT---CKPVRKK  170 (222)
Q Consensus        99 ----~~C~-~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~---C~~~~~~  170 (222)
                          .-|. ..+..|..|..||..+-..|.|||||...+..|.....|+|+. |+|+|.+.+||.++...   |.++-..
T Consensus       205 ~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNte-GSf~C~dk~Gy~~g~d~C~~~~d~~~~  283 (350)
T KOG4260|consen  205 EGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTE-GSFKCEDKEGYKKGVDECQFCADVCAS  283 (350)
T ss_pred             hhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCC-CceEecccccccCChHHhhhhhhhccc
Confidence                0122 2345677789999988889999999998888898888999996 77999999999973222   3344444


Q ss_pred             CCCeeeecCCcEEEEcCC
Q psy5752         171 ESDFLLVNQGMFMLRVPY  188 (222)
Q Consensus       171 ~~~~~~~~~g~~~~~~~~  188 (222)
                      ....|.++.+.+.+.+..
T Consensus       284 kn~~c~ni~~~~r~v~f~  301 (350)
T KOG4260|consen  284 KNRPCMNIDGQYRCVCFS  301 (350)
T ss_pred             CCCCcccCCccEEEEecc
Confidence            556789999999888764


No 7  
>KOG4289|consensus
Probab=98.73  E-value=2.6e-08  Score=94.30  Aligned_cols=69  Identities=30%  Similarity=0.788  Sum_probs=58.6

Q ss_pred             cCCCCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC--CCCCCCCCCCCCCEeeecCCCCCeEEec
Q psy5752           8 FSLSEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES--LGCNVVKNCHANAECVYNATSAGYRCQC   84 (222)
Q Consensus         8 ~~~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~--~~C~~~~~C~~~~~C~~~n~~g~~~C~C   84 (222)
                      ++||.| +.||.+|+.|..  .   .|+| .|.|.+||+|.   .|+...  ..|.+ ..|.++++|+ ....|+|.|.|
T Consensus      1237 TeiDlCYs~pC~nng~C~s--r---EggY-tCeCrpg~tGe---hCEvs~~agrCvp-GvC~nggtC~-~~~nggf~c~C 1305 (2531)
T KOG4289|consen 1237 TEIDLCYSGPCGNNGRCRS--R---EGGY-TCECRPGFTGE---HCEVSARAGRCVP-GVCKNGGTCV-NLLNGGFCCHC 1305 (2531)
T ss_pred             chhHhhhcCCCCCCCceEE--e---cCce-eEEecCCcccc---ceeeecccCcccc-ceecCCCEEe-ecCCCceeccC
Confidence            368999 999999999999  4   7888 99999999999   998443  23776 9999999998 45678899999


Q ss_pred             CCC
Q psy5752          85 AQG   87 (222)
Q Consensus        85 ~~G   87 (222)
                      +.|
T Consensus      1306 p~g 1308 (2531)
T KOG4289|consen 1306 PYG 1308 (2531)
T ss_pred             CCc
Confidence            988


No 8  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.63  E-value=4.6e-08  Score=57.27  Aligned_cols=34  Identities=26%  Similarity=0.845  Sum_probs=31.0

Q ss_pred             cCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCc
Q psy5752         124 PLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFT  158 (222)
Q Consensus       124 ~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~  158 (222)
                      |||||+..++.|..++.|+|+. |+|+|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~-Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTE-GSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEET-TEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCC-CCEEeeCCCCcE
Confidence            6899998888998889999997 669999999998


No 9  
>KOG1217|consensus
Probab=98.55  E-value=1.6e-06  Score=76.30  Aligned_cols=167  Identities=29%  Similarity=0.706  Sum_probs=112.2

Q ss_pred             CCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccC-------------------CCCCCC-CCCCCC
Q psy5752          11 SEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPE-------------------SLGCNV-VKNCHA   67 (222)
Q Consensus        11 ~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~-------------------~~~C~~-~~~C~~   67 (222)
                      ++|   ..+|.+++.|.+.     .+.| .|.|++||.+.   .|+..                   ...|.. ...|..
T Consensus       170 ~~C~~~~~~c~~~~~C~~~-----~~~~-~C~c~~~~~~~---~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~  240 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNT-----GGSY-LCSCPPGYTGS---TCETTGNGGTCVDSVACSCPPGARGPECEVSIVECAS  240 (487)
T ss_pred             cccccCCCCcCCCcccccC-----CCCe-eEeCCCCccCC---cCcCCCCCceEecceeccCCCCCCCCCcccccccccC
Confidence            577   3459999999994     4555 99999999987   33322                   111222 233433


Q ss_pred             C-CEeeecCCCCCeEEecCCCCccCC-CeeecCCCcCCCCCCCCC--------------CCCCcccCCC-ccccCCCCCC
Q psy5752          68 N-AECVYNATSAGYRCQCAQGYVGNG-VECHPLKSCLEDRSLCGK--------------DSQGYVGNGV-ECHPLKSCLE  130 (222)
Q Consensus        68 ~-~~C~~~n~~g~~~C~C~~G~~~~~-~~C~~~~~C~~~~~~C~~--------------c~~g~~~~g~-~C~~id~C~~  130 (222)
                      . +.|+  +..++|+|.|++||.+.. ..+.++++|..... |.+              |..||.+..- .+.+..+|..
T Consensus       241 ~~~~c~--~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~  317 (487)
T KOG1217|consen  241 GDGTCV--NTVGSYTCRCPEGYTGDACVTCVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSP  317 (487)
T ss_pred             CCCccc--ccCCceeeeCCCCccccccceeeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccc
Confidence            3 7885  888899999999999887 57788888876543 322              6788876642 4556678852


Q ss_pred             --CCCCCCCCCeee--eCCCCeeEeecCCCCccCCCccccCC-CC------CCCeeee-cCCcEEEEcCCCCCC
Q psy5752         131 --DRSLCGKDASCV--VASQGHFHCECNEGFTGNGITCKPVR-KK------ESDFLLV-NQGMFMLRVPYQPTR  192 (222)
Q Consensus       131 --~~~~C~~~~~C~--n~~~g~~~C~C~~G~~gd~~~C~~~~-~~------~~~~~~~-~~g~~~~~~~~~~~~  192 (222)
                        ....|.++++|.  +.. +.+.|.|..||.|  ..|+... ++      ....+.+ +.+.+.|.++....+
T Consensus       318 ~~~~~~c~~g~~C~~~~~~-~~~~C~c~~~~~g--~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  318 RNAGGPCANGGTCNTLGSF-GGFRCACGPGFTG--RRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             cccCCcCCCCcccccCCCC-CCCCcCCCCCCCC--CccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence              344588777882  222 3478999999776  6666552 33      3445777 789999998876443


No 10 
>KOG1217|consensus
Probab=98.46  E-value=3.5e-06  Score=74.13  Aligned_cols=155  Identities=26%  Similarity=0.638  Sum_probs=108.3

Q ss_pred             CeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCC-eeecCC
Q psy5752          21 AQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGV-ECHPLK   99 (222)
Q Consensus        21 ~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~-~C~~~~   99 (222)
                      +.|++.     .+.+ .|.|++||++.....|.+++. |.....|.+++.|+  +..+.|.|.|++||.+... .+.+..
T Consensus       243 ~~c~~~-----~~~~-~C~~~~g~~~~~~~~~~~~~~-C~~~~~c~~~~~C~--~~~~~~~C~C~~g~~g~~~~~~~~~~  313 (487)
T KOG1217|consen  243 GTCVNT-----VGSY-TCRCPEGYTGDACVTCVDVDS-CALIASCPNGGTCV--NVPGSYRCTCPPGFTGRLCTECVDVD  313 (487)
T ss_pred             Cccccc-----CCce-eeeCCCCccccccceeeeccc-cCCCCccCCCCeee--cCCCcceeeCCCCCCCCCCccccccc
Confidence            789884     5566 999999999984114666664 88722388899996  7888899999999998875 556666


Q ss_pred             CcCC--CCC------CC--------CCCCCCcccCCCccccC-CCCCCCCCCCCCCCeeee-CCCCeeEeecCCCCcc--
Q psy5752         100 SCLE--DRS------LC--------GKDSQGYVGNGVECHPL-KSCLEDRSLCGKDASCVV-ASQGHFHCECNEGFTG--  159 (222)
Q Consensus       100 ~C~~--~~~------~C--------~~c~~g~~~~g~~C~~i-d~C~~~~~~C~~~~~C~n-~~~g~~~C~C~~G~~g--  159 (222)
                      +|..  ...      .|        ..|..++...+..|+.. ++|...+  +..++.|++ .. ++|.|.|+.+|.+  
T Consensus       314 ~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~--~~~~~~c~~~~~-~~~~c~~~~~~~~~~  390 (487)
T KOG1217|consen  314 ECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSP--CCPGGTCVNETP-GSYRCACPAGFAGKA  390 (487)
T ss_pred             cccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCc--cccCCEeccCCC-CCeEecCCCccccCC
Confidence            7752  111      22        11222222466689888 4998655  777889998 54 5599999999885  


Q ss_pred             --CCCccccCCCCC-CCeeeecCCcEEEEcCC
Q psy5752         160 --NGITCKPVRKKE-SDFLLVNQGMFMLRVPY  188 (222)
Q Consensus       160 --d~~~C~~~~~~~-~~~~~~~~g~~~~~~~~  188 (222)
                        ++..+.+++++. ...+++..+.+.|. +.
T Consensus       391 ~~~~~~~~~~~~c~~~~~c~~~~~~~~c~-~~  421 (487)
T KOG1217|consen  391 NGDGVGCEDIDECSGCGDCVNGPGGGACT-PP  421 (487)
T ss_pred             ccccccccccccccCCcceeccCCCCccc-cC
Confidence              345677777775 34466667777766 55


No 11 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.43  E-value=1.4e-07  Score=53.07  Aligned_cols=36  Identities=33%  Similarity=0.995  Sum_probs=27.8

Q ss_pred             CCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752         128 CLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITC  164 (222)
Q Consensus       128 C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C  164 (222)
                      |+.+++.|+.+|+|+++++ +|.|+|.+||.|||..|
T Consensus         1 C~~~~~~C~~nA~C~~~~~-~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    1 CLENNGGCHPNATCTNTGG-SYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             TTTGGGGS-TTCEEEE-TT-SEEEEE-CEEECCSTCE
T ss_pred             CCCCCCCCCCCcEeecCCC-CEEeECCCCCccCCcCC
Confidence            4455677999999999975 69999999999999876


No 12 
>KOG1225|consensus
Probab=98.17  E-value=2.5e-05  Score=69.29  Aligned_cols=44  Identities=41%  Similarity=1.115  Sum_probs=29.8

Q ss_pred             eeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752          37 KCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGV   93 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~   93 (222)
                      .|+|++||+|.   .|....  |..  .|+.+..++    .+  .|.|++||.|..+
T Consensus       266 ~CIC~~Gf~G~---dC~e~~--Cp~--~cs~~g~~~----~g--~CiC~~g~~G~dC  309 (525)
T KOG1225|consen  266 RCICPPGFTGD---DCDELV--CPV--DCSGGGVCV----DG--ECICNPGYSGKDC  309 (525)
T ss_pred             eEeCCCCCcCC---CCCccc--CCc--ccCCCceec----CC--EeecCCCcccccc
Confidence            68888888888   776643  654  376666563    22  7888888877644


No 13 
>KOG4260|consensus
Probab=98.14  E-value=2.3e-06  Score=68.73  Aligned_cols=130  Identities=25%  Similarity=0.648  Sum_probs=82.7

Q ss_pred             CCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccC-----CCC----CCC-CCCCCCCCEeeecCCCCCeEE-e
Q psy5752          15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPE-----SLG----CNV-VKNCHANAECVYNATSAGYRC-Q   83 (222)
Q Consensus        15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~-----~~~----C~~-~~~C~~~~~C~~~n~~g~~~C-~   83 (222)
                      .||.-+|.|....++  .|+. +|.|.+||+|.--+.|...     .++    |.. ...|  .+.|   ....+-.| +
T Consensus       150 r~C~GnG~C~GdGsR--~GsG-kCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C--~~~C---sg~~~k~C~k  221 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSR--EGSG-KCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC--LGVC---SGESSKGCSK  221 (350)
T ss_pred             CCcCCCCcccCCCCC--CCCC-cccccCCCCCccccccchHHHHhhcccccchhhhhhhhh--hccc---CCCCCCChhh
Confidence            578888899875441  2455 8999999999822222211     000    111 1233  2256   34444455 6


Q ss_pred             cCCCCccCCCeeecCCCcCCCCCCCCC--------------CCCCcccCCCccccCCCCCCCCCCCC-CCCeeeeCCCCe
Q psy5752          84 CAQGYVGNGVECHPLKSCLEDRSLCGK--------------DSQGYVGNGVECHPLKSCLEDRSLCG-KDASCVVASQGH  148 (222)
Q Consensus        84 C~~G~~~~~~~C~~~~~C~~~~~~C~~--------------c~~g~~~~g~~C~~id~C~~~~~~C~-~~~~C~n~~~g~  148 (222)
                      |+.||..+...|.|||+|+..+..|..              +.+||..      ++|+|..-...|. .+..|+|+. +.
T Consensus       222 CkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~------g~d~C~~~~d~~~~kn~~c~ni~-~~  294 (350)
T KOG4260|consen  222 CKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK------GVDECQFCADVCASKNRPCMNID-GQ  294 (350)
T ss_pred             hcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccC------ChHHhhhhhhhcccCCCCcccCC-cc
Confidence            999999888889999999988877754              1233322      2677765333453 356899997 55


Q ss_pred             eEeecCCCCcc
Q psy5752         149 FHCECNEGFTG  159 (222)
Q Consensus       149 ~~C~C~~G~~g  159 (222)
                      |+|.|..|+.-
T Consensus       295 ~r~v~f~~~~~  305 (350)
T KOG4260|consen  295 YRCVCFSGLII  305 (350)
T ss_pred             EEEEeccccee
Confidence            99999999873


No 14 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.94  E-value=5e-06  Score=46.73  Aligned_cols=31  Identities=45%  Similarity=1.115  Sum_probs=24.8

Q ss_pred             CCCCCCCEeeecCCCCCeEEecCCCCccCCCee
Q psy5752          63 KNCHANAECVYNATSAGYRCQCAQGYVGNGVEC   95 (222)
Q Consensus        63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C   95 (222)
                      +.|+.+|+|+  ++.++|.|+|++||.|++..|
T Consensus         6 ~~C~~nA~C~--~~~~~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    6 GGCHPNATCT--NTGGSYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             GGS-TTCEEE--E-TTSEEEEE-CEEECCSTCE
T ss_pred             CCCCCCcEee--cCCCCEEeECCCCCccCCcCC
Confidence            6788899997  888899999999999998754


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.93  E-value=3.9e-06  Score=49.00  Aligned_cols=31  Identities=29%  Similarity=0.775  Sum_probs=27.7

Q ss_pred             CCCCC---CCCCCCCCeecCCCCCCCCCCceeeecCCCCc
Q psy5752           9 SLSEC---DTPCHPNAQCTSPDEFNESREQAKCICNFGYT   45 (222)
Q Consensus         9 ~~~~C---~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~   45 (222)
                      |||||   .+.|..++.|+|  +   .|+| .|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N--~---~Gsy-~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVN--T---EGSY-SCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEE--E---TTEE-EEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEc--C---CCCE-EeeCCCCcE
Confidence            78999   468998999999  4   7888 999999998


No 16 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.90  E-value=2.3e-05  Score=44.56  Aligned_cols=38  Identities=29%  Similarity=0.882  Sum_probs=29.0

Q ss_pred             cCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752         124 PLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITC  164 (222)
Q Consensus       124 ~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C  164 (222)
                      ++++|... ..|.++++|+++.++ |.|.|++||. ++..|
T Consensus         1 d~~~C~~~-~~C~~~~~C~~~~g~-~~C~C~~g~~-~g~~C   38 (39)
T smart00179        1 DIDECASG-NPCQNGGTCVNTVGS-YRCECPPGYT-DGRNC   38 (39)
T ss_pred             CcccCcCC-CCcCCCCEeECCCCC-eEeECCCCCc-cCCcC
Confidence            36778753 459888899999754 9999999998 33555


No 17 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.85  E-value=4.9e-06  Score=45.57  Aligned_cols=29  Identities=31%  Similarity=0.762  Sum_probs=24.3

Q ss_pred             CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752          14 DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus        14 ~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      ++||.|+|+|+...    .+.| +|.|++||+|+
T Consensus         3 ~~~C~n~g~C~~~~----~~~y-~C~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTCIDLP----GGGY-TCECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEEEEES----TSEE-EEEEBTTEEST
T ss_pred             CCcCCCCeEEEeCC----CCCE-EeECCCCCccC
Confidence            67999999999942    2566 99999999986


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.84  E-value=2.6e-05  Score=44.34  Aligned_cols=33  Identities=33%  Similarity=0.916  Sum_probs=27.8

Q ss_pred             CCCCC-C-CCCCCCCeecCCCCCCCCCCceeeecCCCCc-cC
Q psy5752           9 SLSEC-D-TPCHPNAQCTSPDEFNESREQAKCICNFGYT-GD   47 (222)
Q Consensus         9 ~~~~C-~-~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~-g~   47 (222)
                      ++++| . .+|.++++|++.     .++| .|.|++||. |.
T Consensus         1 d~~~C~~~~~C~~~~~C~~~-----~g~~-~C~C~~g~~~g~   36 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNT-----VGSY-RCECPPGYTDGR   36 (39)
T ss_pred             CcccCcCCCCcCCCCEeECC-----CCCe-EeECCCCCccCC
Confidence            57899 5 799999999994     6666 999999999 65


No 19 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.78  E-value=2.3e-05  Score=42.87  Aligned_cols=27  Identities=33%  Similarity=0.960  Sum_probs=23.6

Q ss_pred             CCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752         134 LCGKDASCVVASQGHFHCECNEGFTGN  160 (222)
Q Consensus       134 ~C~~~~~C~n~~~g~~~C~C~~G~~gd  160 (222)
                      +|.++++|++...+.|.|.|++||+|+
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            599999999987466999999999984


No 20 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.74  E-value=2.6e-05  Score=39.44  Aligned_cols=22  Identities=23%  Similarity=0.915  Sum_probs=19.4

Q ss_pred             eeEeecCCCCc--cCCCccccCCC
Q psy5752         148 HFHCECNEGFT--GNGITCKPVRK  169 (222)
Q Consensus       148 ~~~C~C~~G~~--gd~~~C~~~~~  169 (222)
                      +|+|+|++||.  .|+.+|.+|++
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            49999999999  57889999975


No 21 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.63  E-value=4.5e-05  Score=38.59  Aligned_cols=22  Identities=32%  Similarity=0.973  Sum_probs=16.8

Q ss_pred             CeEEecCCCCc--cCCCeeecCCC
Q psy5752          79 GYRCQCAQGYV--GNGVECHPLKS  100 (222)
Q Consensus        79 ~~~C~C~~G~~--~~~~~C~~~~~  100 (222)
                      ||+|+|++||.  .++..|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            68999999998  45666666654


No 22 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.53  E-value=6.5e-05  Score=42.11  Aligned_cols=32  Identities=28%  Similarity=0.782  Sum_probs=23.1

Q ss_pred             CCCCCCCCCCeeeeCCCCeeEeecCCCCcc--CCCcc
Q psy5752         130 EDRSLCGKDASCVVASQGHFHCECNEGFTG--NGITC  164 (222)
Q Consensus       130 ~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g--d~~~C  164 (222)
                      ...+.|.+  .|+++++ +|+|.|++||..  |+++|
T Consensus         3 ~~NGgC~h--~C~~~~g-~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    3 VNNGGCSH--ICVNTPG-SYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             TGGGGSSS--EEEEETT-SEEEE-STTEEE-TTSSSE
T ss_pred             CCCCCcCC--CCccCCC-ceEeECCCCCEECcCCCCC
Confidence            34566865  8999975 599999999994  55554


No 23 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.48  E-value=5.4e-05  Score=57.81  Aligned_cols=130  Identities=26%  Similarity=0.674  Sum_probs=72.1

Q ss_pred             CCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC----CCCCCCCCEeeec---CCCCCeEEecCCCC
Q psy5752          16 PCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV----VKNCHANAECVYN---ATSAGYRCQCAQGY   88 (222)
Q Consensus        16 ~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~----~~~C~~~~~C~~~---n~~g~~~C~C~~G~   88 (222)
                      .|.| |.-+.      +...|.|.|.+||.-...++|+...+ |..    ..+|+..+.|+..   .....|.|.|.+||
T Consensus         7 ~CKN-G~LiQ------MSNHfEC~Cnegfvl~~EntCE~kv~-C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY   78 (197)
T PF06247_consen    7 ICKN-GYLIQ------MSNHFECKCNEGFVLKNENTCEEKVE-CDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGY   78 (197)
T ss_dssp             --BT-EEEEE------ESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred             cccC-CEEEE------ccCceEEEcCCCcEEcccccccccee-cCcccccCccccchhhhhcCCCcccceeEEEecccCc
Confidence            4553 45554      45556999999998544458988774 764    4789999999731   12367999999999


Q ss_pred             ccCCCeeecCCCcCCCCCCCCC----------------CCCCcc-cCCCcccc--CCCCCCCCCCCCCCCeeeeCCCCee
Q psy5752          89 VGNGVECHPLKSCLEDRSLCGK----------------DSQGYV-GNGVECHP--LKSCLEDRSLCGKDASCVVASQGHF  149 (222)
Q Consensus        89 ~~~~~~C~~~~~C~~~~~~C~~----------------c~~g~~-~~g~~C~~--id~C~~~~~~C~~~~~C~n~~~g~~  149 (222)
                      +.....|.+. .|..  -.|..                |.-|+. .+...|..  ..+|++   .|..+..|.... +.|
T Consensus        79 ~~~~~vCvp~-~C~~--~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L---KCk~nE~CK~~~-~~Y  151 (197)
T PF06247_consen   79 ILKQGVCVPN-KCNN--KDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL---KCKENEECKLVD-GYY  151 (197)
T ss_dssp             EESSSSEEEG-GGSS-----TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-----------TTTEEEEEET-TEE
T ss_pred             eeeCCeEchh-hcCc--eecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee---ecCCCcceeeeC-cEE
Confidence            9665555431 1111  11111                333444 34445542  355654   277778899884 779


Q ss_pred             EeecCCCCccC
Q psy5752         150 HCECNEGFTGN  160 (222)
Q Consensus       150 ~C~C~~G~~gd  160 (222)
                      +|.|.+||.++
T Consensus       152 ~C~~~~~~~~~  162 (197)
T PF06247_consen  152 KCVCKEGFPGD  162 (197)
T ss_dssp             EEEE-TT-EEE
T ss_pred             EeecCCCCCCC
Confidence            99999999864


No 24 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.44  E-value=0.00021  Score=39.98  Aligned_cols=33  Identities=33%  Similarity=0.925  Sum_probs=27.4

Q ss_pred             CCCCC-C-CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752           9 SLSEC-D-TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus         9 ~~~~C-~-~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      ++++| . .+|.+++.|++.     .+.| .|.|++||.|.
T Consensus         1 ~~~~C~~~~~C~~~~~C~~~-----~~~~-~C~C~~g~~g~   35 (38)
T cd00054           1 DIDECASGNPCQNGGTCVNT-----VGSY-RCSCPPGYTGR   35 (38)
T ss_pred             CcccCCCCCCcCCCCEeECC-----CCCe-EeECCCCCcCC
Confidence            46888 5 789999999984     5565 99999999986


No 25 
>KOG1225|consensus
Probab=97.41  E-value=0.00047  Score=61.41  Aligned_cols=58  Identities=33%  Similarity=0.995  Sum_probs=45.0

Q ss_pred             CCCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccCC
Q psy5752          13 CDTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGNG   92 (222)
Q Consensus        13 C~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~   92 (222)
                      |+..|..++.+++  .        .|.|++||.|.   .|+...  |.  ..|..++.|+    .  -+|.|.+||+|..
T Consensus       283 Cp~~cs~~g~~~~--g--------~CiC~~g~~G~---dCs~~~--cp--adC~g~G~Ci----~--G~C~C~~Gy~G~~  339 (525)
T KOG1225|consen  283 CPVDCSGGGVCVD--G--------ECICNPGYSGK---DCSIRR--CP--ADCSGHGKCI----D--GECLCDEGYTGEL  339 (525)
T ss_pred             CCcccCCCceecC--C--------EeecCCCcccc---cccccc--CC--ccCCCCCccc----C--CceEeCCCCcCCc
Confidence            5455877788876  2        79999999999   887654  43  6898899995    2  2799999999874


Q ss_pred             C
Q psy5752          93 V   93 (222)
Q Consensus        93 ~   93 (222)
                      +
T Consensus       340 C  340 (525)
T KOG1225|consen  340 C  340 (525)
T ss_pred             c
Confidence            3


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.28  E-value=0.00049  Score=38.43  Aligned_cols=33  Identities=27%  Similarity=0.869  Sum_probs=26.2

Q ss_pred             CCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752         125 LKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG  159 (222)
Q Consensus       125 id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g  159 (222)
                      +++|... ..|.+++.|++..++ |+|.|++||.|
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~~~-~~C~C~~g~~g   34 (38)
T cd00054           2 IDECASG-NPCQNGGTCVNTVGS-YRCSCPPGYTG   34 (38)
T ss_pred             cccCCCC-CCcCCCCEeECCCCC-eEeECCCCCcC
Confidence            5677652 358878899999755 99999999988


No 27 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.06  E-value=0.0013  Score=36.10  Aligned_cols=27  Identities=30%  Similarity=0.924  Sum_probs=22.8

Q ss_pred             CCCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752         133 SLCGKDASCVVASQGHFHCECNEGFTGN  160 (222)
Q Consensus       133 ~~C~~~~~C~n~~~g~~~C~C~~G~~gd  160 (222)
                      ..|.+++.|++..++ |+|.|++||.++
T Consensus         6 ~~C~~~~~C~~~~~~-~~C~C~~g~~g~   32 (36)
T cd00053           6 NPCSNGGTCVNTPGS-YRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCEEecCCCC-eEeECCCCCccc
Confidence            458878899999755 999999999985


No 28 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.95  E-value=0.0018  Score=35.73  Aligned_cols=26  Identities=31%  Similarity=1.048  Sum_probs=21.6

Q ss_pred             CCCCCCCeeeeCCCCeeEeecCCCCccC
Q psy5752         133 SLCGKDASCVVASQGHFHCECNEGFTGN  160 (222)
Q Consensus       133 ~~C~~~~~C~n~~~g~~~C~C~~G~~gd  160 (222)
                      +.|.++ .|+++. ++|.|.|++||.++
T Consensus         6 ~~C~~~-~C~~~~-~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTP-GSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCC-CCeEeECCCCCccC
Confidence            458877 899996 55999999999984


No 29 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.65  E-value=0.0039  Score=34.04  Aligned_cols=27  Identities=41%  Similarity=1.078  Sum_probs=23.9

Q ss_pred             CCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752          63 KNCHANAECVYNATSAGYRCQCAQGYVGN   91 (222)
Q Consensus        63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~~~   91 (222)
                      .+|.++++|+  +..++|.|.|+.||.+.
T Consensus         6 ~~C~~~~~C~--~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           6 NPCSNGGTCV--NTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCEEe--cCCCCeEeECCCCCccc
Confidence            6788889997  88899999999999876


No 30 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.62  E-value=0.0031  Score=34.69  Aligned_cols=26  Identities=38%  Similarity=0.921  Sum_probs=21.5

Q ss_pred             CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752          15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus        15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      .+|.++ .|++.     .+.| .|.|++||.|.
T Consensus         6 ~~C~~~-~C~~~-----~~~~-~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINT-----PGSY-TCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECC-----CCCe-EeECCCCCccC
Confidence            589888 99983     5666 99999999983


No 31 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.14  E-value=0.0046  Score=34.51  Aligned_cols=31  Identities=29%  Similarity=0.687  Sum_probs=23.3

Q ss_pred             CCCCCCeeeeCCCCeeEeecCCCCccCCCcc
Q psy5752         134 LCGKDASCVVASQGHFHCECNEGFTGNGITC  164 (222)
Q Consensus       134 ~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C  164 (222)
                      .|..++.|++..+|++.|.|..||..++..|
T Consensus         6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    6 KCPANAGCFRYDDGSEECRCLLGYKKVGGKC   36 (37)
T ss_dssp             ---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred             cCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence            4888899999987889999999999765555


No 32 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=95.98  E-value=0.008  Score=48.12  Aligned_cols=39  Identities=26%  Similarity=0.722  Sum_probs=32.9

Q ss_pred             CCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752         118 NGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG  159 (222)
Q Consensus       118 ~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g  159 (222)
                      .+..|.++++|...++.|.+  .|.++. |+|.|.|++||+.
T Consensus       180 ~~~~C~~~~~C~~~~~~c~~--~C~~~~-g~~~c~c~~g~~~  218 (224)
T cd01475         180 QGKICVVPDLCATLSHVCQQ--VCISTP-GSYLCACTEGYAL  218 (224)
T ss_pred             ccccCcCchhhcCCCCCccc--eEEcCC-CCEEeECCCCccC
Confidence            45689999999887888975  799997 6699999999984


No 33 
>KOG1226|consensus
Probab=95.48  E-value=0.091  Score=48.55  Aligned_cols=69  Identities=23%  Similarity=0.710  Sum_probs=49.1

Q ss_pred             CCCCCCeecCCCCCCCCCCceeeecCCCCc----cCCCCCcccCCCCCCC--CCCCCCCCEeeecCCCCCeEEecCCCCc
Q psy5752          16 PCHPNAQCTSPDEFNESREQAKCICNFGYT----GDGVTECNPESLGCNV--VKNCHANAECVYNATSAGYRCQCAQGYV   89 (222)
Q Consensus        16 ~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~----g~~~~~C~~~~~~C~~--~~~C~~~~~C~~~n~~g~~~C~C~~G~~   89 (222)
                      +|...|.|+=  .        .|.|.+...    |.   .|+-++-.|..  ...|..++.|.    =  -+|.|.+||+
T Consensus       515 vCSgrG~C~C--G--------qC~C~~~~~~~i~G~---fCECDnfsC~r~~g~lC~g~G~C~----C--G~CvC~~Gwt  575 (783)
T KOG1226|consen  515 VCSGRGDCVC--G--------QCVCHKPDNGKIYGK---FCECDNFSCERHKGVLCGGHGRCE----C--GRCVCNPGWT  575 (783)
T ss_pred             CcCCCCcEeC--C--------ceEecCCCCCceeee---eeeccCcccccccCcccCCCCeEe----C--CcEEcCCCCc
Confidence            7888888876  2        799987776    55   88866545665  34688888884    1  2799999999


Q ss_pred             cCCCeee-cCCCcCC
Q psy5752          90 GNGVECH-PLKSCLE  103 (222)
Q Consensus        90 ~~~~~C~-~~~~C~~  103 (222)
                      |..+.|. +.+.|..
T Consensus       576 G~~C~C~~std~C~~  590 (783)
T KOG1226|consen  576 GSACNCPLSTDTCES  590 (783)
T ss_pred             cCCCCCCCCCccccC
Confidence            9988775 3344543


No 34 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.47  E-value=0.023  Score=30.89  Aligned_cols=25  Identities=40%  Similarity=1.034  Sum_probs=20.5

Q ss_pred             CCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752          15 TPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus        15 ~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      ..|.++|+|+..       .. .|.|.+||+|+
T Consensus         6 ~~C~~~G~C~~~-------~g-~C~C~~g~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSP-------CG-RCVCDSGYTGP   30 (32)
T ss_pred             CccCCCCEEeCC-------CC-EEECCCCCcCC
Confidence            369999999972       12 89999999998


No 35 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.44  E-value=0.0062  Score=26.06  Aligned_cols=11  Identities=45%  Similarity=1.078  Sum_probs=9.0

Q ss_pred             eeecCCCCccC
Q psy5752          37 KCICNFGYTGD   47 (222)
Q Consensus        37 ~C~C~~G~~g~   47 (222)
                      .|+|++||+|.
T Consensus         1 ~C~C~~G~~G~   11 (13)
T PF12661_consen    1 TCQCPPGWTGP   11 (13)
T ss_dssp             EEEE-TTEETT
T ss_pred             CccCcCCCcCC
Confidence            49999999998


No 36 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.40  E-value=0.011  Score=33.03  Aligned_cols=21  Identities=43%  Similarity=0.931  Sum_probs=16.5

Q ss_pred             CEeeecCCCCCeEEecCCCCccC
Q psy5752          69 AECVYNATSAGYRCQCAQGYVGN   91 (222)
Q Consensus        69 ~~C~~~n~~g~~~C~C~~G~~~~   91 (222)
                      .+|+  +++++|+|.|++||...
T Consensus        10 h~C~--~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   10 HICV--NTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SEEE--EETTSEEEE-STTEEE-
T ss_pred             CCCc--cCCCceEeECCCCCEEC
Confidence            4897  88999999999999843


No 37 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.21  E-value=0.015  Score=32.53  Aligned_cols=35  Identities=34%  Similarity=0.769  Sum_probs=23.3

Q ss_pred             CCCCCCCCCCCEeeecCCCCCeEEecCCCCccCCCee
Q psy5752          59 CNVVKNCHANAECVYNATSAGYRCQCAQGYVGNGVEC   95 (222)
Q Consensus        59 C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~C   95 (222)
                      |.. ..|..++.|+ +...|++.|+|..||..++..|
T Consensus         2 C~~-~~cP~NA~C~-~~~dG~eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    2 CID-TKCPANAGCF-RYDDGSEECRCLLGYKKVGGKC   36 (37)
T ss_dssp             -SS-S---TTEEEE-EETTSEEEEEE-TTEEEETTEE
T ss_pred             ccC-ccCCCCcccE-EcCCCCEEEEeeCCccccCCCc
Confidence            444 6777899998 4666999999999998665544


No 38 
>KOG1226|consensus
Probab=95.20  E-value=0.21  Score=46.26  Aligned_cols=120  Identities=24%  Similarity=0.710  Sum_probs=63.0

Q ss_pred             eeecCCCCccCCCCCcccCC---------CCCCC--C-CCCCCCCEeeecCCCCCeEEecCCCCc----cCCCeeecCCC
Q psy5752          37 KCICNFGYTGDGVTECNPES---------LGCNV--V-KNCHANAECVYNATSAGYRCQCAQGYV----GNGVECHPLKS  100 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~~~~---------~~C~~--~-~~C~~~~~C~~~n~~g~~~C~C~~G~~----~~~~~C~~~~~  100 (222)
                      .|.|.+||.|+   .|+=..         +.|..  . ..|...+.|+    =|  +|.|.+...    |..++|.+. .
T Consensus       479 ~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~----CG--qC~C~~~~~~~i~G~fCECDnf-s  548 (783)
T KOG1226|consen  479 QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV----CG--QCVCHKPDNGKIYGKFCECDNF-S  548 (783)
T ss_pred             ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEe----CC--ceEecCCCCCceeeeeeeccCc-c
Confidence            57999999998   665211         12432  1 2566666664    11  577766554    333333222 1


Q ss_pred             cCCC-------CCC--CCC--CCCCcccCCCccc-cCCCCCCCC-CCCCCCCeeeeCCCCeeEeecCCC-CccCCCcccc
Q psy5752         101 CLED-------RSL--CGK--DSQGYVGNGVECH-PLKSCLEDR-SLCGKDASCVVASQGHFHCECNEG-FTGNGITCKP  166 (222)
Q Consensus       101 C~~~-------~~~--C~~--c~~g~~~~g~~C~-~id~C~~~~-~~C~~~~~C~n~~~g~~~C~C~~G-~~gd~~~C~~  166 (222)
                      |...       ++.  |+.  |..||+|..-.|. +.+.|.... ..|...++|.   =|  +|.|... |.|  .+|+.
T Consensus       549 C~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~---Cg--~C~C~~~~~sG--~~CE~  621 (783)
T KOG1226|consen  549 CERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCE---CG--RCKCTDPPYSG--EFCEK  621 (783)
T ss_pred             cccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceee---CC--ceEcCCCCcCc--chhhc
Confidence            2211       111  222  5778877633332 346664422 2355555554   13  4777665 777  77877


Q ss_pred             CCCCCCC
Q psy5752         167 VRKKESD  173 (222)
Q Consensus       167 ~~~~~~~  173 (222)
                      -+.+...
T Consensus       622 cptc~~~  628 (783)
T KOG1226|consen  622 CPTCPDP  628 (783)
T ss_pred             CCCCCCc
Confidence            6666554


No 39 
>KOG0994|consensus
Probab=94.51  E-value=0.1  Score=50.31  Aligned_cols=53  Identities=28%  Similarity=0.651  Sum_probs=29.8

Q ss_pred             ee-ecCCCCccCCCCCcccCCCCCCCCCCCC--------CCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752          37 KC-ICNFGYTGDGVTECNPESLGCNVVKNCH--------ANAECVYNATSAGYRCQCAQGYVGNGV   93 (222)
Q Consensus        37 ~C-~C~~G~~g~~~~~C~~~~~~C~~~~~C~--------~~~~C~~~n~~g~~~C~C~~G~~~~~~   93 (222)
                      +| +|..||.|+++ .=..+  .|.+ -+|.        ....|.-.+......|.|.+||+|..+
T Consensus       886 ~CdrCl~GyyGdP~-lg~g~--~CrP-CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RC  947 (1758)
T KOG0994|consen  886 SCDRCLDGYYGDPR-LGSGI--GCRP-CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRC  947 (1758)
T ss_pred             chhhhhccccCCcc-cCCCC--CCCC-CCCCCCCccchhccccccccccccceeeecccCccccch
Confidence            67 89999998754 11111  1333 2332        122453112334567999999998754


No 40 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.04  E-value=0.1  Score=28.33  Aligned_cols=23  Identities=35%  Similarity=1.014  Sum_probs=19.5

Q ss_pred             CCCCCCeeeeCCCCeeEeecCCCCcc
Q psy5752         134 LCGKDASCVVASQGHFHCECNEGFTG  159 (222)
Q Consensus       134 ~C~~~~~C~n~~~g~~~C~C~~G~~g  159 (222)
                      .|..+++|++..   .+|.|.+||+|
T Consensus         7 ~C~~~G~C~~~~---g~C~C~~g~~G   29 (32)
T PF07974_consen    7 ICSGHGTCVSPC---GRCVCDSGYTG   29 (32)
T ss_pred             ccCCCCEEeCCC---CEEECCCCCcC
Confidence            488899999762   57999999998


No 41 
>KOG0994|consensus
Probab=93.02  E-value=0.42  Score=46.36  Aligned_cols=64  Identities=28%  Similarity=0.687  Sum_probs=37.7

Q ss_pred             EEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCC--Cccc---cCCCCCCCCCCCCCCCeeeeCCCCeeEe-ecC
Q psy5752          81 RCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNG--VECH---PLKSCLEDRSLCGKDASCVVASQGHFHC-ECN  154 (222)
Q Consensus        81 ~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g--~~C~---~id~C~~~~~~C~~~~~C~n~~~g~~~C-~C~  154 (222)
                      +|.|.+|.-|..               |..|.+||++-.  +.|.   ..+.|....+.|..   |.+...| +.| .|.
T Consensus       831 QC~C~~g~ygrq---------------CnqCqpG~WgFPeCr~CqCNgHA~~Cd~~tGaCi~---CqD~T~G-~~CdrCl  891 (1758)
T KOG0994|consen  831 QCQCRPGTYGRQ---------------CNQCQPGYWGFPECRPCQCNGHADTCDPITGACID---CQDSTTG-HSCDRCL  891 (1758)
T ss_pred             ceeeccccchhh---------------ccccCCCccCCCcCccccccCcccccCcccccccc---ccccccc-cchhhhh
Confidence            677776654442               344566665321  2222   13556555555643   7777667 888 799


Q ss_pred             CCCccCCCc
Q psy5752         155 EGFTGNGIT  163 (222)
Q Consensus       155 ~G~~gd~~~  163 (222)
                      .||+||...
T Consensus       892 ~GyyGdP~l  900 (1758)
T KOG0994|consen  892 DGYYGDPRL  900 (1758)
T ss_pred             ccccCCccc
Confidence            999987643


No 42 
>PHA02887 EGF-like protein; Provisional
Probab=92.64  E-value=0.13  Score=36.35  Aligned_cols=40  Identities=38%  Similarity=0.787  Sum_probs=30.3

Q ss_pred             CCCCC----CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC
Q psy5752           9 SLSEC----DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES   56 (222)
Q Consensus         9 ~~~~C----~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~   56 (222)
                      -+.+|    .+-|- ||.|.....   .... .|.|+.||+|.   .|+.++
T Consensus        82 hf~pC~~eyk~YCi-HG~C~yI~d---L~ep-sCrC~~GYtG~---RCE~vs  125 (126)
T PHA02887         82 FFEKCKNDFNDFCI-NGECMNIID---LDEK-FCICNKGYTGI---RCDEVS  125 (126)
T ss_pred             CccccChHhhCEee-CCEEEcccc---CCCc-eeECCCCcccC---CCCccc
Confidence            46778    36787 579998644   5555 89999999999   887653


No 43 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=90.09  E-value=0.24  Score=35.69  Aligned_cols=40  Identities=28%  Similarity=0.665  Sum_probs=29.9

Q ss_pred             CCCCC----CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCC
Q psy5752           9 SLSEC----DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPES   56 (222)
Q Consensus         9 ~~~~C----~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~   56 (222)
                      ++.+|    .+-|-|+ .|.....   ...+ .|.|..||+|.   .|+..+
T Consensus        41 ~i~~Cp~ey~~YClHG-~C~yI~d---l~~~-~CrC~~GYtGe---RCEh~d   84 (139)
T PHA03099         41 AIRLCGPEGDGYCLHG-DCIHARD---IDGM-YCRCSHGYTGI---RCQHVV   84 (139)
T ss_pred             ccccCChhhCCEeECC-EEEeecc---CCCc-eeECCCCcccc---ccccee
Confidence            35667    3668764 8987654   5666 89999999999   888654


No 44 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=90.04  E-value=0.15  Score=39.27  Aligned_cols=75  Identities=23%  Similarity=0.594  Sum_probs=42.6

Q ss_pred             CCCCCCCEeeecCCCCCeEEecCCCCc-cCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCee
Q psy5752          63 KNCHANAECVYNATSAGYRCQCAQGYV-GNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASC  141 (222)
Q Consensus        63 ~~C~~~~~C~~~n~~g~~~C~C~~G~~-~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C  141 (222)
                      ..|.+ +..+  ...+.|.|.|.+||. -+..+|+...+|...                        ......|...++|
T Consensus         6 T~CKN-G~Li--QMSNHfEC~Cnegfvl~~EntCE~kv~C~~~------------------------e~~~K~Cgdya~C   58 (197)
T PF06247_consen    6 TICKN-GYLI--QMSNHFECKCNEGFVLKNENTCEEKVECDKL------------------------ENVNKPCGDYAKC   58 (197)
T ss_dssp             ---BT-EEEE--EESSEEEEEESTTEEEEETTEEEE----SG-------------------------GGTTSEEETTEEE
T ss_pred             ccccC-CEEE--EccCceEEEcCCCcEEccccccccceecCcc------------------------cccCccccchhhh
Confidence            45543 4454  456789999999997 344555555444321                        0112348778899


Q ss_pred             eeCCC----CeeEeecCCCCccCCCcc
Q psy5752         142 VVASQ----GHFHCECNEGFTGNGITC  164 (222)
Q Consensus       142 ~n~~~----g~~~C~C~~G~~gd~~~C  164 (222)
                      ++.+.    ..|.|.|.+||......|
T Consensus        59 ~~~~~~~~~~~~~C~C~~gY~~~~~vC   85 (197)
T PF06247_consen   59 INQANKGEERAYKCDCINGYILKQGVC   85 (197)
T ss_dssp             EE-SSTTSSTSEEEEE-TTEEESSSSE
T ss_pred             hcCCCcccceeEEEecccCceeeCCeE
Confidence            97763    459999999999543343


No 45 
>smart00051 DSL delta serrate ligand.
Probab=89.17  E-value=0.82  Score=28.93  Aligned_cols=44  Identities=23%  Similarity=0.650  Sum_probs=30.2

Q ss_pred             eeecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752          37 KCICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQCAQGYVGN   91 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~   91 (222)
                      .-.|+++|.|.   .|...   |.+.+.+..+..|   +..|  .+.|.+||.|.
T Consensus        18 rv~C~~~~yG~---~C~~~---C~~~~d~~~~~~C---d~~G--~~~C~~Gw~G~   61 (63)
T smart00051       18 RVTCDENYYGE---GCNKF---CRPRDDFFGHYTC---DENG--NKGCLEGWMGP   61 (63)
T ss_pred             EeeCCCCCcCC---ccCCE---eCcCccccCCccC---CcCC--CEecCCCCcCC
Confidence            56899999998   77532   5543445556678   4334  57899999876


No 46 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=87.79  E-value=0.63  Score=37.11  Aligned_cols=36  Identities=28%  Similarity=0.712  Sum_probs=28.0

Q ss_pred             CcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecCCCCccC
Q psy5752          51 ECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCAQGYVGN   91 (222)
Q Consensus        51 ~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~   91 (222)
                      .|++.+ +|.. .+.|.  ..|.  ++.|+|.|.|.+||+..
T Consensus       183 ~C~~~~-~C~~~~~~c~--~~C~--~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         183 ICVVPD-LCATLSHVCQ--QVCI--STPGSYLCACTEGYALL  219 (224)
T ss_pred             cCcCch-hhcCCCCCcc--ceEE--cCCCCEEeECCCCccCC
Confidence            787666 4875 45674  4796  99999999999999843


No 47 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=85.91  E-value=0.88  Score=32.03  Aligned_cols=32  Identities=31%  Similarity=0.707  Sum_probs=24.9

Q ss_pred             CCCCC--CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752           9 SLSEC--DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus         9 ~~~~C--~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      ..|+|  ...|+.++.|..      .... .|.|.+||+-+
T Consensus        76 p~d~Cd~y~~CG~~g~C~~------~~~~-~C~Cl~GF~P~  109 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS------NNSP-KCSCLPGFEPK  109 (110)
T ss_pred             cccCCCCccccCCccEeCC------CCCC-ceECCCCcCCC
Confidence            46789  689999999965      3344 79999999753


No 48 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=82.14  E-value=1.8  Score=30.41  Aligned_cols=30  Identities=27%  Similarity=0.734  Sum_probs=23.9

Q ss_pred             CCCCCCCCCCCCEeeecCCCCCeEEecCCCCcc
Q psy5752          58 GCNVVKNCHANAECVYNATSAGYRCQCAQGYVG   90 (222)
Q Consensus        58 ~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~~   90 (222)
                      .|.....|+..+.|   +...+..|.|.+||.-
T Consensus        79 ~Cd~y~~CG~~g~C---~~~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   79 QCDVYGFCGPNGIC---NSNNSPKCSCLPGFEP  108 (110)
T ss_pred             CCCCccccCCccEe---CCCCCCceECCCCcCC
Confidence            47766899999999   4555668999999963


No 49 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=77.81  E-value=3.9  Score=24.42  Aligned_cols=27  Identities=30%  Similarity=0.807  Sum_probs=20.6

Q ss_pred             CCC--CCCCCCCCeecCCCCCCCCCCceeeecCCCCccC
Q psy5752          11 SEC--DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGD   47 (222)
Q Consensus        11 ~~C--~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~   47 (222)
                      +.|  ...|..++.|++  .        .|.|++||.-.
T Consensus        20 ~~C~~~~qC~~~s~C~~--g--------~C~C~~g~~~~   48 (52)
T PF01683_consen   20 ESCESDEQCIGGSVCVN--G--------RCQCPPGYVEV   48 (52)
T ss_pred             CCCCCcCCCCCcCEEcC--C--------EeECCCCCEec
Confidence            446  567888889976  2        89999999744


No 50 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.95  E-value=3.2  Score=30.02  Aligned_cols=32  Identities=25%  Similarity=0.690  Sum_probs=24.8

Q ss_pred             CCCCCCCCEeeecCCCCCeEEecCCCCccCCCe
Q psy5752          62 VKNCHANAECVYNATSAGYRCQCAQGYVGNGVE   94 (222)
Q Consensus        62 ~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~~   94 (222)
                      .+.|-++ .|.+......+.|.|..||+|..++
T Consensus        50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             CCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence            4677765 8975556678999999999998654


No 51 
>PHA02887 EGF-like protein; Provisional
Probab=75.17  E-value=4  Score=29.06  Aligned_cols=35  Identities=29%  Similarity=0.577  Sum_probs=25.6

Q ss_pred             CCCC--CCCCCCCCEeeecCCCCCeEEecCCCCccCCC
Q psy5752          58 GCNV--VKNCHANAECVYNATSAGYRCQCAQGYVGNGV   93 (222)
Q Consensus        58 ~C~~--~~~C~~~~~C~~~n~~g~~~C~C~~G~~~~~~   93 (222)
                      +|..  .+.|- ++.|.+......+.|.|.+||+|..+
T Consensus        85 pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~RC  121 (126)
T PHA02887         85 KCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIRC  121 (126)
T ss_pred             ccChHhhCEee-CCEEEccccCCCceeECCCCcccCCC
Confidence            3554  46776 56997545567789999999998855


No 52 
>KOG3516|consensus
Probab=75.14  E-value=2.8  Score=41.11  Aligned_cols=41  Identities=24%  Similarity=0.681  Sum_probs=32.7

Q ss_pred             ccCCCCC-CCCCCCCCeecCCCCCCCCCCceeeecC-CCCccCCCCCcccCC
Q psy5752           7 RFSLSEC-DTPCHPNAQCTSPDEFNESREQAKCICN-FGYTGDGVTECNPES   56 (222)
Q Consensus         7 ~~~~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~-~G~~g~~~~~C~~~~   56 (222)
                      |.-++.| +++|++++.|...     ...+ .|.|. .||+|.   +|...+
T Consensus       542 C~i~drClPN~CehgG~C~Qs-----~~~f-~C~C~~TGY~Ga---tCHtsi  584 (1306)
T KOG3516|consen  542 CGISDRCLPNPCEHGGKCSQS-----WDDF-ECNCELTGYKGA---TCHTSI  584 (1306)
T ss_pred             cccccccCCccccCCCccccc-----ccce-eEeccccccccc---cccCCC
Confidence            4457788 9999999999983     4455 99998 999998   887543


No 53 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=63.70  E-value=7.5  Score=21.23  Aligned_cols=19  Identities=26%  Similarity=0.824  Sum_probs=13.3

Q ss_pred             EeeecCCCCCeEEecCCCCccC
Q psy5752          70 ECVYNATSAGYRCQCAQGYVGN   91 (222)
Q Consensus        70 ~C~~~n~~g~~~C~C~~G~~~~   91 (222)
                      .|.   ......|.|+.||+.+
T Consensus        11 ~CD---pn~~~~C~CPeGyIld   29 (34)
T PF09064_consen   11 DCD---PNSPGQCFCPEGYILD   29 (34)
T ss_pred             ccC---CCCCCceeCCCceEec
Confidence            673   3334589999999855


No 54 
>KOG3514|consensus
Probab=62.79  E-value=5.5  Score=38.89  Aligned_cols=35  Identities=29%  Similarity=0.825  Sum_probs=28.9

Q ss_pred             CC-CCCCCCCCeecCCCCCCCCCCceeeecC-CCCccCCCCCcccC
Q psy5752          12 EC-DTPCHPNAQCTSPDEFNESREQAKCICN-FGYTGDGVTECNPE   55 (222)
Q Consensus        12 ~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~-~G~~g~~~~~C~~~   55 (222)
                      .| ++||+|+|.|...     ...+ .|-|. .||.|.   +|+..
T Consensus       625 ~C~~nPC~N~g~C~eg-----wNrf-iCDCs~T~~~G~---~CerE  661 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEG-----WNRF-ICDCSGTGFEGR---TCERE  661 (1591)
T ss_pred             ccCCCcccCCCCcccc-----cccc-ccccccCcccCc---cccce
Confidence            68 9999999999984     5566 99995 788888   88754


No 55 
>KOG0196|consensus
Probab=62.66  E-value=24  Score=33.81  Aligned_cols=72  Identities=19%  Similarity=0.488  Sum_probs=39.7

Q ss_pred             EEecCCCCccCCCeeecCCCcCCCCCCCCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCcc-
Q psy5752          81 RCQCAQGYVGNGVECHPLKSCLEDRSLCGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVASQGHFHCECNEGFTG-  159 (222)
Q Consensus        81 ~C~C~~G~~~~~~~C~~~~~C~~~~~~C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~g-  159 (222)
                      .|.|++||....           ....|..|+.|+....   .....|..    |..+..-. . .|+-.|.|..||.. 
T Consensus       260 ~C~C~aGye~~~-----------~~~~C~aCp~G~yK~~---~~~~~C~~----CP~~S~s~-~-ega~~C~C~~gyyRA  319 (996)
T KOG0196|consen  260 GCVCKAGYEEAE-----------NGKACQACPPGTYKAS---QGDSLCLP----CPPNSHSS-S-EGATSCTCENGYYRA  319 (996)
T ss_pred             ceeecCCCCccc-----------CCCcceeCCCCcccCC---CCCCCCCC----CCCCCCCC-C-CCCCcccccCCcccC
Confidence            688999997421           1234666777764210   11234442    55443322 2 35578999999983 


Q ss_pred             --CCC--ccccCCCCCC
Q psy5752         160 --NGI--TCKPVRKKES  172 (222)
Q Consensus       160 --d~~--~C~~~~~~~~  172 (222)
                        |+.  -|+..+..+.
T Consensus       320 ~~Dp~~mpCT~PPSaP~  336 (996)
T KOG0196|consen  320 DSDPPSMPCTRPPSAPR  336 (996)
T ss_pred             CCCCCCCCCCCCCCccc
Confidence              332  3776554433


No 56 
>KOG1836|consensus
Probab=60.47  E-value=22  Score=37.07  Aligned_cols=49  Identities=22%  Similarity=0.494  Sum_probs=25.3

Q ss_pred             CCCCCCCcccCCCccccCCCCCCCCCCCCCCCeeeeCC-CCeeEee-cCCCCcc
Q psy5752         108 CGKDSQGYVGNGVECHPLKSCLEDRSLCGKDASCVVAS-QGHFHCE-CNEGFTG  159 (222)
Q Consensus       108 C~~c~~g~~~~g~~C~~id~C~~~~~~C~~~~~C~n~~-~g~~~C~-C~~G~~g  159 (222)
                      |..|..||.++...=..-| |..-+  |..++.|..+. ..+..|. |++||+|
T Consensus       758 C~~C~~GfYg~~~~~~~~d-C~~C~--Cp~~~~~~~~~~~~~~iCk~Cp~gytG  808 (1705)
T KOG1836|consen  758 CAQCVDGFYGLPDLGTSGD-CQPCP--CPNGGACGQTPEILEVVCKNCPPGYTG  808 (1705)
T ss_pred             hhhhcCCCCCccccCCCCC-CccCC--CCCChhhcCcCcccceecCCCCCCCcc
Confidence            4456677754311101111 44322  65555665544 2336787 8888887


No 57 
>KOG3516|consensus
Probab=60.26  E-value=7.6  Score=38.27  Aligned_cols=41  Identities=29%  Similarity=0.821  Sum_probs=32.2

Q ss_pred             ccccCCCCCCCCCCCCCCCeeeeCCCCeeEeecC-CCCccCCCcccc
Q psy5752         121 ECHPLKSCLEDRSLCGKDASCVVASQGHFHCECN-EGFTGNGITCKP  166 (222)
Q Consensus       121 ~C~~id~C~~~~~~C~~~~~C~n~~~g~~~C~C~-~G~~gd~~~C~~  166 (222)
                      .|.-+|.|..  +.|.+++.|..+. ..|.|.|. .||+|  .+|..
T Consensus       541 ~C~i~drClP--N~CehgG~C~Qs~-~~f~C~C~~TGY~G--atCHt  582 (1306)
T KOG3516|consen  541 MCGISDRCLP--NPCEHGGKCSQSW-DDFECNCELTGYKG--ATCHT  582 (1306)
T ss_pred             ccccccccCC--ccccCCCcccccc-cceeEecccccccc--ccccC
Confidence            4556678875  4599999999875 55999998 89998  77764


No 58 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=49.32  E-value=12  Score=26.14  Aligned_cols=33  Identities=24%  Similarity=0.616  Sum_probs=23.9

Q ss_pred             CCCCCCCCCCCCCCeeeeCC----CCeeEeecCCCCc
Q psy5752         126 KSCLEDRSLCGKDASCVVAS----QGHFHCECNEGFT  158 (222)
Q Consensus       126 d~C~~~~~~C~~~~~C~n~~----~g~~~C~C~~G~~  158 (222)
                      ++|....+.|..++.|++..    +.-|.|.|.+-+.
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~   42 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV   42 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence            45656667798899999773    2458999987554


No 59 
>KOG3512|consensus
Probab=40.82  E-value=34  Score=30.55  Aligned_cols=14  Identities=29%  Similarity=0.370  Sum_probs=11.6

Q ss_pred             eeeecCCcEEEEcC
Q psy5752         174 FLLVNQGMFMLRVP  187 (222)
Q Consensus       174 ~~~~~~g~~~~~~~  187 (222)
                      .++...+...|+||
T Consensus       527 ~lwv~~~~i~c~cp  540 (592)
T KOG3512|consen  527 SLWVPAGDIACKCP  540 (592)
T ss_pred             eeeecCCCceecCC
Confidence            46888888889988


No 60 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=40.10  E-value=16  Score=21.37  Aligned_cols=14  Identities=43%  Similarity=1.232  Sum_probs=11.9

Q ss_pred             eeecCCCCccCCCCCcc
Q psy5752          37 KCICNFGYTGDGVTECN   53 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~   53 (222)
                      .|.|+++|+|.   .|+
T Consensus        19 ~C~C~~~~~G~---~C~   32 (49)
T PF00053_consen   19 QCVCKPGTTGP---RCD   32 (49)
T ss_dssp             EESBSTTEEST---TS-
T ss_pred             EEeccccccCC---cCc
Confidence            89999999999   776


No 61 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=39.37  E-value=39  Score=19.86  Aligned_cols=14  Identities=36%  Similarity=0.956  Sum_probs=12.0

Q ss_pred             eeecCCCCccCCCCCcc
Q psy5752          37 KCICNFGYTGDGVTECN   53 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~   53 (222)
                      .|.|+++|+|.   .|+
T Consensus        20 ~C~C~~~~~G~---~C~   33 (50)
T cd00055          20 QCECKPNTTGR---RCD   33 (50)
T ss_pred             EEeCCCcCCCC---CCC
Confidence            79999999998   664


No 62 
>KOG3514|consensus
Probab=33.81  E-value=69  Score=31.86  Aligned_cols=35  Identities=31%  Similarity=0.849  Sum_probs=27.5

Q ss_pred             CCCCCCCCCCCCCeeeeCCCCeeEeecC-CCCccCCCcccc
Q psy5752         127 SCLEDRSLCGKDASCVVASQGHFHCECN-EGFTGNGITCKP  166 (222)
Q Consensus       127 ~C~~~~~~C~~~~~C~n~~~g~~~C~C~-~G~~gd~~~C~~  166 (222)
                      .|..+|  |.++++|....+. |.|.|. .||.|  .+|+.
T Consensus       625 ~C~~nP--C~N~g~C~egwNr-fiCDCs~T~~~G--~~Cer  660 (1591)
T KOG3514|consen  625 ICESNP--CQNGGKCSEGWNR-FICDCSGTGFEG--RTCER  660 (1591)
T ss_pred             ccCCCc--ccCCCCccccccc-cccccccCcccC--ccccc
Confidence            476655  9999999977644 999996 58888  78875


No 63 
>KOG1836|consensus
Probab=33.41  E-value=56  Score=34.26  Aligned_cols=53  Identities=25%  Similarity=0.483  Sum_probs=32.8

Q ss_pred             ee-ecCCCCccCCCCCcccCCCCCCCCCCCCCCCEeeecCCCCCeEEe-cCCCCccCCC
Q psy5752          37 KC-ICNFGYTGDGVTECNPESLGCNVVKNCHANAECVYNATSAGYRCQ-CAQGYVGNGV   93 (222)
Q Consensus        37 ~C-~C~~G~~g~~~~~C~~~~~~C~~~~~C~~~~~C~~~n~~g~~~C~-C~~G~~~~~~   93 (222)
                      .| +|..||.|...   .+....|.. -+|.+++.|.-........|. |++||+|..+
T Consensus       757 ~C~~C~~GfYg~~~---~~~~~dC~~-C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC  811 (1705)
T KOG1836|consen  757 QCAQCVDGFYGLPD---LGTSGDCQP-CPCPNGGACGQTPEILEVVCKNCPPGYTGLRC  811 (1705)
T ss_pred             chhhhcCCCCCccc---cCCCCCCcc-CCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence            35 78888887632   111112665 677777777411124567898 9999998854


No 64 
>KOG3509|consensus
Probab=29.83  E-value=1.3e+02  Score=29.61  Aligned_cols=66  Identities=27%  Similarity=0.638  Sum_probs=41.5

Q ss_pred             CCCC-CCCCCCCCeecCCCCCCCCCCceeeecCCCCccCCCCCcccCCCCCCC-CCCCCCCCEeeecCCCCCeEEecCCC
Q psy5752          10 LSEC-DTPCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVTECNPESLGCNV-VKNCHANAECVYNATSAGYRCQCAQG   87 (222)
Q Consensus        10 ~~~C-~~~C~~~~~C~~~~~~~~~g~~~~C~C~~G~~g~~~~~C~~~~~~C~~-~~~C~~~~~C~~~n~~g~~~C~C~~G   87 (222)
                      .+.| ..+|.+.+.|...+     -.. .|.|++||+|+   .|++-.+.|.. ++.. -.++|.  ...+.....|.+|
T Consensus       406 g~~c~~~p~~~~g~c~p~~-----~~~-~c~c~~g~~G~---~c~d~~~~~~~~~~g~-y~~t~~--~~~~~~~~~c~pg  473 (964)
T KOG3509|consen  406 GDVCWRIPCQHDGPCLQTL-----EGK-QCLCPPGYTGD---SCEDCMNGCDRSPNGS-YLGTCV--PIQGKRCEYCGPG  473 (964)
T ss_pred             CCccccccCCCCccccccc-----ccc-ceeccccccCc---hhhccCccccccCCcc-ccceEe--ccCCCcceeecCC
Confidence            3566 67898888888853     333 89999999999   77665444443 1222 234553  3333455677777


No 65 
>PF02740 Colipase_C:  Colipase, C-terminal domain;  InterPro: IPR017914 This entry represents the C-terminal domain of colipase proteins. Colipase [, ] is a small protein cofactor needed by pancreatic lipase for efficient dietary lipid hydrolyisis. It also binds to the bile-salt covered triacylglycerol interface, thus allowing the enzyme to anchor itself to the water-lipid interface. Efficient absorption of dietary fats is dependent on the action of pancreatic triglyceride lipase. Colipase binds to the C-terminal, non-catalytic domain of lipase, thereby stabilising as active conformation and considerably increasing the overall hydrophobic binding site. Structural studies of the complex and of colipase alone have revealed the functionality of its architecture [, ]. Colipase is a small protein with five conserved disulphide bonds. Structural analogies have been recognised between a developmental protein (Dickkopf), the pancreatic lipase C-terminal domain, the N-terminal domains of lipoxygenases and the C-terminal domain of alpha-toxin. These non-catalytic domains in the latter enzymes are important for interaction with membrane. It has not been established if these domains are also involved in eventual protein cofactor binding as is the case for pancreatic lipase [].; GO: 0008047 enzyme activator activity, 0007586 digestion, 0016042 lipid catabolic process, 0005576 extracellular region; PDB: 1PCO_A 1PCN_A 1LPA_A 1N8S_C 1LPB_A 1ETH_D.
Probab=26.75  E-value=50  Score=19.12  Aligned_cols=22  Identities=27%  Similarity=0.523  Sum_probs=12.7

Q ss_pred             CeecCCCCCCCCCCceeeecCCCCc
Q psy5752          21 AQCTSPDEFNESREQAKCICNFGYT   45 (222)
Q Consensus        21 ~~C~~~~~~~~~g~~~~C~C~~G~~   45 (222)
                      +.|.....   -|.||.|.|.+|.+
T Consensus         3 ~~cs~~tl---yGvYy~CPCE~GLt   24 (45)
T PF02740_consen    3 SECSPKTL---YGVYYKCPCERGLT   24 (45)
T ss_dssp             SEEESS-S---SSCBSS--BSTTEE
T ss_pred             ccccccEe---eeEEEeccCCCCcE
Confidence            44554434   67888999988764


No 66 
>KOG1215|consensus
Probab=24.63  E-value=3.6e+02  Score=26.25  Aligned_cols=62  Identities=26%  Similarity=0.553  Sum_probs=36.6

Q ss_pred             CCCCCCCCCCCCCCeeeeCCCCeeEeecCCCCccCCCccccCCCCCCCeeeecCCcEEEEcCCCC
Q psy5752         126 KSCLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITCKPVRKKESDFLLVNQGMFMLRVPYQP  190 (222)
Q Consensus       126 d~C~~~~~~C~~~~~C~n~~~g~~~C~C~~G~~gd~~~C~~~~~~~~~~~~~~~g~~~~~~~~~~  190 (222)
                      ..|...++.|.+  .|.....+.|.|.|..||......|..... ...+.+.+...-+-..++..
T Consensus       365 ~~~~~~~g~Csq--~C~~~~p~~~~c~c~~g~~~~~~~c~~~~~-~~~~l~~s~~~~ir~~~~~~  426 (877)
T KOG1215|consen  365 NPCESDNGGCSQ--LCVPNSPGTFKCACSPGYELRLDKCEASDQ-PEAFLLFSNRHDIRRISLDC  426 (877)
T ss_pred             CcccccCCccce--eccCCCCCceeEecCCCcEeccCCceecCC-CCcEEEEecCccceecccCC
Confidence            455556677865  788433466999999999953333544332 23455555555544444433


No 67 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=23.82  E-value=61  Score=18.69  Aligned_cols=14  Identities=36%  Similarity=0.975  Sum_probs=12.2

Q ss_pred             eeecCCCCccCCCCCcc
Q psy5752          37 KCICNFGYTGDGVTECN   53 (222)
Q Consensus        37 ~C~C~~G~~g~~~~~C~   53 (222)
                      .|.|+++++|.   .|+
T Consensus        19 ~C~C~~~~~G~---~C~   32 (46)
T smart00180       19 QCECKPNVTGR---RCD   32 (46)
T ss_pred             EEECCCCCCCC---CCC
Confidence            79999999998   665


No 68 
>KOG0196|consensus
Probab=23.33  E-value=93  Score=30.07  Aligned_cols=46  Identities=28%  Similarity=0.803  Sum_probs=28.4

Q ss_pred             eeecCCCCcc--CCCCCcccC----------CCCCCCCCCCCCCCEeeecCCCCCeEEecCCCCc
Q psy5752          37 KCICNFGYTG--DGVTECNPE----------SLGCNVVKNCHANAECVYNATSAGYRCQCAQGYV   89 (222)
Q Consensus        37 ~C~C~~G~~g--~~~~~C~~~----------~~~C~~~~~C~~~~~C~~~n~~g~~~C~C~~G~~   89 (222)
                      .|.|.+||.-  .+. .|+.=          +..|   .+|..+..=   ...++..|.|..||-
T Consensus       260 ~C~C~aGye~~~~~~-~C~aCp~G~yK~~~~~~~C---~~CP~~S~s---~~ega~~C~C~~gyy  317 (996)
T KOG0196|consen  260 GCVCKAGYEEAENGK-ACQACPPGTYKASQGDSLC---LPCPPNSHS---SSEGATSCTCENGYY  317 (996)
T ss_pred             ceeecCCCCcccCCC-cceeCCCCcccCCCCCCCC---CCCCCCCCC---CCCCCCcccccCCcc
Confidence            6999999984  333 56531          1112   234444332   457888999999985


No 69 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=22.02  E-value=47  Score=19.78  Aligned_cols=19  Identities=42%  Similarity=0.955  Sum_probs=12.9

Q ss_pred             EecCCCCccCCC-eeecCCC
Q psy5752          82 CQCAQGYVGNGV-ECHPLKS  100 (222)
Q Consensus        82 C~C~~G~~~~~~-~C~~~~~  100 (222)
                      |.|++||..+.. .|+...+
T Consensus        35 C~C~~G~v~~~~~~CV~~~~   54 (55)
T PF01826_consen   35 CFCPPGYVRNDNGRCVPPSE   54 (55)
T ss_dssp             EEETTTEEEETTSEEEEGGG
T ss_pred             CCCCCCeeEcCCCCEEcHHH
Confidence            899999984433 6665543


No 70 
>PF11403 Yeast_MT:  Yeast metallothionein;  InterPro: IPR022710  Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=20.58  E-value=94  Score=16.83  Aligned_cols=9  Identities=33%  Similarity=0.918  Sum_probs=4.0

Q ss_pred             eeecCCCCc
Q psy5752          37 KCICNFGYT   45 (222)
Q Consensus        37 ~C~C~~G~~   45 (222)
                      +|.|+.|-.
T Consensus        23 scscptgcn   31 (40)
T PF11403_consen   23 SCSCPTGCN   31 (40)
T ss_dssp             S-SS-TTTT
T ss_pred             cCCCCCCCC
Confidence            567766543


Done!