Query         psy15668
Match_columns 365
No_of_seqs    333 out of 2518
Neff          9.4 
Searched_HMMs 46136
Date          Fri Aug 16 18:31:07 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15668.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15668hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.7   5E-17 1.1E-21  163.4  17.2  108    5-137  1177-1308(2531)
  2 KOG1214|consensus               99.6 5.4E-15 1.2E-19  142.8  14.4  163   67-266   692-865 (1289)
  3 KOG1214|consensus               99.6   2E-13 4.3E-18  132.2  18.6  204   12-269   699-919 (1289)
  4 KOG4289|consensus               99.5 3.8E-13 8.2E-18  136.1  15.2   90   83-196  1218-1308(2531)
  5 KOG1217|consensus               99.5 2.9E-12 6.2E-17  125.4  20.5  274   10-346    92-389 (487)
  6 KOG1219|consensus               99.4 4.4E-13 9.5E-18  140.2   8.6  109    7-141  3864-3974(4289)
  7 KOG1219|consensus               99.4 6.8E-13 1.5E-17  138.8   8.5  108  110-259  3865-3974(4289)
  8 KOG1217|consensus               99.3 4.4E-11 9.6E-16  117.0  17.2  202    8-259   170-389 (487)
  9 KOG1225|consensus               98.9 2.8E-08   6E-13   95.5  15.0  131   88-347   235-365 (525)
 10 KOG0994|consensus               98.9 1.9E-08   4E-13  101.3  13.5  233   80-355   878-1152(1758)
 11 KOG1225|consensus               98.9 1.7E-08 3.6E-13   97.0  11.7  132   28-260   234-365 (525)
 12 KOG4260|consensus               98.8   6E-09 1.3E-13   89.4   4.7  150   12-198   149-304 (350)
 13 KOG0994|consensus               98.7 2.2E-07 4.8E-12   93.8  14.7  231   88-352   842-1101(1758)
 14 KOG4260|consensus               98.5 3.1E-07 6.7E-12   79.1   5.9  135   74-257   152-304 (350)
 15 PF07645 EGF_CA:  Calcium-bindi  98.4 1.7E-07 3.8E-12   58.9   2.1   34    6-39      1-36  (42)
 16 PF00008 EGF:  EGF-like domain   98.3 3.9E-07 8.5E-12   53.4   1.6   30   10-39      1-31  (32)
 17 smart00179 EGF_CA Calcium-bind  98.2 2.4E-06 5.2E-11   52.6   4.0   34    6-39      1-36  (39)
 18 PF06247 Plasmod_Pvs28:  Plasmo  98.1 3.2E-07   7E-12   75.0  -1.3  141   77-260    10-163 (197)
 19 PF07645 EGF_CA:  Calcium-bindi  97.9 3.5E-06 7.6E-11   52.9   0.9   32  308-344     1-34  (42)
 20 PF00008 EGF:  EGF-like domain   97.9 3.6E-06 7.8E-11   49.3   0.6   32  312-347     1-32  (32)
 21 cd00054 EGF_CA Calcium-binding  97.9 2.5E-05 5.3E-10   47.6   4.0   34    6-39      1-35  (38)
 22 PF12947 EGF_3:  EGF domain;  I  97.8 1.3E-05 2.8E-10   48.1   2.0   29   13-41      6-34  (36)
 23 smart00179 EGF_CA Calcium-bind  97.7 6.1E-05 1.3E-09   46.2   3.8   34  309-347     2-37  (39)
 24 PF12947 EGF_3:  EGF domain;  I  97.6 3.2E-05   7E-10   46.4   1.7   29  233-261     6-34  (36)
 25 KOG1836|consensus               97.6   0.004 8.6E-08   68.5  18.5  242   80-355   749-1027(1705)
 26 cd00053 EGF Epidermal growth f  97.3 0.00028 6.1E-09   42.0   3.6   30   10-39      2-32  (36)
 27 KOG1226|consensus               97.3  0.0041 8.9E-08   61.9  12.9  143  131-362   479-635 (783)
 28 smart00181 EGF Epidermal growt  97.3 0.00037 7.9E-09   41.6   3.7   30    9-39      1-31  (35)
 29 PF06247 Plasmod_Pvs28:  Plasmo  97.2   5E-05 1.1E-09   62.4  -1.0  148   14-201     7-163 (197)
 30 cd00054 EGF_CA Calcium-binding  97.2 0.00053 1.2E-08   41.4   3.7   34  309-347     2-36  (38)
 31 KOG1226|consensus               97.2  0.0052 1.1E-07   61.2  11.9  128   88-267   479-622 (783)
 32 PF12662 cEGF:  Complement Clr-  97.1 0.00044 9.6E-09   37.2   2.5   24   27-54      1-24  (24)
 33 KOG1836|consensus               97.1   0.014   3E-07   64.4  15.1  174  171-353   777-977 (1705)
 34 cd00053 EGF Epidermal growth f  96.6  0.0027 5.9E-08   37.6   3.5   28  314-346     5-32  (36)
 35 PF12662 cEGF:  Complement Clr-  96.4  0.0033 7.1E-08   33.8   2.2   23  247-269     1-24  (24)
 36 smart00181 EGF Epidermal growt  96.4  0.0047   1E-07   36.6   3.2   29  312-346     2-31  (35)
 37 PF07974 EGF_2:  EGF-like domai  96.2  0.0082 1.8E-07   34.9   3.3   25   13-39      6-30  (32)
 38 PF14670 FXa_inhibition:  Coagu  96.1  0.0039 8.4E-08   37.3   1.9   23   15-39      8-30  (36)
 39 PF07974 EGF_2:  EGF-like domai  94.8   0.047   1E-06   31.7   3.3   24  234-259     7-30  (32)
 40 PF14670 FXa_inhibition:  Coagu  94.5   0.036 7.8E-07   33.2   2.3   22  238-259     9-30  (36)
 41 PF12661 hEGF:  Human growth fa  94.2    0.02 4.3E-07   26.0   0.6   11  249-259     1-11  (13)
 42 PF12946 EGF_MSP1_1:  MSP1 EGF   91.5     0.1 2.2E-06   31.2   1.1   30   10-39      2-32  (37)
 43 PF12946 EGF_MSP1_1:  MSP1 EGF   87.4    0.34 7.5E-06   28.9   1.2   28  173-200     4-32  (37)
 44 KOG3512|consensus               86.4     3.9 8.6E-05   39.0   8.2  158  180-347   285-476 (592)
 45 cd01475 vWA_Matrilin VWA_Matri  85.3    0.97 2.1E-05   39.5   3.6   38  303-347   181-220 (224)
 46 cd01475 vWA_Matrilin VWA_Matri  81.6     1.4 2.9E-05   38.6   3.0   21  239-259   199-219 (224)
 47 smart00051 DSL delta serrate l  80.5     2.6 5.6E-05   28.7   3.4   23  317-347    40-62  (63)
 48 KOG3516|consensus               79.6     1.3 2.9E-05   46.8   2.5   33    7-39    545-578 (1306)
 49 KOG1218|consensus               77.4      46   0.001   30.3  12.0   14   26-39     13-26  (316)
 50 PHA02887 EGF-like protein; Pro  75.9     2.9 6.2E-05   32.0   2.7   36    7-46     83-123 (126)
 51 PHA03099 epidermal growth fact  75.5     3.1 6.7E-05   32.4   2.9   37    6-46     41-82  (139)
 52 PF00053 Laminin_EGF:  Laminin   75.4     1.9   4E-05   27.6   1.5   22  333-354    16-37  (49)
 53 PF00954 S_locus_glycop:  S-loc  74.9     3.4 7.4E-05   31.6   3.1   32    6-38     76-108 (110)
 54 smart00051 DSL delta serrate l  74.1     4.9 0.00011   27.4   3.3   44   87-141    17-61  (63)
 55 cd00055 EGF_Lam Laminin-type e  72.7     5.7 0.00012   25.5   3.3   20  335-354    19-38  (50)
 56 PHA02887 EGF-like protein; Pro  71.7     3.5 7.6E-05   31.5   2.3   30  233-266    92-123 (126)
 57 PHA03099 epidermal growth fact  67.6     5.6 0.00012   31.0   2.7   31  233-267    51-83  (139)
 58 KOG3514|consensus               61.2     5.6 0.00012   41.9   2.2   31    9-39    625-656 (1591)
 59 PF09064 Tme5_EGF_like:  Thromb  59.8     8.8 0.00019   22.4   1.9   13  335-347    18-30  (34)
 60 PF01683 EB:  EB module;  Inter  58.8      24 0.00053   22.6   4.3   28  310-346    20-48  (52)
 61 smart00180 EGF_Lam Laminin-typ  58.7      12 0.00026   23.6   2.7   20  335-354    18-37  (46)
 62 KOG3516|consensus               55.0     9.7 0.00021   40.7   2.7   37  306-347   542-579 (1306)
 63 PF00954 S_locus_glycop:  S-loc  54.4      12 0.00025   28.6   2.5   24   74-98     86-109 (110)
 64 KOG3512|consensus               49.9      43 0.00094   32.3   5.8   62  285-353   364-432 (592)
 65 PF12955 DUF3844:  Domain of un  48.5     7.8 0.00017   29.2   0.7   35    5-39      3-44  (103)
 66 KOG1218|consensus               37.2 3.2E+02  0.0069   24.7  12.7   40   92-141    96-135 (316)
 67 KOG3514|consensus               36.9      25 0.00054   37.5   2.3   32  311-347   625-657 (1591)
 68 PF01414 DSL:  Delta serrate li  36.2      16 0.00035   24.8   0.7   14  188-201    16-29  (63)
 69 KOG3607|consensus               26.7      63  0.0014   33.5   3.3   49   52-106   603-658 (716)

No 1  
>KOG4289|consensus
Probab=99.74  E-value=5e-17  Score=163.42  Aligned_cols=108  Identities=31%  Similarity=0.749  Sum_probs=89.3

Q ss_pred             cCCCCCCCCCCCCCCeeeec----------------------CCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCccc
Q psy15668          5 MGGDPCSPNPCGSNTQCNVA----------------------SNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACK   62 (365)
Q Consensus         5 ~did~C~~~~C~~~~~C~~~----------------------~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~   62 (365)
                      .|-+.|...||.+..+|+.+                      .+++.|.||+||+|+   .|+.    .           
T Consensus      1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd---~CeT----e----------- 1238 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGD---YCET----E----------- 1238 (2531)
T ss_pred             ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcc---cccc----h-----------
Confidence            45677999999998889743                      468899999999999   6653    2           


Q ss_pred             CCccccccccC-CCCCCeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeC-CCCceeeCCCC
Q psy15668         63 EYRCVDVCAGQ-CGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVI-NMVPTCSCLPG  137 (365)
Q Consensus        63 ~~~C~~~C~~~-C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~-~~~~~C~C~~G  137 (365)
                          +|.|... |+++++|...+|+|.|.|.+||+|.   .|+.......|.+..|.++++|++. .+++.|.|+.|
T Consensus      1239 ----iDlCYs~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1239 ----IDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             ----hHhhhcCCCCCCCceEEecCceeEEecCCcccc---ceeeecccCccccceecCCCEEeecCCCceeccCCCc
Confidence                4444444 8999999999999999999999999   9986644466888999999999987 57888999987


No 2  
>KOG1214|consensus
Probab=99.62  E-value=5.4e-15  Score=142.81  Aligned_cols=163  Identities=29%  Similarity=0.647  Sum_probs=117.1

Q ss_pred             ccccccC---CCCCCeeeecCC-CceeeCCCCCccCCCCCcccCCCCCCCCC--CCCCCCCeeeeCCCCceeeCCCCCCC
Q psy15668         67 VDVCAGQ---CGVNSECNVRNH-IPVCSCPPGYTGDPLTQCRRFDPQELCDR--SPCGVNTRCEVINMVPTCSCLPGYTG  140 (365)
Q Consensus        67 ~~~C~~~---C~~~~~C~~~~g-~~~C~C~~G~~g~~~~~C~~~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g  140 (365)
                      +++|...   |..++.|....+ .|.|.|..||.|+++ .|.++   ++|+.  ..|++++.|++.+++|+|.|..||..
T Consensus       692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr-~c~d~---~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F  767 (1289)
T KOG1214|consen  692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGR-NCVDE---NECATGFHRCGPNSVCINLPGSYRCECRSGYEF  767 (1289)
T ss_pred             cccceecCcccCCCccccCCCCcceEEEEeeccCCCCC-CCCCh---hhhccCCCCCCCCceeecCCCceeEEEeeccee
Confidence            4566543   888899987654 599999999999988 79887   78875  66999999999999999999998864


Q ss_pred             CCCCCCCCCCCCCCCCCCCCcccCCcccCCCCC--CCCCCCC--eeeeC-CCceeeeCCCCCccCCCCccCccccCCCCC
Q psy15668        141 SPLSGCRHECDSDYDCGPSQSCVNYKCANPCAS--GACAPTA--QCEVR-NHRAVCSCPVGYLGDPYTSCRAECLAHSDC  215 (365)
Q Consensus       141 ~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~--~~C~~~~--~C~~~-~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C  215 (365)
                      .   .+...|....+=.+         ++.|..  ..|...+  +|+.. .+.|.|.|.+||.|++..     |.++   
T Consensus       768 ~---dd~~tCV~i~~pap---------~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-----c~dv---  827 (1289)
T KOG1214|consen  768 A---DDRHTCVLITPPAP---------ANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-----CTDV---  827 (1289)
T ss_pred             c---cCCcceEEecCCCC---------CCccccCccccCcCCceEEEecCCceEEEeecCCccCCccc-----cccc---
Confidence            4   12223332211111         233332  3355444  45544 456999999999999753     3333   


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCcccc
Q psy15668        216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRP  266 (365)
Q Consensus       216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~  266 (365)
                          ++|+.        ..|+.+++|++++++|.|+|.+||+|+++ .|.+
T Consensus       828 ----DeC~p--------srChp~A~CyntpgsfsC~C~pGy~GDGf-~CVP  865 (1289)
T KOG1214|consen  828 ----DECSP--------SRCHPAATCYNTPGSFSCRCQPGYYGDGF-QCVP  865 (1289)
T ss_pred             ----cccCc--------cccCCCceEecCCCcceeecccCccCCCc-eecC
Confidence                44432        44788999999999999999999999987 5765


No 3  
>KOG1214|consensus
Probab=99.56  E-value=2e-13  Score=132.22  Aligned_cols=204  Identities=25%  Similarity=0.603  Sum_probs=141.3

Q ss_pred             CCCCCCCCeeeec-CCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCceee
Q psy15668         12 PNPCGSNTQCNVA-SNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPVCS   90 (365)
Q Consensus        12 ~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~   90 (365)
                      +.-|..++.|... .-.|+|.|..||.|++      ..|.|.++|...            ...|+.++.|++.+++|+|.
T Consensus       699 sh~cdt~a~C~pg~~~~~tcecs~g~~gdg------r~c~d~~eca~~------------~~~CGp~s~Cin~pg~~rce  760 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGVDYTCECSSGYQGDG------RNCVDENECATG------------FHRCGPNSVCINLPGSYRCE  760 (1289)
T ss_pred             CcccCCCccccCCCCcceEEEEeeccCCCC------CCCCChhhhccC------------CCCCCCCceeecCCCceeEE
Confidence            4446677778854 3569999999999994      456677766643            23488999999999999999


Q ss_pred             CCCCCc--cCCCCCcccCCC---CCCCCC--CCCCCCCe--eeeC-CCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668         91 CPPGYT--GDPLTQCRRFDP---QELCDR--SPCGVNTR--CEVI-NMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQ  160 (365)
Q Consensus        91 C~~G~~--g~~~~~C~~~~~---~~~C~~--~~C~~~~~--C~~~-~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~  160 (365)
                      |..||.  ++.. +|..+.+   .+.|..  +.|...+.  |+.. .+.|.|.|.+||+|++.     .|.+        
T Consensus       761 C~~gy~F~dd~~-tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~-----~c~d--------  826 (1289)
T KOG1214|consen  761 CRSGYEFADDRH-TCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH-----QCTD--------  826 (1289)
T ss_pred             EeecceeccCCc-ceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc-----cccc--------
Confidence            999885  3333 6876542   244542  44554444  4444 45799999999999865     4444        


Q ss_pred             cccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCC----CCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668        161 SCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAH----SDCPTDRPSCLGNKCMNPCAGQCG  236 (365)
Q Consensus       161 ~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~----~~C~~~~~~C~~~~C~~~c~~~C~  236 (365)
                             +|+|.++.|...+.|.+++++|.|.|.+||.|+++.     |+..    ..|+...    -.|      ..|+
T Consensus       827 -------vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~-----CVP~~~~~T~C~~er----~hp------l~ch  884 (1289)
T KOG1214|consen  827 -------VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQ-----CVPDTSSLTPCEQER----FHP------LQCH  884 (1289)
T ss_pred             -------ccccCccccCCCceEecCCCcceeecccCccCCCce-----ecCCCccCCcccccc----ccc------eeec
Confidence                   688988889999999999999999999999999864     2222    1222210    001      2355


Q ss_pred             CCceeee--cCCCceeeCCCCCccCCCCccccCCC
Q psy15668        237 INAKCEV--RGATPICSCPRDMTGDPFVRCRPFDK  269 (365)
Q Consensus       237 ~~~~C~~--~~~~~~C~C~~G~~g~~~~~C~~~~~  269 (365)
                      .+..|.-  .+..+.+.|.++-.|++..+|.++++
T Consensus       885 g~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~  919 (1289)
T KOG1214|consen  885 GSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPE  919 (1289)
T ss_pred             cccceeEeeCCCcccCCCCCCCCCCCCCCCCCccc
Confidence            4443321  25678899888888887667876543


No 4  
>KOG4289|consensus
Probab=99.50  E-value=3.8e-13  Score=136.08  Aligned_cols=90  Identities=39%  Similarity=0.868  Sum_probs=75.8

Q ss_pred             cCCCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcc
Q psy15668         83 RNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSC  162 (365)
Q Consensus        83 ~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C  162 (365)
                      ..++++|.|++||+|+   .|+..+  |+|-+.||.+++.|....++|+|.|++||+|.       .|+...   .    
T Consensus      1218 pvnglrCrCPpGFTgd---~CeTei--DlCYs~pC~nng~C~srEggYtCeCrpg~tGe-------hCEvs~---~---- 1278 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGD---YCETEI--DLCYSGPCGNNGRCRSREGGYTCECRPGFTGE-------HCEVSA---R---- 1278 (2531)
T ss_pred             ccCceeEeCCCCCCcc---cccchh--HhhhcCCCCCCCceEEecCceeEEecCCcccc-------ceeeec---c----
Confidence            4577899999999999   898764  89999999999999999999999999999999       887541   1    


Q ss_pred             cCCcccCCCCCCCCCCCCeeeeC-CCceeeeCCCC
Q psy15668        163 VNYKCANPCASGACAPTAQCEVR-NHRAVCSCPVG  196 (365)
Q Consensus       163 ~~~~c~~~C~~~~C~~~~~C~~~-~g~~~C~C~~G  196 (365)
                           .-.|.++.|.++++|++. .++|.|.|+.|
T Consensus      1279 -----agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1279 -----AGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             -----cCccccceecCCCEEeecCCCceeccCCCc
Confidence                 224556788889999864 46788999888


No 5  
>KOG1217|consensus
Probab=99.49  E-value=2.9e-12  Score=125.44  Aligned_cols=274  Identities=26%  Similarity=0.558  Sum_probs=168.3

Q ss_pred             CCCCCCCCCCeeeecCCCceeeCCCCCccCCCCCccCC-CccCCCCCCCCCcccCCccccccccCCCCCCeeee---cCC
Q psy15668         10 CSPNPCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRG-ECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNV---RNH   85 (365)
Q Consensus        10 C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~-~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~---~~g   85 (365)
                      +...+....+.+......|.|.|++||.|.   .++.. .|.....                  .+...+.|..   ...
T Consensus        92 ~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~---~~~~~~~C~~~~~------------------~~~~~~~c~~~~~~~~  150 (487)
T KOG1217|consen   92 CRSPCLLLCGECVDCVGSYECTCPPGYQGT---PCEGECECVTGPG------------------VCCIDGSCSNGPGSVG  150 (487)
T ss_pred             ccCCcccCCccccCCCCCceeeCCCccccC---cCCcceeecCCCC------------------CeeCchhhcCCCCCCC
Confidence            333344455667778899999999999998   33322 2322211                  0111233443   345


Q ss_pred             CceeeCCCCCccCCCCCcccCCCCCCCC--CCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCccc
Q psy15668         86 IPVCSCPPGYTGDPLTQCRRFDPQELCD--RSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCV  163 (365)
Q Consensus        86 ~~~C~C~~G~~g~~~~~C~~~~~~~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~  163 (365)
                      .|.|.|..||.+.   .+....  ++|.  ..+|.+.+.|.+..++|.|.|+++|.+.       .++..   .....|.
T Consensus       151 ~~~c~C~~g~~~~---~~~~~~--~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~-------~~~~~---~~~~~c~  215 (487)
T KOG1217|consen  151 PFRCSCTEGYEGE---PCETDL--DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS-------TCETT---GNGGTCV  215 (487)
T ss_pred             ceeeeeCCCcccc---cccccc--cccccCCCCcCCCcccccCCCCeeEeCCCCccCC-------cCcCC---CCCceEe
Confidence            7899999999998   665432  4676  3569989999999999999999999988       44332   1111111


Q ss_pred             CC-cc-------cCCCCC--CCCCCC-CeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy15668        164 NY-KC-------ANPCAS--GACAPT-AQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCA  232 (365)
Q Consensus       164 ~~-~c-------~~~C~~--~~C~~~-~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~  232 (365)
                      .. .+       .+.|..  ..+... ++|++..++|.|.|++||.+...    ..+.+++.|.       ...      
T Consensus       216 ~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~----~~~~~~~~C~-------~~~------  278 (487)
T KOG1217|consen  216 DSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDAC----VTCVDVDSCA-------LIA------  278 (487)
T ss_pred             cceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccccc----ceeeeccccC-------CCC------
Confidence            00 00       111111  112222 78999999999999999998841    1122233333       221      


Q ss_pred             CCCCCCceeeecCCCceeeCCCCCccCCCCccccCCCC-----Ccccccccccceeee-cCCccceeeeeecCCCCCCCC
Q psy15668        233 GQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDKY-----VAPLINDYLKIYWRY-QNNKTIFYVSLVSLNYPYVTP  306 (365)
Q Consensus       233 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~~-----~~~~~~~~~~~~~~c-~~~~~~~~~~~c~~~~~~~~~  306 (365)
                      . |.++++|++..+.|.|.|++||+|.....+......     ...|.++.     .| ..+....+.+.+..+|.+..|
T Consensus       279 ~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~-----~C~~~~~~~~~~C~c~~~~~g~~C  352 (487)
T KOG1217|consen  279 S-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGG-----TCNTLGSFGGFRCACGPGFTGRRC  352 (487)
T ss_pred             c-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCc-----ccccCCCCCCCCcCCCCCCCCCcc
Confidence            1 556899999998899999999999832011111110     01111111     11 111112233456667778778


Q ss_pred             CCC-CCCCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668        307 LPD-DLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD  346 (365)
Q Consensus       307 ~~~-d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~  346 (365)
                      ... ++|...++..++.|.+    ...++|.|.|+.+|.+.
T Consensus       353 ~~~~~~C~~~~~~~~~~c~~----~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  353 EDSNDECASSPCCPGGTCVN----ETPGSYRCACPAGFAGK  389 (487)
T ss_pred             ccCCccccCCccccCCEecc----CCCCCeEecCCCccccC
Confidence            777 4998888889999997    24688999999999984


No 6  
>KOG1219|consensus
Probab=99.41  E-value=4.4e-13  Score=140.15  Aligned_cols=109  Identities=32%  Similarity=0.735  Sum_probs=96.1

Q ss_pred             CCCCCCCCCCCCCeeeecC-CCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccC-CCCCCeeeecC
Q psy15668          7 GDPCSPNPCGSNTQCNVAS-NRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQ-CGVNSECNVRN   84 (365)
Q Consensus         7 id~C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~-C~~~~~C~~~~   84 (365)
                      .+.|..+||+++|+|+.++ ++|.|.|++.|+|.   +|+.    +               +.+|... |..+++|+...
T Consensus      3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CEi----~---------------~epC~snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCEI----D---------------LEPCASNPCLTGGTCIPFY 3921 (4289)
T ss_pred             ccccccCcccCCCEecCCCCCceEEeCcccccCc---cccc----c---------------cccccCCCCCCCCEEEecC
Confidence            3889999999999999776 77999999999999   7764    2               3344444 88899999999


Q ss_pred             CCceeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668         85 HIPVCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGS  141 (365)
Q Consensus        85 g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~  141 (365)
                      +.|.|.|+.||+|.   +|+.. .+++|+.++|..++.|++..|+|+|.|.+||.|.
T Consensus      3922 n~f~CnC~~gyTG~---~Ce~~-Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYTGK---RCEAR-GISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred             CCeeEeCCCCccCc---eeecc-cccccccccccCCceeeccCCceEeccChhHhcc
Confidence            99999999999999   99875 1389999999999999999999999999999988


No 7  
>KOG1219|consensus
Probab=99.39  E-value=6.8e-13  Score=138.80  Aligned_cols=108  Identities=29%  Similarity=0.727  Sum_probs=97.0

Q ss_pred             CCCCCCCCCCCCeeeeC-CCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCCc
Q psy15668        110 ELCDRSPCGVNTRCEVI-NMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNHR  188 (365)
Q Consensus       110 ~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g~  188 (365)
                      +.|..+||++++.|... .++|.|.|++.|+|.       .|+..              +++|.++||..+++|+...++
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~-------~CEi~--------------~epC~snPC~~GgtCip~~n~ 3923 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN-------HCEID--------------LEPCASNPCLTGGTCIPFYNG 3923 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCc-------ccccc--------------cccccCCCCCCCCEEEecCCC
Confidence            67888999999999987 468999999999999       99987              789999999999999999999


Q ss_pred             eeeeCCCCCccCCCCccCccccCCCCCCCC-CCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccC
Q psy15668        189 AVCSCPVGYLGDPYTSCRAECLAHSDCPTD-RPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGD  259 (365)
Q Consensus       189 ~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~-~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~  259 (365)
                      |.|.|+.||+|.             .|+.. +++|...+        |..+|.|+|..|+|+|.|.+||.|+
T Consensus      3924 f~CnC~~gyTG~-------------~Ce~~Gi~eCs~n~--------C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3924 FLCNCPNGYTGK-------------RCEARGISECSKNV--------CGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred             eeEeCCCCccCc-------------eeeccccccccccc--------ccCCceeeccCCceEeccChhHhcc
Confidence            999999999999             56655 66776655        5569999999999999999999998


No 8  
>KOG1217|consensus
Probab=99.33  E-value=4.4e-11  Score=117.03  Aligned_cols=202  Identities=29%  Similarity=0.695  Sum_probs=135.5

Q ss_pred             CCCC--CCCCCCCCeeeecCCCceeeCCCCCccCCCCCccCC----CccCCCCCCCCCcccCCccccccccC---CCCC-
Q psy15668          8 DPCS--PNPCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRG----ECQDHSDCSHSKACKEYRCVDVCAGQ---CGVN-   77 (365)
Q Consensus         8 d~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~----~C~~~~~C~~~~~C~~~~C~~~C~~~---C~~~-   77 (365)
                      ++|.  ..+|.++++|.+..++|.|.|++||.+.   .++..    .|.+...|.....   .. ...|...   |... 
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~---~~~~~~~~~~c~~~~~~~~~~g---~~-~~~c~~~~~~~~~~~  242 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS---TCETTGNGGTCVDSVACSCPPG---AR-GPECEVSIVECASGD  242 (487)
T ss_pred             cccccCCCCcCCCcccccCCCCeeEeCCCCccCC---cCcCCCCCceEecceeccCCCC---CC-CCCcccccccccCCC
Confidence            6786  4569999999999999999999999998   34322    2322211110000   00 1111111   3323 


Q ss_pred             CeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCCCC-CCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668         78 SECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDRSP-CGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDC  156 (365)
Q Consensus        78 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C  156 (365)
                      ++|++..++|.|.|++||++.....+.++   ++|.... |.+++.|++..+.|.|.|++||.+.       .+   ..+
T Consensus       243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~~---~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~-------~~---~~~  309 (487)
T KOG1217|consen  243 GTCVNTVGSYTCRCPEGYTGDACVTCVDV---DSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR-------LC---TEC  309 (487)
T ss_pred             CcccccCCceeeeCCCCccccccceeeec---cccCCCCccCCCCeeecCCCcceeeCCCCCCCC-------CC---ccc
Confidence            88999999999999999999831134555   7787753 8889999999998999999999998       43   011


Q ss_pred             CCCCcccCCcccCCC----CCCCCCCCCee--eeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCC
Q psy15668        157 GPSQSCVNYKCANPC----ASGACAPTAQC--EVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNP  230 (365)
Q Consensus       157 ~~~~~C~~~~c~~~C----~~~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~  230 (365)
                      ..         ..+|    ...+|..++.|  ....+.+.|.|..||.|.             .|+...++|...+    
T Consensus       310 ~~---------~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~-------------~C~~~~~~C~~~~----  363 (487)
T KOG1217|consen  310 VD---------VDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR-------------RCEDSNDECASSP----  363 (487)
T ss_pred             cc---------cccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC-------------ccccCCccccCCc----
Confidence            11         1233    23457777777  344557889999998888             4443223444333    


Q ss_pred             CCCCCCCCceeee-cCCCceeeCCCCCccC
Q psy15668        231 CAGQCGINAKCEV-RGATPICSCPRDMTGD  259 (365)
Q Consensus       231 c~~~C~~~~~C~~-~~~~~~C~C~~G~~g~  259 (365)
                          +..++.|++ ..++|.|.|+.+|.+.
T Consensus       364 ----~~~~~~c~~~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  364 ----CCPGGTCVNETPGSYRCACPAGFAGK  389 (487)
T ss_pred             ----cccCCEeccCCCCCeEecCCCccccC
Confidence                445889998 6899999999999874


No 9  
>KOG1225|consensus
Probab=98.93  E-value=2.8e-08  Score=95.48  Aligned_cols=131  Identities=34%  Similarity=0.887  Sum_probs=96.4

Q ss_pred             eeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcc
Q psy15668         88 VCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKC  167 (365)
Q Consensus        88 ~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c  167 (365)
                      .|.|..+|+|.   .|...    .|.. .|..++.|++.    +|.|++||+|.       .|...              
T Consensus       235 ic~c~~~~~g~---~c~~~----~C~~-~c~~~g~c~~G----~CIC~~Gf~G~-------dC~e~--------------  281 (525)
T KOG1225|consen  235 ICECPEGYFGP---LCSTI----YCPG-GCTGRGQCVEG----RCICPPGFTGD-------DCDEL--------------  281 (525)
T ss_pred             eeecCCceeCC---ccccc----cCCC-CCcccceEeCC----eEeCCCCCcCC-------CCCcc--------------
Confidence            79999999998   77743    3543 45556778766    89999999999       66642              


Q ss_pred             cCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCC
Q psy15668        168 ANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGAT  247 (365)
Q Consensus       168 ~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~  247 (365)
                        .|... |..++.+++.    .|.|++||.|.             .|....  |         ...|+.++.|+    .
T Consensus       282 --~Cp~~-cs~~g~~~~g----~CiC~~g~~G~-------------dCs~~~--c---------padC~g~G~Ci----~  326 (525)
T KOG1225|consen  282 --VCPVD-CSGGGVCVDG----ECICNPGYSGK-------------DCSIRR--C---------PADCSGHGKCI----D  326 (525)
T ss_pred             --cCCcc-cCCCceecCC----EeecCCCcccc-------------cccccc--C---------CccCCCCCccc----C
Confidence              34333 6666666543    69999999999             555422  2         24577799998    2


Q ss_pred             ceeeCCCCCccCCCCccccCCCCCcccccccccceeeecCCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCCeecCCC
Q psy15668        248 PICSCPRDMTGDPFVRCRPFDKYVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENAKCQPGY  327 (365)
Q Consensus       248 ~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~~C~~~~  327 (365)
                      -+|.|.+||+|.   .|+.                                           .     .|.+++.|++  
T Consensus       327 G~C~C~~Gy~G~---~C~~-------------------------------------------~-----~C~~~g~cv~--  353 (525)
T KOG1225|consen  327 GECLCDEGYTGE---LCIQ-------------------------------------------R-----ACSGGGQCVN--  353 (525)
T ss_pred             CceEeCCCCcCC---cccc-------------------------------------------c-----ccCCCceecc--
Confidence            379999999998   6763                                           1     3788889987  


Q ss_pred             CCCCCCCceeeCCCCCccCC
Q psy15668        328 DKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       328 ~~~~~~~~~C~C~~G~~g~~  347 (365)
                             . |.|..||.|..
T Consensus       354 -------g-C~C~~Gw~G~d  365 (525)
T KOG1225|consen  354 -------G-CKCKKGWRGPD  365 (525)
T ss_pred             -------C-ceeccCccCCC
Confidence                   3 99999999887


No 10 
>KOG0994|consensus
Probab=98.91  E-value=1.9e-08  Score=101.25  Aligned_cols=233  Identities=23%  Similarity=0.441  Sum_probs=124.7

Q ss_pred             eeecCCCcee-eCCCCCccCCCCCcccCCCCCCCCCCCCCCCC--------eeee--CCCCceeeCCCCCCCCCCCCCCC
Q psy15668         80 CNVRNHIPVC-SCPPGYTGDPLTQCRRFDPQELCDRSPCGVNT--------RCEV--INMVPTCSCLPGYTGSPLSGCRH  148 (365)
Q Consensus        80 C~~~~g~~~C-~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~--------~C~~--~~~~~~C~C~~G~~g~~~~~~~~  148 (365)
                      |.+...++.| .|..||.|+++..-.     ..|.+-||..+.        .|..  ......|.|.+||+|.       
T Consensus       878 CqD~T~G~~CdrCl~GyyGdP~lg~g-----~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~-------  945 (1758)
T KOG0994|consen  878 CQDSTTGHSCDRCLDGYYGDPRLGSG-----IGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS-------  945 (1758)
T ss_pred             ccccccccchhhhhccccCCcccCCC-----CCCCCCCCCCCCccchhccccccccccccceeeecccCcccc-------
Confidence            5566777889 899999998752211     346655554421        2322  2334579999999998       


Q ss_pred             CCCCC-----CCCCCCCcccCCcc---cCCCCCCCCCC-CCee---eeCCCceee-eCCCCCccCCCCccCccccCCCCC
Q psy15668        149 ECDSD-----YDCGPSQSCVNYKC---ANPCASGACAP-TAQC---EVRNHRAVC-SCPVGYLGDPYTSCRAECLAHSDC  215 (365)
Q Consensus       149 ~C~~~-----~~C~~~~~C~~~~c---~~~C~~~~C~~-~~~C---~~~~g~~~C-~C~~G~~g~~~~~~~~~C~~~~~C  215 (365)
                      .|+.=     ..-..+++|..-.|   ||.-.+..|.. .|.|   .....+-+| .|.+||.|+.....-..|    .|
T Consensus       946 RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC----~C 1021 (1758)
T KOG0994|consen  946 RCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRC----VC 1021 (1758)
T ss_pred             chhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhh----ec
Confidence            55421     00112556654322   44444444542 3334   333334456 799999998643211111    11


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCccccCCC------CCccc--------ccccccc
Q psy15668        216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDK------YVAPL--------INDYLKI  281 (365)
Q Consensus       216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~------~~~~~--------~~~~~~~  281 (365)
                      ..-.              . .+..+|..  -+-+|.|.|...|.....|.....      .+.+|        ..+...+
T Consensus      1022 n~LG--------------T-n~~~~CDr--~tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG 1084 (1758)
T KOG0994|consen 1022 NFLG--------------T-NSTCHCDR--FTGQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG 1084 (1758)
T ss_pred             cccc--------------c-CCcccccc--ccCcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc
Confidence            1000              0 00112221  222455555555542222222111      11111        2333455


Q ss_pred             eeeecCCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCC----eecCCCCCCCCCCceeeCCCCCccCCCCCCcCCC
Q psy15668        282 YWRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENA----KCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGE  355 (365)
Q Consensus       282 ~~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~----~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~~~  355 (365)
                      .+.|.+++++..|+.|..-|||..-+   .|..-.|...|    .|+.       ...+|+|.+|-.|..+++|...-
T Consensus      1085 QCqCkpGfGGR~C~qCqel~WGdP~~---~C~aCdCd~rG~~tpQCdr-------~tG~C~C~~Gv~G~rCdqCaRgy 1152 (1758)
T KOG0994|consen 1085 QCQCKPGFGGRTCSQCQELYWGDPNE---KCRACDCDPRGIETPQCDR-------ATGRCVCRPGVGGPRCDQCARGY 1152 (1758)
T ss_pred             ceeccCCCCCcchhHHHHhhcCCCCC---CceecCCCCCCCCCCCccc-------cCCceeecCCCCCcchhhhhhhh
Confidence            78899999999999999999985422   34332344332    3554       22489999999999988888653


No 11 
>KOG1225|consensus
Probab=98.88  E-value=1.7e-08  Score=96.97  Aligned_cols=132  Identities=32%  Similarity=0.874  Sum_probs=92.4

Q ss_pred             ceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCceeeCCCCCccCCCCCcccCC
Q psy15668         28 PVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRFD  107 (365)
Q Consensus        28 ~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~  107 (365)
                      +.|.|+.||+|.   .|....|                     +..|..++.|++.    +|.|++||+|.   .|... 
T Consensus       234 ~ic~c~~~~~g~---~c~~~~C---------------------~~~c~~~g~c~~G----~CIC~~Gf~G~---dC~e~-  281 (525)
T KOG1225|consen  234 GICECPEGYFGP---LCSTIYC---------------------PGGCTGRGQCVEG----RCICPPGFTGD---DCDEL-  281 (525)
T ss_pred             ceeecCCceeCC---ccccccC---------------------CCCCcccceEeCC----eEeCCCCCcCC---CCCcc-
Confidence            368888888887   4433222                     2224445566543    79999999999   88764 


Q ss_pred             CCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCC
Q psy15668        108 PQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNH  187 (365)
Q Consensus       108 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g  187 (365)
                         .|... |..++.+++.    .|.|.+||+|.       .|+..                .|. ..|..++.|+.  +
T Consensus       282 ---~Cp~~-cs~~g~~~~g----~CiC~~g~~G~-------dCs~~----------------~cp-adC~g~G~Ci~--G  327 (525)
T KOG1225|consen  282 ---VCPVD-CSGGGVCVDG----ECICNPGYSGK-------DCSIR----------------RCP-ADCSGHGKCID--G  327 (525)
T ss_pred             ---cCCcc-cCCCceecCC----EeecCCCcccc-------ccccc----------------cCC-ccCCCCCcccC--C
Confidence               26544 7777777665    89999999999       77643                233 56888899982  2


Q ss_pred             ceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCC
Q psy15668        188 RAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDP  260 (365)
Q Consensus       188 ~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~  260 (365)
                        +|.|.+||+|.             .|...               .|..++.|++  +   |.|..||+|..
T Consensus       328 --~C~C~~Gy~G~-------------~C~~~---------------~C~~~g~cv~--g---C~C~~Gw~G~d  365 (525)
T KOG1225|consen  328 --ECLCDEGYTGE-------------LCIQR---------------ACSGGGQCVN--G---CKCKKGWRGPD  365 (525)
T ss_pred             --ceEeCCCCcCC-------------ccccc---------------ccCCCceecc--C---ceeccCccCCC
Confidence              49999999999             55542               1445777765  2   89999999874


No 12 
>KOG4260|consensus
Probab=98.79  E-value=6e-09  Score=89.44  Aligned_cols=150  Identities=24%  Similarity=0.532  Sum_probs=95.6

Q ss_pred             CCCCCCCCeee---ecCCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccCCCCCCeeeecCCCce
Q psy15668         12 PNPCGSNTQCN---VASNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQCGVNSECNVRNHIPV   88 (365)
Q Consensus        12 ~~~C~~~~~C~---~~~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~   88 (365)
                      ..||..+|.|.   ...|+..|.|.+||+|..+..|....=+... =..+..|.  .|...|..      .|. ..++-.
T Consensus       149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~R-ne~~lvCt--~Ch~~C~~------~Cs-g~~~k~  218 (350)
T KOG4260|consen  149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSR-NEQHLVCT--ACHEGCLG------VCS-GESSKG  218 (350)
T ss_pred             cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhc-ccccchhh--hhhhhhhc------ccC-CCCCCC
Confidence            46799999999   4558899999999999954333210000000 00011111  11112221      232 223335


Q ss_pred             e-eCCCCCccCCCCCcccCCCCCCCC--CCCCCCCCeeeeCCCCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccCC
Q psy15668         89 C-SCPPGYTGDPLTQCRRFDPQELCD--RSPCGVNTRCEVINMVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVNY  165 (365)
Q Consensus        89 C-~C~~G~~g~~~~~C~~~~~~~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~~  165 (365)
                      | +|+.||..+.. .|.++   ++|.  +.||..+..|+|+.|+|.|..++||.+.     .+.|+.-            
T Consensus       219 C~kCkkGW~lde~-gCvDv---nEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d~C~~~------------  277 (350)
T KOG4260|consen  219 CSKCKKGWKLDEE-GCVDV---NECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VDECQFC------------  277 (350)
T ss_pred             hhhhcccceeccc-ccccH---HHHhcCCCCCChhheeecCCCceEecccccccCC-----hHHhhhh------------
Confidence            6 79999998755 79888   8896  4789999999999999999999999763     1133321            


Q ss_pred             cccCCCCCCCCCCCCeeeeCCCceeeeCCCCCc
Q psy15668        166 KCANPCASGACAPTAQCEVRNHRAVCSCPVGYL  198 (365)
Q Consensus       166 ~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~  198 (365)
                        .+.|.    ..+..|.++.++|+|.|..|+.
T Consensus       278 --~d~~~----~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  278 --ADVCA----SKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             --hhhcc----cCCCCcccCCccEEEEecccce
Confidence              12222    2456788999999999988874


No 13 
>KOG0994|consensus
Probab=98.74  E-value=2.2e-07  Score=93.77  Aligned_cols=231  Identities=25%  Similarity=0.567  Sum_probs=121.8

Q ss_pred             ee-eCCCCCccCCCCCcccCC---CCCCCCCCCCCCCC---eeeeCCCCcee-eCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy15668         88 VC-SCPPGYTGDPLTQCRRFD---PQELCDRSPCGVNT---RCEVINMVPTC-SCLPGYTGSPLSGCRHECDSDYDCGPS  159 (365)
Q Consensus        88 ~C-~C~~G~~g~~~~~C~~~~---~~~~C~~~~C~~~~---~C~~~~~~~~C-~C~~G~~g~~~~~~~~~C~~~~~C~~~  159 (365)
                      .| .|.+||+|.+  .|....   -.+.|.+.    .+   .|.+...+++| .|..||.|++.......|..       
T Consensus       842 qCnqCqpG~WgFP--eCr~CqCNgHA~~Cd~~----tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrP-------  908 (1758)
T KOG0994|consen  842 QCNQCQPGYWGFP--ECRPCQCNGHADTCDPI----TGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRP-------  908 (1758)
T ss_pred             hccccCCCccCCC--cCccccccCcccccCcc----ccccccccccccccchhhhhccccCCcccCCCCCCCC-------
Confidence            45 6888888875  343210   00222221    12   34556677889 89999999865332222221       


Q ss_pred             CcccCCcccCCCCCCCC---CCCCeee--eCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCC---
Q psy15668        160 QSCVNYKCANPCASGAC---APTAQCE--VRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPC---  231 (365)
Q Consensus       160 ~~C~~~~c~~~C~~~~C---~~~~~C~--~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c---  231 (365)
                               =+|...|-   .....|.  +......|.|.+||.|.+++.+.+.-...+.   ....|+.-.|.+-=   
T Consensus       909 ---------CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~---~GGtCq~CeC~~NiD~~  976 (1758)
T KOG0994|consen  909 ---------CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPS---EGGTCQKCECSNNIDLY  976 (1758)
T ss_pred             ---------CCCCCCCccchhccccccccccccceeeecccCccccchhhhcccccCCcc---cCCccccccccCCcCcc
Confidence                     01111110   0111242  2334568999999999987654432111100   01112111111100   


Q ss_pred             -CCCCCC-Cce---eeecCCCcee-eCCCCCccCC-CCccccCCC----CCcccccccccceeeecCCccceeeeeecCC
Q psy15668        232 -AGQCGI-NAK---CEVRGATPIC-SCPRDMTGDP-FVRCRPFDK----YVAPLINDYLKIYWRYQNNKTIFYVSLVSLN  300 (365)
Q Consensus       232 -~~~C~~-~~~---C~~~~~~~~C-~C~~G~~g~~-~~~C~~~~~----~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~  300 (365)
                       .+.|.. .|.   |+....+-+| .|++||.|+. ...|..-..    ....+..+...+.+.|.++.-+..|.+|..+
T Consensus       977 d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDqCA~N 1056 (1758)
T KOG0994|consen  977 DPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQCAEN 1056 (1758)
T ss_pred             CCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhheccccccCCccccccccCcCCCCcccccccccccccc
Confidence             011221 122   3333334466 7999999986 233443111    2334666777778999999999999999999


Q ss_pred             CCCCCCCCCCCCCCCCCCC--CCeecCCCCCCCCCCceeeCCCCCccCCCCCCc
Q psy15668        301 YPYVTPLPDDLCEPNPCGE--NAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCR  352 (365)
Q Consensus       301 ~~~~~~~~~d~C~~~~C~~--~~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~  352 (365)
                      +|...-  ...|++-.|..  +-+|..       -..+|+|+|||-|..+++|.
T Consensus      1057 ~w~laS--G~GCe~C~Cd~~~~pqCN~-------ftGQCqCkpGfGGR~C~qCq 1101 (1758)
T KOG0994|consen 1057 HWNLAS--GEGCEPCNCDPIGGPQCNE-------FTGQCQCKPGFGGRTCSQCQ 1101 (1758)
T ss_pred             hhcccc--CCCCCccCCCccCCccccc-------cccceeccCCCCCcchhHHH
Confidence            985320  12333333333  224543       22489999999998877665


No 14 
>KOG4260|consensus
Probab=98.46  E-value=3.1e-07  Score=79.12  Aligned_cols=135  Identities=24%  Similarity=0.501  Sum_probs=84.8

Q ss_pred             CCCCCeeee---cCCCceeeCCCCCccCCCCCcccCCC------CCC----CCC--CCCCCCCeeeeCCCCcee-eCCCC
Q psy15668         74 CGVNSECNV---RNHIPVCSCPPGYTGDPLTQCRRFDP------QEL----CDR--SPCGVNTRCEVINMVPTC-SCLPG  137 (365)
Q Consensus        74 C~~~~~C~~---~~g~~~C~C~~G~~g~~~~~C~~~~~------~~~----C~~--~~C~~~~~C~~~~~~~~C-~C~~G  137 (365)
                      |..++.|.-   ..|+..|.|.+||+|.   .|.....      .++    |..  .+|.  +.|.. .+.-.| .|..|
T Consensus       152 C~GnG~C~GdGsR~GsGkCkC~~GY~Gp---~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg-~~~k~C~kCkkG  225 (350)
T KOG4260|consen  152 CFGNGSCHGDGSREGSGKCKCETGYTGP---LCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSG-ESSKGCSKCKKG  225 (350)
T ss_pred             cCCCCcccCCCCCCCCCcccccCCCCCc---cccccchHHHHhhcccccchhhhhhhhhh--cccCC-CCCCChhhhccc
Confidence            444555542   4577899999999998   6653210      000    100  1121  23332 223345 78888


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCC--CCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCC
Q psy15668        138 YTGSPLSGCRHECDSDYDCGPSQSCVNYKCANPCA--SGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDC  215 (365)
Q Consensus       138 ~~g~~~~~~~~~C~~~~~C~~~~~C~~~~c~~~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C  215 (365)
                      |.....     .|.+               ||+|.  +.||..+..|+|+.|+|.|..++||.+. .          ++|
T Consensus       226 W~lde~-----gCvD---------------vnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-~----------d~C  274 (350)
T KOG4260|consen  226 WKLDEE-----GCVD---------------VNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-V----------DEC  274 (350)
T ss_pred             ceeccc-----cccc---------------HHHHhcCCCCCChhheeecCCCceEecccccccCC-h----------HHh
Confidence            876521     3333               66775  4568888899999999999999999763 1          155


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCc
Q psy15668        216 PTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMT  257 (365)
Q Consensus       216 ~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~  257 (365)
                      +.-.+.|.            ..+..|.|+.+.|+|+|..|+.
T Consensus       275 ~~~~d~~~------------~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  275 QFCADVCA------------SKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             hhhhhhcc------------cCCCCcccCCccEEEEecccce
Confidence            54222232            2367899999999999998874


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.38  E-value=1.7e-07  Score=58.86  Aligned_cols=34  Identities=32%  Similarity=0.579  Sum_probs=30.3

Q ss_pred             CCCCCCC--CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668          6 GGDPCSP--NPCGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus         6 did~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      |||||+.  ++|..+++|+|+.|+|+|.|++||...
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            7999974  569889999999999999999999843


No 16 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.27  E-value=3.9e-07  Score=53.43  Aligned_cols=30  Identities=37%  Similarity=0.825  Sum_probs=27.9

Q ss_pred             CCCCCCCCCCeeeecC-CCceeeCCCCCccC
Q psy15668         10 CSPNPCGSNTQCNVAS-NRPVCSCLPGHWGN   39 (365)
Q Consensus        10 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~   39 (365)
                      |.++||.++|+|++.. ++|+|.|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            6678999999999999 99999999999986


No 17 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.19  E-value=2.4e-06  Score=52.63  Aligned_cols=34  Identities=32%  Similarity=0.716  Sum_probs=31.2

Q ss_pred             CCCCCCC-CCCCCCCeeeecCCCceeeCCCCCc-cC
Q psy15668          6 GGDPCSP-NPCGSNTQCNVASNRPVCSCLPGHW-GN   39 (365)
Q Consensus         6 did~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~-g~   39 (365)
                      |+|+|.. ++|.++++|+++.++|.|.|++||. |.
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~   36 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR   36 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence            5899987 8999999999999999999999999 66


No 18 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=98.12  E-value=3.2e-07  Score=75.02  Aligned_cols=141  Identities=25%  Similarity=0.613  Sum_probs=86.7

Q ss_pred             CCeeeecCCCceeeCCCCCccCCCCCcccCCCCCCCCC-----CCCCCCCeeeeCC-----CCceeeCCCCCCCCCCCCC
Q psy15668         77 NSECNVRNHIPVCSCPPGYTGDPLTQCRRFDPQELCDR-----SPCGVNTRCEVIN-----MVPTCSCLPGYTGSPLSGC  146 (365)
Q Consensus        77 ~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~C~~-----~~C~~~~~C~~~~-----~~~~C~C~~G~~g~~~~~~  146 (365)
                      +|..+...+.|.|.|.+||......+|+..   .+|..     .+|+..+.|++..     ..|.|.|.+||....    
T Consensus        10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~k---v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~----   82 (197)
T PF06247_consen   10 NGYLIQMSNHFECKCNEGFVLKNENTCEEK---VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQ----   82 (197)
T ss_dssp             TEEEEEESSEEEEEESTTEEEEETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS----
T ss_pred             CCEEEEccCceEEEcCCCcEEccccccccc---eecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeC----
Confidence            466677788899999999987555478877   45643     6799999998775     479999999998662    


Q ss_pred             CCCCCCCCCCCCCCcccCCcccCCCCCCCCCCCCeeeeC---CCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCC
Q psy15668        147 RHECDSDYDCGPSQSCVNYKCANPCASGACAPTAQCEVR---NHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCL  223 (365)
Q Consensus       147 ~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~---~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~  223 (365)
                                   ..|+    .+.|....|+ .|.|+..   .....|+|.-|+..+.          ...|....    
T Consensus        83 -------------~vCv----p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d----------n~kCtk~G----  130 (197)
T PF06247_consen   83 -------------GVCV----PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD----------NKKCTKTG----  130 (197)
T ss_dssp             -------------SSEE----EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT----------TTESEEEE----
T ss_pred             -------------CeEc----hhhcCceecC-CCeEEecCCCCCCceeEeeeceEecc----------CCcccCCC----
Confidence                         1222    2345545676 5789732   3345999999987321          11232211    


Q ss_pred             CCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCC
Q psy15668        224 GNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDP  260 (365)
Q Consensus       224 ~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~  260 (365)
                          ..+|...|..+..|..+.+-|+|.+.+||.+++
T Consensus       131 ----~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  131 ----ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             ------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred             ----ccceeeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence                123345677789999999999999999998764


No 19 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.92  E-value=3.5e-06  Score=52.87  Aligned_cols=32  Identities=34%  Similarity=0.770  Sum_probs=28.4

Q ss_pred             CCCCCCC--CCCCCCCeecCCCCCCCCCCceeeCCCCCc
Q psy15668        308 PDDLCEP--NPCGENAKCQPGYDKSGKDRPVCTCLPGYV  344 (365)
Q Consensus       308 ~~d~C~~--~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~  344 (365)
                      |||||..  +.|..++.|+|     +.|+|+|.|++||+
T Consensus         1 DidEC~~~~~~C~~~~~C~N-----~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVN-----TEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEE-----ETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEc-----CCCCEEeeCCCCcE
Confidence            4899974  57988999999     77999999999998


No 20 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.89  E-value=3.6e-06  Score=49.33  Aligned_cols=32  Identities=34%  Similarity=0.886  Sum_probs=26.5

Q ss_pred             CCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668        312 CEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       312 C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~  347 (365)
                      |.+++|.++|+|++    ...++|+|+|++||+|..
T Consensus         1 C~~~~C~n~g~C~~----~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCID----LPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEE----ESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEe----CCCCCEEeECCCCCccCC
Confidence            45679999999998    223889999999999963


No 21 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.85  E-value=2.5e-05  Score=47.56  Aligned_cols=34  Identities=35%  Similarity=0.735  Sum_probs=31.0

Q ss_pred             CCCCCCC-CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668          6 GGDPCSP-NPCGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus         6 did~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      ++++|.. .+|.++++|++..++|.|.|++||.|.
T Consensus         1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~   35 (38)
T cd00054           1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR   35 (38)
T ss_pred             CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence            4788987 799988999999999999999999986


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.80  E-value=1.3e-05  Score=48.15  Aligned_cols=29  Identities=28%  Similarity=0.700  Sum_probs=23.6

Q ss_pred             CCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy15668         13 NPCGSNTQCNVASNRPVCSCLPGHWGNPL   41 (365)
Q Consensus        13 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~   41 (365)
                      ..|+.+++|+++.++|+|.|++||.|+++
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~   34 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF   34 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence            45999999999999999999999999963


No 23 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.67  E-value=6.1e-05  Score=46.19  Aligned_cols=34  Identities=35%  Similarity=0.818  Sum_probs=29.2

Q ss_pred             CCCCCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCc-cCC
Q psy15668        309 DDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYV-GDA  347 (365)
Q Consensus       309 ~d~C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~-g~~  347 (365)
                      +|+|.. ++|.++++|++     ..++|+|.|++||+ |..
T Consensus         2 ~~~C~~~~~C~~~~~C~~-----~~g~~~C~C~~g~~~g~~   37 (39)
T smart00179        2 IDECASGNPCQNGGTCVN-----TVGSYRCECPPGYTDGRN   37 (39)
T ss_pred             cccCcCCCCcCCCCEeEC-----CCCCeEeECCCCCccCCc
Confidence            688987 78999999998     66889999999998 543


No 24 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.61  E-value=3.2e-05  Score=46.42  Aligned_cols=29  Identities=38%  Similarity=0.836  Sum_probs=23.7

Q ss_pred             CCCCCCceeeecCCCceeeCCCCCccCCC
Q psy15668        233 GQCGINAKCEVRGATPICSCPRDMTGDPF  261 (365)
Q Consensus       233 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~  261 (365)
                      +.|+.+++|+++.++|.|+|++||+|++.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~   34 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF   34 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence            45888999999999999999999999975


No 25 
>KOG1836|consensus
Probab=97.60  E-value=0.004  Score=68.49  Aligned_cols=242  Identities=23%  Similarity=0.475  Sum_probs=120.8

Q ss_pred             eeecCCCcee-eCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCC--CCceee-CCCCCCCCCCCCCCCCCCCCC-
Q psy15668         80 CNVRNHIPVC-SCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVIN--MVPTCS-CLPGYTGSPLSGCRHECDSDY-  154 (365)
Q Consensus        80 C~~~~g~~~C-~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~--~~~~C~-C~~G~~g~~~~~~~~~C~~~~-  154 (365)
                      |+....+-.| +|..||.|....  ...   ..|.+-+|...+.|..+.  ....|. |++||+|.       .|+.-+ 
T Consensus       749 C~~~t~G~~C~~C~~GfYg~~~~--~~~---~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~-------rCe~c~d  816 (1705)
T KOG1836|consen  749 CKHNTFGGQCAQCVDGFYGLPDL--GTS---GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL-------RCEECAD  816 (1705)
T ss_pred             cccCCCCCchhhhcCCCCCcccc--CCC---CCCccCCCCCChhhcCcCcccceecCCCCCCCccc-------ccccCCC
Confidence            4433444466 799999987431  111   227777888877777654  456787 99999998       444211 


Q ss_pred             -----CC---CCCCcccCCcc---cCCCCCCCCCC-CCee---eeCCCceee-eCCCCCccCCCCccCccccCCCCCCCC
Q psy15668        155 -----DC---GPSQSCVNYKC---ANPCASGACAP-TAQC---EVRNHRAVC-SCPVGYLGDPYTSCRAECLAHSDCPTD  218 (365)
Q Consensus       155 -----~C---~~~~~C~~~~c---~~~C~~~~C~~-~~~C---~~~~g~~~C-~C~~G~~g~~~~~~~~~C~~~~~C~~~  218 (365)
                           +=   .+...|..-.|   +|+=....|.. .+.|   +....+..| .|.+||.|+...-..            
T Consensus       817 gyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p------------  884 (1705)
T KOG1836|consen  817 GYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNP------------  884 (1705)
T ss_pred             ccccCCCCCCCCcccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccCCCc------------
Confidence                 00   00122221111   22222223332 2233   322333445 799999988653110            


Q ss_pred             CCCCCCCCCCCCC----CCCCCC-Ccee--eecCCCcee-eCCCCCccCC-CCccccCCC---CCcccccccccceeeec
Q psy15668        219 RPSCLGNKCMNPC----AGQCGI-NAKC--EVRGATPIC-SCPRDMTGDP-FVRCRPFDK---YVAPLINDYLKIYWRYQ  286 (365)
Q Consensus       219 ~~~C~~~~C~~~c----~~~C~~-~~~C--~~~~~~~~C-~C~~G~~g~~-~~~C~~~~~---~~~~~~~~~~~~~~~c~  286 (365)
                      .+.|..--|...=    ...|.+ -|.|  .....+-.| .|.+||.+.. ...|+....   ............++.|.
T Consensus       885 ~~~c~~c~c~p~gs~~~~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~~tGqc~c~  964 (1705)
T KOG1836|consen  885 EDKCFACGCVPAGSELPSLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDVGTGQCYCR  964 (1705)
T ss_pred             CCccccccCccCCcccccccCCCcccceeccCCCCccccccccccccccCCCCCcccccccccccccccccccCCceeee
Confidence            1111111110000    000111 1111  111111222 4555555442 112332111   00111222234467789


Q ss_pred             CCccceeeeeecCCCCCCCCCCCCCCCCCCCCCCC----eecCCCCCCCCCCceeeCCCCCccCCCCCCcCCC
Q psy15668        287 NNKTIFYVSLVSLNYPYVTPLPDDLCEPNPCGENA----KCQPGYDKSGKDRPVCTCLPGYVGDALTYCRRGE  355 (365)
Q Consensus       287 ~~~~~~~~~~c~~~~~~~~~~~~d~C~~~~C~~~~----~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~~~  355 (365)
                      ++.++..+.+|..+++++.-   ..|..--|...+    .|..       ...+|.|+++|.|...++|.++.
T Consensus       965 ~gVtgqrc~qc~~~~~~~~~---~gc~~c~c~~~Gs~~~qc~~-------~~G~c~c~~~~~g~~c~~c~~~~ 1027 (1705)
T KOG1836|consen  965 PGVTGQRCDQCETYHFGFQT---EGCGLCECDPLGSRGFQCDP-------EDGQCPCRPGFEGRRCDQCEEGF 1027 (1705)
T ss_pred             cCccccccCccccCcccccc---cCCcceecccCCcccceecc-------cCCeeeecCCCCCcccccccCCc
Confidence            99999999999999998763   333322244433    5765       23489999999999999998764


No 26 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.35  E-value=0.00028  Score=42.04  Aligned_cols=30  Identities=33%  Similarity=0.852  Sum_probs=27.0

Q ss_pred             CC-CCCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668         10 CS-PNPCGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus        10 C~-~~~C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      |. ..+|.++++|+++.++|+|.|++||.|.
T Consensus         2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            55 6789989999999999999999999887


No 27 
>KOG1226|consensus
Probab=97.31  E-value=0.0041  Score=61.87  Aligned_cols=143  Identities=29%  Similarity=0.656  Sum_probs=82.4

Q ss_pred             eeeCCCCCCCCCCCCCCCCCCCCCCCCCC----CcccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCc----cCCC
Q psy15668        131 TCSCLPGYTGSPLSGCRHECDSDYDCGPS----QSCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYL----GDPY  202 (365)
Q Consensus       131 ~C~C~~G~~g~~~~~~~~~C~~~~~C~~~----~~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~----g~~~  202 (365)
                      .|.|.+||.|.       .|+-.......    ..|..     .-.+.+|...|.|.=.    .|+|.+...    |.  
T Consensus       479 ~C~C~~G~~G~-------~CEC~~~~~ss~~~~~~Cr~-----~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~--  540 (783)
T KOG1226|consen  479 QCRCDEGWLGK-------KCECSTDELSSSEEEDKCRE-----NSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK--  540 (783)
T ss_pred             ceecCCCCCCC-------cccCCccccCcHhHHhhccC-----CCCCCCcCCCCcEeCC----ceEecCCCCCceeee--
Confidence            57999999999       55532211111    11211     1112368888877532    378877765    44  


Q ss_pred             CccCccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeeecCCCceeeCCCCCccCCCCccccCCCCCcccccccccce
Q psy15668        203 TSCRAECLAHSDCPTDRPSCLGNKCMNPCAGQCGINAKCEVRGATPICSCPRDMTGDPFVRCRPFDKYVAPLINDYLKIY  282 (365)
Q Consensus       203 ~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~~~  282 (365)
                                 .|+-+.-.|.+..     ...|+.+++|.-    -+|+|.+||+|+   .|+--               
T Consensus       541 -----------fCECDnfsC~r~~-----g~lC~g~G~C~C----G~CvC~~GwtG~---~C~C~---------------  582 (783)
T KOG1226|consen  541 -----------FCECDNFSCERHK-----GVLCGGHGRCEC----GRCVCNPGWTGS---ACNCP---------------  582 (783)
T ss_pred             -----------eeeccCccccccc-----CcccCCCCeEeC----CcEEcCCCCccC---CCCCC---------------
Confidence                       2322222222111     134777888853    379999999999   66520               


Q ss_pred             eeecCCccceeeeeecCCCCCCCCCCCCCCCC---CCCCCCCeecCCCCCCCCCCceeeCCCC-CccCCCCCCcC--CCC
Q psy15668        283 WRYQNNKTIFYVSLVSLNYPYVTPLPDDLCEP---NPCGENAKCQPGYDKSGKDRPVCTCLPG-YVGDALTYCRR--GEC  356 (365)
Q Consensus       283 ~~c~~~~~~~~~~~c~~~~~~~~~~~~d~C~~---~~C~~~~~C~~~~~~~~~~~~~C~C~~G-~~g~~~~~C~~--~~C  356 (365)
                                              .+.+.|.+   ..|...|+|.=         .+|+|... |.|..++.|..  ++|
T Consensus       583 ------------------------~std~C~~~~G~iCSGrG~C~C---------g~C~C~~~~~sG~~CE~cptc~~~C  629 (783)
T KOG1226|consen  583 ------------------------LSTDTCESSDGQICSGRGTCEC---------GRCKCTDPPYSGEFCEKCPTCPDPC  629 (783)
T ss_pred             ------------------------CCCccccCCCCceeCCCceeeC---------CceEcCCCCcCcchhhcCCCCCCcc
Confidence                                    11466653   24777777775         37888765 89888666663  356


Q ss_pred             CCCCcc
Q psy15668        357 QSDAEC  362 (365)
Q Consensus       357 ~~~~~C  362 (365)
                      .....|
T Consensus       630 ~~~~~C  635 (783)
T KOG1226|consen  630 AENKSC  635 (783)
T ss_pred             cccccc
Confidence            655554


No 28 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.30  E-value=0.00037  Score=41.60  Aligned_cols=30  Identities=33%  Similarity=0.872  Sum_probs=26.2

Q ss_pred             CCCC-CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668          9 PCSP-NPCGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus         9 ~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      +|.. .+|.++ +|+++.++|+|.|++||.|.
T Consensus         1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence            3666 689888 99999999999999999983


No 29 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.22  E-value=5e-05  Score=62.40  Aligned_cols=148  Identities=24%  Similarity=0.561  Sum_probs=87.0

Q ss_pred             CCCCCCeeeecCCCceeeCCCCCccCCCCCccCCCccCCCCCCCCCcccCCccccccccC-CCCCCeeeecC-----CCc
Q psy15668         14 PCGSNTQCNVASNRPVCSCLPGHWGNPLTYCQRGECQDHSDCSHSKACKEYRCVDVCAGQ-CGVNSECNVRN-----HIP   87 (365)
Q Consensus        14 ~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~~C~~~~~C~~~~~C~~~~C~~~C~~~-C~~~~~C~~~~-----g~~   87 (365)
                      .|. +|.-+...+.|.|.|.+||....     +++|+...+|.....          ... |...++|++..     ..|
T Consensus         7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~-----EntCE~kv~C~~~e~----------~~K~Cgdya~C~~~~~~~~~~~~   70 (197)
T PF06247_consen    7 ICK-NGYLIQMSNHFECKCNEGFVLKN-----ENTCEEKVECDKLEN----------VNKPCGDYAKCINQANKGEERAY   70 (197)
T ss_dssp             --B-TEEEEEESSEEEEEESTTEEEEE-----TTEEEE----SG-GG----------TTSEEETTEEEEE-SSTTSSTSE
T ss_pred             ccc-CCEEEEccCceEEEcCCCcEEcc-----ccccccceecCcccc----------cCccccchhhhhcCCCcccceeE
Confidence            465 46888888999999999998762     344544555543100          112 77788898754     569


Q ss_pred             eeeCCCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCC---CCceeeCCCCCCCCCCCCCCCCCCCCCCCCCCCcccC
Q psy15668         88 VCSCPPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVIN---MVPTCSCLPGYTGSPLSGCRHECDSDYDCGPSQSCVN  164 (365)
Q Consensus        88 ~C~C~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~---~~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~C~~  164 (365)
                      .|.|.+||+.... .|..    +.|....|+ .+.|+..+   ....|+|.-|+....    ...|....          
T Consensus        71 ~C~C~~gY~~~~~-vCvp----~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d----n~kCtk~G----------  130 (197)
T PF06247_consen   71 KCDCINGYILKQG-VCVP----NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD----NKKCTKTG----------  130 (197)
T ss_dssp             EEEE-TTEEESSS-SEEE----GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTT----TTESEEEE----------
T ss_pred             EEecccCceeeCC-eEch----hhcCceecC-CCeEEecCCCCCCceeEeeeceEecc----CCcccCCC----------
Confidence            9999999987644 5765    457777788 58897543   345899999987221    11232210          


Q ss_pred             CcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCC
Q psy15668        165 YKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDP  201 (365)
Q Consensus       165 ~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~  201 (365)
                         ..+|+ -.|..+..|....+-|.|.+..||.+++
T Consensus       131 ---~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  131 ---ETKCS-LKCKENEECKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             ------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred             ---cccee-eecCCCcceeeeCcEEEeecCCCCCCCC
Confidence               11222 2366678999999999999999998774


No 30 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.20  E-value=0.00053  Score=41.44  Aligned_cols=34  Identities=35%  Similarity=0.834  Sum_probs=28.9

Q ss_pred             CCCCCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668        309 DDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       309 ~d~C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~  347 (365)
                      +++|.. .+|.+++.|++     ..++|+|.|++||.|..
T Consensus         2 ~~~C~~~~~C~~~~~C~~-----~~~~~~C~C~~g~~g~~   36 (38)
T cd00054           2 IDECASGNPCQNGGTCVN-----TVGSYRCSCPPGYTGRN   36 (38)
T ss_pred             cccCCCCCCcCCCCEeEC-----CCCCeEeECCCCCcCCc
Confidence            678876 78998899998     56889999999999854


No 31 
>KOG1226|consensus
Probab=97.16  E-value=0.0052  Score=61.17  Aligned_cols=128  Identities=24%  Similarity=0.691  Sum_probs=77.3

Q ss_pred             eeeCCCCCccCCCCCcccCCC-------CCCCCC----CCCCCCCeeeeCCCCceeeCCCCCC----CCCCCCCCCCCCC
Q psy15668         88 VCSCPPGYTGDPLTQCRRFDP-------QELCDR----SPCGVNTRCEVINMVPTCSCLPGYT----GSPLSGCRHECDS  152 (365)
Q Consensus        88 ~C~C~~G~~g~~~~~C~~~~~-------~~~C~~----~~C~~~~~C~~~~~~~~C~C~~G~~----g~~~~~~~~~C~~  152 (365)
                      .|.|.+||.|+   .|+-..+       .+.|..    .+|...+.|.=.    .|.|.+...    |.       .|+-
T Consensus       479 ~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~-------fCEC  544 (783)
T KOG1226|consen  479 QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK-------FCEC  544 (783)
T ss_pred             ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeee-------eeec
Confidence            57999999998   7763211       133432    267777777544    678877665    55       5553


Q ss_pred             CCCCCCCCcccCCcccCCCCCCCCCCCCeeeeCCCceeeeCCCCCccCCCCccCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy15668        153 DYDCGPSQSCVNYKCANPCASGACAPTAQCEVRNHRAVCSCPVGYLGDPYTSCRAECLAHSDCPTDRPSCLGNKCMNPCA  232 (365)
Q Consensus       153 ~~~C~~~~~C~~~~c~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~~~~~C~~~~~C~~~~~~C~~~~C~~~c~  232 (365)
                      ++          |+|... ....|..++.|.=.    +|.|.+||+|..+           .|....+.|.+..     .
T Consensus       545 Dn----------fsC~r~-~g~lC~g~G~C~CG----~CvC~~GwtG~~C-----------~C~~std~C~~~~-----G  593 (783)
T KOG1226|consen  545 DN----------FSCERH-KGVLCGGHGRCECG----RCVCNPGWTGSAC-----------NCPLSTDTCESSD-----G  593 (783)
T ss_pred             cC----------cccccc-cCcccCCCCeEeCC----cEEcCCCCccCCC-----------CCCCCCccccCCC-----C
Confidence            31          111110 11247777887532    4999999999965           3555555554321     1


Q ss_pred             CCCCCCceeeecCCCceeeCCCC-CccCCCCccccC
Q psy15668        233 GQCGINAKCEVRGATPICSCPRD-MTGDPFVRCRPF  267 (365)
Q Consensus       233 ~~C~~~~~C~~~~~~~~C~C~~G-~~g~~~~~C~~~  267 (365)
                      ..|+..|+|.=    -+|+|... |.|.   .|+.-
T Consensus       594 ~iCSGrG~C~C----g~C~C~~~~~sG~---~CE~c  622 (783)
T KOG1226|consen  594 QICSGRGTCEC----GRCKCTDPPYSGE---FCEKC  622 (783)
T ss_pred             ceeCCCceeeC----CceEcCCCCcCcc---hhhcC
Confidence            34666777754    25888777 8898   77753


No 32 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.15  E-value=0.00044  Score=37.16  Aligned_cols=24  Identities=33%  Similarity=0.708  Sum_probs=18.0

Q ss_pred             CceeeCCCCCccCCCCCccCCCccCCCC
Q psy15668         27 RPVCSCLPGHWGNPLTYCQRGECQDHSD   54 (365)
Q Consensus        27 ~~~C~C~~G~~g~~~~~C~~~~C~~~~~   54 (365)
                      +|+|+|++||...+    +...|+|++|
T Consensus         1 sy~C~C~~Gy~l~~----d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSP----DGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCC----CCCccccCCC
Confidence            69999999999763    2345777764


No 33 
>KOG1836|consensus
Probab=97.06  E-value=0.014  Score=64.36  Aligned_cols=174  Identities=22%  Similarity=0.466  Sum_probs=94.2

Q ss_pred             CCCCCCCCCCeeeeC--CCceeee-CCCCCccCCCCccCccccCCCCCC-CCCCCCCCCCCCC---CCC-CCCCC-Ccee
Q psy15668        171 CASGACAPTAQCEVR--NHRAVCS-CPVGYLGDPYTSCRAECLAHSDCP-TDRPSCLGNKCMN---PCA-GQCGI-NAKC  241 (365)
Q Consensus       171 C~~~~C~~~~~C~~~--~g~~~C~-C~~G~~g~~~~~~~~~C~~~~~C~-~~~~~C~~~~C~~---~c~-~~C~~-~~~C  241 (365)
                      |.+-+|...+.|...  .....|. |++||+|..++.+.+.......=. .+...|..-+|..   +=. ..|.. .+.|
T Consensus       777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c  856 (1705)
T KOG1836|consen  777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGEC  856 (1705)
T ss_pred             CccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCccccccccccccce
Confidence            555667777777533  4567887 999999998775544332211100 1122344433311   000 11221 2233


Q ss_pred             ---eecCCCcee-eCCCCCccCCCC-----ccccC-----CCCCcccccccccceeeecCCccceeeeeecCCCCCCCCC
Q psy15668        242 ---EVRGATPIC-SCPRDMTGDPFV-----RCRPF-----DKYVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVTPL  307 (365)
Q Consensus       242 ---~~~~~~~~C-~C~~G~~g~~~~-----~C~~~-----~~~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~~~  307 (365)
                         +.....++| .|.+||.|+...     .|...     ........-....+.+.|.+...+..+..|..||++..  
T Consensus       857 ~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~--  934 (1705)
T KOG1836|consen  857 LKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLN--  934 (1705)
T ss_pred             eeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCcccceeccCCCCccccccccccccccC--
Confidence               333334555 799999888532     12211     10100111122234677788888888989999998843  


Q ss_pred             CCCCCCCCCCCCC----CeecCCCCCCCCCCceeeCCCCCccCCCCCCcC
Q psy15668        308 PDDLCEPNPCGEN----AKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRR  353 (365)
Q Consensus       308 ~~d~C~~~~C~~~----~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~  353 (365)
                      .-..|+.-+|..-    ..|+.       ++.+|.|.+|-+|...+.|..
T Consensus       935 s~~gC~~c~c~~~gs~~~~c~~-------~tGqc~c~~gVtgqrc~qc~~  977 (1705)
T KOG1836|consen  935 SGVGCEPCNCDPTGSESSDCDV-------GTGQCYCRPGVTGQRCDQCET  977 (1705)
T ss_pred             CCCCcccccccccccccccccc-------cCCceeeecCccccccCcccc
Confidence            1125555445432    25654       446899999999888777764


No 34 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.65  E-value=0.0027  Score=37.60  Aligned_cols=28  Identities=39%  Similarity=0.973  Sum_probs=24.2

Q ss_pred             CCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668        314 PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD  346 (365)
Q Consensus       314 ~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~  346 (365)
                      ..+|.+++.|++     ..++|+|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~-----~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVN-----TPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEec-----CCCCeEeECCCCCccc
Confidence            567888899998     5588999999999987


No 35 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.36  E-value=0.0033  Score=33.80  Aligned_cols=23  Identities=30%  Similarity=0.638  Sum_probs=17.7

Q ss_pred             CceeeCCCCCccCC-CCccccCCC
Q psy15668        247 TPICSCPRDMTGDP-FVRCRPFDK  269 (365)
Q Consensus       247 ~~~C~C~~G~~g~~-~~~C~~~~~  269 (365)
                      +|+|.|++||+... ...|++|++
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            58999999998654 457888753


No 36 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.35  E-value=0.0047  Score=36.64  Aligned_cols=29  Identities=38%  Similarity=1.062  Sum_probs=23.9

Q ss_pred             CCC-CCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668        312 CEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD  346 (365)
Q Consensus       312 C~~-~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~  346 (365)
                      |.. ++|.++ +|++     ..++|+|.|++||.|.
T Consensus         2 C~~~~~C~~~-~C~~-----~~~~~~C~C~~g~~g~   31 (35)
T smart00181        2 CASGGPCSNG-TCIN-----TPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCcCCCCCC-EEEC-----CCCCeEeECCCCCccC
Confidence            444 578888 9998     5689999999999984


No 37 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.16  E-value=0.0082  Score=34.91  Aligned_cols=25  Identities=28%  Similarity=0.553  Sum_probs=21.8

Q ss_pred             CCCCCCCeeeecCCCceeeCCCCCccC
Q psy15668         13 NPCGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus        13 ~~C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      ..|+++|+|+..  ..+|.|.+||+|.
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGP   30 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCC
Confidence            469999999976  4599999999998


No 38 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.14  E-value=0.0039  Score=37.34  Aligned_cols=23  Identities=30%  Similarity=0.576  Sum_probs=19.2

Q ss_pred             CCCCCeeeecCCCceeeCCCCCccC
Q psy15668         15 CGSNTQCNVASNRPVCSCLPGHWGN   39 (365)
Q Consensus        15 C~~~~~C~~~~~~~~C~C~~G~~g~   39 (365)
                      |++  +|++++++|+|.|++||.+.
T Consensus         8 C~h--~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen    8 CSH--ICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SSS--EEEEETTSEEEE-STTEEE-
T ss_pred             cCC--CCccCCCceEeECCCCCEEC
Confidence            655  89999999999999999987


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.84  E-value=0.047  Score=31.74  Aligned_cols=24  Identities=25%  Similarity=0.555  Sum_probs=20.1

Q ss_pred             CCCCCceeeecCCCceeeCCCCCccC
Q psy15668        234 QCGINAKCEVRGATPICSCPRDMTGD  259 (365)
Q Consensus       234 ~C~~~~~C~~~~~~~~C~C~~G~~g~  259 (365)
                      .|..+++|+..  ..+|+|.+||+|.
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~   30 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGP   30 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCC
Confidence            47779999865  4689999999997


No 40 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.49  E-value=0.036  Score=33.17  Aligned_cols=22  Identities=23%  Similarity=0.415  Sum_probs=18.1

Q ss_pred             CceeeecCCCceeeCCCCCccC
Q psy15668        238 NAKCEVRGATPICSCPRDMTGD  259 (365)
Q Consensus       238 ~~~C~~~~~~~~C~C~~G~~g~  259 (365)
                      ...|++++++|+|.|++||+..
T Consensus         9 ~h~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen    9 SHICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SSEEEEETTSEEEE-STTEEE-
T ss_pred             CCCCccCCCceEeECCCCCEEC
Confidence            3689999999999999999876


No 41 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.23  E-value=0.02  Score=25.99  Aligned_cols=11  Identities=45%  Similarity=1.114  Sum_probs=8.9

Q ss_pred             eeeCCCCCccC
Q psy15668        249 ICSCPRDMTGD  259 (365)
Q Consensus       249 ~C~C~~G~~g~  259 (365)
                      .|+|++||+|.
T Consensus         1 ~C~C~~G~~G~   11 (13)
T PF12661_consen    1 TCQCPPGWTGP   11 (13)
T ss_dssp             EEEE-TTEETT
T ss_pred             CccCcCCCcCC
Confidence            48999999998


No 42 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.47  E-value=0.1  Score=31.18  Aligned_cols=30  Identities=27%  Similarity=0.533  Sum_probs=21.7

Q ss_pred             CCCCCCCCCCeeeecC-CCceeeCCCCCccC
Q psy15668         10 CSPNPCGSNTQCNVAS-NRPVCSCLPGHWGN   39 (365)
Q Consensus        10 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~   39 (365)
                      |...+|..|+.|++.. |++.|.|.+||..+
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            5567788999999777 99999999999865


No 43 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=87.35  E-value=0.34  Score=28.92  Aligned_cols=28  Identities=25%  Similarity=0.549  Sum_probs=20.0

Q ss_pred             CCCCCCCCeeeeCC-CceeeeCCCCCccC
Q psy15668        173 SGACAPTAQCEVRN-HRAVCSCPVGYLGD  200 (365)
Q Consensus       173 ~~~C~~~~~C~~~~-g~~~C~C~~G~~g~  200 (365)
                      ...|+.++.|++.. |++.|.|..||..+
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            35678899998766 99999999999765


No 44 
>KOG3512|consensus
Probab=86.45  E-value=3.9  Score=39.05  Aligned_cols=158  Identities=14%  Similarity=0.204  Sum_probs=81.3

Q ss_pred             CeeeeCC-CceeeeCCCCCccCCCCccCccccCCCC---CCCCCCCCCCCCCCC---CCC---------CCCCCCceeee
Q psy15668        180 AQCEVRN-HRAVCSCPVGYLGDPYTSCRAECLAHSD---CPTDRPSCLGNKCMN---PCA---------GQCGINAKCEV  243 (365)
Q Consensus       180 ~~C~~~~-g~~~C~C~~G~~g~~~~~~~~~C~~~~~---C~~~~~~C~~~~C~~---~c~---------~~C~~~~~C~~  243 (365)
                      ..|+... +.+.|.|..+..|..+..+.+.-.+..-   -....++|....|..   -|.         +.+ .+++|+|
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~-SggvCln  363 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRR-SGGVCLN  363 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCcc-ccceEee
Confidence            3576444 4489999999888866655443322210   001222332222211   000         112 2567764


Q ss_pred             c---CCCcee-eCCCCCccCCCC------ccccCCC---CCcccccccccceeeecCCccceeeeeecCCCCCCC-----
Q psy15668        244 R---GATPIC-SCPRDMTGDPFV------RCRPFDK---YVAPLINDYLKIYWRYQNNKTIFYVSLVSLNYPYVT-----  305 (365)
Q Consensus       244 ~---~~~~~C-~C~~G~~g~~~~------~C~~~~~---~~~~~~~~~~~~~~~c~~~~~~~~~~~c~~~~~~~~-----  305 (365)
                      -   ..+-+| .|++||..++..      .|..-+.   ..+.-..+-..+++.|.++.++..|-+|..||....     
T Consensus       364 CrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tGqCpCkeGvtG~tCnrCa~gyqqsrs~vap  443 (592)
T KOG3512|consen  364 CRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTGQCPCKEGVTGLTCNRCAPGYQQSRSPVAP  443 (592)
T ss_pred             cccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCCcccCCCCCcccccccccchhhcccCCCcC
Confidence            3   223445 699999876531      1221111   111111222345788999999999999999998532     


Q ss_pred             CCCCCCCCCCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668        306 PLPDDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       306 ~~~~d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~  347 (365)
                      |+.++.=.+..++++.+         +..+.+.|+.++.|..
T Consensus       444 cik~p~~~~~~~~s~ve---------~qd~~s~Ck~~~~~~r  476 (592)
T KOG3512|consen  444 CIKIPTDAPTLGSSGVE---------PQDQCSKCKASPGGKR  476 (592)
T ss_pred             ceecCCCCccccCCCCc---------chhccccCCCCCccee
Confidence            33222211222333333         2336677888887766


No 45 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.35  E-value=0.97  Score=39.51  Aligned_cols=38  Identities=24%  Similarity=0.485  Sum_probs=28.9

Q ss_pred             CCCCCCCCCCC--CCCCCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668        303 YVTPLPDDLCE--PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       303 ~~~~~~~d~C~--~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~  347 (365)
                      +..|.++++|.  .++|..  .|.+     ..|+|.|.|++||+...
T Consensus       181 ~~~C~~~~~C~~~~~~c~~--~C~~-----~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         181 GKICVVPDLCATLSHVCQQ--VCIS-----TPGSYLCACTEGYALLE  220 (224)
T ss_pred             cccCcCchhhcCCCCCccc--eEEc-----CCCCEEeECCCCccCCC
Confidence            34466788895  356764  6997     77999999999998643


No 46 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.61  E-value=1.4  Score=38.60  Aligned_cols=21  Identities=14%  Similarity=0.358  Sum_probs=18.9

Q ss_pred             ceeeecCCCceeeCCCCCccC
Q psy15668        239 AKCEVRGATPICSCPRDMTGD  259 (365)
Q Consensus       239 ~~C~~~~~~~~C~C~~G~~g~  259 (365)
                      ..|.++.|+|.|.|++||+..
T Consensus       199 ~~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         199 QVCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             ceEEcCCCCEEeECCCCccCC
Confidence            579999999999999999875


No 47 
>smart00051 DSL delta serrate ligand.
Probab=80.51  E-value=2.6  Score=28.74  Aligned_cols=23  Identities=22%  Similarity=0.413  Sum_probs=16.5

Q ss_pred             CCCCCeecCCCCCCCCCCceeeCCCCCccCC
Q psy15668        317 CGENAKCQPGYDKSGKDRPVCTCLPGYVGDA  347 (365)
Q Consensus       317 C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~~  347 (365)
                      ...+..|..    .    ..++|.+||.|..
T Consensus        40 ~~~~~~Cd~----~----G~~~C~~Gw~G~~   62 (63)
T smart00051       40 FFGHYTCDE----N----GNKGCLEGWMGPY   62 (63)
T ss_pred             ccCCccCCc----C----CCEecCCCCcCCC
Confidence            455667864    1    3689999999875


No 48 
>KOG3516|consensus
Probab=79.58  E-value=1.3  Score=46.82  Aligned_cols=33  Identities=33%  Similarity=0.780  Sum_probs=30.9

Q ss_pred             CCCCCCCCCCCCCeeeecCCCceeeCC-CCCccC
Q psy15668          7 GDPCSPNPCGSNTQCNVASNRPVCSCL-PGHWGN   39 (365)
Q Consensus         7 id~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~   39 (365)
                      +|.|.+|+|.++|.|......|.|.|. .||.|.
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga  578 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA  578 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEeccccccccc
Confidence            578889999999999999999999999 899998


No 49 
>KOG1218|consensus
Probab=77.39  E-value=46  Score=30.33  Aligned_cols=14  Identities=36%  Similarity=0.684  Sum_probs=11.5

Q ss_pred             CCceeeCCCCCccC
Q psy15668         26 NRPVCSCLPGHWGN   39 (365)
Q Consensus        26 ~~~~C~C~~G~~g~   39 (365)
                      ....|.|.+||+|.
T Consensus        13 ~~~~c~c~~~~~g~   26 (316)
T KOG1218|consen   13 GSGQCFCDPGYTGR   26 (316)
T ss_pred             CCCceecCCCcccc
Confidence            45689999999985


No 50 
>PHA02887 EGF-like protein; Provisional
Probab=75.90  E-value=2.9  Score=31.96  Aligned_cols=36  Identities=25%  Similarity=0.528  Sum_probs=25.1

Q ss_pred             CCCCCC---CCCCCCCeeeec--CCCceeeCCCCCccCCCCCccC
Q psy15668          7 GDPCSP---NPCGSNTQCNVA--SNRPVCSCLPGHWGNPLTYCQR   46 (365)
Q Consensus         7 id~C~~---~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~   46 (365)
                      ..+|..   +=|- ||+|.-.  ...+.|.|++||+|.   +|+.
T Consensus        83 f~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~  123 (126)
T PHA02887         83 FEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE  123 (126)
T ss_pred             ccccChHhhCEee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence            355642   3376 4789843  356899999999999   6653


No 51 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.47  E-value=3.1  Score=32.37  Aligned_cols=37  Identities=30%  Similarity=0.548  Sum_probs=26.4

Q ss_pred             CCCCCCC---CCCCCCCeeeecC--CCceeeCCCCCccCCCCCccC
Q psy15668          6 GGDPCSP---NPCGSNTQCNVAS--NRPVCSCLPGHWGNPLTYCQR   46 (365)
Q Consensus         6 did~C~~---~~C~~~~~C~~~~--~~~~C~C~~G~~g~~~~~C~~   46 (365)
                      +|-+|.+   +=|-+ |+|.-..  ..+.|.|..||+|.   +|+.
T Consensus        41 ~i~~Cp~ey~~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh   82 (139)
T PHA03099         41 AIRLCGPEGDGYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQH   82 (139)
T ss_pred             ccccCChhhCCEeEC-CEEEeeccCCCceeECCCCcccc---cccc
Confidence            4556643   33766 4898444  77999999999999   6653


No 52 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=75.36  E-value=1.9  Score=27.62  Aligned_cols=22  Identities=32%  Similarity=0.794  Sum_probs=16.9

Q ss_pred             CCceeeCCCCCccCCCCCCcCC
Q psy15668        333 DRPVCTCLPGYVGDALTYCRRG  354 (365)
Q Consensus       333 ~~~~C~C~~G~~g~~~~~C~~~  354 (365)
                      ...+|.|+++|+|...++|.+.
T Consensus        16 ~~G~C~C~~~~~G~~C~~C~~g   37 (49)
T PF00053_consen   16 STGQCVCKPGTTGPRCDQCKPG   37 (49)
T ss_dssp             TCEEESBSTTEESTTS-EE-TT
T ss_pred             CCCEEeccccccCCcCcCCCCc
Confidence            3469999999999998888864


No 53 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=74.91  E-value=3.4  Score=31.56  Aligned_cols=32  Identities=47%  Similarity=0.923  Sum_probs=25.4

Q ss_pred             CCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcc
Q psy15668          6 GGDPCS-PNPCGSNTQCNVASNRPVCSCLPGHWG   38 (365)
Q Consensus         6 did~C~-~~~C~~~~~C~~~~~~~~C~C~~G~~g   38 (365)
                      ..|+|. ...|+.+|.|.. .....|.|.+||.-
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP  108 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence            457897 578999999964 45678999999974


No 54 
>smart00051 DSL delta serrate ligand.
Probab=74.07  E-value=4.9  Score=27.39  Aligned_cols=44  Identities=27%  Similarity=0.652  Sum_probs=30.2

Q ss_pred             ceeeCCCCCccCCCCCcccCCCCCCCCC-CCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668         87 PVCSCPPGYTGDPLTQCRRFDPQELCDR-SPCGVNTRCEVINMVPTCSCLPGYTGS  141 (365)
Q Consensus        87 ~~C~C~~G~~g~~~~~C~~~~~~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~  141 (365)
                      +.-.|.++|.|.   .|...     |.+ .....+..|.. .|  .++|.+||+|.
T Consensus        17 ~rv~C~~~~yG~---~C~~~-----C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~   61 (63)
T smart00051       17 IRVTCDENYYGE---GCNKF-----CRPRDDFFGHYTCDE-NG--NKGCLEGWMGP   61 (63)
T ss_pred             EEeeCCCCCcCC---ccCCE-----eCcCccccCCccCCc-CC--CEecCCCCcCC
Confidence            455899999999   88643     643 22445667743 23  57899999987


No 55 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=72.71  E-value=5.7  Score=25.50  Aligned_cols=20  Identities=30%  Similarity=0.713  Sum_probs=17.2

Q ss_pred             ceeeCCCCCccCCCCCCcCC
Q psy15668        335 PVCTCLPGYVGDALTYCRRG  354 (365)
Q Consensus       335 ~~C~C~~G~~g~~~~~C~~~  354 (365)
                      .+|.|+++|+|...+.|.+.
T Consensus        19 G~C~C~~~~~G~~C~~C~~g   38 (50)
T cd00055          19 GQCECKPNTTGRRCDRCAPG   38 (50)
T ss_pred             CEEeCCCcCCCCCCCCCCCC
Confidence            58999999999998888754


No 56 
>PHA02887 EGF-like protein; Provisional
Probab=71.67  E-value=3.5  Score=31.49  Aligned_cols=30  Identities=30%  Similarity=0.602  Sum_probs=23.2

Q ss_pred             CCCCCCceeeec--CCCceeeCCCCCccCCCCcccc
Q psy15668        233 GQCGINAKCEVR--GATPICSCPRDMTGDPFVRCRP  266 (365)
Q Consensus       233 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~  266 (365)
                      +.|. +|+|...  .....|+|+.||+|.   +|+.
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~  123 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTGI---RCDE  123 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCcccC---CCCc
Confidence            4455 5788765  456889999999999   7875


No 57 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=67.58  E-value=5.6  Score=30.96  Aligned_cols=31  Identities=29%  Similarity=0.578  Sum_probs=24.0

Q ss_pred             CCCCCCceeeec--CCCceeeCCCCCccCCCCccccC
Q psy15668        233 GQCGINAKCEVR--GATPICSCPRDMTGDPFVRCRPF  267 (365)
Q Consensus       233 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~C~~~  267 (365)
                      +.|.+ |+|...  ...+.|+|..||+|.   +|+..
T Consensus        51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~   83 (139)
T PHA03099         51 GYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHV   83 (139)
T ss_pred             CEeEC-CEEEeeccCCCceeECCCCcccc---cccce
Confidence            45665 488765  477899999999999   88853


No 58 
>KOG3514|consensus
Probab=61.19  E-value=5.6  Score=41.93  Aligned_cols=31  Identities=35%  Similarity=0.833  Sum_probs=28.5

Q ss_pred             CCCCCCCCCCCeeeecCCCceeeCCC-CCccC
Q psy15668          9 PCSPNPCGSNTQCNVASNRPVCSCLP-GHWGN   39 (365)
Q Consensus         9 ~C~~~~C~~~~~C~~~~~~~~C~C~~-G~~g~   39 (365)
                      .|.++||.++|+|...+.+|.|.|.. ||.|.
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~  656 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR  656 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCc
Confidence            69999999999999999999999975 78887


No 59 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=59.79  E-value=8.8  Score=22.43  Aligned_cols=13  Identities=38%  Similarity=0.838  Sum_probs=11.1

Q ss_pred             ceeeCCCCCccCC
Q psy15668        335 PVCTCLPGYVGDA  347 (365)
Q Consensus       335 ~~C~C~~G~~g~~  347 (365)
                      +.|.||+||+.+.
T Consensus        18 ~~C~CPeGyIlde   30 (34)
T PF09064_consen   18 GQCFCPEGYILDE   30 (34)
T ss_pred             CceeCCCceEecC
Confidence            5899999998765


No 60 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=58.83  E-value=24  Score=22.59  Aligned_cols=28  Identities=36%  Similarity=0.828  Sum_probs=20.1

Q ss_pred             CCCC-CCCCCCCCeecCCCCCCCCCCceeeCCCCCccC
Q psy15668        310 DLCE-PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD  346 (365)
Q Consensus       310 d~C~-~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~g~  346 (365)
                      +.|. ...|..++.|++         .+|+|++||+-.
T Consensus        20 ~~C~~~~qC~~~s~C~~---------g~C~C~~g~~~~   48 (52)
T PF01683_consen   20 ESCESDEQCIGGSVCVN---------GRCQCPPGYVEV   48 (52)
T ss_pred             CCCCCcCCCCCcCEEcC---------CEeECCCCCEec
Confidence            3454 235668889987         389999999754


No 61 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=58.65  E-value=12  Score=23.55  Aligned_cols=20  Identities=30%  Similarity=0.727  Sum_probs=16.6

Q ss_pred             ceeeCCCCCccCCCCCCcCC
Q psy15668        335 PVCTCLPGYVGDALTYCRRG  354 (365)
Q Consensus       335 ~~C~C~~G~~g~~~~~C~~~  354 (365)
                      .+|.|+++|+|...+.|.+.
T Consensus        18 G~C~C~~~~~G~~C~~C~~g   37 (46)
T smart00180       18 GQCECKPNVTGRRCDRCAPG   37 (46)
T ss_pred             CEEECCCCCCCCCCCcCCCC
Confidence            58999999999888777653


No 62 
>KOG3516|consensus
Probab=54.99  E-value=9.7  Score=40.75  Aligned_cols=37  Identities=38%  Similarity=0.750  Sum_probs=31.8

Q ss_pred             CCCCCCCCCCCCCCCCeecCCCCCCCCCCceeeCC-CCCccCC
Q psy15668        306 PLPDDLCEPNPCGENAKCQPGYDKSGKDRPVCTCL-PGYVGDA  347 (365)
Q Consensus       306 ~~~~d~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~-~G~~g~~  347 (365)
                      |..+|.|.+++|.+++.|..     ....|.|.|. .||.|..
T Consensus       542 C~i~drClPN~CehgG~C~Q-----s~~~f~C~C~~TGY~Gat  579 (1306)
T KOG3516|consen  542 CGISDRCLPNPCEHGGKCSQ-----SWDDFECNCELTGYKGAT  579 (1306)
T ss_pred             cccccccCCccccCCCcccc-----cccceeEecccccccccc
Confidence            55578899999999999997     4577999999 9999987


No 63 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=54.37  E-value=12  Score=28.56  Aligned_cols=24  Identities=50%  Similarity=1.140  Sum_probs=18.6

Q ss_pred             CCCCCeeeecCCCceeeCCCCCccC
Q psy15668         74 CGVNSECNVRNHIPVCSCPPGYTGD   98 (365)
Q Consensus        74 C~~~~~C~~~~g~~~C~C~~G~~g~   98 (365)
                      |+.++.|.. .....|.|.+||.-+
T Consensus        86 CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   86 CGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cCCccEeCC-CCCCceECCCCcCCC
Confidence            888999954 345689999999743


No 64 
>KOG3512|consensus
Probab=49.88  E-value=43  Score=32.34  Aligned_cols=62  Identities=16%  Similarity=0.294  Sum_probs=39.8

Q ss_pred             ecCCccceeeeeecCCCCCCCCC---CCCCCCCCCCCC----CCeecCCCCCCCCCCceeeCCCCCccCCCCCCcC
Q psy15668        285 YQNNKTIFYVSLVSLNYPYVTPL---PDDLCEPNPCGE----NAKCQPGYDKSGKDRPVCTCLPGYVGDALTYCRR  353 (365)
Q Consensus       285 c~~~~~~~~~~~c~~~~~~~~~~---~~d~C~~~~C~~----~~~C~~~~~~~~~~~~~C~C~~G~~g~~~~~C~~  353 (365)
                      |.-+.++..|..|-+||+-..-.   +...|..-.|++    +-+|..       .+.+|.|++|-+|..++.|.+
T Consensus       364 CrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq-------~tGqCpCkeGvtG~tCnrCa~  432 (592)
T KOG3512|consen  364 CRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQ-------TTGQCPCKEGVTGLTCNRCAP  432 (592)
T ss_pred             cccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccc-------cCCcccCCCCCcccccccccc
Confidence            34445678888999999843322   223354434544    235654       235899999999999888875


No 65 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=48.46  E-value=7.8  Score=29.20  Aligned_cols=35  Identities=23%  Similarity=0.530  Sum_probs=25.6

Q ss_pred             cCCCCCC--CCCCCCCCeeeecC-----CCceeeCCCCCccC
Q psy15668          5 MGGDPCS--PNPCGSNTQCNVAS-----NRPVCSCLPGHWGN   39 (365)
Q Consensus         5 ~did~C~--~~~C~~~~~C~~~~-----~~~~C~C~~G~~g~   39 (365)
                      ...++|.  .+.|..||.|+...     .=|.|.|.+.+...
T Consensus         3 ~S~~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~   44 (103)
T PF12955_consen    3 SSNDACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT   44 (103)
T ss_pred             CCHHHHHHhccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence            4455664  67899999999773     44899999966543


No 66 
>KOG1218|consensus
Probab=37.19  E-value=3.2e+02  Score=24.72  Aligned_cols=40  Identities=33%  Similarity=0.722  Sum_probs=21.7

Q ss_pred             CCCCccCCCCCcccCCCCCCCCCCCCCCCCeeeeCCCCceeeCCCCCCCC
Q psy15668         92 PPGYTGDPLTQCRRFDPQELCDRSPCGVNTRCEVINMVPTCSCLPGYTGS  141 (365)
Q Consensus        92 ~~G~~g~~~~~C~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~  141 (365)
                      ..+|.+.   .|..+   .++... |.. .+|.+...  .|.+..+|.+.
T Consensus        96 ~~~~~g~---~C~~~---~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~~  135 (316)
T KOG1218|consen   96 LNGYEGP---QCESP---CPCGDG-CAE-KTCANPRR--ECRCGGGYIGE  135 (316)
T ss_pred             CCCCCcc---cccCC---CCcCCc-ccc-cccCCCcc--ceecCCcCccc
Confidence            5777777   77755   233222 222 34544432  46666677666


No 67 
>KOG3514|consensus
Probab=36.92  E-value=25  Score=37.48  Aligned_cols=32  Identities=41%  Similarity=1.076  Sum_probs=28.2

Q ss_pred             CCCCCCCCCCCeecCCCCCCCCCCceeeCC-CCCccCC
Q psy15668        311 LCEPNPCGENAKCQPGYDKSGKDRPVCTCL-PGYVGDA  347 (365)
Q Consensus       311 ~C~~~~C~~~~~C~~~~~~~~~~~~~C~C~-~G~~g~~  347 (365)
                      .|.++||.|+|+|..     +..+|.|.|. .||.|..
T Consensus       625 ~C~~nPC~N~g~C~e-----gwNrfiCDCs~T~~~G~~  657 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSE-----GWNRFICDCSGTGFEGRT  657 (1591)
T ss_pred             ccCCCcccCCCCccc-----cccccccccccCcccCcc
Confidence            688999999999997     7788999996 6898887


No 68 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=36.23  E-value=16  Score=24.84  Aligned_cols=14  Identities=29%  Similarity=0.548  Sum_probs=5.9

Q ss_pred             ceeeeCCCCCccCC
Q psy15668        188 RAVCSCPVGYLGDP  201 (365)
Q Consensus       188 ~~~C~C~~G~~g~~  201 (365)
                      .++-.|.+.|.|..
T Consensus        16 ~~rv~C~~nyyG~~   29 (63)
T PF01414_consen   16 RIRVVCDENYYGPN   29 (63)
T ss_dssp             -------TTEETTT
T ss_pred             EEEEECCCCCCCcc
Confidence            45678899999983


No 69 
>KOG3607|consensus
Probab=26.68  E-value=63  Score=33.53  Aligned_cols=49  Identities=29%  Similarity=0.766  Sum_probs=34.8

Q ss_pred             CCCCCCCCcccCCccc-------cccccCCCCCCeeeecCCCceeeCCCCCccCCCCCcccC
Q psy15668         52 HSDCSHSKACKEYRCV-------DVCAGQCGVNSECNVRNHIPVCSCPPGYTGDPLTQCRRF  106 (365)
Q Consensus        52 ~~~C~~~~~C~~~~C~-------~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~  106 (365)
                      ...|..+..|.+.+|+       ..|...|..++.|.+..   .|.|.+||.+.   .|...
T Consensus       603 Gt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~~---~ChC~~gwapp---~C~~~  658 (716)
T KOG3607|consen  603 GTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNEL---NCHCEPGWAPP---FCFIF  658 (716)
T ss_pred             CCccCCCceecCCcchhhhhhcccccccccCCCcccCCCc---ceeeCCCCCCC---ccccc
Confidence            3446666677777772       33445588889886654   89999999988   78754


Done!