Query         psy9427
Match_columns 324
No_of_seqs    289 out of 2381
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 19:04:44 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9427.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9427hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.4 1.3E-12 2.8E-17  128.0  11.2  151   19-217   699-858 (1289)
  2 KOG1219|consensus               99.3 1.1E-11 2.3E-16  131.1   8.9  110   15-158  3865-3976(4289)
  3 KOG1219|consensus               99.2 2.5E-11 5.5E-16  128.3   8.9  110   77-219  3864-3975(4289)
  4 KOG1214|consensus               99.2 4.2E-11 9.1E-16  117.7   9.6  133    5-165   724-866 (1289)
  5 KOG1217|consensus               99.0 9.9E-09 2.2E-13   99.7  16.6  190   15-217   170-388 (487)
  6 KOG1217|consensus               99.0 2.7E-08 5.9E-13   96.6  18.1  178   16-218    91-304 (487)
  7 KOG4289|consensus               98.7 1.3E-08 2.8E-13  105.1   5.5  104   16-152  1181-1308(2531)
  8 KOG4260|consensus               98.5 5.2E-08 1.1E-12   85.1   3.7  152   20-217   150-305 (350)
  9 KOG4289|consensus               98.5 1.2E-07 2.6E-12   98.1   6.4   92   92-214  1216-1308(2531)
 10 KOG1225|consensus               98.2 1.4E-05   3E-10   77.8  11.5   20  194-218   345-364 (525)
 11 KOG1225|consensus               98.1 4.2E-05 9.2E-10   74.5  11.8  149   98-313   234-387 (525)
 12 KOG4260|consensus               98.0 1.2E-05 2.6E-10   70.6   5.3  105   83-217   150-269 (350)
 13 PF06247 Plasmod_Pvs28:  Plasmo  97.9 3.4E-06 7.5E-11   70.3   1.2  145   21-217     7-161 (197)
 14 PF00008 EGF:  EGF-like domain   97.9 1.1E-05 2.4E-10   48.2   2.9   30   17-47      1-31  (32)
 15 PF07645 EGF_CA:  Calcium-bindi  97.9 8.9E-06 1.9E-10   51.9   2.5   25  193-217    11-35  (42)
 16 PF07645 EGF_CA:  Calcium-bindi  97.7 2.3E-05   5E-10   50.0   2.4   34  121-157     3-37  (42)
 17 PF00008 EGF:  EGF-like domain   97.5 5.5E-05 1.2E-09   45.2   1.6   29   80-108     1-30  (32)
 18 smart00179 EGF_CA Calcium-bind  97.4 0.00024 5.2E-09   44.0   3.9   32   77-108     2-35  (39)
 19 PF12662 cEGF:  Complement Clr-  97.4 0.00017 3.7E-09   39.8   2.7   24  144-167     1-24  (24)
 20 PF12947 EGF_3:  EGF domain;  I  97.4 0.00021 4.5E-09   43.9   3.2   28   21-49      7-34  (36)
 21 smart00179 EGF_CA Calcium-bind  97.3 0.00043 9.2E-09   42.9   3.7   31   14-45      2-33  (39)
 22 PF12947 EGF_3:  EGF domain;  I  97.2 0.00024 5.1E-09   43.6   2.0   28   83-110     6-33  (36)
 23 PF12662 cEGF:  Complement Clr-  97.2 0.00038 8.1E-09   38.5   2.3   24   97-122     1-24  (24)
 24 cd00054 EGF_CA Calcium-binding  97.0  0.0013 2.8E-08   40.1   3.8   32   77-108     2-34  (38)
 25 cd00054 EGF_CA Calcium-binding  96.8  0.0015 3.2E-08   39.9   3.3   33   14-47      2-35  (38)
 26 cd00053 EGF Epidermal growth f  96.7  0.0025 5.3E-08   38.2   3.4   28   19-47      5-32  (36)
 27 cd00053 EGF Epidermal growth f  96.5  0.0048   1E-07   36.9   3.8   28   82-109     5-32  (36)
 28 smart00181 EGF Epidermal growt  96.2  0.0061 1.3E-07   36.7   3.2   29   17-47      2-31  (35)
 29 KOG1226|consensus               96.2   0.041 8.9E-07   55.5  10.4  142   99-283   479-636 (783)
 30 smart00181 EGF Epidermal growt  96.1  0.0092   2E-07   35.9   3.5   28   80-108     2-30  (35)
 31 PF14670 FXa_inhibition:  Coagu  95.3   0.013 2.8E-07   35.9   1.9   26  132-160     9-34  (36)
 32 PF14670 FXa_inhibition:  Coagu  95.2   0.014 3.1E-07   35.7   1.8   23   89-111    10-32  (36)
 33 KOG1226|consensus               95.1    0.43 9.4E-06   48.4  13.0  117   84-239   515-635 (783)
 34 KOG0994|consensus               92.9    0.48   1E-05   50.1   8.4   63   90-159   877-948 (1758)
 35 PF07974 EGF_2:  EGF-like domai  92.9    0.17 3.7E-06   30.0   3.3   25   20-47      6-30  (32)
 36 PF06247 Plasmod_Pvs28:  Plasmo  92.3     0.1 2.2E-06   43.9   2.4  106   20-158    50-164 (197)
 37 PF07974 EGF_2:  EGF-like domai  91.1    0.25 5.3E-06   29.3   2.5   23  193-217     7-29  (32)
 38 PF04706 Dickkopf_N:  Dickkopf   91.0    0.41 8.9E-06   31.8   3.8   21  295-315    31-51  (52)
 39 PF12946 EGF_MSP1_1:  MSP1 EGF   91.0    0.22 4.7E-06   30.5   2.3   30   17-47      2-32  (37)
 40 PF12661 hEGF:  Human growth fa  90.2    0.27 5.8E-06   22.9   1.7   11   36-47      1-11  (13)
 41 PF12946 EGF_MSP1_1:  MSP1 EGF   89.3    0.17 3.6E-06   31.0   0.7   30   81-110     3-33  (37)
 42 KOG0994|consensus               88.6       2 4.3E-05   45.8   8.2   75   32-112   931-1013(1758)
 43 cd01475 vWA_Matrilin VWA_Matri  88.3    0.44 9.6E-06   41.8   3.1   36  161-217   183-218 (224)
 44 cd01475 vWA_Matrilin VWA_Matri  85.1     1.2 2.5E-05   39.1   4.1   38  114-157   182-220 (224)
 45 PF04706 Dickkopf_N:  Dickkopf   84.8     1.4   3E-05   29.3   3.3   15  233-248     2-16  (52)
 46 PF00954 S_locus_glycop:  S-loc  79.8     1.8 3.9E-05   33.4   2.8   24  193-217    85-108 (110)
 47 PF00954 S_locus_glycop:  S-loc  78.4       2 4.2E-05   33.2   2.7   31   14-46     77-108 (110)
 48 KOG1836|consensus               71.2       7 0.00015   44.2   5.5   51  101-158   760-811 (1705)
 49 PF01683 EB:  EB module;  Inter  68.3     3.6 7.9E-05   27.0   1.7   18  297-314    22-39  (52)
 50 PF00095 WAP:  WAP-type (Whey A  60.7     3.2 6.9E-05   26.1   0.3   20  291-310    13-32  (43)
 51 smart00051 DSL delta serrate l  57.8      17 0.00036   25.2   3.5   47   97-157    16-62  (63)
 52 PF09064 Tme5_EGF_like:  Thromb  55.6      14 0.00031   22.1   2.4   21   27-49     11-31  (34)
 53 PF04885 Stig1:  Stigma-specifi  55.1      32  0.0007   27.7   5.2   34  242-275    68-108 (136)
 54 PHA02887 EGF-like protein; Pro  53.5      13 0.00028   29.0   2.5   24  193-217    93-118 (126)
 55 PHA02887 EGF-like protein; Pro  53.1      15 0.00033   28.6   2.8   31   83-119    92-124 (126)
 56 cd00055 EGF_Lam Laminin-type e  49.0      21 0.00044   23.2   2.7   25   27-54     13-37  (50)
 57 PF00053 Laminin_EGF:  Laminin   48.2      15 0.00032   23.7   1.9   26   26-54     11-36  (49)
 58 smart00217 WAP Four-disulfide   48.2      25 0.00054   22.7   2.9   24  287-310    12-35  (47)
 59 PHA03099 epidermal growth fact  42.1      22 0.00047   28.3   2.2   24  193-217    52-77  (139)
 60 PHA03099 epidermal growth fact  41.8      24 0.00051   28.1   2.4   31   83-119    51-83  (139)
 61 KOG3516|consensus               36.4      26 0.00056   37.9   2.4   37   77-118   545-582 (1306)
 62 KOG3516|consensus               30.9      34 0.00074   37.1   2.3   33   14-47    545-578 (1306)
 63 PF04885 Stig1:  Stigma-specifi  30.4 1.1E+02  0.0023   24.8   4.6   54  262-315    67-134 (136)
 64 cd00199 WAP whey acidic protei  29.6      64  0.0014   21.9   2.8   21  290-310    28-48  (60)
 65 smart00180 EGF_Lam Laminin-typ  28.5      57  0.0012   20.7   2.3   25   27-54     12-36  (46)
 66 PF08562 Crisp:  Crisp;  InterP  27.1      27 0.00058   23.5   0.5   15  306-320     5-19  (55)
 67 PF11403 Yeast_MT:  Yeast metal  23.1      93   0.002   18.5   2.2   12  293-304    26-37  (40)
 68 KOG3514|consensus               23.1      56  0.0012   35.2   2.1   35   79-118   625-660 (1591)
 69 cd01328 FSL_SPARC Follistatin-  21.3 1.2E+02  0.0026   22.4   3.1   27   16-42      1-28  (86)

No 1  
>KOG1214|consensus
Probab=99.41  E-value=1.3e-12  Score=128.03  Aligned_cols=151  Identities=25%  Similarity=0.564  Sum_probs=113.8

Q ss_pred             CCCCCCCCEeeeC-CCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCC--CCCCCCCEEeeCC
Q psy9427          19 GVSCGVNAVCVVN-NHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDS--GACGTNAVCIPED   95 (324)
Q Consensus        19 ~~~C~~~~~C~~~-~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~--~~C~~~~~C~~~~   95 (324)
                      +.-|..++.|... .-.|+|.|..| |.|++. .|.+                      +++|..  ..|+.++.|++.+
T Consensus       699 sh~cdt~a~C~pg~~~~~tcecs~g-~~gdgr-~c~d----------------------~~eca~~~~~CGp~s~Cin~p  754 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGVDYTCECSSG-YQGDGR-NCVD----------------------ENECATGFHRCGPNSVCINLP  754 (1289)
T ss_pred             CcccCCCccccCCCCcceEEEEeec-cCCCCC-CCCC----------------------hhhhccCCCCCCCCceeecCC
Confidence            4567888889754 45799999999 888654 4555                      667753  5699999999999


Q ss_pred             CCcEeeCCCCCcCCCCCCCcccccC-----CCCCCC-CCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCC
Q psy9427          96 HRPVCHCPVGTHPSPSPDVHCARES-----NLCRDS-NPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCP  169 (324)
Q Consensus        96 ~~~~C~C~~G~~g~~~~~~~C~~~~-----~~C~~~-~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~  169 (324)
                      ++|.|.|..||...... .+|..+.     +.|... ..|+..++++-+...-+.|+|+|.+||.|++.  .|.+++||.
T Consensus       755 g~~rceC~~gy~F~dd~-~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~--~c~dvDeC~  831 (1289)
T KOG1214|consen  755 GSYRCECRSGYEFADDR-HTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH--QCTDVDECS  831 (1289)
T ss_pred             CceeEEEeecceeccCC-cceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc--ccccccccC
Confidence            99999999998754433 5676542     445332 35655554443222556799999999999984  499999983


Q ss_pred             CCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         170 RGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       170 ~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                        ..                   .|+.+|.|.+++|+|.|+|.+||.|
T Consensus       832 --ps-------------------rChp~A~CyntpgsfsC~C~pGy~G  858 (1289)
T KOG1214|consen  832 --PS-------------------RCHPAATCYNTPGSFSCRCQPGYYG  858 (1289)
T ss_pred             --cc-------------------ccCCCceEecCCCcceeecccCccC
Confidence              21                   5889999999999999999999997


No 2  
>KOG1219|consensus
Probab=99.27  E-value=1.1e-11  Score=131.12  Aligned_cols=110  Identities=26%  Similarity=0.704  Sum_probs=98.2

Q ss_pred             cCCCCCCCCCCCEeeeC-CCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEee
Q psy9427          15 NSCDGVSCGVNAVCVVN-NHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIP   93 (324)
Q Consensus        15 ~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~   93 (324)
                      ++|..+||+++|+|... .+.|+|.|++. |+|.   .|+..                     +++|.++||..+++|+.
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsq-ysG~---~CEi~---------------------~epC~snPC~~GgtCip 3919 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQ-YSGN---HCEID---------------------LEPCASNPCLTGGTCIP 3919 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCccc-ccCc---ccccc---------------------cccccCCCCCCCCEEEe
Confidence            78999999999999965 67899999999 9995   34432                     78999999999999999


Q ss_pred             CCCCcEeeCCCCCcCCCCCCCccccc-CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCC
Q psy9427          94 EDHRPVCHCPVGTHPSPSPDVHCARE-SNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPY  158 (324)
Q Consensus        94 ~~~~~~C~C~~G~~g~~~~~~~C~~~-~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~  158 (324)
                      ..+.|.|.|+.||+|     .+|+.. +++| ..++|..++.|+|   ++|+|.|.|.+||.|..+
T Consensus      3920 ~~n~f~CnC~~gyTG-----~~Ce~~Gi~eC-s~n~C~~gg~C~n---~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3920 FYNGFLCNCPNGYTG-----KRCEARGISEC-SKNVCGTGGQCIN---IPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred             cCCCeeEeCCCCccC-----ceeeccccccc-ccccccCCceeec---cCCceEeccChhHhcccC
Confidence            999999999999999     889764 5999 8899999999999   999999999999998763


No 3  
>KOG1219|consensus
Probab=99.22  E-value=2.5e-11  Score=128.34  Aligned_cols=110  Identities=24%  Similarity=0.605  Sum_probs=95.1

Q ss_pred             cCCCCCCCCCCCCEEeeC-CCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCcc
Q psy9427          77 IPACDSGACGTNAVCIPE-DHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVG  155 (324)
Q Consensus        77 i~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g  155 (324)
                      .++|..+||+++++|..+ .++|.|.|++-|.|     +.|+..++.| ..+||..+++|+.   ..+.|.|.|+.||+|
T Consensus      3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG-----~~CEi~~epC-~snPC~~GgtCip---~~n~f~CnC~~gyTG 3934 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSG-----NHCEIDLEPC-ASNPCLTGGTCIP---FYNGFLCNCPNGYTG 3934 (4289)
T ss_pred             ccccccCcccCCCEecCCCCCceEEeCcccccC-----cccccccccc-cCCCCCCCCEEEe---cCCCeeEeCCCCccC
Confidence            378888999999999966 46799999999999     9999766999 9999999999999   899999999999999


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCcc-CCCCCCeeeeCCCCceeeCCCCCccCC
Q psy9427         156 DPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQG-ACGPNALCRAVGKRPVCACPDGFSAVG  219 (324)
Q Consensus       156 ~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~-~C~~~~~C~~~~g~y~C~C~~Gy~g~~  219 (324)
                      ..++.  +.++||                      .. +|+++|.|+|.+|+|.|.|-+||.|+.
T Consensus      3935 ~~Ce~--~Gi~eC----------------------s~n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3935 KRCEA--RGISEC----------------------SKNVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred             ceeec--cccccc----------------------ccccccCCceeeccCCceEeccChhHhccc
Confidence            98541  124555                      22 699999999999999999999999844


No 4  
>KOG1214|consensus
Probab=99.22  E-value=4.2e-11  Score=117.67  Aligned_cols=133  Identities=27%  Similarity=0.635  Sum_probs=97.0

Q ss_pred             eeecCCcccccC--CCC--CCCCCCCEeeeCCCceeeeeCCCeeec-CCCCCCcccCCCCCCcCCCCCceeCCCCcccCC
Q psy9427           5 FVFYLTLTCWNS--CDG--VSCGVNAVCVVNNHEARCFCPPGSYVG-DPNIECRQVPCVHNHDCSTSHACDLLSHTCIPA   79 (324)
Q Consensus         5 ~~~~~~~~c~~~--C~~--~~C~~~~~C~~~~~~~~C~C~~G~y~g-~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~   79 (324)
                      |.+++ +.|+++  |..  ..|++++.|++.+|+|.|.|..| |.- ++...|..+.=      +..          +++
T Consensus       724 ~~gdg-r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~g-y~F~dd~~tCV~i~~------pap----------~n~  785 (1289)
T KOG1214|consen  724 YQGDG-RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSG-YEFADDRHTCVLITP------PAP----------ANP  785 (1289)
T ss_pred             cCCCC-CCCCChhhhccCCCCCCCCceeecCCCceeEEEeec-ceeccCCcceEEecC------CCC----------CCc
Confidence            33443 556554  654  46999999999999999999999 653 33344544200      000          444


Q ss_pred             CCC--CCCCCCC--EEeeC-CCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCc
Q psy9427          80 CDS--GACGTNA--VCIPE-DHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSV  154 (324)
Q Consensus        80 C~~--~~C~~~~--~C~~~-~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~  154 (324)
                      |..  +.|..++  .|+.. .+.|.|.|.|||.|++   ..|.++ |+| .++.|++.+.|.+   ++|+|.|+|.+||.
T Consensus       786 Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG---~~c~dv-DeC-~psrChp~A~Cyn---tpgsfsC~C~pGy~  857 (1289)
T KOG1214|consen  786 CEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG---HQCTDV-DEC-SPSRCHPAATCYN---TPGSFSCRCQPGYY  857 (1289)
T ss_pred             cccCccccCcCCceEEEecCCceEEEeecCCccCCc---cccccc-ccc-CccccCCCceEec---CCCcceeecccCcc
Confidence            542  3455444  44433 3569999999999998   789998 999 8899999999999   99999999999999


Q ss_pred             cCCCCCCCCCC
Q psy9427         155 GDPYKTGCVPQ  165 (324)
Q Consensus       155 g~~~~~~C~~~  165 (324)
                      |++.+  |.+.
T Consensus       858 GDGf~--CVP~  866 (1289)
T KOG1214|consen  858 GDGFQ--CVPD  866 (1289)
T ss_pred             CCCce--ecCC
Confidence            99854  7654


No 5  
>KOG1217|consensus
Probab=99.02  E-value=9.9e-09  Score=99.65  Aligned_cols=190  Identities=26%  Similarity=0.645  Sum_probs=124.8

Q ss_pred             cCCC--CCCCCCCCEeeeCCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCC--CCCCCC-C
Q psy9427          15 NSCD--GVSCGVNAVCVVNNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDS--GACGTN-A   89 (324)
Q Consensus        15 ~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~--~~C~~~-~   89 (324)
                      +.|.  ..+|.+++.|.+..++|.|.|++| |.+.....-     .+...|...+.|.+..+.=.+.|..  ..+... +
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~-~~~~~~~~~-----~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~  243 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPG-YTGSTCETT-----GNGGTCVDSVACSCPPGARGPECEVSIVECASGDG  243 (487)
T ss_pred             cccccCCCCcCCCcccccCCCCeeEeCCCC-ccCCcCcCC-----CCCceEecceeccCCCCCCCCCcccccccccCCCC
Confidence            5787  346999999999999999999999 887533211     1122222222222221100111211  122222 8


Q ss_pred             EEeeCCCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCC
Q psy9427          90 VCIPEDHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCP  169 (324)
Q Consensus        90 ~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~  169 (324)
                      +|++..++|.|.+++||++...  ..+.++ ++|+...+|..+++|++   ..+.|.|.|++||+|... ..+.+..+|.
T Consensus       244 ~c~~~~~~~~C~~~~g~~~~~~--~~~~~~-~~C~~~~~c~~~~~C~~---~~~~~~C~C~~g~~g~~~-~~~~~~~~C~  316 (487)
T KOG1217|consen  244 TCVNTVGSYTCRCPEGYTGDAC--VTCVDV-DSCALIASCPNGGTCVN---VPGSYRCTCPPGFTGRLC-TECVDVDECS  316 (487)
T ss_pred             cccccCCceeeeCCCCcccccc--ceeeec-cccCCCCccCCCCeeec---CCCcceeeCCCCCCCCCC-cccccccccc
Confidence            8999999999999999999541  245666 88844324888999999   888899999999999886 3455666663


Q ss_pred             C--CCCCCCCCCcccC-----------------Ccc--c-CCCcc-CCCCCCeeee-CCCCceeeCCCCCcc
Q psy9427         170 R--GDADCPPSSRCSG-----------------GQC--V-SVCQG-ACGPNALCRA-VGKRPVCACPDGFSA  217 (324)
Q Consensus       170 ~--~~~~C~~~~~C~~-----------------~~C--~-d~C~~-~C~~~~~C~~-~~g~y~C~C~~Gy~g  217 (324)
                      .  ....|.....|..                 .+|  . ++|.. .+..++.|++ ..++|.|.|+.+|.+
T Consensus       317 ~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  317 PRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             ccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence            2  2333444334411                 234  2 36777 6888999999 789999999999986


No 6  
>KOG1217|consensus
Probab=98.99  E-value=2.7e-08  Score=96.57  Aligned_cols=178  Identities=28%  Similarity=0.626  Sum_probs=120.7

Q ss_pred             CCCCCCCCCCCEeeeCCCceeeeeCCCeeecCCCCC---CcccC--CCCCCcCCC------CCceeCCCCcc-------c
Q psy9427          16 SCDGVSCGVNAVCVVNNHEARCFCPPGSYVGDPNIE---CRQVP--CVHNHDCST------SHACDLLSHTC-------I   77 (324)
Q Consensus        16 ~C~~~~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~---C~~~~--C~~~~~C~~------~~~C~c~~~~C-------i   77 (324)
                      .+...+...+..+....++|.|.|++| |.+.....   |...+  +...+.|..      .+.|.|..+..       .
T Consensus        91 ~~~~~~~~~~~~~~~~~~~~~c~c~~g-~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~  169 (487)
T KOG1217|consen   91 PCRSPCLLLCGECVDCVGSYECTCPPG-YQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDL  169 (487)
T ss_pred             cccCCcccCCccccCCCCCceeeCCCc-cccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccc
Confidence            344434445556667788999999999 88865443   66654  345556642      56677776633       2


Q ss_pred             CCCC--CCCCCCCCEEeeCCCCcEeeCCCCCcCCCCCCC----ccccc----------CCCCCCC-CCCCCC-CeeecCC
Q psy9427          78 PACD--SGACGTNAVCIPEDHRPVCHCPVGTHPSPSPDV----HCARE----------SNLCRDS-NPCHPT-ARCEDTG  139 (324)
Q Consensus        78 ~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~----~C~~~----------~~~C~~~-~~C~~~-~~C~~~~  139 (324)
                      ++|.  ..+|.+++.|.+..++|.|.|++||.+......    .|...          ...|... ..+... ++|.+  
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~--  247 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVN--  247 (487)
T ss_pred             cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccc--
Confidence            5776  345888899999999999999999998542101    12110          0222111 123323 88999  


Q ss_pred             CCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccC
Q psy9427         140 ATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAV  218 (324)
Q Consensus       140 ~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~  218 (324)
                       +.++|+|.+++||++... ..+.++++|....                    .|.++++|++..+.|.|.|++||.|.
T Consensus       248 -~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~--------------------~c~~~~~C~~~~~~~~C~C~~g~~g~  304 (487)
T KOG1217|consen  248 -TVGSYTCRCPEGYTGDAC-VTCVDVDSCALIA--------------------SCPNGGTCVNVPGSYRCTCPPGFTGR  304 (487)
T ss_pred             -cCCceeeeCCCCcccccc-ceeeeccccCCCC--------------------ccCCCCeeecCCCcceeeCCCCCCCC
Confidence             889999999999998872 2366777774332                    15578999999999999999999983


No 7  
>KOG4289|consensus
Probab=98.72  E-value=1.3e-08  Score=105.09  Aligned_cols=104  Identities=33%  Similarity=0.742  Sum_probs=85.0

Q ss_pred             CCCCCCCCCCCEeee----------------------CCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCC
Q psy9427          16 SCDGVSCGVNAVCVV----------------------NNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLS   73 (324)
Q Consensus        16 ~C~~~~C~~~~~C~~----------------------~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~   73 (324)
                      .|...||.+-++|+.                      ..+.++|.||+| |+|+.   |...                  
T Consensus      1181 iClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpG-FTgd~---CeTe------------------ 1238 (2531)
T KOG4289|consen 1181 ICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPG-FTGDY---CETE------------------ 1238 (2531)
T ss_pred             hhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCC-CCccc---ccch------------------
Confidence            477778887777742                      246788999999 99963   3332                  


Q ss_pred             CcccCCCCCCCCCCCCEEeeCCCCcEeeCCCCCcCCCCCCCccccc--CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCC
Q psy9427          74 HTCIPACDSGACGTNAVCIPEDHRPVCHCPVGTHPSPSPDVHCARE--SNLCRDSNPCHPTARCEDTGATNGKVVCSCPE  151 (324)
Q Consensus        74 ~~Ci~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~--~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~  151 (324)
                         ||+|-.+||+++++|....++|.|.|++||+|     ..|+..  ..-| .+..|.++++|++.  ..|.|.|.|+.
T Consensus      1239 ---iDlCYs~pC~nng~C~srEggYtCeCrpg~tG-----ehCEvs~~agrC-vpGvC~nggtC~~~--~nggf~c~Cp~ 1307 (2531)
T KOG4289|consen 1239 ---IDLCYSGPCGNNGRCRSREGGYTCECRPGFTG-----EHCEVSARAGRC-VPGVCKNGGTCVNL--LNGGFCCHCPY 1307 (2531)
T ss_pred             ---hHhhhcCCCCCCCceEEecCceeEEecCCccc-----cceeeecccCcc-ccceecCCCEEeec--CCCceeccCCC
Confidence               89999999999999999999999999999999     888753  1456 66789999999984  67889999998


Q ss_pred             C
Q psy9427         152 L  152 (324)
Q Consensus       152 G  152 (324)
                      |
T Consensus      1308 g 1308 (2531)
T KOG4289|consen 1308 G 1308 (2531)
T ss_pred             c
Confidence            8


No 8  
>KOG4260|consensus
Probab=98.55  E-value=5.2e-08  Score=85.14  Aligned_cols=152  Identities=22%  Similarity=0.458  Sum_probs=95.6

Q ss_pred             CCCCCCCEeee---CCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEeeCCC
Q psy9427          20 VSCGVNAVCVV---NNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIPEDH   96 (324)
Q Consensus        20 ~~C~~~~~C~~---~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~~~~   96 (324)
                      .||..+++|..   ..|+-.|.|.+| |+|.....|-..--.. ..=.....|.    .|..     +|.  +.|.....
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~G-Y~Gp~C~~Cg~eyfes-~Rne~~lvCt----~Ch~-----~C~--~~Csg~~~  216 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETG-YTGPLCRYCGIEYFES-SRNEQHLVCT----ACHE-----GCL--GVCSGESS  216 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCC-CCCccccccchHHHHh-hcccccchhh----hhhh-----hhh--cccCCCCC
Confidence            57888899974   367779999999 9997554333200000 0000000000    1111     221  23433222


Q ss_pred             CcEeeCCCCCcCCCCCCCcccccCCCCCC-CCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCC
Q psy9427          97 RPVCHCPVGTHPSPSPDVHCARESNLCRD-SNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADC  175 (324)
Q Consensus        97 ~~~C~C~~G~~g~~~~~~~C~~~~~~C~~-~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C  175 (324)
                      +---.|+.||..+.   ..|.|+ |||.. +.+|..+..|+|   +.|+|+|...+||.+.        +++|..-...|
T Consensus       217 k~C~kCkkGW~lde---~gCvDv-nEC~~ep~~c~~~qfCvN---teGSf~C~dk~Gy~~g--------~d~C~~~~d~~  281 (350)
T KOG4260|consen  217 KGCSKCKKGWKLDE---EGCVDV-NECQNEPAPCKAHQFCVN---TEGSFKCEDKEGYKKG--------VDECQFCADVC  281 (350)
T ss_pred             CChhhhcccceecc---cccccH-HHHhcCCCCCChhheeec---CCCceEecccccccCC--------hHHhhhhhhhc
Confidence            22236899999886   789999 99954 458999999999   9999999999999863        33442211111


Q ss_pred             CCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         176 PPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       176 ~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      .                  ..+..|.++.++|+|.|++|+.-
T Consensus       282 ~------------------~kn~~c~ni~~~~r~v~f~~~~~  305 (350)
T KOG4260|consen  282 A------------------SKNRPCMNIDGQYRCVCFSGLII  305 (350)
T ss_pred             c------------------cCCCCcccCCccEEEEeccccee
Confidence            1                  24567999999999999999863


No 9  
>KOG4289|consensus
Probab=98.54  E-value=1.2e-07  Score=98.12  Aligned_cols=92  Identities=29%  Similarity=0.673  Sum_probs=72.7

Q ss_pred             eeCCCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCC
Q psy9427          92 IPEDHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRG  171 (324)
Q Consensus        92 ~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~  171 (324)
                      ++..++++|.|++||+|     ..|+..+|+| -..||.+++.|..   ..|.|+|.|.+||+|+.++.          .
T Consensus      1216 i~pvnglrCrCPpGFTg-----d~CeTeiDlC-Ys~pC~nng~C~s---rEggYtCeCrpg~tGehCEv----------s 1276 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTG-----DYCETEIDLC-YSGPCGNNGRCRS---REGGYTCECRPGFTGEHCEV----------S 1276 (2531)
T ss_pred             ccccCceeEeCCCCCCc-----ccccchhHhh-hcCCCCCCCceEE---ecCceeEEecCCccccceee----------e
Confidence            45678899999999999     7887655999 8889999999999   89999999999999988431          1


Q ss_pred             CCCCCCCCcccCCcccCCCccCCCCCCeeeeC-CCCceeeCCCC
Q psy9427         172 DADCPPSSRCSGGQCVSVCQGACGPNALCRAV-GKRPVCACPDG  214 (324)
Q Consensus       172 ~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~-~g~y~C~C~~G  214 (324)
                      .+    ..+|+++        .|.++++|++. .|+|.|.||.|
T Consensus      1277 ~~----agrCvpG--------vC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1277 AR----AGRCVPG--------VCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             cc----cCccccc--------eecCCCEEeecCCCceeccCCCc
Confidence            10    1222222        46689999955 67899999998


No 10 
>KOG1225|consensus
Probab=98.20  E-value=1.4e-05  Score=77.80  Aligned_cols=20  Identities=25%  Similarity=0.815  Sum_probs=14.2

Q ss_pred             CCCCCeeeeCCCCceeeCCCCCccC
Q psy9427         194 CGPNALCRAVGKRPVCACPDGFSAV  218 (324)
Q Consensus       194 C~~~~~C~~~~g~y~C~C~~Gy~g~  218 (324)
                      |.+++.|++  |   |.|..||+|.
T Consensus       345 C~~~g~cv~--g---C~C~~Gw~G~  364 (525)
T KOG1225|consen  345 CSGGGQCVN--G---CKCKKGWRGP  364 (525)
T ss_pred             cCCCceecc--C---ceeccCccCC
Confidence            556666765  2   8889999873


No 11 
>KOG1225|consensus
Probab=98.07  E-value=4.2e-05  Score=74.50  Aligned_cols=149  Identities=30%  Similarity=0.704  Sum_probs=89.8

Q ss_pred             cEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCC
Q psy9427          98 PVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADCPP  177 (324)
Q Consensus        98 ~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~  177 (324)
                      ++|.|+.+|.+     ..|+.  -.|  +..|..++.|+.       -+|.|++||+|+.+..            ..|+ 
T Consensus       234 ~ic~c~~~~~g-----~~c~~--~~C--~~~c~~~g~c~~-------G~CIC~~Gf~G~dC~e------------~~Cp-  284 (525)
T KOG1225|consen  234 GICECPEGYFG-----PLCST--IYC--PGGCTGRGQCVE-------GRCICPPGFTGDDCDE------------LVCP-  284 (525)
T ss_pred             ceeecCCceeC-----Ccccc--ccC--CCCCcccceEeC-------CeEeCCCCCcCCCCCc------------ccCC-
Confidence            47899999998     55653  234  234555677866       3899999999988431            1222 


Q ss_pred             CCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCcCCCCcccCCCCC
Q psy9427         178 SSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAVGGEARVGCFREMSACAGDQECLAGEEKCIGGRCRVACLS  257 (324)
Q Consensus       178 ~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~~~~C~~~~~~C~~~~~C~~~~~~C~~~~C~~~C~~  257 (324)
                                    ..|+.++.+++.    +|.|++||+|....... |   +      ++|. ..-.|+.|+|+  |..
T Consensus       285 --------------~~cs~~g~~~~g----~CiC~~g~~G~dCs~~~-c---p------adC~-g~G~Ci~G~C~--C~~  333 (525)
T KOG1225|consen  285 --------------VDCSGGGVCVDG----ECICNPGYSGKDCSIRR-C---P------ADCS-GHGKCIDGECL--CDE  333 (525)
T ss_pred             --------------cccCCCceecCC----EeecCCCcccccccccc-C---C------ccCC-CCCcccCCceE--eCC
Confidence                          135556666553    69999999984333222 3   3      3444 22245555442  111


Q ss_pred             CCCCCCCCcccCCcccCCCCCCCCCCCCCcCCCCcc-----ccccccCCCCCCCCcccCCc
Q psy9427         258 HSDCSEGERCISSLCQSPCTSHSQCPTGQACQAGTC-----ELGCRQNEECSGQMICAGSK  313 (324)
Q Consensus       258 ~~~C~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~c-----~~~C~~~~~C~~~~~C~~~~  313 (324)
                      +   +.|..|...    +|.++++|..+..|+.|+-     .+.+.-..+|+....++...
T Consensus       334 G---y~G~~C~~~----~C~~~g~cv~gC~C~~Gw~G~d~~~~~~~~~~~cs~~~~~~~~~  387 (525)
T KOG1225|consen  334 G---YTGELCIQR----ACSGGGQCVNGCKCKKGWRGPDVADPSLLLITECSPPSLCIAGV  387 (525)
T ss_pred             C---CcCCccccc----ccCCCceeccCceeccCccCCCcCCchhhcccccCCCceeeccc
Confidence            1   344555554    3777788877777777764     23466677777777776654


No 12 
>KOG4260|consensus
Probab=97.96  E-value=1.2e-05  Score=70.63  Aligned_cols=105  Identities=22%  Similarity=0.469  Sum_probs=69.7

Q ss_pred             CCCCCCCEEe---eCCCCcEeeCCCCCcCCCCCCCccccc--------CCCCC-CCCCCCC--CCeeecCCCCCCCeee-
Q psy9427          83 GACGTNAVCI---PEDHRPVCHCPVGTHPSPSPDVHCARE--------SNLCR-DSNPCHP--TARCEDTGATNGKVVC-  147 (324)
Q Consensus        83 ~~C~~~~~C~---~~~~~~~C~C~~G~~g~~~~~~~C~~~--------~~~C~-~~~~C~~--~~~C~~~~~~~g~~~C-  147 (324)
                      .+|.-++.|.   ...|+..|.|.+||.|     ..|..-        .++=. .-..|+.  .+.|.    ..++-.| 
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~G-----p~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Cs----g~~~k~C~  220 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTG-----PLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCS----GESSKGCS  220 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCC-----ccccccchHHHHhhcccccchhhhhhhhhhcccC----CCCCCChh
Confidence            3455566665   3457789999999999     444321        01100 0012321  12443    2444556 


Q ss_pred             eCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         148 SCPELSVGDPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       148 ~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      +|..||..+.  ..|.|++||.....                   +|..+..|+|+.|+|+|..++||.+
T Consensus       221 kCkkGW~lde--~gCvDvnEC~~ep~-------------------~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  221 KCKKGWKLDE--EGCVDVNECQNEPA-------------------PCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             hhcccceecc--cccccHHHHhcCCC-------------------CCChhheeecCCCceEecccccccC
Confidence            6999999986  56999999965543                   5777889999999999999999985


No 13 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.92  E-value=3.4e-06  Score=70.25  Aligned_cols=145  Identities=26%  Similarity=0.577  Sum_probs=91.4

Q ss_pred             CCCCCCEeeeCCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCC-----CCCCCCCCEEeeCC
Q psy9427          21 SCGVNAVCVVNNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACD-----SGACGTNAVCIPED   95 (324)
Q Consensus        21 ~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~-----~~~C~~~~~C~~~~   95 (324)
                      +|.+ |.-+.+...|.|.|.+| |.....+.|..                      ..+|.     ..+|+..|.|++..
T Consensus         7 ~CKN-G~LiQMSNHfEC~Cneg-fvl~~EntCE~----------------------kv~C~~~e~~~K~Cgdya~C~~~~   62 (197)
T PF06247_consen    7 ICKN-GYLIQMSNHFECKCNEG-FVLKNENTCEE----------------------KVECDKLENVNKPCGDYAKCINQA   62 (197)
T ss_dssp             --BT-EEEEEESSEEEEEESTT-EEEEETTEEEE--------------------------SG-GGTTSEEETTEEEEE-S
T ss_pred             cccC-CEEEEccCceEEEcCCC-cEEcccccccc----------------------ceecCcccccCccccchhhhhcCC
Confidence            4654 57888889999999999 87655455665                      23332     25788899999765


Q ss_pred             -----CCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCC
Q psy9427          96 -----HRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPR  170 (324)
Q Consensus        96 -----~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~  170 (324)
                           ..|.|.|.+||+...   ..|..  +.| ....|+ .+.|+..+..+....|+|.-|++.+. ...|...++   
T Consensus        63 ~~~~~~~~~C~C~~gY~~~~---~vCvp--~~C-~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d-n~kCtk~G~---  131 (197)
T PF06247_consen   63 NKGEERAYKCDCINGYILKQ---GVCVP--NKC-NNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD-NKKCTKTGE---  131 (197)
T ss_dssp             STTSSTSEEEEE-TTEEESS---SSEEE--GGG-SS---T-TEEEEEEEGGGSEEEEEE-TEEETTT-TTESEEEE----
T ss_pred             CcccceeEEEecccCceeeC---CeEch--hhc-CceecC-CCeEEecCCCCCCceeEeeeceEecc-CCcccCCCc---
Confidence                 469999999999865   56765  567 556786 78998521133446999999998333 234543221   


Q ss_pred             CCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         171 GDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       171 ~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                                       .+|...|..+.+|..+.+-|+|.+.+||.+
T Consensus       132 -----------------T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~  161 (197)
T PF06247_consen  132 -----------------TKCSLKCKENEECKLVDGYYKCVCKEGFPG  161 (197)
T ss_dssp             ------------------------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred             -----------------cceeeecCCCcceeeeCcEEEeecCCCCCC
Confidence                             122235778899999999999999999987


No 14 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.89  E-value=1.1e-05  Score=48.22  Aligned_cols=30  Identities=33%  Similarity=0.875  Sum_probs=27.1

Q ss_pred             CCCCCCCCCCEeeeCC-CceeeeeCCCeeecC
Q psy9427          17 CDGVSCGVNAVCVVNN-HEARCFCPPGSYVGD   47 (324)
Q Consensus        17 C~~~~C~~~~~C~~~~-~~~~C~C~~G~y~g~   47 (324)
                      |..+||.++|+|+... ++|+|.|++| |+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G-~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPG-YTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTT-EEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCC-CccC
Confidence            5677999999999888 9999999999 9884


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.89  E-value=8.9e-06  Score=51.94  Aligned_cols=25  Identities=28%  Similarity=0.757  Sum_probs=22.8

Q ss_pred             CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         193 ACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       193 ~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      .|..+++|+|+.|+|+|.|++||+.
T Consensus        11 ~C~~~~~C~N~~Gsy~C~C~~Gy~~   35 (42)
T PF07645_consen   11 NCPENGTCVNTEGSYSCSCPPGYEL   35 (42)
T ss_dssp             SSSTTSEEEEETTEEEEEESTTEEE
T ss_pred             cCCCCCEEEcCCCCEEeeCCCCcEE
Confidence            5777899999999999999999983


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.73  E-value=2.3e-05  Score=50.03  Aligned_cols=34  Identities=26%  Similarity=0.621  Sum_probs=28.9

Q ss_pred             CCCCCCC-CCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427         121 NLCRDSN-PCHPTARCEDTGATNGKVVCSCPELSVGDP  157 (324)
Q Consensus       121 ~~C~~~~-~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~  157 (324)
                      |||+... .|..++.|+|   +.|+|+|.|++||....
T Consensus         3 dEC~~~~~~C~~~~~C~N---~~Gsy~C~C~~Gy~~~~   37 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVN---TEGSYSCSCPPGYELND   37 (42)
T ss_dssp             STTTTTSSSSSTTSEEEE---ETTEEEEEESTTEEECT
T ss_pred             cccCCCCCcCCCCCEEEc---CCCCEEeeCCCCcEECC
Confidence            8995533 7988999999   99999999999998443


No 17 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.49  E-value=5.5e-05  Score=45.20  Aligned_cols=29  Identities=31%  Similarity=0.742  Sum_probs=26.1

Q ss_pred             CCCCCCCCCCEEeeCC-CCcEeeCCCCCcC
Q psy9427          80 CDSGACGTNAVCIPED-HRPVCHCPVGTHP  108 (324)
Q Consensus        80 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g  108 (324)
                      |.+++|.++++|+... ++|.|.|++||+|
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G   30 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTG   30 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCcc
Confidence            4457899999999888 9999999999998


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.40  E-value=0.00024  Score=44.02  Aligned_cols=32  Identities=28%  Similarity=0.680  Sum_probs=28.4

Q ss_pred             cCCCCC-CCCCCCCEEeeCCCCcEeeCCCCCc-C
Q psy9427          77 IPACDS-GACGTNAVCIPEDHRPVCHCPVGTH-P  108 (324)
Q Consensus        77 i~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~-g  108 (324)
                      +++|.. .+|.++++|+++.++|.|.|++||+ |
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g   35 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDG   35 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccC
Confidence            567776 6898889999999999999999999 6


No 19 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.40  E-value=0.00017  Score=39.85  Aligned_cols=24  Identities=25%  Similarity=0.450  Sum_probs=21.3

Q ss_pred             CeeeeCCCCCccCCCCCCCCCCCC
Q psy9427         144 KVVCSCPELSVGDPYKTGCVPQGQ  167 (324)
Q Consensus       144 ~~~C~C~~G~~g~~~~~~C~~~~e  167 (324)
                      +|+|+|++||+....+..|.||+|
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            589999999998877788999986


No 20 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.38  E-value=0.00021  Score=43.89  Aligned_cols=28  Identities=39%  Similarity=0.814  Sum_probs=22.7

Q ss_pred             CCCCCCEeeeCCCceeeeeCCCeeecCCC
Q psy9427          21 SCGVNAVCVVNNHEARCFCPPGSYVGDPN   49 (324)
Q Consensus        21 ~C~~~~~C~~~~~~~~C~C~~G~y~g~~~   49 (324)
                      .|..+|+|+++.++|.|+|++| |.|+..
T Consensus         7 ~C~~nA~C~~~~~~~~C~C~~G-y~GdG~   34 (36)
T PF12947_consen    7 GCHPNATCTNTGGSYTCTCKPG-YEGDGF   34 (36)
T ss_dssp             GS-TTCEEEE-TTSEEEEE-CE-EECCST
T ss_pred             CCCCCcEeecCCCCEEeECCCC-CccCCc
Confidence            5889999999999999999999 999754


No 21 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.25  E-value=0.00043  Score=42.85  Aligned_cols=31  Identities=35%  Similarity=0.866  Sum_probs=27.5

Q ss_pred             ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeee
Q psy9427          14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYV   45 (324)
Q Consensus        14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~   45 (324)
                      +++|.. .+|.++++|++..++|.|.|++| |+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g-~~   33 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPG-YT   33 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCC-Cc
Confidence            466776 78999999999999999999999 87


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.20  E-value=0.00024  Score=43.62  Aligned_cols=28  Identities=29%  Similarity=0.613  Sum_probs=23.0

Q ss_pred             CCCCCCCEEeeCCCCcEeeCCCCCcCCC
Q psy9427          83 GACGTNAVCIPEDHRPVCHCPVGTHPSP  110 (324)
Q Consensus        83 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~  110 (324)
                      ..|..+|+|+++.++|.|+|++||.|++
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            4688899999999999999999999987


No 23 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.16  E-value=0.00038  Score=38.47  Aligned_cols=24  Identities=29%  Similarity=0.602  Sum_probs=20.4

Q ss_pred             CcEeeCCCCCcCCCCCCCcccccCCC
Q psy9427          97 RPVCHCPVGTHPSPSPDVHCARESNL  122 (324)
Q Consensus        97 ~~~C~C~~G~~g~~~~~~~C~~~~~~  122 (324)
                      +|.|.|++||+.++.. +.|++| ||
T Consensus         1 sy~C~C~~Gy~l~~d~-~~C~DI-dE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDG-RSCEDI-DE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCC-CccccC-CC
Confidence            6899999999987765 899998 75


No 24 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.95  E-value=0.0013  Score=40.11  Aligned_cols=32  Identities=28%  Similarity=0.674  Sum_probs=28.0

Q ss_pred             cCCCCC-CCCCCCCEEeeCCCCcEeeCCCCCcC
Q psy9427          77 IPACDS-GACGTNAVCIPEDHRPVCHCPVGTHP  108 (324)
Q Consensus        77 i~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g  108 (324)
                      +++|.. .+|.+++.|++..+.|.|.|++||+|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcC
Confidence            466765 68888899999999999999999998


No 25 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.83  E-value=0.0015  Score=39.89  Aligned_cols=33  Identities=36%  Similarity=0.888  Sum_probs=28.3

Q ss_pred             ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427          14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYVGD   47 (324)
Q Consensus        14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g~   47 (324)
                      +++|.. .+|.+++.|++..+.|+|.|++| |.|.
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g-~~g~   35 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPG-YTGR   35 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCC-CcCC
Confidence            456766 68988899999999999999999 8873


No 26 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.67  E-value=0.0025  Score=38.21  Aligned_cols=28  Identities=43%  Similarity=1.032  Sum_probs=25.2

Q ss_pred             CCCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427          19 GVSCGVNAVCVVNNHEARCFCPPGSYVGD   47 (324)
Q Consensus        19 ~~~C~~~~~C~~~~~~~~C~C~~G~y~g~   47 (324)
                      ..+|.++++|++..+.|.|.|++| |.+.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g-~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPG-YTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCC-Cccc
Confidence            567988999999999999999999 8875


No 27 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.48  E-value=0.0048  Score=36.90  Aligned_cols=28  Identities=25%  Similarity=0.625  Sum_probs=25.0

Q ss_pred             CCCCCCCCEEeeCCCCcEeeCCCCCcCC
Q psy9427          82 SGACGTNAVCIPEDHRPVCHCPVGTHPS  109 (324)
Q Consensus        82 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~  109 (324)
                      ..+|.++++|++..+.|.|.|+.||.|+
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            4678888999999999999999999983


No 28 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.20  E-value=0.0061  Score=36.68  Aligned_cols=29  Identities=38%  Similarity=1.014  Sum_probs=24.8

Q ss_pred             CCC-CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427          17 CDG-VSCGVNAVCVVNNHEARCFCPPGSYVGD   47 (324)
Q Consensus        17 C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g~   47 (324)
                      |.. .+|.++ +|++..++|+|+|++| |.++
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g-~~g~   31 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPG-YTGD   31 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCC-CccC
Confidence            445 578888 9999999999999999 8873


No 29 
>KOG1226|consensus
Probab=96.17  E-value=0.041  Score=55.48  Aligned_cols=142  Identities=26%  Similarity=0.638  Sum_probs=78.0

Q ss_pred             EeeCCCCCcCCCCCCCccccc---------CCCCCCCC---CCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCC-C
Q psy9427          99 VCHCPVGTHPSPSPDVHCARE---------SNLCRDSN---PCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVP-Q  165 (324)
Q Consensus        99 ~C~C~~G~~g~~~~~~~C~~~---------~~~C~~~~---~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~-~  165 (324)
                      .|.|.+||.|     ..|+-.         .+.|+..+   +|...+.|.=       -+|+|.+...+.-++..|.- .
T Consensus       479 ~C~C~~G~~G-----~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-------GqC~C~~~~~~~i~G~fCECDn  546 (783)
T KOG1226|consen  479 QCRCDEGWLG-----KKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-------GQCVCHKPDNGKIYGKFCECDN  546 (783)
T ss_pred             ceecCCCCCC-----CcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-------CceEecCCCCCceeeeeeeccC
Confidence            5799999999     444321         13443222   5777777754       36788777664333333321 1


Q ss_pred             CCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCccccCCCCCC--CCccCCCC
Q psy9427         166 GQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAVGGEARVGCFREMSACAG--DQECLAGE  243 (324)
Q Consensus       166 ~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~~~~C~~~~~~C~~--~~~C~~~~  243 (324)
                      -.|....+                  ..|+.+++|.=.    +|+|.+||+|.....    ....+.|.+  +..|+ +.
T Consensus       547 fsC~r~~g------------------~lC~g~G~C~CG----~CvC~~GwtG~~C~C----~~std~C~~~~G~iCS-Gr  599 (783)
T KOG1226|consen  547 FSCERHKG------------------VLCGGHGRCECG----RCVCNPGWTGSACNC----PLSTDTCESSDGQICS-GR  599 (783)
T ss_pred             cccccccC------------------cccCCCCeEeCC----cEEcCCCCccCCCCC----CCCCccccCCCCceeC-CC
Confidence            11211111                  157778888653    599999999854321    112444532  35666 33


Q ss_pred             CcCCCCcccCCCCCCCCCCCCCcccC-CcccCCCCCCCCCC
Q psy9427         244 EKCIGGRCRVACLSHSDCSEGERCIS-SLCQSPCTSHSQCP  283 (324)
Q Consensus       244 ~~C~~~~C~~~C~~~~~C~~~~~C~~-~~C~~~C~~~~~C~  283 (324)
                      ..|+=|+|+  |.+..  ..|..|.. ..|..+|..+-.|.
T Consensus       600 G~C~Cg~C~--C~~~~--~sG~~CE~cptc~~~C~~~~~Cv  636 (783)
T KOG1226|consen  600 GTCECGRCK--CTDPP--YSGEFCEKCPTCPDPCAENKSCV  636 (783)
T ss_pred             ceeeCCceE--cCCCC--cCcchhhcCCCCCCcccccccch
Confidence            356666553  33332  45666644 56666777766665


No 30 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.06  E-value=0.0092  Score=35.88  Aligned_cols=28  Identities=36%  Similarity=0.876  Sum_probs=24.0

Q ss_pred             CCC-CCCCCCCEEeeCCCCcEeeCCCCCcC
Q psy9427          80 CDS-GACGTNAVCIPEDHRPVCHCPVGTHP  108 (324)
Q Consensus        80 C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g  108 (324)
                      |.. .+|.++ +|+++.++|.|.|++||.|
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence            444 578777 9999999999999999998


No 31 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.28  E-value=0.013  Score=35.90  Aligned_cols=26  Identities=27%  Similarity=0.390  Sum_probs=20.7

Q ss_pred             CCeeecCCCCCCCeeeeCCCCCccCCCCC
Q psy9427         132 TARCEDTGATNGKVVCSCPELSVGDPYKT  160 (324)
Q Consensus       132 ~~~C~~~~~~~g~~~C~C~~G~~g~~~~~  160 (324)
                      ...|++   ++++|+|.|++||++....+
T Consensus         9 ~h~C~~---~~g~~~C~C~~Gy~L~~D~~   34 (36)
T PF14670_consen    9 SHICVN---TPGSYRCSCPPGYKLAEDGR   34 (36)
T ss_dssp             SSEEEE---ETTSEEEE-STTEEE-TTSS
T ss_pred             CCCCcc---CCCceEeECCCCCEECcCCC
Confidence            578999   99999999999999877543


No 32 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.15  E-value=0.014  Score=35.68  Aligned_cols=23  Identities=22%  Similarity=0.598  Sum_probs=19.1

Q ss_pred             CEEeeCCCCcEeeCCCCCcCCCC
Q psy9427          89 AVCIPEDHRPVCHCPVGTHPSPS  111 (324)
Q Consensus        89 ~~C~~~~~~~~C~C~~G~~g~~~  111 (324)
                      ..|++++++|+|.|++||++.+.
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~~D   32 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLAED   32 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-TT
T ss_pred             CCCccCCCceEeECCCCCEECcC
Confidence            68999999999999999998654


No 33 
>KOG1226|consensus
Probab=95.11  E-value=0.43  Score=48.41  Aligned_cols=117  Identities=23%  Similarity=0.533  Sum_probs=63.6

Q ss_pred             CCCCCCEEeeCCCCcEeeCCCCCcCCCCCCCcccccCCCCC--CCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCC
Q psy9427          84 ACGTNAVCIPEDHRPVCHCPVGTHPSPSPDVHCARESNLCR--DSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTG  161 (324)
Q Consensus        84 ~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~--~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~  161 (324)
                      +|...+.|.=    ..|.|.+...+.-++ ..|+-+--.|.  ....|..+++|.=       -+|+|.+||+|..++  
T Consensus       515 vCSgrG~C~C----GqC~C~~~~~~~i~G-~fCECDnfsC~r~~g~lC~g~G~C~C-------G~CvC~~GwtG~~C~--  580 (783)
T KOG1226|consen  515 VCSGRGDCVC----GQCVCHKPDNGKIYG-KFCECDNFSCERHKGVLCGGHGRCEC-------GRCVCNPGWTGSACN--  580 (783)
T ss_pred             CcCCCCcEeC----CceEecCCCCCceee-eeeeccCcccccccCcccCCCCeEeC-------CcEEcCCCCccCCCC--
Confidence            5666666652    147777766631111 66765312231  1225888888865       389999999999875  


Q ss_pred             CC-CCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCC-CccCCCCCCCCccccCCCCCCCCcc
Q psy9427         162 CV-PQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDG-FSAVGGEARVGCFREMSACAGDQEC  239 (324)
Q Consensus       162 C~-~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~G-y~g~~~~~~~~C~~~~~~C~~~~~C  239 (324)
                      |. +.+.|...++.                  .|...++|.=.    +|.|-+. |.|...+.-..|   .+.|....+|
T Consensus       581 C~~std~C~~~~G~------------------iCSGrG~C~Cg----~C~C~~~~~sG~~CE~cptc---~~~C~~~~~C  635 (783)
T KOG1226|consen  581 CPLSTDTCESSDGQ------------------ICSGRGTCECG----RCKCTDPPYSGEFCEKCPTC---PDPCAENKSC  635 (783)
T ss_pred             CCCCCccccCCCCc------------------eeCCCceeeCC----ceEcCCCCcCcchhhcCCCC---CCcccccccc
Confidence            43 33444322211                  34445555432    4777666 876433333333   3445555544


No 34 
>KOG0994|consensus
Probab=92.87  E-value=0.48  Score=50.11  Aligned_cols=63  Identities=22%  Similarity=0.439  Sum_probs=36.4

Q ss_pred             EEeeCCCCcEe-eCCCCCcCCCCCCCcccccCCCCCCCCCCCCC--------CeeecCCCCCCCeeeeCCCCCccCCCC
Q psy9427          90 VCIPEDHRPVC-HCPVGTHPSPSPDVHCARESNLCRDSNPCHPT--------ARCEDTGATNGKVVCSCPELSVGDPYK  159 (324)
Q Consensus        90 ~C~~~~~~~~C-~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~--------~~C~~~~~~~g~~~C~C~~G~~g~~~~  159 (324)
                      .|.....++.| +|..||.|+|.-   =..  ..| .+-||..+        ..|.- +.......|.|.+||+|..+.
T Consensus       877 ~CqD~T~G~~CdrCl~GyyGdP~l---g~g--~~C-rPCpCP~gp~Sg~~~A~sC~~-d~~t~~ivC~C~~GY~G~RCe  948 (1758)
T KOG0994|consen  877 DCQDSTTGHSCDRCLDGYYGDPRL---GSG--IGC-RPCPCPDGPASGRQHADSCYL-DTRTQQIVCHCQEGYSGSRCE  948 (1758)
T ss_pred             cccccccccchhhhhccccCCccc---CCC--CCC-CCCCCCCCCccchhccccccc-cccccceeeecccCccccchh
Confidence            34555666777 799999998852   111  223 22233221        12221 113455789999999998863


No 35 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=92.86  E-value=0.17  Score=30.02  Aligned_cols=25  Identities=36%  Similarity=0.852  Sum_probs=20.7

Q ss_pred             CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427          20 VSCGVNAVCVVNNHEARCFCPPGSYVGD   47 (324)
Q Consensus        20 ~~C~~~~~C~~~~~~~~C~C~~G~y~g~   47 (324)
                      ..|.++++|+...+  +|+|.+| |+|+
T Consensus         6 ~~C~~~G~C~~~~g--~C~C~~g-~~G~   30 (32)
T PF07974_consen    6 NICSGHGTCVSPCG--RCVCDSG-YTGP   30 (32)
T ss_pred             CccCCCCEEeCCCC--EEECCCC-CcCC
Confidence            35889999987644  8999999 8885


No 36 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=92.26  E-value=0.1  Score=43.94  Aligned_cols=106  Identities=28%  Similarity=0.710  Sum_probs=65.0

Q ss_pred             CCCCCCCEeeeCC-----CceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEeeC
Q psy9427          20 VSCGVNAVCVVNN-----HEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIPE   94 (324)
Q Consensus        20 ~~C~~~~~C~~~~-----~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~~   94 (324)
                      .+|+.-|.|++..     ..|.|.|.+| |..... .|.+                       +.|....|+ .+.|+..
T Consensus        50 K~Cgdya~C~~~~~~~~~~~~~C~C~~g-Y~~~~~-vCvp-----------------------~~C~~~~Cg-~GKCI~d  103 (197)
T PF06247_consen   50 KPCGDYAKCINQANKGEERAYKCDCING-YILKQG-VCVP-----------------------NKCNNKDCG-SGKCILD  103 (197)
T ss_dssp             SEEETTEEEEE-SSTTSSTSEEEEE-TT-EEESSS-SEEE-----------------------GGGSS---T-TEEEEEE
T ss_pred             ccccchhhhhcCCCcccceeEEEecccC-ceeeCC-eEch-----------------------hhcCceecC-CCeEEec
Confidence            4799999998654     6799999999 876432 3444                       444445676 5788843


Q ss_pred             ---CCCcEeeCCCCCcCCCCCCCccccc-CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCC
Q psy9427          95 ---DHRPVCHCPVGTHPSPSPDVHCARE-SNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPY  158 (324)
Q Consensus        95 ---~~~~~C~C~~G~~g~~~~~~~C~~~-~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~  158 (324)
                         +....|+|.-|+..+..  ..|... ..+|  .-.|..+..|..   +.+-|+|.+..||.++..
T Consensus       104 ~~~~~~~~CSC~IGkV~~dn--~kCtk~G~T~C--~LKCk~nE~CK~---~~~~Y~C~~~~~~~~~~~  164 (197)
T PF06247_consen  104 PDNPNNPTCSCNIGKVPDDN--KKCTKTGETKC--SLKCKENEECKL---VDGYYKCVCKEGFPGDGE  164 (197)
T ss_dssp             EGGGSEEEEEE-TEEETTTT--TESEEEE----------TTTEEEEE---ETTEEEEEE-TT-EEETT
T ss_pred             CCCCCCceeEeeeceEeccC--CcccCCCccce--eeecCCCcceee---eCcEEEeecCCCCCCCCC
Confidence               23459999999983221  566543 1344  224777889999   888899999999987763


No 37 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.11  E-value=0.25  Score=29.34  Aligned_cols=23  Identities=22%  Similarity=0.717  Sum_probs=19.3

Q ss_pred             CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         193 ACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       193 ~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      .|..+++|+..  ..+|.|.+||+|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G   29 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTG   29 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcC
Confidence            47789999976  458999999997


No 38 
>PF04706 Dickkopf_N:  Dickkopf N-terminal cysteine-rich region;  InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=91.03  E-value=0.41  Score=31.82  Aligned_cols=21  Identities=24%  Similarity=0.702  Sum_probs=18.1

Q ss_pred             cccccCCCCCCCCcccCCccc
Q psy9427         295 LGCRQNEECSGQMICAGSKCT  315 (324)
Q Consensus       295 ~~C~~~~~C~~~~~C~~~~C~  315 (324)
                      ..|+.+++|-.++.|++|+|+
T Consensus        31 ~rC~Rd~~CC~g~~CvnG~C~   51 (52)
T PF04706_consen   31 KRCTRDAMCCPGNLCVNGVCT   51 (52)
T ss_pred             CCCCCCcccCCCCeeeCCEec
Confidence            468899999999999999886


No 39 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.02  E-value=0.22  Score=30.47  Aligned_cols=30  Identities=30%  Similarity=0.604  Sum_probs=20.9

Q ss_pred             CCCCCCCCCCEeeeCC-CceeeeeCCCeeecC
Q psy9427          17 CDGVSCGVNAVCVVNN-HEARCFCPPGSYVGD   47 (324)
Q Consensus        17 C~~~~C~~~~~C~~~~-~~~~C~C~~G~y~g~   47 (324)
                      |...+|..||.|++.. |++.|.|..| |..+
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllg-yk~~   32 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLG-YKKV   32 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TT-EEEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCC-cccc
Confidence            4556789999999765 9999999999 8763


No 40 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.20  E-value=0.27  Score=22.93  Aligned_cols=11  Identities=55%  Similarity=1.573  Sum_probs=8.6

Q ss_pred             eeeeCCCeeecC
Q psy9427          36 RCFCPPGSYVGD   47 (324)
Q Consensus        36 ~C~C~~G~y~g~   47 (324)
                      .|+|++| |+|.
T Consensus         1 ~C~C~~G-~~G~   11 (13)
T PF12661_consen    1 TCQCPPG-WTGP   11 (13)
T ss_dssp             EEEE-TT-EETT
T ss_pred             CccCcCC-CcCC
Confidence            4899999 9985


No 41 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.25  E-value=0.17  Score=30.97  Aligned_cols=30  Identities=27%  Similarity=0.483  Sum_probs=21.1

Q ss_pred             CCCCCCCCCEEeeCC-CCcEeeCCCCCcCCC
Q psy9427          81 DSGACGTNAVCIPED-HRPVCHCPVGTHPSP  110 (324)
Q Consensus        81 ~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~  110 (324)
                      ....|..+|.|++.. |++.|+|..||...+
T Consensus         3 ~~~~cP~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    3 IDTKCPANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             cCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence            335677889999766 999999999998644


No 42 
>KOG0994|consensus
Probab=88.60  E-value=2  Score=45.77  Aligned_cols=75  Identities=24%  Similarity=0.556  Sum_probs=40.7

Q ss_pred             CCceeeeeCCCeeecCCCCCCcccCCCC---CCcCCCCCceeCCCCcccCCCCCCCCC-CCCE---EeeCCCCcEe-eCC
Q psy9427          32 NHEARCFCPPGSYVGDPNIECRQVPCVH---NHDCSTSHACDLLSHTCIPACDSGACG-TNAV---CIPEDHRPVC-HCP  103 (324)
Q Consensus        32 ~~~~~C~C~~G~y~g~~~~~C~~~~C~~---~~~C~~~~~C~c~~~~Ci~~C~~~~C~-~~~~---C~~~~~~~~C-~C~  103 (324)
                      .....|+|.+| |+|.....|.++--.+   +++|.   .|.|.++  ||.-.+..|. ..+.   |+....+.+| .|+
T Consensus       931 t~~ivC~C~~G-Y~G~RCe~CA~~~fGnP~~GGtCq---~CeC~~N--iD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck 1004 (1758)
T KOG0994|consen  931 TQQIVCHCQEG-YSGSRCEICADNHFGNPSEGGTCQ---KCECSNN--IDLYDPGACDVATGACLKCLYHTEGDHCEHCK 1004 (1758)
T ss_pred             ccceeeecccC-ccccchhhhcccccCCcccCCccc---cccccCC--cCccCCCccchhhchhhhhhhcccccchhhcc
Confidence            34568999999 9998777776631111   33321   0111000  5555555553 1233   4433334455 699


Q ss_pred             CCCcCCCCC
Q psy9427         104 VGTHPSPSP  112 (324)
Q Consensus       104 ~G~~g~~~~  112 (324)
                      .||.|+...
T Consensus      1005 ~Gf~GdA~~ 1013 (1758)
T KOG0994|consen 1005 DGFYGDALR 1013 (1758)
T ss_pred             ccchhHHHH
Confidence            999997653


No 43 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.27  E-value=0.44  Score=41.78  Aligned_cols=36  Identities=25%  Similarity=0.637  Sum_probs=27.7

Q ss_pred             CCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         161 GCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       161 ~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      .|.+.++|....++                   |.  ..|.++.|+|.|.|++||+.
T Consensus       183 ~C~~~~~C~~~~~~-------------------c~--~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         183 ICVVPDLCATLSHV-------------------CQ--QVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cCcCchhhcCCCCC-------------------cc--ceEEcCCCCEEeECCCCccC
Confidence            47777778554433                   43  46999999999999999985


No 44 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.13  E-value=1.2  Score=39.07  Aligned_cols=38  Identities=26%  Similarity=0.523  Sum_probs=29.8

Q ss_pred             CcccccCCCCCCCC-CCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427         114 VHCARESNLCRDSN-PCHPTARCEDTGATNGKVVCSCPELSVGDP  157 (324)
Q Consensus       114 ~~C~~~~~~C~~~~-~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~  157 (324)
                      ..|.+. ++|...+ .|  ...|.+   +.|+|.|.|++||++..
T Consensus       182 ~~C~~~-~~C~~~~~~c--~~~C~~---~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         182 KICVVP-DLCATLSHVC--QQVCIS---TPGSYLCACTEGYALLE  220 (224)
T ss_pred             ccCcCc-hhhcCCCCCc--cceEEc---CCCCEEeECCCCccCCC
Confidence            678776 8884332 45  468999   99999999999998765


No 45 
>PF04706 Dickkopf_N:  Dickkopf N-terminal cysteine-rich region;  InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=84.85  E-value=1.4  Score=29.33  Aligned_cols=15  Identities=27%  Similarity=0.738  Sum_probs=6.7

Q ss_pred             CCCCCccCCCCCcCCC
Q psy9427         233 CAGDQECLAGEEKCIG  248 (324)
Q Consensus       233 C~~~~~C~~~~~~C~~  248 (324)
                      |..+.+|. .+..|+.
T Consensus         2 C~~D~dC~-~g~yC~~   16 (52)
T PF04706_consen    2 CSSDEDCG-YGKYCHS   16 (52)
T ss_pred             CcccccCC-CCCCcCC
Confidence            44445554 3444443


No 46 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=79.77  E-value=1.8  Score=33.39  Aligned_cols=24  Identities=42%  Similarity=1.186  Sum_probs=19.7

Q ss_pred             CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427         193 ACGPNALCRAVGKRPVCACPDGFSA  217 (324)
Q Consensus       193 ~C~~~~~C~~~~g~y~C~C~~Gy~g  217 (324)
                      .|+..+.|.. .....|.|.+||.+
T Consensus        85 ~CG~~g~C~~-~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   85 FCGPNGICNS-NNSPKCSCLPGFEP  108 (110)
T ss_pred             ccCCccEeCC-CCCCceECCCCcCC
Confidence            6899999954 45678999999975


No 47 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=78.38  E-value=2  Score=33.18  Aligned_cols=31  Identities=35%  Similarity=0.856  Sum_probs=23.9

Q ss_pred             ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeeec
Q psy9427          14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYVG   46 (324)
Q Consensus        14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g   46 (324)
                      .|.|+. ..|++++.|.. ..+..|.|++| |..
T Consensus        77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~G-F~P  108 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPG-FEP  108 (110)
T ss_pred             ccCCCCccccCCccEeCC-CCCCceECCCC-cCC
Confidence            356874 68999999954 45678999999 864


No 48 
>KOG1836|consensus
Probab=71.15  E-value=7  Score=44.18  Aligned_cols=51  Identities=27%  Similarity=0.486  Sum_probs=33.1

Q ss_pred             eCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeee-CCCCCccCCC
Q psy9427         101 HCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCS-CPELSVGDPY  158 (324)
Q Consensus       101 ~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~-C~~G~~g~~~  158 (324)
                      +|..||.|.+..+.    . +.| .+-+|..++.|.... ......|+ |++||+|..+
T Consensus       760 ~C~~GfYg~~~~~~----~-~dC-~~C~Cp~~~~~~~~~-~~~~~iCk~Cp~gytG~rC  811 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLGT----S-GDC-QPCPCPNGGACGQTP-EILEVVCKNCPPGYTGLRC  811 (1705)
T ss_pred             hhcCCCCCccccCC----C-CCC-ccCCCCCChhhcCcC-cccceecCCCCCCCccccc
Confidence            79999998765311    1 225 444777777776510 13557788 9999999875


No 49 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=68.30  E-value=3.6  Score=26.96  Aligned_cols=18  Identities=33%  Similarity=1.169  Sum_probs=8.9

Q ss_pred             cccCCCCCCCCcccCCcc
Q psy9427         297 CRQNEECSGQMICAGSKC  314 (324)
Q Consensus       297 C~~~~~C~~~~~C~~~~C  314 (324)
                      |..+.+|..+..|++++|
T Consensus        22 C~~~~qC~~~s~C~~g~C   39 (52)
T PF01683_consen   22 CESDEQCIGGSVCVNGRC   39 (52)
T ss_pred             CCCcCCCCCcCEEcCCEe
Confidence            444444544555555444


No 50 
>PF00095 WAP:  WAP-type (Whey Acidic Protein) 'four-disulfide core';  InterPro: IPR008197 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  Whey acidic protein (WAP) is a major component of the whey fraction of milk, which contains a significant number of different proteins. WAP proteins share limited sequence identity, except for their conserved cysteine-rich regions, known as 4-disulphide core (4-DSC) domains, and the positional conservation of specific residues []. Several non-milk proteins also contain 4-DSC patterns, such as certain serine protease inhibitors. WAP itself appears to have a protease-inhibitor function, as seen with its inhibitory effect on the progression of cancer cells [].; GO: 0030414 peptidase inhibitor activity, 0005576 extracellular region; PDB: 1FLE_I 2REL_A 2Z7F_I 3NGG_B.
Probab=60.65  E-value=3.2  Score=26.11  Aligned_cols=20  Identities=30%  Similarity=0.944  Sum_probs=14.1

Q ss_pred             CccccccccCCCCCCCCccc
Q psy9427         291 GTCELGCRQNEECSGQMICA  310 (324)
Q Consensus       291 ~~c~~~C~~~~~C~~~~~C~  310 (324)
                      ..|...|.+|.||+..+.|=
T Consensus        13 ~~c~~~C~~D~dC~~~~KCC   32 (43)
T PF00095_consen   13 GDCRDECSSDSDCPGGQKCC   32 (43)
T ss_dssp             CEESBS-SSGGGSSTT-EEE
T ss_pred             CcccccCcCccccCCcCeEC
Confidence            44668899999998888883


No 51 
>smart00051 DSL delta serrate ligand.
Probab=57.76  E-value=17  Score=25.16  Aligned_cols=47  Identities=19%  Similarity=0.333  Sum_probs=28.7

Q ss_pred             CcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427          97 RPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDP  157 (324)
Q Consensus        97 ~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~  157 (324)
                      .+.-.|.++|.|     ..|..   .|...+....+.+|..    .  -.++|.+||+|..
T Consensus        16 ~~rv~C~~~~yG-----~~C~~---~C~~~~d~~~~~~Cd~----~--G~~~C~~Gw~G~~   62 (63)
T smart00051       16 QIRVTCDENYYG-----EGCNK---FCRPRDDFFGHYTCDE----N--GNKGCLEGWMGPY   62 (63)
T ss_pred             EEEeeCCCCCcC-----CccCC---EeCcCccccCCccCCc----C--CCEecCCCCcCCC
Confidence            345578999999     55542   3422223445566633    2  3578999999865


No 52 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=55.64  E-value=14  Score=22.06  Aligned_cols=21  Identities=43%  Similarity=1.012  Sum_probs=13.8

Q ss_pred             EeeeCCCceeeeeCCCeeecCCC
Q psy9427          27 VCVVNNHEARCFCPPGSYVGDPN   49 (324)
Q Consensus        27 ~C~~~~~~~~C~C~~G~y~g~~~   49 (324)
                      .|... ..+.|.||+| |..+..
T Consensus        11 ~CDpn-~~~~C~CPeG-yIlde~   31 (34)
T PF09064_consen   11 DCDPN-SPGQCFCPEG-YILDEG   31 (34)
T ss_pred             ccCCC-CCCceeCCCc-eEecCC
Confidence            56532 2238999999 876543


No 53 
>PF04885 Stig1:  Stigma-specific protein, Stig1;  InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=55.13  E-value=32  Score=27.75  Aligned_cols=34  Identities=18%  Similarity=0.505  Sum_probs=18.6

Q ss_pred             CCCcCCCCcccCCCCC-------CCCCCCCCcccCCcccCC
Q psy9427         242 GEEKCIGGRCRVACLS-------HSDCSEGERCISSLCQSP  275 (324)
Q Consensus       242 ~~~~C~~~~C~~~C~~-------~~~C~~~~~C~~~~C~~~  275 (324)
                      .+..|-+++|+++=.+       +..|..++.|-+++|+++
T Consensus        68 ~~~~CC~~~Cvdv~~d~~nCG~Cg~~C~~g~~cC~G~Cvd~  108 (136)
T PF04885_consen   68 PGPTCCNNKCVDVSSDRNNCGACGNKCPYGQTCCGGQCVDL  108 (136)
T ss_pred             CCCcccCCcCCccCCCccccHhhcCCCCCCceecCCEeECC
Confidence            3445656666553221       244566667777777653


No 54 
>PHA02887 EGF-like protein; Provisional
Probab=53.51  E-value=13  Score=29.01  Aligned_cols=24  Identities=25%  Similarity=0.699  Sum_probs=18.4

Q ss_pred             CCCCCCeeeeC--CCCceeeCCCCCcc
Q psy9427         193 ACGPNALCRAV--GKRPVCACPDGFSA  217 (324)
Q Consensus       193 ~C~~~~~C~~~--~g~y~C~C~~Gy~g  217 (324)
                      -|- +|+|.-.  ...+.|.|+.||+|
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GYtG  118 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGYTG  118 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCccc
Confidence            465 4689744  45689999999998


No 55 
>PHA02887 EGF-like protein; Provisional
Probab=53.14  E-value=15  Score=28.63  Aligned_cols=31  Identities=23%  Similarity=0.578  Sum_probs=23.0

Q ss_pred             CCCCCCCEEee--CCCCcEeeCCCCCcCCCCCCCccccc
Q psy9427          83 GACGTNAVCIP--EDHRPVCHCPVGTHPSPSPDVHCARE  119 (324)
Q Consensus        83 ~~C~~~~~C~~--~~~~~~C~C~~G~~g~~~~~~~C~~~  119 (324)
                      +-|. +|+|.-  ....+.|.|.+||+|     .+|+.+
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG-----~RCE~v  124 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTG-----IRCDEV  124 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCccc-----CCCCcc
Confidence            3455 468874  345689999999999     788754


No 56 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=48.99  E-value=21  Score=23.18  Aligned_cols=25  Identities=24%  Similarity=0.558  Sum_probs=17.2

Q ss_pred             EeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427          27 VCVVNNHEARCFCPPGSYVGDPNIECRQ   54 (324)
Q Consensus        27 ~C~~~~~~~~C~C~~G~y~g~~~~~C~~   54 (324)
                      .|....|  .|.|++| |+|...+.|.+
T Consensus        13 ~C~~~~G--~C~C~~~-~~G~~C~~C~~   37 (50)
T cd00055          13 QCDPGTG--QCECKPN-TTGRRCDRCAP   37 (50)
T ss_pred             cccCCCC--EEeCCCc-CCCCCCCCCCC
Confidence            3655445  7999999 89976655543


No 57 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=48.23  E-value=15  Score=23.68  Aligned_cols=26  Identities=27%  Similarity=0.744  Sum_probs=17.7

Q ss_pred             CEeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427          26 AVCVVNNHEARCFCPPGSYVGDPNIECRQ   54 (324)
Q Consensus        26 ~~C~~~~~~~~C~C~~G~y~g~~~~~C~~   54 (324)
                      +.|....+  .|.|+++ |+|...+.|.+
T Consensus        11 ~~C~~~~G--~C~C~~~-~~G~~C~~C~~   36 (49)
T PF00053_consen   11 QTCDPSTG--QCVCKPG-TTGPRCDQCKP   36 (49)
T ss_dssp             SSEEETCE--EESBSTT-EESTTS-EE-T
T ss_pred             CcccCCCC--EEecccc-ccCCcCcCCCC
Confidence            36766444  8999999 99976665554


No 58 
>smart00217 WAP Four-disulfide core domains.
Probab=48.19  E-value=25  Score=22.66  Aligned_cols=24  Identities=25%  Similarity=0.732  Sum_probs=18.1

Q ss_pred             cCCCCccccccccCCCCCCCCccc
Q psy9427         287 ACQAGTCELGCRQNEECSGQMICA  310 (324)
Q Consensus       287 ~C~~~~c~~~C~~~~~C~~~~~C~  310 (324)
                      .|..+.|...|.+|.+|+..+.|=
T Consensus        12 ~c~~~~c~~~C~~D~~C~~~~KCC   35 (47)
T smart00217       12 SCPLGNPPNKCSSDSDCPGVKKCC   35 (47)
T ss_pred             ccccCCCCccCcCccccCCCCCcC
Confidence            455667777888888888877773


No 59 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=42.12  E-value=22  Score=28.26  Aligned_cols=24  Identities=21%  Similarity=0.515  Sum_probs=18.8

Q ss_pred             CCCCCCeeeeC--CCCceeeCCCCCcc
Q psy9427         193 ACGPNALCRAV--GKRPVCACPDGFSA  217 (324)
Q Consensus       193 ~C~~~~~C~~~--~g~y~C~C~~Gy~g  217 (324)
                      -|.+| +|.-.  ...+.|.|..||+|
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtG   77 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTG   77 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCccc
Confidence            46665 89744  47799999999998


No 60 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=41.83  E-value=24  Score=28.05  Aligned_cols=31  Identities=26%  Similarity=0.579  Sum_probs=23.5

Q ss_pred             CCCCCCCEEeeC--CCCcEeeCCCCCcCCCCCCCccccc
Q psy9427          83 GACGTNAVCIPE--DHRPVCHCPVGTHPSPSPDVHCARE  119 (324)
Q Consensus        83 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~~~C~~~  119 (324)
                      +-|.+ +.|.-.  ...+.|.|..||+|     .+|+..
T Consensus        51 ~YClH-G~C~yI~dl~~~~CrC~~GYtG-----eRCEh~   83 (139)
T PHA03099         51 GYCLH-GDCIHARDIDGMYCRCSHGYTG-----IRCQHV   83 (139)
T ss_pred             CEeEC-CEEEeeccCCCceeECCCCccc-----ccccce
Confidence            34655 488743  47899999999999     888765


No 61 
>KOG3516|consensus
Probab=36.40  E-value=26  Score=37.95  Aligned_cols=37  Identities=19%  Similarity=0.467  Sum_probs=32.5

Q ss_pred             cCCCCCCCCCCCCEEeeCCCCcEeeCC-CCCcCCCCCCCcccc
Q psy9427          77 IPACDSGACGTNAVCIPEDHRPVCHCP-VGTHPSPSPDVHCAR  118 (324)
Q Consensus        77 i~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~~~~C~~  118 (324)
                      ++.|.+++|.+++.|......|+|.|. .||.|     .+|..
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~G-----atCHt  582 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKG-----ATCHT  582 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEecccccccc-----ccccC
Confidence            788999999999999998899999998 89998     56643


No 62 
>KOG3516|consensus
Probab=30.92  E-value=34  Score=37.08  Aligned_cols=33  Identities=24%  Similarity=0.573  Sum_probs=29.7

Q ss_pred             ccCCCCCCCCCCCEeeeCCCceeeeeC-CCeeecC
Q psy9427          14 WNSCDGVSCGVNAVCVVNNHEARCFCP-PGSYVGD   47 (324)
Q Consensus        14 ~~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~y~g~   47 (324)
                      .|.|..++|..++.|.-....|.|.|. .| |.|.
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TG-Y~Ga  578 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTG-YKGA  578 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEeccccc-cccc
Confidence            567999999999999999999999999 89 8774


No 63 
>PF04885 Stig1:  Stigma-specific protein, Stig1;  InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=30.37  E-value=1.1e+02  Score=24.80  Aligned_cols=54  Identities=30%  Similarity=0.749  Sum_probs=32.0

Q ss_pred             CCCCcccCCcccCC------CC-CCCCCCCCCcCCCCcccc------cccc-CCCCCCCCcccCCccc
Q psy9427         262 SEGERCISSLCQSP------CT-SHSQCPTGQACQAGTCEL------GCRQ-NEECSGQMICAGSKCT  315 (324)
Q Consensus       262 ~~~~~C~~~~C~~~------C~-~~~~C~~~~~C~~~~c~~------~C~~-~~~C~~~~~C~~~~C~  315 (324)
                      ..+..|-++.|++.      |. =+..|+.++.|-.|.|..      +|-. +..|+.++.|+.|.|.
T Consensus        67 ~~~~~CC~~~Cvdv~~d~~nCG~Cg~~C~~g~~cC~G~Cvd~~~d~~~CG~Cg~~C~~G~~C~~G~C~  134 (136)
T PF04885_consen   67 SPGPTCCNNKCVDVSSDRNNCGACGNKCPYGQTCCGGQCVDLNSDPRHCGACGNKCPPGQKCVYGMCG  134 (136)
T ss_pred             CCCCcccCCcCCccCCCccccHhhcCCCCCCceecCCEeECCCCCccccCCCCCcCCCcCCcCCeECC
Confidence            44556666666642      10 123566677777776642      3432 4578888888888774


No 64 
>cd00199 WAP whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.
Probab=29.60  E-value=64  Score=21.94  Aligned_cols=21  Identities=24%  Similarity=0.662  Sum_probs=15.4

Q ss_pred             CCccccccccCCCCCCCCccc
Q psy9427         290 AGTCELGCRQNEECSGQMICA  310 (324)
Q Consensus       290 ~~~c~~~C~~~~~C~~~~~C~  310 (324)
                      .+.|...|.+|.+|+..+.|=
T Consensus        28 ~~~c~~~C~~D~~C~g~~KCC   48 (60)
T cd00199          28 LGIPPNRCSSDSDCPGDKKCC   48 (60)
T ss_pred             cCcccccCccccccCCCCCcC
Confidence            445667788888888777773


No 65 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=28.49  E-value=57  Score=20.70  Aligned_cols=25  Identities=24%  Similarity=0.612  Sum_probs=16.7

Q ss_pred             EeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427          27 VCVVNNHEARCFCPPGSYVGDPNIECRQ   54 (324)
Q Consensus        27 ~C~~~~~~~~C~C~~G~y~g~~~~~C~~   54 (324)
                      .|....|  .|+|+++ |+|...+.|.+
T Consensus        12 ~C~~~~G--~C~C~~~-~~G~~C~~C~~   36 (46)
T smart00180       12 TCDPDTG--QCECKPN-VTGRRCDRCAP   36 (46)
T ss_pred             cccCCCC--EEECCCC-CCCCCCCcCCC
Confidence            5654444  8999999 88865554443


No 66 
>PF08562 Crisp:  Crisp;  InterPro: IPR013871  This entry is found on Crisp proteins which contain IPR001283 from INTERPRO and has been termed the Crisp domain. It is found in the mammalian reproductive tract and the venom of reptiles, and has been shown to regulate ryanodine receptor Ca2+ signalling []. It contains 10 conserved cysteines which are all involved in disulphide bonds and is structurally related to the ion channel inhibitor toxins BgK and ShK []. ; PDB: 3MZ8_B 1XX5_B 2GIZ_A 1XTA_A 1RC9_A 2A05_A 2CQ7_A 2DDA_C 2EPF_A 2DDB_C ....
Probab=27.06  E-value=27  Score=23.49  Aligned_cols=15  Identities=20%  Similarity=0.262  Sum_probs=10.4

Q ss_pred             CCcccCCcccccchh
Q psy9427         306 QMICAGSKCTGWVEE  320 (324)
Q Consensus       306 ~~~C~~~~C~np~~~  320 (324)
                      +..|.+|-|.|||.-
T Consensus         5 P~~CdngLCTNpC~y   19 (55)
T PF08562_consen    5 PSNCDNGLCTNPCPY   19 (55)
T ss_dssp             TTSEETTEES-B-SS
T ss_pred             chhhcCCCcCCCccc
Confidence            457899999999963


No 67 
>PF11403 Yeast_MT:  Yeast metallothionein;  InterPro: IPR022710  Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=23.10  E-value=93  Score=18.47  Aligned_cols=12  Identities=33%  Similarity=1.284  Sum_probs=5.4

Q ss_pred             cccccccCCCCC
Q psy9427         293 CELGCRQNEECS  304 (324)
Q Consensus       293 c~~~C~~~~~C~  304 (324)
                      |..+|.+++.|+
T Consensus        26 cptgcnsddkcp   37 (40)
T PF11403_consen   26 CPTGCNSDDKCP   37 (40)
T ss_dssp             S-TTTTSSTT--
T ss_pred             CCCCCCCCCcCC
Confidence            344566666665


No 68 
>KOG3514|consensus
Probab=23.09  E-value=56  Score=35.24  Aligned_cols=35  Identities=29%  Similarity=0.722  Sum_probs=30.0

Q ss_pred             CCCCCCCCCCCEEeeCCCCcEeeCC-CCCcCCCCCCCcccc
Q psy9427          79 ACDSGACGTNAVCIPEDHRPVCHCP-VGTHPSPSPDVHCAR  118 (324)
Q Consensus        79 ~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~~~~C~~  118 (324)
                      .|.++||.+++.|..-.++|.|.|. .||.|     +.|+.
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G-----~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEG-----RTCER  660 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccC-----ccccc
Confidence            6888999999999999999999995 57777     77764


No 69 
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=21.34  E-value=1.2e+02  Score=22.37  Aligned_cols=27  Identities=30%  Similarity=0.898  Sum_probs=20.8

Q ss_pred             CCCCCCCCCCCEeee-CCCceeeeeCCC
Q psy9427          16 SCDGVSCGVNAVCVV-NNHEARCFCPPG   42 (324)
Q Consensus        16 ~C~~~~C~~~~~C~~-~~~~~~C~C~~G   42 (324)
                      ||+...|+.+.+|+. ..+...|+|.+-
T Consensus         1 pC~~v~C~~G~~C~~d~~~~p~CvC~~~   28 (86)
T cd01328           1 PCENHHCGAGKVCEVDDENTPKCVCIDP   28 (86)
T ss_pred             CCCCcCCCCCCEeeECCCCCeEEecCCc
Confidence            577778999999985 467788887654


Done!