Query         psy4097
Match_columns 129
No_of_seqs    181 out of 1834
Neff          10.4
Searched_HMMs 46136
Date          Fri Aug 16 16:52:38 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4097.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4097hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.6 7.9E-15 1.7E-19  108.4   7.9  111    3-124   746-865 (1289)
  2 KOG1219|consensus               99.6 7.7E-15 1.7E-19  116.5   7.7  103    3-121  3874-3977(4289)
  3 KOG4289|consensus               99.3 2.6E-12 5.7E-17   99.6   6.5  100    9-122  1217-1317(2531)
  4 KOG1219|consensus               99.1 1.1E-10 2.3E-15   94.0   6.0   82   29-123  3859-3940(4289)
  5 KOG1214|consensus               99.0 8.2E-10 1.8E-14   82.6   6.2  106    4-119   705-821 (1289)
  6 KOG4289|consensus               98.6 1.9E-08 4.1E-13   79.1   2.7   56   60-123  1221-1276(2531)
  7 KOG4260|consensus               98.4 1.7E-07 3.6E-12   62.4   3.0   89   13-116   216-304 (350)
  8 PF00008 EGF:  EGF-like domain   98.4 9.5E-08 2.1E-12   43.7   1.1   31   88-119     1-32  (32)
  9 KOG1217|consensus               98.4 1.9E-06 4.2E-11   62.0   8.2  104    3-120   181-306 (487)
 10 smart00179 EGF_CA Calcium-bind  98.4   1E-06 2.3E-11   41.8   4.0   35   86-121     3-39  (39)
 11 KOG1217|consensus               98.3 4.6E-06   1E-10   60.1   7.9  107    5-123   243-355 (487)
 12 PF07645 EGF_CA:  Calcium-bindi  98.2 8.8E-07 1.9E-11   43.1   2.2   32   35-70      3-34  (42)
 13 PF12947 EGF_3:  EGF domain;  I  98.2 6.3E-07 1.4E-11   42.0   1.5   28    2-30      9-36  (36)
 14 PF07645 EGF_CA:  Calcium-bindi  98.1 1.1E-06 2.3E-11   42.8   1.4   30   86-116     3-34  (42)
 15 cd00054 EGF_CA Calcium-binding  98.1 6.1E-06 1.3E-10   38.7   3.8   35   86-121     3-38  (38)
 16 PF12662 cEGF:  Complement Clr-  98.0   6E-06 1.3E-10   35.0   1.9   22   13-36      1-24  (24)
 17 PF12947 EGF_3:  EGF domain;  I  98.0 3.7E-06   8E-11   39.4   1.2   32   39-74      3-34  (36)
 18 KOG1225|consensus               97.9 2.9E-05 6.3E-10   56.8   5.8   78   15-123   266-343 (525)
 19 smart00181 EGF Epidermal growt  97.9 2.8E-05 6.1E-10   36.0   3.6   29   91-121     6-35  (35)
 20 cd00053 EGF Epidermal growth f  97.9 3.4E-05 7.5E-10   35.5   3.7   30   90-120     5-35  (36)
 21 KOG4260|consensus               97.9 1.2E-05 2.5E-10   53.8   2.6   92   11-117   165-269 (350)
 22 smart00179 EGF_CA Calcium-bind  97.5 0.00023   5E-09   33.5   3.6   33   35-72      3-36  (39)
 23 KOG1225|consensus               97.5 0.00065 1.4E-08   50.0   7.1   79   15-124   235-313 (525)
 24 PF14670 FXa_inhibition:  Coagu  97.3 0.00016 3.5E-09   33.8   1.8   25    5-30     10-36  (36)
 25 PF00008 EGF:  EGF-like domain   97.2 0.00023 5.1E-09   32.3   1.3   27   43-72      5-31  (32)
 26 PF12661 hEGF:  Human growth fa  97.1 0.00019   4E-09   25.9   0.7   13  108-120     1-13  (13)
 27 cd00054 EGF_CA Calcium-binding  96.9   0.002 4.3E-08   29.8   3.4   33   35-72      3-35  (38)
 28 PHA03099 epidermal growth fact  96.8  0.0013 2.9E-08   39.1   2.8   36   91-127    51-87  (139)
 29 PHA02887 EGF-like protein; Pro  96.5  0.0036 7.8E-08   36.8   3.0   38   86-124    84-125 (126)
 30 PF06247 Plasmod_Pvs28:  Plasmo  96.5 0.00072 1.6E-08   43.0   0.1  100    7-116    13-119 (197)
 31 smart00181 EGF Epidermal growt  96.4  0.0074 1.6E-07   27.5   3.4   25   42-71      6-30  (35)
 32 cd00053 EGF Epidermal growth f  96.3  0.0059 1.3E-07   27.6   2.7   21    3-24     10-30  (36)
 33 PF14670 FXa_inhibition:  Coagu  96.1  0.0029 6.4E-08   29.5   1.0   25   92-119     7-31  (36)
 34 PF07974 EGF_2:  EGF-like domai  96.0   0.013 2.8E-07   26.6   2.9   26   92-120     7-32  (32)
 35 KOG1226|consensus               95.7   0.075 1.6E-06   40.9   7.3   90   11-122   472-581 (783)
 36 PF06247 Plasmod_Pvs28:  Plasmo  95.4  0.0063 1.4E-07   38.9   0.8  104    4-119    55-163 (197)
 37 PF12946 EGF_MSP1_1:  MSP1 EGF   95.3  0.0074 1.6E-07   28.1   0.6   28    2-30      8-36  (37)
 38 KOG0994|consensus               94.5    0.15 3.3E-06   41.3   6.1   55   61-122   885-949 (1758)
 39 cd01475 vWA_Matrilin VWA_Matri  94.2   0.077 1.7E-06   35.0   3.6   36   78-119   183-220 (224)
 40 cd01475 vWA_Matrilin VWA_Matri  93.8   0.079 1.7E-06   35.0   3.1   40   26-72    180-219 (224)
 41 smart00051 DSL delta serrate l  93.4    0.21 4.7E-06   26.3   3.8   46   61-120    17-63  (63)
 42 KOG1836|consensus               92.4    0.41   9E-06   40.6   5.8   56   62-124   757-815 (1705)
 43 KOG3516|consensus               91.1    0.22 4.8E-06   40.3   2.8   36   86-122   546-582 (1306)
 44 KOG3514|consensus               86.2    0.61 1.3E-05   37.8   2.3   35   87-122   625-660 (1591)
 45 KOG1226|consensus               85.8     4.1 8.8E-05   32.0   6.3   63   43-124   556-623 (783)
 46 KOG0994|consensus               83.6     6.3 0.00014   32.8   6.7   17  107-123   530-546 (1758)
 47 PF00954 S_locus_glycop:  S-loc  82.9     2.2 4.7E-05   24.9   3.2   31   86-118    78-109 (110)
 48 PF00053 Laminin_EGF:  Laminin   76.7     1.8 3.9E-05   21.2   1.3   18  105-122    16-33  (49)
 49 PF09064 Tme5_EGF_like:  Thromb  76.4     3.5 7.6E-05   18.8   2.0   13  107-119    18-30  (34)
 50 KOG3516|consensus               73.5     3.9 8.6E-05   33.7   3.0   37   86-123   956-993 (1306)
 51 KOG3512|consensus               71.9      16 0.00034   27.4   5.4   18  104-121   461-478 (592)
 52 PF01414 DSL:  Delta serrate li  70.0     1.2 2.7E-05   23.4  -0.3   46   61-120    17-63  (63)
 53 cd00055 EGF_Lam Laminin-type e  67.0     4.7  0.0001   19.9   1.5   14  108-121    20-33  (50)
 54 smart00180 EGF_Lam Laminin-typ  66.4     5.1 0.00011   19.4   1.6   14  108-121    19-32  (46)
 55 PF12955 DUF3844:  Domain of un  66.0     6.2 0.00014   23.0   2.0   33   91-124    13-63  (103)
 56 PF01683 EB:  EB module;  Inter  60.2     8.7 0.00019   19.0   1.8    9   15-24     38-46  (52)
 57 cd01328 FSL_SPARC Follistatin-  51.7      25 0.00054   19.8   2.8    8  101-108    41-48  (86)
 58 PF01826 TIL:  Trypsin Inhibito  44.4     5.4 0.00012   20.0  -0.5   17   16-33     35-52  (55)
 59 KOG1836|consensus               42.2      43 0.00093   29.5   3.9   60    7-72    749-809 (1705)
 60 KOG3509|consensus               38.0      34 0.00073   28.1   2.6   38   86-124   407-444 (964)
 61 PF02822 Antistasin:  Antistasi  33.4      23  0.0005   15.0   0.7   14   18-31      9-22  (26)
 62 PF04863 EGF_alliinase:  Alliin  33.0      22 0.00049   18.1   0.7   21  103-123    32-52  (56)
 63 PF09289 FOLN:  Follistatin/Ost  29.3      43 0.00092   13.7   1.1    8   92-99      5-12  (22)
 64 smart00274 FOLN Follistatin-N-  28.7      59  0.0013   13.7   2.0   19   91-109     5-23  (26)
 65 KOG3514|consensus               26.0      49  0.0011   27.8   1.8   37   86-123  1019-1056(1591)
 66 KOG0196|consensus               24.4 1.5E+02  0.0033   24.3   4.1   16  101-116   302-317 (996)
 67 KOG1218|consensus               21.8 2.8E+02   0.006   19.0   6.9   15  108-122   163-177 (316)

No 1  
>KOG1214|consensus
Probab=99.58  E-value=7.9e-15  Score=108.43  Aligned_cols=111  Identities=30%  Similarity=0.715  Sum_probs=90.8

Q ss_pred             CCceeeeCCCceecccCCCCcc--cCCCCcccCC-----CCccCCCCCCCCCCeee--eCCCCCcceeeecCCCCCcCCC
Q psy4097           3 PYVECINTQGYRKCGQCPHGWV--GDGTTCRQGT-----TGCSVQNGGCHPLATCR--ETSDTVRSVISCTCPPGMGGSG   73 (129)
Q Consensus         3 ~~~~C~~~~g~~~C~~C~~g~~--~~~~~C~~~~-----~~C~~~~~~C~~~~~C~--~~~~g~~~~~~c~c~~g~~g~~   73 (129)
                      +++.|+|.+++|.| .|..||.  .++.+|..+.     +.|....+.|...++|.  ... ++  .|.|.|.+||.|+|
T Consensus       746 p~s~Cin~pg~~rc-eC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG-gs--~y~C~CLPGfsGDG  821 (1289)
T KOG1214|consen  746 PNSVCINLPGSYRC-ECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG-GS--TYSCACLPGFSGDG  821 (1289)
T ss_pred             CCceeecCCCceeE-EEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC-Cc--eEEEeecCCccCCc
Confidence            67899999999999 9999998  5667887552     55776666787655544  444 32  49999999999998


Q ss_pred             CCcCCcccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097          74 VGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLVP  124 (129)
Q Consensus        74 ~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~  124 (129)
                      .   .|. ++  |+|.+..|+..+.| .+.++++.|+|.+||.|++-+.++
T Consensus       822 ~---~c~-dv--DeC~psrChp~A~C-yntpgsfsC~C~pGy~GDGf~CVP  865 (1289)
T KOG1214|consen  822 H---QCT-DV--DECSPSRCHPAATC-YNTPGSFSCRCQPGYYGDGFQCVP  865 (1289)
T ss_pred             c---ccc-cc--cccCccccCCCceE-ecCCCcceeecccCccCCCceecC
Confidence            8   787 77  99999999999999 999999999999999998865444


No 2  
>KOG1219|consensus
Probab=99.57  E-value=7.7e-15  Score=116.47  Aligned_cols=103  Identities=28%  Similarity=0.663  Sum_probs=91.7

Q ss_pred             CCceeeeCC-CceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCccc
Q psy4097           3 PYVECINTQ-GYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAY   81 (129)
Q Consensus         3 ~~~~C~~~~-g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~   81 (129)
                      ++++|...+ |+|.| .|++.|.  |+.|+...++|.++  ||..+++|.... +.   |.|.|+.+|+|.     .|+ 
T Consensus      3874 hgG~C~~~~~ggy~C-kCpsqys--G~~CEi~~epC~sn--PC~~GgtCip~~-n~---f~CnC~~gyTG~-----~Ce- 3938 (4289)
T KOG1219|consen 3874 HGGTCISQPKGGYKC-KCPSQYS--GNHCEIDLEPCASN--PCLTGGTCIPFY-NG---FLCNCPNGYTGK-----RCE- 3938 (4289)
T ss_pred             CCCEecCCCCCceEE-eCccccc--CcccccccccccCC--CCCCCCEEEecC-CC---eeEeCCCCccCc-----eee-
Confidence            567888887 67999 9999999  99998766999988  999999999877 54   999999999999     998 


Q ss_pred             CCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCc
Q psy4097          82 GMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESEL  121 (129)
Q Consensus        82 ~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~  121 (129)
                      ....++|..++|.+++.| ++..|+|+|.|.+||.|..|.
T Consensus      3939 ~~Gi~eCs~n~C~~gg~C-~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3939 ARGISECSKNVCGTGGQC-INIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred             cccccccccccccCCcee-eccCCceEeccChhHhcccCc
Confidence            432289999999999999 999999999999999999874


No 3  
>KOG4289|consensus
Probab=99.34  E-value=2.6e-12  Score=99.55  Aligned_cols=100  Identities=26%  Similarity=0.684  Sum_probs=81.9

Q ss_pred             eCCCceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCC
Q psy4097           9 NTQGYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPC   88 (129)
Q Consensus         9 ~~~g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C   88 (129)
                      +..+.+.| .|++||+  +..|+..+|.|-+.  +|.+++.|...+ +.   |+|.|.++|+|.     +|+.+.....|
T Consensus      1217 ~pvnglrC-rCPpGFT--gd~CeTeiDlCYs~--pC~nng~C~srE-gg---YtCeCrpg~tGe-----hCEvs~~agrC 1282 (2531)
T KOG4289|consen 1217 HPVNGLRC-RCPPGFT--GDYCETEIDLCYSG--PCGNNGRCRSRE-GG---YTCECRPGFTGE-----HCEVSARAGRC 1282 (2531)
T ss_pred             cccCceeE-eCCCCCC--cccccchhHhhhcC--CCCCCCceEEec-Cc---eeEEecCCcccc-----ceeeecccCcc
Confidence            44567999 9999999  77998766999987  999999999888 76   999999999999     89732222467


Q ss_pred             CCCCCCCCCeeccCCCCceeeeCCCC-CccCCCcc
Q psy4097          89 GGVTCEHGGICAPIGDRGYRCQCEPG-FTGESELL  122 (129)
Q Consensus        89 ~~~~C~~~~~C~~~~~~~~~C~C~~g-~~g~~c~~  122 (129)
                      .+..|.+++.|+....+++.|.|+.| |.+..|+.
T Consensus      1283 vpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1283 VPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred             ccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence            78889999999434568899999987 56777764


No 4  
>KOG1219|consensus
Probab=99.13  E-value=1.1e-10  Score=93.98  Aligned_cols=82  Identities=28%  Similarity=0.649  Sum_probs=70.4

Q ss_pred             CcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeeccCCCCcee
Q psy4097          29 TCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYR  108 (129)
Q Consensus        29 ~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~  108 (129)
                      .|....+.|..+  ||++++.|...+ ++  +|.|.|++.|.|.     +|+.+.  ++|..+||..++.| +...++|.
T Consensus      3859 gC~l~~d~C~~n--pCqhgG~C~~~~-~g--gy~CkCpsqysG~-----~CEi~~--epC~snPC~~GgtC-ip~~n~f~ 3925 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDN--PCQHGGTCISQP-KG--GYKCKCPSQYSGN-----HCEIDL--EPCASNPCLTGGTC-IPFYNGFL 3925 (4289)
T ss_pred             cccccccccccC--cccCCCEecCCC-CC--ceEEeCcccccCc-----cccccc--ccccCCCCCCCCEE-EecCCCee
Confidence            455444788877  999999999877 42  4999999999999     998444  89999999999999 88999999


Q ss_pred             eeCCCCCccCCCccc
Q psy4097         109 CQCEPGFTGESELLV  123 (129)
Q Consensus       109 C~C~~g~~g~~c~~~  123 (129)
                      |.|+.||+|++|+..
T Consensus      3926 CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3926 CNCPNGYTGKRCEAR 3940 (4289)
T ss_pred             EeCCCCccCceeecc
Confidence            999999999999854


No 5  
>KOG1214|consensus
Probab=99.01  E-value=8.2e-10  Score=82.59  Aligned_cols=106  Identities=30%  Similarity=0.770  Sum_probs=79.9

Q ss_pred             CceeeeCCC-ceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc--CCCCCcCCcc
Q psy4097           4 YVECINTQG-YRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG--GSGVGLMGCA   80 (129)
Q Consensus         4 ~~~C~~~~g-~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~--g~~~~~~~c~   80 (129)
                      ++.|....+ .|.| .|..||.+++++|.++ ++|+.....|..+++|++.+ ++   |.|.|..+|.  +.+.   .|.
T Consensus       705 ~a~C~pg~~~~~tc-ecs~g~~gdgr~c~d~-~eca~~~~~CGp~s~Cin~p-g~---~rceC~~gy~F~dd~~---tCV  775 (1289)
T KOG1214|consen  705 TARCHPGTGVDYTC-ECSSGYQGDGRNCVDE-NECATGFHRCGPNSVCINLP-GS---YRCECRSGYEFADDRH---TCV  775 (1289)
T ss_pred             CccccCCCCcceEE-EEeeccCCCCCCCCCh-hhhccCCCCCCCCceeecCC-Cc---eeEEEeecceeccCCc---ceE
Confidence            456666554 6999 9999999999999999 99999888999999999999 88   9999988875  3333   465


Q ss_pred             cCCC----CCCCC--CCCCCCCCe--eccCCCCceeeeCCCCCccCC
Q psy4097          81 YGMS----GNPCG--GVTCEHGGI--CAPIGDRGYRCQCEPGFTGES  119 (129)
Q Consensus        81 ~~~~----~~~C~--~~~C~~~~~--C~~~~~~~~~C~C~~g~~g~~  119 (129)
                       .+.    .+.|.  .+.|...+.  |+....+.|.|.|.+||.|++
T Consensus       776 -~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG  821 (1289)
T KOG1214|consen  776 -LITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG  821 (1289)
T ss_pred             -EecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence             332    14454  255654334  423456789999999999887


No 6  
>KOG4289|consensus
Probab=98.63  E-value=1.9e-08  Score=79.07  Aligned_cols=56  Identities=38%  Similarity=0.908  Sum_probs=50.4

Q ss_pred             eeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCccc
Q psy4097          60 VISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLV  123 (129)
Q Consensus        60 ~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~  123 (129)
                      ++.|+|++||+++     .|+ ... |.|-..||.+++.| ....++|.|.|.+||+|+.|+.+
T Consensus      1221 glrCrCPpGFTgd-----~Ce-Tei-DlCYs~pC~nng~C-~srEggYtCeCrpg~tGehCEvs 1276 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGD-----YCE-TEI-DLCYSGPCGNNGRC-RSREGGYTCECRPGFTGEHCEVS 1276 (2531)
T ss_pred             ceeEeCCCCCCcc-----ccc-chh-HhhhcCCCCCCCce-EEecCceeEEecCCccccceeee
Confidence            3889999999999     898 544 89999999999999 88999999999999999999864


No 7  
>KOG4260|consensus
Probab=98.44  E-value=1.7e-07  Score=62.36  Aligned_cols=89  Identities=26%  Similarity=0.620  Sum_probs=67.8

Q ss_pred             ceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCC
Q psy4097          13 YRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVT   92 (129)
Q Consensus        13 ~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~   92 (129)
                      +..|+.|..||..+...|.++ |+|...+.+|.....|+|+. ||   |.|...+||.. +..  .|++..  +.|.   
T Consensus       216 ~k~C~kCkkGW~lde~gCvDv-nEC~~ep~~c~~~qfCvNte-GS---f~C~dk~Gy~~-g~d--~C~~~~--d~~~---  282 (350)
T KOG4260|consen  216 SKGCSKCKKGWKLDEEGCVDV-NECQNEPAPCKAHQFCVNTE-GS---FKCEDKEGYKK-GVD--ECQFCA--DVCA---  282 (350)
T ss_pred             CCChhhhcccceecccccccH-HHHhcCCCCCChhheeecCC-Cc---eEecccccccC-ChH--Hhhhhh--hhcc---
Confidence            455668999999888899999 99998888999999999999 99   99999888876 222  444111  2232   


Q ss_pred             CCCCCeeccCCCCceeeeCCCCCc
Q psy4097          93 CEHGGICAPIGDRGYRCQCEPGFT  116 (129)
Q Consensus        93 C~~~~~C~~~~~~~~~C~C~~g~~  116 (129)
                       .....| .+..+.|+|.|..++.
T Consensus       283 -~kn~~c-~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  283 -SKNRPC-MNIDGQYRCVCFSGLI  304 (350)
T ss_pred             -cCCCCc-ccCCccEEEEecccce
Confidence             123357 8899999999988763


No 8  
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.42  E-value=9.5e-08  Score=43.72  Aligned_cols=31  Identities=42%  Similarity=1.117  Sum_probs=26.0

Q ss_pred             CCCCCCCCCCeeccCCC-CceeeeCCCCCccCC
Q psy4097          88 CGGVTCEHGGICAPIGD-RGYRCQCEPGFTGES  119 (129)
Q Consensus        88 C~~~~C~~~~~C~~~~~-~~~~C~C~~g~~g~~  119 (129)
                      |.+++|.+++.| +... +.|.|.|++||.|+.
T Consensus         1 C~~~~C~n~g~C-~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTC-IDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEE-EEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEE-EeCCCCCEEeECCCCCccCC
Confidence            445689999999 6666 999999999999873


No 9  
>KOG1217|consensus
Probab=98.41  E-value=1.9e-06  Score=62.05  Aligned_cols=104  Identities=37%  Similarity=0.839  Sum_probs=73.1

Q ss_pred             CCceeeeCCCceecccCCCCcccCCCCcccC-------------------CCCccCCCCCCCCC-CeeeeCCCCCcceee
Q psy4097           3 PYVECINTQGYRKCGQCPHGWVGDGTTCRQG-------------------TTGCSVQNGGCHPL-ATCRETSDTVRSVIS   62 (129)
Q Consensus         3 ~~~~C~~~~g~~~C~~C~~g~~~~~~~C~~~-------------------~~~C~~~~~~C~~~-~~C~~~~~g~~~~~~   62 (129)
                      .+..|.+..++|.| .|.++|.  +..|...                   ...|......|... ++|.+.. ++   +.
T Consensus       181 ~~~~C~~~~~~~~C-~c~~~~~--~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~-~~---~~  253 (487)
T KOG1217|consen  181 NGGTCVNTGGSYLC-SCPPGYT--GSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTV-GS---YT  253 (487)
T ss_pred             CCcccccCCCCeeE-eCCCCcc--CCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccC-Cc---ee
Confidence            35678888888999 9999998  3333211                   01222221223322 6788777 66   89


Q ss_pred             ecCCCCCcCCC-CCcCCcccCCCCCCCCCC-CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097          63 CTCPPGMGGSG-VGLMGCAYGMSGNPCGGV-TCEHGGICAPIGDRGYRCQCEPGFTGESE  120 (129)
Q Consensus        63 c~c~~g~~g~~-~~~~~c~~~~~~~~C~~~-~C~~~~~C~~~~~~~~~C~C~~g~~g~~c  120 (129)
                      |.++++|.+.. .   .+. ++  +.|... +|.+++.| ++..+.|.|.|++||.|..+
T Consensus       254 C~~~~g~~~~~~~---~~~-~~--~~C~~~~~c~~~~~C-~~~~~~~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  254 CRCPEGYTGDACV---TCV-DV--DSCALIASCPNGGTC-VNVPGSYRCTCPPGFTGRLC  306 (487)
T ss_pred             eeCCCCccccccc---eee-ec--cccCCCCccCCCCee-ecCCCcceeeCCCCCCCCCC
Confidence            99999998773 1   244 55  788754 38888899 88888899999999999987


No 10 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.35  E-value=1e-06  Score=41.84  Aligned_cols=35  Identities=37%  Similarity=0.970  Sum_probs=30.2

Q ss_pred             CCCCC-CCCCCCCeeccCCCCceeeeCCCCCc-cCCCc
Q psy4097          86 NPCGG-VTCEHGGICAPIGDRGYRCQCEPGFT-GESEL  121 (129)
Q Consensus        86 ~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~-g~~c~  121 (129)
                      ++|.. .+|.+++.| ++..++|.|.|++||. |..|+
T Consensus         3 ~~C~~~~~C~~~~~C-~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        3 DECASGNPCQNGGTC-VNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ccCcCCCCcCCCCEe-ECCCCCeEeECCCCCccCCcCC
Confidence            67776 689888899 8999999999999999 87774


No 11 
>KOG1217|consensus
Probab=98.29  E-value=4.6e-06  Score=60.06  Aligned_cols=107  Identities=36%  Similarity=0.853  Sum_probs=79.4

Q ss_pred             ceeeeCCCceecccCCCCcccCC-CCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCC
Q psy4097           5 VECINTQGYRKCGQCPHGWVGDG-TTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGM   83 (129)
Q Consensus         5 ~~C~~~~g~~~C~~C~~g~~~~~-~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~   83 (129)
                      ..|+++.+++.| .|.+||..+. ..|.++ ++|..... |.++++|.+.. +.   |.|.|+++|.+..-  ..+. +.
T Consensus       243 ~~c~~~~~~~~C-~~~~g~~~~~~~~~~~~-~~C~~~~~-c~~~~~C~~~~-~~---~~C~C~~g~~g~~~--~~~~-~~  312 (487)
T KOG1217|consen  243 GTCVNTVGSYTC-RCPEGYTGDACVTCVDV-DSCALIAS-CPNGGTCVNVP-GS---YRCTCPPGFTGRLC--TECV-DV  312 (487)
T ss_pred             CcccccCCceee-eCCCCccccccceeeec-cccCCCCc-cCCCCeeecCC-Cc---ceeeCCCCCCCCCC--cccc-cc
Confidence            678888888999 9999999554 456677 88988643 88889999988 76   89999999998832  0122 33


Q ss_pred             CCCCCC----CCCCCCCCeec-cCCCCceeeeCCCCCccCCCccc
Q psy4097          84 SGNPCG----GVTCEHGGICA-PIGDRGYRCQCEPGFTGESELLV  123 (129)
Q Consensus        84 ~~~~C~----~~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~  123 (129)
                        ..|.    ..+|..++.|. ......+.|.|..+|.|..|+..
T Consensus       313 --~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~  355 (487)
T KOG1217|consen  313 --DECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS  355 (487)
T ss_pred             --ccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccC
Confidence              5663    35577766671 34445788999999999988754


No 12 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.24  E-value=8.8e-07  Score=43.08  Aligned_cols=32  Identities=34%  Similarity=0.778  Sum_probs=29.7

Q ss_pred             CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc
Q psy4097          35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG   70 (129)
Q Consensus        35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~   70 (129)
                      +||....+.|..++.|+|+. |+   |.|.|++||.
T Consensus         3 dEC~~~~~~C~~~~~C~N~~-Gs---y~C~C~~Gy~   34 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVNTE-GS---YSCSCPPGYE   34 (42)
T ss_dssp             STTTTTSSSSSTTSEEEEET-TE---EEEEESTTEE
T ss_pred             cccCCCCCcCCCCCEEEcCC-CC---EEeeCCCCcE
Confidence            89998888898899999999 98   9999999997


No 13 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.23  E-value=6.3e-07  Score=42.00  Aligned_cols=28  Identities=43%  Similarity=0.954  Sum_probs=22.9

Q ss_pred             CCCceeeeCCCceecccCCCCcccCCCCc
Q psy4097           2 TPYVECINTQGYRKCGQCPHGWVGDGTTC   30 (129)
Q Consensus         2 ~~~~~C~~~~g~~~C~~C~~g~~~~~~~C   30 (129)
                      +++|.|+++.++|.| .|.+||.++|..|
T Consensus         9 ~~nA~C~~~~~~~~C-~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    9 HPNATCTNTGGSYTC-TCKPGYEGDGFFC   36 (36)
T ss_dssp             -TTCEEEE-TTSEEE-EE-CEEECCSTCE
T ss_pred             CCCcEeecCCCCEEe-ECCCCCccCCcCC
Confidence            578999999999999 9999999888765


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.15  E-value=1.1e-06  Score=42.79  Aligned_cols=30  Identities=30%  Similarity=0.961  Sum_probs=26.6

Q ss_pred             CCCCC--CCCCCCCeeccCCCCceeeeCCCCCc
Q psy4097          86 NPCGG--VTCEHGGICAPIGDRGYRCQCEPGFT  116 (129)
Q Consensus        86 ~~C~~--~~C~~~~~C~~~~~~~~~C~C~~g~~  116 (129)
                      ++|..  +.|...+.| +|+.|+|.|.|++||.
T Consensus         3 dEC~~~~~~C~~~~~C-~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    3 DECAEGPHNCPENGTC-VNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             STTTTTSSSSSTTSEE-EEETTEEEEEESTTEE
T ss_pred             cccCCCCCcCCCCCEE-EcCCCCEEeeCCCCcE
Confidence            78873  569878899 9999999999999998


No 15 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.12  E-value=6.1e-06  Score=38.72  Aligned_cols=35  Identities=37%  Similarity=0.955  Sum_probs=29.7

Q ss_pred             CCCCC-CCCCCCCeeccCCCCceeeeCCCCCccCCCc
Q psy4097          86 NPCGG-VTCEHGGICAPIGDRGYRCQCEPGFTGESEL  121 (129)
Q Consensus        86 ~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~  121 (129)
                      ++|.. .+|.+++.| .+..+.|.|.|+.||.|..|+
T Consensus         3 ~~C~~~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASGNPCQNGGTC-VNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCCCCcCCCCEe-ECCCCCeEeECCCCCcCCcCC
Confidence            66765 688877899 888999999999999998774


No 16 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.98  E-value=6e-06  Score=34.95  Aligned_cols=22  Identities=32%  Similarity=0.815  Sum_probs=18.9

Q ss_pred             ceecccCCCCcc--cCCCCcccCCCC
Q psy4097          13 YRKCGQCPHGWV--GDGTTCRQGTTG   36 (129)
Q Consensus        13 ~~~C~~C~~g~~--~~~~~C~~~~~~   36 (129)
                      +|.| .|++||.  .++.+|+++ ||
T Consensus         1 sy~C-~C~~Gy~l~~d~~~C~DI-dE   24 (24)
T PF12662_consen    1 SYTC-SCPPGYQLSPDGRSCEDI-DE   24 (24)
T ss_pred             CEEe-eCCCCCcCCCCCCccccC-CC
Confidence            5899 9999999  577899998 65


No 17 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.96  E-value=3.7e-06  Score=39.38  Aligned_cols=32  Identities=53%  Similarity=1.129  Sum_probs=24.8

Q ss_pred             CCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCC
Q psy4097          39 VQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGV   74 (129)
Q Consensus        39 ~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~   74 (129)
                      ...+.|+.++.|.++. ++   +.|.|.+||.|+|+
T Consensus         3 ~~~~~C~~nA~C~~~~-~~---~~C~C~~Gy~GdG~   34 (36)
T PF12947_consen    3 ENNGGCHPNATCTNTG-GS---YTCTCKPGYEGDGF   34 (36)
T ss_dssp             TGGGGS-TTCEEEE-T-TS---EEEEE-CEEECCST
T ss_pred             CCCCCCCCCcEeecCC-CC---EEeECCCCCccCCc
Confidence            3355799999999999 87   99999999999976


No 18 
>KOG1225|consensus
Probab=97.93  E-value=2.9e-05  Score=56.81  Aligned_cols=78  Identities=35%  Similarity=0.912  Sum_probs=47.0

Q ss_pred             ecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCC
Q psy4097          15 KCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE   94 (129)
Q Consensus        15 ~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~   94 (129)
                      +| .|++||.  +..|...  .|..   .|..++.+.+   +     .|.|.++|.|.     .|+ .   ..| +.+|.
T Consensus       266 ~C-IC~~Gf~--G~dC~e~--~Cp~---~cs~~g~~~~---g-----~CiC~~g~~G~-----dCs-~---~~c-padC~  319 (525)
T KOG1225|consen  266 RC-ICPPGFT--GDDCDEL--VCPV---DCSGGGVCVD---G-----ECICNPGYSGK-----DCS-I---RRC-PADCS  319 (525)
T ss_pred             eE-eCCCCCc--CCCCCcc--cCCc---ccCCCceecC---C-----EeecCCCcccc-----ccc-c---ccC-CccCC
Confidence            56 7777777  6666542  2432   2444433432   2     57788888777     665 3   234 35677


Q ss_pred             CCCeeccCCCCceeeeCCCCCccCCCccc
Q psy4097          95 HGGICAPIGDRGYRCQCEPGFTGESELLV  123 (129)
Q Consensus        95 ~~~~C~~~~~~~~~C~C~~g~~g~~c~~~  123 (129)
                      .++.| +  .+  +|.|.+||+|+.|...
T Consensus       320 g~G~C-i--~G--~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  320 GHGKC-I--DG--ECLCDEGYTGELCIQR  343 (525)
T ss_pred             CCCcc-c--CC--ceEeCCCCcCCccccc
Confidence            77777 4  22  6778888888777653


No 19 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.89  E-value=2.8e-05  Score=35.96  Aligned_cols=29  Identities=34%  Similarity=0.842  Sum_probs=24.6

Q ss_pred             CCCCCCCeeccCCCCceeeeCCCCCcc-CCCc
Q psy4097          91 VTCEHGGICAPIGDRGYRCQCEPGFTG-ESEL  121 (129)
Q Consensus        91 ~~C~~~~~C~~~~~~~~~C~C~~g~~g-~~c~  121 (129)
                      .+|.++ .| ++..+.|.|.|++||.| ..|+
T Consensus         6 ~~C~~~-~C-~~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        6 GPCSNG-TC-INTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCCC-EE-ECCCCCeEeECCCCCccCCccC
Confidence            578877 89 88899999999999999 6553


No 20 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.87  E-value=3.4e-05  Score=35.50  Aligned_cols=30  Identities=43%  Similarity=1.022  Sum_probs=25.3

Q ss_pred             CCCCCCCCeeccCCCCceeeeCCCCCccC-CC
Q psy4097          90 GVTCEHGGICAPIGDRGYRCQCEPGFTGE-SE  120 (129)
Q Consensus        90 ~~~C~~~~~C~~~~~~~~~C~C~~g~~g~-~c  120 (129)
                      ..+|.+++.| ++..+.|.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTC-VNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEE-ecCCCCeEeECCCCCcccCCc
Confidence            4567777799 888899999999999998 55


No 21 
>KOG4260|consensus
Probab=97.86  E-value=1.2e-05  Score=53.83  Aligned_cols=92  Identities=26%  Similarity=0.674  Sum_probs=60.0

Q ss_pred             CCceecccCCCCcccCCCCcccCC--------CCccCCCCCCCC--CCeeeeCCCCCcceeee-cCCCCCcCCCCCcCCc
Q psy4097          11 QGYRKCGQCPHGWVGDGTTCRQGT--------TGCSVQNGGCHP--LATCRETSDTVRSVISC-TCPPGMGGSGVGLMGC   79 (129)
Q Consensus        11 ~g~~~C~~C~~g~~~~~~~C~~~~--------~~C~~~~~~C~~--~~~C~~~~~g~~~~~~c-~c~~g~~g~~~~~~~c   79 (129)
                      .|+-+| .|.+||.  +..|..-.        ++-...-..|+.  .++|....  +   ..| .|..||.....   .|
T Consensus       165 ~GsGkC-kC~~GY~--Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~--~---k~C~kCkkGW~lde~---gC  233 (350)
T KOG4260|consen  165 EGSGKC-KCETGYT--GPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES--S---KGCSKCKKGWKLDEE---GC  233 (350)
T ss_pred             CCCCcc-cccCCCC--CccccccchHHHHhhcccccchhhhhhhhhhcccCCCC--C---CChhhhcccceeccc---cc
Confidence            367789 9999999  77764310        110000001221  12343222  1   345 68889875533   68


Q ss_pred             ccCCCCCCCC--CCCCCCCCeeccCCCCceeeeCCCCCcc
Q psy4097          80 AYGMSGNPCG--GVTCEHGGICAPIGDRGYRCQCEPGFTG  117 (129)
Q Consensus        80 ~~~~~~~~C~--~~~C~~~~~C~~~~~~~~~C~C~~g~~g  117 (129)
                      . ++  |+|.  +.+|.....| +|+.++|.|.+.+||.+
T Consensus       234 v-Dv--nEC~~ep~~c~~~qfC-vNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  234 V-DV--NECQNEPAPCKAHQFC-VNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             c-cH--HHHhcCCCCCChhhee-ecCCCceEecccccccC
Confidence            8 88  9997  5789888899 99999999999999975


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.49  E-value=0.00023  Score=33.46  Aligned_cols=33  Identities=33%  Similarity=0.785  Sum_probs=26.9

Q ss_pred             CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCc-CC
Q psy4097          35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMG-GS   72 (129)
Q Consensus        35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~-g~   72 (129)
                      ++|... .+|.+++.|.++. ++   |.|.|+.+|. +.
T Consensus         3 ~~C~~~-~~C~~~~~C~~~~-g~---~~C~C~~g~~~g~   36 (39)
T smart00179        3 DECASG-NPCQNGGTCVNTV-GS---YRCECPPGYTDGR   36 (39)
T ss_pred             ccCcCC-CCcCCCCEeECCC-CC---eEeECCCCCccCC
Confidence            677762 4898888999988 77   9999999998 55


No 23 
>KOG1225|consensus
Probab=97.48  E-value=0.00065  Score=49.98  Aligned_cols=79  Identities=37%  Similarity=0.988  Sum_probs=55.0

Q ss_pred             ecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCcccCCCCCCCCCCCCC
Q psy4097          15 KCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE   94 (129)
Q Consensus        15 ~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~   94 (129)
                      .| .|..+|.  +..|..  -.|..   .|...+.|..   |     .|.|++||.|.     .|. .   -.|... |.
T Consensus       235 ic-~c~~~~~--g~~c~~--~~C~~---~c~~~g~c~~---G-----~CIC~~Gf~G~-----dC~-e---~~Cp~~-cs  288 (525)
T KOG1225|consen  235 IC-ECPEGYF--GPLCST--IYCPG---GCTGRGQCVE---G-----RCICPPGFTGD-----DCD-E---LVCPVD-CS  288 (525)
T ss_pred             ee-ecCCcee--CCcccc--ccCCC---CCcccceEeC---C-----eEeCCCCCcCC-----CCC-c---ccCCcc-cC
Confidence            57 7888887  666763  34543   4555556653   3     59999999999     886 4   356443 77


Q ss_pred             CCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097          95 HGGICAPIGDRGYRCQCEPGFTGESELLVP  124 (129)
Q Consensus        95 ~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~  124 (129)
                      .++.+   ..+  .|.|++||.|+.|+...
T Consensus       289 ~~g~~---~~g--~CiC~~g~~G~dCs~~~  313 (525)
T KOG1225|consen  289 GGGVC---VDG--ECICNPGYSGKDCSIRR  313 (525)
T ss_pred             CCcee---cCC--EeecCCCcccccccccc
Confidence            76566   333  89999999999997544


No 24 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.35  E-value=0.00016  Score=33.78  Aligned_cols=25  Identities=48%  Similarity=1.127  Sum_probs=19.3

Q ss_pred             ceeeeCCCceecccCCCCcc--cCCCCc
Q psy4097           5 VECINTQGYRKCGQCPHGWV--GDGTTC   30 (129)
Q Consensus         5 ~~C~~~~g~~~C~~C~~g~~--~~~~~C   30 (129)
                      ..|++++++|.| .|++||.  .|+.+|
T Consensus        10 h~C~~~~g~~~C-~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen   10 HICVNTPGSYRC-SCPPGYKLAEDGRTC   36 (36)
T ss_dssp             SEEEEETTSEEE-E-STTEEE-TTSSSE
T ss_pred             CCCccCCCceEe-ECCCCCEECcCCCCC
Confidence            468999999999 9999998  444444


No 25 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.15  E-value=0.00023  Score=32.32  Aligned_cols=27  Identities=33%  Similarity=0.776  Sum_probs=22.3

Q ss_pred             CCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097          43 GCHPLATCRETSDTVRSVISCTCPPGMGGS   72 (129)
Q Consensus        43 ~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~   72 (129)
                      +|.++++|+... ..  .|.|.|++||.|.
T Consensus         5 ~C~n~g~C~~~~-~~--~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLP-GG--GYTCECPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEES-TS--EEEEEEBTTEEST
T ss_pred             cCCCCeEEEeCC-CC--CEEeECCCCCccC
Confidence            899999999765 22  3999999999875


No 26 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.14  E-value=0.00019  Score=25.86  Aligned_cols=13  Identities=54%  Similarity=1.262  Sum_probs=10.4

Q ss_pred             eeeCCCCCccCCC
Q psy4097         108 RCQCEPGFTGESE  120 (129)
Q Consensus       108 ~C~C~~g~~g~~c  120 (129)
                      .|+|++||.|..|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4899999999876


No 27 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.92  E-value=0.002  Score=29.77  Aligned_cols=33  Identities=33%  Similarity=0.801  Sum_probs=26.2

Q ss_pred             CCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097          35 TGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGS   72 (129)
Q Consensus        35 ~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~   72 (129)
                      ++|... .+|.+++.|.+.. +.   |.|.|+.+|.+.
T Consensus         3 ~~C~~~-~~C~~~~~C~~~~-~~---~~C~C~~g~~g~   35 (38)
T cd00054           3 DECASG-NPCQNGGTCVNTV-GS---YRCSCPPGYTGR   35 (38)
T ss_pred             ccCCCC-CCcCCCCEeECCC-CC---eEeECCCCCcCC
Confidence            567652 3788888999888 76   999999999876


No 28 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=96.82  E-value=0.0013  Score=39.15  Aligned_cols=36  Identities=33%  Similarity=0.597  Sum_probs=28.4

Q ss_pred             CCCCCCCeec-cCCCCceeeeCCCCCccCCCccccccc
Q psy4097          91 VTCEHGGICA-PIGDRGYRCQCEPGFTGESELLVPLLS  127 (129)
Q Consensus        91 ~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~~~~~  127 (129)
                      +.|.++ .|+ ......+.|.|..||.|++||...++.
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~dLl~   87 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVVLVD   87 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccccccceeeee
Confidence            568776 785 455678899999999999999876543


No 29 
>PHA02887 EGF-like protein; Provisional
Probab=96.49  E-value=0.0036  Score=36.77  Aligned_cols=38  Identities=29%  Similarity=0.484  Sum_probs=28.3

Q ss_pred             CCCC---CCCCCCCCeec-cCCCCceeeeCCCCCccCCCcccc
Q psy4097          86 NPCG---GVTCEHGGICA-PIGDRGYRCQCEPGFTGESELLVP  124 (129)
Q Consensus        86 ~~C~---~~~C~~~~~C~-~~~~~~~~C~C~~g~~g~~c~~~~  124 (129)
                      .+|.   .+.|.++ .|. ........|.|+.||.|.+|+.+.
T Consensus        84 ~pC~~eyk~YCiHG-~C~yI~dL~epsCrC~~GYtG~RCE~vs  125 (126)
T PHA02887         84 EKCKNDFNDFCING-ECMNIIDLDEKFCICNKGYTGIRCDEVS  125 (126)
T ss_pred             cccChHhhCEeeCC-EEEccccCCCceeECCCCcccCCCCccc
Confidence            5665   2567765 884 345677899999999999998653


No 30 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.48  E-value=0.00072  Score=43.04  Aligned_cols=100  Identities=24%  Similarity=0.528  Sum_probs=60.4

Q ss_pred             eeeCCCceecccCCCCcc-cCCCCcccCCCCccC---CCCCCCCCCeeeeCCC-CCcceeeecCCCCCcCCCCCcCCccc
Q psy4097           7 CINTQGYRKCGQCPHGWV-GDGTTCRQGTTGCSV---QNGGCHPLATCRETSD-TVRSVISCTCPPGMGGSGVGLMGCAY   81 (129)
Q Consensus         7 C~~~~g~~~C~~C~~g~~-~~~~~C~~~~~~C~~---~~~~C~~~~~C~~~~~-g~~~~~~c~c~~g~~g~~~~~~~c~~   81 (129)
                      .+.....|.| .|.+||. .+..+|+.. .+|..   ...+|...+.|.+... +....|.|.|..+|....-   .|. 
T Consensus        13 LiQMSNHfEC-~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~---vCv-   86 (197)
T PF06247_consen   13 LIQMSNHFEC-KCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG---VCV-   86 (197)
T ss_dssp             EEEESSEEEE-EESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---SEE-
T ss_pred             EEEccCceEE-EcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---eEc-
Confidence            3444567999 9999998 667899887 66654   2346888889987651 1112489999999975422   465 


Q ss_pred             CCCCCCCCCCCCCCCCeecc--CCCCceeeeCCCCCc
Q psy4097          82 GMSGNPCGGVTCEHGGICAP--IGDRGYRCQCEPGFT  116 (129)
Q Consensus        82 ~~~~~~C~~~~C~~~~~C~~--~~~~~~~C~C~~g~~  116 (129)
                      .   +.|....|..+ .|++  .......|.|.-|+.
T Consensus        87 p---~~C~~~~Cg~G-KCI~d~~~~~~~~CSC~IGkV  119 (197)
T PF06247_consen   87 P---NKCNNKDCGSG-KCILDPDNPNNPTCSCNIGKV  119 (197)
T ss_dssp             E---GGGSS---TTE-EEEEEEGGGSEEEEEE-TEEE
T ss_pred             h---hhcCceecCCC-eEEecCCCCCCceeEeeeceE
Confidence            3   46766678744 8842  223345899998887


No 31 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.41  E-value=0.0074  Score=27.52  Aligned_cols=25  Identities=44%  Similarity=1.022  Sum_probs=21.2

Q ss_pred             CCCCCCCeeeeCCCCCcceeeecCCCCCcC
Q psy4097          42 GGCHPLATCRETSDTVRSVISCTCPPGMGG   71 (129)
Q Consensus        42 ~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g   71 (129)
                      .+|.++ +|.+.. +.   +.|.|+.||.+
T Consensus         6 ~~C~~~-~C~~~~-~~---~~C~C~~g~~g   30 (35)
T smart00181        6 GPCSNG-TCINTP-GS---YTCSCPPGYTG   30 (35)
T ss_pred             CCCCCC-EEECCC-CC---eEeECCCCCcc
Confidence            378877 899887 76   99999999987


No 32 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.31  E-value=0.0059  Score=27.64  Aligned_cols=21  Identities=38%  Similarity=0.899  Sum_probs=18.9

Q ss_pred             CCceeeeCCCceecccCCCCcc
Q psy4097           3 PYVECINTQGYRKCGQCPHGWV   24 (129)
Q Consensus         3 ~~~~C~~~~g~~~C~~C~~g~~   24 (129)
                      +++.|+++.+.|.| .|+.||.
T Consensus        10 ~~~~C~~~~~~~~C-~C~~g~~   30 (36)
T cd00053          10 NGGTCVNTPGSYRC-VCPPGYT   30 (36)
T ss_pred             CCCEEecCCCCeEe-ECCCCCc
Confidence            45889999999999 9999998


No 33 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.09  E-value=0.0029  Score=29.49  Aligned_cols=25  Identities=40%  Similarity=1.006  Sum_probs=18.8

Q ss_pred             CCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097          92 TCEHGGICAPIGDRGYRCQCEPGFTGES  119 (129)
Q Consensus        92 ~C~~~~~C~~~~~~~~~C~C~~g~~g~~  119 (129)
                      .|.+  .| ++.+++|+|.|++||....
T Consensus         7 gC~h--~C-~~~~g~~~C~C~~Gy~L~~   31 (36)
T PF14670_consen    7 GCSH--IC-VNTPGSYRCSCPPGYKLAE   31 (36)
T ss_dssp             GSSS--EE-EEETTSEEEE-STTEEE-T
T ss_pred             CcCC--CC-ccCCCceEeECCCCCEECc
Confidence            4655  79 8889999999999997544


No 34 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.00  E-value=0.013  Score=26.55  Aligned_cols=26  Identities=35%  Similarity=0.791  Sum_probs=20.5

Q ss_pred             CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097          92 TCEHGGICAPIGDRGYRCQCEPGFTGESE  120 (129)
Q Consensus        92 ~C~~~~~C~~~~~~~~~C~C~~g~~g~~c  120 (129)
                      .|..+++| +..  ..+|.|.+||.|..|
T Consensus         7 ~C~~~G~C-~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTC-VSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEE-eCC--CCEEECCCCCcCCCC
Confidence            57777899 544  458999999999876


No 35 
>KOG1226|consensus
Probab=95.69  E-value=0.075  Score=40.88  Aligned_cols=90  Identities=30%  Similarity=0.706  Sum_probs=50.2

Q ss_pred             CCceecc--cCCCCcccCCCCcccC---------CCCccCC--CCCCCCCCeeeeCCCCCcceeeecCCCCCc----CCC
Q psy4097          11 QGYRKCG--QCPHGWVGDGTTCRQG---------TTGCSVQ--NGGCHPLATCRETSDTVRSVISCTCPPGMG----GSG   73 (129)
Q Consensus        11 ~g~~~C~--~C~~g~~~~~~~C~~~---------~~~C~~~--~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~----g~~   73 (129)
                      .|.+.|+  .|.+||.  |..|+=.         .+.|...  ..+|...+.|.-   |     +|.|.+...    |. 
T Consensus       472 ~G~~~CG~C~C~~G~~--G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C---G-----qC~C~~~~~~~i~G~-  540 (783)
T KOG1226|consen  472 NGTFVCGQCRCDEGWL--GKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC---G-----QCVCHKPDNGKIYGK-  540 (783)
T ss_pred             CCcEEecceecCCCCC--CCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC---C-----ceEecCCCCCceeee-
Confidence            4555443  8999999  7766522         0223221  115666666642   2     466666544    44 


Q ss_pred             CCcCCcccCCCCCCCCC---CCCCCCCeeccCCCCceeeeCCCCCccCCCcc
Q psy4097          74 VGLMGCAYGMSGNPCGG---VTCEHGGICAPIGDRGYRCQCEPGFTGESELL  122 (129)
Q Consensus        74 ~~~~~c~~~~~~~~C~~---~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~  122 (129)
                          .|+ -.- -.|..   ..|..++.|   .-|  +|.|.+||+|..|+-
T Consensus       541 ----fCE-CDn-fsC~r~~g~lC~g~G~C---~CG--~CvC~~GwtG~~C~C  581 (783)
T KOG1226|consen  541 ----FCE-CDN-FSCERHKGVLCGGHGRC---ECG--RCVCNPGWTGSACNC  581 (783)
T ss_pred             ----eee-ccC-cccccccCcccCCCCeE---eCC--cEEcCCCCccCCCCC
Confidence                565 320 23332   346655567   222  689999999999873


No 36 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.40  E-value=0.0063  Score=38.85  Aligned_cols=104  Identities=25%  Similarity=0.652  Sum_probs=58.7

Q ss_pred             CceeeeCC-----CceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCC
Q psy4097           4 YVECINTQ-----GYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMG   78 (129)
Q Consensus         4 ~~~C~~~~-----g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~   78 (129)
                      .+.|++..     ..|.| .|.+||......|..  +.|...  .|. .+.|+..+ .......|+|.-|+...  ....
T Consensus        55 ya~C~~~~~~~~~~~~~C-~C~~gY~~~~~vCvp--~~C~~~--~Cg-~GKCI~d~-~~~~~~~CSC~IGkV~~--dn~k  125 (197)
T PF06247_consen   55 YAKCINQANKGEERAYKC-DCINGYILKQGVCVP--NKCNNK--DCG-SGKCILDP-DNPNNPTCSCNIGKVPD--DNKK  125 (197)
T ss_dssp             TEEEEE-SSTTSSTSEEE-EE-TTEEESSSSEEE--GGGSS-----T-TEEEEEEE-GGGSEEEEEE-TEEETT--TTTE
T ss_pred             hhhhhcCCCcccceeEEE-ecccCceeeCCeEch--hhcCce--ecC-CCeEEecC-CCCCCceeEeeeceEec--cCCc
Confidence            46777665     36999 999999976677865  467655  777 46787544 22123689998887711  1113


Q ss_pred             cccCCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097          79 CAYGMSGNPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGES  119 (129)
Q Consensus        79 c~~~~~~~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~  119 (129)
                      |. .....+|. --|.....| -...+-|.|.|..++.+..
T Consensus       126 Ct-k~G~T~C~-LKCk~nE~C-K~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  126 CT-KTGETKCS-LKCKENEEC-KLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             SE-EEE---------TTTEEE-EEETTEEEEEE-TT-EEET
T ss_pred             cc-CCCcccee-eecCCCcce-eeeCcEEEeecCCCCCCCC
Confidence            44 22113453 246677789 8888999999999986444


No 37 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.27  E-value=0.0074  Score=28.15  Aligned_cols=28  Identities=25%  Similarity=0.700  Sum_probs=21.5

Q ss_pred             CCCceeeeCC-CceecccCCCCcccCCCCc
Q psy4097           2 TPYVECINTQ-GYRKCGQCPHGWVGDGTTC   30 (129)
Q Consensus         2 ~~~~~C~~~~-g~~~C~~C~~g~~~~~~~C   30 (129)
                      ..+|.|++.. |...| .|..||..++..|
T Consensus         8 P~NA~C~~~~dG~eec-rCllgyk~~~~~C   36 (37)
T PF12946_consen    8 PANAGCFRYDDGSEEC-RCLLGYKKVGGKC   36 (37)
T ss_dssp             -TTEEEEEETTSEEEE-EE-TTEEEETTEE
T ss_pred             CCCcccEEcCCCCEEE-EeeCCccccCCCc
Confidence            3578898887 89999 9999999666555


No 38 
>KOG0994|consensus
Probab=94.50  E-value=0.15  Score=41.30  Aligned_cols=55  Identities=25%  Similarity=0.573  Sum_probs=33.3

Q ss_pred             eee-cCCCCCcCCCCCcCCcccCCCCCCCCCCCCC--------CCCeec-cCCCCceeeeCCCCCccCCCcc
Q psy4097          61 ISC-TCPPGMGGSGVGLMGCAYGMSGNPCGGVTCE--------HGGICA-PIGDRGYRCQCEPGFTGESELL  122 (129)
Q Consensus        61 ~~c-~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~--------~~~~C~-~~~~~~~~C~C~~g~~g~~c~~  122 (129)
                      +.| +|..||.|++..   -. .   ..|.+-||.        +...|. -+......|.|.+||.|.+|+.
T Consensus       885 ~~CdrCl~GyyGdP~l---g~-g---~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~  949 (1758)
T KOG0994|consen  885 HSCDRCLDGYYGDPRL---GS-G---IGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI  949 (1758)
T ss_pred             cchhhhhccccCCccc---CC-C---CCCCCCCCCCCCccchhccccccccccccceeeecccCccccchhh
Confidence            778 899999988652   11 1   233322332        222341 1234556799999999999874


No 39 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=94.18  E-value=0.077  Score=35.05  Aligned_cols=36  Identities=22%  Similarity=0.621  Sum_probs=27.9

Q ss_pred             CcccCCCCCCCC--CCCCCCCCeeccCCCCceeeeCCCCCccCC
Q psy4097          78 GCAYGMSGNPCG--GVTCEHGGICAPIGDRGYRCQCEPGFTGES  119 (129)
Q Consensus        78 ~c~~~~~~~~C~--~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~  119 (129)
                      .|. +.  ++|.  .++|.+  .| .+..++|.|.|+.||....
T Consensus       183 ~C~-~~--~~C~~~~~~c~~--~C-~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         183 ICV-VP--DLCATLSHVCQQ--VC-ISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             cCc-Cc--hhhcCCCCCccc--eE-EcCCCCEEeECCCCccCCC
Confidence            566 55  7786  355765  69 9999999999999997643


No 40 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.77  E-value=0.079  Score=34.98  Aligned_cols=40  Identities=23%  Similarity=0.497  Sum_probs=32.0

Q ss_pred             CCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeecCCCCCcCC
Q psy4097          26 DGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCTCPPGMGGS   72 (129)
Q Consensus        26 ~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~   72 (129)
                      .+..|.+. ++|...++.|.  ..|.++. |+   |.|.|..||...
T Consensus       180 ~~~~C~~~-~~C~~~~~~c~--~~C~~~~-g~---~~c~c~~g~~~~  219 (224)
T cd01475         180 QGKICVVP-DLCATLSHVCQ--QVCISTP-GS---YLCACTEGYALL  219 (224)
T ss_pred             ccccCcCc-hhhcCCCCCcc--ceEEcCC-CC---EEeECCCCccCC
Confidence            35678877 89987766776  4699998 88   999999999754


No 41 
>smart00051 DSL delta serrate ligand.
Probab=93.44  E-value=0.21  Score=26.29  Aligned_cols=46  Identities=24%  Similarity=0.386  Sum_probs=30.2

Q ss_pred             eeecCCCCCcCCCCCcCCcccCCCCCCCCC-CCCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097          61 ISCTCPPGMGGSGVGLMGCAYGMSGNPCGG-VTCEHGGICAPIGDRGYRCQCEPGFTGESE  120 (129)
Q Consensus        61 ~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~-~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c  120 (129)
                      +.-.|..+|.|.     .|.     ..|.. +.+..+..| .. .|  .+.|.+||.|..|
T Consensus        17 ~rv~C~~~~yG~-----~C~-----~~C~~~~d~~~~~~C-d~-~G--~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGE-----GCN-----KFCRPRDDFFGHYTC-DE-NG--NKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCC-----ccC-----CEeCcCccccCCccC-Cc-CC--CEecCCCCcCCCC
Confidence            445788888888     776     44542 234445567 33 33  4689999999876


No 42 
>KOG1836|consensus
Probab=92.41  E-value=0.41  Score=40.61  Aligned_cols=56  Identities=30%  Similarity=0.607  Sum_probs=35.0

Q ss_pred             ee-cCCCCCcCCCCCcCCcccCCCCCCCCCCCCCCCCeec-cCCCCceeee-CCCCCccCCCcccc
Q psy4097          62 SC-TCPPGMGGSGVGLMGCAYGMSGNPCGGVTCEHGGICA-PIGDRGYRCQ-CEPGFTGESELLVP  124 (129)
Q Consensus        62 ~c-~c~~g~~g~~~~~~~c~~~~~~~~C~~~~C~~~~~C~-~~~~~~~~C~-C~~g~~g~~c~~~~  124 (129)
                      .| +|..||.+....      ... +.|..=+|..+..|. +.....+.|. |++||+|.+|+...
T Consensus       757 ~C~~C~~GfYg~~~~------~~~-~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~  815 (1705)
T KOG1836|consen  757 QCAQCVDGFYGLPDL------GTS-GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECA  815 (1705)
T ss_pred             chhhhcCCCCCcccc------CCC-CCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCC
Confidence            45 688888776331      110 226555666665662 2225667898 99999999998543


No 43 
>KOG3516|consensus
Probab=91.05  E-value=0.22  Score=40.28  Aligned_cols=36  Identities=33%  Similarity=0.819  Sum_probs=32.4

Q ss_pred             CCCCCCCCCCCCeeccCCCCceeeeCC-CCCccCCCcc
Q psy4097          86 NPCGGVTCEHGGICAPIGDRGYRCQCE-PGFTGESELL  122 (129)
Q Consensus        86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~  122 (129)
                      +.|.+++|++++.| ..+...|.|.|. .||.|..|..
T Consensus       546 drClPN~CehgG~C-~Qs~~~f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  546 DRCLPNPCEHGGKC-SQSWDDFECNCELTGYKGATCHT  582 (1306)
T ss_pred             cccCCccccCCCcc-cccccceeEeccccccccccccC
Confidence            78889999999999 778889999998 7999999874


No 44 
>KOG3514|consensus
Probab=86.23  E-value=0.61  Score=37.77  Aligned_cols=35  Identities=31%  Similarity=0.771  Sum_probs=30.8

Q ss_pred             CCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCcc
Q psy4097          87 PCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELL  122 (129)
Q Consensus        87 ~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~  122 (129)
                      .|.++||.+++.| ....+.|.|.|.. +|.|..|+.
T Consensus       625 ~C~~nPC~N~g~C-~egwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKC-SEGWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             ccCCCcccCCCCc-cccccccccccccCcccCccccc
Confidence            6888999999999 8888999999964 899999974


No 45 
>KOG1226|consensus
Probab=85.76  E-value=4.1  Score=31.98  Aligned_cols=63  Identities=30%  Similarity=0.720  Sum_probs=36.3

Q ss_pred             CCCCCCeeeeCCCCCcceeeecCCCCCcCCCCCcCCccc-CCCCCCCCC---CCCCCCCeeccCCCCceeeeCCCC-Ccc
Q psy4097          43 GCHPLATCRETSDTVRSVISCTCPPGMGGSGVGLMGCAY-GMSGNPCGG---VTCEHGGICAPIGDRGYRCQCEPG-FTG  117 (129)
Q Consensus        43 ~C~~~~~C~~~~~g~~~~~~c~c~~g~~g~~~~~~~c~~-~~~~~~C~~---~~C~~~~~C~~~~~~~~~C~C~~g-~~g  117 (129)
                      .|..+++|.-   |     .|+|.+||.|.     .|.- ... +.|.+   .-|...+.| .=  +  +|+|... |.|
T Consensus       556 lC~g~G~C~C---G-----~CvC~~GwtG~-----~C~C~~st-d~C~~~~G~iCSGrG~C-~C--g--~C~C~~~~~sG  616 (783)
T KOG1226|consen  556 LCGGHGRCEC---G-----RCVCNPGWTGS-----ACNCPLST-DTCESSDGQICSGRGTC-EC--G--RCKCTDPPYSG  616 (783)
T ss_pred             ccCCCCeEeC---C-----cEEcCCCCccC-----CCCCCCCC-ccccCCCCceeCCCcee-eC--C--ceEcCCCCcCc
Confidence            4666666653   2     58999999998     5531 111 45542   224444445 11  1  4666655 888


Q ss_pred             CCCcccc
Q psy4097         118 ESELLVP  124 (129)
Q Consensus       118 ~~c~~~~  124 (129)
                      ..|+..+
T Consensus       617 ~~CE~cp  623 (783)
T KOG1226|consen  617 EFCEKCP  623 (783)
T ss_pred             chhhcCC
Confidence            8887543


No 46 
>KOG0994|consensus
Probab=83.65  E-value=6.3  Score=32.79  Aligned_cols=17  Identities=24%  Similarity=0.468  Sum_probs=14.1

Q ss_pred             eeeeCCCCCccCCCccc
Q psy4097         107 YRCQCEPGFTGESELLV  123 (129)
Q Consensus       107 ~~C~C~~g~~g~~c~~~  123 (129)
                      -.|.|++++.|.+|+.+
T Consensus       530 GqC~CRe~~~GR~c~~~  546 (1758)
T KOG0994|consen  530 GQCECREHMLGRRCEQV  546 (1758)
T ss_pred             Ccccccccccccccccc
Confidence            37999999999998743


No 47 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=82.90  E-value=2.2  Score=24.88  Aligned_cols=31  Identities=32%  Similarity=0.829  Sum_probs=22.9

Q ss_pred             CCCC-CCCCCCCCeeccCCCCceeeeCCCCCccC
Q psy4097          86 NPCG-GVTCEHGGICAPIGDRGYRCQCEPGFTGE  118 (129)
Q Consensus        86 ~~C~-~~~C~~~~~C~~~~~~~~~C~C~~g~~g~  118 (129)
                      +.|. -..|...+.| .. .....|.|.+||..+
T Consensus        78 d~Cd~y~~CG~~g~C-~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   78 DQCDVYGFCGPNGIC-NS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cCCCCccccCCccEe-CC-CCCCceECCCCcCCC
Confidence            6776 3679888899 43 345579999999754


No 48 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=76.72  E-value=1.8  Score=21.21  Aligned_cols=18  Identities=33%  Similarity=0.569  Sum_probs=13.8

Q ss_pred             CceeeeCCCCCccCCCcc
Q psy4097         105 RGYRCQCEPGFTGESELL  122 (129)
Q Consensus       105 ~~~~C~C~~g~~g~~c~~  122 (129)
                      ....|.|+++|.|..|+.
T Consensus        16 ~~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   16 STGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             TCEEESBSTTEESTTS-E
T ss_pred             CCCEEeccccccCCcCcC
Confidence            344899999999999874


No 49 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=76.40  E-value=3.5  Score=18.81  Aligned_cols=13  Identities=23%  Similarity=0.680  Sum_probs=10.4

Q ss_pred             eeeeCCCCCccCC
Q psy4097         107 YRCQCEPGFTGES  119 (129)
Q Consensus       107 ~~C~C~~g~~g~~  119 (129)
                      +.|.|+.||+.+.
T Consensus        18 ~~C~CPeGyIlde   30 (34)
T PF09064_consen   18 GQCFCPEGYILDE   30 (34)
T ss_pred             CceeCCCceEecC
Confidence            4899999997554


No 50 
>KOG3516|consensus
Probab=73.47  E-value=3.9  Score=33.69  Aligned_cols=37  Identities=27%  Similarity=0.645  Sum_probs=30.4

Q ss_pred             CCCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCccc
Q psy4097          86 NPCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELLV  123 (129)
Q Consensus        86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~~  123 (129)
                      -.|.+.+|.+++.| +..-.+|.|.|.. .|.|..|...
T Consensus       956 GhCss~~C~NGG~C-very~gytCDCs~Tay~Gp~Cs~e  993 (1306)
T KOG3516|consen  956 GHCSSYPCLNGGHC-VERYDGYTCDCSRTAYDGPFCSKE  993 (1306)
T ss_pred             cccccccccCCCEE-EEecCceeeccccCcCCCCccccc
Confidence            45777899999999 7777899999964 7899998643


No 51 
>KOG3512|consensus
Probab=71.94  E-value=16  Score=27.43  Aligned_cols=18  Identities=11%  Similarity=0.167  Sum_probs=12.4

Q ss_pred             CCceeeeCCCCCccCCCc
Q psy4097         104 DRGYRCQCEPGFTGESEL  121 (129)
Q Consensus       104 ~~~~~C~C~~g~~g~~c~  121 (129)
                      +..+.+.|+.++.+.+++
T Consensus       461 ~qd~~s~Ck~~~~~~r~n  478 (592)
T KOG3512|consen  461 PQDQCSKCKASPGGKRLN  478 (592)
T ss_pred             chhccccCCCCCcceecc
Confidence            455667788888776654


No 52 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=70.01  E-value=1.2  Score=23.42  Aligned_cols=46  Identities=24%  Similarity=0.473  Sum_probs=18.8

Q ss_pred             eeecCCCCCcCCCCCcCCcccCCCCCCCCCC-CCCCCCeeccCCCCceeeeCCCCCccCCC
Q psy4097          61 ISCTCPPGMGGSGVGLMGCAYGMSGNPCGGV-TCEHGGICAPIGDRGYRCQCEPGFTGESE  120 (129)
Q Consensus        61 ~~c~c~~g~~g~~~~~~~c~~~~~~~~C~~~-~C~~~~~C~~~~~~~~~C~C~~g~~g~~c  120 (129)
                      +...|...|.|.     .|.     ..|.+. .-..+-.| . ..|..  .|.+||.|..|
T Consensus        17 ~rv~C~~nyyG~-----~C~-----~~C~~~~d~~ghy~C-d-~~G~~--~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGP-----NCS-----KFCKPRDDSFGHYTC-D-SNGNK--VCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETT-----TT------EE---EEETTEEEEE---SS--E--EE-TTEESTTS
T ss_pred             EEEECCCCCCCc-----ccc-----CCcCCCcCCcCCccc-C-CCCCC--CCCCCCcCCCC
Confidence            556788888888     776     345421 01111245 3 33433  58899999876


No 53 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=66.99  E-value=4.7  Score=19.88  Aligned_cols=14  Identities=36%  Similarity=0.845  Sum_probs=10.8

Q ss_pred             eeeCCCCCccCCCc
Q psy4097         108 RCQCEPGFTGESEL  121 (129)
Q Consensus       108 ~C~C~~g~~g~~c~  121 (129)
                      +|.|++++.|..|+
T Consensus        20 ~C~C~~~~~G~~C~   33 (50)
T cd00055          20 QCECKPNTTGRRCD   33 (50)
T ss_pred             EEeCCCcCCCCCCC
Confidence            57788888888776


No 54 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=66.39  E-value=5.1  Score=19.42  Aligned_cols=14  Identities=36%  Similarity=0.899  Sum_probs=10.4

Q ss_pred             eeeCCCCCccCCCc
Q psy4097         108 RCQCEPGFTGESEL  121 (129)
Q Consensus       108 ~C~C~~g~~g~~c~  121 (129)
                      .|.|++++.|..|+
T Consensus        19 ~C~C~~~~~G~~C~   32 (46)
T smart00180       19 QCECKPNVTGRRCD   32 (46)
T ss_pred             EEECCCCCCCCCCC
Confidence            67777887777765


No 55 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=65.98  E-value=6.2  Score=23.00  Aligned_cols=33  Identities=21%  Similarity=0.539  Sum_probs=21.7

Q ss_pred             CCCCCCCeeccCC-----CCceeeeCCC-------------CCccCCCcccc
Q psy4097          91 VTCEHGGICAPIG-----DRGYRCQCEP-------------GFTGESELLVP  124 (129)
Q Consensus        91 ~~C~~~~~C~~~~-----~~~~~C~C~~-------------g~~g~~c~~~~  124 (129)
                      +.|..++.| ++.     ..=|.|.|.+             .|.|..|++.+
T Consensus        13 n~CsgHG~C-~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD   63 (103)
T PF12955_consen   13 NNCSGHGSC-VKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD   63 (103)
T ss_pred             cCCCCCceE-eeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence            557666678 443     2448899977             57777787543


No 56 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=60.19  E-value=8.7  Score=18.95  Aligned_cols=9  Identities=67%  Similarity=1.811  Sum_probs=6.0

Q ss_pred             ecccCCCCcc
Q psy4097          15 KCGQCPHGWV   24 (129)
Q Consensus        15 ~C~~C~~g~~   24 (129)
                      .| .|++||.
T Consensus        38 ~C-~C~~g~~   46 (52)
T PF01683_consen   38 RC-QCPPGYV   46 (52)
T ss_pred             Ee-ECCCCCE
Confidence            56 6777766


No 57 
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=51.67  E-value=25  Score=19.80  Aligned_cols=8  Identities=0%  Similarity=0.156  Sum_probs=3.2

Q ss_pred             cCCCCcee
Q psy4097         101 PIGDRGYR  108 (129)
Q Consensus       101 ~~~~~~~~  108 (129)
                      -+....|.
T Consensus        41 GsDg~TY~   48 (86)
T cd01328          41 TNDNETFD   48 (86)
T ss_pred             CCCCCCcc
Confidence            33334443


No 58 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=44.44  E-value=5.4  Score=19.95  Aligned_cols=17  Identities=35%  Similarity=0.986  Sum_probs=10.9

Q ss_pred             cccCCCCcccCCC-CcccC
Q psy4097          16 CGQCPHGWVGDGT-TCRQG   33 (129)
Q Consensus        16 C~~C~~g~~~~~~-~C~~~   33 (129)
                      | .|++||..+.. .|...
T Consensus        35 C-~C~~G~v~~~~~~CV~~   52 (55)
T PF01826_consen   35 C-FCPPGYVRNDNGRCVPP   52 (55)
T ss_dssp             E-EETTTEEEETTSEEEEG
T ss_pred             C-CCCCCeeEcCCCCEEcH
Confidence            7 88888884444 55543


No 59 
>KOG1836|consensus
Probab=42.18  E-value=43  Score=29.49  Aligned_cols=60  Identities=27%  Similarity=0.616  Sum_probs=31.1

Q ss_pred             eeeCCCceecccCCCCcccCCCCcccCCCCccCCCCCCCCCCeeeeCCCCCcceeeec-CCCCCcCC
Q psy4097           7 CINTQGYRKCGQCPHGWVGDGTTCRQGTTGCSVQNGGCHPLATCRETSDTVRSVISCT-CPPGMGGS   72 (129)
Q Consensus         7 C~~~~g~~~C~~C~~g~~~~~~~C~~~~~~C~~~~~~C~~~~~C~~~~~g~~~~~~c~-c~~g~~g~   72 (129)
                      |+....+-.|+.|..||.++...  +..+.|..=  +|.+.+.|..+. .. ....|. |+++|+|.
T Consensus       749 C~~~t~G~~C~~C~~GfYg~~~~--~~~~dC~~C--~Cp~~~~~~~~~-~~-~~~iCk~Cp~gytG~  809 (1705)
T KOG1836|consen  749 CKHNTFGGQCAQCVDGFYGLPDL--GTSGDCQPC--PCPNGGACGQTP-EI-LEVVCKNCPPGYTGL  809 (1705)
T ss_pred             cccCCCCCchhhhcCCCCCcccc--CCCCCCccC--CCCCChhhcCcC-cc-cceecCCCCCCCccc
Confidence            33333345676899999832211  100114432  555555555433 11 125687 89998877


No 60 
>KOG3509|consensus
Probab=38.04  E-value=34  Score=28.15  Aligned_cols=38  Identities=34%  Similarity=0.737  Sum_probs=29.4

Q ss_pred             CCCCCCCCCCCCeeccCCCCceeeeCCCCCccCCCcccc
Q psy4097          86 NPCGGVTCEHGGICAPIGDRGYRCQCEPGFTGESELLVP  124 (129)
Q Consensus        86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~~  124 (129)
                      +.|+..+|.+.+.| ........|.|++||.|..|+.-+
T Consensus       407 ~~c~~~p~~~~g~c-~p~~~~~~c~c~~g~~G~~c~d~~  444 (964)
T KOG3509|consen  407 DVCWRIPCQHDGPC-LQTLEGKQCLCPPGYTGDSCEDCM  444 (964)
T ss_pred             CccccccCCCCccc-cccccccceeccccccCchhhccC
Confidence            45666778777777 677777889999999999987543


No 61 
>PF02822 Antistasin:  Antistasin family;  InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=33.37  E-value=23  Score=14.98  Aligned_cols=14  Identities=36%  Similarity=0.869  Sum_probs=7.2

Q ss_pred             cCCCCcccCCCCcc
Q psy4097          18 QCPHGWVGDGTTCR   31 (129)
Q Consensus        18 ~C~~g~~~~~~~C~   31 (129)
                      .|+.||..|...|+
T Consensus         9 ~C~~Gf~~D~~GC~   22 (26)
T PF02822_consen    9 YCPYGFKTDENGCP   22 (26)
T ss_dssp             --TT-EEE-TTSSE
T ss_pred             cCCCcccCCCCCCC
Confidence            68899987666664


No 62 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=33.00  E-value=22  Score=18.15  Aligned_cols=21  Identities=19%  Similarity=0.469  Sum_probs=13.6

Q ss_pred             CCCceeeeCCCCCccCCCccc
Q psy4097         103 GDRGYRCQCEPGFTGESELLV  123 (129)
Q Consensus       103 ~~~~~~C~C~~g~~g~~c~~~  123 (129)
                      ..|...|.|..-|.|..|...
T Consensus        32 ~dG~p~CECn~Cy~GpdCS~~   52 (56)
T PF04863_consen   32 ADGSPVCECNSCYGGPDCSTL   52 (56)
T ss_dssp             ETTEE--EE-TTEESTTS-EE
T ss_pred             ccCCccccccCCcCCCCcccC
Confidence            456678999999999999754


No 63 
>PF09289 FOLN:  Follistatin/Osteonectin-like EGF domain;  InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=29.27  E-value=43  Score=13.66  Aligned_cols=8  Identities=38%  Similarity=1.377  Sum_probs=3.0

Q ss_pred             CCCCCCee
Q psy4097          92 TCEHGGIC   99 (129)
Q Consensus        92 ~C~~~~~C   99 (129)
                      .|..+..|
T Consensus         5 ~Ck~GKvC   12 (22)
T PF09289_consen    5 HCKRGKVC   12 (22)
T ss_dssp             --BTTEEE
T ss_pred             ccCCCCEe
Confidence            44444455


No 64 
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=28.73  E-value=59  Score=13.72  Aligned_cols=19  Identities=32%  Similarity=0.725  Sum_probs=9.0

Q ss_pred             CCCCCCCeeccCCCCceee
Q psy4097          91 VTCEHGGICAPIGDRGYRC  109 (129)
Q Consensus        91 ~~C~~~~~C~~~~~~~~~C  109 (129)
                      ..|..+..|++...+.-+|
T Consensus         5 v~C~~G~~C~~d~~g~p~C   23 (26)
T smart00274        5 VQCPFGKVCVVDKGGNARC   23 (26)
T ss_pred             EECCCCCEEEeCCCCCEEE
Confidence            3455566663333444444


No 65 
>KOG3514|consensus
Probab=25.98  E-value=49  Score=27.81  Aligned_cols=37  Identities=22%  Similarity=0.538  Sum_probs=28.2

Q ss_pred             CCCCCCCCCCCCeeccCCCCceeeeCCC-CCccCCCccc
Q psy4097          86 NPCGGVTCEHGGICAPIGDRGYRCQCEP-GFTGESELLV  123 (129)
Q Consensus        86 ~~C~~~~C~~~~~C~~~~~~~~~C~C~~-g~~g~~c~~~  123 (129)
                      ..|..+-|.+++.| +....+|.|.|.. .|.|..|...
T Consensus      1019 t~c~~~acanhG~c-~q~w~~~~c~csmtS~~Gp~C~d~ 1056 (1591)
T KOG3514|consen 1019 TTCSEDACANHGVC-IQQWNGIACDCSMTSYSGPRCNDP 1056 (1591)
T ss_pred             cccchhhhhcccee-eeeecceeeeccccccCCCccCCC
Confidence            34555668788889 8888899999975 6889888643


No 66 
>KOG0196|consensus
Probab=24.35  E-value=1.5e+02  Score=24.32  Aligned_cols=16  Identities=25%  Similarity=0.721  Sum_probs=12.5

Q ss_pred             cCCCCceeeeCCCCCc
Q psy4097         101 PIGDRGYRCQCEPGFT  116 (129)
Q Consensus       101 ~~~~~~~~C~C~~g~~  116 (129)
                      ....++..|.|..||-
T Consensus       302 s~~ega~~C~C~~gyy  317 (996)
T KOG0196|consen  302 SSSEGATSCTCENGYY  317 (996)
T ss_pred             CCCCCCCcccccCCcc
Confidence            3567888899999973


No 67 
>KOG1218|consensus
Probab=21.76  E-value=2.8e+02  Score=19.04  Aligned_cols=15  Identities=40%  Similarity=0.919  Sum_probs=7.7

Q ss_pred             eeeCCCCCccCCCcc
Q psy4097         108 RCQCEPGFTGESELL  122 (129)
Q Consensus       108 ~C~C~~g~~g~~c~~  122 (129)
                      .|.|.+||.+..++.
T Consensus       163 ~c~c~~g~~g~~~~~  177 (316)
T KOG1218|consen  163 ICTCQPGFVGVFCVE  177 (316)
T ss_pred             ceeccCCcccccccc
Confidence            344555555555543


Done!