Query         psy9688
Match_columns 181
No_of_seqs    112 out of 1221
Neff          10.3
Searched_HMMs 46136
Date          Fri Aug 16 17:03:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9688.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9688hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1219|consensus               99.6 6.6E-16 1.4E-20  133.7   9.1  106   43-181  3865-3971(4289)
  2 KOG1214|consensus               99.4 8.1E-12 1.8E-16  101.3  10.8  139   16-181   692-856 (1289)
  3 KOG4289|consensus               99.3 9.8E-12 2.1E-16  105.4   6.2   91   57-180  1216-1308(2531)
  4 KOG1219|consensus               99.1 4.9E-10 1.1E-14   98.7   7.8   98   17-145  3865-3977(4289)
  5 KOG4289|consensus               99.0 1.5E-09 3.2E-14   92.6   8.2   74    2-77   1225-1316(2531)
  6 KOG1214|consensus               98.9 1.3E-08 2.7E-13   83.3   9.8  105    9-145   728-862 (1289)
  7 KOG1217|consensus               98.6 1.3E-06 2.8E-11   69.6  13.0  175    2-181   155-386 (487)
  8 KOG1225|consensus               98.6 4.5E-07 9.7E-12   72.2  10.1   80   50-145   256-341 (525)
  9 KOG1217|consensus               98.5 4.1E-06 8.9E-11   66.7  12.4  122   43-181   170-301 (487)
 10 PF07645 EGF_CA:  Calcium-bindi  98.3 7.3E-07 1.6E-11   46.9   2.6   32  148-181     1-33  (42)
 11 smart00179 EGF_CA Calcium-bind  98.0 1.7E-05 3.6E-10   40.7   4.4   36   41-76      1-38  (39)
 12 PF00008 EGF:  EGF-like domain   98.0 4.2E-06 9.1E-11   41.2   1.4   30   45-74      1-31  (32)
 13 PF00008 EGF:  EGF-like domain   97.9   9E-06   2E-10   40.0   2.0   31  112-143     1-32  (32)
 14 KOG4260|consensus               97.9 1.4E-05   3E-10   58.0   3.5  124   47-181   149-303 (350)
 15 KOG1225|consensus               97.8 0.00015 3.3E-09   58.1   8.9   96   63-181   234-335 (525)
 16 PF07645 EGF_CA:  Calcium-bindi  97.8 1.9E-05 4.1E-10   41.4   2.0   32   41-72      1-34  (42)
 17 smart00179 EGF_CA Calcium-bind  97.7 7.2E-05 1.6E-09   38.3   4.1   35  109-144     2-38  (39)
 18 cd00054 EGF_CA Calcium-binding  97.7 0.00011 2.3E-09   37.2   4.3   35   42-76      2-37  (38)
 19 cd00054 EGF_CA Calcium-binding  97.4 0.00033 7.2E-09   35.3   4.0   34  110-144     3-37  (38)
 20 KOG4260|consensus               97.3 0.00014 3.1E-09   52.9   2.1   69    3-72    222-304 (350)
 21 cd00053 EGF Epidermal growth f  97.2 0.00097 2.1E-08   33.0   4.2   30   47-76      5-35  (36)
 22 smart00181 EGF Epidermal growt  97.2   0.001 2.2E-08   33.1   4.1   31   45-76      2-34  (35)
 23 KOG1226|consensus               97.1  0.0097 2.1E-07   49.5  10.9  107   49-165   468-603 (783)
 24 PF12947 EGF_3:  EGF domain;  I  96.9 0.00058 1.3E-08   34.4   1.6   24  156-181     6-29  (36)
 25 cd00053 EGF Epidermal growth f  96.9  0.0025 5.3E-08   31.5   3.9   30  114-144     5-35  (36)
 26 PF12662 cEGF:  Complement Clr-  96.8  0.0014 3.1E-08   29.7   2.3   20   62-82      1-24  (24)
 27 smart00181 EGF Epidermal growt  96.8   0.003 6.4E-08   31.3   3.8   28  115-144     6-34  (35)
 28 PF12947 EGF_3:  EGF domain;  I  96.8  0.0013 2.7E-08   33.1   2.0   26  116-142     7-32  (36)
 29 KOG1226|consensus               96.7   0.026 5.6E-07   47.1   9.9   89   49-146   515-621 (783)
 30 PF12662 cEGF:  Complement Clr-  96.5  0.0027 5.7E-08   28.8   2.1   20  130-151     1-24  (24)
 31 PF12661 hEGF:  Human growth fa  96.3  0.0022 4.7E-08   24.6   0.9   13  132-144     1-13  (13)
 32 PF07974 EGF_2:  EGF-like domai  95.7    0.02 4.4E-07   27.9   3.1   27   48-76      6-32  (32)
 33 PF07974 EGF_2:  EGF-like domai  95.2   0.031 6.7E-07   27.2   2.8   26  116-144     7-32  (32)
 34 PF14670 FXa_inhibition:  Coagu  95.2   0.016 3.6E-07   29.0   1.8   21  157-181     7-27  (36)
 35 PF14670 FXa_inhibition:  Coagu  93.1   0.071 1.5E-06   26.7   1.5   19   54-72     10-28  (36)
 36 cd01475 vWA_Matrilin VWA_Matri  92.5    0.16 3.6E-06   36.7   3.5   35  141-181   181-216 (224)
 37 PF12946 EGF_MSP1_1:  MSP1 EGF   92.3   0.099 2.2E-06   26.2   1.4   28  113-140     3-30  (37)
 38 KOG0994|consensus               88.9     1.7 3.7E-05   38.8   6.5   17   62-78    933-949 (1758)
 39 cd01475 vWA_Matrilin VWA_Matri  88.7    0.56 1.2E-05   33.9   3.3   35   39-73    184-218 (224)
 40 PF06247 Plasmod_Pvs28:  Plasmo  88.0    0.17 3.7E-06   35.2   0.3   48  130-181   109-159 (197)
 41 KOG3516|consensus               87.7    0.59 1.3E-05   41.4   3.2   44   39-82    542-586 (1306)
 42 PHA03099 epidermal growth fact  87.4    0.81 1.8E-05   29.8   3.0   28   49-77     52-81  (139)
 43 PHA02887 EGF-like protein; Pro  87.2    0.86 1.9E-05   29.2   3.0   27   50-77     94-122 (126)
 44 smart00051 DSL delta serrate l  85.5     1.7 3.7E-05   24.7   3.4   42  131-181    17-58  (63)
 45 PF06247 Plasmod_Pvs28:  Plasmo  83.0    0.61 1.3E-05   32.6   1.0   88   48-140    50-160 (197)
 46 PHA03099 epidermal growth fact  80.2     2.1 4.6E-05   27.9   2.7   33  148-181    41-75  (139)
 47 PHA02887 EGF-like protein; Pro  79.1     2.6 5.6E-05   27.1   2.7   31  150-181    84-116 (126)
 48 KOG3514|consensus               68.2     4.3 9.2E-05   36.2   2.3   37   44-80    625-662 (1591)
 49 KOG0994|consensus               64.2      27 0.00059   31.9   6.3   58  123-181   878-942 (1758)
 50 KOG3516|consensus               62.4       7 0.00015   35.2   2.6   38  108-146   544-582 (1306)
 51 PF01683 EB:  EB module;  Inter  61.3      21 0.00045   19.0   3.6   20   49-72     27-46  (52)
 52 PF12955 DUF3844:  Domain of un  60.8      12 0.00026   23.6   2.8   22  116-137    14-39  (103)
 53 PF00954 S_locus_glycop:  S-loc  57.7      15 0.00033   23.2   3.0   31  148-181    76-106 (110)
 54 KOG1836|consensus               55.7      31 0.00066   33.0   5.5   45  134-181   760-806 (1705)
 55 cd00055 EGF_Lam Laminin-type e  53.8      19 0.00042   19.1   2.6   14   64-77     20-33  (50)
 56 PF00053 Laminin_EGF:  Laminin   51.5     9.7 0.00021   20.1   1.2   16  130-145    17-32  (49)
 57 KOG3514|consensus               47.6      16 0.00034   32.9   2.3   36  110-146   624-660 (1591)
 58 PF09064 Tme5_EGF_like:  Thromb  47.2      12 0.00026   18.4   1.0    8  174-181    19-26  (34)
 59 smart00180 EGF_Lam Laminin-typ  36.2      31 0.00067   18.0   1.6   15  131-145    18-32  (46)
 60 KOG1218|consensus               32.7      83  0.0018   23.7   4.1   39  131-170   162-200 (316)
 61 KOG3509|consensus               22.3 2.4E+02  0.0053   25.6   5.4   36  110-146   407-442 (964)
 62 KOG3512|consensus               21.7 4.6E+02    0.01   21.8   6.3   25   54-78    285-310 (592)
 63 PF04863 EGF_alliinase:  Alliin  21.1      49  0.0011   18.3   0.7   18  129-146    34-51  (56)

No 1  
>KOG1219|consensus
Probab=99.64  E-value=6.6e-16  Score=133.73  Aligned_cols=106  Identities=38%  Similarity=0.940  Sum_probs=92.1

Q ss_pred             cCCCCCCCCCCCEEecCC-CCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCC
Q psy9688          43 DHCLTDPCLGHGTCINRS-DRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDC  121 (181)
Q Consensus        43 ~~c~~~~c~~~~~C~~~~-~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~  121 (181)
                      ++|..+||+++|+|.... +.|.|.|++.|+|+.|+.++.+|                             ..+||..++
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC-----------------------------~snPC~~Gg 3915 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPC-----------------------------ASNPCLTGG 3915 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccc-----------------------------cCCCCCCCC
Confidence            789999999999998754 57999999999999998665444                             455667777


Q ss_pred             EEeeCCCCceEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         122 VCVDDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       122 ~C~~~~~~~~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      +|.. ..+.|.|.|+.||+|+.|+...+++|.. ++|.++|.|+|++|+  |+|.|-+||
T Consensus      3916 tCip-~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~-n~C~~gg~C~n~~gs--f~CncT~g~ 3971 (4289)
T KOG1219|consen 3916 TCIP-FYNGFLCNCPNGYTGKRCEARGISECSK-NVCGTGGQCINIPGS--FHCNCTPGI 3971 (4289)
T ss_pred             EEEe-cCCCeeEeCCCCccCceeeccccccccc-ccccCCceeeccCCc--eEeccChhH
Confidence            8877 7888999999999999999855899988 999999999999999  999998764


No 2  
>KOG1214|consensus
Probab=99.36  E-value=8.1e-12  Score=101.27  Aligned_cols=139  Identities=27%  Similarity=0.693  Sum_probs=86.6

Q ss_pred             cccccCCCCCC-CCceeecCCC-Cc--------------cccCcCCCC--CCCCCCCEEecCCCCeeEeCCCCCc----C
Q psy9688          16 INECETVKDVC-NYGICVNTNG-NG--------------NSKVDHCLT--DPCLGHGTCINRSDRYQCLCTPRYS----G   73 (181)
Q Consensus        16 ~~~c~~~~~~C-~~~~C~~~~~-~~--------------~~~~~~c~~--~~c~~~~~C~~~~~~~~C~c~~g~~----g   73 (181)
                      +++|..+.+.| ....|....+ .|              +.|.++|..  ..|..+..|++..++|.|.|..||.    +
T Consensus       692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~  771 (1289)
T KOG1214|consen  692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDR  771 (1289)
T ss_pred             cccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCC
Confidence            45555555556 5556654432 11              455555543  3477777777777777777777764    2


Q ss_pred             CCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCC--CCEEeeCCCCceEeecCCCCcc--CCccccCC
Q psy9688          74 NNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLN--DCVCVDDGNLSFYCNCTEDFTG--EYCQFENS  149 (181)
Q Consensus        74 ~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~--~~~C~~~~~~~~~C~C~~g~~g--~~c~~~~~  149 (181)
                      ..|..-.+.-..++|..+                      ...|.-  .+.|+....+.|+|+|.+||+|  ..|.  ++
T Consensus       772 ~tCV~i~~pap~n~Ce~g----------------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~--dv  827 (1289)
T KOG1214|consen  772 HTCVLITPPAPANPCEDG----------------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT--DV  827 (1289)
T ss_pred             cceEEecCCCCCCccccC----------------------ccccCcCCceEEEecCCceEEEeecCCccCCccccc--cc
Confidence            333311111112233221                      112332  3455553456799999999985  5565  78


Q ss_pred             CCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         150 AACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       150 ~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      |+|.. +-|+..++|++++++  |.|.|++||
T Consensus       828 DeC~p-srChp~A~Cyntpgs--fsC~C~pGy  856 (1289)
T KOG1214|consen  828 DECSP-SRCHPAATCYNTPGS--FSCRCQPGY  856 (1289)
T ss_pred             cccCc-cccCCCceEecCCCc--ceeecccCc
Confidence            99987 889999999999998  889999997


No 3  
>KOG4289|consensus
Probab=99.25  E-value=9.8e-12  Score=105.36  Aligned_cols=91  Identities=32%  Similarity=0.775  Sum_probs=72.8

Q ss_pred             ecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecC
Q psy9688          57 INRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCT  136 (181)
Q Consensus        57 ~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~  136 (181)
                      ++..+...|.|++||+|..|+.+|+.|                             -+.||.+++.|.. ..+.|+|.|.
T Consensus      1216 i~pvnglrCrCPpGFTgd~CeTeiDlC-----------------------------Ys~pC~nng~C~s-rEggYtCeCr 1265 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCETEIDLC-----------------------------YSGPCGNNGRCRS-REGGYTCECR 1265 (2531)
T ss_pred             ccccCceeEeCCCCCCcccccchhHhh-----------------------------hcCCCCCCCceEE-ecCceeEEec
Confidence            345677899999999999999776555                             3456677778887 8999999999


Q ss_pred             CCCccCCccccCC--CCCCCCCCCCCCCEEeeCCCCcceEEecCCC
Q psy9688         137 EDFTGEYCQFENS--AACVTLNPCQNNATCVASPGEKQITCLCLKE  180 (181)
Q Consensus       137 ~g~~g~~c~~~~~--~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g  180 (181)
                      +||+|.+|+. +.  -.|.. ..|.++++|.+..-+ -|.|+|+.|
T Consensus      1266 pg~tGehCEv-s~~agrCvp-GvC~nggtC~~~~ng-gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1266 PGFTGEHCEV-SARAGRCVP-GVCKNGGTCVNLLNG-GFCCHCPYG 1308 (2531)
T ss_pred             CCccccceee-ecccCcccc-ceecCCCEEeecCCC-ceeccCCCc
Confidence            9999999985 33  24766 889999999876543 388999876


No 4  
>KOG1219|consensus
Probab=99.06  E-value=4.9e-10  Score=98.75  Aligned_cols=98  Identities=32%  Similarity=0.789  Sum_probs=82.6

Q ss_pred             ccccCCCCCC-CCceeecCCC-Cc-------------cccCcCCCCCCCCCCCEEecCCCCeeEeCCCCCcCCCCCCCCC
Q psy9688          17 NECETVKDVC-NYGICVNTNG-NG-------------NSKVDHCLTDPCLGHGTCINRSDRYQCLCTPRYSGNNCEKDNG   81 (181)
Q Consensus        17 ~~c~~~~~~C-~~~~C~~~~~-~~-------------~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~   81 (181)
                      ++|.  .++| .++.|...++ +|             ..++.+|.++||..+++|+...+.|.|.|+.||+|..|+.+  
T Consensus      3865 d~C~--~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~-- 3940 (4289)
T KOG1219|consen 3865 DPCN--DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR-- 3940 (4289)
T ss_pred             cccc--cCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc--
Confidence            6676  7889 9999998874 44             45778899999999999999999999999999999999854  


Q ss_pred             CCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCCCccCCcc
Q psy9688          82 DCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTEDFTGEYCQ  145 (181)
Q Consensus        82 ~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~  145 (181)
                                                .+++|+.++|..++.|.+ ..|+|+|.|.+|+.|..|.
T Consensus      3941 --------------------------Gi~eCs~n~C~~gg~C~n-~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3941 --------------------------GISECSKNVCGTGGQCIN-IPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred             --------------------------cccccccccccCCceeec-cCCceEeccChhHhcccCc
Confidence                                      134556667788888988 8999999999999998875


No 5  
>KOG4289|consensus
Probab=99.00  E-value=1.5e-09  Score=92.64  Aligned_cols=74  Identities=27%  Similarity=0.645  Sum_probs=60.3

Q ss_pred             CCCCCcccCCCCCCcccccCCCCCC-CCceeecCCCCc---c------------ccCcCCCCCCCCCCCEEecCC-CCee
Q psy9688           2 YFKPISLSTLCESDINECETVKDVC-NYGICVNTNGNG---N------------SKVDHCLTDPCLGHGTCINRS-DRYQ   64 (181)
Q Consensus         2 ~C~~g~~g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~---~------------~~~~~c~~~~c~~~~~C~~~~-~~~~   64 (181)
                      .|++||+|..|++.++.|.  ..+| +++.|+...|+|   |            .....|.+.-|.++++|++.. +.+.
T Consensus      1225 rCPpGFTgd~CeTeiDlCY--s~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~ 1302 (2531)
T KOG4289|consen 1225 RCPPGFTGDYCETEIDLCY--SGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFC 1302 (2531)
T ss_pred             eCCCCCCcccccchhHhhh--cCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCcee
Confidence            5899999999999999999  8999 999999999998   1            123457777799999999854 5789


Q ss_pred             EeCCCC-CcCCCCC
Q psy9688          65 CLCTPR-YSGNNCE   77 (181)
Q Consensus        65 C~c~~g-~~g~~c~   77 (181)
                      |.|+.| |++..|+
T Consensus      1303 c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1303 CHCPYGEFEDPRCE 1316 (2531)
T ss_pred             ccCCCcccCCCceE
Confidence            999988 3355554


No 6  
>KOG1214|consensus
Probab=98.90  E-value=1.3e-08  Score=83.28  Aligned_cols=105  Identities=29%  Similarity=0.733  Sum_probs=75.7

Q ss_pred             cCCCCCCcccccCCCCCC-CCceeecCCCCc----------------c------ccCcCCCC--CCCCCCC--EEec-CC
Q psy9688           9 STLCESDINECETVKDVC-NYGICVNTNGNG----------------N------SKVDHCLT--DPCLGHG--TCIN-RS   60 (181)
Q Consensus         9 g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~----------------~------~~~~~c~~--~~c~~~~--~C~~-~~   60 (181)
                      |..|. |+++|+...+.| .++.|++.+++|                |      ...+.|..  +.|...+  .|+. +.
T Consensus       728 gr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGg  806 (1289)
T KOG1214|consen  728 GRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGG  806 (1289)
T ss_pred             CCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCC
Confidence            57895 999999888899 999999999987                1      11233322  2244433  4554 44


Q ss_pred             CCeeEeCCCCCcCC--CCCCCCCCCCCCCCCCCcEEeeCCeeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCC
Q psy9688          61 DRYQCLCTPRYSGN--NCEKDNGDCSSNPCLNDGVCVDEVYKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTED  138 (181)
Q Consensus        61 ~~~~C~c~~g~~g~--~c~~~~~~c~~~~c~~~~~C~~~~~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g  138 (181)
                      +.|+|.|.+||.|.  .|.                              ++++|.+.-|+..+.|.+ +.+++.|.|.+|
T Consensus       807 s~y~C~CLPGfsGDG~~c~------------------------------dvDeC~psrChp~A~Cyn-tpgsfsC~C~pG  855 (1289)
T KOG1214|consen  807 STYSCACLPGFSGDGHQCT------------------------------DVDECSPSRCHPAATCYN-TPGSFSCRCQPG  855 (1289)
T ss_pred             ceEEEeecCCccCCccccc------------------------------cccccCccccCCCceEec-CCCcceeecccC
Confidence            57999999999853  333                              446666666788889998 899999999999


Q ss_pred             CccCCcc
Q psy9688         139 FTGEYCQ  145 (181)
Q Consensus       139 ~~g~~c~  145 (181)
                      |.|+.-+
T Consensus       856 y~GDGf~  862 (1289)
T KOG1214|consen  856 YYGDGFQ  862 (1289)
T ss_pred             ccCCCce
Confidence            9865443


No 7  
>KOG1217|consensus
Probab=98.61  E-value=1.3e-06  Score=69.65  Aligned_cols=175  Identities=33%  Similarity=0.754  Sum_probs=117.3

Q ss_pred             CCCCCcccCCCCCCcccccCCCCCC-CCceeecCCCCc------------ccc--------------------CcCCCC-
Q psy9688           2 YFKPISLSTLCESDINECETVKDVC-NYGICVNTNGNG------------NSK--------------------VDHCLT-   47 (181)
Q Consensus         2 ~C~~g~~g~~C~~~~~~c~~~~~~C-~~~~C~~~~~~~------------~~~--------------------~~~c~~-   47 (181)
                      .|..||.+..+....++|......| ..+.|.+..++|            +..                    .+.|.. 
T Consensus       155 ~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~  234 (487)
T KOG1217|consen  155 SCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVS  234 (487)
T ss_pred             eeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccc
Confidence            4788888888875547887556678 888898887765            110                    011111 


Q ss_pred             -CCCCCC-CEEecCCCCeeEeCCCCCcCCCC--CCCCCCCCCCC-CCCCcEEeeCC----------eecccc-ccCCCCC
Q psy9688          48 -DPCLGH-GTCINRSDRYQCLCTPRYSGNNC--EKDNGDCSSNP-CLNDGVCVDEV----------YKGRYW-ELPEIRD  111 (181)
Q Consensus        48 -~~c~~~-~~C~~~~~~~~C~c~~g~~g~~c--~~~~~~c~~~~-c~~~~~C~~~~----------~~g~~c-~~~~~~~  111 (181)
                       ..+... +.|++..+.+.|.+.+||.+..+  ..+++.|.... |.+.+.|++..          |.+..+ ...+...
T Consensus       235 ~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~  314 (487)
T KOG1217|consen  235 IVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDE  314 (487)
T ss_pred             cccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCcccccccc
Confidence             112222 67888888899999999997762  23677787653 77788888764          555555 2223345


Q ss_pred             CC----CCCCCCCCEEee-CCCCceEeecCCCCccCCccccCC-CCCCCCCCCCCCCEEee-CCCCcceEEecCCCC
Q psy9688         112 CT----SNPCLNDCVCVD-DGNLSFYCNCTEDFTGEYCQFENS-AACVTLNPCQNNATCVA-SPGEKQITCLCLKEV  181 (181)
Q Consensus       112 c~----~~~c~~~~~C~~-~~~~~~~C~C~~g~~g~~c~~~~~-~~c~~~~~C~~~~~C~~-~~g~~~~~C~C~~g~  181 (181)
                      |.    ..+|..++.|.. .....+.|.|..+|.|..|+  .. ++|.. ..+..++.|++ ..+.  +.|.|..+|
T Consensus       315 C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~--~~~~~C~~-~~~~~~~~c~~~~~~~--~~c~~~~~~  386 (487)
T KOG1217|consen  315 CSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCE--DSNDECAS-SPCCPGGTCVNETPGS--YRCACPAGF  386 (487)
T ss_pred             ccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccc--cCCccccC-CccccCCEeccCCCCC--eEecCCCcc
Confidence            53    345777777721 13345779999999999998  44 47877 56888999998 5776  889988765


No 8  
>KOG1225|consensus
Probab=98.61  E-value=4.5e-07  Score=72.19  Aligned_cols=80  Identities=38%  Similarity=1.036  Sum_probs=42.0

Q ss_pred             CCCCCEEecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCC------eeccccccCCCCCCCCCCCCCCCEE
Q psy9688          50 CLGHGTCINRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEV------YKGRYWELPEIRDCTSNPCLNDCVC  123 (181)
Q Consensus        50 c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~------~~g~~c~~~~~~~c~~~~c~~~~~C  123 (181)
                      |..++.|++.    +|+|++||+|..|.+  ..|... |+.++.+++..      |.|..|..   ..| +.+|..++.|
T Consensus       256 c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g~CiC~~g~~G~dCs~---~~c-padC~g~G~C  324 (525)
T KOG1225|consen  256 CTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDGECICNPGYSGKDCSI---RRC-PADCSGHGKC  324 (525)
T ss_pred             CcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCCEeecCCCcccccccc---ccC-CccCCCCCcc
Confidence            4445566655    677777777776653  223222 33333333222      55555542   122 2456666666


Q ss_pred             eeCCCCceEeecCCCCccCCcc
Q psy9688         124 VDDGNLSFYCNCTEDFTGEYCQ  145 (181)
Q Consensus       124 ~~~~~~~~~C~C~~g~~g~~c~  145 (181)
                      +.     -+|.|.+||+|..|.
T Consensus       325 i~-----G~C~C~~Gy~G~~C~  341 (525)
T KOG1225|consen  325 ID-----GECLCDEGYTGELCI  341 (525)
T ss_pred             cC-----CceEeCCCCcCCccc
Confidence            53     247777777777665


No 9  
>KOG1217|consensus
Probab=98.47  E-value=4.1e-06  Score=66.75  Aligned_cols=122  Identities=37%  Similarity=0.895  Sum_probs=82.0

Q ss_pred             cCCC--CCCCCCCCEEecCCCCeeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeC------C-eeccccccCCCCCCC
Q psy9688          43 DHCL--TDPCLGHGTCINRSDRYQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDE------V-YKGRYWELPEIRDCT  113 (181)
Q Consensus        43 ~~c~--~~~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~------~-~~g~~c~~~~~~~c~  113 (181)
                      ++|.  ..+|.+.+.|.+..+.|.|.|+++|.+..++..         .....|...      . +.+..+.. .+..+.
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------~~~~~c~~~~~~~~~~g~~~~~c~~-~~~~~~  239 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------GNGGTCVDSVACSCPPGARGPECEV-SIVECA  239 (487)
T ss_pred             cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC---------CCCceEecceeccCCCCCCCCCccc-cccccc
Confidence            5675  345888889998888899999999998887633         111222211      0 11222221 122222


Q ss_pred             CCCCCCCCEEeeCCCCceEeecCCCCccCCc-cccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         114 SNPCLNDCVCVDDGNLSFYCNCTEDFTGEYC-QFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       114 ~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c-~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      ..   . ++|++ ..+.+.|.|++||.+..+ ...+++.|.....|.++++|++..+.  |.|.|++||
T Consensus       240 ~~---~-~~c~~-~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~--~~C~C~~g~  301 (487)
T KOG1217|consen  240 SG---D-GTCVN-TVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGS--YRCTCPPGF  301 (487)
T ss_pred             CC---C-Ccccc-cCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCc--ceeeCCCCC
Confidence            11   2 78888 788899999999998762 21267888772239999999999987  889999987


No 10 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.28  E-value=7.3e-07  Score=46.88  Aligned_cols=32  Identities=28%  Similarity=0.707  Sum_probs=28.1

Q ss_pred             CCCCCCC-CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         148 NSAACVT-LNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       148 ~~~~c~~-~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      ||++|.. .+.|..++.|+|+.|+  |.|.|++||
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gs--y~C~C~~Gy   33 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGS--YSCSCPPGY   33 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTE--EEEEESTTE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCC--EEeeCCCCc
Confidence            5788876 4679889999999999  899999997


No 11 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.00  E-value=1.7e-05  Score=40.70  Aligned_cols=36  Identities=44%  Similarity=1.231  Sum_probs=30.6

Q ss_pred             cCcCCCC-CCCCCCCEEecCCCCeeEeCCCCCc-CCCC
Q psy9688          41 KVDHCLT-DPCLGHGTCINRSDRYQCLCTPRYS-GNNC   76 (181)
Q Consensus        41 ~~~~c~~-~~c~~~~~C~~~~~~~~C~c~~g~~-g~~c   76 (181)
                      ++++|.. .+|.++++|++..+.|.|.|++||. |..|
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            3567866 7899888999999999999999998 7765


No 12 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.95  E-value=4.2e-06  Score=41.16  Aligned_cols=30  Identities=43%  Similarity=1.166  Sum_probs=26.5

Q ss_pred             CCCCCCCCCCEEecCC-CCeeEeCCCCCcCC
Q psy9688          45 CLTDPCLGHGTCINRS-DRYQCLCTPRYSGN   74 (181)
Q Consensus        45 c~~~~c~~~~~C~~~~-~~~~C~c~~g~~g~   74 (181)
                      |.+.+|.++++|+... +.|.|.|++||+|+
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            4567999999999887 88999999999885


No 13 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.90  E-value=9e-06  Score=39.97  Aligned_cols=31  Identities=39%  Similarity=0.954  Sum_probs=25.8

Q ss_pred             CCCCCCCCCCEEeeCCC-CceEeecCCCCccCC
Q psy9688         112 CTSNPCLNDCVCVDDGN-LSFYCNCTEDFTGEY  143 (181)
Q Consensus       112 c~~~~c~~~~~C~~~~~-~~~~C~C~~g~~g~~  143 (181)
                      |.+.+|.++++|++ .. +.|+|.|++||+|+.
T Consensus         1 C~~~~C~n~g~C~~-~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCID-LPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEE-ESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEe-CCCCCEEeECCCCCccCC
Confidence            34568899999998 66 899999999999863


No 14 
>KOG4260|consensus
Probab=97.88  E-value=1.4e-05  Score=57.95  Aligned_cols=124  Identities=27%  Similarity=0.696  Sum_probs=74.9

Q ss_pred             CCCCCCCCEEecC---CCCeeEeCCCCCcCCCCCCCC--------CCCCC--CCCCC--CcEEeeCC----------e--
Q psy9688          47 TDPCLGHGTCINR---SDRYQCLCTPRYSGNNCEKDN--------GDCSS--NPCLN--DGVCVDEV----------Y--   99 (181)
Q Consensus        47 ~~~c~~~~~C~~~---~~~~~C~c~~g~~g~~c~~~~--------~~c~~--~~c~~--~~~C~~~~----------~--   99 (181)
                      ..+|+.++.|...   .++-.|.|.+||.|..|..-.        ++-..  ..|..  .+.|....          |  
T Consensus       149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~l  228 (350)
T KOG4260|consen  149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKL  228 (350)
T ss_pred             cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhccccee
Confidence            3568888888632   245689999999998885210        00000  00110  01221111          2  


Q ss_pred             eccccccCCCCCCC--CCCCCCCCEEeeCCCCceEeecCCCCccC--CccccCCCCCCCCCCCCCCCEEeeCCCCcceEE
Q psy9688         100 KGRYWELPEIRDCT--SNPCLNDCVCVDDGNLSFYCNCTEDFTGE--YCQFENSAACVTLNPCQNNATCVASPGEKQITC  175 (181)
Q Consensus       100 ~g~~c~~~~~~~c~--~~~c~~~~~C~~~~~~~~~C~C~~g~~g~--~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C  175 (181)
                      +-..|.  ++++|.  +.||.....|+| +.|+|.|.+.+||.+.  .|+. -.+.|.     ..+..|++++++  |.|
T Consensus       229 de~gCv--DvnEC~~ep~~c~~~qfCvN-teGSf~C~dk~Gy~~g~d~C~~-~~d~~~-----~kn~~c~ni~~~--~r~  297 (350)
T KOG4260|consen  229 DEEGCV--DVNECQNEPAPCKAHQFCVN-TEGSFKCEDKEGYKKGVDECQF-CADVCA-----SKNRPCMNIDGQ--YRC  297 (350)
T ss_pred             cccccc--cHHHHhcCCCCCChhheeec-CCCceEecccccccCChHHhhh-hhhhcc-----cCCCCcccCCcc--EEE
Confidence            122233  788886  357999999999 9999999999998742  2221 112222     244578899998  779


Q ss_pred             ecCCCC
Q psy9688         176 LCLKEV  181 (181)
Q Consensus       176 ~C~~g~  181 (181)
                      +|..|+
T Consensus       298 v~f~~~  303 (350)
T KOG4260|consen  298 VCFSGL  303 (350)
T ss_pred             Eecccc
Confidence            987764


No 15 
>KOG1225|consensus
Probab=97.83  E-value=0.00015  Score=58.08  Aligned_cols=96  Identities=30%  Similarity=0.761  Sum_probs=65.7

Q ss_pred             eeEeCCCCCcCCCCCCCCCCCCCCCCCCCcEEeeCC------eeccccccCCCCCCCCCCCCCCCEEeeCCCCceEeecC
Q psy9688          63 YQCLCTPRYSGNNCEKDNGDCSSNPCLNDGVCVDEV------YKGRYWELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCT  136 (181)
Q Consensus        63 ~~C~c~~g~~g~~c~~~~~~c~~~~c~~~~~C~~~~------~~g~~c~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~  136 (181)
                      ..|.|..+|.|..|+. + .| +..|...+.|+.-.      |.|..|..   ..|... |..++.+++   +  +|.|+
T Consensus       234 ~ic~c~~~~~g~~c~~-~-~C-~~~c~~~g~c~~G~CIC~~Gf~G~dC~e---~~Cp~~-cs~~g~~~~---g--~CiC~  301 (525)
T KOG1225|consen  234 GICECPEGYFGPLCST-I-YC-PGGCTGRGQCVEGRCICPPGFTGDDCDE---LVCPVD-CSGGGVCVD---G--ECICN  301 (525)
T ss_pred             ceeecCCceeCCcccc-c-cC-CCCCcccceEeCCeEeCCCCCcCCCCCc---ccCCcc-cCCCceecC---C--EeecC
Confidence            4789999999988872 2 24 34444445554432      77777752   344333 767677754   2  79999


Q ss_pred             CCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         137 EDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       137 ~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      +||+|+.|+.   ..|.  .+|..++.|+  .|.    |.|.+||
T Consensus       302 ~g~~G~dCs~---~~cp--adC~g~G~Ci--~G~----C~C~~Gy  335 (525)
T KOG1225|consen  302 PGYSGKDCSI---RRCP--ADCSGHGKCI--DGE----CLCDEGY  335 (525)
T ss_pred             CCcccccccc---ccCC--ccCCCCCccc--CCc----eEeCCCC
Confidence            9999999963   4463  5899999998  544    9998887


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.75  E-value=1.9e-05  Score=41.41  Aligned_cols=32  Identities=38%  Similarity=1.030  Sum_probs=27.4

Q ss_pred             cCcCCCC--CCCCCCCEEecCCCCeeEeCCCCCc
Q psy9688          41 KVDHCLT--DPCLGHGTCINRSDRYQCLCTPRYS   72 (181)
Q Consensus        41 ~~~~c~~--~~c~~~~~C~~~~~~~~C~c~~g~~   72 (181)
                      |+++|..  ..|...+.|+|..++|.|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            5678855  3588889999999999999999997


No 17 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.74  E-value=7.2e-05  Score=38.26  Aligned_cols=35  Identities=43%  Similarity=1.019  Sum_probs=28.4

Q ss_pred             CCCCCC-CCCCCCCEEeeCCCCceEeecCCCCc-cCCc
Q psy9688         109 IRDCTS-NPCLNDCVCVDDGNLSFYCNCTEDFT-GEYC  144 (181)
Q Consensus       109 ~~~c~~-~~c~~~~~C~~~~~~~~~C~C~~g~~-g~~c  144 (181)
                      +++|.. .+|..+++|++ ..+.|.|.|++||. |..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~-~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVN-TVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeEC-CCCCeEeECCCCCccCCcC
Confidence            355655 57888889998 88899999999998 7766


No 18 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.70  E-value=0.00011  Score=37.22  Aligned_cols=35  Identities=46%  Similarity=1.268  Sum_probs=29.6

Q ss_pred             CcCCCC-CCCCCCCEEecCCCCeeEeCCCCCcCCCC
Q psy9688          42 VDHCLT-DPCLGHGTCINRSDRYQCLCTPRYSGNNC   76 (181)
Q Consensus        42 ~~~c~~-~~c~~~~~C~~~~~~~~C~c~~g~~g~~c   76 (181)
                      +++|.. .+|.+++.|++..+.|.|.|+.||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            466765 68888889999999999999999998766


No 19 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.44  E-value=0.00033  Score=35.32  Aligned_cols=34  Identities=41%  Similarity=1.010  Sum_probs=27.4

Q ss_pred             CCCCC-CCCCCCCEEeeCCCCceEeecCCCCccCCc
Q psy9688         110 RDCTS-NPCLNDCVCVDDGNLSFYCNCTEDFTGEYC  144 (181)
Q Consensus       110 ~~c~~-~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c  144 (181)
                      ++|.. .+|..++.|++ ..+.|.|.|+.||.|..|
T Consensus         3 ~~C~~~~~C~~~~~C~~-~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           3 DECASGNPCQNGGTCVN-TVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             ccCCCCCCcCCCCEeEC-CCCCeEeECCCCCcCCcC
Confidence            45555 56777789988 888899999999998766


No 20 
>KOG4260|consensus
Probab=97.30  E-value=0.00014  Score=52.88  Aligned_cols=69  Identities=32%  Similarity=0.611  Sum_probs=44.4

Q ss_pred             CCCCcccC--CCCCCcccccCCCCCC-CCceeecCCCCc-ccc-------CcCCCC--CCC-CCCCEEecCCCCeeEeCC
Q psy9688           3 FKPISLST--LCESDINECETVKDVC-NYGICVNTNGNG-NSK-------VDHCLT--DPC-LGHGTCINRSDRYQCLCT   68 (181)
Q Consensus         3 C~~g~~g~--~C~~~~~~c~~~~~~C-~~~~C~~~~~~~-~~~-------~~~c~~--~~c-~~~~~C~~~~~~~~C~c~   68 (181)
                      |..||...  .|. |||+|...+.+| .+.+|+|+.|+| +.+       +|+|+.  .-| ..+..|+++.+.|.|+|.
T Consensus       222 CkkGW~lde~gCv-DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f  300 (350)
T KOG4260|consen  222 CKKGWKLDEEGCV-DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCF  300 (350)
T ss_pred             hcccceecccccc-cHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEec
Confidence            55666542  463 788887667777 777888888887 333       444532  112 344567777777888877


Q ss_pred             CCCc
Q psy9688          69 PRYS   72 (181)
Q Consensus        69 ~g~~   72 (181)
                      .++.
T Consensus       301 ~~~~  304 (350)
T KOG4260|consen  301 SGLI  304 (350)
T ss_pred             ccce
Confidence            7764


No 21 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.22  E-value=0.00097  Score=32.99  Aligned_cols=30  Identities=43%  Similarity=1.316  Sum_probs=25.6

Q ss_pred             CCCCCCCCEEecCCCCeeEeCCCCCcCC-CC
Q psy9688          47 TDPCLGHGTCINRSDRYQCLCTPRYSGN-NC   76 (181)
Q Consensus        47 ~~~c~~~~~C~~~~~~~~C~c~~g~~g~-~c   76 (181)
                      ..+|.+++.|++..+.|.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            5678888999998889999999999887 44


No 22 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.20  E-value=0.001  Score=33.05  Aligned_cols=31  Identities=45%  Similarity=1.221  Sum_probs=25.4

Q ss_pred             CCC-CCCCCCCEEecCCCCeeEeCCCCCcC-CCC
Q psy9688          45 CLT-DPCLGHGTCINRSDRYQCLCTPRYSG-NNC   76 (181)
Q Consensus        45 c~~-~~c~~~~~C~~~~~~~~C~c~~g~~g-~~c   76 (181)
                      |.. .+|.++ .|++..+.|.|.|++||.| ..|
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            444 578877 9999888999999999988 554


No 23 
>KOG1226|consensus
Probab=97.12  E-value=0.0097  Score=49.52  Aligned_cols=107  Identities=28%  Similarity=0.628  Sum_probs=66.7

Q ss_pred             CCCCCCEEecCCCCeeEeCCCCCcCCCCCCCC---------CCCC----CCCCCCCcEEeeCC----------eeccccc
Q psy9688          49 PCLGHGTCINRSDRYQCLCTPRYSGNNCEKDN---------GDCS----SNPCLNDGVCVDEV----------YKGRYWE  105 (181)
Q Consensus        49 ~c~~~~~C~~~~~~~~C~c~~g~~g~~c~~~~---------~~c~----~~~c~~~~~C~~~~----------~~g~~c~  105 (181)
                      .|..+|+.+=+    .|.|.+||.|+.|+-..         +.|.    ..+|+..+.|.-..          +.|..|+
T Consensus       468 ~C~g~G~~~CG----~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCE  543 (783)
T KOG1226|consen  468 LCHGNGTFVCG----QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCE  543 (783)
T ss_pred             ccCCCCcEEec----ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeee
Confidence            35555555444    78999999999986322         1121    12577777665332          4577777


Q ss_pred             cCCCCCCCCC---CCCCCCEEeeCCCCceEeecCCCCccCCcccc-CCCCCCC--CCCCCCCCEEe
Q psy9688         106 LPEIRDCTSN---PCLNDCVCVDDGNLSFYCNCTEDFTGEYCQFE-NSAACVT--LNPCQNNATCV  165 (181)
Q Consensus       106 ~~~~~~c~~~---~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~~~-~~~~c~~--~~~C~~~~~C~  165 (181)
                      . +.-.|...   .|..++.|.=     -.|+|.+||+|..|+.. +.+.|..  ...|...|.|.
T Consensus       544 C-DnfsC~r~~g~lC~g~G~C~C-----G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~  603 (783)
T KOG1226|consen  544 C-DNFSCERHKGVLCGGHGRCEC-----GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCE  603 (783)
T ss_pred             c-cCcccccccCcccCCCCeEeC-----CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceee
Confidence            6 33344432   5888888843     26999999999998751 3455654  12366666664


No 24 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.93  E-value=0.00058  Score=34.38  Aligned_cols=24  Identities=38%  Similarity=0.779  Sum_probs=18.9

Q ss_pred             CCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         156 NPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       156 ~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      +.|+.+++|++++++  |.|.|++||
T Consensus         6 ~~C~~nA~C~~~~~~--~~C~C~~Gy   29 (36)
T PF12947_consen    6 GGCHPNATCTNTGGS--YTCTCKPGY   29 (36)
T ss_dssp             GGS-TTCEEEE-TTS--EEEEE-CEE
T ss_pred             CCCCCCcEeecCCCC--EEeECCCCC
Confidence            579999999999997  999999987


No 25 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.90  E-value=0.0025  Score=31.46  Aligned_cols=30  Identities=43%  Similarity=0.978  Sum_probs=24.2

Q ss_pred             CCCCCCCCEEeeCCCCceEeecCCCCccC-Cc
Q psy9688         114 SNPCLNDCVCVDDGNLSFYCNCTEDFTGE-YC  144 (181)
Q Consensus       114 ~~~c~~~~~C~~~~~~~~~C~C~~g~~g~-~c  144 (181)
                      ..+|..++.|++ ..+.|.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~-~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVN-TPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEec-CCCCeEeECCCCCcccCCc
Confidence            346777789988 7788999999999877 44


No 26 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.83  E-value=0.0014  Score=29.65  Aligned_cols=20  Identities=45%  Similarity=1.178  Sum_probs=15.0

Q ss_pred             CeeEeCCCCCc----CCCCCCCCCC
Q psy9688          62 RYQCLCTPRYS----GNNCEKDNGD   82 (181)
Q Consensus        62 ~~~C~c~~g~~----g~~c~~~~~~   82 (181)
                      +|.|.|++||.    |..|. +|+|
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~-DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE-DIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc-cCCC
Confidence            58999999998    55665 5553


No 27 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.82  E-value=0.003  Score=31.31  Aligned_cols=28  Identities=36%  Similarity=1.020  Sum_probs=22.9

Q ss_pred             CCCCCCCEEeeCCCCceEeecCCCCcc-CCc
Q psy9688         115 NPCLNDCVCVDDGNLSFYCNCTEDFTG-EYC  144 (181)
Q Consensus       115 ~~c~~~~~C~~~~~~~~~C~C~~g~~g-~~c  144 (181)
                      .+|..+ .|++ ..+.|.|.|++||.+ ..|
T Consensus         6 ~~C~~~-~C~~-~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCIN-TPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEEC-CCCCeEeECCCCCccCCcc
Confidence            467776 8988 788999999999988 554


No 28 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.75  E-value=0.0013  Score=33.14  Aligned_cols=26  Identities=23%  Similarity=0.786  Sum_probs=20.5

Q ss_pred             CCCCCCEEeeCCCCceEeecCCCCccC
Q psy9688         116 PCLNDCVCVDDGNLSFYCNCTEDFTGE  142 (181)
Q Consensus       116 ~c~~~~~C~~~~~~~~~C~C~~g~~g~  142 (181)
                      .|+.+++|++ ..++|.|.|.+||.|+
T Consensus         7 ~C~~nA~C~~-~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    7 GCHPNATCTN-TGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GS-TTCEEEE--TTSEEEEE-CEEECC
T ss_pred             CCCCCcEeec-CCCCEEeECCCCCccC
Confidence            5788899999 8889999999999865


No 29 
>KOG1226|consensus
Probab=96.65  E-value=0.026  Score=47.10  Aligned_cols=89  Identities=31%  Similarity=0.723  Sum_probs=43.6

Q ss_pred             CCCCCCEEecCCCCeeEeCCCCCc----CCCCCCCCCCCCC---CCCCCCcEEeeCC------eeccccccC-CCCCCCC
Q psy9688          49 PCLGHGTCINRSDRYQCLCTPRYS----GNNCEKDNGDCSS---NPCLNDGVCVDEV------YKGRYWELP-EIRDCTS  114 (181)
Q Consensus        49 ~c~~~~~C~~~~~~~~C~c~~g~~----g~~c~~~~~~c~~---~~c~~~~~C~~~~------~~g~~c~~~-~~~~c~~  114 (181)
                      +|..+|.|+=+    .|.|.+...    |..|+-+.-.|..   ..|...+.|.-..      |+|..|.++ +.+.|..
T Consensus       515 vCSgrG~C~CG----qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~  590 (783)
T KOG1226|consen  515 VCSGRGDCVCG----QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCES  590 (783)
T ss_pred             CcCCCCcEeCC----ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccC
Confidence            45555555433    455544433    5555433222321   1244444443322      555555542 2344443


Q ss_pred             C---CCCCCCEEeeCCCCceEeecCCC-CccCCccc
Q psy9688         115 N---PCLNDCVCVDDGNLSFYCNCTED-FTGEYCQF  146 (181)
Q Consensus       115 ~---~c~~~~~C~~~~~~~~~C~C~~g-~~g~~c~~  146 (181)
                      .   -|..+++|.=     -+|.|... |.|..|+.
T Consensus       591 ~~G~iCSGrG~C~C-----g~C~C~~~~~sG~~CE~  621 (783)
T KOG1226|consen  591 SDGQICSGRGTCEC-----GRCKCTDPPYSGEFCEK  621 (783)
T ss_pred             CCCceeCCCceeeC-----CceEcCCCCcCcchhhc
Confidence            2   3666666643     14777665 88888863


No 30 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.53  E-value=0.0027  Score=28.78  Aligned_cols=20  Identities=25%  Similarity=0.874  Sum_probs=14.6

Q ss_pred             ceEeecCCCCc----cCCccccCCCC
Q psy9688         130 SFYCNCTEDFT----GEYCQFENSAA  151 (181)
Q Consensus       130 ~~~C~C~~g~~----g~~c~~~~~~~  151 (181)
                      +|+|.|++||.    |..|.  +|++
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~--DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE--DIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc--cCCC
Confidence            48999999997    45565  5553


No 31 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.28  E-value=0.0022  Score=24.62  Aligned_cols=13  Identities=38%  Similarity=1.122  Sum_probs=10.2

Q ss_pred             EeecCCCCccCCc
Q psy9688         132 YCNCTEDFTGEYC  144 (181)
Q Consensus       132 ~C~C~~g~~g~~c  144 (181)
                      +|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4899999999876


No 32 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.70  E-value=0.02  Score=27.90  Aligned_cols=27  Identities=41%  Similarity=1.153  Sum_probs=21.8

Q ss_pred             CCCCCCCEEecCCCCeeEeCCCCCcCCCC
Q psy9688          48 DPCLGHGTCINRSDRYQCLCTPRYSGNNC   76 (181)
Q Consensus        48 ~~c~~~~~C~~~~~~~~C~c~~g~~g~~c   76 (181)
                      ..|..+++|+..  ...|+|.+||.|..|
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            458899999854  349999999998765


No 33 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.24  E-value=0.031  Score=27.23  Aligned_cols=26  Identities=31%  Similarity=0.718  Sum_probs=20.9

Q ss_pred             CCCCCCEEeeCCCCceEeecCCCCccCCc
Q psy9688         116 PCLNDCVCVDDGNLSFYCNCTEDFTGEYC  144 (181)
Q Consensus       116 ~c~~~~~C~~~~~~~~~C~C~~g~~g~~c  144 (181)
                      .|.++++|+. .  ..+|.|.+||.|+.|
T Consensus         7 ~C~~~G~C~~-~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVS-P--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeC-C--CCEEECCCCCcCCCC
Confidence            5788899975 3  358999999999865


No 34 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.19  E-value=0.016  Score=29.05  Aligned_cols=21  Identities=33%  Similarity=0.895  Sum_probs=17.2

Q ss_pred             CCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         157 PCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       157 ~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      .|.+  +|++++++  |.|.|++||
T Consensus         7 gC~h--~C~~~~g~--~~C~C~~Gy   27 (36)
T PF14670_consen    7 GCSH--ICVNTPGS--YRCSCPPGY   27 (36)
T ss_dssp             GSSS--EEEEETTS--EEEE-STTE
T ss_pred             CcCC--CCccCCCc--eEeECCCCC
Confidence            4666  99999998  999999997


No 35 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.08  E-value=0.071  Score=26.72  Aligned_cols=19  Identities=37%  Similarity=1.074  Sum_probs=16.2

Q ss_pred             CEEecCCCCeeEeCCCCCc
Q psy9688          54 GTCINRSDRYQCLCTPRYS   72 (181)
Q Consensus        54 ~~C~~~~~~~~C~c~~g~~   72 (181)
                      ..|++..++|.|.|++||.
T Consensus        10 h~C~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             SEEEEETTSEEEE-STTEE
T ss_pred             CCCccCCCceEeECCCCCE
Confidence            4799989999999999997


No 36 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=92.52  E-value=0.16  Score=36.67  Aligned_cols=35  Identities=31%  Similarity=0.852  Sum_probs=27.6

Q ss_pred             cCCccccCCCCCCC-CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         141 GEYCQFENSAACVT-LNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       141 g~~c~~~~~~~c~~-~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      +..|.  ++++|.. .+.|..  .|+++.|+  |.|.|++||
T Consensus       181 ~~~C~--~~~~C~~~~~~c~~--~C~~~~g~--~~c~c~~g~  216 (224)
T cd01475         181 GKICV--VPDLCATLSHVCQQ--VCISTPGS--YLCACTEGY  216 (224)
T ss_pred             cccCc--CchhhcCCCCCccc--eEEcCCCC--EEeECCCCc
Confidence            56676  6778865 356765  89999999  899999997


No 37 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.31  E-value=0.099  Score=26.23  Aligned_cols=28  Identities=18%  Similarity=0.413  Sum_probs=18.9

Q ss_pred             CCCCCCCCCEEeeCCCCceEeecCCCCc
Q psy9688         113 TSNPCLNDCVCVDDGNLSFYCNCTEDFT  140 (181)
Q Consensus       113 ~~~~c~~~~~C~~~~~~~~~C~C~~g~~  140 (181)
                      ....|+.++.|++...|.+.|.|..||+
T Consensus         3 ~~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    3 IDTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             cCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            3446788889987344999999999997


No 38 
>KOG0994|consensus
Probab=88.87  E-value=1.7  Score=38.79  Aligned_cols=17  Identities=41%  Similarity=0.974  Sum_probs=14.4

Q ss_pred             CeeEeCCCCCcCCCCCC
Q psy9688          62 RYQCLCTPRYSGNNCEK   78 (181)
Q Consensus        62 ~~~C~c~~g~~g~~c~~   78 (181)
                      ...|.|.+||+|..|++
T Consensus       933 ~ivC~C~~GY~G~RCe~  949 (1758)
T KOG0994|consen  933 QIVCHCQEGYSGSRCEI  949 (1758)
T ss_pred             ceeeecccCccccchhh
Confidence            35899999999999874


No 39 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.72  E-value=0.56  Score=33.91  Aligned_cols=35  Identities=29%  Similarity=0.519  Sum_probs=25.8

Q ss_pred             cccCcCCCCCCCCCCCEEecCCCCeeEeCCCCCcC
Q psy9688          39 NSKVDHCLTDPCLGHGTCINRSDRYQCLCTPRYSG   73 (181)
Q Consensus        39 ~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~~g~~g   73 (181)
                      |.+.++|...+......|.+..+.|.|.|++||+.
T Consensus       184 C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             CcCchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence            56677785433222357999999999999999974


No 40 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=88.05  E-value=0.17  Score=35.21  Aligned_cols=48  Identities=21%  Similarity=0.568  Sum_probs=25.9

Q ss_pred             ceEeecCCCCc---cCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         130 SFYCNCTEDFT---GEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       130 ~~~C~C~~g~~---g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      ...|+|..|+.   ...|.......|..  .|..+-.|-...+-  |.|.+.+||
T Consensus       109 ~~~CSC~IGkV~~dn~kCtk~G~T~C~L--KCk~nE~CK~~~~~--Y~C~~~~~~  159 (197)
T PF06247_consen  109 NPTCSCNIGKVPDDNKKCTKTGETKCSL--KCKENEECKLVDGY--YKCVCKEGF  159 (197)
T ss_dssp             EEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTE--EEEEE-TT-
T ss_pred             CceeEeeeceEeccCCcccCCCccceee--ecCCCcceeeeCcE--EEeecCCCC
Confidence            34899998875   34454323334543  56777789888775  889988775


No 41 
>KOG3516|consensus
Probab=87.70  E-value=0.59  Score=41.39  Aligned_cols=44  Identities=30%  Similarity=0.750  Sum_probs=37.9

Q ss_pred             cccCcCCCCCCCCCCCEEecCCCCeeEeCC-CCCcCCCCCCCCCC
Q psy9688          39 NSKVDHCLTDPCLGHGTCINRSDRYQCLCT-PRYSGNNCEKDNGD   82 (181)
Q Consensus        39 ~~~~~~c~~~~c~~~~~C~~~~~~~~C~c~-~g~~g~~c~~~~~~   82 (181)
                      |.-++.|.|++|.+++.|......+.|.|. .||.|..|...+.+
T Consensus       542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e  586 (1306)
T KOG3516|consen  542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE  586 (1306)
T ss_pred             cccccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence            556788999999999999998888999998 99999999876644


No 42 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=87.37  E-value=0.81  Score=29.80  Aligned_cols=28  Identities=36%  Similarity=0.863  Sum_probs=22.1

Q ss_pred             CCCCCCEEecC--CCCeeEeCCCCCcCCCCC
Q psy9688          49 PCLGHGTCINR--SDRYQCLCTPRYSGNNCE   77 (181)
Q Consensus        49 ~c~~~~~C~~~--~~~~~C~c~~g~~g~~c~   77 (181)
                      -|.++ .|.-.  ...+.|.|..||+|.+|+
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCccccccc
Confidence            36664 78643  457899999999999997


No 43 
>PHA02887 EGF-like protein; Provisional
Probab=87.20  E-value=0.86  Score=29.18  Aligned_cols=27  Identities=37%  Similarity=1.085  Sum_probs=21.6

Q ss_pred             CCCCCEEecC--CCCeeEeCCCCCcCCCCC
Q psy9688          50 CLGHGTCINR--SDRYQCLCTPRYSGNNCE   77 (181)
Q Consensus        50 c~~~~~C~~~--~~~~~C~c~~g~~g~~c~   77 (181)
                      |. +|+|.-.  .....|.|..||+|.+|+
T Consensus        94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence            66 4688643  446899999999999997


No 44 
>smart00051 DSL delta serrate ligand.
Probab=85.50  E-value=1.7  Score=24.74  Aligned_cols=42  Identities=26%  Similarity=0.491  Sum_probs=28.6

Q ss_pred             eEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         131 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       131 ~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      +.-.|.++|.|..|..    .|...+.+..+.+|-. .|.    ++|.+||
T Consensus        17 ~rv~C~~~~yG~~C~~----~C~~~~d~~~~~~Cd~-~G~----~~C~~Gw   58 (63)
T smart00051       17 IRVTCDENYYGEGCNK----FCRPRDDFFGHYTCDE-NGN----KGCLEGW   58 (63)
T ss_pred             EEeeCCCCCcCCccCC----EeCcCccccCCccCCc-CCC----EecCCCC
Confidence            4566888999998862    3433244566778853 565    8899987


No 45 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=82.95  E-value=0.61  Score=32.59  Aligned_cols=88  Identities=24%  Similarity=0.573  Sum_probs=46.0

Q ss_pred             CCCCCCCEEecCC-----CCeeEeCCCCCcCC--CCCCCCCCCCCCCCCCCcEEeeCC----------------eecccc
Q psy9688          48 DPCLGHGTCINRS-----DRYQCLCTPRYSGN--NCEKDNGDCSSNPCLNDGVCVDEV----------------YKGRYW  104 (181)
Q Consensus        48 ~~c~~~~~C~~~~-----~~~~C~c~~g~~g~--~c~~~~~~c~~~~c~~~~~C~~~~----------------~~g~~c  104 (181)
                      .+|...+.|++..     ..|.|.|.+||...  .|.  .+.|....|. .+.|+...                .+...|
T Consensus        50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kC  126 (197)
T PF06247_consen   50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKC  126 (197)
T ss_dssp             SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTES
T ss_pred             ccccchhhhhcCCCcccceeEEEecccCceeeCCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcc
Confidence            4588888898654     35899999999732  221  1334444555 45665433                011122


Q ss_pred             ccCCCCCCCCCCCCCCCEEeeCCCCceEeecCCCCc
Q psy9688         105 ELPEIRDCTSNPCLNDCVCVDDGNLSFYCNCTEDFT  140 (181)
Q Consensus       105 ~~~~~~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~  140 (181)
                      ..+....|+. .|..+..|.. ..+-|.|.+..+|.
T Consensus       127 tk~G~T~C~L-KCk~nE~CK~-~~~~Y~C~~~~~~~  160 (197)
T PF06247_consen  127 TKTGETKCSL-KCKENEECKL-VDGYYKCVCKEGFP  160 (197)
T ss_dssp             EEEE---------TTTEEEEE-ETTEEEEEE-TT-E
T ss_pred             cCCCccceee-ecCCCcceee-eCcEEEeecCCCCC
Confidence            2211233332 4778889988 78899999999986


No 46 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.15  E-value=2.1  Score=27.92  Aligned_cols=33  Identities=15%  Similarity=0.386  Sum_probs=23.5

Q ss_pred             CCCCCCC--CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         148 NSAACVT--LNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       148 ~~~~c~~--~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      ++..|..  .+.|.+| .|+..+.-..+.|.|..||
T Consensus        41 ~i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GY   75 (139)
T PHA03099         41 AIRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGY   75 (139)
T ss_pred             ccccCChhhCCEeECC-EEEeeccCCCceeECCCCc
Confidence            3445543  3568775 8987776666899999987


No 47 
>PHA02887 EGF-like protein; Provisional
Probab=79.06  E-value=2.6  Score=27.06  Aligned_cols=31  Identities=23%  Similarity=0.504  Sum_probs=23.1

Q ss_pred             CCCCC--CCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         150 AACVT--LNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       150 ~~c~~--~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      .+|.+  .+.|. +|+|...+.-...+|.|++||
T Consensus        84 ~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GY  116 (126)
T PHA02887         84 EKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGY  116 (126)
T ss_pred             cccChHhhCEee-CCEEEccccCCCceeECCCCc
Confidence            45654  35687 579987777667899999997


No 48 
>KOG3514|consensus
Probab=68.18  E-value=4.3  Score=36.15  Aligned_cols=37  Identities=30%  Similarity=0.898  Sum_probs=32.2

Q ss_pred             CCCCCCCCCCCEEecCCCCeeEeC-CCCCcCCCCCCCC
Q psy9688          44 HCLTDPCLGHGTCINRSDRYQCLC-TPRYSGNNCEKDN   80 (181)
Q Consensus        44 ~c~~~~c~~~~~C~~~~~~~~C~c-~~g~~g~~c~~~~   80 (181)
                      .|.++||.+++.|...-+.+.|-| -.+|.|+.|+.+.
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE~  662 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCEREA  662 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCcccccee
Confidence            688999999999999889999986 5789999998654


No 49 
>KOG0994|consensus
Probab=64.24  E-value=27  Score=31.85  Aligned_cols=58  Identities=21%  Similarity=0.487  Sum_probs=30.9

Q ss_pred             EeeCCCCceEe-ecCCCCccCCcccc----CCCCCCC-C-CCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         123 CVDDGNLSFYC-NCTEDFTGEYCQFE----NSAACVT-L-NPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       123 C~~~~~~~~~C-~C~~g~~g~~c~~~----~~~~c~~-~-~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      |.+ ....+.| .|..||.|+----.    ..-.|.+ + +.-.+...|...+-+.+..|+|++||
T Consensus       878 CqD-~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY  942 (1758)
T KOG0994|consen  878 CQD-STTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGY  942 (1758)
T ss_pred             ccc-cccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccccccccceeeecccCc
Confidence            444 4455666 58888865321100    1122332 1 11223346765666667899999997


No 50 
>KOG3516|consensus
Probab=62.38  E-value=7  Score=35.16  Aligned_cols=38  Identities=29%  Similarity=0.696  Sum_probs=33.5

Q ss_pred             CCCCCCCCCCCCCCEEeeCCCCceEeecC-CCCccCCccc
Q psy9688         108 EIRDCTSNPCLNDCVCVDDGNLSFYCNCT-EDFTGEYCQF  146 (181)
Q Consensus       108 ~~~~c~~~~c~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~  146 (181)
                      -++.|.+++|..++.|.. ....|.|.|. .||+|..|..
T Consensus       544 i~drClPN~CehgG~C~Q-s~~~f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  544 ISDRCLPNPCEHGGKCSQ-SWDDFECNCELTGYKGATCHT  582 (1306)
T ss_pred             cccccCCccccCCCcccc-cccceeEeccccccccccccC
Confidence            457888999999999998 8899999997 8999999875


No 51 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=61.28  E-value=21  Score=19.05  Aligned_cols=20  Identities=40%  Similarity=1.223  Sum_probs=15.7

Q ss_pred             CCCCCCEEecCCCCeeEeCCCCCc
Q psy9688          49 PCLGHGTCINRSDRYQCLCTPRYS   72 (181)
Q Consensus        49 ~c~~~~~C~~~~~~~~C~c~~g~~   72 (181)
                      .|..++.|++.    +|.|++||.
T Consensus        27 qC~~~s~C~~g----~C~C~~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVNG----RCQCPPGYV   46 (52)
T ss_pred             CCCCcCEEcCC----EeECCCCCE
Confidence            35577888776    999999986


No 52 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=60.75  E-value=12  Score=23.61  Aligned_cols=22  Identities=27%  Similarity=0.650  Sum_probs=13.8

Q ss_pred             CCCCCCEEeeCC----CCceEeecCC
Q psy9688         116 PCLNDCVCVDDG----NLSFYCNCTE  137 (181)
Q Consensus       116 ~c~~~~~C~~~~----~~~~~C~C~~  137 (181)
                      .|..++.|+...    ..-|.|.|.+
T Consensus        14 ~CsgHG~C~~~~~~~~~~C~~C~C~~   39 (103)
T PF12955_consen   14 NCSGHGSCVKKYGSGGGDCFACKCKP   39 (103)
T ss_pred             CCCCCceEeeccCCCccceEEEEeec
Confidence            466677776521    2447788876


No 53 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=57.65  E-value=15  Score=23.17  Aligned_cols=31  Identities=23%  Similarity=0.540  Sum_probs=22.1

Q ss_pred             CCCCCCCCCCCCCCCEEeeCCCCcceEEecCCCC
Q psy9688         148 NSAACVTLNPCQNNATCVASPGEKQITCLCLKEV  181 (181)
Q Consensus       148 ~~~~c~~~~~C~~~~~C~~~~g~~~~~C~C~~g~  181 (181)
                      ..+.|.....|...+.|... ..  ..|.|.+||
T Consensus        76 p~d~Cd~y~~CG~~g~C~~~-~~--~~C~Cl~GF  106 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNSN-NS--PKCSCLPGF  106 (110)
T ss_pred             cccCCCCccccCCccEeCCC-CC--CceECCCCc
Confidence            44567655789999999533 23  459999997


No 54 
>KOG1836|consensus
Probab=55.66  E-value=31  Score=32.97  Aligned_cols=45  Identities=24%  Similarity=0.626  Sum_probs=24.6

Q ss_pred             ecCCCCccCCccccCCCC-CCCCCCCCCCCEEeeCCCCcceEEe-cCCCC
Q psy9688         134 NCTEDFTGEYCQFENSAA-CVTLNPCQNNATCVASPGEKQITCL-CLKEV  181 (181)
Q Consensus       134 ~C~~g~~g~~c~~~~~~~-c~~~~~C~~~~~C~~~~g~~~~~C~-C~~g~  181 (181)
                      +|..||.|..-.  +... |.. -+|..++.|..+......-|. |++||
T Consensus       760 ~C~~GfYg~~~~--~~~~dC~~-C~Cp~~~~~~~~~~~~~~iCk~Cp~gy  806 (1705)
T KOG1836|consen  760 QCVDGFYGLPDL--GTSGDCQP-CPCPNGGACGQTPEILEVVCKNCPPGY  806 (1705)
T ss_pred             hhcCCCCCcccc--CCCCCCcc-CCCCCChhhcCcCcccceecCCCCCCC
Confidence            466666543221  1111 544 466677777655544445677 88876


No 55 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=53.81  E-value=19  Score=19.08  Aligned_cols=14  Identities=43%  Similarity=1.234  Sum_probs=12.6

Q ss_pred             eEeCCCCCcCCCCC
Q psy9688          64 QCLCTPRYSGNNCE   77 (181)
Q Consensus        64 ~C~c~~g~~g~~c~   77 (181)
                      .|.|++++.|..|+
T Consensus        20 ~C~C~~~~~G~~C~   33 (50)
T cd00055          20 QCECKPNTTGRRCD   33 (50)
T ss_pred             EEeCCCcCCCCCCC
Confidence            78899999999886


No 56 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=51.51  E-value=9.7  Score=20.10  Aligned_cols=16  Identities=31%  Similarity=0.669  Sum_probs=13.2

Q ss_pred             ceEeecCCCCccCCcc
Q psy9688         130 SFYCNCTEDFTGEYCQ  145 (181)
Q Consensus       130 ~~~C~C~~g~~g~~c~  145 (181)
                      ..+|.|.++|.|..|+
T Consensus        17 ~G~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   17 TGQCVCKPGTTGPRCD   32 (49)
T ss_dssp             CEEESBSTTEESTTS-
T ss_pred             CCEEeccccccCCcCc
Confidence            4589999999999996


No 57 
>KOG3514|consensus
Probab=47.58  E-value=16  Score=32.89  Aligned_cols=36  Identities=39%  Similarity=0.944  Sum_probs=31.2

Q ss_pred             CCCCCCCCCCCCEEeeCCCCceEeecC-CCCccCCccc
Q psy9688         110 RDCTSNPCLNDCVCVDDGNLSFYCNCT-EDFTGEYCQF  146 (181)
Q Consensus       110 ~~c~~~~c~~~~~C~~~~~~~~~C~C~-~g~~g~~c~~  146 (181)
                      ..|..+||.+.+.|.. ....+.|.|. .+|.|+.|..
T Consensus       624 ~~C~~nPC~N~g~C~e-gwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  624 KICESNPCQNGGKCSE-GWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             cccCCCcccCCCCccc-cccccccccccCcccCccccc
Confidence            3677889999999998 8899999995 5889999975


No 58 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=47.18  E-value=12  Score=18.37  Aligned_cols=8  Identities=25%  Similarity=0.916  Sum_probs=6.7

Q ss_pred             EEecCCCC
Q psy9688         174 TCLCLKEV  181 (181)
Q Consensus       174 ~C~C~~g~  181 (181)
                      .|.|++||
T Consensus        19 ~C~CPeGy   26 (34)
T PF09064_consen   19 QCFCPEGY   26 (34)
T ss_pred             ceeCCCce
Confidence            49999987


No 59 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=36.17  E-value=31  Score=17.97  Aligned_cols=15  Identities=33%  Similarity=0.866  Sum_probs=12.6

Q ss_pred             eEeecCCCCccCCcc
Q psy9688         131 FYCNCTEDFTGEYCQ  145 (181)
Q Consensus       131 ~~C~C~~g~~g~~c~  145 (181)
                      -+|.|++++.|..|+
T Consensus        18 G~C~C~~~~~G~~C~   32 (46)
T smart00180       18 GQCECKPNVTGRRCD   32 (46)
T ss_pred             CEEECCCCCCCCCCC
Confidence            479999999998885


No 60 
>KOG1218|consensus
Probab=32.70  E-value=83  Score=23.69  Aligned_cols=39  Identities=33%  Similarity=0.818  Sum_probs=25.7

Q ss_pred             eEeecCCCCccCCccccCCCCCCCCCCCCCCCEEeeCCCC
Q psy9688         131 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGE  170 (181)
Q Consensus       131 ~~C~C~~g~~g~~c~~~~~~~c~~~~~C~~~~~C~~~~g~  170 (181)
                      -.|.|.+||.+..+.. ....|.....|.+++.|....+.
T Consensus       162 ~~c~c~~g~~g~~~~~-~~~~c~~~~~~~~g~~C~~~~~~  200 (316)
T KOG1218|consen  162 GICTCQPGFVGVFCVE-SCSGCSPLTACENGAKCNRSTGS  200 (316)
T ss_pred             CceeccCCcccccccc-cCCCcCCCcccCCCCeeeccccc
Confidence            3588999999988874 22225443557677777766554


No 61 
>KOG3509|consensus
Probab=22.29  E-value=2.4e+02  Score=25.57  Aligned_cols=36  Identities=28%  Similarity=0.625  Sum_probs=26.2

Q ss_pred             CCCCCCCCCCCCEEeeCCCCceEeecCCCCccCCccc
Q psy9688         110 RDCTSNPCLNDCVCVDDGNLSFYCNCTEDFTGEYCQF  146 (181)
Q Consensus       110 ~~c~~~~c~~~~~C~~~~~~~~~C~C~~g~~g~~c~~  146 (181)
                      +.|+..++...+.|.. ......|.|+++|.|..|..
T Consensus       407 ~~c~~~p~~~~g~c~p-~~~~~~c~c~~g~~G~~c~d  442 (964)
T KOG3509|consen  407 DVCWRIPCQHDGPCLQ-TLEGKQCLCPPGYTGDSCED  442 (964)
T ss_pred             CccccccCCCCccccc-cccccceeccccccCchhhc
Confidence            3455556666666666 56667899999999998874


No 62 
>KOG3512|consensus
Probab=21.73  E-value=4.6e+02  Score=21.79  Aligned_cols=25  Identities=24%  Similarity=0.732  Sum_probs=17.9

Q ss_pred             CEEec-CCCCeeEeCCCCCcCCCCCC
Q psy9688          54 GTCIN-RSDRYQCLCTPRYSGNNCEK   78 (181)
Q Consensus        54 ~~C~~-~~~~~~C~c~~g~~g~~c~~   78 (181)
                      ..|+- ..+.++|.|...-.|..|++
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCgr  310 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCGR  310 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCccc
Confidence            35764 34458999998888888763


No 63 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=21.05  E-value=49  Score=18.26  Aligned_cols=18  Identities=28%  Similarity=0.576  Sum_probs=11.5

Q ss_pred             CceEeecCCCCccCCccc
Q psy9688         129 LSFYCNCTEDFTGEYCQF  146 (181)
Q Consensus       129 ~~~~C~C~~g~~g~~c~~  146 (181)
                      |.-.|.|..-|.|.+|+.
T Consensus        34 G~p~CECn~Cy~GpdCS~   51 (56)
T PF04863_consen   34 GSPVCECNSCYGGPDCST   51 (56)
T ss_dssp             TEE--EE-TTEESTTS-E
T ss_pred             CCccccccCCcCCCCccc
Confidence            446799999999999975


Done!