Query         psy11797
Match_columns 249
No_of_seqs    245 out of 1876
Neff          9.1 
Searched_HMMs 46136
Date          Fri Aug 16 19:14:39 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11797.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11797hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.3 6.5E-12 1.4E-16  115.1   8.6  158   31-224   692-860 (1289)
  2 KOG1214|consensus               99.3 4.5E-11 9.7E-16  109.8  12.3  107    1-131   724-865 (1289)
  3 KOG1219|consensus               99.1 1.6E-10 3.5E-15  114.6   6.6  108   32-180  3865-3974(4289)
  4 KOG1219|consensus               98.9 1.2E-09 2.7E-14  108.6   6.0   94   12-131  3865-3978(4289)
  5 KOG4289|consensus               98.9 2.1E-09 4.6E-14  103.5   5.3  127   10-176  1178-1308(2531)
  6 PF07645 EGF_CA:  Calcium-bindi  98.7 2.3E-08   5E-13   59.7   3.6   40   89-128     1-41  (42)
  7 PF07645 EGF_CA:  Calcium-bindi  98.6   6E-08 1.3E-12   57.9   3.3   34   30-63      1-36  (42)
  8 KOG1217|consensus               98.5 1.8E-06   4E-11   77.9  12.2  192   32-249   170-378 (487)
  9 KOG1217|consensus               98.4 4.7E-06   1E-10   75.3  11.6  170   43-242   140-330 (487)
 10 PF14670 FXa_inhibition:  Coagu  98.2   9E-07   2E-11   50.6   2.5   32   97-129     5-36  (36)
 11 KOG4260|consensus               98.1 1.5E-06 3.3E-11   71.0   3.0   67   29-121   234-304 (350)
 12 PF12662 cEGF:  Complement Clr-  98.1 2.9E-06 6.2E-11   43.7   2.1   24  111-134     1-24  (24)
 13 KOG4289|consensus               98.0 5.7E-06 1.2E-10   80.7   3.6   91  106-219  1216-1308(2531)
 14 PF12662 cEGF:  Complement Clr-  97.9 7.2E-06 1.6E-10   42.2   2.2   24   51-92      1-24  (24)
 15 PF14670 FXa_inhibition:  Coagu  97.8 1.5E-05 3.3E-10   45.5   2.7   33   34-66      1-33  (36)
 16 smart00179 EGF_CA Calcium-bind  97.8 2.7E-05 5.9E-10   45.1   3.4   35   30-64      1-37  (39)
 17 KOG4260|consensus               97.7 1.9E-05 4.2E-10   64.7   2.0   73   85-178   231-304 (350)
 18 PF12947 EGF_3:  EGF domain;  I  97.5 4.7E-05   1E-09   43.5   1.5   31   34-64      1-33  (36)
 19 PF00008 EGF:  EGF-like domain   97.5 6.2E-05 1.3E-09   41.9   1.9   30   34-63      1-31  (32)
 20 cd00054 EGF_CA Calcium-binding  97.4 0.00018 3.8E-09   41.2   3.4   36   30-65      1-37  (38)
 21 smart00179 EGF_CA Calcium-bind  97.4 0.00021 4.6E-09   41.2   3.5   37   89-129     1-38  (39)
 22 KOG1225|consensus               97.3  0.0011 2.3E-08   60.5   8.5  121   53-224   235-365 (525)
 23 PF12947 EGF_3:  EGF domain;  I  97.2 0.00021 4.5E-09   40.8   2.0   31   97-129     5-36  (36)
 24 PF00683 TB:  TB domain;  Inter  96.9 0.00017 3.7E-09   42.7  -0.7   30  183-212    11-40  (42)
 25 KOG1225|consensus               96.8  0.0091   2E-07   54.5   9.4   74  113-224   266-339 (525)
 26 cd00054 EGF_CA Calcium-binding  96.8  0.0019 4.2E-08   36.6   3.4   35   90-129     2-37  (38)
 27 PF00008 EGF:  EGF-like domain   96.6  0.0018 3.9E-08   35.9   2.3   24   98-121     4-29  (32)
 28 cd00053 EGF Epidermal growth f  96.3  0.0049 1.1E-07   34.3   3.1   21   43-63     12-32  (36)
 29 cd01475 vWA_Matrilin VWA_Matri  96.3  0.0038 8.2E-08   51.2   3.5   44   83-127   180-223 (224)
 30 smart00181 EGF Epidermal growt  96.3  0.0052 1.1E-07   34.5   3.1   19   44-62     12-30  (35)
 31 smart00181 EGF Epidermal growt  96.2   0.007 1.5E-07   33.9   3.2   24   98-121     6-29  (35)
 32 PF06247 Plasmod_Pvs28:  Plasmo  96.2  0.0014 3.1E-08   51.2   0.3  135   43-225    11-164 (197)
 33 cd00053 EGF Epidermal growth f  96.1  0.0089 1.9E-07   33.2   3.2   24   98-121     6-30  (36)
 34 cd01475 vWA_Matrilin VWA_Matri  95.8  0.0073 1.6E-07   49.5   3.0   37   29-65    185-221 (224)
 35 PF12661 hEGF:  Human growth fa  94.3   0.028 6.1E-07   24.4   1.2   12   53-64      1-12  (13)
 36 KOG0994|consensus               92.8     1.5 3.2E-05   43.8  11.0   60  162-224   878-946 (1758)
 37 KOG0994|consensus               92.6    0.34 7.4E-06   47.9   6.5   32   90-123   865-897 (1758)
 38 KOG1226|consensus               90.3     3.5 7.6E-05   39.4  10.4   23   43-66    467-492 (783)
 39 PF07974 EGF_2:  EGF-like domai  89.9    0.52 1.1E-05   26.0   3.0   25   99-129     7-32  (32)
 40 PF06247 Plasmod_Pvs28:  Plasmo  88.3    0.39 8.4E-06   37.8   2.4   99   43-180    56-162 (197)
 41 PF12946 EGF_MSP1_1:  MSP1 EGF   81.0     1.1 2.4E-05   25.5   1.4   25   40-64      8-33  (37)
 42 PHA03099 epidermal growth fact  76.8     2.3   5E-05   31.3   2.4   39   89-131    41-82  (139)
 43 KOG1226|consensus               75.5     7.1 0.00015   37.4   5.8   15  113-131   479-493 (783)
 44 KOG1836|consensus               74.3      10 0.00022   40.2   7.0   14   54-67    697-710 (1705)
 45 smart00051 DSL delta serrate l  70.3     9.1  0.0002   24.6   3.8   16   51-66     16-31  (63)
 46 KOG1836|consensus               65.9      19 0.00042   38.3   6.9   71   32-131   738-813 (1705)
 47 PHA02887 EGF-like protein; Pro  57.5     9.9 0.00021   27.6   2.3   38   90-131    83-123 (126)
 48 PHA03099 epidermal growth fact  56.6     9.4  0.0002   28.2   2.1   24   43-66     56-81  (139)
 49 PF09064 Tme5_EGF_like:  Thromb  56.3      13 0.00028   20.7   2.2   24   40-64      7-30  (34)
 50 PF00954 S_locus_glycop:  S-loc  56.2      12 0.00026   26.7   2.7   30   90-121    77-107 (110)
 51 KOG3512|consensus               43.0      90   0.002   28.6   6.4   25   43-67    285-310 (592)
 52 KOG1215|consensus               42.8      43 0.00094   33.3   5.1   66   40-126   334-400 (877)
 53 cd00055 EGF_Lam Laminin-type e  37.1      93   0.002   18.6   4.5   16  187-203    20-35  (50)
 54 KOG0196|consensus               33.4      51  0.0011   32.4   3.6   51  168-221   258-317 (996)
 55 KOG3516|consensus               32.5      33 0.00071   34.9   2.3   39   86-130   541-581 (1306)
 56 smart00180 EGF_Lam Laminin-typ  27.4 1.4E+02   0.003   17.5   3.7   16  187-203    19-34  (46)
 57 PF12955 DUF3844:  Domain of un  27.2      40 0.00087   24.0   1.4   31   91-121     6-42  (103)
 58 PF01826 TIL:  Trypsin Inhibito  24.0      40 0.00086   20.6   0.9   18  113-131    34-51  (55)
 59 KOG3516|consensus               21.7      77  0.0017   32.4   2.7   37   31-67    545-582 (1306)

No 1  
>KOG1214|consensus
Probab=99.31  E-value=6.5e-12  Score=115.15  Aligned_cols=158  Identities=27%  Similarity=0.556  Sum_probs=114.8

Q ss_pred             CCccCCCCCCCC--CCceecCC-CeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeee
Q psy11797         31 VDECRTPANTCK--FSCKNLIG-SYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCV  106 (249)
Q Consensus        31 id~C~~~~~~c~--~~C~n~~g-sy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~  106 (249)
                      ++.|..+.+.|.  +.|....+ .|+|.|..||.|++                    ..|.|+++|+.....|. +.+|+
T Consensus       692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdg--------------------r~c~d~~eca~~~~~CGp~s~Ci  751 (1289)
T KOG1214|consen  692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDG--------------------RNCVDENECATGFHRCGPNSVCI  751 (1289)
T ss_pred             cccceecCcccCCCccccCCCCcceEEEEeeccCCCC--------------------CCCCChhhhccCCCCCCCCceee
Confidence            567777777776  77876654 59999999999876                    67899999998788899 88999


Q ss_pred             eCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccC--CCCCcc--cccccc-CCCCcccCCCCCCccC
Q psy11797        107 NLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNN--NNNQRL--GFCYRS-LTNGRCVLPTGPALLM  181 (249)
Q Consensus       107 ~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~C~~~-~~~~~C~c~~g~~~~~  181 (249)
                      +.+++|+|.|..||...+++.+|..+..-+.          .+.|..  ..|.-.  ..|... .+.|.|.|.+||.++.
T Consensus       752 n~pg~~rceC~~gy~F~dd~~tCV~i~~pap----------~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG  821 (1289)
T KOG1214|consen  752 NLPGSYRCECRSGYEFADDRHTCVLITPPAP----------ANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG  821 (1289)
T ss_pred             cCCCceeEEEeecceeccCCcceEEecCCCC----------CCccccCccccCcCCceEEEecCCceEEEeecCCccCCc
Confidence            9999999999999999999999987432111          111211  223333  344444 3459999999998875


Q ss_pred             --CCCcceeecCCCCccCCCCccCCCCCCchhhccCCCCCccCCC
Q psy11797        182 --EVTRMDCCCTMGMAWGPQCQLCPTRGSQEYTDLCLESGLTVDG  224 (249)
Q Consensus       182 --~~~~~~C~C~~g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~  224 (249)
                        +.+.++|.=+..+- .++|    ....++|.|.| .+||.||+
T Consensus       822 ~~c~dvDeC~psrChp-~A~C----yntpgsfsC~C-~pGy~GDG  860 (1289)
T KOG1214|consen  822 HQCTDVDECSPSRCHP-AATC----YNTPGSFSCRC-QPGYYGDG  860 (1289)
T ss_pred             cccccccccCccccCC-CceE----ecCCCcceeec-ccCccCCC
Confidence              45667775222222 4455    45557799999 89999988


No 2  
>KOG1214|consensus
Probab=99.28  E-value=4.5e-11  Score=109.79  Aligned_cols=107  Identities=34%  Similarity=0.768  Sum_probs=88.0

Q ss_pred             CCCCCCCCccCcccCCCCCCCC--cceeeC---------------------------CCCCccCCCCCCCC----CCcee
Q psy11797          1 MSQVTFICSDVDECRTPANTCK--FSCKNL---------------------------IDVDECRTPANTCK----FSCKN   47 (249)
Q Consensus         1 ~~~~g~~C~di~eC~~~~~~c~--~~C~~~---------------------------~did~C~~~~~~c~----~~C~n   47 (249)
                      +.+++++|.|++||+..+..|+  .+|++.                           ..++.|..+.+.|.    +.|+.
T Consensus       724 ~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~  803 (1289)
T KOG1214|consen  724 YQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVH  803 (1289)
T ss_pred             cCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEe
Confidence            3578999999999998877665  588887                           23688888877775    45665


Q ss_pred             cC-CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCC
Q psy11797         48 LI-GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLD  125 (249)
Q Consensus        48 ~~-gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~  125 (249)
                      .. ++|.|.|.+||.|++                    +.|.++|+|+  ++.|. ++.|.+++++|.|.|.+||.  .|
T Consensus       804 hGgs~y~C~CLPGfsGDG--------------------~~c~dvDeC~--psrChp~A~CyntpgsfsC~C~pGy~--GD  859 (1289)
T KOG1214|consen  804 HGGSTYSCACLPGFSGDG--------------------HQCTDVDECS--PSRCHPAATCYNTPGSFSCRCQPGYY--GD  859 (1289)
T ss_pred             cCCceEEEeecCCccCCc--------------------cccccccccC--ccccCCCceEecCCCcceeecccCcc--CC
Confidence            54 459999999999976                    6688999999  58898 77999999999999999998  56


Q ss_pred             CCCccc
Q psy11797        126 GKQCLG  131 (249)
Q Consensus       126 g~~C~~  131 (249)
                      |..|.+
T Consensus       860 Gf~CVP  865 (1289)
T KOG1214|consen  860 GFQCVP  865 (1289)
T ss_pred             CceecC
Confidence            778876


No 3  
>KOG1219|consensus
Probab=99.08  E-value=1.6e-10  Score=114.60  Aligned_cols=108  Identities=23%  Similarity=0.557  Sum_probs=94.6

Q ss_pred             CccCCCCCCCCCCceecC-CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCC
Q psy11797         32 DECRTPANTCKFSCKNLI-GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLE  109 (249)
Q Consensus        32 d~C~~~~~~c~~~C~n~~-gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~  109 (249)
                      +.|..+|++.+++|...+ |+|.|.|++.|.|..|+                     .++..|..  +||. .|.|+...
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE---------------------i~~epC~s--nPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE---------------------IDLEPCAS--NPCLTGGTCIPFY 3921 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccc---------------------cccccccC--CCCCCCCEEEecC
Confidence            788888888889998776 67999999999998887                     68899995  8999 55999999


Q ss_pred             CcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCcc
Q psy11797        110 GSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALL  180 (249)
Q Consensus       110 g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~  180 (249)
                      ++|.|.|+.||+    |++|+..              +.+.|..++|..++.|.+..++|.|.|..|+.+.
T Consensus      3922 n~f~CnC~~gyT----G~~Ce~~--------------Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYT----GKRCEAR--------------GISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred             CCeeEeCCCCcc----Cceeecc--------------cccccccccccCCceeeccCCceEeccChhHhcc
Confidence            999999999999    9999872              2455778889999999999999999999998765


No 4  
>KOG1219|consensus
Probab=98.93  E-value=1.2e-09  Score=108.60  Aligned_cols=94  Identities=27%  Similarity=0.753  Sum_probs=83.3

Q ss_pred             cccCCCCCCCCcceeeC-------------------CCCCccCCCCCCCCCCceecCCCeeeecCCCccccCCCcccccc
Q psy11797         12 DECRTPANTCKFSCKNL-------------------IDVDECRTPANTCKFSCKNLIGSYMCTCPPGYQQVTHSTVAIAT   72 (249)
Q Consensus        12 ~eC~~~~~~c~~~C~~~-------------------~did~C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~   72 (249)
                      +.|..+||.++++|...                   +++.+|.++|+.-+++|....++|.|.|+.||+|..|+.     
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~----- 3939 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA----- 3939 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec-----
Confidence            88999999999999776                   688999999999999999999999999999999988872     


Q ss_pred             cCCccccCCCCCCceecCcccccCCCCCCC-CeeeeCCCcceeecCCCceeCCCCCCccc
Q psy11797         73 TDTRTAESGGKSHECVDVNECELNLDSCAN-GRCVNLEGSYRCECERGFKLSLDGKQCLG  131 (249)
Q Consensus        73 ~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~-g~C~~~~g~~~C~C~~G~~~~~~g~~C~~  131 (249)
                                     ..|++|+  .++|.+ |.|+|.+|+|.|.|.+||.    |++|.+
T Consensus      3940 ---------------~Gi~eCs--~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3940 ---------------RGISECS--KNVCGTGGQCINIPGSFHCNCTPGIL----GRTCCA 3978 (4289)
T ss_pred             ---------------ccccccc--cccccCCceeeccCCceEeccChhHh----cccCcc
Confidence                           2489998  489995 5999999999999999998    777743


No 5  
>KOG4289|consensus
Probab=98.87  E-value=2.1e-09  Score=103.53  Aligned_cols=127  Identities=24%  Similarity=0.525  Sum_probs=91.3

Q ss_pred             cCcccCCCCCCCCcceeeCCCCCccCCCCCCCC--CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCce
Q psy11797         10 DVDECRTPANTCKFSCKNLIDVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHEC   87 (249)
Q Consensus        10 di~eC~~~~~~c~~~C~~~~did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C   87 (249)
                      |-|-|...||.+-+.|+.....|.=+.....-.  ..=++..+++.|+||+||+|+.|+                     
T Consensus      1178 dDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~Ce--------------------- 1236 (2531)
T KOG4289|consen 1178 DDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCE--------------------- 1236 (2531)
T ss_pred             cCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCccccc---------------------
Confidence            346788888877788877632221111000000  122356788999999999998887                     


Q ss_pred             ecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccccc-
Q psy11797         88 VDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRS-  165 (249)
Q Consensus        88 ~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~-  165 (249)
                      +.||+|-.  ++|. +|.|...+|+|+|.|.+||+    |.+|+....             .-.|-..-|++.++|.+. 
T Consensus      1237 TeiDlCYs--~pC~nng~C~srEggYtCeCrpg~t----GehCEvs~~-------------agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1237 TEIDLCYS--GPCGNNGRCRSREGGYTCECRPGFT----GEHCEVSAR-------------AGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred             chhHhhhc--CCCCCCCceEEecCceeEEecCCcc----ccceeeecc-------------cCccccceecCCCEEeecC
Confidence            68999985  8999 78999999999999999999    999976211             112333447899999987 


Q ss_pred             CCCCcccCCCC
Q psy11797        166 LTNGRCVLPTG  176 (249)
Q Consensus       166 ~~~~~C~c~~g  176 (249)
                      .+++.|+|++|
T Consensus      1298 nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1298 NGGFCCHCPYG 1308 (2531)
T ss_pred             CCceeccCCCc
Confidence            45688999988


No 6  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.68  E-value=2.3e-08  Score=59.68  Aligned_cols=40  Identities=48%  Similarity=1.066  Sum_probs=34.3

Q ss_pred             cCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCC
Q psy11797         89 DVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQ  128 (249)
Q Consensus        89 ~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~  128 (249)
                      |||||+...+.|. ++.|+|+.|+|+|.|++||.+...+..
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~   41 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTT   41 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCc
Confidence            6899998778898 679999999999999999996655544


No 7  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.57  E-value=6e-08  Score=57.87  Aligned_cols=34  Identities=47%  Similarity=1.090  Sum_probs=29.8

Q ss_pred             CCCccCCCCCCCC--CCceecCCCeeeecCCCcccc
Q psy11797         30 DVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQV   63 (249)
Q Consensus        30 did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~   63 (249)
                      |||||+..++.|.  +.|+|+.|+|+|.|++||+..
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            6888888777776  899999999999999999843


No 8  
>KOG1217|consensus
Probab=98.48  E-value=1.8e-06  Score=77.94  Aligned_cols=192  Identities=28%  Similarity=0.498  Sum_probs=115.5

Q ss_pred             CccCCCCCCCC--CCceecCCCeeeecCCCccccCCCccc-cccc---CCccccCCCCCCce-ecCcccccCCCCCCCCe
Q psy11797         32 DECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVA-IATT---DTRTAESGGKSHEC-VDVNECELNLDSCANGR  104 (249)
Q Consensus        32 d~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~-~~~~---~~~~~~~~~~~~~C-~~i~~C~~~~~~C~~g~  104 (249)
                      ++|......|.  +.|.+..++|.|.|++||.+..++... ...+   ....+..++.+..| .++.++..  .  . +.
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~--~--~-~~  244 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECAS--G--D-GT  244 (487)
T ss_pred             cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccC--C--C-Cc
Confidence            67775554444  789999999999999999998776320 0000   11222233333333 12333332  1  2 68


Q ss_pred             eeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCC-C
Q psy11797        105 CVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLME-V  183 (249)
Q Consensus       105 C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~-~  183 (249)
                      |+++.++|.|.|++||.+... ..+.++++|....               .+.++++|.+..+.|.|.|..+|.+... .
T Consensus       245 c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~---------------~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~  308 (487)
T KOG1217|consen  245 CVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIA---------------SCPNGGTCVNVPGSYRCTCPPGFTGRLCTE  308 (487)
T ss_pred             ccccCCceeeeCCCCcccccc-ceeeeccccCCCC---------------ccCCCCeeecCCCcceeeCCCCCCCCCCcc
Confidence            999999999999999984321 2344444433211               1567899999988899999999988753 1


Q ss_pred             CcceeecCCC-----CccCCCCccCCCCCCchhhccCCCCCccCCC-CCC-cccccCCCCC-CCCcccc-cccCC
Q psy11797        184 TRMDCCCTMG-----MAWGPQCQLCPTRGSQEYTDLCLESGLTVDG-RDI-DECVTIPAVE-SSKLAKM-FLRAY  249 (249)
Q Consensus       184 ~~~~C~C~~g-----~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~-~di-deC~~~~~~C-~ng~c~~-~~~~y  249 (249)
                      ......|...     ..++.+|.  .......|.|.| ..||.+.. ++. ++|...+  + ..+.|++ ..++|
T Consensus       309 ~~~~~~C~~~~~~~~c~~g~~C~--~~~~~~~~~C~c-~~~~~g~~C~~~~~~C~~~~--~~~~~~c~~~~~~~~  378 (487)
T KOG1217|consen  309 CVDVDECSPRNAGGPCANGGTCN--TLGSFGGFRCAC-GPGFTGRRCEDSNDECASSP--CCPGGTCVNETPGSY  378 (487)
T ss_pred             ccccccccccccCCcCCCCcccc--cCCCCCCCCcCC-CCCCCCCccccCCccccCCc--cccCCEeccCCCCCe
Confidence            1111223221     12233551  133445678999 67877755 455 5998877  5 5577887 44443


No 9  
>KOG1217|consensus
Probab=98.35  E-value=4.7e-06  Score=75.26  Aligned_cols=170  Identities=26%  Similarity=0.476  Sum_probs=104.7

Q ss_pred             CCceec---CCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCC
Q psy11797         43 FSCKNL---IGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECER  118 (249)
Q Consensus        43 ~~C~n~---~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~  118 (249)
                      +.|.+.   ...|.|.|..||.+..+.                     ...++|......|. .+.|.+..++|.|.|++
T Consensus       140 ~~c~~~~~~~~~~~c~C~~g~~~~~~~---------------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~  198 (487)
T KOG1217|consen  140 GSCSNGPGSVGPFRCSCTEGYEGEPCE---------------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPP  198 (487)
T ss_pred             hhhcCCCCCCCceeeeeCCCccccccc---------------------ccccccccCCCCcCCCcccccCCCCeeEeCCC
Confidence            566654   357999999999987665                     23367875556788 45899999999999999


Q ss_pred             CceeCCCCCCcccC---CCccceeeeecCCc-ccccccC--CCCCcc-ccccccCCCCcccCCCCCCccC---CCCccee
Q psy11797        119 GFKLSLDGKQCLGK---GQFVEFRIILSMPK-AENSVNN--NNNQRL-GFCYRSLTNGRCVLPTGPALLM---EVTRMDC  188 (249)
Q Consensus       119 G~~~~~~g~~C~~~---~~~~~~~~~~~~~~-~~~~~~~--~~~~~~-~~C~~~~~~~~C~c~~g~~~~~---~~~~~~C  188 (249)
                      +|.    +..++..   ..+.........+. ....+..  ..+... +.|.+..+++.|.+..+|++..   .....+|
T Consensus       199 ~~~----~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C  274 (487)
T KOG1217|consen  199 GYT----GSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSC  274 (487)
T ss_pred             Ccc----CCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeecccc
Confidence            998    4444331   11111000000000 0001111  112222 8899999999999999998773   1245556


Q ss_pred             ecCCCCccCCCCccCCCCCCchhhccCCCCCccCCCC----CCccccc--CCCCCCCC-cc
Q psy11797        189 CCTMGMAWGPQCQLCPTRGSQEYTDLCLESGLTVDGR----DIDECVT--IPAVESSK-LA  242 (249)
Q Consensus       189 ~C~~g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~~----dideC~~--~~~~C~ng-~c  242 (249)
                      .-.....++.+|    +...+.|.|.| .+||++...    ++++|..  ....|.++ .|
T Consensus       275 ~~~~~c~~~~~C----~~~~~~~~C~C-~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C  330 (487)
T KOG1217|consen  275 ALIASCPNGGTC----VNVPGSYRCTC-PPGFTGRLCTECVDVDECSPRNAGGPCANGGTC  330 (487)
T ss_pred             CCCCccCCCCee----ecCCCcceeeC-CCCCCCCCCccccccccccccccCCcCCCCccc
Confidence            533212346677    45555599999 699999653    5578864  34458665 66


No 10 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.23  E-value=9e-07  Score=50.55  Aligned_cols=32  Identities=50%  Similarity=1.161  Sum_probs=26.8

Q ss_pred             CCCCCCCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797         97 LDSCANGRCVNLEGSYRCECERGFKLSLDGKQC  129 (249)
Q Consensus        97 ~~~C~~g~C~~~~g~~~C~C~~G~~~~~~g~~C  129 (249)
                      .+.|.| .|++++++|+|.|++||.|..|+++|
T Consensus         5 NGgC~h-~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    5 NGGCSH-ICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGGSSS-EEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCCcCC-CCccCCCceEeECCCCCEECcCCCCC
Confidence            466777 89999999999999999999999876


No 11 
>KOG4260|consensus
Probab=98.14  E-value=1.5e-06  Score=71.03  Aligned_cols=67  Identities=42%  Similarity=0.955  Sum_probs=54.7

Q ss_pred             CCCCccCCCCCCCC--CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC--CCe
Q psy11797         29 IDVDECRTPANTCK--FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA--NGR  104 (249)
Q Consensus        29 ~did~C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~--~g~  104 (249)
                      +|||||...+..|.  +.|+|+.|||.|...+||.+                          ++|+|+.-...|.  +..
T Consensus       234 vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~--------------------------g~d~C~~~~d~~~~kn~~  287 (350)
T KOG4260|consen  234 VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK--------------------------GVDECQFCADVCASKNRP  287 (350)
T ss_pred             ccHHHHhcCCCCCChhheeecCCCceEecccccccC--------------------------ChHHhhhhhhhcccCCCC
Confidence            79999999888887  79999999999999999975                          3455553223444  568


Q ss_pred             eeeCCCcceeecCCCce
Q psy11797        105 CVNLEGSYRCECERGFK  121 (249)
Q Consensus       105 C~~~~g~~~C~C~~G~~  121 (249)
                      |.|+.+.|+|.|..|+.
T Consensus       288 c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  288 CMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             cccCCccEEEEecccce
Confidence            99999999999999876


No 12 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=98.06  E-value=2.9e-06  Score=43.67  Aligned_cols=24  Identities=42%  Similarity=0.991  Sum_probs=22.1

Q ss_pred             cceeecCCCceeCCCCCCcccCCC
Q psy11797        111 SYRCECERGFKLSLDGKQCLGKGQ  134 (249)
Q Consensus       111 ~~~C~C~~G~~~~~~g~~C~~~~~  134 (249)
                      +|+|.|++||.+..++++|++|+|
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            689999999999999999999875


No 13 
>KOG4289|consensus
Probab=97.96  E-value=5.7e-06  Score=80.74  Aligned_cols=91  Identities=21%  Similarity=0.327  Sum_probs=69.1

Q ss_pred             eeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccC-CCC
Q psy11797        106 VNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLM-EVT  184 (249)
Q Consensus       106 ~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~-~~~  184 (249)
                      ++..++++|.||+||+    +..|+.               ..+.|...+|.+++.|....++|+|.|.++|++.. +++
T Consensus      1216 i~pvnglrCrCPpGFT----gd~CeT---------------eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs 1276 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFT----GDYCET---------------EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVS 1276 (2531)
T ss_pred             ccccCceeEeCCCCCC----cccccc---------------hhHhhhcCCCCCCCceEEecCceeEEecCCccccceeee
Confidence            4556789999999999    778865               44567888999999999999999999999999876 333


Q ss_pred             cceeecCCCC-ccCCCCccCCCCCCchhhccCCCCC
Q psy11797        185 RMDCCCTMGM-AWGPQCQLCPTRGSQEYTDLCLESG  219 (249)
Q Consensus       185 ~~~C~C~~g~-~~g~~C~~C~~~~~~~~~c~Cp~~G  219 (249)
                      ...=.|.+|+ .+|.+|+   ....++|.|.|| .|
T Consensus      1277 ~~agrCvpGvC~nggtC~---~~~nggf~c~Cp-~g 1308 (2531)
T KOG4289|consen 1277 ARAGRCVPGVCKNGGTCV---NLLNGGFCCHCP-YG 1308 (2531)
T ss_pred             cccCccccceecCCCEEe---ecCCCceeccCC-Cc
Confidence            2222344555 3477774   667788999995 44


No 14 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.95  E-value=7.2e-06  Score=42.17  Aligned_cols=24  Identities=50%  Similarity=1.227  Sum_probs=21.0

Q ss_pred             CeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcc
Q psy11797         51 SYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNE   92 (249)
Q Consensus        51 sy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~   92 (249)
                      ||+|+|++||+.....                  +.|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~------------------~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDG------------------RSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCC------------------CccccCCC
Confidence            6999999999987766                  78999986


No 15 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.85  E-value=1.5e-05  Score=45.47  Aligned_cols=33  Identities=39%  Similarity=0.994  Sum_probs=25.9

Q ss_pred             cCCCCCCCCCCceecCCCeeeecCCCccccCCC
Q psy11797         34 CRTPANTCKFSCKNLIGSYMCTCPPGYQQVTHS   66 (249)
Q Consensus        34 C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~~   66 (249)
                      |......|..+|++++++|+|.|++||++....
T Consensus         1 C~~~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~   33 (36)
T PF14670_consen    1 CSVNNGGCSHICVNTPGSYRCSCPPGYKLAEDG   33 (36)
T ss_dssp             CTTGGGGSSSEEEEETTSEEEE-STTEEE-TTS
T ss_pred             CCCCCCCcCCCCccCCCceEeECCCCCEECcCC
Confidence            344556788999999999999999999987755


No 16 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.80  E-value=2.7e-05  Score=45.12  Aligned_cols=35  Identities=43%  Similarity=0.950  Sum_probs=26.4

Q ss_pred             CCCccCC-CCCCCCCCceecCCCeeeecCCCcc-ccC
Q psy11797         30 DVDECRT-PANTCKFSCKNLIGSYMCTCPPGYQ-QVT   64 (249)
Q Consensus        30 did~C~~-~~~~c~~~C~n~~gsy~C~C~~G~~-g~~   64 (249)
                      ++|+|.. .++...++|+++.++|.|.|++||. |..
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~   37 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN   37 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCc
Confidence            4677776 3444446899999999999999998 543


No 17 
>KOG4260|consensus
Probab=97.69  E-value=1.9e-05  Score=64.74  Aligned_cols=73  Identities=33%  Similarity=0.585  Sum_probs=54.4

Q ss_pred             CceecCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccc
Q psy11797         85 HECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCY  163 (249)
Q Consensus        85 ~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~  163 (249)
                      ..|+|||||...+.+|. +..|+|+.|+|.|...+||..+  ...|+.   ++                ..-...+..|.
T Consensus       231 ~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g--~d~C~~---~~----------------d~~~~kn~~c~  289 (350)
T KOG4260|consen  231 EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG--VDECQF---CA----------------DVCASKNRPCM  289 (350)
T ss_pred             cccccHHHHhcCCCCCChhheeecCCCceEecccccccCC--hHHhhh---hh----------------hhcccCCCCcc
Confidence            56899999998889998 7799999999999999999732  222221   00                00013467889


Q ss_pred             ccCCCCcccCCCCCC
Q psy11797        164 RSLTNGRCVLPTGPA  178 (249)
Q Consensus       164 ~~~~~~~C~c~~g~~  178 (249)
                      ++.++|+|.+..++.
T Consensus       290 ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  290 NIDGQYRCVCFSGLI  304 (350)
T ss_pred             cCCccEEEEecccce
Confidence            999999999988865


No 18 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.53  E-value=4.7e-05  Score=43.52  Aligned_cols=31  Identities=42%  Similarity=0.943  Sum_probs=22.8

Q ss_pred             cCCCCCCCC--CCceecCCCeeeecCCCccccC
Q psy11797         34 CRTPANTCK--FSCKNLIGSYMCTCPPGYQQVT   64 (249)
Q Consensus        34 C~~~~~~c~--~~C~n~~gsy~C~C~~G~~g~~   64 (249)
                      |+.+++.|.  ++|+++.++|.|.|++||+|++
T Consensus         1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             TTTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            344455565  8999999999999999999976


No 19 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.51  E-value=6.2e-05  Score=41.89  Aligned_cols=30  Identities=37%  Similarity=0.897  Sum_probs=23.3

Q ss_pred             cCCCCCCCCCCceecC-CCeeeecCCCcccc
Q psy11797         34 CRTPANTCKFSCKNLI-GSYMCTCPPGYQQV   63 (249)
Q Consensus        34 C~~~~~~c~~~C~n~~-gsy~C~C~~G~~g~   63 (249)
                      |.+.++..+++|++.. ++|.|.|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            3444555567888887 88999999999985


No 20 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.45  E-value=0.00018  Score=41.15  Aligned_cols=36  Identities=42%  Similarity=0.926  Sum_probs=26.8

Q ss_pred             CCCccCC-CCCCCCCCceecCCCeeeecCCCccccCC
Q psy11797         30 DVDECRT-PANTCKFSCKNLIGSYMCTCPPGYQQVTH   65 (249)
Q Consensus        30 did~C~~-~~~~c~~~C~n~~gsy~C~C~~G~~g~~~   65 (249)
                      ++++|.. .++...+.|++..++|.|.|++||.|..+
T Consensus         1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            3566765 34443578999999999999999988543


No 21 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.41  E-value=0.00021  Score=41.24  Aligned_cols=37  Identities=51%  Similarity=1.296  Sum_probs=28.3

Q ss_pred             cCcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797         89 DVNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC  129 (249)
Q Consensus        89 ~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C  129 (249)
                      ++++|... .+|. ++.|+++.++|.|.|++||.   +++.|
T Consensus         1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C   38 (39)
T smart00179        1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT---DGRNC   38 (39)
T ss_pred             CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence            35677642 6787 45999999999999999997   35554


No 22 
>KOG1225|consensus
Probab=97.33  E-value=0.0011  Score=60.51  Aligned_cols=121  Identities=27%  Similarity=0.601  Sum_probs=67.9

Q ss_pred             eeecCCCccccCCCccccc---------ccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeCCCcceeecCCCcee
Q psy11797         53 MCTCPPGYQQVTHSTVAIA---------TTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNLEGSYRCECERGFKL  122 (249)
Q Consensus        53 ~C~C~~G~~g~~~~~~~~~---------~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~  122 (249)
                      .|.|+.+|.+..+......         .-..++|..||++..|.. -.|.   ..|. ++.+++  +  .|.|++||. 
T Consensus       235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e-~~Cp---~~cs~~g~~~~--g--~CiC~~g~~-  305 (525)
T KOG1225|consen  235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDE-LVCP---VDCSGGGVCVD--G--ECICNPGYS-  305 (525)
T ss_pred             eeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCc-ccCC---cccCCCceecC--C--EeecCCCcc-
Confidence            6777777777666522110         112245666666666643 2344   2355 445543  3  788999988 


Q ss_pred             CCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCCCCcceeecCCCCccCCCCcc
Q psy11797        123 SLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLMEVTRMDCCCTMGMAWGPQCQL  202 (249)
Q Consensus       123 ~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~~~~~~C~C~~g~~~g~~C~~  202 (249)
                         |+.|+...                 | ...|...+.|.    ...|.|.+||++....... |.      .+..|+ 
T Consensus       306 ---G~dCs~~~-----------------c-padC~g~G~Ci----~G~C~C~~Gy~G~~C~~~~-C~------~~g~cv-  352 (525)
T KOG1225|consen  306 ---GKDCSIRR-----------------C-PADCSGHGKCI----DGECLCDEGYTGELCIQRA-CS------GGGQCV-  352 (525)
T ss_pred             ---cccccccc-----------------C-CccCCCCCccc----CCceEeCCCCcCCcccccc-cC------CCceec-
Confidence               77776511                 1 12356777777    3458888888877533331 21      233441 


Q ss_pred             CCCCCCchhhccCCCCCccCCC
Q psy11797        203 CPTRGSQEYTDLCLESGLTVDG  224 (249)
Q Consensus       203 C~~~~~~~~~c~Cp~~G~~~~~  224 (249)
                            . - |.| ..||+|..
T Consensus       353 ------~-g-C~C-~~Gw~G~d  365 (525)
T KOG1225|consen  353 ------N-G-CKC-KKGWRGPD  365 (525)
T ss_pred             ------c-C-cee-ccCccCCC
Confidence                  1 2 777 78888753


No 23 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.25  E-value=0.00021  Score=40.85  Aligned_cols=31  Identities=42%  Similarity=0.972  Sum_probs=22.2

Q ss_pred             CCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797         97 LDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC  129 (249)
Q Consensus        97 ~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C  129 (249)
                      .+.|. ++.|++++++|.|.|++||.  .+|..|
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~--GdG~~C   36 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYE--GDGFFC   36 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEE--CCSTCE
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCc--cCCcCC
Confidence            45677 78999999999999999998  335443


No 24 
>PF00683 TB:  TB domain;  InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=96.87  E-value=0.00017  Score=42.66  Aligned_cols=30  Identities=47%  Similarity=1.367  Sum_probs=23.2

Q ss_pred             CCcceeecCCCCccCCCCccCCCCCCchhh
Q psy11797        183 VTRMDCCCTMGMAWGPQCQLCPTRGSQEYT  212 (249)
Q Consensus       183 ~~~~~C~C~~g~~~g~~C~~C~~~~~~~~~  212 (249)
                      +++.+|.|+.|.+||..|++||.+++..|.
T Consensus        11 ~tk~~CCCs~G~aWG~~Ce~CP~~~t~ef~   40 (42)
T PF00683_consen   11 VTKSECCCSVGRAWGSPCEPCPPPGTDEFN   40 (42)
T ss_dssp             EEHHHHHTTT-SEETTTTEE---TTSHHHH
T ss_pred             eeccccCCCCCCcCCCccccCCCCCChHHh
Confidence            577899999999999999999999998775


No 25 
>KOG1225|consensus
Probab=96.80  E-value=0.0091  Score=54.54  Aligned_cols=74  Identities=23%  Similarity=0.402  Sum_probs=42.3

Q ss_pred             eeecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCccCCCCcceeecCC
Q psy11797        113 RCECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALLMEVTRMDCCCTM  192 (249)
Q Consensus       113 ~C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~~~~~~~~C~C~~  192 (249)
                      +|.|++||+    |..|...                 .|... |..++.+++.    .|.|.++|.+.... ..+|  ..
T Consensus       266 ~CIC~~Gf~----G~dC~e~-----------------~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs-~~~c--pa  316 (525)
T KOG1225|consen  266 RCICPPGFT----GDDCDEL-----------------VCPVD-CSGGGVCVDG----ECICNPGYSGKDCS-IRRC--PA  316 (525)
T ss_pred             eEeCCCCCc----CCCCCcc-----------------cCCcc-cCCCceecCC----EeecCCCccccccc-cccC--Cc
Confidence            689999998    8777651                 12222 3444555432    58888888766421 1112  22


Q ss_pred             CCccCCCCccCCCCCCchhhccCCCCCccCCC
Q psy11797        193 GMAWGPQCQLCPTRGSQEYTDLCLESGLTVDG  224 (249)
Q Consensus       193 g~~~g~~C~~C~~~~~~~~~c~Cp~~G~~~~~  224 (249)
                      .....+.|    ++    -+|.| .+||+|+-
T Consensus       317 dC~g~G~C----i~----G~C~C-~~Gy~G~~  339 (525)
T KOG1225|consen  317 DCSGHGKC----ID----GECLC-DEGYTGEL  339 (525)
T ss_pred             cCCCCCcc----cC----CceEe-CCCCcCCc
Confidence            22123445    33    45999 79999964


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.79  E-value=0.0019  Score=36.65  Aligned_cols=35  Identities=46%  Similarity=1.249  Sum_probs=26.8

Q ss_pred             CcccccCCCCCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797         90 VNECELNLDSCA-NGRCVNLEGSYRCECERGFKLSLDGKQC  129 (249)
Q Consensus        90 i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C  129 (249)
                      +++|... .+|. ++.|++..+.|.|.|+.||.    ++.|
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~----g~~C   37 (38)
T cd00054           2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYT----GRNC   37 (38)
T ss_pred             cccCCCC-CCcCCCCEeECCCCCeEeECCCCCc----CCcC
Confidence            4566532 5677 56999999999999999998    5555


No 27 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.63  E-value=0.0018  Score=35.90  Aligned_cols=24  Identities=42%  Similarity=1.284  Sum_probs=21.6

Q ss_pred             CCCC-CCeeeeCC-CcceeecCCCce
Q psy11797         98 DSCA-NGRCVNLE-GSYRCECERGFK  121 (249)
Q Consensus        98 ~~C~-~g~C~~~~-g~~~C~C~~G~~  121 (249)
                      ++|. +|+|++.. +.|.|.|++||.
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~C~~G~~   29 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCECPPGYT   29 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEEEBTTEE
T ss_pred             CcCCCCeEEEeCCCCCEEeECCCCCc
Confidence            5888 46999998 999999999998


No 28 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.35  E-value=0.0049  Score=34.31  Aligned_cols=21  Identities=52%  Similarity=1.103  Sum_probs=19.1

Q ss_pred             CCceecCCCeeeecCCCcccc
Q psy11797         43 FSCKNLIGSYMCTCPPGYQQV   63 (249)
Q Consensus        43 ~~C~n~~gsy~C~C~~G~~g~   63 (249)
                      +.|++..++|.|.|+.||.+.
T Consensus        12 ~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053          12 GTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CEEecCCCCeEeECCCCCccc
Confidence            788888899999999999885


No 29 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.32  E-value=0.0038  Score=51.21  Aligned_cols=44  Identities=32%  Similarity=0.712  Sum_probs=36.7

Q ss_pred             CCCceecCcccccCCCCCCCCeeeeCCCcceeecCCCceeCCCCC
Q psy11797         83 KSHECVDVNECELNLDSCANGRCVNLEGSYRCECERGFKLSLDGK  127 (249)
Q Consensus        83 ~~~~C~~i~~C~~~~~~C~~g~C~~~~g~~~C~C~~G~~~~~~g~  127 (249)
                      .+..|.++++|....+.|.+ .|.++.|+|.|.|++||.+..+++
T Consensus       180 ~~~~C~~~~~C~~~~~~c~~-~C~~~~g~~~c~c~~g~~~~~~~~  223 (224)
T cd01475         180 QGKICVVPDLCATLSHVCQQ-VCISTPGSYLCACTEGYALLEDNK  223 (224)
T ss_pred             ccccCcCchhhcCCCCCccc-eEEcCCCCEEeECCCCccCCCCCC
Confidence            44568889999876677886 899999999999999999876654


No 30 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.32  E-value=0.0052  Score=34.46  Aligned_cols=19  Identities=58%  Similarity=1.399  Sum_probs=17.8

Q ss_pred             CceecCCCeeeecCCCccc
Q psy11797         44 SCKNLIGSYMCTCPPGYQQ   62 (249)
Q Consensus        44 ~C~n~~gsy~C~C~~G~~g   62 (249)
                      +|++..++|.|+|++||.|
T Consensus        12 ~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181       12 TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             EEECCCCCeEeECCCCCcc
Confidence            7888899999999999988


No 31 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.21  E-value=0.007  Score=33.91  Aligned_cols=24  Identities=46%  Similarity=1.247  Sum_probs=20.7

Q ss_pred             CCCCCCeeeeCCCcceeecCCCce
Q psy11797         98 DSCANGRCVNLEGSYRCECERGFK  121 (249)
Q Consensus        98 ~~C~~g~C~~~~g~~~C~C~~G~~  121 (249)
                      .+|.+++|+++.++|.|.|++||.
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~~   29 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGYT   29 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCCc
Confidence            467743899999999999999998


No 32 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.19  E-value=0.0014  Score=51.16  Aligned_cols=135  Identities=24%  Similarity=0.558  Sum_probs=72.6

Q ss_pred             CCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccC---CCCCC-CCeeeeCC-----Ccce
Q psy11797         43 FSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELN---LDSCA-NGRCVNLE-----GSYR  113 (249)
Q Consensus        43 ~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~---~~~C~-~g~C~~~~-----g~~~  113 (249)
                      +..+.+...|.|.|.+||.... +                  .+|+...+|...   ..+|. .+.|++.+     ..|.
T Consensus        11 G~LiQMSNHfEC~Cnegfvl~~-E------------------ntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~   71 (197)
T PF06247_consen   11 GYLIQMSNHFECKCNEGFVLKN-E------------------NTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYK   71 (197)
T ss_dssp             EEEEEESSEEEEEESTTEEEEE-T------------------TEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEE
T ss_pred             CEEEEccCceEEEcCCCcEEcc-c------------------cccccceecCcccccCccccchhhhhcCCCcccceeEE
Confidence            4455567789999999998763 2                  578888788752   34687 57898865     5799


Q ss_pred             eecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCcccccccc---CCCCcccCCCCCCccCCCCcceeec
Q psy11797        114 CECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRS---LTNGRCVLPTGPALLMEVTRMDCCC  190 (249)
Q Consensus       114 C~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~---~~~~~C~c~~g~~~~~~~~~~~C~C  190 (249)
                      |.|.+||.+..+  .|.+                 ..|+...|. .|.|+-.   .....|+|.-|++..   +...|. 
T Consensus        72 C~C~~gY~~~~~--vCvp-----------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~---dn~kCt-  127 (197)
T PF06247_consen   72 CDCINGYILKQG--VCVP-----------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPD---DNKKCT-  127 (197)
T ss_dssp             EEE-TTEEESSS--SEEE-----------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETT---TTTESE-
T ss_pred             EecccCceeeCC--eEch-----------------hhcCceecC-CCeEEecCCCCCCceeEeeeceEec---cCCccc-
Confidence            999999997643  5654                 122222233 4555432   223478888888722   233332 


Q ss_pred             CCCCccCCCCc-------cCCCCCCchhhccCCCCCccCCCC
Q psy11797        191 TMGMAWGPQCQ-------LCPTRGSQEYTDLCLESGLTVDGR  225 (249)
Q Consensus       191 ~~g~~~g~~C~-------~C~~~~~~~~~c~Cp~~G~~~~~~  225 (249)
                      ..|   --.|.       .| .....-|+|.| ..||.++++
T Consensus       128 k~G---~T~C~LKCk~nE~C-K~~~~~Y~C~~-~~~~~~~~~  164 (197)
T PF06247_consen  128 KTG---ETKCSLKCKENEEC-KLVDGYYKCVC-KEGFPGDGE  164 (197)
T ss_dssp             EEE-----------TTTEEE-EEETTEEEEEE--TT-EEETT
T ss_pred             CCC---ccceeeecCCCcce-eeeCcEEEeec-CCCCCCCCC
Confidence            111   11122       12 22234588999 888877543


No 33 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.07  E-value=0.0089  Score=33.21  Aligned_cols=24  Identities=50%  Similarity=1.272  Sum_probs=21.0

Q ss_pred             CCCC-CCeeeeCCCcceeecCCCce
Q psy11797         98 DSCA-NGRCVNLEGSYRCECERGFK  121 (249)
Q Consensus        98 ~~C~-~g~C~~~~g~~~C~C~~G~~  121 (249)
                      .+|. ++.|+++.+.|.|.|+.||.
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~   30 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYT   30 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCc
Confidence            5676 47999999999999999997


No 34 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=95.84  E-value=0.0073  Score=49.49  Aligned_cols=37  Identities=32%  Similarity=0.782  Sum_probs=32.5

Q ss_pred             CCCCccCCCCCCCCCCceecCCCeeeecCCCccccCC
Q psy11797         29 IDVDECRTPANTCKFSCKNLIGSYMCTCPPGYQQVTH   65 (249)
Q Consensus        29 ~did~C~~~~~~c~~~C~n~~gsy~C~C~~G~~g~~~   65 (249)
                      .++++|...++.|...|.++.|+|.|.|+.||++...
T Consensus       185 ~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~  221 (224)
T cd01475         185 VVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLED  221 (224)
T ss_pred             cCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCC
Confidence            4678999888889999999999999999999987543


No 35 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.29  E-value=0.028  Score=24.40  Aligned_cols=12  Identities=42%  Similarity=1.276  Sum_probs=9.2

Q ss_pred             eeecCCCccccC
Q psy11797         53 MCTCPPGYQQVT   64 (249)
Q Consensus        53 ~C~C~~G~~g~~   64 (249)
                      .|.|++||+|..
T Consensus         1 ~C~C~~G~~G~~   12 (13)
T PF12661_consen    1 TCQCPPGWTGPN   12 (13)
T ss_dssp             EEEE-TTEETTT
T ss_pred             CccCcCCCcCCC
Confidence            489999999864


No 36 
>KOG0994|consensus
Probab=92.80  E-value=1.5  Score=43.78  Aligned_cols=60  Identities=25%  Similarity=0.348  Sum_probs=31.5

Q ss_pred             ccccCCCCcc-cCCCCCCccCC----CCcceeecCCCCccC----CCCccCCCCCCchhhccCCCCCccCCC
Q psy11797        162 CYRSLTNGRC-VLPTGPALLME----VTRMDCCCTMGMAWG----PQCQLCPTRGSQEYTDLCLESGLTVDG  224 (249)
Q Consensus       162 C~~~~~~~~C-~c~~g~~~~~~----~~~~~C~C~~g~~~g----~~C~~C~~~~~~~~~c~Cp~~G~~~~~  224 (249)
                      |.+..+++.| .|..||.++..    ..-..|.|..|.+.|    ..|.+  -+.+....|.| .+||+|..
T Consensus       878 CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~--d~~t~~ivC~C-~~GY~G~R  946 (1758)
T KOG0994|consen  878 CQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYL--DTRTQQIVCHC-QEGYSGSR  946 (1758)
T ss_pred             ccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccc--cccccceeeec-ccCccccc
Confidence            3445566666 67788877642    122344455443222    22311  11223446899 89999853


No 37 
>KOG0994|consensus
Probab=92.56  E-value=0.34  Score=47.91  Aligned_cols=32  Identities=19%  Similarity=0.568  Sum_probs=21.5

Q ss_pred             CcccccCCCCCCCCeeeeCCCccee-ecCCCceeC
Q psy11797         90 VNECELNLDSCANGRCVNLEGSYRC-ECERGFKLS  123 (249)
Q Consensus        90 i~~C~~~~~~C~~g~C~~~~g~~~C-~C~~G~~~~  123 (249)
                      .+.|....+.|.  .|.+...++.| .|..||+..
T Consensus       865 A~~Cd~~tGaCi--~CqD~T~G~~CdrCl~GyyGd  897 (1758)
T KOG0994|consen  865 ADTCDPITGACI--DCQDSTTGHSCDRCLDGYYGD  897 (1758)
T ss_pred             ccccCccccccc--cccccccccchhhhhccccCC
Confidence            345554445555  36677788888 699999843


No 38 
>KOG1226|consensus
Probab=90.28  E-value=3.5  Score=39.39  Aligned_cols=23  Identities=22%  Similarity=0.613  Sum_probs=16.9

Q ss_pred             CCceecCCCee---eecCCCccccCCC
Q psy11797         43 FSCKNLIGSYM---CTCPPGYQQVTHS   66 (249)
Q Consensus        43 ~~C~n~~gsy~---C~C~~G~~g~~~~   66 (249)
                      +.|. ..|.|.   |.|.+||.|..|+
T Consensus       467 ~~C~-g~G~~~CG~C~C~~G~~G~~CE  492 (783)
T KOG1226|consen  467 ALCH-GNGTFVCGQCRCDEGWLGKKCE  492 (783)
T ss_pred             cccC-CCCcEEecceecCCCCCCCccc
Confidence            4554 456666   4899999998887


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=89.90  E-value=0.52  Score=25.96  Aligned_cols=25  Identities=40%  Similarity=1.152  Sum_probs=19.3

Q ss_pred             CCC-CCeeeeCCCcceeecCCCceeCCCCCCc
Q psy11797         99 SCA-NGRCVNLEGSYRCECERGFKLSLDGKQC  129 (249)
Q Consensus        99 ~C~-~g~C~~~~g~~~C~C~~G~~~~~~g~~C  129 (249)
                      .|. +|+|+..  ..+|.|.+||.    |..|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~----G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYT----GPDC   32 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCc----CCCC
Confidence            466 7899876  46899999998    6554


No 40 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=88.26  E-value=0.39  Score=37.83  Aligned_cols=99  Identities=25%  Similarity=0.475  Sum_probs=53.0

Q ss_pred             CCceecC-----CCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCCCCeeeeCC---Cccee
Q psy11797         43 FSCKNLI-----GSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCANGRCVNLE---GSYRC  114 (249)
Q Consensus        43 ~~C~n~~-----gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~g~C~~~~---g~~~C  114 (249)
                      +.|++.+     ..|.|.|.+||.....                    .|. .+.|..  -.|..|.|+-.+   ....|
T Consensus        56 a~C~~~~~~~~~~~~~C~C~~gY~~~~~--------------------vCv-p~~C~~--~~Cg~GKCI~d~~~~~~~~C  112 (197)
T PF06247_consen   56 AKCINQANKGEERAYKCDCINGYILKQG--------------------VCV-PNKCNN--KDCGSGKCILDPDNPNNPTC  112 (197)
T ss_dssp             EEEEE-SSTTSSTSEEEEE-TTEEESSS--------------------SEE-EGGGSS-----TTEEEEEEEGGGSEEEE
T ss_pred             hhhhcCCCcccceeEEEecccCceeeCC--------------------eEc-hhhcCc--eecCCCeEEecCCCCCCcee
Confidence            7787665     4699999999987653                    243 234552  467778897543   34589


Q ss_pred             ecCCCceeCCCCCCcccCCCccceeeeecCCcccccccCCCCCccccccccCCCCcccCCCCCCcc
Q psy11797        115 ECERGFKLSLDGKQCLGKGQFVEFRIILSMPKAENSVNNNNNQRLGFCYRSLTNGRCVLPTGPALL  180 (249)
Q Consensus       115 ~C~~G~~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~C~c~~g~~~~  180 (249)
                      +|.-|+. -.+...|...++              ..|.+. |..+..|....+-|+|.+..++.+.
T Consensus       113 SC~IGkV-~~dn~kCtk~G~--------------T~C~LK-Ck~nE~CK~~~~~Y~C~~~~~~~~~  162 (197)
T PF06247_consen  113 SCNIGKV-PDDNKKCTKTGE--------------TKCSLK-CKENEECKLVDGYYKCVCKEGFPGD  162 (197)
T ss_dssp             EE-TEEE-TTTTTESEEEE-----------------------TTTEEEEEETTEEEEEE-TT-EEE
T ss_pred             EeeeceE-eccCCcccCCCc--------------cceeee-cCCCcceeeeCcEEEeecCCCCCCC
Confidence            9999998 344556655221              112111 2445566655555666666665444


No 41 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.00  E-value=1.1  Score=25.52  Aligned_cols=25  Identities=32%  Similarity=0.616  Sum_probs=18.4

Q ss_pred             CCCCCceecC-CCeeeecCCCccccC
Q psy11797         40 TCKFSCKNLI-GSYMCTCPPGYQQVT   64 (249)
Q Consensus        40 ~c~~~C~n~~-gsy~C~C~~G~~g~~   64 (249)
                      +-.+.|.+.. |++.|+|.+||+...
T Consensus         8 P~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    8 PANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             -TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             CCCcccEEcCCCCEEEEeeCCccccC
Confidence            3347888776 899999999998644


No 42 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=76.82  E-value=2.3  Score=31.34  Aligned_cols=39  Identities=23%  Similarity=0.732  Sum_probs=29.3

Q ss_pred             cCcccccC-CCCCCCCeeeeCC--CcceeecCCCceeCCCCCCccc
Q psy11797         89 DVNECELN-LDSCANGRCVNLE--GSYRCECERGFKLSLDGKQCLG  131 (249)
Q Consensus        89 ~i~~C~~~-~~~C~~g~C~~~~--g~~~C~C~~G~~~~~~g~~C~~  131 (249)
                      ++.+|... .+-|.||.|.-.+  ..+.|.|+.||.    |..|+.
T Consensus        41 ~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~GYt----GeRCEh   82 (139)
T PHA03099         41 AIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHGYT----GIRCQH   82 (139)
T ss_pred             ccccCChhhCCEeECCEEEeeccCCCceeECCCCcc----cccccc
Confidence            45566533 3567788887654  679999999999    999976


No 43 
>KOG1226|consensus
Probab=75.46  E-value=7.1  Score=37.41  Aligned_cols=15  Identities=40%  Similarity=1.201  Sum_probs=11.5

Q ss_pred             eeecCCCceeCCCCCCccc
Q psy11797        113 RCECERGFKLSLDGKQCLG  131 (249)
Q Consensus       113 ~C~C~~G~~~~~~g~~C~~  131 (249)
                      .|.|.+||.    |+.|+=
T Consensus       479 ~C~C~~G~~----G~~CEC  493 (783)
T KOG1226|consen  479 QCRCDEGWL----GKKCEC  493 (783)
T ss_pred             ceecCCCCC----CCcccC
Confidence            368999998    888753


No 44 
>KOG1836|consensus
Probab=74.25  E-value=10  Score=40.17  Aligned_cols=14  Identities=43%  Similarity=0.985  Sum_probs=12.2

Q ss_pred             eecCCCccccCCCc
Q psy11797         54 CTCPPGYQQVTHST   67 (249)
Q Consensus        54 C~C~~G~~g~~~~~   67 (249)
                      |.|+.||+|+.++.
T Consensus       697 c~C~~g~tG~~Ce~  710 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCES  710 (1705)
T ss_pred             ccCCCCcccchhhh
Confidence            89999999998873


No 45 
>smart00051 DSL delta serrate ligand.
Probab=70.29  E-value=9.1  Score=24.61  Aligned_cols=16  Identities=19%  Similarity=0.266  Sum_probs=12.5

Q ss_pred             CeeeecCCCccccCCC
Q psy11797         51 SYMCTCPPGYQQVTHS   66 (249)
Q Consensus        51 sy~C~C~~G~~g~~~~   66 (249)
                      .+.-.|.++|.|..|.
T Consensus        16 ~~rv~C~~~~yG~~C~   31 (63)
T smart00051       16 QIRVTCDENYYGEGCN   31 (63)
T ss_pred             EEEeeCCCCCcCCccC
Confidence            3556799999998877


No 46 
>KOG1836|consensus
Probab=65.94  E-value=19  Score=38.26  Aligned_cols=71  Identities=24%  Similarity=0.604  Sum_probs=39.8

Q ss_pred             CccCCCCCCCCCCceecCCCeee-ecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCC-CCeeeeC-
Q psy11797         32 DECRTPANTCKFSCKNLIGSYMC-TCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCA-NGRCVNL-  108 (249)
Q Consensus        32 d~C~~~~~~c~~~C~n~~gsy~C-~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~-~g~C~~~-  108 (249)
                      +.|......|.  |+....+-.| .|.+||+|......                    .-| |.  +=+|. .+.|..+ 
T Consensus       738 ~~Cd~~tG~C~--C~~~t~G~~C~~C~~GfYg~~~~~~--------------------~~d-C~--~C~Cp~~~~~~~~~  792 (1705)
T KOG1836|consen  738 NICDPRTGQCK--CKHNTFGGQCAQCVDGFYGLPDLGT--------------------SGD-CQ--PCPCPNGGACGQTP  792 (1705)
T ss_pred             ccccCCCCcee--cccCCCCCchhhhcCCCCCccccCC--------------------CCC-Cc--cCCCCCChhhcCcC
Confidence            34444443443  5544444566 68999988765421                    111 43  13343 2244443 


Q ss_pred             -CCcceee-cCCCceeCCCCCCccc
Q psy11797        109 -EGSYRCE-CERGFKLSLDGKQCLG  131 (249)
Q Consensus       109 -~g~~~C~-C~~G~~~~~~g~~C~~  131 (249)
                       .....|. |++||+    |..|+.
T Consensus       793 ~~~~~iCk~Cp~gyt----G~rCe~  813 (1705)
T KOG1836|consen  793 EILEVVCKNCPPGYT----GLRCEE  813 (1705)
T ss_pred             cccceecCCCCCCCc----cccccc
Confidence             3456787 999999    888876


No 47 
>PHA02887 EGF-like protein; Provisional
Probab=57.51  E-value=9.9  Score=27.62  Aligned_cols=38  Identities=32%  Similarity=0.832  Sum_probs=26.8

Q ss_pred             CcccccC-CCCCCCCeeeeCC--CcceeecCCCceeCCCCCCccc
Q psy11797         90 VNECELN-LDSCANGRCVNLE--GSYRCECERGFKLSLDGKQCLG  131 (249)
Q Consensus        90 i~~C~~~-~~~C~~g~C~~~~--g~~~C~C~~G~~~~~~g~~C~~  131 (249)
                      +.+|... .+-|.||.|.-..  ....|.|+.||.    |..|+.
T Consensus        83 f~pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYt----G~RCE~  123 (126)
T PHA02887         83 FEKCKNDFNDFCINGECMNIIDLDEKFCICNKGYT----GIRCDE  123 (126)
T ss_pred             ccccChHhhCEeeCCEEEccccCCCceeECCCCcc----cCCCCc
Confidence            3455432 3457778887644  568899999999    888875


No 48 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=56.61  E-value=9.4  Score=28.21  Aligned_cols=24  Identities=25%  Similarity=0.641  Sum_probs=19.1

Q ss_pred             CCceec--CCCeeeecCCCccccCCC
Q psy11797         43 FSCKNL--IGSYMCTCPPGYQQVTHS   66 (249)
Q Consensus        43 ~~C~n~--~gsy~C~C~~G~~g~~~~   66 (249)
                      +.|.-.  ...+.|.|..||+|..|+
T Consensus        56 G~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         56 GDCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             CEEEeeccCCCceeECCCCccccccc
Confidence            567544  367999999999998876


No 49 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=56.34  E-value=13  Score=20.67  Aligned_cols=24  Identities=29%  Similarity=0.676  Sum_probs=15.8

Q ss_pred             CCCCCceecCCCeeeecCCCccccC
Q psy11797         40 TCKFSCKNLIGSYMCTCPPGYQQVT   64 (249)
Q Consensus        40 ~c~~~C~n~~gsy~C~C~~G~~g~~   64 (249)
                      .|.+.|... ..+.|.||.||..+.
T Consensus         7 ~CpA~CDpn-~~~~C~CPeGyIlde   30 (34)
T PF09064_consen    7 ECPADCDPN-SPGQCFCPEGYILDE   30 (34)
T ss_pred             cCCCccCCC-CCCceeCCCceEecC
Confidence            345666442 234899999998754


No 50 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=56.19  E-value=12  Score=26.75  Aligned_cols=30  Identities=33%  Similarity=0.923  Sum_probs=22.4

Q ss_pred             CcccccCCCCCC-CCeeeeCCCcceeecCCCce
Q psy11797         90 VNECELNLDSCA-NGRCVNLEGSYRCECERGFK  121 (249)
Q Consensus        90 i~~C~~~~~~C~-~g~C~~~~g~~~C~C~~G~~  121 (249)
                      .+.|.. .+.|. +|.|.. .....|.|.+||.
T Consensus        77 ~d~Cd~-y~~CG~~g~C~~-~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   77 KDQCDV-YGFCGPNGICNS-NNSPKCSCLPGFE  107 (110)
T ss_pred             ccCCCC-ccccCCccEeCC-CCCCceECCCCcC
Confidence            456775 47888 789943 4566799999997


No 51 
>KOG3512|consensus
Probab=42.96  E-value=90  Score=28.60  Aligned_cols=25  Identities=16%  Similarity=0.211  Sum_probs=19.5

Q ss_pred             CCceecCCC-eeeecCCCccccCCCc
Q psy11797         43 FSCKNLIGS-YMCTCPPGYQQVTHST   67 (249)
Q Consensus        43 ~~C~n~~gs-y~C~C~~G~~g~~~~~   67 (249)
                      ..|+-..++ ++|.|...-.|..|+.
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCgr  310 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCGR  310 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCccc
Confidence            467665555 9999999999988874


No 52 
>KOG1215|consensus
Probab=42.77  E-value=43  Score=33.26  Aligned_cols=66  Identities=29%  Similarity=0.697  Sum_probs=43.5

Q ss_pred             CCCCCceecCCCeeeecCCCccccCCCcccccccCCccccCCCCCCceecCcccccCCCCCCCCeee-eCCCcceeecCC
Q psy11797         40 TCKFSCKNLIGSYMCTCPPGYQQVTHSTVAIATTDTRTAESGGKSHECVDVNECELNLDSCANGRCV-NLEGSYRCECER  118 (249)
Q Consensus        40 ~c~~~C~n~~gsy~C~C~~G~~g~~~~~~~~~~~~~~~~~~~~~~~~C~~i~~C~~~~~~C~~g~C~-~~~g~~~C~C~~  118 (249)
                      .+.+.+.+......+.|..+++.....                  .  .+...|....+.|.+ .|+ +.++.|.|.|..
T Consensus       334 ~~~~~~~~~~v~~~~~~~~~~~~~~~~------------------~--~~~~~~~~~~g~Csq-~C~~~~p~~~~c~c~~  392 (877)
T KOG1215|consen  334 KCSHKCPDVSVGPRCDCMGAKVLPLGA------------------R--TDSNPCESDNGGCSQ-LCVPNSPGTFKCACSP  392 (877)
T ss_pred             cccCCCCccccCCcccCCccceecccc------------------c--ccCCcccccCCccce-eccCCCCCceeEecCC
Confidence            333455566666777777777664433                  1  122344444567776 788 568999999999


Q ss_pred             CceeCCCC
Q psy11797        119 GFKLSLDG  126 (249)
Q Consensus       119 G~~~~~~g  126 (249)
                      ||.+..++
T Consensus       393 g~~~~~~~  400 (877)
T KOG1215|consen  393 GYELRLDK  400 (877)
T ss_pred             CcEeccCC
Confidence            99987765


No 53 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=37.10  E-value=93  Score=18.56  Aligned_cols=16  Identities=31%  Similarity=0.997  Sum_probs=8.5

Q ss_pred             eeecCCCCccCCCCccC
Q psy11797        187 DCCCTMGMAWGPQCQLC  203 (249)
Q Consensus       187 ~C~C~~g~~~g~~C~~C  203 (249)
                      +|.|+.++. |..|+.|
T Consensus        20 ~C~C~~~~~-G~~C~~C   35 (50)
T cd00055          20 QCECKPNTT-GRRCDRC   35 (50)
T ss_pred             EEeCCCcCC-CCCCCCC
Confidence            455555544 5556443


No 54 
>KOG0196|consensus
Probab=33.36  E-value=51  Score=32.37  Aligned_cols=51  Identities=25%  Similarity=0.484  Sum_probs=32.9

Q ss_pred             CCcccCCCCCCccCCCCccee-ecCCCCc----cCCCCccCCCCC----CchhhccCCCCCcc
Q psy11797        168 NGRCVLPTGPALLMEVTRMDC-CCTMGMA----WGPQCQLCPTRG----SQEYTDLCLESGLT  221 (249)
Q Consensus       168 ~~~C~c~~g~~~~~~~~~~~C-~C~~g~~----~g~~C~~C~~~~----~~~~~c~Cp~~G~~  221 (249)
                      ...|.|.+||...  .....| .|..|+.    .-..|..||...    .+.-.|.| ..||.
T Consensus       258 iG~C~C~aGye~~--~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C-~~gyy  317 (996)
T KOG0196|consen  258 IGGCVCKAGYEEA--ENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTC-ENGYY  317 (996)
T ss_pred             cCceeecCCCCcc--cCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccc-cCCcc
Confidence            3568899998764  345566 3666651    135788887543    34567999 78864


No 55 
>KOG3516|consensus
Probab=32.45  E-value=33  Score=34.87  Aligned_cols=39  Identities=31%  Similarity=0.888  Sum_probs=30.5

Q ss_pred             ceecCcccccCCCCCCCC-eeeeCCCcceeecC-CCceeCCCCCCcc
Q psy11797         86 ECVDVNECELNLDSCANG-RCVNLEGSYRCECE-RGFKLSLDGKQCL  130 (249)
Q Consensus        86 ~C~~i~~C~~~~~~C~~g-~C~~~~g~~~C~C~-~G~~~~~~g~~C~  130 (249)
                      .|.-++.|.  +++|+|| .|......|.|.|. .||+    |.+|.
T Consensus       541 ~C~i~drCl--PN~CehgG~C~Qs~~~f~C~C~~TGY~----GatCH  581 (1306)
T KOG3516|consen  541 MCGISDRCL--PNPCEHGGKCSQSWDDFECNCELTGYK----GATCH  581 (1306)
T ss_pred             ccccccccC--CccccCCCcccccccceeEeccccccc----ccccc
Confidence            345566676  5899965 99988889999999 7898    77775


No 56 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=27.43  E-value=1.4e+02  Score=17.53  Aligned_cols=16  Identities=31%  Similarity=1.043  Sum_probs=8.3

Q ss_pred             eeecCCCCccCCCCccC
Q psy11797        187 DCCCTMGMAWGPQCQLC  203 (249)
Q Consensus       187 ~C~C~~g~~~g~~C~~C  203 (249)
                      +|.|+.++. |..|+.|
T Consensus        19 ~C~C~~~~~-G~~C~~C   34 (46)
T smart00180       19 QCECKPNVT-GRRCDRC   34 (46)
T ss_pred             EEECCCCCC-CCCCCcC
Confidence            455555544 5555443


No 57 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=27.20  E-value=40  Score=24.04  Aligned_cols=31  Identities=26%  Similarity=0.789  Sum_probs=21.6

Q ss_pred             cccccCCCCCC-CCeeeeCC-----CcceeecCCCce
Q psy11797         91 NECELNLDSCA-NGRCVNLE-----GSYRCECERGFK  121 (249)
Q Consensus        91 ~~C~~~~~~C~-~g~C~~~~-----g~~~C~C~~G~~  121 (249)
                      +.|....+.|. ||.|++..     .=|.|.|.+.+.
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~   42 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVV   42 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeecccc
Confidence            44555557787 89999863     338899988544


No 58 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=24.00  E-value=40  Score=20.56  Aligned_cols=18  Identities=22%  Similarity=0.661  Sum_probs=13.2

Q ss_pred             eeecCCCceeCCCCCCccc
Q psy11797        113 RCECERGFKLSLDGKQCLG  131 (249)
Q Consensus       113 ~C~C~~G~~~~~~g~~C~~  131 (249)
                      .|.|++||.+... ..|..
T Consensus        34 gC~C~~G~v~~~~-~~CV~   51 (55)
T PF01826_consen   34 GCFCPPGYVRNDN-GRCVP   51 (55)
T ss_dssp             EEEETTTEEEETT-SEEEE
T ss_pred             cCCCCCCeeEcCC-CCEEc
Confidence            3899999987654 46655


No 59 
>KOG3516|consensus
Probab=21.67  E-value=77  Score=32.39  Aligned_cols=37  Identities=24%  Similarity=0.516  Sum_probs=29.4

Q ss_pred             CCccCCCCCCCCCCceecCCCeeeecC-CCccccCCCc
Q psy11797         31 VDECRTPANTCKFSCKNLIGSYMCTCP-PGYQQVTHST   67 (249)
Q Consensus        31 id~C~~~~~~c~~~C~n~~gsy~C~C~-~G~~g~~~~~   67 (249)
                      +|.|.++++..++.|......|.|.|. .||+|..|..
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT  582 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEeccccccccccccC
Confidence            366777777777889887788999998 8999988763


Done!