Query         psy5614
Match_columns 98
No_of_seqs    144 out of 1033
Neff          7.2 
Searched_HMMs 46136
Date          Fri Aug 16 22:04:34 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5614.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5614hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.5 4.6E-14 9.9E-19  115.1   5.4   71    6-97    798-873 (1289)
  2 PF12947 EGF_3:  EGF domain;  I  99.0 2.1E-10 4.5E-15   60.7   1.7   31   57-88      6-36  (36)
  3 KOG1214|consensus               98.9   2E-09 4.2E-14   88.6   6.6   88    3-91    702-827 (1289)
  4 PF12947 EGF_3:  EGF domain;  I  98.7 1.1E-08 2.3E-13   54.1   2.0   29    2-30      7-36  (36)
  5 KOG1219|consensus               98.5 1.2E-07 2.6E-12   84.2   5.6   79    3-84   3872-3975(4289)
  6 PF07645 EGF_CA:  Calcium-bindi  98.4 2.7E-07 5.7E-12   49.9   2.8   29   53-81      3-34  (42)
  7 PF00008 EGF:  EGF-like domain   98.3 3.8E-07 8.2E-12   46.7   2.0   27   57-83      4-31  (32)
  8 KOG4289|consensus               98.2 1.1E-06 2.3E-11   76.0   3.9   54   11-84   1218-1272(2531)
  9 smart00179 EGF_CA Calcium-bind  98.1 7.8E-06 1.7E-10   42.3   3.8   32   53-85      3-36  (39)
 10 KOG4260|consensus               97.7 2.7E-05 5.9E-10   57.8   2.9   78    3-82    152-269 (350)
 11 KOG4289|consensus               97.7   5E-05 1.1E-09   66.2   4.4   61    2-79   1246-1308(2531)
 12 cd00053 EGF Epidermal growth f  97.6 7.1E-05 1.5E-09   37.4   3.0   27   57-83      6-32  (36)
 13 cd00054 EGF_CA Calcium-binding  97.6 8.7E-05 1.9E-09   37.7   3.2   27   57-83      9-35  (38)
 14 PF00008 EGF:  EGF-like domain   97.5 5.3E-05 1.1E-09   38.6   1.6   25    2-26      5-31  (32)
 15 smart00181 EGF Epidermal growt  97.5 0.00018 3.9E-09   36.5   3.2   26   57-83      6-31  (35)
 16 KOG1225|consensus               97.4 0.00026 5.7E-09   56.5   5.4   62   14-83    264-338 (525)
 17 PF06247 Plasmod_Pvs28:  Plasmo  97.4 0.00041 8.9E-09   48.9   5.0   82    3-85     52-164 (197)
 18 PF12662 cEGF:  Complement Clr-  97.3 0.00022 4.9E-09   34.3   2.1   21   71-92      1-23  (24)
 19 PF07645 EGF_CA:  Calcium-bindi  97.3 0.00015 3.3E-09   38.9   1.6   24    2-25     11-35  (42)
 20 smart00179 EGF_CA Calcium-bind  97.2 0.00051 1.1E-08   35.2   2.9   27    3-30     11-38  (39)
 21 PF12946 EGF_MSP1_1:  MSP1 EGF   96.9 0.00039 8.5E-09   36.8   1.1   29    2-30      6-36  (37)
 22 cd00053 EGF Epidermal growth f  96.9   0.001 2.2E-08   33.0   2.7   25    2-26      7-32  (36)
 23 KOG1217|consensus               96.9  0.0034 7.3E-08   47.2   6.5   81    3-83    280-389 (487)
 24 KOG1217|consensus               96.9  0.0015 3.2E-08   49.2   4.4   60    7-85    243-306 (487)
 25 smart00181 EGF Epidermal growt  96.7  0.0023 4.9E-08   32.3   2.7   24    2-26      7-31  (35)
 26 cd00054 EGF_CA Calcium-binding  96.5  0.0029 6.3E-08   31.8   2.6   23    3-25     11-34  (38)
 27 PF12946 EGF_MSP1_1:  MSP1 EGF   96.5  0.0011 2.3E-08   35.1   0.8   31   57-88      5-36  (37)
 28 PF12662 cEGF:  Complement Clr-  96.4  0.0023 4.9E-08   30.8   1.5   19   14-32      1-21  (24)
 29 PF14670 FXa_inhibition:  Coagu  96.2  0.0045 9.8E-08   32.4   2.1   20   63-82     10-29  (36)
 30 KOG1225|consensus               96.1   0.015 3.3E-07   46.6   5.8   60   16-84    297-365 (525)
 31 PF07974 EGF_2:  EGF-like domai  96.1  0.0071 1.5E-07   30.8   2.5   24   58-83      7-30  (32)
 32 KOG1219|consensus               96.1  0.0088 1.9E-07   55.0   4.6   36   48-83   3896-3935(4289)
 33 PF12661 hEGF:  Human growth fa  95.5  0.0092   2E-07   24.6   1.3   10   16-25      1-10  (13)
 34 PF06247 Plasmod_Pvs28:  Plasmo  95.2   0.011 2.3E-07   41.8   1.5   69    6-91     10-88  (197)
 35 KOG4260|consensus               95.0   0.016 3.5E-07   43.3   2.1   57    3-81    246-304 (350)
 36 PF07974 EGF_2:  EGF-like domai  94.7   0.043 9.2E-07   27.9   2.6   21    3-25      8-29  (32)
 37 cd01475 vWA_Matrilin VWA_Matri  93.9   0.064 1.4E-06   37.8   3.1   32   50-83    183-219 (224)
 38 PF00954 S_locus_glycop:  S-loc  93.8   0.061 1.3E-06   34.1   2.6   29   53-82     78-108 (110)
 39 PF14670 FXa_inhibition:  Coagu  93.2   0.083 1.8E-06   27.6   2.0   18    8-25     11-29  (36)
 40 KOG1226|consensus               93.1    0.26 5.7E-06   41.2   5.7   72    8-85    468-579 (783)
 41 PF01683 EB:  EB module;  Inter  91.8    0.77 1.7E-05   25.1   4.9   25   57-85     26-50  (52)
 42 PHA02887 EGF-like protein; Pro  88.4    0.51 1.1E-05   31.1   2.6   23    6-30     96-121 (126)
 43 smart00051 DSL delta serrate l  87.2    0.82 1.8E-05   26.6   2.8   22   59-83     40-61  (63)
 44 PHA02887 EGF-like protein; Pro  86.9    0.72 1.6E-05   30.4   2.6   26   57-83     92-119 (126)
 45 KOG0994|consensus               82.8     1.8 3.8E-05   38.5   3.8   21   65-85    878-899 (1758)
 46 PHA03099 epidermal growth fact  78.0     2.3 4.9E-05   28.5   2.4   24    6-31     55-81  (139)
 47 PF00053 Laminin_EGF:  Laminin   77.5     2.2 4.7E-05   23.0   1.9   26   63-90     11-36  (49)
 48 PHA03099 epidermal growth fact  75.1     3.1 6.7E-05   27.9   2.4   30   57-90     51-82  (139)
 49 cd00055 EGF_Lam Laminin-type e  74.1     4.4 9.6E-05   22.0   2.6   19   72-90     19-37  (50)
 50 KOG0994|consensus               73.9     3.4 7.5E-05   36.8   3.0   19   11-29    881-900 (1758)
 51 PF09064 Tme5_EGF_like:  Thromb  73.5     3.2 6.8E-05   21.4   1.7   13   72-84     18-30  (34)
 52 KOG1226|consensus               69.4     7.4 0.00016   33.0   3.9   21    3-26    557-577 (783)
 53 KOG3607|consensus               62.0       9  0.0002   32.2   3.1   45   37-84    603-654 (716)
 54 smart00180 EGF_Lam Laminin-typ  60.8     8.5 0.00018   20.6   2.0   19   72-90     18-36  (46)
 55 KOG1836|consensus               59.4     9.9 0.00022   35.0   3.1   31   58-88    781-814 (1705)
 56 PF04863 EGF_alliinase:  Alliin  55.1     8.3 0.00018   22.1   1.3   28    1-30     17-49  (56)
 57 KOG3516|consensus               52.7      12 0.00026   33.3   2.4   31   53-83    546-578 (1306)
 58 cd00064 FU Furin-like repeats.  42.1      18 0.00039   19.2   1.4   23   14-36     14-37  (49)
 59 PF01826 TIL:  Trypsin Inhibito  34.1      13 0.00028   20.3  -0.1   19   73-91     34-52  (55)
 60 KOG0196|consensus               31.9      70  0.0015   27.9   3.8   58   15-82    259-318 (996)
 61 PF01414 DSL:  Delta serrate li  31.0      22 0.00049   20.5   0.6    8   18-25     53-60  (63)
 62 PF09402 MSC:  Man1-Src1p-C-ter  22.7      28 0.00061   25.8   0.0   24   57-83     53-76  (334)
 63 PTZ00214 high cysteine membran  21.3 3.2E+02  0.0069   23.6   5.9   23   69-92    748-770 (800)

No 1  
>KOG1214|consensus
Probab=99.47  E-value=4.6e-14  Score=115.13  Aligned_cols=71  Identities=34%  Similarity=0.762  Sum_probs=59.7

Q ss_pred             CcEEe--cCCCCeEEcCCCcEeCCCCCcc--cCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEecCCccEEeCCCCCe
Q psy5614           6 WIQCS--MQSNPSCSCLPNYIGAPPNCRP--ECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVINHTPICTCPEGYI   81 (98)
Q Consensus         6 ~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~--eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~   81 (98)
                      +++|+  ..+.|+|.|+|||.|||..|.+  ||..                    +.|+++|.|+|+.++|.|+|++||.
T Consensus       798 ~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~p--------------------srChp~A~CyntpgsfsC~C~pGy~  857 (1289)
T KOG1214|consen  798 QARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSP--------------------SRCHPAATCYNTPGSFSCRCQPGYY  857 (1289)
T ss_pred             ceEEEecCCceEEEeecCCccCCccccccccccCc--------------------cccCCCceEecCCCcceeecccCcc
Confidence            34566  6789999999999999998863  4542                    5799999999999999999999999


Q ss_pred             eCCCCCCccC-CCCCCC
Q psy5614          82 GDPFSSCFPK-PPERKC   97 (98)
Q Consensus        82 G~g~~~C~~~-~~~~~c   97 (98)
                      |||+ +|+|. .+-.+|
T Consensus       858 GDGf-~CVP~~~~~T~C  873 (1289)
T KOG1214|consen  858 GDGF-QCVPDTSSLTPC  873 (1289)
T ss_pred             CCCc-eecCCCccCCcc
Confidence            9999 89997 555555


No 2  
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.98  E-value=2.1e-10  Score=60.67  Aligned_cols=31  Identities=45%  Similarity=0.938  Sum_probs=25.9

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeCCCCCC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGDPFSSC   88 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~~~C   88 (98)
                      +.|+++|+|+++.++|+|+|++||.|||+ .|
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~-~C   36 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF-FC   36 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCST-CE
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCCc-CC
Confidence            57999999999999999999999999998 44


No 3  
>KOG1214|consensus
Probab=98.94  E-value=2e-09  Score=88.62  Aligned_cols=88  Identities=30%  Similarity=0.696  Sum_probs=68.0

Q ss_pred             ccCCcEEe--cCCCCeEEcCCCcEeCCCCCcc--cCcC-CCCCCCCceeeC-------------------Ceec------
Q psy5614           3 WLMWIQCS--MQSNPSCSCLPNYIGAPPNCRP--ECVM-NSECASNLACIN-------------------EKCR------   52 (98)
Q Consensus         3 ~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~--eC~~-~~~C~~~~~C~~-------------------~~C~------   52 (98)
                      |..++.|.  ..-.|+|.|..||.|+|+.|.+  ||.. ...|.++.+|++                   .+|+      
T Consensus       702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pa  781 (1289)
T KOG1214|consen  702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPA  781 (1289)
T ss_pred             cCCCccccCCCCcceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCC
Confidence            67889999  6678999999999999999985  6664 356777777742                   4666      


Q ss_pred             --CCCC---CCCCCCC--eeEecCC-ccEEeCCCCCeeCCCCCCccC
Q psy5614          53 --DPCP---GSCGYNA--QCKVINH-TPICTCPEGYIGDPFSSCFPK   91 (98)
Q Consensus        53 --d~C~---~~C~~~a--~C~~~~g-~~~C~C~~Gy~G~g~~~C~~~   91 (98)
                        ++|.   +.|+.++  .|+..++ .|+|+|.|||.|||+ .|...
T Consensus       782 p~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~-~c~dv  827 (1289)
T KOG1214|consen  782 PANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH-QCTDV  827 (1289)
T ss_pred             CCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc-ccccc
Confidence              5673   6888755  5666555 899999999999998 56554


No 4  
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.68  E-value=1.1e-08  Score=54.06  Aligned_cols=29  Identities=24%  Similarity=0.536  Sum_probs=24.0

Q ss_pred             CccCCcEEe-cCCCCeEEcCCCcEeCCCCC
Q psy5614           2 SWLMWIQCS-MQSNPSCSCLPNYIGAPPNC   30 (98)
Q Consensus         2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C   30 (98)
                      .|+++|+|+ +.++|+|+|++||.|||+.|
T Consensus         7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    7 GCHPNATCTNTGGSYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             GS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred             CCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence            489999999 88899999999999999875


No 5  
>KOG1219|consensus
Probab=98.54  E-value=1.2e-07  Score=84.19  Aligned_cols=79  Identities=25%  Similarity=0.665  Sum_probs=65.1

Q ss_pred             ccCCcEEe--cCCCCeEEcCCCcEeCCCCCcc---cCcCCCCCCCCcee---------------eCCeec----CCCC-C
Q psy5614           3 WLMWIQCS--MQSNPSCSCLPNYIGAPPNCRP---ECVMNSECASNLAC---------------INEKCR----DPCP-G   57 (98)
Q Consensus         3 ~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C~~---eC~~~~~C~~~~~C---------------~~~~C~----d~C~-~   57 (98)
                      |...++|.  ..++|.|.|++-|+|  .+|+.   .|.. .+|..+++|               ++.+|+    ++|+ .
T Consensus      3872 CqhgG~C~~~~~ggy~CkCpsqysG--~~CEi~~epC~s-nPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3872 CQHGGTCISQPKGGYKCKCPSQYSG--NHCEIDLEPCAS-NPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred             ccCCCEecCCCCCceEEeCcccccC--cccccccccccC-CCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence            67889999  778999999999999  78873   3543 356655555               256888    7897 8


Q ss_pred             CCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614          58 SCGYNAQCKVINHTPICTCPEGYIGDP   84 (98)
Q Consensus        58 ~C~~~a~C~~~~g~~~C~C~~Gy~G~g   84 (98)
                      +|..++.|+|..|+|.|.|-+||.|.+
T Consensus      3949 ~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3949 VCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred             cccCCceeeccCCceEeccChhHhccc
Confidence            999999999999999999999999865


No 6  
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.40  E-value=2.7e-07  Score=49.91  Aligned_cols=29  Identities=34%  Similarity=0.843  Sum_probs=26.0

Q ss_pred             CCCC---CCCCCCCeeEecCCccEEeCCCCCe
Q psy5614          53 DPCP---GSCGYNAQCKVINHTPICTCPEGYI   81 (98)
Q Consensus        53 d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~   81 (98)
                      |||.   +.|..++.|+|+.|+|+|.|++||.
T Consensus         3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            4563   6899999999999999999999998


No 7  
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.31  E-value=3.8e-07  Score=46.74  Aligned_cols=27  Identities=30%  Similarity=0.789  Sum_probs=25.2

Q ss_pred             CCCCCCCeeEecC-CccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVIN-HTPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~-g~~~C~C~~Gy~G~   83 (98)
                      .+|.++|+|+... +.|+|+|++||+|+
T Consensus         4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    4 NPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            5899999999998 89999999999996


No 8  
>KOG4289|consensus
Probab=98.22  E-value=1.1e-06  Score=76.02  Aligned_cols=54  Identities=31%  Similarity=0.843  Sum_probs=44.5

Q ss_pred             cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCC-CCCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614          11 MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPC-PGSCGYNAQCKVINHTPICTCPEGYIGDP   84 (98)
Q Consensus        11 ~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C-~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g   84 (98)
                      ..++++|.|++||+||  .|+.|    .              |+| +++|.+|+.|....|.|+|.|++||+|..
T Consensus      1218 pvnglrCrCPpGFTgd--~CeTe----i--------------DlCYs~pC~nng~C~srEggYtCeCrpg~tGeh 1272 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGD--YCETE----I--------------DLCYSGPCGNNGRCRSREGGYTCECRPGFTGEH 1272 (2531)
T ss_pred             ccCceeEeCCCCCCcc--cccch----h--------------HhhhcCCCCCCCceEEecCceeEEecCCccccc
Confidence            4578999999999995  66532    0              344 37999999999999999999999999965


No 9  
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.06  E-value=7.8e-06  Score=42.31  Aligned_cols=32  Identities=31%  Similarity=0.728  Sum_probs=26.5

Q ss_pred             CCCC--CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614          53 DPCP--GSCGYNAQCKVINHTPICTCPEGYIGDPF   85 (98)
Q Consensus        53 d~C~--~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~   85 (98)
                      |+|.  .+|..++.|+++.++|+|.|++||+ +|.
T Consensus         3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~-~g~   36 (39)
T smart00179        3 DECASGNPCQNGGTCVNTVGSYRCECPPGYT-DGR   36 (39)
T ss_pred             ccCcCCCCcCCCCEeECCCCCeEeECCCCCc-cCC
Confidence            4553  5799999999999999999999999 443


No 10 
>KOG4260|consensus
Probab=97.71  E-value=2.7e-05  Score=57.81  Aligned_cols=78  Identities=27%  Similarity=0.606  Sum_probs=54.3

Q ss_pred             ccCCcEEe----cCCCCeEEcCCCcEeCCCCCcc-----------c-------CcCC--CCCCCC--c---eee------
Q psy5614           3 WLMWIQCS----MQSNPSCSCLPNYIGAPPNCRP-----------E-------CVMN--SECASN--L---ACI------   47 (98)
Q Consensus         3 ~~~~a~C~----~~g~~~C~C~~Gy~Gdg~~C~~-----------e-------C~~~--~~C~~~--~---~C~------   47 (98)
                      |.-+++|.    ..|+..|.|.+||+|  ..|.+           +       |...  ..|...  .   .|.      
T Consensus       152 C~GnG~C~GdGsR~GsGkCkC~~GY~G--p~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld  229 (350)
T KOG4260|consen  152 CFGNGSCHGDGSREGSGKCKCETGYTG--PLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD  229 (350)
T ss_pred             cCCCCcccCCCCCCCCCcccccCCCCC--ccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceec
Confidence            56778887    567889999999999  45541           0       1110  122211  1   121      


Q ss_pred             CCeec--CCCC---CCCCCCCeeEecCCccEEeCCCCCee
Q psy5614          48 NEKCR--DPCP---GSCGYNAQCKVINHTPICTCPEGYIG   82 (98)
Q Consensus        48 ~~~C~--d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~G   82 (98)
                      -..|+  |+|.   .+|.+...|+|+.|||.|.+++||.+
T Consensus       230 e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  230 EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence            13566  7883   79999999999999999999999988


No 11 
>KOG4289|consensus
Probab=97.68  E-value=5e-05  Score=66.21  Aligned_cols=61  Identities=25%  Similarity=0.630  Sum_probs=47.4

Q ss_pred             CccCCcEEe-cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEec-CCccEEeCCCC
Q psy5614           2 SWLMWIQCS-MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVI-NHTPICTCPEG   79 (98)
Q Consensus         2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~-~g~~~C~C~~G   79 (98)
                      -|.+|++|. ..|+|+|+|++||+|  .+|+-. .....|.+              +.|.+.++|++. +|.|.|.|+.|
T Consensus      1246 pC~nng~C~srEggYtCeCrpg~tG--ehCEvs-~~agrCvp--------------GvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1246 PCGNNGRCRSREGGYTCECRPGFTG--EHCEVS-ARAGRCVP--------------GVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             CCCCCCceEEecCceeEEecCCccc--cceeee-cccCcccc--------------ceecCCCEEeecCCCceeccCCCc
Confidence            378999999 999999999999999  777521 01122222              578999999995 67999999998


No 12 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.63  E-value=7.1e-05  Score=37.43  Aligned_cols=27  Identities=33%  Similarity=0.840  Sum_probs=25.0

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      .+|.+++.|++..+.|+|.|++||.|+
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCccc
Confidence            578889999999999999999999997


No 13 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.61  E-value=8.7e-05  Score=37.68  Aligned_cols=27  Identities=30%  Similarity=0.764  Sum_probs=24.4

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      .+|.+++.|++..+.|+|.|++||.|.
T Consensus         9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~   35 (38)
T cd00054           9 NPCQNGGTCVNTVGSYRCSCPPGYTGR   35 (38)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCcCC
Confidence            478888999999999999999999883


No 14 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.53  E-value=5.3e-05  Score=38.63  Aligned_cols=25  Identities=24%  Similarity=0.546  Sum_probs=22.4

Q ss_pred             CccCCcEEe-cC-CCCeEEcCCCcEeC
Q psy5614           2 SWLMWIQCS-MQ-SNPSCSCLPNYIGA   26 (98)
Q Consensus         2 ~~~~~a~C~-~~-g~~~C~C~~Gy~Gd   26 (98)
                      .|..+|+|+ .. +.|+|+|++||+|.
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            488899999 66 99999999999994


No 15 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.47  E-value=0.00018  Score=36.50  Aligned_cols=26  Identities=38%  Similarity=0.982  Sum_probs=23.4

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      .+|..+ .|++..++|+|.|++||.|+
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccC
Confidence            378888 99999999999999999996


No 16 
>KOG1225|consensus
Probab=97.45  E-value=0.00026  Score=56.53  Aligned_cols=62  Identities=31%  Similarity=0.962  Sum_probs=37.8

Q ss_pred             CCeEEcCCCcEeCCCCCcc-cCcCCCCCCCCceeeCCeec------------CCCCCCCCCCCeeEecCCccEEeCCCCC
Q psy5614          14 NPSCSCLPNYIGAPPNCRP-ECVMNSECASNLACINEKCR------------DPCPGSCGYNAQCKVINHTPICTCPEGY   80 (98)
Q Consensus        14 ~~~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~~~C~~~~C~------------d~C~~~C~~~a~C~~~~g~~~C~C~~Gy   80 (98)
                      ..+|.|++||+|  ..|.. .|...  |+.+..+++..|+            ..|+..|..++.|+    .-+|.|.+||
T Consensus       264 ~G~CIC~~Gf~G--~dC~e~~Cp~~--cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci----~G~C~C~~Gy  335 (525)
T KOG1225|consen  264 EGRCICPPGFTG--DDCDELVCPVD--CSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKCI----DGECLCDEGY  335 (525)
T ss_pred             CCeEeCCCCCcC--CCCCcccCCcc--cCCCceecCCEeecCCCccccccccccCCccCCCCCccc----CCceEeCCCC
Confidence            446889999988  56653 34322  5555555555554            22446677777777    2257777777


Q ss_pred             eeC
Q psy5614          81 IGD   83 (98)
Q Consensus        81 ~G~   83 (98)
                      +|+
T Consensus       336 ~G~  338 (525)
T KOG1225|consen  336 TGE  338 (525)
T ss_pred             cCC
Confidence            775


No 17 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.37  E-value=0.00041  Score=48.86  Aligned_cols=82  Identities=30%  Similarity=0.703  Sum_probs=51.0

Q ss_pred             ccCCcEEe-cC-----CCCeEEcCCCcEeCCCCCcc-cCcCCCCCCCC-----------cee---------eCCeec---
Q psy5614           3 WLMWIQCS-MQ-----SNPSCSCLPNYIGAPPNCRP-ECVMNSECASN-----------LAC---------INEKCR---   52 (98)
Q Consensus         3 ~~~~a~C~-~~-----g~~~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~-----------~~C---------~~~~C~---   52 (98)
                      |...|+|+ ..     ..|.|.|.+||+-....|.+ +|.. ..|...           ..|         .+.+|+   
T Consensus        52 Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~-~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G  130 (197)
T PF06247_consen   52 CGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNN-KDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTG  130 (197)
T ss_dssp             EETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS----TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE
T ss_pred             ccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCc-eecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCC
Confidence            67889998 33     68999999999876567774 5643 345421           123         135666   


Q ss_pred             -CCCCCCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614          53 -DPCPGSCGYNAQCKVINHTPICTCPEGYIGDPF   85 (98)
Q Consensus        53 -d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~   85 (98)
                       .+|+--|..+..|..+.+-|+|.|++||.+++-
T Consensus       131 ~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~  164 (197)
T PF06247_consen  131 ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDGE  164 (197)
T ss_dssp             --------TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred             ccceeeecCCCcceeeeCcEEEeecCCCCCCCCC
Confidence             567667899999999999999999999987764


No 18 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.29  E-value=0.00022  Score=34.29  Aligned_cols=21  Identities=38%  Similarity=0.738  Sum_probs=17.7

Q ss_pred             ccEEeCCCCCe--eCCCCCCccCC
Q psy5614          71 TPICTCPEGYI--GDPFSSCFPKP   92 (98)
Q Consensus        71 ~~~C~C~~Gy~--G~g~~~C~~~~   92 (98)
                      +|+|+|++||+  .|+. .|+.++
T Consensus         1 sy~C~C~~Gy~l~~d~~-~C~DId   23 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGR-SCEDID   23 (24)
T ss_pred             CEEeeCCCCCcCCCCCC-ccccCC
Confidence            69999999998  6776 798875


No 19 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.28  E-value=0.00015  Score=38.94  Aligned_cols=24  Identities=29%  Similarity=0.433  Sum_probs=21.7

Q ss_pred             CccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614           2 SWLMWIQCS-MQSNPSCSCLPNYIG   25 (98)
Q Consensus         2 ~~~~~a~C~-~~g~~~C~C~~Gy~G   25 (98)
                      .|..++.|+ +.|+|+|.|++||..
T Consensus        11 ~C~~~~~C~N~~Gsy~C~C~~Gy~~   35 (42)
T PF07645_consen   11 NCPENGTCVNTEGSYSCSCPPGYEL   35 (42)
T ss_dssp             SSSTTSEEEEETTEEEEEESTTEEE
T ss_pred             cCCCCCEEEcCCCCEEeeCCCCcEE
Confidence            377899999 999999999999984


No 20 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.17  E-value=0.00051  Score=35.25  Aligned_cols=27  Identities=26%  Similarity=0.515  Sum_probs=22.6

Q ss_pred             ccCCcEEe-cCCCCeEEcCCCcEeCCCCC
Q psy5614           3 WLMWIQCS-MQSNPSCSCLPNYIGAPPNC   30 (98)
Q Consensus         3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C   30 (98)
                      |..++.|+ ..++|.|.|++||+ +|..|
T Consensus        11 C~~~~~C~~~~g~~~C~C~~g~~-~g~~C   38 (39)
T smart00179       11 CQNGGTCVNTVGSYRCECPPGYT-DGRNC   38 (39)
T ss_pred             cCCCCEeECCCCCeEeECCCCCc-cCCcC
Confidence            67788999 99999999999999 33555


No 21 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.95  E-value=0.00039  Score=36.76  Aligned_cols=29  Identities=21%  Similarity=0.426  Sum_probs=21.6

Q ss_pred             CccCCcEEe--cCCCCeEEcCCCcEeCCCCC
Q psy5614           2 SWLMWIQCS--MQSNPSCSCLPNYIGAPPNC   30 (98)
Q Consensus         2 ~~~~~a~C~--~~g~~~C~C~~Gy~Gdg~~C   30 (98)
                      .|..||.|+  .+|++.|.|+.||..++..|
T Consensus         6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    6 KCPANAGCFRYDDGSEECRCLLGYKKVGGKC   36 (37)
T ss_dssp             ---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred             cCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence            478999999  56999999999998876554


No 22 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.94  E-value=0.001  Score=33.01  Aligned_cols=25  Identities=24%  Similarity=0.474  Sum_probs=21.9

Q ss_pred             CccCCcEEe-cCCCCeEEcCCCcEeC
Q psy5614           2 SWLMWIQCS-MQSNPSCSCLPNYIGA   26 (98)
Q Consensus         2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gd   26 (98)
                      .|..++.|+ ..+.|+|.|+.||.|+
T Consensus         7 ~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           7 PCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCEEecCCCCeEeECCCCCccc
Confidence            366789999 8889999999999985


No 23 
>KOG1217|consensus
Probab=96.93  E-value=0.0034  Score=47.24  Aligned_cols=81  Identities=26%  Similarity=0.684  Sum_probs=57.3

Q ss_pred             ccCCcEEe-cCCCCeEEcCCCcEeCCC-CCc--ccCc---CCCCCCCCcee-----------------eCCeec---CCC
Q psy5614           3 WLMWIQCS-MQSNPSCSCLPNYIGAPP-NCR--PECV---MNSECASNLAC-----------------INEKCR---DPC   55 (98)
Q Consensus         3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~-~C~--~eC~---~~~~C~~~~~C-----------------~~~~C~---d~C   55 (98)
                      |..+++|+ ..+.|.|.|++||.|..- .+.  .+|.   ....|....+|                 .+..|+   ++|
T Consensus       280 c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C  359 (487)
T KOG1217|consen  280 CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDEC  359 (487)
T ss_pred             cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccc
Confidence            67788999 777799999999999553 232  2443   12224443333                 235666   268


Q ss_pred             C-CCCCCCCeeEe-cCCccEEeCCCCCeeC
Q psy5614          56 P-GSCGYNAQCKV-INHTPICTCPEGYIGD   83 (98)
Q Consensus        56 ~-~~C~~~a~C~~-~~g~~~C~C~~Gy~G~   83 (98)
                      . .++..++.|++ ..++|+|.|+.+|.+.
T Consensus       360 ~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  360 ASSPCCPGGTCVNETPGSYRCACPAGFAGK  389 (487)
T ss_pred             cCCccccCCEeccCCCCCeEecCCCccccC
Confidence            5 56889999999 7889999999999873


No 24 
>KOG1217|consensus
Probab=96.91  E-value=0.0015  Score=49.20  Aligned_cols=60  Identities=28%  Similarity=0.658  Sum_probs=41.8

Q ss_pred             cEEe-cCCCCeEEcCCCcEeCC-CCCcccCcCCCCCCCCceeeCCeecCCCC-C-CCCCCCeeEecCCccEEeCCCCCee
Q psy5614           7 IQCS-MQSNPSCSCLPNYIGAP-PNCRPECVMNSECASNLACINEKCRDPCP-G-SCGYNAQCKVINHTPICTCPEGYIG   82 (98)
Q Consensus         7 a~C~-~~g~~~C~C~~Gy~Gdg-~~C~~eC~~~~~C~~~~~C~~~~C~d~C~-~-~C~~~a~C~~~~g~~~C~C~~Gy~G   82 (98)
                      ++|+ ..++|+|.|++||+++. ..+.+    .               ++|. . .|.++++|++..+.|+|.|++||+|
T Consensus       243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~----~---------------~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g  303 (487)
T KOG1217|consen  243 GTCVNTVGSYTCRCPEGYTGDACVTCVD----V---------------DSCALIASCPNGGTCVNVPGSYRCTCPPGFTG  303 (487)
T ss_pred             CcccccCCceeeeCCCCccccccceeee----c---------------cccCCCCccCCCCeeecCCCcceeeCCCCCCC
Confidence            5566 66667777777777743 12211    0               3442 2 3888999999999999999999999


Q ss_pred             CCC
Q psy5614          83 DPF   85 (98)
Q Consensus        83 ~g~   85 (98)
                      ...
T Consensus       304 ~~~  306 (487)
T KOG1217|consen  304 RLC  306 (487)
T ss_pred             CCC
Confidence            764


No 25 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.66  E-value=0.0023  Score=32.27  Aligned_cols=24  Identities=29%  Similarity=0.613  Sum_probs=20.9

Q ss_pred             CccCCcEEe-cCCCCeEEcCCCcEeC
Q psy5614           2 SWLMWIQCS-MQSNPSCSCLPNYIGA   26 (98)
Q Consensus         2 ~~~~~a~C~-~~g~~~C~C~~Gy~Gd   26 (98)
                      .|..+ +|+ ..++|+|.|++||.|+
T Consensus         7 ~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        7 PCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCC-EEECCCCCeEeECCCCCccC
Confidence            36677 899 8899999999999985


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.53  E-value=0.0029  Score=31.78  Aligned_cols=23  Identities=30%  Similarity=0.548  Sum_probs=20.7

Q ss_pred             ccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614           3 WLMWIQCS-MQSNPSCSCLPNYIG   25 (98)
Q Consensus         3 ~~~~a~C~-~~g~~~C~C~~Gy~G   25 (98)
                      |..++.|+ ..+.|+|.|++||+|
T Consensus        11 C~~~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054          11 CQNGGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             cCCCCEeECCCCCeEeECCCCCcC
Confidence            66788999 888999999999998


No 27 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=96.51  E-value=0.0011  Score=35.10  Aligned_cols=31  Identities=29%  Similarity=0.509  Sum_probs=23.1

Q ss_pred             CCCCCCCeeEecC-CccEEeCCCCCeeCCCCCC
Q psy5614          57 GSCGYNAQCKVIN-HTPICTCPEGYIGDPFSSC   88 (98)
Q Consensus        57 ~~C~~~a~C~~~~-g~~~C~C~~Gy~G~g~~~C   88 (98)
                      ..|..||.|++.. |++.|.|..||..++. .|
T Consensus         5 ~~cP~NA~C~~~~dG~eecrCllgyk~~~~-~C   36 (37)
T PF12946_consen    5 TKCPANAGCFRYDDGSEECRCLLGYKKVGG-KC   36 (37)
T ss_dssp             S---TTEEEEEETTSEEEEEE-TTEEEETT-EE
T ss_pred             ccCCCCcccEEcCCCCEEEEeeCCccccCC-Cc
Confidence            4789999999976 8999999999998875 45


No 28 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.40  E-value=0.0023  Score=30.76  Aligned_cols=19  Identities=32%  Similarity=0.768  Sum_probs=15.1

Q ss_pred             CCeEEcCCCcE--eCCCCCcc
Q psy5614          14 NPSCSCLPNYI--GAPPNCRP   32 (98)
Q Consensus        14 ~~~C~C~~Gy~--Gdg~~C~~   32 (98)
                      +|+|.|++||.  .|+..|.+
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~D   21 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCED   21 (24)
T ss_pred             CEEeeCCCCCcCCCCCCcccc
Confidence            58999999997  56667753


No 29 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.19  E-value=0.0045  Score=32.42  Aligned_cols=20  Identities=30%  Similarity=0.625  Sum_probs=17.0

Q ss_pred             CeeEecCCccEEeCCCCCee
Q psy5614          63 AQCKVINHTPICTCPEGYIG   82 (98)
Q Consensus        63 a~C~~~~g~~~C~C~~Gy~G   82 (98)
                      -.|++++++|+|.|++||+-
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L   29 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKL   29 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE
T ss_pred             CCCccCCCceEeECCCCCEE
Confidence            37999999999999999974


No 30 
>KOG1225|consensus
Probab=96.13  E-value=0.015  Score=46.62  Aligned_cols=60  Identities=32%  Similarity=0.895  Sum_probs=42.8

Q ss_pred             eEEcCCCcEeCCCCCcc-cCcCCCCCCCCceeeCCeec-------CCCC-CCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614          16 SCSCLPNYIGAPPNCRP-ECVMNSECASNLACINEKCR-------DPCP-GSCGYNAQCKVINHTPICTCPEGYIGDP   84 (98)
Q Consensus        16 ~C~C~~Gy~Gdg~~C~~-eC~~~~~C~~~~~C~~~~C~-------d~C~-~~C~~~a~C~~~~g~~~C~C~~Gy~G~g   84 (98)
                      .|.|++||.|  ..|+. +|.  ++|..++.|+..+|.       ..|. ..|..+..|++.     |+|+.||.|..
T Consensus       297 ~CiC~~g~~G--~dCs~~~cp--adC~g~G~Ci~G~C~C~~Gy~G~~C~~~~C~~~g~cv~g-----C~C~~Gw~G~d  365 (525)
T KOG1225|consen  297 ECICNPGYSG--KDCSIRRCP--ADCSGHGKCIDGECLCDEGYTGELCIQRACSGGGQCVNG-----CKCKKGWRGPD  365 (525)
T ss_pred             EeecCCCccc--cccccccCC--ccCCCCCcccCCceEeCCCCcCCcccccccCCCceeccC-----ceeccCccCCC
Confidence            7999999999  66652 343  567777888877776       3353 236666666532     99999999976


No 31 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.09  E-value=0.0071  Score=30.83  Aligned_cols=24  Identities=29%  Similarity=0.718  Sum_probs=20.5

Q ss_pred             CCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          58 SCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        58 ~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      .|..+++|+..  ..+|+|.+||+|+
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~   30 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGP   30 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCC
Confidence            58889999876  4689999999996


No 32 
>KOG1219|consensus
Probab=96.08  E-value=0.0088  Score=55.02  Aligned_cols=36  Identities=31%  Similarity=0.944  Sum_probs=32.2

Q ss_pred             CCeec---CCCC-CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          48 NEKCR---DPCP-GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        48 ~~~C~---d~C~-~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      +..|+   .+|. +||..+++|+...+.|.|.|+.||+|+
T Consensus      3896 G~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3896 GNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred             CcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence            45777   6785 899999999999999999999999995


No 33 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.53  E-value=0.0092  Score=24.57  Aligned_cols=10  Identities=40%  Similarity=1.348  Sum_probs=8.2

Q ss_pred             eEEcCCCcEe
Q psy5614          16 SCSCLPNYIG   25 (98)
Q Consensus        16 ~C~C~~Gy~G   25 (98)
                      +|+|++||+|
T Consensus         1 ~C~C~~G~~G   10 (13)
T PF12661_consen    1 TCQCPPGWTG   10 (13)
T ss_dssp             EEEE-TTEET
T ss_pred             CccCcCCCcC
Confidence            5899999999


No 34 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.17  E-value=0.011  Score=41.80  Aligned_cols=69  Identities=25%  Similarity=0.507  Sum_probs=44.6

Q ss_pred             CcEEe-cCCCCeEEcCCCcEe-CCCCCcc--cCcC-CCCCCCCceeeCCeecCCCCCCCCCCCeeEecC-----CccEEe
Q psy5614           6 WIQCS-MQSNPSCSCLPNYIG-APPNCRP--ECVM-NSECASNLACINEKCRDPCPGSCGYNAQCKVIN-----HTPICT   75 (98)
Q Consensus         6 ~a~C~-~~g~~~C~C~~Gy~G-dg~~C~~--eC~~-~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~-----g~~~C~   75 (98)
                      |+.-+ +.+.|.|.|.+||.= +-..|++  +|.. ..     .           ..+|++.|.|++..     ..|+|.
T Consensus        10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~-----~-----------~K~Cgdya~C~~~~~~~~~~~~~C~   73 (197)
T PF06247_consen   10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLEN-----V-----------NKPCGDYAKCINQANKGEERAYKCD   73 (197)
T ss_dssp             TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GG-----T-----------TSEEETTEEEEE-SSTTSSTSEEEE
T ss_pred             CCEEEEccCceEEEcCCCcEEccccccccceecCcccc-----c-----------CccccchhhhhcCCCcccceeEEEe
Confidence            45556 778999999999964 1123432  2322 00     0           15899999999965     489999


Q ss_pred             CCCCCeeCCCCCCccC
Q psy5614          76 CPEGYIGDPFSSCFPK   91 (98)
Q Consensus        76 C~~Gy~G~g~~~C~~~   91 (98)
                      |.+||+-..- .|+|.
T Consensus        74 C~~gY~~~~~-vCvp~   88 (197)
T PF06247_consen   74 CINGYILKQG-VCVPN   88 (197)
T ss_dssp             E-TTEEESSS-SEEEG
T ss_pred             cccCceeeCC-eEchh
Confidence            9999987654 57653


No 35 
>KOG4260|consensus
Probab=95.01  E-value=0.016  Score=43.32  Aligned_cols=57  Identities=23%  Similarity=0.420  Sum_probs=42.7

Q ss_pred             ccCCcEEe-cCCCCeEEcCCCcEeCCCCCcccCcCCCCCCCCceeeCCeecCCCCCCC-CCCCeeEecCCccEEeCCCCC
Q psy5614           3 WLMWIQCS-MQSNPSCSCLPNYIGAPPNCRPECVMNSECASNLACINEKCRDPCPGSC-GYNAQCKVINHTPICTCPEGY   80 (98)
Q Consensus         3 ~~~~a~C~-~~g~~~C~C~~Gy~Gdg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C-~~~a~C~~~~g~~~C~C~~Gy   80 (98)
                      |.++..|+ +.|+|+|..++||.++    .++|+.-                  ...| ..+..|.++.++|+|+|..|+
T Consensus       246 c~~~qfCvNteGSf~C~dk~Gy~~g----~d~C~~~------------------~d~~~~kn~~c~ni~~~~r~v~f~~~  303 (350)
T KOG4260|consen  246 CKAHQFCVNTEGSFKCEDKEGYKKG----VDECQFC------------------ADVCASKNRPCMNIDGQYRCVCFSGL  303 (350)
T ss_pred             CChhheeecCCCceEecccccccCC----hHHhhhh------------------hhhcccCCCCcccCCccEEEEecccc
Confidence            56778899 9999999999999883    3344320                  0122 356788899999999999987


Q ss_pred             e
Q psy5614          81 I   81 (98)
Q Consensus        81 ~   81 (98)
                      .
T Consensus       304 ~  304 (350)
T KOG4260|consen  304 I  304 (350)
T ss_pred             e
Confidence            4


No 36 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.67  E-value=0.043  Score=27.91  Aligned_cols=21  Identities=24%  Similarity=0.401  Sum_probs=18.3

Q ss_pred             ccCCcEEe-cCCCCeEEcCCCcEe
Q psy5614           3 WLMWIQCS-MQSNPSCSCLPNYIG   25 (98)
Q Consensus         3 ~~~~a~C~-~~g~~~C~C~~Gy~G   25 (98)
                      |..+++|+ .  ..+|+|.+||+|
T Consensus         8 C~~~G~C~~~--~g~C~C~~g~~G   29 (32)
T PF07974_consen    8 CSGHGTCVSP--CGRCVCDSGYTG   29 (32)
T ss_pred             cCCCCEEeCC--CCEEECCCCCcC
Confidence            77888897 5  678999999999


No 37 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.91  E-value=0.064  Score=37.81  Aligned_cols=32  Identities=31%  Similarity=0.700  Sum_probs=24.5

Q ss_pred             eec--CCCC---CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          50 KCR--DPCP---GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        50 ~C~--d~C~---~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      .|.  ++|.   +.|.  ..|.++.|+|.|.|++||+..
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence            454  5663   4564  579999999999999999753


No 38 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=93.83  E-value=0.061  Score=34.11  Aligned_cols=29  Identities=41%  Similarity=1.012  Sum_probs=23.2

Q ss_pred             CCCC--CCCCCCCeeEecCCccEEeCCCCCee
Q psy5614          53 DPCP--GSCGYNAQCKVINHTPICTCPEGYIG   82 (98)
Q Consensus        53 d~C~--~~C~~~a~C~~~~g~~~C~C~~Gy~G   82 (98)
                      |.|.  +.|++++.|. ...+..|.|.+||.-
T Consensus        78 d~Cd~y~~CG~~g~C~-~~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   78 DQCDVYGFCGPNGICN-SNNSPKCSCLPGFEP  108 (110)
T ss_pred             cCCCCccccCCccEeC-CCCCCceECCCCcCC
Confidence            5674  7899999995 445778999999964


No 39 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.16  E-value=0.083  Score=27.57  Aligned_cols=18  Identities=33%  Similarity=0.704  Sum_probs=14.8

Q ss_pred             EEe-cCCCCeEEcCCCcEe
Q psy5614           8 QCS-MQSNPSCSCLPNYIG   25 (98)
Q Consensus         8 ~C~-~~g~~~C~C~~Gy~G   25 (98)
                      .|+ ..++|+|.|++||+=
T Consensus        11 ~C~~~~g~~~C~C~~Gy~L   29 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYKL   29 (36)
T ss_dssp             EEEEETTSEEEE-STTEEE
T ss_pred             CCccCCCceEeECCCCCEE
Confidence            678 889999999999964


No 40 
>KOG1226|consensus
Probab=93.12  E-value=0.26  Score=41.21  Aligned_cols=72  Identities=26%  Similarity=0.766  Sum_probs=45.5

Q ss_pred             EEecCCCCe---EEcCCCcEeCCCCCc--c----------cCcCC---CCCCCCceee---------------CCeec-C
Q psy5614           8 QCSMQSNPS---CSCLPNYIGAPPNCR--P----------ECVMN---SECASNLACI---------------NEKCR-D   53 (98)
Q Consensus         8 ~C~~~g~~~---C~C~~Gy~Gdg~~C~--~----------eC~~~---~~C~~~~~C~---------------~~~C~-d   53 (98)
                      .|..+|.+.   |.|.+||.|  ..|+  .          .|+..   ..|+..+.|.               +..|+ |
T Consensus       468 ~C~g~G~~~CG~C~C~~G~~G--~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECD  545 (783)
T KOG1226|consen  468 LCHGNGTFVCGQCRCDEGWLG--KKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECD  545 (783)
T ss_pred             ccCCCCcEEecceecCCCCCC--CcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeecc
Confidence            343345554   599999999  5554  1          25431   2577776552               34666 3


Q ss_pred             C--CC----CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614          54 P--CP----GSCGYNAQCKVINHTPICTCPEGYIGDPF   85 (98)
Q Consensus        54 ~--C~----~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~   85 (98)
                      .  |.    ..|..+++|.=    -+|+|.+||+|.-.
T Consensus       546 nfsC~r~~g~lC~g~G~C~C----G~CvC~~GwtG~~C  579 (783)
T KOG1226|consen  546 NFSCERHKGVLCGGHGRCEC----GRCVCNPGWTGSAC  579 (783)
T ss_pred             CcccccccCcccCCCCeEeC----CcEEcCCCCccCCC
Confidence            3  43    35888888763    36999999999764


No 41 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=91.76  E-value=0.77  Score=25.13  Aligned_cols=25  Identities=28%  Similarity=0.575  Sum_probs=18.1

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeCCC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGDPF   85 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~g~   85 (98)
                      ..|..++.|++    -+|+|++||+-.+.
T Consensus        26 ~qC~~~s~C~~----g~C~C~~g~~~~~~   50 (52)
T PF01683_consen   26 EQCIGGSVCVN----GRCQCPPGYVEVGG   50 (52)
T ss_pred             CCCCCcCEEcC----CEeECCCCCEecCC
Confidence            45667888853    37999999976543


No 42 
>PHA02887 EGF-like protein; Provisional
Probab=88.38  E-value=0.51  Score=31.09  Aligned_cols=23  Identities=26%  Similarity=0.553  Sum_probs=19.2

Q ss_pred             CcEEe---cCCCCeEEcCCCcEeCCCCC
Q psy5614           6 WIQCS---MQSNPSCSCLPNYIGAPPNC   30 (98)
Q Consensus         6 ~a~C~---~~g~~~C~C~~Gy~Gdg~~C   30 (98)
                      |++|.   ......|.|..||+|  ..|
T Consensus        96 HG~C~yI~dL~epsCrC~~GYtG--~RC  121 (126)
T PHA02887         96 NGECMNIIDLDEKFCICNKGYTG--IRC  121 (126)
T ss_pred             CCEEEccccCCCceeECCCCccc--CCC
Confidence            78998   557899999999999  555


No 43 
>smart00051 DSL delta serrate ligand.
Probab=87.17  E-value=0.82  Score=26.60  Aligned_cols=22  Identities=27%  Similarity=0.406  Sum_probs=14.7

Q ss_pred             CCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          59 CGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        59 C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      ...+..|.. .|  .++|.+||+|+
T Consensus        40 ~~~~~~Cd~-~G--~~~C~~Gw~G~   61 (63)
T smart00051       40 FFGHYTCDE-NG--NKGCLEGWMGP   61 (63)
T ss_pred             ccCCccCCc-CC--CEecCCCCcCC
Confidence            345566643 23  47899999985


No 44 
>PHA02887 EGF-like protein; Provisional
Probab=86.89  E-value=0.72  Score=30.39  Aligned_cols=26  Identities=35%  Similarity=0.761  Sum_probs=20.9

Q ss_pred             CCCCCCCeeEecCC--ccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVINH--TPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~g--~~~C~C~~Gy~G~   83 (98)
                      +-|- +++|.....  .+.|.|.+||+|.
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~  119 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTGI  119 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCcccC
Confidence            5666 679988654  7899999999994


No 45 
>KOG0994|consensus
Probab=82.76  E-value=1.8  Score=38.48  Aligned_cols=21  Identities=38%  Similarity=0.906  Sum_probs=16.1

Q ss_pred             eEecCCccEE-eCCCCCeeCCC
Q psy5614          65 CKVINHTPIC-TCPEGYIGDPF   85 (98)
Q Consensus        65 C~~~~g~~~C-~C~~Gy~G~g~   85 (98)
                      |.....++.| .|..||.||..
T Consensus       878 CqD~T~G~~CdrCl~GyyGdP~  899 (1758)
T KOG0994|consen  878 CQDSTTGHSCDRCLDGYYGDPR  899 (1758)
T ss_pred             ccccccccchhhhhccccCCcc
Confidence            5555567777 89999999975


No 46 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=78.04  E-value=2.3  Score=28.52  Aligned_cols=24  Identities=25%  Similarity=0.577  Sum_probs=19.4

Q ss_pred             CcEEe---cCCCCeEEcCCCcEeCCCCCc
Q psy5614           6 WIQCS---MQSNPSCSCLPNYIGAPPNCR   31 (98)
Q Consensus         6 ~a~C~---~~g~~~C~C~~Gy~Gdg~~C~   31 (98)
                      |++|.   ....+.|.|..||+|  ..|+
T Consensus        55 HG~C~yI~dl~~~~CrC~~GYtG--eRCE   81 (139)
T PHA03099         55 HGDCIHARDIDGMYCRCSHGYTG--IRCQ   81 (139)
T ss_pred             CCEEEeeccCCCceeECCCCccc--cccc
Confidence            45898   458899999999999  5664


No 47 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=77.46  E-value=2.2  Score=23.00  Aligned_cols=26  Identities=31%  Similarity=0.691  Sum_probs=17.5

Q ss_pred             CeeEecCCccEEeCCCCCeeCCCCCCcc
Q psy5614          63 AQCKVINHTPICTCPEGYIGDPFSSCFP   90 (98)
Q Consensus        63 a~C~~~~g~~~C~C~~Gy~G~g~~~C~~   90 (98)
                      +.|...  ..+|.|+++|+|..-..|.+
T Consensus        11 ~~C~~~--~G~C~C~~~~~G~~C~~C~~   36 (49)
T PF00053_consen   11 QTCDPS--TGQCVCKPGTTGPRCDQCKP   36 (49)
T ss_dssp             SSEEET--CEEESBSTTEESTTS-EE-T
T ss_pred             CcccCC--CCEEeccccccCCcCcCCCC
Confidence            367663  44799999999987656654


No 48 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.07  E-value=3.1  Score=27.88  Aligned_cols=30  Identities=30%  Similarity=0.687  Sum_probs=22.5

Q ss_pred             CCCCCCCeeEecC--CccEEeCCCCCeeCCCCCCcc
Q psy5614          57 GSCGYNAQCKVIN--HTPICTCPEGYIGDPFSSCFP   90 (98)
Q Consensus        57 ~~C~~~a~C~~~~--g~~~C~C~~Gy~G~g~~~C~~   90 (98)
                      +-|-++ +|....  ..+.|.|..||+|.   +|.-
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe---RCEh   82 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI---RCQH   82 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccc---cccc
Confidence            456554 898865  58999999999995   5643


No 49 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=74.14  E-value=4.4  Score=21.99  Aligned_cols=19  Identities=26%  Similarity=0.665  Sum_probs=13.6

Q ss_pred             cEEeCCCCCeeCCCCCCcc
Q psy5614          72 PICTCPEGYIGDPFSSCFP   90 (98)
Q Consensus        72 ~~C~C~~Gy~G~g~~~C~~   90 (98)
                      -+|.|+++|+|..-..|.+
T Consensus        19 G~C~C~~~~~G~~C~~C~~   37 (50)
T cd00055          19 GQCECKPNTTGRRCDRCAP   37 (50)
T ss_pred             CEEeCCCcCCCCCCCCCCC
Confidence            4688888888876656654


No 50 
>KOG0994|consensus
Probab=73.94  E-value=3.4  Score=36.78  Aligned_cols=19  Identities=37%  Similarity=0.793  Sum_probs=15.3

Q ss_pred             cCCCCeE-EcCCCcEeCCCC
Q psy5614          11 MQSNPSC-SCLPNYIGAPPN   29 (98)
Q Consensus        11 ~~g~~~C-~C~~Gy~Gdg~~   29 (98)
                      ..+++.| .|..||.||++.
T Consensus       881 ~T~G~~CdrCl~GyyGdP~l  900 (1758)
T KOG0994|consen  881 STTGHSCDRCLDGYYGDPRL  900 (1758)
T ss_pred             cccccchhhhhccccCCccc
Confidence            6677888 999999988653


No 51 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=73.48  E-value=3.2  Score=21.44  Aligned_cols=13  Identities=62%  Similarity=1.196  Sum_probs=10.8

Q ss_pred             cEEeCCCCCeeCC
Q psy5614          72 PICTCPEGYIGDP   84 (98)
Q Consensus        72 ~~C~C~~Gy~G~g   84 (98)
                      ..|.|+.||.-|-
T Consensus        18 ~~C~CPeGyIlde   30 (34)
T PF09064_consen   18 GQCFCPEGYILDE   30 (34)
T ss_pred             CceeCCCceEecC
Confidence            4899999998664


No 52 
>KOG1226|consensus
Probab=69.44  E-value=7.4  Score=32.96  Aligned_cols=21  Identities=24%  Similarity=0.534  Sum_probs=13.8

Q ss_pred             ccCCcEEecCCCCeEEcCCCcEeC
Q psy5614           3 WLMWIQCSMQSNPSCSCLPNYIGA   26 (98)
Q Consensus         3 ~~~~a~C~~~g~~~C~C~~Gy~Gd   26 (98)
                      |..|++|.   -.+|+|.+||+|+
T Consensus       557 C~g~G~C~---CG~CvC~~GwtG~  577 (783)
T KOG1226|consen  557 CGGHGRCE---CGRCVCNPGWTGS  577 (783)
T ss_pred             cCCCCeEe---CCcEEcCCCCccC
Confidence            34455553   2358999999994


No 53 
>KOG3607|consensus
Probab=61.98  E-value=9  Score=32.23  Aligned_cols=45  Identities=27%  Similarity=0.751  Sum_probs=34.0

Q ss_pred             CCCCCCCceeeCCeec-------CCCCCCCCCCCeeEecCCccEEeCCCCCeeCC
Q psy5614          37 NSECASNLACINEKCR-------DPCPGSCGYNAQCKVINHTPICTCPEGYIGDP   84 (98)
Q Consensus        37 ~~~C~~~~~C~~~~C~-------d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G~g   84 (98)
                      ...|.+..+|++.+|.       +.|+..|+.++.|-+.   +.|.|.+||++..
T Consensus       603 Gt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~---~~ChC~~gwapp~  654 (716)
T KOG3607|consen  603 GTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNE---LNCHCEPGWAPPF  654 (716)
T ss_pred             CCccCCCceecCCcchhhhhhcccccccccCCCcccCCC---cceeeCCCCCCCc
Confidence            4557777788888885       4466668888888764   5699999998854


No 54 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=60.82  E-value=8.5  Score=20.57  Aligned_cols=19  Identities=26%  Similarity=0.679  Sum_probs=12.3

Q ss_pred             cEEeCCCCCeeCCCCCCcc
Q psy5614          72 PICTCPEGYIGDPFSSCFP   90 (98)
Q Consensus        72 ~~C~C~~Gy~G~g~~~C~~   90 (98)
                      -+|.|+++|+|..-..|.+
T Consensus        18 G~C~C~~~~~G~~C~~C~~   36 (46)
T smart00180       18 GQCECKPNVTGRRCDRCAP   36 (46)
T ss_pred             CEEECCCCCCCCCCCcCCC
Confidence            3688888888865444543


No 55 
>KOG1836|consensus
Probab=59.42  E-value=9.9  Score=35.01  Aligned_cols=31  Identities=29%  Similarity=0.786  Sum_probs=18.2

Q ss_pred             CCCCCCeeEecC--CccEEe-CCCCCeeCCCCCC
Q psy5614          58 SCGYNAQCKVIN--HTPICT-CPEGYIGDPFSSC   88 (98)
Q Consensus        58 ~C~~~a~C~~~~--g~~~C~-C~~Gy~G~g~~~C   88 (98)
                      +|.....|....  ....|+ |++||+|.-...|
T Consensus       781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c  814 (1705)
T KOG1836|consen  781 PCPNGGACGQTPEILEVVCKNCPPGYTGLRCEEC  814 (1705)
T ss_pred             CCCCChhhcCcCcccceecCCCCCCCcccccccC
Confidence            344444454433  466787 8888888544333


No 56 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=55.07  E-value=8.3  Score=22.05  Aligned_cols=28  Identities=29%  Similarity=0.597  Sum_probs=15.5

Q ss_pred             CCccCCcEEe-----cCCCCeEEcCCCcEeCCCCC
Q psy5614           1 MSWLMWIQCS-----MQSNPSCSCLPNYIGAPPNC   30 (98)
Q Consensus         1 ~~~~~~a~C~-----~~g~~~C~C~~Gy~Gdg~~C   30 (98)
                      ++|.-|++-.     ..|...|+|..-|.|  ..|
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~G--pdC   49 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGG--PDC   49 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEES--TTS
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCC--CCc
Confidence            3667777775     467788999999998  454


No 57 
>KOG3516|consensus
Probab=52.66  E-value=12  Score=33.32  Aligned_cols=31  Identities=29%  Similarity=0.697  Sum_probs=27.2

Q ss_pred             CCCC-CCCCCCCeeEecCCccEEeCC-CCCeeC
Q psy5614          53 DPCP-GSCGYNAQCKVINHTPICTCP-EGYIGD   83 (98)
Q Consensus        53 d~C~-~~C~~~a~C~~~~g~~~C~C~-~Gy~G~   83 (98)
                      |.|+ ++|...+.|......|.|.|. .||+|-
T Consensus       546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga  578 (1306)
T KOG3516|consen  546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA  578 (1306)
T ss_pred             cccCCccccCCCcccccccceeEeccccccccc
Confidence            6674 899999999997789999999 999884


No 58 
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=42.08  E-value=18  Score=19.16  Aligned_cols=23  Identities=26%  Similarity=0.567  Sum_probs=15.1

Q ss_pred             CCeE-EcCCCcEeCCCCCcccCcC
Q psy5614          14 NPSC-SCLPNYIGAPPNCRPECVM   36 (98)
Q Consensus        14 ~~~C-~C~~Gy~Gdg~~C~~eC~~   36 (98)
                      ...| .|+.||..+...|.++|..
T Consensus        14 ~~~C~~C~~~~~~~~~~Cv~~C~~   37 (49)
T cd00064          14 PDQCTSCRHGFYLDGGTCVSECPE   37 (49)
T ss_pred             cCcCccCcCccCCCCCcccccCCC
Confidence            4567 8888886655677655543


No 59 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=34.14  E-value=13  Score=20.28  Aligned_cols=19  Identities=37%  Similarity=1.001  Sum_probs=13.9

Q ss_pred             EEeCCCCCeeCCCCCCccC
Q psy5614          73 ICTCPEGYIGDPFSSCFPK   91 (98)
Q Consensus        73 ~C~C~~Gy~G~g~~~C~~~   91 (98)
                      -|.|++||.-+....|++.
T Consensus        34 gC~C~~G~v~~~~~~CV~~   52 (55)
T PF01826_consen   34 GCFCPPGYVRNDNGRCVPP   52 (55)
T ss_dssp             EEEETTTEEEETTSEEEEG
T ss_pred             cCCCCCCeeEcCCCCEEcH
Confidence            3999999986554467764


No 60 
>KOG0196|consensus
Probab=31.94  E-value=70  Score=27.94  Aligned_cols=58  Identities=22%  Similarity=0.499  Sum_probs=30.8

Q ss_pred             CeEEcCCCcEe--CCCCCcccCcCCCCCCCCceeeCCeecCCCCCCCCCCCeeEecCCccEEeCCCCCee
Q psy5614          15 PSCSCLPNYIG--APPNCRPECVMNSECASNLACINEKCRDPCPGSCGYNAQCKVINHTPICTCPEGYIG   82 (98)
Q Consensus        15 ~~C~C~~Gy~G--dg~~C~~eC~~~~~C~~~~~C~~~~C~d~C~~~C~~~a~C~~~~g~~~C~C~~Gy~G   82 (98)
                      ..|.|++||.-  .+..|+ .|..+..=   ..=....    | .+|.+|..=. ..++..|.|..||.-
T Consensus       259 G~C~C~aGye~~~~~~~C~-aCp~G~yK---~~~~~~~----C-~~CP~~S~s~-~ega~~C~C~~gyyR  318 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQ-ACPPGTYK---ASQGDSL----C-LPCPPNSHSS-SEGATSCTCENGYYR  318 (996)
T ss_pred             CceeecCCCCcccCCCcce-eCCCCccc---CCCCCCC----C-CCCCCCCCCC-CCCCCcccccCCccc
Confidence            45899999965  455664 23221100   0000011    2 2455544322 467889999999963


No 61 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=30.99  E-value=22  Score=20.48  Aligned_cols=8  Identities=50%  Similarity=1.356  Sum_probs=5.4

Q ss_pred             EcCCCcEe
Q psy5614          18 SCLPNYIG   25 (98)
Q Consensus        18 ~C~~Gy~G   25 (98)
                      .|++||.|
T Consensus        53 ~C~~Gw~G   60 (63)
T PF01414_consen   53 VCLPGWTG   60 (63)
T ss_dssp             EE-TTEES
T ss_pred             CCCCCCcC
Confidence            67788887


No 62 
>PF09402 MSC:  Man1-Src1p-C-terminal domain;  InterPro: IPR018996 This entry represents the Inner nuclear membrane proteins MAN1 (also known as LEM domain-containing protein 3) and LEM domain-containing protein 2 (or LEM protein 2). Emerin and MAN1 are LEM domain-containing integral membrane proteins of the vertebrate nuclear envelope []. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C-terminal nulceoplasmic region forms a DNA binding winged helix and binds to Smad []. LEM protein 2 is an essential protein involved in chromosome segregation and cell division, probably via its interaction with lmn-1, the main component of nuclear lamina. Has some overlapping function with emr-1.; GO: 0005639 integral to nuclear inner membrane; PDB: 2CH0_A.
Probab=22.70  E-value=28  Score=25.85  Aligned_cols=24  Identities=25%  Similarity=0.414  Sum_probs=0.0

Q ss_pred             CCCCCCCeeEecCCccEEeCCCCCeeC
Q psy5614          57 GSCGYNAQCKVINHTPICTCPEGYIGD   83 (98)
Q Consensus        57 ~~C~~~a~C~~~~g~~~C~C~~Gy~G~   83 (98)
                      -||.+||.|...   ..-.|.+||.-.
T Consensus        53 ~pCP~~a~C~~~---~~~~C~~~y~~~   76 (334)
T PF09402_consen   53 EPCPEHAICYPG---LKLECEPGYVLK   76 (334)
T ss_dssp             ---------------------------
T ss_pred             cccccccccccc---cccccccccccc
Confidence            389999999853   457899999865


No 63 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=21.33  E-value=3.2e+02  Score=23.57  Aligned_cols=23  Identities=17%  Similarity=0.446  Sum_probs=16.7

Q ss_pred             CCccEEeCCCCCeeCCCCCCccCC
Q psy5614          69 NHTPICTCPEGYIGDPFSSCFPKP   92 (98)
Q Consensus        69 ~g~~~C~C~~Gy~G~g~~~C~~~~   92 (98)
                      .....|+|..||.-.+- .|++..
T Consensus       748 ~~~~vC~C~~g~~l~~~-~c~~~~  770 (800)
T PTZ00214        748 SNQGVCMCELDAVLTKG-VCVPAK  770 (800)
T ss_pred             ccCCeEEeCCcceecCC-eeEecc
Confidence            34568999999986664 687653


Done!