Query         psy3697
Match_columns 117
No_of_seqs    162 out of 1315
Neff          8.1 
Searched_HMMs 46136
Date          Fri Aug 16 23:15:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3697.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3697hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1225|consensus               99.5 2.1E-14 4.6E-19  113.6   8.7   93    1-96    241-347 (525)
  2 KOG1225|consensus               99.3 7.9E-12 1.7E-16   99.2   8.7   83   19-104   242-324 (525)
  3 KOG1226|consensus               99.1 3.3E-10 7.3E-15   92.2   8.5   84    1-90    485-580 (783)
  4 KOG1226|consensus               98.7 7.2E-08 1.6E-12   78.9   7.2   71   29-102   553-633 (783)
  5 KOG4289|consensus               98.3 1.1E-06 2.4E-11   76.2   5.3   50   43-93   1223-1277(2531)
  6 KOG1219|consensus               98.3 1.9E-06 4.1E-11   77.4   6.1   76   31-107  3870-3958(4289)
  7 KOG4289|consensus               98.2 1.2E-06 2.5E-11   76.1   4.6   75    1-92   1229-1318(2531)
  8 PF07974 EGF_2:  EGF-like domai  98.1 2.6E-06 5.7E-11   43.4   2.9   24   32-55      7-32  (32)
  9 PF07974 EGF_2:  EGF-like domai  98.0 8.8E-06 1.9E-10   41.5   3.1   27   62-89      6-32  (32)
 10 KOG1219|consensus               97.9 2.4E-05 5.3E-10   70.7   5.3   71   19-90   3894-3977(4289)
 11 smart00051 DSL delta serrate l  97.7 5.3E-05 1.1E-09   44.5   3.0   44   43-89     18-63  (63)
 12 PF12661 hEGF:  Human growth fa  97.3 0.00014 3.1E-09   29.7   1.2   13   77-89      1-13  (13)
 13 smart00051 DSL delta serrate l  97.3 0.00034 7.3E-09   41.0   3.2   24   32-55     39-63  (63)
 14 PF00008 EGF:  EGF-like domain   96.9 0.00099 2.1E-08   33.7   2.2   25   62-87      4-31  (32)
 15 PF00008 EGF:  EGF-like domain   96.3  0.0023   5E-08   32.3   1.2   23   32-54      5-32  (32)
 16 smart00179 EGF_CA Calcium-bind  95.2   0.034 7.3E-07   28.3   3.2   21   69-89     15-38  (39)
 17 PF01414 DSL:  Delta serrate li  95.1  0.0047   1E-07   36.1  -0.4   46   42-89     17-63  (63)
 18 cd00055 EGF_Lam Laminin-type e  95.0   0.026 5.6E-07   31.2   2.4   25   70-96     13-37  (50)
 19 smart00179 EGF_CA Calcium-bind  94.9   0.058 1.3E-06   27.4   3.5   25   32-56     10-39  (39)
 20 PF00053 Laminin_EGF:  Laminin   94.8   0.022 4.7E-07   31.3   1.8   26   69-96     11-36  (49)
 21 cd00054 EGF_CA Calcium-binding  94.7    0.05 1.1E-06   27.2   3.0   13   77-89     25-37  (38)
 22 cd00053 EGF Epidermal growth f  94.3   0.089 1.9E-06   25.7   3.3   11   77-87     22-32  (36)
 23 PF01414 DSL:  Delta serrate li  94.3  0.0067 1.5E-07   35.5  -1.1   21   35-55     42-63  (63)
 24 cd00053 EGF Epidermal growth f  93.8   0.095 2.1E-06   25.6   2.9   24   32-55      7-35  (36)
 25 KOG4260|consensus               93.8   0.064 1.4E-06   40.2   3.0   42   46-90    132-182 (350)
 26 cd00054 EGF_CA Calcium-binding  93.8   0.081 1.7E-06   26.4   2.6   25   32-56     10-38  (38)
 27 KOG1217|consensus               93.6     0.3 6.4E-06   37.6   6.6   73   33-106   280-371 (487)
 28 KOG3607|consensus               93.5   0.097 2.1E-06   44.0   3.9   57   29-92    602-658 (716)
 29 smart00180 EGF_Lam Laminin-typ  93.2   0.081 1.8E-06   28.7   2.1   26   69-96     11-36  (46)
 30 KOG0994|consensus               93.2    0.23 4.9E-06   43.8   5.5   53   38-90   1078-1146(1758)
 31 cd00055 EGF_Lam Laminin-type e  93.0    0.11 2.4E-06   28.6   2.4   17   40-56     17-33  (50)
 32 KOG1218|consensus               91.9    0.82 1.8E-05   33.8   6.7   18   74-91    160-177 (316)
 33 smart00181 EGF Epidermal growt  91.2    0.29 6.2E-06   24.3   2.6   23   32-55      7-34  (35)
 34 KOG1218|consensus               91.0    0.92   2E-05   33.5   6.2   48   38-86    158-209 (316)
 35 KOG1217|consensus               90.7    0.72 1.6E-05   35.4   5.5   50   42-92    152-208 (487)
 36 PF01683 EB:  EB module;  Inter  90.7    0.37 8.1E-06   26.5   2.9   20   32-51     27-46  (52)
 37 PF12947 EGF_3:  EGF domain;  I  90.7    0.23 4.9E-06   25.7   1.9   24   63-87      7-32  (36)
 38 PHA02887 EGF-like protein; Pro  87.4    0.62 1.3E-05   30.6   2.5   27   63-91     93-123 (126)
 39 PF01683 EB:  EB module;  Inter  87.0     3.4 7.3E-05   22.5   5.2   39   40-85      8-46  (52)
 40 KOG4260|consensus               86.9    0.76 1.7E-05   34.6   3.1   27   31-57    150-183 (350)
 41 PF07645 EGF_CA:  Calcium-bindi  86.7     1.4 2.9E-05   23.2   3.3   23   62-85     10-34  (42)
 42 KOG1836|consensus               85.4     1.2 2.5E-05   41.1   4.0   59   32-91    732-813 (1705)
 43 KOG3607|consensus               85.3    0.87 1.9E-05   38.4   3.0   27   32-58    631-658 (716)
 44 KOG0994|consensus               85.2     2.2 4.8E-05   38.1   5.4   69    4-78    792-875 (1758)
 45 PHA02887 EGF-like protein; Pro  85.0     0.8 1.7E-05   30.1   2.1   26   32-58     93-124 (126)
 46 PF04863 EGF_alliinase:  Alliin  83.4    0.52 1.1E-05   26.7   0.7   14   77-90     37-50  (56)
 47 KOG1214|consensus               83.2     2.4 5.2E-05   36.7   4.7   45   42-87    809-859 (1289)
 48 PHA03099 epidermal growth fact  79.8     1.5 3.3E-05   29.2   2.0   17   42-58     67-83  (139)
 49 PF12955 DUF3844:  Domain of un  79.3     2.2 4.8E-05   27.3   2.6   10   83-92     53-62  (103)
 50 PF09064 Tme5_EGF_like:  Thromb  78.0     2.8   6E-05   21.4   2.2   27   55-87      3-29  (34)
 51 PHA03099 epidermal growth fact  77.9     2.1 4.5E-05   28.6   2.2   23   69-91     56-82  (139)
 52 KOG1836|consensus               59.5     7.6 0.00016   36.1   2.4   48   43-90    696-759 (1705)
 53 KOG3516|consensus               54.7      12 0.00026   33.6   2.7   27   32-58    962-993 (1306)
 54 PF12662 cEGF:  Complement Clr-  47.6      12 0.00026   17.5   1.0   10   77-86      3-12  (24)
 55 PF14670 FXa_inhibition:  Coagu  41.9      21 0.00045   18.2   1.5   18   70-87     11-30  (36)
 56 KOG3512|consensus               37.7      44 0.00095   27.3   3.3   22   69-90    407-428 (592)
 57 PF01826 TIL:  Trypsin Inhibito  28.0      57  0.0012   17.7   1.9   14   37-51     29-42  (55)
 58 PF00954 S_locus_glycop:  S-loc  25.7      96  0.0021   19.3   2.9   24   62-86     84-108 (110)
 59 KOG1214|consensus               24.3 1.8E+02  0.0039   25.9   4.9   44   42-86    716-767 (1289)
 60 PF05294 Toxin_5:  Scorpion sho  22.0   1E+02  0.0022   15.4   1.9   15   32-46     18-32  (32)

No 1  
>KOG1225|consensus
Probab=99.55  E-value=2.1e-14  Score=113.60  Aligned_cols=93  Identities=32%  Similarity=0.781  Sum_probs=82.1

Q ss_pred             CccCCCCCCCCCCCCCCC--------------CCCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCC
Q psy3697           1 MYTGEACDVPICPNNCSY--------------SSVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSY   66 (117)
Q Consensus         1 G~~G~~C~~~~C~~~c~~--------------~~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~   66 (117)
                      +|+|+.|+...|+.+|++              |.+.+|....++.+|++++.+++++|+|+++|+|..|++..|+.+|++
T Consensus       241 ~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g  320 (525)
T KOG1225|consen  241 GYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSG  320 (525)
T ss_pred             ceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccCCccCCC
Confidence            688999998889999875              678888888888789999999999999999999999998889999999


Q ss_pred             CCCeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697          67 SNGVCKHEFHRCECMDKYKEMKQLFDACDV   96 (117)
Q Consensus        67 ~~G~C~~~~g~C~C~~g~~G~~C~~~~C~~   96 (117)
                      + |.|.  .++|.|++||+|..|+.+.|..
T Consensus       321 ~-G~Ci--~G~C~C~~Gy~G~~C~~~~C~~  347 (525)
T KOG1225|consen  321 H-GKCI--DGECLCDEGYTGELCIQRACSG  347 (525)
T ss_pred             C-Cccc--CCceEeCCCCcCCcccccccCC
Confidence            8 9998  8999999999999999864443


No 2  
>KOG1225|consensus
Probab=99.32  E-value=7.9e-12  Score=99.17  Aligned_cols=83  Identities=30%  Similarity=0.688  Sum_probs=74.8

Q ss_pred             CCCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCcccCCCcCccCCCC
Q psy3697          19 SSVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYKEMKQLFDACDVSK   98 (117)
Q Consensus        19 ~~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~~~~C~~~~   98 (117)
                      |.++.|....++..|.++|.|+.|.|+|++||+|.+|+...|+.+|+.+ +.+.  .+.|+|+++|.|.+|+++.|+.+|
T Consensus       242 ~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~-g~~~--~g~CiC~~g~~G~dCs~~~cpadC  318 (525)
T KOG1225|consen  242 YFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGG-GVCV--DGECICNPGYSGKDCSIRRCPADC  318 (525)
T ss_pred             eeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCC-ceec--CCEeecCCCccccccccccCCccC
Confidence            5677787777788999999999999999999999999998899889987 8887  679999999999999998899999


Q ss_pred             CCCCCC
Q psy3697          99 NAHSVA  104 (117)
Q Consensus        99 ~~~g~~  104 (117)
                      .+||..
T Consensus       319 ~g~G~C  324 (525)
T KOG1225|consen  319 SGHGKC  324 (525)
T ss_pred             CCCCcc
Confidence            888853


No 3  
>KOG1226|consensus
Probab=99.11  E-value=3.3e-10  Score=92.22  Aligned_cols=84  Identities=26%  Similarity=0.655  Sum_probs=66.9

Q ss_pred             CccCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCcEEeCCcEEeCCCCC----CCCCCCCC--CC----CCCCCCC
Q psy3697           1 MYTGEACDVPICPNNCSYS--SVNSCPSSKSDKPCSGQGVCIEGVCTCDAMYT----GEACDVPI--CP----NNCSYSN   68 (117)
Q Consensus         1 G~~G~~C~~~~C~~~c~~~--~~~~C~~~~~~~~C~~~G~C~~g~C~C~~g~~----G~~C~~~~--c~----~~C~~~~   68 (117)
                      ||.|..|+   |+....+.  --+.|......++|+++|.|+.|+|+|++...    |.+|+.+.  |+    ..|+++ 
T Consensus       485 G~~G~~CE---C~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~-  560 (783)
T KOG1226|consen  485 GWLGKKCE---CSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGH-  560 (783)
T ss_pred             CCCCCccc---CCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeeccCcccccccCcccCCC-
Confidence            78999995   44433222  13567776777899999999999999998877    99998653  54    349998 


Q ss_pred             CeEecCCCeEeeCCCCcccCCC
Q psy3697          69 GVCKHEFHRCECMDKYKEMKQL   90 (117)
Q Consensus        69 G~C~~~~g~C~C~~g~~G~~C~   90 (117)
                      |+|.  .|+|+|.+||+|..|+
T Consensus       561 G~C~--CG~CvC~~GwtG~~C~  580 (783)
T KOG1226|consen  561 GRCE--CGRCVCNPGWTGSACN  580 (783)
T ss_pred             CeEe--CCcEEcCCCCccCCCC
Confidence            9998  9999999999999997


No 4  
>KOG1226|consensus
Probab=98.66  E-value=7.2e-08  Score=78.89  Aligned_cols=71  Identities=32%  Similarity=0.753  Sum_probs=55.9

Q ss_pred             CCCCCCCCcEEeCCcEEeCCCCCCCCCCCCC----CC----CCCCCCCCeEecCCCeEeeCCC-CcccCCCc-CccCCCC
Q psy3697          29 SDKPCSGQGVCIEGVCTCDAMYTGEACDVPI----CP----NNCSYSNGVCKHEFHRCECMDK-YKEMKQLF-DACDVSK   98 (117)
Q Consensus        29 ~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~----c~----~~C~~~~G~C~~~~g~C~C~~g-~~G~~C~~-~~C~~~~   98 (117)
                      ....|++||+|..|+|+|.+||+|..|+.+.    |.    .-|+++ |+|.  .|+|+|... |.|+.|+. +.|+..|
T Consensus       553 ~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGr-G~C~--Cg~C~C~~~~~sG~~CE~cptc~~~C  629 (783)
T KOG1226|consen  553 KGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGR-GTCE--CGRCKCTDPPYSGEFCEKCPTCPDPC  629 (783)
T ss_pred             cCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCCC-ceee--CCceEcCCCCcCcchhhcCCCCCCcc
Confidence            4568999999999999999999999999753    32    247886 9998  899999776 99999985 3455544


Q ss_pred             CCCC
Q psy3697          99 NAHS  102 (117)
Q Consensus        99 ~~~g  102 (117)
                      ..+.
T Consensus       630 ~~~~  633 (783)
T KOG1226|consen  630 AENK  633 (783)
T ss_pred             cccc
Confidence            4433


No 5  
>KOG4289|consensus
Probab=98.29  E-value=1.1e-06  Score=76.19  Aligned_cols=50  Identities=28%  Similarity=0.730  Sum_probs=42.8

Q ss_pred             cEEeCCCCCCCCCCCCC--C-CCCCCCCCCeEecCCC--eEeeCCCCcccCCCcCc
Q psy3697          43 VCTCDAMYTGEACDVPI--C-PNNCSYSNGVCKHEFH--RCECMDKYKEMKQLFDA   93 (117)
Q Consensus        43 ~C~C~~g~~G~~C~~~~--c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~~   93 (117)
                      .|.|++||+|++|+..+  | ..+|.++ |+|...+|  +|+|.++|+|.+|++..
T Consensus      1223 rCrCPpGFTgd~CeTeiDlCYs~pC~nn-g~C~srEggYtCeCrpg~tGehCEvs~ 1277 (2531)
T KOG4289|consen 1223 RCRCPPGFTGDYCETEIDLCYSGPCGNN-GRCRSREGGYTCECRPGFTGEHCEVSA 1277 (2531)
T ss_pred             eEeCCCCCCcccccchhHhhhcCCCCCC-CceEEecCceeEEecCCccccceeeec
Confidence            59999999999999754  5 3689997 99987666  99999999999998643


No 6  
>KOG1219|consensus
Probab=98.26  E-value=1.9e-06  Score=77.36  Aligned_cols=76  Identities=26%  Similarity=0.738  Sum_probs=62.5

Q ss_pred             CCCCCCcEEeC-----CcEEeCCCCCCCCCCCCC--C-CCCCCCCCCeEecCCC--eEeeCCCCcccCCCcCc---cCCC
Q psy3697          31 KPCSGQGVCIE-----GVCTCDAMYTGEACDVPI--C-PNNCSYSNGVCKHEFH--RCECMDKYKEMKQLFDA---CDVS   97 (117)
Q Consensus        31 ~~C~~~G~C~~-----g~C~C~~g~~G~~C~~~~--c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~~---C~~~   97 (117)
                      .+|++.|+|..     ++|.|++-|.|..|++..  | +++|..+ |+|....+  .|.|+.||+|..|+...   |...
T Consensus      3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~G-gtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n 3948 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTG-GTCIPFYNGFLCNCPNGYTGKRCEARGISECSKN 3948 (4289)
T ss_pred             CcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCC-CEEEecCCCeeEeCCCCccCceeecccccccccc
Confidence            38999999953     579999999999999753  5 3789886 99987654  89999999999998653   8887


Q ss_pred             CCCCCCCCCC
Q psy3697          98 KNAHSVACPS  107 (117)
Q Consensus        98 ~~~~g~~c~~  107 (117)
                      .+.+|..|.+
T Consensus      3949 ~C~~gg~C~n 3958 (4289)
T KOG1219|consen 3949 VCGTGGQCIN 3958 (4289)
T ss_pred             cccCCceeec
Confidence            7777777754


No 7  
>KOG4289|consensus
Probab=98.25  E-value=1.2e-06  Score=76.09  Aligned_cols=75  Identities=29%  Similarity=0.829  Sum_probs=56.8

Q ss_pred             CccCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCcEEeC----CcEEeCCCCCCCCCCCCC----C-CCCCCCCCC
Q psy3697           1 MYTGEACDVP--ICPNNCSYSSVNSCPSSKSDKPCSGQGVCIE----GVCTCDAMYTGEACDVPI----C-PNNCSYSNG   69 (117)
Q Consensus         1 G~~G~~C~~~--~C~~~c~~~~~~~C~~~~~~~~C~~~G~C~~----g~C~C~~g~~G~~C~~~~----c-~~~C~~~~G   69 (117)
                      ||+|..|+..  .|..+                +|.++|+|+.    ++|.|.++|+|..|+...    | +..|.+. |
T Consensus      1229 GFTgd~CeTeiDlCYs~----------------pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~ng-g 1291 (2531)
T KOG4289|consen 1229 GFTGDYCETEIDLCYSG----------------PCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNG-G 1291 (2531)
T ss_pred             CCCcccccchhHhhhcC----------------CCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCC-C
Confidence            6888888654  35544                8999999954    579999999999999753    4 4568886 9


Q ss_pred             eEecCCC---eEeeCCC-CcccCCCcC
Q psy3697          70 VCKHEFH---RCECMDK-YKEMKQLFD   92 (117)
Q Consensus        70 ~C~~~~g---~C~C~~g-~~G~~C~~~   92 (117)
                      +|.....   .|.|++| |.++.|+..
T Consensus      1292 tC~~~~nggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1292 TCVNLLNGGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred             EEeecCCCceeccCCCcccCCCceEEE
Confidence            9975432   7899875 677888753


No 8  
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=98.14  E-value=2.6e-06  Score=43.45  Aligned_cols=24  Identities=54%  Similarity=1.312  Sum_probs=20.7

Q ss_pred             CCCCCcEEe--CCcEEeCCCCCCCCC
Q psy3697          32 PCSGQGVCI--EGVCTCDAMYTGEAC   55 (117)
Q Consensus        32 ~C~~~G~C~--~g~C~C~~g~~G~~C   55 (117)
                      .|+++|+|+  .++|+|+++|+|++|
T Consensus         7 ~C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence            688999997  688999999998876


No 9  
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.99  E-value=8.8e-06  Score=41.48  Aligned_cols=27  Identities=33%  Similarity=0.605  Sum_probs=23.5

Q ss_pred             CCCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697          62 NNCSYSNGVCKHEFHRCECMDKYKEMKQ   89 (117)
Q Consensus        62 ~~C~~~~G~C~~~~g~C~C~~g~~G~~C   89 (117)
                      ..|+++ |+|+...++|+|++||+|++|
T Consensus         6 ~~C~~~-G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGH-GTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCC-CEEeCCCCEEECCCCCcCCCC
Confidence            358897 999876689999999999987


No 10 
>KOG1219|consensus
Probab=97.85  E-value=2.4e-05  Score=70.66  Aligned_cols=71  Identities=30%  Similarity=0.624  Sum_probs=55.4

Q ss_pred             CCCCCCCCCCC---CCCCCCCcEEeC----CcEEeCCCCCCCCCCCCC---CC-CCCCCCCCeEecCCC--eEeeCCCCc
Q psy3697          19 SSVNSCPSSKS---DKPCSGQGVCIE----GVCTCDAMYTGEACDVPI---CP-NNCSYSNGVCKHEFH--RCECMDKYK   85 (117)
Q Consensus        19 ~~~~~C~~~~~---~~~C~~~G~C~~----g~C~C~~g~~G~~C~~~~---c~-~~C~~~~G~C~~~~g--~C~C~~g~~   85 (117)
                      |.|..|.....   ..+|..+|+|+.    ..|.|+.+|+|..|+..-   |. +.|..+ |.|....|  .|.|.+||.
T Consensus      3894 ysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~g-g~C~n~~gsf~CncT~g~~ 3972 (4289)
T KOG1219|consen 3894 YSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTG-GQCINIPGSFHCNCTPGIL 3972 (4289)
T ss_pred             ccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCC-ceeeccCCceEeccChhHh
Confidence            44555543322   348999999954    469999999999999762   65 678886 99998888  999999999


Q ss_pred             ccCCC
Q psy3697          86 EMKQL   90 (117)
Q Consensus        86 G~~C~   90 (117)
                      |..|.
T Consensus      3973 gr~c~ 3977 (4289)
T KOG1219|consen 3973 GRTCC 3977 (4289)
T ss_pred             cccCc
Confidence            99984


No 11 
>smart00051 DSL delta serrate ligand.
Probab=97.65  E-value=5.3e-05  Score=44.45  Aligned_cols=44  Identities=23%  Similarity=0.440  Sum_probs=34.6

Q ss_pred             cEEeCCCCCCCCCCCCCCC--CCCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697          43 VCTCDAMYTGEACDVPICP--NNCSYSNGVCKHEFHRCECMDKYKEMKQ   89 (117)
Q Consensus        43 ~C~C~~g~~G~~C~~~~c~--~~C~~~~G~C~~~~g~C~C~~g~~G~~C   89 (117)
                      .=.|+++|.|..|+.. |.  ++..++ .+|+. .|.++|.+||+|++|
T Consensus        18 rv~C~~~~yG~~C~~~-C~~~~d~~~~-~~Cd~-~G~~~C~~Gw~G~~C   63 (63)
T smart00051       18 RVTCDENYYGEGCNKF-CRPRDDFFGH-YTCDE-NGNKGCLEGWMGPYC   63 (63)
T ss_pred             EeeCCCCCcCCccCCE-eCcCccccCC-ccCCc-CCCEecCCCCcCCCC
Confidence            3468899999999853 42  345665 88975 789999999999987


No 12 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.30  E-value=0.00014  Score=29.66  Aligned_cols=13  Identities=15%  Similarity=0.506  Sum_probs=7.9

Q ss_pred             eEeeCCCCcccCC
Q psy3697          77 RCECMDKYKEMKQ   89 (117)
Q Consensus        77 ~C~C~~g~~G~~C   89 (117)
                      +|+|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            3667777777665


No 13 
>smart00051 DSL delta serrate ligand.
Probab=97.28  E-value=0.00034  Score=41.02  Aligned_cols=24  Identities=25%  Similarity=0.454  Sum_probs=19.7

Q ss_pred             CCCCCcEE-eCCcEEeCCCCCCCCC
Q psy3697          32 PCSGQGVC-IEGVCTCDAMYTGEAC   55 (117)
Q Consensus        32 ~C~~~G~C-~~g~C~C~~g~~G~~C   55 (117)
                      +..++.+| ..|.++|.+||+|..|
T Consensus        39 d~~~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       39 DFFGHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             cccCCccCCcCCCEecCCCCcCCCC
Confidence            45677888 5689999999999876


No 14 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.87  E-value=0.00099  Score=33.69  Aligned_cols=25  Identities=32%  Similarity=0.665  Sum_probs=15.3

Q ss_pred             CCCCCCCCeEecCC-C--eEeeCCCCccc
Q psy3697          62 NNCSYSNGVCKHEF-H--RCECMDKYKEM   87 (117)
Q Consensus        62 ~~C~~~~G~C~~~~-g--~C~C~~g~~G~   87 (117)
                      ++|.++ |+|.... +  +|+|++||+|+
T Consensus         4 ~~C~n~-g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    4 NPCQNG-GTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TSSTTT-EEEEEESTSEEEEEEBTTEEST
T ss_pred             CcCCCC-eEEEeCCCCCEEeECCCCCccC
Confidence            356665 6775544 2  67777777765


No 15 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.28  E-value=0.0023  Score=32.31  Aligned_cols=23  Identities=43%  Similarity=1.100  Sum_probs=19.4

Q ss_pred             CCCCCcEEeC-----CcEEeCCCCCCCC
Q psy3697          32 PCSGQGVCIE-----GVCTCDAMYTGEA   54 (117)
Q Consensus        32 ~C~~~G~C~~-----g~C~C~~g~~G~~   54 (117)
                      +|.++|+|+.     ..|+|++||+|..
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            8999999954     3699999999963


No 16 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.22  E-value=0.034  Score=28.28  Aligned_cols=21  Identities=33%  Similarity=0.748  Sum_probs=10.4

Q ss_pred             CeEecCCC--eEeeCCCCc-ccCC
Q psy3697          69 GVCKHEFH--RCECMDKYK-EMKQ   89 (117)
Q Consensus        69 G~C~~~~g--~C~C~~g~~-G~~C   89 (117)
                      |+|....+  .|.|.+||. |..|
T Consensus        15 ~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179       15 GTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             CEeECCCCCeEeECCCCCccCCcC
Confidence            45543332  455555555 5444


No 17 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=95.10  E-value=0.0047  Score=36.13  Aligned_cols=46  Identities=17%  Similarity=0.264  Sum_probs=24.0

Q ss_pred             CcEEeCCCCCCCCCCCCCCCC-CCCCCCCeEecCCCeEeeCCCCcccCC
Q psy3697          42 GVCTCDAMYTGEACDVPICPN-NCSYSNGVCKHEFHRCECMDKYKEMKQ   89 (117)
Q Consensus        42 g~C~C~~g~~G~~C~~~~c~~-~C~~~~G~C~~~~g~C~C~~g~~G~~C   89 (117)
                      .+-+|.+.|.|..|+...-+. .-..+ -+|+ ..|.=+|.+||+|++|
T Consensus        17 ~rv~C~~nyyG~~C~~~C~~~~d~~gh-y~Cd-~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGPNCSKFCKPRDDSFGH-YTCD-SNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETTTT-EE---EEETTEE-EEE--SS--EEE-TTEESTTS
T ss_pred             EEEECCCCCCCccccCCcCCCcCCcCC-cccC-CCCCCCCCCCCcCCCC
Confidence            456789999999998642121 12233 5787 4788899999999987


No 18 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=94.97  E-value=0.026  Score=31.21  Aligned_cols=25  Identities=20%  Similarity=0.436  Sum_probs=20.1

Q ss_pred             eEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697          70 VCKHEFHRCECMDKYKEMKQLFDACDV   96 (117)
Q Consensus        70 ~C~~~~g~C~C~~g~~G~~C~~~~C~~   96 (117)
                      .|+...|+|.|.++|+|..|+  .|..
T Consensus        13 ~C~~~~G~C~C~~~~~G~~C~--~C~~   37 (50)
T cd00055          13 QCDPGTGQCECKPNTTGRRCD--RCAP   37 (50)
T ss_pred             cccCCCCEEeCCCcCCCCCCC--CCCC
Confidence            577678899999999999997  3543


No 19 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=94.86  E-value=0.058  Score=27.37  Aligned_cols=25  Identities=40%  Similarity=1.132  Sum_probs=20.2

Q ss_pred             CCCCCcEEeC----CcEEeCCCCC-CCCCC
Q psy3697          32 PCSGQGVCIE----GVCTCDAMYT-GEACD   56 (117)
Q Consensus        32 ~C~~~G~C~~----g~C~C~~g~~-G~~C~   56 (117)
                      +|.++++|+.    ..|.|+++|. |..|+
T Consensus        10 ~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179       10 PCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             CcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            6888889964    3599999999 88774


No 20 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=94.83  E-value=0.022  Score=31.25  Aligned_cols=26  Identities=15%  Similarity=0.319  Sum_probs=20.6

Q ss_pred             CeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697          69 GVCKHEFHRCECMDKYKEMKQLFDACDV   96 (117)
Q Consensus        69 G~C~~~~g~C~C~~g~~G~~C~~~~C~~   96 (117)
                      .+|+...++|.|.++|+|+.|+.  |..
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~C~~--C~~   36 (49)
T PF00053_consen   11 QTCDPSTGQCVCKPGTTGPRCDQ--CKP   36 (49)
T ss_dssp             SSEEETCEEESBSTTEESTTS-E--E-T
T ss_pred             CcccCCCCEEeccccccCCcCcC--CCC
Confidence            58988889999999999999983  543


No 21 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=94.74  E-value=0.05  Score=27.17  Aligned_cols=13  Identities=31%  Similarity=0.705  Sum_probs=6.6

Q ss_pred             eEeeCCCCcccCC
Q psy3697          77 RCECMDKYKEMKQ   89 (117)
Q Consensus        77 ~C~C~~g~~G~~C   89 (117)
                      .|.|.++|.|..|
T Consensus        25 ~C~C~~g~~g~~C   37 (38)
T cd00054          25 RCSCPPGYTGRNC   37 (38)
T ss_pred             EeECCCCCcCCcC
Confidence            4555555555443


No 22 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=94.30  E-value=0.089  Score=25.75  Aligned_cols=11  Identities=36%  Similarity=0.778  Sum_probs=6.1

Q ss_pred             eEeeCCCCccc
Q psy3697          77 RCECMDKYKEM   87 (117)
Q Consensus        77 ~C~C~~g~~G~   87 (117)
                      +|.|+.||.|.
T Consensus        22 ~C~C~~g~~g~   32 (36)
T cd00053          22 RCVCPPGYTGD   32 (36)
T ss_pred             EeECCCCCccc
Confidence            55555555554


No 23 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=94.29  E-value=0.0067  Score=35.49  Aligned_cols=21  Identities=33%  Similarity=0.702  Sum_probs=14.7

Q ss_pred             CCcEE-eCCcEEeCCCCCCCCC
Q psy3697          35 GQGVC-IEGVCTCDAMYTGEAC   55 (117)
Q Consensus        35 ~~G~C-~~g~C~C~~g~~G~~C   55 (117)
                      +|-+| .+|+-+|.+||+|+.|
T Consensus        42 ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   42 GHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             EEEEE-SS--EEE-TTEESTTS
T ss_pred             CCcccCCCCCCCCCCCCcCCCC
Confidence            45678 5689999999999986


No 24 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=93.84  E-value=0.095  Score=25.64  Aligned_cols=24  Identities=46%  Similarity=1.222  Sum_probs=19.5

Q ss_pred             CCCCCcEEeC----CcEEeCCCCCCC-CC
Q psy3697          32 PCSGQGVCIE----GVCTCDAMYTGE-AC   55 (117)
Q Consensus        32 ~C~~~G~C~~----g~C~C~~g~~G~-~C   55 (117)
                      .|.+++.|+.    ..|.|+.+|.|. .|
T Consensus         7 ~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            6888888854    479999999998 55


No 25 
>KOG4260|consensus
Probab=93.82  E-value=0.064  Score=40.16  Aligned_cols=42  Identities=26%  Similarity=0.574  Sum_probs=31.8

Q ss_pred             eCCCCCCCCCCCCCCC----CCCCCCCCeEec-----CCCeEeeCCCCcccCCC
Q psy3697          46 CDAMYTGEACDVPICP----NNCSYSNGVCKH-----EFHRCECMDKYKEMKQL   90 (117)
Q Consensus        46 C~~g~~G~~C~~~~c~----~~C~~~~G~C~~-----~~g~C~C~~g~~G~~C~   90 (117)
                      |++|-.|++|..  |+    .+|.+. |.|+.     ..|.|.|.+||+|+.|.
T Consensus       132 Cp~gtyGpdCl~--Cpggser~C~Gn-G~C~GdGsR~GsGkCkC~~GY~Gp~C~  182 (350)
T KOG4260|consen  132 CPDGTYGPDCLQ--CPGGSERPCFGN-GSCHGDGSREGSGKCKCETGYTGPLCR  182 (350)
T ss_pred             cCCCCcCCcccc--CCCCCcCCcCCC-CcccCCCCCCCCCcccccCCCCCcccc
Confidence            678899999874  43    457776 78853     23589999999999885


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=93.80  E-value=0.081  Score=26.38  Aligned_cols=25  Identities=40%  Similarity=1.153  Sum_probs=19.9

Q ss_pred             CCCCCcEEeC----CcEEeCCCCCCCCCC
Q psy3697          32 PCSGQGVCIE----GVCTCDAMYTGEACD   56 (117)
Q Consensus        32 ~C~~~G~C~~----g~C~C~~g~~G~~C~   56 (117)
                      .|.+++.|+.    ..|.|.++|.|..|+
T Consensus        10 ~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054          10 PCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             CcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            6777888854    369999999998774


No 27 
>KOG1217|consensus
Probab=93.64  E-value=0.3  Score=37.56  Aligned_cols=73  Identities=26%  Similarity=0.649  Sum_probs=48.5

Q ss_pred             CCCCcEEeC----CcEEeCCCCCCCCC-CC---CCCC-----CCCCCCCCeEecC----CCeEeeCCCCcccCCCcC--c
Q psy3697          33 CSGQGVCIE----GVCTCDAMYTGEAC-DV---PICP-----NNCSYSNGVCKHE----FHRCECMDKYKEMKQLFD--A   93 (117)
Q Consensus        33 C~~~G~C~~----g~C~C~~g~~G~~C-~~---~~c~-----~~C~~~~G~C~~~----~g~C~C~~g~~G~~C~~~--~   93 (117)
                      |.++++|+.    ..|.|+++|+|..| ..   ..|.     ..|.+. +.|...    ...|.|.++|.|..|+..  .
T Consensus       280 c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g-~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~  358 (487)
T KOG1217|consen  280 CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANG-GTCNTLGSFGGFRCACGPGFTGRRCEDSNDE  358 (487)
T ss_pred             cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCC-cccccCCCCCCCCcCCCCCCCCCccccCCcc
Confidence            788899954    57999999999998 21   2232     235554 677321    126999999999999866  5


Q ss_pred             cCCCCCCCCCCCC
Q psy3697          94 CDVSKNAHSVACP  106 (117)
Q Consensus        94 C~~~~~~~g~~c~  106 (117)
                      |......++..|.
T Consensus       359 C~~~~~~~~~~c~  371 (487)
T KOG1217|consen  359 CASSPCCPGGTCV  371 (487)
T ss_pred             ccCCccccCCEec
Confidence            7665334444343


No 28 
>KOG3607|consensus
Probab=93.52  E-value=0.097  Score=43.96  Aligned_cols=57  Identities=23%  Similarity=0.611  Sum_probs=42.7

Q ss_pred             CCCCCCCCcEEeCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCcccCCCcC
Q psy3697          29 SDKPCSGQGVCIEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYKEMKQLFD   92 (117)
Q Consensus        29 ~~~~C~~~G~C~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~~~   92 (117)
                      .+..|...-.|++.+|+=.. ..+..+    |+..|+++ |.|+. ...|.|.+||.+++|++.
T Consensus       602 dGt~Cg~~~vC~~~~C~~~~-v~~~~~----~~~~C~g~-GVCnn-~~~ChC~~gwapp~C~~~  658 (716)
T KOG3607|consen  602 DGTSCGPGMICINHRCLSAS-VLNSSC----CPTTCNGH-GVCNN-ELNCHCEPGWAPPFCFIF  658 (716)
T ss_pred             CCCccCCCceecCCcchhhh-hhcccc----cccccCCC-cccCC-CcceeeCCCCCCCccccc
Confidence            34567777778888886555 555443    44569998 99974 669999999999999853


No 29 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=93.20  E-value=0.081  Score=28.73  Aligned_cols=26  Identities=23%  Similarity=0.577  Sum_probs=20.6

Q ss_pred             CeEecCCCeEeeCCCCcccCCCcCccCC
Q psy3697          69 GVCKHEFHRCECMDKYKEMKQLFDACDV   96 (117)
Q Consensus        69 G~C~~~~g~C~C~~g~~G~~C~~~~C~~   96 (117)
                      ..|+...|+|.|.++|+|.+|+  .|..
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~C~--~C~~   36 (46)
T smart00180       11 GTCDPDTGQCECKPNVTGRRCD--RCAP   36 (46)
T ss_pred             CcccCCCCEEECCCCCCCCCCC--cCCC
Confidence            4677667899999999999997  3543


No 30 
>KOG0994|consensus
Probab=93.16  E-value=0.23  Score=43.84  Aligned_cols=53  Identities=23%  Similarity=0.584  Sum_probs=39.2

Q ss_pred             EE--eCCcEEeCCCCCCCCCCCCC----------CC-CCCCCC---CCeEecCCCeEeeCCCCcccCCC
Q psy3697          38 VC--IEGVCTCDAMYTGEACDVPI----------CP-NNCSYS---NGVCKHEFHRCECMDKYKEMKQL   90 (117)
Q Consensus        38 ~C--~~g~C~C~~g~~G~~C~~~~----------c~-~~C~~~---~G~C~~~~g~C~C~~g~~G~~C~   90 (117)
                      .|  -+|+|+|.+||.|..|++-.          |- -.|...   .-.|+..+|+|+|.+|-.|+.|.
T Consensus      1078 qCN~ftGQCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1078 QCNEFTGQCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred             cccccccceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh
Confidence            56  46899999999999997521          10 123332   13688888999999999999997


No 31 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=92.97  E-value=0.11  Score=28.60  Aligned_cols=17  Identities=41%  Similarity=0.886  Sum_probs=15.5

Q ss_pred             eCCcEEeCCCCCCCCCC
Q psy3697          40 IEGVCTCDAMYTGEACD   56 (117)
Q Consensus        40 ~~g~C~C~~g~~G~~C~   56 (117)
                      .+|+|.|.++|+|..|+
T Consensus        17 ~~G~C~C~~~~~G~~C~   33 (50)
T cd00055          17 GTGQCECKPNTTGRRCD   33 (50)
T ss_pred             CCCEEeCCCcCCCCCCC
Confidence            46899999999999998


No 32 
>KOG1218|consensus
Probab=91.91  E-value=0.82  Score=33.78  Aligned_cols=18  Identities=11%  Similarity=0.202  Sum_probs=11.4

Q ss_pred             CCCeEeeCCCCcccCCCc
Q psy3697          74 EFHRCECMDKYKEMKQLF   91 (117)
Q Consensus        74 ~~g~C~C~~g~~G~~C~~   91 (117)
                      ..+.|.|.+||.|..+..
T Consensus       160 ~~~~c~c~~g~~g~~~~~  177 (316)
T KOG1218|consen  160 KNGICTCQPGFVGVFCVE  177 (316)
T ss_pred             CCCceeccCCcccccccc
Confidence            355666777777766653


No 33 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=91.16  E-value=0.29  Score=24.27  Aligned_cols=23  Identities=48%  Similarity=1.197  Sum_probs=14.6

Q ss_pred             CCCCCcEEeC----CcEEeCCCCCC-CCC
Q psy3697          32 PCSGQGVCIE----GVCTCDAMYTG-EAC   55 (117)
Q Consensus        32 ~C~~~G~C~~----g~C~C~~g~~G-~~C   55 (117)
                      .|.++ +|+.    ..|.|++||.| ..|
T Consensus         7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        7 PCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            56655 6643    35777777777 554


No 34 
>KOG1218|consensus
Probab=91.04  E-value=0.92  Score=33.51  Aligned_cols=48  Identities=25%  Similarity=0.688  Sum_probs=35.3

Q ss_pred             EEeCCcEEeCCCCCCCCCCCCC--CC--CCCCCCCCeEecCCCeEeeCCCCcc
Q psy3697          38 VCIEGVCTCDAMYTGEACDVPI--CP--NNCSYSNGVCKHEFHRCECMDKYKE   86 (117)
Q Consensus        38 ~C~~g~C~C~~g~~G~~C~~~~--c~--~~C~~~~G~C~~~~g~C~C~~g~~G   86 (117)
                      .+..+.|.|.+||.|..+....  |.  ..+.+. +.|+...+.+.+.+.|.+
T Consensus       158 ~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g-~~C~~~~~~~~~~~~~~~  209 (316)
T KOG1218|consen  158 DCKNGICTCQPGFVGVFCVESCSGCSPLTACENG-AKCNRSTGSCLCYPGPSG  209 (316)
T ss_pred             CCCCCceeccCCcccccccccCCCcCCCcccCCC-CeeeccccccccCCCCcc
Confidence            3466889999999999998653  33  234553 689877778888777754


No 35 
>KOG1217|consensus
Probab=90.71  E-value=0.72  Score=35.44  Aligned_cols=50  Identities=26%  Similarity=0.590  Sum_probs=38.7

Q ss_pred             CcEEeCCCCCCCCCCCC--CCC---CCCCCCCCeEecCCC--eEeeCCCCcccCCCcC
Q psy3697          42 GVCTCDAMYTGEACDVP--ICP---NNCSYSNGVCKHEFH--RCECMDKYKEMKQLFD   92 (117)
Q Consensus        42 g~C~C~~g~~G~~C~~~--~c~---~~C~~~~G~C~~~~g--~C~C~~g~~G~~C~~~   92 (117)
                      ..|.|..+|.+..++..  .|.   ..|.+. +.|....+  .|.|.++|.+..++..
T Consensus       152 ~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~-~~C~~~~~~~~C~c~~~~~~~~~~~~  208 (487)
T KOG1217|consen  152 FRCSCTEGYEGEPCETDLDECIQYSSPCQNG-GTCVNTGGSYLCSCPPGYTGSTCETT  208 (487)
T ss_pred             eeeeeCCCcccccccccccccccCCCCcCCC-cccccCCCCeeEeCCCCccCCcCcCC
Confidence            45899999999999864  454   347775 88876544  7999999999998754


No 36 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=90.71  E-value=0.37  Score=26.49  Aligned_cols=20  Identities=40%  Similarity=1.097  Sum_probs=17.1

Q ss_pred             CCCCCcEEeCCcEEeCCCCC
Q psy3697          32 PCSGQGVCIEGVCTCDAMYT   51 (117)
Q Consensus        32 ~C~~~G~C~~g~C~C~~g~~   51 (117)
                      .|..+..|+.+.|+|+++|.
T Consensus        27 qC~~~s~C~~g~C~C~~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVNGRCQCPPGYV   46 (52)
T ss_pred             CCCCcCEEcCCEeECCCCCE
Confidence            67788889999999999875


No 37 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.69  E-value=0.23  Score=25.68  Aligned_cols=24  Identities=21%  Similarity=0.567  Sum_probs=14.1

Q ss_pred             CCCCCCCeEecCCC--eEeeCCCCccc
Q psy3697          63 NCSYSNGVCKHEFH--RCECMDKYKEM   87 (117)
Q Consensus        63 ~C~~~~G~C~~~~g--~C~C~~g~~G~   87 (117)
                      .|..+ ++|....+  +|+|.+||.|+
T Consensus         7 ~C~~n-A~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    7 GCHPN-ATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GS-TT-CEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCC-cEeecCCCCEEeECCCCCccC
Confidence            35554 67765444  78888888775


No 38 
>PHA02887 EGF-like protein; Provisional
Probab=87.40  E-value=0.62  Score=30.56  Aligned_cols=27  Identities=22%  Similarity=0.479  Sum_probs=20.0

Q ss_pred             CCCCCCCeEecCC----CeEeeCCCCcccCCCc
Q psy3697          63 NCSYSNGVCKHEF----HRCECMDKYKEMKQLF   91 (117)
Q Consensus        63 ~C~~~~G~C~~~~----g~C~C~~g~~G~~C~~   91 (117)
                      .|- + |+|.-..    -.|.|+.||+|..|+.
T Consensus        93 YCi-H-G~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         93 FCI-N-GECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             Eee-C-CEEEccccCCCceeECCCCcccCCCCc
Confidence            455 3 7885433    3899999999999974


No 39 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=86.96  E-value=3.4  Score=22.55  Aligned_cols=39  Identities=33%  Similarity=0.769  Sum_probs=25.8

Q ss_pred             eCCcEEeCCCCCCCCCCCCCCCCCCCCCCCeEecCCCeEeeCCCCc
Q psy3697          40 IEGVCTCDAMYTGEACDVPICPNNCSYSNGVCKHEFHRCECMDKYK   85 (117)
Q Consensus        40 ~~g~C~C~~g~~G~~C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~   85 (117)
                      +.++|+=. ...|..|+..   ..|..+ ..|.  .++|.|++||+
T Consensus         8 ~~~~C~~~-~~~g~~C~~~---~qC~~~-s~C~--~g~C~C~~g~~   46 (52)
T PF01683_consen    8 INGQCVPR-VQPGESCESD---EQCIGG-SVCV--NGRCQCPPGYV   46 (52)
T ss_pred             ECCEECcc-CCCCCCCCCc---CCCCCc-CEEc--CCEeECCCCCE
Confidence            34445433 4556678753   356665 8996  78999999985


No 40 
>KOG4260|consensus
Probab=86.91  E-value=0.76  Score=34.57  Aligned_cols=27  Identities=44%  Similarity=1.165  Sum_probs=22.2

Q ss_pred             CCCCCCcEEe-------CCcEEeCCCCCCCCCCC
Q psy3697          31 KPCSGQGVCI-------EGVCTCDAMYTGEACDV   57 (117)
Q Consensus        31 ~~C~~~G~C~-------~g~C~C~~g~~G~~C~~   57 (117)
                      .+|.++|.|.       +|.|.|.+||+|+.|..
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~  183 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY  183 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence            4688889883       26899999999999864


No 41 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.66  E-value=1.4  Score=23.19  Aligned_cols=23  Identities=30%  Similarity=0.798  Sum_probs=16.7

Q ss_pred             CCCCCCCCeEecCCC--eEeeCCCCc
Q psy3697          62 NNCSYSNGVCKHEFH--RCECMDKYK   85 (117)
Q Consensus        62 ~~C~~~~G~C~~~~g--~C~C~~g~~   85 (117)
                      ..|... ++|....|  .|.|++||.
T Consensus        10 ~~C~~~-~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen   10 HNCPEN-GTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSSSTT-SEEEEETTEEEEEESTTEE
T ss_pred             CcCCCC-CEEEcCCCCEEeeCCCCcE
Confidence            456655 78877666  788888887


No 42 
>KOG1836|consensus
Probab=85.40  E-value=1.2  Score=41.08  Aligned_cols=59  Identities=25%  Similarity=0.656  Sum_probs=37.3

Q ss_pred             CCCCC-cEE--eCCcEEeCCCCCCCCCCCCC--------------CC-CCCCCCCCeEe----cCCCeEe-eCCCCcccC
Q psy3697          32 PCSGQ-GVC--IEGVCTCDAMYTGEACDVPI--------------CP-NNCSYSNGVCK----HEFHRCE-CMDKYKEMK   88 (117)
Q Consensus        32 ~C~~~-G~C--~~g~C~C~~g~~G~~C~~~~--------------c~-~~C~~~~G~C~----~~~g~C~-C~~g~~G~~   88 (117)
                      +|.+| ..|  .+|.|.|...-.|..|+.-.              |. -+|.+. +.|.    .....|. |+++|+|..
T Consensus       732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~-~~~~~~~~~~~~iCk~Cp~gytG~r  810 (1705)
T KOG1836|consen  732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNG-GACGQTPEILEVVCKNCPPGYTGLR  810 (1705)
T ss_pred             ccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCC-hhhcCcCcccceecCCCCCCCcccc
Confidence            55554 557  45788888777777776410              11 124443 4443    2345898 999999999


Q ss_pred             CCc
Q psy3697          89 QLF   91 (117)
Q Consensus        89 C~~   91 (117)
                      |+.
T Consensus       811 Ce~  813 (1705)
T KOG1836|consen  811 CEE  813 (1705)
T ss_pred             ccc
Confidence            974


No 43 
>KOG3607|consensus
Probab=85.29  E-value=0.87  Score=38.44  Aligned_cols=27  Identities=33%  Similarity=0.800  Sum_probs=24.2

Q ss_pred             CCCCCcEEe-CCcEEeCCCCCCCCCCCC
Q psy3697          32 PCSGQGVCI-EGVCTCDAMYTGEACDVP   58 (117)
Q Consensus        32 ~C~~~G~C~-~g~C~C~~g~~G~~C~~~   58 (117)
                      .|.+||+|. ...|+|.++|.+++|+..
T Consensus       631 ~C~g~GVCnn~~~ChC~~gwapp~C~~~  658 (716)
T KOG3607|consen  631 TCNGHGVCNNELNCHCEPGWAPPFCFIF  658 (716)
T ss_pred             ccCCCcccCCCcceeeCCCCCCCccccc
Confidence            799999995 478999999999999975


No 44 
>KOG0994|consensus
Probab=85.20  E-value=2.2  Score=38.07  Aligned_cols=69  Identities=28%  Similarity=0.683  Sum_probs=37.9

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCc----EE--eCCcEEeCCCCCCCCCCCC--------CC-CCCCCCCC
Q psy3697           4 GEACDVPICPNNCSYSSVNSCPSSKSDKPCSGQG----VC--IEGVCTCDAMYTGEACDVP--------IC-PNNCSYSN   68 (117)
Q Consensus         4 G~~C~~~~C~~~c~~~~~~~C~~~~~~~~C~~~G----~C--~~g~C~C~~g~~G~~C~~~--------~c-~~~C~~~~   68 (117)
                      |..|+  .|...-.+|....|..-    +|...|    .|  ++|+|+|.+|-.|..|+.-        .| +-.|.+|.
T Consensus       792 GR~Cd--qCApGtyGFGPsGCk~C----dC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA  865 (1758)
T KOG0994|consen  792 GRRCD--QCAPGTYGFGPSGCKAC----DCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHA  865 (1758)
T ss_pred             ccccc--ccCCcccCcCCccCccc----cccccccccccccccccceeeccccchhhccccCCCccCCCcCccccccCcc
Confidence            44443  35555444444445432    444333    35  6689999998888887741        12 11344443


Q ss_pred             CeEecCCCeE
Q psy3697          69 GVCKHEFHRC   78 (117)
Q Consensus        69 G~C~~~~g~C   78 (117)
                      .+|++..|.|
T Consensus       866 ~~Cd~~tGaC  875 (1758)
T KOG0994|consen  866 DTCDPITGAC  875 (1758)
T ss_pred             cccCcccccc
Confidence            4676666665


No 45 
>PHA02887 EGF-like protein; Provisional
Probab=84.95  E-value=0.8  Score=30.06  Aligned_cols=26  Identities=42%  Similarity=0.957  Sum_probs=20.0

Q ss_pred             CCCCCcEEe------CCcEEeCCCCCCCCCCCC
Q psy3697          32 PCSGQGVCI------EGVCTCDAMYTGEACDVP   58 (117)
Q Consensus        32 ~C~~~G~C~------~g~C~C~~g~~G~~C~~~   58 (117)
                      -|. ||+|.      ...|.|+.||+|..|+..
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCcccCCCCcc
Confidence            355 68882      246999999999999863


No 46 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=83.41  E-value=0.52  Score=26.75  Aligned_cols=14  Identities=29%  Similarity=0.468  Sum_probs=4.9

Q ss_pred             eEeeCCCCcccCCC
Q psy3697          77 RCECMDKYKEMKQL   90 (117)
Q Consensus        77 ~C~C~~g~~G~~C~   90 (117)
                      .|+|+.-|.|++|+
T Consensus        37 ~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   37 VCECNSCYGGPDCS   50 (56)
T ss_dssp             --EE-TTEESTTS-
T ss_pred             cccccCCcCCCCcc
Confidence            34444444444444


No 47 
>KOG1214|consensus
Probab=83.22  E-value=2.4  Score=36.65  Aligned_cols=45  Identities=27%  Similarity=0.718  Sum_probs=32.5

Q ss_pred             CcEEeCCCCCCCC--CC-CCCC-CCCCCCCCCeEecCCC--eEeeCCCCccc
Q psy3697          42 GVCTCDAMYTGEA--CD-VPIC-PNNCSYSNGVCKHEFH--RCECMDKYKEM   87 (117)
Q Consensus        42 g~C~C~~g~~G~~--C~-~~~c-~~~C~~~~G~C~~~~g--~C~C~~g~~G~   87 (117)
                      +.|.|-+||.|+-  |. ...| +..|... ++|....+  .|.|.+||.|+
T Consensus       809 y~C~CLPGfsGDG~~c~dvDeC~psrChp~-A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  809 YSCACLPGFSGDGHQCTDVDECSPSRCHPA-ATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             EEEeecCCccCCccccccccccCccccCCC-ceEecCCCcceeecccCccCC
Confidence            4699999999863  33 2334 3456665 88876555  89999999987


No 48 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=79.78  E-value=1.5  Score=29.22  Aligned_cols=17  Identities=35%  Similarity=0.718  Sum_probs=13.6

Q ss_pred             CcEEeCCCCCCCCCCCC
Q psy3697          42 GVCTCDAMYTGEACDVP   58 (117)
Q Consensus        42 g~C~C~~g~~G~~C~~~   58 (117)
                      ..|.|+.||+|..||..
T Consensus        67 ~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         67 MYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             ceeECCCCcccccccce
Confidence            45888888888888864


No 49 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=79.30  E-value=2.2  Score=27.34  Aligned_cols=10  Identities=0%  Similarity=-0.223  Sum_probs=7.2

Q ss_pred             CCcccCCCcC
Q psy3697          83 KYKEMKQLFD   92 (117)
Q Consensus        83 g~~G~~C~~~   92 (117)
                      .|.|+.|+..
T Consensus        53 ~W~G~aCqKk   62 (103)
T PF12955_consen   53 HWGGPACQKK   62 (103)
T ss_pred             eecccccccc
Confidence            5888888753


No 50 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=78.01  E-value=2.8  Score=21.44  Aligned_cols=27  Identities=26%  Similarity=0.680  Sum_probs=15.2

Q ss_pred             CCCCCCCCCCCCCCCeEecCCCeEeeCCCCccc
Q psy3697          55 CDVPICPNNCSYSNGVCKHEFHRCECMDKYKEM   87 (117)
Q Consensus        55 C~~~~c~~~C~~~~G~C~~~~g~C~C~~g~~G~   87 (117)
                      |+...|+..|..+      ..+.|.|++||.-+
T Consensus         3 Cn~t~CpA~CDpn------~~~~C~CPeGyIld   29 (34)
T PF09064_consen    3 CNQTECPADCDPN------SPGQCFCPEGYILD   29 (34)
T ss_pred             cccccCCCccCCC------CCCceeCCCceEec
Confidence            4544555444432      24478888888543


No 51 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=77.94  E-value=2.1  Score=28.60  Aligned_cols=23  Identities=26%  Similarity=0.613  Sum_probs=17.9

Q ss_pred             CeEecCC----CeEeeCCCCcccCCCc
Q psy3697          69 GVCKHEF----HRCECMDKYKEMKQLF   91 (117)
Q Consensus        69 G~C~~~~----g~C~C~~g~~G~~C~~   91 (117)
                      |.|.-..    -.|.|..||+|..|+.
T Consensus        56 G~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         56 GDCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             CEEEeeccCCCceeECCCCcccccccc
Confidence            6774322    3799999999999984


No 52 
>KOG1836|consensus
Probab=59.52  E-value=7.6  Score=36.09  Aligned_cols=48  Identities=27%  Similarity=0.701  Sum_probs=34.0

Q ss_pred             cEEeCCCCCCCCCCCC-------------CC---CCCCCCCCCeEecCCCeEeeCCCCcccCCC
Q psy3697          43 VCTCDAMYTGEACDVP-------------IC---PNNCSYSNGVCKHEFHRCECMDKYKEMKQL   90 (117)
Q Consensus        43 ~C~C~~g~~G~~C~~~-------------~c---~~~C~~~~G~C~~~~g~C~C~~g~~G~~C~   90 (117)
                      .|.|+.+|+|.+|+.-             .+   +-+|.++..+|+..+|.|.|.+.-.|..|+
T Consensus       696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~  759 (1705)
T KOG1836|consen  696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCA  759 (1705)
T ss_pred             hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchh
Confidence            4999999999999851             01   113444225788888999888777777775


No 53 
>KOG3516|consensus
Probab=54.74  E-value=12  Score=33.58  Aligned_cols=27  Identities=37%  Similarity=0.987  Sum_probs=22.3

Q ss_pred             CCCCCcEEeC----CcEEeC-CCCCCCCCCCC
Q psy3697          32 PCSGQGVCIE----GVCTCD-AMYTGEACDVP   58 (117)
Q Consensus        32 ~C~~~G~C~~----g~C~C~-~g~~G~~C~~~   58 (117)
                      .|.|+|.|+.    +.|.|. ..|.|++|...
T Consensus       962 ~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~e  993 (1306)
T KOG3516|consen  962 PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKE  993 (1306)
T ss_pred             cccCCCEEEEecCceeeccccCcCCCCccccc
Confidence            8999999954    579986 68999999864


No 54 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=47.60  E-value=12  Score=17.47  Aligned_cols=10  Identities=30%  Similarity=0.843  Sum_probs=8.0

Q ss_pred             eEeeCCCCcc
Q psy3697          77 RCECMDKYKE   86 (117)
Q Consensus        77 ~C~C~~g~~G   86 (117)
                      +|.|++||+-
T Consensus         3 ~C~C~~Gy~l   12 (24)
T PF12662_consen    3 TCSCPPGYQL   12 (24)
T ss_pred             EeeCCCCCcC
Confidence            6889999963


No 55 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=41.94  E-value=21  Score=18.23  Aligned_cols=18  Identities=33%  Similarity=0.741  Sum_probs=10.3

Q ss_pred             eEecCCC--eEeeCCCCccc
Q psy3697          70 VCKHEFH--RCECMDKYKEM   87 (117)
Q Consensus        70 ~C~~~~g--~C~C~~g~~G~   87 (117)
                      .|....+  +|.|++||.-.
T Consensus        11 ~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             EEEEETTSEEEE-STTEEE-
T ss_pred             CCccCCCceEeECCCCCEEC
Confidence            4544333  89999998644


No 56 
>KOG3512|consensus
Probab=37.68  E-value=44  Score=27.35  Aligned_cols=22  Identities=14%  Similarity=0.434  Sum_probs=20.0

Q ss_pred             CeEecCCCeEeeCCCCcccCCC
Q psy3697          69 GVCKHEFHRCECMDKYKEMKQL   90 (117)
Q Consensus        69 G~C~~~~g~C~C~~g~~G~~C~   90 (117)
                      -+|.+.+|+|.|.+|-+|..|.
T Consensus       407 ktCNq~tGqCpCkeGvtG~tCn  428 (592)
T KOG3512|consen  407 KTCNQTTGQCPCKEGVTGLTCN  428 (592)
T ss_pred             ccccccCCcccCCCCCcccccc
Confidence            5898889999999999999985


No 57 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=27.98  E-value=57  Score=17.66  Aligned_cols=14  Identities=43%  Similarity=1.115  Sum_probs=8.5

Q ss_pred             cEEeCCcEEeCCCCC
Q psy3697          37 GVCIEGVCTCDAMYT   51 (117)
Q Consensus        37 G~C~~g~C~C~~g~~   51 (117)
                      ..|+.| |.|++||.
T Consensus        29 ~~C~~g-C~C~~G~v   42 (55)
T PF01826_consen   29 EPCVEG-CFCPPGYV   42 (55)
T ss_dssp             SS-ESE-EEETTTEE
T ss_pred             CCCCcc-CCCCCCee
Confidence            345556 77777764


No 58 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=25.67  E-value=96  Score=19.34  Aligned_cols=24  Identities=21%  Similarity=0.808  Sum_probs=14.3

Q ss_pred             CCCCCCCCeEecCC-CeEeeCCCCcc
Q psy3697          62 NNCSYSNGVCKHEF-HRCECMDKYKE   86 (117)
Q Consensus        62 ~~C~~~~G~C~~~~-g~C~C~~g~~G   86 (117)
                      ..|... |.|.... ..|.|.+||.-
T Consensus        84 ~~CG~~-g~C~~~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   84 GFCGPN-GICNSNNSPKCSCLPGFEP  108 (110)
T ss_pred             cccCCc-cEeCCCCCCceECCCCcCC
Confidence            356665 7775322 26888887753


No 59 
>KOG1214|consensus
Probab=24.33  E-value=1.8e+02  Score=25.92  Aligned_cols=44  Identities=30%  Similarity=0.639  Sum_probs=30.2

Q ss_pred             CcEEeCCCCCCC--CCCCC----CCCCCCCCCCCeEecCCC--eEeeCCCCcc
Q psy3697          42 GVCTCDAMYTGE--ACDVP----ICPNNCSYSNGVCKHEFH--RCECMDKYKE   86 (117)
Q Consensus        42 g~C~C~~g~~G~--~C~~~----~c~~~C~~~~G~C~~~~g--~C~C~~g~~G   86 (117)
                      .+|.|..+|.|.  .|..+    .+...|..+ ..|....+  +|+|..+|.-
T Consensus       716 ~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~-s~Cin~pg~~rceC~~gy~F  767 (1289)
T KOG1214|consen  716 YTCECSSGYQGDGRNCVDENECATGFHRCGPN-SVCINLPGSYRCECRSGYEF  767 (1289)
T ss_pred             eEEEEeeccCCCCCCCCChhhhccCCCCCCCC-ceeecCCCceeEEEeeccee
Confidence            479999999875  44322    244578776 88976665  7888777643


No 60 
>PF05294 Toxin_5:  Scorpion short toxin;  InterPro: IPR007958 This family contains various secreted scorpion short toxins which seem to be unrelated to those described in IPR001947 from INTERPRO.; GO: 0009405 pathogenesis, 0005576 extracellular region; PDB: 1SIS_A 1CHL_A.
Probab=22.01  E-value=1e+02  Score=15.37  Aligned_cols=15  Identities=40%  Similarity=1.156  Sum_probs=10.9

Q ss_pred             CCCCCcEEeCCcEEe
Q psy3697          32 PCSGQGVCIEGVCTC   46 (117)
Q Consensus        32 ~C~~~G~C~~g~C~C   46 (117)
                      -|.++|.|.-.+|+|
T Consensus        18 CCgg~GkC~GpqClC   32 (32)
T PF05294_consen   18 CCGGRGKCFGPQCLC   32 (32)
T ss_dssp             HCTTSEEEETTEEEE
T ss_pred             HhCCCCeEcCCcccC
Confidence            366778887777775


Done!