Query         psy18122
Match_columns 188
No_of_seqs    219 out of 1499
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 23:04:17 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy18122.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18122hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0994|consensus               99.3   1E-11 2.2E-16  113.4   8.3  121   17-144   776-904 (1758)
  2 KOG1219|consensus               99.2 2.4E-11 5.3E-16  116.3   7.6  106    5-132  3865-3982(4289)
  3 KOG1225|consensus               99.2 1.3E-10 2.8E-15  101.0   8.9   93    8-125   252-365 (525)
  4 KOG1226|consensus               98.8 1.1E-08 2.4E-13   91.0   8.4   31    6-36    463-493 (783)
  5 KOG0994|consensus               98.8 1.4E-08   3E-13   93.5   8.4  107   21-132   428-549 (1758)
  6 KOG4289|consensus               98.8   6E-09 1.3E-13   97.4   6.0   83   21-129  1222-1318(2531)
  7 KOG1225|consensus               98.7 3.2E-08 6.9E-13   86.3   7.0   64    9-80    284-365 (525)
  8 KOG4289|consensus               98.6 9.5E-08 2.1E-12   89.7   7.8  107    7-128  1719-1838(2531)
  9 KOG1226|consensus               98.4 1.1E-06 2.4E-11   78.7   9.3  100    9-128   513-621 (783)
 10 KOG1836|consensus               98.4 7.2E-07 1.6E-11   86.9   8.4  125   11-140   732-881 (1705)
 11 KOG1836|consensus               98.3 1.1E-06 2.4E-11   85.6   6.9  114   21-141   695-825 (1705)
 12 KOG1219|consensus               98.2 2.1E-06 4.5E-11   83.9   5.0   69    4-83   3903-3978(4289)
 13 smart00051 DSL delta serrate l  98.0 3.3E-06 7.2E-11   53.5   2.5   51   30-81     13-63  (63)
 14 KOG3512|consensus               98.0 3.4E-05 7.3E-10   65.9   7.9  107   21-134   295-434 (592)
 15 PF00008 EGF:  EGF-like domain   97.8 1.2E-05 2.5E-10   44.0   1.9   26    8-33      2-32  (32)
 16 PF07974 EGF_2:  EGF-like domai  97.7 5.7E-05 1.2E-09   41.2   3.1   25   10-34      6-32  (32)
 17 cd00055 EGF_Lam Laminin-type e  97.5 0.00018 3.9E-09   43.3   3.7   28   18-46     14-43  (50)
 18 PF00008 EGF:  EGF-like domain   97.3 7.4E-05 1.6E-09   40.7   0.7   26   55-80      3-32  (32)
 19 PF00053 Laminin_EGF:  Laminin   97.3  0.0001 2.2E-09   44.1   1.1   30   17-47     12-43  (49)
 20 smart00051 DSL delta serrate l  97.3 0.00013 2.8E-09   46.2   1.5   45   78-126    14-63  (63)
 21 PF07974 EGF_2:  EGF-like domai  97.2 0.00047   1E-08   37.5   2.8   25   57-81      7-32  (32)
 22 PF12661 hEGF:  Human growth fa  97.1 0.00021 4.6E-09   30.8   0.7   13   22-34      1-13  (13)
 23 smart00179 EGF_CA Calcium-bind  97.1  0.0011 2.3E-08   37.1   3.9   26   10-35      9-39  (39)
 24 cd00055 EGF_Lam Laminin-type e  97.0 0.00092   2E-08   40.2   3.3   30   63-93     13-43  (50)
 25 smart00180 EGF_Lam Laminin-typ  97.0 0.00094   2E-08   39.5   3.1   27   17-44     12-40  (46)
 26 PF00053 Laminin_EGF:  Laminin   96.9 0.00039 8.4E-09   41.6   0.9   32   62-94     11-43  (49)
 27 KOG1218|consensus               96.8  0.0079 1.7E-07   49.4   8.6   63   62-131   117-180 (316)
 28 KOG4260|consensus               96.8  0.0014   3E-08   52.7   3.4   53   25-91    132-190 (350)
 29 cd00054 EGF_CA Calcium-binding  96.7  0.0028 6.2E-08   34.8   3.8   26   10-35      9-38  (38)
 30 smart00180 EGF_Lam Laminin-typ  96.7  0.0016 3.4E-08   38.5   2.8   28   63-91     12-40  (46)
 31 KOG1217|consensus               96.6   0.021 4.4E-07   49.0   9.8  114    8-129   175-355 (487)
 32 PF01414 DSL:  Delta serrate li  96.4 0.00059 1.3E-08   43.1  -0.5   43   38-81     21-63  (63)
 33 cd00053 EGF Epidermal growth f  96.3  0.0075 1.6E-07   32.5   3.6   26    9-34      5-35  (36)
 34 KOG4260|consensus               96.2  0.0039 8.5E-08   50.2   2.6   33   10-43    150-189 (350)
 35 smart00181 EGF Epidermal growt  96.0   0.013 2.8E-07   31.9   3.6   25   10-35      6-35  (35)
 36 smart00179 EGF_CA Calcium-bind  95.9    0.01 2.2E-07   32.9   3.1   27   56-82      9-39  (39)
 37 KOG1217|consensus               95.7   0.087 1.9E-06   45.1   9.3  102   21-125   110-264 (487)
 38 KOG3512|consensus               95.6   0.019 4.2E-07   49.5   4.7   69   21-92    359-437 (592)
 39 cd00054 EGF_CA Calcium-binding  95.4   0.021 4.6E-07   31.2   3.0   27   56-82      9-38  (38)
 40 PF01414 DSL:  Delta serrate li  95.4   0.007 1.5E-07   38.2   1.0   39   84-126    20-63  (63)
 41 KOG1218|consensus               95.3    0.13 2.9E-06   42.0   8.6   26   13-38    154-179 (316)
 42 cd00053 EGF Epidermal growth f  95.2   0.027 5.8E-07   30.2   3.0   26   56-81      6-35  (36)
 43 KOG1214|consensus               94.8   0.062 1.4E-06   49.6   5.5   22   15-36    705-733 (1289)
 44 smart00181 EGF Epidermal growt  94.8   0.043 9.4E-07   29.7   3.0   25   56-81      6-34  (35)
 45 KOG0196|consensus               94.5    0.24 5.2E-06   45.9   8.5   92   58-153   248-351 (996)
 46 KOG1214|consensus               94.0    0.12 2.6E-06   47.9   5.5   47   21-80    809-860 (1289)
 47 PF07645 EGF_CA:  Calcium-bindi  91.2    0.18   4E-06   28.8   2.1   22    9-30      9-34  (42)
 48 KOG1388|consensus               91.0    0.44 9.6E-06   37.2   4.5   71   58-132    54-131 (217)
 49 PF07645 EGF_CA:  Calcium-bindi  89.7    0.21 4.5E-06   28.6   1.4   23   55-77      9-34  (42)
 50 PF12947 EGF_3:  EGF domain;  I  88.7    0.13 2.7E-06   28.7  -0.0   24   56-79      6-32  (36)
 51 PF12947 EGF_3:  EGF domain;  I  88.5    0.24 5.2E-06   27.5   1.0   23   10-32      6-32  (36)
 52 KOG3514|consensus               87.7    0.35 7.7E-06   46.0   2.1   36    2-37    621-661 (1591)
 53 PHA03099 epidermal growth fact  84.7    0.77 1.7E-05   33.0   2.1   25   12-37     53-83  (139)
 54 PF01683 EB:  EB module;  Inter  84.6     3.8 8.2E-05   24.2   5.0   20   57-77     27-46  (52)
 55 PF01683 EB:  EB module;  Inter  80.0     2.7 5.8E-05   24.9   3.0   25    6-30     22-46  (52)
 56 PF12955 DUF3844:  Domain of un  78.6     1.8 3.8E-05   30.1   2.1   25    8-32     11-44  (103)
 57 KOG0196|consensus               78.4     3.4 7.4E-05   38.7   4.5   52   21-77    259-317 (996)
 58 PHA02887 EGF-like protein; Pro  77.9       2 4.3E-05   30.4   2.2   26   57-83     93-123 (126)
 59 PF12662 cEGF:  Complement Clr-  75.0       3 6.4E-05   21.0   1.8   11   68-78      2-12  (24)
 60 KOG3516|consensus               73.7     2.9 6.2E-05   40.5   2.7   33    5-37    546-583 (1306)
 61 KOG3607|consensus               72.5     2.8 6.1E-05   38.8   2.4   29    9-37    629-658 (716)
 62 KOG3516|consensus               66.4     4.4 9.5E-05   39.3   2.3   38   46-83    541-582 (1306)
 63 PHA03099 epidermal growth fact  60.0      15 0.00033   26.5   3.6   17   67-83     66-82  (139)
 64 PHA02887 EGF-like protein; Pro  59.7      12 0.00027   26.5   3.0    8   12-20     94-101 (126)
 65 KOG3607|consensus               58.2      14 0.00029   34.5   3.9   28   56-83    630-657 (716)
 66 KOG3514|consensus               54.1     8.6 0.00019   37.2   2.0   32   52-83    625-660 (1591)
 67 PF04863 EGF_alliinase:  Alliin  51.4     6.5 0.00014   23.9   0.5   28    9-36     16-51  (56)
 68 KOG1388|consensus               47.7      19 0.00041   28.3   2.7   69   12-83     54-127 (217)
 69 PF00954 S_locus_glycop:  S-loc  47.4      21 0.00046   24.5   2.7   23   55-77     83-107 (110)
 70 PF14670 FXa_inhibition:  Coagu  42.5     8.4 0.00018   21.2   0.0   13   67-79     18-30  (36)
 71 KOG3509|consensus               37.1      44 0.00096   32.2   3.8   67   21-92    718-792 (964)
 72 PF09064 Tme5_EGF_like:  Thromb  35.2      20 0.00043   19.6   0.8   10   21-30     18-27  (34)
 73 KOG3509|consensus               27.3      81  0.0018   30.5   3.8   23   67-90    717-739 (964)
 74 PF11403 Yeast_MT:  Yeast metal  24.3      39 0.00085   18.3   0.7   18   56-75     12-29  (40)
 75 cd01475 vWA_Matrilin VWA_Matri  23.1      92   0.002   24.1   3.0   17   63-79    200-219 (224)
 76 PF01826 TIL:  Trypsin Inhibito  21.6   1E+02  0.0022   18.1   2.4    9   69-77     34-42  (55)

No 1  
>KOG0994|consensus
Probab=99.28  E-value=1e-11  Score=113.38  Aligned_cols=121  Identities=26%  Similarity=0.556  Sum_probs=97.0

Q ss_pred             eEec--ceEeCCCCCcCCCCCCCCCCCCCCCCCC--CCccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCccC
Q psy18122         17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKCS--MLCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYYG   91 (188)
Q Consensus        17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C~--~~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G   91 (188)
                      .|.+  |+|+|+|+..|+.|+. |.+|+||..-+  ..|.|.... .-+..|.. ++.|.|.+|..|..|. .|.+||||
T Consensus       776 vCn~~GGqCqCkPnVVGR~Cdq-CApGtyGFGPsGCk~CdC~~~G-s~~~~Cd~~tGQC~C~~g~ygrqCn-qCqpG~Wg  852 (1758)
T KOG0994|consen  776 VCNPNGGQCQCKPNVVGRRCDQ-CAPGTYGFGPSGCKACDCNSIG-SLDKYCDKITGQCQCRPGTYGRQCN-QCQPGYWG  852 (1758)
T ss_pred             cccCCCceecccCccccccccc-cCCcccCcCCccCccccccccc-cccccccccccceeeccccchhhcc-ccCCCccC
Confidence            4444  9999999999999998 99999998866  567665321 12345766 7999999999999998 79999999


Q ss_pred             CCccCCCcccc---cceeccccccccccccCCccCCCCCCCCCCCCCCCeeecCcc
Q psy18122         92 ADCKQKCGECI---DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVLDSSY  144 (188)
Q Consensus        92 ~~C~~~C~~C~---~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~~~~~  144 (188)
                      ..--.+|. |+   ..|++.+|.|+. |..-. +|.+|++ |..++++.+++.+.+
T Consensus       853 FPeCr~Cq-CNgHA~~Cd~~tGaCi~-CqD~T-~G~~Cdr-Cl~GyyGdP~lg~g~  904 (1758)
T KOG0994|consen  853 FPECRPCQ-CNGHADTCDPITGACID-CQDST-TGHSCDR-CLDGYYGDPRLGSGI  904 (1758)
T ss_pred             CCcCcccc-ccCcccccCcccccccc-ccccc-cccchhh-hhccccCCcccCCCC
Confidence            64334677 88   899999999986 88877 8999987 888888887776554


No 2  
>KOG1219|consensus
Probab=99.22  E-value=2.4e-11  Score=116.33  Aligned_cols=106  Identities=26%  Similarity=0.652  Sum_probs=84.5

Q ss_pred             CCCCCCCCCCCCeEec-----ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCC
Q psy18122          5 TNENSIECLNGGVEMD-----SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGY   76 (188)
Q Consensus         5 ~~~~~~~C~~~g~C~~-----~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~   76 (188)
                      +.++.+||+|+|+|..     |.|.|++.|.|..||..-.            .|.++||..||+|..   .+.|.|+.||
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~e------------pC~snPC~~GgtCip~~n~f~CnC~~gy 3932 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLE------------PCASNPCLTGGTCIPFYNGFLCNCPNGY 3932 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccccc------------cccCCCCCCCCEEEecCCCeeEeCCCCc
Confidence            5678899999999987     8999999999999997422            456889999999987   5669999999


Q ss_pred             CCCCCCCCCCCCccCCCccCCCcccc--cceecccc--ccccccccCCccCCCCCCCCCC
Q psy18122         77 QGEQCLEHCPEDYYGADCKQKCGECI--DGCDQYTG--VCRGGCTTPYLIRPYCKQPHSY  132 (188)
Q Consensus        77 ~G~~C~~~C~~g~~G~~C~~~C~~C~--~~C~~~~g--~C~~~C~~g~~~G~~C~~~c~~  132 (188)
                      +|.+|+..   |  =..|+..  .|.  +.|.+..|  .|.  |.+|| .|..|...-|.
T Consensus      3933 TG~~Ce~~---G--i~eCs~n--~C~~gg~C~n~~gsf~Cn--cT~g~-~gr~c~~~~pn 3982 (4289)
T KOG1219|consen 3933 TGKRCEAR---G--ISECSKN--VCGTGGQCINIPGSFHCN--CTPGI-LGRTCCAEKPN 3982 (4289)
T ss_pred             cCceeecc---c--ccccccc--cccCCceeeccCCceEec--cChhH-hcccCccccCc
Confidence            99999853   1  1223321  255  88888888  899  99999 99998764444


No 3  
>KOG1225|consensus
Probab=99.16  E-value=1.3e-10  Score=101.00  Aligned_cols=93  Identities=33%  Similarity=0.919  Sum_probs=65.4

Q ss_pred             CCCCCCCCCeEecceEeCCCCCcCCCCCCC------------------CCCCCCCCCCCC-CccCCCCCCCCCcEecCCC
Q psy18122          8 NSIECLNGGVEMDSMCLCTPGFAGKKCEIP------------------CDRNNFGHKCSM-LCSKSPNQCKGMVLCTSNF   68 (188)
Q Consensus         8 ~~~~C~~~g~C~~~~C~C~~G~~G~~C~~~------------------C~~g~~G~~C~~-~C~c~~~~C~~~g~C~~~~   68 (188)
                      .+.-|.++|.|.++.|+|++||+|.+|++.                  |.++|.|.+|+. +|   +..|.++|.|++ +
T Consensus       252 C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~c---padC~g~G~Ci~-G  327 (525)
T KOG1225|consen  252 CPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRC---PADCSGHGKCID-G  327 (525)
T ss_pred             CCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccC---CccCCCCCcccC-C
Confidence            344577778999999999999999999873                  666777777763 33   245777777774 7


Q ss_pred             ceecCCCCCCCCCCCCCCCCccCCCccCCCcccc--cceeccccccccccccCCccCCC
Q psy18122         69 GCSCATGYQGEQCLEHCPEDYYGADCKQKCGECI--DGCDQYTGVCRGGCTTPYLIRPY  125 (188)
Q Consensus        69 ~C~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~C~--~~C~~~~g~C~~~C~~g~~~G~~  125 (188)
                      +|.|.+||+|..|.+.              . |.  +.|  +.| |+  |..|| .|.+
T Consensus       328 ~C~C~~Gy~G~~C~~~--------------~-C~~~g~c--v~g-C~--C~~Gw-~G~d  365 (525)
T KOG1225|consen  328 ECLCDEGYTGELCIQR--------------A-CSGGGQC--VNG-CK--CKKGW-RGPD  365 (525)
T ss_pred             ceEeCCCCcCCccccc--------------c-cCCCcee--ccC-ce--eccCc-cCCC
Confidence            7777777777776642              1 32  333  256 88  99999 8887


No 4  
>KOG1226|consensus
Probab=98.84  E-value=1.1e-08  Score=91.05  Aligned_cols=31  Identities=39%  Similarity=0.753  Sum_probs=26.6

Q ss_pred             CCCCCCCCCCCeEecceEeCCCCCcCCCCCC
Q psy18122          6 NENSIECLNGGVEMDSMCLCTPGFAGKKCEI   36 (188)
Q Consensus         6 ~~~~~~C~~~g~C~~~~C~C~~G~~G~~C~~   36 (188)
                      +.++.-|+.+|+...|.|.|.+||.|..|+-
T Consensus       463 e~~s~~C~g~G~~~CG~C~C~~G~~G~~CEC  493 (783)
T KOG1226|consen  463 EPNSALCHGNGTFVCGQCRCDEGWLGKKCEC  493 (783)
T ss_pred             CCCccccCCCCcEEecceecCCCCCCCcccC
Confidence            3456678889999999999999999999983


No 5  
>KOG0994|consensus
Probab=98.82  E-value=1.4e-08  Score=93.46  Aligned_cols=107  Identities=23%  Similarity=0.583  Sum_probs=85.0

Q ss_pred             ceEeCCCCCcCCCCCCCCCCCCCCCCCCC-----CccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCccCCCc
Q psy18122         21 SMCLCTPGFAGKKCEIPCDRNNFGHKCSM-----LCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYYGADC   94 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~~-----~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~C   94 (188)
                      |+|.|+++..|.+|+. |.+||||..-.+     .|.|.+-.-.+++.|+. ++.|.|+.-.+|..|. .|.|.+||..-
T Consensus       428 GqC~CK~~V~G~RCd~-Ck~Gywgl~~~dp~GC~~C~CN~lGT~~~s~CD~~TG~C~ckrlvTg~~cd-qclPeh~gLs~  505 (1758)
T KOG0994|consen  428 GQCRCKEHVAGRRCDR-CKDGYWGLTSADPYGCRPCDCNPLGTRNGSGCDPETGDCYCKRLVTGIDCD-QCLPEHWGLSN  505 (1758)
T ss_pred             cccccccCcCccccch-hccCcccCccCCCCCccccccccccccCCCCCCCCCCceEeeccccCCCcc-ccCccccccCC
Confidence            8999999999999998 999999988553     45554222234444877 7889999999999998 79999999654


Q ss_pred             cC-CCcc--cc------cceeccccccccccccCCccCCCCCCCCCC
Q psy18122         95 KQ-KCGE--CI------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSY  132 (188)
Q Consensus        95 ~~-~C~~--C~------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~  132 (188)
                      .. .|.+  |.      .+|...+|+|.  |.+++ .|.+|++.++.
T Consensus       506 ~~~gc~~cdcd~GGs~d~sc~~~sGqC~--CRe~~-~GR~c~~~~~~  549 (1758)
T KOG0994|consen  506 DLEGCRPCDCDQGGSYDNSCDLHSGQCE--CREHM-LGRRCEQVCPG  549 (1758)
T ss_pred             CCCCCcccccCCCCCCCcccccccCccc--ccccc-ccccccccCCc
Confidence            42 2443  44      57888999999  99999 99999998765


No 6  
>KOG4289|consensus
Probab=98.82  E-value=6e-09  Score=97.41  Aligned_cols=83  Identities=31%  Similarity=0.803  Sum_probs=62.7

Q ss_pred             ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCCCCCCCCC-----CCCCCccCC
Q psy18122         21 SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGYQGEQCLE-----HCPEDYYGA   92 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~~G~~C~~-----~C~~g~~G~   92 (188)
                      ..|+|||||+|..|++.-+            .|-.+||.++|.|..   .|+|.|.+||+|.+|+.     .|.+|+   
T Consensus      1222 lrCrCPpGFTgd~CeTeiD------------lCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGv--- 1286 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEID------------LCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGV--- 1286 (2531)
T ss_pred             eeEeCCCCCCcccccchhH------------hhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccce---
Confidence            5799999999999987532            344789999999986   68899999999999985     344442   


Q ss_pred             CccCCCcccc--cceeccc--c-ccccccccC-CccCCCCCCC
Q psy18122         93 DCKQKCGECI--DGCDQYT--G-VCRGGCTTP-YLIRPYCKQP  129 (188)
Q Consensus        93 ~C~~~C~~C~--~~C~~~~--g-~C~~~C~~g-~~~G~~C~~~  129 (188)
                              |.  ++|.+..  | .|.  |+.| | .++.|+..
T Consensus      1287 --------C~nggtC~~~~nggf~c~--Cp~ge~-e~prC~v~ 1318 (2531)
T KOG4289|consen 1287 --------CKNGGTCVNLLNGGFCCH--CPYGEF-EDPRCEVT 1318 (2531)
T ss_pred             --------ecCCCEEeecCCCceecc--CCCccc-CCCceEEE
Confidence                    54  6665421  2 567  8887 6 88999874


No 7  
>KOG1225|consensus
Probab=98.71  E-value=3.2e-08  Score=86.27  Aligned_cols=64  Identities=38%  Similarity=0.987  Sum_probs=56.0

Q ss_pred             CCCCCCCCeEecceEeCCCCCcCCCCCCC------------------CCCCCCCCCCCCCccCCCCCCCCCcEecCCCce
Q psy18122          9 SIECLNGGVEMDSMCLCTPGFAGKKCEIP------------------CDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGC   70 (188)
Q Consensus         9 ~~~C~~~g~C~~~~C~C~~G~~G~~C~~~------------------C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C   70 (188)
                      +.+|++|+.+.++.|+|++||+|..|++.                  |.+||+|..|..+      .|.+++.|.+ + |
T Consensus       284 p~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~Ci~G~C~C~~Gy~G~~C~~~------~C~~~g~cv~-g-C  355 (525)
T KOG1225|consen  284 PVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKCIDGECLCDEGYTGELCIQR------ACSGGGQCVN-G-C  355 (525)
T ss_pred             CcccCCCceecCCEeecCCCccccccccccCCccCCCCCcccCCceEeCCCCcCCccccc------ccCCCceecc-C-c
Confidence            44588999999999999999999999742                  7899999999866      2889999998 7 9


Q ss_pred             ecCCCCCCCC
Q psy18122         71 SCATGYQGEQ   80 (188)
Q Consensus        71 ~C~~g~~G~~   80 (188)
                      .|..||.|.+
T Consensus       356 ~C~~Gw~G~d  365 (525)
T KOG1225|consen  356 KCKKGWRGPD  365 (525)
T ss_pred             eeccCccCCC
Confidence            9999999988


No 8  
>KOG4289|consensus
Probab=98.63  E-value=9.5e-08  Score=89.69  Aligned_cols=107  Identities=30%  Similarity=0.658  Sum_probs=73.2

Q ss_pred             CCCCCCCCCCeEec------ceEeCCCCCcCCCCCC----CCCCCCCCCCCCCCccCCCCCCCCCcEecC-CCceecCCC
Q psy18122          7 ENSIECLNGGVEMD------SMCLCTPGFAGKKCEI----PCDRNNFGHKCSMLCSKSPNQCKGMVLCTS-NFGCSCATG   75 (188)
Q Consensus         7 ~~~~~C~~~g~C~~------~~C~C~~G~~G~~C~~----~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~-~~~C~C~~g   75 (188)
                      +.-+||.|.|+|..      |+|.|++||.|++|+.    .|+.||||..--.+|.|+... .-...|.+ ++.|.|+..
T Consensus      1719 C~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~CgpC~CavsK-gfdp~CnKt~G~CqCKe~ 1797 (2531)
T KOG4289|consen 1719 CSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTCGPCNCAVSK-GFDPDCNKTNGQCQCKEN 1797 (2531)
T ss_pred             hcccccccCceeecCCCCCceeEECCCcccCcchhhhccCCCCCcccCCCCccCccccccC-CCCCCccccCcceeeccc
Confidence            45688999999986      8999999999999985    499999997655777776210 01123444 588988876


Q ss_pred             CCCCC--CCCCCCCCccCCCccCCCcccccceeccccccccccccCCccCCCCCC
Q psy18122         76 YQGEQ--CLEHCPEDYYGADCKQKCGECIDGCDQYTGVCRGGCTTPYLIRPYCKQ  128 (188)
Q Consensus        76 ~~G~~--C~~~C~~g~~G~~C~~~C~~C~~~C~~~~g~C~~~C~~g~~~G~~C~~  128 (188)
                      +.-..  |- .|.- |+|..-        ..|+ .+|+|.  |.+|- .|.+|++
T Consensus      1798 hy~~~~~Cl-~CdC-~~Gs~S--------r~C~-adGqC~--C~pga-iGRqCdr 1838 (2531)
T KOG4289|consen 1798 HYRPIGSCL-PCDC-YFGSDS--------RECD-ADGQCP--CKPGA-IGRQCDR 1838 (2531)
T ss_pred             cccCCCcce-eecc-ccCCCc--------cccc-CCCcCC--CCCcc-ccccccc
Confidence            54221  21 1211 122221        3555 689999  99999 9999987


No 9  
>KOG1226|consensus
Probab=98.45  E-value=1.1e-06  Score=78.66  Aligned_cols=100  Identities=32%  Similarity=0.783  Sum_probs=64.7

Q ss_pred             CCCCCCCCeEecceEeCCCCCc----CCCCCCCCCCCCCCCCCCCCccCC-CCCCCCCcEecCCCceecCCCCCCCCCCC
Q psy18122          9 SIECLNGGVEMDSMCLCTPGFA----GKKCEIPCDRNNFGHKCSMLCSKS-PNQCKGMVLCTSNFGCSCATGYQGEQCLE   83 (188)
Q Consensus         9 ~~~C~~~g~C~~~~C~C~~G~~----G~~C~~~C~~g~~G~~C~~~C~c~-~~~C~~~g~C~~~~~C~C~~g~~G~~C~~   83 (188)
                      +.+|++.|.|..|+|.|.+...    |+.|+-  +.        ..|.-. ..-|.++|.|.- ++|.|.+||+|..|+ 
T Consensus       513 ~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCEC--Dn--------fsC~r~~g~lC~g~G~C~C-G~CvC~~GwtG~~C~-  580 (783)
T KOG1226|consen  513 SPVCSGRGDCVCGQCVCHKPDNGKIYGKFCEC--DN--------FSCERHKGVLCGGHGRCEC-GRCVCNPGWTGSACN-  580 (783)
T ss_pred             CCCcCCCCcEeCCceEecCCCCCceeeeeeec--cC--------cccccccCcccCCCCeEeC-CcEEcCCCCccCCCC-
Confidence            3489999999999999988877    777763  21        222211 345889999988 999999999999996 


Q ss_pred             CCCCCccCCCccCCCc-ccc--cceeccccccccccccC-CccCCCCCC
Q psy18122         84 HCPEDYYGADCKQKCG-ECI--DGCDQYTGVCRGGCTTP-YLIRPYCKQ  128 (188)
Q Consensus        84 ~C~~g~~G~~C~~~C~-~C~--~~C~~~~g~C~~~C~~g-~~~G~~C~~  128 (188)
                       |+..  -..|...=. .|.  +.|.  -|+|+  |... | .|..|+.
T Consensus       581 -C~~s--td~C~~~~G~iCSGrG~C~--Cg~C~--C~~~~~-sG~~CE~  621 (783)
T KOG1226|consen  581 -CPLS--TDTCESSDGQICSGRGTCE--CGRCK--CTDPPY-SGEFCEK  621 (783)
T ss_pred             -CCCC--CccccCCCCceeCCCceee--CCceE--cCCCCc-Ccchhhc
Confidence             4421  011111100 033  3332  25666  7555 7 8888776


No 10 
>KOG1836|consensus
Probab=98.43  E-value=7.2e-07  Score=86.85  Aligned_cols=125  Identities=22%  Similarity=0.536  Sum_probs=90.3

Q ss_pred             CCCCC-CeEec--ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCcc-CCCCCCCCCcEecC-----CCcee-cCCCCCCCC
Q psy18122         11 ECLNG-GVEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCS-KSPNQCKGMVLCTS-----NFGCS-CATGYQGEQ   80 (188)
Q Consensus        11 ~C~~~-g~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~-c~~~~C~~~g~C~~-----~~~C~-C~~g~~G~~   80 (188)
                      +|+++ .+|..  |.|.|.+...|..|+. |.+||||..=...=. |.+=+|.+++.|..     .+.|. |+++|+|.+
T Consensus       732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~-C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~r  810 (1705)
T KOG1836|consen  732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQ-CVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLR  810 (1705)
T ss_pred             ccCCccccccCCCCceecccCCCCCchhh-hcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccc
Confidence            34443 57776  9999999999999998 999999987542111 22335666666654     46697 999999999


Q ss_pred             CCCCCCCCccCCCccC-----CCcc--cc--------cceeccccccccccccCCccCCCCCCCCCCCCCCCeee
Q psy18122         81 CLEHCPEDYYGADCKQ-----KCGE--CI--------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVL  140 (188)
Q Consensus        81 C~~~C~~g~~G~~C~~-----~C~~--C~--------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~  140 (188)
                      |+ .|..||+|..-..     .|..  |+        +.|+..+|.|.. |..+- .|.+|+. |.+++++..+.
T Consensus       811 Ce-~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~-ci~nT-~g~~cd~-c~~g~~gd~l~  881 (1705)
T KOG1836|consen  811 CE-ECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGECLK-CIHNT-AGEYCDL-CKEGYFGDPLA  881 (1705)
T ss_pred             cc-cCCCccccCCCCCCCCcccCccceeccccCccccccccccccceee-ccCCc-ccccccc-cccCccccccC
Confidence            98 7999999853221     1322  43        788889999843 87877 8888887 76666666655


No 11 
>KOG1836|consensus
Probab=98.32  E-value=1.1e-06  Score=85.57  Aligned_cols=114  Identities=25%  Similarity=0.603  Sum_probs=82.6

Q ss_pred             ceEeCCCCCcCCCCCCCCCCCCCCCCCC--CCccCCCCCCCCC-cEecC-CCceecCCCCCCCCCCCCCCCCccCCC---
Q psy18122         21 SMCLCTPGFAGKKCEIPCDRNNFGHKCS--MLCSKSPNQCKGM-VLCTS-NFGCSCATGYQGEQCLEHCPEDYYGAD---   93 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~--~~C~c~~~~C~~~-g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~---   93 (188)
                      -.|.|++||+|..|+. |.++|+...=.  ..+.|.+-.|.++ ..|+. ++.|.|.+...|..|+ .|.+||+|..   
T Consensus       695 e~c~C~~g~tG~~Ce~-C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~-~C~~GfYg~~~~~  772 (1705)
T KOG1836|consen  695 EQCTCPVGYTGQFCES-CAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCA-QCVDGFYGLPDLG  772 (1705)
T ss_pred             hhccCCCCcccchhhh-cchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchh-hhcCCCCCccccC
Confidence            5699999999999998 99999866533  1233332234443 67876 7889999999999998 7999999863   


Q ss_pred             ----ccCCCcccc--c----ceeccccccccccccCCccCCCCCCCCCCCCCCCeeec
Q psy18122         94 ----CKQKCGECI--D----GCDQYTGVCRGGCTTPYLIRPYCKQPHSYLEESPRVLD  141 (188)
Q Consensus        94 ----C~~~C~~C~--~----~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~~~~~~~~  141 (188)
                          |. +|. |.  +    .++...+.|.. |+++| +|..|+. |..+++++..-.
T Consensus       773 ~~~dC~-~C~-Cp~~~~~~~~~~~~~~iCk~-Cp~gy-tG~rCe~-c~dgyfg~p~~~  825 (1705)
T KOG1836|consen  773 TSGDCQ-PCP-CPNGGACGQTPEILEVVCKN-CPPGY-TGLRCEE-CADGYFGNPLGH  825 (1705)
T ss_pred             CCCCCc-cCC-CCCChhhcCcCcccceecCC-CCCCC-ccccccc-CCCccccCCCCC
Confidence                32 344 54  3    33344567775 99999 9999998 777776555443


No 12 
>KOG1219|consensus
Probab=98.17  E-value=2.1e-06  Score=83.93  Aligned_cols=69  Identities=33%  Similarity=0.739  Sum_probs=57.8

Q ss_pred             CCCCCCCCCCCCCeEec----ceEeCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCC
Q psy18122          4 NTNENSIECLNGGVEMD----SMCLCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGY   76 (188)
Q Consensus         4 ~~~~~~~~C~~~g~C~~----~~C~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~   76 (188)
                      .+.+.++||.+||+|.+    +.|.|+.||+|.+|+..      |     .-.|..++|.++|.|.+   ++.|.|.++|
T Consensus      3903 ~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~------G-----i~eCs~n~C~~gg~C~n~~gsf~CncT~g~ 3971 (4289)
T KOG1219|consen 3903 LEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR------G-----ISECSKNVCGTGGQCINIPGSFHCNCTPGI 3971 (4289)
T ss_pred             cccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc------c-----ccccccccccCCceeeccCCceEeccChhH
Confidence            35678999999999998    88999999999999875      1     11344678999999987   4679999999


Q ss_pred             CCCCCCC
Q psy18122         77 QGEQCLE   83 (188)
Q Consensus        77 ~G~~C~~   83 (188)
                      .|+.|..
T Consensus      3972 ~gr~c~~ 3978 (4289)
T KOG1219|consen 3972 LGRTCCA 3978 (4289)
T ss_pred             hcccCcc
Confidence            9999864


No 13 
>smart00051 DSL delta serrate ligand.
Probab=98.04  E-value=3.3e-06  Score=53.46  Aligned_cols=51  Identities=25%  Similarity=0.631  Sum_probs=36.6

Q ss_pred             cCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecCCCceecCCCCCCCCC
Q psy18122         30 AGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGCSCATGYQGEQC   81 (188)
Q Consensus        30 ~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C~C~~g~~G~~C   81 (188)
                      .|..++..|+++|||..|...|.+.. .+.++.+|...+.+.|.+||+|..|
T Consensus        13 l~~~~rv~C~~~~yG~~C~~~C~~~~-d~~~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       13 LEYQIRVTCDENYYGEGCNKFCRPRD-DFFGHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             EEEEEEeeCCCCCcCCccCCEeCcCc-cccCCccCCcCCCEecCCCCcCCCC
Confidence            34445556777777777777775322 2567888887788999999999876


No 14 
>KOG3512|consensus
Probab=97.96  E-value=3.4e-05  Score=65.93  Aligned_cols=107  Identities=23%  Similarity=0.549  Sum_probs=73.6

Q ss_pred             ceEeCCCCCcCCCCCCCCCCCCCCCCCC----------CCccCC--CCCCCCCcEecC------CCce-ecCCCCCCCCC
Q psy18122         21 SMCLCTPGFAGKKCEIPCDRNNFGHKCS----------MLCSKS--PNQCKGMVLCTS------NFGC-SCATGYQGEQC   81 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~C~~g~~G~~C~----------~~C~c~--~~~C~~~g~C~~------~~~C-~C~~g~~G~~C   81 (188)
                      .+|.|.-+.+|+.|+. |.+.|++..-.          ..|.|.  ...|..+..+-.      .+.| .|...-.|.+|
T Consensus       295 ltCdC~HNTaGPdCgr-CKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhC  373 (592)
T KOG3512|consen  295 LTCDCEHNTAGPDCGR-CKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHC  373 (592)
T ss_pred             eEEecccCCCCCCccc-ccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCccc
Confidence            6899999999999998 99988766643          112222  112222222211      2456 69999999999


Q ss_pred             CCCCCCCccCCC-----ccC---CCcccc------cceeccccccccccccCCccCCCCCCCCCCCC
Q psy18122         82 LEHCPEDYYGAD-----CKQ---KCGECI------DGCDQYTGVCRGGCTTPYLIRPYCKQPHSYLE  134 (188)
Q Consensus        82 ~~~C~~g~~G~~-----C~~---~C~~C~------~~C~~~~g~C~~~C~~g~~~G~~C~~~c~~~~  134 (188)
                      . .|.+||+=..     -..   .|+ |.      .+|+..+|+|.  |.+|. +|..|.+ |.+++
T Consensus       374 h-yCreGyyRd~s~pl~hrkaCk~Cd-ChpVGs~gktCNq~tGqCp--CkeGv-tG~tCnr-Ca~gy  434 (592)
T KOG3512|consen  374 H-YCREGYYRDGSKPLTHRKACKACD-CHPVGSAGKTCNQTTGQCP--CKEGV-TGLTCNR-CAPGY  434 (592)
T ss_pred             c-cccCccccCCCCCCchhhhhhhcC-CcccccccccccccCCccc--CCCCC-ccccccc-ccchh
Confidence            8 8999997421     111   144 54      68998999999  99999 9999987 55443


No 15 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.82  E-value=1.2e-05  Score=43.95  Aligned_cols=26  Identities=46%  Similarity=1.039  Sum_probs=22.9

Q ss_pred             CCCCCCCCCeEec-----ceEeCCCCCcCCC
Q psy18122          8 NSIECLNGGVEMD-----SMCLCTPGFAGKK   33 (188)
Q Consensus         8 ~~~~C~~~g~C~~-----~~C~C~~G~~G~~   33 (188)
                      .++||.|+|+|+.     +.|+|++||+|++
T Consensus         2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            4569999999987     8899999999974


No 16 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.67  E-value=5.7e-05  Score=41.16  Aligned_cols=25  Identities=28%  Similarity=0.788  Sum_probs=22.9

Q ss_pred             CCCCCCCeEe--cceEeCCCCCcCCCC
Q psy18122         10 IECLNGGVEM--DSMCLCTPGFAGKKC   34 (188)
Q Consensus        10 ~~C~~~g~C~--~~~C~C~~G~~G~~C   34 (188)
                      ..|+++|+|.  .+.|.|++||+|+.|
T Consensus         6 ~~C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCCCEEECCCCCcCCCC
Confidence            5799999999  599999999999986


No 17 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.47  E-value=0.00018  Score=43.32  Aligned_cols=28  Identities=25%  Similarity=0.735  Sum_probs=25.0

Q ss_pred             Eec--ceEeCCCCCcCCCCCCCCCCCCCCCC
Q psy18122         18 EMD--SMCLCTPGFAGKKCEIPCDRNNFGHK   46 (188)
Q Consensus        18 C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~   46 (188)
                      |..  ++|.|+++|.|..|+. |.++|||..
T Consensus        14 C~~~~G~C~C~~~~~G~~C~~-C~~g~~~~~   43 (50)
T cd00055          14 CDPGTGQCECKPNTTGRRCDR-CAPGYYGLP   43 (50)
T ss_pred             ccCCCCEEeCCCcCCCCCCCC-CCCCCccCC
Confidence            765  9999999999999996 999999764


No 18 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.33  E-value=7.4e-05  Score=40.72  Aligned_cols=26  Identities=31%  Similarity=0.928  Sum_probs=21.7

Q ss_pred             CCCCCCCcEecC----CCceecCCCCCCCC
Q psy18122         55 PNQCKGMVLCTS----NFGCSCATGYQGEQ   80 (188)
Q Consensus        55 ~~~C~~~g~C~~----~~~C~C~~g~~G~~   80 (188)
                      +.+|.++|+|.+    .+.|.|++||+|++
T Consensus         3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    3 SNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            558999999986    46799999999964


No 19 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=97.30  E-value=0.0001  Score=44.14  Aligned_cols=30  Identities=30%  Similarity=0.805  Sum_probs=25.1

Q ss_pred             eEec--ceEeCCCCCcCCCCCCCCCCCCCCCCC
Q psy18122         17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFGHKC   47 (188)
Q Consensus        17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G~~C   47 (188)
                      .|..  ++|.|+++|+|+.|++ |.++||+...
T Consensus        12 ~C~~~~G~C~C~~~~~G~~C~~-C~~g~~~~~~   43 (49)
T PF00053_consen   12 TCDPSTGQCVCKPGTTGPRCDQ-CKPGYFGLPS   43 (49)
T ss_dssp             SEEETCEEESBSTTEESTTS-E-E-TTEECSTT
T ss_pred             cccCCCCEEeccccccCCcCcC-CCCccccccC
Confidence            7776  9999999999999997 9999998765


No 20 
>smart00051 DSL delta serrate ligand.
Probab=97.27  E-value=0.00013  Score=46.17  Aligned_cols=45  Identities=33%  Similarity=0.810  Sum_probs=31.0

Q ss_pred             CCCCCCCCCCCccCCCccCCCcc---cc--cceeccccccccccccCCccCCCC
Q psy18122         78 GEQCLEHCPEDYYGADCKQKCGE---CI--DGCDQYTGVCRGGCTTPYLIRPYC  126 (188)
Q Consensus        78 G~~C~~~C~~g~~G~~C~~~C~~---C~--~~C~~~~g~C~~~C~~g~~~G~~C  126 (188)
                      +..+...|+++|+|..|...|.+   +.  ..|+. .|.+.  |.+|| +|..|
T Consensus        14 ~~~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~--C~~Gw-~G~~C   63 (63)
T smart00051       14 EYQIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKG--CLEGW-MGPYC   63 (63)
T ss_pred             EEEEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEe--cCCCC-cCCCC
Confidence            33444566677777777666652   22  57776 68999  99999 99876


No 21 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.17  E-value=0.00047  Score=37.52  Aligned_cols=25  Identities=40%  Similarity=0.974  Sum_probs=22.0

Q ss_pred             CCCCCcEecCC-CceecCCCCCCCCC
Q psy18122         57 QCKGMVLCTSN-FGCSCATGYQGEQC   81 (188)
Q Consensus        57 ~C~~~g~C~~~-~~C~C~~g~~G~~C   81 (188)
                      .|.++|+|+.. +.|.|.+||+|..|
T Consensus         7 ~C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence            58899999964 89999999999876


No 22 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.08  E-value=0.00021  Score=30.85  Aligned_cols=13  Identities=46%  Similarity=1.474  Sum_probs=10.7

Q ss_pred             eEeCCCCCcCCCC
Q psy18122         22 MCLCTPGFAGKKC   34 (188)
Q Consensus        22 ~C~C~~G~~G~~C   34 (188)
                      .|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5899999999987


No 23 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.07  E-value=0.0011  Score=37.06  Aligned_cols=26  Identities=42%  Similarity=1.104  Sum_probs=22.9

Q ss_pred             CCCCCCCeEec----ceEeCCCCCc-CCCCC
Q psy18122         10 IECLNGGVEMD----SMCLCTPGFA-GKKCE   35 (188)
Q Consensus        10 ~~C~~~g~C~~----~~C~C~~G~~-G~~C~   35 (188)
                      .+|.+++.|.+    +.|.|++||. |..|+
T Consensus         9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            78999999987    7899999999 88774


No 24 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=97.01  E-value=0.00092  Score=40.17  Aligned_cols=30  Identities=30%  Similarity=0.825  Sum_probs=25.5

Q ss_pred             EecC-CCceecCCCCCCCCCCCCCCCCccCCC
Q psy18122         63 LCTS-NFGCSCATGYQGEQCLEHCPEDYYGAD   93 (188)
Q Consensus        63 ~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~   93 (188)
                      .|.. ++.|.|+++|+|..|+ .|.++|+|..
T Consensus        13 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~   43 (50)
T cd00055          13 QCDPGTGQCECKPNTTGRRCD-RCAPGYYGLP   43 (50)
T ss_pred             cccCCCCEEeCCCcCCCCCCC-CCCCCCccCC
Confidence            3554 6889999999999998 7999999854


No 25 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=96.97  E-value=0.00094  Score=39.45  Aligned_cols=27  Identities=26%  Similarity=0.821  Sum_probs=24.2

Q ss_pred             eEec--ceEeCCCCCcCCCCCCCCCCCCCC
Q psy18122         17 VEMD--SMCLCTPGFAGKKCEIPCDRNNFG   44 (188)
Q Consensus        17 ~C~~--~~C~C~~G~~G~~C~~~C~~g~~G   44 (188)
                      .|..  ++|.|+++|+|.+|+. |.++|||
T Consensus        12 ~C~~~~G~C~C~~~~~G~~C~~-C~~g~~g   40 (46)
T smart00180       12 TCDPDTGQCECKPNVTGRRCDR-CAPGYYG   40 (46)
T ss_pred             cccCCCCEEECCCCCCCCCCCc-CCCCcCC
Confidence            5655  8999999999999996 9999998


No 26 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=96.88  E-value=0.00039  Score=41.58  Aligned_cols=32  Identities=28%  Similarity=0.774  Sum_probs=25.2

Q ss_pred             cEecC-CCceecCCCCCCCCCCCCCCCCccCCCc
Q psy18122         62 VLCTS-NFGCSCATGYQGEQCLEHCPEDYYGADC   94 (188)
Q Consensus        62 g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G~~C   94 (188)
                      ..|.. ++.|.|+++|+|+.|+ .|.++|++...
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~C~-~C~~g~~~~~~   43 (49)
T PF00053_consen   11 QTCDPSTGQCVCKPGTTGPRCD-QCKPGYFGLPS   43 (49)
T ss_dssp             SSEEETCEEESBSTTEESTTS--EE-TTEECSTT
T ss_pred             CcccCCCCEEeccccccCCcCc-CCCCccccccC
Confidence            35655 6889999999999999 69999999754


No 27 
>KOG1218|consensus
Probab=96.83  E-value=0.0079  Score=49.36  Aligned_cols=63  Identities=37%  Similarity=0.971  Sum_probs=46.2

Q ss_pred             cEecCCC-ceecCCCCCCCCCCCCCCCCccCCCccCCCcccccceeccccccccccccCCccCCCCCCCCC
Q psy18122         62 VLCTSNF-GCSCATGYQGEQCLEHCPEDYYGADCKQKCGECIDGCDQYTGVCRGGCTTPYLIRPYCKQPHS  131 (188)
Q Consensus        62 g~C~~~~-~C~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~C~~~C~~~~g~C~~~C~~g~~~G~~C~~~c~  131 (188)
                      -+|.... .|.+..+|.+..|..   ++|+|..|...|. +...++...+.|.  |.+|| .|.+|...++
T Consensus       117 ~~C~~~~~~c~~~~~~~~~~C~~---~~~~g~~C~~~c~-~~~~~~~~~~~c~--c~~g~-~g~~~~~~~~  180 (316)
T KOG1218|consen  117 KTCANPRRECRCGGGYIGEQCGE---ENLVGLKCQRDCQ-CTGGCDCKNGICT--CQPGF-VGVFCVESCS  180 (316)
T ss_pred             cccCCCccceecCCcCccccccc---cCCCCCCccCCCC-CccccCCCCCcee--ccCCc-ccccccccCC
Confidence            3455433 478888888888863   6888888887763 4456666678899  99999 9998887643


No 28 
>KOG4260|consensus
Probab=96.77  E-value=0.0014  Score=52.72  Aligned_cols=53  Identities=32%  Similarity=0.780  Sum_probs=35.9

Q ss_pred             CCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC------CCceecCCCCCCCCCCCCCCCCccC
Q psy18122         25 CTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS------NFGCSCATGYQGEQCLEHCPEDYYG   91 (188)
Q Consensus        25 C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~------~~~C~C~~g~~G~~C~~~C~~g~~G   91 (188)
                      |++|..|++|.. |+-+.            ..+|.++|.|.-      ++.|.|.+||+|+.|. .|.++|+-
T Consensus       132 Cp~gtyGpdCl~-Cpggs------------er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-~Cg~eyfe  190 (350)
T KOG4260|consen  132 CPDGTYGPDCLQ-CPGGS------------ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-YCGIEYFE  190 (350)
T ss_pred             cCCCCcCCcccc-CCCCC------------cCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-ccchHHHH
Confidence            566666666654 54221            235666666753      5789999999999997 67777764


No 29 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.75  E-value=0.0028  Score=34.81  Aligned_cols=26  Identities=42%  Similarity=1.113  Sum_probs=22.6

Q ss_pred             CCCCCCCeEec----ceEeCCCCCcCCCCC
Q psy18122         10 IECLNGGVEMD----SMCLCTPGFAGKKCE   35 (188)
Q Consensus        10 ~~C~~~g~C~~----~~C~C~~G~~G~~C~   35 (188)
                      .+|.+++.|..    +.|.|++||.|..|+
T Consensus         9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            68999999987    789999999998774


No 30 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=96.75  E-value=0.0016  Score=38.49  Aligned_cols=28  Identities=32%  Similarity=0.920  Sum_probs=24.5

Q ss_pred             EecC-CCceecCCCCCCCCCCCCCCCCccC
Q psy18122         63 LCTS-NFGCSCATGYQGEQCLEHCPEDYYG   91 (188)
Q Consensus        63 ~C~~-~~~C~C~~g~~G~~C~~~C~~g~~G   91 (188)
                      .|.. ++.|.|+++|+|..|+ .|++||+|
T Consensus        12 ~C~~~~G~C~C~~~~~G~~C~-~C~~g~~g   40 (46)
T smart00180       12 TCDPDTGQCECKPNVTGRRCD-RCAPGYYG   40 (46)
T ss_pred             cccCCCCEEECCCCCCCCCCC-cCCCCcCC
Confidence            4544 5889999999999998 79999998


No 31 
>KOG1217|consensus
Probab=96.58  E-value=0.021  Score=48.99  Aligned_cols=114  Identities=26%  Similarity=0.706  Sum_probs=67.9

Q ss_pred             CCCCCCCCCeEec----ceEeCCCCCcCCCCCCC-------------CCCCCCCCCCC----------------------
Q psy18122          8 NSIECLNGGVEMD----SMCLCTPGFAGKKCEIP-------------CDRNNFGHKCS----------------------   48 (188)
Q Consensus         8 ~~~~C~~~g~C~~----~~C~C~~G~~G~~C~~~-------------C~~g~~G~~C~----------------------   48 (188)
                      ...+|.+++.|..    +.|.|+++|.|..|+..             +..++.+..|.                      
T Consensus       175 ~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C  254 (487)
T KOG1217|consen  175 YSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTC  254 (487)
T ss_pred             CCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceee
Confidence            3456899999987    67999999999987642             23333333332                      


Q ss_pred             ---------------CCccCCCC-CCCCCcEecC---CCceecCCCCCCCCCC-----CCCCCCccCCCccCCCcccccc
Q psy18122         49 ---------------MLCSKSPN-QCKGMVLCTS---NFGCSCATGYQGEQCL-----EHCPEDYYGADCKQKCGECIDG  104 (188)
Q Consensus        49 ---------------~~C~c~~~-~C~~~g~C~~---~~~C~C~~g~~G~~C~-----~~C~~g~~G~~C~~~C~~C~~~  104 (188)
                                     ..-.|... +|.++++|..   .+.|.|+++|+|..|.     ..|...+.+..|...     ..
T Consensus       255 ~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g-----~~  329 (487)
T KOG1217|consen  255 RCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANG-----GT  329 (487)
T ss_pred             eCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCC-----cc
Confidence                           11122222 3777888876   2678888888888871     133332222222211     23


Q ss_pred             eec----cccccccccccCCccCCCCCCC
Q psy18122        105 CDQ----YTGVCRGGCTTPYLIRPYCKQP  129 (188)
Q Consensus       105 C~~----~~g~C~~~C~~g~~~G~~C~~~  129 (188)
                      |..    ....|.  |..+| .|..|+..
T Consensus       330 C~~~~~~~~~~C~--c~~~~-~g~~C~~~  355 (487)
T KOG1217|consen  330 CNTLGSFGGFRCA--CGPGF-TGRRCEDS  355 (487)
T ss_pred             cccCCCCCCCCcC--CCCCC-CCCccccC
Confidence            311    112588  88999 99999854


No 32 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=96.44  E-value=0.00059  Score=43.13  Aligned_cols=43  Identities=35%  Similarity=0.804  Sum_probs=20.9

Q ss_pred             CCCCCCCCCCCCCccCCCCCCCCCcEecCCCceecCCCCCCCCC
Q psy18122         38 CDRNNFGHKCSMLCSKSPNQCKGMVLCTSNFGCSCATGYQGEQC   81 (188)
Q Consensus        38 C~~g~~G~~C~~~C~c~~~~C~~~g~C~~~~~C~C~~g~~G~~C   81 (188)
                      |.++|||..|...|.-.. .-..+.+|...+.=+|.+||+|+.|
T Consensus        21 C~~nyyG~~C~~~C~~~~-d~~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   21 CDENYYGPNCSKFCKPRD-DSFGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             --TTEETTTT-EE---EE-ETTEEEEE-SS--EEE-TTEESTTS
T ss_pred             CCCCCCCccccCCcCCCc-CCcCCcccCCCCCCCCCCCCcCCCC
Confidence            666666666665554211 1135667777677788888888765


No 33 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.31  E-value=0.0075  Score=32.51  Aligned_cols=26  Identities=42%  Similarity=1.101  Sum_probs=22.5

Q ss_pred             CCCCCCCCeEec----ceEeCCCCCcCC-CC
Q psy18122          9 SIECLNGGVEMD----SMCLCTPGFAGK-KC   34 (188)
Q Consensus         9 ~~~C~~~g~C~~----~~C~C~~G~~G~-~C   34 (188)
                      ..+|.+++.|..    +.|.|++||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            578999999987    889999999998 55


No 34 
>KOG4260|consensus
Probab=96.15  E-value=0.0039  Score=50.18  Aligned_cols=33  Identities=27%  Similarity=0.765  Sum_probs=27.4

Q ss_pred             CCCCCCCeEec-------ceEeCCCCCcCCCCCCCCCCCCC
Q psy18122         10 IECLNGGVEMD-------SMCLCTPGFAGKKCEIPCDRNNF   43 (188)
Q Consensus        10 ~~C~~~g~C~~-------~~C~C~~G~~G~~C~~~C~~g~~   43 (188)
                      -||.++|.|+-       |.|.|.+||.|+.|.. |.++||
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~-Cg~eyf  189 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY-CGIEYF  189 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccc-cchHHH
Confidence            56888888875       8999999999999986 666654


No 35 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.00  E-value=0.013  Score=31.91  Aligned_cols=25  Identities=44%  Similarity=1.099  Sum_probs=21.0

Q ss_pred             CCCCCCCeEec----ceEeCCCCCcC-CCCC
Q psy18122         10 IECLNGGVEMD----SMCLCTPGFAG-KKCE   35 (188)
Q Consensus        10 ~~C~~~g~C~~----~~C~C~~G~~G-~~C~   35 (188)
                      .+|.++ .|..    +.|.|++||.| ..|+
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence            689888 8986    89999999999 6653


No 36 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.94  E-value=0.01  Score=32.91  Aligned_cols=27  Identities=33%  Similarity=0.995  Sum_probs=21.2

Q ss_pred             CCCCCCcEecC---CCceecCCCCC-CCCCC
Q psy18122         56 NQCKGMVLCTS---NFGCSCATGYQ-GEQCL   82 (188)
Q Consensus        56 ~~C~~~g~C~~---~~~C~C~~g~~-G~~C~   82 (188)
                      .+|.+++.|.+   .+.|.|++||. |..|+
T Consensus         9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            46778888876   46799999998 87764


No 37 
>KOG1217|consensus
Probab=95.71  E-value=0.087  Score=45.11  Aligned_cols=102  Identities=30%  Similarity=0.774  Sum_probs=60.5

Q ss_pred             ceEeCCCCCcCCCCCCC-----------------------------CCCCCCCCCCCC---CccCCCCCCCCCcEecC--
Q psy18122         21 SMCLCTPGFAGKKCEIP-----------------------------CDRNNFGHKCSM---LCSKSPNQCKGMVLCTS--   66 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~-----------------------------C~~g~~G~~C~~---~C~c~~~~C~~~g~C~~--   66 (188)
                      +.|.|++||.|..|+..                             |..+|.+..+..   .|.-...+|.+++.|.+  
T Consensus       110 ~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~  189 (487)
T KOG1217|consen  110 YECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTG  189 (487)
T ss_pred             ceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCC
Confidence            66788888877655541                             334444444442   23222446888888876  


Q ss_pred             -CCceecCCCCCCCCCCCC-------------CCCCccCCCccCCCcccc---cceeccc--cccccccccCCccCCC
Q psy18122         67 -NFGCSCATGYQGEQCLEH-------------CPEDYYGADCKQKCGECI---DGCDQYT--GVCRGGCTTPYLIRPY  125 (188)
Q Consensus        67 -~~~C~C~~g~~G~~C~~~-------------C~~g~~G~~C~~~C~~C~---~~C~~~~--g~C~~~C~~g~~~G~~  125 (188)
                       .+.|.|+++|.|..|+..             +..++.+..|......|.   +.|....  .+|.  |++|| .+..
T Consensus       190 ~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~--~~~g~-~~~~  264 (487)
T KOG1217|consen  190 GSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCR--CPEGY-TGDA  264 (487)
T ss_pred             CCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeee--CCCCc-cccc
Confidence             466999999999988742             455666666554321132   2332221  2566  77787 7665


No 38 
>KOG3512|consensus
Probab=95.63  E-value=0.019  Score=49.53  Aligned_cols=69  Identities=25%  Similarity=0.642  Sum_probs=51.5

Q ss_pred             ceEe-CCCCCcCCCCCCCCCCCCCCCCCC--------CCccCCCCCCCCCcEecC-CCceecCCCCCCCCCCCCCCCCcc
Q psy18122         21 SMCL-CTPGFAGKKCEIPCDRNNFGHKCS--------MLCSKSPNQCKGMVLCTS-NFGCSCATGYQGEQCLEHCPEDYY   90 (188)
Q Consensus        21 ~~C~-C~~G~~G~~C~~~C~~g~~G~~C~--------~~C~c~~~~C~~~g~C~~-~~~C~C~~g~~G~~C~~~C~~g~~   90 (188)
                      +.|+ |.-...|++|.- |.+|||...=.        ..|.|.+- =.-+-+|.. ++.|.|++|.+|..|. .|.+||.
T Consensus       359 gvClnCrHnTaGrhChy-CreGyyRd~s~pl~hrkaCk~CdChpV-Gs~gktCNq~tGqCpCkeGvtG~tCn-rCa~gyq  435 (592)
T KOG3512|consen  359 GVCLNCRHNTAGRHCHY-CREGYYRDGSKPLTHRKACKACDCHPV-GSAGKTCNQTTGQCPCKEGVTGLTCN-RCAPGYQ  435 (592)
T ss_pred             ceEeecccCCCCccccc-ccCccccCCCCCCchhhhhhhcCCccc-ccccccccccCCcccCCCCCcccccc-cccchhh
Confidence            6776 899999999987 99999966532        33544421 123446764 7899999999999998 6999987


Q ss_pred             CC
Q psy18122         91 GA   92 (188)
Q Consensus        91 G~   92 (188)
                      -.
T Consensus       436 qs  437 (592)
T KOG3512|consen  436 QS  437 (592)
T ss_pred             cc
Confidence            54


No 39 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.41  E-value=0.021  Score=31.16  Aligned_cols=27  Identities=37%  Similarity=1.020  Sum_probs=20.4

Q ss_pred             CCCCCCcEecC---CCceecCCCCCCCCCC
Q psy18122         56 NQCKGMVLCTS---NFGCSCATGYQGEQCL   82 (188)
Q Consensus        56 ~~C~~~g~C~~---~~~C~C~~g~~G~~C~   82 (188)
                      .+|.+++.|.+   .+.|.|+++|.|..|+
T Consensus         9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            35777778876   4669999999987763


No 40 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=95.40  E-value=0.007  Score=38.23  Aligned_cols=39  Identities=33%  Similarity=0.830  Sum_probs=19.8

Q ss_pred             CCCCCccCCCccCCCccc---c--cceeccccccccccccCCccCCCC
Q psy18122         84 HCPEDYYGADCKQKCGEC---I--DGCDQYTGVCRGGCTTPYLIRPYC  126 (188)
Q Consensus        84 ~C~~g~~G~~C~~~C~~C---~--~~C~~~~g~C~~~C~~g~~~G~~C  126 (188)
                      .|.+.|+|..|...|.+=   .  -+|+ .+|.-+  |.+|| +|+.|
T Consensus        20 ~C~~nyyG~~C~~~C~~~~d~~ghy~Cd-~~G~~~--C~~Gw-~G~~C   63 (63)
T PF01414_consen   20 VCDENYYGPNCSKFCKPRDDSFGHYTCD-SNGNKV--CLPGW-TGPNC   63 (63)
T ss_dssp             ---TTEETTTT-EE---EEETTEEEEE--SS--EE--E-TTE-ESTTS
T ss_pred             ECCCCCCCccccCCcCCCcCCcCCcccC-CCCCCC--CCCCC-cCCCC
Confidence            566666666666555431   1  4666 478888  99999 99987


No 41 
>KOG1218|consensus
Probab=95.27  E-value=0.13  Score=42.03  Aligned_cols=26  Identities=35%  Similarity=0.832  Sum_probs=18.2

Q ss_pred             CCCCeEecceEeCCCCCcCCCCCCCC
Q psy18122         13 LNGGVEMDSMCLCTPGFAGKKCEIPC   38 (188)
Q Consensus        13 ~~~g~C~~~~C~C~~G~~G~~C~~~C   38 (188)
                      ..+-.+..+.|.|.+||.|.+|...+
T Consensus       154 ~~~~~~~~~~c~c~~g~~g~~~~~~~  179 (316)
T KOG1218|consen  154 TGGCDCKNGICTCQPGFVGVFCVESC  179 (316)
T ss_pred             ccccCCCCCceeccCCcccccccccC
Confidence            33334445888899999999887653


No 42 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.24  E-value=0.027  Score=30.21  Aligned_cols=26  Identities=35%  Similarity=1.059  Sum_probs=20.0

Q ss_pred             CCCCCCcEecC---CCceecCCCCCCC-CC
Q psy18122         56 NQCKGMVLCTS---NFGCSCATGYQGE-QC   81 (188)
Q Consensus        56 ~~C~~~g~C~~---~~~C~C~~g~~G~-~C   81 (188)
                      .+|.+++.|..   .+.|.|++||.|. .|
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            46777788876   5779999999887 54


No 43 
>KOG1214|consensus
Probab=94.76  E-value=0.062  Score=49.60  Aligned_cols=22  Identities=23%  Similarity=0.564  Sum_probs=14.5

Q ss_pred             CCeEec-----ceEeCCCCCcCC--CCCC
Q psy18122         15 GGVEMD-----SMCLCTPGFAGK--KCEI   36 (188)
Q Consensus        15 ~g~C~~-----~~C~C~~G~~G~--~C~~   36 (188)
                      +..|.+     ++|.|..||.|.  +|..
T Consensus       705 ~a~C~pg~~~~~tcecs~g~~gdgr~c~d  733 (1289)
T KOG1214|consen  705 TARCHPGTGVDYTCECSSGYQGDGRNCVD  733 (1289)
T ss_pred             CccccCCCCcceEEEEeeccCCCCCCCCC
Confidence            345655     589999988644  5543


No 44 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.75  E-value=0.043  Score=29.71  Aligned_cols=25  Identities=36%  Similarity=1.131  Sum_probs=18.6

Q ss_pred             CCCCCCcEecC---CCceecCCCCCC-CCC
Q psy18122         56 NQCKGMVLCTS---NFGCSCATGYQG-EQC   81 (188)
Q Consensus        56 ~~C~~~g~C~~---~~~C~C~~g~~G-~~C   81 (188)
                      .+|.++ +|.+   .+.|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            356676 7765   577999999998 655


No 45 
>KOG0196|consensus
Probab=94.49  E-value=0.24  Score=45.90  Aligned_cols=92  Identities=28%  Similarity=0.549  Sum_probs=53.3

Q ss_pred             CCCCcEecC-CCceecCCCCC----CCCCCCCCCCCccCCCc-cCCCcccc-ccee--ccccccccccccCCccCCC---
Q psy18122         58 CKGMVLCTS-NFGCSCATGYQ----GEQCLEHCPEDYYGADC-KQKCGECI-DGCD--QYTGVCRGGCTTPYLIRPY---  125 (188)
Q Consensus        58 C~~~g~C~~-~~~C~C~~g~~----G~~C~~~C~~g~~G~~C-~~~C~~C~-~~C~--~~~g~C~~~C~~g~~~G~~---  125 (188)
                      |..-|.... .+.|.|.+||.    |..|+ .|+.|++=..= ...|.+|. ..-.  .....|.  |..|| .-..   
T Consensus       248 C~~dGeWlvpiG~C~C~aGye~~~~~~~C~-aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~--C~~gy-yRA~~Dp  323 (996)
T KOG0196|consen  248 CSGDGEWLVPIGGCVCKAGYEEAENGKACQ-ACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCT--CENGY-YRADSDP  323 (996)
T ss_pred             EcCCCcEEEEcCceeecCCCCcccCCCcce-eCCCCcccCCCCCCCCCCCCCCCCCCCCCCCccc--ccCCc-ccCCCCC
Confidence            444444443 58899999994    67887 79999875433 33365565 2221  2233698  99998 4322   


Q ss_pred             CCCCCCCCCCCCeeecCccceEEEEEee
Q psy18122        126 CKQPHSYLEESPRVLDSSYNAVNLQIDL  153 (188)
Q Consensus       126 C~~~c~~~~~~~~~~~~~~~~~~~~~~~  153 (188)
                      =+.+|....+.|.-+-...+..++.++.
T Consensus       324 ~~mpCT~PPSaP~nlis~vn~Ts~~L~W  351 (996)
T KOG0196|consen  324 PSMPCTRPPSAPRNLISNVNGTSLILEW  351 (996)
T ss_pred             CCCCCCCCCCccceeeeecccceEEEEe
Confidence            2234654456666554444444444443


No 46 
>KOG1214|consensus
Probab=93.98  E-value=0.12  Score=47.87  Aligned_cols=47  Identities=36%  Similarity=0.957  Sum_probs=33.5

Q ss_pred             ceEeCCCCCcCC--CCCCCCCCCCCCCCCCCCccCCCCCCCCCcEecC---CCceecCCCCCCCC
Q psy18122         21 SMCLCTPGFAGK--KCEIPCDRNNFGHKCSMLCSKSPNQCKGMVLCTS---NFGCSCATGYQGEQ   80 (188)
Q Consensus        21 ~~C~C~~G~~G~--~C~~~C~~g~~G~~C~~~C~c~~~~C~~~g~C~~---~~~C~C~~g~~G~~   80 (188)
                      ++|.|.|||.|.  .|..             .-.|.+..|....+|..   ++.|.|.+||.|.-
T Consensus       809 y~C~CLPGfsGDG~~c~d-------------vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDG  860 (1289)
T KOG1214|consen  809 YSCACLPGFSGDGHQCTD-------------VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDG  860 (1289)
T ss_pred             EEEeecCCccCCcccccc-------------ccccCccccCCCceEecCCCcceeecccCccCCC
Confidence            789999999876  2322             11334566778888876   57799999998764


No 47 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=91.24  E-value=0.18  Score=28.79  Aligned_cols=22  Identities=27%  Similarity=0.818  Sum_probs=19.4

Q ss_pred             CCCCCCCCeEec----ceEeCCCCCc
Q psy18122          9 SIECLNGGVEMD----SMCLCTPGFA   30 (188)
Q Consensus         9 ~~~C~~~g~C~~----~~C~C~~G~~   30 (188)
                      +.+|..++.|++    |.|.|++||.
T Consensus         9 ~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    9 PHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            357988999998    8999999997


No 48 
>KOG1388|consensus
Probab=91.01  E-value=0.44  Score=37.22  Aligned_cols=71  Identities=30%  Similarity=0.735  Sum_probs=48.6

Q ss_pred             CCCCcEecCCCce-ecCCCCCCCCCCCCCCCCccCCCccCCCcc--cc---cceecccccccccc-ccCCccCCCCCCCC
Q psy18122         58 CKGMVLCTSNFGC-SCATGYQGEQCLEHCPEDYYGADCKQKCGE--CI---DGCDQYTGVCRGGC-TTPYLIRPYCKQPH  130 (188)
Q Consensus        58 C~~~g~C~~~~~C-~C~~g~~G~~C~~~C~~g~~G~~C~~~C~~--C~---~~C~~~~g~C~~~C-~~g~~~G~~C~~~c  130 (188)
                      |.+.+.|.....| .|..+-+|..|+ .|..+|+|..-...|.+  |.   ..|...+++|.  | ..++ .|..|.+-.
T Consensus        54 cNGh~~c~t~~v~~~~~N~~~g~~c~-kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~--c~~kgv-vgd~c~~~e  129 (217)
T KOG1388|consen   54 CNGHSDCNTQHVCWRCENGTTGAHCE-KCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCF--CTTKGV-VGDLCPKCE  129 (217)
T ss_pred             hcCCCCcccceeeeeccCccccccCC-ceEEEEEecCCCCccCHhhhcCCeeeeeccCCccc--cccceE-ecccCcccc
Confidence            4455556543445 678888999998 69999999432223433  43   67778889999  8 5677 888888744


Q ss_pred             CC
Q psy18122        131 SY  132 (188)
Q Consensus       131 ~~  132 (188)
                      .+
T Consensus       130 ~~  131 (217)
T KOG1388|consen  130 VP  131 (217)
T ss_pred             cc
Confidence            33


No 49 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=89.67  E-value=0.21  Score=28.56  Aligned_cols=23  Identities=35%  Similarity=1.061  Sum_probs=18.9

Q ss_pred             CCCCCCCcEecC---CCceecCCCCC
Q psy18122         55 PNQCKGMVLCTS---NFGCSCATGYQ   77 (188)
Q Consensus        55 ~~~C~~~g~C~~---~~~C~C~~g~~   77 (188)
                      ...|..++.|++   +|.|.|++||.
T Consensus         9 ~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    9 PHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            446878889987   57899999997


No 50 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.73  E-value=0.13  Score=28.67  Aligned_cols=24  Identities=33%  Similarity=1.053  Sum_probs=17.0

Q ss_pred             CCCCCCcEecC---CCceecCCCCCCC
Q psy18122         56 NQCKGMVLCTS---NFGCSCATGYQGE   79 (188)
Q Consensus        56 ~~C~~~g~C~~---~~~C~C~~g~~G~   79 (188)
                      ..|..+..|..   .+.|.|++||.|.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            35677788876   5779999999875


No 51 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.46  E-value=0.24  Score=27.52  Aligned_cols=23  Identities=26%  Similarity=0.704  Sum_probs=17.6

Q ss_pred             CCCCCCCeEec----ceEeCCCCCcCC
Q psy18122         10 IECLNGGVEMD----SMCLCTPGFAGK   32 (188)
Q Consensus        10 ~~C~~~g~C~~----~~C~C~~G~~G~   32 (188)
                      ..|..+..|..    +.|.|++||+|.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            46778888887    899999999886


No 52 
>KOG3514|consensus
Probab=87.67  E-value=0.35  Score=45.97  Aligned_cols=36  Identities=33%  Similarity=0.684  Sum_probs=29.6

Q ss_pred             CCCCCCCCCCCCCCCeEec----ceEeC-CCCCcCCCCCCC
Q psy18122          2 TSNTNENSIECLNGGVEMD----SMCLC-TPGFAGKKCEIP   37 (188)
Q Consensus         2 ~~~~~~~~~~C~~~g~C~~----~~C~C-~~G~~G~~C~~~   37 (188)
                      +++.-++++||.|+|.|..    +.|.| ..||.|+.|+..
T Consensus       621 ~~~~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE  661 (1591)
T KOG3514|consen  621 SNEKICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE  661 (1591)
T ss_pred             hhccccCCCcccCCCCccccccccccccccCcccCccccce
Confidence            3455689999999999987    78988 568999999863


No 53 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=84.67  E-value=0.77  Score=32.96  Aligned_cols=25  Identities=32%  Similarity=0.914  Sum_probs=18.5

Q ss_pred             CCCCCeEec------ceEeCCCCCcCCCCCCC
Q psy18122         12 CLNGGVEMD------SMCLCTPGFAGKKCEIP   37 (188)
Q Consensus        12 C~~~g~C~~------~~C~C~~G~~G~~C~~~   37 (188)
                      |.|| .|.-      ..|.|+.||+|.+|+..
T Consensus        53 ClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         53 CLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             eECC-EEEeeccCCCceeECCCCcccccccce
Confidence            5554 6643      77999999999999864


No 54 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=84.63  E-value=3.8  Score=24.25  Aligned_cols=20  Identities=40%  Similarity=0.977  Sum_probs=16.1

Q ss_pred             CCCCCcEecCCCceecCCCCC
Q psy18122         57 QCKGMVLCTSNFGCSCATGYQ   77 (188)
Q Consensus        57 ~C~~~g~C~~~~~C~C~~g~~   77 (188)
                      .|..+..|.+ +.|.|++||.
T Consensus        27 qC~~~s~C~~-g~C~C~~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVN-GRCQCPPGYV   46 (52)
T ss_pred             CCCCcCEEcC-CEeECCCCCE
Confidence            4567788876 8999999975


No 55 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=80.01  E-value=2.7  Score=24.94  Aligned_cols=25  Identities=32%  Similarity=0.902  Sum_probs=19.8

Q ss_pred             CCCCCCCCCCCeEecceEeCCCCCc
Q psy18122          6 NENSIECLNGGVEMDSMCLCTPGFA   30 (188)
Q Consensus         6 ~~~~~~C~~~g~C~~~~C~C~~G~~   30 (188)
                      +.....|..+..|..+.|.|++||.
T Consensus        22 C~~~~qC~~~s~C~~g~C~C~~g~~   46 (52)
T PF01683_consen   22 CESDEQCIGGSVCVNGRCQCPPGYV   46 (52)
T ss_pred             CCCcCCCCCcCEEcCCEeECCCCCE
Confidence            3455668888899889999999974


No 56 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=78.57  E-value=1.8  Score=30.05  Aligned_cols=25  Identities=20%  Similarity=0.476  Sum_probs=18.7

Q ss_pred             CCCCCCCCCeEec---------ceEeCCCCCcCC
Q psy18122          8 NSIECLNGGVEMD---------SMCLCTPGFAGK   32 (188)
Q Consensus         8 ~~~~C~~~g~C~~---------~~C~C~~G~~G~   32 (188)
                      ..+.|++||.|..         +.|.|.+.+...
T Consensus        11 ~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~   44 (103)
T PF12955_consen   11 ATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT   44 (103)
T ss_pred             hccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence            4577999999976         679998854433


No 57 
>KOG0196|consensus
Probab=78.44  E-value=3.4  Score=38.66  Aligned_cols=52  Identities=29%  Similarity=0.790  Sum_probs=34.2

Q ss_pred             ceEeCCCCCc----CCCCCCCCCCCCCCCCC-CCCccCCCCCCCCCcEecC--CCceecCCCCC
Q psy18122         21 SMCLCTPGFA----GKKCEIPCDRNNFGHKC-SMLCSKSPNQCKGMVLCTS--NFGCSCATGYQ   77 (188)
Q Consensus        21 ~~C~C~~G~~----G~~C~~~C~~g~~G~~C-~~~C~c~~~~C~~~g~C~~--~~~C~C~~g~~   77 (188)
                      |.|.|.+||.    |..|+. |++|+|-..- ...|.    +|..+..=..  ...|.|..||.
T Consensus       259 G~C~C~aGye~~~~~~~C~a-Cp~G~yK~~~~~~~C~----~CP~~S~s~~ega~~C~C~~gyy  317 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQA-CPPGTYKASQGDSLCL----PCPPNSHSSSEGATSCTCENGYY  317 (996)
T ss_pred             CceeecCCCCcccCCCccee-CCCCcccCCCCCCCCC----CCCCCCCCCCCCCCcccccCCcc
Confidence            9999999995    567876 9999885544 23443    3444332211  34588888875


No 58 
>PHA02887 EGF-like protein; Provisional
Probab=77.88  E-value=2  Score=30.41  Aligned_cols=26  Identities=35%  Similarity=0.700  Sum_probs=17.5

Q ss_pred             CCCCCcEecC-----CCceecCCCCCCCCCCC
Q psy18122         57 QCKGMVLCTS-----NFGCSCATGYQGEQCLE   83 (188)
Q Consensus        57 ~C~~~g~C~~-----~~~C~C~~g~~G~~C~~   83 (188)
                      -|- +|.|.-     ...|.|+.||+|.+|+.
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            344 356753     35588888888888863


No 59 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=74.99  E-value=3  Score=20.95  Aligned_cols=11  Identities=55%  Similarity=1.280  Sum_probs=8.9

Q ss_pred             CceecCCCCCC
Q psy18122         68 FGCSCATGYQG   78 (188)
Q Consensus        68 ~~C~C~~g~~G   78 (188)
                      +.|.|++||.-
T Consensus         2 y~C~C~~Gy~l   12 (24)
T PF12662_consen    2 YTCSCPPGYQL   12 (24)
T ss_pred             EEeeCCCCCcC
Confidence            67899999863


No 60 
>KOG3516|consensus
Probab=73.67  E-value=2.9  Score=40.49  Aligned_cols=33  Identities=24%  Similarity=0.544  Sum_probs=28.8

Q ss_pred             CCCCCCCCCCCCeEec----ceEeCC-CCCcCCCCCCC
Q psy18122          5 TNENSIECLNGGVEMD----SMCLCT-PGFAGKKCEIP   37 (188)
Q Consensus         5 ~~~~~~~C~~~g~C~~----~~C~C~-~G~~G~~C~~~   37 (188)
                      +-+.+++|.++|.|..    +.|.|. .||+|..|...
T Consensus       546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts  583 (1306)
T KOG3516|consen  546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS  583 (1306)
T ss_pred             cccCCccccCCCcccccccceeEeccccccccccccCC
Confidence            3468899999999987    999998 89999999864


No 61 
>KOG3607|consensus
Probab=72.45  E-value=2.8  Score=38.85  Aligned_cols=29  Identities=34%  Similarity=0.614  Sum_probs=25.7

Q ss_pred             CCCCCCCCeEec-ceEeCCCCCcCCCCCCC
Q psy18122          9 SIECLNGGVEMD-SMCLCTPGFAGKKCEIP   37 (188)
Q Consensus         9 ~~~C~~~g~C~~-~~C~C~~G~~G~~C~~~   37 (188)
                      +..|+.+|.|.. ..|.|.+||.+++|+..
T Consensus       629 ~~~C~g~GVCnn~~~ChC~~gwapp~C~~~  658 (716)
T KOG3607|consen  629 PTTCNGHGVCNNELNCHCEPGWAPPFCFIF  658 (716)
T ss_pred             ccccCCCcccCCCcceeeCCCCCCCccccc
Confidence            344999999999 99999999999999874


No 62 
>KOG3516|consensus
Probab=66.42  E-value=4.4  Score=39.28  Aligned_cols=38  Identities=34%  Similarity=0.843  Sum_probs=32.4

Q ss_pred             CCCCCccCCCCCCCCCcEecC---CCceecC-CCCCCCCCCC
Q psy18122         46 KCSMLCSKSPNQCKGMVLCTS---NFGCSCA-TGYQGEQCLE   83 (188)
Q Consensus        46 ~C~~~C~c~~~~C~~~g~C~~---~~~C~C~-~g~~G~~C~~   83 (188)
                      .|...+.|.+++|+++|.|..   .+.|.|. .||.|..|..
T Consensus       541 ~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  541 MCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT  582 (1306)
T ss_pred             ccccccccCCccccCCCcccccccceeEeccccccccccccC
Confidence            356677788999999999987   5789998 8999999985


No 63 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=59.97  E-value=15  Score=26.47  Aligned_cols=17  Identities=35%  Similarity=1.013  Sum_probs=14.1

Q ss_pred             CCceecCCCCCCCCCCC
Q psy18122         67 NFGCSCATGYQGEQCLE   83 (188)
Q Consensus        67 ~~~C~C~~g~~G~~C~~   83 (188)
                      ...|.|..||+|.+|+.
T Consensus        66 ~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         66 GMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             CceeECCCCcccccccc
Confidence            36789999999999985


No 64 
>PHA02887 EGF-like protein; Provisional
Probab=59.65  E-value=12  Score=26.48  Aligned_cols=8  Identities=38%  Similarity=0.953  Sum_probs=4.6

Q ss_pred             CCCCCeEec
Q psy18122         12 CLNGGVEMD   20 (188)
Q Consensus        12 C~~~g~C~~   20 (188)
                      |. ||.|.-
T Consensus        94 Ci-HG~C~y  101 (126)
T PHA02887         94 CI-NGECMN  101 (126)
T ss_pred             ee-CCEEEc
Confidence            55 467753


No 65 
>KOG3607|consensus
Probab=58.24  E-value=14  Score=34.51  Aligned_cols=28  Identities=25%  Similarity=0.860  Sum_probs=24.7

Q ss_pred             CCCCCCcEecCCCceecCCCCCCCCCCC
Q psy18122         56 NQCKGMVLCTSNFGCSCATGYQGEQCLE   83 (188)
Q Consensus        56 ~~C~~~g~C~~~~~C~C~~g~~G~~C~~   83 (188)
                      ..|.++|+|.....|.|.+||.+++|+.
T Consensus       630 ~~C~g~GVCnn~~~ChC~~gwapp~C~~  657 (716)
T KOG3607|consen  630 TTCNGHGVCNNELNCHCEPGWAPPFCFI  657 (716)
T ss_pred             cccCCCcccCCCcceeeCCCCCCCcccc
Confidence            3588999999888999999999999984


No 66 
>KOG3514|consensus
Probab=54.10  E-value=8.6  Score=37.19  Aligned_cols=32  Identities=31%  Similarity=0.799  Sum_probs=23.9

Q ss_pred             cCCCCCCCCCcEecC---CCceecC-CCCCCCCCCC
Q psy18122         52 SKSPNQCKGMVLCTS---NFGCSCA-TGYQGEQCLE   83 (188)
Q Consensus        52 ~c~~~~C~~~g~C~~---~~~C~C~-~g~~G~~C~~   83 (188)
                      .|.++||.|+|.|.+   .+.|-|. .+|.|+.|++
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence            456889999999987   4567775 4677777765


No 67 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=51.36  E-value=6.5  Score=23.94  Aligned_cols=28  Identities=25%  Similarity=0.717  Sum_probs=15.5

Q ss_pred             CCCCCCCCeEec--------ceEeCCCCCcCCCCCC
Q psy18122          9 SIECLNGGVEMD--------SMCLCTPGFAGKKCEI   36 (188)
Q Consensus         9 ~~~C~~~g~C~~--------~~C~C~~G~~G~~C~~   36 (188)
                      .-+|+.||..-.        -.|.|..-|.|++|++
T Consensus        16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~   51 (56)
T PF04863_consen   16 AISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCST   51 (56)
T ss_dssp             TS--TTSEE--TTS-EETTEE--EE-TTEESTTS-E
T ss_pred             cCCcCCCCeeeeccccccCCccccccCCcCCCCccc
Confidence            347888886642        4699999999999876


No 68 
>KOG1388|consensus
Probab=47.67  E-value=19  Score=28.29  Aligned_cols=69  Identities=28%  Similarity=0.639  Sum_probs=42.7

Q ss_pred             CCCCCeEec-ceE-eCCCCCcCCCCCCCCCCCCCCCCCCCCccCCCCCCCCC-cEec-CCCceec-CCCCCCCCCCC
Q psy18122         12 CLNGGVEMD-SMC-LCTPGFAGKKCEIPCDRNNFGHKCSMLCSKSPNQCKGM-VLCT-SNFGCSC-ATGYQGEQCLE   83 (188)
Q Consensus        12 C~~~g~C~~-~~C-~C~~G~~G~~C~~~C~~g~~G~~C~~~C~c~~~~C~~~-g~C~-~~~~C~C-~~g~~G~~C~~   83 (188)
                      |++++.|.. -.| .|..+-+|..|+. |..+|+|..=...|.  +..|.++ ..|. ..+.|.| .-++.|..|.+
T Consensus        54 cNGh~~c~t~~v~~~~~N~~~g~~c~k-c~~g~~GdtN~g~c~--~~~~~g~~~~~~~~~~~c~c~~kgvvgd~c~~  127 (217)
T KOG1388|consen   54 CNGHSDCNTQHVCWRCENGTTGAHCEK-CIVGFYGDTNGGKCQ--PCDCNGGASACVTLTGKCFCTTKGVVGDLCPK  127 (217)
T ss_pred             hcCCCCcccceeeeeccCccccccCCc-eEEEEEecCCCCccC--HhhhcCCeeeeeccCCccccccceEecccCcc
Confidence            455555554 334 3788999999987 899999963322332  1112222 1233 3678888 66888888875


No 69 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=47.35  E-value=21  Score=24.51  Aligned_cols=23  Identities=30%  Similarity=0.703  Sum_probs=17.5

Q ss_pred             CCCCCCCcEecC--CCceecCCCCC
Q psy18122         55 PNQCKGMVLCTS--NFGCSCATGYQ   77 (188)
Q Consensus        55 ~~~C~~~g~C~~--~~~C~C~~g~~   77 (188)
                      .+.|...+.|..  ...|.|.+||.
T Consensus        83 y~~CG~~g~C~~~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   83 YGFCGPNGICNSNNSPKCSCLPGFE  107 (110)
T ss_pred             ccccCCccEeCCCCCCceECCCCcC
Confidence            357888899975  34599998885


No 70 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=42.54  E-value=8.4  Score=21.25  Aligned_cols=13  Identities=38%  Similarity=1.078  Sum_probs=8.3

Q ss_pred             CCceecCCCCCCC
Q psy18122         67 NFGCSCATGYQGE   79 (188)
Q Consensus        67 ~~~C~C~~g~~G~   79 (188)
                      .+.|.|++||.-.
T Consensus        18 ~~~C~C~~Gy~L~   30 (36)
T PF14670_consen   18 SYRCSCPPGYKLA   30 (36)
T ss_dssp             SEEEE-STTEEE-
T ss_pred             ceEeECCCCCEEC
Confidence            4679999988643


No 71 
>KOG3509|consensus
Probab=37.06  E-value=44  Score=32.23  Aligned_cols=67  Identities=22%  Similarity=0.520  Sum_probs=38.5

Q ss_pred             ceEeCCCCCcCCCCCCCCCCCCCCC---CCC--CCccCCCCCCCCC-cEecC-CCce-ecCCCCCCCCCCCCCCCCccCC
Q psy18122         21 SMCLCTPGFAGKKCEIPCDRNNFGH---KCS--MLCSKSPNQCKGM-VLCTS-NFGC-SCATGYQGEQCLEHCPEDYYGA   92 (188)
Q Consensus        21 ~~C~C~~G~~G~~C~~~C~~g~~G~---~C~--~~C~c~~~~C~~~-g~C~~-~~~C-~C~~g~~G~~C~~~C~~g~~G~   92 (188)
                      -.|+|++++.|..|+. |.+++.-.   .+.  ....|   .|..+ ..|.. .+.| .|.....|.+|+ .|.+|++|.
T Consensus       718 ~~C~c~~g~~G~~ce~-c~e~~~ls~t~~~~~~~~~~c---~~~~h~~~c~~~~~~nt~~q~~~~~~~~~-~~~~g~~~d  792 (964)
T KOG3509|consen  718 EQCQCPKGLVGTSCED-CAEGYTLSTTGGLYPGLCEDC---ECNSHISQCEDDLGYNTDCQNNTEGDRCE-LCSPGTYGD  792 (964)
T ss_pred             cccccCccccCccccc-ccccccccccCCcCcccCccc---ccCCCcccccccccccccccccCccceee-ecCCCcccc
Confidence            6899999999999987 77765311   111  11111   12221 23333 2233 456666777776 677777764


No 72 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=35.23  E-value=20  Score=19.58  Aligned_cols=10  Identities=30%  Similarity=1.162  Sum_probs=7.9

Q ss_pred             ceEeCCCCCc
Q psy18122         21 SMCLCTPGFA   30 (188)
Q Consensus        21 ~~C~C~~G~~   30 (188)
                      ++|.|+.||.
T Consensus        18 ~~C~CPeGyI   27 (34)
T PF09064_consen   18 GQCFCPEGYI   27 (34)
T ss_pred             CceeCCCceE
Confidence            6888888884


No 73 
>KOG3509|consensus
Probab=27.31  E-value=81  Score=30.54  Aligned_cols=23  Identities=35%  Similarity=0.852  Sum_probs=19.2

Q ss_pred             CCceecCCCCCCCCCCCCCCCCcc
Q psy18122         67 NFGCSCATGYQGEQCLEHCPEDYY   90 (188)
Q Consensus        67 ~~~C~C~~g~~G~~C~~~C~~g~~   90 (188)
                      ...|.|+.++.|..|+ .|.+++.
T Consensus       717 ~~~C~c~~g~~G~~ce-~c~e~~~  739 (964)
T KOG3509|consen  717 VEQCQCPKGLVGTSCE-DCAEGYT  739 (964)
T ss_pred             ccccccCccccCcccc-ccccccc
Confidence            4579999999999998 6877764


No 74 
>PF11403 Yeast_MT:  Yeast metallothionein;  InterPro: IPR022710  Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=24.31  E-value=39  Score=18.32  Aligned_cols=18  Identities=44%  Similarity=1.047  Sum_probs=7.5

Q ss_pred             CCCCCCcEecCCCceecCCC
Q psy18122         56 NQCKGMVLCTSNFGCSCATG   75 (188)
Q Consensus        56 ~~C~~~g~C~~~~~C~C~~g   75 (188)
                      +.|.++-.|..  .|.|+.|
T Consensus        12 gscknneqcqk--scscptg   29 (40)
T PF11403_consen   12 GSCKNNEQCQK--SCSCPTG   29 (40)
T ss_dssp             STTTT-TTSTT--S-SS-TT
T ss_pred             CCccChHHHhh--cCCCCCC
Confidence            35666555543  4555544


No 75 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=23.09  E-value=92  Score=24.13  Aligned_cols=17  Identities=35%  Similarity=0.942  Sum_probs=12.4

Q ss_pred             EecC---CCceecCCCCCCC
Q psy18122         63 LCTS---NFGCSCATGYQGE   79 (188)
Q Consensus        63 ~C~~---~~~C~C~~g~~G~   79 (188)
                      .|..   +|.|.|++||+..
T Consensus       200 ~C~~~~g~~~c~c~~g~~~~  219 (224)
T cd01475         200 VCISTPGSYLCACTEGYALL  219 (224)
T ss_pred             eEEcCCCCEEeECCCCccCC
Confidence            4655   5779999998753


No 76 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=21.62  E-value=1e+02  Score=18.05  Aligned_cols=9  Identities=56%  Similarity=1.483  Sum_probs=6.3

Q ss_pred             ceecCCCCC
Q psy18122         69 GCSCATGYQ   77 (188)
Q Consensus        69 ~C~C~~g~~   77 (188)
                      .|.|++||.
T Consensus        34 gC~C~~G~v   42 (55)
T PF01826_consen   34 GCFCPPGYV   42 (55)
T ss_dssp             EEEETTTEE
T ss_pred             cCCCCCCee
Confidence            388887764


Done!