Query         psy10559
Match_columns 418
No_of_seqs    316 out of 1827
Neff          6.5 
Searched_HMMs 46136
Date          Fri Aug 16 21:30:01 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy10559.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10559hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4659|consensus              100.0 2.2E-55 4.9E-60  473.5  16.9  241  112-417     4-246 (1899)
  2 KOG1219|consensus               99.4 2.9E-13 6.2E-18  154.1   7.4  105    8-136  3865-3978(4289)
  3 KOG4289|consensus               99.2 1.1E-11 2.4E-16  137.4   5.3  107    9-137  1181-1318(2531)
  4 KOG1225|consensus               99.0 4.1E-10 8.8E-15  118.4   8.4  109    9-133   251-365 (525)
  5 KOG4289|consensus               98.9 1.1E-08 2.3E-13  114.4  14.0   71    6-76   1238-1319(2531)
  6 KOG1225|consensus               98.9 1.9E-09 4.2E-14  113.4   7.5   69    2-75    271-343 (525)
  7 KOG1226|consensus               98.5 1.6E-07 3.4E-12  101.2   7.6   96   13-136   515-621 (783)
  8 KOG1219|consensus               98.5 2.1E-07 4.6E-12  107.8   6.7   75   39-137  3865-3940(4289)
  9 KOG1226|consensus               98.4   5E-07 1.1E-11   97.3   6.7   95   13-137   468-582 (783)
 10 KOG1214|consensus               98.2 2.9E-06 6.2E-11   91.8   7.6  106    6-134   733-861 (1289)
 11 KOG1217|consensus               97.6 0.00014 3.1E-09   74.7   8.3  116   15-134   137-306 (487)
 12 KOG4260|consensus               97.6 5.4E-05 1.2E-09   73.1   4.6  111   12-129   150-303 (350)
 13 PF07974 EGF_2:  EGF-like domai  97.4  0.0002 4.3E-09   47.6   3.2   24   13-36      7-32  (32)
 14 KOG4260|consensus               97.3 0.00024 5.1E-09   68.8   4.8  100   27-131   132-269 (350)
 15 KOG1217|consensus               97.3 0.00074 1.6E-08   69.4   8.9  103   18-137   243-355 (487)
 16 PF00008 EGF:  EGF-like domain   97.3  0.0001 2.2E-09   48.9   1.4   28   42-70      3-31  (32)
 17 KOG1214|consensus               97.2  0.0015 3.2E-08   71.5   9.3   47   23-70    809-859 (1289)
 18 smart00179 EGF_CA Calcium-bind  97.0  0.0011 2.4E-08   44.8   3.9   34   39-73      3-39  (39)
 19 cd00054 EGF_CA Calcium-binding  96.5  0.0035 7.6E-08   41.6   3.8   34   39-73      3-38  (38)
 20 smart00179 EGF_CA Calcium-bind  96.3  0.0046 9.9E-08   41.7   3.4   32    6-37      1-39  (39)
 21 PF00008 EGF:  EGF-like domain   96.3  0.0017 3.7E-08   43.0   1.1   30  101-132     2-31  (32)
 22 PF07974 EGF_2:  EGF-like domai  96.2  0.0055 1.2E-07   40.8   3.2   27   43-72      6-32  (32)
 23 PF07645 EGF_CA:  Calcium-bindi  95.9  0.0045 9.8E-08   43.4   1.9   30   38-68      2-34  (42)
 24 KOG1836|consensus               95.9   0.014 2.9E-07   69.9   6.7  110   14-136   684-813 (1705)
 25 smart00181 EGF Epidermal growt  95.8   0.013 2.8E-07   38.7   3.7   28   43-72      6-34  (35)
 26 smart00051 DSL delta serrate l  95.8   0.011 2.4E-07   45.5   3.6   44   24-72     18-63  (63)
 27 PF12661 hEGF:  Human growth fa  95.8  0.0051 1.1E-07   32.7   1.2   13   24-36      1-13  (13)
 28 cd00054 EGF_CA Calcium-binding  95.8   0.012 2.6E-07   38.9   3.4   31    7-37      2-38  (38)
 29 cd00053 EGF Epidermal growth f  95.6   0.016 3.5E-07   37.6   3.6   29   43-72      6-35  (36)
 30 KOG0994|consensus               95.2   0.043 9.3E-07   62.3   7.1   88    2-94    848-964 (1758)
 31 PF12661 hEGF:  Human growth fa  94.6   0.016 3.4E-07   30.8   0.8   13   60-72      1-13  (13)
 32 KOG0994|consensus               94.3   0.052 1.1E-06   61.6   4.9   49   23-74   1037-1099(1758)
 33 PF07645 EGF_CA:  Calcium-bindi  93.6   0.041   9E-07   38.5   1.6   25  103-130    10-34  (42)
 34 PF13715 DUF4480:  Domain of un  93.4    0.56 1.2E-05   37.4   8.2   79  163-244     2-81  (88)
 35 cd00053 EGF Epidermal growth f  93.3   0.088 1.9E-06   34.0   2.9   30  102-134     5-35  (36)
 36 smart00181 EGF Epidermal growt  91.9    0.17 3.7E-06   33.3   2.8   28  103-134     6-34  (35)
 37 PF12947 EGF_3:  EGF domain;  I  91.9   0.082 1.8E-06   36.1   1.2   27   43-70      6-32  (36)
 38 smart00051 DSL delta serrate l  90.7    0.34 7.4E-06   37.2   3.7   47   59-134    17-63  (63)
 39 PF13620 CarboxypepD_reg:  Carb  90.5     1.5 3.2E-05   34.3   7.4   75  163-239     2-81  (82)
 40 KOG1836|consensus               90.3    0.94   2E-05   54.8   8.6   71   13-88    732-822 (1705)
 41 KOG3512|consensus               88.4     1.4 3.1E-05   46.2   7.3  102   23-136   295-429 (592)
 42 PF12947 EGF_3:  EGF domain;  I  86.1    0.34 7.4E-06   33.0   0.9   27  103-132     6-32  (36)
 43 PHA02887 EGF-like protein; Pro  84.2    0.99 2.2E-05   38.8   3.0   31   43-74     92-123 (126)
 44 PF01414 DSL:  Delta serrate li  83.8    0.28 6.1E-06   37.7  -0.4   45   23-72     17-63  (63)
 45 PHA02887 EGF-like protein; Pro  82.4    0.84 1.8E-05   39.3   1.9   33  103-137    92-124 (126)
 46 PHA03099 epidermal growth fact  81.7       1 2.2E-05   39.4   2.1   36  102-139    50-85  (139)
 47 cd03521 Link_domain_KIAA0527_l  81.7    0.32   7E-06   40.0  -0.9   23  267-289     9-34  (95)
 48 KOG1218|consensus               81.5     6.8 0.00015   38.4   8.4   31   21-51    160-194 (316)
 49 KOG3607|consensus               81.2     1.4   3E-05   49.3   3.5   55   13-76    605-659 (716)
 50 PF04863 EGF_alliinase:  Alliin  80.8     0.9   2E-05   33.9   1.3   33   44-76     18-53  (56)
 51 PHA03099 epidermal growth fact  80.3     1.3 2.8E-05   38.7   2.4   32   44-76     52-84  (139)
 52 PF12955 DUF3844:  Domain of un  79.9    0.97 2.1E-05   38.2   1.4   10   66-75     53-62  (103)
 53 PF00053 Laminin_EGF:  Laminin   78.4     1.4 3.1E-05   31.6   1.7   14   60-73     19-32  (49)
 54 KOG3512|consensus               76.8       6 0.00013   41.8   6.3   84   48-146   406-489 (592)
 55 KOG3516|consensus               76.7     1.8 3.9E-05   50.1   2.7   42   38-80    545-589 (1306)
 56 cd00055 EGF_Lam Laminin-type e  76.5       2 4.3E-05   31.1   2.1   15   23-37     19-33  (50)
 57 PF12955 DUF3844:  Domain of un  75.9       2 4.4E-05   36.2   2.2   31   42-72     12-46  (103)
 58 PF06247 Plasmod_Pvs28:  Plasmo  74.7    0.96 2.1E-05   42.1  -0.0   94   13-131    51-161 (197)
 59 KOG3607|consensus               74.4     2.1 4.5E-05   47.9   2.5   33    6-38    624-657 (716)
 60 KOG1218|consensus               72.9      10 0.00023   37.1   6.9   48   23-74     90-139 (316)
 61 PF04863 EGF_alliinase:  Alliin  69.2     3.2   7E-05   31.0   1.7   26   12-37     17-50  (56)
 62 cd00055 EGF_Lam Laminin-type e  68.7     5.4 0.00012   28.8   2.8   16   58-73     18-33  (50)
 63 PF00053 Laminin_EGF:  Laminin   67.2       3 6.5E-05   29.8   1.2   20   18-37     11-32  (49)
 64 PF01683 EB:  EB module;  Inter  67.0     6.7 0.00014   28.3   3.0   41   19-68      6-46  (52)
 65 PF12662 cEGF:  Complement Clr-  65.4     5.6 0.00012   24.7   1.9   11   58-68      1-11  (24)
 66 KOG3514|consensus               64.3     4.1 8.9E-05   46.8   2.1   36   40-76    625-662 (1591)
 67 cd06245 M14_CPD_III The third   60.2      33 0.00073   35.3   7.7   71  161-234   287-357 (363)
 68 PF06247 Plasmod_Pvs28:  Plasmo  60.1     4.6  0.0001   37.7   1.3   84   23-130    20-119 (197)
 69 PF14670 FXa_inhibition:  Coagu  60.0     4.7  0.0001   27.5   1.0   19   49-68     10-28  (36)
 70 cd01475 vWA_Matrilin VWA_Matri  58.4      10 0.00022   35.7   3.5   34   33-69    181-218 (224)
 71 KOG3516|consensus               54.2     9.4  0.0002   44.5   2.7   36   40-76    957-994 (1306)
 72 cd03863 M14_CPD_II The second   53.8      45 0.00098   34.5   7.5   73  160-234   296-369 (375)
 73 PF09064 Tme5_EGF_like:  Thromb  52.0      11 0.00024   25.4   1.7   11   58-68     17-27  (34)
 74 smart00180 EGF_Lam Laminin-typ  50.4      11 0.00024   26.7   1.7   20   18-37     11-32  (46)
 75 PRK15036 hydroxyisourate hydro  50.1      48   0.001   29.4   6.1   70  162-231    28-109 (137)
 76 PF01414 DSL:  Delta serrate li  49.9     7.8 0.00017   29.7   0.9   20   17-36     43-63  (63)
 77 PF01683 EB:  EB module;  Inter  49.6      49  0.0011   23.7   5.1   20   13-32     27-46  (52)
 78 KOG3514|consensus               46.5 1.1E+02  0.0024   35.9   9.3   37  100-139   626-663 (1591)
 79 cd01475 vWA_Matrilin VWA_Matri  45.8      16 0.00035   34.4   2.6   34   93-131   183-218 (224)
 80 cd03515 Link_domain_TSG_6_like  44.6     3.7   8E-05   34.1  -1.7   23  267-289     9-34  (93)
 81 cd03864 M14_CPN Peptidase M14   40.4   1E+02  0.0022   32.2   7.6   71  162-234   317-387 (392)
 82 cd03865 M14_CPE_H Peptidase M1  39.6      78  0.0017   33.2   6.6   71  163-235   328-398 (402)
 83 PF00954 S_locus_glycop:  S-loc  39.5      34 0.00074   28.6   3.3   29   39-69     78-108 (110)
 84 cd03867 M14_CPZ Peptidase M14-  37.8      91   0.002   32.4   6.8   64  161-226   318-381 (395)
 85 PF12946 EGF_MSP1_1:  MSP1 EGF   37.3      22 0.00049   24.5   1.5   27   42-68      4-30  (37)
 86 cd03858 M14_CP_N-E_like Carbox  37.1 1.1E+02  0.0024   31.4   7.3   71  161-233   298-369 (374)
 87 cd03518 Link_domain_HAPLN_modu  36.6     5.8 0.00013   33.1  -1.8   23  267-289     9-34  (95)
 88 cd03516 Link_domain_CD44_like   35.8     6.6 0.00014   35.2  -1.7   29  261-289     4-39  (144)
 89 cd03868 M14_CPD_I The first ca  35.0 1.1E+02  0.0024   31.4   6.9   74  159-234   294-368 (372)
 90 KOG0196|consensus               33.8      45 0.00097   37.9   3.9   44   23-69    259-318 (996)
 91 cd01102 Link_Domain The link d  32.0     7.9 0.00017   32.1  -1.7   23  267-289     9-34  (92)
 92 cd03866 M14_CPM Peptidase M14   29.7 1.8E+02  0.0039   30.1   7.4   70  159-229   293-363 (376)
 93 cd01204 IRS_PTB Insulin recept  29.2      96  0.0021   26.2   4.2   18  395-412    45-62  (104)
 94 PF00818 Ice_nucleation:  Ice n  26.5      26 0.00056   19.7   0.3   13  370-382     2-14  (16)
 95 smart00445 LINK Link (Hyaluron  23.2      12 0.00026   31.1  -2.1   22  268-289    11-35  (94)
 96 PF06339 Ectoine_synth:  Ectoin  21.8      71  0.0015   28.0   2.3   36  338-373    20-55  (126)

No 1  
>KOG4659|consensus
Probab=100.00  E-value=2.2e-55  Score=473.46  Aligned_cols=241  Identities=44%  Similarity=0.738  Sum_probs=220.9

Q ss_pred             ccCCCCCCceeecCCCCccccccceeeeeecCCceecccccCCCCCcccceeeceEEeecCCceeceEEEeeCCCCceEE
Q psy10559        112 VSAPKPIDILLRKQPPAITASFFERMKFLIEESSLQNYAKKDNFNESRSAVVRGRVVTSMGMGLVGVRVSTSTPLEGFTL  191 (418)
Q Consensus       112 ~~~~~~~~~~~C~C~~G~~g~~Ce~~~fli~~~s~~~~~~~~~~~~~~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~  191 (418)
                      ...+.|++.....|+ ++...|.++++|||.+.+++.|+....|++.+.+||+|+|++..|+|++||+||+...+.+||+
T Consensus         4 ~~a~~pi~vl~r~~p-~~~a~f~~r~~fli~e~s~q~y~~~~~fne~~~~vIrgrvv~~~~~pLVGVrVS~~~~~~yfTl   82 (1899)
T KOG4659|consen    4 STAASPIEVLMRMPP-IFNANFAQRVKFLIMEKSVQSYTDSSQFNENRISVIRGRVVWGGGVPLVGVRVSDAAHPLYFTL   82 (1899)
T ss_pred             CcccChHHHHhhcCC-ccchhHHHhhheeechhhhcccCchhhhccccceEEeccEeecCCcceEEEEeecccccceEEE
Confidence            333444444444444 4558999999999999999999999999999999999999999999999999999955555999


Q ss_pred             EecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEeccEEEeecCCCCCCC-CCcccccCcCcccCceeeccc
Q psy10559        192 TRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIIDTITMQMGDDRPVST-TQHACKDHDYDTMKPVVLATW  270 (418)
Q Consensus       192 t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id~v~m~~~~~~~~~~-~~~~C~~h~~~~~~pv~~~~~  270 (418)
                      ||+||+|||+||||.+++|+|.|+||.++.++||||||++++||+|+|++.+..++-. +.++|+.|.++.|.||++++|
T Consensus        83 TR~DG~FDL~vnGg~svtLqF~R~pF~~qkr~v~vpwnq~i~id~vvm~~~~~~~~~~~~~~~C~~~~~~~pdpvvissw  162 (1899)
T KOG4659|consen   83 TREDGYFDLTVNGGRSVTLQFLRTPFQSQKRSVFVPWNQIIHIDDVVMYRQEGGSPPAPARAKCSPTLRRIPDPVVISSW  162 (1899)
T ss_pred             EecCceEEEEEcccceEEEEEccCCCcccceeEEeChhhEEEEEeEEEEeecCCCCCCCCCCccChhhccCCCcEEecCc
Confidence            9999999999999999999999999999999999999999999999999988885432 468999999999999999999


Q ss_pred             cccccccCCCCCcccchhhhhhcccccccCceeeeeccchhhhcccceeeeecccceeeeeecCccccCCCCCcccCCCC
Q psy10559        271 KHGFQGACPDRSSILAESQFKQLGSKLAIMPTVLITSASIFYQIGRLWQVTLKGKGLVKLLRMGRKYLTPSSNISCASSN  350 (418)
Q Consensus       271 ~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  350 (418)
                      +++|+++||++++|+||+|                                                             
T Consensus       163 ~~tf~ga~pdr~~Iv~esq-------------------------------------------------------------  181 (1899)
T KOG4659|consen  163 QYTFDGACPDRTSIVVESQ-------------------------------------------------------------  181 (1899)
T ss_pred             ccccCCcCCCCcccccchh-------------------------------------------------------------
Confidence            9999999999999999999                                                             


Q ss_pred             CCceeeeeeeeCCCceEEEEecCCCCCCceeEEEeeCCCCCCcccceEEeEEeecccee-EEeecCCC
Q psy10559        351 PPQVVQESFQIPGTGLNLVYHSSRSAGYLSTIQLQLTPQVIPDTLQLRKSLCKDFTHTL-TFYEKRPT  417 (418)
Q Consensus       351 ~~~~~~e~~~i~gt~~~l~y~s~~~~gy~~~~~~~l~~~~ip~~l~~vh~~~~~~~~~~-~~~~~~~~  417 (418)
                         |+|||||||||++||+|+|||++||+|+|+|+||+++||.+|+||||+|++.+|.+ ++|+.||.
T Consensus       182 ---vi~Esl~ipgt~vrLvY~SsRs~gy~St~ki~lt~d~vp~~l~kVHl~i~ieG~lf~r~f~a~~n  246 (1899)
T KOG4659|consen  182 ---VIQESLPIPGTDVRLVYDSSRSPGYPSTMKIGLTYDRVPKELRKVHLNIRIEGRLFDRVFAARPN  246 (1899)
T ss_pred             ---hhhhhccCCCCceEEEEecccCCCcchheEEeecccccccceEEEEEEEEEeeEEeeeeccCCCC
Confidence               99999999999999999999999999999999999999999999999999999876 57888875


No 2  
>KOG1219|consensus
Probab=99.40  E-value=2.9e-13  Score=154.05  Aligned_cols=105  Identities=27%  Similarity=0.643  Sum_probs=93.0

Q ss_pred             cCCC-cCCCCCcEEec-----ceEeeCCCCcCCCCCc--CCC-CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccc
Q psy10559          8 DVDL-LWCNEHGQCKN-----GTCLCVTGWNGKHCTL--EGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQ   78 (418)
Q Consensus         8 d~C~-~pC~n~G~C~~-----g~C~C~~G~~G~~Ce~--~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~   78 (418)
                      +.|. +||+++|+|..     |.|.|++-|.|.+||+  ..| ++||..+|+| ....++|.|.|+.||+|..|+...  
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtC-ip~~n~f~CnC~~gyTG~~Ce~~G-- 3941 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTC-IPFYNGFLCNCPNGYTGKRCEARG-- 3941 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEE-EecCCCeeEeCCCCccCceeeccc--
Confidence            6785 69999999974     6999999999999994  589 9999999999 555788999999999999999541  


Q ss_pred             cCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccccccce
Q psy10559         79 NCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFER  136 (418)
Q Consensus        79 ~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~  136 (418)
                                        .+.|+.++|.+++.|++.++   +|.|.|.+||.|..|..
T Consensus      3942 ------------------i~eCs~n~C~~gg~C~n~~g---sf~CncT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3942 ------------------ISECSKNVCGTGGQCINIPG---SFHCNCTPGILGRTCCA 3978 (4289)
T ss_pred             ------------------ccccccccccCCceeeccCC---ceEeccChhHhcccCcc
Confidence                              36778899999999999999   89999999999999864


No 3  
>KOG4289|consensus
Probab=99.20  E-value=1.1e-11  Score=137.41  Aligned_cols=107  Identities=23%  Similarity=0.611  Sum_probs=88.3

Q ss_pred             CCCc-CCCCCcEEec--------------------------ceEeeCCCCcCCCCC--cCCC-CCCCCCCcEEecCCCCC
Q psy10559          9 VDLL-WCNEHGQCKN--------------------------GTCLCVTGWNGKHCT--LEGC-PNSCSNHGQCRVNSDSQ   58 (418)
Q Consensus         9 ~C~~-pC~n~G~C~~--------------------------g~C~C~~G~~G~~Ce--~~~C-~~~C~~~G~C~~~~~g~   58 (418)
                      .|.+ ||.|.-+|..                          ..|.|++||+|..|+  ++.| .++|.+||+| ....|+
T Consensus      1181 iClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C-~srEgg 1259 (2531)
T KOG4289|consen 1181 ICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRC-RSREGG 1259 (2531)
T ss_pred             hhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCce-EEecCc
Confidence            4765 8988888842                          279999999999997  7899 9999999999 666899


Q ss_pred             ceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCC-cccccccee
Q psy10559         59 WECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPA-ITASFFERM  137 (418)
Q Consensus        59 ~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G-~~g~~Ce~~  137 (418)
                      |.|.|.+||+|.+|++....             ..      |..+.|.|+++|++...  ..+.|.|+.| |++++|+-.
T Consensus      1260 YtCeCrpg~tGehCEvs~~a-------------gr------CvpGvC~nggtC~~~~n--ggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1260 YTCECRPGFTGEHCEVSARA-------------GR------CVPGVCKNGGTCVNLLN--GGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred             eeEEecCCccccceeeeccc-------------Cc------cccceecCCCEEeecCC--CceeccCCCcccCCCceEEE
Confidence            99999999999999987631             11      34567999999999865  4688999986 788888864


No 4  
>KOG1225|consensus
Probab=99.04  E-value=4.1e-10  Score=118.43  Aligned_cols=109  Identities=34%  Similarity=0.755  Sum_probs=85.5

Q ss_pred             CCCcCCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccc-cccCC-CCccC
Q psy10559          9 VDLLWCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLL-EQNCN-DGKDN   86 (418)
Q Consensus         9 ~C~~pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~-~~~C~-~g~c~   86 (418)
                      .|+..|.++|.|++|+|+|++||+|..|.+..|+..|++|+.| .+  +  .|.|++||.|..|+..- ...|. +|.|.
T Consensus       251 ~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~-~~--g--~CiC~~g~~G~dCs~~~cpadC~g~G~Ci  325 (525)
T KOG1225|consen  251 YCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVC-VD--G--ECICNPGYSGKDCSIRRCPADCSGHGKCI  325 (525)
T ss_pred             cCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCcee-cC--C--EeecCCCccccccccccCCccCCCCCccc
Confidence            5778899999999999999999999999988898889999998 33  2  89999999999998665 35564 56666


Q ss_pred             CCCCccccCCC----CCCCCCCCCCCCccccCCCCCCceeecCCCCccccc
Q psy10559         87 DKDGLVDCEDP----ECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASF  133 (418)
Q Consensus        87 ~~~G~~~C~dp----dCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~  133 (418)
                        +|.+.|..-    .|..+. |.+++.|++.        |+|..||.|+.
T Consensus       326 --~G~C~C~~Gy~G~~C~~~~-C~~~g~cv~g--------C~C~~Gw~G~d  365 (525)
T KOG1225|consen  326 --DGECLCDEGYTGELCIQRA-CSGGGQCVNG--------CKCKKGWRGPD  365 (525)
T ss_pred             --CCceEeCCCCcCCcccccc-cCCCceeccC--------ceeccCccCCC
Confidence              555556432    444443 8888877652        68899998866


No 5  
>KOG4289|consensus
Probab=98.93  E-value=1.1e-08  Score=114.38  Aligned_cols=71  Identities=32%  Similarity=0.830  Sum_probs=60.3

Q ss_pred             cccCC-CcCCCCCcEEec----ceEeeCCCCcCCCCCcC----CC-CCCCCCCcEEecCCCCCceEEcCCC-CCCCCCCc
Q psy10559          6 NIDVD-LLWCNEHGQCKN----GTCLCVTGWNGKHCTLE----GC-PNSCSNHGQCRVNSDSQWECKCSDG-WDGKDCSV   74 (418)
Q Consensus         6 ~id~C-~~pC~n~G~C~~----g~C~C~~G~~G~~Ce~~----~C-~~~C~~~G~C~~~~~g~~~C~C~~G-~~G~~C~~   74 (418)
                      .+|.| ..||.|||+|..    |+|+|.+||+|++||++    .| +..|.|+|+|.....|++.|.|+.| |.++.|+.
T Consensus      1238 eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1238 EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred             hhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence            37788 469999999974    79999999999999964    68 9999999999555578899999987 77888976


Q ss_pred             cc
Q psy10559         75 LL   76 (418)
Q Consensus        75 ~~   76 (418)
                      ..
T Consensus      1318 ~t 1319 (2531)
T KOG4289|consen 1318 TT 1319 (2531)
T ss_pred             Ee
Confidence            53


No 6  
>KOG1225|consensus
Probab=98.92  E-value=1.9e-09  Score=113.36  Aligned_cols=69  Identities=39%  Similarity=1.022  Sum_probs=61.3

Q ss_pred             CCccccc----CCCcCCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCcc
Q psy10559          2 RGFSNID----VDLLWCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVL   75 (418)
Q Consensus         2 ~Gf~~id----~C~~pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~   75 (418)
                      .||++.|    .|+..|++|+.|.++.|+|++||.|+.|++..|+..|++||.| .    ...|.|.+||+|..|+..
T Consensus       271 ~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~C-i----~G~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  271 PGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKC-I----DGECLCDEGYTGELCIQR  343 (525)
T ss_pred             CCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccCCccCCCCCcc-c----CCceEeCCCCcCCccccc
Confidence            4677543    4888899999999999999999999999998999999999999 4    248999999999999976


No 7  
>KOG1226|consensus
Probab=98.53  E-value=1.6e-07  Score=101.18  Aligned_cols=96  Identities=31%  Similarity=0.808  Sum_probs=71.5

Q ss_pred             CCCCCcEEecceEeeCCCCc----CCCCCcC--CCC----CCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCC
Q psy10559         13 WCNEHGQCKNGTCLCVTGWN----GKHCTLE--GCP----NSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCND   82 (418)
Q Consensus        13 pC~n~G~C~~g~C~C~~G~~----G~~Ce~~--~C~----~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~   82 (418)
                      +|+++|.|.-|+|+|++...    |..||.+  .|+    ..|.+||+|..+     +|+|.+||+|..|+-...     
T Consensus       515 vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG-----~CvC~~GwtG~~C~C~~s-----  584 (783)
T KOG1226|consen  515 VCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECG-----RCVCNPGWTGSACNCPLS-----  584 (783)
T ss_pred             CcCCCCcEeCCceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCC-----cEEcCCCCccCCCCCCCC-----
Confidence            79999999999999999988    9999855  453    349999999544     899999999999985431     


Q ss_pred             CccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCC-ccccccce
Q psy10559         83 GKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPA-ITASFFER  136 (418)
Q Consensus        83 g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G-~~g~~Ce~  136 (418)
                              ...|..++   ...|...|.|...       .|+|... |.|.+||.
T Consensus       585 --------td~C~~~~---G~iCSGrG~C~Cg-------~C~C~~~~~sG~~CE~  621 (783)
T KOG1226|consen  585 --------TDTCESSD---GQICSGRGTCECG-------RCKCTDPPYSGEFCEK  621 (783)
T ss_pred             --------CccccCCC---CceeCCCceeeCC-------ceEcCCCCcCcchhhc
Confidence                    12233332   2346677777664       3667654 88988886


No 8  
>KOG1219|consensus
Probab=98.46  E-value=2.1e-07  Score=107.78  Aligned_cols=75  Identities=25%  Similarity=0.615  Sum_probs=65.4

Q ss_pred             CCC-CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCC
Q psy10559         39 EGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKP  117 (418)
Q Consensus        39 ~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~  117 (418)
                      +.| .++|+++|.|.-...++|.|.|++-|.|..|+..+                     +-|..++|.+|++|...++ 
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~---------------------epC~snPC~~GgtCip~~n- 3922 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDL---------------------EPCASNPCLTGGTCIPFYN- 3922 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCccccccc---------------------ccccCCCCCCCCEEEecCC-
Confidence            568 88999999995444688999999999999999766                     3467889999999999888 


Q ss_pred             CCceeecCCCCcccccccee
Q psy10559        118 IDILLRKQPPAITASFFERM  137 (418)
Q Consensus       118 ~~~~~C~C~~G~~g~~Ce~~  137 (418)
                        .|.|.|+.||+|.+||.-
T Consensus      3923 --~f~CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3923 --GFLCNCPNGYTGKRCEAR 3940 (4289)
T ss_pred             --CeeEeCCCCccCceeecc
Confidence              789999999999999863


No 9  
>KOG1226|consensus
Probab=98.38  E-value=5e-07  Score=97.33  Aligned_cols=95  Identities=28%  Similarity=0.626  Sum_probs=71.8

Q ss_pred             CCCCCcEEecceEeeCCCCcCCCCCc-----------CCC----C-CCCCCCcEEecCCCCCceEEcCCCCC----CCCC
Q psy10559         13 WCNEHGQCKNGTCLCVTGWNGKHCTL-----------EGC----P-NSCSNHGQCRVNSDSQWECKCSDGWD----GKDC   72 (418)
Q Consensus        13 pC~n~G~C~~g~C~C~~G~~G~~Ce~-----------~~C----~-~~C~~~G~C~~~~~g~~~C~C~~G~~----G~~C   72 (418)
                      .|+.+|++.-|.|.|.+||.|..|+-           +.|    . -.|+++|.|..+     +|.|.+...    |..|
T Consensus       468 ~C~g~G~~~CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG-----qC~C~~~~~~~i~G~fC  542 (783)
T KOG1226|consen  468 LCHGNGTFVCGQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG-----QCVCHKPDNGKIYGKFC  542 (783)
T ss_pred             ccCCCCcEEecceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC-----ceEecCCCCCceeeeee
Confidence            69999999999999999999999982           346    1 269999999544     899998877    8888


Q ss_pred             CccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559         73 SVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFERM  137 (418)
Q Consensus        73 ~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~  137 (418)
                      +-+.               +.|...+   .-.|.+++.|...       +|.|.+||+|+.|+-.
T Consensus       543 ECDn---------------fsC~r~~---g~lC~g~G~C~CG-------~CvC~~GwtG~~C~C~  582 (783)
T KOG1226|consen  543 ECDN---------------FSCERHK---GVLCGGHGRCECG-------RCVCNPGWTGSACNCP  582 (783)
T ss_pred             eccC---------------ccccccc---CcccCCCCeEeCC-------cEEcCCCCccCCCCCC
Confidence            7322               2222211   2348888888763       5899999999999863


No 10 
>KOG1214|consensus
Probab=98.20  E-value=2.9e-06  Score=91.75  Aligned_cols=106  Identities=24%  Similarity=0.588  Sum_probs=78.4

Q ss_pred             cccCCC---cCCCCCcEEec----ceEeeCCCCc--C--CCCC-------cCCC---CCCCCCCcEE--ecCCCCCceEE
Q psy10559          6 NIDVDL---LWCNEHGQCKN----GTCLCVTGWN--G--KHCT-------LEGC---PNSCSNHGQC--RVNSDSQWECK   62 (418)
Q Consensus         6 ~id~C~---~pC~n~G~C~~----g~C~C~~G~~--G--~~Ce-------~~~C---~~~C~~~G~C--~~~~~g~~~C~   62 (418)
                      ++++|.   ..|..+..|++    ++|+|..||.  +  ..|-       .+.|   ...|.-.|+|  .....+.|.|.
T Consensus       733 d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~  812 (1289)
T KOG1214|consen  733 DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCA  812 (1289)
T ss_pred             ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEe
Confidence            455664   47999999986    6888888774  4  4564       2356   2457655544  33346779999


Q ss_pred             cCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccc
Q psy10559         63 CSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFF  134 (418)
Q Consensus        63 C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~C  134 (418)
                      |.+||.|+.                    ..|.+.|.|..+.|...+.|.+.++   ++.|+|.+||.|.-.
T Consensus       813 CLPGfsGDG--------------------~~c~dvDeC~psrChp~A~Cyntpg---sfsC~C~pGy~GDGf  861 (1289)
T KOG1214|consen  813 CLPGFSGDG--------------------HQCTDVDECSPSRCHPAATCYNTPG---SFSCRCQPGYYGDGF  861 (1289)
T ss_pred             ecCCccCCc--------------------cccccccccCccccCCCceEecCCC---cceeecccCccCCCc
Confidence            999999832                    4456678888889999999999997   899999999988543


No 11 
>KOG1217|consensus
Probab=97.65  E-value=0.00014  Score=74.69  Aligned_cols=116  Identities=28%  Similarity=0.700  Sum_probs=83.5

Q ss_pred             CCCcEEe-------cceEeeCCCCcCCCCCc--CCC---CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCcc-------
Q psy10559         15 NEHGQCK-------NGTCLCVTGWNGKHCTL--EGC---PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVL-------   75 (418)
Q Consensus        15 ~n~G~C~-------~g~C~C~~G~~G~~Ce~--~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~-------   75 (418)
                      ..++.|.       .+.|.|..||.|..|..  +.|   ...|.+++.| .+..+.|.|.|.+||.|..|+..       
T Consensus       137 ~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C-~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~  215 (487)
T KOG1217|consen  137 CIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTC-VNTGGSYLCSCPPGYTGSTCETTGNGGTCV  215 (487)
T ss_pred             eCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCccc-ccCCCCeeEeCCCCccCCcCcCCCCCceEe
Confidence            4556665       25799999999999984  478   3459999999 67677799999999999988764       


Q ss_pred             ------------cc------ccCCC--CccCCCCCccccC--------------CCCCCCCC-CCCCCCccccCCCCCCc
Q psy10559         76 ------------LE------QNCND--GKDNDKDGLVDCE--------------DPECCSNH-ICRSSQLCVSAPKPIDI  120 (418)
Q Consensus        76 ------------~~------~~C~~--g~c~~~~G~~~C~--------------dpdCc~~~-~C~~~~~C~~~~~~~~~  120 (418)
                                  ..      ..|..  +.|.+..+.+.|.              +.+-|... .|.+++.|.+..+   .
T Consensus       216 ~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~---~  292 (487)
T KOG1217|consen  216 DSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG---S  292 (487)
T ss_pred             cceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCC---c
Confidence                        10      01211  4555555554443              33434443 3888999999877   4


Q ss_pred             eeecCCCCcccccc
Q psy10559        121 LLRKQPPAITASFF  134 (418)
Q Consensus       121 ~~C~C~~G~~g~~C  134 (418)
                      |.|.|++||+|..|
T Consensus       293 ~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  293 YRCTCPPGFTGRLC  306 (487)
T ss_pred             ceeeCCCCCCCCCC
Confidence            88999999999887


No 12 
>KOG4260|consensus
Probab=97.64  E-value=5.4e-05  Score=73.14  Aligned_cols=111  Identities=25%  Similarity=0.646  Sum_probs=66.3

Q ss_pred             cCCCCCcEEec-------ceEeeCCCCcCCCCCc-----------------CCCCCCCCCCcEEecCCCCCceEE-cCCC
Q psy10559         12 LWCNEHGQCKN-------GTCLCVTGWNGKHCTL-----------------EGCPNSCSNHGQCRVNSDSQWECK-CSDG   66 (418)
Q Consensus        12 ~pC~n~G~C~~-------g~C~C~~G~~G~~Ce~-----------------~~C~~~C~~~G~C~~~~~g~~~C~-C~~G   66 (418)
                      .+|.++|.|..       |.|.|.+||+|+.|..                 ..|...|.  |.| .. .+.-.|. |..|
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~C-sg-~~~k~C~kCkkG  225 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVC-SG-ESSKGCSKCKKG  225 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--ccc-CC-CCCCChhhhccc
Confidence            48999999962       7999999999999962                 12334454  366 22 3344575 9999


Q ss_pred             CCCC--CCCccc-----cccCC-CCccCCCCCccccCC-------CCCCCC--CCCC-CCCccccCCCCCCceeecCCCC
Q psy10559         67 WDGK--DCSVLL-----EQNCN-DGKDNDKDGLVDCED-------PECCSN--HICR-SSQLCVSAPKPIDILLRKQPPA  128 (418)
Q Consensus        67 ~~G~--~C~~~~-----~~~C~-~g~c~~~~G~~~C~d-------pdCc~~--~~C~-~~~~C~~~~~~~~~~~C~C~~G  128 (418)
                      |.-.  .|-...     +.+|. +..|.|..|++.|.+       +|.|..  ..|. .+..|.+..+   .|.|.|..|
T Consensus       226 W~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~---~~r~v~f~~  302 (350)
T KOG4260|consen  226 WKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDG---QYRCVCFSG  302 (350)
T ss_pred             ceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCc---cEEEEeccc
Confidence            9633  232111     13343 456777777777776       444432  2232 3455666555   566666655


Q ss_pred             c
Q psy10559        129 I  129 (418)
Q Consensus       129 ~  129 (418)
                      +
T Consensus       303 ~  303 (350)
T KOG4260|consen  303 L  303 (350)
T ss_pred             c
Confidence            4


No 13 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.35  E-value=0.0002  Score=47.64  Aligned_cols=24  Identities=42%  Similarity=1.239  Sum_probs=16.0

Q ss_pred             CCCCCcEEe--cceEeeCCCCcCCCC
Q psy10559         13 WCNEHGQCK--NGTCLCVTGWNGKHC   36 (418)
Q Consensus        13 pC~n~G~C~--~g~C~C~~G~~G~~C   36 (418)
                      .|++||+|+  .++|+|.+||+|+.|
T Consensus         7 ~C~~~G~C~~~~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPCGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence            466677776  467777777776665


No 14 
>KOG4260|consensus
Probab=97.35  E-value=0.00024  Score=68.82  Aligned_cols=100  Identities=23%  Similarity=0.516  Sum_probs=71.3

Q ss_pred             eCCCCcCCCCCcCCC----CCCCCCCcEEecCC--CCCceEEcCCCCCCCCCCcccc--------------cc----CC-
Q psy10559         27 CVTGWNGKHCTLEGC----PNSCSNHGQCRVNS--DSQWECKCSDGWDGKDCSVLLE--------------QN----CN-   81 (418)
Q Consensus        27 C~~G~~G~~Ce~~~C----~~~C~~~G~C~~~~--~g~~~C~C~~G~~G~~C~~~~~--------------~~----C~-   81 (418)
                      |+.|-.|+.|..  |    ..+|.++|.|.-+.  .|++.|.|..||+|+.|..-..              ..    |. 
T Consensus       132 Cp~gtyGpdCl~--Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~  209 (350)
T KOG4260|consen  132 CPDGTYGPDCLQ--CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLG  209 (350)
T ss_pred             cCCCCcCCcccc--CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhc
Confidence            899999999963  5    45799999994332  5778999999999999864431              11    21 


Q ss_pred             ---C------CccCCCCC--ccccCCCCCCC--CCCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559         82 ---D------GKDNDKDG--LVDCEDPECCS--NHICRSSQLCVSAPKPIDILLRKQPPAITA  131 (418)
Q Consensus        82 ---~------g~c~~~~G--~~~C~dpdCc~--~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g  131 (418)
                         .      ..|..++-  .-.|.|.++|.  .++|..+..|+|..+   +|.|.+.+||.+
T Consensus       210 ~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteG---Sf~C~dk~Gy~~  269 (350)
T KOG4260|consen  210 VCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEG---SFKCEDKEGYKK  269 (350)
T ss_pred             ccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCC---ceEecccccccC
Confidence               1      12222222  23467777764  566999999999988   899999999866


No 15 
>KOG1217|consensus
Probab=97.34  E-value=0.00074  Score=69.38  Aligned_cols=103  Identities=29%  Similarity=0.676  Sum_probs=71.0

Q ss_pred             cEEec----ceEeeCCCCcCCCC----CcCCC-CC-CCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCC
Q psy10559         18 GQCKN----GTCLCVTGWNGKHC----TLEGC-PN-SCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDND   87 (418)
Q Consensus        18 G~C~~----g~C~C~~G~~G~~C----e~~~C-~~-~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~   87 (418)
                      ++|.+    ++|.|.+||.+..+    .++.| .. .|.++++| .+..+.|.|.|++||.|..| ..    |.+     
T Consensus       243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C-~~~~~~~~C~C~~g~~g~~~-~~----~~~-----  311 (487)
T KOG1217|consen  243 GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC-VNVPGSYRCTCPPGFTGRLC-TE----CVD-----  311 (487)
T ss_pred             CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee-ecCCCcceeeCCCCCCCCCC-cc----ccc-----
Confidence            78875    59999999999985    46788 43 39999999 66667799999999999998 11    100     


Q ss_pred             CCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559         88 KDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFERM  137 (418)
Q Consensus        88 ~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~  137 (418)
                         ...|...  .....|.+++.|.. .+....+.|.|.++|.|..|+..
T Consensus       312 ---~~~C~~~--~~~~~c~~g~~C~~-~~~~~~~~C~c~~~~~g~~C~~~  355 (487)
T KOG1217|consen  312 ---VDECSPR--NAGGPCANGGTCNT-LGSFGGFRCACGPGFTGRRCEDS  355 (487)
T ss_pred             ---ccccccc--ccCCcCCCCccccc-CCCCCCCCcCCCCCCCCCccccC
Confidence               0011000  02344777778822 22234677999999999988864


No 16 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.31  E-value=0.0001  Score=48.92  Aligned_cols=28  Identities=36%  Similarity=1.036  Sum_probs=20.6

Q ss_pred             CCCCCCCcEEecCCC-CCceEEcCCCCCCC
Q psy10559         42 PNSCSNHGQCRVNSD-SQWECKCSDGWDGK   70 (418)
Q Consensus        42 ~~~C~~~G~C~~~~~-g~~~C~C~~G~~G~   70 (418)
                      +++|.++|+| ++.. +.|.|.|++||+|+
T Consensus         3 ~~~C~n~g~C-~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTC-IDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEE-EEESTSEEEEEEBTTEEST
T ss_pred             CCcCCCCeEE-EeCCCCCEEeECCCCCccC
Confidence            5678888888 4444 77888888888875


No 17 
>KOG1214|consensus
Probab=97.19  E-value=0.0015  Score=71.47  Aligned_cols=47  Identities=30%  Similarity=1.029  Sum_probs=41.7

Q ss_pred             ceEeeCCCCcCC--CC-CcCCC-CCCCCCCcEEecCCCCCceEEcCCCCCCC
Q psy10559         23 GTCLCVTGWNGK--HC-TLEGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGK   70 (418)
Q Consensus        23 g~C~C~~G~~G~--~C-e~~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~   70 (418)
                      ++|.|.+||.|+  .| ..++| ++-|...++| .+..|++.|.|.+||.|+
T Consensus       809 y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~C-yntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  809 YSCACLPGFSGDGHQCTDVDECSPSRCHPAATC-YNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             EEEeecCCccCCccccccccccCccccCCCceE-ecCCCcceeecccCccCC
Confidence            699999999975  44 47899 9999999999 788999999999999875


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.97  E-value=0.0011  Score=44.77  Aligned_cols=34  Identities=38%  Similarity=1.164  Sum_probs=25.9

Q ss_pred             CCC-C-CCCCCCcEEecCCCCCceEEcCCCCC-CCCCC
Q psy10559         39 EGC-P-NSCSNHGQCRVNSDSQWECKCSDGWD-GKDCS   73 (418)
Q Consensus        39 ~~C-~-~~C~~~G~C~~~~~g~~~C~C~~G~~-G~~C~   73 (418)
                      ++| . .+|.++|.| .+..++|.|.|+.||. |..|+
T Consensus         3 ~~C~~~~~C~~~~~C-~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        3 DECASGNPCQNGGTC-VNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ccCcCCCCcCCCCEe-ECCCCCeEeECCCCCccCCcCC
Confidence            456 3 578888888 6777888888888888 77764


No 19 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.54  E-value=0.0035  Score=41.63  Aligned_cols=34  Identities=38%  Similarity=1.154  Sum_probs=25.2

Q ss_pred             CCC-C-CCCCCCcEEecCCCCCceEEcCCCCCCCCCC
Q psy10559         39 EGC-P-NSCSNHGQCRVNSDSQWECKCSDGWDGKDCS   73 (418)
Q Consensus        39 ~~C-~-~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~   73 (418)
                      ++| . .+|.+++.| .+..+.|.|.|..||.|..|+
T Consensus         3 ~~C~~~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASGNPCQNGGTC-VNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCCCCcCCCCEe-ECCCCCeEeECCCCCcCCcCC
Confidence            456 3 578778888 566777888888888887663


No 20 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.33  E-value=0.0046  Score=41.65  Aligned_cols=32  Identities=34%  Similarity=0.790  Sum_probs=27.0

Q ss_pred             cccCCC--cCCCCCcEEec----ceEeeCCCCc-CCCCC
Q psy10559          6 NIDVDL--LWCNEHGQCKN----GTCLCVTGWN-GKHCT   37 (418)
Q Consensus         6 ~id~C~--~pC~n~G~C~~----g~C~C~~G~~-G~~Ce   37 (418)
                      ++++|.  .+|.++|+|.+    +.|.|.+||. |..|+
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            367886  58999999986    5899999999 98885


No 21 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.32  E-value=0.0017  Score=43.05  Aligned_cols=30  Identities=23%  Similarity=0.442  Sum_probs=25.7

Q ss_pred             CCCCCCCCCccccCCCCCCceeecCCCCcccc
Q psy10559        101 SNHICRSSQLCVSAPKPIDILLRKQPPAITAS  132 (418)
Q Consensus       101 ~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~  132 (418)
                      ...+|.++++|+...  .+.|.|.|++||+|.
T Consensus         2 ~~~~C~n~g~C~~~~--~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLP--GGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTSSTTTEEEEEES--TSEEEEEEBTTEEST
T ss_pred             CCCcCCCCeEEEeCC--CCCEEeECCCCCccC
Confidence            456899999999988  348999999999984


No 22 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.22  E-value=0.0055  Score=40.75  Aligned_cols=27  Identities=48%  Similarity=1.160  Sum_probs=22.1

Q ss_pred             CCCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559         43 NSCSNHGQCRVNSDSQWECKCSDGWDGKDC   72 (418)
Q Consensus        43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C   72 (418)
                      ..|++||+| ...  ...|.|.+||+|+.|
T Consensus         6 ~~C~~~G~C-~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTC-VSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEE-eCC--CCEEECCCCCcCCCC
Confidence            359999999 432  459999999999887


No 23 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.94  E-value=0.0045  Score=43.44  Aligned_cols=30  Identities=33%  Similarity=1.226  Sum_probs=25.9

Q ss_pred             cCCC---CCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559         38 LEGC---PNSCSNHGQCRVNSDSQWECKCSDGWD   68 (418)
Q Consensus        38 ~~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~   68 (418)
                      +++|   +..|..++.| ++..|+|.|.|++||.
T Consensus         2 idEC~~~~~~C~~~~~C-~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTC-VNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEE-EEETTEEEEEESTTEE
T ss_pred             ccccCCCCCcCCCCCEE-EcCCCCEEeeCCCCcE
Confidence            4677   4579889999 8889999999999997


No 24 
>KOG1836|consensus
Probab=95.90  E-value=0.014  Score=69.89  Aligned_cols=110  Identities=27%  Similarity=0.578  Sum_probs=75.5

Q ss_pred             CCCCcEEec--ceEeeCCCCcCCCCCc-------------C--CC-CCCCCCC-cEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559         14 CNEHGQCKN--GTCLCVTGWNGKHCTL-------------E--GC-PNSCSNH-GQCRVNSDSQWECKCSDGWDGKDCSV   74 (418)
Q Consensus        14 C~n~G~C~~--g~C~C~~G~~G~~Ce~-------------~--~C-~~~C~~~-G~C~~~~~g~~~C~C~~G~~G~~C~~   74 (418)
                      +...+.+.+  ..|.|+.||+|..|+.             +  .| +..|++| ..|+   ..++.|.|...-.|..|+ 
T Consensus       684 ~~~~~~~~~wve~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd---~~tG~C~C~~~t~G~~C~-  759 (1705)
T KOG1836|consen  684 GDGAGATDTWVEQCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICD---PRTGQCKCKHNTFGGQCA-  759 (1705)
T ss_pred             CCCCCCccchhhhccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCcccccc---CCCCceecccCCCCCchh-
Confidence            444455533  4699999999999982             1  23 5567776 6784   345689999999999997 


Q ss_pred             cccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeec-CCCCccccccce
Q psy10559         75 LLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRK-QPPAITASFFER  136 (418)
Q Consensus        75 ~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~-C~~G~~g~~Ce~  136 (418)
                          .|.+|.-++-+....   .| |...+|.+++.|..... .....|+ |++||+|.+|+.
T Consensus       760 ----~C~~GfYg~~~~~~~---~d-C~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe~  813 (1705)
T KOG1836|consen  760 ----QCVDGFYGLPDLGTS---GD-CQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCEE  813 (1705)
T ss_pred             ----hhcCCCCCccccCCC---CC-CccCCCCCChhhcCcCc-ccceecCCCCCCCccccccc
Confidence                466665543322111   12 45567778888887765 4456788 999999999985


No 25 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.84  E-value=0.013  Score=38.75  Aligned_cols=28  Identities=39%  Similarity=1.132  Sum_probs=22.8

Q ss_pred             CCCCCCcEEecCCCCCceEEcCCCCCC-CCC
Q psy10559         43 NSCSNHGQCRVNSDSQWECKCSDGWDG-KDC   72 (418)
Q Consensus        43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G-~~C   72 (418)
                      .+|.++ .| .+..+.|.|.|+.||.| ..|
T Consensus         6 ~~C~~~-~C-~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TC-INTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EE-ECCCCCeEeECCCCCccCCcc
Confidence            478877 89 56688899999999999 665


No 26 
>smart00051 DSL delta serrate ligand.
Probab=95.80  E-value=0.011  Score=45.50  Aligned_cols=44  Identities=25%  Similarity=0.634  Sum_probs=34.8

Q ss_pred             eEeeCCCCcCCCCCcCCC--CCCCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559         24 TCLCVTGWNGKHCTLEGC--PNSCSNHGQCRVNSDSQWECKCSDGWDGKDC   72 (418)
Q Consensus        24 ~C~C~~G~~G~~Ce~~~C--~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C   72 (418)
                      .-.|.++|.|..|+. .|  .+.+.+|.+|+ .   .+.|.|.+||+|+.|
T Consensus        18 rv~C~~~~yG~~C~~-~C~~~~d~~~~~~Cd-~---~G~~~C~~Gw~G~~C   63 (63)
T smart00051       18 RVTCDENYYGEGCNK-FCRPRDDFFGHYTCD-E---NGNKGCLEGWMGPYC   63 (63)
T ss_pred             EeeCCCCCcCCccCC-EeCcCccccCCccCC-c---CCCEecCCCCcCCCC
Confidence            456999999999973 45  45678899993 2   357999999999887


No 27 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.79  E-value=0.0051  Score=32.69  Aligned_cols=13  Identities=54%  Similarity=1.592  Sum_probs=7.6

Q ss_pred             eEeeCCCCcCCCC
Q psy10559         24 TCLCVTGWNGKHC   36 (418)
Q Consensus        24 ~C~C~~G~~G~~C   36 (418)
                      .|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            3666666666665


No 28 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.76  E-value=0.012  Score=38.94  Aligned_cols=31  Identities=35%  Similarity=0.800  Sum_probs=26.2

Q ss_pred             ccCCC--cCCCCCcEEec----ceEeeCCCCcCCCCC
Q psy10559          7 IDVDL--LWCNEHGQCKN----GTCLCVTGWNGKHCT   37 (418)
Q Consensus         7 id~C~--~pC~n~G~C~~----g~C~C~~G~~G~~Ce   37 (418)
                      +++|.  .+|.+++.|.+    +.|.|.+||.|..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            57786  48999999986    589999999998874


No 29 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.64  E-value=0.016  Score=37.64  Aligned_cols=29  Identities=45%  Similarity=1.174  Sum_probs=22.6

Q ss_pred             CCCCCCcEEecCCCCCceEEcCCCCCCC-CC
Q psy10559         43 NSCSNHGQCRVNSDSQWECKCSDGWDGK-DC   72 (418)
Q Consensus        43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~-~C   72 (418)
                      .+|.+++.| .+..+.|.|.|+.||.|. .|
T Consensus         6 ~~C~~~~~C-~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           6 NPCSNGGTC-VNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCEE-ecCCCCeEeECCCCCcccCCc
Confidence            567778888 566678889999998887 54


No 30 
>KOG0994|consensus
Probab=95.24  E-value=0.043  Score=62.28  Aligned_cols=88  Identities=27%  Similarity=0.747  Sum_probs=50.8

Q ss_pred             CCcccccCCCc-CCCCCc-EEec--ceEe-eCCCCcCCCCCc--------------CCC-CCCCCCCc--------EEec
Q psy10559          2 RGFSNIDVDLL-WCNEHG-QCKN--GTCL-CVTGWNGKHCTL--------------EGC-PNSCSNHG--------QCRV   53 (418)
Q Consensus         2 ~Gf~~id~C~~-pC~n~G-~C~~--g~C~-C~~G~~G~~Ce~--------------~~C-~~~C~~~G--------~C~~   53 (418)
                      -||++.-+|.. .|+.|. +|..  |.|. |..--+|.+|+.              ..| |.+|..+-        .|..
T Consensus       848 pG~WgFPeCr~CqCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~  927 (1758)
T KOG0994|consen  848 PGYWGFPECRPCQCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYL  927 (1758)
T ss_pred             CCccCCCcCccccccCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccc
Confidence            47888777754 676663 3432  4443 444444555541              256 66675431        4644


Q ss_pred             CC-CCCceEEcCCCCCCCCCCccccccCCCCccCCCCCcccc
Q psy10559         54 NS-DSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDC   94 (418)
Q Consensus        54 ~~-~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C   94 (418)
                      +. .....|.|.+||+|..|+.     |.++.-.+-..+..|
T Consensus       928 d~~t~~ivC~C~~GY~G~RCe~-----CA~~~fGnP~~GGtC  964 (1758)
T KOG0994|consen  928 DTRTQQIVCHCQEGYSGSRCEI-----CADNHFGNPSEGGTC  964 (1758)
T ss_pred             cccccceeeecccCccccchhh-----hcccccCCcccCCcc
Confidence            43 3455799999999999984     555544443333344


No 31 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.56  E-value=0.016  Score=30.82  Aligned_cols=13  Identities=46%  Similarity=1.653  Sum_probs=10.8

Q ss_pred             eEEcCCCCCCCCC
Q psy10559         60 ECKCSDGWDGKDC   72 (418)
Q Consensus        60 ~C~C~~G~~G~~C   72 (418)
                      .|.|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4999999999987


No 32 
>KOG0994|consensus
Probab=94.34  E-value=0.052  Score=61.65  Aligned_cols=49  Identities=37%  Similarity=0.998  Sum_probs=34.2

Q ss_pred             ceEeeCCCCcCCCCCc-----------CCC-CCCCC--CCcEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559         23 GTCLCVTGWNGKHCTL-----------EGC-PNSCS--NHGQCRVNSDSQWECKCSDGWDGKDCSV   74 (418)
Q Consensus        23 g~C~C~~G~~G~~Ce~-----------~~C-~~~C~--~~G~C~~~~~g~~~C~C~~G~~G~~C~~   74 (418)
                      |+|-|.+.-.|..|+.           .+| +..|.  +.-+|. .  =.++|.|.+||-|..|+.
T Consensus      1037 GQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN-~--ftGQCqCkpGfGGR~C~q 1099 (1758)
T KOG0994|consen 1037 GQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCN-E--FTGQCQCKPGFGGRTCSQ 1099 (1758)
T ss_pred             CcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccc-c--cccceeccCCCCCcchhH
Confidence            8999999988988862           245 44454  233662 2  234899999999999973


No 33 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=93.59  E-value=0.041  Score=38.50  Aligned_cols=25  Identities=24%  Similarity=0.453  Sum_probs=22.7

Q ss_pred             CCCCCCCccccCCCCCCceeecCCCCcc
Q psy10559        103 HICRSSQLCVSAPKPIDILLRKQPPAIT  130 (418)
Q Consensus       103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~  130 (418)
                      ..|..++.|++..+   +|.|.|++||.
T Consensus        10 ~~C~~~~~C~N~~G---sy~C~C~~Gy~   34 (42)
T PF07645_consen   10 HNCPENGTCVNTEG---SYSCSCPPGYE   34 (42)
T ss_dssp             SSSSTTSEEEEETT---EEEEEESTTEE
T ss_pred             CcCCCCCEEEcCCC---CEEeeCCCCcE
Confidence            45988999999999   99999999996


No 34 
>PF13715 DUF4480:  Domain of unknown function (DUF4480)
Probab=93.35  E-value=0.56  Score=37.40  Aligned_cols=79  Identities=23%  Similarity=0.309  Sum_probs=58.7

Q ss_pred             eeceEEeec-CCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEeccEEEee
Q psy10559        163 VRGRVVTSM-GMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIIDTITMQM  241 (418)
Q Consensus       163 i~G~v~~~~-Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id~v~m~~  241 (418)
                      ++|.|++.. +.|+.++.+...... -++.|..+|.|.+-+.-|. .++.|.-..|......+-...++...+ ++.|.+
T Consensus         2 i~G~V~d~~t~~pl~~a~V~~~~~~-~~~~Td~~G~F~i~~~~g~-~~l~is~~Gy~~~~~~i~~~~~~~~~~-~i~L~~   78 (88)
T PF13715_consen    2 ISGKVVDSDTGEPLPGATVYLKNTK-KGTVTDENGRFSIKLPEGD-YTLKISYIGYETKTITISVNSNKNTNL-NIYLEP   78 (88)
T ss_pred             EEEEEEECCCCCCccCeEEEEeCCc-ceEEECCCeEEEEEEcCCC-eEEEEEEeCEEEEEEEEEecCCCEEEE-EEEEee
Confidence            579999888 999999999887543 5689999999998765443 667777778888777777766554444 677765


Q ss_pred             cCC
Q psy10559        242 GDD  244 (418)
Q Consensus       242 ~~~  244 (418)
                      ...
T Consensus        79 ~~~   81 (88)
T PF13715_consen   79 KSN   81 (88)
T ss_pred             Ccc
Confidence            443


No 35 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=93.33  E-value=0.088  Score=34.03  Aligned_cols=30  Identities=23%  Similarity=0.416  Sum_probs=24.5

Q ss_pred             CCCCCCCCccccCCCCCCceeecCCCCcccc-cc
Q psy10559        102 NHICRSSQLCVSAPKPIDILLRKQPPAITAS-FF  134 (418)
Q Consensus       102 ~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~-~C  134 (418)
                      ..+|.+++.|.+..+   .+.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~---~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPG---SYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCC---CeEeECCCCCcccCCc
Confidence            456888899998875   7899999999986 44


No 36 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=91.91  E-value=0.17  Score=33.26  Aligned_cols=28  Identities=21%  Similarity=0.434  Sum_probs=22.7

Q ss_pred             CCCCCCCccccCCCCCCceeecCCCCccc-ccc
Q psy10559        103 HICRSSQLCVSAPKPIDILLRKQPPAITA-SFF  134 (418)
Q Consensus       103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g-~~C  134 (418)
                      .+|.++ .|.+..+   .|.|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~---~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPG---SYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCC---CeEeECCCCCccCCcc
Confidence            468877 8988755   899999999998 554


No 37 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=91.87  E-value=0.082  Score=36.05  Aligned_cols=27  Identities=26%  Similarity=0.955  Sum_probs=20.1

Q ss_pred             CCCCCCcEEecCCCCCceEEcCCCCCCC
Q psy10559         43 NSCSNHGQCRVNSDSQWECKCSDGWDGK   70 (418)
Q Consensus        43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~   70 (418)
                      ..|..++.| .+..++|.|.|.+||.|+
T Consensus         6 ~~C~~nA~C-~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATC-TNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEE-EE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEe-ecCCCCEEeECCCCCccC
Confidence            358889999 677789999999999875


No 38 
>smart00051 DSL delta serrate ligand.
Probab=90.71  E-value=0.34  Score=37.24  Aligned_cols=47  Identities=11%  Similarity=0.301  Sum_probs=30.7

Q ss_pred             ceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccc
Q psy10559         59 WECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFF  134 (418)
Q Consensus        59 ~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~C  134 (418)
                      +.-.|.++|.|..|+..                  |.     ....+..+..|... +     .|.|.+||+|+.|
T Consensus        17 ~rv~C~~~~yG~~C~~~------------------C~-----~~~d~~~~~~Cd~~-G-----~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGEGCNKF------------------CR-----PRDDFFGHYTCDEN-G-----NKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCCccCCE------------------eC-----cCccccCCccCCcC-C-----CEecCCCCcCCCC
Confidence            35579999999999732                  21     12234556666442 3     3789999999765


No 39 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=90.48  E-value=1.5  Score=34.27  Aligned_cols=75  Identities=24%  Similarity=0.362  Sum_probs=48.0

Q ss_pred             eeceEEeecCCceeceEEEeeCCCC---ceEEEecCCceeee-ecCCceEEEEEecCCCCCccc-eeeeeccceEEeccE
Q psy10559        163 VRGRVVTSMGMGLVGVRVSTSTPLE---GFTLTRDDGWFDLL-VNGGGAVTLQFGRSPFKPHNH-IVHVPWNEVVIIDTI  237 (418)
Q Consensus       163 i~G~v~~~~Gtp~~Gv~vs~~~~~~---G~t~t~~dg~fdl~-vNGG~~~~l~f~r~Pf~~~~~-~V~vpwn~~~~id~v  237 (418)
                      ++|+|.+..|.|+.|+.|.......   ..+.+..+|.|.+- +-- +.-.+.+....|.+... .|-+..++...+ ++
T Consensus         2 I~G~V~d~~g~pv~~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~-g~Y~l~v~~~g~~~~~~~~v~v~~~~~~~~-~i   79 (82)
T PF13620_consen    2 ISGTVTDATGQPVPGATVTLTDQDGGTVYTTTTDSDGRFSFEGLPP-GTYTLRVSAPGYQPQTQENVTVTAGQTTTV-DI   79 (82)
T ss_dssp             EEEEEEETTSCBHTT-EEEET--TTTECCEEE--TTSEEEEEEE-S-EEEEEEEEBTTEE-EEEEEEEESSSSEEE---E
T ss_pred             EEEEEEcCCCCCcCCEEEEEEEeeCCCEEEEEECCCceEEEEccCC-EeEEEEEEECCcceEEEEEEEEeCCCEEEE-EE
Confidence            6799999999999999999884322   66788889999774 333 45666677777777664 466666666554 44


Q ss_pred             EE
Q psy10559        238 TM  239 (418)
Q Consensus       238 ~m  239 (418)
                      .|
T Consensus        80 ~L   81 (82)
T PF13620_consen   80 TL   81 (82)
T ss_dssp             EE
T ss_pred             EE
Confidence            44


No 40 
>KOG1836|consensus
Probab=90.28  E-value=0.94  Score=54.84  Aligned_cols=71  Identities=34%  Similarity=0.806  Sum_probs=49.7

Q ss_pred             CCCCC-cEEec--ceEeeCCCCcCCCCCc--------------CCC-CCCCCCCcEEecCC-CCCceEE-cCCCCCCCCC
Q psy10559         13 WCNEH-GQCKN--GTCLCVTGWNGKHCTL--------------EGC-PNSCSNHGQCRVNS-DSQWECK-CSDGWDGKDC   72 (418)
Q Consensus        13 pC~n~-G~C~~--g~C~C~~G~~G~~Ce~--------------~~C-~~~C~~~G~C~~~~-~g~~~C~-C~~G~~G~~C   72 (418)
                      .|++| .+|..  |.|.|..--.|.+|+.              ..| +.+|.+.+.|.... .....|. |++||+|..|
T Consensus       732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC  811 (1705)
T KOG1836|consen  732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRC  811 (1705)
T ss_pred             ccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence            45554 45653  7777777777777752              138 88899888884332 4667899 9999999999


Q ss_pred             CccccccCCCCccCCC
Q psy10559         73 SVLLEQNCNDGKDNDK   88 (418)
Q Consensus        73 ~~~~~~~C~~g~c~~~   88 (418)
                      +     .|.+|+-.+.
T Consensus       812 e-----~c~dgyfg~p  822 (1705)
T KOG1836|consen  812 E-----ECADGYFGNP  822 (1705)
T ss_pred             c-----cCCCccccCC
Confidence            8     4666655543


No 41 
>KOG3512|consensus
Probab=88.44  E-value=1.4  Score=46.22  Aligned_cols=102  Identities=24%  Similarity=0.511  Sum_probs=59.3

Q ss_pred             ceEeeCCCCcCCCCCc------------------CCC-CCCCCCCcE-EecCC--------CCCceEE-cCCCCCCCCCC
Q psy10559         23 GTCLCVTGWNGKHCTL------------------EGC-PNSCSNHGQ-CRVNS--------DSQWECK-CSDGWDGKDCS   73 (418)
Q Consensus        23 g~C~C~~G~~G~~Ce~------------------~~C-~~~C~~~G~-C~~~~--------~g~~~C~-C~~G~~G~~C~   73 (418)
                      .+|.|..+-+|+.|+.                  ++| .+.|..|+. |..+.        ..+..|. |.-.-.|..|.
T Consensus       295 ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhCh  374 (592)
T KOG3512|consen  295 LTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHCH  374 (592)
T ss_pred             eEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCcccc
Confidence            4899999999999872                  366 667776653 42221        1223453 66666777774


Q ss_pred             ccccccCCCCccCCCCCc----cccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccccccce
Q psy10559         74 VLLEQNCNDGKDNDKDGL----VDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFER  136 (418)
Q Consensus        74 ~~~~~~C~~g~c~~~~G~----~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~  136 (418)
                           -|..|+-.|..-.    -.|.  .|-++..=..+-+|....+     +|.|.+|.+|..|++
T Consensus       375 -----yCreGyyRd~s~pl~hrkaCk--~CdChpVGs~gktCNq~tG-----qCpCkeGvtG~tCnr  429 (592)
T KOG3512|consen  375 -----YCREGYYRDGSKPLTHRKACK--ACDCHPVGSAGKTCNQTTG-----QCPCKEGVTGLTCNR  429 (592)
T ss_pred             -----cccCccccCCCCCCchhhhhh--hcCCcccccccccccccCC-----cccCCCCCccccccc
Confidence                 4555554443211    1122  2222333344556665555     688888888888775


No 42 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=86.13  E-value=0.34  Score=32.99  Aligned_cols=27  Identities=11%  Similarity=0.204  Sum_probs=21.0

Q ss_pred             CCCCCCCccccCCCCCCceeecCCCCcccc
Q psy10559        103 HICRSSQLCVSAPKPIDILLRKQPPAITAS  132 (418)
Q Consensus       103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~  132 (418)
                      ..|..++.|.+.++   .+.|.|.+||.|.
T Consensus         6 ~~C~~nA~C~~~~~---~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGG---SYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TT---SEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCC---CEEeECCCCCccC
Confidence            45889999999988   8999999999884


No 43 
>PHA02887 EGF-like protein; Provisional
Probab=84.16  E-value=0.99  Score=38.83  Aligned_cols=31  Identities=26%  Similarity=0.667  Sum_probs=23.2

Q ss_pred             CCCCCCcEEecC-CCCCceEEcCCCCCCCCCCc
Q psy10559         43 NSCSNHGQCRVN-SDSQWECKCSDGWDGKDCSV   74 (418)
Q Consensus        43 ~~C~~~G~C~~~-~~g~~~C~C~~G~~G~~C~~   74 (418)
                      +.|- ||+|..- ......|.|+.||+|..|+.
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            4577 6899432 24567899999999999984


No 44 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=83.79  E-value=0.28  Score=37.68  Aligned_cols=45  Identities=29%  Similarity=0.718  Sum_probs=24.2

Q ss_pred             ceEeeCCCCcCCCCCcCCC-CC-CCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559         23 GTCLCVTGWNGKHCTLEGC-PN-SCSNHGQCRVNSDSQWECKCSDGWDGKDC   72 (418)
Q Consensus        23 g~C~C~~G~~G~~Ce~~~C-~~-~C~~~G~C~~~~~g~~~C~C~~G~~G~~C   72 (418)
                      ..-.|.+.|-|+.|.. -| +. .-.+|-+|+  .  .+.-.|.+||+|+.|
T Consensus        17 ~rv~C~~nyyG~~C~~-~C~~~~d~~ghy~Cd--~--~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGPNCSK-FCKPRDDSFGHYTCD--S--NGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETTTT-E-E---EEETTEEEEE---S--S--EEE-TTEESTTS
T ss_pred             EEEECCCCCCCccccC-CcCCCcCCcCCcccC--C--CCCCCCCCCCcCCCC
Confidence            4678999999999974 35 32 235677883  2  346789999999987


No 45 
>PHA02887 EGF-like protein; Provisional
Probab=82.44  E-value=0.84  Score=39.27  Aligned_cols=33  Identities=12%  Similarity=0.173  Sum_probs=26.6

Q ss_pred             CCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559        103 HICRSSQLCVSAPKPIDILLRKQPPAITASFFERM  137 (418)
Q Consensus       103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~  137 (418)
                      +.|. +|.|....+ .+...|.|+.||+|.+|+.+
T Consensus        92 ~YCi-HG~C~yI~d-L~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         92 DFCI-NGECMNIID-LDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             CEee-CCEEEcccc-CCCceeECCCCcccCCCCcc
Confidence            4577 468988776 55678999999999999975


No 46 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=81.66  E-value=1  Score=39.43  Aligned_cols=36  Identities=17%  Similarity=0.257  Sum_probs=29.0

Q ss_pred             CCCCCCCCccccCCCCCCceeecCCCCccccccceeee
Q psy10559        102 NHICRSSQLCVSAPKPIDILLRKQPPAITASFFERMKF  139 (418)
Q Consensus       102 ~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~~f  139 (418)
                      .+.|.+| .|...++ .+.+.|.|..||+|.+||....
T Consensus        50 ~~YClHG-~C~yI~d-l~~~~CrC~~GYtGeRCEh~dL   85 (139)
T PHA03099         50 DGYCLHG-DCIHARD-IDGMYCRCSHGYTGIRCQHVVL   85 (139)
T ss_pred             CCEeECC-EEEeecc-CCCceeECCCCcccccccceee
Confidence            4558876 7988776 5677899999999999998643


No 47 
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=81.66  E-value=0.32  Score=40.00  Aligned_cols=23  Identities=17%  Similarity=0.294  Sum_probs=19.3

Q ss_pred             eccccccc---cccCCCCCcccchhh
Q psy10559        267 LATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       267 ~~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      .+.|+++|   ..+|.++||.||+..
T Consensus         9 ~G~y~l~f~eA~~AC~~~gA~lAs~~   34 (95)
T cd03521           9 NGSQGLGLRAARQSCASLGARLASAA   34 (95)
T ss_pred             CCccccCHHHHHHHHHHcCCEeccHH
Confidence            46789999   578999999999754


No 48 
>KOG1218|consensus
Probab=81.52  E-value=6.8  Score=38.40  Aligned_cols=31  Identities=42%  Similarity=1.074  Sum_probs=16.6

Q ss_pred             ecceEeeCCCCcCCCCCcC--CC--CCCCCCCcEE
Q psy10559         21 KNGTCLCVTGWNGKHCTLE--GC--PNSCSNHGQC   51 (418)
Q Consensus        21 ~~g~C~C~~G~~G~~Ce~~--~C--~~~C~~~G~C   51 (418)
                      .++.|.|.+||.|.+|...  .|  ...|.+++.|
T Consensus       160 ~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C  194 (316)
T KOG1218|consen  160 KNGICTCQPGFVGVFCVESCSGCSPLTACENGAKC  194 (316)
T ss_pred             CCCceeccCCcccccccccCCCcCCCcccCCCCee
Confidence            3456666666666666532  14  2344554566


No 49 
>KOG3607|consensus
Probab=81.23  E-value=1.4  Score=49.29  Aligned_cols=55  Identities=29%  Similarity=0.791  Sum_probs=37.1

Q ss_pred             CCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccc
Q psy10559         13 WCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLL   76 (418)
Q Consensus        13 pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~   76 (418)
                      .|..+-.|.+.+|.=.. ..+..    .|+..|++||.|    ++...|.|.+||.++.|+...
T Consensus       605 ~Cg~~~vC~~~~C~~~~-v~~~~----~~~~~C~g~GVC----nn~~~ChC~~gwapp~C~~~~  659 (716)
T KOG3607|consen  605 SCGPGMICINHRCLSAS-VLNSS----CCPTTCNGHGVC----NNELNCHCEPGWAPPFCFIFG  659 (716)
T ss_pred             ccCCCceecCCcchhhh-hhccc----ccccccCCCccc----CCCcceeeCCCCCCCcccccc
Confidence            35555556666554222 22222    356779999999    345689999999999999654


No 50 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=80.79  E-value=0.9  Score=33.85  Aligned_cols=33  Identities=39%  Similarity=0.904  Sum_probs=15.8

Q ss_pred             CCCCCcEEecCC---CCCceEEcCCCCCCCCCCccc
Q psy10559         44 SCSNHGQCRVNS---DSQWECKCSDGWDGKDCSVLL   76 (418)
Q Consensus        44 ~C~~~G~C~~~~---~g~~~C~C~~G~~G~~C~~~~   76 (418)
                      .|++||+.-++.   .|...|.|..-|.|++|+..+
T Consensus        18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            577777764332   344567777777887777655


No 51 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.32  E-value=1.3  Score=38.71  Aligned_cols=32  Identities=28%  Similarity=0.756  Sum_probs=19.4

Q ss_pred             CCCCCcEEec-CCCCCceEEcCCCCCCCCCCccc
Q psy10559         44 SCSNHGQCRV-NSDSQWECKCSDGWDGKDCSVLL   76 (418)
Q Consensus        44 ~C~~~G~C~~-~~~g~~~C~C~~G~~G~~C~~~~   76 (418)
                      .|.+ |.|.. .....+.|.|..||+|.+|+...
T Consensus        52 YClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh~d   84 (139)
T PHA03099         52 YCLH-GDCIHARDIDGMYCRCSHGYTGIRCQHVV   84 (139)
T ss_pred             EeEC-CEEEeeccCCCceeECCCCccccccccee
Confidence            3553 46632 12355678888888888887543


No 52 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=79.85  E-value=0.97  Score=38.19  Aligned_cols=10  Identities=30%  Similarity=0.883  Sum_probs=7.3

Q ss_pred             CCCCCCCCcc
Q psy10559         66 GWDGKDCSVL   75 (418)
Q Consensus        66 G~~G~~C~~~   75 (418)
                      .|.|..|+..
T Consensus        53 ~W~G~aCqKk   62 (103)
T PF12955_consen   53 HWGGPACQKK   62 (103)
T ss_pred             eecccccccc
Confidence            5788888754


No 53 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.40  E-value=1.4  Score=31.58  Aligned_cols=14  Identities=36%  Similarity=1.028  Sum_probs=6.7

Q ss_pred             eEEcCCCCCCCCCC
Q psy10559         60 ECKCSDGWDGKDCS   73 (418)
Q Consensus        60 ~C~C~~G~~G~~C~   73 (418)
                      .|.|.++|+|+.|+
T Consensus        19 ~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   19 QCVCKPGTTGPRCD   32 (49)
T ss_dssp             EESBSTTEESTTS-
T ss_pred             EEeccccccCCcCc
Confidence            45555555555554


No 54 
>KOG3512|consensus
Probab=76.75  E-value=6  Score=41.78  Aligned_cols=84  Identities=15%  Similarity=0.302  Sum_probs=44.8

Q ss_pred             CcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCC
Q psy10559         48 HGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPP  127 (418)
Q Consensus        48 ~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~  127 (418)
                      +-+|.   ..+++|.|.+|-+|..|+.     |..|+-....-..-|.-.+=-....|.++..       .+.+.+.|++
T Consensus       406 gktCN---q~tGqCpCkeGvtG~tCnr-----Ca~gyqqsrs~vapcik~p~~~~~~~~s~ve-------~qd~~s~Ck~  470 (592)
T KOG3512|consen  406 GKTCN---QTTGQCPCKEGVTGLTCNR-----CAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE-------PQDQCSKCKA  470 (592)
T ss_pred             ccccc---ccCCcccCCCCCccccccc-----ccchhhcccCCCcCceecCCCCccccCCCCc-------chhccccCCC
Confidence            44672   3456999999999999974     3333221111111111000000111333222       2234568999


Q ss_pred             CccccccceeeeeecCCce
Q psy10559        128 AITASFFERMKFLIEESSL  146 (418)
Q Consensus       128 G~~g~~Ce~~~fli~~~s~  146 (418)
                      ++.|..+++.+|-..+..+
T Consensus       471 ~~~~~r~n~kkfc~~Dyav  489 (592)
T KOG3512|consen  471 SPGGKRLNQKKFCKKDYAV  489 (592)
T ss_pred             CCcceeccccccCccccce
Confidence            9999888887777665544


No 55 
>KOG3516|consensus
Probab=76.75  E-value=1.8  Score=50.07  Aligned_cols=42  Identities=36%  Similarity=0.997  Sum_probs=34.5

Q ss_pred             cCCC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc-cccC
Q psy10559         38 LEGC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL-EQNC   80 (418)
Q Consensus        38 ~~~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~-~~~C   80 (418)
                      .+.| ||+|..+|.|.. +-..+.|.|. .||.|..|...+ +..|
T Consensus       545 ~drClPN~CehgG~C~Q-s~~~f~C~C~~TGY~GatCHtsi~e~SC  589 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQ-SWDDFECNCELTGYKGATCHTSIYELSC  589 (1306)
T ss_pred             ccccCCccccCCCcccc-cccceeEeccccccccccccCCCcchhh
Confidence            4678 999999999944 5677899998 899999999776 5555


No 56 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=76.54  E-value=2  Score=31.07  Aligned_cols=15  Identities=33%  Similarity=0.977  Sum_probs=8.3

Q ss_pred             ceEeeCCCCcCCCCC
Q psy10559         23 GTCLCVTGWNGKHCT   37 (418)
Q Consensus        23 g~C~C~~G~~G~~Ce   37 (418)
                      |+|.|.++|.|.+|+
T Consensus        19 G~C~C~~~~~G~~C~   33 (50)
T cd00055          19 GQCECKPNTTGRRCD   33 (50)
T ss_pred             CEEeCCCcCCCCCCC
Confidence            555555555555553


No 57 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=75.92  E-value=2  Score=36.25  Aligned_cols=31  Identities=29%  Similarity=0.628  Sum_probs=19.3

Q ss_pred             CCCCCCCcEEecCC----CCCceEEcCCCCCCCCC
Q psy10559         42 PNSCSNHGQCRVNS----DSQWECKCSDGWDGKDC   72 (418)
Q Consensus        42 ~~~C~~~G~C~~~~----~g~~~C~C~~G~~G~~C   72 (418)
                      .+.|++||.|....    ..=|.|.|.+.+.....
T Consensus        12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~   46 (103)
T PF12955_consen   12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGS   46 (103)
T ss_pred             ccCCCCCceEeeccCCCccceEEEEeecccccccc
Confidence            46688888883221    23477888887665433


No 58 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=74.66  E-value=0.96  Score=42.12  Aligned_cols=94  Identities=20%  Similarity=0.509  Sum_probs=56.0

Q ss_pred             CCCCCcEEec---------ceEeeCCCCc--CCCCCcCCC-CCCCCCCcEEecCC--CCCceEEcCCCCC---CCCCCcc
Q psy10559         13 WCNEHGQCKN---------GTCLCVTGWN--GKHCTLEGC-PNSCSNHGQCRVNS--DSQWECKCSDGWD---GKDCSVL   75 (418)
Q Consensus        13 pC~n~G~C~~---------g~C~C~~G~~--G~~Ce~~~C-~~~C~~~G~C~~~~--~g~~~C~C~~G~~---G~~C~~~   75 (418)
                      +|.+-++|.+         +.|.|.+||.  ...|-.+.| ...|. .|.|.++.  .....|+|.-|..   ...|...
T Consensus        51 ~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~  129 (197)
T PF06247_consen   51 PCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKT  129 (197)
T ss_dssp             EEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEE
T ss_pred             cccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCC
Confidence            6888889964         5899999996  345655667 44566 78995543  2345899998876   2223322


Q ss_pred             ccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559         76 LEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITA  131 (418)
Q Consensus        76 ~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g  131 (418)
                      .              ...|       .-.|..+..|....+   .|.|.|.+|+.+
T Consensus       130 G--------------~T~C-------~LKCk~nE~CK~~~~---~Y~C~~~~~~~~  161 (197)
T PF06247_consen  130 G--------------ETKC-------SLKCKENEECKLVDG---YYKCVCKEGFPG  161 (197)
T ss_dssp             E-----------------------------TTTEEEEEETT---EEEEEE-TT-EE
T ss_pred             C--------------ccce-------eeecCCCcceeeeCc---EEEeecCCCCCC
Confidence            2              2222       234777888988877   899999999866


No 59 
>KOG3607|consensus
Probab=74.45  E-value=2.1  Score=47.87  Aligned_cols=33  Identities=36%  Similarity=0.804  Sum_probs=28.4

Q ss_pred             cccCCCcCCCCCcEEec-ceEeeCCCCcCCCCCc
Q psy10559          6 NIDVDLLWCNEHGQCKN-GTCLCVTGWNGKHCTL   38 (418)
Q Consensus         6 ~id~C~~pC~n~G~C~~-g~C~C~~G~~G~~Ce~   38 (418)
                      +...|+..|++||.|.+ ..|.|.+||.+.+|++
T Consensus       624 ~~~~~~~~C~g~GVCnn~~~ChC~~gwapp~C~~  657 (716)
T KOG3607|consen  624 NSSCCPTTCNGHGVCNNELNCHCEPGWAPPFCFI  657 (716)
T ss_pred             cccccccccCCCcccCCCcceeeCCCCCCCcccc
Confidence            34567778999999987 7999999999999985


No 60 
>KOG1218|consensus
Probab=72.93  E-value=10  Score=37.08  Aligned_cols=48  Identities=31%  Similarity=0.946  Sum_probs=30.4

Q ss_pred             ceEee-CCCCcCCCCCcC-CCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559         23 GTCLC-VTGWNGKHCTLE-GCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSV   74 (418)
Q Consensus        23 g~C~C-~~G~~G~~Ce~~-~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~   74 (418)
                      .+..| ..+|.|..|+.+ .|...|.. -+| .+...  .|.|..+|.+..|..
T Consensus        90 ~~~~~~~~~~~g~~C~~~~~~~~~c~~-~~C-~~~~~--~c~~~~~~~~~~C~~  139 (316)
T KOG1218|consen   90 STGYCHLNGYEGPQCESPCPCGDGCAE-KTC-ANPRR--ECRCGGGYIGEQCGE  139 (316)
T ss_pred             CCCcccCCCCCcccccCCCCcCCcccc-ccc-CCCcc--ceecCCcCccccccc
Confidence            34455 688999999742 34222544 566 33211  688888999888875


No 61 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=69.17  E-value=3.2  Score=30.98  Aligned_cols=26  Identities=31%  Similarity=0.930  Sum_probs=15.5

Q ss_pred             cCCCCCcEEe-c-----c--eEeeCCCCcCCCCC
Q psy10559         12 LWCNEHGQCK-N-----G--TCLCVTGWNGKHCT   37 (418)
Q Consensus        12 ~pC~n~G~C~-~-----g--~C~C~~G~~G~~Ce   37 (418)
                      .+|++||+.. +     |  .|+|+.-|.|++|.
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS   50 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence            3799999885 3     3  79999999999997


No 62 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=68.68  E-value=5.4  Score=28.78  Aligned_cols=16  Identities=25%  Similarity=0.817  Sum_probs=14.5

Q ss_pred             CceEEcCCCCCCCCCC
Q psy10559         58 QWECKCSDGWDGKDCS   73 (418)
Q Consensus        58 ~~~C~C~~G~~G~~C~   73 (418)
                      ..+|.|.++|.|..|+
T Consensus        18 ~G~C~C~~~~~G~~C~   33 (50)
T cd00055          18 TGQCECKPNTTGRRCD   33 (50)
T ss_pred             CCEEeCCCcCCCCCCC
Confidence            4589999999999997


No 63 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=67.25  E-value=3  Score=29.84  Aligned_cols=20  Identities=35%  Similarity=1.049  Sum_probs=16.6

Q ss_pred             cEEe--cceEeeCCCCcCCCCC
Q psy10559         18 GQCK--NGTCLCVTGWNGKHCT   37 (418)
Q Consensus        18 G~C~--~g~C~C~~G~~G~~Ce   37 (418)
                      ..|.  +|+|.|.++|+|.+|+
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   11 QTCDPSTGQCVCKPGTTGPRCD   32 (49)
T ss_dssp             SSEEETCEEESBSTTEESTTS-
T ss_pred             CcccCCCCEEeccccccCCcCc
Confidence            3786  4899999999999996


No 64 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=67.02  E-value=6.7  Score=28.34  Aligned_cols=41  Identities=29%  Similarity=0.708  Sum_probs=22.1

Q ss_pred             EEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559         19 QCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWD   68 (418)
Q Consensus        19 ~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~   68 (418)
                      ..+++.|. +....|..|+.+   ..|..+..| .+    ..|.|++||.
T Consensus         6 ~~~~~~C~-~~~~~g~~C~~~---~qC~~~s~C-~~----g~C~C~~g~~   46 (52)
T PF01683_consen    6 VAINGQCV-PRVQPGESCESD---EQCIGGSVC-VN----GRCQCPPGYV   46 (52)
T ss_pred             EEECCEEC-ccCCCCCCCCCc---CCCCCcCEE-cC----CEeECCCCCE
Confidence            33445553 224556667642   334466777 32    2677777763


No 65 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=65.39  E-value=5.6  Score=24.73  Aligned_cols=11  Identities=27%  Similarity=1.168  Sum_probs=7.6

Q ss_pred             CceEEcCCCCC
Q psy10559         58 QWECKCSDGWD   68 (418)
Q Consensus        58 ~~~C~C~~G~~   68 (418)
                      +|.|.|++||.
T Consensus         1 sy~C~C~~Gy~   11 (24)
T PF12662_consen    1 SYTCSCPPGYQ   11 (24)
T ss_pred             CEEeeCCCCCc
Confidence            46777777775


No 66 
>KOG3514|consensus
Probab=64.29  E-value=4.1  Score=46.80  Aligned_cols=36  Identities=33%  Similarity=0.900  Sum_probs=30.5

Q ss_pred             CC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc
Q psy10559         40 GC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL   76 (418)
Q Consensus        40 ~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~   76 (418)
                      .| ++||.|+|.| ...-+.+.|.|. .||.|+.|+...
T Consensus       625 ~C~~nPC~N~g~C-~egwNrfiCDCs~T~~~G~~CerE~  662 (1591)
T KOG3514|consen  625 ICESNPCQNGGKC-SEGWNRFICDCSGTGFEGRTCEREA  662 (1591)
T ss_pred             ccCCCcccCCCCc-cccccccccccccCcccCcccccee
Confidence            68 8999999999 555678899997 689999998764


No 67 
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=60.15  E-value=33  Score=35.29  Aligned_cols=71  Identities=18%  Similarity=0.286  Sum_probs=54.0

Q ss_pred             ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559        161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII  234 (418)
Q Consensus       161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i  234 (418)
                      --++|.|.+..|.|+.|+.|......  -+.|..+|.|.+.+--|. -++.+.-..|+++...|-|.-++...+
T Consensus       287 ~gI~G~V~d~~g~pi~~A~V~v~g~~--~~~T~~~G~y~~~L~pG~-y~v~vs~~Gy~~~~~~V~v~~~~~~~~  357 (363)
T cd06245         287 KGVHGVVTDKAGKPISGATIVLNGGH--RVYTKEGGYFHVLLAPGQ-HNINVIAEGYQQEHLPVVVSHDEASSV  357 (363)
T ss_pred             cEEEEEEEcCCCCCccceEEEEeCCC--ceEeCCCcEEEEecCCce-EEEEEEEeCceeEEEEEEEcCCCeEEE
Confidence            45689999989999999999988533  366889999988655543 455667777888888888877776655


No 68 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=60.08  E-value=4.6  Score=37.70  Aligned_cols=84  Identities=21%  Similarity=0.510  Sum_probs=48.4

Q ss_pred             ceEeeCCCCc---CCCCC-cCCC------CCCCCCCcEEecCC----CCCceEEcCCCCCCC--CCCccccccCCCCccC
Q psy10559         23 GTCLCVTGWN---GKHCT-LEGC------PNSCSNHGQCRVNS----DSQWECKCSDGWDGK--DCSVLLEQNCNDGKDN   86 (418)
Q Consensus        23 g~C~C~~G~~---G~~Ce-~~~C------~~~C~~~G~C~~~~----~g~~~C~C~~G~~G~--~C~~~~~~~C~~g~c~   86 (418)
                      +.|.|.+||.   -..|+ ...|      ..+|...+.|....    ...|.|.|..||.-.  .|-..   .       
T Consensus        20 fEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~---~-------   89 (197)
T PF06247_consen   20 FECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPN---K-------   89 (197)
T ss_dssp             EEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEG---G-------
T ss_pred             eEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchh---h-------
Confidence            6899999995   55676 3456      23688899994332    367899999999632  23211   1       


Q ss_pred             CCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcc
Q psy10559         87 DKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAIT  130 (418)
Q Consensus        87 ~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~  130 (418)
                             |      ..-.|. .|.|+-.+.......|+|.-|+.
T Consensus        90 -------C------~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV  119 (197)
T PF06247_consen   90 -------C------NNKDCG-SGKCILDPDNPNNPTCSCNIGKV  119 (197)
T ss_dssp             -------G------SS---T-TEEEEEEEGGGSEEEEEE-TEEE
T ss_pred             -------c------CceecC-CCeEEecCCCCCCceeEeeeceE
Confidence                   2      222343 56677666544455788887764


No 69 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=60.02  E-value=4.7  Score=27.47  Aligned_cols=19  Identities=32%  Similarity=0.969  Sum_probs=14.4

Q ss_pred             cEEecCCCCCceEEcCCCCC
Q psy10559         49 GQCRVNSDSQWECKCSDGWD   68 (418)
Q Consensus        49 G~C~~~~~g~~~C~C~~G~~   68 (418)
                      -.| ++..++|+|.|++||.
T Consensus        10 h~C-~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   10 HIC-VNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             SEE-EEETTSEEEE-STTEE
T ss_pred             CCC-ccCCCceEeECCCCCE
Confidence            367 5668899999999985


No 70 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=58.44  E-value=10  Score=35.73  Aligned_cols=34  Identities=26%  Similarity=0.856  Sum_probs=24.6

Q ss_pred             CCCCC-cCCC---CCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559         33 GKHCT-LEGC---PNSCSNHGQCRVNSDSQWECKCSDGWDG   69 (418)
Q Consensus        33 G~~Ce-~~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~G   69 (418)
                      +..|+ .++|   +..|.  ..| .+..|+|.|.|.+||+.
T Consensus       181 ~~~C~~~~~C~~~~~~c~--~~C-~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICVVPDLCATLSHVCQ--QVC-ISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cccCcCchhhcCCCCCcc--ceE-EcCCCCEEeECCCCccC
Confidence            45564 4567   23465  479 78899999999999975


No 71 
>KOG3516|consensus
Probab=54.17  E-value=9.4  Score=44.47  Aligned_cols=36  Identities=36%  Similarity=0.869  Sum_probs=29.8

Q ss_pred             CC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc
Q psy10559         40 GC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL   76 (418)
Q Consensus        40 ~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~   76 (418)
                      .| +.+|.|+|+| +..-.+|.|.|. ..|.|+.|...+
T Consensus       957 hCss~~C~NGG~C-very~gytCDCs~Tay~Gp~Cs~ei  994 (1306)
T KOG3516|consen  957 HCSSYPCLNGGHC-VERYDGYTCDCSRTAYDGPFCSKEI  994 (1306)
T ss_pred             ccccccccCCCEE-EEecCceeeccccCcCCCCcccccc
Confidence            46 6789999999 555678999997 679999998765


No 72 
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=53.80  E-value=45  Score=34.53  Aligned_cols=73  Identities=22%  Similarity=0.256  Sum_probs=54.7

Q ss_pred             cceeeceEEee-cCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559        160 SAVVRGRVVTS-MGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII  234 (418)
Q Consensus       160 ~~vi~G~v~~~-~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i  234 (418)
                      .--++|.|.+. .|.|+.|+.|++.....| +.|..||.|-+.+--| .-++++.-..|+++...|-|.-++.+.+
T Consensus       296 ~~gI~G~V~D~~~g~pl~~AtV~V~g~~~~-~~Td~~G~f~~~l~pG-~ytl~vs~~GY~~~~~~v~V~~~~~~~~  369 (375)
T cd03863         296 HRGVRGFVLDATDGRGILNATISVADINHP-VTTYKDGDYWRLLVPG-TYKVTASARGYDPVTKTVEVDSKGAVQV  369 (375)
T ss_pred             cCeEEEEEEeCCCCCCCCCeEEEEecCcCc-eEECCCccEEEccCCe-eEEEEEEEcCcccEEEEEEEcCCCcEEE
Confidence            34678999986 699999999999753334 5578899998866665 4667777788888877777776666544


No 73 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=51.98  E-value=11  Score=25.40  Aligned_cols=11  Identities=27%  Similarity=0.824  Sum_probs=7.2

Q ss_pred             CceEEcCCCCC
Q psy10559         58 QWECKCSDGWD   68 (418)
Q Consensus        58 ~~~C~C~~G~~   68 (418)
                      .++|.|++||.
T Consensus        17 ~~~C~CPeGyI   27 (34)
T PF09064_consen   17 PGQCFCPEGYI   27 (34)
T ss_pred             CCceeCCCceE
Confidence            34677777774


No 74 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=50.44  E-value=11  Score=26.74  Aligned_cols=20  Identities=35%  Similarity=1.066  Sum_probs=16.6

Q ss_pred             cEEe--cceEeeCCCCcCCCCC
Q psy10559         18 GQCK--NGTCLCVTGWNGKHCT   37 (418)
Q Consensus        18 G~C~--~g~C~C~~G~~G~~Ce   37 (418)
                      ..|.  +|+|.|.++|+|.+|+
T Consensus        11 ~~C~~~~G~C~C~~~~~G~~C~   32 (46)
T smart00180       11 GTCDPDTGQCECKPNVTGRRCD   32 (46)
T ss_pred             CcccCCCCEEECCCCCCCCCCC
Confidence            3564  4899999999999996


No 75 
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=50.15  E-value=48  Score=29.41  Aligned_cols=70  Identities=17%  Similarity=0.169  Sum_probs=46.5

Q ss_pred             eeeceEEee-cCCceeceEEEeeCC-CC-----ceEEEecCCceeeeecC----CceEEEEEecCCCCCc-cceeeeecc
Q psy10559        162 VVRGRVVTS-MGMGLVGVRVSTSTP-LE-----GFTLTRDDGWFDLLVNG----GGAVTLQFGRSPFKPH-NHIVHVPWN  229 (418)
Q Consensus       162 vi~G~v~~~-~Gtp~~Gv~vs~~~~-~~-----G~t~t~~dg~fdl~vNG----G~~~~l~f~r~Pf~~~-~~~V~vpwn  229 (418)
                      .+++.|++. .|+|..||.|..... ..     +.+.|..||.|..+..+    -+.-.+.|..++|-.. ...-+.|+-
T Consensus        28 ~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~~~Td~dGR~~~l~~~~~~~~G~Y~L~F~t~~Yf~~~~~~~F~p~v  107 (137)
T PRK15036         28 ILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYRVVFKTGDYFKKQNLESFFPEI  107 (137)
T ss_pred             CeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEEEEECCCCCCccccCcccCCCeeEEEEEEcchhhhccCCCCCccee
Confidence            588888865 889999999998843 22     55788899999753221    2456777888886653 222334443


Q ss_pred             ce
Q psy10559        230 EV  231 (418)
Q Consensus       230 ~~  231 (418)
                      ++
T Consensus       108 ~v  109 (137)
T PRK15036        108 PV  109 (137)
T ss_pred             EE
Confidence            33


No 76 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=49.89  E-value=7.8  Score=29.73  Aligned_cols=20  Identities=45%  Similarity=1.286  Sum_probs=13.2

Q ss_pred             CcEEe-cceEeeCCCCcCCCC
Q psy10559         17 HGQCK-NGTCLCVTGWNGKHC   36 (418)
Q Consensus        17 ~G~C~-~g~C~C~~G~~G~~C   36 (418)
                      |-+|. +|.-.|.+||.|+.|
T Consensus        43 hy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   43 HYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             EEEE-SS--EEE-TTEESTTS
T ss_pred             CcccCCCCCCCCCCCCcCCCC
Confidence            45676 488999999999987


No 77 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=49.63  E-value=49  Score=23.71  Aligned_cols=20  Identities=35%  Similarity=0.848  Sum_probs=17.7

Q ss_pred             CCCCCcEEecceEeeCCCCc
Q psy10559         13 WCNEHGQCKNGTCLCVTGWN   32 (418)
Q Consensus        13 pC~n~G~C~~g~C~C~~G~~   32 (418)
                      .|..+..|.++.|.|++||.
T Consensus        27 qC~~~s~C~~g~C~C~~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVNGRCQCPPGYV   46 (52)
T ss_pred             CCCCcCEEcCCEeECCCCCE
Confidence            68888899999999999985


No 78 
>KOG3514|consensus
Probab=46.52  E-value=1.1e+02  Score=35.92  Aligned_cols=37  Identities=14%  Similarity=0.262  Sum_probs=31.2

Q ss_pred             CCCCCCCCCCccccCCCCCCceeecCCC-Cccccccceeee
Q psy10559        100 CSNHICRSSQLCVSAPKPIDILLRKQPP-AITASFFERMKF  139 (418)
Q Consensus       100 c~~~~C~~~~~C~~~~~~~~~~~C~C~~-G~~g~~Ce~~~f  139 (418)
                      |..++|.|++.|.....   .+-|-|.. +|.|..|++..-
T Consensus       626 C~~nPC~N~g~C~egwN---rfiCDCs~T~~~G~~CerE~t  663 (1591)
T KOG3514|consen  626 CESNPCQNGGKCSEGWN---RFICDCSGTGFEGRTCEREAT  663 (1591)
T ss_pred             cCCCcccCCCCcccccc---ccccccccCcccCccccceee
Confidence            77889999999988766   78899975 999999998543


No 79 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=45.82  E-value=16  Score=34.36  Aligned_cols=34  Identities=29%  Similarity=0.699  Sum_probs=23.2

Q ss_pred             ccCCCCCCCC--CCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559         93 DCEDPECCSN--HICRSSQLCVSAPKPIDILLRKQPPAITA  131 (418)
Q Consensus        93 ~C~dpdCc~~--~~C~~~~~C~~~~~~~~~~~C~C~~G~~g  131 (418)
                      .|.+.+.|..  ..|.  ..|.+..+   +|.|.|++||+.
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g---~~~c~c~~g~~~  218 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPG---SYLCACTEGYAL  218 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCC---CEEeECCCCccC
Confidence            3544444432  3354  46988888   799999999975


No 80 
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=44.56  E-value=3.7  Score=34.08  Aligned_cols=23  Identities=22%  Similarity=0.397  Sum_probs=19.8

Q ss_pred             eccccccc---cccCCCCCcccchhh
Q psy10559        267 LATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       267 ~~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      .+.|++||   +++|.+.|+.||+..
T Consensus         9 ~grY~l~f~eA~~aC~~~ga~lAs~~   34 (93)
T cd03515           9 SGKYKLTYTEAKAACEAEGAHLATYS   34 (93)
T ss_pred             CCccccCHHHHHHHHHHcCCccCCHH
Confidence            56789999   578999999999865


No 81 
>cd03864 M14_CPN Peptidase M14 Carboxypeptidase N (CPN, also known as kininase I, creatine kinase conversion factor, plasma carboxypeptidase B, arginine carboxypeptidase, and protaminase; EC 3.4.17.3) is an extracellular glycoprotein synthesized in the liver and released into the blood, where it is present in high concentrations. CPN belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPN plays an important role in protecting the body from excessive buildup of potentially deleterious peptides that normally act as local autocrine or paracrine hormones. It specifically removes C-terminal basic residues. As CPN can cleave lysine more avidly than arginine residues it is also called lysine carboxypeptidase. CPN substrates inclu
Probab=40.44  E-value=1e+02  Score=32.23  Aligned_cols=71  Identities=18%  Similarity=0.195  Sum_probs=49.0

Q ss_pred             eeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559        162 VVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII  234 (418)
Q Consensus       162 vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i  234 (418)
                      -|+|.|.+..|.|+.|+.|.+.....+ +.|..+|.|=.++.-| .-++.+.-..|+++..+|-|.-++...+
T Consensus       317 gI~G~V~D~~g~pi~~A~V~v~g~~~~-~~T~~~G~y~r~l~pG-~Y~l~vs~~Gy~~~t~~v~V~~~~~~~~  387 (392)
T cd03864         317 GIKGMVTDENNNGIANAVISVSGISHD-VTSGTLGDYFRLLLPG-TYTVTASAPGYQPSTVTVTVGPAEATLV  387 (392)
T ss_pred             eEEEEEECCCCCccCCeEEEEECCccc-eEECCCCcEEecCCCe-eEEEEEEEcCceeEEEEEEEcCCCcEEE
Confidence            678999999999999999998744333 6677888882223333 3455566777888877777766654433


No 82 
>cd03865 M14_CPE_H Peptidase M14 Carboxypeptidase (CP) E (CPE, also known as carboxypeptidase H, and enkephalin convertase; EC 3.4.17.10) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPE is an important enzyme responsible for the proteolytic processing of prohormone intermediates (such as pro-insulin, pro-opiomelanocortin, or pro-gonadotropin-releasing hormone) by specifically removing C-terminal basic residues. In addition, it has been proposed that the regulated secretory pathway (RSP) of the nervous and endocrine systems utilizes membrane-bound CPE as a sorting receptor. A naturally occurring point mutation in CPE reduces the stability of the enzyme and causes its degradation, leading to an accumulation of numerous neuroendocrine pe
Probab=39.57  E-value=78  Score=33.16  Aligned_cols=71  Identities=21%  Similarity=0.236  Sum_probs=52.0

Q ss_pred             eeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEec
Q psy10559        163 VRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIID  235 (418)
Q Consensus       163 i~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id  235 (418)
                      |+|.|.+..|.|+.|+.|++.....+ +.|..+|.|.+.+--| .-++...-..|+++...|-|.-++.+.+|
T Consensus       328 I~G~V~D~~g~pI~~AtV~V~g~~~~-~~T~~~G~Y~~~L~pG-~Ytv~vsa~Gy~~~~~~V~V~~~~~~~vd  398 (402)
T cd03865         328 VKGFVKDLQGNPIANATISVEGIDHD-ITSAKDGDYWRLLAPG-NYKLTASAPGYLAVVKKVAVPYSPAVRVD  398 (402)
T ss_pred             eEEEEECCCCCcCCCeEEEEEcCccc-cEECCCeeEEECCCCE-EEEEEEEecCcccEEEEEEEcCCCcEEEe
Confidence            78999998899999999999854334 4677889998754444 34555667778888777888777665443


No 83 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=39.52  E-value=34  Score=28.56  Aligned_cols=29  Identities=24%  Similarity=0.764  Sum_probs=20.9

Q ss_pred             CCC--CCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559         39 EGC--PNSCSNHGQCRVNSDSQWECKCSDGWDG   69 (418)
Q Consensus        39 ~~C--~~~C~~~G~C~~~~~g~~~C~C~~G~~G   69 (418)
                      +.|  ...|..+|.|..  .....|.|.+||..
T Consensus        78 d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   78 DQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP  108 (110)
T ss_pred             cCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence            456  577988999943  33456999999875


No 84 
>cd03867 M14_CPZ Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis.  That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Probab=37.81  E-value=91  Score=32.44  Aligned_cols=64  Identities=19%  Similarity=0.171  Sum_probs=46.1

Q ss_pred             ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeee
Q psy10559        161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHV  226 (418)
Q Consensus       161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~v  226 (418)
                      --++|.|.+..|.|++|+.|.+.....+ +.|..+|.|...+.-| .-++.+.-..|+++..+|-+
T Consensus       318 ~~i~G~V~D~~g~pi~~A~V~v~g~~~~-~~Td~~G~y~~~l~~G-~y~l~vs~~Gy~~~~~~v~v  381 (395)
T cd03867         318 RGIKGFVKDKDGNPIKGARISVRGIRHD-ITTAEDGDYWRLLPPG-IHIVSAQAPGYTKVMKRVTL  381 (395)
T ss_pred             ceeEEEEEcCCCCccCCeEEEEeccccc-eEECCCceEEEecCCC-cEEEEEEecCeeeEEEEEEe
Confidence            3578999999999999999988754334 6677889996544444 34466666777777666665


No 85 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=37.26  E-value=22  Score=24.46  Aligned_cols=27  Identities=26%  Similarity=0.655  Sum_probs=17.8

Q ss_pred             CCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559         42 PNSCSNHGQCRVNSDSQWECKCSDGWD   68 (418)
Q Consensus        42 ~~~C~~~G~C~~~~~g~~~C~C~~G~~   68 (418)
                      ...|..|+.|-....|...|.|..||.
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            356788899944446899999999995


No 86 
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=37.10  E-value=1.1e+02  Score=31.38  Aligned_cols=71  Identities=21%  Similarity=0.266  Sum_probs=49.7

Q ss_pred             ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeec-cceEE
Q psy10559        161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPW-NEVVI  233 (418)
Q Consensus       161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpw-n~~~~  233 (418)
                      ..++|.|.+..|.|+.|+.|.......+ +.|..+|.|.+.+ -.+.-++.+....|+++...+-+-- ++.+.
T Consensus       298 ~~i~G~V~d~~g~pl~~A~V~i~~~~~~-~~Td~~G~f~~~l-~~G~y~l~vs~~Gy~~~~~~v~v~~~g~~~~  369 (374)
T cd03858         298 RGIKGFVRDANGNPIANATISVEGINHD-VTTAEDGDYWRLL-LPGTYNVTASAPGYEPQTKSVVVPNDNSAVV  369 (374)
T ss_pred             CceEEEEECCCCCccCCeEEEEecceee-eEECCCceEEEec-CCEeEEEEEEEcCcceEEEEEEEecCCceEE
Confidence            3789999999999999999988543223 6788999997654 3455566666677777766665543 44443


No 87 
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=36.62  E-value=5.8  Score=33.07  Aligned_cols=23  Identities=30%  Similarity=0.588  Sum_probs=20.0

Q ss_pred             eccccccc---cccCCCCCcccchhh
Q psy10559        267 LATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       267 ~~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      .+.|++||   +++|.+.|+.||+..
T Consensus         9 ~grY~l~f~eA~~aC~~~ga~lAs~~   34 (95)
T cd03518           9 LGRYNLNFHEAQQACEEQDATLASFE   34 (95)
T ss_pred             CCccccCHHHHHHHHHHcCCeeCCHH
Confidence            57899999   578999999999865


No 88 
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=35.77  E-value=6.6  Score=35.16  Aligned_cols=29  Identities=21%  Similarity=0.291  Sum_probs=23.3

Q ss_pred             ccCcee----eccccccc---cccCCCCCcccchhh
Q psy10559        261 TMKPVV----LATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       261 ~~~pv~----~~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      +.++||    .+.|+++|   +++|.+.|+.||+..
T Consensus         4 rv~GVFh~~~~grY~lnf~eA~~aC~~~ga~lAs~~   39 (144)
T cd03516           4 RIMGVFLVEKNGRYSLNFTEAKEACRALGLTLASKA   39 (144)
T ss_pred             ceEEEEEeccCCcccCCHHHHHHHHHHcCCeeCCHH
Confidence            455666    37888999   578999999999865


No 89 
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of  Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=35.00  E-value=1.1e+02  Score=31.43  Aligned_cols=74  Identities=16%  Similarity=0.072  Sum_probs=50.9

Q ss_pred             ccceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCcccee-eeeccceEEe
Q psy10559        159 RSAVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIV-HVPWNEVVII  234 (418)
Q Consensus       159 ~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V-~vpwn~~~~i  234 (418)
                      ....++|.|.+..|.|+.|+.|.......+ +.+..+|.|...+.-| .-++.+....|+++...+ -|.-++.+.+
T Consensus       294 ~~~~i~G~V~d~~g~pv~~A~V~v~~~~~~-~~td~~G~y~~~l~~G-~Y~l~vs~~Gf~~~~~~~v~v~~g~~~~~  368 (372)
T cd03868         294 VHIGVKGFVRDASGNPIEDATIMVAGIDHN-VTTAKFGDYWRLLLPG-TYTITAVAPGYEPSTVTDVVVKEGEATSV  368 (372)
T ss_pred             hCCceEEEEEcCCCCcCCCcEEEEEecccc-eEeCCCceEEecCCCE-EEEEEEEecCCCceEEeeEEEcCCCeEEE
Confidence            445678999999999999999998754333 6788899996544333 456666777788875543 3544554433


No 90 
>KOG0196|consensus
Probab=33.83  E-value=45  Score=37.94  Aligned_cols=44  Identities=30%  Similarity=0.853  Sum_probs=28.2

Q ss_pred             ceEeeCCCCc----CCCCCc------------CCCCCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559         23 GTCLCVTGWN----GKHCTL------------EGCPNSCSNHGQCRVNSDSQWECKCSDGWDG   69 (418)
Q Consensus        23 g~C~C~~G~~----G~~Ce~------------~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G   69 (418)
                      |.|.|.+||.    |..|+.            ..| .+|..|..  ....+.-.|.|..||.-
T Consensus       259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C-~~CP~~S~--s~~ega~~C~C~~gyyR  318 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLC-LPCPPNSH--SSSEGATSCTCENGYYR  318 (996)
T ss_pred             CceeecCCCCcccCCCcceeCCCCcccCCCCCCCC-CCCCCCCC--CCCCCCCcccccCCccc
Confidence            7999999994    667752            112 13444433  22356678999999854


No 91 
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=32.04  E-value=7.9  Score=32.09  Aligned_cols=23  Identities=30%  Similarity=0.388  Sum_probs=19.9

Q ss_pred             eccccccc---cccCCCCCcccchhh
Q psy10559        267 LATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       267 ~~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      .+.|++||   +++|.+.|+.||+..
T Consensus         9 ~g~y~l~f~eA~~aC~~~ga~lAs~~   34 (92)
T cd01102           9 NGRYKLTFAEAALACKARGAHLATPG   34 (92)
T ss_pred             CCCcccCHHHHHHHHHHcCCEeCCHH
Confidence            47799999   578999999999866


No 92 
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=29.68  E-value=1.8e+02  Score=30.12  Aligned_cols=70  Identities=23%  Similarity=0.183  Sum_probs=47.8

Q ss_pred             ccceeeceEEeecCCceeceEEEeeCCCCc-eEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeecc
Q psy10559        159 RSAVVRGRVVTSMGMGLVGVRVSTSTPLEG-FTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWN  229 (418)
Q Consensus       159 ~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G-~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn  229 (418)
                      ...-++|.|.+..|.|++|+.|.......+ .+.|..+|.|-+.+--| .-++.+.-..|+++...|.++-+
T Consensus       293 ~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~~~T~~~G~y~~~l~pG-~Y~v~vsa~Gy~~~~~~v~v~~~  363 (376)
T cd03866         293 VHLGVKGQVFDSNGNPIPNAIVEVKGRKHICPYRTNVNGEYFLLLLPG-KYMINVTAPGFKTVITNVIIPYN  363 (376)
T ss_pred             hcCceEEEEECCCCCccCCeEEEEEcCCceeEEEECCCceEEEecCCe-eEEEEEEeCCcceEEEEEEeCCC
Confidence            344679999999999999999988754333 34688889996543333 23445566677777666666543


No 93 
>cd01204 IRS_PTB Insulin receptor substrate (IRS) Phosphotyrosine-binding domain(PTB). Insulin receptor substrate (IRS) Phosphotyrosine-binding domain(PTB). This domain has a PH-like fold and is found in insulin receptor substrate molecules. IRS molecules have an N-terminal PH domain , which is followed by an IRS-like PTB domain. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs in the insulin receptor, IGF-I receptor and the IL-4 receptor.
Probab=29.15  E-value=96  Score=26.15  Aligned_cols=18  Identities=17%  Similarity=0.289  Sum_probs=13.4

Q ss_pred             cceEEeEEeeccceeEEe
Q psy10559        395 LQLRKSLCKDFTHTLTFY  412 (418)
Q Consensus       395 l~~vh~~~~~~~~~~~~~  412 (418)
                      +.+--+.|||.+|+-.||
T Consensus        45 v~l~L~~IRRCGHs~~~F   62 (104)
T cd01204          45 VELLLMNIRRCGHSENFF   62 (104)
T ss_pred             EEEeeeeeeeccCcccEE
Confidence            333345699999998887


No 94 
>PF00818 Ice_nucleation:  Ice nucleation protein repeat;  InterPro: IPR000258 Certain Gram-negative bacteria express proteins that enable them to promote nucleation of ice at relatively high temperatures (above -5C) [, ]. These proteins are localised at the outer membrane surface and can cause frost damage to many plants. The primary structure of the proteins contains a highly repetitive domain that dominates the sequence. The domain comprises a number of 48-residue repeats, which themselves contain 3 blocks of 16 residues, the first 8 of which are identical. It is thought that the repetitive domain may be responsible for aligning water molecules in the seed crystal.  [.........48.residues.repeated.domain..........] / / | | \ \ AGYGSTxTagxxssli AGYGSTxTagxxsxlt AGYGSTxTaqxxsxlt [16.residues...] [16.residues...] [16.residues...] ; GO: 0009279 cell outer membrane
Probab=26.48  E-value=26  Score=19.65  Aligned_cols=13  Identities=38%  Similarity=0.562  Sum_probs=11.1

Q ss_pred             EecCCCCCCceeE
Q psy10559        370 YHSSRSAGYLSTI  382 (418)
Q Consensus       370 y~s~~~~gy~~~~  382 (418)
                      |-|-+++|+.|.|
T Consensus         2 YGSTqTA~~~S~L   14 (16)
T PF00818_consen    2 YGSTQTAGEDSSL   14 (16)
T ss_pred             CCcccccCCCccc
Confidence            7788999999876


No 95 
>smart00445 LINK Link (Hyaluronan-binding).
Probab=23.16  E-value=12  Score=31.11  Aligned_cols=22  Identities=27%  Similarity=0.481  Sum_probs=19.0

Q ss_pred             ccccccc---cccCCCCCcccchhh
Q psy10559        268 ATWKHGF---QGACPDRSSILAESQ  289 (418)
Q Consensus       268 ~~~~~~~---~~~~~~~~~~~~~~q  289 (418)
                      +.|++||   +++|.+.|+.||+..
T Consensus        11 g~y~l~f~eA~~aC~~~ga~lAs~~   35 (94)
T smart00445       11 GRYKLTFAEAREACRAQGATLATVG   35 (94)
T ss_pred             CCCccCHHHHHHHHHHcCCEeCCHH
Confidence            4799999   578999999999865


No 96 
>PF06339 Ectoine_synth:  Ectoine synthase;  InterPro: IPR010462 This family consists of several bacterial ectoine synthase proteins. The ectABC genes encode the diaminobutyric acid acetyltransferase (EctA), the diaminobutyric acid aminotransferase (EctB), and the ectoine synthase (EctC). Together these proteins constitute the ectoine biosynthetic pathway [].; GO: 0016836 hydro-lyase activity, 0006596 polyamine biosynthetic process
Probab=21.82  E-value=71  Score=28.03  Aligned_cols=36  Identities=14%  Similarity=0.075  Sum_probs=26.5

Q ss_pred             cCCCCCcccCCCCCCceeeeeeeeCCCceEEEEecC
Q psy10559        338 LTPSSNISCASSNPPQVVQESFQIPGTGLNLVYHSS  373 (418)
Q Consensus       338 ~~~~~~~~~~~~~~~~~~~e~~~i~gt~~~l~y~s~  373 (418)
                      .-.|-|++++.+.=-=++||.+--|||..+|.|.--
T Consensus        20 ~w~SrRlll~~DgmGFS~h~T~i~aGtet~~~YknH   55 (126)
T PF06339_consen   20 NWESRRLLLKDDGMGFSFHETTIYAGTETHIHYKNH   55 (126)
T ss_pred             CceEEEEEEccCCCCEEEEEEEEeCCCeeEEEecCc
Confidence            334555555555555599999999999999999754


Done!