Query         psy7015
Match_columns 284
No_of_seqs    328 out of 1734
Neff          9.7 
Searched_HMMs 46136
Date          Sat Aug 17 00:37:33 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7015.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7015hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.7 8.5E-17 1.8E-21  152.9  15.9   90   42-131  1217-1309(2531)
  2 KOG1217|consensus               99.7 1.3E-15 2.8E-20  139.5  17.4  235   42-282   105-362 (487)
  3 KOG1217|consensus               99.6 1.2E-14 2.6E-19  133.1  17.8  214   44-270   149-389 (487)
  4 KOG1219|consensus               99.6 9.5E-15 2.1E-19  143.8   9.8  117  154-273  3859-3977(4289)
  5 KOG1219|consensus               99.5 2.8E-14 6.1E-19  140.6   9.6  117   98-235  3860-3977(4289)
  6 KOG1214|consensus               99.4 1.6E-12 3.4E-17  119.2  14.0  185   72-269   702-908 (1289)
  7 KOG1214|consensus               99.4 9.8E-12 2.1E-16  114.1  15.4  206   46-265   715-947 (1289)
  8 KOG4289|consensus               99.4 1.6E-12 3.4E-17  124.5  10.5   88   81-187  1218-1308(2531)
  9 KOG1225|consensus               99.3 2.9E-11 6.4E-16  108.8  12.8  131   86-271   235-365 (525)
 10 KOG1225|consensus               99.3 5.6E-11 1.2E-15  107.0  14.2  131   48-233   235-365 (525)
 11 KOG4260|consensus               99.0 4.2E-10   9E-15   91.2   5.0  165   50-230   131-304 (350)
 12 KOG0994|consensus               99.0 2.5E-09 5.4E-14  101.4   9.4  224   40-274   775-1052(1758)
 13 KOG4260|consensus               99.0 9.5E-10 2.1E-14   89.1   5.2  163   88-268   131-304 (350)
 14 KOG0994|consensus               98.9 6.4E-09 1.4E-13   98.8  10.9   57  213-273  1078-1146(1758)
 15 KOG1226|consensus               98.5 9.5E-07 2.1E-11   81.8  10.1  120  143-274   479-621 (783)
 16 KOG1226|consensus               98.4 3.3E-06 7.2E-11   78.3  12.7  146   72-253   469-636 (783)
 17 KOG1836|consensus               98.2 4.9E-05 1.1E-09   77.9  15.1   57  213-273   953-1021(1705)
 18 PF00008 EGF:  EGF-like domain   98.1 2.7E-06 5.8E-11   47.1   2.2   29  242-270     2-31  (32)
 19 smart00179 EGF_CA Calcium-bind  98.1 8.4E-06 1.8E-10   47.3   4.4   36   64-99      2-39  (39)
 20 PF00008 EGF:  EGF-like domain   98.0 5.6E-06 1.2E-10   45.8   2.1   31  203-233     1-32  (32)
 21 smart00179 EGF_CA Calcium-bind  97.9 1.8E-05 3.9E-10   45.9   4.2   29  206-234     9-38  (39)
 22 PF07645 EGF_CA:  Calcium-bindi  97.9 5.8E-06 1.3E-10   49.0   1.6   32   63-94      1-34  (42)
 23 cd00054 EGF_CA Calcium-binding  97.8 5.3E-05 1.2E-09   43.4   4.3   35   64-98      2-37  (38)
 24 PF07645 EGF_CA:  Calcium-bindi  97.8 2.2E-05 4.8E-10   46.4   2.5   31  200-230     2-34  (42)
 25 cd00054 EGF_CA Calcium-binding  97.6 0.00011 2.3E-09   42.1   4.0   29  244-272     9-37  (38)
 26 cd00053 EGF Epidermal growth f  97.4 0.00036 7.8E-09   39.2   4.1   30   69-98      5-35  (36)
 27 PF06247 Plasmod_Pvs28:  Plasmo  97.4 7.2E-05 1.6E-09   58.1   1.5  135   76-232    11-162 (197)
 28 smart00181 EGF Epidermal growt  97.4  0.0004 8.7E-09   39.1   4.0   28   70-98      6-34  (35)
 29 cd00053 EGF Epidermal growth f  97.3 0.00038 8.2E-09   39.1   3.9   30  243-272     5-35  (36)
 30 PF12947 EGF_3:  EGF domain;  I  97.3 0.00016 3.5E-09   41.0   2.2   27  244-270     6-32  (36)
 31 PF06247 Plasmod_Pvs28:  Plasmo  97.3 0.00011 2.4E-09   57.1   1.2  102  165-271    50-163 (197)
 32 smart00181 EGF Epidermal growt  97.2 0.00057 1.2E-08   38.4   3.8   28  244-272     6-34  (35)
 33 PF07974 EGF_2:  EGF-like domai  97.2 0.00052 1.1E-08   37.7   3.2   26  245-272     7-32  (32)
 34 KOG1836|consensus               97.0  0.0062 1.3E-07   63.1  11.4  176   87-274   697-925 (1705)
 35 PF12947 EGF_3:  EGF domain;  I  97.0 0.00055 1.2E-08   38.8   2.0   28  206-233     6-33  (36)
 36 PF12662 cEGF:  Complement Clr-  96.9 0.00071 1.5E-08   34.4   2.0   20  258-278     1-24  (24)
 37 PF12662 cEGF:  Complement Clr-  96.9 0.00085 1.8E-08   34.1   2.2   11   46-56      1-11  (24)
 38 PF12661 hEGF:  Human growth fa  96.9 0.00046   1E-08   29.7   1.0   13  260-272     1-13  (13)
 39 PF07974 EGF_2:  EGF-like domai  96.8   0.002 4.3E-08   35.4   3.2   26  207-234     7-32  (32)
 40 smart00051 DSL delta serrate l  95.6    0.02 4.3E-07   36.9   3.7   46  221-272    17-63  (63)
 41 KOG3512|consensus               95.6   0.079 1.7E-06   47.3   8.5   87  181-274   372-479 (592)
 42 KOG1218|consensus               95.5     1.1 2.3E-05   38.8  15.7  182   44-257    12-199 (316)
 43 smart00051 DSL delta serrate l  94.3   0.094   2E-06   33.8   4.1   47   46-98     16-63  (63)
 44 PF14670 FXa_inhibition:  Coagu  93.9   0.037   8E-07   31.3   1.4   18   39-56     11-28  (36)
 45 KOG3512|consensus               93.4    0.19 4.2E-06   44.9   5.8  105  167-274   280-429 (592)
 46 KOG1218|consensus               92.9     5.5 0.00012   34.3  15.6  148   45-219    47-199 (316)
 47 PF14670 FXa_inhibition:  Coagu  92.8   0.096 2.1E-06   29.6   2.0   20  212-231    10-29  (36)
 48 PF12946 EGF_MSP1_1:  MSP1 EGF   92.5   0.082 1.8E-06   29.8   1.4   26  243-268     4-30  (37)
 49 PHA02887 EGF-like protein; Pro  92.3    0.13 2.8E-06   36.9   2.6   28  246-274    94-123 (126)
 50 PHA03099 epidermal growth fact  91.9    0.15 3.1E-06   37.3   2.5   28  246-274    53-82  (139)
 51 PF12946 EGF_MSP1_1:  MSP1 EGF   91.7   0.068 1.5E-06   30.1   0.5   27   68-94      3-30  (37)
 52 cd01475 vWA_Matrilin VWA_Matri  89.3    0.46   1E-05   39.1   3.7   38   57-95    181-218 (224)
 53 PF00053 Laminin_EGF:  Laminin   89.3    0.25 5.5E-06   29.9   1.6   22  251-274    12-33  (49)
 54 PHA02887 EGF-like protein; Pro  88.5    0.51 1.1E-05   34.0   2.9   28   72-100    94-123 (126)
 55 cd00055 EGF_Lam Laminin-type e  87.9    0.54 1.2E-05   28.6   2.4   17  257-273    17-33  (50)
 56 PF04863 EGF_alliinase:  Alliin  87.8    0.32   7E-06   29.9   1.3   36  244-279    17-56  (56)
 57 PF01414 DSL:  Delta serrate li  87.7    0.16 3.5E-06   32.7  -0.0   47   46-98     16-63  (63)
 58 PHA03099 epidermal growth fact  85.9    0.78 1.7E-05   33.6   2.6   28   72-100    53-82  (139)
 59 cd01475 vWA_Matrilin VWA_Matri  84.0     1.6 3.5E-05   35.8   4.2   38  152-190   181-218 (224)
 60 PF00053 Laminin_EGF:  Laminin   82.3     1.1 2.3E-05   27.1   1.9   22  212-235    11-32  (49)
 61 smart00180 EGF_Lam Laminin-typ  80.2     1.6 3.4E-05   26.1   2.0   16  258-273    17-32  (46)
 62 cd00055 EGF_Lam Laminin-type e  79.9     1.9 4.1E-05   26.2   2.4   20  214-235    14-33  (50)
 63 KOG3516|consensus               78.4       2 4.2E-05   43.2   3.1   39   63-101   544-583 (1306)
 64 KOG3516|consensus               75.5     2.4 5.2E-05   42.6   2.8   41  238-278   545-586 (1306)
 65 PF09064 Tme5_EGF_like:  Thromb  67.9     6.4 0.00014   21.7   2.2   22  172-194    11-32  (34)
 66 PF00954 S_locus_glycop:  S-loc  67.8     7.5 0.00016   27.9   3.4   31   64-95     77-108 (110)
 67 KOG3514|consensus               67.3       4 8.7E-05   40.7   2.3   35   66-100   625-660 (1591)
 68 KOG3514|consensus               62.7     5.9 0.00013   39.6   2.5   36  240-275   625-661 (1591)
 69 PF12955 DUF3844:  Domain of un  59.7     9.6 0.00021   27.1   2.5    9  266-274    53-61  (103)
 70 KOG3509|consensus               53.4      29 0.00064   34.8   5.5   71  201-272   407-478 (964)
 71 PF00954 S_locus_glycop:  S-loc  52.9      18 0.00039   25.9   3.2   24  244-268    84-107 (110)
 72 PF01683 EB:  EB module;  Inter  46.4      48   0.001   19.9   3.9   30  151-190    18-47  (52)
 73 KOG0196|consensus               45.5      43 0.00094   33.0   5.1   67  208-279   248-328 (996)
 74 KOG0196|consensus               38.9      84  0.0018   31.1   5.9   60   47-109   259-332 (996)
 75 KOG3509|consensus               23.9      72  0.0016   32.2   2.9   43  240-282   408-450 (964)
 76 KOG3607|consensus               22.2      67  0.0015   31.5   2.4   28  245-275   631-658 (716)
 77 KOG3607|consensus               20.3      76  0.0017   31.1   2.3   27   71-100   631-657 (716)

No 1  
>KOG4289|consensus
Probab=99.73  E-value=8.5e-17  Score=152.88  Aligned_cols=90  Identities=41%  Similarity=0.973  Sum_probs=79.9

Q ss_pred             cCCCceEEeCCCCCccCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCC--CCCCCCCCCCCCEEeeC
Q psy7015          42 AVPSSYTCYCIDGYTGVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNV--DECGSNPCQNNGTCHDL  119 (284)
Q Consensus        42 ~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~--~~C~~~~C~~~~~C~~~  119 (284)
                      ...++++|.|++||+|+.|+..+|.|...||.++|+|....|.|+|.|.+||+|..||.+.  ..|.+..|.++++|++.
T Consensus      1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~ 1296 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNL 1296 (2531)
T ss_pred             cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeec
Confidence            3457889999999999999999999999999999999999999999999999999998643  45788889999999976


Q ss_pred             C-CCceEeCCCCC
Q psy7015         120 L-NGFVCSCHPGF  131 (284)
Q Consensus       120 ~-~~~~C~C~~g~  131 (284)
                      . +++.|+|+.|-
T Consensus      1297 ~nggf~c~Cp~ge 1309 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYGE 1309 (2531)
T ss_pred             CCCceeccCCCcc
Confidence            4 56888999873


No 2  
>KOG1217|consensus
Probab=99.69  E-value=1.3e-15  Score=139.54  Aligned_cols=235  Identities=40%  Similarity=0.979  Sum_probs=178.7

Q ss_pred             cCCCceEEeCCCCCccCCCCCCCCCCCCCC--CCCCCeEecC---CCCeeeeCCCCCcCCCcccCCCCCC--CCCCCCCC
Q psy7015          42 AVPSSYTCYCIDGYTGVHCQTNWDECWSNP--CHNGGSCIDG---IAAYNCSCPPGYTGPSCESNVDECG--SNPCQNNG  114 (284)
Q Consensus        42 ~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~--C~~~g~C~~~---~g~~~C~C~~G~~G~~C~~~~~~C~--~~~C~~~~  114 (284)
                      ...+++.|.|.+||.|..++.. .+|...+  +...+.|...   ...+.|.|..||.+..+....++|.  ..+|.+.+
T Consensus       105 ~~~~~~~c~c~~g~~~~~~~~~-~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~  183 (487)
T KOG1217|consen  105 DCVGSYECTCPPGYQGTPCEGE-CECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGG  183 (487)
T ss_pred             CCCCCceeeCCCccccCcCCcc-eeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCc
Confidence            3567899999999999988742 1466555  3566777764   4588999999999999976557886  44599899


Q ss_pred             EEeeCCCCceEeCCCCCeeeeecCC-------CCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCC
Q psy7015         115 TCHDLLNGFVCSCHPGFTGNCIDGI-------AAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPG  187 (284)
Q Consensus       115 ~C~~~~~~~~C~C~~g~~g~c~~~~-------~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g  187 (284)
                      .|.+..+.|.|.|+.+|.+.-....       ..+.|.+.+++.+..+...+.++...   . ++|.+..++++|.|++|
T Consensus       184 ~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g  259 (487)
T KOG1217|consen  184 TCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEG  259 (487)
T ss_pred             ccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCC---C-CcccccCCceeeeCCCC
Confidence            9999999999999999987632221       11457788999988888766665444   4 88999999999999999


Q ss_pred             CccCCCCCccCCCCCCCCCC-CCCCCEEeeCCCCeeeecCCCCccCCC--cccCCcC----CCCCCCCCCEE--eecCCC
Q psy7015         188 FTGWTGSLCQSATNECESSP-CQNGGVCVDLHAAYTCACLFGFTGRNC--DIELKIC----ENSPCLNEALC--LEEEEE  258 (284)
Q Consensus       188 ~~g~~~~~c~~~~~~C~~~~-C~~~g~C~~~~g~~~C~C~~G~~g~~C--~~~~~~C----~~~~C~~~~~C--~~~~~~  258 (284)
                      |.+... ....++++|.... |.++++|++..+.|.|.|++||+|..+  ......|    ...+|.+++.|  ......
T Consensus       260 ~~~~~~-~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~  338 (487)
T KOG1217|consen  260 YTGDAC-VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGG  338 (487)
T ss_pred             cccccc-ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCC
Confidence            988431 1234677887654 899999999999899999999999998  2233566    34568888888  334456


Q ss_pred             eeeecCCCCcCCCccccccccccC
Q psy7015         259 QVCYCVPDYHGNRCQYQYDECQIT  282 (284)
Q Consensus       259 ~~C~C~~G~~G~~C~~~~~~C~~~  282 (284)
                      +.|.|..+|.|..|+...++|...
T Consensus       339 ~~C~c~~~~~g~~C~~~~~~C~~~  362 (487)
T KOG1217|consen  339 FRCACGPGFTGRRCEDSNDECASS  362 (487)
T ss_pred             CCcCCCCCCCCCccccCCccccCC
Confidence            789999999999999655688664


No 3  
>KOG1217|consensus
Probab=99.64  E-value=1.2e-14  Score=133.12  Aligned_cols=214  Identities=42%  Similarity=1.029  Sum_probs=161.0

Q ss_pred             CCceEEeCCCCCccCCCCCCCCCCC--CCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCC
Q psy7015          44 PSSYTCYCIDGYTGVHCQTNWDECW--SNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLN  121 (284)
Q Consensus        44 ~g~~~C~C~~G~~g~~C~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~  121 (284)
                      ...+.|.|..||.+..++...++|.  ..+|.+.+.|.+..+.|.|.|++||.+..++..         ...+.|+..  
T Consensus       149 ~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------~~~~~c~~~--  217 (487)
T KOG1217|consen  149 VGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------GNGGTCVDS--  217 (487)
T ss_pred             CCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC---------CCCceEecc--
Confidence            3589999999999999987667887  445999999999999999999999999988643         112223221  


Q ss_pred             CceEeCCCCCe---------------eeeecCCCCeeeeCCCCccCCC--CccCCCCCCCCC-CCCCCEeccCCCCeeEe
Q psy7015         122 GFVCSCHPGFT---------------GNCIDGIAAYNCSCPPGYTGPS--CESNVDECGSNP-CQNNGTCHDLLNGFVCS  183 (284)
Q Consensus       122 ~~~C~C~~g~~---------------g~c~~~~~~~~C~C~~g~~g~~--C~~~~~~C~~~~-C~~~~~C~~~~g~~~C~  183 (284)
                       +.|.+..++.               +.|++..+.++|++++||.+..  ...+++.|.... |..+++|++..+.|.|.
T Consensus       218 -~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~  296 (487)
T KOG1217|consen  218 -VACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCT  296 (487)
T ss_pred             -eeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceee
Confidence             1122222211               5667777788999999999876  233678887754 88899999999889999


Q ss_pred             CCCCCccCCCCCccCCCCCC----CCCCCCCCCEE--eeCCCCeeeecCCCCccCCCcccCCcCCCCCCCCCCEEee-cC
Q psy7015         184 CHPGFTGWTGSLCQSATNEC----ESSPCQNGGVC--VDLHAAYTCACLFGFTGRNCDIELKICENSPCLNEALCLE-EE  256 (284)
Q Consensus       184 C~~g~~g~~~~~c~~~~~~C----~~~~C~~~g~C--~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~C~~~~~C~~-~~  256 (284)
                      |++||.+..... ..+..+|    ...+|.+++.|  ....+.+.|.|..+|.|..|+...+.|...++..++.|++ ..
T Consensus       297 C~~g~~g~~~~~-~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~  375 (487)
T KOG1217|consen  297 CPPGFTGRLCTE-CVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETP  375 (487)
T ss_pred             CCCCCCCCCCcc-ccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCC
Confidence            999999843211 2234566    34558888888  3344467899999999999985445888888999999998 68


Q ss_pred             CCeeeecCCCCcCC
Q psy7015         257 EEQVCYCVPDYHGN  270 (284)
Q Consensus       257 ~~~~C~C~~G~~G~  270 (284)
                      +++.|.|..+|.+.
T Consensus       376 ~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  376 GSYRCACPAGFAGK  389 (487)
T ss_pred             CCeEecCCCccccC
Confidence            89999999999874


No 4  
>KOG1219|consensus
Probab=99.56  E-value=9.5e-15  Score=143.77  Aligned_cols=117  Identities=31%  Similarity=0.869  Sum_probs=108.3

Q ss_pred             CCccCCCCCCCCCCCCCCEeccCC-CCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccC
Q psy7015         154 SCESNVDECGSNPCQNNGTCHDLL-NGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAYTCACLFGFTGR  232 (284)
Q Consensus       154 ~C~~~~~~C~~~~C~~~~~C~~~~-g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~  232 (284)
                      .|..-.+.|..+||.++|+|.... ++|.|.|++-|.|   ..|+.++..|..+||..+|+|+...+.|.|.|+.||+|.
T Consensus      3859 gC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG---~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3859 GCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSG---NHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred             cccccccccccCcccCCCEecCCCCCceEEeCcccccC---cccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence            454444789999999999999765 6799999999998   999999999999999999999999999999999999999


Q ss_pred             CCccc-CCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCcc
Q psy7015         233 NCDIE-LKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       233 ~C~~~-~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~  273 (284)
                      +|+.. +++|...+|..+|+|++..++|.|.|.+||.|..|.
T Consensus      3936 ~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3936 RCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred             eeecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence            99987 889999999999999999999999999999999985


No 5  
>KOG1219|consensus
Probab=99.53  E-value=2.8e-14  Score=140.58  Aligned_cols=117  Identities=40%  Similarity=1.071  Sum_probs=105.0

Q ss_pred             cccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCC
Q psy7015          98 CESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLL  177 (284)
Q Consensus        98 C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~  177 (284)
                      |..-.+.|..+||+++|.|+..++                  ++|.|.|++-|.|..|+.++..|.++||..+++|+...
T Consensus      3860 C~l~~d~C~~npCqhgG~C~~~~~------------------ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPK------------------GGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFY 3921 (4289)
T ss_pred             ccccccccccCcccCCCEecCCCC------------------CceEEeCcccccCcccccccccccCCCCCCCCEEEecC
Confidence            433347888999999999987643                  46678888999999999999999999999999999999


Q ss_pred             CCeeEeCCCCCccCCCCCccCC-CCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCc
Q psy7015         178 NGFVCSCHPGFTGWTGSLCQSA-TNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCD  235 (284)
Q Consensus       178 g~~~C~C~~g~~g~~~~~c~~~-~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~  235 (284)
                      ++|.|.|+.||+|   .+|+.+ +++|+.++|..+|.|++..|+|.|.|.+||.|+.|.
T Consensus      3922 n~f~CnC~~gyTG---~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3922 NGFLCNCPNGYTG---KRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred             CCeeEeCCCCccC---ceeecccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence            9999999999998   888877 899999999999999999999999999999999985


No 6  
>KOG1214|consensus
Probab=99.45  E-value=1.6e-12  Score=119.18  Aligned_cols=185  Identities=26%  Similarity=0.678  Sum_probs=126.0

Q ss_pred             CCCCCeEecCCC-CeeeeCCCCCcC--CCcccCCCCCCCC--CCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeC
Q psy7015          72 CHNGGSCIDGIA-AYNCSCPPGYTG--PSCESNVDECGSN--PCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSC  146 (284)
Q Consensus        72 C~~~g~C~~~~g-~~~C~C~~G~~G--~~C~~~~~~C~~~--~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C  146 (284)
                      |..+..|....+ .|+|.|..||.|  .+|. ++++|...  .|.++..|++.+++|+|.|..||.-.    ...++|+-
T Consensus       702 cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~----dd~~tCV~  776 (1289)
T KOG1214|consen  702 CDTTARCHPGTGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFA----DDRHTCVL  776 (1289)
T ss_pred             cCCCccccCCCCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceec----cCCcceEE
Confidence            455667776644 689999999985  5675 66788543  49999999999999999999887521    11122321


Q ss_pred             CCCccCCCCccCCCCCC--CCCCCCCCE--eccC-CCCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCe
Q psy7015         147 PPGYTGPSCESNVDECG--SNPCQNNGT--CHDL-LNGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAY  221 (284)
Q Consensus       147 ~~g~~g~~C~~~~~~C~--~~~C~~~~~--C~~~-~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~  221 (284)
                      ..--      ..++.|.  .+.|.-.+.  |+.. .++|.|.|.+||.| ++..|. ++++|.++.|...+.|.++++++
T Consensus       777 i~~p------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsG-DG~~c~-dvDeC~psrChp~A~Cyntpgsf  848 (1289)
T KOG1214|consen  777 ITPP------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSG-DGHQCT-DVDECSPSRCHPAATCYNTPGSF  848 (1289)
T ss_pred             ecCC------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccC-Cccccc-cccccCccccCCCceEecCCCcc
Confidence            0000      1233342  234555554  4433 35799999999999 566665 67999999999999999999999


Q ss_pred             eeecCCCCccCC--Ccc---cCCcCCC-----CCCCCCCEEee--cCCCeeeecCCCCcC
Q psy7015         222 TCACLFGFTGRN--CDI---ELKICEN-----SPCLNEALCLE--EEEEQVCYCVPDYHG  269 (284)
Q Consensus       222 ~C~C~~G~~g~~--C~~---~~~~C~~-----~~C~~~~~C~~--~~~~~~C~C~~G~~G  269 (284)
                      .|+|.+||+|+.  |..   ....|..     ..|+.+..|.+  ++..+.+.|.++--|
T Consensus       849 sC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG  908 (1289)
T KOG1214|consen  849 SCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPG  908 (1289)
T ss_pred             eeecccCccCCCceecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCC
Confidence            999999999764  432   1233432     23666665543  455678888776666


No 7  
>KOG1214|consensus
Probab=99.40  E-value=9.8e-12  Score=114.10  Aligned_cols=206  Identities=28%  Similarity=0.673  Sum_probs=125.1

Q ss_pred             ceEEeCCCCCcc--CCCCCCCCCCCCC--CCCCCCeEecCCCCeeeeCCCCCc----CCCcccCCCCCCCCCCCCC-CEE
Q psy7015          46 SYTCYCIDGYTG--VHCQTNWDECWSN--PCHNGGSCIDGIAAYNCSCPPGYT----GPSCESNVDECGSNPCQNN-GTC  116 (284)
Q Consensus        46 ~~~C~C~~G~~g--~~C~~~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~G~~----G~~C~~~~~~C~~~~C~~~-~~C  116 (284)
                      .|+|.|..||.|  ..|. ++++|...  .|..+..|++.+++|+|.|..||.    +-+|....+.-..++|... ..|
T Consensus       715 ~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C  793 (1289)
T KOG1214|consen  715 DYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTC  793 (1289)
T ss_pred             ceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCcccc
Confidence            589999999984  5665 67788754  499999999999999999999885    3456433332233334322 122


Q ss_pred             eeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCC--CCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCC
Q psy7015         117 HDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGP--SCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGS  194 (284)
Q Consensus       117 ~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~--~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~  194 (284)
                      . ..+  +++|.       ..+.+.|.|.|.+||.|+  .|. ++++|.++-|...++|.++.+++.|+|.+||.| ++.
T Consensus       794 ~-i~g--~a~c~-------~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~G-DGf  861 (1289)
T KOG1214|consen  794 A-IAG--QARCV-------HHGGSTYSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYG-DGF  861 (1289)
T ss_pred             C-cCC--ceEEE-------ecCCceEEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccC-CCc
Confidence            1 111  11111       123356777788888754  565 679999999999999999999999999999999 566


Q ss_pred             CccCC---CCCCCC-----CCCCCCCEEe--eCCCCeeeecCCCCcc---CCCcccCCcCCCCCCCCCCEEeec---CCC
Q psy7015         195 LCQSA---TNECES-----SPCQNGGVCV--DLHAAYTCACLFGFTG---RNCDIELKICENSPCLNEALCLEE---EEE  258 (284)
Q Consensus       195 ~c~~~---~~~C~~-----~~C~~~g~C~--~~~g~~~C~C~~G~~g---~~C~~~~~~C~~~~C~~~~~C~~~---~~~  258 (284)
                      .|..+   ...|..     ..|+.+..|.  ..+..+.+.|.++=.|   ..|.... +----.|..++.+...   ..+
T Consensus       862 ~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~-~~~vp~Cd~hgh~ap~qchG~~  940 (1289)
T KOG1214|consen  862 QCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSP-EQYVPQCDDHGHFAPLQCHGKS  940 (1289)
T ss_pred             eecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCCCcc-cccCCCccccccccccccCCCc
Confidence            66543   223322     2255544332  2234466766555444   3443211 1001125555555432   234


Q ss_pred             eeeecCC
Q psy7015         259 QVCYCVP  265 (284)
Q Consensus       259 ~~C~C~~  265 (284)
                      ++|.|..
T Consensus       941 ~~CwCvd  947 (1289)
T KOG1214|consen  941 DFCWCVD  947 (1289)
T ss_pred             ceeEEec
Confidence            6677755


No 8  
>KOG4289|consensus
Probab=99.39  E-value=1.6e-12  Score=124.51  Aligned_cols=88  Identities=45%  Similarity=1.143  Sum_probs=66.8

Q ss_pred             CCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCC-
Q psy7015          81 GIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNV-  159 (284)
Q Consensus        81 ~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~-  159 (284)
                      ..+.+.|.|++||+|..|+.++|.|...||.+++.|....++|.|.|+                   +||+|..|+.+. 
T Consensus      1218 pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCr-------------------pg~tGehCEvs~~ 1278 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECR-------------------PGFTGEHCEVSAR 1278 (2531)
T ss_pred             ccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEec-------------------CCccccceeeecc
Confidence            345689999999999999999999999999999999988777666655                   555555665322 


Q ss_pred             -CCCCCCCCCCCCEeccCC-CCeeEeCCCC
Q psy7015         160 -DECGSNPCQNNGTCHDLL-NGFVCSCHPG  187 (284)
Q Consensus       160 -~~C~~~~C~~~~~C~~~~-g~~~C~C~~g  187 (284)
                       -.|.+..|.++++|++.. +.+.|.|+.|
T Consensus      1279 agrCvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1279 AGRCVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             cCccccceecCCCEEeecCCCceeccCCCc
Confidence             346666677777887653 5677777776


No 9  
>KOG1225|consensus
Probab=99.31  E-value=2.9e-11  Score=108.80  Aligned_cols=131  Identities=35%  Similarity=0.982  Sum_probs=93.8

Q ss_pred             eeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCC
Q psy7015          86 NCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSN  165 (284)
Q Consensus        86 ~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~  165 (284)
                      .|.|+.+|+|..|.  ...|... |..++.|++                       .+|.|++||+|..|..  -.|...
T Consensus       235 ic~c~~~~~g~~c~--~~~C~~~-c~~~g~c~~-----------------------G~CIC~~Gf~G~dC~e--~~Cp~~  286 (525)
T KOG1225|consen  235 ICECPEGYFGPLCS--TIYCPGG-CTGRGQCVE-----------------------GRCICPPGFTGDDCDE--LVCPVD  286 (525)
T ss_pred             eeecCCceeCCccc--cccCCCC-CcccceEeC-----------------------CeEeCCCCCcCCCCCc--ccCCcc
Confidence            68888888888875  2233222 444445543                       2677888889999963  345544


Q ss_pred             CCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCCCC
Q psy7015         166 PCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICENSP  245 (284)
Q Consensus       166 ~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~  245 (284)
                       |+.++.+++    ..|.|++||+|   ..|..  ..|. ..|.++|.|+  .|  +|.|.+||+|..|+..      . 
T Consensus       287 -cs~~g~~~~----g~CiC~~g~~G---~dCs~--~~cp-adC~g~G~Ci--~G--~C~C~~Gy~G~~C~~~------~-  344 (525)
T KOG1225|consen  287 -CSGGGVCVD----GECICNPGYSG---KDCSI--RRCP-ADCSGHGKCI--DG--ECLCDEGYTGELCIQR------A-  344 (525)
T ss_pred             -cCCCceecC----CEeecCCCccc---ccccc--ccCC-ccCCCCCccc--CC--ceEeCCCCcCCccccc------c-
Confidence             777777765    38999999988   66642  2343 5699999998  33  7999999999999753      3 


Q ss_pred             CCCCCEEeecCCCeeeecCCCCcCCC
Q psy7015         246 CLNEALCLEEEEEQVCYCVPDYHGNR  271 (284)
Q Consensus       246 C~~~~~C~~~~~~~~C~C~~G~~G~~  271 (284)
                      |++++.|++  +   |+|..||.|.+
T Consensus       345 C~~~g~cv~--g---C~C~~Gw~G~d  365 (525)
T KOG1225|consen  345 CSGGGQCVN--G---CKCKKGWRGPD  365 (525)
T ss_pred             cCCCceecc--C---ceeccCccCCC
Confidence            888899854  2   99999999998


No 10 
>KOG1225|consensus
Probab=99.30  E-value=5.6e-11  Score=107.04  Aligned_cols=131  Identities=40%  Similarity=1.079  Sum_probs=99.6

Q ss_pred             EEeCCCCCccCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceEeC
Q psy7015          48 TCYCIDGYTGVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSC  127 (284)
Q Consensus        48 ~C~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C  127 (284)
                      .|.|..+|.|..|+.  ..|. ..|..++.|++.    +|+|++||+|.+|..  -.|... |+.++.+++.        
T Consensus       235 ic~c~~~~~g~~c~~--~~C~-~~c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g--------  296 (525)
T KOG1225|consen  235 ICECPEGYFGPLCST--IYCP-GGCTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG--------  296 (525)
T ss_pred             eeecCCceeCCcccc--ccCC-CCCcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC--------
Confidence            799999999999973  2343 347777889887    599999999999963  345444 7666666542        


Q ss_pred             CCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCCCCCCCCC
Q psy7015         128 HPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSATNECESSP  207 (284)
Q Consensus       128 ~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~  207 (284)
                                     +|+|.+||.|..|+..  .|. ..|..+|.|++    .+|.|.+||+|   ..|...       .
T Consensus       297 ---------------~CiC~~g~~G~dCs~~--~cp-adC~g~G~Ci~----G~C~C~~Gy~G---~~C~~~-------~  344 (525)
T KOG1225|consen  297 ---------------ECICNPGYSGKDCSIR--RCP-ADCSGHGKCID----GECLCDEGYTG---ELCIQR-------A  344 (525)
T ss_pred             ---------------EeecCCCccccccccc--cCC-ccCCCCCcccC----CceEeCCCCcC---Cccccc-------c
Confidence                           5667888888888643  343 45999999982    48999999998   666532       3


Q ss_pred             CCCCCEEeeCCCCeeeecCCCCccCC
Q psy7015         208 CQNGGVCVDLHAAYTCACLFGFTGRN  233 (284)
Q Consensus       208 C~~~g~C~~~~g~~~C~C~~G~~g~~  233 (284)
                      |.+++.|++  +   |.|..||.|.+
T Consensus       345 C~~~g~cv~--g---C~C~~Gw~G~d  365 (525)
T KOG1225|consen  345 CSGGGQCVN--G---CKCKKGWRGPD  365 (525)
T ss_pred             cCCCceecc--C---ceeccCccCCC
Confidence            888899985  2   99999999998


No 11 
>KOG4260|consensus
Probab=99.00  E-value=4.2e-10  Score=91.18  Aligned_cols=165  Identities=29%  Similarity=0.747  Sum_probs=95.8

Q ss_pred             eCCCCCccCCCCCCCCCCCCCCCCCCCeEec---CCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCCceE-
Q psy7015          50 YCIDGYTGVHCQTNWDECWSNPCHNGGSCID---GIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVC-  125 (284)
Q Consensus        50 ~C~~G~~g~~C~~~~~~C~~~~C~~~g~C~~---~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C-  125 (284)
                      -|++|-.|++|.. ...-...||..+|.|.-   ..|+..|.|.+||+|..|..    |...--.   . ........| 
T Consensus       131 CCp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~----Cg~eyfe---s-~Rne~~lvCt  201 (350)
T KOG4260|consen  131 CCPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY----CGIEYFE---S-SRNEQHLVCT  201 (350)
T ss_pred             ccCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc----cchHHHH---h-hcccccchhh
Confidence            3889999998863 22223467999999973   34677899999999998852    2100000   0 000000111 


Q ss_pred             eCCCCCeeeeecCCCCeee-eCCCCcc--CCCCccCCCCCCC--CCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCC
Q psy7015         126 SCHPGFTGNCIDGIAAYNC-SCPPGYT--GPSCESNVDECGS--NPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSAT  200 (284)
Q Consensus       126 ~C~~g~~g~c~~~~~~~~C-~C~~g~~--g~~C~~~~~~C~~--~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~  200 (284)
                      .|..+-.|.|... .+..| .|..||.  -..|. ++++|..  .||.....|+|+.|+|.|.+++||.+ ....|+...
T Consensus       202 ~Ch~~C~~~Csg~-~~k~C~kCkkGW~lde~gCv-DvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~-g~d~C~~~~  278 (350)
T KOG4260|consen  202 ACHEGCLGVCSGE-SSKGCSKCKKGWKLDEEGCV-DVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK-GVDECQFCA  278 (350)
T ss_pred             hhhhhhhcccCCC-CCCChhhhcccceecccccc-cHHHHhcCCCCCChhheeecCCCceEecccccccC-ChHHhhhhh
Confidence            1222222222111 12223 3666665  23565 7888843  56877788999999999988888876 222232111


Q ss_pred             CCCCCCCCCCCCEEeeCCCCeeeecCCCCc
Q psy7015         201 NECESSPCQNGGVCVDLHAAYTCACLFGFT  230 (284)
Q Consensus       201 ~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~  230 (284)
                      +.|    -..+..|.++++.|+|+|..|+.
T Consensus       279 d~~----~~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  279 DVC----ASKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             hhc----ccCCCCcccCCccEEEEecccce
Confidence            111    12356788888889998888764


No 12 
>KOG0994|consensus
Probab=98.97  E-value=2.5e-09  Score=101.43  Aligned_cols=224  Identities=25%  Similarity=0.602  Sum_probs=108.6

Q ss_pred             eecCCCceEEeCCCCCccCCCCCCC--------CCCCCCCCCCCC----eEecCCCCeeeeCCCCCcCCCccc------C
Q psy7015          40 IFAVPSSYTCYCIDGYTGVHCQTNW--------DECWSNPCHNGG----SCIDGIAAYNCSCPPGYTGPSCES------N  101 (284)
Q Consensus        40 ~~~~~g~~~C~C~~G~~g~~C~~~~--------~~C~~~~C~~~g----~C~~~~g~~~C~C~~G~~G~~C~~------~  101 (284)
                      ....+.+.+|.|+|+-.|..|....        ..|....|...|    .|....|  +|.|.+|-+|..|..      .
T Consensus       775 ~vCn~~GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tG--QC~C~~g~ygrqCnqCqpG~Wg  852 (1758)
T KOG0994|consen  775 SVCNPNGGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITG--QCQCRPGTYGRQCNQCQPGYWG  852 (1758)
T ss_pred             ccccCCCceecccCccccccccccCCcccCcCCccCcccccccccccccccccccc--ceeeccccchhhccccCCCccC
Confidence            3444667799999999998886311        112222233322    3444444  488888888877653      2


Q ss_pred             CCCCCCCCCCCCC-EEeeCCCCceEeCCCCCeeeeecCCCCeee-eCCCCccCCCCccCCCCCCCCCCCCCC--------
Q psy7015         102 VDECGSNPCQNNG-TCHDLLNGFVCSCHPGFTGNCIDGIAAYNC-SCPPGYTGPSCESNVDECGSNPCQNNG--------  171 (284)
Q Consensus       102 ~~~C~~~~C~~~~-~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C-~C~~g~~g~~C~~~~~~C~~~~C~~~~--------  171 (284)
                      ..+|.+..|+.|+ .|...         .|.--.|.+...++.| .|..||+|+.-...-..|.+-||..+-        
T Consensus       853 FPeCr~CqCNgHA~~Cd~~---------tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~  923 (1758)
T KOG0994|consen  853 FPECRPCQCNGHADTCDPI---------TGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHAD  923 (1758)
T ss_pred             CCcCccccccCcccccCcc---------ccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccc
Confidence            3344443344332 22221         1222234444445555 366666654322222233333333211        


Q ss_pred             Eec--cCCCCeeEeCCCCCccCCCCCccCC------------CCCCC-------CCCCCC-CCE---EeeCCCCeee-ec
Q psy7015         172 TCH--DLLNGFVCSCHPGFTGWTGSLCQSA------------TNECE-------SSPCQN-GGV---CVDLHAAYTC-AC  225 (284)
Q Consensus       172 ~C~--~~~g~~~C~C~~g~~g~~~~~c~~~------------~~~C~-------~~~C~~-~g~---C~~~~g~~~C-~C  225 (284)
                      .|.  +......|.|.+||.|.....|...            .-+|.       +..|.. .|.   |...+.+.+| .|
T Consensus       924 sC~~d~~t~~ivC~C~~GY~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~C 1003 (1758)
T KOG0994|consen  924 SCYLDTRTQQIVCHCQEGYSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHC 1003 (1758)
T ss_pred             cccccccccceeeecccCccccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhc
Confidence            232  2233456888888877322222100            00111       111211 122   2222333455 58


Q ss_pred             CCCCccCCCcccCCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCccc
Q psy7015         226 LFGFTGRNCDIELKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       226 ~~G~~g~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~  274 (284)
                      .+||+|+.-......|.-..-..+.+|.-+..+++|.|.+..+|.+|+.
T Consensus      1004 k~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDq 1052 (1758)
T KOG0994|consen 1004 KDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQ 1052 (1758)
T ss_pred             cccchhHHHHhhhhhheccccccCCccccccccCcCCCCcccccccccc
Confidence            9999987543333333222111223354477788999999999999953


No 13 
>KOG4260|consensus
Probab=98.95  E-value=9.5e-10  Score=89.13  Aligned_cols=163  Identities=25%  Similarity=0.596  Sum_probs=100.0

Q ss_pred             eCCCCCcCCCcccCCCCCCCCCCCCCCEEee---CCCCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCC
Q psy7015          88 SCPPGYTGPSCESNVDECGSNPCQNNGTCHD---LLNGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGS  164 (284)
Q Consensus        88 ~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~---~~~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~  164 (284)
                      -|++|-+|++|..- ..-+..+|..++.|.-   ..|+.+|.|.+||.|.-..       .|..+|+-..=....-.|..
T Consensus       131 CCp~gtyGpdCl~C-pggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-------~Cg~eyfes~Rne~~lvCt~  202 (350)
T KOG4260|consen  131 CCPDGTYGPDCLQC-PGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-------YCGIEYFESSRNEQHLVCTA  202 (350)
T ss_pred             ccCCCCcCCccccC-CCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-------ccchHHHHhhcccccchhhh
Confidence            38999999999642 2335667999999863   2344566666666654211       14444431100000001110


Q ss_pred             --CCCCCCCEeccCCCCeeE-eCCCCCccCCCCCccCCCCCCC--CCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCC
Q psy7015         165 --NPCQNNGTCHDLLNGFVC-SCHPGFTGWTGSLCQSATNECE--SSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELK  239 (284)
Q Consensus       165 --~~C~~~~~C~~~~g~~~C-~C~~g~~g~~~~~c~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~  239 (284)
                        ..|.  +.|..... -.| .|..||... ...| .|+++|.  +.+|.....|+|+.|+|.|.+++||.+.     ++
T Consensus       203 Ch~~C~--~~Csg~~~-k~C~kCkkGW~ld-e~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d  272 (350)
T KOG4260|consen  203 CHEGCL--GVCSGESS-KGCSKCKKGWKLD-EEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VD  272 (350)
T ss_pred             hhhhhh--cccCCCCC-CChhhhcccceec-cccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCC-----hH
Confidence              0121  23432222 234 689999874 3455 4899995  5679999999999999999999999873     22


Q ss_pred             cC---CCCCCCCCCEEeecCCCeeeecCCCCc
Q psy7015         240 IC---ENSPCLNEALCLEEEEEQVCYCVPDYH  268 (284)
Q Consensus       240 ~C---~~~~C~~~~~C~~~~~~~~C~C~~G~~  268 (284)
                      .|   ...-=..+..|.+.++.|+|+|.+|+.
T Consensus       273 ~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  273 ECQFCADVCASKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             HhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence            22   222223567788899999999998874


No 14 
>KOG0994|consensus
Probab=98.94  E-value=6.4e-09  Score=98.75  Aligned_cols=57  Identities=30%  Similarity=0.780  Sum_probs=36.6

Q ss_pred             EEeeCCCCeeeecCCCCccCCCccc--------CCcCCCCCCCCCC----EEeecCCCeeeecCCCCcCCCcc
Q psy7015         213 VCVDLHAAYTCACLFGFTGRNCDIE--------LKICENSPCLNEA----LCLEEEEEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       213 ~C~~~~g~~~C~C~~G~~g~~C~~~--------~~~C~~~~C~~~~----~C~~~~~~~~C~C~~G~~G~~C~  273 (284)
                      +|...+|  .|.|.+||-|+.|+..        ...|....|...|    +|  +..++.|+|.+|..|.+|+
T Consensus      1078 qCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQC--dr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQC--DRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred             ccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCc--cccCCceeecCCCCCcchh
Confidence            5655555  8999999999998732        1123222233332    44  3345679999999998884


No 15 
>KOG1226|consensus
Probab=98.48  E-value=9.5e-07  Score=81.80  Aligned_cols=120  Identities=32%  Similarity=0.765  Sum_probs=80.5

Q ss_pred             eeeCCCCccCCCCccCC---------CCCC----CCCCCCCCEeccCCCCeeEeCCCCCcc-CCCCCccCCCCCCCCC--
Q psy7015         143 NCSCPPGYTGPSCESNV---------DECG----SNPCQNNGTCHDLLNGFVCSCHPGFTG-WTGSLCQSATNECESS--  206 (284)
Q Consensus       143 ~C~C~~g~~g~~C~~~~---------~~C~----~~~C~~~~~C~~~~g~~~C~C~~g~~g-~~~~~c~~~~~~C~~~--  206 (284)
                      .|.|.+||.|+.|+-..         +.|.    ..+|...|.|.=    .+|+|.+...+ ..|..|.-+.-.|..+  
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCECDnfsC~r~~g  554 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNGKIYGKFCECDNFSCERHKG  554 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCCceeeeeeeccCcccccccC
Confidence            45677777777775322         2231    236888888764    47888876552 1236676554445433  


Q ss_pred             -CCCCCCEEeeCCCCeeeecCCCCccCCCcc--cCCcCCC---CCCCCCCEEeecCCCeeeecCCC-CcCCCccc
Q psy7015         207 -PCQNGGVCVDLHAAYTCACLFGFTGRNCDI--ELKICEN---SPCLNEALCLEEEEEQVCYCVPD-YHGNRCQY  274 (284)
Q Consensus       207 -~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~--~~~~C~~---~~C~~~~~C~~~~~~~~C~C~~G-~~G~~C~~  274 (284)
                       .|.++|.|.-.    +|+|.+||+|..|+-  +.+.|..   ..|+..|.|.-.    +|+|... |.|..||.
T Consensus       555 ~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~  621 (783)
T KOG1226|consen  555 VLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK  621 (783)
T ss_pred             cccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence             49999998643    799999999998863  4455643   248888888543    4999776 99999985


No 16 
>KOG1226|consensus
Probab=98.45  E-value=3.3e-06  Score=78.30  Aligned_cols=146  Identities=34%  Similarity=0.819  Sum_probs=87.9

Q ss_pred             CCCCCeEecCCCCeeeeCCCCCcCCCcccCC---------CCCCC----CCCCCCCEEeeCCCCceEeCCCCCeeeeecC
Q psy7015          72 CHNGGSCIDGIAAYNCSCPPGYTGPSCESNV---------DECGS----NPCQNNGTCHDLLNGFVCSCHPGFTGNCIDG  138 (284)
Q Consensus        72 C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~---------~~C~~----~~C~~~~~C~~~~~~~~C~C~~g~~g~c~~~  138 (284)
                      |+.+|+.+-+    .|.|.+||.|+.|+-..         +.|+.    .+|.++|.|+=    .+|.|.+...+     
T Consensus       469 C~g~G~~~CG----~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~-----  535 (783)
T KOG1226|consen  469 CHGNGTFVCG----QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNG-----  535 (783)
T ss_pred             cCCCCcEEec----ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCC-----
Confidence            6666665544    48999999999997322         22321    14555555542    13333332221     


Q ss_pred             CCCeeeeCCCCccCCCCccCCCCCCC---CCCCCCCEeccCCCCeeEeCCCCCccCCCCCcc--CCCCCCCC---CCCCC
Q psy7015         139 IAAYNCSCPPGYTGPSCESNVDECGS---NPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQ--SATNECES---SPCQN  210 (284)
Q Consensus       139 ~~~~~C~C~~g~~g~~C~~~~~~C~~---~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~--~~~~~C~~---~~C~~  210 (284)
                                -++|..|+-+.-.|..   ..|..+|.|.-    .+|+|.+||+|   ..|.  .+.+.|.+   ..|.+
T Consensus       536 ----------~i~G~fCECDnfsC~r~~g~lC~g~G~C~C----G~CvC~~GwtG---~~C~C~~std~C~~~~G~iCSG  598 (783)
T KOG1226|consen  536 ----------KIYGKFCECDNFSCERHKGVLCGGHGRCEC----GRCVCNPGWTG---SACNCPLSTDTCESSDGQICSG  598 (783)
T ss_pred             ----------ceeeeeeeccCcccccccCcccCCCCeEeC----CcEEcCCCCcc---CCCCCCCCCccccCCCCceeCC
Confidence                      1236777644434432   35888888864    48999999998   5443  34555643   23888


Q ss_pred             CCEEeeCCCCeeeecCCC-CccCCCcccCCcCCCCCCCCCCEEe
Q psy7015         211 GGVCVDLHAAYTCACLFG-FTGRNCDIELKICENSPCLNEALCL  253 (284)
Q Consensus       211 ~g~C~~~~g~~~C~C~~G-~~g~~C~~~~~~C~~~~C~~~~~C~  253 (284)
                      .|+|.-.    +|.|... |.|..|++... | ..+|..+..|+
T Consensus       599 rG~C~Cg----~C~C~~~~~sG~~CE~cpt-c-~~~C~~~~~Cv  636 (783)
T KOG1226|consen  599 RGTCECG----RCKCTDPPYSGEFCEKCPT-C-PDPCAENKSCV  636 (783)
T ss_pred             CceeeCC----ceEcCCCCcCcchhhcCCC-C-CCcccccccch
Confidence            8888643    6888777 99999985432 2 23366665553


No 17 
>KOG1836|consensus
Probab=98.17  E-value=4.9e-05  Score=77.88  Aligned_cols=57  Identities=32%  Similarity=0.674  Sum_probs=37.5

Q ss_pred             EEeeCCCCeeeecCCCCccCCCcccC--------CcCCCCCCCCCC----EEeecCCCeeeecCCCCcCCCcc
Q psy7015         213 VCVDLHAAYTCACLFGFTGRNCDIEL--------KICENSPCLNEA----LCLEEEEEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       213 ~C~~~~g~~~C~C~~G~~g~~C~~~~--------~~C~~~~C~~~~----~C~~~~~~~~C~C~~G~~G~~C~  273 (284)
                      .|...  +..|.|.+|.+|..|+...        ..|....|...|    +|  .+..++|.|++++.|.+|.
T Consensus       953 ~c~~~--tGqc~c~~gVtgqrc~qc~~~~~~~~~~gc~~c~c~~~Gs~~~qc--~~~~G~c~c~~~~~g~~c~ 1021 (1705)
T KOG1836|consen  953 DCDVG--TGQCYCRPGVTGQRCDQCETYHFGFQTEGCGLCECDPLGSRGFQC--DPEDGQCPCRPGFEGRRCD 1021 (1705)
T ss_pred             ccccc--CCceeeecCccccccCccccCcccccccCCcceecccCCccccee--cccCCeeeecCCCCCcccc
Confidence            45433  3489999999998887321        223333355555    56  3445689999999998775


No 18 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.07  E-value=2.7e-06  Score=47.14  Aligned_cols=29  Identities=31%  Similarity=0.841  Sum_probs=18.5

Q ss_pred             CCCCCCCCCEEeecC-CCeeeecCCCCcCC
Q psy7015         242 ENSPCLNEALCLEEE-EEQVCYCVPDYHGN  270 (284)
Q Consensus       242 ~~~~C~~~~~C~~~~-~~~~C~C~~G~~G~  270 (284)
                      .+.+|.++|+|+... .+|.|+|++||+|+
T Consensus         2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            344666666666665 66666666666665


No 19 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.06  E-value=8.4e-06  Score=47.30  Aligned_cols=36  Identities=61%  Similarity=1.522  Sum_probs=30.8

Q ss_pred             CCCCCC-CCCCCCCeEecCCCCeeeeCCCCCc-CCCcc
Q psy7015          64 WDECWS-NPCHNGGSCIDGIAAYNCSCPPGYT-GPSCE   99 (284)
Q Consensus        64 ~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~-G~~C~   99 (284)
                      +++|.. .+|.++++|+++.++|.|.|++||. |..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            567776 7899889999999999999999998 87763


No 20 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.96  E-value=5.6e-06  Score=45.82  Aligned_cols=31  Identities=58%  Similarity=1.372  Sum_probs=27.1

Q ss_pred             CCCCCCCCCCEEeeCC-CCeeeecCCCCccCC
Q psy7015         203 CESSPCQNGGVCVDLH-AAYTCACLFGFTGRN  233 (284)
Q Consensus       203 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~~  233 (284)
                      |.+++|.++|+|+... ++|.|.|++||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            4456899999999988 899999999999964


No 21 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.93  E-value=1.8e-05  Score=45.86  Aligned_cols=29  Identities=59%  Similarity=1.382  Sum_probs=14.9

Q ss_pred             CCCCCCCEEeeCCCCeeeecCCCCc-cCCC
Q psy7015         206 SPCQNGGVCVDLHAAYTCACLFGFT-GRNC  234 (284)
Q Consensus       206 ~~C~~~g~C~~~~g~~~C~C~~G~~-g~~C  234 (284)
                      .+|.++++|+++.++|.|.|++||. |..|
T Consensus         9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        9 NPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            3455555555555555555555555 4443


No 22 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.89  E-value=5.8e-06  Score=48.97  Aligned_cols=32  Identities=44%  Similarity=1.187  Sum_probs=24.9

Q ss_pred             CCCCCCCC--CCCCCCeEecCCCCeeeeCCCCCc
Q psy7015          63 NWDECWSN--PCHNGGSCIDGIAAYNCSCPPGYT   94 (284)
Q Consensus        63 ~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~G~~   94 (284)
                      |++||...  .|..++.|+|+.|+|+|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            46777654  477788888888888888888887


No 23 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.77  E-value=5.3e-05  Score=43.41  Aligned_cols=35  Identities=63%  Similarity=1.551  Sum_probs=29.7

Q ss_pred             CCCCCC-CCCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015          64 WDECWS-NPCHNGGSCIDGIAAYNCSCPPGYTGPSC   98 (284)
Q Consensus        64 ~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~G~~C   98 (284)
                      +++|.. .+|..++.|++..+.|.|.|++||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            456766 67888899999999999999999998776


No 24 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.75  E-value=2.2e-05  Score=46.45  Aligned_cols=31  Identities=32%  Similarity=0.985  Sum_probs=21.7

Q ss_pred             CCCCCC--CCCCCCCEEeeCCCCeeeecCCCCc
Q psy7015         200 TNECES--SPCQNGGVCVDLHAAYTCACLFGFT  230 (284)
Q Consensus       200 ~~~C~~--~~C~~~g~C~~~~g~~~C~C~~G~~  230 (284)
                      ++||..  +.|..++.|+|+.|+|.|.|++||+
T Consensus         2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            566643  3476677777777777777777776


No 25 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.61  E-value=0.00011  Score=42.08  Aligned_cols=29  Identities=34%  Similarity=0.938  Sum_probs=16.4

Q ss_pred             CCCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015         244 SPCLNEALCLEEEEEQVCYCVPDYHGNRC  272 (284)
Q Consensus       244 ~~C~~~~~C~~~~~~~~C~C~~G~~G~~C  272 (284)
                      .+|.+++.|++..++|.|.|+.||.|..|
T Consensus         9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           9 NPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             CCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            34555555655555556666666665554


No 26 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.38  E-value=0.00036  Score=39.18  Aligned_cols=30  Identities=63%  Similarity=1.552  Sum_probs=25.4

Q ss_pred             CCCCCCCCeEecCCCCeeeeCCCCCcCC-Cc
Q psy7015          69 SNPCHNGGSCIDGIAAYNCSCPPGYTGP-SC   98 (284)
Q Consensus        69 ~~~C~~~g~C~~~~g~~~C~C~~G~~G~-~C   98 (284)
                      ..+|..++.|++..+.|+|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            4668888999998889999999999887 44


No 27 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.38  E-value=7.2e-05  Score=58.07  Aligned_cols=135  Identities=27%  Similarity=0.674  Sum_probs=72.7

Q ss_pred             CeEecCCCCeeeeCCCCCc---CCCcccCCCCC-----CCCCCCCCCEEeeCCC-----CceEeCCCCCeeeeecCCCCe
Q psy7015          76 GSCIDGIAAYNCSCPPGYT---GPSCESNVDEC-----GSNPCQNNGTCHDLLN-----GFVCSCHPGFTGNCIDGIAAY  142 (284)
Q Consensus        76 g~C~~~~g~~~C~C~~G~~---G~~C~~~~~~C-----~~~~C~~~~~C~~~~~-----~~~C~C~~g~~g~c~~~~~~~  142 (284)
                      |.-+...+.|.|.|++||.   -..|+.. .+|     ...+|...+.|++...     .|.|.|..||...        
T Consensus        11 G~LiQMSNHfEC~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~--------   81 (197)
T PF06247_consen   11 GYLIQMSNHFECKCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILK--------   81 (197)
T ss_dssp             EEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES--------
T ss_pred             CEEEEccCceEEEcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceee--------
Confidence            4444445667888888886   3445532 234     2356888888886642     3455555544421        


Q ss_pred             eeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCC---CCeeEeCCCCCccCCCCCccCCC-CCCCCCCCCCCCEEeeCC
Q psy7015         143 NCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLL---NGFVCSCHPGFTGWTGSLCQSAT-NECESSPCQNGGVCVDLH  218 (284)
Q Consensus       143 ~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~---g~~~C~C~~g~~g~~~~~c~~~~-~~C~~~~C~~~g~C~~~~  218 (284)
                               ...|.  ...|....|. .|.|+-.+   ....|.|.-|+...+...|..+. .+|. -.|..+.+|..+.
T Consensus        82 ---------~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~  148 (197)
T PF06247_consen   82 ---------QGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVD  148 (197)
T ss_dssp             ---------SSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEET
T ss_pred             ---------CCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeC
Confidence                     12232  2345555576 57887332   34589999998854445554322 2332 2377788999999


Q ss_pred             CCeeeecCCCCccC
Q psy7015         219 AAYTCACLFGFTGR  232 (284)
Q Consensus       219 g~~~C~C~~G~~g~  232 (284)
                      +-|+|.+..||.+.
T Consensus       149 ~~Y~C~~~~~~~~~  162 (197)
T PF06247_consen  149 GYYKCVCKEGFPGD  162 (197)
T ss_dssp             TEEEEEE-TT-EEE
T ss_pred             cEEEeecCCCCCCC
Confidence            99999999998754


No 28 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.35  E-value=0.0004  Score=39.06  Aligned_cols=28  Identities=61%  Similarity=1.541  Sum_probs=23.4

Q ss_pred             CCCCCCCeEecCCCCeeeeCCCCCcC-CCc
Q psy7015          70 NPCHNGGSCIDGIAAYNCSCPPGYTG-PSC   98 (284)
Q Consensus        70 ~~C~~~g~C~~~~g~~~C~C~~G~~G-~~C   98 (284)
                      .+|.++ +|++..+.|+|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            568777 8998888999999999988 555


No 29 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.35  E-value=0.00038  Score=39.11  Aligned_cols=30  Identities=33%  Similarity=0.951  Sum_probs=19.7

Q ss_pred             CCCCCCCCEEeecCCCeeeecCCCCcCC-Cc
Q psy7015         243 NSPCLNEALCLEEEEEQVCYCVPDYHGN-RC  272 (284)
Q Consensus       243 ~~~C~~~~~C~~~~~~~~C~C~~G~~G~-~C  272 (284)
                      ..+|.+++.|++..+.+.|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            3456666777666666777777777766 44


No 30 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.33  E-value=0.00016  Score=40.97  Aligned_cols=27  Identities=30%  Similarity=0.759  Sum_probs=18.8

Q ss_pred             CCCCCCCEEeecCCCeeeecCCCCcCC
Q psy7015         244 SPCLNEALCLEEEEEQVCYCVPDYHGN  270 (284)
Q Consensus       244 ~~C~~~~~C~~~~~~~~C~C~~G~~G~  270 (284)
                      ..|+.+++|++..+++.|+|++||.|+
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            347778888888888888888888876


No 31 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.26  E-value=0.00011  Score=57.11  Aligned_cols=102  Identities=24%  Similarity=0.643  Sum_probs=62.0

Q ss_pred             CCCCCCCEeccCC-----CCeeEeCCCCCccCCCCCccCCCCCCCCCCCCCCCEEeeCC---CCeeeecCCCCc---cCC
Q psy7015         165 NPCQNNGTCHDLL-----NGFVCSCHPGFTGWTGSLCQSATNECESSPCQNGGVCVDLH---AAYTCACLFGFT---GRN  233 (284)
Q Consensus       165 ~~C~~~~~C~~~~-----g~~~C~C~~g~~g~~~~~c~~~~~~C~~~~C~~~g~C~~~~---g~~~C~C~~G~~---g~~  233 (284)
                      .+|...++|++..     ..|.|.|.+||....+ .|.  ...|....|. .|.|+-.+   ....|.|.-|+.   ...
T Consensus        50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~k  125 (197)
T PF06247_consen   50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQG-VCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKK  125 (197)
T ss_dssp             SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-SEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTE
T ss_pred             ccccchhhhhcCCCcccceeEEEecccCceeeCC-eEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCc
Confidence            5688889998654     5799999999987433 442  3456666677 68997432   235899999987   223


Q ss_pred             CcccCC-cCCCCCCCCCCEEeecCCCeeeecCCCCcCCC
Q psy7015         234 CDIELK-ICENSPCLNEALCLEEEEEQVCYCVPDYHGNR  271 (284)
Q Consensus       234 C~~~~~-~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~  271 (284)
                      |..+-+ .| ...|..+-.|.....-|.|.|.+++.++.
T Consensus       126 Ctk~G~T~C-~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  126 CTKTGETKC-SLKCKENEECKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             SEEEE---------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred             ccCCCccce-eeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence            432211 22 23377788999999999999999997543


No 32 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.23  E-value=0.00057  Score=38.41  Aligned_cols=28  Identities=39%  Similarity=1.064  Sum_probs=17.2

Q ss_pred             CCCCCCCEEeecCCCeeeecCCCCcC-CCc
Q psy7015         244 SPCLNEALCLEEEEEQVCYCVPDYHG-NRC  272 (284)
Q Consensus       244 ~~C~~~~~C~~~~~~~~C~C~~G~~G-~~C  272 (284)
                      .+|.++ +|++..+++.|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            345555 6666666666666666666 444


No 33 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.21  E-value=0.00052  Score=37.74  Aligned_cols=26  Identities=27%  Similarity=0.740  Sum_probs=19.4

Q ss_pred             CCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015         245 PCLNEALCLEEEEEQVCYCVPDYHGNRC  272 (284)
Q Consensus       245 ~C~~~~~C~~~~~~~~C~C~~G~~G~~C  272 (284)
                      .|+++|+|+..  .++|+|.+||+|+.|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            47888888654  456888888888776


No 34 
>KOG1836|consensus
Probab=97.04  E-value=0.0062  Score=63.13  Aligned_cols=176  Identities=32%  Similarity=0.730  Sum_probs=91.7

Q ss_pred             eeCCCCCcCCCcccC-----------CCCCCCCCCCCCC---EEeeCCCCceEeCCCCCeeeeecCCCCeee-eCCCCcc
Q psy7015          87 CSCPPGYTGPSCESN-----------VDECGSNPCQNNG---TCHDLLNGFVCSCHPGFTGNCIDGIAAYNC-SCPPGYT  151 (284)
Q Consensus        87 C~C~~G~~G~~C~~~-----------~~~C~~~~C~~~~---~C~~~~~~~~C~C~~g~~g~c~~~~~~~~C-~C~~g~~  151 (284)
                      |.|++||+|..|+.-           .+.+...+|..++   .|...  +..|.|.....|        .+| +|..||+
T Consensus       697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~--tG~C~C~~~t~G--------~~C~~C~~GfY  766 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPR--TGQCKCKHNTFG--------GQCAQCVDGFY  766 (1705)
T ss_pred             ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCC--CCceecccCCCC--------CchhhhcCCCC
Confidence            899999999988731           1112222333333   23322  234555443332        344 4889999


Q ss_pred             CCCCccCCCCCCCCCCCCCCEeccCC--CCeeEe-CCCCCccCCCCCccC-----------CCCCCCCCCCCC-------
Q psy7015         152 GPSCESNVDECGSNPCQNNGTCHDLL--NGFVCS-CHPGFTGWTGSLCQS-----------ATNECESSPCQN-------  210 (284)
Q Consensus       152 g~~C~~~~~~C~~~~C~~~~~C~~~~--g~~~C~-C~~g~~g~~~~~c~~-----------~~~~C~~~~C~~-------  210 (284)
                      |..-......|..-+|...+.|....  ....|. |++||+|.....|..           ++..|.+.+|..       
T Consensus       767 g~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~  846 (1705)
T KOG1836|consen  767 GLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAF  846 (1705)
T ss_pred             CccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCcccc
Confidence            86554333347777788777776443  456787 999998843333321           111233322222       


Q ss_pred             ------CCEE---eeCCCCeee-ecCCCCccCCCc-ccCCcCCCCCCCCC------CEEeecCCCeeeecCCCCcCCCcc
Q psy7015         211 ------GGVC---VDLHAAYTC-ACLFGFTGRNCD-IELKICENSPCLNE------ALCLEEEEEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       211 ------~g~C---~~~~g~~~C-~C~~G~~g~~C~-~~~~~C~~~~C~~~------~~C~~~~~~~~C~C~~G~~G~~C~  273 (284)
                            .+.|   +.......| .|.+||.|+.-. .+.+.|...-|...      .+|  ...++.|.|.+...|..|.
T Consensus       847 g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c--~~~tGQcec~~~v~g~~c~  924 (1705)
T KOG1836|consen  847 GNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTC--NPVTGQCECKPNVEGRDCL  924 (1705)
T ss_pred             ccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccC--CCcccceeccCCCCccccc
Confidence                  1222   111222233 577777766443 11222322222221      234  4456678888888888775


Q ss_pred             c
Q psy7015         274 Y  274 (284)
Q Consensus       274 ~  274 (284)
                      .
T Consensus       925 ~  925 (1705)
T KOG1836|consen  925 Y  925 (1705)
T ss_pred             c
Confidence            3


No 35 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.99  E-value=0.00055  Score=38.79  Aligned_cols=28  Identities=29%  Similarity=0.838  Sum_probs=22.2

Q ss_pred             CCCCCCCEEeeCCCCeeeecCCCCccCC
Q psy7015         206 SPCQNGGVCVDLHAAYTCACLFGFTGRN  233 (284)
Q Consensus       206 ~~C~~~g~C~~~~g~~~C~C~~G~~g~~  233 (284)
                      ..|+.+++|+++.++++|.|++||+|+.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            3588899999999999999999999864


No 36 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.95  E-value=0.00071  Score=34.40  Aligned_cols=20  Identities=40%  Similarity=1.047  Sum_probs=13.1

Q ss_pred             CeeeecCCCCc----CCCccccccc
Q psy7015         258 EQVCYCVPDYH----GNRCQYQYDE  278 (284)
Q Consensus       258 ~~~C~C~~G~~----G~~C~~~~~~  278 (284)
                      +|.|.|++||.    |..|+ ||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~-DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE-DIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc-cCCC
Confidence            46777777775    45675 6665


No 37 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.93  E-value=0.00085  Score=34.11  Aligned_cols=11  Identities=64%  Similarity=1.352  Sum_probs=8.3

Q ss_pred             ceEEeCCCCCc
Q psy7015          46 SYTCYCIDGYT   56 (284)
Q Consensus        46 ~~~C~C~~G~~   56 (284)
                      ||+|+|++||.
T Consensus         1 sy~C~C~~Gy~   11 (24)
T PF12662_consen    1 SYTCSCPPGYQ   11 (24)
T ss_pred             CEEeeCCCCCc
Confidence            57788888876


No 38 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.91  E-value=0.00046  Score=29.69  Aligned_cols=13  Identities=38%  Similarity=1.242  Sum_probs=7.9

Q ss_pred             eeecCCCCcCCCc
Q psy7015         260 VCYCVPDYHGNRC  272 (284)
Q Consensus       260 ~C~C~~G~~G~~C  272 (284)
                      +|+|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            3667777777665


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.79  E-value=0.002  Score=35.45  Aligned_cols=26  Identities=38%  Similarity=0.925  Sum_probs=22.2

Q ss_pred             CCCCCCEEeeCCCCeeeecCCCCccCCC
Q psy7015         207 PCQNGGVCVDLHAAYTCACLFGFTGRNC  234 (284)
Q Consensus       207 ~C~~~g~C~~~~g~~~C~C~~G~~g~~C  234 (284)
                      .|.++|+|+...  .+|.|.+||+|..|
T Consensus         7 ~C~~~G~C~~~~--g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPC--GRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCCC--CEEECCCCCcCCCC
Confidence            589999998763  38999999999876


No 40 
>smart00051 DSL delta serrate ligand.
Probab=95.60  E-value=0.02  Score=36.94  Aligned_cols=46  Identities=20%  Similarity=0.503  Sum_probs=33.4

Q ss_pred             eeeecCCCCccCCCcccCCcCCC-CCCCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015         221 YTCACLFGFTGRNCDIELKICEN-SPCLNEALCLEEEEEQVCYCVPDYHGNRC  272 (284)
Q Consensus       221 ~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~G~~C  272 (284)
                      +.-.|.++|.|..|+.   .|.+ .-...+..|..   .+.++|.+||+|..|
T Consensus        17 ~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCCccCC---EeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence            4558999999999974   3322 12566788843   356999999999987


No 41 
>KOG3512|consensus
Probab=95.59  E-value=0.079  Score=47.28  Aligned_cols=87  Identities=22%  Similarity=0.487  Sum_probs=47.1

Q ss_pred             eE-eCCCCCccCCCCCccCCCCCCCCCCCCC----CCEEeeCCCCeeeecCCCCccCCCccc----------CCcCCCCC
Q psy7015         181 VC-SCHPGFTGWTGSLCQSATNECESSPCQN----GGVCVDLHAAYTCACLFGFTGRNCDIE----------LKICENSP  245 (284)
Q Consensus       181 ~C-~C~~g~~g~~~~~c~~~~~~C~~~~C~~----~g~C~~~~g~~~C~C~~G~~g~~C~~~----------~~~C~~~~  245 (284)
                      +| .|++||.-..+..- .+...|..-.|+.    +-+|..+.|  +|.|.+|.+|..|...          +.+|...|
T Consensus       372 hChyCreGyyRd~s~pl-~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnrCa~gyqqsrs~vapcik~p  448 (592)
T KOG3512|consen  372 HCHYCREGYYRDGSKPL-THRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNRCAPGYQQSRSPVAPCIKIP  448 (592)
T ss_pred             ccccccCccccCCCCCC-chhhhhhhcCCcccccccccccccCC--cccCCCCCcccccccccchhhcccCCCcCceecC
Confidence            45 48888765322111 1122333333443    345665555  8999999999888631          12222111


Q ss_pred             ------CCCCCEEeecCCCeeeecCCCCcCCCccc
Q psy7015         246 ------CLNEALCLEEEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       246 ------C~~~~~C~~~~~~~~C~C~~G~~G~~C~~  274 (284)
                            ++++.+    .....+.|+.++.|.+++.
T Consensus       449 ~~~~~~~~s~ve----~qd~~s~Ck~~~~~~r~n~  479 (592)
T KOG3512|consen  449 TDAPTLGSSGVE----PQDQCSKCKASPGGKRLNQ  479 (592)
T ss_pred             CCCccccCCCCc----chhccccCCCCCcceeccc
Confidence                  222222    2344578999998887764


No 42 
>KOG1218|consensus
Probab=95.55  E-value=1.1  Score=38.76  Aligned_cols=182  Identities=30%  Similarity=0.679  Sum_probs=82.4

Q ss_pred             CCceEEeCCCCCccC-CCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCCCCCCCEEeeCCCC
Q psy7015          44 PSSYTCYCIDGYTGV-HCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNG  122 (284)
Q Consensus        44 ~g~~~C~C~~G~~g~-~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~  122 (284)
                      ..+..|.|.+||.|. .+.. ....  .++...-.+  .....+|.+..+|.+..|..... .  ..  .++.|..    
T Consensus        12 ~~~~~c~c~~~~~g~~~~~~-~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~c~~~~~-~--~~--~~~~c~~----   77 (316)
T KOG1218|consen   12 GGSGQCFCDPGYTGRLQCEH-QAVT--SACSGICPC--EVNSGECGLGYGFVGSVCRIECV-C--GN--AGGGCSQ----   77 (316)
T ss_pred             CCCCceecCCCccccccccC-CCCC--ccccccCCc--cCCceeEecccccCCCccccccc-c--CC--CCCcccC----
Confidence            356789999999995 2221 1111  111111111  22344688899999887653211 1  00  1222221    


Q ss_pred             ceEeCCCCCeeeeecCCCCeeeeC-CCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccC---
Q psy7015         123 FVCSCHPGFTGNCIDGIAAYNCSC-PPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQS---  198 (284)
Q Consensus       123 ~~C~C~~g~~g~c~~~~~~~~C~C-~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~---  198 (284)
                       .+.|..++.-.      .....+ ..+|.|..|.. ..++... |.. .+|.+...  .|.+..+|.+   ..|..   
T Consensus        78 -~~~c~~~~~~~------~~~~~~~~~~~~g~~C~~-~~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~---~~C~~~~~  142 (316)
T KOG1218|consen   78 -PCRCKNGGTCV------SSTGYCHLNGYEGPQCES-PCPCGDG-CAE-KTCANPRR--ECRCGGGYIG---EQCGEENL  142 (316)
T ss_pred             -ccccCCCCccc------CCCCcccCCCCCcccccC-CCCcCCc-ccc-cccCCCcc--ceecCCcCcc---ccccccCC
Confidence             11122222111      111123 46777777763 3333222 222 34443322  4555555544   33332   


Q ss_pred             CCCCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCC-CCCCCCCEEeecCC
Q psy7015         199 ATNECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICEN-SPCLNEALCLEEEE  257 (284)
Q Consensus       199 ~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~~~C~~~~~  257 (284)
                      ....|... |.....+...  ...|.|.+||+|..+......|.. ..+.+++.|....+
T Consensus       143 ~g~~C~~~-c~~~~~~~~~--~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~  199 (316)
T KOG1218|consen  143 VGLKCQRD-CQCTGGCDCK--NGICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG  199 (316)
T ss_pred             CCCCccCC-CCCccccCCC--CCceeccCCcccccccccCCCcCCCcccCCCCeeecccc
Confidence            11112111 2111222212  236889999999988754433442 34666667755443


No 43 
>smart00051 DSL delta serrate ligand.
Probab=94.28  E-value=0.094  Score=33.80  Aligned_cols=47  Identities=26%  Similarity=0.605  Sum_probs=32.5

Q ss_pred             ceEEeCCCCCccCCCCCCCCCCCC-CCCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015          46 SYTCYCIDGYTGVHCQTNWDECWS-NPCHNGGSCIDGIAAYNCSCPPGYTGPSC   98 (284)
Q Consensus        46 ~~~C~C~~G~~g~~C~~~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~G~~C   98 (284)
                      .+.=.|.++|.|..|+.   .|.+ .....+.+|.. .|  .+.|.+||+|..|
T Consensus        16 ~~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C   63 (63)
T smart00051       16 QIRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC   63 (63)
T ss_pred             EEEeeCCCCCcCCccCC---EeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence            34557999999999974   3332 12456677754 34  4899999999876


No 44 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.86  E-value=0.037  Score=31.27  Aligned_cols=18  Identities=39%  Similarity=0.855  Sum_probs=8.3

Q ss_pred             ceecCCCceEEeCCCCCc
Q psy7015          39 PIFAVPSSYTCYCIDGYT   56 (284)
Q Consensus        39 ~~~~~~g~~~C~C~~G~~   56 (284)
                      .+.+++++|+|.|++||.
T Consensus        11 ~C~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             EEEEETTSEEEE-STTEE
T ss_pred             CCccCCCceEeECCCCCE
Confidence            344445555555555554


No 45 
>KOG3512|consensus
Probab=93.43  E-value=0.19  Score=44.93  Aligned_cols=105  Identities=24%  Similarity=0.605  Sum_probs=56.1

Q ss_pred             CCCCC-EeccCCC-CeeEeCCCCCccCCCCCccC-------------CCCCCCCCCCCC-------------------CC
Q psy7015         167 CQNNG-TCHDLLN-GFVCSCHPGFTGWTGSLCQS-------------ATNECESSPCQN-------------------GG  212 (284)
Q Consensus       167 C~~~~-~C~~~~g-~~~C~C~~g~~g~~~~~c~~-------------~~~~C~~~~C~~-------------------~g  212 (284)
                      |..++ .|+.... ..+|.|..+-.|.+...|..             +.++|....|..                   +|
T Consensus       280 CNgHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~Sgg  359 (592)
T KOG3512|consen  280 CNGHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGG  359 (592)
T ss_pred             ecCccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccc
Confidence            44333 4665444 48888888877744333321             234444333322                   34


Q ss_pred             EEee----CCCCeee-ecCCCCccCCCcc--cCCcCCCCCCCC----CCEEeecCCCeeeecCCCCcCCCccc
Q psy7015         213 VCVD----LHAAYTC-ACLFGFTGRNCDI--ELKICENSPCLN----EALCLEEEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       213 ~C~~----~~g~~~C-~C~~G~~g~~C~~--~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~G~~G~~C~~  274 (284)
                      +|+|    +.|. .| .|++||+-+.-..  .-..|....|+.    +-+|  +..+++|.|++|.+|..|..
T Consensus       360 vClnCrHnTaGr-hChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktC--Nq~tGqCpCkeGvtG~tCnr  429 (592)
T KOG3512|consen  360 VCLNCRHNTAGR-HCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTC--NQTTGQCPCKEGVTGLTCNR  429 (592)
T ss_pred             eEeecccCCCCc-ccccccCccccCCCCCCchhhhhhhcCCcccccccccc--cccCCcccCCCCCccccccc
Confidence            5543    3332 34 5888887433221  112233333433    3456  44577899999999998853


No 46 
>KOG1218|consensus
Probab=92.95  E-value=5.5  Score=34.30  Aligned_cols=148  Identities=30%  Similarity=0.724  Sum_probs=68.4

Q ss_pred             CceEEeCCCCCccCCCCCCCC-CCCCCCCCCCCeEecCC--CCeeeeC-CCCCcCCCcccCCCCCCCCCCCCCCEEeeCC
Q psy7015          45 SSYTCYCIDGYTGVHCQTNWD-ECWSNPCHNGGSCIDGI--AAYNCSC-PPGYTGPSCESNVDECGSNPCQNNGTCHDLL  120 (284)
Q Consensus        45 g~~~C~C~~G~~g~~C~~~~~-~C~~~~C~~~g~C~~~~--g~~~C~C-~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~  120 (284)
                      .+..|.+..+|.+..|..... ......|.....|....  ..+...| ..+|.|..|+. ..++... |.. ..|.+..
T Consensus        47 ~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~-~~~~~~~-c~~-~~C~~~~  123 (316)
T KOG1218|consen   47 NSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCES-PCPCGDG-CAE-KTCANPR  123 (316)
T ss_pred             CceeEecccccCCCccccccccCCCCCcccCccccCCCCcccCCCCcccCCCCCcccccC-CCCcCCc-ccc-cccCCCc
Confidence            456788999999888764211 11222244444443221  1222344 68888888863 2333222 222 3444322


Q ss_pred             CCceEeCCCCCeeeeecCCCCeeeeCCCCccCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCccCCCCCccCCC
Q psy7015         121 NGFVCSCHPGFTGNCIDGIAAYNCSCPPGYTGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTGWTGSLCQSAT  200 (284)
Q Consensus       121 ~~~~C~C~~g~~g~c~~~~~~~~C~C~~g~~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~c~~~~  200 (284)
                      .  .|.+..+|.+.        .|.. +++.|..|..+        |.....+..  ....|.|.+||.+   ..+....
T Consensus       124 ~--~c~~~~~~~~~--------~C~~-~~~~g~~C~~~--------c~~~~~~~~--~~~~c~c~~g~~g---~~~~~~~  179 (316)
T KOG1218|consen  124 R--ECRCGGGYIGE--------QCGE-ENLVGLKCQRD--------CQCTGGCDC--KNGICTCQPGFVG---VFCVESC  179 (316)
T ss_pred             c--ceecCCcCccc--------cccc-cCCCCCCccCC--------CCCccccCC--CCCceeccCCccc---ccccccC
Confidence            1  34444444322        1111 25556666422        111111211  1236778888877   4443222


Q ss_pred             CCCC-CCCCCCCCEEeeCCC
Q psy7015         201 NECE-SSPCQNGGVCVDLHA  219 (284)
Q Consensus       201 ~~C~-~~~C~~~g~C~~~~g  219 (284)
                      ..|. ...+.+++.|....+
T Consensus       180 ~~c~~~~~~~~g~~C~~~~~  199 (316)
T KOG1218|consen  180 SGCSPLTACENGAKCNRSTG  199 (316)
T ss_pred             CCcCCCcccCCCCeeecccc
Confidence            2233 234556667776554


No 47 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.82  E-value=0.096  Score=29.57  Aligned_cols=20  Identities=30%  Similarity=0.850  Sum_probs=16.3

Q ss_pred             CEEeeCCCCeeeecCCCCcc
Q psy7015         212 GVCVDLHAAYTCACLFGFTG  231 (284)
Q Consensus       212 g~C~~~~g~~~C~C~~G~~g  231 (284)
                      ..|++++++|+|.|++||+-
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L   29 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKL   29 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE
T ss_pred             CCCccCCCceEeECCCCCEE
Confidence            38899999999999999974


No 48 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=92.50  E-value=0.082  Score=29.80  Aligned_cols=26  Identities=23%  Similarity=0.682  Sum_probs=13.4

Q ss_pred             CCCCCCCCEEeecC-CCeeeecCCCCc
Q psy7015         243 NSPCLNEALCLEEE-EEQVCYCVPDYH  268 (284)
Q Consensus       243 ~~~C~~~~~C~~~~-~~~~C~C~~G~~  268 (284)
                      ...|..++.|++.. ++..|.|..||.
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            34456666665544 666666666664


No 49 
>PHA02887 EGF-like protein; Provisional
Probab=92.30  E-value=0.13  Score=36.93  Aligned_cols=28  Identities=32%  Similarity=0.966  Sum_probs=20.6

Q ss_pred             CCCCCEEee--cCCCeeeecCCCCcCCCccc
Q psy7015         246 CLNEALCLE--EEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       246 C~~~~~C~~--~~~~~~C~C~~G~~G~~C~~  274 (284)
                      |. +|+|.-  ......|.|.+||+|.+|+.
T Consensus        94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            55 467754  34557799999999999984


No 50 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=91.92  E-value=0.15  Score=37.31  Aligned_cols=28  Identities=36%  Similarity=0.891  Sum_probs=20.8

Q ss_pred             CCCCCEEee--cCCCeeeecCCCCcCCCccc
Q psy7015         246 CLNEALCLE--EEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       246 C~~~~~C~~--~~~~~~C~C~~G~~G~~C~~  274 (284)
                      |.+ |+|.-  +...+.|.|..||+|.+||.
T Consensus        53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             eEC-CEEEeeccCCCceeECCCCcccccccc
Confidence            554 47754  34677899999999999985


No 51 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.70  E-value=0.068  Score=30.13  Aligned_cols=27  Identities=22%  Similarity=0.580  Sum_probs=15.9

Q ss_pred             CCCCCCCCCeEecCC-CCeeeeCCCCCc
Q psy7015          68 WSNPCHNGGSCIDGI-AAYNCSCPPGYT   94 (284)
Q Consensus        68 ~~~~C~~~g~C~~~~-g~~~C~C~~G~~   94 (284)
                      ....|..++.|++.. |+++|.|.+||.
T Consensus         3 ~~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    3 IDTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             cCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            334566777777654 777788888775


No 52 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=89.34  E-value=0.46  Score=39.09  Aligned_cols=38  Identities=29%  Similarity=0.519  Sum_probs=25.6

Q ss_pred             cCCCCCCCCCCCCCCCCCCCeEecCCCCeeeeCCCCCcC
Q psy7015          57 GVHCQTNWDECWSNPCHNGGSCIDGIAAYNCSCPPGYTG   95 (284)
Q Consensus        57 g~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G   95 (284)
                      +..|+ +.++|...+......|.++.|+|.|.|++||+.
T Consensus       181 ~~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence            34454 566775443333357888888888888888874


No 53 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=89.34  E-value=0.25  Score=29.90  Aligned_cols=22  Identities=32%  Similarity=0.759  Sum_probs=15.3

Q ss_pred             EEeecCCCeeeecCCCCcCCCccc
Q psy7015         251 LCLEEEEEQVCYCVPDYHGNRCQY  274 (284)
Q Consensus       251 ~C~~~~~~~~C~C~~G~~G~~C~~  274 (284)
                      .|..  .+++|.|+++|+|.+|+.
T Consensus        12 ~C~~--~~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   12 TCDP--STGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             SEEE--TCEEESBSTTEESTTS-E
T ss_pred             cccC--CCCEEeccccccCCcCcC
Confidence            5533  556788888888888864


No 54 
>PHA02887 EGF-like protein; Provisional
Probab=88.52  E-value=0.51  Score=33.96  Aligned_cols=28  Identities=36%  Similarity=1.010  Sum_probs=21.9

Q ss_pred             CCCCCeEec--CCCCeeeeCCCCCcCCCccc
Q psy7015          72 CHNGGSCID--GIAAYNCSCPPGYTGPSCES  100 (284)
Q Consensus        72 C~~~g~C~~--~~g~~~C~C~~G~~G~~C~~  100 (284)
                      |- +|+|.-  ....+.|.|+.||+|.+|+.
T Consensus        94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         94 CI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             ee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            55 578873  34567899999999999973


No 55 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=87.88  E-value=0.54  Score=28.64  Aligned_cols=17  Identities=35%  Similarity=0.917  Sum_probs=13.9

Q ss_pred             CCeeeecCCCCcCCCcc
Q psy7015         257 EEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       257 ~~~~C~C~~G~~G~~C~  273 (284)
                      .+++|.|+++|+|.+|+
T Consensus        17 ~~G~C~C~~~~~G~~C~   33 (50)
T cd00055          17 GTGQCECKPNTTGRRCD   33 (50)
T ss_pred             CCCEEeCCCcCCCCCCC
Confidence            45678899999999886


No 56 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=87.77  E-value=0.32  Score=29.88  Aligned_cols=36  Identities=22%  Similarity=0.541  Sum_probs=18.5

Q ss_pred             CCCCCCCEEee----cCCCeeeecCCCCcCCCcccccccc
Q psy7015         244 SPCLNEALCLE----EEEEQVCYCVPDYHGNRCQYQYDEC  279 (284)
Q Consensus       244 ~~C~~~~~C~~----~~~~~~C~C~~G~~G~~C~~~~~~C  279 (284)
                      .+|+.||....    ..+...|.|..-|.|.+|+..+..|
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~~C   56 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIPNC   56 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-TT-
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcccCCCCC
Confidence            35777776642    3455779999999999998766544


No 57 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=87.70  E-value=0.16  Score=32.68  Aligned_cols=47  Identities=28%  Similarity=0.648  Sum_probs=19.7

Q ss_pred             ceEEeCCCCCccCCCCCCCCCCCCC-CCCCCCeEecCCCCeeeeCCCCCcCCCc
Q psy7015          46 SYTCYCIDGYTGVHCQTNWDECWSN-PCHNGGSCIDGIAAYNCSCPPGYTGPSC   98 (284)
Q Consensus        46 ~~~C~C~~G~~g~~C~~~~~~C~~~-~C~~~g~C~~~~g~~~C~C~~G~~G~~C   98 (284)
                      .+.-.|.+.|.|..|..   -|.+. .-..+-+|.. .|.  =.|.+||+|..|
T Consensus        16 ~~rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd~-~G~--~~C~~Gw~G~~C   63 (63)
T PF01414_consen   16 RIRVVCDENYYGPNCSK---FCKPRDDSFGHYTCDS-NGN--KVCLPGWTGPNC   63 (63)
T ss_dssp             -------TTEETTTT-E---E---EEETTEEEEE-S-S----EEE-TTEESTTS
T ss_pred             EEEEECCCCCCCccccC---CcCCCcCCcCCcccCC-CCC--CCCCCCCcCCCC
Confidence            45668999999998874   22221 0122334442 232  478899998876


No 58 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=85.91  E-value=0.78  Score=33.62  Aligned_cols=28  Identities=43%  Similarity=1.098  Sum_probs=21.5

Q ss_pred             CCCCCeEec--CCCCeeeeCCCCCcCCCccc
Q psy7015          72 CHNGGSCID--GIAAYNCSCPPGYTGPSCES  100 (284)
Q Consensus        72 C~~~g~C~~--~~g~~~C~C~~G~~G~~C~~  100 (284)
                      |-+ |.|.-  ....+.|.|..||+|.+|+.
T Consensus        53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             eEC-CEEEeeccCCCceeECCCCcccccccc
Confidence            444 47863  34678899999999999973


No 59 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.04  E-value=1.6  Score=35.83  Aligned_cols=38  Identities=21%  Similarity=0.552  Sum_probs=27.9

Q ss_pred             CCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCcc
Q psy7015         152 GPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTG  190 (284)
Q Consensus       152 g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g  190 (284)
                      +..|. +.++|...+......|.++.|+|.|.|++||+.
T Consensus       181 ~~~C~-~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICV-VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cccCc-CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence            45565 567775433222358999999999999999986


No 60 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=82.33  E-value=1.1  Score=27.07  Aligned_cols=22  Identities=36%  Similarity=0.709  Sum_probs=16.9

Q ss_pred             CEEeeCCCCeeeecCCCCccCCCc
Q psy7015         212 GVCVDLHAAYTCACLFGFTGRNCD  235 (284)
Q Consensus       212 g~C~~~~g~~~C~C~~G~~g~~C~  235 (284)
                      ..|....  .+|.|+++|+|..|+
T Consensus        11 ~~C~~~~--G~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   11 QTCDPST--GQCVCKPGTTGPRCD   32 (49)
T ss_dssp             SSEEETC--EEESBSTTEESTTS-
T ss_pred             CcccCCC--CEEeccccccCCcCc
Confidence            3676644  489999999999997


No 61 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=80.24  E-value=1.6  Score=26.08  Aligned_cols=16  Identities=38%  Similarity=1.007  Sum_probs=11.5

Q ss_pred             CeeeecCCCCcCCCcc
Q psy7015         258 EQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       258 ~~~C~C~~G~~G~~C~  273 (284)
                      +++|.|+++|+|.+|+
T Consensus        17 ~G~C~C~~~~~G~~C~   32 (46)
T smart00180       17 TGQCECKPNVTGRRCD   32 (46)
T ss_pred             CCEEECCCCCCCCCCC
Confidence            4567777777777775


No 62 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.92  E-value=1.9  Score=26.19  Aligned_cols=20  Identities=40%  Similarity=0.821  Sum_probs=15.5

Q ss_pred             EeeCCCCeeeecCCCCccCCCc
Q psy7015         214 CVDLHAAYTCACLFGFTGRNCD  235 (284)
Q Consensus       214 C~~~~g~~~C~C~~G~~g~~C~  235 (284)
                      |....|  +|.|+++|+|..|+
T Consensus        14 C~~~~G--~C~C~~~~~G~~C~   33 (50)
T cd00055          14 CDPGTG--QCECKPNTTGRRCD   33 (50)
T ss_pred             ccCCCC--EEeCCCcCCCCCCC
Confidence            543333  89999999999996


No 63 
>KOG3516|consensus
Probab=78.39  E-value=2  Score=43.18  Aligned_cols=39  Identities=36%  Similarity=1.053  Sum_probs=34.6

Q ss_pred             CCCCCCCCCCCCCCeEecCCCCeeeeCC-CCCcCCCcccC
Q psy7015          63 NWDECWSNPCHNGGSCIDGIAAYNCSCP-PGYTGPSCESN  101 (284)
Q Consensus        63 ~~~~C~~~~C~~~g~C~~~~g~~~C~C~-~G~~G~~C~~~  101 (284)
                      .++.|.+++|.++|.|......|.|.|. .||+|..|...
T Consensus       544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHts  583 (1306)
T KOG3516|consen  544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTS  583 (1306)
T ss_pred             cccccCCccccCCCcccccccceeEeccccccccccccCC
Confidence            4678999999999999998889999997 99999999753


No 64 
>KOG3516|consensus
Probab=75.53  E-value=2.4  Score=42.57  Aligned_cols=41  Identities=24%  Similarity=0.596  Sum_probs=31.7

Q ss_pred             CCcCCCCCCCCCCEEeecCCCeeeecC-CCCcCCCccccccc
Q psy7015         238 LKICENSPCLNEALCLEEEEEQVCYCV-PDYHGNRCQYQYDE  278 (284)
Q Consensus       238 ~~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~G~~C~~~~~~  278 (284)
                      .+.|.+.+|..+|.|...-..++|.|. .||.|..|...|-|
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e  586 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE  586 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence            467778888888888777777888887 88888888766544


No 65 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=67.92  E-value=6.4  Score=21.70  Aligned_cols=22  Identities=27%  Similarity=0.415  Sum_probs=14.5

Q ss_pred             EeccCCCCeeEeCCCCCccCCCC
Q psy7015         172 TCHDLLNGFVCSCHPGFTGWTGS  194 (284)
Q Consensus       172 ~C~~~~g~~~C~C~~g~~g~~~~  194 (284)
                      .|..... ..|.|++||....+.
T Consensus        11 ~CDpn~~-~~C~CPeGyIlde~~   32 (34)
T PF09064_consen   11 DCDPNSP-GQCFCPEGYILDEGS   32 (34)
T ss_pred             ccCCCCC-CceeCCCceEecCCc
Confidence            5554332 489999999875443


No 66 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=67.78  E-value=7.5  Score=27.89  Aligned_cols=31  Identities=32%  Similarity=0.720  Sum_probs=22.5

Q ss_pred             CCCCC-CCCCCCCCeEecCCCCeeeeCCCCCcC
Q psy7015          64 WDECW-SNPCHNGGSCIDGIAAYNCSCPPGYTG   95 (284)
Q Consensus        64 ~~~C~-~~~C~~~g~C~~~~g~~~C~C~~G~~G   95 (284)
                      .+.|. ...|+..|.|... ....|.|.+||.-
T Consensus        77 ~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP  108 (110)
T ss_pred             ccCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence            35565 4669999999643 4557999999964


No 67 
>KOG3514|consensus
Probab=67.28  E-value=4  Score=40.70  Aligned_cols=35  Identities=46%  Similarity=1.162  Sum_probs=31.8

Q ss_pred             CCCCCCCCCCCeEecCCCCeeeeC-CCCCcCCCccc
Q psy7015          66 ECWSNPCHNGGSCIDGIAAYNCSC-PPGYTGPSCES  100 (284)
Q Consensus        66 ~C~~~~C~~~g~C~~~~g~~~C~C-~~G~~G~~C~~  100 (284)
                      .|.++||.++|.|...-..|.|.| ..||.|+.|+.
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence            689999999999999999999999 46899999984


No 68 
>KOG3514|consensus
Probab=62.74  E-value=5.9  Score=39.60  Aligned_cols=36  Identities=33%  Similarity=0.851  Sum_probs=32.1

Q ss_pred             cCCCCCCCCCCEEeecCCCeeeec-CCCCcCCCcccc
Q psy7015         240 ICENSPCLNEALCLEEEEEQVCYC-VPDYHGNRCQYQ  275 (284)
Q Consensus       240 ~C~~~~C~~~~~C~~~~~~~~C~C-~~G~~G~~C~~~  275 (284)
                      .|+..||.|+|.|...-..+.|.| ..||.|..||..
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE  661 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE  661 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence            789999999999988888999999 579999999864


No 69 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=59.68  E-value=9.6  Score=27.11  Aligned_cols=9  Identities=33%  Similarity=0.752  Sum_probs=6.4

Q ss_pred             CCcCCCccc
Q psy7015         266 DYHGNRCQY  274 (284)
Q Consensus       266 G~~G~~C~~  274 (284)
                      .|.|+.|+.
T Consensus        53 ~W~G~aCqK   61 (103)
T PF12955_consen   53 HWGGPACQK   61 (103)
T ss_pred             eeccccccc
Confidence            577777874


No 70 
>KOG3509|consensus
Probab=53.43  E-value=29  Score=34.84  Aligned_cols=71  Identities=28%  Similarity=0.658  Sum_probs=51.0

Q ss_pred             CCCCCCCCCCCCEEeeCCCCeeeecCCCCccCCCcccCCcCCCCC-CCCCCEEeecCCCeeeecCCCCcCCCc
Q psy7015         201 NECESSPCQNGGVCVDLHAAYTCACLFGFTGRNCDIELKICENSP-CLNEALCLEEEEEQVCYCVPDYHGNRC  272 (284)
Q Consensus       201 ~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~C~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~G~~G~~C  272 (284)
                      +.|...++...+.|..+.....|.|++||+|..|+...+.+...+ =.-.++|....+.....|.+| .|...
T Consensus       407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~  478 (964)
T KOG3509|consen  407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPT  478 (964)
T ss_pred             CccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCcc
Confidence            355666777778888777778899999999999987666665432 223466766655667788888 66655


No 71 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=52.88  E-value=18  Score=25.91  Aligned_cols=24  Identities=21%  Similarity=0.721  Sum_probs=11.5

Q ss_pred             CCCCCCCEEeecCCCeeeecCCCCc
Q psy7015         244 SPCLNEALCLEEEEEQVCYCVPDYH  268 (284)
Q Consensus       244 ~~C~~~~~C~~~~~~~~C~C~~G~~  268 (284)
                      ..|...+.|.. .....|.|.+||.
T Consensus        84 ~~CG~~g~C~~-~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   84 GFCGPNGICNS-NNSPKCSCLPGFE  107 (110)
T ss_pred             cccCCccEeCC-CCCCceECCCCcC
Confidence            34555555532 2233455555554


No 72 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=46.40  E-value=48  Score=19.93  Aligned_cols=30  Identities=37%  Similarity=0.938  Sum_probs=20.2

Q ss_pred             cCCCCccCCCCCCCCCCCCCCEeccCCCCeeEeCCCCCcc
Q psy7015         151 TGPSCESNVDECGSNPCQNNGTCHDLLNGFVCSCHPGFTG  190 (284)
Q Consensus       151 ~g~~C~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~g~~g  190 (284)
                      .|..|..+      ..|..++.|++    .+|+|++||..
T Consensus        18 ~g~~C~~~------~qC~~~s~C~~----g~C~C~~g~~~   47 (52)
T PF01683_consen   18 PGESCESD------EQCIGGSVCVN----GRCQCPPGYVE   47 (52)
T ss_pred             CCCCCCCc------CCCCCcCEEcC----CEeECCCCCEe
Confidence            35666532      23667788864    48999999865


No 73 
>KOG0196|consensus
Probab=45.55  E-value=43  Score=32.98  Aligned_cols=67  Identities=22%  Similarity=0.535  Sum_probs=36.7

Q ss_pred             CCCCCEEeeCCCCeeeecCCCCc----cCCCccc----------CCcCCCCCCCCCCEEeecCCCeeeecCCCCcCCCcc
Q psy7015         208 CQNGGVCVDLHAAYTCACLFGFT----GRNCDIE----------LKICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQ  273 (284)
Q Consensus       208 C~~~g~C~~~~g~~~C~C~~G~~----g~~C~~~----------~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~  273 (284)
                      |...|.-+...|  .|.|.+||.    +..|+..          ...|.  +|..+.. ...+++..|.|..||.-..=+
T Consensus       248 C~~dGeWlvpiG--~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~--~CP~~S~-s~~ega~~C~C~~gyyRA~~D  322 (996)
T KOG0196|consen  248 CSGDGEWLVPIG--GCVCKAGYEEAENGKACQACPPGTYKASQGDSLCL--PCPPNSH-SSSEGATSCTCENGYYRADSD  322 (996)
T ss_pred             EcCCCcEEEEcC--ceeecCCCCcccCCCcceeCCCCcccCCCCCCCCC--CCCCCCC-CCCCCCCcccccCCcccCCCC
Confidence            555555544444  688888886    3444411          11222  2332221 124567789999999866555


Q ss_pred             cccccc
Q psy7015         274 YQYDEC  279 (284)
Q Consensus       274 ~~~~~C  279 (284)
                      .+--+|
T Consensus       323 p~~mpC  328 (996)
T KOG0196|consen  323 PPSMPC  328 (996)
T ss_pred             CCCCCC
Confidence            444445


No 74 
>KOG0196|consensus
Probab=38.91  E-value=84  Score=31.11  Aligned_cols=60  Identities=28%  Similarity=0.651  Sum_probs=33.1

Q ss_pred             eEEeCCCCCc----cCCCCCC----------CCCCCCCCCCCCCeEecCCCCeeeeCCCCCcCCCcccCCCCCCCCC
Q psy7015          47 YTCYCIDGYT----GVHCQTN----------WDECWSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCESNVDECGSNP  109 (284)
Q Consensus        47 ~~C~C~~G~~----g~~C~~~----------~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~  109 (284)
                      ..|.|.+||+    +..|+..          ...|.  +|..+..= ..+++..|.|..||+-..-+.....|...|
T Consensus       259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~--~CP~~S~s-~~ega~~C~C~~gyyRA~~Dp~~mpCT~PP  332 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCL--PCPPNSHS-SSEGATSCTCENGYYRADSDPPSMPCTRPP  332 (996)
T ss_pred             CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCC--CCCCCCCC-CCCCCCcccccCCcccCCCCCCCCCCCCCC
Confidence            3699999996    4556521          11222  23333321 235667899999998554433333454433


No 75 
>KOG3509|consensus
Probab=23.93  E-value=72  Score=32.25  Aligned_cols=43  Identities=30%  Similarity=0.862  Sum_probs=32.5

Q ss_pred             cCCCCCCCCCCEEeecCCCeeeecCCCCcCCCccccccccccC
Q psy7015         240 ICENSPCLNEALCLEEEEEQVCYCVPDYHGNRCQYQYDECQIT  282 (284)
Q Consensus       240 ~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~  282 (284)
                      .|...++...+.|-.......|.|++||+|+.|+...+.|...
T Consensus       408 ~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~  450 (964)
T KOG3509|consen  408 VCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRS  450 (964)
T ss_pred             ccccccCCCCccccccccccceeccccccCchhhccCcccccc
Confidence            4445566666777667777889999999999999777776543


No 76 
>KOG3607|consensus
Probab=22.18  E-value=67  Score=31.47  Aligned_cols=28  Identities=25%  Similarity=0.604  Sum_probs=23.2

Q ss_pred             CCCCCCEEeecCCCeeeecCCCCcCCCcccc
Q psy7015         245 PCLNEALCLEEEEEQVCYCVPDYHGNRCQYQ  275 (284)
Q Consensus       245 ~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~  275 (284)
                      .|+.+|+|.+.   +.|+|.+||.+..|++.
T Consensus       631 ~C~g~GVCnn~---~~ChC~~gwapp~C~~~  658 (716)
T KOG3607|consen  631 TCNGHGVCNNE---LNCHCEPGWAPPFCFIF  658 (716)
T ss_pred             ccCCCcccCCC---cceeeCCCCCCCccccc
Confidence            38899999554   46999999999999864


No 77 
>KOG3607|consensus
Probab=20.26  E-value=76  Score=31.11  Aligned_cols=27  Identities=37%  Similarity=1.040  Sum_probs=20.1

Q ss_pred             CCCCCCeEecCCCCeeeeCCCCCcCCCccc
Q psy7015          71 PCHNGGSCIDGIAAYNCSCPPGYTGPSCES  100 (284)
Q Consensus        71 ~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~  100 (284)
                      .|..+|+|.+..   .|+|.+||.+..|+.
T Consensus       631 ~C~g~GVCnn~~---~ChC~~gwapp~C~~  657 (716)
T KOG3607|consen  631 TCNGHGVCNNEL---NCHCEPGWAPPFCFI  657 (716)
T ss_pred             ccCCCcccCCCc---ceeeCCCCCCCcccc
Confidence            377788886553   588888888888863


Done!