Query         psy6349
Match_columns 145
No_of_seqs    118 out of 1951
Neff          11.0
Searched_HMMs 46136
Date          Sat Aug 17 00:08:12 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6349.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6349hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1219|consensus               99.6 1.4E-14   3E-19  114.5   9.5  108   10-117  3865-3975(4289)
  2 KOG1214|consensus               99.3 2.6E-11 5.7E-16   89.6  10.7  137    6-143   691-860 (1289)
  3 KOG1219|consensus               99.2 4.2E-11 9.1E-16   95.8   8.6   96   46-141  3864-3973(4289)
  4 KOG4289|consensus               99.1   7E-10 1.5E-14   86.2   7.5   85   28-112  1221-1308(2531)
  5 KOG1214|consensus               99.0 3.6E-09 7.9E-14   78.7   7.9  114    2-116   729-859 (1289)
  6 KOG4289|consensus               98.8 8.3E-09 1.8E-13   80.5   4.9   76   63-138  1218-1308(2531)
  7 KOG1225|consensus               98.7 2.1E-07 4.5E-12   67.3   9.7   95   30-143   266-365 (525)
  8 KOG4260|consensus               98.5   4E-07 8.6E-12   60.0   6.2  124   15-140   150-304 (350)
  9 KOG1225|consensus               98.0 6.7E-05 1.5E-09   54.7   8.9   95   30-142   235-338 (525)
 10 PF07645 EGF_CA:  Calcium-bindi  98.0 9.9E-06 2.1E-10   38.8   3.3   33    6-38      1-34  (42)
 11 KOG1217|consensus               98.0 0.00011 2.5E-09   53.1  10.2  124   20-143   140-305 (487)
 12 PF06247 Plasmod_Pvs28:  Plasmo  97.8 2.5E-05 5.5E-10   48.9   2.9  125   15-143     6-163 (197)
 13 KOG1217|consensus               97.7 0.00054 1.2E-08   49.6   9.6  132    4-137   268-421 (487)
 14 PF07645 EGF_CA:  Calcium-bindi  97.6 4.2E-05 9.1E-10   36.6   2.1   19  122-140    16-34  (42)
 15 PF12947 EGF_3:  EGF domain;  I  97.4 0.00013 2.9E-09   33.6   1.9   26  118-143     7-33  (36)
 16 PF00008 EGF:  EGF-like domain   97.3 0.00017 3.6E-09   32.3   1.8   23   92-114     6-29  (32)
 17 PF00008 EGF:  EGF-like domain   97.3 0.00027 5.9E-09   31.6   2.2   27   14-40      3-31  (32)
 18 smart00179 EGF_CA Calcium-bind  97.2   0.001 2.2E-08   30.9   4.0   18   21-38     16-33  (39)
 19 PF14670 FXa_inhibition:  Coagu  97.2 0.00024 5.2E-09   32.7   1.5   23  122-144    10-32  (36)
 20 smart00179 EGF_CA Calcium-bind  97.2  0.0012 2.5E-08   30.7   3.9   35   46-80      2-38  (39)
 21 PF12947 EGF_3:  EGF domain;  I  97.1 0.00049 1.1E-08   31.7   2.0   27   13-39      4-31  (36)
 22 PF12662 cEGF:  Complement Clr-  97.0 0.00074 1.6E-08   28.0   1.7   15  130-144     1-15  (24)
 23 cd00054 EGF_CA Calcium-binding  96.8   0.004 8.6E-08   28.4   4.0   19   21-39     16-34  (38)
 24 smart00181 EGF Epidermal growt  96.5  0.0098 2.1E-07   26.8   3.9   25   15-39      6-30  (35)
 25 cd00054 EGF_CA Calcium-binding  96.2   0.014 3.1E-07   26.5   3.7   24   92-115    11-34  (38)
 26 KOG1226|consensus               96.1   0.068 1.5E-06   41.0   8.5   79   30-116   479-577 (783)
 27 KOG4260|consensus               96.0   0.006 1.3E-07   40.9   2.4   34    5-38    234-268 (350)
 28 cd00053 EGF Epidermal growth f  96.0   0.023 4.9E-07   25.3   3.8   20  122-141    12-31  (36)
 29 smart00181 EGF Epidermal growt  95.9   0.024 5.2E-07   25.4   3.7   19   59-77     12-30  (35)
 30 PF14670 FXa_inhibition:  Coagu  95.8  0.0089 1.9E-07   27.4   1.9   23   94-116     8-30  (36)
 31 PF12661 hEGF:  Human growth fa  95.8  0.0061 1.3E-07   21.4   1.0   12   31-42      2-13  (13)
 32 cd00053 EGF Epidermal growth f  95.6    0.04 8.6E-07   24.5   3.9   20   20-39     12-31  (36)
 33 PF12946 EGF_MSP1_1:  MSP1 EGF   95.2  0.0086 1.9E-07   27.5   0.8   24  121-144    10-34  (37)
 34 PF07974 EGF_2:  EGF-like domai  94.8   0.054 1.2E-06   24.1   2.7   26   15-42      6-32  (32)
 35 KOG0994|consensus               94.0    0.38 8.3E-06   39.1   7.3   13   30-42    783-795 (1758)
 36 PHA02887 EGF-like protein; Pro  92.0     0.2 4.4E-06   29.3   2.6   27   17-43     94-122 (126)
 37 KOG1226|consensus               92.0    0.58 1.3E-05   36.2   5.6   67   68-142   479-577 (783)
 38 cd01475 vWA_Matrilin VWA_Matri  90.0    0.31 6.8E-06   32.1   2.5   23  122-144   199-221 (224)
 39 PHA03099 epidermal growth fact  89.9    0.35 7.7E-06   28.8   2.3   27   17-43     53-81  (139)
 40 cd01475 vWA_Matrilin VWA_Matri  88.9    0.69 1.5E-05   30.5   3.6   33   83-115   186-218 (224)
 41 KOG0994|consensus               88.0     1.2 2.6E-05   36.5   4.6   18   97-116  1078-1095(1758)
 42 PF09064 Tme5_EGF_like:  Thromb  83.1    0.92   2E-05   20.4   1.3   13  132-144    19-31  (34)
 43 KOG3512|consensus               83.0      14 0.00031   27.5   7.7   28   16-43    279-309 (592)
 44 smart00051 DSL delta serrate l  71.0      13 0.00028   19.3   4.1   45   30-80     18-63  (63)
 45 PF00954 S_locus_glycop:  S-loc  67.0     5.8 0.00012   23.0   2.1   19  121-140    89-107 (110)
 46 KOG1836|consensus               61.3      22 0.00048   31.1   5.0   30   87-116   777-809 (1705)
 47 KOG3516|consensus               60.6     9.9 0.00021   31.6   2.8   35   12-46    548-584 (1306)
 48 PF00053 Laminin_EGF:  Laminin   55.6      14  0.0003   17.8   2.0   22   20-43     11-32  (49)
 49 cd00055 EGF_Lam Laminin-type e  54.6      25 0.00054   17.0   3.1   13   30-42     20-32  (50)
 50 PF01683 EB:  EB module;  Inter  43.8      25 0.00054   17.1   1.9    9  106-114    38-46  (52)
 51 KOG3516|consensus               41.7      32  0.0007   28.9   3.0   37   46-82    545-582 (1306)
 52 KOG3514|consensus               34.4      35 0.00077   28.5   2.2   35   48-82    625-660 (1591)
 53 smart00180 EGF_Lam Laminin-typ  30.7      70  0.0015   15.2   3.1   13   30-42     19-31  (46)
 54 PF12955 DUF3844:  Domain of un  26.7      97  0.0021   18.0   2.6   26   90-115    13-43  (103)
 55 PF04863 EGF_alliinase:  Alliin  25.2      50  0.0011   16.7   1.1   17   65-81     34-50  (56)

No 1  
>KOG1219|consensus
Probab=99.57  E-value=1.4e-14  Score=114.49  Aligned_cols=108  Identities=31%  Similarity=0.912  Sum_probs=97.6

Q ss_pred             CCCCCCCCCC-CEeeeCC-CceeeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCcccc-CCC
Q psy6349          10 GLKREALCHN-GTCEDFG-NSHRCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELN-VDD   86 (145)
Q Consensus        10 ~~~~~~~c~~-~~c~~~~-~~~~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~-~~~   86 (145)
                      -.|...+|+| |.|.... +.|.|.|.+.|+|..|+.++..|.+.||..++.|....+.+.|.|+.+|.|..|+.. +++
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~e 3944 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISE 3944 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccc
Confidence            3446678887 7999775 679999999999999998899999999999999999999999999999999999876 789


Q ss_pred             CcCCCCCCCCEEeeCCCCeEeeCCCCCccCC
Q psy6349          87 CKPNPCQNGGICHDLINHFSCSCPSGLKREA  117 (145)
Q Consensus        87 c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~~  117 (145)
                      |+...|.+++.|+++.|+|+|.|.++|.+..
T Consensus      3945 Cs~n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3945 CSKNVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred             cccccccCCceeeccCCceEeccChhHhccc
Confidence            9999999999999999999999999998763


No 2  
>KOG1214|consensus
Probab=99.32  E-value=2.6e-11  Score=89.61  Aligned_cols=137  Identities=27%  Similarity=0.707  Sum_probs=99.9

Q ss_pred             ecCCCCCCCCCCC-CCEeeeCCC-ceeeecCCCcc--cCCCCccCCCCCCC--CCCCCCeeeeCCCceEEeCCCCcc--c
Q psy6349           6 VSSPGLKREALCH-NGTCEDFGN-SHRCHCMEGYT--GSYCSIEINECESD--PCQNGATCQDLIGGYSCQCARGFQ--G   77 (145)
Q Consensus         6 ~~~~~~~~~~~c~-~~~c~~~~~-~~~C~c~~g~~--~~~c~~~~~~c~~~--~c~~~~~c~~~~~~~~c~c~~g~~--g   77 (145)
                      ..++|.++.+.|. ++.|.+.++ .|.|.|..+|.  +..|. ++++|...  .|+.+..|++..+.+.|.|..+|.  +
T Consensus       691 ~~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d  769 (1289)
T KOG1214|consen  691 PVNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD  769 (1289)
T ss_pred             ccccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceecc
Confidence            4467788888885 568887765 58999999997  66774 67777653  488899999999999998877764  2


Q ss_pred             --CCccc-----cCCCCc--CCCCCCCC--EEeeCC-CCeEeeCCCCCccC------------CCCC-CCeeeeCCCcee
Q psy6349          78 --QNCEL-----NVDDCK--PNPCQNGG--ICHDLI-NHFSCSCPSGLKRE------------ALCH-NGTCEDFGNSHR  132 (145)
Q Consensus        78 --~~c~~-----~~~~c~--~~~~~~~~--~c~~~~-~~~~C~c~~g~~~~------------~~C~-~~~c~~~~g~~~  132 (145)
                        -.|-.     ..+.|.  .+.|...+  .|+... +.|.|.|+|||.++            ..|+ ++.|+++.|++.
T Consensus       770 d~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~Cyntpgsfs  849 (1289)
T KOG1214|consen  770 DRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFS  849 (1289)
T ss_pred             CCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccccccccCccccCCCceEecCCCcce
Confidence              23421     223443  23454444  344443 56999999999876            2465 789999999999


Q ss_pred             EecCCCCCcCC
Q psy6349         133 CHCICWDDLRN  143 (145)
Q Consensus       133 c~c~~~~~~~~  143 (145)
                      |+|++||.+++
T Consensus       850 C~C~pGy~GDG  860 (1289)
T KOG1214|consen  850 CRCQPGYYGDG  860 (1289)
T ss_pred             eecccCccCCC
Confidence            99999998865


No 3  
>KOG1219|consensus
Probab=99.24  E-value=4.2e-11  Score=95.76  Aligned_cols=96  Identities=35%  Similarity=0.957  Sum_probs=85.9

Q ss_pred             CCCCCCCCCCCCCeeeeCC-CceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC--------
Q psy6349          46 INECESDPCQNGATCQDLI-GGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE--------  116 (145)
Q Consensus        46 ~~~c~~~~c~~~~~c~~~~-~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~--------  116 (145)
                      .+.|.+.||.+++.|.... +.|.|.|+..|.|+.|..+...|.+.+|..++.|+...+.|.|.|+.+|++.        
T Consensus      3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~ 3943 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGIS 3943 (4289)
T ss_pred             ccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccccc
Confidence            3778999999999998764 6799999999999999988999999999999999999999999999999875        


Q ss_pred             ----CCCC-CCeeeeCCCceeEecCCCCCc
Q psy6349         117 ----ALCH-NGTCEDFGNSHRCHCICWDDL  141 (145)
Q Consensus       117 ----~~C~-~~~c~~~~g~~~c~c~~~~~~  141 (145)
                          +.|. +|+|.+..|+|+|-|-++|.+
T Consensus      3944 eCs~n~C~~gg~C~n~~gsf~CncT~g~~g 3973 (4289)
T KOG1219|consen 3944 ECSKNVCGTGGQCINIPGSFHCNCTPGILG 3973 (4289)
T ss_pred             ccccccccCCceeeccCCceEeccChhHhc
Confidence                3455 679999999999999988864


No 4  
>KOG4289|consensus
Probab=99.05  E-value=7e-10  Score=86.18  Aligned_cols=85  Identities=46%  Similarity=1.194  Sum_probs=75.6

Q ss_pred             ceeeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccC--CCCcCCCCCCCCEEeeCC-CC
Q psy6349          28 SHRCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNV--DDCKPNPCQNGGICHDLI-NH  104 (145)
Q Consensus        28 ~~~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~--~~c~~~~~~~~~~c~~~~-~~  104 (145)
                      ..+|.|++||+|..|+++++.|-+.||.+++.|....+.|.|.|.++|.|..|..+.  ..|.+..|.+++.|++.. +.
T Consensus      1221 glrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ngg 1300 (2531)
T KOG4289|consen 1221 GLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGG 1300 (2531)
T ss_pred             ceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCc
Confidence            468999999999999999999999999999999999999999999999999997654  357788899999998754 67


Q ss_pred             eEeeCCCC
Q psy6349         105 FSCSCPSG  112 (145)
Q Consensus       105 ~~C~c~~g  112 (145)
                      +.|.|+.|
T Consensus      1301 f~c~Cp~g 1308 (2531)
T KOG4289|consen 1301 FCCHCPYG 1308 (2531)
T ss_pred             eeccCCCc
Confidence            88999987


No 5  
>KOG1214|consensus
Probab=98.96  E-value=3.6e-09  Score=78.70  Aligned_cols=114  Identities=32%  Similarity=0.752  Sum_probs=83.0

Q ss_pred             ceeeecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc----cCCCCcc-----CCCCCCC--CCCCCC--eeeeC-CCc
Q psy6349           2 RNLIVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT----GSYCSIE-----INECESD--PCQNGA--TCQDL-IGG   66 (145)
Q Consensus         2 ~~~~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~----~~~c~~~-----~~~c~~~--~c~~~~--~c~~~-~~~   66 (145)
                      ++|..++.++...+-|- +..|++-+++|+|.|..+|.    +..|..-     ++.|...  .|...+  .|+.. .+.
T Consensus       729 r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~  808 (1289)
T KOG1214|consen  729 RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGST  808 (1289)
T ss_pred             CCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCce
Confidence            45667777888888885 68999999999888877764    3455321     1223221  232223  33333 357


Q ss_pred             eEEeCCCCccc--CCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC
Q psy6349          67 YSCQCARGFQG--QNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE  116 (145)
Q Consensus        67 ~~c~c~~g~~g--~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~  116 (145)
                      |.|.|.+||.|  ..|. ++++|.++.|.....|.++.+++.|+|.+||.++
T Consensus       809 y~C~CLPGfsGDG~~c~-dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  809 YSCACLPGFSGDGHQCT-DVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             EEEeecCCccCCccccc-cccccCccccCCCceEecCCCcceeecccCccCC
Confidence            99999999984  4565 6799999999999999999999999999999876


No 6  
>KOG4289|consensus
Probab=98.79  E-value=8.3e-09  Score=80.54  Aligned_cols=76  Identities=34%  Similarity=0.895  Sum_probs=63.4

Q ss_pred             CCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCccC--------CCCCC------CeeeeC-
Q psy6349          63 LIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSCPSGLKRE--------ALCHN------GTCEDF-  127 (145)
Q Consensus        63 ~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~~--------~~C~~------~~c~~~-  127 (145)
                      ..+.+.|.|++||.|..|...++.|-..+|.+++.|....+.|+|.|.++|+++        ..|.+      +.|.+. 
T Consensus      1218 pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred             ccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence            346789999999999999988899988899999999999999999999999987        23433      345443 


Q ss_pred             CCceeEecCCC
Q psy6349         128 GNSHRCHCICW  138 (145)
Q Consensus       128 ~g~~~c~c~~~  138 (145)
                      .|+++|+|+.|
T Consensus      1298 nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1298 NGGFCCHCPYG 1308 (2531)
T ss_pred             CCceeccCCCc
Confidence            48889999988


No 7  
>KOG1225|consensus
Probab=98.71  E-value=2.1e-07  Score=67.29  Aligned_cols=95  Identities=34%  Similarity=0.877  Sum_probs=69.2

Q ss_pred             eeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeC
Q psy6349          30 RCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSC  109 (145)
Q Consensus        30 ~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c  109 (145)
                      +|+|++||+|..|  ++..|... |+.++.++  .+  .|+|+++|.|..|+.  ..|. ..|..++.|+..    .|.|
T Consensus       266 ~CIC~~Gf~G~dC--~e~~Cp~~-cs~~g~~~--~g--~CiC~~g~~G~dCs~--~~cp-adC~g~G~Ci~G----~C~C  331 (525)
T KOG1225|consen  266 RCICPPGFTGDDC--DELVCPVD-CSGGGVCV--DG--ECICNPGYSGKDCSI--RRCP-ADCSGHGKCIDG----ECLC  331 (525)
T ss_pred             eEeCCCCCcCCCC--CcccCCcc-cCCCceec--CC--EeecCCCcccccccc--ccCC-ccCCCCCcccCC----ceEe
Confidence            7999999999999  34456544 66666664  22  699999999998852  3343 568888999733    7999


Q ss_pred             CCCCccC----CCCC-CCeeeeCCCceeEecCCCCCcCC
Q psy6349         110 PSGLKRE----ALCH-NGTCEDFGNSHRCHCICWDDLRN  143 (145)
Q Consensus       110 ~~g~~~~----~~C~-~~~c~~~~g~~~c~c~~~~~~~~  143 (145)
                      .+||.++    ..|+ +++|++  |   |.|++||++++
T Consensus       332 ~~Gy~G~~C~~~~C~~~g~cv~--g---C~C~~Gw~G~d  365 (525)
T KOG1225|consen  332 DEGYTGELCIQRACSGGGQCVN--G---CKCKKGWRGPD  365 (525)
T ss_pred             CCCCcCCcccccccCCCceecc--C---ceeccCccCCC
Confidence            9999986    2255 456653  2   99999998765


No 8  
>KOG4260|consensus
Probab=98.52  E-value=4e-07  Score=59.97  Aligned_cols=124  Identities=31%  Similarity=0.779  Sum_probs=77.6

Q ss_pred             CCCC-CCEee-eC--CCceeeecCCCcccCCCCc-cCCCCCC-----C----CCC--CCCeeeeCCCceEE-eCCCCcc-
Q psy6349          15 ALCH-NGTCE-DF--GNSHRCHCMEGYTGSYCSI-EINECES-----D----PCQ--NGATCQDLIGGYSC-QCARGFQ-   76 (145)
Q Consensus        15 ~~c~-~~~c~-~~--~~~~~C~c~~g~~~~~c~~-~~~~c~~-----~----~c~--~~~~c~~~~~~~~c-~c~~g~~-   76 (145)
                      .+|. +|.|. ..  .|+..|.|..||.|+.|.. ++....+     .    .|.  ..+.|.... .-.| .|..||. 
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~-~k~C~kCkkGW~l  228 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES-SKGCSKCKKGWKL  228 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCC-CCChhhhccccee
Confidence            4575 57886 22  3456899999999999841 1100000     0    011  111232111 1122 4666765 


Q ss_pred             -cCCccccCCCCc--CCCCCCCCEEeeCCCCeEeeCCCCCccC--------CCCC--CCeeeeCCCceeEecCCCCC
Q psy6349          77 -GQNCELNVDDCK--PNPCQNGGICHDLINHFSCSCPSGLKRE--------ALCH--NGTCEDFGNSHRCHCICWDD  140 (145)
Q Consensus        77 -g~~c~~~~~~c~--~~~~~~~~~c~~~~~~~~C~c~~g~~~~--------~~C~--~~~c~~~~g~~~c~c~~~~~  140 (145)
                       ...| .++++|.  +.+|..+..|+|+.|+|.|...+||...        ..|.  +..|.++.++|+|+|+.++.
T Consensus       229 de~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  229 DEEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             ccccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence             2234 3677775  4567788899999999999999999753        2343  67899999999999988764


No 9  
>KOG1225|consensus
Probab=98.02  E-value=6.7e-05  Score=54.69  Aligned_cols=95  Identities=35%  Similarity=0.828  Sum_probs=63.0

Q ss_pred             eeecCCCcccCCCCccCCCCCCCCCCCCCeeeeCCCceEEeCCCCcccCCccccCCCCcCCCCCCCCEEeeCCCCeEeeC
Q psy6349          30 RCHCMEGYTGSYCSIEINECESDPCQNGATCQDLIGGYSCQCARGFQGQNCELNVDDCKPNPCQNGGICHDLINHFSCSC  109 (145)
Q Consensus        30 ~C~c~~g~~~~~c~~~~~~c~~~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~~~~c~~~~~~~~~~c~~~~~~~~C~c  109 (145)
                      .|.|..+|.++.|.  ...|.. .|...+.|+    .-+|+|.++|.|.+|..  ..|... |..++.+++.    +|.|
T Consensus       235 ic~c~~~~~g~~c~--~~~C~~-~c~~~g~c~----~G~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g----~CiC  300 (525)
T KOG1225|consen  235 ICECPEGYFGPLCS--TIYCPG-GCTGRGQCV----EGRCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG----ECIC  300 (525)
T ss_pred             eeecCCceeCCccc--cccCCC-CCcccceEe----CCeEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC----Eeec
Confidence            46777777777663  233322 233334554    23699999999999952  445444 6666676544    8999


Q ss_pred             CCCCccC--------CCCC-CCeeeeCCCceeEecCCCCCcC
Q psy6349         110 PSGLKRE--------ALCH-NGTCEDFGNSHRCHCICWDDLR  142 (145)
Q Consensus       110 ~~g~~~~--------~~C~-~~~c~~~~g~~~c~c~~~~~~~  142 (145)
                      .++|.++        ..|. ++.|.+  |  +|.|.+||++.
T Consensus       301 ~~g~~G~dCs~~~cpadC~g~G~Ci~--G--~C~C~~Gy~G~  338 (525)
T KOG1225|consen  301 NPGYSGKDCSIRRCPADCSGHGKCID--G--ECLCDEGYTGE  338 (525)
T ss_pred             CCCccccccccccCCccCCCCCcccC--C--ceEeCCCCcCC
Confidence            9999986        2355 678873  3  69999999864


No 10 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.02  E-value=9.9e-06  Score=38.83  Aligned_cols=33  Identities=30%  Similarity=0.551  Sum_probs=27.6

Q ss_pred             ecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc
Q psy6349           6 VSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT   38 (145)
Q Consensus         6 ~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~   38 (145)
                      ++++|....+.|. ++.|++..|+|.|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            4677888888886 57899999999999999887


No 11 
>KOG1217|consensus
Probab=98.01  E-value=0.00011  Score=53.07  Aligned_cols=124  Identities=37%  Similarity=0.906  Sum_probs=83.0

Q ss_pred             CEeeeC---CCceeeecCCCcccCCCCccCCCCCC--CCCCCCCeeeeCCCceEEeCCCCcccCCcccc-----------
Q psy6349          20 GTCEDF---GNSHRCHCMEGYTGSYCSIEINECES--DPCQNGATCQDLIGGYSCQCARGFQGQNCELN-----------   83 (145)
Q Consensus        20 ~~c~~~---~~~~~C~c~~g~~~~~c~~~~~~c~~--~~c~~~~~c~~~~~~~~c~c~~g~~g~~c~~~-----------   83 (145)
                      +.|...   ...+.|.|..+|.+..+....+.|..  .+|..+..|.+..+.|.|.|..+|.+..+...           
T Consensus       140 ~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~  219 (487)
T KOG1217|consen  140 GSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVA  219 (487)
T ss_pred             hhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEeccee
Confidence            455543   24678999999998877533356763  34777888888888899999999986655421           


Q ss_pred             --------CCCCcC--CCCCCC-CEEeeCCCCeEeeCCCCCccCC--------------CCC-CCeeeeCCCceeEecCC
Q psy6349          84 --------VDDCKP--NPCQNG-GICHDLINHFSCSCPSGLKREA--------------LCH-NGTCEDFGNSHRCHCIC  137 (145)
Q Consensus        84 --------~~~c~~--~~~~~~-~~c~~~~~~~~C~c~~g~~~~~--------------~C~-~~~c~~~~g~~~c~c~~  137 (145)
                              ...+..  ..+... +.|.+..+.+.|.++++|.+..              .|. ++.|.+..+.|.|.|++
T Consensus       220 ~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~  299 (487)
T KOG1217|consen  220 CSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPP  299 (487)
T ss_pred             ccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCC
Confidence                    011110  112211 6677777778898888876431              255 56888888889999999


Q ss_pred             CCCcCC
Q psy6349         138 WDDLRN  143 (145)
Q Consensus       138 ~~~~~~  143 (145)
                      +|.+..
T Consensus       300 g~~g~~  305 (487)
T KOG1217|consen  300 GFTGRL  305 (487)
T ss_pred             CCCCCC
Confidence            998764


No 12 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.77  E-value=2.5e-05  Score=48.92  Aligned_cols=125  Identities=28%  Similarity=0.654  Sum_probs=75.0

Q ss_pred             CCCCCCEeeeCCCceeeecCCCcc---cCCCCccCCCCC-----CCCCCCCCeeeeCC-----CceEEeCCCCcccC--C
Q psy6349          15 ALCHNGTCEDFGNSHRCHCMEGYT---GSYCSIEINECE-----SDPCQNGATCQDLI-----GGYSCQCARGFQGQ--N   79 (145)
Q Consensus        15 ~~c~~~~c~~~~~~~~C~c~~g~~---~~~c~~~~~~c~-----~~~c~~~~~c~~~~-----~~~~c~c~~g~~g~--~   79 (145)
                      ..|.+|..++++..|.|.|..||+   ...|+ ....|.     ..+|...+.|....     ..|.|.|..+|.-.  .
T Consensus         6 T~CKNG~LiQMSNHfEC~Cnegfvl~~EntCE-~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~v   84 (197)
T PF06247_consen    6 TICKNGYLIQMSNHFECKCNEGFVLKNENTCE-EKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGV   84 (197)
T ss_dssp             ---BTEEEEEESSEEEEEESTTEEEEETTEEE-E----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSS
T ss_pred             ccccCCEEEEccCceEEEcCCCcEEccccccc-cceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCe
Confidence            467889999999999999999997   34454 333443     24677777787654     57999999999622  2


Q ss_pred             ccccCCCCcCCCCCCCCEEeeCC---CCeEeeCCCCCccC----------CC----CC-CCeeeeCCCceeEecCCCCCc
Q psy6349          80 CELNVDDCKPNPCQNGGICHDLI---NHFSCSCPSGLKRE----------AL----CH-NGTCEDFGNSHRCHCICWDDL  141 (145)
Q Consensus        80 c~~~~~~c~~~~~~~~~~c~~~~---~~~~C~c~~g~~~~----------~~----C~-~~~c~~~~g~~~c~c~~~~~~  141 (145)
                      |.  ...|....|. .+.|+...   ....|+|.-|+..+          ..    |. +..|....+-|+|.+..++.+
T Consensus        85 Cv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~  161 (197)
T PF06247_consen   85 CV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCKEGFPG  161 (197)
T ss_dssp             EE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred             Ec--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecCCCCCC
Confidence            32  2344445565 56775332   23488888887622          23    33 568888999999999999876


Q ss_pred             CC
Q psy6349         142 RN  143 (145)
Q Consensus       142 ~~  143 (145)
                      ..
T Consensus       162 ~~  163 (197)
T PF06247_consen  162 DG  163 (197)
T ss_dssp             ET
T ss_pred             CC
Confidence            54


No 13 
>KOG1217|consensus
Probab=97.71  E-value=0.00054  Score=49.57  Aligned_cols=132  Identities=33%  Similarity=0.872  Sum_probs=87.5

Q ss_pred             eeecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcccCCC--CccCCCCC----CCCCCCCCeee--eCCCceEEeCCCC
Q psy6349           4 LIVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYTGSYC--SIEINECE----SDPCQNGATCQ--DLIGGYSCQCARG   74 (145)
Q Consensus         4 ~~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~~~~c--~~~~~~c~----~~~c~~~~~c~--~~~~~~~c~c~~g   74 (145)
                      +..++.+....+ |. ++.|++..+.|.|.|.++|.+..+  ..+...|.    ..+|..+..|.  .....+.|.+..+
T Consensus       268 ~~~~~~C~~~~~-c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~  346 (487)
T KOG1217|consen  268 CVDVDSCALIAS-CPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPG  346 (487)
T ss_pred             eeeccccCCCCc-cCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCC
Confidence            344556655544 65 479999888899999999998886  22334553    33455555662  2234567889888


Q ss_pred             cccCCccccCCCCcCCCCCCCCEEee-CCCCeEeeCCCCCcc-C----------CCCC-CCeeeeCCCceeEecCC
Q psy6349          75 FQGQNCELNVDDCKPNPCQNGGICHD-LINHFSCSCPSGLKR-E----------ALCH-NGTCEDFGNSHRCHCIC  137 (145)
Q Consensus        75 ~~g~~c~~~~~~c~~~~~~~~~~c~~-~~~~~~C~c~~g~~~-~----------~~C~-~~~c~~~~g~~~c~c~~  137 (145)
                      +.|..|....+.|....+...+.|.+ ..+.+.|.++.+|.+ .          ..|. .+.|.+..+++.|. +.
T Consensus       347 ~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~~~~c~~~~~~~~c~-~~  421 (487)
T KOG1217|consen  347 FTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSGCGDCVNGPGGGACT-PP  421 (487)
T ss_pred             CCCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccCCcceeccCCCCccc-cC
Confidence            88888863223676666777888988 688899999998765 2          1222 23455666667776 55


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.65  E-value=4.2e-05  Score=36.62  Aligned_cols=19  Identities=32%  Similarity=0.546  Sum_probs=9.4

Q ss_pred             CeeeeCCCceeEecCCCCC
Q psy6349         122 GTCEDFGNSHRCHCICWDD  140 (145)
Q Consensus       122 ~~c~~~~g~~~c~c~~~~~  140 (145)
                      +.|.|+.|+|+|.|++||.
T Consensus        16 ~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen   16 GTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SEEEEETTEEEEEESTTEE
T ss_pred             CEEEcCCCCEEeeCCCCcE
Confidence            4455555555555555444


No 15 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.40  E-value=0.00013  Score=33.59  Aligned_cols=26  Identities=35%  Similarity=0.605  Sum_probs=16.4

Q ss_pred             CCC-CCeeeeCCCceeEecCCCCCcCC
Q psy6349         118 LCH-NGTCEDFGNSHRCHCICWDDLRN  143 (145)
Q Consensus       118 ~C~-~~~c~~~~g~~~c~c~~~~~~~~  143 (145)
                      .|+ ++.|.++.++|.|+|++||.+++
T Consensus         7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    7 GCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCcEeecCCCCEEeECCCCCccCC
Confidence            455 67777777777777777777665


No 16 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.34  E-value=0.00017  Score=32.33  Aligned_cols=23  Identities=52%  Similarity=1.295  Sum_probs=10.0

Q ss_pred             CCCCCEEeeCC-CCeEeeCCCCCc
Q psy6349          92 CQNGGICHDLI-NHFSCSCPSGLK  114 (145)
Q Consensus        92 ~~~~~~c~~~~-~~~~C~c~~g~~  114 (145)
                      |.+++.|+... ..|.|.|++||.
T Consensus         6 C~n~g~C~~~~~~~y~C~C~~G~~   29 (32)
T PF00008_consen    6 CQNGGTCIDLPGGGYTCECPPGYT   29 (32)
T ss_dssp             STTTEEEEEESTSEEEEEEBTTEE
T ss_pred             CCCCeEEEeCCCCCEEeECCCCCc
Confidence            44444444433 344444444443


No 17 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.28  E-value=0.00027  Score=31.63  Aligned_cols=27  Identities=44%  Similarity=0.982  Sum_probs=17.6

Q ss_pred             CCCCCC-CEeeeCC-CceeeecCCCcccC
Q psy6349          14 EALCHN-GTCEDFG-NSHRCHCMEGYTGS   40 (145)
Q Consensus        14 ~~~c~~-~~c~~~~-~~~~C~c~~g~~~~   40 (145)
                      ..+|.| +.|++.. +.|.|.|.+||.|.
T Consensus         3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            346654 5777666 66777777777654


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.23  E-value=0.001  Score=30.88  Aligned_cols=18  Identities=50%  Similarity=0.949  Sum_probs=9.0

Q ss_pred             EeeeCCCceeeecCCCcc
Q psy6349          21 TCEDFGNSHRCHCMEGYT   38 (145)
Q Consensus        21 ~c~~~~~~~~C~c~~g~~   38 (145)
                      .|++..+.|.|.|..+|.
T Consensus        16 ~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179       16 TCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             EeECCCCCeEeECCCCCc
Confidence            455444445555555544


No 19 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.19  E-value=0.00024  Score=32.68  Aligned_cols=23  Identities=26%  Similarity=0.388  Sum_probs=16.2

Q ss_pred             CeeeeCCCceeEecCCCCCcCCC
Q psy6349         122 GTCEDFGNSHRCHCICWDDLRNS  144 (145)
Q Consensus       122 ~~c~~~~g~~~c~c~~~~~~~~~  144 (145)
                      ..|.+..++|+|.|++||.|..|
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L~~D   32 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKLAED   32 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE-TT
T ss_pred             CCCccCCCceEeECCCCCEECcC
Confidence            36777888888888888887765


No 20 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.18  E-value=0.0012  Score=30.66  Aligned_cols=35  Identities=54%  Similarity=1.414  Sum_probs=28.5

Q ss_pred             CCCCCC-CCCCCCCeeeeCCCceEEeCCCCcc-cCCc
Q psy6349          46 INECES-DPCQNGATCQDLIGGYSCQCARGFQ-GQNC   80 (145)
Q Consensus        46 ~~~c~~-~~c~~~~~c~~~~~~~~c~c~~g~~-g~~c   80 (145)
                      +++|.. .+|..++.|.+..++|.|.|..+|. |..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            355655 6788778999999999999999998 7766


No 21 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.08  E-value=0.00049  Score=31.68  Aligned_cols=27  Identities=44%  Similarity=1.005  Sum_probs=18.1

Q ss_pred             CCCCCC-CCEeeeCCCceeeecCCCccc
Q psy6349          13 REALCH-NGTCEDFGNSHRCHCMEGYTG   39 (145)
Q Consensus        13 ~~~~c~-~~~c~~~~~~~~C~c~~g~~~   39 (145)
                      ..+.|+ ++.|.+..++|.|.|++||.|
T Consensus         4 ~~~~C~~nA~C~~~~~~~~C~C~~Gy~G   31 (36)
T PF12947_consen    4 NNGGCHPNATCTNTGGSYTCTCKPGYEG   31 (36)
T ss_dssp             GGGGS-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred             CCCCCCCCcEeecCCCCEEeECCCCCcc
Confidence            445665 678888888888888888875


No 22 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.95  E-value=0.00074  Score=28.02  Aligned_cols=15  Identities=27%  Similarity=0.321  Sum_probs=11.7

Q ss_pred             ceeEecCCCCCcCCC
Q psy6349         130 SHRCHCICWDDLRNS  144 (145)
Q Consensus       130 ~~~c~c~~~~~~~~~  144 (145)
                      +|+|.|++||+|.++
T Consensus         1 sy~C~C~~Gy~l~~d   15 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPD   15 (24)
T ss_pred             CEEeeCCCCCcCCCC
Confidence            578899999988654


No 23 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.81  E-value=0.004  Score=28.44  Aligned_cols=19  Identities=53%  Similarity=1.004  Sum_probs=8.4

Q ss_pred             EeeeCCCceeeecCCCccc
Q psy6349          21 TCEDFGNSHRCHCMEGYTG   39 (145)
Q Consensus        21 ~c~~~~~~~~C~c~~g~~~   39 (145)
                      .|.+..+.|.|.|..+|.|
T Consensus        16 ~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054          16 TCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             EeECCCCCeEeECCCCCcC
Confidence            4444444444444444443


No 24 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.46  E-value=0.0098  Score=26.79  Aligned_cols=25  Identities=48%  Similarity=0.975  Sum_probs=16.2

Q ss_pred             CCCCCCEeeeCCCceeeecCCCccc
Q psy6349          15 ALCHNGTCEDFGNSHRCHCMEGYTG   39 (145)
Q Consensus        15 ~~c~~~~c~~~~~~~~C~c~~g~~~   39 (145)
                      .+|.++.|.+..+.|.|.|..||.+
T Consensus         6 ~~C~~~~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        6 GPCSNGTCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCCCCEEECCCCCeEeECCCCCcc
Confidence            3454336776666777777777766


No 25 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.19  E-value=0.014  Score=26.45  Aligned_cols=24  Identities=46%  Similarity=1.222  Sum_probs=11.8

Q ss_pred             CCCCCEEeeCCCCeEeeCCCCCcc
Q psy6349          92 CQNGGICHDLINHFSCSCPSGLKR  115 (145)
Q Consensus        92 ~~~~~~c~~~~~~~~C~c~~g~~~  115 (145)
                      |..++.|.+..+.|.|.|+++|.+
T Consensus        11 C~~~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054          11 CQNGGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             cCCCCEeECCCCCeEeECCCCCcC
Confidence            333444554445555555555543


No 26 
>KOG1226|consensus
Probab=96.12  E-value=0.068  Score=40.96  Aligned_cols=79  Identities=27%  Similarity=0.683  Sum_probs=46.7

Q ss_pred             eeecCCCcccCCCCccCC---------CCCC----CCCCCCCeeeeCCCceEEeCCCCcc----cCCccccCCCCcC---
Q psy6349          30 RCHCMEGYTGSYCSIEIN---------ECES----DPCQNGATCQDLIGGYSCQCARGFQ----GQNCELNVDDCKP---   89 (145)
Q Consensus        30 ~C~c~~g~~~~~c~~~~~---------~c~~----~~c~~~~~c~~~~~~~~c~c~~g~~----g~~c~~~~~~c~~---   89 (145)
                      .|.|.+||.|..|+...+         .|+.    .+|+..+.|.=    ..|.|.+...    |+.|.-+.-.|..   
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C----GqC~C~~~~~~~i~G~fCECDnfsC~r~~g  554 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC----GQCVCHKPDNGKIYGKFCECDNFSCERHKG  554 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC----CceEecCCCCCceeeeeeeccCcccccccC
Confidence            479999999999874322         2321    24555555531    1366655443    6666533333322   


Q ss_pred             CCCCCCCEEeeCCCCeEeeCCCCCccC
Q psy6349          90 NPCQNGGICHDLINHFSCSCPSGLKRE  116 (145)
Q Consensus        90 ~~~~~~~~c~~~~~~~~C~c~~g~~~~  116 (145)
                      ..|..++.|.-.    +|.|.+||++.
T Consensus       555 ~lC~g~G~C~CG----~CvC~~GwtG~  577 (783)
T KOG1226|consen  555 VLCGGHGRCECG----RCVCNPGWTGS  577 (783)
T ss_pred             cccCCCCeEeCC----cEEcCCCCccC
Confidence            246777777443    78999999886


No 27 
>KOG4260|consensus
Probab=96.00  E-value=0.006  Score=40.87  Aligned_cols=34  Identities=24%  Similarity=0.455  Sum_probs=17.6

Q ss_pred             eecCCCCCCCCCCC-CCEeeeCCCceeeecCCCcc
Q psy6349           5 IVSSPGLKREALCH-NGTCEDFGNSHRCHCMEGYT   38 (145)
Q Consensus         5 ~~~~~~~~~~~~c~-~~~c~~~~~~~~C~c~~g~~   38 (145)
                      .+++.|...+.+|. +..|+|+.|+|.|....||.
T Consensus       234 vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~  268 (350)
T KOG4260|consen  234 VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYK  268 (350)
T ss_pred             ccHHHHhcCCCCCChhheeecCCCceEeccccccc
Confidence            34445554555553 34555555555555555554


No 28 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.97  E-value=0.023  Score=25.33  Aligned_cols=20  Identities=35%  Similarity=0.439  Sum_probs=9.6

Q ss_pred             CeeeeCCCceeEecCCCCCc
Q psy6349         122 GTCEDFGNSHRCHCICWDDL  141 (145)
Q Consensus       122 ~~c~~~~g~~~c~c~~~~~~  141 (145)
                      +.|.+..+.|.|.|+.||.+
T Consensus        12 ~~C~~~~~~~~C~C~~g~~g   31 (36)
T cd00053          12 GTCVNTPGSYRCVCPPGYTG   31 (36)
T ss_pred             CEEecCCCCeEeECCCCCcc
Confidence            44444444455555555443


No 29 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.93  E-value=0.024  Score=25.44  Aligned_cols=19  Identities=42%  Similarity=1.184  Sum_probs=8.9

Q ss_pred             eeeeCCCceEEeCCCCccc
Q psy6349          59 TCQDLIGGYSCQCARGFQG   77 (145)
Q Consensus        59 ~c~~~~~~~~c~c~~g~~g   77 (145)
                      .|.+..+.+.|.|..+|.+
T Consensus        12 ~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181       12 TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             EEECCCCCeEeECCCCCcc
Confidence            4444444445555554443


No 30 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.80  E-value=0.0089  Score=27.45  Aligned_cols=23  Identities=35%  Similarity=0.795  Sum_probs=17.8

Q ss_pred             CCCEEeeCCCCeEeeCCCCCccC
Q psy6349          94 NGGICHDLINHFSCSCPSGLKRE  116 (145)
Q Consensus        94 ~~~~c~~~~~~~~C~c~~g~~~~  116 (145)
                      ..+.|++..++|+|.|++||...
T Consensus         8 C~h~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen    8 CSHICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             SSSEEEEETTSEEEE-STTEEE-
T ss_pred             cCCCCccCCCceEeECCCCCEEC
Confidence            45688888999999999999864


No 31 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.77  E-value=0.0061  Score=21.38  Aligned_cols=12  Identities=50%  Similarity=1.431  Sum_probs=6.7

Q ss_pred             eecCCCcccCCC
Q psy6349          31 CHCMEGYTGSYC   42 (145)
Q Consensus        31 C~c~~g~~~~~c   42 (145)
                      |.|++||+|..|
T Consensus         2 C~C~~G~~G~~C   13 (13)
T PF12661_consen    2 CQCPPGWTGPNC   13 (13)
T ss_dssp             EEE-TTEETTTT
T ss_pred             ccCcCCCcCCCC
Confidence            566666666544


No 32 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.63  E-value=0.04  Score=24.48  Aligned_cols=20  Identities=55%  Similarity=1.062  Sum_probs=9.5

Q ss_pred             CEeeeCCCceeeecCCCccc
Q psy6349          20 GTCEDFGNSHRCHCMEGYTG   39 (145)
Q Consensus        20 ~~c~~~~~~~~C~c~~g~~~   39 (145)
                      +.|.+..+.|.|.|..+|.+
T Consensus        12 ~~C~~~~~~~~C~C~~g~~g   31 (36)
T cd00053          12 GTCVNTPGSYRCVCPPGYTG   31 (36)
T ss_pred             CEEecCCCCeEeECCCCCcc
Confidence            44544444444555544443


No 33 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=95.25  E-value=0.0086  Score=27.49  Aligned_cols=24  Identities=21%  Similarity=0.479  Sum_probs=15.1

Q ss_pred             CCeeeeCC-CceeEecCCCCCcCCC
Q psy6349         121 NGTCEDFG-NSHRCHCICWDDLRNS  144 (145)
Q Consensus       121 ~~~c~~~~-g~~~c~c~~~~~~~~~  144 (145)
                      ++.|.+.. |.+.|+|.+||+..++
T Consensus        10 NA~C~~~~dG~eecrCllgyk~~~~   34 (37)
T PF12946_consen   10 NAGCFRYDDGSEECRCLLGYKKVGG   34 (37)
T ss_dssp             TEEEEEETTSEEEEEE-TTEEEETT
T ss_pred             CcccEEcCCCCEEEEeeCCccccCC
Confidence            66676655 6666777777766543


No 34 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.80  E-value=0.054  Score=24.12  Aligned_cols=26  Identities=46%  Similarity=1.057  Sum_probs=19.0

Q ss_pred             CCCC-CCEeeeCCCceeeecCCCcccCCC
Q psy6349          15 ALCH-NGTCEDFGNSHRCHCMEGYTGSYC   42 (145)
Q Consensus        15 ~~c~-~~~c~~~~~~~~C~c~~g~~~~~c   42 (145)
                      ..|. +|+|+...  ..|.|.++|.|..|
T Consensus         6 ~~C~~~G~C~~~~--g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPC--GRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCC--CEEECCCCCcCCCC
Confidence            3575 58898652  37999999988765


No 35 
>KOG0994|consensus
Probab=93.99  E-value=0.38  Score=39.06  Aligned_cols=13  Identities=31%  Similarity=1.004  Sum_probs=10.9

Q ss_pred             eeecCCCcccCCC
Q psy6349          30 RCHCMEGYTGSYC   42 (145)
Q Consensus        30 ~C~c~~g~~~~~c   42 (145)
                      .|.|++..+|..|
T Consensus       783 qCqCkPnVVGR~C  795 (1758)
T KOG0994|consen  783 QCQCKPNVVGRRC  795 (1758)
T ss_pred             eecccCccccccc
Confidence            6888888888877


No 36 
>PHA02887 EGF-like protein; Provisional
Probab=91.97  E-value=0.2  Score=29.28  Aligned_cols=27  Identities=41%  Similarity=0.979  Sum_probs=15.2

Q ss_pred             CCCCEeeeC--CCceeeecCCCcccCCCC
Q psy6349          17 CHNGTCEDF--GNSHRCHCMEGYTGSYCS   43 (145)
Q Consensus        17 c~~~~c~~~--~~~~~C~c~~g~~~~~c~   43 (145)
                      |.||.|.-.  .....|.|..||.|.+|+
T Consensus        94 CiHG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         94 CINGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             eeCCEEEccccCCCceeECCCCcccCCCC
Confidence            445555422  234567777777776664


No 37 
>KOG1226|consensus
Probab=91.96  E-value=0.58  Score=36.21  Aligned_cols=67  Identities=31%  Similarity=0.728  Sum_probs=38.0

Q ss_pred             EEeCCCCcccCCccccC---------CCCc----CCCCCCCCEEeeCCCCeEeeCCCCCc----cC--------------
Q psy6349          68 SCQCARGFQGQNCELNV---------DDCK----PNPCQNGGICHDLINHFSCSCPSGLK----RE--------------  116 (145)
Q Consensus        68 ~c~c~~g~~g~~c~~~~---------~~c~----~~~~~~~~~c~~~~~~~~C~c~~g~~----~~--------------  116 (145)
                      .|.|.+||.|+.|.-..         +.|.    ...|...+.|.=.    +|.|.+...    +.              
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~fCECDnfsC~r~~g  554 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGKFCECDNFSCERHKG  554 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeeeeeeccCcccccccC
Confidence            47899999888764211         1221    1246666666432    566655433    11              


Q ss_pred             CCCC-CCeeeeCCCceeEecCCCCCcC
Q psy6349         117 ALCH-NGTCEDFGNSHRCHCICWDDLR  142 (145)
Q Consensus       117 ~~C~-~~~c~~~~g~~~c~c~~~~~~~  142 (145)
                      ..|. +++|.=  |  +|+|.+||++.
T Consensus       555 ~lC~g~G~C~C--G--~CvC~~GwtG~  577 (783)
T KOG1226|consen  555 VLCGGHGRCEC--G--RCVCNPGWTGS  577 (783)
T ss_pred             cccCCCCeEeC--C--cEEcCCCCccC
Confidence            2455 555532  2  59999999875


No 38 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=90.04  E-value=0.31  Score=32.10  Aligned_cols=23  Identities=22%  Similarity=0.254  Sum_probs=20.0

Q ss_pred             CeeeeCCCceeEecCCCCCcCCC
Q psy6349         122 GTCEDFGNSHRCHCICWDDLRNS  144 (145)
Q Consensus       122 ~~c~~~~g~~~c~c~~~~~~~~~  144 (145)
                      ..|.+..|+|.|.|..||++..+
T Consensus       199 ~~C~~~~g~~~c~c~~g~~~~~~  221 (224)
T cd01475         199 QVCISTPGSYLCACTEGYALLED  221 (224)
T ss_pred             ceEEcCCCCEEeECCCCccCCCC
Confidence            47999999999999999998654


No 39 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=89.88  E-value=0.35  Score=28.80  Aligned_cols=27  Identities=37%  Similarity=0.907  Sum_probs=18.0

Q ss_pred             CCCCEeeeC--CCceeeecCCCcccCCCC
Q psy6349          17 CHNGTCEDF--GNSHRCHCMEGYTGSYCS   43 (145)
Q Consensus        17 c~~~~c~~~--~~~~~C~c~~g~~~~~c~   43 (145)
                      |.||.|.-.  ...+.|.|..||+|.+|+
T Consensus        53 ClHG~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         53 CLHGDCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             eECCEEEeeccCCCceeECCCCccccccc
Confidence            455666533  245678888888888885


No 40 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.93  E-value=0.69  Score=30.47  Aligned_cols=33  Identities=18%  Similarity=0.398  Sum_probs=24.4

Q ss_pred             cCCCCcCCCCCCCCEEeeCCCCeEeeCCCCCcc
Q psy6349          83 NVDDCKPNPCQNGGICHDLINHFSCSCPSGLKR  115 (145)
Q Consensus        83 ~~~~c~~~~~~~~~~c~~~~~~~~C~c~~g~~~  115 (145)
                      +.++|..........|.++.|+|.|.|.+||..
T Consensus       186 ~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         186 VPDLCATLSHVCQQVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             CchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence            455665433334468999999999999999975


No 41 
>KOG0994|consensus
Probab=87.99  E-value=1.2  Score=36.51  Aligned_cols=18  Identities=22%  Similarity=0.840  Sum_probs=11.2

Q ss_pred             EEeeCCCCeEeeCCCCCccC
Q psy6349          97 ICHDLINHFSCSCPSGLKRE  116 (145)
Q Consensus        97 ~c~~~~~~~~C~c~~g~~~~  116 (145)
                      .|....|  +|.|.+||.+.
T Consensus      1078 qCN~ftG--QCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGR 1095 (1758)
T ss_pred             ccccccc--ceeccCCCCCc
Confidence            4433334  68888888653


No 42 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=83.11  E-value=0.92  Score=20.38  Aligned_cols=13  Identities=23%  Similarity=0.340  Sum_probs=8.3

Q ss_pred             eEecCCCCCcCCC
Q psy6349         132 RCHCICWDDLRNS  144 (145)
Q Consensus       132 ~c~c~~~~~~~~~  144 (145)
                      .|.|++||-|..+
T Consensus        19 ~C~CPeGyIlde~   31 (34)
T PF09064_consen   19 QCFCPEGYILDEG   31 (34)
T ss_pred             ceeCCCceEecCC
Confidence            5777777766544


No 43 
>KOG3512|consensus
Probab=83.03  E-value=14  Score=27.50  Aligned_cols=28  Identities=21%  Similarity=0.597  Sum_probs=19.2

Q ss_pred             CCC-C-CEeeeC-CCceeeecCCCcccCCCC
Q psy6349          16 LCH-N-GTCEDF-GNSHRCHCMEGYTGSYCS   43 (145)
Q Consensus        16 ~c~-~-~~c~~~-~~~~~C~c~~g~~~~~c~   43 (145)
                      +|+ | ..|+-- .+...|.|...-.|+.|.
T Consensus       279 KCNgHAs~Cv~d~~~~ltCdC~HNTaGPdCg  309 (592)
T KOG3512|consen  279 KCNGHASRCVMDESSHLTCDCEHNTAGPDCG  309 (592)
T ss_pred             eecCccceeeeccCCceEEecccCCCCCCcc
Confidence            354 4 367744 445889999888888874


No 44 
>smart00051 DSL delta serrate ligand.
Probab=71.02  E-value=13  Score=19.34  Aligned_cols=45  Identities=31%  Similarity=0.700  Sum_probs=22.8

Q ss_pred             eeecCCCcccCCCCccCCCCCCC-CCCCCCeeeeCCCceEEeCCCCcccCCc
Q psy6349          30 RCHCMEGYTGSYCSIEINECESD-PCQNGATCQDLIGGYSCQCARGFQGQNC   80 (145)
Q Consensus        30 ~C~c~~g~~~~~c~~~~~~c~~~-~c~~~~~c~~~~~~~~c~c~~g~~g~~c   80 (145)
                      .-.|..+|.|..|.   ..|... ....+..|.. .+  .++|.+||.|+.|
T Consensus        18 rv~C~~~~yG~~C~---~~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C   63 (63)
T smart00051       18 RVTCDENYYGEGCN---KFCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC   63 (63)
T ss_pred             EeeCCCCCcCCccC---CEeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence            44677777777773   233221 1223344532 12  3567777766543


No 45 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=66.98  E-value=5.8  Score=22.97  Aligned_cols=19  Identities=37%  Similarity=0.773  Sum_probs=9.0

Q ss_pred             CCeeeeCCCceeEecCCCCC
Q psy6349         121 NGTCEDFGNSHRCHCICWDD  140 (145)
Q Consensus       121 ~~~c~~~~g~~~c~c~~~~~  140 (145)
                      ++.|. ......|.|.+||+
T Consensus        89 ~g~C~-~~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   89 NGICN-SNNSPKCSCLPGFE  107 (110)
T ss_pred             ccEeC-CCCCCceECCCCcC
Confidence            44552 22333466666654


No 46 
>KOG1836|consensus
Probab=61.33  E-value=22  Score=31.13  Aligned_cols=30  Identities=40%  Similarity=1.016  Sum_probs=19.7

Q ss_pred             CcCCCCCCCCEEeeCC--CCeEee-CCCCCccC
Q psy6349          87 CKPNPCQNGGICHDLI--NHFSCS-CPSGLKRE  116 (145)
Q Consensus        87 c~~~~~~~~~~c~~~~--~~~~C~-c~~g~~~~  116 (145)
                      |.+-.|...+.|..+.  ....|. |+++|++.
T Consensus       777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~  809 (1705)
T KOG1836|consen  777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL  809 (1705)
T ss_pred             CccCCCCCChhhcCcCcccceecCCCCCCCccc
Confidence            6666666666665433  345787 88888765


No 47 
>KOG3516|consensus
Probab=60.61  E-value=9.9  Score=31.57  Aligned_cols=35  Identities=29%  Similarity=0.741  Sum_probs=28.1

Q ss_pred             CCCCCCCC-CEeeeCCCceeeecC-CCcccCCCCccC
Q psy6349          12 KREALCHN-GTCEDFGNSHRCHCM-EGYTGSYCSIEI   46 (145)
Q Consensus        12 ~~~~~c~~-~~c~~~~~~~~C~c~-~g~~~~~c~~~~   46 (145)
                      |.+++|.| |.|.+.-..|.|.|. .||.|..|.+.+
T Consensus       548 ClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi  584 (1306)
T KOG3516|consen  548 CLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSI  584 (1306)
T ss_pred             cCCccccCCCcccccccceeEeccccccccccccCCC
Confidence            47788986 689987778999997 889998886544


No 48 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=55.59  E-value=14  Score=17.81  Aligned_cols=22  Identities=36%  Similarity=0.828  Sum_probs=14.6

Q ss_pred             CEeeeCCCceeeecCCCcccCCCC
Q psy6349          20 GTCEDFGNSHRCHCMEGYTGSYCS   43 (145)
Q Consensus        20 ~~c~~~~~~~~C~c~~g~~~~~c~   43 (145)
                      ..|....  ..|.|.++|.|..|.
T Consensus        11 ~~C~~~~--G~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   11 QTCDPST--GQCVCKPGTTGPRCD   32 (49)
T ss_dssp             SSEEETC--EEESBSTTEESTTS-
T ss_pred             CcccCCC--CEEeccccccCCcCc
Confidence            3565543  368888888888874


No 49 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=54.56  E-value=25  Score=17.04  Aligned_cols=13  Identities=38%  Similarity=1.027  Sum_probs=10.4

Q ss_pred             eeecCCCcccCCC
Q psy6349          30 RCHCMEGYTGSYC   42 (145)
Q Consensus        30 ~C~c~~g~~~~~c   42 (145)
                      .|.|.+++.|..|
T Consensus        20 ~C~C~~~~~G~~C   32 (50)
T cd00055          20 QCECKPNTTGRRC   32 (50)
T ss_pred             EEeCCCcCCCCCC
Confidence            5788888888877


No 50 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=43.83  E-value=25  Score=17.10  Aligned_cols=9  Identities=44%  Similarity=1.327  Sum_probs=4.9

Q ss_pred             EeeCCCCCc
Q psy6349         106 SCSCPSGLK  114 (145)
Q Consensus       106 ~C~c~~g~~  114 (145)
                      .|.|+++|.
T Consensus        38 ~C~C~~g~~   46 (52)
T PF01683_consen   38 RCQCPPGYV   46 (52)
T ss_pred             EeECCCCCE
Confidence            455555554


No 51 
>KOG3516|consensus
Probab=41.71  E-value=32  Score=28.88  Aligned_cols=37  Identities=27%  Similarity=0.987  Sum_probs=30.5

Q ss_pred             CCCCCCCCCCCCCeeeeCCCceEEeCC-CCcccCCccc
Q psy6349          46 INECESDPCQNGATCQDLIGGYSCQCA-RGFQGQNCEL   82 (145)
Q Consensus        46 ~~~c~~~~c~~~~~c~~~~~~~~c~c~-~g~~g~~c~~   82 (145)
                      .+.|.+++|..++.|.-+-..+.|.|. .||.|..|..
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHT  582 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEeccccccccccccC
Confidence            466888999999999887788999996 8898877753


No 52 
>KOG3514|consensus
Probab=34.36  E-value=35  Score=28.53  Aligned_cols=35  Identities=46%  Similarity=1.161  Sum_probs=29.8

Q ss_pred             CCCCCCCCCCCeeeeCCCceEEeCC-CCcccCCccc
Q psy6349          48 ECESDPCQNGATCQDLIGGYSCQCA-RGFQGQNCEL   82 (145)
Q Consensus        48 ~c~~~~c~~~~~c~~~~~~~~c~c~-~g~~g~~c~~   82 (145)
                      .|.++||.+++.|...-+.+.|-|. .+|.|+.|.+
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence            6888999999999888888999985 6888988875


No 53 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=30.67  E-value=70  Score=15.18  Aligned_cols=13  Identities=38%  Similarity=1.048  Sum_probs=10.2

Q ss_pred             eeecCCCcccCCC
Q psy6349          30 RCHCMEGYTGSYC   42 (145)
Q Consensus        30 ~C~c~~g~~~~~c   42 (145)
                      .|.|.+++.|..|
T Consensus        19 ~C~C~~~~~G~~C   31 (46)
T smart00180       19 QCECKPNVTGRRC   31 (46)
T ss_pred             EEECCCCCCCCCC
Confidence            5788888888777


No 54 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=26.72  E-value=97  Score=18.03  Aligned_cols=26  Identities=27%  Similarity=0.759  Sum_probs=16.6

Q ss_pred             CCCCCCCEEeeCC-----CCeEeeCCCCCcc
Q psy6349          90 NPCQNGGICHDLI-----NHFSCSCPSGLKR  115 (145)
Q Consensus        90 ~~~~~~~~c~~~~-----~~~~C~c~~g~~~  115 (145)
                      ..|..++.|++..     .-|.|.|.+.+..
T Consensus        13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~   43 (103)
T PF12955_consen   13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVK   43 (103)
T ss_pred             cCCCCCceEeeccCCCccceEEEEeeccccc
Confidence            4577788887653     2367777765543


No 55 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=25.21  E-value=50  Score=16.71  Aligned_cols=17  Identities=29%  Similarity=0.970  Sum_probs=7.8

Q ss_pred             CceEEeCCCCcccCCcc
Q psy6349          65 GGYSCQCARGFQGQNCE   81 (145)
Q Consensus        65 ~~~~c~c~~g~~g~~c~   81 (145)
                      +...|.|..-|.|+.|+
T Consensus        34 G~p~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   34 GSPVCECNSCYGGPDCS   50 (56)
T ss_dssp             TEE--EE-TTEESTTS-
T ss_pred             CCccccccCCcCCCCcc
Confidence            33456666666666665


Done!