Query         psy2857
Match_columns 332
No_of_seqs    188 out of 1483
Neff          10.3
Searched_HMMs 46136
Date          Fri Aug 16 18:38:55 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2857.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2857hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.6 9.9E-15 2.2E-19  134.0  14.5  199   27-320   702-913 (1289)
  2 KOG1214|consensus               99.6 2.3E-14 5.1E-19  131.6  11.8  140  123-317   715-860 (1289)
  3 KOG1217|consensus               99.5 5.7E-13 1.2E-17  124.4  18.9  270    5-329   110-403 (487)
  4 KOG1217|consensus               99.4 3.3E-11 7.3E-16  112.5  18.5  262    4-328   151-434 (487)
  5 KOG1219|consensus               99.4 1.1E-12 2.3E-17  130.9   8.3  111  143-318  3865-3976(4289)
  6 KOG1219|consensus               99.2 4.4E-11 9.6E-16  119.8   8.9  111   20-177  3865-3976(4289)
  7 KOG1225|consensus               99.1   2E-09 4.3E-14   97.8  14.0  155   81-302   233-387 (525)
  8 KOG1225|consensus               99.1 6.7E-10 1.5E-14  100.8   9.8  131    6-176   235-365 (525)
  9 KOG4260|consensus               98.9 1.5E-09 3.1E-14   88.2   6.1  135   31-175   122-270 (350)
 10 KOG4289|consensus               98.9 1.5E-09 3.3E-14  105.5   6.6   79  227-312  1229-1308(2531)
 11 KOG4289|consensus               98.9 1.3E-09 2.9E-14  105.9   5.8   83    5-89   1222-1308(2531)
 12 PF07645 EGF_CA:  Calcium-bindi  98.8   4E-09 8.7E-14   62.6   2.6   38  281-318     1-38  (42)
 13 KOG4260|consensus               98.7 1.8E-08 3.9E-13   82.0   5.8  161  128-316   132-306 (350)
 14 KOG0994|consensus               98.7 7.7E-08 1.7E-12   92.3  10.8  138  161-318   996-1157(1758)
 15 PF07645 EGF_CA:  Calcium-bindi  98.3 1.1E-06 2.4E-11   52.1   3.3   34  241-274     1-36  (42)
 16 PF00008 EGF:  EGF-like domain   98.1 2.1E-06 4.6E-11   47.4   2.7   30  245-274     1-31  (32)
 17 smart00179 EGF_CA Calcium-bind  98.0   1E-05 2.2E-10   47.1   4.2   35  281-316     1-36  (39)
 18 smart00179 EGF_CA Calcium-bind  98.0 1.2E-05 2.7E-10   46.7   4.2   35   18-52      1-37  (39)
 19 PF00008 EGF:  EGF-like domain   98.0 4.3E-06 9.2E-11   46.2   1.7   30   22-51      1-31  (32)
 20 PF12947 EGF_3:  EGF domain;  I  97.9 4.7E-06   1E-10   47.1   1.3   32  286-317     2-33  (36)
 21 KOG1226|consensus               97.9 0.00016 3.5E-09   68.0  11.5  138   27-179   469-621 (783)
 22 PF12662 cEGF:  Complement Clr-  97.9 1.2E-05 2.5E-10   40.6   2.1   23  262-284     1-24  (24)
 23 cd00054 EGF_CA Calcium-binding  97.7 9.2E-05   2E-09   42.5   4.1   35   18-52      1-36  (38)
 24 PF12662 cEGF:  Complement Clr-  97.6 4.8E-05   1E-09   38.4   2.4   24  304-328     1-24  (24)
 25 KOG1836|consensus               97.6  0.0016 3.6E-08   68.0  15.4  140  160-320   863-1022(1705)
 26 cd00054 EGF_CA Calcium-binding  97.6  0.0001 2.2E-09   42.3   3.6   34  242-275     2-36  (38)
 27 KOG1226|consensus               97.5 0.00044 9.5E-09   65.1   8.8  134    6-158   479-636 (783)
 28 PF14670 FXa_inhibition:  Coagu  97.5 6.1E-05 1.3E-09   42.5   1.7   29  290-320     6-34  (36)
 29 PF12947 EGF_3:  EGF domain;  I  97.4  0.0001 2.2E-09   41.7   2.1   29  248-276     6-34  (36)
 30 KOG0994|consensus               97.4  0.0016 3.4E-08   63.9  10.4   34  157-190   878-915 (1758)
 31 PF06247 Plasmod_Pvs28:  Plasmo  97.2 9.3E-05   2E-09   57.6   0.6  140   27-178     8-165 (197)
 32 cd00053 EGF Epidermal growth f  97.2 0.00071 1.5E-08   38.1   3.8   28  289-316     5-32  (36)
 33 cd00053 EGF Epidermal growth f  97.1 0.00096 2.1E-08   37.5   4.0   28   24-51      5-32  (36)
 34 PF06247 Plasmod_Pvs28:  Plasmo  97.1 0.00082 1.8E-08   52.4   4.2  131  124-319    20-165 (197)
 35 smart00181 EGF Epidermal growt  97.0  0.0011 2.5E-08   37.2   3.6   26  290-316     6-31  (35)
 36 smart00181 EGF Epidermal growt  96.9  0.0017 3.6E-08   36.5   3.9   29   22-51      2-31  (35)
 37 PF14670 FXa_inhibition:  Coagu  96.5  0.0028 6.2E-08   35.7   2.4   24  253-276     9-32  (36)
 38 cd01475 vWA_Matrilin VWA_Matri  96.2  0.0063 1.4E-07   50.8   4.1   41  278-320   183-223 (224)
 39 PF07974 EGF_2:  EGF-like domai  96.1   0.013 2.8E-07   32.1   3.7   26  290-317     6-31  (32)
 40 PF12661 hEGF:  Human growth fa  96.1  0.0039 8.4E-08   26.5   1.3   13  264-276     1-13  (13)
 41 PF07974 EGF_2:  EGF-like domai  96.0  0.0091   2E-07   32.7   2.8   26  248-275     6-31  (32)
 42 KOG1836|consensus               94.3    0.79 1.7E-05   48.9  13.0   51    8-58    760-816 (1705)
 43 PF12946 EGF_MSP1_1:  MSP1 EGF   94.1   0.043 9.3E-07   30.8   1.9   31  245-275     2-33  (37)
 44 PF12946 EGF_MSP1_1:  MSP1 EGF   93.1   0.053 1.2E-06   30.4   1.2   30   22-51      2-32  (37)
 45 cd01475 vWA_Matrilin VWA_Matri  93.0   0.099 2.1E-06   43.6   3.3   38  238-275   183-220 (224)
 46 KOG1218|consensus               91.2     9.7 0.00021   33.3  17.3   97   38-145    13-110 (316)
 47 smart00051 DSL delta serrate l  90.9     0.5 1.1E-05   30.5   4.0   47  262-317    16-62  (63)
 48 smart00051 DSL delta serrate l  90.1    0.56 1.2E-05   30.2   3.7   46    4-52     16-62  (63)
 49 KOG1218|consensus               90.0     6.2 0.00013   34.5  11.7   40  121-162   159-199 (316)
 50 PF01683 EB:  EB module;  Inter  83.6     1.5 3.3E-05   26.8   3.0   27  109-135    22-48  (52)
 51 PF00053 Laminin_EGF:  Laminin   83.3     1.1 2.4E-05   27.1   2.2   24  255-280    12-35  (49)
 52 PF00954 S_locus_glycop:  S-loc  81.3     1.8 3.9E-05   31.4   3.1   33  282-316    77-109 (110)
 53 cd00055 EGF_Lam Laminin-type e  80.1     2.2 4.8E-05   25.9   2.7   22  256-279    14-35  (50)
 54 PF01683 EB:  EB module;  Inter  78.0     4.5 9.7E-05   24.7   3.7   24  148-175    25-48  (52)
 55 cd00055 EGF_Lam Laminin-type e  76.2     4.8  0.0001   24.4   3.4   22  306-329    20-41  (50)
 56 PHA03099 epidermal growth fact  73.6     4.1 8.9E-05   30.0   2.9   36   18-54     41-81  (139)
 57 smart00180 EGF_Lam Laminin-typ  73.4     3.9 8.4E-05   24.4   2.4   21  255-277    12-32  (46)
 58 PF00954 S_locus_glycop:  S-loc  71.3     4.8  0.0001   29.2   3.0   34  141-175    76-109 (110)
 59 PHA02887 EGF-like protein; Pro  70.4     4.2 9.1E-05   29.4   2.4   27  250-277    94-122 (126)
 60 PF01414 DSL:  Delta serrate li  70.3     2.3   5E-05   27.4   1.0   46  262-317    16-62  (63)
 61 PHA02887 EGF-like protein; Pro  69.6     5.4 0.00012   28.9   2.8   27   27-54     94-122 (126)
 62 PF09064 Tme5_EGF_like:  Thromb  69.0     3.9 8.4E-05   22.5   1.5   13  264-276    19-31  (34)
 63 PHA03099 epidermal growth fact  65.9     5.5 0.00012   29.4   2.2   27  250-277    53-81  (139)
 64 PF12955 DUF3844:  Domain of un  64.7     6.8 0.00015   28.0   2.5   35  282-316     5-44  (103)
 65 KOG3516|consensus               63.4     6.5 0.00014   40.1   2.9   36   19-54    545-581 (1306)
 66 KOG3512|consensus               61.9      23 0.00049   32.6   5.7   25  253-279   406-430 (592)
 67 KOG3516|consensus               55.4     9.9 0.00021   38.9   2.7   39  278-318   541-580 (1306)
 68 KOG3514|consensus               53.4     9.7 0.00021   38.5   2.3   34   21-54    625-659 (1591)
 69 KOG3509|consensus               47.5      36 0.00078   34.7   5.1   71  243-317   407-477 (964)
 70 KOG3514|consensus               36.3      24 0.00051   36.0   2.0   34  244-277   625-659 (1591)
 71 PF01826 TIL:  Trypsin Inhibito  33.5      20 0.00044   22.0   0.8   21  264-284    34-54  (55)
 72 PF04863 EGF_alliinase:  Alliin  32.6      23  0.0005   21.9   0.8   30   25-54     17-50  (56)
 73 KOG3512|consensus               29.8      69  0.0015   29.6   3.6   59  259-320   368-429 (592)
 74 PF05092 PIF:  Per os infectivi  25.5 1.5E+02  0.0032   28.1   5.0   49    3-51    130-182 (522)
 75 KOG0196|consensus               24.0 1.1E+02  0.0025   30.6   4.2   56  264-324   260-329 (996)
 76 KOG0196|consensus               20.3 2.2E+02  0.0048   28.7   5.2   17  160-176   304-320 (996)

No 1  
>KOG1214|consensus
Probab=99.62  E-value=9.9e-15  Score=133.97  Aligned_cols=199  Identities=28%  Similarity=0.676  Sum_probs=137.9

Q ss_pred             CCCCCeeeeCCC-CeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccC
Q psy2857          27 CGVNATCIDTQG-SYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDV  105 (332)
Q Consensus        27 C~~~g~C~~~~g-~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~  105 (332)
                      |..++.|....+ .|.|.|..||.|+. +.|.++++|+...+.|++++.|++..+.|+|.|..||......- .|..+..
T Consensus       702 cdt~a~C~pg~~~~~tcecs~g~~gdg-r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~-tCV~i~~  779 (1289)
T KOG1214|consen  702 CDTTARCHPGTGVDYTCECSSGYQGDG-RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRH-TCVLITP  779 (1289)
T ss_pred             cCCCccccCCCCcceEEEEeeccCCCC-CCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCc-ceEEecC
Confidence            666677887744 79999999999987 67999999999889999999999999999999999987654322 3432210


Q ss_pred             CCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCC--CceecC-CCceEeeCCCCCccCCCCcccc
Q psy2857         106 TSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQ--AQCTNT-PGSFRCDCVEGYVGAPPRIKCK  182 (332)
Q Consensus       106 ~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~--~~C~~~-~~~~~C~C~~G~~g~~~~~~c~  182 (332)
                        . .+                        ...|.+.     .+.|...  .+|+.. .++|.|.|.+||.|+.      
T Consensus       780 --p-ap------------------------~n~Ce~g-----~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG------  821 (1289)
T KOG1214|consen  780 --P-AP------------------------ANPCEDG-----SHTCAIAGQARCVHHGGSTYSCACLPGFSGDG------  821 (1289)
T ss_pred             --C-CC------------------------CCccccC-----ccccCcCCceEEEecCCceEEEeecCCccCCc------
Confidence              0 00                        1223221     1333333  345544 4679999999999862      


Q ss_pred             ccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCccccccccCccCCCCCCCCCeeeecCCc
Q psy2857         183 DVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGS  262 (332)
Q Consensus       183 ~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~  262 (332)
                                                                            ..+.++|+|.++-|..++.|++++++
T Consensus       822 ------------------------------------------------------~~c~dvDeC~psrChp~A~Cyntpgs  847 (1289)
T KOG1214|consen  822 ------------------------------------------------------HQCTDVDECSPSRCHPAATCYNTPGS  847 (1289)
T ss_pred             ------------------------------------------------------cccccccccCccccCCCceEecCCCc
Confidence                                                                  13456799999999999999999999


Q ss_pred             eeeecCCCcccCCCCcccc----cccccCC---CCCCCCCC-eee-cCCCceeeeCCCCCCCCCCCC
Q psy2857         263 YSCVCKEHYTGDPYQACSD----IDECKAL---DKPCGLRA-ICE-NTVPGFNCLCPKGYSGKPDAK  320 (332)
Q Consensus       263 ~~C~C~~G~~g~~~~~C~~----~d~C~~~---~~~C~~~~-~C~-~~~g~~~C~C~~g~~g~~~~~  320 (332)
                      |.|+|.+||.|++.. |..    .-.|...   +-.|..+. .|. ..+.+|.|.|.++-.|+...+
T Consensus       848 fsC~C~pGy~GDGf~-CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~  913 (1289)
T KOG1214|consen  848 FSCRCQPGYYGDGFQ-CVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPH  913 (1289)
T ss_pred             ceeecccCccCCCce-ecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCCCCC
Confidence            999999999999754 332    1223222   23354332 222 234567888888777766554


No 2  
>KOG1214|consensus
Probab=99.56  E-value=2.3e-14  Score=131.58  Aligned_cols=140  Identities=38%  Similarity=0.868  Sum_probs=109.0

Q ss_pred             CceecCCCCccCCCCeeecCCCCC-CCCCCCCCCceecCCCceEeeCCCCCccCCCCccccccCcccceeeccccCcccc
Q psy2857         123 DGLCYCRPGFDARGSVCVDVDECQ-LGDPCGPQAQCTNTPGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLH  201 (332)
Q Consensus       123 ~~~c~C~~g~~~~g~~c~~~~~C~-~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~  201 (332)
                      .+.|.|..||.+.++.|.+.++|+ ....|.++.+|++.+++|+|.|..||.......+|..+.-               
T Consensus       715 ~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~---------------  779 (1289)
T KOG1214|consen  715 DYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITP---------------  779 (1289)
T ss_pred             ceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecC---------------
Confidence            458999999999999999999998 3467999999999999999999999987665544433210               


Q ss_pred             cccCCcccccceecccccceecccccEEecccCccccccccCccCC--CCCCCCC--eeeecC-CceeeecCCCcccCCC
Q psy2857         202 SVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQS--NPCGVNA--TCIDTQ-GSYSCVCKEHYTGDPY  276 (332)
Q Consensus       202 ~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~--~~C~~~~--~C~~~~-~~~~C~C~~G~~g~~~  276 (332)
                                                    .       ..++.|+.  +.|..++  .|+... ++|.|+|.+||.|++.
T Consensus       780 ------------------------------p-------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~  822 (1289)
T KOG1214|consen  780 ------------------------------P-------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH  822 (1289)
T ss_pred             ------------------------------C-------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc
Confidence                                          0       01122322  2343333  455544 4799999999999986


Q ss_pred             CcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         277 QACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       277 ~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      . |.|+|+|.  +..|.++++|+|++++|.|+|.+||.|+.
T Consensus       823 ~-c~dvDeC~--psrChp~A~CyntpgsfsC~C~pGy~GDG  860 (1289)
T KOG1214|consen  823 Q-CTDVDECS--PSRCHPAATCYNTPGSFSCRCQPGYYGDG  860 (1289)
T ss_pred             c-cccccccC--ccccCCCceEecCCCcceeecccCccCCC
Confidence            4 88999997  47899999999999999999999999985


No 3  
>KOG1217|consensus
Probab=99.53  E-value=5.7e-13  Score=124.36  Aligned_cols=270  Identities=32%  Similarity=0.707  Sum_probs=186.9

Q ss_pred             EEEEecCCceecccCC--cCCCCC--CCCCCeeeeC---CCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceee
Q psy2857           5 VLVRILLGVRAIVDIN--ECQSNP--CGVNATCIDT---QGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICEN   77 (332)
Q Consensus         5 ~~c~c~~g~~~~~~~~--~C~~~~--C~~~g~C~~~---~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~   77 (332)
                      ..|.|..||.+..+..  .|...+  +..++.|.+.   ...|.|.|..||.+..+..  ..++|.....+|.+.+.|.+
T Consensus       110 ~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~--~~~~C~~~~~~c~~~~~C~~  187 (487)
T KOG1217|consen  110 YECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCET--DLDECIQYSSPCQNGGTCVN  187 (487)
T ss_pred             ceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccc--cccccccCCCCcCCCccccc
Confidence            4578999999986655  477766  3556778774   3589999999999976642  22677655667999999999


Q ss_pred             CCCCeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccC-ceecCCCCccCCCCee-ecCCCCCCCCCCCCCC
Q psy2857          78 TVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVD-GLCYCRPGFDARGSVC-VDVDECQLGDPCGPQA  155 (332)
Q Consensus        78 ~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~-~~c~C~~g~~~~g~~c-~~~~~C~~~~~C~~~~  155 (332)
                      ..++|.|.|..+|.+......                 .+...|.. ..+.+.++|.  +..| .++.++...    . +
T Consensus       188 ~~~~~~C~c~~~~~~~~~~~~-----------------~~~~~c~~~~~~~~~~g~~--~~~c~~~~~~~~~~----~-~  243 (487)
T KOG1217|consen  188 TGGSYLCSCPPGYTGSTCETT-----------------GNGGTCVDSVACSCPPGAR--GPECEVSIVECASG----D-G  243 (487)
T ss_pred             CCCCeeEeCCCCccCCcCcCC-----------------CCCceEecceeccCCCCCC--CCCcccccccccCC----C-C
Confidence            999999999999998743321                 11122222 3567777776  4444 334444311    4 8


Q ss_pred             ceecCCCceEeeCCCCCccCC-----CCccccccC-cccceeeccccCcccccccCCcccccceecccccceecccccEE
Q psy2857         156 QCTNTPGSFRCDCVEGYVGAP-----PRIKCKDVR-WEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFV  229 (332)
Q Consensus       156 ~C~~~~~~~~C~C~~G~~g~~-----~~~~c~~~~-c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~  229 (332)
                      +|++..+.+.|.|++||.+..     ....|.... |.++..|.....           .+.|.|..          +|+
T Consensus       244 ~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~-----------~~~C~C~~----------g~~  302 (487)
T KOG1217|consen  244 TCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG-----------SYRCTCPP----------GFT  302 (487)
T ss_pred             cccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCC-----------cceeeCCC----------CCC
Confidence            899999999999999999987     234555543 666666653222           26777777          888


Q ss_pred             ecccCccccccccCcc----CCCCCCCCCee--eecCCceeeecCCCcccCCCCccccc-ccccCCCCCCCCCCeeec-C
Q psy2857         230 IEDAKRNLNRVDINEC----QSNPCGVNATC--IDTQGSYSCVCKEHYTGDPYQACSDI-DECKALDKPCGLRAICEN-T  301 (332)
Q Consensus       230 ~~~~~~~~~~~~~~~C----~~~~C~~~~~C--~~~~~~~~C~C~~G~~g~~~~~C~~~-d~C~~~~~~C~~~~~C~~-~  301 (332)
                      +..+   ..+.+..+|    ...+|.+++.|  .+..+.+.|.|..||.|..+   ++. ++|..  .++..++.|++ .
T Consensus       303 g~~~---~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C---~~~~~~C~~--~~~~~~~~c~~~~  374 (487)
T KOG1217|consen  303 GRLC---TECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRC---EDSNDECAS--SPCCPGGTCVNET  374 (487)
T ss_pred             CCCC---ccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCcc---ccCCccccC--CccccCCEeccCC
Confidence            7766   234455677    34668887788  34445788999999888754   455 48876  34778899999 7


Q ss_pred             CCceeeeCCCCCCCC-CCCCccccccccC
Q psy2857         302 VPGFNCLCPKGYSGK-PDAKVACEQEKAG  329 (332)
Q Consensus       302 ~g~~~C~C~~g~~g~-~~~~~~c~~~~~~  329 (332)
                      .++|.|.|+.+|.+. ......+..+.+.
T Consensus       375 ~~~~~c~~~~~~~~~~~~~~~~~~~~~~c  403 (487)
T KOG1217|consen  375 PGSYRCACPAGFAGKANGDGVGCEDIDEC  403 (487)
T ss_pred             CCCeEecCCCccccCCccccccccccccc
Confidence            899999999999984 2222345555443


No 4  
>KOG1217|consensus
Probab=99.38  E-value=3.3e-11  Score=112.47  Aligned_cols=262  Identities=31%  Similarity=0.619  Sum_probs=171.7

Q ss_pred             EEEEEecCCceecccC---CcCCC--CCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeC
Q psy2857           4 VVLVRILLGVRAIVDI---NECQS--NPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT   78 (332)
Q Consensus         4 ~~~c~c~~g~~~~~~~---~~C~~--~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~   78 (332)
                      ...|.|..||.+....   ++|..  .+|.+.+.|.+..++|.|.|.+||.+..+..-             ...+.|++.
T Consensus       151 ~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~-------------~~~~~c~~~  217 (487)
T KOG1217|consen  151 PFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT-------------GNGGTCVDS  217 (487)
T ss_pred             ceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC-------------CCCceEecc
Confidence            4678999999999432   68874  34998999999999999999999999765311             223455444


Q ss_pred             CCCeeeeCCCCCCCCCCCccceeeccCCCCCCCC-ccccCCCcccCceecCCCCccCCC-CeeecCCCCCCCCCCCCCCc
Q psy2857          79 VPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSN-FECVNNAECVDGLCYCRPGFDARG-SVCVDVDECQLGDPCGPQAQ  156 (332)
Q Consensus        79 ~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~-~~C~~~~~c~~~~c~C~~g~~~~g-~~c~~~~~C~~~~~C~~~~~  156 (332)
                         +.|.+..++.+........       .+... ..|.+...  .+.|.+++||.+.. ..+.++++|.....|.++++
T Consensus       218 ---~~~~~~~g~~~~~c~~~~~-------~~~~~~~~c~~~~~--~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~  285 (487)
T KOG1217|consen  218 ---VACSCPPGARGPECEVSIV-------ECASGDGTCVNTVG--SYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGT  285 (487)
T ss_pred             ---eeccCCCCCCCCCcccccc-------cccCCCCcccccCC--ceeeeCCCCccccccceeeeccccCCCCccCCCCe
Confidence               5778888887653321100       11111 12222111  25888999998655 46778888974324888999


Q ss_pred             eecCCCceEeeCCCCCccCCCCccccc-cC---------cccceeeccccCcccccccCCcccccceecccccceecccc
Q psy2857         157 CTNTPGSFRCDCVEGYVGAPPRIKCKD-VR---------WEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLK  226 (332)
Q Consensus       157 C~~~~~~~~C~C~~G~~g~~~~~~c~~-~~---------c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~  226 (332)
                      |++..+.|.|.|++||.+..+ ..+.+ ..         |.++..|.         .......+.|.+..          
T Consensus       286 C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~g~~C~---------~~~~~~~~~C~c~~----------  345 (487)
T KOG1217|consen  286 CVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCANGGTCN---------TLGSFGGFRCACGP----------  345 (487)
T ss_pred             eecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCCCcccc---------cCCCCCCCCcCCCC----------
Confidence            999998899999999999876 22222 11         33333331         00111234455554          


Q ss_pred             cEEecccCcccccccc-CccCCCCCCCCCeeee-cCCceeeecCCCcccC---CCCcccccccccCCCCCCCCCCeeecC
Q psy2857         227 LFVIEDAKRNLNRVDI-NECQSNPCGVNATCID-TQGSYSCVCKEHYTGD---PYQACSDIDECKALDKPCGLRAICENT  301 (332)
Q Consensus       227 g~~~~~~~~~~~~~~~-~~C~~~~C~~~~~C~~-~~~~~~C~C~~G~~g~---~~~~C~~~d~C~~~~~~C~~~~~C~~~  301 (332)
                      +|++.      .+... ++|...++..++.|++ ..++|.|.|+.+|.+.   ....+.++++|..       .+.|++.
T Consensus       346 ~~~g~------~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~-------~~~c~~~  412 (487)
T KOG1217|consen  346 GFTGR------RCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG-------CGDCVNG  412 (487)
T ss_pred             CCCCC------ccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC-------Ccceecc
Confidence            44433      23334 4888888889999999 7899999999999984   2345777888754       4578888


Q ss_pred             CCceeeeCCCCCCCCCCCCcccccccc
Q psy2857         302 VPGFNCLCPKGYSGKPDAKVACEQEKA  328 (332)
Q Consensus       302 ~g~~~C~C~~g~~g~~~~~~~c~~~~~  328 (332)
                      .++|.|. ++ + .....  .|.++.+
T Consensus       413 ~~~~~c~-~~-~-~~~~~--~~~~~~~  434 (487)
T KOG1217|consen  413 PGGGACT-PP-G-LVSPG--TCDDIDE  434 (487)
T ss_pred             CCCCccc-cC-c-ccCCc--ceecccc
Confidence            9999999 77 4 33222  4555444


No 5  
>KOG1219|consensus
Probab=99.37  E-value=1.1e-12  Score=130.89  Aligned_cols=111  Identities=34%  Similarity=0.844  Sum_probs=92.8

Q ss_pred             CCCCCCCCCCCCCceecCC-CceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccce
Q psy2857         143 DECQLGDPCGPQAQCTNTP-GSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIF  221 (332)
Q Consensus       143 ~~C~~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~  221 (332)
                      +.|. .++|+++|+|...+ +.|.|.|++-|.|..                                             
T Consensus      3865 d~C~-~npCqhgG~C~~~~~ggy~CkCpsqysG~~--------------------------------------------- 3898 (4289)
T KOG1219|consen 3865 DPCN-DNPCQHGGTCISQPKGGYKCKCPSQYSGNH--------------------------------------------- 3898 (4289)
T ss_pred             cccc-cCcccCCCEecCCCCCceEEeCcccccCcc---------------------------------------------
Confidence            5565 57788888887665 667788887777643                                             


Q ss_pred             ecccccEEecccCccccccccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecC
Q psy2857         222 SKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT  301 (332)
Q Consensus       222 ~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~  301 (332)
                                      +..++..|.++||..+++|+...++|.|.|+.||+|..|+. ..+++|+.  ++|.++|.|+|+
T Consensus      3899 ----------------CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~-~Gi~eCs~--n~C~~gg~C~n~ 3959 (4289)
T KOG1219|consen 3899 ----------------CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA-RGISECSK--NVCGTGGQCINI 3959 (4289)
T ss_pred             ----------------cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec-cccccccc--ccccCCceeecc
Confidence                            34566789999999999999999999999999999998873 23999985  799999999999


Q ss_pred             CCceeeeCCCCCCCCCC
Q psy2857         302 VPGFNCLCPKGYSGKPD  318 (332)
Q Consensus       302 ~g~~~C~C~~g~~g~~~  318 (332)
                      .|+|+|.|-+||.|..+
T Consensus      3960 ~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3960 PGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred             CCceEeccChhHhcccC
Confidence            99999999999998853


No 6  
>KOG1219|consensus
Probab=99.20  E-value=4.4e-11  Score=119.81  Aligned_cols=111  Identities=40%  Similarity=1.008  Sum_probs=96.3

Q ss_pred             CcCCCCCCCCCCeeeeC-CCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCcc
Q psy2857          20 NECQSNPCGVNATCIDT-QGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKV   98 (332)
Q Consensus        20 ~~C~~~~C~~~g~C~~~-~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~   98 (332)
                      ++|..+||.++|+|+.. .++|.|.|++.|.|..|+  .++.+|.  +.||..+++|+-..++|.|.|+.||+|.     
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~--snPC~~GgtCip~~n~f~CnC~~gyTG~----- 3935 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCA--SNPCLTGGTCIPFYNGFLCNCPNGYTGK----- 3935 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccc--ccccccc--CCCCCCCCEEEecCCCeeEeCCCCccCc-----
Confidence            78999999999999998 478999999999998876  4778886  4899999999999999999999999987     


Q ss_pred             ceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecCCCceEeeCCCCCccCCC
Q psy2857          99 ACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNTPGSFRCDCVEGYVGAPP  177 (332)
Q Consensus        99 ~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~  177 (332)
                      +|+.-                                     .+++|+ .++|.+++.|++..|+|.|.|-+||.|..+
T Consensus      3936 ~Ce~~-------------------------------------Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3936 RCEAR-------------------------------------GISECS-KNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred             eeecc-------------------------------------cccccc-cccccCCceeeccCCceEeccChhHhcccC
Confidence            44320                                     266886 789999999999999999999999998654


No 7  
>KOG1225|consensus
Probab=99.10  E-value=2e-09  Score=97.82  Aligned_cols=155  Identities=25%  Similarity=0.550  Sum_probs=110.6

Q ss_pred             CeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecC
Q psy2857          81 GFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNT  160 (332)
Q Consensus        81 ~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~  160 (332)
                      .+.|.|..+|.+..+....|+.           .|..++.|+.+.|.|++||.  |..|.. ..|.  ..|+.++.+++.
T Consensus       233 ~~ic~c~~~~~g~~c~~~~C~~-----------~c~~~g~c~~G~CIC~~Gf~--G~dC~e-~~Cp--~~cs~~g~~~~g  296 (525)
T KOG1225|consen  233 DGICECPEGYFGPLCSTIYCPG-----------GCTGRGQCVEGRCICPPGFT--GDDCDE-LVCP--VDCSGGGVCVDG  296 (525)
T ss_pred             CceeecCCceeCCccccccCCC-----------CCcccceEeCCeEeCCCCCc--CCCCCc-ccCC--cccCCCceecCC
Confidence            4479999999998655433322           56667889999999999999  888865 3463  448887887755


Q ss_pred             CCceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCcccccc
Q psy2857         161 PGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRV  240 (332)
Q Consensus       161 ~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~  240 (332)
                          .|.|++||+|..+..+=-..+|..++.|.               ...|.|.+          ||++..|..     
T Consensus       297 ----~CiC~~g~~G~dCs~~~cpadC~g~G~Ci---------------~G~C~C~~----------Gy~G~~C~~-----  342 (525)
T KOG1225|consen  297 ----ECICNPGYSGKDCSIRRCPADCSGHGKCI---------------DGECLCDE----------GYTGELCIQ-----  342 (525)
T ss_pred             ----EeecCCCccccccccccCCccCCCCCccc---------------CCceEeCC----------CCcCCcccc-----
Confidence                89999999999886532335677777776               67899999          999875542     


Q ss_pred             ccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCC
Q psy2857         241 DINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTV  302 (332)
Q Consensus       241 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~  302 (332)
                             ..|..++.|++     .|.|..||.|.+ .   .-+.+.. ...|.....|+...
T Consensus       343 -------~~C~~~g~cv~-----gC~C~~Gw~G~d-~---~~~~~~~-~~~cs~~~~~~~~~  387 (525)
T KOG1225|consen  343 -------RACSGGGQCVN-----GCKCKKGWRGPD-V---ADPSLLL-ITECSPPSLCIAGV  387 (525)
T ss_pred             -------cccCCCceecc-----CceeccCccCCC-c---CCchhhc-ccccCCCceeeccc
Confidence                   23778888887     399999999986 1   1222222 23566666776665


No 8  
>KOG1225|consensus
Probab=99.07  E-value=6.7e-10  Score=100.85  Aligned_cols=131  Identities=27%  Similarity=0.744  Sum_probs=100.7

Q ss_pred             EEEecCCceecccCCcCCCCCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeee
Q psy2857           6 LVRILLGVRAIVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCL   85 (332)
Q Consensus         6 ~c~c~~g~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~   85 (332)
                      +|.|..+|++......=.+..|..++.|++..    |+|++||+|.++..    ..|   +..|+.++.+++.    .|.
T Consensus       235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~----CIC~~Gf~G~dC~e----~~C---p~~cs~~g~~~~g----~Ci  299 (525)
T KOG1225|consen  235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVEGR----CICPPGFTGDDCDE----LVC---PVDCSGGGVCVDG----ECI  299 (525)
T ss_pred             eeecCCceeCCccccccCCCCCcccceEeCCe----EeCCCCCcCCCCCc----ccC---CcccCCCceecCC----Eee
Confidence            58899999998655444445577778898875    99999999987742    223   3448777877766    899


Q ss_pred             CCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecCCCceE
Q psy2857          86 CPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNTPGSFR  165 (332)
Q Consensus        86 C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~~~~~~  165 (332)
                      |.+||+|..++..+|.           ..|..++.|++++|.|.+||+  |..|...       .|.+++.|++.     
T Consensus       300 C~~g~~G~dCs~~~cp-----------adC~g~G~Ci~G~C~C~~Gy~--G~~C~~~-------~C~~~g~cv~g-----  354 (525)
T KOG1225|consen  300 CNPGYSGKDCSIRRCP-----------ADCSGHGKCIDGECLCDEGYT--GELCIQR-------ACSGGGQCVNG-----  354 (525)
T ss_pred             cCCCccccccccccCC-----------ccCCCCCcccCCceEeCCCCc--CCccccc-------ccCCCceeccC-----
Confidence            9999999977654432           378999999999999999998  7777533       27788888752     


Q ss_pred             eeCCCCCccCC
Q psy2857         166 CDCVEGYVGAP  176 (332)
Q Consensus       166 C~C~~G~~g~~  176 (332)
                      |.|..||.|.+
T Consensus       355 C~C~~Gw~G~d  365 (525)
T KOG1225|consen  355 CKCKKGWRGPD  365 (525)
T ss_pred             ceeccCccCCC
Confidence            89999999876


No 9  
>KOG4260|consensus
Probab=98.95  E-value=1.5e-09  Score=88.18  Aligned_cols=135  Identities=29%  Similarity=0.623  Sum_probs=89.6

Q ss_pred             CeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceee---CCCCeeeeCCCCCCCCCCCccceeeccC--
Q psy2857          31 ATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICEN---TVPGFNCLCPKGYSGKPDAKVACEQVDV--  105 (332)
Q Consensus        31 g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~---~~~~~~C~C~~g~~~~~~~~~~c~~~~~--  105 (332)
                      ..|+...   .--|+.|..|.++..|...     ...+|..++.|..   ..|+-.|.|..||.|..+..---+....  
T Consensus       122 WlCvdqL---kvCCp~gtyGpdCl~Cpgg-----ser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~R  193 (350)
T KOG4260|consen  122 WLCVDQL---KVCCPDGTYGPDCLQCPGG-----SERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSR  193 (350)
T ss_pred             Hhhhhhh---eeccCCCCcCCccccCCCC-----CcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhc
Confidence            3454443   3447889999888766542     2367998999963   3456799999999998543210000000  


Q ss_pred             ---CCCCCC-CccccCCCccc----CceecCCCCccCCCCeeecCCCCC-CCCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857         106 ---TSECSS-NFECVNNAECV----DGLCYCRPGFDARGSVCVDVDECQ-LGDPCGPQAQCTNTPGSFRCDCVEGYVGA  175 (332)
Q Consensus       106 ---~~~c~~-~~~C~~~~~c~----~~~c~C~~g~~~~g~~c~~~~~C~-~~~~C~~~~~C~~~~~~~~C~C~~G~~g~  175 (332)
                         ...|.+ ...|..  .|.    ..--.|..||..+...|+|+++|. .+.+|..+..|+|+.|+|.|...+||.+.
T Consensus       194 ne~~lvCt~Ch~~C~~--~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g  270 (350)
T KOG4260|consen  194 NEQHLVCTACHEGCLG--VCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG  270 (350)
T ss_pred             ccccchhhhhhhhhhc--ccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC
Confidence               001111 011211  111    123468999999889999999997 67899999999999999999999999863


No 10 
>KOG4289|consensus
Probab=98.93  E-value=1.5e-09  Score=105.48  Aligned_cols=79  Identities=32%  Similarity=0.745  Sum_probs=65.8

Q ss_pred             cEEecccCccccccccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecC-CCce
Q psy2857         227 LFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT-VPGF  305 (332)
Q Consensus       227 g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~-~g~~  305 (332)
                      |||+.+     +..++|+|-+.||.++++|....|+|+|.|++||+|.+|+.-...-.|..  +.|.++++|++. .|+|
T Consensus      1229 GFTgd~-----CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvp--GvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1229 GFTGDY-----CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVP--GVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred             CCCccc-----ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcccc--ceecCCCEEeecCCCce
Confidence            666653     35688999999999999999999999999999999998873333455654  689999999865 6889


Q ss_pred             eeeCCCC
Q psy2857         306 NCLCPKG  312 (332)
Q Consensus       306 ~C~C~~g  312 (332)
                      .|.|+.|
T Consensus      1302 ~c~Cp~g 1308 (2531)
T KOG4289|consen 1302 CCHCPYG 1308 (2531)
T ss_pred             eccCCCc
Confidence            9999988


No 11 
>KOG4289|consensus
Probab=98.92  E-value=1.3e-09  Score=105.88  Aligned_cols=83  Identities=31%  Similarity=0.740  Sum_probs=73.1

Q ss_pred             EEEEecCCceec---ccCCcCCCCCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeC-CC
Q psy2857           5 VLVRILLGVRAI---VDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT-VP   80 (332)
Q Consensus         5 ~~c~c~~g~~~~---~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~-~~   80 (332)
                      ..|+|++||+|+   +.||.|-+.||+++|.|..-.|+|.|.|.+||+|..|+.-.....|  .+..|+++++|.+. .+
T Consensus      1222 lrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrC--vpGvC~nggtC~~~~ng 1299 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRC--VPGVCKNGGTCVNLLNG 1299 (2531)
T ss_pred             eeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcc--ccceecCCCEEeecCCC
Confidence            579999999999   7799999999999999999999999999999999887644455566  36899999999886 56


Q ss_pred             CeeeeCCCC
Q psy2857          81 GFNCLCPKG   89 (332)
Q Consensus        81 ~~~C~C~~g   89 (332)
                      +|.|.|+.|
T Consensus      1300 gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1300 GFCCHCPYG 1308 (2531)
T ss_pred             ceeccCCCc
Confidence            899999987


No 12 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.79  E-value=4e-09  Score=62.62  Aligned_cols=38  Identities=39%  Similarity=0.804  Sum_probs=34.3

Q ss_pred             ccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCC
Q psy2857         281 DIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPD  318 (332)
Q Consensus       281 ~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~  318 (332)
                      |||||....+.|..++.|+|+.|+|+|.|++||+....
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~   38 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDD   38 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTT
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCC
Confidence            68999998899998999999999999999999995443


No 13 
>KOG4260|consensus
Probab=98.74  E-value=1.8e-08  Score=81.97  Aligned_cols=161  Identities=30%  Similarity=0.588  Sum_probs=98.1

Q ss_pred             CCCCccCCCCeeecCCCCC--CCCCCCCCCceec---CCCceEeeCCCCCccCCCCcccc-----ccCcccceeeccccC
Q psy2857         128 CRPGFDARGSVCVDVDECQ--LGDPCGPQAQCTN---TPGSFRCDCVEGYVGAPPRIKCK-----DVRWEFNVTLLFYET  197 (332)
Q Consensus       128 C~~g~~~~g~~c~~~~~C~--~~~~C~~~~~C~~---~~~~~~C~C~~G~~g~~~~~~c~-----~~~c~~~~~c~~~~~  197 (332)
                      |++|-.  |..|.   .|.  ...+|..++.|..   ..|+..|.|.+||.|..+.. |.     ..+=..+..|.-...
T Consensus       132 Cp~gty--GpdCl---~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~-Cg~eyfes~Rne~~lvCt~Ch~  205 (350)
T KOG4260|consen  132 CPDGTY--GPDCL---QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY-CGIEYFESSRNEQHLVCTACHE  205 (350)
T ss_pred             cCCCCc--CCccc---cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc-cchHHHHhhcccccchhhhhhh
Confidence            555554  55553   221  1245777777763   34778999999999976532 10     000001111110000


Q ss_pred             cccccccCCcccccceecc-cccceecccccEEecccCccccccccCccCC--CCCCCCCeeeecCCceeeecCCCcccC
Q psy2857         198 DYLHSVASDISDILTIIHE-FSRIFSKHLKLFVIEDAKRNLNRVDINECQS--NPCGVNATCIDTQGSYSCVCKEHYTGD  274 (332)
Q Consensus       198 ~~~~~~~~~~~~~~c~c~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~  274 (332)
                      ...         .  .|.. .....+.+..||...    ...|+|||+|..  .||..+..|+|+.|||.|.+++||.+.
T Consensus       206 ~C~---------~--~Csg~~~k~C~kCkkGW~ld----e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g  270 (350)
T KOG4260|consen  206 GCL---------G--VCSGESSKGCSKCKKGWKLD----EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG  270 (350)
T ss_pred             hhh---------c--ccCCCCCCChhhhcccceec----ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC
Confidence            000         0  0111 122344556688665    346899999965  679999999999999999999999762


Q ss_pred             CCCcccccccccCCCCCCC-CCCeeecCCCceeeeCCCCCCCC
Q psy2857         275 PYQACSDIDECKALDKPCG-LRAICENTVPGFNCLCPKGYSGK  316 (332)
Q Consensus       275 ~~~~C~~~d~C~~~~~~C~-~~~~C~~~~g~~~C~C~~g~~g~  316 (332)
                             +|+|......|. .+..|.|+.++|+|+|..|+.-.
T Consensus       271 -------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~  306 (350)
T KOG4260|consen  271 -------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLIII  306 (350)
T ss_pred             -------hHHhhhhhhhcccCCCCcccCCccEEEEecccceee
Confidence                   466654333443 25789999999999999887543


No 14 
>KOG0994|consensus
Probab=98.74  E-value=7.7e-08  Score=92.32  Aligned_cols=138  Identities=22%  Similarity=0.440  Sum_probs=68.8

Q ss_pred             CCceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCc---cc
Q psy2857         161 PGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKR---NL  237 (332)
Q Consensus       161 ~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~---~~  237 (332)
                      .|.+--.|..||.|+.....|+...|..-++-.        ....+..+..|.|.+          .+.+..|.+   +.
T Consensus       996 eG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~--------~~~CDr~tGQCpClp----------Nv~G~~CDqCA~N~ 1057 (1758)
T KOG0994|consen  996 EGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNS--------TCHCDRFTGQCPCLP----------NVQGVRCDQCAENH 1057 (1758)
T ss_pred             cccchhhccccchhHHHHhhhhhheccccccCC--------ccccccccCcCCCCc----------ccccccccccccch
Confidence            344434789999999888777766654433321        122344455566665          222222210   00


Q ss_pred             ccc-ccCccCCCCCC--CCCeeeecCCceeeecCCCcccCCCCcccccc-----------cccCC---CCCCCC-CC--e
Q psy2857         238 NRV-DINECQSNPCG--VNATCIDTQGSYSCVCKEHYTGDPYQACSDID-----------ECKAL---DKPCGL-RA--I  297 (332)
Q Consensus       238 ~~~-~~~~C~~~~C~--~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d-----------~C~~~---~~~C~~-~~--~  297 (332)
                      --. .-..|++-.|.  ..-+|....|  +|+|++||-|..|..|++.-           +|..-   .-.|.. .|  .
T Consensus      1058 w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~ 1135 (1758)
T KOG0994|consen 1058 WNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCV 1135 (1758)
T ss_pred             hccccCCCCCccCCCccCCcccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCcee
Confidence            000 00012211121  1225655555  89999999998776665421           12110   012332 22  3


Q ss_pred             eecCCCceeee-CCCCCCCCCC
Q psy2857         298 CENTVPGFNCL-CPKGYSGKPD  318 (332)
Q Consensus       298 C~~~~g~~~C~-C~~g~~g~~~  318 (332)
                      |....++++|. |..||.|.-.
T Consensus      1136 C~~Gv~G~rCdqCaRgy~G~fP 1157 (1758)
T KOG0994|consen 1136 CRPGVGGPRCDQCARGYSGQFP 1157 (1758)
T ss_pred             ecCCCCCcchhhhhhhhcCCCC
Confidence            44566667773 7777777643


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.25  E-value=1.1e-06  Score=52.07  Aligned_cols=34  Identities=47%  Similarity=1.037  Sum_probs=29.7

Q ss_pred             ccCccCC--CCCCCCCeeeecCCceeeecCCCcccC
Q psy2857         241 DINECQS--NPCGVNATCIDTQGSYSCVCKEHYTGD  274 (332)
Q Consensus       241 ~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~  274 (332)
                      |||||..  ..|..++.|+|+.|+|+|.|++||...
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~   36 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN   36 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence            5789976  469889999999999999999999843


No 16 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.14  E-value=2.1e-06  Score=47.40  Aligned_cols=30  Identities=53%  Similarity=1.198  Sum_probs=27.0

Q ss_pred             cCCCCCCCCCeeeecC-CceeeecCCCcccC
Q psy2857         245 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD  274 (332)
Q Consensus       245 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~  274 (332)
                      |.++||.++++|++.. ++|+|.|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            4457899999999999 99999999999986


No 17 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.02  E-value=1e-05  Score=47.08  Aligned_cols=35  Identities=49%  Similarity=1.099  Sum_probs=29.9

Q ss_pred             ccccccCCCCCCCCCCeeecCCCceeeeCCCCCC-CC
Q psy2857         281 DIDECKALDKPCGLRAICENTVPGFNCLCPKGYS-GK  316 (332)
Q Consensus       281 ~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~-g~  316 (332)
                      ++|+|... .+|.++++|+++.++|.|.|++||+ |.
T Consensus         1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~~g~   36 (39)
T smart00179        1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYTDGR   36 (39)
T ss_pred             CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCccCC
Confidence            46888754 6899888999999999999999999 54


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.99  E-value=1.2e-05  Score=46.70  Aligned_cols=35  Identities=54%  Similarity=1.138  Sum_probs=30.6

Q ss_pred             cCCcCCC-CCCCCCCeeeeCCCCeEeeCCCCCc-cCC
Q psy2857          18 DINECQS-NPCGVNATCIDTQGSYSCVCKEHYT-GDP   52 (332)
Q Consensus        18 ~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~-g~~   52 (332)
                      ++++|.. .+|..+++|+++.++|.|.|++||. |..
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~   37 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN   37 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCc
Confidence            4688887 7899889999999999999999998 643


No 19 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.97  E-value=4.3e-06  Score=46.17  Aligned_cols=30  Identities=53%  Similarity=1.198  Sum_probs=27.2

Q ss_pred             CCCCCCCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857          22 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD   51 (332)
Q Consensus        22 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~   51 (332)
                      |.+.+|.++|+|++.. +.|.|+|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            5667899999999998 99999999999985


No 20 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.91  E-value=4.7e-06  Score=47.11  Aligned_cols=32  Identities=31%  Similarity=0.664  Sum_probs=25.4

Q ss_pred             cCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         286 KALDKPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       286 ~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      ....+.|..+++|+++.++|.|+|++||+|+.
T Consensus         2 ~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    2 LENNGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             TTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            34457899999999999999999999999985


No 21 
>KOG1226|consensus
Probab=97.89  E-value=0.00016  Score=67.96  Aligned_cols=138  Identities=30%  Similarity=0.671  Sum_probs=87.9

Q ss_pred             CCCCCeeeeCCCCeEeeCCCCCccCCCCCCC-------CcccccCC--CCCCCCCCceeeCCCCeeeeCCCCCCCCCCCc
Q psy2857          27 CGVNATCIDTQGSYSCVCKEHYTGDPYQACS-------DIDECKAL--DKPCGLRAICENTVPGFNCLCPKGYSGKPDAK   97 (332)
Q Consensus        27 C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~-------~~~~c~~~--~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~   97 (332)
                      |+.+|...-..    |.|.+||.|..++--.       ..+.|...  ..+|+.+|.|.=.    .|.|.+...+.-.++
T Consensus       469 C~g~G~~~CG~----C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~  540 (783)
T KOG1226|consen  469 CHGNGTFVCGQ----CRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK  540 (783)
T ss_pred             cCCCCcEEecc----eecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeee
Confidence            55555554443    8999999998774111       12334321  2379999999776    789988776432222


Q ss_pred             cceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeee---cCCCCC--CCCCCCCCCceecCCCceEeeCCCC-
Q psy2857          98 VACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCV---DVDECQ--LGDPCGPQAQCTNTPGSFRCDCVEG-  171 (332)
Q Consensus        98 ~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~---~~~~C~--~~~~C~~~~~C~~~~~~~~C~C~~G-  171 (332)
                       .|+.-++.=+-.....|..++.|.-++|.|.+||+  |..|.   +.+.|.  ....|..+++|.=.    +|.|... 
T Consensus       541 -fCECDnfsC~r~~g~lC~g~G~C~CG~CvC~~Gwt--G~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~  613 (783)
T KOG1226|consen  541 -FCECDNFSCERHKGVLCGGHGRCECGRCVCNPGWT--GSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPP  613 (783)
T ss_pred             -eeeccCcccccccCcccCCCCeEeCCcEEcCCCCc--cCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCC
Confidence             56544432111234578889999999999999999  66663   455565  23467777777643    6777655 


Q ss_pred             CccCCCCc
Q psy2857         172 YVGAPPRI  179 (332)
Q Consensus       172 ~~g~~~~~  179 (332)
                      |.|..++.
T Consensus       614 ~sG~~CE~  621 (783)
T KOG1226|consen  614 YSGEFCEK  621 (783)
T ss_pred             cCcchhhc
Confidence            88876654


No 22 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.86  E-value=1.2e-05  Score=40.64  Aligned_cols=23  Identities=48%  Similarity=1.028  Sum_probs=19.3

Q ss_pred             ceeeecCCCcccCCC-Cccccccc
Q psy2857         262 SYSCVCKEHYTGDPY-QACSDIDE  284 (332)
Q Consensus       262 ~~~C~C~~G~~g~~~-~~C~~~d~  284 (332)
                      ||+|+|++||++... ..|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            689999999997644 67999986


No 23 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.65  E-value=9.2e-05  Score=42.53  Aligned_cols=35  Identities=54%  Similarity=1.146  Sum_probs=30.2

Q ss_pred             cCCcCCC-CCCCCCCeeeeCCCCeEeeCCCCCccCC
Q psy2857          18 DINECQS-NPCGVNATCIDTQGSYSCVCKEHYTGDP   52 (332)
Q Consensus        18 ~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~g~~   52 (332)
                      ++++|.. .+|..++.|++..+.|.|.|++||.|..
T Consensus         1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~   36 (38)
T cd00054           1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN   36 (38)
T ss_pred             CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence            3578877 7898889999999999999999999853


No 24 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.65  E-value=4.8e-05  Score=38.43  Aligned_cols=24  Identities=38%  Similarity=1.035  Sum_probs=21.1

Q ss_pred             ceeeeCCCCCCCCCCCCcccccccc
Q psy2857         304 GFNCLCPKGYSGKPDAKVACEQEKA  328 (332)
Q Consensus       304 ~~~C~C~~g~~g~~~~~~~c~~~~~  328 (332)
                      ||+|.|++||...+.++ .|.+|+|
T Consensus         1 sy~C~C~~Gy~l~~d~~-~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGR-SCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCC-ccccCCC
Confidence            68999999999998775 8999875


No 25 
>KOG1836|consensus
Probab=97.64  E-value=0.0016  Score=67.96  Aligned_cols=140  Identities=21%  Similarity=0.359  Sum_probs=70.0

Q ss_pred             CCCceEeeCCCCCccCCCC----ccccccCcccceeeccccCcccccccCCcccccceeccccc--ceecccccEEeccc
Q psy2857         160 TPGSFRCDCVEGYVGAPPR----IKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSR--IFSKHLKLFVIEDA  233 (332)
Q Consensus       160 ~~~~~~C~C~~G~~g~~~~----~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~--~~~~~~~g~~~~~~  233 (332)
                      +.+.+.=.|.+||.|++-.    ..|....|...+.-..       ....+....-|.|.+...  ....++.|+..-..
T Consensus       863 T~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~-------~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s  935 (1705)
T KOG1836|consen  863 TAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELP-------SLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNS  935 (1705)
T ss_pred             cccccccccccCccccccCCCcCCccccccCccCCcccc-------cccCCCcccceeccCCCCccccccccccccccCC
Confidence            3344444788999987654    3455544443333220       112223344555555332  23444555543321


Q ss_pred             CccccccccCccCCCCCCC----CCeeeecCCceeeecCCCcccCCCCcccccc------cccCCCCCCCCC----Ceee
Q psy2857         234 KRNLNRVDINECQSNPCGV----NATCIDTQGSYSCVCKEHYTGDPYQACSDID------ECKALDKPCGLR----AICE  299 (332)
Q Consensus       234 ~~~~~~~~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d------~C~~~~~~C~~~----~~C~  299 (332)
                      .        ..|++-+|..    +..|+...|  +|.|.+|-+|.++..|....      .|..  -.|...    ..|.
T Consensus       936 ~--------~gC~~c~c~~~gs~~~~c~~~tG--qc~c~~gVtgqrc~qc~~~~~~~~~~gc~~--c~c~~~Gs~~~qc~ 1003 (1705)
T KOG1836|consen  936 G--------VGCEPCNCDPTGSESSDCDVGTG--QCYCRPGVTGQRCDQCETYHFGFQTEGCGL--CECDPLGSRGFQCD 1003 (1705)
T ss_pred             C--------CCcccccccccccccccccccCC--ceeeecCccccccCccccCcccccccCCcc--eecccCCcccceec
Confidence            1        1233333322    235665555  89999999998876554321      1110  112222    2455


Q ss_pred             cCCCceeeeCCCCCCCCCCCC
Q psy2857         300 NTVPGFNCLCPKGYSGKPDAK  320 (332)
Q Consensus       300 ~~~g~~~C~C~~g~~g~~~~~  320 (332)
                      ...|  +|.|++++.|..+..
T Consensus      1004 ~~~G--~c~c~~~~~g~~c~~ 1022 (1705)
T KOG1836|consen 1004 PEDG--QCPCRPGFEGRRCDQ 1022 (1705)
T ss_pred             ccCC--eeeecCCCCCccccc
Confidence            5455  677888877765544


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.58  E-value=0.0001  Score=42.32  Aligned_cols=34  Identities=53%  Similarity=1.134  Sum_probs=29.2

Q ss_pred             cCccCC-CCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857         242 INECQS-NPCGVNATCIDTQGSYSCVCKEHYTGDP  275 (332)
Q Consensus       242 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~~  275 (332)
                      +++|.. .+|.+++.|++..++|.|.|++||.|..
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~   36 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN   36 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence            467776 6888889999999999999999999864


No 27 
>KOG1226|consensus
Probab=97.54  E-value=0.00044  Score=65.15  Aligned_cols=134  Identities=28%  Similarity=0.656  Sum_probs=86.5

Q ss_pred             EEEecCCceeccc------------CCcCCCC----CCCCCCeeeeCCCCeEeeCCCCCc----cCCCCCCCCcccccCC
Q psy2857           6 LVRILLGVRAIVD------------INECQSN----PCGVNATCIDTQGSYSCVCKEHYT----GDPYQACSDIDECKAL   65 (332)
Q Consensus         6 ~c~c~~g~~~~~~------------~~~C~~~----~C~~~g~C~~~~g~~~C~C~~G~~----g~~~~~C~~~~~c~~~   65 (332)
                      .|+|.+||.|...            .+.|...    +|+.+|.|.=..    |+|.+...    |..++ | +-..|...
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGq----C~C~~~~~~~i~G~fCE-C-DnfsC~r~  552 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQ----CVCHKPDNGKIYGKFCE-C-DNFSCERH  552 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCc----eEecCCCCCceeeeeee-c-cCcccccc
Confidence            6899999999811            1334332    599999998876    99988766    54432 1 11223221


Q ss_pred             -CCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccCCCCCCC--CccccCCCcccCceecCCCC-ccCCCCeeec
Q psy2857          66 -DKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSS--NFECVNNAECVDGLCYCRPG-FDARGSVCVD  141 (332)
Q Consensus        66 -~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~--~~~C~~~~~c~~~~c~C~~g-~~~~g~~c~~  141 (332)
                       -..|+.++.|.-.    +|.|.+||+|..+.   |+  .-+..|..  ...|+..+.|.-++|.|.+. |.  |..|..
T Consensus       553 ~g~lC~g~G~C~CG----~CvC~~GwtG~~C~---C~--~std~C~~~~G~iCSGrG~C~Cg~C~C~~~~~s--G~~CE~  621 (783)
T KOG1226|consen  553 KGVLCGGHGRCECG----RCVCNPGWTGSACN---CP--LSTDTCESSDGQICSGRGTCECGRCKCTDPPYS--GEFCEK  621 (783)
T ss_pred             cCcccCCCCeEeCC----cEEcCCCCccCCCC---CC--CCCccccCCCCceeCCCceeeCCceEcCCCCcC--cchhhc
Confidence             2358889999776    79999999999553   21  11223332  34788888888889999865 77  788854


Q ss_pred             CCCCCCCCCCCCCCcee
Q psy2857         142 VDECQLGDPCGPQAQCT  158 (332)
Q Consensus       142 ~~~C~~~~~C~~~~~C~  158 (332)
                      -..|  +.+|.....|+
T Consensus       622 cptc--~~~C~~~~~Cv  636 (783)
T KOG1226|consen  622 CPTC--PDPCAENKSCV  636 (783)
T ss_pred             CCCC--CCcccccccch
Confidence            3334  34566555554


No 28 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.50  E-value=6.1e-05  Score=42.49  Aligned_cols=29  Identities=38%  Similarity=0.870  Sum_probs=23.3

Q ss_pred             CCCCCCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857         290 KPCGLRAICENTVPGFNCLCPKGYSGKPDAK  320 (332)
Q Consensus       290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~  320 (332)
                      +.|.  .+|++.+++|+|.|++||+..++.+
T Consensus         6 GgC~--h~C~~~~g~~~C~C~~Gy~L~~D~~   34 (36)
T PF14670_consen    6 GGCS--HICVNTPGSYRCSCPPGYKLAEDGR   34 (36)
T ss_dssp             GGSS--SEEEEETTSEEEE-STTEEE-TTSS
T ss_pred             CCcC--CCCccCCCceEeECCCCCEECcCCC
Confidence            4563  6999999999999999999998765


No 29 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.43  E-value=0.0001  Score=41.68  Aligned_cols=29  Identities=52%  Similarity=1.094  Sum_probs=23.4

Q ss_pred             CCCCCCCeeeecCCceeeecCCCcccCCC
Q psy2857         248 NPCGVNATCIDTQGSYSCVCKEHYTGDPY  276 (332)
Q Consensus       248 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~  276 (332)
                      ..|..+++|+++.++|.|+|++||.|++.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~   34 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF   34 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence            46888999999999999999999999864


No 30 
>KOG0994|consensus
Probab=97.37  E-value=0.0016  Score=63.86  Aligned_cols=34  Identities=26%  Similarity=0.734  Sum_probs=22.7

Q ss_pred             eecCCCceEe-eCCCCCccCCC---CccccccCcccce
Q psy2857         157 CTNTPGSFRC-DCVEGYVGAPP---RIKCKDVRWEFNV  190 (332)
Q Consensus       157 C~~~~~~~~C-~C~~G~~g~~~---~~~c~~~~c~~~~  190 (332)
                      |.+...++.| .|..||.|++.   ...|+..+|-.+.
T Consensus       878 CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp  915 (1758)
T KOG0994|consen  878 CQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGP  915 (1758)
T ss_pred             ccccccccchhhhhccccCCcccCCCCCCCCCCCCCCC
Confidence            4455566777 79999999864   3467766665443


No 31 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.23  E-value=9.3e-05  Score=57.55  Aligned_cols=140  Identities=27%  Similarity=0.697  Sum_probs=86.9

Q ss_pred             CCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccC---CCCCCCCCCceeeCC-----CCeeeeCCCCCCCCCCCcc
Q psy2857          27 CGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKA---LDKPCGLRAICENTV-----PGFNCLCPKGYSGKPDAKV   98 (332)
Q Consensus        27 C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~---~~~~C~~~~~C~~~~-----~~~~C~C~~g~~~~~~~~~   98 (332)
                      |. +|..+.+...|.|.|.+||......+|+...+|..   ...+|..-+.|++..     ..|.|.|.+||.....   
T Consensus         8 CK-NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~---   83 (197)
T PF06247_consen    8 CK-NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG---   83 (197)
T ss_dssp             -B-TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---
T ss_pred             cc-CCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---
Confidence            54 67788888899999999999876677877777654   345798889998765     3699999999987654   


Q ss_pred             ceeeccCCCCCCCCccccCCCccc-------CceecCCCCcc-CCCCeeec--CCCCCCCCCCCCCCceecCCCceEeeC
Q psy2857          99 ACEQVDVTSECSSNFECVNNAECV-------DGLCYCRPGFD-ARGSVCVD--VDECQLGDPCGPQAQCTNTPGSFRCDC  168 (332)
Q Consensus        99 ~c~~~~~~~~c~~~~~C~~~~~c~-------~~~c~C~~g~~-~~g~~c~~--~~~C~~~~~C~~~~~C~~~~~~~~C~C  168 (332)
                      .|...    .|.. ..|. .+.|+       ...|+|.-|.. .+...|..  ...|+  -.|..+..|....+-|.|.+
T Consensus        84 vCvp~----~C~~-~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~  155 (197)
T PF06247_consen   84 VCVPN----KCNN-KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS--LKCKENEECKLVDGYYKCVC  155 (197)
T ss_dssp             SEEEG----GGSS----T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTEEEEEE
T ss_pred             eEchh----hcCc-eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee--eecCCCcceeeeCcEEEeec
Confidence            23211    1111 1333 23332       23799998887 56777753  34454  46778889999999999999


Q ss_pred             CCCCccCCCC
Q psy2857         169 VEGYVGAPPR  178 (332)
Q Consensus       169 ~~G~~g~~~~  178 (332)
                      ..+|.++...
T Consensus       156 ~~~~~~~~~~  165 (197)
T PF06247_consen  156 KEGFPGDGEG  165 (197)
T ss_dssp             -TT-EEETTT
T ss_pred             CCCCCCCCCc
Confidence            9999886544


No 32 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.17  E-value=0.00071  Score=38.07  Aligned_cols=28  Identities=39%  Similarity=1.068  Sum_probs=25.3

Q ss_pred             CCCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857         289 DKPCGLRAICENTVPGFNCLCPKGYSGK  316 (332)
Q Consensus       289 ~~~C~~~~~C~~~~g~~~C~C~~g~~g~  316 (332)
                      ..+|.+++.|++..++|.|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            3578888999999999999999999988


No 33 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.11  E-value=0.00096  Score=37.52  Aligned_cols=28  Identities=61%  Similarity=1.280  Sum_probs=25.4

Q ss_pred             CCCCCCCCeeeeCCCCeEeeCCCCCccC
Q psy2857          24 SNPCGVNATCIDTQGSYSCVCKEHYTGD   51 (332)
Q Consensus        24 ~~~C~~~g~C~~~~g~~~C~C~~G~~g~   51 (332)
                      ..+|..++.|++..+.|.|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            5678888999999999999999999986


No 34 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.05  E-value=0.00082  Score=52.44  Aligned_cols=131  Identities=27%  Similarity=0.751  Sum_probs=76.3

Q ss_pred             ceecCCCCccC-CCCeeecCCCCCC----CCCCCCCCceecCC-----CceEeeCCCCCccCCCCccccccCcccceeec
Q psy2857         124 GLCYCRPGFDA-RGSVCVDVDECQL----GDPCGPQAQCTNTP-----GSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLL  193 (332)
Q Consensus       124 ~~c~C~~g~~~-~g~~c~~~~~C~~----~~~C~~~~~C~~~~-----~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~  193 (332)
                      +.|.|.+||-. +...|....+|..    ..+|...++|++..     ..|.|.|.+||.....  .|..          
T Consensus        20 fEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~--vCvp----------   87 (197)
T PF06247_consen   20 FECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG--VCVP----------   87 (197)
T ss_dssp             EEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS--SEEE----------
T ss_pred             eEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC--eEch----------
Confidence            58999999953 3567777667752    35688888998765     5688999999886421  1111          


Q ss_pred             cccCcccccccCCcccccceecccccceecccccEEecccCccccccccCccCCCCCCCCCeeeec---CCceeeecCCC
Q psy2857         194 FYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDT---QGSYSCVCKEH  270 (332)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~---~~~~~C~C~~G  270 (332)
                                                                       ..|....|. .|.|+-.   +....|+|.-|
T Consensus        88 -------------------------------------------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IG  117 (197)
T PF06247_consen   88 -------------------------------------------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIG  117 (197)
T ss_dssp             -------------------------------------------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TE
T ss_pred             -------------------------------------------------hhcCceecC-CCeEEecCCCCCCceeEeeec
Confidence                                                             133334454 5667532   23458999999


Q ss_pred             cccCCCCccccc--ccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCCC
Q psy2857         271 YTGDPYQACSDI--DECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDA  319 (332)
Q Consensus       271 ~~g~~~~~C~~~--d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~  319 (332)
                      +..+....|...  -+|.+   .|..+..|....+-|.|.|.++|.++...
T Consensus       118 kV~~dn~kCtk~G~T~C~L---KCk~nE~CK~~~~~Y~C~~~~~~~~~~~~  165 (197)
T PF06247_consen  118 KVPDDNKKCTKTGETKCSL---KCKENEECKLVDGYYKCVCKEGFPGDGEG  165 (197)
T ss_dssp             EETTTTTESEEEE-----------TTTEEEEEETTEEEEEE-TT-EEETTT
T ss_pred             eEeccCCcccCCCccceee---ecCCCcceeeeCcEEEeecCCCCCCCCCc
Confidence            984444445432  33543   67788999999999999999999877654


No 35 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.00  E-value=0.0011  Score=37.19  Aligned_cols=26  Identities=42%  Similarity=1.055  Sum_probs=23.5

Q ss_pred             CCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857         290 KPCGLRAICENTVPGFNCLCPKGYSGK  316 (332)
Q Consensus       290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~  316 (332)
                      .+|.++ +|++..++|.|.|++||.|+
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccC
Confidence            578877 99999999999999999995


No 36 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.94  E-value=0.0017  Score=36.52  Aligned_cols=29  Identities=59%  Similarity=1.300  Sum_probs=24.7

Q ss_pred             CCC-CCCCCCCeeeeCCCCeEeeCCCCCccC
Q psy2857          22 CQS-NPCGVNATCIDTQGSYSCVCKEHYTGD   51 (332)
Q Consensus        22 C~~-~~C~~~g~C~~~~g~~~C~C~~G~~g~   51 (332)
                      |.. .+|..+ .|++..++|.|.|++||.|.
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~   31 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD   31 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence            445 578877 99999999999999999983


No 37 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.47  E-value=0.0028  Score=35.71  Aligned_cols=24  Identities=33%  Similarity=0.755  Sum_probs=19.3

Q ss_pred             CCeeeecCCceeeecCCCcccCCC
Q psy2857         253 NATCIDTQGSYSCVCKEHYTGDPY  276 (332)
Q Consensus       253 ~~~C~~~~~~~~C~C~~G~~g~~~  276 (332)
                      ...|++.+++|+|.|++||++...
T Consensus         9 ~h~C~~~~g~~~C~C~~Gy~L~~D   32 (36)
T PF14670_consen    9 SHICVNTPGSYRCSCPPGYKLAED   32 (36)
T ss_dssp             SSEEEEETTSEEEE-STTEEE-TT
T ss_pred             CCCCccCCCceEeECCCCCEECcC
Confidence            468999999999999999998743


No 38 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.16  E-value=0.0063  Score=50.78  Aligned_cols=41  Identities=32%  Similarity=0.691  Sum_probs=35.4

Q ss_pred             cccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857         278 ACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAK  320 (332)
Q Consensus       278 ~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~  320 (332)
                      .|.++++|....++|.  ..|+++.|+|.|.|+.||++....+
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~~~~  223 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLEDNK  223 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCCCCC
Confidence            5888999988777886  5899999999999999999887654


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.10  E-value=0.013  Score=32.08  Aligned_cols=26  Identities=27%  Similarity=0.781  Sum_probs=21.7

Q ss_pred             CCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         290 KPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      ..|.++|+|+...+  +|+|.+||+|..
T Consensus         6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~   31 (32)
T PF07974_consen    6 NICSGHGTCVSPCG--RCVCDSGYTGPD   31 (32)
T ss_pred             CccCCCCEEeCCCC--EEECCCCCcCCC
Confidence            46888999987644  999999999974


No 40 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.09  E-value=0.0039  Score=26.53  Aligned_cols=13  Identities=31%  Similarity=0.810  Sum_probs=10.0

Q ss_pred             eeecCCCcccCCC
Q psy2857         264 SCVCKEHYTGDPY  276 (332)
Q Consensus       264 ~C~C~~G~~g~~~  276 (332)
                      +|+|++||+|..+
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5899999999753


No 41 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.99  E-value=0.0091  Score=32.68  Aligned_cols=26  Identities=42%  Similarity=0.953  Sum_probs=21.5

Q ss_pred             CCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857         248 NPCGVNATCIDTQGSYSCVCKEHYTGDP  275 (332)
Q Consensus       248 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~  275 (332)
                      ..|..+++|+...+  +|+|.+||+|..
T Consensus         6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~   31 (32)
T PF07974_consen    6 NICSGHGTCVSPCG--RCVCDSGYTGPD   31 (32)
T ss_pred             CccCCCCEEeCCCC--EEECCCCCcCCC
Confidence            35889999997644  999999999975


No 42 
>KOG1836|consensus
Probab=94.33  E-value=0.79  Score=48.93  Aligned_cols=51  Identities=29%  Similarity=0.619  Sum_probs=37.7

Q ss_pred             EecCCceeccc--C-CcCCCCCCCCCCeeeeCC--CCeEee-CCCCCccCCCCCCCC
Q psy2857           8 RILLGVRAIVD--I-NECQSNPCGVNATCIDTQ--GSYSCV-CKEHYTGDPYQACSD   58 (332)
Q Consensus         8 ~c~~g~~~~~~--~-~~C~~~~C~~~g~C~~~~--g~~~C~-C~~G~~g~~~~~C~~   58 (332)
                      +|.+||+|+.+  . +.|.+=+|...+.|..+.  ..+.|. |++||+|..++.|.+
T Consensus       760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d  816 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD  816 (1705)
T ss_pred             hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence            47899999922  2 227777788888888774  456797 999999987776654


No 43 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=94.10  E-value=0.043  Score=30.80  Aligned_cols=31  Identities=32%  Similarity=0.641  Sum_probs=21.9

Q ss_pred             cCCCCCCCCCeeeecC-CceeeecCCCcccCC
Q psy2857         245 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGDP  275 (332)
Q Consensus       245 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~  275 (332)
                      |...+|..++.|++.. |++.|.|..||....
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence            4556788899999887 899999999998653


No 44 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.06  E-value=0.053  Score=30.42  Aligned_cols=30  Identities=33%  Similarity=0.680  Sum_probs=21.6

Q ss_pred             CCCCCCCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857          22 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD   51 (332)
Q Consensus        22 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~   51 (332)
                      |....|..|+.|++.. |++.|.|..||...
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            4556788999999986 99999999999864


No 45 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.02  E-value=0.099  Score=43.59  Aligned_cols=38  Identities=32%  Similarity=0.493  Sum_probs=29.6

Q ss_pred             cccccCccCCCCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857         238 NRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDP  275 (332)
Q Consensus       238 ~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~  275 (332)
                      .|.+.++|...+......|.++.|+|.|.|++||++.+
T Consensus       183 ~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         183 ICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             cCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCC
Confidence            45677888654443456899999999999999998754


No 46 
>KOG1218|consensus
Probab=91.19  E-value=9.7  Score=33.30  Aligned_cols=97  Identities=29%  Similarity=0.572  Sum_probs=46.7

Q ss_pred             CCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccC
Q psy2857          38 GSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVN  117 (332)
Q Consensus        38 g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~  117 (332)
                      .+..|.|.++|+|. ..... .....    .+..  .+.......+|.+..+|.+..+.. .+........|.....|..
T Consensus        13 ~~~~c~c~~~~~g~-~~~~~-~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~c~~-~~~~~~~~~~c~~~~~c~~   83 (316)
T KOG1218|consen   13 GSGQCFCDPGYTGR-LQCEH-QAVTS----ACSG--ICPCEVNSGECGLGYGFVGSVCRI-ECVCGNAGGGCSQPCRCKN   83 (316)
T ss_pred             CCCceecCCCcccc-ccccC-CCCCc----cccc--cCCccCCceeEecccccCCCcccc-ccccCCCCCcccCccccCC
Confidence            35579999999985 11111 11111    1111  111122234778888888775433 2222222334444445555


Q ss_pred             CCcccCceecC-CCCccCCCCeeecCCCC
Q psy2857         118 NAECVDGLCYC-RPGFDARGSVCVDVDEC  145 (332)
Q Consensus       118 ~~~c~~~~c~C-~~g~~~~g~~c~~~~~C  145 (332)
                      ..........+ ..+|.  +..|....++
T Consensus        84 ~~~~~~~~~~~~~~~~~--g~~C~~~~~~  110 (316)
T KOG1218|consen   84 GGTCVSSTGYCHLNGYE--GPQCESPCPC  110 (316)
T ss_pred             CCcccCCCCcccCCCCC--cccccCCCCc
Confidence            55555544455 46665  5666544443


No 47 
>smart00051 DSL delta serrate ligand.
Probab=90.94  E-value=0.5  Score=30.49  Aligned_cols=47  Identities=21%  Similarity=0.415  Sum_probs=30.6

Q ss_pred             ceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         262 SYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       262 ~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      .++-.|+++|.|..|.     ..|.. .+....+..|.. .|  .++|++||+|..
T Consensus        16 ~~rv~C~~~~yG~~C~-----~~C~~-~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~   62 (63)
T smart00051       16 QIRVTCDENYYGEGCN-----KFCRP-RDDFFGHYTCDE-NG--NKGCLEGWMGPY   62 (63)
T ss_pred             EEEeeCCCCCcCCccC-----CEeCc-CccccCCccCCc-CC--CEecCCCCcCCC
Confidence            3466899999999874     22322 123444567743 34  789999999864


No 48 
>smart00051 DSL delta serrate ligand.
Probab=90.06  E-value=0.56  Score=30.23  Aligned_cols=46  Identities=15%  Similarity=0.042  Sum_probs=31.5

Q ss_pred             EEEEEecCCceecccCCcCCCCC-CCCCCeeeeCCCCeEeeCCCCCccCC
Q psy2857           4 VVLVRILLGVRAIVDINECQSNP-CGVNATCIDTQGSYSCVCKEHYTGDP   52 (332)
Q Consensus         4 ~~~c~c~~g~~~~~~~~~C~~~~-C~~~g~C~~~~g~~~C~C~~G~~g~~   52 (332)
                      ...-.|.++|.|......|.+.. ...+..|....   .++|.+||+|..
T Consensus        16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~~G---~~~C~~Gw~G~~   62 (63)
T smart00051       16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDENG---NKGCLEGWMGPY   62 (63)
T ss_pred             EEEeeCCCCCcCCccCCEeCcCccccCCccCCcCC---CEecCCCCcCCC
Confidence            44567789999986556665432 45677775432   688999999864


No 49 
>KOG1218|consensus
Probab=90.04  E-value=6.2  Score=34.55  Aligned_cols=40  Identities=38%  Similarity=1.000  Sum_probs=23.9

Q ss_pred             ccCceecCCCCccCCCCeeecCCC-CCCCCCCCCCCceecCCC
Q psy2857         121 CVDGLCYCRPGFDARGSVCVDVDE-CQLGDPCGPQAQCTNTPG  162 (332)
Q Consensus       121 c~~~~c~C~~g~~~~g~~c~~~~~-C~~~~~C~~~~~C~~~~~  162 (332)
                      .....|.|.+||.  +..+..... |.....+.+++.|....+
T Consensus       159 ~~~~~c~c~~g~~--g~~~~~~~~~c~~~~~~~~g~~C~~~~~  199 (316)
T KOG1218|consen  159 CKNGICTCQPGFV--GVFCVESCSGCSPLTACENGAKCNRSTG  199 (316)
T ss_pred             CCCCceeccCCcc--cccccccCCCcCCCcccCCCCeeecccc
Confidence            3455788999998  555543322 444455666667765543


No 50 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=83.56  E-value=1.5  Score=26.84  Aligned_cols=27  Identities=37%  Similarity=1.046  Sum_probs=16.3

Q ss_pred             CCCCccccCCCcccCceecCCCCccCC
Q psy2857         109 CSSNFECVNNAECVDGLCYCRPGFDAR  135 (332)
Q Consensus       109 c~~~~~C~~~~~c~~~~c~C~~g~~~~  135 (332)
                      |.....|..++.|+.++|.|++||...
T Consensus        22 C~~~~qC~~~s~C~~g~C~C~~g~~~~   48 (52)
T PF01683_consen   22 CESDEQCIGGSVCVNGRCQCPPGYVEV   48 (52)
T ss_pred             CCCcCCCCCcCEEcCCEeECCCCCEec
Confidence            333345556666667777777777543


No 51 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=83.32  E-value=1.1  Score=27.12  Aligned_cols=24  Identities=42%  Similarity=0.873  Sum_probs=16.8

Q ss_pred             eeeecCCceeeecCCCcccCCCCccc
Q psy2857         255 TCIDTQGSYSCVCKEHYTGDPYQACS  280 (332)
Q Consensus       255 ~C~~~~~~~~C~C~~G~~g~~~~~C~  280 (332)
                      .|....+  +|.|+++|+|..++.|.
T Consensus        12 ~C~~~~G--~C~C~~~~~G~~C~~C~   35 (49)
T PF00053_consen   12 TCDPSTG--QCVCKPGTTGPRCDQCK   35 (49)
T ss_dssp             SEEETCE--EESBSTTEESTTS-EE-
T ss_pred             cccCCCC--EEeccccccCCcCcCCC
Confidence            5655444  89999999999876544


No 52 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=81.26  E-value=1.8  Score=31.42  Aligned_cols=33  Identities=30%  Similarity=0.719  Sum_probs=25.0

Q ss_pred             cccccCCCCCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857         282 IDECKALDKPCGLRAICENTVPGFNCLCPKGYSGK  316 (332)
Q Consensus       282 ~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~  316 (332)
                      .|.|+.. ..|+..+.|.. .....|.|++||...
T Consensus        77 ~d~Cd~y-~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   77 KDQCDVY-GFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             ccCCCCc-cccCCccEeCC-CCCCceECCCCcCCC
Confidence            4677764 58999999953 345579999999865


No 53 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=80.12  E-value=2.2  Score=25.94  Aligned_cols=22  Identities=36%  Similarity=0.833  Sum_probs=15.3

Q ss_pred             eeecCCceeeecCCCcccCCCCcc
Q psy2857         256 CIDTQGSYSCVCKEHYTGDPYQAC  279 (332)
Q Consensus       256 C~~~~~~~~C~C~~G~~g~~~~~C  279 (332)
                      |....|  +|.|+++|+|..++.|
T Consensus        14 C~~~~G--~C~C~~~~~G~~C~~C   35 (50)
T cd00055          14 CDPGTG--QCECKPNTTGRRCDRC   35 (50)
T ss_pred             ccCCCC--EEeCCCcCCCCCCCCC
Confidence            544444  8999999998876533


No 54 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=77.98  E-value=4.5  Score=24.71  Aligned_cols=24  Identities=38%  Similarity=0.891  Sum_probs=17.1

Q ss_pred             CCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857         148 GDPCGPQAQCTNTPGSFRCDCVEGYVGA  175 (332)
Q Consensus       148 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~  175 (332)
                      ...|..++.|++.    .|.|++||...
T Consensus        25 ~~qC~~~s~C~~g----~C~C~~g~~~~   48 (52)
T PF01683_consen   25 DEQCIGGSVCVNG----RCQCPPGYVEV   48 (52)
T ss_pred             cCCCCCcCEEcCC----EeECCCCCEec
Confidence            3455567788653    89999998753


No 55 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=76.19  E-value=4.8  Score=24.41  Aligned_cols=22  Identities=23%  Similarity=0.608  Sum_probs=14.9

Q ss_pred             eeeCCCCCCCCCCCCccccccccC
Q psy2857         306 NCLCPKGYSGKPDAKVACEQEKAG  329 (332)
Q Consensus       306 ~C~C~~g~~g~~~~~~~c~~~~~~  329 (332)
                      +|.|+++|.|.....  |.+.-.+
T Consensus        20 ~C~C~~~~~G~~C~~--C~~g~~~   41 (50)
T cd00055          20 QCECKPNTTGRRCDR--CAPGYYG   41 (50)
T ss_pred             EEeCCCcCCCCCCCC--CCCCCcc
Confidence            788888888887764  6554433


No 56 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=73.59  E-value=4.1  Score=30.00  Aligned_cols=36  Identities=31%  Similarity=0.659  Sum_probs=25.1

Q ss_pred             cCCcCCCCC---CCCCCeeeeCC--CCeEeeCCCCCccCCCC
Q psy2857          18 DINECQSNP---CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ   54 (332)
Q Consensus        18 ~~~~C~~~~---C~~~g~C~~~~--g~~~C~C~~G~~g~~~~   54 (332)
                      ++.+|.+.-   |- ||.|.-..  ..++|.|..||+|..|+
T Consensus        41 ~i~~Cp~ey~~YCl-HG~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         41 AIRLCGPEGDGYCL-HGDCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             ccccCChhhCCEeE-CCEEEeeccCCCceeECCCCccccccc
Confidence            345554432   54 46888764  67899999999997664


No 57 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.43  E-value=3.9  Score=24.38  Aligned_cols=21  Identities=38%  Similarity=0.770  Sum_probs=14.6

Q ss_pred             eeeecCCceeeecCCCcccCCCC
Q psy2857         255 TCIDTQGSYSCVCKEHYTGDPYQ  277 (332)
Q Consensus       255 ~C~~~~~~~~C~C~~G~~g~~~~  277 (332)
                      .|....|  +|.|+++|+|..++
T Consensus        12 ~C~~~~G--~C~C~~~~~G~~C~   32 (46)
T smart00180       12 TCDPDTG--QCECKPNVTGRRCD   32 (46)
T ss_pred             cccCCCC--EEECCCCCCCCCCC
Confidence            3444344  89999999987654


No 58 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=71.32  E-value=4.8  Score=29.19  Aligned_cols=34  Identities=29%  Similarity=0.800  Sum_probs=25.8

Q ss_pred             cCCCCCCCCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857         141 DVDECQLGDPCGPQAQCTNTPGSFRCDCVEGYVGA  175 (332)
Q Consensus       141 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~G~~g~  175 (332)
                      ..+.|.....|+..+.|.. .....|.|.+||...
T Consensus        76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence            3467776788999999954 345579999999753


No 59 
>PHA02887 EGF-like protein; Provisional
Probab=70.41  E-value=4.2  Score=29.43  Aligned_cols=27  Identities=30%  Similarity=0.736  Sum_probs=20.6

Q ss_pred             CCCCCeeeecC--CceeeecCCCcccCCCC
Q psy2857         250 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ  277 (332)
Q Consensus       250 C~~~~~C~~~~--~~~~C~C~~G~~g~~~~  277 (332)
                      |- +|+|.-..  ..+.|.|++||+|..|+
T Consensus        94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence            44 57886544  46899999999999775


No 60 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=70.26  E-value=2.3  Score=27.38  Aligned_cols=46  Identities=26%  Similarity=0.524  Sum_probs=17.8

Q ss_pred             ceeeecCCCcccCCCC-cccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         262 SYSCVCKEHYTGDPYQ-ACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       262 ~~~C~C~~G~~g~~~~-~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      .++-.|.+.|.|..|. .|...|.-       ..+-.|. ..|  .=+|.+||+|..
T Consensus        16 ~~rv~C~~nyyG~~C~~~C~~~~d~-------~ghy~Cd-~~G--~~~C~~Gw~G~~   62 (63)
T PF01414_consen   16 RIRVVCDENYYGPNCSKFCKPRDDS-------FGHYTCD-SNG--NKVCLPGWTGPN   62 (63)
T ss_dssp             -------TTEETTTT-EE---EEET-------TEEEEE--SS----EEE-TTEESTT
T ss_pred             EEEEECCCCCCCccccCCcCCCcCC-------cCCcccC-CCC--CCCCCCCCcCCC
Confidence            4577899999999875 23322210       1122343 233  335889998864


No 61 
>PHA02887 EGF-like protein; Provisional
Probab=69.64  E-value=5.4  Score=28.88  Aligned_cols=27  Identities=30%  Similarity=0.736  Sum_probs=21.3

Q ss_pred             CCCCCeeeeCC--CCeEeeCCCCCccCCCC
Q psy2857          27 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ   54 (332)
Q Consensus        27 C~~~g~C~~~~--g~~~C~C~~G~~g~~~~   54 (332)
                      |- ||+|.-..  ..++|.|.+||+|..|+
T Consensus        94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence            65 68898764  46899999999997653


No 62 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=68.98  E-value=3.9  Score=22.46  Aligned_cols=13  Identities=38%  Similarity=0.715  Sum_probs=11.1

Q ss_pred             eeecCCCcccCCC
Q psy2857         264 SCVCKEHYTGDPY  276 (332)
Q Consensus       264 ~C~C~~G~~g~~~  276 (332)
                      +|.|++||.++..
T Consensus        19 ~C~CPeGyIlde~   31 (34)
T PF09064_consen   19 QCFCPEGYILDEG   31 (34)
T ss_pred             ceeCCCceEecCC
Confidence            8999999988743


No 63 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=65.91  E-value=5.5  Score=29.38  Aligned_cols=27  Identities=33%  Similarity=0.698  Sum_probs=20.5

Q ss_pred             CCCCCeeeecC--CceeeecCCCcccCCCC
Q psy2857         250 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ  277 (332)
Q Consensus       250 C~~~~~C~~~~--~~~~C~C~~G~~g~~~~  277 (332)
                      |.+ |.|.-..  ..+.|.|..||+|.+|+
T Consensus        53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             eEC-CEEEeeccCCCceeECCCCccccccc
Confidence            444 4786544  57899999999999775


No 64 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=64.70  E-value=6.8  Score=27.95  Aligned_cols=35  Identities=20%  Similarity=0.465  Sum_probs=27.1

Q ss_pred             cccccCCCCCCCCCCeeecCC-----CceeeeCCCCCCCC
Q psy2857         282 IDECKALDKPCGLRAICENTV-----PGFNCLCPKGYSGK  316 (332)
Q Consensus       282 ~d~C~~~~~~C~~~~~C~~~~-----g~~~C~C~~g~~g~  316 (332)
                      .+.|...++.|..+|.|++..     .=|.|.|.+.+...
T Consensus         5 ~~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~   44 (103)
T PF12955_consen    5 NDACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT   44 (103)
T ss_pred             HHHHHHhccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence            466777778999999999773     33899999876654


No 65 
>KOG3516|consensus
Probab=63.38  E-value=6.5  Score=40.12  Aligned_cols=36  Identities=25%  Similarity=0.768  Sum_probs=32.3

Q ss_pred             CCcCCCCCCCCCCeeeeCCCCeEeeCC-CCCccCCCC
Q psy2857          19 INECQSNPCGVNATCIDTQGSYSCVCK-EHYTGDPYQ   54 (332)
Q Consensus        19 ~~~C~~~~C~~~g~C~~~~g~~~C~C~-~G~~g~~~~   54 (332)
                      ++.|.+++|...|.|......|.|.|. .||.|..|.
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH  581 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCH  581 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEecccccccccccc
Confidence            578999999999999998899999997 899997664


No 66 
>KOG3512|consensus
Probab=61.94  E-value=23  Score=32.57  Aligned_cols=25  Identities=44%  Similarity=0.842  Sum_probs=18.9

Q ss_pred             CCeeeecCCceeeecCCCcccCCCCcc
Q psy2857         253 NATCIDTQGSYSCVCKEHYTGDPYQAC  279 (332)
Q Consensus       253 ~~~C~~~~~~~~C~C~~G~~g~~~~~C  279 (332)
                      +.+|..+.|  +|.|++|-+|..|..|
T Consensus       406 gktCNq~tG--qCpCkeGvtG~tCnrC  430 (592)
T KOG3512|consen  406 GKTCNQTTG--QCPCKEGVTGLTCNRC  430 (592)
T ss_pred             cccccccCC--cccCCCCCcccccccc
Confidence            446776766  8999999999876544


No 67 
>KOG3516|consensus
Probab=55.39  E-value=9.9  Score=38.91  Aligned_cols=39  Identities=31%  Similarity=0.776  Sum_probs=34.1

Q ss_pred             cccccccccCCCCCCCCCCeeecCCCceeeeCC-CCCCCCCC
Q psy2857         278 ACSDIDECKALDKPCGLRAICENTVPGFNCLCP-KGYSGKPD  318 (332)
Q Consensus       278 ~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~-~g~~g~~~  318 (332)
                      +|..+|.|..  ++|.+++.|.-....|.|.|. .||.|..+
T Consensus       541 ~C~i~drClP--N~CehgG~C~Qs~~~f~C~C~~TGY~GatC  580 (1306)
T KOG3516|consen  541 MCGISDRCLP--NPCEHGGKCSQSWDDFECNCELTGYKGATC  580 (1306)
T ss_pred             ccccccccCC--ccccCCCcccccccceeEeccccccccccc
Confidence            5778888974  799999999988889999999 89999865


No 68 
>KOG3514|consensus
Probab=53.44  E-value=9.7  Score=38.54  Aligned_cols=34  Identities=26%  Similarity=0.741  Sum_probs=30.1

Q ss_pred             cCCCCCCCCCCeeeeCCCCeEeeC-CCCCccCCCC
Q psy2857          21 ECQSNPCGVNATCIDTQGSYSCVC-KEHYTGDPYQ   54 (332)
Q Consensus        21 ~C~~~~C~~~g~C~~~~g~~~C~C-~~G~~g~~~~   54 (332)
                      .|.++||.++|.|...+.+|.|.| ..||.|..|+
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce  659 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE  659 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence            699999999999999999999999 5689987663


No 69 
>KOG3509|consensus
Probab=47.55  E-value=36  Score=34.66  Aligned_cols=71  Identities=25%  Similarity=0.564  Sum_probs=49.1

Q ss_pred             CccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857         243 NECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP  317 (332)
Q Consensus       243 ~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~  317 (332)
                      +.|+..++...+.|-......+|.|++||+|..+..|...  +...++-+. .++|....+.+...|.+| .|..
T Consensus       407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~--~~~~~~g~y-~~t~~~~~~~~~~~c~pg-~g~~  477 (964)
T KOG3509|consen  407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNG--CDRSPNGSY-LGTCVPIQGKRCEYCGPG-AGAP  477 (964)
T ss_pred             CccccccCCCCccccccccccceeccccccCchhhccCcc--ccccCCccc-cceEeccCCCcceeecCC-CCCc
Confidence            4566667777777777888889999999999988755443  333333332 467777766677888888 5554


No 70 
>KOG3514|consensus
Probab=36.33  E-value=24  Score=36.00  Aligned_cols=34  Identities=26%  Similarity=0.753  Sum_probs=29.4

Q ss_pred             ccCCCCCCCCCeeeecCCceeeecC-CCcccCCCC
Q psy2857         244 ECQSNPCGVNATCIDTQGSYSCVCK-EHYTGDPYQ  277 (332)
Q Consensus       244 ~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~  277 (332)
                      .|.++||.+++.|....+.|.|.|. .||.|..|+
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce  659 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE  659 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence            5888999999999999999999996 578777553


No 71 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=33.51  E-value=20  Score=22.00  Aligned_cols=21  Identities=24%  Similarity=0.597  Sum_probs=14.0

Q ss_pred             eeecCCCcccCCCCccccccc
Q psy2857         264 SCVCKEHYTGDPYQACSDIDE  284 (332)
Q Consensus       264 ~C~C~~G~~g~~~~~C~~~d~  284 (332)
                      -|.|++||..+....|...++
T Consensus        34 gC~C~~G~v~~~~~~CV~~~~   54 (55)
T PF01826_consen   34 GCFCPPGYVRNDNGRCVPPSE   54 (55)
T ss_dssp             EEEETTTEEEETTSEEEEGGG
T ss_pred             cCCCCCCeeEcCCCCEEcHHH
Confidence            499999998765444544443


No 72 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=32.59  E-value=23  Score=21.88  Aligned_cols=30  Identities=23%  Similarity=0.488  Sum_probs=16.8

Q ss_pred             CCCCCCCeeee----CCCCeEeeCCCCCccCCCC
Q psy2857          25 NPCGVNATCID----TQGSYSCVCKEHYTGDPYQ   54 (332)
Q Consensus        25 ~~C~~~g~C~~----~~g~~~C~C~~G~~g~~~~   54 (332)
                      .+|+.||....    ..|...|.|..-|.|.++.
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS   50 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence            46888887663    2466889999999998764


No 73 
>KOG3512|consensus
Probab=29.81  E-value=69  Score=29.60  Aligned_cols=59  Identities=31%  Similarity=0.636  Sum_probs=31.4

Q ss_pred             cCCceee-ecCCCcccCCCCcccccccccCC-CCCCC-CCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857         259 TQGSYSC-VCKEHYTGDPYQACSDIDECKAL-DKPCG-LRAICENTVPGFNCLCPKGYSGKPDAK  320 (332)
Q Consensus       259 ~~~~~~C-~C~~G~~g~~~~~C~~~d~C~~~-~~~C~-~~~~C~~~~g~~~C~C~~g~~g~~~~~  320 (332)
                      +.|. .| .|++||.-+....=.+-..|..- -|+-+ .+.+|..+.|  +|.|.+|.+|..+..
T Consensus       368 TaGr-hChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnr  429 (592)
T KOG3512|consen  368 TAGR-HCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNR  429 (592)
T ss_pred             CCCc-ccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCccccccc
Confidence            4443 35 69999987654211122223210 01211 2456654555  788888888886654


No 74 
>PF05092 PIF:  Per os infectivity;  InterPro: IPR007784 This entry represents a group of dsDNA Baculovirus proteins. It is required for the infectivity of the OBs or occlusion bodies. It is a structural protein of the ODV envelope required only in the first steps of per os larva infection, as viruses being produced in cells expressing the gene for this protein but not containing it in their genomes are able to produce successful infections. Baculoviruses are large DNA viruses that infect arthropods, mainly members of the order Lepidoptera. In their life cycle, they produce two kinds of particles, a budded, non-occluded virus (BV), which buds out of the infected cell and is responsible for the cell-to-cell transmission of the virus, and an occluded form, the occlusion body (OB), which is responsible for protecting the virus between encounters with larvae. A variable number of virions are included in the para-crystalline structure of the OB, mainly constituted by the virus-encoded polyhedrin protein; these virions are called occlusion body-derived virions or ODVs [].
Probab=25.47  E-value=1.5e+02  Score=28.06  Aligned_cols=49  Identities=24%  Similarity=0.582  Sum_probs=36.5

Q ss_pred             eEEEEEe-cCCceecccC-CcCCCCC-CCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857           3 FVVLVRI-LLGVRAIVDI-NECQSNP-CGVNATCIDTQ-GSYSCVCKEHYTGD   51 (332)
Q Consensus         3 ~~~~c~c-~~g~~~~~~~-~~C~~~~-C~~~g~C~~~~-g~~~C~C~~G~~g~   51 (332)
                      +..+|.| .||+.+...+ +.|...- |.+||.-.+.. ....|.|..||..+
T Consensus       130 fsLlCsC~~PGlVtqlniy~DC~vpVGC~PhG~I~din~~pi~C~Cd~GyVsd  182 (522)
T PF05092_consen  130 FSLLCSCLRPGLVTQLNIYEDCDVPVGCQPHGRIADINESPIRCVCDDGYVSD  182 (522)
T ss_pred             eEEEEEcCCCCeEeeeehhccCCCcEecCCCCEEeeecCCceEeECCCCcccc
Confidence            5578888 7888888654 4454432 88899988874 46789999999765


No 75 
>KOG0196|consensus
Probab=23.99  E-value=1.1e+02  Score=30.59  Aligned_cols=56  Identities=29%  Similarity=0.688  Sum_probs=33.1

Q ss_pred             eeecCCCccc----CCCCccc--------ccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCC--CCcccc
Q psy2857         264 SCVCKEHYTG----DPYQACS--------DIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPD--AKVACE  324 (332)
Q Consensus       264 ~C~C~~G~~g----~~~~~C~--------~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~--~~~~c~  324 (332)
                      .|.|.+||.-    ..|..|.        ....|    .+|+.+.. ....++..|.|..||...+.  ..+.|.
T Consensus       260 ~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C----~~CP~~S~-s~~ega~~C~C~~gyyRA~~Dp~~mpCT  329 (996)
T KOG0196|consen  260 GCVCKAGYEEAENGKACQACPPGTYKASQGDSLC----LPCPPNSH-SSSEGATSCTCENGYYRADSDPPSMPCT  329 (996)
T ss_pred             ceeecCCCCcccCCCcceeCCCCcccCCCCCCCC----CCCCCCCC-CCCCCCCcccccCCcccCCCCCCCCCCC
Confidence            6889999864    2222231        12223    25665543 24567789999999987754  234554


No 76 
>KOG0196|consensus
Probab=20.26  E-value=2.2e+02  Score=28.73  Aligned_cols=17  Identities=35%  Similarity=0.829  Sum_probs=12.1

Q ss_pred             CCCceEeeCCCCCccCC
Q psy2857         160 TPGSFRCDCVEGYVGAP  176 (332)
Q Consensus       160 ~~~~~~C~C~~G~~g~~  176 (332)
                      ..+.-.|.|..||.-++
T Consensus       304 ~ega~~C~C~~gyyRA~  320 (996)
T KOG0196|consen  304 SEGATSCTCENGYYRAD  320 (996)
T ss_pred             CCCCCcccccCCcccCC
Confidence            34666888999987643


Done!