Query         psy9424
Match_columns 535
No_of_seqs    287 out of 2382
Neff          8.6 
Searched_HMMs 46136
Date          Fri Aug 16 18:59:56 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9424.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9424hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1217|consensus               99.4 1.7E-11 3.7E-16  131.2  24.0  275   41-483    97-389 (487)
  2 KOG4289|consensus               99.4 8.6E-12 1.9E-16  137.1  17.7  110  144-315  1180-1316(2531)
  3 KOG1214|consensus               99.4 6.2E-12 1.3E-16  132.5  15.3  123  143-315   734-862 (1289)
  4 KOG1214|consensus               99.3 1.1E-11 2.3E-16  130.8  12.1  136   10-176   713-860 (1289)
  5 KOG4289|consensus               99.3 2.8E-11 6.1E-16  133.2  13.5   85  257-345  1218-1303(2531)
  6 KOG1217|consensus               99.2 2.5E-10 5.4E-15  122.2  18.8  246   12-345   151-418 (487)
  7 KOG1219|consensus               99.1 1.4E-10   3E-15  132.2   8.3  107  245-402  3869-3976(4289)
  8 KOG1219|consensus               99.1 1.6E-10 3.4E-15  131.8   8.1  110  144-315  3865-3977(4289)
  9 KOG1225|consensus               99.0 3.4E-09 7.4E-14  111.0  12.2  105   14-178   235-341 (525)
 10 KOG1225|consensus               98.9 1.7E-08 3.6E-13  105.9  13.9  121  130-340   267-387 (525)
 11 KOG4260|consensus               98.6 5.7E-08 1.2E-12   91.0   5.4   71  219-310   231-304 (350)
 12 KOG4260|consensus               98.4 4.4E-07 9.6E-12   85.1   5.7  137  246-400   150-306 (350)
 13 KOG0994|consensus               98.3 2.8E-05 6.2E-10   85.8  18.3  106  165-315   831-948 (1758)
 14 KOG0994|consensus               98.0 0.00018 3.9E-09   79.7  15.2   26  381-408  1078-1103(1758)
 15 KOG1226|consensus               97.8 0.00025 5.4E-09   76.4  12.1   62  246-316   555-621 (783)
 16 PF07645 EGF_CA:  Calcium-bindi  97.7   2E-05 4.3E-10   54.1   2.0   33   32-64      2-35  (42)
 17 PF00008 EGF:  EGF-like domain   97.7 1.4E-05 3.1E-10   51.2   1.2   30  454-483     1-31  (32)
 18 smart00179 EGF_CA Calcium-bind  97.6 9.6E-05 2.1E-09   49.6   3.9   35  451-485     2-38  (39)
 19 PF07645 EGF_CA:  Calcium-bindi  97.6 3.7E-05 8.1E-10   52.8   1.7   31  451-481     2-34  (42)
 20 PF00008 EGF:  EGF-like domain   97.5   5E-05 1.1E-09   48.7   1.3   28   38-65      3-31  (32)
 21 KOG1226|consensus               97.4  0.0015 3.3E-08   70.5  12.0   60  246-315   514-580 (783)
 22 smart00179 EGF_CA Calcium-bind  97.3 0.00037   8E-09   46.7   3.8   35  143-177     2-38  (39)
 23 cd00054 EGF_CA Calcium-binding  97.2  0.0005 1.1E-08   45.5   3.8   35  451-485     2-37  (38)
 24 PF12947 EGF_3:  EGF domain;  I  97.0 0.00026 5.7E-09   46.5   1.0   29  373-401     5-33  (36)
 25 KOG1836|consensus               97.0   0.048   1E-06   65.7  20.2   49  131-179   760-813 (1705)
 26 PF12947 EGF_3:  EGF domain;  I  97.0  0.0003 6.6E-09   46.3   1.0   29   38-66      5-33  (36)
 27 cd00054 EGF_CA Calcium-binding  96.9  0.0014   3E-08   43.3   3.6   35  143-177     2-37  (38)
 28 cd00053 EGF Epidermal growth f  96.8  0.0021 4.5E-08   41.7   3.8   30  456-485     5-35  (36)
 29 smart00181 EGF Epidermal growt  96.7  0.0024 5.1E-08   41.6   3.7   28  457-485     6-34  (35)
 30 PF12662 cEGF:  Complement Clr-  96.6  0.0021 4.6E-08   37.9   2.6   24  300-323     1-24  (24)
 31 cd00053 EGF Epidermal growth f  96.3   0.005 1.1E-07   39.9   3.6   28  148-175     5-32  (36)
 32 PF06247 Plasmod_Pvs28:  Plasmo  96.2 0.00096 2.1E-08   60.1  -0.7  106  149-315    50-165 (197)
 33 PF12662 cEGF:  Complement Clr-  96.2  0.0045 9.8E-08   36.5   2.4   24   53-87      1-24  (24)
 34 PF06247 Plasmod_Pvs28:  Plasmo  96.1 0.00087 1.9E-08   60.3  -1.4  146  154-401    10-163 (197)
 35 smart00181 EGF Epidermal growt  96.1  0.0079 1.7E-07   39.1   3.4   28  149-177     6-34  (35)
 36 PF07974 EGF_2:  EGF-like domai  95.9   0.009 1.9E-07   38.1   2.9   26   39-66      6-31  (32)
 37 PF07974 EGF_2:  EGF-like domai  95.8   0.012 2.5E-07   37.6   3.1   27  457-485     6-32  (32)
 38 PF12661 hEGF:  Human growth fa  94.1    0.02 4.3E-07   28.6   0.5   13  473-485     1-13  (13)
 39 smart00051 DSL delta serrate l  94.1   0.064 1.4E-06   40.2   3.5   45   14-66     18-62  (63)
 40 PF14670 FXa_inhibition:  Coagu  92.2   0.083 1.8E-06   34.7   1.4   21  293-313    11-31  (36)
 41 PF14670 FXa_inhibition:  Coagu  92.1    0.08 1.7E-06   34.8   1.3   25   39-65      6-30  (36)
 42 KOG1836|consensus               90.8    0.46 9.9E-06   57.7   6.6   53  262-316   757-813 (1705)
 43 smart00051 DSL delta serrate l  82.1     1.8 3.9E-05   32.4   3.3   43  131-177    20-63  (63)
 44 PHA02887 EGF-like protein; Pro  82.0     1.4 2.9E-05   36.8   2.9   36   32-68     83-122 (126)
 45 PF12946 EGF_MSP1_1:  MSP1 EGF   78.2    0.72 1.6E-05   30.3   0.1   31  146-176     2-33  (37)
 46 cd01475 vWA_Matrilin VWA_Matri  77.8     2.3   5E-05   40.6   3.6   38  442-484   181-220 (224)
 47 cd01475 vWA_Matrilin VWA_Matri  76.6     2.8 6.1E-05   40.0   3.8   39  276-314   183-221 (224)
 48 KOG1218|consensus               71.6      30 0.00065   34.6  10.1   56    2-66     81-136 (316)
 49 PF00954 S_locus_glycop:  S-loc  71.1     4.2   9E-05   34.1   3.1   32   32-64     77-108 (110)
 50 PHA02887 EGF-like protein; Pro  69.5     4.8  0.0001   33.7   2.9   30  456-486    91-122 (126)
 51 PHA03099 epidermal growth fact  69.5     4.7  0.0001   34.3   2.9   36   32-68     42-81  (139)
 52 PF12946 EGF_MSP1_1:  MSP1 EGF   67.7     2.3 5.1E-05   27.9   0.6   29  456-484     4-33  (37)
 53 cd00055 EGF_Lam Laminin-type e  66.0       7 0.00015   27.6   2.9   17  390-406    20-36  (50)
 54 PF00053 Laminin_EGF:  Laminin   65.1     3.8 8.2E-05   28.7   1.4   26  380-407    11-36  (49)
 55 PHA03099 epidermal growth fact  64.9     6.4 0.00014   33.5   2.8   30  456-486    50-81  (139)
 56 cd00055 EGF_Lam Laminin-type e  64.9     8.5 0.00019   27.1   3.2   21  464-486    13-33  (50)
 57 PF00954 S_locus_glycop:  S-loc  63.9     7.8 0.00017   32.4   3.3   32  368-400    78-109 (110)
 58 KOG1218|consensus               63.4 1.6E+02  0.0035   29.2  20.4   40  302-344   163-202 (316)
 59 PF00053 Laminin_EGF:  Laminin   62.1     5.7 0.00012   27.8   1.8   22  463-486    11-32  (49)
 60 PF01414 DSL:  Delta serrate li  55.2     2.2 4.9E-05   31.9  -1.3   48   11-66     15-62  (63)
 61 smart00180 EGF_Lam Laminin-typ  52.6      15 0.00032   25.5   2.5   16  390-405    19-34  (46)
 62 PF01683 EB:  EB module;  Inter  47.9      23 0.00051   25.0   3.1   24  456-483    25-48  (52)
 63 KOG3516|consensus               46.5      15 0.00032   42.8   2.7   36  451-486   545-581 (1306)
 64 KOG3516|consensus               46.0      15 0.00033   42.6   2.8   36   32-68    545-581 (1306)
 65 PF01683 EB:  EB module;  Inter  44.9      24 0.00053   24.9   2.8   24   38-65     25-48  (52)
 66 KOG3512|consensus               41.3      78  0.0017   33.3   6.7   28  380-407   285-313 (592)
 67 KOG3514|consensus               39.7      20 0.00043   41.3   2.3   34  453-486   625-659 (1591)
 68 PF09064 Tme5_EGF_like:  Thromb  33.9      47   0.001   21.4   2.4   13  389-401    18-30  (34)
 69 KOG3514|consensus               32.8      30 0.00065   39.9   2.4   36  144-179   624-660 (1591)
 70 PF12955 DUF3844:  Domain of un  29.7      20 0.00044   29.6   0.4   32   33-64      6-43  (103)
 71 KOG3512|consensus               26.4 2.8E+02   0.006   29.5   7.7   27  156-182   286-313 (592)
 72 PF04863 EGF_alliinase:  Alliin  20.8      40 0.00087   24.3   0.5   31  456-486    16-50  (56)

No 1  
>KOG1217|consensus
Probab=99.44  E-value=1.7e-11  Score=131.20  Aligned_cols=275  Identities=28%  Similarity=0.602  Sum_probs=177.9

Q ss_pred             CCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCCccccCccCCCCCCCCCCCCCCCccCCCCCCCCcc
Q psy9424          41 CGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQQCISGLCQPTCRSNTTCPAQHYCNSGLCVLEMQC  120 (535)
Q Consensus        41 C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~~C~~~~c~~~C~~~~~C~~~~~c~~~~C~~g~~C  120 (535)
                      ...++.++.....+.|.|++||.|..+..   .      ..|+....        .+.....|.....            
T Consensus        97 ~~~~~~~~~~~~~~~c~c~~g~~~~~~~~---~------~~C~~~~~--------~~~~~~~c~~~~~------------  147 (487)
T KOG1217|consen   97 LLLCGECVDCVGSYECTCPPGYQGTPCEG---E------CECVTGPG--------VCCIDGSCSNGPG------------  147 (487)
T ss_pred             ccCCccccCCCCCceeeCCCccccCcCCc---c------eeecCCCC--------CeeCchhhcCCCC------------
Confidence            34456667777889999999999985541   0      01222111        0001111111110            


Q ss_pred             ccCCCCCCCCccCCCCCCCCccc--CCCCC--CCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424         121 TTHDQCSATEQCRSNDMGQMQCR--PACEG--ILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHG  196 (535)
Q Consensus       121 ~~~~~c~~~~~C~~~~~G~~~c~--~~C~~--~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~  196 (535)
                         ........|..+|.+.....  ++|..  .+|.+.+.|.+..++|.|.|++||.+..++.                 
T Consensus       148 ---~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~-----------------  207 (487)
T KOG1217|consen  148 ---SVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET-----------------  207 (487)
T ss_pred             ---CCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC-----------------
Confidence               00123346666777665443  58873  5699999999999889999999999988431                 


Q ss_pred             CCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCC
Q psy9424         197 PGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFG  276 (535)
Q Consensus       197 ~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~  276 (535)
                                                                       .  .+++.|++.   +.|.+.++|.+..+. 
T Consensus       208 -------------------------------------------------~--~~~~~c~~~---~~~~~~~g~~~~~c~-  232 (487)
T KOG1217|consen  208 -------------------------------------------------T--GNGGTCVDS---VACSCPPGARGPECE-  232 (487)
T ss_pred             -------------------------------------------------C--CCCceEecc---eeccCCCCCCCCCcc-
Confidence                                                             0  122344333   568888998876432 


Q ss_pred             CeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCccCCCCccCcccC
Q psy9424         277 CHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKCKASCQSDEECG  356 (535)
Q Consensus       277 C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~  356 (535)
                       ..+.++...   . ++|.+..++|+|.+++||++... ..+.++++|...                             
T Consensus       233 -~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~-----------------------------  277 (487)
T KOG1217|consen  233 -VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALI-----------------------------  277 (487)
T ss_pred             -cccccccCC---C-CcccccCCceeeeCCCCcccccc-ceeeeccccCCC-----------------------------
Confidence             334444333   4 78999999999999999988762 112233333221                             


Q ss_pred             CCCcccCCCcCCCCCCCCCCCCCceeeecCCCeeeeCCCCCcCCCC------CCCC----ccCCCCCCcccCCC---CCc
Q psy9424         357 LGEKCLQGQCNNPCERQGACGVNSLCNVLTHRKVCFCPRGFTGDPE------TECV----RITCLSHADCYPGG---GSL  423 (535)
Q Consensus       357 ~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~------~~C~----~~~C~~~~~C~~~~---g~~  423 (535)
                                      . .|.++++|+++.+.|.|.|++||+|..+      .+|.    ..+|.++++|....   .+.
T Consensus       278 ----------------~-~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~  340 (487)
T KOG1217|consen  278 ----------------A-SCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFR  340 (487)
T ss_pred             ----------------C-ccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCC
Confidence                            1 1455667777777799999999999986      2453    22577777883322   245


Q ss_pred             cCCCCCCCCCcCCCCCCCCCCcCCCCCcCCCCCCCCCCCCeeee-CCCCceeeCCCCCcCC
Q psy9424         424 CLANLCTRGCSADTDCPAALSCRSAECVDPCSPAPCGPNAQCSV-ANHRPLCSCPAGLMGL  483 (535)
Q Consensus       424 C~~g~C~~g~~~~~~C~~g~~C~~~~c~d~C~~~~C~~~~~C~~-~~g~~~C~C~~G~~G~  483 (535)
                      |.   +..++       .|..|+..  .++|...++.+++.|++ ..++|.|.|+.+|.+.
T Consensus       341 C~---c~~~~-------~g~~C~~~--~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  341 CA---CGPGF-------TGRRCEDS--NDECASSPCCPGGTCVNETPGSYRCACPAGFAGK  389 (487)
T ss_pred             cC---CCCCC-------CCCccccC--CccccCCccccCCEeccCCCCCeEecCCCccccC
Confidence            76   66665       78888873  25898778999999999 6899999999999985


No 2  
>KOG4289|consensus
Probab=99.39  E-value=8.6e-12  Score=137.13  Aligned_cols=110  Identities=28%  Similarity=0.709  Sum_probs=83.1

Q ss_pred             CCCCCCCCCCCCeec----------------------cCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424         144 PACEGILCGRNALCT----------------------ASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSP  201 (535)
Q Consensus       144 ~~C~~~~C~~~g~C~----------------------~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~  201 (535)
                      +.|+..||.+..+|+                      +..+++.|.|++||+|+.|++.                     
T Consensus      1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTe--------------------- 1238 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETE--------------------- 1238 (2531)
T ss_pred             chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccch---------------------
Confidence            566667777766663                      3346788999999999997632                     


Q ss_pred             CCCCCCCCCCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCee--
Q psy9424         202 GATSHSSHSGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHL--  279 (535)
Q Consensus       202 ~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~--  279 (535)
                                            ++.|.                +.+|.++++|....|.|+|.|.+||+|+   .|+.  
T Consensus      1239 ----------------------iDlCY----------------s~pC~nng~C~srEggYtCeCrpg~tGe---hCEvs~ 1277 (2531)
T KOG4289|consen 1239 ----------------------IDLCY----------------SGPCGNNGRCRSREGGYTCECRPGFTGE---HCEVSA 1277 (2531)
T ss_pred             ----------------------hHhhh----------------cCCCCCCCceEEecCceeEEecCCcccc---ceeeec
Confidence                                  34443                5789999999999999999999999999   4543  


Q ss_pred             -CCcCCCCCCCCCCeeecCC-CCeeeeCCCC-CccCCCC
Q psy9424         280 -IDFCAAKPCGPGARCDNSR-GSYKCLCPLG-LVGDPYG  315 (535)
Q Consensus       280 -~~~C~~~~C~~~~~C~~~~-g~~~C~C~~G-y~g~~c~  315 (535)
                       .-.|.+..|.++++|++.. +.+.|.|+.| |++..|+
T Consensus      1278 ~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1278 RAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred             ccCccccceecCCCEEeecCCCceeccCCCcccCCCceE
Confidence             3457777899999998875 7888999887 4444443


No 3  
>KOG1214|consensus
Probab=99.38  E-value=6.2e-12  Score=132.48  Aligned_cols=123  Identities=28%  Similarity=0.682  Sum_probs=88.4

Q ss_pred             cCCCCC--CCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc
Q psy9424         143 RPACEG--ILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVE  220 (535)
Q Consensus       143 ~~~C~~--~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~  220 (535)
                      +++|+.  +.|+.+++|++.+++|+|.|..||.-..                           .+        .+|..+.
T Consensus       734 ~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d---------------------------d~--------~tCV~i~  778 (1289)
T KOG1214|consen  734 ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD---------------------------DR--------HTCVLIT  778 (1289)
T ss_pred             hhhhccCCCCCCCCceeecCCCceeEEEeecceecc---------------------------CC--------cceEEec
Confidence            455654  6789999999999999999999875433                           00        1222221


Q ss_pred             C-CCCCCCCCCCcccCCCcccCCCCCCCCCCC--CccccCC-CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeec
Q psy9424         221 C-NSHADCSGDKVCEDHRCKISCLANNPCGPN--ALCSAEK-HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDN  296 (535)
Q Consensus       221 C-~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~--~~C~~~~-g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~  296 (535)
                      = .....|..+              +..|...  ++|+... ++|.|.|.|||.|++.. |.++|+|.++.|...++|.+
T Consensus       779 ~pap~n~Ce~g--------------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c~dvDeC~psrChp~A~Cyn  843 (1289)
T KOG1214|consen  779 PPAPANPCEDG--------------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-CTDVDECSPSRCHPAATCYN  843 (1289)
T ss_pred             CCCCCCccccC--------------ccccCcCCceEEEecCCceEEEeecCCccCCccc-cccccccCccccCCCceEec
Confidence            0 011122111              2445444  3555544 56999999999999875 88899999999999999999


Q ss_pred             CCCCeeeeCCCCCccCCCC
Q psy9424         297 SRGSYKCLCPLGLVGDPYG  315 (535)
Q Consensus       297 ~~g~~~C~C~~Gy~g~~c~  315 (535)
                      +++++.|+|.+||.|+...
T Consensus       844 tpgsfsC~C~pGy~GDGf~  862 (1289)
T KOG1214|consen  844 TPGSFSCRCQPGYYGDGFQ  862 (1289)
T ss_pred             CCCcceeecccCccCCCce
Confidence            9999999999999998765


No 4  
>KOG1214|consensus
Probab=99.32  E-value=1.1e-11  Score=130.76  Aligned_cols=136  Identities=24%  Similarity=0.503  Sum_probs=102.4

Q ss_pred             ccCCCccCCCCCC--CcccCcCCcCCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCC---
Q psy9424          10 QISQHLQQVPGLA--SAACVDGRCRNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCI---   83 (535)
Q Consensus        10 ~~~~~c~c~~g~~--g~~C~~~~~~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~---   83 (535)
                      .++..|+|..||.  |++|++   +++|+. ...|..+.+|++..++|+|.|..||.-.+           +...|+   
T Consensus       713 ~~~~tcecs~g~~gdgr~c~d---~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~d-----------d~~tCV~i~  778 (1289)
T KOG1214|consen  713 GVDYTCECSSGYQGDGRNCVD---ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFAD-----------DRHTCVLIT  778 (1289)
T ss_pred             CcceEEEEeeccCCCCCCCCC---hhhhccCCCCCCCCceeecCCCceeEEEeecceecc-----------CCcceEEec
Confidence            3556799999997  788998   899997 77899999999999999999999986432           112343   


Q ss_pred             ---CCCccccCccCCCCCCCCCC--CC-CCCccCCCCCCCCccccCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCee
Q psy9424          84 ---GEQQCISGLCQPTCRSNTTC--PA-QHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQMQCRPACEGILCGRNALC  157 (535)
Q Consensus        84 ---~~~~C~~~~c~~~C~~~~~C--~~-~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~~c~~~C~~~~C~~~g~C  157 (535)
                         .++.|..+.  |.|...+.+  +. +..-|.+.|.+||.       +++..|.        ++|+|+.+.|..++.|
T Consensus       779 ~pap~n~Ce~g~--h~C~i~g~a~c~~hGgs~y~C~CLPGfs-------GDG~~c~--------dvDeC~psrChp~A~C  841 (1289)
T KOG1214|consen  779 PPAPANPCEDGS--HTCAIAGQARCVHHGGSTYSCACLPGFS-------GDGHQCT--------DVDECSPSRCHPAATC  841 (1289)
T ss_pred             CCCCCCccccCc--cccCcCCceEEEecCCceEEEeecCCcc-------CCccccc--------cccccCccccCCCceE
Confidence               344566666  677765544  22 23344555555555       4444443        4899999999999999


Q ss_pred             ccCCCCceecCCCCCCCCC
Q psy9424         158 TASDHHATCSCKPGYVGHP  176 (535)
Q Consensus       158 ~~~~~~~~C~C~~Gf~g~~  176 (535)
                      ++++++|.|+|.+||.|++
T Consensus       842 yntpgsfsC~C~pGy~GDG  860 (1289)
T KOG1214|consen  842 YNTPGSFSCRCQPGYYGDG  860 (1289)
T ss_pred             ecCCCcceeecccCccCCC
Confidence            9999999999999999998


No 5  
>KOG4289|consensus
Probab=99.28  E-value=2.8e-11  Score=133.21  Aligned_cols=85  Identities=34%  Similarity=0.802  Sum_probs=69.1

Q ss_pred             CCCCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEE
Q psy9424         257 EKHKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHC  336 (535)
Q Consensus       257 ~~g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C  336 (535)
                      +.+.++|.|+|||+|+.++  +.+|.|.+.||.++++|...+|+|+|.|++||+|..|+..- ..-.|. +..|.++++|
T Consensus      1218 pvnglrCrCPpGFTgd~Ce--TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~-~agrCv-pGvC~nggtC 1293 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCE--TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSA-RAGRCV-PGVCKNGGTC 1293 (2531)
T ss_pred             ccCceeEeCCCCCCccccc--chhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeec-ccCccc-cceecCCCEE
Confidence            3467899999999999554  45999999999999999999999999999999999988421 123454 3689999999


Q ss_pred             ec-CCCCCcc
Q psy9424         337 VK-TDGVPKC  345 (535)
Q Consensus       337 ~~-~~g~~~C  345 (535)
                      ++ ..|.|.|
T Consensus      1294 ~~~~nggf~c 1303 (2531)
T KOG4289|consen 1294 VNLLNGGFCC 1303 (2531)
T ss_pred             eecCCCceec
Confidence            98 4555555


No 6  
>KOG1217|consensus
Probab=99.25  E-value=2.5e-10  Score=122.22  Aligned_cols=246  Identities=28%  Similarity=0.644  Sum_probs=160.1

Q ss_pred             CCCccCCCCCCCcccCcCCcCCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCC--cc
Q psy9424          12 SQHLQQVPGLASAACVDGRCRNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQ--QC   88 (535)
Q Consensus        12 ~~~c~c~~g~~g~~C~~~~~~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~--~C   88 (535)
                      ...++|..||.+..+...  .++|.. ..+|.+++.|.+..++|.|.|++||++..+..   .   .....|++..  .+
T Consensus       151 ~~~c~C~~g~~~~~~~~~--~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~---~---~~~~~c~~~~~~~~  222 (487)
T KOG1217|consen  151 PFRCSCTEGYEGEPCETD--LDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET---T---GNGGTCVDSVACSC  222 (487)
T ss_pred             ceeeeeCCCccccccccc--ccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC---C---CCCceEecceeccC
Confidence            445999999999999862  379984 66799999999999999999999999985541   1   0111222210  00


Q ss_pred             ccCccCCCCC-------CC-CCCCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCCCC----cccCCCCCCC-CCCCC
Q psy9424          89 ISGLCQPTCR-------SN-TTCPAQHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQM----QCRPACEGIL-CGRNA  155 (535)
Q Consensus        89 ~~~~c~~~C~-------~~-~~C~~~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~----~c~~~C~~~~-C~~~g  155 (535)
                      ..+.-...|.       .+ ..|.+..+.+                  .+.|.++|.+..    ..+++|+... |.+++
T Consensus       223 ~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~------------------~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~  284 (487)
T KOG1217|consen  223 PPGARGPECEVSIVECASGDGTCVNTVGSY------------------TCRCPEGYTGDACVTCVDVDSCALIASCPNGG  284 (487)
T ss_pred             CCCCCCCCcccccccccCCCCcccccCCce------------------eeeCCCCccccccceeeeccccCCCCccCCCC
Confidence            0000000111       11 2222222222                  234455666654    2478898753 99999


Q ss_pred             eeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcccC
Q psy9424         156 LCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVCED  235 (535)
Q Consensus       156 ~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C~~  235 (535)
                      +|++..+.|.|.|++||.|..+                                         ..+.+..+|....    
T Consensus       285 ~C~~~~~~~~C~C~~g~~g~~~-----------------------------------------~~~~~~~~C~~~~----  319 (487)
T KOG1217|consen  285 TCVNVPGSYRCTCPPGFTGRLC-----------------------------------------TECVDVDECSPRN----  319 (487)
T ss_pred             eeecCCCcceeeCCCCCCCCCC-----------------------------------------ccccccccccccc----
Confidence            9999988899999999999983                                         1122223332100    


Q ss_pred             CCcccCCCCCCCCCCCCcc--ccCCCCeeeeCCCCCccCCCCCCeeC-CcCCCCCCCCCCeeec-CCCCeeeeCCCCCcc
Q psy9424         236 HRCKISCLANNPCGPNALC--SAEKHKQICYCQPGYTGDAYFGCHLI-DFCAAKPCGPGARCDN-SRGSYKCLCPLGLVG  311 (535)
Q Consensus       236 ~~c~~~c~~~~~C~~~~~C--~~~~g~~~C~C~~G~~G~~~~~C~~~-~~C~~~~C~~~~~C~~-~~g~~~C~C~~Gy~g  311 (535)
                              ...+|.+++.|  ......+.|.|.++|.|.   .|+.. ++|...++..++.|++ ..+.|.|.++.+|.+
T Consensus       320 --------~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~---~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  320 --------AGGPCANGGTCNTLGSFGGFRCACGPGFTGR---RCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             --------cCCcCCCCcccccCCCCCCCCcCCCCCCCCC---ccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence                    13557777777  233456789999999999   67776 4898888999999999 689999999999997


Q ss_pred             C--CCCCCCCCCCCCCCCCCCCCCCEEecCCCCCcc
Q psy9424         312 D--PYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKC  345 (535)
Q Consensus       312 ~--~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C  345 (535)
                      .  .....+.++++|..      .+.|++..+++.|
T Consensus       389 ~~~~~~~~~~~~~~c~~------~~~c~~~~~~~~c  418 (487)
T KOG1217|consen  389 KANGDGVGCEDIDECSG------CGDCVNGPGGGAC  418 (487)
T ss_pred             CCccccccccccccccC------CcceeccCCCCcc
Confidence            4  33344555666642      4456666666554


No 7  
>KOG1219|consensus
Probab=99.10  E-value=1.4e-10  Score=132.22  Aligned_cols=107  Identities=27%  Similarity=0.642  Sum_probs=85.5

Q ss_pred             CCCCCCCCccccCC-CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCC
Q psy9424         245 NNPCGPNALCSAEK-HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQ  323 (535)
Q Consensus       245 ~~~C~~~~~C~~~~-g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~  323 (535)
                      .+||+++|+|...+ +.|.|.|++.|+|..|+  .++++|.++||..+++|+...++|.|.|+.||+|..|+..  .+++
T Consensus      3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~--Gi~e 3944 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR--GISE 3944 (4289)
T ss_pred             cCcccCCCEecCCCCCceEEeCcccccCcccc--cccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc--cccc
Confidence            58999999998876 67999999999999555  4689999999999999999999999999999999988731  2444


Q ss_pred             CCCCCCCCCCCEEecCCCCCccCCCCccCcccCCCCcccCCCcCCCCCCCCCCCCCceeeecCCCeeeeCCCCCcCCCC
Q psy9424         324 CTRDDQCPPGAHCVKTDGVPKCKASCQSDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVLTHRKVCFCPRGFTGDPE  402 (535)
Q Consensus       324 C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~  402 (535)
                      |+.                                               ..|..+|+|++..|+|.|.|-+||.|..+
T Consensus      3945 Cs~-----------------------------------------------n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3945 CSK-----------------------------------------------NVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred             ccc-----------------------------------------------ccccCCceeeccCCceEeccChhHhcccC
Confidence            432                                               34555666677777777777777776653


No 8  
>KOG1219|consensus
Probab=99.09  E-value=1.6e-10  Score=131.84  Aligned_cols=110  Identities=29%  Similarity=0.712  Sum_probs=96.7

Q ss_pred             CCCCCCCCCCCCeeccCC-CCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCC
Q psy9424         144 PACEGILCGRNALCTASD-HHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECN  222 (535)
Q Consensus       144 ~~C~~~~C~~~g~C~~~~-~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~  222 (535)
                      +.|..+||+++|+|.... ++|.|.|++-|+|..||+.                                          
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~------------------------------------------ 3902 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID------------------------------------------ 3902 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccc------------------------------------------
Confidence            789999999999999876 6799999999999997632                                          


Q ss_pred             CCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCee--CCcCCCCCCCCCCeeecCCCC
Q psy9424         223 SHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHL--IDFCAAKPCGPGARCDNSRGS  300 (535)
Q Consensus       223 ~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~--~~~C~~~~C~~~~~C~~~~g~  300 (535)
                       ...|.                ++||..+++|+...+.|.|.|+.||+|.   +|+.  +++|..++|.++|.|++..|+
T Consensus      3903 -~epC~----------------snPC~~GgtCip~~n~f~CnC~~gyTG~---~Ce~~Gi~eCs~n~C~~gg~C~n~~gs 3962 (4289)
T KOG1219|consen 3903 -LEPCA----------------SNPCLTGGTCIPFYNGFLCNCPNGYTGK---RCEARGISECSKNVCGTGGQCINIPGS 3962 (4289)
T ss_pred             -ccccc----------------CCCCCCCCEEEecCCCeeEeCCCCccCc---eeecccccccccccccCCceeeccCCc
Confidence             22333                5899999999999999999999999999   5553  899999999999999999999


Q ss_pred             eeeeCCCCCccCCCC
Q psy9424         301 YKCLCPLGLVGDPYG  315 (535)
Q Consensus       301 ~~C~C~~Gy~g~~c~  315 (535)
                      |+|.|.+||.|..|.
T Consensus      3963 f~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3963 FHCNCTPGILGRTCC 3977 (4289)
T ss_pred             eEeccChhHhcccCc
Confidence            999999999988764


No 9  
>KOG1225|consensus
Probab=98.98  E-value=3.4e-09  Score=111.00  Aligned_cols=105  Identities=25%  Similarity=0.522  Sum_probs=74.2

Q ss_pred             CccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCCCCC--CCCCCCCCCCCCCCCCCccccC
Q psy9424          14 HLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDPKYQ--SGVGTSCTSSRDCIGEQQCISG   91 (535)
Q Consensus        14 ~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~c~~--C~~~~~C~~~~~C~~~~~C~~~   91 (535)
                      .|+++.+|+|..|+..    .|+  +.|..++.|++.    +|+|++||+|.+|++  |+      .  .|.....+.++
T Consensus       235 ic~c~~~~~g~~c~~~----~C~--~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~~Cp------~--~cs~~g~~~~g  296 (525)
T KOG1225|consen  235 ICECPEGYFGPLCSTI----YCP--GGCTGRGQCVEG----RCICPPGFTGDDCDELVCP------V--DCSGGGVCVDG  296 (525)
T ss_pred             eeecCCceeCCccccc----cCC--CCCcccceEeCC----eEeCCCCCcCCCCCcccCC------c--ccCCCceecCC
Confidence            6999999999999964    554  556666888865    799999999997662  21      0  01111111111


Q ss_pred             ccCCCCCCCCCCCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCCCCcccCCCCCCCCCCCCeeccCCCCceecCCCC
Q psy9424          92 LCQPTCRSNTTCPAQHYCNSGLCVLEMQCTTHDQCSATEQCRSNDMGQMQCRPACEGILCGRNALCTASDHHATCSCKPG  171 (535)
Q Consensus        92 ~c~~~C~~~~~C~~~~~c~~~~C~~g~~C~~~~~c~~~~~C~~~~~G~~~c~~~C~~~~C~~~g~C~~~~~~~~C~C~~G  171 (535)
                                                           .+.|.++|+|+.+.+..|. .+|..+|.|++    ..|.|.+|
T Consensus       297 -------------------------------------~CiC~~g~~G~dCs~~~cp-adC~g~G~Ci~----G~C~C~~G  334 (525)
T KOG1225|consen  297 -------------------------------------ECICNPGYSGKDCSIRRCP-ADCSGHGKCID----GECLCDEG  334 (525)
T ss_pred             -------------------------------------EeecCCCccccccccccCC-ccCCCCCcccC----CceEeCCC
Confidence                                                 1367788888887777776 68888999984    37999999


Q ss_pred             CCCCCCC
Q psy9424         172 YVGHPGP  178 (535)
Q Consensus       172 f~g~~c~  178 (535)
                      |+|..|+
T Consensus       335 y~G~~C~  341 (525)
T KOG1225|consen  335 YTGELCI  341 (525)
T ss_pred             CcCCccc
Confidence            9998853


No 10 
>KOG1225|consensus
Probab=98.91  E-value=1.7e-08  Score=105.89  Aligned_cols=121  Identities=26%  Similarity=0.620  Sum_probs=86.4

Q ss_pred             CccCCCCCCCCcccCCCCCCCCCCCCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy9424         130 EQCRSNDMGQMQCRPACEGILCGRNALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSH  209 (535)
Q Consensus       130 ~~C~~~~~G~~~c~~~C~~~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~  209 (535)
                      +.|+++|+|.++.+-.|... |+.++.+++.    .|+|++||+|..|+                               
T Consensus       267 CIC~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs-------------------------------  310 (525)
T KOG1225|consen  267 CICPPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCS-------------------------------  310 (525)
T ss_pred             EeCCCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCccccccc-------------------------------
Confidence            47788999998888778755 8888888874    89999999999853                               


Q ss_pred             CCCCCccccccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCeeCCcCCCCCCC
Q psy9424         210 SGGPVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHLIDFCAAKPCG  289 (535)
Q Consensus       210 ~~~~~~C~~~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~  289 (535)
                              ...|                       +..|.+++.|+..    +|.|.+||+|.   .|...      .|.
T Consensus       311 --------~~~c-----------------------padC~g~G~Ci~G----~C~C~~Gy~G~---~C~~~------~C~  346 (525)
T KOG1225|consen  311 --------IRRC-----------------------PADCSGHGKCIDG----ECLCDEGYTGE---LCIQR------ACS  346 (525)
T ss_pred             --------cccC-----------------------CccCCCCCcccCC----ceEeCCCCcCC---ccccc------ccC
Confidence                    1222                       3668889999833    59999999999   44432      388


Q ss_pred             CCCeeecCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCC
Q psy9424         290 PGARCDNSRGSYKCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTD  340 (535)
Q Consensus       290 ~~~~C~~~~g~~~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~  340 (535)
                      +++.|++.     |+|..||.|.. .    .-+.+.....|.....++...
T Consensus       347 ~~g~cv~g-----C~C~~Gw~G~d-~----~~~~~~~~~~cs~~~~~~~~~  387 (525)
T KOG1225|consen  347 GGGQCVNG-----CKCKKGWRGPD-V----ADPSLLLITECSPPSLCIAGV  387 (525)
T ss_pred             CCceeccC-----ceeccCccCCC-c----CCchhhcccccCCCceeeccc
Confidence            88888763     99999999987 1    222333233455555555443


No 11 
>KOG4260|consensus
Probab=98.59  E-value=5.7e-08  Score=90.98  Aligned_cols=71  Identities=25%  Similarity=0.651  Sum_probs=55.5

Q ss_pred             ccCCCCCCCCCCCcccCCCcccCCCCCCCCCCCCccccCCCCeeeeCCCCCccCCCCCCeeCCcCCC--CC-CCCCCeee
Q psy9424         219 VECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICYCQPGYTGDAYFGCHLIDFCAA--KP-CGPGARCD  295 (535)
Q Consensus       219 ~~C~~~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~C~~--~~-C~~~~~C~  295 (535)
                      ..|.++++|...              +.+|..+..|+|+.|+|.|...+||.+.       +++|..  .. -..+..|+
T Consensus       231 ~gCvDvnEC~~e--------------p~~c~~~qfCvNteGSf~C~dk~Gy~~g-------~d~C~~~~d~~~~kn~~c~  289 (350)
T KOG4260|consen  231 EGCVDVNECQNE--------------PAPCKAHQFCVNTEGSFKCEDKEGYKKG-------VDECQFCADVCASKNRPCM  289 (350)
T ss_pred             cccccHHHHhcC--------------CCCCChhheeecCCCceEecccccccCC-------hHHhhhhhhhcccCCCCcc
Confidence            347888888754              5889999999999999999999999863       344432  22 23456889


Q ss_pred             cCCCCeeeeCCCCCc
Q psy9424         296 NSRGSYKCLCPLGLV  310 (535)
Q Consensus       296 ~~~g~~~C~C~~Gy~  310 (535)
                      ++++.|+|.|..|+.
T Consensus       290 ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  290 NIDGQYRCVCFSGLI  304 (350)
T ss_pred             cCCccEEEEecccce
Confidence            999999999999975


No 12 
>KOG4260|consensus
Probab=98.39  E-value=4.4e-07  Score=85.12  Aligned_cols=137  Identities=28%  Similarity=0.669  Sum_probs=88.4

Q ss_pred             CCCCCCCccccC---CCCeeeeCCCCCccCCCCCCeeC------Cc----CCC--CCCCCCCeeecCCCCeee-eCCCCC
Q psy9424         246 NPCGPNALCSAE---KHKQICYCQPGYTGDAYFGCHLI------DF----CAA--KPCGPGARCDNSRGSYKC-LCPLGL  309 (535)
Q Consensus       246 ~~C~~~~~C~~~---~g~~~C~C~~G~~G~~~~~C~~~------~~----C~~--~~C~~~~~C~~~~g~~~C-~C~~Gy  309 (535)
                      .+|..++.|...   .|+-+|.|.+||+|+.+..|.+-      ++    |..  .+|.  +.|..... -.| +|+.||
T Consensus       150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~Csg~~~-k~C~kCkkGW  226 (350)
T KOG4260|consen  150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVCSGESS-KGCSKCKKGW  226 (350)
T ss_pred             CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--cccCCCCC-CChhhhcccc
Confidence            567777777532   35678999999999976555420      00    100  1222  24543322 234 799999


Q ss_pred             ccCCCCCCCCCCCCCCCC-CCCCCCCEEecCCCCCccCC--CCc-cCcccCCCCcccCCCcCCCCCCCCCCCCCceeeec
Q psy9424         310 VGDPYGAGCVSASQCTRD-DQCPPGAHCVKTDGVPKCKA--SCQ-SDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVL  385 (535)
Q Consensus       310 ~g~~c~~~C~~~~~C~~~-~~C~~~~~C~~~~g~~~C~~--~C~-~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~  385 (535)
                      ..+.  ..|+|+++|... .+|.....|+|+.|+|.|..  ++. .+++|+.        |.+.|.     ..+..|+++
T Consensus       227 ~lde--~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~--------~~d~~~-----~kn~~c~ni  291 (350)
T KOG4260|consen  227 KLDE--EGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF--------CADVCA-----SKNRPCMNI  291 (350)
T ss_pred             eecc--cccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh--------hhhhcc-----cCCCCcccC
Confidence            9884  459999999876 68999999999999999852  121 1233321        011111     235678899


Q ss_pred             CCCeeeeCCCCCcCC
Q psy9424         386 THRKVCFCPRGFTGD  400 (535)
Q Consensus       386 ~g~~~C~C~~G~~g~  400 (535)
                      +++|+|+|..|+.-.
T Consensus       292 ~~~~r~v~f~~~~~~  306 (350)
T KOG4260|consen  292 DGQYRCVCFSGLIII  306 (350)
T ss_pred             CccEEEEecccceee
Confidence            999999999887644


No 13 
>KOG0994|consensus
Probab=98.32  E-value=2.8e-05  Score=85.76  Aligned_cols=106  Identities=27%  Similarity=0.603  Sum_probs=65.7

Q ss_pred             eecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCC-CCCCCCCcccCCCcccCCC
Q psy9424         165 TCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSH-ADCSGDKVCEDHRCKISCL  243 (535)
Q Consensus       165 ~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~-~~C~~~~~C~~~~c~~~c~  243 (535)
                      .|+|.+|-+|..|.                   -|+|||+|..       .|+.-.|..+ ++|...             
T Consensus       831 QC~C~~g~ygrqCn-------------------qCqpG~WgFP-------eCr~CqCNgHA~~Cd~~-------------  871 (1758)
T KOG0994|consen  831 QCQCRPGTYGRQCN-------------------QCQPGYWGFP-------ECRPCQCNGHADTCDPI-------------  871 (1758)
T ss_pred             ceeeccccchhhcc-------------------ccCCCccCCC-------cCccccccCcccccCcc-------------
Confidence            78898998888864                   4788888853       4665566544 344322             


Q ss_pred             CCCCCCCCCccccCCCCeee-eCCCCCccCCCCCCeeCCcCCCCCCCCCC--------eeecC--CCCeeeeCCCCCccC
Q psy9424         244 ANNPCGPNALCSAEKHKQIC-YCQPGYTGDAYFGCHLIDFCAAKPCGPGA--------RCDNS--RGSYKCLCPLGLVGD  312 (535)
Q Consensus       244 ~~~~C~~~~~C~~~~g~~~C-~C~~G~~G~~~~~C~~~~~C~~~~C~~~~--------~C~~~--~g~~~C~C~~Gy~g~  312 (535)
                       ...|.   .|.+....+.| +|..||.|++..  -.-..|.+-||..+-        .|...  .....|.|.+||+|.
T Consensus       872 -tGaCi---~CqD~T~G~~CdrCl~GyyGdP~l--g~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~  945 (1758)
T KOG0994|consen  872 -TGACI---DCQDSTTGHSCDRCLDGYYGDPRL--GSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS  945 (1758)
T ss_pred             -ccccc---cccccccccchhhhhccccCCccc--CCCCCCCCCCCCCCCccchhccccccccccccceeeecccCcccc
Confidence             12222   24455567788 899999998754  122345444443321        23222  234579999999999


Q ss_pred             CCC
Q psy9424         313 PYG  315 (535)
Q Consensus       313 ~c~  315 (535)
                      .|+
T Consensus       946 RCe  948 (1758)
T KOG0994|consen  946 RCE  948 (1758)
T ss_pred             chh
Confidence            877


No 14 
>KOG0994|consensus
Probab=97.95  E-value=0.00018  Score=79.68  Aligned_cols=26  Identities=35%  Similarity=0.828  Sum_probs=18.5

Q ss_pred             eeeecCCCeeeeCCCCCcCCCCCCCCcc
Q psy9424         381 LCNVLTHRKVCFCPRGFTGDPETECVRI  408 (535)
Q Consensus       381 ~C~~~~g~~~C~C~~G~~g~~~~~C~~~  408 (535)
                      +|..-.|  +|+|.+||-|..+++|...
T Consensus      1078 qCN~ftG--QCqCkpGfGGR~C~qCqel 1103 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQEL 1103 (1758)
T ss_pred             ccccccc--ceeccCCCCCcchhHHHHh
Confidence            3443344  8999999999988777643


No 15 
>KOG1226|consensus
Probab=97.75  E-value=0.00025  Score=76.41  Aligned_cols=62  Identities=29%  Similarity=0.717  Sum_probs=45.3

Q ss_pred             CCCCCCCccccCCCCeeeeCCCCCccCCCCCCe-eCCcCCCC---CCCCCCeeecCCCCeeeeCCCC-CccCCCCC
Q psy9424         246 NPCGPNALCSAEKHKQICYCQPGYTGDAYFGCH-LIDFCAAK---PCGPGARCDNSRGSYKCLCPLG-LVGDPYGA  316 (535)
Q Consensus       246 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~-~~~~C~~~---~C~~~~~C~~~~g~~~C~C~~G-y~g~~c~~  316 (535)
                      ..|..+|+|.=.    +|+|.+||+|..+. |. +.+.|.+.   .|...|+|.=.    +|+|... |+|..|++
T Consensus       555 ~lC~g~G~C~CG----~CvC~~GwtG~~C~-C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~  621 (783)
T KOG1226|consen  555 VLCGGHGRCECG----RCVCNPGWTGSACN-CPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEK  621 (783)
T ss_pred             cccCCCCeEeCC----cEEcCCCCccCCCC-CCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhc
Confidence            568888887543    49999999999774 54 35667542   37777777654    6888776 99998874


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.71  E-value=2e-05  Score=54.14  Aligned_cols=33  Identities=21%  Similarity=0.519  Sum_probs=30.6

Q ss_pred             CCCCCC-CCCCCCCCeeeecCCCceeeCCCCCcC
Q psy9424          32 RNPCEA-DEVCGRNAECAVVNHTPRCTCVAGTVG   64 (535)
Q Consensus        32 ~d~C~~-~~~C~~~g~C~~~~~~~~C~C~~Gf~G   64 (535)
                      ||||.. .+.|..++.|+++.|+|+|+|++||..
T Consensus         2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~   35 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL   35 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred             ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence            899997 678999999999999999999999983


No 17 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.70  E-value=1.4e-05  Score=51.22  Aligned_cols=30  Identities=33%  Similarity=0.793  Sum_probs=27.0

Q ss_pred             CCCCCCCCCCeeeeCC-CCceeeCCCCCcCC
Q psy9424         454 CSPAPCGPNAQCSVAN-HRPLCSCPAGLMGL  483 (535)
Q Consensus       454 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~G~  483 (535)
                      |.+++|.++|+|++.. ++|+|+|++||+|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            3467999999999998 99999999999996


No 18 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.57  E-value=9.6e-05  Score=49.57  Aligned_cols=35  Identities=29%  Similarity=0.748  Sum_probs=31.2

Q ss_pred             cCCCCC-CCCCCCCeeeeCCCCceeeCCCCCc-CCCC
Q psy9424         451 VDPCSP-APCGPNAQCSVANHRPLCSCPAGLM-GLPS  485 (535)
Q Consensus       451 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-G~~c  485 (535)
                      +|+|.. .+|.++++|+++.++|.|.|++||+ |..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            678876 7899999999999999999999999 7765


No 19 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.56  E-value=3.7e-05  Score=52.76  Aligned_cols=31  Identities=35%  Similarity=0.862  Sum_probs=29.0

Q ss_pred             cCCCC--CCCCCCCCeeeeCCCCceeeCCCCCc
Q psy9424         451 VDPCS--PAPCGPNAQCSVANHRPLCSCPAGLM  481 (535)
Q Consensus       451 ~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~  481 (535)
                      ||||+  +++|..++.|+|+.|+|+|.|++||.
T Consensus         2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            78998  56899999999999999999999998


No 20 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.47  E-value=5e-05  Score=48.72  Aligned_cols=28  Identities=21%  Similarity=0.564  Sum_probs=25.9

Q ss_pred             CCCCCCCCeeeecC-CCceeeCCCCCcCC
Q psy9424          38 DEVCGRNAECAVVN-HTPRCTCVAGTVGD   65 (535)
Q Consensus        38 ~~~C~~~g~C~~~~-~~~~C~C~~Gf~G~   65 (535)
                      .++|.++|+|++.. ++|+|+|++||+|.
T Consensus         3 ~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    3 SNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            46899999999998 89999999999996


No 21 
>KOG1226|consensus
Probab=97.39  E-value=0.0015  Score=70.52  Aligned_cols=60  Identities=25%  Similarity=0.616  Sum_probs=42.5

Q ss_pred             CCCCCCCccccCCCCeeeeCCCCCc----cCCCCCCeeCCcCCC---CCCCCCCeeecCCCCeeeeCCCCCccCCCC
Q psy9424         246 NPCGPNALCSAEKHKQICYCQPGYT----GDAYFGCHLIDFCAA---KPCGPGARCDNSRGSYKCLCPLGLVGDPYG  315 (535)
Q Consensus       246 ~~C~~~~~C~~~~g~~~C~C~~G~~----G~~~~~C~~~~~C~~---~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c~  315 (535)
                      .+|+..|.|+=.    +|+|.+...    |..++ |.+ -.|..   ..|..+|+|.=.    +|+|.+||+|..|+
T Consensus       514 ~vCSgrG~C~CG----qC~C~~~~~~~i~G~fCE-CDn-fsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~  580 (783)
T KOG1226|consen  514 PVCSGRGDCVCG----QCVCHKPDNGKIYGKFCE-CDN-FSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACN  580 (783)
T ss_pred             CCcCCCCcEeCC----ceEecCCCCCceeeeeee-ccC-cccccccCcccCCCCeEeCC----cEEcCCCCccCCCC
Confidence            478888888644    488887766    66443 322 23433   348889998765    79999999999987


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.26  E-value=0.00037  Score=46.66  Aligned_cols=35  Identities=26%  Similarity=0.656  Sum_probs=30.4

Q ss_pred             cCCCCC-CCCCCCCeeccCCCCceecCCCCCC-CCCC
Q psy9424         143 RPACEG-ILCGRNALCTASDHHATCSCKPGYV-GHPG  177 (535)
Q Consensus       143 ~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~-g~~c  177 (535)
                      +++|.. .+|.++++|+++.++|.|.|++||+ |..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C   38 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC   38 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence            567776 7899899999999999999999999 7664


No 23 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.19  E-value=0.0005  Score=45.51  Aligned_cols=35  Identities=31%  Similarity=0.767  Sum_probs=30.9

Q ss_pred             cCCCCC-CCCCCCCeeeeCCCCceeeCCCCCcCCCC
Q psy9424         451 VDPCSP-APCGPNAQCSVANHRPLCSCPAGLMGLPS  485 (535)
Q Consensus       451 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~G~~c  485 (535)
                      +++|.. .+|.+++.|++..++|+|.|++||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            577765 78999999999999999999999999765


No 24 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.02  E-value=0.00026  Score=46.55  Aligned_cols=29  Identities=31%  Similarity=0.736  Sum_probs=24.1

Q ss_pred             CCCCCCCceeeecCCCeeeeCCCCCcCCC
Q psy9424         373 QGACGVNSLCNVLTHRKVCFCPRGFTGDP  401 (535)
Q Consensus       373 ~~~C~~~~~C~~~~g~~~C~C~~G~~g~~  401 (535)
                      .+.|+.+|+|+++.++|+|+|++||.|++
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            46899999999999999999999999986


No 25 
>KOG1836|consensus
Probab=97.00  E-value=0.048  Score=65.70  Aligned_cols=49  Identities=24%  Similarity=0.558  Sum_probs=35.7

Q ss_pred             ccCCCCCCCCcc--cCCCCCCCCCCCCeeccCC--CCceec-CCCCCCCCCCCC
Q psy9424         131 QCRSNDMGQMQC--RPACEGILCGRNALCTASD--HHATCS-CKPGYVGHPGPS  179 (535)
Q Consensus       131 ~C~~~~~G~~~c--~~~C~~~~C~~~g~C~~~~--~~~~C~-C~~Gf~g~~c~~  179 (535)
                      .|.++|+|...-  ...|..-+|...+.|..+.  ....|. |++||+|..|+.
T Consensus       760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~  813 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE  813 (1705)
T ss_pred             hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence            566777776422  1227777888888887765  457898 999999999864


No 26 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.97  E-value=0.0003  Score=46.26  Aligned_cols=29  Identities=34%  Similarity=0.657  Sum_probs=23.8

Q ss_pred             CCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424          38 DEVCGRNAECAVVNHTPRCTCVAGTVGDP   66 (535)
Q Consensus        38 ~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~   66 (535)
                      .+.|..+++|+++.++|.|+|++||+|+.
T Consensus         5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGSYTCTCKPGYEGDG   33 (36)
T ss_dssp             GGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred             CCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence            46799999999999999999999999984


No 27 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.85  E-value=0.0014  Score=43.28  Aligned_cols=35  Identities=29%  Similarity=0.674  Sum_probs=30.1

Q ss_pred             cCCCCC-CCCCCCCeeccCCCCceecCCCCCCCCCC
Q psy9424         143 RPACEG-ILCGRNALCTASDHHATCSCKPGYVGHPG  177 (535)
Q Consensus       143 ~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c  177 (535)
                      +++|.. .+|.++++|+++.+.|.|.|++||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence            466766 68988899999999999999999999764


No 28 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.75  E-value=0.0021  Score=41.75  Aligned_cols=30  Identities=27%  Similarity=0.664  Sum_probs=26.7

Q ss_pred             CCCCCCCCeeeeCCCCceeeCCCCCcCC-CC
Q psy9424         456 PAPCGPNAQCSVANHRPLCSCPAGLMGL-PS  485 (535)
Q Consensus       456 ~~~C~~~~~C~~~~g~~~C~C~~G~~G~-~c  485 (535)
                      ..+|.+++.|++..+.|+|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            5678889999999999999999999998 44


No 29 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.69  E-value=0.0024  Score=41.64  Aligned_cols=28  Identities=32%  Similarity=0.756  Sum_probs=24.7

Q ss_pred             CCCCCCCeeeeCCCCceeeCCCCCcC-CCC
Q psy9424         457 APCGPNAQCSVANHRPLCSCPAGLMG-LPS  485 (535)
Q Consensus       457 ~~C~~~~~C~~~~g~~~C~C~~G~~G-~~c  485 (535)
                      .+|.++ +|++..++|+|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            578888 9999999999999999999 544


No 30 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.61  E-value=0.0021  Score=37.92  Aligned_cols=24  Identities=38%  Similarity=0.836  Sum_probs=20.7

Q ss_pred             CeeeeCCCCCccCCCCCCCCCCCC
Q psy9424         300 SYKCLCPLGLVGDPYGAGCVSASQ  323 (535)
Q Consensus       300 ~~~C~C~~Gy~g~~c~~~C~~~~~  323 (535)
                      +|+|+|++||......++|++|+|
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            589999999998887888888875


No 31 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.34  E-value=0.005  Score=39.90  Aligned_cols=28  Identities=29%  Similarity=0.780  Sum_probs=25.4

Q ss_pred             CCCCCCCCeeccCCCCceecCCCCCCCC
Q psy9424         148 GILCGRNALCTASDHHATCSCKPGYVGH  175 (535)
Q Consensus       148 ~~~C~~~g~C~~~~~~~~C~C~~Gf~g~  175 (535)
                      ..+|.++++|+++.++|.|.|+.||.|.
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence            4678889999999999999999999987


No 32 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.20  E-value=0.00096  Score=60.05  Aligned_cols=106  Identities=29%  Similarity=0.663  Sum_probs=66.0

Q ss_pred             CCCCCCCeeccCC-----CCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCC
Q psy9424         149 ILCGRNALCTASD-----HHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNS  223 (535)
Q Consensus       149 ~~C~~~g~C~~~~-----~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~  223 (535)
                      .+|+..++|++..     ..|.|.|.+||+...                                     ..|.+..|.+
T Consensus        50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~-------------------------------------~vCvp~~C~~   92 (197)
T PF06247_consen   50 KPCGDYAKCINQANKGEERAYKCDCINGYILKQ-------------------------------------GVCVPNKCNN   92 (197)
T ss_dssp             SEEETTEEEEE-SSTTSSTSEEEEE-TTEEESS-------------------------------------SSEEEGGGSS
T ss_pred             ccccchhhhhcCCCcccceeEEEecccCceeeC-------------------------------------CeEchhhcCc
Confidence            5677778887755     469999999999887                                     5677777764


Q ss_pred             CCCCCCCCcccCCCcccCCCCCCCCCCCCccccCC---CCeeeeCCCCCccCCCCCCee--CCcCCCCCCCCCCeeecCC
Q psy9424         224 HADCSGDKVCEDHRCKISCLANNPCGPNALCSAEK---HKQICYCQPGYTGDAYFGCHL--IDFCAAKPCGPGARCDNSR  298 (535)
Q Consensus       224 ~~~C~~~~~C~~~~c~~~c~~~~~C~~~~~C~~~~---g~~~C~C~~G~~G~~~~~C~~--~~~C~~~~C~~~~~C~~~~  298 (535)
                                            ..|. .|.|+..+   ....|+|.-|+.-+....|+.  ..+|. -.|..+-.|....
T Consensus        93 ----------------------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~  148 (197)
T PF06247_consen   93 ----------------------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVD  148 (197)
T ss_dssp             -------------------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEET
T ss_pred             ----------------------eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeC
Confidence                                  3355 56886432   345899999998333334553  23443 3477778999999


Q ss_pred             CCeeeeCCCCCccCCCC
Q psy9424         299 GSYKCLCPLGLVGDPYG  315 (535)
Q Consensus       299 g~~~C~C~~Gy~g~~c~  315 (535)
                      +-|+|.+..||.+..-+
T Consensus       149 ~~Y~C~~~~~~~~~~~~  165 (197)
T PF06247_consen  149 GYYKCVCKEGFPGDGEG  165 (197)
T ss_dssp             TEEEEEE-TT-EEETTT
T ss_pred             cEEEeecCCCCCCCCCc
Confidence            99999999999876544


No 33 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.20  E-value=0.0045  Score=36.52  Aligned_cols=24  Identities=21%  Similarity=0.494  Sum_probs=18.7

Q ss_pred             CceeeCCCCCcCCCCCCCCCCCCCCCCCCCCCCCc
Q psy9424          53 TPRCTCVAGTVGDPKYQSGVGTSCTSSRDCIGEQQ   87 (535)
Q Consensus        53 ~~~C~C~~Gf~G~~c~~C~~~~~C~~~~~C~~~~~   87 (535)
                      +|+|.|++||+..           .+...|.||+|
T Consensus         1 sy~C~C~~Gy~l~-----------~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLS-----------PDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCC-----------CCCCccccCCC
Confidence            5899999999976           33467888765


No 34 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.12  E-value=0.00087  Score=60.31  Aligned_cols=146  Identities=29%  Similarity=0.653  Sum_probs=83.9

Q ss_pred             CCeeccCCCCceecCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccCCCCCCCCCCCcc
Q psy9424         154 NALCTASDHHATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGDKVC  233 (535)
Q Consensus       154 ~g~C~~~~~~~~C~C~~Gf~g~~c~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~C  233 (535)
                      +|..+...+.|.|.|.+||....                                         ..+|+...+|....  
T Consensus        10 NG~LiQMSNHfEC~Cnegfvl~~-----------------------------------------EntCE~kv~C~~~e--   46 (197)
T PF06247_consen   10 NGYLIQMSNHFECKCNEGFVLKN-----------------------------------------ENTCEEKVECDKLE--   46 (197)
T ss_dssp             TEEEEEESSEEEEEESTTEEEEE-----------------------------------------TTEEEE----SG-G--
T ss_pred             CCEEEEccCceEEEcCCCcEEcc-----------------------------------------ccccccceecCccc--
Confidence            57777777889999999998765                                         12233233332100  


Q ss_pred             cCCCcccCCCCCCCCCCCCccccCC-----CCeeeeCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCC---CCeeeeC
Q psy9424         234 EDHRCKISCLANNPCGPNALCSAEK-----HKQICYCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSR---GSYKCLC  305 (535)
Q Consensus       234 ~~~~c~~~c~~~~~C~~~~~C~~~~-----g~~~C~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~---g~~~C~C  305 (535)
                               ....+|...++|++..     ..|.|.|.+||+.... .|. .+.|....|. .|.|+..+   ....|+|
T Consensus        47 ---------~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCv-p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC  114 (197)
T PF06247_consen   47 ---------NVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG-VCV-PNKCNNKDCG-SGKCILDPDNPNNPTCSC  114 (197)
T ss_dssp             ---------GTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-SEE-EGGGSS---T-TEEEEEEEGGGSEEEEEE
T ss_pred             ---------ccCccccchhhhhcCCCcccceeEEEecccCceeeCC-eEc-hhhcCceecC-CCeEEecCCCCCCceeEe
Confidence                     0136788899998765     4699999999997654 354 3566666677 56887433   3458999


Q ss_pred             CCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCccCCCCccCcccCCCCcccCCCcCCCCCCCCCCCCCceeeec
Q psy9424         306 PLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPKCKASCQSDEECGLGEKCLQGQCNNPCERQGACGVNSLCNVL  385 (535)
Q Consensus       306 ~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~  385 (535)
                      .-|+..+. ..                  .|+. +|.-.|                         +  -.|..+-.|..+
T Consensus       115 ~IGkV~~d-n~------------------kCtk-~G~T~C-------------------------~--LKCk~nE~CK~~  147 (197)
T PF06247_consen  115 NIGKVPDD-NK------------------KCTK-TGETKC-------------------------S--LKCKENEECKLV  147 (197)
T ss_dssp             -TEEETTT-TT------------------ESEE-EE-----------------------------------TTTEEEEEE
T ss_pred             eeceEecc-CC------------------cccC-CCccce-------------------------e--eecCCCcceeee
Confidence            99998221 11                  1211 011111                         1  134456678889


Q ss_pred             CCCeeeeCCCCCcCCC
Q psy9424         386 THRKVCFCPRGFTGDP  401 (535)
Q Consensus       386 ~g~~~C~C~~G~~g~~  401 (535)
                      .+-|+|++.+||.++.
T Consensus       148 ~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  148 DGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             TTEEEEEE-TT-EEET
T ss_pred             CcEEEeecCCCCCCCC
Confidence            9999999999998775


No 35 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.07  E-value=0.0079  Score=39.09  Aligned_cols=28  Identities=36%  Similarity=0.780  Sum_probs=24.2

Q ss_pred             CCCCCCCeeccCCCCceecCCCCCCC-CCC
Q psy9424         149 ILCGRNALCTASDHHATCSCKPGYVG-HPG  177 (535)
Q Consensus       149 ~~C~~~g~C~~~~~~~~C~C~~Gf~g-~~c  177 (535)
                      .+|..+ +|+++.++|+|.|++||.| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            578777 9999999999999999999 553


No 36 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.90  E-value=0.009  Score=38.14  Aligned_cols=26  Identities=27%  Similarity=0.756  Sum_probs=22.4

Q ss_pred             CCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424          39 EVCGRNAECAVVNHTPRCTCVAGTVGDP   66 (535)
Q Consensus        39 ~~C~~~g~C~~~~~~~~C~C~~Gf~G~~   66 (535)
                      ..|+++|+|+..  .++|+|.+||+|..
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~   31 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGPD   31 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCCC
Confidence            469999999976  56899999999984


No 37 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.77  E-value=0.012  Score=37.62  Aligned_cols=27  Identities=22%  Similarity=0.564  Sum_probs=22.8

Q ss_pred             CCCCCCCeeeeCCCCceeeCCCCCcCCCC
Q psy9424         457 APCGPNAQCSVANHRPLCSCPAGLMGLPS  485 (535)
Q Consensus       457 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~c  485 (535)
                      ..|+++|+|+...+  +|+|.+||+|..|
T Consensus         6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPCG--RCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence            46899999997644  9999999999875


No 38 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.08  E-value=0.02  Score=28.59  Aligned_cols=13  Identities=38%  Similarity=0.989  Sum_probs=10.4

Q ss_pred             eeeCCCCCcCCCC
Q psy9424         473 LCSCPAGLMGLPS  485 (535)
Q Consensus       473 ~C~C~~G~~G~~c  485 (535)
                      +|+|++||+|..|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5899999999875


No 39 
>smart00051 DSL delta serrate ligand.
Probab=94.07  E-value=0.064  Score=40.18  Aligned_cols=45  Identities=13%  Similarity=0.229  Sum_probs=34.4

Q ss_pred             CccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424          14 HLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP   66 (535)
Q Consensus        14 ~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~   66 (535)
                      .-.|.++|.|..|..     .|...+....+.+|..   .+.++|.+||+|..
T Consensus        18 rv~C~~~~yG~~C~~-----~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~   62 (63)
T smart00051       18 RVTCDENYYGEGCNK-----FCRPRDDFFGHYTCDE---NGNKGCLEGWMGPY   62 (63)
T ss_pred             EeeCCCCCcCCccCC-----EeCcCccccCCccCCc---CCCEecCCCCcCCC
Confidence            456889999999976     5654455677788854   35799999999984


No 40 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.20  E-value=0.083  Score=34.69  Aligned_cols=21  Identities=43%  Similarity=0.885  Sum_probs=10.1

Q ss_pred             eeecCCCCeeeeCCCCCccCC
Q psy9424         293 RCDNSRGSYKCLCPLGLVGDP  313 (535)
Q Consensus       293 ~C~~~~g~~~C~C~~Gy~g~~  313 (535)
                      +|++.+++|+|.|++||+...
T Consensus        11 ~C~~~~g~~~C~C~~Gy~L~~   31 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYKLAE   31 (36)
T ss_dssp             EEEEETTSEEEE-STTEEE-T
T ss_pred             CCccCCCceEeECCCCCEECc
Confidence            455555555555555555444


No 41 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.12  E-value=0.08  Score=34.77  Aligned_cols=25  Identities=24%  Similarity=0.497  Sum_probs=19.4

Q ss_pred             CCCCCCCeeeecCCCceeeCCCCCcCC
Q psy9424          39 EVCGRNAECAVVNHTPRCTCVAGTVGD   65 (535)
Q Consensus        39 ~~C~~~g~C~~~~~~~~C~C~~Gf~G~   65 (535)
                      +.|++  .|+++.++|+|.|++||+..
T Consensus         6 GgC~h--~C~~~~g~~~C~C~~Gy~L~   30 (36)
T PF14670_consen    6 GGCSH--ICVNTPGSYRCSCPPGYKLA   30 (36)
T ss_dssp             GGSSS--EEEEETTSEEEE-STTEEE-
T ss_pred             CCcCC--CCccCCCceEeECCCCCEEC
Confidence            34554  79999999999999999876


No 42 
>KOG1836|consensus
Probab=90.78  E-value=0.46  Score=57.70  Aligned_cols=53  Identities=26%  Similarity=0.574  Sum_probs=38.8

Q ss_pred             ee-eCCCCCccCCCCCCeeCCcCCCCCCCCCCeeecCC--CCeeee-CCCCCccCCCCC
Q psy9424         262 IC-YCQPGYTGDAYFGCHLIDFCAAKPCGPGARCDNSR--GSYKCL-CPLGLVGDPYGA  316 (535)
Q Consensus       262 ~C-~C~~G~~G~~~~~C~~~~~C~~~~C~~~~~C~~~~--g~~~C~-C~~Gy~g~~c~~  316 (535)
                      +| +|..||.|.+-.  -....|.+-+|.+++.|....  ....|+ |++||+|..|+.
T Consensus       757 ~C~~C~~GfYg~~~~--~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~  813 (1705)
T KOG1836|consen  757 QCAQCVDGFYGLPDL--GTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE  813 (1705)
T ss_pred             chhhhcCCCCCcccc--CCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence            56 899999987532  112338777888888776654  567898 999999998773


No 43 
>smart00051 DSL delta serrate ligand.
Probab=82.08  E-value=1.8  Score=32.43  Aligned_cols=43  Identities=19%  Similarity=0.305  Sum_probs=28.0

Q ss_pred             ccCCCCCCCCcccCCCCC-CCCCCCCeeccCCCCceecCCCCCCCCCC
Q psy9424         131 QCRSNDMGQMQCRPACEG-ILCGRNALCTASDHHATCSCKPGYVGHPG  177 (535)
Q Consensus       131 ~C~~~~~G~~~c~~~C~~-~~C~~~g~C~~~~~~~~C~C~~Gf~g~~c  177 (535)
                      .|.++|.|..+ ...|.. +....+.+|.. .  ..++|.+||+|+.|
T Consensus        20 ~C~~~~yG~~C-~~~C~~~~d~~~~~~Cd~-~--G~~~C~~Gw~G~~C   63 (63)
T smart00051       20 TCDENYYGEGC-NKFCRPRDDFFGHYTCDE-N--GNKGCLEGWMGPYC   63 (63)
T ss_pred             eCCCCCcCCcc-CCEeCcCccccCCccCCc-C--CCEecCCCCcCCCC
Confidence            56677777763 345543 33456777754 2  36889999999863


No 44 
>PHA02887 EGF-like protein; Provisional
Probab=81.97  E-value=1.4  Score=36.79  Aligned_cols=36  Identities=25%  Similarity=0.516  Sum_probs=28.2

Q ss_pred             CCCCCC--CCCCCCCCeeeecC--CCceeeCCCCCcCCCCC
Q psy9424          32 RNPCEA--DEVCGRNAECAVVN--HTPRCTCVAGTVGDPKY   68 (535)
Q Consensus        32 ~d~C~~--~~~C~~~g~C~~~~--~~~~C~C~~Gf~G~~c~   68 (535)
                      ..+|..  .+-|- ||+|.-..  ..+.|.|++||+|.+|.
T Consensus        83 f~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         83 FEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             ccccChHhhCEee-CCEEEccccCCCceeECCCCcccCCCC
Confidence            667775  56788 57997754  46899999999999654


No 45 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=78.25  E-value=0.72  Score=30.26  Aligned_cols=31  Identities=32%  Similarity=0.576  Sum_probs=21.9

Q ss_pred             CCCCCCCCCCeeccCC-CCceecCCCCCCCCC
Q psy9424         146 CEGILCGRNALCTASD-HHATCSCKPGYVGHP  176 (535)
Q Consensus       146 C~~~~C~~~g~C~~~~-~~~~C~C~~Gf~g~~  176 (535)
                      |....|..|+.|++.. |+++|.|..||....
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence            4446788899999876 899999999998655


No 46 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=77.81  E-value=2.3  Score=40.61  Aligned_cols=38  Identities=26%  Similarity=0.543  Sum_probs=29.5

Q ss_pred             CCCcCCCCCcCCCC--CCCCCCCCeeeeCCCCceeeCCCCCcCCC
Q psy9424         442 ALSCRSAECVDPCS--PAPCGPNAQCSVANHRPLCSCPAGLMGLP  484 (535)
Q Consensus       442 g~~C~~~~c~d~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~G~~  484 (535)
                      +..|.+   +++|.  +++|.  ..|.++.|+|.|.|++||+...
T Consensus       181 ~~~C~~---~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~  220 (224)
T cd01475         181 GKICVV---PDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLE  220 (224)
T ss_pred             cccCcC---chhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCC
Confidence            455655   67786  45565  5799999999999999998754


No 47 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=76.59  E-value=2.8  Score=40.04  Aligned_cols=39  Identities=28%  Similarity=0.433  Sum_probs=29.4

Q ss_pred             CCeeCCcCCCCCCCCCCeeecCCCCeeeeCCCCCccCCC
Q psy9424         276 GCHLIDFCAAKPCGPGARCDNSRGSYKCLCPLGLVGDPY  314 (535)
Q Consensus       276 ~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~c  314 (535)
                      .|+++++|...+......|.+..|+|.|.|++||+....
T Consensus       183 ~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~  221 (224)
T cd01475         183 ICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLED  221 (224)
T ss_pred             cCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCC
Confidence            577788886533222358999999999999999987543


No 48 
>KOG1218|consensus
Probab=71.64  E-value=30  Score=34.56  Aligned_cols=56  Identities=21%  Similarity=0.585  Sum_probs=27.5

Q ss_pred             cccccchhccCCCccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424           2 CREQVQWQQISQHLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP   66 (535)
Q Consensus         2 ~~~~~~~~~~~~~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~   66 (535)
                      |+.++..+...+.+. ..+|.|..|..   +.+|...  |.. -+|.+...  .|.+..+|.+..
T Consensus        81 c~~~~~~~~~~~~~~-~~~~~g~~C~~---~~~~~~~--c~~-~~C~~~~~--~c~~~~~~~~~~  136 (316)
T KOG1218|consen   81 CKNGGTCVSSTGYCH-LNGYEGPQCES---PCPCGDG--CAE-KTCANPRR--ECRCGGGYIGEQ  136 (316)
T ss_pred             cCCCCcccCCCCccc-CCCCCcccccC---CCCcCCc--ccc-cccCCCcc--ceecCCcCcccc
Confidence            344555555555554 56666666665   3333211  222 33443321  466666666553


No 49 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=71.14  E-value=4.2  Score=34.08  Aligned_cols=32  Identities=31%  Similarity=0.924  Sum_probs=26.1

Q ss_pred             CCCCCCCCCCCCCCeeeecCCCceeeCCCCCcC
Q psy9424          32 RNPCEADEVCGRNAECAVVNHTPRCTCVAGTVG   64 (535)
Q Consensus        32 ~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G   64 (535)
                      .|+|...+.|+.+|.|.. .....|.|.+||.-
T Consensus        77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP  108 (110)
T ss_pred             ccCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence            568877789999999954 45678999999974


No 50 
>PHA02887 EGF-like protein; Provisional
Probab=69.52  E-value=4.8  Score=33.65  Aligned_cols=30  Identities=23%  Similarity=0.540  Sum_probs=24.0

Q ss_pred             CCCCCCCCeeeeC--CCCceeeCCCCCcCCCCC
Q psy9424         456 PAPCGPNAQCSVA--NHRPLCSCPAGLMGLPSA  486 (535)
Q Consensus       456 ~~~C~~~~~C~~~--~g~~~C~C~~G~~G~~c~  486 (535)
                      .+.|- +|+|...  ...+.|.|+.||+|..|+
T Consensus        91 k~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE  122 (126)
T PHA02887         91 NDFCI-NGECMNIIDLDEKFCICNKGYTGIRCD  122 (126)
T ss_pred             hCEee-CCEEEccccCCCceeECCCCcccCCCC
Confidence            45676 5799765  356899999999999986


No 51 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=69.48  E-value=4.7  Score=34.26  Aligned_cols=36  Identities=22%  Similarity=0.441  Sum_probs=27.5

Q ss_pred             CCCCCC--CCCCCCCCeeeecC--CCceeeCCCCCcCCCCC
Q psy9424          32 RNPCEA--DEVCGRNAECAVVN--HTPRCTCVAGTVGDPKY   68 (535)
Q Consensus        32 ~d~C~~--~~~C~~~g~C~~~~--~~~~C~C~~Gf~G~~c~   68 (535)
                      +-+|..  .+-|-++ +|.-..  ..+.|.|..||+|.+|+
T Consensus        42 i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         42 IRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             cccCChhhCCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence            556664  5678764 997754  58899999999999654


No 52 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=67.74  E-value=2.3  Score=27.92  Aligned_cols=29  Identities=24%  Similarity=0.462  Sum_probs=20.6

Q ss_pred             CCCCCCCCeeeeCC-CCceeeCCCCCcCCC
Q psy9424         456 PAPCGPNAQCSVAN-HRPLCSCPAGLMGLP  484 (535)
Q Consensus       456 ~~~C~~~~~C~~~~-g~~~C~C~~G~~G~~  484 (535)
                      ...|..++.|++.. |++.|.|..||....
T Consensus         4 ~~~cP~NA~C~~~~dG~eecrCllgyk~~~   33 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGSEECRCLLGYKKVG   33 (37)
T ss_dssp             SS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred             CccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence            45677889998876 999999999997643


No 53 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=66.04  E-value=7  Score=27.57  Aligned_cols=17  Identities=29%  Similarity=0.612  Sum_probs=14.4

Q ss_pred             eeeCCCCCcCCCCCCCC
Q psy9424         390 VCFCPRGFTGDPETECV  406 (535)
Q Consensus       390 ~C~C~~G~~g~~~~~C~  406 (535)
                      +|.|+++|+|..++.|.
T Consensus        20 ~C~C~~~~~G~~C~~C~   36 (50)
T cd00055          20 QCECKPNTTGRRCDRCA   36 (50)
T ss_pred             EEeCCCcCCCCCCCCCC
Confidence            89999999999876553


No 54 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=65.15  E-value=3.8  Score=28.73  Aligned_cols=26  Identities=31%  Similarity=0.634  Sum_probs=18.1

Q ss_pred             ceeeecCCCeeeeCCCCCcCCCCCCCCc
Q psy9424         380 SLCNVLTHRKVCFCPRGFTGDPETECVR  407 (535)
Q Consensus       380 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~  407 (535)
                      .+|....|  +|.|+++|+|..+++|.+
T Consensus        11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~   36 (49)
T PF00053_consen   11 QTCDPSTG--QCVCKPGTTGPRCDQCKP   36 (49)
T ss_dssp             SSEEETCE--EESBSTTEESTTS-EE-T
T ss_pred             CcccCCCC--EEeccccccCCcCcCCCC
Confidence            35665444  999999999999776543


No 55 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=64.95  E-value=6.4  Score=33.50  Aligned_cols=30  Identities=23%  Similarity=0.536  Sum_probs=24.3

Q ss_pred             CCCCCCCCeeeeCC--CCceeeCCCCCcCCCCC
Q psy9424         456 PAPCGPNAQCSVAN--HRPLCSCPAGLMGLPSA  486 (535)
Q Consensus       456 ~~~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~  486 (535)
                      .+-|-++ +|....  ..+.|.|..||+|..|+
T Consensus        50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCE   81 (139)
T PHA03099         50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQ   81 (139)
T ss_pred             CCEeECC-EEEeeccCCCceeECCCCccccccc
Confidence            4567764 897653  68999999999999997


No 56 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=64.91  E-value=8.5  Score=27.11  Aligned_cols=21  Identities=24%  Similarity=0.566  Sum_probs=16.6

Q ss_pred             eeeeCCCCceeeCCCCCcCCCCC
Q psy9424         464 QCSVANHRPLCSCPAGLMGLPSA  486 (535)
Q Consensus       464 ~C~~~~g~~~C~C~~G~~G~~c~  486 (535)
                      .|....|  +|.|+++|.|..|+
T Consensus        13 ~C~~~~G--~C~C~~~~~G~~C~   33 (50)
T cd00055          13 QCDPGTG--QCECKPNTTGRRCD   33 (50)
T ss_pred             cccCCCC--EEeCCCcCCCCCCC
Confidence            3655555  89999999999985


No 57 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.87  E-value=7.8  Score=32.41  Aligned_cols=32  Identities=34%  Similarity=0.864  Sum_probs=24.2

Q ss_pred             CCCCCCCCCCCCceeeecCCCeeeeCCCCCcCC
Q psy9424         368 NPCERQGACGVNSLCNVLTHRKVCFCPRGFTGD  400 (535)
Q Consensus       368 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~G~~g~  400 (535)
                      +.|...+.|++++.|.. .....|.|++||...
T Consensus        78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             cCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence            35555789999999954 346689999999753


No 58 
>KOG1218|consensus
Probab=63.40  E-value=1.6e+02  Score=29.18  Aligned_cols=40  Identities=30%  Similarity=0.650  Sum_probs=25.8

Q ss_pred             eeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCEEecCCCCCc
Q psy9424         302 KCLCPLGLVGDPYGAGCVSASQCTRDDQCPPGAHCVKTDGVPK  344 (535)
Q Consensus       302 ~C~C~~Gy~g~~c~~~C~~~~~C~~~~~C~~~~~C~~~~g~~~  344 (535)
                      .|.|.+||.+..+...+..   |.....+.+++.|....+...
T Consensus       163 ~c~c~~g~~g~~~~~~~~~---c~~~~~~~~g~~C~~~~~~~~  202 (316)
T KOG1218|consen  163 ICTCQPGFVGVFCVESCSG---CSPLTACENGAKCNRSTGSCL  202 (316)
T ss_pred             ceeccCCcccccccccCCC---cCCCcccCCCCeeeccccccc
Confidence            6889999999887743221   544456666667776555433


No 59 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=62.08  E-value=5.7  Score=27.83  Aligned_cols=22  Identities=23%  Similarity=0.579  Sum_probs=18.0

Q ss_pred             CeeeeCCCCceeeCCCCCcCCCCC
Q psy9424         463 AQCSVANHRPLCSCPAGLMGLPSA  486 (535)
Q Consensus       463 ~~C~~~~g~~~C~C~~G~~G~~c~  486 (535)
                      .+|....+  +|+|+++|+|..|+
T Consensus        11 ~~C~~~~G--~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   11 QTCDPSTG--QCVCKPGTTGPRCD   32 (49)
T ss_dssp             SSEEETCE--EESBSTTEESTTS-
T ss_pred             CcccCCCC--EEeccccccCCcCc
Confidence            47877666  99999999999995


No 60 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=55.24  E-value=2.2  Score=31.88  Aligned_cols=48  Identities=13%  Similarity=0.203  Sum_probs=21.0

Q ss_pred             cCCCccCCCCCCCcccCcCCcCCCCCCCCCCCCCCeeeecCCCceeeCCCCCcCCC
Q psy9424          11 ISQHLQQVPGLASAACVDGRCRNPCEADEVCGRNAECAVVNHTPRCTCVAGTVGDP   66 (535)
Q Consensus        11 ~~~~c~c~~g~~g~~C~~~~~~d~C~~~~~C~~~g~C~~~~~~~~C~C~~Gf~G~~   66 (535)
                      +.-...|.+.|.|..|..     .|.....=..+-+|...   +.=+|.+||+|..
T Consensus        15 ~~~rv~C~~nyyG~~C~~-----~C~~~~d~~ghy~Cd~~---G~~~C~~Gw~G~~   62 (63)
T PF01414_consen   15 YRIRVVCDENYYGPNCSK-----FCKPRDDSFGHYTCDSN---GNKVCLPGWTGPN   62 (63)
T ss_dssp             --------TTEETTTT-E-----E---EEETTEEEEE-SS-----EEE-TTEESTT
T ss_pred             EEEEEECCCCCCCccccC-----CcCCCcCCcCCcccCCC---CCCCCCCCCcCCC
Confidence            344578899999999986     55422111223355532   3558999999984


No 61 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=52.60  E-value=15  Score=25.47  Aligned_cols=16  Identities=31%  Similarity=0.696  Sum_probs=13.6

Q ss_pred             eeeCCCCCcCCCCCCC
Q psy9424         390 VCFCPRGFTGDPETEC  405 (535)
Q Consensus       390 ~C~C~~G~~g~~~~~C  405 (535)
                      +|.|+++|+|..++.|
T Consensus        19 ~C~C~~~~~G~~C~~C   34 (46)
T smart00180       19 QCECKPNVTGRRCDRC   34 (46)
T ss_pred             EEECCCCCCCCCCCcC
Confidence            8999999999886644


No 62 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=47.92  E-value=23  Score=24.96  Aligned_cols=24  Identities=25%  Similarity=0.714  Sum_probs=17.8

Q ss_pred             CCCCCCCCeeeeCCCCceeeCCCCCcCC
Q psy9424         456 PAPCGPNAQCSVANHRPLCSCPAGLMGL  483 (535)
Q Consensus       456 ~~~C~~~~~C~~~~g~~~C~C~~G~~G~  483 (535)
                      ...|..++.|++.    +|+|++||+-.
T Consensus        25 ~~qC~~~s~C~~g----~C~C~~g~~~~   48 (52)
T PF01683_consen   25 DEQCIGGSVCVNG----RCQCPPGYVEV   48 (52)
T ss_pred             cCCCCCcCEEcCC----EeECCCCCEec
Confidence            4456678889653    99999998643


No 63 
>KOG3516|consensus
Probab=46.52  E-value=15  Score=42.77  Aligned_cols=36  Identities=31%  Similarity=0.707  Sum_probs=33.3

Q ss_pred             cCCCCCCCCCCCCeeeeCCCCceeeCC-CCCcCCCCC
Q psy9424         451 VDPCSPAPCGPNAQCSVANHRPLCSCP-AGLMGLPSA  486 (535)
Q Consensus       451 ~d~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~c~  486 (535)
                      +|.|.+++|.++|.|.-....|.|.|. .||.|..|.
T Consensus       545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH  581 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCH  581 (1306)
T ss_pred             ccccCCccccCCCcccccccceeEecccccccccccc
Confidence            688889999999999988889999998 899999986


No 64 
>KOG3516|consensus
Probab=46.03  E-value=15  Score=42.64  Aligned_cols=36  Identities=19%  Similarity=0.480  Sum_probs=31.9

Q ss_pred             CCCCCCCCCCCCCCeeeecCCCceeeCC-CCCcCCCCC
Q psy9424          32 RNPCEADEVCGRNAECAVVNHTPRCTCV-AGTVGDPKY   68 (535)
Q Consensus        32 ~d~C~~~~~C~~~g~C~~~~~~~~C~C~-~Gf~G~~c~   68 (535)
                      +|.|. +++|.++|.|.-....|.|.|. .||+|..|.
T Consensus       545 ~drCl-PN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH  581 (1306)
T KOG3516|consen  545 SDRCL-PNPCEHGGKCSQSWDDFECNCELTGYKGATCH  581 (1306)
T ss_pred             ccccC-CccccCCCcccccccceeEecccccccccccc
Confidence            67777 8999999999998889999998 999999654


No 65 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=44.94  E-value=24  Score=24.88  Aligned_cols=24  Identities=38%  Similarity=0.682  Sum_probs=17.6

Q ss_pred             CCCCCCCCeeeecCCCceeeCCCCCcCC
Q psy9424          38 DEVCGRNAECAVVNHTPRCTCVAGTVGD   65 (535)
Q Consensus        38 ~~~C~~~g~C~~~~~~~~C~C~~Gf~G~   65 (535)
                      ...|..++.|++.    +|+|++||+-.
T Consensus        25 ~~qC~~~s~C~~g----~C~C~~g~~~~   48 (52)
T PF01683_consen   25 DEQCIGGSVCVNG----RCQCPPGYVEV   48 (52)
T ss_pred             cCCCCCcCEEcCC----EeECCCCCEec
Confidence            3456677888653    89999999744


No 66 
>KOG3512|consensus
Probab=41.34  E-value=78  Score=33.31  Aligned_cols=28  Identities=21%  Similarity=0.383  Sum_probs=20.4

Q ss_pred             ceeeecCCC-eeeeCCCCCcCCCCCCCCc
Q psy9424         380 SLCNVLTHR-KVCFCPRGFTGDPETECVR  407 (535)
Q Consensus       380 ~~C~~~~g~-~~C~C~~G~~g~~~~~C~~  407 (535)
                      +.|+-...+ ++|.|..+-+|..|..|.+
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKp  313 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCGRCKP  313 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCcccccc
Confidence            457665544 8999999999988755543


No 67 
>KOG3514|consensus
Probab=39.74  E-value=20  Score=41.27  Aligned_cols=34  Identities=29%  Similarity=0.766  Sum_probs=31.3

Q ss_pred             CCCCCCCCCCCeeeeCCCCceeeCC-CCCcCCCCC
Q psy9424         453 PCSPAPCGPNAQCSVANHRPLCSCP-AGLMGLPSA  486 (535)
Q Consensus       453 ~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~c~  486 (535)
                      .|.++||.|+|.|.....+|.|.|. .||.|..|+
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce  659 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE  659 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence            6889999999999999999999997 599999887


No 68 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=33.88  E-value=47  Score=21.40  Aligned_cols=13  Identities=46%  Similarity=1.186  Sum_probs=11.0

Q ss_pred             eeeeCCCCCcCCC
Q psy9424         389 KVCFCPRGFTGDP  401 (535)
Q Consensus       389 ~~C~C~~G~~g~~  401 (535)
                      +.|.|++||..+.
T Consensus        18 ~~C~CPeGyIlde   30 (34)
T PF09064_consen   18 GQCFCPEGYILDE   30 (34)
T ss_pred             CceeCCCceEecC
Confidence            3899999998775


No 69 
>KOG3514|consensus
Probab=32.80  E-value=30  Score=39.92  Aligned_cols=36  Identities=22%  Similarity=0.591  Sum_probs=32.8

Q ss_pred             CCCCCCCCCCCCeeccCCCCceecCC-CCCCCCCCCC
Q psy9424         144 PACEGILCGRNALCTASDHHATCSCK-PGYVGHPGPS  179 (535)
Q Consensus       144 ~~C~~~~C~~~g~C~~~~~~~~C~C~-~Gf~g~~c~~  179 (535)
                      ..|.++||.++|+|....+.|.|.|. .||.|..|+.
T Consensus       624 ~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  624 KICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             cccCCCcccCCCCccccccccccccccCcccCccccc
Confidence            47888999999999999999999995 8999999985


No 70 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=29.71  E-value=20  Score=29.56  Aligned_cols=32  Identities=22%  Similarity=0.563  Sum_probs=24.0

Q ss_pred             CCCCC-CCCCCCCCeeeecC-----CCceeeCCCCCcC
Q psy9424          33 NPCEA-DEVCGRNAECAVVN-----HTPRCTCVAGTVG   64 (535)
Q Consensus        33 d~C~~-~~~C~~~g~C~~~~-----~~~~C~C~~Gf~G   64 (535)
                      +.|.. .+.|+.||.|+...     .=|.|.|.+.+..
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~   43 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK   43 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence            35554 67899999999872     4588999996653


No 71 
>KOG3512|consensus
Probab=26.41  E-value=2.8e+02  Score=29.46  Aligned_cols=27  Identities=19%  Similarity=0.421  Sum_probs=20.7

Q ss_pred             eeccCCCC-ceecCCCCCCCCCCCCCCC
Q psy9424         156 LCTASDHH-ATCSCKPGYVGHPGPSMGT  182 (535)
Q Consensus       156 ~C~~~~~~-~~C~C~~Gf~g~~c~~~~~  182 (535)
                      +|+....+ .+|.|+..-.|+.|+.+.+
T Consensus       286 ~Cv~d~~~~ltCdC~HNTaGPdCgrCKp  313 (592)
T KOG3512|consen  286 RCVMDESSHLTCDCEHNTAGPDCGRCKP  313 (592)
T ss_pred             eeeeccCCceEEecccCCCCCCcccccc
Confidence            57765544 9999999999999875433


No 72 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=20.85  E-value=40  Score=24.26  Aligned_cols=31  Identities=19%  Similarity=0.517  Sum_probs=17.7

Q ss_pred             CCCCCCCCeeee----CCCCceeeCCCCCcCCCCC
Q psy9424         456 PAPCGPNAQCSV----ANHRPLCSCPAGLMGLPSA  486 (535)
Q Consensus       456 ~~~C~~~~~C~~----~~g~~~C~C~~G~~G~~c~  486 (535)
                      ..+|+.||+-..    ..|...|.|..-|.|..|+
T Consensus        16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS   50 (56)
T PF04863_consen   16 AISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS   50 (56)
T ss_dssp             TS--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred             cCCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence            346777777642    3567899999999999986


Done!