Query         psy11059
Match_columns 429
No_of_seqs    331 out of 2310
Neff          8.9 
Searched_HMMs 46136
Date          Fri Aug 16 17:01:21 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11059.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11059hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.8 8.6E-18 1.9E-22  176.0  16.0  115  292-424  1717-1839(2531)
  2 KOG4289|consensus               99.7   2E-16 4.3E-21  166.0  16.2   90    9-117  1179-1297(2531)
  3 KOG1217|consensus               99.5 1.3E-12 2.7E-17  133.7  24.5  212  175-419   155-389 (487)
  4 KOG1217|consensus               99.5 3.2E-12   7E-17  130.7  23.3  298   15-414    93-421 (487)
  5 KOG1219|consensus               99.5   1E-13 2.2E-18  150.8   9.2  115  291-424  3864-3979(4289)
  6 KOG1219|consensus               99.5   1E-13 2.2E-18  150.8   8.5  114    9-187  3864-3978(4289)
  7 KOG1214|consensus               99.3 2.2E-10 4.8E-15  116.4  19.3  129  273-414   808-947 (1289)
  8 KOG0994|consensus               99.3 8.6E-11 1.9E-15  122.5  15.1   99  314-423  1033-1147(1758)
  9 KOG1214|consensus               99.1 2.9E-10 6.2E-15  115.5  11.0  143    8-184   691-860 (1289)
 10 KOG1225|consensus               99.0 3.4E-09 7.3E-14  106.3  10.9  126  249-420   234-365 (525)
 11 KOG0994|consensus               98.8 7.1E-08 1.5E-12  101.4  13.9   95   33-131   830-949 (1758)
 12 KOG1225|consensus               98.8 6.3E-08 1.4E-12   97.3  12.0  118    3-184   243-365 (525)
 13 KOG4260|consensus               98.6   1E-07 2.2E-12   85.7   6.6  131  277-417   131-304 (350)
 14 KOG1836|consensus               98.4 1.2E-05 2.5E-10   91.4  16.9  135  272-423   864-1022(1705)
 15 PF07645 EGF_CA:  Calcium-bindi  98.3 3.5E-07 7.7E-12   60.2   2.0   34    8-42      1-34  (42)
 16 KOG1226|consensus               98.2 1.1E-05 2.3E-10   83.1  10.6  144  233-424   467-622 (783)
 17 PF00008 EGF:  EGF-like domain   97.9 4.9E-06 1.1E-10   51.2   1.6   27   15-42      2-29  (32)
 18 KOG4260|consensus               97.9 1.5E-05 3.3E-10   72.0   4.9   96   55-158   138-262 (350)
 19 PF12947 EGF_3:  EGF domain;  I  97.8 6.3E-06 1.4E-10   51.9   0.9   32   12-44      1-32  (36)
 20 smart00179 EGF_CA Calcium-bind  97.8 3.2E-05   7E-10   49.7   4.2   32    8-42      1-33  (39)
 21 PF00008 EGF:  EGF-like domain   97.8 1.1E-05 2.4E-10   49.6   1.4   29  391-419     2-31  (32)
 22 smart00179 EGF_CA Calcium-bind  97.8   4E-05 8.6E-10   49.3   4.2   36  387-422     2-39  (39)
 23 PF07645 EGF_CA:  Calcium-bindi  97.7 1.8E-05 3.9E-10   52.0   1.7   33   61-97      1-34  (42)
 24 cd00054 EGF_CA Calcium-binding  97.6 8.3E-05 1.8E-09   47.3   4.1   32    8-42      1-33  (38)
 25 KOG1226|consensus               97.5  0.0011 2.4E-08   68.7  11.7   99   16-138   466-589 (783)
 26 cd00054 EGF_CA Calcium-binding  97.4 0.00025 5.4E-09   45.0   4.1   36  387-422     2-38  (38)
 27 KOG1836|consensus               97.4 0.00072 1.6E-08   77.3  10.1  112  251-385   697-813 (1705)
 28 cd00053 EGF Epidermal growth f  97.1 0.00074 1.6E-08   42.1   3.9   26   16-42      5-30  (36)
 29 smart00181 EGF Epidermal growt  97.0 0.00094   2E-08   41.7   3.8   28   11-42      1-29  (35)
 30 cd00053 EGF Epidermal growth f  96.8  0.0021 4.6E-08   39.9   4.1   30  392-421     5-35  (36)
 31 PF12662 cEGF:  Complement Clr-  96.8  0.0013 2.8E-08   37.1   2.6   23   32-63      1-23  (24)
 32 smart00181 EGF Epidermal growt  96.8  0.0021 4.5E-08   40.1   3.9   28  393-421     6-34  (35)
 33 PF07974 EGF_2:  EGF-like domai  96.5  0.0047   1E-07   37.7   3.6   23   17-42      6-28  (32)
 34 PF12947 EGF_3:  EGF domain;  I  96.4  0.0018 3.9E-08   40.8   1.5   27  393-419     6-32  (36)
 35 PF12662 cEGF:  Complement Clr-  96.0  0.0066 1.4E-07   34.3   2.3   10  408-417     2-11  (24)
 36 PF12661 hEGF:  Human growth fa  95.6  0.0047   1E-07   29.5   0.6   13  409-421     1-13  (13)
 37 PF07974 EGF_2:  EGF-like domai  95.6   0.017 3.6E-07   35.3   3.1   26  394-421     7-32  (32)
 38 KOG3512|consensus               95.4    0.15 3.2E-06   50.3  10.4  163  239-423   285-479 (592)
 39 PF06247 Plasmod_Pvs28:  Plasmo  95.1  0.0032 6.9E-08   54.4  -1.7  136  267-419    13-162 (197)
 40 PF14670 FXa_inhibition:  Coagu  94.9   0.012 2.6E-07   37.0   0.9   18   24-42     11-28  (36)
 41 KOG1218|consensus               94.4     4.4 9.4E-05   38.9  18.3   65  255-327   140-208 (316)
 42 PF06247 Plasmod_Pvs28:  Plasmo  93.4   0.024 5.2E-07   49.1   0.2  121   23-158    11-154 (197)
 43 smart00051 DSL delta serrate l  92.6    0.21 4.5E-06   35.8   4.1   45  276-331    19-63  (63)
 44 KOG3512|consensus               92.4    0.37 8.1E-06   47.6   6.7  109  311-423   288-429 (592)
 45 PF14670 FXa_inhibition:  Coagu  92.4   0.059 1.3E-06   33.9   0.8   22   72-97      7-28  (36)
 46 KOG1218|consensus               92.0     3.3 7.2E-05   39.7  13.1   42  247-288    13-63  (316)
 47 PF00053 Laminin_EGF:  Laminin   90.9    0.14 3.1E-06   34.5   1.6   29  315-345    15-43  (49)
 48 cd00055 EGF_Lam Laminin-type e  90.6    0.39 8.5E-06   32.6   3.5   28  315-344    16-43  (50)
 49 smart00051 DSL delta serrate l  90.1    0.44 9.5E-06   34.1   3.6   13  371-383    51-63  (63)
 50 smart00180 EGF_Lam Laminin-typ  89.2    0.47   1E-05   31.6   3.0   25  316-342    16-40  (46)
 51 PF12946 EGF_MSP1_1:  MSP1 EGF   89.1    0.22 4.7E-06   31.3   1.2   30   15-44      3-32  (37)
 52 cd00055 EGF_Lam Laminin-type e  88.2    0.71 1.5E-05   31.3   3.4   27  247-286    17-43  (50)
 53 PF00053 Laminin_EGF:  Laminin   85.7    0.34 7.4E-06   32.6   0.7   27  247-286    16-42  (49)
 54 PHA02887 EGF-like protein; Pro  84.7    0.85 1.8E-05   36.4   2.6   31  233-264    92-123 (126)
 55 cd01475 vWA_Matrilin VWA_Matri  84.1     1.1 2.4E-05   40.9   3.6   39   52-97    178-217 (224)
 56 PF12946 EGF_MSP1_1:  MSP1 EGF   83.8    0.47   1E-05   29.8   0.7   26   70-97      4-30  (37)
 57 PHA02887 EGF-like protein; Pro  82.8     1.2 2.7E-05   35.5   2.8   29  394-423    93-123 (126)
 58 PHA03099 epidermal growth fact  82.2     1.1 2.5E-05   36.3   2.4   29  394-423    52-82  (139)
 59 smart00180 EGF_Lam Laminin-typ  82.2     1.7 3.8E-05   28.8   3.0   24  248-284    17-40  (46)
 60 KOG3516|consensus               80.5     1.5 3.3E-05   48.4   3.4   42  386-427   544-586 (1306)
 61 PHA03099 epidermal growth fact  79.6     1.8 3.9E-05   35.2   2.8   31  234-265    52-83  (139)
 62 KOG3516|consensus               78.0     1.9 4.1E-05   47.7   3.2   47    7-68    543-590 (1306)
 63 cd01475 vWA_Matrilin VWA_Matri  75.9     3.4 7.4E-05   37.7   4.0   36  380-418   181-218 (224)
 64 KOG3514|consensus               74.5       2 4.3E-05   46.9   2.2   36   11-61    625-661 (1591)
 65 PF00954 S_locus_glycop:  S-loc  69.9     4.5 9.7E-05   32.4   2.9   29   66-97     79-107 (110)
 66 PF12955 DUF3844:  Domain of un  69.4     2.3   5E-05   33.5   1.0   51    9-60      5-61  (103)
 67 PF01414 DSL:  Delta serrate li  66.8     1.6 3.4E-05   31.3  -0.4   39  248-287    16-63  (63)
 68 PF00954 S_locus_glycop:  S-loc  63.0     8.2 0.00018   30.8   3.1   31    9-42     77-107 (110)
 69 KOG3514|consensus               60.1     6.5 0.00014   43.2   2.5   36  389-424   625-661 (1591)
 70 PF01683 EB:  EB module;  Inter  57.3      23  0.0005   23.8   4.2   27   10-42     20-46  (52)
 71 PF04863 EGF_alliinase:  Alliin  48.6     6.8 0.00015   26.9   0.3   35   16-62     16-53  (56)
 72 PF09064 Tme5_EGF_like:  Thromb  34.5      30 0.00065   21.3   1.5   13   32-44     17-29  (34)
 73 KOG3509|consensus               31.0      93   0.002   34.7   5.6   71  350-421   407-478 (964)
 74 KOG3509|consensus               21.2 1.8E+02   0.004   32.5   5.7   68    8-95    405-473 (964)
 75 KOG0196|consensus               20.5 1.3E+02  0.0029   32.8   4.3   57  248-328   258-318 (996)

No 1  
>KOG4289|consensus
Probab=99.76  E-value=8.6e-18  Score=175.96  Aligned_cols=115  Identities=30%  Similarity=0.693  Sum_probs=72.5

Q ss_pred             CCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCc---ccccCCCccCC-CCccCCCcCCCCCc-CCCCccccC
Q psy11059        292 TGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICE---KCFCRPGFAGD-HCDVDFDECLSNPC-FNGATCQNK  366 (429)
Q Consensus       292 ~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~---~C~C~~g~~g~-~C~~~i~~C~~~~C-~~~~~C~~~  366 (429)
                      +.|.- ++|.+.++|...   +...+|.|.|++||.|..|+   .=.|+.||+|. .|    ..|.-..- .....|..+
T Consensus      1717 ~vC~l-npc~~~g~Cv~s---p~a~GY~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~C----gpC~CavsKgfdp~CnKt 1788 (2531)
T KOG4289|consen 1717 DVCSL-NPCENQGTCVRS---PGAHGYTCECPPGYTGPYCELRADQPCPRGWWGFPTC----GPCNCAVSKGFDPDCNKT 1788 (2531)
T ss_pred             chhcc-cccccCceeecC---CCCCceeEECCCcccCcchhhhccCCCCCcccCCCCc----cCccccccCCCCCCcccc
Confidence            34544 889999999632   24578999999999999998   34689999985 22    11210000 123456666


Q ss_pred             CCceEEecCCCCCCCCccCCCCCCCCCCCCCCC---EEccCCCCeeeeCCCCCCCCCCCCC
Q psy11059        367 INGYTCVCAPGYSGKECSININECESSPCLHGA---TCIDEVATFSCVCPKGLTGRLCETN  424 (429)
Q Consensus       367 ~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~---~C~~~~~~~~C~C~~g~~G~~C~~~  424 (429)
                      .|  .|.|+..+.-.     +..|.+..|..++   +|.   ...+|.|++|-.|+.|+.-
T Consensus      1789 ~G--~CqCKe~hy~~-----~~~Cl~CdC~~Gs~Sr~C~---adGqC~C~pgaiGRqCdrC 1839 (2531)
T KOG4289|consen 1789 NG--QCQCKENHYRP-----IGSCLPCDCYFGSDSRECD---ADGQCPCKPGAIGRQCDRC 1839 (2531)
T ss_pred             Cc--ceeeccccccC-----CCcceeeccccCCCccccc---CCCcCCCCCcccccccccc
Confidence            55  78888765321     2223433344332   353   4558999999999988753


No 2  
>KOG4289|consensus
Probab=99.71  E-value=2e-16  Score=165.99  Aligned_cols=90  Identities=36%  Similarity=0.801  Sum_probs=79.3

Q ss_pred             CCCCCCCCCCCCCCCEecc---------------------CCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCC
Q psy11059          9 SSPCDAQRNPCQNGGKCNE---------------------DETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQIC   67 (429)
Q Consensus         9 ~~~C~~~~~~C~~~g~C~~---------------------~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C   67 (429)
                      .+-|  ...||.|..+|+.                     ...+.++|.|++||+            |.+||+.+|  +|
T Consensus      1179 DniC--lrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFT------------gd~CeTeiD--lC 1242 (2531)
T KOG4289|consen 1179 DNIC--LREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFT------------GDYCETEID--LC 1242 (2531)
T ss_pred             Cchh--hcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCC------------cccccchhH--hh
Confidence            5679  8999999999985                     234578999999999            999999999  99


Q ss_pred             CCCCCCCCCCeEeeCCCCCCeeeeCCCCCc--------ccCCCCCCCCCCCCeEecCC
Q psy11059         68 TTAPPCLNGATCRPQLTEQLYECVCPPGYK--------EIRDCTSNPCLNDGVCVWMF  117 (429)
Q Consensus        68 ~~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~--------~~~~C~~~~C~~~g~C~~~~  117 (429)
                      . +.+|.++|+|.  ...+.|+|.|.+||+        ....|.+..|.++|+|++..
T Consensus      1243 Y-s~pC~nng~C~--srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1243 Y-SGPCGNNGRCR--SREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred             h-cCCCCCCCceE--EecCceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence            9 99999999999  999999999999998        34568888899999998765


No 3  
>KOG1217|consensus
Probab=99.55  E-value=1.3e-12  Score=133.70  Aligned_cols=212  Identities=38%  Similarity=0.991  Sum_probs=159.4

Q ss_pred             eecCCcccCCcccCCCCCCC--CCCCCCCCEEeeCCCCe-eeccCCCC--CCCCCCCCCCCCCCCCCCCcEeccCCCCCe
Q psy11059        175 VCVDVYKGRYWELPEIRDCT--SNPCLNDGVCVDEVYKG-RYWELPEI--RDCTSNPCLNDCVNPCQNGGKCNEDETGNY  249 (429)
Q Consensus       175 ~C~~~~~G~~c~~~~~~~C~--~~~C~~~~~C~~~~~~~-C~C~~~g~--~~C~~~~C~~~~c~~C~~~g~C~~~~~~~~  249 (429)
                      .|..+|.+..++. ..++|.  ..+|.++++|.+..++| |.| +++.  ..+...          .++++| ...   +
T Consensus       155 ~C~~g~~~~~~~~-~~~~C~~~~~~c~~~~~C~~~~~~~~C~c-~~~~~~~~~~~~----------~~~~~c-~~~---~  218 (487)
T KOG1217|consen  155 SCTEGYEGEPCET-DLDECIQYSSPCQNGGTCVNTGGSYLCSC-PPGYTGSTCETT----------GNGGTC-VDS---V  218 (487)
T ss_pred             eeCCCcccccccc-cccccccCCCCcCCCcccccCCCCeeEeC-CCCccCCcCcCC----------CCCceE-ecc---e
Confidence            3999999999886 557898  35699999999999999 999 8772  222211          234567 222   6


Q ss_pred             eEeCCCCCcCCCCCC---------CccccCCCCeeeeCCCCCCCCC--CccCCCCCCCCCCCCCCCeeCCccccCCCCCe
Q psy11059        250 DCTCDALHTGDPCKH---------GSCVDKRAGYFCDCPPTYGGKN--CSVELTGCVGPDTCLNGGTCKPYLVDETQHRF  318 (429)
Q Consensus       250 ~C~C~~g~~G~~C~~---------~~C~~~~~~~~C~C~~G~~g~~--c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~  318 (429)
                      .|.+..++.+..|+.         +.|++..++|+|.|++||.+..  ...+++.|....+|.++++|    .+ ..+.|
T Consensus       219 ~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C----~~-~~~~~  293 (487)
T KOG1217|consen  219 ACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC----VN-VPGSY  293 (487)
T ss_pred             eccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee----ec-CCCcc
Confidence            789999999887765         5688888899999999999886  23456788873348888999    44 55559


Q ss_pred             eeeCCCCccCCCCcccccCCCccCCCCccCCCcC----CCCCcCCCCcc--ccCCCceEEecCCCCCCCCccCCCCCCCC
Q psy11059        319 NCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDEC----LSNPCFNGATC--QNKINGYTCVCAPGYSGKECSININECES  392 (429)
Q Consensus       319 ~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C----~~~~C~~~~~C--~~~~g~~~C~C~~G~~G~~C~~~~~~C~~  392 (429)
                      .|.|++||.|..+.           .+ .+..+|    ...+|.++++|  ....+.+.|.|..+|.|..|+...++|..
T Consensus       294 ~C~C~~g~~g~~~~-----------~~-~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~  361 (487)
T KOG1217|consen  294 RCTCPPGFTGRLCT-----------EC-VDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECAS  361 (487)
T ss_pred             eeeCCCCCCCCCCc-----------cc-cccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccC
Confidence            99999999988541           11 133445    33457777788  34445688999999999999744458998


Q ss_pred             CCCCCCCEEcc-CCCCeeeeCCCCCCCC
Q psy11059        393 SPCLHGATCID-EVATFSCVCPKGLTGR  419 (429)
Q Consensus       393 ~~C~~~~~C~~-~~~~~~C~C~~g~~G~  419 (429)
                      .++..++.|++ ..++|+|.++.+|.+.
T Consensus       362 ~~~~~~~~c~~~~~~~~~c~~~~~~~~~  389 (487)
T KOG1217|consen  362 SPCCPGGTCVNETPGSYRCACPAGFAGK  389 (487)
T ss_pred             CccccCCEeccCCCCCeEecCCCccccC
Confidence            88999999999 6889999999998874


No 4  
>KOG1217|consensus
Probab=99.50  E-value=3.2e-12  Score=130.68  Aligned_cols=298  Identities=34%  Similarity=0.776  Sum_probs=194.0

Q ss_pred             CCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCC--CCCCCeEeeCCC---CCCee
Q psy11059         15 QRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPP--CLNGATCRPQLT---EQLYE   89 (429)
Q Consensus        15 ~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~--C~~~g~C~~~~~---~~~~~   89 (429)
                      ...+....+.++ ....++.|.|++||.            |..|+...   +|. ..+  +...+.|.  ..   ...+.
T Consensus        93 ~~~~~~~~~~~~-~~~~~~~c~c~~g~~------------~~~~~~~~---~C~-~~~~~~~~~~~c~--~~~~~~~~~~  153 (487)
T KOG1217|consen   93 RSPCLLLCGECV-DCVGSYECTCPPGYQ------------GTPCEGEC---ECV-TGPGVCCIDGSCS--NGPGSVGPFR  153 (487)
T ss_pred             cCCcccCCcccc-CCCCCceeeCCCccc------------cCcCCcce---eec-CCCCCeeCchhhc--CCCCCCCcee
Confidence            344445566777 788899999999999            88776432   265 332  35557777  53   46899


Q ss_pred             eeCCCCCc------ccCCCC--CCCCCCCCeEecCCC---ce-eCCeeccCCCCCCCCCCCCCCCCCCCeEeeCCCCccc
Q psy11059         90 CVCPPGYK------EIRDCT--SNPCLNDGVCVWMFD---VT-IQVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRYC  157 (429)
Q Consensus        90 C~C~~Gy~------~~~~C~--~~~C~~~g~C~~~~~---C~-~~g~~G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C  157 (429)
                      |.|..||.      ..++|.  ..+|.+++.|.+..+   |. +++|.|..|+. .         .+++.|++.   +.|
T Consensus       154 c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~-~---------~~~~~c~~~---~~~  220 (487)
T KOG1217|consen  154 CSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET-T---------GNGGTCVDS---VAC  220 (487)
T ss_pred             eeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC-C---------CCCceEecc---eec
Confidence            99999998      226887  556999999998876   89 99999999985 2         445566654   333


Q ss_pred             cCCCCCCCCCCCCCCCceecCCcccCCcccCCCCCCCCCCCCCCCEEeeCCCCe-eeccCCCC--CCC----CCCCCCCC
Q psy11059        158 ELPEIGDCSSNPCLNDGVCVDVYKGRYWELPEIRDCTSNPCLNDGVCVDEVYKG-RYWELPEI--RDC----TSNPCLND  230 (429)
Q Consensus       158 ~~~~~~~C~~~~C~~~~~C~~~~~G~~c~~~~~~~C~~~~C~~~~~C~~~~~~~-C~C~~~g~--~~C----~~~~C~~~  230 (429)
                      .                 +..+|.+..++. .+.++...   + ++|++..++| |.+ ++|.  ..+    ....|...
T Consensus       221 ~-----------------~~~g~~~~~c~~-~~~~~~~~---~-~~c~~~~~~~~C~~-~~g~~~~~~~~~~~~~~C~~~  277 (487)
T KOG1217|consen  221 S-----------------CPPGARGPECEV-SIVECASG---D-GTCVNTVGSYTCRC-PEGYTGDACVTCVDVDSCALI  277 (487)
T ss_pred             c-----------------CCCCCCCCCccc-ccccccCC---C-CcccccCCceeeeC-CCCccccccceeeeccccCCC
Confidence            3                 667788888887 77777655   4 8899988888 998 7772  221    11222211


Q ss_pred             CCCCCCCCcEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCcc
Q psy11059        231 CVNPCQNGGKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYL  310 (429)
Q Consensus       231 ~c~~C~~~g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~  310 (429)
                      .  +|.++++| .+..+.|.|.|++||+|..|  ..+...     ..|.+.+.+             ..|.+++.|.   
T Consensus       278 ~--~c~~~~~C-~~~~~~~~C~C~~g~~g~~~--~~~~~~-----~~C~~~~~~-------------~~c~~g~~C~---  331 (487)
T KOG1217|consen  278 A--SCPNGGTC-VNVPGSYRCTCPPGFTGRLC--TECVDV-----DECSPRNAG-------------GPCANGGTCN---  331 (487)
T ss_pred             C--ccCCCCee-ecCCCcceeeCCCCCCCCCC--cccccc-----ccccccccC-------------CcCCCCcccc---
Confidence            1  27777788 55555577777777777766  111110     122222111             3355555551   


Q ss_pred             ccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCcccc-CCCceEEecCCCCCCC------Cc
Q psy11059        311 VDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQN-KINGYTCVCAPGYSGK------EC  383 (429)
Q Consensus       311 ~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~-~~g~~~C~C~~G~~G~------~C  383 (429)
                      .......+.|.|..+|.              |..|+...++|...++..++.|++ ..++|.|.++.+|.+.      .+
T Consensus       332 ~~~~~~~~~C~c~~~~~--------------g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~  397 (487)
T KOG1217|consen  332 TLGSFGGFRCACGPGFT--------------GRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGC  397 (487)
T ss_pred             cCCCCCCCCcCCCCCCC--------------CCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccc
Confidence            11133455666666655              445542225888888999999999 6899999999999874      12


Q ss_pred             cCCCCCCCCCCCCCCCEEccCCCCeeeeCCC
Q psy11059        384 SININECESSPCLHGATCIDEVATFSCVCPK  414 (429)
Q Consensus       384 ~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~  414 (429)
                       .++++|..     .+.|++..+++.|. .+
T Consensus       398 -~~~~~c~~-----~~~c~~~~~~~~c~-~~  421 (487)
T KOG1217|consen  398 -EDIDECSG-----CGDCVNGPGGGACT-PP  421 (487)
T ss_pred             -cccccccC-----CcceeccCCCCccc-cC
Confidence             24444443     55687788888888 66


No 5  
>KOG1219|consensus
Probab=99.47  E-value=1e-13  Score=150.82  Aligned_cols=115  Identities=40%  Similarity=1.055  Sum_probs=103.4

Q ss_pred             CCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCCCce
Q psy11059        291 LTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKINGY  370 (429)
Q Consensus       291 ~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~g~~  370 (429)
                      .+.|.+ +||+++|+|    .....++|.|.|++-|.|.              +|+.++..|.++||..+++|+...++|
T Consensus      3864 ~d~C~~-npCqhgG~C----~~~~~ggy~CkCpsqysG~--------------~CEi~~epC~snPC~~GgtCip~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPCND-NPCQHGGTC----ISQPKGGYKCKCPSQYSGN--------------HCEIDLEPCASNPCLTGGTCIPFYNGF 3924 (4289)
T ss_pred             cccccc-CcccCCCEe----cCCCCCceEEeCcccccCc--------------ccccccccccCCCCCCCCEEEecCCCe
Confidence            367888 999999999    4446678999988877655              555789999999999999999999999


Q ss_pred             EEecCCCCCCCCccCC-CCCCCCCCCCCCCEEccCCCCeeeeCCCCCCCCCCCCC
Q psy11059        371 TCVCAPGYSGKECSIN-INECESSPCLHGATCIDEVATFSCVCPKGLTGRLCETN  424 (429)
Q Consensus       371 ~C~C~~G~~G~~C~~~-~~~C~~~~C~~~~~C~~~~~~~~C~C~~g~~G~~C~~~  424 (429)
                      .|.|+.||+|.+|+.+ +++|..++|.++|.|++..|+|.|.|.+||.|..|...
T Consensus      3925 ~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~ 3979 (4289)
T KOG1219|consen 3925 LCNCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCCAE 3979 (4289)
T ss_pred             eEeCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhcccCccc
Confidence            9999999999999988 99999999999999999999999999999999998643


No 6  
>KOG1219|consensus
Probab=99.46  E-value=1e-13  Score=150.83  Aligned_cols=114  Identities=37%  Similarity=0.885  Sum_probs=103.3

Q ss_pred             CCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCCCCCCCeEeeCCCCCCe
Q psy11059          9 SSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPPCLNGATCRPQLTEQLY   88 (429)
Q Consensus         9 ~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~C~~~g~C~~~~~~~~~   88 (429)
                      .|+|  ..+||+++|+|+....|.|+|.|++-|+            |..||+++.  +|. ++||..+|+|+  ...+.|
T Consensus      3864 ~d~C--~~npCqhgG~C~~~~~ggy~CkCpsqys------------G~~CEi~~e--pC~-snPC~~GgtCi--p~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPC--NDNPCQHGGTCISQPKGGYKCKCPSQYS------------GNHCEIDLE--PCA-SNPCLTGGTCI--PFYNGF 3924 (4289)
T ss_pred             cccc--ccCcccCCCEecCCCCCceEEeCccccc------------Ccccccccc--ccc-CCCCCCCCEEE--ecCCCe
Confidence            4889  7999999999995556889999999999            999999999  999 99999999999  888999


Q ss_pred             eeeCCCCCcccCCCCCCCCCCCCeEecCCCceeCCeeccCCCCCC-CCCCCCCCCCCCCeEeeCCCCccccCCCCCCCCC
Q psy11059         89 ECVCPPGYKEIRDCTSNPCLNDGVCVWMFDVTIQVYKGRYCELPE-IGDCSSNPCLNDGVCVDVYKGRYCELPEIGDCSS  167 (429)
Q Consensus        89 ~C~C~~Gy~~~~~C~~~~C~~~g~C~~~~~C~~~g~~G~~C~~~~-i~~C~~~~C~~~g~C~~~~~g~~C~~~~~~~C~~  167 (429)
                      .|.|+.|                            |+|.+||. + |++|..++|.++|.|++..+.+.|.         
T Consensus      3925 ~CnC~~g----------------------------yTG~~Ce~-~Gi~eCs~n~C~~gg~C~n~~gsf~Cn--------- 3966 (4289)
T KOG1219|consen 3925 LCNCPNG----------------------------YTGKRCEA-RGISECSKNVCGTGGQCINIPGSFHCN--------- 3966 (4289)
T ss_pred             eEeCCCC----------------------------ccCceeec-ccccccccccccCCceeeccCCceEec---------
Confidence            9999877                            77888988 5 9999999999999999999999998         


Q ss_pred             CCCCCCceecCCcccCCccc
Q psy11059        168 NPCLNDGVCVDVYKGRYWEL  187 (429)
Q Consensus       168 ~~C~~~~~C~~~~~G~~c~~  187 (429)
                              |..+|.|..|..
T Consensus      3967 --------cT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3967 --------CTPGILGRTCCA 3978 (4289)
T ss_pred             --------cChhHhcccCcc
Confidence                    888898888754


No 7  
>KOG1214|consensus
Probab=99.29  E-value=2.2e-10  Score=116.35  Aligned_cols=129  Identities=22%  Similarity=0.462  Sum_probs=71.4

Q ss_pred             CeeeeCCCCCCCC--CCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCcc-CCCCccCC
Q psy11059        273 GYFCDCPPTYGGK--NCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFA-GDHCDVDF  349 (429)
Q Consensus       273 ~~~C~C~~G~~g~--~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~-g~~C~~~i  349 (429)
                      .|.|.|.|||.|+  .|. +.++|.+ +.|...++|    ++ +.+++.|+|.+||.|+.-   +|.|+-. -..|+...
T Consensus       808 ~y~C~CLPGfsGDG~~c~-dvDeC~p-srChp~A~C----yn-tpgsfsC~C~pGy~GDGf---~CVP~~~~~T~C~~er  877 (1289)
T KOG1214|consen  808 TYSCACLPGFSGDGHQCT-DVDECSP-SRCHPAATC----YN-TPGSFSCRCQPGYYGDGF---QCVPDTSSLTPCEQER  877 (1289)
T ss_pred             eEEEeecCCccCCccccc-cccccCc-cccCCCceE----ec-CCCcceeecccCccCCCc---eecCCCccCCcccccc
Confidence            4666666666554  333 5588887 899999999    66 889999999999998721   2333311 11232110


Q ss_pred             CcCCCCCcCCCCccc--cCCCceEEecCCCCCC---CCccCCCCCCCCCCCCCCCEEccC---CCCeeeeCCC
Q psy11059        350 DECLSNPCFNGATCQ--NKINGYTCVCAPGYSG---KECSININECESSPCLHGATCIDE---VATFSCVCPK  414 (429)
Q Consensus       350 ~~C~~~~C~~~~~C~--~~~g~~~C~C~~G~~G---~~C~~~~~~C~~~~C~~~~~C~~~---~~~~~C~C~~  414 (429)
                      -  -+..|...+.+.  ..+.+|.+.+.++-.|   ..|. .+.+=---.|..++.+..+   ..+++|+|..
T Consensus       878 ~--hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~-~~~~~~vp~Cd~hgh~ap~qchG~~~~CwCvd  947 (1289)
T KOG1214|consen  878 F--HPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCG-PSPEQYVPQCDDHGHFAPLQCHGKSDFCWCVD  947 (1289)
T ss_pred             c--cceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCC-CcccccCCCccccccccccccCCCcceeEEec
Confidence            0  011255444332  1345678887766555   3453 1111011235555555433   2247788865


No 8  
>KOG0994|consensus
Probab=99.26  E-value=8.6e-11  Score=122.53  Aligned_cols=99  Identities=30%  Similarity=0.804  Sum_probs=62.2

Q ss_pred             CCCCeeeeCCCCccCCCCcccccCCCcc----CCCCccCCCcCCCCCcCCCCccccCCCceEEecCCCCCCCCccC----
Q psy11059        314 TQHRFNCTCPSGYHGKICEKCFCRPGFA----GDHCDVDFDECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSI----  385 (429)
Q Consensus       314 ~~~~~~C~C~~G~~G~~C~~C~C~~g~~----g~~C~~~i~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~----  385 (429)
                      ...+++|.|.+...|.+|+  .|.+.++    |..|+    .|.-+| ..+.+|....|  .|.|++||.|+.|..    
T Consensus      1033 Dr~tGQCpClpNv~G~~CD--qCA~N~w~laSG~GCe----~C~Cd~-~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel 1103 (1758)
T KOG0994|consen 1033 DRFTGQCPCLPNVQGVRCD--QCAENHWNLASGEGCE----PCNCDP-IGGPQCNEFTG--QCQCKPGFGGRTCSQCQEL 1103 (1758)
T ss_pred             ccccCcCCCCccccccccc--ccccchhccccCCCCC----ccCCCc-cCCcccccccc--ceeccCCCCCcchhHHHHh
Confidence            4456788888888888888  5566654    55564    233222 23346766665  899999999998862    


Q ss_pred             ---CCC-CCCCCCCCCCC----EEccCCCCeeeeCCCCCCCCCCCC
Q psy11059        386 ---NIN-ECESSPCLHGA----TCIDEVATFSCVCPKGLTGRLCET  423 (429)
Q Consensus       386 ---~~~-~C~~~~C~~~~----~C~~~~~~~~C~C~~g~~G~~C~~  423 (429)
                         +.+ .|..-.|...|    .|..  .+.+|+|.+|..|.+|++
T Consensus      1104 ~WGdP~~~C~aCdCd~rG~~tpQCdr--~tG~C~C~~Gv~G~rCdq 1147 (1758)
T KOG0994|consen 1104 YWGDPNEKCRACDCDPRGIETPQCDR--ATGRCVCRPGVGGPRCDQ 1147 (1758)
T ss_pred             hcCCCCCCceecCCCCCCCCCCCccc--cCCceeecCCCCCcchhh
Confidence               111 23333344433    2322  245789999999988863


No 9  
>KOG1214|consensus
Probab=99.13  E-value=2.9e-10  Score=115.54  Aligned_cols=143  Identities=22%  Similarity=0.544  Sum_probs=117.7

Q ss_pred             CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCC-CCCCCCCCCeEeeCCCCC
Q psy11059          8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICT-TAPPCLNGATCRPQLTEQ   86 (429)
Q Consensus         8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~-~~~~C~~~g~C~~~~~~~   86 (429)
                      .+++|-.+++-|..++.|.....-.|+|.|..||.|+          |.+|. +++  +|+ ..+.|..+++|+  +.++
T Consensus       691 ~~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd----------gr~c~-d~~--eca~~~~~CGp~s~Ci--n~pg  755 (1289)
T KOG1214|consen  691 PVNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD----------GRNCV-DEN--ECATGFHRCGPNSVCI--NLPG  755 (1289)
T ss_pred             ccccceecCcccCCCccccCCCCcceEEEEeeccCCC----------CCCCC-Chh--hhccCCCCCCCCceee--cCCC
Confidence            4678877888899999999444557999999999988          99995 777  898 577899999999  9999


Q ss_pred             CeeeeCCCCCc---------------ccCCCCC--CCCCCCC--eEecCCC----ce-eCCeec--cCCCCCCCCCCCCC
Q psy11059         87 LYECVCPPGYK---------------EIRDCTS--NPCLNDG--VCVWMFD----VT-IQVYKG--RYCELPEIGDCSSN  140 (429)
Q Consensus        87 ~~~C~C~~Gy~---------------~~~~C~~--~~C~~~g--~C~~~~~----C~-~~g~~G--~~C~~~~i~~C~~~  140 (429)
                      +|+|.|..||.               .++.|..  ..|.-.|  .|+...+    |. .+||.|  ..|.  ++|+|..+
T Consensus       756 ~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~--dvDeC~ps  833 (1289)
T KOG1214|consen  756 SYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT--DVDECSPS  833 (1289)
T ss_pred             ceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc--cccccCcc
Confidence            99999999997               3445552  3454443  4555544    99 999986  5677  88999999


Q ss_pred             CCCCCCeEeeCCCCccccCCCCCCCCCCCCCCCceecCCcccCC
Q psy11059        141 PCLNDGVCVDVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRY  184 (429)
Q Consensus       141 ~C~~~g~C~~~~~g~~C~~~~~~~C~~~~C~~~~~C~~~~~G~~  184 (429)
                      .|...++|++..+.+.|+                 |++||.|+.
T Consensus       834 rChp~A~CyntpgsfsC~-----------------C~pGy~GDG  860 (1289)
T KOG1214|consen  834 RCHPAATCYNTPGSFSCR-----------------CQPGYYGDG  860 (1289)
T ss_pred             ccCCCceEecCCCcceee-----------------cccCccCCC
Confidence            999999999999999998                 999999864


No 10 
>KOG1225|consensus
Probab=98.96  E-value=3.4e-09  Score=106.35  Aligned_cols=126  Identities=33%  Similarity=0.815  Sum_probs=83.9

Q ss_pred             eeEeCCCCCcCCCCCCCccccCC------CCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeC
Q psy11059        249 YDCTCDALHTGDPCKHGSCVDKR------AGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTC  322 (429)
Q Consensus       249 ~~C~C~~g~~G~~C~~~~C~~~~------~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C  322 (429)
                      +.|.|+.+|.|..|+...|...-      ..-+|.|++||+|..|+.  ..|.. . |..++.+    +   .  ..|+|
T Consensus       234 ~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e--~~Cp~-~-cs~~g~~----~---~--g~CiC  300 (525)
T KOG1225|consen  234 GICECPEGYFGPLCSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDE--LVCPV-D-CSGGGVC----V---D--GECIC  300 (525)
T ss_pred             ceeecCCceeCCccccccCCCCCcccceEeCCeEeCCCCCcCCCCCc--ccCCc-c-cCCCcee----c---C--CEeec
Confidence            36777777777777655554431      113577777777777743  34543 2 6555555    2   1  26666


Q ss_pred             CCCccCCCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCCCCCCCCCEEc
Q psy11059        323 PSGYHGKICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSININECESSPCLHGATCI  402 (429)
Q Consensus       323 ~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~  402 (429)
                      ++||+|..|+                +..|. ..|.+++.|++.    +|.|.+||+|..|...       +|.+++.|+
T Consensus       301 ~~g~~G~dCs----------------~~~cp-adC~g~G~Ci~G----~C~C~~Gy~G~~C~~~-------~C~~~g~cv  352 (525)
T KOG1225|consen  301 NPGYSGKDCS----------------IRRCP-ADCSGHGKCIDG----ECLCDEGYTGELCIQR-------ACSGGGQCV  352 (525)
T ss_pred             CCCccccccc----------------cccCC-ccCCCCCcccCC----ceEeCCCCcCCccccc-------ccCCCceec
Confidence            6666655443                23343 459999999833    8999999999999643       388888885


Q ss_pred             cCCCCeeeeCCCCCCCCC
Q psy11059        403 DEVATFSCVCPKGLTGRL  420 (429)
Q Consensus       403 ~~~~~~~C~C~~g~~G~~  420 (429)
                      +.     |+|..||.|++
T Consensus       353 ~g-----C~C~~Gw~G~d  365 (525)
T KOG1225|consen  353 NG-----CKCKKGWRGPD  365 (525)
T ss_pred             cC-----ceeccCccCCC
Confidence            43     99999999987


No 11 
>KOG0994|consensus
Probab=98.80  E-value=7.1e-08  Score=101.35  Aligned_cols=95  Identities=21%  Similarity=0.382  Sum_probs=59.0

Q ss_pred             eEEecCCCCcccccccccCcCCC-CCCCCCCCCCCCC-CCCCCCC-CCeEee-CCCCCCeee-eCCCCCc------ccCC
Q psy11059         33 YDCTCDALHTVCCVGLANQTLGS-IHCETPISNQICT-TAPPCLN-GATCRP-QLTEQLYEC-VCPPGYK------EIRD  101 (429)
Q Consensus        33 ~~C~C~~g~~g~~~~~~~~~~~G-~~C~~~~~~~~C~-~~~~C~~-~g~C~~-~~~~~~~~C-~C~~Gy~------~~~~  101 (429)
                      .+|+|.+|-.|.++..|..+|=| +.|..-    .|. -.+.|.. -|.|+- .+....+.| .|..||+      ....
T Consensus       830 GQC~C~~g~ygrqCnqCqpG~WgFPeCr~C----qCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~  905 (1758)
T KOG0994|consen  830 GQCQCRPGTYGRQCNQCQPGYWGFPECRPC----QCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIG  905 (1758)
T ss_pred             cceeeccccchhhccccCCCccCCCcCccc----cccCcccccCccccccccccccccccchhhhhccccCCcccCCCCC
Confidence            46888888777777777776655 333311    121 0222322 244441 045566778 6999999      3467


Q ss_pred             CCCCCCCCCC--------eEecCCC-----ce-eCCeeccCCCC
Q psy11059        102 CTSNPCLNDG--------VCVWMFD-----VT-IQVYKGRYCEL  131 (429)
Q Consensus       102 C~~~~C~~~g--------~C~~~~~-----C~-~~g~~G~~C~~  131 (429)
                      |.+.||..+-        .|.-...     |. .+||+|.+|+.
T Consensus       906 CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~  949 (1758)
T KOG0994|consen  906 CRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI  949 (1758)
T ss_pred             CCCCCCCCCCccchhccccccccccccceeeecccCccccchhh
Confidence            8888887652        3543332     89 99999999984


No 12 
>KOG1225|consensus
Probab=98.76  E-value=6.3e-08  Score=97.35  Aligned_cols=118  Identities=31%  Similarity=0.785  Sum_probs=85.6

Q ss_pred             cccCCCCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCCCCCCCCCCeEeeC
Q psy11059          3 FKPISLSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTTAPPCLNGATCRPQ   82 (429)
Q Consensus         3 ~~~~~~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~~~~C~~~g~C~~~   82 (429)
                      +.|....--|   ++-|.++|.|+     ..+|+|++||+            |.+|...    .|. .. |+.++.++  
T Consensus       243 ~g~~c~~~~C---~~~c~~~g~c~-----~G~CIC~~Gf~------------G~dC~e~----~Cp-~~-cs~~g~~~--  294 (525)
T KOG1225|consen  243 FGPLCSTIYC---PGGCTGRGQCV-----EGRCICPPGFT------------GDDCDEL----VCP-VD-CSGGGVCV--  294 (525)
T ss_pred             eCCccccccC---CCCCcccceEe-----CCeEeCCCCCc------------CCCCCcc----cCC-cc-cCCCceec--
Confidence            3444444445   66677778888     56899999999            9999753    366 33 88888888  


Q ss_pred             CCCCCeeeeCCCCCc----ccCCCCCCCCCCCCeEecCCCce-eCCeeccCCCCCCCCCCCCCCCCCCCeEeeCCCCccc
Q psy11059         83 LTEQLYECVCPPGYK----EIRDCTSNPCLNDGVCVWMFDVT-IQVYKGRYCELPEIGDCSSNPCLNDGVCVDVYKGRYC  157 (429)
Q Consensus        83 ~~~~~~~C~C~~Gy~----~~~~C~~~~C~~~g~C~~~~~C~-~~g~~G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C  157 (429)
                      +.    .|+|++||+    .+..|. .+|..+|.|+ ...|. .+||+|..|+. .       +|.+++.|++.     |
T Consensus       295 ~g----~CiC~~g~~G~dCs~~~cp-adC~g~G~Ci-~G~C~C~~Gy~G~~C~~-~-------~C~~~g~cv~g-----C  355 (525)
T KOG1225|consen  295 DG----ECICNPGYSGKDCSIRRCP-ADCSGHGKCI-DGECLCDEGYTGELCIQ-R-------ACSGGGQCVNG-----C  355 (525)
T ss_pred             CC----EeecCCCccccccccccCC-ccCCCCCccc-CCceEeCCCCcCCcccc-c-------ccCCCceeccC-----c
Confidence            32    699999998    334444 6699999999 33399 99999999985 2       37777787752     4


Q ss_pred             cCCCCCCCCCCCCCCCceecCCcccCC
Q psy11059        158 ELPEIGDCSSNPCLNDGVCVDVYKGRY  184 (429)
Q Consensus       158 ~~~~~~~C~~~~C~~~~~C~~~~~G~~  184 (429)
                      .                 |..||.|..
T Consensus       356 ~-----------------C~~Gw~G~d  365 (525)
T KOG1225|consen  356 K-----------------CKKGWRGPD  365 (525)
T ss_pred             e-----------------eccCccCCC
Confidence            4                 889999987


No 13 
>KOG4260|consensus
Probab=98.58  E-value=1e-07  Score=85.72  Aligned_cols=131  Identities=29%  Similarity=0.749  Sum_probs=88.7

Q ss_pred             eCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCc------------------------
Q psy11059        277 DCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICE------------------------  332 (429)
Q Consensus       277 ~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~------------------------  332 (429)
                      -|++|.+|..|..-... + ..+|..++.|..  -..+.++..|.|.+||.|..|.                        
T Consensus       131 CCp~gtyGpdCl~Cpgg-s-er~C~GnG~C~G--dGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~  206 (350)
T KOG4260|consen  131 CCPDGTYGPDCLQCPGG-S-ERPCFGNGSCHG--DGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEG  206 (350)
T ss_pred             ccCCCCcCCccccCCCC-C-cCCcCCCCcccC--CCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhh
Confidence            36777777776421110 1 256777778742  1125688999999999999886                        


Q ss_pred             -----------cc-ccCCCccC--CCCccCCCcCCC--CCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCC--CC
Q psy11059        333 -----------KC-FCRPGFAG--DHCDVDFDECLS--NPCFNGATCQNKINGYTCVCAPGYSGKECSININECES--SP  394 (429)
Q Consensus       333 -----------~C-~C~~g~~g--~~C~~~i~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~--~~  394 (429)
                                 .| .|..||.-  ..| +||++|..  .||.....|+|+.|+|.|..++||.+.     +|+|..  ..
T Consensus       207 C~~~Csg~~~k~C~kCkkGW~lde~gC-vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-----~d~C~~~~d~  280 (350)
T KOG4260|consen  207 CLGVCSGESSKGCSKCKKGWKLDEEGC-VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-----VDECQFCADV  280 (350)
T ss_pred             hhcccCCCCCCChhhhcccceeccccc-ccHHHHhcCCCCCChhheeecCCCceEecccccccCC-----hHHhhhhhhh
Confidence                       12 24555542  234 48888864  568888899999999999999998762     444442  23


Q ss_pred             C-CCCCEEccCCCCeeeeCCCCCC
Q psy11059        395 C-LHGATCIDEVATFSCVCPKGLT  417 (429)
Q Consensus       395 C-~~~~~C~~~~~~~~C~C~~g~~  417 (429)
                      | ..+..|.++.+.|+|+|..++.
T Consensus       281 ~~~kn~~c~ni~~~~r~v~f~~~~  304 (350)
T KOG4260|consen  281 CASKNRPCMNIDGQYRCVCFSGLI  304 (350)
T ss_pred             cccCCCCcccCCccEEEEecccce
Confidence            3 2345688889999999988753


No 14 
>KOG1836|consensus
Probab=98.35  E-value=1.2e-05  Score=91.41  Aligned_cols=135  Identities=27%  Similarity=0.594  Sum_probs=78.1

Q ss_pred             CCeee-eCCCCCCCCCCc-cCCCCCCCCCCCCCC------CeeCCccccCCCCCeeeeCCCCccCCCCcccccCCCccCC
Q psy11059        272 AGYFC-DCPPTYGGKNCS-VELTGCVGPDTCLNG------GTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGD  343 (429)
Q Consensus       272 ~~~~C-~C~~G~~g~~c~-~~~~~C~~~~~C~~~------~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~  343 (429)
                      .+.+| .|.+||.|..-. ...+.|.. .-|...      .+|       ..-...|.|.+.-.|..|.  .|.+||++.
T Consensus       864 ~g~~cd~c~~g~~gd~l~~~p~~~c~~-c~c~p~gs~~~~~~c-------~~~tGQcec~~~v~g~~c~--~c~~g~fnl  933 (1705)
T KOG1836|consen  864 AGEYCDLCKEGYFGDPLAPNPEDKCFA-CGCVPAGSELPSLTC-------NPVTGQCECKPNVEGRDCL--YCFKGFFNL  933 (1705)
T ss_pred             ccccccccccCccccccCCCcCCcccc-ccCccCCcccccccC-------CCcccceeccCCCCccccc--ccccccccc
Confidence            33445 688888887543 12233433 223222      223       3456789999999999888  677888866


Q ss_pred             CCccCCCcCCCCCcCCC----CccccCCCceEEecCCCCCCCCccCC--------CCCCCCCCCCCCC----EEccCCCC
Q psy11059        344 HCDVDFDECLSNPCFNG----ATCQNKINGYTCVCAPGYSGKECSIN--------INECESSPCLHGA----TCIDEVAT  407 (429)
Q Consensus       344 ~C~~~i~~C~~~~C~~~----~~C~~~~g~~~C~C~~G~~G~~C~~~--------~~~C~~~~C~~~~----~C~~~~~~  407 (429)
                      .-   -..|+.-.|...    ..|....  ..|.|.+|-+|.+|..-        +..|..--|...|    .|...  .
T Consensus       934 ~s---~~gC~~c~c~~~gs~~~~c~~~t--Gqc~c~~gVtgqrc~qc~~~~~~~~~~gc~~c~c~~~Gs~~~qc~~~--~ 1006 (1705)
T KOG1836|consen  934 NS---GVGCEPCNCDPTGSESSDCDVGT--GQCYCRPGVTGQRCDQCETYHFGFQTEGCGLCECDPLGSRGFQCDPE--D 1006 (1705)
T ss_pred             CC---CCCcccccccccccccccccccC--CceeeecCccccccCccccCcccccccCCcceecccCCcccceeccc--C
Confidence            51   122333334322    2454444  48999999999887521        1112222243333    45432  3


Q ss_pred             eeeeCCCCCCCCCCCC
Q psy11059        408 FSCVCPKGLTGRLCET  423 (429)
Q Consensus       408 ~~C~C~~g~~G~~C~~  423 (429)
                      .+|.|+++|.|.+|..
T Consensus      1007 G~c~c~~~~~g~~c~~ 1022 (1705)
T KOG1836|consen 1007 GQCPCRPGFEGRRCDQ 1022 (1705)
T ss_pred             CeeeecCCCCCccccc
Confidence            4899999999987763


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.29  E-value=3.5e-07  Score=60.22  Aligned_cols=34  Identities=24%  Similarity=0.663  Sum_probs=32.2

Q ss_pred             CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059          8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus         8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      |||||+..+++|..+++|+ ++.|+|+|.|++||+
T Consensus         1 DidEC~~~~~~C~~~~~C~-N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCV-NTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEE-EETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEE-cCCCCEEeeCCCCcE
Confidence            5899998888999999999 999999999999998


No 16 
>KOG1226|consensus
Probab=98.17  E-value=1.1e-05  Score=83.10  Aligned_cols=144  Identities=28%  Similarity=0.650  Sum_probs=84.9

Q ss_pred             CCCCCCcEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCC---CCCCCCCCeeCCc
Q psy11059        233 NPCQNGGKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVG---PDTCLNGGTCKPY  309 (429)
Q Consensus       233 ~~C~~~g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~---~~~C~~~~~C~~~  309 (429)
                      ..|+.+|+.     .-..|.|.+||.|..|+-.              ..-.... + ..+.|..   ..+|...|.|.  
T Consensus       467 ~~C~g~G~~-----~CG~C~C~~G~~G~~CEC~--------------~~~~ss~-~-~~~~Cr~~~~~~vCSgrG~C~--  523 (783)
T KOG1226|consen  467 ALCHGNGTF-----VCGQCRCDEGWLGKKCECS--------------TDELSSS-E-EEDKCRENSDSPVCSGRGDCV--  523 (783)
T ss_pred             cccCCCCcE-----EecceecCCCCCCCcccCC--------------ccccCcH-h-HHhhccCCCCCCCcCCCCcEe--
Confidence            356655554     2246888888888887632              1111110 0 0122321   13577777772  


Q ss_pred             cccCCCCCeeeeCCCCccCCCCcccccCCCccCCCCccCCCcCCC---CCcCCCCccccCCCceEEecCCCCCCCCcc--
Q psy11059        310 LVDETQHRFNCTCPSGYHGKICEKCFCRPGFAGDHCDVDFDECLS---NPCFNGATCQNKINGYTCVCAPGYSGKECS--  384 (429)
Q Consensus       310 ~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C~~~i~~C~~---~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~--  384 (429)
                             =++|+|.+...|.          ++|..|+-|--.|..   .-|..++.|.-.    +|+|.+||+|..|+  
T Consensus       524 -------CGqC~C~~~~~~~----------i~G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~  582 (783)
T KOG1226|consen  524 -------CGQCVCHKPDNGK----------IYGKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCP  582 (783)
T ss_pred             -------CCceEecCCCCCc----------eeeeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence                   1356776655421          235566533333433   237788887654    79999999999875  


Q ss_pred             CCCCCCCCC---CCCCCCEEccCCCCeeeeCCCC-CCCCCCCCC
Q psy11059        385 ININECESS---PCLHGATCIDEVATFSCVCPKG-LTGRLCETN  424 (429)
Q Consensus       385 ~~~~~C~~~---~C~~~~~C~~~~~~~~C~C~~g-~~G~~C~~~  424 (429)
                      .+.+.|.+.   .|...|+|.-.    +|+|... |.|..||..
T Consensus       583 ~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~c  622 (783)
T KOG1226|consen  583 LSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEKC  622 (783)
T ss_pred             CCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhcC
Confidence            455666542   47777777544    6888866 999999864


No 17 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.92  E-value=4.9e-06  Score=51.16  Aligned_cols=27  Identities=52%  Similarity=1.068  Sum_probs=25.0

Q ss_pred             CCCCCCCCCEeccCCC-CceEEecCCCCc
Q psy11059         15 QRNPCQNGGKCNEDET-GNYDCTCDALHT   42 (429)
Q Consensus        15 ~~~~C~~~g~C~~~~~-~~~~C~C~~g~~   42 (429)
                      .++||+|+|+|+ +.. ++|+|+|++||+
T Consensus         2 ~~~~C~n~g~C~-~~~~~~y~C~C~~G~~   29 (32)
T PF00008_consen    2 SSNPCQNGGTCI-DLPGGGYTCECPPGYT   29 (32)
T ss_dssp             TTTSSTTTEEEE-EESTSEEEEEEBTTEE
T ss_pred             CCCcCCCCeEEE-eCCCCCEEeECCCCCc
Confidence            578999999999 777 999999999999


No 18 
>KOG4260|consensus
Probab=97.89  E-value=1.5e-05  Score=72.00  Aligned_cols=96  Identities=26%  Similarity=0.600  Sum_probs=64.4

Q ss_pred             CCCCCCCCCCCCCC--CCCCCCCCCeEeeC-CCCCCeeeeCCCCCc--ccCCCCC-----------C---CCCC--CCeE
Q psy11059         55 SIHCETPISNQICT--TAPPCLNGATCRPQ-LTEQLYECVCPPGYK--EIRDCTS-----------N---PCLN--DGVC  113 (429)
Q Consensus        55 G~~C~~~~~~~~C~--~~~~C~~~g~C~~~-~~~~~~~C~C~~Gy~--~~~~C~~-----------~---~C~~--~g~C  113 (429)
                      |+.|.      .|.  +..+|..+|.|.-. ...|+..|.|.+||+  .-..|..           .   .|..  .++|
T Consensus       138 GpdCl------~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~C  211 (350)
T KOG4260|consen  138 GPDCL------QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVC  211 (350)
T ss_pred             CCccc------cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhccc
Confidence            99997      453  35678788888721 145678999999998  1122221           0   1211  1245


Q ss_pred             ecCCC--ce--eCCeec--cCCCCCCCCCCC--CCCCCCCCeEeeCCCCcccc
Q psy11059        114 VWMFD--VT--IQVYKG--RYCELPEIGDCS--SNPCLNDGVCVDVYKGRYCE  158 (429)
Q Consensus       114 ~~~~~--C~--~~g~~G--~~C~~~~i~~C~--~~~C~~~g~C~~~~~g~~C~  158 (429)
                      .....  |.  ..||.-  ..|.  |||||.  +.||.....|+|+.++|.|+
T Consensus       212 sg~~~k~C~kCkkGW~lde~gCv--DvnEC~~ep~~c~~~qfCvNteGSf~C~  262 (350)
T KOG4260|consen  212 SGESSKGCSKCKKGWKLDEEGCV--DVNECQNEPAPCKAHQFCVNTEGSFKCE  262 (350)
T ss_pred             CCCCCCChhhhcccceecccccc--cHHHHhcCCCCCChhheeecCCCceEec
Confidence            43332  77  788874  4576  899998  57799999999999999998


No 19 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.82  E-value=6.3e-06  Score=51.94  Aligned_cols=32  Identities=25%  Similarity=0.641  Sum_probs=25.4

Q ss_pred             CCCCCCCCCCCCEeccCCCCceEEecCCCCccc
Q psy11059         12 CDAQRNPCQNGGKCNEDETGNYDCTCDALHTVC   44 (429)
Q Consensus        12 C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~   44 (429)
                      |+...+.|+.+++|+ ++.++|+|+|++||.|+
T Consensus         1 C~~~~~~C~~nA~C~-~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    1 CLENNGGCHPNATCT-NTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             TTTGGGGS-TTCEEE-E-TTSEEEEE-CEEECC
T ss_pred             CCCCCCCCCCCcEee-cCCCCEEeECCCCCccC
Confidence            344567899999999 99999999999999988


No 20 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.81  E-value=3.2e-05  Score=49.72  Aligned_cols=32  Identities=44%  Similarity=0.966  Sum_probs=29.0

Q ss_pred             CCCCCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059          8 LSSPCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus         8 ~~~~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      ++|+|  .. ++|.++|+|+ +..++|+|.|++||.
T Consensus         1 d~~~C--~~~~~C~~~~~C~-~~~g~~~C~C~~g~~   33 (39)
T smart00179        1 DIDEC--ASGNPCQNGGTCV-NTVGSYRCECPPGYT   33 (39)
T ss_pred             CcccC--cCCCCcCCCCEeE-CCCCCeEeECCCCCc
Confidence            47899  45 7999999999 999999999999998


No 21 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.78  E-value=1.1e-05  Score=49.59  Aligned_cols=29  Identities=48%  Similarity=1.145  Sum_probs=19.1

Q ss_pred             CCCCCCCCCEEccCC-CCeeeeCCCCCCCC
Q psy11059        391 ESSPCLHGATCIDEV-ATFSCVCPKGLTGR  419 (429)
Q Consensus       391 ~~~~C~~~~~C~~~~-~~~~C~C~~g~~G~  419 (429)
                      .+++|.++|+|++.. ++|+|.|++||+|+
T Consensus         2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            445666667776666 66777777777665


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.77  E-value=4e-05  Score=49.28  Aligned_cols=36  Identities=53%  Similarity=1.297  Sum_probs=24.0

Q ss_pred             CCCCCC-CCCCCCCEEccCCCCeeeeCCCCCC-CCCCC
Q psy11059        387 INECES-SPCLHGATCIDEVATFSCVCPKGLT-GRLCE  422 (429)
Q Consensus       387 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~g~~-G~~C~  422 (429)
                      +++|.. .+|.++++|+++.++|+|.|++||+ |.+|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            455655 5676666777777777777777777 66653


No 23 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.70  E-value=1.8e-05  Score=52.03  Aligned_cols=33  Identities=42%  Similarity=0.951  Sum_probs=29.3

Q ss_pred             CCCCCCCC-CCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059         61 PISNQICT-TAPPCLNGATCRPQLTEQLYECVCPPGYK   97 (429)
Q Consensus        61 ~~~~~~C~-~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~   97 (429)
                      |||  ||+ ..+.|..+++|+  |+.|+|+|.|++||+
T Consensus         1 Did--EC~~~~~~C~~~~~C~--N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DID--ECAEGPHNCPENGTCV--NTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESS--TTTTTSSSSSTTSEEE--EETTEEEEEESTTEE
T ss_pred             Ccc--ccCCCCCcCCCCCEEE--cCCCCEEeeCCCCcE
Confidence            466  898 466899899999  999999999999998


No 24 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.64  E-value=8.3e-05  Score=47.26  Aligned_cols=32  Identities=44%  Similarity=0.982  Sum_probs=28.7

Q ss_pred             CCCCCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059          8 LSSPCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus         8 ~~~~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      ++|+|  .. .+|.++++|+ +..+.|+|.|++||.
T Consensus         1 ~~~~C--~~~~~C~~~~~C~-~~~~~~~C~C~~g~~   33 (38)
T cd00054           1 DIDEC--ASGNPCQNGGTCV-NTVGSYRCSCPPGYT   33 (38)
T ss_pred             CcccC--CCCCCcCCCCEeE-CCCCCeEeECCCCCc
Confidence            36889  45 7999999999 999999999999999


No 25 
>KOG1226|consensus
Probab=97.47  E-value=0.0011  Score=68.69  Aligned_cols=99  Identities=22%  Similarity=0.513  Sum_probs=64.1

Q ss_pred             CCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCC-------CCCC---CCCCCCCCCeEeeCCCC
Q psy11059         16 RNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISN-------QICT---TAPPCLNGATCRPQLTE   85 (429)
Q Consensus        16 ~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~-------~~C~---~~~~C~~~g~C~~~~~~   85 (429)
                      ...|+.+|+.+     =.+|.|.+||.            |..||-+.+.       +.|.   +..+|.++|.|+  =. 
T Consensus       466 s~~C~g~G~~~-----CG~C~C~~G~~------------G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~--CG-  525 (783)
T KOG1226|consen  466 SALCHGNGTFV-----CGQCRCDEGWL------------GKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV--CG-  525 (783)
T ss_pred             ccccCCCCcEE-----ecceecCCCCC------------CCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEe--CC-
Confidence            44577777665     45799999999            7777754321       1343   133788999888  22 


Q ss_pred             CCeeeeCCCCCc----------ccCCCC---CCCCCCCCeEecCCCce-eCCeeccCCCCC-CCCCCC
Q psy11059         86 QLYECVCPPGYK----------EIRDCT---SNPCLNDGVCVWMFDVT-IQVYKGRYCELP-EIGDCS  138 (429)
Q Consensus        86 ~~~~C~C~~Gy~----------~~~~C~---~~~C~~~g~C~~~~~C~-~~g~~G~~C~~~-~i~~C~  138 (429)
                         .|+|.+...          +.-.|.   ...|..+|.|.-.. |. .+||+|..|+-+ +.+.|.
T Consensus       526 ---qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~-CvC~~GwtG~~C~C~~std~C~  589 (783)
T KOG1226|consen  526 ---QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECGR-CVCNPGWTGSACNCPLSTDTCE  589 (783)
T ss_pred             ---ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCCc-EEcCCCCccCCCCCCCCCcccc
Confidence               378876554          222333   24588888886444 99 999999998741 334444


No 26 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.41  E-value=0.00025  Score=44.98  Aligned_cols=36  Identities=53%  Similarity=1.305  Sum_probs=23.5

Q ss_pred             CCCCCC-CCCCCCCEEccCCCCeeeeCCCCCCCCCCC
Q psy11059        387 INECES-SPCLHGATCIDEVATFSCVCPKGLTGRLCE  422 (429)
Q Consensus       387 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~g~~G~~C~  422 (429)
                      +++|.. .+|.++++|++..++|+|.|++||+|.+|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            345554 566666677777777777777777776653


No 27 
>KOG1836|consensus
Probab=97.39  E-value=0.00072  Score=77.31  Aligned_cols=112  Identities=29%  Similarity=0.669  Sum_probs=78.4

Q ss_pred             EeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCCcc-CCCCCCCCCCCCC-CCeeCCccccCCCCCeeeeCCCCccC
Q psy11059        251 CTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNCSV-ELTGCVGPDTCLN-GGTCKPYLVDETQHRFNCTCPSGYHG  328 (429)
Q Consensus       251 C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~-~~~~C~~~~~C~~-~~~C~~~~~~~~~~~~~C~C~~G~~G  328 (429)
                      |.|+.||+|..|+             .|.+||....-.. +...|.+ ..|.. ..+|       ...+..|.|.+...|
T Consensus       697 c~C~~g~tG~~Ce-------------~C~~gfrr~~~~~~~~~~c~~-C~cngh~~~C-------d~~tG~C~C~~~t~G  755 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCE-------------SCAPGFRRLSPQLGPFCPCIP-CDCNGHSNIC-------DPRTGQCKCKHNTFG  755 (1705)
T ss_pred             ccCCCCcccchhh-------------hcchhhhcccccCCCCCcccc-cccCCccccc-------cCCCCceecccCCCC
Confidence            9999999999998             5888885432111 1122322 23322 2345       556788999999999


Q ss_pred             CCCcccccCCCccCCCCccCCCcCCCCCcCCCCccccCC--CceEEe-cCCCCCCCCccC
Q psy11059        329 KICEKCFCRPGFAGDHCDVDFDECLSNPCFNGATCQNKI--NGYTCV-CAPGYSGKECSI  385 (429)
Q Consensus       329 ~~C~~C~C~~g~~g~~C~~~i~~C~~~~C~~~~~C~~~~--g~~~C~-C~~G~~G~~C~~  385 (429)
                      ..|+  +|..||+|..=.-....|.+-+|.+++.|....  ....|. |++||+|.+|+.
T Consensus       756 ~~C~--~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~  813 (1705)
T KOG1836|consen  756 GQCA--QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE  813 (1705)
T ss_pred             Cchh--hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence            9998  789999987533122237777888888887554  567898 999999999974


No 28 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.13  E-value=0.00074  Score=42.05  Aligned_cols=26  Identities=46%  Similarity=1.088  Sum_probs=24.6

Q ss_pred             CCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059         16 RNPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus        16 ~~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      ..+|.++++|+ +..+.|+|.|+.||.
T Consensus         5 ~~~C~~~~~C~-~~~~~~~C~C~~g~~   30 (36)
T cd00053           5 SNPCSNGGTCV-NTPGSYRCVCPPGYT   30 (36)
T ss_pred             CCCCCCCCEEe-cCCCCeEeECCCCCc
Confidence            67999999999 988999999999999


No 29 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.05  E-value=0.00094  Score=41.70  Aligned_cols=28  Identities=39%  Similarity=1.035  Sum_probs=24.8

Q ss_pred             CCCCCC-CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059         11 PCDAQR-NPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus        11 ~C~~~~-~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      +|  .. ++|.++ +|+ +..++|+|.|++||.
T Consensus         1 ~C--~~~~~C~~~-~C~-~~~~~~~C~C~~g~~   29 (35)
T smart00181        1 EC--ASGGPCSNG-TCI-NTPGSYTCSCPPGYT   29 (35)
T ss_pred             CC--CCcCCCCCC-EEE-CCCCCeEeECCCCCc
Confidence            46  44 789998 999 999999999999999


No 30 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.82  E-value=0.0021  Score=39.91  Aligned_cols=30  Identities=47%  Similarity=1.230  Sum_probs=21.6

Q ss_pred             CCCCCCCCEEccCCCCeeeeCCCCCCCC-CC
Q psy11059        392 SSPCLHGATCIDEVATFSCVCPKGLTGR-LC  421 (429)
Q Consensus       392 ~~~C~~~~~C~~~~~~~~C~C~~g~~G~-~C  421 (429)
                      ..+|.++++|++..++|+|.|+.||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            4567667777777777778888777777 54


No 31 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=96.81  E-value=0.0013  Score=37.14  Aligned_cols=23  Identities=26%  Similarity=0.589  Sum_probs=19.2

Q ss_pred             ceEEecCCCCcccccccccCcCCCCCCCCCCC
Q psy11059         32 NYDCTCDALHTVCCVGLANQTLGSIHCETPIS   63 (429)
Q Consensus        32 ~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~   63 (429)
                      +|+|+|++||+        ...+|..|+ |||
T Consensus         1 sy~C~C~~Gy~--------l~~d~~~C~-DId   23 (24)
T PF12662_consen    1 SYTCSCPPGYQ--------LSPDGRSCE-DID   23 (24)
T ss_pred             CEEeeCCCCCc--------CCCCCCccc-cCC
Confidence            69999999999        445689996 877


No 32 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.80  E-value=0.0021  Score=40.07  Aligned_cols=28  Identities=46%  Similarity=1.293  Sum_probs=20.4

Q ss_pred             CCCCCCCEEccCCCCeeeeCCCCCCC-CCC
Q psy11059        393 SPCLHGATCIDEVATFSCVCPKGLTG-RLC  421 (429)
Q Consensus       393 ~~C~~~~~C~~~~~~~~C~C~~g~~G-~~C  421 (429)
                      .+|.++ +|+++.++|+|.|++||+| ..|
T Consensus         6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            467666 7777777778888888877 555


No 33 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.47  E-value=0.0047  Score=37.71  Aligned_cols=23  Identities=35%  Similarity=0.674  Sum_probs=20.0

Q ss_pred             CCCCCCCEeccCCCCceEEecCCCCc
Q psy11059         17 NPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus        17 ~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      ..|+++|+|+ ..  ..+|+|.+||+
T Consensus         6 ~~C~~~G~C~-~~--~g~C~C~~g~~   28 (32)
T PF07974_consen    6 NICSGHGTCV-SP--CGRCVCDSGYT   28 (32)
T ss_pred             CccCCCCEEe-CC--CCEEECCCCCc
Confidence            4699999999 54  57999999999


No 34 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.38  E-value=0.0018  Score=40.76  Aligned_cols=27  Identities=30%  Similarity=0.808  Sum_probs=18.2

Q ss_pred             CCCCCCCEEccCCCCeeeeCCCCCCCC
Q psy11059        393 SPCLHGATCIDEVATFSCVCPKGLTGR  419 (429)
Q Consensus       393 ~~C~~~~~C~~~~~~~~C~C~~g~~G~  419 (429)
                      ..|+.+|+|+++.++|+|+|++||+|+
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            357777888888778888888888765


No 35 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.98  E-value=0.0066  Score=34.30  Aligned_cols=10  Identities=40%  Similarity=1.404  Sum_probs=4.8

Q ss_pred             eeeeCCCCCC
Q psy11059        408 FSCVCPKGLT  417 (429)
Q Consensus       408 ~~C~C~~g~~  417 (429)
                      |+|.|++||+
T Consensus         2 y~C~C~~Gy~   11 (24)
T PF12662_consen    2 YTCSCPPGYQ   11 (24)
T ss_pred             EEeeCCCCCc
Confidence            4445555543


No 36 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.64  E-value=0.0047  Score=29.51  Aligned_cols=13  Identities=54%  Similarity=1.347  Sum_probs=7.4

Q ss_pred             eeeCCCCCCCCCC
Q psy11059        409 SCVCPKGLTGRLC  421 (429)
Q Consensus       409 ~C~C~~g~~G~~C  421 (429)
                      +|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            3666666666655


No 37 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.56  E-value=0.017  Score=35.30  Aligned_cols=26  Identities=38%  Similarity=0.818  Sum_probs=18.6

Q ss_pred             CCCCCCEEccCCCCeeeeCCCCCCCCCC
Q psy11059        394 PCLHGATCIDEVATFSCVCPKGLTGRLC  421 (429)
Q Consensus       394 ~C~~~~~C~~~~~~~~C~C~~g~~G~~C  421 (429)
                      .|.++|+|+..  ..+|+|.+||+|+.|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            47777788654  347888888888765


No 38 
>KOG3512|consensus
Probab=95.36  E-value=0.15  Score=50.35  Aligned_cols=163  Identities=23%  Similarity=0.574  Sum_probs=91.1

Q ss_pred             cEeccCCCCCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCCC----ccCCCCCCCCCCCCCCCe-eCC----c
Q psy11059        239 GKCNEDETGNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKNC----SVELTGCVGPDTCLNGGT-CKP----Y  309 (429)
Q Consensus       239 g~C~~~~~~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~c----~~~~~~C~~~~~C~~~~~-C~~----~  309 (429)
                      ..|+.+..+..+|.|..+-+|..|+             .|.+-|.....    ..++.+|.. ..|..++. |.-    .
T Consensus       285 s~Cv~d~~~~ltCdC~HNTaGPdCg-------------rCKpfy~dRPW~raT~~~a~~c~a-c~Cn~harrcrfn~Ely  350 (592)
T KOG3512|consen  285 SRCVMDESSHLTCDCEHNTAGPDCG-------------RCKPFYYDRPWGRATALPANECVA-CNCNGHARRCRFNMELY  350 (592)
T ss_pred             ceeeeccCCceEEecccCCCCCCcc-------------cccccccCCCccccccCCCccccc-cccchhhhhcccchhhh
Confidence            4676777777999999999999997             35565533322    124455554 44433322 210    0


Q ss_pred             cccCCCCCeee-eCCCCccCCCCcccccCCCccCCCCc--cCCCcCCCCCcC----CCCccccCCCceEEecCCCCCCCC
Q psy11059        310 LVDETQHRFNC-TCPSGYHGKICEKCFCRPGFAGDHCD--VDFDECLSNPCF----NGATCQNKINGYTCVCAPGYSGKE  382 (429)
Q Consensus       310 ~~~~~~~~~~C-~C~~G~~G~~C~~C~C~~g~~g~~C~--~~i~~C~~~~C~----~~~~C~~~~g~~~C~C~~G~~G~~  382 (429)
                      .......+..| .|.....|..|.  -|..||+-+.-.  .+...|..-.|+    .+-+|....|  +|.|++|-+|.+
T Consensus       351 ~lSgr~SggvClnCrHnTaGrhCh--yCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~t  426 (592)
T KOG3512|consen  351 RLSGRRSGGVCLNCRHNTAGRHCH--YCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLT  426 (592)
T ss_pred             cccCccccceEeecccCCCCcccc--cccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCcccc
Confidence            00112233466 788888888887  678888743211  122233322232    3446765655  899999999988


Q ss_pred             ccC----------CCCCCCCC------CCCCCCEEccCCCCeeeeCCCCCCCCCCCC
Q psy11059        383 CSI----------NINECESS------PCLHGATCIDEVATFSCVCPKGLTGRLCET  423 (429)
Q Consensus       383 C~~----------~~~~C~~~------~C~~~~~C~~~~~~~~C~C~~g~~G~~C~~  423 (429)
                      |..          .+-+|.-.      .+.++.+    +..+.+.|+.++.|.+++.
T Consensus       427 CnrCa~gyqqsrs~vapcik~p~~~~~~~~s~ve----~qd~~s~Ck~~~~~~r~n~  479 (592)
T KOG3512|consen  427 CNRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE----PQDQCSKCKASPGGKRLNQ  479 (592)
T ss_pred             cccccchhhcccCCCcCceecCCCCccccCCCCc----chhccccCCCCCcceeccc
Confidence            741          11122111      1222222    3345678999998887764


No 39 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=95.06  E-value=0.0032  Score=54.42  Aligned_cols=136  Identities=22%  Similarity=0.536  Sum_probs=73.1

Q ss_pred             cccCCCCeeeeCCCCCCC---CCCccCCCCCCC----CCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCCcccccCCC
Q psy11059        267 CVDKRAGYFCDCPPTYGG---KNCSVELTGCVG----PDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICEKCFCRPG  339 (429)
Q Consensus       267 C~~~~~~~~C~C~~G~~g---~~c~~~~~~C~~----~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C~~C~C~~g  339 (429)
                      .+...+.|.|.|.+||..   ..|+.. ..|..    ..+|...+.|...........|.|.|.+||.-.          
T Consensus        13 LiQMSNHfEC~Cnegfvl~~EntCE~k-v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~----------   81 (197)
T PF06247_consen   13 LIQMSNHFECKCNEGFVLKNENTCEEK-VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILK----------   81 (197)
T ss_dssp             EEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES----------
T ss_pred             EEEccCceEEEcCCCcEEccccccccc-eecCcccccCccccchhhhhcCCCcccceeEEEecccCceee----------
Confidence            444556677888888742   344422 23432    145777788843211124578999999999743          


Q ss_pred             ccCCCCccCCCcCCCCCcCCCCccccCC---CceEEecCCCCC---CCCccCCCC-CCCCCCCCCCCEEccCCCCeeeeC
Q psy11059        340 FAGDHCDVDFDECLSNPCFNGATCQNKI---NGYTCVCAPGYS---GKECSININ-ECESSPCLHGATCIDEVATFSCVC  412 (429)
Q Consensus       340 ~~g~~C~~~i~~C~~~~C~~~~~C~~~~---g~~~C~C~~G~~---G~~C~~~~~-~C~~~~C~~~~~C~~~~~~~~C~C  412 (429)
                        ...|.  ...|....|. .|.|+..+   ....|+|.-|+.   ...|..+-+ +|. --|..+.+|....+-|+|.+
T Consensus        82 --~~vCv--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~~  155 (197)
T PF06247_consen   82 --QGVCV--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCVC  155 (197)
T ss_dssp             --SSSEE--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE
T ss_pred             --CCeEc--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeCcEEEeec
Confidence              11221  2445544565 56776433   345999999986   334432211 222 24777889999999999999


Q ss_pred             CCCCCCC
Q psy11059        413 PKGLTGR  419 (429)
Q Consensus       413 ~~g~~G~  419 (429)
                      .++|.+.
T Consensus       156 ~~~~~~~  162 (197)
T PF06247_consen  156 KEGFPGD  162 (197)
T ss_dssp             -TT-EEE
T ss_pred             CCCCCCC
Confidence            9998643


No 40 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.87  E-value=0.012  Score=37.01  Aligned_cols=18  Identities=28%  Similarity=0.778  Sum_probs=16.0

Q ss_pred             EeccCCCCceEEecCCCCc
Q psy11059         24 KCNEDETGNYDCTCDALHT   42 (429)
Q Consensus        24 ~C~~~~~~~~~C~C~~g~~   42 (429)
                      .|+ +.+++|+|.|++||.
T Consensus        11 ~C~-~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICV-NTPGSYRCSCPPGYK   28 (36)
T ss_dssp             EEE-EETTSEEEE-STTEE
T ss_pred             CCc-cCCCceEeECCCCCE
Confidence            788 889999999999999


No 41 
>KOG1218|consensus
Probab=94.45  E-value=4.4  Score=38.88  Aligned_cols=65  Identities=25%  Similarity=0.553  Sum_probs=37.3

Q ss_pred             CCCcCCCCCCCccccC----CCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCcc
Q psy11059        255 ALHTGDPCKHGSCVDK----RAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYH  327 (429)
Q Consensus       255 ~g~~G~~C~~~~C~~~----~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~  327 (429)
                      ++|.|..|.... ...    .....|.|.+||.|..+......|.....+.+++.|.       .....+.+.+.+.
T Consensus       140 ~~~~g~~C~~~c-~~~~~~~~~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~-------~~~~~~~~~~~~~  208 (316)
T KOG1218|consen  140 ENLVGLKCQRDC-QCTGGCDCKNGICTCQPGFVGVFCVESCSGCSPLTACENGAKCN-------RSTGSCLCYPGPS  208 (316)
T ss_pred             cCCCCCCccCCC-CCccccCCCCCceeccCCcccccccccCCCcCCCcccCCCCeee-------ccccccccCCCCc
Confidence            356666665422 111    2234678899998888765544466556677777772       2334455555554


No 42 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=93.38  E-value=0.024  Score=49.13  Aligned_cols=121  Identities=24%  Similarity=0.494  Sum_probs=70.4

Q ss_pred             CEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCC----CCCCCCCCCeEeeCC---CCCCeeeeCCCC
Q psy11059         23 GKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICT----TAPPCLNGATCRPQL---TEQLYECVCPPG   95 (429)
Q Consensus        23 g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~----~~~~C~~~g~C~~~~---~~~~~~C~C~~G   95 (429)
                      |.-+ ...+.|.|.|.+||.-.         .-.+||.-+   +|.    ...+|.+-++|+...   ....|+|.|.+|
T Consensus        11 G~Li-QMSNHfEC~Cnegfvl~---------~EntCE~kv---~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~g   77 (197)
T PF06247_consen   11 GYLI-QMSNHFECKCNEGFVLK---------NENTCEEKV---ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCING   77 (197)
T ss_dssp             EEEE-EESSEEEEEESTTEEEE---------ETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TT
T ss_pred             CEEE-EccCceEEEcCCCcEEc---------cccccccce---ecCcccccCccccchhhhhcCCCcccceeEEEecccC
Confidence            6666 66778999999999822         235676433   565    256799999999432   246799999999


Q ss_pred             Cc-c-----cCCCCCCCCCCCCeEecCCC------ce-eCCee---ccCCCCCCCCCCCCCCCCCCCeEeeCCCCcccc
Q psy11059         96 YK-E-----IRDCTSNPCLNDGVCVWMFD------VT-IQVYK---GRYCELPEIGDCSSNPCLNDGVCVDVYKGRYCE  158 (429)
Q Consensus        96 y~-~-----~~~C~~~~C~~~g~C~~~~~------C~-~~g~~---G~~C~~~~i~~C~~~~C~~~g~C~~~~~g~~C~  158 (429)
                      |. .     .+.|..-.|. .|.|+-.+.      |. .-|+.   ...|...--.+|. ..|..+..|....+-|.|.
T Consensus        78 Y~~~~~vCvp~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~  154 (197)
T PF06247_consen   78 YILKQGVCVPNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCV  154 (197)
T ss_dssp             EEESSSSEEEGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEE
T ss_pred             ceeeCCeEchhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-eecCCCcceeeeCcEEEee
Confidence            99 2     3455555666 688876543      77 77776   1223210011222 2366777888887777887


No 43 
>smart00051 DSL delta serrate ligand.
Probab=92.62  E-value=0.21  Score=35.80  Aligned_cols=45  Identities=29%  Similarity=0.625  Sum_probs=27.1

Q ss_pred             eeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCCCCccCCCC
Q psy11059        276 CDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKIC  331 (429)
Q Consensus       276 C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~~G~~G~~C  331 (429)
                      -.|+++|.|..|+   ..|.+.+....+.+|       .. ...++|.+||+|..|
T Consensus        19 v~C~~~~yG~~C~---~~C~~~~d~~~~~~C-------d~-~G~~~C~~Gw~G~~C   63 (63)
T smart00051       19 VTCDENYYGEGCN---KFCRPRDDFFGHYTC-------DE-NGNKGCLEGWMGPYC   63 (63)
T ss_pred             eeCCCCCcCCccC---CEeCcCccccCCccC-------Cc-CCCEecCCCCcCCCC
Confidence            3677777777774   234433344556666       22 356788888887654


No 44 
>KOG3512|consensus
Probab=92.38  E-value=0.37  Score=47.65  Aligned_cols=109  Identities=26%  Similarity=0.642  Sum_probs=61.1

Q ss_pred             ccCCCCCeeeeCCCCccCCCCcccccCCCccCC----CCccCCCcCCCCCcCCC-------------------Ccccc--
Q psy11059        311 VDETQHRFNCTCPSGYHGKICEKCFCRPGFAGD----HCDVDFDECLSNPCFNG-------------------ATCQN--  365 (429)
Q Consensus       311 ~~~~~~~~~C~C~~G~~G~~C~~C~C~~g~~g~----~C~~~i~~C~~~~C~~~-------------------~~C~~--  365 (429)
                      +.+..+..+|.|..+..|..|+  .|.+-|.+.    .-..++++|....|..+                   ++|++  
T Consensus       288 v~d~~~~ltCdC~HNTaGPdCg--rCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCr  365 (592)
T KOG3512|consen  288 VMDESSHLTCDCEHNTAGPDCG--RCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCR  365 (592)
T ss_pred             eeccCCceEEecccCCCCCCcc--cccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecc
Confidence            3435566999999999999998  566666532    11124455554333222                   23332  


Q ss_pred             -CCCceEE-ecCCCCCCCCcc--CCCCCCCCCCCCC----CCEEccCCCCeeeeCCCCCCCCCCCC
Q psy11059        366 -KINGYTC-VCAPGYSGKECS--ININECESSPCLH----GATCIDEVATFSCVCPKGLTGRLCET  423 (429)
Q Consensus       366 -~~g~~~C-~C~~G~~G~~C~--~~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~g~~G~~C~~  423 (429)
                       ...+-+| .|++||.-+.=.  .+...|..-.|+.    +.+|..+.  .+|.|++|.+|..|+.
T Consensus       366 HnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~t--GqCpCkeGvtG~tCnr  429 (592)
T KOG3512|consen  366 HNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTT--GQCPCKEGVTGLTCNR  429 (592)
T ss_pred             cCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccC--CcccCCCCCccccccc
Confidence             1122234 477777522111  1223344444543    34675553  4899999999998874


No 45 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.35  E-value=0.059  Score=33.86  Aligned_cols=22  Identities=50%  Similarity=1.162  Sum_probs=18.1

Q ss_pred             CCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059         72 PCLNGATCRPQLTEQLYECVCPPGYK   97 (429)
Q Consensus        72 ~C~~~g~C~~~~~~~~~~C~C~~Gy~   97 (429)
                      .|.+  +|+  +++++|+|.|++||+
T Consensus         7 gC~h--~C~--~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen    7 GCSH--ICV--NTPGSYRCSCPPGYK   28 (36)
T ss_dssp             GSSS--EEE--EETTSEEEE-STTEE
T ss_pred             CcCC--CCc--cCCCceEeECCCCCE
Confidence            4555  899  889999999999998


No 46 
>KOG1218|consensus
Probab=92.02  E-value=3.3  Score=39.69  Aligned_cols=42  Identities=31%  Similarity=0.600  Sum_probs=22.7

Q ss_pred             CCeeEeCCCCCcCC-CCCC--------CccccCCCCeeeeCCCCCCCCCCc
Q psy11059        247 GNYDCTCDALHTGD-PCKH--------GSCVDKRAGYFCDCPPTYGGKNCS  288 (429)
Q Consensus       247 ~~~~C~C~~g~~G~-~C~~--------~~C~~~~~~~~C~C~~G~~g~~c~  288 (429)
                      ....|.|.++|+|. .+..        ..+........|.+..+|.+..|.
T Consensus        13 ~~~~c~c~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~   63 (316)
T KOG1218|consen   13 GSGQCFCDPGYTGRLQCEHQAVTSACSGICPCEVNSGECGLGYGFVGSVCR   63 (316)
T ss_pred             CCCceecCCCccccccccCCCCCccccccCCccCCceeEecccccCCCccc
Confidence            45567788888773 2222        012222234456777777776654


No 47 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=90.86  E-value=0.14  Score=34.53  Aligned_cols=29  Identities=31%  Similarity=0.805  Sum_probs=22.5

Q ss_pred             CCCeeeeCCCCccCCCCcccccCCCccCCCC
Q psy11059        315 QHRFNCTCPSGYHGKICEKCFCRPGFAGDHC  345 (429)
Q Consensus       315 ~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~C  345 (429)
                      ...+.|.|+++|.|..|+  +|.++|++..-
T Consensus        15 ~~~G~C~C~~~~~G~~C~--~C~~g~~~~~~   43 (49)
T PF00053_consen   15 PSTGQCVCKPGTTGPRCD--QCKPGYFGLPS   43 (49)
T ss_dssp             ETCEEESBSTTEESTTS---EE-TTEECSTT
T ss_pred             CCCCEEeccccccCCcCc--CCCCccccccC
Confidence            356899999999999999  68899987643


No 48 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=90.56  E-value=0.39  Score=32.57  Aligned_cols=28  Identities=29%  Similarity=0.738  Sum_probs=22.9

Q ss_pred             CCCeeeeCCCCccCCCCcccccCCCccCCC
Q psy11059        315 QHRFNCTCPSGYHGKICEKCFCRPGFAGDH  344 (429)
Q Consensus       315 ~~~~~C~C~~G~~G~~C~~C~C~~g~~g~~  344 (429)
                      ..+.+|.|+++|.|..|+  .|.++|++..
T Consensus        16 ~~~G~C~C~~~~~G~~C~--~C~~g~~~~~   43 (50)
T cd00055          16 PGTGQCECKPNTTGRRCD--RCAPGYYGLP   43 (50)
T ss_pred             CCCCEEeCCCcCCCCCCC--CCCCCCccCC
Confidence            345789999999999999  6788888753


No 49 
>smart00051 DSL delta serrate ligand.
Probab=90.09  E-value=0.44  Score=34.12  Aligned_cols=13  Identities=31%  Similarity=0.726  Sum_probs=6.5

Q ss_pred             EEecCCCCCCCCc
Q psy11059        371 TCVCAPGYSGKEC  383 (429)
Q Consensus       371 ~C~C~~G~~G~~C  383 (429)
                      .++|.+||+|..|
T Consensus        51 ~~~C~~Gw~G~~C   63 (63)
T smart00051       51 NKGCLEGWMGPYC   63 (63)
T ss_pred             CEecCCCCcCCCC
Confidence            3455555555443


No 50 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=89.19  E-value=0.47  Score=31.55  Aligned_cols=25  Identities=32%  Similarity=0.931  Sum_probs=21.9

Q ss_pred             CCeeeeCCCCccCCCCcccccCCCccC
Q psy11059        316 HRFNCTCPSGYHGKICEKCFCRPGFAG  342 (429)
Q Consensus       316 ~~~~C~C~~G~~G~~C~~C~C~~g~~g  342 (429)
                      .+.+|.|+++|+|..|+  .|++||+|
T Consensus        16 ~~G~C~C~~~~~G~~C~--~C~~g~~g   40 (46)
T smart00180       16 DTGQCECKPNVTGRRCD--RCAPGYYG   40 (46)
T ss_pred             CCCEEECCCCCCCCCCC--cCCCCcCC
Confidence            35689999999999999  78999998


No 51 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.13  E-value=0.22  Score=31.25  Aligned_cols=30  Identities=17%  Similarity=0.380  Sum_probs=21.3

Q ss_pred             CCCCCCCCCEeccCCCCceEEecCCCCccc
Q psy11059         15 QRNPCQNGGKCNEDETGNYDCTCDALHTVC   44 (429)
Q Consensus        15 ~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~   44 (429)
                      ...+|..|+.|+....|++.|.|.+||..+
T Consensus         3 ~~~~cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    3 IDTKCPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             cCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence            456899999999555599999999999944


No 52 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=88.22  E-value=0.71  Score=31.27  Aligned_cols=27  Identities=37%  Similarity=0.860  Sum_probs=21.0

Q ss_pred             CCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCC
Q psy11059        247 GNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKN  286 (429)
Q Consensus       247 ~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~  286 (429)
                      .+.+|.|+++|+|..|+             .|.+||++..
T Consensus        17 ~~G~C~C~~~~~G~~C~-------------~C~~g~~~~~   43 (50)
T cd00055          17 GTGQCECKPNTTGRRCD-------------RCAPGYYGLP   43 (50)
T ss_pred             CCCEEeCCCcCCCCCCC-------------CCCCCCccCC
Confidence            45678999999999987             4778887653


No 53 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=85.70  E-value=0.34  Score=32.64  Aligned_cols=27  Identities=33%  Similarity=0.752  Sum_probs=20.2

Q ss_pred             CCeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCCCC
Q psy11059        247 GNYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGGKN  286 (429)
Q Consensus       247 ~~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g~~  286 (429)
                      .+.+|.|.++|+|..|+             .|.++|++..
T Consensus        16 ~~G~C~C~~~~~G~~C~-------------~C~~g~~~~~   42 (49)
T PF00053_consen   16 STGQCVCKPGTTGPRCD-------------QCKPGYFGLP   42 (49)
T ss_dssp             TCEEESBSTTEESTTS--------------EE-TTEECST
T ss_pred             CCCEEeccccccCCcCc-------------CCCCcccccc
Confidence            56789999999999998             4777877653


No 54 
>PHA02887 EGF-like protein; Provisional
Probab=84.72  E-value=0.85  Score=36.39  Aligned_cols=31  Identities=26%  Similarity=0.530  Sum_probs=21.5

Q ss_pred             CCCCCCcEec-cCCCCCeeEeCCCCCcCCCCCC
Q psy11059        233 NPCQNGGKCN-EDETGNYDCTCDALHTGDPCKH  264 (429)
Q Consensus       233 ~~C~~~g~C~-~~~~~~~~C~C~~g~~G~~C~~  264 (429)
                      +.|. +|+|. ........|.|+.||+|..|++
T Consensus        92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             CEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence            4566 46783 2344567888888888888875


No 55 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.05  E-value=1.1  Score=40.90  Aligned_cols=39  Identities=26%  Similarity=0.623  Sum_probs=31.6

Q ss_pred             cCCCCCCCCCCCCCCCC-CCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059         52 TLGSIHCETPISNQICT-TAPPCLNGATCRPQLTEQLYECVCPPGYK   97 (429)
Q Consensus        52 ~~~G~~C~~~~~~~~C~-~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~   97 (429)
                      .+.+..|+ +++  +|. ..++|.+  .|.  ++.++|.|.|.+||+
T Consensus       178 ~l~~~~C~-~~~--~C~~~~~~c~~--~C~--~~~g~~~c~c~~g~~  217 (224)
T cd01475         178 KFQGKICV-VPD--LCATLSHVCQQ--VCI--STPGSYLCACTEGYA  217 (224)
T ss_pred             hcccccCc-Cch--hhcCCCCCccc--eEE--cCCCCEEeECCCCcc
Confidence            34578897 677  897 4566765  799  999999999999997


No 56 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=83.84  E-value=0.47  Score=29.78  Aligned_cols=26  Identities=35%  Similarity=0.622  Sum_probs=18.8

Q ss_pred             CCCCCCCCeEeeCCCC-CCeeeeCCCCCc
Q psy11059         70 APPCLNGATCRPQLTE-QLYECVCPPGYK   97 (429)
Q Consensus        70 ~~~C~~~g~C~~~~~~-~~~~C~C~~Gy~   97 (429)
                      ...|..++.|+  +.. |+++|.|..||.
T Consensus         4 ~~~cP~NA~C~--~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    4 DTKCPANAGCF--RYDDGSEECRCLLGYK   30 (37)
T ss_dssp             SS---TTEEEE--EETTSEEEEEE-TTEE
T ss_pred             CccCCCCcccE--EcCCCCEEEEeeCCcc
Confidence            46778889999  655 899999999998


No 57 
>PHA02887 EGF-like protein; Provisional
Probab=82.83  E-value=1.2  Score=35.47  Aligned_cols=29  Identities=31%  Similarity=0.918  Sum_probs=21.4

Q ss_pred             CCCCCCEEcc--CCCCeeeeCCCCCCCCCCCC
Q psy11059        394 PCLHGATCID--EVATFSCVCPKGLTGRLCET  423 (429)
Q Consensus       394 ~C~~~~~C~~--~~~~~~C~C~~g~~G~~C~~  423 (429)
                      -|.+ |+|.-  ......|+|.+||+|.+|+.
T Consensus        93 YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         93 FCIN-GECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             EeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence            4654 58854  34567899999999999984


No 58 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=82.25  E-value=1.1  Score=36.32  Aligned_cols=29  Identities=41%  Similarity=1.045  Sum_probs=21.6

Q ss_pred             CCCCCCEEcc--CCCCeeeeCCCCCCCCCCCC
Q psy11059        394 PCLHGATCID--EVATFSCVCPKGLTGRLCET  423 (429)
Q Consensus       394 ~C~~~~~C~~--~~~~~~C~C~~g~~G~~C~~  423 (429)
                      -|.++ +|.-  ....+.|+|..||+|.+||.
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCcccccccc
Confidence            46664 7854  34678899999999999985


No 59 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=82.23  E-value=1.7  Score=28.81  Aligned_cols=24  Identities=38%  Similarity=0.894  Sum_probs=20.0

Q ss_pred             CeeEeCCCCCcCCCCCCCccccCCCCeeeeCCCCCCC
Q psy11059        248 NYDCTCDALHTGDPCKHGSCVDKRAGYFCDCPPTYGG  284 (429)
Q Consensus       248 ~~~C~C~~g~~G~~C~~~~C~~~~~~~~C~C~~G~~g  284 (429)
                      +.+|.|+++|+|..|+             .|.+||+|
T Consensus        17 ~G~C~C~~~~~G~~C~-------------~C~~g~~g   40 (46)
T smart00180       17 TGQCECKPNVTGRRCD-------------RCAPGYYG   40 (46)
T ss_pred             CCEEECCCCCCCCCCC-------------cCCCCcCC
Confidence            5688999999999987             47888877


No 60 
>KOG3516|consensus
Probab=80.46  E-value=1.5  Score=48.37  Aligned_cols=42  Identities=33%  Similarity=0.822  Sum_probs=35.4

Q ss_pred             CCCCCCCCCCCCCCEEccCCCCeeeeCC-CCCCCCCCCCCCCC
Q psy11059        386 NINECESSPCLHGATCIDEVATFSCVCP-KGLTGRLCETNIDD  427 (429)
Q Consensus       386 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~-~g~~G~~C~~~i~~  427 (429)
                      .++.|.+++|.+++.|......|.|.|. .||.|..|...|.|
T Consensus       544 i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e  586 (1306)
T KOG3516|consen  544 ISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYE  586 (1306)
T ss_pred             cccccCCccccCCCcccccccceeEeccccccccccccCCCcc
Confidence            4677888999999999888888999998 89999999877654


No 61 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=79.64  E-value=1.8  Score=35.19  Aligned_cols=31  Identities=29%  Similarity=0.636  Sum_probs=21.9

Q ss_pred             CCCCCcEec-cCCCCCeeEeCCCCCcCCCCCCC
Q psy11059        234 PCQNGGKCN-EDETGNYDCTCDALHTGDPCKHG  265 (429)
Q Consensus       234 ~C~~~g~C~-~~~~~~~~C~C~~g~~G~~C~~~  265 (429)
                      -|.++ +|. ......+.|.|..||+|..|++.
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence            46554 773 23446788999999999998863


No 62 
>KOG3516|consensus
Probab=78.00  E-value=1.9  Score=47.68  Aligned_cols=47  Identities=32%  Similarity=0.863  Sum_probs=39.4

Q ss_pred             CCCCCCCCCCCCCCCCCEeccCCCCceEEecC-CCCcccccccccCcCCCCCCCCCCCCCCCC
Q psy11059          7 SLSSPCDAQRNPCQNGGKCNEDETGNYDCTCD-ALHTVCCVGLANQTLGSIHCETPISNQICT   68 (429)
Q Consensus         7 ~~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~-~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~   68 (429)
                      -.+|.|  .+|+|.++|.|. -....|.|.|. .||.            |.+|+..+.+..|.
T Consensus       543 ~i~drC--lPN~CehgG~C~-Qs~~~f~C~C~~TGY~------------GatCHtsi~e~SCe  590 (1306)
T KOG3516|consen  543 GISDRC--LPNPCEHGGKCS-QSWDDFECNCELTGYK------------GATCHTSIYELSCE  590 (1306)
T ss_pred             cccccc--CCccccCCCccc-ccccceeEeccccccc------------cccccCCCcchhhH
Confidence            346889  899999999999 58889999998 9999            99999877633454


No 63 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=75.90  E-value=3.4  Score=37.70  Aligned_cols=36  Identities=28%  Similarity=0.733  Sum_probs=26.0

Q ss_pred             CCCccCCCCCCCC--CCCCCCCEEccCCCCeeeeCCCCCCC
Q psy11059        380 GKECSININECES--SPCLHGATCIDEVATFSCVCPKGLTG  418 (429)
Q Consensus       380 G~~C~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~g~~G  418 (429)
                      +..|. ++++|..  ++|.  ..|.++.|+|.|.|++||+.
T Consensus       181 ~~~C~-~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICV-VPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cccCc-CchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence            55664 6677753  3454  47888999999999999874


No 64 
>KOG3514|consensus
Probab=74.52  E-value=2  Score=46.90  Aligned_cols=36  Identities=39%  Similarity=0.927  Sum_probs=32.0

Q ss_pred             CCCCCCCCCCCCCEeccCCCCceEEec-CCCCcccccccccCcCCCCCCCCC
Q psy11059         11 PCDAQRNPCQNGGKCNEDETGNYDCTC-DALHTVCCVGLANQTLGSIHCETP   61 (429)
Q Consensus        11 ~C~~~~~~C~~~g~C~~~~~~~~~C~C-~~g~~g~~~~~~~~~~~G~~C~~~   61 (429)
                      .|  .++||+|+|+|. ...+.|.|.| ..||.            |+.||..
T Consensus       625 ~C--~~nPC~N~g~C~-egwNrfiCDCs~T~~~------------G~~CerE  661 (1591)
T KOG3514|consen  625 IC--ESNPCQNGGKCS-EGWNRFICDCSGTGFE------------GRTCERE  661 (1591)
T ss_pred             cc--CCCcccCCCCcc-ccccccccccccCccc------------Cccccce
Confidence            68  899999999999 9999999999 57888            8888854


No 65 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=69.86  E-value=4.5  Score=32.37  Aligned_cols=29  Identities=24%  Similarity=0.639  Sum_probs=23.7

Q ss_pred             CCCCCCCCCCCCeEeeCCCCCCeeeeCCCCCc
Q psy11059         66 ICTTAPPCLNGATCRPQLTEQLYECVCPPGYK   97 (429)
Q Consensus        66 ~C~~~~~C~~~g~C~~~~~~~~~~C~C~~Gy~   97 (429)
                      .|.....|...|.|.  .. ....|.|++||+
T Consensus        79 ~Cd~y~~CG~~g~C~--~~-~~~~C~Cl~GF~  107 (110)
T PF00954_consen   79 QCDVYGFCGPNGICN--SN-NSPKCSCLPGFE  107 (110)
T ss_pred             CCCCccccCCccEeC--CC-CCCceECCCCcC
Confidence            787678999999997  33 456799999996


No 66 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=69.44  E-value=2.3  Score=33.50  Aligned_cols=51  Identities=20%  Similarity=0.537  Sum_probs=33.7

Q ss_pred             CCCCCCCCCCCCCCCEeccCC-----CCceEEecCCCCcccccc-cccCcCCCCCCCC
Q psy11059          9 SSPCDAQRNPCQNGGKCNEDE-----TGNYDCTCDALHTVCCVG-LANQTLGSIHCET   60 (429)
Q Consensus         9 ~~~C~~~~~~C~~~g~C~~~~-----~~~~~C~C~~g~~g~~~~-~~~~~~~G~~C~~   60 (429)
                      .+.|..+++.|++||.|+ ..     ..=|.|.|.+.+.....+ .=...|.|..|+.
T Consensus         5 ~~aC~~~Tn~CsgHG~C~-~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK   61 (103)
T PF12955_consen    5 NDACENATNNCSGHGSCV-KKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK   61 (103)
T ss_pred             HHHHHHhccCCCCCceEe-eccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence            456877899999999999 44     244899999866522100 0112455888873


No 67 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=66.83  E-value=1.6  Score=31.29  Aligned_cols=39  Identities=31%  Similarity=0.724  Sum_probs=19.1

Q ss_pred             CeeEeCCCCCcCCCCCCCccccC---CCCeee------eCCCCCCCCCC
Q psy11059        248 NYDCTCDALHTGDPCKHGSCVDK---RAGYFC------DCPPTYGGKNC  287 (429)
Q Consensus       248 ~~~C~C~~g~~G~~C~~~~C~~~---~~~~~C------~C~~G~~g~~c  287 (429)
                      .++-.|.+.|.|..|.. .|...   .+.|+|      +|.+||+|..|
T Consensus        16 ~~rv~C~~nyyG~~C~~-~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   16 RIRVVCDENYYGPNCSK-FCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             -------TTEETTTT-E-E---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred             EEEEECCCCCCCccccC-CcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence            45678899999998864 35443   344555      57888888765


No 68 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=63.04  E-value=8.2  Score=30.84  Aligned_cols=31  Identities=26%  Similarity=0.598  Sum_probs=24.5

Q ss_pred             CCCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059          9 SSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus         9 ~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      .|+|. ....|+.+|.|. . .....|.|.+||.
T Consensus        77 ~d~Cd-~y~~CG~~g~C~-~-~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   77 KDQCD-VYGFCGPNGICN-S-NNSPKCSCLPGFE  107 (110)
T ss_pred             ccCCC-CccccCCccEeC-C-CCCCceECCCCcC
Confidence            46884 457999999997 3 3456799999997


No 69 
>KOG3514|consensus
Probab=60.09  E-value=6.5  Score=43.19  Aligned_cols=36  Identities=42%  Similarity=1.018  Sum_probs=31.6

Q ss_pred             CCCCCCCCCCCEEccCCCCeeeeCC-CCCCCCCCCCC
Q psy11059        389 ECESSPCLHGATCIDEVATFSCVCP-KGLTGRLCETN  424 (429)
Q Consensus       389 ~C~~~~C~~~~~C~~~~~~~~C~C~-~g~~G~~C~~~  424 (429)
                      .|.++||.++|+|......|.|.|. .+|.|+.|+..
T Consensus       625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~CerE  661 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCERE  661 (1591)
T ss_pred             ccCCCcccCCCCccccccccccccccCcccCccccce
Confidence            6888999999999998899999997 58999999864


No 70 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=57.35  E-value=23  Score=23.84  Aligned_cols=27  Identities=26%  Similarity=0.685  Sum_probs=21.0

Q ss_pred             CCCCCCCCCCCCCCEeccCCCCceEEecCCCCc
Q psy11059         10 SPCDAQRNPCQNGGKCNEDETGNYDCTCDALHT   42 (429)
Q Consensus        10 ~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~   42 (429)
                      +.|. ....|..++.|+     ..+|+|++||.
T Consensus        20 ~~C~-~~~qC~~~s~C~-----~g~C~C~~g~~   46 (52)
T PF01683_consen   20 ESCE-SDEQCIGGSVCV-----NGRCQCPPGYV   46 (52)
T ss_pred             CCCC-CcCCCCCcCEEc-----CCEeECCCCCE
Confidence            4564 455678889998     56999999998


No 71 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=48.60  E-value=6.8  Score=26.87  Aligned_cols=35  Identities=26%  Similarity=0.501  Sum_probs=19.8

Q ss_pred             CCCCCCCCEeccCC---CCceEEecCCCCcccccccccCcCCCCCCCCCC
Q psy11059         16 RNPCQNGGKCNEDE---TGNYDCTCDALHTVCCVGLANQTLGSIHCETPI   62 (429)
Q Consensus        16 ~~~C~~~g~C~~~~---~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~   62 (429)
                      .-+|+.||+-..+.   .|...|+|..-|.            |+.|.+.+
T Consensus        16 ai~CSGHGr~flDg~~~dG~p~CECn~Cy~------------GpdCS~~~   53 (56)
T PF04863_consen   16 AISCSGHGRAFLDGLIADGSPVCECNSCYG------------GPDCSTLI   53 (56)
T ss_dssp             TS--TTSEE--TTS-EETTEE--EE-TTEE------------STTS-EE-
T ss_pred             cCCcCCCCeeeeccccccCCccccccCCcC------------CCCcccCC
Confidence            44788999876432   5678899999999            99997543


No 72 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=34.53  E-value=30  Score=21.29  Aligned_cols=13  Identities=15%  Similarity=0.194  Sum_probs=10.5

Q ss_pred             ceEEecCCCCccc
Q psy11059         32 NYDCTCDALHTVC   44 (429)
Q Consensus        32 ~~~C~C~~g~~g~   44 (429)
                      .++|.|++||..+
T Consensus        17 ~~~C~CPeGyIld   29 (34)
T PF09064_consen   17 PGQCFCPEGYILD   29 (34)
T ss_pred             CCceeCCCceEec
Confidence            4589999999844


No 73 
>KOG3509|consensus
Probab=31.00  E-value=93  Score=34.72  Aligned_cols=71  Identities=31%  Similarity=0.724  Sum_probs=50.6

Q ss_pred             CcCCCCCcCCCCccccCCCceEEecCCCCCCCCccCCCCCCCCCC-CCCCCEEccCCCCeeeeCCCCCCCCCC
Q psy11059        350 DECLSNPCFNGATCQNKINGYTCVCAPGYSGKECSININECESSP-CLHGATCIDEVATFSCVCPKGLTGRLC  421 (429)
Q Consensus       350 ~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~~~-C~~~~~C~~~~~~~~C~C~~g~~G~~C  421 (429)
                      +.|...++...+.|....-...|.|++||+|..|....+.+...+ =...++|....+.....|.++ .|...
T Consensus       407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~  478 (964)
T KOG3509|consen  407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPT  478 (964)
T ss_pred             CccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCcc
Confidence            456667788888888888888999999999999975555554322 223456766655667788888 66554


No 74 
>KOG3509|consensus
Probab=21.15  E-value=1.8e+02  Score=32.51  Aligned_cols=68  Identities=28%  Similarity=0.501  Sum_probs=49.0

Q ss_pred             CCCCCCCCCCCCCCCCEeccCCCCceEEecCCCCcccccccccCcCCCCCCCCCCCCCCCCC-CCCCCCCCeEeeCCCCC
Q psy11059          8 LSSPCDAQRNPCQNGGKCNEDETGNYDCTCDALHTVCCVGLANQTLGSIHCETPISNQICTT-APPCLNGATCRPQLTEQ   86 (429)
Q Consensus         8 ~~~~C~~~~~~C~~~g~C~~~~~~~~~C~C~~g~~g~~~~~~~~~~~G~~C~~~~~~~~C~~-~~~C~~~g~C~~~~~~~   86 (429)
                      ..++|  ...||+..+.|. ...-..+|.|++||+            |..|+...+  .+.. .+-.. .++|.  ...+
T Consensus       405 ~g~~c--~~~p~~~~g~c~-p~~~~~~c~c~~g~~------------G~~c~d~~~--~~~~~~~g~y-~~t~~--~~~~  464 (964)
T KOG3509|consen  405 LGDVC--WRIPCQHDGPCL-QTLEGKQCLCPPGYT------------GDSCEDCMN--GCDRSPNGSY-LGTCV--PIQG  464 (964)
T ss_pred             CCCcc--ccccCCCCcccc-ccccccceecccccc------------CchhhccCc--cccccCCccc-cceEe--ccCC
Confidence            45677  788999999998 888889999999999            888875544  4441 22222 26777  5555


Q ss_pred             CeeeeCCCC
Q psy11059         87 LYECVCPPG   95 (429)
Q Consensus        87 ~~~C~C~~G   95 (429)
                      .....|.+|
T Consensus       465 ~~~~~c~pg  473 (964)
T KOG3509|consen  465 KRCEYCGPG  473 (964)
T ss_pred             CcceeecCC
Confidence            566788888


No 75 
>KOG0196|consensus
Probab=20.45  E-value=1.3e+02  Score=32.77  Aligned_cols=57  Identities=25%  Similarity=0.541  Sum_probs=34.3

Q ss_pred             CeeEeCCCCCcC----CCCCCCccccCCCCeeeeCCCCCCCCCCccCCCCCCCCCCCCCCCeeCCccccCCCCCeeeeCC
Q psy11059        248 NYDCTCDALHTG----DPCKHGSCVDKRAGYFCDCPPTYGGKNCSVELTGCVGPDTCLNGGTCKPYLVDETQHRFNCTCP  323 (429)
Q Consensus       248 ~~~C~C~~g~~G----~~C~~~~C~~~~~~~~C~C~~G~~g~~c~~~~~~C~~~~~C~~~~~C~~~~~~~~~~~~~C~C~  323 (429)
                      ...|.|.+||.-    ..|+             .|++|+.-..-  ....|.   +|..+.      .....++..|.|.
T Consensus       258 iG~C~C~aGye~~~~~~~C~-------------aCp~G~yK~~~--~~~~C~---~CP~~S------~s~~ega~~C~C~  313 (996)
T KOG0196|consen  258 IGGCVCKAGYEEAENGKACQ-------------ACPPGTYKASQ--GDSLCL---PCPPNS------HSSSEGATSCTCE  313 (996)
T ss_pred             cCceeecCCCCcccCCCcce-------------eCCCCcccCCC--CCCCCC---CCCCCC------CCCCCCCCccccc
Confidence            457999999963    3333             68888853321  123343   233332      2236778899999


Q ss_pred             CCccC
Q psy11059        324 SGYHG  328 (429)
Q Consensus       324 ~G~~G  328 (429)
                      .||.-
T Consensus       314 ~gyyR  318 (996)
T KOG0196|consen  314 NGYYR  318 (996)
T ss_pred             CCccc
Confidence            99863


Done!