Query         psy6902
Match_columns 86
No_of_seqs    195 out of 1261
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 21:23:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6902.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6902hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1219|consensus               99.5 1.6E-14 3.6E-19  115.9   4.8   69    2-86   3865-3938(4289)
  2 KOG1219|consensus               99.1 1.3E-10 2.9E-15   94.3   4.6   70    2-86   3904-3977(4289)
  3 PF00008 EGF:  EGF-like domain   98.9 9.7E-10 2.1E-14   52.4   2.2   32   50-84      1-32  (32)
  4 KOG4289|consensus               98.8   4E-09 8.7E-14   83.6   3.2   53   18-86   1222-1274(2531)
  5 smart00179 EGF_CA Calcium-bind  98.6 6.8E-08 1.5E-12   47.0   4.0   36   47-86      2-39  (39)
  6 cd00054 EGF_CA Calcium-binding  98.5 3.6E-07 7.7E-12   43.8   4.0   36   47-86      2-38  (38)
  7 KOG4289|consensus               98.2 1.5E-06 3.3E-11   69.6   3.9   72    3-86   1241-1316(2531)
  8 cd00053 EGF Epidermal growth f  98.1 5.9E-06 1.3E-10   38.8   4.0   33   50-86      2-36  (36)
  9 smart00181 EGF Epidermal growt  98.0 1.1E-05 2.4E-10   38.3   3.9   32   50-86      2-35  (35)
 10 PF07645 EGF_CA:  Calcium-bindi  97.9 1.5E-05 3.3E-10   39.8   2.5   31   47-81      2-34  (42)
 11 KOG1214|consensus               97.6 8.7E-05 1.9E-09   57.5   4.4   51   18-83    809-859 (1289)
 12 PF07974 EGF_2:  EGF-like domai  97.3  0.0006 1.3E-08   32.3   3.4   27   53-85      6-32  (32)
 13 KOG3514|consensus               97.3 0.00017 3.7E-09   57.0   2.2   35   49-86    625-659 (1591)
 14 PF12661 hEGF:  Human growth fa  97.1 0.00025 5.3E-09   27.1   1.0   13   72-85      1-13  (13)
 15 KOG3516|consensus               97.1 0.00041   9E-09   55.1   2.7   37   47-86    545-581 (1306)
 16 PHA03099 epidermal growth fact  96.6  0.0019 4.1E-08   39.8   2.6   37   47-86     42-81  (139)
 17 KOG1217|consensus               96.6    0.01 2.2E-07   41.8   6.4   35   47-85    271-306 (487)
 18 KOG3516|consensus               96.3  0.0034 7.4E-08   50.2   3.1   35   49-86    957-991 (1306)
 19 PHA02887 EGF-like protein; Pro  96.3  0.0055 1.2E-07   37.2   3.0   37   47-86     83-122 (126)
 20 PF12947 EGF_3:  EGF domain;  I  96.1  0.0047   1E-07   29.9   1.8   26   54-83      7-32  (36)
 21 KOG1225|consensus               95.9    0.02 4.4E-07   42.6   5.0   26   53-86    316-341 (525)
 22 KOG1217|consensus               95.7   0.016 3.6E-07   40.8   3.9   36   47-86    169-206 (487)
 23 KOG1225|consensus               95.0   0.044 9.4E-07   40.9   4.3   61    3-86    246-310 (525)
 24 PF12662 cEGF:  Complement Clr-  94.1    0.05 1.1E-06   24.0   1.8   11   70-81      1-11  (24)
 25 KOG1836|consensus               91.4    0.41 8.8E-06   40.4   4.8   68   13-86    740-812 (1705)
 26 PF14670 FXa_inhibition:  Coagu  89.6    0.29 6.3E-06   23.5   1.6   18   60-81     11-28  (36)
 27 PF00053 Laminin_EGF:  Laminin   89.4    0.42   9E-06   24.1   2.2   22   59-86     11-32  (49)
 28 smart00051 DSL delta serrate l  89.4    0.56 1.2E-05   25.3   2.8   24   55-85     40-63  (63)
 29 KOG1214|consensus               89.0    0.74 1.6E-05   36.7   4.2   60   20-83    744-820 (1289)
 30 PF12946 EGF_MSP1_1:  MSP1 EGF   85.4    0.81 1.8E-05   22.3   1.8   29   50-81      2-30  (37)
 31 cd01475 vWA_Matrilin VWA_Matri  85.0     1.5 3.3E-05   28.7   3.6   30   47-82    187-218 (224)
 32 KOG1226|consensus               82.2     2.3 4.9E-05   33.4   3.9   33   47-86    585-620 (783)
 33 cd00055 EGF_Lam Laminin-type e  81.9     1.8 3.9E-05   21.9   2.4   15   71-86     19-33  (50)
 34 KOG0994|consensus               77.2     5.9 0.00013   33.0   4.8   48   37-86    889-948 (1758)
 35 smart00180 EGF_Lam Laminin-typ  75.6     2.6 5.7E-05   21.0   1.8   15   71-86     18-32  (46)
 36 KOG4260|consensus               75.6     4.5 9.7E-05   28.4   3.4   33   52-85    149-181 (350)
 37 KOG1226|consensus               73.1     5.8 0.00013   31.2   3.7   40   34-86    538-580 (783)
 38 PF06247 Plasmod_Pvs28:  Plasmo  71.4     1.1 2.4E-05   29.5  -0.3   64    7-81      6-79  (197)
 39 KOG4260|consensus               68.2     3.8 8.3E-05   28.7   1.7   33   46-82    235-269 (350)
 40 PF00954 S_locus_glycop:  S-loc  65.0      10 0.00022   22.1   3.0   32   47-83     77-109 (110)
 41 KOG3509|consensus               64.2     6.1 0.00013   32.0   2.3   36   47-86    406-441 (964)
 42 PF12955 DUF3844:  Domain of un  63.9      10 0.00023   22.5   2.8   26   52-78     12-39  (103)
 43 PF01414 DSL:  Delta serrate li  63.2     2.6 5.6E-05   22.7   0.1   11   74-85     53-63  (63)
 44 KOG0994|consensus               59.4      16 0.00035   30.7   3.9   30   13-46   1079-1108(1758)
 45 KOG3514|consensus               53.0      13 0.00029   30.9   2.5   37   47-86   1018-1054(1591)
 46 PF09064 Tme5_EGF_like:  Thromb  50.1      15 0.00033   17.4   1.6   24    4-27      3-27  (34)
 47 PF12020 TAFA:  TAFA family;  I  41.2      21 0.00045   20.6   1.5   25   49-74     56-80  (90)
 48 PF01683 EB:  EB module;  Inter  40.0      42  0.0009   16.7   2.5   20   54-81     27-46  (52)
 49 KOG1836|consensus               38.3      82  0.0018   27.5   5.0   35    3-44    776-820 (1705)
 50 KOG3512|consensus               28.6 1.2E+02  0.0026   23.2   4.1   68    3-85    396-477 (592)

No 1  
>KOG1219|consensus
Probab=99.50  E-value=1.6e-14  Score=115.87  Aligned_cols=69  Identities=32%  Similarity=0.925  Sum_probs=62.3

Q ss_pred             CCCCCCCCCCC--CeeC---CccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCC
Q psy6902           2 SACDPAHVRPG--CRIQ---DNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCS   76 (86)
Q Consensus         2 ~~C~~~~c~~g--C~~~---~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~   76 (86)
                      .+|..+||+||  |...   .|.|.|++.       |+|.+||.+     +.+|.++||++||+|+...   ++|.|.|+
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsq-------ysG~~CEi~-----~epC~snPC~~GgtCip~~---n~f~CnC~ 3929 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQ-------YSGNHCEID-----LEPCASNPCLTGGTCIPFY---NGFLCNCP 3929 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCccc-------ccCcccccc-----cccccCCCCCCCCEEEecC---CCeeEeCC
Confidence            47999999999  8763   399999999       489999999     8999999999999999875   49999999


Q ss_pred             CCCcCCCCCC
Q psy6902          77 GTGYYGKKCQ   86 (86)
Q Consensus        77 ~~g~~G~~Ce   86 (86)
                       .||+|++||
T Consensus      3930 -~gyTG~~Ce 3938 (4289)
T KOG1219|consen 3930 -NGYTGKRCE 3938 (4289)
T ss_pred             -CCccCceee
Confidence             899999997


No 2  
>KOG1219|consensus
Probab=99.07  E-value=1.3e-10  Score=94.32  Aligned_cols=70  Identities=27%  Similarity=0.688  Sum_probs=60.8

Q ss_pred             CCCCCCCCCCC--CeeC--CccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCC
Q psy6902           2 SACDPAHVRPG--CRIQ--DNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSG   77 (86)
Q Consensus         2 ~~C~~~~c~~g--C~~~--~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~   77 (86)
                      .+|.++||++|  |...  .|.|.|+.+       |||.+|+..-    +++|..++|.+||.|++..   +.|.|.|. 
T Consensus      3904 epC~snPC~~GgtCip~~n~f~CnC~~g-------yTG~~Ce~~G----i~eCs~n~C~~gg~C~n~~---gsf~CncT- 3968 (4289)
T KOG1219|consen 3904 EPCASNPCLTGGTCIPFYNGFLCNCPNG-------YTGKRCEARG----ISECSKNVCGTGGQCINIP---GSFHCNCT- 3968 (4289)
T ss_pred             ccccCCCCCCCCEEEecCCCeeEeCCCC-------ccCceeeccc----ccccccccccCCceeeccC---CceEeccC-
Confidence            36999999999  8763  499999999       4999998641    8999999999999999985   48999999 


Q ss_pred             CCcCCCCCC
Q psy6902          78 TGYYGKKCQ   86 (86)
Q Consensus        78 ~g~~G~~Ce   86 (86)
                      +||.|+.|+
T Consensus      3969 ~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3969 PGILGRTCC 3977 (4289)
T ss_pred             hhHhcccCc
Confidence            999999873


No 3  
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.90  E-value=9.7e-10  Score=52.41  Aligned_cols=32  Identities=47%  Similarity=1.216  Sum_probs=27.9

Q ss_pred             CCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCC
Q psy6902          50 CDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKK   84 (86)
Q Consensus        50 C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~   84 (86)
                      |.++||+|+|+|++..  .+.|+|.|+ +||+|++
T Consensus         1 C~~~~C~n~g~C~~~~--~~~y~C~C~-~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLP--GGGYTCECP-PGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEES--TSEEEEEEB-TTEESTT
T ss_pred             CCCCcCCCCeEEEeCC--CCCEEeECC-CCCccCC
Confidence            6788999999999885  258999999 8999975


No 4  
>KOG4289|consensus
Probab=98.78  E-value=4e-09  Score=83.61  Aligned_cols=53  Identities=36%  Similarity=0.950  Sum_probs=48.6

Q ss_pred             ccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          18 NQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        18 ~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +.|.||+|       |||..|++.     +|.|.++||.|+|+|....   ++|+|.|. +||+|.+||
T Consensus      1222 lrCrCPpG-------FTgd~CeTe-----iDlCYs~pC~nng~C~srE---ggYtCeCr-pg~tGehCE 1274 (2531)
T KOG4289|consen 1222 LRCRCPPG-------FTGDYCETE-----IDLCYSGPCGNNGRCRSRE---GGYTCECR-PGFTGEHCE 1274 (2531)
T ss_pred             eeEeCCCC-------CCcccccch-----hHhhhcCCCCCCCceEEec---CceeEEec-CCcccccee
Confidence            78999999       388899998     9999999999999999874   59999999 999999997


No 5  
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.64  E-value=6.8e-08  Score=46.97  Aligned_cols=36  Identities=42%  Similarity=1.227  Sum_probs=31.7

Q ss_pred             cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcC-CCCCC
Q psy6902          47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYY-GKKCQ   86 (86)
Q Consensus        47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~-G~~Ce   86 (86)
                      +++|.. +||.++++|++..   ++|.|.|+ +||+ |..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~---g~~~C~C~-~g~~~g~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTV---GSYRCECP-PGYTDGRNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCC---CCeEeECC-CCCccCCcCC
Confidence            678888 8999999999874   48999999 8999 99886


No 6  
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.47  E-value=3.6e-07  Score=43.80  Aligned_cols=36  Identities=42%  Similarity=1.221  Sum_probs=31.2

Q ss_pred             cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +++|.. .||.+++.|++..   ++|.|.|+ .||.|+.|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~---~~~~C~C~-~g~~g~~C~   38 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTV---GSYRCSCP-PGYTGRNCE   38 (38)
T ss_pred             cccCCCCCCcCCCCEeECCC---CCeEeECC-CCCcCCcCC
Confidence            578887 7999999999875   48999999 899999885


No 7  
>KOG4289|consensus
Probab=98.20  E-value=1.5e-06  Score=69.61  Aligned_cols=72  Identities=26%  Similarity=0.721  Sum_probs=55.6

Q ss_pred             CCCCCCCCCC--Cee--CCccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCC
Q psy6902           3 ACDPAHVRPG--CRI--QDNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGT   78 (86)
Q Consensus         3 ~C~~~~c~~g--C~~--~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~   78 (86)
                      .|-+.||.++  |..  ..|.|.|.++    |   +|.+||...   +.-.|.+.-|+|||+|++...  ++|.|+|+..
T Consensus      1241 lCYs~pC~nng~C~srEggYtCeCrpg----~---tGehCEvs~---~agrCvpGvC~nggtC~~~~n--ggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1241 LCYSGPCGNNGRCRSREGGYTCECRPG----F---TGEHCEVSA---RAGRCVPGVCKNGGTCVNLLN--GGFCCHCPYG 1308 (2531)
T ss_pred             hhhcCCCCCCCceEEecCceeEEecCC----c---cccceeeec---ccCccccceecCCCEEeecCC--CceeccCCCc
Confidence            4667888876  865  4499999888    4   889998761   123599999999999998863  6899999944


Q ss_pred             CcCCCCCC
Q psy6902          79 GYYGKKCQ   86 (86)
Q Consensus        79 g~~G~~Ce   86 (86)
                      .|.+++||
T Consensus      1309 e~e~prC~ 1316 (2531)
T KOG4289|consen 1309 EFEDPRCE 1316 (2531)
T ss_pred             ccCCCceE
Confidence            57788775


No 8  
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=98.14  E-value=5.9e-06  Score=38.81  Aligned_cols=33  Identities=45%  Similarity=1.241  Sum_probs=27.9

Q ss_pred             CC-CCCCcCCceEecCCCCCCCeEEeCCCCCcCCC-CCC
Q psy6902          50 CD-TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK-KCQ   86 (86)
Q Consensus        50 C~-~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~-~Ce   86 (86)
                      |. .++|.+++.|++..   ++|.|.|+ .||.|. .|+
T Consensus         2 C~~~~~C~~~~~C~~~~---~~~~C~C~-~g~~g~~~C~   36 (36)
T cd00053           2 CAASNPCSNGGTCVNTP---GSYRCVCP-PGYTGDRSCE   36 (36)
T ss_pred             CCCCCCCCCCCEEecCC---CCeEeECC-CCCcccCCcC
Confidence            44 67899999999875   48999999 899998 775


No 9  
>smart00181 EGF Epidermal growth factor-like domain.
Probab=98.05  E-value=1.1e-05  Score=38.26  Aligned_cols=32  Identities=44%  Similarity=1.216  Sum_probs=27.0

Q ss_pred             CCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCC-CCCC
Q psy6902          50 CDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYG-KKCQ   86 (86)
Q Consensus        50 C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G-~~Ce   86 (86)
                      |.. ++|.++ .|++..   ++|.|.|+ .||.| +.|+
T Consensus         2 C~~~~~C~~~-~C~~~~---~~~~C~C~-~g~~g~~~C~   35 (35)
T smart00181        2 CASGGPCSNG-TCINTP---GSYTCSCP-PGYTGDKRCE   35 (35)
T ss_pred             CCCcCCCCCC-EEECCC---CCeEeECC-CCCccCCccC
Confidence            556 689998 999874   48999999 89999 8875


No 10 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.85  E-value=1.5e-05  Score=39.81  Aligned_cols=31  Identities=32%  Similarity=0.854  Sum_probs=26.7

Q ss_pred             cccCCCC--CCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902          47 QNACDTN--PCYHGGRCTQTFQDRTGYKCRCSGTGYY   81 (86)
Q Consensus        47 ~~~C~~~--PC~ngg~C~~~~~~~~~~~C~C~~~g~~   81 (86)
                      ||||...  +|..++.|+++.   ++|.|.|+ +||.
T Consensus         2 idEC~~~~~~C~~~~~C~N~~---Gsy~C~C~-~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNTE---GSYSCSCP-PGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEET---TEEEEEES-TTEE
T ss_pred             ccccCCCCCcCCCCCEEEcCC---CCEEeeCC-CCcE
Confidence            7888764  598899999985   59999999 8997


No 11 
>KOG1214|consensus
Probab=97.61  E-value=8.7e-05  Score=57.46  Aligned_cols=51  Identities=29%  Similarity=0.801  Sum_probs=40.9

Q ss_pred             ccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902          18 NQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK   83 (86)
Q Consensus        18 ~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~   83 (86)
                      |.|.|.+|+.+     .|..|.      ++|+|.++-|...++|.+..   ++|.|.|. +||.|.
T Consensus       809 y~C~CLPGfsG-----DG~~c~------dvDeC~psrChp~A~Cyntp---gsfsC~C~-pGy~GD  859 (1289)
T KOG1214|consen  809 YSCACLPGFSG-----DGHQCT------DVDECSPSRCHPAATCYNTP---GSFSCRCQ-PGYYGD  859 (1289)
T ss_pred             EEEeecCCccC-----Cccccc------cccccCccccCCCceEecCC---Ccceeecc-cCccCC
Confidence            88888888544     223443      37999999999999999985   59999999 999974


No 12 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.27  E-value=0.0006  Score=32.28  Aligned_cols=27  Identities=37%  Similarity=0.942  Sum_probs=23.1

Q ss_pred             CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902          53 NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC   85 (86)
Q Consensus        53 ~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C   85 (86)
                      ..|.+.|+|+..     ..+|.|. +||+|+.|
T Consensus         6 ~~C~~~G~C~~~-----~g~C~C~-~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP-----CGRCVCD-SGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC-----CCEEECC-CCCcCCCC
Confidence            469999999975     4689999 89999987


No 13 
>KOG3514|consensus
Probab=97.26  E-value=0.00017  Score=56.98  Aligned_cols=35  Identities=43%  Similarity=1.319  Sum_probs=32.5

Q ss_pred             cCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          49 ACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        49 ~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .|.++||+|+|.|.+..+   .|.|.|.+.+|.|+.||
T Consensus       625 ~C~~nPC~N~g~C~egwN---rfiCDCs~T~~~G~~Ce  659 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWN---RFICDCSGTGFEGRTCE  659 (1591)
T ss_pred             ccCCCcccCCCCcccccc---ccccccccCcccCcccc
Confidence            799999999999999874   89999998999999996


No 14 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.12  E-value=0.00025  Score=27.11  Aligned_cols=13  Identities=38%  Similarity=1.336  Sum_probs=10.6

Q ss_pred             EEeCCCCCcCCCCC
Q psy6902          72 KCRCSGTGYYGKKC   85 (86)
Q Consensus        72 ~C~C~~~g~~G~~C   85 (86)
                      +|.|+ +||+|++|
T Consensus         1 ~C~C~-~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCP-PGWTGPNC   13 (13)
T ss_dssp             EEEE--TTEETTTT
T ss_pred             CccCc-CCCcCCCC
Confidence            58999 89999987


No 15 
>KOG3516|consensus
Probab=97.07  E-value=0.00041  Score=55.12  Aligned_cols=37  Identities=43%  Similarity=1.192  Sum_probs=33.1

Q ss_pred             cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +|.|.+|||+|||.|.+...   .|.|.|...||.|..|+
T Consensus       545 ~drClPN~CehgG~C~Qs~~---~f~C~C~~TGY~GatCH  581 (1306)
T KOG3516|consen  545 SDRCLPNPCEHGGKCSQSWD---DFECNCELTGYKGATCH  581 (1306)
T ss_pred             ccccCCccccCCCccccccc---ceeEecccccccccccc
Confidence            68899999999999999754   89999987899999885


No 16 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=96.62  E-value=0.0019  Score=39.76  Aligned_cols=37  Identities=41%  Similarity=0.952  Sum_probs=28.0

Q ss_pred             cccCCC---CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDT---NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~---~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +.+|.+   +-|.|| .|.-... ...+.|+|. .||+|.+||
T Consensus        42 i~~Cp~ey~~YClHG-~C~yI~d-l~~~~CrC~-~GYtGeRCE   81 (139)
T PHA03099         42 IRLCGPEGDGYCLHG-DCIHARD-IDGMYCRCS-HGYTGIRCQ   81 (139)
T ss_pred             cccCChhhCCEeECC-EEEeecc-CCCceeECC-CCccccccc
Confidence            344543   469997 8986543 457999999 899999997


No 17 
>KOG1217|consensus
Probab=96.58  E-value=0.01  Score=41.77  Aligned_cols=35  Identities=31%  Similarity=0.970  Sum_probs=30.1

Q ss_pred             cccCCCCC-CcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902          47 QNACDTNP-CYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC   85 (86)
Q Consensus        47 ~~~C~~~P-C~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C   85 (86)
                      ++.|...+ |.++++|++...   .|.|.|+ +||+|..|
T Consensus       271 ~~~C~~~~~c~~~~~C~~~~~---~~~C~C~-~g~~g~~~  306 (487)
T KOG1217|consen  271 VDSCALIASCPNGGTCVNVPG---SYRCTCP-PGFTGRLC  306 (487)
T ss_pred             ccccCCCCccCCCCeeecCCC---cceeeCC-CCCCCCCC
Confidence            78898875 999999999853   5999999 99999875


No 18 
>KOG3516|consensus
Probab=96.35  E-value=0.0034  Score=50.20  Aligned_cols=35  Identities=43%  Similarity=1.191  Sum_probs=31.1

Q ss_pred             cCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          49 ACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        49 ~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      -|.+.||+|||.|++...   +|+|.|....|.|..|.
T Consensus       957 hCss~~C~NGG~Cvery~---gytCDCs~Tay~Gp~Cs  991 (1306)
T KOG3516|consen  957 HCSSYPCLNGGHCVERYD---GYTCDCSRTAYDGPFCS  991 (1306)
T ss_pred             ccccccccCCCEEEEecC---ceeeccccCcCCCCccc
Confidence            588999999999999864   99999987899999883


No 19 
>PHA02887 EGF-like protein; Provisional
Probab=96.26  E-value=0.0055  Score=37.19  Aligned_cols=37  Identities=27%  Similarity=0.775  Sum_probs=28.3

Q ss_pred             cccCCC---CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDT---NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~---~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .++|..   +-|.| |+|.-... .....|.|. .||+|.+||
T Consensus        83 f~pC~~eyk~YCiH-G~C~yI~d-L~epsCrC~-~GYtG~RCE  122 (126)
T PHA02887         83 FEKCKNDFNDFCIN-GECMNIID-LDEKFCICN-KGYTGIRCD  122 (126)
T ss_pred             ccccChHhhCEeeC-CEEEcccc-CCCceeECC-CCcccCCCC
Confidence            456754   46886 59986554 457899999 899999996


No 20 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.09  E-value=0.0047  Score=29.90  Aligned_cols=26  Identities=38%  Similarity=0.876  Sum_probs=19.2

Q ss_pred             CCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902          54 PCYHGGRCTQTFQDRTGYKCRCSGTGYYGK   83 (86)
Q Consensus        54 PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~   83 (86)
                      .|...++|++...   +|+|.|+ +||.|.
T Consensus         7 ~C~~nA~C~~~~~---~~~C~C~-~Gy~Gd   32 (36)
T PF12947_consen    7 GCHPNATCTNTGG---SYTCTCK-PGYEGD   32 (36)
T ss_dssp             GS-TTCEEEE-TT---SEEEEE--CEEECC
T ss_pred             CCCCCcEeecCCC---CEEeECC-CCCccC
Confidence            4777789999853   8999999 899874


No 21 
>KOG1225|consensus
Probab=95.87  E-value=0.02  Score=42.65  Aligned_cols=26  Identities=35%  Similarity=0.974  Sum_probs=16.6

Q ss_pred             CCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          53 NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        53 ~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      ..|.+.|.|++.       .|.|. +||+|..|+
T Consensus       316 adC~g~G~Ci~G-------~C~C~-~Gy~G~~C~  341 (525)
T KOG1225|consen  316 ADCSGHGKCIDG-------ECLCD-EGYTGELCI  341 (525)
T ss_pred             ccCCCCCcccCC-------ceEeC-CCCcCCccc
Confidence            566777777632       46676 677776663


No 22 
>KOG1217|consensus
Probab=95.72  E-value=0.016  Score=40.75  Aligned_cols=36  Identities=39%  Similarity=1.042  Sum_probs=29.7

Q ss_pred             cccCC--CCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACD--TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~--~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .++|.  ..+|++++.|.+...   .|.|.|+ ++|++..|+
T Consensus       169 ~~~C~~~~~~c~~~~~C~~~~~---~~~C~c~-~~~~~~~~~  206 (487)
T KOG1217|consen  169 LDECIQYSSPCQNGGTCVNTGG---SYLCSCP-PGYTGSTCE  206 (487)
T ss_pred             ccccccCCCCcCCCcccccCCC---CeeEeCC-CCccCCcCc
Confidence            46887  456999999999854   6999999 899998764


No 23 
>KOG1225|consensus
Probab=95.00  E-value=0.044  Score=40.94  Aligned_cols=61  Identities=36%  Similarity=0.990  Sum_probs=39.6

Q ss_pred             CCCCCCCCCCCee----CCccCCCCCCccCCcccccccccccccCCcccccCCCCCCcCCceEecCCCCCCCeEEeCCCC
Q psy6902           3 ACDPAHVRPGCRI----QDNQCICGKGCYPEYKFITQEECMTALKGKRQNACDTNPCYHGGRCTQTFQDRTGYKCRCSGT   78 (86)
Q Consensus         3 ~C~~~~c~~gC~~----~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~   78 (86)
                      .|+...|.++|..    ....|.|+++    |   +|..|+..       .|... |..++.+++      + .|.|. +
T Consensus       246 ~c~~~~C~~~c~~~g~c~~G~CIC~~G----f---~G~dC~e~-------~Cp~~-cs~~g~~~~------g-~CiC~-~  302 (525)
T KOG1225|consen  246 LCSTIYCPGGCTGRGQCVEGRCICPPG----F---TGDDCDEL-------VCPVD-CSGGGVCVD------G-ECICN-P  302 (525)
T ss_pred             ccccccCCCCCcccceEeCCeEeCCCC----C---cCCCCCcc-------cCCcc-cCCCceecC------C-EeecC-C
Confidence            4556666666642    3467888887    3   67777643       46544 766666664      3 78888 7


Q ss_pred             CcCCCCCC
Q psy6902          79 GYYGKKCQ   86 (86)
Q Consensus        79 g~~G~~Ce   86 (86)
                      +|+|+.|+
T Consensus       303 g~~G~dCs  310 (525)
T KOG1225|consen  303 GYSGKDCS  310 (525)
T ss_pred             Cccccccc
Confidence            88887763


No 24 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=94.09  E-value=0.05  Score=24.02  Aligned_cols=11  Identities=45%  Similarity=1.340  Sum_probs=9.5

Q ss_pred             CeEEeCCCCCcC
Q psy6902          70 GYKCRCSGTGYY   81 (86)
Q Consensus        70 ~~~C~C~~~g~~   81 (86)
                      +|+|.|+ +||.
T Consensus         1 sy~C~C~-~Gy~   11 (24)
T PF12662_consen    1 SYTCSCP-PGYQ   11 (24)
T ss_pred             CEEeeCC-CCCc
Confidence            5899999 8996


No 25 
>KOG1836|consensus
Probab=91.42  E-value=0.41  Score=40.41  Aligned_cols=68  Identities=29%  Similarity=0.759  Sum_probs=45.1

Q ss_pred             CeeCCccCCCCCCccCCcccccccccccccCCcc----cccCCCCCCcCCceEecCCCCCCCeEEe-CCCCCcCCCCCC
Q psy6902          13 CRIQDNQCICGKGCYPEYKFITQEECMTALKGKR----QNACDTNPCYHGGRCTQTFQDRTGYKCR-CSGTGYYGKKCQ   86 (86)
Q Consensus        13 C~~~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~----~~~C~~~PC~ngg~C~~~~~~~~~~~C~-C~~~g~~G~~Ce   86 (86)
                      |......|.|...-   ++..- ..|..+|.+..    ...|.+=||.+++.|..... .....|. |+ +||+|.+|+
T Consensus       740 Cd~~tG~C~C~~~t---~G~~C-~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~-~~~~iCk~Cp-~gytG~rCe  812 (1705)
T KOG1836|consen  740 CDPRTGQCKCKHNT---FGGQC-AQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPE-ILEVVCKNCP-PGYTGLRCE  812 (1705)
T ss_pred             ccCCCCceecccCC---CCCch-hhhcCCCCCccccCCCCCCccCCCCCChhhcCcCc-ccceecCCCC-CCCcccccc
Confidence            54444566655431   11111 34567777653    22399999999999998753 3468898 99 999999985


No 26 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.62  E-value=0.29  Score=23.54  Aligned_cols=18  Identities=39%  Similarity=0.997  Sum_probs=13.9

Q ss_pred             eEecCCCCCCCeEEeCCCCCcC
Q psy6902          60 RCTQTFQDRTGYKCRCSGTGYY   81 (86)
Q Consensus        60 ~C~~~~~~~~~~~C~C~~~g~~   81 (86)
                      +|++..   ++|+|.|+ +||+
T Consensus        11 ~C~~~~---g~~~C~C~-~Gy~   28 (36)
T PF14670_consen   11 ICVNTP---GSYRCSCP-PGYK   28 (36)
T ss_dssp             EEEEET---TSEEEE-S-TTEE
T ss_pred             CCccCC---CceEeECC-CCCE
Confidence            688774   48999999 8996


No 27 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=89.41  E-value=0.42  Score=24.10  Aligned_cols=22  Identities=27%  Similarity=0.804  Sum_probs=16.2

Q ss_pred             ceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          59 GRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        59 g~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .+|...     ...|.|+ ++|+|++|+
T Consensus        11 ~~C~~~-----~G~C~C~-~~~~G~~C~   32 (49)
T PF00053_consen   11 QTCDPS-----TGQCVCK-PGTTGPRCD   32 (49)
T ss_dssp             SSEEET-----CEEESBS-TTEESTTS-
T ss_pred             CcccCC-----CCEEecc-ccccCCcCc
Confidence            356653     5689999 899999885


No 28 
>smart00051 DSL delta serrate ligand.
Probab=89.38  E-value=0.56  Score=25.31  Aligned_cols=24  Identities=21%  Similarity=0.545  Sum_probs=17.1

Q ss_pred             CcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902          55 CYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC   85 (86)
Q Consensus        55 C~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C   85 (86)
                      +..+.+|...      =.+.|+ +||+|..|
T Consensus        40 ~~~~~~Cd~~------G~~~C~-~Gw~G~~C   63 (63)
T smart00051       40 FFGHYTCDEN------GNKGCL-EGWMGPYC   63 (63)
T ss_pred             ccCCccCCcC------CCEecC-CCCcCCCC
Confidence            4566777432      247799 89999987


No 29 
>KOG1214|consensus
Probab=89.01  E-value=0.74  Score=36.73  Aligned_cols=60  Identities=27%  Similarity=0.546  Sum_probs=32.0

Q ss_pred             CCCCCCccCCcccccccccccccCCcc--------cccCCCCCCcCC-------c--eEecCCCCCCCeEEeCCCCCcCC
Q psy6902          20 CICGKGCYPEYKFITQEECMTALKGKR--------QNACDTNPCYHG-------G--RCTQTFQDRTGYKCRCSGTGYYG   82 (86)
Q Consensus        20 C~c~~~c~~~~~~~~g~~C~~~~~~~~--------~~~C~~~PC~ng-------g--~C~~~~~~~~~~~C~C~~~g~~G   82 (86)
                      |--.+.|++.-+.|. +.|+.+|...+        .+.=-.+||..+       |  .|+...  .+.|+|.|. +||.|
T Consensus       744 CGp~s~Cin~pg~~r-ceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hG--gs~y~C~CL-PGfsG  819 (1289)
T KOG1214|consen  744 CGPNSVCINLPGSYR-CECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHG--GSTYSCACL-PGFSG  819 (1289)
T ss_pred             CCCCceeecCCCcee-EEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecC--CceEEEeec-CCccC
Confidence            333345555554555 56665543221        122223455544       3  344443  247999999 99998


Q ss_pred             C
Q psy6902          83 K   83 (86)
Q Consensus        83 ~   83 (86)
                      .
T Consensus       820 D  820 (1289)
T KOG1214|consen  820 D  820 (1289)
T ss_pred             C
Confidence            4


No 30 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=85.37  E-value=0.81  Score=22.25  Aligned_cols=29  Identities=28%  Similarity=0.630  Sum_probs=19.2

Q ss_pred             CCCCCCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902          50 CDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYY   81 (86)
Q Consensus        50 C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~   81 (86)
                      |...+|..++.|+...+  +.+.|+|. .||.
T Consensus         2 C~~~~cP~NA~C~~~~d--G~eecrCl-lgyk   30 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDD--GSEECRCL-LGYK   30 (37)
T ss_dssp             -SSS---TTEEEEEETT--SEEEEEE--TTEE
T ss_pred             ccCccCCCCcccEEcCC--CCEEEEee-CCcc
Confidence            55667888899998752  57999999 9996


No 31 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.99  E-value=1.5  Score=28.70  Aligned_cols=30  Identities=33%  Similarity=0.699  Sum_probs=22.4

Q ss_pred             cccCCCC--CCcCCceEecCCCCCCCeEEeCCCCCcCC
Q psy6902          47 QNACDTN--PCYHGGRCTQTFQDRTGYKCRCSGTGYYG   82 (86)
Q Consensus        47 ~~~C~~~--PC~ngg~C~~~~~~~~~~~C~C~~~g~~G   82 (86)
                      .++|...  +|.  ..|.+..   ++|.|.|+ .||+.
T Consensus       187 ~~~C~~~~~~c~--~~C~~~~---g~~~c~c~-~g~~~  218 (224)
T cd01475         187 PDLCATLSHVCQ--QVCISTP---GSYLCACT-EGYAL  218 (224)
T ss_pred             chhhcCCCCCcc--ceEEcCC---CCEEeECC-CCccC
Confidence            5677543  465  4799875   48999999 89975


No 32 
>KOG1226|consensus
Probab=82.23  E-value=2.3  Score=33.38  Aligned_cols=33  Identities=27%  Similarity=0.913  Sum_probs=21.0

Q ss_pred             cccCCCC---CCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDTN---PCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~~---PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .+.|.+.   -|...|+|.-.       +|.|..++|.|..||
T Consensus       585 td~C~~~~G~iCSGrG~C~Cg-------~C~C~~~~~sG~~CE  620 (783)
T KOG1226|consen  585 TDTCESSDGQICSGRGTCECG-------RCKCTDPPYSGEFCE  620 (783)
T ss_pred             CccccCCCCceeCCCceeeCC-------ceEcCCCCcCcchhh
Confidence            6777653   25555555532       578885559998886


No 33 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=81.94  E-value=1.8  Score=21.88  Aligned_cols=15  Identities=27%  Similarity=0.935  Sum_probs=10.8

Q ss_pred             eEEeCCCCCcCCCCCC
Q psy6902          71 YKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        71 ~~C~C~~~g~~G~~Ce   86 (86)
                      -+|.|+ ++++|..|+
T Consensus        19 G~C~C~-~~~~G~~C~   33 (50)
T cd00055          19 GQCECK-PNTTGRRCD   33 (50)
T ss_pred             CEEeCC-CcCCCCCCC
Confidence            467787 788887764


No 34 
>KOG0994|consensus
Probab=77.19  E-value=5.9  Score=33.03  Aligned_cols=48  Identities=27%  Similarity=0.758  Sum_probs=31.8

Q ss_pred             ccccccCCcc----cccCCCCCCcCCc--------eEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          37 ECMTALKGKR----QNACDTNPCYHGG--------RCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        37 ~C~~~~~~~~----~~~C~~~PC~ngg--------~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +|..+|.|..    -..|.+=||..+.        .|..... .....|+|. .||+|.+|+
T Consensus       889 rCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~-t~~ivC~C~-~GY~G~RCe  948 (1758)
T KOG0994|consen  889 RCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTR-TQQIVCHCQ-EGYSGSRCE  948 (1758)
T ss_pred             hhhccccCCcccCCCCCCCCCCCCCCCccchhcccccccccc-ccceeeecc-cCccccchh
Confidence            4556776643    3567777877653        3554322 235789999 899999985


No 35 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=75.64  E-value=2.6  Score=20.99  Aligned_cols=15  Identities=27%  Similarity=0.953  Sum_probs=10.4

Q ss_pred             eEEeCCCCCcCCCCCC
Q psy6902          71 YKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        71 ~~C~C~~~g~~G~~Ce   86 (86)
                      -.|.|+ ++++|++|+
T Consensus        18 G~C~C~-~~~~G~~C~   32 (46)
T smart00180       18 GQCECK-PNVTGRRCD   32 (46)
T ss_pred             CEEECC-CCCCCCCCC
Confidence            367777 777777764


No 36 
>KOG4260|consensus
Probab=75.56  E-value=4.5  Score=28.39  Aligned_cols=33  Identities=36%  Similarity=0.907  Sum_probs=25.7

Q ss_pred             CCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCC
Q psy6902          52 TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKC   85 (86)
Q Consensus        52 ~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~C   85 (86)
                      ..||...|.|.-.....++-.|.|. +||+|+.|
T Consensus       149 er~C~GnG~C~GdGsR~GsGkCkC~-~GY~Gp~C  181 (350)
T KOG4260|consen  149 ERPCFGNGSCHGDGSREGSGKCKCE-TGYTGPLC  181 (350)
T ss_pred             cCCcCCCCcccCCCCCCCCCccccc-CCCCCccc
Confidence            4578888999865433456789999 99999887


No 37 
>KOG1226|consensus
Probab=73.10  E-value=5.8  Score=31.25  Aligned_cols=40  Identities=28%  Similarity=0.755  Sum_probs=26.6

Q ss_pred             cccccccccCCcccccCCCC---CCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          34 TQEECMTALKGKRQNACDTN---PCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        34 ~g~~C~~~~~~~~~~~C~~~---PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .|..|+-+     .-.|...   -|...|+|.=.       .|.|. +||+|..|+
T Consensus       538 ~G~fCECD-----nfsC~r~~g~lC~g~G~C~CG-------~CvC~-~GwtG~~C~  580 (783)
T KOG1226|consen  538 YGKFCECD-----NFSCERHKGVLCGGHGRCECG-------RCVCN-PGWTGSACN  580 (783)
T ss_pred             eeeeeecc-----CcccccccCcccCCCCeEeCC-------cEEcC-CCCccCCCC
Confidence            45666544     2234432   47777887643       69999 899999885


No 38 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=71.38  E-value=1.1  Score=29.50  Aligned_cols=64  Identities=25%  Similarity=0.641  Sum_probs=37.8

Q ss_pred             CCCCCC-CeeCC--ccCCCCCCccCCcccccccccccccCCcccccCCC-----CCCcCCceEecCCC--CCCCeEEeCC
Q psy6902           7 AHVRPG-CRIQD--NQCICGKGCYPEYKFITQEECMTALKGKRQNACDT-----NPCYHGGRCTQTFQ--DRTGYKCRCS   76 (86)
Q Consensus         7 ~~c~~g-C~~~~--~~C~c~~~c~~~~~~~~g~~C~~~~~~~~~~~C~~-----~PC~ngg~C~~~~~--~~~~~~C~C~   76 (86)
                      ..|.|| -..++  |-|.|..+++-    -+-..|+..      .+|..     .||-.-+.|+....  ....|.|.|.
T Consensus         6 T~CKNG~LiQMSNHfEC~Cnegfvl----~~EntCE~k------v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~   75 (197)
T PF06247_consen    6 TICKNGYLIQMSNHFECKCNEGFVL----KNENTCEEK------VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCI   75 (197)
T ss_dssp             ---BTEEEEEESSEEEEEESTTEEE----EETTEEEE----------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-
T ss_pred             ccccCCEEEEccCceEEEcCCCcEE----ccccccccc------eecCcccccCccccchhhhhcCCCcccceeEEEecc
Confidence            467777 43333  88999998432    233456654      34533     58999999997642  2457999999


Q ss_pred             CCCcC
Q psy6902          77 GTGYY   81 (86)
Q Consensus        77 ~~g~~   81 (86)
                       .||+
T Consensus        76 -~gY~   79 (197)
T PF06247_consen   76 -NGYI   79 (197)
T ss_dssp             -TTEE
T ss_pred             -cCce
Confidence             8996


No 39 
>KOG4260|consensus
Probab=68.16  E-value=3.8  Score=28.74  Aligned_cols=33  Identities=30%  Similarity=0.638  Sum_probs=26.8

Q ss_pred             ccccCC--CCCCcCCceEecCCCCCCCeEEeCCCCCcCC
Q psy6902          46 RQNACD--TNPCYHGGRCTQTFQDRTGYKCRCSGTGYYG   82 (86)
Q Consensus        46 ~~~~C~--~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G   82 (86)
                      +||+|.  ++||...-.|++..   ++|+|.+. +||.+
T Consensus       235 DvnEC~~ep~~c~~~qfCvNte---GSf~C~dk-~Gy~~  269 (350)
T KOG4260|consen  235 DVNECQNEPAPCKAHQFCVNTE---GSFKCEDK-EGYKK  269 (350)
T ss_pred             cHHHHhcCCCCCChhheeecCC---CceEeccc-ccccC
Confidence            378885  56899999999985   49999998 78865


No 40 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=64.96  E-value=10  Score=22.06  Aligned_cols=32  Identities=31%  Similarity=0.717  Sum_probs=24.2

Q ss_pred             cccCCC-CCCcCCceEecCCCCCCCeEEeCCCCCcCCC
Q psy6902          47 QNACDT-NPCYHGGRCTQTFQDRTGYKCRCSGTGYYGK   83 (86)
Q Consensus        47 ~~~C~~-~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~   83 (86)
                      .|.|.. ..|-..|.|....    .-.|.|+ +||.-+
T Consensus        77 ~d~Cd~y~~CG~~g~C~~~~----~~~C~Cl-~GF~P~  109 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNSNN----SPKCSCL-PGFEPK  109 (110)
T ss_pred             ccCCCCccccCCccEeCCCC----CCceECC-CCcCCC
Confidence            567864 7899999996532    4579999 899754


No 41 
>KOG3509|consensus
Probab=64.17  E-value=6.1  Score=31.97  Aligned_cols=36  Identities=42%  Similarity=1.046  Sum_probs=30.2

Q ss_pred             cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      .++|...|+.+.+.|.+...   ...|.|+ ++|+|..|+
T Consensus       406 g~~c~~~p~~~~g~c~p~~~---~~~c~c~-~g~~G~~c~  441 (964)
T KOG3509|consen  406 GDVCWRIPCQHDGPCLQTLE---GKQCLCP-PGYTGDSCE  441 (964)
T ss_pred             CCccccccCCCCcccccccc---ccceecc-ccccCchhh
Confidence            46788899999999988753   7899999 899998764


No 42 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=63.90  E-value=10  Score=22.52  Aligned_cols=26  Identities=27%  Similarity=0.819  Sum_probs=18.1

Q ss_pred             CCCCcCCceEecCCCCC--CCeEEeCCCC
Q psy6902          52 TNPCYHGGRCTQTFQDR--TGYKCRCSGT   78 (86)
Q Consensus        52 ~~PC~ngg~C~~~~~~~--~~~~C~C~~~   78 (86)
                      .+-|...|.|+......  .=|.|.|. +
T Consensus        12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~-~   39 (103)
T PF12955_consen   12 TNNCSGHGSCVKKYGSGGGDCFACKCK-P   39 (103)
T ss_pred             ccCCCCCceEeeccCCCccceEEEEee-c
Confidence            35688889999864311  24999998 5


No 43 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=63.22  E-value=2.6  Score=22.72  Aligned_cols=11  Identities=36%  Similarity=1.047  Sum_probs=8.4

Q ss_pred             eCCCCCcCCCCC
Q psy6902          74 RCSGTGYYGKKC   85 (86)
Q Consensus        74 ~C~~~g~~G~~C   85 (86)
                      .|. +||+|+.|
T Consensus        53 ~C~-~Gw~G~~C   63 (63)
T PF01414_consen   53 VCL-PGWTGPNC   63 (63)
T ss_dssp             EE--TTEESTTS
T ss_pred             CCC-CCCcCCCC
Confidence            467 89999987


No 44 
>KOG0994|consensus
Probab=59.42  E-value=16  Score=30.70  Aligned_cols=30  Identities=27%  Similarity=0.511  Sum_probs=18.0

Q ss_pred             CeeCCccCCCCCCccCCcccccccccccccCCcc
Q psy6902          13 CRIQDNQCICGKGCYPEYKFITQEECMTALKGKR   46 (86)
Q Consensus        13 C~~~~~~C~c~~~c~~~~~~~~g~~C~~~~~~~~   46 (86)
                      |.+....|.|.+|    |++.+-..|+.-|+|..
T Consensus      1079 CN~ftGQCqCkpG----fGGR~C~qCqel~WGdP 1108 (1758)
T KOG0994|consen 1079 CNEFTGQCQCKPG----FGGRTCSQCQELYWGDP 1108 (1758)
T ss_pred             ccccccceeccCC----CCCcchhHHHHhhcCCC
Confidence            5555577888887    43334234666677654


No 45 
>KOG3514|consensus
Probab=52.96  E-value=13  Score=30.87  Aligned_cols=37  Identities=35%  Similarity=0.935  Sum_probs=30.2

Q ss_pred             cccCCCCCCcCCceEecCCCCCCCeEEeCCCCCcCCCCCC
Q psy6902          47 QNACDTNPCYHGGRCTQTFQDRTGYKCRCSGTGYYGKKCQ   86 (86)
Q Consensus        47 ~~~C~~~PC~ngg~C~~~~~~~~~~~C~C~~~g~~G~~Ce   86 (86)
                      +..|..+-|.|.|.|++...   +|.|.|.-..|+|+.|.
T Consensus      1018 st~c~~~acanhG~c~q~w~---~~~c~csmtS~~Gp~C~ 1054 (1591)
T KOG3514|consen 1018 STTCSEDACANHGVCIQQWN---GIACDCSMTSYSGPRCN 1054 (1591)
T ss_pred             Ccccchhhhhccceeeeeec---ceeeeccccccCCCccC
Confidence            45688888999999999864   89999975678888773


No 46 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=50.09  E-value=15  Score=17.44  Aligned_cols=24  Identities=25%  Similarity=0.753  Sum_probs=13.7

Q ss_pred             CCCCCCCCCCee-CCccCCCCCCcc
Q psy6902           4 CDPAHVRPGCRI-QDNQCICGKGCY   27 (86)
Q Consensus         4 C~~~~c~~gC~~-~~~~C~c~~~c~   27 (86)
                      |+...|...|.. ....|.||.|++
T Consensus         3 Cn~t~CpA~CDpn~~~~C~CPeGyI   27 (34)
T PF09064_consen    3 CNQTECPADCDPNSPGQCFCPEGYI   27 (34)
T ss_pred             cccccCCCccCCCCCCceeCCCceE
Confidence            344445545554 336788888743


No 47 
>PF12020 TAFA:  TAFA family;  InterPro: IPR020350 This entry contains the protein FAM19A2 (also known as TAFA2); it is a secreted, 11 kDa member of the FAM19/TAFA family of chemokine-like proteins [], which are composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The FAM19/TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines that act as regulators of immune and nervous cells []. 
Probab=41.24  E-value=21  Score=20.60  Aligned_cols=25  Identities=28%  Similarity=0.828  Sum_probs=19.3

Q ss_pred             cCCCCCCcCCceEecCCCCCCCeEEe
Q psy6902          49 ACDTNPCYHGGRCTQTFQDRTGYKCR   74 (86)
Q Consensus        49 ~C~~~PC~ngg~C~~~~~~~~~~~C~   74 (86)
                      -|...||+.|..|....+ ..+++|.
T Consensus        56 WC~MePCl~gEeC~~Lpd-~sGWsCs   80 (90)
T PF12020_consen   56 WCQMEPCLEGEECKVLPD-LSGWSCS   80 (90)
T ss_pred             eeecccccCCcceEEecc-CCCceec
Confidence            488999999999987654 3467765


No 48 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=40.00  E-value=42  Score=16.68  Aligned_cols=20  Identities=35%  Similarity=1.104  Sum_probs=14.5

Q ss_pred             CCcCCceEecCCCCCCCeEEeCCCCCcC
Q psy6902          54 PCYHGGRCTQTFQDRTGYKCRCSGTGYY   81 (86)
Q Consensus        54 PC~ngg~C~~~~~~~~~~~C~C~~~g~~   81 (86)
                      .|..+..|++.       +|.|+ .||.
T Consensus        27 qC~~~s~C~~g-------~C~C~-~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVNG-------RCQCP-PGYV   46 (52)
T ss_pred             CCCCcCEEcCC-------EeECC-CCCE
Confidence            46677778543       79999 7885


No 49 
>KOG1836|consensus
Probab=38.32  E-value=82  Score=27.54  Aligned_cols=35  Identities=29%  Similarity=0.638  Sum_probs=22.3

Q ss_pred             CCCCCCCCCC--Cee----CCccCC-CCCCccCCcccccccccc---cccCC
Q psy6902           3 ACDPAHVRPG--CRI----QDNQCI-CGKGCYPEYKFITQEECM---TALKG   44 (86)
Q Consensus         3 ~C~~~~c~~g--C~~----~~~~C~-c~~~c~~~~~~~~g~~C~---~~~~~   44 (86)
                      .|.+-||-++  |..    ....|. |+.+       |+|.+|+   .+|++
T Consensus       776 dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~g-------ytG~rCe~c~dgyfg  820 (1705)
T KOG1836|consen  776 DCQPCPCPNGGACGQTPEILEVVCKNCPPG-------YTGLRCEECADGYFG  820 (1705)
T ss_pred             CCccCCCCCChhhcCcCcccceecCCCCCC-------CcccccccCCCcccc
Confidence            3677777666  533    337787 8888       3777765   45554


No 50 
>KOG3512|consensus
Probab=28.62  E-value=1.2e+02  Score=23.17  Aligned_cols=68  Identities=28%  Similarity=0.503  Sum_probs=35.8

Q ss_pred             CCCCCCCCC-C--CeeCCccCCCCCCccCCccccccc---ccccccCCcc--cccCCCCC------CcCCceEecCCCCC
Q psy6902           3 ACDPAHVRP-G--CRIQDNQCICGKGCYPEYKFITQE---ECMTALKGKR--QNACDTNP------CYHGGRCTQTFQDR   68 (86)
Q Consensus         3 ~C~~~~c~~-g--C~~~~~~C~c~~~c~~~~~~~~g~---~C~~~~~~~~--~~~C~~~P------C~ngg~C~~~~~~~   68 (86)
                      +|+.+|... |  |.....+|.|..|    .   +|.   +|..+|+-..  +-+|..-|      +.++++       +
T Consensus       396 ~CdChpVGs~gktCNq~tGqCpCkeG----v---tG~tCnrCa~gyqqsrs~vapcik~p~~~~~~~~s~ve-------~  461 (592)
T KOG3512|consen  396 ACDCHPVGSAGKTCNQTTGQCPCKEG----V---TGLTCNRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE-------P  461 (592)
T ss_pred             hcCCcccccccccccccCCcccCCCC----C---cccccccccchhhcccCCCcCceecCCCCccccCCCCc-------c
Confidence            345555532 2  6555678888887    3   444   3555654332  34443322      333333       2


Q ss_pred             CCeEEeCCCCCcCCCCC
Q psy6902          69 TGYKCRCSGTGYYGKKC   85 (86)
Q Consensus        69 ~~~~C~C~~~g~~G~~C   85 (86)
                      ..+-+.|+ .++.|.++
T Consensus       462 qd~~s~Ck-~~~~~~r~  477 (592)
T KOG3512|consen  462 QDQCSKCK-ASPGGKRL  477 (592)
T ss_pred             hhccccCC-CCCcceec
Confidence            24566788 77777654


Done!