Query         psy11297
Match_columns 132
No_of_seqs    153 out of 1321
Neff          8.8 
Searched_HMMs 46136
Date          Fri Aug 16 23:19:09 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11297.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11297hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus               99.8 2.1E-19 4.6E-24  150.2   9.2  117   11-132  1220-1371(2531)
  2 KOG3514|consensus               99.3 1.7E-11 3.6E-16  101.7   8.5  104   21-132   612-719 (1591)
  3 KOG1219|consensus               99.2 1.5E-11 3.3E-16  107.5   7.1   72   17-90   3889-3961(4289)
  4 KOG4289|consensus               99.1 6.7E-10 1.5E-14   94.6  10.1  103   21-131  1491-1595(2531)
  5 KOG1219|consensus               99.1 2.3E-10 4.9E-15  100.4   5.6   67   22-89   3855-3921(4289)
  6 KOG3516|consensus               99.0 1.5E-09 3.3E-14   91.1   9.7   44   84-131   798-841 (1306)
  7 KOG3516|consensus               99.0 2.4E-09 5.1E-14   90.0  10.2   96   24-132   951-1066(1306)
  8 PF00008 EGF:  EGF-like domain   98.9 2.4E-09 5.2E-14   54.1   2.7   32   34-65      1-32  (32)
  9 KOG3514|consensus               98.7 4.1E-08   9E-13   82.1   8.1   89   36-132   215-306 (1591)
 10 smart00179 EGF_CA Calcium-bind  98.5 2.3E-07   5E-12   48.2   4.1   36   31-67      2-39  (39)
 11 cd00054 EGF_CA Calcium-binding  98.3 1.6E-06 3.4E-11   44.5   4.1   36   31-67      2-38  (38)
 12 cd00110 LamG Laminin G domain;  98.1 2.1E-05 4.5E-10   52.3   7.2   52   79-132     3-55  (151)
 13 cd00053 EGF Epidermal growth f  98.0 1.5E-05 3.3E-10   40.1   4.0   32   34-66      2-35  (36)
 14 PF07645 EGF_CA:  Calcium-bindi  98.0   1E-05 2.2E-10   43.1   3.2   32   30-62      1-34  (42)
 15 smart00181 EGF Epidermal growt  97.9 2.1E-05 4.7E-10   39.8   3.9   32   34-67      2-35  (35)
 16 PF12661 hEGF:  Human growth fa  97.1 0.00023   5E-09   28.4   0.9   13   54-66      1-13  (13)
 17 PF07974 EGF_2:  EGF-like domai  97.1   0.001 2.3E-08   33.2   3.3   27   37-66      6-32  (32)
 18 PF12947 EGF_3:  EGF domain;  I  95.8  0.0075 1.6E-07   30.9   1.8   26   38-64      7-32  (36)
 19 PHA03099 epidermal growth fact  95.1   0.019 4.1E-07   37.7   2.4   39   31-70     42-84  (139)
 20 KOG1214|consensus               95.1    0.03 6.5E-07   47.3   3.9   42   19-63    815-858 (1289)
 21 PF12662 cEGF:  Complement Clr-  95.0   0.027 5.9E-07   26.2   2.1   16   52-67      1-20  (24)
 22 PHA02887 EGF-like protein; Pro  94.8    0.03 6.6E-07   36.2   2.7   36   32-68     84-123 (126)
 23 KOG1225|consensus               94.3    0.07 1.5E-06   43.1   4.3   44   18-69    300-343 (525)
 24 KOG1217|consensus               94.0   0.088 1.9E-06   40.8   4.4   53   16-69    154-208 (487)
 25 PF12946 EGF_MSP1_1:  MSP1 EGF   94.0   0.057 1.2E-06   27.8   2.1   29   34-62      2-30  (37)
 26 KOG1217|consensus               94.0    0.17 3.6E-06   39.3   5.7   47   19-66    257-306 (487)
 27 KOG4260|consensus               93.1    0.11 2.5E-06   38.5   3.2   47   18-68    132-183 (350)
 28 smart00051 DSL delta serrate l  92.6    0.19   4E-06   29.0   3.1   42   18-66     21-63  (63)
 29 cd01475 vWA_Matrilin VWA_Matri  91.8    0.24 5.2E-06   35.4   3.6   36   24-63    181-218 (224)
 30 KOG1225|consensus               91.0    0.59 1.3E-05   37.9   5.3   45   18-70    269-313 (525)
 31 PF14670 FXa_inhibition:  Coagu  90.7    0.22 4.8E-06   25.4   1.8   18   44-62     11-28  (36)
 32 KOG1214|consensus               87.5     1.9 4.1E-05   37.1   5.9   31   43-74    799-831 (1289)
 33 PF00053 Laminin_EGF:  Laminin   86.5    0.88 1.9E-05   24.5   2.5   23   43-68     11-33  (49)
 34 cd00055 EGF_Lam Laminin-type e  84.8     1.1 2.4E-05   24.3   2.4   16   53-68     19-34  (50)
 35 PF12955 DUF3844:  Domain of un  83.0    0.71 1.5E-05   29.3   1.2   32   37-68     13-61  (103)
 36 KOG1226|consensus               81.9       2 4.3E-05   36.3   3.7   40   24-68    539-581 (783)
 37 KOG1836|consensus               81.5       3 6.5E-05   38.6   4.9   60   11-71    755-816 (1705)
 38 PF01414 DSL:  Delta serrate li  79.3    0.98 2.1E-05   26.0   0.9   11   56-66     53-63  (63)
 39 KOG3509|consensus               78.0     6.5 0.00014   34.4   5.7   56   15-72    158-213 (964)
 40 PF00954 S_locus_glycop:  S-loc  75.7     3.4 7.4E-05   26.1   2.8   32   30-63     76-108 (110)
 41 smart00180 EGF_Lam Laminin-typ  74.0     2.7 5.9E-05   22.3   1.7   16   53-68     18-33  (46)
 42 KOG3509|consensus               71.1     6.7 0.00015   34.3   4.1   42   31-73    406-447 (964)
 43 KOG4260|consensus               71.1     2.4 5.2E-05   31.7   1.3   33   30-63    235-269 (350)
 44 PF09064 Tme5_EGF_like:  Thromb  66.5     5.1 0.00011   20.1   1.5   25   34-62      3-27  (34)
 45 PF04863 EGF_alliinase:  Alliin  60.0     5.3 0.00011   22.4   1.0   35   37-71     17-54  (56)
 46 KOG0994|consensus               52.4      16 0.00034   33.0   3.1   32   33-68   1068-1099(1758)
 47 cd01328 FSL_SPARC Follistatin-  50.6      29 0.00063   21.3   3.3   27   33-59      1-27  (86)
 48 PHA02885 putative interleukin   43.5      35 0.00076   21.8   2.9   24  106-129    57-80  (135)
 49 smart00274 FOLN Follistatin-N-  41.1      41 0.00089   15.6   2.6   23   33-55      1-23  (26)
 50 PF12020 TAFA:  TAFA family;  I  34.4      52  0.0011   20.2   2.6   39    9-47     27-70  (90)
 51 KOG1226|consensus               34.3      80  0.0017   27.2   4.4   50   18-70    570-623 (783)
 52 KOG0994|consensus               31.2 1.1E+02  0.0025   28.0   4.9   37   33-69    905-950 (1758)
 53 PF02973 Sialidase:  Sialidase,  30.9 1.8E+02   0.004   20.5   5.3   41   91-132    29-71  (190)
 54 KOG3607|consensus               30.6      38 0.00082   28.9   2.1   29   38-70    631-659 (716)
 55 KOG1218|consensus               25.5 2.5E+02  0.0055   20.5   5.6   30   53-82    162-191 (316)
 56 KOG4004|consensus               22.0      96  0.0021   22.4   2.5   31   29-59     48-78  (259)
 57 PF06247 Plasmod_Pvs28:  Plasmo  21.9      48   0.001   23.5   1.0   26   37-62     50-79  (197)
 58 PF07966 A1_Propeptide:  A1 Pro  20.6      32  0.0007   16.4  -0.0    8    2-9       4-11  (29)

No 1  
>KOG4289|consensus
Probab=99.80  E-value=2.1e-19  Score=150.20  Aligned_cols=117  Identities=28%  Similarity=0.539  Sum_probs=98.0

Q ss_pred             CccCCeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCC------------------
Q psy11297         11 GCLSASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPP------------------   72 (132)
Q Consensus        11 ~~~~~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~------------------   72 (132)
                      +.++.. ||..|+|..|+.++|.|-+.||.|+|+|....+ +|+|.|.+||+|++||.+...                  
T Consensus      1220 nglrCr-CPpGFTgd~CeTeiDlCYs~pC~nng~C~srEg-gYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1220 NGLRCR-CPPGFTGDYCETEIDLCYSGPCGNNGRCRSREG-GYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred             CceeEe-CCCCCCcccccchhHhhhcCCCCCCCceEEecC-ceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence            334433 455566999999999999999999999999985 599999999999999986532                  


Q ss_pred             -----------------CCCCCCCCCCcceEEecccCCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297         73 -----------------KLYDTPAFNGSSHIVMKTLKAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus        73 -----------------c~~~~~~~~g~~~~~~~~~~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~  132 (132)
                                       |+....+|.+.+|+.|.+++.+.++.++|.|.|...+|+|+|+|.+   ++||++|++++
T Consensus      1298 nggf~c~Cp~ge~e~prC~v~trSFp~~sfv~frglrqRfh~TlslsfaT~~~nGlL~ynGne---khDFvalevVd 1371 (2531)
T KOG4289|consen 1298 NGGFCCHCPYGEFEDPRCEVTTRSFPPESFVTFRGLRQRFHFTLSLSFATIERNGLLLYNGNE---KHDFVALEVVD 1371 (2531)
T ss_pred             CCceeccCCCcccCCCceEEEeeccCchheEEEeccccceEEEEEEEEEEeeecceEEecCCc---ccceEeeeeee
Confidence                             1123345777889999999999999999999999999999999954   89999999975


No 2  
>KOG3514|consensus
Probab=99.27  E-value=1.7e-11  Score=101.69  Aligned_cols=104  Identities=27%  Similarity=0.551  Sum_probs=85.2

Q ss_pred             CCCCCCcccCCC-CCCCCCCCCCCeEeeCCCCceEEeC-CCCccCCcccccCCCCCCCCCCCCCcceEEecc--cCCcee
Q psy11297         21 SQASPNLRLEES-PCLSHPCQNGATCQDEEDGLFECLC-SPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKT--LKAYNK   96 (132)
Q Consensus        21 ~~~g~~C~~~~~-~C~~~pC~ngg~C~~~~~~~~~C~C-~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~--~~~~~~   96 (132)
                      ..+-+.|..... .|.++||+|+|+|...|+. |.|.| ..+|.|+.|+.+.     ....++|.-|+.+-.  .+....
T Consensus       612 agvkpsCs~~~~~~C~~nPC~N~g~C~egwNr-fiCDCs~T~~~G~~CerE~-----t~ls~nGs~~m~i~L~~~~~tq~  685 (1591)
T KOG3514|consen  612 AGVKPSCSLSNEKICESNPCQNGGKCSEGWNR-FICDCSGTGFEGRTCEREA-----TALSYNGSMSMKIVLPHTMHTQA  685 (1591)
T ss_pred             cccCcccchhhccccCCCcccCCCCccccccc-cccccccCcccCcccccee-----eeEEEcCeeeEEEEecccceeec
Confidence            445668875544 7999999999999999985 99999 6789999999877     678889988887542  234456


Q ss_pred             EEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297         97 LSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus        97 ~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~  132 (132)
                      .+++|+|||..+.|+||-.+++  +..|-|.|+|.+
T Consensus       686 E~v~iRF~t~r~~Gll~~Tta~--~s~D~l~l~L~~  719 (1591)
T KOG3514|consen  686 EDVSIRFRTQRAYGLLFATTAR--GSADTLRLELDA  719 (1591)
T ss_pred             ceEEEEEEecccceeEEEeccC--CCCceEEEEEec
Confidence            7899999999999999988886  358999999864


No 3  
>KOG1219|consensus
Probab=99.25  E-value=1.5e-11  Score=107.45  Aligned_cols=72  Identities=26%  Similarity=0.510  Sum_probs=65.0

Q ss_pred             ecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCccccc-CCCCCCCCCCCCCcceEEecc
Q psy11297         17 VVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR-APPKLYDTPAFNGSSHIVMKT   90 (132)
Q Consensus        17 ~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~-~~~c~~~~~~~~g~~~~~~~~   90 (132)
                      -||..|.|.+||.++.+|.++||.+||+|+...++ |.|.|+.||+|.+||.+ +++|. .++|.+|+.|+..++
T Consensus      3889 kCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~-f~CnC~~gyTG~~Ce~~Gi~eCs-~n~C~~gg~C~n~~g 3961 (4289)
T KOG1219|consen 3889 KCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNG-FLCNCPNGYTGKRCEARGISECS-KNVCGTGGQCINIPG 3961 (4289)
T ss_pred             eCcccccCcccccccccccCCCCCCCCEEEecCCC-eeEeCCCCccCceeecccccccc-cccccCCceeeccCC
Confidence            46677889999999999999999999999999865 99999999999999998 89997 889999998888764


No 4  
>KOG4289|consensus
Probab=99.10  E-value=6.7e-10  Score=94.60  Aligned_cols=103  Identities=22%  Similarity=0.401  Sum_probs=80.1

Q ss_pred             CCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEecccC--CceeEE
Q psy11297         21 SQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTLK--AYNKLS   98 (132)
Q Consensus        21 ~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~~--~~~~~~   98 (132)
                      ..+...|....+-|+..+|.|+|+|++.|+ +|.|.||.+|.|..|++.+.    ...-|.|.+-+....+.  ....+.
T Consensus      1491 ngt~eGC~ark~fCdsg~C~n~g~CvnrWg-~~~C~CP~~fggk~c~~~m~----~pq~frG~sl~sw~~~~~~vSvPwy 1565 (2531)
T KOG4289|consen 1491 NGTHEGCKARKNFCDSGQCSNGGTCVNRWG-GFSCECPLGFGGKGCCQGMA----HPQHFRGHSLVSWEGLPSQVSVPWY 1565 (2531)
T ss_pred             cCcccCchhhhcccCCCccCCCCeeecccC-cEeecCccccCCcchhhccC----CchhccccceeeecCCCcceecceE
Confidence            344678988889999999999999999995 59999999999999998773    55677887777655443  234678


Q ss_pred             EEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEc
Q psy11297         99 IEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIV  131 (132)
Q Consensus        99 i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~  131 (132)
                      ++|+|||+..+|+||-....   ...-+.|+|.
T Consensus      1566 lsl~FRTr~ad~vl~~~~~~---~rst~~lqld 1595 (2531)
T KOG4289|consen 1566 LSLMFRTRRADGVLMQAEFG---GRSTYNLQLD 1595 (2531)
T ss_pred             EEEEEEeeccccEEEEEEeC---CCceEEEEEc
Confidence            99999999999999976653   1223555553


No 5  
>KOG1219|consensus
Probab=99.05  E-value=2.3e-10  Score=100.40  Aligned_cols=67  Identities=28%  Similarity=0.713  Sum_probs=60.6

Q ss_pred             CCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEec
Q psy11297         22 QASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMK   89 (132)
Q Consensus        22 ~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~   89 (132)
                      ..-+.|....++|+.+||+|||+|.....++|.|.|+..|+|.+||.++..|. ++||..|++|+.+.
T Consensus      3855 ~l~pgC~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~-snPC~~GgtCip~~ 3921 (4289)
T KOG1219|consen 3855 GLQPGCSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCA-SNPCLTGGTCIPFY 3921 (4289)
T ss_pred             cccccccccccccccCcccCCCEecCCCCCceEEeCcccccCccccccccccc-CCCCCCCCEEEecC
Confidence            34567876669999999999999999887889999999999999999999998 99999999999875


No 6  
>KOG3516|consensus
Probab=99.04  E-value=1.5e-09  Score=91.11  Aligned_cols=44  Identities=23%  Similarity=0.381  Sum_probs=40.1

Q ss_pred             ceEEecccCCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEc
Q psy11297         84 SHIVMKTLKAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIV  131 (132)
Q Consensus        84 ~~~~~~~~~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~  131 (132)
                      +|++|+..+.....+|+|.|||..+.|++|.+.+    ..|||.|+|.
T Consensus       798 syL~fp~f~~~~saDIsf~FrTt~~~gvflen~g----~~dfir~eL~  841 (1306)
T KOG3516|consen  798 SYLHFPPFHNELSADISFFFRTTASSGVFLENHG----INDFIRLELS  841 (1306)
T ss_pred             ceeecCcccCcccccEEEEEEecCCceEeeeccC----CCceEEEEEc
Confidence            5888888888888999999999999999999998    5999999986


No 7  
>KOG3516|consensus
Probab=99.02  E-value=2.4e-09  Score=90.01  Aligned_cols=96  Identities=26%  Similarity=0.537  Sum_probs=77.3

Q ss_pred             CCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeC-CCCccCCcccccCCCCCCCCCCCCCcceEEeccc-----------
Q psy11297         24 SPNLRLEESPCLSHPCQNGATCQDEEDGLFECLC-SPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTL-----------   91 (132)
Q Consensus        24 g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C-~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~-----------   91 (132)
                      -+.|+   ..|++.||.|||.|+..+.+ |+|.| ...|.|+.|..++      ...|..+++++|+-.           
T Consensus       951 ~~GC~---GhCss~~C~NGG~Cvery~g-ytCDCs~Tay~Gp~Cs~ei------g~~fe~gs~i~y~fq~~~~~a~~~~~ 1020 (1306)
T KOG3516|consen  951 SPGCE---GHCSSYPCLNGGHCVERYDG-YTCDCSRTAYDGPFCSKEI------GVFFERGSSIRYNFQKPMRSAVFESS 1020 (1306)
T ss_pred             cCCCc---cccccccccCCCEEEEecCc-eeeccccCcCCCCcccccc------ceEecCCceEEEeccchHHHhhhhhh
Confidence            45665   47999999999999999975 99999 5669999999876      456777888887521           


Q ss_pred             --------CCceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297         92 --------KAYNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus        92 --------~~~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~  132 (132)
                              .......|.|.|+|+...++|||.++.   ..||+++.|.+
T Consensus      1021 ~~~~~~~~~~~~~e~i~~sftTt~~ps~LLfvssF---~~~y~~V~v~~ 1066 (1306)
T KOG3516|consen 1021 RVKQKLEIEINPNEEINFSFTTTRAPSDLLFVSSF---TDDYLAVLVKD 1066 (1306)
T ss_pred             hhhhccccccCccceEEEEEEeccCceEEEEeecc---ccceEEEEEeC
Confidence                    123467899999999999999999987   57899888754


No 8  
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.85  E-value=2.4e-09  Score=54.07  Aligned_cols=32  Identities=50%  Similarity=1.078  Sum_probs=28.1

Q ss_pred             CCCCCCCCCCeEeeCCCCceEEeCCCCccCCc
Q psy11297         34 CLSHPCQNGATCQDEEDGLFECLCSPEFTGYL   65 (132)
Q Consensus        34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~   65 (132)
                      |.++||.|+|+|++...++|+|.|++||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            67889999999998874469999999999974


No 9  
>KOG3514|consensus
Probab=98.74  E-value=4.1e-08  Score=82.14  Aligned_cols=89  Identities=24%  Similarity=0.531  Sum_probs=75.9

Q ss_pred             CCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCCCCCCCCCCCcceEEecccC---CceeEEEEEEEEeCCCCeEE
Q psy11297         36 SHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPKLYDTPAFNGSSHIVMKTLK---AYNKLSIEIEFKTNKNDGIL  112 (132)
Q Consensus        36 ~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g~~~~~~~~~~---~~~~~~i~~~f~t~~~~g~l  112 (132)
                      ..+|+|++.|.....+ -.|.|...+.|.+|+...++   ....|.|..|+-|..-.   .....+|+|.|||...+|+|
T Consensus       215 ~~~c~~~g~c~s~d~g-p~c~c~~~~dgq~cekeK~~---~eaTF~G~ef~~YDls~npI~s~~d~itl~FrT~q~ngll  290 (1591)
T KOG3514|consen  215 QFVCLNDGECYSSDDG-PHCDCQFDHDGQNCEKEKND---GEATFGGDEFVGYDLSQNPIRSKKDNITLTFRTVQGNGLL  290 (1591)
T ss_pred             cceeccCCeEecCCCC-CccccccccCccccccccCc---ceEEecCceEEEeeccCCcccccccceEEEEEEecCceeE
Confidence            3679999999998754 99999999999999998865   67889999999886322   23466899999999999999


Q ss_pred             EEeccCCCCCCCeEEEEEcC
Q psy11297        113 LYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus       113 l~~~~~~~~~~d~~~l~l~~  132 (132)
                      ||.|.    ..||+.|+|+|
T Consensus       291 fytG~----~~dYlnlaL~d  306 (1591)
T KOG3514|consen  291 FYTGD----EKDYLNLALQD  306 (1591)
T ss_pred             EEccC----CcceeeEeecC
Confidence            99999    48999999986


No 10 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.50  E-value=2.3e-07  Score=48.24  Aligned_cols=36  Identities=42%  Similarity=1.005  Sum_probs=31.1

Q ss_pred             CCCCCC-CCCCCCCeEeeCCCCceEEeCCCCcc-CCccc
Q psy11297         31 ESPCLS-HPCQNGATCQDEEDGLFECLCSPEFT-GYLCH   67 (132)
Q Consensus        31 ~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~-G~~C~   67 (132)
                      +++|.. +||.++++|++... +|.|.|+.||. |..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g-~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVG-SYRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCC-CeEeECCCCCccCCcCC
Confidence            577887 89999999998874 59999999999 88874


No 11 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.29  E-value=1.6e-06  Score=44.47  Aligned_cols=36  Identities=42%  Similarity=1.011  Sum_probs=30.7

Q ss_pred             CCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccCCccc
Q psy11297         31 ESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTGYLCH   67 (132)
Q Consensus        31 ~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~   67 (132)
                      +++|.. .+|.+++.|.+... .|.|.|+.+|.|..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~-~~~C~C~~g~~g~~C~   38 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVG-SYRCSCPPGYTGRNCE   38 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCC-CeEeECCCCCcCCcCC
Confidence            467877 79999999998874 5999999999998874


No 12 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.08  E-value=2.1e-05  Score=52.31  Aligned_cols=52  Identities=38%  Similarity=0.676  Sum_probs=43.8

Q ss_pred             CCCCcceEEecccCC-ceeEEEEEEEEeCCCCeEEEEeccCCCCCCCeEEEEEcC
Q psy11297         79 AFNGSSHIVMKTLKA-YNKLSIEIEFKTNKNDGILLYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus        79 ~~~g~~~~~~~~~~~-~~~~~i~~~f~t~~~~g~ll~~~~~~~~~~d~~~l~l~~  132 (132)
                      .|.+.+|+.++.... ...++|+++|||..++|+|||.+...  ..+|++|+|.+
T Consensus         3 ~F~g~~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~~~--~~~~~~l~l~~   55 (151)
T cd00110           3 SFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQN--GGDFLALELED   55 (151)
T ss_pred             EeCCCceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecCCC--CCCEEEEEEEC
Confidence            477789999986654 67889999999999999999999853  48999999874


No 13 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.99  E-value=1.5e-05  Score=40.06  Aligned_cols=32  Identities=44%  Similarity=1.189  Sum_probs=27.2

Q ss_pred             CC-CCCCCCCCeEeeCCCCceEEeCCCCccCC-cc
Q psy11297         34 CL-SHPCQNGATCQDEEDGLFECLCSPEFTGY-LC   66 (132)
Q Consensus        34 C~-~~pC~ngg~C~~~~~~~~~C~C~~g~~G~-~C   66 (132)
                      |. ..+|.+++.|++... .|.|.|+.||.|. .|
T Consensus         2 C~~~~~C~~~~~C~~~~~-~~~C~C~~g~~g~~~C   35 (36)
T cd00053           2 CAASNPCSNGGTCVNTPG-SYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCCCCEEecCCC-CeEeECCCCCcccCCc
Confidence            44 678999999999884 5999999999998 65


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.97  E-value=1e-05  Score=43.15  Aligned_cols=32  Identities=31%  Similarity=0.935  Sum_probs=27.2

Q ss_pred             CCCCCCC--CCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297         30 EESPCLS--HPCQNGATCQDEEDGLFECLCSPEFT   62 (132)
Q Consensus        30 ~~~~C~~--~pC~ngg~C~~~~~~~~~C~C~~g~~   62 (132)
                      ++++|+.  ++|..++.|++.. |+|+|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~-Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTE-GSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEET-TEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCC-CCEEeeCCCCcE
Confidence            4678876  4698899999998 679999999997


No 15 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.93  E-value=2.1e-05  Score=39.80  Aligned_cols=32  Identities=47%  Similarity=1.185  Sum_probs=26.6

Q ss_pred             CCC-CCCCCCCeEeeCCCCceEEeCCCCccC-Cccc
Q psy11297         34 CLS-HPCQNGATCQDEEDGLFECLCSPEFTG-YLCH   67 (132)
Q Consensus        34 C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G-~~C~   67 (132)
                      |.. .+|.++ .|++.. ++|+|.|+.||.| ..|+
T Consensus         2 C~~~~~C~~~-~C~~~~-~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        2 CASGGPCSNG-TCINTP-GSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCcCCCCCC-EEECCC-CCeEeECCCCCccCCccC
Confidence            556 689998 999886 4699999999999 7664


No 16 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.10  E-value=0.00023  Score=28.41  Aligned_cols=13  Identities=46%  Similarity=1.193  Sum_probs=10.8

Q ss_pred             EEeCCCCccCCcc
Q psy11297         54 ECLCSPEFTGYLC   66 (132)
Q Consensus        54 ~C~C~~g~~G~~C   66 (132)
                      .|.|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5899999999987


No 17 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.07  E-value=0.001  Score=33.25  Aligned_cols=27  Identities=30%  Similarity=0.716  Sum_probs=23.3

Q ss_pred             CCCCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297         37 HPCQNGATCQDEEDGLFECLCSPEFTGYLC   66 (132)
Q Consensus        37 ~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C   66 (132)
                      ..|.++|+|+...   .+|.|..||+|+.|
T Consensus         6 ~~C~~~G~C~~~~---g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPC---GRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCC---CEEECCCCCcCCCC
Confidence            4689999999772   69999999999887


No 18 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.77  E-value=0.0075  Score=30.93  Aligned_cols=26  Identities=31%  Similarity=0.880  Sum_probs=18.9

Q ss_pred             CCCCCCeEeeCCCCceEEeCCCCccCC
Q psy11297         38 PCQNGATCQDEEDGLFECLCSPEFTGY   64 (132)
Q Consensus        38 pC~ngg~C~~~~~~~~~C~C~~g~~G~   64 (132)
                      .|-..++|++... +|+|.|++||.|.
T Consensus         7 ~C~~nA~C~~~~~-~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    7 GCHPNATCTNTGG-SYTCTCKPGYEGD   32 (36)
T ss_dssp             GS-TTCEEEE-TT-SEEEEE-CEEECC
T ss_pred             CCCCCcEeecCCC-CEEeECCCCCccC
Confidence            4666789999985 6999999999873


No 19 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=95.13  E-value=0.019  Score=37.68  Aligned_cols=39  Identities=26%  Similarity=0.510  Sum_probs=28.0

Q ss_pred             CCCCCC---CCCCCCCeEeeCC-CCceEEeCCCCccCCcccccC
Q psy11297         31 ESPCLS---HPCQNGATCQDEE-DGLFECLCSPEFTGYLCHTRA   70 (132)
Q Consensus        31 ~~~C~~---~pC~ngg~C~~~~-~~~~~C~C~~g~~G~~C~~~~   70 (132)
                      +.+|.+   +=|.|| +|.-.. ...+.|.|..||+|.+||...
T Consensus        42 i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d   84 (139)
T PHA03099         42 IRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV   84 (139)
T ss_pred             cccCChhhCCEeECC-EEEeeccCCCceeECCCCccccccccee
Confidence            445653   458887 897331 125999999999999998743


No 20 
>KOG1214|consensus
Probab=95.06  E-value=0.03  Score=47.29  Aligned_cols=42  Identities=26%  Similarity=0.656  Sum_probs=34.7

Q ss_pred             CCCCCCC--CcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297         19 QSSQASP--NLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTG   63 (132)
Q Consensus        19 p~~~~g~--~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G   63 (132)
                      |++. |.  .| .+.|.|+++.|-..++|.+.. ++|+|.|.+||.|
T Consensus       815 PGfs-GDG~~c-~dvDeC~psrChp~A~Cyntp-gsfsC~C~pGy~G  858 (1289)
T KOG1214|consen  815 PGFS-GDGHQC-TDVDECSPSRCHPAATCYNTP-GSFSCRCQPGYYG  858 (1289)
T ss_pred             CCcc-CCcccc-ccccccCccccCCCceEecCC-CcceeecccCccC
Confidence            6655 44  44 467999999999999999998 5699999999976


No 21 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.00  E-value=0.027  Score=26.19  Aligned_cols=16  Identities=31%  Similarity=1.055  Sum_probs=12.1

Q ss_pred             ceEEeCCCCcc----CCccc
Q psy11297         52 LFECLCSPEFT----GYLCH   67 (132)
Q Consensus        52 ~~~C~C~~g~~----G~~C~   67 (132)
                      +|+|.|++||.    |..|+
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE   20 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc
Confidence            49999999985    45553


No 22 
>PHA02887 EGF-like protein; Provisional
Probab=94.82  E-value=0.03  Score=36.16  Aligned_cols=36  Identities=25%  Similarity=0.575  Sum_probs=26.1

Q ss_pred             CCCCC---CCCCCCCeEeeCC-CCceEEeCCCCccCCcccc
Q psy11297         32 SPCLS---HPCQNGATCQDEE-DGLFECLCSPEFTGYLCHT   68 (132)
Q Consensus        32 ~~C~~---~pC~ngg~C~~~~-~~~~~C~C~~g~~G~~C~~   68 (132)
                      .+|.+   +=|.| |+|.-.. .....|.|+.||+|.+|+.
T Consensus        84 ~pC~~eyk~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         84 EKCKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             cccChHhhCEeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence            45653   45885 5998321 1248999999999999986


No 23 
>KOG1225|consensus
Probab=94.33  E-value=0.07  Score=43.08  Aligned_cols=44  Identities=30%  Similarity=0.773  Sum_probs=29.8

Q ss_pred             cCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCccccc
Q psy11297         18 VQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR   69 (132)
Q Consensus        18 cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~   69 (132)
                      |+..|.|..|+...  |. ..|.++|.|+  . +  +|.|.+||+|..|++.
T Consensus       300 C~~g~~G~dCs~~~--cp-adC~g~G~Ci--~-G--~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  300 CNPGYSGKDCSIRR--CP-ADCSGHGKCI--D-G--ECLCDEGYTGELCIQR  343 (525)
T ss_pred             cCCCcccccccccc--CC-ccCCCCCccc--C-C--ceEeCCCCcCCccccc
Confidence            44555577775432  43 5677778887  2 3  7888888888888876


No 24 
>KOG1217|consensus
Probab=94.04  E-value=0.088  Score=40.81  Aligned_cols=53  Identities=30%  Similarity=0.698  Sum_probs=43.4

Q ss_pred             eecCCCCCCCCcccCCCCCC--CCCCCCCCeEeeCCCCceEEeCCCCccCCccccc
Q psy11297         16 SVVQSSQASPNLRLEESPCL--SHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTR   69 (132)
Q Consensus        16 ~~cp~~~~g~~C~~~~~~C~--~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~   69 (132)
                      ..|...+.+..|....+.|.  ..+|.+++.|.+... .|.|.|+.+|.|..|+..
T Consensus       154 c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~-~~~C~c~~~~~~~~~~~~  208 (487)
T KOG1217|consen  154 CSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGG-SYLCSCPPGYTGSTCETT  208 (487)
T ss_pred             eeeCCCcccccccccccccccCCCCcCCCcccccCCC-CeeEeCCCCccCCcCcCC
Confidence            45666666888876667898  456999999999885 499999999999998865


No 25 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.96  E-value=0.057  Score=27.78  Aligned_cols=29  Identities=31%  Similarity=0.746  Sum_probs=21.0

Q ss_pred             CCCCCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297         34 CLSHPCQNGATCQDEEDGLFECLCSPEFT   62 (132)
Q Consensus        34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~   62 (132)
                      |...+|..++.|....+|++.|.|.+||.
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            66678889999998877889999999985


No 26 
>KOG1217|consensus
Probab=93.96  E-value=0.17  Score=39.29  Aligned_cols=47  Identities=30%  Similarity=0.721  Sum_probs=36.6

Q ss_pred             CCCCCCCC--cccCCCCCCCCC-CCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297         19 QSSQASPN--LRLEESPCLSHP-CQNGATCQDEEDGLFECLCSPEFTGYLC   66 (132)
Q Consensus        19 p~~~~g~~--C~~~~~~C~~~p-C~ngg~C~~~~~~~~~C~C~~g~~G~~C   66 (132)
                      +..+.+..  ...+++.|...+ |.++++|....+ .|.|.|+++|.|..|
T Consensus       257 ~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~-~~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  257 PEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG-SYRCTCPPGFTGRLC  306 (487)
T ss_pred             CCCccccccceeeeccccCCCCccCCCCeeecCCC-cceeeCCCCCCCCCC
Confidence            33343554  234678899874 999999999875 499999999999998


No 27 
>KOG4260|consensus
Probab=93.08  E-value=0.11  Score=38.53  Aligned_cols=47  Identities=26%  Similarity=0.703  Sum_probs=35.4

Q ss_pred             cCCCCCCCCcccCCCCC---CCCCCCCCCeEeeC--CCCceEEeCCCCccCCcccc
Q psy11297         18 VQSSQASPNLRLEESPC---LSHPCQNGATCQDE--EDGLFECLCSPEFTGYLCHT   68 (132)
Q Consensus        18 cp~~~~g~~C~~~~~~C---~~~pC~ngg~C~~~--~~~~~~C~C~~g~~G~~C~~   68 (132)
                      ||...-|++|.    .|   +..||..+|.|...  +.|+-.|.|.+||+|+.|..
T Consensus       132 Cp~gtyGpdCl----~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~  183 (350)
T KOG4260|consen  132 CPDGTYGPDCL----QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY  183 (350)
T ss_pred             cCCCCcCCccc----cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence            45444499997    45   46789999999843  33567999999999999865


No 28 
>smart00051 DSL delta serrate ligand.
Probab=92.60  E-value=0.19  Score=29.04  Aligned_cols=42  Identities=17%  Similarity=0.311  Sum_probs=27.5

Q ss_pred             cCCCCCCCCcccCCCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccCCcc
Q psy11297         18 VQSSQASPNLRLEESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTGYLC   66 (132)
Q Consensus        18 cp~~~~g~~C~~~~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C   66 (132)
                      |+..+-|..|..   .|.+ +-+.++.+|...  |  .|.|++||+|..|
T Consensus        21 C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~~--G--~~~C~~Gw~G~~C   63 (63)
T smart00051       21 CDENYYGEGCNK---FCRPRDDFFGHYTCDEN--G--NKGCLEGWMGPYC   63 (63)
T ss_pred             CCCCCcCCccCC---EeCcCccccCCccCCcC--C--CEecCCCCcCCCC
Confidence            343444999963   5643 224567788532  3  6789999999887


No 29 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.82  E-value=0.24  Score=35.37  Aligned_cols=36  Identities=22%  Similarity=0.546  Sum_probs=26.2

Q ss_pred             CCCcccCCCCCCC--CCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297         24 SPNLRLEESPCLS--HPCQNGATCQDEEDGLFECLCSPEFTG   63 (132)
Q Consensus        24 g~~C~~~~~~C~~--~pC~ngg~C~~~~~~~~~C~C~~g~~G   63 (132)
                      +..|+ +.++|..  ++|.  ..|.+.. |+|.|.|+.||+.
T Consensus       181 ~~~C~-~~~~C~~~~~~c~--~~C~~~~-g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICV-VPDLCATLSHVCQ--QVCISTP-GSYLCACTEGYAL  218 (224)
T ss_pred             cccCc-CchhhcCCCCCcc--ceEEcCC-CCEEeECCCCccC
Confidence            55665 5667864  4565  5799887 5699999999974


No 30 
>KOG1225|consensus
Probab=90.99  E-value=0.59  Score=37.94  Aligned_cols=45  Identities=29%  Similarity=0.742  Sum_probs=33.0

Q ss_pred             cCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccC
Q psy11297         18 VQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRA   70 (132)
Q Consensus        18 cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~   70 (132)
                      ||..|.|.+|.+  -.|... |.+++.|++   +  .|.|+++|.|..|+...
T Consensus       269 C~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~---g--~CiC~~g~~G~dCs~~~  313 (525)
T KOG1225|consen  269 CPPGFTGDDCDE--LVCPVD-CSGGGVCVD---G--ECICNPGYSGKDCSIRR  313 (525)
T ss_pred             CCCCCcCCCCCc--ccCCcc-cCCCceecC---C--EeecCCCcccccccccc
Confidence            666777888875  357665 777676663   2  89999999999997543


No 31 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.67  E-value=0.22  Score=25.41  Aligned_cols=18  Identities=28%  Similarity=1.005  Sum_probs=14.4

Q ss_pred             eEeeCCCCceEEeCCCCcc
Q psy11297         44 TCQDEEDGLFECLCSPEFT   62 (132)
Q Consensus        44 ~C~~~~~~~~~C~C~~g~~   62 (132)
                      .|++.. ++|+|.|+.||.
T Consensus        11 ~C~~~~-g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICVNTP-GSYRCSCPPGYK   28 (36)
T ss_dssp             EEEEET-TSEEEE-STTEE
T ss_pred             CCccCC-CceEeECCCCCE
Confidence            788887 469999999985


No 32 
>KOG1214|consensus
Probab=87.47  E-value=1.9  Score=37.10  Aligned_cols=31  Identities=26%  Similarity=0.639  Sum_probs=22.3

Q ss_pred             CeEeeCCCCceEEeCCCCccC--CcccccCCCCC
Q psy11297         43 ATCQDEEDGLFECLCSPEFTG--YLCHTRAPPKL   74 (132)
Q Consensus        43 g~C~~~~~~~~~C~C~~g~~G--~~C~~~~~~c~   74 (132)
                      ..|+....++|+|.|.+||.|  ..|. +.++|.
T Consensus       799 a~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~  831 (1289)
T KOG1214|consen  799 ARCVHHGGSTYSCACLPGFSGDGHQCT-DVDECS  831 (1289)
T ss_pred             eEEEecCCceEEEeecCCccCCccccc-cccccC
Confidence            366666656899999999986  4554 347775


No 33 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=86.51  E-value=0.88  Score=24.47  Aligned_cols=23  Identities=35%  Similarity=0.791  Sum_probs=17.9

Q ss_pred             CeEeeCCCCceEEeCCCCccCCcccc
Q psy11297         43 ATCQDEEDGLFECLCSPEFTGYLCHT   68 (132)
Q Consensus        43 g~C~~~~~~~~~C~C~~g~~G~~C~~   68 (132)
                      ..|...   +..|.|.++++|++|++
T Consensus        11 ~~C~~~---~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   11 QTCDPS---TGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             SSEEET---CEEESBSTTEESTTS-E
T ss_pred             CcccCC---CCEEeccccccCCcCcC
Confidence            467664   26999999999999986


No 34 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=84.85  E-value=1.1  Score=24.31  Aligned_cols=16  Identities=38%  Similarity=0.831  Sum_probs=14.3

Q ss_pred             eEEeCCCCccCCcccc
Q psy11297         53 FECLCSPEFTGYLCHT   68 (132)
Q Consensus        53 ~~C~C~~g~~G~~C~~   68 (132)
                      -.|.|.+++.|..|++
T Consensus        19 G~C~C~~~~~G~~C~~   34 (50)
T cd00055          19 GQCECKPNTTGRRCDR   34 (50)
T ss_pred             CEEeCCCcCCCCCCCC
Confidence            5899999999999974


No 35 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=82.98  E-value=0.71  Score=29.33  Aligned_cols=32  Identities=25%  Similarity=0.774  Sum_probs=22.2

Q ss_pred             CCCCCCCeEeeCCC----CceEEeCCC-------------CccCCcccc
Q psy11297         37 HPCQNGATCQDEED----GLFECLCSP-------------EFTGYLCHT   68 (132)
Q Consensus        37 ~pC~ngg~C~~~~~----~~~~C~C~~-------------g~~G~~C~~   68 (132)
                      +-|.++|.|+....    .=|.|.|.+             .|.|..|+.
T Consensus        13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqK   61 (103)
T PF12955_consen   13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQK   61 (103)
T ss_pred             cCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccc
Confidence            45677899997621    238999976             466777764


No 36 
>KOG1226|consensus
Probab=81.90  E-value=2  Score=36.30  Aligned_cols=40  Identities=25%  Similarity=0.640  Sum_probs=22.2

Q ss_pred             CCCcccCCCCCCCC---CCCCCCeEeeCCCCceEEeCCCCccCCcccc
Q psy11297         24 SPNLRLEESPCLSH---PCQNGATCQDEEDGLFECLCSPEFTGYLCHT   68 (132)
Q Consensus        24 g~~C~~~~~~C~~~---pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~   68 (132)
                      |+.|+-+.-.|..+   .|.++|+|.=     -.|.|.+||+|..|+-
T Consensus       539 G~fCECDnfsC~r~~g~lC~g~G~C~C-----G~CvC~~GwtG~~C~C  581 (783)
T KOG1226|consen  539 GKFCECDNFSCERHKGVLCGGHGRCEC-----GRCVCNPGWTGSACNC  581 (783)
T ss_pred             eeeeeccCcccccccCcccCCCCeEeC-----CcEEcCCCCccCCCCC
Confidence            66666555555542   4555555541     2566666666665543


No 37 
>KOG1836|consensus
Probab=81.47  E-value=3  Score=38.58  Aligned_cols=60  Identities=27%  Similarity=0.452  Sum_probs=40.2

Q ss_pred             CccCCeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCC-CCceEEe-CCCCccCCcccccCC
Q psy11297         11 GCLSASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEE-DGLFECL-CSPEFTGYLCHTRAP   71 (132)
Q Consensus        11 ~~~~~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~-~~~~~C~-C~~g~~G~~C~~~~~   71 (132)
                      |-.-++..+++| |..=......|.+=||.+++.|.... .....|. |+++|+|.+|+...+
T Consensus       755 G~~C~~C~~GfY-g~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d  816 (1705)
T KOG1836|consen  755 GGQCAQCVDGFY-GLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD  816 (1705)
T ss_pred             CCchhhhcCCCC-CccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence            334444555666 33322222338888999999998654 2357899 999999999998554


No 38 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=79.34  E-value=0.98  Score=26.02  Aligned_cols=11  Identities=45%  Similarity=1.093  Sum_probs=8.0

Q ss_pred             eCCCCccCCcc
Q psy11297         56 LCSPEFTGYLC   66 (132)
Q Consensus        56 ~C~~g~~G~~C   66 (132)
                      .|.+||+|+.|
T Consensus        53 ~C~~Gw~G~~C   63 (63)
T PF01414_consen   53 VCLPGWTGPNC   63 (63)
T ss_dssp             EE-TTEESTTS
T ss_pred             CCCCCCcCCCC
Confidence            47789999877


No 39 
>KOG3509|consensus
Probab=78.01  E-value=6.5  Score=34.36  Aligned_cols=56  Identities=21%  Similarity=0.426  Sum_probs=41.4

Q ss_pred             CeecCCCCCCCCcccCCCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCC
Q psy11297         15 ASVVQSSQASPNLRLEESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPP   72 (132)
Q Consensus        15 ~~~cp~~~~g~~C~~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~   72 (132)
                      -..||.+..+..++  .-.|...+|+-+++|......+|.+.|+..-.+..|+.....
T Consensus       158 ~~~~~~~~~~~~~~--~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r  213 (964)
T KOG3509|consen  158 KRCCPTSEGGPGTE--PIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPR  213 (964)
T ss_pred             hhcCcccCCCCccc--cccccCcccccceeEEecCCcceeeeccccccceehhhccCc
Confidence            34567777666665  357889999999999977544688888888777778775543


No 40 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=75.72  E-value=3.4  Score=26.07  Aligned_cols=32  Identities=22%  Similarity=0.552  Sum_probs=24.5

Q ss_pred             CCCCCCC-CCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297         30 EESPCLS-HPCQNGATCQDEEDGLFECLCSPEFTG   63 (132)
Q Consensus        30 ~~~~C~~-~pC~ngg~C~~~~~~~~~C~C~~g~~G   63 (132)
                      ..+.|.. ..|...|.|....  ...|.|++||.-
T Consensus        76 p~d~Cd~y~~CG~~g~C~~~~--~~~C~Cl~GF~P  108 (110)
T PF00954_consen   76 PKDQCDVYGFCGPNGICNSNN--SPKCSCLPGFEP  108 (110)
T ss_pred             cccCCCCccccCCccEeCCCC--CCceECCCCcCC
Confidence            3467874 8899999997553  367999999963


No 41 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=74.03  E-value=2.7  Score=22.35  Aligned_cols=16  Identities=38%  Similarity=0.879  Sum_probs=14.1

Q ss_pred             eEEeCCCCccCCcccc
Q psy11297         53 FECLCSPEFTGYLCHT   68 (132)
Q Consensus        53 ~~C~C~~g~~G~~C~~   68 (132)
                      -.|.|+++++|++|++
T Consensus        18 G~C~C~~~~~G~~C~~   33 (46)
T smart00180       18 GQCECKPNVTGRRCDR   33 (46)
T ss_pred             CEEECCCCCCCCCCCc
Confidence            4899999999999974


No 42 
>KOG3509|consensus
Probab=71.12  E-value=6.7  Score=34.30  Aligned_cols=42  Identities=31%  Similarity=0.737  Sum_probs=35.9

Q ss_pred             CCCCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccccCCCC
Q psy11297         31 ESPCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHTRAPPK   73 (132)
Q Consensus        31 ~~~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~~~c   73 (132)
                      .+.|...||.+.+.|.+...+ ..|.|+++|.|..|+...+.|
T Consensus       406 g~~c~~~p~~~~g~c~p~~~~-~~c~c~~g~~G~~c~d~~~~~  447 (964)
T KOG3509|consen  406 GDVCWRIPCQHDGPCLQTLEG-KQCLCPPGYTGDSCEDCMNGC  447 (964)
T ss_pred             CCccccccCCCCccccccccc-cceeccccccCchhhccCccc
Confidence            467889999999999988754 999999999999998876654


No 43 
>KOG4260|consensus
Probab=71.09  E-value=2.4  Score=31.71  Aligned_cols=33  Identities=24%  Similarity=0.647  Sum_probs=27.6

Q ss_pred             CCCCCC--CCCCCCCCeEeeCCCCceEEeCCCCccC
Q psy11297         30 EESPCL--SHPCQNGATCQDEEDGLFECLCSPEFTG   63 (132)
Q Consensus        30 ~~~~C~--~~pC~ngg~C~~~~~~~~~C~C~~g~~G   63 (132)
                      ++++|.  +.||.....|++.. |+|+|.+..||.+
T Consensus       235 DvnEC~~ep~~c~~~qfCvNte-GSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  235 DVNECQNEPAPCKAHQFCVNTE-GSFKCEDKEGYKK  269 (350)
T ss_pred             cHHHHhcCCCCCChhheeecCC-CceEecccccccC
Confidence            677775  57899999999987 6799999888865


No 44 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=66.53  E-value=5.1  Score=20.13  Aligned_cols=25  Identities=24%  Similarity=0.763  Sum_probs=16.3

Q ss_pred             CCCCCCCCCCeEeeCCCCceEEeCCCCcc
Q psy11297         34 CLSHPCQNGATCQDEEDGLFECLCSPEFT   62 (132)
Q Consensus        34 C~~~pC~ngg~C~~~~~~~~~C~C~~g~~   62 (132)
                      |....|.  +.|.+..  .+.|.||.||.
T Consensus         3 Cn~t~Cp--A~CDpn~--~~~C~CPeGyI   27 (34)
T PF09064_consen    3 CNQTECP--ADCDPNS--PGQCFCPEGYI   27 (34)
T ss_pred             cccccCC--CccCCCC--CCceeCCCceE
Confidence            4444554  3666654  37999999984


No 45 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=59.98  E-value=5.3  Score=22.36  Aligned_cols=35  Identities=26%  Similarity=0.474  Sum_probs=17.9

Q ss_pred             CCCCCCCeEeeC---CCCceEEeCCCCccCCcccccCC
Q psy11297         37 HPCQNGATCQDE---EDGLFECLCSPEFTGYLCHTRAP   71 (132)
Q Consensus        37 ~pC~ngg~C~~~---~~~~~~C~C~~g~~G~~C~~~~~   71 (132)
                      .+|..+|+....   ..|.-.|.|..-|.|+.|.+.+.
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~   54 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIP   54 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-T
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcccCCC
Confidence            456666665421   12457899999999999988653


No 46 
>KOG0994|consensus
Probab=52.36  E-value=16  Score=32.98  Aligned_cols=32  Identities=34%  Similarity=0.895  Sum_probs=20.6

Q ss_pred             CCCCCCCCCCCeEeeCCCCceEEeCCCCccCCcccc
Q psy11297         33 PCLSHPCQNGATCQDEEDGLFECLCSPEFTGYLCHT   68 (132)
Q Consensus        33 ~C~~~pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~   68 (132)
                      +|.=+| ..+..|..-.   -.|.|.+||.|+.|.+
T Consensus      1068 ~C~Cd~-~~~pqCN~ft---GQCqCkpGfGGR~C~q 1099 (1758)
T KOG0994|consen 1068 PCNCDP-IGGPQCNEFT---GQCQCKPGFGGRTCSQ 1099 (1758)
T ss_pred             ccCCCc-cCCccccccc---cceeccCCCCCcchhH
Confidence            444444 2333555443   3799999999999876


No 47 
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=50.58  E-value=29  Score=21.26  Aligned_cols=27  Identities=30%  Similarity=0.934  Sum_probs=21.1

Q ss_pred             CCCCCCCCCCCeEeeCCCCceEEeCCC
Q psy11297         33 PCLSHPCQNGATCQDEEDGLFECLCSP   59 (132)
Q Consensus        33 ~C~~~pC~ngg~C~~~~~~~~~C~C~~   59 (132)
                      +|....|..|.+|+-...+.-.|+|..
T Consensus         1 pC~~v~C~~G~~C~~d~~~~p~CvC~~   27 (86)
T cd01328           1 PCENHHCGAGKVCEVDDENTPKCVCID   27 (86)
T ss_pred             CCCCcCCCCCCEeeECCCCCeEEecCC
Confidence            577888999999986544568999864


No 48 
>PHA02885 putative interleukin binding protein; Provisional
Probab=43.49  E-value=35  Score=21.80  Aligned_cols=24  Identities=21%  Similarity=0.516  Sum_probs=19.6

Q ss_pred             CCCCeEEEEeccCCCCCCCeEEEE
Q psy11297        106 NKNDGILLYNQQNLDGTGDFVSLA  129 (132)
Q Consensus       106 ~~~~g~ll~~~~~~~~~~d~~~l~  129 (132)
                      .+.+|.|+|.+..+...+.||.++
T Consensus        57 nsedgyiywi~pnntspgeyif~e   80 (135)
T PHA02885         57 NSEDGYIYWIGPNNTSPGEYIFIE   80 (135)
T ss_pred             cCCCceEEEEeCCCCCCcceeeee
Confidence            468899999999877778888765


No 49 
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=41.06  E-value=41  Score=15.64  Aligned_cols=23  Identities=26%  Similarity=0.683  Sum_probs=14.2

Q ss_pred             CCCCCCCCCCCeEeeCCCCceEE
Q psy11297         33 PCLSHPCQNGATCQDEEDGLFEC   55 (132)
Q Consensus        33 ~C~~~pC~ngg~C~~~~~~~~~C   55 (132)
                      +|...-|..|.+|.....+.-.|
T Consensus         1 ~C~~v~C~~G~~C~~d~~g~p~C   23 (26)
T smart00274        1 SCRNVQCPFGKVCVVDKGGNARC   23 (26)
T ss_pred             CCCCEECCCCCEEEeCCCCCEEE
Confidence            35666677777887654344555


No 50 
>PF12020 TAFA:  TAFA family;  InterPro: IPR020350 This entry contains the protein FAM19A2 (also known as TAFA2); it is a secreted, 11 kDa member of the FAM19/TAFA family of chemokine-like proteins [], which are composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The FAM19/TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines that act as regulators of immune and nervous cells []. 
Probab=34.43  E-value=52  Score=20.17  Aligned_cols=39  Identities=26%  Similarity=0.571  Sum_probs=20.8

Q ss_pred             cCCccCCeecCCCCCCCCccc-----CCCCCCCCCCCCCCeEee
Q psy11297          9 RMGCLSASVVQSSQASPNLRL-----EESPCLSHPCQNGATCQD   47 (132)
Q Consensus         9 ~~~~~~~~~cp~~~~g~~C~~-----~~~~C~~~pC~ngg~C~~   47 (132)
                      ++.|..+++.=-.-+-+.|-.     ...-|.-.||..|..|.-
T Consensus        27 ~C~C~~GqvaGTTrA~Pscvdasiv~~rwWC~MePCl~gEeC~~   70 (90)
T PF12020_consen   27 RCSCFPGQVAGTTRARPSCVDASIVLQRWWCQMEPCLEGEECKV   70 (90)
T ss_pred             EEEecCCeecccccCCCccccceeeeeeeeeecccccCCcceEE
Confidence            456666666543333333421     122366777777777763


No 51 
>KOG1226|consensus
Probab=34.25  E-value=80  Score=27.20  Aligned_cols=50  Identities=18%  Similarity=0.461  Sum_probs=28.6

Q ss_pred             cCCCCCCCCcc--cCCCCCCCCCCCCCCeEeeCCCC-ceEEeCCCC-ccCCcccccC
Q psy11297         18 VQSSQASPNLR--LEESPCLSHPCQNGATCQDEEDG-LFECLCSPE-FTGYLCHTRA   70 (132)
Q Consensus        18 cp~~~~g~~C~--~~~~~C~~~pC~ngg~C~~~~~~-~~~C~C~~g-~~G~~C~~~~   70 (132)
                      |...+.|..|+  ...+.|.+.   +|+.|.....- =-+|.|... |.|..||...
T Consensus       570 C~~GwtG~~C~C~~std~C~~~---~G~iCSGrG~C~Cg~C~C~~~~~sG~~CE~cp  623 (783)
T KOG1226|consen  570 CNPGWTGSACNCPLSTDTCESS---DGQICSGRGTCECGRCKCTDPPYSGEFCEKCP  623 (783)
T ss_pred             cCCCCccCCCCCCCCCccccCC---CCceeCCCceeeCCceEcCCCCcCcchhhcCC
Confidence            34455588775  456667653   44455433100 135667444 9999998854


No 52 
>KOG0994|consensus
Probab=31.19  E-value=1.1e+02  Score=27.99  Aligned_cols=37  Identities=24%  Similarity=0.713  Sum_probs=24.2

Q ss_pred             CCCCCCCCCCC--------eEeeCCC-CceEEeCCCCccCCccccc
Q psy11297         33 PCLSHPCQNGA--------TCQDEED-GLFECLCSPEFTGYLCHTR   69 (132)
Q Consensus        33 ~C~~~pC~ngg--------~C~~~~~-~~~~C~C~~g~~G~~C~~~   69 (132)
                      .|.+=||..+.        .|..... -...|.|..||.|.+|+.=
T Consensus       905 ~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C  950 (1758)
T KOG0994|consen  905 GCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC  950 (1758)
T ss_pred             CCCCCCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence            46655666543        4543321 1367999999999999873


No 53 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=30.89  E-value=1.8e+02  Score=20.53  Aligned_cols=41  Identities=29%  Similarity=0.448  Sum_probs=25.1

Q ss_pred             cCCceeEEEEEEEEeCCCCe--EEEEeccCCCCCCCeEEEEEcC
Q psy11297         91 LKAYNKLSIEIEFKTNKNDG--ILLYNQQNLDGTGDFVSLAIVN  132 (132)
Q Consensus        91 ~~~~~~~~i~~~f~t~~~~g--~ll~~~~~~~~~~d~~~l~l~~  132 (132)
                      ++...+..|.++|++....+  .||....... ...|+.|.+.|
T Consensus        29 lk~L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~-~n~YF~lyv~~   71 (190)
T PF02973_consen   29 LKKLEEGTIVIRFKSDSNSGIQSLFSISNSTK-GNEYFSLYVSN   71 (190)
T ss_dssp             CCT-SSEEEEEEEEESS-SSEEEEEEEE-TST-TSEEEEEEEET
T ss_pred             HhcccccEEEEEEecCCCcceeEEEEecCCCC-ccceEEEEEEC
Confidence            44566789999999865544  3454444433 45788887764


No 54 
>KOG3607|consensus
Probab=30.60  E-value=38  Score=28.93  Aligned_cols=29  Identities=24%  Similarity=0.809  Sum_probs=23.0

Q ss_pred             CCCCCCeEeeCCCCceEEeCCCCccCCcccccC
Q psy11297         38 PCQNGATCQDEEDGLFECLCSPEFTGYLCHTRA   70 (132)
Q Consensus        38 pC~ngg~C~~~~~~~~~C~C~~g~~G~~C~~~~   70 (132)
                      .|...|.|.+.    +.|.|.+||.++.|+..-
T Consensus       631 ~C~g~GVCnn~----~~ChC~~gwapp~C~~~~  659 (716)
T KOG3607|consen  631 TCNGHGVCNNE----LNCHCEPGWAPPFCFIFG  659 (716)
T ss_pred             ccCCCcccCCC----cceeeCCCCCCCcccccc
Confidence            37667777643    789999999999998754


No 55 
>KOG1218|consensus
Probab=25.47  E-value=2.5e+02  Score=20.51  Aligned_cols=30  Identities=30%  Similarity=0.463  Sum_probs=19.6

Q ss_pred             eEEeCCCCccCCcccccCCCCCCCCCCCCC
Q psy11297         53 FECLCSPEFTGYLCHTRAPPKLYDTPAFNG   82 (132)
Q Consensus        53 ~~C~C~~g~~G~~C~~~~~~c~~~~~~~~g   82 (132)
                      -.|.|++||.|..|......|.....+.++
T Consensus       162 ~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g  191 (316)
T KOG1218|consen  162 GICTCQPGFVGVFCVESCSGCSPLTACENG  191 (316)
T ss_pred             CceeccCCcccccccccCCCcCCCcccCCC
Confidence            567899999999988765434323334444


No 56 
>KOG4004|consensus
Probab=22.04  E-value=96  Score=22.39  Aligned_cols=31  Identities=26%  Similarity=0.692  Sum_probs=24.7

Q ss_pred             cCCCCCCCCCCCCCCeEeeCCCCceEEeCCC
Q psy11297         29 LEESPCLSHPCQNGATCQDEEDGLFECLCSP   59 (132)
Q Consensus        29 ~~~~~C~~~pC~ngg~C~~~~~~~~~C~C~~   59 (132)
                      ...++|+.+.|..|-.|.-...+.-+|+|..
T Consensus        48 ~a~npC~dh~Cg~gk~C~vd~~~~P~Cvc~~   78 (259)
T KOG4004|consen   48 EARNPCADHKCGPGKNCLVDLQTQPRCVCCR   78 (259)
T ss_pred             cccCccccccCCCCceeeecCCCCceeEEec
Confidence            3468999999999999986655668899855


No 57 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=21.86  E-value=48  Score=23.50  Aligned_cols=26  Identities=23%  Similarity=0.759  Sum_probs=19.8

Q ss_pred             CCCCCCCeEeeCCC----CceEEeCCCCcc
Q psy11297         37 HPCQNGATCQDEED----GLFECLCSPEFT   62 (132)
Q Consensus        37 ~pC~ngg~C~~~~~----~~~~C~C~~g~~   62 (132)
                      .+|...+.|+...+    ..|.|.|.+||.
T Consensus        50 K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~   79 (197)
T PF06247_consen   50 KPCGDYAKCINQANKGEERAYKCDCINGYI   79 (197)
T ss_dssp             SEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred             ccccchhhhhcCCCcccceeEEEecccCce
Confidence            68999999997653    369999999985


No 58 
>PF07966 A1_Propeptide:  A1 Propeptide ;  InterPro: IPR012848 Most eukaryotic endopeptidases (MEROPS peptidase family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residue in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions [, ]. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1AVF_Q 1HTR_P 3PSG_A 2PSG_A 3VCM_Q 1TZS_P.
Probab=20.65  E-value=32  Score=16.38  Aligned_cols=8  Identities=38%  Similarity=0.845  Sum_probs=3.3

Q ss_pred             CceeEeec
Q psy11297          2 VPLVRMKR    9 (132)
Q Consensus         2 ~~~~~~~~    9 (132)
                      |||.|+.+
T Consensus         4 IPL~K~kS   11 (29)
T PF07966_consen    4 IPLKKFKS   11 (29)
T ss_dssp             EEEEE---
T ss_pred             EeccCCch
Confidence            56766654


Done!