Query         psy8491
Match_columns 273
No_of_seqs    215 out of 1559
Neff          7.1 
Searched_HMMs 46136
Date          Fri Aug 16 20:14:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8491.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8491hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1225|consensus               99.2 9.7E-11 2.1E-15  114.0  10.9  126   26-194   233-364 (525)
  2 KOG1225|consensus               98.8 2.4E-08 5.3E-13   97.4   8.7   72   25-103   263-338 (525)
  3 KOG1214|consensus               98.0 1.3E-05 2.8E-10   80.9   8.1  135   53-201   705-866 (1289)
  4 KOG1214|consensus               97.7 5.9E-05 1.3E-09   76.3   5.6   91   18-111   748-865 (1289)
  5 KOG1226|consensus               97.6 0.00028   6E-09   71.2   9.6  118   25-164   476-620 (783)
  6 KOG4289|consensus               97.5 6.9E-05 1.5E-09   79.3   4.2   75   22-99   1217-1308(2531)
  7 KOG4260|consensus               97.5 7.1E-05 1.5E-09   67.2   3.6  148   30-200   131-312 (350)
  8 KOG1226|consensus               97.4 0.00075 1.6E-08   68.1   9.7   47  153-202   569-623 (783)
  9 KOG1219|consensus               97.4 0.00028   6E-09   77.7   6.0   94   81-200  3870-3978(4289)
 10 KOG1217|consensus               97.2  0.0043 9.2E-08   59.0  11.3  156   22-195   105-305 (487)
 11 KOG4289|consensus               96.9 0.00081 1.7E-08   71.6   4.3   73   60-137  1222-1308(2531)
 12 KOG1219|consensus               96.7  0.0023   5E-08   70.9   5.5   80   49-137  3870-3970(4289)
 13 PF07974 EGF_2:  EGF-like domai  96.2  0.0073 1.6E-07   37.0   3.3   20   51-70      8-29  (32)
 14 KOG1217|consensus               96.1   0.062 1.3E-06   51.0  11.3  155   19-193   184-388 (487)
 15 KOG0994|consensus               96.0   0.021 4.6E-07   60.1   7.8   44  153-200   889-949 (1758)
 16 smart00051 DSL delta serrate l  95.6   0.011 2.5E-07   41.9   2.9   43  153-198    20-63  (63)
 17 PF06247 Plasmod_Pvs28:  Plasmo  94.9    0.01 2.2E-07   50.8   1.1   93   15-109     7-126 (197)
 18 PF07974 EGF_2:  EGF-like domai  94.7   0.033 7.2E-07   34.1   2.6   21  176-198    11-32  (32)
 19 smart00051 DSL delta serrate l  94.5   0.034 7.4E-07   39.4   2.8   42   26-70     16-60  (63)
 20 KOG0994|consensus               93.8    0.13 2.9E-06   54.5   6.3   48  148-200  1084-1147(1758)
 21 PF12661 hEGF:  Human growth fa  93.7   0.033 7.1E-07   27.2   0.9   10  184-193     1-10  (13)
 22 PF12947 EGF_3:  EGF domain;  I  93.6   0.041 8.9E-07   34.6   1.4   23  176-198    11-36  (36)
 23 PF00008 EGF:  EGF-like domain   93.0   0.044 9.5E-07   33.4   0.8   21  173-193     5-30  (32)
 24 KOG4260|consensus               91.4    0.17 3.7E-06   45.9   3.0   46  153-200   221-276 (350)
 25 PF00008 EGF:  EGF-like domain   90.9   0.086 1.9E-06   32.0   0.4   21   50-70      5-30  (32)
 26 KOG1218|consensus               90.8      11 0.00025   33.9  14.6   54   22-79     10-65  (316)
 27 PF12947 EGF_3:  EGF domain;  I  90.4    0.12 2.6E-06   32.5   0.7   20   51-70      8-31  (36)
 28 PF12662 cEGF:  Complement Clr-  90.1    0.23   5E-06   28.4   1.7   10   61-70      3-12  (24)
 29 PF14670 FXa_inhibition:  Coagu  89.6    0.23 4.9E-06   31.2   1.6   25   15-39      7-31  (36)
 30 PF12662 cEGF:  Complement Clr-  88.6    0.34 7.5E-06   27.7   1.7   21   26-46      1-22  (24)
 31 PF01414 DSL:  Delta serrate li  87.4    0.23   5E-06   35.2   0.6   44  153-198    20-63  (63)
 32 PF07645 EGF_CA:  Calcium-bindi  86.3    0.51 1.1E-05   30.3   1.8   10   60-69     25-34  (42)
 33 smart00179 EGF_CA Calcium-bind  86.0    0.81 1.8E-05   27.9   2.5   19   51-69     11-33  (39)
 34 smart00179 EGF_CA Calcium-bind  85.9     1.2 2.6E-05   27.2   3.3   18   19-36     16-33  (39)
 35 PF07645 EGF_CA:  Calcium-bindi  85.5    0.33 7.1E-06   31.2   0.6   20   18-37     16-35  (42)
 36 PF06247 Plasmod_Pvs28:  Plasmo  84.2    0.59 1.3E-05   40.2   1.6   82   22-104    65-163 (197)
 37 KOG1836|consensus               83.4     1.9 4.2E-05   48.4   5.6   48  149-200   757-813 (1705)
 38 cd00053 EGF Epidermal growth f  83.1     1.4 3.1E-05   25.9   2.7   18   53-70     11-31  (36)
 39 KOG1218|consensus               80.8      13 0.00029   33.5   9.4   12   91-102   161-172 (316)
 40 PF01414 DSL:  Delta serrate li  78.9    0.82 1.8E-05   32.3   0.6   43   25-70     15-60  (63)
 41 PHA02887 EGF-like protein; Pro  78.6     1.7 3.7E-05   34.6   2.3   27  173-201    93-124 (126)
 42 cd00054 EGF_CA Calcium-binding  77.8     2.4 5.1E-05   25.3   2.4   18   53-70     14-34  (38)
 43 smart00181 EGF Epidermal growt  77.4     1.9 4.1E-05   25.8   1.9   11   60-70     20-30  (35)
 44 PHA03099 epidermal growth fact  74.8     2.2 4.8E-05   34.5   2.1   28  173-202    52-84  (139)
 45 PF01683 EB:  EB module;  Inter  63.2      17 0.00037   24.0   4.3   22  176-198    31-52  (52)
 46 PF09064 Tme5_EGF_like:  Thromb  61.5     7.6 0.00017   24.0   2.0   16   25-40     16-31  (34)
 47 PTZ00214 high cysteine membran  59.4      51  0.0011   34.8   8.9   20  182-201   750-769 (800)
 48 PF03302 VSP:  Giardia variant-  53.5      69  0.0015   30.8   8.2   18   63-80     41-58  (397)
 49 PHA02887 EGF-like protein; Pro  52.7      12 0.00025   29.9   2.3   24   51-77     94-122 (126)
 50 PF02363 C_tripleX:  Cysteine r  39.2      14  0.0003   18.9   0.6   12   42-53      2-13  (17)
 51 KOG1836|consensus               38.7      49  0.0011   37.8   5.1   16   29-44    697-712 (1705)
 52 PHA03099 epidermal growth fact  35.6      33 0.00071   27.9   2.4   24   51-77     53-81  (139)
 53 PF12946 EGF_MSP1_1:  MSP1 EGF   35.6      16 0.00036   23.0   0.6   24  176-199    10-37  (37)
 54 PF01826 TIL:  Trypsin Inhibito  32.6      32 0.00068   22.9   1.7   16  184-199    34-50  (55)
 55 smart00180 EGF_Lam Laminin-typ  30.0      39 0.00083   21.9   1.7   17  182-200    17-33  (46)
 56 PF00053 Laminin_EGF:  Laminin   29.8      23 0.00051   23.0   0.6   21  178-200    12-33  (49)
 57 PF00954 S_locus_glycop:  S-loc  29.4      39 0.00085   25.9   1.9   22   49-70     84-108 (110)
 58 cd00055 EGF_Lam Laminin-type e  27.4      50  0.0011   21.7   1.9   14   25-38     17-30  (50)
 59 KOG1388|consensus               26.0      67  0.0014   28.3   2.9   47  153-201    80-128 (217)
 60 KOG0196|consensus               21.6      96  0.0021   32.9   3.5   44   60-103   259-319 (996)

No 1  
>KOG1225|consensus
Probab=99.19  E-value=9.7e-11  Score=114.02  Aligned_cols=126  Identities=27%  Similarity=0.601  Sum_probs=84.3

Q ss_pred             CceeeeCCCCeeCCCCCCCCCCCCCCC-CCEEcCCCeEEcCCCCeeCCCCCcc-cCC-CCCC-CceecCCCceeeCCCcc
Q psy8491          26 RQAVICRFDLKFVNASYCAPVCSSPCI-NGACVFPEQCHCSPGFQFVNETYCE-PYC-ENCQ-HGTCTAPSVCECESGFV  101 (273)
Q Consensus        26 ~~~C~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~~~~C~C~~Gy~g~~~~~C~-~~C-~~C~-~G~C~~~~~C~C~~Gy~  101 (273)
                      .++|.|..+|.+.+..  ...|++.|. +|.|+. +.|+|++||+|   ..|. ..| ..|. |+.++. ++|+|++||+
T Consensus       233 ~~ic~c~~~~~g~~c~--~~~C~~~c~~~g~c~~-G~CIC~~Gf~G---~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~  305 (525)
T KOG1225|consen  233 DGICECPEGYFGPLCS--TIYCPGGCTGRGQCVE-GRCICPPGFTG---DDCDELVCPVDCSGGGVCVD-GECICNPGYS  305 (525)
T ss_pred             CceeecCCceeCCccc--cccCCCCCcccceEeC-CeEeCCCCCcC---CCCCcccCCcccCCCceecC-CEeecCCCcc
Confidence            3489999999987654  356777777 788886 89999999999   4554 345 3365 455555 5999999999


Q ss_pred             cCCCCCcc-ccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCcccccCCCCeeeCCCCCCCCCCCCCCCC-Cee
Q psy8491         102 HNNETNAC-ERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCEPECFNCTR-GYC  179 (273)
Q Consensus       102 g~~~~~~C-~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~~~C~~C~n-G~C  179 (273)
                      |.    .| ++.|...|..  +|.|+ .++ |.                    |.+||+|..|..+  +    |.+ |.|
T Consensus       306 G~----dCs~~~cpadC~g--~G~Ci-~G~-C~--------------------C~~Gy~G~~C~~~--~----C~~~g~c  351 (525)
T KOG1225|consen  306 GK----DCSIRRCPADCSG--HGKCI-DGE-CL--------------------CDEGYTGELCIQR--A----CSGGGQC  351 (525)
T ss_pred             cc----ccccccCCccCCC--CCccc-CCc-eE--------------------eCCCCcCCccccc--c----cCCCcee
Confidence            95    45 2456667766  78877 344 54                    5556666655533  1    334 333


Q ss_pred             cCCCeeECCCCceeC
Q psy8491         180 LEPNECSCPYNFHLE  194 (273)
Q Consensus       180 ~~~~~C~C~~Gy~G~  194 (273)
                      + .+ |+|..||.|.
T Consensus       352 v-~g-C~C~~Gw~G~  364 (525)
T KOG1225|consen  352 V-NG-CKCKKGWRGP  364 (525)
T ss_pred             c-cC-ceeccCccCC
Confidence            3 34 6677776663


No 2  
>KOG1225|consensus
Probab=98.76  E-value=2.4e-08  Score=97.42  Aligned_cols=72  Identities=29%  Similarity=0.781  Sum_probs=58.7

Q ss_pred             CCceeeeCCCCeeCCCCCCCCCCCCCCC-CCEEcCCCeEEcCCCCeeCCCCCcc-cCC-CCCC-CceecCCCceeeCCCc
Q psy8491          25 LRQAVICRFDLKFVNASYCAPVCSSPCI-NGACVFPEQCHCSPGFQFVNETYCE-PYC-ENCQ-HGTCTAPSVCECESGF  100 (273)
Q Consensus        25 ~~~~C~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~~~~C~C~~Gy~g~~~~~C~-~~C-~~C~-~G~C~~~~~C~C~~Gy  100 (273)
                      ..+.|+|++||.|.+.+.  ..|+.+|+ ++.+++ ++|+|.+||+|   ..|. ..| ..|. ||.|+ .++|.|.+||
T Consensus       263 ~~G~CIC~~Gf~G~dC~e--~~Cp~~cs~~g~~~~-g~CiC~~g~~G---~dCs~~~cpadC~g~G~Ci-~G~C~C~~Gy  335 (525)
T KOG1225|consen  263 VEGRCICPPGFTGDDCDE--LVCPVDCSGGGVCVD-GECICNPGYSG---KDCSIRRCPADCSGHGKCI-DGECLCDEGY  335 (525)
T ss_pred             eCCeEeCCCCCcCCCCCc--ccCCcccCCCceecC-CEeecCCCccc---cccccccCCccCCCCCccc-CCceEeCCCC
Confidence            467899999999977555  57777787 666766 69999999999   6665 445 5676 79999 7999999999


Q ss_pred             ccC
Q psy8491         101 VHN  103 (273)
Q Consensus       101 ~g~  103 (273)
                      +|.
T Consensus       336 ~G~  338 (525)
T KOG1225|consen  336 TGE  338 (525)
T ss_pred             cCC
Confidence            995


No 3  
>KOG1214|consensus
Probab=98.05  E-value=1.3e-05  Score=80.93  Aligned_cols=135  Identities=25%  Similarity=0.594  Sum_probs=83.5

Q ss_pred             CCEEcC----CCeEEcCCCCeeCCCCCcc--cCC----CCCC-CceecCC---CceeeCCCcccCCCCCccccccC----
Q psy8491          53 NGACVF----PEQCHCSPGFQFVNETYCE--PYC----ENCQ-HGTCTAP---SVCECESGFVHNNETNACERICE----  114 (273)
Q Consensus        53 ~g~C~~----~~~C~C~~Gy~g~~~~~C~--~~C----~~C~-~G~C~~~---~~C~C~~Gy~g~~~~~~C~~~C~----  114 (273)
                      ++.|..    .++|.|..||.|+ ...|.  .+|    +.|. |..|++.   ..|.|..||.-.+....|.+.=.    
T Consensus       705 ~a~C~pg~~~~~tcecs~g~~gd-gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~  783 (1289)
T KOG1214|consen  705 TARCHPGTGVDYTCECSSGYQGD-GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPA  783 (1289)
T ss_pred             CccccCCCCcceEEEEeeccCCC-CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCC
Confidence            445543    3699999999993 35565  233    5565 7889863   48999999977655566743211    


Q ss_pred             CCCCCCCC-CeecCCCCceeeCCCCCcCCCCCCCCcccccCCCCeeeCCCCCCCC--CCCC--CC-CCCeec---CCCee
Q psy8491         115 EPKPPLYP-GLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCE--PECF--NC-TRGYCL---EPNEC  185 (273)
Q Consensus       115 ~~C~~~~~-g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~--~~C~--~C-~nG~C~---~~~~C  185 (273)
                      ++|..  + ..|..... +.|...      .=..-.|  .|.|||.|+.  ..|.  ++|+  .| .++.|.   +++.|
T Consensus       784 n~Ce~--g~h~C~i~g~-a~c~~h------Ggs~y~C--~CLPGfsGDG--~~c~dvDeC~psrChp~A~CyntpgsfsC  850 (1289)
T KOG1214|consen  784 NPCED--GSHTCAIAGQ-ARCVHH------GGSTYSC--ACLPGFSGDG--HQCTDVDECSPSRCHPAATCYNTPGSFSC  850 (1289)
T ss_pred             Ccccc--CccccCcCCc-eEEEec------CCceEEE--eecCCccCCc--cccccccccCccccCCCceEecCCCccee
Confidence            23433  1 12322222 332210      0000145  8999999983  4443  4665  56 367886   35899


Q ss_pred             ECCCCceeCCCCcccc
Q psy8491         186 SCPYNFHLEGGMCMIA  201 (273)
Q Consensus       186 ~C~~Gy~G~~~~C~~~  201 (273)
                      .|.+||.|+|.+|.+.
T Consensus       851 ~C~pGy~GDGf~CVP~  866 (1289)
T KOG1214|consen  851 RCQPGYYGDGFQCVPD  866 (1289)
T ss_pred             ecccCccCCCceecCC
Confidence            9999999999999886


No 4  
>KOG1214|consensus
Probab=97.67  E-value=5.9e-05  Score=76.29  Aligned_cols=91  Identities=26%  Similarity=0.667  Sum_probs=66.9

Q ss_pred             cccccCCCCceeeeCCCCeeCC-CCCCCCC--------CC---CCCC-CC--EEcC----CCeEEcCCCCeeCCCCCcc-
Q psy8491          18 STSMSQDLRQAVICRFDLKFVN-ASYCAPV--------CS---SPCI-NG--ACVF----PEQCHCSPGFQFVNETYCE-   77 (273)
Q Consensus        18 ~~~~~~~~~~~C~C~~Gy~~~~-~~~C~~~--------C~---~~C~-~g--~C~~----~~~C~C~~Gy~g~~~~~C~-   77 (273)
                      .++++-++++.|+|..||.+.+ +..|+++        |+   ..|. +|  +|+.    .|.|+|.|||.|+. ..|. 
T Consensus       748 s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG-~~c~d  826 (1289)
T KOG1214|consen  748 SVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG-HQCTD  826 (1289)
T ss_pred             ceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc-ccccc
Confidence            5777889999999999999865 5888644        22   2455 44  4442    58999999999943 5555 


Q ss_pred             -cCC--CCCC-CceecCC---CceeeCCCcccCCCCCcccc
Q psy8491          78 -PYC--ENCQ-HGTCTAP---SVCECESGFVHNNETNACER  111 (273)
Q Consensus        78 -~~C--~~C~-~G~C~~~---~~C~C~~Gy~g~~~~~~C~~  111 (273)
                       ++|  +.|. ++.|.+.   ..|.|.+||.|++.  .|.|
T Consensus       827 vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf--~CVP  865 (1289)
T KOG1214|consen  827 VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF--QCVP  865 (1289)
T ss_pred             ccccCccccCCCceEecCCCcceeecccCccCCCc--eecC
Confidence             567  6776 6899753   48999999999753  6654


No 5  
>KOG1226|consensus
Probab=97.63  E-value=0.00028  Score=71.16  Aligned_cols=118  Identities=22%  Similarity=0.490  Sum_probs=71.4

Q ss_pred             CCceeeeCCCCeeCCCCCC----------CCCCC-----CCCC-CCEEcCCCeEEcCCCCee-CCCCCcc-c--CC----
Q psy8491          25 LRQAVICRFDLKFVNASYC----------APVCS-----SPCI-NGACVFPEQCHCSPGFQF-VNETYCE-P--YC----   80 (273)
Q Consensus        25 ~~~~C~C~~Gy~~~~~~~C----------~~~C~-----~~C~-~g~C~~~~~C~C~~Gy~g-~~~~~C~-~--~C----   80 (273)
                      ..+.|.|.+||.|.-. +|          ...|+     ++|. +|.|+= |+|+|.+...+ ..+..|+ +  .|    
T Consensus       476 ~CG~C~C~~G~~G~~C-EC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~  553 (783)
T KOG1226|consen  476 VCGQCRCDEGWLGKKC-ECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHK  553 (783)
T ss_pred             EecceecCCCCCCCcc-cCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCccccccc
Confidence            4678999999997421 11          01222     3576 788875 89999888873 2225555 1  12    


Q ss_pred             -CCCC-CceecCCCceeeCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCcccccCCCC-
Q psy8491          81 -ENCQ-HGTCTAPSVCECESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVG-  157 (273)
Q Consensus        81 -~~C~-~G~C~~~~~C~C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~G-  157 (273)
                       ..|. ||+|.- +.|+|.+||+|+.+  .|. .=...|...++..|...+. |.|.  .           |  .|... 
T Consensus       554 g~lC~g~G~C~C-G~CvC~~GwtG~~C--~C~-~std~C~~~~G~iCSGrG~-C~Cg--~-----------C--~C~~~~  613 (783)
T KOG1226|consen  554 GVLCGGHGRCEC-GRCVCNPGWTGSAC--NCP-LSTDTCESSDGQICSGRGT-CECG--R-----------C--KCTDPP  613 (783)
T ss_pred             CcccCCCCeEeC-CcEEcCCCCccCCC--CCC-CCCccccCCCCceeCCCce-eeCC--c-----------e--EcCCCC
Confidence             2354 688744 78999999999644  221 1112455555666666655 5544  2           2  45444 


Q ss_pred             eeeCCCC
Q psy8491         158 YTLNPRT  164 (273)
Q Consensus       158 y~g~~c~  164 (273)
                      |+|..|+
T Consensus       614 ~sG~~CE  620 (783)
T KOG1226|consen  614 YSGEFCE  620 (783)
T ss_pred             cCcchhh
Confidence            8998776


No 6  
>KOG4289|consensus
Probab=97.54  E-value=6.9e-05  Score=79.25  Aligned_cols=75  Identities=31%  Similarity=0.617  Sum_probs=54.5

Q ss_pred             cCCCCceeeeCCCCeeCCCCCCCCCC-CCCCC-CCEEcC---CCeEEcCCCCeeCCCCCcc-----cCC--CCCCC-cee
Q psy8491          22 SQDLRQAVICRFDLKFVNASYCAPVC-SSPCI-NGACVF---PEQCHCSPGFQFVNETYCE-----PYC--ENCQH-GTC   88 (273)
Q Consensus        22 ~~~~~~~C~C~~Gy~~~~~~~C~~~C-~~~C~-~g~C~~---~~~C~C~~Gy~g~~~~~C~-----~~C--~~C~~-G~C   88 (273)
                      .+.....|.|+|||.|+..+.=...| +.+|+ ||.|.+   .|+|+|.+||+|   ..|+     ..|  ..|.| |+|
T Consensus      1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tG---ehCEvs~~agrCvpGvC~nggtC 1293 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTG---EHCEVSARAGRCVPGVCKNGGTC 1293 (2531)
T ss_pred             cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccc---cceeeecccCccccceecCCCEE
Confidence            45578899999999986433322344 35788 899986   689999999999   7777     234  45665 677


Q ss_pred             cCC----CceeeCCC
Q psy8491          89 TAP----SVCECESG   99 (273)
Q Consensus        89 ~~~----~~C~C~~G   99 (273)
                      ++.    ..|+|+.|
T Consensus      1294 ~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1294 VNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             eecCCCceeccCCCc
Confidence            642    37889988


No 7  
>KOG4260|consensus
Probab=97.53  E-value=7.1e-05  Score=67.17  Aligned_cols=148  Identities=22%  Similarity=0.542  Sum_probs=82.7

Q ss_pred             eeCCCCeeCCCCCCCCCCCCCCC-CCEEcC------CCeEEcCCCCeeCCCCCcc------------cCCCCCC---Cce
Q psy8491          30 ICRFDLKFVNASYCAPVCSSPCI-NGACVF------PEQCHCSPGFQFVNETYCE------------PYCENCQ---HGT   87 (273)
Q Consensus        30 ~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~------~~~C~C~~Gy~g~~~~~C~------------~~C~~C~---~G~   87 (273)
                      -|++|-+|++...|...-+.+|. +|.|..      ++.|.|.+||+|..-.+|.            .+|..|.   -|+
T Consensus       131 CCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~  210 (350)
T KOG4260|consen  131 CCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGV  210 (350)
T ss_pred             ccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcc
Confidence            48999999888887655556787 888874      5799999999994334443            1343343   135


Q ss_pred             ecCC--Ccee-eCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCC--C--cccccCCCCeee
Q psy8491          88 CTAP--SVCE-CESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSN--P--YCIIQCDVGYTL  160 (273)
Q Consensus        88 C~~~--~~C~-C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~n--g--~C~~~C~~Gy~g  160 (273)
                      |...  ..|. |..||..+.  ..|                +..+. |.=.+..|.-.+.|.|  |  .|  ..++||.+
T Consensus       211 Csg~~~k~C~kCkkGW~lde--~gC----------------vDvnE-C~~ep~~c~~~qfCvNteGSf~C--~dk~Gy~~  269 (350)
T KOG4260|consen  211 CSGESSKGCSKCKKGWKLDE--EGC----------------VDVNE-CQNEPAPCKAHQFCVNTEGSFKC--EDKEGYKK  269 (350)
T ss_pred             cCCCCCCChhhhcccceecc--ccc----------------ccHHH-HhcCCCCCChhheeecCCCceEe--cccccccC
Confidence            6543  2564 999999852  223                22211 1100000100122333  2  34  56788887


Q ss_pred             CCCC-CCCCCCCC-CCCCCeec---CCCeeECCCCceeCCCCccc
Q psy8491         161 NPRT-LYCEPECF-NCTRGYCL---EPNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       161 ~~c~-~~C~~~C~-~C~nG~C~---~~~~C~C~~Gy~G~~~~C~~  200 (273)
                      ...+ ..|.+.|. .  |+.|.   +.++|+|..|+.-....|..
T Consensus       270 g~d~C~~~~d~~~~k--n~~c~ni~~~~r~v~f~~~~~~~g~cV~  312 (350)
T KOG4260|consen  270 GVDECQFCADVCASK--NRPCMNIDGQYRCVCFSGLIIIEGFCVW  312 (350)
T ss_pred             ChHHhhhhhhhcccC--CCCcccCCccEEEEecccceeeeeeeec
Confidence            4211 12223333 1  45554   35789999988764455554


No 8  
>KOG1226|consensus
Probab=97.44  E-value=0.00075  Score=68.14  Aligned_cols=47  Identities=17%  Similarity=0.412  Sum_probs=29.0

Q ss_pred             cCCCCeeeCCCCCC-CCCCCCCCCC-CeecCC-----CeeECCCC-ceeCCCCccccC
Q psy8491         153 QCDVGYTLNPRTLY-CEPECFNCTR-GYCLEP-----NECSCPYN-FHLEGGMCMIAE  202 (273)
Q Consensus       153 ~C~~Gy~g~~c~~~-C~~~C~~C~n-G~C~~~-----~~C~C~~G-y~G~~~~C~~~~  202 (273)
                      .|.+||+|..|+-. -.+.|.. .+ +.|.+-     ++|+|... |.|  ..|++-+
T Consensus       569 vC~~GwtG~~C~C~~std~C~~-~~G~iCSGrG~C~Cg~C~C~~~~~sG--~~CE~cp  623 (783)
T KOG1226|consen  569 VCNPGWTGSACNCPLSTDTCES-SDGQICSGRGTCECGRCKCTDPPYSG--EFCEKCP  623 (783)
T ss_pred             EcCCCCccCCCCCCCCCccccC-CCCceeCCCceeeCCceEcCCCCcCc--chhhcCC
Confidence            89999999988621 1122320 02 255543     56777665 899  8888753


No 9  
>KOG1219|consensus
Probab=97.36  E-value=0.00028  Score=77.68  Aligned_cols=94  Identities=30%  Similarity=0.611  Sum_probs=67.9

Q ss_pred             CCCCC-ceecCC----CceeeCCCcccCCCCCccc---cccC-CCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCccc
Q psy8491          81 ENCQH-GTCTAP----SVCECESGFVHNNETNACE---RICE-EPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCI  151 (273)
Q Consensus        81 ~~C~~-G~C~~~----~~C~C~~Gy~g~~~~~~C~---~~C~-~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~  151 (273)
                      .+|+| |.|+..    ++|.|++-|.|.    .|+   ..|. .||..  ||+|+..+.                ...| 
T Consensus      3870 npCqhgG~C~~~~~ggy~CkCpsqysG~----~CEi~~epC~snPC~~--GgtCip~~n----------------~f~C- 3926 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGN----HCEIDLEPCASNPCLT--GGTCIPFYN----------------GFLC- 3926 (4289)
T ss_pred             CcccCCCEecCCCCCceEEeCcccccCc----ccccccccccCCCCCC--CCEEEecCC----------------CeeE-
Confidence            57888 789642    489999999995    553   2343 47876  888876543                0134 


Q ss_pred             ccCCCCeeeCCCCCCCCCCCC--CCCC-CeecC---CCeeECCCCceeCCCCccc
Q psy8491         152 IQCDVGYTLNPRTLYCEPECF--NCTR-GYCLE---PNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       152 ~~C~~Gy~g~~c~~~C~~~C~--~C~n-G~C~~---~~~C~C~~Gy~G~~~~C~~  200 (273)
                       .|+.||+|..|+..=+.+|.  .|.+ |.|+.   ++.|.|.+||.|  ..|..
T Consensus      3927 -nC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~g--r~c~~ 3978 (4289)
T KOG1219|consen 3927 -NCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILG--RTCCA 3978 (4289)
T ss_pred             -eCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhc--ccCcc
Confidence             78899999988855455676  7866 79974   479999999999  76644


No 10 
>KOG1217|consensus
Probab=97.16  E-value=0.0043  Score=58.96  Aligned_cols=156  Identities=26%  Similarity=0.585  Sum_probs=90.1

Q ss_pred             cCCCCceeeeCCCCeeCCCCC---CCCCCCCCCCCCEEcC------CCeEEcCCCCeeCCCCCcc---cCC----CCCCC
Q psy8491          22 SQDLRQAVICRFDLKFVNASY---CAPVCSSPCINGACVF------PEQCHCSPGFQFVNETYCE---PYC----ENCQH   85 (273)
Q Consensus        22 ~~~~~~~C~C~~Gy~~~~~~~---C~~~C~~~C~~g~C~~------~~~C~C~~Gy~g~~~~~C~---~~C----~~C~~   85 (273)
                      .....+.|.|++||.+.....   |...-...+.++.|..      .+.|.|.+||.+   ..+.   +.|    ..|.+
T Consensus       105 ~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~---~~~~~~~~~C~~~~~~c~~  181 (487)
T KOG1217|consen  105 DCVGSYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEG---EPCETDLDECIQYSSPCQN  181 (487)
T ss_pred             CCCCCceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCccc---ccccccccccccCCCCcCC
Confidence            456788999999999865333   4332110123556653      468999999999   3333   234    24665


Q ss_pred             -ceecCC---CceeeCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCC----CC-----cCC---CCCCC--
Q psy8491          86 -GTCTAP---SVCECESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDN----KC-----HCS---EGDSN--  147 (273)
Q Consensus        86 -G~C~~~---~~C~C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G----~C-----~C~---~~C~n--  147 (273)
                       +.|.+.   ..|.|.+||.+......           .+++.|+.... |.+.+|    .|     .+.   ..|.+  
T Consensus       182 ~~~C~~~~~~~~C~c~~~~~~~~~~~~-----------~~~~~c~~~~~-~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~  249 (487)
T KOG1217|consen  182 GGTCVNTGGSYLCSCPPGYTGSTCETT-----------GNGGTCVDSVA-CSCPPGARGPECEVSIVECASGDGTCVNTV  249 (487)
T ss_pred             CcccccCCCCeeEeCCCCccCCcCcCC-----------CCCceEeccee-ccCCCCCCCCCcccccccccCCCCcccccC
Confidence             678664   37999999998532111           11344443322 444433    22     111   22332  


Q ss_pred             --CcccccCCCCeeeCCCCCCC--CCCCC---CCCC-CeecCC---CeeECCCCceeCC
Q psy8491         148 --PYCIIQCDVGYTLNPRTLYC--EPECF---NCTR-GYCLEP---NECSCPYNFHLEG  195 (273)
Q Consensus       148 --g~C~~~C~~Gy~g~~c~~~C--~~~C~---~C~n-G~C~~~---~~C~C~~Gy~G~~  195 (273)
                        ..|  .|++||++..+ ..+  ..+|.   .|.+ ++|+..   +.|.|++||.|..
T Consensus       250 ~~~~C--~~~~g~~~~~~-~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~  305 (487)
T KOG1217|consen  250 GSYTC--RCPEGYTGDAC-VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRL  305 (487)
T ss_pred             Cceee--eCCCCcccccc-ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCC
Confidence              145  78999999862 011  12333   2544 788754   8999999999943


No 11 
>KOG4289|consensus
Probab=96.94  E-value=0.00081  Score=71.60  Aligned_cols=73  Identities=27%  Similarity=0.585  Sum_probs=51.2

Q ss_pred             CeEEcCCCCeeCCCCCcc---cCC--CCCC-CceecC---CCceeeCCCcccCCCCCcc-ccccCC-CCCCCCCCeecCC
Q psy8491          60 EQCHCSPGFQFVNETYCE---PYC--ENCQ-HGTCTA---PSVCECESGFVHNNETNAC-ERICEE-PKPPLYPGLGTIP  128 (273)
Q Consensus        60 ~~C~C~~Gy~g~~~~~C~---~~C--~~C~-~G~C~~---~~~C~C~~Gy~g~~~~~~C-~~~C~~-~C~~~~~g~C~~~  128 (273)
                      .+|.|+|||+|   ..|+   +.|  .+|. ||+|.+   .++|.|.+||+|..+...- .-.|.+ .|.+  +|+|+..
T Consensus      1222 lrCrCPpGFTg---d~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~n--ggtC~~~ 1296 (2531)
T KOG4289|consen 1222 LRCRCPPGFTG---DYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKN--GGTCVNL 1296 (2531)
T ss_pred             eeEeCCCCCCc---ccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecC--CCEEeec
Confidence            58999999999   7898   567  6887 599964   4699999999995321110 023433 4666  7888653


Q ss_pred             C---CceeeCCC
Q psy8491         129 N---TERTCLDN  137 (273)
Q Consensus       129 ~---~~C~C~~G  137 (273)
                      .   ..|.|++|
T Consensus      1297 ~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYG 1308 (2531)
T ss_pred             CCCceeccCCCc
Confidence            2   23899998


No 12 
>KOG1219|consensus
Probab=96.70  E-value=0.0023  Score=70.91  Aligned_cols=80  Identities=26%  Similarity=0.676  Sum_probs=57.8

Q ss_pred             CCCC-CCEEcC----CCeEEcCCCCeeCCCCCcc---cCC--CCCCC-ceecC---CCceeeCCCcccCCCCCccc----
Q psy8491          49 SPCI-NGACVF----PEQCHCSPGFQFVNETYCE---PYC--ENCQH-GTCTA---PSVCECESGFVHNNETNACE----  110 (273)
Q Consensus        49 ~~C~-~g~C~~----~~~C~C~~Gy~g~~~~~C~---~~C--~~C~~-G~C~~---~~~C~C~~Gy~g~~~~~~C~----  110 (273)
                      .+|. .|.|..    .|.|.|++-|.|   ..|+   ..|  .+|.+ |+|+-   .+.|.|+.||+|.    .|+    
T Consensus      3870 npCqhgG~C~~~~~ggy~CkCpsqysG---~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~----~Ce~~Gi 3942 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSG---NHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK----RCEARGI 3942 (4289)
T ss_pred             CcccCCCEecCCCCCceEEeCcccccC---cccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc----eeecccc
Confidence            4677 678986    479999999999   8888   346  67886 78963   4589999999995    453    


Q ss_pred             cccC-CCCCCCCCCeecCCCC--ceeeCCC
Q psy8491         111 RICE-EPKPPLYPGLGTIPNT--ERTCLDN  137 (273)
Q Consensus       111 ~~C~-~~C~~~~~g~C~~~~~--~C~C~~G  137 (273)
                      ..|+ +.|.+  +|.|++..+  .|.|.+|
T Consensus      3943 ~eCs~n~C~~--gg~C~n~~gsf~CncT~g 3970 (4289)
T KOG1219|consen 3943 SECSKNVCGT--GGQCINIPGSFHCNCTPG 3970 (4289)
T ss_pred             cccccccccC--CceeeccCCceEeccChh
Confidence            3366 36777  788876543  3655555


No 13 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.16  E-value=0.0073  Score=37.01  Aligned_cols=20  Identities=40%  Similarity=1.114  Sum_probs=13.5

Q ss_pred             CC-CCEEcCC-CeEEcCCCCee
Q psy8491          51 CI-NGACVFP-EQCHCSPGFQF   70 (273)
Q Consensus        51 C~-~g~C~~~-~~C~C~~Gy~g   70 (273)
                      |. +|+|+.+ ++|+|.+||+|
T Consensus         8 C~~~G~C~~~~g~C~C~~g~~G   29 (32)
T PF07974_consen    8 CSGHGTCVSPCGRCVCDSGYTG   29 (32)
T ss_pred             cCCCCEEeCCCCEEECCCCCcC
Confidence            54 6777754 67777777776


No 14 
>KOG1217|consensus
Probab=96.14  E-value=0.062  Score=50.96  Aligned_cols=155  Identities=25%  Similarity=0.543  Sum_probs=84.7

Q ss_pred             ccccCCCCceeeeCCCCeeCCCCC------CC-------------CCCCC---CCC-C-CEEcC---CCeEEcCCCCeeC
Q psy8491          19 TSMSQDLRQAVICRFDLKFVNASY------CA-------------PVCSS---PCI-N-GACVF---PEQCHCSPGFQFV   71 (273)
Q Consensus        19 ~~~~~~~~~~C~C~~Gy~~~~~~~------C~-------------~~C~~---~C~-~-g~C~~---~~~C~C~~Gy~g~   71 (273)
                      .+.+...++.|.|++||.+.....      |.             +.|..   .|. . +.|.+   .++|.|.+||.+.
T Consensus       184 ~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~  263 (487)
T KOG1217|consen  184 TCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGD  263 (487)
T ss_pred             ccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccc
Confidence            455666779999999998754221      21             11111   122 2 56665   3689999999982


Q ss_pred             CCCCcc--cCC--C-CCCC-ceecCC---CceeeCCCcccCCCCCccc--cccCC-----CCCCCCCCeecCCCCceeeC
Q psy8491          72 NETYCE--PYC--E-NCQH-GTCTAP---SVCECESGFVHNNETNACE--RICEE-----PKPPLYPGLGTIPNTERTCL  135 (273)
Q Consensus        72 ~~~~C~--~~C--~-~C~~-G~C~~~---~~C~C~~Gy~g~~~~~~C~--~~C~~-----~C~~~~~g~C~~~~~~C~C~  135 (273)
                      ....+.  ..|  . .|.+ ++|+..   ..|.|++||.|..+ ..+.  ..|..     .|.+  ++.|.....     
T Consensus       264 ~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~--g~~C~~~~~-----  335 (487)
T KOG1217|consen  264 ACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCAN--GGTCNTLGS-----  335 (487)
T ss_pred             ccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCC--CcccccCCC-----
Confidence            201122  334  2 2664 788754   58999999998643 2221  12210     1222  122210000     


Q ss_pred             CCCCcCCCCCCCCcccccCCCCeeeCCCCCCCCCCCC--CCC-CCeecC----CCeeECCCCcee
Q psy8491         136 DNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCEPECF--NCT-RGYCLE----PNECSCPYNFHL  193 (273)
Q Consensus       136 ~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~~~C~--~C~-nG~C~~----~~~C~C~~Gy~G  193 (273)
                               .....|  .|.+||.|..|+... ++|.  ++. .+.|..    .+.|.|+.+|.+
T Consensus       336 ---------~~~~~C--~c~~~~~g~~C~~~~-~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  336 ---------FGGFRC--ACGPGFTGRRCEDSN-DECASSPCCPGGTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             ---------CCCCCc--CCCCCCCCCccccCC-ccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence                     000124  777888887766331 2343  343 478875    478999999987


No 15 
>KOG0994|consensus
Probab=96.02  E-value=0.021  Score=60.14  Aligned_cols=44  Identities=34%  Similarity=0.826  Sum_probs=27.1

Q ss_pred             cCCCCeeeCCCC---CCCCCCCCCCCC----C-----eecC-----CCeeECCCCceeCCCCccc
Q psy8491         153 QCDVGYTLNPRT---LYCEPECFNCTR----G-----YCLE-----PNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       153 ~C~~Gy~g~~c~---~~C~~~C~~C~n----G-----~C~~-----~~~C~C~~Gy~G~~~~C~~  200 (273)
                      .|..||+|++--   ..|.| | +|..    |     +|.-     .-.|.|.+||+|  .+|+.
T Consensus       889 rCl~GyyGdP~lg~g~~CrP-C-pCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G--~RCe~  949 (1758)
T KOG0994|consen  889 RCLDGYYGDPRLGSGIGCRP-C-PCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSG--SRCEI  949 (1758)
T ss_pred             hhhccccCCcccCCCCCCCC-C-CCCCCCccchhccccccccccccceeeecccCccc--cchhh
Confidence            899999998532   23333 1 1211    1     3421     237999999999  77765


No 16 
>smart00051 DSL delta serrate ligand.
Probab=95.60  E-value=0.011  Score=41.87  Aligned_cols=43  Identities=21%  Similarity=0.345  Sum_probs=32.8

Q ss_pred             cCCCCeeeCCCCCCCCCCCCCCC-CCeecCCCeeECCCCceeCCCCc
Q psy8491         153 QCDVGYTLNPRTLYCEPECFNCT-RGYCLEPNECSCPYNFHLEGGMC  198 (273)
Q Consensus       153 ~C~~Gy~g~~c~~~C~~~C~~C~-nG~C~~~~~C~C~~Gy~G~~~~C  198 (273)
                      .|.++|+|..|...|.+.-. .. +.+|...+.++|.+||+|  ..|
T Consensus        20 ~C~~~~yG~~C~~~C~~~~d-~~~~~~Cd~~G~~~C~~Gw~G--~~C   63 (63)
T smart00051       20 TCDENYYGEGCNKFCRPRDD-FFGHYTCDENGNKGCLEGWMG--PYC   63 (63)
T ss_pred             eCCCCCcCCccCCEeCcCcc-ccCCccCCcCCCEecCCCCcC--CCC
Confidence            88999999988877765321 22 358888899999999999  544


No 17 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=94.95  E-value=0.01  Score=50.77  Aligned_cols=93  Identities=23%  Similarity=0.673  Sum_probs=59.6

Q ss_pred             ccccccc-cCCCCceeeeCCCCeeCCCCCCCCC--CC------CCCC-CCEEcC--------CCeEEcCCCCeeCCCCCc
Q psy8491          15 VIQSTSM-SQDLRQAVICRFDLKFVNASYCAPV--CS------SPCI-NGACVF--------PEQCHCSPGFQFVNETYC   76 (273)
Q Consensus        15 ~~~~~~~-~~~~~~~C~C~~Gy~~~~~~~C~~~--C~------~~C~-~g~C~~--------~~~C~C~~Gy~g~~~~~C   76 (273)
                      .||+.-+ ....-+.|.|.+||.....+.|.+.  |.      .+|+ .+.|+.        .+.|.|.+||+. ....|
T Consensus         7 ~CKNG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~-~~~vC   85 (197)
T PF06247_consen    7 ICKNGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL-KQGVC   85 (197)
T ss_dssp             --BTEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE-SSSSE
T ss_pred             cccCCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee-eCCeE
Confidence            3454433 4445689999999998888888642  32      2466 578864        368999999998 33678


Q ss_pred             c-cCC--CCCCCceecC-C-----CceeeCCCcccCCCCCcc
Q psy8491          77 E-PYC--ENCQHGTCTA-P-----SVCECESGFVHNNETNAC  109 (273)
Q Consensus        77 ~-~~C--~~C~~G~C~~-~-----~~C~C~~Gy~g~~~~~~C  109 (273)
                      . ..|  ..|..|.|+. +     ..|+|.-|+.- +....|
T Consensus        86 vp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~-~dn~kC  126 (197)
T PF06247_consen   86 VPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVP-DDNKKC  126 (197)
T ss_dssp             EEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEET-TTTTES
T ss_pred             chhhcCceecCCCeEEecCCCCCCceeEeeeceEe-ccCCcc
Confidence            8 457  5788899962 1     28999999983 344556


No 18 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.69  E-value=0.033  Score=34.07  Aligned_cols=21  Identities=29%  Similarity=0.861  Sum_probs=18.2

Q ss_pred             CCeecCC-CeeECCCCceeCCCCc
Q psy8491         176 RGYCLEP-NECSCPYNFHLEGGMC  198 (273)
Q Consensus       176 nG~C~~~-~~C~C~~Gy~G~~~~C  198 (273)
                      +|+|+.. ++|+|++||+|  ..|
T Consensus        11 ~G~C~~~~g~C~C~~g~~G--~~C   32 (32)
T PF07974_consen   11 HGTCVSPCGRCVCDSGYTG--PDC   32 (32)
T ss_pred             CCEEeCCCCEEECCCCCcC--CCC
Confidence            6999987 99999999999  544


No 19 
>smart00051 DSL delta serrate ligand.
Probab=94.54  E-value=0.034  Score=39.43  Aligned_cols=42  Identities=17%  Similarity=0.276  Sum_probs=24.3

Q ss_pred             CceeeeCCCCeeCCCCCCCCCCCC--CCC-CCEEcCCCeEEcCCCCee
Q psy8491          26 RQAVICRFDLKFVNASYCAPVCSS--PCI-NGACVFPEQCHCSPGFQF   70 (273)
Q Consensus        26 ~~~C~C~~Gy~~~~~~~C~~~C~~--~C~-~g~C~~~~~C~C~~Gy~g   70 (273)
                      .+.-.|.++|+|..   |...|.+  +.. +..|...|.++|.+||+|
T Consensus        16 ~~rv~C~~~~yG~~---C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G   60 (63)
T smart00051       16 QIRVTCDENYYGEG---CNKFCRPRDDFFGHYTCDENGNKGCLEGWMG   60 (63)
T ss_pred             EEEeeCCCCCcCCc---cCCEeCcCccccCCccCCcCCCEecCCCCcC
Confidence            34455777777643   3334432  233 566766677777777777


No 20 
>KOG0994|consensus
Probab=93.80  E-value=0.13  Score=54.50  Aligned_cols=48  Identities=27%  Similarity=0.662  Sum_probs=27.5

Q ss_pred             CcccccCCCCeeeCCCCCCCC--------CCCC--CC-CCC----eec-CCCeeECCCCceeCCCCccc
Q psy8491         148 PYCIIQCDVGYTLNPRTLYCE--------PECF--NC-TRG----YCL-EPNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       148 g~C~~~C~~Gy~g~~c~~~C~--------~~C~--~C-~nG----~C~-~~~~C~C~~Gy~G~~~~C~~  200 (273)
                      |.|  .|+|||-|..|. .|+        ..|.  .| ..|    .|. ..+.|+|.+|-.|  .+|..
T Consensus      1084 GQC--qCkpGfGGR~C~-qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G--~rCdq 1147 (1758)
T KOG0994|consen 1084 GQC--QCKPGFGGRTCS-QCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGG--PRCDQ 1147 (1758)
T ss_pred             cce--eccCCCCCcchh-HHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCC--cchhh
Confidence            566  888888776553 222        1233  22 123    343 3478888888877  66644


No 21 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.67  E-value=0.033  Score=27.18  Aligned_cols=10  Identities=30%  Similarity=0.896  Sum_probs=6.7

Q ss_pred             eeECCCCcee
Q psy8491         184 ECSCPYNFHL  193 (273)
Q Consensus       184 ~C~C~~Gy~G  193 (273)
                      +|+|++||+|
T Consensus         1 ~C~C~~G~~G   10 (13)
T PF12661_consen    1 TCQCPPGWTG   10 (13)
T ss_dssp             EEEE-TTEET
T ss_pred             CccCcCCCcC
Confidence            4777888877


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=93.57  E-value=0.041  Score=34.57  Aligned_cols=23  Identities=22%  Similarity=0.677  Sum_probs=17.7

Q ss_pred             CCeec---CCCeeECCCCceeCCCCc
Q psy8491         176 RGYCL---EPNECSCPYNFHLEGGMC  198 (273)
Q Consensus       176 nG~C~---~~~~C~C~~Gy~G~~~~C  198 (273)
                      +++|.   +.++|+|++||.|+|..|
T Consensus        11 nA~C~~~~~~~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen   11 NATCTNTGGSYTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             TCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred             CcEeecCCCCEEeECCCCCccCCcCC
Confidence            67775   358999999999988765


No 23 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=92.97  E-value=0.044  Score=33.36  Aligned_cols=21  Identities=29%  Similarity=0.920  Sum_probs=17.0

Q ss_pred             CCCC-CeecC----CCeeECCCCcee
Q psy8491         173 NCTR-GYCLE----PNECSCPYNFHL  193 (273)
Q Consensus       173 ~C~n-G~C~~----~~~C~C~~Gy~G  193 (273)
                      +|+| |+|+.    .+.|+|++||+|
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G   30 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTG   30 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence            5766 79963    479999999999


No 24 
>KOG4260|consensus
Probab=91.44  E-value=0.17  Score=45.88  Aligned_cols=46  Identities=22%  Similarity=0.602  Sum_probs=30.4

Q ss_pred             cCCCCeeeCCCCCCCC--CCCC----CCCC-Ceec---CCCeeECCCCceeCCCCccc
Q psy8491         153 QCDVGYTLNPRTLYCE--PECF----NCTR-GYCL---EPNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       153 ~C~~Gy~g~~c~~~C~--~~C~----~C~n-G~C~---~~~~C~C~~Gy~G~~~~C~~  200 (273)
                      .|+.||..+  +..|.  ++|.    +|.. --|+   ++|+|...+||.+..+.|+.
T Consensus       221 kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~  276 (350)
T KOG4260|consen  221 KCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF  276 (350)
T ss_pred             hhcccceec--ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh
Confidence            889999987  33343  3443    3422 3444   46899999999985556765


No 25 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=90.89  E-value=0.086  Score=32.04  Aligned_cols=21  Identities=38%  Similarity=1.086  Sum_probs=11.1

Q ss_pred             CCC-CCEEcC----CCeEEcCCCCee
Q psy8491          50 PCI-NGACVF----PEQCHCSPGFQF   70 (273)
Q Consensus        50 ~C~-~g~C~~----~~~C~C~~Gy~g   70 (273)
                      +|. +|+|+.    .++|+|.+||+|
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G   30 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTG   30 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence            344 455543    245666666655


No 26 
>KOG1218|consensus
Probab=90.82  E-value=11  Score=33.94  Aligned_cols=54  Identities=15%  Similarity=0.040  Sum_probs=29.2

Q ss_pred             cCCCCceeeeCCCCeeC-CCCCCCCCCCCCCCCCE-EcCCCeEEcCCCCeeCCCCCcccC
Q psy8491          22 SQDLRQAVICRFDLKFV-NASYCAPVCSSPCINGA-CVFPEQCHCSPGFQFVNETYCEPY   79 (273)
Q Consensus        22 ~~~~~~~C~C~~Gy~~~-~~~~C~~~C~~~C~~g~-C~~~~~C~C~~Gy~g~~~~~C~~~   79 (273)
                      ....+..|.|.++|.+. .... .+........-. -....+|.+..+|.+   ..|.+.
T Consensus        10 ~~~~~~~c~c~~~~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~c~~~   65 (316)
T KOG1218|consen   10 CLGGSGQCFCDPGYTGRLQCEH-QAVTSACSGICPCEVNSGECGLGYGFVG---SVCRIE   65 (316)
T ss_pred             cCCCCCceecCCCccccccccC-CCCCccccccCCccCCceeEecccccCC---Cccccc
Confidence            34567889999999984 1111 111111111111 123567888899988   445533


No 27 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.36  E-value=0.12  Score=32.48  Aligned_cols=20  Identities=35%  Similarity=0.981  Sum_probs=11.4

Q ss_pred             CC-CCEEcC---CCeEEcCCCCee
Q psy8491          51 CI-NGACVF---PEQCHCSPGFQF   70 (273)
Q Consensus        51 C~-~g~C~~---~~~C~C~~Gy~g   70 (273)
                      |. ++.|.+   .++|+|++||+|
T Consensus         8 C~~nA~C~~~~~~~~C~C~~Gy~G   31 (36)
T PF12947_consen    8 CHPNATCTNTGGSYTCTCKPGYEG   31 (36)
T ss_dssp             S-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred             CCCCcEeecCCCCEEeECCCCCcc
Confidence            44 666654   456777777776


No 28 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=90.05  E-value=0.23  Score=28.36  Aligned_cols=10  Identities=50%  Similarity=1.570  Sum_probs=4.4

Q ss_pred             eEEcCCCCee
Q psy8491          61 QCHCSPGFQF   70 (273)
Q Consensus        61 ~C~C~~Gy~g   70 (273)
                      +|+|++||+.
T Consensus         3 ~C~C~~Gy~l   12 (24)
T PF12662_consen    3 TCSCPPGYQL   12 (24)
T ss_pred             EeeCCCCCcC
Confidence            4444444443


No 29 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.62  E-value=0.23  Score=31.22  Aligned_cols=25  Identities=8%  Similarity=0.020  Sum_probs=19.6

Q ss_pred             ccccccccCCCCceeeeCCCCeeCC
Q psy8491          15 VIQSTSMSQDLRQAVICRFDLKFVN   39 (273)
Q Consensus        15 ~~~~~~~~~~~~~~C~C~~Gy~~~~   39 (273)
                      .|.+.++++++++.|.|++||.+.+
T Consensus         7 gC~h~C~~~~g~~~C~C~~Gy~L~~   31 (36)
T PF14670_consen    7 GCSHICVNTPGSYRCSCPPGYKLAE   31 (36)
T ss_dssp             GSSSEEEEETTSEEEE-STTEEE-T
T ss_pred             CcCCCCccCCCceEeECCCCCEECc
Confidence            4778888889999999999998754


No 30 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=88.58  E-value=0.34  Score=27.66  Aligned_cols=21  Identities=10%  Similarity=0.331  Sum_probs=15.4

Q ss_pred             CceeeeCCCCeeC-CCCCCCCC
Q psy8491          26 RQAVICRFDLKFV-NASYCAPV   46 (273)
Q Consensus        26 ~~~C~C~~Gy~~~-~~~~C~~~   46 (273)
                      +|.|.|++||+.. +...|+.+
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DI   22 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDI   22 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccC
Confidence            5899999999974 35566543


No 31 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=87.37  E-value=0.23  Score=35.18  Aligned_cols=44  Identities=18%  Similarity=0.299  Sum_probs=22.8

Q ss_pred             cCCCCeeeCCCCCCCCCCCCCCCCCeecCCCeeECCCCceeCCCCc
Q psy8491         153 QCDVGYTLNPRTLYCEPECFNCTRGYCLEPNECSCPYNFHLEGGMC  198 (273)
Q Consensus       153 ~C~~Gy~g~~c~~~C~~~C~~C~nG~C~~~~~C~C~~Gy~G~~~~C  198 (273)
                      .|.+.|+|..|...|.+.=..=.+=+|...+.=+|.+||+|  ..|
T Consensus        20 ~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G--~~C   63 (63)
T PF01414_consen   20 VCDENYYGPNCSKFCKPRDDSFGHYTCDSNGNKVCLPGWTG--PNC   63 (63)
T ss_dssp             ---TTEETTTT-EE---EEETTEEEEE-SS--EEE-TTEES--TTS
T ss_pred             ECCCCCCCccccCCcCCCcCCcCCcccCCCCCCCCCCCCcC--CCC
Confidence            78899999988777665211100238888889999999999  544


No 32 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.34  E-value=0.51  Score=30.28  Aligned_cols=10  Identities=40%  Similarity=1.388  Sum_probs=4.5

Q ss_pred             CeEEcCCCCe
Q psy8491          60 EQCHCSPGFQ   69 (273)
Q Consensus        60 ~~C~C~~Gy~   69 (273)
                      |+|.|++||+
T Consensus        25 y~C~C~~Gy~   34 (42)
T PF07645_consen   25 YSCSCPPGYE   34 (42)
T ss_dssp             EEEEESTTEE
T ss_pred             EEeeCCCCcE
Confidence            3444444444


No 33 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=85.96  E-value=0.81  Score=27.92  Aligned_cols=19  Identities=42%  Similarity=1.249  Sum_probs=10.6

Q ss_pred             CC-CCEEcC---CCeEEcCCCCe
Q psy8491          51 CI-NGACVF---PEQCHCSPGFQ   69 (273)
Q Consensus        51 C~-~g~C~~---~~~C~C~~Gy~   69 (273)
                      |. ++.|++   .+.|.|.+||+
T Consensus        11 C~~~~~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179       11 CQNGGTCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             cCCCCEeECCCCCeEeECCCCCc
Confidence            44 445553   34566666666


No 34 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=85.93  E-value=1.2  Score=27.15  Aligned_cols=18  Identities=11%  Similarity=-0.005  Sum_probs=9.5

Q ss_pred             ccccCCCCceeeeCCCCe
Q psy8491          19 TSMSQDLRQAVICRFDLK   36 (273)
Q Consensus        19 ~~~~~~~~~~C~C~~Gy~   36 (273)
                      +++++.+.+.|.|++||.
T Consensus        16 ~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179       16 TCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             EeECCCCCeEeECCCCCc
Confidence            444555555555555554


No 35 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=85.55  E-value=0.33  Score=31.20  Aligned_cols=20  Identities=10%  Similarity=0.190  Sum_probs=18.4

Q ss_pred             cccccCCCCceeeeCCCCee
Q psy8491          18 STSMSQDLRQAVICRFDLKF   37 (273)
Q Consensus        18 ~~~~~~~~~~~C~C~~Gy~~   37 (273)
                      ..++++.++|.|.|++||+.
T Consensus        16 ~~C~N~~Gsy~C~C~~Gy~~   35 (42)
T PF07645_consen   16 GTCVNTEGSYSCSCPPGYEL   35 (42)
T ss_dssp             SEEEEETTEEEEEESTTEEE
T ss_pred             CEEEcCCCCEEeeCCCCcEE
Confidence            77889999999999999994


No 36 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=84.19  E-value=0.59  Score=40.20  Aligned_cols=82  Identities=21%  Similarity=0.570  Sum_probs=51.6

Q ss_pred             cCCCCceeeeCCCCeeCCCCCCC-CCCCC-CCCCCEEcC------CCeEEcCCCCeeCCCCCcc----cCC-CCCC-Cce
Q psy8491          22 SQDLRQAVICRFDLKFVNASYCA-PVCSS-PCINGACVF------PEQCHCSPGFQFVNETYCE----PYC-ENCQ-HGT   87 (273)
Q Consensus        22 ~~~~~~~C~C~~Gy~~~~~~~C~-~~C~~-~C~~g~C~~------~~~C~C~~Gy~g~~~~~C~----~~C-~~C~-~G~   87 (273)
                      ..+..+.|.|-+||.... ..|. ..|.. .|++|.|+.      ...|+|.=|+..++..+|.    ..| ..|. |..
T Consensus        65 ~~~~~~~C~C~~gY~~~~-~vCvp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~  143 (197)
T PF06247_consen   65 GEERAYKCDCINGYILKQ-GVCVPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEE  143 (197)
T ss_dssp             TSSTSEEEEE-TTEEESS-SSEEEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEE
T ss_pred             ccceeEEEecccCceeeC-CeEchhhcCceecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcc
Confidence            566789999999999754 4665 34553 588999973      2389999999955556776    345 5565 577


Q ss_pred             ecC---CCceeeCCCcccCC
Q psy8491          88 CTA---PSVCECESGFVHNN  104 (273)
Q Consensus        88 C~~---~~~C~C~~Gy~g~~  104 (273)
                      |..   .++|++.+|+.++.
T Consensus       144 CK~~~~~Y~C~~~~~~~~~~  163 (197)
T PF06247_consen  144 CKLVDGYYKCVCKEGFPGDG  163 (197)
T ss_dssp             EEEETTEEEEEE-TT-EEET
T ss_pred             eeeeCcEEEeecCCCCCCCC
Confidence            854   34899999998864


No 37 
>KOG1836|consensus
Probab=83.45  E-value=1.9  Score=48.42  Aligned_cols=48  Identities=27%  Similarity=0.495  Sum_probs=30.0

Q ss_pred             cccccCCCCeeeCCCCCCCCCCCC--CCCC-Ceec-----CCCeeE-CCCCceeCCCCccc
Q psy8491         149 YCIIQCDVGYTLNPRTLYCEPECF--NCTR-GYCL-----EPNECS-CPYNFHLEGGMCMI  200 (273)
Q Consensus       149 ~C~~~C~~Gy~g~~c~~~C~~~C~--~C~n-G~C~-----~~~~C~-C~~Gy~G~~~~C~~  200 (273)
                      .|. +|..||+|..-...-.+ |.  +|-+ +.|.     ....|. ||+||+|  .+|+.
T Consensus       757 ~C~-~C~~GfYg~~~~~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG--~rCe~  813 (1705)
T KOG1836|consen  757 QCA-QCVDGFYGLPDLGTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTG--LRCEE  813 (1705)
T ss_pred             chh-hhcCCCCCccccCCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCcc--ccccc
Confidence            444 89999999753322111 44  3434 4453     236898 9999999  77765


No 38 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=83.15  E-value=1.4  Score=25.85  Aligned_cols=18  Identities=39%  Similarity=1.001  Sum_probs=10.9

Q ss_pred             CCEEcC---CCeEEcCCCCee
Q psy8491          53 NGACVF---PEQCHCSPGFQF   70 (273)
Q Consensus        53 ~g~C~~---~~~C~C~~Gy~g   70 (273)
                      ++.|++   .++|.|++||.|
T Consensus        11 ~~~C~~~~~~~~C~C~~g~~g   31 (36)
T cd00053          11 GGTCVNTPGSYRCVCPPGYTG   31 (36)
T ss_pred             CCEEecCCCCeEeECCCCCcc
Confidence            455554   356666666666


No 39 
>KOG1218|consensus
Probab=80.76  E-value=13  Score=33.52  Aligned_cols=12  Identities=42%  Similarity=1.198  Sum_probs=6.3

Q ss_pred             CCceeeCCCccc
Q psy8491          91 PSVCECESGFVH  102 (273)
Q Consensus        91 ~~~C~C~~Gy~g  102 (273)
                      .+.|.|.+||.+
T Consensus       161 ~~~c~c~~g~~g  172 (316)
T KOG1218|consen  161 NGICTCQPGFVG  172 (316)
T ss_pred             CCceeccCCccc
Confidence            345555555555


No 40 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=78.86  E-value=0.82  Score=32.35  Aligned_cols=43  Identities=23%  Similarity=0.443  Sum_probs=13.7

Q ss_pred             CCceeeeCCCCeeCCCCCCCCCCCCC--CC-CCEEcCCCeEEcCCCCee
Q psy8491          25 LRQAVICRFDLKFVNASYCAPVCSSP--CI-NGACVFPEQCHCSPGFQF   70 (273)
Q Consensus        25 ~~~~C~C~~Gy~~~~~~~C~~~C~~~--C~-~g~C~~~~~C~C~~Gy~g   70 (273)
                      .+.+-.|.+.|+|..   |...|.+.  =. +.+|...|.-.|.+||+|
T Consensus        15 ~~~rv~C~~nyyG~~---C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G   60 (63)
T PF01414_consen   15 YRIRVVCDENYYGPN---CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTG   60 (63)
T ss_dssp             --------TTEETTT---T-EE---EEETTEEEEE-SS--EEE-TTEES
T ss_pred             EEEEEECCCCCCCcc---ccCCcCCCcCCcCCcccCCCCCCCCCCCCcC
Confidence            344555666666533   22233221  11 455655566666666666


No 41 
>PHA02887 EGF-like protein; Provisional
Probab=78.61  E-value=1.7  Score=34.57  Aligned_cols=27  Identities=22%  Similarity=0.545  Sum_probs=21.5

Q ss_pred             CCCCCeec-----CCCeeECCCCceeCCCCcccc
Q psy8491         173 NCTRGYCL-----EPNECSCPYNFHLEGGMCMIA  201 (273)
Q Consensus       173 ~C~nG~C~-----~~~~C~C~~Gy~G~~~~C~~~  201 (273)
                      -|.||+|.     ....|.|++||+|  .+|+..
T Consensus        93 YCiHG~C~yI~dL~epsCrC~~GYtG--~RCE~v  124 (126)
T PHA02887         93 FCINGECMNIIDLDEKFCICNKGYTG--IRCDEV  124 (126)
T ss_pred             EeeCCEEEccccCCCceeECCCCccc--CCCCcc
Confidence            47789995     2368999999999  888753


No 42 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=77.77  E-value=2.4  Score=25.29  Aligned_cols=18  Identities=39%  Similarity=1.034  Sum_probs=10.6

Q ss_pred             CCEEcC---CCeEEcCCCCee
Q psy8491          53 NGACVF---PEQCHCSPGFQF   70 (273)
Q Consensus        53 ~g~C~~---~~~C~C~~Gy~g   70 (273)
                      ++.|.+   .+.|.|.+||.|
T Consensus        14 ~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054          14 GGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             CCEeECCCCCeEeECCCCCcC
Confidence            445543   346677777766


No 43 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=77.40  E-value=1.9  Score=25.81  Aligned_cols=11  Identities=36%  Similarity=1.047  Sum_probs=5.6

Q ss_pred             CeEEcCCCCee
Q psy8491          60 EQCHCSPGFQF   70 (273)
Q Consensus        60 ~~C~C~~Gy~g   70 (273)
                      ++|.|.+||+|
T Consensus        20 ~~C~C~~g~~g   30 (35)
T smart00181       20 YTCSCPPGYTG   30 (35)
T ss_pred             eEeECCCCCcc
Confidence            44555555544


No 44 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=74.78  E-value=2.2  Score=34.45  Aligned_cols=28  Identities=21%  Similarity=0.546  Sum_probs=22.7

Q ss_pred             CCCCCeec-----CCCeeECCCCceeCCCCccccC
Q psy8491         173 NCTRGYCL-----EPNECSCPYNFHLEGGMCMIAE  202 (273)
Q Consensus       173 ~C~nG~C~-----~~~~C~C~~Gy~G~~~~C~~~~  202 (273)
                      -|.||+|.     ....|.|+.||+|  .+||...
T Consensus        52 YClHG~C~yI~dl~~~~CrC~~GYtG--eRCEh~d   84 (139)
T PHA03099         52 YCLHGDCIHARDIDGMYCRCSHGYTG--IRCQHVV   84 (139)
T ss_pred             EeECCEEEeeccCCCceeECCCCccc--cccccee
Confidence            46688885     3478999999999  9998764


No 45 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=63.20  E-value=17  Score=24.00  Aligned_cols=22  Identities=32%  Similarity=0.924  Sum_probs=16.6

Q ss_pred             CCeecCCCeeECCCCceeCCCCc
Q psy8491         176 RGYCLEPNECSCPYNFHLEGGMC  198 (273)
Q Consensus       176 nG~C~~~~~C~C~~Gy~G~~~~C  198 (273)
                      +..|+ .+.|.|++||+-.+.+|
T Consensus        31 ~s~C~-~g~C~C~~g~~~~~~~C   52 (52)
T PF01683_consen   31 GSVCV-NGRCQCPPGYVEVGGRC   52 (52)
T ss_pred             cCEEc-CCEeECCCCCEecCCCC
Confidence            45774 47999999998766654


No 46 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=61.53  E-value=7.6  Score=24.03  Aligned_cols=16  Identities=6%  Similarity=0.044  Sum_probs=9.8

Q ss_pred             CCceeeeCCCCeeCCC
Q psy8491          25 LRQAVICRFDLKFVNA   40 (273)
Q Consensus        25 ~~~~C~C~~Gy~~~~~   40 (273)
                      ..+.|.|++||..+..
T Consensus        16 ~~~~C~CPeGyIlde~   31 (34)
T PF09064_consen   16 SPGQCFCPEGYILDEG   31 (34)
T ss_pred             CCCceeCCCceEecCC
Confidence            4456777777766543


No 47 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=59.37  E-value=51  Score=34.80  Aligned_cols=20  Identities=30%  Similarity=0.687  Sum_probs=15.3

Q ss_pred             CCeeECCCCceeCCCCcccc
Q psy8491         182 PNECSCPYNFHLEGGMCMIA  201 (273)
Q Consensus       182 ~~~C~C~~Gy~G~~~~C~~~  201 (273)
                      .+.|+|..||...+..|...
T Consensus       750 ~~vC~C~~g~~l~~~~c~~~  769 (800)
T PTZ00214        750 QGVCMCELDAVLTKGVCVPA  769 (800)
T ss_pred             CCeEEeCCcceecCCeeEec
Confidence            36889999998877777654


No 48 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=53.50  E-value=69  Score=30.78  Aligned_cols=18  Identities=22%  Similarity=0.741  Sum_probs=9.0

Q ss_pred             EcCCCCeeCCCCCcccCC
Q psy8491          63 HCSPGFQFVNETYCEPYC   80 (273)
Q Consensus        63 ~C~~Gy~g~~~~~C~~~C   80 (273)
                      .|.++|.+.....|...|
T Consensus        41 ~C~~~~~lt~t~~Ci~~C   58 (397)
T PF03302_consen   41 ECNSGYYLTPTNQCIEDC   58 (397)
T ss_pred             cCCCCCcCCCCCccccCc
Confidence            366666653234454443


No 49 
>PHA02887 EGF-like protein; Provisional
Probab=52.69  E-value=12  Score=29.90  Aligned_cols=24  Identities=38%  Similarity=1.018  Sum_probs=16.7

Q ss_pred             CCCCEEcC-----CCeEEcCCCCeeCCCCCcc
Q psy8491          51 CINGACVF-----PEQCHCSPGFQFVNETYCE   77 (273)
Q Consensus        51 C~~g~C~~-----~~~C~C~~Gy~g~~~~~C~   77 (273)
                      |.||.|.-     ...|.|.+||+|   ..|+
T Consensus        94 CiHG~C~yI~dL~epsCrC~~GYtG---~RCE  122 (126)
T PHA02887         94 CINGECMNIIDLDEKFCICNKGYTG---IRCD  122 (126)
T ss_pred             eeCCEEEccccCCCceeECCCCccc---CCCC
Confidence            55777763     357888888888   5554


No 50 
>PF02363 C_tripleX:  Cysteine rich repeat;  InterPro: IPR003341 This signature describes a cysteine repeat C-X3-C-X3-C the function of which is unknown as is the function of the proteins in which they occur.
Probab=39.16  E-value=14  Score=18.86  Aligned_cols=12  Identities=42%  Similarity=1.140  Sum_probs=6.2

Q ss_pred             CCCCCCCCCCCC
Q psy8491          42 YCAPVCSSPCIN   53 (273)
Q Consensus        42 ~C~~~C~~~C~~   53 (273)
                      .|.|.|...|.+
T Consensus         2 ~C~p~C~~~C~~   13 (17)
T PF02363_consen    2 QCVPQCEPSCEN   13 (17)
T ss_pred             cchhhccCcccc
Confidence            355555555544


No 51 
>KOG1836|consensus
Probab=38.69  E-value=49  Score=37.82  Aligned_cols=16  Identities=19%  Similarity=0.277  Sum_probs=11.8

Q ss_pred             eeeCCCCeeCCCCCCC
Q psy8491          29 VICRFDLKFVNASYCA   44 (273)
Q Consensus        29 C~C~~Gy~~~~~~~C~   44 (273)
                      |.|+.||.|...+.|.
T Consensus       697 c~C~~g~tG~~Ce~C~  712 (1705)
T KOG1836|consen  697 CTCPVGYTGQFCESCA  712 (1705)
T ss_pred             ccCCCCcccchhhhcc
Confidence            8899999876555553


No 52 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=35.64  E-value=33  Score=27.87  Aligned_cols=24  Identities=33%  Similarity=1.013  Sum_probs=14.3

Q ss_pred             CCCCEEcC-----CCeEEcCCCCeeCCCCCcc
Q psy8491          51 CINGACVF-----PEQCHCSPGFQFVNETYCE   77 (273)
Q Consensus        51 C~~g~C~~-----~~~C~C~~Gy~g~~~~~C~   77 (273)
                      |.||.|.-     ...|.|..||+|   .+|+
T Consensus        53 ClHG~C~yI~dl~~~~CrC~~GYtG---eRCE   81 (139)
T PHA03099         53 CLHGDCIHARDIDGMYCRCSHGYTG---IRCQ   81 (139)
T ss_pred             eECCEEEeeccCCCceeECCCCccc---cccc
Confidence            45666642     345777777777   4554


No 53 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=35.62  E-value=16  Score=23.05  Aligned_cols=24  Identities=29%  Similarity=0.859  Sum_probs=16.8

Q ss_pred             CCeec----CCCeeECCCCceeCCCCcc
Q psy8491         176 RGYCL----EPNECSCPYNFHLEGGMCM  199 (273)
Q Consensus       176 nG~C~----~~~~C~C~~Gy~G~~~~C~  199 (273)
                      |+.|.    +..+|.|..||..++..|+
T Consensus        10 NA~C~~~~dG~eecrCllgyk~~~~~C~   37 (37)
T PF12946_consen   10 NAGCFRYDDGSEECRCLLGYKKVGGKCV   37 (37)
T ss_dssp             TEEEEEETTSEEEEEE-TTEEEETTEEE
T ss_pred             CcccEEcCCCCEEEEeeCCccccCCCcC
Confidence            66675    3468999999998777663


No 54 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=32.58  E-value=32  Score=22.94  Aligned_cols=16  Identities=25%  Similarity=0.791  Sum_probs=11.5

Q ss_pred             eeECCCCceeCCC-Ccc
Q psy8491         184 ECSCPYNFHLEGG-MCM  199 (273)
Q Consensus       184 ~C~C~~Gy~G~~~-~C~  199 (273)
                      .|.|++||.-++. .|.
T Consensus        34 gC~C~~G~v~~~~~~CV   50 (55)
T PF01826_consen   34 GCFCPPGYVRNDNGRCV   50 (55)
T ss_dssp             EEEETTTEEEETTSEEE
T ss_pred             cCCCCCCeeEcCCCCEE
Confidence            3999999987544 553


No 55 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=29.95  E-value=39  Score=21.93  Aligned_cols=17  Identities=24%  Similarity=0.477  Sum_probs=12.8

Q ss_pred             CCeeECCCCceeCCCCccc
Q psy8491         182 PNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       182 ~~~C~C~~Gy~G~~~~C~~  200 (273)
                      .+.|.|+++|+|  ..|+.
T Consensus        17 ~G~C~C~~~~~G--~~C~~   33 (46)
T smart00180       17 TGQCECKPNVTG--RRCDR   33 (46)
T ss_pred             CCEEECCCCCCC--CCCCc
Confidence            467888888888  77654


No 56 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=29.77  E-value=23  Score=23.04  Aligned_cols=21  Identities=19%  Similarity=0.357  Sum_probs=13.5

Q ss_pred             eecC-CCeeECCCCceeCCCCccc
Q psy8491         178 YCLE-PNECSCPYNFHLEGGMCMI  200 (273)
Q Consensus       178 ~C~~-~~~C~C~~Gy~G~~~~C~~  200 (273)
                      .|.. .+.|.|+++|+|  ..|+.
T Consensus        12 ~C~~~~G~C~C~~~~~G--~~C~~   33 (49)
T PF00053_consen   12 TCDPSTGQCVCKPGTTG--PRCDQ   33 (49)
T ss_dssp             SEEETCEEESBSTTEES--TTS-E
T ss_pred             cccCCCCEEeccccccC--CcCcC
Confidence            4443 467888888888  76664


No 57 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=29.41  E-value=39  Score=25.93  Aligned_cols=22  Identities=41%  Similarity=0.922  Sum_probs=16.8

Q ss_pred             CCCC-CCEEcC--CCeEEcCCCCee
Q psy8491          49 SPCI-NGACVF--PEQCHCSPGFQF   70 (273)
Q Consensus        49 ~~C~-~g~C~~--~~~C~C~~Gy~g   70 (273)
                      ..|+ +|.|..  ...|.|.+||.-
T Consensus        84 ~~CG~~g~C~~~~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   84 GFCGPNGICNSNNSPKCSCLPGFEP  108 (110)
T ss_pred             cccCCccEeCCCCCCceECCCCcCC
Confidence            4587 899975  347999999963


No 58 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=27.38  E-value=50  Score=21.68  Aligned_cols=14  Identities=7%  Similarity=-0.100  Sum_probs=8.3

Q ss_pred             CCceeeeCCCCeeC
Q psy8491          25 LRQAVICRFDLKFV   38 (273)
Q Consensus        25 ~~~~C~C~~Gy~~~   38 (273)
                      ..+.|.|.++|.|.
T Consensus        17 ~~G~C~C~~~~~G~   30 (50)
T cd00055          17 GTGQCECKPNTTGR   30 (50)
T ss_pred             CCCEEeCCCcCCCC
Confidence            35566666666653


No 59 
>KOG1388|consensus
Probab=26.04  E-value=67  Score=28.34  Aligned_cols=47  Identities=17%  Similarity=0.327  Sum_probs=31.0

Q ss_pred             cCCCCeeeCCCCCCCCCCCC-CCCCCeecCCCeeEC-CCCceeCCCCcccc
Q psy8491         153 QCDVGYTLNPRTLYCEPECF-NCTRGYCLEPNECSC-PYNFHLEGGMCMIA  201 (273)
Q Consensus       153 ~C~~Gy~g~~c~~~C~~~C~-~C~nG~C~~~~~C~C-~~Gy~G~~~~C~~~  201 (273)
                      .|..||+|+..+..|+++.. ....+.....++|.| ..|+.|  +.|..-
T Consensus        80 kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~c~~kgvvg--d~c~~~  128 (217)
T KOG1388|consen   80 KCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCFCTTKGVVG--DLCPKC  128 (217)
T ss_pred             ceEEEEEecCCCCccCHhhhcCCeeeeeccCCccccccceEec--ccCccc
Confidence            68899999755667776422 222344456789999 678888  555443


No 60 
>KOG0196|consensus
Probab=21.58  E-value=96  Score=32.88  Aligned_cols=44  Identities=25%  Similarity=0.782  Sum_probs=25.8

Q ss_pred             CeEEcCCCCee-CCCCCcc-------------cCCCCCC-Cceec--CCCceeeCCCcccC
Q psy8491          60 EQCHCSPGFQF-VNETYCE-------------PYCENCQ-HGTCT--APSVCECESGFVHN  103 (273)
Q Consensus        60 ~~C~C~~Gy~g-~~~~~C~-------------~~C~~C~-~G~C~--~~~~C~C~~Gy~g~  103 (273)
                      |.|.|.+||.. .....|+             ..|..|. |-.=.  +...|.|..||...
T Consensus       259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyyRA  319 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYYRA  319 (996)
T ss_pred             CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcccC
Confidence            78999999986 3334554             1233332 22211  22378999998764


Done!