Query         psy11799
Match_columns 483
No_of_seqs    326 out of 2758
Neff          9.1 
Searched_HMMs 46136
Date          Fri Aug 16 19:17:43 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy11799.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11799hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.7 3.5E-15 7.6E-20  152.0  17.7  190  192-396   693-910 (1289)
  2 KOG1217|consensus               99.5 3.1E-13 6.7E-18  141.0  20.6  270  113-473   155-436 (487)
  3 KOG1217|consensus               99.5 8.1E-13 1.8E-17  137.8  20.2  269  140-474    98-405 (487)
  4 KOG1214|consensus               99.4 4.8E-13   1E-17  136.7  11.4  155  276-468   694-865 (1289)
  5 KOG1219|consensus               99.3 9.5E-12   2E-16  137.6   8.6  109  315-468  3865-3978(4289)
  6 KOG4289|consensus               99.1 1.5E-10 3.4E-15  124.1  10.5   88  289-391  1217-1308(2531)
  7 KOG4289|consensus               99.1 7.3E-10 1.6E-14  119.1  14.1   87  206-301  1216-1308(2531)
  8 KOG0994|consensus               99.0 1.4E-09 3.1E-14  115.0  12.5   73  334-421  1077-1160(1758)
  9 KOG1219|consensus               99.0 5.1E-10 1.1E-14  124.5   8.4  106  275-396  3865-3975(4289)
 10 KOG0994|consensus               98.7 3.8E-07 8.2E-12   97.2  15.1   41   25-72    757-797 (1758)
 11 KOG1225|consensus               98.5 5.9E-07 1.3E-11   91.6  10.5   76  293-396   233-308 (525)
 12 KOG4260|consensus               98.4 3.1E-07 6.7E-12   83.5   5.3  164  112-304   130-305 (350)
 13 KOG4260|consensus               98.4 6.6E-07 1.4E-11   81.4   6.2  156  216-395   132-306 (350)
 14 PF07645 EGF_CA:  Calcium-bindi  98.4 2.8E-07 6.1E-12   61.3   2.8   42  422-466     1-42  (42)
 15 KOG1225|consensus               98.3 1.3E-06 2.8E-11   89.1   7.6  102   31-170   234-341 (525)
 16 PF07645 EGF_CA:  Calcium-bindi  98.3 7.3E-07 1.6E-11   59.3   3.6   40  125-167     1-40  (42)
 17 PF12662 cEGF:  Complement Clr-  98.1 3.2E-06   7E-11   47.9   2.6   24  448-471     1-24  (24)
 18 PF14670 FXa_inhibition:  Coagu  97.7 3.2E-05 6.9E-10   49.0   2.8   33  197-229     4-36  (36)
 19 KOG1226|consensus               97.7 0.00038 8.3E-09   72.9  12.0  146  286-473   467-626 (783)
 20 PF12662 cEGF:  Complement Clr-  97.7 3.8E-05 8.1E-10   43.6   2.3   24  211-234     1-24  (24)
 21 PF12947 EGF_3:  EGF domain;  I  97.6 3.9E-05 8.5E-10   48.7   2.2   29  139-169     8-36  (36)
 22 smart00179 EGF_CA Calcium-bind  97.6 9.8E-05 2.1E-09   47.9   4.1   38  423-467     2-39  (39)
 23 PF00008 EGF:  EGF-like domain   97.6   4E-05 8.7E-10   47.4   1.9   26  435-460     5-31  (32)
 24 PF12947 EGF_3:  EGF domain;  I  97.5 6.2E-05 1.4E-09   47.8   2.0   30  435-466     7-36  (36)
 25 KOG1836|consensus               97.5  0.0023 5.1E-08   74.4  15.6   57  382-459   957-1017(1705)
 26 PF06247 Plasmod_Pvs28:  Plasmo  97.4 0.00019 4.2E-09   62.4   4.5  138  290-464    16-166 (197)
 27 PF00008 EGF:  EGF-like domain   97.2  0.0002 4.2E-09   44.3   2.0   30  317-354     1-31  (32)
 28 cd00054 EGF_CA Calcium-binding  97.1 0.00086 1.9E-08   42.9   4.0   28  435-466    10-37  (38)
 29 PF06247 Plasmod_Pvs28:  Plasmo  97.0  0.0012 2.6E-08   57.6   5.5  138  203-357    11-165 (197)
 30 PF14670 FXa_inhibition:  Coagu  97.0 0.00068 1.5E-08   43.0   2.8   28  142-169     9-36  (36)
 31 smart00179 EGF_CA Calcium-bind  97.0  0.0012 2.5E-08   42.7   4.0   31  314-352     2-33  (39)
 32 KOG1226|consensus               96.9  0.0024 5.2E-08   67.1   7.7   94   57-164   478-578 (783)
 33 cd00053 EGF Epidermal growth f  96.3  0.0066 1.4E-07   38.0   3.9   26  435-460     7-32  (36)
 34 smart00181 EGF Epidermal growt  96.2  0.0073 1.6E-07   38.0   3.8   24  435-459     7-30  (35)
 35 cd00054 EGF_CA Calcium-binding  96.2  0.0075 1.6E-07   38.4   3.9   32  314-353     2-34  (38)
 36 PF12661 hEGF:  Human growth fa  95.9  0.0017 3.7E-08   31.2  -0.2   13   32-44      1-13  (13)
 37 KOG1836|consensus               95.1    0.24 5.2E-06   58.3  12.9   67  192-263   738-808 (1705)
 38 cd00053 EGF Epidermal growth f  95.1    0.04 8.7E-07   34.3   4.0   22  333-354    11-32  (36)
 39 PF07974 EGF_2:  EGF-like domai  94.9   0.032   7E-07   34.3   3.0   26  435-466     7-32  (32)
 40 smart00181 EGF Epidermal growt  94.8    0.05 1.1E-06   34.0   3.8   20  333-353    11-30  (35)
 41 PF07974 EGF_2:  EGF-like domai  94.6   0.048   1E-06   33.6   3.1   23  139-163     8-30  (32)
 42 cd01475 vWA_Matrilin VWA_Matri  94.0    0.06 1.3E-06   50.2   4.0   42  186-227   182-223 (224)
 43 smart00051 DSL delta serrate l  92.9    0.09 1.9E-06   38.1   2.5   40   30-70     16-63  (63)
 44 cd01475 vWA_Matrilin VWA_Matri  91.9    0.18 3.8E-06   47.0   3.9   42  120-166   181-222 (224)
 45 PF12946 EGF_MSP1_1:  MSP1 EGF   89.4    0.23 4.9E-06   31.4   1.4   25  139-163     7-32  (37)
 46 KOG1218|consensus               86.2      29 0.00062   33.7  15.1   15  382-396   160-174 (316)
 47 smart00051 DSL delta serrate l  85.1     1.2 2.5E-05   32.3   3.3   44  294-353    17-60  (63)
 48 PF12946 EGF_MSP1_1:  MSP1 EGF   82.9     1.1 2.5E-05   28.3   2.1   22  332-353     9-31  (37)
 49 KOG3512|consensus               79.2     7.6 0.00016   39.3   7.5  112  333-467   358-478 (592)
 50 PF09064 Tme5_EGF_like:  Thromb  78.1     2.5 5.4E-05   26.1   2.4   25  370-396     6-30  (34)
 51 PF01414 DSL:  Delta serrate li  77.0    0.34 7.4E-06   35.0  -1.9   39   31-70     17-63  (63)
 52 KOG1218|consensus               75.6      62  0.0013   31.3  13.2   16  293-308    48-63  (316)
 53 PHA03099 epidermal growth fact  75.0       3 6.4E-05   34.3   2.8   41  422-469    41-83  (139)
 54 PF00053 Laminin_EGF:  Laminin   72.7     1.1 2.4E-05   30.4  -0.1   21   29-49     16-36  (49)
 55 smart00180 EGF_Lam Laminin-typ  72.0     2.6 5.7E-05   28.2   1.6   20   29-48     16-35  (46)
 56 cd00055 EGF_Lam Laminin-type e  69.6     2.9 6.3E-05   28.6   1.4   20   29-48     17-36  (50)
 57 PHA02887 EGF-like protein; Pro  68.9     5.4 0.00012   32.3   3.0   30  435-469    93-124 (126)
 58 PF00954 S_locus_glycop:  S-loc  57.5     9.2  0.0002   31.0   2.5   32  125-161    76-107 (110)
 59 PHA02887 EGF-like protein; Pro  55.4     5.9 0.00013   32.0   1.0   17   30-46    107-123 (126)
 60 PF01683 EB:  EB module;  Inter  53.0      38 0.00083   23.0   4.8   17  333-353    31-47  (52)
 61 PF01724 DUF29:  Domain of unkn  49.4     7.7 0.00017   33.1   0.9   19    3-21     16-34  (139)
 62 PHA03099 epidermal growth fact  48.1      17 0.00037   30.0   2.6   36  125-163    41-78  (139)
 63 PF00954 S_locus_glycop:  S-loc  42.4      20 0.00043   29.0   2.3   23  435-458    85-107 (110)
 64 PF12955 DUF3844:  Domain of un  39.4      21 0.00045   28.5   1.8   44  424-470     6-63  (103)
 65 PF12955 DUF3844:  Domain of un  38.7      23 0.00051   28.3   2.0   33  127-162     6-43  (103)
 66 PTZ00214 high cysteine membran  36.9 2.5E+02  0.0054   31.5  10.2  120  254-403   647-769 (800)
 67 KOG3516|consensus               35.2      29 0.00063   39.5   2.7   40  420-468   542-582 (1306)
 68 KOG3512|consensus               34.1 1.5E+02  0.0033   30.4   7.1   53  294-353   371-424 (592)
 69 KOG3516|consensus               33.7      34 0.00073   39.0   2.9   41  310-362   541-582 (1306)
 70 KOG0196|consensus               30.4      54  0.0012   36.0   3.6   58  384-458   259-317 (996)
 71 KOG3514|consensus               28.9      35 0.00076   38.4   2.0   35  316-362   625-660 (1591)
 72 PF04863 EGF_alliinase:  Alliin  25.4      24 0.00052   24.5   0.1   25   22-46     26-51  (56)
 73 PTZ00214 high cysteine membran  24.2 3.1E+02  0.0068   30.8   8.3   25  212-236   682-706 (800)
 74 PF01826 TIL:  Trypsin Inhibito  23.4      60  0.0013   22.3   1.8   18  386-403    35-52  (55)

No 1  
>KOG1214|consensus
Probab=99.66  E-value=3.5e-15  Score=152.04  Aligned_cols=190  Identities=32%  Similarity=0.758  Sum_probs=137.8

Q ss_pred             chhccCCCCCC--CeEEeCCC-CeEEeCCCCceeCCCCCcccccCcccCCCCCC--CCeeeeCCCCceEeCCCCceecCC
Q psy11799        192 NECLELSNQCA--FRCHNVPG-SFRCICPYGYALAPDGRHCIDINECEQFPGLC--AHTCTNTEGSYTCGCHPGFELSSD  266 (483)
Q Consensus       192 ~~C~~~~~~C~--~~C~~~~g-s~~C~C~~G~~~~~~g~~C~~~~~C~~~~~~~--~~~C~~~~g~~~C~C~~G~~~~~~  266 (483)
                      +.|....+.|.  +.|...++ .|+|.|..||.  .++++|.++++|+..+..+  +..|++.+++|+|+|..||.+..+
T Consensus       693 npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~--gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd  770 (1289)
T KOG1214|consen  693 NPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQ--GDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADD  770 (1289)
T ss_pred             ccceecCcccCCCccccCCCCcceEEEEeeccC--CCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccC
Confidence            33443334443  45655544 58999999997  4489999999998655444  789999999999999999998876


Q ss_pred             CCCcccc------CccccCCCCccc--c--cc--cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCC
Q psy11799        267 GSGTCLD------SDECATGRHICQ--Q--IC--NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGG  334 (483)
Q Consensus       267 g~~~C~~------~~~C~~~~~~c~--~--~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g  334 (483)
                      +. +|+.      .+.|..+.+.|.  +  .|  ...+.|.|.|.|||.|+  +..|.++|+|..+.|.+        .+
T Consensus       771 ~~-tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD--G~~c~dvDeC~psrChp--------~A  839 (1289)
T KOG1214|consen  771 RH-TCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGD--GHQCTDVDECSPSRCHP--------AA  839 (1289)
T ss_pred             Cc-ceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCC--ccccccccccCccccCC--------Cc
Confidence            65 7763      467888777775  3  34  45568999999999998  46788999999999999        89


Q ss_pred             EEeeCCCCeeEECCCCCeeccCCCceeeCC---CccCC-----CCCCCC---cEECCCCCeeEeCCCCeeecC
Q psy11799        335 KCFNTAGSYRCGCPDGFQFEAALQLCTQLS---SSCLS-----SPCSFG---CTPLGISGFSCECPSGYQRIG  396 (483)
Q Consensus       335 ~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~---~~C~~-----~~C~~~---c~~~~~~~~~C~C~~Gy~g~~  396 (483)
                      +|.+++++|.|.|.+||.|  +|..|....   ..|..     ..|...   |.+.+.+.+.+.|.++-.|..
T Consensus       840 ~CyntpgsfsC~C~pGy~G--DGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~  910 (1289)
T KOG1214|consen  840 TCYNTPGSFSCRCQPGYYG--DGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGST  910 (1289)
T ss_pred             eEecCCCcceeecccCccC--CCceecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCC
Confidence            9999999999999999998  577775431   13331     123321   344444666777666655544


No 2  
>KOG1217|consensus
Probab=99.55  E-value=3.1e-13  Score=140.98  Aligned_cols=270  Identities=36%  Similarity=0.772  Sum_probs=184.1

Q ss_pred             eecccCcccccccc-cccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCCccccCCCCCCCCCccCCCcccccc
Q psy11799        113 SLKKKKFYSVFLDM-DECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLDQDECTDDSKCREGCQTCEDI  191 (483)
Q Consensus       113 ~~~~~~~~~~C~~~-~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C~~~~~C~~~~~C~~g~~~C~~~  191 (483)
                      .|..++.+..+... ++|... ...  |.+++.|.+..++|.|.|++||.+.    .|+..                   
T Consensus       155 ~C~~g~~~~~~~~~~~~C~~~-~~~--c~~~~~C~~~~~~~~C~c~~~~~~~----~~~~~-------------------  208 (487)
T KOG1217|consen  155 SCTEGYEGEPCETDLDECIQY-SSP--CQNGGTCVNTGGSYLCSCPPGYTGS----TCETT-------------------  208 (487)
T ss_pred             eeCCCcccccccccccccccC-CCC--cCCCcccccCCCCeeEeCCCCccCC----cCcCC-------------------
Confidence            44455555555532 677743 334  9899999999999999999999844    33211                   


Q ss_pred             chhccCCCCCCCeEEeCCCCeEEeCCCCceeCCCCCcc-cccCcccCCCCCCCCeeeeCCCCceEeCCCCceecCCCCCc
Q psy11799        192 NECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGRHC-IDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDGSGT  270 (483)
Q Consensus       192 ~~C~~~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~~C-~~~~~C~~~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~~  270 (483)
                              ...+.|++.   +.|.+.+++.    +..| ..+.++...   . +.|++..+.++|.|++||.+... . .
T Consensus       209 --------~~~~~c~~~---~~~~~~~g~~----~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~-~-~  267 (487)
T KOG1217|consen  209 --------GNGGTCVDS---VACSCPPGAR----GPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC-V-T  267 (487)
T ss_pred             --------CCCceEecc---eeccCCCCCC----CCCcccccccccCC---C-CcccccCCceeeeCCCCcccccc-c-e
Confidence                    011345544   5788889887    6666 334444433   3 78999988899999999996532 1 5


Q ss_pred             cccCccccCCCC-cccccc-cCCCCeeEeCCCCceeCCCCCccccCCcCC----CCCCCCCCCCCCCCCCEE--eeCCCC
Q psy11799        271 CLDSDECATGRH-ICQQIC-NLLGGYKCGCPEGYVRHPFSNECIDDNECA----RNPCEPSSNTTGERGGKC--FNTAGS  342 (483)
Q Consensus       271 C~~~~~C~~~~~-~c~~~C-~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~----~~~C~~~~~~~c~~~g~C--~~~~g~  342 (483)
                      +.++++|....+ ...+.| +..+.|.|.|++||++..+ ..+.+..+|.    ..+|.+        ++.|  ....+.
T Consensus       268 ~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~--------g~~C~~~~~~~~  338 (487)
T KOG1217|consen  268 CVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCAN--------GGTCNTLGSFGG  338 (487)
T ss_pred             eeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCC--------CcccccCCCCCC
Confidence            677888888753 334678 6666699999999999887 4556666774    346666        7788  344557


Q ss_pred             eeEECCCCCeeccCCCceeeCCCccCCCCCCCC--cEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCc
Q psy11799        343 YRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG--CTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNL  420 (483)
Q Consensus       343 ~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~--c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C  420 (483)
                      +.|.|..+|    .++.|+...+.|...++.++  |.......+.|.|+.+|.+...                 .....+
T Consensus       339 ~~C~c~~~~----~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~-----------------~~~~~~  397 (487)
T KOG1217|consen  339 FRCACGPGF----TGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKAN-----------------GDGVGC  397 (487)
T ss_pred             CCcCCCCCC----CCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCc-----------------cccccc
Confidence            889999998    77888876447887777766  4441237899999999987410                 001235


Q ss_pred             ccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCCCcCCC
Q psy11799        421 GDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLDQDECT  473 (483)
Q Consensus       421 ~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~C~  473 (483)
                      .++++|..          .+.|++..+++.|. ++ + ..... .|.+++++.
T Consensus       398 ~~~~~c~~----------~~~c~~~~~~~~c~-~~-~-~~~~~-~~~~~~~~~  436 (487)
T KOG1217|consen  398 EDIDECSG----------CGDCVNGPGGGACT-PP-G-LVSPG-TCDDIDECP  436 (487)
T ss_pred             cccccccC----------CcceeccCCCCccc-cC-c-ccCCc-ceecccccc
Confidence            56666642          46778788999999 88 5 44344 777777664


No 3  
>KOG1217|consensus
Probab=99.51  E-value=8.1e-13  Score=137.85  Aligned_cols=269  Identities=40%  Similarity=0.882  Sum_probs=182.8

Q ss_pred             CCCCeEeeCCCceEeeCCCCCeeCCCCCc--cccCCCCCCCCCccCCCccccccchhccCCCCCCCeEEe---CCCCeEE
Q psy11799        140 PKPGTCINTMGSFRCLCPRGFKLDRTGTF--CLDQDECTDDSKCREGCQTCEDINECLELSNQCAFRCHN---VPGSFRC  214 (483)
Q Consensus       140 ~~~g~C~~~~g~~~C~C~~G~~g~~~g~~--C~~~~~C~~~~~C~~g~~~C~~~~~C~~~~~~C~~~C~~---~~gs~~C  214 (483)
                      ...+.+.....+|.|.|.+||.+..+...  |.....            .+..           .+.|..   ....+.|
T Consensus        98 ~~~~~~~~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~------------~~~~-----------~~~c~~~~~~~~~~~c  154 (487)
T KOG1217|consen   98 LLCGECVDCVGSYECTCPPGYQGTPCEGECECVTGPG------------VCCI-----------DGSCSNGPGSVGPFRC  154 (487)
T ss_pred             cCCccccCCCCCceeeCCCccccCcCCcceeecCCCC------------CeeC-----------chhhcCCCCCCCceee
Confidence            35667777888999999999996655442  321110            0011           134444   3457899


Q ss_pred             eCCCCceeCCCCCccccc-CcccC--CCCCCCCeeeeCCCCceEeCCCCceecCCCCC----cccc-----------Ccc
Q psy11799        215 ICPYGYALAPDGRHCIDI-NECEQ--FPGLCAHTCTNTEGSYTCGCHPGFELSSDGSG----TCLD-----------SDE  276 (483)
Q Consensus       215 ~C~~G~~~~~~g~~C~~~-~~C~~--~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~----~C~~-----------~~~  276 (483)
                      .|..||.    +..+... ++|..  .++..++.|.+..+.|.|.|+++|.+......    .|..           ...
T Consensus       155 ~C~~g~~----~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~  230 (487)
T KOG1217|consen  155 SCTEGYE----GEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPE  230 (487)
T ss_pred             eeCCCcc----cccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCC
Confidence            9999999    6666533 67773  34777889999999999999999986644321    2221           122


Q ss_pred             ccCCCCccc---ccc-cCCCCeeEeCCCCceeCCCCCccccCCcCCCCC-CCCCCCCCCCCCCEEeeCCCCeeEECCCCC
Q psy11799        277 CATGRHICQ---QIC-NLLGGYKCGCPEGYVRHPFSNECIDDNECARNP-CEPSSNTTGERGGKCFNTAGSYRCGCPDGF  351 (483)
Q Consensus       277 C~~~~~~c~---~~C-~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~-C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~  351 (483)
                      |......+.   +.| +..++|.|.|++||.+... ..+.++++|.... |.+        +++|++..+.|.|.|++||
T Consensus       231 c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~~c~~--------~~~C~~~~~~~~C~C~~g~  301 (487)
T KOG1217|consen  231 CEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDAC-VTCVDVDSCALIASCPN--------GGTCVNVPGSYRCTCPPGF  301 (487)
T ss_pred             cccccccccCCCCcccccCCceeeeCCCCcccccc-ceeeeccccCCCCccCC--------CCeeecCCCcceeeCCCCC
Confidence            222222222   456 6677899999999999753 3567788888753 777        8999999999999999999


Q ss_pred             eeccCCCceeeCCCccC----CCCCCCCcEE--CC-CCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccc-
Q psy11799        352 QFEAALQLCTQLSSSCL----SSPCSFGCTP--LG-ISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDI-  423 (483)
Q Consensus       352 ~~~~~g~~C~~~~~~C~----~~~C~~~c~~--~~-~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~-  423 (483)
                      .+... ..+.... .|.    ..+|.++..+  .. ...+.|.|.+||.|..                       |+.. 
T Consensus       302 ~g~~~-~~~~~~~-~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~-----------------------C~~~~  356 (487)
T KOG1217|consen  302 TGRLC-TECVDVD-ECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRR-----------------------CEDSN  356 (487)
T ss_pred             CCCCC-ccccccc-cccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCc-----------------------cccCC
Confidence            66443 2232223 563    4557665222  11 1356788888876654                       6666 


Q ss_pred             cccccCCCCccCCCCCCeEee-CCCCeeeeCCCCcccC--CCCCCCCCCcCCCC
Q psy11799        424 DECKEEINGVAVCPKPGTCIN-TMGSFRCLCPRGFKLD--RTGTFCLDQDECTD  474 (483)
Q Consensus       424 ~~C~~~~~~~~~C~~~g~C~~-~~g~~~C~C~~G~~~~--~~g~~C~~~~~C~~  474 (483)
                      ++|...     ++.+++.|++ ..++|.|.|+.+|.+.  ..+..+.++++|..
T Consensus       357 ~~C~~~-----~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~  405 (487)
T KOG1217|consen  357 DECASS-----PCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG  405 (487)
T ss_pred             ccccCC-----ccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC
Confidence            378765     6889999999 7899999999999864  57788888888863


No 4  
>KOG1214|consensus
Probab=99.44  E-value=4.8e-13  Score=136.72  Aligned_cols=155  Identities=38%  Similarity=0.919  Sum_probs=120.6

Q ss_pred             cccCCCCccc--ccc--cCCCCeeEeCCCCceeCCCCCccccCCcCCC--CCCCCCCCCCCCCCCEEeeCCCCeeEECCC
Q psy11799        276 ECATGRHICQ--QIC--NLLGGYKCGCPEGYVRHPFSNECIDDNECAR--NPCEPSSNTTGERGGKCFNTAGSYRCGCPD  349 (483)
Q Consensus       276 ~C~~~~~~c~--~~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~--~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~  349 (483)
                      +|-...+.|.  +.|  ...-.|+|.|..||.|+  ...|.++++|+.  ..|.+        ++.|++.+++|+|.|..
T Consensus       694 pCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gd--gr~c~d~~eca~~~~~CGp--------~s~Cin~pg~~rceC~~  763 (1289)
T KOG1214|consen  694 PCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGD--GRNCVDENECATGFHRCGP--------NSVCINLPGSYRCECRS  763 (1289)
T ss_pred             cceecCcccCCCccccCCCCcceEEEEeeccCCC--CCCCCChhhhccCCCCCCC--------CceeecCCCceeEEEee
Confidence            3444444444  445  33457999999999987  467889999987  36766        89999999999999999


Q ss_pred             CCeeccCCCceeeCCC-----ccC--CCCCCC-C---cEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCC
Q psy11799        350 GFQFEAALQLCTQLSS-----SCL--SSPCSF-G---CTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERP  418 (483)
Q Consensus       350 G~~~~~~g~~C~~~~~-----~C~--~~~C~~-~---c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~  418 (483)
                      ||.....+-+|..+..     .|+  .+.|.. +   |...+.+.|+|.|.+||.|++                     .
T Consensus       764 gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG---------------------~  822 (1289)
T KOG1214|consen  764 GYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG---------------------H  822 (1289)
T ss_pred             cceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc---------------------c
Confidence            9998878788975432     455  345642 2   445555789999999999976                     3


Q ss_pred             CcccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCC
Q psy11799        419 NLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLD  468 (483)
Q Consensus       419 ~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~  468 (483)
                      .|.++|+|..+     -|...+.|++++++|.|.|.+||+||  |.+|..
T Consensus       823 ~c~dvDeC~ps-----rChp~A~CyntpgsfsC~C~pGy~GD--Gf~CVP  865 (1289)
T KOG1214|consen  823 QCTDVDECSPS-----RCHPAATCYNTPGSFSCRCQPGYYGD--GFQCVP  865 (1289)
T ss_pred             ccccccccCcc-----ccCCCceEecCCCcceeecccCccCC--CceecC
Confidence            47888999865     79999999999999999999999986  777864


No 5  
>KOG1219|consensus
Probab=99.26  E-value=9.5e-12  Score=137.65  Aligned_cols=109  Identities=40%  Similarity=1.010  Sum_probs=94.9

Q ss_pred             CcCCCCCCCCCCCCCCCCCCEEeeCC-CCeeEECCCCCeeccCCCceeeCCCccCCCCCCCC--cEECCCCCeeEeCCCC
Q psy11799        315 NECARNPCEPSSNTTGERGGKCFNTA-GSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG--CTPLGISGFSCECPSG  391 (483)
Q Consensus       315 ~~C~~~~C~~~~~~~c~~~g~C~~~~-g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~--c~~~~~~~~~C~C~~G  391 (483)
                      +.|..+||++        +|+|...+ ++|.|.|+.-|    .|++|+....+|.++||.+|  |.... ++|.|.|+.|
T Consensus      3865 d~C~~npCqh--------gG~C~~~~~ggy~CkCpsqy----sG~~CEi~~epC~snPC~~GgtCip~~-n~f~CnC~~g 3931 (4289)
T KOG1219|consen 3865 DPCNDNPCQH--------GGTCISQPKGGYKCKCPSQY----SGNHCEIDLEPCASNPCLTGGTCIPFY-NGFLCNCPNG 3931 (4289)
T ss_pred             cccccCcccC--------CCEecCCCCCceEEeCcccc----cCcccccccccccCCCCCCCCEEEecC-CCeeEeCCCC
Confidence            7899999999        99999866 67999999999    88999987779999999988  55555 8999999999


Q ss_pred             eeecCCcceecccCCCCCCCcccCCCCCcc--cccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCCC
Q psy11799        392 YQRIGQGHCLATISSGLYPGFRSYERPNLG--DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCLD  468 (483)
Q Consensus       392 y~g~~~g~C~~~~~~~C~~g~~g~~~~~C~--~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~  468 (483)
                      |+|.+                       |+  .|++|..+     +|.++|.|++..|+|.|.|.+||    -|..|..
T Consensus      3932 yTG~~-----------------------Ce~~Gi~eCs~n-----~C~~gg~C~n~~gsf~CncT~g~----~gr~c~~ 3978 (4289)
T KOG1219|consen 3932 YTGKR-----------------------CEARGISECSKN-----VCGTGGQCINIPGSFHCNCTPGI----LGRTCCA 3978 (4289)
T ss_pred             ccCce-----------------------eecccccccccc-----cccCCceeeccCCceEeccChhH----hcccCcc
Confidence            99986                       32  27889765     89999999999999999999999    6777753


No 6  
>KOG4289|consensus
Probab=99.15  E-value=1.5e-10  Score=124.11  Aligned_cols=88  Identities=32%  Similarity=0.866  Sum_probs=76.7

Q ss_pred             cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCceeeCC--Cc
Q psy11799        289 NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLS--SS  366 (483)
Q Consensus       289 ~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~--~~  366 (483)
                      +..++++|+|++||+|+.|+   ..+|.|-+.||.+        +|+|....|+|+|.|.+||    +|++|+...  ..
T Consensus      1217 ~pvnglrCrCPpGFTgd~Ce---TeiDlCYs~pC~n--------ng~C~srEggYtCeCrpg~----tGehCEvs~~agr 1281 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCE---TEIDLCYSGPCGN--------NGRCRSREGGYTCECRPGF----TGEHCEVSARAGR 1281 (2531)
T ss_pred             cccCceeEeCCCCCCccccc---chhHhhhcCCCCC--------CCceEEecCceeEEecCCc----cccceeeecccCc
Confidence            45578899999999999888   7799999999998        9999999999999999999    778897532  26


Q ss_pred             cCCCCCCCC--cEECCCCCeeEeCCCC
Q psy11799        367 CLSSPCSFG--CTPLGISGFSCECPSG  391 (483)
Q Consensus       367 C~~~~C~~~--c~~~~~~~~~C~C~~G  391 (483)
                      |.+..|.++  |+....+.+.|.|+.|
T Consensus      1282 CvpGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1282 CVPGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             cccceecCCCEEeecCCCceeccCCCc
Confidence            889999988  6777778899999998


No 7  
>KOG4289|consensus
Probab=99.12  E-value=7.3e-10  Score=119.08  Aligned_cols=87  Identities=37%  Similarity=0.852  Sum_probs=70.2

Q ss_pred             EeCCCCeEEeCCCCceeCCCCCcc-cccCcccCCCCCCCCeeeeCCCCceEeCCCCceecCCCCCccc---cCccccCCC
Q psy11799        206 HNVPGSFRCICPYGYALAPDGRHC-IDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDGSGTCL---DSDECATGR  281 (483)
Q Consensus       206 ~~~~gs~~C~C~~G~~~~~~g~~C-~~~~~C~~~~~~~~~~C~~~~g~~~C~C~~G~~~~~~g~~~C~---~~~~C~~~~  281 (483)
                      ++..+.++|.|++||+    |..| .+++.|-..||.+++.|....++|+|.|.+||+    |. .|+   ....|.++-
T Consensus      1216 i~pvnglrCrCPpGFT----gd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~t----Ge-hCEvs~~agrCvpGv 1286 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFT----GDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFT----GE-HCEVSARAGRCVPGV 1286 (2531)
T ss_pred             ccccCceeEeCCCCCC----cccccchhHhhhcCCCCCCCceEEecCceeEEecCCcc----cc-ceeeecccCccccce
Confidence            3455678999999999    8899 579999999999999999999999999999999    44 443   234566665


Q ss_pred             Ccccccc--cCCCCeeEeCCCC
Q psy11799        282 HICQQIC--NLLGGYKCGCPEG  301 (483)
Q Consensus       282 ~~c~~~C--~~~g~~~C~C~~G  301 (483)
                      +...+.|  ...+++.|.|+.|
T Consensus      1287 C~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1287 CKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred             ecCCCEEeecCCCceeccCCCc
Confidence            5555788  5667889999876


No 8  
>KOG0994|consensus
Probab=99.05  E-value=1.4e-09  Score=115.02  Aligned_cols=73  Identities=21%  Similarity=0.383  Sum_probs=47.1

Q ss_pred             CEEeeCCCCeeEECCCCCeeccCCCceee--------CCCccCCCCCCCCcEECC---CCCeeEeCCCCeeecCCcceec
Q psy11799        334 GKCFNTAGSYRCGCPDGFQFEAALQLCTQ--------LSSSCLSSPCSFGCTPLG---ISGFSCECPSGYQRIGQGHCLA  402 (483)
Q Consensus       334 g~C~~~~g~~~C~C~~G~~~~~~g~~C~~--------~~~~C~~~~C~~~c~~~~---~~~~~C~C~~Gy~g~~~g~C~~  402 (483)
                      -+|....|  +|+|.+||    .|+.|..        .+..|..-.|....+.++   ....+|+|.+|-.|....    
T Consensus      1077 pqCN~ftG--QCqCkpGf----GGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd---- 1146 (1758)
T KOG0994|consen 1077 PQCNEFTG--QCQCKPGF----GGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD---- 1146 (1758)
T ss_pred             cccccccc--ceeccCCC----CCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh----
Confidence            36666666  99999999    6777641        222455555554322221   256789999998887644    


Q ss_pred             ccCCCCCCCcccCCCCCcc
Q psy11799        403 TISSGLYPGFRSYERPNLG  421 (483)
Q Consensus       403 ~~~~~C~~g~~g~~~~~C~  421 (483)
                          .|.+||+|.. +.|.
T Consensus      1147 ----qCaRgy~G~f-P~C~ 1160 (1758)
T KOG0994|consen 1147 ----QCARGYSGQF-PVCV 1160 (1758)
T ss_pred             ----hhhhhhcCCC-CCCc
Confidence                4778999876 4444


No 9  
>KOG1219|consensus
Probab=99.03  E-value=5.1e-10  Score=124.48  Aligned_cols=106  Identities=33%  Similarity=0.801  Sum_probs=95.0

Q ss_pred             ccccCCCCcccccc--cCCCCeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCe
Q psy11799        275 DECATGRHICQQIC--NLLGGYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ  352 (483)
Q Consensus       275 ~~C~~~~~~c~~~C--~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~  352 (483)
                      +.|...++...+.|  ...++|.|.|++-|+|..|+   .++..|..+||..        ||+|+...++|.|.|+.|| 
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE---i~~epC~snPC~~--------GgtCip~~n~f~CnC~~gy- 3932 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE---IDLEPCASNPCLT--------GGTCIPFYNGFLCNCPNGY- 3932 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccc---cccccccCCCCCC--------CCEEEecCCCeeEeCCCCc-
Confidence            78888877777999  56678999999999999998   7788999999998        9999999999999999999 


Q ss_pred             eccCCCceeeC-CCccCCCCCCCC--cEECCCCCeeEeCCCCeeecC
Q psy11799        353 FEAALQLCTQL-SSSCLSSPCSFG--CTPLGISGFSCECPSGYQRIG  396 (483)
Q Consensus       353 ~~~~g~~C~~~-~~~C~~~~C~~~--c~~~~~~~~~C~C~~Gy~g~~  396 (483)
                         +|++|+.. .++|+.++|.++  |.+.. ++|+|.|.+||.|..
T Consensus      3933 ---TG~~Ce~~Gi~eCs~n~C~~gg~C~n~~-gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3933 ---TGKRCEARGISECSKNVCGTGGQCINIP-GSFHCNCTPGILGRT 3975 (4289)
T ss_pred             ---cCceeecccccccccccccCCceeeccC-CceEeccChhHhccc
Confidence               88899875 568999999987  77777 899999999999876


No 10 
>KOG0994|consensus
Probab=98.68  E-value=3.8e-07  Score=97.20  Aligned_cols=41  Identities=10%  Similarity=0.136  Sum_probs=31.1

Q ss_pred             ccCCCceEEEeCCCCeeeEeeecCCCCcccCCceeeeccCcceeeecc
Q psy11799         25 NMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASVSTRKGLTGVSVTR   72 (483)
Q Consensus        25 ~~~~g~~~C~C~~g~~G~~C~~c~~~~~~~~~~~C~c~~g~~g~~C~~   72 (483)
                      -|+.|...|.|.|.  |..     +.-|..+++.|.|.++..|.+|+.
T Consensus       757 ~l~n~a~~CnCnpt--GSl-----S~vCn~~GGqCqCkPnVVGR~Cdq  797 (1758)
T KOG0994|consen  757 LLHNGASMCNCNPT--GSL-----SSVCNPNGGQCQCKPNVVGRRCDQ  797 (1758)
T ss_pred             HHhcCccccccCCC--ccc-----cccccCCCceecccCccccccccc
Confidence            35566677888875  443     445677888999999999999985


No 11 
>KOG1225|consensus
Probab=98.50  E-value=5.9e-07  Score=91.61  Aligned_cols=76  Identities=36%  Similarity=0.749  Sum_probs=51.5

Q ss_pred             CeeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCceeeCCCccCCCCC
Q psy11799        293 GYKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPC  372 (483)
Q Consensus       293 ~~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C  372 (483)
                      .+.|.|..+|.+..+.     ...|.. -|.+        ++.|++.    +|+|++||    +|..|..+.  |... |
T Consensus       233 ~~ic~c~~~~~g~~c~-----~~~C~~-~c~~--------~g~c~~G----~CIC~~Gf----~G~dC~e~~--Cp~~-c  287 (525)
T KOG1225|consen  233 DGICECPEGYFGPLCS-----TIYCPG-GCTG--------RGQCVEG----RCICPPGF----TGDDCDELV--CPVD-C  287 (525)
T ss_pred             CceeecCCceeCCccc-----cccCCC-CCcc--------cceEeCC----eEeCCCCC----cCCCCCccc--CCcc-c
Confidence            3489999999986443     112221 2333        5788876    99999999    666776543  5544 7


Q ss_pred             CCCcEECCCCCeeEeCCCCeeecC
Q psy11799        373 SFGCTPLGISGFSCECPSGYQRIG  396 (483)
Q Consensus       373 ~~~c~~~~~~~~~C~C~~Gy~g~~  396 (483)
                      +.+....+   ..|+|++||.|..
T Consensus       288 s~~g~~~~---g~CiC~~g~~G~d  308 (525)
T KOG1225|consen  288 SGGGVCVD---GECICNPGYSGKD  308 (525)
T ss_pred             CCCceecC---CEeecCCCccccc
Confidence            66655554   3999999999976


No 12 
>KOG4260|consensus
Probab=98.42  E-value=3.1e-07  Score=83.48  Aligned_cols=164  Identities=29%  Similarity=0.661  Sum_probs=100.7

Q ss_pred             EeecccCcccccccccccccCCCCccCCCCCCeEeeC---CCceEeeCCCCCeeCCCCCccccC--CCCCCCCCccCCCc
Q psy11799        112 ISLKKKKFYSVFLDMDECKEEINGVAVCPKPGTCINT---MGSFRCLCPRGFKLDRTGTFCLDQ--DECTDDSKCREGCQ  186 (483)
Q Consensus       112 ~~~~~~~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~---~g~~~C~C~~G~~g~~~g~~C~~~--~~C~~~~~C~~g~~  186 (483)
                      ..|+.+.||.+|.   .|......  ||..+|+|.-.   .|+-.|.|.+||+    |..|..-  .......  ..-..
T Consensus       130 vCCp~gtyGpdCl---~Cpggser--~C~GnG~C~GdGsR~GsGkCkC~~GY~----Gp~C~~Cg~eyfes~R--ne~~l  198 (350)
T KOG4260|consen  130 VCCPDGTYGPDCL---QCPGGSER--PCFGNGSCHGDGSREGSGKCKCETGYT----GPLCRYCGIEYFESSR--NEQHL  198 (350)
T ss_pred             eccCCCCcCCccc---cCCCCCcC--CcCCCCcccCCCCCCCCCcccccCCCC----CccccccchHHHHhhc--ccccc
Confidence            3567888899987   34332122  38888999743   5677999999999    5555311  0000000  00000


Q ss_pred             cccccchhccCCCCCCCeEEeCCCCeEE-eCCCCceeCCCCCcccccCcccC--CCCCCCCeeeeCCCCceEeCCCCcee
Q psy11799        187 TCEDINECLELSNQCAFRCHNVPGSFRC-ICPYGYALAPDGRHCIDINECEQ--FPGLCAHTCTNTEGSYTCGCHPGFEL  263 (483)
Q Consensus       187 ~C~~~~~C~~~~~~C~~~C~~~~gs~~C-~C~~G~~~~~~g~~C~~~~~C~~--~~~~~~~~C~~~~g~~~C~C~~G~~~  263 (483)
                      .|+.   |   ...|.+.|... ++-.| .|..||.++  ...|+||++|..  .|+.....|+|+.|+|.|...+||..
T Consensus       199 vCt~---C---h~~C~~~Csg~-~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  199 VCTA---C---HEGCLGVCSGE-SSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             hhhh---h---hhhhhcccCCC-CCCChhhhcccceec--ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence            1110   0   11122344332 22345 599999987  556999999985  45555779999999999999999973


Q ss_pred             cCCCCCccccCccccCCCCccc---ccc-cCCCCeeEeCCCCcee
Q psy11799        264 SSDGSGTCLDSDECATGRHICQ---QIC-NLLGGYKCGCPEGYVR  304 (483)
Q Consensus       264 ~~~g~~~C~~~~~C~~~~~~c~---~~C-~~~g~~~C~C~~G~~g  304 (483)
                      .         +|+|+.-...+.   ..| +..+.|+|.|..|+.-
T Consensus       270 g---------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~  305 (350)
T KOG4260|consen  270 G---------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLII  305 (350)
T ss_pred             C---------hHHhhhhhhhcccCCCCcccCCccEEEEeccccee
Confidence            1         345543222232   456 8889999999988863


No 13 
>KOG4260|consensus
Probab=98.37  E-value=6.6e-07  Score=81.41  Aligned_cols=156  Identities=27%  Similarity=0.628  Sum_probs=101.7

Q ss_pred             CCCCceeCCCCCcccccCcccCCCCCCCCeeee---CCCCceEeCCCCceecCCCCCcccc------C----ccccCCCC
Q psy11799        216 CPYGYALAPDGRHCIDINECEQFPGLCAHTCTN---TEGSYTCGCHPGFELSSDGSGTCLD------S----DECATGRH  282 (483)
Q Consensus       216 C~~G~~~~~~g~~C~~~~~C~~~~~~~~~~C~~---~~g~~~C~C~~G~~~~~~g~~~C~~------~----~~C~~~~~  282 (483)
                      |++|-.    |+.|..-.--...||.-++.|.-   ..|+-+|.|.+||.+..+..  |..      .    -.|..-..
T Consensus       132 Cp~gty----GpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~--Cg~eyfes~Rne~~lvCt~Ch~  205 (350)
T KOG4260|consen  132 CPDGTY----GPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY--CGIEYFESSRNEQHLVCTACHE  205 (350)
T ss_pred             cCCCCc----CCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc--cchHHHHhhcccccchhhhhhh
Confidence            777776    88774321112356666777753   24667899999999554321  211      0    11221112


Q ss_pred             cccccccCCCCeeEe-CCCCceeCCCCCccccCCcCCC--CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCeeccCCCc
Q psy11799        283 ICQQICNLLGGYKCG-CPEGYVRHPFSNECIDDNECAR--NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQL  359 (483)
Q Consensus       283 ~c~~~C~~~g~~~C~-C~~G~~g~~~~~~C~~~~~C~~--~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~~~~g~~  359 (483)
                      .|.+.|...++..|. |..||..+  +..|.|+++|..  .+|..        +-.|+|+.|+|.|...+||...  -+.
T Consensus       206 ~C~~~Csg~~~k~C~kCkkGW~ld--e~gCvDvnEC~~ep~~c~~--------~qfCvNteGSf~C~dk~Gy~~g--~d~  273 (350)
T KOG4260|consen  206 GCLGVCSGESSKGCSKCKKGWKLD--EEGCVDVNECQNEPAPCKA--------HQFCVNTEGSFKCEDKEGYKKG--VDE  273 (350)
T ss_pred             hhhcccCCCCCCChhhhcccceec--ccccccHHHHhcCCCCCCh--------hheeecCCCceEecccccccCC--hHH
Confidence            334456556666675 99999988  678999999986  47776        7899999999999999999652  222


Q ss_pred             eeeCCCccCCCCCC--CC-cEECCCCCeeEeCCCCeeec
Q psy11799        360 CTQLSSSCLSSPCS--FG-CTPLGISGFSCECPSGYQRI  395 (483)
Q Consensus       360 C~~~~~~C~~~~C~--~~-c~~~~~~~~~C~C~~Gy~g~  395 (483)
                      |+..     ...|.  +. |.+++ +.|+|.|..|+.-.
T Consensus       274 C~~~-----~d~~~~kn~~c~ni~-~~~r~v~f~~~~~~  306 (350)
T KOG4260|consen  274 CQFC-----ADVCASKNRPCMNID-GQYRCVCFSGLIII  306 (350)
T ss_pred             hhhh-----hhhcccCCCCcccCC-ccEEEEecccceee
Confidence            3221     13332  22 67777 89999999997643


No 14 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.36  E-value=2.8e-07  Score=61.27  Aligned_cols=42  Identities=55%  Similarity=1.214  Sum_probs=34.7

Q ss_pred             cccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799        422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC  466 (483)
Q Consensus       422 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C  466 (483)
                      |||||....+   +|..+++|+|+.|+|+|+|++||+....+..|
T Consensus         1 DidEC~~~~~---~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C   42 (42)
T PF07645_consen    1 DIDECAEGPH---NCPENGTCVNTEGSYSCSCPPGYELNDDGTTC   42 (42)
T ss_dssp             ESSTTTTTSS---SSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred             CccccCCCCC---cCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence            6889987654   89999999999999999999999855555443


No 15 
>KOG1225|consensus
Probab=98.31  E-value=1.3e-06  Score=89.12  Aligned_cols=102  Identities=17%  Similarity=0.273  Sum_probs=70.7

Q ss_pred             eEEEeCCCCeeeEeee--cCCC----CcccCCceeeeccCcceeeecccCcceeeeeccccccCceEEEeeCceeEEEEE
Q psy11799         31 FVWSLASQWTGEILTN--VSPY----PTSVNMASVSTRKGLTGVSVTRDTSYTMAILNVEQDKQGILIFSMLQTRYFNLF  104 (483)
Q Consensus        31 ~~C~C~~g~~G~~C~~--c~~~----~~~~~~~~C~c~~g~~g~~C~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~  104 (483)
                      +.|.|..+|.|+.|+.  |.++    ..++. +.|.|++||+|..|+....... ......+.+|               
T Consensus       234 ~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~-G~CIC~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g---------------  296 (525)
T KOG1225|consen  234 GICECPEGYFGPLCSTIYCPGGCTGRGQCVE-GRCICPPGFTGDDCDELVCPVD-CSGGGVCVDG---------------  296 (525)
T ss_pred             ceeecCCceeCCccccccCCCCCcccceEeC-CeEeCCCCCcCCCCCcccCCcc-cCCCceecCC---------------
Confidence            4799999999999994  4433    22333 5999999999999997433333 1111222222               


Q ss_pred             EEEEEEEEeecccCcccccccccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCCccc
Q psy11799        105 YVIIFYVISLKKKKFYSVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL  170 (483)
Q Consensus       105 ~~~~~~~~~~~~~~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C~  170 (483)
                            ++.|..++++..|+. ..|..+      |.++|.|++.    +|.|.+||+    |..|.
T Consensus       297 ------~CiC~~g~~G~dCs~-~~cpad------C~g~G~Ci~G----~C~C~~Gy~----G~~C~  341 (525)
T KOG1225|consen  297 ------ECICNPGYSGKDCSI-RRCPAD------CSGHGKCIDG----ECLCDEGYT----GELCI  341 (525)
T ss_pred             ------EeecCCCcccccccc-ccCCcc------CCCCCcccCC----ceEeCCCCc----CCccc
Confidence                  235678888999973 347654      9999999933    699999999    55554


No 16 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.29  E-value=7.3e-07  Score=59.27  Aligned_cols=40  Identities=53%  Similarity=1.178  Sum_probs=33.4

Q ss_pred             cccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCCC
Q psy11799        125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGT  167 (483)
Q Consensus       125 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g~  167 (483)
                      |||||... ++.  |..+++|+|+.|+|+|.|++||.....+.
T Consensus         1 DidEC~~~-~~~--C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~   40 (42)
T PF07645_consen    1 DIDECAEG-PHN--CPENGTCVNTEGSYSCSCPPGYELNDDGT   40 (42)
T ss_dssp             ESSTTTTT-SSS--SSTTSEEEEETTEEEEEESTTEEECTTSS
T ss_pred             CccccCCC-CCc--CCCCCEEEcCCCCEEeeCCCCcEECCCCC
Confidence            68999976 555  98899999999999999999998554443


No 17 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=98.07  E-value=3.2e-06  Score=47.94  Aligned_cols=24  Identities=46%  Similarity=1.039  Sum_probs=22.8

Q ss_pred             CeeeeCCCCcccCCCCCCCCCCcC
Q psy11799        448 SFRCLCPRGFKLDRTGTFCLDQDE  471 (483)
Q Consensus       448 ~~~C~C~~G~~~~~~g~~C~~~~~  471 (483)
                      ||+|+|++||++++++++|+||||
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            699999999999999999999986


No 18 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.70  E-value=3.2e-05  Score=49.00  Aligned_cols=33  Identities=55%  Similarity=1.207  Sum_probs=27.4

Q ss_pred             CCCCCCCeEEeCCCCeEEeCCCCceeCCCCCcc
Q psy11799        197 LSNQCAFRCHNVPGSFRCICPYGYALAPDGRHC  229 (483)
Q Consensus       197 ~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~~C  229 (483)
                      .+..|...|++.+++|+|.|++||.|.+|+++|
T Consensus         4 ~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    4 NNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             GGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence            356678999999999999999999999998876


No 19 
>KOG1226|consensus
Probab=97.69  E-value=0.00038  Score=72.86  Aligned_cols=146  Identities=22%  Similarity=0.506  Sum_probs=89.6

Q ss_pred             ccccCCCCe---eEeCCCCceeCCCCCc--cc----cCCcCCCCCCCCCCCC-CCCCCCEEeeCCCCeeEECCCCCeecc
Q psy11799        286 QICNLLGGY---KCGCPEGYVRHPFSNE--CI----DDNECARNPCEPSSNT-TGERGGKCFNTAGSYRCGCPDGFQFEA  355 (483)
Q Consensus       286 ~~C~~~g~~---~C~C~~G~~g~~~~~~--C~----~~~~C~~~~C~~~~~~-~c~~~g~C~~~~g~~~C~C~~G~~~~~  355 (483)
                      ..|+..|.+   .|.|.+||.|..|+-.  ..    ..+.|..     .... .|...|.|+=.    +|+|.+...+..
T Consensus       467 ~~C~g~G~~~CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~-----~~~~~vCSgrG~C~CG----qC~C~~~~~~~i  537 (783)
T KOG1226|consen  467 ALCHGNGTFVCGQCRCDEGWLGKKCECSTDELSSSEEEDKCRE-----NSDSPVCSGRGDCVCG----QCVCHKPDNGKI  537 (783)
T ss_pred             cccCCCCcEEecceecCCCCCCCcccCCccccCcHhHHhhccC-----CCCCCCcCCCCcEeCC----ceEecCCCCCce
Confidence            445544444   5789999999887611  10    1123322     2221 44448888766    899998887666


Q ss_pred             CCCceeeCCCccC---CCCCCCCcEECCCCCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCC
Q psy11799        356 ALQLCTQLSSSCL---SSPCSFGCTPLGISGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEING  432 (483)
Q Consensus       356 ~g~~C~~~~~~C~---~~~C~~~c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~  432 (483)
                      .|+.|+-.+-.|.   ...|.....+.-   ..|.|.+||+|...+         |+           .+.+.|..... 
T Consensus       538 ~G~fCECDnfsC~r~~g~lC~g~G~C~C---G~CvC~~GwtG~~C~---------C~-----------~std~C~~~~G-  593 (783)
T KOG1226|consen  538 YGKFCECDNFSCERHKGVLCGGHGRCEC---GRCVCNPGWTGSACN---------CP-----------LSTDTCESSDG-  593 (783)
T ss_pred             eeeeeeccCcccccccCcccCCCCeEeC---CcEEcCCCCccCCCC---------CC-----------CCCccccCCCC-
Confidence            7899976554555   335664433322   469999999987621         11           24566655322 


Q ss_pred             ccCCCCCCeEeeCCCCeeeeCCCC-cccCCCCCCCCCCcCCC
Q psy11799        433 VAVCPKPGTCINTMGSFRCLCPRG-FKLDRTGTFCLDQDECT  473 (483)
Q Consensus       433 ~~~C~~~g~C~~~~g~~~C~C~~G-~~~~~~g~~C~~~~~C~  473 (483)
                      . .|...|+|.=.    +|+|... |    .|..|+.-..|.
T Consensus       594 ~-iCSGrG~C~Cg----~C~C~~~~~----sG~~CE~cptc~  626 (783)
T KOG1226|consen  594 Q-ICSGRGTCECG----RCKCTDPPY----SGEFCEKCPTCP  626 (783)
T ss_pred             c-eeCCCceeeCC----ceEcCCCCc----CcchhhcCCCCC
Confidence            2 68888888644    3888877 8    788888544443


No 20 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=97.66  E-value=3.8e-05  Score=43.61  Aligned_cols=24  Identities=63%  Similarity=1.370  Sum_probs=22.1

Q ss_pred             CeEEeCCCCceeCCCCCcccccCc
Q psy11799        211 SFRCICPYGYALAPDGRHCIDINE  234 (483)
Q Consensus       211 s~~C~C~~G~~~~~~g~~C~~~~~  234 (483)
                      +|+|+|++||.+.++++.|++|++
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccccCCC
Confidence            589999999999999999999875


No 21 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.61  E-value=3.9e-05  Score=48.70  Aligned_cols=29  Identities=48%  Similarity=1.098  Sum_probs=23.1

Q ss_pred             CCCCCeEeeCCCceEeeCCCCCeeCCCCCcc
Q psy11799        139 CPKPGTCINTMGSFRCLCPRGFKLDRTGTFC  169 (483)
Q Consensus       139 C~~~g~C~~~~g~~~C~C~~G~~g~~~g~~C  169 (483)
                      |+.+++|+++.++|.|+|++||.|+  |..|
T Consensus         8 C~~nA~C~~~~~~~~C~C~~Gy~Gd--G~~C   36 (36)
T PF12947_consen    8 CHPNATCTNTGGSYTCTCKPGYEGD--GFFC   36 (36)
T ss_dssp             S-TTCEEEE-TTSEEEEE-CEEECC--STCE
T ss_pred             CCCCcEeecCCCCEEeECCCCCccC--CcCC
Confidence            9999999999999999999999987  5543


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.59  E-value=9.8e-05  Score=47.94  Aligned_cols=38  Identities=50%  Similarity=1.157  Sum_probs=29.9

Q ss_pred             ccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCC
Q psy11799        423 IDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL  467 (483)
Q Consensus       423 ~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~  467 (483)
                      +++|...    .+|.++++|++..++|+|.|++||.   .|..|+
T Consensus         2 ~~~C~~~----~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C~   39 (39)
T smart00179        2 IDECASG----NPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE   39 (39)
T ss_pred             cccCcCC----CCcCCCCEeECCCCCeEeECCCCCc---cCCcCC
Confidence            5667642    2899999999999999999999994   266653


No 23 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.58  E-value=4e-05  Score=47.43  Aligned_cols=26  Identities=38%  Similarity=1.005  Sum_probs=23.7

Q ss_pred             CCCCCCeEeeCC-CCeeeeCCCCcccC
Q psy11799        435 VCPKPGTCINTM-GSFRCLCPRGFKLD  460 (483)
Q Consensus       435 ~C~~~g~C~~~~-g~~~C~C~~G~~~~  460 (483)
                      +|.++|+|++.. ++|+|+|++||+|.
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            899999999988 99999999999553


No 24 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.50  E-value=6.2e-05  Score=47.77  Aligned_cols=30  Identities=47%  Similarity=1.025  Sum_probs=23.6

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC  466 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C  466 (483)
                      .|+.+|+|+++.++|+|+|++||.|+  |..|
T Consensus         7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd--G~~C   36 (36)
T PF12947_consen    7 GCHPNATCTNTGGSYTCTCKPGYEGD--GFFC   36 (36)
T ss_dssp             GS-TTCEEEE-TTSEEEEE-CEEECC--STCE
T ss_pred             CCCCCcEeecCCCCEEeECCCCCccC--CcCC
Confidence            79999999999999999999999886  5543


No 25 
>KOG1836|consensus
Probab=97.47  E-value=0.0023  Score=74.40  Aligned_cols=57  Identities=25%  Similarity=0.524  Sum_probs=34.2

Q ss_pred             CCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCCccCCCCCC----eEeeCCCCeeeeCCCCc
Q psy11799        382 SGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVCPKPG----TCINTMGSFRCLCPRGF  457 (483)
Q Consensus       382 ~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C~~~g----~C~~~~g~~~C~C~~G~  457 (483)
                      ....|.|.+|-+|.+.++|        ..++++.....|..   |        .|...|    +|....|  +|.|.++|
T Consensus       957 ~tGqc~c~~gVtgqrc~qc--------~~~~~~~~~~gc~~---c--------~c~~~Gs~~~qc~~~~G--~c~c~~~~ 1015 (1705)
T KOG1836|consen  957 GTGQCYCRPGVTGQRCDQC--------ETYHFGFQTEGCGL---C--------ECDPLGSRGFQCDPEDG--QCPCRPGF 1015 (1705)
T ss_pred             cCCceeeecCccccccCcc--------ccCcccccccCCcc---e--------ecccCCcccceecccCC--eeeecCCC
Confidence            5678999999988876654        44444433222322   2        233333    6766555  58888888


Q ss_pred             cc
Q psy11799        458 KL  459 (483)
Q Consensus       458 ~~  459 (483)
                      .|
T Consensus      1016 ~g 1017 (1705)
T KOG1836|consen 1016 EG 1017 (1705)
T ss_pred             CC
Confidence            43


No 26 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.40  E-value=0.00019  Score=62.40  Aligned_cols=138  Identities=28%  Similarity=0.700  Sum_probs=85.5

Q ss_pred             CCCCeeEeCCCCceeCCCCCccccCCcCCC-----CCCCCCCCCCCCCCCEEeeCC-----CCeeEECCCCCeeccCCCc
Q psy11799        290 LLGGYKCGCPEGYVRHPFSNECIDDNECAR-----NPCEPSSNTTGERGGKCFNTA-----GSYRCGCPDGFQFEAALQL  359 (483)
Q Consensus       290 ~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~-----~~C~~~~~~~c~~~g~C~~~~-----g~~~C~C~~G~~~~~~g~~  359 (483)
                      ....|.|.|.+||.... +.+|+...+|..     .+|..        -+.|++..     ..|.|.|.+||.+.  ...
T Consensus        16 MSNHfEC~Cnegfvl~~-EntCE~kv~C~~~e~~~K~Cgd--------ya~C~~~~~~~~~~~~~C~C~~gY~~~--~~v   84 (197)
T PF06247_consen   16 MSNHFECKCNEGFVLKN-ENTCEEKVECDKLENVNKPCGD--------YAKCINQANKGEERAYKCDCINGYILK--QGV   84 (197)
T ss_dssp             ESSEEEEEESTTEEEEE-TTEEEE----SG-GGTTSEEET--------TEEEEE-SSTTSSTSEEEEE-TTEEES--SSS
T ss_pred             ccCceEEEcCCCcEEcc-ccccccceecCcccccCccccc--------hhhhhcCCCcccceeEEEecccCceee--CCe
Confidence            34568999999999864 567877777754     25554        88998876     46999999999884  346


Q ss_pred             eeeCCCccCCCCCCCC-cEECCC--CCeeEeCCCCeeecCCcceecccCCCCCCCcccCCCCCcccccccccCCCCccCC
Q psy11799        360 CTQLSSSCLSSPCSFG-CTPLGI--SGFSCECPSGYQRIGQGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVC  436 (483)
Q Consensus       360 C~~~~~~C~~~~C~~~-c~~~~~--~~~~C~C~~Gy~g~~~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C  436 (483)
                      |...  .|....|..| |..++.  ...+|+|.-|+.-...+.|.-..                  -..|+.      -|
T Consensus        85 Cvp~--~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G------------------~T~C~L------KC  138 (197)
T PF06247_consen   85 CVPN--KCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTG------------------ETKCSL------KC  138 (197)
T ss_dssp             EEEG--GGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------------------------------
T ss_pred             Echh--hcCceecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCC------------------ccceee------ec
Confidence            7644  4777777777 775542  34599999999844444443211                  124544      47


Q ss_pred             CCCCeEeeCCCCeeeeCCCCcccCCCCC
Q psy11799        437 PKPGTCINTMGSFRCLCPRGFKLDRTGT  464 (483)
Q Consensus       437 ~~~g~C~~~~g~~~C~C~~G~~~~~~g~  464 (483)
                      ..+-+|....+-|+|+|.+||.++..+.
T Consensus       139 k~nE~CK~~~~~Y~C~~~~~~~~~~~~~  166 (197)
T PF06247_consen  139 KENEECKLVDGYYKCVCKEGFPGDGEGE  166 (197)
T ss_dssp             TTTEEEEEETTEEEEEE-TT-EEETTT-
T ss_pred             CCCcceeeeCcEEEeecCCCCCCCCCcc
Confidence            7788999999999999999998776554


No 27 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.24  E-value=0.0002  Score=44.35  Aligned_cols=30  Identities=43%  Similarity=1.270  Sum_probs=25.3

Q ss_pred             CCCCCCCCCCCCCCCCCCEEeeCC-CCeeEECCCCCeec
Q psy11799        317 CARNPCEPSSNTTGERGGKCFNTA-GSYRCGCPDGFQFE  354 (483)
Q Consensus       317 C~~~~C~~~~~~~c~~~g~C~~~~-g~~~C~C~~G~~~~  354 (483)
                      |..++|.+        +|+|++.. ++|.|.|++||.|.
T Consensus         1 C~~~~C~n--------~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    1 CSSNPCQN--------GGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             TTTTSSTT--------TEEEEEESTSEEEEEEBTTEEST
T ss_pred             CCCCcCCC--------CeEEEeCCCCCEEeECCCCCccC
Confidence            44567887        99999999 99999999999653


No 28 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.07  E-value=0.00086  Score=42.94  Aligned_cols=28  Identities=57%  Similarity=1.373  Sum_probs=24.7

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC  466 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C  466 (483)
                      +|.+++.|++..++|+|.|++||    .|..|
T Consensus        10 ~C~~~~~C~~~~~~~~C~C~~g~----~g~~C   37 (38)
T cd00054          10 PCQNGGTCVNTVGSYRCSCPPGY----TGRNC   37 (38)
T ss_pred             CcCCCCEeECCCCCeEeECCCCC----cCCcC
Confidence            79889999999999999999999    55555


No 29 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.04  E-value=0.0012  Score=57.61  Aligned_cols=138  Identities=25%  Similarity=0.619  Sum_probs=83.6

Q ss_pred             CeEEeCCCCeEEeCCCCceeCCCCCcccccCcccC-----CCCCCCCeeeeCC-----CCceEeCCCCceecCCCCCccc
Q psy11799        203 FRCHNVPGSFRCICPYGYALAPDGRHCIDINECEQ-----FPGLCAHTCTNTE-----GSYTCGCHPGFELSSDGSGTCL  272 (483)
Q Consensus       203 ~~C~~~~gs~~C~C~~G~~~~~~g~~C~~~~~C~~-----~~~~~~~~C~~~~-----g~~~C~C~~G~~~~~~g~~~C~  272 (483)
                      |.-+.....|.|.|.+||.+. +..+|+...+|..     .+|...+.|++..     ..|.|.|.+||.+...   .|.
T Consensus        11 G~LiQMSNHfEC~Cnegfvl~-~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~---vCv   86 (197)
T PF06247_consen   11 GYLIQMSNHFECKCNEGFVLK-NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG---VCV   86 (197)
T ss_dssp             EEEEEESSEEEEEESTTEEEE-ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---SEE
T ss_pred             CEEEEccCceEEEcCCCcEEc-cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---eEc
Confidence            566777788999999999875 3567876666653     4566678898765     5789999999997532   455


Q ss_pred             cCccccCCCCccc-ccc----cCCCCeeEeCCCCceeCCCCCcccc--CCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeE
Q psy11799        273 DSDECATGRHICQ-QIC----NLLGGYKCGCPEGYVRHPFSNECID--DNECARNPCEPSSNTTGERGGKCFNTAGSYRC  345 (483)
Q Consensus       273 ~~~~C~~~~~~c~-~~C----~~~g~~~C~C~~G~~g~~~~~~C~~--~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C  345 (483)
                      . ..|...  .|. +.|    .......|.|.-|++ ......|..  ..+|.. .|..        +..|....+-|+|
T Consensus        87 p-~~C~~~--~Cg~GKCI~d~~~~~~~~CSC~IGkV-~~dn~kCtk~G~T~C~L-KCk~--------nE~CK~~~~~Y~C  153 (197)
T PF06247_consen   87 P-NKCNNK--DCGSGKCILDPDNPNNPTCSCNIGKV-PDDNKKCTKTGETKCSL-KCKE--------NEECKLVDGYYKC  153 (197)
T ss_dssp             E-GGGSS-----TTEEEEEEEGGGSEEEEEE-TEEE-TTTTTESEEEE---------TT--------TEEEEEETTEEEE
T ss_pred             h-hhcCce--ecCCCeEEecCCCCCCceeEeeeceE-eccCCcccCCCccceee-ecCC--------CcceeeeCcEEEe
Confidence            3 345443  343 666    333355999999999 333344432  122322 3333        7899999999999


Q ss_pred             ECCCCCeeccCC
Q psy11799        346 GCPDGFQFEAAL  357 (483)
Q Consensus       346 ~C~~G~~~~~~g  357 (483)
                      .+.+||.++..+
T Consensus       154 ~~~~~~~~~~~~  165 (197)
T PF06247_consen  154 VCKEGFPGDGEG  165 (197)
T ss_dssp             EE-TT-EEETTT
T ss_pred             ecCCCCCCCCCc
Confidence            999999876544


No 30 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.01  E-value=0.00068  Score=42.96  Aligned_cols=28  Identities=50%  Similarity=1.163  Sum_probs=23.2

Q ss_pred             CCeEeeCCCceEeeCCCCCeeCCCCCcc
Q psy11799        142 PGTCINTMGSFRCLCPRGFKLDRTGTFC  169 (483)
Q Consensus       142 ~g~C~~~~g~~~C~C~~G~~g~~~g~~C  169 (483)
                      ...|++.+++|+|.|++||++..+++.|
T Consensus         9 ~h~C~~~~g~~~C~C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen    9 SHICVNTPGSYRCSCPPGYKLAEDGRTC   36 (36)
T ss_dssp             SSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred             CCCCccCCCceEeECCCCCEECcCCCCC
Confidence            5699999999999999999999888754


No 31 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.99  E-value=0.0012  Score=42.75  Aligned_cols=31  Identities=61%  Similarity=1.426  Sum_probs=26.5

Q ss_pred             CCcCCC-CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCe
Q psy11799        314 DNECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ  352 (483)
Q Consensus       314 ~~~C~~-~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~  352 (483)
                      +++|.. .+|.+        +++|++..++|.|.|++||.
T Consensus         2 ~~~C~~~~~C~~--------~~~C~~~~g~~~C~C~~g~~   33 (39)
T smart00179        2 IDECASGNPCQN--------GGTCVNTVGSYRCECPPGYT   33 (39)
T ss_pred             cccCcCCCCcCC--------CCEeECCCCCeEeECCCCCc
Confidence            466776 67877        88999999999999999995


No 32 
>KOG1226|consensus
Probab=96.94  E-value=0.0024  Score=67.10  Aligned_cols=94  Identities=14%  Similarity=0.154  Sum_probs=53.4

Q ss_pred             ceeeeccCcceeeecccCcceeeeecccccc--CceEEEeeCceeEEEEEEEEEEEEEeecccCc----ccccc-ccccc
Q psy11799         57 ASVSTRKGLTGVSVTRDTSYTMAILNVEQDK--QGILIFSMLQTRYFNLFYVIIFYVISLKKKKF----YSVFL-DMDEC  129 (483)
Q Consensus        57 ~~C~c~~g~~g~~C~~~~~~~~~~~~~~~~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~C~-~~~~C  129 (483)
                      +.|.|.+||.|..|+++..-.........|.  ++...+ .+.+...|       ..+.|....+    |..|+ |...|
T Consensus       478 G~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vC-SgrG~C~C-------GqC~C~~~~~~~i~G~fCECDnfsC  549 (783)
T KOG1226|consen  478 GQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVC-SGRGDCVC-------GQCVCHKPDNGKIYGKFCECDNFSC  549 (783)
T ss_pred             cceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCc-CCCCcEeC-------CceEecCCCCCceeeeeeeccCccc
Confidence            6999999999999999755433311111121  111000 11111111       1223444444    77787 55566


Q ss_pred             ccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCC
Q psy11799        130 KEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR  164 (483)
Q Consensus       130 ~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~  164 (483)
                      ...  ....|..+|+|.=.    +|+|.+||+|..
T Consensus       550 ~r~--~g~lC~g~G~C~CG----~CvC~~GwtG~~  578 (783)
T KOG1226|consen  550 ERH--KGVLCGGHGRCECG----RCVCNPGWTGSA  578 (783)
T ss_pred             ccc--cCcccCCCCeEeCC----cEEcCCCCccCC
Confidence            653  33349999999765    599999999543


No 33 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.30  E-value=0.0066  Score=38.03  Aligned_cols=26  Identities=54%  Similarity=1.269  Sum_probs=23.3

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCcccC
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFKLD  460 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~~~  460 (483)
                      +|.++++|++..++|+|.|+.||.+.
T Consensus         7 ~C~~~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053           7 PCSNGGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCCCCCEEecCCCCeEeECCCCCccc
Confidence            78899999999999999999999543


No 34 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.24  E-value=0.0073  Score=37.96  Aligned_cols=24  Identities=50%  Similarity=1.108  Sum_probs=21.8

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCccc
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFKL  459 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~~  459 (483)
                      +|.++ +|++..++|+|.|++||.+
T Consensus         7 ~C~~~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181        7 PCSNG-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             CCCCC-EEECCCCCeEeECCCCCcc
Confidence            78888 9999999999999999954


No 35 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.23  E-value=0.0075  Score=38.40  Aligned_cols=32  Identities=59%  Similarity=1.340  Sum_probs=25.9

Q ss_pred             CCcCCC-CCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799        314 DNECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF  353 (483)
Q Consensus       314 ~~~C~~-~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~  353 (483)
                      +++|.. .+|.+        ++.|++..++|.|.|++||.|
T Consensus         2 ~~~C~~~~~C~~--------~~~C~~~~~~~~C~C~~g~~g   34 (38)
T cd00054           2 IDECASGNPCQN--------GGTCVNTVGSYRCSCPPGYTG   34 (38)
T ss_pred             cccCCCCCCcCC--------CCEeECCCCCeEeECCCCCcC
Confidence            455665 57766        789999999999999999954


No 36 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.94  E-value=0.0017  Score=31.18  Aligned_cols=13  Identities=23%  Similarity=0.409  Sum_probs=8.1

Q ss_pred             EEEeCCCCeeeEe
Q psy11799         32 VWSLASQWTGEIL   44 (483)
Q Consensus        32 ~C~C~~g~~G~~C   44 (483)
                      +|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4777777777654


No 37 
>KOG1836|consensus
Probab=95.10  E-value=0.24  Score=58.34  Aligned_cols=67  Identities=27%  Similarity=0.545  Sum_probs=36.2

Q ss_pred             chhccCCCCCCCeEEeCCCCeEE-eCCCCceeCCCCCcccccCcccCCCCCCCCeeeeCC--CCceEe-CCCCcee
Q psy11799        192 NECLELSNQCAFRCHNVPGSFRC-ICPYGYALAPDGRHCIDINECEQFPGLCAHTCTNTE--GSYTCG-CHPGFEL  263 (483)
Q Consensus       192 ~~C~~~~~~C~~~C~~~~gs~~C-~C~~G~~~~~~g~~C~~~~~C~~~~~~~~~~C~~~~--g~~~C~-C~~G~~~  263 (483)
                      +.|.+....|.  |......-+| +|..||...++...   .+.|..-+|..++.|..+.  ....|. |++||+|
T Consensus       738 ~~Cd~~tG~C~--C~~~t~G~~C~~C~~GfYg~~~~~~---~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG  808 (1705)
T KOG1836|consen  738 NICDPRTGQCK--CKHNTFGGQCAQCVDGFYGLPDLGT---SGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTG  808 (1705)
T ss_pred             ccccCCCCcee--cccCCCCCchhhhcCCCCCccccCC---CCCCccCCCCCChhhcCcCcccceecCCCCCCCcc
Confidence            44444444332  4444444455 58888874332211   1225555555566665543  445688 9999983


No 38 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.08  E-value=0.04  Score=34.28  Aligned_cols=22  Identities=59%  Similarity=1.340  Sum_probs=19.9

Q ss_pred             CCEEeeCCCCeeEECCCCCeec
Q psy11799        333 GGKCFNTAGSYRCGCPDGFQFE  354 (483)
Q Consensus       333 ~g~C~~~~g~~~C~C~~G~~~~  354 (483)
                      ++.|++..++|.|.|+.||.+.
T Consensus        11 ~~~C~~~~~~~~C~C~~g~~g~   32 (36)
T cd00053          11 GGTCVNTPGSYRCVCPPGYTGD   32 (36)
T ss_pred             CCEEecCCCCeEeECCCCCccc
Confidence            7899999999999999999653


No 39 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.94  E-value=0.032  Score=34.30  Aligned_cols=26  Identities=42%  Similarity=1.125  Sum_probs=21.0

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCcccCCCCCCC
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC  466 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C  466 (483)
                      .|.++|+|+...  .+|+|.+||    .|..|
T Consensus         7 ~C~~~G~C~~~~--g~C~C~~g~----~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSPC--GRCVCDSGY----TGPDC   32 (32)
T ss_pred             ccCCCCEEeCCC--CEEECCCCC----cCCCC
Confidence            699999999763  479999999    56554


No 40 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=94.82  E-value=0.05  Score=34.01  Aligned_cols=20  Identities=55%  Similarity=1.315  Sum_probs=18.5

Q ss_pred             CCEEeeCCCCeeEECCCCCee
Q psy11799        333 GGKCFNTAGSYRCGCPDGFQF  353 (483)
Q Consensus       333 ~g~C~~~~g~~~C~C~~G~~~  353 (483)
                      + .|++..++|.|.|++||.+
T Consensus        11 ~-~C~~~~~~~~C~C~~g~~g   30 (35)
T smart00181       11 G-TCINTPGSYTCSCPPGYTG   30 (35)
T ss_pred             C-EEECCCCCeEeECCCCCcc
Confidence            6 9999999999999999965


No 41 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.58  E-value=0.048  Score=33.55  Aligned_cols=23  Identities=39%  Similarity=1.091  Sum_probs=19.4

Q ss_pred             CCCCCeEeeCCCceEeeCCCCCeeC
Q psy11799        139 CPKPGTCINTMGSFRCLCPRGFKLD  163 (483)
Q Consensus       139 C~~~g~C~~~~g~~~C~C~~G~~g~  163 (483)
                      |+++|+|+...+  +|+|.+||+|.
T Consensus         8 C~~~G~C~~~~g--~C~C~~g~~G~   30 (32)
T PF07974_consen    8 CSGHGTCVSPCG--RCVCDSGYTGP   30 (32)
T ss_pred             cCCCCEEeCCCC--EEECCCCCcCC
Confidence            999999997633  89999999953


No 42 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.99  E-value=0.06  Score=50.16  Aligned_cols=42  Identities=38%  Similarity=0.819  Sum_probs=34.6

Q ss_pred             ccccccchhccCCCCCCCeEEeCCCCeEEeCCCCceeCCCCC
Q psy11799        186 QTCEDINECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGR  227 (483)
Q Consensus       186 ~~C~~~~~C~~~~~~C~~~C~~~~gs~~C~C~~G~~~~~~g~  227 (483)
                      ..|.++++|...++.|...|.+..|+|.|.|.+||++..+++
T Consensus       182 ~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~  223 (224)
T cd01475         182 KICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNK  223 (224)
T ss_pred             ccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCC
Confidence            456677778777777888999999999999999999766654


No 43 
>smart00051 DSL delta serrate ligand.
Probab=92.93  E-value=0.09  Score=38.07  Aligned_cols=40  Identities=13%  Similarity=0.063  Sum_probs=30.3

Q ss_pred             ceEEEeCCCCeeeEee-ecCCC-------CcccCCceeeeccCcceeee
Q psy11799         30 IFVWSLASQWTGEILT-NVSPY-------PTSVNMASVSTRKGLTGVSV   70 (483)
Q Consensus        30 ~~~C~C~~g~~G~~C~-~c~~~-------~~~~~~~~C~c~~g~~g~~C   70 (483)
                      .+.-.|.++|.|..|+ .|.+.       .|.. .+.+.|.+||+|..|
T Consensus        16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~-~G~~~C~~Gw~G~~C   63 (63)
T smart00051       16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDE-NGNKGCLEGWMGPYC   63 (63)
T ss_pred             EEEeeCCCCCcCCccCCEeCcCccccCCccCCc-CCCEecCCCCcCCCC
Confidence            3568899999999998 36542       1433 468999999999876


No 44 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.94  E-value=0.18  Score=46.98  Aligned_cols=42  Identities=31%  Similarity=0.641  Sum_probs=34.1

Q ss_pred             ccccccccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCeeCCCC
Q psy11799        120 YSVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTG  166 (483)
Q Consensus       120 ~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~g  166 (483)
                      +..|.++++|... ++.  |.  ..|.++.|+|.|.|++||++.+.+
T Consensus       181 ~~~C~~~~~C~~~-~~~--c~--~~C~~~~g~~~c~c~~g~~~~~~~  222 (224)
T cd01475         181 GKICVVPDLCATL-SHV--CQ--QVCISTPGSYLCACTEGYALLEDN  222 (224)
T ss_pred             cccCcCchhhcCC-CCC--cc--ceEEcCCCCEEeECCCCccCCCCC
Confidence            5678889999865 544  83  589999999999999999876554


No 45 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.35  E-value=0.23  Score=31.40  Aligned_cols=25  Identities=36%  Similarity=0.754  Sum_probs=19.7

Q ss_pred             CCCCCeEeeCC-CceEeeCCCCCeeC
Q psy11799        139 CPKPGTCINTM-GSFRCLCPRGFKLD  163 (483)
Q Consensus       139 C~~~g~C~~~~-g~~~C~C~~G~~g~  163 (483)
                      |..++.|++.. |++.|+|..||..+
T Consensus         7 cP~NA~C~~~~dG~eecrCllgyk~~   32 (37)
T PF12946_consen    7 CPANAGCFRYDDGSEECRCLLGYKKV   32 (37)
T ss_dssp             --TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred             CCCCcccEEcCCCCEEEEeeCCcccc
Confidence            88899999885 99999999999854


No 46 
>KOG1218|consensus
Probab=86.20  E-value=29  Score=33.74  Aligned_cols=15  Identities=20%  Similarity=0.527  Sum_probs=11.8

Q ss_pred             CCeeEeCCCCeeecC
Q psy11799        382 SGFSCECPSGYQRIG  396 (483)
Q Consensus       382 ~~~~C~C~~Gy~g~~  396 (483)
                      ....|.|++||.+..
T Consensus       160 ~~~~c~c~~g~~g~~  174 (316)
T KOG1218|consen  160 KNGICTCQPGFVGVF  174 (316)
T ss_pred             CCCceeccCCccccc
Confidence            456788999998875


No 47 
>smart00051 DSL delta serrate ligand.
Probab=85.12  E-value=1.2  Score=32.31  Aligned_cols=44  Identities=20%  Similarity=0.413  Sum_probs=30.3

Q ss_pred             eeEeCCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799        294 YKCGCPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF  353 (483)
Q Consensus       294 ~~C~C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~  353 (483)
                      +.-.|.++|.|..|.+.|...+...     .        +.+|.. .|  .++|.+||.|
T Consensus        17 ~rv~C~~~~yG~~C~~~C~~~~d~~-----~--------~~~Cd~-~G--~~~C~~Gw~G   60 (63)
T smart00051       17 IRVTCDENYYGEGCNKFCRPRDDFF-----G--------HYTCDE-NG--NKGCLEGWMG   60 (63)
T ss_pred             EEeeCCCCCcCCccCCEeCcCcccc-----C--------CccCCc-CC--CEecCCCCcC
Confidence            4456899999998887776544332     2        456654 34  7899999954


No 48 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=82.94  E-value=1.1  Score=28.28  Aligned_cols=22  Identities=32%  Similarity=0.726  Sum_probs=18.4

Q ss_pred             CCCEEeeCC-CCeeEECCCCCee
Q psy11799        332 RGGKCFNTA-GSYRCGCPDGFQF  353 (483)
Q Consensus       332 ~~g~C~~~~-g~~~C~C~~G~~~  353 (483)
                      .++.|++.. |++.|.|..||..
T Consensus         9 ~NA~C~~~~dG~eecrCllgyk~   31 (37)
T PF12946_consen    9 ANAGCFRYDDGSEECRCLLGYKK   31 (37)
T ss_dssp             TTEEEEEETTSEEEEEE-TTEEE
T ss_pred             CCcccEEcCCCCEEEEeeCCccc
Confidence            388999877 8999999999976


No 49 
>KOG3512|consensus
Probab=79.19  E-value=7.6  Score=39.27  Aligned_cols=112  Identities=20%  Similarity=0.292  Sum_probs=59.0

Q ss_pred             CCEEee----CCCCeeEECCCCCeeccCCCceeeCCCccCCCCCCCC----cEECCCCCeeEeCCCCeeecCCcceeccc
Q psy11799        333 GGKCFN----TAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFG----CTPLGISGFSCECPSGYQRIGQGHCLATI  404 (483)
Q Consensus       333 ~g~C~~----~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~----c~~~~~~~~~C~C~~Gy~g~~~g~C~~~~  404 (483)
                      +|+|++    +.|.+-=.|.+||.-+.... -.... .|..-.|+..    =++.. .+..|.|.+|-+|....      
T Consensus       358 ggvClnCrHnTaGrhChyCreGyyRd~s~p-l~hrk-aCk~CdChpVGs~gktCNq-~tGqCpCkeGvtG~tCn------  428 (592)
T KOG3512|consen  358 GGVCLNCRHNTAGRHCHYCREGYYRDGSKP-LTHRK-ACKACDCHPVGSAGKTCNQ-TTGQCPCKEGVTGLTCN------  428 (592)
T ss_pred             cceEeecccCCCCcccccccCccccCCCCC-Cchhh-hhhhcCCcccccccccccc-cCCcccCCCCCcccccc------
Confidence            456654    33433335899996542211 11111 3444455542    23333 66789999999987643      


Q ss_pred             CCCCCCCcccCCCCCcccccccccCC-CCccCCCCCCeEeeCCCCeeeeCCCCcccCCCCCCCC
Q psy11799        405 SSGLYPGFRSYERPNLGDIDECKEEI-NGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFCL  467 (483)
Q Consensus       405 ~~~C~~g~~g~~~~~C~~~~~C~~~~-~~~~~C~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~  467 (483)
                        .|.+||.....+++    .|...+ ..+..++++.+    ...+.+.|+.++    .++.+.
T Consensus       429 --rCa~gyqqsrs~va----pcik~p~~~~~~~~s~ve----~qd~~s~Ck~~~----~~~r~n  478 (592)
T KOG3512|consen  429 --RCAPGYQQSRSPVA----PCIKIPTDAPTLGSSGVE----PQDQCSKCKASP----GGKRLN  478 (592)
T ss_pred             --cccchhhcccCCCc----CceecCCCCccccCCCCc----chhccccCCCCC----cceecc
Confidence              46778876553322    222111 11124555554    233457888888    555554


No 50 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=78.09  E-value=2.5  Score=26.08  Aligned_cols=25  Identities=36%  Similarity=0.733  Sum_probs=16.6

Q ss_pred             CCCCCCcEECCCCCeeEeCCCCeeecC
Q psy11799        370 SPCSFGCTPLGISGFSCECPSGYQRIG  396 (483)
Q Consensus       370 ~~C~~~c~~~~~~~~~C~C~~Gy~g~~  396 (483)
                      ..|...|...  ..+.|.||+||..+.
T Consensus         6 t~CpA~CDpn--~~~~C~CPeGyIlde   30 (34)
T PF09064_consen    6 TECPADCDPN--SPGQCFCPEGYILDE   30 (34)
T ss_pred             ccCCCccCCC--CCCceeCCCceEecC
Confidence            3344434332  567999999998765


No 51 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=76.97  E-value=0.34  Score=35.04  Aligned_cols=39  Identities=13%  Similarity=0.107  Sum_probs=19.1

Q ss_pred             eEEEeCCCCeeeEeee-cCCCC-------cccCCceeeeccCcceeee
Q psy11799         31 FVWSLASQWTGEILTN-VSPYP-------TSVNMASVSTRKGLTGVSV   70 (483)
Q Consensus        31 ~~C~C~~g~~G~~C~~-c~~~~-------~~~~~~~C~c~~g~~g~~C   70 (483)
                      +...|...|.|..|++ |.+.-       |.. .+.=.|.+||+|..|
T Consensus        17 ~rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~-~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGPNCSKFCKPRDDSFGHYTCDS-NGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETTTT-EE---EEETTEEEEE-S-S--EEE-TTEESTTS
T ss_pred             EEEECCCCCCCccccCCcCCCcCCcCCcccCC-CCCCCCCCCCcCCCC
Confidence            5578999999999994 66541       332 246678999999876


No 52 
>KOG1218|consensus
Probab=75.60  E-value=62  Score=31.34  Aligned_cols=16  Identities=25%  Similarity=0.439  Sum_probs=11.2

Q ss_pred             CeeEeCCCCceeCCCC
Q psy11799        293 GYKCGCPEGYVRHPFS  308 (483)
Q Consensus       293 ~~~C~C~~G~~g~~~~  308 (483)
                      ...|.+..+|.+..+.
T Consensus        48 ~~~~~~~~~~~~~~c~   63 (316)
T KOG1218|consen   48 SGECGLGYGFVGSVCR   63 (316)
T ss_pred             ceeEecccccCCCccc
Confidence            4467778888877654


No 53 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=75.01  E-value=3  Score=34.29  Aligned_cols=41  Identities=29%  Similarity=0.703  Sum_probs=28.1

Q ss_pred             cccccccCCCCccCCCCCCeEee--CCCCeeeeCCCCcccCCCCCCCCCC
Q psy11799        422 DIDECKEEINGVAVCPKPGTCIN--TMGSFRCLCPRGFKLDRTGTFCLDQ  469 (483)
Q Consensus       422 ~~~~C~~~~~~~~~C~~~g~C~~--~~g~~~C~C~~G~~~~~~g~~C~~~  469 (483)
                      ++..|...-.  +-|.+| +|.-  ....+.|.|..||    .|..|+..
T Consensus        41 ~i~~Cp~ey~--~YClHG-~C~yI~dl~~~~CrC~~GY----tGeRCEh~   83 (139)
T PHA03099         41 AIRLCGPEGD--GYCLHG-DCIHARDIDGMYCRCSHGY----TGIRCQHV   83 (139)
T ss_pred             ccccCChhhC--CEeECC-EEEeeccCCCceeECCCCc----ccccccce
Confidence            3455544321  268764 8973  4467899999999    88888854


No 54 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=72.69  E-value=1.1  Score=30.43  Aligned_cols=21  Identities=14%  Similarity=-0.026  Sum_probs=12.5

Q ss_pred             CceEEEeCCCCeeeEeeecCC
Q psy11799         29 GIFVWSLASQWTGEILTNVSP   49 (483)
Q Consensus        29 g~~~C~C~~g~~G~~C~~c~~   49 (483)
                      ...+|.|+++|+|+.|+.|.+
T Consensus        16 ~~G~C~C~~~~~G~~C~~C~~   36 (49)
T PF00053_consen   16 STGQCVCKPGTTGPRCDQCKP   36 (49)
T ss_dssp             TCEEESBSTTEESTTS-EE-T
T ss_pred             CCCEEeccccccCCcCcCCCC
Confidence            456677777777777766553


No 55 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=71.96  E-value=2.6  Score=28.25  Aligned_cols=20  Identities=10%  Similarity=-0.071  Sum_probs=13.7

Q ss_pred             CceEEEeCCCCeeeEeeecC
Q psy11799         29 GIFVWSLASQWTGEILTNVS   48 (483)
Q Consensus        29 g~~~C~C~~g~~G~~C~~c~   48 (483)
                      ...+|.|+++|+|+.|+.|.
T Consensus        16 ~~G~C~C~~~~~G~~C~~C~   35 (46)
T smart00180       16 DTGQCECKPNVTGRRCDRCA   35 (46)
T ss_pred             CCCEEECCCCCCCCCCCcCC
Confidence            35677777777777776554


No 56 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=69.57  E-value=2.9  Score=28.55  Aligned_cols=20  Identities=15%  Similarity=0.022  Sum_probs=12.2

Q ss_pred             CceEEEeCCCCeeeEeeecC
Q psy11799         29 GIFVWSLASQWTGEILTNVS   48 (483)
Q Consensus        29 g~~~C~C~~g~~G~~C~~c~   48 (483)
                      ...+|.|+++|+|..|+.|.
T Consensus        17 ~~G~C~C~~~~~G~~C~~C~   36 (50)
T cd00055          17 GTGQCECKPNTTGRRCDRCA   36 (50)
T ss_pred             CCCEEeCCCcCCCCCCCCCC
Confidence            44566666666666665543


No 57 
>PHA02887 EGF-like protein; Provisional
Probab=68.91  E-value=5.4  Score=32.27  Aligned_cols=30  Identities=30%  Similarity=0.841  Sum_probs=23.3

Q ss_pred             CCCCCCeEe--eCCCCeeeeCCCCcccCCCCCCCCCC
Q psy11799        435 VCPKPGTCI--NTMGSFRCLCPRGFKLDRTGTFCLDQ  469 (483)
Q Consensus       435 ~C~~~g~C~--~~~g~~~C~C~~G~~~~~~g~~C~~~  469 (483)
                      -|. +|+|.  .....+.|.|+.||    +|..|+.+
T Consensus        93 YCi-HG~C~yI~dL~epsCrC~~GY----tG~RCE~v  124 (126)
T PHA02887         93 FCI-NGECMNIIDLDEKFCICNKGY----TGIRCDEV  124 (126)
T ss_pred             Eee-CCEEEccccCCCceeECCCCc----ccCCCCcc
Confidence            577 57997  34456899999999    78888754


No 58 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=57.51  E-value=9.2  Score=30.97  Aligned_cols=32  Identities=34%  Similarity=0.786  Sum_probs=23.8

Q ss_pred             cccccccCCCCccCCCCCCeEeeCCCceEeeCCCCCe
Q psy11799        125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK  161 (483)
Q Consensus       125 ~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~  161 (483)
                      ..+.|..  .+.  |..+|.|.. ..+..|.|.+||.
T Consensus        76 p~d~Cd~--y~~--CG~~g~C~~-~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   76 PKDQCDV--YGF--CGPNGICNS-NNSPKCSCLPGFE  107 (110)
T ss_pred             cccCCCC--ccc--cCCccEeCC-CCCCceECCCCcC
Confidence            3467775  344  999999964 3456799999996


No 59 
>PHA02887 EGF-like protein; Provisional
Probab=55.35  E-value=5.9  Score=32.04  Aligned_cols=17  Identities=12%  Similarity=-0.074  Sum_probs=14.1

Q ss_pred             ceEEEeCCCCeeeEeee
Q psy11799         30 IFVWSLASQWTGEILTN   46 (483)
Q Consensus        30 ~~~C~C~~g~~G~~C~~   46 (483)
                      ...|.|..||+|.+|+.
T Consensus       107 epsCrC~~GYtG~RCE~  123 (126)
T PHA02887        107 EKFCICNKGYTGIRCDE  123 (126)
T ss_pred             CceeECCCCcccCCCCc
Confidence            37899999999998873


No 60 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=53.00  E-value=38  Score=23.01  Aligned_cols=17  Identities=47%  Similarity=1.165  Sum_probs=14.0

Q ss_pred             CCEEeeCCCCeeEECCCCCee
Q psy11799        333 GGKCFNTAGSYRCGCPDGFQF  353 (483)
Q Consensus       333 ~g~C~~~~g~~~C~C~~G~~~  353 (483)
                      ++.|++.    +|.|++||..
T Consensus        31 ~s~C~~g----~C~C~~g~~~   47 (52)
T PF01683_consen   31 GSVCVNG----RCQCPPGYVE   47 (52)
T ss_pred             cCEEcCC----EeECCCCCEe
Confidence            7788664    9999999965


No 61 
>PF01724 DUF29:  Domain of unknown function DUF29;  InterPro: IPR002636 This entry is represented by Ralstonia phage RSS1, Orf11. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of various hypothetical proteins from cyanobacteria, none of which are functionally described. The aligned region is approximately 120-140 amino acids long corresponding to almost the entire length of the proteins in the family.; PDB: 3FCN_A.
Probab=49.43  E-value=7.7  Score=33.06  Aligned_cols=19  Identities=37%  Similarity=0.576  Sum_probs=15.4

Q ss_pred             ccccccccchhHHHHHhhh
Q psy11799          3 LLKARKLETFDRELLIELL   21 (483)
Q Consensus         3 ~~~~~~~~~~~~~~~~~~~   21 (483)
                      |||+|++..+|.|+|+|+.
T Consensus        16 lLr~~~~~~lD~enLiEEi   34 (139)
T PF01724_consen   16 LLREGDFDALDWENLIEEI   34 (139)
T ss_dssp             HHHHSSSTT--HHHHHHHH
T ss_pred             HHHcCCChHhhHHHHHHHH
Confidence            7999999999999998875


No 62 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=48.12  E-value=17  Score=30.01  Aligned_cols=36  Identities=28%  Similarity=0.704  Sum_probs=23.1

Q ss_pred             cccccccCCCCccCCCCCCeEeeC--CCceEeeCCCCCeeC
Q psy11799        125 DMDECKEEINGVAVCPKPGTCINT--MGSFRCLCPRGFKLD  163 (483)
Q Consensus       125 ~~~~C~~~~~~~~~C~~~g~C~~~--~g~~~C~C~~G~~g~  163 (483)
                      ++.+|.....+-  |- ||+|.-.  ...+.|.|..||+|.
T Consensus        41 ~i~~Cp~ey~~Y--Cl-HG~C~yI~dl~~~~CrC~~GYtGe   78 (139)
T PHA03099         41 AIRLCGPEGDGY--CL-HGDCIHARDIDGMYCRCSHGYTGI   78 (139)
T ss_pred             ccccCChhhCCE--eE-CCEEEeeccCCCceeECCCCcccc
Confidence            345565432233  63 4588754  467899999999944


No 63 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=42.39  E-value=20  Score=28.98  Aligned_cols=23  Identities=35%  Similarity=0.810  Sum_probs=18.9

Q ss_pred             CCCCCCeEeeCCCCeeeeCCCCcc
Q psy11799        435 VCPKPGTCINTMGSFRCLCPRGFK  458 (483)
Q Consensus       435 ~C~~~g~C~~~~g~~~C~C~~G~~  458 (483)
                      .|+..|.|.. ..+..|.|.+||+
T Consensus        85 ~CG~~g~C~~-~~~~~C~Cl~GF~  107 (110)
T PF00954_consen   85 FCGPNGICNS-NNSPKCSCLPGFE  107 (110)
T ss_pred             ccCCccEeCC-CCCCceECCCCcC
Confidence            7999999963 4566799999994


No 64 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=39.38  E-value=21  Score=28.54  Aligned_cols=44  Identities=27%  Similarity=0.696  Sum_probs=29.0

Q ss_pred             cccccCCCCccCCCCCCeEeeCC-----CCeeeeCCCCcccC---------CCCCCCCCCc
Q psy11799        424 DECKEEINGVAVCPKPGTCINTM-----GSFRCLCPRGFKLD---------RTGTFCLDQD  470 (483)
Q Consensus       424 ~~C~~~~~~~~~C~~~g~C~~~~-----g~~~C~C~~G~~~~---------~~g~~C~~~~  470 (483)
                      +.|....+   .|..||.|++..     .=|.|.|.+.+...         ..|..|+..|
T Consensus         6 ~aC~~~Tn---~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkD   63 (103)
T PF12955_consen    6 DACENATN---NCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKD   63 (103)
T ss_pred             HHHHHhcc---CCCCCceEeeccCCCccceEEEEeeccccccccccCceeeeccccccccc
Confidence            44554433   799999999762     34999999965332         2566776544


No 65 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=38.69  E-value=23  Score=28.26  Aligned_cols=33  Identities=24%  Similarity=0.720  Sum_probs=23.9

Q ss_pred             cccccCCCCccCCCCCCeEeeCC-----CceEeeCCCCCee
Q psy11799        127 DECKEEINGVAVCPKPGTCINTM-----GSFRCLCPRGFKL  162 (483)
Q Consensus       127 ~~C~~~~~~~~~C~~~g~C~~~~-----g~~~C~C~~G~~g  162 (483)
                      +.|... .+.  |+.+|.|+...     .=|.|.|.+.+..
T Consensus         6 ~aC~~~-Tn~--CsgHG~C~~~~~~~~~~C~~C~C~~T~~~   43 (103)
T PF12955_consen    6 DACENA-TNN--CSGHGSCVKKYGSGGGDCFACKCKPTVVK   43 (103)
T ss_pred             HHHHHh-ccC--CCCCceEeeccCCCccceEEEEeeccccc
Confidence            445544 555  99999999872     3499999996653


No 66 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=36.86  E-value=2.5e+02  Score=31.52  Aligned_cols=120  Identities=23%  Similarity=0.571  Sum_probs=58.3

Q ss_pred             eEe-CCCCceecCCCCCccccCccccCCCCcccccccCCCCeeEeCCCCceeCCCCCccccCCcCCCC--CCCCCCCCCC
Q psy11799        254 TCG-CHPGFELSSDGSGTCLDSDECATGRHICQQICNLLGGYKCGCPEGYVRHPFSNECIDDNECARN--PCEPSSNTTG  330 (483)
Q Consensus       254 ~C~-C~~G~~~~~~g~~~C~~~~~C~~~~~~c~~~C~~~g~~~C~C~~G~~g~~~~~~C~~~~~C~~~--~C~~~~~~~c  330 (483)
                      .|. |..+|.+...+. .|+.  +|.... . .+. ...+..+|.|..||.....+..|.....|...  -|..     |
T Consensus       647 ~C~~C~~~~~ls~~~~-~C~~--~C~~~~-~-~~~-~~~~~~~C~C~~g~~p~~~~~~C~~~~~C~~~~~gC~~-----C  715 (800)
T PTZ00214        647 FCTECAGELFVSLDGQ-SCLE--ECTGDK-V-VGE-VSGGVRRCWCERGFLPALDRSGCVLPTECPPDMPSCAA-----C  715 (800)
T ss_pred             CcCcCCCCceecCcCC-chhh--cCCCcc-c-ccc-ccCCcceeEecCCcccccCCCccccccCCCcccccccc-----c
Confidence            464 777776655444 4543  332110 0 000 00124589999999977766666654445321  1211     1


Q ss_pred             CCCCEEeeCCCCeeEECCCCCeeccCCCceeeCCCccCCCCCCCCcEECCCCCeeEeCCCCeeecCCcceecc
Q psy11799        331 ERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSFGCTPLGISGFSCECPSGYQRIGQGHCLAT  403 (483)
Q Consensus       331 ~~~g~C~~~~g~~~C~C~~G~~~~~~g~~C~~~~~~C~~~~C~~~c~~~~~~~~~C~C~~Gy~g~~~g~C~~~  403 (483)
                      ...++|+      .|. ..++.....++.|..   .     |..+-  . .....|+|..||... .+.|++.
T Consensus       716 ~~~g~C~------~C~-~~~~~vq~~~~~C~~---~-----C~~~~--~-~~~~vC~C~~g~~l~-~~~c~~~  769 (800)
T PTZ00214        716 DESGRCL------LCV-TSGHNVQVDQRTCAE---G-----CGARA--S-SNQGVCMCELDAVLT-KGVCVPA  769 (800)
T ss_pred             CCCCcee------ecc-ccCcccccCCCcccc---C-----CCCCc--c-ccCCeEEeCCcceec-CCeeEec
Confidence            1244554      232 223322224444422   1     11111  1 256799999999764 4567653


No 67 
>KOG3516|consensus
Probab=35.24  E-value=29  Score=39.46  Aligned_cols=40  Identities=30%  Similarity=0.782  Sum_probs=32.6

Q ss_pred             cccccccccCCCCccCCCCCCeEeeCCCCeeeeCC-CCcccCCCCCCCCC
Q psy11799        420 LGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCP-RGFKLDRTGTFCLD  468 (483)
Q Consensus       420 C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~  468 (483)
                      |.-++.|..+     +|.++|.|.-.-..|.|.|. .||    .|.+|..
T Consensus       542 C~i~drClPN-----~CehgG~C~Qs~~~f~C~C~~TGY----~GatCHt  582 (1306)
T KOG3516|consen  542 CGISDRCLPN-----PCEHGGKCSQSWDDFECNCELTGY----KGATCHT  582 (1306)
T ss_pred             cccccccCCc-----cccCCCcccccccceeEecccccc----ccccccC
Confidence            5555666654     99999999987788999999 999    7888864


No 68 
>KOG3512|consensus
Probab=34.09  E-value=1.5e+02  Score=30.43  Aligned_cols=53  Identities=32%  Similarity=0.755  Sum_probs=31.3

Q ss_pred             eeEe-CCCCceeCCCCCccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECCCCCee
Q psy11799        294 YKCG-CPEGYVRHPFSNECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQF  353 (483)
Q Consensus       294 ~~C~-C~~G~~g~~~~~~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~~G~~~  353 (483)
                      ..|. |++||..+.-. .-.+...|..-.|++    ....+.+|..+.|  +|.|.+|-+|
T Consensus       371 rhChyCreGyyRd~s~-pl~hrkaCk~CdChp----VGs~gktCNq~tG--qCpCkeGvtG  424 (592)
T KOG3512|consen  371 RHCHYCREGYYRDGSK-PLTHRKACKACDCHP----VGSAGKTCNQTTG--QCPCKEGVTG  424 (592)
T ss_pred             cccccccCccccCCCC-CCchhhhhhhcCCcc----cccccccccccCC--cccCCCCCcc
Confidence            3465 99999876432 222333454444554    2223567877766  8999998833


No 69 
>KOG3516|consensus
Probab=33.72  E-value=34  Score=38.96  Aligned_cols=41  Identities=34%  Similarity=0.881  Sum_probs=34.0

Q ss_pred             ccccCCcCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECC-CCCeeccCCCceee
Q psy11799        310 ECIDDNECARNPCEPSSNTTGERGGKCFNTAGSYRCGCP-DGFQFEAALQLCTQ  362 (483)
Q Consensus       310 ~C~~~~~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~  362 (483)
                      .|...+.|.+++|++        +|.|......|.|.|. .||    .|.+|..
T Consensus       541 ~C~i~drClPN~Ceh--------gG~C~Qs~~~f~C~C~~TGY----~GatCHt  582 (1306)
T KOG3516|consen  541 MCGISDRCLPNPCEH--------GGKCSQSWDDFECNCELTGY----KGATCHT  582 (1306)
T ss_pred             ccccccccCCccccC--------CCcccccccceeEecccccc----ccccccC
Confidence            355677889999999        9999998899999998 899    5666653


No 70 
>KOG0196|consensus
Probab=30.35  E-value=54  Score=36.04  Aligned_cols=58  Identities=24%  Similarity=0.564  Sum_probs=37.3

Q ss_pred             eeEeCCCCeeecC-CcceecccCCCCCCCcccCCCCCcccccccccCCCCccCCCCCCeEeeCCCCeeeeCCCCcc
Q psy11799        384 FSCECPSGYQRIG-QGHCLATISSGLYPGFRSYERPNLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK  458 (483)
Q Consensus       384 ~~C~C~~Gy~g~~-~g~C~~~~~~~C~~g~~g~~~~~C~~~~~C~~~~~~~~~C~~~g~C~~~~g~~~C~C~~G~~  458 (483)
                      ..|.|.+||.-.. ...|.     .|++|+.-..    .....|.       +|+.+..= ...++-.|.|..||+
T Consensus       259 G~C~C~aGye~~~~~~~C~-----aCp~G~yK~~----~~~~~C~-------~CP~~S~s-~~ega~~C~C~~gyy  317 (996)
T KOG0196|consen  259 GGCVCKAGYEEAENGKACQ-----ACPPGTYKAS----QGDSLCL-------PCPPNSHS-SSEGATSCTCENGYY  317 (996)
T ss_pred             CceeecCCCCcccCCCcce-----eCCCCcccCC----CCCCCCC-------CCCCCCCC-CCCCCCcccccCCcc
Confidence            4699999998632 33343     6888876433    1223343       67666552 356778899999995


No 71 
>KOG3514|consensus
Probab=28.90  E-value=35  Score=38.38  Aligned_cols=35  Identities=37%  Similarity=0.973  Sum_probs=29.7

Q ss_pred             cCCCCCCCCCCCCCCCCCCEEeeCCCCeeEECC-CCCeeccCCCceee
Q psy11799        316 ECARNPCEPSSNTTGERGGKCFNTAGSYRCGCP-DGFQFEAALQLCTQ  362 (483)
Q Consensus       316 ~C~~~~C~~~~~~~c~~~g~C~~~~g~~~C~C~-~G~~~~~~g~~C~~  362 (483)
                      .|..+||+|        +|+|....+.|.|.|. .+|    .|+.|++
T Consensus       625 ~C~~nPC~N--------~g~C~egwNrfiCDCs~T~~----~G~~Cer  660 (1591)
T KOG3514|consen  625 ICESNPCQN--------GGKCSEGWNRFICDCSGTGF----EGRTCER  660 (1591)
T ss_pred             ccCCCcccC--------CCCccccccccccccccCcc----cCccccc
Confidence            588889998        9999999999999997 688    6677763


No 72 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=25.40  E-value=24  Score=24.48  Aligned_cols=25  Identities=24%  Similarity=0.272  Sum_probs=13.7

Q ss_pred             cccccC-CCceEEEeCCCCeeeEeee
Q psy11799         22 FLINML-AGIFVWSLASQWTGEILTN   46 (483)
Q Consensus        22 ~~~~~~-~g~~~C~C~~g~~G~~C~~   46 (483)
                      ||..+. .|...|.|..-|.|++|++
T Consensus        26 flDg~~~dG~p~CECn~Cy~GpdCS~   51 (56)
T PF04863_consen   26 FLDGLIADGSPVCECNSCYGGPDCST   51 (56)
T ss_dssp             -TTS-EETTEE--EE-TTEESTTS-E
T ss_pred             eeccccccCCccccccCCcCCCCccc
Confidence            444444 4557799999999998874


No 73 
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=24.21  E-value=3.1e+02  Score=30.75  Aligned_cols=25  Identities=36%  Similarity=0.719  Sum_probs=18.4

Q ss_pred             eEEeCCCCceeCCCCCcccccCccc
Q psy11799        212 FRCICPYGYALAPDGRHCIDINECE  236 (483)
Q Consensus       212 ~~C~C~~G~~~~~~g~~C~~~~~C~  236 (483)
                      -+|+|..||....++..|+....|.
T Consensus       682 ~~C~C~~g~~p~~~~~~C~~~~~C~  706 (800)
T PTZ00214        682 RRCWCERGFLPALDRSGCVLPTECP  706 (800)
T ss_pred             ceeEecCCcccccCCCccccccCCC
Confidence            4789999998777787886555554


No 74 
>PF01826 TIL:  Trypsin Inhibitor like cysteine rich domain;  InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are:  chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster  Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=23.44  E-value=60  Score=22.33  Aligned_cols=18  Identities=44%  Similarity=1.176  Sum_probs=15.1

Q ss_pred             EeCCCCeeecCCcceecc
Q psy11799        386 CECPSGYQRIGQGHCLAT  403 (483)
Q Consensus       386 C~C~~Gy~g~~~g~C~~~  403 (483)
                      |.|++||..+..++||..
T Consensus        35 C~C~~G~v~~~~~~CV~~   52 (55)
T PF01826_consen   35 CFCPPGYVRNDNGRCVPP   52 (55)
T ss_dssp             EEETTTEEEETTSEEEEG
T ss_pred             CCCCCCeeEcCCCCEEcH
Confidence            999999998877788764


Done!