Query         psy2060
Match_columns 193
No_of_seqs    194 out of 2072
Neff          8.8 
Searched_HMMs 46136
Date          Fri Aug 16 23:12:15 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2060.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2060hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 5.9E-28 1.3E-32  218.3  14.5  160   20-189  1217-1389(2531)
  2 KOG4289|consensus               99.5 1.5E-12 3.2E-17  119.9  14.5   97   73-179  1502-1604(2531)
  3 KOG1219|consensus               99.4 1.3E-13 2.8E-18  130.5   5.6   88   16-113  3877-3978(4289)
  4 KOG1219|consensus               99.2 2.4E-10 5.1E-15  109.2  11.2  122   51-190  3631-3756(4289)
  5 PF00054 Laminin_G_1:  Laminin   98.9 3.8E-09 8.2E-14   76.3   7.7   50  141-193     1-50  (131)
  6 KOG3516|consensus               98.9 2.2E-08 4.7E-13   92.0  13.9  108   74-191   957-1088(1306)
  7 KOG3514|consensus               98.9 1.6E-08 3.6E-13   91.8  10.7  109   64-181   616-729 (1591)
  8 KOG3514|consensus               98.9 1.9E-08 4.1E-13   91.5  10.9  104   74-188   211-323 (1591)
  9 cd00110 LamG Laminin G domain;  98.8 4.6E-08   1E-12   71.5  10.3   73  116-191     2-75  (151)
 10 KOG3516|consensus               98.8 1.7E-07 3.8E-12   86.3  13.3  128   48-188   726-860 (1306)
 11 smart00282 LamG Laminin G doma  98.4   2E-06 4.4E-11   61.9   8.5   53  135-190     3-55  (135)
 12 KOG1225|consensus               98.2 1.6E-06 3.5E-11   75.3   4.9   70   26-113   266-342 (525)
 13 PF00008 EGF:  EGF-like domain   98.1 1.8E-06 3.9E-11   46.8   1.4   28   75-103     1-29  (32)
 14 PF02210 Laminin_G_2:  Laminin   98.0 3.5E-05 7.6E-10   54.3   8.0   48  141-191     1-49  (128)
 15 PF07645 EGF_CA:  Calcium-bindi  97.6 3.5E-05 7.5E-10   44.4   2.1   28   74-103     4-34  (42)
 16 KOG1214|consensus               97.5 8.2E-05 1.8E-09   67.2   4.1   55   50-111   806-863 (1289)
 17 PF12661 hEGF:  Human growth fa  97.4 4.5E-05 9.7E-10   32.6   0.3   13   54-66      1-13  (13)
 18 KOG1217|consensus               97.4 0.00037   8E-09   59.7   5.8   57   47-111   246-306 (487)
 19 smart00179 EGF_CA Calcium-bind  97.3 0.00037   8E-09   38.8   3.4   33   74-111     4-38  (39)
 20 KOG1225|consensus               97.3 0.00041 8.8E-09   60.7   5.1   72   26-113   235-311 (525)
 21 KOG4260|consensus               97.3  0.0003 6.4E-09   56.0   3.7   75   26-104   130-269 (350)
 22 cd00054 EGF_CA Calcium-binding  97.2 0.00047   1E-08   37.9   3.1   33   74-112     4-38  (38)
 23 KOG1217|consensus               97.1 0.00097 2.1E-08   57.1   5.9   83   21-113   106-207 (487)
 24 cd00053 EGF Epidermal growth f  97.0  0.0012 2.5E-08   35.7   3.3   25   77-103     6-30  (36)
 25 smart00181 EGF Epidermal growt  96.8  0.0018   4E-08   35.1   3.2   29   77-112     6-35  (35)
 26 KOG1214|consensus               96.7  0.0026 5.6E-08   57.9   5.1   77   18-105   686-768 (1289)
 27 PF12947 EGF_3:  EGF domain;  I  96.3  0.0019 4.2E-08   35.7   1.1   31   77-111     6-36  (36)
 28 PF00008 EGF:  EGF-like domain   96.2 0.00094   2E-08   36.0  -0.6   23   43-65      7-32  (32)
 29 PF14670 FXa_inhibition:  Coagu  95.8  0.0054 1.2E-07   33.9   1.5   26   84-111    11-36  (36)
 30 smart00179 EGF_CA Calcium-bind  95.8  0.0082 1.8E-07   33.1   2.1   21   47-67     18-39  (39)
 31 KOG1836|consensus               95.7   0.016 3.5E-07   57.3   5.1   84   27-116   697-816 (1705)
 32 smart00051 DSL delta serrate l  95.6   0.014 3.1E-07   36.5   3.0   37   28-66     21-63  (63)
 33 PF12662 cEGF:  Complement Clr-  95.6   0.009   2E-07   29.9   1.6   16   52-67      1-20  (24)
 34 cd00054 EGF_CA Calcium-binding  95.1   0.018 3.8E-07   31.3   2.1   21   47-67     18-38  (38)
 35 PF12662 cEGF:  Complement Clr-  95.1   0.015 3.2E-07   29.1   1.5   21   92-113     1-21  (24)
 36 smart00181 EGF Epidermal growt  95.1   0.019 4.2E-07   30.9   2.1   20   48-67     15-35  (35)
 37 KOG1226|consensus               95.0   0.037 7.9E-07   50.2   4.6   51   53-113   566-621 (783)
 38 PF14670 FXa_inhibition:  Coagu  94.9  0.0096 2.1E-07   32.9   0.5   16   47-62     13-28  (36)
 39 cd00053 EGF Epidermal growth f  94.1   0.044 9.5E-07   29.1   2.1   20   47-66     15-35  (36)
 40 KOG0994|consensus               94.1   0.093   2E-06   49.8   5.2   83   28-118   846-954 (1758)
 41 PHA03099 epidermal growth fact  94.0   0.043 9.2E-07   39.0   2.3   39   25-69     42-83  (139)
 42 PF07645 EGF_CA:  Calcium-bindi  93.8   0.015 3.3E-07   33.2  -0.2   16   47-62     19-34  (42)
 43 PHA02887 EGF-like protein; Pro  93.5   0.053 1.2E-06   37.8   2.0   23   47-69    102-124 (126)
 44 PF07974 EGF_2:  EGF-like domai  93.4   0.085 1.8E-06   28.3   2.3   27   77-111     6-32  (32)
 45 PHA03099 epidermal growth fact  92.9   0.083 1.8E-06   37.5   2.3   33   77-115    51-84  (139)
 46 smart00051 DSL delta serrate l  92.9    0.13 2.8E-06   32.2   2.9   46   52-111    16-63  (63)
 47 PHA02887 EGF-like protein; Pro  92.9   0.093   2E-06   36.7   2.4   32   77-114    92-124 (126)
 48 KOG1226|consensus               92.7    0.18 3.9E-06   45.9   4.6   67   27-103   480-575 (783)
 49 PF07974 EGF_2:  EGF-like domai  91.6    0.13 2.8E-06   27.6   1.5   15   52-66     18-32  (32)
 50 KOG4260|consensus               91.4    0.22 4.7E-06   40.1   3.3   46   57-113   132-183 (350)
 51 KOG3509|consensus               90.6    0.41   9E-06   45.1   4.7  121   57-189   161-307 (964)
 52 PF01414 DSL:  Delta serrate li  88.6    0.15 3.4E-06   31.8   0.3   37   28-66     21-63  (63)
 53 KOG1836|consensus               87.2     1.2 2.5E-05   44.9   5.4   50   22-71    754-816 (1705)
 54 PF12946 EGF_MSP1_1:  MSP1 EGF   87.0    0.34 7.3E-06   26.8   1.0   34   75-111     2-36  (37)
 55 PF06247 Plasmod_Pvs28:  Plasmo  84.0    0.38 8.3E-06   36.6   0.4   52   49-104    16-80  (197)
 56 cd01475 vWA_Matrilin VWA_Matri  83.6     1.4 3.1E-05   34.3   3.5   35   63-104   181-218 (224)
 57 PF06247 Plasmod_Pvs28:  Plasmo  83.1    0.56 1.2E-05   35.7   0.9   76   22-103    17-119 (197)
 58 KOG0994|consensus               83.1    0.81 1.8E-05   43.8   2.1   20   23-44    932-952 (1758)
 59 PF00954 S_locus_glycop:  S-loc  77.1     2.5 5.3E-05   29.2   2.6   28   73-103    78-107 (110)
 60 PF00053 Laminin_EGF:  Laminin   74.3     1.6 3.4E-05   25.4   0.9   18   51-68     16-33  (49)
 61 cd00055 EGF_Lam Laminin-type e  73.4     2.5 5.5E-05   24.8   1.6   16   52-67     18-33  (50)
 62 KOG1218|consensus               73.1     8.1 0.00018   31.4   5.1   38   50-89    159-198 (316)
 63 smart00180 EGF_Lam Laminin-typ  73.0     2.7 5.9E-05   24.2   1.7   16   52-67     17-32  (46)
 64 PF01683 EB:  EB module;  Inter  73.0     5.7 0.00012   23.3   3.1   21   77-103    26-46  (52)
 65 KOG3509|consensus               69.6      26 0.00056   33.6   7.9   39  139-179   572-612 (964)
 66 cd01475 vWA_Matrilin VWA_Matri  63.6     6.1 0.00013   30.8   2.4   17   47-63    202-218 (224)
 67 KOG0196|consensus               63.0      11 0.00023   35.4   4.0   13   91-104   306-318 (996)
 68 PF04863 EGF_alliinase:  Alliin  62.2     2.6 5.6E-05   25.4   0.0   23   49-71     32-54  (56)
 69 PF09064 Tme5_EGF_like:  Thromb  46.7      13 0.00029   20.0   1.2   12   51-62     16-27  (34)
 70 PF12955 DUF3844:  Domain of un  38.3      15 0.00032   25.3   0.7   26   74-100     7-39  (103)
 71 PF02973 Sialidase:  Sialidase,  30.9 2.5E+02  0.0054   21.6   9.1   50  132-181    32-83  (190)
 72 PF08300 HCV_NS5a_1a:  Hepatiti  28.3      16 0.00036   22.5  -0.3   13   24-36      2-14  (62)
 73 TIGR01492 CPW_WPC Plasmodium f  27.5      58  0.0013   20.0   2.1   16   52-68     20-35  (62)
 74 PF12599 DUF3768:  Protein of u  26.0      97  0.0021   20.5   3.0   34  157-192    34-67  (84)
 75 smart00274 FOLN Follistatin-N-  25.3      86  0.0019   15.7   2.1   21   75-95      2-23  (26)
 76 smart00564 PQQ beta-propeller   24.6 1.1E+02  0.0023   15.3   3.8   19  158-178    14-32  (33)
 77 cd01328 FSL_SPARC Follistatin-  23.9      99  0.0021   20.5   2.8   25   74-99      1-26  (86)
 78 PHA03399 pif3 per os infectivi  23.8      57  0.0012   25.3   1.8   10   53-62    159-168 (200)
 79 COG4939 Major membrane immunog  23.4 1.2E+02  0.0025   21.8   3.2   34  157-191    44-77  (147)
 80 PF07546 EMI:  EMI domain;  Int  22.3      83  0.0018   19.8   2.1    6   30-35     66-71  (72)
 81 PHA02151 hypothetical protein   21.9 1.5E+02  0.0032   22.2   3.6   31  145-177   140-172 (217)

No 1  
>KOG4289|consensus
Probab=99.95  E-value=5.9e-28  Score=218.31  Aligned_cols=160  Identities=36%  Similarity=0.648  Sum_probs=142.7

Q ss_pred             CCCCCcee-cCCCCccCCCCcceeec---------CC--ccCCCCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEe
Q psy2060          20 GPFQPYLS-GLPDNEGLEKRVQTFVD---------GG--HRTLDLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCS   86 (193)
Q Consensus        20 ~~~~~~~c-C~~g~~g~~~~c~~~~~---------gg--~~~~~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~   86 (193)
                      .|...+.| |||||+|..  ||+.+|         +|  ..-.++|.|.|.+||+|.+||.+.....|. ..|.|||+|+
T Consensus      1217 ~pvnglrCrCPpGFTgd~--CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~ 1294 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDY--CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCV 1294 (2531)
T ss_pred             cccCceeEeCCCCCCccc--ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEe
Confidence            45667789 999999998  999888         33  455789999999999999999887557899 9999999999


Q ss_pred             eCCCCceecccCCCCCCccccccccceecccCCCcceEEecCCCCCceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEE
Q psy2060          87 LNPSGGYICDDCSLAGSYEHYDRLCRLRARSFPKNSFLTFPALKQRNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALE  166 (193)
Q Consensus        87 ~~~~~~~~C~~C~~g~~~~~~g~~C~~~~~sF~g~syl~~~~~~~~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~  166 (193)
                      +...++|.|+ ||.+ ..+  +++|+..+.+|.+.+|+.+.....+.++.++|.|.|...+|+|+|+|  ++++||++|+
T Consensus      1295 ~~~nggf~c~-Cp~g-e~e--~prC~v~trSFp~~sfv~frglrqRfh~TlslsfaT~~~nGlL~ynG--nekhDFvale 1368 (2531)
T KOG4289|consen 1295 NLLNGGFCCH-CPYG-EFE--DPRCEVTTRSFPPESFVTFRGLRQRFHFTLSLSFATIERNGLLLYNG--NEKHDFVALE 1368 (2531)
T ss_pred             ecCCCceecc-CCCc-ccC--CCceEEEeeccCchheEEEeccccceEEEEEEEEEEeeecceEEecC--CcccceEeee
Confidence            9887999999 9987 333  99999999999999999999999999999999999999999999999  7899999999


Q ss_pred             ecCCeeEEEEEEeCCCceEEEEc
Q psy2060         167 LTDSGRSVQFSWSLGSEVARLNR  189 (193)
Q Consensus       167 l~~G~~~l~~~~~~g~g~~~l~s  189 (193)
                      ++++.  |+++|+.|...+++..
T Consensus      1369 vVd~q--vqltfS~Ges~t~v~p 1389 (2531)
T KOG4289|consen 1369 VVDEQ--VQLTFSAGESTTTVSP 1389 (2531)
T ss_pred             eeeee--EEEEEecccccceecC
Confidence            99999  9999999966555543


No 2  
>KOG4289|consensus
Probab=99.46  E-value=1.5e-12  Score=119.87  Aligned_cols=97  Identities=26%  Similarity=0.496  Sum_probs=76.9

Q ss_pred             CCCC-CCCCCCcEEeeCCCCceecccCCCCCCccccccccceecc---cCCCcceEEecCCCC--CceeEEEEEEEeCCC
Q psy2060          73 SDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLRAR---SFPKNSFLTFPALKQ--RNRLHIKLQFSTLHD  146 (193)
Q Consensus        73 ~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~~~---sF~g~syl~~~~~~~--~~~~~i~~~f~T~~~  146 (193)
                      +-|. .+|.|+|+|++.+ ++|.|+ ||.++.    |..|+..+.   -|.|.|-+.+.....  .-.+.++|.|||+..
T Consensus      1502 ~fCdsg~C~n~g~CvnrW-g~~~C~-CP~~fg----gk~c~~~m~~pq~frG~sl~sw~~~~~~vSvPwylsl~FRTr~a 1575 (2531)
T KOG4289|consen 1502 NFCDSGQCSNGGTCVNRW-GGFSCE-CPLGFG----GKGCCQGMAHPQHFRGHSLVSWEGLPSQVSVPWYLSLMFRTRRA 1575 (2531)
T ss_pred             cccCCCccCCCCeeeccc-CcEeec-CccccC----CcchhhccCCchhccccceeeecCCCcceecceEEEEEEEeecc
Confidence            4567 8999999999998 999999 999998    899988653   588888888765443  334678999999999


Q ss_pred             CeEEEEeecCCCCCceEEEEecCCeeEEEEEEe
Q psy2060         147 SGLLLYNGRYNERHDFIALELTDSGRSVQFSWS  179 (193)
Q Consensus       147 ~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~  179 (193)
                      +++||-... .+.+ -+.|+|.+|.  |++...
T Consensus      1576 d~vl~~~~~-~~rs-t~~lqld~g~--l~~~v~ 1604 (2531)
T KOG4289|consen 1576 DGVLMQAEF-GGRS-TYNLQLDDGT--LKYNVG 1604 (2531)
T ss_pred             ccEEEEEEe-CCCc-eEEEEEcCCE--EEEEec
Confidence            999997642 2333 4889999999  776543


No 3  
>KOG1219|consensus
Probab=99.43  E-value=1.3e-13  Score=130.54  Aligned_cols=88  Identities=23%  Similarity=0.425  Sum_probs=77.7

Q ss_pred             eccCCCCCCcee-cCCCCccCCCCcceeec---------CC--ccCCCCceeeCCCCCCCCccccc-CCCCCCC-CCCCC
Q psy2060          16 DLTNGPFQPYLS-GLPDNEGLEKRVQTFVD---------GG--HRTLDLLHDMGARYHWGDTCETD-ITKSDCK-GVCSP   81 (193)
Q Consensus        16 ~~~~~~~~~~~c-C~~g~~g~~~~c~~~~~---------gg--~~~~~~~~C~C~~g~~G~~C~~~-~~~~~C~-~pC~~   81 (193)
                      .+...|.+.|.| |++.|+|++  ||.+++         ||  .-..++|.|.||.||||.+||.. ++  +|. ++|.+
T Consensus      3877 ~C~~~~~ggy~CkCpsqysG~~--CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~--eCs~n~C~~ 3952 (4289)
T KOG1219|consen 3877 TCISQPKGGYKCKCPSQYSGNH--CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGIS--ECSKNVCGT 3952 (4289)
T ss_pred             EecCCCCCceEEeCcccccCcc--cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeeccccc--ccccccccC
Confidence            345667888999 999999998  998776         66  45568999999999999999988 66  999 99999


Q ss_pred             CcEEeeCCCCceecccCCCCCCccccccccce
Q psy2060          82 GATCSLNPSGGYICDDCSLAGSYEHYDRLCRL  113 (193)
Q Consensus        82 gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~  113 (193)
                      ||.|++.+ ++|.|. |.+++.    |+.|..
T Consensus      3953 gg~C~n~~-gsf~Cn-cT~g~~----gr~c~~ 3978 (4289)
T KOG1219|consen 3953 GGQCINIP-GSFHCN-CTPGIL----GRTCCA 3978 (4289)
T ss_pred             CceeeccC-CceEec-cChhHh----cccCcc
Confidence            99999999 999999 999998    888865


No 4  
>KOG1219|consensus
Probab=99.16  E-value=2.4e-10  Score=109.23  Aligned_cols=122  Identities=20%  Similarity=0.425  Sum_probs=101.3

Q ss_pred             CCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCC-CCceecccCCCCCCccccccccce-ecccCCCcceEEec
Q psy2060          51 DLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRL-RARSFPKNSFLTFP  127 (193)
Q Consensus        51 ~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~-~~~sF~g~syl~~~  127 (193)
                      ....|.|+.||    |+..-  +.|. .||..+..|++.. ...|+|. ||.|.     -..|+- ...++.|.||..++
T Consensus      3631 r~a~ClC~~G~----Cp~~~--~~C~~~pcp~~~~Cvs~~~~~~~~cV-cP~gr-----~g~C~g~~elS~tGnSYveyr 3698 (4289)
T KOG1219|consen 3631 RTAACLCNRGF----CPVET--NQCAKSPCPAGNLCVSSVHNSTYTCV-CPIGR-----FGFCQGDFELSSTGNSYVEYR 3698 (4289)
T ss_pred             ccceeeecCCc----CCccc--CccccCCCcccCcccccccccceeEe-ccCcc-----cccCCCcceEeecCceeEEEE
Confidence            45789999998    88774  4999 9999999999875 4679999 99874     345655 35788899999997


Q ss_pred             CCCC-CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcC
Q psy2060         128 ALKQ-RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRA  190 (193)
Q Consensus       128 ~~~~-~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~  190 (193)
                      .... ...+.+.|+++|..++|++||..    +.|++.|.|.+|.  +++.|+.|+|+..+.+.
T Consensus      3699 lse~~n~~~kl~frLkT~~sngIiM~tr----~~d~~iLkLv~G~--~~l~~~cgsG~Givg~q 3756 (4289)
T KOG1219|consen 3699 LSENQNTRMKLGFRLKTLQSNGIIMYTR----KTDLAILKLVGGS--PQLLADCGSGPGIVGSQ 3756 (4289)
T ss_pred             cccccccceEEEEEEEecccCcEEEEEc----CCceEEEEecCCc--EEEEEecCCCCCccccc
Confidence            5544 22378999999999999999973    6799999999999  99999999998877765


No 5  
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.95  E-value=3.8e-09  Score=76.34  Aligned_cols=50  Identities=46%  Similarity=0.743  Sum_probs=45.1

Q ss_pred             EEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCCCC
Q psy2060         141 FSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRADSI  193 (193)
Q Consensus       141 f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~~i  193 (193)
                      |||..++|+|||.++ ....||++|+|.+|+  |+|+|++|+++..+.+++.|
T Consensus         1 frT~~~~Gllly~g~-~~~~dfial~L~~G~--l~~~~~~G~~~~~~~~~~~i   50 (131)
T PF00054_consen    1 FRTSEPNGLLLYLGS-KDGKDFIALELRDGR--LEFRYNLGSGPASLRSPQKI   50 (131)
T ss_dssp             EEESSSSEEEEEEES-STTSSEEEEEEETTE--EEEEEESSSEEEEEEESSET
T ss_pred             CccCCCCceEEECCc-CCCCCEEEEEEECCE--EEEEEeCCCccceecCCCcc
Confidence            899999999999986 445599999999999  99999999999999988754


No 6  
>KOG3516|consensus
Probab=98.94  E-value=2.2e-08  Score=92.04  Aligned_cols=108  Identities=25%  Similarity=0.432  Sum_probs=84.2

Q ss_pred             CCC-CCCCCCcEEeeCCCCceecccCCC-CCCccccccccceec-ccCCCcceEEecCCCC-------------------
Q psy2060          74 DCK-GVCSPGATCSLNPSGGYICDDCSL-AGSYEHYDRLCRLRA-RSFPKNSFLTFPALKQ-------------------  131 (193)
Q Consensus        74 ~C~-~pC~~gg~C~~~~~~~~~C~~C~~-g~~~~~~g~~C~~~~-~sF~g~syl~~~~~~~-------------------  131 (193)
                      .|. .+|.|||+|+... .+|.|. |.. .|.    |+.|.... .-|...+++.|....+                   
T Consensus       957 hCss~~C~NGG~Cvery-~gytCD-Cs~Tay~----Gp~Cs~eig~~fe~gs~i~y~fq~~~~~a~~~~~~~~~~~~~~~ 1030 (1306)
T KOG3516|consen  957 HCSSYPCLNGGHCVERY-DGYTCD-CSRTAYD----GPFCSKEIGVFFERGSSIRYNFQKPMRSAVFESSRVKQKLEIEI 1030 (1306)
T ss_pred             ccccccccCCCEEEEec-Cceeec-cccCcCC----CCccccccceEecCCceEEEeccchHHHhhhhhhhhhhcccccc
Confidence            788 9999999999998 999999 975 344    99998864 4466677777653221                   


Q ss_pred             CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEec-CCeeEEEEEEeCCC-ceEEEEcCC
Q psy2060         132 RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELT-DSGRSVQFSWSLGS-EVARLNRAD  191 (193)
Q Consensus       132 ~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~-~G~~~l~~~~~~g~-g~~~l~s~~  191 (193)
                      .....|.|.|+|....++|||.+  .-..||+++.|. +|.  |+++|.+|. .+.+++.++
T Consensus      1031 ~~~e~i~~sftTt~~ps~LLfvs--sF~~~y~~V~v~~nGs--Lq~ry~lg~~e~~~~~~~~ 1088 (1306)
T KOG3516|consen 1031 NPNEEINFSFTTTRAPSDLLFVS--SFTDDYLAVLVKDNGS--LQTRYMLGFREPFEYQFKD 1088 (1306)
T ss_pred             CccceEEEEEEeccCceEEEEee--ccccceEEEEEeCCCc--eEEEEecCCcCceEEeccc
Confidence            11236889999999999999987  456789999888 799  999999998 566665543


No 7  
>KOG3514|consensus
Probab=98.87  E-value=1.6e-08  Score=91.84  Aligned_cols=109  Identities=25%  Similarity=0.418  Sum_probs=85.8

Q ss_pred             CcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee--cccCCCcceEEecCCCC--CceeEEE
Q psy2060          64 DTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR--ARSFPKNSFLTFPALKQ--RNRLHIK  138 (193)
Q Consensus        64 ~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~--~~sF~g~syl~~~~~~~--~~~~~i~  138 (193)
                      +.|...-. ..|+ +||+|+|+|...+ +.|.|. |......   |+.|+..  +.+|+|..|+..-....  ....++.
T Consensus       616 psCs~~~~-~~C~~nPC~N~g~C~egw-NrfiCD-Cs~T~~~---G~~CerE~t~ls~nGs~~m~i~L~~~~~tq~E~v~  689 (1591)
T KOG3514|consen  616 PSCSLSNE-KICESNPCQNGGKCSEGW-NRFICD-CSGTGFE---GRTCEREATALSYNGSMSMKIVLPHTMHTQAEDVS  689 (1591)
T ss_pred             cccchhhc-cccCCCcccCCCCccccc-cccccc-cccCccc---CccccceeeeEEEcCeeeEEEEecccceeecceEE
Confidence            34554432 3899 9999999999988 999999 9753332   9999884  67899988876532222  2234799


Q ss_pred             EEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCC
Q psy2060         139 LQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLG  181 (193)
Q Consensus       139 ~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g  181 (193)
                      ++|+|..+.||||-.+. .++.|-+.|+|.+|+  +++..++.
T Consensus       690 iRF~t~r~~Gll~~Tta-~~s~D~l~l~L~~g~--vkl~v~ls  729 (1591)
T KOG3514|consen  690 IRFRTQRAYGLLFATTA-RGSADTLRLELDAGQ--VKLFVNLS  729 (1591)
T ss_pred             EEEEecccceeEEEecc-CCCCceEEEEEecce--EEEEEecC
Confidence            99999999999998765 578899999999999  99988886


No 8  
>KOG3514|consensus
Probab=98.87  E-value=1.9e-08  Score=91.46  Aligned_cols=104  Identities=26%  Similarity=0.536  Sum_probs=86.6

Q ss_pred             CCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee----cccCCCcceEEecCCCC---CceeEEEEEEEeC
Q psy2060          74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR----ARSFPKNSFLTFPALKQ---RNRLHIKLQFSTL  144 (193)
Q Consensus        74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~----~~sF~g~syl~~~~~~~---~~~~~i~~~f~T~  144 (193)
                      .|.  .+|.+++.|.... .+..|. |...+.    |+.|++.    ...|.|+-|+-|....+   ...-.|+|.|||.
T Consensus       211 ~~~~~~~c~~~g~c~s~d-~gp~c~-c~~~~d----gq~cekeK~~~eaTF~G~ef~~YDls~npI~s~~d~itl~FrT~  284 (1591)
T KOG3514|consen  211 SSREQFVCLNDGECYSSD-DGPHCD-CQFDHD----GQNCEKEKNDGEATFGGDEFVGYDLSQNPIRSKKDNITLTFRTV  284 (1591)
T ss_pred             cccccceeccCCeEecCC-CCCccc-cccccC----ccccccccCcceEEecCceEEEeeccCCcccccccceEEEEEEe
Confidence            555  8999999999887 888899 977444    9999984    56899999998864433   3335799999999


Q ss_pred             CCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEE
Q psy2060         145 HDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLN  188 (193)
Q Consensus       145 ~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~  188 (193)
                      +.+|+|||.|   +..||+.|.|.+|.  |.+..++++|..++.
T Consensus       285 q~ngllfytG---~~~dYlnlaL~dGa--V~l~~~l~~g~~e~~  323 (1591)
T KOG3514|consen  285 QGNGLLFYTG---DEKDYLNLALQDGA--VSLSSKLDGGDAEII  323 (1591)
T ss_pred             cCceeEEEcc---CCcceeeEeecCCc--EEEEEecCCccceeE
Confidence            9999999995   57899999999999  999999998877654


No 9  
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.83  E-value=4.6e-08  Score=71.53  Aligned_cols=73  Identities=38%  Similarity=0.643  Sum_probs=62.3

Q ss_pred             ccCCCcceEEecCCCC-CceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCC
Q psy2060         116 RSFPKNSFLTFPALKQ-RNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRAD  191 (193)
Q Consensus       116 ~sF~g~syl~~~~~~~-~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~  191 (193)
                      ..|.+.+|+.++.... ....+|+++|+|..++|+||+.+. ....+|++|+|.+|+  |.+.++.|.....++++.
T Consensus         2 ~~F~g~~~i~~~~~~~~~~~~~i~~~frt~~~~g~l~~~~~-~~~~~~~~l~l~~g~--l~~~~~~g~~~~~~~~~~   75 (151)
T cd00110           2 VSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGS-QNGGDFLALELEDGR--LVLRYDLGSGSLVLSSKT   75 (151)
T ss_pred             eEeCCCceEEecCCCCCcceeEEEEEEEeCCCCeEEEEecC-CCCCCEEEEEEECCE--EEEEEcCCcccEEEEccC
Confidence            4688899999987665 667899999999999999999864 225899999999999  999999997788887764


No 10 
>KOG3516|consensus
Probab=98.76  E-value=1.7e-07  Score=86.27  Aligned_cols=128  Identities=22%  Similarity=0.191  Sum_probs=95.7

Q ss_pred             cCCCCceeeCCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccccee-----cccCC-C
Q psy2060          48 RTLDLLHDMGARYHWGDTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLR-----ARSFP-K  120 (193)
Q Consensus        48 ~~~~~~~C~C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~-----~~sF~-g  120 (193)
                      ..+..+.|.|.++..-=.|.     .-|. .-++.+.+|+... ..-++. |...+...  +-+|+..     +++|+ +
T Consensus       726 C~d~~~~CNCDa~~~ewt~D-----tg~l~~k~hLPVt~vv~g-dTg~~~-sea~~~lg--PLrC~gDr~~wnsvSF~~~  796 (1306)
T KOG3516|consen  726 CLDPQLYCNCDADEKEWTTD-----TGCLAYKDHLPVTQVVIG-DTGRSQ-SEAPYVLG--PLRCEGDRNFWNSVSFHTG  796 (1306)
T ss_pred             ccCcceeeeccCCCcccccc-----ccccchhhcCCeeEEEEc-cCCCcc-cccceeec--ceEeecccccccceEeecC
Confidence            34567899998775432222     1444 6677888888665 333366 65544432  6678763     56787 4


Q ss_pred             cceEEecCCCCCceeEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEE
Q psy2060         121 NSFLTFPALKQRNRLHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLN  188 (193)
Q Consensus       121 ~syl~~~~~~~~~~~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~  188 (193)
                      .+|+.++.....++..|+|.|||..+.|++|.+.   +..|||.|+|.... ++.|.|+.|+|+.+++
T Consensus       797 ~syL~fp~f~~~~saDIsf~FrTt~~~gvflen~---g~~dfir~eL~~~~-~vtf~~dvgnGp~~~~  860 (1306)
T KOG3516|consen  797 ASYLHFPPFHNELSADISFFFRTTASSGVFLENH---GINDFIRLELSSPV-EVTFAFDVGNGPSQLT  860 (1306)
T ss_pred             cceeecCcccCcccccEEEEEEecCCceEeeecc---CCCceEEEEEcCCC-ceEEEEEcCCCceeEE
Confidence            7899999999999999999999999999999983   47899999999765 5999999999977664


No 11 
>smart00282 LamG Laminin G domain.
Probab=98.41  E-value=2e-06  Score=61.94  Aligned_cols=53  Identities=42%  Similarity=0.682  Sum_probs=46.4

Q ss_pred             eEEEEEEEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEcC
Q psy2060         135 LHIKLQFSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNRA  190 (193)
Q Consensus       135 ~~i~~~f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~  190 (193)
                      .+|+|.|+|..++|+|||.+. ....+|++|+|.+|+  +.+.++.|+++..++++
T Consensus         3 ~~i~~~frt~~~~g~l~~~~~-~~~~~~l~l~l~~g~--l~~~~~~g~~~~~~~~~   55 (135)
T smart00282        3 LSISFSFRTTSPNGLLLYAGS-KNGGDYLALELRDGR--LVLRYDLGSGPARLTSD   55 (135)
T ss_pred             eEEEEEEEeCCCCEEEEEeCC-CCCCCEEEEEEECCE--EEEEEECCCCCEEEEEC
Confidence            579999999999999999853 246799999999999  99999999888887765


No 12 
>KOG1225|consensus
Probab=98.21  E-value=1.6e-06  Score=75.35  Aligned_cols=70  Identities=20%  Similarity=0.270  Sum_probs=54.3

Q ss_pred             ee-cCCCCccCCCCcce--eec---CC-ccCCCCceeeCCCCCCCCcccccCCCCCCCCCCCCCcEEeeCCCCceecccC
Q psy2060          26 LS-GLPDNEGLEKRVQT--FVD---GG-HRTLDLLHDMGARYHWGDTCETDITKSDCKGVCSPGATCSLNPSGGYICDDC   98 (193)
Q Consensus        26 ~c-C~~g~~g~~~~c~~--~~~---gg-~~~~~~~~C~C~~g~~G~~C~~~~~~~~C~~pC~~gg~C~~~~~~~~~C~~C   98 (193)
                      .| |+|||+|..  |+.  +..   ++ ....+  .|+|++||.|..|+..    .|...|.+.|.|++ .    +|. |
T Consensus       266 ~CIC~~Gf~G~d--C~e~~Cp~~cs~~g~~~~g--~CiC~~g~~G~dCs~~----~cpadC~g~G~Ci~-G----~C~-C  331 (525)
T KOG1225|consen  266 RCICPPGFTGDD--CDELVCPVDCSGGGVCVDG--ECICNPGYSGKDCSIR----RCPADCSGHGKCID-G----ECL-C  331 (525)
T ss_pred             eEeCCCCCcCCC--CCcccCCcccCCCceecCC--EeecCCCccccccccc----cCCccCCCCCcccC-C----ceE-e
Confidence            48 999999998  876  211   33 44444  9999999999999865    67788999999983 3    588 8


Q ss_pred             CCCCCccccccccce
Q psy2060          99 SLAGSYEHYDRLCRL  113 (193)
Q Consensus        99 ~~g~~~~~~g~~C~~  113 (193)
                      .+||+    |..|++
T Consensus       332 ~~Gy~----G~~C~~  342 (525)
T KOG1225|consen  332 DEGYT----GELCIQ  342 (525)
T ss_pred             CCCCc----CCcccc
Confidence            89998    666665


No 13 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.06  E-value=1.8e-06  Score=46.79  Aligned_cols=28  Identities=39%  Similarity=0.993  Sum_probs=25.1

Q ss_pred             CC-CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          75 CK-GVCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        75 C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      |. +||.|+|+|++...++|.|. |+++|.
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~-C~~G~~   29 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCE-CPPGYT   29 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEE-EBTTEE
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeE-CCCCCc
Confidence            55 89999999998766999999 999987


No 14 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=98.03  E-value=3.5e-05  Score=54.30  Aligned_cols=48  Identities=38%  Similarity=0.605  Sum_probs=38.7

Q ss_pred             EEeCCCCeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCC-CceEEEEcCC
Q psy2060         141 FSTLHDSGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLG-SEVARLNRAD  191 (193)
Q Consensus       141 f~T~~~~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g-~g~~~l~s~~  191 (193)
                      |||..++|+|||.+... ..+|++|+|.+|+  |++.|++| .....+.++.
T Consensus         1 Frt~~~~g~Ll~~~~~~-~~~~l~l~l~~g~--l~~~~~~g~~~~~~~~~~~   49 (128)
T PF02210_consen    1 FRTRSPNGLLLYIGSED-NGDFLSLELVDGR--LVVRYNLGGSEIVTTFSNS   49 (128)
T ss_dssp             EEESSSSEEEEEEEEST-TSEEEEEEEETTE--EEEEEESSSSEEEEEECSS
T ss_pred             CccCCCCEeEEEEcCCC-CCEEEEEEEECCE--EEEEEEccccceeeeccCc
Confidence            89999999999997522 2689999999999  99999999 4455555444


No 15 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.64  E-value=3.5e-05  Score=44.36  Aligned_cols=28  Identities=29%  Similarity=0.759  Sum_probs=25.8

Q ss_pred             CCC---CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          74 DCK---GVCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        74 ~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      +|.   +.|..++.|++.. ++|.|. |++||.
T Consensus         4 EC~~~~~~C~~~~~C~N~~-Gsy~C~-C~~Gy~   34 (42)
T PF07645_consen    4 ECAEGPHNCPENGTCVNTE-GSYSCS-CPPGYE   34 (42)
T ss_dssp             TTTTTSSSSSTTSEEEEET-TEEEEE-ESTTEE
T ss_pred             ccCCCCCcCCCCCEEEcCC-CCEEee-CCCCcE
Confidence            888   6899899999998 999999 999986


No 16 
>KOG1214|consensus
Probab=97.55  E-value=8.2e-05  Score=67.15  Aligned_cols=55  Identities=25%  Similarity=0.496  Sum_probs=46.1

Q ss_pred             CCCceeeCCCCCCCC--cccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          50 LDLLHDMGARYHWGD--TCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        50 ~~~~~C~C~~g~~G~--~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      .+.|.|.|.|||.|+  .|... +  +|. +.|...++|.+.+ ++|.|. |.+||.++  |..|
T Consensus       806 gs~y~C~CLPGfsGDG~~c~dv-D--eC~psrChp~A~Cyntp-gsfsC~-C~pGy~GD--Gf~C  863 (1289)
T KOG1214|consen  806 GSTYSCACLPGFSGDGHQCTDV-D--ECSPSRCHPAATCYNTP-GSFSCR-CQPGYYGD--GFQC  863 (1289)
T ss_pred             CceEEEeecCCccCCccccccc-c--ccCccccCCCceEecCC-Ccceee-cccCccCC--Ccee
Confidence            457999999999964  56543 3  999 9999999999999 999999 99999865  5555


No 17 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=97.41  E-value=4.5e-05  Score=32.62  Aligned_cols=13  Identities=15%  Similarity=0.067  Sum_probs=8.9

Q ss_pred             eeeCCCCCCCCcc
Q psy2060          54 HDMGARYHWGDTC   66 (193)
Q Consensus        54 ~C~C~~g~~G~~C   66 (193)
                      .|.|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4778888887776


No 18 
>KOG1217|consensus
Probab=97.36  E-value=0.00037  Score=59.73  Aligned_cols=57  Identities=32%  Similarity=0.605  Sum_probs=45.8

Q ss_pred             ccCCCCceeeCCCCCCCCcc--cccCCCCCCC-C-CCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          47 HRTLDLLHDMGARYHWGDTC--ETDITKSDCK-G-VCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~G~~C--~~~~~~~~C~-~-pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      .+..++|.|.|++||++..+  ...+  +.|. . +|.++++|++.. +.|.|. |+++|.    |..|
T Consensus       246 ~~~~~~~~C~~~~g~~~~~~~~~~~~--~~C~~~~~c~~~~~C~~~~-~~~~C~-C~~g~~----g~~~  306 (487)
T KOG1217|consen  246 VNTVGSYTCRCPEGYTGDACVTCVDV--DSCALIASCPNGGTCVNVP-GSYRCT-CPPGFT----GRLC  306 (487)
T ss_pred             cccCCceeeeCCCCccccccceeeec--cccCCCCccCCCCeeecCC-Ccceee-CCCCCC----CCCC
Confidence            45566799999999999874  3333  3898 5 499999999887 779999 999998    6665


No 19 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.29  E-value=0.00037  Score=38.79  Aligned_cols=33  Identities=33%  Similarity=0.900  Sum_probs=27.4

Q ss_pred             CCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      +|.  .+|.+++.|++.. ++|.|. |+.+|. .  |..|
T Consensus         4 ~C~~~~~C~~~~~C~~~~-g~~~C~-C~~g~~-~--g~~C   38 (39)
T smart00179        4 ECASGNPCQNGGTCVNTV-GSYRCE-CPPGYT-D--GRNC   38 (39)
T ss_pred             cCcCCCCcCCCCEeECCC-CCeEeE-CCCCCc-c--CCcC
Confidence            777  5899999999987 999999 999886 1  5555


No 20 
>KOG1225|consensus
Probab=97.28  E-value=0.00041  Score=60.68  Aligned_cols=72  Identities=24%  Similarity=0.307  Sum_probs=52.7

Q ss_pred             ee-cCCCCccCCCCcceeec-CC-cc--CCCCceeeCCCCCCCCcccccCCCCCCCCCCCCCcEEeeCCCCceecccCCC
Q psy2060          26 LS-GLPDNEGLEKRVQTFVD-GG-HR--TLDLLHDMGARYHWGDTCETDITKSDCKGVCSPGATCSLNPSGGYICDDCSL  100 (193)
Q Consensus        26 ~c-C~~g~~g~~~~c~~~~~-gg-~~--~~~~~~C~C~~g~~G~~C~~~~~~~~C~~pC~~gg~C~~~~~~~~~C~~C~~  100 (193)
                      .| |+-+|+|..  |..-.- ++ .+  .-..-+|+|++||+|..|...    .|...|..++.|++    + .|. |++
T Consensus       235 ic~c~~~~~g~~--c~~~~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~----~Cp~~cs~~g~~~~----g-~Ci-C~~  302 (525)
T KOG1225|consen  235 ICECPEGYFGPL--CSTIYCPGGCTGRGQCVEGRCICPPGFTGDDCDEL----VCPVDCSGGGVCVD----G-ECI-CNP  302 (525)
T ss_pred             eeecCCceeCCc--cccccCCCCCcccceEeCCeEeCCCCCcCCCCCcc----cCCcccCCCceecC----C-Eee-cCC
Confidence            47 999999997  763221 11 11  112358999999999999875    78844998888873    3 799 999


Q ss_pred             CCCccccccccce
Q psy2060         101 AGSYEHYDRLCRL  113 (193)
Q Consensus       101 g~~~~~~g~~C~~  113 (193)
                      +|.    |+.|+.
T Consensus       303 g~~----G~dCs~  311 (525)
T KOG1225|consen  303 GYS----GKDCSI  311 (525)
T ss_pred             Ccc----cccccc
Confidence            999    888864


No 21 
>KOG4260|consensus
Probab=97.26  E-value=0.0003  Score=56.04  Aligned_cols=75  Identities=12%  Similarity=0.130  Sum_probs=56.3

Q ss_pred             eecCCCCccCCCCcceeec------CC-------ccCCCCceeeCCCCCCCCccccc-----------------------
Q psy2060          26 LSGLPDNEGLEKRVQTFVD------GG-------HRTLDLLHDMGARYHWGDTCETD-----------------------   69 (193)
Q Consensus        26 ~cC~~g~~g~~~~c~~~~~------gg-------~~~~~~~~C~C~~g~~G~~C~~~-----------------------   69 (193)
                      +|||+|-.|+.  |..+..      +|       ..-.++-.|.|.+||+|+.|..-                       
T Consensus       130 vCCp~gtyGpd--Cl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C  207 (350)
T KOG4260|consen  130 VCCPDGTYGPD--CLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC  207 (350)
T ss_pred             eccCCCCcCCc--cccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhh
Confidence            47999999998  887743      11       22345678999999999888431                       


Q ss_pred             --------------------------CCCCCCC---CCCCCCcEEeeCCCCceecccCCCCCCc
Q psy2060          70 --------------------------ITKSDCK---GVCSPGATCSLNPSGGYICDDCSLAGSY  104 (193)
Q Consensus        70 --------------------------~~~~~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~~  104 (193)
                                                +|.++|.   .||...-.|++.. ++|+|. +.+||..
T Consensus       208 ~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNte-GSf~C~-dk~Gy~~  269 (350)
T KOG4260|consen  208 LGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTE-GSFKCE-DKEGYKK  269 (350)
T ss_pred             hcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCC-CceEec-ccccccC
Confidence                                      1124665   7899989999997 999999 9888874


No 22 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.19  E-value=0.00047  Score=37.88  Aligned_cols=33  Identities=33%  Similarity=0.855  Sum_probs=28.1

Q ss_pred             CCC--CCCCCCcEEeeCCCCceecccCCCCCCccccccccc
Q psy2060          74 DCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCR  112 (193)
Q Consensus        74 ~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~  112 (193)
                      +|.  .+|.+++.|++.. ++|.|. |+.+|.    |..|+
T Consensus         4 ~C~~~~~C~~~~~C~~~~-~~~~C~-C~~g~~----g~~C~   38 (38)
T cd00054           4 ECASGNPCQNGGTCVNTV-GSYRCS-CPPGYT----GRNCE   38 (38)
T ss_pred             cCCCCCCcCCCCEeECCC-CCeEeE-CCCCCc----CCcCC
Confidence            676  5899999999887 899999 999988    77774


No 23 
>KOG1217|consensus
Probab=97.13  E-value=0.00097  Score=57.13  Aligned_cols=83  Identities=24%  Similarity=0.507  Sum_probs=63.9

Q ss_pred             CCCCcee-cCCCCccCCCCcce---eec-------CC--cc---CCCCceeeCCCCCCCCcccccCCCCCCC---CCCCC
Q psy2060          21 PFQPYLS-GLPDNEGLEKRVQT---FVD-------GG--HR---TLDLLHDMGARYHWGDTCETDITKSDCK---GVCSP   81 (193)
Q Consensus        21 ~~~~~~c-C~~g~~g~~~~c~~---~~~-------gg--~~---~~~~~~C~C~~g~~G~~C~~~~~~~~C~---~pC~~   81 (193)
                      ....+.| |++||.|..  |+.   +..       .+  .+   ....+.|.|..||.+..|+...+  .|.   .+|.+
T Consensus       106 ~~~~~~c~c~~g~~~~~--~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~--~C~~~~~~c~~  181 (487)
T KOG1217|consen  106 CVGSYECTCPPGYQGTP--CEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLD--ECIQYSSPCQN  181 (487)
T ss_pred             CCCCceeeCCCccccCc--CCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCccccccccccc--ccccCCCCcCC
Confidence            4456778 999999986  555   222       12  22   34589999999999999987644  887   57999


Q ss_pred             CcEEeeCCCCceecccCCCCCCccccccccce
Q psy2060          82 GATCSLNPSGGYICDDCSLAGSYEHYDRLCRL  113 (193)
Q Consensus        82 gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~  113 (193)
                      ++.|.+.. ++|.|. |+.++.    +..|+.
T Consensus       182 ~~~C~~~~-~~~~C~-c~~~~~----~~~~~~  207 (487)
T KOG1217|consen  182 GGTCVNTG-GSYLCS-CPPGYT----GSTCET  207 (487)
T ss_pred             CcccccCC-CCeeEe-CCCCcc----CCcCcC
Confidence            99999987 789999 999998    666654


No 24 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.96  E-value=0.0012  Score=35.68  Aligned_cols=25  Identities=40%  Similarity=0.869  Sum_probs=22.7

Q ss_pred             CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          77 GVCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      .+|.+++.|++.. ++|.|. |+.+|.
T Consensus         6 ~~C~~~~~C~~~~-~~~~C~-C~~g~~   30 (36)
T cd00053           6 NPCSNGGTCVNTP-GSYRCV-CPPGYT   30 (36)
T ss_pred             CCCCCCCEEecCC-CCeEeE-CCCCCc
Confidence            6788899999988 899999 999998


No 25 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.79  E-value=0.0018  Score=35.14  Aligned_cols=29  Identities=41%  Similarity=1.008  Sum_probs=24.2

Q ss_pred             CCCCCCcEEeeCCCCceecccCCCCCCccccc-cccc
Q psy2060          77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYD-RLCR  112 (193)
Q Consensus        77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g-~~C~  112 (193)
                      .+|.++ +|++.. ++|.|. |+.+|.    | ..|+
T Consensus         6 ~~C~~~-~C~~~~-~~~~C~-C~~g~~----g~~~C~   35 (35)
T smart00181        6 GPCSNG-TCINTP-GSYTCS-CPPGYT----GDKRCE   35 (35)
T ss_pred             CCCCCC-EEECCC-CCeEeE-CCCCCc----cCCccC
Confidence            378888 999886 999999 999998    6 6653


No 26 
>KOG1214|consensus
Probab=96.72  E-value=0.0026  Score=57.90  Aligned_cols=77  Identities=18%  Similarity=0.311  Sum_probs=59.8

Q ss_pred             cCCCCCCcee-cCCCCccCCCCcceeecCCccCCCCceeeCCCCCC--CCcccccCCCCCCC---CCCCCCcEEeeCCCC
Q psy2060          18 TNGPFQPYLS-GLPDNEGLEKRVQTFVDGGHRTLDLLHDMGARYHW--GDTCETDITKSDCK---GVCSPGATCSLNPSG   91 (193)
Q Consensus        18 ~~~~~~~~~c-C~~g~~g~~~~c~~~~~gg~~~~~~~~C~C~~g~~--G~~C~~~~~~~~C~---~pC~~gg~C~~~~~~   91 (193)
                      .+.|...+.| |..++++..++|..      .+...|.|.|..||.  |.+|-..   ++|.   ..|.....|++.+ +
T Consensus       686 ~S~~~~~npCy~gsh~cdt~a~C~p------g~~~~~tcecs~g~~gdgr~c~d~---~eca~~~~~CGp~s~Cin~p-g  755 (1289)
T KOG1214|consen  686 DSDPTPVNPCYDGSHMCDTTARCHP------GTGVDYTCECSSGYQGDGRNCVDE---NECATGFHRCGPNSVCINLP-G  755 (1289)
T ss_pred             CCCCcccccceecCcccCCCccccC------CCCcceEEEEeeccCCCCCCCCCh---hhhccCCCCCCCCceeecCC-C
Confidence            4455666678 88888888777752      334579999999998  5678643   3887   8899999999999 9


Q ss_pred             ceecccCCCCCCcc
Q psy2060          92 GYICDDCSLAGSYE  105 (193)
Q Consensus        92 ~~~C~~C~~g~~~~  105 (193)
                      +|+|+ |..+|...
T Consensus       756 ~~rce-C~~gy~F~  768 (1289)
T KOG1214|consen  756 SYRCE-CRSGYEFA  768 (1289)
T ss_pred             ceeEE-Eeecceec
Confidence            99999 98776544


No 27 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.28  E-value=0.0019  Score=35.74  Aligned_cols=31  Identities=35%  Similarity=0.830  Sum_probs=22.9

Q ss_pred             CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      ..|...++|++.. ++|.|. |++||.++  |..|
T Consensus         6 ~~C~~nA~C~~~~-~~~~C~-C~~Gy~Gd--G~~C   36 (36)
T PF12947_consen    6 GGCHPNATCTNTG-GSYTCT-CKPGYEGD--GFFC   36 (36)
T ss_dssp             GGS-TTCEEEE-T-TSEEEE-E-CEEECC--STCE
T ss_pred             CCCCCCcEeecCC-CCEEeE-CCCCCccC--CcCC
Confidence            5688889999998 899999 99998854  5544


No 28 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.17  E-value=0.00094  Score=35.97  Aligned_cols=23  Identities=13%  Similarity=-0.104  Sum_probs=16.7

Q ss_pred             ecCC--ccCC-CCceeeCCCCCCCCc
Q psy2060          43 VDGG--HRTL-DLLHDMGARYHWGDT   65 (193)
Q Consensus        43 ~~gg--~~~~-~~~~C~C~~g~~G~~   65 (193)
                      .|+|  .+.. ++|.|.|++||+|.+
T Consensus         7 ~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    7 QNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCeEEEeCCCCCEEeECCCCCccCC
Confidence            3455  3444 889999999999864


No 29 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.83  E-value=0.0054  Score=33.95  Aligned_cols=26  Identities=31%  Similarity=0.746  Sum_probs=19.7

Q ss_pred             EEeeCCCCceecccCCCCCCcccccccc
Q psy2060          84 TCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        84 ~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      .|++.+ ++|+|. ||.||....+++.|
T Consensus        11 ~C~~~~-g~~~C~-C~~Gy~L~~D~~tC   36 (36)
T PF14670_consen   11 ICVNTP-GSYRCS-CPPGYKLAEDGRTC   36 (36)
T ss_dssp             EEEEET-TSEEEE--STTEEE-TTSSSE
T ss_pred             CCccCC-CceEeE-CCCCCEECcCCCCC
Confidence            789998 999999 99999876555544


No 30 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.77  E-value=0.0082  Score=33.13  Aligned_cols=21  Identities=19%  Similarity=0.091  Sum_probs=18.3

Q ss_pred             ccCCCCceeeCCCCCC-CCccc
Q psy2060          47 HRTLDLLHDMGARYHW-GDTCE   67 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~-G~~C~   67 (193)
                      .+..++|.|.|++||. |..|+
T Consensus        18 ~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179       18 VNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ECCCCCeEeECCCCCccCCcCC
Confidence            5778899999999999 88874


No 31 
>KOG1836|consensus
Probab=95.71  E-value=0.016  Score=57.33  Aligned_cols=84  Identities=17%  Similarity=0.309  Sum_probs=58.8

Q ss_pred             e-cCCCCccCCCCcceeec--------------------CC---ccCCCCceeeCCCCCCCCcccccCC----------C
Q psy2060          27 S-GLPDNEGLEKRVQTFVD--------------------GG---HRTLDLLHDMGARYHWGDTCETDIT----------K   72 (193)
Q Consensus        27 c-C~~g~~g~~~~c~~~~~--------------------gg---~~~~~~~~C~C~~g~~G~~C~~~~~----------~   72 (193)
                      | |++||+|.+  |+.+-+                    .|   .-...+-.|.|.+.--|+.|+...+          .
T Consensus       697 c~C~~g~tG~~--Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~  774 (1705)
T KOG1836|consen  697 CTCPVGYTGQF--CESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTS  774 (1705)
T ss_pred             ccCCCCcccch--hhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCC
Confidence            8 999999998  986654                    11   1112345566666666666665322          1


Q ss_pred             CCCC-CCCCCCcEEeeCC-CCceecccCCCCCCccccccccceecc
Q psy2060          73 SDCK-GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRAR  116 (193)
Q Consensus        73 ~~C~-~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~~  116 (193)
                      ..|. -+|.+++.|.... .....|..||++|+    |.+|+.-..
T Consensus       775 ~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gyt----G~rCe~c~d  816 (1705)
T KOG1836|consen  775 GDCQPCPCPNGGACGQTPEILEVVCKNCPPGYT----GLRCEECAD  816 (1705)
T ss_pred             CCCccCCCCCChhhcCcCcccceecCCCCCCCc----ccccccCCC
Confidence            2388 8999999998765 47778989999999    999988543


No 32 
>smart00051 DSL delta serrate ligand.
Probab=95.63  E-value=0.014  Score=36.52  Aligned_cols=37  Identities=11%  Similarity=-0.013  Sum_probs=23.7

Q ss_pred             cCCCCccCCCCcceeecCCccCCCC------ceeeCCCCCCCCcc
Q psy2060          28 GLPDNEGLEKRVQTFVDGGHRTLDL------LHDMGARYHWGDTC   66 (193)
Q Consensus        28 C~~g~~g~~~~c~~~~~gg~~~~~~------~~C~C~~g~~G~~C   66 (193)
                      |.++|.|..  |+..-....+..+.      -.|.|++||+|+.|
T Consensus        21 C~~~~yG~~--C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G~~C   63 (63)
T smart00051       21 CDENYYGEG--CNKFCRPRDDFFGHYTCDENGNKGCLEGWMGPYC   63 (63)
T ss_pred             CCCCCcCCc--cCCEeCcCccccCCccCCcCCCEecCCCCcCCCC
Confidence            899999997  76432211112223      34669999999887


No 33 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.57  E-value=0.009  Score=29.90  Aligned_cols=16  Identities=19%  Similarity=0.094  Sum_probs=12.4

Q ss_pred             CceeeCCCCCC----CCccc
Q psy2060          52 LLHDMGARYHW----GDTCE   67 (193)
Q Consensus        52 ~~~C~C~~g~~----G~~C~   67 (193)
                      +|+|.|++||.    |..|+
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~   20 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCE   20 (24)
T ss_pred             CEEeeCCCCCcCCCCCCccc
Confidence            58999999997    45554


No 34 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.13  E-value=0.018  Score=31.30  Aligned_cols=21  Identities=19%  Similarity=0.101  Sum_probs=17.9

Q ss_pred             ccCCCCceeeCCCCCCCCccc
Q psy2060          47 HRTLDLLHDMGARYHWGDTCE   67 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~G~~C~   67 (193)
                      .+..++|.|.|++||.|..|+
T Consensus        18 ~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054          18 VNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ECCCCCeEeECCCCCcCCcCC
Confidence            567788999999999998774


No 35 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=95.11  E-value=0.015  Score=29.11  Aligned_cols=21  Identities=24%  Similarity=0.521  Sum_probs=17.4

Q ss_pred             ceecccCCCCCCccccccccce
Q psy2060          92 GYICDDCSLAGSYEHYDRLCRL  113 (193)
Q Consensus        92 ~~~C~~C~~g~~~~~~g~~C~~  113 (193)
                      +|+|. |++||.....|..|+.
T Consensus         1 sy~C~-C~~Gy~l~~d~~~C~D   21 (24)
T PF12662_consen    1 SYTCS-CPPGYQLSPDGRSCED   21 (24)
T ss_pred             CEEee-CCCCCcCCCCCCcccc
Confidence            58999 9999987667888864


No 36 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.07  E-value=0.019  Score=30.95  Aligned_cols=20  Identities=20%  Similarity=0.057  Sum_probs=16.3

Q ss_pred             cCCCCceeeCCCCCCC-Cccc
Q psy2060          48 RTLDLLHDMGARYHWG-DTCE   67 (193)
Q Consensus        48 ~~~~~~~C~C~~g~~G-~~C~   67 (193)
                      +..++|.|.|++||.| ..|+
T Consensus        15 ~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181       15 NTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCCeEeECCCCCccCCccC
Confidence            5578899999999998 6663


No 37 
>KOG1226|consensus
Probab=94.98  E-value=0.037  Score=50.18  Aligned_cols=51  Identities=22%  Similarity=0.496  Sum_probs=36.8

Q ss_pred             ceeeCCCCCCCCcccccCCCCCCC----CCCCCCcEEeeCCCCceecccCCCC-CCccccccccce
Q psy2060          53 LHDMGARYHWGDTCETDITKSDCK----GVCSPGATCSLNPSGGYICDDCSLA-GSYEHYDRLCRL  113 (193)
Q Consensus        53 ~~C~C~~g~~G~~C~~~~~~~~C~----~pC~~gg~C~~~~~~~~~C~~C~~g-~~~~~~g~~C~~  113 (193)
                      -+|+|.+||+|..|+.+.+.+.|.    ..|...|+|+=.     +|. |..+ |.    |+.||.
T Consensus       566 G~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg-----~C~-C~~~~~s----G~~CE~  621 (783)
T KOG1226|consen  566 GRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG-----RCK-CTDPPYS----GEFCEK  621 (783)
T ss_pred             CcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC-----ceE-cCCCCcC----cchhhc
Confidence            478999999999998877667887    578887877632     355 5444 44    666665


No 38 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.90  E-value=0.0096  Score=32.93  Aligned_cols=16  Identities=6%  Similarity=-0.408  Sum_probs=13.1

Q ss_pred             ccCCCCceeeCCCCCC
Q psy2060          47 HRTLDLLHDMGARYHW   62 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~   62 (193)
                      .+++++|+|.|++||.
T Consensus        13 ~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   13 VNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             EEETTSEEEE-STTEE
T ss_pred             ccCCCceEeECCCCCE
Confidence            5778899999999996


No 39 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=94.14  E-value=0.044  Score=29.14  Aligned_cols=20  Identities=20%  Similarity=0.127  Sum_probs=16.5

Q ss_pred             ccCCCCceeeCCCCCCCC-cc
Q psy2060          47 HRTLDLLHDMGARYHWGD-TC   66 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~G~-~C   66 (193)
                      .+..++|.|.|++||.|. .|
T Consensus        15 ~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053          15 VNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             ecCCCCeEeECCCCCcccCCc
Confidence            466678999999999988 55


No 40 
>KOG0994|consensus
Probab=94.10  E-value=0.093  Score=49.77  Aligned_cols=83  Identities=18%  Similarity=0.297  Sum_probs=50.9

Q ss_pred             cCCCCccCCCCcceeecCC---------------ccCCCCcee-eCCCCCCCCcccccCCCCCCC-CCCCCC--------
Q psy2060          28 GLPDNEGLEKRVQTFVDGG---------------HRTLDLLHD-MGARYHWGDTCETDITKSDCK-GVCSPG--------   82 (193)
Q Consensus        28 C~~g~~g~~~~c~~~~~gg---------------~~~~~~~~C-~C~~g~~G~~C~~~~~~~~C~-~pC~~g--------   82 (193)
                      |.|||+|-. .|.+++-.|               .+...++.| .|..||.|+.=--.-  ..|. -||..|        
T Consensus       846 CqpG~WgFP-eCr~CqCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g--~~CrPCpCP~gp~Sg~~~A  922 (1758)
T KOG0994|consen  846 CQPGYWGFP-ECRPCQCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSG--IGCRPCPCPDGPASGRQHA  922 (1758)
T ss_pred             cCCCccCCC-cCccccccCcccccCccccccccccccccccchhhhhccccCCcccCCC--CCCCCCCCCCCCccchhcc
Confidence            888887764 466665322               455567888 488998865322111  2454 455543        


Q ss_pred             cEEeeCC-CCceecccCCCCCCccccccccceecccC
Q psy2060          83 ATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRARSF  118 (193)
Q Consensus        83 g~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~~sF  118 (193)
                      ..|..+. .....|. |.+||.    |.+|+.-...|
T Consensus       923 ~sC~~d~~t~~ivC~-C~~GY~----G~RCe~CA~~~  954 (1758)
T KOG0994|consen  923 DSCYLDTRTQQIVCH-CQEGYS----GSRCEICADNH  954 (1758)
T ss_pred             ccccccccccceeee-cccCcc----ccchhhhcccc
Confidence            2455433 2556699 999999    89998754443


No 41 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=94.03  E-value=0.043  Score=38.96  Aligned_cols=39  Identities=18%  Similarity=0.190  Sum_probs=28.5

Q ss_pred             ceecCCCCccCCCCcceeecCC---ccCCCCceeeCCCCCCCCccccc
Q psy2060          25 YLSGLPDNEGLEKRVQTFVDGG---HRTLDLLHDMGARYHWGDTCETD   69 (193)
Q Consensus        25 ~~cC~~g~~g~~~~c~~~~~gg---~~~~~~~~C~C~~g~~G~~C~~~   69 (193)
                      +.-|++.|.+-      |++|.   ......+.|.|+.||+|..||+.
T Consensus        42 i~~Cp~ey~~Y------ClHG~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         42 IRLCGPEGDGY------CLHGDCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             cccCChhhCCE------eECCEEEeeccCCCceeECCCCcccccccce
Confidence            33488877654      45554   34456899999999999999865


No 42 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=93.81  E-value=0.015  Score=33.17  Aligned_cols=16  Identities=6%  Similarity=-0.452  Sum_probs=15.1

Q ss_pred             ccCCCCceeeCCCCCC
Q psy2060          47 HRTLDLLHDMGARYHW   62 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~   62 (193)
                      .|+.|+|.|.|++||.
T Consensus        19 ~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen   19 VNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             EEETTEEEEEESTTEE
T ss_pred             EcCCCCEEeeCCCCcE
Confidence            7999999999999997


No 43 
>PHA02887 EGF-like protein; Provisional
Probab=93.52  E-value=0.053  Score=37.85  Aligned_cols=23  Identities=17%  Similarity=0.044  Sum_probs=18.9

Q ss_pred             ccCCCCceeeCCCCCCCCccccc
Q psy2060          47 HRTLDLLHDMGARYHWGDTCETD   69 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~G~~C~~~   69 (193)
                      ........|.|++||+|..|+..
T Consensus       102 I~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887        102 IIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             cccCCCceeECCCCcccCCCCcc
Confidence            34456799999999999999853


No 44 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.44  E-value=0.085  Score=28.29  Aligned_cols=27  Identities=26%  Similarity=0.488  Sum_probs=21.4

Q ss_pred             CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          77 GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      ..|.+.|+|+..   ..+|. |..+|.    |+.|
T Consensus         6 ~~C~~~G~C~~~---~g~C~-C~~g~~----G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP---CGRCV-CDSGYT----GPDC   32 (32)
T ss_pred             CccCCCCEEeCC---CCEEE-CCCCCc----CCCC
Confidence            368899999843   35899 999998    7766


No 45 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.94  E-value=0.083  Score=37.52  Aligned_cols=33  Identities=27%  Similarity=0.573  Sum_probs=27.7

Q ss_pred             CCCCCCcEEeeCC-CCceecccCCCCCCccccccccceec
Q psy2060          77 GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLRA  115 (193)
Q Consensus        77 ~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~~  115 (193)
                      +.|.|| +|.-.+ ...+.|. |+.||.    |.+||...
T Consensus        51 ~YClHG-~C~yI~dl~~~~Cr-C~~GYt----GeRCEh~d   84 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCR-CSHGYT----GIRCQHVV   84 (139)
T ss_pred             CEeECC-EEEeeccCCCceeE-CCCCcc----ccccccee
Confidence            789997 897654 3788999 999999    99999864


No 46 
>smart00051 DSL delta serrate ligand.
Probab=92.90  E-value=0.13  Score=32.17  Aligned_cols=46  Identities=20%  Similarity=0.325  Sum_probs=32.2

Q ss_pred             CceeeCCCCCCCCcccccCCCCCCC--CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          52 LLHDMGARYHWGDTCETDITKSDCK--GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        52 ~~~C~C~~g~~G~~C~~~~~~~~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      .+.-.|+++|.|..|+.     .|.  +-+.++.+|...  +  .|. |++|+.    |..|
T Consensus        16 ~~rv~C~~~~yG~~C~~-----~C~~~~d~~~~~~Cd~~--G--~~~-C~~Gw~----G~~C   63 (63)
T smart00051       16 QIRVTCDENYYGEGCNK-----FCRPRDDFFGHYTCDEN--G--NKG-CLEGWM----GPYC   63 (63)
T ss_pred             EEEeeCCCCCcCCccCC-----EeCcCccccCCccCCcC--C--CEe-cCCCCc----CCCC
Confidence            34557999999999963     565  446677778432  2  356 879998    7766


No 47 
>PHA02887 EGF-like protein; Provisional
Probab=92.88  E-value=0.093  Score=36.66  Aligned_cols=32  Identities=16%  Similarity=0.263  Sum_probs=25.9

Q ss_pred             CCCCCCcEEeeCC-CCceecccCCCCCCcccccccccee
Q psy2060          77 GVCSPGATCSLNP-SGGYICDDCSLAGSYEHYDRLCRLR  114 (193)
Q Consensus        77 ~pC~~gg~C~~~~-~~~~~C~~C~~g~~~~~~g~~C~~~  114 (193)
                      +.|.+ |+|.-.. .....|. |+.||.    |.+|+..
T Consensus        92 ~YCiH-G~C~yI~dL~epsCr-C~~GYt----G~RCE~v  124 (126)
T PHA02887         92 DFCIN-GECMNIIDLDEKFCI-CNKGYT----GIRCDEV  124 (126)
T ss_pred             CEeeC-CEEEccccCCCceeE-CCCCcc----cCCCCcc
Confidence            78886 5998654 3778899 999999    9999864


No 48 
>KOG1226|consensus
Probab=92.69  E-value=0.18  Score=45.88  Aligned_cols=67  Identities=16%  Similarity=0.232  Sum_probs=42.6

Q ss_pred             e-cCCCCccCCCCcceeecC------------CccC---CCC-----ceeeCCCCCC----CCcccccCCCCCCC----C
Q psy2060          27 S-GLPDNEGLEKRVQTFVDG------------GHRT---LDL-----LHDMGARYHW----GDTCETDITKSDCK----G   77 (193)
Q Consensus        27 c-C~~g~~g~~~~c~~~~~g------------g~~~---~~~-----~~C~C~~g~~----G~~C~~~~~~~~C~----~   77 (193)
                      | |.+||.|..  ||-..+.            ..+.   .+.     -.|.|.+...    |.+||-+.-  .|+    .
T Consensus       480 C~C~~G~~G~~--CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnf--sC~r~~g~  555 (783)
T KOG1226|consen  480 CRCDEGWLGKK--CECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNF--SCERHKGV  555 (783)
T ss_pred             eecCCCCCCCc--ccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeeeeeeccCc--ccccccCc
Confidence            6 999999997  8743320            0111   122     3566765555    999997744  777    6


Q ss_pred             CCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          78 VCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        78 pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      .|.+.|.|.=.     +|. |.+|++
T Consensus       556 lC~g~G~C~CG-----~Cv-C~~Gwt  575 (783)
T KOG1226|consen  556 LCGGHGRCECG-----RCV-CNPGWT  575 (783)
T ss_pred             ccCCCCeEeCC-----cEE-cCCCCc
Confidence            89988888522     366 666665


No 49 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.56  E-value=0.13  Score=27.59  Aligned_cols=15  Identities=13%  Similarity=-0.035  Sum_probs=13.3

Q ss_pred             CceeeCCCCCCCCcc
Q psy2060          52 LLHDMGARYHWGDTC   66 (193)
Q Consensus        52 ~~~C~C~~g~~G~~C   66 (193)
                      ..+|.|.+||+|+.|
T Consensus        18 ~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen   18 CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CCEEECCCCCcCCCC
Confidence            579999999999887


No 50 
>KOG4260|consensus
Probab=91.45  E-value=0.22  Score=40.10  Aligned_cols=46  Identities=24%  Similarity=0.575  Sum_probs=37.1

Q ss_pred             CCCCCCCCcccccCCCCCCC----CCCCCCcEEeeCC--CCceecccCCCCCCccccccccce
Q psy2060          57 GARYHWGDTCETDITKSDCK----GVCSPGATCSLNP--SGGYICDDCSLAGSYEHYDRLCRL  113 (193)
Q Consensus        57 C~~g~~G~~C~~~~~~~~C~----~pC~~gg~C~~~~--~~~~~C~~C~~g~~~~~~g~~C~~  113 (193)
                      ||+|-.|+.|.      .|.    .||...|.|...-  .++-+|. |.+||+    |+.|..
T Consensus       132 Cp~gtyGpdCl------~Cpggser~C~GnG~C~GdGsR~GsGkCk-C~~GY~----Gp~C~~  183 (350)
T KOG4260|consen  132 CPDGTYGPDCL------QCPGGSERPCFGNGSCHGDGSREGSGKCK-CETGYT----GPLCRY  183 (350)
T ss_pred             cCCCCcCCccc------cCCCCCcCCcCCCCcccCCCCCCCCCccc-ccCCCC----Cccccc
Confidence            89999999997      564    7899999998542  3666899 999999    777754


No 51 
>KOG3509|consensus
Probab=90.58  E-value=0.41  Score=45.07  Aligned_cols=121  Identities=17%  Similarity=0.162  Sum_probs=64.3

Q ss_pred             CCCCCCCCcccccCCCCCCC-CCCCCCcEEeeCCCCceecccCCCCCCccccccccceec----ccCC------------
Q psy2060          57 GARYHWGDTCETDITKSDCK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLCRLRA----RSFP------------  119 (193)
Q Consensus        57 C~~g~~G~~C~~~~~~~~C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C~~~~----~sF~------------  119 (193)
                      |+..+.+..++..    .|. .+|..+++|......+|.+. |+....    ...|+...    ..|.            
T Consensus       161 ~~~~~~~~~~~~~----~~~~~~~q~g~tC~~~~~~~~~~~-~~~~~~----~~~c~~~~~r~~~~f~~~~~g~~~~~~~  231 (964)
T KOG3509|consen  161 CPTSEGGPGTEPI----HCAQPVCQGGATCEVRNGKGYSLE-CPDCKV----RVVCEACKPRAFCPFEKSVEGCLKCFCF  231 (964)
T ss_pred             CcccCCCCccccc----cccCcccccceeEEecCCcceeee-cccccc----ceehhhccCceecccccccccccceeec
Confidence            4444455556544    667 99999999998877888888 876544    44455421    1111            


Q ss_pred             -------CcceEEecCCCCCceeEEEEEEEeCCC--CeEEEEeecCCCCCceEEEEecCCeeEEEEEEeCCCceEEEEc
Q psy2060         120 -------KNSFLTFPALKQRNRLHIKLQFSTLHD--SGLLLYNGRYNERHDFIALELTDSGRSVQFSWSLGSEVARLNR  189 (193)
Q Consensus       120 -------g~syl~~~~~~~~~~~~i~~~f~T~~~--~glLl~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s  189 (193)
                             ...++.+.......+......|.+...  +.++.+... ....+|+++.+..|.  +-++|..+.++..+..
T Consensus       232 ~vp~~~e~S~~~~~~~h~~~~~~~~~~~~~~~s~~~~~~~~~~~~-~~~~~f~~lt~~~g~--~g~~~~~~~~~~~~~~  307 (964)
T KOG3509|consen  232 GVPRPSESSLHAFRAIHGATLHVDSLGVFFSDSEHRDILGNFLFS-SFKDGFRALTLDGGT--DGVRYDCGLPQREDRL  307 (964)
T ss_pred             CCCccccchhhhHhhhccchhccchheeecccccccccccccccc-ccccceeeeccCCCC--ccccccccCcchhhhh
Confidence                   011111111111111111122333322  233333322 456789999988888  7777777766555443


No 52 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=88.65  E-value=0.15  Score=31.82  Aligned_cols=37  Identities=11%  Similarity=0.069  Sum_probs=19.1

Q ss_pred             cCCCCccCCCCcceeecCCccCCCCcee------eCCCCCCCCcc
Q psy2060          28 GLPDNEGLEKRVQTFVDGGHRTLDLLHD------MGARYHWGDTC   66 (193)
Q Consensus        28 C~~g~~g~~~~c~~~~~gg~~~~~~~~C------~C~~g~~G~~C   66 (193)
                      |.+.|.|..  |.+--..-.+..+.|.|      +|.+||+|+.|
T Consensus        21 C~~nyyG~~--C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   21 CDENYYGPN--CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             --TTEETTT--T-EE---EEETTEEEEE-SS--EEE-TTEESTTS
T ss_pred             CCCCCCCcc--ccCCcCCCcCCcCCcccCCCCCCCCCCCCcCCCC
Confidence            788899997  76532211122233443      57899999887


No 53 
>KOG1836|consensus
Probab=87.18  E-value=1.2  Score=44.86  Aligned_cols=50  Identities=18%  Similarity=0.076  Sum_probs=32.5

Q ss_pred             CCCcee-cCCCCccCC-----CCcceee--cCC----ccCCCCceee-CCCCCCCCcccccCC
Q psy2060          22 FQPYLS-GLPDNEGLE-----KRVQTFV--DGG----HRTLDLLHDM-GARYHWGDTCETDIT   71 (193)
Q Consensus        22 ~~~~~c-C~~g~~g~~-----~~c~~~~--~gg----~~~~~~~~C~-C~~g~~G~~C~~~~~   71 (193)
                      .++..- |.+||+|..     ..|..+-  +++    ........|+ ||++|+|..|+...+
T Consensus       754 ~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d  816 (1705)
T KOG1836|consen  754 FGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD  816 (1705)
T ss_pred             CCCchhhhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence            444543 888888774     1133322  123    2335678999 999999999997644


No 54 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=86.96  E-value=0.34  Score=26.83  Aligned_cols=34  Identities=26%  Similarity=0.517  Sum_probs=22.3

Q ss_pred             CC-CCCCCCcEEeeCCCCceecccCCCCCCcccccccc
Q psy2060          75 CK-GVCSPGATCSLNPSGGYICDDCSLAGSYEHYDRLC  111 (193)
Q Consensus        75 C~-~pC~~gg~C~~~~~~~~~C~~C~~g~~~~~~g~~C  111 (193)
                      |. ..|...+.|+...++.+.|. |-+||...  |..|
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecr-Cllgyk~~--~~~C   36 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECR-CLLGYKKV--GGKC   36 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEE-E-TTEEEE--TTEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEE-eeCCcccc--CCCc
Confidence            55 67888899998766999999 99988643  4444


No 55 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.98  E-value=0.38  Score=36.61  Aligned_cols=52  Identities=21%  Similarity=0.395  Sum_probs=37.4

Q ss_pred             CCCCceeeCCCCCC---CCcccccCCCCCCC------CCCCCCcEEeeCC----CCceecccCCCCCCc
Q psy2060          49 TLDLLHDMGARYHW---GDTCETDITKSDCK------GVCSPGATCSLNP----SGGYICDDCSLAGSY  104 (193)
Q Consensus        49 ~~~~~~C~C~~g~~---G~~C~~~~~~~~C~------~pC~~gg~C~~~~----~~~~~C~~C~~g~~~  104 (193)
                      ..+.|.|.|.+||.   -..||...   .|.      .+|..-++|+..+    ...|+|. |..+|..
T Consensus        16 MSNHfEC~Cnegfvl~~EntCE~kv---~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~-C~~gY~~   80 (197)
T PF06247_consen   16 MSNHFECKCNEGFVLKNENTCEEKV---ECDKLENVNKPCGDYAKCINQANKGEERAYKCD-CINGYIL   80 (197)
T ss_dssp             ESSEEEEEESTTEEEEETTEEEE-------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEE-E-TTEEE
T ss_pred             ccCceEEEcCCCcEEccccccccce---ecCcccccCccccchhhhhcCCCcccceeEEEe-cccCcee
Confidence            35679999999997   56787654   554      6999999999765    2789999 9998864


No 56 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=83.58  E-value=1.4  Score=34.30  Aligned_cols=35  Identities=31%  Similarity=0.800  Sum_probs=26.2

Q ss_pred             CCcccccCCCCCCC---CCCCCCcEEeeCCCCceecccCCCCCCc
Q psy2060          63 GDTCETDITKSDCK---GVCSPGATCSLNPSGGYICDDCSLAGSY  104 (193)
Q Consensus        63 G~~C~~~~~~~~C~---~pC~~gg~C~~~~~~~~~C~~C~~g~~~  104 (193)
                      +..|+ +.  ++|.   .+|.  ..|.+.. ++|.|. |+.||..
T Consensus       181 ~~~C~-~~--~~C~~~~~~c~--~~C~~~~-g~~~c~-c~~g~~~  218 (224)
T cd01475         181 GKICV-VP--DLCATLSHVCQ--QVCISTP-GSYLCA-CTEGYAL  218 (224)
T ss_pred             cccCc-Cc--hhhcCCCCCcc--ceEEcCC-CCEEeE-CCCCccC
Confidence            56675 33  3786   4566  4899887 999999 9999874


No 57 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.08  E-value=0.56  Score=35.72  Aligned_cols=76  Identities=12%  Similarity=0.204  Sum_probs=45.1

Q ss_pred             CCCcee-cCCCCccCC-CCcceeec-------------------CC-ccCCCCceeeCCCCCC--CCcccccCCCCCCC-
Q psy2060          22 FQPYLS-GLPDNEGLE-KRVQTFVD-------------------GG-HRTLDLLHDMGARYHW--GDTCETDITKSDCK-   76 (193)
Q Consensus        22 ~~~~~c-C~~g~~g~~-~~c~~~~~-------------------gg-~~~~~~~~C~C~~g~~--G~~C~~~~~~~~C~-   76 (193)
                      ...|.| |.+||.-.. ..||.-++                   .+ ......|.|.|.+||+  ...|-..    .|. 
T Consensus        17 SNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~----~C~~   92 (197)
T PF06247_consen   17 SNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPN----KCNN   92 (197)
T ss_dssp             SSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEG----GGSS
T ss_pred             cCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchh----hcCc
Confidence            356889 999995321 12553322                   11 2445789999999997  3456543    788 


Q ss_pred             CCCCCCcEEeeCC--CCceecccCCCCCC
Q psy2060          77 GVCSPGATCSLNP--SGGYICDDCSLAGS  103 (193)
Q Consensus        77 ~pC~~gg~C~~~~--~~~~~C~~C~~g~~  103 (193)
                      ..|. .|.|+..+  .....|. |.-|..
T Consensus        93 ~~Cg-~GKCI~d~~~~~~~~CS-C~IGkV  119 (197)
T PF06247_consen   93 KDCG-SGKCILDPDNPNNPTCS-CNIGKV  119 (197)
T ss_dssp             ---T-TEEEEEEEGGGSEEEEE-E-TEEE
T ss_pred             eecC-CCeEEecCCCCCCceeE-eeeceE
Confidence            8888 58998664  2456899 977654


No 58 
>KOG0994|consensus
Probab=83.06  E-value=0.81  Score=43.81  Aligned_cols=20  Identities=15%  Similarity=0.144  Sum_probs=16.3

Q ss_pred             CCcee-cCCCCccCCCCcceeec
Q psy2060          23 QPYLS-GLPDNEGLEKRVQTFVD   44 (193)
Q Consensus        23 ~~~~c-C~~g~~g~~~~c~~~~~   44 (193)
                      +...| |.+||+|.-  |+.+-+
T Consensus       932 ~~ivC~C~~GY~G~R--Ce~CA~  952 (1758)
T KOG0994|consen  932 QQIVCHCQEGYSGSR--CEICAD  952 (1758)
T ss_pred             cceeeecccCccccc--hhhhcc
Confidence            55568 999999995  998765


No 59 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=77.06  E-value=2.5  Score=29.16  Aligned_cols=28  Identities=29%  Similarity=0.764  Sum_probs=22.6

Q ss_pred             CCCC--CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          73 SDCK--GVCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        73 ~~C~--~pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      +.|+  ..|...|.|...  ....|. |++||.
T Consensus        78 d~Cd~y~~CG~~g~C~~~--~~~~C~-Cl~GF~  107 (110)
T PF00954_consen   78 DQCDVYGFCGPNGICNSN--NSPKCS-CLPGFE  107 (110)
T ss_pred             cCCCCccccCCccEeCCC--CCCceE-CCCCcC
Confidence            4888  999999999643  455699 999986


No 60 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=74.32  E-value=1.6  Score=25.43  Aligned_cols=18  Identities=11%  Similarity=0.062  Sum_probs=14.5

Q ss_pred             CCceeeCCCCCCCCcccc
Q psy2060          51 DLLHDMGARYHWGDTCET   68 (193)
Q Consensus        51 ~~~~C~C~~g~~G~~C~~   68 (193)
                      ....|.|+++|+|+.|+.
T Consensus        16 ~~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   16 STGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             TCEEESBSTTEESTTS-E
T ss_pred             CCCEEeccccccCCcCcC
Confidence            456899999999999974


No 61 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=73.41  E-value=2.5  Score=24.77  Aligned_cols=16  Identities=13%  Similarity=0.046  Sum_probs=14.0

Q ss_pred             CceeeCCCCCCCCccc
Q psy2060          52 LLHDMGARYHWGDTCE   67 (193)
Q Consensus        52 ~~~C~C~~g~~G~~C~   67 (193)
                      .-.|.|+++++|..|+
T Consensus        18 ~G~C~C~~~~~G~~C~   33 (50)
T cd00055          18 TGQCECKPNTTGRRCD   33 (50)
T ss_pred             CCEEeCCCcCCCCCCC
Confidence            4689999999999997


No 62 
>KOG1218|consensus
Probab=73.12  E-value=8.1  Score=31.37  Aligned_cols=38  Identities=18%  Similarity=0.260  Sum_probs=30.3

Q ss_pred             CCCceeeCCCCCCCCcccccCCCCCCC--CCCCCCcEEeeCC
Q psy2060          50 LDLLHDMGARYHWGDTCETDITKSDCK--GVCSPGATCSLNP   89 (193)
Q Consensus        50 ~~~~~C~C~~g~~G~~C~~~~~~~~C~--~pC~~gg~C~~~~   89 (193)
                      ...-.|.|++||+|.+|.....  .|.  ..|.+++.|....
T Consensus       159 ~~~~~c~c~~g~~g~~~~~~~~--~c~~~~~~~~g~~C~~~~  198 (316)
T KOG1218|consen  159 CKNGICTCQPGFVGVFCVESCS--GCSPLTACENGAKCNRST  198 (316)
T ss_pred             CCCCceeccCCcccccccccCC--CcCCCcccCCCCeeeccc
Confidence            3456789999999999998754  477  8888988998653


No 63 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.01  E-value=2.7  Score=24.22  Aligned_cols=16  Identities=13%  Similarity=0.011  Sum_probs=13.5

Q ss_pred             CceeeCCCCCCCCccc
Q psy2060          52 LLHDMGARYHWGDTCE   67 (193)
Q Consensus        52 ~~~C~C~~g~~G~~C~   67 (193)
                      .-.|.|+++++|..|+
T Consensus        17 ~G~C~C~~~~~G~~C~   32 (46)
T smart00180       17 TGQCECKPNVTGRRCD   32 (46)
T ss_pred             CCEEECCCCCCCCCCC
Confidence            4589999999999886


No 64 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=72.98  E-value=5.7  Score=23.30  Aligned_cols=21  Identities=24%  Similarity=0.580  Sum_probs=15.4

Q ss_pred             CCCCCCcEEeeCCCCceecccCCCCCC
Q psy2060          77 GVCSPGATCSLNPSGGYICDDCSLAGS  103 (193)
Q Consensus        77 ~pC~~gg~C~~~~~~~~~C~~C~~g~~  103 (193)
                      ..|..++.|++.     +|. |+.++.
T Consensus        26 ~qC~~~s~C~~g-----~C~-C~~g~~   46 (52)
T PF01683_consen   26 EQCIGGSVCVNG-----RCQ-CPPGYV   46 (52)
T ss_pred             CCCCCcCEEcCC-----EeE-CCCCCE
Confidence            567788888643     599 988865


No 65 
>KOG3509|consensus
Probab=69.62  E-value=26  Score=33.59  Aligned_cols=39  Identities=18%  Similarity=0.109  Sum_probs=22.8

Q ss_pred             EEEEeCCCCeEEEEeec--CCCCCceEEEEecCCeeEEEEEEe
Q psy2060         139 LQFSTLHDSGLLLYNGR--YNERHDFIALELTDSGRSVQFSWS  179 (193)
Q Consensus       139 ~~f~T~~~~glLl~~g~--~~~~~dfi~l~l~~G~~~l~~~~~  179 (193)
                      .+|+.....+.+++..+  .....|+++.......  ..+.|.
T Consensus       572 ~s~~v~a~~~~~~~~~~~P~~~vad~vs~~g~~~~--~~~s~~  612 (964)
T KOG3509|consen  572 ISFSVSAEAGGLLLRWSLPAKPVADKVSPLGGALS--QTHSYK  612 (964)
T ss_pred             ceeeeeeeccceEEEecCCCCCCcceeccccceee--eeeccC
Confidence            45666666666665543  2334566666555555  666676


No 66 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=63.62  E-value=6.1  Score=30.76  Aligned_cols=17  Identities=6%  Similarity=-0.490  Sum_probs=15.0

Q ss_pred             ccCCCCceeeCCCCCCC
Q psy2060          47 HRTLDLLHDMGARYHWG   63 (193)
Q Consensus        47 ~~~~~~~~C~C~~g~~G   63 (193)
                      .++.++|.|.|++||+.
T Consensus       202 ~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         202 ISTPGSYLCACTEGYAL  218 (224)
T ss_pred             EcCCCCEEeECCCCccC
Confidence            57889999999999974


No 67 
>KOG0196|consensus
Probab=63.04  E-value=11  Score=35.42  Aligned_cols=13  Identities=23%  Similarity=0.422  Sum_probs=8.7

Q ss_pred             CceecccCCCCCCc
Q psy2060          91 GGYICDDCSLAGSY  104 (193)
Q Consensus        91 ~~~~C~~C~~g~~~  104 (193)
                      +...|. |..||..
T Consensus       306 ga~~C~-C~~gyyR  318 (996)
T KOG0196|consen  306 GATSCT-CENGYYR  318 (996)
T ss_pred             CCCccc-ccCCccc
Confidence            556688 8777653


No 68 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=62.19  E-value=2.6  Score=25.42  Aligned_cols=23  Identities=17%  Similarity=-0.054  Sum_probs=14.6

Q ss_pred             CCCCceeeCCCCCCCCcccccCC
Q psy2060          49 TLDLLHDMGARYHWGDTCETDIT   71 (193)
Q Consensus        49 ~~~~~~C~C~~g~~G~~C~~~~~   71 (193)
                      ..|...|.|..-|+|++|...+.
T Consensus        32 ~dG~p~CECn~Cy~GpdCS~~~~   54 (56)
T PF04863_consen   32 ADGSPVCECNSCYGGPDCSTLIP   54 (56)
T ss_dssp             ETTEE--EE-TTEESTTS-EE-T
T ss_pred             ccCCccccccCCcCCCCcccCCC
Confidence            45668999999999999987653


No 69 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=46.71  E-value=13  Score=20.04  Aligned_cols=12  Identities=0%  Similarity=-0.438  Sum_probs=9.8

Q ss_pred             CCceeeCCCCCC
Q psy2060          51 DLLHDMGARYHW   62 (193)
Q Consensus        51 ~~~~C~C~~g~~   62 (193)
                      ....|.||.||.
T Consensus        16 ~~~~C~CPeGyI   27 (34)
T PF09064_consen   16 SPGQCFCPEGYI   27 (34)
T ss_pred             CCCceeCCCceE
Confidence            456999999995


No 70 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=38.30  E-value=15  Score=25.30  Aligned_cols=26  Identities=23%  Similarity=0.686  Sum_probs=19.6

Q ss_pred             CCC---CCCCCCcEEeeCCC----CceecccCCC
Q psy2060          74 DCK---GVCSPGATCSLNPS----GGYICDDCSL  100 (193)
Q Consensus        74 ~C~---~pC~~gg~C~~~~~----~~~~C~~C~~  100 (193)
                      .|.   +-|.+.|.|+....    .=|.|. |.+
T Consensus         7 aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~-C~~   39 (103)
T PF12955_consen    7 ACENATNNCSGHGSCVKKYGSGGGDCFACK-CKP   39 (103)
T ss_pred             HHHHhccCCCCCceEeeccCCCccceEEEE-eec
Confidence            566   78999999998741    347899 966


No 71 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=30.90  E-value=2.5e+02  Score=21.60  Aligned_cols=50  Identities=14%  Similarity=0.270  Sum_probs=32.2

Q ss_pred             CceeEEEEEEEeCCCCeE--EEEeecCCCCCceEEEEecCCeeEEEEEEeCC
Q psy2060         132 RNRLHIKLQFSTLHDSGL--LLYNGRYNERHDFIALELTDSGRSVQFSWSLG  181 (193)
Q Consensus       132 ~~~~~i~~~f~T~~~~gl--Ll~~g~~~~~~dfi~l~l~~G~~~l~~~~~~g  181 (193)
                      .....|.++|++....++  ||-.+.......|+.|-+.++.+.+.+|=.-|
T Consensus        32 L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~   83 (190)
T PF02973_consen   32 LEEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKG   83 (190)
T ss_dssp             -SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTT
T ss_pred             ccccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCC
Confidence            445688999999766665  44433324556899999999985555554444


No 72 
>PF08300 HCV_NS5a_1a:  Hepatitis C virus non-structural 5a zinc finger domain;  InterPro: IPR013192 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in the non-structural 5a protein (NS5a) in Hepatitis C virus. The molecular function of NS5a is uncertain, but it is phosphorylated when expressed in mammalian cells. It is thought to interact with the dsRNA dependent (interferon inducible) kinase PKR, P19525 from SWISSPROT [, ]. This region corresponds to the N-terminal zinc binding domain (1a) []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003968 RNA-directed RNA polymerase activity, 0004252 serine-type endopeptidase activity, 0008270 zinc ion binding, 0017111 nucleoside-triphosphatase activity, 0006355 regulation of transcription, DNA-dependent, 0006915 apoptosis, 0030683 evasion by virus of host immune response, 0005789 endoplasmic reticulum membrane, 0016021 integral to membrane; PDB: 1ZH1_B 3FQM_A 3FQQ_B.
Probab=28.30  E-value=16  Score=22.51  Aligned_cols=13  Identities=31%  Similarity=0.473  Sum_probs=6.1

Q ss_pred             CceecCCCCccCC
Q psy2060          24 PYLSGLPDNEGLE   36 (193)
Q Consensus        24 ~~~cC~~g~~g~~   36 (193)
                      ||..|..||.|.+
T Consensus         2 Pf~sCqrGy~G~W   14 (62)
T PF08300_consen    2 PFLSCQRGYKGVW   14 (62)
T ss_dssp             TTTS----B-SEE
T ss_pred             CccccCCCCCcEE
Confidence            6777888888886


No 73 
>TIGR01492 CPW_WPC Plasmodium falciparum CPW-WPC domain. This model represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats, and is known so far only in Plasmodium falciparum. It is named for motifs of CPxxW and (less well conserved) WPC.
Probab=27.54  E-value=58  Score=19.95  Aligned_cols=16  Identities=19%  Similarity=0.210  Sum_probs=10.3

Q ss_pred             CceeeCCCCCCCCcccc
Q psy2060          52 LLHDMGARYHWGDTCET   68 (193)
Q Consensus        52 ~~~C~C~~g~~G~~C~~   68 (193)
                      ++.|.-|..|.|+ |..
T Consensus        20 ~~~C~ap~~Y~Gp-C~~   35 (62)
T TIGR01492        20 KYGCIAPDNYKGP-CER   35 (62)
T ss_pred             CCeeCCCcccCCC-CCC
Confidence            4447777777776 653


No 74 
>PF12599 DUF3768:  Protein of unknown function (DUF3768);  InterPro: IPR022243  This family of proteins is found in bacteria. Proteins in this family are typically between 108 and 129 amino acids in length. There are two conserved sequence motifs: NDP and RVLT. 
Probab=26.02  E-value=97  Score=20.48  Aligned_cols=34  Identities=21%  Similarity=0.232  Sum_probs=24.8

Q ss_pred             CCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCCC
Q psy2060         157 NERHDFIALELTDSGRSVQFSWSLGSEVARLNRADS  192 (193)
Q Consensus       157 ~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~~  192 (193)
                      .+.+||=++++.+-+  +.++++.-+...+.-|+++
T Consensus        34 ~gEHDFG~v~~~g~~--vfWKIDyYD~d~~~gS~dP   67 (84)
T PF12599_consen   34 YGEHDFGAVEFDGEK--VFWKIDYYDPDLEFGSPDP   67 (84)
T ss_pred             cccccceEEEECCEE--EEEEEEEEcCccccCCCCC
Confidence            356899999988888  8888777666666666554


No 75 
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=25.34  E-value=86  Score=15.71  Aligned_cols=21  Identities=29%  Similarity=0.824  Sum_probs=12.2

Q ss_pred             CC-CCCCCCcEEeeCCCCceec
Q psy2060          75 CK-GVCSPGATCSLNPSGGYIC   95 (193)
Q Consensus        75 C~-~pC~~gg~C~~~~~~~~~C   95 (193)
                      |. ..|..|..|+.+..+...|
T Consensus         2 C~~v~C~~G~~C~~d~~g~p~C   23 (26)
T smart00274        2 CRNVQCPFGKVCVVDKGGNARC   23 (26)
T ss_pred             CCCEECCCCCEEEeCCCCCEEE
Confidence            45 5677777777643244444


No 76 
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=24.57  E-value=1.1e+02  Score=15.30  Aligned_cols=19  Identities=11%  Similarity=0.314  Sum_probs=13.8

Q ss_pred             CCCceEEEEecCCeeEEEEEE
Q psy2060         158 ERHDFIALELTDSGRSVQFSW  178 (193)
Q Consensus       158 ~~~dfi~l~l~~G~~~l~~~~  178 (193)
                      ..+.+.++...+|.  +..++
T Consensus        14 ~~g~l~a~d~~~G~--~~W~~   32 (33)
T smart00564       14 TDGTLYALDAKTGE--ILWTY   32 (33)
T ss_pred             CCCEEEEEEcccCc--EEEEc
Confidence            35678888888888  76654


No 77 
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=23.91  E-value=99  Score=20.50  Aligned_cols=25  Identities=24%  Similarity=0.728  Sum_probs=17.7

Q ss_pred             CCC-CCCCCCcEEeeCCCCceecccCC
Q psy2060          74 DCK-GVCSPGATCSLNPSGGYICDDCS   99 (193)
Q Consensus        74 ~C~-~pC~~gg~C~~~~~~~~~C~~C~   99 (193)
                      +|. ..|..|.+|+....+...|. |.
T Consensus         1 pC~~v~C~~G~~C~~d~~~~p~Cv-C~   26 (86)
T cd01328           1 PCENHHCGAGKVCEVDDENTPKCV-CI   26 (86)
T ss_pred             CCCCcCCCCCCEeeECCCCCeEEe-cC
Confidence            466 77888888886533667787 64


No 78 
>PHA03399 pif3 per os infectivity factor 3; Provisional
Probab=23.82  E-value=57  Score=25.29  Aligned_cols=10  Identities=10%  Similarity=-0.486  Sum_probs=7.1

Q ss_pred             ceeeCCCCCC
Q psy2060          53 LHDMGARYHW   62 (193)
Q Consensus        53 ~~C~C~~g~~   62 (193)
                      .-|+|.+||+
T Consensus       159 ~dC~C~~gyt  168 (200)
T PHA03399        159 DDCVCASGYT  168 (200)
T ss_pred             hhcCcCCCCE
Confidence            4577777776


No 79 
>COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown]
Probab=23.35  E-value=1.2e+02  Score=21.83  Aligned_cols=34  Identities=12%  Similarity=0.325  Sum_probs=23.8

Q ss_pred             CCCCceEEEEecCCeeEEEEEEeCCCceEEEEcCC
Q psy2060         157 NERHDFIALELTDSGRSVQFSWSLGSEVARLNRAD  191 (193)
Q Consensus       157 ~~~~dfi~l~l~~G~~~l~~~~~~g~g~~~l~s~~  191 (193)
                      .+..+|+.+.+.+|.+ ++..|+.-.+...+.|.|
T Consensus        44 ~gwk~f~~iti~dGKi-v~~~ydy~~k~G~~Ks~D   77 (147)
T COG4939          44 HGWKAFVTITIQDGKI-VACTYDYRDKKGNIKSDD   77 (147)
T ss_pred             cCccceEEEEEeCCEE-EEEEeeeecCCCCccccc
Confidence            4567899999999982 345676666666666654


No 80 
>PF07546 EMI:  EMI domain;  InterPro: IPR011489 The EMI domain, first named after its presence in proteins of the EMILIN family, is a small cysteine-rich module of around 75 amino acids. The EMI domain is most often found at the N terminus of metazoan extracellular proteins that are forming or are compatible with multimer formation []. It is found in association with other domains, such as C1q, laminin-type EGF-like, collagen-like, FN3, WAP, ZP or FAS1 []. It has been suggested that the EMI domain could be a protein-protein interaction module, as the EMI domain of EMILIN-1 was found to interact with the C1q domain of EMILIN-2 [].  The EMI domain possesses six highly conserved cysteines residues, which likely form disulphide bonds. Other key features of the EMI domain are the C-C-x-G-[WYFH] pattern, a hydrophobic position just preceding the first cysteine (Cys1) of the domain and a cluster of hydrophobic residues between Cys3 and Cys4. The EMI domain could be made of two sub-domains, the fold of the second one sharing similarities with the C-terminal sub-module characteristic of EGF-like domains [].  Proteins known to contain a EMI domain include:   Vertebrate Emilins, extracellular matrix glycoproteins. Vertebrate Multimerins, extracellular matrix glycoproteins.  Vetebrate Emu proteins, which could interact with several different extracellular matrix components and serve to connect and integrate the function of multiple partner molecules.  Vertebrate beta-IG-H3.  Vertebrate osteoblast-specific factor 2 (OSF-2).  Mammalian NEU1/NG3 proteins.  Drosophila midline fasciclin.   Caenorhabditis elegans ced-1, a transmembrane receptor that mediates cell corpse engulfment.     The Pfam alignment for this domain is truncated at the C terminus and does not include the final cysteine []. This is to stop the family overlapping with other domains.; GO: 0005515 protein binding
Probab=22.26  E-value=83  Score=19.79  Aligned_cols=6  Identities=33%  Similarity=0.656  Sum_probs=3.5

Q ss_pred             CCCccC
Q psy2060          30 PDNEGL   35 (193)
Q Consensus        30 ~g~~g~   35 (193)
                      |||.|.
T Consensus        66 pGy~~~   71 (72)
T PF07546_consen   66 PGYSGT   71 (72)
T ss_pred             cCcccC
Confidence            366664


No 81 
>PHA02151 hypothetical protein
Probab=21.92  E-value=1.5e+02  Score=22.18  Aligned_cols=31  Identities=29%  Similarity=0.534  Sum_probs=22.7

Q ss_pred             CCCeEEEEeecCCCCCceEEEEec--CCeeEEEEE
Q psy2060         145 HDSGLLLYNGRYNERHDFIALELT--DSGRSVQFS  177 (193)
Q Consensus       145 ~~~glLl~~g~~~~~~dfi~l~l~--~G~~~l~~~  177 (193)
                      .++||.||.|-.+...-|+.|.+.  +|+  +.++
T Consensus       140 ~~~glcly~g~~g~adv~lvl~~~~~~g~--~~~~  172 (217)
T PHA02151        140 RADGLCLYAGYKGNADVFLVLHMENVDGQ--TYFN  172 (217)
T ss_pred             ccCceEEecccCCCCcEEEEEeeecccce--EEec
Confidence            578999999753455668888776  677  7664


Done!