Query         psy7014
Match_columns 500
No_of_seqs    256 out of 2648
Neff          7.3 
Searched_HMMs 46136
Date          Sat Aug 17 00:36:32 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7014.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7014hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3514|consensus              100.0 3.6E-37 7.8E-42  331.7  24.6  344    2-491   356-729 (1591)
  2 KOG1219|consensus              100.0 2.8E-35 6.2E-40  331.1  25.6  314   23-422  3645-3977(4289)
  3 KOG4289|consensus              100.0 1.5E-31 3.3E-36  293.5  25.5  293   41-489  1298-1604(2531)
  4 KOG3516|consensus              100.0 2.8E-31   6E-36  291.9  25.9  310   39-492   763-1078(1306)
  5 KOG3514|consensus              100.0 1.6E-30 3.5E-35  280.7  24.2  314   38-496   807-1133(1591)
  6 KOG3516|consensus              100.0 3.3E-27 7.2E-32  259.9  29.9  208    9-297   123-338 (1306)
  7 PF00054 Laminin_G_1:  Laminin   99.9 4.4E-23 9.6E-28  184.0  14.8  130   97-298     1-131 (131)
  8 cd00110 LamG Laminin G domain;  99.9 3.1E-20 6.8E-25  167.9  18.0  149   67-293     3-151 (151)
  9 smart00282 LamG Laminin G doma  99.8 3.8E-20 8.3E-25  165.1  17.1  134   90-295     2-135 (135)
 10 KOG4289|consensus               99.8 6.6E-20 1.4E-24  202.9  18.3  226   41-360  1519-1762(2531)
 11 PF02210 Laminin_G_2:  Laminin   99.8 5.9E-18 1.3E-22  148.2  14.0  127   97-295     1-128 (128)
 12 KOG1219|consensus               99.3 6.6E-12 1.4E-16  145.0  12.6  108  286-395  3867-3980(4289)
 13 KOG3509|consensus               99.1 2.3E-10 4.9E-15  128.5  11.7  166  216-391   312-478 (964)
 14 PF00054 Laminin_G_1:  Laminin   98.4 4.3E-07 9.3E-12   80.9   5.9   42  454-495     1-42  (131)
 15 cd00110 LamG Laminin G domain;  98.3 1.8E-06 3.9E-11   77.6   8.1   46  446-491    20-65  (151)
 16 smart00210 TSPN Thrombospondin  98.3 1.7E-05 3.8E-10   74.8  14.9  130   87-293    50-181 (184)
 17 smart00282 LamG Laminin G doma  98.1 8.7E-06 1.9E-10   72.3   8.1   47  448-494     3-49  (135)
 18 PF00008 EGF:  EGF-like domain   97.8   8E-06 1.7E-10   54.5   1.8   26  364-389     5-31  (32)
 19 PF00008 EGF:  EGF-like domain   97.8 1.3E-05 2.7E-10   53.5   1.6   31  320-350     1-32  (32)
 20 PF13385 Laminin_G_3:  Concanav  97.7 0.00067 1.5E-08   60.2  12.3  147   66-296     3-150 (157)
 21 PF02210 Laminin_G_2:  Laminin   97.6 6.3E-05 1.4E-09   65.2   4.3   40  454-493     1-40  (128)
 22 smart00179 EGF_CA Calcium-bind  97.6 8.8E-05 1.9E-09   51.0   4.0   37  355-392     2-39  (39)
 23 PF07645 EGF_CA:  Calcium-bindi  97.4 0.00011 2.5E-09   52.0   2.3   34  354-387     1-34  (42)
 24 cd00054 EGF_CA Calcium-binding  97.2 0.00043 9.3E-09   46.9   3.8   36  356-392     3-38  (38)
 25 cd00152 PTX Pentraxins are pla  97.1   0.017 3.7E-07   55.2  14.5   76  216-297    88-165 (201)
 26 smart00159 PTX Pentraxin / C-r  97.0   0.016 3.4E-07   55.7  13.9   78  214-297    86-165 (206)
 27 smart00179 EGF_CA Calcium-bind  97.0 0.00077 1.7E-08   46.2   3.5   35  318-352     3-39  (39)
 28 cd00053 EGF Epidermal growth f  96.9  0.0016 3.4E-08   43.3   3.9   30  363-392     6-36  (36)
 29 PF02973 Sialidase:  Sialidase,  96.8   0.035 7.5E-07   52.5  13.9  140   89-297    33-177 (190)
 30 cd00054 EGF_CA Calcium-binding  96.8  0.0014 3.1E-08   44.2   3.4   35  318-352     3-38  (38)
 31 smart00181 EGF Epidermal growt  96.7  0.0023 5.1E-08   42.9   3.8   28  364-392     7-35  (35)
 32 KOG1225|consensus               96.6  0.0033 7.2E-08   67.9   5.9   75  306-394   269-343 (525)
 33 KOG1214|consensus               96.5   0.016 3.4E-07   64.4  10.7  102  317-418   734-858 (1289)
 34 cd00053 EGF Epidermal growth f  96.5  0.0035 7.7E-08   41.5   3.5   32  320-351     2-35  (36)
 35 KOG1214|consensus               96.4  0.0045 9.8E-08   68.5   5.7   57  330-389   800-859 (1289)
 36 KOG1217|consensus               96.3  0.0056 1.2E-07   65.0   6.0   78  317-394   271-355 (487)
 37 smart00181 EGF Epidermal growt  96.3  0.0045 9.8E-08   41.5   3.3   31  320-351     2-34  (35)
 38 KOG4260|consensus               96.0  0.0068 1.5E-07   59.3   4.0   93  322-414   149-301 (350)
 39 KOG1225|consensus               96.0   0.013 2.8E-07   63.5   6.5   58  325-395   256-313 (525)
 40 KOG1217|consensus               95.9   0.014 3.1E-07   61.9   6.4   76  319-394   128-208 (487)
 41 PF00354 Pentaxin:  Pentaxin fa  95.3     0.2 4.2E-06   47.8  11.2   78  214-297    80-159 (195)
 42 PF12947 EGF_3:  EGF domain;  I  95.2    0.01 2.2E-07   40.7   1.4   27  363-389     6-32  (36)
 43 PF07974 EGF_2:  EGF-like domai  95.2   0.026 5.6E-07   37.6   3.4   26  364-391     7-32  (32)
 44 PF12661 hEGF:  Human growth fa  95.2  0.0066 1.4E-07   32.0   0.4   13   45-57      1-13  (13)
 45 PF12661 hEGF:  Human growth fa  95.2  0.0064 1.4E-07   32.1   0.3   13  379-391     1-13  (13)
 46 PF07645 EGF_CA:  Calcium-bindi  94.1   0.043 9.4E-07   38.7   2.5   30  318-347     3-34  (42)
 47 PF07974 EGF_2:  EGF-like domai  93.7   0.072 1.6E-06   35.4   2.9   27  323-351     6-32  (32)
 48 KOG1226|consensus               92.3    0.26 5.7E-06   54.9   6.3   56  339-398   567-626 (783)
 49 PHA03099 epidermal growth fact  92.1    0.12 2.6E-06   45.1   2.7   31  364-395    52-84  (139)
 50 PF06439 DUF1080:  Domain of Un  90.8    0.45 9.7E-06   44.1   5.5   37  210-247   119-155 (185)
 51 PHA02887 EGF-like protein; Pro  90.8    0.19   4E-06   43.3   2.5   30  364-394    93-124 (126)
 52 smart00051 DSL delta serrate l  90.8    0.33 7.1E-06   37.6   3.7   47  338-391    17-63  (63)
 53 smart00560 LamGL LamG-like jel  90.8    0.71 1.5E-05   40.9   6.5   67  218-296    61-129 (133)
 54 KOG3509|consensus               90.3     3.5 7.5E-05   48.0  12.8  170  218-395   654-844 (964)
 55 KOG1226|consensus               89.4    0.76 1.6E-05   51.4   6.5   43  348-394   539-582 (783)
 56 PF12947 EGF_3:  EGF domain;  I  88.7    0.25 5.4E-06   33.8   1.4   27  323-349     6-32  (36)
 57 PF14670 FXa_inhibition:  Coagu  88.5    0.34 7.5E-06   33.1   2.0   18  370-387    11-28  (36)
 58 KOG4260|consensus               87.8    0.51 1.1E-05   46.6   3.4   48  342-393   132-183 (350)
 59 cd01475 vWA_Matrilin VWA_Matri  87.5    0.53 1.1E-05   45.6   3.5   32  355-388   187-218 (224)
 60 PHA03099 epidermal growth fact  86.5    0.45 9.7E-06   41.7   2.0   31  323-354    51-83  (139)
 61 PF12662 cEGF:  Complement Clr-  85.6    0.75 1.6E-05   28.5   2.1   11  377-387     1-11  (24)
 62 PHA02887 EGF-like protein; Pro  85.4    0.65 1.4E-05   40.0   2.4   22   39-60    103-124 (126)
 63 PF01414 DSL:  Delta serrate li  84.7     0.3 6.5E-06   37.8   0.1   41  338-391    17-63  (63)
 64 KOG1836|consensus               75.8     1.4   3E-05   54.3   1.7   80  212-301  1611-1690(1705)
 65 KOG3546|consensus               74.4      11 0.00025   41.5   7.8   66  217-293   156-223 (1167)
 66 PF14670 FXa_inhibition:  Coagu  72.4     2.3 4.9E-05   29.1   1.4   18  330-347    11-28  (36)
 67 KOG1834|consensus               70.4      75  0.0016   35.6  12.8  147   87-296   364-518 (952)
 68 KOG1836|consensus               67.8     4.7  0.0001   50.0   3.6   52  341-394   760-814 (1705)
 69 PF12955 DUF3844:  Domain of un  65.4     3.9 8.4E-05   34.8   1.7   39  358-396     8-64  (103)
 70 PF00954 S_locus_glycop:  S-loc  64.7     6.9 0.00015   33.4   3.2   31  356-388    78-108 (110)
 71 KOG0994|consensus               63.8     6.5 0.00014   46.1   3.5   58  335-394   882-950 (1758)
 72 PF13385 Laminin_G_3:  Concanav  60.7      19 0.00042   31.1   5.5   46  445-492    21-70  (157)
 73 PF06247 Plasmod_Pvs28:  Plasmo  58.0     5.8 0.00012   37.3   1.5   63  323-387     6-79  (197)
 74 PF12946 EGF_MSP1_1:  MSP1 EGF   56.3     5.7 0.00012   27.3   0.9   24  364-387     6-30  (37)
 75 PF12946 EGF_MSP1_1:  MSP1 EGF   56.1     6.2 0.00013   27.1   1.1   28  320-347     2-30  (37)
 76 PF00053 Laminin_EGF:  Laminin   52.9     9.2  0.0002   27.6   1.6   22  370-393    12-33  (49)
 77 PF04863 EGF_alliinase:  Alliin  51.5     3.7   8E-05   30.7  -0.6   22   40-61     32-53  (56)
 78 cd00055 EGF_Lam Laminin-type e  45.1      21 0.00044   25.9   2.5   16  378-393    19-34  (50)
 79 PF02973 Sialidase:  Sialidase,  44.5      95  0.0021   29.5   7.4   49  445-493    32-83  (190)
 80 cd06899 lectin_legume_LecRK_Ar  43.5 2.7E+02  0.0059   27.1  10.9   26  214-240   159-186 (236)
 81 cd01475 vWA_Matrilin VWA_Matri  43.1      21 0.00044   34.4   2.9   34  313-348   182-218 (224)
 82 smart00210 TSPN Thrombospondin  41.9      70  0.0015   29.8   6.2   45  446-490    52-97  (184)
 83 PF00139 Lectin_legB:  Legume l  39.8 2.4E+02  0.0051   27.3   9.8   29  211-240   160-190 (236)
 84 PF14099 Polysacc_lyase:  Polys  38.9 1.2E+02  0.0026   28.9   7.5   22  120-141   112-133 (224)
 85 cd01951 lectin_L-type legume l  38.7 3.4E+02  0.0074   25.8  10.7   23  218-241   154-178 (223)
 86 PF04863 EGF_alliinase:  Alliin  32.4      21 0.00045   26.8   0.7   33  323-355    17-53  (56)
 87 PF01683 EB:  EB module;  Inter  32.3      62  0.0013   23.4   3.3   20  364-387    27-46  (52)
 88 smart00180 EGF_Lam Laminin-typ  29.1      51  0.0011   23.5   2.3   18   43-60     17-34  (46)
 89 PF12955 DUF3844:  Domain of un  28.9      42 0.00091   28.6   2.1   23  322-344    12-39  (103)
 90 PF11250 DUF3049:  Protein of u  24.5   2E+02  0.0044   21.7   4.9   38  451-488    17-55  (56)
 91 PF14607 GxDLY:  N-terminus of   23.8 2.4E+02  0.0051   25.7   6.1   13  216-229    91-103 (147)
 92 KOG1218|consensus               23.3   1E+02  0.0023   30.7   4.3   56  338-394   162-223 (316)
 93 KOG1218|consensus               22.8 1.7E+02  0.0037   29.2   5.7   53  339-396   125-180 (316)
 94 cd00152 PTX Pentraxins are pla  22.1 2.5E+02  0.0054   26.4   6.4   46  444-489    29-77  (201)
 95 PF07622 DUF1583:  Protein of u  21.9 4.9E+02   0.011   27.6   8.7   33  213-246    85-117 (399)

No 1  
>KOG3514|consensus
Probab=100.00  E-value=3.6e-37  Score=331.75  Aligned_cols=344  Identities=21%  Similarity=0.367  Sum_probs=246.6

Q ss_pred             cccccccCCCCCCCCccee--cc--ccccc-cCCeeeeecccc-----cCCCc-eeecCCC--------CCCC---CCCC
Q psy7014           2 YAHHLQSCPELTNPDNIKI--LG--RHRQE-ENDRIVNFQHYF-----DTNQP-INQLLSI--------IFTN---FLPP   59 (500)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~--~~--~~~~~-~~~~~~~~~~~~-----~~~~~-~c~c~~~--------~~G~---~C~~   59 (500)
                      |.+|+|++||||++.|...  +|  .|... +..| |.|-.|.     +++.. +-.-...        ..|.   .|+.
T Consensus       356 ~t~~~~~a~~~tmlsss~~fyvgg~~~~~~l~gsr-VsF~GClkkV~y~~d~~rl~L~~LAk~g~~~~k~~G~l~y~C~n  434 (1591)
T KOG3514|consen  356 RTEIRQYAPELTMLSSSDFFYVGGSPNTADLPGSR-VSFMGCLKKVVYKNDDTRLELSRLAKQGDSKMKTEGDLSYSCEN  434 (1591)
T ss_pred             EecccccccceeEeeccceEEecCCCCccccCCCc-eeeeeeeeeeEeccCceeehhhHHhhcCCceeEeeceEEEecCC
Confidence            7899999999999999984  44  33333 3334 3577762     33321 1111111        2222   4888


Q ss_pred             CCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEc
Q psy7014          60 DIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNL  139 (500)
Q Consensus        60 ~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~  139 (500)
                      .......+|...    .||+.+|.+. .....+|+|.|||+.++  |||||++..  .....||+|+||.||++.+.+++
T Consensus       435 ~~~~DpvtFtt~----es~l~LP~Wn-t~~~gSiSf~FRTtepn--Glil~~~g~--~~~~~d~~A~ELldghlyl~ldl  505 (1591)
T KOG3514|consen  435 VAQLDPVTFTTP----ESYLTLPRWN-TKKSGSISFDFRTTEPN--GLILFHGGP--QANATDYFAIELLDGHLYLLLDL  505 (1591)
T ss_pred             CCccCceeeecc----cceeeccccc-cCCcceeEEEEeecCCC--ceEEEccCc--ccccccEEEEEEeCCeEEEEEec
Confidence            776666789876    8999999964 56789999999988777  999999752  45778999999999999999999


Q ss_pred             CCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCC
Q psy7014         140 GSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKR  219 (500)
Q Consensus       140 G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~  219 (500)
                      |+|...             |                                                +. ...++|||.
T Consensus       506 GSG~ik-------------l------------------------------------------------ra-s~rkv~DGe  523 (1591)
T KOG3514|consen  506 GSGVIK-------------L------------------------------------------------RA-SSRKVNDGE  523 (1591)
T ss_pred             CCceEE-------------e------------------------------------------------ee-ecccccCCc
Confidence            997542             2                                                21 122669999


Q ss_pred             ccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCC---cCCCCCccccccceeecccc
Q psy7014         220 GGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHD---LPLHSGFSGCIFDVELSAGN  296 (500)
Q Consensus       220 WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~---~~~~~gF~GCIr~v~ing~~  296 (500)
                      | |+|.+.|+++.+.++||... .....||....|+++.++|+|-.+  +....|.+   +..+.||+||||++.|+|..
T Consensus       524 W-hhv~l~R~gR~gsvsVd~~~-~df~tpG~s~iL~ld~~mylG~~~--n~l~~P~~vWta~L~~GyvGCirdl~i~G~s  599 (1591)
T KOG3514|consen  524 W-HHVDLQRDGRTGSVSVDAIK-TDFSTPGDSEILDLDDPMYLGEVP--NNLVYPSEVWTAALRKGYVGCIRDLFIDGVS  599 (1591)
T ss_pred             e-EEEEeeccCccceEEEeeee-cCccCCCcceeEeecCceeeccCC--CCccCcHHHHHHHHhccchheehhheeccee
Confidence            9 99999999999999999976 567778999999999999999553  33344433   45778999999999999999


Q ss_pred             cccccccccccCCCCcc-ccCC---ccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEe
Q psy7014         297 VGINLYKTRAAEGRGVG-QCGT---SQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCV  372 (500)
Q Consensus       297 ~~l~~~~~~~~~~~~v~-~C~~---~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~  372 (500)
                      .++..... ...+.++. .|..   ..|.++||+|+|+|...|+.                                   
T Consensus       600 ~di~q~ae-~q~sagvkpsCs~~~~~~C~~nPC~N~g~C~egwNr-----------------------------------  643 (1591)
T KOG3514|consen  600 TDIRQEAE-AQNSAGVKPSCSLSNEKICESNPCQNGGKCSEGWNR-----------------------------------  643 (1591)
T ss_pred             hhhHHHhh-hccccccCcccchhhccccCCCcccCCCCccccccc-----------------------------------
Confidence            88754321 22333343 3431   34555555555555555544                                   


Q ss_pred             eCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCccccccccccccccccccccccCCccccccceeeEE
Q psy7014         373 PLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLSDISFSGRRSYISLPSSELHLHEACID  451 (500)
Q Consensus       373 ~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~~~~f~~~~~~~~~p~~~~~~~~~~i~  451 (500)
                           |.|+|.. +|.|+.||.+.  ..+.|+|. .|+.+-.+..                         . ..+.+.|.
T Consensus       644 -----fiCDCs~T~~~G~~CerE~--t~ls~nGs-~~m~i~L~~~-------------------------~-~tq~E~v~  689 (1591)
T KOG3514|consen  644 -----FICDCSGTGFEGRTCEREA--TALSYNGS-MSMKIVLPHT-------------------------M-HTQAEDVS  689 (1591)
T ss_pred             -----cccccccCcccCcccccee--eeEEEcCe-eeEEEEeccc-------------------------c-eeecceEE
Confidence                 5555543 56666666543  45789996 5655543311                         1 12677999


Q ss_pred             EEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcC
Q psy7014         452 LEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLML  491 (500)
Q Consensus       452 l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g  491 (500)
                      ++|||..+-||||-.+.....|-+.|+|.+|+|++.+++.
T Consensus       690 iRF~t~r~~Gll~~Tta~~s~D~l~l~L~~g~vkl~v~ls  729 (1591)
T KOG3514|consen  690 IRFRTQRAYGLLFATTARGSADTLRLELDAGQVKLFVNLS  729 (1591)
T ss_pred             EEEEecccceeEEEeccCCCCceEEEEEecceEEEEEecC
Confidence            9999999999999998887899999999999999999976


No 2  
>KOG1219|consensus
Probab=100.00  E-value=2.8e-35  Score=331.10  Aligned_cols=314  Identities=21%  Similarity=0.354  Sum_probs=250.9

Q ss_pred             ccccccCCeeeeecccc---cCCCceeecCCCCCCCCCCCCCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEee
Q psy7014          23 RHRQEENDRIVNFQHYF---DTNQPINQLLSIIFTNFLPPDIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVP   99 (500)
Q Consensus        23 ~~~~~~~~~~~~~~~~~---~~~~~~c~c~~~~~G~~C~~~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt   99 (500)
                      .|.|=+.+++-.-+.|.   -+..+.|.||.|..| .|+.+.   ..++.|     +||.+|....+...++.+.|++||
T Consensus      3645 ~~~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~~---elS~tG-----nSYveyrlse~~n~~~kl~frLkT 3715 (4289)
T KOG1219|consen 3645 ETNQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGDF---ELSSTG-----NSYVEYRLSENQNTRMKLGFRLKT 3715 (4289)
T ss_pred             ccCccccCCCcccCcccccccccceeEeccCcccc-cCCCcc---eEeecC-----ceeEEEEcccccccceEEEEEEEe
Confidence            45555666665555443   255789999999665 499874   558888     999999998776566899999988


Q ss_pred             CCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhcccccccccccccccccccc
Q psy7014         100 NSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVD  179 (500)
Q Consensus       100 ~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d  179 (500)
                      .+.+  |++||..       ..|+..|.|.+|.+.+.|+.|+|.+                                   
T Consensus      3716 ~~sn--gIiM~tr-------~~d~~iLkLv~G~~~l~~~cgsG~G----------------------------------- 3751 (4289)
T KOG1219|consen 3716 LQSN--GIIMYTR-------KTDLAILKLVGGSPQLLADCGSGPG----------------------------------- 3751 (4289)
T ss_pred             cccC--cEEEEEc-------CCceEEEEecCCcEEEEEecCCCCC-----------------------------------
Confidence            8666  9999995       3499999999999999999999643                                   


Q ss_pred             ceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCc
Q psy7014         180 FLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPM  259 (500)
Q Consensus       180 ~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~  259 (500)
                                                 +.+.++..+|||+| |.|.+.|+++.+.|+||+.......+|+....|+++.-
T Consensus      3752 ---------------------------ivg~q~~~VnDgqW-Hsialerrr~~irlsvDd~~~~~atvPg~~~tln~d~h 3803 (4289)
T KOG1219|consen 3752 ---------------------------IVGSQKRTVNDGQW-HSIALERRRNHIRLSVDDDTYDSATVPGMKSTLNLDTH 3803 (4289)
T ss_pred             ---------------------------cccccceEeecCce-eEEEeeccCCceEEEEcccCceeeecccceeeccccce
Confidence                                       22344457799999 99999999999999999999899999999999999999


Q ss_pred             eEEcccccccCcCCCCCcCCCCCccccccceeecccccccccccccc---cCCCCcc-ccC--CccCCCCCCCCCCEEee
Q psy7014         260 LYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRA---AEGRGVG-QCG--TSQCHNHTCSHGGACMN  333 (500)
Q Consensus       260 lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~---~~~~~v~-~C~--~~~C~~~pC~ngg~Ci~  333 (500)
                      ||+||.-..   .-.+......||.|||+.+.+||..+++.......   ....... .|.  .++|..+||+|||+|..
T Consensus      3804 iy~Ga~vrl---r~~~~tqvs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~npCqhgG~C~~ 3880 (4289)
T KOG1219|consen 3804 IYLGALVRL---RHQRSTQVSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCNDNPCQHGGTCIS 3880 (4289)
T ss_pred             EEEeeEeee---ccCCCccccccccceeeeEEEccccccccCCCchhhhhhhhhcccccccccccccccCcccCCCEecC
Confidence            999997420   11122456789999999999999888764322111   2223333 343  38999999999999998


Q ss_pred             CC-CceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC---CCc------Ccccc
Q psy7014         334 HG-ATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE---SLS------DISFS  403 (500)
Q Consensus       334 ~~-~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~---~~~------~~~F~  403 (500)
                      .. +.|.|.|+..|.|.+|+.++.+|.++  ||.+||+|++..++|.|.|+.||+|++||.+.   +..      +.|.+
T Consensus      3881 ~~~ggy~CkCpsqysG~~CEi~~epC~sn--PC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n 3958 (4289)
T KOG1219|consen 3881 QPKGGYKCKCPSQYSGNHCEIDLEPCASN--PCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECSKNVCGTGGQCIN 3958 (4289)
T ss_pred             CCCCceEEeCcccccCcccccccccccCC--CCCCCCEEEecCCCeeEeCCCCccCceeecccccccccccccCCceeec
Confidence            75 78999999999999999999999996  89999999999999999999999999999883   221      23444


Q ss_pred             CCCceEEecCCCccccccc
Q psy7014         404 GRRSYISLPSSELHLIINE  422 (500)
Q Consensus       404 g~~sy~~~~~~~~~~~~~e  422 (500)
                      -.++|.|-+.+++.+..|+
T Consensus      3959 ~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3959 IPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred             cCCceEeccChhHhcccCc
Confidence            4578999999888866554


No 3  
>KOG4289|consensus
Probab=100.00  E-value=1.5e-31  Score=293.50  Aligned_cols=293  Identities=23%  Similarity=0.324  Sum_probs=217.6

Q ss_pred             CCCceeecCCC-CCCCCCCCCCccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCC
Q psy7014          41 TNQPINQLLSI-IFTNFLPPDIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDA  119 (500)
Q Consensus        41 ~~~~~c~c~~~-~~G~~C~~~~~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~  119 (500)
                      ++...|+||.| |++++|+-.    +.+|.+     .||+.|..... +..+.++|+|-|.  ..+|||+|+|+     .
T Consensus      1298 nggf~c~Cp~ge~e~prC~v~----trSFp~-----~sfv~frglrq-Rfh~TlslsfaT~--~~nGlL~ynGn-----e 1360 (2531)
T KOG4289|consen 1298 NGGFCCHCPYGEFEDPRCEVT----TRSFPP-----ESFVTFRGLRQ-RFHFTLSLSFATI--ERNGLLLYNGN-----E 1360 (2531)
T ss_pred             CCceeccCCCcccCCCceEEE----eeccCc-----hheEEEecccc-ceEEEEEEEEEEe--eecceEEecCC-----c
Confidence            56788999998 888999974    578998     89999997643 4566677777554  55599999994     5


Q ss_pred             CCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCccccc
Q psy7014         120 ITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVD  199 (500)
Q Consensus       120 ~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~  199 (500)
                      ..||++|+++++.|+++|.+|.-                                                         
T Consensus      1361 khDFvalevVd~qvqltfS~Ges--------------------------------------------------------- 1383 (2531)
T KOG4289|consen 1361 KHDFVALEVVDEQVQLTFSAGES--------------------------------------------------------- 1383 (2531)
T ss_pred             ccceEeeeeeeeeEEEEEecccc---------------------------------------------------------
Confidence            67999999999999999999962                                                         


Q ss_pred             ccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCccee-------------eeCCCccccccCCCceEEcccc
Q psy7014         200 TYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVT-------------SKSPGRLTQLNTKPMLYLGGHF  266 (500)
Q Consensus       200 ~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~-------------~~~~~~~~~L~~~~~lyIGG~~  266 (500)
                          ..++....+-.++||+| |+|.+++.++.++++||++....             +...+....|++..+|++||+|
T Consensus      1384 ----~t~v~p~Vp~gvsDGqW-HtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvP 1458 (2531)
T KOG4289|consen 1384 ----TTTVSPDVPGGVSDGQW-HTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVP 1458 (2531)
T ss_pred             ----cceecCCCCCCcccCce-eEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCC
Confidence                12233344446789999 99999999999999999976421             1223455679999999999998


Q ss_pred             cccCcCCCCCcCCCCCccccccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCC
Q psy7014         267 SKNFSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGW  346 (500)
Q Consensus       267 ~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy  346 (500)
                      +..       ......|.|||+++.++++.+++..+...  .+ ...                                 
T Consensus      1459 e~f-------pv~~k~FvGCmrdLsvD~~~VDma~fian--ng-t~e--------------------------------- 1495 (2531)
T KOG4289|consen 1459 ETF-------PVIEKQFVGCMRDLSVDGRDVDMATFIAN--NG-THE--------------------------------- 1495 (2531)
T ss_pred             Ccc-------hhhHhHhhhhhhhcccccccccHHHHHhh--cC-ccc---------------------------------
Confidence            421       12345799999999999999988654321  11 112                                 


Q ss_pred             CCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCCCCcCccccCCCceEEecCCCccccccccccc
Q psy7014         347 FGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLSD  426 (500)
Q Consensus       347 ~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~~  426 (500)
                         .|....+.|.+.  +|.|+|+|+++|++|.|.||.+|.|+.|+..... .-.|.|. |-+.+...            
T Consensus      1496 ---GC~ark~fCdsg--~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~-pq~frG~-sl~sw~~~------------ 1556 (2531)
T KOG4289|consen 1496 ---GCKARKNFCDSG--QCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAH-PQHFRGH-SLVSWEGL------------ 1556 (2531)
T ss_pred             ---CchhhhcccCCC--ccCCCCeeecccCcEeecCccccCCcchhhccCC-chhcccc-ceeeecCC------------
Confidence               244455667775  6999999999999999999999999999876432 2357774 65554311            


Q ss_pred             cccccccccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEE
Q psy7014         427 ISFSGRRSYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVL  489 (500)
Q Consensus       427 ~~f~~~~~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~  489 (500)
                            ++-++        ....++|+|||++.+|+||-....+ ..-+.|+|.+|+|++.+.
T Consensus      1557 ------~~~vS--------vPwylsl~FRTr~ad~vl~~~~~~~-rst~~lqld~g~l~~~v~ 1604 (2531)
T KOG4289|consen 1557 ------PSQVS--------VPWYLSLMFRTRRADGVLMQAEFGG-RSTYNLQLDDGTLKYNVG 1604 (2531)
T ss_pred             ------Cccee--------cceEEEEEEEeeccccEEEEEEeCC-CceEEEEEcCCEEEEEec
Confidence                  11111        3468999999999999999775443 345999999999998764


No 4  
>KOG3516|consensus
Probab=100.00  E-value=2.8e-31  Score=291.87  Aligned_cols=310  Identities=20%  Similarity=0.285  Sum_probs=231.1

Q ss_pred             ccCCCceeecCCCCCCCCCCCCCc-cccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCC
Q psy7014          39 FDTNQPINQLLSIIFTNFLPPDIE-IGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQH  117 (500)
Q Consensus        39 ~~~~~~~c~c~~~~~G~~C~~~~~-~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~  117 (500)
                      +|++...|+-+....+-+|+.+.. ...++|.+.    .||+.|+...+ ..+.+|+|.|||+.++  |++|.+-     
T Consensus       763 gdTg~~~sea~~~lgPLrC~gDr~~wnsvSF~~~----~syL~fp~f~~-~~saDIsf~FrTt~~~--gvflen~-----  830 (1306)
T KOG3516|consen  763 GDTGRSQSEAPYVLGPLRCEGDRNFWNSVSFHTG----ASYLHFPPFHN-ELSADISFFFRTTASS--GVFLENH-----  830 (1306)
T ss_pred             ccCCCcccccceeecceEeecccccccceEeecC----cceeecCcccC-cccccEEEEEEecCCc--eEeeecc-----
Confidence            566665555566677889999765 466789874    78999999876 4789999999999777  9999884     


Q ss_pred             CCCCCeEEEEEECCE-EEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcc
Q psy7014         118 DAITDHLAVSFIKGY-VVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHM  196 (500)
Q Consensus       118 ~~~~df~~l~l~~G~-l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~  196 (500)
                       +..||+.|+|..+. |.|.++.|+|+.                                                    
T Consensus       831 -g~~dfir~eL~~~~~vtf~~dvgnGp~----------------------------------------------------  857 (1306)
T KOG3516|consen  831 -GINDFIRLELSSPVEVTFAFDVGNGPS----------------------------------------------------  857 (1306)
T ss_pred             -CCCceEEEEEcCCCceEEEEEcCCCce----------------------------------------------------
Confidence             46799999998764 999999999653                                                    


Q ss_pred             cccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCC-CccccccCCCceEEcccccccCcCCCC
Q psy7014         197 FVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSP-GRLTQLNTKPMLYLGGHFSKNFSILPH  275 (500)
Q Consensus       197 ~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~-~~~~~L~~~~~lyIGG~~~~~~~~~~~  275 (500)
                               .+....+..+||++| |+|+++|+.+.+.|+||+.......+| .....|.+...+||||...        
T Consensus       858 ---------~~~V~s~t~~nD~qW-H~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~--------  919 (1306)
T KOG3516|consen  858 ---------QLTVRSPTELNDNQW-HQVRAERNSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVS--------  919 (1306)
T ss_pred             ---------eEEEcCCcccCCCce-EEEEEEeccccceEEEcCcccceecCCCCCEEEEEeccceecccccc--------
Confidence                     333334457899999 999999999999999999987776665 4456788999999999632        


Q ss_pred             CcCCCCCccccccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCccccccc
Q psy7014         276 DLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRY  355 (500)
Q Consensus       276 ~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i  355 (500)
                         -+.||.||||.+.|||..++|.   .++....++..-..                                      
T Consensus       920 ---~~~gF~GCIRsl~LNGv~ldLe---~ra~~~~gv~~GC~--------------------------------------  955 (1306)
T KOG3516|consen  920 ---RQRGFLGCIRSLQLNGVMLDLE---YRAYGTAGVSPGCE--------------------------------------  955 (1306)
T ss_pred             ---CcCcceeeeeeeeecceeeeeh---hhhccCCcccCCCc--------------------------------------
Confidence               3459999999999999998883   22222223321111                                      


Q ss_pred             ccccCCCccCCCCCEEeeCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCcc-cccccccccccccccc
Q psy7014         356 NLCDSTRHNCSFGATCVPLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELH-LIINESLSDISFSGRR  433 (500)
Q Consensus       356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~-~~~~e~~~~~~f~~~~  433 (500)
                      -.|.+.  ||.|||+|+..+.+|.|+|.. .|.|+.|.++..   +.| .+++++.|...+.. ..+++....-++.   
T Consensus       956 GhCss~--~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~eig---~~f-e~gs~i~y~fq~~~~~a~~~~~~~~~~~--- 1026 (1306)
T KOG3516|consen  956 GHCSSY--PCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEIG---VFF-ERGSSIRYNFQKPMRSAVFESSRVKQKL--- 1026 (1306)
T ss_pred             cccccc--cccCCCEEEEecCceeeccccCcCCCCccccccc---eEe-cCCceEEEeccchHHHhhhhhhhhhhcc---
Confidence            234443  688888888888889999976 699999988753   334 45799988754322 1122211111111   


Q ss_pred             ccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEE-CCEEEEEEEcCC
Q psy7014         434 SYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQ-GGVLELRVLMLG  492 (500)
Q Consensus       434 ~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~-~G~l~~~~~~g~  492 (500)
                             .........|.|.|+|+.+.++|+|+++- ..||+++-|+ +|.|+++|.+|.
T Consensus      1027 -------~~~~~~~e~i~~sftTt~~ps~LLfvssF-~~~y~~V~v~~nGsLq~ry~lg~ 1078 (1306)
T KOG3516|consen 1027 -------EIEINPNEEINFSFTTTRAPSDLLFVSSF-TDDYLAVLVKDNGSLQTRYMLGF 1078 (1306)
T ss_pred             -------ccccCccceEEEEEEeccCceEEEEeecc-ccceEEEEEeCCCceEEEEecCC
Confidence                   11233567999999999999999999887 4899999999 799999999998


No 5  
>KOG3514|consensus
Probab=99.97  E-value=1.6e-30  Score=280.66  Aligned_cols=314  Identities=20%  Similarity=0.311  Sum_probs=231.7

Q ss_pred             cccCCCceeecCCCCCCCCCCCCC-------ccccccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEE
Q psy7014          38 YFDTNQPINQLLSIIFTNFLPPDI-------EIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAF  110 (500)
Q Consensus        38 ~~~~~~~~c~c~~~~~G~~C~~~~-------~~~~~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly  110 (500)
                      .|++..++-+|+++-- .+|+...       -...+.|...    +||+.+...+. +.+++|.|+|||++++  |||+|
T Consensus       807 vFNG~~Yld~~K~~~~-~ls~l~a~fkl~~iv~~paTf~sk----~Sy~~la~L~a-y~s~~l~Fqfkt~sp~--gll~f  878 (1591)
T KOG3514|consen  807 VFNGQDYLDKCKMGDI-QLSELSARFKLRAIVADPATFKSK----SSYVKLATLQA-YFSMHLFFQFKTTSPD--GLLLF  878 (1591)
T ss_pred             EECcHHHHHHHhcCCc-chhhcchhhCceEEeeccceeeec----hhhhhhhhhhe-eeEEEEEEEEeecCCC--eEEEe
Confidence            3777777777776632 3455431       1233457664    79999988764 6889999999999888  99999


Q ss_pred             ecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeeccccc
Q psy7014         111 IGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYL  190 (500)
Q Consensus       111 ~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~  190 (500)
                      .+.     ..+||++|||++|+|+++|++|+|+                                               
T Consensus       879 n~g-----d~ndfi~velvnG~ihYtfdlg~gp-----------------------------------------------  906 (1591)
T KOG3514|consen  879 NSG-----DGNDFIAVELVNGYIHYTFDLGNGP-----------------------------------------------  906 (1591)
T ss_pred             cCC-----CCCceEEEEEeCcEEEEEEEcCCCc-----------------------------------------------
Confidence            974     4679999999999999999999964                                               


Q ss_pred             CCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCc-EEEEEEcCCcceeeeCCCccccccCCCceEEccccccc
Q psy7014         191 QPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQ-QCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKN  269 (500)
Q Consensus       191 ~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~-~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~  269 (500)
                                    ..++-+....+||++| |.|.|.|++. .-+|.||..... ....+ ...|++.+.|||||+..++
T Consensus       907 --------------~~~k~~sr~hlnDnrW-HnV~I~rd~~~~HtL~vD~s~~t-~~~~g-~~~l~l~g~LyiGGv~k~m  969 (1591)
T KOG3514|consen  907 --------------TSMKGPSRQHLNDNRW-HNVLIYRDKTNTHTLKVDNSSTT-QIIDG-AVNLDLKGKLYIGGVSKPM  969 (1591)
T ss_pred             --------------ccccCcccCcCccccc-eeEEEEcCCCCceEEEecCceEE-EEecC-ccccccccceecccccccc
Confidence                          3333344557789999 9999999865 468999998643 33333 6778999999999999888


Q ss_pred             CcCCCCCcCCCCCccccccceeecccccccccccccccCCCCc-cccC--CccCCCCCCCCCCEEeeCCCceeecCCCCC
Q psy7014         270 FSILPHDLPLHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGV-GQCG--TSQCHNHTCSHGGACMNHGATFSCLCADGW  346 (500)
Q Consensus       270 ~~~~~~~~~~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v-~~C~--~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy  346 (500)
                      ...++.....+.+|.||...+-+++....+.....  .....+ ..|.  ...|..+                       
T Consensus       970 ~~~~p~~~asR~g~~g~~~s~dl~~r~p~L~~~a~--~~s~lv~~~~sgpst~c~~~----------------------- 1024 (1591)
T KOG3514|consen  970 YSFLPKLVASRSGFQGCLASLDLGGRLPDLISDAL--FESGLVEVGCSGPSTTCSED----------------------- 1024 (1591)
T ss_pred             cccccceeeccCCCCCCcCccCccccchhHHHHhh--hhccceeeeccCCCcccchh-----------------------
Confidence            88888888889999999999999987665432211  111111 1222  1333333                       


Q ss_pred             CCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCC-CCCCCCcCCCCCCcCccccCCCceEEecCCCcccccccccc
Q psy7014         347 FGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPP-GRTGKFCEKDESLSDISFSGRRSYISLPSSELHLIINESLS  425 (500)
Q Consensus       347 ~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~-G~~G~~Ce~~~~~~~~~F~g~~sy~~~~~~~~~~~~~e~~~  425 (500)
                                       .|.|.|.|+..|.+|.|.|++ .|+|+.|....  ..+.|.+.++.+.|..|..         
T Consensus      1025 -----------------acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~g--tTYiFgk~gglI~YtwPpN--------- 1076 (1591)
T KOG3514|consen 1025 -----------------ACANHGVCIQQWNGIACDCSMTSYSGPRCNDPG--TTYIFGKSGGLITYTWPPN--------- 1076 (1591)
T ss_pred             -----------------hhhccceeeeeecceeeeccccccCCCccCCCc--eEEEECCCCceEEEecCCC---------
Confidence                             466666666666666666665 57777776653  3467888888888765533         


Q ss_pred             ccccccccccccCCccccccceeeEEEEEeeCCCCcEEEEcCCCC-CCCeEEEEEECCEEEEEEEcCCCCCC
Q psy7014         426 DISFSGRRSYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQ-KNSMMTLSLQGGVLELRVLMLGDRPK  496 (500)
Q Consensus       426 ~~~f~~~~~~~~~p~~~~~~~~~~i~l~frT~~~~GlLl~~~~~~-~~dfi~l~l~~G~l~~~~~~g~~~~~  496 (500)
                                     +++...+.+|.+.|+|++++|+|+-+.+.. .+||++|+|..|+|-+.||.|.....
T Consensus      1077 ---------------dRpsTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~Dit 1133 (1591)
T KOG3514|consen 1077 ---------------DRPSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTDDIT 1133 (1591)
T ss_pred             ---------------CCCCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCcccc
Confidence                           345557889999999999999999997664 58999999999999999999976543


No 6  
>KOG3516|consensus
Probab=99.96  E-value=3.3e-27  Score=259.93  Aligned_cols=208  Identities=18%  Similarity=0.198  Sum_probs=150.4

Q ss_pred             CCCCCCCCcceeccccccc-cCCeeeeecccccCCCceeecCCCCCCCC-----CCCCCccccccccCCCCCCcceEEee
Q psy7014           9 CPELTNPDNIKILGRHRQE-ENDRIVNFQHYFDTNQPINQLLSIIFTNF-----LPPDIEIGQASYSSSMSGLSSFSAYV   82 (500)
Q Consensus         9 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~c~c~~~~~G~~-----C~~~~~~~~~~f~g~~~~~~sy~~~~   82 (500)
                      -+.-+.++|+-+-++++.+ +.--+--|-.+    .|+-.||.|..|-+     |.-..  ....|+|     .|++.|+
T Consensus       123 ~~~wtf~Gn~n~~sVv~~~l~~~~~ar~vr~----~pl~wnp~grig~rVevygc~y~s--~vi~fdg-----~s~~~yr  191 (1306)
T KOG3516|consen  123 GSSWTFVGNVNADSVVYHELEPPIEARFVRI----LPLDWNPKGRIGMRVEVYGCSYKS--PVIYFDG-----SSSLLYR  191 (1306)
T ss_pred             CCccccccccccceEEeccccCcccceEEee----eeeeeCCCCcceeEEEEEeccccC--ceeEECC-----ccceeee
Confidence            4556677777776655444 22111111122    45778898888865     44432  4457999     7888888


Q ss_pred             cCCC--CceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhc
Q psy7014          83 IPAN--IHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLR  160 (500)
Q Consensus        83 ~~~~--~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~  160 (500)
                      ....  ......|+|+|||...+  |+|||..     ...+||+.|+|++|++++.+|+|+-               .+.
T Consensus       192 ~~~~~m~s~~d~is~~Fkt~~sd--Gvllh~e-----g~QGd~itlql~~~kl~l~ld~G~~---------------~~~  249 (1306)
T KOG3516|consen  192 FHRKLMSSLKDVISLKFKTMQSD--GVLLHGE-----GQQGDYITLQLIGGKLVLILDLGNS---------------KLP  249 (1306)
T ss_pred             ccccccccccceeEEEEEeeccc--eeEEEcc-----cCCCCEEEEEEeCCEEEEEEecCCc---------------cCc
Confidence            5433  34577899999887666  9999995     2578999999999999999999972               111


Q ss_pred             cccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCC
Q psy7014         161 SAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNM  240 (500)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~  240 (500)
                                                             .-+++..|.+..  .++|.+| |.|+|.|.++.+.++||+.
T Consensus       250 ---------------------------------------~s~~~~sis~Gs--lLdD~hW-HsV~i~r~~~~vnftvD~~  287 (1306)
T KOG3516|consen  250 ---------------------------------------SSRTPTSISAGS--LLDDQHW-HSVRIERQGRQVNFTVDGV  287 (1306)
T ss_pred             ---------------------------------------cccCcceeeccc--ccCCCcc-eEEEEEecCcEEEEEEccc
Confidence                                                   112445554444  3478899 9999999999999999997


Q ss_pred             cceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccccc
Q psy7014         241 GNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNV  297 (500)
Q Consensus       241 ~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~  297 (500)
                      . ......|.++.|+++..+++||+|.+..+     ......|.|||.+|.+|+..+
T Consensus       288 ~-~~fr~~Ge~~~Ldld~e~~~GGiP~~~~~-----~~~~~nF~GCienly~N~vdi  338 (1306)
T KOG3516|consen  288 V-HHFRATGEFDALDLDTEISFGGIPNDGKS-----VGFEKNFTGCLENLYYNGVDI  338 (1306)
T ss_pred             e-EeecccCccceeecceEEEECCccCCCcc-----cceeeeeeeeeeeeeecCcee
Confidence            6 45777899999999999999999875543     223478999999999997554


No 7  
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.90  E-value=4.4e-23  Score=184.02  Aligned_cols=130  Identities=33%  Similarity=0.549  Sum_probs=103.4

Q ss_pred             EeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccc
Q psy7014          97 FVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLIL  176 (500)
Q Consensus        97 Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~  176 (500)
                      |||..++  |||||.+++    ...||++|+|.+|+|+|+|++|+|                                  
T Consensus         1 frT~~~~--Gllly~g~~----~~~dfial~L~~G~l~~~~~~G~~----------------------------------   40 (131)
T PF00054_consen    1 FRTSEPN--GLLLYLGSK----DGKDFIALELRDGRLEFRYNLGSG----------------------------------   40 (131)
T ss_dssp             EEESSSS--EEEEEEESS----TTSSEEEEEEETTEEEEEEESSSE----------------------------------
T ss_pred             CccCCCC--ceEEECCcC----CCCCEEEEEEECCEEEEEEeCCCc----------------------------------
Confidence            7888777  999999864    344999999999999999999994                                  


Q ss_pred             cccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCcccc-cc
Q psy7014         177 GVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQ-LN  255 (500)
Q Consensus       177 ~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~-L~  255 (500)
                                                 +..+.+..+  ++||+| |+|.+.|.++.+.|+||+...+...++..... ++
T Consensus        41 ---------------------------~~~~~~~~~--i~dg~w-h~v~~~r~~~~~~L~Vd~~~~~~~~s~~~~~~~l~   90 (131)
T PF00054_consen   41 ---------------------------PASLRSPQK--INDGKW-HTVSVSRNGRNGSLSVDGEEVVTGESPSGATQSLD   90 (131)
T ss_dssp             ---------------------------EEEEEESSE--TTSSSE-EEEEEEEETTEEEEEETTSEEEEEEECSSSSSSCE
T ss_pred             ---------------------------cceecCCCc--cCCCcc-eEEEEEEcCcEEEEEECCccceeeecCCccccccc
Confidence                                       334444554  689999 99999999999999999988766777755555 88


Q ss_pred             CCCceEEcccccccCcCCCCCcCCCCCccccccceeecccccc
Q psy7014         256 TKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNVG  298 (500)
Q Consensus       256 ~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~~  298 (500)
                      ...+|||||+|..  ...+.......+|.|||+++.+|++.++
T Consensus        91 ~~~~lyvGG~p~~--~~~~~~~~~~~~f~GCi~~~~in~~~ld  131 (131)
T PF00054_consen   91 VDGPLYVGGLPSS--SSRPRPLPISPGFKGCIRNLSINGKPLD  131 (131)
T ss_dssp             ECSEEEESSSSTT--TGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred             cccCEEEccCCch--hhcccccccCCCeeEEEEEeEECCEECc
Confidence            8999999999822  2223345567799999999999987653


No 8  
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.85  E-value=3.1e-20  Score=167.90  Aligned_cols=149  Identities=32%  Similarity=0.456  Sum_probs=116.0

Q ss_pred             cccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEE
Q psy7014          67 SYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLV  146 (500)
Q Consensus        67 ~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~  146 (500)
                      +|.|     +||+.|+.+......+.|+|+|||+.++  |+|||.+..    ...+|++|+|.+|++++.++.|.+    
T Consensus         3 ~F~g-----~~~i~~~~~~~~~~~~~i~~~frt~~~~--g~l~~~~~~----~~~~~~~l~l~~g~l~~~~~~g~~----   67 (151)
T cd00110           3 SFSG-----SSYVRLPTLPAPRTRLSISFSFRTTSPN--GLLLYAGSQ----NGGDFLALELEDGRLVLRYDLGSG----   67 (151)
T ss_pred             EeCC-----CceEEecCCCCCcceeEEEEEEEeCCCC--eEEEEecCC----CCCCEEEEEEECCEEEEEEcCCcc----
Confidence            6777     7999999876546789999999998776  999999853    257999999999999999999852    


Q ss_pred             EeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEE
Q psy7014         147 YFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRV  226 (500)
Q Consensus       147 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v  226 (500)
                                                                               ...+.+..  .++||+| |+|.+
T Consensus        68 ---------------------------------------------------------~~~~~~~~--~v~dg~W-h~v~i   87 (151)
T cd00110          68 ---------------------------------------------------------SLVLSSKT--PLNDGQW-HSVSV   87 (151)
T ss_pred             ---------------------------------------------------------cEEEEccC--ccCCCCE-EEEEE
Confidence                                                                     22333333  5789999 99999


Q ss_pred             EEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014         227 GKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS  293 (500)
Q Consensus       227 ~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in  293 (500)
                      .+.++.+.|.||+........+.....++...++||||.|.....   .......+|.|||+++++|
T Consensus        88 ~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~---~~~~~~~~F~Gci~~v~in  151 (151)
T cd00110          88 ERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS---PGLPVSPGFVGCIRDLKVN  151 (151)
T ss_pred             EECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc---ccccccCCCceEeeEeEeC
Confidence            999999999999985444444433335677889999999753321   1234567999999999986


No 9  
>smart00282 LamG Laminin G domain.
Probab=99.85  E-value=3.8e-20  Score=165.12  Aligned_cols=134  Identities=32%  Similarity=0.469  Sum_probs=106.8

Q ss_pred             EEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhcccccccccc
Q psy7014          90 CFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCC  169 (500)
Q Consensus        90 ~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~  169 (500)
                      .++|+|.|||++++  |+|||.++.    ...+|++|+|.+|++++.++.|++..                         
T Consensus         2 ~~~i~~~frt~~~~--g~l~~~~~~----~~~~~l~l~l~~g~l~~~~~~g~~~~-------------------------   50 (135)
T smart00282        2 RLSISFSFRTTSPN--GLLLYAGSK----NGGDYLALELRDGRLVLRYDLGSGPA-------------------------   50 (135)
T ss_pred             ceEEEEEEEeCCCC--EEEEEeCCC----CCCCEEEEEEECCEEEEEEECCCCCE-------------------------
Confidence            46799999999777  999999742    35799999999999999999998432                         


Q ss_pred             ccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCC
Q psy7014         170 LPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPG  249 (500)
Q Consensus       170 ~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~  249 (500)
                                                          .+++ ....++||+| |+|.+.+.++.+.|.||+........++
T Consensus        51 ------------------------------------~~~~-~~~~~~dg~W-H~v~i~~~~~~~~l~VD~~~~~~~~~~~   92 (135)
T smart00282       51 ------------------------------------RLTS-DPTPLNDGQW-HRVAVERNGRRVTLSVDGENPVSGESPG   92 (135)
T ss_pred             ------------------------------------EEEE-CCeEeCCCCE-EEEEEEEeCCEEEEEECCCccccEECCC
Confidence                                                2222 2246799999 9999999999999999997655566666


Q ss_pred             ccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014         250 RLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG  295 (500)
Q Consensus       250 ~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~  295 (500)
                      ....+++...+||||+|.....   .......+|.|||++|++|+.
T Consensus        93 ~~~~l~~~~~l~iGG~p~~~~~---~~~~~~~~F~GCi~~v~in~~  135 (135)
T smart00282       93 GLTILNLDGPLYLGGLPEDLKL---PPLLVTPGFRGCIRNLKVNGK  135 (135)
T ss_pred             CceEEecCCCcEEccCCchhcc---cccccCCCCeeEeeEEEECCC
Confidence            6677888899999999864321   224456799999999999973


No 10 
>KOG4289|consensus
Probab=99.83  E-value=6.6e-20  Score=202.91  Aligned_cols=226  Identities=16%  Similarity=0.248  Sum_probs=155.7

Q ss_pred             CCCceeecCCCCCCCCCCCCCccccc-cccCCCCCCcceEEeecCCC-CceEEEEEEEEeeCCCCCceEEEEecccCCCC
Q psy7014          41 TNQPINQLLSIIFTNFLPPDIEIGQA-SYSSSMSGLSSFSAYVIPAN-IHHCFELKFRFVPNSFDQIALLAFIGQDYQHD  118 (500)
Q Consensus        41 ~~~~~c~c~~~~~G~~C~~~~~~~~~-~f~g~~~~~~sy~~~~~~~~-~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~  118 (500)
                      -|.++|+||.+|.|+.|+..+  .-| -|.|     +|.+++..... ..-.+.++|.|||+..+  |+||-..-     
T Consensus      1519 Wg~~~C~CP~~fggk~c~~~m--~~pq~frG-----~sl~sw~~~~~~vSvPwylsl~FRTr~ad--~vl~~~~~----- 1584 (2531)
T KOG4289|consen 1519 WGGFSCECPLGFGGKGCCQGM--AHPQHFRG-----HSLVSWEGLPSQVSVPWYLSLMFRTRRAD--GVLMQAEF----- 1584 (2531)
T ss_pred             cCcEeecCccccCCcchhhcc--CCchhccc-----cceeeecCCCcceecceEEEEEEEeeccc--cEEEEEEe-----
Confidence            357899999999999999976  334 5999     78887774332 44578999999999888  99987742     


Q ss_pred             CCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccc
Q psy7014         119 AITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFV  198 (500)
Q Consensus       119 ~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~  198 (500)
                      +...-+.|+|.+|++.+  ++|....                                                      
T Consensus      1585 ~~rst~~lqld~g~l~~--~v~~s~v------------------------------------------------------ 1608 (2531)
T KOG4289|consen 1585 GGRSTYNLQLDDGTLKY--NVGDSSV------------------------------------------------------ 1608 (2531)
T ss_pred             CCCceEEEEEcCCEEEE--EecCceE------------------------------------------------------
Confidence            22234889999999876  4443111                                                      


Q ss_pred             cccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcC
Q psy7014         199 DTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLP  278 (500)
Q Consensus       199 ~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~  278 (500)
                               . ....+++||+| |.+.++.... ..++.|..... .........|++.. ||+||.|.         ..
T Consensus      1609 ---------~-L~~~~vtdg~W-h~~~i~l~~d-~~~t~d~g~~~-aea~~gl~gl~l~s-l~vGgap~---------~g 1665 (2531)
T KOG4289|consen 1609 ---------E-LPAPRVTDGHW-HHLVIELEAD-SVATLDYGIYQ-AEAKAGLSGLNLES-LYVGGAPA---------TG 1665 (2531)
T ss_pred             ---------E-ccCccccCCch-hheeeeeccC-eEEEEechhhh-hhhhcCCCCceeeE-EEEccccC---------CC
Confidence                     0 12336689999 9999998764 66677654322 22222245566665 99999873         33


Q ss_pred             CCCCccccccceeecccccccccccccccCCCCc-c------c--cC----CccCCCCCCCCCCEEeeCC--CceeecCC
Q psy7014         279 LHSGFSGCIFDVELSAGNVGINLYKTRAAEGRGV-G------Q--CG----TSQCHNHTCSHGGACMNHG--ATFSCLCA  343 (500)
Q Consensus       279 ~~~gF~GCIr~v~ing~~~~l~~~~~~~~~~~~v-~------~--C~----~~~C~~~pC~ngg~Ci~~~--~~~~C~C~  343 (500)
                      ...||+|||++|++.|..+..+.. .....+-.+ .      .  |+    .+.|.-+||.|.|+|+..+  ..|+|.|+
T Consensus      1666 ~p~gf~GCiqgV~v~g~~~l~~~k-v~~~~GCvvpn~C~~d~sC~c~~~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~ 1744 (2531)
T KOG4289|consen 1666 VPRGFRGCIQGVRVGGVSILVPKK-VNVEAGCVVPNPCSVDSSCPCDPYNCVDVCSLNPCENQGTCVRSPGAHGYTCECP 1744 (2531)
T ss_pred             ccccchhhhhceEECCEeeccccc-cccccCcccCCccccCCcccCCCCCccchhcccccccCceeecCCCCCceeEECC
Confidence            456999999999999877654421 110000000 1      1  22    3667789999999998765  57999999


Q ss_pred             CCCCCcccccccc-cccC
Q psy7014         344 DGWFGPLCASRYN-LCDS  360 (500)
Q Consensus       344 ~Gy~G~~C~~~i~-~C~~  360 (500)
                      +||.|+.|+.+.+ +|.+
T Consensus      1745 ~g~~G~~Ce~~~dq~CPr 1762 (2531)
T KOG4289|consen 1745 PGYTGPYCELRADQPCPR 1762 (2531)
T ss_pred             CcccCcchhhhccCCCCC
Confidence            9999999998764 4544


No 11 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.77  E-value=5.9e-18  Score=148.18  Aligned_cols=127  Identities=24%  Similarity=0.384  Sum_probs=96.8

Q ss_pred             EeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccccccccccc
Q psy7014          97 FVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLCCLPLHLIL  176 (500)
Q Consensus        97 Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~  176 (500)
                      |||+.++  |+|||.+..    ...+|++|+|.+|+|++.|++|++..                                
T Consensus         1 Frt~~~~--g~Ll~~~~~----~~~~~l~l~l~~g~l~~~~~~g~~~~--------------------------------   42 (128)
T PF02210_consen    1 FRTRSPN--GLLLYIGSE----DNGDFLSLELVDGRLVVRYNLGGSEI--------------------------------   42 (128)
T ss_dssp             EEESSSS--EEEEEEEES----TTSEEEEEEEETTEEEEEEESSSSEE--------------------------------
T ss_pred             CccCCCC--EeEEEEcCC----CCCEEEEEEEECCEEEEEEEccccce--------------------------------
Confidence            7888777  999999854    22689999999999999999995321                                


Q ss_pred             cccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccc-ccc
Q psy7014         177 GVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLT-QLN  255 (500)
Q Consensus       177 ~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~-~L~  255 (500)
                                                  ..+  .....++|++| |+|.+.|.++.+.|.||+............. .++
T Consensus        43 ----------------------------~~~--~~~~~~~dg~w-h~v~i~~~~~~~~l~Vd~~~~~~~~~~~~~~~~~~   91 (128)
T PF02210_consen   43 ----------------------------VTT--FSNSNLNDGQW-HKVSISRDGNRVTLTVDGQSVSSESLPSSSSDSLD   91 (128)
T ss_dssp             ----------------------------EEE--ECSSSSTSSSE-EEEEEEEETTEEEEEETTSEEEEEESSSTTHHCBE
T ss_pred             ----------------------------eee--ccCccccccce-eEEEEEEeeeeEEEEecCccceEEeccccceeccc
Confidence                                        112  22335689999 9999999999999999999876666655443 677


Q ss_pred             CCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014         256 TKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG  295 (500)
Q Consensus       256 ~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~  295 (500)
                      ....+||||.|.......   .....+|.|||+++++||+
T Consensus        92 ~~~~l~iGg~~~~~~~~~---~~~~~~f~Gci~~l~vng~  128 (128)
T PF02210_consen   92 PDGSLYIGGLPESNQPSG---SVDTPGFVGCIRDLRVNGQ  128 (128)
T ss_dssp             SEEEEEESSTTTTCTCTT---SSTTSB-EEEEEEEEETTE
T ss_pred             CCCCEEEecccCcccccc---ccCCCCcEEEcCeEEECCC
Confidence            777899999976443221   1126799999999999974


No 12 
>KOG1219|consensus
Probab=99.33  E-value=6.6e-12  Score=144.97  Aligned_cols=108  Identities=26%  Similarity=0.679  Sum_probs=93.0

Q ss_pred             cccceeeccccccc---ccccccccCCCCccccCC--ccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccc-ccccc
Q psy7014         286 CIFDVELSAGNVGI---NLYKTRAAEGRGVGQCGT--SQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASR-YNLCD  359 (500)
Q Consensus       286 CIr~v~ing~~~~l---~~~~~~~~~~~~v~~C~~--~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~-i~~C~  359 (500)
                      |-++-..+|+.+.-   .-+.|.|...+....|+.  .+|+++||.+||+|++..+.|.|.|+.||+|.+|+.+ +++|.
T Consensus      3867 C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs 3946 (4289)
T KOG1219|consen 3867 CNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEARGISECS 3946 (4289)
T ss_pred             cccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecccccccc
Confidence            55666667666532   124566677777888984  9999999999999999999999999999999999998 89999


Q ss_pred             CCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC
Q psy7014         360 STRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE  395 (500)
Q Consensus       360 ~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~  395 (500)
                      .+  +|++||.|++..++|.|.|.+||.|.+|+...
T Consensus      3947 ~n--~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~ 3980 (4289)
T KOG1219|consen 3947 KN--VCGTGGQCINIPGSFHCNCTPGILGRTCCAEK 3980 (4289)
T ss_pred             cc--cccCCceeeccCCceEeccChhHhcccCcccc
Confidence            85  89999999999999999999999999997763


No 13 
>KOG3509|consensus
Probab=99.13  E-value=2.3e-10  Score=128.49  Aligned_cols=166  Identities=27%  Similarity=0.510  Sum_probs=124.0

Q ss_pred             CCCCccEEEEEEEeCcEEEEEEcC-CcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecc
Q psy7014         216 KSKRGGYTVRVGKNGQQCWLMVDN-MGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSA  294 (500)
Q Consensus       216 ~dg~WwH~V~v~r~~~~~~L~VD~-~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing  294 (500)
                      -.++| |.+.+.|   .-.+.+++ ..+....+......+...+.+|.||+  -+...+.+..+...||.|||+++.+++
T Consensus       312 ~~~E~-~~~~i~r---~s~~~~~g~~~~l~g~~~~~~~~i~~ee~v~lg~i--~ni~~l~~~~~~~eGf~gci~~~~~~~  385 (964)
T KOG3509|consen  312 YIGEW-RFGIIFR---GSGLSVSGHKGVLQGNSNILVSRITNEESVFLGGI--INIETLQHNLPLPEGFAGCIRDLVMNL  385 (964)
T ss_pred             cccee-eeeEeee---cccccccCcceeecccccccccceeecccccCCce--eeeccccccCCCccCccceehhhhhhc
Confidence            35699 9999988   44455555 44445555566666777888999996  445556666778889999999999999


Q ss_pred             cccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeC
Q psy7014         295 GNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPL  374 (500)
Q Consensus       295 ~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~  374 (500)
                      +.+...+.....+.  ....|..+.|...||.+.+.|.+..-...|.|+.+|.|..|+...+.|...++-+ ..++|...
T Consensus       386 k~l~~~~~~~~~v~--~~~~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~-y~~t~~~~  462 (964)
T KOG3509|consen  386 KDLRVTLQRASYVA--AQGTCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGS-YLGTCVPI  462 (964)
T ss_pred             cccccccccccccc--cccccCCCccccccCCCCccccccccccceeccccccCchhhccCccccccCCcc-ccceEecc
Confidence            88876554322121  1226778999999999999999999999999999999999999988888775333 34677776


Q ss_pred             CCCeeeeCCCCCCCCCc
Q psy7014         375 THSYECDCPPGRTGKFC  391 (500)
Q Consensus       375 ~~g~~C~C~~G~~G~~C  391 (500)
                      .....+.|-+| .|..+
T Consensus       463 ~~~~~~~c~pg-~g~~~  478 (964)
T KOG3509|consen  463 QGKRCEYCGPG-AGAPT  478 (964)
T ss_pred             CCCcceeecCC-CCCcc
Confidence            66677788888 55555


No 14 
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=98.40  E-value=4.3e-07  Score=80.88  Aligned_cols=42  Identities=29%  Similarity=0.559  Sum_probs=39.1

Q ss_pred             EeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCCCC
Q psy7014         454 IRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDRP  495 (500)
Q Consensus       454 frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~~~  495 (500)
                      |||.+++|||||.+.....|||+|+|++|+|+++|++|+++.
T Consensus         1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~   42 (131)
T PF00054_consen    1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPA   42 (131)
T ss_dssp             EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEE
T ss_pred             CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccc
Confidence            899999999999998877899999999999999999998843


No 15 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=98.32  E-value=1.8e-06  Score=77.64  Aligned_cols=46  Identities=30%  Similarity=0.490  Sum_probs=42.8

Q ss_pred             ceeeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcC
Q psy7014         446 HEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLML  491 (500)
Q Consensus       446 ~~~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g  491 (500)
                      ...+|+|+|||.+++|+|||.+.....+|++|+|++|+|.+.++.|
T Consensus        20 ~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~~~~g   65 (151)
T cd00110          20 TRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLG   65 (151)
T ss_pred             ceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEEEcCC
Confidence            6789999999999999999998875689999999999999999987


No 16 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.31  E-value=1.7e-05  Score=74.80  Aligned_cols=130  Identities=12%  Similarity=0.099  Sum_probs=80.9

Q ss_pred             CceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEc-CCceeEEEeechhhhhhhhhcccccc
Q psy7014          87 IHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNL-GSGWYLVYFEHTYLFILSRLRSAQDT  165 (500)
Q Consensus        87 ~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~-G~g~~~~~~~~~~~~~~~~l~~~~~~  165 (500)
                      ....+.|.+.||+.. ...|.||-..+.    ....++.|.+..++..+.+.. +..                       
T Consensus        50 ~~~~fsi~~~~r~~~-~~~g~L~si~~~----~~~~~l~v~l~g~~~~~~~~~~~~~-----------------------  101 (184)
T smart00210       50 LPEDFSLLTTFRQTP-KSRGVLFAIYDA----QNVRQFGLEVDGRANTLLLRYQGVD-----------------------  101 (184)
T ss_pred             CCCCeEEEEEEEeCC-CCCeEEEEEEcC----CCcEEEEEEEeCCccEEEEEECCCC-----------------------
Confidence            345788899998874 445888766532    344688999887664554443 220                       


Q ss_pred             ccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceee
Q psy7014         166 RLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTS  245 (500)
Q Consensus       166 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~  245 (500)
                                                           |.....+-..+.+.||+| |+|.+...+..++|.||.......
T Consensus       102 -------------------------------------g~~~~~~f~~~~l~dg~W-H~lal~V~~~~v~LyvDC~~~~~~  143 (184)
T smart00210      102 -------------------------------------GKQHTVSFRNLPLADGQW-HKLALSVSGSSATLYVDCNEIDSR  143 (184)
T ss_pred             -------------------------------------CcEEEEeecCCccccCCc-eEEEEEEeCCEEEEEECCccccce
Confidence                                                 111111112245789999 999999999999999999875544


Q ss_pred             eCCCcc-ccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014         246 KSPGRL-TQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS  293 (500)
Q Consensus       246 ~~~~~~-~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in  293 (500)
                      ..+... ..++.++ +.++|...          .....|.|||++++|.
T Consensus       144 ~l~~~~~~~~~~~g-~~~~g~~~----------~~~~~f~G~lq~l~i~  181 (184)
T smart00210      144 PLDRPGQPPIDTDG-IEVRGAQA----------ADRKPFQGDLQQLKIV  181 (184)
T ss_pred             ecCCcccccccccc-eEEEeecc----------CCCCcceEEeEEEEEe
Confidence            333222 1333333 44554321          1124799999999984


No 17 
>smart00282 LamG Laminin G domain.
Probab=98.13  E-value=8.7e-06  Score=72.26  Aligned_cols=47  Identities=30%  Similarity=0.459  Sum_probs=42.6

Q ss_pred             eeEEEEEeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCCC
Q psy7014         448 ACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDR  494 (500)
Q Consensus       448 ~~i~l~frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~~  494 (500)
                      .+|+|.|||.+++|+|||.+.....+|++|+|.+|+|.+.++.|++.
T Consensus         3 ~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~   49 (135)
T smart00282        3 LSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGP   49 (135)
T ss_pred             eEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCC
Confidence            47999999999999999998755689999999999999999998754


No 18 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.84  E-value=8e-06  Score=54.46  Aligned_cols=26  Identities=54%  Similarity=1.280  Sum_probs=16.7

Q ss_pred             cCCCCCEEeeCC-CCeeeeCCCCCCCC
Q psy7014         364 NCSFGATCVPLT-HSYECDCPPGRTGK  389 (500)
Q Consensus       364 pC~ngg~C~~~~-~g~~C~C~~G~~G~  389 (500)
                      ||.|+|+|++.. .+|.|.|++||+|+
T Consensus         5 ~C~n~g~C~~~~~~~y~C~C~~G~~G~   31 (32)
T PF00008_consen    5 PCQNGGTCIDLPGGGYTCECPPGYTGK   31 (32)
T ss_dssp             SSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred             cCCCCeEEEeCCCCCEEeECCCCCccC
Confidence            566666666655 66666666666665


No 19 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.75  E-value=1.3e-05  Score=53.51  Aligned_cols=31  Identities=29%  Similarity=0.978  Sum_probs=29.1

Q ss_pred             CCCCCCCCCCEEeeCC-CceeecCCCCCCCcc
Q psy7014         320 CHNHTCSHGGACMNHG-ATFSCLCADGWFGPL  350 (500)
Q Consensus       320 C~~~pC~ngg~Ci~~~-~~~~C~C~~Gy~G~~  350 (500)
                      |.++||+|+|+|++.. +.|.|.|+.||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            7889999999999999 999999999999963


No 20 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=97.68  E-value=0.00067  Score=60.24  Aligned_cols=147  Identities=16%  Similarity=0.194  Sum_probs=82.8

Q ss_pred             ccccCCCCCCcceEEeecCCCCceEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEEC-CEEEEEEEcCCcee
Q psy7014          66 ASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIK-GYVVLTWNLGSGWY  144 (500)
Q Consensus        66 ~~f~g~~~~~~sy~~~~~~~~~~~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~-G~l~~~~~~G~g~~  144 (500)
                      ..|+|.    ++|+.++...-....+.|++.||+....+...++....     ...+.+.|.+.+ |++++.+..+.+. 
T Consensus         3 ~~f~g~----~~~i~~~~~~~~~~~fTi~~w~~~~~~~~~~~~~~~~~-----~~~~~~~l~~~~~~~l~~~~~~~~~~-   72 (157)
T PF13385_consen    3 LYFDGS----NDYISIPNSDFPSGSFTISFWVKPDSPSSSQSFVFMDS-----SGSGGFGLFINNNGRLRFYIGNGGGG-   72 (157)
T ss_dssp             EEE-ST----T-EEEEESGGGGGTEEEEEEEEEESS--SSEEEEEESS-----SSSEEEEEEEETTSEEEEEETTSEEE-
T ss_pred             EEECCC----CCEEEECCcCCCCCCEEEEEEEEeCCCCCCceEEEEec-----CCCCEEEEEEECCCEEEEEEeCCCce-
Confidence            356664    78999985322256899999999887666554444311     122366777764 6666644444311 


Q ss_pred             EEEeechhhhhhhhhccccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEE
Q psy7014         145 LVYFEHTYLFILSRLRSAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTV  224 (500)
Q Consensus       145 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V  224 (500)
                                                                                 ...+.+..  .+.+++| |+|
T Consensus        73 -----------------------------------------------------------~~~~~~~~--~~~~~~W-~~l   90 (157)
T PF13385_consen   73 -----------------------------------------------------------NYSFSSDS--NLPDNKW-HHL   90 (157)
T ss_dssp             -----------------------------------------------------------SS-EE-BS-----TT-E-EEE
T ss_pred             -----------------------------------------------------------eEEEecCc--ccCCCCE-EEE
Confidence                                                                       01122223  3456799 999


Q ss_pred             EEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecccc
Q psy7014         225 RVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGN  296 (500)
Q Consensus       225 ~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~  296 (500)
                      .+..++..+.|.||+........+.. ........++||+...           ....|.|-|.++++=.+.
T Consensus        91 ~~~~~~~~~~lyvnG~~~~~~~~~~~-~~~~~~~~~~iG~~~~-----------~~~~~~g~i~~~~i~~~a  150 (157)
T PF13385_consen   91 ALTYDGSTVTLYVNGELVGSSTIPSN-ISLNSNGPLFIGGSGG-----------GSSPFNGYIDDLRIYNRA  150 (157)
T ss_dssp             EEEEETTEEEEEETTEEETTCTEESS-SSTTSCCEEEESS-ST-----------T--B-EEEEEEEEEESS-
T ss_pred             EEEEECCeEEEEECCEEEEeEeccCC-cCCCCcceEEEeecCC-----------CCCceEEEEEEEEEECcc
Confidence            99999999999999976433322222 1234556799998631           135799999999985443


No 21 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=97.61  E-value=6.3e-05  Score=65.24  Aligned_cols=40  Identities=30%  Similarity=0.566  Sum_probs=36.3

Q ss_pred             EeeCCCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEcCCC
Q psy7014         454 IRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGD  493 (500)
Q Consensus       454 frT~~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~g~~  493 (500)
                      |||++++|+|||.+.....+|++|+|.+|+|++.+++|+.
T Consensus         1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~   40 (128)
T PF02210_consen    1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGS   40 (128)
T ss_dssp             EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSS
T ss_pred             CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEcccc
Confidence            8999999999999877546899999999999999999944


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.59  E-value=8.8e-05  Score=50.98  Aligned_cols=37  Identities=49%  Similarity=1.086  Sum_probs=29.4

Q ss_pred             cccccCCCccCCCCCEEeeCCCCeeeeCCCCCC-CCCcC
Q psy7014         355 YNLCDSTRHNCSFGATCVPLTHSYECDCPPGRT-GKFCE  392 (500)
Q Consensus       355 i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~-G~~Ce  392 (500)
                      +++|... .||.+++.|++..++|.|.|+.||. |..|+
T Consensus         2 ~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECASG-NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCcCC-CCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            3556542 3788888999988899999999998 88885


No 23 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.37  E-value=0.00011  Score=52.02  Aligned_cols=34  Identities=41%  Similarity=0.919  Sum_probs=29.9

Q ss_pred             ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCC
Q psy7014         354 RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRT  387 (500)
Q Consensus       354 ~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~  387 (500)
                      +|++|...+++|..++.|++..++|.|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            4678888777898899999999999999999986


No 24 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.22  E-value=0.00043  Score=46.88  Aligned_cols=36  Identities=50%  Similarity=1.102  Sum_probs=27.2

Q ss_pred             ccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcC
Q psy7014         356 NLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCE  392 (500)
Q Consensus       356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce  392 (500)
                      ++|... .+|.+++.|.+..++|.|.|+.+|.|..|+
T Consensus         3 ~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCC-CCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            445441 268888888888888889998888888774


No 25 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=97.06  E-value=0.017  Score=55.21  Aligned_cols=76  Identities=14%  Similarity=0.055  Sum_probs=48.4

Q ss_pred             CCCCccEEEEEEEe--CcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014         216 KSKRGGYTVRVGKN--GQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS  293 (500)
Q Consensus       216 ~dg~WwH~V~v~r~--~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in  293 (500)
                      .+++| |+|.+..+  .+.+.|.||+....... ......+...+.|.||.....    +-........|.|-|.++++=
T Consensus        88 ~~g~W-~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~----~gg~~~~~~~f~G~I~~v~iw  161 (201)
T cd00152          88 SDGAW-HHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDS----YGGGFDATQSFVGEISDVNMW  161 (201)
T ss_pred             CCCCE-EEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccC----CCCCCCCCcceEEEEceeEEE
Confidence            78899 99999988  44678999997643332 122234455667888854211    111122345799999999886


Q ss_pred             cccc
Q psy7014         294 AGNV  297 (500)
Q Consensus       294 g~~~  297 (500)
                      ++.+
T Consensus       162 ~~~L  165 (201)
T cd00152         162 DSVL  165 (201)
T ss_pred             cccC
Confidence            5544


No 26 
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=97.02  E-value=0.016  Score=55.69  Aligned_cols=78  Identities=13%  Similarity=0.067  Sum_probs=48.8

Q ss_pred             eeCCCCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014         214 SAKSKRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE  291 (500)
Q Consensus       214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~  291 (500)
                      .+.|++| |+|-+..+.  +.+.|.|||... ..........+.....|.||-....    .-........|.|-|.++.
T Consensus        86 ~~~~g~W-~hvc~tw~~~~g~~~lyvnG~~~-~~~~~~~g~~i~~~G~lvlGq~qd~----~gg~f~~~~~f~G~i~~v~  159 (206)
T smart00159       86 PESDGKW-HHICTTWESSSGIAELWVDGKPG-VRKGLAKGYTVKPGGSIILGQEQDS----YGGGFDATQSFVGEIGDLN  159 (206)
T ss_pred             cccCCce-EEEEEEEECCCCcEEEEECCEEc-ccccccCCcEECCCCEEEEEecccC----CCCCCCCCcceeEEEeeeE
Confidence            4578899 999999884  467899999763 2221122233455566888864221    1111234457999999998


Q ss_pred             eccccc
Q psy7014         292 LSAGNV  297 (500)
Q Consensus       292 ing~~~  297 (500)
                      +=++.+
T Consensus       160 iw~~~L  165 (206)
T smart00159      160 MWDSVL  165 (206)
T ss_pred             EecccC
Confidence            865544


No 27 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.01  E-value=0.00077  Score=46.17  Aligned_cols=35  Identities=31%  Similarity=0.994  Sum_probs=30.8

Q ss_pred             ccCCC-CCCCCCCEEeeCCCceeecCCCCCC-Ccccc
Q psy7014         318 SQCHN-HTCSHGGACMNHGATFSCLCADGWF-GPLCA  352 (500)
Q Consensus       318 ~~C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~-G~~C~  352 (500)
                      ++|.. .||.+++.|++..+.|.|.|+.||. |..|+
T Consensus         3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            56776 7999999999999999999999999 88774


No 28 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.87  E-value=0.0016  Score=43.32  Aligned_cols=30  Identities=57%  Similarity=1.157  Sum_probs=24.4

Q ss_pred             ccCCCCCEEeeCCCCeeeeCCCCCCCC-CcC
Q psy7014         363 HNCSFGATCVPLTHSYECDCPPGRTGK-FCE  392 (500)
Q Consensus       363 ~pC~ngg~C~~~~~g~~C~C~~G~~G~-~Ce  392 (500)
                      .+|.+++.|++..++|.|.|+.||.|. .|+
T Consensus         6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~   36 (36)
T cd00053           6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE   36 (36)
T ss_pred             CCCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence            368888888888888889999888887 663


No 29 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=96.82  E-value=0.035  Score=52.55  Aligned_cols=140  Identities=15%  Similarity=0.079  Sum_probs=82.3

Q ss_pred             eEEEEEEEEeeCCCCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhccccccccc
Q psy7014          89 HCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLRSAQDTRLC  168 (500)
Q Consensus        89 ~~~~i~~~Frt~~~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~~~~~~~~~  168 (500)
                      ....|.++|++.+....--||-.++.   .....|+.|++.++++-+.++-..|.....+.                   
T Consensus        33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~---~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~-------------------   90 (190)
T PF02973_consen   33 EEGTIVIRFKSDSNSGIQSLFSISNS---TKGNEYFSLYVSNNKLGFELRDTKGNQNYNFS-------------------   90 (190)
T ss_dssp             SSEEEEEEEEESS-SSEEEEEEEE-T---STTSEEEEEEEETTEEEEEEEETTTTCEEEEE-------------------
T ss_pred             cccEEEEEEecCCCcceeEEEEecCC---CCccceEEEEEECCEEEEEEecCCCCcccccc-------------------
Confidence            46789999999766643446666543   34558999999999988888776653211110                   


Q ss_pred             cccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEe--CcEEEEEEcCCcceeee
Q psy7014         169 CLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKN--GQQCWLMVDNMGNVTSK  246 (500)
Q Consensus       169 ~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~--~~~~~L~VD~~~~~~~~  246 (500)
                                                        .+..+.   +...++..| |+|.+.-+  ....+|.|||.......
T Consensus        91 ----------------------------------~~~~v~---~~~~~~~~~-~tva~~ad~~~~~ykly~NG~~v~~~~  132 (190)
T PF02973_consen   91 ----------------------------------RPAKVR---GGYKNNVTF-NTVAFVADSKNKGYKLYVNGELVSTLS  132 (190)
T ss_dssp             ----------------------------------ESSE-----SEETTEES--EEEEEEEETTTTEEEEEETTCEEEEEE
T ss_pred             ----------------------------------cccEec---ccccCCceE-EEEEEEEecCCCeEEEEeCCeeEEEec
Confidence                                              111111   112233344 99999997  88999999996544443


Q ss_pred             CC-CccccccCC--CceEEcccccccCcCCCCCcCCCCCccccccceeeccccc
Q psy7014         247 SP-GRLTQLNTK--PMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGNV  297 (500)
Q Consensus       247 ~~-~~~~~L~~~--~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~~  297 (500)
                      .+ +++ .-++.  ..++|||.....        ...-+|.|=|++|.+=++.+
T Consensus       133 ~~~~~F-is~i~~~n~~~iG~t~R~g--------~~~y~f~G~I~~l~iYn~aL  177 (190)
T PF02973_consen  133 SKSGNF-ISDIPGLNSVQIGGTNRAG--------SNAYPFNGTIDNLKIYNRAL  177 (190)
T ss_dssp             ECTSS--GGGSTT--EEEESSEEETT--------EEES--EEEEEEEEEESS--
T ss_pred             cccccH-hhcCcCCceEEEcceEeCC--------CceecccceEEEEEEEcCcC
Confidence            33 222 11222  249999984322        12348999999999976554


No 30 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.80  E-value=0.0014  Score=44.21  Aligned_cols=35  Identities=31%  Similarity=1.001  Sum_probs=30.6

Q ss_pred             ccCCC-CCCCCCCEEeeCCCceeecCCCCCCCcccc
Q psy7014         318 SQCHN-HTCSHGGACMNHGATFSCLCADGWFGPLCA  352 (500)
Q Consensus       318 ~~C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~  352 (500)
                      ++|.. .+|.+++.|.+..+.|.|.|+.||.|..|+
T Consensus         3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            56776 789999999999999999999999997773


No 31 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.70  E-value=0.0023  Score=42.88  Aligned_cols=28  Identities=61%  Similarity=1.278  Sum_probs=22.6

Q ss_pred             cCCCCCEEeeCCCCeeeeCCCCCCC-CCcC
Q psy7014         364 NCSFGATCVPLTHSYECDCPPGRTG-KFCE  392 (500)
Q Consensus       364 pC~ngg~C~~~~~g~~C~C~~G~~G-~~Ce  392 (500)
                      +|.++ .|++..++|.|.|+.||.| ..|+
T Consensus         7 ~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        7 PCSNG-TCINTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence            67777 8888888888888888888 7664


No 32 
>KOG1225|consensus
Probab=96.58  E-value=0.0033  Score=67.88  Aligned_cols=75  Identities=33%  Similarity=0.821  Sum_probs=58.8

Q ss_pred             ccCCCCccccCCccCCCCCCCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCC
Q psy7014         306 AAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPG  385 (500)
Q Consensus       306 ~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G  385 (500)
                      +..++....|..-.|... |..++.|++.    .|.|.+||.|+.|+..-  |..   .|.++|.|++.    .|.|.+|
T Consensus       269 C~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs~~~--cpa---dC~g~G~Ci~G----~C~C~~G  334 (525)
T KOG1225|consen  269 CPPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCSIRR--CPA---DCSGHGKCIDG----ECLCDEG  334 (525)
T ss_pred             CCCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCcccccccccc--CCc---cCCCCCcccCC----ceEeCCC
Confidence            345556666666566655 8888888766    89999999999997544  654   69999999933    7999999


Q ss_pred             CCCCCcCCC
Q psy7014         386 RTGKFCEKD  394 (500)
Q Consensus       386 ~~G~~Ce~~  394 (500)
                      |+|..|++.
T Consensus       335 y~G~~C~~~  343 (525)
T KOG1225|consen  335 YTGELCIQR  343 (525)
T ss_pred             CcCCccccc
Confidence            999999986


No 33 
>KOG1214|consensus
Probab=96.55  E-value=0.016  Score=64.42  Aligned_cols=102  Identities=21%  Similarity=0.522  Sum_probs=69.9

Q ss_pred             CccCCCC--CCCCCCEEeeCCCceeecCCCCCC----Cccccc-----ccccccCCCccCCCCCEEe--eC-CCCeeeeC
Q psy7014         317 TSQCHNH--TCSHGGACMNHGATFSCLCADGWF----GPLCAS-----RYNLCDSTRHNCSFGATCV--PL-THSYECDC  382 (500)
Q Consensus       317 ~~~C~~~--pC~ngg~Ci~~~~~~~C~C~~Gy~----G~~C~~-----~i~~C~~~p~pC~ngg~C~--~~-~~g~~C~C  382 (500)
                      .+.|+..  -|-...+|+...+.|+|.|..+|.    +-.|.-     ..++|....+.|...|.|.  .. .+.|.|.|
T Consensus       734 ~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~C  813 (1289)
T KOG1214|consen  734 ENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCAC  813 (1289)
T ss_pred             hhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEee
Confidence            3667643  499999999999999999998876    345643     3466766656677666554  33 47899999


Q ss_pred             CCCCCCC--CcCCC-CCC------cCccccCCCceEEecCCCccc
Q psy7014         383 PPGRTGK--FCEKD-ESL------SDISFSGRRSYISLPSSELHL  418 (500)
Q Consensus       383 ~~G~~G~--~Ce~~-~~~------~~~~F~g~~sy~~~~~~~~~~  418 (500)
                      -+||+|+  .|... ++.      .+.+++..++|.+-+.+++.+
T Consensus       814 LPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~G  858 (1289)
T KOG1214|consen  814 LPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYG  858 (1289)
T ss_pred             cCCccCCccccccccccCccccCCCceEecCCCcceeecccCccC
Confidence            9999876  34433 221      134566667777777776654


No 34 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.47  E-value=0.0035  Score=41.55  Aligned_cols=32  Identities=38%  Similarity=1.162  Sum_probs=28.2

Q ss_pred             CC-CCCCCCCCEEeeCCCceeecCCCCCCCc-cc
Q psy7014         320 CH-NHTCSHGGACMNHGATFSCLCADGWFGP-LC  351 (500)
Q Consensus       320 C~-~~pC~ngg~Ci~~~~~~~C~C~~Gy~G~-~C  351 (500)
                      |. ..+|.+++.|++..+.|.|.|+.||.|. .|
T Consensus         2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            45 6789999999999999999999999997 55


No 35 
>KOG1214|consensus
Probab=96.42  E-value=0.0045  Score=68.49  Aligned_cols=57  Identities=37%  Similarity=0.938  Sum_probs=47.0

Q ss_pred             EEeeC-CCceeecCCCCCCCc--ccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCC
Q psy7014         330 ACMNH-GATFSCLCADGWFGP--LCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGK  389 (500)
Q Consensus       330 ~Ci~~-~~~~~C~C~~Gy~G~--~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~  389 (500)
                      .|+.. ...|+|.|-+||.|.  .|. ++++|.+.  -|...++|.+.+++|.|.|.+||.|+
T Consensus       800 ~c~~hGgs~y~C~CLPGfsGDG~~c~-dvDeC~ps--rChp~A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  800 RCVHHGGSTYSCACLPGFSGDGHQCT-DVDECSPS--RCHPAATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             EEEecCCceEEEeecCCccCCccccc-cccccCcc--ccCCCceEecCCCcceeecccCccCC
Confidence            34444 468999999999965  443 56999875  79999999999999999999999865


No 36 
>KOG1217|consensus
Probab=96.35  E-value=0.0056  Score=65.00  Aligned_cols=78  Identities=32%  Similarity=0.842  Sum_probs=63.7

Q ss_pred             CccCCCCC-CCCCCEEeeCCCceeecCCCCCCCccc--ccccccccC--CCccCCCCCEEee--CCCCeeeeCCCCCCCC
Q psy7014         317 TSQCHNHT-CSHGGACMNHGATFSCLCADGWFGPLC--ASRYNLCDS--TRHNCSFGATCVP--LTHSYECDCPPGRTGK  389 (500)
Q Consensus       317 ~~~C~~~p-C~ngg~Ci~~~~~~~C~C~~Gy~G~~C--~~~i~~C~~--~p~pC~ngg~C~~--~~~g~~C~C~~G~~G~  389 (500)
                      .+.|...+ |.+++.|+...+.|.|.|++||.|..|  ..+...|..  ...+|.++++|..  ....+.|.|..++.|.
T Consensus       271 ~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~  350 (487)
T KOG1217|consen  271 VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR  350 (487)
T ss_pred             ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC
Confidence            47788764 999999999999999999999999999  234467742  2357999999933  3468889999999999


Q ss_pred             CcCCC
Q psy7014         390 FCEKD  394 (500)
Q Consensus       390 ~Ce~~  394 (500)
                      .|+..
T Consensus       351 ~C~~~  355 (487)
T KOG1217|consen  351 RCEDS  355 (487)
T ss_pred             ccccC
Confidence            99976


No 37 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.33  E-value=0.0045  Score=41.47  Aligned_cols=31  Identities=35%  Similarity=1.087  Sum_probs=27.6

Q ss_pred             CCC-CCCCCCCEEeeCCCceeecCCCCCCC-ccc
Q psy7014         320 CHN-HTCSHGGACMNHGATFSCLCADGWFG-PLC  351 (500)
Q Consensus       320 C~~-~pC~ngg~Ci~~~~~~~C~C~~Gy~G-~~C  351 (500)
                      |.. .+|.++ .|++.++.|.|.|+.||.| +.|
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            566 689999 9999999999999999999 666


No 38 
>KOG4260|consensus
Probab=96.01  E-value=0.0068  Score=59.27  Aligned_cols=93  Identities=23%  Similarity=0.475  Sum_probs=67.8

Q ss_pred             CCCCCCCCEEeeC---CCceeecCCCCCCCccccc---------------------------------------------
Q psy7014         322 NHTCSHGGACMNH---GATFSCLCADGWFGPLCAS---------------------------------------------  353 (500)
Q Consensus       322 ~~pC~ngg~Ci~~---~~~~~C~C~~Gy~G~~C~~---------------------------------------------  353 (500)
                      ..||...|.|.-+   .++-.|.|.+||+|+.|..                                             
T Consensus       149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~l  228 (350)
T KOG4260|consen  149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKL  228 (350)
T ss_pred             cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhccccee
Confidence            4678888888765   3567899999999888653                                             


Q ss_pred             ------ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCC--CcCCCCCC----cCccccCCCceEEecCC
Q psy7014         354 ------RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGK--FCEKDESL----SDISFSGRRSYISLPSS  414 (500)
Q Consensus       354 ------~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~--~Ce~~~~~----~~~~F~g~~sy~~~~~~  414 (500)
                            ++++|...|.||.....|++..++|.|.+.+||.+.  .|+.-..+    ...+.+-.++|..++..
T Consensus       229 de~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~  301 (350)
T KOG4260|consen  229 DEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDGQYRCVCFS  301 (350)
T ss_pred             cccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCccEEEEecc
Confidence                  268888889999999999999999999999998753  34442111    12345555677766543


No 39 
>KOG1225|consensus
Probab=96.00  E-value=0.013  Score=63.45  Aligned_cols=58  Identities=38%  Similarity=0.937  Sum_probs=47.2

Q ss_pred             CCCCCEEeeCCCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCC
Q psy7014         325 CSHGGACMNHGATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDE  395 (500)
Q Consensus       325 C~ngg~Ci~~~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~  395 (500)
                      |.+++.|++.    .|.|++||+|..|+.  ..|..   +|..++.|++.    .|.|+++|+|+.|+...
T Consensus       256 c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~---~cs~~g~~~~g----~CiC~~g~~G~dCs~~~  313 (525)
T KOG1225|consen  256 CTGRGQCVEG----RCICPPGFTGDDCDE--LVCPV---DCSGGGVCVDG----ECICNPGYSGKDCSIRR  313 (525)
T ss_pred             CcccceEeCC----eEeCCCCCcCCCCCc--ccCCc---ccCCCceecCC----EeecCCCcccccccccc
Confidence            5555667765    899999999999986  45755   48888888875    89999999999998764


No 40 
>KOG1217|consensus
Probab=95.89  E-value=0.014  Score=61.86  Aligned_cols=76  Identities=37%  Similarity=0.864  Sum_probs=62.2

Q ss_pred             cCCCCC--CCCCCEEeeC---CCceeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCC
Q psy7014         319 QCHNHT--CSHGGACMNH---GATFSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEK  393 (500)
Q Consensus       319 ~C~~~p--C~ngg~Ci~~---~~~~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~  393 (500)
                      .|...+  +...+.|...   ...+.|.|..||.+..|....+.|.....+|.+++.|.+...+|.|.|+.+|.|..|+.
T Consensus       128 ~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~  207 (487)
T KOG1217|consen  128 ECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET  207 (487)
T ss_pred             eecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcC
Confidence            355555  3455566654   35889999999999999988789986667899999999999889999999999999987


Q ss_pred             C
Q psy7014         394 D  394 (500)
Q Consensus       394 ~  394 (500)
                      .
T Consensus       208 ~  208 (487)
T KOG1217|consen  208 T  208 (487)
T ss_pred             C
Confidence            6


No 41 
>PF00354 Pentaxin:  Pentaxin family;  InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=95.34  E-value=0.2  Score=47.82  Aligned_cols=78  Identities=15%  Similarity=0.109  Sum_probs=42.8

Q ss_pred             eeCCCCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014         214 SAKSKRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE  291 (500)
Q Consensus       214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~  291 (500)
                      .+.+++| |+|-+..+.  ....|.+||.. ...........+.-++.+.||--..    .+.........|.|=|.++.
T Consensus        80 ~~~~~~W-hh~C~tW~s~~G~~~ly~dG~~-~~~~~~~~g~~i~~gG~~vlGQeQd----~~gG~fd~~q~F~G~i~~~~  153 (195)
T PF00354_consen   80 PIRDGQW-HHICVTWDSSTGRWQLYVDGVR-LSSTGLATGHSIPGGGTLVLGQEQD----SYGGGFDESQAFVGEISDFN  153 (195)
T ss_dssp             CS-TSS--EEEEEEEETTTTEEEEEETTEE-EEEEESSTT--B-SSEEEEESS-BS----BTTBTCSGGGB--EEEEEEE
T ss_pred             ccCCCCc-EEEEEEEecCCcEEEEEECCEe-cccccccCCceECCCCEEEECcccc----ccCCCcCCccEeeEEEeceE
Confidence            4568899 999999875  67888899984 2222222234454555677774321    12222334568999999998


Q ss_pred             eccccc
Q psy7014         292 LSAGNV  297 (500)
Q Consensus       292 ing~~~  297 (500)
                      +=++.+
T Consensus       154 iWd~vL  159 (195)
T PF00354_consen  154 IWDRVL  159 (195)
T ss_dssp             EESS--
T ss_pred             EEeeeC
Confidence            855544


No 42 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=95.23  E-value=0.01  Score=40.66  Aligned_cols=27  Identities=41%  Similarity=0.867  Sum_probs=20.8

Q ss_pred             ccCCCCCEEeeCCCCeeeeCCCCCCCC
Q psy7014         363 HNCSFGATCVPLTHSYECDCPPGRTGK  389 (500)
Q Consensus       363 ~pC~ngg~C~~~~~g~~C~C~~G~~G~  389 (500)
                      ..|...+.|++..++|.|.|.+||.|.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            468889999999999999999998764


No 43 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.22  E-value=0.026  Score=37.58  Aligned_cols=26  Identities=42%  Similarity=0.729  Sum_probs=21.8

Q ss_pred             cCCCCCEEeeCCCCeeeeCCCCCCCCCc
Q psy7014         364 NCSFGATCVPLTHSYECDCPPGRTGKFC  391 (500)
Q Consensus       364 pC~ngg~C~~~~~g~~C~C~~G~~G~~C  391 (500)
                      .|.++|+|+..  ...|.|.+||+|+.|
T Consensus         7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    7 ICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            58888999866  558999999999887


No 44 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.21  E-value=0.0066  Score=32.00  Aligned_cols=13  Identities=8%  Similarity=-0.463  Sum_probs=11.1

Q ss_pred             eeecCCCCCCCCC
Q psy7014          45 INQLLSIIFTNFL   57 (500)
Q Consensus        45 ~c~c~~~~~G~~C   57 (500)
                      .|+|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5999999999998


No 45 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.18  E-value=0.0064  Score=32.06  Aligned_cols=13  Identities=62%  Similarity=1.559  Sum_probs=8.2

Q ss_pred             eeeCCCCCCCCCc
Q psy7014         379 ECDCPPGRTGKFC  391 (500)
Q Consensus       379 ~C~C~~G~~G~~C  391 (500)
                      .|.|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            3677777777666


No 46 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=94.07  E-value=0.043  Score=38.69  Aligned_cols=30  Identities=33%  Similarity=1.028  Sum_probs=25.7

Q ss_pred             ccCCC--CCCCCCCEEeeCCCceeecCCCCCC
Q psy7014         318 SQCHN--HTCSHGGACMNHGATFSCLCADGWF  347 (500)
Q Consensus       318 ~~C~~--~pC~ngg~Ci~~~~~~~C~C~~Gy~  347 (500)
                      +.|..  ++|..++.|++..++|.|.|++||.
T Consensus         3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            45553  4698899999999999999999997


No 47 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.75  E-value=0.072  Score=35.44  Aligned_cols=27  Identities=37%  Similarity=0.998  Sum_probs=23.4

Q ss_pred             CCCCCCCEEeeCCCceeecCCCCCCCccc
Q psy7014         323 HTCSHGGACMNHGATFSCLCADGWFGPLC  351 (500)
Q Consensus       323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~G~~C  351 (500)
                      ..|.++|+|+..  ..+|.|..||.|+.|
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            469999999876  678999999999876


No 48 
>KOG1226|consensus
Probab=92.33  E-value=0.26  Score=54.95  Aligned_cols=56  Identities=34%  Similarity=0.816  Sum_probs=35.1

Q ss_pred             eecCCCCCCCcccccc--cccccCCC-ccCCCCCEEeeCCCCeeeeCCCC-CCCCCcCCCCCCc
Q psy7014         339 SCLCADGWFGPLCASR--YNLCDSTR-HNCSFGATCVPLTHSYECDCPPG-RTGKFCEKDESLS  398 (500)
Q Consensus       339 ~C~C~~Gy~G~~C~~~--i~~C~~~p-~pC~ngg~C~~~~~g~~C~C~~G-~~G~~Ce~~~~~~  398 (500)
                      .|.|.+||+|..|+.+  .+.|.+.- .-|...|+|.=.    +|.|... |+|..||+.....
T Consensus       567 ~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~~CE~cptc~  626 (783)
T KOG1226|consen  567 RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPPYSGEFCEKCPTCP  626 (783)
T ss_pred             cEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCCcCcchhhcCCCCC
Confidence            5566788888887654  35665531 234444444433    6778764 8899998875533


No 49 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=92.05  E-value=0.12  Score=45.14  Aligned_cols=31  Identities=29%  Similarity=0.685  Sum_probs=25.0

Q ss_pred             cCCCCCEEeeC--CCCeeeeCCCCCCCCCcCCCC
Q psy7014         364 NCSFGATCVPL--THSYECDCPPGRTGKFCEKDE  395 (500)
Q Consensus       364 pC~ngg~C~~~--~~g~~C~C~~G~~G~~Ce~~~  395 (500)
                      -|.|| .|.-.  ...+.|.|+.||+|.+||...
T Consensus        52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d   84 (139)
T PHA03099         52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV   84 (139)
T ss_pred             EeECC-EEEeeccCCCceeECCCCccccccccee
Confidence            47786 88643  478999999999999998763


No 50 
>PF06439 DUF1080:  Domain of Unknown Function (DUF1080);  InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=90.83  E-value=0.45  Score=44.11  Aligned_cols=37  Identities=14%  Similarity=0.083  Sum_probs=27.6

Q ss_pred             CceeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeC
Q psy7014         210 PNTISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKS  247 (500)
Q Consensus       210 ~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~  247 (500)
                      ........++| |+++|...+..+++.||+..+.....
T Consensus       119 ~~~~~~~~~~W-~~~~I~~~g~~i~v~vnG~~v~~~~d  155 (185)
T PF06439_consen  119 SVNVAIPPGEW-NTVRIVVKGNRITVWVNGKPVADFTD  155 (185)
T ss_dssp             SS--S--TTSE-EEEEEEEETTEEEEEETTEEEEEEET
T ss_pred             cccccCCCCce-EEEEEEEECCEEEEEECCEEEEEEEc
Confidence            34445677899 99999999999999999987655544


No 51 
>PHA02887 EGF-like protein; Provisional
Probab=90.82  E-value=0.19  Score=43.30  Aligned_cols=30  Identities=27%  Similarity=0.492  Sum_probs=22.5

Q ss_pred             cCCCCCEEee--CCCCeeeeCCCCCCCCCcCCC
Q psy7014         364 NCSFGATCVP--LTHSYECDCPPGRTGKFCEKD  394 (500)
Q Consensus       364 pC~ngg~C~~--~~~g~~C~C~~G~~G~~Ce~~  394 (500)
                      -|-+ |+|.-  ....+.|.|+.||+|.+||..
T Consensus        93 YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         93 FCIN-GECMNIIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             EeeC-CEEEccccCCCceeECCCCcccCCCCcc
Confidence            4664 48864  346788999999999999864


No 52 
>smart00051 DSL delta serrate ligand.
Probab=90.82  E-value=0.33  Score=37.60  Aligned_cols=47  Identities=21%  Similarity=0.580  Sum_probs=32.5

Q ss_pred             eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCc
Q psy7014         338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFC  391 (500)
Q Consensus       338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~C  391 (500)
                      +.-.|+.+|.|..|+   ..|.+. +-+..+.+|..   .-.|.|.+||+|..|
T Consensus        17 ~rv~C~~~~yG~~C~---~~C~~~-~d~~~~~~Cd~---~G~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYYGEGCN---KFCRPR-DDFFGHYTCDE---NGNKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCcCCccC---CEeCcC-ccccCCccCCc---CCCEecCCCCcCCCC
Confidence            345688999999996   345432 23556667743   135789999999987


No 53 
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=90.79  E-value=0.71  Score=40.86  Aligned_cols=67  Identities=18%  Similarity=0.083  Sum_probs=43.4

Q ss_pred             CCccEEEEEEEeC--cEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeeccc
Q psy7014         218 KRGGYTVRVGKNG--QQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAG  295 (500)
Q Consensus       218 g~WwH~V~v~r~~--~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~  295 (500)
                      ++| |+|.+..++  ..+.|.||+.........    ......++.||.....       .......|.|.|.+++|-++
T Consensus        61 ~~W-~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~~~~~-------~~~~~~~f~G~Idevriy~~  128 (133)
T smart00560       61 GVW-VHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGGRILL-------GGAGGENFSGRLDEVRVYNR  128 (133)
T ss_pred             CCE-EEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEeeeccC-------CCCCCCCceEEeeEEEEecc
Confidence            689 999999998  789999999653322111    1233457888842100       01123579999999998654


Q ss_pred             c
Q psy7014         296 N  296 (500)
Q Consensus       296 ~  296 (500)
                      .
T Consensus       129 a  129 (133)
T smart00560      129 A  129 (133)
T ss_pred             c
Confidence            3


No 54 
>KOG3509|consensus
Probab=90.35  E-value=3.5  Score=48.02  Aligned_cols=170  Identities=20%  Similarity=0.239  Sum_probs=83.5

Q ss_pred             CCccEEEEEEEeC-cEEEEEEcCCcceeeeCCCc-------cccccCCCceEEcccccccCcCCCCCcCCCCCccc----
Q psy7014         218 KRGGYTVRVGKNG-QQCWLMVDNMGNVTSKSPGR-------LTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSG----  285 (500)
Q Consensus       218 g~WwH~V~v~r~~-~~~~L~VD~~~~~~~~~~~~-------~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~G----  285 (500)
                      +.| |++.-.+.+ ..+.+.+++...+...+...       ...++++...+.++.....-...........+++|    
T Consensus       654 ~~a-~~~~~~~~~~~~~~m~~a~~~~~l~~st~~~~~~p~~~~~~~~~ga~~~~~g~~~~~~~~~~~C~c~~g~~G~~ce  732 (964)
T KOG3509|consen  654 GSA-HRVDGIRARGQHILMTVADTTTVLIKSTVSTDCTPSECSSANLEGALCYGGGKTDIIAAEVEQCQCPKGLVGTSCE  732 (964)
T ss_pred             ccc-ccccceecCCceeccccccccceeeeecccCCCChHHhhhhccCcccccCCCCCchhhhhccccccCccccCcccc
Confidence            467 999988887 55667777776666655432       33444555555555432211222233445678999    


Q ss_pred             -cccceeecccccccccccccccCCCCccccCCccCCCCCCCCCCEEeeCCCceee-cCCCCCCCccccccccc----cc
Q psy7014         286 -CIFDVELSAGNVGINLYKTRAAEGRGVGQCGTSQCHNHTCSHGGACMNHGATFSC-LCADGWFGPLCASRYNL----CD  359 (500)
Q Consensus       286 -CIr~v~ing~~~~l~~~~~~~~~~~~v~~C~~~~C~~~pC~ngg~Ci~~~~~~~C-~C~~Gy~G~~C~~~i~~----C~  359 (500)
                       |.....+....-..+.....+..+..+..|..+.=..      ..|........| .|..|+.|..=......    |.
T Consensus       733 ~c~e~~~ls~t~~~~~~~~~~c~~~~h~~~c~~~~~~n------t~~q~~~~~~~~~~~~~g~~~da~~g~~~D~~p~~~  806 (964)
T KOG3509|consen  733 DCAEGYTLSTTGGLYPGLCEDCECNSHISQCEDDLGYN------TDCQNNTEGDRCELCSPGTYGDARRGTPEDCRPATA  806 (964)
T ss_pred             cccccccccccCCcCcccCcccccCCCccccccccccc------ccccccCccceeeecCCCccccCccCCcccCCccch
Confidence             7776666542111111112222222223332211001      234445556666 47788765432211111    11


Q ss_pred             CCCccCCCCC-EEeeC-CCCeee-eCCCCCCCCCcCCCC
Q psy7014         360 STRHNCSFGA-TCVPL-THSYEC-DCPPGRTGKFCEKDE  395 (500)
Q Consensus       360 ~~p~pC~ngg-~C~~~-~~g~~C-~C~~G~~G~~Ce~~~  395 (500)
                      .. .+|.-+. .+... ..++.| .|+.+++|.+|+...
T Consensus       807 l~-~~~~~~~r~~l~~~~~~~~~~~~p~~~~g~~~~~~~  844 (964)
T KOG3509|consen  807 LT-IQCSCNNRSPLSCDGFGPGCLLCPHNTEGTTCERVK  844 (964)
T ss_pred             hh-hhhhhcccCccccccCCCCcccCCCCccccchhhhc
Confidence            11 1222111 22222 245567 489999999998863


No 55 
>KOG1226|consensus
Probab=89.44  E-value=0.76  Score=51.43  Aligned_cols=43  Identities=28%  Similarity=0.590  Sum_probs=25.0

Q ss_pred             CcccccccccccCC-CccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCC
Q psy7014         348 GPLCASRYNLCDST-RHNCSFGATCVPLTHSYECDCPPGRTGKFCEKD  394 (500)
Q Consensus       348 G~~C~~~i~~C~~~-p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~  394 (500)
                      |+.|+.+--.|... ...|...|.|.=.    .|.|.+||+|..|+=.
T Consensus       539 G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~  582 (783)
T KOG1226|consen  539 GKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCP  582 (783)
T ss_pred             eeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence            44444443334332 1356666767655    6778888877776654


No 56 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.66  E-value=0.25  Score=33.77  Aligned_cols=27  Identities=30%  Similarity=0.890  Sum_probs=21.0

Q ss_pred             CCCCCCCEEeeCCCceeecCCCCCCCc
Q psy7014         323 HTCSHGGACMNHGATFSCLCADGWFGP  349 (500)
Q Consensus       323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~G~  349 (500)
                      ..|...+.|++..+.|.|.|.+||.|.
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            358888999999999999999999863


No 57 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=88.47  E-value=0.34  Score=33.10  Aligned_cols=18  Identities=50%  Similarity=1.198  Sum_probs=14.9

Q ss_pred             EEeeCCCCeeeeCCCCCC
Q psy7014         370 TCVPLTHSYECDCPPGRT  387 (500)
Q Consensus       370 ~C~~~~~g~~C~C~~G~~  387 (500)
                      .|++.+++|.|.|+.||.
T Consensus        11 ~C~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             EEEEETTSEEEE-STTEE
T ss_pred             CCccCCCceEeECCCCCE
Confidence            788889999999999874


No 58 
>KOG4260|consensus
Probab=87.76  E-value=0.51  Score=46.57  Aligned_cols=48  Identities=33%  Similarity=0.762  Sum_probs=37.3

Q ss_pred             CCCCCCCcccccccccccCC-CccCCCCCEEeeC---CCCeeeeCCCCCCCCCcCC
Q psy7014         342 CADGWFGPLCASRYNLCDST-RHNCSFGATCVPL---THSYECDCPPGRTGKFCEK  393 (500)
Q Consensus       342 C~~Gy~G~~C~~~i~~C~~~-p~pC~ngg~C~~~---~~g~~C~C~~G~~G~~Ce~  393 (500)
                      |++|..|+.|.    .|... ..||...|.|.-.   .++-.|.|.+||.|+.|..
T Consensus       132 Cp~gtyGpdCl----~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~  183 (350)
T KOG4260|consen  132 CPDGTYGPDCL----QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY  183 (350)
T ss_pred             cCCCCcCCccc----cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc
Confidence            78899999995    34222 1389999999843   5778999999999999865


No 59 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=87.52  E-value=0.53  Score=45.56  Aligned_cols=32  Identities=31%  Similarity=0.730  Sum_probs=23.9

Q ss_pred             cccccCCCccCCCCCEEeeCCCCeeeeCCCCCCC
Q psy7014         355 YNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTG  388 (500)
Q Consensus       355 i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G  388 (500)
                      .++|...+++|..  .|.+..++|.|.|+.||+.
T Consensus       187 ~~~C~~~~~~c~~--~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         187 PDLCATLSHVCQQ--VCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             chhhcCCCCCccc--eEEcCCCCEEeECCCCccC
Confidence            3555554456763  7999999999999999874


No 60 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=86.51  E-value=0.45  Score=41.67  Aligned_cols=31  Identities=29%  Similarity=0.747  Sum_probs=25.7

Q ss_pred             CCCCCCCEEee--CCCceeecCCCCCCCcccccc
Q psy7014         323 HTCSHGGACMN--HGATFSCLCADGWFGPLCASR  354 (500)
Q Consensus       323 ~pC~ngg~Ci~--~~~~~~C~C~~Gy~G~~C~~~  354 (500)
                      +-|.|| +|.-  +.+.+.|.|..||+|.+|+..
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~   83 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV   83 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccccccce
Confidence            468897 8964  457899999999999999853


No 61 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=85.63  E-value=0.75  Score=28.52  Aligned_cols=11  Identities=64%  Similarity=1.615  Sum_probs=6.3

Q ss_pred             CeeeeCCCCCC
Q psy7014         377 SYECDCPPGRT  387 (500)
Q Consensus       377 g~~C~C~~G~~  387 (500)
                      +|.|.|++||.
T Consensus         1 sy~C~C~~Gy~   11 (24)
T PF12662_consen    1 SYTCSCPPGYQ   11 (24)
T ss_pred             CEEeeCCCCCc
Confidence            35666666653


No 62 
>PHA02887 EGF-like protein; Provisional
Probab=85.43  E-value=0.65  Score=40.03  Aligned_cols=22  Identities=5%  Similarity=-0.292  Sum_probs=19.6

Q ss_pred             ccCCCceeecCCCCCCCCCCCC
Q psy7014          39 FDTNQPINQLLSIIFTNFLPPD   60 (500)
Q Consensus        39 ~~~~~~~c~c~~~~~G~~C~~~   60 (500)
                      .+-++|.|.|+.||+|.+|+..
T Consensus       103 ~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887        103 IDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             ccCCCceeECCCCcccCCCCcc
Confidence            5677999999999999999974


No 63 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=84.72  E-value=0.3  Score=37.81  Aligned_cols=41  Identities=32%  Similarity=0.891  Sum_probs=20.9

Q ss_pred             eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeee------eCCCCCCCCCc
Q psy7014         338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYEC------DCPPGRTGKFC  391 (500)
Q Consensus       338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C------~C~~G~~G~~C  391 (500)
                      ++-.|...|.|+.|..   .|.+.          .+..+.|.|      .|.+||+|+.|
T Consensus        17 ~rv~C~~nyyG~~C~~---~C~~~----------~d~~ghy~Cd~~G~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYYGPNCSK---FCKPR----------DDSFGHYTCDSNGNKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEETTTT-E---E---E----------EETTEEEEE-SS--EEE-TTEESTTS
T ss_pred             EEEECCCCCCCccccC---CcCCC----------cCCcCCcccCCCCCCCCCCCCcCCCC
Confidence            4567889999999973   23221          012356666      48999999887


No 64 
>KOG1836|consensus
Probab=75.85  E-value=1.4  Score=54.34  Aligned_cols=80  Identities=19%  Similarity=0.173  Sum_probs=56.5

Q ss_pred             eeeeCCCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCcccccccee
Q psy7014         212 TISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVE  291 (500)
Q Consensus       212 ~~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~  291 (500)
                      .++.=++.| |.|.+.+....+.+.+|. . +  ..+......+...++++||+|+.....   ....-.+|.|||  ++
T Consensus      1611 ~~~~~~~~~-~~~~~~~~~~v~~~~~~~-~-~--~~~~~~~~~~~~~p~~~~~~~~s~~~~---~~~~~~~~~~~~--~~ 1680 (1705)
T KOG1836|consen 1611 IVSLLPGGC-HSVTSSTDPGVVQLEDDT-Y-T--VGEIPPPPADTQEPIKLGGYPSSLTTL---RIAVLKSFTGCI--FV 1680 (1705)
T ss_pred             hhhhcCCcc-eeeeeecCCccccccccc-e-e--cccCCCCchhccCCcccCCccccccce---eeecccccccce--EE
Confidence            334457789 999999999999998888 2 1  122233456778899999998643322   233456899999  88


Q ss_pred             eccccccccc
Q psy7014         292 LSAGNVGINL  301 (500)
Q Consensus       292 ing~~~~l~~  301 (500)
                      +++..+++..
T Consensus      1681 ~~~~~~~~~~ 1690 (1705)
T KOG1836|consen 1681 VMGIRVDVTL 1690 (1705)
T ss_pred             ecCCCCcHHH
Confidence            8887777654


No 65 
>KOG3546|consensus
Probab=74.35  E-value=11  Score=41.50  Aligned_cols=66  Identities=14%  Similarity=0.063  Sum_probs=45.0

Q ss_pred             CCCccEEEEEEEeCcEEEEEEcCCcceeeeCCCccccccCC--CceEEcccccccCcCCCCCcCCCCCccccccceeec
Q psy7014         217 SKRGGYTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTK--PMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELS  293 (500)
Q Consensus       217 dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~~~~~~~~L~~~--~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~in  293 (500)
                      .++| .++.+...+..+.|.||-.+-.+.........|.+.  .-||+|-.-.          .-...|.|-|.++.+.
T Consensus       156 ~~~w-~~~a~~v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~----------~~~~~f~g~~~~l~v~  223 (1167)
T KOG3546|consen  156 VGQW-THLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGG----------ADPDKFQGVIAELKVR  223 (1167)
T ss_pred             hchh-hheeeeecCceEEEEechHHhcccchhccccceeecCCcceEEeccCC----------CChHhhhhhhhheeec
Confidence            4689 999999999999999997654443333333445554  3588875421          1224699999998885


No 66 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=72.40  E-value=2.3  Score=29.08  Aligned_cols=18  Identities=28%  Similarity=0.907  Sum_probs=15.5

Q ss_pred             EEeeCCCceeecCCCCCC
Q psy7014         330 ACMNHGATFSCLCADGWF  347 (500)
Q Consensus       330 ~Ci~~~~~~~C~C~~Gy~  347 (500)
                      .|++..+.|+|.|+.||.
T Consensus        11 ~C~~~~g~~~C~C~~Gy~   28 (36)
T PF14670_consen   11 ICVNTPGSYRCSCPPGYK   28 (36)
T ss_dssp             EEEEETTSEEEE-STTEE
T ss_pred             CCccCCCceEeECCCCCE
Confidence            788889999999999996


No 67 
>KOG1834|consensus
Probab=70.38  E-value=75  Score=35.59  Aligned_cols=147  Identities=14%  Similarity=0.126  Sum_probs=83.3

Q ss_pred             CceEEEEEEEEeeCC------CCCceEEEEecccCCCCCCCCeEEEEEECCEEEEEEEcCCceeEEEeechhhhhhhhhc
Q psy7014          87 IHHCFELKFRFVPNS------FDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFEHTYLFILSRLR  160 (500)
Q Consensus        87 ~~~~~~i~~~Frt~~------~~~~GlLly~~~~~~~~~~~df~~l~l~~G~l~~~~~~G~g~~~~~~~~~~~~~~~~l~  160 (500)
                      ....+.|+|..|--.      .+. --||-..++  .+-+..+.+|++..=|+.|.++-..|.            ++..|
T Consensus       364 l~dhFTlSfwMkHg~~p~~~~~ek-etIlCnsdk--~emnrhHyslyvh~Crl~fllr~d~~~------------~~~fR  428 (952)
T KOG1834|consen  364 LPDHFTLSFWMKHGPGPKDEQSEK-ETILCNSDK--TEMNRHHYSLYVHGCRLEFLLRRDAGA------------TSDFR  428 (952)
T ss_pred             CCCceEEEEeeecCCCCccccccc-eeEEecccc--cccccceeEEEEeccEEEEEEccCccc------------ccccc
Confidence            445677777765221      011 235555543  244567899999999999988775532            11222


Q ss_pred             cccccccccccccccccccceeeeecccccCCCCcccccccCCceEEecCceeeeCCCCccEEEEEEEeCcEEEEEEcCC
Q psy7014         161 SAQDTRLCCLPLHLILGVDFLCMSIYTSYLQPTGHMFVDTYRGPRRIFTPNTISAKSKRGGYTVRVGKNGQQCWLMVDNM  240 (500)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~dg~WwH~V~v~r~~~~~~L~VD~~  240 (500)
                      +|                +|.|                            +.-.+.|..| |.-.+....-.++|.|||.
T Consensus       429 pa----------------ef~W----------------------------kl~qVCD~EW-H~Y~ln~efp~VtlyvDG~  463 (952)
T KOG1834|consen  429 PA----------------EFHW----------------------------KLPQVCDNEW-HHYVLNVEFPDVTLYVDGK  463 (952)
T ss_pred             ch----------------heec----------------------------cchhhhhhhh-heeEEeecCceEEEEEcCc
Confidence            22                1211                            1124578899 9999999999999999996


Q ss_pred             ccee--eeCCCccccccCCCceEEcccccccCcCCCCCcCCCCCccccccceeecccc
Q psy7014         241 GNVT--SKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHSGFSGCIFDVELSAGN  296 (500)
Q Consensus       241 ~~~~--~~~~~~~~~L~~~~~lyIGG~~~~~~~~~~~~~~~~~gF~GCIr~v~ing~~  296 (500)
                      .-..  ..-......-.+.+.|.||.-=....   ........-|+|=+..+.+..+.
T Consensus       464 Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~---~~~l~~aqfFrG~LasltlrsGk  518 (952)
T KOG1834|consen  464 SFEPPLITDDWPLHPSKIETQLVVGACWQGRQ---QKPLKLAQFFRGQLASLTLRSGK  518 (952)
T ss_pred             ccCCceeccCCccCcccccceeEEeeeccCcc---ccchhHHHHhhcccceeEEeccc
Confidence            5221  11112222333566788885411110   01122345688888877775433


No 68 
>KOG1836|consensus
Probab=67.84  E-value=4.7  Score=50.01  Aligned_cols=52  Identities=33%  Similarity=0.717  Sum_probs=38.0

Q ss_pred             cCCCCCCCcccccccccccCCCccCCCCCEEeeCC--CCeeee-CCCCCCCCCcCCC
Q psy7014         341 LCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLT--HSYECD-CPPGRTGKFCEKD  394 (500)
Q Consensus       341 ~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~--~g~~C~-C~~G~~G~~Ce~~  394 (500)
                      .|..||.|..=......|.+  .||.+++.|....  ....|. |+++|+|++|+.-
T Consensus       760 ~C~~GfYg~~~~~~~~dC~~--C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c  814 (1705)
T KOG1836|consen  760 QCVDGFYGLPDLGTSGDCQP--CPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEEC  814 (1705)
T ss_pred             hhcCCCCCccccCCCCCCcc--CCCCCChhhcCcCcccceecCCCCCCCcccccccC
Confidence            36667766544333333665  4799999998654  678898 9999999999985


No 69 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=65.39  E-value=3.9  Score=34.82  Aligned_cols=39  Identities=31%  Similarity=0.805  Sum_probs=24.4

Q ss_pred             ccCCCccCCCCCEEeeCC-----CCeeeeCCC-------------CCCCCCcCCCCC
Q psy7014         358 CDSTRHNCSFGATCVPLT-----HSYECDCPP-------------GRTGKFCEKDES  396 (500)
Q Consensus       358 C~~~p~pC~ngg~C~~~~-----~g~~C~C~~-------------G~~G~~Ce~~~~  396 (500)
                      |....+-|..+|.|+...     .=|.|.|.+             .|.|..|++..-
T Consensus         8 C~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDv   64 (103)
T PF12955_consen    8 CENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKKDV   64 (103)
T ss_pred             HHHhccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccccc
Confidence            333334566677776652     346777765             477888888743


No 70 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=64.69  E-value=6.9  Score=33.36  Aligned_cols=31  Identities=29%  Similarity=0.608  Sum_probs=22.5

Q ss_pred             ccccCCCccCCCCCEEeeCCCCeeeeCCCCCCC
Q psy7014         356 NLCDSTRHNCSFGATCVPLTHSYECDCPPGRTG  388 (500)
Q Consensus       356 ~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G  388 (500)
                      +.|... ..|+..|.|... ....|.|.+||.-
T Consensus        78 d~Cd~y-~~CG~~g~C~~~-~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   78 DQCDVY-GFCGPNGICNSN-NSPKCSCLPGFEP  108 (110)
T ss_pred             cCCCCc-cccCCccEeCCC-CCCceECCCCcCC
Confidence            455544 479999999653 4567999999864


No 71 
>KOG0994|consensus
Probab=63.83  E-value=6.5  Score=46.14  Aligned_cols=58  Identities=29%  Similarity=0.577  Sum_probs=35.4

Q ss_pred             CCceeec-CCCCCCCcccccccccccCCCccCCCC--------CEEeeC--CCCeeeeCCCCCCCCCcCCC
Q psy7014         335 GATFSCL-CADGWFGPLCASRYNLCDSTRHNCSFG--------ATCVPL--THSYECDCPPGRTGKFCEKD  394 (500)
Q Consensus       335 ~~~~~C~-C~~Gy~G~~C~~~i~~C~~~p~pC~ng--------g~C~~~--~~g~~C~C~~G~~G~~Ce~~  394 (500)
                      ...+.|+ |..||.|..---.-..|.+  .||..+        ..|...  .....|.|..||+|.+|+.=
T Consensus       882 T~G~~CdrCl~GyyGdP~lg~g~~CrP--CpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C  950 (1758)
T KOG0994|consen  882 TTGHSCDRCLDGYYGDPRLGSGIGCRP--CPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC  950 (1758)
T ss_pred             ccccchhhhhccccCCcccCCCCCCCC--CCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence            3456664 7777775432222234433  355433        245433  35788999999999999874


No 72 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=60.70  E-value=19  Score=31.13  Aligned_cols=46  Identities=13%  Similarity=0.189  Sum_probs=28.3

Q ss_pred             cceeeEEEEEeeCCCCc---EEEEcCCCCCCCeEEEEEE-CCEEEEEEEcCC
Q psy7014         445 LHEACIDLEIRPTKDKG---LLMYFGHPQKNSMMTLSLQ-GGVLELRVLMLG  492 (500)
Q Consensus       445 ~~~~~i~l~frT~~~~G---lLl~~~~~~~~dfi~l~l~-~G~l~~~~~~g~  492 (500)
                      ...++|++.||.....+   .+++  .....+.+.|.+. +|.+.+.+..++
T Consensus        21 ~~~fTi~~w~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~l~~~~~~~~   70 (157)
T PF13385_consen   21 SGSFTISFWVKPDSPSSSQSFVFM--DSSGSGGFGLFINNNGRLRFYIGNGG   70 (157)
T ss_dssp             GTEEEEEEEEEESS--SSEEEEEE--SSSSSEEEEEEEETTSEEEEEETTSE
T ss_pred             CCCEEEEEEEEeCCCCCCceEEEE--ecCCCCEEEEEEECCCEEEEEEeCCC
Confidence            36788999999886433   4343  1112347777777 577777766553


No 73 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=57.97  E-value=5.8  Score=37.34  Aligned_cols=63  Identities=27%  Similarity=0.734  Sum_probs=42.5

Q ss_pred             CCCCCCCEEeeCCCceeecCCCCCC---CcccccccccccC---CCccCCCCCEEeeCC-----CCeeeeCCCCCC
Q psy7014         323 HTCSHGGACMNHGATFSCLCADGWF---GPLCASRYNLCDS---TRHNCSFGATCVPLT-----HSYECDCPPGRT  387 (500)
Q Consensus       323 ~pC~ngg~Ci~~~~~~~C~C~~Gy~---G~~C~~~i~~C~~---~p~pC~ngg~C~~~~-----~g~~C~C~~G~~  387 (500)
                      ..|.| |..++-.+.|.|.|..||.   -..|+..+ .|..   .-.+|+.-+.|+...     ..|.|.|-.||.
T Consensus         6 T~CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~   79 (197)
T PF06247_consen    6 TICKN-GYLIQMSNHFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYI   79 (197)
T ss_dssp             ---BT-EEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEE
T ss_pred             ccccC-CEEEEccCceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCce
Confidence            34555 4778888999999999997   45676554 4443   124899999998764     699999999985


No 74 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=56.31  E-value=5.7  Score=27.34  Aligned_cols=24  Identities=33%  Similarity=0.660  Sum_probs=15.3

Q ss_pred             cCCCCCEEeeCC-CCeeeeCCCCCC
Q psy7014         364 NCSFGATCVPLT-HSYECDCPPGRT  387 (500)
Q Consensus       364 pC~ngg~C~~~~-~g~~C~C~~G~~  387 (500)
                      +|..++.|.... +.+.|.|-+||.
T Consensus         6 ~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    6 KCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             ---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             cCCCCcccEEcCCCCEEEEeeCCcc
Confidence            577777887765 778888888874


No 75 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=56.09  E-value=6.2  Score=27.14  Aligned_cols=28  Identities=21%  Similarity=0.672  Sum_probs=19.9

Q ss_pred             CCCCCCCCCCEEeeCC-CceeecCCCCCC
Q psy7014         320 CHNHTCSHGGACMNHG-ATFSCLCADGWF  347 (500)
Q Consensus       320 C~~~pC~ngg~Ci~~~-~~~~C~C~~Gy~  347 (500)
                      |...+|..++.|.... +...|.|-.||.
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk   30 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYK   30 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence            6667888999999876 789999999996


No 76 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=52.88  E-value=9.2  Score=27.56  Aligned_cols=22  Identities=45%  Similarity=1.093  Sum_probs=15.9

Q ss_pred             EEeeCCCCeeeeCCCCCCCCCcCC
Q psy7014         370 TCVPLTHSYECDCPPGRTGKFCEK  393 (500)
Q Consensus       370 ~C~~~~~g~~C~C~~G~~G~~Ce~  393 (500)
                      .|..  ....|.|.++|+|+.|+.
T Consensus        12 ~C~~--~~G~C~C~~~~~G~~C~~   33 (49)
T PF00053_consen   12 TCDP--STGQCVCKPGTTGPRCDQ   33 (49)
T ss_dssp             SEEE--TCEEESBSTTEESTTS-E
T ss_pred             cccC--CCCEEeccccccCCcCcC
Confidence            4544  345899999999999985


No 77 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=51.53  E-value=3.7  Score=30.65  Aligned_cols=22  Identities=14%  Similarity=-0.087  Sum_probs=14.2

Q ss_pred             cCCCceeecCCCCCCCCCCCCC
Q psy7014          40 DTNQPINQLLSIIFTNFLPPDI   61 (500)
Q Consensus        40 ~~~~~~c~c~~~~~G~~C~~~~   61 (500)
                      ..|.|.|+|..-|+|+.|...+
T Consensus        32 ~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   32 ADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             ETTEE--EE-TTEESTTS-EE-
T ss_pred             ccCCccccccCCcCCCCcccCC
Confidence            3456999999999999998754


No 78 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=45.09  E-value=21  Score=25.93  Aligned_cols=16  Identities=38%  Similarity=1.167  Sum_probs=13.7

Q ss_pred             eeeeCCCCCCCCCcCC
Q psy7014         378 YECDCPPGRTGKFCEK  393 (500)
Q Consensus       378 ~~C~C~~G~~G~~Ce~  393 (500)
                      -.|.|.++++|..|+.
T Consensus        19 G~C~C~~~~~G~~C~~   34 (50)
T cd00055          19 GQCECKPNTTGRRCDR   34 (50)
T ss_pred             CEEeCCCcCCCCCCCC
Confidence            3789999999999984


No 79 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=44.47  E-value=95  Score=29.52  Aligned_cols=49  Identities=16%  Similarity=0.310  Sum_probs=37.3

Q ss_pred             cceeeEEEEEeeCCCCcE--EEEcCCC-CCCCeEEEEEECCEEEEEEEcCCC
Q psy7014         445 LHEACIDLEIRPTKDKGL--LMYFGHP-QKNSMMTLSLQGGVLELRVLMLGD  493 (500)
Q Consensus       445 ~~~~~i~l~frT~~~~Gl--Ll~~~~~-~~~dfi~l~l~~G~l~~~~~~g~~  493 (500)
                      +...+|.++|++.+.+++  ||-++.. ..+.|+.|.+.++.+-+.+.-..+
T Consensus        32 L~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~   83 (190)
T PF02973_consen   32 LEEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKG   83 (190)
T ss_dssp             -SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTT
T ss_pred             ccccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCC
Confidence            467799999999877774  6666554 357999999999998888876665


No 80 
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=43.45  E-value=2.7e+02  Score=27.07  Aligned_cols=26  Identities=8%  Similarity=0.059  Sum_probs=19.4

Q ss_pred             eeCCCCccEEEEEEEeC--cEEEEEEcCC
Q psy7014         214 SAKSKRGGYTVRVGKNG--QQCWLMVDNM  240 (500)
Q Consensus       214 ~~~dg~WwH~V~v~r~~--~~~~L~VD~~  240 (500)
                      .+.+|++ |+|.|.+++  +.+.+.|+..
T Consensus       159 ~l~~g~~-~~v~I~Y~~~~~~L~V~l~~~  186 (236)
T cd06899         159 KLKSGKP-MQAWIDYDSSSKRLSVTLAYS  186 (236)
T ss_pred             cccCCCe-EEEEEEEcCCCCEEEEEEEeC
Confidence            3578999 999999995  5666666554


No 81 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=43.07  E-value=21  Score=34.42  Aligned_cols=34  Identities=24%  Similarity=0.781  Sum_probs=24.9

Q ss_pred             cccC-CccCC--CCCCCCCCEEeeCCCceeecCCCCCCC
Q psy7014         313 GQCG-TSQCH--NHTCSHGGACMNHGATFSCLCADGWFG  348 (500)
Q Consensus       313 ~~C~-~~~C~--~~pC~ngg~Ci~~~~~~~C~C~~Gy~G  348 (500)
                      ..|. .+.|.  +.+|.  ..|.+..+.|.|.|+.||+.
T Consensus       182 ~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         182 KICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             ccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence            3453 35665  34566  47999999999999999973


No 82 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=41.88  E-value=70  Score=29.82  Aligned_cols=45  Identities=20%  Similarity=0.250  Sum_probs=34.2

Q ss_pred             ceeeEEEEEeeC-CCCcEEEEcCCCCCCCeEEEEEECCEEEEEEEc
Q psy7014         446 HEACIDLEIRPT-KDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLM  490 (500)
Q Consensus       446 ~~~~i~l~frT~-~~~GlLl~~~~~~~~dfi~l~l~~G~l~~~~~~  490 (500)
                      ..++|.+.||+. ...|.||-..+.+...++.|.|.++...+.|..
T Consensus        52 ~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~   97 (184)
T smart00210       52 EDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRY   97 (184)
T ss_pred             CCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEE
Confidence            567899999998 688888877665556799999987765555543


No 83 
>PF00139 Lectin_legB:  Legume lectin domain;  InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=39.77  E-value=2.4e+02  Score=27.35  Aligned_cols=29  Identities=14%  Similarity=0.151  Sum_probs=22.6

Q ss_pred             ceeeeCCCCccEEEEEEEeC--cEEEEEEcCC
Q psy7014         211 NTISAKSKRGGYTVRVGKNG--QQCWLMVDNM  240 (500)
Q Consensus       211 ~~~~~~dg~WwH~V~v~r~~--~~~~L~VD~~  240 (500)
                      ....+.+|+| |+|.|.++.  +.+.+.++..
T Consensus       160 ~~~~l~~g~~-~~v~I~Yd~~~~~L~V~l~~~  190 (236)
T PF00139_consen  160 PSFSLSDGKW-HTVWIDYDASTKRLSVYLDDN  190 (236)
T ss_dssp             EEHHHGTTSE-EEEEEEEETTTTEEEEEEEET
T ss_pred             ccccccCCcE-EEEEEEEcCCccEEEEEEecc
Confidence            3456789999 999999998  5666666665


No 84 
>PF14099 Polysacc_lyase:  Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=38.87  E-value=1.2e+02  Score=28.87  Aligned_cols=22  Identities=14%  Similarity=0.031  Sum_probs=17.2

Q ss_pred             CCCeEEEEEECCEEEEEEEcCC
Q psy7014         120 ITDHLAVSFIKGYVVLTWNLGS  141 (500)
Q Consensus       120 ~~df~~l~l~~G~l~~~~~~G~  141 (500)
                      ....++|.+.+|++.+.++.+.
T Consensus       112 ~~P~~~l~~~~~~l~~~~~~~~  133 (224)
T PF14099_consen  112 GSPPFALRIKGGRLYLRVRGDE  133 (224)
T ss_dssp             EEECEEEEEETTEEEEEEEEE-
T ss_pred             CCCcEEEEEeCCEEEEEEEcCC
Confidence            4567899999999998877765


No 85 
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=38.72  E-value=3.4e+02  Score=25.82  Aligned_cols=23  Identities=22%  Similarity=0.180  Sum_probs=19.2

Q ss_pred             CCccEEEEEEEe--CcEEEEEEcCCc
Q psy7014         218 KRGGYTVRVGKN--GQQCWLMVDNMG  241 (500)
Q Consensus       218 g~WwH~V~v~r~--~~~~~L~VD~~~  241 (500)
                      |+| |+|+|.++  .+.+.+.++...
T Consensus       154 g~~-~~v~I~Y~~~~~~L~v~l~~~~  178 (223)
T cd01951         154 GNE-HTVRITYDPTTNTLTVYLDNGS  178 (223)
T ss_pred             CCE-EEEEEEEeCCCCEEEEEECCCC
Confidence            789 99999999  477888888764


No 86 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=32.36  E-value=21  Score=26.79  Aligned_cols=33  Identities=24%  Similarity=0.523  Sum_probs=13.8

Q ss_pred             CCCCCCCEEeeC----CCceeecCCCCCCCccccccc
Q psy7014         323 HTCSHGGACMNH----GATFSCLCADGWFGPLCASRY  355 (500)
Q Consensus       323 ~pC~ngg~Ci~~----~~~~~C~C~~Gy~G~~C~~~i  355 (500)
                      .+|..+|....+    .+...|.|-.-|.|++|+..+
T Consensus        17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             S--TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CCcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            345555554322    234567777777777776544


No 87 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=32.31  E-value=62  Score=23.41  Aligned_cols=20  Identities=45%  Similarity=1.112  Sum_probs=14.7

Q ss_pred             cCCCCCEEeeCCCCeeeeCCCCCC
Q psy7014         364 NCSFGATCVPLTHSYECDCPPGRT  387 (500)
Q Consensus       364 pC~ngg~C~~~~~g~~C~C~~G~~  387 (500)
                      .|..+..|++.    .|.|+.||.
T Consensus        27 qC~~~s~C~~g----~C~C~~g~~   46 (52)
T PF01683_consen   27 QCIGGSVCVNG----RCQCPPGYV   46 (52)
T ss_pred             CCCCcCEEcCC----EeECCCCCE
Confidence            47777788553    899999863


No 88 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=29.10  E-value=51  Score=23.48  Aligned_cols=18  Identities=0%  Similarity=-0.348  Sum_probs=15.6

Q ss_pred             CceeecCCCCCCCCCCCC
Q psy7014          43 QPINQLLSIIFTNFLPPD   60 (500)
Q Consensus        43 ~~~c~c~~~~~G~~C~~~   60 (500)
                      .-.|.|+.+++|+.|++-
T Consensus        17 ~G~C~C~~~~~G~~C~~C   34 (46)
T smart00180       17 TGQCECKPNVTGRRCDRC   34 (46)
T ss_pred             CCEEECCCCCCCCCCCcC
Confidence            457999999999999964


No 89 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=28.88  E-value=42  Score=28.63  Aligned_cols=23  Identities=30%  Similarity=0.846  Sum_probs=17.8

Q ss_pred             CCCCCCCCEEeeCC-----CceeecCCC
Q psy7014         322 NHTCSHGGACMNHG-----ATFSCLCAD  344 (500)
Q Consensus       322 ~~pC~ngg~Ci~~~-----~~~~C~C~~  344 (500)
                      .+-|..+|.|+...     +=|.|.|.+
T Consensus        12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~   39 (103)
T PF12955_consen   12 TNNCSGHGSCVKKYGSGGGDCFACKCKP   39 (103)
T ss_pred             ccCCCCCceEeeccCCCccceEEEEeec
Confidence            35699999999873     448999965


No 90 
>PF11250 DUF3049:  Protein of unknown function (DUF3049);  InterPro: IPR021410  This eukaryotic family of proteins has no known function. 
Probab=24.50  E-value=2e+02  Score=21.68  Aligned_cols=38  Identities=18%  Similarity=0.349  Sum_probs=27.7

Q ss_pred             EEEEeeCCCCcEEEEcC-CCCCCCeEEEEEECCEEEEEE
Q psy7014         451 DLEIRPTKDKGLLMYFG-HPQKNSMMTLSLQGGVLELRV  488 (500)
Q Consensus       451 ~l~frT~~~~GlLl~~~-~~~~~dfi~l~l~~G~l~~~~  488 (500)
                      .+.+|+...||=|.-.. .-...+++..+=.||+|.+.+
T Consensus        17 ~~~~r~~r~dGRLvl~~v~v~~~~~~~A~R~~GRL~L~~   55 (56)
T PF11250_consen   17 SVLMRPHREDGRLVLEEVRVPSHEYFHAEREDGRLRLQF   55 (56)
T ss_pred             cEEEEEEccCCEEEEEEEEcCCcceEEEEccCCEEEEEe
Confidence            46788888888555442 222367999888999999875


No 91 
>PF14607 GxDLY:  N-terminus of Esterase_SGNH_hydro-type
Probab=23.81  E-value=2.4e+02  Score=25.71  Aligned_cols=13  Identities=15%  Similarity=0.143  Sum_probs=10.1

Q ss_pred             CCCCccEEEEEEEe
Q psy7014         216 KSKRGGYTVRVGKN  229 (500)
Q Consensus       216 ~dg~WwH~V~v~r~  229 (500)
                      +||+| +-+.+.+-
T Consensus        91 ~~G~W-~~~~~g~p  103 (147)
T PF14607_consen   91 DDGKW-RFAGVGRP  103 (147)
T ss_pred             CCCCE-EEEEeccc
Confidence            38999 98887764


No 92 
>KOG1218|consensus
Probab=23.27  E-value=1e+02  Score=30.72  Aligned_cols=56  Identities=30%  Similarity=0.634  Sum_probs=37.7

Q ss_pred             eeecCCCCCCCcccccccccccCCCccCCCCCEEeeCCCCeee------eCCCCCCCCCcCCC
Q psy7014         338 FSCLCADGWFGPLCASRYNLCDSTRHNCSFGATCVPLTHSYEC------DCPPGRTGKFCEKD  394 (500)
Q Consensus       338 ~~C~C~~Gy~G~~C~~~i~~C~~~p~pC~ngg~C~~~~~g~~C------~C~~G~~G~~Ce~~  394 (500)
                      -.|.|+.||.|..|......|... ..|.+++.|......-.|      .|..++.|..|...
T Consensus       162 ~~c~c~~g~~g~~~~~~~~~c~~~-~~~~~g~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  223 (316)
T KOG1218|consen  162 GICTCQPGFVGVFCVESCSGCSPL-TACENGAKCNRSTGSCLCYPGPSGACKGGFHGCACLRM  223 (316)
T ss_pred             CceeccCCcccccccccCCCcCCC-cccCCCCeeeccccccccCCCCcccccCCccCCcCccc
Confidence            467799999999998776656654 478888899876542222      34444566666654


No 93 
>KOG1218|consensus
Probab=22.81  E-value=1.7e+02  Score=29.18  Aligned_cols=53  Identities=30%  Similarity=0.744  Sum_probs=31.5

Q ss_pred             eecCCCCCCCccccc---ccccccCCCccCCCCCEEeeCCCCeeeeCCCCCCCCCcCCCCC
Q psy7014         339 SCLCADGWFGPLCAS---RYNLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCEKDES  396 (500)
Q Consensus       339 ~C~C~~Gy~G~~C~~---~i~~C~~~p~pC~ngg~C~~~~~g~~C~C~~G~~G~~Ce~~~~  396 (500)
                      .|.|..++.+..|..   ....|...   |.+...|  ....-.|.|++||.|..|+....
T Consensus       125 ~c~~~~~~~~~~C~~~~~~g~~C~~~---c~~~~~~--~~~~~~c~c~~g~~g~~~~~~~~  180 (316)
T KOG1218|consen  125 ECRCGGGYIGEQCGEENLVGLKCQRD---CQCTGGC--DCKNGICTCQPGFVGVFCVESCS  180 (316)
T ss_pred             ceecCCcCccccccccCCCCCCccCC---CCCcccc--CCCCCceeccCCcccccccccCC
Confidence            566777777777765   12334332   3111111  12344788999999999988753


No 94 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=22.09  E-value=2.5e+02  Score=26.45  Aligned_cols=46  Identities=17%  Similarity=0.136  Sum_probs=26.0

Q ss_pred             ccceeeEEEEEeeCC--CCcEEE-EcCCCCCCCeEEEEEECCEEEEEEE
Q psy7014         444 HLHEACIDLEIRPTK--DKGLLM-YFGHPQKNSMMTLSLQGGVLELRVL  489 (500)
Q Consensus       444 ~~~~~~i~l~frT~~--~~GlLl-~~~~~~~~dfi~l~l~~G~l~~~~~  489 (500)
                      .+..+++.+.+|+..  ..+.|| |.+..+.++++...-.+|.+.|.++
T Consensus        29 ~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~   77 (201)
T cd00152          29 PLQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYIG   77 (201)
T ss_pred             ChhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc
Confidence            346788888888864  444455 5544333334333223567777664


No 95 
>PF07622 DUF1583:  Protein of unknown function (DUF1583);  InterPro: IPR011475  Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM. 
Probab=21.91  E-value=4.9e+02  Score=27.60  Aligned_cols=33  Identities=15%  Similarity=0.142  Sum_probs=26.9

Q ss_pred             eeeCCCCccEEEEEEEeCcEEEEEEcCCcceeee
Q psy7014         213 ISAKSKRGGYTVRVGKNGQQCWLMVDNMGNVTSK  246 (500)
Q Consensus       213 ~~~~dg~WwH~V~v~r~~~~~~L~VD~~~~~~~~  246 (500)
                      +.+++..| .+|++.+.+..+.|.+++.......
T Consensus        85 ~~l~~~~w-N~v~l~~~g~~v~l~LN~~~i~~~~  117 (399)
T PF07622_consen   85 LPLKVNAW-NRVRLQRRGDKVQLHLNGQLIYERP  117 (399)
T ss_pred             CCCCcccc-ceEEEEEeCCEEEEEeCCceeEecc
Confidence            34566789 9999999999999999998754443


Done!