Query         psy5025
Match_columns 533
No_of_seqs    194 out of 2040
Neff          8.9 
Searched_HMMs 46136
Date          Fri Aug 16 23:54:40 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5025.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5025hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 1.8E-50 3.8E-55  429.8  30.6  397   35-509  1227-1687(2531)
  2 KOG3516|consensus              100.0 1.2E-48 2.7E-53  416.4  42.7  375   86-513   790-1179(1306)
  3 KOG3514|consensus              100.0 3.9E-49 8.4E-54  412.0  37.2  445    4-532    98-631 (1591)
  4 KOG3514|consensus              100.0   2E-47 4.2E-52  399.3  34.6  343  107-511   860-1217(1591)
  5 KOG3516|consensus              100.0 1.8E-40 3.9E-45  354.2  40.3  459    2-530   257-966 (1306)
  6 KOG4289|consensus              100.0 8.8E-33 1.9E-37  295.5  24.0  272    5-295  1392-1754(2531)
  7 KOG1219|consensus              100.0   2E-27 4.3E-32  262.5  20.8  245   41-296  3641-3940(4289)
  8 PF00054 Laminin_G_1:  Laminin   99.9 9.2E-23   2E-27  178.1  18.3  129  345-507     1-131 (131)
  9 PF00054 Laminin_G_1:  Laminin   99.9 3.8E-22 8.3E-27  174.1  17.3  130  114-245     1-131 (131)
 10 smart00282 LamG Laminin G doma  99.9 1.6E-21 3.4E-26  171.6  18.8  132  338-501     2-133 (135)
 11 cd00110 LamG Laminin G domain;  99.9 6.7E-21 1.5E-25  170.7  20.2  150  299-501     2-151 (151)
 12 cd00110 LamG Laminin G domain;  99.8 5.1E-20 1.1E-24  165.0  18.5  150   87-240     2-151 (151)
 13 smart00282 LamG Laminin G doma  99.8 9.9E-20 2.1E-24  160.1  18.1  133  107-242     2-135 (135)
 14 PF02210 Laminin_G_2:  Laminin   99.8 3.1E-19 6.8E-24  155.0  16.6  126  345-503     1-127 (128)
 15 KOG1219|consensus               99.8 5.6E-19 1.2E-23  196.7  19.1  195  261-512  3647-3844(4289)
 16 PF02210 Laminin_G_2:  Laminin   99.8 9.4E-18   2E-22  145.7  16.1  126  114-242     1-128 (128)
 17 smart00210 TSPN Thrombospondin  99.1 7.3E-09 1.6E-13   95.8  16.7  125  336-500    51-180 (184)
 18 KOG3509|consensus               98.7 2.8E-08   6E-13  109.4   8.8  250   40-295   168-442 (964)
 19 smart00210 TSPN Thrombospondin  98.7 6.2E-07 1.3E-11   82.9  15.7  124  106-240    52-181 (184)
 20 PF00008 EGF:  EGF-like domain   98.7 5.9E-09 1.3E-13   66.3   1.4   31   47-77      1-32  (32)
 21 PF13385 Laminin_G_3:  Concanav  98.7 5.4E-07 1.2E-11   80.2  14.0  149  298-508     2-153 (157)
 22 smart00159 PTX Pentraxin / C-r  98.5 5.5E-06 1.2E-10   78.0  15.2  140  336-513    30-172 (206)
 23 PF00008 EGF:  EGF-like domain   98.4   2E-07 4.4E-12   59.2   2.3   30  263-292     1-32  (32)
 24 cd00152 PTX Pentraxins are pla  98.2 5.1E-05 1.1E-09   71.2  16.2  139  336-512    30-171 (201)
 25 smart00179 EGF_CA Calcium-bind  98.2 1.5E-06 3.2E-11   58.2   3.7   36   44-79      2-39  (39)
 26 PF13385 Laminin_G_3:  Concanav  98.2 3.3E-05 7.2E-10   68.5  13.6  142   89-245     6-152 (157)
 27 smart00159 PTX Pentraxin / C-r  98.2 8.6E-05 1.9E-09   69.9  16.7  148   90-245    15-166 (206)
 28 PF00354 Pentaxin:  Pentaxin fa  98.2 6.5E-05 1.4E-09   69.9  15.2  140  336-513    24-166 (195)
 29 cd00152 PTX Pentraxins are pla  98.1  0.0002 4.3E-09   67.2  16.7  146   90-245    15-166 (201)
 30 cd00054 EGF_CA Calcium-binding  98.0   6E-06 1.3E-10   54.6   3.7   36   44-79      2-38  (38)
 31 PF00354 Pentaxin:  Pentaxin fa  98.0 0.00031 6.7E-09   65.4  15.1  148   90-245     9-160 (195)
 32 cd00053 EGF Epidermal growth f  97.7 3.5E-05 7.6E-10   50.1   3.6   33   47-79      2-36  (36)
 33 PF02973 Sialidase:  Sialidase,  97.7  0.0007 1.5E-08   61.8  13.1  136  337-508    33-179 (190)
 34 smart00181 EGF Epidermal growt  97.6 6.2E-05 1.3E-09   48.9   3.5   32   47-79      2-35  (35)
 35 PF07645 EGF_CA:  Calcium-bindi  97.5 6.1E-05 1.3E-09   51.2   2.0   31   44-74      2-34  (42)
 36 PF02973 Sialidase:  Sialidase,  97.5  0.0033 7.1E-08   57.5  13.6  133  106-245    33-178 (190)
 37 smart00179 EGF_CA Calcium-bind  97.4 0.00018   4E-09   47.8   3.7   34  261-294     3-39  (39)
 38 cd00054 EGF_CA Calcium-binding  97.3 0.00036 7.7E-09   45.9   3.6   34  261-294     3-38  (38)
 39 KOG3509|consensus               97.2 0.00097 2.1E-08   74.4   8.8  209   41-281   403-612 (964)
 40 smart00560 LamGL LamG-like jel  97.1   0.012 2.7E-07   51.1  13.2   68  398-506    61-130 (133)
 41 PF07974 EGF_2:  EGF-like domai  97.1 0.00058 1.3E-08   43.1   3.1   27   50-78      6-32  (32)
 42 cd00053 EGF Epidermal growth f  97.1 0.00077 1.7E-08   43.5   3.7   31  263-293     2-35  (36)
 43 smart00181 EGF Epidermal growt  96.8  0.0017 3.8E-08   41.9   3.6   31  262-293     1-34  (35)
 44 PF12661 hEGF:  Human growth fa  96.5 0.00083 1.8E-08   33.1   0.2   13   66-78      1-13  (13)
 45 smart00560 LamGL LamG-like jel  96.3    0.12 2.7E-06   44.7  13.2   69  168-245    61-131 (133)
 46 KOG1834|consensus               96.1   0.084 1.8E-06   55.8  12.6  135  335-501   364-515 (952)
 47 PF07645 EGF_CA:  Calcium-bindi  95.9  0.0089 1.9E-07   40.5   3.1   29  261-289     3-34  (42)
 48 PF07974 EGF_2:  EGF-like domai  95.7   0.013 2.8E-07   37.0   2.9   26  266-293     6-32  (32)
 49 KOG1836|consensus               95.2  0.0062 1.4E-07   72.6   0.8  135  338-512  1557-1692(1705)
 50 PF12947 EGF_3:  EGF domain;  I  95.0   0.011 2.3E-07   38.5   1.0   27   50-76      6-32  (36)
 51 PF12947 EGF_3:  EGF domain;  I  94.4   0.029 6.2E-07   36.5   2.0   26  266-291     6-32  (36)
 52 PHA03099 epidermal growth fact  94.4   0.027 5.9E-07   46.9   2.3   33   49-82     50-84  (139)
 53 PHA02887 EGF-like protein; Pro  94.0   0.048   1E-06   44.7   2.9   36   45-81     84-124 (126)
 54 KOG1834|consensus               93.3     1.5 3.2E-05   46.9  12.8  156   88-244   346-519 (952)
 55 PHA03099 epidermal growth fact  93.1   0.095 2.1E-06   43.8   3.2   35  266-301    51-88  (139)
 56 PHA02887 EGF-like protein; Pro  93.0   0.092   2E-06   43.1   2.9   35  262-296    85-124 (126)
 57 PF12662 cEGF:  Complement Clr-  92.3   0.091   2E-06   30.7   1.5   17   64-80      1-21  (24)
 58 PF06439 DUF1080:  Domain of Un  91.2    0.29 6.3E-06   44.8   4.5   91  338-460    54-157 (185)
 59 PF12946 EGF_MSP1_1:  MSP1 EGF   90.9   0.093   2E-06   34.0   0.6   32   47-78      2-36  (37)
 60 KOG1214|consensus               90.2    0.37 7.9E-06   52.6   4.7   42   35-76    814-859 (1289)
 61 smart00051 DSL delta serrate l  90.1    0.49 1.1E-05   35.1   4.0   36   39-78     27-63  (63)
 62 PF14670 FXa_inhibition:  Coagu  90.1    0.21 4.7E-06   32.4   1.8   18   56-73     10-27  (36)
 63 KOG3546|consensus               88.8       1 2.2E-05   47.8   6.5  127  336-501    86-223 (1167)
 64 PF06439 DUF1080:  Domain of Un  87.0    0.69 1.5E-05   42.3   3.8  101   92-196    43-155 (185)
 65 PF14099 Polysacc_lyase:  Polys  86.5      10 0.00022   35.9  11.6   67  363-456   113-183 (224)
 66 KOG3546|consensus               86.0     2.7 5.9E-05   44.7   7.7  129  106-243    87-226 (1167)
 67 KOG1225|consensus               84.7    0.67 1.5E-05   49.2   2.7   40   35-79    270-310 (525)
 68 KOG1217|consensus               84.2    0.84 1.8E-05   48.4   3.3   35   44-78    271-306 (487)
 69 KOG1217|consensus               83.6    0.94   2E-05   48.0   3.3   47   35-81    157-208 (487)
 70 cd01475 vWA_Matrilin VWA_Matri  82.4     1.1 2.5E-05   42.5   3.1   35   39-75    181-218 (224)
 71 KOG1225|consensus               81.0     1.3 2.9E-05   47.0   3.2   41   35-80    239-280 (525)
 72 KOG4260|consensus               79.6     1.3 2.7E-05   42.3   2.2   59   23-81    119-184 (350)
 73 PF00053 Laminin_EGF:  Laminin   78.5     1.6 3.6E-05   30.3   2.0   24   56-81     11-34  (49)
 74 KOG1214|consensus               76.9     1.9   4E-05   47.4   2.8   31  261-291   828-859 (1289)
 75 PF14670 FXa_inhibition:  Coagu  76.9     1.9 4.2E-05   28.0   1.8   20  271-290    10-29  (36)
 76 KOG1836|consensus               76.7     2.8 6.1E-05   51.0   4.4  105  134-245  1581-1687(1705)
 77 PF00954 S_locus_glycop:  S-loc  75.3      10 0.00022   31.4   6.4   32   44-76     77-109 (110)
 78 KOG0994|consensus               68.3      16 0.00034   42.2   7.3  136   45-180   508-690 (1758)
 79 cd00055 EGF_Lam Laminin-type e  64.8       6 0.00013   27.7   2.3   22   58-81     14-35  (50)
 80 PF09064 Tme5_EGF_like:  Thromb  62.8     6.6 0.00014   24.9   1.9   13  278-290    16-28  (34)
 81 PF02057 Glyco_hydro_59:  Glyco  59.4 1.9E+02  0.0041   32.1  13.4   60  363-456   578-637 (669)
 82 PF12955 DUF3844:  Domain of un  59.2       7 0.00015   31.9   2.0   37   45-81      6-62  (103)
 83 cd01475 vWA_Matrilin VWA_Matri  56.2     9.7 0.00021   36.1   2.9   29  261-290   188-218 (224)
 84 PF14099 Polysacc_lyase:  Polys  54.1      53  0.0012   30.9   7.6   59  134-192   113-181 (224)
 85 PF09264 Sial-lect-inser:  Vibr  53.0      98  0.0021   28.2   8.3   83  339-455    34-119 (198)
 86 cd01951 lectin_L-type legume l  52.4   2E+02  0.0044   26.9  12.1   10  398-407   154-163 (223)
 87 KOG4260|consensus               51.5     7.7 0.00017   37.2   1.3   35   41-75    232-269 (350)
 88 PF00954 S_locus_glycop:  S-loc  49.9      15 0.00033   30.4   2.8   29  261-290    78-108 (110)
 89 cd00413 Glyco_hydrolase_16 gly  49.4 2.1E+02  0.0046   26.3  12.3   30  396-457   140-169 (210)
 90 PF12946 EGF_MSP1_1:  MSP1 EGF   48.3      16 0.00034   23.9   1.9   28  263-290     2-31  (37)
 91 PF04863 EGF_alliinase:  Alliin  47.7     6.6 0.00014   27.8   0.2   32   51-82     18-53  (56)
 92 smart00180 EGF_Lam Laminin-typ  47.7      13 0.00027   25.5   1.6   18   64-81     17-34  (46)
 93 PF07622 DUF1583:  Protein of u  43.7      28 0.00061   35.4   3.9   31    4-34     83-113 (399)
 94 PF01683 EB:  EB module;  Inter  39.9      38 0.00083   23.6   3.1   26   45-74     20-46  (52)
 95 PF12955 DUF3844:  Domain of un  37.3      29 0.00062   28.5   2.3   22  266-287    13-40  (103)
 96 cd06899 lectin_legume_LecRK_Ar  36.3 3.9E+02  0.0084   25.4  12.0   13  395-407   160-172 (236)
 97 PF07953 Toxin_R_bind_N:  Clost  35.4 3.5E+02  0.0077   24.7  12.2   88  336-457    54-156 (195)
 98 PTZ00334 trans-sialidase; Prov  31.9      68  0.0015   36.2   4.9   78  393-507   638-715 (780)
 99 KOG1226|consensus               31.3      43 0.00094   37.1   3.2   27   51-81    556-582 (783)
100 PF02057 Glyco_hydro_59:  Glyco  29.1 5.3E+02   0.011   28.8  10.9   56  135-192   579-635 (669)
101 cd01951 lectin_L-type legume l  28.7 4.1E+02  0.0088   24.8   9.2   23  168-190   154-178 (223)
102 PF05910 DUF868:  Plant protein  27.5 1.3E+02  0.0027   29.5   5.4   38  441-480   173-210 (274)
103 PF03425 CBM_11:  Carbohydrate   26.5 1.7E+02  0.0037   26.5   5.9   49  350-409    73-121 (178)
104 PTZ00334 trans-sialidase; Prov  24.8 5.7E+02   0.012   29.2  10.4   74  165-245   640-715 (780)
105 smart00289 WR1 Worm-specific r  23.0      78  0.0017   20.2   2.2   23   46-70     15-38  (38)
106 cd02178 GH16_beta_agarase Beta  22.7 1.9E+02  0.0042   27.9   5.9   29  397-457   178-207 (258)
107 PF08787 Alginate_lyase2:  Algi  22.2 6.8E+02   0.015   23.8  12.4   52  371-455   135-189 (236)
108 PF09313 DUF1971:  Domain of un  20.7 4.2E+02  0.0091   20.7   7.1   47  361-407    22-74  (82)
109 KOG0994|consensus               20.2   1E+02  0.0023   36.0   3.6   38   44-81    903-950 (1758)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=1.8e-50  Score=429.83  Aligned_cols=397  Identities=25%  Similarity=0.533  Sum_probs=314.2

Q ss_pred             ceee-cCCCC--CCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccc-------c-------------------
Q psy5025          35 IKGR-SKECL--EDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDL-------Q-------------------   85 (533)
Q Consensus        35 ~~G~-~~~C~--~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~-------~-------------------   85 (533)
                      ++|| +.-|+  +|.|-+.||.|+|+|....++|+|.|.++|+|.+||.+...       |                   
T Consensus      1227 PpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp 1306 (2531)
T KOG4289|consen 1227 PPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCP 1306 (2531)
T ss_pred             CCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCC
Confidence            7888 88888  79999999999999999999999999999999999987421       1                   


Q ss_pred             ------------ccccCCceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcC
Q psy5025          86 ------------VPQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLG  153 (533)
Q Consensus        86 ------------~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g  153 (533)
                                  +.+|.+.||+.++.....+  +++++|+|.|...+|+|+|.|+.   +.||++|++.++++++.|+.|
T Consensus      1307 ~ge~e~prC~v~trSFp~~sfv~frglrqRf--h~TlslsfaT~~~nGlL~ynGne---khDFvalevVd~qvqltfS~G 1381 (2531)
T KOG4289|consen 1307 YGEFEDPRCEVTTRSFPPESFVTFRGLRQRF--HFTLSLSFATIERNGLLLYNGNE---KHDFVALEVVDEQVQLTFSAG 1381 (2531)
T ss_pred             CcccCCCceEEEeeccCchheEEEeccccce--EEEEEEEEEEeeecceEEecCCc---ccceEeeeeeeeeEEEEEecc
Confidence                        5579999999999876655  78999999999999999999954   499999999999999999999


Q ss_pred             CCceEEE--eeccccCCCcEEEEEEEECCEEEEEEcCccCcee-------------eCCCCceeeecCCCEEEcCcCCCC
Q psy5025         154 TGAATLR--SSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQI-------------LSPGPFTQLSLSLSLYLGGVPDYN  218 (533)
Q Consensus       154 ~~~~~v~--s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~-------------~~~~~~~~l~~~~~l~vGG~p~~~  218 (533)
                      ....++.  .+..++||+||+|.++..++.+.++||++.....             ...+....+++.+++++||+|...
T Consensus      1382 es~t~v~p~Vp~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~f 1461 (2531)
T KOG4289|consen 1382 ESTTTVSPDVPGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETF 1461 (2531)
T ss_pred             cccceecCCCCCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcc
Confidence            6665554  3578999999999999999999999999875432             223456678888999999999654


Q ss_pred             ccCCCcccccCceeeeeeEEECcEEcccccccccC-CCccCCC---CCCCCCCC-CCCeeeecCCceEEecCCCCCcCcc
Q psy5025         219 IVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQG-VNVVNCA---HPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHL  293 (533)
Q Consensus       219 ~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~-~~~~~C~---~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c  293 (533)
                      .     .....|.|||+++.++++.+| ++..... ....+|+   +.|.+.+| |+|+|++.|++|.|+||.++.|..|
T Consensus      1462 p-----v~~k~FvGCmrdLsvD~~~VD-ma~fianngt~eGC~ark~fCdsg~C~n~g~CvnrWg~~~C~CP~~fggk~c 1535 (2531)
T KOG4289|consen 1462 P-----VIEKQFVGCMRDLSVDGRDVD-MATFIANNGTHEGCKARKNFCDSGQCSNGGTCVNRWGGFSCECPLGFGGKGC 1535 (2531)
T ss_pred             h-----hhHhHhhhhhhhccccccccc-HHHHHhhcCcccCchhhhcccCCCccCCCCeeecccCcEeecCccccCCcch
Confidence            2     233789999999999999999 4443332 2357894   78999999 9999999999999999999999888


Q ss_pred             cce--eeeeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEE
Q psy5025         294 TEV--MCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIH  371 (533)
Q Consensus       294 ~~~--~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~  371 (533)
                      ...  .+-.| .|.|.+.+...+.                  ..+..+.++|.|||++.+|+|+.+...+. .-+.+.|.
T Consensus      1536 ~~~m~~pq~f-rG~sl~sw~~~~~------------------~vSvPwylsl~FRTr~ad~vl~~~~~~~r-st~~lqld 1595 (2531)
T KOG4289|consen 1536 CQGMAHPQHF-RGHSLVSWEGLPS------------------QVSVPWYLSLMFRTRRADGVLMQAEFGGR-STYNLQLD 1595 (2531)
T ss_pred             hhccCCchhc-cccceeeecCCCc------------------ceecceEEEEEEEeeccccEEEEEEeCCC-ceEEEEEc
Confidence            653  33455 7788777764433                  23567899999999999999999875554 44889999


Q ss_pred             CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCc
Q psy5025         372 NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTD  451 (533)
Q Consensus       372 ~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~  451 (533)
                      +|+|.+.+  +...  +.+...+++||+||++.+..                                ... ..++.|..
T Consensus      1596 ~g~l~~~v--~~s~--v~L~~~~vtdg~Wh~~~i~l--------------------------------~~d-~~~t~d~g 1638 (2531)
T KOG4289|consen 1596 DGTLKYNV--GDSS--VELPAPRVTDGHWHHLVIEL--------------------------------EAD-SVATLDYG 1638 (2531)
T ss_pred             CCEEEEEe--cCce--EEccCccccCCchhheeeee--------------------------------ccC-eEEEEech
Confidence            99987765  3333  34478899999999999998                                433 45555543


Q ss_pred             -eeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCceEEeec
Q psy5025         452 -YSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVKLP  509 (533)
Q Consensus       452 -~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~~  509 (533)
                       ....... + ...++..+ ||+||+|...      ...+|.|||+++++ ++...-.+
T Consensus      1639 ~~~aea~~-g-l~gl~l~s-l~vGgap~~g------~p~gf~GCiqgV~v-~g~~~l~~ 1687 (2531)
T KOG4289|consen 1639 IYQAEAKA-G-LSGLNLES-LYVGGAPATG------VPRGFRGCIQGVRV-GGVSILVP 1687 (2531)
T ss_pred             hhhhhhhc-C-CCCceeeE-EEEccccCCC------ccccchhhhhceEE-CCEeeccc
Confidence             2222111 1 33455554 9999999542      46799999999999 66655443


No 2  
>KOG3516|consensus
Probab=100.00  E-value=1.2e-48  Score=416.42  Aligned_cols=375  Identities=24%  Similarity=0.416  Sum_probs=289.2

Q ss_pred             ccccC-CceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECC-EEEEEEEcCCCceEEE--e
Q psy5025          86 VPQFN-GSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDR-YVDFTFDLGTGAATLR--S  161 (533)
Q Consensus        86 ~~~F~-g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G-~l~~~~~~g~~~~~v~--s  161 (533)
                      .++|. +.+|+.+++.....  ...|+|.|||+.+.|++|..-.    ..||+.|+|... .+.|.++.|+++..++  +
T Consensus       790 svSF~~~~syL~fp~f~~~~--saDIsf~FrTt~~~gvflen~g----~~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s  863 (1306)
T KOG3516|consen  790 SVSFHTGASYLHFPPFHNEL--SADISFFFRTTASSGVFLENHG----INDFIRLELSSPVEVTFAFDVGNGPSQLTVRS  863 (1306)
T ss_pred             ceEeecCcceeecCcccCcc--cccEEEEEEecCCceEeeeccC----CCceEEEEEcCCCceEEEEEcCCCceeEEEcC
Confidence            44565 45799998876654  6799999999999999999854    489999999875 8999999999886554  6


Q ss_pred             eccccCCCcEEEEEEEECCEEEEEEcCccCceeeCC-CCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025         162 SNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN  240 (533)
Q Consensus       162 ~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~-~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in  240 (533)
                      +..+||++||+|+++++.+.+.|+||+........+ .....+.+.+++||||....         ..+|.||||.+++|
T Consensus       864 ~t~~nD~qWH~V~~Ern~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vGgt~~~---------~~gF~GCIRsl~LN  934 (1306)
T KOG3516|consen  864 PTELNDNQWHQVRAERNSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVGGTVSR---------QRGFLGCIRSLQLN  934 (1306)
T ss_pred             CcccCCCceEEEEEEeccccceEEEcCcccceecCCCCCEEEEEeccceeccccccC---------cCcceeeeeeeeec
Confidence            678999999999999999999999999765554433 34667888899999996643         37999999999999


Q ss_pred             cEEcccccccccCC-CccCCCCCCCCCCC-CCCeeeecCCceEEecCC-CCCcCcccceeeeeecCCCeeEEecCccccc
Q psy5025         241 NHELNILGEAIQGV-NVVNCAHPCITQPC-GSGSCVPLYDGYKCVCTQ-SCKESHLTEVMCASFDHGASFLHYLDPEIIH  317 (533)
Q Consensus       241 ~~~l~~~~~~~~~~-~~~~C~~~C~~~pC-ngg~C~~~~~~~~C~C~~-g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~  317 (533)
                      |+.+++...+.... ...+|+..|++.|| |||+|++.|++|+|+|.. .|.|+.|.....+.| ..++++.|+.....+
T Consensus       935 Gv~ldLe~ra~~~~gv~~GC~GhCss~~C~NGG~Cvery~gytCDCs~Tay~Gp~Cs~eig~~f-e~gs~i~y~fq~~~~ 1013 (1306)
T KOG3516|consen  935 GVMLDLEYRAYGTAGVSPGCEGHCSSYPCLNGGHCVERYDGYTCDCSRTAYDGPFCSKEIGVFF-ERGSSIRYNFQKPMR 1013 (1306)
T ss_pred             ceeeeehhhhccCCcccCCCccccccccccCCCEEEEecCceeeccccCcCCCCccccccceEe-cCCceEEEeccchHH
Confidence            99999643332222 24689999999999 999999999999999997 799999998888888 677888887765431


Q ss_pred             cccc--cceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCC-ceEEEeeCce
Q psy5025         318 SSGL--LHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGN-GEVLLENNQT  393 (533)
Q Consensus       318 ~~~~--~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~-~~~~~~~~~~  393 (533)
                      ....  .+..+ ..  .-.....-.|.|.|+|+...++|||..+ ..++|+.|-|. ||.|.+++.+|. .+..+.+...
T Consensus      1014 ~a~~~~~~~~~-~~--~~~~~~~e~i~~sftTt~~ps~LLfvss-F~~~y~~V~v~~nGsLq~ry~lg~~e~~~~~~~~k 1089 (1306)
T KOG3516|consen 1014 SAVFESSRVKQ-KL--EIEINPNEEINFSFTTTRAPSDLLFVSS-FTDDYLAVLVKDNGSLQTRYMLGFREPFEYQFKDK 1089 (1306)
T ss_pred             Hhhhhhhhhhh-cc--ccccCccceEEEEEEeccCceEEEEeec-cccceEEEEEeCCCceEEEEecCCcCceEEecccc
Confidence            1100  00000 00  1112344689999999999999998764 45789999886 899999999987 5667777888


Q ss_pred             eecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEE
Q psy5025         394 KINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYV  473 (533)
Q Consensus       394 ~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~l  473 (533)
                      ++.||+.|+|.+.|                                ..+.+.|+||+.......... ...++....|++
T Consensus      1090 n~~~gq~H~i~i~r--------------------------------~~~~~~i~vD~y~~~~y~~~~-~~~~~~~ksl~l 1136 (1306)
T KOG3516|consen 1090 NIALGQPHDINITR--------------------------------GPRTVFLEVDGYLKVEYTFSI-DVDFQSPKSLTL 1136 (1306)
T ss_pred             cccCCCceEEEEec--------------------------------CCceEEEEecCccceeeeccc-ceeecccchhhc
Confidence            99999999999999                                889999999997766544322 233444444666


Q ss_pred             cCCCCcccc---cccccccCceeeEEEEEEcCceEEeecccCc
Q psy5025         474 GGVEDIEEL---TRSKYHHSFKGWISNLILSNDYQVKLPLSRS  513 (533)
Q Consensus       474 Gg~~~~~~~---~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~  513 (533)
                      |-+-.....   .......+|.||++.|++++-.||+.+....
T Consensus      1137 g~v~e~~~~d~~~~k~~t~gF~GClS~Vqf~~vaPLK~~~~~~ 1179 (1306)
T KOG3516|consen 1137 GPVTETANIDHEISKYNTPGFGGCLSRVQFNDVAPLKAAFRTE 1179 (1306)
T ss_pred             cceeeccCCChhHHhhcCCCccceeeEEEccccccchhccccC
Confidence            654333221   1123567899999999998888998877655


No 3  
>KOG3514|consensus
Probab=100.00  E-value=3.9e-49  Score=412.04  Aligned_cols=445  Identities=22%  Similarity=0.405  Sum_probs=346.8

Q ss_pred             CCccccCCCceEEEEEEeCceeEEEEcCCCC-------------------------------------------------
Q psy5025           4 SDNQVILDDWNTVRVFRHRWDAWIQLNDGNK-------------------------------------------------   34 (533)
Q Consensus         4 s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~-------------------------------------------------   34 (533)
                      -..+|+|++||+|++.|+-..+.|.||+...                                                 
T Consensus        98 ~~~~i~D~~WH~v~i~r~~e~t~L~vDgv~~~~~~~~~~f~fg~iasdvfVGGlP~~~~la~l~lp~v~yep~frg~~rn  177 (1591)
T KOG3514|consen   98 RRVRIDDDKWHTVTIFRSWENTKLEVDGVLVFKILNQRSFVFGNIASDVFVGGLPNMHMLAVLSLPLVRYEPRFRGNVRN  177 (1591)
T ss_pred             ecceecCCceeEEEEEeccccceEEechhhhhhhhhcceeeeeeeehheeecCCChHHhhhhhcCcccccccccCcccee
Confidence            3468999999999999999999999999443                                                 


Q ss_pred             ------ceee----------cCCCCC------CCCC-CCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCC
Q psy5025          35 ------IKGR----------SKECLE------DKCN-KTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNG   91 (533)
Q Consensus        35 ------~~G~----------~~~C~~------d~C~-~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g   91 (533)
                            +.|.          ..+|..      ..|. ..+|+|+|.|.....+.+|.|...+.|++||+.-....+.|.|
T Consensus       178 l~y~~~p~g~t~~q~l~~~~d~~c~d~~~~~~~~~~~~~~c~~~g~c~s~d~gp~c~c~~~~dgq~cekeK~~~eaTF~G  257 (1591)
T KOG3514|consen  178 LMYRQYPQGVTSPQLLEVGTDTNCDDHCKSKSMSSREQFVCLNDGECYSSDDGPHCDCQFDHDGQNCEKEKNDGEATFGG  257 (1591)
T ss_pred             eeeecCCCCcCChhhhhcccCCCCcCCCCCccccccccceeccCCeEecCCCCCccccccccCccccccccCcceEEecC
Confidence                  1111          123331      2233 3689999999999999999999999999999988777889999


Q ss_pred             ceeEEEcCCcccc-cceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE---eeccccC
Q psy5025          92 SSYLRYQGLGEQY-LSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR---SSNPISL  167 (533)
Q Consensus        92 ~syl~~~~~~~~~-~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~---s~~~lnD  167 (533)
                      +.|+.|+....+. +....|+|.|||.+++|+|||++..    .||+.|.|++|.+.+...++++..++.   ...+++|
T Consensus       258 ~ef~~YDls~npI~s~~d~itl~FrT~q~ngllfytG~~----~dYlnlaL~dGaV~l~~~l~~g~~e~~~~p~~~rfdD  333 (1591)
T KOG3514|consen  258 DEFVGYDLSQNPIRSKKDNITLTFRTVQGNGLLFYTGDE----KDYLNLALQDGAVSLSSKLDGGDAEIIRMPNSFRFDD  333 (1591)
T ss_pred             ceEEEeeccCCcccccccceEEEEEEecCceeEEEccCC----cceeeEeecCCcEEEEEecCCccceeEEccccccccC
Confidence            9999998655443 3567899999999999999999875    799999999999999999987764432   4578999


Q ss_pred             CCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccc
Q psy5025         168 GEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNIL  247 (533)
Q Consensus       168 g~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~  247 (533)
                      ..||.|.++|+-.-+.+.||+.........+.+.++..+.-+|+||.|....++..   ...|.||++++.+....+.+.
T Consensus       334 ~~WH~V~v~R~~~m~t~~VDg~~t~~~~~a~~~tmlsss~~fyvgg~~~~~~l~gs---rVsF~GClkkV~y~~d~~rl~  410 (1591)
T KOG3514|consen  334 DSWHTVIVERSLQMMTLIVDGRRTEIRQYAPELTMLSSSDFFYVGGSPNTADLPGS---RVSFMGCLKKVVYKNDDTRLE  410 (1591)
T ss_pred             CcceEEEEEeeeEEEEEEEccEEecccccccceeEeeccceEEecCCCCccccCCC---ceeeeeeeeeeEeccCceeeh
Confidence            99999999999999999999988777777778888887777999999987665432   234999999999976654431


Q ss_pred             cc--cccCCCccCCCCCCCCCCCCCCeeeecCCceEEecCCCCCcCcccceeeeeecCCCeeEEecCcccccccccccee
Q psy5025         248 GE--AIQGVNVVNCAHPCITQPCGSGSCVPLYDGYKCVCTQSCKESHLTEVMCASFDHGASFLHYLDPEIIHSSGLLHYK  325 (533)
Q Consensus       248 ~~--~~~~~~~~~C~~~C~~~pCngg~C~~~~~~~~C~C~~g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~~~~~~~~~  325 (533)
                      -.  +.++.....               +...-+|.|  +.+      ....+++|....||+.++.|+..         
T Consensus       411 L~~LAk~g~~~~k---------------~~G~l~y~C--~n~------~~~DpvtFtt~es~l~LP~Wnt~---------  458 (1591)
T KOG3514|consen  411 LSRLAKQGDSKMK---------------TEGDLSYSC--ENV------AQLDPVTFTTPESYLTLPRWNTK---------  458 (1591)
T ss_pred             hhHHhhcCCceeE---------------eeceEEEec--CCC------CccCceeeecccceeeccccccC---------
Confidence            11  111111000               000112333  211      12356899889999999988753         


Q ss_pred             eeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCC--CCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEE
Q psy5025         326 FQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQ--SKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRL  403 (533)
Q Consensus       326 ~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~--~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v  403 (533)
                                 ..-+|+|.|||+++||+|||.+..+  ..+|++++|.||+|.+.+++|++.+++..+..+++||+||+|
T Consensus       459 -----------~~gSiSf~FRTtepnGlil~~~g~~~~~~d~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv  527 (1591)
T KOG3514|consen  459 -----------KSGSISFDFRTTEPNGLILFHGGPQANATDYFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHV  527 (1591)
T ss_pred             -----------CcceeEEEEeecCCCceEEEccCcccccccEEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEE
Confidence                       3468999999999999999987554  379999999999999999999999999988999999999999


Q ss_pred             EEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccc-
Q psy5025         404 TVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEEL-  482 (533)
Q Consensus       404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~-  482 (533)
                      .+.|                                +++...+.||...... ..++..+.|+.++++|||-++..... 
T Consensus       528 ~l~R--------------------------------~gR~gsvsVd~~~~df-~tpG~s~iL~ld~~mylG~~~n~l~~P  574 (1591)
T KOG3514|consen  528 DLQR--------------------------------DGRTGSVSVDAIKTDF-STPGDSEILDLDDPMYLGEVPNNLVYP  574 (1591)
T ss_pred             Eeec--------------------------------cCccceEEEeeeecCc-cCCCcceeEeecCceeeccCCCCccCc
Confidence            9999                                9999999999987776 35677788999999999966655322 


Q ss_pred             ---cccccccCceeeEEEEEEcCceEEeecccCc--CCCCcccceee---ecCCceee
Q psy5025         483 ---TRSKYHHSFKGWISNLILSNDYQVKLPLSRS--QGSSCQLTCET---LNTGQVTC  532 (533)
Q Consensus       483 ---~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~--~~~~c~~~~~~---~~~~~~~~  532 (533)
                         |......+|.||||+|+| +++..|+.+.+.  .+.+...+|.-   +.|..++|
T Consensus       575 ~~vWta~L~~GyvGCirdl~i-~G~s~di~q~ae~q~sagvkpsCs~~~~~~C~~nPC  631 (1591)
T KOG3514|consen  575 SEVWTAALRKGYVGCIRDLFI-DGVSTDIRQEAEAQNSAGVKPSCSLSNEKICESNPC  631 (1591)
T ss_pred             HHHHHHHHhccchheehhhee-cceehhhHHHhhhccccccCcccchhhccccCCCcc
Confidence               344567899999999999 899999877654  33344444432   24666665


No 4  
>KOG3514|consensus
Probab=100.00  E-value=2e-47  Score=399.32  Aligned_cols=343  Identities=24%  Similarity=0.480  Sum_probs=278.6

Q ss_pred             eeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE--eeccccCCCcEEEEEEEECC-EEE
Q psy5025         107 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR--SSNPISLGEWHKLRLTRTGR-HAY  183 (533)
Q Consensus       107 ~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~--s~~~lnDg~Wh~V~v~~~~~-~~~  183 (533)
                      ...|-|+|+|++++|+|+|.+...   +||++|+|.+|+|.+.|++|+++..++  +...+||++||.|.|.|... .-.
T Consensus       860 s~~l~Fqfkt~sp~gll~fn~gd~---ndfi~velvnG~ihYtfdlg~gp~~~k~~sr~hlnDnrWHnV~I~rd~~~~Ht  936 (1591)
T KOG3514|consen  860 SMHLFFQFKTTSPDGLLLFNSGDG---NDFIAVELVNGYIHYTFDLGNGPTSMKGPSRQHLNDNRWHNVLIYRDKTNTHT  936 (1591)
T ss_pred             EEEEEEEEeecCCCeEEEecCCCC---CceEEEEEeCcEEEEEEEcCCCcccccCcccCcCccccceeEEEEcCCCCceE
Confidence            456778899999999999987653   899999999999999999999987776  45889999999999999865 458


Q ss_pred             EEEcCccCceeeCCCCceeeecCCCEEEcCcCCCC--ccCCCcccccCceeeeeeEEECcEEcccccccccCC-C-ccCC
Q psy5025         184 LQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYN--IVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGV-N-VVNC  259 (533)
Q Consensus       184 L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~--~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~-~-~~~C  259 (533)
                      |.||...... ...+ ...+++.+.+|+||+-...  .++.......+|.||+..+-+++...+++..++... . ..+|
T Consensus       937 L~vD~s~~t~-~~~g-~~~l~l~g~LyiGGv~k~m~~~~p~~~asR~g~~g~~~s~dl~~r~p~L~~~a~~~s~lv~~~~ 1014 (1591)
T KOG3514|consen  937 LKVDNSSTTQ-IIDG-AVNLDLKGKLYIGGVSKPMYSFLPKLVASRSGFQGCLASLDLGGRLPDLISDALFESGLVEVGC 1014 (1591)
T ss_pred             EEecCceEEE-EecC-ccccccccceecccccccccccccceeeccCCCCCCcCccCccccchhHHHHhhhhccceeeec
Confidence            9999865333 3333 6678888999999997654  334455567899999999999988766544433221 2 2356


Q ss_pred             --C-CCCCCCCC-CCCeeeecCCceEEecCC-CCCcCcccce-eeeeecCCCeeEEecCccccccccccceeeeeeeeee
Q psy5025         260 --A-HPCITQPC-GSGSCVPLYDGYKCVCTQ-SCKESHLTEV-MCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFR  333 (533)
Q Consensus       260 --~-~~C~~~pC-ngg~C~~~~~~~~C~C~~-g~~G~~c~~~-~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~  333 (533)
                        | ..|..+-| |.|.|+..|+.|.|+|++ .|.|+.|... ..+-|..+.+.+.|..++.                .|
T Consensus      1015 sgpst~c~~~acanhG~c~q~w~~~~c~csmtS~~Gp~C~d~gtTYiFgk~gglI~YtwPpN----------------dR 1078 (1591)
T KOG3514|consen 1015 SGPSTTCSEDACANHGVCIQQWNGIACDCSMTSYSGPRCNDPGTTYIFGKSGGLITYTWPPN----------------DR 1078 (1591)
T ss_pred             cCCCcccchhhhhccceeeeeecceeeeccccccCCCccCCCceEEEECCCCceEEEecCCC----------------CC
Confidence              3 57999999 999999999999999998 7999999753 4578877777777766554                34


Q ss_pred             ecceeeEEEEEEEeCCCCeEEEEeccCC-CCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025         334 ITGYEFNIDMRIKTSSSSGLILWTGSSQ-SKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV  412 (533)
Q Consensus       334 ~~~~~~~i~~~~rt~~~~g~ll~~~~~~-~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~  412 (533)
                      ....+.++.+.|+|.+++|+|+.+.+.. ..+||+|+|..|++-+.||+|...+.+......+|||++|.|++.|     
T Consensus      1079 psTr~DrlAvGFsTtq~daVLvRVdSAsglgDYlqLhI~qG~igvvfNiGt~Diti~E~~~ivNDgkYHVVRFtR----- 1153 (1591)
T KOG3514|consen 1079 PSTRKDRLAVGFSTTQPDAVLVRVDSASGLGDYLQLHINQGKIGVVFNIGTDDITISEHNAIVNDGKYHVVRFTR----- 1153 (1591)
T ss_pred             CCcccceEEEEEEeccCceEEEEEeccCCCCceEEEEEeccEEEEEEeccCcccccccccccccCCceEEEEEEe-----
Confidence            5567789999999999999999986554 4899999999999999999998888887788999999999999999     


Q ss_pred             ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEec-CCcceeeccCCceEEcCCCCcccccccccccCc
Q psy5025         413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSIS-GGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSF  491 (533)
Q Consensus       413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~-~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f  491 (533)
                                                 .+..++|+||...+..... ......++....|-|||....         ..|
T Consensus      1154 ---------------------------~GGNATLQVD~wpV~e~yP~grqlTiFNtqa~I~IGGk~qG---------RpF 1197 (1591)
T KOG3514|consen 1154 ---------------------------SGGNATLQVDSWPVNERYPAGRQLTIFNSQARISIGGKFQG---------RPF 1197 (1591)
T ss_pred             ---------------------------cCCceEEEecccchhhccCCCceeEEeeccceEEecccccC---------Ccc
Confidence                                       9999999999988765433 344666788888999997632         379


Q ss_pred             eeeEEEEEEcCceEEeeccc
Q psy5025         492 KGWISNLILSNDYQVKLPLS  511 (533)
Q Consensus       492 ~GCi~~l~i~~~~~l~~~~~  511 (533)
                      +|=|+.|.+|+.+.|++...
T Consensus      1198 qGqiSGLyyNgLkVLdlAaE 1217 (1591)
T KOG3514|consen 1198 QGQISGLYYNGLKVLDLAAE 1217 (1591)
T ss_pred             cceecceEEcceeeeehhhc
Confidence            99999999988788888654


No 5  
>KOG3516|consensus
Probab=100.00  E-value=1.8e-40  Score=354.23  Aligned_cols=459  Identities=21%  Similarity=0.352  Sum_probs=331.6

Q ss_pred             ccCCccccCCCceEEEEEEeCceeEEEEcCCCC------------------ceee---c--CCCC--CCCCCCCCCCCCC
Q psy5025           2 VKSDNQVILDDWNTVRVFRHRWDAWIQLNDGNK------------------IKGR---S--KECL--EDKCNKTPCEHGG   56 (533)
Q Consensus         2 ~~s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~------------------~~G~---~--~~C~--~d~C~~~pC~Ngg   56 (533)
                      |.-.+.|+|++||.|.++|.++.+.|+||+.-.                  +.|.   +  ..-+  .-.|..|.-.|+.
T Consensus       257 is~GslLdD~hWHsV~i~r~~~~vnftvD~~~~~fr~~Ge~~~Ldld~e~~~GGiP~~~~~~~~~~nF~GCienly~N~v  336 (1306)
T KOG3516|consen  257 ISAGSLLDDQHWHSVRIERQGRQVNFTVDGVVHHFRATGEFDALDLDTEISFGGIPNDGKSVGFEKNFTGCLENLYYNGV  336 (1306)
T ss_pred             eecccccCCCcceEEEEEecCcEEEEEEccceEeecccCccceeecceEEEECCccCCCcccceeeeeeeeeeeeeecCc
Confidence            445678999999999999999999999999544                  1111   0  0000  2346666666766


Q ss_pred             eeeeCCCCce-ecCCCCCCCCCccccccccccccCCc-eeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCC
Q psy5025          57 KCITTPGSSY-CLCPLGFTGDLCQTRLDLQVPQFNGS-SYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVG  134 (533)
Q Consensus        57 tC~~~~~~y~-C~C~~g~~G~~Ce~~~~~~~~~F~g~-syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~  134 (533)
                      .=+++.-.-. =.=..|-.--.|+....+ ...|.+. +|+++|......  ...++|+|||...+|+|++......  .
T Consensus       337 diidLa~~~~~~~~~~gnv~f~C~~P~~~-pvtF~~sss~~~lpg~~~~~--~l~vSF~FRtw~~~G~ll~~~~~e~--~  411 (1306)
T KOG3516|consen  337 DIIDLAKRRKSQISAMGNVSFSCSDPQII-PVTFGNSSSYLRLPGNPNPD--RLSVSFQFRTWNKTGLLLFSELKEG--S  411 (1306)
T ss_pred             eeEeeecccccceecccceeEeccCCCCC-CeEecccceeEEcCCCCCCC--ceeeEEEEEeccccCceeeeeeccC--C
Confidence            6555431110 000001122245554332 3356544 599987655443  7899999999999999999876543  6


Q ss_pred             CeEEEEEECCEEEEEEEc-CCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcC
Q psy5025         135 DFIALYLNDRYVDFTFDL-GTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGG  213 (533)
Q Consensus       135 ~fl~l~L~~G~l~~~~~~-g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG  213 (533)
                      ..+.|.|++|++.+.+.. +.....+.....+|||+||.|.+.+..+.+.+.||+......... ....+......|+||
T Consensus       412 g~v~~fl~eg~~~~~i~~~~r~~~~~~~g~~lnDG~WHsv~~~ak~n~~~~~iDd~~~~~~~~~-~p~~V~tg~tY~fgg  490 (1306)
T KOG3516|consen  412 GEVLLFLKEGKKFLQITQIGRSKADAYAGLKLNDGAWHSVSFNAKKNRLVLMIDDGEAEIAPDS-KPLQVYTGTTYYFGG  490 (1306)
T ss_pred             ceEEEEEeCCeEEEEEeccccchhhhcccccCCCCceEEEEEEeecceeEEEEcCcccccccCC-ccEEEEeCCeeEecc
Confidence            899999999998887764 222344556688999999999999999999999999654322221 123344456899999


Q ss_pred             cCCCCccCCCcccccCceeeeeeEEECcEEccccccccc-------CCCccCC--CCCCCCCCC-CCCeeeecCCceEEe
Q psy5025         214 VPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQ-------GVNVVNC--AHPCITQPC-GSGSCVPLYDGYKCV  283 (533)
Q Consensus       214 ~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~-------~~~~~~C--~~~C~~~pC-ngg~C~~~~~~~~C~  283 (533)
                      .|.....-.-.....+|.||++-++++++..++.. ...       .+.+..|  .+.|.+||| +||.|...|+.|.|.
T Consensus       491 ~~~~~~~~~~~~~~~~f~GCmrli~vd~~~~~l~~-v~~~~~g~~~~v~id~C~i~drClPN~CehgG~C~Qs~~~f~C~  569 (1306)
T KOG3516|consen  491 CPDKFNSWQCASPIKGFQGCMRLIKVDGQLKDLID-VKQGSLGNFSDVQIDMCGISDRCLPNPCEHGGKCSQSWDDFECN  569 (1306)
T ss_pred             ccccccchhhccccccccceeEEEEECCeEeeeee-eeccccccccceeecccccccccCCccccCCCcccccccceeEe
Confidence            98774322222345789999999999999877322 111       1224567  589999999 999999999999999


Q ss_pred             cC-CCCCcCcccce------------------------------------------------------------------
Q psy5025         284 CT-QSCKESHLTEV------------------------------------------------------------------  296 (533)
Q Consensus       284 C~-~g~~G~~c~~~------------------------------------------------------------------  296 (533)
                      |. .||.|..|...                                                                  
T Consensus       570 C~~TGY~GatCHtsi~e~SCeay~~~~~t~~~~~iD~DGsGpl~Pl~v~C~~~ed~awTvv~H~~~~~t~V~g~n~~g~~  649 (1306)
T KOG3516|consen  570 CELTGYKGATCHTSIYELSCEAYKNIGQTSGNFLIDSDGSGPLEPLQVYCNITEDRAWTVVQHDNLGTTRVRGSNPEGPV  649 (1306)
T ss_pred             ccccccccccccCCCcchhhHHhhhhccccceEEEccCCCCcccceEEEEecccCceEEEEEeCCccceEEeccCCCCce
Confidence            99 47777211100                                                                  


Q ss_pred             --------------------------------------------------------------------------------
Q psy5025         297 --------------------------------------------------------------------------------  296 (533)
Q Consensus       297 --------------------------------------------------------------------------------  296 (533)
                                                                                                      
T Consensus       650 ~~s~~y~as~eQ~~al~n~se~CeQ~i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~  729 (1306)
T KOG3516|consen  650 AISLFYAASMEQLQALLNRSEHCEQEIEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDP  729 (1306)
T ss_pred             eEeeehhccHHHHHHHhhhhhhhheeeeeeeccceeeeCCCCCeeEEEecccCCccceecCCCCccceeeccccccccCc
Confidence                                                                                            


Q ss_pred             -----------------------------------------------------------eeeeecCCCeeEEecCccccc
Q psy5025         297 -----------------------------------------------------------MCASFDHGASFLHYLDPEIIH  317 (533)
Q Consensus       297 -----------------------------------------------------------~~~~f~~~~s~~~~~~~~~~~  317 (533)
                                                                                 .+++|..+++|++|+.-..  
T Consensus       730 ~~~CNCDa~~~ewt~Dtg~l~~k~hLPVt~vv~gdTg~~~sea~~~lgPLrC~gDr~~wnsvSF~~~~syL~fp~f~~--  807 (1306)
T KOG3516|consen  730 QLYCNCDADEKEWTTDTGCLAYKDHLPVTQVVIGDTGRSQSEAPYVLGPLRCEGDRNFWNSVSFHTGASYLHFPPFHN--  807 (1306)
T ss_pred             ceeeeccCCCccccccccccchhhcCCeeEEEEccCCCcccccceeecceEeecccccccceEeecCcceeecCcccC--
Confidence                                                                       1177766777888875433  


Q ss_pred             cccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEEC-CEEEEEEEcCCceEEEee-Cceee
Q psy5025         318 SSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHN-GYAHVMYNLGNGEVLLEN-NQTKI  395 (533)
Q Consensus       318 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~-g~l~~~~~~g~~~~~~~~-~~~~~  395 (533)
                                        .....|+|.|||+.++|++|.....  .+|+.|+|.. -.+.+.++.|+++..+.. +++.+
T Consensus       808 ------------------~~saDIsf~FrTt~~~gvflen~g~--~dfir~eL~~~~~vtf~~dvgnGp~~~~V~s~t~~  867 (1306)
T KOG3516|consen  808 ------------------ELSADISFFFRTTASSGVFLENHGI--NDFIRLELSSPVEVTFAFDVGNGPSQLTVRSPTEL  867 (1306)
T ss_pred             ------------------cccccEEEEEEecCCceEeeeccCC--CceEEEEEcCCCceEEEEEcCCCceeEEEcCCccc
Confidence                              3357899999999999999998654  8999999984 479999999998765554 55799


Q ss_pred             cCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecC-CcceeeccCCceEEc
Q psy5025         396 NNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG-GKFEQLNTDSGLYVG  474 (533)
Q Consensus       396 ~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~-~~~~~l~~~~~l~lG  474 (533)
                      ||++||+|++.|                                +.+...|+||+.......++ ....-|...+.+|+|
T Consensus       868 nD~qWH~V~~Er--------------------------------n~K~a~LqVD~~~~~~r~sp~~~~~~L~l~s~l~vG  915 (1306)
T KOG3516|consen  868 NDNQWHQVRAER--------------------------------NSKEASLQVDGLPKSIRTSPIPGTRLLQLYSSLFVG  915 (1306)
T ss_pred             CCCceEEEEEEe--------------------------------ccccceEEEcCcccceecCCCCCEEEEEeccceecc
Confidence            999999999999                                99999999999876554443 334556778889999


Q ss_pred             CCCCcccccccccccCceeeEEEEEEcCceEEeecccCc----CCCCcccceeeecCCce
Q psy5025         475 GVEDIEELTRSKYHHSFKGWISNLILSNDYQVKLPLSRS----QGSSCQLTCETLNTGQV  530 (533)
Q Consensus       475 g~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~~~~~~----~~~~c~~~~~~~~~~~~  530 (533)
                      |...        ...+|.||||.|++ |+..|||...+.    ...+|.-.|++.+|.++
T Consensus       916 gt~~--------~~~gF~GCIRsl~L-NGv~ldLe~ra~~~~gv~~GC~GhCss~~C~NG  966 (1306)
T KOG3516|consen  916 GTVS--------RQRGFLGCIRSLQL-NGVMLDLEYRAYGTAGVSPGCEGHCSSYPCLNG  966 (1306)
T ss_pred             cccc--------CcCcceeeeeeeee-cceeeeehhhhccCCcccCCCccccccccccCC
Confidence            9553        34699999999999 899999976653    33589999999987654


No 6  
>KOG4289|consensus
Probab=100.00  E-value=8.8e-33  Score=295.53  Aligned_cols=272  Identities=25%  Similarity=0.515  Sum_probs=210.8

Q ss_pred             CccccCCCceEEEEEEeCceeEEEEcCCCC--------------------------------------------------
Q psy5025           5 DNQVILDDWNTVRVFRHRWDAWIQLNDGNK--------------------------------------------------   34 (533)
Q Consensus         5 ~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~--------------------------------------------------   34 (533)
                      ++.|+||+||+|.++...|.+.++||++++                                                  
T Consensus      1392 p~gvsDGqWHtV~l~YyNK~av~svDdCdt~~al~fg~~gNCAa~g~q~~sKKsLDltgpLlLGGvPe~fpv~~k~FvGC 1471 (2531)
T KOG4289|consen 1392 PGGVSDGQWHTVQLEYYNKVAVVSVDDCDTNVALRFGTIGNCAAQGTQTGSKKSLDLTGPLLLGGVPETFPVIEKQFVGC 1471 (2531)
T ss_pred             CCCcccCceeEEEEEEeceEEEEEeccccccceeeecCccchHhhhhccCcceeeeccCceeecCCCCcchhhHhHhhhh
Confidence            568999999999999999999999999988                                                  


Q ss_pred             ------------------ceeecCCCC--CCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCCcee
Q psy5025          35 ------------------IKGRSKECL--EDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSY   94 (533)
Q Consensus        35 ------------------~~G~~~~C~--~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~sy   94 (533)
                                        ..|...+|.  .+.|+..+|+|+|+|++.|++|.|.||.+|.|+.|++.... .-.|.|.|-
T Consensus      1472 mrdLsvD~~~VDma~fianngt~eGC~ark~fCdsg~C~n~g~CvnrWg~~~C~CP~~fggk~c~~~m~~-pq~frG~sl 1550 (2531)
T KOG4289|consen 1472 MRDLSVDGRDVDMATFIANNGTHEGCKARKNFCDSGQCSNGGTCVNRWGGFSCECPLGFGGKGCCQGMAH-PQHFRGHSL 1550 (2531)
T ss_pred             hhhcccccccccHHHHHhhcCcccCchhhhcccCCCccCCCCeeecccCcEeecCccccCCcchhhccCC-chhccccce
Confidence                              122256888  68999999999999999999999999999999999987431 236999999


Q ss_pred             EEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE-eeccccCCCcEEE
Q psy5025          95 LRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR-SSNPISLGEWHKL  173 (533)
Q Consensus        95 l~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~-s~~~lnDg~Wh~V  173 (533)
                      +.+...+...+..+.++|.|||++.||+||.+....   ..-+.|+|.+|.|.+.+..    ..+. ....++||+||++
T Consensus      1551 ~sw~~~~~~vSvPwylsl~FRTr~ad~vl~~~~~~~---rst~~lqld~g~l~~~v~~----s~v~L~~~~vtdg~Wh~~ 1623 (2531)
T KOG4289|consen 1551 VSWEGLPSQVSVPWYLSLMFRTRRADGVLMQAEFGG---RSTYNLQLDDGTLKYNVGD----SSVELPAPRVTDGHWHHL 1623 (2531)
T ss_pred             eeecCCCcceecceEEEEEEEeeccccEEEEEEeCC---CceEEEEEcCCEEEEEecC----ceEEccCccccCCchhhe
Confidence            999877766667899999999999999999986543   4559999999999987743    3333 3578999999999


Q ss_pred             EEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEccccccccc-
Q psy5025         174 RLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQ-  252 (533)
Q Consensus       174 ~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~-  252 (533)
                      .|+.... ..+.+|...-+.....+ ...+.+ ..+|+||.|...       ...+|.|||+++.++|..+. .++... 
T Consensus      1624 ~i~l~~d-~~~t~d~g~~~aea~~g-l~gl~l-~sl~vGgap~~g-------~p~gf~GCiqgV~v~g~~~l-~~~kv~~ 1692 (2531)
T KOG4289|consen 1624 VIELEAD-SVATLDYGIYQAEAKAG-LSGLNL-ESLYVGGAPATG-------VPRGFRGCIQGVRVGGVSIL-VPKKVNV 1692 (2531)
T ss_pred             eeeeccC-eEEEEechhhhhhhhcC-CCCcee-eEEEEccccCCC-------ccccchhhhhceEECCEeec-ccccccc
Confidence            9999875 45566653322211111 334444 369999999442       34899999999999998864 221110 


Q ss_pred             --CCCc--------------cCCCCCCCCCCC-CCCeeeecC--CceEEecCCCCCcCcccc
Q psy5025         253 --GVNV--------------VNCAHPCITQPC-GSGSCVPLY--DGYKCVCTQSCKESHLTE  295 (533)
Q Consensus       253 --~~~~--------------~~C~~~C~~~pC-ngg~C~~~~--~~~~C~C~~g~~G~~c~~  295 (533)
                        +-.+              ..|-+.|.-+|| |.|+|+...  .+|+|+|++||.|..|+.
T Consensus      1693 ~~GCvvpn~C~~d~sC~c~~~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~ 1754 (2531)
T KOG4289|consen 1693 EAGCVVPNPCSVDSSCPCDPYNCVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCEL 1754 (2531)
T ss_pred             ccCcccCCccccCCcccCCCCCccchhcccccccCceeecCCCCCceeEECCCcccCcchhh
Confidence              0000              123356778999 999999876  579999999999998875


No 7  
>KOG1219|consensus
Probab=99.95  E-value=2e-27  Score=262.48  Aligned_cols=245  Identities=27%  Similarity=0.613  Sum_probs=198.9

Q ss_pred             CCC--CCCCCCCCCCCCCeeeeCC--CCceecCCCCCCCCCccccccccccccCCceeEEEcCCcccccceeEEEEEEEe
Q psy5025          41 ECL--EDKCNKTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSYLRYQGLGEQYLSLLDLTIVFKA  116 (533)
Q Consensus        41 ~C~--~d~C~~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT  116 (533)
                      .|+  .+.|...||+.+..|+...  ..|.|.||.|..| .|+...   ..++.|+||++|....... ..+.+.|+.||
T Consensus      3641 ~Cp~~~~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g-~C~g~~---elS~tGnSYveyrlse~~n-~~~kl~frLkT 3715 (4289)
T KOG1219|consen 3641 FCPVETNQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFG-FCQGDF---ELSSTGNSYVEYRLSENQN-TRMKLGFRLKT 3715 (4289)
T ss_pred             cCCcccCccccCCCcccCcccccccccceeEeccCcccc-cCCCcc---eEeecCceeEEEEcccccc-cceEEEEEEEe
Confidence            476  6899999999999999886  7899999998555 488774   5789999999999755443 34899999999


Q ss_pred             cCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEEEcCccCceee
Q psy5025         117 IEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSSQIL  195 (533)
Q Consensus       117 ~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~  195 (533)
                      ..++|+++|...     .++..|.|.+|.+.+.++.|+++-.+.+. ..++||+||.|.+.+.++++.|.||+....+..
T Consensus      3716 ~~sngIiM~tr~-----~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialerrr~~irlsvDd~~~~~at 3790 (4289)
T KOG1219|consen 3716 LQSNGIIMYTRK-----TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALERRRNHIRLSVDDDTYDSAT 3790 (4289)
T ss_pred             cccCcEEEEEcC-----CceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeeccCCceEEEEcccCceeee
Confidence            999999999852     79999999999999999999988666665 789999999999999999999999998888888


Q ss_pred             CCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEccccccc------ccCCC-ccCC---------
Q psy5025         196 SPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEA------IQGVN-VVNC---------  259 (533)
Q Consensus       196 ~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~------~~~~~-~~~C---------  259 (533)
                      .|+....++.+..||+|+.-..... .......+|.|||+.+.+|+..+++..+.      +.... ..+|         
T Consensus      3791 vPg~~~tln~d~hiy~Ga~vrlr~~-~~tqvs~Gf~GCldsiyLng~el~l~~k~~s~a~~~el~~l~pgC~l~~d~C~~ 3869 (4289)
T KOG1219|consen 3791 VPGMKSTLNLDTHIYLGALVRLRHQ-RSTQVSYGFDGCLDSIYLNGMELPLTRKGKSVAGLMELFGLQPGCSLLTDPCND 3869 (4289)
T ss_pred             cccceeeccccceEEEeeEeeeccC-CCccccccccceeeeEEEccccccccCCCchhhhhhhhhccccccccccccccc
Confidence            9998888988899999998752211 22245689999999999999998864332      11000 1122         


Q ss_pred             ---------------------C------------CCCCCCCC-CCCeeeecCCceEEecCCCCCcCcccce
Q psy5025         260 ---------------------A------------HPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHLTEV  296 (533)
Q Consensus       260 ---------------------~------------~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~~~  296 (533)
                                           |            .+|.++|| +||+|+...++|.|.||.||+|.+|+..
T Consensus      3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEAR 3940 (4289)
T ss_pred             CcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeecc
Confidence                                 1            26888888 8888888888888888888888888764


No 8  
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.90  E-value=9.2e-23  Score=178.06  Aligned_cols=129  Identities=35%  Similarity=0.595  Sum_probs=106.8

Q ss_pred             EEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCc
Q psy5025         345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKS  424 (533)
Q Consensus       345 ~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~  424 (533)
                      |||.+++|+|||.+.....+|++|+|++|+|+++++.|+++..+... .+++||+||+|.+.|                 
T Consensus         1 frT~~~~Gllly~g~~~~~dfial~L~~G~l~~~~~~G~~~~~~~~~-~~i~dg~wh~v~~~r-----------------   62 (131)
T PF00054_consen    1 FRTSEPNGLLLYLGSKDGKDFIALELRDGRLEFRYNLGSGPASLRSP-QKINDGKWHTVSVSR-----------------   62 (131)
T ss_dssp             EEESSSSEEEEEEESSTTSSEEEEEEETTEEEEEEESSSEEEEEEES-SETTSSSEEEEEEEE-----------------
T ss_pred             CccCCCCceEEECCcCCCCCEEEEEEECCEEEEEEeCCCccceecCC-CccCCCcceEEEEEE-----------------
Confidence            89999999999998777789999999999999999999998777744 459999999999999                 


Q ss_pred             chhhhhhhhhccccccCcEEEEEECCceeeeEecCCccee-eccCCceEEcCCCCc-ccccccccccCceeeEEEEEEcC
Q psy5025         425 SKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ-LNTDSGLYVGGVEDI-EELTRSKYHHSFKGWISNLILSN  502 (533)
Q Consensus       425 ~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~-l~~~~~l~lGg~~~~-~~~~~~~~~~~f~GCi~~l~i~~  502 (533)
                                     .++.+.|+||+.......++..... ++.+..|||||+|.. ..........+|+|||+++.| +
T Consensus        63 ---------------~~~~~~L~Vd~~~~~~~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~i-n  126 (131)
T PF00054_consen   63 ---------------NGRNGSLSVDGEEVVTGESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSI-N  126 (131)
T ss_dssp             ---------------ETTEEEEEETTSEEEEEEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEE-T
T ss_pred             ---------------cCcEEEEEECCccceeeecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEE-C
Confidence                           9999999999998855555544334 788888999999933 333445577899999999999 7


Q ss_pred             ceEEe
Q psy5025         503 DYQVK  507 (533)
Q Consensus       503 ~~~l~  507 (533)
                      +.++|
T Consensus       127 ~~~ld  131 (131)
T PF00054_consen  127 GKPLD  131 (131)
T ss_dssp             TEEC-
T ss_pred             CEECc
Confidence            77765


No 9  
>PF00054 Laminin_G_1:  Laminin G domain;  InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=99.89  E-value=3.8e-22  Score=174.14  Aligned_cols=130  Identities=40%  Similarity=0.704  Sum_probs=108.8

Q ss_pred             EEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCce
Q psy5025         114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQ  193 (533)
Q Consensus       114 FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~  193 (533)
                      |||..++|+|||.+....  .||++|+|.+|+|+++++.|.++..+.++..++||+||+|.+.|....+.|.||+.....
T Consensus         1 frT~~~~Gllly~g~~~~--~dfial~L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~   78 (131)
T PF00054_consen    1 FRTSEPNGLLLYLGSKDG--KDFIALELRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVT   78 (131)
T ss_dssp             EEESSSSEEEEEEESSTT--SSEEEEEEETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEE
T ss_pred             CccCCCCceEEECCcCCC--CCEEEEEEECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCcccee
Confidence            899999999999987654  599999999999999999999988888888899999999999999999999999977634


Q ss_pred             eeCCCCcee-eecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         194 ILSPGPFTQ-LSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       194 ~~~~~~~~~-l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      ...+..... ++....+||||+|.....+.......+|.|||+++.+|+++++
T Consensus        79 ~~s~~~~~~~l~~~~~lyvGG~p~~~~~~~~~~~~~~f~GCi~~~~in~~~ld  131 (131)
T PF00054_consen   79 GESPSGATQSLDVDGPLYVGGLPSSSSRPRPLPISPGFKGCIRNLSINGKPLD  131 (131)
T ss_dssp             EEECSSSSSSCEECSEEEESSSSTTTGCGSSCSCCSB-EEEEEEEEETTEEC-
T ss_pred             eecCCccccccccccCEEEccCCchhhcccccccCCCeeEEEEEeEECCEECc
Confidence            444444444 7777889999999444444455667899999999999999875


No 10 
>smart00282 LamG Laminin G domain.
Probab=99.88  E-value=1.6e-21  Score=171.60  Aligned_cols=132  Identities=33%  Similarity=0.628  Sum_probs=110.6

Q ss_pred             eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCc
Q psy5025         338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSS  417 (533)
Q Consensus       338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~  417 (533)
                      .++|+|.|||.+++|+||+..+....+|+.|+|.+|++++.++.+++...+......++||+||+|.+.+          
T Consensus         2 ~~~i~~~frt~~~~g~l~~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~----------   71 (135)
T smart00282        2 RLSISFSFRTTSPNGLLLYAGSKNGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVER----------   71 (135)
T ss_pred             ceEEEEEEEeCCCCEEEEEeCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEE----------
Confidence            3689999999999999999976445899999999999999999988777766555899999999999999          


Q ss_pred             ccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEE
Q psy5025         418 HTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISN  497 (533)
Q Consensus       418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~  497 (533)
                                            .++.+.|+||+........++....++....+||||+|+...........+|+|||++
T Consensus        72 ----------------------~~~~~~l~VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~~~~~~~~~F~GCi~~  129 (135)
T smart00282       72 ----------------------NGRRVTLSVDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLLVTPGFRGCIRN  129 (135)
T ss_pred             ----------------------eCCEEEEEECCCccccEECCCCceEEecCCCcEEccCCchhcccccccCCCCeeEeeE
Confidence                                  8999999999976554444444566777788999999987544344567899999999


Q ss_pred             EEEc
Q psy5025         498 LILS  501 (533)
Q Consensus       498 l~i~  501 (533)
                      |+||
T Consensus       130 v~in  133 (135)
T smart00282      130 LKVN  133 (135)
T ss_pred             EEEC
Confidence            9994


No 11 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.87  E-value=6.7e-21  Score=170.74  Aligned_cols=150  Identities=31%  Similarity=0.558  Sum_probs=119.9

Q ss_pred             eeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEE
Q psy5025         299 ASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVM  378 (533)
Q Consensus       299 ~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~  378 (533)
                      +.| .+.+|+.++.+...                   ...++|+|+|||.+++|+||+.+.....+|+.|+|.+|++++.
T Consensus         2 ~~F-~g~~~i~~~~~~~~-------------------~~~~~i~~~frt~~~~g~l~~~~~~~~~~~~~l~l~~g~l~~~   61 (151)
T cd00110           2 VSF-SGSSYVRLPTLPAP-------------------RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLR   61 (151)
T ss_pred             eEe-CCCceEEecCCCCC-------------------cceeEEEEEEEeCCCCeEEEEecCCCCCCEEEEEEECCEEEEE
Confidence            567 66799999976541                   4568999999999999999999876568999999999999999


Q ss_pred             EEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEec
Q psy5025         379 YNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSIS  458 (533)
Q Consensus       379 ~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~  458 (533)
                      ++.+.....+. ...+++||+||+|.+.+                                .++.+.|+||+........
T Consensus        62 ~~~g~~~~~~~-~~~~v~dg~Wh~v~i~~--------------------------------~~~~~~l~VD~~~~~~~~~  108 (151)
T cd00110          62 YDLGSGSLVLS-SKTPLNDGQWHSVSVER--------------------------------NGRSVTLSVDGERVVESGS  108 (151)
T ss_pred             EcCCcccEEEE-ccCccCCCCEEEEEEEE--------------------------------CCCEEEEEECCccEEeeeC
Confidence            99985555555 33479999999999999                                8899999999985444433


Q ss_pred             CCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEc
Q psy5025         459 GGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILS  501 (533)
Q Consensus       459 ~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~  501 (533)
                      +.....+.....+||||+|............+|+|||++|+||
T Consensus       109 ~~~~~~~~~~~~~~iGg~~~~~~~~~~~~~~~F~Gci~~v~in  151 (151)
T cd00110         109 PGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN  151 (151)
T ss_pred             CCCceeecCCCCeEEcCCCCchhcccccccCCCceEeeEeEeC
Confidence            3332246677889999999865433345678999999999983


No 12 
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=99.85  E-value=5.1e-20  Score=165.01  Aligned_cols=150  Identities=37%  Similarity=0.691  Sum_probs=120.5

Q ss_pred             cccCCceeEEEcCCcccccceeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeecccc
Q psy5025          87 PQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPIS  166 (533)
Q Consensus        87 ~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~ln  166 (533)
                      ..|+|.+|+.++.+... ...++|+|+|||.+++|+||+.+...  ..+|+.|+|.+|+|.+.++.+.....+.+...++
T Consensus         2 ~~F~g~~~i~~~~~~~~-~~~~~i~~~frt~~~~g~l~~~~~~~--~~~~~~l~l~~g~l~~~~~~g~~~~~~~~~~~v~   78 (151)
T cd00110           2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQN--GGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLN   78 (151)
T ss_pred             eEeCCCceEEecCCCCC-cceeEEEEEEEeCCCCeEEEEecCCC--CCCEEEEEEECCEEEEEEcCCcccEEEEccCccC
Confidence            46899999999986654 45799999999999999999998753  3899999999999999999986666777666899


Q ss_pred             CCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025         167 LGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN  240 (533)
Q Consensus       167 Dg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in  240 (533)
                      ||+||+|.+.+.++.+.|.||+........+.....+.....+||||.|..... .......+|.|||+++++|
T Consensus        79 dg~Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~-~~~~~~~~F~Gci~~v~in  151 (151)
T cd00110          79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKS-PGLPVSPGFVGCIRDLKVN  151 (151)
T ss_pred             CCCEEEEEEEECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhc-ccccccCCCceEeeEeEeC
Confidence            999999999999999999999974333333322223556688999999976432 2234568999999999987


No 13 
>smart00282 LamG Laminin G domain.
Probab=99.84  E-value=9.9e-20  Score=160.10  Aligned_cols=133  Identities=38%  Similarity=0.719  Sum_probs=111.4

Q ss_pred             eeEEEEEEEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEE
Q psy5025         107 LLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQ  185 (533)
Q Consensus       107 ~~~ls~~FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~  185 (533)
                      .++|+|+|||.+++|+|||.++..  ..+|+.|+|.+|+|.+.++.+++...+... ..++||+||+|.+.+.++.+.|.
T Consensus         2 ~~~i~~~frt~~~~g~l~~~~~~~--~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~WH~v~i~~~~~~~~l~   79 (135)
T smart00282        2 RLSISFSFRTTSPNGLLLYAGSKN--GGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRRVTLS   79 (135)
T ss_pred             ceEEEEEEEeCCCCEEEEEeCCCC--CCCEEEEEEECCEEEEEEECCCCCEEEEECCeEeCCCCEEEEEEEEeCCEEEEE
Confidence            468999999999999999997642  389999999999999999998877777776 89999999999999999999999


Q ss_pred             EcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025         186 VDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH  242 (533)
Q Consensus       186 VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~  242 (533)
                      ||+........++....++....+||||.|..... .......+|.|||+++++|+.
T Consensus        80 VD~~~~~~~~~~~~~~~l~~~~~l~iGG~p~~~~~-~~~~~~~~F~GCi~~v~in~~  135 (135)
T smart00282       80 VDGENPVSGESPGGLTILNLDGPLYLGGLPEDLKL-PPLLVTPGFRGCIRNLKVNGK  135 (135)
T ss_pred             ECCCccccEECCCCceEEecCCCcEEccCCchhcc-cccccCCCCeeEeeEEEECCC
Confidence            99976555555555566777789999999976432 223456899999999999973


No 14 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.82  E-value=3.1e-19  Score=154.97  Aligned_cols=126  Identities=29%  Similarity=0.570  Sum_probs=104.4

Q ss_pred             EEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCc
Q psy5025         345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKS  424 (533)
Q Consensus       345 ~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~  424 (533)
                      |||.+++|+|||.......+|+.|+|.+|+|++.++.++...........++||+||.|.+.+                 
T Consensus         1 Frt~~~~g~Ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~-----------------   63 (128)
T PF02210_consen    1 FRTRSPNGLLLYIGSEDNGDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISR-----------------   63 (128)
T ss_dssp             EEESSSSEEEEEEEESTTSEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEE-----------------
T ss_pred             CccCCCCEeEEEEcCCCCCEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEE-----------------
Confidence            899999999999987654579999999999999999996666666688889999999999999                 


Q ss_pred             chhhhhhhhhccccccCcEEEEEECCceeeeEecCCcce-eeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCc
Q psy5025         425 SKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFE-QLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSND  503 (533)
Q Consensus       425 ~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~-~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~  503 (533)
                                     .+..++|.||+............. .++....+||||.|............+|+|||++|+| |+
T Consensus        64 ---------------~~~~~~l~Vd~~~~~~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~v-ng  127 (128)
T PF02210_consen   64 ---------------DGNRVTLTVDGQSVSSESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRV-NG  127 (128)
T ss_dssp             ---------------ETTEEEEEETTSEEEEEESSSTTHHCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEE-TT
T ss_pred             ---------------eeeeEEEEecCccceEEeccccceecccCCCCEEEecccCccccccccCCCCcEEEcCeEEE-CC
Confidence                           889999999999888766554433 5666667999999988654443348899999999999 54


No 15 
>KOG1219|consensus
Probab=99.81  E-value=5.6e-19  Score=196.73  Aligned_cols=195  Identities=23%  Similarity=0.451  Sum_probs=161.0

Q ss_pred             CCCCCCCC-CCCeeeec-C-CceEEecCCCCCcCcccceeeeeecCCCeeEEecCccccccccccceeeeeeeeeeecce
Q psy5025         261 HPCITQPC-GSGSCVPL-Y-DGYKCVCTQSCKESHLTEVMCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGY  337 (533)
Q Consensus       261 ~~C~~~pC-ngg~C~~~-~-~~~~C~C~~g~~G~~c~~~~~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  337 (533)
                      ++|...|| .+..|+.. | ..|.|.||.|-.|. |+.-...++ .|.||+++...+..                   ..
T Consensus      3647 ~~C~~~pcp~~~~Cvs~~~~~~~~cVcP~gr~g~-C~g~~elS~-tGnSYveyrlse~~-------------------n~ 3705 (4289)
T KOG1219|consen 3647 NQCAKSPCPAGNLCVSSVHNSTYTCVCPIGRFGF-CQGDFELSS-TGNSYVEYRLSENQ-------------------NT 3705 (4289)
T ss_pred             CccccCCCcccCcccccccccceeEeccCccccc-CCCcceEee-cCceeEEEEccccc-------------------cc
Confidence            68999999 88999976 4 45999999985554 554456777 89999999987653                   23


Q ss_pred             eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCc
Q psy5025         338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSS  417 (533)
Q Consensus       338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~  417 (533)
                      .+.+.|++||.+++|+++|+..   .++..|.|.+|.+.+.++-|+++-.+....+.++||+||.|.+.|          
T Consensus      3706 ~~kl~frLkT~~sngIiM~tr~---~d~~iLkLv~G~~~l~~~cgsG~Givg~q~~~VnDgqWHsialer---------- 3772 (4289)
T KOG1219|consen 3706 RMKLGFRLKTLQSNGIIMYTRK---TDLAILKLVGGSPQLLADCGSGPGIVGSQKRTVNDGQWHSIALER---------- 3772 (4289)
T ss_pred             ceEEEEEEEecccCcEEEEEcC---CceEEEEecCCcEEEEEecCCCCCcccccceEeecCceeEEEeec----------
Confidence            4889999999999999999862   689999999999999999999886666455899999999999999          


Q ss_pred             ccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEE
Q psy5025         418 HTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISN  497 (533)
Q Consensus       418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~  497 (533)
                                            .++.++|.||+........++....++.+..||+||+-.....+.-....+|.|||..
T Consensus      3773 ----------------------rr~~irlsvDd~~~~~atvPg~~~tln~d~hiy~Ga~vrlr~~~~tqvs~Gf~GClds 3830 (4289)
T KOG1219|consen 3773 ----------------------RRNHIRLSVDDDTYDSATVPGMKSTLNLDTHIYLGALVRLRHQRSTQVSYGFDGCLDS 3830 (4289)
T ss_pred             ----------------------cCCceEEEEcccCceeeecccceeeccccceEEEeeEeeeccCCCccccccccceeee
Confidence                                  8899999999999888888888888998999999998762221222467899999999


Q ss_pred             EEEcCceEEeecccC
Q psy5025         498 LILSNDYQVKLPLSR  512 (533)
Q Consensus       498 l~i~~~~~l~~~~~~  512 (533)
                      +.+ |+..|++....
T Consensus      3831 iyL-ng~el~l~~k~ 3844 (4289)
T KOG1219|consen 3831 IYL-NGMELPLTRKG 3844 (4289)
T ss_pred             EEE-ccccccccCCC
Confidence            999 77777775543


No 16 
>PF02210 Laminin_G_2:  Laminin G domain;  InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin [].  Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=99.77  E-value=9.4e-18  Score=145.66  Aligned_cols=126  Identities=35%  Similarity=0.635  Sum_probs=102.4

Q ss_pred             EEecCCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCc-eEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCc
Q psy5025         114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-ATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSS  192 (533)
Q Consensus       114 FrT~~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~-~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~  192 (533)
                      |||++++|+|||.+....  .+|+.|+|.+|+|++.++.+... .....+..++||+||.|.+.+....+.|.||+....
T Consensus         1 Frt~~~~g~Ll~~~~~~~--~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wh~v~i~~~~~~~~l~Vd~~~~~   78 (128)
T PF02210_consen    1 FRTRSPNGLLLYIGSEDN--GDFLSLELVDGRLVVRYNLGGSEIVTTFSNSNLNDGQWHKVSISRDGNRVTLTVDGQSVS   78 (128)
T ss_dssp             EEESSSSEEEEEEEESTT--SEEEEEEEETTEEEEEEESSSSEEEEEECSSSSTSSSEEEEEEEEETTEEEEEETTSEEE
T ss_pred             CccCCCCEeEEEEcCCCC--CEEEEEEEECCEEEEEEEccccceeeeccCccccccceeEEEEEEeeeeEEEEecCccce
Confidence            899999999999988753  68999999999999999999433 445567889999999999999999999999998766


Q ss_pred             eeeCCCCce-eeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025         193 QILSPGPFT-QLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH  242 (533)
Q Consensus       193 ~~~~~~~~~-~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~  242 (533)
                      ......... .++....+||||.|.....+.... ..+|.|||+++++|++
T Consensus        79 ~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~-~~~f~Gci~~l~vng~  128 (128)
T PF02210_consen   79 SESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVD-TPGFVGCIRDLRVNGQ  128 (128)
T ss_dssp             EEESSSTTHHCBESEEEEEESSTTTTCTCTTSST-TSB-EEEEEEEEETTE
T ss_pred             EEeccccceecccCCCCEEEecccCccccccccC-CCCcEEEcCeEEECCC
Confidence            655554443 555667799999998765333222 6899999999999985


No 17 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=99.06  E-value=7.3e-09  Score=95.76  Aligned_cols=125  Identities=15%  Similarity=0.281  Sum_probs=88.7

Q ss_pred             ceeeEEEEEEEeC-CCCeEEEEeccCCCCCeEEEEEECCEEEEEEEc--CCc-eEEEeeCceeecCCCeEEEEEEEeeee
Q psy5025         336 GYEFNIDMRIKTS-SSSGLILWTGSSQSKDFLALGIHNGYAHVMYNL--GNG-EVLLENNQTKINNGVWHRLTVHRNFMV  411 (533)
Q Consensus       336 ~~~~~i~~~~rt~-~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~--g~~-~~~~~~~~~~~~dg~wH~v~~~~~~~~  411 (533)
                      ..++.|.+++|+. +..|.||...+.+...+|.|.+..+...+.+..  ..+ .....+....+.||+||+|.+..    
T Consensus        51 ~~~fsi~~~~r~~~~~~g~L~si~~~~~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V----  126 (184)
T smart00210       51 PEDFSLLTTFRQTPKSRGVLFAIYDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSV----  126 (184)
T ss_pred             CCCeEEEEEEEeCCCCCeEEEEEEcCCCcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEE----
Confidence            4578999999997 678899988765567789998876664555442  222 33444455789999999999999    


Q ss_pred             eecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcc-eeeccCCceEEcCCCCcccccccccccC
Q psy5025         412 VMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKF-EQLNTDSGLYVGGVEDIEELTRSKYHHS  490 (533)
Q Consensus       412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~-~~l~~~~~l~lGg~~~~~~~~~~~~~~~  490 (533)
                                                  .+..++|+||............. ..+..++ ++++|....       ....
T Consensus       127 ----------------------------~~~~v~LyvDC~~~~~~~l~~~~~~~~~~~g-~~~~g~~~~-------~~~~  170 (184)
T smart00210      127 ----------------------------SGSSATLYVDCNEIDSRPLDRPGQPPIDTDG-IEVRGAQAA-------DRKP  170 (184)
T ss_pred             ----------------------------eCCEEEEEECCccccceecCCcccccccccc-eEEEeeccC-------CCCc
Confidence                                        88999999999987665432222 1344444 556554432       1348


Q ss_pred             ceeeEEEEEE
Q psy5025         491 FKGWISNLIL  500 (533)
Q Consensus       491 f~GCi~~l~i  500 (533)
                      |+|+|++|+|
T Consensus       171 f~G~lq~l~i  180 (184)
T smart00210      171 FQGDLQQLKI  180 (184)
T ss_pred             ceEEeEEEEE
Confidence            9999999998


No 18 
>KOG3509|consensus
Probab=98.73  E-value=2.8e-08  Score=109.38  Aligned_cols=250  Identities=19%  Similarity=0.296  Sum_probs=159.9

Q ss_pred             CCCCCCCCCCCCCCCCCeeeeCC-CCceecCCCCCCCCCccccccccccccCCc-----eeEEEcC-------------C
Q psy5025          40 KECLEDKCNKTPCEHGGKCITTP-GSSYCLCPLGFTGDLCQTRLDLQVPQFNGS-----SYLRYQG-------------L  100 (533)
Q Consensus        40 ~~C~~d~C~~~pC~NggtC~~~~-~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~-----syl~~~~-------------~  100 (533)
                      ..++...|...+|+-|++|.... -+|.+.|+..-.+..|+.....++..|...     ....+..             +
T Consensus       168 ~~~~~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~  247 (964)
T KOG3509|consen  168 PGTEPIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAI  247 (964)
T ss_pred             CccccccccCcccccceeEEecCCcceeeeccccccceehhhccCceecccccccccccceeecCCCccccchhhhHhhh
Confidence            34556678889999999999986 788888888777777888776555544321     1111111             0


Q ss_pred             cccccceeEEEEEEEec--CCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEE
Q psy5025         101 GEQYLSLLDLTIVFKAI--EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRT  178 (533)
Q Consensus       101 ~~~~~~~~~ls~~FrT~--~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~  178 (533)
                      .........+.. |++.  ..+.++.+....  ...+|+++.+..|-+.+.+..+.+...+......-.++|+.+.+.| 
T Consensus       248 h~~~~~~~~~~~-~~~~s~~~~~~~~~~~~~--~~~~f~~lt~~~g~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r-  323 (964)
T KOG3509|consen  248 HGATLHVDSLGV-FFSDSEHRDILGNFLFSS--FKDGFRALTLDGGTDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFR-  323 (964)
T ss_pred             ccchhccchhee-eccccccccccccccccc--cccceeeeccCCCCccccccccCcchhhhhccccccceeeeeEeee-
Confidence            111111112222 2232  234444444332  2378888888777777777766665555566777889999999998 


Q ss_pred             CCEEEEEEcCcc-CceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccccccccCCCc-
Q psy5025         179 GRHAYLQVDRFP-SSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGVNV-  256 (533)
Q Consensus       179 ~~~~~L~VD~~~-~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~~~-  256 (533)
                        .-.+.+++.. ............+.....+|+|++-....+........+|.|||+++.++.+.++...+....+.. 
T Consensus       324 --~s~~~~~g~~~~l~g~~~~~~~~i~~ee~v~lg~i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~  401 (964)
T KOG3509|consen  324 --GSGLSVSGHKGVLQGNSNILVSRITNEESVFLGGIINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQ  401 (964)
T ss_pred             --cccccccCcceeecccccccccceeecccccCCceeeeccccccCCCccCccceehhhhhhccccccccccccccccc
Confidence              2233444421 111222233334445556888885555445555566789999999999999988754433333322 


Q ss_pred             cCC-CCCCCCCCC-CCCeeeecCCceEEecCCCCCcCcccc
Q psy5025         257 VNC-AHPCITQPC-GSGSCVPLYDGYKCVCTQSCKESHLTE  295 (533)
Q Consensus       257 ~~C-~~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~~  295 (533)
                      ..| .+.|...|| ..+.|........|.|+++|.|..|+.
T Consensus       402 ~~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d  442 (964)
T KOG3509|consen  402 GTCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCED  442 (964)
T ss_pred             cccCCCccccccCCCCccccccccccceeccccccCchhhc
Confidence            245 578999999 999999999999999999999976654


No 19 
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=98.70  E-value=6.2e-07  Score=82.91  Aligned_cols=124  Identities=21%  Similarity=0.262  Sum_probs=85.8

Q ss_pred             ceeEEEEEEEec-CCCeeEEEccccCCCCCCeEEEEEECCEEEEEEEc--CCC-ceEEEee-ccccCCCcEEEEEEEECC
Q psy5025         106 SLLDLTIVFKAI-EPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDL--GTG-AATLRSS-NPISLGEWHKLRLTRTGR  180 (533)
Q Consensus       106 ~~~~ls~~FrT~-~~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~--g~~-~~~v~s~-~~lnDg~Wh~V~v~~~~~  180 (533)
                      ..+.|.++||+. ...|.||-..+..  ...++.|.+..++..+.+..  ..+ ...+... ..+.||+||+|.+...+.
T Consensus        52 ~~fsi~~~~r~~~~~~g~L~si~~~~--~~~~l~v~l~g~~~~~~~~~~~~~g~~~~~~f~~~~l~dg~WH~lal~V~~~  129 (184)
T smart00210       52 EDFSLLTTFRQTPKSRGVLFAIYDAQ--NVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRNLPLADGQWHKLALSVSGS  129 (184)
T ss_pred             CCeEEEEEEEeCCCCCeEEEEEEcCC--CcEEEEEEEeCCccEEEEEECCCCCcEEEEeecCCccccCCceEEEEEEeCC
Confidence            468999999986 7788888886642  26789999987765555553  222 2333444 779999999999999999


Q ss_pred             EEEEEEcCccCceeeCCCCc-eeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEEC
Q psy5025         181 HAYLQVDRFPSSQILSPGPF-TQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN  240 (533)
Q Consensus       181 ~~~L~VD~~~~~~~~~~~~~-~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in  240 (533)
                      .++|+||............. ..+.. ..++++|.....        ...|.|+|++++|-
T Consensus       130 ~v~LyvDC~~~~~~~l~~~~~~~~~~-~g~~~~g~~~~~--------~~~f~G~lq~l~i~  181 (184)
T smart00210      130 SATLYVDCNEIDSRPLDRPGQPPIDT-DGIEVRGAQAAD--------RKPFQGDLQQLKIV  181 (184)
T ss_pred             EEEEEECCccccceecCCcccccccc-cceEEEeeccCC--------CCcceEEeEEEEEe
Confidence            99999999776554433222 12333 335555544321        25899999999884


No 20 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.69  E-value=5.9e-09  Score=66.33  Aligned_cols=31  Identities=48%  Similarity=1.266  Sum_probs=29.2

Q ss_pred             CCCCCCCCCCeeeeCC-CCceecCCCCCCCCC
Q psy5025          47 CNKTPCEHGGKCITTP-GSSYCLCPLGFTGDL   77 (533)
Q Consensus        47 C~~~pC~NggtC~~~~-~~y~C~C~~g~~G~~   77 (533)
                      |.++||+|+|+|++.. ++|+|.|++||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            7789999999999999 999999999999974


No 21 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.67  E-value=5.4e-07  Score=80.19  Aligned_cols=149  Identities=19%  Similarity=0.350  Sum_probs=89.7

Q ss_pred             eeeecCCCeeEEecCccccccccccceeeeeeeeeeecceeeEEEEEEEeCCCCe--EEEEeccCCCCCeEEEEEE-CCE
Q psy5025         298 CASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSG--LILWTGSSQSKDFLALGIH-NGY  374 (533)
Q Consensus       298 ~~~f~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~rt~~~~g--~ll~~~~~~~~~~l~l~l~-~g~  374 (533)
                      ++.|+.+.+|+.++....                   ....++|+++||.....+  .++.. .....+.+.|.+. +|.
T Consensus         2 a~~f~g~~~~i~~~~~~~-------------------~~~~fTi~~w~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~   61 (157)
T PF13385_consen    2 ALYFDGSNDYISIPNSDF-------------------PSGSFTISFWVKPDSPSSSQSFVFM-DSSGSGGFGLFINNNGR   61 (157)
T ss_dssp             EEEE-STT-EEEEESGGG-------------------GGTEEEEEEEEEESS--SSEEEEEE-SSSSSEEEEEEEETTSE
T ss_pred             EEEECCCCCEEEECCcCC-------------------CCCCEEEEEEEEeCCCCCCceEEEE-ecCCCCEEEEEEECCCE
Confidence            467767889999886332                   145789999999986332  23322 2222457777776 577


Q ss_pred             EEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceee
Q psy5025         375 AHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSM  454 (533)
Q Consensus       375 l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~  454 (533)
                      +.+.+..+.+..........+.+++||+|.+.+                                .+..+.|+|||....
T Consensus        62 l~~~~~~~~~~~~~~~~~~~~~~~~W~~l~~~~--------------------------------~~~~~~lyvnG~~~~  109 (157)
T PF13385_consen   62 LRFYIGNGGGGNYSFSSDSNLPDNKWHHLALTY--------------------------------DGSTVTLYVNGELVG  109 (157)
T ss_dssp             EEEEETTSEEESS-EE-BS---TT-EEEEEEEE--------------------------------ETTEEEEEETTEEET
T ss_pred             EEEEEeCCCceeEEEecCcccCCCCEEEEEEEE--------------------------------ECCeEEEEECCEEEE
Confidence            777766553211122256789999999999999                                888999999999776


Q ss_pred             eEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEEEEEEcCceEEee
Q psy5025         455 QSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVKL  508 (533)
Q Consensus       455 ~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~~  508 (533)
                      ....... ........++||+...        ....|.|-|.+|+| ..+.|.-
T Consensus       110 ~~~~~~~-~~~~~~~~~~iG~~~~--------~~~~~~g~i~~~~i-~~~aLt~  153 (157)
T PF13385_consen  110 SSTIPSN-ISLNSNGPLFIGGSGG--------GSSPFNGYIDDLRI-YNRALTA  153 (157)
T ss_dssp             TCTEESS-SSTTSCCEEEESS-ST--------T--B-EEEEEEEEE-ESS---H
T ss_pred             eEeccCC-cCCCCcceEEEeecCC--------CCCceEEEEEEEEE-ECccCCH
Confidence            5433222 1124556799998662        24589999999999 6666543


No 22 
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.46  E-value=5.5e-06  Score=78.02  Aligned_cols=140  Identities=14%  Similarity=0.117  Sum_probs=89.1

Q ss_pred             ceeeEEEEEEEeCC--CCeEEEEeccCCCCCeEEEEE-ECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025         336 GYEFNIDMRIKTSS--SSGLILWTGSSQSKDFLALGI-HNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV  412 (533)
Q Consensus       336 ~~~~~i~~~~rt~~--~~g~ll~~~~~~~~~~l~l~l-~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~  412 (533)
                      ...++|.+++|+..  ..+.||........+-+.+.. .++.+.+.+  ++....   ....+.||+||+|.+.++.   
T Consensus        30 l~~fTvc~W~k~~~~~~~~~ifSy~~~~~~ne~~~~~~~~~~~~l~i--~g~~~~---~~~~~~~g~W~hvc~tw~~---  101 (206)
T smart00159       30 LQAFTVCLWFYSDLSPRGYSLFSYATKGQDNELLLYKEKQGEYSLYI--GGKKVQ---FPVPESDGKWHHICTTWES---  101 (206)
T ss_pred             hhHEEEEEEEEecCCCCceEEEEEeCCCCCCeEEEEEcCCcEEEEEE--cCeEEE---ecccccCCceEEEEEEEEC---
Confidence            45789999999975  456676554443344444444 355555554  333222   2356899999999999932   


Q ss_pred             ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025         413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK  492 (533)
Q Consensus       413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~  492 (533)
                                                 ..+.+.|+|||..... ........+...+.|.||.-.+..- ........|.
T Consensus       102 ---------------------------~~g~~~lyvnG~~~~~-~~~~~g~~i~~~G~lvlGq~qd~~g-g~f~~~~~f~  152 (206)
T smart00159      102 ---------------------------SSGIAELWVDGKPGVR-KGLAKGYTVKPGGSIILGQEQDSYG-GGFDATQSFV  152 (206)
T ss_pred             ---------------------------CCCcEEEEECCEEccc-ccccCCcEECCCCEEEEEecccCCC-CCCCCCccee
Confidence                                       4667999999987622 2222233455666788887543211 0112456899


Q ss_pred             eeEEEEEEcCceEEeecccCc
Q psy5025         493 GWISNLILSNDYQVKLPLSRS  513 (533)
Q Consensus       493 GCi~~l~i~~~~~l~~~~~~~  513 (533)
                      |-|.+++| +++.|...+...
T Consensus       153 G~i~~v~i-w~~~Ls~~eI~~  172 (206)
T smart00159      153 GEIGDLNM-WDSVLSPEEIKS  172 (206)
T ss_pred             EEEeeeEE-ecccCCHHHHHH
Confidence            99999999 888887766543


No 23 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.38  E-value=2e-07  Score=59.23  Aligned_cols=30  Identities=33%  Similarity=1.001  Sum_probs=27.6

Q ss_pred             CCCCCC-CCCeeeecC-CceEEecCCCCCcCc
Q psy5025         263 CITQPC-GSGSCVPLY-DGYKCVCTQSCKESH  292 (533)
Q Consensus       263 C~~~pC-ngg~C~~~~-~~~~C~C~~g~~G~~  292 (533)
                      |.++|| |+|+|++.. ++|.|.|++||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence            788999 999999999 999999999999863


No 24 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.24  E-value=5.1e-05  Score=71.19  Aligned_cols=139  Identities=16%  Similarity=0.141  Sum_probs=89.3

Q ss_pred             ceeeEEEEEEEeCC--CCeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025         336 GYEFNIDMRIKTSS--SSGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV  412 (533)
Q Consensus       336 ~~~~~i~~~~rt~~--~~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~  412 (533)
                      ...++|.+++|+..  ..+.||........+.+.+... +|.+.+.+.  +....+.   ....||+||+|.+.+..   
T Consensus        30 l~~fTv~~Wv~~~~~~~~~~ifSy~~~~~~~~~~l~~~~~g~~~~~i~--~~~~~~~---~~~~~g~W~hv~~t~d~---  101 (201)
T cd00152          30 LQAFTLCLWVYTDLSTREYSLFSYATKGQDNELLLYKEKDGGYSLYIG--GKEVTFK---VPESDGAWHHICVTWES---  101 (201)
T ss_pred             hhhEEEEEEEEecCCCCCeEEEEEeCCCCCCeEEEEEcCCCeEEEEEc--CEEEEEe---ccCCCCCEEEEEEEEEC---
Confidence            45789999999874  4556665443333445555553 567766653  3333332   34599999999999922   


Q ss_pred             ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025         413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK  492 (533)
Q Consensus       413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~  492 (533)
                                                 ..+.+.|+|||....... ......+...+.|.||...+... ........|.
T Consensus       102 ---------------------------~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~~g-g~~~~~~~f~  152 (201)
T cd00152         102 ---------------------------TSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDSYG-GGFDATQSFV  152 (201)
T ss_pred             ---------------------------CCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccCCC-CCCCCCcceE
Confidence                                       456799999998765543 22233455666788886543210 0112346899


Q ss_pred             eeEEEEEEcCceEEeecccC
Q psy5025         493 GWISNLILSNDYQVKLPLSR  512 (533)
Q Consensus       493 GCi~~l~i~~~~~l~~~~~~  512 (533)
                      |-|.+++| +++.|...+..
T Consensus       153 G~I~~v~i-w~~~Ls~~eI~  171 (201)
T cd00152         153 GEISDVNM-WDSVLSPEEIK  171 (201)
T ss_pred             EEEceeEE-EcccCCHHHHH
Confidence            99999999 88878766654


No 25 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.21  E-value=1.5e-06  Score=58.18  Aligned_cols=36  Identities=47%  Similarity=1.178  Sum_probs=33.1

Q ss_pred             CCCCCC-CCCCCCCeeeeCCCCceecCCCCCC-CCCcc
Q psy5025          44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFT-GDLCQ   79 (533)
Q Consensus        44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~-G~~Ce   79 (533)
                      +++|.. +||.++|+|++..++|.|.|+.||. |..|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            477887 8999999999999999999999999 99885


No 26 
>PF13385 Laminin_G_3:  Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=98.21  E-value=3.3e-05  Score=68.52  Aligned_cols=142  Identities=20%  Similarity=0.308  Sum_probs=84.4

Q ss_pred             cCCceeEEEcCCcccccceeEEEEEEEecCCCe---eEEEccccCCCCCCeEEEEEE-CCEEEEEEEcCC-CceEEEeec
Q psy5025          89 FNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNG---ILLYNGHRADGVGDFIALYLN-DRYVDFTFDLGT-GAATLRSSN  163 (533)
Q Consensus        89 F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~dG---lLl~~~~~~~~~~~fl~l~L~-~G~l~~~~~~g~-~~~~v~s~~  163 (533)
                      +.+.+|+.++..... ...++|+++||.....+   .++. ..   ...+.+.+.+. ++++.+.+..+. ....+....
T Consensus         6 ~g~~~~i~~~~~~~~-~~~fTi~~w~~~~~~~~~~~~~~~-~~---~~~~~~~l~~~~~~~l~~~~~~~~~~~~~~~~~~   80 (157)
T PF13385_consen    6 DGSNDYISIPNSDFP-SGSFTISFWVKPDSPSSSQSFVFM-DS---SGSGGFGLFINNNGRLRFYIGNGGGGNYSFSSDS   80 (157)
T ss_dssp             -STT-EEEEESGGGG-GTEEEEEEEEEESS--SSEEEEEE-SS---SSSEEEEEEEETTSEEEEEETTSEEESS-EE-BS
T ss_pred             CCCCCEEEECCcCCC-CCCEEEEEEEEeCCCCCCceEEEE-ec---CCCCEEEEEEECCCEEEEEEeCCCceeEEEecCc
Confidence            345789998863333 56899999999865332   3333 11   11346777776 477777766543 233555677


Q ss_pred             cccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025         164 PISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE  243 (533)
Q Consensus       164 ~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~  243 (533)
                      .+.+++||+|.+...+..+.|+||+............ .......++||+....         ...|.|-|.+++|.+..
T Consensus        81 ~~~~~~W~~l~~~~~~~~~~lyvnG~~~~~~~~~~~~-~~~~~~~~~iG~~~~~---------~~~~~g~i~~~~i~~~a  150 (157)
T PF13385_consen   81 NLPDNKWHHLALTYDGSTVTLYVNGELVGSSTIPSNI-SLNSNGPLFIGGSGGG---------SSPFNGYIDDLRIYNRA  150 (157)
T ss_dssp             ---TT-EEEEEEEEETTEEEEEETTEEETTCTEESSS-STTSCCEEEESS-STT-----------B-EEEEEEEEEESS-
T ss_pred             ccCCCCEEEEEEEEECCeEEEEECCEEEEeEeccCCc-CCCCcceEEEeecCCC---------CCceEEEEEEEEEECcc
Confidence            8889999999999999999999999764433222111 1123457899987622         37899999999998877


Q ss_pred             cc
Q psy5025         244 LN  245 (533)
Q Consensus       244 l~  245 (533)
                      +.
T Consensus       151 Lt  152 (157)
T PF13385_consen  151 LT  152 (157)
T ss_dssp             --
T ss_pred             CC
Confidence            64


No 27 
>smart00159 PTX Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
Probab=98.20  E-value=8.6e-05  Score=69.92  Aligned_cols=148  Identities=17%  Similarity=0.136  Sum_probs=91.2

Q ss_pred             CCceeEEEcCCcccccceeEEEEEEEecC--CCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccC
Q psy5025          90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIE--PNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISL  167 (533)
Q Consensus        90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~--~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnD  167 (533)
                      +...|+++.+........+++.+++|+..  .++.||-...... .+.++...-.++.+.+.+.  ..  .+..+..+.+
T Consensus        15 s~~~yv~l~~~~~~~l~~fTvc~W~k~~~~~~~~~ifSy~~~~~-~ne~~~~~~~~~~~~l~i~--g~--~~~~~~~~~~   89 (206)
T smart00159       15 SDTSYVKLKPELPKPLQAFTVCLWFYSDLSPRGYSLFSYATKGQ-DNELLLYKEKQGEYSLYIG--GK--KVQFPVPESD   89 (206)
T ss_pred             CCCCeEEEccCCCCChhHEEEEEEEEecCCCCceEEEEEeCCCC-CCeEEEEEcCCcEEEEEEc--Ce--EEEecccccC
Confidence            45689998765443567899999999864  4555553332211 1333322223455555543  22  4445567899


Q ss_pred             CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      |+||+|.+++..  +.+.|++|+.... .........+...+.+.||.......  ..+.....|.|-|.++.+.++.+.
T Consensus        90 g~W~hvc~tw~~~~g~~~lyvnG~~~~-~~~~~~g~~i~~~G~lvlGq~qd~~g--g~f~~~~~f~G~i~~v~iw~~~Ls  166 (206)
T smart00159       90 GKWHHICTTWESSSGIAELWVDGKPGV-RKGLAKGYTVKPGGSIILGQEQDSYG--GGFDATQSFVGEIGDLNMWDSVLS  166 (206)
T ss_pred             CceEEEEEEEECCCCcEEEEECCEEcc-cccccCCcEECCCCEEEEEecccCCC--CCCCCCcceeEEEeeeEEecccCC
Confidence            999999999985  4678999997642 11111122233445677886543221  223345789999999999999886


No 28 
>PF00354 Pentaxin:  Pentaxin family;  InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=98.18  E-value=6.5e-05  Score=69.88  Aligned_cols=140  Identities=19%  Similarity=0.185  Sum_probs=82.5

Q ss_pred             ceeeEEEEEEEeCCC--CeEEEEeccCCCCCeEEEEEE-CCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeee
Q psy5025         336 GYEFNIDMRIKTSSS--SGLILWTGSSQSKDFLALGIH-NGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVV  412 (533)
Q Consensus       336 ~~~~~i~~~~rt~~~--~g~ll~~~~~~~~~~l~l~l~-~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~  412 (533)
                      ..+++|.|++|+...  .+.||....+...+-|.+... .+.+.+.+  ++....+   ...+.||+||++.+.++.   
T Consensus        24 L~~fTvC~w~k~~~~~~~~tifSYat~~~~nell~~~~~~~~~~l~i--~~~~~~~---~~~~~~~~Whh~C~tW~s---   95 (195)
T PF00354_consen   24 LSAFTVCFWVKTDDSSNDGTIFSYATSSQDNELLLFGSSSGSLRLYI--NGSSVSF---SGPIRDGQWHHICVTWDS---   95 (195)
T ss_dssp             BSEEEEEEEEEESGSGS-EEEEEEEETTEEEEEEEEEETTTEEEEEE--TTEEEEE---EECS-TSS-EEEEEEEET---
T ss_pred             cccEEEEEEEEeccCCCceEEEEEccCCCCccEEEEEeCCceEEEEE--CCeEeEe---ccccCCCCcEEEEEEEec---
Confidence            346899999999864  788886654433343444443 46665554  3333333   246799999999999943   


Q ss_pred             ecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCce
Q psy5025         413 MHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFK  492 (533)
Q Consensus       413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~  492 (533)
                                                 ....+.|++||...... .......+...+.+.||--.+... -......+|.
T Consensus        96 ---------------------------~~G~~~ly~dG~~~~~~-~~~~g~~i~~gG~~vlGQeQd~~g-G~fd~~q~F~  146 (195)
T PF00354_consen   96 ---------------------------STGRWQLYVDGVRLSST-GLATGHSIPGGGTLVLGQEQDSYG-GGFDESQAFV  146 (195)
T ss_dssp             ---------------------------TTTEEEEEETTEEEEEE-ESSTT--B-SSEEEEESS-BSBTT-BTCSGGGB--
T ss_pred             ---------------------------CCcEEEEEECCEecccc-cccCCceECCCCEEEECccccccC-CCcCCccEee
Confidence                                       56889999999954433 233445666667777775433211 1122456899


Q ss_pred             eeEEEEEEcCceEEeecccCc
Q psy5025         493 GWISNLILSNDYQVKLPLSRS  513 (533)
Q Consensus       493 GCi~~l~i~~~~~l~~~~~~~  513 (533)
                      |=|.++++ +++.|...+...
T Consensus       147 G~i~~~~i-Wd~vLs~~eI~~  166 (195)
T PF00354_consen  147 GEISDFNI-WDRVLSPEEIRA  166 (195)
T ss_dssp             EEEEEEEE-ESS---HHHHHH
T ss_pred             EEEeceEE-EeeeCCHHHHHH
Confidence            99999999 888887665543


No 29 
>cd00152 PTX Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Probab=98.09  E-value=0.0002  Score=67.23  Aligned_cols=146  Identities=20%  Similarity=0.139  Sum_probs=91.0

Q ss_pred             CCceeEEEcCCcccccceeEEEEEEEecC--CCeeEE-EccccCCCCCCeEEEEE-ECCEEEEEEEcCCCceEEEeeccc
Q psy5025          90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIE--PNGILL-YNGHRADGVGDFIALYL-NDRYVDFTFDLGTGAATLRSSNPI  165 (533)
Q Consensus        90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~--~dGlLl-~~~~~~~~~~~fl~l~L-~~G~l~~~~~~g~~~~~v~s~~~l  165 (533)
                      +...|++++.........+++.+++|+..  ..+.+| |....   ..+.+.+.. ..|++.+.+..  .  ........
T Consensus        15 s~~~yv~l~~~~~~~l~~fTv~~Wv~~~~~~~~~~ifSy~~~~---~~~~~~l~~~~~g~~~~~i~~--~--~~~~~~~~   87 (201)
T cd00152          15 SDTSYVKLKPELPKPLQAFTLCLWVYTDLSTREYSLFSYATKG---QDNELLLYKEKDGGYSLYIGG--K--EVTFKVPE   87 (201)
T ss_pred             CCCceEEEccCCCCChhhEEEEEEEEecCCCCCeEEEEEeCCC---CCCeEEEEEcCCCeEEEEEcC--E--EEEEeccC
Confidence            34578998765444567899999999864  444455 33332   133444444 34677776642  2  22233455


Q ss_pred             cCCCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025         166 SLGEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE  243 (533)
Q Consensus       166 nDg~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~  243 (533)
                      .+|+||+|.+++..  +.+.|+||+....... ......+.....+.||.......  ..+.....|.|-|.++++.+..
T Consensus        88 ~~g~W~hv~~t~d~~~g~~~lyvnG~~~~~~~-~~~~~~~~~~g~l~lG~~q~~~g--g~~~~~~~f~G~I~~v~iw~~~  164 (201)
T cd00152          88 SDGAWHHICVTWESTSGIAELWVNGKLSVRKS-LKKGYTVGPGGSIILGQEQDSYG--GGFDATQSFVGEISDVNMWDSV  164 (201)
T ss_pred             CCCCEEEEEEEEECCCCcEEEEECCEEecccc-ccCCCEECCCCeEEEeecccCCC--CCCCCCcceEEEEceeEEEccc
Confidence            99999999999984  4679999997643332 11122233445678886543211  1122347899999999999988


Q ss_pred             cc
Q psy5025         244 LN  245 (533)
Q Consensus       244 l~  245 (533)
                      ++
T Consensus       165 Ls  166 (201)
T cd00152         165 LS  166 (201)
T ss_pred             CC
Confidence            86


No 30 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=98.02  E-value=6e-06  Score=54.64  Aligned_cols=36  Identities=47%  Similarity=1.185  Sum_probs=32.8

Q ss_pred             CCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCCCcc
Q psy5025          44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGDLCQ   79 (533)
Q Consensus        44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce   79 (533)
                      .+.|.. +||.+++.|++..++|.|.|+.+|.|+.|+
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            467887 799999999999999999999999999885


No 31 
>PF00354 Pentaxin:  Pentaxin family;  InterPro: IPR001759 Pentaxins (or pentraxins) [, ] are a family of proteins which show, under electron microscopy, a discoid arrangement of five noncovalently bound subunits. Proteins of the pentaxin family are involved in acute immunological responses []. Three of the principal members of the pentaxin family are serum proteins: namely, C-reactive protein (CRP) [], serum amyloid P component protein (SAP) [], and female protein (FP) []. CRP is expressed during acute phase response to tissue injury or inflammation in mammals. The protein resembles antibody and performs several functions associated with host defence: it promotes agglutination, bacterial capsular swelling and phagocytosis, and activates the classical complement pathway through its calcium-dependent binding to phosphocholine. CRPs have also been sequenced in an invertebrate, Limulus polyphemus (Atlantic horseshoe crab), where they are a normal constituent of the hemolymph. SAP is a vertebrate protein that is a precursor of amyloid component P. It is found in all types of amyloid deposits, in glomerular basement menbrane and in elastic fibres in blood vessels. SAP binds to various lipoprotein ligands in a calcium-dependent manner, and it has been suggested that, in mammals, this may have important implications in atherosclerosis and amyloidosis. FP is a SAP homologue found in Mesocricetus auratus (Golden hamster). The concentration of this plasma protein is altered by sex steroids and stimuli that elicit an acute phase response. Pentaxin proteins expressed in the nervous system are neural pentaxin I (NPI) and II (NPII) []. NPI and NPII are homologous and can exist within one species. It is suggested that both proteins mediate the uptake of synaptic macromolecules and play a role in synaptic plasticity. Apexin, a sperm acrosomal protein, is a homologue of NPII found in Cavia porcellus (Guinea pig) []. PTX3 (or TSG-14) protein is a cytokine-induced protein that is homologous to CRPs and SAPs, but its function is not yet known.; PDB: 2A3W_F 3KQR_C 3D5O_D 2A3X_G 1SAC_D 2W08_B 1GYK_B 1LGN_A 2A3Y_A 1B09_D ....
Probab=97.95  E-value=0.00031  Score=65.36  Aligned_cols=148  Identities=18%  Similarity=0.198  Sum_probs=86.0

Q ss_pred             CCceeEEEcCCcccccceeEEEEEEEecCC--CeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccC
Q psy5025          90 NGSSYLRYQGLGEQYLSLLDLTIVFKAIEP--NGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISL  167 (533)
Q Consensus        90 ~g~syl~~~~~~~~~~~~~~ls~~FrT~~~--dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnD  167 (533)
                      +...|+++.+........+++.+++|+...  .+.||--..... .++++.+.-..+.+.+.+..  .  .+.....+.+
T Consensus         9 s~~~yv~l~~~~~~pL~~fTvC~w~k~~~~~~~~tifSYat~~~-~nell~~~~~~~~~~l~i~~--~--~~~~~~~~~~   83 (195)
T PF00354_consen    9 STTDYVRLKPSVPLPLSAFTVCFWVKTDDSSNDGTIFSYATSSQ-DNELLLFGSSSGSLRLYING--S--SVSFSGPIRD   83 (195)
T ss_dssp             BSSBEEEEEESS-S-BSEEEEEEEEEESGSGS-EEEEEEEETTE-EEEEEEEEETTTEEEEEETT--E--EEEEEECS-T
T ss_pred             CCcceEEEecCCCCCcccEEEEEEEEeccCCCceEEEEEccCCC-CccEEEEEeCCceEEEEECC--e--EeEeccccCC
Confidence            346788887654333678999999999764  666666543321 13333332234666655532  2  3344556789


Q ss_pred             CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      ++||++.+++..  +...|++||....... ......+...+.+.||.-....  ...+...+.|.|=|.++.+.++.++
T Consensus        84 ~~Whh~C~tW~s~~G~~~ly~dG~~~~~~~-~~~g~~i~~gG~~vlGQeQd~~--gG~fd~~q~F~G~i~~~~iWd~vLs  160 (195)
T PF00354_consen   84 GQWHHICVTWDSSTGRWQLYVDGVRLSSTG-LATGHSIPGGGTLVLGQEQDSY--GGGFDESQAFVGEISDFNIWDRVLS  160 (195)
T ss_dssp             SS-EEEEEEEETTTTEEEEEETTEEEEEEE-SSTT--B-SSEEEEESS-BSBT--TBTCSGGGB--EEEEEEEEESS---
T ss_pred             CCcEEEEEEEecCCcEEEEEECCEeccccc-ccCCceECCCCEEEECcccccc--CCCcCCccEeeEEEeceEEEeeeCC
Confidence            999999999987  5789999997432222 2223334445567788655432  2344566899999999999998886


No 32 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.74  E-value=3.5e-05  Score=50.09  Aligned_cols=33  Identities=55%  Similarity=1.438  Sum_probs=29.9

Q ss_pred             CC-CCCCCCCCeeeeCCCCceecCCCCCCCC-Ccc
Q psy5025          47 CN-KTPCEHGGKCITTPGSSYCLCPLGFTGD-LCQ   79 (533)
Q Consensus        47 C~-~~pC~NggtC~~~~~~y~C~C~~g~~G~-~Ce   79 (533)
                      |. .++|.+++.|++..++|.|.|+.||.|. .|+
T Consensus         2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C~   36 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE   36 (36)
T ss_pred             CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCcC
Confidence            55 7899999999999999999999999998 774


No 33 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.73  E-value=0.0007  Score=61.81  Aligned_cols=136  Identities=18%  Similarity=0.228  Sum_probs=85.4

Q ss_pred             eeeEEEEEEEeCCCCeE--EEEeccCC-CCCeEEEEEECCEEEEEEEcCCc--eEEEe----eCceeecCCCeEEEEEEE
Q psy5025         337 YEFNIDMRIKTSSSSGL--ILWTGSSQ-SKDFLALGIHNGYAHVMYNLGNG--EVLLE----NNQTKINNGVWHRLTVHR  407 (533)
Q Consensus       337 ~~~~i~~~~rt~~~~g~--ll~~~~~~-~~~~l~l~l~~g~l~~~~~~g~~--~~~~~----~~~~~~~dg~wH~v~~~~  407 (533)
                      ...+|-++|++...+++  ||.+.+.. ...||.|++.++.+-+.++...+  .+...    ..+...++-.||.|++..
T Consensus        33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~~~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~a  112 (190)
T PF02973_consen   33 EEGTIVIRFKSDSNSGIQSLFSISNSTKGNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFVA  112 (190)
T ss_dssp             SSEEEEEEEEESS-SSEEEEEEEE-TSTTSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEEE
T ss_pred             cccEEEEEEecCCCcceeEEEEecCCCCccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEEE
Confidence            45789999999776664  56554332 36999999999999999987655  22221    122345777899999998


Q ss_pred             eeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecC-Ccc-eeeccCCceEEcCCCCccccccc
Q psy5025         408 NFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG-GKF-EQLNTDSGLYVGGVEDIEELTRS  485 (533)
Q Consensus       408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~-~~~-~~l~~~~~l~lGg~~~~~~~~~~  485 (533)
                      ..                              ......|++||......... ..+ ..++.-..++|||.....     
T Consensus       113 d~------------------------------~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g-----  157 (190)
T PF02973_consen  113 DS------------------------------KNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAG-----  157 (190)
T ss_dssp             ET------------------------------TTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETT-----
T ss_pred             ec------------------------------CCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCC-----
Confidence            11                              47899999999766654332 222 223223359999975432     


Q ss_pred             ccccCceeeEEEEEEcCceEEee
Q psy5025         486 KYHHSFKGWISNLILSNDYQVKL  508 (533)
Q Consensus       486 ~~~~~f~GCi~~l~i~~~~~l~~  508 (533)
                      ....+|.|-|++|+| -+..|+-
T Consensus       158 ~~~y~f~G~I~~l~i-Yn~aLsd  179 (190)
T PF02973_consen  158 SNAYPFNGTIDNLKI-YNRALSD  179 (190)
T ss_dssp             EEES--EEEEEEEEE-ESS---H
T ss_pred             CceecccceEEEEEE-EcCcCCH
Confidence            245689999999999 5555543


No 34 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.63  E-value=6.2e-05  Score=48.89  Aligned_cols=32  Identities=53%  Similarity=1.321  Sum_probs=29.1

Q ss_pred             CCC-CCCCCCCeeeeCCCCceecCCCCCCC-CCcc
Q psy5025          47 CNK-TPCEHGGKCITTPGSSYCLCPLGFTG-DLCQ   79 (533)
Q Consensus        47 C~~-~pC~NggtC~~~~~~y~C~C~~g~~G-~~Ce   79 (533)
                      |.. ++|.++ .|++..++|.|.|+.||.| +.|+
T Consensus         2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C~   35 (35)
T smart00181        2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCcCCCCCC-EEECCCCCeEeECCCCCccCCccC
Confidence            666 799999 9999999999999999999 8875


No 35 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.49  E-value=6.1e-05  Score=51.21  Aligned_cols=31  Identities=39%  Similarity=0.997  Sum_probs=27.7

Q ss_pred             CCCCCC--CCCCCCCeeeeCCCCceecCCCCCC
Q psy5025          44 EDKCNK--TPCEHGGKCITTPGSSYCLCPLGFT   74 (533)
Q Consensus        44 ~d~C~~--~pC~NggtC~~~~~~y~C~C~~g~~   74 (533)
                      +|+|..  ++|.+++.|+++.++|.|.|++||.
T Consensus         2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            577874  5799999999999999999999996


No 36 
>PF02973 Sialidase:  Sialidase, N-terminal domain;  InterPro: IPR004124 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Sialidases (GH33 from CAZY) hydrolyse alpha-(2->3)-, alpha-(2->6)-, alpha-(2->8)-glycosidic linkages of terminal sialic residues in oligosaccharides, glycoproteins, glycolipids, colominic acid and synthetic substrates. Sialidases may act as pathogenic factors in microbial infections [].  The 1.8 A structure of trans-sialidase from leech (Macrobdella decora, Q27701 from SWISSPROT) in complex with 2-deoxy-2, 3-didehydro-NeuAc was solved. The refined model comprising residues 81-769 has a catalytic beta-propeller domain, a N-terminal lectin-like domain and an irregular beta-stranded domain inserted into the catalytic domain [].; GO: 0004308 exo-alpha-sialidase activity, 0005975 carbohydrate metabolic process; PDB: 2JKB_A 2VW2_A 2VW0_A 2VW1_A 2V73_B 2V72_A 1SLI_A 1SLL_A 2SLI_A 4SLI_A ....
Probab=97.47  E-value=0.0033  Score=57.47  Aligned_cols=133  Identities=17%  Similarity=0.201  Sum_probs=86.0

Q ss_pred             ceeEEEEEEEecCCCee--EEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEE--ee-----ccccCCCcEEEEEE
Q psy5025         106 SLLDLTIVFKAIEPNGI--LLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLR--SS-----NPISLGEWHKLRLT  176 (533)
Q Consensus       106 ~~~~ls~~FrT~~~dGl--Ll~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~--s~-----~~lnDg~Wh~V~v~  176 (533)
                      ...+|.++|++...+++  ||-..+... ...|+.|++.++++-+.+....+.....  .+     ...++-.||.|.+.
T Consensus        33 ~~gTI~i~Fk~~~~~~~~sLfsiSn~~~-~n~YF~lyv~~~~~G~E~R~~~~~~~y~~~~~~~v~~~~~~~~~~~tva~~  111 (190)
T PF02973_consen   33 EEGTIVIRFKSDSNSGIQSLFSISNSTK-GNEYFSLYVSNNKLGFELRDTKGNQNYNFSRPAKVRGGYKNNVTFNTVAFV  111 (190)
T ss_dssp             SSEEEEEEEEESS-SSEEEEEEEE-TST-TSEEEEEEEETTEEEEEEEETTTTCEEEEEESSE--SEETTEES-EEEEEE
T ss_pred             cccEEEEEEecCCCcceeEEEEecCCCC-ccceEEEEEECCEEEEEEecCCCCcccccccccEecccccCCceEEEEEEE
Confidence            45789999999776666  666665543 3699999999999999998765533322  22     22456689999999


Q ss_pred             EE--CCEEEEEEcCccCceeeCCCCce--eeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         177 RT--GRHAYLQVDRFPSSQILSPGPFT--QLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       177 ~~--~~~~~L~VD~~~~~~~~~~~~~~--~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      ..  ....+|++||.............  .+.--..++||+.....      ...-+|.|-|+.+.+-+++++
T Consensus       112 ad~~~~~ykly~NG~~v~~~~~~~~~Fis~i~~~n~~~iG~t~R~g------~~~y~f~G~I~~l~iYn~aLs  178 (190)
T PF02973_consen  112 ADSKNKGYKLYVNGELVSTLSSKSGNFISDIPGLNSVQIGGTNRAG------SNAYPFNGTIDNLKIYNRALS  178 (190)
T ss_dssp             EETTTTEEEEEETTCEEEEEEECTSS-GGGSTT--EEEESSEEETT------EEES--EEEEEEEEEESS---
T ss_pred             EecCCCeEEEEeCCeeEEEeccccccHhhcCcCCceEEEcceEeCC------CceecccceEEEEEEEcCcCC
Confidence            97  78999999996544332222111  11112469999986532      234789999999999888775


No 37 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.42  E-value=0.00018  Score=47.77  Aligned_cols=34  Identities=26%  Similarity=0.813  Sum_probs=29.4

Q ss_pred             CCCCC-CCC-CCCeeeecCCceEEecCCCCC-cCccc
Q psy5025         261 HPCIT-QPC-GSGSCVPLYDGYKCVCTQSCK-ESHLT  294 (533)
Q Consensus       261 ~~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~-G~~c~  294 (533)
                      +.|.. +|| ++|+|++...+|.|.|+++|. |..|+
T Consensus         3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~   39 (39)
T smart00179        3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE   39 (39)
T ss_pred             ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence            45776 799 999999999999999999998 87763


No 38 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.26  E-value=0.00036  Score=45.88  Aligned_cols=34  Identities=26%  Similarity=0.822  Sum_probs=29.7

Q ss_pred             CCCCC-CCC-CCCeeeecCCceEEecCCCCCcCccc
Q psy5025         261 HPCIT-QPC-GSGSCVPLYDGYKCVCTQSCKESHLT  294 (533)
Q Consensus       261 ~~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~G~~c~  294 (533)
                      ++|.. .|| +++.|.+....|.|.|+++|.|..|+
T Consensus         3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~   38 (38)
T cd00054           3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE   38 (38)
T ss_pred             ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence            46776 799 99999999999999999999997763


No 39 
>KOG3509|consensus
Probab=97.24  E-value=0.00097  Score=74.37  Aligned_cols=209  Identities=22%  Similarity=0.272  Sum_probs=125.8

Q ss_pred             CCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccccccccccccCCceeEEEcCCcccccceeEEEEEEEecCCC
Q psy5025          41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTRLDLQVPQFNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPN  120 (533)
Q Consensus        41 ~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~~~~~~~~F~g~syl~~~~~~~~~~~~~~ls~~FrT~~~d  120 (533)
                      -|..+.|.+.||+..+.|.+......|.|+.+|+|+.|+...+.+...+.+.......+...                  
T Consensus       403 ~c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~------------------  464 (964)
T KOG3509|consen  403 TCLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQG------------------  464 (964)
T ss_pred             ccCCCccccccCCCCccccccccccceeccccccCchhhccCccccccCCccccceEeccCC------------------
Confidence            44467899999999999999999999999999999999998766555555443333322111                  


Q ss_pred             eeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCc
Q psy5025         121 GILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPF  200 (533)
Q Consensus       121 GlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~  200 (533)
                      ..+.|-+.. .  +++.+..|          ...+.....+.......+||+..+.+......+.++-............
T Consensus       465 ~~~~~c~pg-~--g~~~~~~l----------g~~~~~a~~s~~s~~~~~~~~f~~~~t~~~~~~~c~~~~~~~s~s~~~~  531 (964)
T KOG3509|consen  465 KRCEYCGPG-A--GAPTAGAL----------GQCGCKASASGPSTSRCMPHHFGISPTNHQGCLPCFCSELTLSISSSTY  531 (964)
T ss_pred             CcceeecCC-C--CCccchhc----------cccCccceeccccccccccccCCcccccccccccccchhcccccccccc
Confidence            233333222 1  33333311          1111122223344556789999998888777777666443333333333


Q ss_pred             eeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcccccccccCCCccCCCCCCCCCCC-CCCeeeecCCc
Q psy5025         201 TQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELNILGEAIQGVNVVNCAHPCITQPC-GSGSCVPLYDG  279 (533)
Q Consensus       201 ~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~~~~~~~~~~~~~~C~~~C~~~pC-ngg~C~~~~~~  279 (533)
                      ........+.+|+......+.+.......+.|+.+.+-. ...+....+.........+|..+...++ -.|.++..|..
T Consensus       532 ~~~s~~~~~~~~~~~~i~~~~~~l~~r~~~l~~~~~i~~-~~s~~v~a~~~~~~~~~~~P~~~vad~vs~~g~~~~~~~s  610 (964)
T KOG3509|consen  532 RCSSVSIDYARGQFDLIELLRPDLRQRYASLGRDQLINQ-AISFSVSAEAGGLLLRWSLPAKPVADKVSPLGGALSQTHS  610 (964)
T ss_pred             cccchheecccchhhhhhhccccccccccccceeeeccc-cceeeeeeeccceEEEecCCCCCCcceeccccceeeeeec
Confidence            333344456677776665555666666677777654433 1111112222222334567888888899 88888888877


Q ss_pred             eE
Q psy5025         280 YK  281 (533)
Q Consensus       280 ~~  281 (533)
                      |.
T Consensus       611 ~~  612 (964)
T KOG3509|consen  611 YK  612 (964)
T ss_pred             cC
Confidence            65


No 40 
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=97.14  E-value=0.012  Score=51.07  Aligned_cols=68  Identities=16%  Similarity=0.202  Sum_probs=45.6

Q ss_pred             CCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccC--cEEEEEECCceeeeEecCCcceeeccCCceEEcC
Q psy5025         398 GVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTG--KQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGG  475 (533)
Q Consensus       398 g~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg  475 (533)
                      |+||+|.+..                                ++  ..++|+|||.........    .......+.||.
T Consensus        61 ~~W~hva~v~--------------------------------d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~  104 (133)
T smart00560       61 GVWVHLAGVY--------------------------------DGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGG  104 (133)
T ss_pred             CCEEEEEEEE--------------------------------ECCCCeEEEEECCEEccccccC----CcccCCceEEee
Confidence            9999999999                                55  789999999876543221    112345688884


Q ss_pred             CCCcccccccccccCceeeEEEEEEcCceEE
Q psy5025         476 VEDIEELTRSKYHHSFKGWISNLILSNDYQV  506 (533)
Q Consensus       476 ~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l  506 (533)
                      ....    .......|.|.|.+++| .+..|
T Consensus       105 ~~~~----~~~~~~~f~G~Idevri-y~~aL  130 (133)
T smart00560      105 RILL----GGAGGENFSGRLDEVRV-YNRAL  130 (133)
T ss_pred             eccC----CCCCCCCceEEeeEEEE-ecccc
Confidence            1111    11245689999999999 55544


No 41 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.10  E-value=0.00058  Score=43.13  Aligned_cols=27  Identities=33%  Similarity=0.962  Sum_probs=24.3

Q ss_pred             CCCCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025          50 TPCEHGGKCITTPGSSYCLCPLGFTGDLC   78 (533)
Q Consensus        50 ~pC~NggtC~~~~~~y~C~C~~g~~G~~C   78 (533)
                      ..|.+.|+|+..  ...|.|..+|+|+.|
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            479999999976  679999999999987


No 42 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=97.07  E-value=0.00077  Score=43.54  Aligned_cols=31  Identities=32%  Similarity=0.922  Sum_probs=27.6

Q ss_pred             CC-CCCC-CCCeeeecCCceEEecCCCCCcC-cc
Q psy5025         263 CI-TQPC-GSGSCVPLYDGYKCVCTQSCKES-HL  293 (533)
Q Consensus       263 C~-~~pC-ngg~C~~~~~~~~C~C~~g~~G~-~c  293 (533)
                      |. ..+| +++.|++....|.|.|+.||.|. .|
T Consensus         2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C   35 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence            55 7899 99999999999999999999987 54


No 43 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.79  E-value=0.0017  Score=41.90  Aligned_cols=31  Identities=23%  Similarity=0.754  Sum_probs=26.7

Q ss_pred             CCCC-CCC-CCCeeeecCCceEEecCCCCCc-Ccc
Q psy5025         262 PCIT-QPC-GSGSCVPLYDGYKCVCTQSCKE-SHL  293 (533)
Q Consensus       262 ~C~~-~pC-ngg~C~~~~~~~~C~C~~g~~G-~~c  293 (533)
                      +|.. .+| ++ .|++...+|.|.|++||.| ..|
T Consensus         1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C   34 (35)
T smart00181        1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence            3666 699 77 9999999999999999998 555


No 44 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.47  E-value=0.00083  Score=33.11  Aligned_cols=13  Identities=54%  Similarity=1.329  Sum_probs=11.0

Q ss_pred             eecCCCCCCCCCc
Q psy5025          66 YCLCPLGFTGDLC   78 (533)
Q Consensus        66 ~C~C~~g~~G~~C   78 (533)
                      .|.|++||+|++|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5999999999988


No 45 
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=96.30  E-value=0.12  Score=44.75  Aligned_cols=69  Identities=22%  Similarity=0.179  Sum_probs=46.4

Q ss_pred             CCcEEEEEEEEC--CEEEEEEcCccCceeeCCCCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         168 GEWHKLRLTRTG--RHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       168 g~Wh~V~v~~~~--~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      ++||+|.+..+.  ..++|+||+.........    .......+.||.....     .......|.|.|.+++|.+..+.
T Consensus        61 ~~W~hva~v~d~~~g~~~lYvnG~~~~~~~~~----~~~~~~~~~iG~~~~~-----~~~~~~~f~G~Idevriy~~aLs  131 (133)
T smart00560       61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQ----PSPSSGNLPQGGRILL-----GGAGGENFSGRLDEVRVYNRALT  131 (133)
T ss_pred             CCEEEEEEEEECCCCeEEEEECCEEccccccC----CcccCCceEEeeeccC-----CCCCCCCceEEeeEEEEeccccC
Confidence            899999999998  789999999654322211    1122356788841111     11123689999999999887764


No 46 
>KOG1834|consensus
Probab=96.10  E-value=0.084  Score=55.82  Aligned_cols=135  Identities=13%  Similarity=0.197  Sum_probs=90.0

Q ss_pred             cceeeEEEEEEEeCC-------CCeEEEEeccC--CCCCeEEEEEECCEEEEEEEcCCce------EEEeeCceeecCCC
Q psy5025         335 TGYEFNIDMRIKTSS-------SSGLILWTGSS--QSKDFLALGIHNGYAHVMYNLGNGE------VLLENNQTKINNGV  399 (533)
Q Consensus       335 ~~~~~~i~~~~rt~~-------~~g~ll~~~~~--~~~~~l~l~l~~g~l~~~~~~g~~~------~~~~~~~~~~~dg~  399 (533)
                      ....|+|+|+.|-..       ..-.||-....  .+...++|++++=+|.|.++...+.      ....-.-..+.|.+
T Consensus       364 l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~~fRpaef~Wkl~qVCD~E  443 (952)
T KOG1834|consen  364 LPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATSDFRPAEFHWKLPQVCDNE  443 (952)
T ss_pred             CCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccccccchheeccchhhhhhh
Confidence            356789999998643       22366654322  2367889999999999998764321      11111345699999


Q ss_pred             eEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeE--ecCCcceeeccCCceEEcCCC
Q psy5025         400 WHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQS--ISGGKFEQLNTDSGLYVGGVE  477 (533)
Q Consensus       400 wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~--~~~~~~~~l~~~~~l~lGg~~  477 (533)
                      ||+-.+..                                ....++|++||......  ......+.-.....|.||.-.
T Consensus       444 WH~Y~ln~--------------------------------efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW  491 (952)
T KOG1834|consen  444 WHHYVLNV--------------------------------EFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACW  491 (952)
T ss_pred             hheeEEee--------------------------------cCceEEEEEcCcccCCceeccCCccCcccccceeEEeeec
Confidence            99999999                                77889999999864321  122222333345568898765


Q ss_pred             CcccccccccccCceeeEEEEEEc
Q psy5025         478 DIEELTRSKYHHSFKGWISNLILS  501 (533)
Q Consensus       478 ~~~~~~~~~~~~~f~GCi~~l~i~  501 (533)
                      .............|.|-+..|.+-
T Consensus       492 ~g~~~~~l~~aqfFrG~Lasltlr  515 (952)
T KOG1834|consen  492 QGRQQKPLKLAQFFRGQLASLTLR  515 (952)
T ss_pred             cCccccchhHHHHhhcccceeEEe
Confidence            544333345677899999999883


No 47 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.87  E-value=0.0089  Score=40.48  Aligned_cols=29  Identities=28%  Similarity=0.826  Sum_probs=24.9

Q ss_pred             CCCCC--CCC-CCCeeeecCCceEEecCCCCC
Q psy5025         261 HPCIT--QPC-GSGSCVPLYDGYKCVCTQSCK  289 (533)
Q Consensus       261 ~~C~~--~pC-ngg~C~~~~~~~~C~C~~g~~  289 (533)
                      ++|..  ++| .++.|++...+|.|.|++||.
T Consensus         3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~   34 (42)
T PF07645_consen    3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYE   34 (42)
T ss_dssp             STTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred             cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence            35664  569 899999999999999999996


No 48 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.69  E-value=0.013  Score=37.04  Aligned_cols=26  Identities=31%  Similarity=0.755  Sum_probs=22.7

Q ss_pred             CCC-CCCeeeecCCceEEecCCCCCcCcc
Q psy5025         266 QPC-GSGSCVPLYDGYKCVCTQSCKESHL  293 (533)
Q Consensus       266 ~pC-ngg~C~~~~~~~~C~C~~g~~G~~c  293 (533)
                      ..| ++|+|+..  ...|.|+++|.|+.|
T Consensus         6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSP--CGRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence            479 99999976  789999999999865


No 49 
>KOG1836|consensus
Probab=95.24  E-value=0.0062  Score=72.58  Aligned_cols=135  Identities=17%  Similarity=0.149  Sum_probs=89.1

Q ss_pred             eeEEEEEEEeCCCCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEe-eCceeecCCCeEEEEEEEeeeeeecCC
Q psy5025         338 EFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLE-NNQTKINNGVWHRLTVHRNFMVVMHGS  416 (533)
Q Consensus       338 ~~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~-~~~~~~~dg~wH~v~~~~~~~~~~~~~  416 (533)
                      .+.+.+..+..+..|.+-...+ ....+..+....+.....+..+-...... .....+-++.||.+.+.+         
T Consensus      1557 ~~~~~~~~~~~~~~~~l~~~~s-~~~~~~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~--------- 1626 (1705)
T KOG1836|consen 1557 AFALVFSERNVSSTGGLTHHLS-KLGTELLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSST--------- 1626 (1705)
T ss_pred             hHHhhhcccccccCCCcccccc-ccchHHhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeec---------
Confidence            3455566666665555544332 22567777777777777665543211111 144678999999999999         


Q ss_pred             cccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcccccccccccCceeeEE
Q psy5025         417 SHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWIS  496 (533)
Q Consensus       417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~~~~~~~~~~~f~GCi~  496 (533)
                                             ....+.+..|. .+.   .+..........++++||.|.....+......+|.||| 
T Consensus      1627 -----------------------~~~v~~~~~~~-~~~---~~~~~~~~~~~~p~~~~~~~~s~~~~~~~~~~~~~~~~- 1678 (1705)
T KOG1836|consen 1627 -----------------------DPGVVQLEDDT-YTV---GEIPPPPADTQEPIKLGGYPSSLTTLRIAVLKSFTGCI- 1678 (1705)
T ss_pred             -----------------------CCccccccccc-eec---ccCCCCchhccCCcccCCccccccceeeecccccccce-
Confidence                                   77778888887 222   22222344567789999999887666666778999999 


Q ss_pred             EEEEcCceEEeecccC
Q psy5025         497 NLILSNDYQVKLPLSR  512 (533)
Q Consensus       497 ~l~i~~~~~l~~~~~~  512 (533)
                       +.+ .+.++++..+.
T Consensus      1679 -~~~-~~~~~~~~~a~ 1692 (1705)
T KOG1836|consen 1679 -FVV-MGIRVDVTLAG 1692 (1705)
T ss_pred             -EEe-cCCCCcHHHHH
Confidence             777 77888776654


No 50 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.99  E-value=0.011  Score=38.51  Aligned_cols=27  Identities=37%  Similarity=0.840  Sum_probs=21.3

Q ss_pred             CCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025          50 TPCEHGGKCITTPGSSYCLCPLGFTGD   76 (533)
Q Consensus        50 ~pC~NggtC~~~~~~y~C~C~~g~~G~   76 (533)
                      ..|...++|++..++|.|.|++||.|+
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            467778999999999999999999775


No 51 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.45  E-value=0.029  Score=36.53  Aligned_cols=26  Identities=19%  Similarity=0.611  Sum_probs=21.4

Q ss_pred             CCC-CCCeeeecCCceEEecCCCCCcC
Q psy5025         266 QPC-GSGSCVPLYDGYKCVCTQSCKES  291 (533)
Q Consensus       266 ~pC-ngg~C~~~~~~~~C~C~~g~~G~  291 (533)
                      ..| ..++|++...+|+|.|.+||.|+
T Consensus         6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd   32 (36)
T PF12947_consen    6 GGCHPNATCTNTGGSYTCTCKPGYEGD   32 (36)
T ss_dssp             GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred             CCCCCCcEeecCCCCEEeECCCCCccC
Confidence            358 88999999999999999999885


No 52 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=94.43  E-value=0.027  Score=46.91  Aligned_cols=33  Identities=39%  Similarity=0.885  Sum_probs=28.8

Q ss_pred             CCCCCCCCeeeeCC--CCceecCCCCCCCCCccccc
Q psy5025          49 KTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTRL   82 (533)
Q Consensus        49 ~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~~   82 (533)
                      .+-|.|| +|.-..  +.+.|.|+.||+|..||...
T Consensus        50 ~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d   84 (139)
T PHA03099         50 DGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV   84 (139)
T ss_pred             CCEeECC-EEEeeccCCCceeECCCCccccccccee
Confidence            4689997 898765  99999999999999999863


No 53 
>PHA02887 EGF-like protein; Provisional
Probab=94.01  E-value=0.048  Score=44.72  Aligned_cols=36  Identities=31%  Similarity=0.904  Sum_probs=29.2

Q ss_pred             CCCC---CCCCCCCCeeeeCC--CCceecCCCCCCCCCcccc
Q psy5025          45 DKCN---KTPCEHGGKCITTP--GSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        45 d~C~---~~pC~NggtC~~~~--~~y~C~C~~g~~G~~Ce~~   81 (533)
                      ..|.   .+-|.| |+|.-..  ..+.|.|+.||+|..||..
T Consensus        84 ~pC~~eyk~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         84 EKCKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             cccChHhhCEeeC-CEEEccccCCCceeECCCCcccCCCCcc
Confidence            4565   367886 6998765  8899999999999999974


No 54 
>KOG1834|consensus
Probab=93.26  E-value=1.5  Score=46.91  Aligned_cols=156  Identities=17%  Similarity=0.234  Sum_probs=94.9

Q ss_pred             ccCCceeEEEcCCcc--cccceeEEEEEEEecC-------CCeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCc-e
Q psy5025          88 QFNGSSYLRYQGLGE--QYLSLLDLTIVFKAIE-------PNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-A  157 (533)
Q Consensus        88 ~F~g~syl~~~~~~~--~~~~~~~ls~~FrT~~-------~dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~-~  157 (533)
                      .|+|..-+.++.-..  .....|+|+|++|-..       ..-.|+-..+.......-++|++.+.||.|.+....+. .
T Consensus       346 eFdG~qgv~vpdg~~~g~l~dhFTlSfwMkHg~~p~~~~~eketIlCnsdk~emnrhHyslyvh~Crl~fllr~d~~~~~  425 (952)
T KOG1834|consen  346 EFDGTQGVTVPDGNVSGSLPDHFTLSFWMKHGPGPKDEQSEKETILCNSDKTEMNRHHYSLYVHGCRLEFLLRRDAGATS  425 (952)
T ss_pred             EEcCceeeEccCCCCCCCCCCceEEEEeeecCCCCccccccceeEEecccccccccceeEEEEeccEEEEEEccCccccc
Confidence            466666666553221  2335789999998321       23356665554444567899999999999999864332 1


Q ss_pred             EEE-----ee-ccccCCCcEEEEEEEECCEEEEEEcCccCce--eeCCCCceeeecCCCEEEcCcCCCCccCCCcccccC
Q psy5025         158 TLR-----SS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSSQ--ILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSS  229 (533)
Q Consensus       158 ~v~-----s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~--~~~~~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~  229 (533)
                      ..+     -. ..+.|.+||+-.+....-.+.|++||..-..  .....+.........+.||.-=.... ...+...+.
T Consensus       426 ~fRpaef~Wkl~qVCD~EWH~Y~ln~efp~VtlyvDG~Sfep~~i~ddwplHpsk~~tqLvVGACW~g~~-~~~l~~aqf  504 (952)
T KOG1834|consen  426 DFRPAEFHWKLPQVCDNEWHHYVLNVEFPDVTLYVDGKSFEPPLITDDWPLHPSKIETQLVVGACWQGRQ-QKPLKLAQF  504 (952)
T ss_pred             cccchheeccchhhhhhhhheeEEeecCceEEEEEcCcccCCceeccCCccCcccccceeEEeeeccCcc-ccchhHHHH
Confidence            222     12 4578999999999999999999999954211  11111222222345677774322211 011234578


Q ss_pred             ceeeeeeEEECcEEc
Q psy5025         230 FIGCIQKVLINNHEL  244 (533)
Q Consensus       230 F~GCI~~l~in~~~l  244 (533)
                      |.|-+..+.+....+
T Consensus       505 FrG~LasltlrsGkl  519 (952)
T KOG1834|consen  505 FRGQLASLTLRSGKL  519 (952)
T ss_pred             hhcccceeEEecccc
Confidence            999998888865544


No 55 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=93.11  E-value=0.095  Score=43.78  Aligned_cols=35  Identities=20%  Similarity=0.522  Sum_probs=27.7

Q ss_pred             CCC-CCCeee--ecCCceEEecCCCCCcCcccceeeeee
Q psy5025         266 QPC-GSGSCV--PLYDGYKCVCTQSCKESHLTEVMCASF  301 (533)
Q Consensus       266 ~pC-ngg~C~--~~~~~~~C~C~~g~~G~~c~~~~~~~f  301 (533)
                      +-| || +|.  ...+.+.|.|+.||.|.+|+...-..+
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~dLl~~   88 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVVLVDY   88 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccccccceeeeee
Confidence            569 86 896  445789999999999999997654333


No 56 
>PHA02887 EGF-like protein; Provisional
Probab=92.95  E-value=0.092  Score=43.12  Aligned_cols=35  Identities=26%  Similarity=0.550  Sum_probs=26.6

Q ss_pred             CCCC---CCCCCCeee--ecCCceEEecCCCCCcCcccce
Q psy5025         262 PCIT---QPCGSGSCV--PLYDGYKCVCTQSCKESHLTEV  296 (533)
Q Consensus       262 ~C~~---~pCngg~C~--~~~~~~~C~C~~g~~G~~c~~~  296 (533)
                      +|.+   +-|-+|+|.  ...+.+.|.|+.||.|.+|+..
T Consensus        85 pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         85 KCKNDFNDFCINGECMNIIDLDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             ccChHhhCEeeCCEEEccccCCCceeECCCCcccCCCCcc
Confidence            4553   568236996  4557899999999999999864


No 57 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=92.30  E-value=0.091  Score=30.68  Aligned_cols=17  Identities=41%  Similarity=1.052  Sum_probs=13.6

Q ss_pred             CceecCCCCCC----CCCccc
Q psy5025          64 SSYCLCPLGFT----GDLCQT   80 (533)
Q Consensus        64 ~y~C~C~~g~~----G~~Ce~   80 (533)
                      +|+|.|++||.    |+.|+.
T Consensus         1 sy~C~C~~Gy~l~~d~~~C~D   21 (24)
T PF12662_consen    1 SYTCSCPPGYQLSPDGRSCED   21 (24)
T ss_pred             CEEeeCCCCCcCCCCCCcccc
Confidence            69999999994    667764


No 58 
>PF06439 DUF1080:  Domain of Unknown Function (DUF1080);  InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=91.20  E-value=0.29  Score=44.83  Aligned_cols=91  Identities=13%  Similarity=0.236  Sum_probs=51.2

Q ss_pred             eeEEEEEEE--eCCCCeEEEEecc----CCCCCeEEEEEECCEEEEEEEcCCceEE-------EeeCceeecCCCeEEEE
Q psy5025         338 EFNIDMRIK--TSSSSGLILWTGS----SQSKDFLALGIHNGYAHVMYNLGNGEVL-------LENNQTKINNGVWHRLT  404 (533)
Q Consensus       338 ~~~i~~~~r--t~~~~g~ll~~~~----~~~~~~l~l~l~~g~l~~~~~~g~~~~~-------~~~~~~~~~dg~wH~v~  404 (533)
                      .+.+++.||  .....|++|....    ........+.|.++.-........+...       .......+..|+||+++
T Consensus        54 df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~W~~~~  133 (185)
T PF06439_consen   54 DFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEPSVNVAIPPGEWNTVR  133 (185)
T ss_dssp             SEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-SSS--S--TTSEEEEE
T ss_pred             cEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccccccccCCCCceEEEE
Confidence            455666666  3345577777651    1124556676664332211111112211       22244669999999999


Q ss_pred             EEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCC
Q psy5025         405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGG  460 (533)
Q Consensus       405 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~  460 (533)
                      |+.                                .+.++++.||+..+.....+.
T Consensus       134 I~~--------------------------------~g~~i~v~vnG~~v~~~~d~~  157 (185)
T PF06439_consen  134 IVV--------------------------------KGNRITVWVNGKPVADFTDPS  157 (185)
T ss_dssp             EEE--------------------------------ETTEEEEEETTEEEEEEETTS
T ss_pred             EEE--------------------------------ECCEEEEEECCEEEEEEEcCC
Confidence            999                                999999999999888765543


No 59 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.92  E-value=0.093  Score=33.98  Aligned_cols=32  Identities=38%  Similarity=0.910  Sum_probs=23.2

Q ss_pred             CCCCCCCCCCeeeeCC-CCceecCCCCC--CCCCc
Q psy5025          47 CNKTPCEHGGKCITTP-GSSYCLCPLGF--TGDLC   78 (533)
Q Consensus        47 C~~~pC~NggtC~~~~-~~y~C~C~~g~--~G~~C   78 (533)
                      |...+|+.++.|+... +.+.|.|-+||  .|..|
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGGKC   36 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence            7778999999999998 99999999999  45555


No 60 
>KOG1214|consensus
Probab=90.23  E-value=0.37  Score=52.61  Aligned_cols=42  Identities=36%  Similarity=0.865  Sum_probs=36.1

Q ss_pred             ceee---cCCC-CCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025          35 IKGR---SKEC-LEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGD   76 (533)
Q Consensus        35 ~~G~---~~~C-~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~   76 (533)
                      ++||   +--| ..|+|.++-|...++|.++.+++.|.|.+||.|+
T Consensus       814 LPGfsGDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  814 LPGFSGDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             cCCccCCccccccccccCccccCCCceEecCCCcceeecccCccCC
Confidence            6788   3334 4799999999999999999999999999999764


No 61 
>smart00051 DSL delta serrate ligand.
Probab=90.12  E-value=0.49  Score=35.12  Aligned_cols=36  Identities=19%  Similarity=0.365  Sum_probs=26.9

Q ss_pred             cCCCCCCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025          39 SKECLEDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGDLC   78 (533)
Q Consensus        39 ~~~C~~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~~C   78 (533)
                      +..|. ..|.+ +.+.++.+|..   .-.|.|++||+|++|
T Consensus        27 G~~C~-~~C~~~~d~~~~~~Cd~---~G~~~C~~Gw~G~~C   63 (63)
T smart00051       27 GEGCN-KFCRPRDDFFGHYTCDE---NGNKGCLEGWMGPYC   63 (63)
T ss_pred             CCccC-CEeCcCccccCCccCCc---CCCEecCCCCcCCCC
Confidence            77786 56653 45778888843   356899999999988


No 62 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=90.06  E-value=0.21  Score=32.41  Aligned_cols=18  Identities=50%  Similarity=1.196  Sum_probs=15.8

Q ss_pred             CeeeeCCCCceecCCCCC
Q psy5025          56 GKCITTPGSSYCLCPLGF   73 (533)
Q Consensus        56 gtC~~~~~~y~C~C~~g~   73 (533)
                      -.|++..++|+|.|+.||
T Consensus        10 h~C~~~~g~~~C~C~~Gy   27 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGY   27 (36)
T ss_dssp             SEEEEETTSEEEE-STTE
T ss_pred             CCCccCCCceEeECCCCC
Confidence            489999999999999999


No 63 
>KOG3546|consensus
Probab=88.76  E-value=1  Score=47.76  Aligned_cols=127  Identities=17%  Similarity=0.302  Sum_probs=77.2

Q ss_pred             ceeeEEEEEEEeCCCC-eEEEEeccCC-CCCeEEEEE---ECCEEEEE--EEc-CCceE-EEeeCceeecCCCeEEEEEE
Q psy5025         336 GYEFNIDMRIKTSSSS-GLILWTGSSQ-SKDFLALGI---HNGYAHVM--YNL-GNGEV-LLENNQTKINNGVWHRLTVH  406 (533)
Q Consensus       336 ~~~~~i~~~~rt~~~~-g~ll~~~~~~-~~~~l~l~l---~~g~l~~~--~~~-g~~~~-~~~~~~~~~~dg~wH~v~~~  406 (533)
                      -..|.|.|.+|+.... |+||...+.. ..-||-|.|   +||+-.+.  |.. +.+.. ..-.....+..++|.++.+.
T Consensus        86 frdf~~~~~i~p~s~~~gvlfaitd~~q~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~  165 (1167)
T KOG3546|consen   86 FRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALS  165 (1167)
T ss_pred             hccceEEEEeeccCCCCceEEEechhhhhhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeee
Confidence            3467899999988644 5666654332 234555554   36754433  322 22211 11112356788999999999


Q ss_pred             EeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCccee--eccCCceEEcCCCCcccccc
Q psy5025         407 RNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ--LNTDSGLYVGGVEDIEELTR  484 (533)
Q Consensus       407 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~--l~~~~~l~lGg~~~~~~~~~  484 (533)
                      .                                .+..+.|+||-+....+......+.  +.....||+|-.-..     
T Consensus       166 v--------------------------------~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~-----  208 (1167)
T KOG3546|consen  166 V--------------------------------AGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGA-----  208 (1167)
T ss_pred             e--------------------------------cCceEEEEechHHhcccchhccccceeecCCcceEEeccCCC-----
Confidence            9                                9999999999776444332222223  334445888754322     


Q ss_pred             cccccCceeeEEEEEEc
Q psy5025         485 SKYHHSFKGWISNLILS  501 (533)
Q Consensus       485 ~~~~~~f~GCi~~l~i~  501 (533)
                        -...|+|.|..|++.
T Consensus       209 --~~~~f~g~~~~l~v~  223 (1167)
T KOG3546|consen  209 --DPDKFQGVIAELKVR  223 (1167)
T ss_pred             --ChHhhhhhhhheeec
Confidence              234699999999994


No 64 
>PF06439 DUF1080:  Domain of Unknown Function (DUF1080);  InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=87.00  E-value=0.69  Score=42.29  Aligned_cols=101  Identities=18%  Similarity=0.249  Sum_probs=52.7

Q ss_pred             ceeEEEcCCcccccceeEEEEEEEe-c-CCCeeEEEcccc--CCCCCCeEEEEEECCEEEEEEEcCCCceE--------E
Q psy5025          92 SSYLRYQGLGEQYLSLLDLTIVFKA-I-EPNGILLYNGHR--ADGVGDFIALYLNDRYVDFTFDLGTGAAT--------L  159 (533)
Q Consensus        92 ~syl~~~~~~~~~~~~~~ls~~FrT-~-~~dGlLl~~~~~--~~~~~~fl~l~L~~G~l~~~~~~g~~~~~--------v  159 (533)
                      .+++....    ....+.|++.||. . ...|++|.....  .......+.+.|.++.-........+...        .
T Consensus        43 ~~~l~~~~----~~~df~l~~d~k~~~~~~sGi~~r~~~~~~~~~~~~gy~~~i~~~~~~~~~~~~~G~~~~~~~~~~~~  118 (185)
T PF06439_consen   43 GGYLYTDK----KFSDFELEVDFKITPGGNSGIFFRAQSPGDGQDWNNGYEFQIDNSGGGTGLPNSTGSLYDEPPWQLEP  118 (185)
T ss_dssp             S--EEESS----EBSSEEEEEEEEE-TT-EEEEEEEESSECCSSGGGTSEEEEEE-TTTCSTTTTSTTSBTTTB-TCB-S
T ss_pred             cceEEECC----ccccEEEEEEEEECCCCCeEEEEEeccccCCCCcceEEEEEEECCCCccCCCCccceEEEeccccccc
Confidence            45555542    2346888888884 3 344555554401  11124556777765432210011112111        1


Q ss_pred             EeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeC
Q psy5025         160 RSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILS  196 (533)
Q Consensus       160 ~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~  196 (533)
                      ........++||+++|+..+..+++.||+........
T Consensus       119 ~~~~~~~~~~W~~~~I~~~g~~i~v~vnG~~v~~~~d  155 (185)
T PF06439_consen  119 SVNVAIPPGEWNTVRIVVKGNRITVWVNGKPVADFTD  155 (185)
T ss_dssp             SS--S--TTSEEEEEEEEETTEEEEEETTEEEEEEET
T ss_pred             cccccCCCCceEEEEEEEECCEEEEEECCEEEEEEEc
Confidence            1235577899999999999999999999976554443


No 65 
>PF14099 Polysacc_lyase:  Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=86.49  E-value=10  Score=35.89  Aligned_cols=67  Identities=13%  Similarity=0.193  Sum_probs=45.4

Q ss_pred             CCeEEEEEECCEEEEEEEcCC----ceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhcccc
Q psy5025         363 KDFLALGIHNGYAHVMYNLGN----GEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLH  438 (533)
Q Consensus       363 ~~~l~l~l~~g~l~~~~~~g~----~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  438 (533)
                      .-.++|.+.++.+.+.+....    ...........+.-|+||++.+...                      +-..    
T Consensus       113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~----------------------~s~~----  166 (224)
T PF14099_consen  113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVK----------------------WSPD----  166 (224)
T ss_dssp             EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEE----------------------E-CC----
T ss_pred             CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEE----------------------ECCC----
Confidence            457889999999999987655    1222223456688899999998871                      1111    


Q ss_pred             ccCcEEEEEECCceeeeE
Q psy5025         439 RTGKQCLIRIDTDYSMQS  456 (533)
Q Consensus       439 ~~~~~~~l~vD~~~~~~~  456 (533)
                       ....+.|.+||+.+...
T Consensus       167 -~~G~~~vw~nG~~v~~~  183 (224)
T PF14099_consen  167 -SDGFLEVWLNGKLVVDY  183 (224)
T ss_dssp             -CTEEEEEEECCEECCEE
T ss_pred             -CCEEEEEEECCEEEEEE
Confidence             45789999999876554


No 66 
>KOG3546|consensus
Probab=85.99  E-value=2.7  Score=44.71  Aligned_cols=129  Identities=18%  Similarity=0.251  Sum_probs=78.6

Q ss_pred             ceeEEEEEEEecC-CCeeEEEccccCCCCCCeEEEEE---ECCEEEEEE--Ec-CCCceE--EEeeccccCCCcEEEEEE
Q psy5025         106 SLLDLTIVFKAIE-PNGILLYNGHRADGVGDFIALYL---NDRYVDFTF--DL-GTGAAT--LRSSNPISLGEWHKLRLT  176 (533)
Q Consensus       106 ~~~~ls~~FrT~~-~dGlLl~~~~~~~~~~~fl~l~L---~~G~l~~~~--~~-g~~~~~--v~s~~~lnDg~Wh~V~v~  176 (533)
                      +.+.|.|-.|... .-|+||...+... .--|+-|.|   +||+-.+.+  .. +++...  ......+--++|.++.++
T Consensus        87 rdf~~~~~i~p~s~~~gvlfaitd~~q-~~i~lg~~lsgv~dghq~i~l~ytepg~~~s~~aa~f~~p~~~~~w~~~a~~  165 (1167)
T KOG3546|consen   87 RDFSLLFHIRPATEGPGVLFAITDSAQ-AMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALS  165 (1167)
T ss_pred             ccceEEEEeeccCCCCceEEEechhhh-hhheeeeeeeccccCcceeEEEeccCCCCccchhheeccchhhchhhheeee
Confidence            4566777777654 5566666654421 123444444   467544433  32 222211  123456678999999999


Q ss_pred             EECCEEEEEEcCccCceeeCCCCceeee--cCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcEE
Q psy5025         177 RTGRHAYLQVDRFPSSQILSPGPFTQLS--LSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNHE  243 (533)
Q Consensus       177 ~~~~~~~L~VD~~~~~~~~~~~~~~~l~--~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~  243 (533)
                      ..+..+.|.||-+.-+..........+.  ....||++..-...        ...|.|-|..+++...+
T Consensus       166 v~g~~v~l~v~cee~~r~p~~rss~~l~~e~~ag~f~~~ag~~~--------~~~f~g~~~~l~v~~dp  226 (1167)
T KOG3546|consen  166 VAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGAD--------PDKFQGVIAELKVRRDP  226 (1167)
T ss_pred             ecCceEEEEechHHhcccchhccccceeecCCcceEEeccCCCC--------hHhhhhhhhheeecCCC
Confidence            9999999999986644433333333333  34568988665443        26799999999886543


No 67 
>KOG1225|consensus
Probab=84.70  E-value=0.67  Score=49.24  Aligned_cols=40  Identities=33%  Similarity=0.844  Sum_probs=23.7

Q ss_pred             ceee-cCCCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCcc
Q psy5025          35 IKGR-SKECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQ   79 (533)
Q Consensus        35 ~~G~-~~~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce   79 (533)
                      ..|| +..|.+-.|... |..++.|++.    .|.|+++|.|+.|+
T Consensus       270 ~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs  310 (525)
T KOG1225|consen  270 PPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPGYSGKDCS  310 (525)
T ss_pred             CCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCCccccccc
Confidence            5666 666665555554 6555665554    56666666666665


No 68 
>KOG1217|consensus
Probab=84.24  E-value=0.84  Score=48.36  Aligned_cols=35  Identities=51%  Similarity=1.235  Sum_probs=32.6

Q ss_pred             CCCCCCCC-CCCCCeeeeCCCCceecCCCCCCCCCc
Q psy5025          44 EDKCNKTP-CEHGGKCITTPGSSYCLCPLGFTGDLC   78 (533)
Q Consensus        44 ~d~C~~~p-C~NggtC~~~~~~y~C~C~~g~~G~~C   78 (533)
                      .+.|...+ |.|+++|++..+.|.|.|+++|+|..|
T Consensus       271 ~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~  306 (487)
T KOG1217|consen  271 VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC  306 (487)
T ss_pred             ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC
Confidence            78899874 999999999999999999999999998


No 69 
>KOG1217|consensus
Probab=83.56  E-value=0.94  Score=47.97  Aligned_cols=47  Identities=43%  Similarity=1.071  Sum_probs=39.6

Q ss_pred             ceee-cCCCCC--CCCC--CCCCCCCCeeeeCCCCceecCCCCCCCCCcccc
Q psy5025          35 IKGR-SKECLE--DKCN--KTPCEHGGKCITTPGSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        35 ~~G~-~~~C~~--d~C~--~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~   81 (533)
                      ..|| +..|..  +.|.  .++|.|+++|.+...+|.|.|+.+|.|..|+..
T Consensus       157 ~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~  208 (487)
T KOG1217|consen  157 TEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT  208 (487)
T ss_pred             CCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC
Confidence            4566 666663  5898  567999999999999999999999999999875


No 70 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.40  E-value=1.1  Score=42.52  Aligned_cols=35  Identities=40%  Similarity=0.923  Sum_probs=27.2

Q ss_pred             cCCCC-CCCCCC--CCCCCCCeeeeCCCCceecCCCCCCC
Q psy5025          39 SKECL-EDKCNK--TPCEHGGKCITTPGSSYCLCPLGFTG   75 (533)
Q Consensus        39 ~~~C~-~d~C~~--~pC~NggtC~~~~~~y~C~C~~g~~G   75 (533)
                      ..-|. .++|..  ++|.  ..|.+..++|.|.|+.||+.
T Consensus       181 ~~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         181 GKICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             cccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccC
Confidence            45675 577853  4565  58999999999999999964


No 71 
>KOG1225|consensus
Probab=81.04  E-value=1.3  Score=47.04  Aligned_cols=41  Identities=37%  Similarity=0.875  Sum_probs=34.2

Q ss_pred             ceee-cCCCCCCCCCCCCCCCCCeeeeCCCCceecCCCCCCCCCccc
Q psy5025          35 IKGR-SKECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQT   80 (533)
Q Consensus        35 ~~G~-~~~C~~d~C~~~pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~   80 (533)
                      ..|| ++.|+.-.|..+ |.+.|.|++.    +|+|++||+|++|+.
T Consensus       239 ~~~~~g~~c~~~~C~~~-c~~~g~c~~G----~CIC~~Gf~G~dC~e  280 (525)
T KOG1225|consen  239 PEGYFGPLCSTIYCPGG-CTGRGQCVEG----RCICPPGFTGDDCDE  280 (525)
T ss_pred             CCceeCCccccccCCCC-CcccceEeCC----eEeCCCCCcCCCCCc
Confidence            6677 888886677764 7677889887    899999999999998


No 72 
>KOG4260|consensus
Probab=79.58  E-value=1.3  Score=42.29  Aligned_cols=59  Identities=29%  Similarity=0.655  Sum_probs=42.8

Q ss_pred             ceeEEEEcCCCC--cee-ecCCCCC-CCCCCCCCCCCCeeeeCC---CCceecCCCCCCCCCcccc
Q psy5025          23 WDAWIQLNDGNK--IKG-RSKECLE-DKCNKTPCEHGGKCITTP---GSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        23 ~~~~l~vd~~~~--~~G-~~~~C~~-d~C~~~pC~NggtC~~~~---~~y~C~C~~g~~G~~Ce~~   81 (533)
                      ...||-||+.-.  +.| |++.|.. ..=+..||.-.|.|.-..   ++-.|.|.+||+|+.|.+-
T Consensus       119 l~~WlCvdqLkvCCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~C  184 (350)
T KOG4260|consen  119 LFNWLCVDQLKVCCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYC  184 (350)
T ss_pred             HHhHhhhhhheeccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccccc
Confidence            345666666554  444 4888873 111356899889999775   7889999999999999875


No 73 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.52  E-value=1.6  Score=30.34  Aligned_cols=24  Identities=29%  Similarity=0.697  Sum_probs=19.3

Q ss_pred             CeeeeCCCCceecCCCCCCCCCcccc
Q psy5025          56 GKCITTPGSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        56 gtC~~~~~~y~C~C~~g~~G~~Ce~~   81 (533)
                      .+|..  ....|.|+.+|+|+.|+.-
T Consensus        11 ~~C~~--~~G~C~C~~~~~G~~C~~C   34 (49)
T PF00053_consen   11 QTCDP--STGQCVCKPGTTGPRCDQC   34 (49)
T ss_dssp             SSEEE--TCEEESBSTTEESTTS-EE
T ss_pred             CcccC--CCCEEeccccccCCcCcCC
Confidence            47766  6779999999999999974


No 74 
>KOG1214|consensus
Probab=76.94  E-value=1.9  Score=47.43  Aligned_cols=31  Identities=16%  Similarity=0.550  Sum_probs=28.8

Q ss_pred             CCCCCCCC-CCCeeeecCCceEEecCCCCCcC
Q psy5025         261 HPCITQPC-GSGSCVPLYDGYKCVCTQSCKES  291 (533)
Q Consensus       261 ~~C~~~pC-ngg~C~~~~~~~~C~C~~g~~G~  291 (533)
                      +.|.++-| ..+.|+...++|.|.|.+||.|+
T Consensus       828 DeC~psrChp~A~CyntpgsfsC~C~pGy~GD  859 (1289)
T KOG1214|consen  828 DECSPSRCHPAATCYNTPGSFSCRCQPGYYGD  859 (1289)
T ss_pred             cccCccccCCCceEecCCCcceeecccCccCC
Confidence            57999999 89999999999999999999993


No 75 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=76.87  E-value=1.9  Score=27.96  Aligned_cols=20  Identities=30%  Similarity=0.660  Sum_probs=16.0

Q ss_pred             CeeeecCCceEEecCCCCCc
Q psy5025         271 GSCVPLYDGYKCVCTQSCKE  290 (533)
Q Consensus       271 g~C~~~~~~~~C~C~~g~~G  290 (533)
                      -.|++...+|.|.|++||.-
T Consensus        10 h~C~~~~g~~~C~C~~Gy~L   29 (36)
T PF14670_consen   10 HICVNTPGSYRCSCPPGYKL   29 (36)
T ss_dssp             SEEEEETTSEEEE-STTEEE
T ss_pred             CCCccCCCceEeECCCCCEE
Confidence            36888889999999999854


No 76 
>KOG1836|consensus
Probab=76.71  E-value=2.8  Score=50.95  Aligned_cols=105  Identities=18%  Similarity=0.250  Sum_probs=68.7

Q ss_pred             CCeEEEEEECCEEEEEEEcCCC--ceEEEeeccccCCCcEEEEEEEECCEEEEEEcCccCceeeCCCCceeeecCCCEEE
Q psy5025         134 GDFIALYLNDRYVDFTFDLGTG--AATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYL  211 (533)
Q Consensus       134 ~~fl~l~L~~G~l~~~~~~g~~--~~~v~s~~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~~~~~~~~~~~l~~~~~l~v  211 (533)
                      ..+..+....+.....+..+-.  ...+.....+-++.||.+...+....+.+..|... .   .+......+...++++
T Consensus      1581 ~~~~~~~~~~~~~~~~~~~gi~~~~~s~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-~---~~~~~~~~~~~~p~~~ 1656 (1705)
T KOG1836|consen 1581 GTELLVQENPIGVTEKFESGITDLSTSSTPIVSLLPGGCHSVTSSTDPGVVQLEDDTYT-V---GEIPPPPADTQEPIKL 1656 (1705)
T ss_pred             chHHhhhhcccccchhhhhhhhhhhhcchhhhhhcCCcceeeeeecCCcccccccccee-c---ccCCCCchhccCCccc
Confidence            4555555555555555444322  22333446778999999999999998888888721 1   1222333445578999


Q ss_pred             cCcCCCCccCCCcccccCceeeeeeEEECcEEcc
Q psy5025         212 GGVPDYNIVSPKVKIKSSFIGCIQKVLINNHELN  245 (533)
Q Consensus       212 GG~p~~~~~~~~~~~~~~F~GCI~~l~in~~~l~  245 (533)
                      ||++..... ........|.|||  +++++.+.+
T Consensus      1657 ~~~~~s~~~-~~~~~~~~~~~~~--~~~~~~~~~ 1687 (1705)
T KOG1836|consen 1657 GGYPSSLTT-LRIAVLKSFTGCI--FVVMGIRVD 1687 (1705)
T ss_pred             CCccccccc-eeeecccccccce--EEecCCCCc
Confidence            999987653 2334457899999  888888776


No 77 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=75.31  E-value=10  Score=31.43  Aligned_cols=32  Identities=31%  Similarity=0.707  Sum_probs=25.4

Q ss_pred             CCCCCC-CCCCCCCeeeeCCCCceecCCCCCCCC
Q psy5025          44 EDKCNK-TPCEHGGKCITTPGSSYCLCPLGFTGD   76 (533)
Q Consensus        44 ~d~C~~-~pC~NggtC~~~~~~y~C~C~~g~~G~   76 (533)
                      .|.|+. ..|...|.|.. .....|.|.+||.-+
T Consensus        77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~  109 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK  109 (110)
T ss_pred             ccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence            578874 79999999954 366789999999643


No 78 
>KOG0994|consensus
Probab=68.34  E-value=16  Score=42.17  Aligned_cols=136  Identities=18%  Similarity=0.311  Sum_probs=77.1

Q ss_pred             CCCCCCCCCCCCeeeeC--CCCceecCCCCCCCCCcccccc---------------------------------cccccc
Q psy5025          45 DKCNKTPCEHGGKCITT--PGSSYCLCPLGFTGDLCQTRLD---------------------------------LQVPQF   89 (533)
Q Consensus        45 d~C~~~pC~NggtC~~~--~~~y~C~C~~g~~G~~Ce~~~~---------------------------------~~~~~F   89 (533)
                      +.|.+-.|.-||.=-+.  ..+-.|.|..++.|++|++.-.                                 ...++|
T Consensus       508 ~gc~~cdcd~GGs~d~sc~~~sGqC~CRe~~~GR~c~~~~~~yy~~~l~h~i~eAe~~~~~~~~~v~~r~~~~~~~~~sf  587 (1758)
T KOG0994|consen  508 EGCRPCDCDQGGSYDNSCDLHSGQCECREHMLGRRCEQVCPGYYSPVLDHYIYEAEDAGTGVEVNVKERKVLKSTKLPSF  587 (1758)
T ss_pred             CCCcccccCCCCCCCcccccccCccccccccccccccccCCcccccccchhhhhhhhccccceeeeeeeeecccCCCccc
Confidence            44666666666432222  2567899999999999997521                                 014578


Q ss_pred             CCceeEEEcCCcc--------cccceeEEEEEEEecCC----CeeEEEccccCCCCCCeEEEEEECCEEEEEEEcCCCce
Q psy5025          90 NGSSYLRYQGLGE--------QYLSLLDLTIVFKAIEP----NGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAA  157 (533)
Q Consensus        90 ~g~syl~~~~~~~--------~~~~~~~ls~~FrT~~~----dGlLl~~~~~~~~~~~fl~l~L~~G~l~~~~~~g~~~~  157 (533)
                      .|.+|++.+-...        +.+..+.+-+.+..+.+    +..|+.......+.+.--.+.+.+.++.+.+..++.-+
T Consensus       588 tG~gf~r~~e~~~l~f~~~~ip~sm~Ydv~ir~~~~~~~~wen~~itvqrp~~p~~g~c~~~~~~dd~~~~sl~p~sRyv  667 (1758)
T KOG0994|consen  588 TGKGFVRVPEGTTLEFTVPIIPPSMEYDVLIRYDPRTPKLWENAKITVQRPGQPSLGRCGMAIPKDDRIPFSLPPGSRYV  667 (1758)
T ss_pred             cccceeecCCCceeeeecCCCCcccccchheeccCCCcchhhhheEEeecCCCCcccccccccccccccccccCCCceee
Confidence            8888887764311        11123344444443332    22233322221222333444556677777777665444


Q ss_pred             EEEeeccccCCCcEEEEEEEECC
Q psy5025         158 TLRSSNPISLGEWHKLRLTRTGR  180 (533)
Q Consensus       158 ~v~s~~~lnDg~Wh~V~v~~~~~  180 (533)
                      .+..+.-+..|.-+.++|....+
T Consensus       668 v~~~~vClE~G~~Yklri~~~~~  690 (1758)
T KOG0994|consen  668 VAPNPVCLEAGKVYKLRIYFERK  690 (1758)
T ss_pred             ecCCchhhccCcceEEEEEeccc
Confidence            44455678889999888887653


No 79 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=64.81  E-value=6  Score=27.68  Aligned_cols=22  Identities=32%  Similarity=0.708  Sum_probs=17.6

Q ss_pred             eeeCCCCceecCCCCCCCCCcccc
Q psy5025          58 CITTPGSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        58 C~~~~~~y~C~C~~g~~G~~Ce~~   81 (533)
                      |..  ..-.|.|+.+++|+.|+.-
T Consensus        14 C~~--~~G~C~C~~~~~G~~C~~C   35 (50)
T cd00055          14 CDP--GTGQCECKPNTTGRRCDRC   35 (50)
T ss_pred             ccC--CCCEEeCCCcCCCCCCCCC
Confidence            544  4567999999999999964


No 80 
>PF09064 Tme5_EGF_like:  Thrombomodulin like fifth domain, EGF-like;  InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=62.79  E-value=6.6  Score=24.93  Aligned_cols=13  Identities=15%  Similarity=0.419  Sum_probs=9.9

Q ss_pred             CceEEecCCCCCc
Q psy5025         278 DGYKCVCTQSCKE  290 (533)
Q Consensus       278 ~~~~C~C~~g~~G  290 (533)
                      +...|.||.||--
T Consensus        16 ~~~~C~CPeGyIl   28 (34)
T PF09064_consen   16 SPGQCFCPEGYIL   28 (34)
T ss_pred             CCCceeCCCceEe
Confidence            4568999999853


No 81 
>PF02057 Glyco_hydro_59:  Glycosyl hydrolase family 59;  InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=59.42  E-value=1.9e+02  Score=32.10  Aligned_cols=60  Identities=22%  Similarity=0.324  Sum_probs=39.1

Q ss_pred             CCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCc
Q psy5025         363 KDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGK  442 (533)
Q Consensus       363 ~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  442 (533)
                      +-||.| ..+|.-.|.-++.+.. .+......+.-++||+|++..                                .+.
T Consensus       578 G~~f~v-~~~G~w~vt~d~~~~~-~l~~G~~~~~~~~WhtltL~~--------------------------------~g~  623 (669)
T PF02057_consen  578 GYFFWV-YANGTWSVTSDLAGTT-TLASGTADIGAGKWHTLTLTI--------------------------------SGS  623 (669)
T ss_dssp             EEEEEE-ETTTEEEEEEETTS-S-EEEEEE-S--TT-EEEEEEEE--------------------------------ETT
T ss_pred             eEEEEE-EcCCcEEEeccCCCcE-EEeeeeecccCCeEEEEEEEE--------------------------------ECC
Confidence            345544 5588877776665432 332233458889999999999                                889


Q ss_pred             EEEEEECCceeeeE
Q psy5025         443 QCLIRIDTDYSMQS  456 (533)
Q Consensus       443 ~~~l~vD~~~~~~~  456 (533)
                      .+.-.+|+......
T Consensus       624 ~~ta~lng~~l~~~  637 (669)
T PF02057_consen  624 TATAMLNGTVLWTD  637 (669)
T ss_dssp             EEEEEETTEEEEEE
T ss_pred             EEEEEECCEEeEEe
Confidence            99999999877653


No 82 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=59.17  E-value=7  Score=31.95  Aligned_cols=37  Identities=27%  Similarity=0.718  Sum_probs=28.5

Q ss_pred             CCCC--CCCCCCCCeeeeCC-----CCceecCCC-------------CCCCCCcccc
Q psy5025          45 DKCN--KTPCEHGGKCITTP-----GSSYCLCPL-------------GFTGDLCQTR   81 (533)
Q Consensus        45 d~C~--~~pC~NggtC~~~~-----~~y~C~C~~-------------g~~G~~Ce~~   81 (533)
                      +.|.  .+-|...|.|++..     +=|.|.|.+             .|.|+.|++.
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~~~~ktt~W~G~aCqKk   62 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGSGKGKTTHWGGPACQKK   62 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeeccccccccccCceeeecccccccc
Confidence            3454  46788899999984     559999987             5778888875


No 83 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=56.15  E-value=9.7  Score=36.08  Aligned_cols=29  Identities=28%  Similarity=0.671  Sum_probs=22.5

Q ss_pred             CCCCCC--CCCCCeeeecCCceEEecCCCCCc
Q psy5025         261 HPCITQ--PCGSGSCVPLYDGYKCVCTQSCKE  290 (533)
Q Consensus       261 ~~C~~~--pCngg~C~~~~~~~~C~C~~g~~G  290 (533)
                      +.|...  +| ...|.+...+|.|.|+.||..
T Consensus       188 ~~C~~~~~~c-~~~C~~~~g~~~c~c~~g~~~  218 (224)
T cd01475         188 DLCATLSHVC-QQVCISTPGSYLCACTEGYAL  218 (224)
T ss_pred             hhhcCCCCCc-cceEEcCCCCEEeECCCCccC
Confidence            456543  45 347999999999999999975


No 84 
>PF14099 Polysacc_lyase:  Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=54.10  E-value=53  Score=30.86  Aligned_cols=59  Identities=12%  Similarity=0.122  Sum_probs=41.0

Q ss_pred             CCeEEEEEECCEEEEEEEcCC-C----ceEEEeeccccCCCcEEEEEEEE-----CCEEEEEEcCccCc
Q psy5025         134 GDFIALYLNDRYVDFTFDLGT-G----AATLRSSNPISLGEWHKLRLTRT-----GRHAYLQVDRFPSS  192 (533)
Q Consensus       134 ~~fl~l~L~~G~l~~~~~~g~-~----~~~v~s~~~lnDg~Wh~V~v~~~-----~~~~~L~VD~~~~~  192 (533)
                      ...++|.+.++++.+.+.... .    .........+.-|+||+|.+...     ...+.+.+|+....
T Consensus       113 ~P~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~  181 (224)
T PF14099_consen  113 SPPFALRIKGGRLYLRVRGDEPSDSGNKAYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVV  181 (224)
T ss_dssp             EECEEEEEETTEEEEEEEEE-TCEEEEEEEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECC
T ss_pred             CCcEEEEEeCCEEEEEEEcCCCCcccceeEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEE
Confidence            668999999999999988765 1    11222345677799999887643     35789999996543


No 85 
>PF09264 Sial-lect-inser:  Vibrio cholerae sialidase, lectin insertion;  InterPro: IPR015344 This domain is predominantly found in Vibrio cholerae sialidase, and adopt a beta sandwich structure consisting of 12-14 strands arranged in two beta-sheets. It binds to lectins with high affinity helping to target the protein to sialic acid-rich environments, thereby enhancing the catalytic efficiency of the enzyme []. ; PDB: 1W0P_A 1W0O_A 1KIT_A 2W68_B.
Probab=53.01  E-value=98  Score=28.23  Aligned_cols=83  Identities=11%  Similarity=0.134  Sum_probs=48.0

Q ss_pred             eEEEEEEEeCCCCeEEEEeccCCCCCeEEE-EEE-CCEEEEEEEcCCceEEEeeCceeecC-CCeEEEEEEEeeeeeecC
Q psy5025         339 FNIDMRIKTSSSSGLILWTGSSQSKDFLAL-GIH-NGYAHVMYNLGNGEVLLENNQTKINN-GVWHRLTVHRNFMVVMHG  415 (533)
Q Consensus       339 ~~i~~~~rt~~~~g~ll~~~~~~~~~~l~l-~l~-~g~l~~~~~~g~~~~~~~~~~~~~~d-g~wH~v~~~~~~~~~~~~  415 (533)
                      ++++-..|-.+....-.|.++.. ..||.. .|. +|.|++.++-.++...+.   ....| ...|..++....      
T Consensus        34 W~ls~~~RV~~G~~n~~yyAnG~-~r~l~~lsvn~sG~LvA~L~g~ss~~~~~---~~~~di~gyH~Y~i~~~p------  103 (198)
T PF09264_consen   34 WSLSWESRVVSGGCNTNYYANGS-KRYLPILSVNESGSLVAELEGQSSNTLLA---TTGADIHGYHKYEIVFSP------  103 (198)
T ss_dssp             EEEEEEEEEEEES-EEEEEEESS-EEEEEEEEE-TTS-EEEEETTS-S-EEEE----CHHHHCSEEEEEEEEET------
T ss_pred             cceeeeEEEecCcceeEEEcCCc-eEEEEEEEEcCCCCEEEEEecCCCcEEEe---cccccccceeEEEEEecC------
Confidence            44444444444333333333332 456644 555 788999987766665554   22223 678999998822      


Q ss_pred             CcccccCCcchhhhhhhhhccccccCcEEEEEECCceeee
Q psy5025         416 SSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQ  455 (533)
Q Consensus       416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~  455 (533)
                                              .....+++|||..+.+
T Consensus       104 ------------------------~~~tASfy~DG~lI~t  119 (198)
T PF09264_consen  104 ------------------------LTNTASFYFDGTLIAT  119 (198)
T ss_dssp             ------------------------TTTEEEEEETTEEEEE
T ss_pred             ------------------------CCCceEEEECCEEEee
Confidence                                    4478999999998876


No 86 
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=52.41  E-value=2e+02  Score=26.90  Aligned_cols=10  Identities=20%  Similarity=0.285  Sum_probs=9.7

Q ss_pred             CCeEEEEEEE
Q psy5025         398 GVWHRLTVHR  407 (533)
Q Consensus       398 g~wH~v~~~~  407 (533)
                      |+||++.|..
T Consensus       154 g~~~~v~I~Y  163 (223)
T cd01951         154 GNEHTVRITY  163 (223)
T ss_pred             CCEEEEEEEE
Confidence            9999999999


No 87 
>KOG4260|consensus
Probab=51.45  E-value=7.7  Score=37.16  Aligned_cols=35  Identities=29%  Similarity=0.703  Sum_probs=29.8

Q ss_pred             CCC-CCCCC--CCCCCCCCeeeeCCCCceecCCCCCCC
Q psy5025          41 ECL-EDKCN--KTPCEHGGKCITTPGSSYCLCPLGFTG   75 (533)
Q Consensus        41 ~C~-~d~C~--~~pC~NggtC~~~~~~y~C~C~~g~~G   75 (533)
                      +|- +|+|.  ++||....-|++..++|.|.+..||.+
T Consensus       232 gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~  269 (350)
T KOG4260|consen  232 GCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK  269 (350)
T ss_pred             ccccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence            444 57775  789999999999999999999999965


No 88 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=49.92  E-value=15  Score=30.38  Aligned_cols=29  Identities=24%  Similarity=0.541  Sum_probs=22.0

Q ss_pred             CCCC-CCCC-CCCeeeecCCceEEecCCCCCc
Q psy5025         261 HPCI-TQPC-GSGSCVPLYDGYKCVCTQSCKE  290 (533)
Q Consensus       261 ~~C~-~~pC-ngg~C~~~~~~~~C~C~~g~~G  290 (533)
                      +.|. ...| ..|.|... ....|.|.+||..
T Consensus        78 d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P  108 (110)
T PF00954_consen   78 DQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP  108 (110)
T ss_pred             cCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence            3566 4789 88999543 5667999999964


No 89 
>cd00413 Glyco_hydrolase_16 glycosyl hydrolase family 16. The O-Glycosyl hydrolases are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycosyl hydrolase family 16. Family 16 includes lichenase, xyloglucan endotransglycosylase (XET), beta-agarase, kappa-carrageenase, endo-beta-1,3-glucanase, endo-beta-1,3-1,4-glucanase, and endo-beta-galactosidase, all of which have a conserved jelly roll fold with a deep active site channel harboring the catalytic residues.
Probab=49.44  E-value=2.1e+02  Score=26.26  Aligned_cols=30  Identities=13%  Similarity=0.062  Sum_probs=26.5

Q ss_pred             cCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEe
Q psy5025         396 NNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSI  457 (533)
Q Consensus       396 ~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~  457 (533)
                      ..+.||...+.+                                ....++.+|||..+....
T Consensus       140 ~~~~~H~Y~~~W--------------------------------~~~~i~~yvDG~~~~~~~  169 (210)
T cd00413         140 PADDFHTYRVDW--------------------------------TPGEITFYVDGVLVATIT  169 (210)
T ss_pred             CccCeEEEEEEE--------------------------------eCCEEEEEECCEEEEEEC
Confidence            578999999999                                889999999999887753


No 90 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=48.30  E-value=16  Score=23.86  Aligned_cols=28  Identities=25%  Similarity=0.691  Sum_probs=20.5

Q ss_pred             CCCCCC-CCCeeeecC-CceEEecCCCCCc
Q psy5025         263 CITQPC-GSGSCVPLY-DGYKCVCTQSCKE  290 (533)
Q Consensus       263 C~~~pC-ngg~C~~~~-~~~~C~C~~g~~G  290 (533)
                      |...+| -++.|.... ..+.|.|-+||..
T Consensus         2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~   31 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDGSEECRCLLGYKK   31 (37)
T ss_dssp             -SSS---TTEEEEEETTSEEEEEE-TTEEE
T ss_pred             ccCccCCCCcccEEcCCCCEEEEeeCCccc
Confidence            677789 889999887 6799999999864


No 91 
>PF04863 EGF_alliinase:  Alliinase EGF-like domain;  InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=47.71  E-value=6.6  Score=27.84  Aligned_cols=32  Identities=28%  Similarity=0.647  Sum_probs=16.4

Q ss_pred             CCCCCCeeeeC----CCCceecCCCCCCCCCccccc
Q psy5025          51 PCEHGGKCITT----PGSSYCLCPLGFTGDLCQTRL   82 (533)
Q Consensus        51 pC~NggtC~~~----~~~y~C~C~~g~~G~~Ce~~~   82 (533)
                      +|.-.|...-.    .+...|.|..-|.|++|++.+
T Consensus        18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~   53 (56)
T PF04863_consen   18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI   53 (56)
T ss_dssp             --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred             CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence            45444544422    255899999999999999864


No 92 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=47.69  E-value=13  Score=25.54  Aligned_cols=18  Identities=28%  Similarity=0.641  Sum_probs=15.6

Q ss_pred             CceecCCCCCCCCCcccc
Q psy5025          64 SSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        64 ~y~C~C~~g~~G~~Ce~~   81 (533)
                      .-.|.|+++++|+.|+.-
T Consensus        17 ~G~C~C~~~~~G~~C~~C   34 (46)
T smart00180       17 TGQCECKPNVTGRRCDRC   34 (46)
T ss_pred             CCEEECCCCCCCCCCCcC
Confidence            458999999999999964


No 93 
>PF07622 DUF1583:  Protein of unknown function (DUF1583);  InterPro: IPR011475  Most of the Rhodopirellula baltica hypothetical proteins that have this domain also match PF07619 from PFAM. 
Probab=43.71  E-value=28  Score=35.44  Aligned_cols=31  Identities=23%  Similarity=0.308  Sum_probs=28.4

Q ss_pred             CCccccCCCceEEEEEEeCceeEEEEcCCCC
Q psy5025           4 SDNQVILDDWNTVRVFRHRWDAWIQLNDGNK   34 (533)
Q Consensus         4 s~~~v~d~~WH~v~~~r~~~~~~l~vd~~~~   34 (533)
                      ....+.++.|.+|.++|.+.++.+.||++..
T Consensus        83 ~~~~l~~~~wN~v~l~~~g~~v~l~LN~~~i  113 (399)
T PF07622_consen   83 PTLPLKVNAWNRVRLQRRGDKVQLHLNGQLI  113 (399)
T ss_pred             CCCCCCccccceEEEEEeCCEEEEEeCCcee
Confidence            3568899999999999999999999999887


No 94 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=39.95  E-value=38  Score=23.64  Aligned_cols=26  Identities=31%  Similarity=1.008  Sum_probs=18.5

Q ss_pred             CCCC-CCCCCCCCeeeeCCCCceecCCCCCC
Q psy5025          45 DKCN-KTPCEHGGKCITTPGSSYCLCPLGFT   74 (533)
Q Consensus        45 d~C~-~~pC~NggtC~~~~~~y~C~C~~g~~   74 (533)
                      +.|. ...|..+..|++.    +|.|+.+|.
T Consensus        20 ~~C~~~~qC~~~s~C~~g----~C~C~~g~~   46 (52)
T PF01683_consen   20 ESCESDEQCIGGSVCVNG----RCQCPPGYV   46 (52)
T ss_pred             CCCCCcCCCCCcCEEcCC----EeECCCCCE
Confidence            3354 3467778899664    899999883


No 95 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=37.27  E-value=29  Score=28.46  Aligned_cols=22  Identities=45%  Similarity=1.159  Sum_probs=18.0

Q ss_pred             CCC-CCCeeeecC-----CceEEecCCC
Q psy5025         266 QPC-GSGSCVPLY-----DGYKCVCTQS  287 (533)
Q Consensus       266 ~pC-ngg~C~~~~-----~~~~C~C~~g  287 (533)
                      +-| ++|.|+...     +=|.|.|.+.
T Consensus        13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T   40 (103)
T PF12955_consen   13 NNCSGHGSCVKKYGSGGGDCFACKCKPT   40 (103)
T ss_pred             cCCCCCceEeeccCCCccceEEEEeecc
Confidence            679 889999985     3499999983


No 96 
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor.  Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids.  Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family.  Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins.  PHA agglutinates most mammalian red blood cell types by bindin
Probab=36.27  E-value=3.9e+02  Score=25.44  Aligned_cols=13  Identities=8%  Similarity=0.043  Sum_probs=11.9

Q ss_pred             ecCCCeEEEEEEE
Q psy5025         395 INNGVWHRLTVHR  407 (533)
Q Consensus       395 ~~dg~wH~v~~~~  407 (533)
                      +.||++|++.|..
T Consensus       160 l~~g~~~~v~I~Y  172 (236)
T cd06899         160 LKSGKPMQAWIDY  172 (236)
T ss_pred             ccCCCeEEEEEEE
Confidence            5799999999999


No 97 
>PF07953 Toxin_R_bind_N:  Clostridium neurotoxin, N-terminal receptor binding;  InterPro: IPR012928 The Clostridium neurotoxin family is composed of tetanus neurotoxin and seven serotypes of botulinum neurotoxin. The structure of the botulinum neurotoxin reveals a four domain protein. The N-terminal catalytic domain (IPR000395 from INTERPRO), the central translocation domain and two receptor binding domains []. This domain is the N-terminal receptor binding domain, which is comprised of two seven-stranded beta-sheets sandwiched together to form a jelly role motif []. The role of this domain in receptor binding appears to be indirect. ; GO: 0004222 metalloendopeptidase activity, 0050827 toxin receptor binding, 0009405 pathogenesis, 0051609 inhibition of neurotransmitter uptake, 0005576 extracellular region; PDB: 3RSJ_B 3FUQ_A 3FFZ_B 1G9B_A 1S0F_A 1Z0H_B 1F31_A 1G9D_A 1S0C_A 1S0D_A ....
Probab=35.38  E-value=3.5e+02  Score=24.75  Aligned_cols=88  Identities=15%  Similarity=0.136  Sum_probs=56.4

Q ss_pred             ceeeEEEEEEEeCCCC---------eEEEEeccCCCCCeEEEEEECCEEEEEEEcCCc-eEEEee--Cc-eeecC--CCe
Q psy5025         336 GYEFNIDMRIKTSSSS---------GLILWTGSSQSKDFLALGIHNGYAHVMYNLGNG-EVLLEN--NQ-TKINN--GVW  400 (533)
Q Consensus       336 ~~~~~i~~~~rt~~~~---------g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~-~~~~~~--~~-~~~~d--g~w  400 (533)
                      ...|+|+|++|..+..         .+|=....   ..=-.++++++.+...+.-.++ ...+.+  +. ..++|  .+|
T Consensus        54 ~~nFSIsFWlRipk~~~~~~~~neytII~~~~n---NsGWkI~l~~n~iiwtl~D~ng~~k~i~f~y~~~~~~SdyiNkW  130 (195)
T PF07953_consen   54 YNNFSISFWLRIPKYDNNINLNNEYTIINCMKN---NSGWKISLNNNGIIWTLIDSNGNEKSIYFNYSIMDNISDYINKW  130 (195)
T ss_dssp             SSEEEEEEEEEEECHHCCHHTTSEEEEEEEEET---TEEEEEEEETTEEEEEEEETTSEEEEEEEESSSTSSTTSSTTSE
T ss_pred             ccceEEEEEEEccCcccccccCcceEEEEeecC---CCceEEEEeCCcEEEEEEeCCCCEEEEEEEcccccchhhhcccE
Confidence            3468999999997522         23333321   2445788999999988765444 333333  11 22333  799


Q ss_pred             EEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEe
Q psy5025         401 HRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSI  457 (533)
Q Consensus       401 H~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~  457 (533)
                      |.|+++-.+                              - +...|+|+|....+..
T Consensus       131 ~fITITnnr------------------------------L-~~~~IyINg~Li~~~~  156 (195)
T PF07953_consen  131 FFITITNNR------------------------------L-GNSKIYINGNLIDNES  156 (195)
T ss_dssp             EEEEEEEET------------------------------T-SEEEEEETTEEEEEEE
T ss_pred             EEEEEEccc------------------------------C-ccceEEECCEEEcccc
Confidence            999999832                              3 4449999999887754


No 98 
>PTZ00334 trans-sialidase; Provisional
Probab=31.90  E-value=68  Score=36.23  Aligned_cols=78  Identities=17%  Similarity=0.149  Sum_probs=45.5

Q ss_pred             eeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEEECCceeeeEecCCcceeeccCCceE
Q psy5025         393 TKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQLNTDSGLY  472 (533)
Q Consensus       393 ~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~  472 (533)
                      ....-++-|+|.|..                               +++++..++|||+.+.....+.......--+.+|
T Consensus       638 stWe~~k~yqVal~L-------------------------------~~G~~gsvYVDG~~vg~~~~~l~~~~~~~IshFy  686 (780)
T PTZ00334        638 SNWEPETTHQVAIVL-------------------------------RNGKQGSAYVDGQRVGDASCELKNTDSKGISHFY  686 (780)
T ss_pred             ccccCCCeEEEEEEE-------------------------------eCCCeEEEEECCEEecCcccccCCCCCcccceEE
Confidence            346678889999887                               1567999999999774422221111222224699


Q ss_pred             EcCCCCcccccccccccCceeeEEEEEEcCceEEe
Q psy5025         473 VGGVEDIEELTRSKYHHSFKGWISNLILSNDYQVK  507 (533)
Q Consensus       473 lGg~~~~~~~~~~~~~~~f~GCi~~l~i~~~~~l~  507 (533)
                      |||-.....     ....-.=-++|+.+ -.++|+
T Consensus       687 iGgdg~~~~-----~~~~~~VTV~NVlL-YNRpL~  715 (780)
T PTZ00334        687 IGGDGGSAG-----SKEDVPVTATNVLL-YNRPLD  715 (780)
T ss_pred             ECCCccccc-----cCCCCCEEEeEeEE-eCCCCC
Confidence            998543211     01112225677777 566665


No 99 
>KOG1226|consensus
Probab=31.27  E-value=43  Score=37.12  Aligned_cols=27  Identities=33%  Similarity=0.880  Sum_probs=21.8

Q ss_pred             CCCCCCeeeeCCCCceecCCCCCCCCCcccc
Q psy5025          51 PCEHGGKCITTPGSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        51 pC~NggtC~~~~~~y~C~C~~g~~G~~Ce~~   81 (533)
                      .|...|.|.=.    .|.|.+||+|+.|+-+
T Consensus       556 lC~g~G~C~CG----~CvC~~GwtG~~C~C~  582 (783)
T KOG1226|consen  556 LCGGHGRCECG----RCVCNPGWTGSACNCP  582 (783)
T ss_pred             ccCCCCeEeCC----cEEcCCCCccCCCCCC
Confidence            67777878554    7999999999998865


No 100
>PF02057 Glyco_hydro_59:  Glycosyl hydrolase family 59;  InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=29.08  E-value=5.3e+02  Score=28.75  Aligned_cols=56  Identities=29%  Similarity=0.332  Sum_probs=36.3

Q ss_pred             CeEEEEEECCEEEEEEEcCCCceEEEee-ccccCCCcEEEEEEEECCEEEEEEcCccCc
Q psy5025         135 DFIALYLNDRYVDFTFDLGTGAATLRSS-NPISLGEWHKLRLTRTGRHAYLQVDRFPSS  192 (533)
Q Consensus       135 ~fl~l~L~~G~l~~~~~~g~~~~~v~s~-~~lnDg~Wh~V~v~~~~~~~~L~VD~~~~~  192 (533)
                      =|+.| ..+|.-.+.-++.. ...+.+. ..+.-++||++.|...+..+.-.+|+....
T Consensus       579 ~~f~v-~~~G~w~vt~d~~~-~~~l~~G~~~~~~~~WhtltL~~~g~~~ta~lng~~l~  635 (669)
T PF02057_consen  579 YFFWV-YANGTWSVTSDLAG-TTTLASGTADIGAGKWHTLTLTISGSTATAMLNGTVLW  635 (669)
T ss_dssp             EEEEE-ETTTEEEEEEETTS--SEEEEEE-S--TT-EEEEEEEEETTEEEEEETTEEEE
T ss_pred             EEEEE-EcCCcEEEeccCCC-cEEEeeeeecccCCeEEEEEEEEECCEEEEEECCEEeE
Confidence            34444 56787777656553 3344433 457779999999999999999999996543


No 101
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind.  This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor.  L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face".  This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers.  Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=28.69  E-value=4.1e+02  Score=24.84  Aligned_cols=23  Identities=22%  Similarity=0.251  Sum_probs=19.5

Q ss_pred             CCcEEEEEEEE--CCEEEEEEcCcc
Q psy5025         168 GEWHKLRLTRT--GRHAYLQVDRFP  190 (533)
Q Consensus       168 g~Wh~V~v~~~--~~~~~L~VD~~~  190 (533)
                      |+||++.|...  .+.+.+.++...
T Consensus       154 g~~~~v~I~Y~~~~~~L~v~l~~~~  178 (223)
T cd01951         154 GNEHTVRITYDPTTNTLTVYLDNGS  178 (223)
T ss_pred             CCEEEEEEEEeCCCCEEEEEECCCC
Confidence            99999999999  478888888754


No 102
>PF05910 DUF868:  Plant protein of unknown function (DUF868);  InterPro: IPR008586 This family consists of several hypothetical proteins from plants. The function of this family is unknown.
Probab=27.53  E-value=1.3e+02  Score=29.49  Aligned_cols=38  Identities=11%  Similarity=0.131  Sum_probs=27.4

Q ss_pred             CcEEEEEECCceeeeEecCCcceeeccCCceEEcCCCCcc
Q psy5025         441 GKQCLIRIDTDYSMQSISGGKFEQLNTDSGLYVGGVEDIE  480 (533)
Q Consensus       441 ~~~~~l~vD~~~~~~~~~~~~~~~l~~~~~l~lGg~~~~~  480 (533)
                      ...+.|.||+..+.+..-...  .+..+..|+|+|+|...
T Consensus       173 dp~l~V~VDgk~v~~VkrL~W--kFRGNqti~vdg~~V~V  210 (274)
T PF05910_consen  173 DPELWVSVDGKKVVQVKRLRW--KFRGNQTIFVDGLPVQV  210 (274)
T ss_pred             CceEEEEECCEEEEEEEEeee--cccCceEEEECCeEEEE
Confidence            458999999998877543221  24567779999988764


No 103
>PF03425 CBM_11:  Carbohydrate binding domain (family 11);  InterPro: IPR005087 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM11 from CAZY which binds both beta-1,4-glucan and beta-1,3-1,4-mixed linked glucans.; GO: 0008810 cellulase activity, 0030245 cellulose catabolic process; PDB: 1V0A_A.
Probab=26.47  E-value=1.7e+02  Score=26.45  Aligned_cols=49  Identities=16%  Similarity=0.253  Sum_probs=19.1

Q ss_pred             CCeEEEEeccCCCCCeEEEEEECCEEEEEEEcCCceEEEeeCceeecCCCeEEEEEEEee
Q psy5025         350 SSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLTVHRNF  409 (533)
Q Consensus       350 ~~g~ll~~~~~~~~~~l~l~l~~g~l~~~~~~g~~~~~~~~~~~~~~dg~wH~v~~~~~~  409 (533)
                      -+|+=|++....+..-|.|.|.++.          ...+.......+.. |++|+|-.+.
T Consensus        73 ~~gl~Fw~k~dgs~~~l~vqi~d~~----------~~e~~~~~~~~~~~-W~~V~IPF~~  121 (178)
T PF03425_consen   73 YGGLSFWIKGDGSGNKLRVQIKDGG----------DYEYWEASFTDSST-WKTVEIPFSD  121 (178)
T ss_dssp             --EEEEEEEE------EEEEEEEE-----------EEEEEEEEE---SS--EEEEEEGGG
T ss_pred             CCcEEEEEEcCCCCcEEEEEEecCC----------cceeeEeecCCCCc-CEEEEEEHHH
Confidence            3455555543333555666665554          11111123445555 9999998843


No 104
>PTZ00334 trans-sialidase; Provisional
Probab=24.78  E-value=5.7e+02  Score=29.18  Aligned_cols=74  Identities=15%  Similarity=0.138  Sum_probs=44.5

Q ss_pred             ccCCCcEEEEEEE-ECCEEEEEEcCccCceeeCC-CCceeeecCCCEEEcCcCCCCccCCCcccccCceeeeeeEEECcE
Q psy5025         165 ISLGEWHKLRLTR-TGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIGCIQKVLINNH  242 (533)
Q Consensus       165 lnDg~Wh~V~v~~-~~~~~~L~VD~~~~~~~~~~-~~~~~l~~~~~l~vGG~p~~~~~~~~~~~~~~F~GCI~~l~in~~  242 (533)
                      -..++-|+|.|.. +++...++||+........+ ......++ ..|||||-.......      ....=-++++.+-++
T Consensus       640 We~~k~yqVal~L~~G~~gsvYVDG~~vg~~~~~l~~~~~~~I-shFyiGgdg~~~~~~------~~~~VTV~NVlLYNR  712 (780)
T PTZ00334        640 WEPETTHQVAIVLRNGKQGSAYVDGQRVGDASCELKNTDSKGI-SHFYIGGDGGSAGSK------EDVPVTATNVLLYNR  712 (780)
T ss_pred             ccCCCeEEEEEEEeCCCeEEEEECCEEecCcccccCCCCCccc-ceEEECCCccccccC------CCCCEEEeEeEEeCC
Confidence            4567889999887 56789999999654221111 11122222 579999976321110      112224788888888


Q ss_pred             Ecc
Q psy5025         243 ELN  245 (533)
Q Consensus       243 ~l~  245 (533)
                      +++
T Consensus       713 pL~  715 (780)
T PTZ00334        713 PLD  715 (780)
T ss_pred             CCC
Confidence            886


No 105
>smart00289 WR1 Worm-specific repeat type 1. Worm-specific repeat type 1. Cysteine-rich domain apparently unique (so far) to C. elegans. Often appears with KU domains. About 3 dozen worm proteins contain this domain.
Probab=22.99  E-value=78  Score=20.21  Aligned_cols=23  Identities=30%  Similarity=1.009  Sum_probs=16.8

Q ss_pred             CCC-CCCCCCCCeeeeCCCCceecCC
Q psy5025          46 KCN-KTPCEHGGKCITTPGSSYCLCP   70 (533)
Q Consensus        46 ~C~-~~pC~NggtC~~~~~~y~C~C~   70 (533)
                      .|. ..+|..|-.|+.. ..+.| |+
T Consensus        15 ~C~~~~~CP~g~~C~~~-~~~~C-C~   38 (38)
T smart00289       15 RCSPNGSCPSGYSCQNS-KQGIC-CP   38 (38)
T ss_pred             ECCCCCCCCCCCEEecC-CCccc-Cc
Confidence            465 6788888899887 66666 53


No 106
>cd02178 GH16_beta_agarase Beta-agarase, member of glycosyl hydrolase family 16. Beta-agarase is a glycosyl hydrolase family 16 (GH16) member that hydrolyzes the internal beta-1,4-linkage of agarose, a hydrophilic polysaccharide found in the cell wall of Rhodophyceaea, marine red algae. Agarose is a linear chain of galactose units linked by alternating L-alpha-1,3- and D-beta-1,4-linkages that are additionally modified by a 3,6-anhydro-bridge. Agarose forms thermo-reversible gels that are widely used in the food industry or as a laboratory medium. While beta-agarases are also found in two other families derived from the sequence-based classification of glycosyl hydrolases (GH50, and GH86) the GH16 members are most abundant.  This domain adopts a curved  beta-sandwich conformation, with a tunnel-shaped active site cavity, referred to as a jellyroll fold.
Probab=22.66  E-value=1.9e+02  Score=27.89  Aligned_cols=29  Identities=14%  Similarity=-0.041  Sum_probs=25.0

Q ss_pred             CCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhcccccc-CcEEEEEECCceeeeEe
Q psy5025         397 NGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRT-GKQCLIRIDTDYSMQSI  457 (533)
Q Consensus       397 dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~vD~~~~~~~~  457 (533)
                      .+.||...|.+                                + ...+..+|||..+....
T Consensus       178 ~~~fHtY~veW--------------------------------~~p~~i~fyvDG~~~~~~~  207 (258)
T cd02178         178 ADDFHVYGVYW--------------------------------KDPDTIRFYIDGVLVRTVE  207 (258)
T ss_pred             ccCeEEEEEEE--------------------------------cCCCeEEEEECCEEEEEEc
Confidence            46899999999                                8 89999999999876653


No 107
>PF08787 Alginate_lyase2:  Alginate lyase;  InterPro: IPR014895 Alginate lyases are enzymes that degrade the linear polysaccharide alignate. They cleave the glycosidic linkage of alignate through a beta-elimination reaction. This region forms an all beta fold, which is different to the all alpha fold of IPR008397 from INTERPRO. ; PDB: 1VAV_B 1UAI_A 1J1T_A 2Z42_A 2ZAC_A 2ZAB_A 2ZAA_A 2ZA9_A 2CWS_A.
Probab=22.24  E-value=6.8e+02  Score=23.77  Aligned_cols=52  Identities=15%  Similarity=0.210  Sum_probs=36.9

Q ss_pred             ECCEEEEEEEc---CCceEEEeeCceeecCCCeEEEEEEEeeeeeecCCcccccCCcchhhhhhhhhccccccCcEEEEE
Q psy5025         371 HNGYAHVMYNL---GNGEVLLENNQTKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIR  447 (533)
Q Consensus       371 ~~g~l~~~~~~---g~~~~~~~~~~~~~~dg~wH~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~  447 (533)
                      ..|.|.+.++.   .+...... .-..+.=|+|..+.|..                                .+..+.|+
T Consensus       135 ~~G~l~~~~~~~~~~~~~~~~~-~~~~i~LG~~F~y~I~v--------------------------------~~~~l~V~  181 (236)
T PF08787_consen  135 EKGSLYVYVRQSNPDGGDQEYT-IYGGIPLGEWFSYEIEV--------------------------------SGGTLTVT  181 (236)
T ss_dssp             ETTEEEEEEESSTTTTSEEEEE-EEEEEETT-EEEEEEEE--------------------------------ETTEEEEE
T ss_pred             cCCeEEEEEeccCCCCCcEEee-eEcceeCCCEEEEEEEE--------------------------------ECCEEEEE
Confidence            78999999982   12222222 22467778999999999                                88899999


Q ss_pred             ECCceeee
Q psy5025         448 IDTDYSMQ  455 (533)
Q Consensus       448 vD~~~~~~  455 (533)
                      |++.....
T Consensus       182 ing~~~~~  189 (236)
T PF08787_consen  182 INGEGKTT  189 (236)
T ss_dssp             ETTEEEEE
T ss_pred             EECCcceE
Confidence            99986655


No 108
>PF09313 DUF1971:  Domain of unknown function (DUF1971);  InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=20.74  E-value=4.2e+02  Score=20.73  Aligned_cols=47  Identities=17%  Similarity=0.197  Sum_probs=31.2

Q ss_pred             CCCCeEEEEEECCEEEEEEEcCCc----eEEEeeCc--eeecCCCeEEEEEEE
Q psy5025         361 QSKDFLALGIHNGYAHVMYNLGNG----EVLLENNQ--TKINNGVWHRLTVHR  407 (533)
Q Consensus       361 ~~~~~l~l~l~~g~l~~~~~~g~~----~~~~~~~~--~~~~dg~wH~v~~~~  407 (533)
                      ..+.|-.|.+..|.|.+....+.+    ...+...+  .-+.-..||+|...-
T Consensus        22 K~GtWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p~s   74 (82)
T PF09313_consen   22 KAGTWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEPLS   74 (82)
T ss_dssp             STTEEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEESS
T ss_pred             CCCeEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEECC
Confidence            347899999999999999876642    22222222  348899999998644


No 109
>KOG0994|consensus
Probab=20.16  E-value=1e+02  Score=35.96  Aligned_cols=38  Identities=26%  Similarity=0.716  Sum_probs=29.5

Q ss_pred             CCCCCCCCCCCCC--------eeeeCC--CCceecCCCCCCCCCcccc
Q psy5025          44 EDKCNKTPCEHGG--------KCITTP--GSSYCLCPLGFTGDLCQTR   81 (533)
Q Consensus        44 ~d~C~~~pC~Ngg--------tC~~~~--~~y~C~C~~g~~G~~Ce~~   81 (533)
                      ...|.+-||+-|-        +|....  ....|+|..||.|..|+.-
T Consensus       903 g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~C  950 (1758)
T KOG0994|consen  903 GIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEIC  950 (1758)
T ss_pred             CCCCCCCCCCCCCccchhccccccccccccceeeecccCccccchhhh
Confidence            4568888897653        555443  6788999999999999976


Done!