Query         psy9686
Match_columns 769
No_of_seqs    538 out of 2838
Neff          6.1 
Searched_HMMs 46136
Date          Fri Aug 16 17:00:42 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9686.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9686hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00119 LYZ1 C-type lysozyme ( 100.0 2.8E-46   6E-51  347.8   5.2  119  279-405     1-122 (123)
  2 PF00062 Lys:  C-type lysozyme/ 100.0 9.7E-47 2.1E-51  352.9   2.1  117  279-403     1-122 (125)
  3 smart00263 LYZ1 Alpha-lactalbu 100.0 2.2E-45 4.8E-50  343.9   5.3  121  279-407     1-126 (127)
  4 smart00263 LYZ1 Alpha-lactalbu  99.9 2.8E-28 6.2E-33  228.7   5.5   79   12-97     46-127 (127)
  5 cd00119 LYZ1 C-type lysozyme (  99.9 2.8E-28   6E-33  227.4   5.0   77   12-94     44-122 (123)
  6 PF00062 Lys:  C-type lysozyme/  99.9 7.9E-28 1.7E-32  225.6   3.3   77   12-96     46-125 (125)
  7 KOG1214|consensus               99.6 1.5E-15 3.2E-20  173.3  15.4  250  413-725   639-908 (1289)
  8 KOG1219|consensus               99.5 3.5E-14 7.7E-19  172.9   8.2  120  509-687  3857-3977(4289)
  9 KOG1219|consensus               99.4 1.8E-13 3.8E-18  167.0   8.3  112  609-729  3865-3977(4289)
 10 KOG1214|consensus               99.3   9E-12   2E-16  142.9  11.6  148  520-726   699-859 (1289)
 11 KOG4289|consensus               99.3 9.1E-12   2E-16  148.2  10.6  109  515-678  1179-1308(2531)
 12 KOG1217|consensus               99.2 2.5E-10 5.4E-15  128.2  16.1  204  516-761   170-388 (487)
 13 KOG1217|consensus               99.0 3.3E-09 7.1E-14  119.2  15.8  228  516-762    90-350 (487)
 14 KOG4289|consensus               99.0 4.7E-10   1E-14  134.0   7.0  100  623-730  1216-1317(2531)
 15 KOG1225|consensus               98.8 2.6E-08 5.6E-13  113.9  10.7  158  539-757   234-395 (525)
 16 KOG1225|consensus               98.5 6.1E-07 1.3E-11  102.9  10.8   88  616-731   256-343 (525)
 17 KOG1226|consensus               98.3 4.7E-06   1E-10   97.4  12.2  101  630-746   526-636 (783)
 18 KOG4260|consensus               98.2 1.4E-06 3.1E-11   90.6   5.4  150  520-724   149-304 (350)
 19 KOG4260|consensus               98.2 2.3E-06   5E-11   89.0   6.5  120  626-757   165-300 (350)
 20 KOG0994|consensus               97.9 4.6E-05   1E-09   91.4  10.0  208  498-729   890-1146(1758)
 21 PF00008 EGF:  EGF-like domain   97.6 4.1E-05 8.9E-10   55.6   2.0   31  518-551     1-32  (32)
 22 smart00179 EGF_CA Calcium-bind  97.5 0.00016 3.4E-09   53.7   4.2   36  514-552     1-38  (39)
 23 KOG1226|consensus               97.4 0.00047   1E-08   81.2   8.7   95  630-737   479-590 (783)
 24 PF00008 EGF:  EGF-like domain   97.2 0.00015 3.2E-09   52.7   1.7   30  697-727     3-32  (32)
 25 smart00179 EGF_CA Calcium-bind  97.2 0.00044 9.6E-09   51.2   4.3   36  691-729     2-39  (39)
 26 KOG0994|consensus               97.1  0.0013 2.7E-08   79.7   8.5   62  671-745   885-964 (1758)
 27 cd00054 EGF_CA Calcium-binding  97.1 0.00082 1.8E-08   49.0   4.2   35  515-552     2-37  (38)
 28 PRK15328 invasion protein IagB  97.0 0.00012 2.7E-09   72.5  -0.5   45  294-342    29-73  (160)
 29 PRK13722 lytic transglycosylas  97.0 0.00018   4E-09   71.7   0.0   44  294-341    30-73  (169)
 30 PF07645 EGF_CA:  Calcium-bindi  96.9 0.00051 1.1E-08   52.8   2.3   32  514-548     1-34  (42)
 31 PF07645 EGF_CA:  Calcium-bindi  96.9 0.00074 1.6E-08   51.9   2.6   33  646-680     1-34  (42)
 32 cd00054 EGF_CA Calcium-binding  96.8  0.0021 4.6E-08   46.8   4.3   36  691-729     2-38  (38)
 33 PRK13888 conjugal transfer pro  96.6 0.00056 1.2E-08   69.6   0.4   31  307-339    35-65  (206)
 34 PF12947 EGF_3:  EGF domain;  I  96.5  0.0019 4.2E-08   48.2   2.3   32  653-686     5-36  (36)
 35 cd00053 EGF Epidermal growth f  96.4  0.0054 1.2E-07   43.9   4.1   32  518-552     2-35  (36)
 36 PF06247 Plasmod_Pvs28:  Plasmo  96.1  0.0016 3.5E-08   65.3   0.5  145  521-726     6-162 (197)
 37 smart00181 EGF Epidermal growt  96.1   0.009   2E-07   43.4   4.0   32  517-552     1-34  (35)
 38 PF01464 SLT:  Transglycosylase  95.7  0.0021 4.6E-08   59.8  -0.6   64  307-377    20-83  (121)
 39 cd00053 EGF Epidermal growth f  95.7   0.017 3.6E-07   41.3   4.2   30  697-728     5-35  (36)
 40 smart00181 EGF Epidermal growt  95.4   0.022 4.9E-07   41.2   4.0   29  698-729     6-35  (35)
 41 KOG1836|consensus               95.3    0.18 3.9E-06   65.9  14.0  197  518-729   777-1021(1705)
 42 PRK13864 type IV secretion sys  95.1   0.007 1.5E-07   63.8   0.5   33  307-340    49-104 (245)
 43 cd00254 LT_GEWL Lytic Transgly  94.9    0.01 2.2E-07   53.8   0.9   61  307-377     7-67  (113)
 44 PF12662 cEGF:  Complement Clr-  94.8    0.02 4.3E-07   39.0   2.0   19  539-557     2-24  (24)
 45 PF12662 cEGF:  Complement Clr-  94.8   0.022 4.9E-07   38.8   2.3   19  715-734     2-24  (24)
 46 PF06247 Plasmod_Pvs28:  Plasmo  94.2   0.013 2.8E-07   59.0  -0.1  129  622-757    13-156 (197)
 47 PF12947 EGF_3:  EGF domain;  I  94.1   0.025 5.5E-07   42.3   1.3   29  697-727     5-33  (36)
 48 PF07974 EGF_2:  EGF-like domai  93.6   0.088 1.9E-06   38.4   3.4   27  698-728     6-32  (32)
 49 PF07974 EGF_2:  EGF-like domai  93.4    0.11 2.4E-06   37.9   3.6   27  521-552     6-32  (32)
 50 PF12661 hEGF:  Human growth fa  93.2   0.038 8.3E-07   32.2   0.8   13  716-728     1-13  (13)
 51 PF06737 Transglycosylas:  Tran  92.6   0.024 5.1E-07   49.4  -1.0   71  301-390     4-76  (77)
 52 PF12661 hEGF:  Human growth fa  91.7   0.075 1.6E-06   31.0   0.7   13  540-552     1-13  (13)
 53 cd01021 GEWL Goose Egg White L  90.6    0.13 2.8E-06   51.5   1.7   49  287-339    35-83  (166)
 54 smart00051 DSL delta serrate l  90.0    0.39 8.4E-06   40.5   3.8   47  671-728    17-63  (63)
 55 KOG1836|consensus               89.6    0.64 1.4E-05   61.1   7.1   56  674-733   760-816 (1705)
 56 KOG1218|consensus               87.3     5.1 0.00011   43.2  11.2   80  621-711   118-199 (316)
 57 PRK13888 conjugal transfer pro  85.9    0.26 5.7E-06   50.6   0.4   18   11-28     48-65  (206)
 58 PHA03099 epidermal growth fact  85.6    0.75 1.6E-05   43.8   3.1   32  698-730    51-82  (139)
 59 PF14670 FXa_inhibition:  Coagu  85.4    0.88 1.9E-05   34.1   2.8   20  660-681    10-29  (36)
 60 PHA02887 EGF-like protein; Pro  84.3    0.93   2E-05   42.5   3.1   32  698-730    92-123 (126)
 61 PF14670 FXa_inhibition:  Coagu  83.4    0.91   2E-05   34.0   2.2   23  522-549     7-29  (36)
 62 smart00051 DSL delta serrate l  81.4     1.6 3.5E-05   36.8   3.2   47  629-686    17-63  (63)
 63 cd01475 vWA_Matrilin VWA_Matri  81.1     1.5 3.2E-05   45.5   3.6   34  511-549   183-218 (224)
 64 PHA03099 epidermal growth fact  80.8     1.4   3E-05   42.1   2.8   40  515-556    42-84  (139)
 65 PRK15470 emtA lytic murein end  80.8    0.63 1.4E-05   48.1   0.7   38  294-339    50-87  (203)
 66 KOG3516|consensus               80.6     1.1 2.5E-05   55.9   2.8   40  512-554   542-582 (1306)
 67 PF12946 EGF_MSP1_1:  MSP1 EGF   79.6     1.3 2.8E-05   33.4   1.8   32  653-686     4-36  (37)
 68 KOG3516|consensus               79.5     1.4 3.1E-05   55.0   3.2   45  685-734   541-586 (1306)
 69 cd01475 vWA_Matrilin VWA_Matri  78.3     2.3   5E-05   44.1   3.9   38  683-726   181-219 (224)
 70 PHA02887 EGF-like protein; Pro  77.1     2.2 4.7E-05   40.2   2.9   39  515-555    83-124 (126)
 71 PRK11671 mltC murein transglyc  72.3     1.6 3.5E-05   48.9   1.0   46  286-339   195-240 (359)
 72 KOG1218|consensus               71.3      22 0.00047   38.3   9.5  107  634-757    96-206 (316)
 73 KOG3512|consensus               69.4     7.6 0.00017   44.5   5.4   53  671-729   371-428 (592)
 74 PF01683 EB:  EB module;  Inter  66.3      12 0.00026   29.8   4.6   22  697-724    25-46  (52)
 75 KOG3514|consensus               65.0     3.8 8.3E-05   50.8   2.1   34  517-553   625-659 (1591)
 76 PRK15328 invasion protein IagB  63.7     1.9 4.2E-05   43.0  -0.6   19   12-30     54-72  (160)
 77 KOG3512|consensus               61.4      40 0.00087   39.0   9.0   25  527-553   285-309 (592)
 78 KOG3514|consensus               54.4     8.8 0.00019   47.9   2.7   44  684-730   616-660 (1591)
 79 PRK10783 mltD membrane-bound l  49.4     7.2 0.00016   45.3   0.9   44  286-339   108-151 (456)
 80 PF00053 Laminin_EGF:  Laminin   44.4      14 0.00029   29.1   1.5   21  705-729    12-32  (49)
 81 PF01414 DSL:  Delta serrate li  43.4       8 0.00017   32.6   0.0   47  671-728    17-63  (63)
 82 cd00055 EGF_Lam Laminin-type e  42.5      23 0.00051   28.0   2.6   14  716-729    20-33  (50)
 83 PF12946 EGF_MSP1_1:  MSP1 EGF   41.7      21 0.00046   27.1   2.0   30  518-549     2-31  (37)
 84 PF00954 S_locus_glycop:  S-loc  41.5      26 0.00056   32.3   3.1   32  515-550    77-109 (110)
 85 cd00254 LT_GEWL Lytic Transgly  37.7      16 0.00034   32.7   1.1   47   14-66     21-67  (113)
 86 PF00954 S_locus_glycop:  S-loc  37.7      34 0.00075   31.4   3.3   33  691-726    77-109 (110)
 87 PF01683 EB:  EB module;  Inter  35.8      49  0.0011   26.2   3.5   25  654-684    26-50  (52)
 88 PRK11619 lytic murein transgly  35.6      16 0.00036   44.3   1.1   33  303-340   495-528 (644)
 89 PF01464 SLT:  Transglycosylase  34.5      10 0.00022   35.0  -0.8   48   13-66     36-83  (121)
 90 PRK10859 membrane-bound lytic   33.2      18 0.00039   42.2   0.8   31  304-339   306-336 (482)
 91 PF05497 Destabilase:  Destabil  31.0     4.8  0.0001   38.3  -3.5   62  306-376     8-72  (119)
 92 PRK13722 lytic transglycosylas  30.2      15 0.00033   37.0  -0.4   18   12-29     55-72  (169)
 93 PHA00368 internal virion prote  29.9      24 0.00052   44.2   1.1   39  293-339    21-59  (1315)
 94 COG4764 Uncharacterized protei  29.2      43 0.00093   33.4   2.5   93  287-392    48-150 (197)
 95 smart00180 EGF_Lam Laminin-typ  26.8      56  0.0012   25.5   2.4   14  716-729    19-32  (46)
 96 PF01414 DSL:  Delta serrate li  24.4      29 0.00064   29.3   0.4   48  628-686    16-63  (63)
 97 PF12955 DUF3844:  Domain of un  24.3      70  0.0015   29.8   2.8   31  516-546     6-40  (103)
 98 PF12955 DUF3844:  Domain of un  20.9      89  0.0019   29.1   2.8   26  697-722    12-40  (103)

No 1  
>cd00119 LYZ1 C-type lysozyme (1, 4-beta-N-acetylmuramidase, LYZ) and alpha-lactalbumin (lactose synthase B protein, LA). They have a close evolutionary relationship and similar tertiary structure, however, functionally they are quite different. Lysozymes have primarily bacteriolytic function; hydrolysis of peptidoglycan of prokaryotic cell walls and transglycosylation. LA is a calcium-binding metalloprotein that is expressed exclusively in the mammary gland during lactation. LA is the regulatory subunit of the enzyme lactose synthase. The association of LA with the catalytic component of lactose synthase, galactosyltransferase, alters the acceptor substrate specificity of this glycosyltransferase, facilitating biosynthesis of lactose.
Probab=100.00  E-value=2.8e-46  Score=347.78  Aligned_cols=119  Identities=42%  Similarity=0.834  Sum_probs=112.5

Q ss_pred             eeecchHHHHHHHHhcCC-CCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC-Cccccccccc
Q psy9686         279 KRFGACELAKFLVRQRGI-ARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP-AGKECHAKCS  356 (769)
Q Consensus       279 k~~~rCela~~l~~~~g~-~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~-~~~~C~~~C~  356 (769)
                      |||+|||||++|++ +|| ++.+|++|||||++||+|||+|+++ |.|||+||||||||++|||+++++ ++|.|+|+|+
T Consensus         1 k~~~rCeLa~~L~~-~g~~~~~~l~~Wvcia~~ES~~nt~a~~~-n~~gs~dyGlfQINs~~WC~~~~~~~~n~C~i~C~   78 (123)
T cd00119           1 KIFTRCELAKELKR-LGLYPGISLANWVCLAEHESGFNTQAVNN-NNDGSTDYGLFQINSKYWCNDGKTPSGNICHISCS   78 (123)
T ss_pred             CcCcHhHHHHHHHH-cCCCCCCCHHHHHHHHHHHHCCCccceec-CCCCCeeEeEEEecchhhccCCCCCCCCCcCCCHH
Confidence            79999999999998 576 9999999999999999999999995 679999999999999999999876 5899999999


Q ss_pred             cccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccccc
Q psy9686         357 SFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTYDH  405 (769)
Q Consensus       357 ~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~~~  405 (769)
                      +||+|||+|||+|||+|+++|     +||+||++|+ +|+|+ +|++|..
T Consensus        79 ~Ll~ddI~~~i~CAk~I~~~~-----~G~~aW~~w~~~C~g~-~L~~~~~  122 (123)
T cd00119          79 KLLDDDITDDIKCAKKIVKEH-----QGFDAWVAWKNHCQGR-DLSQWVK  122 (123)
T ss_pred             HhccccHHHHHHHHHHHHhcC-----CCchhHHHHHHhCCCC-CCccccC
Confidence            999999999999999999996     7999999999 99997 9999963


No 2  
>PF00062 Lys:  C-type lysozyme/alpha-lactalbumin family;  InterPro: IPR001916 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 22 GH22 from CAZY comprises enzymes with two known activities; lysozyme type C (3.2.1.17 from EC) and alpha-lactalbumins. Asp and/or the carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base.   Alpha-lactalbumin [, ] is a milk protein that acts as the regulatory subunit of lactose synthetase, acting to promote the conversion of galactosyltransferase to lactose synthase, which is essential for milk production. In the mammary gland, alpha-lactalbumin changes the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose.  Lysozymes (3.2.1.17 from EC) act as bacteriolytic enzymes by hydrolyzing the beta(1->4) bonds between N-acetylglucosamine and N-acetylmuramic acid in the peptidoglycan of prokaryotic cell walls. It has also been recruited for a digestive role in certain ruminants and colobine monkeys []. There are at least five different classes of lysozymes []: C (chicken type), G (goose type), phage-type (T4), fungi (Chalaropsis), and bacterial (Bacillus subtilis). There are few similarities in the sequences of the different types of lysozymes. Lysozyme type C and alpha-lactalbumin are similar both in terms of primary sequence and structure, and probably evolved from a common ancestral protein []. Around 35 to 40% of the residues are conserved in both proteins as well as the positions of the four disulphide bonds. There is, however, no similarity in function. Another significant difference between the two enzymes is that all lactalbumins have the ability to bind calcium [], while this property is restricted to only a few lysozymes [].  The binding site was deduced using high resolution X-ray structure analysis and was shown to consist of three aspartic acid residues. It was first suggested that calcium bound to lactalbumin stabilised the structure, but recently it has been claimed that calcium controls the release of lactalbumin from the golgi membrane and that the pattern of ion binding may also affect the catalytic properties of the lactose synthetase complex.; PDB: 1HFX_A 1HMK_A 1HFY_B 1FKQ_A 1FKV_A 3B0K_B 1HML_A 1B9O_A 1A4V_A 3B0O_A ....
Probab=100.00  E-value=9.7e-47  Score=352.87  Aligned_cols=117  Identities=39%  Similarity=0.806  Sum_probs=105.8

Q ss_pred             eeecchHHHHHHHHh--cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC--Cccccccc
Q psy9686         279 KRFGACELAKFLVRQ--RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP--AGKECHAK  354 (769)
Q Consensus       279 k~~~rCela~~l~~~--~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~--~~~~C~~~  354 (769)
                      |||+|||||++|++.  .||++++|+||||||+|||+|||+|+++++ |||+|||||||||+|||++++.  ++|.|+|+
T Consensus         1 Kv~~rCeLa~~L~~~g~~g~~~~~l~~WvCla~~eS~f~T~a~~~~~-~gS~dyGIFQIns~~WC~~~~~~~~~n~C~i~   79 (125)
T PF00062_consen    1 KVFTRCELARELKNAGLDGYPGYSLANWVCLAYHESGFNTSAVNYNN-DGSTDYGIFQINSRYWCSDGKTPGSKNICNIS   79 (125)
T ss_dssp             -BC-HHHHHHHHHHTTGTTBTCBGHHHHHHHHHHHHTTBTT-EEEET-TSEEEETTTTEETTTTSBCSSSTTSSTTTSSB
T ss_pred             CcccHhHHHHHHHHcCCCCcchhhhhheeeeEecccCceeeccccCC-CCccccceeecCchhhccCCCCCCCCCccCCc
Confidence            899999999999973  345589999999999999999999999865 9999999999999999999875  47999999


Q ss_pred             cccccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccc
Q psy9686         355 CSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTY  403 (769)
Q Consensus       355 C~~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~  403 (769)
                      |++||||||+|||+|||+|++++     +||+||++|+ +|+ + +|+.|
T Consensus        80 C~~ll~ddI~ddi~Cak~I~~~~-----~G~~aW~~w~~~C~-~-~L~~~  122 (125)
T PF00062_consen   80 CSDLLDDDITDDIRCAKKIYREP-----QGFNAWVAWKNHCR-K-DLSSW  122 (125)
T ss_dssp             GGGGGSSSTHHHHHHHHHHHTHH-----TGGGGSHHHHHHTT-S-SGGGG
T ss_pred             cccccccchHHHHHHHHHHHcCc-----ccchhhHHHHHhhc-C-CCccc
Confidence            99999999999999999999775     7999999999 999 5 99999


No 3  
>smart00263 LYZ1 Alpha-lactalbumin / lysozyme C.
Probab=100.00  E-value=2.2e-45  Score=343.93  Aligned_cols=121  Identities=40%  Similarity=0.745  Sum_probs=112.7

Q ss_pred             eeecchHHHHHHHHh--cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC--Cccccccc
Q psy9686         279 KRFGACELAKFLVRQ--RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP--AGKECHAK  354 (769)
Q Consensus       279 k~~~rCela~~l~~~--~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~--~~~~C~~~  354 (769)
                      |+|+|||||++|++.  +|||+.+|++|||||++||+|||+|+++ |.|||+||||||||++|||+++++  ++|.|+|+
T Consensus         1 k~~~rCeLa~~L~~~gl~g~~~~~l~~WvCia~~ES~~~t~a~~~-n~~gs~dyGLfQINs~~WC~~~~~~~~~n~C~i~   79 (127)
T smart00263        1 KVFTRCELARELKRLGLDGYRGISLANWVCLAFHESGYNTQATNY-NNGGSTDYGIFQINSKYWCNDGKTPGSVNACGIS   79 (127)
T ss_pred             CcCcHhHHHHHHHHhCcCCCCCcCHHHHHHHHHHHhcCCCcceec-CCCCCceeeEEEecccccccCCCCCCCCCcCCCC
Confidence            789999999999972  4999999999999999999999999996 679999999999999999998864  47999999


Q ss_pred             cccccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccccccC
Q psy9686         355 CSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTYDHAD  407 (769)
Q Consensus       355 C~~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~~~~~  407 (769)
                      |++||+|||+|||+|||+|+++      +||+||++|+ +|+|+ +|+.|...|
T Consensus        80 C~~Ll~ddI~ddv~CAk~I~~~------~G~~aW~~w~~~C~g~-~l~~~~~~C  126 (127)
T smart00263       80 CSKLLDDDITDDVKCAKKIVSD------QGIDAWVAWKNHCQGE-DLSQWIRGC  126 (127)
T ss_pred             HhhhcCCChHHHHHHHHHHHHH------cCchHHHHHHHhCCCC-CCChhhccC
Confidence            9999999999999999999999      6999999999 99996 999997555


No 4  
>smart00263 LYZ1 Alpha-lactalbumin / lysozyme C.
Probab=99.94  E-value=2.8e-28  Score=228.73  Aligned_cols=79  Identities=39%  Similarity=0.780  Sum_probs=73.8

Q ss_pred             cCCCCccceEeeeCCcccCCCCCC--CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCC
Q psy9686          12 DCRGSRDHGIFQINDKYWCTASGP--AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSK   88 (769)
Q Consensus        12 ~~~gs~~yGiFQIn~~~wC~~~~~--~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~   88 (769)
                      |.|||+||||||||++|||+++++  ++|.|+|+|++||+|||+|||+|||+|+++      +||+||++|+ +|+| ++
T Consensus        46 n~~gs~dyGLfQINs~~WC~~~~~~~~~n~C~i~C~~Ll~ddI~ddv~CAk~I~~~------~G~~aW~~w~~~C~g-~~  118 (127)
T smart00263       46 NNGGSTDYGIFQINSKYWCNDGKTPGSVNACGISCSKLLDDDITDDVKCAKKIVSD------QGIDAWVAWKNHCQG-ED  118 (127)
T ss_pred             CCCCCceeeEEEecccccccCCCCCCCCCcCCCCHhhhcCCChHHHHHHHHHHHHH------cCchHHHHHHHhCCC-CC
Confidence            789999999999999999999874  479999999999999999999999999998      4999999999 9997 79


Q ss_pred             cccccccCc
Q psy9686          89 VSTYYSSSF   97 (769)
Q Consensus        89 ~~~~~~~~~   97 (769)
                      |+.||++|+
T Consensus       119 l~~~~~~C~  127 (127)
T smart00263      119 LSQWIRGCG  127 (127)
T ss_pred             CChhhccCC
Confidence            999998874


No 5  
>cd00119 LYZ1 C-type lysozyme (1, 4-beta-N-acetylmuramidase, LYZ) and alpha-lactalbumin (lactose synthase B protein, LA). They have a close evolutionary relationship and similar tertiary structure, however, functionally they are quite different. Lysozymes have primarily bacteriolytic function; hydrolysis of peptidoglycan of prokaryotic cell walls and transglycosylation. LA is a calcium-binding metalloprotein that is expressed exclusively in the mammary gland during lactation. LA is the regulatory subunit of the enzyme lactose synthase. The association of LA with the catalytic component of lactose synthase, galactosyltransferase, alters the acceptor substrate specificity of this glycosyltransferase, facilitating biosynthesis of lactose.
Probab=99.94  E-value=2.8e-28  Score=227.41  Aligned_cols=77  Identities=40%  Similarity=0.896  Sum_probs=72.6

Q ss_pred             cCCCCccceEeeeCCcccCCCCCC-CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCCc
Q psy9686          12 DCRGSRDHGIFQINDKYWCTASGP-AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKV   89 (769)
Q Consensus        12 ~~~gs~~yGiFQIn~~~wC~~~~~-~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~~   89 (769)
                      |.|||+||||||||++|||+++++ ++|.|+|+|++||+|||+|||+|||+|+++|     +||+||++|+ ||+| ++|
T Consensus        44 n~~gs~dyGlfQINs~~WC~~~~~~~~n~C~i~C~~Ll~ddI~~~i~CAk~I~~~~-----~G~~aW~~w~~~C~g-~~L  117 (123)
T cd00119          44 NNDGSTDYGLFQINSKYWCNDGKTPSGNICHISCSKLLDDDITDDIKCAKKIVKEH-----QGFDAWVAWKNHCQG-RDL  117 (123)
T ss_pred             CCCCCeeEeEEEecchhhccCCCCCCCCCcCCCHHHhccccHHHHHHHHHHHHhcC-----CCchhHHHHHHhCCC-CCC
Confidence            788999999999999999999875 5899999999999999999999999999997     4999999999 9997 799


Q ss_pred             ccccc
Q psy9686          90 STYYS   94 (769)
Q Consensus        90 ~~~~~   94 (769)
                      ++|++
T Consensus       118 ~~~~~  122 (123)
T cd00119         118 SQWVK  122 (123)
T ss_pred             ccccC
Confidence            99984


No 6  
>PF00062 Lys:  C-type lysozyme/alpha-lactalbumin family;  InterPro: IPR001916 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 22 GH22 from CAZY comprises enzymes with two known activities; lysozyme type C (3.2.1.17 from EC) and alpha-lactalbumins. Asp and/or the carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base.   Alpha-lactalbumin [, ] is a milk protein that acts as the regulatory subunit of lactose synthetase, acting to promote the conversion of galactosyltransferase to lactose synthase, which is essential for milk production. In the mammary gland, alpha-lactalbumin changes the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose.  Lysozymes (3.2.1.17 from EC) act as bacteriolytic enzymes by hydrolyzing the beta(1->4) bonds between N-acetylglucosamine and N-acetylmuramic acid in the peptidoglycan of prokaryotic cell walls. It has also been recruited for a digestive role in certain ruminants and colobine monkeys []. There are at least five different classes of lysozymes []: C (chicken type), G (goose type), phage-type (T4), fungi (Chalaropsis), and bacterial (Bacillus subtilis). There are few similarities in the sequences of the different types of lysozymes. Lysozyme type C and alpha-lactalbumin are similar both in terms of primary sequence and structure, and probably evolved from a common ancestral protein []. Around 35 to 40% of the residues are conserved in both proteins as well as the positions of the four disulphide bonds. There is, however, no similarity in function. Another significant difference between the two enzymes is that all lactalbumins have the ability to bind calcium [], while this property is restricted to only a few lysozymes [].  The binding site was deduced using high resolution X-ray structure analysis and was shown to consist of three aspartic acid residues. It was first suggested that calcium bound to lactalbumin stabilised the structure, but recently it has been claimed that calcium controls the release of lactalbumin from the golgi membrane and that the pattern of ion binding may also affect the catalytic properties of the lactose synthetase complex.; PDB: 1HFX_A 1HMK_A 1HFY_B 1FKQ_A 1FKV_A 3B0K_B 1HML_A 1B9O_A 1A4V_A 3B0O_A ....
Probab=99.94  E-value=7.9e-28  Score=225.60  Aligned_cols=77  Identities=38%  Similarity=0.867  Sum_probs=71.1

Q ss_pred             cCCCCccceEeeeCCcccCCCCCC--CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCC
Q psy9686          12 DCRGSRDHGIFQINDKYWCTASGP--AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSK   88 (769)
Q Consensus        12 ~~~gs~~yGiFQIn~~~wC~~~~~--~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~   88 (769)
                      +.|||+|||||||||+|||++++.  ++|.|+|+|++||||||+|||.|||+|++++     +||+||++|+ +|+  ++
T Consensus        46 ~~~gS~dyGIFQIns~~WC~~~~~~~~~n~C~i~C~~ll~ddI~ddi~Cak~I~~~~-----~G~~aW~~w~~~C~--~~  118 (125)
T PF00062_consen   46 NNDGSTDYGIFQINSRYWCSDGKTPGSKNICNISCSDLLDDDITDDIRCAKKIYREP-----QGFNAWVAWKNHCR--KD  118 (125)
T ss_dssp             ETTSEEEETTTTEETTTTSBCSSSTTSSTTTSSBGGGGGSSSTHHHHHHHHHHHTHH-----TGGGGSHHHHHHTT--SS
T ss_pred             CCCCccccceeecCchhhccCCCCCCCCCccCCccccccccchHHHHHHHHHHHcCc-----ccchhhHHHHHhhc--CC
Confidence            449999999999999999999884  4699999999999999999999999999776     4999999999 999  59


Q ss_pred             cccccccC
Q psy9686          89 VSTYYSSS   96 (769)
Q Consensus        89 ~~~~~~~~   96 (769)
                      |++| ++|
T Consensus       119 L~~~-~~C  125 (125)
T PF00062_consen  119 LSSW-KGC  125 (125)
T ss_dssp             GGGG-TTS
T ss_pred             Cccc-CCC
Confidence            9999 887


No 7  
>KOG1214|consensus
Probab=99.65  E-value=1.5e-15  Score=173.28  Aligned_cols=250  Identities=18%  Similarity=0.361  Sum_probs=177.5

Q ss_pred             cccccccccCcc--CccceeeeEE--ecCcceeEEeecccccccccccccccccccccCCCCceeeccceeehhccCCCC
Q psy9686         413 SACACYMHVNYL--GSSLKLETVL--MSSEKIFISYSSTHHISKMANKFKFNLFKDHADDMGTCTWEKLALALAKKKTHP  488 (769)
Q Consensus       413 ~~C~~~~~~~~~--~~~~~~e~v~--~~~~e~~i~y~~~~~i~~~~~~~~~~~~~~~~~~~~~c~~~~l~~~~~s~~~~~  488 (769)
                      -.|.|+++.+..  ++|++++.|+  +..++.+++++-+..|..+-+.......++++++.+.|.        ......+
T Consensus       639 q~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cd--------t~a~C~p  710 (1289)
T KOG1214|consen  639 QVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCD--------TTARCHP  710 (1289)
T ss_pred             EEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhcccceecCCCCcccccceecCcccC--------CCccccC
Confidence            358887776633  8899999977  777799999999999999999988888899999999888        2233344


Q ss_pred             CccccceeeeccccccccccCCCccccCCCCCC--CCCCCcEEeeCCCCccceEeeCCCCCc----CCCCcccccchhcc
Q psy9686         489 MFGKLAHFESQELFLSSALKKPGAAALGGCAGR--PCRNNGTCTPVSGGVVNFTCTCPSGGQ----GQSGLEITPVYLWL  562 (769)
Q Consensus       489 ~~~~~~~~~~~~~~~~~~~~~~gc~dideC~s~--PC~ngGtCvn~~gs~~~y~C~C~~Gft----G~~CedideC~~~~  562 (769)
                      ..+....++|..+|.+   ....|.|+++|+..  .|..+.+|++.+++   |.|+|..||.    +-+|..+..-+   
T Consensus       711 g~~~~~tcecs~g~~g---dgr~c~d~~eca~~~~~CGp~s~Cin~pg~---~rceC~~gy~F~dd~~tCV~i~~pa---  781 (1289)
T KOG1214|consen  711 GTGVDYTCECSSGYQG---DGRNCVDENECATGFHRCGPNSVCINLPGS---YRCECRSGYEFADDRHTCVLITPPA---  781 (1289)
T ss_pred             CCCcceEEEEeeccCC---CCCCCCChhhhccCCCCCCCCceeecCCCc---eeEEEeecceeccCCcceEEecCCC---
Confidence            4555667778777643   44557888999654  49999999999997   9999999984    34464432211   


Q ss_pred             ccCceeeeeeeecccCCCCCCCCC-CCcCCCcccccccccccccccceeeccccccCCCcccc-CCCCcccccCCCCCCC
Q psy9686         563 MKLKTLVMVQYYFRRKSTSPHKSR-GQGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKD-GNLSFYCNCTEDFTGE  640 (769)
Q Consensus       563 ~~~~~~~~~~~~~~~Cs~~pC~~~-g~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n-~~gsy~C~C~~Gy~G~  640 (769)
                                      ..++|..+ ..|.      +                   ...+.|+. +.++|.|.|.|||.|+
T Consensus       782 ----------------p~n~Ce~g~h~C~------i-------------------~g~a~c~~hGgs~y~C~CLPGfsGD  820 (1289)
T KOG1214|consen  782 ----------------PANPCEDGSHTCA------I-------------------AGQARCVHHGGSTYSCACLPGFSGD  820 (1289)
T ss_pred             ----------------CCCccccCccccC------c-------------------CCceEEEecCCceEEEeecCCccCC
Confidence                            24555443 1121      0                   11234554 4567999999999876


Q ss_pred             --ccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc--CCccccC----CCCCCCCCEEecCCCC
Q psy9686         641 --YCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE--NSAACVT----LNPCQNNATCVASPGD  712 (769)
Q Consensus       641 --~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~--~ideC~s----~~pC~ngg~Cvnt~Gs  712 (769)
                        .|.  ++|+|. ++.|+..++|.+++|+  |.|+|.+||.|+|..|...  ....|+.    +-.|+..+.|.-....
T Consensus       821 G~~c~--dvDeC~-psrChp~A~Cyntpgs--fsC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp  895 (1289)
T KOG1214|consen  821 GHQCT--DVDECS-PSRCHPAATCYNTPGS--FSCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDP  895 (1289)
T ss_pred             ccccc--cccccC-ccccCCCceEecCCCc--ceeecccCccCCCceecCCCccCCccccccccceeeccccceeEeeCC
Confidence              666  689998 8899999999999987  9999999999999999741  1234541    1236554433332223


Q ss_pred             CceEEecCCCCCC
Q psy9686         713 KQITCLCLKGFEG  725 (769)
Q Consensus       713 ~~y~C~C~~GytG  725 (769)
                      ++|.|.|.++-.|
T Consensus       896 ~~~e~p~~~~ppG  908 (1289)
T KOG1214|consen  896 DGHEVPGTQTPPG  908 (1289)
T ss_pred             CcccCCCCCCCCC
Confidence            4488888766655


No 8  
>KOG1219|consensus
Probab=99.49  E-value=3.5e-14  Score=172.87  Aligned_cols=120  Identities=28%  Similarity=0.662  Sum_probs=102.3

Q ss_pred             CCCcccc-CCCCCCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCC
Q psy9686         509 KPGAAAL-GGCAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRG  587 (769)
Q Consensus       509 ~~gc~di-deC~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g  587 (769)
                      .+||... +.|..+||+|+|+|+..+++  +|.|.|++.|+|+.||.                   ....|.++||.++ 
T Consensus      3857 ~pgC~l~~d~C~~npCqhgG~C~~~~~g--gy~CkCpsqysG~~CEi-------------------~~epC~snPC~~G- 3914 (4289)
T KOG1219|consen 3857 QPGCSLLTDPCNDNPCQHGGTCISQPKG--GYKCKCPSQYSGNHCEI-------------------DLEPCASNPCLTG- 3914 (4289)
T ss_pred             cccccccccccccCcccCCCEecCCCCC--ceEEeCcccccCccccc-------------------ccccccCCCCCCC-
Confidence            3455544 89999999999999999886  79999999999999943                   2344578888554 


Q ss_pred             CcCCCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCC
Q psy9686         588 QGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPG  667 (769)
Q Consensus       588 ~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~G  667 (769)
                                                      |+|+...+.|.|.|+.||+|..|+...+++|+ .++|.++|.|++..|
T Consensus      3915 --------------------------------gtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs-~n~C~~gg~C~n~~g 3961 (4289)
T KOG1219|consen 3915 --------------------------------GTCIPFYNGFLCNCPNGYTGKRCEARGISECS-KNVCGTGGQCINIPG 3961 (4289)
T ss_pred             --------------------------------CEEEecCCCeeEeCCCCccCceeecccccccc-cccccCCceeeccCC
Confidence                                            55677778899999999999999986689998 899999999999999


Q ss_pred             CCceEeecCCCCeecCCCcc
Q psy9686         668 DKQITCLCLKGGTVSSEYCQ  687 (769)
Q Consensus       668 s~~y~C~C~~Gftg~G~~C~  687 (769)
                      +  |.|.|.+||.  |.+|.
T Consensus      3962 s--f~CncT~g~~--gr~c~ 3977 (4289)
T KOG1219|consen 3962 S--FHCNCTPGIL--GRTCC 3977 (4289)
T ss_pred             c--eEeccChhHh--cccCc
Confidence            8  9999999999  77775


No 9  
>KOG1219|consensus
Probab=99.43  E-value=1.8e-13  Score=166.98  Aligned_cols=112  Identities=23%  Similarity=0.548  Sum_probs=102.0

Q ss_pred             eeeccccccCCCccccCC-CCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcc
Q psy9686         609 VMVQYYLQQNISPLKDGN-LSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQ  687 (769)
Q Consensus       609 ~~~~~~~c~ngg~C~n~~-gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~  687 (769)
                      ..+..+.|+++|+|...+ +.|.|.|++.|+|..|+.+ +.+|. ++||..+++|+...++  |.|.|+.||+  |.+|+
T Consensus      3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~-~epC~-snPC~~GgtCip~~n~--f~CnC~~gyT--G~~Ce 3938 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID-LEPCA-SNPCLTGGTCIPFYNG--FLCNCPNGYT--GKRCE 3938 (4289)
T ss_pred             cccccCcccCCCEecCCCCCceEEeCcccccCcccccc-ccccc-CCCCCCCCEEEecCCC--eeEeCCCCcc--Cceee
Confidence            456677899999999865 6699999999999999985 88998 9999999999998876  9999999999  99999


Q ss_pred             ccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686         688 FENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       688 ~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      .+-+++|. .++|.++|+|++.+|+  |.|.|.+||.|..|.
T Consensus      3939 ~~Gi~eCs-~n~C~~gg~C~n~~gs--f~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3939 ARGISECS-KNVCGTGGQCINIPGS--FHCNCTPGILGRTCC 3977 (4289)
T ss_pred             cccccccc-cccccCCceeeccCCc--eEeccChhHhcccCc
Confidence            85589995 9999999999999998  999999999999995


No 10 
>KOG1214|consensus
Probab=99.30  E-value=9e-12  Score=142.87  Aligned_cols=148  Identities=25%  Similarity=0.590  Sum_probs=112.9

Q ss_pred             CCCCCCCcEEeeCCCCccceEeeCCCCCc--CCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccc
Q psy9686         520 GRPCRNNGTCTPVSGGVVNFTCTCPSGGQ--GQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITP  597 (769)
Q Consensus       520 s~PC~ngGtCvn~~gs~~~y~C~C~~Gft--G~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~  597 (769)
                      ++-|..+..|...++.  .|+|.|..||.  |++|.++++|+.                  ....|              
T Consensus       699 sh~cdt~a~C~pg~~~--~~tcecs~g~~gdgr~c~d~~eca~------------------~~~~C--------------  744 (1289)
T KOG1214|consen  699 SHMCDTTARCHPGTGV--DYTCECSSGYQGDGRNCVDENECAT------------------GFHRC--------------  744 (1289)
T ss_pred             CcccCCCccccCCCCc--ceEEEEeeccCCCCCCCCChhhhcc------------------CCCCC--------------
Confidence            4457777788877664  79999999996  567988888753                  12223              


Q ss_pred             ccccccccccceeeccccccCCCccccCCCCcccccCCCCC----CCccccc----ccccccc-CCCCCCCCe--EeeCC
Q psy9686         598 VYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFT----GEYCQFE----NSAACVT-LNPCQNNAT--CVASP  666 (769)
Q Consensus       598 gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~----G~~Ce~~----~ideC~~-~~pC~n~gt--Cvnt~  666 (769)
                                         ..++.|++.+++|.|.|..||.    +-+|-..    .++.|.. .+.|...+.  |+.. 
T Consensus       745 -------------------Gp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~h-  804 (1289)
T KOG1214|consen  745 -------------------GPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHH-  804 (1289)
T ss_pred             -------------------CCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEec-
Confidence                               3456789999999999999984    3355421    2456664 466776654  4444 


Q ss_pred             CCCceEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686         667 GDKQITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP  726 (769)
Q Consensus       667 Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~  726 (769)
                      |.+.|.|.|.|||.|+|..|.  ++|+|. ++.|+..+.|++++|+  +.|+|.+||.|.
T Consensus       805 Ggs~y~C~CLPGfsGDG~~c~--dvDeC~-psrChp~A~Cyntpgs--fsC~C~pGy~GD  859 (1289)
T KOG1214|consen  805 GGSTYSCACLPGFSGDGHQCT--DVDECS-PSRCHPAATCYNTPGS--FSCRCQPGYYGD  859 (1289)
T ss_pred             CCceEEEeecCCccCCccccc--cccccC-ccccCCCceEecCCCc--ceeecccCccCC
Confidence            334799999999999999998  789996 8999999999999988  999999999854


No 11 
>KOG4289|consensus
Probab=99.28  E-value=9.1e-12  Score=148.16  Aligned_cols=109  Identities=31%  Similarity=0.641  Sum_probs=85.8

Q ss_pred             cCCCCCCCCCCCcEEeeCC-------------------CCccceEeeCCCCCcCCCC-cccccchhccccCceeeeeeee
Q psy9686         515 LGGCAGRPCRNNGTCTPVS-------------------GGVVNFTCTCPSGGQGQSG-LEITPVYLWLMKLKTLVMVQYY  574 (769)
Q Consensus       515 ideC~s~PC~ngGtCvn~~-------------------gs~~~y~C~C~~GftG~~C-edideC~~~~~~~~~~~~~~~~  574 (769)
                      -+.|...||.|...|+...                   ..+.++.|+||+||+|..| ++||+|+               
T Consensus      1179 DniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCY--------------- 1243 (2531)
T KOG4289|consen 1179 DNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCY--------------- 1243 (2531)
T ss_pred             CchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhh---------------
Confidence            4679999999999896421                   0124599999999999999 5678875               


Q ss_pred             cccCCCCCCCCCCCcCCCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCcccccc-ccccccC
Q psy9686         575 FRRKSTSPHKSRGQGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFEN-SAACVTL  653 (769)
Q Consensus       575 ~~~Cs~~pC~~~g~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~-ideC~~~  653 (769)
                           +.||+                                 +++.|....|+|+|.|.+||+|++|+.+. .-.|. +
T Consensus      1244 -----s~pC~---------------------------------nng~C~srEggYtCeCrpg~tGehCEvs~~agrCv-p 1284 (2531)
T KOG4289|consen 1244 -----SGPCG---------------------------------NNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCV-P 1284 (2531)
T ss_pred             -----cCCCC---------------------------------CCCceEEecCceeEEecCCccccceeeecccCccc-c
Confidence                 57774                                 44567777899999999999999999541 12476 7


Q ss_pred             CCCCCCCeEeeCCCCCceEeecCCC
Q psy9686         654 NPCQNNATCVASPGDKQITCLCLKG  678 (769)
Q Consensus       654 ~pC~n~gtCvnt~Gs~~y~C~C~~G  678 (769)
                      ..|.++++|++...+ +|.|.|+.|
T Consensus      1285 GvC~nggtC~~~~ng-gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1285 GVCKNGGTCVNLLNG-GFCCHCPYG 1308 (2531)
T ss_pred             ceecCCCEEeecCCC-ceeccCCCc
Confidence            889999999987643 699999987


No 12 
>KOG1217|consensus
Probab=99.20  E-value=2.5e-10  Score=128.22  Aligned_cols=204  Identities=25%  Similarity=0.494  Sum_probs=141.6

Q ss_pred             CCCC--CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCccc---ccchhccccCceeeeeeeecccCCCCCCCCCCCcC
Q psy9686         516 GGCA--GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEI---TPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQ  590 (769)
Q Consensus       516 deC~--s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedi---deC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~  590 (769)
                      ++|.  ..+|.+++.|.+..++   |.|.|++||.+..|+..   ..|..          .    ..             
T Consensus       170 ~~C~~~~~~c~~~~~C~~~~~~---~~C~c~~~~~~~~~~~~~~~~~c~~----------~----~~-------------  219 (487)
T KOG1217|consen  170 DECIQYSSPCQNGGTCVNTGGS---YLCSCPPGYTGSTCETTGNGGTCVD----------S----VA-------------  219 (487)
T ss_pred             cccccCCCCcCCCcccccCCCC---eeEeCCCCccCCcCcCCCCCceEec----------c----ee-------------
Confidence            5786  4569999999999887   99999999999998542   11110          0    00             


Q ss_pred             CCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCc---cccccccccccCCCCCCCCeEeeCCC
Q psy9686         591 SGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEY---CQFENSAACVTLNPCQNNATCVASPG  667 (769)
Q Consensus       591 ~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~---Ce~~~ideC~~~~pC~n~gtCvnt~G  667 (769)
                        ..+.+++.+......   .......+ +.|.+..++|.|.|++||.+..   |.  ++++|....+|.++++|++..+
T Consensus       220 --~~~~~g~~~~~c~~~---~~~~~~~~-~~c~~~~~~~~C~~~~g~~~~~~~~~~--~~~~C~~~~~c~~~~~C~~~~~  291 (487)
T KOG1217|consen  220 --CSCPPGARGPECEVS---IVECASGD-GTCVNTVGSYTCRCPEGYTGDACVTCV--DVDSCALIASCPNGGTCVNVPG  291 (487)
T ss_pred             --ccCCCCCCCCCcccc---cccccCCC-CcccccCCceeeeCCCCccccccceee--eccccCCCCccCCCCeeecCCC
Confidence              112222221111100   00000113 7899999999999999999986   33  5788983324999999999987


Q ss_pred             CCceEeecCCCCeecCCCc-cccCCcccc---CCCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCCCCCC-CC
Q psy9686         668 DKQITCLCLKGGTVSSEYC-QFENSAACV---TLNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVDEPPS-ED  742 (769)
Q Consensus       668 s~~y~C~C~~Gftg~G~~C-~~~~ideC~---s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~~npC-~~  742 (769)
                      .  |.|.|++||+  |..| ...+.++|.   ...+|.+++.|........+.|.|..||.|..|+...++|...++ .+
T Consensus       292 ~--~~C~C~~g~~--g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~  367 (487)
T KOG1217|consen  292 S--YRCTCPPGFT--GRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPG  367 (487)
T ss_pred             c--ceeeCCCCCC--CCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccC
Confidence            6  9999999999  6665 221446774   256799988994333333488999999999999854458998866 78


Q ss_pred             Cceee-CCCCccccCCC-Ccc
Q psy9686         743 ETSVD-LQLGSQANSYN-WAH  761 (769)
Q Consensus       743 GtC~n-~~ggy~C~~~~-w~~  761 (769)
                      ++|++ ..++|.|.... +.+
T Consensus       368 ~~c~~~~~~~~~c~~~~~~~~  388 (487)
T KOG1217|consen  368 GTCVNETPGSYRCACPAGFAG  388 (487)
T ss_pred             CEeccCCCCCeEecCCCcccc
Confidence            99999 79999998763 443


No 13 
>KOG1217|consensus
Probab=99.03  E-value=3.3e-09  Score=119.18  Aligned_cols=228  Identities=24%  Similarity=0.390  Sum_probs=144.7

Q ss_pred             CCCCCCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccc
Q psy9686         516 GGCAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEI  595 (769)
Q Consensus       516 deC~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C  595 (769)
                      ..|...+....+.|......   |.|.|++||.|..|+...+|...    ..   .......|...+.   ........|
T Consensus        90 ~~~~~~~~~~~~~~~~~~~~---~~c~c~~g~~~~~~~~~~~C~~~----~~---~~~~~~~c~~~~~---~~~~~~c~C  156 (487)
T KOG1217|consen   90 PPCRSPCLLLCGECVDCVGS---YECTCPPGYQGTPCEGECECVTG----PG---VCCIDGSCSNGPG---SVGPFRCSC  156 (487)
T ss_pred             ccccCCcccCCccccCCCCC---ceeeCCCccccCcCCcceeecCC----CC---CeeCchhhcCCCC---CCCceeeee
Confidence            34555555556666666665   99999999999988443234320    00   0011122222222   112222467


Q ss_pred             ccccccccccccceeec--cccccCCCccccCCCCcccccCCCCCCCccccc-------c-----------ccccc-cCC
Q psy9686         596 TPVYLWLMKMKTLVMVQ--YYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFE-------N-----------SAACV-TLN  654 (769)
Q Consensus       596 ~~gy~~~~~~~~C~~~~--~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~-------~-----------ideC~-~~~  654 (769)
                      ..+|.+.........+.  ...|.+++.|.+..++|.|.|+++|.+..|+..       +           .+.|. ...
T Consensus       157 ~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~  236 (487)
T KOG1217|consen  157 TEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIV  236 (487)
T ss_pred             CCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccc
Confidence            78887765553322222  234888999999999999999999999887732       0           11222 122


Q ss_pred             CCCCC-CeEeeCCCCCceEeecCCCCeecC-CCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC--CC
Q psy9686         655 PCQNN-ATCVASPGDKQITCLCLKGGTVSS-EYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR--EL  730 (769)
Q Consensus       655 pC~n~-gtCvnt~Gs~~y~C~C~~Gftg~G-~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C--e~  730 (769)
                      .|..+ ++|++..++  |.|.|++||++.. ..|.  +++.|....+|.++++|++..+.  |.|.|++||+|..|  ..
T Consensus       237 ~~~~~~~~c~~~~~~--~~C~~~~g~~~~~~~~~~--~~~~C~~~~~c~~~~~C~~~~~~--~~C~C~~g~~g~~~~~~~  310 (487)
T KOG1217|consen  237 ECASGDGTCVNTVGS--YTCRCPEGYTGDACVTCV--DVDSCALIASCPNGGTCVNVPGS--YRCTCPPGFTGRLCTECV  310 (487)
T ss_pred             cccCCCCcccccCCc--eeeeCCCCccccccceee--eccccCCCCccCCCCeeecCCCc--ceeeCCCCCCCCCCcccc
Confidence            34433 889988877  9999999999443 2344  67889733249999999999876  99999999999998  23


Q ss_pred             CCCCCC----CCCC-CCCce--eeCCCCccccCCC-Cccc
Q psy9686         731 PVESVD----EPPS-EDETS--VDLQLGSQANSYN-WAHM  762 (769)
Q Consensus       731 dideC~----~npC-~~GtC--~n~~ggy~C~~~~-w~~~  762 (769)
                      +.++|.    ..+| ++++|  .+..+.+.|...+ |.++
T Consensus       311 ~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~  350 (487)
T KOG1217|consen  311 DVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR  350 (487)
T ss_pred             ccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC
Confidence            456784    4566 45578  4555567777664 3443


No 14 
>KOG4289|consensus
Probab=98.99  E-value=4.7e-10  Score=134.03  Aligned_cols=100  Identities=36%  Similarity=0.766  Sum_probs=85.8

Q ss_pred             ccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc-CCccccCCCCCC
Q psy9686         623 KDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE-NSAACVTLNPCQ  701 (769)
Q Consensus       623 ~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-~ideC~s~~pC~  701 (769)
                      ++..+++.|.|++||+|+.|+.. +|.|- +.||.++++|....|+  |+|.|.+||+  |++|+.. ..-.|. +..|.
T Consensus      1216 i~pvnglrCrCPpGFTgd~CeTe-iDlCY-s~pC~nng~C~srEgg--YtCeCrpg~t--GehCEvs~~agrCv-pGvC~ 1288 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCETE-IDLCY-SGPCGNNGRCRSREGG--YTCECRPGFT--GEHCEVSARAGRCV-PGVCK 1288 (2531)
T ss_pred             ccccCceeEeCCCCCCcccccch-hHhhh-cCCCCCCCceEEecCc--eeEEecCCcc--ccceeeecccCccc-cceec
Confidence            45567899999999999999985 99998 8999999999999887  9999999999  9999852 123575 88899


Q ss_pred             CCCEEecCCCCCceEEecCCC-CCCCCCCC
Q psy9686         702 NNATCVASPGDKQITCLCLKG-FEGPHREL  730 (769)
Q Consensus       702 ngg~Cvnt~Gs~~y~C~C~~G-ytG~~Ce~  730 (769)
                      |+++|++... +++.|+|+.| |++++|+.
T Consensus      1289 nggtC~~~~n-ggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1289 NGGTCVNLLN-GGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred             CCCEEeecCC-CceeccCCCcccCCCceEE
Confidence            9999998754 3599999998 67899985


No 15 
>KOG1225|consensus
Probab=98.77  E-value=2.6e-08  Score=113.94  Aligned_cols=158  Identities=24%  Similarity=0.423  Sum_probs=109.4

Q ss_pred             eEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCC-cccccccccccccc-cceeeccccc
Q psy9686         539 FTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSG-LEITPVYLWLMKMK-TLVMVQYYLQ  616 (769)
Q Consensus       539 y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~-~~C~~gy~~~~~~~-~C~~~~~~~c  616 (769)
                      +.|.|+.+|.|..|+. -.|                     +.-|...+.|.++ +.|.+||.+..+.. .|..   . |
T Consensus       234 ~ic~c~~~~~g~~c~~-~~C---------------------~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~---~-c  287 (525)
T KOG1225|consen  234 GICECPEGYFGPLCST-IYC---------------------PGGCTGRGQCVEGRCICPPGFTGDDCDELVCPV---D-C  287 (525)
T ss_pred             ceeecCCceeCCcccc-ccC---------------------CCCCcccceEeCCeEeCCCCCcCCCCCcccCCc---c-c
Confidence            5899999999999963 333                     4455556777776 57899987744332 3332   2 6


Q ss_pred             cCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCccccC
Q psy9686         617 QNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACVT  696 (769)
Q Consensus       617 ~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s  696 (769)
                      +.++.|++.    .|.|.+||.|+.|+.   ..|  +..|.++|.|++  |    +|.|.+||+  |..|..        
T Consensus       288 s~~g~~~~g----~CiC~~g~~G~dCs~---~~c--padC~g~G~Ci~--G----~C~C~~Gy~--G~~C~~--------  342 (525)
T KOG1225|consen  288 SGGGVCVDG----ECICNPGYSGKDCSI---RRC--PADCSGHGKCID--G----ECLCDEGYT--GELCIQ--------  342 (525)
T ss_pred             CCCceecCC----EeecCCCcccccccc---ccC--CccCCCCCcccC--C----ceEeCCCCc--CCcccc--------
Confidence            677777766    899999999999984   357  678999999983  3    599999999  888874        


Q ss_pred             CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCC--CCCCCCCceeeCCCCccccCC
Q psy9686         697 LNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVD--EPPSEDETSVDLQLGSQANSY  757 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~--~npC~~GtC~n~~ggy~C~~~  757 (769)
                      . .|.+++.|++      - |.|..||.|.+  ...+.+.  ..+...+.|+...+...+..+
T Consensus       343 ~-~C~~~g~cv~------g-C~C~~Gw~G~d--~~~~~~~~~~~cs~~~~~~~~~~~~~~~~~  395 (525)
T KOG1225|consen  343 R-ACSGGGQCVN------G-CKCKKGWRGPD--VADPSLLLITECSPPSLCIAGVGRRRVTHC  395 (525)
T ss_pred             c-ccCCCceecc------C-ceeccCccCCC--cCCchhhcccccCCCceeeccccccccccc
Confidence            1 3788888886      2 99999999998  2222222  122345566666644444443


No 16 
>KOG1225|consensus
Probab=98.47  E-value=6.1e-07  Score=102.86  Aligned_cols=88  Identities=27%  Similarity=0.722  Sum_probs=72.2

Q ss_pred             ccCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCcccc
Q psy9686         616 QQNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACV  695 (769)
Q Consensus       616 c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~  695 (769)
                      |.+.+.|+++    +|.|++||+|.+|+.   -.|  +..|+.++.|++.      .|.|.+||+  |+.|+.+   .| 
T Consensus       256 c~~~g~c~~G----~CIC~~Gf~G~dC~e---~~C--p~~cs~~g~~~~g------~CiC~~g~~--G~dCs~~---~c-  314 (525)
T KOG1225|consen  256 CTGRGQCVEG----RCICPPGFTGDDCDE---LVC--PVDCSGGGVCVDG------ECICNPGYS--GKDCSIR---RC-  314 (525)
T ss_pred             CcccceEeCC----eEeCCCCCcCCCCCc---ccC--CcccCCCceecCC------EeecCCCcc--ccccccc---cC-
Confidence            3444678777    899999999999984   358  4559888888764      699999999  9999853   47 


Q ss_pred             CCCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCC
Q psy9686         696 TLNPCQNNATCVASPGDKQITCLCLKGFEGPHRELP  731 (769)
Q Consensus       696 s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~d  731 (769)
                       +.+|.++|.|++  |    +|.|.+||+|..|+..
T Consensus       315 -padC~g~G~Ci~--G----~C~C~~Gy~G~~C~~~  343 (525)
T KOG1225|consen  315 -PADCSGHGKCID--G----ECLCDEGYTGELCIQR  343 (525)
T ss_pred             -CccCCCCCcccC--C----ceEeCCCCcCCccccc
Confidence             688999999993  3    8999999999999865


No 17 
>KOG1226|consensus
Probab=98.28  E-value=4.7e-06  Score=97.44  Aligned_cols=101  Identities=24%  Similarity=0.613  Sum_probs=67.8

Q ss_pred             ccccCCCCC----CCcccccccccccc--CCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc-CCccccCC--CCC
Q psy9686         630 YCNCTEDFT----GEYCQFENSAACVT--LNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE-NSAACVTL--NPC  700 (769)
Q Consensus       630 ~C~C~~Gy~----G~~Ce~~~ideC~~--~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-~ideC~s~--~pC  700 (769)
                      .|.|.+...    |.+|+.++ -.|..  ...|..+|+|.-.      +|+|.+||+  |..|+.. +.+.|.++  ..|
T Consensus       526 qC~C~~~~~~~i~G~fCECDn-fsC~r~~g~lC~g~G~C~CG------~CvC~~Gwt--G~~C~C~~std~C~~~~G~iC  596 (783)
T KOG1226|consen  526 QCVCHKPDNGKIYGKFCECDN-FSCERHKGVLCGGHGRCECG------RCVCNPGWT--GSACNCPLSTDTCESSDGQIC  596 (783)
T ss_pred             ceEecCCCCCceeeeeeeccC-cccccccCcccCCCCeEeCC------cEEcCCCCc--cCCCCCCCCCccccCCCCcee
Confidence            577776665    88999753 24641  2348888999764      599999999  6666532 56678643  248


Q ss_pred             CCCCEEecCCCCCceEEecCCC-CCCCCCCCCCCCCCCCCCCCCcee
Q psy9686         701 QNNATCVASPGDKQITCLCLKG-FEGPHRELPVESVDEPPSEDETSV  746 (769)
Q Consensus       701 ~ngg~Cvnt~Gs~~y~C~C~~G-ytG~~Ce~dideC~~npC~~GtC~  746 (769)
                      ...|+|.-.      +|+|... |.|..||... .|...+-.+..|+
T Consensus       597 SGrG~C~Cg------~C~C~~~~~sG~~CE~cp-tc~~~C~~~~~Cv  636 (783)
T KOG1226|consen  597 SGRGTCECG------RCKCTDPPYSGEFCEKCP-TCPDPCAENKSCV  636 (783)
T ss_pred             CCCceeeCC------ceEcCCCCcCcchhhcCC-CCCCcccccccch
Confidence            888888864      7999876 9999998533 3443322444453


No 18 
>KOG4260|consensus
Probab=98.21  E-value=1.4e-06  Score=90.61  Aligned_cols=150  Identities=22%  Similarity=0.468  Sum_probs=92.1

Q ss_pred             CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccccc
Q psy9686         520 GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITPVY  599 (769)
Q Consensus       520 s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~gy  599 (769)
                      ..||..+|.|.-.....+.-.|.|.+||+|..|.+-+.=+               +.   +..-..+             
T Consensus       149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~ey---------------fe---s~Rne~~-------------  197 (350)
T KOG4260|consen  149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEY---------------FE---SSRNEQH-------------  197 (350)
T ss_pred             cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHH---------------HH---hhccccc-------------
Confidence            4578888999754333234689999999999984311100               00   0000000             


Q ss_pred             ccccccccceeeccccccCCCccccCCCCccc-ccCCCCCCC--cccccccccccc-CCCCCCCCeEeeCCCCCceEeec
Q psy9686         600 LWLMKMKTLVMVQYYLQQNISPLKDGNLSFYC-NCTEDFTGE--YCQFENSAACVT-LNPCQNNATCVASPGDKQITCLC  675 (769)
Q Consensus       600 ~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C-~C~~Gy~G~--~Ce~~~ideC~~-~~pC~n~gtCvnt~Gs~~y~C~C  675 (769)
                            ..|..+.+.+   .+.|... ++-.| .|..||...  .|-  |+|+|.. +.+|.....|+|+.|+  |.|.+
T Consensus       198 ------lvCt~Ch~~C---~~~Csg~-~~k~C~kCkkGW~lde~gCv--DvnEC~~ep~~c~~~qfCvNteGS--f~C~d  263 (350)
T KOG4260|consen  198 ------LVCTACHEGC---LGVCSGE-SSKGCSKCKKGWKLDEEGCV--DVNECQNEPAPCKAHQFCVNTEGS--FKCED  263 (350)
T ss_pred             ------chhhhhhhhh---hcccCCC-CCCChhhhcccceecccccc--cHHHHhcCCCCCChhheeecCCCc--eEecc
Confidence                  1111111100   1234332 23345 599999754  666  8999983 6789988899999988  99999


Q ss_pred             CCCCeecCCCccccCCccccC-CCCCC-CCCEEecCCCCCceEEecCCCCC
Q psy9686         676 LKGGTVSSEYCQFENSAACVT-LNPCQ-NNATCVASPGDKQITCLCLKGFE  724 (769)
Q Consensus       676 ~~Gftg~G~~C~~~~ideC~s-~~pC~-ngg~Cvnt~Gs~~y~C~C~~Gyt  724 (769)
                      .+||.+ |       +|+|.. ...|. .+..|.++++.  |+|.|..|+.
T Consensus       264 k~Gy~~-g-------~d~C~~~~d~~~~kn~~c~ni~~~--~r~v~f~~~~  304 (350)
T KOG4260|consen  264 KEGYKK-G-------VDECQFCADVCASKNRPCMNIDGQ--YRCVCFSGLI  304 (350)
T ss_pred             cccccC-C-------hHHhhhhhhhcccCCCCcccCCcc--EEEEecccce
Confidence            999974 2       334430 13343 23468888877  9999999873


No 19 
>KOG4260|consensus
Probab=98.20  E-value=2.3e-06  Score=89.03  Aligned_cols=120  Identities=19%  Similarity=0.466  Sum_probs=82.5

Q ss_pred             CCCcccccCCCCCCCcccccccccccc-C----C---CCCC--CCeEeeCCCCCceEe-ecCCCCeecCCCccccCCccc
Q psy9686         626 NLSFYCNCTEDFTGEYCQFENSAACVT-L----N---PCQN--NATCVASPGDKQITC-LCLKGGTVSSEYCQFENSAAC  694 (769)
Q Consensus       626 ~gsy~C~C~~Gy~G~~Ce~~~ideC~~-~----~---pC~n--~gtCvnt~Gs~~y~C-~C~~Gftg~G~~C~~~~ideC  694 (769)
                      .|+-.|.|.+||.|+.|..-.+..=.+ .    .   .|+.  .+.|.... +  -.| .|..||..+-..|.  |+|+|
T Consensus       165 ~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~-~--k~C~kCkkGW~lde~gCv--DvnEC  239 (350)
T KOG4260|consen  165 EGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES-S--KGCSKCKKGWKLDEEGCV--DVNEC  239 (350)
T ss_pred             CCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCC-C--CChhhhcccceecccccc--cHHHH
Confidence            456689999999999887311100000 0    0   1222  12453321 1  345 59999998877888  89999


Q ss_pred             cC-CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCCC--CCC--CCCceeeCCCCccccCC
Q psy9686         695 VT-LNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVDE--PPS--EDETSVDLQLGSQANSY  757 (769)
Q Consensus       695 ~s-~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~~--npC--~~GtC~n~~ggy~C~~~  757 (769)
                      .. +.+|.....|+|+.||  |.|.+++||.+.     +|+|.-  .-|  .++.|-|+.++|+|-.+
T Consensus       240 ~~ep~~c~~~qfCvNteGS--f~C~dk~Gy~~g-----~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f  300 (350)
T KOG4260|consen  240 QNEPAPCKAHQFCVNTEGS--FKCEDKEGYKKG-----VDECQFCADVCASKNRPCMNIDGQYRCVCF  300 (350)
T ss_pred             hcCCCCCChhheeecCCCc--eEecccccccCC-----hHHhhhhhhhcccCCCCcccCCccEEEEec
Confidence            73 5679888899999998  999999999873     455554  223  57889999999998765


No 20 
>KOG0994|consensus
Probab=97.88  E-value=4.6e-05  Score=91.40  Aligned_cols=208  Identities=16%  Similarity=0.293  Sum_probs=113.4

Q ss_pred             eccccccccccCCCccccCCCCCCCCCCC--------cEEeeCCCCccceEeeCCCCCcCCCCcccccchhc----cccC
Q psy9686         498 SQELFLSSALKKPGAAALGGCAGRPCRNN--------GTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLW----LMKL  565 (769)
Q Consensus       498 ~~~~~~~~~~~~~gc~dideC~s~PC~ng--------GtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~----~~~~  565 (769)
                      |..+|++.-.-..+    ..|.+-||..+        -.|...+.. ....|.|.+||+|.+|+.   |+.+    ...+
T Consensus       890 Cl~GyyGdP~lg~g----~~CrPCpCP~gp~Sg~~~A~sC~~d~~t-~~ivC~C~~GY~G~RCe~---CA~~~fGnP~~G  961 (1758)
T KOG0994|consen  890 CLDGYYGDPRLGSG----IGCRPCPCPDGPASGRQHADSCYLDTRT-QQIVCHCQEGYSGSRCEI---CADNHFGNPSEG  961 (1758)
T ss_pred             hhccccCCcccCCC----CCCCCCCCCCCCccchhccccccccccc-cceeeecccCccccchhh---hcccccCCcccC
Confidence            55566554333322    34555555443        246544332 457899999999999953   5544    3347


Q ss_pred             ceeeeee--eecccCCCCCCCC-CCCcCC---------CcccccccccccccccceeeccccccCCCccccCCCCccccc
Q psy9686         566 KTLVMVQ--YYFRRKSTSPHKS-RGQGQS---------GLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNC  633 (769)
Q Consensus       566 ~~~~~~~--~~~~~Cs~~pC~~-~g~C~~---------~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C  633 (769)
                      ++|..|+  ..++.=.+..|.. .|.|..         +--|.+||.+......|..+.-+.-.-+.+|.=..-+-.|-|
T Consensus       962 GtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpC 1041 (1758)
T KOG0994|consen  962 GTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPC 1041 (1758)
T ss_pred             CccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhheccccccCCccccccccCcCCC
Confidence            8885443  3444444555543 244433         236899998876666665544333222222222223347888


Q ss_pred             CCCCCCCcccccccccccc------------CCCCCC--CCeEeeCCCCCceEeecCCCCeecCCCcccc------CCc-
Q psy9686         634 TEDFTGEYCQFENSAACVT------------LNPCQN--NATCVASPGDKQITCLCLKGGTVSSEYCQFE------NSA-  692 (769)
Q Consensus       634 ~~Gy~G~~Ce~~~ideC~~------------~~pC~n--~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~------~id-  692 (769)
                      .+...|..|+.     |+.            +-.|..  +-+|....|    .|.|.|||-  |.+|..-      +.+ 
T Consensus      1042 lpNv~G~~CDq-----CA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG----QCqCkpGfG--GR~C~qCqel~WGdP~~ 1110 (1758)
T KOG0994|consen 1042 LPNVQGVRCDQ-----CAENHWNLASGEGCEPCNCDPIGGPQCNEFTG----QCQCKPGFG--GRTCSQCQELYWGDPNE 1110 (1758)
T ss_pred             Ccccccccccc-----cccchhccccCCCCCccCCCccCCcccccccc----ceeccCCCC--CcchhHHHHhhcCCCCC
Confidence            88888887762     431            112222  225544444    599999997  7777620      000 


Q ss_pred             cccCCCCCCCCC----EEecCCCCCceEEecCCCCCCCCCC
Q psy9686         693 ACVTLNPCQNNA----TCVASPGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       693 eC~s~~pC~ngg----~Cvnt~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      .|. .-.|...|    +|....|    +|.|.+|-.|..|.
T Consensus      1111 ~C~-aCdCd~rG~~tpQCdr~tG----~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1111 KCR-ACDCDPRGIETPQCDRATG----RCVCRPGVGGPRCD 1146 (1758)
T ss_pred             Cce-ecCCCCCCCCCCCccccCC----ceeecCCCCCcchh
Confidence            121 11233222    3444433    78888888888884


No 21 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.56  E-value=4.1e-05  Score=55.55  Aligned_cols=31  Identities=45%  Similarity=1.241  Sum_probs=27.4

Q ss_pred             CCCCCCCCCcEEeeCC-CCccceEeeCCCCCcCCC
Q psy9686         518 CAGRPCRNNGTCTPVS-GGVVNFTCTCPSGGQGQS  551 (769)
Q Consensus       518 C~s~PC~ngGtCvn~~-gs~~~y~C~C~~GftG~~  551 (769)
                      |.++||.|+|+|++.. +   .|.|.|++||+|++
T Consensus         1 C~~~~C~n~g~C~~~~~~---~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    1 CSSNPCQNGGTCIDLPGG---GYTCECPPGYTGKR   32 (32)
T ss_dssp             TTTTSSTTTEEEEEESTS---EEEEEEBTTEESTT
T ss_pred             CCCCcCCCCeEEEeCCCC---CEEeECCCCCccCC
Confidence            6678999999999988 4   49999999999974


No 22 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.47  E-value=0.00016  Score=53.66  Aligned_cols=36  Identities=39%  Similarity=1.039  Sum_probs=32.1

Q ss_pred             ccCCCCC-CCCCCCcEEeeCCCCccceEeeCCCCCc-CCCC
Q psy9686         514 ALGGCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQ-GQSG  552 (769)
Q Consensus       514 dideC~s-~PC~ngGtCvn~~gs~~~y~C~C~~Gft-G~~C  552 (769)
                      ++++|.. .||.++++|++..++   |.|.|++||. |..|
T Consensus         1 d~~~C~~~~~C~~~~~C~~~~g~---~~C~C~~g~~~g~~C   38 (39)
T smart00179        1 DIDECASGNPCQNGGTCVNTVGS---YRCECPPGYTDGRNC   38 (39)
T ss_pred             CcccCcCCCCcCCCCEeECCCCC---eEeECCCCCccCCcC
Confidence            3678987 799999999999887   9999999999 8877


No 23 
>KOG1226|consensus
Probab=97.38  E-value=0.00047  Score=81.16  Aligned_cols=95  Identities=21%  Similarity=0.651  Sum_probs=66.6

Q ss_pred             ccccCCCCCCCcccccc--------cccccc---CCCCCCCCeEeeCCCCCceEeecCCCCee--cCCCccccCCccccC
Q psy9686         630 YCNCTEDFTGEYCQFEN--------SAACVT---LNPCQNNATCVASPGDKQITCLCLKGGTV--SSEYCQFENSAACVT  696 (769)
Q Consensus       630 ~C~C~~Gy~G~~Ce~~~--------ideC~~---~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg--~G~~C~~~~ideC~s  696 (769)
                      .|.|.+||.|+.|+...        .+.|..   .-+|...|.|+=.      +|.|.+...+  .|+.|+. +.-.|..
T Consensus       479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG------qC~C~~~~~~~i~G~fCEC-DnfsC~r  551 (783)
T KOG1226|consen  479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG------QCVCHKPDNGKIYGKFCEC-DNFSCER  551 (783)
T ss_pred             ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC------ceEecCCCCCceeeeeeec-cCccccc
Confidence            67899999999998631        133431   2268888888764      5999876442  4888986 3335642


Q ss_pred             --CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCC--CCCCCCC
Q psy9686         697 --LNPCQNNATCVASPGDKQITCLCLKGFEGPHREL--PVESVDE  737 (769)
Q Consensus       697 --~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~--dideC~~  737 (769)
                        ...|..+|.|.-.      +|.|.+||+|..|+-  +.+.|.+
T Consensus       552 ~~g~lC~g~G~C~CG------~CvC~~GwtG~~C~C~~std~C~~  590 (783)
T KOG1226|consen  552 HKGVLCGGHGRCECG------RCVCNPGWTGSACNCPLSTDTCES  590 (783)
T ss_pred             ccCcccCCCCeEeCC------cEEcCCCCccCCCCCCCCCccccC
Confidence              2359888898764      899999999998763  4555665


No 24 
>PF00008 EGF:  EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry;  InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.24  E-value=0.00015  Score=52.67  Aligned_cols=30  Identities=43%  Similarity=0.953  Sum_probs=25.9

Q ss_pred             CCCCCCCCEEecCCCCCceEEecCCCCCCCC
Q psy9686         697 LNPCQNNATCVASPGDKQITCLCLKGFEGPH  727 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~  727 (769)
                      +++|.|+|+|++.. .++|+|+|++||+|++
T Consensus         3 ~~~C~n~g~C~~~~-~~~y~C~C~~G~~G~~   32 (32)
T PF00008_consen    3 SNPCQNGGTCIDLP-GGGYTCECPPGYTGKR   32 (32)
T ss_dssp             TTSSTTTEEEEEES-TSEEEEEEBTTEESTT
T ss_pred             CCcCCCCeEEEeCC-CCCEEeECCCCCccCC
Confidence            67999999999988 2349999999999974


No 25 
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.24  E-value=0.00044  Score=51.20  Aligned_cols=36  Identities=42%  Similarity=0.906  Sum_probs=31.3

Q ss_pred             CccccCC-CCCCCCCEEecCCCCCceEEecCCCCC-CCCCC
Q psy9686         691 SAACVTL-NPCQNNATCVASPGDKQITCLCLKGFE-GPHRE  729 (769)
Q Consensus       691 ideC~s~-~pC~ngg~Cvnt~Gs~~y~C~C~~Gyt-G~~Ce  729 (769)
                      +++|. . .+|.+++.|+++.++  |.|.|++||. |..|+
T Consensus         2 ~~~C~-~~~~C~~~~~C~~~~g~--~~C~C~~g~~~g~~C~   39 (39)
T smart00179        2 IDECA-SGNPCQNGGTCVNTVGS--YRCECPPGYTDGRNCE   39 (39)
T ss_pred             cccCc-CCCCcCCCCEeECCCCC--eEeECCCCCccCCcCC
Confidence            57785 4 789999999999887  9999999999 98884


No 26 
>KOG0994|consensus
Probab=97.11  E-value=0.0013  Score=79.67  Aligned_cols=62  Identities=31%  Similarity=0.674  Sum_probs=39.5

Q ss_pred             eEe-ecCCCCeec-----CCCccccCCccccCCCCCCC--------CCEEecCCCCCceEEecCCCCCCCCCCCCCCCCC
Q psy9686         671 ITC-LCLKGGTVS-----SEYCQFENSAACVTLNPCQN--------NATCVASPGDKQITCLCLKGFEGPHRELPVESVD  736 (769)
Q Consensus       671 y~C-~C~~Gftg~-----G~~C~~~~ideC~s~~pC~n--------gg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~  736 (769)
                      +.| +|..||.|+     |..|.         +-||..        .-.|.-........|.|.+||+|.+|+.    |+
T Consensus       885 ~~CdrCl~GyyGdP~lg~g~~Cr---------PCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~----CA  951 (1758)
T KOG0994|consen  885 HSCDRCLDGYYGDPRLGSGIGCR---------PCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI----CA  951 (1758)
T ss_pred             cchhhhhccccCCcccCCCCCCC---------CCCCCCCCccchhccccccccccccceeeecccCccccchhh----hc
Confidence            677 699999853     44444         223432        2245443333457899999999999974    55


Q ss_pred             C----CCCCCCce
Q psy9686         737 E----PPSEDETS  745 (769)
Q Consensus       737 ~----npC~~GtC  745 (769)
                      +    +|-++|+|
T Consensus       952 ~~~fGnP~~GGtC  964 (1758)
T KOG0994|consen  952 DNHFGNPSEGGTC  964 (1758)
T ss_pred             ccccCCcccCCcc
Confidence            4    56666665


No 27 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.06  E-value=0.00082  Score=49.03  Aligned_cols=35  Identities=40%  Similarity=1.086  Sum_probs=31.2

Q ss_pred             cCCCCC-CCCCCCcEEeeCCCCccceEeeCCCCCcCCCC
Q psy9686         515 LGGCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSG  552 (769)
Q Consensus       515 ideC~s-~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~C  552 (769)
                      +++|.. .+|.+++.|.+..++   |.|.|++||.|..|
T Consensus         2 ~~~C~~~~~C~~~~~C~~~~~~---~~C~C~~g~~g~~C   37 (38)
T cd00054           2 IDECASGNPCQNGGTCVNTVGS---YRCSCPPGYTGRNC   37 (38)
T ss_pred             cccCCCCCCcCCCCEeECCCCC---eEeECCCCCcCCcC
Confidence            578877 799999999998887   99999999999877


No 28 
>PRK15328 invasion protein IagB; Provisional
Probab=97.04  E-value=0.00012  Score=72.53  Aligned_cols=45  Identities=31%  Similarity=0.299  Sum_probs=37.3

Q ss_pred             cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccC
Q psy9686         294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCT  342 (769)
Q Consensus       294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~  342 (769)
                      +|++-..|   ..|++.||+||+.|++. |.+|+.|+||+|||+.+|..
T Consensus        29 y~Idp~Ll---~AIa~~ES~f~p~avs~-n~~gs~d~GLMQIn~~~~~~   73 (160)
T PRK15328         29 FNIESELL---YAIAQQESAMKPGAIGH-NRDGSTDLGLMQINSFHMKR   73 (160)
T ss_pred             cCCCHHHH---HHHHHHHcCCCCCCEec-CCCCCcccChhhcCHHHHHH
Confidence            56665444   35999999999999985 67899999999999988753


No 29 
>PRK13722 lytic transglycosylase; Provisional
Probab=96.97  E-value=0.00018  Score=71.72  Aligned_cols=44  Identities=23%  Similarity=0.192  Sum_probs=35.4

Q ss_pred             cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccccc
Q psy9686         294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWC  341 (769)
Q Consensus       294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC  341 (769)
                      ++++-.-|.-   |+..||+||+.|++. |.+|+.|+||.|||+.++=
T Consensus        30 y~vdp~LL~A---Ii~~ES~fnp~Avs~-n~~gs~d~GLMQIn~~~~~   73 (169)
T PRK13722         30 YKIDPDLLRA---ISWKESRYRVNAIGI-NPVTGYGSGLMQVDSQHFN   73 (169)
T ss_pred             cCCCHHHHHH---HHHHHcCCCCCCeec-CCCCCeeecCCcccHHHHH
Confidence            5665544443   899999999999985 6789999999999997754


No 30 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.94  E-value=0.00051  Score=52.83  Aligned_cols=32  Identities=38%  Similarity=1.043  Sum_probs=28.5

Q ss_pred             ccCCCCC--CCCCCCcEEeeCCCCccceEeeCCCCCc
Q psy9686         514 ALGGCAG--RPCRNNGTCTPVSGGVVNFTCTCPSGGQ  548 (769)
Q Consensus       514 dideC~s--~PC~ngGtCvn~~gs~~~y~C~C~~Gft  548 (769)
                      |||||..  ++|..++.|+|+.|+   |.|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gs---y~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGS---YSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTE---EEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCC---EEeeCCCCcE
Confidence            6899964  469989999999998   9999999997


No 31 
>PF07645 EGF_CA:  Calcium-binding EGF domain;  InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes [].  +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.87  E-value=0.00074  Score=51.94  Aligned_cols=33  Identities=30%  Similarity=0.733  Sum_probs=29.1

Q ss_pred             ccccccc-CCCCCCCCeEeeCCCCCceEeecCCCCe
Q psy9686         646 NSAACVT-LNPCQNNATCVASPGDKQITCLCLKGGT  680 (769)
Q Consensus       646 ~ideC~~-~~pC~n~gtCvnt~Gs~~y~C~C~~Gft  680 (769)
                      |||||.. .+.|..++.|+|+.|+  |+|.|++||.
T Consensus         1 DidEC~~~~~~C~~~~~C~N~~Gs--y~C~C~~Gy~   34 (42)
T PF07645_consen    1 DIDECAEGPHNCPENGTCVNTEGS--YSCSCPPGYE   34 (42)
T ss_dssp             ESSTTTTTSSSSSTTSEEEEETTE--EEEEESTTEE
T ss_pred             CccccCCCCCcCCCCCEEEcCCCC--EEeeCCCCcE
Confidence            4789984 5679989999999988  9999999998


No 32 
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.76  E-value=0.0021  Score=46.79  Aligned_cols=36  Identities=42%  Similarity=0.912  Sum_probs=30.5

Q ss_pred             CccccCC-CCCCCCCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686         691 SAACVTL-NPCQNNATCVASPGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       691 ideC~s~-~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      +++|. . .+|.+++.|++..++  |.|.|++||.|..|+
T Consensus         2 ~~~C~-~~~~C~~~~~C~~~~~~--~~C~C~~g~~g~~C~   38 (38)
T cd00054           2 IDECA-SGNPCQNGGTCVNTVGS--YRCSCPPGYTGRNCE   38 (38)
T ss_pred             cccCC-CCCCcCCCCEeECCCCC--eEeECCCCCcCCcCC
Confidence            46775 4 689988999999876  999999999998884


No 33 
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=96.62  E-value=0.00056  Score=69.61  Aligned_cols=31  Identities=35%  Similarity=0.531  Sum_probs=28.1

Q ss_pred             eeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      |+..||+ +..+++. |.|||.|+||+|||+.|
T Consensus        35 Ia~~ESg-~pgai~~-N~NGT~DvGlMQINS~h   65 (206)
T PRK13888         35 VAEKEGG-KPGQWVR-NTNGTHDVGPMQFNTAY   65 (206)
T ss_pred             HHHHhCC-CCCCEEe-CCCCCEEeCCcccccHh
Confidence            8999996 9999995 78999999999999965


No 34 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.46  E-value=0.0019  Score=48.24  Aligned_cols=32  Identities=34%  Similarity=0.709  Sum_probs=25.6

Q ss_pred             CCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686         653 LNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC  686 (769)
Q Consensus       653 ~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C  686 (769)
                      ...|+.+++|.++.++  |.|+|.+||.|+|..|
T Consensus         5 ~~~C~~nA~C~~~~~~--~~C~C~~Gy~GdG~~C   36 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGS--YTCTCKPGYEGDGFFC   36 (36)
T ss_dssp             GGGS-TTCEEEE-TTS--EEEEE-CEEECCSTCE
T ss_pred             CCCCCCCcEeecCCCC--EEeECCCCCccCCcCC
Confidence            4578999999999886  9999999999998876


No 35 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=96.35  E-value=0.0054  Score=43.89  Aligned_cols=32  Identities=47%  Similarity=1.155  Sum_probs=27.7

Q ss_pred             CC-CCCCCCCcEEeeCCCCccceEeeCCCCCcCC-CC
Q psy9686         518 CA-GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQ-SG  552 (769)
Q Consensus       518 C~-s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~-~C  552 (769)
                      |. ..+|.+++.|++..++   |.|.|+.||.|. .|
T Consensus         2 C~~~~~C~~~~~C~~~~~~---~~C~C~~g~~g~~~C   35 (36)
T cd00053           2 CAASNPCSNGGTCVNTPGS---YRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCCCCEEecCCCC---eEeECCCCCcccCCc
Confidence            44 6789999999998876   999999999998 65


No 36 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.15  E-value=0.0016  Score=65.32  Aligned_cols=145  Identities=22%  Similarity=0.518  Sum_probs=80.0

Q ss_pred             CCCCCCcEEeeCCCCccceEeeCCCCCcC---CCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccc
Q psy9686         521 RPCRNNGTCTPVSGGVVNFTCTCPSGGQG---QSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITP  597 (769)
Q Consensus       521 ~PC~ngGtCvn~~gs~~~y~C~C~~GftG---~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~  597 (769)
                      ..|.| |..++....   |.|.|.+||.-   .+|+...+|..               -+-...+|++.+          
T Consensus         6 T~CKN-G~LiQMSNH---fEC~Cnegfvl~~EntCE~kv~C~~---------------~e~~~K~Cgdya----------   56 (197)
T PF06247_consen    6 TICKN-GYLIQMSNH---FECKCNEGFVLKNENTCEEKVECDK---------------LENVNKPCGDYA----------   56 (197)
T ss_dssp             ---BT-EEEEEESSE---EEEEESTTEEEEETTEEEE----SG----------------GGTTSEEETTE----------
T ss_pred             ccccC-CEEEEccCc---eEEEcCCCcEEccccccccceecCc---------------ccccCccccchh----------
Confidence            35666 566665554   99999999953   34776666621               122344554433          


Q ss_pred             ccccccccccceeeccccccCCCccccC-----CCCcccccCCCCCCC--ccccccccccccCCCCCCCCeEeeCCCC-C
Q psy9686         598 VYLWLMKMKTLVMVQYYLQQNISPLKDG-----NLSFYCNCTEDFTGE--YCQFENSAACVTLNPCQNNATCVASPGD-K  669 (769)
Q Consensus       598 gy~~~~~~~~C~~~~~~~c~ngg~C~n~-----~gsy~C~C~~Gy~G~--~Ce~~~ideC~~~~pC~n~gtCvnt~Gs-~  669 (769)
                                             .|++.     ...|.|.|.+||.-.  .|.   .+.|. ...|. .|.|+-.+.. .
T Consensus        57 -----------------------~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv---p~~C~-~~~Cg-~GKCI~d~~~~~  108 (197)
T PF06247_consen   57 -----------------------KCINQANKGEERAYKCDCINGYILKQGVCV---PNKCN-NKDCG-SGKCILDPDNPN  108 (197)
T ss_dssp             -----------------------EEEE-SSTTSSTSEEEEE-TTEEESSSSEE---EGGGS-S---T-TEEEEEEEGGGS
T ss_pred             -----------------------hhhcCCCcccceeEEEecccCceeeCCeEc---hhhcC-ceecC-CCeEEecCCCCC
Confidence                                   34332     356999999999743  554   34665 45576 5788754322 2


Q ss_pred             ceEeecCCCCe-ecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686         670 QITCLCLKGGT-VSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP  726 (769)
Q Consensus       670 ~y~C~C~~Gft-g~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~  726 (769)
                      ...|+|.-|+. .++..|+..-...|  +--|..+-.|..+.+.  |+|.|.+|+.|.
T Consensus       109 ~~~CSC~IGkV~~dn~kCtk~G~T~C--~LKCk~nE~CK~~~~~--Y~C~~~~~~~~~  162 (197)
T PF06247_consen  109 NPTCSCNIGKVPDDNKKCTKTGETKC--SLKCKENEECKLVDGY--YKCVCKEGFPGD  162 (197)
T ss_dssp             EEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTE--EEEEE-TT-EEE
T ss_pred             CceeEeeeceEeccCCcccCCCccce--eeecCCCcceeeeCcE--EEeecCCCCCCC
Confidence            36899999987 45567765333456  3557777799999877  999999999643


No 37 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.06  E-value=0.009  Score=43.36  Aligned_cols=32  Identities=44%  Similarity=1.083  Sum_probs=27.1

Q ss_pred             CCCC-CCCCCCcEEeeCCCCccceEeeCCCCCcC-CCC
Q psy9686         517 GCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQG-QSG  552 (769)
Q Consensus       517 eC~s-~PC~ngGtCvn~~gs~~~y~C~C~~GftG-~~C  552 (769)
                      +|.. .+|.++ +|++..++   |.|.|++||.| ..|
T Consensus         1 ~C~~~~~C~~~-~C~~~~~~---~~C~C~~g~~g~~~C   34 (35)
T smart00181        1 ECASGGPCSNG-TCINTPGS---YTCSCPPGYTGDKRC   34 (35)
T ss_pred             CCCCcCCCCCC-EEECCCCC---eEeECCCCCccCCcc
Confidence            3555 689998 99998776   99999999999 766


No 38 
>PF01464 SLT:  Transglycosylase SLT domain;  InterPro: IPR008258 Bacterial lytic transglycosylases degrade murein via cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid and N-acetylglucosamine, with the concomitant formation of a 1,6-anhydrobond in the muramic acid residue. There are both soluble (Slt enzymes) and membrane-bound (Mlt enzymes) lytic transglycosylases that differ in size, sequence, activity, specificity and location. The multi-domain structure of the 70 Kd soluble lytic transglycosylase Slt70 is known []. Slt70 has 3 distinct domains, each rich in alpha helices: an N-terminal superhelical U-shaped domain (U-domain; IPR008939 from INTERPRO), a superhelical linker domain (L-domain, IPR012289 from INTERPRO), and a C-terminal catalytic domain (IPR008258 from INTERPRO). Both the U- and L-domain share a similar superhelical structure. These two domains are connected, and together form a closed ring with a large central hole; the catalytic domain is packed on top of, and interacts with, this ring. The catalytic domain has a lysosome-like fold.  This entry represents the catalytic domain, which is structurally conserved in some membrane-bound lytic glycosylases and in bacteriophage transglycosylases, even though their sequences can differ considerably proteins []. The most conserved part of this domain is its N-terminal extremity that contains two conserved serines and a glutamate, which have been shown [] to be involved in the catalytic mechanism. This family is distantly related to IPR001916 from INTERPRO.; PDB: 2Y8P_B 3MGW_A 1QTE_A 1SLY_A 1QSA_A 153L_A 154L_A 3BKV_A 3BKH_A 1LSP_A ....
Probab=95.74  E-value=0.0021  Score=59.84  Aligned_cols=64  Identities=30%  Similarity=0.274  Sum_probs=45.3

Q ss_pred             eeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCChhHHHHHHHHHHhcc
Q psy9686         307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT  377 (769)
Q Consensus       307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di~~~i~Cak~i~~~~  377 (769)
                      |++.||+|++.+++.. .+++.++|||||+...|=.-....+     .-.+++ .|....|..+-+++++.
T Consensus        20 i~~~ES~~~~~a~~~~-~~~~~a~GlmQi~~~t~~~~~~~~~-----~~~~~~-~dp~~ni~~g~~~L~~~   83 (121)
T PF01464_consen   20 IAYQESGFNPNAVSYN-SNGSGAYGLMQINPATWKDYGRKGG-----LSPDDL-FDPEDNIQAGAKYLKWL   83 (121)
T ss_dssp             HHHHHHTTCTTTBTTB-ETTTTEETTTTEETTTSHHHSHTHC-----TSHHHH-HHHHHHHHHHHHHHHHH
T ss_pred             HHHHhcCCCCCCcCCC-CCCCcceeccccCCcccchhhhccc-----ccchhh-hCHHHHHHHHHHHHHHH
Confidence            8999999999999853 6788899999999988865321100     122333 35566777877777763


No 39 
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at  least  one  is  present  in  most EGF-like domains; a subset of these bind calcium.
Probab=95.73  E-value=0.017  Score=41.26  Aligned_cols=30  Identities=43%  Similarity=1.056  Sum_probs=25.9

Q ss_pred             CCCCCCCCEEecCCCCCceEEecCCCCCCC-CC
Q psy9686         697 LNPCQNNATCVASPGDKQITCLCLKGFEGP-HR  728 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~-~C  728 (769)
                      ..+|.+++.|++..+.  |+|.|++||.|. .|
T Consensus         5 ~~~C~~~~~C~~~~~~--~~C~C~~g~~g~~~C   35 (36)
T cd00053           5 SNPCSNGGTCVNTPGS--YRCVCPPGYTGDRSC   35 (36)
T ss_pred             CCCCCCCCEEecCCCC--eEeECCCCCcccCCc
Confidence            4578888999998866  999999999998 66


No 40 
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.43  E-value=0.022  Score=41.25  Aligned_cols=29  Identities=45%  Similarity=1.106  Sum_probs=24.7

Q ss_pred             CCCCCCCEEecCCCCCceEEecCCCCCC-CCCC
Q psy9686         698 NPCQNNATCVASPGDKQITCLCLKGFEG-PHRE  729 (769)
Q Consensus       698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG-~~Ce  729 (769)
                      .+|.++ .|+++.++  |+|.|++||.| ..|+
T Consensus         6 ~~C~~~-~C~~~~~~--~~C~C~~g~~g~~~C~   35 (35)
T smart00181        6 GPCSNG-TCINTPGS--YTCSCPPGYTGDKRCE   35 (35)
T ss_pred             CCCCCC-EEECCCCC--eEeECCCCCccCCccC
Confidence            578888 99998766  99999999999 7763


No 41 
>KOG1836|consensus
Probab=95.29  E-value=0.18  Score=65.94  Aligned_cols=197  Identities=21%  Similarity=0.416  Sum_probs=98.5

Q ss_pred             CCCCCCCCCcEEeeCCCCccceEee-CCCCCcCCCCcccccchh-ccc-cC---ceeeeeeeec--ccCCCCCCCCC-CC
Q psy9686         518 CAGRPCRNNGTCTPVSGGVVNFTCT-CPSGGQGQSGLEITPVYL-WLM-KL---KTLVMVQYYF--RRKSTSPHKSR-GQ  588 (769)
Q Consensus       518 C~s~PC~ngGtCvn~~gs~~~y~C~-C~~GftG~~CedideC~~-~~~-~~---~~~~~~~~~~--~~Cs~~pC~~~-g~  588 (769)
                      |.+-||.+++.|..... .....|+ |++||+|.+|+.-++=+. +.. .+   ..|..|+...  +.-....|... +.
T Consensus       777 C~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~  855 (1705)
T KOG1836|consen  777 CQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGE  855 (1705)
T ss_pred             CccCCCCCChhhcCcCc-ccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCccccccccccccc
Confidence            88888999999988764 2457899 999999999954222111 111 11   2333222222  21122223221 22


Q ss_pred             c---------CCCccccccccccccc----ccceeeccccc---cCCCccccCCCCcccccCCCCCCCccccc-----cc
Q psy9686         589 G---------QSGLEITPVYLWLMKM----KTLVMVQYYLQ---QNISPLKDGNLSFYCNCTEDFTGEYCQFE-----NS  647 (769)
Q Consensus       589 C---------~~~~~C~~gy~~~~~~----~~C~~~~~~~c---~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~-----~i  647 (769)
                      |         ..+..|.++|.+..-.    ..|..++-..-   ....+|..-.|  .|.|.+.-.|..|..-     ++
T Consensus       856 c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tG--Qcec~~~v~g~~c~~c~~g~fnl  933 (1705)
T KOG1836|consen  856 CLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTG--QCECKPNVEGRDCLYCFKGFFNL  933 (1705)
T ss_pred             eeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCccc--ceeccCCCCcccccccccccccc
Confidence            3         2233677887764322    33443332110   00122433322  5556555555544320     00


Q ss_pred             c---ccccCCCCCCC----CeEeeCCCCCceEeecCCCCeecCCCcccc-------CCccccCCCCCCCCC----EEecC
Q psy9686         648 A---ACVTLNPCQNN----ATCVASPGDKQITCLCLKGGTVSSEYCQFE-------NSAACVTLNPCQNNA----TCVAS  709 (769)
Q Consensus       648 d---eC~~~~pC~n~----gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-------~ideC~s~~pC~ngg----~Cvnt  709 (769)
                      +   .|. +..|..-    ..|....|    .|.|.+|-+  |..|..-       ....|. .-.|...|    +|...
T Consensus       934 ~s~~gC~-~c~c~~~gs~~~~c~~~tG----qc~c~~gVt--gqrc~qc~~~~~~~~~~gc~-~c~c~~~Gs~~~qc~~~ 1005 (1705)
T KOG1836|consen  934 NSGVGCE-PCNCDPTGSESSDCDVGTG----QCYCRPGVT--GQRCDQCETYHFGFQTEGCG-LCECDPLGSRGFQCDPE 1005 (1705)
T ss_pred             CCCCCcc-cccccccccccccccccCC----ceeeecCcc--ccccCccccCcccccccCCc-ceecccCCcccceeccc
Confidence            1   222 2233322    25655544    488888887  6555420       001121 11244433    67765


Q ss_pred             CCCCceEEecCCCCCCCCCC
Q psy9686         710 PGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       710 ~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      .|    +|.|++|+.|.+|.
T Consensus      1006 ~G----~c~c~~~~~g~~c~ 1021 (1705)
T KOG1836|consen 1006 DG----QCPCRPGFEGRRCD 1021 (1705)
T ss_pred             CC----eeeecCCCCCcccc
Confidence            44    89999999998874


No 42 
>PRK13864 type IV secretion system lytic transglycosylase VirB1; Provisional
Probab=95.08  E-value=0.007  Score=63.76  Aligned_cols=33  Identities=42%  Similarity=0.473  Sum_probs=28.7

Q ss_pred             eeccccCCCcccccCCCCCC-----------------------Cceeeeeeeccccc
Q psy9686         307 IATKESNRNSNARSPKNGNG-----------------------SRDHGIFQINDKYW  340 (769)
Q Consensus       307 la~~eS~~~t~a~~~~~~~g-----------------------s~~yGifQIn~~~W  340 (769)
                      ||..||+||+-||+. |.+|                       +.|.||.|||+++.
T Consensus        49 Ia~vESg~nP~AI~~-N~~G~~l~~qs~~EAi~~a~~l~~~g~s~DvGLmQINs~nl  104 (245)
T PRK13864         49 IAKVESRFDPLAVHD-NTTGETLHWQNHAQATQSVRHRLEARHSLDVGLMQINSKNF  104 (245)
T ss_pred             HHHHhccCCCcceee-cCCCCccCCCCHHHHHHHHHHHHhcCCceeeechhccHHHH
Confidence            999999999999984 5566                       79999999999663


No 43 
>cd00254 LT_GEWL Lytic Transglycosylase (LT)  and Goose Egg White Lysozyme (GEWL) domain. Members include the soluble and  insoluble membrane-bound LTs in bacteria, the LTs in bacteriophage lambda, as well as, the eukaryotic "goose-type" lysozymes (GEWL).  LTs catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetyl-D-glucosamine (GlcNAc), as do "goose-type" lysozymes. However, in addition to this, they also make a new glycosidic bond with the C6 hydroxyl group of the same muramic acid residue.
Probab=94.86  E-value=0.01  Score=53.82  Aligned_cols=61  Identities=18%  Similarity=0.151  Sum_probs=43.2

Q ss_pred             eeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCChhHHHHHHHHHHhcc
Q psy9686         307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT  377 (769)
Q Consensus       307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di~~~i~Cak~i~~~~  377 (769)
                      |++.||+|+..+++    +++.++|||||+...|-.-.....    .  ....--|..+.|.-|-++++..
T Consensus         7 i~~~ES~~~~~a~~----~~~~a~GlmQi~~~~~~~~~~~~~----~--~~~~~~dp~~ni~~~~~~l~~~   67 (113)
T cd00254           7 IIRQESGFNPNAVN----SGAGAVGLMQFMPSTARAYGVDGG----D--GADDLFDPEDNIRAGARYLKEL   67 (113)
T ss_pred             HHHHHCCCCCCCcc----CCCCCeeeeeeCchHHHHHHHhcC----C--ChhhcCCHHHHHHHHHHHHHHH
Confidence            78899999999984    468899999999999975332110    0  0112246777888888887764


No 44 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=94.85  E-value=0.02  Score=39.02  Aligned_cols=19  Identities=47%  Similarity=1.001  Sum_probs=16.5

Q ss_pred             eEeeCCCCCc----CCCCccccc
Q psy9686         539 FTCTCPSGGQ----GQSGLEITP  557 (769)
Q Consensus       539 y~C~C~~Gft----G~~Cedide  557 (769)
                      |+|.|++||.    |.+|+||||
T Consensus         2 y~C~C~~Gy~l~~d~~~C~DIdE   24 (24)
T PF12662_consen    2 YTCSCPPGYQLSPDGRSCEDIDE   24 (24)
T ss_pred             EEeeCCCCCcCCCCCCccccCCC
Confidence            9999999995    667999886


No 45 
>PF12662 cEGF:  Complement Clr-like EGF-like
Probab=94.84  E-value=0.022  Score=38.77  Aligned_cols=19  Identities=32%  Similarity=0.796  Sum_probs=16.6

Q ss_pred             eEEecCCCCC----CCCCCCCCCC
Q psy9686         715 ITCLCLKGFE----GPHRELPVES  734 (769)
Q Consensus       715 y~C~C~~Gyt----G~~Ce~dide  734 (769)
                      |+|.|++||.    |..|+ ||||
T Consensus         2 y~C~C~~Gy~l~~d~~~C~-DIdE   24 (24)
T PF12662_consen    2 YTCSCPPGYQLSPDGRSCE-DIDE   24 (24)
T ss_pred             EEeeCCCCCcCCCCCCccc-cCCC
Confidence            9999999996    78895 8886


No 46 
>PF06247 Plasmod_Pvs28:  Plasmodium ookinete surface protein Pvs28;  InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=94.20  E-value=0.013  Score=59.03  Aligned_cols=129  Identities=21%  Similarity=0.426  Sum_probs=79.8

Q ss_pred             cccCCCCcccccCCCCC---CCcccccccccccc----CCCCCCCCeEeeCCC---CCceEeecCCCCeecCCCccccCC
Q psy9686         622 LKDGNLSFYCNCTEDFT---GEYCQFENSAACVT----LNPCQNNATCVASPG---DKQITCLCLKGGTVSSEYCQFENS  691 (769)
Q Consensus       622 C~n~~gsy~C~C~~Gy~---G~~Ce~~~ideC~~----~~pC~n~gtCvnt~G---s~~y~C~C~~Gftg~G~~C~~~~i  691 (769)
                      .+...+-|.|.|.+||.   -.+|+.  ..+|..    ..+|..-+.|++...   ...|.|.|.+||+.....|..   
T Consensus        13 LiQMSNHfEC~Cnegfvl~~EntCE~--kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp---   87 (197)
T PF06247_consen   13 LIQMSNHFECKCNEGFVLKNENTCEE--KVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVP---   87 (197)
T ss_dssp             EEEESSEEEEEESTTEEEEETTEEEE------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEE---
T ss_pred             EEEccCceEEEcCCCcEEcccccccc--ceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEch---
Confidence            33445678999999996   457884  346752    346988999998763   246999999999976667763   


Q ss_pred             ccccCCCCCCCCCEEecCCC-CCceEEecCCCCC---CCCCCCCCC-CCCCCCCCCCceeeCCCCccccCC
Q psy9686         692 AACVTLNPCQNNATCVASPG-DKQITCLCLKGFE---GPHRELPVE-SVDEPPSEDETSVDLQLGSQANSY  757 (769)
Q Consensus       692 deC~s~~pC~ngg~Cvnt~G-s~~y~C~C~~Gyt---G~~Ce~did-eC~~npC~~GtC~n~~ggy~C~~~  757 (769)
                      +.|. ...|. .|.|+-.+. .+...|.|.-|+.   ...|..+-+ .|+.-+-.+..|+...+=|+|..-
T Consensus        88 ~~C~-~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~  156 (197)
T PF06247_consen   88 NKCN-NKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCK  156 (197)
T ss_dssp             GGGS-S---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-
T ss_pred             hhcC-ceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecC
Confidence            4785 66787 458975432 2346999999986   444654322 255554467899999999999765


No 47 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.07  E-value=0.025  Score=42.26  Aligned_cols=29  Identities=41%  Similarity=0.863  Sum_probs=22.2

Q ss_pred             CCCCCCCCEEecCCCCCceEEecCCCCCCCC
Q psy9686         697 LNPCQNNATCVASPGDKQITCLCLKGFEGPH  727 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~  727 (769)
                      ...|+.+|+|+++.++  |.|.|++||.|.-
T Consensus         5 ~~~C~~nA~C~~~~~~--~~C~C~~Gy~GdG   33 (36)
T PF12947_consen    5 NGGCHPNATCTNTGGS--YTCTCKPGYEGDG   33 (36)
T ss_dssp             GGGS-TTCEEEE-TTS--EEEEE-CEEECCS
T ss_pred             CCCCCCCcEeecCCCC--EEeECCCCCccCC
Confidence            3568999999999886  9999999998763


No 48 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.61  E-value=0.088  Score=38.37  Aligned_cols=27  Identities=37%  Similarity=0.876  Sum_probs=22.8

Q ss_pred             CCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686         698 NPCQNNATCVASPGDKQITCLCLKGFEGPHR  728 (769)
Q Consensus       698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C  728 (769)
                      ..|.++|+|+...    .+|.|.+||+|..|
T Consensus         6 ~~C~~~G~C~~~~----g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPC----GRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCC----CEEECCCCCcCCCC
Confidence            3588999999763    48999999999987


No 49 
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.39  E-value=0.11  Score=37.89  Aligned_cols=27  Identities=33%  Similarity=0.698  Sum_probs=22.8

Q ss_pred             CCCCCCcEEeeCCCCccceEeeCCCCCcCCCC
Q psy9686         521 RPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSG  552 (769)
Q Consensus       521 ~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~C  552 (769)
                      ..|.++|+|+...     ..|.|++||+|..|
T Consensus         6 ~~C~~~G~C~~~~-----g~C~C~~g~~G~~C   32 (32)
T PF07974_consen    6 NICSGHGTCVSPC-----GRCVCDSGYTGPDC   32 (32)
T ss_pred             CccCCCCEEeCCC-----CEEECCCCCcCCCC
Confidence            4699999999752     48999999999876


No 50 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.20  E-value=0.038  Score=32.24  Aligned_cols=13  Identities=46%  Similarity=1.135  Sum_probs=10.9

Q ss_pred             EEecCCCCCCCCC
Q psy9686         716 TCLCLKGFEGPHR  728 (769)
Q Consensus       716 ~C~C~~GytG~~C  728 (769)
                      .|+|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            5999999999987


No 51 
>PF06737 Transglycosylas:  Transglycosylase-like domain;  InterPro: IPR010618 This family of proteins is very likely to act as transglycosylase enzymes related to IPR001916 from INTERPRO and IPR008258 from INTERPRO. These other families are weakly matched by this family, and include the known active site residues.; GO: 0005576 extracellular region; PDB: 1XSF_A 3EO5_A.
Probab=92.59  E-value=0.024  Score=49.45  Aligned_cols=71  Identities=24%  Similarity=0.419  Sum_probs=47.8

Q ss_pred             cccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCC--ccccccccccccCCChhHHHHHHHHHHhccc
Q psy9686         301 VPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPA--GKECHAKCSSFEDNNITDDVACVVKIHSQTQ  378 (769)
Q Consensus       301 l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~--~~~C~~~C~~ll~~di~~~i~Cak~i~~~~~  378 (769)
                      -.+|--||+-||+-|- ++|.    |+..||+||+...-|=..++..  .+      -++  .--..-|.=|++|+..  
T Consensus         4 ~~~Wd~ia~cESgGn~-~~nt----gnG~yGg~Qf~~sTW~a~Gg~~yap~------~~~--As~~eQi~~A~~l~~~--   68 (77)
T PF06737_consen    4 ASDWDAIAQCESGGNW-AINT----GNGYYGGLQFSQSTWRAYGGSGYAPR------PDQ--ASRAEQIAVAEKLYAR--   68 (77)
T ss_dssp             HHHHHHHHHHHCTT-T-T-------SSSBBTTTTBBHHHHHHTTGGGT-SS------CCC--S-HHHHHHHHHHHHHH--
T ss_pred             hHHHHHHHHHHcCCCC-ccCC----CCCccceeccCHHHHHHhCCCcCCCC------hhh--CCHHHHHHHHHHHHHH--
Confidence            3578889999999999 5542    5668999999999997654321  00      011  1234557788999998  


Q ss_pred             ccccCccccccc
Q psy9686         379 RARGNGFQAWST  390 (769)
Q Consensus       379 ~~~~~G~~aW~~  390 (769)
                          +|+.+|+.
T Consensus        69 ----qG~gaWp~   76 (77)
T PF06737_consen   69 ----QGWGAWPV   76 (77)
T ss_dssp             ----HSSTTSTC
T ss_pred             ----cCCCcCCC
Confidence                79999984


No 52 
>PF12661 hEGF:  Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=91.67  E-value=0.075  Score=31.04  Aligned_cols=13  Identities=46%  Similarity=1.060  Sum_probs=10.7

Q ss_pred             EeeCCCCCcCCCC
Q psy9686         540 TCTCPSGGQGQSG  552 (769)
Q Consensus       540 ~C~C~~GftG~~C  552 (769)
                      .|.|++||+|.+|
T Consensus         1 ~C~C~~G~~G~~C   13 (13)
T PF12661_consen    1 TCQCPPGWTGPNC   13 (13)
T ss_dssp             EEEE-TTEETTTT
T ss_pred             CccCcCCCcCCCC
Confidence            4999999999987


No 53 
>cd01021 GEWL Goose Egg White Lysozyme domain. Eukaryotic "goose-type" lysozymes (GEWL). These enzymes catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetylglucosamine (GlcNAc).  Members include tunicate, Japanese flounder, ostrich, and mouse.
Probab=90.62  E-value=0.13  Score=51.53  Aligned_cols=49  Identities=16%  Similarity=0.284  Sum_probs=34.7

Q ss_pred             HHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         287 AKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       287 a~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      .++-.++++++-.-|.-   |+..||+|++.|++. +.+...++||+||+...
T Consensus        35 i~~aa~~y~idp~Li~A---ii~~ES~f~~~a~~~-~~s~~ga~GLMQi~p~t   83 (166)
T cd01021          35 IKKVGQKKCIDPAVIAA---IISRESRAGKVLKNG-WGGAGNGFGLMQVDKRS   83 (166)
T ss_pred             HHHHHHHhCcCHHHHHH---HHHHHcCCCcccccC-CCCCCCCeEEeeECccc
Confidence            33333345555443333   889999999999863 45677899999999864


No 54 
>smart00051 DSL delta serrate ligand.
Probab=89.98  E-value=0.39  Score=40.48  Aligned_cols=47  Identities=28%  Similarity=0.441  Sum_probs=35.5

Q ss_pred             eEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686         671 ITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR  728 (769)
Q Consensus       671 y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C  728 (769)
                      +.-.|.++|.  |..|..    .|...+.+..+.+|.. .|    .|.|.+||+|.+|
T Consensus        17 ~rv~C~~~~y--G~~C~~----~C~~~~d~~~~~~Cd~-~G----~~~C~~Gw~G~~C   63 (63)
T smart00051       17 IRVTCDENYY--GEGCNK----FCRPRDDFFGHYTCDE-NG----NKGCLEGWMGPYC   63 (63)
T ss_pred             EEeeCCCCCc--CCccCC----EeCcCccccCCccCCc-CC----CEecCCCCcCCCC
Confidence            5567999998  888873    5653455677788864 34    6999999999987


No 55 
>KOG1836|consensus
Probab=89.65  E-value=0.64  Score=61.06  Aligned_cols=56  Identities=25%  Similarity=0.498  Sum_probs=37.7

Q ss_pred             ecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEe-cCCCCCCCCCCCCCC
Q psy9686         674 LCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCL-CLKGFEGPHRELPVE  733 (769)
Q Consensus       674 ~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~-C~~GytG~~Ce~did  733 (769)
                      +|..||.++..  .- ..+.|. +-+|.+++.|..+.......|. |++||+|.+|+...+
T Consensus       760 ~C~~GfYg~~~--~~-~~~dC~-~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d  816 (1705)
T KOG1836|consen  760 QCVDGFYGLPD--LG-TSGDCQ-PCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD  816 (1705)
T ss_pred             hhcCCCCCccc--cC-CCCCCc-cCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence            57777763211  11 112264 6778888888877644557998 999999999986333


No 56 
>KOG1218|consensus
Probab=87.26  E-value=5.1  Score=43.22  Aligned_cols=80  Identities=28%  Similarity=0.609  Sum_probs=44.8

Q ss_pred             ccccCCCCcccccCCCCCCCccccc--cccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCccccCCC
Q psy9686         621 PLKDGNLSFYCNCTEDFTGEYCQFE--NSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACVTLN  698 (769)
Q Consensus       621 ~C~n~~gsy~C~C~~Gy~G~~Ce~~--~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s~~  698 (769)
                      +|.+...  .|.+..++.+..|...  ....|.  ..|.+...+....    -.|.|.+||.  |..|... ...|....
T Consensus       118 ~C~~~~~--~c~~~~~~~~~~C~~~~~~g~~C~--~~c~~~~~~~~~~----~~c~c~~g~~--g~~~~~~-~~~c~~~~  186 (316)
T KOG1218|consen  118 TCANPRR--ECRCGGGYIGEQCGEENLVGLKCQ--RDCQCTGGCDCKN----GICTCQPGFV--GVFCVES-CSGCSPLT  186 (316)
T ss_pred             ccCCCcc--ceecCCcCccccccccCCCCCCcc--CCCCCccccCCCC----CceeccCCcc--ccccccc-CCCcCCCc
Confidence            4544432  5677777777777630  012232  2232222332222    2589999999  7777742 22265456


Q ss_pred             CCCCCCEEecCCC
Q psy9686         699 PCQNNATCVASPG  711 (769)
Q Consensus       699 pC~ngg~Cvnt~G  711 (769)
                      .|.+++.|....+
T Consensus       187 ~~~~g~~C~~~~~  199 (316)
T KOG1218|consen  187 ACENGAKCNRSTG  199 (316)
T ss_pred             ccCCCCeeecccc
Confidence            7888888887655


No 57 
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=85.88  E-value=0.26  Score=50.57  Aligned_cols=18  Identities=33%  Similarity=0.588  Sum_probs=16.6

Q ss_pred             ecCCCCccceEeeeCCcc
Q psy9686          11 LDCRGSRDHGIFQINDKY   28 (769)
Q Consensus        11 ~~~~gs~~yGiFQIn~~~   28 (769)
                      .|.|||.|+||.|||+.|
T Consensus        48 ~N~NGT~DvGlMQINS~h   65 (206)
T PRK13888         48 RNTNGTHDVGPMQFNTAY   65 (206)
T ss_pred             eCCCCCEEeCCcccccHh
Confidence            589999999999999974


No 58 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=85.57  E-value=0.75  Score=43.82  Aligned_cols=32  Identities=19%  Similarity=0.545  Sum_probs=26.5

Q ss_pred             CCCCCCCEEecCCCCCceEEecCCCCCCCCCCC
Q psy9686         698 NPCQNNATCVASPGDKQITCLCLKGFEGPHREL  730 (769)
Q Consensus       698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~  730 (769)
                      +-|.|| +|.-...-+.+.|.|..||+|.+||.
T Consensus        51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh   82 (139)
T PHA03099         51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH   82 (139)
T ss_pred             CEeECC-EEEeeccCCCceeECCCCcccccccc
Confidence            458886 89877665668999999999999984


No 59 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=85.38  E-value=0.88  Score=34.11  Aligned_cols=20  Identities=35%  Similarity=0.850  Sum_probs=16.6

Q ss_pred             CeEeeCCCCCceEeecCCCCee
Q psy9686         660 ATCVASPGDKQITCLCLKGGTV  681 (769)
Q Consensus       660 gtCvnt~Gs~~y~C~C~~Gftg  681 (769)
                      ..|++.+++  |+|.|++||..
T Consensus        10 h~C~~~~g~--~~C~C~~Gy~L   29 (36)
T PF14670_consen   10 HICVNTPGS--YRCSCPPGYKL   29 (36)
T ss_dssp             SEEEEETTS--EEEE-STTEEE
T ss_pred             CCCccCCCc--eEeECCCCCEE
Confidence            389999877  99999999985


No 60 
>PHA02887 EGF-like protein; Provisional
Probab=84.28  E-value=0.93  Score=42.55  Aligned_cols=32  Identities=25%  Similarity=0.568  Sum_probs=26.2

Q ss_pred             CCCCCCCEEecCCCCCceEEecCCCCCCCCCCC
Q psy9686         698 NPCQNNATCVASPGDKQITCLCLKGFEGPHREL  730 (769)
Q Consensus       698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~  730 (769)
                      +-|.| |+|.-........|.|.+||+|.+|+.
T Consensus        92 ~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~  123 (126)
T PHA02887         92 DFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE  123 (126)
T ss_pred             CEeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence            45775 599877666668999999999999974


No 61 
>PF14670 FXa_inhibition:  Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=83.39  E-value=0.91  Score=34.04  Aligned_cols=23  Identities=30%  Similarity=0.947  Sum_probs=18.4

Q ss_pred             CCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686         522 PCRNNGTCTPVSGGVVNFTCTCPSGGQG  549 (769)
Q Consensus       522 PC~ngGtCvn~~gs~~~y~C~C~~GftG  549 (769)
                      .|++  +|++++++   |+|.|++||..
T Consensus         7 gC~h--~C~~~~g~---~~C~C~~Gy~L   29 (36)
T PF14670_consen    7 GCSH--ICVNTPGS---YRCSCPPGYKL   29 (36)
T ss_dssp             GSSS--EEEEETTS---EEEE-STTEEE
T ss_pred             CcCC--CCccCCCc---eEeECCCCCEE
Confidence            3665  99999987   99999999963


No 62 
>smart00051 DSL delta serrate ligand.
Probab=81.39  E-value=1.6  Score=36.80  Aligned_cols=47  Identities=30%  Similarity=0.622  Sum_probs=33.5

Q ss_pred             cccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686         629 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC  686 (769)
Q Consensus       629 y~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C  686 (769)
                      +.-.|.++|.|..|+.    .|...+.+..+.+|.. .|.    |+|++||+  |..|
T Consensus        17 ~rv~C~~~~yG~~C~~----~C~~~~d~~~~~~Cd~-~G~----~~C~~Gw~--G~~C   63 (63)
T smart00051       17 IRVTCDENYYGEGCNK----FCRPRDDFFGHYTCDE-NGN----KGCLEGWM--GPYC   63 (63)
T ss_pred             EEeeCCCCCcCCccCC----EeCcCccccCCccCCc-CCC----EecCCCCc--CCCC
Confidence            3446999999999973    4753344666778854 354    89999999  6655


No 63 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.14  E-value=1.5  Score=45.55  Aligned_cols=34  Identities=24%  Similarity=0.538  Sum_probs=26.7

Q ss_pred             CccccCCCCCC--CCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686         511 GAAALGGCAGR--PCRNNGTCTPVSGGVVNFTCTCPSGGQG  549 (769)
Q Consensus       511 gc~dideC~s~--PC~ngGtCvn~~gs~~~y~C~C~~GftG  549 (769)
                      .|.++++|...  +|.  ..|.+++++   |.|.|++||+.
T Consensus       183 ~C~~~~~C~~~~~~c~--~~C~~~~g~---~~c~c~~g~~~  218 (224)
T cd01475         183 ICVVPDLCATLSHVCQ--QVCISTPGS---YLCACTEGYAL  218 (224)
T ss_pred             cCcCchhhcCCCCCcc--ceEEcCCCC---EEeECCCCccC
Confidence            35567888544  465  489999998   99999999975


No 64 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.79  E-value=1.4  Score=42.10  Aligned_cols=40  Identities=18%  Similarity=0.471  Sum_probs=29.9

Q ss_pred             cCCCC---CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccc
Q psy9686         515 LGGCA---GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEIT  556 (769)
Q Consensus       515 ideC~---s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedid  556 (769)
                      +.+|.   .+-|-|| +|.-.+. ...+.|.|+.||+|.+|+..+
T Consensus        42 i~~Cp~ey~~YClHG-~C~yI~d-l~~~~CrC~~GYtGeRCEh~d   84 (139)
T PHA03099         42 IRLCGPEGDGYCLHG-DCIHARD-IDGMYCRCSHGYTGIRCQHVV   84 (139)
T ss_pred             cccCChhhCCEeECC-EEEeecc-CCCceeECCCCccccccccee
Confidence            45563   3458885 8987664 345899999999999998765


No 65 
>PRK15470 emtA lytic murein endotransglycosylase E; Provisional
Probab=80.78  E-value=0.63  Score=48.09  Aligned_cols=38  Identities=29%  Similarity=0.355  Sum_probs=28.5

Q ss_pred             cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      +|++-.-|.-   |++.||+||+.|+++     +..+||+||....
T Consensus        50 y~iDp~Ll~A---ii~qES~Fnp~A~S~-----aGA~GLMQImP~T   87 (203)
T PRK15470         50 WGVDPQLITA---IIAIESGGNPNAVSK-----SNAIGLMQLKAST   87 (203)
T ss_pred             cCCCHHHHHH---HHHHHcCCCCCCcCC-----CCCeeeccCCHHH
Confidence            4555433332   889999999999974     5699999998755


No 66 
>KOG3516|consensus
Probab=80.57  E-value=1.1  Score=55.88  Aligned_cols=40  Identities=25%  Similarity=0.684  Sum_probs=35.0

Q ss_pred             ccccCCCCCCCCCCCcEEeeCCCCccceEeeCC-CCCcCCCCcc
Q psy9686         512 AAALGGCAGRPCRNNGTCTPVSGGVVNFTCTCP-SGGQGQSGLE  554 (769)
Q Consensus       512 c~dideC~s~PC~ngGtCvn~~gs~~~y~C~C~-~GftG~~Ced  554 (769)
                      |.-++.|.++||+++|.|..+...   |.|.|. .||.|.+|+.
T Consensus       542 C~i~drClPN~CehgG~C~Qs~~~---f~C~C~~TGY~GatCHt  582 (1306)
T KOG3516|consen  542 CGISDRCLPNPCEHGGKCSQSWDD---FECNCELTGYKGATCHT  582 (1306)
T ss_pred             cccccccCCccccCCCcccccccc---eeEeccccccccccccC
Confidence            556788999999999999996665   999999 9999999953


No 67 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=79.55  E-value=1.3  Score=33.43  Aligned_cols=32  Identities=31%  Similarity=0.588  Sum_probs=22.4

Q ss_pred             CCCCCCCCeEeeCC-CCCceEeecCCCCeecCCCc
Q psy9686         653 LNPCQNNATCVASP-GDKQITCLCLKGGTVSSEYC  686 (769)
Q Consensus       653 ~~pC~n~gtCvnt~-Gs~~y~C~C~~Gftg~G~~C  686 (769)
                      ...|..++.|++.. |+  +.|+|..||..+|..|
T Consensus         4 ~~~cP~NA~C~~~~dG~--eecrCllgyk~~~~~C   36 (37)
T PF12946_consen    4 DTKCPANAGCFRYDDGS--EECRCLLGYKKVGGKC   36 (37)
T ss_dssp             SS---TTEEEEEETTSE--EEEEE-TTEEEETTEE
T ss_pred             CccCCCCcccEEcCCCC--EEEEeeCCccccCCCc
Confidence            45688899999887 54  9999999998666555


No 68 
>KOG3516|consensus
Probab=79.52  E-value=1.4  Score=55.03  Aligned_cols=45  Identities=29%  Similarity=0.700  Sum_probs=37.2

Q ss_pred             CccccCCccccCCCCCCCCCEEecCCCCCceEEecC-CCCCCCCCCCCCCC
Q psy9686         685 YCQFENSAACVTLNPCQNNATCVASPGDKQITCLCL-KGFEGPHRELPVES  734 (769)
Q Consensus       685 ~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~-~GytG~~Ce~dide  734 (769)
                      .|.  .+|.|. +++|+++|.|......  |.|.|. .||.|..|+..+.|
T Consensus       541 ~C~--i~drCl-PN~CehgG~C~Qs~~~--f~C~C~~TGY~GatCHtsi~e  586 (1306)
T KOG3516|consen  541 MCG--ISDRCL-PNPCEHGGKCSQSWDD--FECNCELTGYKGATCHTSIYE  586 (1306)
T ss_pred             ccc--cccccC-CccccCCCcccccccc--eeEeccccccccccccCCCcc
Confidence            455  467886 9999999999986555  999999 89999999976654


No 69 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=78.26  E-value=2.3  Score=44.14  Aligned_cols=38  Identities=29%  Similarity=0.794  Sum_probs=28.1

Q ss_pred             CCCccccCCccccC-CCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686         683 SEYCQFENSAACVT-LNPCQNNATCVASPGDKQITCLCLKGFEGP  726 (769)
Q Consensus       683 G~~C~~~~ideC~s-~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~  726 (769)
                      +..|.  +.++|.. .++|.+  .|.++.|+  |.|.|++||+..
T Consensus       181 ~~~C~--~~~~C~~~~~~c~~--~C~~~~g~--~~c~c~~g~~~~  219 (224)
T cd01475         181 GKICV--VPDLCATLSHVCQQ--VCISTPGS--YLCACTEGYALL  219 (224)
T ss_pred             cccCc--CchhhcCCCCCccc--eEEcCCCC--EEeECCCCccCC
Confidence            45565  5567742 356765  79999988  999999999753


No 70 
>PHA02887 EGF-like protein; Provisional
Probab=77.08  E-value=2.2  Score=40.18  Aligned_cols=39  Identities=26%  Similarity=0.531  Sum_probs=28.8

Q ss_pred             cCCCC---CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCccc
Q psy9686         515 LGGCA---GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEI  555 (769)
Q Consensus       515 ideC~---s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedi  555 (769)
                      ..+|.   .+-|-+ |+|.-.+. ...+.|.|++||+|.+|+.+
T Consensus        83 f~pC~~eyk~YCiH-G~C~yI~d-L~epsCrC~~GYtG~RCE~v  124 (126)
T PHA02887         83 FEKCKNDFNDFCIN-GECMNIID-LDEKFCICNKGYTGIRCDEV  124 (126)
T ss_pred             ccccChHhhCEeeC-CEEEcccc-CCCceeECCCCcccCCCCcc
Confidence            34664   234874 69987655 35689999999999999764


No 71 
>PRK11671 mltC murein transglycosylase C; Provisional
Probab=72.30  E-value=1.6  Score=48.92  Aligned_cols=46  Identities=24%  Similarity=0.268  Sum_probs=31.8

Q ss_pred             HHHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         286 LAKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       286 la~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      ++++-.+++|++-.-|.-   |++.||+||+.|+.+     +..+||+||...-
T Consensus       195 lI~~aa~~ygvdp~LI~A---Ii~qES~FnP~AvS~-----agA~GLMQImP~T  240 (359)
T PRK11671        195 MVRKASRKYGVDESLILA---IMQTESSFNPYAVSR-----SDALGLMQVVQHT  240 (359)
T ss_pred             HHHHHHHHcCCCHHHHHH---HHHHhcCCCCCCccC-----CCCceeeeeCHhh
Confidence            334443345666533322   788999999999975     4589999998854


No 72 
>KOG1218|consensus
Probab=71.27  E-value=22  Score=38.30  Aligned_cols=107  Identities=20%  Similarity=0.382  Sum_probs=57.9

Q ss_pred             CCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc--CCccccCCCCCCCCCEEecCCC
Q psy9686         634 TEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE--NSAACVTLNPCQNNATCVASPG  711 (769)
Q Consensus       634 ~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~--~ideC~s~~pC~ngg~Cvnt~G  711 (769)
                      ..+|.|..|+.  ..+|.  ..|.. .+|.+...    .|.|..+|.  +..|...  ....|.  ..|.+...+.... 
T Consensus        96 ~~~~~g~~C~~--~~~~~--~~c~~-~~C~~~~~----~c~~~~~~~--~~~C~~~~~~g~~C~--~~c~~~~~~~~~~-  161 (316)
T KOG1218|consen   96 LNGYEGPQCES--PCPCG--DGCAE-KTCANPRR----ECRCGGGYI--GEQCGEENLVGLKCQ--RDCQCTGGCDCKN-  161 (316)
T ss_pred             CCCCCcccccC--CCCcC--Ccccc-cccCCCcc----ceecCCcCc--cccccccCCCCCCcc--CCCCCccccCCCC-
Confidence            68889998883  34443  11433 45555421    367777766  5555520  122342  2232222232222 


Q ss_pred             CCceEEecCCCCCCCCCCCCCCCCCC-CCC-CCCceeeCCCCccccCC
Q psy9686         712 DKQITCLCLKGFEGPHRELPVESVDE-PPS-EDETSVDLQLGSQANSY  757 (769)
Q Consensus       712 s~~y~C~C~~GytG~~Ce~dideC~~-npC-~~GtC~n~~ggy~C~~~  757 (769)
                         -.|.|.+||.|.+|......|.. ..+ +++.|....+.=.+.+.
T Consensus       162 ---~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~~~~~~~~  206 (316)
T KOG1218|consen  162 ---GICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTGSCLCYPG  206 (316)
T ss_pred             ---CceeccCCcccccccccCCCcCCCcccCCCCeeeccccccccCCC
Confidence               37899999999999754433554 334 34477666665555444


No 73 
>KOG3512|consensus
Probab=69.44  E-value=7.6  Score=44.51  Aligned_cols=53  Identities=25%  Similarity=0.503  Sum_probs=29.2

Q ss_pred             eEe-ecCCCCeecCCCccccCCccccCCCCCCC----CCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686         671 ITC-LCLKGGTVSSEYCQFENSAACVTLNPCQN----NATCVASPGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       671 y~C-~C~~Gftg~G~~C~~~~ideC~s~~pC~n----gg~Cvnt~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      -+| .|.+||.-++..=.. +...|. .-.|+.    |-+|..+.|    +|.|++|-+|.+|.
T Consensus       371 rhChyCreGyyRd~s~pl~-hrkaCk-~CdChpVGs~gktCNq~tG----qCpCkeGvtG~tCn  428 (592)
T KOG3512|consen  371 RHCHYCREGYYRDGSKPLT-HRKACK-ACDCHPVGSAGKTCNQTTG----QCPCKEGVTGLTCN  428 (592)
T ss_pred             cccccccCccccCCCCCCc-hhhhhh-hcCCcccccccccccccCC----cccCCCCCcccccc
Confidence            356 388888744321110 223342 233442    335665544    78888888888875


No 74 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=66.35  E-value=12  Score=29.75  Aligned_cols=22  Identities=27%  Similarity=0.777  Sum_probs=16.7

Q ss_pred             CCCCCCCCEEecCCCCCceEEecCCCCC
Q psy9686         697 LNPCQNNATCVASPGDKQITCLCLKGFE  724 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~Gyt  724 (769)
                      ...|..++.|++.      +|.|++||.
T Consensus        25 ~~qC~~~s~C~~g------~C~C~~g~~   46 (52)
T PF01683_consen   25 DEQCIGGSVCVNG------RCQCPPGYV   46 (52)
T ss_pred             cCCCCCcCEEcCC------EeECCCCCE
Confidence            4456677889653      899999985


No 75 
>KOG3514|consensus
Probab=64.97  E-value=3.8  Score=50.83  Aligned_cols=34  Identities=32%  Similarity=0.820  Sum_probs=31.0

Q ss_pred             CCCCCCCCCCcEEeeCCCCccceEeeCC-CCCcCCCCc
Q psy9686         517 GCAGRPCRNNGTCTPVSGGVVNFTCTCP-SGGQGQSGL  553 (769)
Q Consensus       517 eC~s~PC~ngGtCvn~~gs~~~y~C~C~-~GftG~~Ce  553 (769)
                      .|.++||+|+|+|......   |.|.|. .||.|+.|+
T Consensus       625 ~C~~nPC~N~g~C~egwNr---fiCDCs~T~~~G~~Ce  659 (1591)
T KOG3514|consen  625 ICESNPCQNGGKCSEGWNR---FICDCSGTGFEGRTCE  659 (1591)
T ss_pred             ccCCCcccCCCCccccccc---cccccccCcccCcccc
Confidence            6999999999999998877   999996 789999995


No 76 
>PRK15328 invasion protein IagB; Provisional
Probab=63.66  E-value=1.9  Score=43.01  Aligned_cols=19  Identities=37%  Similarity=0.419  Sum_probs=16.8

Q ss_pred             cCCCCccceEeeeCCcccC
Q psy9686          12 DCRGSRDHGIFQINDKYWC   30 (769)
Q Consensus        12 ~~~gs~~yGiFQIn~~~wC   30 (769)
                      |.+|+.|+||+|||+.+|.
T Consensus        54 n~~gs~d~GLMQIn~~~~~   72 (160)
T PRK15328         54 NRDGSTDLGLMQINSFHMK   72 (160)
T ss_pred             CCCCCcccChhhcCHHHHH
Confidence            5689999999999998874


No 77 
>KOG3512|consensus
Probab=61.44  E-value=40  Score=38.98  Aligned_cols=25  Identities=20%  Similarity=0.468  Sum_probs=19.2

Q ss_pred             cEEeeCCCCccceEeeCCCCCcCCCCc
Q psy9686         527 GTCTPVSGGVVNFTCTCPSGGQGQSGL  553 (769)
Q Consensus       527 GtCvn~~gs~~~y~C~C~~GftG~~Ce  553 (769)
                      ..|+-...+  .++|.|.-+-+|..|+
T Consensus       285 s~Cv~d~~~--~ltCdC~HNTaGPdCg  309 (592)
T KOG3512|consen  285 SRCVMDESS--HLTCDCEHNTAGPDCG  309 (592)
T ss_pred             ceeeeccCC--ceEEecccCCCCCCcc
Confidence            468765544  4899999999999884


No 78 
>KOG3514|consensus
Probab=54.40  E-value=8.8  Score=47.87  Aligned_cols=44  Identities=36%  Similarity=0.751  Sum_probs=34.2

Q ss_pred             CCccccCCccccCCCCCCCCCEEecCCCCCceEEecC-CCCCCCCCCC
Q psy9686         684 EYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCL-KGFEGPHREL  730 (769)
Q Consensus       684 ~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~-~GytG~~Ce~  730 (769)
                      ..|..+....|. ++||.|+|.|.....  +|.|.|. .||.|+.||.
T Consensus       616 psCs~~~~~~C~-~nPC~N~g~C~egwN--rfiCDCs~T~~~G~~Cer  660 (1591)
T KOG3514|consen  616 PSCSLSNEKICE-SNPCQNGGKCSEGWN--RFICDCSGTGFEGRTCER  660 (1591)
T ss_pred             cccchhhccccC-CCcccCCCCcccccc--ccccccccCcccCccccc
Confidence            456543333686 999999999998765  4999996 6999999985


No 79 
>PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional
Probab=49.45  E-value=7.2  Score=45.31  Aligned_cols=44  Identities=23%  Similarity=0.277  Sum_probs=33.1

Q ss_pred             HHHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         286 LAKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       286 la~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      +.+++++ .|+|..    |+=|+..||+||+.|+.+     ....||+||-..-
T Consensus       108 i~~~~~~-~~lp~~----l~~l~~~ES~f~p~A~S~-----~gA~Gl~Q~mp~T  151 (456)
T PRK10783        108 IVGQIKK-RNMPME----LVLLPIVESAFDPHATSG-----ANAAGIWQIIPST  151 (456)
T ss_pred             HHHHHHH-cCCCHH----HHHHHHHhcCCCCCCccC-----CCccceeeecHHH
Confidence            4566766 699854    333899999999999975     4578999997643


No 80 
>PF00053 Laminin_EGF:  Laminin EGF-like (Domains III and V);  InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below.  +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain  In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=44.45  E-value=14  Score=29.12  Aligned_cols=21  Identities=38%  Similarity=0.808  Sum_probs=16.5

Q ss_pred             EEecCCCCCceEEecCCCCCCCCCC
Q psy9686         705 TCVASPGDKQITCLCLKGFEGPHRE  729 (769)
Q Consensus       705 ~Cvnt~Gs~~y~C~C~~GytG~~Ce  729 (769)
                      +|....    .+|.|+++|+|..|+
T Consensus        12 ~C~~~~----G~C~C~~~~~G~~C~   32 (49)
T PF00053_consen   12 TCDPST----GQCVCKPGTTGPRCD   32 (49)
T ss_dssp             SEEETC----EEESBSTTEESTTS-
T ss_pred             cccCCC----CEEeccccccCCcCc
Confidence            566643    489999999999996


No 81 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=43.35  E-value=8  Score=32.64  Aligned_cols=47  Identities=26%  Similarity=0.533  Sum_probs=21.7

Q ss_pred             eEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686         671 ITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR  728 (769)
Q Consensus       671 y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C  728 (769)
                      ++-.|.+.|.  |..|..    .|...+.-..+-+|.. .|    .=.|.+||+|.+|
T Consensus        17 ~rv~C~~nyy--G~~C~~----~C~~~~d~~ghy~Cd~-~G----~~~C~~Gw~G~~C   63 (63)
T PF01414_consen   17 IRVVCDENYY--GPNCSK----FCKPRDDSFGHYTCDS-NG----NKVCLPGWTGPNC   63 (63)
T ss_dssp             ------TTEE--TTTT-E----E---EEETTEEEEE-S-S------EEE-TTEESTTS
T ss_pred             EEEECCCCCC--CccccC----CcCCCcCCcCCcccCC-CC----CCCCCCCCcCCCC
Confidence            5778999999  888874    4642211222335663 23    4579999999987


No 82 
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=42.49  E-value=23  Score=28.03  Aligned_cols=14  Identities=21%  Similarity=0.534  Sum_probs=13.2

Q ss_pred             EEecCCCCCCCCCC
Q psy9686         716 TCLCLKGFEGPHRE  729 (769)
Q Consensus       716 ~C~C~~GytG~~Ce  729 (769)
                      +|.|++|++|..|+
T Consensus        20 ~C~C~~~~~G~~C~   33 (50)
T cd00055          20 QCECKPNTTGRRCD   33 (50)
T ss_pred             EEeCCCcCCCCCCC
Confidence            89999999999996


No 83 
>PF12946 EGF_MSP1_1:  MSP1 EGF domain 1;  InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=41.71  E-value=21  Score=27.09  Aligned_cols=30  Identities=27%  Similarity=0.626  Sum_probs=20.2

Q ss_pred             CCCCCCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686         518 CAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQG  549 (769)
Q Consensus       518 C~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG  549 (769)
                      |...+|..++.|++...+  .+.|+|.+||..
T Consensus         2 C~~~~cP~NA~C~~~~dG--~eecrCllgyk~   31 (37)
T PF12946_consen    2 CIDTKCPANAGCFRYDDG--SEECRCLLGYKK   31 (37)
T ss_dssp             -SSS---TTEEEEEETTS--EEEEEE-TTEEE
T ss_pred             ccCccCCCCcccEEcCCC--CEEEEeeCCccc
Confidence            556779899999998743  499999999963


No 84 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=41.55  E-value=26  Score=32.30  Aligned_cols=32  Identities=25%  Similarity=0.544  Sum_probs=23.9

Q ss_pred             cCCCC-CCCCCCCcEEeeCCCCccceEeeCCCCCcCC
Q psy9686         515 LGGCA-GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQ  550 (769)
Q Consensus       515 ideC~-s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~  550 (769)
                      .+.|. -..|+..|.|.....    ..|.|.+||.-+
T Consensus        77 ~d~Cd~y~~CG~~g~C~~~~~----~~C~Cl~GF~P~  109 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNSNNS----PKCSCLPGFEPK  109 (110)
T ss_pred             ccCCCCccccCCccEeCCCCC----CceECCCCcCCC
Confidence            35785 467999999965432    579999999754


No 85 
>cd00254 LT_GEWL Lytic Transglycosylase (LT)  and Goose Egg White Lysozyme (GEWL) domain. Members include the soluble and  insoluble membrane-bound LTs in bacteria, the LTs in bacteriophage lambda, as well as, the eukaryotic "goose-type" lysozymes (GEWL).  LTs catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetyl-D-glucosamine (GlcNAc), as do "goose-type" lysozymes. However, in addition to this, they also make a new glycosidic bond with the C6 hydroxyl group of the same muramic acid residue.
Probab=37.72  E-value=16  Score=32.66  Aligned_cols=47  Identities=11%  Similarity=0.069  Sum_probs=32.0

Q ss_pred             CCCccceEeeeCCcccCCCCCCCcCccccccchhccCChhHHHHHHHHHhhcc
Q psy9686          14 RGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT   66 (769)
Q Consensus        14 ~gs~~yGiFQIn~~~wC~~~~~~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~   66 (769)
                      +++.++||+||+...|-.-+....    .  ....-.|..+.|.-+-+++++.
T Consensus        21 ~~~~a~GlmQi~~~~~~~~~~~~~----~--~~~~~~dp~~ni~~~~~~l~~~   67 (113)
T cd00254          21 SGAGAVGLMQFMPSTARAYGVDGG----D--GADDLFDPEDNIRAGARYLKEL   67 (113)
T ss_pred             CCCCCeeeeeeCchHHHHHHHhcC----C--ChhhcCCHHHHHHHHHHHHHHH
Confidence            678999999999999976433111    0  0112247778888888888764


No 86 
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=37.67  E-value=34  Score=31.45  Aligned_cols=33  Identities=30%  Similarity=0.664  Sum_probs=24.6

Q ss_pred             CccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686         691 SAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP  726 (769)
Q Consensus       691 ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~  726 (769)
                      .|.|.....|...|.|.....   ..|.|.+||.-+
T Consensus        77 ~d~Cd~y~~CG~~g~C~~~~~---~~C~Cl~GF~P~  109 (110)
T PF00954_consen   77 KDQCDVYGFCGPNGICNSNNS---PKCSCLPGFEPK  109 (110)
T ss_pred             ccCCCCccccCCccEeCCCCC---CceECCCCcCCC
Confidence            456764578999999965433   489999999754


No 87 
>PF01683 EB:  EB module;  InterPro: IPR006149  The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO 
Probab=35.81  E-value=49  Score=26.22  Aligned_cols=25  Identities=24%  Similarity=0.598  Sum_probs=17.6

Q ss_pred             CCCCCCCeEeeCCCCCceEeecCCCCeecCC
Q psy9686         654 NPCQNNATCVASPGDKQITCLCLKGGTVSSE  684 (769)
Q Consensus       654 ~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~  684 (769)
                      ..|..++.|++.      +|.|++||...+.
T Consensus        26 ~qC~~~s~C~~g------~C~C~~g~~~~~~   50 (52)
T PF01683_consen   26 EQCIGGSVCVNG------RCQCPPGYVEVGG   50 (52)
T ss_pred             CCCCCcCEEcCC------EeECCCCCEecCC
Confidence            345677889653      6999999974443


No 88 
>PRK11619 lytic murein transglycosylase; Provisional
Probab=35.65  E-value=16  Score=44.26  Aligned_cols=33  Identities=36%  Similarity=0.460  Sum_probs=26.4

Q ss_pred             cce-EeeccccCCCcccccCCCCCCCceeeeeeeccccc
Q psy9686         303 TWV-CIATKESNRNSNARSPKNGNGSRDHGIFQINDKYW  340 (769)
Q Consensus       303 ~wv-Cla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~W  340 (769)
                      .|| =|+..||+||+.|+++     ....||+||....+
T Consensus       495 ~lv~ai~rqES~f~p~a~S~-----~gA~GLMQimP~Ta  528 (644)
T PRK11619        495 SYAMAIARQESAWNPKARSP-----VGASGLMQIMPGTA  528 (644)
T ss_pred             HHHHHHHHHhcCCCCCCccC-----CCCeeeeeecHHHH
Confidence            444 3899999999999974     34899999998654


No 89 
>PF01464 SLT:  Transglycosylase SLT domain;  InterPro: IPR008258 Bacterial lytic transglycosylases degrade murein via cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid and N-acetylglucosamine, with the concomitant formation of a 1,6-anhydrobond in the muramic acid residue. There are both soluble (Slt enzymes) and membrane-bound (Mlt enzymes) lytic transglycosylases that differ in size, sequence, activity, specificity and location. The multi-domain structure of the 70 Kd soluble lytic transglycosylase Slt70 is known []. Slt70 has 3 distinct domains, each rich in alpha helices: an N-terminal superhelical U-shaped domain (U-domain; IPR008939 from INTERPRO), a superhelical linker domain (L-domain, IPR012289 from INTERPRO), and a C-terminal catalytic domain (IPR008258 from INTERPRO). Both the U- and L-domain share a similar superhelical structure. These two domains are connected, and together form a closed ring with a large central hole; the catalytic domain is packed on top of, and interacts with, this ring. The catalytic domain has a lysosome-like fold.  This entry represents the catalytic domain, which is structurally conserved in some membrane-bound lytic glycosylases and in bacteriophage transglycosylases, even though their sequences can differ considerably proteins []. The most conserved part of this domain is its N-terminal extremity that contains two conserved serines and a glutamate, which have been shown [] to be involved in the catalytic mechanism. This family is distantly related to IPR001916 from INTERPRO.; PDB: 2Y8P_B 3MGW_A 1QTE_A 1SLY_A 1QSA_A 153L_A 154L_A 3BKV_A 3BKH_A 1LSP_A ....
Probab=34.47  E-value=10  Score=35.03  Aligned_cols=48  Identities=21%  Similarity=0.218  Sum_probs=31.6

Q ss_pred             CCCCccceEeeeCCcccCCCCCCCcCccccccchhccCChhHHHHHHHHHhhcc
Q psy9686          13 CRGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT   66 (769)
Q Consensus        13 ~~gs~~yGiFQIn~~~wC~~~~~~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~   66 (769)
                      .+++.++|||||+...|=.-+....     .-.+++ .|....|..+-+++++.
T Consensus        36 ~~~~~a~GlmQi~~~t~~~~~~~~~-----~~~~~~-~dp~~ni~~g~~~L~~~   83 (121)
T PF01464_consen   36 SNGSGAYGLMQINPATWKDYGRKGG-----LSPDDL-FDPEDNIQAGAKYLKWL   83 (121)
T ss_dssp             ETTTTEETTTTEETTTSHHHSHTHC-----TSHHHH-HHHHHHHHHHHHHHHHH
T ss_pred             CCCCcceeccccCCcccchhhhccc-----ccchhh-hCHHHHHHHHHHHHHHH
Confidence            5688899999999998865332111     223434 45667777777777764


No 90 
>PRK10859 membrane-bound lytic transglycosylase F; Provisional
Probab=33.20  E-value=18  Score=42.18  Aligned_cols=31  Identities=29%  Similarity=0.398  Sum_probs=25.0

Q ss_pred             ceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         304 WVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       304 wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      ..=+++.||+||+.|+++.     ...||+||...-
T Consensus       306 l~a~~~~ES~~~~~a~s~~-----ga~Glmq~~p~t  336 (482)
T PRK10859        306 LAAIAYQESHWNPQATSPT-----GVRGLMMLTRNT  336 (482)
T ss_pred             HHHHHHHhCCCCCCCCcCC-----CCEecceeCHHH
Confidence            3448899999999999752     369999999754


No 91 
>PF05497 Destabilase:  Destabilase;  InterPro: IPR008597 Destabilase (3.5.1.44 from EC) is an endo-epsilon(gamma-Glu)-Lys isopeptidase, which cleaves isopeptide bonds formed by transglutaminase (Factor XIIIa) between glutamine gamma-carboxamide and the epsilon-amino group of lysine [].; GO: 0003796 lysozyme activity; PDB: 2DQA_B 3AB6_A 3AYQ_A.
Probab=31.02  E-value=4.8  Score=38.30  Aligned_cols=62  Identities=32%  Similarity=0.461  Sum_probs=37.8

Q ss_pred             EeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCCh---hHHHHHHHHHHhc
Q psy9686         306 CIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNI---TDDVACVVKIHSQ  376 (769)
Q Consensus       306 Cla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di---~~~i~Cak~i~~~  376 (769)
                      ||.+.||+=|..+.=. +..||..-|+|||+-.||=+-++. +.       +++.++-   .+|..||.++++.
T Consensus         8 CiC~~~SgC~~~~~C~-~~~~s~~CG~f~Is~~YW~Dag~~-g~-------~~~~~a~~~Ca~d~~Ca~~tV~~   72 (119)
T PF05497_consen    8 CICEAESGCNMPAGCR-MDVGSLSCGPFQISWPYWQDAGKP-GD-------SPSRDAYERCANDPYCAERTVQN   72 (119)
T ss_dssp             HHHHHHCSS-S---EE-EETTEEEETTTTEEHHHHHHTT---SS-------S-----HHHHHTSHHHHHHHHHH
T ss_pred             HHhHHhhCCCCCCCCC-CCCCCceeeeeEecHHHHHHcCCC-CC-------CchhhhHHHHcCCHHHHHHHHHH
Confidence            7888899877666643 345788999999999999554432 21       1111122   3578899999885


No 92 
>PRK13722 lytic transglycosylase; Provisional
Probab=30.17  E-value=15  Score=37.05  Aligned_cols=18  Identities=11%  Similarity=0.314  Sum_probs=15.9

Q ss_pred             cCCCCccceEeeeCCccc
Q psy9686          12 DCRGSRDHGIFQINDKYW   29 (769)
Q Consensus        12 ~~~gs~~yGiFQIn~~~w   29 (769)
                      |.+|+.|+||-|||+.++
T Consensus        55 n~~gs~d~GLMQIn~~~~   72 (169)
T PRK13722         55 NPVTGYGSGLMQVDSQHF   72 (169)
T ss_pred             CCCCCeeecCCcccHHHH
Confidence            678899999999999765


No 93 
>PHA00368 internal virion protein D
Probab=29.88  E-value=24  Score=44.23  Aligned_cols=39  Identities=23%  Similarity=0.332  Sum_probs=28.0

Q ss_pred             hcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686         293 QRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY  339 (769)
Q Consensus       293 ~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~  339 (769)
                      ++|++...|.   =|++.||+||+.|+.+.     ...||+||-..-
T Consensus        21 kyGVppdLL~---ALIrQES~FNP~AvS~A-----GA~GLMQLMPaT   59 (1315)
T PHA00368         21 AHGVSYDLLR---KVGWDESRFNPTAKSPT-----GPKGLMQFTKAT   59 (1315)
T ss_pred             HcCCCHHHHH---HHHHHcCCCCCCCEeCC-----CCEeeeeecHHH
Confidence            3566632221   38899999999999753     478999998644


No 94 
>COG4764 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.17  E-value=43  Score=33.39  Aligned_cols=93  Identities=24%  Similarity=0.253  Sum_probs=54.7

Q ss_pred             HHHHHHhcCCCCCCcccceEeeccccCCCcccccCCC--------CCCCceeeeeeeccccccCCCC-CCcccccccccc
Q psy9686         287 AKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKN--------GNGSRDHGIFQINDKYWCTASG-PAGKECHAKCSS  357 (769)
Q Consensus       287 a~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~--------~~gs~~yGifQIn~~~WC~~~~-~~~~~C~~~C~~  357 (769)
                      |....+++|+|-   +-.+=+++.||+|...|.-+..        .--|+.||-=|--|.-|=.-.. +++.       .
T Consensus        48 A~~~er~~GvPV---pVlMAtiy~ESgFk~~ArPPRtklLgfIPW~rvSsAyGysQAlDgTWa~YqretG~W-------~  117 (197)
T COG4764          48 ALRTEREYGVPV---PVLMATIYTESGFKHNARPPRTKLLGFIPWKRVSSAYGYSQALDGTWARYQRETGRW-------S  117 (197)
T ss_pred             HHHHHHHhCCCc---hHHHHHHHHhhcCCCCCCCCcceeEeeeeccCccchhhhhHhhcccHHHHHHHhCCc-------c
Confidence            444455578874   4445578999999999875421        1348899999999988865211 1111       1


Q ss_pred             ccCCChhHHHHHHH-HHHhcccccccCcccccccCc
Q psy9686         358 FEDNNITDDVACVV-KIHSQTQRARGNGFQAWSTYH  392 (769)
Q Consensus       358 ll~~di~~~i~Cak-~i~~~~~~~~~~G~~aW~~w~  392 (769)
                      -.-+|..|.|.=+- -+.+.+   ..+|..-|.+|.
T Consensus       118 AsRtdFaDAi~FvgWYh~ks~---~~nGva~nD~y~  150 (197)
T COG4764         118 ASRTDFADAIDFVGWYHQKSQ---RVNGVARNDAYN  150 (197)
T ss_pred             cccchHHHHHHHHHHHHHhHH---hhcCccccchHH
Confidence            12367777766543 222222   235666665543


No 95 
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=26.76  E-value=56  Score=25.53  Aligned_cols=14  Identities=21%  Similarity=0.589  Sum_probs=12.9

Q ss_pred             EEecCCCCCCCCCC
Q psy9686         716 TCLCLKGFEGPHRE  729 (769)
Q Consensus       716 ~C~C~~GytG~~Ce  729 (769)
                      +|.|++|++|.+|+
T Consensus        19 ~C~C~~~~~G~~C~   32 (46)
T smart00180       19 QCECKPNVTGRRCD   32 (46)
T ss_pred             EEECCCCCCCCCCC
Confidence            89999999999996


No 96 
>PF01414 DSL:  Delta serrate ligand;  InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=24.37  E-value=29  Score=29.26  Aligned_cols=48  Identities=27%  Similarity=0.579  Sum_probs=20.4

Q ss_pred             CcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686         628 SFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC  686 (769)
Q Consensus       628 sy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C  686 (769)
                      .+.-.|.+.|.|..|..    .|...+.-..+-+|.. .|.    -+|.+||+  |..|
T Consensus        16 ~~rv~C~~nyyG~~C~~----~C~~~~d~~ghy~Cd~-~G~----~~C~~Gw~--G~~C   63 (63)
T PF01414_consen   16 RIRVVCDENYYGPNCSK----FCKPRDDSFGHYTCDS-NGN----KVCLPGWT--GPNC   63 (63)
T ss_dssp             -------TTEETTTT-E----E---EEETTEEEEE-S-S------EEE-TTEE--STTS
T ss_pred             EEEEECCCCCCCccccC----CcCCCcCCcCCcccCC-CCC----CCCCCCCc--CCCC
Confidence            35667999999999983    4652111122235643 332    47999999  6665


No 97 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=24.26  E-value=70  Score=29.80  Aligned_cols=31  Identities=29%  Similarity=0.747  Sum_probs=21.9

Q ss_pred             CCC--CCCCCCCCcEEeeCCCCc--cceEeeCCCC
Q psy9686         516 GGC--AGRPCRNNGTCTPVSGGV--VNFTCTCPSG  546 (769)
Q Consensus       516 deC--~s~PC~ngGtCvn~~gs~--~~y~C~C~~G  546 (769)
                      ++|  ..+.|.++|.|++...+.  .-|.|.|.+.
T Consensus         6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T   40 (103)
T PF12955_consen    6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPT   40 (103)
T ss_pred             HHHHHhccCCCCCceEeeccCCCccceEEEEeecc
Confidence            345  345699999999874321  3599999983


No 98 
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=20.90  E-value=89  Score=29.14  Aligned_cols=26  Identities=27%  Similarity=0.671  Sum_probs=19.4

Q ss_pred             CCCCCCCCEEecCCCC---CceEEecCCC
Q psy9686         697 LNPCQNNATCVASPGD---KQITCLCLKG  722 (769)
Q Consensus       697 ~~pC~ngg~Cvnt~Gs---~~y~C~C~~G  722 (769)
                      .+.|..+|.|+...+.   .=|.|.|.+.
T Consensus        12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T   40 (103)
T PF12955_consen   12 TNNCSGHGSCVKKYGSGGGDCFACKCKPT   40 (103)
T ss_pred             ccCCCCCceEeeccCCCccceEEEEeecc
Confidence            5778999999987431   3488999873


Done!