Query psy9686
Match_columns 769
No_of_seqs 538 out of 2838
Neff 6.1
Searched_HMMs 46136
Date Fri Aug 16 17:00:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9686.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9686hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00119 LYZ1 C-type lysozyme ( 100.0 2.8E-46 6E-51 347.8 5.2 119 279-405 1-122 (123)
2 PF00062 Lys: C-type lysozyme/ 100.0 9.7E-47 2.1E-51 352.9 2.1 117 279-403 1-122 (125)
3 smart00263 LYZ1 Alpha-lactalbu 100.0 2.2E-45 4.8E-50 343.9 5.3 121 279-407 1-126 (127)
4 smart00263 LYZ1 Alpha-lactalbu 99.9 2.8E-28 6.2E-33 228.7 5.5 79 12-97 46-127 (127)
5 cd00119 LYZ1 C-type lysozyme ( 99.9 2.8E-28 6E-33 227.4 5.0 77 12-94 44-122 (123)
6 PF00062 Lys: C-type lysozyme/ 99.9 7.9E-28 1.7E-32 225.6 3.3 77 12-96 46-125 (125)
7 KOG1214|consensus 99.6 1.5E-15 3.2E-20 173.3 15.4 250 413-725 639-908 (1289)
8 KOG1219|consensus 99.5 3.5E-14 7.7E-19 172.9 8.2 120 509-687 3857-3977(4289)
9 KOG1219|consensus 99.4 1.8E-13 3.8E-18 167.0 8.3 112 609-729 3865-3977(4289)
10 KOG1214|consensus 99.3 9E-12 2E-16 142.9 11.6 148 520-726 699-859 (1289)
11 KOG4289|consensus 99.3 9.1E-12 2E-16 148.2 10.6 109 515-678 1179-1308(2531)
12 KOG1217|consensus 99.2 2.5E-10 5.4E-15 128.2 16.1 204 516-761 170-388 (487)
13 KOG1217|consensus 99.0 3.3E-09 7.1E-14 119.2 15.8 228 516-762 90-350 (487)
14 KOG4289|consensus 99.0 4.7E-10 1E-14 134.0 7.0 100 623-730 1216-1317(2531)
15 KOG1225|consensus 98.8 2.6E-08 5.6E-13 113.9 10.7 158 539-757 234-395 (525)
16 KOG1225|consensus 98.5 6.1E-07 1.3E-11 102.9 10.8 88 616-731 256-343 (525)
17 KOG1226|consensus 98.3 4.7E-06 1E-10 97.4 12.2 101 630-746 526-636 (783)
18 KOG4260|consensus 98.2 1.4E-06 3.1E-11 90.6 5.4 150 520-724 149-304 (350)
19 KOG4260|consensus 98.2 2.3E-06 5E-11 89.0 6.5 120 626-757 165-300 (350)
20 KOG0994|consensus 97.9 4.6E-05 1E-09 91.4 10.0 208 498-729 890-1146(1758)
21 PF00008 EGF: EGF-like domain 97.6 4.1E-05 8.9E-10 55.6 2.0 31 518-551 1-32 (32)
22 smart00179 EGF_CA Calcium-bind 97.5 0.00016 3.4E-09 53.7 4.2 36 514-552 1-38 (39)
23 KOG1226|consensus 97.4 0.00047 1E-08 81.2 8.7 95 630-737 479-590 (783)
24 PF00008 EGF: EGF-like domain 97.2 0.00015 3.2E-09 52.7 1.7 30 697-727 3-32 (32)
25 smart00179 EGF_CA Calcium-bind 97.2 0.00044 9.6E-09 51.2 4.3 36 691-729 2-39 (39)
26 KOG0994|consensus 97.1 0.0013 2.7E-08 79.7 8.5 62 671-745 885-964 (1758)
27 cd00054 EGF_CA Calcium-binding 97.1 0.00082 1.8E-08 49.0 4.2 35 515-552 2-37 (38)
28 PRK15328 invasion protein IagB 97.0 0.00012 2.7E-09 72.5 -0.5 45 294-342 29-73 (160)
29 PRK13722 lytic transglycosylas 97.0 0.00018 4E-09 71.7 0.0 44 294-341 30-73 (169)
30 PF07645 EGF_CA: Calcium-bindi 96.9 0.00051 1.1E-08 52.8 2.3 32 514-548 1-34 (42)
31 PF07645 EGF_CA: Calcium-bindi 96.9 0.00074 1.6E-08 51.9 2.6 33 646-680 1-34 (42)
32 cd00054 EGF_CA Calcium-binding 96.8 0.0021 4.6E-08 46.8 4.3 36 691-729 2-38 (38)
33 PRK13888 conjugal transfer pro 96.6 0.00056 1.2E-08 69.6 0.4 31 307-339 35-65 (206)
34 PF12947 EGF_3: EGF domain; I 96.5 0.0019 4.2E-08 48.2 2.3 32 653-686 5-36 (36)
35 cd00053 EGF Epidermal growth f 96.4 0.0054 1.2E-07 43.9 4.1 32 518-552 2-35 (36)
36 PF06247 Plasmod_Pvs28: Plasmo 96.1 0.0016 3.5E-08 65.3 0.5 145 521-726 6-162 (197)
37 smart00181 EGF Epidermal growt 96.1 0.009 2E-07 43.4 4.0 32 517-552 1-34 (35)
38 PF01464 SLT: Transglycosylase 95.7 0.0021 4.6E-08 59.8 -0.6 64 307-377 20-83 (121)
39 cd00053 EGF Epidermal growth f 95.7 0.017 3.6E-07 41.3 4.2 30 697-728 5-35 (36)
40 smart00181 EGF Epidermal growt 95.4 0.022 4.9E-07 41.2 4.0 29 698-729 6-35 (35)
41 KOG1836|consensus 95.3 0.18 3.9E-06 65.9 14.0 197 518-729 777-1021(1705)
42 PRK13864 type IV secretion sys 95.1 0.007 1.5E-07 63.8 0.5 33 307-340 49-104 (245)
43 cd00254 LT_GEWL Lytic Transgly 94.9 0.01 2.2E-07 53.8 0.9 61 307-377 7-67 (113)
44 PF12662 cEGF: Complement Clr- 94.8 0.02 4.3E-07 39.0 2.0 19 539-557 2-24 (24)
45 PF12662 cEGF: Complement Clr- 94.8 0.022 4.9E-07 38.8 2.3 19 715-734 2-24 (24)
46 PF06247 Plasmod_Pvs28: Plasmo 94.2 0.013 2.8E-07 59.0 -0.1 129 622-757 13-156 (197)
47 PF12947 EGF_3: EGF domain; I 94.1 0.025 5.5E-07 42.3 1.3 29 697-727 5-33 (36)
48 PF07974 EGF_2: EGF-like domai 93.6 0.088 1.9E-06 38.4 3.4 27 698-728 6-32 (32)
49 PF07974 EGF_2: EGF-like domai 93.4 0.11 2.4E-06 37.9 3.6 27 521-552 6-32 (32)
50 PF12661 hEGF: Human growth fa 93.2 0.038 8.3E-07 32.2 0.8 13 716-728 1-13 (13)
51 PF06737 Transglycosylas: Tran 92.6 0.024 5.1E-07 49.4 -1.0 71 301-390 4-76 (77)
52 PF12661 hEGF: Human growth fa 91.7 0.075 1.6E-06 31.0 0.7 13 540-552 1-13 (13)
53 cd01021 GEWL Goose Egg White L 90.6 0.13 2.8E-06 51.5 1.7 49 287-339 35-83 (166)
54 smart00051 DSL delta serrate l 90.0 0.39 8.4E-06 40.5 3.8 47 671-728 17-63 (63)
55 KOG1836|consensus 89.6 0.64 1.4E-05 61.1 7.1 56 674-733 760-816 (1705)
56 KOG1218|consensus 87.3 5.1 0.00011 43.2 11.2 80 621-711 118-199 (316)
57 PRK13888 conjugal transfer pro 85.9 0.26 5.7E-06 50.6 0.4 18 11-28 48-65 (206)
58 PHA03099 epidermal growth fact 85.6 0.75 1.6E-05 43.8 3.1 32 698-730 51-82 (139)
59 PF14670 FXa_inhibition: Coagu 85.4 0.88 1.9E-05 34.1 2.8 20 660-681 10-29 (36)
60 PHA02887 EGF-like protein; Pro 84.3 0.93 2E-05 42.5 3.1 32 698-730 92-123 (126)
61 PF14670 FXa_inhibition: Coagu 83.4 0.91 2E-05 34.0 2.2 23 522-549 7-29 (36)
62 smart00051 DSL delta serrate l 81.4 1.6 3.5E-05 36.8 3.2 47 629-686 17-63 (63)
63 cd01475 vWA_Matrilin VWA_Matri 81.1 1.5 3.2E-05 45.5 3.6 34 511-549 183-218 (224)
64 PHA03099 epidermal growth fact 80.8 1.4 3E-05 42.1 2.8 40 515-556 42-84 (139)
65 PRK15470 emtA lytic murein end 80.8 0.63 1.4E-05 48.1 0.7 38 294-339 50-87 (203)
66 KOG3516|consensus 80.6 1.1 2.5E-05 55.9 2.8 40 512-554 542-582 (1306)
67 PF12946 EGF_MSP1_1: MSP1 EGF 79.6 1.3 2.8E-05 33.4 1.8 32 653-686 4-36 (37)
68 KOG3516|consensus 79.5 1.4 3.1E-05 55.0 3.2 45 685-734 541-586 (1306)
69 cd01475 vWA_Matrilin VWA_Matri 78.3 2.3 5E-05 44.1 3.9 38 683-726 181-219 (224)
70 PHA02887 EGF-like protein; Pro 77.1 2.2 4.7E-05 40.2 2.9 39 515-555 83-124 (126)
71 PRK11671 mltC murein transglyc 72.3 1.6 3.5E-05 48.9 1.0 46 286-339 195-240 (359)
72 KOG1218|consensus 71.3 22 0.00047 38.3 9.5 107 634-757 96-206 (316)
73 KOG3512|consensus 69.4 7.6 0.00017 44.5 5.4 53 671-729 371-428 (592)
74 PF01683 EB: EB module; Inter 66.3 12 0.00026 29.8 4.6 22 697-724 25-46 (52)
75 KOG3514|consensus 65.0 3.8 8.3E-05 50.8 2.1 34 517-553 625-659 (1591)
76 PRK15328 invasion protein IagB 63.7 1.9 4.2E-05 43.0 -0.6 19 12-30 54-72 (160)
77 KOG3512|consensus 61.4 40 0.00087 39.0 9.0 25 527-553 285-309 (592)
78 KOG3514|consensus 54.4 8.8 0.00019 47.9 2.7 44 684-730 616-660 (1591)
79 PRK10783 mltD membrane-bound l 49.4 7.2 0.00016 45.3 0.9 44 286-339 108-151 (456)
80 PF00053 Laminin_EGF: Laminin 44.4 14 0.00029 29.1 1.5 21 705-729 12-32 (49)
81 PF01414 DSL: Delta serrate li 43.4 8 0.00017 32.6 0.0 47 671-728 17-63 (63)
82 cd00055 EGF_Lam Laminin-type e 42.5 23 0.00051 28.0 2.6 14 716-729 20-33 (50)
83 PF12946 EGF_MSP1_1: MSP1 EGF 41.7 21 0.00046 27.1 2.0 30 518-549 2-31 (37)
84 PF00954 S_locus_glycop: S-loc 41.5 26 0.00056 32.3 3.1 32 515-550 77-109 (110)
85 cd00254 LT_GEWL Lytic Transgly 37.7 16 0.00034 32.7 1.1 47 14-66 21-67 (113)
86 PF00954 S_locus_glycop: S-loc 37.7 34 0.00075 31.4 3.3 33 691-726 77-109 (110)
87 PF01683 EB: EB module; Inter 35.8 49 0.0011 26.2 3.5 25 654-684 26-50 (52)
88 PRK11619 lytic murein transgly 35.6 16 0.00036 44.3 1.1 33 303-340 495-528 (644)
89 PF01464 SLT: Transglycosylase 34.5 10 0.00022 35.0 -0.8 48 13-66 36-83 (121)
90 PRK10859 membrane-bound lytic 33.2 18 0.00039 42.2 0.8 31 304-339 306-336 (482)
91 PF05497 Destabilase: Destabil 31.0 4.8 0.0001 38.3 -3.5 62 306-376 8-72 (119)
92 PRK13722 lytic transglycosylas 30.2 15 0.00033 37.0 -0.4 18 12-29 55-72 (169)
93 PHA00368 internal virion prote 29.9 24 0.00052 44.2 1.1 39 293-339 21-59 (1315)
94 COG4764 Uncharacterized protei 29.2 43 0.00093 33.4 2.5 93 287-392 48-150 (197)
95 smart00180 EGF_Lam Laminin-typ 26.8 56 0.0012 25.5 2.4 14 716-729 19-32 (46)
96 PF01414 DSL: Delta serrate li 24.4 29 0.00064 29.3 0.4 48 628-686 16-63 (63)
97 PF12955 DUF3844: Domain of un 24.3 70 0.0015 29.8 2.8 31 516-546 6-40 (103)
98 PF12955 DUF3844: Domain of un 20.9 89 0.0019 29.1 2.8 26 697-722 12-40 (103)
No 1
>cd00119 LYZ1 C-type lysozyme (1, 4-beta-N-acetylmuramidase, LYZ) and alpha-lactalbumin (lactose synthase B protein, LA). They have a close evolutionary relationship and similar tertiary structure, however, functionally they are quite different. Lysozymes have primarily bacteriolytic function; hydrolysis of peptidoglycan of prokaryotic cell walls and transglycosylation. LA is a calcium-binding metalloprotein that is expressed exclusively in the mammary gland during lactation. LA is the regulatory subunit of the enzyme lactose synthase. The association of LA with the catalytic component of lactose synthase, galactosyltransferase, alters the acceptor substrate specificity of this glycosyltransferase, facilitating biosynthesis of lactose.
Probab=100.00 E-value=2.8e-46 Score=347.78 Aligned_cols=119 Identities=42% Similarity=0.834 Sum_probs=112.5
Q ss_pred eeecchHHHHHHHHhcCC-CCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC-Cccccccccc
Q psy9686 279 KRFGACELAKFLVRQRGI-ARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP-AGKECHAKCS 356 (769)
Q Consensus 279 k~~~rCela~~l~~~~g~-~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~-~~~~C~~~C~ 356 (769)
|||+|||||++|++ +|| ++.+|++|||||++||+|||+|+++ |.|||+||||||||++|||+++++ ++|.|+|+|+
T Consensus 1 k~~~rCeLa~~L~~-~g~~~~~~l~~Wvcia~~ES~~nt~a~~~-n~~gs~dyGlfQINs~~WC~~~~~~~~n~C~i~C~ 78 (123)
T cd00119 1 KIFTRCELAKELKR-LGLYPGISLANWVCLAEHESGFNTQAVNN-NNDGSTDYGLFQINSKYWCNDGKTPSGNICHISCS 78 (123)
T ss_pred CcCcHhHHHHHHHH-cCCCCCCCHHHHHHHHHHHHCCCccceec-CCCCCeeEeEEEecchhhccCCCCCCCCCcCCCHH
Confidence 79999999999998 576 9999999999999999999999995 679999999999999999999876 5899999999
Q ss_pred cccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccccc
Q psy9686 357 SFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTYDH 405 (769)
Q Consensus 357 ~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~~~ 405 (769)
+||+|||+|||+|||+|+++| +||+||++|+ +|+|+ +|++|..
T Consensus 79 ~Ll~ddI~~~i~CAk~I~~~~-----~G~~aW~~w~~~C~g~-~L~~~~~ 122 (123)
T cd00119 79 KLLDDDITDDIKCAKKIVKEH-----QGFDAWVAWKNHCQGR-DLSQWVK 122 (123)
T ss_pred HhccccHHHHHHHHHHHHhcC-----CCchhHHHHHHhCCCC-CCccccC
Confidence 999999999999999999996 7999999999 99997 9999963
No 2
>PF00062 Lys: C-type lysozyme/alpha-lactalbumin family; InterPro: IPR001916 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 22 GH22 from CAZY comprises enzymes with two known activities; lysozyme type C (3.2.1.17 from EC) and alpha-lactalbumins. Asp and/or the carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base. Alpha-lactalbumin [, ] is a milk protein that acts as the regulatory subunit of lactose synthetase, acting to promote the conversion of galactosyltransferase to lactose synthase, which is essential for milk production. In the mammary gland, alpha-lactalbumin changes the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. Lysozymes (3.2.1.17 from EC) act as bacteriolytic enzymes by hydrolyzing the beta(1->4) bonds between N-acetylglucosamine and N-acetylmuramic acid in the peptidoglycan of prokaryotic cell walls. It has also been recruited for a digestive role in certain ruminants and colobine monkeys []. There are at least five different classes of lysozymes []: C (chicken type), G (goose type), phage-type (T4), fungi (Chalaropsis), and bacterial (Bacillus subtilis). There are few similarities in the sequences of the different types of lysozymes. Lysozyme type C and alpha-lactalbumin are similar both in terms of primary sequence and structure, and probably evolved from a common ancestral protein []. Around 35 to 40% of the residues are conserved in both proteins as well as the positions of the four disulphide bonds. There is, however, no similarity in function. Another significant difference between the two enzymes is that all lactalbumins have the ability to bind calcium [], while this property is restricted to only a few lysozymes []. The binding site was deduced using high resolution X-ray structure analysis and was shown to consist of three aspartic acid residues. It was first suggested that calcium bound to lactalbumin stabilised the structure, but recently it has been claimed that calcium controls the release of lactalbumin from the golgi membrane and that the pattern of ion binding may also affect the catalytic properties of the lactose synthetase complex.; PDB: 1HFX_A 1HMK_A 1HFY_B 1FKQ_A 1FKV_A 3B0K_B 1HML_A 1B9O_A 1A4V_A 3B0O_A ....
Probab=100.00 E-value=9.7e-47 Score=352.87 Aligned_cols=117 Identities=39% Similarity=0.806 Sum_probs=105.8
Q ss_pred eeecchHHHHHHHHh--cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC--Cccccccc
Q psy9686 279 KRFGACELAKFLVRQ--RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP--AGKECHAK 354 (769)
Q Consensus 279 k~~~rCela~~l~~~--~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~--~~~~C~~~ 354 (769)
|||+|||||++|++. .||++++|+||||||+|||+|||+|+++++ |||+|||||||||+|||++++. ++|.|+|+
T Consensus 1 Kv~~rCeLa~~L~~~g~~g~~~~~l~~WvCla~~eS~f~T~a~~~~~-~gS~dyGIFQIns~~WC~~~~~~~~~n~C~i~ 79 (125)
T PF00062_consen 1 KVFTRCELARELKNAGLDGYPGYSLANWVCLAYHESGFNTSAVNYNN-DGSTDYGIFQINSRYWCSDGKTPGSKNICNIS 79 (125)
T ss_dssp -BC-HHHHHHHHHHTTGTTBTCBGHHHHHHHHHHHHTTBTT-EEEET-TSEEEETTTTEETTTTSBCSSSTTSSTTTSSB
T ss_pred CcccHhHHHHHHHHcCCCCcchhhhhheeeeEecccCceeeccccCC-CCccccceeecCchhhccCCCCCCCCCccCCc
Confidence 899999999999973 345589999999999999999999999865 9999999999999999999875 47999999
Q ss_pred cccccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccc
Q psy9686 355 CSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTY 403 (769)
Q Consensus 355 C~~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~ 403 (769)
|++||||||+|||+|||+|++++ +||+||++|+ +|+ + +|+.|
T Consensus 80 C~~ll~ddI~ddi~Cak~I~~~~-----~G~~aW~~w~~~C~-~-~L~~~ 122 (125)
T PF00062_consen 80 CSDLLDDDITDDIRCAKKIYREP-----QGFNAWVAWKNHCR-K-DLSSW 122 (125)
T ss_dssp GGGGGSSSTHHHHHHHHHHHTHH-----TGGGGSHHHHHHTT-S-SGGGG
T ss_pred cccccccchHHHHHHHHHHHcCc-----ccchhhHHHHHhhc-C-CCccc
Confidence 99999999999999999999775 7999999999 999 5 99999
No 3
>smart00263 LYZ1 Alpha-lactalbumin / lysozyme C.
Probab=100.00 E-value=2.2e-45 Score=343.93 Aligned_cols=121 Identities=40% Similarity=0.745 Sum_probs=112.7
Q ss_pred eeecchHHHHHHHHh--cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCC--Cccccccc
Q psy9686 279 KRFGACELAKFLVRQ--RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGP--AGKECHAK 354 (769)
Q Consensus 279 k~~~rCela~~l~~~--~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~--~~~~C~~~ 354 (769)
|+|+|||||++|++. +|||+.+|++|||||++||+|||+|+++ |.|||+||||||||++|||+++++ ++|.|+|+
T Consensus 1 k~~~rCeLa~~L~~~gl~g~~~~~l~~WvCia~~ES~~~t~a~~~-n~~gs~dyGLfQINs~~WC~~~~~~~~~n~C~i~ 79 (127)
T smart00263 1 KVFTRCELARELKRLGLDGYRGISLANWVCLAFHESGYNTQATNY-NNGGSTDYGIFQINSKYWCNDGKTPGSVNACGIS 79 (127)
T ss_pred CcCcHhHHHHHHHHhCcCCCCCcCHHHHHHHHHHHhcCCCcceec-CCCCCceeeEEEecccccccCCCCCCCCCcCCCC
Confidence 789999999999972 4999999999999999999999999996 679999999999999999998864 47999999
Q ss_pred cccccCCChhHHHHHHHHHHhcccccccCcccccccCc-cccCCCCCccccccC
Q psy9686 355 CSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKVSTYDHAD 407 (769)
Q Consensus 355 C~~ll~~di~~~i~Cak~i~~~~~~~~~~G~~aW~~w~-~C~g~~~l~~~~~~~ 407 (769)
|++||+|||+|||+|||+|+++ +||+||++|+ +|+|+ +|+.|...|
T Consensus 80 C~~Ll~ddI~ddv~CAk~I~~~------~G~~aW~~w~~~C~g~-~l~~~~~~C 126 (127)
T smart00263 80 CSKLLDDDITDDVKCAKKIVSD------QGIDAWVAWKNHCQGE-DLSQWIRGC 126 (127)
T ss_pred HhhhcCCChHHHHHHHHHHHHH------cCchHHHHHHHhCCCC-CCChhhccC
Confidence 9999999999999999999999 6999999999 99996 999997555
No 4
>smart00263 LYZ1 Alpha-lactalbumin / lysozyme C.
Probab=99.94 E-value=2.8e-28 Score=228.73 Aligned_cols=79 Identities=39% Similarity=0.780 Sum_probs=73.8
Q ss_pred cCCCCccceEeeeCCcccCCCCCC--CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCC
Q psy9686 12 DCRGSRDHGIFQINDKYWCTASGP--AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSK 88 (769)
Q Consensus 12 ~~~gs~~yGiFQIn~~~wC~~~~~--~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~ 88 (769)
|.|||+||||||||++|||+++++ ++|.|+|+|++||+|||+|||+|||+|+++ +||+||++|+ +|+| ++
T Consensus 46 n~~gs~dyGLfQINs~~WC~~~~~~~~~n~C~i~C~~Ll~ddI~ddv~CAk~I~~~------~G~~aW~~w~~~C~g-~~ 118 (127)
T smart00263 46 NNGGSTDYGIFQINSKYWCNDGKTPGSVNACGISCSKLLDDDITDDVKCAKKIVSD------QGIDAWVAWKNHCQG-ED 118 (127)
T ss_pred CCCCCceeeEEEecccccccCCCCCCCCCcCCCCHhhhcCCChHHHHHHHHHHHHH------cCchHHHHHHHhCCC-CC
Confidence 789999999999999999999874 479999999999999999999999999998 4999999999 9997 79
Q ss_pred cccccccCc
Q psy9686 89 VSTYYSSSF 97 (769)
Q Consensus 89 ~~~~~~~~~ 97 (769)
|+.||++|+
T Consensus 119 l~~~~~~C~ 127 (127)
T smart00263 119 LSQWIRGCG 127 (127)
T ss_pred CChhhccCC
Confidence 999998874
No 5
>cd00119 LYZ1 C-type lysozyme (1, 4-beta-N-acetylmuramidase, LYZ) and alpha-lactalbumin (lactose synthase B protein, LA). They have a close evolutionary relationship and similar tertiary structure, however, functionally they are quite different. Lysozymes have primarily bacteriolytic function; hydrolysis of peptidoglycan of prokaryotic cell walls and transglycosylation. LA is a calcium-binding metalloprotein that is expressed exclusively in the mammary gland during lactation. LA is the regulatory subunit of the enzyme lactose synthase. The association of LA with the catalytic component of lactose synthase, galactosyltransferase, alters the acceptor substrate specificity of this glycosyltransferase, facilitating biosynthesis of lactose.
Probab=99.94 E-value=2.8e-28 Score=227.41 Aligned_cols=77 Identities=40% Similarity=0.896 Sum_probs=72.6
Q ss_pred cCCCCccceEeeeCCcccCCCCCC-CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCCc
Q psy9686 12 DCRGSRDHGIFQINDKYWCTASGP-AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSKV 89 (769)
Q Consensus 12 ~~~gs~~yGiFQIn~~~wC~~~~~-~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~~ 89 (769)
|.|||+||||||||++|||+++++ ++|.|+|+|++||+|||+|||+|||+|+++| +||+||++|+ ||+| ++|
T Consensus 44 n~~gs~dyGlfQINs~~WC~~~~~~~~n~C~i~C~~Ll~ddI~~~i~CAk~I~~~~-----~G~~aW~~w~~~C~g-~~L 117 (123)
T cd00119 44 NNDGSTDYGLFQINSKYWCNDGKTPSGNICHISCSKLLDDDITDDIKCAKKIVKEH-----QGFDAWVAWKNHCQG-RDL 117 (123)
T ss_pred CCCCCeeEeEEEecchhhccCCCCCCCCCcCCCHHHhccccHHHHHHHHHHHHhcC-----CCchhHHHHHHhCCC-CCC
Confidence 788999999999999999999875 5899999999999999999999999999997 4999999999 9997 799
Q ss_pred ccccc
Q psy9686 90 STYYS 94 (769)
Q Consensus 90 ~~~~~ 94 (769)
++|++
T Consensus 118 ~~~~~ 122 (123)
T cd00119 118 SQWVK 122 (123)
T ss_pred ccccC
Confidence 99984
No 6
>PF00062 Lys: C-type lysozyme/alpha-lactalbumin family; InterPro: IPR001916 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 22 GH22 from CAZY comprises enzymes with two known activities; lysozyme type C (3.2.1.17 from EC) and alpha-lactalbumins. Asp and/or the carbonyl oxygen of the C-2 acetamido group of the substrate acts as the catalytic nucleophile/base. Alpha-lactalbumin [, ] is a milk protein that acts as the regulatory subunit of lactose synthetase, acting to promote the conversion of galactosyltransferase to lactose synthase, which is essential for milk production. In the mammary gland, alpha-lactalbumin changes the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. Lysozymes (3.2.1.17 from EC) act as bacteriolytic enzymes by hydrolyzing the beta(1->4) bonds between N-acetylglucosamine and N-acetylmuramic acid in the peptidoglycan of prokaryotic cell walls. It has also been recruited for a digestive role in certain ruminants and colobine monkeys []. There are at least five different classes of lysozymes []: C (chicken type), G (goose type), phage-type (T4), fungi (Chalaropsis), and bacterial (Bacillus subtilis). There are few similarities in the sequences of the different types of lysozymes. Lysozyme type C and alpha-lactalbumin are similar both in terms of primary sequence and structure, and probably evolved from a common ancestral protein []. Around 35 to 40% of the residues are conserved in both proteins as well as the positions of the four disulphide bonds. There is, however, no similarity in function. Another significant difference between the two enzymes is that all lactalbumins have the ability to bind calcium [], while this property is restricted to only a few lysozymes []. The binding site was deduced using high resolution X-ray structure analysis and was shown to consist of three aspartic acid residues. It was first suggested that calcium bound to lactalbumin stabilised the structure, but recently it has been claimed that calcium controls the release of lactalbumin from the golgi membrane and that the pattern of ion binding may also affect the catalytic properties of the lactose synthetase complex.; PDB: 1HFX_A 1HMK_A 1HFY_B 1FKQ_A 1FKV_A 3B0K_B 1HML_A 1B9O_A 1A4V_A 3B0O_A ....
Probab=99.94 E-value=7.9e-28 Score=225.60 Aligned_cols=77 Identities=38% Similarity=0.867 Sum_probs=71.1
Q ss_pred cCCCCccceEeeeCCcccCCCCCC--CcCccccccchhccCChhHHHHHHHHHhhccccccCCcccccccCc-cccCCCC
Q psy9686 12 DCRGSRDHGIFQINDKYWCTASGP--AGKECHAKCSSFEDNNITDDVACVVKIHSQTQRARGNGFQAWSTYH-YCNTNSK 88 (769)
Q Consensus 12 ~~~gs~~yGiFQIn~~~wC~~~~~--~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~~~~~~~g~~aW~~w~-~C~~~~~ 88 (769)
+.|||+|||||||||+|||++++. ++|.|+|+|++||||||+|||.|||+|++++ +||+||++|+ +|+ ++
T Consensus 46 ~~~gS~dyGIFQIns~~WC~~~~~~~~~n~C~i~C~~ll~ddI~ddi~Cak~I~~~~-----~G~~aW~~w~~~C~--~~ 118 (125)
T PF00062_consen 46 NNDGSTDYGIFQINSRYWCSDGKTPGSKNICNISCSDLLDDDITDDIRCAKKIYREP-----QGFNAWVAWKNHCR--KD 118 (125)
T ss_dssp ETTSEEEETTTTEETTTTSBCSSSTTSSTTTSSBGGGGGSSSTHHHHHHHHHHHTHH-----TGGGGSHHHHHHTT--SS
T ss_pred CCCCccccceeecCchhhccCCCCCCCCCccCCccccccccchHHHHHHHHHHHcCc-----ccchhhHHHHHhhc--CC
Confidence 449999999999999999999884 4699999999999999999999999999776 4999999999 999 59
Q ss_pred cccccccC
Q psy9686 89 VSTYYSSS 96 (769)
Q Consensus 89 ~~~~~~~~ 96 (769)
|++| ++|
T Consensus 119 L~~~-~~C 125 (125)
T PF00062_consen 119 LSSW-KGC 125 (125)
T ss_dssp GGGG-TTS
T ss_pred Cccc-CCC
Confidence 9999 887
No 7
>KOG1214|consensus
Probab=99.65 E-value=1.5e-15 Score=173.28 Aligned_cols=250 Identities=18% Similarity=0.361 Sum_probs=177.5
Q ss_pred cccccccccCcc--CccceeeeEE--ecCcceeEEeecccccccccccccccccccccCCCCceeeccceeehhccCCCC
Q psy9686 413 SACACYMHVNYL--GSSLKLETVL--MSSEKIFISYSSTHHISKMANKFKFNLFKDHADDMGTCTWEKLALALAKKKTHP 488 (769)
Q Consensus 413 ~~C~~~~~~~~~--~~~~~~e~v~--~~~~e~~i~y~~~~~i~~~~~~~~~~~~~~~~~~~~~c~~~~l~~~~~s~~~~~ 488 (769)
-.|.|+++.+.. ++|++++.|+ +..++.+++++-+..|..+-+.......++++++.+.|. ......+
T Consensus 639 q~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cd--------t~a~C~p 710 (1289)
T KOG1214|consen 639 QVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCD--------TTARCHP 710 (1289)
T ss_pred EEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhcccceecCCCCcccccceecCcccC--------CCccccC
Confidence 358887776633 8899999977 777799999999999999999988888899999999888 2233344
Q ss_pred CccccceeeeccccccccccCCCccccCCCCCC--CCCCCcEEeeCCCCccceEeeCCCCCc----CCCCcccccchhcc
Q psy9686 489 MFGKLAHFESQELFLSSALKKPGAAALGGCAGR--PCRNNGTCTPVSGGVVNFTCTCPSGGQ----GQSGLEITPVYLWL 562 (769)
Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~gc~dideC~s~--PC~ngGtCvn~~gs~~~y~C~C~~Gft----G~~CedideC~~~~ 562 (769)
..+....++|..+|.+ ....|.|+++|+.. .|..+.+|++.+++ |.|+|..||. +-+|..+..-+
T Consensus 711 g~~~~~tcecs~g~~g---dgr~c~d~~eca~~~~~CGp~s~Cin~pg~---~rceC~~gy~F~dd~~tCV~i~~pa--- 781 (1289)
T KOG1214|consen 711 GTGVDYTCECSSGYQG---DGRNCVDENECATGFHRCGPNSVCINLPGS---YRCECRSGYEFADDRHTCVLITPPA--- 781 (1289)
T ss_pred CCCcceEEEEeeccCC---CCCCCCChhhhccCCCCCCCCceeecCCCc---eeEEEeecceeccCCcceEEecCCC---
Confidence 4555667778777643 44557888999654 49999999999997 9999999984 34464432211
Q ss_pred ccCceeeeeeeecccCCCCCCCCC-CCcCCCcccccccccccccccceeeccccccCCCcccc-CCCCcccccCCCCCCC
Q psy9686 563 MKLKTLVMVQYYFRRKSTSPHKSR-GQGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKD-GNLSFYCNCTEDFTGE 640 (769)
Q Consensus 563 ~~~~~~~~~~~~~~~Cs~~pC~~~-g~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n-~~gsy~C~C~~Gy~G~ 640 (769)
..++|..+ ..|. + ...+.|+. +.++|.|.|.|||.|+
T Consensus 782 ----------------p~n~Ce~g~h~C~------i-------------------~g~a~c~~hGgs~y~C~CLPGfsGD 820 (1289)
T KOG1214|consen 782 ----------------PANPCEDGSHTCA------I-------------------AGQARCVHHGGSTYSCACLPGFSGD 820 (1289)
T ss_pred ----------------CCCccccCccccC------c-------------------CCceEEEecCCceEEEeecCCccCC
Confidence 24555443 1121 0 11234554 4567999999999876
Q ss_pred --ccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc--CCccccC----CCCCCCCCEEecCCCC
Q psy9686 641 --YCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE--NSAACVT----LNPCQNNATCVASPGD 712 (769)
Q Consensus 641 --~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~--~ideC~s----~~pC~ngg~Cvnt~Gs 712 (769)
.|. ++|+|. ++.|+..++|.+++|+ |.|+|.+||.|+|..|... ....|+. +-.|+..+.|.-....
T Consensus 821 G~~c~--dvDeC~-psrChp~A~Cyntpgs--fsC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp 895 (1289)
T KOG1214|consen 821 GHQCT--DVDECS-PSRCHPAATCYNTPGS--FSCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDP 895 (1289)
T ss_pred ccccc--cccccC-ccccCCCceEecCCCc--ceeecccCccCCCceecCCCccCCccccccccceeeccccceeEeeCC
Confidence 666 689998 8899999999999987 9999999999999999741 1234541 1236554433332223
Q ss_pred CceEEecCCCCCC
Q psy9686 713 KQITCLCLKGFEG 725 (769)
Q Consensus 713 ~~y~C~C~~GytG 725 (769)
++|.|.|.++-.|
T Consensus 896 ~~~e~p~~~~ppG 908 (1289)
T KOG1214|consen 896 DGHEVPGTQTPPG 908 (1289)
T ss_pred CcccCCCCCCCCC
Confidence 4488888766655
No 8
>KOG1219|consensus
Probab=99.49 E-value=3.5e-14 Score=172.87 Aligned_cols=120 Identities=28% Similarity=0.662 Sum_probs=102.3
Q ss_pred CCCcccc-CCCCCCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCC
Q psy9686 509 KPGAAAL-GGCAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRG 587 (769)
Q Consensus 509 ~~gc~di-deC~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g 587 (769)
.+||... +.|..+||+|+|+|+..+++ +|.|.|++.|+|+.||. ....|.++||.++
T Consensus 3857 ~pgC~l~~d~C~~npCqhgG~C~~~~~g--gy~CkCpsqysG~~CEi-------------------~~epC~snPC~~G- 3914 (4289)
T KOG1219|consen 3857 QPGCSLLTDPCNDNPCQHGGTCISQPKG--GYKCKCPSQYSGNHCEI-------------------DLEPCASNPCLTG- 3914 (4289)
T ss_pred cccccccccccccCcccCCCEecCCCCC--ceEEeCcccccCccccc-------------------ccccccCCCCCCC-
Confidence 3455544 89999999999999999886 79999999999999943 2344578888554
Q ss_pred CcCCCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCC
Q psy9686 588 QGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPG 667 (769)
Q Consensus 588 ~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~G 667 (769)
|+|+...+.|.|.|+.||+|..|+...+++|+ .++|.++|.|++..|
T Consensus 3915 --------------------------------gtCip~~n~f~CnC~~gyTG~~Ce~~Gi~eCs-~n~C~~gg~C~n~~g 3961 (4289)
T KOG1219|consen 3915 --------------------------------GTCIPFYNGFLCNCPNGYTGKRCEARGISECS-KNVCGTGGQCINIPG 3961 (4289)
T ss_pred --------------------------------CEEEecCCCeeEeCCCCccCceeecccccccc-cccccCCceeeccCC
Confidence 55677778899999999999999986689998 899999999999999
Q ss_pred CCceEeecCCCCeecCCCcc
Q psy9686 668 DKQITCLCLKGGTVSSEYCQ 687 (769)
Q Consensus 668 s~~y~C~C~~Gftg~G~~C~ 687 (769)
+ |.|.|.+||. |.+|.
T Consensus 3962 s--f~CncT~g~~--gr~c~ 3977 (4289)
T KOG1219|consen 3962 S--FHCNCTPGIL--GRTCC 3977 (4289)
T ss_pred c--eEeccChhHh--cccCc
Confidence 8 9999999999 77775
No 9
>KOG1219|consensus
Probab=99.43 E-value=1.8e-13 Score=166.98 Aligned_cols=112 Identities=23% Similarity=0.548 Sum_probs=102.0
Q ss_pred eeeccccccCCCccccCC-CCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcc
Q psy9686 609 VMVQYYLQQNISPLKDGN-LSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQ 687 (769)
Q Consensus 609 ~~~~~~~c~ngg~C~n~~-gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~ 687 (769)
..+..+.|+++|+|...+ +.|.|.|++.|+|..|+.+ +.+|. ++||..+++|+...++ |.|.|+.||+ |.+|+
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~-~epC~-snPC~~GgtCip~~n~--f~CnC~~gyT--G~~Ce 3938 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID-LEPCA-SNPCLTGGTCIPFYNG--FLCNCPNGYT--GKRCE 3938 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccc-ccccc-CCCCCCCCEEEecCCC--eeEeCCCCcc--Cceee
Confidence 456677899999999865 6699999999999999985 88998 9999999999998876 9999999999 99999
Q ss_pred ccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686 688 FENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 688 ~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce 729 (769)
.+-+++|. .++|.++|+|++.+|+ |.|.|.+||.|..|.
T Consensus 3939 ~~Gi~eCs-~n~C~~gg~C~n~~gs--f~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3939 ARGISECS-KNVCGTGGQCINIPGS--FHCNCTPGILGRTCC 3977 (4289)
T ss_pred cccccccc-cccccCCceeeccCCc--eEeccChhHhcccCc
Confidence 85589995 9999999999999998 999999999999995
No 10
>KOG1214|consensus
Probab=99.30 E-value=9e-12 Score=142.87 Aligned_cols=148 Identities=25% Similarity=0.590 Sum_probs=112.9
Q ss_pred CCCCCCCcEEeeCCCCccceEeeCCCCCc--CCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccc
Q psy9686 520 GRPCRNNGTCTPVSGGVVNFTCTCPSGGQ--GQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITP 597 (769)
Q Consensus 520 s~PC~ngGtCvn~~gs~~~y~C~C~~Gft--G~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~ 597 (769)
++-|..+..|...++. .|+|.|..||. |++|.++++|+. ....|
T Consensus 699 sh~cdt~a~C~pg~~~--~~tcecs~g~~gdgr~c~d~~eca~------------------~~~~C-------------- 744 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGV--DYTCECSSGYQGDGRNCVDENECAT------------------GFHRC-------------- 744 (1289)
T ss_pred CcccCCCccccCCCCc--ceEEEEeeccCCCCCCCCChhhhcc------------------CCCCC--------------
Confidence 4457777788877664 79999999996 567988888753 12223
Q ss_pred ccccccccccceeeccccccCCCccccCCCCcccccCCCCC----CCccccc----ccccccc-CCCCCCCCe--EeeCC
Q psy9686 598 VYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFT----GEYCQFE----NSAACVT-LNPCQNNAT--CVASP 666 (769)
Q Consensus 598 gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~----G~~Ce~~----~ideC~~-~~pC~n~gt--Cvnt~ 666 (769)
..++.|++.+++|.|.|..||. +-+|-.. .++.|.. .+.|...+. |+..
T Consensus 745 -------------------Gp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~h- 804 (1289)
T KOG1214|consen 745 -------------------GPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHH- 804 (1289)
T ss_pred -------------------CCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEec-
Confidence 3456789999999999999984 3355421 2456664 466776654 4444
Q ss_pred CCCceEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686 667 GDKQITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP 726 (769)
Q Consensus 667 Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~ 726 (769)
|.+.|.|.|.|||.|+|..|. ++|+|. ++.|+..+.|++++|+ +.|+|.+||.|.
T Consensus 805 Ggs~y~C~CLPGfsGDG~~c~--dvDeC~-psrChp~A~Cyntpgs--fsC~C~pGy~GD 859 (1289)
T KOG1214|consen 805 GGSTYSCACLPGFSGDGHQCT--DVDECS-PSRCHPAATCYNTPGS--FSCRCQPGYYGD 859 (1289)
T ss_pred CCceEEEeecCCccCCccccc--cccccC-ccccCCCceEecCCCc--ceeecccCccCC
Confidence 334799999999999999998 789996 8999999999999988 999999999854
No 11
>KOG4289|consensus
Probab=99.28 E-value=9.1e-12 Score=148.16 Aligned_cols=109 Identities=31% Similarity=0.641 Sum_probs=85.8
Q ss_pred cCCCCCCCCCCCcEEeeCC-------------------CCccceEeeCCCCCcCCCC-cccccchhccccCceeeeeeee
Q psy9686 515 LGGCAGRPCRNNGTCTPVS-------------------GGVVNFTCTCPSGGQGQSG-LEITPVYLWLMKLKTLVMVQYY 574 (769)
Q Consensus 515 ideC~s~PC~ngGtCvn~~-------------------gs~~~y~C~C~~GftG~~C-edideC~~~~~~~~~~~~~~~~ 574 (769)
-+.|...||.|...|+... ..+.++.|+||+||+|..| ++||+|+
T Consensus 1179 DniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCY--------------- 1243 (2531)
T KOG4289|consen 1179 DNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCY--------------- 1243 (2531)
T ss_pred CchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhh---------------
Confidence 4679999999999896421 0124599999999999999 5678875
Q ss_pred cccCCCCCCCCCCCcCCCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCcccccc-ccccccC
Q psy9686 575 FRRKSTSPHKSRGQGQSGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFEN-SAACVTL 653 (769)
Q Consensus 575 ~~~Cs~~pC~~~g~C~~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~-ideC~~~ 653 (769)
+.||+ +++.|....|+|+|.|.+||+|++|+.+. .-.|. +
T Consensus 1244 -----s~pC~---------------------------------nng~C~srEggYtCeCrpg~tGehCEvs~~agrCv-p 1284 (2531)
T KOG4289|consen 1244 -----SGPCG---------------------------------NNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCV-P 1284 (2531)
T ss_pred -----cCCCC---------------------------------CCCceEEecCceeEEecCCccccceeeecccCccc-c
Confidence 57774 44567777899999999999999999541 12476 7
Q ss_pred CCCCCCCeEeeCCCCCceEeecCCC
Q psy9686 654 NPCQNNATCVASPGDKQITCLCLKG 678 (769)
Q Consensus 654 ~pC~n~gtCvnt~Gs~~y~C~C~~G 678 (769)
..|.++++|++...+ +|.|.|+.|
T Consensus 1285 GvC~nggtC~~~~ng-gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1285 GVCKNGGTCVNLLNG-GFCCHCPYG 1308 (2531)
T ss_pred ceecCCCEEeecCCC-ceeccCCCc
Confidence 889999999987643 699999987
No 12
>KOG1217|consensus
Probab=99.20 E-value=2.5e-10 Score=128.22 Aligned_cols=204 Identities=25% Similarity=0.494 Sum_probs=141.6
Q ss_pred CCCC--CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCccc---ccchhccccCceeeeeeeecccCCCCCCCCCCCcC
Q psy9686 516 GGCA--GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEI---TPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQ 590 (769)
Q Consensus 516 deC~--s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedi---deC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~ 590 (769)
++|. ..+|.+++.|.+..++ |.|.|++||.+..|+.. ..|.. . ..
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~---~~C~c~~~~~~~~~~~~~~~~~c~~----------~----~~------------- 219 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGS---YLCSCPPGYTGSTCETTGNGGTCVD----------S----VA------------- 219 (487)
T ss_pred cccccCCCCcCCCcccccCCCC---eeEeCCCCccCCcCcCCCCCceEec----------c----ee-------------
Confidence 5786 4569999999999887 99999999999998542 11110 0 00
Q ss_pred CCcccccccccccccccceeeccccccCCCccccCCCCcccccCCCCCCCc---cccccccccccCCCCCCCCeEeeCCC
Q psy9686 591 SGLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNCTEDFTGEY---CQFENSAACVTLNPCQNNATCVASPG 667 (769)
Q Consensus 591 ~~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~---Ce~~~ideC~~~~pC~n~gtCvnt~G 667 (769)
..+.+++.+...... .......+ +.|.+..++|.|.|++||.+.. |. ++++|....+|.++++|++..+
T Consensus 220 --~~~~~g~~~~~c~~~---~~~~~~~~-~~c~~~~~~~~C~~~~g~~~~~~~~~~--~~~~C~~~~~c~~~~~C~~~~~ 291 (487)
T KOG1217|consen 220 --CSCPPGARGPECEVS---IVECASGD-GTCVNTVGSYTCRCPEGYTGDACVTCV--DVDSCALIASCPNGGTCVNVPG 291 (487)
T ss_pred --ccCCCCCCCCCcccc---cccccCCC-CcccccCCceeeeCCCCccccccceee--eccccCCCCccCCCCeeecCCC
Confidence 112222221111100 00000113 7899999999999999999986 33 5788983324999999999987
Q ss_pred CCceEeecCCCCeecCCCc-cccCCcccc---CCCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCCCCCC-CC
Q psy9686 668 DKQITCLCLKGGTVSSEYC-QFENSAACV---TLNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVDEPPS-ED 742 (769)
Q Consensus 668 s~~y~C~C~~Gftg~G~~C-~~~~ideC~---s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~~npC-~~ 742 (769)
. |.|.|++||+ |..| ...+.++|. ...+|.+++.|........+.|.|..||.|..|+...++|...++ .+
T Consensus 292 ~--~~C~C~~g~~--g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~ 367 (487)
T KOG1217|consen 292 S--YRCTCPPGFT--GRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPG 367 (487)
T ss_pred c--ceeeCCCCCC--CCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccC
Confidence 6 9999999999 6665 221446774 256799988994333333488999999999999854458998866 78
Q ss_pred Cceee-CCCCccccCCC-Ccc
Q psy9686 743 ETSVD-LQLGSQANSYN-WAH 761 (769)
Q Consensus 743 GtC~n-~~ggy~C~~~~-w~~ 761 (769)
++|++ ..++|.|.... +.+
T Consensus 368 ~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 368 GTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred CEeccCCCCCeEecCCCcccc
Confidence 99999 79999998763 443
No 13
>KOG1217|consensus
Probab=99.03 E-value=3.3e-09 Score=119.18 Aligned_cols=228 Identities=24% Similarity=0.390 Sum_probs=144.7
Q ss_pred CCCCCCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccc
Q psy9686 516 GGCAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEI 595 (769)
Q Consensus 516 deC~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C 595 (769)
..|...+....+.|...... |.|.|++||.|..|+...+|... .. .......|...+. ........|
T Consensus 90 ~~~~~~~~~~~~~~~~~~~~---~~c~c~~g~~~~~~~~~~~C~~~----~~---~~~~~~~c~~~~~---~~~~~~c~C 156 (487)
T KOG1217|consen 90 PPCRSPCLLLCGECVDCVGS---YECTCPPGYQGTPCEGECECVTG----PG---VCCIDGSCSNGPG---SVGPFRCSC 156 (487)
T ss_pred ccccCCcccCCccccCCCCC---ceeeCCCccccCcCCcceeecCC----CC---CeeCchhhcCCCC---CCCceeeee
Confidence 34555555556666666665 99999999999988443234320 00 0011122222222 112222467
Q ss_pred ccccccccccccceeec--cccccCCCccccCCCCcccccCCCCCCCccccc-------c-----------ccccc-cCC
Q psy9686 596 TPVYLWLMKMKTLVMVQ--YYLQQNISPLKDGNLSFYCNCTEDFTGEYCQFE-------N-----------SAACV-TLN 654 (769)
Q Consensus 596 ~~gy~~~~~~~~C~~~~--~~~c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~-------~-----------ideC~-~~~ 654 (769)
..+|.+.........+. ...|.+++.|.+..++|.|.|+++|.+..|+.. + .+.|. ...
T Consensus 157 ~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~ 236 (487)
T KOG1217|consen 157 TEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIV 236 (487)
T ss_pred CCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccc
Confidence 78887765553322222 234888999999999999999999999887732 0 11222 122
Q ss_pred CCCCC-CeEeeCCCCCceEeecCCCCeecC-CCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC--CC
Q psy9686 655 PCQNN-ATCVASPGDKQITCLCLKGGTVSS-EYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR--EL 730 (769)
Q Consensus 655 pC~n~-gtCvnt~Gs~~y~C~C~~Gftg~G-~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C--e~ 730 (769)
.|..+ ++|++..++ |.|.|++||++.. ..|. +++.|....+|.++++|++..+. |.|.|++||+|..| ..
T Consensus 237 ~~~~~~~~c~~~~~~--~~C~~~~g~~~~~~~~~~--~~~~C~~~~~c~~~~~C~~~~~~--~~C~C~~g~~g~~~~~~~ 310 (487)
T KOG1217|consen 237 ECASGDGTCVNTVGS--YTCRCPEGYTGDACVTCV--DVDSCALIASCPNGGTCVNVPGS--YRCTCPPGFTGRLCTECV 310 (487)
T ss_pred cccCCCCcccccCCc--eeeeCCCCccccccceee--eccccCCCCccCCCCeeecCCCc--ceeeCCCCCCCCCCcccc
Confidence 34433 889988877 9999999999443 2344 67889733249999999999876 99999999999998 23
Q ss_pred CCCCCC----CCCC-CCCce--eeCCCCccccCCC-Cccc
Q psy9686 731 PVESVD----EPPS-EDETS--VDLQLGSQANSYN-WAHM 762 (769)
Q Consensus 731 dideC~----~npC-~~GtC--~n~~ggy~C~~~~-w~~~ 762 (769)
+.++|. ..+| ++++| .+..+.+.|...+ |.++
T Consensus 311 ~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~ 350 (487)
T KOG1217|consen 311 DVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR 350 (487)
T ss_pred ccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC
Confidence 456784 4566 45578 4555567777664 3443
No 14
>KOG4289|consensus
Probab=98.99 E-value=4.7e-10 Score=134.03 Aligned_cols=100 Identities=36% Similarity=0.766 Sum_probs=85.8
Q ss_pred ccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc-CCccccCCCCCC
Q psy9686 623 KDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE-NSAACVTLNPCQ 701 (769)
Q Consensus 623 ~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-~ideC~s~~pC~ 701 (769)
++..+++.|.|++||+|+.|+.. +|.|- +.||.++++|....|+ |+|.|.+||+ |++|+.. ..-.|. +..|.
T Consensus 1216 i~pvnglrCrCPpGFTgd~CeTe-iDlCY-s~pC~nng~C~srEgg--YtCeCrpg~t--GehCEvs~~agrCv-pGvC~ 1288 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCETE-IDLCY-SGPCGNNGRCRSREGG--YTCECRPGFT--GEHCEVSARAGRCV-PGVCK 1288 (2531)
T ss_pred ccccCceeEeCCCCCCcccccch-hHhhh-cCCCCCCCceEEecCc--eeEEecCCcc--ccceeeecccCccc-cceec
Confidence 45567899999999999999985 99998 8999999999999887 9999999999 9999852 123575 88899
Q ss_pred CCCEEecCCCCCceEEecCCC-CCCCCCCC
Q psy9686 702 NNATCVASPGDKQITCLCLKG-FEGPHREL 730 (769)
Q Consensus 702 ngg~Cvnt~Gs~~y~C~C~~G-ytG~~Ce~ 730 (769)
|+++|++... +++.|+|+.| |++++|+.
T Consensus 1289 nggtC~~~~n-ggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1289 NGGTCVNLLN-GGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred CCCEEeecCC-CceeccCCCcccCCCceEE
Confidence 9999998754 3599999998 67899985
No 15
>KOG1225|consensus
Probab=98.77 E-value=2.6e-08 Score=113.94 Aligned_cols=158 Identities=24% Similarity=0.423 Sum_probs=109.4
Q ss_pred eEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCC-cccccccccccccc-cceeeccccc
Q psy9686 539 FTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSG-LEITPVYLWLMKMK-TLVMVQYYLQ 616 (769)
Q Consensus 539 y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~-~~C~~gy~~~~~~~-~C~~~~~~~c 616 (769)
+.|.|+.+|.|..|+. -.| +.-|...+.|.++ +.|.+||.+..+.. .|.. . |
T Consensus 234 ~ic~c~~~~~g~~c~~-~~C---------------------~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~---~-c 287 (525)
T KOG1225|consen 234 GICECPEGYFGPLCST-IYC---------------------PGGCTGRGQCVEGRCICPPGFTGDDCDELVCPV---D-C 287 (525)
T ss_pred ceeecCCceeCCcccc-ccC---------------------CCCCcccceEeCCeEeCCCCCcCCCCCcccCCc---c-c
Confidence 5899999999999963 333 4455556777776 57899987744332 3332 2 6
Q ss_pred cCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCccccC
Q psy9686 617 QNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACVT 696 (769)
Q Consensus 617 ~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s 696 (769)
+.++.|++. .|.|.+||.|+.|+. ..| +..|.++|.|++ | +|.|.+||+ |..|..
T Consensus 288 s~~g~~~~g----~CiC~~g~~G~dCs~---~~c--padC~g~G~Ci~--G----~C~C~~Gy~--G~~C~~-------- 342 (525)
T KOG1225|consen 288 SGGGVCVDG----ECICNPGYSGKDCSI---RRC--PADCSGHGKCID--G----ECLCDEGYT--GELCIQ-------- 342 (525)
T ss_pred CCCceecCC----EeecCCCcccccccc---ccC--CccCCCCCcccC--C----ceEeCCCCc--CCcccc--------
Confidence 677777766 899999999999984 357 678999999983 3 599999999 888874
Q ss_pred CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCC--CCCCCCCceeeCCCCccccCC
Q psy9686 697 LNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVD--EPPSEDETSVDLQLGSQANSY 757 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~--~npC~~GtC~n~~ggy~C~~~ 757 (769)
. .|.+++.|++ - |.|..||.|.+ ...+.+. ..+...+.|+...+...+..+
T Consensus 343 ~-~C~~~g~cv~------g-C~C~~Gw~G~d--~~~~~~~~~~~cs~~~~~~~~~~~~~~~~~ 395 (525)
T KOG1225|consen 343 R-ACSGGGQCVN------G-CKCKKGWRGPD--VADPSLLLITECSPPSLCIAGVGRRRVTHC 395 (525)
T ss_pred c-ccCCCceecc------C-ceeccCccCCC--cCCchhhcccccCCCceeeccccccccccc
Confidence 1 3788888886 2 99999999998 2222222 122345566666644444443
No 16
>KOG1225|consensus
Probab=98.47 E-value=6.1e-07 Score=102.86 Aligned_cols=88 Identities=27% Similarity=0.722 Sum_probs=72.2
Q ss_pred ccCCCccccCCCCcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCcccc
Q psy9686 616 QQNISPLKDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACV 695 (769)
Q Consensus 616 c~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~ 695 (769)
|.+.+.|+++ +|.|++||+|.+|+. -.| +..|+.++.|++. .|.|.+||+ |+.|+.+ .|
T Consensus 256 c~~~g~c~~G----~CIC~~Gf~G~dC~e---~~C--p~~cs~~g~~~~g------~CiC~~g~~--G~dCs~~---~c- 314 (525)
T KOG1225|consen 256 CTGRGQCVEG----RCICPPGFTGDDCDE---LVC--PVDCSGGGVCVDG------ECICNPGYS--GKDCSIR---RC- 314 (525)
T ss_pred CcccceEeCC----eEeCCCCCcCCCCCc---ccC--CcccCCCceecCC------EeecCCCcc--ccccccc---cC-
Confidence 3444678777 899999999999984 358 4559888888764 699999999 9999853 47
Q ss_pred CCCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCC
Q psy9686 696 TLNPCQNNATCVASPGDKQITCLCLKGFEGPHRELP 731 (769)
Q Consensus 696 s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~d 731 (769)
+.+|.++|.|++ | +|.|.+||+|..|+..
T Consensus 315 -padC~g~G~Ci~--G----~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 315 -PADCSGHGKCID--G----ECLCDEGYTGELCIQR 343 (525)
T ss_pred -CccCCCCCcccC--C----ceEeCCCCcCCccccc
Confidence 688999999993 3 8999999999999865
No 17
>KOG1226|consensus
Probab=98.28 E-value=4.7e-06 Score=97.44 Aligned_cols=101 Identities=24% Similarity=0.613 Sum_probs=67.8
Q ss_pred ccccCCCCC----CCcccccccccccc--CCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc-CCccccCC--CCC
Q psy9686 630 YCNCTEDFT----GEYCQFENSAACVT--LNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE-NSAACVTL--NPC 700 (769)
Q Consensus 630 ~C~C~~Gy~----G~~Ce~~~ideC~~--~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-~ideC~s~--~pC 700 (769)
.|.|.+... |.+|+.++ -.|.. ...|..+|+|.-. +|+|.+||+ |..|+.. +.+.|.++ ..|
T Consensus 526 qC~C~~~~~~~i~G~fCECDn-fsC~r~~g~lC~g~G~C~CG------~CvC~~Gwt--G~~C~C~~std~C~~~~G~iC 596 (783)
T KOG1226|consen 526 QCVCHKPDNGKIYGKFCECDN-FSCERHKGVLCGGHGRCECG------RCVCNPGWT--GSACNCPLSTDTCESSDGQIC 596 (783)
T ss_pred ceEecCCCCCceeeeeeeccC-cccccccCcccCCCCeEeCC------cEEcCCCCc--cCCCCCCCCCccccCCCCcee
Confidence 577776665 88999753 24641 2348888999764 599999999 6666532 56678643 248
Q ss_pred CCCCEEecCCCCCceEEecCCC-CCCCCCCCCCCCCCCCCCCCCcee
Q psy9686 701 QNNATCVASPGDKQITCLCLKG-FEGPHRELPVESVDEPPSEDETSV 746 (769)
Q Consensus 701 ~ngg~Cvnt~Gs~~y~C~C~~G-ytG~~Ce~dideC~~npC~~GtC~ 746 (769)
...|+|.-. +|+|... |.|..||... .|...+-.+..|+
T Consensus 597 SGrG~C~Cg------~C~C~~~~~sG~~CE~cp-tc~~~C~~~~~Cv 636 (783)
T KOG1226|consen 597 SGRGTCECG------RCKCTDPPYSGEFCEKCP-TCPDPCAENKSCV 636 (783)
T ss_pred CCCceeeCC------ceEcCCCCcCcchhhcCC-CCCCcccccccch
Confidence 888888864 7999876 9999998533 3443322444453
No 18
>KOG4260|consensus
Probab=98.21 E-value=1.4e-06 Score=90.61 Aligned_cols=150 Identities=22% Similarity=0.468 Sum_probs=92.1
Q ss_pred CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccccc
Q psy9686 520 GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITPVY 599 (769)
Q Consensus 520 s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~gy 599 (769)
..||..+|.|.-.....+.-.|.|.+||+|..|.+-+.=+ +. +..-..+
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~ey---------------fe---s~Rne~~------------- 197 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEY---------------FE---SSRNEQH------------- 197 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHH---------------HH---hhccccc-------------
Confidence 4578888999754333234689999999999984311100 00 0000000
Q ss_pred ccccccccceeeccccccCCCccccCCCCccc-ccCCCCCCC--cccccccccccc-CCCCCCCCeEeeCCCCCceEeec
Q psy9686 600 LWLMKMKTLVMVQYYLQQNISPLKDGNLSFYC-NCTEDFTGE--YCQFENSAACVT-LNPCQNNATCVASPGDKQITCLC 675 (769)
Q Consensus 600 ~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C-~C~~Gy~G~--~Ce~~~ideC~~-~~pC~n~gtCvnt~Gs~~y~C~C 675 (769)
..|..+.+.+ .+.|... ++-.| .|..||... .|- |+|+|.. +.+|.....|+|+.|+ |.|.+
T Consensus 198 ------lvCt~Ch~~C---~~~Csg~-~~k~C~kCkkGW~lde~gCv--DvnEC~~ep~~c~~~qfCvNteGS--f~C~d 263 (350)
T KOG4260|consen 198 ------LVCTACHEGC---LGVCSGE-SSKGCSKCKKGWKLDEEGCV--DVNECQNEPAPCKAHQFCVNTEGS--FKCED 263 (350)
T ss_pred ------chhhhhhhhh---hcccCCC-CCCChhhhcccceecccccc--cHHHHhcCCCCCChhheeecCCCc--eEecc
Confidence 1111111100 1234332 23345 599999754 666 8999983 6789988899999988 99999
Q ss_pred CCCCeecCCCccccCCccccC-CCCCC-CCCEEecCCCCCceEEecCCCCC
Q psy9686 676 LKGGTVSSEYCQFENSAACVT-LNPCQ-NNATCVASPGDKQITCLCLKGFE 724 (769)
Q Consensus 676 ~~Gftg~G~~C~~~~ideC~s-~~pC~-ngg~Cvnt~Gs~~y~C~C~~Gyt 724 (769)
.+||.+ | +|+|.. ...|. .+..|.++++. |+|.|..|+.
T Consensus 264 k~Gy~~-g-------~d~C~~~~d~~~~kn~~c~ni~~~--~r~v~f~~~~ 304 (350)
T KOG4260|consen 264 KEGYKK-G-------VDECQFCADVCASKNRPCMNIDGQ--YRCVCFSGLI 304 (350)
T ss_pred cccccC-C-------hHHhhhhhhhcccCCCCcccCCcc--EEEEecccce
Confidence 999974 2 334430 13343 23468888877 9999999873
No 19
>KOG4260|consensus
Probab=98.20 E-value=2.3e-06 Score=89.03 Aligned_cols=120 Identities=19% Similarity=0.466 Sum_probs=82.5
Q ss_pred CCCcccccCCCCCCCcccccccccccc-C----C---CCCC--CCeEeeCCCCCceEe-ecCCCCeecCCCccccCCccc
Q psy9686 626 NLSFYCNCTEDFTGEYCQFENSAACVT-L----N---PCQN--NATCVASPGDKQITC-LCLKGGTVSSEYCQFENSAAC 694 (769)
Q Consensus 626 ~gsy~C~C~~Gy~G~~Ce~~~ideC~~-~----~---pC~n--~gtCvnt~Gs~~y~C-~C~~Gftg~G~~C~~~~ideC 694 (769)
.|+-.|.|.+||.|+.|..-.+..=.+ . . .|+. .+.|.... + -.| .|..||..+-..|. |+|+|
T Consensus 165 ~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~-~--k~C~kCkkGW~lde~gCv--DvnEC 239 (350)
T KOG4260|consen 165 EGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGES-S--KGCSKCKKGWKLDEEGCV--DVNEC 239 (350)
T ss_pred CCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcccCCCC-C--CChhhhcccceecccccc--cHHHH
Confidence 456689999999999887311100000 0 0 1222 12453321 1 345 59999998877888 89999
Q ss_pred cC-CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCCCCCCCCC--CCC--CCCceeeCCCCccccCC
Q psy9686 695 VT-LNPCQNNATCVASPGDKQITCLCLKGFEGPHRELPVESVDE--PPS--EDETSVDLQLGSQANSY 757 (769)
Q Consensus 695 ~s-~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~~--npC--~~GtC~n~~ggy~C~~~ 757 (769)
.. +.+|.....|+|+.|| |.|.+++||.+. +|+|.- .-| .++.|-|+.++|+|-.+
T Consensus 240 ~~ep~~c~~~qfCvNteGS--f~C~dk~Gy~~g-----~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f 300 (350)
T KOG4260|consen 240 QNEPAPCKAHQFCVNTEGS--FKCEDKEGYKKG-----VDECQFCADVCASKNRPCMNIDGQYRCVCF 300 (350)
T ss_pred hcCCCCCChhheeecCCCc--eEecccccccCC-----hHHhhhhhhhcccCCCCcccCCccEEEEec
Confidence 73 5679888899999998 999999999873 455554 223 57889999999998765
No 20
>KOG0994|consensus
Probab=97.88 E-value=4.6e-05 Score=91.40 Aligned_cols=208 Identities=16% Similarity=0.293 Sum_probs=113.4
Q ss_pred eccccccccccCCCccccCCCCCCCCCCC--------cEEeeCCCCccceEeeCCCCCcCCCCcccccchhc----cccC
Q psy9686 498 SQELFLSSALKKPGAAALGGCAGRPCRNN--------GTCTPVSGGVVNFTCTCPSGGQGQSGLEITPVYLW----LMKL 565 (769)
Q Consensus 498 ~~~~~~~~~~~~~gc~dideC~s~PC~ng--------GtCvn~~gs~~~y~C~C~~GftG~~CedideC~~~----~~~~ 565 (769)
|..+|++.-.-..+ ..|.+-||..+ -.|...+.. ....|.|.+||+|.+|+. |+.+ ...+
T Consensus 890 Cl~GyyGdP~lg~g----~~CrPCpCP~gp~Sg~~~A~sC~~d~~t-~~ivC~C~~GY~G~RCe~---CA~~~fGnP~~G 961 (1758)
T KOG0994|consen 890 CLDGYYGDPRLGSG----IGCRPCPCPDGPASGRQHADSCYLDTRT-QQIVCHCQEGYSGSRCEI---CADNHFGNPSEG 961 (1758)
T ss_pred hhccccCCcccCCC----CCCCCCCCCCCCccchhccccccccccc-cceeeecccCccccchhh---hcccccCCcccC
Confidence 55566554333322 34555555443 246544332 457899999999999953 5544 3347
Q ss_pred ceeeeee--eecccCCCCCCCC-CCCcCC---------CcccccccccccccccceeeccccccCCCccccCCCCccccc
Q psy9686 566 KTLVMVQ--YYFRRKSTSPHKS-RGQGQS---------GLEITPVYLWLMKMKTLVMVQYYLQQNISPLKDGNLSFYCNC 633 (769)
Q Consensus 566 ~~~~~~~--~~~~~Cs~~pC~~-~g~C~~---------~~~C~~gy~~~~~~~~C~~~~~~~c~ngg~C~n~~gsy~C~C 633 (769)
++|..|+ ..++.=.+..|.. .|.|.. +--|.+||.+......|..+.-+.-.-+.+|.=..-+-.|-|
T Consensus 962 GtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpC 1041 (1758)
T KOG0994|consen 962 GTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTGQCPC 1041 (1758)
T ss_pred CccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhheccccccCCccccccccCcCCC
Confidence 8885443 3444444555543 244433 236899998876666665544333222222222223347888
Q ss_pred CCCCCCCcccccccccccc------------CCCCCC--CCeEeeCCCCCceEeecCCCCeecCCCcccc------CCc-
Q psy9686 634 TEDFTGEYCQFENSAACVT------------LNPCQN--NATCVASPGDKQITCLCLKGGTVSSEYCQFE------NSA- 692 (769)
Q Consensus 634 ~~Gy~G~~Ce~~~ideC~~------------~~pC~n--~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~------~id- 692 (769)
.+...|..|+. |+. +-.|.. +-+|....| .|.|.|||- |.+|..- +.+
T Consensus 1042 lpNv~G~~CDq-----CA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG----QCqCkpGfG--GR~C~qCqel~WGdP~~ 1110 (1758)
T KOG0994|consen 1042 LPNVQGVRCDQ-----CAENHWNLASGEGCEPCNCDPIGGPQCNEFTG----QCQCKPGFG--GRTCSQCQELYWGDPNE 1110 (1758)
T ss_pred Ccccccccccc-----cccchhccccCCCCCccCCCccCCcccccccc----ceeccCCCC--CcchhHHHHhhcCCCCC
Confidence 88888887762 431 112222 225544444 599999997 7777620 000
Q ss_pred cccCCCCCCCCC----EEecCCCCCceEEecCCCCCCCCCC
Q psy9686 693 ACVTLNPCQNNA----TCVASPGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 693 eC~s~~pC~ngg----~Cvnt~Gs~~y~C~C~~GytG~~Ce 729 (769)
.|. .-.|...| +|....| +|.|.+|-.|..|.
T Consensus 1111 ~C~-aCdCd~rG~~tpQCdr~tG----~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1111 KCR-ACDCDPRGIETPQCDRATG----RCVCRPGVGGPRCD 1146 (1758)
T ss_pred Cce-ecCCCCCCCCCCCccccCC----ceeecCCCCCcchh
Confidence 121 11233222 3444433 78888888888884
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.56 E-value=4.1e-05 Score=55.55 Aligned_cols=31 Identities=45% Similarity=1.241 Sum_probs=27.4
Q ss_pred CCCCCCCCCcEEeeCC-CCccceEeeCCCCCcCCC
Q psy9686 518 CAGRPCRNNGTCTPVS-GGVVNFTCTCPSGGQGQS 551 (769)
Q Consensus 518 C~s~PC~ngGtCvn~~-gs~~~y~C~C~~GftG~~ 551 (769)
|.++||.|+|+|++.. + .|.|.|++||+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~---~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGG---GYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTS---EEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCC---CEEeECCCCCccCC
Confidence 6678999999999988 4 49999999999974
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.47 E-value=0.00016 Score=53.66 Aligned_cols=36 Identities=39% Similarity=1.039 Sum_probs=32.1
Q ss_pred ccCCCCC-CCCCCCcEEeeCCCCccceEeeCCCCCc-CCCC
Q psy9686 514 ALGGCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQ-GQSG 552 (769)
Q Consensus 514 dideC~s-~PC~ngGtCvn~~gs~~~y~C~C~~Gft-G~~C 552 (769)
++++|.. .||.++++|++..++ |.|.|++||. |..|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~---~~C~C~~g~~~g~~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGS---YRCECPPGYTDGRNC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCC---eEeECCCCCccCCcC
Confidence 3678987 799999999999887 9999999999 8877
No 23
>KOG1226|consensus
Probab=97.38 E-value=0.00047 Score=81.16 Aligned_cols=95 Identities=21% Similarity=0.651 Sum_probs=66.6
Q ss_pred ccccCCCCCCCcccccc--------cccccc---CCCCCCCCeEeeCCCCCceEeecCCCCee--cCCCccccCCccccC
Q psy9686 630 YCNCTEDFTGEYCQFEN--------SAACVT---LNPCQNNATCVASPGDKQITCLCLKGGTV--SSEYCQFENSAACVT 696 (769)
Q Consensus 630 ~C~C~~Gy~G~~Ce~~~--------ideC~~---~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg--~G~~C~~~~ideC~s 696 (769)
.|.|.+||.|+.|+... .+.|.. .-+|...|.|+=. +|.|.+...+ .|+.|+. +.-.|..
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG------qC~C~~~~~~~i~G~fCEC-DnfsC~r 551 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG------QCVCHKPDNGKIYGKFCEC-DNFSCER 551 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC------ceEecCCCCCceeeeeeec-cCccccc
Confidence 67899999999998631 133431 2268888888764 5999876442 4888986 3335642
Q ss_pred --CCCCCCCCEEecCCCCCceEEecCCCCCCCCCCC--CCCCCCC
Q psy9686 697 --LNPCQNNATCVASPGDKQITCLCLKGFEGPHREL--PVESVDE 737 (769)
Q Consensus 697 --~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~--dideC~~ 737 (769)
...|..+|.|.-. +|.|.+||+|..|+- +.+.|.+
T Consensus 552 ~~g~lC~g~G~C~CG------~CvC~~GwtG~~C~C~~std~C~~ 590 (783)
T KOG1226|consen 552 HKGVLCGGHGRCECG------RCVCNPGWTGSACNCPLSTDTCES 590 (783)
T ss_pred ccCcccCCCCeEeCC------cEEcCCCCccCCCCCCCCCccccC
Confidence 2359888898764 899999999998763 4555665
No 24
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.24 E-value=0.00015 Score=52.67 Aligned_cols=30 Identities=43% Similarity=0.953 Sum_probs=25.9
Q ss_pred CCCCCCCCEEecCCCCCceEEecCCCCCCCC
Q psy9686 697 LNPCQNNATCVASPGDKQITCLCLKGFEGPH 727 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~ 727 (769)
+++|.|+|+|++.. .++|+|+|++||+|++
T Consensus 3 ~~~C~n~g~C~~~~-~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 3 SNPCQNGGTCIDLP-GGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTSSTTTEEEEEES-TSEEEEEEBTTEESTT
T ss_pred CCcCCCCeEEEeCC-CCCEEeECCCCCccCC
Confidence 67999999999988 2349999999999974
No 25
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.24 E-value=0.00044 Score=51.20 Aligned_cols=36 Identities=42% Similarity=0.906 Sum_probs=31.3
Q ss_pred CccccCC-CCCCCCCEEecCCCCCceEEecCCCCC-CCCCC
Q psy9686 691 SAACVTL-NPCQNNATCVASPGDKQITCLCLKGFE-GPHRE 729 (769)
Q Consensus 691 ideC~s~-~pC~ngg~Cvnt~Gs~~y~C~C~~Gyt-G~~Ce 729 (769)
+++|. . .+|.+++.|+++.++ |.|.|++||. |..|+
T Consensus 2 ~~~C~-~~~~C~~~~~C~~~~g~--~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECA-SGNPCQNGGTCVNTVGS--YRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCc-CCCCcCCCCEeECCCCC--eEeECCCCCccCCcCC
Confidence 57785 4 789999999999887 9999999999 98884
No 26
>KOG0994|consensus
Probab=97.11 E-value=0.0013 Score=79.67 Aligned_cols=62 Identities=31% Similarity=0.674 Sum_probs=39.5
Q ss_pred eEe-ecCCCCeec-----CCCccccCCccccCCCCCCC--------CCEEecCCCCCceEEecCCCCCCCCCCCCCCCCC
Q psy9686 671 ITC-LCLKGGTVS-----SEYCQFENSAACVTLNPCQN--------NATCVASPGDKQITCLCLKGFEGPHRELPVESVD 736 (769)
Q Consensus 671 y~C-~C~~Gftg~-----G~~C~~~~ideC~s~~pC~n--------gg~Cvnt~Gs~~y~C~C~~GytG~~Ce~dideC~ 736 (769)
+.| +|..||.|+ |..|. +-||.. .-.|.-........|.|.+||+|.+|+. |+
T Consensus 885 ~~CdrCl~GyyGdP~lg~g~~Cr---------PCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~----CA 951 (1758)
T KOG0994|consen 885 HSCDRCLDGYYGDPRLGSGIGCR---------PCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEI----CA 951 (1758)
T ss_pred cchhhhhccccCCcccCCCCCCC---------CCCCCCCCccchhccccccccccccceeeecccCccccchhh----hc
Confidence 677 699999853 44444 223432 2245443333457899999999999974 55
Q ss_pred C----CCCCCCce
Q psy9686 737 E----PPSEDETS 745 (769)
Q Consensus 737 ~----npC~~GtC 745 (769)
+ +|-++|+|
T Consensus 952 ~~~fGnP~~GGtC 964 (1758)
T KOG0994|consen 952 DNHFGNPSEGGTC 964 (1758)
T ss_pred ccccCCcccCCcc
Confidence 4 56666665
No 27
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.06 E-value=0.00082 Score=49.03 Aligned_cols=35 Identities=40% Similarity=1.086 Sum_probs=31.2
Q ss_pred cCCCCC-CCCCCCcEEeeCCCCccceEeeCCCCCcCCCC
Q psy9686 515 LGGCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSG 552 (769)
Q Consensus 515 ideC~s-~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~C 552 (769)
+++|.. .+|.+++.|.+..++ |.|.|++||.|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~---~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGS---YRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCC---eEeECCCCCcCCcC
Confidence 578877 799999999998887 99999999999877
No 28
>PRK15328 invasion protein IagB; Provisional
Probab=97.04 E-value=0.00012 Score=72.53 Aligned_cols=45 Identities=31% Similarity=0.299 Sum_probs=37.3
Q ss_pred cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccC
Q psy9686 294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCT 342 (769)
Q Consensus 294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~ 342 (769)
+|++-..| ..|++.||+||+.|++. |.+|+.|+||+|||+.+|..
T Consensus 29 y~Idp~Ll---~AIa~~ES~f~p~avs~-n~~gs~d~GLMQIn~~~~~~ 73 (160)
T PRK15328 29 FNIESELL---YAIAQQESAMKPGAIGH-NRDGSTDLGLMQINSFHMKR 73 (160)
T ss_pred cCCCHHHH---HHHHHHHcCCCCCCEec-CCCCCcccChhhcCHHHHHH
Confidence 56665444 35999999999999985 67899999999999988753
No 29
>PRK13722 lytic transglycosylase; Provisional
Probab=96.97 E-value=0.00018 Score=71.72 Aligned_cols=44 Identities=23% Similarity=0.192 Sum_probs=35.4
Q ss_pred cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccccc
Q psy9686 294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWC 341 (769)
Q Consensus 294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC 341 (769)
++++-.-|.- |+..||+||+.|++. |.+|+.|+||.|||+.++=
T Consensus 30 y~vdp~LL~A---Ii~~ES~fnp~Avs~-n~~gs~d~GLMQIn~~~~~ 73 (169)
T PRK13722 30 YKIDPDLLRA---ISWKESRYRVNAIGI-NPVTGYGSGLMQVDSQHFN 73 (169)
T ss_pred cCCCHHHHHH---HHHHHcCCCCCCeec-CCCCCeeecCCcccHHHHH
Confidence 5665544443 899999999999985 6789999999999997754
No 30
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.94 E-value=0.00051 Score=52.83 Aligned_cols=32 Identities=38% Similarity=1.043 Sum_probs=28.5
Q ss_pred ccCCCCC--CCCCCCcEEeeCCCCccceEeeCCCCCc
Q psy9686 514 ALGGCAG--RPCRNNGTCTPVSGGVVNFTCTCPSGGQ 548 (769)
Q Consensus 514 dideC~s--~PC~ngGtCvn~~gs~~~y~C~C~~Gft 548 (769)
|||||.. ++|..++.|+|+.|+ |.|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gs---y~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGS---YSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTE---EEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCC---EEeeCCCCcE
Confidence 6899964 469989999999998 9999999997
No 31
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=96.87 E-value=0.00074 Score=51.94 Aligned_cols=33 Identities=30% Similarity=0.733 Sum_probs=29.1
Q ss_pred ccccccc-CCCCCCCCeEeeCCCCCceEeecCCCCe
Q psy9686 646 NSAACVT-LNPCQNNATCVASPGDKQITCLCLKGGT 680 (769)
Q Consensus 646 ~ideC~~-~~pC~n~gtCvnt~Gs~~y~C~C~~Gft 680 (769)
|||||.. .+.|..++.|+|+.|+ |+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gs--y~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGS--YSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTE--EEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCC--EEeeCCCCcE
Confidence 4789984 5679989999999988 9999999998
No 32
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.76 E-value=0.0021 Score=46.79 Aligned_cols=36 Identities=42% Similarity=0.912 Sum_probs=30.5
Q ss_pred CccccCC-CCCCCCCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686 691 SAACVTL-NPCQNNATCVASPGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 691 ideC~s~-~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce 729 (769)
+++|. . .+|.+++.|++..++ |.|.|++||.|..|+
T Consensus 2 ~~~C~-~~~~C~~~~~C~~~~~~--~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECA-SGNPCQNGGTCVNTVGS--YRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCC-CCCCcCCCCEeECCCCC--eEeECCCCCcCCcCC
Confidence 46775 4 689988999999876 999999999998884
No 33
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=96.62 E-value=0.00056 Score=69.61 Aligned_cols=31 Identities=35% Similarity=0.531 Sum_probs=28.1
Q ss_pred eeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
|+..||+ +..+++. |.|||.|+||+|||+.|
T Consensus 35 Ia~~ESg-~pgai~~-N~NGT~DvGlMQINS~h 65 (206)
T PRK13888 35 VAEKEGG-KPGQWVR-NTNGTHDVGPMQFNTAY 65 (206)
T ss_pred HHHHhCC-CCCCEEe-CCCCCEEeCCcccccHh
Confidence 8999996 9999995 78999999999999965
No 34
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.46 E-value=0.0019 Score=48.24 Aligned_cols=32 Identities=34% Similarity=0.709 Sum_probs=25.6
Q ss_pred CCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686 653 LNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC 686 (769)
Q Consensus 653 ~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C 686 (769)
...|+.+++|.++.++ |.|+|.+||.|+|..|
T Consensus 5 ~~~C~~nA~C~~~~~~--~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGS--YTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGGS-TTCEEEE-TTS--EEEEE-CEEECCSTCE
T ss_pred CCCCCCCcEeecCCCC--EEeECCCCCccCCcCC
Confidence 4578999999999886 9999999999998876
No 35
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.35 E-value=0.0054 Score=43.89 Aligned_cols=32 Identities=47% Similarity=1.155 Sum_probs=27.7
Q ss_pred CC-CCCCCCCcEEeeCCCCccceEeeCCCCCcCC-CC
Q psy9686 518 CA-GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQ-SG 552 (769)
Q Consensus 518 C~-s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~-~C 552 (769)
|. ..+|.+++.|++..++ |.|.|+.||.|. .|
T Consensus 2 C~~~~~C~~~~~C~~~~~~---~~C~C~~g~~g~~~C 35 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGS---YRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCCCCEEecCCCC---eEeECCCCCcccCCc
Confidence 44 6789999999998876 999999999998 65
No 36
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.15 E-value=0.0016 Score=65.32 Aligned_cols=145 Identities=22% Similarity=0.518 Sum_probs=80.0
Q ss_pred CCCCCCcEEeeCCCCccceEeeCCCCCcC---CCCcccccchhccccCceeeeeeeecccCCCCCCCCCCCcCCCccccc
Q psy9686 521 RPCRNNGTCTPVSGGVVNFTCTCPSGGQG---QSGLEITPVYLWLMKLKTLVMVQYYFRRKSTSPHKSRGQGQSGLEITP 597 (769)
Q Consensus 521 ~PC~ngGtCvn~~gs~~~y~C~C~~GftG---~~CedideC~~~~~~~~~~~~~~~~~~~Cs~~pC~~~g~C~~~~~C~~ 597 (769)
..|.| |..++.... |.|.|.+||.- .+|+...+|.. -+-...+|++.+
T Consensus 6 T~CKN-G~LiQMSNH---fEC~Cnegfvl~~EntCE~kv~C~~---------------~e~~~K~Cgdya---------- 56 (197)
T PF06247_consen 6 TICKN-GYLIQMSNH---FECKCNEGFVLKNENTCEEKVECDK---------------LENVNKPCGDYA---------- 56 (197)
T ss_dssp ---BT-EEEEEESSE---EEEEESTTEEEEETTEEEE----SG----------------GGTTSEEETTE----------
T ss_pred ccccC-CEEEEccCc---eEEEcCCCcEEccccccccceecCc---------------ccccCccccchh----------
Confidence 35666 566665554 99999999953 34776666621 122344554433
Q ss_pred ccccccccccceeeccccccCCCccccC-----CCCcccccCCCCCCC--ccccccccccccCCCCCCCCeEeeCCCC-C
Q psy9686 598 VYLWLMKMKTLVMVQYYLQQNISPLKDG-----NLSFYCNCTEDFTGE--YCQFENSAACVTLNPCQNNATCVASPGD-K 669 (769)
Q Consensus 598 gy~~~~~~~~C~~~~~~~c~ngg~C~n~-----~gsy~C~C~~Gy~G~--~Ce~~~ideC~~~~pC~n~gtCvnt~Gs-~ 669 (769)
.|++. ...|.|.|.+||.-. .|. .+.|. ...|. .|.|+-.+.. .
T Consensus 57 -----------------------~C~~~~~~~~~~~~~C~C~~gY~~~~~vCv---p~~C~-~~~Cg-~GKCI~d~~~~~ 108 (197)
T PF06247_consen 57 -----------------------KCINQANKGEERAYKCDCINGYILKQGVCV---PNKCN-NKDCG-SGKCILDPDNPN 108 (197)
T ss_dssp -----------------------EEEE-SSTTSSTSEEEEE-TTEEESSSSEE---EGGGS-S---T-TEEEEEEEGGGS
T ss_pred -----------------------hhhcCCCcccceeEEEecccCceeeCCeEc---hhhcC-ceecC-CCeEEecCCCCC
Confidence 34332 356999999999743 554 34665 45576 5788754322 2
Q ss_pred ceEeecCCCCe-ecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686 670 QITCLCLKGGT-VSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP 726 (769)
Q Consensus 670 ~y~C~C~~Gft-g~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~ 726 (769)
...|+|.-|+. .++..|+..-...| +--|..+-.|..+.+. |+|.|.+|+.|.
T Consensus 109 ~~~CSC~IGkV~~dn~kCtk~G~T~C--~LKCk~nE~CK~~~~~--Y~C~~~~~~~~~ 162 (197)
T PF06247_consen 109 NPTCSCNIGKVPDDNKKCTKTGETKC--SLKCKENEECKLVDGY--YKCVCKEGFPGD 162 (197)
T ss_dssp EEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTE--EEEEE-TT-EEE
T ss_pred CceeEeeeceEeccCCcccCCCccce--eeecCCCcceeeeCcE--EEeecCCCCCCC
Confidence 36899999987 45567765333456 3557777799999877 999999999643
No 37
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.06 E-value=0.009 Score=43.36 Aligned_cols=32 Identities=44% Similarity=1.083 Sum_probs=27.1
Q ss_pred CCCC-CCCCCCcEEeeCCCCccceEeeCCCCCcC-CCC
Q psy9686 517 GCAG-RPCRNNGTCTPVSGGVVNFTCTCPSGGQG-QSG 552 (769)
Q Consensus 517 eC~s-~PC~ngGtCvn~~gs~~~y~C~C~~GftG-~~C 552 (769)
+|.. .+|.++ +|++..++ |.|.|++||.| ..|
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~---~~C~C~~g~~g~~~C 34 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGS---YTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCcCCCCCC-EEECCCCC---eEeECCCCCccCCcc
Confidence 3555 689998 99998776 99999999999 766
No 38
>PF01464 SLT: Transglycosylase SLT domain; InterPro: IPR008258 Bacterial lytic transglycosylases degrade murein via cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid and N-acetylglucosamine, with the concomitant formation of a 1,6-anhydrobond in the muramic acid residue. There are both soluble (Slt enzymes) and membrane-bound (Mlt enzymes) lytic transglycosylases that differ in size, sequence, activity, specificity and location. The multi-domain structure of the 70 Kd soluble lytic transglycosylase Slt70 is known []. Slt70 has 3 distinct domains, each rich in alpha helices: an N-terminal superhelical U-shaped domain (U-domain; IPR008939 from INTERPRO), a superhelical linker domain (L-domain, IPR012289 from INTERPRO), and a C-terminal catalytic domain (IPR008258 from INTERPRO). Both the U- and L-domain share a similar superhelical structure. These two domains are connected, and together form a closed ring with a large central hole; the catalytic domain is packed on top of, and interacts with, this ring. The catalytic domain has a lysosome-like fold. This entry represents the catalytic domain, which is structurally conserved in some membrane-bound lytic glycosylases and in bacteriophage transglycosylases, even though their sequences can differ considerably proteins []. The most conserved part of this domain is its N-terminal extremity that contains two conserved serines and a glutamate, which have been shown [] to be involved in the catalytic mechanism. This family is distantly related to IPR001916 from INTERPRO.; PDB: 2Y8P_B 3MGW_A 1QTE_A 1SLY_A 1QSA_A 153L_A 154L_A 3BKV_A 3BKH_A 1LSP_A ....
Probab=95.74 E-value=0.0021 Score=59.84 Aligned_cols=64 Identities=30% Similarity=0.274 Sum_probs=45.3
Q ss_pred eeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCChhHHHHHHHHHHhcc
Q psy9686 307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT 377 (769)
Q Consensus 307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di~~~i~Cak~i~~~~ 377 (769)
|++.||+|++.+++.. .+++.++|||||+...|=.-....+ .-.+++ .|....|..+-+++++.
T Consensus 20 i~~~ES~~~~~a~~~~-~~~~~a~GlmQi~~~t~~~~~~~~~-----~~~~~~-~dp~~ni~~g~~~L~~~ 83 (121)
T PF01464_consen 20 IAYQESGFNPNAVSYN-SNGSGAYGLMQINPATWKDYGRKGG-----LSPDDL-FDPEDNIQAGAKYLKWL 83 (121)
T ss_dssp HHHHHHTTCTTTBTTB-ETTTTEETTTTEETTTSHHHSHTHC-----TSHHHH-HHHHHHHHHHHHHHHHH
T ss_pred HHHHhcCCCCCCcCCC-CCCCcceeccccCCcccchhhhccc-----ccchhh-hCHHHHHHHHHHHHHHH
Confidence 8999999999999853 6788899999999988865321100 122333 35566777877777763
No 39
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.73 E-value=0.017 Score=41.26 Aligned_cols=30 Identities=43% Similarity=1.056 Sum_probs=25.9
Q ss_pred CCCCCCCCEEecCCCCCceEEecCCCCCCC-CC
Q psy9686 697 LNPCQNNATCVASPGDKQITCLCLKGFEGP-HR 728 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~-~C 728 (769)
..+|.+++.|++..+. |+|.|++||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~--~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGS--YRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCC--eEeECCCCCcccCCc
Confidence 4578888999998866 999999999998 66
No 40
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.43 E-value=0.022 Score=41.25 Aligned_cols=29 Identities=45% Similarity=1.106 Sum_probs=24.7
Q ss_pred CCCCCCCEEecCCCCCceEEecCCCCCC-CCCC
Q psy9686 698 NPCQNNATCVASPGDKQITCLCLKGFEG-PHRE 729 (769)
Q Consensus 698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG-~~Ce 729 (769)
.+|.++ .|+++.++ |+|.|++||.| ..|+
T Consensus 6 ~~C~~~-~C~~~~~~--~~C~C~~g~~g~~~C~ 35 (35)
T smart00181 6 GPCSNG-TCINTPGS--YTCSCPPGYTGDKRCE 35 (35)
T ss_pred CCCCCC-EEECCCCC--eEeECCCCCccCCccC
Confidence 578888 99998766 99999999999 7763
No 41
>KOG1836|consensus
Probab=95.29 E-value=0.18 Score=65.94 Aligned_cols=197 Identities=21% Similarity=0.416 Sum_probs=98.5
Q ss_pred CCCCCCCCCcEEeeCCCCccceEee-CCCCCcCCCCcccccchh-ccc-cC---ceeeeeeeec--ccCCCCCCCCC-CC
Q psy9686 518 CAGRPCRNNGTCTPVSGGVVNFTCT-CPSGGQGQSGLEITPVYL-WLM-KL---KTLVMVQYYF--RRKSTSPHKSR-GQ 588 (769)
Q Consensus 518 C~s~PC~ngGtCvn~~gs~~~y~C~-C~~GftG~~CedideC~~-~~~-~~---~~~~~~~~~~--~~Cs~~pC~~~-g~ 588 (769)
|.+-||.+++.|..... .....|+ |++||+|.+|+.-++=+. +.. .+ ..|..|+... +.-....|... +.
T Consensus 777 C~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~n~dp~~~g~c~~~tg~ 855 (1705)
T KOG1836|consen 777 CQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNFNVDPNAFGNCNRLTGE 855 (1705)
T ss_pred CccCCCCCChhhcCcCc-ccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceeccccCccccccccccccc
Confidence 88888999999988764 2457899 999999999954222111 111 11 2333222222 21122223221 22
Q ss_pred c---------CCCccccccccccccc----ccceeeccccc---cCCCccccCCCCcccccCCCCCCCccccc-----cc
Q psy9686 589 G---------QSGLEITPVYLWLMKM----KTLVMVQYYLQ---QNISPLKDGNLSFYCNCTEDFTGEYCQFE-----NS 647 (769)
Q Consensus 589 C---------~~~~~C~~gy~~~~~~----~~C~~~~~~~c---~ngg~C~n~~gsy~C~C~~Gy~G~~Ce~~-----~i 647 (769)
| ..+..|.++|.+..-. ..|..++-..- ....+|..-.| .|.|.+.-.|..|..- ++
T Consensus 856 c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tG--Qcec~~~v~g~~c~~c~~g~fnl 933 (1705)
T KOG1836|consen 856 CLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTG--QCECKPNVEGRDCLYCFKGFFNL 933 (1705)
T ss_pred eeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCccc--ceeccCCCCcccccccccccccc
Confidence 3 2233677887764322 33443332110 00122433322 5556555555544320 00
Q ss_pred c---ccccCCCCCCC----CeEeeCCCCCceEeecCCCCeecCCCcccc-------CCccccCCCCCCCCC----EEecC
Q psy9686 648 A---ACVTLNPCQNN----ATCVASPGDKQITCLCLKGGTVSSEYCQFE-------NSAACVTLNPCQNNA----TCVAS 709 (769)
Q Consensus 648 d---eC~~~~pC~n~----gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~-------~ideC~s~~pC~ngg----~Cvnt 709 (769)
+ .|. +..|..- ..|....| .|.|.+|-+ |..|..- ....|. .-.|...| +|...
T Consensus 934 ~s~~gC~-~c~c~~~gs~~~~c~~~tG----qc~c~~gVt--gqrc~qc~~~~~~~~~~gc~-~c~c~~~Gs~~~qc~~~ 1005 (1705)
T KOG1836|consen 934 NSGVGCE-PCNCDPTGSESSDCDVGTG----QCYCRPGVT--GQRCDQCETYHFGFQTEGCG-LCECDPLGSRGFQCDPE 1005 (1705)
T ss_pred CCCCCcc-cccccccccccccccccCC----ceeeecCcc--ccccCccccCcccccccCCc-ceecccCCcccceeccc
Confidence 1 222 2233322 25655544 488888887 6555420 001121 11244433 67765
Q ss_pred CCCCceEEecCCCCCCCCCC
Q psy9686 710 PGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 710 ~Gs~~y~C~C~~GytG~~Ce 729 (769)
.| +|.|++|+.|.+|.
T Consensus 1006 ~G----~c~c~~~~~g~~c~ 1021 (1705)
T KOG1836|consen 1006 DG----QCPCRPGFEGRRCD 1021 (1705)
T ss_pred CC----eeeecCCCCCcccc
Confidence 44 89999999998874
No 42
>PRK13864 type IV secretion system lytic transglycosylase VirB1; Provisional
Probab=95.08 E-value=0.007 Score=63.76 Aligned_cols=33 Identities=42% Similarity=0.473 Sum_probs=28.7
Q ss_pred eeccccCCCcccccCCCCCC-----------------------Cceeeeeeeccccc
Q psy9686 307 IATKESNRNSNARSPKNGNG-----------------------SRDHGIFQINDKYW 340 (769)
Q Consensus 307 la~~eS~~~t~a~~~~~~~g-----------------------s~~yGifQIn~~~W 340 (769)
||..||+||+-||+. |.+| +.|.||.|||+++.
T Consensus 49 Ia~vESg~nP~AI~~-N~~G~~l~~qs~~EAi~~a~~l~~~g~s~DvGLmQINs~nl 104 (245)
T PRK13864 49 IAKVESRFDPLAVHD-NTTGETLHWQNHAQATQSVRHRLEARHSLDVGLMQINSKNF 104 (245)
T ss_pred HHHHhccCCCcceee-cCCCCccCCCCHHHHHHHHHHHHhcCCceeeechhccHHHH
Confidence 999999999999984 5566 79999999999663
No 43
>cd00254 LT_GEWL Lytic Transglycosylase (LT) and Goose Egg White Lysozyme (GEWL) domain. Members include the soluble and insoluble membrane-bound LTs in bacteria, the LTs in bacteriophage lambda, as well as, the eukaryotic "goose-type" lysozymes (GEWL). LTs catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetyl-D-glucosamine (GlcNAc), as do "goose-type" lysozymes. However, in addition to this, they also make a new glycosidic bond with the C6 hydroxyl group of the same muramic acid residue.
Probab=94.86 E-value=0.01 Score=53.82 Aligned_cols=61 Identities=18% Similarity=0.151 Sum_probs=43.2
Q ss_pred eeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCChhHHHHHHHHHHhcc
Q psy9686 307 IATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT 377 (769)
Q Consensus 307 la~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di~~~i~Cak~i~~~~ 377 (769)
|++.||+|+..+++ +++.++|||||+...|-.-..... . ....--|..+.|.-|-++++..
T Consensus 7 i~~~ES~~~~~a~~----~~~~a~GlmQi~~~~~~~~~~~~~----~--~~~~~~dp~~ni~~~~~~l~~~ 67 (113)
T cd00254 7 IIRQESGFNPNAVN----SGAGAVGLMQFMPSTARAYGVDGG----D--GADDLFDPEDNIRAGARYLKEL 67 (113)
T ss_pred HHHHHCCCCCCCcc----CCCCCeeeeeeCchHHHHHHHhcC----C--ChhhcCCHHHHHHHHHHHHHHH
Confidence 78899999999984 468899999999999975332110 0 0112246777888888887764
No 44
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.85 E-value=0.02 Score=39.02 Aligned_cols=19 Identities=47% Similarity=1.001 Sum_probs=16.5
Q ss_pred eEeeCCCCCc----CCCCccccc
Q psy9686 539 FTCTCPSGGQ----GQSGLEITP 557 (769)
Q Consensus 539 y~C~C~~Gft----G~~Cedide 557 (769)
|+|.|++||. |.+|+||||
T Consensus 2 y~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 2 YTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred EEeeCCCCCcCCCCCCccccCCC
Confidence 9999999995 667999886
No 45
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.84 E-value=0.022 Score=38.77 Aligned_cols=19 Identities=32% Similarity=0.796 Sum_probs=16.6
Q ss_pred eEEecCCCCC----CCCCCCCCCC
Q psy9686 715 ITCLCLKGFE----GPHRELPVES 734 (769)
Q Consensus 715 y~C~C~~Gyt----G~~Ce~dide 734 (769)
|+|.|++||. |..|+ ||||
T Consensus 2 y~C~C~~Gy~l~~d~~~C~-DIdE 24 (24)
T PF12662_consen 2 YTCSCPPGYQLSPDGRSCE-DIDE 24 (24)
T ss_pred EEeeCCCCCcCCCCCCccc-cCCC
Confidence 9999999996 78895 8886
No 46
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=94.20 E-value=0.013 Score=59.03 Aligned_cols=129 Identities=21% Similarity=0.426 Sum_probs=79.8
Q ss_pred cccCCCCcccccCCCCC---CCcccccccccccc----CCCCCCCCeEeeCCC---CCceEeecCCCCeecCCCccccCC
Q psy9686 622 LKDGNLSFYCNCTEDFT---GEYCQFENSAACVT----LNPCQNNATCVASPG---DKQITCLCLKGGTVSSEYCQFENS 691 (769)
Q Consensus 622 C~n~~gsy~C~C~~Gy~---G~~Ce~~~ideC~~----~~pC~n~gtCvnt~G---s~~y~C~C~~Gftg~G~~C~~~~i 691 (769)
.+...+-|.|.|.+||. -.+|+. ..+|.. ..+|..-+.|++... ...|.|.|.+||+.....|..
T Consensus 13 LiQMSNHfEC~Cnegfvl~~EntCE~--kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp--- 87 (197)
T PF06247_consen 13 LIQMSNHFECKCNEGFVLKNENTCEE--KVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVP--- 87 (197)
T ss_dssp EEEESSEEEEEESTTEEEEETTEEEE------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEE---
T ss_pred EEEccCceEEEcCCCcEEcccccccc--ceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEch---
Confidence 33445678999999996 457884 346752 346988999998763 246999999999976667763
Q ss_pred ccccCCCCCCCCCEEecCCC-CCceEEecCCCCC---CCCCCCCCC-CCCCCCCCCCceeeCCCCccccCC
Q psy9686 692 AACVTLNPCQNNATCVASPG-DKQITCLCLKGFE---GPHRELPVE-SVDEPPSEDETSVDLQLGSQANSY 757 (769)
Q Consensus 692 deC~s~~pC~ngg~Cvnt~G-s~~y~C~C~~Gyt---G~~Ce~did-eC~~npC~~GtC~n~~ggy~C~~~ 757 (769)
+.|. ...|. .|.|+-.+. .+...|.|.-|+. ...|..+-+ .|+.-+-.+..|+...+=|+|..-
T Consensus 88 ~~C~-~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~ 156 (197)
T PF06247_consen 88 NKCN-NKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCK 156 (197)
T ss_dssp GGGS-S---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-
T ss_pred hhcC-ceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecC
Confidence 4785 66787 458975432 2346999999986 444654322 255554467899999999999765
No 47
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=94.07 E-value=0.025 Score=42.26 Aligned_cols=29 Identities=41% Similarity=0.863 Sum_probs=22.2
Q ss_pred CCCCCCCCEEecCCCCCceEEecCCCCCCCC
Q psy9686 697 LNPCQNNATCVASPGDKQITCLCLKGFEGPH 727 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~ 727 (769)
...|+.+|+|+++.++ |.|.|++||.|.-
T Consensus 5 ~~~C~~nA~C~~~~~~--~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGS--YTCTCKPGYEGDG 33 (36)
T ss_dssp GGGS-TTCEEEE-TTS--EEEEE-CEEECCS
T ss_pred CCCCCCCcEeecCCCC--EEeECCCCCccCC
Confidence 3568999999999886 9999999998763
No 48
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.61 E-value=0.088 Score=38.37 Aligned_cols=27 Identities=37% Similarity=0.876 Sum_probs=22.8
Q ss_pred CCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686 698 NPCQNNATCVASPGDKQITCLCLKGFEGPHR 728 (769)
Q Consensus 698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C 728 (769)
..|.++|+|+... .+|.|.+||+|..|
T Consensus 6 ~~C~~~G~C~~~~----g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPC----GRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCC----CEEECCCCCcCCCC
Confidence 3588999999763 48999999999987
No 49
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.39 E-value=0.11 Score=37.89 Aligned_cols=27 Identities=33% Similarity=0.698 Sum_probs=22.8
Q ss_pred CCCCCCcEEeeCCCCccceEeeCCCCCcCCCC
Q psy9686 521 RPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSG 552 (769)
Q Consensus 521 ~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~C 552 (769)
..|.++|+|+... ..|.|++||+|..|
T Consensus 6 ~~C~~~G~C~~~~-----g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPC-----GRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCC-----CEEECCCCCcCCCC
Confidence 4699999999752 48999999999876
No 50
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.20 E-value=0.038 Score=32.24 Aligned_cols=13 Identities=46% Similarity=1.135 Sum_probs=10.9
Q ss_pred EEecCCCCCCCCC
Q psy9686 716 TCLCLKGFEGPHR 728 (769)
Q Consensus 716 ~C~C~~GytG~~C 728 (769)
.|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999987
No 51
>PF06737 Transglycosylas: Transglycosylase-like domain; InterPro: IPR010618 This family of proteins is very likely to act as transglycosylase enzymes related to IPR001916 from INTERPRO and IPR008258 from INTERPRO. These other families are weakly matched by this family, and include the known active site residues.; GO: 0005576 extracellular region; PDB: 1XSF_A 3EO5_A.
Probab=92.59 E-value=0.024 Score=49.45 Aligned_cols=71 Identities=24% Similarity=0.419 Sum_probs=47.8
Q ss_pred cccceEeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCC--ccccccccccccCCChhHHHHHHHHHHhccc
Q psy9686 301 VPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPA--GKECHAKCSSFEDNNITDDVACVVKIHSQTQ 378 (769)
Q Consensus 301 l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~--~~~C~~~C~~ll~~di~~~i~Cak~i~~~~~ 378 (769)
-.+|--||+-||+-|- ++|. |+..||+||+...-|=..++.. .+ -++ .--..-|.=|++|+..
T Consensus 4 ~~~Wd~ia~cESgGn~-~~nt----gnG~yGg~Qf~~sTW~a~Gg~~yap~------~~~--As~~eQi~~A~~l~~~-- 68 (77)
T PF06737_consen 4 ASDWDAIAQCESGGNW-AINT----GNGYYGGLQFSQSTWRAYGGSGYAPR------PDQ--ASRAEQIAVAEKLYAR-- 68 (77)
T ss_dssp HHHHHHHHHHHCTT-T-T-------SSSBBTTTTBBHHHHHHTTGGGT-SS------CCC--S-HHHHHHHHHHHHHH--
T ss_pred hHHHHHHHHHHcCCCC-ccCC----CCCccceeccCHHHHHHhCCCcCCCC------hhh--CCHHHHHHHHHHHHHH--
Confidence 3578889999999999 5542 5668999999999997654321 00 011 1234557788999998
Q ss_pred ccccCccccccc
Q psy9686 379 RARGNGFQAWST 390 (769)
Q Consensus 379 ~~~~~G~~aW~~ 390 (769)
+|+.+|+.
T Consensus 69 ----qG~gaWp~ 76 (77)
T PF06737_consen 69 ----QGWGAWPV 76 (77)
T ss_dssp ----HSSTTSTC
T ss_pred ----cCCCcCCC
Confidence 79999984
No 52
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=91.67 E-value=0.075 Score=31.04 Aligned_cols=13 Identities=46% Similarity=1.060 Sum_probs=10.7
Q ss_pred EeeCCCCCcCCCC
Q psy9686 540 TCTCPSGGQGQSG 552 (769)
Q Consensus 540 ~C~C~~GftG~~C 552 (769)
.|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4999999999987
No 53
>cd01021 GEWL Goose Egg White Lysozyme domain. Eukaryotic "goose-type" lysozymes (GEWL). These enzymes catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetylglucosamine (GlcNAc). Members include tunicate, Japanese flounder, ostrich, and mouse.
Probab=90.62 E-value=0.13 Score=51.53 Aligned_cols=49 Identities=16% Similarity=0.284 Sum_probs=34.7
Q ss_pred HHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 287 AKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 287 a~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
.++-.++++++-.-|.- |+..||+|++.|++. +.+...++||+||+...
T Consensus 35 i~~aa~~y~idp~Li~A---ii~~ES~f~~~a~~~-~~s~~ga~GLMQi~p~t 83 (166)
T cd01021 35 IKKVGQKKCIDPAVIAA---IISRESRAGKVLKNG-WGGAGNGFGLMQVDKRS 83 (166)
T ss_pred HHHHHHHhCcCHHHHHH---HHHHHcCCCcccccC-CCCCCCCeEEeeECccc
Confidence 33333345555443333 889999999999863 45677899999999864
No 54
>smart00051 DSL delta serrate ligand.
Probab=89.98 E-value=0.39 Score=40.48 Aligned_cols=47 Identities=28% Similarity=0.441 Sum_probs=35.5
Q ss_pred eEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686 671 ITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR 728 (769)
Q Consensus 671 y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C 728 (769)
+.-.|.++|. |..|.. .|...+.+..+.+|.. .| .|.|.+||+|.+|
T Consensus 17 ~rv~C~~~~y--G~~C~~----~C~~~~d~~~~~~Cd~-~G----~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYY--GEGCNK----FCRPRDDFFGHYTCDE-NG----NKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCc--CCccCC----EeCcCccccCCccCCc-CC----CEecCCCCcCCCC
Confidence 5567999998 888873 5653455677788864 34 6999999999987
No 55
>KOG1836|consensus
Probab=89.65 E-value=0.64 Score=61.06 Aligned_cols=56 Identities=25% Similarity=0.498 Sum_probs=37.7
Q ss_pred ecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEe-cCCCCCCCCCCCCCC
Q psy9686 674 LCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCL-CLKGFEGPHRELPVE 733 (769)
Q Consensus 674 ~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~-C~~GytG~~Ce~did 733 (769)
+|..||.++.. .- ..+.|. +-+|.+++.|..+.......|. |++||+|.+|+...+
T Consensus 760 ~C~~GfYg~~~--~~-~~~dC~-~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 760 QCVDGFYGLPD--LG-TSGDCQ-PCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred hhcCCCCCccc--cC-CCCCCc-cCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 57777763211 11 112264 6778888888877644557998 999999999986333
No 56
>KOG1218|consensus
Probab=87.26 E-value=5.1 Score=43.22 Aligned_cols=80 Identities=28% Similarity=0.609 Sum_probs=44.8
Q ss_pred ccccCCCCcccccCCCCCCCccccc--cccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCccccCCccccCCC
Q psy9686 621 PLKDGNLSFYCNCTEDFTGEYCQFE--NSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFENSAACVTLN 698 (769)
Q Consensus 621 ~C~n~~gsy~C~C~~Gy~G~~Ce~~--~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~~ideC~s~~ 698 (769)
+|.+... .|.+..++.+..|... ....|. ..|.+...+.... -.|.|.+||. |..|... ...|....
T Consensus 118 ~C~~~~~--~c~~~~~~~~~~C~~~~~~g~~C~--~~c~~~~~~~~~~----~~c~c~~g~~--g~~~~~~-~~~c~~~~ 186 (316)
T KOG1218|consen 118 TCANPRR--ECRCGGGYIGEQCGEENLVGLKCQ--RDCQCTGGCDCKN----GICTCQPGFV--GVFCVES-CSGCSPLT 186 (316)
T ss_pred ccCCCcc--ceecCCcCccccccccCCCCCCcc--CCCCCccccCCCC----CceeccCCcc--ccccccc-CCCcCCCc
Confidence 4544432 5677777777777630 012232 2232222332222 2589999999 7777742 22265456
Q ss_pred CCCCCCEEecCCC
Q psy9686 699 PCQNNATCVASPG 711 (769)
Q Consensus 699 pC~ngg~Cvnt~G 711 (769)
.|.+++.|....+
T Consensus 187 ~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 187 ACENGAKCNRSTG 199 (316)
T ss_pred ccCCCCeeecccc
Confidence 7888888887655
No 57
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=85.88 E-value=0.26 Score=50.57 Aligned_cols=18 Identities=33% Similarity=0.588 Sum_probs=16.6
Q ss_pred ecCCCCccceEeeeCCcc
Q psy9686 11 LDCRGSRDHGIFQINDKY 28 (769)
Q Consensus 11 ~~~~gs~~yGiFQIn~~~ 28 (769)
.|.|||.|+||.|||+.|
T Consensus 48 ~N~NGT~DvGlMQINS~h 65 (206)
T PRK13888 48 RNTNGTHDVGPMQFNTAY 65 (206)
T ss_pred eCCCCCEEeCCcccccHh
Confidence 589999999999999974
No 58
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=85.57 E-value=0.75 Score=43.82 Aligned_cols=32 Identities=19% Similarity=0.545 Sum_probs=26.5
Q ss_pred CCCCCCCEEecCCCCCceEEecCCCCCCCCCCC
Q psy9686 698 NPCQNNATCVASPGDKQITCLCLKGFEGPHREL 730 (769)
Q Consensus 698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~ 730 (769)
+-|.|| +|.-...-+.+.|.|..||+|.+||.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccccccc
Confidence 458886 89877665668999999999999984
No 59
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=85.38 E-value=0.88 Score=34.11 Aligned_cols=20 Identities=35% Similarity=0.850 Sum_probs=16.6
Q ss_pred CeEeeCCCCCceEeecCCCCee
Q psy9686 660 ATCVASPGDKQITCLCLKGGTV 681 (769)
Q Consensus 660 gtCvnt~Gs~~y~C~C~~Gftg 681 (769)
..|++.+++ |+|.|++||..
T Consensus 10 h~C~~~~g~--~~C~C~~Gy~L 29 (36)
T PF14670_consen 10 HICVNTPGS--YRCSCPPGYKL 29 (36)
T ss_dssp SEEEEETTS--EEEE-STTEEE
T ss_pred CCCccCCCc--eEeECCCCCEE
Confidence 389999877 99999999985
No 60
>PHA02887 EGF-like protein; Provisional
Probab=84.28 E-value=0.93 Score=42.55 Aligned_cols=32 Identities=25% Similarity=0.568 Sum_probs=26.2
Q ss_pred CCCCCCCEEecCCCCCceEEecCCCCCCCCCCC
Q psy9686 698 NPCQNNATCVASPGDKQITCLCLKGFEGPHREL 730 (769)
Q Consensus 698 ~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~Ce~ 730 (769)
+-|.| |+|.-........|.|.+||+|.+|+.
T Consensus 92 ~YCiH-G~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 92 DFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CEeeC-CEEEccccCCCceeECCCCcccCCCCc
Confidence 45775 599877666668999999999999974
No 61
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=83.39 E-value=0.91 Score=34.04 Aligned_cols=23 Identities=30% Similarity=0.947 Sum_probs=18.4
Q ss_pred CCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686 522 PCRNNGTCTPVSGGVVNFTCTCPSGGQG 549 (769)
Q Consensus 522 PC~ngGtCvn~~gs~~~y~C~C~~GftG 549 (769)
.|++ +|++++++ |+|.|++||..
T Consensus 7 gC~h--~C~~~~g~---~~C~C~~Gy~L 29 (36)
T PF14670_consen 7 GCSH--ICVNTPGS---YRCSCPPGYKL 29 (36)
T ss_dssp GSSS--EEEEETTS---EEEE-STTEEE
T ss_pred CcCC--CCccCCCc---eEeECCCCCEE
Confidence 3665 99999987 99999999963
No 62
>smart00051 DSL delta serrate ligand.
Probab=81.39 E-value=1.6 Score=36.80 Aligned_cols=47 Identities=30% Similarity=0.622 Sum_probs=33.5
Q ss_pred cccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686 629 FYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC 686 (769)
Q Consensus 629 y~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C 686 (769)
+.-.|.++|.|..|+. .|...+.+..+.+|.. .|. |+|++||+ |..|
T Consensus 17 ~rv~C~~~~yG~~C~~----~C~~~~d~~~~~~Cd~-~G~----~~C~~Gw~--G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNK----FCRPRDDFFGHYTCDE-NGN----KGCLEGWM--GPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCC----EeCcCccccCCccCCc-CCC----EecCCCCc--CCCC
Confidence 3446999999999973 4753344666778854 354 89999999 6655
No 63
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.14 E-value=1.5 Score=45.55 Aligned_cols=34 Identities=24% Similarity=0.538 Sum_probs=26.7
Q ss_pred CccccCCCCCC--CCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686 511 GAAALGGCAGR--PCRNNGTCTPVSGGVVNFTCTCPSGGQG 549 (769)
Q Consensus 511 gc~dideC~s~--PC~ngGtCvn~~gs~~~y~C~C~~GftG 549 (769)
.|.++++|... +|. ..|.+++++ |.|.|++||+.
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~---~~c~c~~g~~~ 218 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGS---YLCACTEGYAL 218 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCC---EEeECCCCccC
Confidence 35567888544 465 489999998 99999999975
No 64
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.79 E-value=1.4 Score=42.10 Aligned_cols=40 Identities=18% Similarity=0.471 Sum_probs=29.9
Q ss_pred cCCCC---CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCcccc
Q psy9686 515 LGGCA---GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEIT 556 (769)
Q Consensus 515 ideC~---s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedid 556 (769)
+.+|. .+-|-|| +|.-.+. ...+.|.|+.||+|.+|+..+
T Consensus 42 i~~Cp~ey~~YClHG-~C~yI~d-l~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 42 IRLCGPEGDGYCLHG-DCIHARD-IDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred cccCChhhCCEeECC-EEEeecc-CCCceeECCCCccccccccee
Confidence 45563 3458885 8987664 345899999999999998765
No 65
>PRK15470 emtA lytic murein endotransglycosylase E; Provisional
Probab=80.78 E-value=0.63 Score=48.09 Aligned_cols=38 Identities=29% Similarity=0.355 Sum_probs=28.5
Q ss_pred cCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 294 RGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 294 ~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
+|++-.-|.- |++.||+||+.|+++ +..+||+||....
T Consensus 50 y~iDp~Ll~A---ii~qES~Fnp~A~S~-----aGA~GLMQImP~T 87 (203)
T PRK15470 50 WGVDPQLITA---IIAIESGGNPNAVSK-----SNAIGLMQLKAST 87 (203)
T ss_pred cCCCHHHHHH---HHHHHcCCCCCCcCC-----CCCeeeccCCHHH
Confidence 4555433332 889999999999974 5699999998755
No 66
>KOG3516|consensus
Probab=80.57 E-value=1.1 Score=55.88 Aligned_cols=40 Identities=25% Similarity=0.684 Sum_probs=35.0
Q ss_pred ccccCCCCCCCCCCCcEEeeCCCCccceEeeCC-CCCcCCCCcc
Q psy9686 512 AAALGGCAGRPCRNNGTCTPVSGGVVNFTCTCP-SGGQGQSGLE 554 (769)
Q Consensus 512 c~dideC~s~PC~ngGtCvn~~gs~~~y~C~C~-~GftG~~Ced 554 (769)
|.-++.|.++||+++|.|..+... |.|.|. .||.|.+|+.
T Consensus 542 C~i~drClPN~CehgG~C~Qs~~~---f~C~C~~TGY~GatCHt 582 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQSWDD---FECNCELTGYKGATCHT 582 (1306)
T ss_pred cccccccCCccccCCCcccccccc---eeEeccccccccccccC
Confidence 556788999999999999996665 999999 9999999953
No 67
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=79.55 E-value=1.3 Score=33.43 Aligned_cols=32 Identities=31% Similarity=0.588 Sum_probs=22.4
Q ss_pred CCCCCCCCeEeeCC-CCCceEeecCCCCeecCCCc
Q psy9686 653 LNPCQNNATCVASP-GDKQITCLCLKGGTVSSEYC 686 (769)
Q Consensus 653 ~~pC~n~gtCvnt~-Gs~~y~C~C~~Gftg~G~~C 686 (769)
...|..++.|++.. |+ +.|+|..||..+|..|
T Consensus 4 ~~~cP~NA~C~~~~dG~--eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGS--EECRCLLGYKKVGGKC 36 (37)
T ss_dssp SS---TTEEEEEETTSE--EEEEE-TTEEEETTEE
T ss_pred CccCCCCcccEEcCCCC--EEEEeeCCccccCCCc
Confidence 45688899999887 54 9999999998666555
No 68
>KOG3516|consensus
Probab=79.52 E-value=1.4 Score=55.03 Aligned_cols=45 Identities=29% Similarity=0.700 Sum_probs=37.2
Q ss_pred CccccCCccccCCCCCCCCCEEecCCCCCceEEecC-CCCCCCCCCCCCCC
Q psy9686 685 YCQFENSAACVTLNPCQNNATCVASPGDKQITCLCL-KGFEGPHRELPVES 734 (769)
Q Consensus 685 ~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~-~GytG~~Ce~dide 734 (769)
.|. .+|.|. +++|+++|.|...... |.|.|. .||.|..|+..+.|
T Consensus 541 ~C~--i~drCl-PN~CehgG~C~Qs~~~--f~C~C~~TGY~GatCHtsi~e 586 (1306)
T KOG3516|consen 541 MCG--ISDRCL-PNPCEHGGKCSQSWDD--FECNCELTGYKGATCHTSIYE 586 (1306)
T ss_pred ccc--cccccC-CccccCCCcccccccc--eeEeccccccccccccCCCcc
Confidence 455 467886 9999999999986555 999999 89999999976654
No 69
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=78.26 E-value=2.3 Score=44.14 Aligned_cols=38 Identities=29% Similarity=0.794 Sum_probs=28.1
Q ss_pred CCCccccCCccccC-CCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686 683 SEYCQFENSAACVT-LNPCQNNATCVASPGDKQITCLCLKGFEGP 726 (769)
Q Consensus 683 G~~C~~~~ideC~s-~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~ 726 (769)
+..|. +.++|.. .++|.+ .|.++.|+ |.|.|++||+..
T Consensus 181 ~~~C~--~~~~C~~~~~~c~~--~C~~~~g~--~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICV--VPDLCATLSHVCQQ--VCISTPGS--YLCACTEGYALL 219 (224)
T ss_pred cccCc--CchhhcCCCCCccc--eEEcCCCC--EEeECCCCccCC
Confidence 45565 5567742 356765 79999988 999999999753
No 70
>PHA02887 EGF-like protein; Provisional
Probab=77.08 E-value=2.2 Score=40.18 Aligned_cols=39 Identities=26% Similarity=0.531 Sum_probs=28.8
Q ss_pred cCCCC---CCCCCCCcEEeeCCCCccceEeeCCCCCcCCCCccc
Q psy9686 515 LGGCA---GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQSGLEI 555 (769)
Q Consensus 515 ideC~---s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~~Cedi 555 (769)
..+|. .+-|-+ |+|.-.+. ...+.|.|++||+|.+|+.+
T Consensus 83 f~pC~~eyk~YCiH-G~C~yI~d-L~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 83 FEKCKNDFNDFCIN-GECMNIID-LDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred ccccChHhhCEeeC-CEEEcccc-CCCceeECCCCcccCCCCcc
Confidence 34664 234874 69987655 35689999999999999764
No 71
>PRK11671 mltC murein transglycosylase C; Provisional
Probab=72.30 E-value=1.6 Score=48.92 Aligned_cols=46 Identities=24% Similarity=0.268 Sum_probs=31.8
Q ss_pred HHHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 286 LAKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 286 la~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
++++-.+++|++-.-|.- |++.||+||+.|+.+ +..+||+||...-
T Consensus 195 lI~~aa~~ygvdp~LI~A---Ii~qES~FnP~AvS~-----agA~GLMQImP~T 240 (359)
T PRK11671 195 MVRKASRKYGVDESLILA---IMQTESSFNPYAVSR-----SDALGLMQVVQHT 240 (359)
T ss_pred HHHHHHHHcCCCHHHHHH---HHHHhcCCCCCCccC-----CCCceeeeeCHhh
Confidence 334443345666533322 788999999999975 4589999998854
No 72
>KOG1218|consensus
Probab=71.27 E-value=22 Score=38.30 Aligned_cols=107 Identities=20% Similarity=0.382 Sum_probs=57.9
Q ss_pred CCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCcccc--CCccccCCCCCCCCCEEecCCC
Q psy9686 634 TEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYCQFE--NSAACVTLNPCQNNATCVASPG 711 (769)
Q Consensus 634 ~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C~~~--~ideC~s~~pC~ngg~Cvnt~G 711 (769)
..+|.|..|+. ..+|. ..|.. .+|.+... .|.|..+|. +..|... ....|. ..|.+...+....
T Consensus 96 ~~~~~g~~C~~--~~~~~--~~c~~-~~C~~~~~----~c~~~~~~~--~~~C~~~~~~g~~C~--~~c~~~~~~~~~~- 161 (316)
T KOG1218|consen 96 LNGYEGPQCES--PCPCG--DGCAE-KTCANPRR----ECRCGGGYI--GEQCGEENLVGLKCQ--RDCQCTGGCDCKN- 161 (316)
T ss_pred CCCCCcccccC--CCCcC--Ccccc-cccCCCcc----ceecCCcCc--cccccccCCCCCCcc--CCCCCccccCCCC-
Confidence 68889998883 34443 11433 45555421 367777766 5555520 122342 2232222232222
Q ss_pred CCceEEecCCCCCCCCCCCCCCCCCC-CCC-CCCceeeCCCCccccCC
Q psy9686 712 DKQITCLCLKGFEGPHRELPVESVDE-PPS-EDETSVDLQLGSQANSY 757 (769)
Q Consensus 712 s~~y~C~C~~GytG~~Ce~dideC~~-npC-~~GtC~n~~ggy~C~~~ 757 (769)
-.|.|.+||.|.+|......|.. ..+ +++.|....+.=.+.+.
T Consensus 162 ---~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~~~~~~~~ 206 (316)
T KOG1218|consen 162 ---GICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTGSCLCYPG 206 (316)
T ss_pred ---CceeccCCcccccccccCCCcCCCcccCCCCeeeccccccccCCC
Confidence 37899999999999754433554 334 34477666665555444
No 73
>KOG3512|consensus
Probab=69.44 E-value=7.6 Score=44.51 Aligned_cols=53 Identities=25% Similarity=0.503 Sum_probs=29.2
Q ss_pred eEe-ecCCCCeecCCCccccCCccccCCCCCCC----CCEEecCCCCCceEEecCCCCCCCCCC
Q psy9686 671 ITC-LCLKGGTVSSEYCQFENSAACVTLNPCQN----NATCVASPGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 671 y~C-~C~~Gftg~G~~C~~~~ideC~s~~pC~n----gg~Cvnt~Gs~~y~C~C~~GytG~~Ce 729 (769)
-+| .|.+||.-++..=.. +...|. .-.|+. |-+|..+.| +|.|++|-+|.+|.
T Consensus 371 rhChyCreGyyRd~s~pl~-hrkaCk-~CdChpVGs~gktCNq~tG----qCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 371 RHCHYCREGYYRDGSKPLT-HRKACK-ACDCHPVGSAGKTCNQTTG----QCPCKEGVTGLTCN 428 (592)
T ss_pred cccccccCccccCCCCCCc-hhhhhh-hcCCcccccccccccccCC----cccCCCCCcccccc
Confidence 356 388888744321110 223342 233442 335665544 78888888888875
No 74
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=66.35 E-value=12 Score=29.75 Aligned_cols=22 Identities=27% Similarity=0.777 Sum_probs=16.7
Q ss_pred CCCCCCCCEEecCCCCCceEEecCCCCC
Q psy9686 697 LNPCQNNATCVASPGDKQITCLCLKGFE 724 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs~~y~C~C~~Gyt 724 (769)
...|..++.|++. +|.|++||.
T Consensus 25 ~~qC~~~s~C~~g------~C~C~~g~~ 46 (52)
T PF01683_consen 25 DEQCIGGSVCVNG------RCQCPPGYV 46 (52)
T ss_pred cCCCCCcCEEcCC------EeECCCCCE
Confidence 4456677889653 899999985
No 75
>KOG3514|consensus
Probab=64.97 E-value=3.8 Score=50.83 Aligned_cols=34 Identities=32% Similarity=0.820 Sum_probs=31.0
Q ss_pred CCCCCCCCCCcEEeeCCCCccceEeeCC-CCCcCCCCc
Q psy9686 517 GCAGRPCRNNGTCTPVSGGVVNFTCTCP-SGGQGQSGL 553 (769)
Q Consensus 517 eC~s~PC~ngGtCvn~~gs~~~y~C~C~-~GftG~~Ce 553 (769)
.|.++||+|+|+|...... |.|.|. .||.|+.|+
T Consensus 625 ~C~~nPC~N~g~C~egwNr---fiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNR---FICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccccccc---cccccccCcccCcccc
Confidence 6999999999999998877 999996 789999995
No 76
>PRK15328 invasion protein IagB; Provisional
Probab=63.66 E-value=1.9 Score=43.01 Aligned_cols=19 Identities=37% Similarity=0.419 Sum_probs=16.8
Q ss_pred cCCCCccceEeeeCCcccC
Q psy9686 12 DCRGSRDHGIFQINDKYWC 30 (769)
Q Consensus 12 ~~~gs~~yGiFQIn~~~wC 30 (769)
|.+|+.|+||+|||+.+|.
T Consensus 54 n~~gs~d~GLMQIn~~~~~ 72 (160)
T PRK15328 54 NRDGSTDLGLMQINSFHMK 72 (160)
T ss_pred CCCCCcccChhhcCHHHHH
Confidence 5689999999999998874
No 77
>KOG3512|consensus
Probab=61.44 E-value=40 Score=38.98 Aligned_cols=25 Identities=20% Similarity=0.468 Sum_probs=19.2
Q ss_pred cEEeeCCCCccceEeeCCCCCcCCCCc
Q psy9686 527 GTCTPVSGGVVNFTCTCPSGGQGQSGL 553 (769)
Q Consensus 527 GtCvn~~gs~~~y~C~C~~GftG~~Ce 553 (769)
..|+-...+ .++|.|.-+-+|..|+
T Consensus 285 s~Cv~d~~~--~ltCdC~HNTaGPdCg 309 (592)
T KOG3512|consen 285 SRCVMDESS--HLTCDCEHNTAGPDCG 309 (592)
T ss_pred ceeeeccCC--ceEEecccCCCCCCcc
Confidence 468765544 4899999999999884
No 78
>KOG3514|consensus
Probab=54.40 E-value=8.8 Score=47.87 Aligned_cols=44 Identities=36% Similarity=0.751 Sum_probs=34.2
Q ss_pred CCccccCCccccCCCCCCCCCEEecCCCCCceEEecC-CCCCCCCCCC
Q psy9686 684 EYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCL-KGFEGPHREL 730 (769)
Q Consensus 684 ~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~-~GytG~~Ce~ 730 (769)
..|..+....|. ++||.|+|.|..... +|.|.|. .||.|+.||.
T Consensus 616 psCs~~~~~~C~-~nPC~N~g~C~egwN--rfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 616 PSCSLSNEKICE-SNPCQNGGKCSEGWN--RFICDCSGTGFEGRTCER 660 (1591)
T ss_pred cccchhhccccC-CCcccCCCCcccccc--ccccccccCcccCccccc
Confidence 456543333686 999999999998765 4999996 6999999985
No 79
>PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional
Probab=49.45 E-value=7.2 Score=45.31 Aligned_cols=44 Identities=23% Similarity=0.277 Sum_probs=33.1
Q ss_pred HHHHHHHhcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 286 LAKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 286 la~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
+.+++++ .|+|.. |+=|+..||+||+.|+.+ ....||+||-..-
T Consensus 108 i~~~~~~-~~lp~~----l~~l~~~ES~f~p~A~S~-----~gA~Gl~Q~mp~T 151 (456)
T PRK10783 108 IVGQIKK-RNMPME----LVLLPIVESAFDPHATSG-----ANAAGIWQIIPST 151 (456)
T ss_pred HHHHHHH-cCCCHH----HHHHHHHhcCCCCCCccC-----CCccceeeecHHH
Confidence 4566766 699854 333899999999999975 4578999997643
No 80
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=44.45 E-value=14 Score=29.12 Aligned_cols=21 Identities=38% Similarity=0.808 Sum_probs=16.5
Q ss_pred EEecCCCCCceEEecCCCCCCCCCC
Q psy9686 705 TCVASPGDKQITCLCLKGFEGPHRE 729 (769)
Q Consensus 705 ~Cvnt~Gs~~y~C~C~~GytG~~Ce 729 (769)
+|.... .+|.|+++|+|..|+
T Consensus 12 ~C~~~~----G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 12 TCDPST----GQCVCKPGTTGPRCD 32 (49)
T ss_dssp SEEETC----EEESBSTTEESTTS-
T ss_pred cccCCC----CEEeccccccCCcCc
Confidence 566643 489999999999996
No 81
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=43.35 E-value=8 Score=32.64 Aligned_cols=47 Identities=26% Similarity=0.533 Sum_probs=21.7
Q ss_pred eEeecCCCCeecCCCccccCCccccCCCCCCCCCEEecCCCCCceEEecCCCCCCCCC
Q psy9686 671 ITCLCLKGGTVSSEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGPHR 728 (769)
Q Consensus 671 y~C~C~~Gftg~G~~C~~~~ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~~C 728 (769)
++-.|.+.|. |..|.. .|...+.-..+-+|.. .| .=.|.+||+|.+|
T Consensus 17 ~rv~C~~nyy--G~~C~~----~C~~~~d~~ghy~Cd~-~G----~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYY--GPNCSK----FCKPRDDSFGHYTCDS-NG----NKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEE--TTTT-E----E---EEETTEEEEE-S-S------EEE-TTEESTTS
T ss_pred EEEECCCCCC--CccccC----CcCCCcCCcCCcccCC-CC----CCCCCCCCcCCCC
Confidence 5778999999 888874 4642211222335663 23 4579999999987
No 82
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=42.49 E-value=23 Score=28.03 Aligned_cols=14 Identities=21% Similarity=0.534 Sum_probs=13.2
Q ss_pred EEecCCCCCCCCCC
Q psy9686 716 TCLCLKGFEGPHRE 729 (769)
Q Consensus 716 ~C~C~~GytG~~Ce 729 (769)
+|.|++|++|..|+
T Consensus 20 ~C~C~~~~~G~~C~ 33 (50)
T cd00055 20 QCECKPNTTGRRCD 33 (50)
T ss_pred EEeCCCcCCCCCCC
Confidence 89999999999996
No 83
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=41.71 E-value=21 Score=27.09 Aligned_cols=30 Identities=27% Similarity=0.626 Sum_probs=20.2
Q ss_pred CCCCCCCCCcEEeeCCCCccceEeeCCCCCcC
Q psy9686 518 CAGRPCRNNGTCTPVSGGVVNFTCTCPSGGQG 549 (769)
Q Consensus 518 C~s~PC~ngGtCvn~~gs~~~y~C~C~~GftG 549 (769)
|...+|..++.|++...+ .+.|+|.+||..
T Consensus 2 C~~~~cP~NA~C~~~~dG--~eecrCllgyk~ 31 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDG--SEECRCLLGYKK 31 (37)
T ss_dssp -SSS---TTEEEEEETTS--EEEEEE-TTEEE
T ss_pred ccCccCCCCcccEEcCCC--CEEEEeeCCccc
Confidence 556779899999998743 499999999963
No 84
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=41.55 E-value=26 Score=32.30 Aligned_cols=32 Identities=25% Similarity=0.544 Sum_probs=23.9
Q ss_pred cCCCC-CCCCCCCcEEeeCCCCccceEeeCCCCCcCC
Q psy9686 515 LGGCA-GRPCRNNGTCTPVSGGVVNFTCTCPSGGQGQ 550 (769)
Q Consensus 515 ideC~-s~PC~ngGtCvn~~gs~~~y~C~C~~GftG~ 550 (769)
.+.|. -..|+..|.|..... ..|.|.+||.-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~~~----~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSNNS----PKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCCCCC----CceECCCCcCCC
Confidence 35785 467999999965432 579999999754
No 85
>cd00254 LT_GEWL Lytic Transglycosylase (LT) and Goose Egg White Lysozyme (GEWL) domain. Members include the soluble and insoluble membrane-bound LTs in bacteria, the LTs in bacteriophage lambda, as well as, the eukaryotic "goose-type" lysozymes (GEWL). LTs catalyze the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetyl-D-glucosamine (GlcNAc), as do "goose-type" lysozymes. However, in addition to this, they also make a new glycosidic bond with the C6 hydroxyl group of the same muramic acid residue.
Probab=37.72 E-value=16 Score=32.66 Aligned_cols=47 Identities=11% Similarity=0.069 Sum_probs=32.0
Q ss_pred CCCccceEeeeCCcccCCCCCCCcCccccccchhccCChhHHHHHHHHHhhcc
Q psy9686 14 RGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT 66 (769)
Q Consensus 14 ~gs~~yGiFQIn~~~wC~~~~~~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~ 66 (769)
+++.++||+||+...|-.-+.... . ....-.|..+.|.-+-+++++.
T Consensus 21 ~~~~a~GlmQi~~~~~~~~~~~~~----~--~~~~~~dp~~ni~~~~~~l~~~ 67 (113)
T cd00254 21 SGAGAVGLMQFMPSTARAYGVDGG----D--GADDLFDPEDNIRAGARYLKEL 67 (113)
T ss_pred CCCCCeeeeeeCchHHHHHHHhcC----C--ChhhcCCHHHHHHHHHHHHHHH
Confidence 678999999999999976433111 0 0112247778888888888764
No 86
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=37.67 E-value=34 Score=31.45 Aligned_cols=33 Identities=30% Similarity=0.664 Sum_probs=24.6
Q ss_pred CccccCCCCCCCCCEEecCCCCCceEEecCCCCCCC
Q psy9686 691 SAACVTLNPCQNNATCVASPGDKQITCLCLKGFEGP 726 (769)
Q Consensus 691 ideC~s~~pC~ngg~Cvnt~Gs~~y~C~C~~GytG~ 726 (769)
.|.|.....|...|.|..... ..|.|.+||.-+
T Consensus 77 ~d~Cd~y~~CG~~g~C~~~~~---~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNSNNS---PKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCccccCCccEeCCCCC---CceECCCCcCCC
Confidence 456764578999999965433 489999999754
No 87
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=35.81 E-value=49 Score=26.22 Aligned_cols=25 Identities=24% Similarity=0.598 Sum_probs=17.6
Q ss_pred CCCCCCCeEeeCCCCCceEeecCCCCeecCC
Q psy9686 654 NPCQNNATCVASPGDKQITCLCLKGGTVSSE 684 (769)
Q Consensus 654 ~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~ 684 (769)
..|..++.|++. +|.|++||...+.
T Consensus 26 ~qC~~~s~C~~g------~C~C~~g~~~~~~ 50 (52)
T PF01683_consen 26 EQCIGGSVCVNG------RCQCPPGYVEVGG 50 (52)
T ss_pred CCCCCcCEEcCC------EeECCCCCEecCC
Confidence 345677889653 6999999974443
No 88
>PRK11619 lytic murein transglycosylase; Provisional
Probab=35.65 E-value=16 Score=44.26 Aligned_cols=33 Identities=36% Similarity=0.460 Sum_probs=26.4
Q ss_pred cce-EeeccccCCCcccccCCCCCCCceeeeeeeccccc
Q psy9686 303 TWV-CIATKESNRNSNARSPKNGNGSRDHGIFQINDKYW 340 (769)
Q Consensus 303 ~wv-Cla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~W 340 (769)
.|| =|+..||+||+.|+++ ....||+||....+
T Consensus 495 ~lv~ai~rqES~f~p~a~S~-----~gA~GLMQimP~Ta 528 (644)
T PRK11619 495 SYAMAIARQESAWNPKARSP-----VGASGLMQIMPGTA 528 (644)
T ss_pred HHHHHHHHHhcCCCCCCccC-----CCCeeeeeecHHHH
Confidence 444 3899999999999974 34899999998654
No 89
>PF01464 SLT: Transglycosylase SLT domain; InterPro: IPR008258 Bacterial lytic transglycosylases degrade murein via cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid and N-acetylglucosamine, with the concomitant formation of a 1,6-anhydrobond in the muramic acid residue. There are both soluble (Slt enzymes) and membrane-bound (Mlt enzymes) lytic transglycosylases that differ in size, sequence, activity, specificity and location. The multi-domain structure of the 70 Kd soluble lytic transglycosylase Slt70 is known []. Slt70 has 3 distinct domains, each rich in alpha helices: an N-terminal superhelical U-shaped domain (U-domain; IPR008939 from INTERPRO), a superhelical linker domain (L-domain, IPR012289 from INTERPRO), and a C-terminal catalytic domain (IPR008258 from INTERPRO). Both the U- and L-domain share a similar superhelical structure. These two domains are connected, and together form a closed ring with a large central hole; the catalytic domain is packed on top of, and interacts with, this ring. The catalytic domain has a lysosome-like fold. This entry represents the catalytic domain, which is structurally conserved in some membrane-bound lytic glycosylases and in bacteriophage transglycosylases, even though their sequences can differ considerably proteins []. The most conserved part of this domain is its N-terminal extremity that contains two conserved serines and a glutamate, which have been shown [] to be involved in the catalytic mechanism. This family is distantly related to IPR001916 from INTERPRO.; PDB: 2Y8P_B 3MGW_A 1QTE_A 1SLY_A 1QSA_A 153L_A 154L_A 3BKV_A 3BKH_A 1LSP_A ....
Probab=34.47 E-value=10 Score=35.03 Aligned_cols=48 Identities=21% Similarity=0.218 Sum_probs=31.6
Q ss_pred CCCCccceEeeeCCcccCCCCCCCcCccccccchhccCChhHHHHHHHHHhhcc
Q psy9686 13 CRGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNITDDVACVVKIHSQT 66 (769)
Q Consensus 13 ~~gs~~yGiFQIn~~~wC~~~~~~~~~C~~~C~~ll~~di~~~v~Ca~~i~~~~ 66 (769)
.+++.++|||||+...|=.-+.... .-.+++ .|....|..+-+++++.
T Consensus 36 ~~~~~a~GlmQi~~~t~~~~~~~~~-----~~~~~~-~dp~~ni~~g~~~L~~~ 83 (121)
T PF01464_consen 36 SNGSGAYGLMQINPATWKDYGRKGG-----LSPDDL-FDPEDNIQAGAKYLKWL 83 (121)
T ss_dssp ETTTTEETTTTEETTTSHHHSHTHC-----TSHHHH-HHHHHHHHHHHHHHHHH
T ss_pred CCCCcceeccccCCcccchhhhccc-----ccchhh-hCHHHHHHHHHHHHHHH
Confidence 5688899999999998865332111 223434 45667777777777764
No 90
>PRK10859 membrane-bound lytic transglycosylase F; Provisional
Probab=33.20 E-value=18 Score=42.18 Aligned_cols=31 Identities=29% Similarity=0.398 Sum_probs=25.0
Q ss_pred ceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 304 WVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 304 wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
..=+++.||+||+.|+++. ...||+||...-
T Consensus 306 l~a~~~~ES~~~~~a~s~~-----ga~Glmq~~p~t 336 (482)
T PRK10859 306 LAAIAYQESHWNPQATSPT-----GVRGLMMLTRNT 336 (482)
T ss_pred HHHHHHHhCCCCCCCCcCC-----CCEecceeCHHH
Confidence 3448899999999999752 369999999754
No 91
>PF05497 Destabilase: Destabilase; InterPro: IPR008597 Destabilase (3.5.1.44 from EC) is an endo-epsilon(gamma-Glu)-Lys isopeptidase, which cleaves isopeptide bonds formed by transglutaminase (Factor XIIIa) between glutamine gamma-carboxamide and the epsilon-amino group of lysine [].; GO: 0003796 lysozyme activity; PDB: 2DQA_B 3AB6_A 3AYQ_A.
Probab=31.02 E-value=4.8 Score=38.30 Aligned_cols=62 Identities=32% Similarity=0.461 Sum_probs=37.8
Q ss_pred EeeccccCCCcccccCCCCCCCceeeeeeeccccccCCCCCCccccccccccccCCCh---hHHHHHHHHHHhc
Q psy9686 306 CIATKESNRNSNARSPKNGNGSRDHGIFQINDKYWCTASGPAGKECHAKCSSFEDNNI---TDDVACVVKIHSQ 376 (769)
Q Consensus 306 Cla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~WC~~~~~~~~~C~~~C~~ll~~di---~~~i~Cak~i~~~ 376 (769)
||.+.||+=|..+.=. +..||..-|+|||+-.||=+-++. +. +++.++- .+|..||.++++.
T Consensus 8 CiC~~~SgC~~~~~C~-~~~~s~~CG~f~Is~~YW~Dag~~-g~-------~~~~~a~~~Ca~d~~Ca~~tV~~ 72 (119)
T PF05497_consen 8 CICEAESGCNMPAGCR-MDVGSLSCGPFQISWPYWQDAGKP-GD-------SPSRDAYERCANDPYCAERTVQN 72 (119)
T ss_dssp HHHHHHCSS-S---EE-EETTEEEETTTTEEHHHHHHTT---SS-------S-----HHHHHTSHHHHHHHHHH
T ss_pred HHhHHhhCCCCCCCCC-CCCCCceeeeeEecHHHHHHcCCC-CC-------CchhhhHHHHcCCHHHHHHHHHH
Confidence 7888899877666643 345788999999999999554432 21 1111122 3578899999885
No 92
>PRK13722 lytic transglycosylase; Provisional
Probab=30.17 E-value=15 Score=37.05 Aligned_cols=18 Identities=11% Similarity=0.314 Sum_probs=15.9
Q ss_pred cCCCCccceEeeeCCccc
Q psy9686 12 DCRGSRDHGIFQINDKYW 29 (769)
Q Consensus 12 ~~~gs~~yGiFQIn~~~w 29 (769)
|.+|+.|+||-|||+.++
T Consensus 55 n~~gs~d~GLMQIn~~~~ 72 (169)
T PRK13722 55 NPVTGYGSGLMQVDSQHF 72 (169)
T ss_pred CCCCCeeecCCcccHHHH
Confidence 678899999999999765
No 93
>PHA00368 internal virion protein D
Probab=29.88 E-value=24 Score=44.23 Aligned_cols=39 Identities=23% Similarity=0.332 Sum_probs=28.0
Q ss_pred hcCCCCCCcccceEeeccccCCCcccccCCCCCCCceeeeeeecccc
Q psy9686 293 QRGIARRDVPTWVCIATKESNRNSNARSPKNGNGSRDHGIFQINDKY 339 (769)
Q Consensus 293 ~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~~~gs~~yGifQIn~~~ 339 (769)
++|++...|. =|++.||+||+.|+.+. ...||+||-..-
T Consensus 21 kyGVppdLL~---ALIrQES~FNP~AvS~A-----GA~GLMQLMPaT 59 (1315)
T PHA00368 21 AHGVSYDLLR---KVGWDESRFNPTAKSPT-----GPKGLMQFTKAT 59 (1315)
T ss_pred HcCCCHHHHH---HHHHHcCCCCCCCEeCC-----CCEeeeeecHHH
Confidence 3566632221 38899999999999753 478999998644
No 94
>COG4764 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.17 E-value=43 Score=33.39 Aligned_cols=93 Identities=24% Similarity=0.253 Sum_probs=54.7
Q ss_pred HHHHHHhcCCCCCCcccceEeeccccCCCcccccCCC--------CCCCceeeeeeeccccccCCCC-CCcccccccccc
Q psy9686 287 AKFLVRQRGIARRDVPTWVCIATKESNRNSNARSPKN--------GNGSRDHGIFQINDKYWCTASG-PAGKECHAKCSS 357 (769)
Q Consensus 287 a~~l~~~~g~~~~~l~~wvCla~~eS~~~t~a~~~~~--------~~gs~~yGifQIn~~~WC~~~~-~~~~~C~~~C~~ 357 (769)
|....+++|+|- +-.+=+++.||+|...|.-+.. .--|+.||-=|--|.-|=.-.. +++. .
T Consensus 48 A~~~er~~GvPV---pVlMAtiy~ESgFk~~ArPPRtklLgfIPW~rvSsAyGysQAlDgTWa~YqretG~W-------~ 117 (197)
T COG4764 48 ALRTEREYGVPV---PVLMATIYTESGFKHNARPPRTKLLGFIPWKRVSSAYGYSQALDGTWARYQRETGRW-------S 117 (197)
T ss_pred HHHHHHHhCCCc---hHHHHHHHHhhcCCCCCCCCcceeEeeeeccCccchhhhhHhhcccHHHHHHHhCCc-------c
Confidence 444455578874 4445578999999999875421 1348899999999988865211 1111 1
Q ss_pred ccCCChhHHHHHHH-HHHhcccccccCcccccccCc
Q psy9686 358 FEDNNITDDVACVV-KIHSQTQRARGNGFQAWSTYH 392 (769)
Q Consensus 358 ll~~di~~~i~Cak-~i~~~~~~~~~~G~~aW~~w~ 392 (769)
-.-+|..|.|.=+- -+.+.+ ..+|..-|.+|.
T Consensus 118 AsRtdFaDAi~FvgWYh~ks~---~~nGva~nD~y~ 150 (197)
T COG4764 118 ASRTDFADAIDFVGWYHQKSQ---RVNGVARNDAYN 150 (197)
T ss_pred cccchHHHHHHHHHHHHHhHH---hhcCccccchHH
Confidence 12367777766543 222222 235666665543
No 95
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=26.76 E-value=56 Score=25.53 Aligned_cols=14 Identities=21% Similarity=0.589 Sum_probs=12.9
Q ss_pred EEecCCCCCCCCCC
Q psy9686 716 TCLCLKGFEGPHRE 729 (769)
Q Consensus 716 ~C~C~~GytG~~Ce 729 (769)
+|.|++|++|.+|+
T Consensus 19 ~C~C~~~~~G~~C~ 32 (46)
T smart00180 19 QCECKPNVTGRRCD 32 (46)
T ss_pred EEECCCCCCCCCCC
Confidence 89999999999996
No 96
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=24.37 E-value=29 Score=29.26 Aligned_cols=48 Identities=27% Similarity=0.579 Sum_probs=20.4
Q ss_pred CcccccCCCCCCCccccccccccccCCCCCCCCeEeeCCCCCceEeecCCCCeecCCCc
Q psy9686 628 SFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCVASPGDKQITCLCLKGGTVSSEYC 686 (769)
Q Consensus 628 sy~C~C~~Gy~G~~Ce~~~ideC~~~~pC~n~gtCvnt~Gs~~y~C~C~~Gftg~G~~C 686 (769)
.+.-.|.+.|.|..|.. .|...+.-..+-+|.. .|. -+|.+||+ |..|
T Consensus 16 ~~rv~C~~nyyG~~C~~----~C~~~~d~~ghy~Cd~-~G~----~~C~~Gw~--G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSK----FCKPRDDSFGHYTCDS-NGN----KVCLPGWT--GPNC 63 (63)
T ss_dssp -------TTEETTTT-E----E---EEETTEEEEE-S-S------EEE-TTEE--STTS
T ss_pred EEEEECCCCCCCccccC----CcCCCcCCcCCcccCC-CCC----CCCCCCCc--CCCC
Confidence 35667999999999983 4652111122235643 332 47999999 6665
No 97
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=24.26 E-value=70 Score=29.80 Aligned_cols=31 Identities=29% Similarity=0.747 Sum_probs=21.9
Q ss_pred CCC--CCCCCCCCcEEeeCCCCc--cceEeeCCCC
Q psy9686 516 GGC--AGRPCRNNGTCTPVSGGV--VNFTCTCPSG 546 (769)
Q Consensus 516 deC--~s~PC~ngGtCvn~~gs~--~~y~C~C~~G 546 (769)
++| ..+.|.++|.|++...+. .-|.|.|.+.
T Consensus 6 ~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T 40 (103)
T PF12955_consen 6 DACENATNNCSGHGSCVKKYGSGGGDCFACKCKPT 40 (103)
T ss_pred HHHHHhccCCCCCceEeeccCCCccceEEEEeecc
Confidence 345 345699999999874321 3599999983
No 98
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=20.90 E-value=89 Score=29.14 Aligned_cols=26 Identities=27% Similarity=0.671 Sum_probs=19.4
Q ss_pred CCCCCCCCEEecCCCC---CceEEecCCC
Q psy9686 697 LNPCQNNATCVASPGD---KQITCLCLKG 722 (769)
Q Consensus 697 ~~pC~ngg~Cvnt~Gs---~~y~C~C~~G 722 (769)
.+.|..+|.|+...+. .=|.|.|.+.
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T 40 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPT 40 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeecc
Confidence 5778999999987431 3488999873
Done!