Query psy10559
Match_columns 418
No_of_seqs 316 out of 1827
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 21:30:01 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10559.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10559hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4659|consensus 100.0 2.2E-55 4.9E-60 473.5 16.9 241 112-417 4-246 (1899)
2 KOG1219|consensus 99.4 2.9E-13 6.2E-18 154.1 7.4 105 8-136 3865-3978(4289)
3 KOG4289|consensus 99.2 1.1E-11 2.4E-16 137.4 5.3 107 9-137 1181-1318(2531)
4 KOG1225|consensus 99.0 4.1E-10 8.8E-15 118.4 8.4 109 9-133 251-365 (525)
5 KOG4289|consensus 98.9 1.1E-08 2.3E-13 114.4 14.0 71 6-76 1238-1319(2531)
6 KOG1225|consensus 98.9 1.9E-09 4.2E-14 113.4 7.5 69 2-75 271-343 (525)
7 KOG1226|consensus 98.5 1.6E-07 3.4E-12 101.2 7.6 96 13-136 515-621 (783)
8 KOG1219|consensus 98.5 2.1E-07 4.6E-12 107.8 6.7 75 39-137 3865-3940(4289)
9 KOG1226|consensus 98.4 5E-07 1.1E-11 97.3 6.7 95 13-137 468-582 (783)
10 KOG1214|consensus 98.2 2.9E-06 6.2E-11 91.8 7.6 106 6-134 733-861 (1289)
11 KOG1217|consensus 97.6 0.00014 3.1E-09 74.7 8.3 116 15-134 137-306 (487)
12 KOG4260|consensus 97.6 5.4E-05 1.2E-09 73.1 4.6 111 12-129 150-303 (350)
13 PF07974 EGF_2: EGF-like domai 97.4 0.0002 4.3E-09 47.6 3.2 24 13-36 7-32 (32)
14 KOG4260|consensus 97.3 0.00024 5.1E-09 68.8 4.8 100 27-131 132-269 (350)
15 KOG1217|consensus 97.3 0.00074 1.6E-08 69.4 8.9 103 18-137 243-355 (487)
16 PF00008 EGF: EGF-like domain 97.3 0.0001 2.2E-09 48.9 1.4 28 42-70 3-31 (32)
17 KOG1214|consensus 97.2 0.0015 3.2E-08 71.5 9.3 47 23-70 809-859 (1289)
18 smart00179 EGF_CA Calcium-bind 97.0 0.0011 2.4E-08 44.8 3.9 34 39-73 3-39 (39)
19 cd00054 EGF_CA Calcium-binding 96.5 0.0035 7.6E-08 41.6 3.8 34 39-73 3-38 (38)
20 smart00179 EGF_CA Calcium-bind 96.3 0.0046 9.9E-08 41.7 3.4 32 6-37 1-39 (39)
21 PF00008 EGF: EGF-like domain 96.3 0.0017 3.7E-08 43.0 1.1 30 101-132 2-31 (32)
22 PF07974 EGF_2: EGF-like domai 96.2 0.0055 1.2E-07 40.8 3.2 27 43-72 6-32 (32)
23 PF07645 EGF_CA: Calcium-bindi 95.9 0.0045 9.8E-08 43.4 1.9 30 38-68 2-34 (42)
24 KOG1836|consensus 95.9 0.014 2.9E-07 69.9 6.7 110 14-136 684-813 (1705)
25 smart00181 EGF Epidermal growt 95.8 0.013 2.8E-07 38.7 3.7 28 43-72 6-34 (35)
26 smart00051 DSL delta serrate l 95.8 0.011 2.4E-07 45.5 3.6 44 24-72 18-63 (63)
27 PF12661 hEGF: Human growth fa 95.8 0.0051 1.1E-07 32.7 1.2 13 24-36 1-13 (13)
28 cd00054 EGF_CA Calcium-binding 95.8 0.012 2.6E-07 38.9 3.4 31 7-37 2-38 (38)
29 cd00053 EGF Epidermal growth f 95.6 0.016 3.5E-07 37.6 3.6 29 43-72 6-35 (36)
30 KOG0994|consensus 95.2 0.043 9.3E-07 62.3 7.1 88 2-94 848-964 (1758)
31 PF12661 hEGF: Human growth fa 94.6 0.016 3.4E-07 30.8 0.8 13 60-72 1-13 (13)
32 KOG0994|consensus 94.3 0.052 1.1E-06 61.6 4.9 49 23-74 1037-1099(1758)
33 PF07645 EGF_CA: Calcium-bindi 93.6 0.041 9E-07 38.5 1.6 25 103-130 10-34 (42)
34 PF13715 DUF4480: Domain of un 93.4 0.56 1.2E-05 37.4 8.2 79 163-244 2-81 (88)
35 cd00053 EGF Epidermal growth f 93.3 0.088 1.9E-06 34.0 2.9 30 102-134 5-35 (36)
36 smart00181 EGF Epidermal growt 91.9 0.17 3.7E-06 33.3 2.8 28 103-134 6-34 (35)
37 PF12947 EGF_3: EGF domain; I 91.9 0.082 1.8E-06 36.1 1.2 27 43-70 6-32 (36)
38 smart00051 DSL delta serrate l 90.7 0.34 7.4E-06 37.2 3.7 47 59-134 17-63 (63)
39 PF13620 CarboxypepD_reg: Carb 90.5 1.5 3.2E-05 34.3 7.4 75 163-239 2-81 (82)
40 KOG1836|consensus 90.3 0.94 2E-05 54.8 8.6 71 13-88 732-822 (1705)
41 KOG3512|consensus 88.4 1.4 3.1E-05 46.2 7.3 102 23-136 295-429 (592)
42 PF12947 EGF_3: EGF domain; I 86.1 0.34 7.4E-06 33.0 0.9 27 103-132 6-32 (36)
43 PHA02887 EGF-like protein; Pro 84.2 0.99 2.2E-05 38.8 3.0 31 43-74 92-123 (126)
44 PF01414 DSL: Delta serrate li 83.8 0.28 6.1E-06 37.7 -0.4 45 23-72 17-63 (63)
45 PHA02887 EGF-like protein; Pro 82.4 0.84 1.8E-05 39.3 1.9 33 103-137 92-124 (126)
46 PHA03099 epidermal growth fact 81.7 1 2.2E-05 39.4 2.1 36 102-139 50-85 (139)
47 cd03521 Link_domain_KIAA0527_l 81.7 0.32 7E-06 40.0 -0.9 23 267-289 9-34 (95)
48 KOG1218|consensus 81.5 6.8 0.00015 38.4 8.4 31 21-51 160-194 (316)
49 KOG3607|consensus 81.2 1.4 3E-05 49.3 3.5 55 13-76 605-659 (716)
50 PF04863 EGF_alliinase: Alliin 80.8 0.9 2E-05 33.9 1.3 33 44-76 18-53 (56)
51 PHA03099 epidermal growth fact 80.3 1.3 2.8E-05 38.7 2.4 32 44-76 52-84 (139)
52 PF12955 DUF3844: Domain of un 79.9 0.97 2.1E-05 38.2 1.4 10 66-75 53-62 (103)
53 PF00053 Laminin_EGF: Laminin 78.4 1.4 3.1E-05 31.6 1.7 14 60-73 19-32 (49)
54 KOG3512|consensus 76.8 6 0.00013 41.8 6.3 84 48-146 406-489 (592)
55 KOG3516|consensus 76.7 1.8 3.9E-05 50.1 2.7 42 38-80 545-589 (1306)
56 cd00055 EGF_Lam Laminin-type e 76.5 2 4.3E-05 31.1 2.1 15 23-37 19-33 (50)
57 PF12955 DUF3844: Domain of un 75.9 2 4.4E-05 36.2 2.2 31 42-72 12-46 (103)
58 PF06247 Plasmod_Pvs28: Plasmo 74.7 0.96 2.1E-05 42.1 -0.0 94 13-131 51-161 (197)
59 KOG3607|consensus 74.4 2.1 4.5E-05 47.9 2.5 33 6-38 624-657 (716)
60 KOG1218|consensus 72.9 10 0.00023 37.1 6.9 48 23-74 90-139 (316)
61 PF04863 EGF_alliinase: Alliin 69.2 3.2 7E-05 31.0 1.7 26 12-37 17-50 (56)
62 cd00055 EGF_Lam Laminin-type e 68.7 5.4 0.00012 28.8 2.8 16 58-73 18-33 (50)
63 PF00053 Laminin_EGF: Laminin 67.2 3 6.5E-05 29.8 1.2 20 18-37 11-32 (49)
64 PF01683 EB: EB module; Inter 67.0 6.7 0.00014 28.3 3.0 41 19-68 6-46 (52)
65 PF12662 cEGF: Complement Clr- 65.4 5.6 0.00012 24.7 1.9 11 58-68 1-11 (24)
66 KOG3514|consensus 64.3 4.1 8.9E-05 46.8 2.1 36 40-76 625-662 (1591)
67 cd06245 M14_CPD_III The third 60.2 33 0.00073 35.3 7.7 71 161-234 287-357 (363)
68 PF06247 Plasmod_Pvs28: Plasmo 60.1 4.6 0.0001 37.7 1.3 84 23-130 20-119 (197)
69 PF14670 FXa_inhibition: Coagu 60.0 4.7 0.0001 27.5 1.0 19 49-68 10-28 (36)
70 cd01475 vWA_Matrilin VWA_Matri 58.4 10 0.00022 35.7 3.5 34 33-69 181-218 (224)
71 KOG3516|consensus 54.2 9.4 0.0002 44.5 2.7 36 40-76 957-994 (1306)
72 cd03863 M14_CPD_II The second 53.8 45 0.00098 34.5 7.5 73 160-234 296-369 (375)
73 PF09064 Tme5_EGF_like: Thromb 52.0 11 0.00024 25.4 1.7 11 58-68 17-27 (34)
74 smart00180 EGF_Lam Laminin-typ 50.4 11 0.00024 26.7 1.7 20 18-37 11-32 (46)
75 PRK15036 hydroxyisourate hydro 50.1 48 0.001 29.4 6.1 70 162-231 28-109 (137)
76 PF01414 DSL: Delta serrate li 49.9 7.8 0.00017 29.7 0.9 20 17-36 43-63 (63)
77 PF01683 EB: EB module; Inter 49.6 49 0.0011 23.7 5.1 20 13-32 27-46 (52)
78 KOG3514|consensus 46.5 1.1E+02 0.0024 35.9 9.3 37 100-139 626-663 (1591)
79 cd01475 vWA_Matrilin VWA_Matri 45.8 16 0.00035 34.4 2.6 34 93-131 183-218 (224)
80 cd03515 Link_domain_TSG_6_like 44.6 3.7 8E-05 34.1 -1.7 23 267-289 9-34 (93)
81 cd03864 M14_CPN Peptidase M14 40.4 1E+02 0.0022 32.2 7.6 71 162-234 317-387 (392)
82 cd03865 M14_CPE_H Peptidase M1 39.6 78 0.0017 33.2 6.6 71 163-235 328-398 (402)
83 PF00954 S_locus_glycop: S-loc 39.5 34 0.00074 28.6 3.3 29 39-69 78-108 (110)
84 cd03867 M14_CPZ Peptidase M14- 37.8 91 0.002 32.4 6.8 64 161-226 318-381 (395)
85 PF12946 EGF_MSP1_1: MSP1 EGF 37.3 22 0.00049 24.5 1.5 27 42-68 4-30 (37)
86 cd03858 M14_CP_N-E_like Carbox 37.1 1.1E+02 0.0024 31.4 7.3 71 161-233 298-369 (374)
87 cd03518 Link_domain_HAPLN_modu 36.6 5.8 0.00013 33.1 -1.8 23 267-289 9-34 (95)
88 cd03516 Link_domain_CD44_like 35.8 6.6 0.00014 35.2 -1.7 29 261-289 4-39 (144)
89 cd03868 M14_CPD_I The first ca 35.0 1.1E+02 0.0024 31.4 6.9 74 159-234 294-368 (372)
90 KOG0196|consensus 33.8 45 0.00097 37.9 3.9 44 23-69 259-318 (996)
91 cd01102 Link_Domain The link d 32.0 7.9 0.00017 32.1 -1.7 23 267-289 9-34 (92)
92 cd03866 M14_CPM Peptidase M14 29.7 1.8E+02 0.0039 30.1 7.4 70 159-229 293-363 (376)
93 cd01204 IRS_PTB Insulin recept 29.2 96 0.0021 26.2 4.2 18 395-412 45-62 (104)
94 PF00818 Ice_nucleation: Ice n 26.5 26 0.00056 19.7 0.3 13 370-382 2-14 (16)
95 smart00445 LINK Link (Hyaluron 23.2 12 0.00026 31.1 -2.1 22 268-289 11-35 (94)
96 PF06339 Ectoine_synth: Ectoin 21.8 71 0.0015 28.0 2.3 36 338-373 20-55 (126)
No 1
>KOG4659|consensus
Probab=100.00 E-value=2.2e-55 Score=473.46 Aligned_cols=241 Identities=44% Similarity=0.738 Sum_probs=220.9
Q ss_pred ccCCCCCCceeecCCCCccccccceeeeeecCCceecccccCCCCCcccceeeceEEeecCCceeceEEEeeCCCCceEE
Q psy10559 112 VSAPKPIDILLRKQPPAITASFFERMKFLIEESSLQNYAKKDNFNESRSAVVRGRVVTSMGMGLVGVRVSTSTPLEGFTL 191 (418)
Q Consensus 112 ~~~~~~~~~~~C~C~~G~~g~~Ce~~~fli~~~s~~~~~~~~~~~~~~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~ 191 (418)
...+.|++.....|+ ++...|.++++|||.+.+++.|+....|++.+.+||+|+|++..|+|++||+||+...+.+||+
T Consensus 4 ~~a~~pi~vl~r~~p-~~~a~f~~r~~fli~e~s~q~y~~~~~fne~~~~vIrgrvv~~~~~pLVGVrVS~~~~~~yfTl 82 (1899)
T KOG4659|consen 4 STAASPIEVLMRMPP-IFNANFAQRVKFLIMEKSVQSYTDSSQFNENRISVIRGRVVWGGGVPLVGVRVSDAAHPLYFTL 82 (1899)
T ss_pred CcccChHHHHhhcCC-ccchhHHHhhheeechhhhcccCchhhhccccceEEeccEeecCCcceEEEEeecccccceEEE
Confidence 333444444444444 4558999999999999999999999999999999999999999999999999999955555999
Q ss_pred EecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEeccEEEeecCCCCCCC-CCcccccCcCcccCceeeccc
Q psy10559 192 TRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIIDTITMQMGDDRPVST-TQHACKDHDYDTMKPVVLATW 270 (418)
Q Consensus 192 t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id~v~m~~~~~~~~~~-~~~~C~~h~~~~~~pv~~~~~ 270 (418)
||+||+|||+||||.+++|+|.|+||.++.++||||||++++||+|+|++.+..++-. +.++|+.|.++.|.||++++|
T Consensus 83 TR~DG~FDL~vnGg~svtLqF~R~pF~~qkr~v~vpwnq~i~id~vvm~~~~~~~~~~~~~~~C~~~~~~~pdpvvissw 162 (1899)
T KOG4659|consen 83 TREDGYFDLTVNGGRSVTLQFLRTPFQSQKRSVFVPWNQIIHIDDVVMYRQEGGSPPAPARAKCSPTLRRIPDPVVISSW 162 (1899)
T ss_pred EecCceEEEEEcccceEEEEEccCCCcccceeEEeChhhEEEEEeEEEEeecCCCCCCCCCCccChhhccCCCcEEecCc
Confidence 9999999999999999999999999999999999999999999999999988885432 468999999999999999999
Q ss_pred cccccccCCCCCcccchhhhhhcccccccCceeeeeccchhhhcccceeeeecccceeeeeecCccccCCCCCcccCCCC
Q psy10559 271 KHGFQGACPDRSSILAESQFKQLGSKLAIMPTVLITSASIFYQIGRLWQVTLKGKGLVKLLRMGRKYLTPSSNISCASSN 350 (418)
Q Consensus 271 ~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (418)
+++|+++||++++|+||+|
T Consensus 163 ~~tf~ga~pdr~~Iv~esq------------------------------------------------------------- 181 (1899)
T KOG4659|consen 163 QYTFDGACPDRTSIVVESQ------------------------------------------------------------- 181 (1899)
T ss_pred ccccCCcCCCCcccccchh-------------------------------------------------------------
Confidence 9999999999999999999
Q ss_pred CCceeeeeeeeCCCceEEEEecCCCCCCceeEEEeeCCCCCCcccceEEeEEeecccee-EEeecCCC
Q psy10559 351 PPQVVQESFQIPGTGLNLVYHSSRSAGYLSTIQLQLTPQVIPDTLQLRKSLCKDFTHTL-TFYEKRPT 417 (418)
Q Consensus 351 ~~~~~~e~~~i~gt~~~l~y~s~~~~gy~~~~~~~l~~~~ip~~l~~vh~~~~~~~~~~-~~~~~~~~ 417 (418)
|+|||||||||++||+|+|||++||+|+|+|+||+++||.+|+||||+|++.+|.+ ++|+.||.
T Consensus 182 ---vi~Esl~ipgt~vrLvY~SsRs~gy~St~ki~lt~d~vp~~l~kVHl~i~ieG~lf~r~f~a~~n 246 (1899)
T KOG4659|consen 182 ---VIQESLPIPGTDVRLVYDSSRSPGYPSTMKIGLTYDRVPKELRKVHLNIRIEGRLFDRVFAARPN 246 (1899)
T ss_pred ---hhhhhccCCCCceEEEEecccCCCcchheEEeecccccccceEEEEEEEEEeeEEeeeeccCCCC
Confidence 99999999999999999999999999999999999999999999999999999876 57888875
No 2
>KOG1219|consensus
Probab=99.40 E-value=2.9e-13 Score=154.05 Aligned_cols=105 Identities=27% Similarity=0.643 Sum_probs=93.0
Q ss_pred cCCC-cCCCCCcEEec-----ceEeeCCCCcCCCCCc--CCC-CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccc
Q psy10559 8 DVDL-LWCNEHGQCKN-----GTCLCVTGWNGKHCTL--EGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQ 78 (418)
Q Consensus 8 d~C~-~pC~n~G~C~~-----g~C~C~~G~~G~~Ce~--~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~ 78 (418)
+.|. +||+++|+|.. |.|.|++-|.|.+||+ ..| ++||..+|+| ....++|.|.|+.||+|..|+...
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtC-ip~~n~f~CnC~~gyTG~~Ce~~G-- 3941 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTC-IPFYNGFLCNCPNGYTGKRCEARG-- 3941 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEE-EecCCCeeEeCCCCccCceeeccc--
Confidence 6785 69999999974 6999999999999994 589 9999999999 555788999999999999999541
Q ss_pred cCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccccccce
Q psy10559 79 NCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFER 136 (418)
Q Consensus 79 ~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~ 136 (418)
.+.|+.++|.+++.|++.++ +|.|.|.+||.|..|..
T Consensus 3942 ------------------i~eCs~n~C~~gg~C~n~~g---sf~CncT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3942 ------------------ISECSKNVCGTGGQCINIPG---SFHCNCTPGILGRTCCA 3978 (4289)
T ss_pred ------------------ccccccccccCCceeeccCC---ceEeccChhHhcccCcc
Confidence 36778899999999999999 89999999999999864
No 3
>KOG4289|consensus
Probab=99.20 E-value=1.1e-11 Score=137.41 Aligned_cols=107 Identities=23% Similarity=0.611 Sum_probs=88.3
Q ss_pred CCCc-CCCCCcEEec--------------------------ceEeeCCCCcCCCCC--cCCC-CCCCCCCcEEecCCCCC
Q psy10559 9 VDLL-WCNEHGQCKN--------------------------GTCLCVTGWNGKHCT--LEGC-PNSCSNHGQCRVNSDSQ 58 (418)
Q Consensus 9 ~C~~-pC~n~G~C~~--------------------------g~C~C~~G~~G~~Ce--~~~C-~~~C~~~G~C~~~~~g~ 58 (418)
.|.+ ||.|.-+|.. ..|.|++||+|..|+ ++.| .++|.+||+| ....|+
T Consensus 1181 iClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C-~srEgg 1259 (2531)
T KOG4289|consen 1181 ICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRC-RSREGG 1259 (2531)
T ss_pred hhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcccccchhHhhhcCCCCCCCce-EEecCc
Confidence 4765 8988888842 279999999999997 7899 9999999999 666899
Q ss_pred ceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCC-cccccccee
Q psy10559 59 WECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPA-ITASFFERM 137 (418)
Q Consensus 59 ~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G-~~g~~Ce~~ 137 (418)
|.|.|.+||+|.+|++.... .. |..+.|.|+++|++... ..+.|.|+.| |++++|+-.
T Consensus 1260 YtCeCrpg~tGehCEvs~~a-------------gr------CvpGvC~nggtC~~~~n--ggf~c~Cp~ge~e~prC~v~ 1318 (2531)
T KOG4289|consen 1260 YTCECRPGFTGEHCEVSARA-------------GR------CVPGVCKNGGTCVNLLN--GGFCCHCPYGEFEDPRCEVT 1318 (2531)
T ss_pred eeEEecCCccccceeeeccc-------------Cc------cccceecCCCEEeecCC--CceeccCCCcccCCCceEEE
Confidence 99999999999999987631 11 34567999999999865 4688999986 788888864
No 4
>KOG1225|consensus
Probab=99.04 E-value=4.1e-10 Score=118.43 Aligned_cols=109 Identities=34% Similarity=0.755 Sum_probs=85.5
Q ss_pred CCCcCCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccc-cccCC-CCccC
Q psy10559 9 VDLLWCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLL-EQNCN-DGKDN 86 (418)
Q Consensus 9 ~C~~pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~-~~~C~-~g~c~ 86 (418)
.|+..|.++|.|++|+|+|++||+|..|.+..|+..|++|+.| .+ + .|.|++||.|..|+..- ...|. +|.|.
T Consensus 251 ~C~~~c~~~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~-~~--g--~CiC~~g~~G~dCs~~~cpadC~g~G~Ci 325 (525)
T KOG1225|consen 251 YCPGGCTGRGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVC-VD--G--ECICNPGYSGKDCSIRRCPADCSGHGKCI 325 (525)
T ss_pred cCCCCCcccceEeCCeEeCCCCCcCCCCCcccCCcccCCCcee-cC--C--EeecCCCccccccccccCCccCCCCCccc
Confidence 5778899999999999999999999999988898889999998 33 2 89999999999998665 35564 56666
Q ss_pred CCCCccccCCC----CCCCCCCCCCCCccccCCCCCCceeecCCCCccccc
Q psy10559 87 DKDGLVDCEDP----ECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASF 133 (418)
Q Consensus 87 ~~~G~~~C~dp----dCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~ 133 (418)
+|.+.|..- .|..+. |.+++.|++. |+|..||.|+.
T Consensus 326 --~G~C~C~~Gy~G~~C~~~~-C~~~g~cv~g--------C~C~~Gw~G~d 365 (525)
T KOG1225|consen 326 --DGECLCDEGYTGELCIQRA-CSGGGQCVNG--------CKCKKGWRGPD 365 (525)
T ss_pred --CCceEeCCCCcCCcccccc-cCCCceeccC--------ceeccCccCCC
Confidence 555556432 444443 8888877652 68899998866
No 5
>KOG4289|consensus
Probab=98.93 E-value=1.1e-08 Score=114.38 Aligned_cols=71 Identities=32% Similarity=0.830 Sum_probs=60.3
Q ss_pred cccCC-CcCCCCCcEEec----ceEeeCCCCcCCCCCcC----CC-CCCCCCCcEEecCCCCCceEEcCCC-CCCCCCCc
Q psy10559 6 NIDVD-LLWCNEHGQCKN----GTCLCVTGWNGKHCTLE----GC-PNSCSNHGQCRVNSDSQWECKCSDG-WDGKDCSV 74 (418)
Q Consensus 6 ~id~C-~~pC~n~G~C~~----g~C~C~~G~~G~~Ce~~----~C-~~~C~~~G~C~~~~~g~~~C~C~~G-~~G~~C~~ 74 (418)
.+|.| ..||.|||+|.. |+|+|.+||+|++||++ .| +..|.|+|+|.....|++.|.|+.| |.++.|+.
T Consensus 1238 eiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1238 EIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred hhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecCCCceeccCCCcccCCCceEE
Confidence 37788 469999999974 79999999999999964 68 9999999999555578899999987 77888976
Q ss_pred cc
Q psy10559 75 LL 76 (418)
Q Consensus 75 ~~ 76 (418)
..
T Consensus 1318 ~t 1319 (2531)
T KOG4289|consen 1318 TT 1319 (2531)
T ss_pred Ee
Confidence 53
No 6
>KOG1225|consensus
Probab=98.92 E-value=1.9e-09 Score=113.36 Aligned_cols=69 Identities=39% Similarity=1.022 Sum_probs=61.3
Q ss_pred CCccccc----CCCcCCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCcc
Q psy10559 2 RGFSNID----VDLLWCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVL 75 (418)
Q Consensus 2 ~Gf~~id----~C~~pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~ 75 (418)
.||++.| .|+..|++|+.|.++.|+|++||.|+.|++..|+..|++||.| . ...|.|.+||+|..|+..
T Consensus 271 ~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G~C-i----~G~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 271 PGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYSGKDCSIRRCPADCSGHGKC-I----DGECLCDEGYTGELCIQR 343 (525)
T ss_pred CCCcCCCCCcccCCcccCCCceecCCEeecCCCccccccccccCCccCCCCCcc-c----CCceEeCCCCcCCccccc
Confidence 4677543 4888899999999999999999999999998999999999999 4 248999999999999976
No 7
>KOG1226|consensus
Probab=98.53 E-value=1.6e-07 Score=101.18 Aligned_cols=96 Identities=31% Similarity=0.808 Sum_probs=71.5
Q ss_pred CCCCCcEEecceEeeCCCCc----CCCCCcC--CCC----CCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCC
Q psy10559 13 WCNEHGQCKNGTCLCVTGWN----GKHCTLE--GCP----NSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCND 82 (418)
Q Consensus 13 pC~n~G~C~~g~C~C~~G~~----G~~Ce~~--~C~----~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~ 82 (418)
+|+++|.|.-|+|+|++... |..||.+ .|+ ..|.+||+|..+ +|+|.+||+|..|+-...
T Consensus 515 vCSgrG~C~CGqC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG-----~CvC~~GwtG~~C~C~~s----- 584 (783)
T KOG1226|consen 515 VCSGRGDCVCGQCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECG-----RCVCNPGWTGSACNCPLS----- 584 (783)
T ss_pred CcCCCCcEeCCceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCC-----cEEcCCCCccCCCCCCCC-----
Confidence 79999999999999999988 9999855 453 349999999544 899999999999985431
Q ss_pred CccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCC-ccccccce
Q psy10559 83 GKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPA-ITASFFER 136 (418)
Q Consensus 83 g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G-~~g~~Ce~ 136 (418)
...|..++ ...|...|.|... .|+|... |.|.+||.
T Consensus 585 --------td~C~~~~---G~iCSGrG~C~Cg-------~C~C~~~~~sG~~CE~ 621 (783)
T KOG1226|consen 585 --------TDTCESSD---GQICSGRGTCECG-------RCKCTDPPYSGEFCEK 621 (783)
T ss_pred --------CccccCCC---CceeCCCceeeCC-------ceEcCCCCcCcchhhc
Confidence 12233332 2346677777664 3667654 88988886
No 8
>KOG1219|consensus
Probab=98.46 E-value=2.1e-07 Score=107.78 Aligned_cols=75 Identities=25% Similarity=0.615 Sum_probs=65.4
Q ss_pred CCC-CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCC
Q psy10559 39 EGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKP 117 (418)
Q Consensus 39 ~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~ 117 (418)
+.| .++|+++|.|.-...++|.|.|++-|.|..|+..+ +-|..++|.+|++|...++
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~---------------------epC~snPC~~GgtCip~~n- 3922 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDL---------------------EPCASNPCLTGGTCIPFYN- 3922 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCccccccc---------------------ccccCCCCCCCCEEEecCC-
Confidence 568 88999999995444688999999999999999766 3467889999999999888
Q ss_pred CCceeecCCCCcccccccee
Q psy10559 118 IDILLRKQPPAITASFFERM 137 (418)
Q Consensus 118 ~~~~~C~C~~G~~g~~Ce~~ 137 (418)
.|.|.|+.||+|.+||.-
T Consensus 3923 --~f~CnC~~gyTG~~Ce~~ 3940 (4289)
T KOG1219|consen 3923 --GFLCNCPNGYTGKRCEAR 3940 (4289)
T ss_pred --CeeEeCCCCccCceeecc
Confidence 789999999999999863
No 9
>KOG1226|consensus
Probab=98.38 E-value=5e-07 Score=97.33 Aligned_cols=95 Identities=28% Similarity=0.626 Sum_probs=71.8
Q ss_pred CCCCCcEEecceEeeCCCCcCCCCCc-----------CCC----C-CCCCCCcEEecCCCCCceEEcCCCCC----CCCC
Q psy10559 13 WCNEHGQCKNGTCLCVTGWNGKHCTL-----------EGC----P-NSCSNHGQCRVNSDSQWECKCSDGWD----GKDC 72 (418)
Q Consensus 13 pC~n~G~C~~g~C~C~~G~~G~~Ce~-----------~~C----~-~~C~~~G~C~~~~~g~~~C~C~~G~~----G~~C 72 (418)
.|+.+|++.-|.|.|.+||.|..|+- +.| . -.|+++|.|..+ +|.|.+... |..|
T Consensus 468 ~C~g~G~~~CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG-----qC~C~~~~~~~i~G~fC 542 (783)
T KOG1226|consen 468 LCHGNGTFVCGQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG-----QCVCHKPDNGKIYGKFC 542 (783)
T ss_pred ccCCCCcEEecceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC-----ceEecCCCCCceeeeee
Confidence 69999999999999999999999982 346 1 269999999544 899998877 8888
Q ss_pred CccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559 73 SVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFERM 137 (418)
Q Consensus 73 ~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~ 137 (418)
+-+. +.|...+ .-.|.+++.|... +|.|.+||+|+.|+-.
T Consensus 543 ECDn---------------fsC~r~~---g~lC~g~G~C~CG-------~CvC~~GwtG~~C~C~ 582 (783)
T KOG1226|consen 543 ECDN---------------FSCERHK---GVLCGGHGRCECG-------RCVCNPGWTGSACNCP 582 (783)
T ss_pred eccC---------------ccccccc---CcccCCCCeEeCC-------cEEcCCCCccCCCCCC
Confidence 7322 2222211 2348888888763 5899999999999863
No 10
>KOG1214|consensus
Probab=98.20 E-value=2.9e-06 Score=91.75 Aligned_cols=106 Identities=24% Similarity=0.588 Sum_probs=78.4
Q ss_pred cccCCC---cCCCCCcEEec----ceEeeCCCCc--C--CCCC-------cCCC---CCCCCCCcEE--ecCCCCCceEE
Q psy10559 6 NIDVDL---LWCNEHGQCKN----GTCLCVTGWN--G--KHCT-------LEGC---PNSCSNHGQC--RVNSDSQWECK 62 (418)
Q Consensus 6 ~id~C~---~pC~n~G~C~~----g~C~C~~G~~--G--~~Ce-------~~~C---~~~C~~~G~C--~~~~~g~~~C~ 62 (418)
++++|. ..|..+..|++ ++|+|..||. + ..|- .+.| ...|.-.|+| .....+.|.|.
T Consensus 733 d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~ 812 (1289)
T KOG1214|consen 733 DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCA 812 (1289)
T ss_pred ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEe
Confidence 455664 47999999986 6888888774 4 4564 2356 2457655544 33346779999
Q ss_pred cCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccc
Q psy10559 63 CSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFF 134 (418)
Q Consensus 63 C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~C 134 (418)
|.+||.|+. ..|.+.|.|..+.|...+.|.+.++ ++.|+|.+||.|.-.
T Consensus 813 CLPGfsGDG--------------------~~c~dvDeC~psrChp~A~Cyntpg---sfsC~C~pGy~GDGf 861 (1289)
T KOG1214|consen 813 CLPGFSGDG--------------------HQCTDVDECSPSRCHPAATCYNTPG---SFSCRCQPGYYGDGF 861 (1289)
T ss_pred ecCCccCCc--------------------cccccccccCccccCCCceEecCCC---cceeecccCccCCCc
Confidence 999999832 4456678888889999999999997 899999999988543
No 11
>KOG1217|consensus
Probab=97.65 E-value=0.00014 Score=74.69 Aligned_cols=116 Identities=28% Similarity=0.700 Sum_probs=83.5
Q ss_pred CCCcEEe-------cceEeeCCCCcCCCCCc--CCC---CCCCCCCcEEecCCCCCceEEcCCCCCCCCCCcc-------
Q psy10559 15 NEHGQCK-------NGTCLCVTGWNGKHCTL--EGC---PNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVL------- 75 (418)
Q Consensus 15 ~n~G~C~-------~g~C~C~~G~~G~~Ce~--~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~------- 75 (418)
..++.|. .+.|.|..||.|..|.. +.| ...|.+++.| .+..+.|.|.|.+||.|..|+..
T Consensus 137 ~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C-~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~ 215 (487)
T KOG1217|consen 137 CIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTC-VNTGGSYLCSCPPGYTGSTCETTGNGGTCV 215 (487)
T ss_pred eCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCccc-ccCCCCeeEeCCCCccCCcCcCCCCCceEe
Confidence 4556665 25799999999999984 478 3459999999 67677799999999999988764
Q ss_pred ------------cc------ccCCC--CccCCCCCccccC--------------CCCCCCCC-CCCCCCccccCCCCCCc
Q psy10559 76 ------------LE------QNCND--GKDNDKDGLVDCE--------------DPECCSNH-ICRSSQLCVSAPKPIDI 120 (418)
Q Consensus 76 ------------~~------~~C~~--g~c~~~~G~~~C~--------------dpdCc~~~-~C~~~~~C~~~~~~~~~ 120 (418)
.. ..|.. +.|.+..+.+.|. +.+-|... .|.+++.|.+..+ .
T Consensus 216 ~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~---~ 292 (487)
T KOG1217|consen 216 DSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG---S 292 (487)
T ss_pred cceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCC---c
Confidence 10 01211 4555555554443 33434443 3888999999877 4
Q ss_pred eeecCCCCcccccc
Q psy10559 121 LLRKQPPAITASFF 134 (418)
Q Consensus 121 ~~C~C~~G~~g~~C 134 (418)
|.|.|++||+|..|
T Consensus 293 ~~C~C~~g~~g~~~ 306 (487)
T KOG1217|consen 293 YRCTCPPGFTGRLC 306 (487)
T ss_pred ceeeCCCCCCCCCC
Confidence 88999999999887
No 12
>KOG4260|consensus
Probab=97.64 E-value=5.4e-05 Score=73.14 Aligned_cols=111 Identities=25% Similarity=0.646 Sum_probs=66.3
Q ss_pred cCCCCCcEEec-------ceEeeCCCCcCCCCCc-----------------CCCCCCCCCCcEEecCCCCCceEE-cCCC
Q psy10559 12 LWCNEHGQCKN-------GTCLCVTGWNGKHCTL-----------------EGCPNSCSNHGQCRVNSDSQWECK-CSDG 66 (418)
Q Consensus 12 ~pC~n~G~C~~-------g~C~C~~G~~G~~Ce~-----------------~~C~~~C~~~G~C~~~~~g~~~C~-C~~G 66 (418)
.+|.++|.|.. |.|.|.+||+|+.|.. ..|...|. |.| .. .+.-.|. |..|
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~--~~C-sg-~~~k~C~kCkkG 225 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL--GVC-SG-ESSKGCSKCKKG 225 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--ccc-CC-CCCCChhhhccc
Confidence 48999999962 7999999999999962 12334454 366 22 3344575 9999
Q ss_pred CCCC--CCCccc-----cccCC-CCccCCCCCccccCC-------CCCCCC--CCCC-CCCccccCCCCCCceeecCCCC
Q psy10559 67 WDGK--DCSVLL-----EQNCN-DGKDNDKDGLVDCED-------PECCSN--HICR-SSQLCVSAPKPIDILLRKQPPA 128 (418)
Q Consensus 67 ~~G~--~C~~~~-----~~~C~-~g~c~~~~G~~~C~d-------pdCc~~--~~C~-~~~~C~~~~~~~~~~~C~C~~G 128 (418)
|.-. .|-... +.+|. +..|.|..|++.|.+ +|.|.. ..|. .+..|.+..+ .|.|.|..|
T Consensus 226 W~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~~~d~~~~kn~~c~ni~~---~~r~v~f~~ 302 (350)
T KOG4260|consen 226 WKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQFCADVCASKNRPCMNIDG---QYRCVCFSG 302 (350)
T ss_pred ceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhhhhhhcccCCCCcccCCc---cEEEEeccc
Confidence 9633 232111 13343 456777777777776 444432 2232 3455666555 566666655
Q ss_pred c
Q psy10559 129 I 129 (418)
Q Consensus 129 ~ 129 (418)
+
T Consensus 303 ~ 303 (350)
T KOG4260|consen 303 L 303 (350)
T ss_pred c
Confidence 4
No 13
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=97.35 E-value=0.0002 Score=47.64 Aligned_cols=24 Identities=42% Similarity=1.239 Sum_probs=16.0
Q ss_pred CCCCCcEEe--cceEeeCCCCcCCCC
Q psy10559 13 WCNEHGQCK--NGTCLCVTGWNGKHC 36 (418)
Q Consensus 13 pC~n~G~C~--~g~C~C~~G~~G~~C 36 (418)
.|++||+|+ .++|+|.+||+|+.|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 466677776 467777777776665
No 14
>KOG4260|consensus
Probab=97.35 E-value=0.00024 Score=68.82 Aligned_cols=100 Identities=23% Similarity=0.516 Sum_probs=71.3
Q ss_pred eCCCCcCCCCCcCCC----CCCCCCCcEEecCC--CCCceEEcCCCCCCCCCCcccc--------------cc----CC-
Q psy10559 27 CVTGWNGKHCTLEGC----PNSCSNHGQCRVNS--DSQWECKCSDGWDGKDCSVLLE--------------QN----CN- 81 (418)
Q Consensus 27 C~~G~~G~~Ce~~~C----~~~C~~~G~C~~~~--~g~~~C~C~~G~~G~~C~~~~~--------------~~----C~- 81 (418)
|+.|-.|+.|.. | ..+|.++|.|.-+. .|++.|.|..||+|+.|..-.. .. |.
T Consensus 132 Cp~gtyGpdCl~--Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~ 209 (350)
T KOG4260|consen 132 CPDGTYGPDCLQ--CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLG 209 (350)
T ss_pred cCCCCcCCcccc--CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhc
Confidence 899999999963 5 45799999994332 5778999999999999864431 11 21
Q ss_pred ---C------CccCCCCC--ccccCCCCCCC--CCCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559 82 ---D------GKDNDKDG--LVDCEDPECCS--NHICRSSQLCVSAPKPIDILLRKQPPAITA 131 (418)
Q Consensus 82 ---~------g~c~~~~G--~~~C~dpdCc~--~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g 131 (418)
. ..|..++- .-.|.|.++|. .++|..+..|+|..+ +|.|.+.+||.+
T Consensus 210 ~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteG---Sf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 210 VCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEG---SFKCEDKEGYKK 269 (350)
T ss_pred ccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCC---ceEecccccccC
Confidence 1 12222222 23467777764 566999999999988 899999999866
No 15
>KOG1217|consensus
Probab=97.34 E-value=0.00074 Score=69.38 Aligned_cols=103 Identities=29% Similarity=0.676 Sum_probs=71.0
Q ss_pred cEEec----ceEeeCCCCcCCCC----CcCCC-CC-CCCCCcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCC
Q psy10559 18 GQCKN----GTCLCVTGWNGKHC----TLEGC-PN-SCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDND 87 (418)
Q Consensus 18 G~C~~----g~C~C~~G~~G~~C----e~~~C-~~-~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~ 87 (418)
++|.+ ++|.|.+||.+..+ .++.| .. .|.++++| .+..+.|.|.|++||.|..| .. |.+
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C-~~~~~~~~C~C~~g~~g~~~-~~----~~~----- 311 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTC-VNVPGSYRCTCPPGFTGRLC-TE----CVD----- 311 (487)
T ss_pred CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCee-ecCCCcceeeCCCCCCCCCC-cc----ccc-----
Confidence 78875 59999999999985 46788 43 39999999 66667799999999999998 11 100
Q ss_pred CCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559 88 KDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFERM 137 (418)
Q Consensus 88 ~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~ 137 (418)
...|... .....|.+++.|.. .+....+.|.|.++|.|..|+..
T Consensus 312 ---~~~C~~~--~~~~~c~~g~~C~~-~~~~~~~~C~c~~~~~g~~C~~~ 355 (487)
T KOG1217|consen 312 ---VDECSPR--NAGGPCANGGTCNT-LGSFGGFRCACGPGFTGRRCEDS 355 (487)
T ss_pred ---ccccccc--ccCCcCCCCccccc-CCCCCCCCcCCCCCCCCCccccC
Confidence 0011000 02344777778822 22234677999999999988864
No 16
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.31 E-value=0.0001 Score=48.92 Aligned_cols=28 Identities=36% Similarity=1.036 Sum_probs=20.6
Q ss_pred CCCCCCCcEEecCCC-CCceEEcCCCCCCC
Q psy10559 42 PNSCSNHGQCRVNSD-SQWECKCSDGWDGK 70 (418)
Q Consensus 42 ~~~C~~~G~C~~~~~-g~~~C~C~~G~~G~ 70 (418)
+++|.++|+| ++.. +.|.|.|++||+|+
T Consensus 3 ~~~C~n~g~C-~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 3 SNPCQNGGTC-IDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTSSTTTEEE-EEESTSEEEEEEBTTEEST
T ss_pred CCcCCCCeEE-EeCCCCCEEeECCCCCccC
Confidence 5678888888 4444 77888888888875
No 17
>KOG1214|consensus
Probab=97.19 E-value=0.0015 Score=71.47 Aligned_cols=47 Identities=30% Similarity=1.029 Sum_probs=41.7
Q ss_pred ceEeeCCCCcCC--CC-CcCCC-CCCCCCCcEEecCCCCCceEEcCCCCCCC
Q psy10559 23 GTCLCVTGWNGK--HC-TLEGC-PNSCSNHGQCRVNSDSQWECKCSDGWDGK 70 (418)
Q Consensus 23 g~C~C~~G~~G~--~C-e~~~C-~~~C~~~G~C~~~~~g~~~C~C~~G~~G~ 70 (418)
++|.|.+||.|+ .| ..++| ++-|...++| .+..|++.|.|.+||.|+
T Consensus 809 y~C~CLPGfsGDG~~c~dvDeC~psrChp~A~C-yntpgsfsC~C~pGy~GD 859 (1289)
T KOG1214|consen 809 YSCACLPGFSGDGHQCTDVDECSPSRCHPAATC-YNTPGSFSCRCQPGYYGD 859 (1289)
T ss_pred EEEeecCCccCCccccccccccCccccCCCceE-ecCCCcceeecccCccCC
Confidence 699999999975 44 47899 9999999999 788999999999999875
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.97 E-value=0.0011 Score=44.77 Aligned_cols=34 Identities=38% Similarity=1.164 Sum_probs=25.9
Q ss_pred CCC-C-CCCCCCcEEecCCCCCceEEcCCCCC-CCCCC
Q psy10559 39 EGC-P-NSCSNHGQCRVNSDSQWECKCSDGWD-GKDCS 73 (418)
Q Consensus 39 ~~C-~-~~C~~~G~C~~~~~g~~~C~C~~G~~-G~~C~ 73 (418)
++| . .+|.++|.| .+..++|.|.|+.||. |..|+
T Consensus 3 ~~C~~~~~C~~~~~C-~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 3 DECASGNPCQNGGTC-VNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred ccCcCCCCcCCCCEe-ECCCCCeEeECCCCCccCCcCC
Confidence 456 3 578888888 6777888888888888 77764
No 19
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.54 E-value=0.0035 Score=41.63 Aligned_cols=34 Identities=38% Similarity=1.154 Sum_probs=25.2
Q ss_pred CCC-C-CCCCCCcEEecCCCCCceEEcCCCCCCCCCC
Q psy10559 39 EGC-P-NSCSNHGQCRVNSDSQWECKCSDGWDGKDCS 73 (418)
Q Consensus 39 ~~C-~-~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~ 73 (418)
++| . .+|.+++.| .+..+.|.|.|..||.|..|+
T Consensus 3 ~~C~~~~~C~~~~~C-~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 3 DECASGNPCQNGGTC-VNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred ccCCCCCCcCCCCEe-ECCCCCeEeECCCCCcCCcCC
Confidence 456 3 578778888 566777888888888887663
No 20
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.33 E-value=0.0046 Score=41.65 Aligned_cols=32 Identities=34% Similarity=0.790 Sum_probs=27.0
Q ss_pred cccCCC--cCCCCCcEEec----ceEeeCCCCc-CCCCC
Q psy10559 6 NIDVDL--LWCNEHGQCKN----GTCLCVTGWN-GKHCT 37 (418)
Q Consensus 6 ~id~C~--~pC~n~G~C~~----g~C~C~~G~~-G~~Ce 37 (418)
++++|. .+|.++|+|.+ +.|.|.+||. |..|+
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 367886 58999999986 5899999999 98885
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=96.32 E-value=0.0017 Score=43.05 Aligned_cols=30 Identities=23% Similarity=0.442 Sum_probs=25.7
Q ss_pred CCCCCCCCCccccCCCCCCceeecCCCCcccc
Q psy10559 101 SNHICRSSQLCVSAPKPIDILLRKQPPAITAS 132 (418)
Q Consensus 101 ~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~ 132 (418)
...+|.++++|+... .+.|.|.|++||+|.
T Consensus 2 ~~~~C~n~g~C~~~~--~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLP--GGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTSSTTTEEEEEES--TSEEEEEEBTTEEST
T ss_pred CCCcCCCCeEEEeCC--CCCEEeECCCCCccC
Confidence 456899999999988 348999999999984
No 22
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.22 E-value=0.0055 Score=40.75 Aligned_cols=27 Identities=48% Similarity=1.160 Sum_probs=22.1
Q ss_pred CCCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559 43 NSCSNHGQCRVNSDSQWECKCSDGWDGKDC 72 (418)
Q Consensus 43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C 72 (418)
..|++||+| ... ...|.|.+||+|+.|
T Consensus 6 ~~C~~~G~C-~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTC-VSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEE-eCC--CCEEECCCCCcCCCC
Confidence 359999999 432 459999999999887
No 23
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.94 E-value=0.0045 Score=43.44 Aligned_cols=30 Identities=33% Similarity=1.226 Sum_probs=25.9
Q ss_pred cCCC---CCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559 38 LEGC---PNSCSNHGQCRVNSDSQWECKCSDGWD 68 (418)
Q Consensus 38 ~~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~ 68 (418)
+++| +..|..++.| ++..|+|.|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C-~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTC-VNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEE-EEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEE-EcCCCCEEeeCCCCcE
Confidence 4677 4579889999 8889999999999997
No 24
>KOG1836|consensus
Probab=95.90 E-value=0.014 Score=69.89 Aligned_cols=110 Identities=27% Similarity=0.578 Sum_probs=75.5
Q ss_pred CCCCcEEec--ceEeeCCCCcCCCCCc-------------C--CC-CCCCCCC-cEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559 14 CNEHGQCKN--GTCLCVTGWNGKHCTL-------------E--GC-PNSCSNH-GQCRVNSDSQWECKCSDGWDGKDCSV 74 (418)
Q Consensus 14 C~n~G~C~~--g~C~C~~G~~G~~Ce~-------------~--~C-~~~C~~~-G~C~~~~~g~~~C~C~~G~~G~~C~~ 74 (418)
+...+.+.+ ..|.|+.||+|..|+. + .| +..|++| ..|+ ..++.|.|...-.|..|+
T Consensus 684 ~~~~~~~~~wve~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd---~~tG~C~C~~~t~G~~C~- 759 (1705)
T KOG1836|consen 684 GDGAGATDTWVEQCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICD---PRTGQCKCKHNTFGGQCA- 759 (1705)
T ss_pred CCCCCCccchhhhccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCcccccc---CCCCceecccCCCCCchh-
Confidence 444455533 4699999999999982 1 23 5567776 6784 345689999999999997
Q ss_pred cccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeec-CCCCccccccce
Q psy10559 75 LLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRK-QPPAITASFFER 136 (418)
Q Consensus 75 ~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~-C~~G~~g~~Ce~ 136 (418)
.|.+|.-++-+.... .| |...+|.+++.|..... .....|+ |++||+|.+|+.
T Consensus 760 ----~C~~GfYg~~~~~~~---~d-C~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 760 ----QCVDGFYGLPDLGTS---GD-CQPCPCPNGGACGQTPE-ILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred ----hhcCCCCCccccCCC---CC-CccCCCCCChhhcCcCc-ccceecCCCCCCCccccccc
Confidence 466665543322111 12 45567778888887765 4456788 999999999985
No 25
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.84 E-value=0.013 Score=38.75 Aligned_cols=28 Identities=39% Similarity=1.132 Sum_probs=22.8
Q ss_pred CCCCCCcEEecCCCCCceEEcCCCCCC-CCC
Q psy10559 43 NSCSNHGQCRVNSDSQWECKCSDGWDG-KDC 72 (418)
Q Consensus 43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G-~~C 72 (418)
.+|.++ .| .+..+.|.|.|+.||.| ..|
T Consensus 6 ~~C~~~-~C-~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TC-INTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EE-ECCCCCeEeECCCCCccCCcc
Confidence 478877 89 56688899999999999 665
No 26
>smart00051 DSL delta serrate ligand.
Probab=95.80 E-value=0.011 Score=45.50 Aligned_cols=44 Identities=25% Similarity=0.634 Sum_probs=34.8
Q ss_pred eEeeCCCCcCCCCCcCCC--CCCCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559 24 TCLCVTGWNGKHCTLEGC--PNSCSNHGQCRVNSDSQWECKCSDGWDGKDC 72 (418)
Q Consensus 24 ~C~C~~G~~G~~Ce~~~C--~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C 72 (418)
.-.|.++|.|..|+. .| .+.+.+|.+|+ . .+.|.|.+||+|+.|
T Consensus 18 rv~C~~~~yG~~C~~-~C~~~~d~~~~~~Cd-~---~G~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCNK-FCRPRDDFFGHYTCD-E---NGNKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccCC-EeCcCccccCCccCC-c---CCCEecCCCCcCCCC
Confidence 456999999999973 45 45678899993 2 357999999999887
No 27
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.79 E-value=0.0051 Score=32.69 Aligned_cols=13 Identities=54% Similarity=1.592 Sum_probs=7.6
Q ss_pred eEeeCCCCcCCCC
Q psy10559 24 TCLCVTGWNGKHC 36 (418)
Q Consensus 24 ~C~C~~G~~G~~C 36 (418)
.|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 3666666666665
No 28
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=95.76 E-value=0.012 Score=38.94 Aligned_cols=31 Identities=35% Similarity=0.800 Sum_probs=26.2
Q ss_pred ccCCC--cCCCCCcEEec----ceEeeCCCCcCCCCC
Q psy10559 7 IDVDL--LWCNEHGQCKN----GTCLCVTGWNGKHCT 37 (418)
Q Consensus 7 id~C~--~pC~n~G~C~~----g~C~C~~G~~G~~Ce 37 (418)
+++|. .+|.+++.|.+ +.|.|.+||.|..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C~ 38 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcCC
Confidence 57786 48999999986 589999999998874
No 29
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.64 E-value=0.016 Score=37.64 Aligned_cols=29 Identities=45% Similarity=1.174 Sum_probs=22.6
Q ss_pred CCCCCCcEEecCCCCCceEEcCCCCCCC-CC
Q psy10559 43 NSCSNHGQCRVNSDSQWECKCSDGWDGK-DC 72 (418)
Q Consensus 43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~-~C 72 (418)
.+|.+++.| .+..+.|.|.|+.||.|. .|
T Consensus 6 ~~C~~~~~C-~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 6 NPCSNGGTC-VNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCEE-ecCCCCeEeECCCCCcccCCc
Confidence 567778888 566678889999998887 54
No 30
>KOG0994|consensus
Probab=95.24 E-value=0.043 Score=62.28 Aligned_cols=88 Identities=27% Similarity=0.747 Sum_probs=50.8
Q ss_pred CCcccccCCCc-CCCCCc-EEec--ceEe-eCCCCcCCCCCc--------------CCC-CCCCCCCc--------EEec
Q psy10559 2 RGFSNIDVDLL-WCNEHG-QCKN--GTCL-CVTGWNGKHCTL--------------EGC-PNSCSNHG--------QCRV 53 (418)
Q Consensus 2 ~Gf~~id~C~~-pC~n~G-~C~~--g~C~-C~~G~~G~~Ce~--------------~~C-~~~C~~~G--------~C~~ 53 (418)
-||++.-+|.. .|+.|. +|.. |.|. |..--+|.+|+. ..| |.+|..+- .|..
T Consensus 848 pG~WgFPeCr~CqCNgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~ 927 (1758)
T KOG0994|consen 848 PGYWGFPECRPCQCNGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYL 927 (1758)
T ss_pred CCccCCCcCccccccCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccc
Confidence 47888777754 676663 3432 4443 444444555541 256 66675431 4644
Q ss_pred CC-CCCceEEcCCCCCCCCCCccccccCCCCccCCCCCcccc
Q psy10559 54 NS-DSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDC 94 (418)
Q Consensus 54 ~~-~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C 94 (418)
+. .....|.|.+||+|..|+. |.++.-.+-..+..|
T Consensus 928 d~~t~~ivC~C~~GY~G~RCe~-----CA~~~fGnP~~GGtC 964 (1758)
T KOG0994|consen 928 DTRTQQIVCHCQEGYSGSRCEI-----CADNHFGNPSEGGTC 964 (1758)
T ss_pred cccccceeeecccCccccchhh-----hcccccCCcccCCcc
Confidence 43 3455799999999999984 555544443333344
No 31
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.56 E-value=0.016 Score=30.82 Aligned_cols=13 Identities=46% Similarity=1.653 Sum_probs=10.8
Q ss_pred eEEcCCCCCCCCC
Q psy10559 60 ECKCSDGWDGKDC 72 (418)
Q Consensus 60 ~C~C~~G~~G~~C 72 (418)
.|.|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4999999999987
No 32
>KOG0994|consensus
Probab=94.34 E-value=0.052 Score=61.65 Aligned_cols=49 Identities=37% Similarity=0.998 Sum_probs=34.2
Q ss_pred ceEeeCCCCcCCCCCc-----------CCC-CCCCC--CCcEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559 23 GTCLCVTGWNGKHCTL-----------EGC-PNSCS--NHGQCRVNSDSQWECKCSDGWDGKDCSV 74 (418)
Q Consensus 23 g~C~C~~G~~G~~Ce~-----------~~C-~~~C~--~~G~C~~~~~g~~~C~C~~G~~G~~C~~ 74 (418)
|+|-|.+.-.|..|+. .+| +..|. +.-+|. . =.++|.|.+||-|..|+.
T Consensus 1037 GQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN-~--ftGQCqCkpGfGGR~C~q 1099 (1758)
T KOG0994|consen 1037 GQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCN-E--FTGQCQCKPGFGGRTCSQ 1099 (1758)
T ss_pred CcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccc-c--cccceeccCCCCCcchhH
Confidence 8999999988988862 245 44454 233662 2 234899999999999973
No 33
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=93.59 E-value=0.041 Score=38.50 Aligned_cols=25 Identities=24% Similarity=0.453 Sum_probs=22.7
Q ss_pred CCCCCCCccccCCCCCCceeecCCCCcc
Q psy10559 103 HICRSSQLCVSAPKPIDILLRKQPPAIT 130 (418)
Q Consensus 103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~ 130 (418)
..|..++.|++..+ +|.|.|++||.
T Consensus 10 ~~C~~~~~C~N~~G---sy~C~C~~Gy~ 34 (42)
T PF07645_consen 10 HNCPENGTCVNTEG---SYSCSCPPGYE 34 (42)
T ss_dssp SSSSTTSEEEEETT---EEEEEESTTEE
T ss_pred CcCCCCCEEEcCCC---CEEeeCCCCcE
Confidence 45988999999999 99999999996
No 34
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=93.35 E-value=0.56 Score=37.40 Aligned_cols=79 Identities=23% Similarity=0.309 Sum_probs=58.7
Q ss_pred eeceEEeec-CCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEeccEEEee
Q psy10559 163 VRGRVVTSM-GMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIIDTITMQM 241 (418)
Q Consensus 163 i~G~v~~~~-Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id~v~m~~ 241 (418)
++|.|++.. +.|+.++.+...... -++.|..+|.|.+-+.-|. .++.|.-..|......+-...++...+ ++.|.+
T Consensus 2 i~G~V~d~~t~~pl~~a~V~~~~~~-~~~~Td~~G~F~i~~~~g~-~~l~is~~Gy~~~~~~i~~~~~~~~~~-~i~L~~ 78 (88)
T PF13715_consen 2 ISGKVVDSDTGEPLPGATVYLKNTK-KGTVTDENGRFSIKLPEGD-YTLKISYIGYETKTITISVNSNKNTNL-NIYLEP 78 (88)
T ss_pred EEEEEEECCCCCCccCeEEEEeCCc-ceEEECCCeEEEEEEcCCC-eEEEEEEeCEEEEEEEEEecCCCEEEE-EEEEee
Confidence 579999888 999999999887543 5689999999998765443 667777778888777777766554444 677765
Q ss_pred cCC
Q psy10559 242 GDD 244 (418)
Q Consensus 242 ~~~ 244 (418)
...
T Consensus 79 ~~~ 81 (88)
T PF13715_consen 79 KSN 81 (88)
T ss_pred Ccc
Confidence 443
No 35
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=93.33 E-value=0.088 Score=34.03 Aligned_cols=30 Identities=23% Similarity=0.416 Sum_probs=24.5
Q ss_pred CCCCCCCCccccCCCCCCceeecCCCCcccc-cc
Q psy10559 102 NHICRSSQLCVSAPKPIDILLRKQPPAITAS-FF 134 (418)
Q Consensus 102 ~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~-~C 134 (418)
..+|.+++.|.+..+ .+.|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~---~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPG---SYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCC---CeEeECCCCCcccCCc
Confidence 456888899998875 7899999999986 44
No 36
>smart00181 EGF Epidermal growth factor-like domain.
Probab=91.91 E-value=0.17 Score=33.26 Aligned_cols=28 Identities=21% Similarity=0.434 Sum_probs=22.7
Q ss_pred CCCCCCCccccCCCCCCceeecCCCCccc-ccc
Q psy10559 103 HICRSSQLCVSAPKPIDILLRKQPPAITA-SFF 134 (418)
Q Consensus 103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g-~~C 134 (418)
.+|.++ .|.+..+ .|.|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~---~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPG---SYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCC---CeEeECCCCCccCCcc
Confidence 468877 8988755 899999999998 554
No 37
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=91.87 E-value=0.082 Score=36.05 Aligned_cols=27 Identities=26% Similarity=0.955 Sum_probs=20.1
Q ss_pred CCCCCCcEEecCCCCCceEEcCCCCCCC
Q psy10559 43 NSCSNHGQCRVNSDSQWECKCSDGWDGK 70 (418)
Q Consensus 43 ~~C~~~G~C~~~~~g~~~C~C~~G~~G~ 70 (418)
..|..++.| .+..++|.|.|.+||.|+
T Consensus 6 ~~C~~nA~C-~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATC-TNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEE-EE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEe-ecCCCCEEeECCCCCccC
Confidence 358889999 677789999999999875
No 38
>smart00051 DSL delta serrate ligand.
Probab=90.71 E-value=0.34 Score=37.24 Aligned_cols=47 Identities=11% Similarity=0.301 Sum_probs=30.7
Q ss_pred ceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcccccc
Q psy10559 59 WECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFF 134 (418)
Q Consensus 59 ~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~C 134 (418)
+.-.|.++|.|..|+.. |. ....+..+..|... + .|.|.+||+|+.|
T Consensus 17 ~rv~C~~~~yG~~C~~~------------------C~-----~~~d~~~~~~Cd~~-G-----~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNKF------------------CR-----PRDDFFGHYTCDEN-G-----NKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCCE------------------eC-----cCccccCCccCCcC-C-----CEecCCCCcCCCC
Confidence 35579999999999732 21 12234556666442 3 3789999999765
No 39
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=90.48 E-value=1.5 Score=34.27 Aligned_cols=75 Identities=24% Similarity=0.362 Sum_probs=48.0
Q ss_pred eeceEEeecCCceeceEEEeeCCCC---ceEEEecCCceeee-ecCCceEEEEEecCCCCCccc-eeeeeccceEEeccE
Q psy10559 163 VRGRVVTSMGMGLVGVRVSTSTPLE---GFTLTRDDGWFDLL-VNGGGAVTLQFGRSPFKPHNH-IVHVPWNEVVIIDTI 237 (418)
Q Consensus 163 i~G~v~~~~Gtp~~Gv~vs~~~~~~---G~t~t~~dg~fdl~-vNGG~~~~l~f~r~Pf~~~~~-~V~vpwn~~~~id~v 237 (418)
++|+|.+..|.|+.|+.|....... ..+.+..+|.|.+- +-- +.-.+.+....|.+... .|-+..++...+ ++
T Consensus 2 I~G~V~d~~g~pv~~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~-g~Y~l~v~~~g~~~~~~~~v~v~~~~~~~~-~i 79 (82)
T PF13620_consen 2 ISGTVTDATGQPVPGATVTLTDQDGGTVYTTTTDSDGRFSFEGLPP-GTYTLRVSAPGYQPQTQENVTVTAGQTTTV-DI 79 (82)
T ss_dssp EEEEEEETTSCBHTT-EEEET--TTTECCEEE--TTSEEEEEEE-S-EEEEEEEEBTTEE-EEEEEEEESSSSEEE---E
T ss_pred EEEEEEcCCCCCcCCEEEEEEEeeCCCEEEEEECCCceEEEEccCC-EeEEEEEEECCcceEEEEEEEEeCCCEEEE-EE
Confidence 6799999999999999999884322 66788889999774 333 45666677777777664 466666666554 44
Q ss_pred EE
Q psy10559 238 TM 239 (418)
Q Consensus 238 ~m 239 (418)
.|
T Consensus 80 ~L 81 (82)
T PF13620_consen 80 TL 81 (82)
T ss_dssp EE
T ss_pred EE
Confidence 44
No 40
>KOG1836|consensus
Probab=90.28 E-value=0.94 Score=54.84 Aligned_cols=71 Identities=34% Similarity=0.806 Sum_probs=49.7
Q ss_pred CCCCC-cEEec--ceEeeCCCCcCCCCCc--------------CCC-CCCCCCCcEEecCC-CCCceEE-cCCCCCCCCC
Q psy10559 13 WCNEH-GQCKN--GTCLCVTGWNGKHCTL--------------EGC-PNSCSNHGQCRVNS-DSQWECK-CSDGWDGKDC 72 (418)
Q Consensus 13 pC~n~-G~C~~--g~C~C~~G~~G~~Ce~--------------~~C-~~~C~~~G~C~~~~-~g~~~C~-C~~G~~G~~C 72 (418)
.|++| .+|.. |.|.|..--.|.+|+. ..| +.+|.+.+.|.... .....|. |++||+|..|
T Consensus 732 ~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 732 DCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred ccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccc
Confidence 45554 45653 7777777777777752 138 88899888884332 4667899 9999999999
Q ss_pred CccccccCCCCccCCC
Q psy10559 73 SVLLEQNCNDGKDNDK 88 (418)
Q Consensus 73 ~~~~~~~C~~g~c~~~ 88 (418)
+ .|.+|+-.+.
T Consensus 812 e-----~c~dgyfg~p 822 (1705)
T KOG1836|consen 812 E-----ECADGYFGNP 822 (1705)
T ss_pred c-----cCCCccccCC
Confidence 8 4666655543
No 41
>KOG3512|consensus
Probab=88.44 E-value=1.4 Score=46.22 Aligned_cols=102 Identities=24% Similarity=0.511 Sum_probs=59.3
Q ss_pred ceEeeCCCCcCCCCCc------------------CCC-CCCCCCCcE-EecCC--------CCCceEE-cCCCCCCCCCC
Q psy10559 23 GTCLCVTGWNGKHCTL------------------EGC-PNSCSNHGQ-CRVNS--------DSQWECK-CSDGWDGKDCS 73 (418)
Q Consensus 23 g~C~C~~G~~G~~Ce~------------------~~C-~~~C~~~G~-C~~~~--------~g~~~C~-C~~G~~G~~C~ 73 (418)
.+|.|..+-+|+.|+. ++| .+.|..|+. |..+. ..+..|. |.-.-.|..|.
T Consensus 295 ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhCh 374 (592)
T KOG3512|consen 295 LTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHCH 374 (592)
T ss_pred eEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCcccc
Confidence 4899999999999872 366 667776653 42221 1223453 66666777774
Q ss_pred ccccccCCCCccCCCCCc----cccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccccccce
Q psy10559 74 VLLEQNCNDGKDNDKDGL----VDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITASFFER 136 (418)
Q Consensus 74 ~~~~~~C~~g~c~~~~G~----~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~ 136 (418)
-|..|+-.|..-. -.|. .|-++..=..+-+|....+ +|.|.+|.+|..|++
T Consensus 375 -----yCreGyyRd~s~pl~hrkaCk--~CdChpVGs~gktCNq~tG-----qCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 375 -----YCREGYYRDGSKPLTHRKACK--ACDCHPVGSAGKTCNQTTG-----QCPCKEGVTGLTCNR 429 (592)
T ss_pred -----cccCccccCCCCCCchhhhhh--hcCCcccccccccccccCC-----cccCCCCCccccccc
Confidence 4555554443211 1122 2222333344556665555 688888888888775
No 42
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=86.13 E-value=0.34 Score=32.99 Aligned_cols=27 Identities=11% Similarity=0.204 Sum_probs=21.0
Q ss_pred CCCCCCCccccCCCCCCceeecCCCCcccc
Q psy10559 103 HICRSSQLCVSAPKPIDILLRKQPPAITAS 132 (418)
Q Consensus 103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~ 132 (418)
..|..++.|.+.++ .+.|.|.+||.|.
T Consensus 6 ~~C~~nA~C~~~~~---~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGG---SYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TT---SEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCC---CEEeECCCCCccC
Confidence 45889999999988 8999999999884
No 43
>PHA02887 EGF-like protein; Provisional
Probab=84.16 E-value=0.99 Score=38.83 Aligned_cols=31 Identities=26% Similarity=0.667 Sum_probs=23.2
Q ss_pred CCCCCCcEEecC-CCCCceEEcCCCCCCCCCCc
Q psy10559 43 NSCSNHGQCRVN-SDSQWECKCSDGWDGKDCSV 74 (418)
Q Consensus 43 ~~C~~~G~C~~~-~~g~~~C~C~~G~~G~~C~~ 74 (418)
+.|- ||+|..- ......|.|+.||+|..|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 4577 6899432 24567899999999999984
No 44
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=83.79 E-value=0.28 Score=37.68 Aligned_cols=45 Identities=29% Similarity=0.718 Sum_probs=24.2
Q ss_pred ceEeeCCCCcCCCCCcCCC-CC-CCCCCcEEecCCCCCceEEcCCCCCCCCC
Q psy10559 23 GTCLCVTGWNGKHCTLEGC-PN-SCSNHGQCRVNSDSQWECKCSDGWDGKDC 72 (418)
Q Consensus 23 g~C~C~~G~~G~~Ce~~~C-~~-~C~~~G~C~~~~~g~~~C~C~~G~~G~~C 72 (418)
..-.|.+.|-|+.|.. -| +. .-.+|-+|+ . .+.-.|.+||+|+.|
T Consensus 17 ~rv~C~~nyyG~~C~~-~C~~~~d~~ghy~Cd--~--~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSK-FCKPRDDSFGHYTCD--S--NGNKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-E-E---EEETTEEEEE---S--S--EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccC-CcCCCcCCcCCcccC--C--CCCCCCCCCCcCCCC
Confidence 4678999999999974 35 32 235677883 2 346789999999987
No 45
>PHA02887 EGF-like protein; Provisional
Probab=82.44 E-value=0.84 Score=39.27 Aligned_cols=33 Identities=12% Similarity=0.173 Sum_probs=26.6
Q ss_pred CCCCCCCccccCCCCCCceeecCCCCcccccccee
Q psy10559 103 HICRSSQLCVSAPKPIDILLRKQPPAITASFFERM 137 (418)
Q Consensus 103 ~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~ 137 (418)
+.|. +|.|....+ .+...|.|+.||+|.+|+.+
T Consensus 92 ~YCi-HG~C~yI~d-L~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 92 DFCI-NGECMNIID-LDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred CEee-CCEEEcccc-CCCceeECCCCcccCCCCcc
Confidence 4577 468988776 55678999999999999975
No 46
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=81.66 E-value=1 Score=39.43 Aligned_cols=36 Identities=17% Similarity=0.257 Sum_probs=29.0
Q ss_pred CCCCCCCCccccCCCCCCceeecCCCCccccccceeee
Q psy10559 102 NHICRSSQLCVSAPKPIDILLRKQPPAITASFFERMKF 139 (418)
Q Consensus 102 ~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g~~Ce~~~f 139 (418)
.+.|.+| .|...++ .+.+.|.|..||+|.+||....
T Consensus 50 ~~YClHG-~C~yI~d-l~~~~CrC~~GYtGeRCEh~dL 85 (139)
T PHA03099 50 DGYCLHG-DCIHARD-IDGMYCRCSHGYTGIRCQHVVL 85 (139)
T ss_pred CCEeECC-EEEeecc-CCCceeECCCCcccccccceee
Confidence 4558876 7988776 5677899999999999998643
No 47
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=81.66 E-value=0.32 Score=40.00 Aligned_cols=23 Identities=17% Similarity=0.294 Sum_probs=19.3
Q ss_pred eccccccc---cccCCCCCcccchhh
Q psy10559 267 LATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 267 ~~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
.+.|+++| ..+|.++||.||+..
T Consensus 9 ~G~y~l~f~eA~~AC~~~gA~lAs~~ 34 (95)
T cd03521 9 NGSQGLGLRAARQSCASLGARLASAA 34 (95)
T ss_pred CCccccCHHHHHHHHHHcCCEeccHH
Confidence 46789999 578999999999754
No 48
>KOG1218|consensus
Probab=81.52 E-value=6.8 Score=38.40 Aligned_cols=31 Identities=42% Similarity=1.074 Sum_probs=16.6
Q ss_pred ecceEeeCCCCcCCCCCcC--CC--CCCCCCCcEE
Q psy10559 21 KNGTCLCVTGWNGKHCTLE--GC--PNSCSNHGQC 51 (418)
Q Consensus 21 ~~g~C~C~~G~~G~~Ce~~--~C--~~~C~~~G~C 51 (418)
.++.|.|.+||.|.+|... .| ...|.+++.|
T Consensus 160 ~~~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C 194 (316)
T KOG1218|consen 160 KNGICTCQPGFVGVFCVESCSGCSPLTACENGAKC 194 (316)
T ss_pred CCCceeccCCcccccccccCCCcCCCcccCCCCee
Confidence 3456666666666666532 14 2344554566
No 49
>KOG3607|consensus
Probab=81.23 E-value=1.4 Score=49.29 Aligned_cols=55 Identities=29% Similarity=0.791 Sum_probs=37.1
Q ss_pred CCCCCcEEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCccc
Q psy10559 13 WCNEHGQCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSVLL 76 (418)
Q Consensus 13 pC~n~G~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~~~ 76 (418)
.|..+-.|.+.+|.=.. ..+.. .|+..|++||.| ++...|.|.+||.++.|+...
T Consensus 605 ~Cg~~~vC~~~~C~~~~-v~~~~----~~~~~C~g~GVC----nn~~~ChC~~gwapp~C~~~~ 659 (716)
T KOG3607|consen 605 SCGPGMICINHRCLSAS-VLNSS----CCPTTCNGHGVC----NNELNCHCEPGWAPPFCFIFG 659 (716)
T ss_pred ccCCCceecCCcchhhh-hhccc----ccccccCCCccc----CCCcceeeCCCCCCCcccccc
Confidence 35555556666554222 22222 356779999999 345689999999999999654
No 50
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=80.79 E-value=0.9 Score=33.85 Aligned_cols=33 Identities=39% Similarity=0.904 Sum_probs=15.8
Q ss_pred CCCCCcEEecCC---CCCceEEcCCCCCCCCCCccc
Q psy10559 44 SCSNHGQCRVNS---DSQWECKCSDGWDGKDCSVLL 76 (418)
Q Consensus 44 ~C~~~G~C~~~~---~g~~~C~C~~G~~G~~C~~~~ 76 (418)
.|++||+.-++. .|...|.|..-|.|++|+..+
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~ 53 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLI 53 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCC
Confidence 577777764332 344567777777887777655
No 51
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=80.32 E-value=1.3 Score=38.71 Aligned_cols=32 Identities=28% Similarity=0.756 Sum_probs=19.4
Q ss_pred CCCCCcEEec-CCCCCceEEcCCCCCCCCCCccc
Q psy10559 44 SCSNHGQCRV-NSDSQWECKCSDGWDGKDCSVLL 76 (418)
Q Consensus 44 ~C~~~G~C~~-~~~g~~~C~C~~G~~G~~C~~~~ 76 (418)
.|.+ |.|.. .....+.|.|..||+|.+|+...
T Consensus 52 YClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 52 YCLH-GDCIHARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred EeEC-CEEEeeccCCCceeECCCCccccccccee
Confidence 3553 46632 12355678888888888887543
No 52
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=79.85 E-value=0.97 Score=38.19 Aligned_cols=10 Identities=30% Similarity=0.883 Sum_probs=7.3
Q ss_pred CCCCCCCCcc
Q psy10559 66 GWDGKDCSVL 75 (418)
Q Consensus 66 G~~G~~C~~~ 75 (418)
.|.|..|+..
T Consensus 53 ~W~G~aCqKk 62 (103)
T PF12955_consen 53 HWGGPACQKK 62 (103)
T ss_pred eecccccccc
Confidence 5788888754
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.40 E-value=1.4 Score=31.58 Aligned_cols=14 Identities=36% Similarity=1.028 Sum_probs=6.7
Q ss_pred eEEcCCCCCCCCCC
Q psy10559 60 ECKCSDGWDGKDCS 73 (418)
Q Consensus 60 ~C~C~~G~~G~~C~ 73 (418)
.|.|.++|+|+.|+
T Consensus 19 ~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 19 QCVCKPGTTGPRCD 32 (49)
T ss_dssp EESBSTTEESTTS-
T ss_pred EEeccccccCCcCc
Confidence 45555555555554
No 54
>KOG3512|consensus
Probab=76.75 E-value=6 Score=41.78 Aligned_cols=84 Identities=15% Similarity=0.302 Sum_probs=44.8
Q ss_pred CcEEecCCCCCceEEcCCCCCCCCCCccccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCC
Q psy10559 48 HGQCRVNSDSQWECKCSDGWDGKDCSVLLEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPP 127 (418)
Q Consensus 48 ~G~C~~~~~g~~~C~C~~G~~G~~C~~~~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~ 127 (418)
+-+|. ..+++|.|.+|-+|..|+. |..|+-....-..-|.-.+=-....|.++.. .+.+.+.|++
T Consensus 406 gktCN---q~tGqCpCkeGvtG~tCnr-----Ca~gyqqsrs~vapcik~p~~~~~~~~s~ve-------~qd~~s~Ck~ 470 (592)
T KOG3512|consen 406 GKTCN---QTTGQCPCKEGVTGLTCNR-----CAPGYQQSRSPVAPCIKIPTDAPTLGSSGVE-------PQDQCSKCKA 470 (592)
T ss_pred ccccc---ccCCcccCCCCCccccccc-----ccchhhcccCCCcCceecCCCCccccCCCCc-------chhccccCCC
Confidence 44672 3456999999999999974 3333221111111111000000111333222 2234568999
Q ss_pred CccccccceeeeeecCCce
Q psy10559 128 AITASFFERMKFLIEESSL 146 (418)
Q Consensus 128 G~~g~~Ce~~~fli~~~s~ 146 (418)
++.|..+++.+|-..+..+
T Consensus 471 ~~~~~r~n~kkfc~~Dyav 489 (592)
T KOG3512|consen 471 SPGGKRLNQKKFCKKDYAV 489 (592)
T ss_pred CCcceeccccccCccccce
Confidence 9999888887777665544
No 55
>KOG3516|consensus
Probab=76.75 E-value=1.8 Score=50.07 Aligned_cols=42 Identities=36% Similarity=0.997 Sum_probs=34.5
Q ss_pred cCCC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc-cccC
Q psy10559 38 LEGC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL-EQNC 80 (418)
Q Consensus 38 ~~~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~-~~~C 80 (418)
.+.| ||+|..+|.|.. +-..+.|.|. .||.|..|...+ +..|
T Consensus 545 ~drClPN~CehgG~C~Q-s~~~f~C~C~~TGY~GatCHtsi~e~SC 589 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQ-SWDDFECNCELTGYKGATCHTSIYELSC 589 (1306)
T ss_pred ccccCCccccCCCcccc-cccceeEeccccccccccccCCCcchhh
Confidence 4678 999999999944 5677899998 899999999776 5555
No 56
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=76.54 E-value=2 Score=31.07 Aligned_cols=15 Identities=33% Similarity=0.977 Sum_probs=8.3
Q ss_pred ceEeeCCCCcCCCCC
Q psy10559 23 GTCLCVTGWNGKHCT 37 (418)
Q Consensus 23 g~C~C~~G~~G~~Ce 37 (418)
|+|.|.++|.|.+|+
T Consensus 19 G~C~C~~~~~G~~C~ 33 (50)
T cd00055 19 GQCECKPNTTGRRCD 33 (50)
T ss_pred CEEeCCCcCCCCCCC
Confidence 555555555555553
No 57
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=75.92 E-value=2 Score=36.25 Aligned_cols=31 Identities=29% Similarity=0.628 Sum_probs=19.3
Q ss_pred CCCCCCCcEEecCC----CCCceEEcCCCCCCCCC
Q psy10559 42 PNSCSNHGQCRVNS----DSQWECKCSDGWDGKDC 72 (418)
Q Consensus 42 ~~~C~~~G~C~~~~----~g~~~C~C~~G~~G~~C 72 (418)
.+.|++||.|.... ..=|.|.|.+.+.....
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~~~ 46 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKTGS 46 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeecccccccc
Confidence 46688888883221 23477888887665433
No 58
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=74.66 E-value=0.96 Score=42.12 Aligned_cols=94 Identities=20% Similarity=0.509 Sum_probs=56.0
Q ss_pred CCCCCcEEec---------ceEeeCCCCc--CCCCCcCCC-CCCCCCCcEEecCC--CCCceEEcCCCCC---CCCCCcc
Q psy10559 13 WCNEHGQCKN---------GTCLCVTGWN--GKHCTLEGC-PNSCSNHGQCRVNS--DSQWECKCSDGWD---GKDCSVL 75 (418)
Q Consensus 13 pC~n~G~C~~---------g~C~C~~G~~--G~~Ce~~~C-~~~C~~~G~C~~~~--~g~~~C~C~~G~~---G~~C~~~ 75 (418)
+|.+-++|.+ +.|.|.+||. ...|-.+.| ...|. .|.|.++. .....|+|.-|.. ...|...
T Consensus 51 ~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~ 129 (197)
T PF06247_consen 51 PCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKT 129 (197)
T ss_dssp EEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEE
T ss_pred cccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCC
Confidence 6888889964 5899999996 345655667 44566 78995543 2345899998876 2223322
Q ss_pred ccccCCCCccCCCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559 76 LEQNCNDGKDNDKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAITA 131 (418)
Q Consensus 76 ~~~~C~~g~c~~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~g 131 (418)
. ...| .-.|..+..|....+ .|.|.|.+|+.+
T Consensus 130 G--------------~T~C-------~LKCk~nE~CK~~~~---~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 130 G--------------ETKC-------SLKCKENEECKLVDG---YYKCVCKEGFPG 161 (197)
T ss_dssp E-----------------------------TTTEEEEEETT---EEEEEE-TT-EE
T ss_pred C--------------ccce-------eeecCCCcceeeeCc---EEEeecCCCCCC
Confidence 2 2222 234777888988877 899999999866
No 59
>KOG3607|consensus
Probab=74.45 E-value=2.1 Score=47.87 Aligned_cols=33 Identities=36% Similarity=0.804 Sum_probs=28.4
Q ss_pred cccCCCcCCCCCcEEec-ceEeeCCCCcCCCCCc
Q psy10559 6 NIDVDLLWCNEHGQCKN-GTCLCVTGWNGKHCTL 38 (418)
Q Consensus 6 ~id~C~~pC~n~G~C~~-g~C~C~~G~~G~~Ce~ 38 (418)
+...|+..|++||.|.+ ..|.|.+||.+.+|++
T Consensus 624 ~~~~~~~~C~g~GVCnn~~~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 624 NSSCCPTTCNGHGVCNNELNCHCEPGWAPPFCFI 657 (716)
T ss_pred cccccccccCCCcccCCCcceeeCCCCCCCcccc
Confidence 34567778999999987 7999999999999985
No 60
>KOG1218|consensus
Probab=72.93 E-value=10 Score=37.08 Aligned_cols=48 Identities=31% Similarity=0.946 Sum_probs=30.4
Q ss_pred ceEee-CCCCcCCCCCcC-CCCCCCCCCcEEecCCCCCceEEcCCCCCCCCCCc
Q psy10559 23 GTCLC-VTGWNGKHCTLE-GCPNSCSNHGQCRVNSDSQWECKCSDGWDGKDCSV 74 (418)
Q Consensus 23 g~C~C-~~G~~G~~Ce~~-~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G~~C~~ 74 (418)
.+..| ..+|.|..|+.+ .|...|.. -+| .+... .|.|..+|.+..|..
T Consensus 90 ~~~~~~~~~~~g~~C~~~~~~~~~c~~-~~C-~~~~~--~c~~~~~~~~~~C~~ 139 (316)
T KOG1218|consen 90 STGYCHLNGYEGPQCESPCPCGDGCAE-KTC-ANPRR--ECRCGGGYIGEQCGE 139 (316)
T ss_pred CCCcccCCCCCcccccCCCCcCCcccc-ccc-CCCcc--ceecCCcCccccccc
Confidence 34455 688999999742 34222544 566 33211 688888999888875
No 61
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=69.17 E-value=3.2 Score=30.98 Aligned_cols=26 Identities=31% Similarity=0.930 Sum_probs=15.5
Q ss_pred cCCCCCcEEe-c-----c--eEeeCCCCcCCCCC
Q psy10559 12 LWCNEHGQCK-N-----G--TCLCVTGWNGKHCT 37 (418)
Q Consensus 12 ~pC~n~G~C~-~-----g--~C~C~~G~~G~~Ce 37 (418)
.+|++||+.. + | .|+|+.-|.|++|.
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS 50 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence 3799999885 3 3 79999999999997
No 62
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=68.68 E-value=5.4 Score=28.78 Aligned_cols=16 Identities=25% Similarity=0.817 Sum_probs=14.5
Q ss_pred CceEEcCCCCCCCCCC
Q psy10559 58 QWECKCSDGWDGKDCS 73 (418)
Q Consensus 58 ~~~C~C~~G~~G~~C~ 73 (418)
..+|.|.++|.|..|+
T Consensus 18 ~G~C~C~~~~~G~~C~ 33 (50)
T cd00055 18 TGQCECKPNTTGRRCD 33 (50)
T ss_pred CCEEeCCCcCCCCCCC
Confidence 4589999999999997
No 63
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=67.25 E-value=3 Score=29.84 Aligned_cols=20 Identities=35% Similarity=1.049 Sum_probs=16.6
Q ss_pred cEEe--cceEeeCCCCcCCCCC
Q psy10559 18 GQCK--NGTCLCVTGWNGKHCT 37 (418)
Q Consensus 18 G~C~--~g~C~C~~G~~G~~Ce 37 (418)
..|. +|+|.|.++|+|.+|+
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPSTGQCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEETCEEESBSTTEESTTS-
T ss_pred CcccCCCCEEeccccccCCcCc
Confidence 3786 4899999999999996
No 64
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=67.02 E-value=6.7 Score=28.34 Aligned_cols=41 Identities=29% Similarity=0.708 Sum_probs=22.1
Q ss_pred EEecceEeeCCCCcCCCCCcCCCCCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559 19 QCKNGTCLCVTGWNGKHCTLEGCPNSCSNHGQCRVNSDSQWECKCSDGWD 68 (418)
Q Consensus 19 ~C~~g~C~C~~G~~G~~Ce~~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~ 68 (418)
..+++.|. +....|..|+.+ ..|..+..| .+ ..|.|++||.
T Consensus 6 ~~~~~~C~-~~~~~g~~C~~~---~qC~~~s~C-~~----g~C~C~~g~~ 46 (52)
T PF01683_consen 6 VAINGQCV-PRVQPGESCESD---EQCIGGSVC-VN----GRCQCPPGYV 46 (52)
T ss_pred EEECCEEC-ccCCCCCCCCCc---CCCCCcCEE-cC----CEeECCCCCE
Confidence 33445553 224556667642 334466777 32 2677777763
No 65
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=65.39 E-value=5.6 Score=24.73 Aligned_cols=11 Identities=27% Similarity=1.168 Sum_probs=7.6
Q ss_pred CceEEcCCCCC
Q psy10559 58 QWECKCSDGWD 68 (418)
Q Consensus 58 ~~~C~C~~G~~ 68 (418)
+|.|.|++||.
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 46777777775
No 66
>KOG3514|consensus
Probab=64.29 E-value=4.1 Score=46.80 Aligned_cols=36 Identities=33% Similarity=0.900 Sum_probs=30.5
Q ss_pred CC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc
Q psy10559 40 GC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL 76 (418)
Q Consensus 40 ~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~ 76 (418)
.| ++||.|+|.| ...-+.+.|.|. .||.|+.|+...
T Consensus 625 ~C~~nPC~N~g~C-~egwNrfiCDCs~T~~~G~~CerE~ 662 (1591)
T KOG3514|consen 625 ICESNPCQNGGKC-SEGWNRFICDCSGTGFEGRTCEREA 662 (1591)
T ss_pred ccCCCcccCCCCc-cccccccccccccCcccCcccccee
Confidence 68 8999999999 555678899997 689999998764
No 67
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=60.15 E-value=33 Score=35.29 Aligned_cols=71 Identities=18% Similarity=0.286 Sum_probs=54.0
Q ss_pred ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559 161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII 234 (418)
Q Consensus 161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i 234 (418)
--++|.|.+..|.|+.|+.|...... -+.|..+|.|.+.+--|. -++.+.-..|+++...|-|.-++...+
T Consensus 287 ~gI~G~V~d~~g~pi~~A~V~v~g~~--~~~T~~~G~y~~~L~pG~-y~v~vs~~Gy~~~~~~V~v~~~~~~~~ 357 (363)
T cd06245 287 KGVHGVVTDKAGKPISGATIVLNGGH--RVYTKEGGYFHVLLAPGQ-HNINVIAEGYQQEHLPVVVSHDEASSV 357 (363)
T ss_pred cEEEEEEEcCCCCCccceEEEEeCCC--ceEeCCCcEEEEecCCce-EEEEEEEeCceeEEEEEEEcCCCeEEE
Confidence 45689999989999999999988533 366889999988655543 455667777888888888877776655
No 68
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=60.08 E-value=4.6 Score=37.70 Aligned_cols=84 Identities=21% Similarity=0.510 Sum_probs=48.4
Q ss_pred ceEeeCCCCc---CCCCC-cCCC------CCCCCCCcEEecCC----CCCceEEcCCCCCCC--CCCccccccCCCCccC
Q psy10559 23 GTCLCVTGWN---GKHCT-LEGC------PNSCSNHGQCRVNS----DSQWECKCSDGWDGK--DCSVLLEQNCNDGKDN 86 (418)
Q Consensus 23 g~C~C~~G~~---G~~Ce-~~~C------~~~C~~~G~C~~~~----~g~~~C~C~~G~~G~--~C~~~~~~~C~~g~c~ 86 (418)
+.|.|.+||. -..|+ ...| ..+|...+.|.... ...|.|.|..||.-. .|-.. .
T Consensus 20 fEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~---~------- 89 (197)
T PF06247_consen 20 FECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPN---K------- 89 (197)
T ss_dssp EEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEG---G-------
T ss_pred eEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchh---h-------
Confidence 6899999995 55676 3456 23688899994332 367899999999632 23211 1
Q ss_pred CCCCccccCCCCCCCCCCCCCCCccccCCCCCCceeecCCCCcc
Q psy10559 87 DKDGLVDCEDPECCSNHICRSSQLCVSAPKPIDILLRKQPPAIT 130 (418)
Q Consensus 87 ~~~G~~~C~dpdCc~~~~C~~~~~C~~~~~~~~~~~C~C~~G~~ 130 (418)
| ..-.|. .|.|+-.+.......|+|.-|+.
T Consensus 90 -------C------~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV 119 (197)
T PF06247_consen 90 -------C------NNKDCG-SGKCILDPDNPNNPTCSCNIGKV 119 (197)
T ss_dssp -------G------SS---T-TEEEEEEEGGGSEEEEEE-TEEE
T ss_pred -------c------CceecC-CCeEEecCCCCCCceeEeeeceE
Confidence 2 222343 56677666544455788887764
No 69
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=60.02 E-value=4.7 Score=27.47 Aligned_cols=19 Identities=32% Similarity=0.969 Sum_probs=14.4
Q ss_pred cEEecCCCCCceEEcCCCCC
Q psy10559 49 GQCRVNSDSQWECKCSDGWD 68 (418)
Q Consensus 49 G~C~~~~~g~~~C~C~~G~~ 68 (418)
-.| ++..++|+|.|++||.
T Consensus 10 h~C-~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 10 HIC-VNTPGSYRCSCPPGYK 28 (36)
T ss_dssp SEE-EEETTSEEEE-STTEE
T ss_pred CCC-ccCCCceEeECCCCCE
Confidence 367 5668899999999985
No 70
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=58.44 E-value=10 Score=35.73 Aligned_cols=34 Identities=26% Similarity=0.856 Sum_probs=24.6
Q ss_pred CCCCC-cCCC---CCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559 33 GKHCT-LEGC---PNSCSNHGQCRVNSDSQWECKCSDGWDG 69 (418)
Q Consensus 33 G~~Ce-~~~C---~~~C~~~G~C~~~~~g~~~C~C~~G~~G 69 (418)
+..|+ .++| +..|. ..| .+..|+|.|.|.+||+.
T Consensus 181 ~~~C~~~~~C~~~~~~c~--~~C-~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICVVPDLCATLSHVCQ--QVC-ISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCcCchhhcCCCCCcc--ceE-EcCCCCEEeECCCCccC
Confidence 45564 4567 23465 479 78899999999999975
No 71
>KOG3516|consensus
Probab=54.17 E-value=9.4 Score=44.47 Aligned_cols=36 Identities=36% Similarity=0.869 Sum_probs=29.8
Q ss_pred CC-CCCCCCCcEEecCCCCCceEEcC-CCCCCCCCCccc
Q psy10559 40 GC-PNSCSNHGQCRVNSDSQWECKCS-DGWDGKDCSVLL 76 (418)
Q Consensus 40 ~C-~~~C~~~G~C~~~~~g~~~C~C~-~G~~G~~C~~~~ 76 (418)
.| +.+|.|+|+| +..-.+|.|.|. ..|.|+.|...+
T Consensus 957 hCss~~C~NGG~C-very~gytCDCs~Tay~Gp~Cs~ei 994 (1306)
T KOG3516|consen 957 HCSSYPCLNGGHC-VERYDGYTCDCSRTAYDGPFCSKEI 994 (1306)
T ss_pred ccccccccCCCEE-EEecCceeeccccCcCCCCcccccc
Confidence 46 6789999999 555678999997 679999998765
No 72
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=53.80 E-value=45 Score=34.53 Aligned_cols=73 Identities=22% Similarity=0.256 Sum_probs=54.7
Q ss_pred cceeeceEEee-cCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559 160 SAVVRGRVVTS-MGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII 234 (418)
Q Consensus 160 ~~vi~G~v~~~-~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i 234 (418)
.--++|.|.+. .|.|+.|+.|++.....| +.|..||.|-+.+--| .-++++.-..|+++...|-|.-++.+.+
T Consensus 296 ~~gI~G~V~D~~~g~pl~~AtV~V~g~~~~-~~Td~~G~f~~~l~pG-~ytl~vs~~GY~~~~~~v~V~~~~~~~~ 369 (375)
T cd03863 296 HRGVRGFVLDATDGRGILNATISVADINHP-VTTYKDGDYWRLLVPG-TYKVTASARGYDPVTKTVEVDSKGAVQV 369 (375)
T ss_pred cCeEEEEEEeCCCCCCCCCeEEEEecCcCc-eEECCCccEEEccCCe-eEEEEEEEcCcccEEEEEEEcCCCcEEE
Confidence 34678999986 699999999999753334 5578899998866665 4667777788888877777776666544
No 73
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=51.98 E-value=11 Score=25.40 Aligned_cols=11 Identities=27% Similarity=0.824 Sum_probs=7.2
Q ss_pred CceEEcCCCCC
Q psy10559 58 QWECKCSDGWD 68 (418)
Q Consensus 58 ~~~C~C~~G~~ 68 (418)
.++|.|++||.
T Consensus 17 ~~~C~CPeGyI 27 (34)
T PF09064_consen 17 PGQCFCPEGYI 27 (34)
T ss_pred CCceeCCCceE
Confidence 34677777774
No 74
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=50.44 E-value=11 Score=26.74 Aligned_cols=20 Identities=35% Similarity=1.066 Sum_probs=16.6
Q ss_pred cEEe--cceEeeCCCCcCCCCC
Q psy10559 18 GQCK--NGTCLCVTGWNGKHCT 37 (418)
Q Consensus 18 G~C~--~g~C~C~~G~~G~~Ce 37 (418)
..|. +|+|.|.++|+|.+|+
T Consensus 11 ~~C~~~~G~C~C~~~~~G~~C~ 32 (46)
T smart00180 11 GTCDPDTGQCECKPNVTGRRCD 32 (46)
T ss_pred CcccCCCCEEECCCCCCCCCCC
Confidence 3564 4899999999999996
No 75
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=50.15 E-value=48 Score=29.41 Aligned_cols=70 Identities=17% Similarity=0.169 Sum_probs=46.5
Q ss_pred eeeceEEee-cCCceeceEEEeeCC-CC-----ceEEEecCCceeeeecC----CceEEEEEecCCCCCc-cceeeeecc
Q psy10559 162 VVRGRVVTS-MGMGLVGVRVSTSTP-LE-----GFTLTRDDGWFDLLVNG----GGAVTLQFGRSPFKPH-NHIVHVPWN 229 (418)
Q Consensus 162 vi~G~v~~~-~Gtp~~Gv~vs~~~~-~~-----G~t~t~~dg~fdl~vNG----G~~~~l~f~r~Pf~~~-~~~V~vpwn 229 (418)
.+++.|++. .|+|..||.|..... .. +.+.|..||.|..+..+ -+.-.+.|..++|-.. ...-+.|+-
T Consensus 28 ~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~~~Td~dGR~~~l~~~~~~~~G~Y~L~F~t~~Yf~~~~~~~F~p~v 107 (137)
T PRK15036 28 ILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYRVVFKTGDYFKKQNLESFFPEI 107 (137)
T ss_pred CeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEEEEECCCCCCccccCcccCCCeeEEEEEEcchhhhccCCCCCccee
Confidence 588888865 889999999998843 22 55788899999753221 2456777888886653 222334443
Q ss_pred ce
Q psy10559 230 EV 231 (418)
Q Consensus 230 ~~ 231 (418)
++
T Consensus 108 ~v 109 (137)
T PRK15036 108 PV 109 (137)
T ss_pred EE
Confidence 33
No 76
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=49.89 E-value=7.8 Score=29.73 Aligned_cols=20 Identities=45% Similarity=1.286 Sum_probs=13.2
Q ss_pred CcEEe-cceEeeCCCCcCCCC
Q psy10559 17 HGQCK-NGTCLCVTGWNGKHC 36 (418)
Q Consensus 17 ~G~C~-~g~C~C~~G~~G~~C 36 (418)
|-+|. +|.-.|.+||.|+.|
T Consensus 43 hy~Cd~~G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 43 HYTCDSNGNKVCLPGWTGPNC 63 (63)
T ss_dssp EEEE-SS--EEE-TTEESTTS
T ss_pred CcccCCCCCCCCCCCCcCCCC
Confidence 45676 488999999999987
No 77
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=49.63 E-value=49 Score=23.71 Aligned_cols=20 Identities=35% Similarity=0.848 Sum_probs=17.7
Q ss_pred CCCCCcEEecceEeeCCCCc
Q psy10559 13 WCNEHGQCKNGTCLCVTGWN 32 (418)
Q Consensus 13 pC~n~G~C~~g~C~C~~G~~ 32 (418)
.|..+..|.++.|.|++||.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcCCEeECCCCCE
Confidence 68888899999999999985
No 78
>KOG3514|consensus
Probab=46.52 E-value=1.1e+02 Score=35.92 Aligned_cols=37 Identities=14% Similarity=0.262 Sum_probs=31.2
Q ss_pred CCCCCCCCCCccccCCCCCCceeecCCC-Cccccccceeee
Q psy10559 100 CSNHICRSSQLCVSAPKPIDILLRKQPP-AITASFFERMKF 139 (418)
Q Consensus 100 c~~~~C~~~~~C~~~~~~~~~~~C~C~~-G~~g~~Ce~~~f 139 (418)
|..++|.|++.|..... .+-|-|.. +|.|..|++..-
T Consensus 626 C~~nPC~N~g~C~egwN---rfiCDCs~T~~~G~~CerE~t 663 (1591)
T KOG3514|consen 626 CESNPCQNGGKCSEGWN---RFICDCSGTGFEGRTCEREAT 663 (1591)
T ss_pred cCCCcccCCCCcccccc---ccccccccCcccCccccceee
Confidence 77889999999988766 78899975 999999998543
No 79
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=45.82 E-value=16 Score=34.36 Aligned_cols=34 Identities=29% Similarity=0.699 Sum_probs=23.2
Q ss_pred ccCCCCCCCC--CCCCCCCccccCCCCCCceeecCCCCccc
Q psy10559 93 DCEDPECCSN--HICRSSQLCVSAPKPIDILLRKQPPAITA 131 (418)
Q Consensus 93 ~C~dpdCc~~--~~C~~~~~C~~~~~~~~~~~C~C~~G~~g 131 (418)
.|.+.+.|.. ..|. ..|.+..+ +|.|.|++||+.
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g---~~~c~c~~g~~~ 218 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPG---SYLCACTEGYAL 218 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCC---CEEeECCCCccC
Confidence 3544444432 3354 46988888 799999999975
No 80
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=44.56 E-value=3.7 Score=34.08 Aligned_cols=23 Identities=22% Similarity=0.397 Sum_probs=19.8
Q ss_pred eccccccc---cccCCCCCcccchhh
Q psy10559 267 LATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 267 ~~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
.+.|++|| +++|.+.|+.||+..
T Consensus 9 ~grY~l~f~eA~~aC~~~ga~lAs~~ 34 (93)
T cd03515 9 SGKYKLTYTEAKAACEAEGAHLATYS 34 (93)
T ss_pred CCccccCHHHHHHHHHHcCCccCCHH
Confidence 56789999 578999999999865
No 81
>cd03864 M14_CPN Peptidase M14 Carboxypeptidase N (CPN, also known as kininase I, creatine kinase conversion factor, plasma carboxypeptidase B, arginine carboxypeptidase, and protaminase; EC 3.4.17.3) is an extracellular glycoprotein synthesized in the liver and released into the blood, where it is present in high concentrations. CPN belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPN plays an important role in protecting the body from excessive buildup of potentially deleterious peptides that normally act as local autocrine or paracrine hormones. It specifically removes C-terminal basic residues. As CPN can cleave lysine more avidly than arginine residues it is also called lysine carboxypeptidase. CPN substrates inclu
Probab=40.44 E-value=1e+02 Score=32.23 Aligned_cols=71 Identities=18% Similarity=0.195 Sum_probs=49.0
Q ss_pred eeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEe
Q psy10559 162 VVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVII 234 (418)
Q Consensus 162 vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~i 234 (418)
-|+|.|.+..|.|+.|+.|.+.....+ +.|..+|.|=.++.-| .-++.+.-..|+++..+|-|.-++...+
T Consensus 317 gI~G~V~D~~g~pi~~A~V~v~g~~~~-~~T~~~G~y~r~l~pG-~Y~l~vs~~Gy~~~t~~v~V~~~~~~~~ 387 (392)
T cd03864 317 GIKGMVTDENNNGIANAVISVSGISHD-VTSGTLGDYFRLLLPG-TYTVTASAPGYQPSTVTVTVGPAEATLV 387 (392)
T ss_pred eEEEEEECCCCCccCCeEEEEECCccc-eEECCCCcEEecCCCe-eEEEEEEEcCceeEEEEEEEcCCCcEEE
Confidence 678999999999999999998744333 6677888882223333 3455566777888877777766654433
No 82
>cd03865 M14_CPE_H Peptidase M14 Carboxypeptidase (CP) E (CPE, also known as carboxypeptidase H, and enkephalin convertase; EC 3.4.17.10) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPE is an important enzyme responsible for the proteolytic processing of prohormone intermediates (such as pro-insulin, pro-opiomelanocortin, or pro-gonadotropin-releasing hormone) by specifically removing C-terminal basic residues. In addition, it has been proposed that the regulated secretory pathway (RSP) of the nervous and endocrine systems utilizes membrane-bound CPE as a sorting receptor. A naturally occurring point mutation in CPE reduces the stability of the enzyme and causes its degradation, leading to an accumulation of numerous neuroendocrine pe
Probab=39.57 E-value=78 Score=33.16 Aligned_cols=71 Identities=21% Similarity=0.236 Sum_probs=52.0
Q ss_pred eeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeeccceEEec
Q psy10559 163 VRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWNEVVIID 235 (418)
Q Consensus 163 i~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn~~~~id 235 (418)
|+|.|.+..|.|+.|+.|++.....+ +.|..+|.|.+.+--| .-++...-..|+++...|-|.-++.+.+|
T Consensus 328 I~G~V~D~~g~pI~~AtV~V~g~~~~-~~T~~~G~Y~~~L~pG-~Ytv~vsa~Gy~~~~~~V~V~~~~~~~vd 398 (402)
T cd03865 328 VKGFVKDLQGNPIANATISVEGIDHD-ITSAKDGDYWRLLAPG-NYKLTASAPGYLAVVKKVAVPYSPAVRVD 398 (402)
T ss_pred eEEEEECCCCCcCCCeEEEEEcCccc-cEECCCeeEEECCCCE-EEEEEEEecCcccEEEEEEEcCCCcEEEe
Confidence 78999998899999999999854334 4677889998754444 34555667778888777888777665443
No 83
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=39.52 E-value=34 Score=28.56 Aligned_cols=29 Identities=24% Similarity=0.764 Sum_probs=20.9
Q ss_pred CCC--CCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559 39 EGC--PNSCSNHGQCRVNSDSQWECKCSDGWDG 69 (418)
Q Consensus 39 ~~C--~~~C~~~G~C~~~~~g~~~C~C~~G~~G 69 (418)
+.| ...|..+|.|.. .....|.|.+||..
T Consensus 78 d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence 456 577988999943 33456999999875
No 84
>cd03867 M14_CPZ Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis. That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Probab=37.81 E-value=91 Score=32.44 Aligned_cols=64 Identities=19% Similarity=0.171 Sum_probs=46.1
Q ss_pred ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeee
Q psy10559 161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHV 226 (418)
Q Consensus 161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~v 226 (418)
--++|.|.+..|.|++|+.|.+.....+ +.|..+|.|...+.-| .-++.+.-..|+++..+|-+
T Consensus 318 ~~i~G~V~D~~g~pi~~A~V~v~g~~~~-~~Td~~G~y~~~l~~G-~y~l~vs~~Gy~~~~~~v~v 381 (395)
T cd03867 318 RGIKGFVKDKDGNPIKGARISVRGIRHD-ITTAEDGDYWRLLPPG-IHIVSAQAPGYTKVMKRVTL 381 (395)
T ss_pred ceeEEEEEcCCCCccCCeEEEEeccccc-eEECCCceEEEecCCC-cEEEEEEecCeeeEEEEEEe
Confidence 3578999999999999999988754334 6677889996544444 34466666777777666665
No 85
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=37.26 E-value=22 Score=24.46 Aligned_cols=27 Identities=26% Similarity=0.655 Sum_probs=17.8
Q ss_pred CCCCCCCcEEecCCCCCceEEcCCCCC
Q psy10559 42 PNSCSNHGQCRVNSDSQWECKCSDGWD 68 (418)
Q Consensus 42 ~~~C~~~G~C~~~~~g~~~C~C~~G~~ 68 (418)
...|..|+.|-....|...|.|..||.
T Consensus 4 ~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 4 DTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp SS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred CccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 356788899944446899999999995
No 86
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=37.10 E-value=1.1e+02 Score=31.38 Aligned_cols=71 Identities=21% Similarity=0.266 Sum_probs=49.7
Q ss_pred ceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeec-cceEE
Q psy10559 161 AVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPW-NEVVI 233 (418)
Q Consensus 161 ~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpw-n~~~~ 233 (418)
..++|.|.+..|.|+.|+.|.......+ +.|..+|.|.+.+ -.+.-++.+....|+++...+-+-- ++.+.
T Consensus 298 ~~i~G~V~d~~g~pl~~A~V~i~~~~~~-~~Td~~G~f~~~l-~~G~y~l~vs~~Gy~~~~~~v~v~~~g~~~~ 369 (374)
T cd03858 298 RGIKGFVRDANGNPIANATISVEGINHD-VTTAEDGDYWRLL-LPGTYNVTASAPGYEPQTKSVVVPNDNSAVV 369 (374)
T ss_pred CceEEEEECCCCCccCCeEEEEecceee-eEECCCceEEEec-CCEeEEEEEEEcCcceEEEEEEEecCCceEE
Confidence 3789999999999999999988543223 6788999997654 3455566666677777766665543 44443
No 87
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=36.62 E-value=5.8 Score=33.07 Aligned_cols=23 Identities=30% Similarity=0.588 Sum_probs=20.0
Q ss_pred eccccccc---cccCCCCCcccchhh
Q psy10559 267 LATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 267 ~~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
.+.|++|| +++|.+.|+.||+..
T Consensus 9 ~grY~l~f~eA~~aC~~~ga~lAs~~ 34 (95)
T cd03518 9 LGRYNLNFHEAQQACEEQDATLASFE 34 (95)
T ss_pred CCccccCHHHHHHHHHHcCCeeCCHH
Confidence 57899999 578999999999865
No 88
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=35.77 E-value=6.6 Score=35.16 Aligned_cols=29 Identities=21% Similarity=0.291 Sum_probs=23.3
Q ss_pred ccCcee----eccccccc---cccCCCCCcccchhh
Q psy10559 261 TMKPVV----LATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 261 ~~~pv~----~~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
+.++|| .+.|+++| +++|.+.|+.||+..
T Consensus 4 rv~GVFh~~~~grY~lnf~eA~~aC~~~ga~lAs~~ 39 (144)
T cd03516 4 RIMGVFLVEKNGRYSLNFTEAKEACRALGLTLASKA 39 (144)
T ss_pred ceEEEEEeccCCcccCCHHHHHHHHHHcCCeeCCHH
Confidence 455666 37888999 578999999999865
No 89
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=35.00 E-value=1.1e+02 Score=31.43 Aligned_cols=74 Identities=16% Similarity=0.072 Sum_probs=50.9
Q ss_pred ccceeeceEEeecCCceeceEEEeeCCCCceEEEecCCceeeeecCCceEEEEEecCCCCCcccee-eeeccceEEe
Q psy10559 159 RSAVVRGRVVTSMGMGLVGVRVSTSTPLEGFTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIV-HVPWNEVVII 234 (418)
Q Consensus 159 ~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V-~vpwn~~~~i 234 (418)
....++|.|.+..|.|+.|+.|.......+ +.+..+|.|...+.-| .-++.+....|+++...+ -|.-++.+.+
T Consensus 294 ~~~~i~G~V~d~~g~pv~~A~V~v~~~~~~-~~td~~G~y~~~l~~G-~Y~l~vs~~Gf~~~~~~~v~v~~g~~~~~ 368 (372)
T cd03868 294 VHIGVKGFVRDASGNPIEDATIMVAGIDHN-VTTAKFGDYWRLLLPG-TYTITAVAPGYEPSTVTDVVVKEGEATSV 368 (372)
T ss_pred hCCceEEEEEcCCCCcCCCcEEEEEecccc-eEeCCCceEEecCCCE-EEEEEEEecCCCceEEeeEEEcCCCeEEE
Confidence 445678999999999999999998754333 6788899996544333 456666777788875543 3544554433
No 90
>KOG0196|consensus
Probab=33.83 E-value=45 Score=37.94 Aligned_cols=44 Identities=30% Similarity=0.853 Sum_probs=28.2
Q ss_pred ceEeeCCCCc----CCCCCc------------CCCCCCCCCCcEEecCCCCCceEEcCCCCCC
Q psy10559 23 GTCLCVTGWN----GKHCTL------------EGCPNSCSNHGQCRVNSDSQWECKCSDGWDG 69 (418)
Q Consensus 23 g~C~C~~G~~----G~~Ce~------------~~C~~~C~~~G~C~~~~~g~~~C~C~~G~~G 69 (418)
|.|.|.+||. |..|+. ..| .+|..|.. ....+.-.|.|..||.-
T Consensus 259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C-~~CP~~S~--s~~ega~~C~C~~gyyR 318 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLC-LPCPPNSH--SSSEGATSCTCENGYYR 318 (996)
T ss_pred CceeecCCCCcccCCCcceeCCCCcccCCCCCCCC-CCCCCCCC--CCCCCCCcccccCCccc
Confidence 7999999994 667752 112 13444433 22356678999999854
No 91
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=32.04 E-value=7.9 Score=32.09 Aligned_cols=23 Identities=30% Similarity=0.388 Sum_probs=19.9
Q ss_pred eccccccc---cccCCCCCcccchhh
Q psy10559 267 LATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 267 ~~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
.+.|++|| +++|.+.|+.||+..
T Consensus 9 ~g~y~l~f~eA~~aC~~~ga~lAs~~ 34 (92)
T cd01102 9 NGRYKLTFAEAALACKARGAHLATPG 34 (92)
T ss_pred CCCcccCHHHHHHHHHHcCCEeCCHH
Confidence 47799999 578999999999866
No 92
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=29.68 E-value=1.8e+02 Score=30.12 Aligned_cols=70 Identities=23% Similarity=0.183 Sum_probs=47.8
Q ss_pred ccceeeceEEeecCCceeceEEEeeCCCCc-eEEEecCCceeeeecCCceEEEEEecCCCCCccceeeeecc
Q psy10559 159 RSAVVRGRVVTSMGMGLVGVRVSTSTPLEG-FTLTRDDGWFDLLVNGGGAVTLQFGRSPFKPHNHIVHVPWN 229 (418)
Q Consensus 159 ~~~vi~G~v~~~~Gtp~~Gv~vs~~~~~~G-~t~t~~dg~fdl~vNGG~~~~l~f~r~Pf~~~~~~V~vpwn 229 (418)
...-++|.|.+..|.|++|+.|.......+ .+.|..+|.|-+.+--| .-++.+.-..|+++...|.++-+
T Consensus 293 ~~~gI~G~V~D~~g~pi~~A~V~v~g~~~~~~~~T~~~G~y~~~l~pG-~Y~v~vsa~Gy~~~~~~v~v~~~ 363 (376)
T cd03866 293 VHLGVKGQVFDSNGNPIPNAIVEVKGRKHICPYRTNVNGEYFLLLLPG-KYMINVTAPGFKTVITNVIIPYN 363 (376)
T ss_pred hcCceEEEEECCCCCccCCeEEEEEcCCceeEEEECCCceEEEecCCe-eEEEEEEeCCcceEEEEEEeCCC
Confidence 344679999999999999999988754333 34688889996543333 23445566677777666666543
No 93
>cd01204 IRS_PTB Insulin receptor substrate (IRS) Phosphotyrosine-binding domain(PTB). Insulin receptor substrate (IRS) Phosphotyrosine-binding domain(PTB). This domain has a PH-like fold and is found in insulin receptor substrate molecules. IRS molecules have an N-terminal PH domain , which is followed by an IRS-like PTB domain. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs in the insulin receptor, IGF-I receptor and the IL-4 receptor.
Probab=29.15 E-value=96 Score=26.15 Aligned_cols=18 Identities=17% Similarity=0.289 Sum_probs=13.4
Q ss_pred cceEEeEEeeccceeEEe
Q psy10559 395 LQLRKSLCKDFTHTLTFY 412 (418)
Q Consensus 395 l~~vh~~~~~~~~~~~~~ 412 (418)
+.+--+.|||.+|+-.||
T Consensus 45 v~l~L~~IRRCGHs~~~F 62 (104)
T cd01204 45 VELLLMNIRRCGHSENFF 62 (104)
T ss_pred EEEeeeeeeeccCcccEE
Confidence 333345699999998887
No 94
>PF00818 Ice_nucleation: Ice nucleation protein repeat; InterPro: IPR000258 Certain Gram-negative bacteria express proteins that enable them to promote nucleation of ice at relatively high temperatures (above -5C) [, ]. These proteins are localised at the outer membrane surface and can cause frost damage to many plants. The primary structure of the proteins contains a highly repetitive domain that dominates the sequence. The domain comprises a number of 48-residue repeats, which themselves contain 3 blocks of 16 residues, the first 8 of which are identical. It is thought that the repetitive domain may be responsible for aligning water molecules in the seed crystal. [.........48.residues.repeated.domain..........] / / | | \ \ AGYGSTxTagxxssli AGYGSTxTagxxsxlt AGYGSTxTaqxxsxlt [16.residues...] [16.residues...] [16.residues...] ; GO: 0009279 cell outer membrane
Probab=26.48 E-value=26 Score=19.65 Aligned_cols=13 Identities=38% Similarity=0.562 Sum_probs=11.1
Q ss_pred EecCCCCCCceeE
Q psy10559 370 YHSSRSAGYLSTI 382 (418)
Q Consensus 370 y~s~~~~gy~~~~ 382 (418)
|-|-+++|+.|.|
T Consensus 2 YGSTqTA~~~S~L 14 (16)
T PF00818_consen 2 YGSTQTAGEDSSL 14 (16)
T ss_pred CCcccccCCCccc
Confidence 7788999999876
No 95
>smart00445 LINK Link (Hyaluronan-binding).
Probab=23.16 E-value=12 Score=31.11 Aligned_cols=22 Identities=27% Similarity=0.481 Sum_probs=19.0
Q ss_pred ccccccc---cccCCCCCcccchhh
Q psy10559 268 ATWKHGF---QGACPDRSSILAESQ 289 (418)
Q Consensus 268 ~~~~~~~---~~~~~~~~~~~~~~q 289 (418)
+.|++|| +++|.+.|+.||+..
T Consensus 11 g~y~l~f~eA~~aC~~~ga~lAs~~ 35 (94)
T smart00445 11 GRYKLTFAEAREACRAQGATLATVG 35 (94)
T ss_pred CCCccCHHHHHHHHHHcCCEeCCHH
Confidence 4799999 578999999999865
No 96
>PF06339 Ectoine_synth: Ectoine synthase; InterPro: IPR010462 This family consists of several bacterial ectoine synthase proteins. The ectABC genes encode the diaminobutyric acid acetyltransferase (EctA), the diaminobutyric acid aminotransferase (EctB), and the ectoine synthase (EctC). Together these proteins constitute the ectoine biosynthetic pathway [].; GO: 0016836 hydro-lyase activity, 0006596 polyamine biosynthetic process
Probab=21.82 E-value=71 Score=28.03 Aligned_cols=36 Identities=14% Similarity=0.075 Sum_probs=26.5
Q ss_pred cCCCCCcccCCCCCCceeeeeeeeCCCceEEEEecC
Q psy10559 338 LTPSSNISCASSNPPQVVQESFQIPGTGLNLVYHSS 373 (418)
Q Consensus 338 ~~~~~~~~~~~~~~~~~~~e~~~i~gt~~~l~y~s~ 373 (418)
.-.|-|++++.+.=-=++||.+--|||..+|.|.--
T Consensus 20 ~w~SrRlll~~DgmGFS~h~T~i~aGtet~~~YknH 55 (126)
T PF06339_consen 20 NWESRRLLLKDDGMGFSFHETTIYAGTETHIHYKNH 55 (126)
T ss_pred CceEEEEEEccCCCCEEEEEEEEeCCCeeEEEecCc
Confidence 334555555555555599999999999999999754
Done!