Query psy2857
Match_columns 332
No_of_seqs 188 out of 1483
Neff 10.3
Searched_HMMs 46136
Date Fri Aug 16 18:38:55 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2857.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2857hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.6 9.9E-15 2.2E-19 134.0 14.5 199 27-320 702-913 (1289)
2 KOG1214|consensus 99.6 2.3E-14 5.1E-19 131.6 11.8 140 123-317 715-860 (1289)
3 KOG1217|consensus 99.5 5.7E-13 1.2E-17 124.4 18.9 270 5-329 110-403 (487)
4 KOG1217|consensus 99.4 3.3E-11 7.3E-16 112.5 18.5 262 4-328 151-434 (487)
5 KOG1219|consensus 99.4 1.1E-12 2.3E-17 130.9 8.3 111 143-318 3865-3976(4289)
6 KOG1219|consensus 99.2 4.4E-11 9.6E-16 119.8 8.9 111 20-177 3865-3976(4289)
7 KOG1225|consensus 99.1 2E-09 4.3E-14 97.8 14.0 155 81-302 233-387 (525)
8 KOG1225|consensus 99.1 6.7E-10 1.5E-14 100.8 9.8 131 6-176 235-365 (525)
9 KOG4260|consensus 98.9 1.5E-09 3.1E-14 88.2 6.1 135 31-175 122-270 (350)
10 KOG4289|consensus 98.9 1.5E-09 3.3E-14 105.5 6.6 79 227-312 1229-1308(2531)
11 KOG4289|consensus 98.9 1.3E-09 2.9E-14 105.9 5.8 83 5-89 1222-1308(2531)
12 PF07645 EGF_CA: Calcium-bindi 98.8 4E-09 8.7E-14 62.6 2.6 38 281-318 1-38 (42)
13 KOG4260|consensus 98.7 1.8E-08 3.9E-13 82.0 5.8 161 128-316 132-306 (350)
14 KOG0994|consensus 98.7 7.7E-08 1.7E-12 92.3 10.8 138 161-318 996-1157(1758)
15 PF07645 EGF_CA: Calcium-bindi 98.3 1.1E-06 2.4E-11 52.1 3.3 34 241-274 1-36 (42)
16 PF00008 EGF: EGF-like domain 98.1 2.1E-06 4.6E-11 47.4 2.7 30 245-274 1-31 (32)
17 smart00179 EGF_CA Calcium-bind 98.0 1E-05 2.2E-10 47.1 4.2 35 281-316 1-36 (39)
18 smart00179 EGF_CA Calcium-bind 98.0 1.2E-05 2.7E-10 46.7 4.2 35 18-52 1-37 (39)
19 PF00008 EGF: EGF-like domain 98.0 4.3E-06 9.2E-11 46.2 1.7 30 22-51 1-31 (32)
20 PF12947 EGF_3: EGF domain; I 97.9 4.7E-06 1E-10 47.1 1.3 32 286-317 2-33 (36)
21 KOG1226|consensus 97.9 0.00016 3.5E-09 68.0 11.5 138 27-179 469-621 (783)
22 PF12662 cEGF: Complement Clr- 97.9 1.2E-05 2.5E-10 40.6 2.1 23 262-284 1-24 (24)
23 cd00054 EGF_CA Calcium-binding 97.7 9.2E-05 2E-09 42.5 4.1 35 18-52 1-36 (38)
24 PF12662 cEGF: Complement Clr- 97.6 4.8E-05 1E-09 38.4 2.4 24 304-328 1-24 (24)
25 KOG1836|consensus 97.6 0.0016 3.6E-08 68.0 15.4 140 160-320 863-1022(1705)
26 cd00054 EGF_CA Calcium-binding 97.6 0.0001 2.2E-09 42.3 3.6 34 242-275 2-36 (38)
27 KOG1226|consensus 97.5 0.00044 9.5E-09 65.1 8.8 134 6-158 479-636 (783)
28 PF14670 FXa_inhibition: Coagu 97.5 6.1E-05 1.3E-09 42.5 1.7 29 290-320 6-34 (36)
29 PF12947 EGF_3: EGF domain; I 97.4 0.0001 2.2E-09 41.7 2.1 29 248-276 6-34 (36)
30 KOG0994|consensus 97.4 0.0016 3.4E-08 63.9 10.4 34 157-190 878-915 (1758)
31 PF06247 Plasmod_Pvs28: Plasmo 97.2 9.3E-05 2E-09 57.6 0.6 140 27-178 8-165 (197)
32 cd00053 EGF Epidermal growth f 97.2 0.00071 1.5E-08 38.1 3.8 28 289-316 5-32 (36)
33 cd00053 EGF Epidermal growth f 97.1 0.00096 2.1E-08 37.5 4.0 28 24-51 5-32 (36)
34 PF06247 Plasmod_Pvs28: Plasmo 97.1 0.00082 1.8E-08 52.4 4.2 131 124-319 20-165 (197)
35 smart00181 EGF Epidermal growt 97.0 0.0011 2.5E-08 37.2 3.6 26 290-316 6-31 (35)
36 smart00181 EGF Epidermal growt 96.9 0.0017 3.6E-08 36.5 3.9 29 22-51 2-31 (35)
37 PF14670 FXa_inhibition: Coagu 96.5 0.0028 6.2E-08 35.7 2.4 24 253-276 9-32 (36)
38 cd01475 vWA_Matrilin VWA_Matri 96.2 0.0063 1.4E-07 50.8 4.1 41 278-320 183-223 (224)
39 PF07974 EGF_2: EGF-like domai 96.1 0.013 2.8E-07 32.1 3.7 26 290-317 6-31 (32)
40 PF12661 hEGF: Human growth fa 96.1 0.0039 8.4E-08 26.5 1.3 13 264-276 1-13 (13)
41 PF07974 EGF_2: EGF-like domai 96.0 0.0091 2E-07 32.7 2.8 26 248-275 6-31 (32)
42 KOG1836|consensus 94.3 0.79 1.7E-05 48.9 13.0 51 8-58 760-816 (1705)
43 PF12946 EGF_MSP1_1: MSP1 EGF 94.1 0.043 9.3E-07 30.8 1.9 31 245-275 2-33 (37)
44 PF12946 EGF_MSP1_1: MSP1 EGF 93.1 0.053 1.2E-06 30.4 1.2 30 22-51 2-32 (37)
45 cd01475 vWA_Matrilin VWA_Matri 93.0 0.099 2.1E-06 43.6 3.3 38 238-275 183-220 (224)
46 KOG1218|consensus 91.2 9.7 0.00021 33.3 17.3 97 38-145 13-110 (316)
47 smart00051 DSL delta serrate l 90.9 0.5 1.1E-05 30.5 4.0 47 262-317 16-62 (63)
48 smart00051 DSL delta serrate l 90.1 0.56 1.2E-05 30.2 3.7 46 4-52 16-62 (63)
49 KOG1218|consensus 90.0 6.2 0.00013 34.5 11.7 40 121-162 159-199 (316)
50 PF01683 EB: EB module; Inter 83.6 1.5 3.3E-05 26.8 3.0 27 109-135 22-48 (52)
51 PF00053 Laminin_EGF: Laminin 83.3 1.1 2.4E-05 27.1 2.2 24 255-280 12-35 (49)
52 PF00954 S_locus_glycop: S-loc 81.3 1.8 3.9E-05 31.4 3.1 33 282-316 77-109 (110)
53 cd00055 EGF_Lam Laminin-type e 80.1 2.2 4.8E-05 25.9 2.7 22 256-279 14-35 (50)
54 PF01683 EB: EB module; Inter 78.0 4.5 9.7E-05 24.7 3.7 24 148-175 25-48 (52)
55 cd00055 EGF_Lam Laminin-type e 76.2 4.8 0.0001 24.4 3.4 22 306-329 20-41 (50)
56 PHA03099 epidermal growth fact 73.6 4.1 8.9E-05 30.0 2.9 36 18-54 41-81 (139)
57 smart00180 EGF_Lam Laminin-typ 73.4 3.9 8.4E-05 24.4 2.4 21 255-277 12-32 (46)
58 PF00954 S_locus_glycop: S-loc 71.3 4.8 0.0001 29.2 3.0 34 141-175 76-109 (110)
59 PHA02887 EGF-like protein; Pro 70.4 4.2 9.1E-05 29.4 2.4 27 250-277 94-122 (126)
60 PF01414 DSL: Delta serrate li 70.3 2.3 5E-05 27.4 1.0 46 262-317 16-62 (63)
61 PHA02887 EGF-like protein; Pro 69.6 5.4 0.00012 28.9 2.8 27 27-54 94-122 (126)
62 PF09064 Tme5_EGF_like: Thromb 69.0 3.9 8.4E-05 22.5 1.5 13 264-276 19-31 (34)
63 PHA03099 epidermal growth fact 65.9 5.5 0.00012 29.4 2.2 27 250-277 53-81 (139)
64 PF12955 DUF3844: Domain of un 64.7 6.8 0.00015 28.0 2.5 35 282-316 5-44 (103)
65 KOG3516|consensus 63.4 6.5 0.00014 40.1 2.9 36 19-54 545-581 (1306)
66 KOG3512|consensus 61.9 23 0.00049 32.6 5.7 25 253-279 406-430 (592)
67 KOG3516|consensus 55.4 9.9 0.00021 38.9 2.7 39 278-318 541-580 (1306)
68 KOG3514|consensus 53.4 9.7 0.00021 38.5 2.3 34 21-54 625-659 (1591)
69 KOG3509|consensus 47.5 36 0.00078 34.7 5.1 71 243-317 407-477 (964)
70 KOG3514|consensus 36.3 24 0.00051 36.0 2.0 34 244-277 625-659 (1591)
71 PF01826 TIL: Trypsin Inhibito 33.5 20 0.00044 22.0 0.8 21 264-284 34-54 (55)
72 PF04863 EGF_alliinase: Alliin 32.6 23 0.0005 21.9 0.8 30 25-54 17-50 (56)
73 KOG3512|consensus 29.8 69 0.0015 29.6 3.6 59 259-320 368-429 (592)
74 PF05092 PIF: Per os infectivi 25.5 1.5E+02 0.0032 28.1 5.0 49 3-51 130-182 (522)
75 KOG0196|consensus 24.0 1.1E+02 0.0025 30.6 4.2 56 264-324 260-329 (996)
76 KOG0196|consensus 20.3 2.2E+02 0.0048 28.7 5.2 17 160-176 304-320 (996)
No 1
>KOG1214|consensus
Probab=99.62 E-value=9.9e-15 Score=133.97 Aligned_cols=199 Identities=28% Similarity=0.676 Sum_probs=137.9
Q ss_pred CCCCCeeeeCCC-CeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccC
Q psy2857 27 CGVNATCIDTQG-SYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDV 105 (332)
Q Consensus 27 C~~~g~C~~~~g-~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~ 105 (332)
|..++.|....+ .|.|.|..||.|+. +.|.++++|+...+.|++++.|++..+.|+|.|..||......- .|..+..
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gdg-r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~-tCV~i~~ 779 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGDG-RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRH-TCVLITP 779 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCCC-CCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCc-ceEEecC
Confidence 666677887744 79999999999987 67999999999889999999999999999999999987654322 3432210
Q ss_pred CCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCC--CceecC-CCceEeeCCCCCccCCCCcccc
Q psy2857 106 TSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQ--AQCTNT-PGSFRCDCVEGYVGAPPRIKCK 182 (332)
Q Consensus 106 ~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~--~~C~~~-~~~~~C~C~~G~~g~~~~~~c~ 182 (332)
. .+ ...|.+. .+.|... .+|+.. .++|.|.|.+||.|+.
T Consensus 780 --p-ap------------------------~n~Ce~g-----~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG------ 821 (1289)
T KOG1214|consen 780 --P-AP------------------------ANPCEDG-----SHTCAIAGQARCVHHGGSTYSCACLPGFSGDG------ 821 (1289)
T ss_pred --C-CC------------------------CCccccC-----ccccCcCCceEEEecCCceEEEeecCCccCCc------
Confidence 0 00 1223221 1333333 345544 4679999999999862
Q ss_pred ccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCccccccccCccCCCCCCCCCeeeecCCc
Q psy2857 183 DVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGS 262 (332)
Q Consensus 183 ~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~ 262 (332)
..+.++|+|.++-|..++.|++++++
T Consensus 822 ------------------------------------------------------~~c~dvDeC~psrChp~A~Cyntpgs 847 (1289)
T KOG1214|consen 822 ------------------------------------------------------HQCTDVDECSPSRCHPAATCYNTPGS 847 (1289)
T ss_pred ------------------------------------------------------cccccccccCccccCCCceEecCCCc
Confidence 13456799999999999999999999
Q ss_pred eeeecCCCcccCCCCcccc----cccccCC---CCCCCCCC-eee-cCCCceeeeCCCCCCCCCCCC
Q psy2857 263 YSCVCKEHYTGDPYQACSD----IDECKAL---DKPCGLRA-ICE-NTVPGFNCLCPKGYSGKPDAK 320 (332)
Q Consensus 263 ~~C~C~~G~~g~~~~~C~~----~d~C~~~---~~~C~~~~-~C~-~~~g~~~C~C~~g~~g~~~~~ 320 (332)
|.|+|.+||.|++.. |.. .-.|... +-.|..+. .|. ..+.+|.|.|.++-.|+...+
T Consensus 848 fsC~C~pGy~GDGf~-CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~ 913 (1289)
T KOG1214|consen 848 FSCRCQPGYYGDGFQ-CVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPH 913 (1289)
T ss_pred ceeecccCccCCCce-ecCCCccCCccccccccceeeccccceeEeeCCCcccCCCCCCCCCCCCCC
Confidence 999999999999754 332 1223222 23354332 222 234567888888777766554
No 2
>KOG1214|consensus
Probab=99.56 E-value=2.3e-14 Score=131.58 Aligned_cols=140 Identities=38% Similarity=0.868 Sum_probs=109.0
Q ss_pred CceecCCCCccCCCCeeecCCCCC-CCCCCCCCCceecCCCceEeeCCCCCccCCCCccccccCcccceeeccccCcccc
Q psy2857 123 DGLCYCRPGFDARGSVCVDVDECQ-LGDPCGPQAQCTNTPGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLH 201 (332)
Q Consensus 123 ~~~c~C~~g~~~~g~~c~~~~~C~-~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~ 201 (332)
.+.|.|..||.+.++.|.+.++|+ ....|.++.+|++.+++|+|.|..||.......+|..+.-
T Consensus 715 ~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~--------------- 779 (1289)
T KOG1214|consen 715 DYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITP--------------- 779 (1289)
T ss_pred ceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecC---------------
Confidence 458999999999999999999998 3467999999999999999999999987665544433210
Q ss_pred cccCCcccccceecccccceecccccEEecccCccccccccCccCC--CCCCCCC--eeeecC-CceeeecCCCcccCCC
Q psy2857 202 SVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQS--NPCGVNA--TCIDTQ-GSYSCVCKEHYTGDPY 276 (332)
Q Consensus 202 ~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~--~~C~~~~--~C~~~~-~~~~C~C~~G~~g~~~ 276 (332)
. ..++.|+. +.|..++ .|+... ++|.|+|.+||.|++.
T Consensus 780 ------------------------------p-------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~ 822 (1289)
T KOG1214|consen 780 ------------------------------P-------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH 822 (1289)
T ss_pred ------------------------------C-------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc
Confidence 0 01122322 2343333 455544 4799999999999986
Q ss_pred CcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 277 QACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 277 ~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
. |.|+|+|. +..|.++++|+|++++|.|+|.+||.|+.
T Consensus 823 ~-c~dvDeC~--psrChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 823 Q-CTDVDECS--PSRCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred c-cccccccC--ccccCCCceEecCCCcceeecccCccCCC
Confidence 4 88999997 47899999999999999999999999985
No 3
>KOG1217|consensus
Probab=99.53 E-value=5.7e-13 Score=124.36 Aligned_cols=270 Identities=32% Similarity=0.707 Sum_probs=186.9
Q ss_pred EEEEecCCceecccCC--cCCCCC--CCCCCeeeeC---CCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceee
Q psy2857 5 VLVRILLGVRAIVDIN--ECQSNP--CGVNATCIDT---QGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICEN 77 (332)
Q Consensus 5 ~~c~c~~g~~~~~~~~--~C~~~~--C~~~g~C~~~---~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~ 77 (332)
..|.|..||.+..+.. .|...+ +..++.|.+. ...|.|.|..||.+..+.. ..++|.....+|.+.+.|.+
T Consensus 110 ~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~--~~~~C~~~~~~c~~~~~C~~ 187 (487)
T KOG1217|consen 110 YECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCET--DLDECIQYSSPCQNGGTCVN 187 (487)
T ss_pred ceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccc--cccccccCCCCcCCCccccc
Confidence 4578999999986655 477766 3556778774 3589999999999976642 22677655667999999999
Q ss_pred CCCCeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccC-ceecCCCCccCCCCee-ecCCCCCCCCCCCCCC
Q psy2857 78 TVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVD-GLCYCRPGFDARGSVC-VDVDECQLGDPCGPQA 155 (332)
Q Consensus 78 ~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~-~~c~C~~g~~~~g~~c-~~~~~C~~~~~C~~~~ 155 (332)
..++|.|.|..+|.+...... .+...|.. ..+.+.++|. +..| .++.++... . +
T Consensus 188 ~~~~~~C~c~~~~~~~~~~~~-----------------~~~~~c~~~~~~~~~~g~~--~~~c~~~~~~~~~~----~-~ 243 (487)
T KOG1217|consen 188 TGGSYLCSCPPGYTGSTCETT-----------------GNGGTCVDSVACSCPPGAR--GPECEVSIVECASG----D-G 243 (487)
T ss_pred CCCCeeEeCCCCccCCcCcCC-----------------CCCceEecceeccCCCCCC--CCCcccccccccCC----C-C
Confidence 999999999999998743321 11122222 3567777776 4444 334444311 4 8
Q ss_pred ceecCCCceEeeCCCCCccCC-----CCccccccC-cccceeeccccCcccccccCCcccccceecccccceecccccEE
Q psy2857 156 QCTNTPGSFRCDCVEGYVGAP-----PRIKCKDVR-WEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFV 229 (332)
Q Consensus 156 ~C~~~~~~~~C~C~~G~~g~~-----~~~~c~~~~-c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~ 229 (332)
+|++..+.+.|.|++||.+.. ....|.... |.++..|..... .+.|.|.. +|+
T Consensus 244 ~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~-----------~~~C~C~~----------g~~ 302 (487)
T KOG1217|consen 244 TCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPG-----------SYRCTCPP----------GFT 302 (487)
T ss_pred cccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCC-----------cceeeCCC----------CCC
Confidence 899999999999999999987 234555543 666666653222 26777777 888
Q ss_pred ecccCccccccccCcc----CCCCCCCCCee--eecCCceeeecCCCcccCCCCccccc-ccccCCCCCCCCCCeeec-C
Q psy2857 230 IEDAKRNLNRVDINEC----QSNPCGVNATC--IDTQGSYSCVCKEHYTGDPYQACSDI-DECKALDKPCGLRAICEN-T 301 (332)
Q Consensus 230 ~~~~~~~~~~~~~~~C----~~~~C~~~~~C--~~~~~~~~C~C~~G~~g~~~~~C~~~-d~C~~~~~~C~~~~~C~~-~ 301 (332)
+..+ ..+.+..+| ...+|.+++.| .+..+.+.|.|..||.|..+ ++. ++|.. .++..++.|++ .
T Consensus 303 g~~~---~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C---~~~~~~C~~--~~~~~~~~c~~~~ 374 (487)
T KOG1217|consen 303 GRLC---TECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRC---EDSNDECAS--SPCCPGGTCVNET 374 (487)
T ss_pred CCCC---ccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCcc---ccCCccccC--CccccCCEeccCC
Confidence 7766 234455677 34668887788 34445788999999888754 455 48876 34778899999 7
Q ss_pred CCceeeeCCCCCCCC-CCCCccccccccC
Q psy2857 302 VPGFNCLCPKGYSGK-PDAKVACEQEKAG 329 (332)
Q Consensus 302 ~g~~~C~C~~g~~g~-~~~~~~c~~~~~~ 329 (332)
.++|.|.|+.+|.+. ......+..+.+.
T Consensus 375 ~~~~~c~~~~~~~~~~~~~~~~~~~~~~c 403 (487)
T KOG1217|consen 375 PGSYRCACPAGFAGKANGDGVGCEDIDEC 403 (487)
T ss_pred CCCeEecCCCccccCCccccccccccccc
Confidence 899999999999984 2222345555443
No 4
>KOG1217|consensus
Probab=99.38 E-value=3.3e-11 Score=112.47 Aligned_cols=262 Identities=31% Similarity=0.619 Sum_probs=171.7
Q ss_pred EEEEEecCCceecccC---CcCCC--CCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeC
Q psy2857 4 VVLVRILLGVRAIVDI---NECQS--NPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT 78 (332)
Q Consensus 4 ~~~c~c~~g~~~~~~~---~~C~~--~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~ 78 (332)
...|.|..||.+.... ++|.. .+|.+.+.|.+..++|.|.|.+||.+..+..- ...+.|++.
T Consensus 151 ~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~-------------~~~~~c~~~ 217 (487)
T KOG1217|consen 151 PFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT-------------GNGGTCVDS 217 (487)
T ss_pred ceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC-------------CCCceEecc
Confidence 4678999999999432 68874 34998999999999999999999999765311 223455444
Q ss_pred CCCeeeeCCCCCCCCCCCccceeeccCCCCCCCC-ccccCCCcccCceecCCCCccCCC-CeeecCCCCCCCCCCCCCCc
Q psy2857 79 VPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSN-FECVNNAECVDGLCYCRPGFDARG-SVCVDVDECQLGDPCGPQAQ 156 (332)
Q Consensus 79 ~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~-~~C~~~~~c~~~~c~C~~g~~~~g-~~c~~~~~C~~~~~C~~~~~ 156 (332)
+.|.+..++.+........ .+... ..|.+... .+.|.+++||.+.. ..+.++++|.....|.++++
T Consensus 218 ---~~~~~~~g~~~~~c~~~~~-------~~~~~~~~c~~~~~--~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~ 285 (487)
T KOG1217|consen 218 ---VACSCPPGARGPECEVSIV-------ECASGDGTCVNTVG--SYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGT 285 (487)
T ss_pred ---eeccCCCCCCCCCcccccc-------cccCCCCcccccCC--ceeeeCCCCccccccceeeeccccCCCCccCCCCe
Confidence 5778888887653321100 11111 12222111 25888999998655 46778888974324888999
Q ss_pred eecCCCceEeeCCCCCccCCCCccccc-cC---------cccceeeccccCcccccccCCcccccceecccccceecccc
Q psy2857 157 CTNTPGSFRCDCVEGYVGAPPRIKCKD-VR---------WEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLK 226 (332)
Q Consensus 157 C~~~~~~~~C~C~~G~~g~~~~~~c~~-~~---------c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~ 226 (332)
|++..+.|.|.|++||.+..+ ..+.+ .. |.++..|. .......+.|.+..
T Consensus 286 C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~g~~C~---------~~~~~~~~~C~c~~---------- 345 (487)
T KOG1217|consen 286 CVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCANGGTCN---------TLGSFGGFRCACGP---------- 345 (487)
T ss_pred eecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCCCcccc---------cCCCCCCCCcCCCC----------
Confidence 999998899999999999876 22222 11 33333331 00111234455554
Q ss_pred cEEecccCcccccccc-CccCCCCCCCCCeeee-cCCceeeecCCCcccC---CCCcccccccccCCCCCCCCCCeeecC
Q psy2857 227 LFVIEDAKRNLNRVDI-NECQSNPCGVNATCID-TQGSYSCVCKEHYTGD---PYQACSDIDECKALDKPCGLRAICENT 301 (332)
Q Consensus 227 g~~~~~~~~~~~~~~~-~~C~~~~C~~~~~C~~-~~~~~~C~C~~G~~g~---~~~~C~~~d~C~~~~~~C~~~~~C~~~ 301 (332)
+|++. .+... ++|...++..++.|++ ..++|.|.|+.+|.+. ....+.++++|.. .+.|++.
T Consensus 346 ~~~g~------~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~-------~~~c~~~ 412 (487)
T KOG1217|consen 346 GFTGR------RCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG-------CGDCVNG 412 (487)
T ss_pred CCCCC------ccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC-------Ccceecc
Confidence 44433 23334 4888888889999999 7899999999999984 2345777888754 4578888
Q ss_pred CCceeeeCCCCCCCCCCCCcccccccc
Q psy2857 302 VPGFNCLCPKGYSGKPDAKVACEQEKA 328 (332)
Q Consensus 302 ~g~~~C~C~~g~~g~~~~~~~c~~~~~ 328 (332)
.++|.|. ++ + ..... .|.++.+
T Consensus 413 ~~~~~c~-~~-~-~~~~~--~~~~~~~ 434 (487)
T KOG1217|consen 413 PGGGACT-PP-G-LVSPG--TCDDIDE 434 (487)
T ss_pred CCCCccc-cC-c-ccCCc--ceecccc
Confidence 9999999 77 4 33222 4555444
No 5
>KOG1219|consensus
Probab=99.37 E-value=1.1e-12 Score=130.89 Aligned_cols=111 Identities=34% Similarity=0.844 Sum_probs=92.8
Q ss_pred CCCCCCCCCCCCCceecCC-CceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccce
Q psy2857 143 DECQLGDPCGPQAQCTNTP-GSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIF 221 (332)
Q Consensus 143 ~~C~~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~ 221 (332)
+.|. .++|+++|+|...+ +.|.|.|++-|.|..
T Consensus 3865 d~C~-~npCqhgG~C~~~~~ggy~CkCpsqysG~~--------------------------------------------- 3898 (4289)
T KOG1219|consen 3865 DPCN-DNPCQHGGTCISQPKGGYKCKCPSQYSGNH--------------------------------------------- 3898 (4289)
T ss_pred cccc-cCcccCCCEecCCCCCceEEeCcccccCcc---------------------------------------------
Confidence 5565 57788888887665 667788887777643
Q ss_pred ecccccEEecccCccccccccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecC
Q psy2857 222 SKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT 301 (332)
Q Consensus 222 ~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~ 301 (332)
+..++..|.++||..+++|+...++|.|.|+.||+|..|+. ..+++|+. ++|.++|.|+|+
T Consensus 3899 ----------------CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~-~Gi~eCs~--n~C~~gg~C~n~ 3959 (4289)
T KOG1219|consen 3899 ----------------CEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA-RGISECSK--NVCGTGGQCINI 3959 (4289)
T ss_pred ----------------cccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec-cccccccc--ccccCCceeecc
Confidence 34566789999999999999999999999999999998873 23999985 799999999999
Q ss_pred CCceeeeCCCCCCCCCC
Q psy2857 302 VPGFNCLCPKGYSGKPD 318 (332)
Q Consensus 302 ~g~~~C~C~~g~~g~~~ 318 (332)
.|+|+|.|-+||.|..+
T Consensus 3960 ~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3960 PGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred CCceEeccChhHhcccC
Confidence 99999999999998853
No 6
>KOG1219|consensus
Probab=99.20 E-value=4.4e-11 Score=119.81 Aligned_cols=111 Identities=40% Similarity=1.008 Sum_probs=96.3
Q ss_pred CcCCCCCCCCCCeeeeC-CCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCcc
Q psy2857 20 NECQSNPCGVNATCIDT-QGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKV 98 (332)
Q Consensus 20 ~~C~~~~C~~~g~C~~~-~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~ 98 (332)
++|..+||.++|+|+.. .++|.|.|++.|.|..|+ .++.+|. +.||..+++|+-..++|.|.|+.||+|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CE--i~~epC~--snPC~~GgtCip~~n~f~CnC~~gyTG~----- 3935 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCE--IDLEPCA--SNPCLTGGTCIPFYNGFLCNCPNGYTGK----- 3935 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccc--ccccccc--CCCCCCCCEEEecCCCeeEeCCCCccCc-----
Confidence 78999999999999998 478999999999998876 4778886 4899999999999999999999999987
Q ss_pred ceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecCCCceEeeCCCCCccCCC
Q psy2857 99 ACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNTPGSFRCDCVEGYVGAPP 177 (332)
Q Consensus 99 ~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~ 177 (332)
+|+.- .+++|+ .++|.+++.|++..|+|.|.|-+||.|..+
T Consensus 3936 ~Ce~~-------------------------------------Gi~eCs-~n~C~~gg~C~n~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3936 RCEAR-------------------------------------GISECS-KNVCGTGGQCINIPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred eeecc-------------------------------------cccccc-cccccCCceeeccCCceEeccChhHhcccC
Confidence 44320 266886 789999999999999999999999998654
No 7
>KOG1225|consensus
Probab=99.10 E-value=2e-09 Score=97.82 Aligned_cols=155 Identities=25% Similarity=0.550 Sum_probs=110.6
Q ss_pred CeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecC
Q psy2857 81 GFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNT 160 (332)
Q Consensus 81 ~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~ 160 (332)
.+.|.|..+|.+..+....|+. .|..++.|+.+.|.|++||. |..|.. ..|. ..|+.++.+++.
T Consensus 233 ~~ic~c~~~~~g~~c~~~~C~~-----------~c~~~g~c~~G~CIC~~Gf~--G~dC~e-~~Cp--~~cs~~g~~~~g 296 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCSTIYCPG-----------GCTGRGQCVEGRCICPPGFT--GDDCDE-LVCP--VDCSGGGVCVDG 296 (525)
T ss_pred CceeecCCceeCCccccccCCC-----------CCcccceEeCCeEeCCCCCc--CCCCCc-ccCC--cccCCCceecCC
Confidence 4479999999998655433322 56667889999999999999 888865 3463 448887887755
Q ss_pred CCceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCcccccc
Q psy2857 161 PGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRV 240 (332)
Q Consensus 161 ~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~ 240 (332)
.|.|++||+|..+..+=-..+|..++.|. ...|.|.+ ||++..|..
T Consensus 297 ----~CiC~~g~~G~dCs~~~cpadC~g~G~Ci---------------~G~C~C~~----------Gy~G~~C~~----- 342 (525)
T KOG1225|consen 297 ----ECICNPGYSGKDCSIRRCPADCSGHGKCI---------------DGECLCDE----------GYTGELCIQ----- 342 (525)
T ss_pred ----EeecCCCccccccccccCCccCCCCCccc---------------CCceEeCC----------CCcCCcccc-----
Confidence 89999999999886532335677777776 67899999 999875542
Q ss_pred ccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCC
Q psy2857 241 DINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTV 302 (332)
Q Consensus 241 ~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~ 302 (332)
..|..++.|++ .|.|..||.|.+ . .-+.+.. ...|.....|+...
T Consensus 343 -------~~C~~~g~cv~-----gC~C~~Gw~G~d-~---~~~~~~~-~~~cs~~~~~~~~~ 387 (525)
T KOG1225|consen 343 -------RACSGGGQCVN-----GCKCKKGWRGPD-V---ADPSLLL-ITECSPPSLCIAGV 387 (525)
T ss_pred -------cccCCCceecc-----CceeccCccCCC-c---CCchhhc-ccccCCCceeeccc
Confidence 23778888887 399999999986 1 1222222 23566666776665
No 8
>KOG1225|consensus
Probab=99.07 E-value=6.7e-10 Score=100.85 Aligned_cols=131 Identities=27% Similarity=0.744 Sum_probs=100.7
Q ss_pred EEEecCCceecccCCcCCCCCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeee
Q psy2857 6 LVRILLGVRAIVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCL 85 (332)
Q Consensus 6 ~c~c~~g~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~ 85 (332)
+|.|..+|++......=.+..|..++.|++.. |+|++||+|.++.. ..| +..|+.++.+++. .|.
T Consensus 235 ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~G~----CIC~~Gf~G~dC~e----~~C---p~~cs~~g~~~~g----~Ci 299 (525)
T KOG1225|consen 235 ICECPEGYFGPLCSTIYCPGGCTGRGQCVEGR----CICPPGFTGDDCDE----LVC---PVDCSGGGVCVDG----ECI 299 (525)
T ss_pred eeecCCceeCCccccccCCCCCcccceEeCCe----EeCCCCCcCCCCCc----ccC---CcccCCCceecCC----Eee
Confidence 58899999998655444445577778898875 99999999987742 223 3448777877766 899
Q ss_pred CCCCCCCCCCCccceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeeecCCCCCCCCCCCCCCceecCCCceE
Q psy2857 86 CPKGYSGKPDAKVACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCVDVDECQLGDPCGPQAQCTNTPGSFR 165 (332)
Q Consensus 86 C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~~~~~C~~~~~C~~~~~C~~~~~~~~ 165 (332)
|.+||+|..++..+|. ..|..++.|++++|.|.+||+ |..|... .|.+++.|++.
T Consensus 300 C~~g~~G~dCs~~~cp-----------adC~g~G~Ci~G~C~C~~Gy~--G~~C~~~-------~C~~~g~cv~g----- 354 (525)
T KOG1225|consen 300 CNPGYSGKDCSIRRCP-----------ADCSGHGKCIDGECLCDEGYT--GELCIQR-------ACSGGGQCVNG----- 354 (525)
T ss_pred cCCCccccccccccCC-----------ccCCCCCcccCCceEeCCCCc--CCccccc-------ccCCCceeccC-----
Confidence 9999999977654432 378999999999999999998 7777533 27788888752
Q ss_pred eeCCCCCccCC
Q psy2857 166 CDCVEGYVGAP 176 (332)
Q Consensus 166 C~C~~G~~g~~ 176 (332)
|.|..||.|.+
T Consensus 355 C~C~~Gw~G~d 365 (525)
T KOG1225|consen 355 CKCKKGWRGPD 365 (525)
T ss_pred ceeccCccCCC
Confidence 89999999876
No 9
>KOG4260|consensus
Probab=98.95 E-value=1.5e-09 Score=88.18 Aligned_cols=135 Identities=29% Similarity=0.623 Sum_probs=89.6
Q ss_pred CeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceee---CCCCeeeeCCCCCCCCCCCccceeeccC--
Q psy2857 31 ATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICEN---TVPGFNCLCPKGYSGKPDAKVACEQVDV-- 105 (332)
Q Consensus 31 g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~---~~~~~~C~C~~g~~~~~~~~~~c~~~~~-- 105 (332)
..|+... .--|+.|..|.++..|... ...+|..++.|.. ..|+-.|.|..||.|..+..---+....
T Consensus 122 WlCvdqL---kvCCp~gtyGpdCl~Cpgg-----ser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~R 193 (350)
T KOG4260|consen 122 WLCVDQL---KVCCPDGTYGPDCLQCPGG-----SERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSR 193 (350)
T ss_pred Hhhhhhh---eeccCCCCcCCccccCCCC-----CcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhc
Confidence 3454443 3447889999888766542 2367998999963 3456799999999998543210000000
Q ss_pred ---CCCCCC-CccccCCCccc----CceecCCCCccCCCCeeecCCCCC-CCCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857 106 ---TSECSS-NFECVNNAECV----DGLCYCRPGFDARGSVCVDVDECQ-LGDPCGPQAQCTNTPGSFRCDCVEGYVGA 175 (332)
Q Consensus 106 ---~~~c~~-~~~C~~~~~c~----~~~c~C~~g~~~~g~~c~~~~~C~-~~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 175 (332)
...|.+ ...|.. .|. ..--.|..||..+...|+|+++|. .+.+|..+..|+|+.|+|.|...+||.+.
T Consensus 194 ne~~lvCt~Ch~~C~~--~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g 270 (350)
T KOG4260|consen 194 NEQHLVCTACHEGCLG--VCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG 270 (350)
T ss_pred ccccchhhhhhhhhhc--ccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC
Confidence 001111 011211 111 123468999999889999999997 67899999999999999999999999863
No 10
>KOG4289|consensus
Probab=98.93 E-value=1.5e-09 Score=105.48 Aligned_cols=79 Identities=32% Similarity=0.745 Sum_probs=65.8
Q ss_pred cEEecccCccccccccCccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecC-CCce
Q psy2857 227 LFVIEDAKRNLNRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT-VPGF 305 (332)
Q Consensus 227 g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~-~g~~ 305 (332)
|||+.+ +..++|+|-+.||.++++|....|+|+|.|++||+|.+|+.-...-.|.. +.|.++++|++. .|+|
T Consensus 1229 GFTgd~-----CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvp--GvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1229 GFTGDY-----CETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVP--GVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred CCCccc-----ccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcccc--ceecCCCEEeecCCCce
Confidence 666653 35688999999999999999999999999999999998873333455654 689999999865 6889
Q ss_pred eeeCCCC
Q psy2857 306 NCLCPKG 312 (332)
Q Consensus 306 ~C~C~~g 312 (332)
.|.|+.|
T Consensus 1302 ~c~Cp~g 1308 (2531)
T KOG4289|consen 1302 CCHCPYG 1308 (2531)
T ss_pred eccCCCc
Confidence 9999988
No 11
>KOG4289|consensus
Probab=98.92 E-value=1.3e-09 Score=105.88 Aligned_cols=83 Identities=31% Similarity=0.740 Sum_probs=73.1
Q ss_pred EEEEecCCceec---ccCCcCCCCCCCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeC-CC
Q psy2857 5 VLVRILLGVRAI---VDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENT-VP 80 (332)
Q Consensus 5 ~~c~c~~g~~~~---~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~-~~ 80 (332)
..|+|++||+|+ +.||.|-+.||+++|.|..-.|+|.|.|.+||+|..|+.-.....| .+..|+++++|.+. .+
T Consensus 1222 lrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrC--vpGvC~nggtC~~~~ng 1299 (2531)
T KOG4289|consen 1222 LRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRC--VPGVCKNGGTCVNLLNG 1299 (2531)
T ss_pred eeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCcc--ccceecCCCEEeecCCC
Confidence 579999999999 7799999999999999999999999999999999887644455566 36899999999886 56
Q ss_pred CeeeeCCCC
Q psy2857 81 GFNCLCPKG 89 (332)
Q Consensus 81 ~~~C~C~~g 89 (332)
+|.|.|+.|
T Consensus 1300 gf~c~Cp~g 1308 (2531)
T KOG4289|consen 1300 GFCCHCPYG 1308 (2531)
T ss_pred ceeccCCCc
Confidence 899999987
No 12
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.79 E-value=4e-09 Score=62.62 Aligned_cols=38 Identities=39% Similarity=0.804 Sum_probs=34.3
Q ss_pred ccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCC
Q psy2857 281 DIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPD 318 (332)
Q Consensus 281 ~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~ 318 (332)
|||||....+.|..++.|+|+.|+|+|.|++||+....
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~ 38 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDD 38 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTT
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCC
Confidence 68999998899998999999999999999999995443
No 13
>KOG4260|consensus
Probab=98.74 E-value=1.8e-08 Score=81.97 Aligned_cols=161 Identities=30% Similarity=0.588 Sum_probs=98.1
Q ss_pred CCCCccCCCCeeecCCCCC--CCCCCCCCCceec---CCCceEeeCCCCCccCCCCcccc-----ccCcccceeeccccC
Q psy2857 128 CRPGFDARGSVCVDVDECQ--LGDPCGPQAQCTN---TPGSFRCDCVEGYVGAPPRIKCK-----DVRWEFNVTLLFYET 197 (332)
Q Consensus 128 C~~g~~~~g~~c~~~~~C~--~~~~C~~~~~C~~---~~~~~~C~C~~G~~g~~~~~~c~-----~~~c~~~~~c~~~~~ 197 (332)
|++|-. |..|. .|. ...+|..++.|.. ..|+..|.|.+||.|..+.. |. ..+=..+..|.-...
T Consensus 132 Cp~gty--GpdCl---~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~-Cg~eyfes~Rne~~lvCt~Ch~ 205 (350)
T KOG4260|consen 132 CPDGTY--GPDCL---QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRY-CGIEYFESSRNEQHLVCTACHE 205 (350)
T ss_pred cCCCCc--CCccc---cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccc-cchHHHHhhcccccchhhhhhh
Confidence 555554 55553 221 1245777777763 34778999999999976532 10 000001111110000
Q ss_pred cccccccCCcccccceecc-cccceecccccEEecccCccccccccCccCC--CCCCCCCeeeecCCceeeecCCCcccC
Q psy2857 198 DYLHSVASDISDILTIIHE-FSRIFSKHLKLFVIEDAKRNLNRVDINECQS--NPCGVNATCIDTQGSYSCVCKEHYTGD 274 (332)
Q Consensus 198 ~~~~~~~~~~~~~~c~c~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~ 274 (332)
... . .|.. .....+.+..||... ...|+|||+|.. .||..+..|+|+.|||.|.+++||.+.
T Consensus 206 ~C~---------~--~Csg~~~k~C~kCkkGW~ld----e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g 270 (350)
T KOG4260|consen 206 GCL---------G--VCSGESSKGCSKCKKGWKLD----EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG 270 (350)
T ss_pred hhh---------c--ccCCCCCCChhhhcccceec----ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC
Confidence 000 0 0111 122344556688665 346899999965 679999999999999999999999762
Q ss_pred CCCcccccccccCCCCCCC-CCCeeecCCCceeeeCCCCCCCC
Q psy2857 275 PYQACSDIDECKALDKPCG-LRAICENTVPGFNCLCPKGYSGK 316 (332)
Q Consensus 275 ~~~~C~~~d~C~~~~~~C~-~~~~C~~~~g~~~C~C~~g~~g~ 316 (332)
+|+|......|. .+..|.|+.++|+|+|..|+.-.
T Consensus 271 -------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 271 -------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLIII 306 (350)
T ss_pred -------hHHhhhhhhhcccCCCCcccCCccEEEEecccceee
Confidence 466654333443 25789999999999999887543
No 14
>KOG0994|consensus
Probab=98.74 E-value=7.7e-08 Score=92.32 Aligned_cols=138 Identities=22% Similarity=0.440 Sum_probs=68.8
Q ss_pred CCceEeeCCCCCccCCCCccccccCcccceeeccccCcccccccCCcccccceecccccceecccccEEecccCc---cc
Q psy2857 161 PGSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKR---NL 237 (332)
Q Consensus 161 ~~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~---~~ 237 (332)
.|.+--.|..||.|+.....|+...|..-++-. ....+..+..|.|.+ .+.+..|.+ +.
T Consensus 996 eG~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~--------~~~CDr~tGQCpClp----------Nv~G~~CDqCA~N~ 1057 (1758)
T KOG0994|consen 996 EGDHCEHCKDGFYGDALRQNCQRCVCNFLGTNS--------TCHCDRFTGQCPCLP----------NVQGVRCDQCAENH 1057 (1758)
T ss_pred cccchhhccccchhHHHHhhhhhheccccccCC--------ccccccccCcCCCCc----------ccccccccccccch
Confidence 344434789999999888777766654433321 122344455566665 222222210 00
Q ss_pred ccc-ccCccCCCCCC--CCCeeeecCCceeeecCCCcccCCCCcccccc-----------cccCC---CCCCCC-CC--e
Q psy2857 238 NRV-DINECQSNPCG--VNATCIDTQGSYSCVCKEHYTGDPYQACSDID-----------ECKAL---DKPCGL-RA--I 297 (332)
Q Consensus 238 ~~~-~~~~C~~~~C~--~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d-----------~C~~~---~~~C~~-~~--~ 297 (332)
--. .-..|++-.|. ..-+|....| +|+|++||-|..|..|++.- +|..- .-.|.. .| .
T Consensus 1058 w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~ 1135 (1758)
T KOG0994|consen 1058 WNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCV 1135 (1758)
T ss_pred hccccCCCCCccCCCccCCcccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCcee
Confidence 000 00012211121 1225655555 89999999998776665421 12110 012332 22 3
Q ss_pred eecCCCceeee-CCCCCCCCCC
Q psy2857 298 CENTVPGFNCL-CPKGYSGKPD 318 (332)
Q Consensus 298 C~~~~g~~~C~-C~~g~~g~~~ 318 (332)
|....++++|. |..||.|.-.
T Consensus 1136 C~~Gv~G~rCdqCaRgy~G~fP 1157 (1758)
T KOG0994|consen 1136 CRPGVGGPRCDQCARGYSGQFP 1157 (1758)
T ss_pred ecCCCCCcchhhhhhhhcCCCC
Confidence 44566667773 7777777643
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.25 E-value=1.1e-06 Score=52.07 Aligned_cols=34 Identities=47% Similarity=1.037 Sum_probs=29.7
Q ss_pred ccCccCC--CCCCCCCeeeecCCceeeecCCCcccC
Q psy2857 241 DINECQS--NPCGVNATCIDTQGSYSCVCKEHYTGD 274 (332)
Q Consensus 241 ~~~~C~~--~~C~~~~~C~~~~~~~~C~C~~G~~g~ 274 (332)
|||||.. ..|..++.|+|+.|+|+|.|++||...
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 5789976 469889999999999999999999843
No 16
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=98.14 E-value=2.1e-06 Score=47.40 Aligned_cols=30 Identities=53% Similarity=1.198 Sum_probs=27.0
Q ss_pred cCCCCCCCCCeeeecC-CceeeecCCCcccC
Q psy2857 245 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD 274 (332)
Q Consensus 245 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~ 274 (332)
|.++||.++++|++.. ++|+|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4457899999999999 99999999999986
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=98.02 E-value=1e-05 Score=47.08 Aligned_cols=35 Identities=49% Similarity=1.099 Sum_probs=29.9
Q ss_pred ccccccCCCCCCCCCCeeecCCCceeeeCCCCCC-CC
Q psy2857 281 DIDECKALDKPCGLRAICENTVPGFNCLCPKGYS-GK 316 (332)
Q Consensus 281 ~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~-g~ 316 (332)
++|+|... .+|.++++|+++.++|.|.|++||+ |.
T Consensus 1 d~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCC-CCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 46888754 6899888999999999999999999 54
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.99 E-value=1.2e-05 Score=46.70 Aligned_cols=35 Identities=54% Similarity=1.138 Sum_probs=30.6
Q ss_pred cCCcCCC-CCCCCCCeeeeCCCCeEeeCCCCCc-cCC
Q psy2857 18 DINECQS-NPCGVNATCIDTQGSYSCVCKEHYT-GDP 52 (332)
Q Consensus 18 ~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~-g~~ 52 (332)
++++|.. .+|..+++|+++.++|.|.|++||. |..
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~ 37 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN 37 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCc
Confidence 4688887 7899889999999999999999998 643
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.97 E-value=4.3e-06 Score=46.17 Aligned_cols=30 Identities=53% Similarity=1.198 Sum_probs=27.2
Q ss_pred CCCCCCCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857 22 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD 51 (332)
Q Consensus 22 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~ 51 (332)
|.+.+|.++|+|++.. +.|.|+|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5667899999999998 99999999999985
No 20
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.91 E-value=4.7e-06 Score=47.11 Aligned_cols=32 Identities=31% Similarity=0.664 Sum_probs=25.4
Q ss_pred cCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 286 KALDKPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 286 ~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
....+.|..+++|+++.++|.|+|++||+|+.
T Consensus 2 ~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 2 LENNGGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp TTGGGGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 34457899999999999999999999999985
No 21
>KOG1226|consensus
Probab=97.89 E-value=0.00016 Score=67.96 Aligned_cols=138 Identities=30% Similarity=0.671 Sum_probs=87.9
Q ss_pred CCCCCeeeeCCCCeEeeCCCCCccCCCCCCC-------CcccccCC--CCCCCCCCceeeCCCCeeeeCCCCCCCCCCCc
Q psy2857 27 CGVNATCIDTQGSYSCVCKEHYTGDPYQACS-------DIDECKAL--DKPCGLRAICENTVPGFNCLCPKGYSGKPDAK 97 (332)
Q Consensus 27 C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~-------~~~~c~~~--~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~ 97 (332)
|+.+|...-.. |.|.+||.|..++--. ..+.|... ..+|+.+|.|.=. .|.|.+...+.-.++
T Consensus 469 C~g~G~~~CG~----C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~ 540 (783)
T KOG1226|consen 469 CHGNGTFVCGQ----CRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK 540 (783)
T ss_pred cCCCCcEEecc----eecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeee
Confidence 55555554443 8999999998774111 12334321 2379999999776 789988776432222
Q ss_pred cceeeccCCCCCCCCccccCCCcccCceecCCCCccCCCCeee---cCCCCC--CCCCCCCCCceecCCCceEeeCCCC-
Q psy2857 98 VACEQVDVTSECSSNFECVNNAECVDGLCYCRPGFDARGSVCV---DVDECQ--LGDPCGPQAQCTNTPGSFRCDCVEG- 171 (332)
Q Consensus 98 ~~c~~~~~~~~c~~~~~C~~~~~c~~~~c~C~~g~~~~g~~c~---~~~~C~--~~~~C~~~~~C~~~~~~~~C~C~~G- 171 (332)
.|+.-++.=+-.....|..++.|.-++|.|.+||+ |..|. +.+.|. ....|..+++|.=. +|.|...
T Consensus 541 -fCECDnfsC~r~~g~lC~g~G~C~CG~CvC~~Gwt--G~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~~~~ 613 (783)
T KOG1226|consen 541 -FCECDNFSCERHKGVLCGGHGRCECGRCVCNPGWT--GSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCTDPP 613 (783)
T ss_pred -eeeccCcccccccCcccCCCCeEeCCcEEcCCCCc--cCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcCCCC
Confidence 56544432111234578889999999999999999 66663 455565 23467777777643 6777655
Q ss_pred CccCCCCc
Q psy2857 172 YVGAPPRI 179 (332)
Q Consensus 172 ~~g~~~~~ 179 (332)
|.|..++.
T Consensus 614 ~sG~~CE~ 621 (783)
T KOG1226|consen 614 YSGEFCEK 621 (783)
T ss_pred cCcchhhc
Confidence 88876654
No 22
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.86 E-value=1.2e-05 Score=40.64 Aligned_cols=23 Identities=48% Similarity=1.028 Sum_probs=19.3
Q ss_pred ceeeecCCCcccCCC-Cccccccc
Q psy2857 262 SYSCVCKEHYTGDPY-QACSDIDE 284 (332)
Q Consensus 262 ~~~C~C~~G~~g~~~-~~C~~~d~ 284 (332)
||+|+|++||++... ..|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 689999999997644 67999986
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.65 E-value=9.2e-05 Score=42.53 Aligned_cols=35 Identities=54% Similarity=1.146 Sum_probs=30.2
Q ss_pred cCCcCCC-CCCCCCCeeeeCCCCeEeeCCCCCccCC
Q psy2857 18 DINECQS-NPCGVNATCIDTQGSYSCVCKEHYTGDP 52 (332)
Q Consensus 18 ~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~G~~g~~ 52 (332)
++++|.. .+|..++.|++..+.|.|.|++||.|..
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence 3578877 7898889999999999999999999853
No 24
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.65 E-value=4.8e-05 Score=38.43 Aligned_cols=24 Identities=38% Similarity=1.035 Sum_probs=21.1
Q ss_pred ceeeeCCCCCCCCCCCCcccccccc
Q psy2857 304 GFNCLCPKGYSGKPDAKVACEQEKA 328 (332)
Q Consensus 304 ~~~C~C~~g~~g~~~~~~~c~~~~~ 328 (332)
||+|.|++||...+.++ .|.+|+|
T Consensus 1 sy~C~C~~Gy~l~~d~~-~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGR-SCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCC-ccccCCC
Confidence 68999999999998775 8999875
No 25
>KOG1836|consensus
Probab=97.64 E-value=0.0016 Score=67.96 Aligned_cols=140 Identities=21% Similarity=0.359 Sum_probs=70.0
Q ss_pred CCCceEeeCCCCCccCCCC----ccccccCcccceeeccccCcccccccCCcccccceeccccc--ceecccccEEeccc
Q psy2857 160 TPGSFRCDCVEGYVGAPPR----IKCKDVRWEFNVTLLFYETDYLHSVASDISDILTIIHEFSR--IFSKHLKLFVIEDA 233 (332)
Q Consensus 160 ~~~~~~C~C~~G~~g~~~~----~~c~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~c~c~~~~~--~~~~~~~g~~~~~~ 233 (332)
+.+.+.=.|.+||.|++-. ..|....|...+.-.. ....+....-|.|.+... ....++.|+..-..
T Consensus 863 T~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~-------~~~c~~~tGQcec~~~v~g~~c~~c~~g~fnl~s 935 (1705)
T KOG1836|consen 863 TAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELP-------SLTCNPVTGQCECKPNVEGRDCLYCFKGFFNLNS 935 (1705)
T ss_pred cccccccccccCccccccCCCcCCccccccCccCCcccc-------cccCCCcccceeccCCCCccccccccccccccCC
Confidence 3344444788999987654 3455544443333220 112223344555555332 23444555543321
Q ss_pred CccccccccCccCCCCCCC----CCeeeecCCceeeecCCCcccCCCCcccccc------cccCCCCCCCCC----Ceee
Q psy2857 234 KRNLNRVDINECQSNPCGV----NATCIDTQGSYSCVCKEHYTGDPYQACSDID------ECKALDKPCGLR----AICE 299 (332)
Q Consensus 234 ~~~~~~~~~~~C~~~~C~~----~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d------~C~~~~~~C~~~----~~C~ 299 (332)
. ..|++-+|.. +..|+...| +|.|.+|-+|.++..|.... .|.. -.|... ..|.
T Consensus 936 ~--------~gC~~c~c~~~gs~~~~c~~~tG--qc~c~~gVtgqrc~qc~~~~~~~~~~gc~~--c~c~~~Gs~~~qc~ 1003 (1705)
T KOG1836|consen 936 G--------VGCEPCNCDPTGSESSDCDVGTG--QCYCRPGVTGQRCDQCETYHFGFQTEGCGL--CECDPLGSRGFQCD 1003 (1705)
T ss_pred C--------CCcccccccccccccccccccCC--ceeeecCccccccCccccCcccccccCCcc--eecccCCcccceec
Confidence 1 1233333322 235665555 89999999998876554321 1110 112222 2455
Q ss_pred cCCCceeeeCCCCCCCCCCCC
Q psy2857 300 NTVPGFNCLCPKGYSGKPDAK 320 (332)
Q Consensus 300 ~~~g~~~C~C~~g~~g~~~~~ 320 (332)
...| +|.|++++.|..+..
T Consensus 1004 ~~~G--~c~c~~~~~g~~c~~ 1022 (1705)
T KOG1836|consen 1004 PEDG--QCPCRPGFEGRRCDQ 1022 (1705)
T ss_pred ccCC--eeeecCCCCCccccc
Confidence 5455 677888877765544
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.58 E-value=0.0001 Score=42.32 Aligned_cols=34 Identities=53% Similarity=1.134 Sum_probs=29.2
Q ss_pred cCccCC-CCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857 242 INECQS-NPCGVNATCIDTQGSYSCVCKEHYTGDP 275 (332)
Q Consensus 242 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 275 (332)
+++|.. .+|.+++.|++..++|.|.|++||.|..
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~ 36 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCc
Confidence 467776 6888889999999999999999999864
No 27
>KOG1226|consensus
Probab=97.54 E-value=0.00044 Score=65.15 Aligned_cols=134 Identities=28% Similarity=0.656 Sum_probs=86.5
Q ss_pred EEEecCCceeccc------------CCcCCCC----CCCCCCeeeeCCCCeEeeCCCCCc----cCCCCCCCCcccccCC
Q psy2857 6 LVRILLGVRAIVD------------INECQSN----PCGVNATCIDTQGSYSCVCKEHYT----GDPYQACSDIDECKAL 65 (332)
Q Consensus 6 ~c~c~~g~~~~~~------------~~~C~~~----~C~~~g~C~~~~g~~~C~C~~G~~----g~~~~~C~~~~~c~~~ 65 (332)
.|+|.+||.|... .+.|... +|+.+|.|.=.. |+|.+... |..++ | +-..|...
T Consensus 479 ~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGq----C~C~~~~~~~i~G~fCE-C-DnfsC~r~ 552 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQ----CVCHKPDNGKIYGKFCE-C-DNFSCERH 552 (783)
T ss_pred ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCc----eEecCCCCCceeeeeee-c-cCcccccc
Confidence 6899999999811 1334332 599999998876 99988766 54432 1 11223221
Q ss_pred -CCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccCCCCCCC--CccccCCCcccCceecCCCC-ccCCCCeeec
Q psy2857 66 -DKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSS--NFECVNNAECVDGLCYCRPG-FDARGSVCVD 141 (332)
Q Consensus 66 -~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~--~~~C~~~~~c~~~~c~C~~g-~~~~g~~c~~ 141 (332)
-..|+.++.|.-. +|.|.+||+|..+. |+ .-+..|.. ...|+..+.|.-++|.|.+. |. |..|..
T Consensus 553 ~g~lC~g~G~C~CG----~CvC~~GwtG~~C~---C~--~std~C~~~~G~iCSGrG~C~Cg~C~C~~~~~s--G~~CE~ 621 (783)
T KOG1226|consen 553 KGVLCGGHGRCECG----RCVCNPGWTGSACN---CP--LSTDTCESSDGQICSGRGTCECGRCKCTDPPYS--GEFCEK 621 (783)
T ss_pred cCcccCCCCeEeCC----cEEcCCCCccCCCC---CC--CCCccccCCCCceeCCCceeeCCceEcCCCCcC--cchhhc
Confidence 2358889999776 79999999999553 21 11223332 34788888888889999865 77 788854
Q ss_pred CCCCCCCCCCCCCCcee
Q psy2857 142 VDECQLGDPCGPQAQCT 158 (332)
Q Consensus 142 ~~~C~~~~~C~~~~~C~ 158 (332)
-..| +.+|.....|+
T Consensus 622 cptc--~~~C~~~~~Cv 636 (783)
T KOG1226|consen 622 CPTC--PDPCAENKSCV 636 (783)
T ss_pred CCCC--CCcccccccch
Confidence 3334 34566555554
No 28
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=97.50 E-value=6.1e-05 Score=42.49 Aligned_cols=29 Identities=38% Similarity=0.870 Sum_probs=23.3
Q ss_pred CCCCCCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857 290 KPCGLRAICENTVPGFNCLCPKGYSGKPDAK 320 (332)
Q Consensus 290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~ 320 (332)
+.|. .+|++.+++|+|.|++||+..++.+
T Consensus 6 GgC~--h~C~~~~g~~~C~C~~Gy~L~~D~~ 34 (36)
T PF14670_consen 6 GGCS--HICVNTPGSYRCSCPPGYKLAEDGR 34 (36)
T ss_dssp GGSS--SEEEEETTSEEEE-STTEEE-TTSS
T ss_pred CCcC--CCCccCCCceEeECCCCCEECcCCC
Confidence 4563 6999999999999999999998765
No 29
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.43 E-value=0.0001 Score=41.68 Aligned_cols=29 Identities=52% Similarity=1.094 Sum_probs=23.4
Q ss_pred CCCCCCCeeeecCCceeeecCCCcccCCC
Q psy2857 248 NPCGVNATCIDTQGSYSCVCKEHYTGDPY 276 (332)
Q Consensus 248 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~~ 276 (332)
..|..+++|+++.++|.|+|++||.|++.
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 46888999999999999999999999864
No 30
>KOG0994|consensus
Probab=97.37 E-value=0.0016 Score=63.86 Aligned_cols=34 Identities=26% Similarity=0.734 Sum_probs=22.7
Q ss_pred eecCCCceEe-eCCCCCccCCC---CccccccCcccce
Q psy2857 157 CTNTPGSFRC-DCVEGYVGAPP---RIKCKDVRWEFNV 190 (332)
Q Consensus 157 C~~~~~~~~C-~C~~G~~g~~~---~~~c~~~~c~~~~ 190 (332)
|.+...++.| .|..||.|++. ...|+..+|-.+.
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp 915 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGP 915 (1758)
T ss_pred ccccccccchhhhhccccCCcccCCCCCCCCCCCCCCC
Confidence 4455566777 79999999864 3467766665443
No 31
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.23 E-value=9.3e-05 Score=57.55 Aligned_cols=140 Identities=27% Similarity=0.697 Sum_probs=86.9
Q ss_pred CCCCCeeeeCCCCeEeeCCCCCccCCCCCCCCcccccC---CCCCCCCCCceeeCC-----CCeeeeCCCCCCCCCCCcc
Q psy2857 27 CGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKA---LDKPCGLRAICENTV-----PGFNCLCPKGYSGKPDAKV 98 (332)
Q Consensus 27 C~~~g~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~c~~---~~~~C~~~~~C~~~~-----~~~~C~C~~g~~~~~~~~~ 98 (332)
|. +|..+.+...|.|.|.+||......+|+...+|.. ...+|..-+.|++.. ..|.|.|.+||.....
T Consensus 8 CK-NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~--- 83 (197)
T PF06247_consen 8 CK-NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG--- 83 (197)
T ss_dssp -B-TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS---
T ss_pred cc-CCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC---
Confidence 54 67788888899999999999876677877777654 345798889998765 3699999999987654
Q ss_pred ceeeccCCCCCCCCccccCCCccc-------CceecCCCCcc-CCCCeeec--CCCCCCCCCCCCCCceecCCCceEeeC
Q psy2857 99 ACEQVDVTSECSSNFECVNNAECV-------DGLCYCRPGFD-ARGSVCVD--VDECQLGDPCGPQAQCTNTPGSFRCDC 168 (332)
Q Consensus 99 ~c~~~~~~~~c~~~~~C~~~~~c~-------~~~c~C~~g~~-~~g~~c~~--~~~C~~~~~C~~~~~C~~~~~~~~C~C 168 (332)
.|... .|.. ..|. .+.|+ ...|+|.-|.. .+...|.. ...|+ -.|..+..|....+-|.|.+
T Consensus 84 vCvp~----~C~~-~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~--LKCk~nE~CK~~~~~Y~C~~ 155 (197)
T PF06247_consen 84 VCVPN----KCNN-KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS--LKCKENEECKLVDGYYKCVC 155 (197)
T ss_dssp SEEEG----GGSS----T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE----------TTTEEEEEETTEEEEEE
T ss_pred eEchh----hcCc-eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee--eecCCCcceeeeCcEEEeec
Confidence 23211 1111 1333 23332 23799998887 56777753 34454 46778889999999999999
Q ss_pred CCCCccCCCC
Q psy2857 169 VEGYVGAPPR 178 (332)
Q Consensus 169 ~~G~~g~~~~ 178 (332)
..+|.++...
T Consensus 156 ~~~~~~~~~~ 165 (197)
T PF06247_consen 156 KEGFPGDGEG 165 (197)
T ss_dssp -TT-EEETTT
T ss_pred CCCCCCCCCc
Confidence 9999886544
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.17 E-value=0.00071 Score=38.07 Aligned_cols=28 Identities=39% Similarity=1.068 Sum_probs=25.3
Q ss_pred CCCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857 289 DKPCGLRAICENTVPGFNCLCPKGYSGK 316 (332)
Q Consensus 289 ~~~C~~~~~C~~~~g~~~C~C~~g~~g~ 316 (332)
..+|.+++.|++..++|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 3578888999999999999999999988
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.11 E-value=0.00096 Score=37.52 Aligned_cols=28 Identities=61% Similarity=1.280 Sum_probs=25.4
Q ss_pred CCCCCCCCeeeeCCCCeEeeCCCCCccC
Q psy2857 24 SNPCGVNATCIDTQGSYSCVCKEHYTGD 51 (332)
Q Consensus 24 ~~~C~~~g~C~~~~g~~~C~C~~G~~g~ 51 (332)
..+|..++.|++..+.|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5678888999999999999999999986
No 34
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.05 E-value=0.00082 Score=52.44 Aligned_cols=131 Identities=27% Similarity=0.751 Sum_probs=76.3
Q ss_pred ceecCCCCccC-CCCeeecCCCCCC----CCCCCCCCceecCC-----CceEeeCCCCCccCCCCccccccCcccceeec
Q psy2857 124 GLCYCRPGFDA-RGSVCVDVDECQL----GDPCGPQAQCTNTP-----GSFRCDCVEGYVGAPPRIKCKDVRWEFNVTLL 193 (332)
Q Consensus 124 ~~c~C~~g~~~-~g~~c~~~~~C~~----~~~C~~~~~C~~~~-----~~~~C~C~~G~~g~~~~~~c~~~~c~~~~~c~ 193 (332)
+.|.|.+||-. +...|....+|.. ..+|...++|++.. ..|.|.|.+||..... .|..
T Consensus 20 fEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~--vCvp---------- 87 (197)
T PF06247_consen 20 FECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG--VCVP---------- 87 (197)
T ss_dssp EEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS--SEEE----------
T ss_pred eEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC--eEch----------
Confidence 58999999953 3567777667752 35688888998765 5688999999886421 1111
Q ss_pred cccCcccccccCCcccccceecccccceecccccEEecccCccccccccCccCCCCCCCCCeeeec---CCceeeecCCC
Q psy2857 194 FYETDYLHSVASDISDILTIIHEFSRIFSKHLKLFVIEDAKRNLNRVDINECQSNPCGVNATCIDT---QGSYSCVCKEH 270 (332)
Q Consensus 194 ~~~~~~~~~~~~~~~~~~c~c~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~---~~~~~C~C~~G 270 (332)
..|....|. .|.|+-. +....|+|.-|
T Consensus 88 -------------------------------------------------~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IG 117 (197)
T PF06247_consen 88 -------------------------------------------------NKCNNKDCG-SGKCILDPDNPNNPTCSCNIG 117 (197)
T ss_dssp -------------------------------------------------GGGSS---T-TEEEEEEEGGGSEEEEEE-TE
T ss_pred -------------------------------------------------hhcCceecC-CCeEEecCCCCCCceeEeeec
Confidence 133334454 5667532 23458999999
Q ss_pred cccCCCCccccc--ccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCCC
Q psy2857 271 YTGDPYQACSDI--DECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDA 319 (332)
Q Consensus 271 ~~g~~~~~C~~~--d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~ 319 (332)
+..+....|... -+|.+ .|..+..|....+-|.|.|.++|.++...
T Consensus 118 kV~~dn~kCtk~G~T~C~L---KCk~nE~CK~~~~~Y~C~~~~~~~~~~~~ 165 (197)
T PF06247_consen 118 KVPDDNKKCTKTGETKCSL---KCKENEECKLVDGYYKCVCKEGFPGDGEG 165 (197)
T ss_dssp EETTTTTESEEEE-----------TTTEEEEEETTEEEEEE-TT-EEETTT
T ss_pred eEeccCCcccCCCccceee---ecCCCcceeeeCcEEEeecCCCCCCCCCc
Confidence 984444445432 33543 67788999999999999999999877654
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=97.00 E-value=0.0011 Score=37.19 Aligned_cols=26 Identities=42% Similarity=1.055 Sum_probs=23.5
Q ss_pred CCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857 290 KPCGLRAICENTVPGFNCLCPKGYSGK 316 (332)
Q Consensus 290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~ 316 (332)
.+|.++ +|++..++|.|.|++||.|+
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 578877 99999999999999999995
No 36
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.94 E-value=0.0017 Score=36.52 Aligned_cols=29 Identities=59% Similarity=1.300 Sum_probs=24.7
Q ss_pred CCC-CCCCCCCeeeeCCCCeEeeCCCCCccC
Q psy2857 22 CQS-NPCGVNATCIDTQGSYSCVCKEHYTGD 51 (332)
Q Consensus 22 C~~-~~C~~~g~C~~~~g~~~C~C~~G~~g~ 51 (332)
|.. .+|..+ .|++..++|.|.|++||.|.
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 445 578877 99999999999999999983
No 37
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.47 E-value=0.0028 Score=35.71 Aligned_cols=24 Identities=33% Similarity=0.755 Sum_probs=19.3
Q ss_pred CCeeeecCCceeeecCCCcccCCC
Q psy2857 253 NATCIDTQGSYSCVCKEHYTGDPY 276 (332)
Q Consensus 253 ~~~C~~~~~~~~C~C~~G~~g~~~ 276 (332)
...|++.+++|+|.|++||++...
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~~D 32 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLAED 32 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-TT
T ss_pred CCCCccCCCceEeECCCCCEECcC
Confidence 468999999999999999998743
No 38
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=96.16 E-value=0.0063 Score=50.78 Aligned_cols=41 Identities=32% Similarity=0.691 Sum_probs=35.4
Q ss_pred cccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857 278 ACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAK 320 (332)
Q Consensus 278 ~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~~~ 320 (332)
.|.++++|....++|. ..|+++.|+|.|.|+.||++....+
T Consensus 183 ~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~~~~~ 223 (224)
T cd01475 183 ICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALLEDNK 223 (224)
T ss_pred cCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCCCCCC
Confidence 5888999988777886 5899999999999999999887654
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.10 E-value=0.013 Score=32.08 Aligned_cols=26 Identities=27% Similarity=0.781 Sum_probs=21.7
Q ss_pred CCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 290 KPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 290 ~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
..|.++|+|+...+ +|+|.+||+|..
T Consensus 6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~ 31 (32)
T PF07974_consen 6 NICSGHGTCVSPCG--RCVCDSGYTGPD 31 (32)
T ss_pred CccCCCCEEeCCCC--EEECCCCCcCCC
Confidence 46888999987644 999999999974
No 40
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.09 E-value=0.0039 Score=26.53 Aligned_cols=13 Identities=31% Similarity=0.810 Sum_probs=10.0
Q ss_pred eeecCCCcccCCC
Q psy2857 264 SCVCKEHYTGDPY 276 (332)
Q Consensus 264 ~C~C~~G~~g~~~ 276 (332)
+|+|++||+|..+
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999753
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.99 E-value=0.0091 Score=32.68 Aligned_cols=26 Identities=42% Similarity=0.953 Sum_probs=21.5
Q ss_pred CCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857 248 NPCGVNATCIDTQGSYSCVCKEHYTGDP 275 (332)
Q Consensus 248 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 275 (332)
..|..+++|+...+ +|+|.+||+|..
T Consensus 6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~ 31 (32)
T PF07974_consen 6 NICSGHGTCVSPCG--RCVCDSGYTGPD 31 (32)
T ss_pred CccCCCCEEeCCCC--EEECCCCCcCCC
Confidence 35889999997644 999999999975
No 42
>KOG1836|consensus
Probab=94.33 E-value=0.79 Score=48.93 Aligned_cols=51 Identities=29% Similarity=0.619 Sum_probs=37.7
Q ss_pred EecCCceeccc--C-CcCCCCCCCCCCeeeeCC--CCeEee-CCCCCccCCCCCCCC
Q psy2857 8 RILLGVRAIVD--I-NECQSNPCGVNATCIDTQ--GSYSCV-CKEHYTGDPYQACSD 58 (332)
Q Consensus 8 ~c~~g~~~~~~--~-~~C~~~~C~~~g~C~~~~--g~~~C~-C~~G~~g~~~~~C~~ 58 (332)
+|.+||+|+.+ . +.|.+=+|...+.|..+. ..+.|. |++||+|..++.|.+
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 47899999922 2 227777788888888774 456797 999999987776654
No 43
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=94.10 E-value=0.043 Score=30.80 Aligned_cols=31 Identities=32% Similarity=0.641 Sum_probs=21.9
Q ss_pred cCCCCCCCCCeeeecC-CceeeecCCCcccCC
Q psy2857 245 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGDP 275 (332)
Q Consensus 245 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~ 275 (332)
|...+|..++.|++.. |++.|.|..||....
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 4556788899999887 899999999998653
No 44
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.06 E-value=0.053 Score=30.42 Aligned_cols=30 Identities=33% Similarity=0.680 Sum_probs=21.6
Q ss_pred CCCCCCCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857 22 CQSNPCGVNATCIDTQ-GSYSCVCKEHYTGD 51 (332)
Q Consensus 22 C~~~~C~~~g~C~~~~-g~~~C~C~~G~~g~ 51 (332)
|....|..|+.|++.. |++.|.|..||...
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 4556788999999986 99999999999864
No 45
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.02 E-value=0.099 Score=43.59 Aligned_cols=38 Identities=32% Similarity=0.493 Sum_probs=29.6
Q ss_pred cccccCccCCCCCCCCCeeeecCCceeeecCCCcccCC
Q psy2857 238 NRVDINECQSNPCGVNATCIDTQGSYSCVCKEHYTGDP 275 (332)
Q Consensus 238 ~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 275 (332)
.|.+.++|...+......|.++.|+|.|.|++||++.+
T Consensus 183 ~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 183 ICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLE 220 (224)
T ss_pred cCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCC
Confidence 45677888654443456899999999999999998754
No 46
>KOG1218|consensus
Probab=91.19 E-value=9.7 Score=33.30 Aligned_cols=97 Identities=29% Similarity=0.572 Sum_probs=46.7
Q ss_pred CCeEeeCCCCCccCCCCCCCCcccccCCCCCCCCCCceeeCCCCeeeeCCCCCCCCCCCccceeeccCCCCCCCCccccC
Q psy2857 38 GSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPDAKVACEQVDVTSECSSNFECVN 117 (332)
Q Consensus 38 g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~C~~~~~C~~~~~~~~C~C~~g~~~~~~~~~~c~~~~~~~~c~~~~~C~~ 117 (332)
.+..|.|.++|+|. ..... ..... .+.. .+.......+|.+..+|.+..+.. .+........|.....|..
T Consensus 13 ~~~~c~c~~~~~g~-~~~~~-~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~c~~-~~~~~~~~~~c~~~~~c~~ 83 (316)
T KOG1218|consen 13 GSGQCFCDPGYTGR-LQCEH-QAVTS----ACSG--ICPCEVNSGECGLGYGFVGSVCRI-ECVCGNAGGGCSQPCRCKN 83 (316)
T ss_pred CCCceecCCCcccc-ccccC-CCCCc----cccc--cCCccCCceeEecccccCCCcccc-ccccCCCCCcccCccccCC
Confidence 35579999999985 11111 11111 1111 111122234778888888775433 2222222334444445555
Q ss_pred CCcccCceecC-CCCccCCCCeeecCCCC
Q psy2857 118 NAECVDGLCYC-RPGFDARGSVCVDVDEC 145 (332)
Q Consensus 118 ~~~c~~~~c~C-~~g~~~~g~~c~~~~~C 145 (332)
..........+ ..+|. +..|....++
T Consensus 84 ~~~~~~~~~~~~~~~~~--g~~C~~~~~~ 110 (316)
T KOG1218|consen 84 GGTCVSSTGYCHLNGYE--GPQCESPCPC 110 (316)
T ss_pred CCcccCCCCcccCCCCC--cccccCCCCc
Confidence 55555544455 46665 5666544443
No 47
>smart00051 DSL delta serrate ligand.
Probab=90.94 E-value=0.5 Score=30.49 Aligned_cols=47 Identities=21% Similarity=0.415 Sum_probs=30.6
Q ss_pred ceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 262 SYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 262 ~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
.++-.|+++|.|..|. ..|.. .+....+..|.. .| .++|++||+|..
T Consensus 16 ~~rv~C~~~~yG~~C~-----~~C~~-~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~ 62 (63)
T smart00051 16 QIRVTCDENYYGEGCN-----KFCRP-RDDFFGHYTCDE-NG--NKGCLEGWMGPY 62 (63)
T ss_pred EEEeeCCCCCcCCccC-----CEeCc-CccccCCccCCc-CC--CEecCCCCcCCC
Confidence 3466899999999874 22322 123444567743 34 789999999864
No 48
>smart00051 DSL delta serrate ligand.
Probab=90.06 E-value=0.56 Score=30.23 Aligned_cols=46 Identities=15% Similarity=0.042 Sum_probs=31.5
Q ss_pred EEEEEecCCceecccCCcCCCCC-CCCCCeeeeCCCCeEeeCCCCCccCC
Q psy2857 4 VVLVRILLGVRAIVDINECQSNP-CGVNATCIDTQGSYSCVCKEHYTGDP 52 (332)
Q Consensus 4 ~~~c~c~~g~~~~~~~~~C~~~~-C~~~g~C~~~~g~~~C~C~~G~~g~~ 52 (332)
...-.|.++|.|......|.+.. ...+..|.... .++|.+||+|..
T Consensus 16 ~~rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~~G---~~~C~~Gw~G~~ 62 (63)
T smart00051 16 QIRVTCDENYYGEGCNKFCRPRDDFFGHYTCDENG---NKGCLEGWMGPY 62 (63)
T ss_pred EEEeeCCCCCcCCccCCEeCcCccccCCccCCcCC---CEecCCCCcCCC
Confidence 44567789999986556665432 45677775432 688999999864
No 49
>KOG1218|consensus
Probab=90.04 E-value=6.2 Score=34.55 Aligned_cols=40 Identities=38% Similarity=1.000 Sum_probs=23.9
Q ss_pred ccCceecCCCCccCCCCeeecCCC-CCCCCCCCCCCceecCCC
Q psy2857 121 CVDGLCYCRPGFDARGSVCVDVDE-CQLGDPCGPQAQCTNTPG 162 (332)
Q Consensus 121 c~~~~c~C~~g~~~~g~~c~~~~~-C~~~~~C~~~~~C~~~~~ 162 (332)
.....|.|.+||. +..+..... |.....+.+++.|....+
T Consensus 159 ~~~~~c~c~~g~~--g~~~~~~~~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 159 CKNGICTCQPGFV--GVFCVESCSGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred CCCCceeccCCcc--cccccccCCCcCCCcccCCCCeeecccc
Confidence 3455788999998 555543322 444455666667765543
No 50
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=83.56 E-value=1.5 Score=26.84 Aligned_cols=27 Identities=37% Similarity=1.046 Sum_probs=16.3
Q ss_pred CCCCccccCCCcccCceecCCCCccCC
Q psy2857 109 CSSNFECVNNAECVDGLCYCRPGFDAR 135 (332)
Q Consensus 109 c~~~~~C~~~~~c~~~~c~C~~g~~~~ 135 (332)
|.....|..++.|+.++|.|++||...
T Consensus 22 C~~~~qC~~~s~C~~g~C~C~~g~~~~ 48 (52)
T PF01683_consen 22 CESDEQCIGGSVCVNGRCQCPPGYVEV 48 (52)
T ss_pred CCCcCCCCCcCEEcCCEeECCCCCEec
Confidence 333345556666667777777777543
No 51
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=83.32 E-value=1.1 Score=27.12 Aligned_cols=24 Identities=42% Similarity=0.873 Sum_probs=16.8
Q ss_pred eeeecCCceeeecCCCcccCCCCccc
Q psy2857 255 TCIDTQGSYSCVCKEHYTGDPYQACS 280 (332)
Q Consensus 255 ~C~~~~~~~~C~C~~G~~g~~~~~C~ 280 (332)
.|....+ +|.|+++|+|..++.|.
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~ 35 (49)
T PF00053_consen 12 TCDPSTG--QCVCKPGTTGPRCDQCK 35 (49)
T ss_dssp SEEETCE--EESBSTTEESTTS-EE-
T ss_pred cccCCCC--EEeccccccCCcCcCCC
Confidence 5655444 89999999999876544
No 52
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=81.26 E-value=1.8 Score=31.42 Aligned_cols=33 Identities=30% Similarity=0.719 Sum_probs=25.0
Q ss_pred cccccCCCCCCCCCCeeecCCCceeeeCCCCCCCC
Q psy2857 282 IDECKALDKPCGLRAICENTVPGFNCLCPKGYSGK 316 (332)
Q Consensus 282 ~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~ 316 (332)
.|.|+.. ..|+..+.|.. .....|.|++||...
T Consensus 77 ~d~Cd~y-~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 77 KDQCDVY-GFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred ccCCCCc-cccCCccEeCC-CCCCceECCCCcCCC
Confidence 4677764 58999999953 345579999999865
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=80.12 E-value=2.2 Score=25.94 Aligned_cols=22 Identities=36% Similarity=0.833 Sum_probs=15.3
Q ss_pred eeecCCceeeecCCCcccCCCCcc
Q psy2857 256 CIDTQGSYSCVCKEHYTGDPYQAC 279 (332)
Q Consensus 256 C~~~~~~~~C~C~~G~~g~~~~~C 279 (332)
|....| +|.|+++|+|..++.|
T Consensus 14 C~~~~G--~C~C~~~~~G~~C~~C 35 (50)
T cd00055 14 CDPGTG--QCECKPNTTGRRCDRC 35 (50)
T ss_pred ccCCCC--EEeCCCcCCCCCCCCC
Confidence 544444 8999999998876533
No 54
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=77.98 E-value=4.5 Score=24.71 Aligned_cols=24 Identities=38% Similarity=0.891 Sum_probs=17.1
Q ss_pred CCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857 148 GDPCGPQAQCTNTPGSFRCDCVEGYVGA 175 (332)
Q Consensus 148 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 175 (332)
...|..++.|++. .|.|++||...
T Consensus 25 ~~qC~~~s~C~~g----~C~C~~g~~~~ 48 (52)
T PF01683_consen 25 DEQCIGGSVCVNG----RCQCPPGYVEV 48 (52)
T ss_pred cCCCCCcCEEcCC----EeECCCCCEec
Confidence 3455567788653 89999998753
No 55
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=76.19 E-value=4.8 Score=24.41 Aligned_cols=22 Identities=23% Similarity=0.608 Sum_probs=14.9
Q ss_pred eeeCCCCCCCCCCCCccccccccC
Q psy2857 306 NCLCPKGYSGKPDAKVACEQEKAG 329 (332)
Q Consensus 306 ~C~C~~g~~g~~~~~~~c~~~~~~ 329 (332)
+|.|+++|.|..... |.+.-.+
T Consensus 20 ~C~C~~~~~G~~C~~--C~~g~~~ 41 (50)
T cd00055 20 QCECKPNTTGRRCDR--CAPGYYG 41 (50)
T ss_pred EEeCCCcCCCCCCCC--CCCCCcc
Confidence 788888888887764 6554433
No 56
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=73.59 E-value=4.1 Score=30.00 Aligned_cols=36 Identities=31% Similarity=0.659 Sum_probs=25.1
Q ss_pred cCCcCCCCC---CCCCCeeeeCC--CCeEeeCCCCCccCCCC
Q psy2857 18 DINECQSNP---CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ 54 (332)
Q Consensus 18 ~~~~C~~~~---C~~~g~C~~~~--g~~~C~C~~G~~g~~~~ 54 (332)
++.+|.+.- |- ||.|.-.. ..++|.|..||+|..|+
T Consensus 41 ~i~~Cp~ey~~YCl-HG~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 41 AIRLCGPEGDGYCL-HGDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred ccccCChhhCCEeE-CCEEEeeccCCCceeECCCCccccccc
Confidence 345554432 54 46888764 67899999999997664
No 57
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.43 E-value=3.9 Score=24.38 Aligned_cols=21 Identities=38% Similarity=0.770 Sum_probs=14.6
Q ss_pred eeeecCCceeeecCCCcccCCCC
Q psy2857 255 TCIDTQGSYSCVCKEHYTGDPYQ 277 (332)
Q Consensus 255 ~C~~~~~~~~C~C~~G~~g~~~~ 277 (332)
.|....| +|.|+++|+|..++
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~ 32 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCD 32 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCC
Confidence 3444344 89999999987654
No 58
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=71.32 E-value=4.8 Score=29.19 Aligned_cols=34 Identities=29% Similarity=0.800 Sum_probs=25.8
Q ss_pred cCCCCCCCCCCCCCCceecCCCceEeeCCCCCccC
Q psy2857 141 DVDECQLGDPCGPQAQCTNTPGSFRCDCVEGYVGA 175 (332)
Q Consensus 141 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 175 (332)
..+.|.....|+..+.|.. .....|.|.+||...
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 3467776788999999954 345579999999753
No 59
>PHA02887 EGF-like protein; Provisional
Probab=70.41 E-value=4.2 Score=29.43 Aligned_cols=27 Identities=30% Similarity=0.736 Sum_probs=20.6
Q ss_pred CCCCCeeeecC--CceeeecCCCcccCCCC
Q psy2857 250 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ 277 (332)
Q Consensus 250 C~~~~~C~~~~--~~~~C~C~~G~~g~~~~ 277 (332)
|- +|+|.-.. ..+.|.|++||+|..|+
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 44 57886544 46899999999999775
No 60
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=70.26 E-value=2.3 Score=27.38 Aligned_cols=46 Identities=26% Similarity=0.524 Sum_probs=17.8
Q ss_pred ceeeecCCCcccCCCC-cccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 262 SYSCVCKEHYTGDPYQ-ACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 262 ~~~C~C~~G~~g~~~~-~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
.++-.|.+.|.|..|. .|...|.- ..+-.|. ..| .=+|.+||+|..
T Consensus 16 ~~rv~C~~nyyG~~C~~~C~~~~d~-------~ghy~Cd-~~G--~~~C~~Gw~G~~ 62 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSKFCKPRDDS-------FGHYTCD-SNG--NKVCLPGWTGPN 62 (63)
T ss_dssp -------TTEETTTT-EE---EEET-------TEEEEE--SS----EEE-TTEESTT
T ss_pred EEEEECCCCCCCccccCCcCCCcCC-------cCCcccC-CCC--CCCCCCCCcCCC
Confidence 4577899999999875 23322210 1122343 233 335889998864
No 61
>PHA02887 EGF-like protein; Provisional
Probab=69.64 E-value=5.4 Score=28.88 Aligned_cols=27 Identities=30% Similarity=0.736 Sum_probs=21.3
Q ss_pred CCCCCeeeeCC--CCeEeeCCCCCccCCCC
Q psy2857 27 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ 54 (332)
Q Consensus 27 C~~~g~C~~~~--g~~~C~C~~G~~g~~~~ 54 (332)
|- ||+|.-.. ..++|.|.+||+|..|+
T Consensus 94 Ci-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 94 CI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred ee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 65 68898764 46899999999997653
No 62
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=68.98 E-value=3.9 Score=22.46 Aligned_cols=13 Identities=38% Similarity=0.715 Sum_probs=11.1
Q ss_pred eeecCCCcccCCC
Q psy2857 264 SCVCKEHYTGDPY 276 (332)
Q Consensus 264 ~C~C~~G~~g~~~ 276 (332)
+|.|++||.++..
T Consensus 19 ~C~CPeGyIlde~ 31 (34)
T PF09064_consen 19 QCFCPEGYILDEG 31 (34)
T ss_pred ceeCCCceEecCC
Confidence 8999999988743
No 63
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=65.91 E-value=5.5 Score=29.38 Aligned_cols=27 Identities=33% Similarity=0.698 Sum_probs=20.5
Q ss_pred CCCCCeeeecC--CceeeecCCCcccCCCC
Q psy2857 250 CGVNATCIDTQ--GSYSCVCKEHYTGDPYQ 277 (332)
Q Consensus 250 C~~~~~C~~~~--~~~~C~C~~G~~g~~~~ 277 (332)
|.+ |.|.-.. ..+.|.|..||+|.+|+
T Consensus 53 ClH-G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 53 CLH-GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred eEC-CEEEeeccCCCceeECCCCccccccc
Confidence 444 4786544 57899999999999775
No 64
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=64.70 E-value=6.8 Score=27.95 Aligned_cols=35 Identities=20% Similarity=0.465 Sum_probs=27.1
Q ss_pred cccccCCCCCCCCCCeeecCC-----CceeeeCCCCCCCC
Q psy2857 282 IDECKALDKPCGLRAICENTV-----PGFNCLCPKGYSGK 316 (332)
Q Consensus 282 ~d~C~~~~~~C~~~~~C~~~~-----g~~~C~C~~g~~g~ 316 (332)
.+.|...++.|..+|.|++.. .=|.|.|.+.+...
T Consensus 5 ~~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~ 44 (103)
T PF12955_consen 5 NDACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT 44 (103)
T ss_pred HHHHHHhccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence 466777778999999999773 33899999876654
No 65
>KOG3516|consensus
Probab=63.38 E-value=6.5 Score=40.12 Aligned_cols=36 Identities=25% Similarity=0.768 Sum_probs=32.3
Q ss_pred CCcCCCCCCCCCCeeeeCCCCeEeeCC-CCCccCCCC
Q psy2857 19 INECQSNPCGVNATCIDTQGSYSCVCK-EHYTGDPYQ 54 (332)
Q Consensus 19 ~~~C~~~~C~~~g~C~~~~g~~~C~C~-~G~~g~~~~ 54 (332)
++.|.+++|...|.|......|.|.|. .||.|..|.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCH 581 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCH 581 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccccccc
Confidence 578999999999999998899999997 899997664
No 66
>KOG3512|consensus
Probab=61.94 E-value=23 Score=32.57 Aligned_cols=25 Identities=44% Similarity=0.842 Sum_probs=18.9
Q ss_pred CCeeeecCCceeeecCCCcccCCCCcc
Q psy2857 253 NATCIDTQGSYSCVCKEHYTGDPYQAC 279 (332)
Q Consensus 253 ~~~C~~~~~~~~C~C~~G~~g~~~~~C 279 (332)
+.+|..+.| +|.|++|-+|..|..|
T Consensus 406 gktCNq~tG--qCpCkeGvtG~tCnrC 430 (592)
T KOG3512|consen 406 GKTCNQTTG--QCPCKEGVTGLTCNRC 430 (592)
T ss_pred cccccccCC--cccCCCCCcccccccc
Confidence 446776766 8999999999876544
No 67
>KOG3516|consensus
Probab=55.39 E-value=9.9 Score=38.91 Aligned_cols=39 Identities=31% Similarity=0.776 Sum_probs=34.1
Q ss_pred cccccccccCCCCCCCCCCeeecCCCceeeeCC-CCCCCCCC
Q psy2857 278 ACSDIDECKALDKPCGLRAICENTVPGFNCLCP-KGYSGKPD 318 (332)
Q Consensus 278 ~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~-~g~~g~~~ 318 (332)
+|..+|.|.. ++|.+++.|.-....|.|.|. .||.|..+
T Consensus 541 ~C~i~drClP--N~CehgG~C~Qs~~~f~C~C~~TGY~GatC 580 (1306)
T KOG3516|consen 541 MCGISDRCLP--NPCEHGGKCSQSWDDFECNCELTGYKGATC 580 (1306)
T ss_pred ccccccccCC--ccccCCCcccccccceeEeccccccccccc
Confidence 5778888974 799999999988889999999 89999865
No 68
>KOG3514|consensus
Probab=53.44 E-value=9.7 Score=38.54 Aligned_cols=34 Identities=26% Similarity=0.741 Sum_probs=30.1
Q ss_pred cCCCCCCCCCCeeeeCCCCeEeeC-CCCCccCCCC
Q psy2857 21 ECQSNPCGVNATCIDTQGSYSCVC-KEHYTGDPYQ 54 (332)
Q Consensus 21 ~C~~~~C~~~g~C~~~~g~~~C~C-~~G~~g~~~~ 54 (332)
.|.++||.++|.|...+.+|.|.| ..||.|..|+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence 699999999999999999999999 5689987663
No 69
>KOG3509|consensus
Probab=47.55 E-value=36 Score=34.66 Aligned_cols=71 Identities=25% Similarity=0.564 Sum_probs=49.1
Q ss_pred CccCCCCCCCCCeeeecCCceeeecCCCcccCCCCcccccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCC
Q psy2857 243 NECQSNPCGVNATCIDTQGSYSCVCKEHYTGDPYQACSDIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKP 317 (332)
Q Consensus 243 ~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~C~~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~ 317 (332)
+.|+..++...+.|-......+|.|++||+|..+..|... +...++-+. .++|....+.+...|.+| .|..
T Consensus 407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~--~~~~~~g~y-~~t~~~~~~~~~~~c~pg-~g~~ 477 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNG--CDRSPNGSY-LGTCVPIQGKRCEYCGPG-AGAP 477 (964)
T ss_pred CccccccCCCCccccccccccceeccccccCchhhccCcc--ccccCCccc-cceEeccCCCcceeecCC-CCCc
Confidence 4566667777777777888889999999999988755443 333333332 467777766677888888 5554
No 70
>KOG3514|consensus
Probab=36.33 E-value=24 Score=36.00 Aligned_cols=34 Identities=26% Similarity=0.753 Sum_probs=29.4
Q ss_pred ccCCCCCCCCCeeeecCCceeeecC-CCcccCCCC
Q psy2857 244 ECQSNPCGVNATCIDTQGSYSCVCK-EHYTGDPYQ 277 (332)
Q Consensus 244 ~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~ 277 (332)
.|.++||.+++.|....+.|.|.|. .||.|..|+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Ce 659 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCE 659 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCcccc
Confidence 5888999999999999999999996 578777553
No 71
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=33.51 E-value=20 Score=22.00 Aligned_cols=21 Identities=24% Similarity=0.597 Sum_probs=14.0
Q ss_pred eeecCCCcccCCCCccccccc
Q psy2857 264 SCVCKEHYTGDPYQACSDIDE 284 (332)
Q Consensus 264 ~C~C~~G~~g~~~~~C~~~d~ 284 (332)
-|.|++||..+....|...++
T Consensus 34 gC~C~~G~v~~~~~~CV~~~~ 54 (55)
T PF01826_consen 34 GCFCPPGYVRNDNGRCVPPSE 54 (55)
T ss_dssp EEEETTTEEEETTSEEEEGGG
T ss_pred cCCCCCCeeEcCCCCEEcHHH
Confidence 499999998765444544443
No 72
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=32.59 E-value=23 Score=21.88 Aligned_cols=30 Identities=23% Similarity=0.488 Sum_probs=16.8
Q ss_pred CCCCCCCeeee----CCCCeEeeCCCCCccCCCC
Q psy2857 25 NPCGVNATCID----TQGSYSCVCKEHYTGDPYQ 54 (332)
Q Consensus 25 ~~C~~~g~C~~----~~g~~~C~C~~G~~g~~~~ 54 (332)
.+|+.||.... ..|...|.|..-|.|.++.
T Consensus 17 i~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 17 ISCSGHGRAFLDGLIADGSPVCECNSCYGGPDCS 50 (56)
T ss_dssp S--TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred CCcCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence 46888887663 2466889999999998764
No 73
>KOG3512|consensus
Probab=29.81 E-value=69 Score=29.60 Aligned_cols=59 Identities=31% Similarity=0.636 Sum_probs=31.4
Q ss_pred cCCceee-ecCCCcccCCCCcccccccccCC-CCCCC-CCCeeecCCCceeeeCCCCCCCCCCCC
Q psy2857 259 TQGSYSC-VCKEHYTGDPYQACSDIDECKAL-DKPCG-LRAICENTVPGFNCLCPKGYSGKPDAK 320 (332)
Q Consensus 259 ~~~~~~C-~C~~G~~g~~~~~C~~~d~C~~~-~~~C~-~~~~C~~~~g~~~C~C~~g~~g~~~~~ 320 (332)
+.|. .| .|++||.-+....=.+-..|..- -|+-+ .+.+|..+.| +|.|.+|.+|..+..
T Consensus 368 TaGr-hChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 368 TAGR-HCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNR 429 (592)
T ss_pred CCCc-ccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCccccccc
Confidence 4443 35 69999987654211122223210 01211 2456654555 788888888886654
No 74
>PF05092 PIF: Per os infectivity; InterPro: IPR007784 This entry represents a group of dsDNA Baculovirus proteins. It is required for the infectivity of the OBs or occlusion bodies. It is a structural protein of the ODV envelope required only in the first steps of per os larva infection, as viruses being produced in cells expressing the gene for this protein but not containing it in their genomes are able to produce successful infections. Baculoviruses are large DNA viruses that infect arthropods, mainly members of the order Lepidoptera. In their life cycle, they produce two kinds of particles, a budded, non-occluded virus (BV), which buds out of the infected cell and is responsible for the cell-to-cell transmission of the virus, and an occluded form, the occlusion body (OB), which is responsible for protecting the virus between encounters with larvae. A variable number of virions are included in the para-crystalline structure of the OB, mainly constituted by the virus-encoded polyhedrin protein; these virions are called occlusion body-derived virions or ODVs [].
Probab=25.47 E-value=1.5e+02 Score=28.06 Aligned_cols=49 Identities=24% Similarity=0.582 Sum_probs=36.5
Q ss_pred eEEEEEe-cCCceecccC-CcCCCCC-CCCCCeeeeCC-CCeEeeCCCCCccC
Q psy2857 3 FVVLVRI-LLGVRAIVDI-NECQSNP-CGVNATCIDTQ-GSYSCVCKEHYTGD 51 (332)
Q Consensus 3 ~~~~c~c-~~g~~~~~~~-~~C~~~~-C~~~g~C~~~~-g~~~C~C~~G~~g~ 51 (332)
+..+|.| .||+.+...+ +.|...- |.+||.-.+.. ....|.|..||..+
T Consensus 130 fsLlCsC~~PGlVtqlniy~DC~vpVGC~PhG~I~din~~pi~C~Cd~GyVsd 182 (522)
T PF05092_consen 130 FSLLCSCLRPGLVTQLNIYEDCDVPVGCQPHGRIADINESPIRCVCDDGYVSD 182 (522)
T ss_pred eEEEEEcCCCCeEeeeehhccCCCcEecCCCCEEeeecCCceEeECCCCcccc
Confidence 5578888 7888888654 4454432 88899988874 46789999999765
No 75
>KOG0196|consensus
Probab=23.99 E-value=1.1e+02 Score=30.59 Aligned_cols=56 Identities=29% Similarity=0.688 Sum_probs=33.1
Q ss_pred eeecCCCccc----CCCCccc--------ccccccCCCCCCCCCCeeecCCCceeeeCCCCCCCCCC--CCcccc
Q psy2857 264 SCVCKEHYTG----DPYQACS--------DIDECKALDKPCGLRAICENTVPGFNCLCPKGYSGKPD--AKVACE 324 (332)
Q Consensus 264 ~C~C~~G~~g----~~~~~C~--------~~d~C~~~~~~C~~~~~C~~~~g~~~C~C~~g~~g~~~--~~~~c~ 324 (332)
.|.|.+||.- ..|..|. ....| .+|+.+.. ....++..|.|..||...+. ..+.|.
T Consensus 260 ~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C----~~CP~~S~-s~~ega~~C~C~~gyyRA~~Dp~~mpCT 329 (996)
T KOG0196|consen 260 GCVCKAGYEEAENGKACQACPPGTYKASQGDSLC----LPCPPNSH-SSSEGATSCTCENGYYRADSDPPSMPCT 329 (996)
T ss_pred ceeecCCCCcccCCCcceeCCCCcccCCCCCCCC----CCCCCCCC-CCCCCCCcccccCCcccCCCCCCCCCCC
Confidence 6889999864 2222231 12223 25665543 24567789999999987754 234554
No 76
>KOG0196|consensus
Probab=20.26 E-value=2.2e+02 Score=28.73 Aligned_cols=17 Identities=35% Similarity=0.829 Sum_probs=12.1
Q ss_pred CCCceEeeCCCCCccCC
Q psy2857 160 TPGSFRCDCVEGYVGAP 176 (332)
Q Consensus 160 ~~~~~~C~C~~G~~g~~ 176 (332)
..+.-.|.|..||.-++
T Consensus 304 ~ega~~C~C~~gyyRA~ 320 (996)
T KOG0196|consen 304 SEGATSCTCENGYYRAD 320 (996)
T ss_pred CCCCCcccccCCcccCC
Confidence 34666888999987643
Done!