Query psy13159
Match_columns 660
No_of_seqs 427 out of 2327
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 21:40:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13159.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13159hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.8 4.3E-17 9.4E-22 175.6 26.8 209 394-643 732-951 (1289)
2 KOG1217|consensus 99.6 2.2E-14 4.7E-19 159.6 24.0 262 216-594 149-422 (487)
3 KOG1217|consensus 99.6 9.7E-14 2.1E-18 154.4 22.2 292 247-643 91-388 (487)
4 KOG4289|consensus 99.6 5.9E-14 1.3E-18 158.2 17.8 107 199-330 1180-1308(2531)
5 KOG4289|consensus 99.5 1.2E-13 2.5E-18 155.9 15.8 102 36-167 1177-1308(2531)
6 KOG1214|consensus 99.5 2.4E-13 5.2E-18 147.1 16.5 223 38-302 692-936 (1289)
7 KOG1219|consensus 99.3 4.2E-12 9.1E-17 148.4 8.4 111 397-546 3865-3977(4289)
8 KOG1219|consensus 99.2 2.1E-11 4.5E-16 142.8 8.6 107 459-601 3870-3977(4289)
9 KOG0994|consensus 99.1 1.5E-09 3.2E-14 121.7 17.9 63 466-541 1031-1095(1758)
10 KOG1225|consensus 99.0 1.3E-09 2.8E-14 118.0 12.6 130 116-335 234-365 (525)
11 KOG1225|consensus 99.0 1.9E-09 4E-14 116.7 12.5 129 59-277 234-364 (525)
12 KOG4260|consensus 98.9 2.1E-09 4.6E-14 103.4 5.4 163 62-277 131-306 (350)
13 KOG0994|consensus 98.6 6.5E-07 1.4E-11 101.1 14.7 199 410-645 878-1096(1758)
14 KOG4260|consensus 98.5 1.3E-07 2.9E-12 91.2 5.4 160 120-332 132-304 (350)
15 KOG1226|consensus 98.2 1.3E-05 2.9E-10 88.7 12.3 137 100-277 466-617 (783)
16 PF07645 EGF_CA: Calcium-bindi 98.1 1.2E-06 2.6E-11 62.2 1.4 34 37-70 1-36 (42)
17 smart00179 EGF_CA Calcium-bind 98.0 8.3E-06 1.8E-10 56.8 3.6 36 37-74 1-38 (39)
18 PF07645 EGF_CA: Calcium-bindi 97.9 7.8E-06 1.7E-10 58.1 2.9 34 80-125 1-34 (42)
19 KOG1226|consensus 97.9 4.9E-05 1.1E-09 84.3 9.5 128 10-176 480-622 (783)
20 PF00008 EGF: EGF-like domain 97.9 3.9E-06 8.5E-11 55.6 0.6 30 41-70 1-31 (32)
21 PF00008 EGF: EGF-like domain 97.7 3.1E-05 6.8E-10 51.3 2.3 30 248-277 1-31 (32)
22 PF12947 EGF_3: EGF domain; I 97.7 1.3E-05 2.9E-10 54.4 0.6 31 44-74 6-36 (36)
23 cd00054 EGF_CA Calcium-binding 97.6 6.7E-05 1.5E-09 51.6 3.6 36 37-74 1-37 (38)
24 smart00179 EGF_CA Calcium-bind 97.5 0.00011 2.4E-09 51.0 3.7 33 509-541 2-36 (39)
25 KOG1836|consensus 97.4 0.0096 2.1E-07 73.9 21.4 50 219-277 756-809 (1705)
26 PF12947 EGF_3: EGF domain; I 97.4 8.5E-05 1.8E-09 50.5 1.7 30 100-129 5-34 (36)
27 PF06247 Plasmod_Pvs28: Plasmo 97.3 9.1E-05 2E-09 68.6 1.5 142 460-643 7-161 (197)
28 PF06247 Plasmod_Pvs28: Plasmo 97.1 0.0001 2.3E-09 68.2 -0.7 147 408-601 11-165 (197)
29 cd00054 EGF_CA Calcium-binding 97.0 0.00089 1.9E-08 45.8 3.7 35 396-432 2-37 (38)
30 smart00181 EGF Epidermal growt 96.8 0.0013 2.8E-08 44.4 3.2 30 40-70 1-31 (35)
31 cd00053 EGF Epidermal growth f 96.8 0.0014 3.1E-08 44.1 3.1 30 41-70 2-32 (36)
32 KOG1836|consensus 96.7 0.062 1.3E-06 67.0 18.4 97 324-433 696-812 (1705)
33 cd00053 EGF Epidermal growth f 96.2 0.0062 1.3E-07 40.8 3.6 27 572-598 6-32 (36)
34 PF12662 cEGF: Complement Clr- 96.2 0.0046 9.9E-08 37.7 2.4 24 58-83 1-24 (24)
35 smart00181 EGF Epidermal growt 95.9 0.0094 2E-07 40.1 3.3 28 399-427 2-30 (35)
36 PF12662 cEGF: Complement Clr- 95.7 0.0099 2.1E-07 36.2 2.3 23 529-553 1-23 (24)
37 PF07974 EGF_2: EGF-like domai 95.4 0.018 3.9E-07 38.0 2.9 27 101-129 6-32 (32)
38 PF07974 EGF_2: EGF-like domai 95.3 0.018 3.8E-07 38.0 2.7 23 150-172 7-31 (32)
39 PF12661 hEGF: Human growth fa 93.6 0.028 6.2E-07 28.9 0.6 13 588-600 1-13 (13)
40 PF14670 FXa_inhibition: Coagu 92.7 0.059 1.3E-06 36.6 1.2 23 46-70 8-30 (36)
41 KOG3512|consensus 91.2 1.9 4.1E-05 46.0 10.8 163 465-647 285-478 (592)
42 PF14670 FXa_inhibition: Coagu 91.2 0.13 2.9E-06 34.8 1.7 21 521-541 10-30 (36)
43 smart00051 DSL delta serrate l 90.6 0.39 8.4E-06 37.2 3.9 48 58-129 16-63 (63)
44 PF12946 EGF_MSP1_1: MSP1 EGF 90.4 0.12 2.7E-06 34.9 0.9 34 248-281 2-36 (37)
45 PF12946 EGF_MSP1_1: MSP1 EGF 90.1 0.058 1.3E-06 36.5 -0.9 31 41-71 2-33 (37)
46 smart00051 DSL delta serrate l 88.7 0.62 1.3E-05 36.1 3.8 48 529-600 16-63 (63)
47 cd01475 vWA_Matrilin VWA_Matri 83.2 0.98 2.1E-05 44.8 3.1 38 31-70 180-219 (224)
48 cd01475 vWA_Matrilin VWA_Matri 82.2 1.3 2.8E-05 44.0 3.5 37 77-127 183-219 (224)
49 KOG1218|consensus 81.4 18 0.00038 37.7 12.0 45 588-641 163-207 (316)
50 KOG1218|consensus 80.5 49 0.0011 34.3 15.0 14 114-127 13-26 (316)
51 PF00053 Laminin_EGF: Laminin 79.1 1.9 4.1E-05 31.4 2.6 22 107-130 11-32 (49)
52 PF01683 EB: EB module; Inter 76.2 2.8 6E-05 30.9 2.8 11 324-334 38-48 (52)
53 PF00053 Laminin_EGF: Laminin 71.9 2.4 5.3E-05 30.8 1.6 27 578-606 11-37 (49)
54 cd00055 EGF_Lam Laminin-type e 70.5 4.8 0.0001 29.5 2.9 21 587-607 19-39 (50)
55 PF01414 DSL: Delta serrate li 69.9 1.8 3.9E-05 33.6 0.5 48 58-129 16-63 (63)
56 PF01683 EB: EB module; Inter 69.2 7.2 0.00016 28.7 3.6 23 101-127 26-48 (52)
57 PHA02887 EGF-like protein; Pro 64.4 6 0.00013 34.2 2.6 29 572-601 92-122 (126)
58 cd00055 EGF_Lam Laminin-type e 64.1 7.4 0.00016 28.4 2.8 17 116-132 19-35 (50)
59 PHA03099 epidermal growth fact 61.1 7 0.00015 34.4 2.4 29 572-601 51-81 (139)
60 PF00954 S_locus_glycop: S-loc 58.4 8.8 0.00019 33.3 2.7 33 37-70 76-109 (110)
61 KOG3516|consensus 57.7 7.3 0.00016 46.6 2.6 38 36-75 543-581 (1306)
62 smart00180 EGF_Lam Laminin-typ 57.4 13 0.00028 26.7 3.0 21 587-607 18-38 (46)
63 KOG3512|consensus 53.1 52 0.0011 35.7 7.7 150 9-173 296-477 (592)
64 PHA03099 epidermal growth fact 53.0 12 0.00026 33.0 2.5 30 614-647 52-81 (139)
65 PHA02887 EGF-like protein; Pro 52.7 14 0.0003 32.0 2.8 30 614-647 93-122 (126)
66 PF09064 Tme5_EGF_like: Thromb 47.0 21 0.00045 23.8 2.3 14 586-599 17-30 (34)
67 PF00954 S_locus_glycop: S-loc 46.0 19 0.00042 31.1 2.9 25 101-126 84-108 (110)
68 KOG3514|consensus 43.9 16 0.00034 43.3 2.4 35 40-76 625-660 (1591)
69 KOG3516|consensus 38.2 23 0.0005 42.6 2.7 41 504-546 540-581 (1306)
70 KOG3514|consensus 34.9 26 0.00057 41.6 2.4 36 511-548 625-661 (1591)
71 PF12955 DUF3844: Domain of un 31.0 20 0.00044 30.6 0.5 33 38-70 5-44 (103)
72 PF12955 DUF3844: Domain of un 25.0 30 0.00066 29.6 0.6 26 100-125 12-42 (103)
73 KOG3607|consensus 20.7 84 0.0018 36.8 3.3 47 552-601 603-656 (716)
No 1
>KOG1214|consensus
Probab=99.79 E-value=4.3e-17 Score=175.59 Aligned_cols=209 Identities=27% Similarity=0.496 Sum_probs=144.2
Q ss_pred CCCCCCCC--CCCCCCCceeeCCCCceeecCCCcc--CCCCCCccCC--cCCCCCCCCcccccCCCCCCCCCCCCCCCeE
Q psy13159 394 EYVNPCIP--SPCGPYSQCRDIGGSPSCSCLPNYI--GSPPNCRPEC--VMNSECPSNEACINEKCGDPCPGSCGYNAQC 467 (660)
Q Consensus 394 ~~~d~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~--g~~~~C~~~C--~~~~~C~~~~~C~~~~c~~~C~~~C~~~~~C 467 (660)
.++++|+. ..|.+++.|++.+++|+|.|..||. +++.+|...= ..++.|... .+.|+.++.+
T Consensus 732 ~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g------------~h~C~i~g~a 799 (1289)
T KOG1214|consen 732 VDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDG------------SHTCAIAGQA 799 (1289)
T ss_pred CChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccC------------ccccCcCCce
Confidence 46677774 4599999999999999999999875 3433443210 123445544 3567666654
Q ss_pred --eec-CCceeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCC
Q psy13159 468 --KVI-NHTPICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPN 544 (660)
Q Consensus 468 --~~~-~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~C~~~g~C~~~~g~y~C~C~~G~~g~~~~ 544 (660)
+.. .++|.|.|.|||.|++.. | .++|||.++-|.++++|.+++|+|.|+|.+||+|+|..
T Consensus 800 ~c~~hGgs~y~C~CLPGfsGDG~~-c----------------~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~ 862 (1289)
T KOG1214|consen 800 RCVHHGGSTYSCACLPGFSGDGHQ-C----------------TDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQ 862 (1289)
T ss_pred EEEecCCceEEEeecCCccCCccc-c----------------ccccccCccccCCCceEecCCCcceeecccCccCCCce
Confidence 433 347999999999999853 3 67899999999999999999999999999999999987
Q ss_pred CccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEe--cCCceEeeCCCCCccCCCcCCCCCCCCCCCCCCCCCCeEe
Q psy13159 545 CRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKV--INHTPICTCPDGYTGDAFSGCYPKPPEQQQLKRDRGGILV 622 (660)
Q Consensus 545 C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~--~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~C~~~g~C~ 622 (660)
|.+.=.....|... .+-+..|+.++.|.- .+.+|.+.|.++-.|++-..|-++++ .--..|+.+|.+.
T Consensus 863 CVP~~~~~T~C~~e---------r~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~-~~vp~Cd~hgh~a 932 (1289)
T KOG1214|consen 863 CVPDTSSLTPCEQE---------RFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPE-QYVPQCDDHGHFA 932 (1289)
T ss_pred ecCCCccCCccccc---------cccceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCCCccc-ccCCCcccccccc
Confidence 76321111223221 011345666654321 25579999999998888777766543 2223588888888
Q ss_pred ecCCcCCCCceeeecCCCCCC
Q psy13159 623 LLPITRRKIKYECRCRRRRGR 643 (660)
Q Consensus 623 ~~~~~~~~~~~~C~C~~Gy~g 643 (660)
.++- ...++.|+|.++-++
T Consensus 933 p~qc--hG~~~~CwCvd~dGr 951 (1289)
T KOG1214|consen 933 PLQC--HGKSDFCWCVDKDGR 951 (1289)
T ss_pred cccc--CCCcceeEEecCCCc
Confidence 6644 344599999887643
No 2
>KOG1217|consensus
Probab=99.64 E-value=2.2e-14 Score=159.63 Aligned_cols=262 Identities=29% Similarity=0.607 Sum_probs=185.4
Q ss_pred CCceEeeCCCCCccCCCccccCCCCCCCcCCCCC--CCCCCCCCeeecCCCceeeecCCCccCCCCCCcCCCccCCCCCC
Q psy13159 216 NHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQ--PSPCGPNSQCREVNKQAVCSCLPNYFGSPPNCRPECTVNTDCPL 293 (660)
Q Consensus 216 ~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~--~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~ 293 (660)
...|.|.|..||.+..+. ...++|. ..+|.+++.|.+..++|.|.|++||.+. .|+..
T Consensus 149 ~~~~~c~C~~g~~~~~~~---------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~------~~~~~----- 208 (487)
T KOG1217|consen 149 VGPFRCSCTEGYEGEPCE---------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS------TCETT----- 208 (487)
T ss_pred CCceeeeeCCCccccccc---------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC------cCcCC-----
Confidence 467899999999999753 2336786 4569999999999999999999999998 33322
Q ss_pred CCCccCCcccCCCCCCCCCCccccccCceeeEecCCCcccCCcccccCCccCCCCCCCCcccccCCCCCCCCCCccCCCC
Q psy13159 294 NKACVNQKCVDPCPGSCGENRELDAQRFLVSSVCLPDYYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVPGTCGEGA 373 (660)
Q Consensus 294 ~~~C~~~~C~~~c~g~C~~~~~~~~~~~~~~C~C~~G~~g~~~~~c~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~g~ 373 (660)
.++ +.|... +.|.+.+||.+..+...+.++. .+ + +
T Consensus 209 ----~~~-------~~c~~~---------~~~~~~~g~~~~~c~~~~~~~~---------------------~~---~-~ 243 (487)
T KOG1217|consen 209 ----GNG-------GTCVDS---------VACSCPPGARGPECEVSIVECA---------------------SG---D-G 243 (487)
T ss_pred ----CCC-------ceEecc---------eeccCCCCCCCCCccccccccc---------------------CC---C-C
Confidence 011 122111 4688899988774332222211 11 3 6
Q ss_pred eeeecCCCeeecCCCCCCCCC-----CCCCCCCCC-CCCCCceeeCCCCceeecCCCccCCCCCCccCCcCCCCCCCCcc
Q psy13159 374 ICDVFLLSFTAPPPPLESPPE-----YVNPCIPSP-CGPYSQCRDIGGSPSCSCLPNYIGSPPNCRPECVMNSECPSNEA 447 (660)
Q Consensus 374 ~C~~~~~~~~C~c~~g~~~~~-----~~d~C~~~~-C~~~~~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~ 447 (660)
+|.+..++|+|.++.||.... ++++|...+ |.++++|++..++|.|.|++||+|. .+ ..+.+..+|....
T Consensus 244 ~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~--~~-~~~~~~~~C~~~~- 319 (487)
T KOG1217|consen 244 TCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR--LC-TECVDVDECSPRN- 319 (487)
T ss_pred cccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCC--CC-ccccccccccccc-
Confidence 778888889999999986655 899999764 9999999999999999999999999 65 3344455664310
Q ss_pred cccCCCCCCCCCCCCCCCeE--eecCCceeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCcee
Q psy13159 448 CINEKCGDPCPGSCGYNAQC--KVINHTPICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPV-NPCYPSPCGPYSQCR 524 (660)
Q Consensus 448 C~~~~c~~~C~~~C~~~~~C--~~~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i-~eC~~~~C~~~g~C~ 524 (660)
. ...|.++++| ....+.+.|.|..||.|..++ .. ++|...++.+++.|+
T Consensus 320 -~--------~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~-------------------~~~~~C~~~~~~~~~~c~ 371 (487)
T KOG1217|consen 320 -A--------GGPCANGGTCNTLGSFGGFRCACGPGFTGRRCE-------------------DSNDECASSPCCPGGTCV 371 (487)
T ss_pred -c--------CCcCCCCcccccCCCCCCCCcCCCCCCCCCccc-------------------cCCccccCCccccCCEec
Confidence 0 2357777777 334457889999999998652 34 588888899999999
Q ss_pred e-cCCCceeeCCCCcccCCCCCccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEecCCceEeeCCCC
Q psy13159 525 D-IGGSPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDG 594 (660)
Q Consensus 525 ~-~~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~G 594 (660)
+ ..++|.|.|+.+|.+....-...+.++++|.. .+.|++..++|.|. +.+
T Consensus 372 ~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~-------------------~~~c~~~~~~~~c~-~~~ 422 (487)
T KOG1217|consen 372 NETPGSYRCACPAGFAGKANGDGVGCEDIDECSG-------------------CGDCVNGPGGGACT-PPG 422 (487)
T ss_pred cCCCCCeEecCCCccccCCccccccccccccccC-------------------CcceeccCCCCccc-cCc
Confidence 9 79999999999999841111122333333321 45677788899998 773
No 3
>KOG1217|consensus
Probab=99.59 E-value=9.7e-14 Score=154.43 Aligned_cols=292 Identities=26% Similarity=0.543 Sum_probs=190.5
Q ss_pred CCCCCCCCCCCeeecCCCceeeecCCCccCCCCCCcC--CCccCCCCCCCCCccCCcccCCCCCCCCCCccccccCceee
Q psy13159 247 PCQPSPCGPNSQCREVNKQAVCSCLPNYFGSPPNCRP--ECTVNTDCPLNKACVNQKCVDPCPGSCGENRELDAQRFLVS 324 (660)
Q Consensus 247 ~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~--~C~~~~~C~~~~~C~~~~C~~~c~g~C~~~~~~~~~~~~~~ 324 (660)
.+...+....+.+......|.|.|++||.+. .++. +|..... ..+..+. |..... ....+.
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~--~~~~~~~C~~~~~----~~~~~~~--------c~~~~~---~~~~~~ 153 (487)
T KOG1217|consen 91 PCRSPCLLLCGECVDCVGSYECTCPPGYQGT--PCEGECECVTGPG----VCCIDGS--------CSNGPG---SVGPFR 153 (487)
T ss_pred cccCCcccCCccccCCCCCceeeCCCccccC--cCCcceeecCCCC----CeeCchh--------hcCCCC---CCCcee
Confidence 3444444556677778899999999999998 4432 2332211 0122222 221100 134688
Q ss_pred EecCCCcccCCcccccCCccCCCCCCCCcccccCCCCCCCCCCccCCCCeeeecCCCeeecCCCCCCCCCCCCCCCCCCC
Q psy13159 325 SVCLPDYYGDGYVSCRPECVLNSDCPSNKACIRNKCKNPCVPGTCGEGAICDVFLLSFTAPPPPLESPPEYVNPCIPSPC 404 (660)
Q Consensus 325 C~C~~G~~g~~~~~c~~~C~~~~~C~~~~~C~~~~c~~~C~~~~C~~g~~C~~~~~~~~C~c~~g~~~~~~~d~C~~~~C 404 (660)
|.|..||.+.......++|... ...|.+++.|.+..++|.|.|+.+|........
T Consensus 154 c~C~~g~~~~~~~~~~~~C~~~-------------------~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~------ 208 (487)
T KOG1217|consen 154 CSCTEGYEGEPCETDLDECIQY-------------------SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT------ 208 (487)
T ss_pred eeeCCCcccccccccccccccC-------------------CCCcCCCcccccCCCCeeEeCCCCccCCcCcCC------
Confidence 9999999998543322233211 223667777888888888888888754322111
Q ss_pred CCCCceeeCCCCceeecCCCccCCCCCCccCCcCCCCCCCCcccccCCCCCCCCCCCCCCCeEeecCCceeeeCCCCCcc
Q psy13159 405 GPYSQCRDIGGSPSCSCLPNYIGSPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAQCKVINHTPICTCPDGFIG 484 (660)
Q Consensus 405 ~~~~~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G 484 (660)
.+++.|++. +.|.+.+||.+. .+.. .+.++.. . + +.|++..++|+|.|++||++
T Consensus 209 ~~~~~c~~~---~~~~~~~g~~~~--~c~~---~~~~~~~---------------~--~-~~c~~~~~~~~C~~~~g~~~ 262 (487)
T KOG1217|consen 209 GNGGTCVDS---VACSCPPGARGP--ECEV---SIVECAS---------------G--D-GTCVNTVGSYTCRCPEGYTG 262 (487)
T ss_pred CCCceEecc---eeccCCCCCCCC--Cccc---ccccccC---------------C--C-CcccccCCceeeeCCCCccc
Confidence 344556554 667888888766 5542 1222221 0 3 78899999999999999999
Q ss_pred CCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCceeecCCCceeeCCCCcccCCCCCccCCccCCCCCCCCcccc
Q psy13159 485 DPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSP-CGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACIN 563 (660)
Q Consensus 485 ~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~-C~~~g~C~~~~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~ 563 (660)
.....+ .++++|.... |.++++|++..+.|.|.|++||+|. .+ ..+.+...|.... .
T Consensus 263 ~~~~~~----------------~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~--~~-~~~~~~~~C~~~~--~- 320 (487)
T KOG1217|consen 263 DACVTC----------------VDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGR--LC-TECVDVDECSPRN--A- 320 (487)
T ss_pred ccccee----------------eeccccCCCCccCCCCeeecCCCcceeeCCCCCCCC--CC-ccccccccccccc--c-
Confidence 852112 5789998764 9999999999999999999999999 54 2333333443110 0
Q ss_pred ccccCCCCCCCCCCCeE--EecCCceEeeCCCCCccCCCcCCCCCCCCCCCCCCCCCCeEee-cCCcCCCCceeeecCCC
Q psy13159 564 EKCQDPCPGSCGYNALC--KVINHTPICTCPDGYTGDAFSGCYPKPPEQQQLKRDRGGILVL-LPITRRKIKYECRCRRR 640 (660)
Q Consensus 564 ~~C~~~c~~~C~~~~~C--~~~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~C~~~g~C~~-~~~~~~~~~~~C~C~~G 640 (660)
...|.+++.| .+..+.|.|.|..||.|..|+ .....+...++.+++.|++ . .++|+|.|+.+
T Consensus 321 -------~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~---~~~~~C~~~~~~~~~~c~~~~-----~~~~~c~~~~~ 385 (487)
T KOG1217|consen 321 -------GGPCANGGTCNTLGSFGGFRCACGPGFTGRRCE---DSNDECASSPCCPGGTCVNET-----PGSYRCACPAG 385 (487)
T ss_pred -------CCcCCCCcccccCCCCCCCCcCCCCCCCCCccc---cCCccccCCccccCCEeccCC-----CCCeEecCCCc
Confidence 2457777778 344557899999999999987 2223455556888999998 4 56999999999
Q ss_pred CCC
Q psy13159 641 RGR 643 (660)
Q Consensus 641 y~g 643 (660)
|.+
T Consensus 386 ~~~ 388 (487)
T KOG1217|consen 386 FAG 388 (487)
T ss_pred ccc
Confidence 865
No 4
>KOG4289|consensus
Probab=99.57 E-value=5.9e-14 Score=158.21 Aligned_cols=107 Identities=28% Similarity=0.616 Sum_probs=85.4
Q ss_pred CCCCCCCCCCCCeEee----------------------eCCceEeeCCCCCccCCCccccCCCCCCCcCCCCCCCCCCCC
Q psy13159 199 NPCVPGTCGEGAICDV----------------------VNHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQPSPCGPN 256 (660)
Q Consensus 199 ~~C~~~~C~~~~~C~~----------------------~~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~~~~C~~~ 256 (660)
+.|...||.+..+|+. ..++++|+|++||+|+.|+ ..+|+|.+.||.++
T Consensus 1180 niClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~Ce---------TeiDlCYs~pC~nn 1250 (2531)
T KOG4289|consen 1180 NICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCE---------TEIDLCYSGPCGNN 1250 (2531)
T ss_pred chhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCccccc---------chhHhhhcCCCCCC
Confidence 4566677777777733 2367999999999999765 78999999999999
Q ss_pred CeeecCCCceeeecCCCccCCCCCCcCCCccCCCCCCCCCccCCcccCCCCCCCCCCccccccCceeeEecCCC
Q psy13159 257 SQCREVNKQAVCSCLPNYFGSPPNCRPECTVNTDCPLNKACVNQKCVDPCPGSCGENRELDAQRFLVSSVCLPD 330 (660)
Q Consensus 257 g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~g~C~~~~~~~~~~~~~~C~C~~G 330 (660)
|+|....|+|+|.|.+||+|. .|+... ..+.|+++.|.+ .|+|... ..+.|.|.|+.|
T Consensus 1251 g~C~srEggYtCeCrpg~tGe------hCEvs~---~agrCvpGvC~n--ggtC~~~-----~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1251 GRCRSREGGYTCECRPGFTGE------HCEVSA---RAGRCVPGVCKN--GGTCVNL-----LNGGFCCHCPYG 1308 (2531)
T ss_pred CceEEecCceeEEecCCcccc------ceeeec---ccCccccceecC--CCEEeec-----CCCceeccCCCc
Confidence 999999999999999999999 566543 235677777766 4777654 356789999988
No 5
>KOG4289|consensus
Probab=99.53 E-value=1.2e-13 Score=155.91 Aligned_cols=102 Identities=26% Similarity=0.678 Sum_probs=82.0
Q ss_pred cccCCCCCCCCCCCCccccC----------------------CCCceeeCCCCCccCCCCCcccCCCCCCCCCCcccccc
Q psy13159 36 EYVNPCVPSPCGPYSQCRDI----------------------GGSPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIRE 93 (660)
Q Consensus 36 ~~~d~C~~~~C~~~g~C~~~----------------------~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~ 93 (660)
.|-+.|+..||.|..+|+.. .+++.|.|++||+|+ .|+ ..+|+|-
T Consensus 1177 fdDniClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd--~Ce---TeiDlCY-------- 1243 (2531)
T KOG4289|consen 1177 FDDNICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGD--YCE---TEIDLCY-------- 1243 (2531)
T ss_pred ccCchhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCCCCcc--ccc---chhHhhh--------
Confidence 45677999999999999732 367889999999999 898 4566776
Q ss_pred cccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCc---CCCCCCEeeCc-----eeecC
Q psy13159 94 KCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAFLSCHPKPPEPVQPIIQEDTC---NCVPNAECRDG-----VCVCL 165 (660)
Q Consensus 94 ~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C---~C~~~g~C~~~-----~C~C~ 165 (660)
..+|.++|+|....|+|+|.|++||+|..|+- . .....| .|.++|+|++. .|.|+
T Consensus 1244 ------s~pC~nng~C~srEggYtCeCrpg~tGehCEv---s--------~~agrCvpGvC~nggtC~~~~nggf~c~Cp 1306 (2531)
T KOG4289|consen 1244 ------SGPCGNNGRCRSREGGYTCECRPGFTGEHCEV---S--------ARAGRCVPGVCKNGGTCVNLLNGGFCCHCP 1306 (2531)
T ss_pred ------cCCCCCCCceEEecCceeEEecCCccccceee---e--------cccCccccceecCCCEEeecCCCceeccCC
Confidence 47899999999999999999999999999761 0 122344 48899999862 58998
Q ss_pred CC
Q psy13159 166 PD 167 (660)
Q Consensus 166 ~G 167 (660)
.|
T Consensus 1307 ~g 1308 (2531)
T KOG4289|consen 1307 YG 1308 (2531)
T ss_pred Cc
Confidence 87
No 6
>KOG1214|consensus
Probab=99.52 E-value=2.4e-13 Score=147.12 Aligned_cols=223 Identities=26% Similarity=0.595 Sum_probs=149.4
Q ss_pred cCCCC--CCCCCCCCccccCCC-CceeeCCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCC
Q psy13159 38 VNPCV--PSPCGPYSQCRDIGG-SPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINH 114 (660)
Q Consensus 38 ~d~C~--~~~C~~~g~C~~~~g-~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~g 114 (660)
+++|. ++-|..++.|....+ .|+|.|..||.|++++| .+.++|... .+.|..+++|++.+|
T Consensus 692 ~npCy~gsh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c----~d~~eca~~------------~~~CGp~s~Cin~pg 755 (1289)
T KOG1214|consen 692 VNPCYDGSHMCDTTARCHPGTGVDYTCECSSGYQGDGRNC----VDENECATG------------FHRCGPNSVCINLPG 755 (1289)
T ss_pred cccceecCcccCCCccccCCCCcceEEEEeeccCCCCCCC----CChhhhccC------------CCCCCCCceeecCCC
Confidence 45553 566888889987654 69999999999998776 578888875 688999999999999
Q ss_pred CceeeCCCCCc--cCCCccCCCCCC-CCCCCCCCCCCcCCCCCCEe--eC-----ceeecCCCcccCCcccCCCCCccCC
Q psy13159 115 TPICTCPDGFI--GDAFLSCHPKPP-EPVQPIIQEDTCNCVPNAEC--RD-----GVCVCLPDYYGDGYVSCRPECVVNS 184 (660)
Q Consensus 115 s~~C~C~~Gy~--G~~c~~C~~~~~-~~~~~~~~~~~C~C~~~g~C--~~-----~~C~C~~G~~G~~c~~~~~~C~~~~ 184 (660)
+|+|.|..||. ++.. .|.+..+ .+..+|.+. .-.|..++.+ +. +.|+|.|||.|++.. |.+.
T Consensus 756 ~~rceC~~gy~F~dd~~-tCV~i~~pap~n~Ce~g-~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-----c~dv- 827 (1289)
T KOG1214|consen 756 SYRCECRSGYEFADDRH-TCVLITPPAPANPCEDG-SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-----CTDV- 827 (1289)
T ss_pred ceeEEEeecceeccCCc-ceEEecCCCCCCccccC-ccccCcCCceEEEecCCceEEEeecCCccCCccc-----cccc-
Confidence 99999999985 4432 3654422 223333322 0146555554 42 479999999999843 4443
Q ss_pred CCCCCcccccCcccCCCCCCCCCCCCeEeeeCCceEeeCCCCCccCCCccccCCCCCCCcCCCCCCC-----CCCCCCee
Q psy13159 185 ECPRNKACIKYKCKNPCVPGTCGEGAICDVVNHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQPS-----PCGPNSQC 259 (660)
Q Consensus 185 ~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~~~-----~C~~~g~C 259 (660)
++|....|+.+++|.+++++|.|.|.+||.|++. .|.+.. .....|... -|+.+..|
T Consensus 828 --------------DeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf-~CVP~~---~~~T~C~~er~hpl~chg~t~~ 889 (1289)
T KOG1214|consen 828 --------------DECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF-QCVPDT---SSLTPCEQERFHPLQCHGSTGF 889 (1289)
T ss_pred --------------cccCccccCCCceEecCCCcceeecccCccCCCc-eecCCC---ccCCccccccccceeeccccce
Confidence 5666778999999999999999999999999987 476541 334455432 25444433
Q ss_pred e--cCCCceeeecCCCccCCCC-CCcCC-CccCCCCCCCCCccCCcc
Q psy13159 260 R--EVNKQAVCSCLPNYFGSPP-NCRPE-CTVNTDCPLNKACVNQKC 302 (660)
Q Consensus 260 ~--~~~g~y~C~C~~Gy~g~~~-~C~~~-C~~~~~C~~~~~C~~~~C 302 (660)
. -.+..|.+.+.++-.|++. +|... =..-.+|..++.+....|
T Consensus 890 ~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~~~vp~Cd~hgh~ap~qc 936 (1289)
T KOG1214|consen 890 CWCVDPDGHEVPGTQTPPGSTPPHCGPSPEQYVPQCDDHGHFAPLQC 936 (1289)
T ss_pred eEeeCCCcccCCCCCCCCCCCCCCCCCcccccCCCcccccccccccc
Confidence 2 1246688888877776532 34310 011123555555555433
No 7
>KOG1219|consensus
Probab=99.29 E-value=4.2e-12 Score=148.39 Aligned_cols=111 Identities=31% Similarity=0.773 Sum_probs=100.7
Q ss_pred CCCCCCCCCCCCceeeC-CCCceeecCCCccCCCCCCccCCcCCCCCCCCcccccCCCCCCCCCCCCCCCeEeecCCcee
Q psy13159 397 NPCIPSPCGPYSQCRDI-GGSPSCSCLPNYIGSPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAQCKVINHTPI 475 (660)
Q Consensus 397 d~C~~~~C~~~~~C~~~-~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~C~~~C~~~~~C~~~~g~~~ 475 (660)
+.|..+||+++|+|... .++|.|.|++-|+|. +|+. ++.+|..+ +|..+++|+...++|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~--~CEi---~~epC~sn--------------PC~~GgtCip~~n~f~ 3925 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN--HCEI---DLEPCASN--------------PCLTGGTCIPFYNGFL 3925 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc--cccc---ccccccCC--------------CCCCCCEEEecCCCee
Confidence 88999999999999985 478999999999999 9985 47788765 7999999999999999
Q ss_pred eeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCceeecCCCceeeCCCCcccCCCCCc
Q psy13159 476 CTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEP-VNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCR 546 (660)
Q Consensus 476 C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~-i~eC~~~~C~~~g~C~~~~g~y~C~C~~G~~g~~~~C~ 546 (660)
|.|+.||+|.+|+ .+ |+||+.++|.++|.|+|++|+|.|.|.+||.|. .|.
T Consensus 3926 CnC~~gyTG~~Ce------------------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr--~c~ 3977 (4289)
T KOG1219|consen 3926 CNCPNGYTGKRCE------------------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR--TCC 3977 (4289)
T ss_pred EeCCCCccCceee------------------cccccccccccccCCceeeccCCceEeccChhHhcc--cCc
Confidence 9999999999764 44 999999999999999999999999999999999 654
No 8
>KOG1219|consensus
Probab=99.21 E-value=2.1e-11 Score=142.79 Aligned_cols=107 Identities=24% Similarity=0.598 Sum_probs=95.3
Q ss_pred CCCCCCCeEeecC-CceeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeecCCCceeeCCCC
Q psy13159 459 GSCGYNAQCKVIN-HTPICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPN 537 (660)
Q Consensus 459 ~~C~~~~~C~~~~-g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~C~~~g~C~~~~g~y~C~C~~G 537 (660)
.+|+++|+|.... ++|.|.|++-|+|..|+ .++..|.++||..+|+|+...++|.|.|+.|
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~CE------------------i~~epC~snPC~~GgtCip~~n~f~CnC~~g 3931 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNHCE------------------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNG 3931 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCcccc------------------cccccccCCCCCCCCEEEecCCCeeEeCCCC
Confidence 4899999999875 58999999999999754 7899999999999999999999999999999
Q ss_pred cccCCCCCccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEecCCceEeeCCCCCccCCCc
Q psy13159 538 YIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAFS 601 (660)
Q Consensus 538 ~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~ 601 (660)
|+|. +|+.+ .+++|.. ++|.++|+|+|++|+|.|.|.+||.|..|.
T Consensus 3932 yTG~--~Ce~~--Gi~eCs~--------------n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3932 YTGK--RCEAR--GISECSK--------------NVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred ccCc--eeecc--ccccccc--------------ccccCCceeeccCCceEeccChhHhcccCc
Confidence 9999 88742 2556654 489999999999999999999999999974
No 9
>KOG0994|consensus
Probab=99.13 E-value=1.5e-09 Score=121.65 Aligned_cols=63 Identities=25% Similarity=0.655 Sum_probs=37.5
Q ss_pred eEeecCCceeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CceeecCCCceeeCCCCcccC
Q psy13159 466 QCKVINHTPICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGPY--SQCRDIGGSPSCSCLPNYIGA 541 (660)
Q Consensus 466 ~C~~~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~C~~~--g~C~~~~g~y~C~C~~G~~g~ 541 (660)
.|....| +|-|.+...|..|..|.++.+.... -.-|.+-.|.+. -+|....| .|+|++||-|.
T Consensus 1031 ~CDr~tG--QCpClpNv~G~~CDqCA~N~w~laS---------G~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR 1095 (1758)
T KOG0994|consen 1031 HCDRFTG--QCPCLPNVQGVRCDQCAENHWNLAS---------GEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGR 1095 (1758)
T ss_pred ccccccC--cCCCCcccccccccccccchhcccc---------CCCCCccCCCccCCcccccccc--ceeccCCCCCc
Confidence 3444444 5889999999988777766554311 112222233321 25655555 68888888887
No 10
>KOG1225|consensus
Probab=99.05 E-value=1.3e-09 Score=117.95 Aligned_cols=130 Identities=30% Similarity=0.790 Sum_probs=95.9
Q ss_pred ceeeCCCCCccCCCc--cCCCCCCCCCCCCCCCCCcCCCCCCEeeCceeecCCCcccCCcccCCCCCccCCCCCCCcccc
Q psy13159 116 PICTCPDGFIGDAFL--SCHPKPPEPVQPIIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCRPECVVNSECPRNKACI 193 (660)
Q Consensus 116 ~~C~C~~Gy~G~~c~--~C~~~~~~~~~~~~~~~~C~C~~~g~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C~ 193 (660)
++|.|..+|+|..+. .|.. .|..++.|++++|+|++||+|+.|..
T Consensus 234 ~ic~c~~~~~g~~c~~~~C~~---------------~c~~~g~c~~G~CIC~~Gf~G~dC~e------------------ 280 (525)
T KOG1225|consen 234 GICECPEGYFGPLCSTIYCPG---------------GCTGRGQCVEGRCICPPGFTGDDCDE------------------ 280 (525)
T ss_pred ceeecCCceeCCccccccCCC---------------CCcccceEeCCeEeCCCCCcCCCCCc------------------
Confidence 489999999999864 2221 36778999999999999999997652
Q ss_pred cCcccCCCCCCCCCCCCeEeeeCCceEeeCCCCCccCCCccccCCCCCCCcCCCCCCCCCCCCCeeecCCCceeeecCCC
Q psy13159 194 KYKCKNPCVPGTCGEGAICDVVNHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPN 273 (660)
Q Consensus 194 ~~~C~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~y~C~C~~G 273 (660)
-.|... |+.++.+++. .|+|++||+|..|+ +..| +.+|+++|.|++. +|.|.+|
T Consensus 281 -----~~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs-----------~~~c-padC~g~G~Ci~G----~C~C~~G 334 (525)
T KOG1225|consen 281 -----LVCPVD-CSGGGVCVDG----ECICNPGYSGKDCS-----------IRRC-PADCSGHGKCIDG----ECLCDEG 334 (525)
T ss_pred -----ccCCcc-cCCCceecCC----EeecCCCccccccc-----------cccC-CccCCCCCcccCC----ceEeCCC
Confidence 224223 6667766643 69999999999764 2224 3679999999944 9999999
Q ss_pred ccCCCCCCcCCCccCCCCCCCCCccCCcccCCCCCCCCCCccccccCceeeEecCCCcccCC
Q psy13159 274 YFGSPPNCRPECTVNTDCPLNKACVNQKCVDPCPGSCGENRELDAQRFLVSSVCLPDYYGDG 335 (660)
Q Consensus 274 y~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~g~C~~~~~~~~~~~~~~C~C~~G~~g~~ 335 (660)
|+|. .|+... |+....|+++ |+|..||.|..
T Consensus 335 y~G~------~C~~~~-C~~~g~cv~g------------------------C~C~~Gw~G~d 365 (525)
T KOG1225|consen 335 YTGE------LCIQRA-CSGGGQCVNG------------------------CKCKKGWRGPD 365 (525)
T ss_pred CcCC------cccccc-cCCCceeccC------------------------ceeccCccCCC
Confidence 9999 666542 4444444332 89999999985
No 11
>KOG1225|consensus
Probab=99.02 E-value=1.9e-09 Score=116.72 Aligned_cols=129 Identities=34% Similarity=0.819 Sum_probs=99.1
Q ss_pred ceeeCCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCCccCCCCCCC
Q psy13159 59 PSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAFLSCHPKPPE 138 (660)
Q Consensus 59 y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~ 138 (660)
+.|.|..||.|. .|+ . -.| ++.|..+++|++ .+|+|++||+|+.|.
T Consensus 234 ~ic~c~~~~~g~--~c~----~-~~C---------------~~~c~~~g~c~~----G~CIC~~Gf~G~dC~-------- 279 (525)
T KOG1225|consen 234 GICECPEGYFGP--LCS----T-IYC---------------PGGCTGRGQCVE----GRCICPPGFTGDDCD-------- 279 (525)
T ss_pred ceeecCCceeCC--ccc----c-ccC---------------CCCCcccceEeC----CeEeCCCCCcCCCCC--------
Confidence 379999999998 664 1 122 355777788887 589999999999974
Q ss_pred CCCCCCCCCCc--CCCCCCEeeCceeecCCCcccCCcccCCCCCccCCCCCCCcccccCcccCCCCCCCCCCCCeEeeeC
Q psy13159 139 PVQPIIQEDTC--NCVPNAECRDGVCVCLPDYYGDGYVSCRPECVVNSECPRNKACIKYKCKNPCVPGTCGEGAICDVVN 216 (660)
Q Consensus 139 ~~~~~~~~~~C--~C~~~g~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~ 216 (660)
+-.| .|+.++.+++++|+|++||+|..|+. ..| ...|..++.|++
T Consensus 280 -------e~~Cp~~cs~~g~~~~g~CiC~~g~~G~dCs~-----------------------~~c-padC~g~G~Ci~-- 326 (525)
T KOG1225|consen 280 -------ELVCPVDCSGGGVCVDGECICNPGYSGKDCSI-----------------------RRC-PADCSGHGKCID-- 326 (525)
T ss_pred -------cccCCcccCCCceecCCEeecCCCcccccccc-----------------------ccC-CccCCCCCcccC--
Confidence 2223 47889999999999999999996541 223 356888899982
Q ss_pred CceEeeCCCCCccCCCccccCCCCCCCcCCCCCCCCCCCCCeeecCCCceeeecCCCccCC
Q psy13159 217 HAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPNYFGS 277 (660)
Q Consensus 217 ~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~ 277 (660)
+ +|.|.+||+|..|. . . +|.+++.|++. |+|..||.|.
T Consensus 327 G--~C~C~~Gy~G~~C~---------~--~-----~C~~~g~cv~g-----C~C~~Gw~G~ 364 (525)
T KOG1225|consen 327 G--ECLCDEGYTGELCI---------Q--R-----ACSGGGQCVNG-----CKCKKGWRGP 364 (525)
T ss_pred C--ceEeCCCCcCCccc---------c--c-----ccCCCceeccC-----ceeccCccCC
Confidence 2 69999999999653 1 1 38889999873 9999999998
No 12
>KOG4260|consensus
Probab=98.87 E-value=2.1e-09 Score=103.42 Aligned_cols=163 Identities=23% Similarity=0.526 Sum_probs=110.5
Q ss_pred eCCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeE---ecCCCceeeCCCCCccCCCccCCCCCCC
Q psy13159 62 SCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCK---VINHTPICTCPDGFIGDAFLSCHPKPPE 138 (660)
Q Consensus 62 ~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~---~~~gs~~C~C~~Gy~G~~c~~C~~~~~~ 138 (660)
-|++|-+|. +|. .|..+. ..+|..+|.|. ...|+..|.|.+||+|..|..|.+.+.+
T Consensus 131 CCp~gtyGp--dCl-------~Cpggs-----------er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfe 190 (350)
T KOG4260|consen 131 CCPDGTYGP--DCL-------QCPGGS-----------ERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFE 190 (350)
T ss_pred ccCCCCcCC--ccc-------cCCCCC-----------cCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHH
Confidence 388999998 674 333221 24677788887 3667899999999999999888765544
Q ss_pred CCCCCCCCCCc-CCCC--CCEeeCc---ee-ecCCCcccCCcccCCCCCccCCCCCCCcccccCcccCCCCCCCCCCCCe
Q psy13159 139 PVQPIIQEDTC-NCVP--NAECRDG---VC-VCLPDYYGDGYVSCRPECVVNSECPRNKACIKYKCKNPCVPGTCGEGAI 211 (660)
Q Consensus 139 ~~~~~~~~~~C-~C~~--~g~C~~~---~C-~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~ 211 (660)
..+.- ..-.| .|+. .++|... .| +|+.||..+. ..|++++||.+. +.+|..+..
T Consensus 191 s~Rne-~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde-----~gCvDvnEC~~e-------------p~~c~~~qf 251 (350)
T KOG4260|consen 191 SSRNE-QHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLDE-----EGCVDVNECQNE-------------PAPCKAHQF 251 (350)
T ss_pred hhccc-ccchhhhhhhhhhcccCCCCCCChhhhcccceecc-----cccccHHHHhcC-------------CCCCChhhe
Confidence 32210 11112 1221 1245432 35 7899998873 358887666542 567889999
Q ss_pred EeeeCCceEeeCCCCCccCCCccccCCCCCCCcCCCCCC--CCC-CCCCeeecCCCceeeecCCCccCC
Q psy13159 212 CDVVNHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPCQP--SPC-GPNSQCREVNKQAVCSCLPNYFGS 277 (660)
Q Consensus 212 C~~~~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C~~--~~C-~~~g~C~~~~g~y~C~C~~Gy~g~ 277 (660)
|+|+.|||+|..++||.+. - ++|.. ..| ..+..|.|+.++|+|+|..|+.-.
T Consensus 252 CvNteGSf~C~dk~Gy~~g-~-------------d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 252 CVNTEGSFKCEDKEGYKKG-V-------------DECQFCADVCASKNRPCMNIDGQYRCVCFSGLIII 306 (350)
T ss_pred eecCCCceEecccccccCC-h-------------HHhhhhhhhcccCCCCcccCCccEEEEecccceee
Confidence 9999999999999999873 1 23321 122 256789999999999999998643
No 13
>KOG0994|consensus
Probab=98.59 E-value=6.5e-07 Score=101.12 Aligned_cols=199 Identities=20% Similarity=0.421 Sum_probs=109.0
Q ss_pred eeeCCCCcee-ecCCCccCCCCCCccCCcCCCCCCCCcccccCCCCCCCCCCCCCCCeEe--ecCCceeeeCCCCCccCC
Q psy13159 410 CRDIGGSPSC-SCLPNYIGSPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAQCK--VINHTPICTCPDGFIGDP 486 (660)
Q Consensus 410 C~~~~g~~~C-~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~C~~~C~~~~~C~--~~~g~~~C~C~~Gy~G~~ 486 (660)
|.+...++.| .|..||.|++..-. ...|.+= .|..+. ..-=++.-.|. +......|.|.+||+|..
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~-----g~~CrPC-pCP~gp-----~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~R 946 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGS-----GIGCRPC-PCPDGP-----ASGRQHADSCYLDTRTQQIVCHCQEGYSGSR 946 (1758)
T ss_pred ccccccccchhhhhccccCCcccCC-----CCCCCCC-CCCCCC-----ccchhccccccccccccceeeecccCccccc
Confidence 5566677888 59999999853211 1111110 010000 00011112343 223457899999999999
Q ss_pred CCcCCCCCCCC---CCCCC-CCCCCCCCCCCCCCCCCC-C---ceeecCCCcee-eCCCCcccCCCCCccCCccCCCCCC
Q psy13159 487 FTLCSPKPPEP---RPPPQ-EDVPEPVNPCYPSPCGPY-S---QCRDIGGSPSC-SCLPNYIGAPPNCRPECVQNNDCSN 557 (660)
Q Consensus 487 c~~C~~~~~~~---~~~~~-~~~~~~i~eC~~~~C~~~-g---~C~~~~g~y~C-~C~~G~~g~~~~C~~~C~~~~~C~~ 557 (660)
|..|.++.+.. +++++ -+|...||.=.+..|... | +|+....+-+| .|++||.|+.. ...|.. -.|..
T Consensus 947 Ce~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~--~q~Cqr-C~Cn~ 1023 (1758)
T KOG0994|consen 947 CEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDAL--RQNCQR-CVCNF 1023 (1758)
T ss_pred hhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHH--Hhhhhh-heccc
Confidence 99998765433 33333 233344554444455432 2 35544455567 69999999832 111110 01111
Q ss_pred CCccccccccCCCCCCCCCCCeEEecCCceEeeCCCCCccCCCcCCCCCCCCCCCC----CC--C--CCCeEeecCCcCC
Q psy13159 558 DKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAFSGCYPKPPEQQQL----KR--D--RGGILVLLPITRR 629 (660)
Q Consensus 558 ~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~----~C--~--~~g~C~~~~~~~~ 629 (660)
-+ .+ +.+.|... +.+|.|.+...|..|+.|.++.+-..+. || . .+-.|.
T Consensus 1024 LG--Tn------------~~~~CDr~--tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN------- 1080 (1758)
T KOG0994|consen 1024 LG--TN------------STCHCDRF--TGQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCN------- 1080 (1758)
T ss_pred cc--cC------------Cccccccc--cCcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccc-------
Confidence 10 00 11234333 4489999999999999998876544333 22 2 233555
Q ss_pred CCceeeecCCCCCCcc
Q psy13159 630 KIKYECRCRRRRGRKR 645 (660)
Q Consensus 630 ~~~~~C~C~~Gy~g~c 645 (660)
..+++|+|++||+|+-
T Consensus 1081 ~ftGQCqCkpGfGGR~ 1096 (1758)
T KOG0994|consen 1081 EFTGQCQCKPGFGGRT 1096 (1758)
T ss_pred ccccceeccCCCCCcc
Confidence 4589999999997753
No 14
>KOG4260|consensus
Probab=98.50 E-value=1.3e-07 Score=91.24 Aligned_cols=160 Identities=27% Similarity=0.575 Sum_probs=97.4
Q ss_pred CCCCCccCCCccCCCCCCCCCCCCCCCCCcCCCCCCEee-------CceeecCCCcccCCcccCCCCCccCCCCCCCccc
Q psy13159 120 CPDGFIGDAFLSCHPKPPEPVQPIIQEDTCNCVPNAECR-------DGVCVCLPDYYGDGYVSCRPECVVNSECPRNKAC 192 (660)
Q Consensus 120 C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C~C~~~g~C~-------~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C 192 (660)
|++|-.|..|..|.-. .+=+|..+|.|. ++.|.|.+||+|..|..+..+--....-..+-.|
T Consensus 132 Cp~gtyGpdCl~Cpgg-----------ser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvC 200 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGG-----------SERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVC 200 (350)
T ss_pred cCCCCcCCccccCCCC-----------CcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchh
Confidence 8899999998766522 222478888886 2469999999999877543210000000000001
Q ss_pred ccCcccCCCCCCCCCCCCeEeeeCCceEe-eCCCCCccCCCccccCCCCCCCcCCCCC--CCCCCCCCeeecCCCceeee
Q psy13159 193 IKYKCKNPCVPGTCGEGAICDVVNHAVVC-TCPPGTTGSPLVLCRPIQNEPVYTNPCQ--PSPCGPNSQCREVNKQAVCS 269 (660)
Q Consensus 193 ~~~~C~~~C~~~~C~~~~~C~~~~~~~~C-~C~~G~~G~~c~~C~~~~~~~~~~~~C~--~~~C~~~g~C~~~~g~y~C~ 269 (660)
. .| ..+|. +.|... ++-.| .|..||..+.- -| +|||+|. +.||..+..|+|+.|+|+|.
T Consensus 201 t------~C-h~~C~--~~Csg~-~~k~C~kCkkGW~lde~-gC-------vDvnEC~~ep~~c~~~qfCvNteGSf~C~ 262 (350)
T KOG4260|consen 201 T------AC-HEGCL--GVCSGE-SSKGCSKCKKGWKLDEE-GC-------VDVNECQNEPAPCKAHQFCVNTEGSFKCE 262 (350)
T ss_pred h------hh-hhhhh--cccCCC-CCCChhhhcccceeccc-cc-------ccHHHHhcCCCCCChhheeecCCCceEec
Confidence 0 01 11121 133221 22335 59999998754 36 5667885 67899999999999999999
Q ss_pred cCCCccCCCCCCc---CCCccCCCCCCCCCccCCcccCCCCCCCCCCccccccCceeeEecCCCcc
Q psy13159 270 CLPNYFGSPPNCR---PECTVNTDCPLNKACVNQKCVDPCPGSCGENRELDAQRFLVSSVCLPDYY 332 (660)
Q Consensus 270 C~~Gy~g~~~~C~---~~C~~~~~C~~~~~C~~~~C~~~c~g~C~~~~~~~~~~~~~~C~C~~G~~ 332 (660)
.++||.+....|+ +-|... +.+|.|. .+.|+|+|..|+.
T Consensus 263 dk~Gy~~g~d~C~~~~d~~~~k-----n~~c~ni-------------------~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 263 DKEGYKKGVDECQFCADVCASK-----NRPCMNI-------------------DGQYRCVCFSGLI 304 (350)
T ss_pred ccccccCChHHhhhhhhhcccC-----CCCcccC-------------------CccEEEEecccce
Confidence 9999987423333 222221 2233332 6789999998875
No 15
>KOG1226|consensus
Probab=98.18 E-value=1.3e-05 Score=88.70 Aligned_cols=137 Identities=28% Similarity=0.601 Sum_probs=91.7
Q ss_pred CCCCCCCCeeEecCCCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCc-------CCCCCCEeeCceeecCCCcc---
Q psy13159 100 PGSCGYNAQCKVINHTPICTCPDGFIGDAFLSCHPKPPEPVQPIIQEDTC-------NCVPNAECRDGVCVCLPDYY--- 169 (660)
Q Consensus 100 ~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C-------~C~~~g~C~~~~C~C~~G~~--- 169 (660)
...|+.+|+.+- ..|.|.+||.|..|+ |........ ...+.| +|+.+|.|+=++|+|.+...
T Consensus 466 s~~C~g~G~~~C----G~C~C~~G~~G~~CE-C~~~~~ss~---~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i 537 (783)
T KOG1226|consen 466 SALCHGNGTFVC----GQCRCDEGWLGKKCE-CSTDELSSS---EEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKI 537 (783)
T ss_pred ccccCCCCcEEe----cceecCCCCCCCccc-CCccccCcH---hHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCce
Confidence 356776666554 478999999999986 543211100 001333 69999999999999998877
Q ss_pred -cCCcccCCCCCccCCCCCCCcccccCcccCCCCCCCCCCCCeEeeeCCceEeeCCCCCccCCCccccCCCCCCCcCCCC
Q psy13159 170 -GDGYVSCRPECVVNSECPRNKACIKYKCKNPCVPGTCGEGAICDVVNHAVVCTCPPGTTGSPLVLCRPIQNEPVYTNPC 248 (660)
Q Consensus 170 -G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~c~~C~~~~~~~~~~~~C 248 (660)
|..|+-+.-.|+.++ ...|..++.|.-. +|+|.+||+|..|. |. ...+.|
T Consensus 538 ~G~fCECDnfsC~r~~------------------g~lC~g~G~C~CG----~CvC~~GwtG~~C~-C~------~std~C 588 (783)
T KOG1226|consen 538 YGKFCECDNFSCERHK------------------GVLCGGHGRCECG----RCVCNPGWTGSACN-CP------LSTDTC 588 (783)
T ss_pred eeeeeeccCccccccc------------------CcccCCCCeEeCC----cEEcCCCCccCCCC-CC------CCCccc
Confidence 776652222222210 1247777777543 59999999999884 53 556777
Q ss_pred CCC---CCCCCCeeecCCCceeeecCCC-ccCC
Q psy13159 249 QPS---PCGPNSQCREVNKQAVCSCLPN-YFGS 277 (660)
Q Consensus 249 ~~~---~C~~~g~C~~~~g~y~C~C~~G-y~g~ 277 (660)
.+. -|...|+|.=. +|+|.+. |.|.
T Consensus 589 ~~~~G~iCSGrG~C~Cg----~C~C~~~~~sG~ 617 (783)
T KOG1226|consen 589 ESSDGQICSGRGTCECG----RCKCTDPPYSGE 617 (783)
T ss_pred cCCCCceeCCCceeeCC----ceEcCCCCcCcc
Confidence 542 47778888755 7899876 9998
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.10 E-value=1.2e-06 Score=62.25 Aligned_cols=34 Identities=32% Similarity=0.685 Sum_probs=30.5
Q ss_pred ccCCCCC--CCCCCCCccccCCCCceeeCCCCCccC
Q psy13159 37 YVNPCVP--SPCGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 37 ~~d~C~~--~~C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
|||||+. ++|..+++|+|+.|+|+|+|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 7999984 569989999999999999999999943
No 17
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.96 E-value=8.3e-06 Score=56.77 Aligned_cols=36 Identities=36% Similarity=0.918 Sum_probs=32.4
Q ss_pred ccCCCCC-CCCCCCCccccCCCCceeeCCCCCc-cCCCCC
Q psy13159 37 YVNPCVP-SPCGPYSQCRDIGGSPSCSCLPNYI-GAPPNC 74 (660)
Q Consensus 37 ~~d~C~~-~~C~~~g~C~~~~g~y~C~C~~G~~-g~~~~C 74 (660)
++|+|.. ++|.++++|+++.++|+|.|++||+ |. .|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~--~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR--NC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC--cC
Confidence 5789987 8999999999999999999999999 66 55
No 18
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.92 E-value=7.8e-06 Score=58.05 Aligned_cols=34 Identities=29% Similarity=0.756 Sum_probs=30.3
Q ss_pred CCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCc
Q psy13159 80 QNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFI 125 (660)
Q Consensus 80 ~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~ 125 (660)
|+|||..+ ++.|..+++|+|+.|+|+|.|++||+
T Consensus 1 DidEC~~~------------~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEG------------PHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTT------------SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCC------------CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 46788765 57898899999999999999999998
No 19
>KOG1226|consensus
Probab=97.87 E-value=4.9e-05 Score=84.32 Aligned_cols=128 Identities=27% Similarity=0.565 Sum_probs=92.1
Q ss_pred eeecCCCceeeeeecCCCCCCCCCcccccCCCC----CCCCCCCCccccCCCCceeeCCCCCc----cCCCCCcccCCCC
Q psy13159 10 FTASSGNTWKNIRFKNAPPPPQQDVQEYVNPCV----PSPCGPYSQCRDIGGSPSCSCLPNYI----GAPPNCRPECLQN 81 (660)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~C~----~~~C~~~g~C~~~~g~y~C~C~~G~~----g~~~~C~~~C~~~ 81 (660)
..|..||.|..|+.+....... ...+.|. ..+|+..|.|+=. +|+|.+... |. .|+ | |.
T Consensus 480 C~C~~G~~G~~CEC~~~~~ss~----~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~G~--fCE--C-Dn 546 (783)
T KOG1226|consen 480 CRCDEGWLGKKCECSTDELSSS----EEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIYGK--FCE--C-DN 546 (783)
T ss_pred eecCCCCCCCcccCCccccCcH----hHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCceeee--eee--c-cC
Confidence 4678999999999977653322 2255675 2379999999843 899998887 55 675 2 22
Q ss_pred CCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCc------CCCCCC
Q psy13159 82 SECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAFLSCHPKPPEPVQPIIQEDTC------NCVPNA 155 (660)
Q Consensus 82 ~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C------~C~~~g 155 (660)
-.|.... ...|+.+|+|.= .+|+|.+||+|..|. |.. +.+.| .|+.+|
T Consensus 547 fsC~r~~-----------g~lC~g~G~C~C----G~CvC~~GwtG~~C~-C~~----------std~C~~~~G~iCSGrG 600 (783)
T KOG1226|consen 547 FSCERHK-----------GVLCGGHGRCEC----GRCVCNPGWTGSACN-CPL----------STDTCESSDGQICSGRG 600 (783)
T ss_pred ccccccc-----------CcccCCCCeEeC----CcEEcCCCCccCCCC-CCC----------CCccccCCCCceeCCCc
Confidence 2232210 257999999975 489999999999985 552 44555 689999
Q ss_pred EeeCceeecCCC-cccCCcccC
Q psy13159 156 ECRDGVCVCLPD-YYGDGYVSC 176 (660)
Q Consensus 156 ~C~~~~C~C~~G-~~G~~c~~~ 176 (660)
+|.=++|+|... |+|..|+.+
T Consensus 601 ~C~Cg~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 601 TCECGRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred eeeCCceEcCCCCcCcchhhcC
Confidence 999999999765 999988744
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.87 E-value=3.9e-06 Score=55.62 Aligned_cols=30 Identities=37% Similarity=0.978 Sum_probs=27.9
Q ss_pred CCCCCCCCCCccccCC-CCceeeCCCCCccC
Q psy13159 41 CVPSPCGPYSQCRDIG-GSPSCSCLPNYIGA 70 (660)
Q Consensus 41 C~~~~C~~~g~C~~~~-g~y~C~C~~G~~g~ 70 (660)
|.++||+++|+|++.. ++|+|+|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5678999999999988 99999999999997
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.66 E-value=3.1e-05 Score=51.28 Aligned_cols=30 Identities=30% Similarity=0.895 Sum_probs=27.4
Q ss_pred CCCCCCCCCCeeecCC-CceeeecCCCccCC
Q psy13159 248 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGS 277 (660)
Q Consensus 248 C~~~~C~~~g~C~~~~-g~y~C~C~~Gy~g~ 277 (660)
|.++||.++|+|++.. ++|+|.|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4567999999999998 99999999999996
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.65 E-value=1.3e-05 Score=54.37 Aligned_cols=31 Identities=39% Similarity=0.942 Sum_probs=25.0
Q ss_pred CCCCCCCccccCCCCceeeCCCCCccCCCCC
Q psy13159 44 SPCGPYSQCRDIGGSPSCSCLPNYIGAPPNC 74 (660)
Q Consensus 44 ~~C~~~g~C~~~~g~y~C~C~~G~~g~~~~C 74 (660)
..|+.+|+|+++.++|+|+|++||+|+|..|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence 4699999999999999999999999998643
No 23
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.60 E-value=6.7e-05 Score=51.60 Aligned_cols=36 Identities=39% Similarity=0.936 Sum_probs=32.3
Q ss_pred ccCCCCC-CCCCCCCccccCCCCceeeCCCCCccCCCCC
Q psy13159 37 YVNPCVP-SPCGPYSQCRDIGGSPSCSCLPNYIGAPPNC 74 (660)
Q Consensus 37 ~~d~C~~-~~C~~~g~C~~~~g~y~C~C~~G~~g~~~~C 74 (660)
++|+|.. .+|.++++|++..++|+|.|++||.|. .|
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~--~C 37 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NC 37 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC--cC
Confidence 4788987 899999999999999999999999997 55
No 24
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.52 E-value=0.00011 Score=51.02 Aligned_cols=33 Identities=33% Similarity=0.852 Sum_probs=29.8
Q ss_pred CCCCCC-CCCCCCCceeecCCCceeeCCCCcc-cC
Q psy13159 509 VNPCYP-SPCGPYSQCRDIGGSPSCSCLPNYI-GA 541 (660)
Q Consensus 509 i~eC~~-~~C~~~g~C~~~~g~y~C~C~~G~~-g~ 541 (660)
+++|.. .+|.++++|+++.++|.|.|++||+ |.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 678877 7899989999999999999999999 65
No 25
>KOG1836|consensus
Probab=97.45 E-value=0.0096 Score=73.90 Aligned_cols=50 Identities=34% Similarity=0.787 Sum_probs=34.6
Q ss_pred eEe-eCCCCCccCCCccccCCCCCCCcCCCCCCCCCCCCCeeecCC--Cceeee-cCCCccCC
Q psy13159 219 VVC-TCPPGTTGSPLVLCRPIQNEPVYTNPCQPSPCGPNSQCREVN--KQAVCS-CLPNYFGS 277 (660)
Q Consensus 219 ~~C-~C~~G~~G~~c~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~--g~y~C~-C~~Gy~g~ 277 (660)
-+| +|..||.|..-. ...-+ |.+-+|.+++.|..+. ....|+ |++||+|.
T Consensus 756 ~~C~~C~~GfYg~~~~--------~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~ 809 (1705)
T KOG1836|consen 756 GQCAQCVDGFYGLPDL--------GTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL 809 (1705)
T ss_pred CchhhhcCCCCCcccc--------CCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCccc
Confidence 345 577888876431 01122 8888888888887554 577898 99999998
No 26
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.38 E-value=8.5e-05 Score=50.46 Aligned_cols=30 Identities=40% Similarity=0.854 Sum_probs=24.5
Q ss_pred CCCCCCCCeeEecCCCceeeCCCCCccCCC
Q psy13159 100 PGSCGYNAQCKVINHTPICTCPDGFIGDAF 129 (660)
Q Consensus 100 ~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c 129 (660)
.+.|+.+|+|+++.++|.|+|++||+|++.
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 367999999999999999999999999874
No 27
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.31 E-value=9.1e-05 Score=68.62 Aligned_cols=142 Identities=27% Similarity=0.587 Sum_probs=82.8
Q ss_pred CCCCCCeEeecCCceeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCC-----CCCCCCCceeecC-----CC
Q psy13159 460 SCGYNAQCKVINHTPICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYP-----SPCGPYSQCRDIG-----GS 529 (660)
Q Consensus 460 ~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~-----~~C~~~g~C~~~~-----g~ 529 (660)
.|. +|..+...+.|.|.|.+||....-..| +...+|.. .+|+..++|++.. ..
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~EntC----------------E~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~ 69 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKNENTC----------------EEKVECDKLENVNKPCGDYAKCINQANKGEERA 69 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEETTEE----------------EE----SG-GGTTSEEETTEEEEE-SSTTSSTS
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEcccccc----------------ccceecCcccccCccccchhhhhcCCCccccee
Confidence 344 367888889999999999976543344 33445542 3799999999765 57
Q ss_pred ceeeCCCCcccCCCCCccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEecC---CceEeeCCCCCccCCCcCCCCC
Q psy13159 530 PSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVIN---HTPICTCPDGYTGDAFSGCYPK 606 (660)
Q Consensus 530 y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~---g~~~C~C~~Gy~G~~c~~C~~~ 606 (660)
|.|.|.+||+.....|.+ +.|.. -.|. .|.|+-.+ ....|.|.-|+.-+.-..|.-.
T Consensus 70 ~~C~C~~gY~~~~~vCvp-----~~C~~--------------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~ 129 (197)
T PF06247_consen 70 YKCDCINGYILKQGVCVP-----NKCNN--------------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKT 129 (197)
T ss_dssp EEEEE-TTEEESSSSEEE-----GGGSS-----------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEE
T ss_pred EEEecccCceeeCCeEch-----hhcCc--------------eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCC
Confidence 999999999998333321 12221 1355 67886553 2459999999982221223222
Q ss_pred CCCCCCCCCCCCCeEeecCCcCCCCceeeecCCCCCC
Q psy13159 607 PPEQQQLKRDRGGILVLLPITRRKIKYECRCRRRRGR 643 (660)
Q Consensus 607 ~~~~~~~~C~~~g~C~~~~~~~~~~~~~C~C~~Gy~g 643 (660)
-+-.-...|..+..|..+ .+-|+|.+.+||.+
T Consensus 130 G~T~C~LKCk~nE~CK~~-----~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 130 GETKCSLKCKENEECKLV-----DGYYKCVCKEGFPG 161 (197)
T ss_dssp E--------TTTEEEEEE-----TTEEEEEE-TT-EE
T ss_pred CccceeeecCCCcceeee-----CcEEEeecCCCCCC
Confidence 111223357888999998 77999999999943
No 28
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.07 E-value=0.0001 Score=68.23 Aligned_cols=147 Identities=24% Similarity=0.572 Sum_probs=84.9
Q ss_pred CceeeCCCCceeecCCCccCCCCCCccCCcCCCCCCCCcccccCCCCCCCCCCCCCCCeEeecC-----CceeeeCCCCC
Q psy13159 408 SQCRDIGGSPSCSCLPNYIGSPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAQCKVIN-----HTPICTCPDGF 482 (660)
Q Consensus 408 ~~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~C~~~C~~~~~C~~~~-----g~~~C~C~~Gy 482 (660)
|.-+...+.|.|.|.+||.... +..|....+|.... =+ ..+|...+.|++.. ..|.|.|.+||
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~---EntCE~kv~C~~~e-~~--------~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY 78 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKN---ENTCEEKVECDKLE-NV--------NKPCGDYAKCINQANKGEERAYKCDCINGY 78 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEE---TTEEEE----SG-G-GT--------TSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred CEEEEccCceEEEcCCCcEEcc---ccccccceecCccc-cc--------CccccchhhhhcCCCcccceeEEEecccCc
Confidence 4566677889999999998752 22233345554410 01 24678889999876 47999999999
Q ss_pred ccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCceeec---CCCceeeCCCCcccCCCCCccCCccCCCCCCCC
Q psy13159 483 IGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGPYSQCRDI---GGSPSCSCLPNYIGAPPNCRPECVQNNDCSNDK 559 (660)
Q Consensus 483 ~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~C~~~g~C~~~---~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~ 559 (660)
+-..- .|. .++|....|+ .|.|+-. +....|+|.-|+..+ +.+.|...+
T Consensus 79 ~~~~~-vCv-----------------p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~---------dn~kCtk~G 130 (197)
T PF06247_consen 79 ILKQG-VCV-----------------PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPD---------DNKKCTKTG 130 (197)
T ss_dssp EESSS-SEE-----------------EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETT---------TTTESEEEE
T ss_pred eeeCC-eEc-----------------hhhcCceecC-CCeEEecCCCCCCceeEeeeceEec---------cCCcccCCC
Confidence 86632 231 3566666788 5899833 334599999999832 112222111
Q ss_pred ccccccccCCCCCCCCCCCeEEecCCceEeeCCCCCccCCCc
Q psy13159 560 ACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAFS 601 (660)
Q Consensus 560 ~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~ 601 (660)
+.+|...|..+..|..+.+-|+|.+.+||.++.-.
T Consensus 131 -------~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~~ 165 (197)
T PF06247_consen 131 -------ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDGEG 165 (197)
T ss_dssp ---------------TTTEEEEEETTEEEEEE-TT-EEETTT
T ss_pred -------ccceeeecCCCcceeeeCcEEEeecCCCCCCCCCc
Confidence 11234567778999999999999999999876643
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.99 E-value=0.00089 Score=45.82 Aligned_cols=35 Identities=40% Similarity=0.985 Sum_probs=30.7
Q ss_pred CCCCCC-CCCCCCCceeeCCCCceeecCCCccCCCCCC
Q psy13159 396 VNPCIP-SPCGPYSQCRDIGGSPSCSCLPNYIGSPPNC 432 (660)
Q Consensus 396 ~d~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~C 432 (660)
+++|.. .+|.++++|++..++|.|.|++||.|. .|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~--~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC--cC
Confidence 577876 789888999999999999999999997 55
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.82 E-value=0.0013 Score=44.43 Aligned_cols=30 Identities=40% Similarity=0.968 Sum_probs=26.2
Q ss_pred CCCC-CCCCCCCccccCCCCceeeCCCCCccC
Q psy13159 40 PCVP-SPCGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 40 ~C~~-~~C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
+|.. ++|.++ +|+++.++|+|.|++||.|.
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 3556 689988 99999999999999999993
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.76 E-value=0.0014 Score=44.07 Aligned_cols=30 Identities=37% Similarity=0.934 Sum_probs=27.1
Q ss_pred CC-CCCCCCCCccccCCCCceeeCCCCCccC
Q psy13159 41 CV-PSPCGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 41 C~-~~~C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
|. ..+|.++++|++..++|+|.|++||.|.
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 44 6789999999999999999999999987
No 32
>KOG1836|consensus
Probab=96.69 E-value=0.062 Score=67.00 Aligned_cols=97 Identities=26% Similarity=0.481 Sum_probs=55.1
Q ss_pred eEecCCCcccCCcccccC-----------CccCCCCCCCC---cccccCCCCCCCCCCccCCCCeeeecCCCeeecCCCC
Q psy13159 324 SSVCLPDYYGDGYVSCRP-----------ECVLNSDCPSN---KACIRNKCKNPCVPGTCGEGAICDVFLLSFTAPPPPL 389 (660)
Q Consensus 324 ~C~C~~G~~g~~~~~c~~-----------~C~~~~~C~~~---~~C~~~~c~~~C~~~~C~~g~~C~~~~~~~~C~c~~g 389 (660)
.|.|++||+|..++.|.+ .+.- .+|..+ ..|....+...|+... .|..|. .|..|
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c-~~C~cngh~~~Cd~~tG~C~C~~~t--~G~~C~--------~C~~G 764 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPC-IPCDCNGHSNICDPRTGQCKCKHNT--FGGQCA--------QCVDG 764 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcc-cccccCCccccccCCCCceecccCC--CCCchh--------hhcCC
Confidence 399999999997765531 0000 111111 2333333332233333 233343 34444
Q ss_pred CCCCCC---CCCCCCCCCCCCCceeeCC--CCceee-cCCCccCCCCCCc
Q psy13159 390 ESPPEY---VNPCIPSPCGPYSQCRDIG--GSPSCS-CLPNYIGSPPNCR 433 (660)
Q Consensus 390 ~~~~~~---~d~C~~~~C~~~~~C~~~~--g~~~C~-C~~G~~g~~~~C~ 433 (660)
|....+ ...|.+-+|.+++.|..+. ....|+ |++||+|. +|+
T Consensus 765 fYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~--rCe 812 (1705)
T KOG1836|consen 765 FYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL--RCE 812 (1705)
T ss_pred CCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccc--ccc
Confidence 433221 1228888898888887643 577899 99999999 887
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.22 E-value=0.0062 Score=40.85 Aligned_cols=27 Identities=37% Similarity=0.860 Sum_probs=24.7
Q ss_pred CCCCCCCeEEecCCceEeeCCCCCccC
Q psy13159 572 GSCGYNALCKVINHTPICTCPDGYTGD 598 (660)
Q Consensus 572 ~~C~~~~~C~~~~g~~~C~C~~Gy~G~ 598 (660)
.+|.++++|++..++|+|.|+.||.|.
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCccc
Confidence 467778999999999999999999998
No 34
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.19 E-value=0.0046 Score=37.68 Aligned_cols=24 Identities=38% Similarity=0.663 Sum_probs=17.1
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCC
Q psy13159 58 SPSCSCLPNYIGAPPNCRPECLQNSE 83 (660)
Q Consensus 58 ~y~C~C~~G~~g~~~~C~~~C~~~~~ 83 (660)
||+|+|++||+... -...|.|+||
T Consensus 1 sy~C~C~~Gy~l~~--d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP--DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC--CCCccccCCC
Confidence 69999999999752 1234567764
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.92 E-value=0.0094 Score=40.09 Aligned_cols=28 Identities=43% Similarity=1.036 Sum_probs=24.1
Q ss_pred CCC-CCCCCCCceeeCCCCceeecCCCccC
Q psy13159 399 CIP-SPCGPYSQCRDIGGSPSCSCLPNYIG 427 (660)
Q Consensus 399 C~~-~~C~~~~~C~~~~g~~~C~C~~G~~g 427 (660)
|.. .+|.++ +|++..++|.|.|++||.|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence 444 578887 9999999999999999998
No 36
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.67 E-value=0.0099 Score=36.24 Aligned_cols=23 Identities=30% Similarity=0.554 Sum_probs=16.5
Q ss_pred CceeeCCCCcccCCCCCccCCccCC
Q psy13159 529 SPSCSCLPNYIGAPPNCRPECVQNN 553 (660)
Q Consensus 529 ~y~C~C~~G~~g~~~~C~~~C~~~~ 553 (660)
||+|+|++||+.. .-...|.+||
T Consensus 1 sy~C~C~~Gy~l~--~d~~~C~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLS--PDGRSCEDID 23 (24)
T ss_pred CEEeeCCCCCcCC--CCCCccccCC
Confidence 6999999999976 3234455554
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.35 E-value=0.018 Score=37.98 Aligned_cols=27 Identities=22% Similarity=0.601 Sum_probs=22.5
Q ss_pred CCCCCCCeeEecCCCceeeCCCCCccCCC
Q psy13159 101 GSCGYNAQCKVINHTPICTCPDGFIGDAF 129 (660)
Q Consensus 101 ~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c 129 (660)
..|+++|+|+.. ..+|+|.+||+|..|
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 468899999976 458999999999864
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.31 E-value=0.018 Score=38.03 Aligned_cols=23 Identities=35% Similarity=0.774 Sum_probs=20.6
Q ss_pred CCCCCCEee--CceeecCCCcccCC
Q psy13159 150 NCVPNAECR--DGVCVCLPDYYGDG 172 (660)
Q Consensus 150 ~C~~~g~C~--~~~C~C~~G~~G~~ 172 (660)
.|+++|+|+ .++|+|++||+|..
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCC
Confidence 388999999 67999999999985
No 39
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.58 E-value=0.028 Score=28.94 Aligned_cols=13 Identities=46% Similarity=1.211 Sum_probs=10.4
Q ss_pred EeeCCCCCccCCC
Q psy13159 588 ICTCPDGYTGDAF 600 (660)
Q Consensus 588 ~C~C~~Gy~G~~c 600 (660)
+|+|++||+|+.|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999875
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=92.68 E-value=0.059 Score=36.57 Aligned_cols=23 Identities=43% Similarity=0.847 Sum_probs=19.1
Q ss_pred CCCCCccccCCCCceeeCCCCCccC
Q psy13159 46 CGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 46 C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
|++ .|++++++|+|.|++||+..
T Consensus 8 C~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 8 CSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSS--EEEEETTSEEEE-STTEEE-
T ss_pred cCC--CCccCCCceEeECCCCCEEC
Confidence 555 89999999999999999987
No 41
>KOG3512|consensus
Probab=91.21 E-value=1.9 Score=46.04 Aligned_cols=163 Identities=18% Similarity=0.337 Sum_probs=85.7
Q ss_pred CeEeecCCc-eeeeCCCCCccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-------------------CCcee
Q psy13159 465 AQCKVINHT-PICTCPDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGP-------------------YSQCR 524 (660)
Q Consensus 465 ~~C~~~~g~-~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~i~eC~~~~C~~-------------------~g~C~ 524 (660)
..|+-...+ ++|.|..+-+|..|..|.+....-+ +. ..-..++++|....|.. +++|+
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRP-W~-raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvCl 362 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRP-WG-RATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCL 362 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCC-cc-ccccCCCccccccccchhhhhcccchhhhcccCccccceEe
Confidence 357766554 9999999999999888876422110 00 00113456665444432 23454
Q ss_pred ----ecCCCceeeCCCCcccCCCCCccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEecCCceEeeCCCCCccCCC
Q psy13159 525 ----DIGGSPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAF 600 (660)
Q Consensus 525 ----~~~g~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c 600 (660)
|+.|.+-=.|++||.-++..=. .+...|..-.|. ..=+.+.+|..+.| +|.|++|-+|..|
T Consensus 363 nCrHnTaGrhChyCreGyyRd~s~pl---------~hrkaCk~CdCh----pVGs~gktCNq~tG--qCpCkeGvtG~tC 427 (592)
T KOG3512|consen 363 NCRHNTAGRHCHYCREGYYRDGSKPL---------THRKACKACDCH----PVGSAGKTCNQTTG--QCPCKEGVTGLTC 427 (592)
T ss_pred ecccCCCCcccccccCccccCCCCCC---------chhhhhhhcCCc----ccccccccccccCC--cccCCCCCccccc
Confidence 3444432358999887642100 001111111110 01112345654444 8999999999999
Q ss_pred cCCCCCCCCCCC--CCCCC---C--CeEeecCCcCCCCceeeecCCCCCCccCc
Q psy13159 601 SGCYPKPPEQQQ--LKRDR---G--GILVLLPITRRKIKYECRCRRRRGRKRST 647 (660)
Q Consensus 601 ~~C~~~~~~~~~--~~C~~---~--g~C~~~~~~~~~~~~~C~C~~Gy~g~c~~ 647 (660)
..|.+...-..+ .||.. . -.+.+.. ....+.+.|+.+++++...
T Consensus 428 nrCa~gyqqsrs~vapcik~p~~~~~~~~s~v---e~qd~~s~Ck~~~~~~r~n 478 (592)
T KOG3512|consen 428 NRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGV---EPQDQCSKCKASPGGKRLN 478 (592)
T ss_pred ccccchhhcccCCCcCceecCCCCccccCCCC---cchhccccCCCCCcceecc
Confidence 988776433322 23311 0 0111110 1235778899999886654
No 42
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=91.20 E-value=0.13 Score=34.85 Aligned_cols=21 Identities=43% Similarity=0.754 Sum_probs=17.8
Q ss_pred CceeecCCCceeeCCCCcccC
Q psy13159 521 SQCRDIGGSPSCSCLPNYIGA 541 (660)
Q Consensus 521 g~C~~~~g~y~C~C~~G~~g~ 541 (660)
..|++++++|+|.|++||+..
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-
T ss_pred CCCccCCCceEeECCCCCEEC
Confidence 479999999999999999987
No 43
>smart00051 DSL delta serrate ligand.
Probab=90.57 E-value=0.39 Score=37.24 Aligned_cols=48 Identities=23% Similarity=0.443 Sum_probs=33.6
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCC
Q psy13159 58 SPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAF 129 (660)
Q Consensus 58 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c 129 (660)
.|.=.|.++|.|. .|...|...+ ....+.+|.. . ..++|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~~-------------------d~~~~~~Cd~-~--G~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPRD-------------------DFFGHYTCDE-N--GNKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcCc-------------------cccCCccCCc-C--CCEecCCCCcCCCC
Confidence 3456799999999 8876665432 2345667743 2 36899999999863
No 44
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.41 E-value=0.12 Score=34.92 Aligned_cols=34 Identities=26% Similarity=0.660 Sum_probs=22.6
Q ss_pred CCCCCCCCCCeeecCC-CceeeecCCCccCCCCCC
Q psy13159 248 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGSPPNC 281 (660)
Q Consensus 248 C~~~~C~~~g~C~~~~-g~y~C~C~~Gy~g~~~~C 281 (660)
|...+|..++.|++.. |++.|+|..||..++..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 3345677899999776 999999999998654333
No 45
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.07 E-value=0.058 Score=36.46 Aligned_cols=31 Identities=29% Similarity=0.727 Sum_probs=22.4
Q ss_pred CCCCCCCCCCccccCC-CCceeeCCCCCccCC
Q psy13159 41 CVPSPCGPYSQCRDIG-GSPSCSCLPNYIGAP 71 (660)
Q Consensus 41 C~~~~C~~~g~C~~~~-g~y~C~C~~G~~g~~ 71 (660)
|...+|..|+.|++.. |++.|.|..||..++
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 5566788899999866 999999999998874
No 46
>smart00051 DSL delta serrate ligand.
Probab=88.67 E-value=0.62 Score=36.09 Aligned_cols=48 Identities=25% Similarity=0.459 Sum_probs=33.1
Q ss_pred CceeeCCCCcccCCCCCccCCccCCCCCCCCccccccccCCCCCCCCCCCeEEecCCceEeeCCCCCccCCC
Q psy13159 529 SPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAF 600 (660)
Q Consensus 529 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c 600 (660)
.+.-.|.++|.|. .|.+.|...+. ...+.+|.. .| .++|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~~d-------------------~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPRDD-------------------FFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcCcc-------------------ccCCccCCc-CC--CEecCCCCcCCCC
Confidence 3456899999999 88776654332 233556643 23 6899999999865
No 47
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=83.20 E-value=0.98 Score=44.81 Aligned_cols=38 Identities=24% Similarity=0.456 Sum_probs=30.3
Q ss_pred CCCcccccCCCC--CCCCCCCCccccCCCCceeeCCCCCccC
Q psy13159 31 QQDVQEYVNPCV--PSPCGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 31 ~~~~~~~~d~C~--~~~C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
....+.++++|. .++|.. .|.++.|+|.|.|++||+..
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~--~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQ--VCISTPGSYLCACTEGYALL 219 (224)
T ss_pred ccccCcCchhhcCCCCCccc--eEEcCCCCEEeECCCCccCC
Confidence 334446889997 455764 79999999999999999885
No 48
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=82.21 E-value=1.3 Score=43.96 Aligned_cols=37 Identities=19% Similarity=0.511 Sum_probs=28.7
Q ss_pred cCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccC
Q psy13159 77 ECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGD 127 (660)
Q Consensus 77 ~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~ 127 (660)
.|.++++|... .+.|. ..|.++.|+|.|.|++||+..
T Consensus 183 ~C~~~~~C~~~------------~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 183 ICVVPDLCATL------------SHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCcCchhhcCC------------CCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 45566777643 45675 589999999999999999864
No 49
>KOG1218|consensus
Probab=81.39 E-value=18 Score=37.67 Aligned_cols=45 Identities=20% Similarity=0.277 Sum_probs=28.2
Q ss_pred EeeCCCCCccCCCcCCCCCCCCCCCCCCCCCCeEeecCCcCCCCceeeecCCCC
Q psy13159 588 ICTCPDGYTGDAFSGCYPKPPEQQQLKRDRGGILVLLPITRRKIKYECRCRRRR 641 (660)
Q Consensus 588 ~C~C~~Gy~G~~c~~C~~~~~~~~~~~C~~~g~C~~~~~~~~~~~~~C~C~~Gy 641 (660)
.|.|++||.|..+.. . ...+.....+.+++.|+.. ...+.+.+++
T Consensus 163 ~c~c~~g~~g~~~~~-~-~~~c~~~~~~~~g~~C~~~-------~~~~~~~~~~ 207 (316)
T KOG1218|consen 163 ICTCQPGFVGVFCVE-S-CSGCSPLTACENGAKCNRS-------TGSCLCYPGP 207 (316)
T ss_pred ceeccCCcccccccc-c-CCCcCCCcccCCCCeeecc-------ccccccCCCC
Confidence 688999999988762 1 1113445577777788854 4455544444
No 50
>KOG1218|consensus
Probab=80.52 E-value=49 Score=34.29 Aligned_cols=14 Identities=29% Similarity=0.795 Sum_probs=11.8
Q ss_pred CCceeeCCCCCccC
Q psy13159 114 HTPICTCPDGFIGD 127 (660)
Q Consensus 114 gs~~C~C~~Gy~G~ 127 (660)
.+..|.|.+||+|.
T Consensus 13 ~~~~c~c~~~~~g~ 26 (316)
T KOG1218|consen 13 GSGQCFCDPGYTGR 26 (316)
T ss_pred CCCceecCCCcccc
Confidence 45689999999995
No 51
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=79.09 E-value=1.9 Score=31.39 Aligned_cols=22 Identities=27% Similarity=0.504 Sum_probs=16.9
Q ss_pred CeeEecCCCceeeCCCCCccCCCc
Q psy13159 107 AQCKVINHTPICTCPDGFIGDAFL 130 (660)
Q Consensus 107 g~C~~~~gs~~C~C~~Gy~G~~c~ 130 (660)
++|... ..+|.|+++|+|..|+
T Consensus 11 ~~C~~~--~G~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPS--TGQCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEET--CEEESBSTTEESTTS-
T ss_pred CcccCC--CCEEeccccccCCcCc
Confidence 467663 3589999999999975
No 52
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=76.18 E-value=2.8 Score=30.93 Aligned_cols=11 Identities=27% Similarity=0.555 Sum_probs=9.1
Q ss_pred eEecCCCcccC
Q psy13159 324 SSVCLPDYYGD 334 (660)
Q Consensus 324 ~C~C~~G~~g~ 334 (660)
+|+|++||+-.
T Consensus 38 ~C~C~~g~~~~ 48 (52)
T PF01683_consen 38 RCQCPPGYVEV 48 (52)
T ss_pred EeECCCCCEec
Confidence 89999998744
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=71.85 E-value=2.4 Score=30.81 Aligned_cols=27 Identities=33% Similarity=0.676 Sum_probs=19.8
Q ss_pred CeEEecCCceEeeCCCCCccCCCcCCCCC
Q psy13159 578 ALCKVINHTPICTCPDGYTGDAFSGCYPK 606 (660)
Q Consensus 578 ~~C~~~~g~~~C~C~~Gy~G~~c~~C~~~ 606 (660)
.+|... +.+|+|+++|+|..|+.|.+.
T Consensus 11 ~~C~~~--~G~C~C~~~~~G~~C~~C~~g 37 (49)
T PF00053_consen 11 QTCDPS--TGQCVCKPGTTGPRCDQCKPG 37 (49)
T ss_dssp SSEEET--CEEESBSTTEESTTS-EE-TT
T ss_pred CcccCC--CCEEeccccccCCcCcCCCCc
Confidence 467664 449999999999999977754
No 54
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=70.47 E-value=4.8 Score=29.45 Aligned_cols=21 Identities=29% Similarity=0.575 Sum_probs=17.6
Q ss_pred eEeeCCCCCccCCCcCCCCCC
Q psy13159 587 PICTCPDGYTGDAFSGCYPKP 607 (660)
Q Consensus 587 ~~C~C~~Gy~G~~c~~C~~~~ 607 (660)
.+|.|+++|+|..|+.|.+..
T Consensus 19 G~C~C~~~~~G~~C~~C~~g~ 39 (50)
T cd00055 19 GQCECKPNTTGRRCDRCAPGY 39 (50)
T ss_pred CEEeCCCcCCCCCCCCCCCCC
Confidence 389999999999999876553
No 55
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=69.87 E-value=1.8 Score=33.56 Aligned_cols=48 Identities=25% Similarity=0.450 Sum_probs=21.6
Q ss_pred CceeeCCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCC
Q psy13159 58 SPSCSCLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAF 129 (660)
Q Consensus 58 ~y~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c 129 (660)
.++-+|.+.|.|. .|...|...+.= ..+-+|.. .| .=+|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~--~C~~~C~~~~d~-------------------~ghy~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGP--NCSKFCKPRDDS-------------------FGHYTCDS-NG--NKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETT--TT-EE---EEET-------------------TEEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEECCCCCCCc--cccCCcCCCcCC-------------------cCCcccCC-CC--CCCCCCCCcCCCC
Confidence 5677899999999 887666543210 12345552 23 3479999999864
No 56
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=69.16 E-value=7.2 Score=28.69 Aligned_cols=23 Identities=26% Similarity=0.681 Sum_probs=16.7
Q ss_pred CCCCCCCeeEecCCCceeeCCCCCccC
Q psy13159 101 GSCGYNAQCKVINHTPICTCPDGFIGD 127 (660)
Q Consensus 101 ~~C~~~g~C~~~~gs~~C~C~~Gy~G~ 127 (660)
..|..++.|++ .+|+|++||.-.
T Consensus 26 ~qC~~~s~C~~----g~C~C~~g~~~~ 48 (52)
T PF01683_consen 26 EQCIGGSVCVN----GRCQCPPGYVEV 48 (52)
T ss_pred CCCCCcCEEcC----CEeECCCCCEec
Confidence 44556778865 489999998643
No 57
>PHA02887 EGF-like protein; Provisional
Probab=64.36 E-value=6 Score=34.19 Aligned_cols=29 Identities=34% Similarity=0.646 Sum_probs=22.4
Q ss_pred CCCCCCCeEEec--CCceEeeCCCCCccCCCc
Q psy13159 572 GSCGYNALCKVI--NHTPICTCPDGYTGDAFS 601 (660)
Q Consensus 572 ~~C~~~~~C~~~--~g~~~C~C~~Gy~G~~c~ 601 (660)
+.|- +|+|.-. ...+.|.|+.||+|.+|+
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCD 122 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCCCC
Confidence 3455 5788555 446899999999999987
No 58
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=64.08 E-value=7.4 Score=28.44 Aligned_cols=17 Identities=24% Similarity=0.548 Sum_probs=14.1
Q ss_pred ceeeCCCCCccCCCccC
Q psy13159 116 PICTCPDGFIGDAFLSC 132 (660)
Q Consensus 116 ~~C~C~~Gy~G~~c~~C 132 (660)
.+|.|++||+|..|+.|
T Consensus 19 G~C~C~~~~~G~~C~~C 35 (50)
T cd00055 19 GQCECKPNTTGRRCDRC 35 (50)
T ss_pred CEEeCCCcCCCCCCCCC
Confidence 47999999999987543
No 59
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=61.08 E-value=7 Score=34.37 Aligned_cols=29 Identities=31% Similarity=0.554 Sum_probs=22.7
Q ss_pred CCCCCCCeEEec--CCceEeeCCCCCccCCCc
Q psy13159 572 GSCGYNALCKVI--NHTPICTCPDGYTGDAFS 601 (660)
Q Consensus 572 ~~C~~~~~C~~~--~g~~~C~C~~Gy~G~~c~ 601 (660)
+.|.+ |+|.-. ...+.|.|..||+|.+|+
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCccccccc
Confidence 34554 488655 468999999999999987
No 60
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=58.42 E-value=8.8 Score=33.27 Aligned_cols=33 Identities=36% Similarity=0.835 Sum_probs=26.3
Q ss_pred ccCCCC-CCCCCCCCccccCCCCceeeCCCCCccC
Q psy13159 37 YVNPCV-PSPCGPYSQCRDIGGSPSCSCLPNYIGA 70 (660)
Q Consensus 37 ~~d~C~-~~~C~~~g~C~~~~g~y~C~C~~G~~g~ 70 (660)
..|+|. ...|+.+|.|.. ..+..|.|.+||.-.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 467887 588999999964 456689999999754
No 61
>KOG3516|consensus
Probab=57.66 E-value=7.3 Score=46.57 Aligned_cols=38 Identities=29% Similarity=0.772 Sum_probs=34.0
Q ss_pred cccCCCCCCCCCCCCccccCCCCceeeCC-CCCccCCCCCc
Q psy13159 36 EYVNPCVPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCR 75 (660)
Q Consensus 36 ~~~d~C~~~~C~~~g~C~~~~g~y~C~C~-~G~~g~~~~C~ 75 (660)
.-+|.|.+++|+++|.|......|.|.|. .||.|. .|.
T Consensus 543 ~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 543 GISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred ccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 34678889999999999998889999999 999999 776
No 62
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=57.38 E-value=13 Score=26.65 Aligned_cols=21 Identities=29% Similarity=0.588 Sum_probs=17.3
Q ss_pred eEeeCCCCCccCCCcCCCCCC
Q psy13159 587 PICTCPDGYTGDAFSGCYPKP 607 (660)
Q Consensus 587 ~~C~C~~Gy~G~~c~~C~~~~ 607 (660)
.+|+|+++|+|..|+.|.+..
T Consensus 18 G~C~C~~~~~G~~C~~C~~g~ 38 (46)
T smart00180 18 GQCECKPNVTGRRCDRCAPGY 38 (46)
T ss_pred CEEECCCCCCCCCCCcCCCCc
Confidence 389999999999998776543
No 63
>KOG3512|consensus
Probab=53.07 E-value=52 Score=35.66 Aligned_cols=150 Identities=19% Similarity=0.269 Sum_probs=80.3
Q ss_pred eeeecCCCceeeeeecCCCCCC---CCCcccccCCCCCCCCCCC-------------------Cccc----cCCCCceee
Q psy13159 9 AFTASSGNTWKNIRFKNAPPPP---QQDVQEYVNPCVPSPCGPY-------------------SQCR----DIGGSPSCS 62 (660)
Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~d~C~~~~C~~~-------------------g~C~----~~~g~y~C~ 62 (660)
...|-+..+|..|....+.... ++.-..++++|....|..+ |+|+ |+.|.+-=.
T Consensus 296 tCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhChy 375 (592)
T KOG3512|consen 296 TCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHCHY 375 (592)
T ss_pred EEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeecccCCCCccccc
Confidence 3456677788887777666443 4455567788876555433 3343 233333335
Q ss_pred CCCCCccCCCCCcccCCCCCCCCCCcccccccccCCCCCCCCCCCeeEecCCCceeeCCCCCccCCCccCCCCCCC---C
Q psy13159 63 CLPNYIGAPPNCRPECLQNSECPNDKACIREKCADPCPGSCGYNAQCKVINHTPICTCPDGFIGDAFLSCHPKPPE---P 139 (660)
Q Consensus 63 C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~---~ 139 (660)
|.+||+-++..= . ....+|+.-.|. ..=+.+-+|..+.| .|.|++|-+|..|..|.+..-- +
T Consensus 376 CreGyyRd~s~p------l---~hrkaCk~CdCh----pVGs~gktCNq~tG--qCpCkeGvtG~tCnrCa~gyqqsrs~ 440 (592)
T KOG3512|consen 376 CREGYYRDGSKP------L---THRKACKACDCH----PVGSAGKTCNQTTG--QCPCKEGVTGLTCNRCAPGYQQSRSP 440 (592)
T ss_pred ccCccccCCCCC------C---chhhhhhhcCCc----ccccccccccccCC--cccCCCCCcccccccccchhhcccCC
Confidence 899998764210 0 011112111010 01122456765555 7999999999999878764321 1
Q ss_pred CCCCCCCCCc---CCCCCCEeeCceeecCCCcccCCc
Q psy13159 140 VQPIIQEDTC---NCVPNAECRDGVCVCLPDYYGDGY 173 (660)
Q Consensus 140 ~~~~~~~~~C---~C~~~g~C~~~~C~C~~G~~G~~c 173 (660)
+.+++.++.= .++++.+=.+..+.|+.++.|-.+
T Consensus 441 vapcik~p~~~~~~~~s~ve~qd~~s~Ck~~~~~~r~ 477 (592)
T KOG3512|consen 441 VAPCIKIPTDAPTLGSSGVEPQDQCSKCKASPGGKRL 477 (592)
T ss_pred CcCceecCCCCccccCCCCcchhccccCCCCCcceec
Confidence 2233332221 244444422334788888877653
No 64
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=52.98 E-value=12 Score=32.97 Aligned_cols=30 Identities=17% Similarity=0.057 Sum_probs=24.4
Q ss_pred CCCCCCeEeecCCcCCCCceeeecCCCCCCccCc
Q psy13159 614 KRDRGGILVLLPITRRKIKYECRCRRRRGRKRST 647 (660)
Q Consensus 614 ~C~~~g~C~~~~~~~~~~~~~C~C~~Gy~g~c~~ 647 (660)
-|.+| .|...+. ...+.|.|..||.|.+++
T Consensus 52 YClHG-~C~yI~d---l~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 52 YCLHG-DCIHARD---IDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred EeECC-EEEeecc---CCCceeECCCCccccccc
Confidence 47664 8998876 789999999999886654
No 65
>PHA02887 EGF-like protein; Provisional
Probab=52.69 E-value=14 Score=32.03 Aligned_cols=30 Identities=13% Similarity=-0.006 Sum_probs=24.1
Q ss_pred CCCCCCeEeecCCcCCCCceeeecCCCCCCccCc
Q psy13159 614 KRDRGGILVLLPITRRKIKYECRCRRRRGRKRST 647 (660)
Q Consensus 614 ~C~~~g~C~~~~~~~~~~~~~C~C~~Gy~g~c~~ 647 (660)
-|- +|+|...+. .....|.|..||.|.+++
T Consensus 93 YCi-HG~C~yI~d---L~epsCrC~~GYtG~RCE 122 (126)
T PHA02887 93 FCI-NGECMNIID---LDEKFCICNKGYTGIRCD 122 (126)
T ss_pred Eee-CCEEEcccc---CCCceeECCCCcccCCCC
Confidence 466 579998876 889999999999886543
No 66
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=46.98 E-value=21 Score=23.81 Aligned_cols=14 Identities=43% Similarity=0.994 Sum_probs=11.4
Q ss_pred ceEeeCCCCCccCC
Q psy13159 586 TPICTCPDGYTGDA 599 (660)
Q Consensus 586 ~~~C~C~~Gy~G~~ 599 (660)
.++|.||+||..+.
T Consensus 17 ~~~C~CPeGyIlde 30 (34)
T PF09064_consen 17 PGQCFCPEGYILDE 30 (34)
T ss_pred CCceeCCCceEecC
Confidence 34899999998765
No 67
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=46.00 E-value=19 Score=31.12 Aligned_cols=25 Identities=44% Similarity=1.003 Sum_probs=20.2
Q ss_pred CCCCCCCeeEecCCCceeeCCCCCcc
Q psy13159 101 GSCGYNAQCKVINHTPICTCPDGFIG 126 (660)
Q Consensus 101 ~~C~~~g~C~~~~gs~~C~C~~Gy~G 126 (660)
..|+.+|.|.. ..+..|.|.+||.-
T Consensus 84 ~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 84 GFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCccEeCC-CCCCceECCCCcCC
Confidence 67999999953 45678999999963
No 68
>KOG3514|consensus
Probab=43.89 E-value=16 Score=43.31 Aligned_cols=35 Identities=23% Similarity=0.622 Sum_probs=31.6
Q ss_pred CCCCCCCCCCCccccCCCCceeeCC-CCCccCCCCCcc
Q psy13159 40 PCVPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCRP 76 (660)
Q Consensus 40 ~C~~~~C~~~g~C~~~~g~y~C~C~-~G~~g~~~~C~~ 76 (660)
.|+++||.|+|+|...-..|.|.|. .||.|. .|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccc
Confidence 7999999999999999999999996 688888 8873
No 69
>KOG3516|consensus
Probab=38.17 E-value=23 Score=42.62 Aligned_cols=41 Identities=29% Similarity=0.711 Sum_probs=35.7
Q ss_pred CCCCCCCCCCCCCCCCCCceeecCCCceeeCC-CCcccCCCCCc
Q psy13159 504 DVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCR 546 (660)
Q Consensus 504 ~~~~~i~eC~~~~C~~~g~C~~~~g~y~C~C~-~G~~g~~~~C~ 546 (660)
+.+.-++.|.+++|.++|.|......|.|.|. .||+|. .|.
T Consensus 540 d~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 540 DMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred cccccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 34466788999999999999999899999999 899998 665
No 70
>KOG3514|consensus
Probab=34.87 E-value=26 Score=41.60 Aligned_cols=36 Identities=25% Similarity=0.631 Sum_probs=32.1
Q ss_pred CCCCCCCCCCCceeecCCCceeeCC-CCcccCCCCCccC
Q psy13159 511 PCYPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCRPE 548 (660)
Q Consensus 511 eC~~~~C~~~g~C~~~~g~y~C~C~-~G~~g~~~~C~~~ 548 (660)
.|.++||.|+|+|.....+|.|.|. .||.|. .|+.+
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccce
Confidence 6889999999999999999999997 589999 88743
No 71
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=30.95 E-value=20 Score=30.63 Aligned_cols=33 Identities=18% Similarity=0.513 Sum_probs=24.3
Q ss_pred cCCCC--CCCCCCCCccccCC-----CCceeeCCCCCccC
Q psy13159 38 VNPCV--PSPCGPYSQCRDIG-----GSPSCSCLPNYIGA 70 (660)
Q Consensus 38 ~d~C~--~~~C~~~g~C~~~~-----g~y~C~C~~G~~g~ 70 (660)
.+.|. .+.|+.||.|++.. .=|.|+|.+.+...
T Consensus 5 ~~aC~~~Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~~ 44 (103)
T PF12955_consen 5 NDACENATNNCSGHGSCVKKYGSGGGDCFACKCKPTVVKT 44 (103)
T ss_pred HHHHHHhccCCCCCceEeeccCCCccceEEEEeecccccc
Confidence 34554 57799999999863 34999999966543
No 72
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=25.00 E-value=30 Score=29.57 Aligned_cols=26 Identities=15% Similarity=0.450 Sum_probs=20.2
Q ss_pred CCCCCCCCeeEecC-----CCceeeCCCCCc
Q psy13159 100 PGSCGYNAQCKVIN-----HTPICTCPDGFI 125 (660)
Q Consensus 100 ~~~C~~~g~C~~~~-----gs~~C~C~~Gy~ 125 (660)
.+.|+.||.|++.. .=|.|.|.+.+.
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeecccc
Confidence 57899999999863 338999998543
No 73
>KOG3607|consensus
Probab=20.70 E-value=84 Score=36.79 Aligned_cols=47 Identities=26% Similarity=0.689 Sum_probs=36.5
Q ss_pred CCCCCCCCcccccccc-------CCCCCCCCCCCeEEecCCceEeeCCCCCccCCCc
Q psy13159 552 NNDCSNDKACINEKCQ-------DPCPGSCGYNALCKVINHTPICTCPDGYTGDAFS 601 (660)
Q Consensus 552 ~~~C~~~~~C~~~~C~-------~~c~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~c~ 601 (660)
...|.....|++..|+ +.|+..|+.+|+|.+.. .|.|.+||.+..|.
T Consensus 603 Gt~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn~~---~ChC~~gwapp~C~ 656 (716)
T KOG3607|consen 603 GTSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNNEL---NCHCEPGWAPPFCF 656 (716)
T ss_pred CCccCCCceecCCcchhhhhhcccccccccCCCcccCCCc---ceeeCCCCCCCccc
Confidence 3456667777777774 23577799999998766 69999999999887
Done!