Query psy17084
Match_columns 721
No_of_seqs 521 out of 2731
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 21:26:46 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17084.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17084hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1217|consensus 99.8 5.7E-18 1.2E-22 190.8 28.0 256 384-708 150-421 (487)
2 KOG1217|consensus 99.8 3.3E-18 7.1E-23 192.8 25.9 280 405-713 91-389 (487)
3 KOG4289|consensus 99.8 3.4E-18 7.3E-23 192.2 19.3 132 150-299 1179-1316(2531)
4 KOG4289|consensus 99.8 7.6E-18 1.6E-22 189.4 18.4 73 381-453 1217-1291(2531)
5 KOG0994|consensus 99.6 2.4E-15 5.3E-20 167.1 16.2 351 62-601 775-1146(1758)
6 KOG0994|consensus 99.6 2.3E-13 4.9E-18 151.8 26.9 275 284-602 781-1099(1758)
7 KOG1219|consensus 99.5 1.9E-14 4.1E-19 168.0 9.9 116 562-677 3860-3977(4289)
8 KOG1219|consensus 99.5 1.9E-14 4.1E-19 168.0 9.8 119 600-718 3860-3979(4289)
9 KOG1214|consensus 99.5 7.1E-13 1.5E-17 143.9 15.1 151 18-209 701-860 (1289)
10 KOG1214|consensus 99.4 1.4E-11 3E-16 134.1 16.0 218 58-328 702-947 (1289)
11 KOG1836|consensus 99.3 9E-10 1.9E-14 134.3 31.3 211 491-718 760-1023(1705)
12 KOG1836|consensus 99.3 7.5E-10 1.6E-14 135.0 30.3 244 323-602 696-974 (1705)
13 KOG1225|consensus 99.3 2.1E-11 4.7E-16 132.0 14.6 131 550-714 235-365 (525)
14 KOG1225|consensus 99.3 5.1E-11 1.1E-15 129.2 15.4 132 488-675 234-365 (525)
15 KOG4260|consensus 98.9 1.1E-09 2.4E-14 105.3 5.1 161 37-255 132-304 (350)
16 KOG4260|consensus 98.7 1.6E-08 3.4E-13 97.5 6.7 147 117-294 150-304 (350)
17 KOG1226|consensus 98.5 7E-07 1.5E-11 98.8 11.3 131 572-718 467-622 (783)
18 KOG1226|consensus 98.4 1.4E-06 3E-11 96.5 11.5 153 511-696 467-636 (783)
19 PF00008 EGF: EGF-like domain 97.8 1.2E-05 2.5E-10 53.2 2.6 29 54-82 2-31 (32)
20 PF00008 EGF: EGF-like domain 97.8 1.2E-05 2.6E-10 53.2 2.3 30 684-713 1-31 (32)
21 PF07645 EGF_CA: Calcium-bindi 97.8 1.8E-05 3.9E-10 56.2 2.8 32 680-711 1-34 (42)
22 PF07645 EGF_CA: Calcium-bindi 97.7 3E-05 6.4E-10 55.0 3.1 32 49-80 1-34 (42)
23 smart00179 EGF_CA Calcium-bind 97.7 4.8E-05 1E-09 53.0 4.1 34 51-84 3-38 (39)
24 smart00179 EGF_CA Calcium-bind 97.7 4.7E-05 1E-09 53.0 4.0 36 681-716 2-39 (39)
25 PF06247 Plasmod_Pvs28: Plasmo 97.4 0.00011 2.4E-09 68.0 2.8 136 574-715 8-164 (197)
26 cd00054 EGF_CA Calcium-binding 97.4 0.00026 5.6E-09 48.7 4.0 34 51-84 3-37 (38)
27 PF12947 EGF_3: EGF domain; I 97.3 0.00014 3.1E-09 49.2 1.8 28 688-715 7-34 (36)
28 cd00054 EGF_CA Calcium-binding 97.3 0.00038 8.2E-09 47.9 3.9 35 681-715 2-37 (38)
29 PF12662 cEGF: Complement Clr- 97.1 0.00052 1.1E-08 41.6 2.7 24 196-227 1-24 (24)
30 cd00053 EGF Epidermal growth f 96.9 0.0014 3E-08 44.3 3.8 28 686-713 5-32 (36)
31 cd00053 EGF Epidermal growth f 96.9 0.0015 3.3E-08 44.1 3.9 28 55-82 5-32 (36)
32 smart00181 EGF Epidermal growt 96.8 0.0017 3.8E-08 43.8 3.8 28 56-84 6-34 (35)
33 smart00181 EGF Epidermal growt 96.7 0.002 4.3E-08 43.5 3.8 31 684-715 2-34 (35)
34 PF12947 EGF_3: EGF domain; I 96.5 0.0017 3.7E-08 44.0 1.9 27 511-537 6-32 (36)
35 PF06247 Plasmod_Pvs28: Plasmo 96.4 0.00064 1.4E-08 63.1 -0.8 134 188-332 11-161 (197)
36 KOG1218|consensus 96.3 0.52 1.1E-05 49.7 21.1 47 488-536 162-209 (316)
37 PF07974 EGF_2: EGF-like domai 96.3 0.0043 9.3E-08 40.8 3.0 25 18-45 7-31 (32)
38 PF07974 EGF_2: EGF-like domai 96.3 0.0048 1E-07 40.6 3.0 26 57-84 7-32 (32)
39 KOG1218|consensus 96.3 0.62 1.3E-05 49.2 21.1 160 245-441 14-179 (316)
40 KOG3512|consensus 95.8 0.064 1.4E-06 56.8 10.3 94 491-602 375-479 (592)
41 PF12661 hEGF: Human growth fa 95.4 0.0073 1.6E-07 30.9 0.9 12 73-84 2-13 (13)
42 PF12662 cEGF: Complement Clr- 95.0 0.019 4.1E-07 34.9 2.1 11 321-331 1-11 (24)
43 PF12661 hEGF: Human growth fa 94.7 0.014 3E-07 29.9 0.8 8 628-635 3-10 (13)
44 KOG3512|consensus 94.6 0.22 4.9E-06 52.9 10.1 48 389-438 375-428 (592)
45 PF14670 FXa_inhibition: Coagu 94.4 0.023 5E-07 38.5 1.5 23 187-209 9-31 (36)
46 PF14670 FXa_inhibition: Coagu 94.1 0.033 7.1E-07 37.7 1.7 22 692-713 9-30 (36)
47 smart00051 DSL delta serrate l 92.4 0.21 4.5E-06 38.7 4.2 47 71-146 17-63 (63)
48 smart00051 DSL delta serrate l 92.3 0.17 3.7E-06 39.2 3.5 47 32-84 16-63 (63)
49 cd00055 EGF_Lam Laminin-type e 91.4 0.23 4.9E-06 36.6 3.2 31 518-561 13-43 (50)
50 PF00053 Laminin_EGF: Laminin 91.1 0.13 2.8E-06 37.7 1.6 33 517-562 11-43 (49)
51 smart00180 EGF_Lam Laminin-typ 89.3 0.39 8.5E-06 34.6 2.8 29 518-559 12-40 (46)
52 PF12946 EGF_MSP1_1: MSP1 EGF 88.8 0.3 6.6E-06 32.9 1.8 28 53-80 2-30 (37)
53 cd00055 EGF_Lam Laminin-type e 86.0 1.2 2.7E-05 32.6 3.9 21 63-85 13-33 (50)
54 PF00053 Laminin_EGF: Laminin 85.6 0.53 1.1E-05 34.4 1.8 32 351-398 11-42 (49)
55 cd01475 vWA_Matrilin VWA_Matri 84.7 0.78 1.7E-05 45.9 3.1 39 170-208 181-219 (224)
56 PF12946 EGF_MSP1_1: MSP1 EGF 83.9 0.78 1.7E-05 31.0 1.8 29 684-712 2-31 (37)
57 smart00180 EGF_Lam Laminin-typ 81.3 1.7 3.6E-05 31.3 2.9 22 361-396 19-40 (46)
58 cd01475 vWA_Matrilin VWA_Matri 79.0 2 4.4E-05 42.9 3.7 38 219-256 181-218 (224)
59 KOG3516|consensus 77.2 1.9 4.2E-05 51.4 3.2 44 6-51 541-585 (1306)
60 PF01414 DSL: Delta serrate li 74.9 1.3 2.9E-05 34.3 0.8 17 196-212 16-32 (63)
61 PHA03099 epidermal growth fact 71.0 3.8 8.2E-05 36.0 2.7 39 679-718 40-83 (139)
62 PF01414 DSL: Delta serrate li 70.2 1.3 2.8E-05 34.4 -0.3 14 488-501 17-30 (63)
63 PHA02887 EGF-like protein; Pro 70.2 4.2 9.1E-05 35.1 2.8 37 11-48 84-123 (126)
64 PHA02887 EGF-like protein; Pro 66.4 5.8 0.00013 34.3 2.8 16 624-639 107-122 (126)
65 PHA03099 epidermal growth fact 64.0 5.8 0.00012 34.9 2.4 29 56-85 51-81 (139)
66 KOG3516|consensus 60.3 7.5 0.00016 46.7 3.3 44 221-264 541-585 (1306)
67 PF00954 S_locus_glycop: S-loc 58.2 10 0.00022 33.0 3.2 33 9-43 76-108 (110)
68 KOG3514|consensus 57.5 6.3 0.00014 46.6 2.0 35 12-48 625-660 (1591)
69 KOG3514|consensus 55.1 7.2 0.00016 46.1 2.0 35 568-602 625-660 (1591)
70 PF01683 EB: EB module; Inter 54.6 18 0.00038 26.6 3.4 30 41-80 17-46 (52)
71 PF04863 EGF_alliinase: Alliin 49.4 10 0.00022 28.0 1.4 33 410-442 18-54 (56)
72 PF09064 Tme5_EGF_like: Thromb 46.9 12 0.00025 24.9 1.2 12 702-713 18-29 (34)
73 PF00954 S_locus_glycop: S-loc 43.4 23 0.00049 30.8 2.9 33 109-143 76-108 (110)
74 PF01683 EB: EB module; Inter 41.5 31 0.00066 25.3 2.9 27 111-142 20-46 (52)
75 KOG0196|consensus 35.9 77 0.0017 37.2 6.2 78 58-154 248-329 (996)
76 PF12955 DUF3844: Domain of un 32.8 36 0.00078 29.2 2.3 25 609-633 12-41 (103)
77 PF12955 DUF3844: Domain of un 32.7 41 0.00088 28.9 2.6 27 571-597 12-43 (103)
78 KOG3509|consensus 20.6 1.6E+02 0.0035 35.7 5.5 72 10-84 406-478 (964)
No 1
>KOG1217|consensus
Probab=99.81 E-value=5.7e-18 Score=190.82 Aligned_cols=256 Identities=36% Similarity=0.970 Sum_probs=201.8
Q ss_pred CccccccCCCCCCCCCCcCCCCCC--CCCCCCCCEEeeCCCceEEecCCCccCCCCcccCcccCCCCCCCCceeeccCCC
Q psy17084 384 DLYSCICKEGFEGPDCGQDINDCS--PQPCYNGGKCVDGVNWFLCECAPGFAGPDCRININECASNPCGYGKEILTVQSR 461 (721)
Q Consensus 384 ~~~~C~C~~G~~G~~C~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~g~~~~~~~~~ 461 (721)
..+.|.|..||.+..+....++|. ..+|.+++.|.+..++|.|.|+++|++..++.. ..+
T Consensus 150 ~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~---------~~~--------- 211 (487)
T KOG1217|consen 150 GPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETT---------GNG--------- 211 (487)
T ss_pred CceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCC---------CCC---------
Confidence 467788888888888876657776 456999999999999999999999999988642 111
Q ss_pred CCCCCccccCCCccccceeeecCCCceEeccCCCCCCCcccccCCCCCCCCCCCCCEEecCCCCeeeecCCCcccccccC
Q psy17084 462 SRSPAYLCVSNPAYLVARCVSQSGGSFKCSCDAGFSGKYCHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICAN 541 (721)
Q Consensus 462 ~~~~~~~C~~~~~~~~~~C~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~ 541 (721)
+.|... +.|.+..||.+..+...+.++... + ++|.+..++|+|.|++||.+..+
T Consensus 212 ----------------~~c~~~----~~~~~~~g~~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~-- 265 (487)
T KOG1217|consen 212 ----------------GTCVDS----VACSCPPGARGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDAC-- 265 (487)
T ss_pred ----------------ceEecc----eeccCCCCCCCCCcccccccccCC---C-CcccccCCceeeeCCCCcccccc--
Confidence 233332 578999999999998888777655 5 89999999999999999987642
Q ss_pred CCCCCCCceeeCCCCccCCccccCCCCCCCCC-CCCCCEEeecCCCeEEeccCCCccCcc--cccCCCC----CCCCCCC
Q psy17084 542 SNQSGGSFKCSCDAGFSGKYCHENINDCKHNP-CQNGGTCVDKVNSFQCICRDGWEGEIC--ANNKNEC----EPNPCKN 614 (721)
Q Consensus 542 ~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~-C~~~g~C~~~~~~~~C~C~~G~~G~~C--~~~~~~C----~~~~C~~ 614 (721)
....++++|...+ |.++++|++..+.|.|.|++||+|..+ .....+| ...+|.+
T Consensus 266 -------------------~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~ 326 (487)
T KOG1217|consen 266 -------------------VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCAN 326 (487)
T ss_pred -------------------ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCC
Confidence 1124678887764 999999999998899999999999998 2344677 4567998
Q ss_pred CcEE--eeCCCCeEEecCCCCccccccCCCCCCCCCCCCCCCeeee-cCCCceeeCCCCCCCC-c---cccCccCCCCCC
Q psy17084 615 NGTC--IDGHADFTCLCKNGWKGKTCTSKNGHCDRGTCKHGGTCAD-LGSSFFCHCPPDWEGT-S---CHIGKLNACKSN 687 (721)
Q Consensus 615 ~g~C--~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~-~~~~~~C~C~~G~~G~-~---C~~~~~~~C~~~ 687 (721)
+++| ......+.|.|..||.|..|+...++|...++..++.|.+ ..+.|.|.|+.+|.+. . -...++++|..
T Consensus 327 g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~c~~- 405 (487)
T KOG1217|consen 327 GGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCEDIDECSG- 405 (487)
T ss_pred CcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCccccCCccccccccccccccC-
Confidence 8899 3344578899999999999987656898888999999999 6889999999999984 1 11224455544
Q ss_pred CCCCCCeEeeCCCceeeecCC
Q psy17084 688 PCKNGGTCVNTGDLYSCICKE 708 (721)
Q Consensus 688 ~C~~~~~C~~~~~~~~C~C~~ 708 (721)
.+.|++..+++.|. ++
T Consensus 406 ----~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 406 ----CGDCVNGPGGGACT-PP 421 (487)
T ss_pred ----CcceeccCCCCccc-cC
Confidence 56799999999999 87
No 2
>KOG1217|consensus
Probab=99.81 E-value=3.3e-18 Score=192.77 Aligned_cols=280 Identities=36% Similarity=0.905 Sum_probs=217.2
Q ss_pred CCCCCCCCCCCEEeeCCCceEEecCCCccCCCCcccCcccCCCCCCCCceeeccCCCCCCCCccccCCCccccceeeec-
Q psy17084 405 DCSPQPCYNGGKCVDGVNWFLCECAPGFAGPDCRININECASNPCGYGKEILTVQSRSRSPAYLCVSNPAYLVARCVSQ- 483 (721)
Q Consensus 405 ~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~g~~~~~~~~~~~~~~~~C~~~~~~~~~~C~~~- 483 (721)
.+...+....+.+......+.|.|++||.|..++... +|...+. . ....+.|...
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~~~~~~~-~C~~~~~------------~-----------~~~~~~c~~~~ 146 (487)
T KOG1217|consen 91 PCRSPCLLLCGECVDCVGSYECTCPPGYQGTPCEGEC-ECVTGPG------------V-----------CCIDGSCSNGP 146 (487)
T ss_pred cccCCcccCCccccCCCCCceeeCCCccccCcCCcce-eecCCCC------------C-----------eeCchhhcCCC
Confidence 3444444556677778888999999999999886422 3433321 0 0011223322
Q ss_pred -CCCceEeccCCCCCCCcccccCCCCC--CCCCCCCCEEecCCCCeeeecCCCcccccccCC---CCCCCCceeeCCCCc
Q psy17084 484 -SGGSFKCSCDAGFSGKYCHENINDCK--HNPCQNGGTCVDKVNSFQCICRDGWEGEICANS---NQSGGSFKCSCDAGF 557 (721)
Q Consensus 484 -~~~~~~C~C~~G~~G~~C~~~~~~C~--~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~---~~~~~~~~C~C~~G~ 557 (721)
....+.|.|..||.+..+....++|. ..+|.+++.|.+..++|.|.|+++|.+..++.. ......+.|.+..++
T Consensus 147 ~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~ 226 (487)
T KOG1217|consen 147 GSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGA 226 (487)
T ss_pred CCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCC
Confidence 23458999999999999987768897 456999999999999999999999999988765 111222568899999
Q ss_pred cCCccccCCCCCCCCCCCCCCEEeecCCCeEEeccCCCccCc--ccccCCCCCCCC-CCCCcEEeeCCCCeEEecCCCCc
Q psy17084 558 SGKYCHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEI--CANNKNECEPNP-CKNNGTCIDGHADFTCLCKNGWK 634 (721)
Q Consensus 558 ~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~--C~~~~~~C~~~~-C~~~g~C~~~~~~~~C~C~~G~~ 634 (721)
.+..+...+.++... + ++|.+..++|+|+|++||++.. ...++++|...+ |.++++|++..+.|.|+|++||+
T Consensus 227 ~~~~c~~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~ 302 (487)
T KOG1217|consen 227 RGPECEVSIVECASG---D-GTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFT 302 (487)
T ss_pred CCCCcccccccccCC---C-CcccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCC
Confidence 999998888777655 5 8999999999999999999997 345688898874 99999999998889999999999
Q ss_pred cccc--cCCCCCC----CCCCCCCCCeee--ecCCCceeeCCCCCCCCccccCccCCCCCCCCCCCCeEee-CCCceeee
Q psy17084 635 GKTC--TSKNGHC----DRGTCKHGGTCA--DLGSSFFCHCPPDWEGTSCHIGKLNACKSNPCKNGGTCVN-TGDLYSCI 705 (721)
Q Consensus 635 G~~C--~~~~~~C----~~~~C~~~~~C~--~~~~~~~C~C~~G~~G~~C~~~~~~~C~~~~C~~~~~C~~-~~~~~~C~ 705 (721)
|..| ......| ...+|.++++|. .....+.|.|..+|.|..|+... ++|...++..++.|++ ..++|.|.
T Consensus 303 g~~~~~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~-~~C~~~~~~~~~~c~~~~~~~~~c~ 381 (487)
T KOG1217|consen 303 GRLCTECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSN-DECASSPCCPGGTCVNETPGSYRCA 381 (487)
T ss_pred CCCCccccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCC-ccccCCccccCCEeccCCCCCeEec
Confidence 9998 2234566 345688888993 33446889999999999999322 5899888999999999 79999999
Q ss_pred cCCCCccC
Q psy17084 706 CKEGFVHA 713 (721)
Q Consensus 706 C~~G~~g~ 713 (721)
|+.+|.+.
T Consensus 382 ~~~~~~~~ 389 (487)
T KOG1217|consen 382 CPAGFAGK 389 (487)
T ss_pred CCCccccC
Confidence 99999873
No 3
>KOG4289|consensus
Probab=99.79 E-value=3.4e-18 Score=192.17 Aligned_cols=132 Identities=31% Similarity=0.716 Sum_probs=97.6
Q ss_pred CCCCCCCCCCCCCeeeecCCceeeccCCCCCCC--CCCCCCceeeCCCCeEeeCCCCCccCcCCCCCccCCCCCCCCCCC
Q psy17084 150 DNPCMMGPCGNGGQCKETAGQFQCVDHDHCNPN--PCLNGAPCFNTQADYYCHCTEDWEGKNCSFPRYKCDNPPCDDIDE 227 (721)
Q Consensus 150 ~~~C~~~~C~~~~~C~~~~~~~~C~~~~~C~~~--~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~c~~~~C~~~~~ 227 (721)
.+.|...||.+-.+|.....- |.-++- .=.--..=++..+.++|.|++||+|+.|+. .||+
T Consensus 1179 DniClrEPCenymkCvsvlrF------dssapf~~s~s~lfRpi~pvnglrCrCPpGFTgd~CeT-----------eiDl 1241 (2531)
T KOG4289|consen 1179 DNICLREPCENYMKCVSVLRF------DSSAPFLASDSVLFRPIHPVNGLRCRCPPGFTGDYCET-----------EIDL 1241 (2531)
T ss_pred CchhhcchhHHHHhhhhheee------cccCccccccceeeeeccccCceeEeCCCCCCcccccc-----------hhHh
Confidence 456777777777777554310 000000 000011224566789999999999999986 8899
Q ss_pred CCCCCCCCCCEEeeCCCCeEeecCCCCccCCCccCC--CCCCCCCCCCCCcceEec-CCCcEEeCCCC-CccCCCC
Q psy17084 228 CVSNPCQNGGTCVDLVDGYKCECTQAWEGSNCQYDA--DECQKSPCVNAALGCTNL-VGDYRCNCSPG-WTGHNCD 299 (721)
Q Consensus 228 C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~--~~C~~~~C~~~~~~C~~~-~g~~~C~C~~G-~~G~~C~ 299 (721)
|.+.||.++++|....|+|+|+|.+||+|..|+++. -.|.+..|.++ +.|++. .|.+.|.|+.| |++..|+
T Consensus 1242 CYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~ng-gtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1242 CYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNG-GTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred hhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCC-CEEeecCCCceeccCCCcccCCCceE
Confidence 999999999999999999999999999999999765 45778888888 789886 67899999998 3444443
No 4
>KOG4289|consensus
Probab=99.77 E-value=7.6e-18 Score=189.40 Aligned_cols=73 Identities=33% Similarity=0.856 Sum_probs=63.1
Q ss_pred ccCCccccccCCCCCCCCCCcCCCCCCCCCCCCCCEEeeCCCceEEecCCCccCCCCcccC--cccCCCCCCCCc
Q psy17084 381 NTGDLYSCICKEGFEGPDCGQDINDCSPQPCYNGGKCVDGVNWFLCECAPGFAGPDCRINI--NECASNPCGYGK 453 (721)
Q Consensus 381 ~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C~~~~--~~C~~~~C~~g~ 453 (721)
+..+.++|.|++||+|+.|+.+|+.|...||.++++|....|+|+|+|.+||+|.+||++. -.|...-|.+|.
T Consensus 1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~ngg 1291 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGG 1291 (2531)
T ss_pred cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCC
Confidence 4456789999999999999999999999999999999999999999999999999999754 346555566554
No 5
>KOG0994|consensus
Probab=99.64 E-value=2.4e-15 Score=167.12 Aligned_cols=351 Identities=26% Similarity=0.653 Sum_probs=179.2
Q ss_pred CeeeecCCceEEecCCCCCCCccCccCCCCCcccccccccCccccc-cCCCCCCCCCCCCCCCeeccCCCCceeeeCCCC
Q psy17084 62 GQCKETAGQFQCVCAPGWTGPTCKIKHNFFPYLQFIPLTTSLFRLL-SDLNYCGTHEPCQNGGTCENTAPDQYLCTCPEG 140 (721)
Q Consensus 62 g~C~~~~g~~~C~C~~Gy~G~~C~~~~~~~~~~~c~c~~~~~~~~~-~~~~~C~~~~~C~~~g~C~~~~~~~~~C~C~~G 140 (721)
++|....| +|.|+|+-.|..|. .|. +|++|.. +--..|.-+..=..+..|.. -++.|.|.+|
T Consensus 775 ~vCn~~GG--qCqCkPnVVGR~Cd-----------qCA-pGtyGFGPsGCk~CdC~~~Gs~~~~Cd~---~tGQC~C~~g 837 (1758)
T KOG0994|consen 775 SVCNPNGG--QCQCKPNVVGRRCD-----------QCA-PGTYGFGPSGCKACDCNSIGSLDKYCDK---ITGQCQCRPG 837 (1758)
T ss_pred ccccCCCc--eecccCcccccccc-----------ccC-CcccCcCCccCccccccccccccccccc---cccceeeccc
Confidence 45655555 79999999999887 333 4444432 11111111111112235543 4568999999
Q ss_pred CcCCcccccCCCCCCCCCCCCCeeeecCCceeeccCCCCCCCCCCCCCceeeCCCCeEeeCCCCCccCcCCCCCccCCCC
Q psy17084 141 FSGINCEVVDNPCMMGPCGNGGQCKETAGQFQCVDHDHCNPNPCLNGAPCFNTQADYYCHCTEDWEGKNCSFPRYKCDNP 220 (721)
Q Consensus 141 y~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~c~~~ 220 (721)
-+|..|. .|.+||||-+ .|..-
T Consensus 838 ~ygrqCn----------------------------------------------------qCqpG~WgFP------eCr~C 859 (1758)
T KOG0994|consen 838 TYGRQCN----------------------------------------------------QCQPGYWGFP------ECRPC 859 (1758)
T ss_pred cchhhcc----------------------------------------------------ccCCCccCCC------cCccc
Confidence 9888776 6889998832 11111
Q ss_pred CCCC-CCCCCCC--CCCCCCEEeeCCCCeEe-ecCCCCccCCCccCCCCCCCCCCCCCC-------cceEe--cCCCcEE
Q psy17084 221 PCDD-IDECVSN--PCQNGGTCVDLVDGYKC-ECTQAWEGSNCQYDADECQKSPCVNAA-------LGCTN--LVGDYRC 287 (721)
Q Consensus 221 ~C~~-~~~C~~~--~C~~~~~C~~~~g~~~C-~C~~G~~G~~C~~~~~~C~~~~C~~~~-------~~C~~--~~g~~~C 287 (721)
.|.+ .+.|.+. .|. .|.+...++.| .|.+||.|+.=--....|.+-||..+. ..|.. ......|
T Consensus 860 qCNgHA~~Cd~~tGaCi---~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC 936 (1758)
T KOG0994|consen 860 QCNGHADTCDPITGACI---DCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVC 936 (1758)
T ss_pred cccCcccccCccccccc---cccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccccccccceee
Confidence 1111 1333321 121 35566777888 699999987433233455555554431 12321 1234578
Q ss_pred eCCCCCccCCCCcCCCCCCCCCCCCCEEeeCCCCeeeecCCCCcCCCcccCCCccCCCCCCCC-CEEeeCCCCceEecCC
Q psy17084 288 NCSPGWTGHNCDVNINDCVGQCRHGSTCIDLVNDFHCACLPGYTGRTCQTDINDCESSPCVNG-GECVDQVNGFRCICPV 366 (721)
Q Consensus 288 ~C~~G~~G~~C~~~~~~c~~~C~~~~~C~~~~~~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~~-g~C~~~~~~~~C~C~~ 366 (721)
.|.+||+|..|+...+.-.+.=+.+|+|. .|.|.. .|+.-.+..|... |.|. +|..
T Consensus 937 ~C~~GY~G~RCe~CA~~~fGnP~~GGtCq------~CeC~~---------NiD~~d~~aCD~~TG~CL--------kCL~ 993 (1758)
T KOG0994|consen 937 HCQEGYSGSRCEICADNHFGNPSEGGTCQ------KCECSN---------NIDLYDPGACDVATGACL--------KCLY 993 (1758)
T ss_pred ecccCccccchhhhcccccCCcccCCccc------cccccC---------CcCccCCCccchhhchhh--------hhhh
Confidence 88888888888732211111112233332 233322 2222222222211 2221 2333
Q ss_pred CCcCCccCCCCeeeccCCccccccCCCCCCCCCCcCCCCCCCCCCC--CCCEEeeCCCceEEecCCCccCCCCcccCccc
Q psy17084 367 GFAGQLCENGGTCVNTGDLYSCICKEGFEGPDCGQDINDCSPQPCY--NGGKCVDGVNWFLCECAPGFAGPDCRININEC 444 (721)
Q Consensus 367 gy~g~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~--~~g~C~~~~g~~~C~C~~Gy~G~~C~~~~~~C 444 (721)
--.|..|+ .|.+||+|+.-.++...|.-+-=. +.+.|....| +|.|.+...|..|. .|
T Consensus 994 hTeG~hCe--------------~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tG--QCpClpNv~G~~CD----qC 1053 (1758)
T KOG0994|consen 994 HTEGDHCE--------------HCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTG--QCPCLPNVQGVRCD----QC 1053 (1758)
T ss_pred cccccchh--------------hccccchhHHHHhhhhhheccccccCCccccccccC--cCCCCccccccccc----cc
Confidence 44566665 688888887544332222100000 1123433334 78888888888875 24
Q ss_pred CCCCCCCCceeeccCCCCCCCCccccCCCccccceeeecCCCceEeccCCCCCCCcccccCCCCCCCCCCCCCEEecCCC
Q psy17084 445 ASNPCGYGKEILTVQSRSRSPAYLCVSNPAYLVARCVSQSGGSFKCSCDAGFSGKYCHENINDCKHNPCQNGGTCVDKVN 524 (721)
Q Consensus 445 ~~~~C~~g~~~~~~~~~~~~~~~~C~~~~~~~~~~C~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~g 524 (721)
+.+...-. ...|--.|.|.+ ..+-+|..-.|
T Consensus 1054 A~N~w~la------------------------------SG~GCe~C~Cd~-------------------~~~pqCN~ftG 1084 (1758)
T KOG0994|consen 1054 AENHWNLA------------------------------SGEGCEPCNCDP-------------------IGGPQCNEFTG 1084 (1758)
T ss_pred ccchhccc------------------------------cCCCCCccCCCc-------------------cCCcccccccc
Confidence 43321100 000001122222 11225655555
Q ss_pred CeeeecCCCcccccccCCCCCCCCceeeCCCCccCCccccCCCCCCCCCCCCCC----EEeecCCCeEEeccCCCccCcc
Q psy17084 525 SFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKYCHENINDCKHNPCQNGG----TCVDKVNSFQCICRDGWEGEIC 600 (721)
Q Consensus 525 ~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g----~C~~~~~~~~C~C~~G~~G~~C 600 (721)
+|+|+|||-|..|+. |..-|+|..=. .|..-.|...| +|...+| +|+|.+|..|.+|
T Consensus 1085 --QCqCkpGfGGR~C~q-----------Cqel~WGdP~~----~C~aCdCd~rG~~tpQCdr~tG--~C~C~~Gv~G~rC 1145 (1758)
T KOG0994|consen 1085 --QCQCKPGFGGRTCSQ-----------CQELYWGDPNE----KCRACDCDPRGIETPQCDRATG--RCVCRPGVGGPRC 1145 (1758)
T ss_pred --ceeccCCCCCcchhH-----------HHHhhcCCCCC----CceecCCCCCCCCCCCccccCC--ceeecCCCCCcch
Confidence 899999999999987 88889886421 23222233322 4554444 7888888888887
Q ss_pred c
Q psy17084 601 A 601 (721)
Q Consensus 601 ~ 601 (721)
+
T Consensus 1146 d 1146 (1758)
T KOG0994|consen 1146 D 1146 (1758)
T ss_pred h
Confidence 4
No 6
>KOG0994|consensus
Probab=99.61 E-value=2.3e-13 Score=151.78 Aligned_cols=275 Identities=27% Similarity=0.643 Sum_probs=141.4
Q ss_pred CcEEeCCCCCccCCCCcCCCCC--------C-CCCC----CCCEEeeCCCCeeeecCCCCcCCCccc------CCCccCC
Q psy17084 284 DYRCNCSPGWTGHNCDVNINDC--------V-GQCR----HGSTCIDLVNDFHCACLPGYTGRTCQT------DINDCES 344 (721)
Q Consensus 284 ~~~C~C~~G~~G~~C~~~~~~c--------~-~~C~----~~~~C~~~~~~~~C~C~~Gy~G~~C~~------~~~~C~~ 344 (721)
+..|+|+|+-.|..|+.....- . -.|. -+..|....| .|.|.+|-+|+.|.+ ...+|.+
T Consensus 781 GGqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tG--QC~C~~g~ygrqCnqCqpG~WgFPeCr~ 858 (1758)
T KOG0994|consen 781 GGQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITG--QCQCRPGTYGRQCNQCQPGYWGFPECRP 858 (1758)
T ss_pred CceecccCccccccccccCCcccCcCCccCcccccccccccccccccccc--ceeeccccchhhccccCCCccCCCcCcc
Confidence 3467777777777766422110 0 0111 1223433333 677777766666542 2344555
Q ss_pred CCCCCCC-EEeeCCCCceE-ecCCCCcCCccCCCCeeeccCCccccccCCCCCCCCCCcCCCCCCCCCCCCCC-------
Q psy17084 345 SPCVNGG-ECVDQVNGFRC-ICPVGFAGQLCENGGTCVNTGDLYSCICKEGFEGPDCGQDINDCSPQPCYNGG------- 415 (721)
Q Consensus 345 ~~C~~~g-~C~~~~~~~~C-~C~~gy~g~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g------- 415 (721)
..|..++ +|....| .| .|..--+|..|+ .|..||+|+.-.-.-..|.|-||..+-
T Consensus 859 CqCNgHA~~Cd~~tG--aCi~CqD~T~G~~Cd--------------rCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A 922 (1758)
T KOG0994|consen 859 CQCNGHADTCDPITG--ACIDCQDSTTGHSCD--------------RCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHA 922 (1758)
T ss_pred ccccCcccccCcccc--ccccccccccccchh--------------hhhccccCCcccCCCCCCCCCCCCCCCccchhcc
Confidence 5554443 3433333 22 244444444444 688999987654444566776765431
Q ss_pred -EEee--CCCceEEecCCCccCCCCcccCcccCCCCCCCCceeeccCCCCCCC------CccccCCCccccceeeecCCC
Q psy17084 416 -KCVD--GVNWFLCECAPGFAGPDCRININECASNPCGYGKEILTVQSRSRSP------AYLCVSNPAYLVARCVSQSGG 486 (721)
Q Consensus 416 -~C~~--~~g~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~g~~~~~~~~~~~~~------~~~C~~~~~~~~~~C~~~~~~ 486 (721)
.|.. ......|.|.+||+|.+|++ |+.+.-++....-.++.-..+. ...|....+ .--.|...+.|
T Consensus 923 ~sC~~d~~t~~ivC~C~~GY~G~RCe~----CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG-~CLkCL~hTeG 997 (1758)
T KOG0994|consen 923 DSCYLDTRTQQIVCHCQEGYSGSRCEI----CADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATG-ACLKCLYHTEG 997 (1758)
T ss_pred ccccccccccceeeecccCccccchhh----hcccccCCcccCCccccccccCCcCccCCCccchhhc-hhhhhhhcccc
Confidence 4532 22345899999999999974 5544332221111111111100 001111000 00122333334
Q ss_pred ceEe-ccCCCCCCCcccccCCCCCCCCCC--CCCEEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCcc----C
Q psy17084 487 SFKC-SCDAGFSGKYCHENINDCKHNPCQ--NGGTCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFS----G 559 (721)
Q Consensus 487 ~~~C-~C~~G~~G~~C~~~~~~C~~~~C~--~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~----G 559 (721)
..| .|.+||+|+.=.++...|.-+.=. +.+.|....| +|-|.|...|..|+. |.+.++ |
T Consensus 998 -~hCe~Ck~Gf~GdA~~q~CqrC~Cn~LGTn~~~~CDr~tG--QCpClpNv~G~~CDq-----------CA~N~w~laSG 1063 (1758)
T KOG0994|consen 998 -DHCEHCKDGFYGDALRQNCQRCVCNFLGTNSTCHCDRFTG--QCPCLPNVQGVRCDQ-----------CAENHWNLASG 1063 (1758)
T ss_pred -cchhhccccchhHHHHhhhhhheccccccCCccccccccC--cCCCCcccccccccc-----------cccchhccccC
Confidence 344 589999997543333333211000 1245555555 899999999999887 666554 4
Q ss_pred CccccCCCCCCCCCCCCCCEEeecCCCeEEeccCCCccCcccc
Q psy17084 560 KYCHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICAN 602 (721)
Q Consensus 560 ~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~ 602 (721)
..|+ .|.-+| ..+-+|...+| +|+|.|||.|..|..
T Consensus 1064 ~GCe----~C~Cd~-~~~pqCN~ftG--QCqCkpGfGGR~C~q 1099 (1758)
T KOG0994|consen 1064 EGCE----PCNCDP-IGGPQCNEFTG--QCQCKPGFGGRTCSQ 1099 (1758)
T ss_pred CCCC----ccCCCc-cCCcccccccc--ceeccCCCCCcchhH
Confidence 4443 232222 22336666555 899999999998863
No 7
>KOG1219|consensus
Probab=99.53 E-value=1.9e-14 Score=167.98 Aligned_cols=116 Identities=36% Similarity=0.982 Sum_probs=98.5
Q ss_pred cccCCCCCCCCCCCCCCEEeecC-CCeEEeccCCCccCcccccCCCCCCCCCCCCcEEeeCCCCeEEecCCCCccccccC
Q psy17084 562 CHENINDCKHNPCQNGGTCVDKV-NSFQCICRDGWEGEICANNKNECEPNPCKNNGTCIDGHADFTCLCKNGWKGKTCTS 640 (721)
Q Consensus 562 C~~~~~~C~~~~C~~~g~C~~~~-~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~ 640 (721)
|....+.|..+||+++|+|.... ++|+|.|++-|+|++|+..+..|.++||..+|+|+...++|.|.|+.||+|.+||.
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~ 3939 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA 3939 (4289)
T ss_pred ccccccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec
Confidence 43334778888999999998764 56889999999999998888889989999989998888888999999999999987
Q ss_pred C-CCCCCCCCCCCCCeeeecCCCceeeCCCCCCCCccc
Q psy17084 641 K-NGHCDRGTCKHGGTCADLGSSFFCHCPPDWEGTSCH 677 (721)
Q Consensus 641 ~-~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~ 677 (721)
. +++|+.++|.++|.|+++.++|.|.|.+||.|..|.
T Consensus 3940 ~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~ 3977 (4289)
T KOG1219|consen 3940 RGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCC 3977 (4289)
T ss_pred ccccccccccccCCceeeccCCceEeccChhHhcccCc
Confidence 7 788888888888888888888888888888888875
No 8
>KOG1219|consensus
Probab=99.53 E-value=1.9e-14 Score=167.98 Aligned_cols=119 Identities=29% Similarity=0.845 Sum_probs=109.8
Q ss_pred ccccCCCCCCCCCCCCcEEeeCC-CCeEEecCCCCccccccCCCCCCCCCCCCCCCeeeecCCCceeeCCCCCCCCcccc
Q psy17084 600 CANNKNECEPNPCKNNGTCIDGH-ADFTCLCKNGWKGKTCTSKNGHCDRGTCKHGGTCADLGSSFFCHCPPDWEGTSCHI 678 (721)
Q Consensus 600 C~~~~~~C~~~~C~~~g~C~~~~-~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~ 678 (721)
|..-.+.|..+||.++|+|+..+ ++|.|.|++-|.|.+||..++.|.++||..+|+|+...++|.|.|+.||+|++|+.
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~~Ce~ 3939 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGKRCEA 3939 (4289)
T ss_pred ccccccccccCcccCCCEecCCCCCceEEeCcccccCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCceeec
Confidence 44344789999999999999776 67999999999999999999999999999999999999999999999999999996
Q ss_pred CccCCCCCCCCCCCCeEeeCCCceeeecCCCCccCCcccC
Q psy17084 679 GKLNACKSNPCKNGGTCVNTGDLYSCICKEGFVHALLFTR 718 (721)
Q Consensus 679 ~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~c~~~ 718 (721)
..+++|+.++|.++|+|+|..|+|.|.|.+||.|..|...
T Consensus 3940 ~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~ 3979 (4289)
T KOG1219|consen 3940 RGISECSKNVCGTGGQCINIPGSFHCNCTPGILGRTCCAE 3979 (4289)
T ss_pred ccccccccccccCCceeeccCCceEeccChhHhcccCccc
Confidence 5599999999999999999999999999999999987543
No 9
>KOG1214|consensus
Probab=99.46 E-value=7.1e-13 Score=143.91 Aligned_cols=151 Identities=30% Similarity=0.786 Sum_probs=118.4
Q ss_pred CCCCCCEeccCCCCceeEeCCCCCc--cCccCCCCCCCCCC--CCCCCCeeeecCCceEEecCCCCC--CCccCccCCCC
Q psy17084 18 PCQNGGTCENTAPDQYLCTCPEGFS--GINCEVVDNPCMMG--PCGNGGQCKETAGQFQCVCAPGWT--GPTCKIKHNFF 91 (721)
Q Consensus 18 ~C~~~g~C~~~~~~~~~C~C~~G~~--G~~C~~~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~Gy~--G~~C~~~~~~~ 91 (721)
-|.-+..|...+...|+|.|..||. |.+|. ++++|+.. .|+.+.+|++..|+|+|.|..||. ++
T Consensus 701 ~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~-d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd--------- 770 (1289)
T KOG1214|consen 701 MCDTTARCHPGTGVDYTCECSSGYQGDGRNCV-DENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADD--------- 770 (1289)
T ss_pred ccCCCccccCCCCcceEEEEeeccCCCCCCCC-ChhhhccCCCCCCCCceeecCCCceeEEEeecceeccC---------
Confidence 4666777876444579999999997 57886 67889765 499999999999999999999986 22
Q ss_pred CcccccccccCccccccCCCCCCCC-CCCCCCC--eeccCCCCceeeeCCCCCcCCcccccCCCCCCCCCCCCCeeeecC
Q psy17084 92 PYLQFIPLTTSLFRLLSDLNYCGTH-EPCQNGG--TCENTAPDQYLCTCPEGFSGINCEVVDNPCMMGPCGNGGQCKETA 168 (721)
Q Consensus 92 ~~~~c~c~~~~~~~~~~~~~~C~~~-~~C~~~g--~C~~~~~~~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~~~~C~~~~ 168 (721)
.+.|..+.+ ...++.|+.. ..|...+ .|+....++|+|.|.|||.|+
T Consensus 771 -~~tCV~i~~-----pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD------------------------ 820 (1289)
T KOG1214|consen 771 -RHTCVLITP-----PAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGD------------------------ 820 (1289)
T ss_pred -CcceEEecC-----CCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCC------------------------
Confidence 122222211 1245667764 6676554 666666778999999999983
Q ss_pred CceeeccCCCCCCCCCCCCCceeeCCCCeEeeCCCCCccCc
Q psy17084 169 GQFQCVDHDHCNPNPCLNGAPCFNTQADYYCHCTEDWEGKN 209 (721)
Q Consensus 169 ~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~ 209 (721)
++.|.++|+|.++.|..+|.|+++.++|.|.|.+||+|+.
T Consensus 821 -G~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 821 -GHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred -ccccccccccCccccCCCceEecCCCcceeecccCccCCC
Confidence 5788899999999999999999999999999999999964
No 10
>KOG1214|consensus
Probab=99.35 E-value=1.4e-11 Score=134.09 Aligned_cols=218 Identities=25% Similarity=0.629 Sum_probs=133.0
Q ss_pred CCCCCeeeecCC-ceEEecCCCCCCCccCccCCCCCcccccccccCccccccCCCCCCC-CCCCCCCCeeccCCCCceee
Q psy17084 58 CGNGGQCKETAG-QFQCVCAPGWTGPTCKIKHNFFPYLQFIPLTTSLFRLLSDLNYCGT-HEPCQNGGTCENTAPDQYLC 135 (721)
Q Consensus 58 C~~~g~C~~~~g-~~~C~C~~Gy~G~~C~~~~~~~~~~~c~c~~~~~~~~~~~~~~C~~-~~~C~~~g~C~~~~~~~~~C 135 (721)
|..++.|....+ .|+|.|..||.|+ .+...|+++|+. ...|..++.|++ .+++|.|
T Consensus 702 cdt~a~C~pg~~~~~tcecs~g~~gd---------------------gr~c~d~~eca~~~~~CGp~s~Cin-~pg~~rc 759 (1289)
T KOG1214|consen 702 CDTTARCHPGTGVDYTCECSSGYQGD---------------------GRNCVDENECATGFHRCGPNSVCIN-LPGSYRC 759 (1289)
T ss_pred cCCCccccCCCCcceEEEEeeccCCC---------------------CCCCCChhhhccCCCCCCCCceeec-CCCceeE
Confidence 444555655443 3666666666543 233367788876 467999999999 7899999
Q ss_pred eCCCCCcC----CcccccCCCCCCCCCCCCCeeeecCCceeeccCCCCCC--CCCCCCC--ceeeC-CCCeEeeCCCCCc
Q psy17084 136 TCPEGFSG----INCEVVDNPCMMGPCGNGGQCKETAGQFQCVDHDHCNP--NPCLNGA--PCFNT-QADYYCHCTEDWE 206 (721)
Q Consensus 136 ~C~~Gy~G----~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~~~~~C~~--~~C~~~~--~C~~~-~g~~~C~C~~G~~ 206 (721)
.|..||.- .+|....++= .++.|.. +.|.-.+ .|+.. .++|+|.|.+||.
T Consensus 760 eC~~gy~F~dd~~tCV~i~~pa---------------------p~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfs 818 (1289)
T KOG1214|consen 760 ECRSGYEFADDRHTCVLITPPA---------------------PANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFS 818 (1289)
T ss_pred EEeecceeccCCcceEEecCCC---------------------CCCccccCccccCcCCceEEEecCCceEEEeecCCcc
Confidence 99999863 2333222221 1223322 2344433 34433 3579999999999
Q ss_pred cCcCCCCCccCCCCCCCCCCCCCCCCCCCCCEEeeCCCCeEeecCCCCccCCCc--cC---CCCCCCC---C--CCCCCc
Q psy17084 207 GKNCSFPRYKCDNPPCDDIDECVSNPCQNGGTCVDLVDGYKCECTQAWEGSNCQ--YD---ADECQKS---P--CVNAAL 276 (721)
Q Consensus 207 G~~C~~~~~~c~~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~--~~---~~~C~~~---~--C~~~~~ 276 (721)
| |+..|.|+|||.++.|..+++|.++.++|.|.|.+||.|+.-+ .+ ...|... | |....+
T Consensus 819 G----------DG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~ 888 (1289)
T KOG1214|consen 819 G----------DGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTG 888 (1289)
T ss_pred C----------CccccccccccCccccCCCceEecCCCcceeecccCccCCCceecCCCccCCccccccccceeeccccc
Confidence 9 5677889999999999999999999999999999999987433 11 1223221 1 222222
Q ss_pred ceEe-cCCCcEEeCCCCCcc---CCCCcCCCCCCCCCCCCCEEeeCC--C-CeeeecCC
Q psy17084 277 GCTN-LVGDYRCNCSPGWTG---HNCDVNINDCVGQCRHGSTCIDLV--N-DFHCACLP 328 (721)
Q Consensus 277 ~C~~-~~g~~~C~C~~G~~G---~~C~~~~~~c~~~C~~~~~C~~~~--~-~~~C~C~~ 328 (721)
-|.. ....|++.+.++-.| .+|......-...|..++.+..+. | ++.|.|..
T Consensus 889 ~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~~~vp~Cd~hgh~ap~qchG~~~~CwCvd 947 (1289)
T KOG1214|consen 889 FCWCVDPDGHEVPGTQTPPGSTPPHCGPSPEQYVPQCDDHGHFAPLQCHGKSDFCWCVD 947 (1289)
T ss_pred eeEeeCCCcccCCCCCCCCCCCCCCCCCcccccCCCccccccccccccCCCcceeEEec
Confidence 2221 234567776666555 345422111113455555554432 2 36677765
No 11
>KOG1836|consensus
Probab=99.33 E-value=9e-10 Score=134.34 Aligned_cols=211 Identities=26% Similarity=0.617 Sum_probs=119.7
Q ss_pred ccCCCCCCCcccccCCCCCCCCCCCCCEEecCC--CCeeee-cCCCcccccccCCCCCCCCceeeCCCCccCCccccC--
Q psy17084 491 SCDAGFSGKYCHENINDCKHNPCQNGGTCVDKV--NSFQCI-CRDGWEGEICANSNQSGGSFKCSCDAGFSGKYCHEN-- 565 (721)
Q Consensus 491 ~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~--g~~~C~-C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C~~~-- 565 (721)
+|..||+|..-......|.+-+|.+++.|.... ....|. |++||+|..|+. |..||+|..=..+
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~-----------c~dgyfg~p~~~~~~ 828 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE-----------CADGYFGNPLGHDGD 828 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCccccccc-----------CCCccccCCCCCCCC
Confidence 577888876543333348888888888886553 567899 999999999997 8899987643222
Q ss_pred CCCCCCCCCCCC------CEE----------eecCCCeEE-eccCCCccCccc-ccCCCCCCCCCCCC------cEEeeC
Q psy17084 566 INDCKHNPCQNG------GTC----------VDKVNSFQC-ICRDGWEGEICA-NNKNECEPNPCKNN------GTCIDG 621 (721)
Q Consensus 566 ~~~C~~~~C~~~------g~C----------~~~~~~~~C-~C~~G~~G~~C~-~~~~~C~~~~C~~~------g~C~~~ 621 (721)
...|.+-+|..+ +.| +....+..| .|.+||.|+.=. .+.+.|....|..- .+|..
T Consensus 829 ~~~c~~c~c~~n~dp~~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~- 907 (1705)
T KOG1836|consen 829 VRPCQSCQCNFNVDPNAFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNP- 907 (1705)
T ss_pred cccCccceeccccCccccccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCC-
Confidence 223433333221 122 222223333 477777776433 22233332222221 23432
Q ss_pred CCCeEEecCCCCccccccCC---------CCCCCCCCCCCC----CeeeecCCCceeeCCCCCCCCccccCc-------c
Q psy17084 622 HADFTCLCKNGWKGKTCTSK---------NGHCDRGTCKHG----GTCADLGSSFFCHCPPDWEGTSCHIGK-------L 681 (721)
Q Consensus 622 ~~~~~C~C~~G~~G~~C~~~---------~~~C~~~~C~~~----~~C~~~~~~~~C~C~~G~~G~~C~~~~-------~ 681 (721)
-...|.|++.-.|..|... ...|.+-+|..- ..|.. ++.+|.|.+|-+|.+|.... +
T Consensus 908 -~tGQcec~~~v~g~~c~~c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~--~tGqc~c~~gVtgqrc~qc~~~~~~~~~ 984 (1705)
T KOG1836|consen 908 -VTGQCECKPNVEGRDCLYCFKGFFNLNSGVGCEPCNCDPTGSESSDCDV--GTGQCYCRPGVTGQRCDQCETYHFGFQT 984 (1705)
T ss_pred -cccceeccCCCCccccccccccccccCCCCCcccccccccccccccccc--cCCceeeecCccccccCccccCcccccc
Confidence 2346777777766655311 112333334322 24543 34579999999998776211 1
Q ss_pred CCCCCCCCCCCC----eEeeCCCceeeecCCCCccCCcccC
Q psy17084 682 NACKSNPCKNGG----TCVNTGDLYSCICKEGFVHALLFTR 718 (721)
Q Consensus 682 ~~C~~~~C~~~~----~C~~~~~~~~C~C~~G~~g~~c~~~ 718 (721)
.-|..-.|...| +|....| +|.|++||.|..|..+
T Consensus 985 ~gc~~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c~~c 1023 (1705)
T KOG1836|consen 985 EGCGLCECDPLGSRGFQCDPEDG--QCPCRPGFEGRRCDQC 1023 (1705)
T ss_pred cCCcceecccCCcccceecccCC--eeeecCCCCCcccccc
Confidence 223333355555 6776444 7999999998766543
No 12
>KOG1836|consensus
Probab=99.32 E-value=7.5e-10 Score=134.98 Aligned_cols=244 Identities=28% Similarity=0.601 Sum_probs=148.3
Q ss_pred eeecCCCCcCCCccc-------------CCCccCCCCCCCC-CEEeeCCCCceEecCCCCcCCccCCCCeeeccCCcccc
Q psy17084 323 HCACLPGYTGRTCQT-------------DINDCESSPCVNG-GECVDQVNGFRCICPVGFAGQLCENGGTCVNTGDLYSC 388 (721)
Q Consensus 323 ~C~C~~Gy~G~~C~~-------------~~~~C~~~~C~~~-g~C~~~~~~~~C~C~~gy~g~~C~~~~~C~~~~~~~~C 388 (721)
.|.|++||+|+.|+. +...|.+.+|..+ .+|....| .|.|.+.-.|..|+
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG--~C~C~~~t~G~~C~-------------- 759 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTG--QCKCKHNTFGGQCA-------------- 759 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCC--ceecccCCCCCchh--------------
Confidence 399999999998873 1122333334333 35655444 78899888888887
Q ss_pred ccCCCCCCCCCCcCCCCCCCCCCCCCCEEeeC--CCceEEe-cCCCccCCCCcccCcccCCCCCCCCceeeccCCCCCCC
Q psy17084 389 ICKEGFEGPDCGQDINDCSPQPCYNGGKCVDG--VNWFLCE-CAPGFAGPDCRININECASNPCGYGKEILTVQSRSRSP 465 (721)
Q Consensus 389 ~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~--~g~~~C~-C~~Gy~G~~C~~~~~~C~~~~C~~g~~~~~~~~~~~~~ 465 (721)
+|..||+|..-.....+|.+.+|.+.+.|... .....|. |++||+|.+|+...+-..-+|=.+......++......
T Consensus 760 ~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c~~ 839 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQCNF 839 (1705)
T ss_pred hhcCCCCCccccCCCCCCccCCCCCChhhcCcCcccceecCCCCCCCcccccccCCCccccCCCCCCCCcccCccceecc
Confidence 79999999876555555899999998888644 4567899 99999999998655444333322221111111100000
Q ss_pred ------CccccCCCccccceeeecCCCceEe-ccCCCCCCCccc-ccCCCCCCCCCCC------CCEEecCCCCeeeecC
Q psy17084 466 ------AYLCVSNPAYLVARCVSQSGGSFKC-SCDAGFSGKYCH-ENINDCKHNPCQN------GGTCVDKVNSFQCICR 531 (721)
Q Consensus 466 ------~~~C~~~~~~~~~~C~~~~~~~~~C-~C~~G~~G~~C~-~~~~~C~~~~C~~------~g~C~~~~g~~~C~C~ 531 (721)
...|... ....-.|+.++.+ ..| .|.+||+|+.-. ...+.|....|.. ..+|....| +|.|.
T Consensus 840 n~dp~~~g~c~~~-tg~c~~ci~nT~g-~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tG--Qcec~ 915 (1705)
T KOG1836|consen 840 NVDPNAFGNCNRL-TGECLKCIHNTAG-EYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTG--QCECK 915 (1705)
T ss_pred ccCcccccccccc-ccceeeccCCccc-ccccccccCccccccCCCcCCccccccCccCCcccccccCCCccc--ceecc
Confidence 0011110 0011233444444 334 688999887543 2223343322322 235776766 89999
Q ss_pred CCcccccccCCCCCCCCceeeCCCCccCCccccCCCCCCCCCCCC----CCEEeecCCCeEEeccCCCccCcccc
Q psy17084 532 DGWEGEICANSNQSGGSFKCSCDAGFSGKYCHENINDCKHNPCQN----GGTCVDKVNSFQCICRDGWEGEICAN 602 (721)
Q Consensus 532 ~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~----~g~C~~~~~~~~C~C~~G~~G~~C~~ 602 (721)
+.-.|..|.. |.+||++..- -..|..-.|.. ...|....| +|.|.+|.+|.+|..
T Consensus 916 ~~v~g~~c~~-----------c~~g~fnl~s---~~gC~~c~c~~~gs~~~~c~~~tG--qc~c~~gVtgqrc~q 974 (1705)
T KOG1836|consen 916 PNVEGRDCLY-----------CFKGFFNLNS---GVGCEPCNCDPTGSESSDCDVGTG--QCYCRPGVTGQRCDQ 974 (1705)
T ss_pred CCCCcccccc-----------ccccccccCC---CCCcccccccccccccccccccCC--ceeeecCccccccCc
Confidence 9999999987 8889987651 12233333432 236665554 899999999998863
No 13
>KOG1225|consensus
Probab=99.31 E-value=2.1e-11 Score=132.04 Aligned_cols=131 Identities=35% Similarity=0.981 Sum_probs=90.8
Q ss_pred eeeCCCCccCCccccCCCCCCCCCCCCCCEEeecCCCeEEeccCCCccCcccccCCCCCCCCCCCCcEEeeCCCCeEEec
Q psy17084 550 KCSCDAGFSGKYCHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICANNKNECEPNPCKNNGTCIDGHADFTCLC 629 (721)
Q Consensus 550 ~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C 629 (721)
.|.|..+|+|..|. ...|..+ |...+.|++. +|+|++||+|..|+. -.|... |+.++.+++. +|+|
T Consensus 235 ic~c~~~~~g~~c~--~~~C~~~-c~~~g~c~~G----~CIC~~Gf~G~dC~e--~~Cp~~-cs~~g~~~~g----~CiC 300 (525)
T KOG1225|consen 235 ICECPEGYFGPLCS--TIYCPGG-CTGRGQCVEG----RCICPPGFTGDDCDE--LVCPVD-CSGGGVCVDG----ECIC 300 (525)
T ss_pred eeecCCceeCCccc--cccCCCC-CcccceEeCC----eEeCCCCCcCCCCCc--ccCCcc-cCCCceecCC----Eeec
Confidence 67777778777775 2234333 5566777766 688888888888864 235444 7777777755 7888
Q ss_pred CCCCccccccCCCCCCCCCCCCCCCeeeecCCCceeeCCCCCCCCccccCccCCCCCCCCCCCCeEeeCCCceeeecCCC
Q psy17084 630 KNGWKGKTCTSKNGHCDRGTCKHGGTCADLGSSFFCHCPPDWEGTSCHIGKLNACKSNPCKNGGTCVNTGDLYSCICKEG 709 (721)
Q Consensus 630 ~~G~~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G 709 (721)
++||.|..|+... | +..|.++|.|+ ...|.|.+||+|..|+.. +|.+++.|++. |.|.+|
T Consensus 301 ~~g~~G~dCs~~~--c-padC~g~G~Ci----~G~C~C~~Gy~G~~C~~~--------~C~~~g~cv~g-----C~C~~G 360 (525)
T KOG1225|consen 301 NPGYSGKDCSIRR--C-PADCSGHGKCI----DGECLCDEGYTGELCIQR--------ACSGGGQCVNG-----CKCKKG 360 (525)
T ss_pred CCCcccccccccc--C-CccCCCCCccc----CCceEeCCCCcCCccccc--------ccCCCceeccC-----ceeccC
Confidence 8888888886543 3 35688888887 224888888888887732 37777888752 888888
Q ss_pred CccCC
Q psy17084 710 FVHAL 714 (721)
Q Consensus 710 ~~g~~ 714 (721)
|.|..
T Consensus 361 w~G~d 365 (525)
T KOG1225|consen 361 WRGPD 365 (525)
T ss_pred ccCCC
Confidence 88765
No 14
>KOG1225|consensus
Probab=99.28 E-value=5.1e-11 Score=129.16 Aligned_cols=132 Identities=39% Similarity=1.125 Sum_probs=88.6
Q ss_pred eEeccCCCCCCCcccccCCCCCCCCCCCCCEEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccCCccccCCC
Q psy17084 488 FKCSCDAGFSGKYCHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKYCHENIN 567 (721)
Q Consensus 488 ~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C~~~~~ 567 (721)
+.|.|+.+|+|..|.. ..| +..|..++.|++. +|+|++||+|..|+..
T Consensus 234 ~ic~c~~~~~g~~c~~--~~C-~~~c~~~g~c~~G----~CIC~~Gf~G~dC~e~------------------------- 281 (525)
T KOG1225|consen 234 GICECPEGYFGPLCST--IYC-PGGCTGRGQCVEG----RCICPPGFTGDDCDEL------------------------- 281 (525)
T ss_pred ceeecCCceeCCcccc--ccC-CCCCcccceEeCC----eEeCCCCCcCCCCCcc-------------------------
Confidence 4799999999998862 334 3347777889876 7888888888776542
Q ss_pred CCCCCCCCCCCEEeecCCCeEEeccCCCccCcccccCCCCCCCCCCCCcEEeeCCCCeEEecCCCCccccccCCCCCCCC
Q psy17084 568 DCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICANNKNECEPNPCKNNGTCIDGHADFTCLCKNGWKGKTCTSKNGHCDR 647 (721)
Q Consensus 568 ~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~~~~~C~~ 647 (721)
.|... |+.++.+++. +|+|++||+|+.|+... | +.+|+++|.|+++ +|+|.+||+|..|+..
T Consensus 282 ~Cp~~-cs~~g~~~~g----~CiC~~g~~G~dCs~~~--c-padC~g~G~Ci~G----~C~C~~Gy~G~~C~~~------ 343 (525)
T KOG1225|consen 282 VCPVD-CSGGGVCVDG----ECICNPGYSGKDCSIRR--C-PADCSGHGKCIDG----ECLCDEGYTGELCIQR------ 343 (525)
T ss_pred cCCcc-cCCCceecCC----EeecCCCcccccccccc--C-CccCCCCCcccCC----ceEeCCCCcCCccccc------
Confidence 23333 6666666655 67777777777775432 3 3557777777743 6777777777777543
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCCCCc
Q psy17084 648 GTCKHGGTCADLGSSFFCHCPPDWEGTS 675 (721)
Q Consensus 648 ~~C~~~~~C~~~~~~~~C~C~~G~~G~~ 675 (721)
.|.+++.|++ + |.|..||.|..
T Consensus 344 -~C~~~g~cv~---g--C~C~~Gw~G~d 365 (525)
T KOG1225|consen 344 -ACSGGGQCVN---G--CKCKKGWRGPD 365 (525)
T ss_pred -ccCCCceecc---C--ceeccCccCCC
Confidence 2666667763 2 77777777766
No 15
>KOG4260|consensus
Probab=98.91 E-value=1.1e-09 Score=105.27 Aligned_cols=161 Identities=29% Similarity=0.692 Sum_probs=109.8
Q ss_pred CCCCCccCccCCCCCCCCCCCCCCCCeeeec---CCceEEecCCCCCCCccCccCCCCCcccccccccCccccccC--CC
Q psy17084 37 CPEGFSGINCEVVDNPCMMGPCGNGGQCKET---AGQFQCVCAPGWTGPTCKIKHNFFPYLQFIPLTTSLFRLLSD--LN 111 (721)
Q Consensus 37 C~~G~~G~~C~~~~~~C~~~~C~~~g~C~~~---~g~~~C~C~~Gy~G~~C~~~~~~~~~~~c~c~~~~~~~~~~~--~~ 111 (721)
|++|.+|.+|.. -+.=...||..+|.|.-. .|+-.|.|.+||+|..|. .|. ++|+-...+ ..
T Consensus 132 Cp~gtyGpdCl~-Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~-----------~Cg-~eyfes~Rne~~l 198 (350)
T KOG4260|consen 132 CPDGTYGPDCLQ-CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCR-----------YCG-IEYFESSRNEQHL 198 (350)
T ss_pred cCCCCcCCcccc-CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcccc-----------ccc-hHHHHhhcccccc
Confidence 899999999973 111134589999999742 356689999999999886 233 444432211 11
Q ss_pred CCCC-CCCCCCCCeeccCCCCceeee-CCCCCcCCcccccCCCCCCCCCCCCCeeeecCCceeeccCCCCCC--CCCCCC
Q psy17084 112 YCGT-HEPCQNGGTCENTAPDQYLCT-CPEGFSGINCEVVDNPCMMGPCGNGGQCKETAGQFQCVDHDHCNP--NPCLNG 187 (721)
Q Consensus 112 ~C~~-~~~C~~~g~C~~~~~~~~~C~-C~~Gy~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~~~~~C~~--~~C~~~ 187 (721)
.|.. +.+|. ++|.. ++.-.|. |..||... ...|+|||||.. .||...
T Consensus 199 vCt~Ch~~C~--~~Csg--~~~k~C~kCkkGW~ld-------------------------e~gCvDvnEC~~ep~~c~~~ 249 (350)
T KOG4260|consen 199 VCTACHEGCL--GVCSG--ESSKGCSKCKKGWKLD-------------------------EEGCVDVNECQNEPAPCKAH 249 (350)
T ss_pred hhhhhhhhhh--cccCC--CCCCChhhhcccceec-------------------------ccccccHHHHhcCCCCCChh
Confidence 2321 12233 24543 3344574 89999763 134889999964 679999
Q ss_pred CceeeCCCCeEeeCCCCCccCcCCCCCccCCCCCCCCCCCCCC--CCC-CCCCEEeeCCCCeEeecCCCCc
Q psy17084 188 APCFNTQADYYCHCTEDWEGKNCSFPRYKCDNPPCDDIDECVS--NPC-QNGGTCVDLVDGYKCECTQAWE 255 (721)
Q Consensus 188 ~~C~~~~g~~~C~C~~G~~G~~C~~~~~~c~~~~C~~~~~C~~--~~C-~~~~~C~~~~g~~~C~C~~G~~ 255 (721)
..|+|+.|||.|...+||.+ ++|+|.. ..| ..+..|.++.+.|+|+|..|+.
T Consensus 250 qfCvNteGSf~C~dk~Gy~~----------------g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 250 QFCVNTEGSFKCEDKEGYKK----------------GVDECQFCADVCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred heeecCCCceEecccccccC----------------ChHHhhhhhhhcccCCCCcccCCccEEEEecccce
Confidence 99999999999999999986 2355543 233 2456788999999999988874
No 16
>KOG4260|consensus
Probab=98.74 E-value=1.6e-08 Score=97.48 Aligned_cols=147 Identities=27% Similarity=0.723 Sum_probs=91.3
Q ss_pred CCCCCCCeecc--CCCCceeeeCCCCCcCCcccccCCCCCCCCCCCCCeeeecCCceeecc-CCCCCCCCCCCCCceeeC
Q psy17084 117 EPCQNGGTCEN--TAPDQYLCTCPEGFSGINCEVVDNPCMMGPCGNGGQCKETAGQFQCVD-HDHCNPNPCLNGAPCFNT 193 (721)
Q Consensus 117 ~~C~~~g~C~~--~~~~~~~C~C~~Gy~G~~C~~~~~~C~~~~C~~~~~C~~~~~~~~C~~-~~~C~~~~C~~~~~C~~~ 193 (721)
.+|..+|.|.- +..|+..|.|.+||+|..|..-..+=... ..+.....|.. ...|. +.|..
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes--------~Rne~~lvCt~Ch~~C~-------~~Csg- 213 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFES--------SRNEQHLVCTACHEGCL-------GVCSG- 213 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHh--------hcccccchhhhhhhhhh-------cccCC-
Confidence 34555555542 24588999999999999886311100000 00000111110 00111 11211
Q ss_pred CCCeEe-eCCCCCccCcCCCCCccCCCCCCCCCCCCC--CCCCCCCCEEeeCCCCeEeecCCCCccCCCccCCCCCCC--
Q psy17084 194 QADYYC-HCTEDWEGKNCSFPRYKCDNPPCDDIDECV--SNPCQNGGTCVDLVDGYKCECTQAWEGSNCQYDADECQK-- 268 (721)
Q Consensus 194 ~g~~~C-~C~~G~~G~~C~~~~~~c~~~~C~~~~~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~C~~-- 268 (721)
.++-.| .|..||..+ ...|+|||||. +.||..+..|+|+.|+|+|...+||.+ ++|+|+.
T Consensus 214 ~~~k~C~kCkkGW~ld----------e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~-----g~d~C~~~~ 278 (350)
T KOG4260|consen 214 ESSKGCSKCKKGWKLD----------EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK-----GVDECQFCA 278 (350)
T ss_pred CCCCChhhhcccceec----------ccccccHHHHhcCCCCCChhheeecCCCceEecccccccC-----ChHHhhhhh
Confidence 112223 589999874 45789999998 578999999999999999999999986 3566664
Q ss_pred CCCCCCCcceEecCCCcEEeCCCCCc
Q psy17084 269 SPCVNAALGCTNLVGDYRCNCSPGWT 294 (721)
Q Consensus 269 ~~C~~~~~~C~~~~g~~~C~C~~G~~ 294 (721)
..|......|.++.+.|+|+|..|+.
T Consensus 279 d~~~~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 279 DVCASKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred hhcccCCCCcccCCccEEEEecccce
Confidence 44444445688888888888887763
No 17
>KOG1226|consensus
Probab=98.49 E-value=7e-07 Score=98.82 Aligned_cols=131 Identities=31% Similarity=0.819 Sum_probs=98.5
Q ss_pred CCCCCCCEEeecCCCeEEeccCCCccCccccc---------CCCCCC----CCCCCCcEEeeCCCCeEEecCCCCc----
Q psy17084 572 NPCQNGGTCVDKVNSFQCICRDGWEGEICANN---------KNECEP----NPCKNNGTCIDGHADFTCLCKNGWK---- 634 (721)
Q Consensus 572 ~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~~---------~~~C~~----~~C~~~g~C~~~~~~~~C~C~~G~~---- 634 (721)
..|+.+|+.+-. +|.|.+||.|+.|+-. .+.|.. .+|+.+|.|+=+ +|+|.+...
T Consensus 467 ~~C~g~G~~~CG----~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CG----qC~C~~~~~~~i~ 538 (783)
T KOG1226|consen 467 ALCHGNGTFVCG----QCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCG----QCVCHKPDNGKIY 538 (783)
T ss_pred cccCCCCcEEec----ceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCC----ceEecCCCCCcee
Confidence 347766776665 6999999999998732 234432 279999999865 799987776
Q ss_pred cccccCCCCCCCCC---CCCCCCeeeecCCCceeeCCCCCCCCccccC-ccCCCCCC---CCCCCCeEeeCCCceeeecC
Q psy17084 635 GKTCTSKNGHCDRG---TCKHGGTCADLGSSFFCHCPPDWEGTSCHIG-KLNACKSN---PCKNGGTCVNTGDLYSCICK 707 (721)
Q Consensus 635 G~~C~~~~~~C~~~---~C~~~~~C~~~~~~~~C~C~~G~~G~~C~~~-~~~~C~~~---~C~~~~~C~~~~~~~~C~C~ 707 (721)
|.+||-+.-.|.+. .|..+|+|. -.+|+|.+||+|..|.+. +.+.|.+. .|+..|+|.=. +|+|.
T Consensus 539 G~fCECDnfsC~r~~g~lC~g~G~C~----CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg----~C~C~ 610 (783)
T KOG1226|consen 539 GKFCECDNFSCERHKGVLCGGHGRCE----CGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECG----RCKCT 610 (783)
T ss_pred eeeeeccCcccccccCcccCCCCeEe----CCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCC----ceEcC
Confidence 99998777667654 599999987 447999999999998864 45667543 27888888743 68998
Q ss_pred CC-CccCCcccC
Q psy17084 708 EG-FVHALLFTR 718 (721)
Q Consensus 708 ~G-~~g~~c~~~ 718 (721)
+. |+|.+|+..
T Consensus 611 ~~~~sG~~CE~c 622 (783)
T KOG1226|consen 611 DPPYSGEFCEKC 622 (783)
T ss_pred CCCcCcchhhcC
Confidence 87 999998854
No 18
>KOG1226|consensus
Probab=98.42 E-value=1.4e-06 Score=96.52 Aligned_cols=153 Identities=31% Similarity=0.830 Sum_probs=102.6
Q ss_pred CCCCCCCEEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccCCccccCCCCCC----CCCCCCCCEEeecCCC
Q psy17084 511 NPCQNGGTCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKYCHENINDCK----HNPCQNGGTCVDKVNS 586 (721)
Q Consensus 511 ~~C~~~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C~~~~~~C~----~~~C~~~g~C~~~~~~ 586 (721)
..|+.+|+.+-. +|.|.+||.|..|+- +..-.... + ..+.|. .-+|+..|.|+=.
T Consensus 467 ~~C~g~G~~~CG----~C~C~~G~~G~~CEC------------~~~~~ss~-~-~~~~Cr~~~~~~vCSgrG~C~CG--- 525 (783)
T KOG1226|consen 467 ALCHGNGTFVCG----QCRCDEGWLGKKCEC------------STDELSSS-E-EEDKCRENSDSPVCSGRGDCVCG--- 525 (783)
T ss_pred cccCCCCcEEec----ceecCCCCCCCcccC------------CccccCcH-h-HHhhccCCCCCCCcCCCCcEeCC---
Confidence 446666666554 689999999987763 32222211 0 122232 2279999999876
Q ss_pred eEEeccCCCc----cCcccccCCCCCC---CCCCCCcEEeeCCCCeEEecCCCCcccccc--CCCCCCCCC---CCCCCC
Q psy17084 587 FQCICRDGWE----GEICANNKNECEP---NPCKNNGTCIDGHADFTCLCKNGWKGKTCT--SKNGHCDRG---TCKHGG 654 (721)
Q Consensus 587 ~~C~C~~G~~----G~~C~~~~~~C~~---~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~--~~~~~C~~~---~C~~~~ 654 (721)
+|+|.+... |..|+-+.-.|.. ..|..+|.|.-+ +|+|.+||+|..|+ .+.+.|.+. .|...|
T Consensus 526 -qC~C~~~~~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG 600 (783)
T KOG1226|consen 526 -QCVCHKPDNGKIYGKFCECDNFSCERHKGVLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQICSGRG 600 (783)
T ss_pred -ceEecCCCCCceeeeeeeccCcccccccCcccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCceeCCCc
Confidence 799988776 8899866666654 359999999865 89999999999885 344556532 466777
Q ss_pred eeeecCCCceeeCCCC-CCCCccccCccCCCCCCCCCCCCeEe
Q psy17084 655 TCADLGSSFFCHCPPD-WEGTSCHIGKLNACKSNPCKNGGTCV 696 (721)
Q Consensus 655 ~C~~~~~~~~C~C~~G-~~G~~C~~~~~~~C~~~~C~~~~~C~ 696 (721)
+|. -.+|+|... |.|..|++ ...| ..+|..+..|+
T Consensus 601 ~C~----Cg~C~C~~~~~sG~~CE~--cptc-~~~C~~~~~Cv 636 (783)
T KOG1226|consen 601 TCE----CGRCKCTDPPYSGEFCEK--CPTC-PDPCAENKSCV 636 (783)
T ss_pred eee----CCceEcCCCCcCcchhhc--CCCC-CCcccccccch
Confidence 776 346888876 99999983 2233 33466666665
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.85 E-value=1.2e-05 Score=53.23 Aligned_cols=29 Identities=45% Similarity=1.179 Sum_probs=17.8
Q ss_pred CCCCCCCCCeeeecC-CceEEecCCCCCCC
Q psy17084 54 MMGPCGNGGQCKETA-GQFQCVCAPGWTGP 82 (721)
Q Consensus 54 ~~~~C~~~g~C~~~~-g~~~C~C~~Gy~G~ 82 (721)
.+.||.++|+|++.. ++|+|+|++||+|.
T Consensus 2 ~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 2 SSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 345666666666665 56666666666654
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.82 E-value=1.2e-05 Score=53.19 Aligned_cols=30 Identities=47% Similarity=1.173 Sum_probs=22.2
Q ss_pred CCCCCCCCCCeEeeCC-CceeeecCCCCccC
Q psy17084 684 CKSNPCKNGGTCVNTG-DLYSCICKEGFVHA 713 (721)
Q Consensus 684 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~ 713 (721)
|.++||.++|+|++.. ++|+|+|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4566777777887776 77778888887775
No 21
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.78 E-value=1.8e-05 Score=56.16 Aligned_cols=32 Identities=41% Similarity=0.998 Sum_probs=28.8
Q ss_pred ccCCCCCC--CCCCCCeEeeCCCceeeecCCCCc
Q psy17084 680 KLNACKSN--PCKNGGTCVNTGDLYSCICKEGFV 711 (721)
Q Consensus 680 ~~~~C~~~--~C~~~~~C~~~~~~~~C~C~~G~~ 711 (721)
|||||+.. .|..+++|+|+.|+|+|+|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 57899754 598899999999999999999998
No 22
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.71 E-value=3e-05 Score=55.03 Aligned_cols=32 Identities=34% Similarity=0.969 Sum_probs=27.8
Q ss_pred CCCCCCCC--CCCCCCeeeecCCceEEecCCCCC
Q psy17084 49 VDNPCMMG--PCGNGGQCKETAGQFQCVCAPGWT 80 (721)
Q Consensus 49 ~~~~C~~~--~C~~~g~C~~~~g~~~C~C~~Gy~ 80 (721)
||+||... +|..+++|+|+.|+|+|.|++||+
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 57888654 598889999999999999999997
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.71 E-value=4.8e-05 Score=52.97 Aligned_cols=34 Identities=47% Similarity=1.304 Sum_probs=18.9
Q ss_pred CCCCC-CCCCCCCeeeecCCceEEecCCCCC-CCcc
Q psy17084 51 NPCMM-GPCGNGGQCKETAGQFQCVCAPGWT-GPTC 84 (721)
Q Consensus 51 ~~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~-G~~C 84 (721)
++|.. .+|.++++|+++.++|+|.|++||+ |..|
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C 38 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcC
Confidence 44544 4555555566555556666666665 5444
No 24
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.70 E-value=4.7e-05 Score=53.03 Aligned_cols=36 Identities=47% Similarity=0.982 Sum_probs=28.5
Q ss_pred cCCCCC-CCCCCCCeEeeCCCceeeecCCCCc-cCCcc
Q psy17084 681 LNACKS-NPCKNGGTCVNTGDLYSCICKEGFV-HALLF 716 (721)
Q Consensus 681 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~-g~~c~ 716 (721)
+++|.. .+|.++++|+++.++|.|.|++||+ |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 567776 6788888888888888888888888 77663
No 25
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.37 E-value=0.00011 Score=68.05 Aligned_cols=136 Identities=29% Similarity=0.715 Sum_probs=84.8
Q ss_pred CCCCCEEeecCCCeEEeccCCCcc---CcccccCCCCCC-----CCCCCCcEEeeCC-----CCeEEecCCCCccc--cc
Q psy17084 574 CQNGGTCVDKVNSFQCICRDGWEG---EICANNKNECEP-----NPCKNNGTCIDGH-----ADFTCLCKNGWKGK--TC 638 (721)
Q Consensus 574 C~~~g~C~~~~~~~~C~C~~G~~G---~~C~~~~~~C~~-----~~C~~~g~C~~~~-----~~~~C~C~~G~~G~--~C 638 (721)
|.+ |.-+...+.|.|.|.+||.- +.|+..+ +|.. .+|...++|++.. ..|+|.|.+||+-. .|
T Consensus 8 CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~kv-~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vC 85 (197)
T PF06247_consen 8 CKN-GYLIQMSNHFECKCNEGFVLKNENTCEEKV-ECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVC 85 (197)
T ss_dssp -BT-EEEEEESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSE
T ss_pred ccC-CEEEEccCceEEEcCCCcEEccccccccce-ecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeE
Confidence 544 67777788899999999974 4565433 4532 4799999998765 46999999999732 34
Q ss_pred cCCCCCCCCCCCCCCCeeeecC---CCceeeCCCCCC---CCccccCccCCCCCCCCCCCCeEeeCCCceeeecCCCCcc
Q psy17084 639 TSKNGHCDRGTCKHGGTCADLG---SSFFCHCPPDWE---GTSCHIGKLNACKSNPCKNGGTCVNTGDLYSCICKEGFVH 712 (721)
Q Consensus 639 ~~~~~~C~~~~C~~~~~C~~~~---~~~~C~C~~G~~---G~~C~~~~~~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g 712 (721)
. ...|....|. .|.|+-.+ ....|+|.-|+. ...|+..-.-+|+. -|..+..|..+.+-|+|.+.+||.+
T Consensus 86 v--p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~L-KCk~nE~CK~~~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 86 V--PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSL-KCKENEECKLVDGYYKCVCKEGFPG 161 (197)
T ss_dssp E--EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred c--hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceee-ecCCCcceeeeCcEEEeecCCCCCC
Confidence 2 2345555676 67898433 245899999997 33455444455643 4788899999999999999999987
Q ss_pred CCc
Q psy17084 713 ALL 715 (721)
Q Consensus 713 ~~c 715 (721)
+.=
T Consensus 162 ~~~ 164 (197)
T PF06247_consen 162 DGE 164 (197)
T ss_dssp ETT
T ss_pred CCC
Confidence 643
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.36 E-value=0.00026 Score=48.71 Aligned_cols=34 Identities=47% Similarity=1.313 Sum_probs=19.5
Q ss_pred CCCCC-CCCCCCCeeeecCCceEEecCCCCCCCcc
Q psy17084 51 NPCMM-GPCGNGGQCKETAGQFQCVCAPGWTGPTC 84 (721)
Q Consensus 51 ~~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C 84 (721)
++|.. .+|.+++.|++..++|+|.|++||+|..|
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 44544 45655556666666666666666665544
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.28 E-value=0.00014 Score=49.21 Aligned_cols=28 Identities=39% Similarity=0.898 Sum_probs=22.2
Q ss_pred CCCCCCeEeeCCCceeeecCCCCccCCc
Q psy17084 688 PCKNGGTCVNTGDLYSCICKEGFVHALL 715 (721)
Q Consensus 688 ~C~~~~~C~~~~~~~~C~C~~G~~g~~c 715 (721)
.|+.+|+|++++++|+|+|++||+|+--
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 4888999999999999999999999743
No 28
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.26 E-value=0.00038 Score=47.87 Aligned_cols=35 Identities=49% Similarity=1.056 Sum_probs=26.4
Q ss_pred cCCCCC-CCCCCCCeEeeCCCceeeecCCCCccCCc
Q psy17084 681 LNACKS-NPCKNGGTCVNTGDLYSCICKEGFVHALL 715 (721)
Q Consensus 681 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g~~c 715 (721)
+++|.. .+|.+++.|++..++|.|.|++||+|..|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 466765 67777778888888888888888887655
No 29
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.09 E-value=0.00052 Score=41.57 Aligned_cols=24 Identities=38% Similarity=0.906 Sum_probs=20.5
Q ss_pred CeEeeCCCCCccCcCCCCCccCCCCCCCCCCC
Q psy17084 196 DYYCHCTEDWEGKNCSFPRYKCDNPPCDDIDE 227 (721)
Q Consensus 196 ~~~C~C~~G~~G~~C~~~~~~c~~~~C~~~~~ 227 (721)
||+|+|++||+... +++.|+||||
T Consensus 1 sy~C~C~~Gy~l~~--------d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP--------DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC--------CCCccccCCC
Confidence 68999999998764 6788999986
No 30
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.88 E-value=0.0014 Score=44.28 Aligned_cols=28 Identities=57% Similarity=1.247 Sum_probs=21.8
Q ss_pred CCCCCCCCeEeeCCCceeeecCCCCccC
Q psy17084 686 SNPCKNGGTCVNTGDLYSCICKEGFVHA 713 (721)
Q Consensus 686 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 713 (721)
..+|.++++|+++.+.|+|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4567777888888778888888888877
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.86 E-value=0.0015 Score=44.07 Aligned_cols=28 Identities=54% Similarity=1.376 Sum_probs=16.7
Q ss_pred CCCCCCCCeeeecCCceEEecCCCCCCC
Q psy17084 55 MGPCGNGGQCKETAGQFQCVCAPGWTGP 82 (721)
Q Consensus 55 ~~~C~~~g~C~~~~g~~~C~C~~Gy~G~ 82 (721)
..+|.++++|++..++|+|.|++||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 3455555666666666666666666654
No 32
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.80 E-value=0.0017 Score=43.79 Aligned_cols=28 Identities=54% Similarity=1.449 Sum_probs=19.3
Q ss_pred CCCCCCCeeeecCCceEEecCCCCCC-Ccc
Q psy17084 56 GPCGNGGQCKETAGQFQCVCAPGWTG-PTC 84 (721)
Q Consensus 56 ~~C~~~g~C~~~~g~~~C~C~~Gy~G-~~C 84 (721)
.+|.++ +|+++.++|+|.|++||+| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 466666 7777777777777777776 444
No 33
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.75 E-value=0.002 Score=43.50 Aligned_cols=31 Identities=45% Similarity=1.060 Sum_probs=22.4
Q ss_pred CCC-CCCCCCCeEeeCCCceeeecCCCCcc-CCc
Q psy17084 684 CKS-NPCKNGGTCVNTGDLYSCICKEGFVH-ALL 715 (721)
Q Consensus 684 C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g-~~c 715 (721)
|.. .+|.++ +|+++.++|+|.|++||.| ..|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 444 567776 7888878888888888887 444
No 34
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.48 E-value=0.0017 Score=44.01 Aligned_cols=27 Identities=33% Similarity=0.989 Sum_probs=22.1
Q ss_pred CCCCCCCEEecCCCCeeeecCCCcccc
Q psy17084 511 NPCQNGGTCVDKVNSFQCICRDGWEGE 537 (721)
Q Consensus 511 ~~C~~~g~C~~~~g~~~C~C~~G~~G~ 537 (721)
..|+.+|+|+++.++|.|+|++||+|+
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccC
Confidence 468999999999999999999999986
No 35
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.38 E-value=0.00064 Score=63.13 Aligned_cols=134 Identities=22% Similarity=0.607 Sum_probs=71.9
Q ss_pred CceeeCCCCeEeeCCCCCccCcCCCCCccCCCCCCCCCCCCCC-----CCCCCCCEEeeCC-----CCeEeecCCCCccC
Q psy17084 188 APCFNTQADYYCHCTEDWEGKNCSFPRYKCDNPPCDDIDECVS-----NPCQNGGTCVDLV-----DGYKCECTQAWEGS 257 (721)
Q Consensus 188 ~~C~~~~g~~~C~C~~G~~G~~C~~~~~~c~~~~C~~~~~C~~-----~~C~~~~~C~~~~-----g~~~C~C~~G~~G~ 257 (721)
+..+.....|.|.|++||...+ ..+|+...+|.. .+|...++|++.. ..|+|.|.+||+-.
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~---------EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~ 81 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKN---------ENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILK 81 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEE---------TTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES
T ss_pred CEEEEccCceEEEcCCCcEEcc---------ccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceee
Confidence 3455566789999999998642 233444555543 5799999999875 56999999999743
Q ss_pred CCccCCCCCCCCCCCCCCcceEec---CCCcEEeCCCCCc---cCCCCcC-CCCCCCCCCCCCEEeeCCCCeeeecCCCC
Q psy17084 258 NCQYDADECQKSPCVNAALGCTNL---VGDYRCNCSPGWT---GHNCDVN-INDCVGQCRHGSTCIDLVNDFHCACLPGY 330 (721)
Q Consensus 258 ~C~~~~~~C~~~~C~~~~~~C~~~---~g~~~C~C~~G~~---G~~C~~~-~~~c~~~C~~~~~C~~~~~~~~C~C~~Gy 330 (721)
.=.--...|....|. .+.|+-. .....|+|.-|+. ...|..+ ...|...|..+..|....+-|+|.+..||
T Consensus 82 ~~vCvp~~C~~~~Cg--~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~ 159 (197)
T PF06247_consen 82 QGVCVPNKCNNKDCG--SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKLVDGYYKCVCKEGF 159 (197)
T ss_dssp SSSEEEGGGSS---T--TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEEETTEEEEEE-TT-
T ss_pred CCeEchhhcCceecC--CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceeeeCcEEEeecCCCC
Confidence 211112345555554 3567543 2234888888886 1122211 12344556666677777777777777777
Q ss_pred cC
Q psy17084 331 TG 332 (721)
Q Consensus 331 ~G 332 (721)
.+
T Consensus 160 ~~ 161 (197)
T PF06247_consen 160 PG 161 (197)
T ss_dssp EE
T ss_pred CC
Confidence 64
No 36
>KOG1218|consensus
Probab=96.34 E-value=0.52 Score=49.74 Aligned_cols=47 Identities=34% Similarity=0.928 Sum_probs=34.0
Q ss_pred eEeccCCCCCCCcccccCCCCC-CCCCCCCCEEecCCCCeeeecCCCccc
Q psy17084 488 FKCSCDAGFSGKYCHENINDCK-HNPCQNGGTCVDKVNSFQCICRDGWEG 536 (721)
Q Consensus 488 ~~C~C~~G~~G~~C~~~~~~C~-~~~C~~~g~C~~~~g~~~C~C~~G~~G 536 (721)
..|.|.+||.|..+......|. ...+.+++.|+...+ .+.+.+.+.+
T Consensus 162 ~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~--~~~~~~~~~~ 209 (316)
T KOG1218|consen 162 GICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG--SCLCYPGPSG 209 (316)
T ss_pred CceeccCCcccccccccCCCcCCCcccCCCCeeecccc--ccccCCCCcc
Confidence 4688999999998876554455 456777789988766 5666666654
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.31 E-value=0.0043 Score=40.80 Aligned_cols=25 Identities=32% Similarity=0.804 Sum_probs=12.4
Q ss_pred CCCCCCEeccCCCCceeEeCCCCCccCc
Q psy17084 18 PCQNGGTCENTAPDQYLCTCPEGFSGIN 45 (721)
Q Consensus 18 ~C~~~g~C~~~~~~~~~C~C~~G~~G~~ 45 (721)
.|.++|+|+. ...+|+|.+||+|..
T Consensus 7 ~C~~~G~C~~---~~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVS---PCGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeC---CCCEEECCCCCcCCC
Confidence 3555555553 134555555555544
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.26 E-value=0.0048 Score=40.59 Aligned_cols=26 Identities=46% Similarity=1.267 Sum_probs=22.7
Q ss_pred CCCCCCeeeecCCceEEecCCCCCCCcc
Q psy17084 57 PCGNGGQCKETAGQFQCVCAPGWTGPTC 84 (721)
Q Consensus 57 ~C~~~g~C~~~~g~~~C~C~~Gy~G~~C 84 (721)
.|+++|+|+...+ +|+|++||+|+.|
T Consensus 7 ~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 4888999998855 7999999999976
No 39
>KOG1218|consensus
Probab=96.25 E-value=0.62 Score=49.16 Aligned_cols=160 Identities=26% Similarity=0.638 Sum_probs=83.6
Q ss_pred CeEeecCCCCccC-CCccCCCCCCCCCCCCCCcceEecCCCcEEeCCCCCccCCCCcCCCCCC--CCCCCCCEEeeCC--
Q psy17084 245 GYKCECTQAWEGS-NCQYDADECQKSPCVNAALGCTNLVGDYRCNCSPGWTGHNCDVNINDCV--GQCRHGSTCIDLV-- 319 (721)
Q Consensus 245 ~~~C~C~~G~~G~-~C~~~~~~C~~~~C~~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~c~--~~C~~~~~C~~~~-- 319 (721)
+..|.|.++|+|. .+.. ..+.. ++. ..+.......+|.+..+|.|..|......-. ..|.....|....
T Consensus 14 ~~~c~c~~~~~g~~~~~~-~~~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~ 87 (316)
T KOG1218|consen 14 SGQCFCDPGYTGRLQCEH-QAVTS--ACS---GICPCEVNSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTC 87 (316)
T ss_pred CCceecCCCccccccccC-CCCCc--ccc---ccCCccCCceeEecccccCCCccccccccCCCCCcccCccccCCCCcc
Confidence 4589999999995 2222 11111 111 1111133456889999999988764322111 2233222222211
Q ss_pred CCeeeec-CCCCcCCCcccCCCccCCCCCCCCCEEeeCCCCceEecCCCCcCCccCCCCeeeccCCccccccCCCCCCCC
Q psy17084 320 NDFHCAC-LPGYTGRTCQTDINDCESSPCVNGGECVDQVNGFRCICPVGFAGQLCENGGTCVNTGDLYSCICKEGFEGPD 398 (721)
Q Consensus 320 ~~~~C~C-~~Gy~G~~C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~gy~g~~C~~~~~C~~~~~~~~C~C~~G~~G~~ 398 (721)
......+ ..+|.|..|+. +.++... |.. .+|.+... .|.+..+|.+..|.. ++|+|..
T Consensus 88 ~~~~~~~~~~~~~g~~C~~-~~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~~~C~~----------------~~~~g~~ 146 (316)
T KOG1218|consen 88 VSSTGYCHLNGYEGPQCES-PCPCGDG-CAE-KTCANPRR--ECRCGGGYIGEQCGE----------------ENLVGLK 146 (316)
T ss_pred cCCCCcccCCCCCcccccC-CCCcCCc-ccc-cccCCCcc--ceecCCcCccccccc----------------cCCCCCC
Confidence 1122344 67888888873 3333322 322 44544432 477777777665542 4677777
Q ss_pred CCcCCCCCCCCCCCCCCEEeeCCCceEEecCCCccCCCCcccC
Q psy17084 399 CGQDINDCSPQPCYNGGKCVDGVNWFLCECAPGFAGPDCRINI 441 (721)
Q Consensus 399 C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C~~~~ 441 (721)
|..+. . + ...+.-..+ .|.|++||.|..+....
T Consensus 147 C~~~c-~-----~--~~~~~~~~~--~c~c~~g~~g~~~~~~~ 179 (316)
T KOG1218|consen 147 CQRDC-Q-----C--TGGCDCKNG--ICTCQPGFVGVFCVESC 179 (316)
T ss_pred ccCCC-C-----C--ccccCCCCC--ceeccCCcccccccccC
Confidence 76543 1 1 112211222 68899999999887543
No 40
>KOG3512|consensus
Probab=95.80 E-value=0.064 Score=56.81 Aligned_cols=94 Identities=22% Similarity=0.492 Sum_probs=55.7
Q ss_pred ccCCCCCCCccc--ccCCCCCCCCCCC----CCEEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccCCc---
Q psy17084 491 SCDAGFSGKYCH--ENINDCKHNPCQN----GGTCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKY--- 561 (721)
Q Consensus 491 ~C~~G~~G~~C~--~~~~~C~~~~C~~----~g~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~--- 561 (721)
.|..||+-+.-. .+.+.|..-.|+. +.+|..+.| +|.|++|.+|..|.. |.+||+-..
T Consensus 375 yCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnr-----------Ca~gyqqsrs~v 441 (592)
T KOG3512|consen 375 YCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNR-----------CAPGYQQSRSPV 441 (592)
T ss_pred cccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCccccccc-----------ccchhhcccCCC
Confidence 578888633211 1223344434443 457887777 999999999999987 888886332
Q ss_pred --cccCCCCCCCCCCCCCCEEeecCCCeEEeccCCCccCcccc
Q psy17084 562 --CHENINDCKHNPCQNGGTCVDKVNSFQCICRDGWEGEICAN 602 (721)
Q Consensus 562 --C~~~~~~C~~~~C~~~g~C~~~~~~~~C~C~~G~~G~~C~~ 602 (721)
|. .++.=....++++.+ .-.+.+.|+.++.|.+++.
T Consensus 442 apci-k~p~~~~~~~~s~ve----~qd~~s~Ck~~~~~~r~n~ 479 (592)
T KOG3512|consen 442 APCI-KIPTDAPTLGSSGVE----PQDQCSKCKASPGGKRLNQ 479 (592)
T ss_pred cCce-ecCCCCccccCCCCc----chhccccCCCCCcceeccc
Confidence 21 111111122444443 1224578999998887753
No 41
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.40 E-value=0.0073 Score=30.92 Aligned_cols=12 Identities=75% Similarity=2.187 Sum_probs=6.3
Q ss_pred EecCCCCCCCcc
Q psy17084 73 CVCAPGWTGPTC 84 (721)
Q Consensus 73 C~C~~Gy~G~~C 84 (721)
|+|++||+|+.|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 566666666543
No 42
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.04 E-value=0.019 Score=34.88 Aligned_cols=11 Identities=45% Similarity=1.485 Sum_probs=6.2
Q ss_pred CeeeecCCCCc
Q psy17084 321 DFHCACLPGYT 331 (721)
Q Consensus 321 ~~~C~C~~Gy~ 331 (721)
+|+|.|++||+
T Consensus 1 sy~C~C~~Gy~ 11 (24)
T PF12662_consen 1 SYTCSCPPGYQ 11 (24)
T ss_pred CEEeeCCCCCc
Confidence 35556666654
No 43
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=94.73 E-value=0.014 Score=29.90 Aligned_cols=8 Identities=50% Similarity=1.493 Sum_probs=2.6
Q ss_pred ecCCCCcc
Q psy17084 628 LCKNGWKG 635 (721)
Q Consensus 628 ~C~~G~~G 635 (721)
+|++||+|
T Consensus 3 ~C~~G~~G 10 (13)
T PF12661_consen 3 QCPPGWTG 10 (13)
T ss_dssp EE-TTEET
T ss_pred cCcCCCcC
Confidence 33333333
No 44
>KOG3512|consensus
Probab=94.64 E-value=0.22 Score=52.89 Aligned_cols=48 Identities=25% Similarity=0.679 Sum_probs=29.6
Q ss_pred ccCCCCCCCCCC--cCCCCCCCCCCCC----CCEEeeCCCceEEecCCCccCCCCc
Q psy17084 389 ICKEGFEGPDCG--QDINDCSPQPCYN----GGKCVDGVNWFLCECAPGFAGPDCR 438 (721)
Q Consensus 389 ~C~~G~~G~~C~--~~~~~C~~~~C~~----~g~C~~~~g~~~C~C~~Gy~G~~C~ 438 (721)
.|.+||+-+.-. .+-..|....|.. +-+|....| +|.|.+|.+|..|.
T Consensus 375 yCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCn 428 (592)
T KOG3512|consen 375 YCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCN 428 (592)
T ss_pred cccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCcccccc
Confidence 588898754321 1122233333332 447876766 89999999998884
No 45
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.38 E-value=0.023 Score=38.49 Aligned_cols=23 Identities=26% Similarity=0.661 Sum_probs=18.6
Q ss_pred CCceeeCCCCeEeeCCCCCccCc
Q psy17084 187 GAPCFNTQADYYCHCTEDWEGKN 209 (721)
Q Consensus 187 ~~~C~~~~g~~~C~C~~G~~G~~ 209 (721)
...|++++++|+|.|++||+...
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-T
T ss_pred CCCCccCCCceEeECCCCCEECc
Confidence 46899999999999999998754
No 46
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.05 E-value=0.033 Score=37.73 Aligned_cols=22 Identities=41% Similarity=0.821 Sum_probs=17.7
Q ss_pred CCeEeeCCCceeeecCCCCccC
Q psy17084 692 GGTCVNTGDLYSCICKEGFVHA 713 (721)
Q Consensus 692 ~~~C~~~~~~~~C~C~~G~~g~ 713 (721)
...|++++++|+|.|++||+.+
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE-
T ss_pred CCCCccCCCceEeECCCCCEEC
Confidence 3589999999999999999865
No 47
>smart00051 DSL delta serrate ligand.
Probab=92.45 E-value=0.21 Score=38.73 Aligned_cols=47 Identities=23% Similarity=0.617 Sum_probs=32.2
Q ss_pred eEEecCCCCCCCccCccCCCCCcccccccccCccccccCCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCcCCcc
Q psy17084 71 FQCVCAPGWTGPTCKIKHNFFPYLQFIPLTTSLFRLLSDLNYCGTHEPCQNGGTCENTAPDQYLCTCPEGFSGINC 146 (721)
Q Consensus 71 ~~C~C~~Gy~G~~C~~~~~~~~~~~c~c~~~~~~~~~~~~~~C~~~~~C~~~g~C~~~~~~~~~C~C~~Gy~G~~C 146 (721)
|.=.|+++|.|..|+ ..|...+....+.+|.. .+.++|.+||+|.+|
T Consensus 17 ~rv~C~~~~yG~~C~-------------------------~~C~~~~d~~~~~~Cd~----~G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCN-------------------------KFCRPRDDFFGHYTCDE----NGNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccC-------------------------CEeCcCccccCCccCCc----CCCEecCCCCcCCCC
Confidence 344899999999885 12333234455667743 257999999999875
No 48
>smart00051 DSL delta serrate ligand.
Probab=92.28 E-value=0.17 Score=39.18 Aligned_cols=47 Identities=34% Similarity=0.692 Sum_probs=34.7
Q ss_pred ceeEeCCCCCccCccCCCCCCCCC-CCCCCCCeeeecCCceEEecCCCCCCCcc
Q psy17084 32 QYLCTCPEGFSGINCEVVDNPCMM-GPCGNGGQCKETAGQFQCVCAPGWTGPTC 84 (721)
Q Consensus 32 ~~~C~C~~G~~G~~C~~~~~~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~G~~C 84 (721)
.+.=.|+++|.|..|+. .|.. +....+.+|.. .| .++|.+||+|+.|
T Consensus 16 ~~rv~C~~~~yG~~C~~---~C~~~~d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNK---FCRPRDDFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCC---EeCcCccccCCccCCc-CC--CEecCCCCcCCCC
Confidence 46667999999999973 3532 23566788865 45 5899999999875
No 49
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=91.37 E-value=0.23 Score=36.58 Aligned_cols=31 Identities=29% Similarity=0.729 Sum_probs=24.8
Q ss_pred EEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccCCc
Q psy17084 518 TCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKY 561 (721)
Q Consensus 518 ~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~ 561 (721)
.|....| +|.|+++|+|..|+. |.+||+|..
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~-----------C~~g~~~~~ 43 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDR-----------CAPGYYGLP 43 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCC-----------CCCCCccCC
Confidence 3554444 899999999999986 899998864
No 50
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=91.10 E-value=0.13 Score=37.70 Aligned_cols=33 Identities=33% Similarity=0.828 Sum_probs=24.9
Q ss_pred CEEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccCCcc
Q psy17084 517 GTCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSGKYC 562 (721)
Q Consensus 517 g~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G~~C 562 (721)
.+|....| +|.|+++|+|..|+. |.+||++...
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~-----------C~~g~~~~~~ 43 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQ-----------CKPGYFGLPS 43 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-E-----------E-TTEECSTT
T ss_pred CcccCCCC--EEeccccccCCcCcC-----------CCCccccccC
Confidence 36766544 999999999999996 8999988743
No 51
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=89.27 E-value=0.39 Score=34.62 Aligned_cols=29 Identities=34% Similarity=0.896 Sum_probs=23.5
Q ss_pred EEecCCCCeeeecCCCcccccccCCCCCCCCceeeCCCCccC
Q psy17084 518 TCVDKVNSFQCICRDGWEGEICANSNQSGGSFKCSCDAGFSG 559 (721)
Q Consensus 518 ~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~~~~C~C~~G~~G 559 (721)
.|....| +|.|+++|+|..|+. |++||+|
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~-----------C~~g~~g 40 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDR-----------CAPGYYG 40 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCc-----------CCCCcCC
Confidence 4544444 899999999999986 8999998
No 52
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=88.78 E-value=0.3 Score=32.94 Aligned_cols=28 Identities=25% Similarity=0.692 Sum_probs=17.9
Q ss_pred CCCCCCCCCCeeeecC-CceEEecCCCCC
Q psy17084 53 CMMGPCGNGGQCKETA-GQFQCVCAPGWT 80 (721)
Q Consensus 53 C~~~~C~~~g~C~~~~-g~~~C~C~~Gy~ 80 (721)
|....|..++.|++.. |+++|.|..||.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 4455677788888765 888888888886
No 53
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=85.97 E-value=1.2 Score=32.63 Aligned_cols=21 Identities=48% Similarity=1.081 Sum_probs=16.9
Q ss_pred eeeecCCceEEecCCCCCCCccC
Q psy17084 63 QCKETAGQFQCVCAPGWTGPTCK 85 (721)
Q Consensus 63 ~C~~~~g~~~C~C~~Gy~G~~C~ 85 (721)
.|....| +|.|+++|+|..|+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~ 33 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCD 33 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCC
Confidence 4666556 79999999999987
No 54
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=85.64 E-value=0.53 Score=34.41 Aligned_cols=32 Identities=31% Similarity=0.830 Sum_probs=22.2
Q ss_pred CEEeeCCCCceEecCCCCcCCccCCCCeeeccCCccccccCCCCCCCC
Q psy17084 351 GECVDQVNGFRCICPVGFAGQLCENGGTCVNTGDLYSCICKEGFEGPD 398 (721)
Q Consensus 351 g~C~~~~~~~~C~C~~gy~g~~C~~~~~C~~~~~~~~C~C~~G~~G~~ 398 (721)
..|....+ +|.|+++|+|..|+ +|.+||++..
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~--------------~C~~g~~~~~ 42 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCD--------------QCKPGYFGLP 42 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS---------------EE-TTEECST
T ss_pred CcccCCCC--EEeccccccCCcCc--------------CCCCcccccc
Confidence 35655433 88899999999888 6888888764
No 55
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=84.66 E-value=0.78 Score=45.88 Aligned_cols=39 Identities=28% Similarity=0.534 Sum_probs=31.5
Q ss_pred ceeeccCCCCCCCCCCCCCceeeCCCCeEeeCCCCCccC
Q psy17084 170 QFQCVDHDHCNPNPCLNGAPCFNTQADYYCHCTEDWEGK 208 (721)
Q Consensus 170 ~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~ 208 (721)
+..|.++++|...+......|.++.|+|.|.|.+||+..
T Consensus 181 ~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCC
Confidence 445888899976554445789999999999999999864
No 56
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=83.86 E-value=0.78 Score=31.02 Aligned_cols=29 Identities=21% Similarity=0.599 Sum_probs=20.4
Q ss_pred CCCCCCCCCCeEeeCC-CceeeecCCCCcc
Q psy17084 684 CKSNPCKNGGTCVNTG-DLYSCICKEGFVH 712 (721)
Q Consensus 684 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g 712 (721)
|...+|..++.|++.. |++.|.|..||..
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~ 31 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKK 31 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccc
Confidence 4456788889999886 8999999999974
No 57
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=81.32 E-value=1.7 Score=31.31 Aligned_cols=22 Identities=32% Similarity=0.975 Sum_probs=18.5
Q ss_pred eEecCCCCcCCccCCCCeeeccCCccccccCCCCCC
Q psy17084 361 RCICPVGFAGQLCENGGTCVNTGDLYSCICKEGFEG 396 (721)
Q Consensus 361 ~C~C~~gy~g~~C~~~~~C~~~~~~~~C~C~~G~~G 396 (721)
+|.|+++++|..|+ .|++||+|
T Consensus 19 ~C~C~~~~~G~~C~--------------~C~~g~~g 40 (46)
T smart00180 19 QCECKPNVTGRRCD--------------RCAPGYYG 40 (46)
T ss_pred EEECCCCCCCCCCC--------------cCCCCcCC
Confidence 78888888888887 67888887
No 58
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=79.00 E-value=2 Score=42.87 Aligned_cols=38 Identities=21% Similarity=0.455 Sum_probs=25.3
Q ss_pred CCCCCCCCCCCCCCCCCCCEEeeCCCCeEeecCCCCcc
Q psy17084 219 NPPCDDIDECVSNPCQNGGTCVDLVDGYKCECTQAWEG 256 (721)
Q Consensus 219 ~~~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G 256 (721)
...|.++++|...+......|.+..|+|.|.|++||+.
T Consensus 181 ~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 45566778886433222347888888888888888864
No 59
>KOG3516|consensus
Probab=77.25 E-value=1.9 Score=51.37 Aligned_cols=44 Identities=27% Similarity=0.778 Sum_probs=36.6
Q ss_pred eccCCCCCCCCCCCCCCCEeccCCCCceeEeCC-CCCccCccCCCCC
Q psy17084 6 LLSDLNYCGTHEPCQNGGTCENTAPDQYLCTCP-EGFSGINCEVVDN 51 (721)
Q Consensus 6 ~~~~~~~C~~~~~C~~~g~C~~~~~~~~~C~C~-~G~~G~~C~~~~~ 51 (721)
++.-+|.|.. ++|+++|.|.. ....|.|.|. .||.|.+|+..+.
T Consensus 541 ~C~i~drClP-N~CehgG~C~Q-s~~~f~C~C~~TGY~GatCHtsi~ 585 (1306)
T KOG3516|consen 541 MCGISDRCLP-NPCEHGGKCSQ-SWDDFECNCELTGYKGATCHTSIY 585 (1306)
T ss_pred ccccccccCC-ccccCCCcccc-cccceeEeccccccccccccCCCc
Confidence 4455678887 99999999988 6788999999 9999999986554
No 60
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=74.89 E-value=1.3 Score=34.28 Aligned_cols=17 Identities=35% Similarity=0.669 Sum_probs=7.7
Q ss_pred CeEeeCCCCCccCcCCC
Q psy17084 196 DYYCHCTEDWEGKNCSF 212 (721)
Q Consensus 196 ~~~C~C~~G~~G~~C~~ 212 (721)
.++-.|.+.|+|..|+.
T Consensus 16 ~~rv~C~~nyyG~~C~~ 32 (63)
T PF01414_consen 16 RIRVVCDENYYGPNCSK 32 (63)
T ss_dssp -------TTEETTTT-E
T ss_pred EEEEECCCCCCCccccC
Confidence 45568999999988863
No 61
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=70.96 E-value=3.8 Score=35.97 Aligned_cols=39 Identities=21% Similarity=0.491 Sum_probs=25.8
Q ss_pred CccCCCCC---CCCCCCCeEeeC--CCceeeecCCCCccCCcccC
Q psy17084 679 GKLNACKS---NPCKNGGTCVNT--GDLYSCICKEGFVHALLFTR 718 (721)
Q Consensus 679 ~~~~~C~~---~~C~~~~~C~~~--~~~~~C~C~~G~~g~~c~~~ 718 (721)
.++.+|.. +-|.++ +|.-. ...+.|.|..||+|..|+..
T Consensus 40 ~~i~~Cp~ey~~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 40 PAIRLCGPEGDGYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred cccccCChhhCCEeECC-EEEeeccCCCceeECCCCcccccccce
Confidence 44555643 226664 78544 35678888888888888754
No 62
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=70.20 E-value=1.3 Score=34.35 Aligned_cols=14 Identities=29% Similarity=0.918 Sum_probs=4.1
Q ss_pred eEeccCCCCCCCcc
Q psy17084 488 FKCSCDAGFSGKYC 501 (721)
Q Consensus 488 ~~C~C~~G~~G~~C 501 (721)
++-+|.+.|+|..|
T Consensus 17 ~rv~C~~nyyG~~C 30 (63)
T PF01414_consen 17 IRVVCDENYYGPNC 30 (63)
T ss_dssp ------TTEETTTT
T ss_pred EEEECCCCCCCccc
Confidence 34445555555444
No 63
>PHA02887 EGF-like protein; Provisional
Probab=70.16 E-value=4.2 Score=35.08 Aligned_cols=37 Identities=32% Similarity=0.909 Sum_probs=25.9
Q ss_pred CCCCCC--CCCCCCCEeccC-CCCceeEeCCCCCccCccCC
Q psy17084 11 NYCGTH--EPCQNGGTCENT-APDQYLCTCPEGFSGINCEV 48 (721)
Q Consensus 11 ~~C~~~--~~C~~~g~C~~~-~~~~~~C~C~~G~~G~~C~~ 48 (721)
++|.+. +=|- ||+|.-. ......|.|++||+|.+|+.
T Consensus 84 ~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 84 EKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred cccChHhhCEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 445432 3465 6899752 23568999999999999984
No 64
>PHA02887 EGF-like protein; Provisional
Probab=66.39 E-value=5.8 Score=34.26 Aligned_cols=16 Identities=31% Similarity=1.065 Sum_probs=7.8
Q ss_pred CeEEecCCCCcccccc
Q psy17084 624 DFTCLCKNGWKGKTCT 639 (721)
Q Consensus 624 ~~~C~C~~G~~G~~C~ 639 (721)
...|.|.+||+|.+|+
T Consensus 107 epsCrC~~GYtG~RCE 122 (126)
T PHA02887 107 EKFCICNKGYTGIRCD 122 (126)
T ss_pred CceeECCCCcccCCCC
Confidence 3445555555555443
No 65
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=63.97 E-value=5.8 Score=34.87 Aligned_cols=29 Identities=34% Similarity=0.906 Sum_probs=23.5
Q ss_pred CCCCCCCeeeec--CCceEEecCCCCCCCccC
Q psy17084 56 GPCGNGGQCKET--AGQFQCVCAPGWTGPTCK 85 (721)
Q Consensus 56 ~~C~~~g~C~~~--~g~~~C~C~~Gy~G~~C~ 85 (721)
+-|.+ |+|.-. ...+.|.|..||+|..|+
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCE 81 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQ 81 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCccccccc
Confidence 34776 589764 467999999999999997
No 66
>KOG3516|consensus
Probab=60.30 E-value=7.5 Score=46.71 Aligned_cols=44 Identities=32% Similarity=0.945 Sum_probs=38.7
Q ss_pred CCCCCCCCCCCCCCCCCEEeeCCCCeEeecC-CCCccCCCccCCC
Q psy17084 221 PCDDIDECVSNPCQNGGTCVDLVDGYKCECT-QAWEGSNCQYDAD 264 (721)
Q Consensus 221 ~C~~~~~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~C~~~~~ 264 (721)
.|.-+|.|.+++|.++|.|......|.|.|. .||.|..|...+.
T Consensus 541 ~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~ 585 (1306)
T KOG3516|consen 541 MCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIY 585 (1306)
T ss_pred ccccccccCCccccCCCcccccccceeEeccccccccccccCCCc
Confidence 4667799999999999999998889999998 8999999986553
No 67
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=58.20 E-value=10 Score=32.99 Aligned_cols=33 Identities=24% Similarity=0.627 Sum_probs=24.8
Q ss_pred CCCCCCCCCCCCCCCEeccCCCCceeEeCCCCCcc
Q psy17084 9 DLNYCGTHEPCQNGGTCENTAPDQYLCTCPEGFSG 43 (721)
Q Consensus 9 ~~~~C~~~~~C~~~g~C~~~~~~~~~C~C~~G~~G 43 (721)
..|.|.....|+.+|+|.. .....|.|.+||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence 4568887788999999964 34567999998863
No 68
>KOG3514|consensus
Probab=57.52 E-value=6.3 Score=46.60 Aligned_cols=35 Identities=43% Similarity=1.197 Sum_probs=30.8
Q ss_pred CCCCCCCCCCCCEeccCCCCceeEeCC-CCCccCccCC
Q psy17084 12 YCGTHEPCQNGGTCENTAPDQYLCTCP-EGFSGINCEV 48 (721)
Q Consensus 12 ~C~~~~~C~~~g~C~~~~~~~~~C~C~-~G~~G~~C~~ 48 (721)
.|.+ +||+|+|+|.. .++.|.|.|. .||.|..||.
T Consensus 625 ~C~~-nPC~N~g~C~e-gwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICES-NPCQNGGKCSE-GWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCC-CcccCCCCccc-cccccccccccCcccCccccc
Confidence 6887 99999999988 6789999998 6799999984
No 69
>KOG3514|consensus
Probab=55.10 E-value=7.2 Score=46.12 Aligned_cols=35 Identities=49% Similarity=1.147 Sum_probs=32.1
Q ss_pred CCCCCCCCCCCEEeecCCCeEEecc-CCCccCcccc
Q psy17084 568 DCKHNPCQNGGTCVDKVNSFQCICR-DGWEGEICAN 602 (721)
Q Consensus 568 ~C~~~~C~~~g~C~~~~~~~~C~C~-~G~~G~~C~~ 602 (721)
.|..+||.|+|+|...++.|.|.|. .+|.|+.|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 6899999999999999999999996 6899999974
No 70
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=54.61 E-value=18 Score=26.63 Aligned_cols=30 Identities=37% Similarity=0.957 Sum_probs=17.1
Q ss_pred CccCccCCCCCCCCCCCCCCCCeeeecCCceEEecCCCCC
Q psy17084 41 FSGINCEVVDNPCMMGPCGNGGQCKETAGQFQCVCAPGWT 80 (721)
Q Consensus 41 ~~G~~C~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~ 80 (721)
..|..|+. ..+| ..++.|++. +|.|++||+
T Consensus 17 ~~g~~C~~-~~qC-----~~~s~C~~g----~C~C~~g~~ 46 (52)
T PF01683_consen 17 QPGESCES-DEQC-----IGGSVCVNG----RCQCPPGYV 46 (52)
T ss_pred CCCCCCCC-cCCC-----CCcCEEcCC----EeECCCCCE
Confidence 34555653 2333 355677664 577777775
No 71
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=49.42 E-value=10 Score=28.03 Aligned_cols=33 Identities=30% Similarity=0.685 Sum_probs=17.2
Q ss_pred CCCCCCEEe----eCCCceEEecCCCccCCCCcccCc
Q psy17084 410 PCYNGGKCV----DGVNWFLCECAPGFAGPDCRININ 442 (721)
Q Consensus 410 ~C~~~g~C~----~~~g~~~C~C~~Gy~G~~C~~~~~ 442 (721)
+|+.||+-. ...|...|+|..-|.|++|++.+.
T Consensus 18 ~CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS~~~~ 54 (56)
T PF04863_consen 18 SCSGHGRAFLDGLIADGSPVCECNSCYGGPDCSTLIP 54 (56)
T ss_dssp --TTSEE--TTS-EETTEE--EE-TTEESTTS-EE-T
T ss_pred CcCCCCeeeeccccccCCccccccCCcCCCCcccCCC
Confidence 466666652 234557899999999999986543
No 72
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=46.89 E-value=12 Score=24.87 Aligned_cols=12 Identities=33% Similarity=0.993 Sum_probs=9.7
Q ss_pred eeeecCCCCccC
Q psy17084 702 YSCICKEGFVHA 713 (721)
Q Consensus 702 ~~C~C~~G~~g~ 713 (721)
+.|.||+||+.+
T Consensus 18 ~~C~CPeGyIld 29 (34)
T PF09064_consen 18 GQCFCPEGYILD 29 (34)
T ss_pred CceeCCCceEec
Confidence 379999999865
No 73
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=43.40 E-value=23 Score=30.81 Aligned_cols=33 Identities=24% Similarity=0.627 Sum_probs=26.3
Q ss_pred CCCCCCCCCCCCCCCeeccCCCCceeeeCCCCCcC
Q psy17084 109 DLNYCGTHEPCQNGGTCENTAPDQYLCTCPEGFSG 143 (721)
Q Consensus 109 ~~~~C~~~~~C~~~g~C~~~~~~~~~C~C~~Gy~G 143 (721)
..+.|.....|..+|.|.. .....|.|++||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~--~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS--NNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCC--CCCCceECCCCcCC
Confidence 3457888899999999964 34567999999964
No 74
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=41.48 E-value=31 Score=25.32 Aligned_cols=27 Identities=37% Similarity=0.987 Sum_probs=21.4
Q ss_pred CCCCCCCCCCCCCeeccCCCCceeeeCCCCCc
Q psy17084 111 NYCGTHEPCQNGGTCENTAPDQYLCTCPEGFS 142 (721)
Q Consensus 111 ~~C~~~~~C~~~g~C~~~~~~~~~C~C~~Gy~ 142 (721)
..|.....|..++.|++ .+|+|++||.
T Consensus 20 ~~C~~~~qC~~~s~C~~-----g~C~C~~g~~ 46 (52)
T PF01683_consen 20 ESCESDEQCIGGSVCVN-----GRCQCPPGYV 46 (52)
T ss_pred CCCCCcCCCCCcCEEcC-----CEeECCCCCE
Confidence 34666677888899976 4899999986
No 75
>KOG0196|consensus
Probab=35.93 E-value=77 Score=37.21 Aligned_cols=78 Identities=22% Similarity=0.599 Sum_probs=47.0
Q ss_pred CCCCCeeeecCCceEEecCCCCC----CCccCccCCCCCcccccccccCccccccCCCCCCCCCCCCCCCeeccCCCCce
Q psy17084 58 CGNGGQCKETAGQFQCVCAPGWT----GPTCKIKHNFFPYLQFIPLTTSLFRLLSDLNYCGTHEPCQNGGTCENTAPDQY 133 (721)
Q Consensus 58 C~~~g~C~~~~g~~~C~C~~Gy~----G~~C~~~~~~~~~~~c~c~~~~~~~~~~~~~~C~~~~~C~~~g~C~~~~~~~~ 133 (721)
|...|.=....| .|.|++||. |..|+ .|. +|+++.......|.. |..+..= ..+++-
T Consensus 248 C~~dGeWlvpiG--~C~C~aGye~~~~~~~C~-----------aCp-~G~yK~~~~~~~C~~---CP~~S~s--~~ega~ 308 (996)
T KOG0196|consen 248 CSGDGEWLVPIG--GCVCKAGYEEAENGKACQ-----------ACP-PGTYKASQGDSLCLP---CPPNSHS--SSEGAT 308 (996)
T ss_pred EcCCCcEEEEcC--ceeecCCCCcccCCCcce-----------eCC-CCcccCCCCCCCCCC---CCCCCCC--CCCCCC
Confidence 555454444455 599999996 44554 233 777776655556654 5554431 145788
Q ss_pred eeeCCCCCcCCcccccCCCCC
Q psy17084 134 LCTCPEGFSGINCEVVDNPCM 154 (721)
Q Consensus 134 ~C~C~~Gy~G~~C~~~~~~C~ 154 (721)
.|.|..||+-..=+-..-+|.
T Consensus 309 ~C~C~~gyyRA~~Dp~~mpCT 329 (996)
T KOG0196|consen 309 SCTCENGYYRADSDPPSMPCT 329 (996)
T ss_pred cccccCCcccCCCCCCCCCCC
Confidence 999999998543333333443
No 76
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.80 E-value=36 Score=29.17 Aligned_cols=25 Identities=32% Similarity=0.796 Sum_probs=15.2
Q ss_pred CCCCCCCcEEeeCC-----CCeEEecCCCC
Q psy17084 609 PNPCKNNGTCIDGH-----ADFTCLCKNGW 633 (721)
Q Consensus 609 ~~~C~~~g~C~~~~-----~~~~C~C~~G~ 633 (721)
.+.|+.||.|+... .=|.|.|.+.+
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~ 41 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTV 41 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeeccc
Confidence 45677777777652 22666666644
No 77
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.71 E-value=41 Score=28.85 Aligned_cols=27 Identities=33% Similarity=0.792 Sum_probs=19.8
Q ss_pred CCCCCCCCEEeecC-----CCeEEeccCCCcc
Q psy17084 571 HNPCQNGGTCVDKV-----NSFQCICRDGWEG 597 (721)
Q Consensus 571 ~~~C~~~g~C~~~~-----~~~~C~C~~G~~G 597 (721)
.+.|..||.|+... .=|.|.|.+.+..
T Consensus 12 Tn~CsgHG~C~~~~~~~~~~C~~C~C~~T~~~ 43 (103)
T PF12955_consen 12 TNNCSGHGSCVKKYGSGGGDCFACKCKPTVVK 43 (103)
T ss_pred ccCCCCCceEeeccCCCccceEEEEeeccccc
Confidence 35799999999873 2388888886553
No 78
>KOG3509|consensus
Probab=20.64 E-value=1.6e+02 Score=35.69 Aligned_cols=72 Identities=32% Similarity=0.686 Sum_probs=54.5
Q ss_pred CCCCCCCCCCCCCCEeccCCCCceeEeCCCCCccCccCCCCCCCCCCC-CCCCCeeeecCCceEEecCCCCCCCcc
Q psy17084 10 LNYCGTHEPCQNGGTCENTAPDQYLCTCPEGFSGINCEVVDNPCMMGP-CGNGGQCKETAGQFQCVCAPGWTGPTC 84 (721)
Q Consensus 10 ~~~C~~~~~C~~~g~C~~~~~~~~~C~C~~G~~G~~C~~~~~~C~~~~-C~~~g~C~~~~g~~~C~C~~Gy~G~~C 84 (721)
.++|.. -|++..+.|-. .+-...|.|++||+|..|+...+.+...+ =.-.++|....+.....|.+| .|...
T Consensus 406 g~~c~~-~p~~~~g~c~p-~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~y~~t~~~~~~~~~~~c~pg-~g~~~ 478 (964)
T KOG3509|consen 406 GDVCWR-IPCQHDGPCLQ-TLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGSYLGTCVPIQGKRCEYCGPG-AGAPT 478 (964)
T ss_pred CCcccc-ccCCCCccccc-cccccceeccccccCchhhccCccccccCCccccceEeccCCCcceeecCC-CCCcc
Confidence 346665 68888888876 45778999999999999997777775443 334578888777667899999 76654
Done!