Query psy13144
Match_columns 1031
No_of_seqs 386 out of 2779
Neff 8.3
Searched_HMMs 46136
Date Fri Aug 16 20:40:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13144.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13144hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.7 4.7E-17 1E-21 192.8 18.5 91 8-124 1216-1308(2531)
2 KOG4289|consensus 99.6 1.1E-14 2.3E-19 173.2 18.8 90 533-698 1218-1308(2531)
3 KOG1217|consensus 99.6 1.9E-13 4.2E-18 163.5 25.8 333 418-969 95-436 (487)
4 KOG1217|consensus 99.6 1.6E-13 3.5E-18 164.2 24.0 312 45-486 95-421 (487)
5 KOG1214|consensus 99.5 6.4E-14 1.4E-18 160.2 14.2 202 101-336 699-919 (1289)
6 KOG1214|consensus 99.5 4.4E-13 9.5E-18 153.5 14.7 196 791-1022 715-920 (1289)
7 KOG0994|consensus 99.4 5.1E-10 1.1E-14 132.5 31.9 46 117-162 300-345 (1758)
8 KOG1219|consensus 99.3 8.7E-12 1.9E-16 154.0 8.8 120 874-1026 3869-3989(4289)
9 KOG1219|consensus 99.2 1.1E-11 2.4E-16 153.1 8.4 112 39-187 3864-3977(4289)
10 KOG0994|consensus 99.1 2.2E-09 4.8E-14 127.2 16.9 72 163-270 824-898 (1758)
11 KOG1225|consensus 99.0 2.8E-09 6E-14 123.0 12.7 132 748-955 234-365 (525)
12 KOG1225|consensus 98.9 8.4E-09 1.8E-13 119.1 13.1 132 601-827 255-386 (525)
13 KOG4260|consensus 98.7 1.2E-08 2.5E-13 104.3 6.0 128 16-182 131-270 (350)
14 KOG4260|consensus 98.5 8.8E-08 1.9E-12 98.0 5.4 162 796-1009 132-304 (350)
15 KOG1226|consensus 97.9 6.4E-05 1.4E-09 88.7 10.7 140 734-907 468-620 (783)
16 KOG1226|consensus 97.8 0.00017 3.7E-09 85.2 12.5 137 675-844 466-617 (783)
17 KOG1836|consensus 97.7 0.0017 3.6E-08 85.6 20.0 108 118-269 696-809 (1705)
18 PF07645 EGF_CA: Calcium-bindi 97.6 3.8E-05 8.3E-10 58.3 2.3 34 38-71 1-36 (42)
19 PF00008 EGF: EGF-like domain 97.6 3.6E-05 7.8E-10 54.5 1.7 30 42-71 1-31 (32)
20 PF07645 EGF_CA: Calcium-bindi 97.6 5.4E-05 1.2E-09 57.5 2.7 34 964-1009 1-34 (42)
21 PF00008 EGF: EGF-like domain 97.5 4.2E-05 9E-10 54.2 1.2 30 925-954 1-31 (32)
22 smart00179 EGF_CA Calcium-bind 97.4 0.00018 4E-09 53.4 3.7 34 921-954 1-36 (39)
23 KOG1836|consensus 97.4 0.0055 1.2E-07 81.0 19.5 53 117-187 756-812 (1705)
24 smart00179 EGF_CA Calcium-bind 97.4 0.00019 4.2E-09 53.3 3.8 35 150-186 2-38 (39)
25 PF12947 EGF_3: EGF domain; I 97.2 0.00016 3.4E-09 52.5 1.7 30 985-1014 6-35 (36)
26 PF06247 Plasmod_Pvs28: Plasmo 97.1 0.00011 2.3E-09 72.5 -0.2 139 830-1013 17-164 (197)
27 PF12947 EGF_3: EGF domain; I 97.0 0.00027 5.9E-09 51.3 1.4 30 101-130 5-34 (36)
28 cd00054 EGF_CA Calcium-binding 96.9 0.0012 2.6E-08 48.5 3.7 33 922-954 2-35 (38)
29 cd00054 EGF_CA Calcium-binding 96.9 0.0012 2.7E-08 48.4 3.8 35 150-186 2-37 (38)
30 PF06247 Plasmod_Pvs28: Plasmo 96.9 6.6E-05 1.4E-09 74.0 -4.1 144 46-223 7-163 (197)
31 PF12662 cEGF: Complement Clr- 95.9 0.0067 1.4E-07 39.5 2.4 24 59-90 1-24 (24)
32 cd00053 EGF Epidermal growth f 95.9 0.0094 2E-07 42.9 3.5 28 44-71 5-32 (36)
33 cd00053 EGF Epidermal growth f 95.9 0.01 2.2E-07 42.7 3.6 28 927-954 5-32 (36)
34 smart00181 EGF Epidermal growt 95.8 0.01 2.2E-07 42.9 3.4 32 153-186 2-34 (35)
35 smart00181 EGF Epidermal growt 95.7 0.013 2.7E-07 42.3 3.6 28 42-70 2-30 (35)
36 PF12662 cEGF: Complement Clr- 95.7 0.0094 2E-07 38.8 2.3 24 942-967 1-24 (24)
37 PF07974 EGF_2: EGF-like domai 94.2 0.067 1.4E-06 37.8 3.6 26 986-1013 7-32 (32)
38 KOG1218|consensus 94.1 3.3 7.2E-05 46.5 19.3 151 746-941 47-199 (316)
39 PF07974 EGF_2: EGF-like domai 93.6 0.069 1.5E-06 37.7 2.7 23 783-805 8-32 (32)
40 PF14670 FXa_inhibition: Coagu 93.2 0.042 9.1E-07 39.9 1.2 21 51-71 10-30 (36)
41 PF12661 hEGF: Human growth fa 90.8 0.11 2.4E-06 28.7 0.8 13 1001-1013 1-13 (13)
42 KOG1218|consensus 90.5 9.1 0.0002 42.9 16.8 163 747-960 14-177 (316)
43 KOG3512|consensus 90.3 2.4 5.2E-05 48.2 11.2 79 104-183 280-384 (592)
44 PF14670 FXa_inhibition: Coagu 89.7 0.14 3E-06 37.3 0.7 21 162-182 10-30 (36)
45 smart00051 DSL delta serrate l 87.9 0.65 1.4E-05 38.6 3.6 43 794-844 19-61 (63)
46 smart00051 DSL delta serrate l 87.0 0.74 1.6E-05 38.3 3.5 47 943-1013 17-63 (63)
47 PF12946 EGF_MSP1_1: MSP1 EGF 84.8 0.22 4.8E-06 36.1 -0.6 32 925-956 2-34 (37)
48 KOG3512|consensus 82.4 4 8.7E-05 46.5 7.6 118 881-1018 285-432 (592)
49 PF12946 EGF_MSP1_1: MSP1 EGF 81.7 0.73 1.6E-05 33.5 1.0 31 597-627 2-33 (37)
50 cd00055 EGF_Lam Laminin-type e 80.0 1.8 3.8E-05 34.1 2.8 28 992-1021 13-40 (50)
51 PF00053 Laminin_EGF: Laminin 78.6 1.1 2.4E-05 35.0 1.4 26 991-1018 11-36 (49)
52 cd00055 EGF_Lam Laminin-type e 77.3 3.4 7.4E-05 32.5 3.8 25 171-223 19-43 (50)
53 cd01475 vWA_Matrilin VWA_Matri 72.7 3 6.5E-05 44.4 3.2 33 148-182 185-219 (224)
54 PF00053 Laminin_EGF: Laminin 72.3 2 4.4E-05 33.6 1.3 22 162-187 11-32 (49)
55 cd01475 vWA_Matrilin VWA_Matri 71.5 3.4 7.3E-05 44.0 3.2 32 38-71 186-219 (224)
56 smart00180 EGF_Lam Laminin-typ 69.7 4.4 9.5E-05 31.3 2.6 27 992-1020 12-38 (46)
57 smart00180 EGF_Lam Laminin-typ 69.0 5.6 0.00012 30.7 3.1 14 172-187 19-32 (46)
58 PHA03099 epidermal growth fact 58.2 9.6 0.00021 35.8 3.0 30 986-1016 52-83 (139)
59 PF01683 EB: EB module; Inter 57.8 15 0.00033 29.0 3.8 19 783-801 28-46 (52)
60 PF01683 EB: EB module; Inter 57.1 16 0.00034 28.9 3.8 21 986-1010 27-47 (52)
61 PHA02887 EGF-like protein; Pro 49.5 16 0.00034 33.9 2.8 29 986-1015 93-123 (126)
62 PHA02887 EGF-like protein; Pro 48.6 13 0.00029 34.3 2.3 32 816-854 92-123 (126)
63 PF00954 S_locus_glycop: S-loc 47.4 16 0.00034 34.1 2.7 32 38-70 76-108 (110)
64 KOG3516|consensus 46.7 14 0.0003 47.2 2.8 42 144-187 539-581 (1306)
65 PHA03099 epidermal growth fact 42.0 17 0.00036 34.3 1.8 32 816-854 51-82 (139)
66 KOG3516|consensus 40.5 23 0.0005 45.3 3.3 36 234-269 542-578 (1306)
67 PF00954 S_locus_glycop: S-loc 38.4 28 0.0006 32.4 2.9 33 149-182 76-109 (110)
68 PF09064 Tme5_EGF_like: Thromb 37.1 25 0.00055 25.1 1.7 21 5-25 10-30 (34)
69 KOG3514|consensus 36.3 23 0.0005 44.6 2.4 36 924-961 625-661 (1591)
70 KOG3514|consensus 31.4 32 0.00069 43.4 2.4 35 152-188 625-660 (1591)
71 PF01414 DSL: Delta serrate li 26.5 20 0.00043 29.9 -0.3 44 793-844 18-61 (63)
72 PF03302 VSP: Giardia variant- 20.5 3E+02 0.0066 32.0 7.8 41 174-222 95-135 (397)
No 1
>KOG4289|consensus
Probab=99.73 E-value=4.7e-17 Score=192.78 Aligned_cols=91 Identities=33% Similarity=0.859 Sum_probs=78.5
Q ss_pred ccCCCce-eeCCCCCCCCCCCccccccCCCCCCCCCCCCCCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCC
Q psy13144 8 INTYEVF-YSCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCP 86 (1031)
Q Consensus 8 ~~~~~~~-C~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~ 86 (1031)
|++...+ |+|||||+|+. |+ +.||+|-+.||++||+|....|+|+|.|.+||+|. .|+...
T Consensus 1216 i~pvnglrCrCPpGFTgd~---Ce------TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe------hCEvs~--- 1277 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDY---CE------TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE------HCEVSA--- 1277 (2531)
T ss_pred ccccCceeEeCCCCCCccc---cc------chhHhhhcCCCCCCCceEEecCceeEEecCCcccc------ceeeec---
Confidence 4666777 99999999994 98 78999999999999999999999999999999998 787643
Q ss_pred CCccccCCCCCCCCCCCCCCCCeeeec-CCCceeeCCCC
Q psy13144 87 LDKSCQNQKCADPCPGTCGQNANCKVI-NHSPICRCKAG 124 (1031)
Q Consensus 87 ~~~~C~~~~C~~~c~~~C~~~~~C~~~-~g~~~C~C~~G 124 (1031)
..++|. ++.|.++|+|++. .|+|.|+|+.|
T Consensus 1278 -----~agrCv---pGvC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1278 -----RAGRCV---PGVCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred -----ccCccc---cceecCCCEEeecCCCceeccCCCc
Confidence 134454 5789999999995 68999999998
No 2
>KOG4289|consensus
Probab=99.62 E-value=1.1e-14 Score=173.24 Aligned_cols=90 Identities=31% Similarity=0.777 Sum_probs=75.2
Q ss_pred cCCCceeecCCCCcCCCcCccccccCcCcchhhccccccccCCCCCCCcccccccCCCCCCCCCCCCCCCCCCCeeeecC
Q psy13144 533 INHSPICTCKPGFTGDALAYCNRIPLSNYVFEKILIQLMYCPGTTGNPFVLCKLVQNEPVYTNPCQPSPCGPNSQCREVN 612 (1031)
Q Consensus 533 ~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~~~~g~~G~~~~~C~~~~~~~~~~~eC~~~~C~~~~~C~~~~ 612 (1031)
..+++.|+|++||+|+.|+ +.||+|.+.||.++++|....
T Consensus 1218 pvnglrCrCPpGFTgd~Ce----------------------------------------TeiDlCYs~pC~nng~C~srE 1257 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCE----------------------------------------TEIDLCYSGPCGNNGRCRSRE 1257 (2531)
T ss_pred ccCceeEeCCCCCCccccc----------------------------------------chhHhhhcCCCCCCCceEEec
Confidence 4577899999999999876 348999999999999999999
Q ss_pred CceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCceeec-CCCC
Q psy13144 613 HQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCVDPCPDSPPPPLESPPEYVNPCIPSPCGPYSQCRDI-GGSP 691 (1031)
Q Consensus 613 g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~~~~~~~~~~dideC~~~~C~~~g~C~~~-~g~y 691 (1031)
|+|+|+|.+||+|. +||+.. ..-.|.+..|.++|+|++. .|+|
T Consensus 1258 ggYtCeCrpg~tGe------hCEvs~------------------------------~agrCvpGvC~nggtC~~~~nggf 1301 (2531)
T KOG4289|consen 1258 GGYTCECRPGFTGE------HCEVSA------------------------------RAGRCVPGVCKNGGTCVNLLNGGF 1301 (2531)
T ss_pred CceeEEecCCcccc------ceeeec------------------------------ccCccccceecCCCEEeecCCCce
Confidence 99999999999999 687532 1234566778888999876 5788
Q ss_pred eeecCCC
Q psy13144 692 SCSCLPN 698 (1031)
Q Consensus 692 ~C~C~~G 698 (1031)
.|.|+.|
T Consensus 1302 ~c~Cp~g 1308 (2531)
T KOG4289|consen 1302 CCHCPYG 1308 (2531)
T ss_pred eccCCCc
Confidence 8999887
No 3
>KOG1217|consensus
Probab=99.58 E-value=1.9e-13 Score=163.50 Aligned_cols=333 Identities=27% Similarity=0.643 Sum_probs=225.1
Q ss_pred CCCCCCCeeeecCCceEeeCCCCCccCCCccccccCCCCCCCCCCCCCC--CCCCCccccc---CCceeeecCCCccCCC
Q psy13144 418 GTCGEGAICDVINHAVSCNCPAGTTGNPFVLCKPVQNEPVYTNPCHPSP--CGPNSQCREV---NHQAVCSCLPNYFGSP 492 (1031)
Q Consensus 418 ~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~C~~~~--C~~~g~C~~~---~g~~~C~C~~Gy~G~~ 492 (1031)
.+......+......|.|.|.+||.+. .++.. .+|...+ +...+.|... ...+.|.|..||.+.
T Consensus 95 ~~~~~~~~~~~~~~~~~c~c~~g~~~~---~~~~~-------~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~- 163 (487)
T KOG1217|consen 95 PCLLLCGECVDCVGSYECTCPPGYQGT---PCEGE-------CECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGE- 163 (487)
T ss_pred CcccCCccccCCCCCceeeCCCccccC---cCCcc-------eeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccc-
Confidence 334445566677888999999999998 55411 1355444 3455677764 358999999999998
Q ss_pred CCCcCcCcccCCCCCCccccCCCcCCCCCCCCCCCCeeeecCCCceeecCCCCcCCCcCccccccCcCcchhhccccccc
Q psy13144 493 PACRPECTVNTDCPLDKACFNQKCVDPCPGTCGQNANCRVINHSPICTCKPGFTGDALAYCNRIPLSNYVFEKILIQLMY 572 (1031)
Q Consensus 493 ~~C~~~C~~~~~c~~~~~C~~~~C~~~c~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~~~ 572 (1031)
.+.... .+|... ...|.+++.|.+..++|.|.|++||.+..+..
T Consensus 164 -----~~~~~~-----~~C~~~------~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~-------------------- 207 (487)
T KOG1217|consen 164 -----PCETDL-----DECIQY------SSPCQNGGTCVNTGGSYLCSCPPGYTGSTCET-------------------- 207 (487)
T ss_pred -----cccccc-----cccccC------CCCcCCCcccccCCCCeeEeCCCCccCCcCcC--------------------
Confidence 555431 133321 24688889999999999999999999987541
Q ss_pred cCCCCCCCcccccccCCCCCCCCCCCCCCCCCCCeeeecCCceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccC
Q psy13144 573 CPGTTGNPFVLCKLVQNEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCV 652 (1031)
Q Consensus 573 ~~g~~G~~~~~C~~~~~~~~~~~eC~~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~ 652 (1031)
. .++++|++. +.|.+.+||.+. .|+
T Consensus 208 ----------------------~-------~~~~~c~~~---~~~~~~~g~~~~------~c~----------------- 232 (487)
T KOG1217|consen 208 ----------------------T-------GNGGTCVDS---VACSCPPGARGP------ECE----------------- 232 (487)
T ss_pred ----------------------C-------CCCceEecc---eeccCCCCCCCC------Ccc-----------------
Confidence 0 344566555 678999999876 233
Q ss_pred CCCCCCCCCCCCCCCCcCCcCCCCCCCCCCceeecCCCCeeecCCCCcCCCCCCCcCcccCCCCCCccccccCcCCCCCC
Q psy13144 653 DPCPDSPPPPLESPPEYVNPCIPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECVMNSECPSNEACINEKCGDPCP 732 (1031)
Q Consensus 653 ~~C~~~~~~~~~~~~~dideC~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~c~ 732 (1031)
+++.+|... + ++|++..++|+|.|++||.+. .+ ..+.++++|... .
T Consensus 233 ---------------~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~--~~-~~~~~~~~C~~~------------~ 278 (487)
T KOG1217|consen 233 ---------------VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGD--AC-VTCVDVDSCALI------------A 278 (487)
T ss_pred ---------------cccccccCC---C-CcccccCCceeeeCCCCcccc--cc-ceeeeccccCCC------------C
Confidence 234455444 4 899999999999999999998 42 234567777765 1
Q ss_pred CCCCCCCeeeccCCCceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCC
Q psy13144 733 GSCGYNAECKIINHTPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPEC 812 (1031)
Q Consensus 733 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C 812 (1031)
.|.++++|.+..++|.|.|++||+|.. | .. +..... |..++.
T Consensus 279 -~c~~~~~C~~~~~~~~C~C~~g~~g~~---~-~~---------------~~~~~~-------C~~~~~----------- 320 (487)
T KOG1217|consen 279 -SCPNGGTCVNVPGSYRCTCPPGFTGRL---C-TE---------------CVDVDE-------CSPRNA----------- 320 (487)
T ss_pred -ccCCCCeeecCCCcceeeCCCCCCCCC---C-cc---------------cccccc-------cccccc-----------
Confidence 277789999999999999999999984 3 10 111111 111111
Q ss_pred cCCCCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceee-cCCCee
Q psy13144 813 ILNNDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRV-INHNAV 891 (1031)
Q Consensus 813 ~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~-~~g~~~ 891 (1031)
...|.+++.|......+.++|.|..||.|. .|+... ++|.. ..+..++.|++ ..++|+
T Consensus 321 --~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~------~C~~~~-----~~C~~--------~~~~~~~~c~~~~~~~~~ 379 (487)
T KOG1217|consen 321 --GGPCANGGTCNTLGSFGGFRCACGPGFTGR------RCEDSN-----DECAS--------SPCCPGGTCVNETPGSYR 379 (487)
T ss_pred --CCcCCCCcccccCCCCCCCCcCCCCCCCCC------ccccCC-----ccccC--------CccccCCEeccCCCCCeE
Confidence 134666667733332367889999999888 666542 12332 24677899999 688999
Q ss_pred eEeCCCCccC-C--CccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCCCCCCCCcccCCCC
Q psy13144 892 CNCKPGFTGE-P--RIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAPPNCRPECIQNSEC 968 (1031)
Q Consensus 892 C~C~~Gy~G~-~--~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~eC 968 (1031)
|.|+.+|.+. . ...+ .++++|.. .+.|++..++|.|. ++ + ... .+ .|.+++++
T Consensus 380 c~~~~~~~~~~~~~~~~~-------------~~~~~c~~-----~~~c~~~~~~~~c~-~~-~-~~~-~~--~~~~~~~~ 435 (487)
T KOG1217|consen 380 CACPAGFAGKANGDGVGC-------------EDIDECSG-----CGDCVNGPGGGACT-PP-G-LVS-PG--TCDDIDEC 435 (487)
T ss_pred ecCCCccccCCccccccc-------------cccccccC-----CcceeccCCCCccc-cC-c-ccC-Cc--ceeccccc
Confidence 9999999984 1 1123 35666754 57799999999999 88 5 432 22 56677666
Q ss_pred C
Q psy13144 969 P 969 (1031)
Q Consensus 969 ~ 969 (1031)
.
T Consensus 436 ~ 436 (487)
T KOG1217|consen 436 P 436 (487)
T ss_pred c
Confidence 4
No 4
>KOG1217|consensus
Probab=99.57 E-value=1.6e-13 Score=164.15 Aligned_cols=312 Identities=28% Similarity=0.656 Sum_probs=210.5
Q ss_pred CCCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeee---cCCCceeeC
Q psy13144 45 SPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKV---INHSPICRC 121 (1031)
Q Consensus 45 ~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~---~~g~~~C~C 121 (1031)
.+....+.++....+|.|.|+.||.|.. +.... +|...+ ..+...+.|++ ....|.|+|
T Consensus 95 ~~~~~~~~~~~~~~~~~c~c~~g~~~~~------~~~~~------~C~~~~------~~~~~~~~c~~~~~~~~~~~c~C 156 (487)
T KOG1217|consen 95 PCLLLCGECVDCVGSYECTCPPGYQGTP------CEGEC------ECVTGP------GVCCIDGSCSNGPGSVGPFRCSC 156 (487)
T ss_pred CcccCCccccCCCCCceeeCCCccccCc------CCcce------eecCCC------CCeeCchhhcCCCCCCCceeeee
Confidence 3444456677788899999999999982 22211 222221 11223345555 345899999
Q ss_pred CCCCccCCccccccCCCCCCCCCCCCCCCCCCC--CCCCCCCCceecCCCCceeeCCCCCcCCCCCCcccCCCCCCCCCC
Q psy13144 122 KAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCY--PSPCGPYSQCRDINGSPSCSCLPSYIGSPPNCRPECIQNSECPYD 199 (1031)
Q Consensus 122 ~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~--~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~ 199 (1031)
..||.+.. +. ...++|. ..+|.+++.|.+..++|.|.|++||.+. .++.. ...
T Consensus 157 ~~g~~~~~---~~-------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~~~~-------~~~ 211 (487)
T KOG1217|consen 157 TEGYEGEP---CE-------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS--TCETT-------GNG 211 (487)
T ss_pred CCCccccc---cc-------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC--cCcCC-------CCC
Confidence 99999987 32 2347887 4569999999999999999999999999 66521 223
Q ss_pred CcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCCCCCCCCcccccCCCeeeecCCCCcCCCCCCCCCCcc
Q psy13144 200 KACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTV 279 (1031)
Q Consensus 200 ~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~ 279 (1031)
..|++. +.+ .+.+||.+.. |. +.+.+|... + ++|++..++|+|.|++||.+. .+
T Consensus 212 ~~c~~~---~~~--~~~~g~~~~~---c~------~~~~~~~~~---~-~~c~~~~~~~~C~~~~g~~~~------~~-- 265 (487)
T KOG1217|consen 212 GTCVDS---VAC--SCPPGARGPE---CE------VSIVECASG---D-GTCVNTVGSYTCRCPEGYTGD------AC-- 265 (487)
T ss_pred ceEecc---eec--cCCCCCCCCC---cc------cccccccCC---C-CcccccCCceeeeCCCCcccc------cc--
Confidence 445544 556 8889998763 44 234555443 4 899999999999999999998 21
Q ss_pred CCCCCCCccccCCCcCCCCCCCCCCCCeeeeeCCCceeeCCCCCccCCcCcccccCccccCCCCCCCCCCCCCcccCCcc
Q psy13144 280 NSDCPLDKSCQNQKCADPCPGTCGQNANCKVINHSPICRCKAGFTGDPFTYCNRIPLQYLMPNNAPMNVPPISAVETPVL 359 (1031)
Q Consensus 280 ~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~G~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (1031)
..+.+.+.|+.. ..|.++++|++..++|.|.|++||+|..+..+.+..+|....
T Consensus 266 -~~~~~~~~C~~~-------~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~------------------ 319 (487)
T KOG1217|consen 266 -VTCVDVDSCALI-------ASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRN------------------ 319 (487)
T ss_pred -ceeeeccccCCC-------CccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccc------------------
Confidence 112222344432 238888999999999999999999999752233333332100
Q ss_pred CCCCCCCCCCeecc------ceeecCCCcccCCCCcCCCCCcCCCCCCCCcccccCcccCCCCCCCCCCCCeeee-cCCc
Q psy13144 360 EDTCNCAPNAVCKD------EVCVCLPDFYGDGYVSCRPECVLNNDCPSNKACIKYKCKNPCVSGTCGEGAICDV-INHA 432 (1031)
Q Consensus 360 ~~~~~C~~~g~C~~------~~C~C~~G~~G~~c~~~~~eC~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~-~~g~ 432 (1031)
....|.+++.|.. +.|.|.+||.|..|+... ++|...++..++.|.+ ..++
T Consensus 320 -~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~---------------------~~C~~~~~~~~~~c~~~~~~~ 377 (487)
T KOG1217|consen 320 -AGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSN---------------------DECASSPCCPGGTCVNETPGS 377 (487)
T ss_pred -cCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCC---------------------ccccCCccccCCEeccCCCCC
Confidence 1123666767722 469999999998864211 2566667888999998 7899
Q ss_pred eEeeCCCCCccC-C--CccccccCCCCCCCCCCCCCCCCCCCcccccCCceeeecCC
Q psy13144 433 VSCNCPAGTTGN-P--FVLCKPVQNEPVYTNPCHPSPCGPNSQCREVNHQAVCSCLP 486 (1031)
Q Consensus 433 ~~C~C~~Gy~G~-~--~~~C~~~~~~~~~~~~C~~~~C~~~g~C~~~~g~~~C~C~~ 486 (1031)
|.|.|+.+|.+. . ...+ .++++|.. .+.|++..+++.|. ..
T Consensus 378 ~~c~~~~~~~~~~~~~~~~~-------~~~~~c~~-----~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 378 YRCACPAGFAGKANGDGVGC-------EDIDECSG-----CGDCVNGPGGGACT-PP 421 (487)
T ss_pred eEecCCCccccCCccccccc-------cccccccC-----CcceeccCCCCccc-cC
Confidence 999999999984 1 1122 34555543 46677778888888 77
No 5
>KOG1214|consensus
Probab=99.53 E-value=6.4e-14 Score=160.17 Aligned_cols=202 Identities=25% Similarity=0.592 Sum_probs=149.9
Q ss_pred CCCCCCCCeeeecCC-CceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCC--CCCCCCCceecCCCCceeeCCC
Q psy13144 101 PGTCGQNANCKVINH-SPICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYP--SPCGPYSQCRDINGSPSCSCLP 177 (1031)
Q Consensus 101 ~~~C~~~~~C~~~~g-~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~ 177 (1031)
+|.|..++.|....+ .|+|.|.+||.|++.. | .++++|+. ..|+.+++|++.+++|+|.|..
T Consensus 699 sh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~-c--------------~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~ 763 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTCECSSGYQGDGRN-C--------------VDENECATGFHRCGPNSVCINLPGSYRCECRS 763 (1289)
T ss_pred CcccCCCccccCCCCcceEEEEeeccCCCCCC-C--------------CChhhhccCCCCCCCCceeecCCCceeEEEee
Confidence 689999999988665 6999999999998855 5 47889985 5699999999999999999999
Q ss_pred CCc--CCCCCCc--------ccCCCC-CCCCCCCc--ccC-CCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCC
Q psy13144 178 SYI--GSPPNCR--------PECIQN-SECPYDKA--CIN-EKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPS 243 (1031)
Q Consensus 178 Gy~--g~~~~C~--------~~C~~~-~~C~~~~~--C~n-~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~ 243 (1031)
||. +++.+|. +.|++. +.|...+. |+. +.+.|.| .|.|||.|++.. | .++|+|+++
T Consensus 764 gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C--~CLPGfsGDG~~-c-------~dvDeC~ps 833 (1289)
T KOG1214|consen 764 GYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSC--ACLPGFSGDGHQ-C-------TDVDECSPS 833 (1289)
T ss_pred cceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEE--eecCCccCCccc-c-------ccccccCcc
Confidence 986 4444675 246655 67776554 443 5678999 999999999753 4 457999999
Q ss_pred CCCCCCcccccCCCeeeecCCCCcCCCCCCCCCCccCCCCCCCccccCCCcCCCCCCCCCCCCeeee--eCCCceeeCCC
Q psy13144 244 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKV--INHSPICRCKA 321 (1031)
Q Consensus 244 ~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~--~~g~~~C~C~~ 321 (1031)
-|...+.|.+++|+|.|+|.+||+|++..|+|.=.....|.... .+ +-.|+.++.|.. .+..|.+.|.+
T Consensus 834 rChp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er---~h------pl~chg~t~~~~~~Dp~~~e~p~~~ 904 (1289)
T KOG1214|consen 834 RCHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQER---FH------PLQCHGSTGFCWCVDPDGHEVPGTQ 904 (1289)
T ss_pred ccCCCceEecCCCcceeecccCccCCCceecCCCccCCcccccc---cc------ceeeccccceeEeeCCCcccCCCCC
Confidence 99999999999999999999999999999987511112222110 00 123555544432 24567888888
Q ss_pred CCccCCcCcccccCc
Q psy13144 322 GFTGDPFTYCNRIPL 336 (1031)
Q Consensus 322 G~~G~~~~~C~~~~~ 336 (1031)
+--|+.-..|-.+++
T Consensus 905 ~ppG~~~~~c~~~~~ 919 (1289)
T KOG1214|consen 905 TPPGSTPPHCGPSPE 919 (1289)
T ss_pred CCCCCCCCCCCCccc
Confidence 877776555655544
No 6
>KOG1214|consensus
Probab=99.47 E-value=4.4e-13 Score=153.49 Aligned_cols=196 Identities=26% Similarity=0.549 Sum_probs=138.8
Q ss_pred CceeecCCCCccCCcc-cCCCCCcCC-CCCCCCCccccCCCCCceeeecCCCC--cCCCCCCCCCcccCCCCCCcccccC
Q psy13144 791 DGVCVCLPDYYGDGYV-SCGPECILN-NDCPSNKACIRNKFNKQAVCSCLPNY--FGSPPACRPECTVNTDCPLDKACVN 866 (1031)
Q Consensus 791 ~~~C~C~~G~~G~~c~-~~~~~C~~~-~~C~~~g~C~~~~~~~~~~C~C~~G~--~g~g~~C~~~C~~~~~C~~~~~C~~ 866 (1031)
+.+|.|..||.|++-. .++++|... ..|..++.|++.+ ++|+|.|..|| ++++.+|+..=.. ...+.|+.
T Consensus 715 ~~tcecs~g~~gdgr~c~d~~eca~~~~~CGp~s~Cin~p--g~~rceC~~gy~F~dd~~tCV~i~~p----ap~n~Ce~ 788 (1289)
T KOG1214|consen 715 DYTCECSSGYQGDGRNCVDENECATGFHRCGPNSVCINLP--GSYRCECRSGYEFADDRHTCVLITPP----APANPCED 788 (1289)
T ss_pred ceEEEEeeccCCCCCCCCChhhhccCCCCCCCCceeecCC--CceeEEEeecceeccCCcceEEecCC----CCCCcccc
Confidence 3489999999988643 356788865 6899999999997 99999998885 6777788731110 12234444
Q ss_pred CcccCCCCCCCCCCC--ceeecCC-CeeeEeCCCCccCCCccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCc
Q psy13144 867 QKCVDPCPGSCGQNA--NCRVINH-NAVCNCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSP 943 (1031)
Q Consensus 867 ~~C~~~C~~~C~~~~--~C~~~~g-~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~ 943 (1031)
+ .+.|.-.+ .|+...+ +|.|.|.|||.|++.. | .|+|||.++-|...|.|++++|+|
T Consensus 789 g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-c-------------~dvDeC~psrChp~A~Cyntpgsf 848 (1289)
T KOG1214|consen 789 G------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-C-------------TDVDECSPSRCHPAATCYNTPGSF 848 (1289)
T ss_pred C------ccccCcCCceEEEecCCceEEEeecCCccCCccc-c-------------ccccccCccccCCCceEecCCCcc
Confidence 3 34565544 4555443 5999999999999864 5 368999999999999999999999
Q ss_pred eeeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeee---ecCCCceeeCCCCCccCCCCCCcCCC
Q psy13144 944 SCSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCK---VINHSPICTCPDGFVGDAFSGCYPKP 1020 (1031)
Q Consensus 944 ~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~---~~~g~~~C~C~~G~~G~~c~~c~~~~ 1020 (1031)
.|.|.+||.|++.+|-+-=.....|. .++ .. +..|+.++-|. +. ..|.+.++++=.|++-.+|-+.|
T Consensus 849 sC~C~pGy~GDGf~CVP~~~~~T~C~------~er-~h--pl~chg~t~~~~~~Dp-~~~e~p~~~~ppG~~~~~c~~~~ 918 (1289)
T KOG1214|consen 849 SCRCQPGYYGDGFQCVPDTSSLTPCE------QER-FH--PLQCHGSTGFCWCVDP-DGHEVPGTQTPPGSTPPHCGPSP 918 (1289)
T ss_pred eeecccCccCCCceecCCCccCCccc------ccc-cc--ceeeccccceeEeeCC-CcccCCCCCCCCCCCCCCCCCcc
Confidence 99999999999987753101112222 111 01 23577666433 44 45789999999998888787665
Q ss_pred Cc
Q psy13144 1021 PE 1022 (1031)
Q Consensus 1021 ~~ 1022 (1031)
..
T Consensus 919 ~~ 920 (1289)
T KOG1214|consen 919 EQ 920 (1289)
T ss_pred cc
Confidence 53
No 7
>KOG0994|consensus
Probab=99.37 E-value=5.1e-10 Score=132.49 Aligned_cols=46 Identities=20% Similarity=0.311 Sum_probs=34.2
Q ss_pred ceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy13144 117 PICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS 162 (1031)
Q Consensus 117 ~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g 162 (1031)
..|.|.---.|.+++.|.+++.-.||....-.+.++|..-.|..++
T Consensus 300 G~C~C~HNT~G~nCE~C~~fYnDlPWrpAeG~~~neCrkC~CNgHa 345 (1758)
T KOG0994|consen 300 GRCMCKHNTAGLNCEHCAPFYNDLPWRPAEGKTSNECRKCECNGHA 345 (1758)
T ss_pred ceeEeccCCCCCChHHhhHhhcCCCCCccCCCCcccccccCCCCCc
Confidence 4799998888999888888877777655444567888776665544
No 8
>KOG1219|consensus
Probab=99.26 E-value=8.7e-12 Score=154.01 Aligned_cols=120 Identities=24% Similarity=0.628 Sum_probs=105.8
Q ss_pred CCCCCCCCceeecC-CCeeeEeCCCCccCCCccccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCc
Q psy13144 874 PGSCGQNANCRVIN-HNAVCNCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFI 952 (1031)
Q Consensus 874 ~~~C~~~~~C~~~~-g~~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~ 952 (1031)
.++|.++|+|+..+ ++|.|.|++-|+|. .|+ .++..|.++||..|++|+...++|.|.|+.||+
T Consensus 3869 ~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CE------------i~~epC~snPC~~GgtCip~~n~f~CnC~~gyT 3933 (4289)
T KOG1219|consen 3869 DNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCE------------IDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYT 3933 (4289)
T ss_pred cCcccCCCEecCCCCCceEEeCcccccCc---ccc------------cccccccCCCCCCCCEEEecCCCeeEeCCCCcc
Confidence 46899999999986 47999999999999 898 678999999999999999999999999999999
Q ss_pred CCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCccCCCCCCcCCCCccccc
Q psy13144 953 GAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFVGDAFSGCYPKPPERTMW 1026 (1031)
Q Consensus 953 g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c~~c~~~~~~~~~~ 1026 (1031)
|. +|+.. .++||+. +.|..+|.|+++.|+|.|.|.+||.|..|..=+|....-+.|
T Consensus 3934 G~--~Ce~~--Gi~eCs~--------------n~C~~gg~C~n~~gsf~CncT~g~~gr~c~~~~pni~~~~~~ 3989 (4289)
T KOG1219|consen 3934 GK--RCEAR--GISECSK--------------NVCGTGGQCINIPGSFHCNCTPGILGRTCCAEKPNILSTVLW 3989 (4289)
T ss_pred Cc--eeecc--ccccccc--------------ccccCCceeeccCCceEeccChhHhcccCccccCccccccch
Confidence 99 88842 3777763 479999999999999999999999999996566666666666
No 9
>KOG1219|consensus
Probab=99.24 E-value=1.1e-11 Score=153.09 Aligned_cols=112 Identities=29% Similarity=0.820 Sum_probs=100.1
Q ss_pred CCCCCCCCCCCCCeeccc-CCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeeecCCCc
Q psy13144 39 TNPCQPSPCGPNSQCREV-NHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKVINHSP 117 (1031)
Q Consensus 39 ~d~C~~~~C~~~g~C~~~-~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~g~~ 117 (1031)
.++|..+||+++|+|+.. .|+|.|.|++.|+|. .|+... ..|. +++|..+|+|+...++|
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~------~CEi~~-----epC~--------snPC~~GgtCip~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN------HCEIDL-----EPCA--------SNPCLTGGTCIPFYNGF 3924 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCc------cccccc-----cccc--------CCCCCCCCEEEecCCCe
Confidence 389999999999999985 668999999999998 888754 3555 37888999999999999
Q ss_pred eeeCCCCCccCCccccccCCCCCCCCCCCCCC-CCCCCCCCCCCCCceecCCCCceeeCCCCCcCCCCCCc
Q psy13144 118 ICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEP-VNPCYPSPCGPYSQCRDINGSPSCSCLPSYIGSPPNCR 187 (1031)
Q Consensus 118 ~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~-~d~C~~~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C~ 187 (1031)
.|.|+.||+|.. |+ .. +++|+.++|.++|.|+|..|+|.|.|-+||.|. .|.
T Consensus 3925 ~CnC~~gyTG~~---Ce-------------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr--~c~ 3977 (4289)
T KOG1219|consen 3925 LCNCPNGYTGKR---CE-------------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR--TCC 3977 (4289)
T ss_pred eEeCCCCccCce---ee-------------cccccccccccccCCceeeccCCceEeccChhHhcc--cCc
Confidence 999999999997 76 44 899999999999999999999999999999999 664
No 10
>KOG0994|consensus
Probab=99.07 E-value=2.2e-09 Score=127.23 Aligned_cols=72 Identities=35% Similarity=0.949 Sum_probs=43.7
Q ss_pred ceecCCCCceeeCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCC
Q psy13144 163 QCRDINGSPSCSCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQP 242 (1031)
Q Consensus 163 ~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~ 242 (1031)
.|....| +|+|.+|-+|. .|. +|.|||+|-+ .|..- ..=-..|.|.+
T Consensus 824 ~Cd~~tG--QC~C~~g~ygr--qCn--------------------------qCqpG~WgFP--eCr~C-qCNgHA~~Cd~ 870 (1758)
T KOG0994|consen 824 YCDKITG--QCQCRPGTYGR--QCN--------------------------QCQPGYWGFP--ECRPC-QCNGHADTCDP 870 (1758)
T ss_pred ccccccc--ceeeccccchh--hcc--------------------------ccCCCccCCC--cCccc-cccCcccccCc
Confidence 3543333 79999998888 554 8999999987 34310 00001223322
Q ss_pred C--CCCCCCcccccCCCeee-ecCCCCcCCC
Q psy13144 243 S--PCGPNSQCREVNHQAVC-SCLPNYFGSP 270 (1031)
Q Consensus 243 ~--~C~~~g~C~~~~g~y~C-~C~~Gy~g~~ 270 (1031)
. -|. .|.+..+++.| .|..||+|++
T Consensus 871 ~tGaCi---~CqD~T~G~~CdrCl~GyyGdP 898 (1758)
T KOG0994|consen 871 ITGACI---DCQDSTTGHSCDRCLDGYYGDP 898 (1758)
T ss_pred cccccc---cccccccccchhhhhccccCCc
Confidence 1 122 25567778888 7999999985
No 11
>KOG1225|consensus
Probab=98.98 E-value=2.8e-09 Score=122.95 Aligned_cols=132 Identities=33% Similarity=0.902 Sum_probs=103.3
Q ss_pred ceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCccccC
Q psy13144 748 PICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRN 827 (1031)
Q Consensus 748 ~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~ 827 (1031)
+.|.|..+|+|. .|+...- . -.|...+.|+++.|+|++||+|..|.. ..|.. .|+.++.+++.
T Consensus 234 ~ic~c~~~~~g~---~c~~~~C--------~--~~c~~~g~c~~G~CIC~~Gf~G~dC~e--~~Cp~--~cs~~g~~~~g 296 (525)
T KOG1225|consen 234 GICECPEGYFGP---LCSTIYC--------P--GGCTGRGQCVEGRCICPPGFTGDDCDE--LVCPV--DCSGGGVCVDG 296 (525)
T ss_pred ceeecCCceeCC---ccccccC--------C--CCCcccceEeCCeEeCCCCCcCCCCCc--ccCCc--ccCCCceecCC
Confidence 489999999999 5663210 0 026667899999999999999999854 45663 48888888863
Q ss_pred CCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCcccc
Q psy13144 828 KFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRIRCS 907 (1031)
Q Consensus 828 ~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~C~ 907 (1031)
+|+|++||+|. +|+.. .|+..|.++|.|+. | +|.|.+||+|. .|+
T Consensus 297 ------~CiC~~g~~G~------dCs~~----------------~cpadC~g~G~Ci~--G--~C~C~~Gy~G~---~C~ 341 (525)
T KOG1225|consen 297 ------ECICNPGYSGK------DCSIR----------------RCPADCSGHGKCID--G--ECLCDEGYTGE---LCI 341 (525)
T ss_pred ------EeecCCCcccc------ccccc----------------cCCccCCCCCcccC--C--ceEeCCCCcCC---ccc
Confidence 89999999999 66542 23667999999993 3 99999999999 675
Q ss_pred CCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCC
Q psy13144 908 KIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAP 955 (1031)
Q Consensus 908 ~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~ 955 (1031)
. . +|.+++.|++. |.|..||.|..
T Consensus 342 ~--------------~-----~C~~~g~cv~g-----C~C~~Gw~G~d 365 (525)
T KOG1225|consen 342 Q--------------R-----ACSGGGQCVNG-----CKCKKGWRGPD 365 (525)
T ss_pred c--------------c-----ccCCCceeccC-----ceeccCccCCC
Confidence 2 1 38888999863 99999999984
No 12
>KOG1225|consensus
Probab=98.91 E-value=8.4e-09 Score=119.07 Aligned_cols=132 Identities=30% Similarity=0.806 Sum_probs=96.3
Q ss_pred CCCCCCeeeecCCceeeeCCCCcccCCCCCcCcCccCCCCCCCCcccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCC
Q psy13144 601 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACFNQKCVDPCPDSPPPPLESPPEYVNPCIPSPCGP 680 (1031)
Q Consensus 601 ~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~~~~~~~~~~dideC~~~~C~~ 680 (1031)
.|..+++|++. +|+|++||+|.. |+ +-.|... |+.
T Consensus 255 ~c~~~g~c~~G----~CIC~~Gf~G~d------C~----------------------------------e~~Cp~~-cs~ 289 (525)
T KOG1225|consen 255 GCTGRGQCVEG----RCICPPGFTGDD------CD----------------------------------ELVCPVD-CSG 289 (525)
T ss_pred CCcccceEeCC----eEeCCCCCcCCC------CC----------------------------------cccCCcc-cCC
Confidence 45566778776 799999999993 54 2224333 777
Q ss_pred CCceeecCCCCeeecCCCCcCCCCCCCcCcccCCCCCCccccccCcCCCCCCCCCCCCCeeeccCCCceeeCCCCcccCC
Q psy13144 681 YSQCRDIGGSPSCSCLPNYIGAPPNCRPECVMNSECPSNEACINEKCGDPCPGSCGYNAECKIINHTPICTCPDGFIGDP 760 (1031)
Q Consensus 681 ~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 760 (1031)
++.+++. .|+|++||+|. .|+. ..|. ..|.++|.|+ .| .|.|.+||+|.
T Consensus 290 ~g~~~~g----~CiC~~g~~G~--dCs~-----~~cp---------------adC~g~G~Ci--~G--~C~C~~Gy~G~- 338 (525)
T KOG1225|consen 290 GGVCVDG----ECICNPGYSGK--DCSI-----RRCP---------------ADCSGHGKCI--DG--ECLCDEGYTGE- 338 (525)
T ss_pred CceecCC----EeecCCCcccc--cccc-----ccCC---------------ccCCCCCccc--CC--ceEeCCCCcCC-
Confidence 7877776 79999999999 7763 2233 3588899998 33 89999999999
Q ss_pred CCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCccccC
Q psy13144 761 FTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRN 827 (1031)
Q Consensus 761 ~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~ 827 (1031)
.|... .|++++.|+++ |+|..||.|.. ...+.++....|.....++..
T Consensus 339 --~C~~~--------------~C~~~g~cv~g-C~C~~Gw~G~d--~~~~~~~~~~~cs~~~~~~~~ 386 (525)
T KOG1225|consen 339 --LCIQR--------------ACSGGGQCVNG-CKCKKGWRGPD--VADPSLLLITECSPPSLCIAG 386 (525)
T ss_pred --ccccc--------------ccCCCceeccC-ceeccCccCCC--cCCchhhcccccCCCceeecc
Confidence 57642 28999999999 99999999987 333444443456555556554
No 13
>KOG4260|consensus
Probab=98.74 E-value=1.2e-08 Score=104.34 Aligned_cols=128 Identities=32% Similarity=0.701 Sum_probs=83.7
Q ss_pred eCCCCCCCCCCCccccccCCCCCCCCCC---CCCCCCCCeecc---cCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCc
Q psy13144 16 SCPPGTTGSPFVQCKPIVHEPVYTNPCQ---PSPCGPNSQCRE---VNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDK 89 (1031)
Q Consensus 16 ~C~~G~~g~~~~~C~~~~~~~~~~d~C~---~~~C~~~g~C~~---~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~ 89 (1031)
-|++|.+|++ |. .|. .-||..+|.|.- ..|+..|.|.+||+|.. |. .|.+.--=...+
T Consensus 131 CCp~gtyGpd---Cl----------~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~--C~-~Cg~eyfes~Rn 194 (350)
T KOG4260|consen 131 CCPDGTYGPD---CL----------QCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPL--CR-YCGIEYFESSRN 194 (350)
T ss_pred ccCCCCcCCc---cc----------cCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCcc--cc-ccchHHHHhhcc
Confidence 4899999996 53 342 467999999974 57788999999999982 21 121110000000
Q ss_pred cccCCC---CCCCCCCCCCCCCeeeecCCCcee-eCCCCCccCCccccccCCCCCCCCCCCCCCCCCCC--CCCCCCCCc
Q psy13144 90 SCQNQK---CADPCPGTCGQNANCKVINHSPIC-RCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCY--PSPCGPYSQ 163 (1031)
Q Consensus 90 ~C~~~~---C~~~c~~~C~~~~~C~~~~g~~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~--~~~C~~~g~ 163 (1031)
+ ++.. |-.+|... |.. .++-.| +|+.||..+... | .|||||. +.||.....
T Consensus 195 e-~~lvCt~Ch~~C~~~------Csg-~~~k~C~kCkkGW~lde~g-C--------------vDvnEC~~ep~~c~~~qf 251 (350)
T KOG4260|consen 195 E-QHLVCTACHEGCLGV------CSG-ESSKGCSKCKKGWKLDEEG-C--------------VDVNECQNEPAPCKAHQF 251 (350)
T ss_pred c-ccchhhhhhhhhhcc------cCC-CCCCChhhhcccceecccc-c--------------ccHHHHhcCCCCCChhhe
Confidence 0 0111 21122222 322 223455 799999987533 6 5899997 578999999
Q ss_pred eecCCCCceeeCCCCCcCC
Q psy13144 164 CRDINGSPSCSCLPSYIGS 182 (1031)
Q Consensus 164 C~~~~g~~~C~C~~Gy~g~ 182 (1031)
|+|+.|||.|..++||.+.
T Consensus 252 CvNteGSf~C~dk~Gy~~g 270 (350)
T KOG4260|consen 252 CVNTEGSFKCEDKEGYKKG 270 (350)
T ss_pred eecCCCceEecccccccCC
Confidence 9999999999999999875
No 14
>KOG4260|consensus
Probab=98.53 E-value=8.8e-08 Score=98.03 Aligned_cols=162 Identities=26% Similarity=0.618 Sum_probs=102.9
Q ss_pred cCCCCccCCcccCCCCCcCC--CCCCCCCccccCC-CCCceeeecCCCCcCCCCCCCCCcccCCCCCCc-----ccccCC
Q psy13144 796 CLPDYYGDGYVSCGPECILN--NDCPSNKACIRNK-FNKQAVCSCLPNYFGSPPACRPECTVNTDCPLD-----KACVNQ 867 (1031)
Q Consensus 796 C~~G~~G~~c~~~~~~C~~~--~~C~~~g~C~~~~-~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~-----~~C~~~ 867 (1031)
|++|.+|..|.. |... .+|..+|.|.-.. ..|+..|.|.+||+|. .|.. |-+. ..=...
T Consensus 132 Cp~gtyGpdCl~----Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp------~C~~---Cg~eyfes~Rne~~l 198 (350)
T KOG4260|consen 132 CPDGTYGPDCLQ----CPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGP------LCRY---CGIEYFESSRNEQHL 198 (350)
T ss_pred cCCCCcCCcccc----CCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCc------cccc---cchHHHHhhcccccc
Confidence 789999998753 4322 4688888887643 2378899999999999 4431 1000 000000
Q ss_pred cccCCCCCCCCCCCceeecCCCeee-EeCCCCccCCCccccCCCCCCCCCCCCCcCCCCC--CCCCCCCCceeecCCCce
Q psy13144 868 KCVDPCPGSCGQNANCRVINHNAVC-NCKPGFTGEPRIRCSKIPPPPPPQDVPEYVNPCI--PSPCGPNSQCRDINGSPS 944 (1031)
Q Consensus 868 ~C~~~C~~~C~~~~~C~~~~g~~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~~C~--~~~C~~~~~C~~~~g~~~ 944 (1031)
.|. .|...|. +.|+... +-.| .|..||..+.. .| .|||||. +.||.....|+|+.|||+
T Consensus 199 vCt-~Ch~~C~--~~Csg~~-~k~C~kCkkGW~lde~-gC-------------vDvnEC~~ep~~c~~~qfCvNteGSf~ 260 (350)
T KOG4260|consen 199 VCT-ACHEGCL--GVCSGES-SKGCSKCKKGWKLDEE-GC-------------VDVNECQNEPAPCKAHQFCVNTEGSFK 260 (350)
T ss_pred hhh-hhhhhhh--cccCCCC-CCChhhhcccceeccc-cc-------------ccHHHHhcCCCCCChhheeecCCCceE
Confidence 000 0111222 2343322 2234 58999987732 36 5899995 579999999999999999
Q ss_pred eeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCc
Q psy13144 945 CSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFV 1009 (1031)
Q Consensus 945 C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 1009 (1031)
|...+||.+..+.|+ .- .+.|. ..+..|.++.++|+|+|..|+.
T Consensus 261 C~dk~Gy~~g~d~C~----------~~--------~d~~~---~kn~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 261 CEDKEGYKKGVDECQ----------FC--------ADVCA---SKNRPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred ecccccccCChHHhh----------hh--------hhhcc---cCCCCcccCCccEEEEecccce
Confidence 999999988533333 10 01111 1367899999999999999974
No 15
>KOG1226|consensus
Probab=97.86 E-value=6.4e-05 Score=88.68 Aligned_cols=140 Identities=24% Similarity=0.542 Sum_probs=93.9
Q ss_pred CCCCCCeeeccCCCceeeCCCCcccCCCCCCCCCCCCCCC-----CCccC-CCCCCCCCCeeeCceeecCCCCc----cC
Q psy13144 734 SCGYNAECKIINHTPICTCPDGFIGDPFTSCSPKPPEPVQ-----PVIQE-DTCNCVPNAECRDGVCVCLPDYY----GD 803 (1031)
Q Consensus 734 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~~~~~~~~~-----~~~~~-~~c~C~~~~~C~~~~C~C~~G~~----G~ 803 (1031)
.|+.+|+.+-. +|.|.+||.|. .|+-....... .|... +.=+|+..|.|+=+.|+|.+... |.
T Consensus 468 ~C~g~G~~~CG----~C~C~~G~~G~---~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~i~G~ 540 (783)
T KOG1226|consen 468 LCHGNGTFVCG----QCRCDEGWLGK---KCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGKIYGK 540 (783)
T ss_pred ccCCCCcEEec----ceecCCCCCCC---cccCCccccCcHhHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCceeee
Confidence 45555554433 89999999999 45532211111 11111 11169999999999999987776 88
Q ss_pred CcccCCCCCcCC--CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCC
Q psy13144 804 GYVSCGPECILN--NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNA 881 (1031)
Q Consensus 804 ~c~~~~~~C~~~--~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~ 881 (1031)
.|+-+.-.|... ..|.++|+|.-. +|+|.+||+|+ .|+-.. ..+.|++.. -.+|+..|
T Consensus 541 fCECDnfsC~r~~g~lC~g~G~C~CG------~CvC~~GwtG~------~C~C~~---std~C~~~~-----G~iCSGrG 600 (783)
T KOG1226|consen 541 FCECDNFSCERHKGVLCGGHGRCECG------RCVCNPGWTGS------ACNCPL---STDTCESSD-----GQICSGRG 600 (783)
T ss_pred eeeccCcccccccCcccCCCCeEeCC------cEEcCCCCccC------CCCCCC---CCccccCCC-----CceeCCCc
Confidence 898776677754 479999999864 89999999999 553211 112344321 24688888
Q ss_pred ceeecCCCeeeEeCCC-CccCCCcccc
Q psy13144 882 NCRVINHNAVCNCKPG-FTGEPRIRCS 907 (1031)
Q Consensus 882 ~C~~~~g~~~C~C~~G-y~G~~~~~C~ 907 (1031)
+|.= .+|+|... |.|. .|+
T Consensus 601 ~C~C----g~C~C~~~~~sG~---~CE 620 (783)
T KOG1226|consen 601 TCEC----GRCKCTDPPYSGE---FCE 620 (783)
T ss_pred eeeC----CceEcCCCCcCcc---hhh
Confidence 8866 38999766 9999 786
No 16
>KOG1226|consensus
Probab=97.78 E-value=0.00017 Score=85.17 Aligned_cols=137 Identities=20% Similarity=0.490 Sum_probs=91.9
Q ss_pred CCCCCCCCceeecCCCCeeecCCCCcCCCCCCC--cCcccC----CCCCCccccccCcCCCCCCCCCCCCCeeeccCCCc
Q psy13144 675 PSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCR--PECVMN----SECPSNEACINEKCGDPCPGSCGYNAECKIINHTP 748 (1031)
Q Consensus 675 ~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~--~~C~~~----~~C~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~ 748 (1031)
+..|+.+|+.+-. .|.|.+||.|. .|+ ...... +.|.... + ..+|++.|.|+=.
T Consensus 466 s~~C~g~G~~~CG----~C~C~~G~~G~--~CEC~~~~~ss~~~~~~Cr~~~----~------~~vCSgrG~C~CG---- 525 (783)
T KOG1226|consen 466 SALCHGNGTFVCG----QCRCDEGWLGK--KCECSTDELSSSEEEDKCRENS----D------SPVCSGRGDCVCG---- 525 (783)
T ss_pred ccccCCCCcEEec----ceecCCCCCCC--cccCCccccCcHhHHhhccCCC----C------CCCcCCCCcEeCC----
Confidence 3456655665544 59999999999 775 111111 2333321 0 2378888888765
Q ss_pred eeeCCCCcc----cCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccC--CCCCcCC--CCCCC
Q psy13144 749 ICTCPDGFI----GDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSC--GPECILN--NDCPS 820 (1031)
Q Consensus 749 ~C~C~~G~~----g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~--~~~C~~~--~~C~~ 820 (1031)
+|+|.+... |. .|+-.. +.|....--.|..+|+|.-+.|+|.+||+|..|... .+.|... ..|..
T Consensus 526 qC~C~~~~~~~i~G~---fCECDn----fsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSG 598 (783)
T KOG1226|consen 526 QCVCHKPDNGKIYGK---FCECDN----FSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESSDGQICSG 598 (783)
T ss_pred ceEecCCCCCceeee---eeeccC----cccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCCCCceeCC
Confidence 899988777 55 455221 111111222489999999999999999999998743 5677754 46888
Q ss_pred CCccccCCCCCceeeecCCC-CcCC
Q psy13144 821 NKACIRNKFNKQAVCSCLPN-YFGS 844 (1031)
Q Consensus 821 ~g~C~~~~~~~~~~C~C~~G-~~g~ 844 (1031)
.|+|+=. +|+|.+. |.|.
T Consensus 599 rG~C~Cg------~C~C~~~~~sG~ 617 (783)
T KOG1226|consen 599 RGTCECG------RCKCTDPPYSGE 617 (783)
T ss_pred CceeeCC------ceEcCCCCcCcc
Confidence 8888864 7999887 9998
No 17
>KOG1836|consensus
Probab=97.66 E-value=0.0017 Score=85.63 Aligned_cols=108 Identities=31% Similarity=0.686 Sum_probs=69.8
Q ss_pred eeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC---ceecCCCCceeeCCCCCcCCCCCCcccCCCCC
Q psy13144 118 ICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS---QCRDINGSPSCSCLPSYIGSPPNCRPECIQNS 194 (1031)
Q Consensus 118 ~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g---~C~~~~g~~~C~C~~Gy~g~~~~C~~~C~~~~ 194 (1031)
.|.|+.||+|..++.|.+......++. -+. +...+|.-++ +|... +-.|.|.+--.|. .|+
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~---~~~--~~c~~C~cngh~~~Cd~~--tG~C~C~~~t~G~--~C~------- 759 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQL---GPF--CPCIPCDCNGHSNICDPR--TGQCKCKHNTFGG--QCA------- 759 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccC---CCC--CcccccccCCccccccCC--CCceecccCCCCC--chh-------
Confidence 399999999998666653332221111 122 3223444333 45433 3467888777777 665
Q ss_pred CCCCCCcccCCCCCCCCCCCCCCCCCCCCCcccccccCCCCCCCCCCCCCCCCCCcccccC--CCeeee-cCCCCcCC
Q psy13144 195 ECPYDKACINEKCADPCPGFCPPGTTGSPFVQCKPIVHEPVYTNPCQPSPCGPNSQCREVN--HQAVCS-CLPNYFGS 269 (1031)
Q Consensus 195 ~C~~~~~C~n~~~~~~C~~~C~~G~~g~~~~~C~~~~~~~~~~d~C~~~~C~~~g~C~~~~--g~y~C~-C~~Gy~g~ 269 (1031)
+|..||.|.+.. .. .+.|++.+|.+++.|..+. ....|. |++||+|.
T Consensus 760 -------------------~C~~GfYg~~~~--------~~-~~dC~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~ 809 (1705)
T KOG1836|consen 760 -------------------QCVDGFYGLPDL--------GT-SGDCQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL 809 (1705)
T ss_pred -------------------hhcCCCCCcccc--------CC-CCCCccCCCCCChhhcCcCcccceecCCCCCCCccc
Confidence 899999998621 01 1238888999998887764 567897 99999999
No 18
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.59 E-value=3.8e-05 Score=58.27 Aligned_cols=34 Identities=26% Similarity=0.587 Sum_probs=30.1
Q ss_pred CCCCCCC--CCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144 38 YTNPCQP--SPCGPNSQCREVNHQAVCSCLPNYFGS 71 (1031)
Q Consensus 38 ~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~ 71 (1031)
|||||+. ++|..+++|+|+.|+|+|.|++||...
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 5899974 579989999999999999999999843
No 19
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.56 E-value=3.6e-05 Score=54.51 Aligned_cols=30 Identities=30% Similarity=0.892 Sum_probs=27.6
Q ss_pred CCCCCCCCCCeecccC-CceeeeCCCCCcCC
Q psy13144 42 CQPSPCGPNSQCREVN-HQAVCSCLPNYFGS 71 (1031)
Q Consensus 42 C~~~~C~~~g~C~~~~-g~~~C~C~~Gy~g~ 71 (1031)
|.++||.++|+|++.. ++|+|+|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5678999999999988 99999999999996
No 20
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.56 E-value=5.4e-05 Score=57.48 Aligned_cols=34 Identities=32% Similarity=0.711 Sum_probs=30.1
Q ss_pred cCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCc
Q psy13144 964 QNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFV 1009 (1031)
Q Consensus 964 ~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~ 1009 (1031)
|||||... .+.|..++.|+|+.|+|+|.|++||.
T Consensus 1 DidEC~~~------------~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEG------------PHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTT------------SSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCC------------CCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 67888865 35788899999999999999999998
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.49 E-value=4.2e-05 Score=54.19 Aligned_cols=30 Identities=33% Similarity=0.979 Sum_probs=27.5
Q ss_pred CCCCCCCCCCceeecC-CCceeeCCCCCcCC
Q psy13144 925 CIPSPCGPNSQCRDIN-GSPSCSCLPTFIGA 954 (1031)
Q Consensus 925 C~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~ 954 (1031)
|.++||+++|+|++.. ++|+|+|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4567999999999998 99999999999997
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.39 E-value=0.00018 Score=53.44 Aligned_cols=34 Identities=29% Similarity=0.826 Sum_probs=30.4
Q ss_pred cCCCCCC-CCCCCCCceeecCCCceeeCCCCCc-CC
Q psy13144 921 YVNPCIP-SPCGPNSQCRDINGSPSCSCLPTFI-GA 954 (1031)
Q Consensus 921 ~~~~C~~-~~C~~~~~C~~~~g~~~C~C~~Gy~-g~ 954 (1031)
++++|.. ++|.++++|+++.++|+|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 3688887 7999999999999999999999999 55
No 23
>KOG1836|consensus
Probab=97.39 E-value=0.0055 Score=80.96 Aligned_cols=53 Identities=32% Similarity=0.746 Sum_probs=39.1
Q ss_pred cee-eCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCceecCC--CCceee-CCCCCcCCCCCCc
Q psy13144 117 PIC-RCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYSQCRDIN--GSPSCS-CLPSYIGSPPNCR 187 (1031)
Q Consensus 117 ~~C-~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g~C~~~~--g~~~C~-C~~Gy~g~~~~C~ 187 (1031)
-+| +|..||.|.... ...-| |.+-+|.+++.|..+. ..+.|+ |++||+|. .|+
T Consensus 756 ~~C~~C~~GfYg~~~~---------------~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~--rCe 812 (1705)
T KOG1836|consen 756 GQCAQCVDGFYGLPDL---------------GTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGL--RCE 812 (1705)
T ss_pred CchhhhcCCCCCcccc---------------CCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCccc--ccc
Confidence 344 688999987621 01223 8888999988887654 567899 99999999 776
No 24
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.38 E-value=0.00019 Score=53.32 Aligned_cols=35 Identities=37% Similarity=0.963 Sum_probs=31.2
Q ss_pred CCCCCC-CCCCCCCceecCCCCceeeCCCCCc-CCCCCC
Q psy13144 150 VNPCYP-SPCGPYSQCRDINGSPSCSCLPSYI-GSPPNC 186 (1031)
Q Consensus 150 ~d~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~-g~~~~C 186 (1031)
+++|.. .+|.++++|+++.++|+|.|++||. |. .|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~--~C 38 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR--NC 38 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC--cC
Confidence 678887 7999999999999999999999999 66 55
No 25
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.23 E-value=0.00016 Score=52.52 Aligned_cols=30 Identities=43% Similarity=0.835 Sum_probs=24.8
Q ss_pred CCCCCCCeeeecCCCceeeCCCCCccCCCC
Q psy13144 985 GSCGYNALCKVINHSPICTCPDGFVGDAFS 1014 (1031)
Q Consensus 985 ~~C~~~~~C~~~~g~~~C~C~~G~~G~~c~ 1014 (1031)
+.|+.+|+|+++.++|+|+|++||+|++..
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 469999999999999999999999999864
No 26
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.11 E-value=0.00011 Score=72.54 Aligned_cols=139 Identities=24% Similarity=0.559 Sum_probs=81.1
Q ss_pred CCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecC-----CCeeeEeCCCCccCCCc
Q psy13144 830 NKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVIN-----HNAVCNCKPGFTGEPRI 904 (1031)
Q Consensus 830 ~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~-----g~~~C~C~~Gy~G~~~~ 904 (1031)
++.|+|.|.+||..... +.|+...+|... . -+ ..+|...|+|++.. ..|+|.|.+||.....
T Consensus 17 SNHfEC~Cnegfvl~~E---ntCE~kv~C~~~-----e-~~---~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~- 83 (197)
T PF06247_consen 17 SNHFECKCNEGFVLKNE---NTCEEKVECDKL-----E-NV---NKPCGDYAKCINQANKGEERAYKCDCINGYILKQG- 83 (197)
T ss_dssp SSEEEEEESTTEEEEET---TEEEE----SG------G-GT---TSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-
T ss_pred cCceEEEcCCCcEEccc---cccccceecCcc-----c-cc---CccccchhhhhcCCCcccceeEEEecccCceeeCC-
Confidence 37899999999976531 133333332210 0 01 23678889999876 3599999999987633
Q ss_pred cccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeec---CCCceeeCCCCCc-CCCCCCCCCcccCCCCCCcchhcccccc
Q psy13144 905 RCSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDI---NGSPSCSCLPTFI-GAPPNCRPECIQNSECPFDKACIREKCI 980 (1031)
Q Consensus 905 ~C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~---~g~~~C~C~~Gy~-g~~~~C~~~C~~~~eC~~~~~C~~~~c~ 980 (1031)
.|. .++|....|.. |.|+-. +....|+|..|+. .+...|.. .+ .
T Consensus 84 vCv--------------p~~C~~~~Cg~-GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk---------~G--------~ 131 (197)
T PF06247_consen 84 VCV--------------PNKCNNKDCGS-GKCILDPDNPNNPTCSCNIGKVPDDNKKCTK---------TG--------E 131 (197)
T ss_dssp SEE--------------EGGGSS---TT-EEEEEEEGGGSEEEEEE-TEEETTTTTESEE---------EE---------
T ss_pred eEc--------------hhhcCceecCC-CeEEecCCCCCCceeEeeeceEeccCCcccC---------CC--------c
Confidence 364 35676667884 899743 3355999999999 33222221 01 2
Q ss_pred CCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144 981 DPCPGSCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus 981 ~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
.+|..-|..+-+|..+.+.|+|.+..||.+++=
T Consensus 132 T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 132 TKCSLKCKENEECKLVDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp -------TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred cceeeecCCCcceeeeCcEEEeecCCCCCCCCC
Confidence 334456778899999999999999999987664
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.04 E-value=0.00027 Score=51.29 Aligned_cols=30 Identities=43% Similarity=0.912 Sum_probs=24.6
Q ss_pred CCCCCCCCeeeecCCCceeeCCCCCccCCc
Q psy13144 101 PGTCGQNANCKVINHSPICRCKAGFTGDPF 130 (1031)
Q Consensus 101 ~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~ 130 (1031)
++.|+.+|+|+++.++|+|+|++||+|++.
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~ 34 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGDGF 34 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECCST
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccCCc
Confidence 378999999999999999999999999874
No 28
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.87 E-value=0.0012 Score=48.51 Aligned_cols=33 Identities=33% Similarity=0.899 Sum_probs=29.9
Q ss_pred CCCCCC-CCCCCCCceeecCCCceeeCCCCCcCC
Q psy13144 922 VNPCIP-SPCGPNSQCRDINGSPSCSCLPTFIGA 954 (1031)
Q Consensus 922 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~Gy~g~ 954 (1031)
+++|.. .+|.+++.|++..++|+|.|++||.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 578877 799988999999999999999999986
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.87 E-value=0.0012 Score=48.41 Aligned_cols=35 Identities=40% Similarity=0.982 Sum_probs=30.9
Q ss_pred CCCCCC-CCCCCCCceecCCCCceeeCCCCCcCCCCCC
Q psy13144 150 VNPCYP-SPCGPYSQCRDINGSPSCSCLPSYIGSPPNC 186 (1031)
Q Consensus 150 ~d~C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C 186 (1031)
+++|.. .+|.++++|++..++|+|.|++||.|. .|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~--~C 37 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NC 37 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC--cC
Confidence 567876 789988999999999999999999997 55
No 30
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.86 E-value=6.6e-05 Score=74.01 Aligned_cols=144 Identities=22% Similarity=0.513 Sum_probs=89.3
Q ss_pred CCCCCCeecccCCceeeeCCCCCcCCCCCCCCCCccCCCCCCCccccCCCCCCCCCCCCCCCCeeeecC-----CCceee
Q psy13144 46 PCGPNSQCREVNHQAVCSCLPNYFGSPPACRPECTVNSDCPLDKSCQNQKCADPCPGTCGQNANCKVIN-----HSPICR 120 (1031)
Q Consensus 46 ~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~C~~~~C~~~c~~~C~~~~~C~~~~-----g~~~C~ 120 (1031)
.|. ||.-+...+.|.|.|.+||.+.. .++|+...+|...+. . .-.|..-|+|++.. ..|+|.
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~---EntCE~kv~C~~~e~------~---~K~Cgdya~C~~~~~~~~~~~~~C~ 73 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKN---ENTCEEKVECDKLEN------V---NKPCGDYAKCINQANKGEERAYKCD 73 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEE---TTEEEE----SG-GG------T---TSEEETTEEEEE-SSTTSSTSEEEE
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEcc---ccccccceecCcccc------c---CccccchhhhhcCCCcccceeEEEe
Confidence 344 45778888899999999998853 136776665543211 1 25677889999865 479999
Q ss_pred CCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCceecCC---CCceeeCCCCCc-CCCCCCc----ccCCC
Q psy13144 121 CKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYSQCRDIN---GSPSCSCLPSYI-GSPPNCR----PECIQ 192 (1031)
Q Consensus 121 C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g~C~~~~---g~~~C~C~~Gy~-g~~~~C~----~~C~~ 192 (1031)
|.+||+..... |. .+.|..-.|+ .|.|+-.+ ....|+|.-|+. .+...|. ..|.
T Consensus 74 C~~gY~~~~~v-Cv---------------p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~- 135 (197)
T PF06247_consen 74 CINGYILKQGV-CV---------------PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCS- 135 (197)
T ss_dssp E-TTEEESSSS-EE---------------EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE------
T ss_pred cccCceeeCCe-Ec---------------hhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCcccee-
Confidence 99999876533 53 3566666788 58997433 345999999999 4444565 2343
Q ss_pred CCCCCCCCcccCCCCCCCCCCCCCCCCCCCC
Q psy13144 193 NSECPYDKACINEKCADPCPGFCPPGTTGSP 223 (1031)
Q Consensus 193 ~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~ 223 (1031)
-.|..+..|....+-|+| .+.+||.++.
T Consensus 136 -LKCk~nE~CK~~~~~Y~C--~~~~~~~~~~ 163 (197)
T PF06247_consen 136 -LKCKENEECKLVDGYYKC--VCKEGFPGDG 163 (197)
T ss_dssp ----TTTEEEEEETTEEEE--EE-TT-EEET
T ss_pred -eecCCCcceeeeCcEEEe--ecCCCCCCCC
Confidence 246678899999999999 9999998764
No 31
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.91 E-value=0.0067 Score=39.53 Aligned_cols=24 Identities=33% Similarity=0.573 Sum_probs=17.4
Q ss_pred ceeeeCCCCCcCCCCCCCCCCccCCCCCCCcc
Q psy13144 59 QAVCSCLPNYFGSPPACRPECTVNSDCPLDKS 90 (1031)
Q Consensus 59 ~~~C~C~~Gy~g~~~~c~~~C~~~~~C~~~~~ 90 (1031)
||+|+|++||++.+ ++..|.++||
T Consensus 1 sy~C~C~~Gy~l~~--------d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP--------DGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCC--------CCCccccCCC
Confidence 69999999999873 4445555553
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.88 E-value=0.0094 Score=42.90 Aligned_cols=28 Identities=29% Similarity=0.776 Sum_probs=25.7
Q ss_pred CCCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144 44 PSPCGPNSQCREVNHQAVCSCLPNYFGS 71 (1031)
Q Consensus 44 ~~~C~~~g~C~~~~g~~~C~C~~Gy~g~ 71 (1031)
..+|.++++|++..++|+|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5788889999999999999999999886
No 33
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=95.88 E-value=0.01 Score=42.70 Aligned_cols=28 Identities=32% Similarity=0.873 Sum_probs=25.7
Q ss_pred CCCCCCCCceeecCCCceeeCCCCCcCC
Q psy13144 927 PSPCGPNSQCRDINGSPSCSCLPTFIGA 954 (1031)
Q Consensus 927 ~~~C~~~~~C~~~~g~~~C~C~~Gy~g~ 954 (1031)
..+|.++++|+++.++|+|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 5689889999999999999999999987
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.84 E-value=0.01 Score=42.88 Aligned_cols=32 Identities=41% Similarity=1.021 Sum_probs=26.1
Q ss_pred CCC-CCCCCCCceecCCCCceeeCCCCCcCCCCCC
Q psy13144 153 CYP-SPCGPYSQCRDINGSPSCSCLPSYIGSPPNC 186 (1031)
Q Consensus 153 C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g~~~~C 186 (1031)
|.. .+|.++ +|+++.++|+|.|++||+|. ..|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~-~~C 34 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTGD-KRC 34 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCccC-Ccc
Confidence 444 578887 99999999999999999993 155
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=95.72 E-value=0.013 Score=42.34 Aligned_cols=28 Identities=36% Similarity=0.950 Sum_probs=25.0
Q ss_pred CCC-CCCCCCCeecccCCceeeeCCCCCcC
Q psy13144 42 CQP-SPCGPNSQCREVNHQAVCSCLPNYFG 70 (1031)
Q Consensus 42 C~~-~~C~~~g~C~~~~g~~~C~C~~Gy~g 70 (1031)
|.. .+|.++ +|++..++|+|.|++||+|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence 555 688888 9999999999999999998
No 36
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.67 E-value=0.0094 Score=38.85 Aligned_cols=24 Identities=33% Similarity=0.641 Sum_probs=18.8
Q ss_pred CceeeCCCCCcCCCCCCCCCcccCCC
Q psy13144 942 SPSCSCLPTFIGAPPNCRPECIQNSE 967 (1031)
Q Consensus 942 ~~~C~C~~Gy~g~~~~C~~~C~~~~e 967 (1031)
||+|.|++||+..+. .+.|+||||
T Consensus 1 sy~C~C~~Gy~l~~d--~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPD--GRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCC--CCccccCCC
Confidence 699999999997642 356778876
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.21 E-value=0.067 Score=37.80 Aligned_cols=26 Identities=23% Similarity=0.595 Sum_probs=22.9
Q ss_pred CCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144 986 SCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus 986 ~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
.|.++++|+...+ +|+|.+||+|..|
T Consensus 7 ~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 5888999998865 8999999999875
No 38
>KOG1218|consensus
Probab=94.11 E-value=3.3 Score=46.48 Aligned_cols=151 Identities=25% Similarity=0.599 Sum_probs=80.0
Q ss_pred CCceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeec-CCCCccCCcccCCCCCcCCCCCCCCCcc
Q psy13144 746 HTPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVC-LPDYYGDGYVSCGPECILNNDCPSNKAC 824 (1031)
Q Consensus 746 g~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C-~~G~~G~~c~~~~~~C~~~~~C~~~g~C 824 (1031)
.+..|.+..+|.+. .|................+.|..+.........+ ..+|.|..|... .+|.. .|.. .+|
T Consensus 47 ~~~~~~~~~~~~~~---~c~~~~~~~~~~~~c~~~~~c~~~~~~~~~~~~~~~~~~~g~~C~~~-~~~~~--~c~~-~~C 119 (316)
T KOG1218|consen 47 NSGECGLGYGFVGS---VCRIECVCGNAGGGCSQPCRCKNGGTCVSSTGYCHLNGYEGPQCESP-CPCGD--GCAE-KTC 119 (316)
T ss_pred CceeEecccccCCC---ccccccccCCCCCcccCccccCCCCcccCCCCcccCCCCCcccccCC-CCcCC--cccc-ccc
Confidence 34588889999988 3554433322222223333455555555444455 688998887532 12221 1333 567
Q ss_pred ccCCCCCceeeecCCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCc
Q psy13144 825 IRNKFNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRI 904 (1031)
Q Consensus 825 ~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~ 904 (1031)
.+.. . .|.+..+|.+. .|... -.....|. ..|.....+.... ..|.|.+||+|.
T Consensus 120 ~~~~---~-~c~~~~~~~~~------~C~~~--~~~g~~C~---------~~c~~~~~~~~~~--~~c~c~~g~~g~--- 173 (316)
T KOG1218|consen 120 ANPR---R-ECRCGGGYIGE------QCGEE--NLVGLKCQ---------RDCQCTGGCDCKN--GICTCQPGFVGV--- 173 (316)
T ss_pred CCCc---c-ceecCCcCccc------ccccc--CCCCCCcc---------CCCCCccccCCCC--CceeccCCcccc---
Confidence 7653 2 68888888777 45430 00011122 1222222222222 378999999999
Q ss_pred cccCCCCCCCCCCCCCcCCCCCC-CCCCCCCceeecCC
Q psy13144 905 RCSKIPPPPPPQDVPEYVNPCIP-SPCGPNSQCRDING 941 (1031)
Q Consensus 905 ~C~~~~~~~~~~~~~~~~~~C~~-~~C~~~~~C~~~~g 941 (1031)
++... ...|.. ..+.+++.|+...+
T Consensus 174 ~~~~~------------~~~c~~~~~~~~g~~C~~~~~ 199 (316)
T KOG1218|consen 174 FCVES------------CSGCSPLTACENGAKCNRSTG 199 (316)
T ss_pred ccccc------------CCCcCCCcccCCCCeeecccc
Confidence 67532 111442 35677677776654
No 39
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=93.59 E-value=0.069 Score=37.72 Aligned_cols=23 Identities=35% Similarity=0.807 Sum_probs=20.8
Q ss_pred CCCCCeee--CceeecCCCCccCCc
Q psy13144 783 CVPNAECR--DGVCVCLPDYYGDGY 805 (1031)
Q Consensus 783 C~~~~~C~--~~~C~C~~G~~G~~c 805 (1031)
|+++|+|+ .++|+|.+||+|..|
T Consensus 8 C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 8 CSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred cCCCCEEeCCCCEEECCCCCcCCCC
Confidence 88999999 679999999999865
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.21 E-value=0.042 Score=39.92 Aligned_cols=21 Identities=29% Similarity=0.542 Sum_probs=17.9
Q ss_pred CeecccCCceeeeCCCCCcCC
Q psy13144 51 SQCREVNHQAVCSCLPNYFGS 71 (1031)
Q Consensus 51 g~C~~~~g~~~C~C~~Gy~g~ 71 (1031)
.+|++.+++|+|+|++||++.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-
T ss_pred CCCccCCCceEeECCCCCEEC
Confidence 389999999999999999997
No 41
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.83 E-value=0.11 Score=28.70 Aligned_cols=13 Identities=38% Similarity=1.135 Sum_probs=10.5
Q ss_pred eeeCCCCCccCCC
Q psy13144 1001 ICTCPDGFVGDAF 1013 (1031)
Q Consensus 1001 ~C~C~~G~~G~~c 1013 (1031)
+|.|++||+|..|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999876
No 42
>KOG1218|consensus
Probab=90.47 E-value=9.1 Score=42.91 Aligned_cols=163 Identities=22% Similarity=0.503 Sum_probs=75.5
Q ss_pred CceeeCCCCcccCCCCCCCCCCCCCCCCCccCCCCCCCCCCeeeCceeecCCCCccCCcccCCCCCcCCCCCCCCCcccc
Q psy13144 747 TPICTCPDGFIGDPFTSCSPKPPEPVQPVIQEDTCNCVPNAECRDGVCVCLPDYYGDGYVSCGPECILNNDCPSNKACIR 826 (1031)
Q Consensus 747 ~~~C~C~~G~~g~~~~~C~~~~~~~~~~~~~~~~c~C~~~~~C~~~~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~ 826 (1031)
+..|.|.+||.|. . .+.. ....... ...+++ .=...+|.+..+|.|..|......-.....|.....|..
T Consensus 14 ~~~c~c~~~~~g~-~-~~~~-~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~ 83 (316)
T KOG1218|consen 14 SGQCFCDPGYTGR-L-QCEH-QAVTSAC---SGICPC----EVNSGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKN 83 (316)
T ss_pred CCceecCCCcccc-c-cccC-CCCCccc---cccCCc----cCCceeEecccccCCCccccccccCCCCCcccCccccCC
Confidence 4589999999995 1 2332 1110000 011111 002346888899998876432211111123444444444
Q ss_pred CCCCCceeeec-CCCCcCCCCCCCCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeecCCCeeeEeCCCCccCCCcc
Q psy13144 827 NKFNKQAVCSC-LPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVINHNAVCNCKPGFTGEPRIR 905 (1031)
Q Consensus 827 ~~~~~~~~C~C-~~G~~g~g~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~Gy~G~~~~~ 905 (1031)
......+.-.+ ..+|.|. .|+....+... |.. .+|.+... .|.+..+|.+. +
T Consensus 84 ~~~~~~~~~~~~~~~~~g~------~C~~~~~~~~~---------------c~~-~~C~~~~~--~c~~~~~~~~~---~ 136 (316)
T KOG1218|consen 84 GGTCVSSTGYCHLNGYEGP------QCESPCPCGDG---------------CAE-KTCANPRR--ECRCGGGYIGE---Q 136 (316)
T ss_pred CCcccCCCCcccCCCCCcc------cccCCCCcCCc---------------ccc-cccCCCcc--ceecCCcCccc---c
Confidence 33223334444 5677777 55443221110 222 34444332 57777777777 5
Q ss_pred ccCCCCCCCCCCCCCcCCCCCCCCCCCCCceeecCCCceeeCCCCCcCCCCCCCC
Q psy13144 906 CSKIPPPPPPQDVPEYVNPCIPSPCGPNSQCRDINGSPSCSCLPTFIGAPPNCRP 960 (1031)
Q Consensus 906 C~~~~~~~~~~~~~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~Gy~g~~~~C~~ 960 (1031)
|.. +.+. ...|... |.+...+.-. ...|.|++||.+. .++.
T Consensus 137 C~~--~~~~-------g~~C~~~-c~~~~~~~~~--~~~c~c~~g~~g~--~~~~ 177 (316)
T KOG1218|consen 137 CGE--ENLV-------GLKCQRD-CQCTGGCDCK--NGICTCQPGFVGV--FCVE 177 (316)
T ss_pred ccc--cCCC-------CCCccCC-CCCccccCCC--CCceeccCCcccc--cccc
Confidence 653 1111 1222211 1111122212 2378899999998 5553
No 43
>KOG3512|consensus
Probab=90.31 E-value=2.4 Score=48.15 Aligned_cols=79 Identities=23% Similarity=0.502 Sum_probs=57.0
Q ss_pred CCCCC-eeeecCCC-ceeeCCCCCccCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCC-------------------
Q psy13144 104 CGQNA-NCKVINHS-PICRCKAGFTGDPFTYCNRIPPPPPPQEDVPEPVNPCYPSPCGPYS------------------- 162 (1031)
Q Consensus 104 C~~~~-~C~~~~g~-~~C~C~~Gy~G~~~~~C~~~~~~~~~~~~~~~~~d~C~~~~C~~~g------------------- 162 (1031)
|..++ .|+-...+ ++|.|.-.-+|..+..|.++.--.|++...-.++++|....|..++
T Consensus 280 CNgHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~Sgg 359 (592)
T KOG3512|consen 280 CNGHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGG 359 (592)
T ss_pred ecCccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccc
Confidence 43333 58876665 9999999999999888988877788888877889999876665443
Q ss_pred ceec----CCCCcee-eCCCCCcCCC
Q psy13144 163 QCRD----INGSPSC-SCLPSYIGSP 183 (1031)
Q Consensus 163 ~C~~----~~g~~~C-~C~~Gy~g~~ 183 (1031)
+|+| +.|. +| .|.+||.-++
T Consensus 360 vClnCrHnTaGr-hChyCreGyyRd~ 384 (592)
T KOG3512|consen 360 VCLNCRHNTAGR-HCHYCREGYYRDG 384 (592)
T ss_pred eEeecccCCCCc-ccccccCccccCC
Confidence 4443 3333 45 4999998553
No 44
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.67 E-value=0.14 Score=37.27 Aligned_cols=21 Identities=38% Similarity=0.744 Sum_probs=17.7
Q ss_pred CceecCCCCceeeCCCCCcCC
Q psy13144 162 SQCRDINGSPSCSCLPSYIGS 182 (1031)
Q Consensus 162 g~C~~~~g~~~C~C~~Gy~g~ 182 (1031)
.+|++++++|+|.|++||++.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-
T ss_pred CCCccCCCceEeECCCCCEEC
Confidence 589999999999999999987
No 45
>smart00051 DSL delta serrate ligand.
Probab=87.85 E-value=0.65 Score=38.59 Aligned_cols=43 Identities=28% Similarity=0.612 Sum_probs=32.7
Q ss_pred eecCCCCccCCcccCCCCCcCCCCCCCCCccccCCCCCceeeecCCCCcCC
Q psy13144 794 CVCLPDYYGDGYVSCGPECILNNDCPSNKACIRNKFNKQAVCSCLPNYFGS 844 (1031)
Q Consensus 794 C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~ 844 (1031)
=.|+++|+|..|. ..|...+....+.+|... ..++|.+||+|.
T Consensus 19 v~C~~~~yG~~C~---~~C~~~~d~~~~~~Cd~~-----G~~~C~~Gw~G~ 61 (63)
T smart00051 19 VTCDENYYGEGCN---KFCRPRDDFFGHYTCDEN-----GNKGCLEGWMGP 61 (63)
T ss_pred eeCCCCCcCCccC---CEeCcCccccCCccCCcC-----CCEecCCCCcCC
Confidence 4688999999885 355544556778888643 489999999998
No 46
>smart00051 DSL delta serrate ligand.
Probab=87.04 E-value=0.74 Score=38.26 Aligned_cols=47 Identities=19% Similarity=0.393 Sum_probs=33.8
Q ss_pred ceeeCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13144 943 PSCSCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGYNALCKVINHSPICTCPDGFVGDAF 1013 (1031)
Q Consensus 943 ~~C~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~~~~C~~~~g~~~C~C~~G~~G~~c 1013 (1031)
+.=.|+++|.|. .|...|...+ ....+..|.. .| .++|++||+|..|
T Consensus 17 ~rv~C~~~~yG~--~C~~~C~~~~-------------------d~~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGE--GCNKFCRPRD-------------------DFFGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCC--ccCCEeCcCc-------------------cccCCccCCc-CC--CEecCCCCcCCCC
Confidence 344799999999 7877664322 2345677865 45 7999999999875
No 47
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=84.77 E-value=0.22 Score=36.06 Aligned_cols=32 Identities=31% Similarity=0.729 Sum_probs=22.4
Q ss_pred CCCCCCCCCCceeecC-CCceeeCCCCCcCCCC
Q psy13144 925 CIPSPCGPNSQCRDIN-GSPSCSCLPTFIGAPP 956 (1031)
Q Consensus 925 C~~~~C~~~~~C~~~~-g~~~C~C~~Gy~g~~~ 956 (1031)
|....|..++.|++.. |++.|.|.+||..++.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETT
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCC
Confidence 4455788889998876 9999999999987753
No 48
>KOG3512|consensus
Probab=82.38 E-value=4 Score=46.46 Aligned_cols=118 Identities=20% Similarity=0.518 Sum_probs=69.3
Q ss_pred CceeecCCC-eeeEeCCCCccCCCccccCCC-CCCCCCCCCCcCCCCCCCCCCCCC-------------------cee--
Q psy13144 881 ANCRVINHN-AVCNCKPGFTGEPRIRCSKIP-PPPPPQDVPEYVNPCIPSPCGPNS-------------------QCR-- 937 (1031)
Q Consensus 881 ~~C~~~~g~-~~C~C~~Gy~G~~~~~C~~~~-~~~~~~~~~~~~~~C~~~~C~~~~-------------------~C~-- 937 (1031)
..|+...++ ++|.|..+-.|..-.+|.+.. .-+|......++++|....|..++ +|+
T Consensus 285 s~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnC 364 (592)
T KOG3512|consen 285 SRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNC 364 (592)
T ss_pred ceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCccccccccchhhhhcccchhhhcccCccccceEeec
Confidence 357666555 999999999998444454321 112333334567777655554333 343
Q ss_pred --ecCCCcee-eCCCCCcCCCCCCCCCcccCCCCCCcchhccccccCCCCCCCCC----CCeeeecCCCceeeCCCCCcc
Q psy13144 938 --DINGSPSC-SCLPTFIGAPPNCRPECIQNSECPFDKACIREKCIDPCPGSCGY----NALCKVINHSPICTCPDGFVG 1010 (1031)
Q Consensus 938 --~~~g~~~C-~C~~Gy~g~~~~C~~~C~~~~eC~~~~~C~~~~c~~~c~~~C~~----~~~C~~~~g~~~C~C~~G~~G 1010 (1031)
|+.|. +| .|.+||.-++.. +. .+-..|.. -.|+. +-+|..+.| .|.|++|.+|
T Consensus 365 rHnTaGr-hChyCreGyyRd~s~--pl-~hrkaCk~--------------CdChpVGs~gktCNq~tG--qCpCkeGvtG 424 (592)
T KOG3512|consen 365 RHNTAGR-HCHYCREGYYRDGSK--PL-THRKACKA--------------CDCHPVGSAGKTCNQTTG--QCPCKEGVTG 424 (592)
T ss_pred ccCCCCc-ccccccCccccCCCC--CC-chhhhhhh--------------cCCcccccccccccccCC--cccCCCCCcc
Confidence 23333 44 689999877521 00 01111110 02333 446888888 8999999999
Q ss_pred CCCCCCcC
Q psy13144 1011 DAFSGCYP 1018 (1031)
Q Consensus 1011 ~~c~~c~~ 1018 (1031)
..|++|.+
T Consensus 425 ~tCnrCa~ 432 (592)
T KOG3512|consen 425 LTCNRCAP 432 (592)
T ss_pred cccccccc
Confidence 99999953
No 49
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=81.68 E-value=0.73 Score=33.46 Aligned_cols=31 Identities=26% Similarity=0.606 Sum_probs=21.5
Q ss_pred CCCCCCCCCCeeeecC-CceeeeCCCCcccCC
Q psy13144 597 CQPSPCGPNSQCREVN-HQAVCSCLPNYFGSP 627 (1031)
Q Consensus 597 C~~~~C~~~~~C~~~~-g~y~C~C~~Gy~g~~ 627 (1031)
|....|..++.|++.. |++.|.|..||..++
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 4456788899998865 999999999997664
No 50
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=79.97 E-value=1.8 Score=34.15 Aligned_cols=28 Identities=21% Similarity=0.419 Sum_probs=23.0
Q ss_pred eeeecCCCceeeCCCCCccCCCCCCcCCCC
Q psy13144 992 LCKVINHSPICTCPDGFVGDAFSGCYPKPP 1021 (1031)
Q Consensus 992 ~C~~~~g~~~C~C~~G~~G~~c~~c~~~~~ 1021 (1031)
.|....| +|.|+++|+|..|++|.+...
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~ 40 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDRCAPGYY 40 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCCCCCCCc
Confidence 4776677 899999999999999976443
No 51
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=78.60 E-value=1.1 Score=35.03 Aligned_cols=26 Identities=27% Similarity=0.657 Sum_probs=21.1
Q ss_pred CeeeecCCCceeeCCCCCccCCCCCCcC
Q psy13144 991 ALCKVINHSPICTCPDGFVGDAFSGCYP 1018 (1031)
Q Consensus 991 ~~C~~~~g~~~C~C~~G~~G~~c~~c~~ 1018 (1031)
+.|....| +|.|+++|+|..|++|.+
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE-T
T ss_pred CcccCCCC--EEeccccccCCcCcCCCC
Confidence 47888777 999999999999998875
No 52
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=77.31 E-value=3.4 Score=32.51 Aligned_cols=25 Identities=40% Similarity=0.995 Sum_probs=21.2
Q ss_pred ceeeCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCC
Q psy13144 171 PSCSCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGSP 223 (1031)
Q Consensus 171 ~~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~~ 223 (1031)
.+|.|+++|+|. .|+ +|.+||.+.+
T Consensus 19 G~C~C~~~~~G~--~C~--------------------------~C~~g~~~~~ 43 (50)
T cd00055 19 GQCECKPNTTGR--RCD--------------------------RCAPGYYGLP 43 (50)
T ss_pred CEEeCCCcCCCC--CCC--------------------------CCCCCCccCC
Confidence 379999999999 776 8889998874
No 53
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=72.74 E-value=3 Score=44.42 Aligned_cols=33 Identities=24% Similarity=0.464 Sum_probs=26.8
Q ss_pred CCCCCCCC--CCCCCCCceecCCCCceeeCCCCCcCC
Q psy13144 148 EPVNPCYP--SPCGPYSQCRDINGSPSCSCLPSYIGS 182 (1031)
Q Consensus 148 ~~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~ 182 (1031)
.++++|.. ++|. ..|.++.|+|.|.|++||++.
T Consensus 185 ~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 185 VVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 46788864 3465 589999999999999999875
No 54
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=72.32 E-value=2 Score=33.60 Aligned_cols=22 Identities=27% Similarity=0.801 Sum_probs=16.4
Q ss_pred CceecCCCCceeeCCCCCcCCCCCCc
Q psy13144 162 SQCRDINGSPSCSCLPSYIGSPPNCR 187 (1031)
Q Consensus 162 g~C~~~~g~~~C~C~~Gy~g~~~~C~ 187 (1031)
.+|... ..+|.|+++|+|. .|+
T Consensus 11 ~~C~~~--~G~C~C~~~~~G~--~C~ 32 (49)
T PF00053_consen 11 QTCDPS--TGQCVCKPGTTGP--RCD 32 (49)
T ss_dssp SSEEET--CEEESBSTTEEST--TS-
T ss_pred CcccCC--CCEEeccccccCC--cCc
Confidence 367653 3489999999999 776
No 55
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=71.50 E-value=3.4 Score=44.02 Aligned_cols=32 Identities=19% Similarity=0.480 Sum_probs=26.8
Q ss_pred CCCCCCC--CCCCCCCeecccCCceeeeCCCCCcCC
Q psy13144 38 YTNPCQP--SPCGPNSQCREVNHQAVCSCLPNYFGS 71 (1031)
Q Consensus 38 ~~d~C~~--~~C~~~g~C~~~~g~~~C~C~~Gy~g~ 71 (1031)
++++|.. ++|. ..|++..|+|.|+|++||++.
T Consensus 186 ~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 186 VPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred CchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 6788863 4565 589999999999999999986
No 56
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=69.65 E-value=4.4 Score=31.30 Aligned_cols=27 Identities=22% Similarity=0.513 Sum_probs=22.1
Q ss_pred eeeecCCCceeeCCCCCccCCCCCCcCCC
Q psy13144 992 LCKVINHSPICTCPDGFVGDAFSGCYPKP 1020 (1031)
Q Consensus 992 ~C~~~~g~~~C~C~~G~~G~~c~~c~~~~ 1020 (1031)
.|....| .|.|+++|+|..|++|.+..
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~ 38 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDRCAPGY 38 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCcCCCCc
Confidence 5666666 89999999999999997643
No 57
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=69.01 E-value=5.6 Score=30.70 Aligned_cols=14 Identities=36% Similarity=1.094 Sum_probs=12.6
Q ss_pred eeeCCCCCcCCCCCCc
Q psy13144 172 SCSCLPSYIGSPPNCR 187 (1031)
Q Consensus 172 ~C~C~~Gy~g~~~~C~ 187 (1031)
+|.|+++|+|. .|+
T Consensus 19 ~C~C~~~~~G~--~C~ 32 (46)
T smart00180 19 QCECKPNVTGR--RCD 32 (46)
T ss_pred EEECCCCCCCC--CCC
Confidence 89999999998 776
No 58
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=58.18 E-value=9.6 Score=35.80 Aligned_cols=30 Identities=20% Similarity=0.440 Sum_probs=23.6
Q ss_pred CCCCCCeeeecC--CCceeeCCCCCccCCCCCC
Q psy13144 986 SCGYNALCKVIN--HSPICTCPDGFVGDAFSGC 1016 (1031)
Q Consensus 986 ~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~~c 1016 (1031)
-|-++ .|.-.. ..+.|.|..||+|.+|++=
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCcccccccce
Confidence 46644 888554 6789999999999999843
No 59
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=57.77 E-value=15 Score=28.95 Aligned_cols=19 Identities=37% Similarity=1.095 Sum_probs=17.2
Q ss_pred CCCCCeeeCceeecCCCCc
Q psy13144 783 CVPNAECRDGVCVCLPDYY 801 (1031)
Q Consensus 783 C~~~~~C~~~~C~C~~G~~ 801 (1031)
|..++.|++++|+|++||.
T Consensus 28 C~~~s~C~~g~C~C~~g~~ 46 (52)
T PF01683_consen 28 CIGGSVCVNGRCQCPPGYV 46 (52)
T ss_pred CCCcCEEcCCEeECCCCCE
Confidence 6688999999999999985
No 60
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=57.10 E-value=16 Score=28.90 Aligned_cols=21 Identities=33% Similarity=0.865 Sum_probs=16.0
Q ss_pred CCCCCCeeeecCCCceeeCCCCCcc
Q psy13144 986 SCGYNALCKVINHSPICTCPDGFVG 1010 (1031)
Q Consensus 986 ~C~~~~~C~~~~g~~~C~C~~G~~G 1010 (1031)
.|..++.|++ | +|.|++||+-
T Consensus 27 qC~~~s~C~~--g--~C~C~~g~~~ 47 (52)
T PF01683_consen 27 QCIGGSVCVN--G--RCQCPPGYVE 47 (52)
T ss_pred CCCCcCEEcC--C--EeECCCCCEe
Confidence 4667888944 4 8999999863
No 61
>PHA02887 EGF-like protein; Provisional
Probab=49.51 E-value=16 Score=33.90 Aligned_cols=29 Identities=28% Similarity=0.582 Sum_probs=22.9
Q ss_pred CCCCCCeeeecC--CCceeeCCCCCccCCCCC
Q psy13144 986 SCGYNALCKVIN--HSPICTCPDGFVGDAFSG 1015 (1031)
Q Consensus 986 ~C~~~~~C~~~~--g~~~C~C~~G~~G~~c~~ 1015 (1031)
-|- +|.|.-.. ..+.|.|+.||+|.+|+.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 466 57888544 468999999999999983
No 62
>PHA02887 EGF-like protein; Provisional
Probab=48.60 E-value=13 Score=34.30 Aligned_cols=32 Identities=25% Similarity=0.537 Sum_probs=25.8
Q ss_pred CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCccc
Q psy13144 816 NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTV 854 (1031)
Q Consensus 816 ~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~ 854 (1031)
+-|. ||+|..........|.|+.||+|. +|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~------RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGI------RCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccC------CCCc
Confidence 4466 478988776788999999999999 7764
No 63
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=47.36 E-value=16 Score=34.09 Aligned_cols=32 Identities=34% Similarity=0.828 Sum_probs=25.2
Q ss_pred CCCCCC-CCCCCCCCeecccCCceeeeCCCCCcC
Q psy13144 38 YTNPCQ-PSPCGPNSQCREVNHQAVCSCLPNYFG 70 (1031)
Q Consensus 38 ~~d~C~-~~~C~~~g~C~~~~g~~~C~C~~Gy~g 70 (1031)
..|.|+ ...|+.+|.|... .+..|.|.+||.-
T Consensus 76 p~d~Cd~y~~CG~~g~C~~~-~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNSN-NSPKCSCLPGFEP 108 (110)
T ss_pred cccCCCCccccCCccEeCCC-CCCceECCCCcCC
Confidence 346787 4889999999654 4567999999974
No 64
>KOG3516|consensus
Probab=46.72 E-value=14 Score=47.16 Aligned_cols=42 Identities=26% Similarity=0.666 Sum_probs=36.4
Q ss_pred CCCCCCCCCCCCCCCCCCCceecCCCCceeeCC-CCCcCCCCCCc
Q psy13144 144 EDVPEPVNPCYPSPCGPYSQCRDINGSPSCSCL-PSYIGSPPNCR 187 (1031)
Q Consensus 144 ~~~~~~~d~C~~~~C~~~g~C~~~~g~~~C~C~-~Gy~g~~~~C~ 187 (1031)
.|.+..+|.|.+++|.++|.|......|.|.|. .||.|. +|.
T Consensus 539 id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 539 IDMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred ecccccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 344456788899999999999998889999999 999999 887
No 65
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=42.03 E-value=17 Score=34.31 Aligned_cols=32 Identities=25% Similarity=0.583 Sum_probs=25.6
Q ss_pred CCCCCCCccccCCCCCceeeecCCCCcCCCCCCCCCccc
Q psy13144 816 NDCPSNKACIRNKFNKQAVCSCLPNYFGSPPACRPECTV 854 (1031)
Q Consensus 816 ~~C~~~g~C~~~~~~~~~~C~C~~G~~g~g~~C~~~C~~ 854 (1031)
+-|.++ .|....+...+.|.|..||+|. +|+.
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGe------RCEh 82 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGI------RCQH 82 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCcccc------cccc
Confidence 346554 8887766788999999999999 8875
No 66
>KOG3516|consensus
Probab=40.47 E-value=23 Score=45.34 Aligned_cols=36 Identities=25% Similarity=0.675 Sum_probs=32.6
Q ss_pred CCCCCCCCCCCCCCCCcccccCCCeeeecC-CCCcCC
Q psy13144 234 PVYTNPCQPSPCGPNSQCREVNHQAVCSCL-PNYFGS 269 (1031)
Q Consensus 234 ~~~~d~C~~~~C~~~g~C~~~~g~y~C~C~-~Gy~g~ 269 (1031)
|.-+|.|.+++|..+|.|......|.|.|. .||+|.
T Consensus 542 C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 542 CGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred cccccccCCccccCCCcccccccceeEeccccccccc
Confidence 344688999999999999999999999999 899998
No 67
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=38.43 E-value=28 Score=32.41 Aligned_cols=33 Identities=42% Similarity=0.983 Sum_probs=25.6
Q ss_pred CCCCCC-CCCCCCCCceecCCCCceeeCCCCCcCC
Q psy13144 149 PVNPCY-PSPCGPYSQCRDINGSPSCSCLPSYIGS 182 (1031)
Q Consensus 149 ~~d~C~-~~~C~~~g~C~~~~g~~~C~C~~Gy~g~ 182 (1031)
..|.|. ...|+.++.|.. ..+..|.|.+||.-.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 467887 478999999954 456689999999753
No 68
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=37.08 E-value=25 Score=25.10 Aligned_cols=21 Identities=14% Similarity=-0.007 Sum_probs=15.6
Q ss_pred CccccCCCceeeCCCCCCCCC
Q psy13144 5 DTKINTYEVFYSCPPGTTGSP 25 (1031)
Q Consensus 5 ~~c~~~~~~~C~C~~G~~g~~ 25 (1031)
|.|.....+.|.|+.||..+.
T Consensus 10 A~CDpn~~~~C~CPeGyIlde 30 (34)
T PF09064_consen 10 ADCDPNSPGQCFCPEGYILDE 30 (34)
T ss_pred CccCCCCCCceeCCCceEecC
Confidence 445555566799999999774
No 69
>KOG3514|consensus
Probab=36.33 E-value=23 Score=44.60 Aligned_cols=36 Identities=28% Similarity=0.679 Sum_probs=32.1
Q ss_pred CCCCCCCCCCCceeecCCCceeeCC-CCCcCCCCCCCCC
Q psy13144 924 PCIPSPCGPNSQCRDINGSPSCSCL-PTFIGAPPNCRPE 961 (1031)
Q Consensus 924 ~C~~~~C~~~~~C~~~~g~~~C~C~-~Gy~g~~~~C~~~ 961 (1031)
.|.++||+|+++|....+.|.|.|. .||.|. .|+++
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~CerE 661 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCERE 661 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccce
Confidence 7899999999999999999999998 679888 78753
No 70
>KOG3514|consensus
Probab=31.37 E-value=32 Score=43.45 Aligned_cols=35 Identities=23% Similarity=0.622 Sum_probs=32.1
Q ss_pred CCCCCCCCCCCceecCCCCceeeCC-CCCcCCCCCCcc
Q psy13144 152 PCYPSPCGPYSQCRDINGSPSCSCL-PSYIGSPPNCRP 188 (1031)
Q Consensus 152 ~C~~~~C~~~g~C~~~~g~~~C~C~-~Gy~g~~~~C~~ 188 (1031)
.|.++||.|+|+|.....+|.|.|. .||.|. +|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccc
Confidence 6899999999999999999999996 699999 9984
No 71
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=26.47 E-value=20 Score=29.86 Aligned_cols=44 Identities=30% Similarity=0.577 Sum_probs=20.3
Q ss_pred eeecCCCCccCCcccCCCCCcCCCCCCCCCccccCCCCCceeeecCCCCcCC
Q psy13144 793 VCVCLPDYYGDGYVSCGPECILNNDCPSNKACIRNKFNKQAVCSCLPNYFGS 844 (1031)
Q Consensus 793 ~C~C~~G~~G~~c~~~~~~C~~~~~C~~~g~C~~~~~~~~~~C~C~~G~~g~ 844 (1031)
.-+|.+.|+|..|.. .|...+.=..+-+|... ++=+|.+||+|.
T Consensus 18 rv~C~~nyyG~~C~~---~C~~~~d~~ghy~Cd~~-----G~~~C~~Gw~G~ 61 (63)
T PF01414_consen 18 RVVCDENYYGPNCSK---FCKPRDDSFGHYTCDSN-----GNKVCLPGWTGP 61 (63)
T ss_dssp -----TTEETTTT-E---E---EEETTEEEEE-SS-------EEE-TTEEST
T ss_pred EEECCCCCCCccccC---CcCCCcCCcCCcccCCC-----CCCCCCCCCcCC
Confidence 356889999998852 33322222334456643 255789999998
No 72
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=20.46 E-value=3e+02 Score=32.04 Aligned_cols=41 Identities=24% Similarity=0.703 Sum_probs=22.8
Q ss_pred eCCCCCcCCCCCCcccCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCC
Q psy13144 174 SCLPSYIGSPPNCRPECIQNSECPYDKACINEKCADPCPGFCPPGTTGS 222 (1031)
Q Consensus 174 ~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~n~~~~~~C~~~C~~G~~g~ 222 (1031)
.|++||+..+..|. .|. ..| .+|.+. ....|. .|++||...
T Consensus 95 ~C~~G~y~~~~~C~-~C~--~~C---~~C~~~-~~~~Ct-~C~~g~~L~ 135 (397)
T PF03302_consen 95 ECPDGYYKNGNKCV-PCH--ESC---ATCSGG-APNQCT-SCKPGKVLK 135 (397)
T ss_pred CCCCCccccCCCCC-CCC--ccc---cccCCC-CCCCCc-ccCCCcccc
Confidence 79999997655665 111 111 133322 223343 889998766
Done!