Query psy13146
Match_columns 895
No_of_seqs 525 out of 3019
Neff 8.2
Searched_HMMs 46136
Date Fri Aug 16 20:44:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13146.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13146hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4289|consensus 99.7 2E-17 4.4E-22 193.3 17.2 143 9-219 1172-1316(2531)
2 KOG1217|consensus 99.7 6.4E-16 1.4E-20 180.9 23.1 318 17-476 90-422 (487)
3 KOG1217|consensus 99.7 1.5E-14 3.3E-19 169.3 29.6 312 128-568 90-421 (487)
4 KOG1214|consensus 99.7 1.8E-15 3.9E-20 170.4 16.6 278 33-368 635-947 (1289)
5 KOG4289|consensus 99.6 4.5E-15 9.7E-20 174.1 18.9 99 199-332 1216-1316(2531)
6 KOG1214|consensus 99.5 8.7E-14 1.9E-18 157.0 15.9 197 132-375 699-911 (1289)
7 KOG1219|consensus 99.4 5.3E-13 1.1E-17 162.2 7.6 112 16-164 3864-3977(4289)
8 KOG1219|consensus 99.3 2.4E-12 5.3E-17 156.6 8.1 108 128-272 3865-3974(4289)
9 KOG0994|consensus 99.3 4.8E-11 1E-15 139.0 15.3 205 518-791 878-1113(1758)
10 KOG0994|consensus 99.2 2.9E-10 6.3E-15 132.7 18.0 58 657-716 1078-1146(1758)
11 KOG1225|consensus 99.1 2.5E-10 5.5E-15 129.4 12.5 131 524-714 233-365 (525)
12 KOG1225|consensus 99.0 1.6E-09 3.5E-14 123.0 13.9 216 547-861 150-365 (525)
13 KOG4260|consensus 98.7 2.4E-08 5.3E-13 100.7 6.4 148 79-272 150-306 (350)
14 KOG4260|consensus 98.5 1.4E-07 3E-12 95.4 5.4 134 698-860 123-306 (350)
15 KOG1226|consensus 98.3 5.9E-06 1.3E-10 95.7 12.8 144 545-718 465-622 (783)
16 KOG1836|consensus 98.2 7E-05 1.5E-09 96.5 21.0 63 753-820 953-1019(1705)
17 KOG1226|consensus 98.2 8.7E-06 1.9E-10 94.3 11.7 146 596-777 468-622 (783)
18 KOG1836|consensus 98.0 0.00023 5.1E-09 91.8 19.5 108 206-333 696-813 (1705)
19 PF07645 EGF_CA: Calcium-bindi 97.9 3.2E-06 7E-11 63.2 1.2 34 126-159 1-36 (42)
20 PF07645 EGF_CA: Calcium-bindi 97.9 6.8E-06 1.5E-10 61.5 2.4 34 15-48 1-36 (42)
21 PF00008 EGF: EGF-like domain 97.9 7.8E-06 1.7E-10 57.1 2.0 30 19-48 1-31 (32)
22 smart00179 EGF_CA Calcium-bind 97.7 4.5E-05 9.8E-10 55.8 3.6 34 126-159 1-36 (39)
23 smart00179 EGF_CA Calcium-bind 97.6 5.3E-05 1.1E-09 55.5 3.7 34 15-48 1-36 (39)
24 PF00008 EGF: EGF-like domain 97.6 3.8E-05 8.2E-10 53.6 2.3 31 743-773 1-32 (32)
25 PF06247 Plasmod_Pvs28: Plasmo 97.5 1.2E-05 2.5E-10 78.1 -1.7 148 23-217 7-163 (197)
26 PF12947 EGF_3: EGF domain; I 97.5 4.7E-05 1E-09 54.5 1.3 31 831-864 6-36 (36)
27 PF12947 EGF_3: EGF domain; I 97.3 0.00012 2.5E-09 52.5 2.0 30 22-51 6-35 (36)
28 cd00054 EGF_CA Calcium-binding 97.2 0.00033 7.2E-09 50.6 3.6 33 127-159 2-35 (38)
29 cd00054 EGF_CA Calcium-binding 97.1 0.0005 1.1E-08 49.7 3.7 34 15-48 1-35 (38)
30 PF06247 Plasmod_Pvs28: Plasmo 96.9 0.00024 5.2E-09 69.1 0.4 140 407-575 7-164 (197)
31 PF12662 cEGF: Complement Clr- 96.9 0.00077 1.7E-08 43.3 2.4 24 147-172 1-24 (24)
32 cd00053 EGF Epidermal growth f 96.6 0.0024 5.2E-08 45.3 3.6 30 19-48 2-32 (36)
33 PF12662 cEGF: Complement Clr- 96.6 0.0018 3.9E-08 41.6 2.4 14 93-106 1-14 (24)
34 smart00181 EGF Epidermal growt 96.4 0.0042 9.1E-08 44.2 3.5 30 18-48 1-31 (35)
35 smart00181 EGF Epidermal growt 96.2 0.0046 1E-07 43.9 3.2 28 130-158 2-30 (35)
36 cd00053 EGF Epidermal growth f 96.1 0.0082 1.8E-07 42.5 3.9 30 649-678 5-35 (36)
37 PF07974 EGF_2: EGF-like domai 94.9 0.03 6.6E-07 38.9 3.0 27 650-678 6-32 (32)
38 KOG1218|consensus 94.7 1.7 3.7E-05 47.8 18.4 160 523-716 13-176 (316)
39 KOG1218|consensus 94.5 4.3 9.2E-05 44.7 20.8 160 420-618 14-174 (316)
40 PF14670 FXa_inhibition: Coagu 94.3 0.018 4E-07 41.2 0.9 21 139-159 10-30 (36)
41 PF07974 EGF_2: EGF-like domai 94.2 0.044 9.5E-07 38.2 2.6 24 692-715 7-32 (32)
42 PF14670 FXa_inhibition: Coagu 93.0 0.062 1.4E-06 38.5 1.8 23 24-48 8-30 (36)
43 PF12661 hEGF: Human growth fa 92.7 0.049 1.1E-06 29.7 0.7 13 206-218 1-13 (13)
44 PF12661 hEGF: Human growth fa 92.5 0.047 1E-06 29.8 0.5 11 667-677 2-12 (13)
45 PF12946 EGF_MSP1_1: MSP1 EGF 90.8 0.088 1.9E-06 37.5 0.5 31 19-49 2-33 (37)
46 cd01475 vWA_Matrilin VWA_Matri 88.2 0.38 8.2E-06 50.4 3.2 37 121-159 181-219 (224)
47 KOG3512|consensus 88.1 3.1 6.8E-05 46.6 10.0 158 195-374 284-476 (592)
48 smart00051 DSL delta serrate l 87.7 0.69 1.5E-05 37.8 3.6 47 664-715 16-63 (63)
49 PF12946 EGF_MSP1_1: MSP1 EGF 86.0 0.2 4.4E-06 35.8 -0.3 34 790-823 2-36 (37)
50 smart00051 DSL delta serrate l 85.2 1.1 2.4E-05 36.6 3.6 48 147-218 16-63 (63)
51 cd00055 EGF_Lam Laminin-type e 82.5 1.6 3.5E-05 33.8 3.4 21 657-679 13-33 (50)
52 cd01475 vWA_Matrilin VWA_Matri 81.9 1.2 2.7E-05 46.5 3.4 37 234-272 181-219 (224)
53 PF00053 Laminin_EGF: Laminin 81.1 0.98 2.1E-05 34.8 1.7 22 656-679 11-32 (49)
54 KOG3512|consensus 80.7 11 0.00023 42.6 9.9 93 524-618 370-476 (592)
55 smart00180 EGF_Lam Laminin-typ 77.1 2.7 5.8E-05 32.0 3.0 21 657-679 12-32 (46)
56 smart00180 EGF_Lam Laminin-typ 73.3 5.3 0.00012 30.3 3.8 28 753-782 12-39 (46)
57 cd00055 EGF_Lam Laminin-type e 72.6 4.1 9E-05 31.5 3.1 28 753-782 13-40 (50)
58 PF00053 Laminin_EGF: Laminin 68.9 3.3 7.2E-05 31.8 1.8 29 752-782 11-39 (49)
59 PF01683 EB: EB module; Inter 64.4 8.6 0.00019 29.9 3.4 22 596-617 27-48 (52)
60 PHA02887 EGF-like protein; Pro 62.0 7.2 0.00016 35.5 2.8 29 651-680 93-123 (126)
61 PF01683 EB: EB module; Inter 60.7 7.8 0.00017 30.1 2.5 31 826-863 21-51 (52)
62 PF01414 DSL: Delta serrate li 56.6 3.5 7.6E-05 33.7 -0.1 48 147-218 16-63 (63)
63 KOG3516|consensus 52.8 11 0.00023 47.4 3.0 42 121-164 539-581 (1306)
64 PHA03099 epidermal growth fact 50.7 12 0.00026 34.7 2.3 32 650-682 51-84 (139)
65 PF00954 S_locus_glycop: S-loc 49.9 14 0.00031 33.7 2.8 33 15-48 76-109 (110)
66 PHA02887 EGF-like protein; Pro 47.1 13 0.00029 33.8 2.0 30 303-333 92-123 (126)
67 PHA03099 epidermal growth fact 45.7 16 0.00034 34.0 2.2 30 303-333 51-82 (139)
68 KOG3516|consensus 39.5 21 0.00046 44.9 2.8 43 233-277 538-581 (1306)
69 PF00954 S_locus_glycop: S-loc 38.2 27 0.00059 31.9 2.7 26 79-105 84-109 (110)
70 KOG3514|consensus 32.1 30 0.00066 43.0 2.4 35 129-165 625-660 (1591)
71 KOG3514|consensus 28.9 36 0.00077 42.4 2.3 31 18-48 625-656 (1591)
72 PF12955 DUF3844: Domain of un 21.7 51 0.0011 29.8 1.4 25 79-103 13-42 (103)
73 PF09064 Tme5_EGF_like: Thromb 21.4 59 0.0013 23.0 1.3 13 761-773 18-30 (34)
No 1
>KOG4289|consensus
Probab=99.74 E-value=2e-17 Score=193.30 Aligned_cols=143 Identities=28% Similarity=0.685 Sum_probs=108.3
Q ss_pred cccCCCCCCCCCCCCCCCCCceeecCCceeeecCCCcccCCCCCcCCcccCCCCCCcccccCCcccCCCCCCCCCCCcee
Q psy13146 9 IQYEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPNYFGSPPACRPECTVNSDCPLNKACFNQKCVDPCPGTCGQNANCK 88 (895)
Q Consensus 9 ~~~~~~~~d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~ 88 (895)
+|..+.|.+.|+..||.|..+|+.+.. |.++.+.=. .... -.+=+
T Consensus 1172 l~VlpfdDniClrEPCenymkCvsvlr---------Fdssapf~~---s~s~-----------------------lfRpi 1216 (2531)
T KOG4289|consen 1172 LRVLPFDDNICLREPCENYMKCVSVLR---------FDSSAPFLA---SDSV-----------------------LFRPI 1216 (2531)
T ss_pred eeeeeccCchhhcchhHHHHhhhhhee---------ecccCcccc---ccce-----------------------eeeec
Confidence 567788999999999999999987532 433321000 0000 01113
Q ss_pred ccCCCCeeecCCCCcCCCCcccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccCCCCeEeCCCCCcCCCCCCCCCCC
Q psy13146 89 VQNHNPICNCKPGYTGDPRVYCNKIPPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECV 168 (895)
Q Consensus 89 ~~~g~y~C~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~C~ 168 (895)
+..++++|+|++||+|+. |+ +.||+|.+.||+++|+|....|+|+|.|.+||+|. +|+.
T Consensus 1217 ~pvnglrCrCPpGFTgd~---Ce-------------TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe--hCEv--- 1275 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDY---CE-------------TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE--HCEV--- 1275 (2531)
T ss_pred cccCceeEeCCCCCCccc---cc-------------chhHhhhcCCCCCCCceEEecCceeEEecCCcccc--ceee---
Confidence 456789999999999997 97 78999999999999999999999999999999999 8872
Q ss_pred CCCCCCCCCccccCCcCCCCCCCCCCCCeeeec-CCCceeeCCCC-CccCCCc
Q psy13146 169 QNNDCSNDKACINEKCQDPCPGSCGYNALCKVI-NHTPICTCPDG-YTGDAFS 219 (895)
Q Consensus 169 d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~-~g~~~C~C~~G-y~G~~c~ 219 (895)
+. ..+.|+ |+.|.++|+|++. .|+|.|+|+.| |++..|+
T Consensus 1276 s~----~agrCv--------pGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1276 SA----RAGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred ec----ccCccc--------cceecCCCEEeecCCCceeccCCCcccCCCceE
Confidence 21 234555 4578889999987 57899999998 5666665
No 2
>KOG1217|consensus
Probab=99.71 E-value=6.4e-16 Score=180.92 Aligned_cols=318 Identities=28% Similarity=0.664 Sum_probs=212.0
Q ss_pred CCCCCCCCCCCCceeecCCceeeecCCCcccCCCCCcCCcccCCCCCCcccccCCcccCCCCCCCCCCCceecc---CCC
Q psy13146 17 NPCQPSPCGPNSQCREVNKQAVCSCLPNYFGSPPACRPECTVNSDCPLNKACFNQKCVDPCPGTCGQNANCKVQ---NHN 93 (895)
Q Consensus 17 d~C~~~~C~~~~~C~~~~g~~~C~C~~Gf~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~~~---~g~ 93 (895)
+++...+....+.+.....+|.|.|++||.|.. ++... .|...+ ..+...+.|.+. ...
T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~~------~~~~~------~C~~~~------~~~~~~~~c~~~~~~~~~ 151 (487)
T KOG1217|consen 90 PPCRSPCLLLCGECVDCVGSYECTCPPGYQGTP------CEGEC------ECVTGP------GVCCIDGSCSNGPGSVGP 151 (487)
T ss_pred ccccCCcccCCccccCCCCCceeeCCCccccCc------CCcce------eecCCC------CCeeCchhhcCCCCCCCc
Confidence 444455555667777788899999999999983 22211 122111 112334566654 468
Q ss_pred CeeecCCCCcCCCCcccccCCCCCCCCCCCCCCCCCCC--CCCCCCCCcccccCCCCeEeCCCCCcCCCCCCCCC-----
Q psy13146 94 PICNCKPGYTGDPRVYCNKIPPRPPPQEDVPEPVNPCY--PSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPE----- 166 (895)
Q Consensus 94 y~C~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~dideC~--~~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~----- 166 (895)
|.|.|..||.+.. +. .+.++|. .++|.+++.|.+..++|.|.|++||.+. .++..
T Consensus 152 ~~c~C~~g~~~~~---~~-------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~~~~~~~~~ 213 (487)
T KOG1217|consen 152 FRCSCTEGYEGEP---CE-------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS--TCETTGNGGT 213 (487)
T ss_pred eeeeeCCCccccc---cc-------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC--cCcCCCCCce
Confidence 9999999999997 64 3447898 4569999999999999999999999998 55532
Q ss_pred CCCCCCCCCCCccccCCcCCCCCCCCCCC-CeeeecCCCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy13146 167 CVQNNDCSNDKACINEKCQDPCPGSCGYN-ALCKVINHTPICTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEPINPCYP 245 (895)
Q Consensus 167 C~d~~~C~~~~~C~~~~C~~~C~~~C~~~-g~C~n~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~dideC~~ 245 (895)
|.+...|.....-....|... ...|... ++|++..++|+|+|++||++..+. .++++++|+.
T Consensus 214 c~~~~~~~~~~g~~~~~c~~~-~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~----------------~~~~~~~C~~ 276 (487)
T KOG1217|consen 214 CVDSVACSCPPGARGPECEVS-IVECASGDGTCVNTVGSYTCRCPEGYTGDACV----------------TCVDVDSCAL 276 (487)
T ss_pred EecceeccCCCCCCCCCcccc-cccccCCCCcccccCCceeeeCCCCccccccc----------------eeeeccccCC
Confidence 221100000000000011100 1223333 899999999999999999998731 0478999997
Q ss_pred CC-CCCCCceecCCCCCeeeCCCCCcCCCCCCCccccCCCCCCCcccccccCcCCCCCCCCCCCCee--eccCCCCcccC
Q psy13146 246 SP-CGPYSQCRDINGSPSCSCLPSYIGAPPNCRPECIQNSECPYDKACINEKCADPCPGSCGYGAVC--TVINHSPICTC 322 (895)
Q Consensus 246 ~~-C~~~g~C~n~~gsy~C~C~~G~~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~g~C--~~~~g~~~C~C 322 (895)
.+ |.++++|++..++|.|.|++||.|. .+ ..+.++.+|..... ..+|.++++| ....+.+.|.|
T Consensus 277 ~~~c~~~~~C~~~~~~~~C~C~~g~~g~--~~-~~~~~~~~C~~~~~----------~~~c~~g~~C~~~~~~~~~~C~c 343 (487)
T KOG1217|consen 277 IASCPNGGTCVNVPGSYRCTCPPGFTGR--LC-TECVDVDECSPRNA----------GGPCANGGTCNTLGSFGGFRCAC 343 (487)
T ss_pred CCccCCCCeeecCCCcceeeCCCCCCCC--CC-cccccccccccccc----------CCcCCCCcccccCCCCCCCCcCC
Confidence 54 9999999999999999999999999 55 34555566642210 3457777788 33445678888
Q ss_pred CCCcccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCeeeCceeeeCCCcccCCcccCCCccccCCCCCCCcccccCCcCCC
Q psy13146 323 PEGYIGDAFSSCYPKPPEPVQPVIQEDTCNCAPNAECRDGVCLCLPDYYGDGYVSCRPECVQNSDCPRNKACIKLKCKNP 402 (895)
Q Consensus 323 ~~Gy~G~~c~~c~~~~~~~~~~c~~~~~C~C~~~g~C~~~~C~C~~Gy~G~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~ 402 (895)
..||.|..|+. . .++
T Consensus 344 ~~~~~g~~C~~--------------~---------------------------------------------------~~~ 358 (487)
T KOG1217|consen 344 GPGFTGRRCED--------------S---------------------------------------------------NDE 358 (487)
T ss_pred CCCCCCCcccc--------------C---------------------------------------------------Ccc
Confidence 88888877652 0 013
Q ss_pred CCCCCCCCCcEEee-cCCceeeeCCCCCcCCCCccccccCCCCCCCCCCCCCCCCCCCcccccCCceeeccCCCC
Q psy13146 403 CVPGTCGEGAICDV-VNHNVMCICPPGTTGSPFIQCKPILQEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPNY 476 (895)
Q Consensus 403 C~~~~C~~~~~C~~-~~g~y~C~C~~Gy~G~~~~~C~~~~~~~~~~~eC~~~~C~~~g~C~~~~g~y~C~C~~Gy 476 (895)
|...++..++.|++ ..++|.|.|+.+|.+....... ...++++|.. .+.|++..++|.|. ++ +
T Consensus 359 C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~----~~~~~~~c~~-----~~~c~~~~~~~~c~-~~-~ 422 (487)
T KOG1217|consen 359 CASSPCCPGGTCVNETPGSYRCACPAGFAGKANGDGV----GCEDIDECSG-----CGDCVNGPGGGACT-PP-G 422 (487)
T ss_pred ccCCccccCCEeccCCCCCeEecCCCccccCCccccc----cccccccccC-----CcceeccCCCCccc-cC-c
Confidence 44445667788988 7899999999999984100011 1134566654 56788889999999 87 5
No 3
>KOG1217|consensus
Probab=99.68 E-value=1.5e-14 Score=169.33 Aligned_cols=312 Identities=29% Similarity=0.693 Sum_probs=217.7
Q ss_pred CCCCCCCCCCCCcccccCCCCeEeCCCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeec---CCC
Q psy13146 128 NPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVI---NHT 204 (895)
Q Consensus 128 deC~~~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~---~g~ 204 (895)
+.+...+....+.+....++|.|.|++||.|. .++.. .+|.... ..+...+.|.+. ...
T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~c~c~~g~~~~--~~~~~----~~C~~~~------------~~~~~~~~c~~~~~~~~~ 151 (487)
T KOG1217|consen 90 PPCRSPCLLLCGECVDCVGSYECTCPPGYQGT--PCEGE----CECVTGP------------GVCCIDGSCSNGPGSVGP 151 (487)
T ss_pred ccccCCcccCCccccCCCCCceeeCCCccccC--cCCcc----eeecCCC------------CCeeCchhhcCCCCCCCc
Confidence 34444444455777888899999999999998 55531 0232221 112234555554 458
Q ss_pred ceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCceecCCCCCeeeCCCCCcCCCCCCCccccC
Q psy13146 205 PICTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEPINPCY--PSPCGPYSQCRDINGSPSCSCLPSYIGAPPNCRPECIQ 282 (895)
Q Consensus 205 ~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~dideC~--~~~C~~~g~C~n~~gsy~C~C~~G~~g~~~~C~~~C~~ 282 (895)
|.|+|..||.+..+. .+.++|. ..+|.+++.|.+..++|.|.|++||.+. .++..
T Consensus 152 ~~c~C~~g~~~~~~~------------------~~~~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~~~~--- 208 (487)
T KOG1217|consen 152 FRCSCTEGYEGEPCE------------------TDLDECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGS--TCETT--- 208 (487)
T ss_pred eeeeeCCCccccccc------------------ccccccccCCCCcCCCcccccCCCCeeEeCCCCccCC--cCcCC---
Confidence 999999999999884 3447897 4569999999999999999999999998 54411
Q ss_pred CCCCCCcccccccCcCCCCCCCCCCCCeeeccCCCCcccCCCCcccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCeeeC-
Q psy13146 283 NSECPYDKACINEKCADPCPGSCGYGAVCTVINHSPICTCPEGYIGDAFSSCYPKPPEPVQPVIQEDTCNCAPNAECRD- 361 (895)
Q Consensus 283 ~~eC~~~~~C~~~~C~~~C~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~c~~c~~~~~~~~~~c~~~~~C~C~~~g~C~~- 361 (895)
.+++.|++. +.|.+.+||.+..+.. .+.++.-.+ ++|++
T Consensus 209 -----------------------~~~~~c~~~---~~~~~~~g~~~~~c~~-------------~~~~~~~~~-~~c~~~ 248 (487)
T KOG1217|consen 209 -----------------------GNGGTCVDS---VACSCPPGARGPECEV-------------SIVECASGD-GTCVNT 248 (487)
T ss_pred -----------------------CCCceEecc---eeccCCCCCCCCCccc-------------ccccccCCC-Cccccc
Confidence 234567654 7899999999888762 233322223 67765
Q ss_pred ---ceeeeCCCcccCCcccCCCccccCCCCCCCcccccCCcCCCCCCCC-CCCCcEEeecCCceeeeCCCCCcCCCCccc
Q psy13146 362 ---GVCLCLPDYYGDGYVSCRPECVQNSDCPRNKACIKLKCKNPCVPGT-CGEGAICDVVNHNVMCICPPGTTGSPFIQC 437 (895)
Q Consensus 362 ---~~C~C~~Gy~G~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~~-C~~~~~C~~~~g~y~C~C~~Gy~G~~~~~C 437 (895)
++|.|++||.+... ..+++ +++|.... |.++++|++..++|.|.|++||+|.. |
T Consensus 249 ~~~~~C~~~~g~~~~~~----------------~~~~~---~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~---~ 306 (487)
T KOG1217|consen 249 VGSYTCRCPEGYTGDAC----------------VTCVD---VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRL---C 306 (487)
T ss_pred CCceeeeCCCCcccccc----------------ceeee---ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCC---C
Confidence 47999999999852 01111 56777553 88899999999999999999999998 5
Q ss_pred cccCCCCCCCCCCC----CCCCCCCCcc--cccCCceeeccCCCCcCCCCCCCCCCccCCCCCCCccccCCcccCCCCCC
Q psy13146 438 KPILQEPVYTNPCQ----PSPCGPNSQC--REVNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKACVNQKCVDPCPGS 511 (895)
Q Consensus 438 ~~~~~~~~~~~eC~----~~~C~~~g~C--~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~ 511 (895)
. ...+..+|. ..+|.++++| .+..+.|.|.|..||.|. .|+... ..|.. ..
T Consensus 307 ~----~~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~------~C~~~~-----~~C~~--------~~ 363 (487)
T KOG1217|consen 307 T----ECVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGR------RCEDSN-----DECAS--------SP 363 (487)
T ss_pred c----cccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCC------ccccCC-----ccccC--------Cc
Confidence 1 113456774 4578888899 445568899999999888 676543 12322 24
Q ss_pred CCCCCeeee-eCCCeeeeCCCCCcCC-C--CccccCCCCCCCCCCCeeeecCCcccccCCC
Q psy13146 512 CGQNANCRV-INHNAVCNCKPGFTGE-P--RIRCSKIPPRSCGYNAECKVINHTPICTCPQ 568 (895)
Q Consensus 512 C~~~g~C~~-~~g~~~C~C~~Gy~G~-~--~~~C~~~~~~~C~~~g~C~~~~gs~~C~C~~ 568 (895)
+..++.|++ ..++|.|.|+.+|.+. . ...+.+ ...|...+.|++..+++.|. +.
T Consensus 364 ~~~~~~c~~~~~~~~~c~~~~~~~~~~~~~~~~~~~--~~~c~~~~~c~~~~~~~~c~-~~ 421 (487)
T KOG1217|consen 364 CCPGGTCVNETPGSYRCACPAGFAGKANGDGVGCED--IDECSGCGDCVNGPGGGACT-PP 421 (487)
T ss_pred cccCCEeccCCCCCeEecCCCccccCCccccccccc--cccccCCcceeccCCCCccc-cC
Confidence 677899998 7899999999999984 2 122222 23343356788888888888 77
No 4
>KOG1214|consensus
Probab=99.65 E-value=1.8e-15 Score=170.35 Aligned_cols=278 Identities=25% Similarity=0.521 Sum_probs=181.5
Q ss_pred cCCceeeecCCCcc--cCCCCCc-C---------CcccCCCCCCcccccCC----cccCCC---CCCCCCCCceeccC-C
Q psy13146 33 VNKQAVCSCLPNYF--GSPPACR-P---------ECTVNSDCPLNKACFNQ----KCVDPC---PGTCGQNANCKVQN-H 92 (895)
Q Consensus 33 ~~g~~~C~C~~Gf~--g~~~~C~-~---------~C~~~~~C~~~~~C~~~----~C~~~C---~~~C~~~g~C~~~~-g 92 (895)
+.+-++|.+.+-|. +..+++. | |+....++..-..++.. .=++|| ++.|..++.|.... -
T Consensus 635 ~ityq~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cdt~a~C~pg~~~ 714 (1289)
T KOG1214|consen 635 NITYQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGV 714 (1289)
T ss_pred cceeEEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhcccceecCCCCcccccceecCcccCCCccccCCCCc
Confidence 55677899998885 4433332 1 33333223222222211 114555 67888889998754 5
Q ss_pred CCeeecCCCCcCCCCcccccCCCCCCCCCCCCCCCCCCCC--CCCCCCCcccccCCCCeEeCCCCCcCCCCCCCCCCCCC
Q psy13146 93 NPICNCKPGYTGDPRVYCNKIPPRPPPQEDVPEPVNPCYP--SPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCRPECVQN 170 (895)
Q Consensus 93 ~y~C~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~dideC~~--~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~~~C~d~ 170 (895)
.|+|.|..||.|+.+. | .|++||++ +.|+.+++|+|.+|+|+|.|..||... ....+|+.+
T Consensus 715 ~~tcecs~g~~gdgr~-c--------------~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~--dd~~tCV~i 777 (1289)
T KOG1214|consen 715 DYTCECSSGYQGDGRN-C--------------VDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFA--DDRHTCVLI 777 (1289)
T ss_pred ceEEEEeeccCCCCCC-C--------------CChhhhccCCCCCCCCceeecCCCceeEEEeecceec--cCCcceEEe
Confidence 6999999999998754 5 68899995 459999999999999999999998765 222234322
Q ss_pred CCCCCCCccccCCcCCCCCCCCCCCCe--eeecC-CCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy13146 171 NDCSNDKACINEKCQDPCPGSCGYNAL--CKVIN-HTPICTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEPINPCYPSP 247 (895)
Q Consensus 171 ~~C~~~~~C~~~~C~~~C~~~C~~~g~--C~n~~-g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~dideC~~~~ 247 (895)
-.=...+.|.++ ++.|..++. |+... ++|+|+|.+||.|+.-. +.|+|||.++.
T Consensus 778 ~~pap~n~Ce~g------~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~-----------------c~dvDeC~psr 834 (1289)
T KOG1214|consen 778 TPPAPANPCEDG------SHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQ-----------------CTDVDECSPSR 834 (1289)
T ss_pred cCCCCCCccccC------ccccCcCCceEEEecCCceEEEeecCCccCCccc-----------------cccccccCccc
Confidence 211222333333 577766554 44443 47999999999999863 68899999999
Q ss_pred CCCCCceecCCCCCeeeCCCCCcCCCCCCCccccCCCCCCCcccccccCcCCCCCCCCCCCCee---eccCCCCcccCCC
Q psy13146 248 CGPYSQCRDINGSPSCSCLPSYIGAPPNCRPECIQNSECPYDKACINEKCADPCPGSCGYGAVC---TVINHSPICTCPE 324 (895)
Q Consensus 248 C~~~g~C~n~~gsy~C~C~~G~~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~g~C---~~~~g~~~C~C~~ 324 (895)
|...++|.|++|+|.|+|.+||.|+++.|.+.=.....|..... . +-.|+.+..| ++. ..|.+.+.+
T Consensus 835 Chp~A~CyntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er~---h------pl~chg~t~~~~~~Dp-~~~e~p~~~ 904 (1289)
T KOG1214|consen 835 CHPAATCYNTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQERF---H------PLQCHGSTGFCWCVDP-DGHEVPGTQ 904 (1289)
T ss_pred cCCCceEecCCCcceeecccCccCCCceecCCCccCCccccccc---c------ceeeccccceeEeeCC-CcccCCCCC
Confidence 99999999999999999999999999877642111222221100 0 1235544433 343 468999988
Q ss_pred CcccCCCCcCCCCCCCCCCCCCCCCCCCCCCCCeeeC-------ceeeeCC
Q psy13146 325 GYIGDAFSSCYPKPPEPVQPVIQEDTCNCAPNAECRD-------GVCLCLP 368 (895)
Q Consensus 325 Gy~G~~c~~c~~~~~~~~~~c~~~~~C~C~~~g~C~~-------~~C~C~~ 368 (895)
+-.|+.-..|.+.+..-.. .|..+|.+.. +.|.|..
T Consensus 905 ~ppG~~~~~c~~~~~~~vp--------~Cd~hgh~ap~qchG~~~~CwCvd 947 (1289)
T KOG1214|consen 905 TPPGSTPPHCGPSPEQYVP--------QCDDHGHFAPLQCHGKSDFCWCVD 947 (1289)
T ss_pred CCCCCCCCCCCCcccccCC--------CccccccccccccCCCcceeEEec
Confidence 8888776666554422111 2555666653 3577765
No 5
>KOG4289|consensus
Probab=99.64 E-value=4.5e-15 Score=174.07 Aligned_cols=99 Identities=32% Similarity=0.825 Sum_probs=82.4
Q ss_pred eecCCCceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCceecCCCCCeeeCCCCCcCCCCCCCc
Q psy13146 199 KVINHTPICTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEPINPCYPSPCGPYSQCRDINGSPSCSCLPSYIGAPPNCRP 278 (895)
Q Consensus 199 ~n~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~~G~~g~~~~C~~ 278 (895)
++..++++|.|++||+|+.|+ ..||+|.+.||.++|+|+...|+|+|.|.+||+|. .|+.
T Consensus 1216 i~pvnglrCrCPpGFTgd~Ce------------------TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe--hCEv 1275 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGDYCE------------------TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE--HCEV 1275 (2531)
T ss_pred ccccCceeEeCCCCCCccccc------------------chhHhhhcCCCCCCCceEEecCceeEEecCCcccc--ceee
Confidence 455678999999999999994 88999999999999999999999999999999999 7872
Q ss_pred cccCCCCCCCcccccccCcCCCCCCCCCCCCeeecc-CCCCcccCCCC-cccCCCC
Q psy13146 279 ECIQNSECPYDKACINEKCADPCPGSCGYGAVCTVI-NHSPICTCPEG-YIGDAFS 332 (895)
Q Consensus 279 ~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~g~C~~~-~g~~~C~C~~G-y~G~~c~ 332 (895)
+ .....|+ ++.|.++|+|++. +|+|.|.|+.| |++..|+
T Consensus 1276 ---s----~~agrCv--------pGvC~nggtC~~~~nggf~c~Cp~ge~e~prC~ 1316 (2531)
T KOG4289|consen 1276 ---S----ARAGRCV--------PGVCKNGGTCVNLLNGGFCCHCPYGEFEDPRCE 1316 (2531)
T ss_pred ---e----cccCccc--------cceecCCCEEeecCCCceeccCCCcccCCCceE
Confidence 1 1112333 5689999999986 78999999987 6666665
No 6
>KOG1214|consensus
Probab=99.54 E-value=8.7e-14 Score=157.03 Aligned_cols=197 Identities=30% Similarity=0.666 Sum_probs=138.5
Q ss_pred CCCCCCCCcccccCC-CCeEeCCCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeecCCCceeeCC
Q psy13146 132 PSPCGPYSQCRDIGG-SPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCP 210 (895)
Q Consensus 132 ~~~C~~~g~C~n~~g-sy~C~C~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~C~C~ 210 (895)
++-|..++.|....+ .|+|.|..||.|++.+ |.|+++|+.. .+.|..++.|+|.+|+|+|.|.
T Consensus 699 sh~cdt~a~C~pg~~~~~tcecs~g~~gdgr~----c~d~~eca~~------------~~~CGp~s~Cin~pg~~rceC~ 762 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTCECSSGYQGDGRN----CVDENECATG------------FHRCGPNSVCINLPGSYRCECR 762 (1289)
T ss_pred CcccCCCccccCCCCcceEEEEeeccCCCCCC----CCChhhhccC------------CCCCCCCceeecCCCceeEEEe
Confidence 344666677887654 6999999999999755 4688888876 6899999999999999999999
Q ss_pred CCCc--cCCCccCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCc--eecC-CCCCeeeCCCCCcCCCCCCCccccCC
Q psy13146 211 DGYT--GDAFSGCYPKPPEPPPPPQEDIPEPINPCY--PSPCGPYSQ--CRDI-NGSPSCSCLPSYIGAPPNCRPECIQN 283 (895)
Q Consensus 211 ~Gy~--G~~c~~C~~~~~~~~~~~~~~~~~dideC~--~~~C~~~g~--C~n~-~gsy~C~C~~G~~g~~~~C~~~C~~~ 283 (895)
.||. ++.- .|++.... ..++.|. .+.|...+. |+.. .++|.|.|.|||.|+++.|. ++
T Consensus 763 ~gy~F~dd~~-tCV~i~~p----------ap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c~----dv 827 (1289)
T KOG1214|consen 763 SGYEFADDRH-TCVLITPP----------APANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGHQCT----DV 827 (1289)
T ss_pred ecceeccCCc-ceEEecCC----------CCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCccccc----cc
Confidence 9985 4422 23332211 2356676 356776554 4543 46799999999999987654 88
Q ss_pred CCCCCcccccccCcCCCCCCCCCCCCeeeccCCCCcccCCCCcccCCCCcCCCCCCCCCCCCCCCCCC--CCCCCC---e
Q psy13146 284 SECPYDKACINEKCADPCPGSCGYGAVCTVINHSPICTCPEGYIGDAFSSCYPKPPEPVQPVIQEDTC--NCAPNA---E 358 (895)
Q Consensus 284 ~eC~~~~~C~~~~C~~~C~~~C~~~g~C~~~~g~~~C~C~~Gy~G~~c~~c~~~~~~~~~~c~~~~~C--~C~~~g---~ 358 (895)
|||. ++.|+.+++|.+++|+|.|.|.+||.|++.. |.|... ...+|....+- .|+.+. .
T Consensus 828 DeC~--------------psrChp~A~CyntpgsfsC~C~pGy~GDGf~-CVP~~~-~~T~C~~er~hpl~chg~t~~~~ 891 (1289)
T KOG1214|consen 828 DECS--------------PSRCHPAATCYNTPGSFSCRCQPGYYGDGFQ-CVPDTS-SLTPCEQERFHPLQCHGSTGFCW 891 (1289)
T ss_pred cccC--------------ccccCCCceEecCCCcceeecccCccCCCce-ecCCCc-cCCccccccccceeeccccceeE
Confidence 9887 5689999999999999999999999999965 665421 12222222111 244444 2
Q ss_pred eeC---ceeeeCCCcccCCc
Q psy13146 359 CRD---GVCLCLPDYYGDGY 375 (895)
Q Consensus 359 C~~---~~C~C~~Gy~G~~~ 375 (895)
|++ +.+.+.++-.|++.
T Consensus 892 ~~Dp~~~e~p~~~~ppG~~~ 911 (1289)
T KOG1214|consen 892 CVDPDGHEVPGTQTPPGSTP 911 (1289)
T ss_pred eeCCCcccCCCCCCCCCCCC
Confidence 333 36777777666653
No 7
>KOG1219|consensus
Probab=99.37 E-value=5.3e-13 Score=162.17 Aligned_cols=112 Identities=31% Similarity=0.816 Sum_probs=99.9
Q ss_pred CCCCCCCCCCCCCceeecC-CceeeecCCCcccCCCCCcCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeccCCCC
Q psy13146 16 TNPCQPSPCGPNSQCREVN-KQAVCSCLPNYFGSPPACRPECTVNSDCPLNKACFNQKCVDPCPGTCGQNANCKVQNHNP 94 (895)
Q Consensus 16 ~d~C~~~~C~~~~~C~~~~-g~~~C~C~~Gf~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~~~~g~y 94 (895)
.|.|..+||+++|+|+..+ |+|+|.|++-|.|. .|+++.+ +|. +++|..+|+|+...++|
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~------~CEi~~e-----pC~--------snPC~~GgtCip~~n~f 3924 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN------HCEIDLE-----PCA--------SNPCLTGGTCIPFYNGF 3924 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccCc------ccccccc-----ccc--------CCCCCCCCEEEecCCCe
Confidence 3899999999999999876 58999999999999 6887642 333 47899999999999999
Q ss_pred eeecCCCCcCCCCcccccCCCCCCCCCCCCCC-CCCCCCCCCCCCCcccccCCCCeEeCCCCCcCCCCCCC
Q psy13146 95 ICNCKPGYTGDPRVYCNKIPPRPPPQEDVPEP-VNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGAPPNCR 164 (895)
Q Consensus 95 ~C~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~d-ideC~~~~C~~~g~C~n~~gsy~C~C~~Gy~g~~~~C~ 164 (895)
.|.|+.||+|.. || ++ |+||+.++|.++|.|+|..|+|.|.|.+||.|. .|.
T Consensus 3925 ~CnC~~gyTG~~---Ce-------------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr--~c~ 3977 (4289)
T KOG1219|consen 3925 LCNCPNGYTGKR---CE-------------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR--TCC 3977 (4289)
T ss_pred eEeCCCCccCce---ee-------------cccccccccccccCCceeeccCCceEeccChhHhcc--cCc
Confidence 999999999998 97 45 999999999999999999999999999999998 553
No 8
>KOG1219|consensus
Probab=99.31 E-value=2.4e-12 Score=156.59 Aligned_cols=108 Identities=31% Similarity=0.803 Sum_probs=99.0
Q ss_pred CCCCCCCCCCCCcccccC-CCCeEeCCCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeecCCCce
Q psy13146 128 NPCYPSPCGPYSQCRDIG-GSPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPI 206 (895)
Q Consensus 128 deC~~~~C~~~g~C~n~~-gsy~C~C~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~ 206 (895)
+.|..+||+++|+|+.++ |+|.|.|++-|+|. +|+ +++.+|.. ++|..+|+|+...++|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~--~CE---i~~epC~s--------------nPC~~GgtCip~~n~f~ 3925 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN--HCE---IDLEPCAS--------------NPCLTGGTCIPFYNGFL 3925 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc--ccc---cccccccC--------------CCCCCCCEEEecCCCee
Confidence 899999999999999876 67999999999999 888 36666663 67889999999999999
Q ss_pred eeCCCCCccCCCccCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCceecCCCCCeeeCCCCCcCC
Q psy13146 207 CTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEP-INPCYPSPCGPYSQCRDINGSPSCSCLPSYIGA 272 (895)
Q Consensus 207 C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~d-ideC~~~~C~~~g~C~n~~gsy~C~C~~G~~g~ 272 (895)
|.|+.||+|+.|+ .+ |+||..++|.++|.|+|..|+|.|.|.+||.|.
T Consensus 3926 CnC~~gyTG~~Ce------------------~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~gr 3974 (4289)
T KOG1219|consen 3926 CNCPNGYTGKRCE------------------ARGISECSKNVCGTGGQCINIPGSFHCNCTPGILGR 3974 (4289)
T ss_pred EeCCCCccCceee------------------cccccccccccccCCceeeccCCceEeccChhHhcc
Confidence 9999999999995 44 999999999999999999999999999999998
No 9
>KOG0994|consensus
Probab=99.27 E-value=4.8e-11 Score=139.03 Aligned_cols=205 Identities=28% Similarity=0.692 Sum_probs=109.1
Q ss_pred eeeeCCCeeee-CCCCCcCCCC----ccccCCCCCCCCCC--------CeeeecC--CcccccCCCCCccCCCCCCCCCC
Q psy13146 518 CRVINHNAVCN-CKPGFTGEPR----IRCSKIPPRSCGYN--------AECKVIN--HTPICTCPQGYVGDAFSGCYPKP 582 (895)
Q Consensus 518 C~~~~g~~~C~-C~~Gy~G~~~----~~C~~~~~~~C~~~--------g~C~~~~--gs~~C~C~~Gy~G~~c~~C~~~~ 582 (895)
|.+...++.|. |..||.|++. +.|. |-||..+ -.|.... ..-.|.|.+||+|.+|+.|.+
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg~g~~Cr---PCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~RCe~CA~-- 952 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLGSGIGCR---PCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGSRCEICAD-- 952 (1758)
T ss_pred ccccccccchhhhhccccCCcccCCCCCCC---CCCCCCCCccchhccccccccccccceeeecccCccccchhhhcc--
Confidence 44555677785 9999999873 2333 2334321 1343222 235788999999998885443
Q ss_pred CCCCCCcccCCCCCCCCCCcccCceeeeCCCcccCCCc--ccCC-CcccCCCCCCCCccccCCccCCCCCCCCCC-CCee
Q psy13146 583 PEPEQPVVQEDTCNCVPNAECRDGVCVCLPEFYGDGYV--SCRP-ECVLNNDCPSNKACIRNKCKNPCVPGTCGE-GAIC 658 (895)
Q Consensus 583 ~~~~~~~~~~~~C~C~~~g~C~~~~C~C~~Gy~G~~~~--~C~~-~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~-~g~C 658 (895)
+|.|+... +|++ +|..+ ||.-.++.|.. .|.|
T Consensus 953 ------------------------------~~fGnP~~GGtCq~CeC~~N--------------iD~~d~~aCD~~TG~C 988 (1758)
T KOG0994|consen 953 ------------------------------NHFGNPSEGGTCQKCECSNN--------------IDLYDPGACDVATGAC 988 (1758)
T ss_pred ------------------------------cccCCcccCCccccccccCC--------------cCccCCCccchhhchh
Confidence 33333210 1111 12211 23333444432 2223
Q ss_pred ---eccCCceee-eCCCCCccCCccccCCCCccCCCCCCC-----CCCeec--CceeeCCCCccCCCCccCCCCcccC-C
Q psy13146 659 ---DVINHAVSC-NCPPGTTGSPFVQSEQPVVQEDTCNCV-----PNAECR--DGVCVCLPEFYGDGYVSCRPECVLN-N 726 (895)
Q Consensus 659 ---~~~~gs~~C-~C~~Gy~G~~c~~~~~~~~~~~~c~C~-----~~g~C~--~~~C~C~~G~~G~~c~~~~~~C~~~-~ 726 (895)
....-+-+| .|.+||.|+.-.+. ...|.|. +.+.|. .++|.|.|...|..|+.+ ..+ -
T Consensus 989 LkCL~hTeG~hCe~Ck~Gf~GdA~~q~------CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDqC----A~N~w 1058 (1758)
T KOG0994|consen 989 LKCLYHTEGDHCEHCKDGFYGDALRQN------CQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQC----AENHW 1058 (1758)
T ss_pred hhhhhcccccchhhccccchhHHHHhh------hhhheccccccCCccccccccCcCCCCccccccccccc----ccchh
Confidence 222223456 49999999853221 1222222 123343 368999999999988643 221 0
Q ss_pred CCCCCcccccCCcCCCCCCCCCCCCCeEeecCCceeeeCCCCCcCCCCccccccccCCCCCCCCC
Q psy13146 727 DCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFVQCKPIQYEPVYTNPCQ 791 (895)
Q Consensus 727 ~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~~~~~~~~C~ 791 (895)
.=.++..| ++|.-+| ..+-+|....| +|.|+|||-|..+.+|++..|+.-+. .|.
T Consensus 1059 ~laSG~GC------e~C~Cd~-~~~pqCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~-~C~ 1113 (1758)
T KOG0994|consen 1059 NLASGEGC------EPCNCDP-IGGPQCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNE-KCR 1113 (1758)
T ss_pred ccccCCCC------CccCCCc-cCCcccccccc--ceeccCCCCCcchhHHHHhhcCCCCC-Cce
Confidence 00112222 1222122 22346777776 99999999999988898887765443 453
No 10
>KOG0994|consensus
Probab=99.22 E-value=2.9e-10 Score=132.66 Aligned_cols=58 Identities=31% Similarity=0.746 Sum_probs=38.3
Q ss_pred eeeccCCceeeeCCCCCccCCccccCC-----CCccCCCCCCCCCCe----ec--CceeeCCCCccCCCCc
Q psy13146 657 ICDVINHAVSCNCPPGTTGSPFVQSEQ-----PVVQEDTCNCVPNAE----CR--DGVCVCLPEFYGDGYV 716 (895)
Q Consensus 657 ~C~~~~gs~~C~C~~Gy~G~~c~~~~~-----~~~~~~~c~C~~~g~----C~--~~~C~C~~G~~G~~c~ 716 (895)
+|...+| +|.|.|||-|..|+++.. +-.....|.|...|+ |. ++.|+|.+|..|..|+
T Consensus 1078 qCN~ftG--QCqCkpGfGGR~C~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G~rCd 1146 (1758)
T KOG0994|consen 1078 QCNEFTG--QCQCKPGFGGRTCSQCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGGPRCD 1146 (1758)
T ss_pred ccccccc--ceeccCCCCCcchhHHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCCcchh
Confidence 5666666 788888888887765321 222344455665553 53 4689999999999875
No 11
>KOG1225|consensus
Probab=99.14 E-value=2.5e-10 Score=129.40 Aligned_cols=131 Identities=30% Similarity=0.824 Sum_probs=104.3
Q ss_pred CeeeeCCCCCcCCCCccccCCCCCCCCCCCeeeecCCcccccCCCCCccCCCCCCCCCCCCCCCCcccCCCC--CCCCCC
Q psy13146 524 NAVCNCKPGFTGEPRIRCSKIPPRSCGYNAECKVINHTPICTCPQGYVGDAFSGCYPKPPEPEQPVVQEDTC--NCVPNA 601 (895)
Q Consensus 524 ~~~C~C~~Gy~G~~~~~C~~~~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C--~C~~~g 601 (895)
.+.|.|+.+|+|..+. ....+..|...+.|+ ..+|+|++||+|+.|.. ..| .|+.++
T Consensus 233 ~~ic~c~~~~~g~~c~--~~~C~~~c~~~g~c~----~G~CIC~~Gf~G~dC~e---------------~~Cp~~cs~~g 291 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCS--TIYCPGGCTGRGQCV----EGRCICPPGFTGDDCDE---------------LVCPVDCSGGG 291 (525)
T ss_pred CceeecCCceeCCccc--cccCCCCCcccceEe----CCeEeCCCCCcCCCCCc---------------ccCCcccCCCc
Confidence 4589999999998532 122356677778887 56899999999999862 223 377889
Q ss_pred cccCceeeeCCCcccCCCcccCCCcccCCCCCCCCccccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCCccCCcccc
Q psy13146 602 ECRDGVCVCLPEFYGDGYVSCRPECVLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFVQS 681 (895)
Q Consensus 602 ~C~~~~C~C~~Gy~G~~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~~ 681 (895)
.++++.|+|++||+|.. |++.. | +..|+++|.|+ .| +|.|.+||+|..|...
T Consensus 292 ~~~~g~CiC~~g~~G~d-------Cs~~~----------------c-padC~g~G~Ci--~G--~C~C~~Gy~G~~C~~~ 343 (525)
T KOG1225|consen 292 VCVDGECICNPGYSGKD-------CSIRR----------------C-PADCSGHGKCI--DG--ECLCDEGYTGELCIQR 343 (525)
T ss_pred eecCCEeecCCCccccc-------ccccc----------------C-CccCCCCCccc--CC--ceEeCCCCcCCccccc
Confidence 99999999999999994 55432 2 45788999998 44 8999999999998531
Q ss_pred CCCCccCCCCCCCCCCeecCceeeCCCCccCCC
Q psy13146 682 EQPVVQEDTCNCVPNAECRDGVCVCLPEFYGDG 714 (895)
Q Consensus 682 ~~~~~~~~~c~C~~~g~C~~~~C~C~~G~~G~~ 714 (895)
.|.+++.|+++ |+|..||.|..
T Consensus 344 ----------~C~~~g~cv~g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 344 ----------ACSGGGQCVNG-CKCKKGWRGPD 365 (525)
T ss_pred ----------ccCCCceeccC-ceeccCccCCC
Confidence 38999999999 99999999987
No 12
>KOG1225|consensus
Probab=99.05 E-value=1.6e-09 Score=122.97 Aligned_cols=216 Identities=24% Similarity=0.551 Sum_probs=131.7
Q ss_pred CCCCCCCeeeecCCcccccCCCCCccCCCCCCCCCCCCCCCCcccCCCCCCCCCCcccCceeeeCCCcccCCCcccCCCc
Q psy13146 547 RSCGYNAECKVINHTPICTCPQGYVGDAFSGCYPKPPEPEQPVVQEDTCNCVPNAECRDGVCVCLPEFYGDGYVSCRPEC 626 (895)
Q Consensus 547 ~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C~C~~~g~C~~~~C~C~~Gy~G~~~~~C~~~C 626 (895)
..+...+.+. .++|.+.+++.+..+. ...-...+..++.+..+.+.+..+|+|.... +..+
T Consensus 150 ~~~~~~~~~~----~~~c~~~~~~~~~~~g-------------~~~~~~~~~~hg~~~~~~~l~~~~~s~~~~~--~~~~ 210 (525)
T KOG1225|consen 150 EDCLVRILCK----NGVCSLKPNPFGAECG-------------QYKCPNDGSGHGRYYFGNCLSGISASGETCN--QLGC 210 (525)
T ss_pred hhhcchhhhh----cccccccCCccccccc-------------eecCCcCCCCCccceecccccccCcchhhhh--cccC
Confidence 3344444444 5778888888877654 1111224667788888889999999887411 0001
Q ss_pred ccCCCCCCCCccccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCCccCCccccCCCCccCCCCCCCCCCeecCceeeC
Q psy13146 627 VLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFVQSEQPVVQEDTCNCVPNAECRDGVCVC 706 (895)
Q Consensus 627 ~~~~~C~~~~~C~~~~C~~~C~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~~~~~~~~~~~c~C~~~g~C~~~~C~C 706 (895)
..++.....+ +.+++ .|....-.+.|.|+.+|+|..|+. . .=+..|.+++.|+++.|+|
T Consensus 211 --~~~~~~~~r~---------~~~~~----~~~~~~~~~ic~c~~~~~g~~c~~--~----~C~~~c~~~g~c~~G~CIC 269 (525)
T KOG1225|consen 211 --NDDCFRTGRC---------REGRC----FCTAGFFDGICECPEGYFGPLCST--I----YCPGGCTGRGQCVEGRCIC 269 (525)
T ss_pred --Cccceecccc---------ccCcc----cccccccCceeecCCceeCCcccc--c----cCCCCCcccceEeCCeEeC
Confidence 0111111111 11111 122222233899999999988741 1 0111466678899999999
Q ss_pred CCCccCCCCccCCCCcccCCCCCCCcccccCCcCCCCCCCCCCCCCeEeecCCceeeeCCCCCcCCCCccccccccCCCC
Q psy13146 707 LPEFYGDGYVSCRPECVLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFVQCKPIQYEPVY 786 (895)
Q Consensus 707 ~~G~~G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~~~~~ 786 (895)
++||+|+.|+. ..|.. .|+.++.+++. +|.|++||+|.. |+
T Consensus 270 ~~Gf~G~dC~e-----------------------~~Cp~-~cs~~g~~~~g----~CiC~~g~~G~d---Cs-------- 310 (525)
T KOG1225|consen 270 PPGFTGDDCDE-----------------------LVCPV-DCSGGGVCVDG----ECICNPGYSGKD---CS-------- 310 (525)
T ss_pred CCCCcCCCCCc-----------------------ccCCc-ccCCCceecCC----EeecCCCccccc---cc--------
Confidence 99999987652 11212 26666665543 789999998887 64
Q ss_pred CCCCCCCCCCCCCceecCCCceeccCCCCCcCCCCCCcccCcCCCCCCCCCccccCCCcccceecCCCcceeecC
Q psy13146 787 TNPCQPSPCGPNSQCREVNKQAVCSCLPNYFGSPPACRPECTVNSDCPLNKACFNQKCVYTYSISTFCIWYTVAG 861 (895)
Q Consensus 787 ~~~C~~~~C~~~~~C~~~~g~y~C~C~~Gy~G~~~~C~~eC~~~~~C~~~~~C~n~~g~~~C~C~~~~~g~~~~g 861 (895)
+..| +.+|.++|.|+ .| +|.|.+||+|. .|+... |...+.|+|. |.|.. ||.|.-
T Consensus 311 ~~~c-padC~g~G~Ci--~G--~C~C~~Gy~G~------~C~~~~-C~~~g~cv~g-----C~C~~---Gw~G~d 365 (525)
T KOG1225|consen 311 IRRC-PADCSGHGKCI--DG--ECLCDEGYTGE------LCIQRA-CSGGGQCVNG-----CKCKK---GWRGPD 365 (525)
T ss_pred cccC-CccCCCCCccc--CC--ceEeCCCCcCC------cccccc-cCCCceeccC-----ceecc---CccCCC
Confidence 2334 35688888888 33 88999999988 555443 7777777665 66766 666553
No 13
>KOG4260|consensus
Probab=98.69 E-value=2.4e-08 Score=100.74 Aligned_cols=148 Identities=25% Similarity=0.590 Sum_probs=98.2
Q ss_pred CCCCCCCceec---cCCCCeeecCCCCcCCCCcccccCCCCCCCCCCCCCCCCCCCC--CCCCCCCcccccCCCCeE-eC
Q psy13146 79 GTCGQNANCKV---QNHNPICNCKPGYTGDPRVYCNKIPPRPPPQEDVPEPVNPCYP--SPCGPYSQCRDIGGSPSC-SC 152 (895)
Q Consensus 79 ~~C~~~g~C~~---~~g~y~C~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~dideC~~--~~C~~~g~C~n~~gsy~C-~C 152 (895)
.+|..+|.|.- ..|+-.|.|.+||+|....+|.+ +.....++ +..--|.. .+|. +.|.- .++-.| .|
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~--eyfes~Rn--e~~lvCt~Ch~~C~--~~Csg-~~~k~C~kC 222 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGI--EYFESSRN--EQHLVCTACHEGCL--GVCSG-ESSKGCSKC 222 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccch--HHHHhhcc--cccchhhhhhhhhh--cccCC-CCCCChhhh
Confidence 34666777753 45788999999999998333420 00000000 00001111 1232 24432 334456 59
Q ss_pred CCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeecCCCceeeCCCCCccCCCccCCCCCCCCCCCC
Q psy13146 153 LPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAFSGCYPKPPEPPPPP 232 (895)
Q Consensus 153 ~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~C~C~~Gy~G~~c~~C~~~~~~~~~~~ 232 (895)
..||.++ +..|+|||||.+. +.+|..+..|+|+.|||+|..++||.+.
T Consensus 223 kkGW~ld----e~gCvDvnEC~~e------------p~~c~~~qfCvNteGSf~C~dk~Gy~~g---------------- 270 (350)
T KOG4260|consen 223 KKGWKLD----EEGCVDVNECQNE------------PAPCKAHQFCVNTEGSFKCEDKEGYKKG---------------- 270 (350)
T ss_pred cccceec----ccccccHHHHhcC------------CCCCChhheeecCCCceEecccccccCC----------------
Confidence 9999998 3467899999987 6789889999999999999999999862
Q ss_pred CCCCCCCCCCCCC--CCC-CCCCceecCCCCCeeeCCCCCcCC
Q psy13146 233 QEDIPEPINPCYP--SPC-GPYSQCRDINGSPSCSCLPSYIGA 272 (895)
Q Consensus 233 ~~~~~~dideC~~--~~C-~~~g~C~n~~gsy~C~C~~G~~g~ 272 (895)
+|+|.. ..| ..+..|.|+.++|+|+|..|+.-.
T Consensus 271 -------~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 271 -------VDECQFCADVCASKNRPCMNIDGQYRCVCFSGLIII 306 (350)
T ss_pred -------hHHhhhhhhhcccCCCCcccCCccEEEEecccceee
Confidence 345542 333 245789999999999999998754
No 14
>KOG4260|consensus
Probab=98.49 E-value=1.4e-07 Score=95.35 Aligned_cols=134 Identities=24% Similarity=0.472 Sum_probs=94.2
Q ss_pred eecCc-eeeCCCCccCCCCccCCCCcccCCCCCCCcccccCCcCCCCCCCCCCCCCeEee---cCCceeeeCCCCCcCCC
Q psy13146 698 ECRDG-VCVCLPEFYGDGYVSCRPECVLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDV---INHAVSCNCPPGTTGSP 773 (895)
Q Consensus 698 ~C~~~-~C~C~~G~~G~~c~~~~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~~~C~~---~~g~y~C~C~~Gy~g~~ 773 (895)
.|++. .=-|++|.+|..|..+ +-+ ...+|..+|.|.- ..|+.+|.|.+||+|..
T Consensus 123 lCvdqLkvCCp~gtyGpdCl~C----------pgg------------ser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~ 180 (350)
T KOG4260|consen 123 LCVDQLKVCCPDGTYGPDCLQC----------PGG------------SERPCFGNGSCHGDGSREGSGKCKCETGYTGPL 180 (350)
T ss_pred hhhhhheeccCCCCcCCccccC----------CCC------------CcCCcCCCCcccCCCCCCCCCcccccCCCCCcc
Confidence 34443 2347889899876532 111 0235666677742 45778999999999998
Q ss_pred Ccccccccc------------------------------------------CCCCCCCCC--CCCCCCCCceecCCCcee
Q psy13146 774 FVQCKPIQY------------------------------------------EPVYTNPCQ--PSPCGPNSQCREVNKQAV 809 (895)
Q Consensus 774 ~~~C~~~~~------------------------------------------~~~~~~~C~--~~~C~~~~~C~~~~g~y~ 809 (895)
+..|.+..| ..+|||||. ++||..+..|+|+.|||+
T Consensus 181 C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~ 260 (350)
T KOG4260|consen 181 CRYCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFK 260 (350)
T ss_pred ccccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceE
Confidence 666654321 236899995 578999999999999999
Q ss_pred ccCCCCCcCCCCCCcccCcCC-CCCC-CCCccccCCCcccceecCCCcceeec
Q psy13146 810 CSCLPNYFGSPPACRPECTVN-SDCP-LNKACFNQKCVYTYSISTFCIWYTVA 860 (895)
Q Consensus 810 C~C~~Gy~G~~~~C~~eC~~~-~~C~-~~~~C~n~~g~~~C~C~~~~~g~~~~ 860 (895)
|..++||.+. +|+|+.- ..|. .+..|.|+.++|+|.|.+ |+...
T Consensus 261 C~dk~Gy~~g----~d~C~~~~d~~~~kn~~c~ni~~~~r~v~f~---~~~~~ 306 (350)
T KOG4260|consen 261 CEDKEGYKKG----VDECQFCADVCASKNRPCMNIDGQYRCVCFS---GLIII 306 (350)
T ss_pred ecccccccCC----hHHhhhhhhhcccCCCCcccCCccEEEEecc---cceee
Confidence 9999999984 3344321 1232 488999999999999999 55544
No 15
>KOG1226|consensus
Probab=98.28 E-value=5.9e-06 Score=95.69 Aligned_cols=144 Identities=26% Similarity=0.546 Sum_probs=95.8
Q ss_pred CCCCCCCCCeeeecCCcccccCCCCCccCCCCCCCCCCCCCCCCcccCCCC-------CCCCCCcccCceeeeCCCcc--
Q psy13146 545 PPRSCGYNAECKVINHTPICTCPQGYVGDAFSGCYPKPPEPEQPVVQEDTC-------NCVPNAECRDGVCVCLPEFY-- 615 (895)
Q Consensus 545 ~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~C-------~C~~~g~C~~~~C~C~~Gy~-- 615 (895)
.+..|+.+|+.+ =.+|.|.+||.|..|+ |....-.+. ..++.| .|++.|.|.=|+|+|.+...
T Consensus 465 ~s~~C~g~G~~~----CG~C~C~~G~~G~~CE-C~~~~~ss~---~~~~~Cr~~~~~~vCSgrG~C~CGqC~C~~~~~~~ 536 (783)
T KOG1226|consen 465 NSALCHGNGTFV----CGQCRCDEGWLGKKCE-CSTDELSSS---EEEDKCRENSDSPVCSGRGDCVCGQCVCHKPDNGK 536 (783)
T ss_pred CccccCCCCcEE----ecceecCCCCCCCccc-CCccccCcH---hHHhhccCCCCCCCcCCCCcEeCCceEecCCCCCc
Confidence 455677666665 4679999999999998 432211110 012223 69999999999999998877
Q ss_pred --cCCCcccCCCcccCCCCCCCCccccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCCccCCccccCCC--CccCCCC
Q psy13146 616 --GDGYVSCRPECVLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFVQSEQP--VVQEDTC 691 (895)
Q Consensus 616 --G~~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~~~~~--~~~~~~c 691 (895)
|..+ ||.+ ..|+.. ....|+.+|.|.=. +|+|.+||+|..|++.... +...+--
T Consensus 537 i~G~fC-----ECDn-fsC~r~------------~g~lC~g~G~C~CG----~CvC~~GwtG~~C~C~~std~C~~~~G~ 594 (783)
T KOG1226|consen 537 IYGKFC-----ECDN-FSCERH------------KGVLCGGHGRCECG----RCVCNPGWTGSACNCPLSTDTCESSDGQ 594 (783)
T ss_pred eeeeee-----eccC-cccccc------------cCcccCCCCeEeCC----cEEcCCCCccCCCCCCCCCccccCCCCc
Confidence 5542 1221 123221 12368888888643 8999999999999864321 1111222
Q ss_pred CCCCCCeecCceeeCCCC-ccCCCCccC
Q psy13146 692 NCVPNAECRDGVCVCLPE-FYGDGYVSC 718 (895)
Q Consensus 692 ~C~~~g~C~~~~C~C~~G-~~G~~c~~~ 718 (895)
.|...|+|.=++|+|... |.|..|+.+
T Consensus 595 iCSGrG~C~Cg~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 595 ICSGRGTCECGRCKCTDPPYSGEFCEKC 622 (783)
T ss_pred eeCCCceeeCCceEcCCCCcCcchhhcC
Confidence 688899999999999887 999988743
No 16
>KOG1836|consensus
Probab=98.19 E-value=7e-05 Score=96.46 Aligned_cols=63 Identities=32% Similarity=0.709 Sum_probs=46.5
Q ss_pred eEeecCCceeeeCCCCCcCCCCccccccccCCCCCCCCCCCCCCCCC----ceecCCCceeccCCCCCcCCC
Q psy13146 753 ICDVINHAVSCNCPPGTTGSPFVQCKPIQYEPVYTNPCQPSPCGPNS----QCREVNKQAVCSCLPNYFGSP 820 (895)
Q Consensus 753 ~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~~~~~~~~C~~~~C~~~~----~C~~~~g~y~C~C~~Gy~G~~ 820 (895)
.|+...| +|.|.+|.+|..+.+|+..++...- ..|..-.|...| .|....| +|.|.+||.|..
T Consensus 953 ~c~~~tG--qc~c~~gVtgqrc~qc~~~~~~~~~-~gc~~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~ 1019 (1705)
T KOG1836|consen 953 DCDVGTG--QCYCRPGVTGQRCDQCETYHFGFQT-EGCGLCECDPLGSRGFQCDPEDG--QCPCRPGFEGRR 1019 (1705)
T ss_pred cccccCC--ceeeecCccccccCccccCcccccc-cCCcceecccCCcccceecccCC--eeeecCCCCCcc
Confidence 5655555 8999999999998778877665432 556544566555 5888777 999999999973
No 17
>KOG1226|consensus
Probab=98.19 E-value=8.7e-06 Score=94.33 Aligned_cols=146 Identities=20% Similarity=0.429 Sum_probs=95.7
Q ss_pred CCCCCCcccCceeeeCCCcccCCCcccCCCcccCCCCC--CCCccccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCC
Q psy13146 596 NCVPNAECRDGVCVCLPEFYGDGYVSCRPECVLNNDCP--SNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGT 673 (895)
Q Consensus 596 ~C~~~g~C~~~~C~C~~Gy~G~~~~~C~~~C~~~~~C~--~~~~C~~~~C~~~C~~~~C~~~g~C~~~~gs~~C~C~~Gy 673 (895)
.|+.+|+..-++|.|.+||.|..++ |.....=. ....|... =...+|+..|.|.=. +|+|.+..
T Consensus 468 ~C~g~G~~~CG~C~C~~G~~G~~CE-----C~~~~~ss~~~~~~Cr~~-----~~~~vCSgrG~C~CG----qC~C~~~~ 533 (783)
T KOG1226|consen 468 LCHGNGTFVCGQCRCDEGWLGKKCE-----CSTDELSSSEEEDKCREN-----SDSPVCSGRGDCVCG----QCVCHKPD 533 (783)
T ss_pred ccCCCCcEEecceecCCCCCCCccc-----CCccccCcHhHHhhccCC-----CCCCCcCCCCcEeCC----ceEecCCC
Confidence 4778888888999999999999743 32210000 00111111 012378888888643 79998877
Q ss_pred c----cCCccccCCCCccCCCCCCCCCCeecCceeeCCCCccCCCCccC--CCCcccCCCCCCCcccccCCcCCCCCCCC
Q psy13146 674 T----GSPFVQSEQPVVQEDTCNCVPNAECRDGVCVCLPEFYGDGYVSC--RPECVLNNDCPSNKACIRNKCKNPCVPGT 747 (895)
Q Consensus 674 ~----G~~c~~~~~~~~~~~~c~C~~~g~C~~~~C~C~~G~~G~~c~~~--~~~C~~~~~C~~~~~C~~~~C~~~C~~~~ 747 (895)
. |..|++.+..+...+--.|..+|+|.=++|+|.+||+|..|.-. .+.|+.. + ...
T Consensus 534 ~~~i~G~fCECDnfsC~r~~g~lC~g~G~C~CG~CvC~~GwtG~~C~C~~std~C~~~----~--------------G~i 595 (783)
T KOG1226|consen 534 NGKIYGKFCECDNFSCERHKGVLCGGHGRCECGRCVCNPGWTGSACNCPLSTDTCESS----D--------------GQI 595 (783)
T ss_pred CCceeeeeeeccCcccccccCcccCCCCeEeCCcEEcCCCCccCCCCCCCCCccccCC----C--------------Cce
Confidence 6 89998766555444445799999999999999999999988621 2222221 0 124
Q ss_pred CCCCCeEeecCCceeeeCCCC-CcCCCCccc
Q psy13146 748 CGEGAICDVINHAVSCNCPPG-TTGSPFVQC 777 (895)
Q Consensus 748 C~~~~~C~~~~g~y~C~C~~G-y~g~~~~~C 777 (895)
|+..|+|.=. +|.|... |.|..++.|
T Consensus 596 CSGrG~C~Cg----~C~C~~~~~sG~~CE~c 622 (783)
T KOG1226|consen 596 CSGRGTCECG----RCKCTDPPYSGEFCEKC 622 (783)
T ss_pred eCCCceeeCC----ceEcCCCCcCcchhhcC
Confidence 6666777543 6888766 999984433
No 18
>KOG1836|consensus
Probab=97.97 E-value=0.00023 Score=91.80 Aligned_cols=108 Identities=29% Similarity=0.666 Sum_probs=63.4
Q ss_pred eeeCCCCCccCCCccCCCCCCCCCCCCCC-CCCCCCCCCC--CCCCCC-CC--ceecCCCCCee-eCCCCCcCCCCCCCc
Q psy13146 206 ICTCPDGYTGDAFSGCYPKPPEPPPPPQE-DIPEPINPCY--PSPCGP-YS--QCRDINGSPSC-SCLPSYIGAPPNCRP 278 (895)
Q Consensus 206 ~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~-~~~~dideC~--~~~C~~-~g--~C~n~~gsy~C-~C~~G~~g~~~~C~~ 278 (895)
.|.|+.||+|..|+.|.+.+++..+.... ..+.+.+ |. ++.|.. .| .|.....+-+| +|..||.|....-.
T Consensus 696 ~c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~-cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~- 773 (1705)
T KOG1836|consen 696 QCTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCD-CNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGT- 773 (1705)
T ss_pred hccCCCCcccchhhhcchhhhcccccCCCCCcccccc-cCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCC-
Confidence 39999999999999999988775332111 1111111 00 122221 12 24444444566 79999998843111
Q ss_pred cccCCCCCCCcccccccCcCCCCCCCCCCCCeeecc--CCCCccc-CCCCcccCCCCc
Q psy13146 279 ECIQNSECPYDKACINEKCADPCPGSCGYGAVCTVI--NHSPICT-CPEGYIGDAFSS 333 (895)
Q Consensus 279 ~C~~~~eC~~~~~C~~~~C~~~C~~~C~~~g~C~~~--~g~~~C~-C~~Gy~G~~c~~ 333 (895)
..| |. +=+|.+++.|..+ ..+..|. |++||+|..|+.
T Consensus 774 ---~~d-C~--------------~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~ 813 (1705)
T KOG1836|consen 774 ---SGD-CQ--------------PCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE 813 (1705)
T ss_pred ---CCC-Cc--------------cCCCCCChhhcCcCcccceecCCCCCCCccccccc
Confidence 000 21 1245666666654 3567898 999999999984
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.94 E-value=3.2e-06 Score=63.22 Aligned_cols=34 Identities=32% Similarity=0.683 Sum_probs=30.2
Q ss_pred CCCCCCC--CCCCCCCcccccCCCCeEeCCCCCcCC
Q psy13146 126 PVNPCYP--SPCGPYSQCRDIGGSPSCSCLPNYIGA 159 (895)
Q Consensus 126 dideC~~--~~C~~~g~C~n~~gsy~C~C~~Gy~g~ 159 (895)
|||||+. ++|..+++|+|+.|||+|.|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 5899985 569888999999999999999999954
No 20
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.90 E-value=6.8e-06 Score=61.46 Aligned_cols=34 Identities=26% Similarity=0.590 Sum_probs=30.3
Q ss_pred CCCCCCC--CCCCCCCceeecCCceeeecCCCcccC
Q psy13146 15 YTNPCQP--SPCGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 15 ~~d~C~~--~~C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
|||||+. ++|..+++|+|+.|+|+|.|++||+..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 7999975 579989999999999999999999843
No 21
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.86 E-value=7.8e-06 Score=57.08 Aligned_cols=30 Identities=30% Similarity=0.895 Sum_probs=28.1
Q ss_pred CCCCCCCCCCceeecC-CceeeecCCCcccC
Q psy13146 19 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGS 48 (895)
Q Consensus 19 C~~~~C~~~~~C~~~~-g~~~C~C~~Gf~g~ 48 (895)
|.++||+++|+|+++. ++|+|+|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 6788999999999998 99999999999996
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.66 E-value=4.5e-05 Score=55.83 Aligned_cols=34 Identities=32% Similarity=0.820 Sum_probs=30.8
Q ss_pred CCCCCCC-CCCCCCCcccccCCCCeEeCCCCCc-CC
Q psy13146 126 PVNPCYP-SPCGPYSQCRDIGGSPSCSCLPNYI-GA 159 (895)
Q Consensus 126 dideC~~-~~C~~~g~C~n~~gsy~C~C~~Gy~-g~ 159 (895)
|+|+|.. ++|.++++|+++.|+|+|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 4788987 8999989999999999999999999 65
No 23
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.65 E-value=5.3e-05 Score=55.48 Aligned_cols=34 Identities=26% Similarity=0.715 Sum_probs=31.2
Q ss_pred CCCCCCC-CCCCCCCceeecCCceeeecCCCcc-cC
Q psy13146 15 YTNPCQP-SPCGPNSQCREVNKQAVCSCLPNYF-GS 48 (895)
Q Consensus 15 ~~d~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~-g~ 48 (895)
|+|+|.. +||.++++|+++.++|+|.|++||+ |.
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 5789988 8999999999999999999999999 65
No 24
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.61 E-value=3.8e-05 Score=53.62 Aligned_cols=31 Identities=35% Similarity=0.941 Sum_probs=27.4
Q ss_pred CCCCCCCCCCeEeecC-CceeeeCCCCCcCCC
Q psy13146 743 CVPGTCGEGAICDVIN-HAVSCNCPPGTTGSP 773 (895)
Q Consensus 743 C~~~~C~~~~~C~~~~-g~y~C~C~~Gy~g~~ 773 (895)
|.+++|+++|+|++.. ++|+|.|++||+|..
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEESTT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccCC
Confidence 4467999999999988 999999999999963
No 25
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.52 E-value=1.2e-05 Score=78.07 Aligned_cols=148 Identities=26% Similarity=0.623 Sum_probs=86.6
Q ss_pred CCCCCCceeecCCceeeecCCCcccCC-CCCcCCcccCCCCCCcccccCCcccCCCCCCCCCCCceeccC-----CCCee
Q psy13146 23 PCGPNSQCREVNKQAVCSCLPNYFGSP-PACRPECTVNSDCPLNKACFNQKCVDPCPGTCGQNANCKVQN-----HNPIC 96 (895)
Q Consensus 23 ~C~~~~~C~~~~g~~~C~C~~Gf~g~~-~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~~~~-----g~y~C 96 (895)
.|. ||.-+.+..-|.|.|.+||.... .+| +...+|... .+ ++ -+|...|+|++.. ..|+|
T Consensus 7 ~CK-NG~LiQMSNHfEC~Cnegfvl~~EntC----E~kv~C~~~----e~--~~---K~Cgdya~C~~~~~~~~~~~~~C 72 (197)
T PF06247_consen 7 ICK-NGYLIQMSNHFECKCNEGFVLKNENTC----EEKVECDKL----EN--VN---KPCGDYAKCINQANKGEERAYKC 72 (197)
T ss_dssp --B-TEEEEEESSEEEEEESTTEEEEETTEE----EE----SG-----GG--TT---SEEETTEEEEE-SSTTSSTSEEE
T ss_pred ccc-CCEEEEccCceEEEcCCCcEEcccccc----ccceecCcc----cc--cC---ccccchhhhhcCCCcccceeEEE
Confidence 454 47788888899999999998654 234 333222110 00 11 4577788998765 57999
Q ss_pred ecCCCCcCCCCcccccCCCCCCCCCCCCCCCCCCCCCCCCCCCccccc---CCCCeEeCCCCCcCCCCCCCCCCCCCCCC
Q psy13146 97 NCKPGYTGDPRVYCNKIPPRPPPQEDVPEPVNPCYPSPCGPYSQCRDI---GGSPSCSCLPNYIGAPPNCRPECVQNNDC 173 (895)
Q Consensus 97 ~C~~Gy~g~~~~~C~~i~~~~~~~~~~~~dideC~~~~C~~~g~C~n~---~gsy~C~C~~Gy~g~~~~C~~~C~d~~~C 173 (895)
.|.+||+..... |. .++|..-.|+ .|.|+-. +....|+|.-|+..+ |.+.|
T Consensus 73 ~C~~gY~~~~~v-Cv---------------p~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~---------dn~kC 126 (197)
T PF06247_consen 73 DCINGYILKQGV-CV---------------PNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPD---------DNKKC 126 (197)
T ss_dssp EE-TTEEESSSS-EE---------------EGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETT---------TTTES
T ss_pred ecccCceeeCCe-Ec---------------hhhcCceecC-CCeEEecCCCCCCceeEeeeceEec---------cCCcc
Confidence 999999976533 53 2566666787 4899843 334599999999933 22222
Q ss_pred CCCCccccCCcCCCCCCCCCCCCeeeecCCCceeeCCCCCccCC
Q psy13146 174 SNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDA 217 (895)
Q Consensus 174 ~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~C~C~~Gy~G~~ 217 (895)
...+ +.+|+-.|..+..|....+-|+|.+..||.++.
T Consensus 127 tk~G-------~T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 127 TKTG-------ETKCSLKCKENEECKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp EEEE---------------TTTEEEEEETTEEEEEE-TT-EEET
T ss_pred cCCC-------ccceeeecCCCcceeeeCcEEEeecCCCCCCCC
Confidence 2211 112234567788999999999999999998654
No 26
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.47 E-value=4.7e-05 Score=54.47 Aligned_cols=31 Identities=29% Similarity=0.477 Sum_probs=26.3
Q ss_pred CCCCCCCccccCCCcccceecCCCcceeecCccc
Q psy13146 831 SDCPLNKACFNQKCVYTYSISTFCIWYTVAGVFL 864 (895)
Q Consensus 831 ~~C~~~~~C~n~~g~~~C~C~~~~~g~~~~g~~c 864 (895)
..|+.+|+|+|+.++|+|+|++ ||+|+|.+|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~---Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKP---GYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-C---EEECCSTCE
T ss_pred CCCCCCcEeecCCCCEEeECCC---CCccCCcCC
Confidence 4799999999999999999999 999999876
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.33 E-value=0.00012 Score=52.46 Aligned_cols=30 Identities=30% Similarity=0.718 Sum_probs=24.3
Q ss_pred CCCCCCCceeecCCceeeecCCCcccCCCC
Q psy13146 22 SPCGPNSQCREVNKQAVCSCLPNYFGSPPA 51 (895)
Q Consensus 22 ~~C~~~~~C~~~~g~~~C~C~~Gf~g~~~~ 51 (895)
..|+.+|+|+++.++|+|+|++||+|++..
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 469999999999999999999999999744
No 28
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.23 E-value=0.00033 Score=50.64 Aligned_cols=33 Identities=36% Similarity=0.872 Sum_probs=30.2
Q ss_pred CCCCCC-CCCCCCCcccccCCCCeEeCCCCCcCC
Q psy13146 127 VNPCYP-SPCGPYSQCRDIGGSPSCSCLPNYIGA 159 (895)
Q Consensus 127 ideC~~-~~C~~~g~C~n~~gsy~C~C~~Gy~g~ 159 (895)
+|+|.. .+|.++++|+++.++|+|.|++||.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 678887 799988999999999999999999996
No 29
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.14 E-value=0.0005 Score=49.70 Aligned_cols=34 Identities=29% Similarity=0.734 Sum_probs=31.1
Q ss_pred CCCCCCC-CCCCCCCceeecCCceeeecCCCcccC
Q psy13146 15 YTNPCQP-SPCGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 15 ~~d~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
++|+|.. .+|.++++|++..++|+|.|++||.|.
T Consensus 1 ~~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CcccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 4788987 899999999999999999999999986
No 30
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.93 E-value=0.00024 Score=69.15 Aligned_cols=140 Identities=25% Similarity=0.633 Sum_probs=87.7
Q ss_pred CCCCCcEEeecCCceeeeCCCCCcCCCCccccccCCCCCCCCCCCC-----CCCCCCCcccccC-----CceeeccCCCC
Q psy13146 407 TCGEGAICDVVNHNVMCICPPGTTGSPFIQCKPILQEPVYTNPCQP-----SPCGPNSQCREVN-----KQAVCSCLPNY 476 (895)
Q Consensus 407 ~C~~~~~C~~~~g~y~C~C~~Gy~G~~~~~C~~~~~~~~~~~eC~~-----~~C~~~g~C~~~~-----g~y~C~C~~Gy 476 (895)
.|.+ |.-+...+-|.|.|.+||......+|++. .+|.. .+|...+.|++.. ..|+|.|.+||
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cnegfvl~~EntCE~k-------v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY 78 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEGFVLKNENTCEEK-------VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGY 78 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTTEEEEETTEEEE-----------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTE
T ss_pred cccC-CEEEEccCceEEEcCCCcEEccccccccc-------eecCcccccCccccchhhhhcCCCcccceeEEEecccCc
Confidence 3443 57778888999999999987765668764 45542 4798899998765 57999999999
Q ss_pred cCCCCCCC-CCCccCCCCCCCccccCCcccCCCCCCCCCCCeeeee---CCCeeeeCCCCCcCCCCccccCCC----CCC
Q psy13146 477 FGSPPACR-PECTVNTDCPLDKACVNQKCVDPCPGSCGQNANCRVI---NHNAVCNCKPGFTGEPRIRCSKIP----PRS 548 (895)
Q Consensus 477 ~g~~~~C~-~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~~~---~g~~~C~C~~Gy~G~~~~~C~~~~----~~~ 548 (895)
+.....|+ ..|. + -.|. .|.|+-. +....|+|.-|+.-+....|..-. .-.
T Consensus 79 ~~~~~vCvp~~C~------------~--------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LK 137 (197)
T PF06247_consen 79 ILKQGVCVPNKCN------------N--------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLK 137 (197)
T ss_dssp EESSSSEEEGGGS------------S-----------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE-------
T ss_pred eeeCCeEchhhcC------------c--------eecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeee
Confidence 98754443 2222 1 1344 6788743 234699999999833222333111 234
Q ss_pred CCCCCeeeecCCcccccCCCCCccCCC
Q psy13146 549 CGYNAECKVINHTPICTCPQGYVGDAF 575 (895)
Q Consensus 549 C~~~g~C~~~~gs~~C~C~~Gy~G~~c 575 (895)
|..+-.|....+-|+|.+.+||.++.-
T Consensus 138 Ck~nE~CK~~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 138 CKENEECKLVDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp -TTTEEEEEETTEEEEEE-TT-EEETT
T ss_pred cCCCcceeeeCcEEEeecCCCCCCCCC
Confidence 667789999999999999999987653
No 31
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.91 E-value=0.00077 Score=43.26 Aligned_cols=24 Identities=29% Similarity=0.569 Sum_probs=19.8
Q ss_pred CCeEeCCCCCcCCCCCCCCCCCCCCC
Q psy13146 147 SPSCSCLPNYIGAPPNCRPECVQNND 172 (895)
Q Consensus 147 sy~C~C~~Gy~g~~~~C~~~C~d~~~ 172 (895)
||+|+|++||++. .-.+.|+||||
T Consensus 1 sy~C~C~~Gy~l~--~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS--PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC--CCCCccccCCC
Confidence 7999999999987 44567888876
No 32
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.62 E-value=0.0024 Score=45.30 Aligned_cols=30 Identities=30% Similarity=0.853 Sum_probs=27.2
Q ss_pred CC-CCCCCCCCceeecCCceeeecCCCcccC
Q psy13146 19 CQ-PSPCGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 19 C~-~~~C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
|. ..+|.++++|+++.++|+|.|++||.|.
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 44 6789999999999999999999999987
No 33
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.59 E-value=0.0018 Score=41.60 Aligned_cols=14 Identities=43% Similarity=1.099 Sum_probs=12.0
Q ss_pred CCeeecCCCCcCCC
Q psy13146 93 NPICNCKPGYTGDP 106 (895)
Q Consensus 93 ~y~C~C~~Gy~g~~ 106 (895)
||+|+|++||+...
T Consensus 1 sy~C~C~~Gy~l~~ 14 (24)
T PF12662_consen 1 SYTCSCPPGYQLSP 14 (24)
T ss_pred CEEeeCCCCCcCCC
Confidence 69999999999654
No 34
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.37 E-value=0.0042 Score=44.16 Aligned_cols=30 Identities=33% Similarity=0.891 Sum_probs=26.5
Q ss_pred CCCC-CCCCCCCceeecCCceeeecCCCcccC
Q psy13146 18 PCQP-SPCGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 18 ~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
+|.. ++|.++ +|+++.++|+|+|++||+|.
T Consensus 1 ~C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCcCCCCCC-EEECCCCCeEeECCCCCccC
Confidence 4666 689988 99999999999999999983
No 35
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.23 E-value=0.0046 Score=43.94 Aligned_cols=28 Identities=43% Similarity=1.019 Sum_probs=24.8
Q ss_pred CCC-CCCCCCCcccccCCCCeEeCCCCCcC
Q psy13146 130 CYP-SPCGPYSQCRDIGGSPSCSCLPNYIG 158 (895)
Q Consensus 130 C~~-~~C~~~g~C~n~~gsy~C~C~~Gy~g 158 (895)
|.. ++|.++ +|+++.++|+|.|++||.|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence 444 688887 9999999999999999999
No 36
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.08 E-value=0.0082 Score=42.46 Aligned_cols=30 Identities=33% Similarity=0.815 Sum_probs=26.2
Q ss_pred CCCCCCCCeeeccCCceeeeCCCCCccC-Cc
Q psy13146 649 PGTCGEGAICDVINHAVSCNCPPGTTGS-PF 678 (895)
Q Consensus 649 ~~~C~~~g~C~~~~gs~~C~C~~Gy~G~-~c 678 (895)
..+|.++++|.+..++|+|.|++||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 4578888999999999999999999998 54
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.87 E-value=0.03 Score=38.94 Aligned_cols=27 Identities=26% Similarity=0.546 Sum_probs=22.8
Q ss_pred CCCCCCCeeeccCCceeeeCCCCCccCCc
Q psy13146 650 GTCGEGAICDVINHAVSCNCPPGTTGSPF 678 (895)
Q Consensus 650 ~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c 678 (895)
..|+++|+|+...+ +|+|++||+|..|
T Consensus 6 ~~C~~~G~C~~~~g--~C~C~~g~~G~~C 32 (32)
T PF07974_consen 6 NICSGHGTCVSPCG--RCVCDSGYTGPDC 32 (32)
T ss_pred CccCCCCEEeCCCC--EEECCCCCcCCCC
Confidence 36889999987744 9999999999865
No 38
>KOG1218|consensus
Probab=94.72 E-value=1.7 Score=47.82 Aligned_cols=160 Identities=24% Similarity=0.585 Sum_probs=79.2
Q ss_pred CCeeeeCCCCCcCC-CCccccCCCCCCCCCCCeeeecCCcccccCCCCCccCCCCC-CCCCCCCCCCCcccCCCCCCCCC
Q psy13146 523 HNAVCNCKPGFTGE-PRIRCSKIPPRSCGYNAECKVINHTPICTCPQGYVGDAFSG-CYPKPPEPEQPVVQEDTCNCVPN 600 (895)
Q Consensus 523 g~~~C~C~~Gy~G~-~~~~C~~~~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~-C~~~~~~~~~~~~~~~~C~C~~~ 600 (895)
.+..|.|.+||+|. .... ... ...+.. .+.....+..|.+..+|.|..+.. +.... ........+.|..+
T Consensus 13 ~~~~c~c~~~~~g~~~~~~-~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~----~~~~c~~~~~c~~~ 84 (316)
T KOG1218|consen 13 GSGQCFCDPGYTGRLQCEH-QAV-TSACSG--ICPCEVNSGECGLGYGFVGSVCRIECVCGN----AGGGCSQPCRCKNG 84 (316)
T ss_pred CCCceecCCCccccccccC-CCC-Cccccc--cCCccCCceeEecccccCCCccccccccCC----CCCcccCccccCCC
Confidence 36789999999995 1111 111 111111 111133456788899999888652 11100 01111222234444
Q ss_pred CcccCceeee-CCCcccCCCcccCCCcccCCCCCCCCccccCCccCCCCCCCCCCCCeeeccCCceeeeCCCCCccCCcc
Q psy13146 601 AECRDGVCVC-LPEFYGDGYVSCRPECVLNNDCPSNKACIRNKCKNPCVPGTCGEGAICDVINHAVSCNCPPGTTGSPFV 679 (895)
Q Consensus 601 g~C~~~~C~C-~~Gy~G~~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~ 679 (895)
.......-.+ ..+|.|.. |+...+|.. . |.. .+|.+... .|.+..+|.+..|.
T Consensus 85 ~~~~~~~~~~~~~~~~g~~-------C~~~~~~~~---------------~-c~~-~~C~~~~~--~c~~~~~~~~~~C~ 138 (316)
T KOG1218|consen 85 GTCVSSTGYCHLNGYEGPQ-------CESPCPCGD---------------G-CAE-KTCANPRR--ECRCGGGYIGEQCG 138 (316)
T ss_pred CcccCCCCcccCCCCCccc-------ccCCCCcCC---------------c-ccc-cccCCCcc--ceecCCcCcccccc
Confidence 4444434444 57777774 333222111 1 222 34444432 57777777777765
Q ss_pred ccCC-CCccCCCCCCCCCCeecCceeeCCCCccCCCCc
Q psy13146 680 QSEQ-PVVQEDTCNCVPNAECRDGVCVCLPEFYGDGYV 716 (895)
Q Consensus 680 ~~~~-~~~~~~~c~C~~~g~C~~~~C~C~~G~~G~~c~ 716 (895)
.... ...-...+.+..+..+.++.|.|++||.|..+.
T Consensus 139 ~~~~~g~~C~~~c~~~~~~~~~~~~c~c~~g~~g~~~~ 176 (316)
T KOG1218|consen 139 EENLVGLKCQRDCQCTGGCDCKNGICTCQPGFVGVFCV 176 (316)
T ss_pred ccCCCCCCccCCCCCccccCCCCCceeccCCccccccc
Confidence 3000 000011112344445567899999999999765
No 39
>KOG1218|consensus
Probab=94.46 E-value=4.3 Score=44.65 Aligned_cols=160 Identities=24% Similarity=0.634 Sum_probs=75.2
Q ss_pred ceeeeCCCCCcCCCCccccccCCCCCCCCCCCCCCCCCCCcccccCCceeeccCCCCcCCCCCCCCCCccCCCCCCCccc
Q psy13146 420 NVMCICPPGTTGSPFIQCKPILQEPVYTNPCQPSPCGPNSQCREVNKQAVCSCLPNYFGSPPACRPECTVNTDCPLDKAC 499 (895)
Q Consensus 420 ~y~C~C~~Gy~G~~~~~C~~~~~~~~~~~eC~~~~C~~~g~C~~~~g~y~C~C~~Gy~g~~~~C~~~C~~~~~C~~~~~C 499 (895)
+..|.|.+||+|. ..+.. ..+.. ++.. .+.......+|.+..+|.+. .|...... ..
T Consensus 14 ~~~c~c~~~~~g~--~~~~~-------~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~------~c~~~~~~----~~ 70 (316)
T KOG1218|consen 14 SGQCFCDPGYTGR--LQCEH-------QAVTS--ACSG--ICPCEVNSGECGLGYGFVGS------VCRIECVC----GN 70 (316)
T ss_pred CCceecCCCcccc--ccccC-------CCCCc--cccc--cCCccCCceeEecccccCCC------cccccccc----CC
Confidence 4589999999996 11331 11111 1110 11113344568888888887 33322110 00
Q ss_pred cCCcccCCCCCCCCCCCeeeeeCCCeeeeC-CCCCcCCCCccccCCCCCCCCCCCeeeecCCcccccCCCCCccCCCCCC
Q psy13146 500 VNQKCVDPCPGSCGQNANCRVINHNAVCNC-KPGFTGEPRIRCSKIPPRSCGYNAECKVINHTPICTCPQGYVGDAFSGC 578 (895)
Q Consensus 500 ~~~~C~~~C~~~C~~~g~C~~~~g~~~C~C-~~Gy~G~~~~~C~~~~~~~C~~~g~C~~~~gs~~C~C~~Gy~G~~c~~C 578 (895)
....|...+ .|..+.... .+...+ ..+|.|..+..-.++... |.. .+|.+... .|.+..+|.+..|..
T Consensus 71 ~~~~c~~~~--~c~~~~~~~----~~~~~~~~~~~~g~~C~~~~~~~~~-c~~-~~C~~~~~--~c~~~~~~~~~~C~~- 139 (316)
T KOG1218|consen 71 AGGGCSQPC--RCKNGGTCV----SSTGYCHLNGYEGPQCESPCPCGDG-CAE-KTCANPRR--ECRCGGGYIGEQCGE- 139 (316)
T ss_pred CCCcccCcc--ccCCCCccc----CCCCcccCCCCCcccccCCCCcCCc-ccc-cccCCCcc--ceecCCcCccccccc-
Confidence 111111111 133333333 233344 678888753221121111 222 45554332 577777777777652
Q ss_pred CCCCCCCCCCcccCCCCCCCCCCcccCceeeeCCCcccCC
Q psy13146 579 YPKPPEPEQPVVQEDTCNCVPNAECRDGVCVCLPEFYGDG 618 (895)
Q Consensus 579 ~~~~~~~~~~~~~~~~C~C~~~g~C~~~~C~C~~Gy~G~~ 618 (895)
+. .........|.+..+..+.++.|.|.+||.|..
T Consensus 140 -~~----~~g~~C~~~c~~~~~~~~~~~~c~c~~g~~g~~ 174 (316)
T KOG1218|consen 140 -EN----LVGLKCQRDCQCTGGCDCKNGICTCQPGFVGVF 174 (316)
T ss_pred -cC----CCCCCccCCCCCccccCCCCCceeccCCccccc
Confidence 00 000011111223444455678899999999996
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=94.32 E-value=0.018 Score=41.19 Aligned_cols=21 Identities=43% Similarity=0.754 Sum_probs=18.1
Q ss_pred CcccccCCCCeEeCCCCCcCC
Q psy13146 139 SQCRDIGGSPSCSCLPNYIGA 159 (895)
Q Consensus 139 g~C~n~~gsy~C~C~~Gy~g~ 159 (895)
.+|++++++|+|.|++||++.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-
T ss_pred CCCccCCCceEeECCCCCEEC
Confidence 489999999999999999998
No 41
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.19 E-value=0.044 Score=38.17 Aligned_cols=24 Identities=29% Similarity=0.685 Sum_probs=21.3
Q ss_pred CCCCCCeec--CceeeCCCCccCCCC
Q psy13146 692 NCVPNAECR--DGVCVCLPEFYGDGY 715 (895)
Q Consensus 692 ~C~~~g~C~--~~~C~C~~G~~G~~c 715 (895)
.|.++|+|+ .++|+|.+||+|..|
T Consensus 7 ~C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCCCCEEECCCCCcCCCC
Confidence 588999999 689999999999853
No 42
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=93.01 E-value=0.062 Score=38.49 Aligned_cols=23 Identities=30% Similarity=0.658 Sum_probs=19.0
Q ss_pred CCCCCceeecCCceeeecCCCcccC
Q psy13146 24 CGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 24 C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
|++ +|++++++|+|+|++||+..
T Consensus 8 C~h--~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 8 CSH--ICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SSS--EEEEETTSEEEE-STTEEE-
T ss_pred cCC--CCccCCCceEeECCCCCEEC
Confidence 555 89999999999999999876
No 43
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=92.71 E-value=0.049 Score=29.68 Aligned_cols=13 Identities=46% Similarity=1.211 Sum_probs=10.5
Q ss_pred eeeCCCCCccCCC
Q psy13146 206 ICTCPDGYTGDAF 218 (895)
Q Consensus 206 ~C~C~~Gy~G~~c 218 (895)
+|+|++||+|+.|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5899999999875
No 44
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=92.53 E-value=0.047 Score=29.75 Aligned_cols=11 Identities=64% Similarity=1.531 Sum_probs=5.6
Q ss_pred eeCCCCCccCC
Q psy13146 667 CNCPPGTTGSP 677 (895)
Q Consensus 667 C~C~~Gy~G~~ 677 (895)
|+|++||+|..
T Consensus 2 C~C~~G~~G~~ 12 (13)
T PF12661_consen 2 CQCPPGWTGPN 12 (13)
T ss_dssp EEE-TTEETTT
T ss_pred ccCcCCCcCCC
Confidence 55555555554
No 45
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=90.78 E-value=0.088 Score=37.53 Aligned_cols=31 Identities=26% Similarity=0.609 Sum_probs=22.3
Q ss_pred CCCCCCCCCCceeecC-CceeeecCCCcccCC
Q psy13146 19 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGSP 49 (895)
Q Consensus 19 C~~~~C~~~~~C~~~~-g~~~C~C~~Gf~g~~ 49 (895)
|...+|..|+.|.+.. |++.|.|.+||...+
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 5567888999999876 999999999998664
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.24 E-value=0.38 Score=50.37 Aligned_cols=37 Identities=22% Similarity=0.414 Sum_probs=31.1
Q ss_pred CCCCCCCCCCCC--CCCCCCCcccccCCCCeEeCCCCCcCC
Q psy13146 121 EDVPEPVNPCYP--SPCGPYSQCRDIGGSPSCSCLPNYIGA 159 (895)
Q Consensus 121 ~~~~~dideC~~--~~C~~~g~C~n~~gsy~C~C~~Gy~g~ 159 (895)
...|.++++|.. ++|. ..|.++.|+|.|.|++||++.
T Consensus 181 ~~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 456788999974 4565 589999999999999999986
No 47
>KOG3512|consensus
Probab=88.15 E-value=3.1 Score=46.60 Aligned_cols=158 Identities=24% Similarity=0.450 Sum_probs=89.0
Q ss_pred CCeeeecCCC-ceeeCCCCCccCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-------------------Cce
Q psy13146 195 NALCKVINHT-PICTCPDGYTGDAFSGCYPKPPEPPPPPQEDIPEPINPCYPSPCGPY-------------------SQC 254 (895)
Q Consensus 195 ~g~C~n~~g~-~~C~C~~Gy~G~~c~~C~~~~~~~~~~~~~~~~~dideC~~~~C~~~-------------------g~C 254 (895)
...|+-...+ ++|.|..+-+|..|..|.+.+..-+=.+ .+-.++++|....|..+ ++|
T Consensus 284 As~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~r--aT~~~a~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvC 361 (592)
T KOG3512|consen 284 ASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGR--ATALPANECVACNCNGHARRCRFNMELYRLSGRRSGGVC 361 (592)
T ss_pred cceeeeccCCceEEecccCCCCCCcccccccccCCCccc--cccCCCccccccccchhhhhcccchhhhcccCccccceE
Confidence 3478877665 9999999999999998877654321111 11255677765555433 345
Q ss_pred e----cCCCCCee-eCCCCCcCCCCCCCccccCCCCCCCcccccccCcCCCCCCCCC----CCCeeeccCCCCcccCCCC
Q psy13146 255 R----DINGSPSC-SCLPSYIGAPPNCRPECIQNSECPYDKACINEKCADPCPGSCG----YGAVCTVINHSPICTCPEG 325 (895)
Q Consensus 255 ~----n~~gsy~C-~C~~G~~g~~~~C~~~C~~~~eC~~~~~C~~~~C~~~C~~~C~----~~g~C~~~~g~~~C~C~~G 325 (895)
+ |+.|. .| .|.+||.-++.. .++ ..+.|..= .|+ .+-+|..+.| +|.|.+|
T Consensus 362 lnCrHnTaGr-hChyCreGyyRd~s~------pl~---hrkaCk~C--------dChpVGs~gktCNq~tG--qCpCkeG 421 (592)
T KOG3512|consen 362 LNCRHNTAGR-HCHYCREGYYRDGSK------PLT---HRKACKAC--------DCHPVGSAGKTCNQTTG--QCPCKEG 421 (592)
T ss_pred eecccCCCCc-ccccccCccccCCCC------CCc---hhhhhhhc--------CCcccccccccccccCC--cccCCCC
Confidence 4 34443 45 689999866421 000 00111110 122 2345665544 8999999
Q ss_pred cccCCCCcCCCCCCC---CCCCCCCCCCC---CCCCCCeeeCceeeeCCCcccCC
Q psy13146 326 YIGDAFSSCYPKPPE---PVQPVIQEDTC---NCAPNAECRDGVCLCLPDYYGDG 374 (895)
Q Consensus 326 y~G~~c~~c~~~~~~---~~~~c~~~~~C---~C~~~g~C~~~~C~C~~Gy~G~~ 374 (895)
-+|..|+.|.+.-.. ++.+|+-++.= .++++++=.+..+.|+.++.|..
T Consensus 422 vtG~tCnrCa~gyqqsrs~vapcik~p~~~~~~~~s~ve~qd~~s~Ck~~~~~~r 476 (592)
T KOG3512|consen 422 VTGLTCNRCAPGYQQSRSPVAPCIKIPTDAPTLGSSGVEPQDQCSKCKASPGGKR 476 (592)
T ss_pred CcccccccccchhhcccCCCcCceecCCCCccccCCCCcchhccccCCCCCccee
Confidence 999999988764221 11222222111 13444443345578888887764
No 48
>smart00051 DSL delta serrate ligand.
Probab=87.67 E-value=0.69 Score=37.81 Aligned_cols=47 Identities=17% Similarity=0.308 Sum_probs=32.3
Q ss_pred ceeeeCCCCCccCCccccCCCCccCCCCCCCCCCeec-CceeeCCCCccCCCC
Q psy13146 664 AVSCNCPPGTTGSPFVQSEQPVVQEDTCNCVPNAECR-DGVCVCLPEFYGDGY 715 (895)
Q Consensus 664 s~~C~C~~Gy~G~~c~~~~~~~~~~~~c~C~~~g~C~-~~~C~C~~G~~G~~c 715 (895)
.++=.|+++|.|..|+..-.+ ......+.+|. ++.++|++||+|..|
T Consensus 16 ~~rv~C~~~~yG~~C~~~C~~-----~~d~~~~~~Cd~~G~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGEGCNKFCRP-----RDDFFGHYTCDENGNKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCCccCCEeCc-----CccccCCccCCcCCCEecCCCCcCCCC
Confidence 456789999999998532111 11235667775 468999999999753
No 49
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=85.96 E-value=0.2 Score=35.75 Aligned_cols=34 Identities=26% Similarity=0.648 Sum_probs=23.4
Q ss_pred CCCCCCCCCCceecCC-CceeccCCCCCcCCCCCC
Q psy13146 790 CQPSPCGPNSQCREVN-KQAVCSCLPNYFGSPPAC 823 (895)
Q Consensus 790 C~~~~C~~~~~C~~~~-g~y~C~C~~Gy~G~~~~C 823 (895)
|....|..++.|.+.. |+++|.|.+||..++..|
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~~~C 36 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVGGKC 36 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEETTEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccCCCc
Confidence 4445688899999887 999999999998765444
No 50
>smart00051 DSL delta serrate ligand.
Probab=85.17 E-value=1.1 Score=36.58 Aligned_cols=48 Identities=25% Similarity=0.459 Sum_probs=33.0
Q ss_pred CCeEeCCCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13146 147 SPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAF 218 (895)
Q Consensus 147 sy~C~C~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~C~C~~Gy~G~~c 218 (895)
.|+=.|+++|.|. .|.+.|...+.. ..+.+|.. .| .++|.+||+|..|
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~~d~-------------------~~~~~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPRDDF-------------------FGHYTCDE-NG--NKGCLEGWMGPYC 63 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcCccc-------------------cCCccCCc-CC--CEecCCCCcCCCC
Confidence 4556899999999 888777543322 23455643 23 6889999999875
No 51
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=82.53 E-value=1.6 Score=33.83 Aligned_cols=21 Identities=38% Similarity=0.721 Sum_probs=15.6
Q ss_pred eeeccCCceeeeCCCCCccCCcc
Q psy13146 657 ICDVINHAVSCNCPPGTTGSPFV 679 (895)
Q Consensus 657 ~C~~~~gs~~C~C~~Gy~G~~c~ 679 (895)
.|+...| +|.|.+||+|..|+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~ 33 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCD 33 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCC
Confidence 3655555 88899999998874
No 52
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=81.94 E-value=1.2 Score=46.50 Aligned_cols=37 Identities=24% Similarity=0.421 Sum_probs=30.4
Q ss_pred CCCCCCCCCCCC--CCCCCCCceecCCCCCeeeCCCCCcCC
Q psy13146 234 EDIPEPINPCYP--SPCGPYSQCRDINGSPSCSCLPSYIGA 272 (895)
Q Consensus 234 ~~~~~dideC~~--~~C~~~g~C~n~~gsy~C~C~~G~~g~ 272 (895)
..+|.++++|.. ++|. ..|.++.|+|.|.|++||+..
T Consensus 181 ~~~C~~~~~C~~~~~~c~--~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLSHVCQ--QVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCCCCcc--ceEEcCCCCEEeECCCCccCC
Confidence 345788999973 4565 589999999999999999875
No 53
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=81.08 E-value=0.98 Score=34.78 Aligned_cols=22 Identities=41% Similarity=0.801 Sum_probs=16.5
Q ss_pred CeeeccCCceeeeCCCCCccCCcc
Q psy13146 656 AICDVINHAVSCNCPPGTTGSPFV 679 (895)
Q Consensus 656 g~C~~~~gs~~C~C~~Gy~G~~c~ 679 (895)
.+|+...| +|.|.++|+|..|+
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~ 32 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCD 32 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-
T ss_pred CcccCCCC--EEeccccccCCcCc
Confidence 46766555 89999999999884
No 54
>KOG3512|consensus
Probab=80.69 E-value=11 Score=42.61 Aligned_cols=93 Identities=20% Similarity=0.412 Sum_probs=52.5
Q ss_pred Ceeee-CCCCCcCCCC------ccccCCCCCCCC-CCCeeeecCCcccccCCCCCccCCCCCCCCCCCCCCC---CcccC
Q psy13146 524 NAVCN-CKPGFTGEPR------IRCSKIPPRSCG-YNAECKVINHTPICTCPQGYVGDAFSGCYPKPPEPEQ---PVVQE 592 (895)
Q Consensus 524 ~~~C~-C~~Gy~G~~~------~~C~~~~~~~C~-~~g~C~~~~gs~~C~C~~Gy~G~~c~~C~~~~~~~~~---~~~~~ 592 (895)
+-.|. |.+||.-+.. ..|..++=+|=. .+-+|..+ +.+|.|++|-+|..|+.|.+.....-. +++.+
T Consensus 370 GrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~--tGqCpCkeGvtG~tCnrCa~gyqqsrs~vapcik~ 447 (592)
T KOG3512|consen 370 GRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQT--TGQCPCKEGVTGLTCNRCAPGYQQSRSPVAPCIKI 447 (592)
T ss_pred CcccccccCccccCCCCCCchhhhhhhcCCccccccccccccc--CCcccCCCCCcccccccccchhhcccCCCcCceec
Confidence 34554 9999986652 123333211111 12345433 568999999999999999887654322 12111
Q ss_pred CCC---CCCCCCcccCceeeeCCCcccCC
Q psy13146 593 DTC---NCVPNAECRDGVCVCLPEFYGDG 618 (895)
Q Consensus 593 ~~C---~C~~~g~C~~~~C~C~~Gy~G~~ 618 (895)
..= .+.++.+=.+..+.|+.++.|..
T Consensus 448 p~~~~~~~~s~ve~qd~~s~Ck~~~~~~r 476 (592)
T KOG3512|consen 448 PTDAPTLGSSGVEPQDQCSKCKASPGGKR 476 (592)
T ss_pred CCCCccccCCCCcchhccccCCCCCccee
Confidence 111 24444442244578888887764
No 55
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=77.14 E-value=2.7 Score=31.96 Aligned_cols=21 Identities=33% Similarity=0.713 Sum_probs=15.8
Q ss_pred eeeccCCceeeeCCCCCccCCcc
Q psy13146 657 ICDVINHAVSCNCPPGTTGSPFV 679 (895)
Q Consensus 657 ~C~~~~gs~~C~C~~Gy~G~~c~ 679 (895)
.|+...| +|.|+++|+|..|+
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~ 32 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCD 32 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCC
Confidence 4554455 89999999998874
No 56
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=73.33 E-value=5.3 Score=30.33 Aligned_cols=28 Identities=36% Similarity=0.794 Sum_probs=19.4
Q ss_pred eEeecCCceeeeCCCCCcCCCCcccccccc
Q psy13146 753 ICDVINHAVSCNCPPGTTGSPFVQCKPIQY 782 (895)
Q Consensus 753 ~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~ 782 (895)
.|....| +|.|++||+|..+.+|+++.|
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~ 39 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDRCAPGYY 39 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCcCCCCcC
Confidence 4544444 899999999988555554444
No 57
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=72.63 E-value=4.1 Score=31.49 Aligned_cols=28 Identities=39% Similarity=0.800 Sum_probs=19.8
Q ss_pred eEeecCCceeeeCCCCCcCCCCcccccccc
Q psy13146 753 ICDVINHAVSCNCPPGTTGSPFVQCKPIQY 782 (895)
Q Consensus 753 ~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~ 782 (895)
.|....| +|.|++||+|..+.+|+++.+
T Consensus 13 ~C~~~~G--~C~C~~~~~G~~C~~C~~g~~ 40 (50)
T cd00055 13 QCDPGTG--QCECKPNTTGRRCDRCAPGYY 40 (50)
T ss_pred cccCCCC--EEeCCCcCCCCCCCCCCCCCc
Confidence 4655555 899999999998555554443
No 58
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=68.92 E-value=3.3 Score=31.82 Aligned_cols=29 Identities=45% Similarity=0.882 Sum_probs=20.5
Q ss_pred CeEeecCCceeeeCCCCCcCCCCcccccccc
Q psy13146 752 AICDVINHAVSCNCPPGTTGSPFVQCKPIQY 782 (895)
Q Consensus 752 ~~C~~~~g~y~C~C~~Gy~g~~~~~C~~~~~ 782 (895)
++|....| +|.|+++|+|..+.+|.++.|
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~C~~g~~ 39 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQCKPGYF 39 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-EE-TTEE
T ss_pred CcccCCCC--EEeccccccCCcCcCCCCccc
Confidence 57777555 999999999999655655444
No 59
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=64.42 E-value=8.6 Score=29.89 Aligned_cols=22 Identities=27% Similarity=0.847 Sum_probs=18.7
Q ss_pred CCCCCCcccCceeeeCCCcccC
Q psy13146 596 NCVPNAECRDGVCVCLPEFYGD 617 (895)
Q Consensus 596 ~C~~~g~C~~~~C~C~~Gy~G~ 617 (895)
.|..++.|++++|.|++||.-.
T Consensus 27 qC~~~s~C~~g~C~C~~g~~~~ 48 (52)
T PF01683_consen 27 QCIGGSVCVNGRCQCPPGYVEV 48 (52)
T ss_pred CCCCcCEEcCCEeECCCCCEec
Confidence 3668899999999999998754
No 60
>PHA02887 EGF-like protein; Provisional
Probab=62.02 E-value=7.2 Score=35.46 Aligned_cols=29 Identities=24% Similarity=0.363 Sum_probs=21.8
Q ss_pred CCCCCCeee--ccCCceeeeCCCCCccCCccc
Q psy13146 651 TCGEGAICD--VINHAVSCNCPPGTTGSPFVQ 680 (895)
Q Consensus 651 ~C~~~g~C~--~~~gs~~C~C~~Gy~G~~c~~ 680 (895)
-|- +|+|. .......|.|++||+|.+|+.
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 455 46884 334567999999999999964
No 61
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=60.72 E-value=7.8 Score=30.14 Aligned_cols=31 Identities=19% Similarity=0.339 Sum_probs=20.0
Q ss_pred cCcCCCCCCCCCccccCCCcccceecCCCcceeecCcc
Q psy13146 826 ECTVNSDCPLNKACFNQKCVYTYSISTFCIWYTVAGVF 863 (895)
Q Consensus 826 eC~~~~~C~~~~~C~n~~g~~~C~C~~~~~g~~~~g~~ 863 (895)
.|..+..|..++.|++ -+|.|++ ||+..+..
T Consensus 21 ~C~~~~qC~~~s~C~~----g~C~C~~---g~~~~~~~ 51 (52)
T PF01683_consen 21 SCESDEQCIGGSVCVN----GRCQCPP---GYVEVGGR 51 (52)
T ss_pred CCCCcCCCCCcCEEcC----CEeECCC---CCEecCCC
Confidence 4555556666777755 3688888 88766543
No 62
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=56.59 E-value=3.5 Score=33.71 Aligned_cols=48 Identities=29% Similarity=0.511 Sum_probs=20.5
Q ss_pred CCeEeCCCCCcCCCCCCCCCCCCCCCCCCCCccccCCcCCCCCCCCCCCCeeeecCCCceeeCCCCCccCCC
Q psy13146 147 SPSCSCLPNYIGAPPNCRPECVQNNDCSNDKACINEKCQDPCPGSCGYNALCKVINHTPICTCPDGYTGDAF 218 (895)
Q Consensus 147 sy~C~C~~Gy~g~~~~C~~~C~d~~~C~~~~~C~~~~C~~~C~~~C~~~g~C~n~~g~~~C~C~~Gy~G~~c 218 (895)
+++-.|.+.|.|. .|...|...+.=. .+-+|.. .| .=+|.+||+|..|
T Consensus 16 ~~rv~C~~nyyG~--~C~~~C~~~~d~~-------------------ghy~Cd~-~G--~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 16 RIRVVCDENYYGP--NCSKFCKPRDDSF-------------------GHYTCDS-NG--NKVCLPGWTGPNC 63 (63)
T ss_dssp -------TTEETT--TT-EE---EEETT-------------------EEEEE-S-S----EEE-TTEESTTS
T ss_pred EEEEECCCCCCCc--cccCCcCCCcCCc-------------------CCcccCC-CC--CCCCCCCCcCCCC
Confidence 5678899999999 8876665322100 1223332 22 3478999999875
No 63
>KOG3516|consensus
Probab=52.82 E-value=11 Score=47.42 Aligned_cols=42 Identities=29% Similarity=0.673 Sum_probs=37.8
Q ss_pred CCCCCCCCCCCCCCCCCCCcccccCCCCeEeCC-CCCcCCCCCCC
Q psy13146 121 EDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCR 164 (895)
Q Consensus 121 ~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~-~Gy~g~~~~C~ 164 (895)
.++|..+|.|.++||.++|.|...-..|.|.|. .||.|. .|.
T Consensus 539 id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 539 IDMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred ecccccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 467788899999999999999998889999998 899998 776
No 64
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=50.72 E-value=12 Score=34.71 Aligned_cols=32 Identities=28% Similarity=0.464 Sum_probs=23.7
Q ss_pred CCCCCCCeee--ccCCceeeeCCCCCccCCccccC
Q psy13146 650 GTCGEGAICD--VINHAVSCNCPPGTTGSPFVQSE 682 (895)
Q Consensus 650 ~~C~~~g~C~--~~~gs~~C~C~~Gy~G~~c~~~~ 682 (895)
+-|-+| +|. .....+.|.|..||+|.+|+..+
T Consensus 51 ~YClHG-~C~yI~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 51 GYCLHG-DCIHARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred CEeECC-EEEeeccCCCceeECCCCccccccccee
Confidence 356554 784 34467899999999999997543
No 65
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=49.86 E-value=14 Score=33.74 Aligned_cols=33 Identities=36% Similarity=0.873 Sum_probs=25.8
Q ss_pred CCCCCCC-CCCCCCCceeecCCceeeecCCCcccC
Q psy13146 15 YTNPCQP-SPCGPNSQCREVNKQAVCSCLPNYFGS 48 (895)
Q Consensus 15 ~~d~C~~-~~C~~~~~C~~~~g~~~C~C~~Gf~g~ 48 (895)
..|+|+. ..|..+|.|.. ..+-.|.|++||.-.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcCCC
Confidence 4568874 78999999964 456679999999754
No 66
>PHA02887 EGF-like protein; Provisional
Probab=47.05 E-value=13 Score=33.81 Aligned_cols=30 Identities=27% Similarity=0.585 Sum_probs=23.7
Q ss_pred CCCCCCCeeecc--CCCCcccCCCCcccCCCCc
Q psy13146 303 GSCGYGAVCTVI--NHSPICTCPEGYIGDAFSS 333 (895)
Q Consensus 303 ~~C~~~g~C~~~--~g~~~C~C~~Gy~G~~c~~ 333 (895)
+-|. +|+|.-. ...+.|.|++||+|..|+.
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCCCCc
Confidence 4566 5799865 3568999999999999873
No 67
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=45.71 E-value=16 Score=33.97 Aligned_cols=30 Identities=27% Similarity=0.545 Sum_probs=24.0
Q ss_pred CCCCCCCeeecc--CCCCcccCCCCcccCCCCc
Q psy13146 303 GSCGYGAVCTVI--NHSPICTCPEGYIGDAFSS 333 (895)
Q Consensus 303 ~~C~~~g~C~~~--~g~~~C~C~~Gy~G~~c~~ 333 (895)
+-|.+ |+|.-. ...+.|.|..||+|..|+.
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGeRCEh 82 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGIRCQH 82 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCcccccccc
Confidence 45776 489765 3679999999999999974
No 68
>KOG3516|consensus
Probab=39.45 E-value=21 Score=44.91 Aligned_cols=43 Identities=30% Similarity=0.682 Sum_probs=37.2
Q ss_pred CCCCCCCCCCCCCCCCCCCCceecCCCCCeeeCC-CCCcCCCCCCC
Q psy13146 233 QEDIPEPINPCYPSPCGPYSQCRDINGSPSCSCL-PSYIGAPPNCR 277 (895)
Q Consensus 233 ~~~~~~dideC~~~~C~~~g~C~n~~gsy~C~C~-~G~~g~~~~C~ 277 (895)
+.+.|.-+|.|.+++|.++|.|.-....|.|.|. .||.|. .|.
T Consensus 538 ~id~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga--tCH 581 (1306)
T KOG3516|consen 538 QIDMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA--TCH 581 (1306)
T ss_pred eecccccccccCCccccCCCcccccccceeEeccccccccc--ccc
Confidence 3445677889999999999999998889999998 899998 775
No 69
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=38.17 E-value=27 Score=31.88 Aligned_cols=26 Identities=42% Similarity=1.106 Sum_probs=20.6
Q ss_pred CCCCCCCceeccCCCCeeecCCCCcCC
Q psy13146 79 GTCGQNANCKVQNHNPICNCKPGYTGD 105 (895)
Q Consensus 79 ~~C~~~g~C~~~~g~y~C~C~~Gy~g~ 105 (895)
+.|..+|.|.. ..+..|.|.+||+-+
T Consensus 84 ~~CG~~g~C~~-~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICNS-NNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeCC-CCCCceECCCCcCCC
Confidence 67899999954 456689999999743
No 70
>KOG3514|consensus
Probab=32.08 E-value=30 Score=42.97 Aligned_cols=35 Identities=23% Similarity=0.615 Sum_probs=31.8
Q ss_pred CCCCCCCCCCCcccccCCCCeEeCC-CCCcCCCCCCCC
Q psy13146 129 PCYPSPCGPYSQCRDIGGSPSCSCL-PNYIGAPPNCRP 165 (895)
Q Consensus 129 eC~~~~C~~~g~C~n~~gsy~C~C~-~Gy~g~~~~C~~ 165 (895)
.|.++||.|+|+|...-..|.|.|. .||.|. .|++
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc--cccc
Confidence 6899999999999999999999996 699998 7874
No 71
>KOG3514|consensus
Probab=28.88 E-value=36 Score=42.41 Aligned_cols=31 Identities=26% Similarity=0.774 Sum_probs=28.5
Q ss_pred CCCCCCCCCCCceeecCCceeeecC-CCcccC
Q psy13146 18 PCQPSPCGPNSQCREVNKQAVCSCL-PNYFGS 48 (895)
Q Consensus 18 ~C~~~~C~~~~~C~~~~g~~~C~C~-~Gf~g~ 48 (895)
.|.++||+|+|+|......|.|.|- .||.|.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~ 656 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGR 656 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCc
Confidence 7999999999999999999999997 478887
No 72
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=21.66 E-value=51 Score=29.80 Aligned_cols=25 Identities=24% Similarity=0.654 Sum_probs=19.6
Q ss_pred CCCCCCCceeccC-----CCCeeecCCCCc
Q psy13146 79 GTCGQNANCKVQN-----HNPICNCKPGYT 103 (895)
Q Consensus 79 ~~C~~~g~C~~~~-----g~y~C~C~~Gy~ 103 (895)
+.|..+|.|+... .=|.|+|.+.+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeecccc
Confidence 6788999999863 448999999543
No 73
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=21.37 E-value=59 Score=22.99 Aligned_cols=13 Identities=31% Similarity=0.606 Sum_probs=11.1
Q ss_pred eeeeCCCCCcCCC
Q psy13146 761 VSCNCPPGTTGSP 773 (895)
Q Consensus 761 y~C~C~~Gy~g~~ 773 (895)
++|.|+.||..+.
T Consensus 18 ~~C~CPeGyIlde 30 (34)
T PF09064_consen 18 GQCFCPEGYILDE 30 (34)
T ss_pred CceeCCCceEecC
Confidence 4999999998765
Done!