Query psy13151
Match_columns 256
No_of_seqs 244 out of 2208
Neff 9.7
Searched_HMMs 46136
Date Fri Aug 16 20:54:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13151.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13151hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.7 8.5E-17 1.8E-21 143.7 13.2 197 27-243 636-866 (1289)
2 KOG1214|consensus 99.7 3E-16 6.5E-21 140.2 15.5 204 15-244 700-918 (1289)
3 KOG1219|consensus 99.4 1.3E-12 2.8E-17 126.7 9.1 107 68-236 3865-3975(4289)
4 KOG1217|consensus 99.3 1.6E-10 3.4E-15 103.3 15.8 206 14-248 176-405 (487)
5 KOG1219|consensus 99.2 1.9E-11 4.1E-16 118.9 8.6 107 11-178 3866-3974(4289)
6 KOG1217|consensus 99.2 3.6E-10 7.9E-15 101.0 14.2 199 18-250 137-363 (487)
7 KOG4260|consensus 99.1 1.9E-10 4.1E-15 91.4 5.7 139 72-234 150-305 (350)
8 KOG4289|consensus 98.9 1.5E-09 3.3E-14 102.6 6.7 68 144-231 1237-1308(2531)
9 KOG4289|consensus 98.9 2.2E-09 4.9E-14 101.5 7.1 60 24-105 1216-1275(2531)
10 KOG4260|consensus 98.8 4.9E-09 1.1E-13 83.5 4.4 145 14-178 149-306 (350)
11 KOG1225|consensus 98.7 3.2E-07 6.9E-12 81.4 11.8 132 30-236 234-365 (525)
12 PF12947 EGF_3: EGF domain; I 98.3 3E-07 6.5E-12 51.3 1.5 30 209-238 6-35 (36)
13 PF07645 EGF_CA: Calcium-bindi 98.3 5.6E-07 1.2E-11 52.4 2.4 26 209-234 10-35 (42)
14 PF07645 EGF_CA: Calcium-bindi 98.2 1E-06 2.2E-11 51.3 2.1 33 145-177 1-35 (42)
15 KOG1225|consensus 98.1 3.7E-05 8E-10 68.5 11.3 102 28-178 263-364 (525)
16 PF12947 EGF_3: EGF domain; I 98.0 1.2E-06 2.6E-11 48.9 0.4 31 15-45 6-36 (36)
17 PF12662 cEGF: Complement Clr- 98.0 6.1E-06 1.3E-10 41.3 2.2 23 223-245 1-24 (24)
18 PF00008 EGF: EGF-like domain 97.9 5.8E-06 1.3E-10 44.9 1.4 27 209-235 4-31 (32)
19 PF06247 Plasmod_Pvs28: Plasmo 97.8 3E-06 6.5E-11 64.3 -0.3 139 77-237 10-164 (197)
20 PF00008 EGF: EGF-like domain 97.7 1.8E-05 3.9E-10 43.0 1.8 30 149-178 1-31 (32)
21 smart00179 EGF_CA Calcium-bind 97.7 6.6E-05 1.4E-09 42.6 3.9 33 146-178 2-36 (39)
22 smart00179 EGF_CA Calcium-bind 97.5 0.00018 3.8E-09 40.7 3.7 27 72-98 9-36 (39)
23 PF12662 cEGF: Complement Clr- 97.3 0.00021 4.5E-09 35.7 2.2 23 86-108 1-24 (24)
24 cd00054 EGF_CA Calcium-binding 97.3 0.00046 1E-08 38.6 3.9 33 146-178 2-35 (38)
25 KOG1226|consensus 97.3 0.0021 4.5E-08 59.1 9.8 57 30-97 478-535 (783)
26 cd00054 EGF_CA Calcium-binding 97.3 0.00046 9.9E-09 38.6 3.7 27 209-235 9-35 (38)
27 cd00053 EGF Epidermal growth f 97.0 0.00092 2E-08 36.7 3.3 27 209-235 6-32 (36)
28 smart00181 EGF Epidermal growt 97.0 0.0013 2.9E-08 36.2 3.7 25 210-235 7-31 (35)
29 PF06247 Plasmod_Pvs28: Plasmo 97.0 0.00024 5.2E-09 54.2 0.5 104 117-241 10-127 (197)
30 smart00181 EGF Epidermal growt 96.8 0.0021 4.5E-08 35.4 3.5 26 72-98 6-31 (35)
31 cd00053 EGF Epidermal growth f 96.7 0.0029 6.4E-08 34.6 3.5 28 151-178 5-32 (36)
32 PF14670 FXa_inhibition: Coagu 96.1 0.0041 8.9E-08 34.5 1.7 22 216-237 11-32 (36)
33 PF07974 EGF_2: EGF-like domai 96.0 0.011 2.4E-07 31.9 3.1 25 210-236 7-31 (32)
34 PF07974 EGF_2: EGF-like domai 95.8 0.011 2.4E-07 31.9 2.7 25 15-41 6-30 (32)
35 KOG0994|consensus 95.8 0.12 2.7E-06 50.0 11.1 57 120-178 878-945 (1758)
36 KOG1226|consensus 95.7 0.083 1.8E-06 49.0 9.5 35 209-248 555-590 (783)
37 KOG0994|consensus 95.7 0.033 7.2E-07 53.6 7.1 69 28-99 932-1011(1758)
38 PF12661 hEGF: Human growth fa 95.7 0.0053 1.1E-07 25.9 0.9 11 31-41 1-11 (13)
39 PF14670 FXa_inhibition: Coagu 95.2 0.019 4.2E-07 31.8 2.3 21 118-138 10-30 (36)
40 PF09064 Tme5_EGF_like: Thromb 94.4 0.04 8.6E-07 29.7 2.1 15 224-238 18-32 (34)
41 PF12946 EGF_MSP1_1: MSP1 EGF 93.6 0.022 4.7E-07 31.6 0.2 29 210-238 6-35 (37)
42 PF12946 EGF_MSP1_1: MSP1 EGF 93.3 0.035 7.5E-07 30.8 0.7 31 149-179 2-33 (37)
43 cd01475 vWA_Matrilin VWA_Matri 90.0 0.36 7.8E-06 38.8 3.5 37 140-178 181-219 (224)
44 PF01683 EB: EB module; Inter 88.9 1.1 2.4E-05 26.9 4.3 24 210-237 27-50 (52)
45 smart00051 DSL delta serrate l 88.1 1 2.2E-05 28.4 3.8 47 166-236 16-62 (63)
46 cd01475 vWA_Matrilin VWA_Matri 87.7 0.57 1.2E-05 37.7 3.2 37 102-138 183-219 (224)
47 PHA02887 EGF-like protein; Pro 87.6 0.61 1.3E-05 32.9 2.8 27 209-236 92-120 (126)
48 smart00051 DSL delta serrate l 87.1 1.3 2.8E-05 28.0 3.8 45 30-98 17-61 (63)
49 PF00954 S_locus_glycop: S-loc 83.4 1.4 3E-05 31.2 3.0 26 72-98 84-109 (110)
50 PHA03099 epidermal growth fact 82.4 1.3 2.9E-05 31.9 2.5 32 209-244 51-84 (139)
51 PF00954 S_locus_glycop: S-loc 81.3 1.9 4E-05 30.5 3.1 26 209-235 84-109 (110)
52 KOG1836|consensus 78.3 5.4 0.00012 41.6 6.2 32 211-242 782-816 (1705)
53 PHA02887 EGF-like protein; Pro 77.4 2.8 6.1E-05 29.7 2.8 30 73-106 93-124 (126)
54 PHA03099 epidermal growth fact 72.4 3.9 8.4E-05 29.5 2.5 31 73-107 52-84 (139)
55 PF00053 Laminin_EGF: Laminin 67.3 4.4 9.5E-05 23.8 1.7 26 215-242 11-36 (49)
56 KOG1836|consensus 65.8 29 0.00063 36.5 8.0 30 73-105 781-813 (1705)
57 cd00055 EGF_Lam Laminin-type e 59.4 12 0.00027 22.1 2.7 19 223-241 18-36 (50)
58 PF01826 TIL: Trypsin Inhibito 57.7 3.2 7E-05 25.1 -0.1 22 225-246 34-55 (55)
59 KOG3607|consensus 54.5 15 0.00032 35.1 3.6 45 190-237 604-655 (716)
60 KOG3516|consensus 52.1 12 0.00027 37.1 2.7 38 141-178 540-578 (1306)
61 smart00180 EGF_Lam Laminin-typ 50.7 18 0.0004 20.9 2.4 19 223-241 17-35 (46)
62 KOG0196|consensus 48.7 59 0.0013 31.5 6.4 19 219-237 303-321 (996)
63 KOG3516|consensus 45.7 15 0.00033 36.5 2.3 31 68-98 546-578 (1306)
64 PF12955 DUF3844: Domain of un 40.2 28 0.0006 24.3 2.3 25 209-233 13-42 (103)
65 PF01414 DSL: Delta serrate li 39.7 8.6 0.00019 24.2 -0.2 20 166-187 16-35 (63)
66 PF04863 EGF_alliinase: Alliin 36.2 18 0.00039 22.0 0.8 28 211-238 19-50 (56)
67 KOG3514|consensus 33.6 30 0.00066 34.3 2.2 35 10-46 624-659 (1591)
68 KOG3607|consensus 31.3 58 0.0013 31.2 3.7 47 53-105 604-657 (716)
69 KOG3509|consensus 24.1 1.8E+02 0.0039 29.0 5.5 32 147-178 407-438 (964)
70 KOG1218|consensus 23.0 4.4E+02 0.0095 21.8 13.3 13 166-178 161-173 (316)
71 KOG1218|consensus 22.8 4.4E+02 0.0096 21.8 12.4 15 27-41 12-26 (316)
72 KOG3512|consensus 21.5 4.4E+02 0.0096 24.0 6.9 54 123-178 368-425 (592)
No 1
>KOG1214|consensus
Probab=99.72 E-value=8.5e-17 Score=143.66 Aligned_cols=197 Identities=22% Similarity=0.434 Sum_probs=143.5
Q ss_pred CCCceeeCCCCCc--cCCCCCc----------cCCCCCCCCCCCCcccCC----CccCCC---CCCCCCCCeeeeCC-CC
Q psy13151 27 GNTPACSCVESYI--GRPPNCR----------PECTINAECAGNLACINE----RCKDPC---PGSCGAHASCVVLN-HT 86 (256)
Q Consensus 27 ~g~~~C~C~~G~~--g~~~~C~----------~~C~~~~~C~~~~~C~~~----~C~~~C---~~~C~~~~~C~n~~-g~ 86 (256)
.+-++|.+.+.|. ++++.+. .++.....++....++.. .=+++| ++.|..++.|.... -.
T Consensus 636 ityq~C~h~~~~p~~p~tqql~vd~vfalyn~ee~~lr~a~Sn~igpV~E~S~~~~~npCy~gsh~cdt~a~C~pg~~~~ 715 (1289)
T KOG1214|consen 636 ITYQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVSNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGVD 715 (1289)
T ss_pred ceeEEeecCCCCCCCCCceEeecccceeccCccccchhhhhhhcccceecCCCCcccccceecCcccCCCccccCCCCcc
Confidence 4567899988886 4443332 122222234333333332 112556 67899899998654 46
Q ss_pred ceeeCCCCCccCCCCCCccccccCC--CCCCCCCceEecCCceeeecCCCCcCCCC-CCCCC------CCCCC--CCCCC
Q psy13151 87 PRCTCDPGFTGDPFSTCFYIQERNH--MVMGSRDQMVLVVGQQLLERKSTAAPVAA-EPRPS------GNPCL--PSPCG 155 (256)
Q Consensus 87 ~~C~C~~G~~g~~~~~C~~~~~c~~--~~~~~~~~c~~~~g~~~c~c~~g~~~~~~-~~c~~------~~~C~--~~~C~ 155 (256)
|+|.|..||.|++.. |.++++|+. ..|+.+..|++.+++++|.|..||.+..+ ..|.. ++.|. .+.|.
T Consensus 716 ~tcecs~g~~gdgr~-c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~ 794 (1289)
T KOG1214|consen 716 YTCECSSGYQGDGRN-CVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCA 794 (1289)
T ss_pred eEEEEeeccCCCCCC-CCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccC
Confidence 899999999999864 999999986 44889999999999999999999988754 45543 45565 26676
Q ss_pred CCC--EEeecC-CCCeeecCCCcccCCCCCcccCCcCCCCCCCCeeeCCcccCCCCCCCCCCCeEeeCCCCceeeCCCCC
Q psy13151 156 PNS--ICRVIG-NTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGF 232 (256)
Q Consensus 156 ~~~--~C~~~~-g~~~C~C~~G~~~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~ 232 (256)
.++ +|+... ++|.|.|.+||.|++..| .+.++|.. +.|+++|+|+|++|+|.|.|.+||
T Consensus 795 i~g~a~c~~hGgs~y~C~CLPGfsGDG~~c----~dvDeC~p--------------srChp~A~CyntpgsfsC~C~pGy 856 (1289)
T KOG1214|consen 795 IAGQARCVHHGGSTYSCACLPGFSGDGHQC----TDVDECSP--------------SRCHPAATCYNTPGSFSCRCQPGY 856 (1289)
T ss_pred cCCceEEEecCCceEEEeecCCccCCcccc----ccccccCc--------------cccCCCceEecCCCcceeecccCc
Confidence 544 455555 459999999999998754 34456653 459999999999999999999999
Q ss_pred ccCCCCCCccc
Q psy13151 233 TGDPFSTCFYI 243 (256)
Q Consensus 233 ~g~~~~~C~~i 243 (256)
.|++.. |++-
T Consensus 857 ~GDGf~-CVP~ 866 (1289)
T KOG1214|consen 857 YGDGFQ-CVPD 866 (1289)
T ss_pred cCCCce-ecCC
Confidence 999998 8763
No 2
>KOG1214|consensus
Probab=99.71 E-value=3e-16 Score=140.22 Aligned_cols=204 Identities=21% Similarity=0.341 Sum_probs=143.3
Q ss_pred cCCCCCCcc-CCCCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCC
Q psy13151 15 VAVDIRRTD-PHIGNTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDP 93 (256)
Q Consensus 15 ~~c~~~~~C-~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~ 93 (256)
.-|+..+.| ..+.-+|+|.|..||.|++..|.+ .++|+.. ++.|++++.|+|.+++|+|.|..
T Consensus 700 h~cdt~a~C~pg~~~~~tcecs~g~~gdgr~c~d----~~eca~~------------~~~CGp~s~Cin~pg~~rceC~~ 763 (1289)
T KOG1214|consen 700 HMCDTTARCHPGTGVDYTCECSSGYQGDGRNCVD----ENECATG------------FHRCGPNSVCINLPGSYRCECRS 763 (1289)
T ss_pred cccCCCccccCCCCcceEEEEeeccCCCCCCCCC----hhhhccC------------CCCCCCCceeecCCCceeEEEee
Confidence 334445667 455678999999999999887753 3455443 67899999999999999999999
Q ss_pred CCccCCC-CCCcccc------ccCCC--CCCCCCceEe--c-CCceeeecCCCCcCCCCCCCCCCCCCCCCCCCCCCEEe
Q psy13151 94 GFTGDPF-STCFYIQ------ERNHM--VMGSRDQMVL--V-VGQQLLERKSTAAPVAAEPRPSGNPCLPSPCGPNSICR 161 (256)
Q Consensus 94 G~~g~~~-~~C~~~~------~c~~~--~~~~~~~c~~--~-~g~~~c~c~~g~~~~~~~~c~~~~~C~~~~C~~~~~C~ 161 (256)
||...+. .+|..+. .|... .|.-.+.++. . -++|.|.|.+||.+. +..|.++|+|.++.|.+++.|.
T Consensus 764 gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGD-G~~c~dvDeC~psrChp~A~Cy 842 (1289)
T KOG1214|consen 764 GYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGD-GHQCTDVDECSPSRCHPAATCY 842 (1289)
T ss_pred cceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCC-ccccccccccCccccCCCceEe
Confidence 9864433 3476443 34433 2333444433 3 456899999999986 4778889999999999999999
Q ss_pred ecCCCCeeecCCCcccCCCCCcccCCcCCCCCCCCeeeCCcccCCCCCCCCCCCeE--eeCCCCceeeCCCCCccCCCCC
Q psy13151 162 VIGNTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASC--VVLNHTPRCTCDPGFTGDPFST 239 (256)
Q Consensus 162 ~~~g~~~C~C~~G~~~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C--~~~~g~~~C~C~~G~~g~~~~~ 239 (256)
|+++++.|.|.+||.|++..|.+.=.....|+... +=+-.|+.++.| ...+..|.+.|.++-.|++...
T Consensus 843 ntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er---------~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~ 913 (1289)
T KOG1214|consen 843 NTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQER---------FHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPH 913 (1289)
T ss_pred cCCCcceeecccCccCCCceecCCCccCCcccccc---------ccceeeccccceeEeeCCCcccCCCCCCCCCCCCCC
Confidence 99999999999999999988874311113444331 002335554433 2235568888888877776655
Q ss_pred Ccccc
Q psy13151 240 CFYIQ 244 (256)
Q Consensus 240 C~~i~ 244 (256)
|-.|.
T Consensus 914 c~~~~ 918 (1289)
T KOG1214|consen 914 CGPSP 918 (1289)
T ss_pred CCCcc
Confidence 65543
No 3
>KOG1219|consensus
Probab=99.38 E-value=1.3e-12 Score=126.69 Aligned_cols=107 Identities=28% Similarity=0.807 Sum_probs=90.7
Q ss_pred CCC-CCCCCCCCeeeeCC-CCceeeCCCCCccCCCCCCccccccCCCCCCCCCceEecCCceeeecCCCCcCCCCCCCCC
Q psy13151 68 DPC-PGSCGAHASCVVLN-HTPRCTCDPGFTGDPFSTCFYIQERNHMVMGSRDQMVLVVGQQLLERKSTAAPVAAEPRPS 145 (256)
Q Consensus 68 ~~C-~~~C~~~~~C~n~~-g~~~C~C~~G~~g~~~~~C~~~~~c~~~~~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~ 145 (256)
+.| .++|.++|.|..++ +.|.|.|++-|+|+ +|+. +
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~---~CEi---------------------------------------~ 3902 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN---HCEI---------------------------------------D 3902 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCc---cccc---------------------------------------c
Confidence 556 67899999999876 67899999999998 7863 3
Q ss_pred CCCCCCCCCCCCCEEeecCCCCeeecCCCcccCCCCCcccCCcC--CCCCCCCeeeCCcccCCCCCCCCCCCeEeeCCCC
Q psy13151 146 GNPCLPSPCGPNSICRVIGNTPACSCVESYIGRPPNCRPECTIN--AECAGNLACINERCKDPCPGSCGAHASCVVLNHT 223 (256)
Q Consensus 146 ~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~c~~~C~~~--~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~ 223 (256)
+.+|.++||..+++|+...+.|.|.|+.||+|. .|+.. ++|+. +.|..+|.|+|++|+
T Consensus 3903 ~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~------~Ce~~Gi~eCs~--------------n~C~~gg~C~n~~gs 3962 (4289)
T KOG1219|consen 3903 LEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK------RCEARGISECSK--------------NVCGTGGQCINIPGS 3962 (4289)
T ss_pred cccccCCCCCCCCEEEecCCCeeEeCCCCccCc------eeeccccccccc--------------ccccCCceeeccCCc
Confidence 677889999999999999999999999999998 45432 55654 459999999999999
Q ss_pred ceeeCCCCCccCC
Q psy13151 224 PRCTCDPGFTGDP 236 (256)
Q Consensus 224 ~~C~C~~G~~g~~ 236 (256)
|+|.|.+||.|..
T Consensus 3963 f~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3963 FHCNCTPGILGRT 3975 (4289)
T ss_pred eEeccChhHhccc
Confidence 9999999999983
No 4
>KOG1217|consensus
Probab=99.27 E-value=1.6e-10 Score=103.31 Aligned_cols=206 Identities=24% Similarity=0.477 Sum_probs=143.2
Q ss_pred ccCCCCCCccCCCCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCC-Cc-------cCCC---CCCCCCC-Ceee
Q psy13151 14 VVAVDIRRTDPHIGNTPACSCVESYIGRPPNCRPECTINAECAGNLACINE-RC-------KDPC---PGSCGAH-ASCV 81 (256)
Q Consensus 14 ~~~c~~~~~C~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~-~C-------~~~C---~~~C~~~-~~C~ 81 (256)
..+|...+.|.+..++|.|.|++||.+. .| ... .....|+.. .| .+.| ...|... +.|+
T Consensus 176 ~~~c~~~~~C~~~~~~~~C~c~~~~~~~--~~----~~~---~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~ 246 (487)
T KOG1217|consen 176 SSPCQNGGTCVNTGGSYLCSCPPGYTGS--TC----ETT---GNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCV 246 (487)
T ss_pred CCCcCCCcccccCCCCeeEeCCCCccCC--cC----cCC---CCCceEecceeccCCCCCCCCCcccccccccCCCCccc
Confidence 4457788899999999999999999987 22 211 111111110 00 0111 1223322 8899
Q ss_pred eCCCCceeeCCCCCccCCCCCCccccccCCCC-CCCCCceEecCCceeeecCCCCcCCCCCCCCCCCCCC----CCCCCC
Q psy13151 82 VLNHTPRCTCDPGFTGDPFSTCFYIQERNHMV-MGSRDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCL----PSPCGP 156 (256)
Q Consensus 82 n~~g~~~C~C~~G~~g~~~~~C~~~~~c~~~~-~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~----~~~C~~ 156 (256)
++.++++|.|++||.+.....+.++++|.... +...+.|++..+.+.|.|.+||.+.....+.+..+|. ..+|..
T Consensus 247 ~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~~~~~~~C~~~~~~~~c~~ 326 (487)
T KOG1217|consen 247 NTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTECVDVDECSPRNAGGPCAN 326 (487)
T ss_pred ccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCccccccccccccccCCcCCC
Confidence 99999999999999998421367788888764 6778899999999999999999987433355566774 466887
Q ss_pred CCEE--eecCCCCeeecCCCcccCCCCCcccCCcC-CCCCCCCeeeCCcccCCCCCCCCCCCeEee-CCCCceeeCCCCC
Q psy13151 157 NSIC--RVIGNTPACSCVESYIGRPPNCRPECTIN-AECAGNLACINERCKDPCPGSCGAHASCVV-LNHTPRCTCDPGF 232 (256)
Q Consensus 157 ~~~C--~~~~g~~~C~C~~G~~~~~~~c~~~C~~~-~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~-~~g~~~C~C~~G~ 232 (256)
++.| ....+.+.|.|..||.+. .|+.. ++|... .+..++.|++ ..++|.|.|+.+|
T Consensus 327 g~~C~~~~~~~~~~C~c~~~~~g~------~C~~~~~~C~~~--------------~~~~~~~c~~~~~~~~~c~~~~~~ 386 (487)
T KOG1217|consen 327 GGTCNTLGSFGGFRCACGPGFTGR------RCEDSNDECASS--------------PCCPGGTCVNETPGSYRCACPAGF 386 (487)
T ss_pred CcccccCCCCCCCCcCCCCCCCCC------ccccCCccccCC--------------ccccCCEeccCCCCCeEecCCCcc
Confidence 7788 333456889999998777 44443 245432 2667789998 7899999999999
Q ss_pred ccC---CCCCCccccCCCC
Q psy13151 233 TGD---PFSTCFYIQESKH 248 (256)
Q Consensus 233 ~g~---~~~~C~~i~ec~~ 248 (256)
.+. ....+.++++|..
T Consensus 387 ~~~~~~~~~~~~~~~~c~~ 405 (487)
T KOG1217|consen 387 AGKANGDGVGCEDIDECSG 405 (487)
T ss_pred ccCCccccccccccccccC
Confidence 973 2345888888864
No 5
>KOG1219|consensus
Probab=99.25 E-value=1.9e-11 Score=118.89 Aligned_cols=107 Identities=24% Similarity=0.510 Sum_probs=86.4
Q ss_pred eeeccCCCCCCccCCCC-CCceeeCCCCCccCCCCCccCCCC-CCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCce
Q psy13151 11 YLVVVAVDIRRTDPHIG-NTPACSCVESYIGRPPNCRPECTI-NAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPR 88 (256)
Q Consensus 11 ~~~~~~c~~~~~C~~~~-g~~~C~C~~G~~g~~~~C~~~C~~-~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~ 88 (256)
.+..+||...|+|+..+ +.|+|.|++.|+|. + |+. ...|. ++||..++.|+...+.|.
T Consensus 3866 ~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~--~----CEi~~epC~--------------snPC~~GgtCip~~n~f~ 3925 (4289)
T KOG1219|consen 3866 PCNDNPCQHGGTCISQPKGGYKCKCPSQYSGN--H----CEIDLEPCA--------------SNPCLTGGTCIPFYNGFL 3925 (4289)
T ss_pred ccccCcccCCCEecCCCCCceEEeCcccccCc--c----ccccccccc--------------CCCCCCCCEEEecCCCee
Confidence 34578999999998776 77999999999998 4 443 22354 579999999999999999
Q ss_pred eeCCCCCccCCCCCCccccccCCCCCCCCCceEecCCceeeecCCCCcCCCCCCCCCCCCCCCCCCCCCCEEeecCCCCe
Q psy13151 89 CTCDPGFTGDPFSTCFYIQERNHMVMGSRDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCLPSPCGPNSICRVIGNTPA 168 (256)
Q Consensus 89 C~C~~G~~g~~~~~C~~~~~c~~~~~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~~~~C~~~~~C~~~~g~~~ 168 (256)
|.|+.||+|. +|+.. .+++|..++|..++.|+|+.|+|.
T Consensus 3926 CnC~~gyTG~---~Ce~~--------------------------------------Gi~eCs~n~C~~gg~C~n~~gsf~ 3964 (4289)
T KOG1219|consen 3926 CNCPNGYTGK---RCEAR--------------------------------------GISECSKNVCGTGGQCINIPGSFH 3964 (4289)
T ss_pred EeCCCCccCc---eeecc--------------------------------------cccccccccccCCceeeccCCceE
Confidence 9999999999 78742 156777778888888888888888
Q ss_pred eecCCCcccC
Q psy13151 169 CSCVESYIGR 178 (256)
Q Consensus 169 C~C~~G~~~~ 178 (256)
|.|-+||.|.
T Consensus 3965 CncT~g~~gr 3974 (4289)
T KOG1219|consen 3965 CNCTPGILGR 3974 (4289)
T ss_pred eccChhHhcc
Confidence 8888888765
No 6
>KOG1217|consensus
Probab=99.20 E-value=3.6e-10 Score=100.99 Aligned_cols=199 Identities=26% Similarity=0.493 Sum_probs=132.5
Q ss_pred CCCCccCCC---CCCceeeCCCCCccCCCCCccCCCCC-CCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCC
Q psy13151 18 DIRRTDPHI---GNTPACSCVESYIGRPPNCRPECTIN-AECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDP 93 (256)
Q Consensus 18 ~~~~~C~~~---~g~~~C~C~~G~~g~~~~C~~~C~~~-~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~ 93 (256)
...+.|... ...+.|.|..||.+. .+... +.|... ...|.+.+.|.+..++|.|.|.+
T Consensus 137 ~~~~~c~~~~~~~~~~~c~C~~g~~~~------~~~~~~~~C~~~------------~~~c~~~~~C~~~~~~~~C~c~~ 198 (487)
T KOG1217|consen 137 CIDGSCSNGPGSVGPFRCSCTEGYEGE------PCETDLDECIQY------------SSPCQNGGTCVNTGGSYLCSCPP 198 (487)
T ss_pred eCchhhcCCCCCCCceeeeeCCCcccc------cccccccccccC------------CCCcCCCcccccCCCCeeEeCCC
Confidence 344556553 358999999999997 33322 355533 34588888999999899999999
Q ss_pred CCccCCCC------CCcc-----------ccccCCC--CCCCC-CceEecCCceeeecCCCCcCCCCCCCCCCCCCCCC-
Q psy13151 94 GFTGDPFS------TCFY-----------IQERNHM--VMGSR-DQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCLPS- 152 (256)
Q Consensus 94 G~~g~~~~------~C~~-----------~~~c~~~--~~~~~-~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~~~- 152 (256)
||.+.... .|.. ...+... .+... ..|+++.+++.|.+..||.+.....+.++++|...
T Consensus 199 ~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~ 278 (487)
T KOG1217|consen 199 GYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIA 278 (487)
T ss_pred CccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCccccccceeeeccccCCCC
Confidence 99886311 0111 1111111 11111 78899999999999999988742356678899864
Q ss_pred CCCCCCEEeecCCCCeeecCCCcccCCCCCcccCCcCCCCCCCCeeeCCcccCCCCCCCCCCCeE--eeCCCCceeeCCC
Q psy13151 153 PCGPNSICRVIGNTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASC--VVLNHTPRCTCDP 230 (256)
Q Consensus 153 ~C~~~~~C~~~~g~~~C~C~~G~~~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C--~~~~g~~~C~C~~ 230 (256)
.|.++++|++..+.|.|.|++||.+.. + ..+.+...|.... -...|.+++.| ....+.+.|.|..
T Consensus 279 ~c~~~~~C~~~~~~~~C~C~~g~~g~~--~-~~~~~~~~C~~~~----------~~~~c~~g~~C~~~~~~~~~~C~c~~ 345 (487)
T KOG1217|consen 279 SCPNGGTCVNVPGSYRCTCPPGFTGRL--C-TECVDVDECSPRN----------AGGPCANGGTCNTLGSFGGFRCACGP 345 (487)
T ss_pred ccCCCCeeecCCCcceeeCCCCCCCCC--C-ccccccccccccc----------cCCcCCCCcccccCCCCCCCCcCCCC
Confidence 388889999999999999999999983 2 1222223443110 01346666677 3445678899999
Q ss_pred CCccCCCCCCccc-cCCCCCC
Q psy13151 231 GFTGDPFSTCFYI-QESKHPN 250 (256)
Q Consensus 231 G~~g~~~~~C~~i-~ec~~~~ 250 (256)
||.|. .|.+. ++|...+
T Consensus 346 ~~~g~---~C~~~~~~C~~~~ 363 (487)
T KOG1217|consen 346 GFTGR---RCEDSNDECASSP 363 (487)
T ss_pred CCCCC---ccccCCccccCCc
Confidence 98888 56666 3776543
No 7
>KOG4260|consensus
Probab=99.08 E-value=1.9e-10 Score=91.40 Aligned_cols=139 Identities=22% Similarity=0.426 Sum_probs=92.2
Q ss_pred CCCCCCCeeee---CCCCceeeCCCCCccCCCCCCcc------ccccCCCCC-CCCCceE----ecCCceeeecCCCCcC
Q psy13151 72 GSCGAHASCVV---LNHTPRCTCDPGFTGDPFSTCFY------IQERNHMVM-GSRDQMV----LVVGQQLLERKSTAAP 137 (256)
Q Consensus 72 ~~C~~~~~C~n---~~g~~~C~C~~G~~g~~~~~C~~------~~~c~~~~~-~~~~~c~----~~~g~~~c~c~~g~~~ 137 (256)
.+|..++.|.- ..|+..|.|.+||+|+.-..|.. .++-. ..| .++..|. ......--.|..||.+
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~-lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~l 228 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQH-LVCTACHEGCLGVCSGESSKGCSKCKKGWKL 228 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhccccc-chhhhhhhhhhcccCCCCCCChhhhccccee
Confidence 46777777763 34788999999999983322321 01100 001 1122222 2222222368889988
Q ss_pred CCCCCCCCCCCCC--CCCCCCCCEEeecCCCCeeecCCCcccCCCCCcccCCcC-CCCCCCCeeeCCcccCCCCCCCCCC
Q psy13151 138 VAAEPRPSGNPCL--PSPCGPNSICRVIGNTPACSCVESYIGRPPNCRPECTIN-AECAGNLACINERCKDPCPGSCGAH 214 (256)
Q Consensus 138 ~~~~~c~~~~~C~--~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~c~~~C~~~-~~C~~~~~C~~~~C~~~C~~~C~~~ 214 (256)
. ...|.|||+|. +.+|..+..|+|+.|+|.|..++||.+... +|+.- +.|.. .+
T Consensus 229 d-e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d----~C~~~~d~~~~------------------kn 285 (350)
T KOG4260|consen 229 D-EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVD----ECQFCADVCAS------------------KN 285 (350)
T ss_pred c-ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChH----Hhhhhhhhccc------------------CC
Confidence 7 57899999998 588999999999999999999999987422 33221 22321 34
Q ss_pred CeEeeCCCCceeeCCCCCcc
Q psy13151 215 ASCVVLNHTPRCTCDPGFTG 234 (256)
Q Consensus 215 ~~C~~~~g~~~C~C~~G~~g 234 (256)
..|+|++++|+|+|..|+.-
T Consensus 286 ~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 286 RPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred CCcccCCccEEEEeccccee
Confidence 67899999999999999863
No 8
>KOG4289|consensus
Probab=98.94 E-value=1.5e-09 Score=102.59 Aligned_cols=68 Identities=31% Similarity=0.779 Sum_probs=55.9
Q ss_pred CCCCCCCCCCCCCCCEEeecCCCCeeecCCCcccCCCCCcccCCcC---CCCCCCCeeeCCcccCCCCCCCCCCCeEeeC
Q psy13151 144 PSGNPCLPSPCGPNSICRVIGNTPACSCVESYIGRPPNCRPECTIN---AECAGNLACINERCKDPCPGSCGAHASCVVL 220 (256)
Q Consensus 144 ~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~c~~~C~~~---~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~ 220 (256)
..+|+|...+|.+++.|....|+|+|.|++||+|. .|+.. ..|. |+.|.++++|++.
T Consensus 1237 TeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGe------hCEvs~~agrCv--------------pGvC~nggtC~~~ 1296 (2531)
T KOG4289|consen 1237 TEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGE------HCEVSARAGRCV--------------PGVCKNGGTCVNL 1296 (2531)
T ss_pred chhHhhhcCCCCCCCceEEecCceeEEecCCcccc------ceeeecccCccc--------------cceecCCCEEeec
Confidence 34788999999999999999999999999999998 44422 2233 3568899999887
Q ss_pred C-CCceeeCCCC
Q psy13151 221 N-HTPRCTCDPG 231 (256)
Q Consensus 221 ~-g~~~C~C~~G 231 (256)
. |++.|+|+.|
T Consensus 1297 ~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYG 1308 (2531)
T ss_pred CCCceeccCCCc
Confidence 5 7899999998
No 9
>KOG4289|consensus
Probab=98.92 E-value=2.2e-09 Score=101.54 Aligned_cols=60 Identities=33% Similarity=0.758 Sum_probs=48.8
Q ss_pred CCCCCCceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCC
Q psy13151 24 PHIGNTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGDPFSTC 103 (256)
Q Consensus 24 ~~~~g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~G~~g~~~~~C 103 (256)
+++.+.+.|.|++||+|+ +|+.+ .+.|- ..+|++++.|....|.|+|.|.+||+|. +|
T Consensus 1216 i~pvnglrCrCPpGFTgd--~CeTe---iDlCY--------------s~pC~nng~C~srEggYtCeCrpg~tGe---hC 1273 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTGD--YCETE---IDLCY--------------SGPCGNNGRCRSREGGYTCECRPGFTGE---HC 1273 (2531)
T ss_pred ccccCceeEeCCCCCCcc--cccch---hHhhh--------------cCCCCCCCceEEecCceeEEecCCcccc---ce
Confidence 345688999999999999 55421 33443 5689999999999999999999999999 78
Q ss_pred cc
Q psy13151 104 FY 105 (256)
Q Consensus 104 ~~ 105 (256)
+-
T Consensus 1274 Ev 1275 (2531)
T KOG4289|consen 1274 EV 1275 (2531)
T ss_pred ee
Confidence 73
No 10
>KOG4260|consensus
Probab=98.80 E-value=4.9e-09 Score=83.47 Aligned_cols=145 Identities=22% Similarity=0.368 Sum_probs=95.2
Q ss_pred ccCCCCCCccCCC---CCCceeeCCCCCccCCCCCccCCCCC-CCCCC---CCcccCCCccCCCCCCCCCCCeeeeCCCC
Q psy13151 14 VVAVDIRRTDPHI---GNTPACSCVESYIGRPPNCRPECTIN-AECAG---NLACINERCKDPCPGSCGAHASCVVLNHT 86 (256)
Q Consensus 14 ~~~c~~~~~C~~~---~g~~~C~C~~G~~g~~~~C~~~C~~~-~~C~~---~~~C~~~~C~~~C~~~C~~~~~C~n~~g~ 86 (256)
..||..+|.|... .|++.|.|.+||+|. .|. .|... .+-.+ +-.|. .|. .+|. +.|... ++
T Consensus 149 er~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp--~C~-~Cg~eyfes~Rne~~lvCt--~Ch----~~C~--~~Csg~-~~ 216 (350)
T KOG4260|consen 149 ERPCFGNGSCHGDGSREGSGKCKCETGYTGP--LCR-YCGIEYFESSRNEQHLVCT--ACH----EGCL--GVCSGE-SS 216 (350)
T ss_pred cCCcCCCCcccCCCCCCCCCcccccCCCCCc--ccc-ccchHHHHhhcccccchhh--hhh----hhhh--cccCCC-CC
Confidence 4678888888533 488999999999998 332 11110 00000 00000 111 1221 134322 23
Q ss_pred ceee-CCCCCccCCCCCCccccccCC--CCCCCCCceEecCCceeeecCCCCcCCCCCCCCCCCCCCC--CCC-CCCCEE
Q psy13151 87 PRCT-CDPGFTGDPFSTCFYIQERNH--MVMGSRDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCLP--SPC-GPNSIC 160 (256)
Q Consensus 87 ~~C~-C~~G~~g~~~~~C~~~~~c~~--~~~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~~--~~C-~~~~~C 160 (256)
-.|. |+.||..+.. .|.+|++|+. .++...+.|+|+.|+|.|...+||... +|+|.. ..| ..+..|
T Consensus 217 k~C~kCkkGW~lde~-gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g-------~d~C~~~~d~~~~kn~~c 288 (350)
T KOG4260|consen 217 KGCSKCKKGWKLDEE-GCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG-------VDECQFCADVCASKNRPC 288 (350)
T ss_pred CChhhhcccceeccc-ccccHHHHhcCCCCCChhheeecCCCceEecccccccCC-------hHHhhhhhhhcccCCCCc
Confidence 3454 9999988743 3999999986 457788999999999999999888653 566663 333 346789
Q ss_pred eecCCCCeeecCCCcccC
Q psy13151 161 RVIGNTPACSCVESYIGR 178 (256)
Q Consensus 161 ~~~~g~~~C~C~~G~~~~ 178 (256)
+|+.+.|+|+|..|+.-.
T Consensus 289 ~ni~~~~r~v~f~~~~~~ 306 (350)
T KOG4260|consen 289 MNIDGQYRCVCFSGLIII 306 (350)
T ss_pred ccCCccEEEEecccceee
Confidence 999999999999988643
No 11
>KOG1225|consensus
Probab=98.66 E-value=3.2e-07 Score=81.37 Aligned_cols=132 Identities=33% Similarity=0.708 Sum_probs=85.9
Q ss_pred ceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCcccccc
Q psy13151 30 PACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGDPFSTCFYIQER 109 (256)
Q Consensus 30 ~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~c 109 (256)
+.|.|..+|+|. .|. ...| ++.|..++.|+ ..+|.|++||+|. .|..+ .|
T Consensus 234 ~ic~c~~~~~g~------~c~-~~~C---------------~~~c~~~g~c~----~G~CIC~~Gf~G~---dC~e~-~C 283 (525)
T KOG1225|consen 234 GICECPEGYFGP------LCS-TIYC---------------PGGCTGRGQCV----EGRCICPPGFTGD---DCDEL-VC 283 (525)
T ss_pred ceeecCCceeCC------ccc-cccC---------------CCCCcccceEe----CCeEeCCCCCcCC---CCCcc-cC
Confidence 478888888887 443 2334 34466567787 5589999999998 56542 22
Q ss_pred CCCCCCCCCceEecCCceeeecCCCCcCCCCCCCCCCCCCCCCCCCCCCEEeecCCCCeeecCCCcccCCCCCcccCCcC
Q psy13151 110 NHMVMGSRDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCLPSPCGPNSICRVIGNTPACSCVESYIGRPPNCRPECTIN 189 (256)
Q Consensus 110 ~~~~~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~~~~c~~~C~~~ 189 (256)
... + ++.-.+..| .|.|.+||.+.. |. +..| +..|..++.|+ .-+|.|.+||.|. .|...
T Consensus 284 p~~-c--s~~g~~~~g--~CiC~~g~~G~d---Cs-~~~c-padC~g~G~Ci----~G~C~C~~Gy~G~------~C~~~ 343 (525)
T KOG1225|consen 284 PVD-C--SGGGVCVDG--ECICNPGYSGKD---CS-IRRC-PADCSGHGKCI----DGECLCDEGYTGE------LCIQR 343 (525)
T ss_pred Ccc-c--CCCceecCC--EeecCCCccccc---cc-cccC-CccCCCCCccc----CCceEeCCCCcCC------ccccc
Confidence 221 2 222233334 788999998752 11 1122 36888889996 4579999999998 56543
Q ss_pred CCCCCCCeeeCCcccCCCCCCCCCCCeEeeCCCCceeeCCCCCccCC
Q psy13151 190 AECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGDP 236 (256)
Q Consensus 190 ~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 236 (256)
. |.+++.|++ . |.|..||+|.+
T Consensus 344 ~--------------------C~~~g~cv~----g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 344 A--------------------CSGGGQCVN----G-CKCKKGWRGPD 365 (525)
T ss_pred c--------------------cCCCceecc----C-ceeccCccCCC
Confidence 2 445566664 3 88999998876
No 12
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.30 E-value=3e-07 Score=51.30 Aligned_cols=30 Identities=40% Similarity=0.964 Sum_probs=24.8
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCccCCCC
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTGDPFS 238 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~ 238 (256)
+.|+.+|+|++++++|.|+|++||+|++..
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~ 35 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFF 35 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTC
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcC
Confidence 568999999999999999999999999865
No 13
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.28 E-value=5.6e-07 Score=52.41 Aligned_cols=26 Identities=27% Similarity=0.773 Sum_probs=24.0
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCcc
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTG 234 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g 234 (256)
+.|..++.|+|+.|+|.|.|++||+.
T Consensus 10 ~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 10 HNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 67888999999999999999999993
No 14
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=98.19 E-value=1e-06 Score=51.33 Aligned_cols=33 Identities=24% Similarity=0.498 Sum_probs=29.1
Q ss_pred CCCCCCC--CCCCCCCEEeecCCCCeeecCCCccc
Q psy13151 145 SGNPCLP--SPCGPNSICRVIGNTPACSCVESYIG 177 (256)
Q Consensus 145 ~~~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~~ 177 (256)
|||||.. +.|..++.|+|+.|+|+|.|++||..
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 4788883 67988899999999999999999984
No 15
>KOG1225|consensus
Probab=98.11 E-value=3.7e-05 Score=68.50 Aligned_cols=102 Identities=21% Similarity=0.570 Sum_probs=59.4
Q ss_pred CCceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCcccc
Q psy13151 28 NTPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGDPFSTCFYIQ 107 (256)
Q Consensus 28 g~~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~G~~g~~~~~C~~~~ 107 (256)
...+|+|++||+|. +|+. ..| +..|+.++.+++ ..|.|++||+|. .|+...
T Consensus 263 ~~G~CIC~~Gf~G~--dC~e-----~~C---------------p~~cs~~g~~~~----g~CiC~~g~~G~---dCs~~~ 313 (525)
T KOG1225|consen 263 VEGRCICPPGFTGD--DCDE-----LVC---------------PVDCSGGGVCVD----GECICNPGYSGK---DCSIRR 313 (525)
T ss_pred eCCeEeCCCCCcCC--CCCc-----ccC---------------CcccCCCceecC----CEeecCCCcccc---cccccc
Confidence 44689999999998 5532 223 223555555552 267777777776 454222
Q ss_pred ccCCCCCCCCCceEecCCceeeecCCCCcCCCCCCCCCCCCCCCCCCCCCCEEeecCCCCeeecCCCcccC
Q psy13151 108 ERNHMVMGSRDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCLPSPCGPNSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 108 ~c~~~~~~~~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
| ...|..++.|+ ...|.|..||.+. .|....|.+++.|++ + |.|..||+|.
T Consensus 314 -c-padC~g~G~Ci----~G~C~C~~Gy~G~---------~C~~~~C~~~g~cv~--g---C~C~~Gw~G~ 364 (525)
T KOG1225|consen 314 -C-PADCSGHGKCI----DGECLCDEGYTGE---------LCIQRACSGGGQCVN--G---CKCKKGWRGP 364 (525)
T ss_pred -C-CccCCCCCccc----CCceEeCCCCcCC---------cccccccCCCceecc--C---ceeccCccCC
Confidence 1 23455555665 3456667777653 233223666666643 1 7777777765
No 16
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=98.04 E-value=1.2e-06 Score=48.89 Aligned_cols=31 Identities=23% Similarity=0.408 Sum_probs=25.1
Q ss_pred cCCCCCCccCCCCCCceeeCCCCCccCCCCC
Q psy13151 15 VAVDIRRTDPHIGNTPACSCVESYIGRPPNC 45 (256)
Q Consensus 15 ~~c~~~~~C~~~~g~~~C~C~~G~~g~~~~C 45 (256)
..|+.+++|+++.++|.|+|++||+|+|..|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCCcCC
Confidence 4789999999999999999999999997654
No 17
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.97 E-value=6.1e-06 Score=41.28 Aligned_cols=23 Identities=35% Similarity=0.798 Sum_probs=18.6
Q ss_pred CceeeCCCCCccCCC-CCCccccC
Q psy13151 223 TPRCTCDPGFTGDPF-STCFYIQE 245 (256)
Q Consensus 223 ~~~C~C~~G~~g~~~-~~C~~i~e 245 (256)
||+|+|++||+.... ..|+||||
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 699999999995432 35999997
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.89 E-value=5.8e-06 Score=44.94 Aligned_cols=27 Identities=33% Similarity=0.885 Sum_probs=24.7
Q ss_pred CCCCCCCeEeeCC-CCceeeCCCCCccC
Q psy13151 209 GSCGAHASCVVLN-HTPRCTCDPGFTGD 235 (256)
Q Consensus 209 ~~C~~~~~C~~~~-g~~~C~C~~G~~g~ 235 (256)
++|.++|+|++.. ++|.|.|++||+|+
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 4688999999998 99999999999997
No 19
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.85 E-value=3e-06 Score=64.35 Aligned_cols=139 Identities=21% Similarity=0.411 Sum_probs=85.6
Q ss_pred CCeeeeCCCCceeeCCCCCccCCCCCCccccccCC-----CCCCCCCceEecC-----CceeeecCCCCcCCCCCCCCCC
Q psy13151 77 HASCVVLNHTPRCTCDPGFTGDPFSTCFYIQERNH-----MVMGSRDQMVLVV-----GQQLLERKSTAAPVAAEPRPSG 146 (256)
Q Consensus 77 ~~~C~n~~g~~~C~C~~G~~g~~~~~C~~~~~c~~-----~~~~~~~~c~~~~-----g~~~c~c~~g~~~~~~~~c~~~ 146 (256)
+|..+.+.+.|.|.|.+||......+|+...+|.. .+|+..+.|.+.. ..+.|.|..||.+..+ .|. .
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCv-p 87 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG-VCV-P 87 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS-SEE-E
T ss_pred CCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC-eEc-h
Confidence 67788888899999999998765446988777764 3467778898865 4689999999998732 111 1
Q ss_pred CCCCCCCCCCCCEEeecC---CCCeeecCCCcccC-CCCCcccCCcC--CCCCCCCeeeCCcccCCCCCCCCCCCeEeeC
Q psy13151 147 NPCLPSPCGPNSICRVIG---NTPACSCVESYIGR-PPNCRPECTIN--AECAGNLACINERCKDPCPGSCGAHASCVVL 220 (256)
Q Consensus 147 ~~C~~~~C~~~~~C~~~~---g~~~C~C~~G~~~~-~~~c~~~C~~~--~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~ 220 (256)
+.|....|+ .+.|+..+ ....|+|.-|+..+ ... |... ..|. -.|..+..|..+
T Consensus 88 ~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~k----Ctk~G~T~C~---------------LKCk~nE~CK~~ 147 (197)
T PF06247_consen 88 NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKK----CTKTGETKCS---------------LKCKENEECKLV 147 (197)
T ss_dssp GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTE----SEEEE-----------------------TTTEEEEEE
T ss_pred hhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCc----ccCCCcccee---------------eecCCCcceeee
Confidence 345556777 48896543 23589999999832 122 2211 2232 336677899999
Q ss_pred CCCceeeCCCCCccCCC
Q psy13151 221 NHTPRCTCDPGFTGDPF 237 (256)
Q Consensus 221 ~g~~~C~C~~G~~g~~~ 237 (256)
.+-|.|.+.+||.+++.
T Consensus 148 ~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 148 DGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp TTEEEEEE-TT-EEETT
T ss_pred CcEEEeecCCCCCCCCC
Confidence 99999999999986543
No 20
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.74 E-value=1.8e-05 Score=42.98 Aligned_cols=30 Identities=27% Similarity=0.779 Sum_probs=26.5
Q ss_pred CCCCCCCCCCEEeecC-CCCeeecCCCcccC
Q psy13151 149 CLPSPCGPNSICRVIG-NTPACSCVESYIGR 178 (256)
Q Consensus 149 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~~~ 178 (256)
|.+++|.++++|++.. +.|+|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 3457899999999999 88999999999985
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.70 E-value=6.6e-05 Score=42.58 Aligned_cols=33 Identities=27% Similarity=0.683 Sum_probs=28.8
Q ss_pred CCCCCC-CCCCCCCEEeecCCCCeeecCCCcc-cC
Q psy13151 146 GNPCLP-SPCGPNSICRVIGNTPACSCVESYI-GR 178 (256)
Q Consensus 146 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-~~ 178 (256)
+++|.. .+|.+++.|+++.++|.|.|++||. +.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 577876 7898888999999999999999998 54
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.48 E-value=0.00018 Score=40.75 Aligned_cols=27 Identities=37% Similarity=0.898 Sum_probs=24.2
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCc-cC
Q psy13151 72 GSCGAHASCVVLNHTPRCTCDPGFT-GD 98 (256)
Q Consensus 72 ~~C~~~~~C~n~~g~~~C~C~~G~~-g~ 98 (256)
.+|.++++|+++.++|.|.|++||. |.
T Consensus 9 ~~C~~~~~C~~~~g~~~C~C~~g~~~g~ 36 (39)
T smart00179 9 NPCQNGGTCVNTVGSYRCECPPGYTDGR 36 (39)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCccCC
Confidence 5788888999999999999999998 65
No 23
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.32 E-value=0.00021 Score=35.75 Aligned_cols=23 Identities=35% Similarity=0.798 Sum_probs=14.4
Q ss_pred CceeeCCCCCccCCC-CCCccccc
Q psy13151 86 TPRCTCDPGFTGDPF-STCFYIQE 108 (256)
Q Consensus 86 ~~~C~C~~G~~g~~~-~~C~~~~~ 108 (256)
+|+|.|++||+..+. ..|++|+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 577888888875432 23666654
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.30 E-value=0.00046 Score=38.55 Aligned_cols=33 Identities=30% Similarity=0.703 Sum_probs=28.4
Q ss_pred CCCCCC-CCCCCCCEEeecCCCCeeecCCCcccC
Q psy13151 146 GNPCLP-SPCGPNSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 146 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
+++|.. .+|..++.|+++.+.|+|.|++||.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 566766 688888899999999999999999885
No 25
>KOG1226|consensus
Probab=97.29 E-value=0.0021 Score=59.10 Aligned_cols=57 Identities=25% Similarity=0.783 Sum_probs=30.2
Q ss_pred ceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCC-CCCCCCCCeeeeCCCCceeeCCCCCcc
Q psy13151 30 PACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPC-PGSCGAHASCVVLNHTPRCTCDPGFTG 97 (256)
Q Consensus 30 ~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C-~~~C~~~~~C~n~~g~~~C~C~~G~~g 97 (256)
++|.|.+||.|+ .| +|.. +...... ....|...= ..+|...|.|+ =.+|.|.+...+
T Consensus 478 G~C~C~~G~~G~--~C--EC~~--~~~ss~~-~~~~Cr~~~~~~vCSgrG~C~----CGqC~C~~~~~~ 535 (783)
T KOG1226|consen 478 GQCRCDEGWLGK--KC--ECST--DELSSSE-EEDKCRENSDSPVCSGRGDCV----CGQCVCHKPDNG 535 (783)
T ss_pred cceecCCCCCCC--cc--cCCc--cccCcHh-HHhhccCCCCCCCcCCCCcEe----CCceEecCCCCC
Confidence 457899999998 55 2221 1110000 000111000 34688888887 567888887663
No 26
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.27 E-value=0.00046 Score=38.57 Aligned_cols=27 Identities=37% Similarity=0.918 Sum_probs=24.0
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCccC
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTGD 235 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 235 (256)
.+|.+++.|++..++|.|.|++||.|.
T Consensus 9 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 35 (38)
T cd00054 9 NPCQNGGTCVNTVGSYRCSCPPGYTGR 35 (38)
T ss_pred CCcCCCCEeECCCCCeEeECCCCCcCC
Confidence 358778899999999999999999986
No 27
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=97.03 E-value=0.00092 Score=36.71 Aligned_cols=27 Identities=41% Similarity=0.994 Sum_probs=24.3
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCccC
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTGD 235 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 235 (256)
..|.+++.|++..++|.|.|+.||.|.
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCccc
Confidence 457778999999999999999999988
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.99 E-value=0.0013 Score=36.17 Aligned_cols=25 Identities=36% Similarity=1.012 Sum_probs=22.4
Q ss_pred CCCCCCeEeeCCCCceeeCCCCCccC
Q psy13151 210 SCGAHASCVVLNHTPRCTCDPGFTGD 235 (256)
Q Consensus 210 ~C~~~~~C~~~~g~~~C~C~~G~~g~ 235 (256)
.|.++ +|+++.++|.|.|++||.|.
T Consensus 7 ~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 7 PCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCC-EEECCCCCeEeECCCCCccC
Confidence 57777 89999999999999999994
No 29
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=96.96 E-value=0.00024 Score=54.19 Aligned_cols=104 Identities=19% Similarity=0.467 Sum_probs=68.1
Q ss_pred CCceEecCCceeeecCCCCcCCCCCCCCCCCCCC-----CCCCCCCCEEeecC-----CCCeeecCCCcccCCCCCc-cc
Q psy13151 117 RDQMVLVVGQQLLERKSTAAPVAAEPRPSGNPCL-----PSPCGPNSICRVIG-----NTPACSCVESYIGRPPNCR-PE 185 (256)
Q Consensus 117 ~~~c~~~~g~~~c~c~~g~~~~~~~~c~~~~~C~-----~~~C~~~~~C~~~~-----g~~~C~C~~G~~~~~~~c~-~~ 185 (256)
++..+.+.+.+.|.|.+||.+.+...|....+|. ..+|+.-++|++.. ..|+|.|.+||......|. +.
T Consensus 10 NG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~ 89 (197)
T PF06247_consen 10 NGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNK 89 (197)
T ss_dssp TEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGG
T ss_pred CCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhh
Confidence 4566778889999999999998777777777776 36899999998876 4689999999987644444 34
Q ss_pred CCcCCCCCCCCeeeCCcccCCCCCCCCCCCeEeeCC---CCceeeCCCCCccCCCCCCc
Q psy13151 186 CTINAECAGNLACINERCKDPCPGSCGAHASCVVLN---HTPRCTCDPGFTGDPFSTCF 241 (256)
Q Consensus 186 C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~---g~~~C~C~~G~~g~~~~~C~ 241 (256)
|.. ..|+ .|.|+..+ ....|.|.-|+..++...|.
T Consensus 90 C~~--------------------~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCt 127 (197)
T PF06247_consen 90 CNN--------------------KDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCT 127 (197)
T ss_dssp GSS-----------------------T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESE
T ss_pred cCc--------------------eecC-CCeEEecCCCCCCceeEeeeceEeccCCccc
Confidence 432 2354 56785443 24599999999844433354
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.81 E-value=0.0021 Score=35.36 Aligned_cols=26 Identities=38% Similarity=1.058 Sum_probs=22.9
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13151 72 GSCGAHASCVVLNHTPRCTCDPGFTGD 98 (256)
Q Consensus 72 ~~C~~~~~C~n~~g~~~C~C~~G~~g~ 98 (256)
.+|.++ +|+++.++|.|.|.+||.|.
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccC
Confidence 468777 99999999999999999984
No 31
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.67 E-value=0.0029 Score=34.58 Aligned_cols=28 Identities=25% Similarity=0.679 Sum_probs=24.5
Q ss_pred CCCCCCCCEEeecCCCCeeecCCCcccC
Q psy13151 151 PSPCGPNSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 151 ~~~C~~~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
..+|.+++.|+++.+.|.|.|+.||.+.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4678778999999999999999999876
No 32
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.09 E-value=0.0041 Score=34.54 Aligned_cols=22 Identities=32% Similarity=0.665 Sum_probs=17.7
Q ss_pred eEeeCCCCceeeCCCCCccCCC
Q psy13151 216 SCVVLNHTPRCTCDPGFTGDPF 237 (256)
Q Consensus 216 ~C~~~~g~~~C~C~~G~~g~~~ 237 (256)
.|++++++|+|.|++||+....
T Consensus 11 ~C~~~~g~~~C~C~~Gy~L~~D 32 (36)
T PF14670_consen 11 ICVNTPGSYRCSCPPGYKLAED 32 (36)
T ss_dssp EEEEETTSEEEE-STTEEE-TT
T ss_pred CCccCCCceEeECCCCCEECcC
Confidence 6999999999999999996643
No 33
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.97 E-value=0.011 Score=31.88 Aligned_cols=25 Identities=44% Similarity=1.070 Sum_probs=21.1
Q ss_pred CCCCCCeEeeCCCCceeeCCCCCccCC
Q psy13151 210 SCGAHASCVVLNHTPRCTCDPGFTGDP 236 (256)
Q Consensus 210 ~C~~~~~C~~~~g~~~C~C~~G~~g~~ 236 (256)
.|.++++|+.. ..+|+|.+||+|+.
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~ 31 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPD 31 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCC
Confidence 48889999865 47999999999983
No 34
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=95.84 E-value=0.011 Score=31.90 Aligned_cols=25 Identities=20% Similarity=0.231 Sum_probs=21.6
Q ss_pred cCCCCCCccCCCCCCceeeCCCCCccC
Q psy13151 15 VAVDIRRTDPHIGNTPACSCVESYIGR 41 (256)
Q Consensus 15 ~~c~~~~~C~~~~g~~~C~C~~G~~g~ 41 (256)
..|..+|+|+.. .++|+|.+||+|.
T Consensus 6 ~~C~~~G~C~~~--~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP--CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC--CCEEECCCCCcCC
Confidence 468899999876 5689999999997
No 35
>KOG0994|consensus
Probab=95.83 E-value=0.12 Score=50.02 Aligned_cols=57 Identities=21% Similarity=0.290 Sum_probs=30.6
Q ss_pred eEecCCceee-ecCCCCcCCCCCCCCCCCCCCCCCCCCC--------CEEe--ecCCCCeeecCCCcccC
Q psy13151 120 MVLVVGQQLL-ERKSTAAPVAAEPRPSGNPCLPSPCGPN--------SICR--VIGNTPACSCVESYIGR 178 (256)
Q Consensus 120 c~~~~g~~~c-~c~~g~~~~~~~~c~~~~~C~~~~C~~~--------~~C~--~~~g~~~C~C~~G~~~~ 178 (256)
|.+...++.| .|..||.+.+... ....|.+-+|..+ ..|. +......|.|.+||+|.
T Consensus 878 CqD~T~G~~CdrCl~GyyGdP~lg--~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G~ 945 (1758)
T KOG0994|consen 878 CQDSTTGHSCDRCLDGYYGDPRLG--SGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSGS 945 (1758)
T ss_pred ccccccccchhhhhccccCCcccC--CCCCCCCCCCCCCCccchhccccccccccccceeeecccCcccc
Confidence 4445555666 6888887653211 1223443333221 1232 12234679999999997
No 36
>KOG1226|consensus
Probab=95.75 E-value=0.083 Score=49.00 Aligned_cols=35 Identities=34% Similarity=0.879 Sum_probs=22.4
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCccCCCCCCc-cccCCCC
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTGDPFSTCF-YIQESKH 248 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g~~~~~C~-~i~ec~~ 248 (256)
.+|..+++|.= .+|+|.+||+|..+. |. +.+-|..
T Consensus 555 ~lC~g~G~C~C----G~CvC~~GwtG~~C~-C~~std~C~~ 590 (783)
T KOG1226|consen 555 VLCGGHGRCEC----GRCVCNPGWTGSACN-CPLSTDTCES 590 (783)
T ss_pred cccCCCCeEeC----CcEEcCCCCccCCCC-CCCCCccccC
Confidence 35777777752 578888888888664 32 4444443
No 37
>KOG0994|consensus
Probab=95.73 E-value=0.033 Score=53.64 Aligned_cols=69 Identities=30% Similarity=0.758 Sum_probs=34.3
Q ss_pred CCceeeCCCCCccCCCCCccCCCCCC--CCCCCCcccCCCcc---CCC-CCCCCC-CCe---eeeCCCCceee-CCCCCc
Q psy13151 28 NTPACSCVESYIGRPPNCRPECTINA--ECAGNLACINERCK---DPC-PGSCGA-HAS---CVVLNHTPRCT-CDPGFT 96 (256)
Q Consensus 28 g~~~C~C~~G~~g~~~~C~~~C~~~~--~C~~~~~C~~~~C~---~~C-~~~C~~-~~~---C~n~~g~~~C~-C~~G~~ 96 (256)
.+..|.|.+||+|. .|+ .|.... +=..+++|..-.|. |.- +..|.. -+. |.-.....+|. |+.||.
T Consensus 932 ~~ivC~C~~GY~G~--RCe-~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~ 1008 (1758)
T KOG0994|consen 932 QQIVCHCQEGYSGS--RCE-ICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFY 1008 (1758)
T ss_pred cceeeecccCcccc--chh-hhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccch
Confidence 56789999999998 443 222110 11113344322222 111 233321 122 33333345675 999999
Q ss_pred cCC
Q psy13151 97 GDP 99 (256)
Q Consensus 97 g~~ 99 (256)
|+.
T Consensus 1009 GdA 1011 (1758)
T KOG0994|consen 1009 GDA 1011 (1758)
T ss_pred hHH
Confidence 974
No 38
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.71 E-value=0.0053 Score=25.93 Aligned_cols=11 Identities=27% Similarity=0.942 Sum_probs=9.0
Q ss_pred eeeCCCCCccC
Q psy13151 31 ACSCVESYIGR 41 (256)
Q Consensus 31 ~C~C~~G~~g~ 41 (256)
.|+|++||+|.
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 58999999997
No 39
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.18 E-value=0.019 Score=31.82 Aligned_cols=21 Identities=10% Similarity=-0.164 Sum_probs=8.4
Q ss_pred CceEecCCceeeecCCCCcCC
Q psy13151 118 DQMVLVVGQQLLERKSTAAPV 138 (256)
Q Consensus 118 ~~c~~~~g~~~c~c~~g~~~~ 138 (256)
..|+++++++.|.|..||.+.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLA 30 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-
T ss_pred CCCccCCCceEeECCCCCEEC
Confidence 344444444444444444443
No 40
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=94.38 E-value=0.04 Score=29.72 Aligned_cols=15 Identities=33% Similarity=0.609 Sum_probs=12.8
Q ss_pred ceeeCCCCCccCCCC
Q psy13151 224 PRCTCDPGFTGDPFS 238 (256)
Q Consensus 224 ~~C~C~~G~~g~~~~ 238 (256)
+.|.|++||+.+...
T Consensus 18 ~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 18 GQCFCPEGYILDEGS 32 (34)
T ss_pred CceeCCCceEecCCc
Confidence 689999999988655
No 41
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.61 E-value=0.022 Score=31.57 Aligned_cols=29 Identities=24% Similarity=0.527 Sum_probs=21.2
Q ss_pred CCCCCCeEeeCC-CCceeeCCCCCccCCCC
Q psy13151 210 SCGAHASCVVLN-HTPRCTCDPGFTGDPFS 238 (256)
Q Consensus 210 ~C~~~~~C~~~~-g~~~C~C~~G~~g~~~~ 238 (256)
.|..|+.|++.. |++.|.|..||..++..
T Consensus 6 ~cP~NA~C~~~~dG~eecrCllgyk~~~~~ 35 (37)
T PF12946_consen 6 KCPANAGCFRYDDGSEECRCLLGYKKVGGK 35 (37)
T ss_dssp ---TTEEEEEETTSEEEEEE-TTEEEETTE
T ss_pred cCCCCcccEEcCCCCEEEEeeCCccccCCC
Confidence 467789998776 99999999999876554
No 42
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=93.33 E-value=0.035 Score=30.76 Aligned_cols=31 Identities=23% Similarity=0.596 Sum_probs=21.1
Q ss_pred CCCCCCCCCCEEeecC-CCCeeecCCCcccCC
Q psy13151 149 CLPSPCGPNSICRVIG-NTPACSCVESYIGRP 179 (256)
Q Consensus 149 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~~~~ 179 (256)
|....|..++.|++.. |++.|+|..||..++
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 3345677889998877 889999999998653
No 43
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=89.97 E-value=0.36 Score=38.85 Aligned_cols=37 Identities=19% Similarity=0.417 Sum_probs=30.2
Q ss_pred CCCCCCCCCCCC--CCCCCCCEEeecCCCCeeecCCCcccC
Q psy13151 140 AEPRPSGNPCLP--SPCGPNSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 140 ~~~c~~~~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
...|.++++|.. +.|.. .|.++.|+|.|.|.+||.+.
T Consensus 181 ~~~C~~~~~C~~~~~~c~~--~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 181 GKICVVPDLCATLSHVCQQ--VCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cccCcCchhhcCCCCCccc--eEEcCCCCEEeECCCCccCC
Confidence 366778888973 56764 89999999999999999864
No 44
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=88.94 E-value=1.1 Score=26.89 Aligned_cols=24 Identities=33% Similarity=0.740 Sum_probs=17.2
Q ss_pred CCCCCCeEeeCCCCceeeCCCCCccCCC
Q psy13151 210 SCGAHASCVVLNHTPRCTCDPGFTGDPF 237 (256)
Q Consensus 210 ~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 237 (256)
.|..++.|++ .+|+|++||+..+.
T Consensus 27 qC~~~s~C~~----g~C~C~~g~~~~~~ 50 (52)
T PF01683_consen 27 QCIGGSVCVN----GRCQCPPGYVEVGG 50 (52)
T ss_pred CCCCcCEEcC----CEeECCCCCEecCC
Confidence 3555677854 68999999986544
No 45
>smart00051 DSL delta serrate ligand.
Probab=88.12 E-value=1 Score=28.42 Aligned_cols=47 Identities=23% Similarity=0.452 Sum_probs=28.7
Q ss_pred CCeeecCCCcccCCCCCcccCCcCCCCCCCCeeeCCcccCCCCCCCCCCCeEeeCCCCceeeCCCCCccCC
Q psy13151 166 TPACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGDP 236 (256)
Q Consensus 166 ~~~C~C~~G~~~~~~~c~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~ 236 (256)
.++-.|.++|.|. .|...|...+ ....+.+|.. ...++|.+||+|..
T Consensus 16 ~~rv~C~~~~yG~--~C~~~C~~~~-------------------d~~~~~~Cd~---~G~~~C~~Gw~G~~ 62 (63)
T smart00051 16 QIRVTCDENYYGE--GCNKFCRPRD-------------------DFFGHYTCDE---NGNKGCLEGWMGPY 62 (63)
T ss_pred EEEeeCCCCCcCC--ccCCEeCcCc-------------------cccCCccCCc---CCCEecCCCCcCCC
Confidence 3556799999988 4443443211 1234556632 24678999999973
No 46
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=87.75 E-value=0.57 Score=37.68 Aligned_cols=37 Identities=11% Similarity=0.033 Sum_probs=32.2
Q ss_pred CCccccccCCCCCCCCCceEecCCceeeecCCCCcCC
Q psy13151 102 TCFYIQERNHMVMGSRDQMVLVVGQQLLERKSTAAPV 138 (256)
Q Consensus 102 ~C~~~~~c~~~~~~~~~~c~~~~g~~~c~c~~g~~~~ 138 (256)
.|.++++|...+..+...|.++.|+|.|.|..||.+.
T Consensus 183 ~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~ 219 (224)
T cd01475 183 ICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALL 219 (224)
T ss_pred cCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCC
Confidence 4888889987666677899999999999999999875
No 47
>PHA02887 EGF-like protein; Provisional
Probab=87.64 E-value=0.61 Score=32.94 Aligned_cols=27 Identities=26% Similarity=0.577 Sum_probs=20.5
Q ss_pred CCCCCCCeEee--CCCCceeeCCCCCccCC
Q psy13151 209 GSCGAHASCVV--LNHTPRCTCDPGFTGDP 236 (256)
Q Consensus 209 ~~C~~~~~C~~--~~g~~~C~C~~G~~g~~ 236 (256)
+.|. ||+|.- ....+.|.|+.||+|..
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG~R 120 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTGIR 120 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCcccCC
Confidence 4565 578843 44669999999999994
No 48
>smart00051 DSL delta serrate ligand.
Probab=87.12 E-value=1.3 Score=27.98 Aligned_cols=45 Identities=24% Similarity=0.511 Sum_probs=27.6
Q ss_pred ceeeCCCCCccCCCCCccCCCCCCCCCCCCcccCCCccCCCCCCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13151 30 PACSCVESYIGRPPNCRPECTINAECAGNLACINERCKDPCPGSCGAHASCVVLNHTPRCTCDPGFTGD 98 (256)
Q Consensus 30 ~~C~C~~G~~g~~~~C~~~C~~~~~C~~~~~C~~~~C~~~C~~~C~~~~~C~n~~g~~~C~C~~G~~g~ 98 (256)
+.=.|.++|.|. .|...|...++ ...+..|.. ...++|.+||+|.
T Consensus 17 ~rv~C~~~~yG~--~C~~~C~~~~d-------------------~~~~~~Cd~---~G~~~C~~Gw~G~ 61 (63)
T smart00051 17 IRVTCDENYYGE--GCNKFCRPRDD-------------------FFGHYTCDE---NGNKGCLEGWMGP 61 (63)
T ss_pred EEeeCCCCCcCC--ccCCEeCcCcc-------------------ccCCccCCc---CCCEecCCCCcCC
Confidence 344688999998 55444432222 233555632 3468899999987
No 49
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=83.38 E-value=1.4 Score=31.17 Aligned_cols=26 Identities=42% Similarity=1.109 Sum_probs=20.4
Q ss_pred CCCCCCCeeeeCCCCceeeCCCCCccC
Q psy13151 72 GSCGAHASCVVLNHTPRCTCDPGFTGD 98 (256)
Q Consensus 72 ~~C~~~~~C~n~~g~~~C~C~~G~~g~ 98 (256)
..|+.++.|. ......|.|.+||...
T Consensus 84 ~~CG~~g~C~-~~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICN-SNNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeC-CCCCCceECCCCcCCC
Confidence 5789999994 4456689999999754
No 50
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=82.38 E-value=1.3 Score=31.86 Aligned_cols=32 Identities=28% Similarity=0.627 Sum_probs=22.9
Q ss_pred CCCCCCCeEee--CCCCceeeCCCCCccCCCCCCcccc
Q psy13151 209 GSCGAHASCVV--LNHTPRCTCDPGFTGDPFSTCFYIQ 244 (256)
Q Consensus 209 ~~C~~~~~C~~--~~g~~~C~C~~G~~g~~~~~C~~i~ 244 (256)
+.|.+ |+|.- ....+.|.|..||+|. +|+-.+
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~d 84 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHVV 84 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCcccc---ccccee
Confidence 44654 48843 4478999999999999 565444
No 51
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=81.26 E-value=1.9 Score=30.47 Aligned_cols=26 Identities=42% Similarity=1.109 Sum_probs=20.4
Q ss_pred CCCCCCCeEeeCCCCceeeCCCCCccC
Q psy13151 209 GSCGAHASCVVLNHTPRCTCDPGFTGD 235 (256)
Q Consensus 209 ~~C~~~~~C~~~~g~~~C~C~~G~~g~ 235 (256)
..|+.++.|. ...+..|.|.+||...
T Consensus 84 ~~CG~~g~C~-~~~~~~C~Cl~GF~P~ 109 (110)
T PF00954_consen 84 GFCGPNGICN-SNNSPKCSCLPGFEPK 109 (110)
T ss_pred cccCCccEeC-CCCCCceECCCCcCCC
Confidence 5688999994 3456789999999864
No 52
>KOG1836|consensus
Probab=78.26 E-value=5.4 Score=41.55 Aligned_cols=32 Identities=28% Similarity=0.607 Sum_probs=21.1
Q ss_pred CCCCCeEeeC--CCCceee-CCCCCccCCCCCCcc
Q psy13151 211 CGAHASCVVL--NHTPRCT-CDPGFTGDPFSTCFY 242 (256)
Q Consensus 211 C~~~~~C~~~--~g~~~C~-C~~G~~g~~~~~C~~ 242 (256)
|.+++.|..+ .....|. |++||+|..+..|.+
T Consensus 782 Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~c~d 816 (1705)
T KOG1836|consen 782 CPNGGACGQTPEILEVVCKNCPPGYTGLRCEECAD 816 (1705)
T ss_pred CCCChhhcCcCcccceecCCCCCCCcccccccCCC
Confidence 4444555433 3567888 999999996665543
No 53
>PHA02887 EGF-like protein; Provisional
Probab=77.37 E-value=2.8 Score=29.68 Aligned_cols=30 Identities=27% Similarity=0.686 Sum_probs=22.9
Q ss_pred CCCCCCeeeeCC--CCceeeCCCCCccCCCCCCccc
Q psy13151 73 SCGAHASCVVLN--HTPRCTCDPGFTGDPFSTCFYI 106 (256)
Q Consensus 73 ~C~~~~~C~n~~--g~~~C~C~~G~~g~~~~~C~~~ 106 (256)
-|- +|.|.-.. ..+.|.|.+||+|. +|+.+
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG~---RCE~v 124 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTGI---RCDEV 124 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCcccC---CCCcc
Confidence 465 57886543 56899999999998 78754
No 54
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=72.39 E-value=3.9 Score=29.54 Aligned_cols=31 Identities=26% Similarity=0.640 Sum_probs=23.6
Q ss_pred CCCCCCeeeeCC--CCceeeCCCCCccCCCCCCcccc
Q psy13151 73 SCGAHASCVVLN--HTPRCTCDPGFTGDPFSTCFYIQ 107 (256)
Q Consensus 73 ~C~~~~~C~n~~--g~~~C~C~~G~~g~~~~~C~~~~ 107 (256)
-|.+ |.|.-.. ..+.|.|..||+|. +|+..+
T Consensus 52 YClH-G~C~yI~dl~~~~CrC~~GYtGe---RCEh~d 84 (139)
T PHA03099 52 YCLH-GDCIHARDIDGMYCRCSHGYTGI---RCQHVV 84 (139)
T ss_pred EeEC-CEEEeeccCCCceeECCCCcccc---ccccee
Confidence 3654 4786544 68899999999999 898654
No 55
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=67.26 E-value=4.4 Score=23.84 Aligned_cols=26 Identities=35% Similarity=0.745 Sum_probs=17.9
Q ss_pred CeEeeCCCCceeeCCCCCccCCCCCCcc
Q psy13151 215 ASCVVLNHTPRCTCDPGFTGDPFSTCFY 242 (256)
Q Consensus 215 ~~C~~~~g~~~C~C~~G~~g~~~~~C~~ 242 (256)
.+|.. ....|.|+++|+|....+|.+
T Consensus 11 ~~C~~--~~G~C~C~~~~~G~~C~~C~~ 36 (49)
T PF00053_consen 11 QTCDP--STGQCVCKPGTTGPRCDQCKP 36 (49)
T ss_dssp SSEEE--TCEEESBSTTEESTTS-EE-T
T ss_pred CcccC--CCCEEeccccccCCcCcCCCC
Confidence 36655 347899999999997665554
No 56
>KOG1836|consensus
Probab=65.80 E-value=29 Score=36.55 Aligned_cols=30 Identities=30% Similarity=0.671 Sum_probs=20.5
Q ss_pred CCCCCCeeeeCC--CCceee-CCCCCccCCCCCCcc
Q psy13151 73 SCGAHASCVVLN--HTPRCT-CDPGFTGDPFSTCFY 105 (256)
Q Consensus 73 ~C~~~~~C~n~~--g~~~C~-C~~G~~g~~~~~C~~ 105 (256)
+|..++.|..+. ....|. |++||+|. +|+.
T Consensus 781 ~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~---rCe~ 813 (1705)
T KOG1836|consen 781 PCPNGGACGQTPEILEVVCKNCPPGYTGL---RCEE 813 (1705)
T ss_pred CCCCChhhcCcCcccceecCCCCCCCccc---cccc
Confidence 455555565443 466898 99999998 6653
No 57
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=59.38 E-value=12 Score=22.07 Aligned_cols=19 Identities=37% Similarity=0.845 Sum_probs=14.5
Q ss_pred CceeeCCCCCccCCCCCCc
Q psy13151 223 TPRCTCDPGFTGDPFSTCF 241 (256)
Q Consensus 223 ~~~C~C~~G~~g~~~~~C~ 241 (256)
..+|.|+++|+|....+|.
T Consensus 18 ~G~C~C~~~~~G~~C~~C~ 36 (50)
T cd00055 18 TGQCECKPNTTGRRCDRCA 36 (50)
T ss_pred CCEEeCCCcCCCCCCCCCC
Confidence 4678899999998665554
No 58
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=57.69 E-value=3.2 Score=25.10 Aligned_cols=22 Identities=27% Similarity=0.694 Sum_probs=17.5
Q ss_pred eeeCCCCCccCCCCCCccccCC
Q psy13151 225 RCTCDPGFTGDPFSTCFYIQES 246 (256)
Q Consensus 225 ~C~C~~G~~g~~~~~C~~i~ec 246 (256)
-|.|++||..+....|+.+.+|
T Consensus 34 gC~C~~G~v~~~~~~CV~~~~C 55 (55)
T PF01826_consen 34 GCFCPPGYVRNDNGRCVPPSEC 55 (55)
T ss_dssp EEEETTTEEEETTSEEEEGGGC
T ss_pred cCCCCCCeeEcCCCCEEcHHHC
Confidence 4999999997766568887765
No 59
>KOG3607|consensus
Probab=54.52 E-value=15 Score=35.08 Aligned_cols=45 Identities=33% Similarity=0.919 Sum_probs=36.5
Q ss_pred CCCCCCCeeeCCccc-------CCCCCCCCCCCeEeeCCCCceeeCCCCCccCCC
Q psy13151 190 AECAGNLACINERCK-------DPCPGSCGAHASCVVLNHTPRCTCDPGFTGDPF 237 (256)
Q Consensus 190 ~~C~~~~~C~~~~C~-------~~C~~~C~~~~~C~~~~g~~~C~C~~G~~g~~~ 237 (256)
..|..+..|++.+|+ +.|+..|+.+++|.| .+.|+|.+||.+..+
T Consensus 604 t~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn---~~~ChC~~gwapp~C 655 (716)
T KOG3607|consen 604 TSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNN---ELNCHCEPGWAPPFC 655 (716)
T ss_pred CccCCCceecCCcchhhhhhcccccccccCCCcccCC---CcceeeCCCCCCCcc
Confidence 567778888888883 566788999999976 479999999988754
No 60
>KOG3516|consensus
Probab=52.06 E-value=12 Score=37.13 Aligned_cols=38 Identities=26% Similarity=0.638 Sum_probs=33.1
Q ss_pred CCCCCCCCCCCCCCCCCCEEeecCCCCeeecC-CCcccC
Q psy13151 141 EPRPSGNPCLPSPCGPNSICRVIGNTPACSCV-ESYIGR 178 (256)
Q Consensus 141 ~~c~~~~~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~~~ 178 (256)
..|.-++.|.+++|.+++.|.-+-..|.|.|. .||.|.
T Consensus 540 d~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 540 DMCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred cccccccccCCccccCCCcccccccceeEeccccccccc
Confidence 55666788889999999999887788999998 899987
No 61
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=50.73 E-value=18 Score=20.93 Aligned_cols=19 Identities=37% Similarity=0.885 Sum_probs=14.0
Q ss_pred CceeeCCCCCccCCCCCCc
Q psy13151 223 TPRCTCDPGFTGDPFSTCF 241 (256)
Q Consensus 223 ~~~C~C~~G~~g~~~~~C~ 241 (256)
+..|.|+++|+|....+|.
T Consensus 17 ~G~C~C~~~~~G~~C~~C~ 35 (46)
T smart00180 17 TGQCECKPNVTGRRCDRCA 35 (46)
T ss_pred CCEEECCCCCCCCCCCcCC
Confidence 3588899999998555554
No 62
>KOG0196|consensus
Probab=48.70 E-value=59 Score=31.50 Aligned_cols=19 Identities=21% Similarity=0.513 Sum_probs=14.7
Q ss_pred eCCCCceeeCCCCCccCCC
Q psy13151 219 VLNHTPRCTCDPGFTGDPF 237 (256)
Q Consensus 219 ~~~g~~~C~C~~G~~g~~~ 237 (256)
..+|+..|.|..||.....
T Consensus 303 ~~ega~~C~C~~gyyRA~~ 321 (996)
T KOG0196|consen 303 SSEGATSCTCENGYYRADS 321 (996)
T ss_pred CCCCCCcccccCCcccCCC
Confidence 4578899999999975433
No 63
>KOG3516|consensus
Probab=45.73 E-value=15 Score=36.53 Aligned_cols=31 Identities=29% Similarity=0.833 Sum_probs=27.2
Q ss_pred CCC-CCCCCCCCeeeeCCCCceeeCC-CCCccC
Q psy13151 68 DPC-PGSCGAHASCVVLNHTPRCTCD-PGFTGD 98 (256)
Q Consensus 68 ~~C-~~~C~~~~~C~n~~g~~~C~C~-~G~~g~ 98 (256)
+.| |++|.+++.|.-.-..|.|.|. .||.|.
T Consensus 546 drClPN~CehgG~C~Qs~~~f~C~C~~TGY~Ga 578 (1306)
T KOG3516|consen 546 DRCLPNPCEHGGKCSQSWDDFECNCELTGYKGA 578 (1306)
T ss_pred cccCCccccCCCcccccccceeEeccccccccc
Confidence 455 8899999999987789999999 899987
No 64
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=40.21 E-value=28 Score=24.32 Aligned_cols=25 Identities=32% Similarity=0.771 Sum_probs=18.6
Q ss_pred CCCCCCCeEeeCC-----CCceeeCCCCCc
Q psy13151 209 GSCGAHASCVVLN-----HTPRCTCDPGFT 233 (256)
Q Consensus 209 ~~C~~~~~C~~~~-----g~~~C~C~~G~~ 233 (256)
+.|..||.|++.. .=|.|.|.+.+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T~~ 42 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPTVV 42 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeecccc
Confidence 5688889997762 348999998554
No 65
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=39.70 E-value=8.6 Score=24.21 Aligned_cols=20 Identities=35% Similarity=0.728 Sum_probs=7.3
Q ss_pred CCeeecCCCcccCCCCCcccCC
Q psy13151 166 TPACSCVESYIGRPPNCRPECT 187 (256)
Q Consensus 166 ~~~C~C~~G~~~~~~~c~~~C~ 187 (256)
.++-.|.+.|.|. .|...|.
T Consensus 16 ~~rv~C~~nyyG~--~C~~~C~ 35 (63)
T PF01414_consen 16 RIRVVCDENYYGP--NCSKFCK 35 (63)
T ss_dssp -------TTEETT--TT-EE--
T ss_pred EEEEECCCCCCCc--cccCCcC
Confidence 4567889999998 5554443
No 66
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=36.20 E-value=18 Score=21.98 Aligned_cols=28 Identities=25% Similarity=0.752 Sum_probs=15.7
Q ss_pred CCCCCeEe----eCCCCceeeCCCCCccCCCC
Q psy13151 211 CGAHASCV----VLNHTPRCTCDPGFTGDPFS 238 (256)
Q Consensus 211 C~~~~~C~----~~~g~~~C~C~~G~~g~~~~ 238 (256)
|..||+-. ...|...|.|..-|.|.+++
T Consensus 19 CSGHGr~flDg~~~dG~p~CECn~Cy~GpdCS 50 (56)
T PF04863_consen 19 CSGHGRAFLDGLIADGSPVCECNSCYGGPDCS 50 (56)
T ss_dssp -TTSEE--TTS-EETTEE--EE-TTEESTTS-
T ss_pred cCCCCeeeeccccccCCccccccCCcCCCCcc
Confidence 55555552 24677999999999999655
No 67
>KOG3514|consensus
Probab=33.58 E-value=30 Score=34.30 Aligned_cols=35 Identities=17% Similarity=0.230 Sum_probs=29.5
Q ss_pred eeeeccCCCCCCccCCCCCCceeeC-CCCCccCCCCCc
Q psy13151 10 SYLVVVAVDIRRTDPHIGNTPACSC-VESYIGRPPNCR 46 (256)
Q Consensus 10 ~~~~~~~c~~~~~C~~~~g~~~C~C-~~G~~g~~~~C~ 46 (256)
+-+..+||.+.|+|....++|.|-| ..||.|. .|+
T Consensus 624 ~~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~--~Ce 659 (1591)
T KOG3514|consen 624 KICESNPCQNGGKCSEGWNRFICDCSGTGFEGR--TCE 659 (1591)
T ss_pred cccCCCcccCCCCccccccccccccccCcccCc--ccc
Confidence 3456899999999999999999999 4589887 554
No 68
>KOG3607|consensus
Probab=31.33 E-value=58 Score=31.22 Aligned_cols=47 Identities=36% Similarity=1.022 Sum_probs=36.3
Q ss_pred CCCCCCCcccCCCcc-------CCCCCCCCCCCeeeeCCCCceeeCCCCCccCCCCCCcc
Q psy13151 53 AECAGNLACINERCK-------DPCPGSCGAHASCVVLNHTPRCTCDPGFTGDPFSTCFY 105 (256)
Q Consensus 53 ~~C~~~~~C~~~~C~-------~~C~~~C~~~~~C~n~~g~~~C~C~~G~~g~~~~~C~~ 105 (256)
..|.++..|++.+|+ ..|+..|..++.|.| .+.|.|.+||.+. .|..
T Consensus 604 t~Cg~~~vC~~~~C~~~~v~~~~~~~~~C~g~GVCnn---~~~ChC~~gwapp---~C~~ 657 (716)
T KOG3607|consen 604 TSCGPGMICINHRCLSASVLNSSCCPTTCNGHGVCNN---ELNCHCEPGWAPP---FCFI 657 (716)
T ss_pred CccCCCceecCCcchhhhhhcccccccccCCCcccCC---CcceeeCCCCCCC---cccc
Confidence 358888888888885 445666888999974 5689999999987 5764
No 69
>KOG3509|consensus
Probab=24.09 E-value=1.8e+02 Score=29.01 Aligned_cols=32 Identities=25% Similarity=0.623 Sum_probs=26.1
Q ss_pred CCCCCCCCCCCCEEeecCCCCeeecCCCcccC
Q psy13151 147 NPCLPSPCGPNSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 147 ~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
++|...++.+.+.|..+.....|.|++||.|+
T Consensus 407 ~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~ 438 (964)
T KOG3509|consen 407 DVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGD 438 (964)
T ss_pred CccccccCCCCccccccccccceeccccccCc
Confidence 45556778777788778788899999999998
No 70
>KOG1218|consensus
Probab=23.01 E-value=4.4e+02 Score=21.84 Aligned_cols=13 Identities=23% Similarity=0.866 Sum_probs=10.4
Q ss_pred CCeeecCCCcccC
Q psy13151 166 TPACSCVESYIGR 178 (256)
Q Consensus 166 ~~~C~C~~G~~~~ 178 (256)
...|.|.+||.+.
T Consensus 161 ~~~c~c~~g~~g~ 173 (316)
T KOG1218|consen 161 NGICTCQPGFVGV 173 (316)
T ss_pred CCceeccCCcccc
Confidence 3467899999987
No 71
>KOG1218|consensus
Probab=22.83 E-value=4.4e+02 Score=21.82 Aligned_cols=15 Identities=40% Similarity=0.942 Sum_probs=12.3
Q ss_pred CCCceeeCCCCCccC
Q psy13151 27 GNTPACSCVESYIGR 41 (256)
Q Consensus 27 ~g~~~C~C~~G~~g~ 41 (256)
.....|.|.++|+|.
T Consensus 12 ~~~~~c~c~~~~~g~ 26 (316)
T KOG1218|consen 12 GGSGQCFCDPGYTGR 26 (316)
T ss_pred CCCCceecCCCcccc
Confidence 356789999999994
No 72
>KOG3512|consensus
Probab=21.49 E-value=4.4e+02 Score=24.01 Aligned_cols=54 Identities=20% Similarity=0.423 Sum_probs=28.5
Q ss_pred cCCceeeecCCCCcCCCCCCCCCCCCCCCCCCCC----CCEEeecCCCCeeecCCCcccC
Q psy13151 123 VVGQQLLERKSTAAPVAAEPRPSGNPCLPSPCGP----NSICRVIGNTPACSCVESYIGR 178 (256)
Q Consensus 123 ~~g~~~c~c~~g~~~~~~~~c~~~~~C~~~~C~~----~~~C~~~~g~~~C~C~~G~~~~ 178 (256)
+.|-+--.|..||.......-.+.+.|..-.|++ +.+|..+ +..|.|++|-+|.
T Consensus 368 TaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~--tGqCpCkeGvtG~ 425 (592)
T KOG3512|consen 368 TAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQT--TGQCPCKEGVTGL 425 (592)
T ss_pred CCCcccccccCccccCCCCCCchhhhhhhcCCccccccccccccc--CCcccCCCCCccc
Confidence 4444444578887665444444444454433332 2234222 3467888887776
Done!