Query psy8491
Match_columns 273
No_of_seqs 215 out of 1559
Neff 7.1
Searched_HMMs 46136
Date Fri Aug 16 20:14:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8491.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8491hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1225|consensus 99.2 9.7E-11 2.1E-15 114.0 10.9 126 26-194 233-364 (525)
2 KOG1225|consensus 98.8 2.4E-08 5.3E-13 97.4 8.7 72 25-103 263-338 (525)
3 KOG1214|consensus 98.0 1.3E-05 2.8E-10 80.9 8.1 135 53-201 705-866 (1289)
4 KOG1214|consensus 97.7 5.9E-05 1.3E-09 76.3 5.6 91 18-111 748-865 (1289)
5 KOG1226|consensus 97.6 0.00028 6E-09 71.2 9.6 118 25-164 476-620 (783)
6 KOG4289|consensus 97.5 6.9E-05 1.5E-09 79.3 4.2 75 22-99 1217-1308(2531)
7 KOG4260|consensus 97.5 7.1E-05 1.5E-09 67.2 3.6 148 30-200 131-312 (350)
8 KOG1226|consensus 97.4 0.00075 1.6E-08 68.1 9.7 47 153-202 569-623 (783)
9 KOG1219|consensus 97.4 0.00028 6E-09 77.7 6.0 94 81-200 3870-3978(4289)
10 KOG1217|consensus 97.2 0.0043 9.2E-08 59.0 11.3 156 22-195 105-305 (487)
11 KOG4289|consensus 96.9 0.00081 1.7E-08 71.6 4.3 73 60-137 1222-1308(2531)
12 KOG1219|consensus 96.7 0.0023 5E-08 70.9 5.5 80 49-137 3870-3970(4289)
13 PF07974 EGF_2: EGF-like domai 96.2 0.0073 1.6E-07 37.0 3.3 20 51-70 8-29 (32)
14 KOG1217|consensus 96.1 0.062 1.3E-06 51.0 11.3 155 19-193 184-388 (487)
15 KOG0994|consensus 96.0 0.021 4.6E-07 60.1 7.8 44 153-200 889-949 (1758)
16 smart00051 DSL delta serrate l 95.6 0.011 2.5E-07 41.9 2.9 43 153-198 20-63 (63)
17 PF06247 Plasmod_Pvs28: Plasmo 94.9 0.01 2.2E-07 50.8 1.1 93 15-109 7-126 (197)
18 PF07974 EGF_2: EGF-like domai 94.7 0.033 7.2E-07 34.1 2.6 21 176-198 11-32 (32)
19 smart00051 DSL delta serrate l 94.5 0.034 7.4E-07 39.4 2.8 42 26-70 16-60 (63)
20 KOG0994|consensus 93.8 0.13 2.9E-06 54.5 6.3 48 148-200 1084-1147(1758)
21 PF12661 hEGF: Human growth fa 93.7 0.033 7.1E-07 27.2 0.9 10 184-193 1-10 (13)
22 PF12947 EGF_3: EGF domain; I 93.6 0.041 8.9E-07 34.6 1.4 23 176-198 11-36 (36)
23 PF00008 EGF: EGF-like domain 93.0 0.044 9.5E-07 33.4 0.8 21 173-193 5-30 (32)
24 KOG4260|consensus 91.4 0.17 3.7E-06 45.9 3.0 46 153-200 221-276 (350)
25 PF00008 EGF: EGF-like domain 90.9 0.086 1.9E-06 32.0 0.4 21 50-70 5-30 (32)
26 KOG1218|consensus 90.8 11 0.00025 33.9 14.6 54 22-79 10-65 (316)
27 PF12947 EGF_3: EGF domain; I 90.4 0.12 2.6E-06 32.5 0.7 20 51-70 8-31 (36)
28 PF12662 cEGF: Complement Clr- 90.1 0.23 5E-06 28.4 1.7 10 61-70 3-12 (24)
29 PF14670 FXa_inhibition: Coagu 89.6 0.23 4.9E-06 31.2 1.6 25 15-39 7-31 (36)
30 PF12662 cEGF: Complement Clr- 88.6 0.34 7.5E-06 27.7 1.7 21 26-46 1-22 (24)
31 PF01414 DSL: Delta serrate li 87.4 0.23 5E-06 35.2 0.6 44 153-198 20-63 (63)
32 PF07645 EGF_CA: Calcium-bindi 86.3 0.51 1.1E-05 30.3 1.8 10 60-69 25-34 (42)
33 smart00179 EGF_CA Calcium-bind 86.0 0.81 1.8E-05 27.9 2.5 19 51-69 11-33 (39)
34 smart00179 EGF_CA Calcium-bind 85.9 1.2 2.6E-05 27.2 3.3 18 19-36 16-33 (39)
35 PF07645 EGF_CA: Calcium-bindi 85.5 0.33 7.1E-06 31.2 0.6 20 18-37 16-35 (42)
36 PF06247 Plasmod_Pvs28: Plasmo 84.2 0.59 1.3E-05 40.2 1.6 82 22-104 65-163 (197)
37 KOG1836|consensus 83.4 1.9 4.2E-05 48.4 5.6 48 149-200 757-813 (1705)
38 cd00053 EGF Epidermal growth f 83.1 1.4 3.1E-05 25.9 2.7 18 53-70 11-31 (36)
39 KOG1218|consensus 80.8 13 0.00029 33.5 9.4 12 91-102 161-172 (316)
40 PF01414 DSL: Delta serrate li 78.9 0.82 1.8E-05 32.3 0.6 43 25-70 15-60 (63)
41 PHA02887 EGF-like protein; Pro 78.6 1.7 3.7E-05 34.6 2.3 27 173-201 93-124 (126)
42 cd00054 EGF_CA Calcium-binding 77.8 2.4 5.1E-05 25.3 2.4 18 53-70 14-34 (38)
43 smart00181 EGF Epidermal growt 77.4 1.9 4.1E-05 25.8 1.9 11 60-70 20-30 (35)
44 PHA03099 epidermal growth fact 74.8 2.2 4.8E-05 34.5 2.1 28 173-202 52-84 (139)
45 PF01683 EB: EB module; Inter 63.2 17 0.00037 24.0 4.3 22 176-198 31-52 (52)
46 PF09064 Tme5_EGF_like: Thromb 61.5 7.6 0.00017 24.0 2.0 16 25-40 16-31 (34)
47 PTZ00214 high cysteine membran 59.4 51 0.0011 34.8 8.9 20 182-201 750-769 (800)
48 PF03302 VSP: Giardia variant- 53.5 69 0.0015 30.8 8.2 18 63-80 41-58 (397)
49 PHA02887 EGF-like protein; Pro 52.7 12 0.00025 29.9 2.3 24 51-77 94-122 (126)
50 PF02363 C_tripleX: Cysteine r 39.2 14 0.0003 18.9 0.6 12 42-53 2-13 (17)
51 KOG1836|consensus 38.7 49 0.0011 37.8 5.1 16 29-44 697-712 (1705)
52 PHA03099 epidermal growth fact 35.6 33 0.00071 27.9 2.4 24 51-77 53-81 (139)
53 PF12946 EGF_MSP1_1: MSP1 EGF 35.6 16 0.00036 23.0 0.6 24 176-199 10-37 (37)
54 PF01826 TIL: Trypsin Inhibito 32.6 32 0.00068 22.9 1.7 16 184-199 34-50 (55)
55 smart00180 EGF_Lam Laminin-typ 30.0 39 0.00083 21.9 1.7 17 182-200 17-33 (46)
56 PF00053 Laminin_EGF: Laminin 29.8 23 0.00051 23.0 0.6 21 178-200 12-33 (49)
57 PF00954 S_locus_glycop: S-loc 29.4 39 0.00085 25.9 1.9 22 49-70 84-108 (110)
58 cd00055 EGF_Lam Laminin-type e 27.4 50 0.0011 21.7 1.9 14 25-38 17-30 (50)
59 KOG1388|consensus 26.0 67 0.0014 28.3 2.9 47 153-201 80-128 (217)
60 KOG0196|consensus 21.6 96 0.0021 32.9 3.5 44 60-103 259-319 (996)
No 1
>KOG1225|consensus
Probab=99.19 E-value=9.7e-11 Score=114.02 Aligned_cols=126 Identities=27% Similarity=0.601 Sum_probs=84.3
Q ss_pred CceeeeCCCCeeCCCCCCCCCCCCCCC-CCEEcCCCeEEcCCCCeeCCCCCcc-cCC-CCCC-CceecCCCceeeCCCcc
Q psy8491 26 RQAVICRFDLKFVNASYCAPVCSSPCI-NGACVFPEQCHCSPGFQFVNETYCE-PYC-ENCQ-HGTCTAPSVCECESGFV 101 (273)
Q Consensus 26 ~~~C~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~~~~C~C~~Gy~g~~~~~C~-~~C-~~C~-~G~C~~~~~C~C~~Gy~ 101 (273)
.++|.|..+|.+.+.. ...|++.|. +|.|+. +.|+|++||+| ..|. ..| ..|. |+.++. ++|+|++||+
T Consensus 233 ~~ic~c~~~~~g~~c~--~~~C~~~c~~~g~c~~-G~CIC~~Gf~G---~dC~e~~Cp~~cs~~g~~~~-g~CiC~~g~~ 305 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCS--TIYCPGGCTGRGQCVE-GRCICPPGFTG---DDCDELVCPVDCSGGGVCVD-GECICNPGYS 305 (525)
T ss_pred CceeecCCceeCCccc--cccCCCCCcccceEeC-CeEeCCCCCcC---CCCCcccCCcccCCCceecC-CEeecCCCcc
Confidence 3489999999987654 356777777 788886 89999999999 4554 345 3365 455555 5999999999
Q ss_pred cCCCCCcc-ccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCcccccCCCCeeeCCCCCCCCCCCCCCCC-Cee
Q psy8491 102 HNNETNAC-ERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCEPECFNCTR-GYC 179 (273)
Q Consensus 102 g~~~~~~C-~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~~~C~~C~n-G~C 179 (273)
|. .| ++.|...|.. +|.|+ .++ |. |.+||+|..|..+ + |.+ |.|
T Consensus 306 G~----dCs~~~cpadC~g--~G~Ci-~G~-C~--------------------C~~Gy~G~~C~~~--~----C~~~g~c 351 (525)
T KOG1225|consen 306 GK----DCSIRRCPADCSG--HGKCI-DGE-CL--------------------CDEGYTGELCIQR--A----CSGGGQC 351 (525)
T ss_pred cc----ccccccCCccCCC--CCccc-CCc-eE--------------------eCCCCcCCccccc--c----cCCCcee
Confidence 95 45 2456667766 78877 344 54 5556666655533 1 334 333
Q ss_pred cCCCeeECCCCceeC
Q psy8491 180 LEPNECSCPYNFHLE 194 (273)
Q Consensus 180 ~~~~~C~C~~Gy~G~ 194 (273)
+ .+ |+|..||.|.
T Consensus 352 v-~g-C~C~~Gw~G~ 364 (525)
T KOG1225|consen 352 V-NG-CKCKKGWRGP 364 (525)
T ss_pred c-cC-ceeccCccCC
Confidence 3 34 6677776663
No 2
>KOG1225|consensus
Probab=98.76 E-value=2.4e-08 Score=97.42 Aligned_cols=72 Identities=29% Similarity=0.781 Sum_probs=58.7
Q ss_pred CCceeeeCCCCeeCCCCCCCCCCCCCCC-CCEEcCCCeEEcCCCCeeCCCCCcc-cCC-CCCC-CceecCCCceeeCCCc
Q psy8491 25 LRQAVICRFDLKFVNASYCAPVCSSPCI-NGACVFPEQCHCSPGFQFVNETYCE-PYC-ENCQ-HGTCTAPSVCECESGF 100 (273)
Q Consensus 25 ~~~~C~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~~~~C~C~~Gy~g~~~~~C~-~~C-~~C~-~G~C~~~~~C~C~~Gy 100 (273)
..+.|+|++||.|.+.+. ..|+.+|+ ++.+++ ++|+|.+||+| ..|. ..| ..|. ||.|+ .++|.|.+||
T Consensus 263 ~~G~CIC~~Gf~G~dC~e--~~Cp~~cs~~g~~~~-g~CiC~~g~~G---~dCs~~~cpadC~g~G~Ci-~G~C~C~~Gy 335 (525)
T KOG1225|consen 263 VEGRCICPPGFTGDDCDE--LVCPVDCSGGGVCVD-GECICNPGYSG---KDCSIRRCPADCSGHGKCI-DGECLCDEGY 335 (525)
T ss_pred eCCeEeCCCCCcCCCCCc--ccCCcccCCCceecC-CEeecCCCccc---cccccccCCccCCCCCccc-CCceEeCCCC
Confidence 467899999999977555 57777787 666766 69999999999 6665 445 5676 79999 7999999999
Q ss_pred ccC
Q psy8491 101 VHN 103 (273)
Q Consensus 101 ~g~ 103 (273)
+|.
T Consensus 336 ~G~ 338 (525)
T KOG1225|consen 336 TGE 338 (525)
T ss_pred cCC
Confidence 995
No 3
>KOG1214|consensus
Probab=98.05 E-value=1.3e-05 Score=80.93 Aligned_cols=135 Identities=25% Similarity=0.594 Sum_probs=83.5
Q ss_pred CCEEcC----CCeEEcCCCCeeCCCCCcc--cCC----CCCC-CceecCC---CceeeCCCcccCCCCCccccccC----
Q psy8491 53 NGACVF----PEQCHCSPGFQFVNETYCE--PYC----ENCQ-HGTCTAP---SVCECESGFVHNNETNACERICE---- 114 (273)
Q Consensus 53 ~g~C~~----~~~C~C~~Gy~g~~~~~C~--~~C----~~C~-~G~C~~~---~~C~C~~Gy~g~~~~~~C~~~C~---- 114 (273)
++.|.. .++|.|..||.|+ ...|. .+| +.|. |..|++. ..|.|..||.-.+....|.+.=.
T Consensus 705 ~a~C~pg~~~~~tcecs~g~~gd-gr~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~ 783 (1289)
T KOG1214|consen 705 TARCHPGTGVDYTCECSSGYQGD-GRNCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPA 783 (1289)
T ss_pred CccccCCCCcceEEEEeeccCCC-CCCCCChhhhccCCCCCCCCceeecCCCceeEEEeecceeccCCcceEEecCCCCC
Confidence 445543 3699999999993 35565 233 5565 7889863 48999999977655566743211
Q ss_pred CCCCCCCC-CeecCCCCceeeCCCCCcCCCCCCCCcccccCCCCeeeCCCCCCCC--CCCC--CC-CCCeec---CCCee
Q psy8491 115 EPKPPLYP-GLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCE--PECF--NC-TRGYCL---EPNEC 185 (273)
Q Consensus 115 ~~C~~~~~-g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~--~~C~--~C-~nG~C~---~~~~C 185 (273)
++|.. + ..|..... +.|... .=..-.| .|.|||.|+. ..|. ++|+ .| .++.|. +++.|
T Consensus 784 n~Ce~--g~h~C~i~g~-a~c~~h------Ggs~y~C--~CLPGfsGDG--~~c~dvDeC~psrChp~A~CyntpgsfsC 850 (1289)
T KOG1214|consen 784 NPCED--GSHTCAIAGQ-ARCVHH------GGSTYSC--ACLPGFSGDG--HQCTDVDECSPSRCHPAATCYNTPGSFSC 850 (1289)
T ss_pred Ccccc--CccccCcCCc-eEEEec------CCceEEE--eecCCccCCc--cccccccccCccccCCCceEecCCCccee
Confidence 23433 1 12322222 332210 0000145 8999999983 4443 4665 56 367886 35899
Q ss_pred ECCCCceeCCCCcccc
Q psy8491 186 SCPYNFHLEGGMCMIA 201 (273)
Q Consensus 186 ~C~~Gy~G~~~~C~~~ 201 (273)
.|.+||.|+|.+|.+.
T Consensus 851 ~C~pGy~GDGf~CVP~ 866 (1289)
T KOG1214|consen 851 RCQPGYYGDGFQCVPD 866 (1289)
T ss_pred ecccCccCCCceecCC
Confidence 9999999999999886
No 4
>KOG1214|consensus
Probab=97.67 E-value=5.9e-05 Score=76.29 Aligned_cols=91 Identities=26% Similarity=0.667 Sum_probs=66.9
Q ss_pred cccccCCCCceeeeCCCCeeCC-CCCCCCC--------CC---CCCC-CC--EEcC----CCeEEcCCCCeeCCCCCcc-
Q psy8491 18 STSMSQDLRQAVICRFDLKFVN-ASYCAPV--------CS---SPCI-NG--ACVF----PEQCHCSPGFQFVNETYCE- 77 (273)
Q Consensus 18 ~~~~~~~~~~~C~C~~Gy~~~~-~~~C~~~--------C~---~~C~-~g--~C~~----~~~C~C~~Gy~g~~~~~C~- 77 (273)
.++++-++++.|+|..||.+.+ +..|+++ |+ ..|. +| +|+. .|.|+|.|||.|+. ..|.
T Consensus 748 s~Cin~pg~~rceC~~gy~F~dd~~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG-~~c~d 826 (1289)
T KOG1214|consen 748 SVCINLPGSYRCECRSGYEFADDRHTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG-HQCTD 826 (1289)
T ss_pred ceeecCCCceeEEEeecceeccCCcceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCc-ccccc
Confidence 5777889999999999999865 5888644 22 2455 44 4442 58999999999943 5555
Q ss_pred -cCC--CCCC-CceecCC---CceeeCCCcccCCCCCcccc
Q psy8491 78 -PYC--ENCQ-HGTCTAP---SVCECESGFVHNNETNACER 111 (273)
Q Consensus 78 -~~C--~~C~-~G~C~~~---~~C~C~~Gy~g~~~~~~C~~ 111 (273)
++| +.|. ++.|.+. ..|.|.+||.|++. .|.|
T Consensus 827 vDeC~psrChp~A~CyntpgsfsC~C~pGy~GDGf--~CVP 865 (1289)
T KOG1214|consen 827 VDECSPSRCHPAATCYNTPGSFSCRCQPGYYGDGF--QCVP 865 (1289)
T ss_pred ccccCccccCCCceEecCCCcceeecccCccCCCc--eecC
Confidence 567 6776 6899753 48999999999753 6654
No 5
>KOG1226|consensus
Probab=97.63 E-value=0.00028 Score=71.16 Aligned_cols=118 Identities=22% Similarity=0.490 Sum_probs=71.4
Q ss_pred CCceeeeCCCCeeCCCCCC----------CCCCC-----CCCC-CCEEcCCCeEEcCCCCee-CCCCCcc-c--CC----
Q psy8491 25 LRQAVICRFDLKFVNASYC----------APVCS-----SPCI-NGACVFPEQCHCSPGFQF-VNETYCE-P--YC---- 80 (273)
Q Consensus 25 ~~~~C~C~~Gy~~~~~~~C----------~~~C~-----~~C~-~g~C~~~~~C~C~~Gy~g-~~~~~C~-~--~C---- 80 (273)
..+.|.|.+||.|.-. +| ...|+ ++|. +|.|+= |+|+|.+...+ ..+..|+ + .|
T Consensus 476 ~CG~C~C~~G~~G~~C-EC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-GqC~C~~~~~~~i~G~fCECDnfsC~r~~ 553 (783)
T KOG1226|consen 476 VCGQCRCDEGWLGKKC-ECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-GQCVCHKPDNGKIYGKFCECDNFSCERHK 553 (783)
T ss_pred EecceecCCCCCCCcc-cCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-CceEecCCCCCceeeeeeeccCccccccc
Confidence 4678999999997421 11 01222 3576 788875 89999888873 2225555 1 12
Q ss_pred -CCCC-CceecCCCceeeCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCcccccCCCC-
Q psy8491 81 -ENCQ-HGTCTAPSVCECESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCIIQCDVG- 157 (273)
Q Consensus 81 -~~C~-~G~C~~~~~C~C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~~~C~~G- 157 (273)
..|. ||+|.- +.|+|.+||+|+.+ .|. .=...|...++..|...+. |.|. . | .|...
T Consensus 554 g~lC~g~G~C~C-G~CvC~~GwtG~~C--~C~-~std~C~~~~G~iCSGrG~-C~Cg--~-----------C--~C~~~~ 613 (783)
T KOG1226|consen 554 GVLCGGHGRCEC-GRCVCNPGWTGSAC--NCP-LSTDTCESSDGQICSGRGT-CECG--R-----------C--KCTDPP 613 (783)
T ss_pred CcccCCCCeEeC-CcEEcCCCCccCCC--CCC-CCCccccCCCCceeCCCce-eeCC--c-----------e--EcCCCC
Confidence 2354 688744 78999999999644 221 1112455555666666655 5544 2 2 45444
Q ss_pred eeeCCCC
Q psy8491 158 YTLNPRT 164 (273)
Q Consensus 158 y~g~~c~ 164 (273)
|+|..|+
T Consensus 614 ~sG~~CE 620 (783)
T KOG1226|consen 614 YSGEFCE 620 (783)
T ss_pred cCcchhh
Confidence 8998776
No 6
>KOG4289|consensus
Probab=97.54 E-value=6.9e-05 Score=79.25 Aligned_cols=75 Identities=31% Similarity=0.617 Sum_probs=54.5
Q ss_pred cCCCCceeeeCCCCeeCCCCCCCCCC-CCCCC-CCEEcC---CCeEEcCCCCeeCCCCCcc-----cCC--CCCCC-cee
Q psy8491 22 SQDLRQAVICRFDLKFVNASYCAPVC-SSPCI-NGACVF---PEQCHCSPGFQFVNETYCE-----PYC--ENCQH-GTC 88 (273)
Q Consensus 22 ~~~~~~~C~C~~Gy~~~~~~~C~~~C-~~~C~-~g~C~~---~~~C~C~~Gy~g~~~~~C~-----~~C--~~C~~-G~C 88 (273)
.+.....|.|+|||.|+..+.=...| +.+|+ ||.|.+ .|+|+|.+||+| ..|+ ..| ..|.| |+|
T Consensus 1217 ~pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tG---ehCEvs~~agrCvpGvC~nggtC 1293 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTG---EHCEVSARAGRCVPGVCKNGGTC 1293 (2531)
T ss_pred cccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccc---cceeeecccCccccceecCCCEE
Confidence 45578899999999986433322344 35788 899986 689999999999 7777 234 45665 677
Q ss_pred cCC----CceeeCCC
Q psy8491 89 TAP----SVCECESG 99 (273)
Q Consensus 89 ~~~----~~C~C~~G 99 (273)
++. ..|+|+.|
T Consensus 1294 ~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1294 VNLLNGGFCCHCPYG 1308 (2531)
T ss_pred eecCCCceeccCCCc
Confidence 642 37889988
No 7
>KOG4260|consensus
Probab=97.53 E-value=7.1e-05 Score=67.17 Aligned_cols=148 Identities=22% Similarity=0.542 Sum_probs=82.7
Q ss_pred eeCCCCeeCCCCCCCCCCCCCCC-CCEEcC------CCeEEcCCCCeeCCCCCcc------------cCCCCCC---Cce
Q psy8491 30 ICRFDLKFVNASYCAPVCSSPCI-NGACVF------PEQCHCSPGFQFVNETYCE------------PYCENCQ---HGT 87 (273)
Q Consensus 30 ~C~~Gy~~~~~~~C~~~C~~~C~-~g~C~~------~~~C~C~~Gy~g~~~~~C~------------~~C~~C~---~G~ 87 (273)
-|++|-+|++...|...-+.+|. +|.|.. ++.|.|.+||+|..-.+|. .+|..|. -|+
T Consensus 131 CCp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~ 210 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCLGV 210 (350)
T ss_pred ccCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhhcc
Confidence 48999999888887655556787 888874 5799999999994334443 1343343 135
Q ss_pred ecCC--Ccee-eCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCCCCcCCCCCCC--C--cccccCCCCeee
Q psy8491 88 CTAP--SVCE-CESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSN--P--YCIIQCDVGYTL 160 (273)
Q Consensus 88 C~~~--~~C~-C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~n--g--~C~~~C~~Gy~g 160 (273)
|... ..|. |..||..+. ..| +..+. |.=.+..|.-.+.|.| | .| ..++||.+
T Consensus 211 Csg~~~k~C~kCkkGW~lde--~gC----------------vDvnE-C~~ep~~c~~~qfCvNteGSf~C--~dk~Gy~~ 269 (350)
T KOG4260|consen 211 CSGESSKGCSKCKKGWKLDE--EGC----------------VDVNE-CQNEPAPCKAHQFCVNTEGSFKC--EDKEGYKK 269 (350)
T ss_pred cCCCCCCChhhhcccceecc--ccc----------------ccHHH-HhcCCCCCChhheeecCCCceEe--cccccccC
Confidence 6543 2564 999999852 223 22211 1100000100122333 2 34 56788887
Q ss_pred CCCC-CCCCCCCC-CCCCCeec---CCCeeECCCCceeCCCCccc
Q psy8491 161 NPRT-LYCEPECF-NCTRGYCL---EPNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 161 ~~c~-~~C~~~C~-~C~nG~C~---~~~~C~C~~Gy~G~~~~C~~ 200 (273)
...+ ..|.+.|. . |+.|. +.++|+|..|+.-....|..
T Consensus 270 g~d~C~~~~d~~~~k--n~~c~ni~~~~r~v~f~~~~~~~g~cV~ 312 (350)
T KOG4260|consen 270 GVDECQFCADVCASK--NRPCMNIDGQYRCVCFSGLIIIEGFCVW 312 (350)
T ss_pred ChHHhhhhhhhcccC--CCCcccCCccEEEEecccceeeeeeeec
Confidence 4211 12223333 1 45554 35789999988764455554
No 8
>KOG1226|consensus
Probab=97.44 E-value=0.00075 Score=68.14 Aligned_cols=47 Identities=17% Similarity=0.412 Sum_probs=29.0
Q ss_pred cCCCCeeeCCCCCC-CCCCCCCCCC-CeecCC-----CeeECCCC-ceeCCCCccccC
Q psy8491 153 QCDVGYTLNPRTLY-CEPECFNCTR-GYCLEP-----NECSCPYN-FHLEGGMCMIAE 202 (273)
Q Consensus 153 ~C~~Gy~g~~c~~~-C~~~C~~C~n-G~C~~~-----~~C~C~~G-y~G~~~~C~~~~ 202 (273)
.|.+||+|..|+-. -.+.|.. .+ +.|.+- ++|+|... |.| ..|++-+
T Consensus 569 vC~~GwtG~~C~C~~std~C~~-~~G~iCSGrG~C~Cg~C~C~~~~~sG--~~CE~cp 623 (783)
T KOG1226|consen 569 VCNPGWTGSACNCPLSTDTCES-SDGQICSGRGTCECGRCKCTDPPYSG--EFCEKCP 623 (783)
T ss_pred EcCCCCccCCCCCCCCCccccC-CCCceeCCCceeeCCceEcCCCCcCc--chhhcCC
Confidence 89999999988621 1122320 02 255543 56777665 899 8888753
No 9
>KOG1219|consensus
Probab=97.36 E-value=0.00028 Score=77.68 Aligned_cols=94 Identities=30% Similarity=0.611 Sum_probs=67.9
Q ss_pred CCCCC-ceecCC----CceeeCCCcccCCCCCccc---cccC-CCCCCCCCCeecCCCCceeeCCCCCcCCCCCCCCccc
Q psy8491 81 ENCQH-GTCTAP----SVCECESGFVHNNETNACE---RICE-EPKPPLYPGLGTIPNTERTCLDNKCHCSEGDSNPYCI 151 (273)
Q Consensus 81 ~~C~~-G~C~~~----~~C~C~~Gy~g~~~~~~C~---~~C~-~~C~~~~~g~C~~~~~~C~C~~G~C~C~~~C~ng~C~ 151 (273)
.+|+| |.|+.. ++|.|++-|.|. .|+ ..|. .||.. ||+|+..+. ...|
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~----~CEi~~epC~snPC~~--GgtCip~~n----------------~f~C- 3926 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGN----HCEIDLEPCASNPCLT--GGTCIPFYN----------------GFLC- 3926 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCc----ccccccccccCCCCCC--CCEEEecCC----------------CeeE-
Confidence 57888 789642 489999999995 553 2343 47876 888876543 0134
Q ss_pred ccCCCCeeeCCCCCCCCCCCC--CCCC-CeecC---CCeeECCCCceeCCCCccc
Q psy8491 152 IQCDVGYTLNPRTLYCEPECF--NCTR-GYCLE---PNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 152 ~~C~~Gy~g~~c~~~C~~~C~--~C~n-G~C~~---~~~C~C~~Gy~G~~~~C~~ 200 (273)
.|+.||+|..|+..=+.+|. .|.+ |.|+. ++.|.|.+||.| ..|..
T Consensus 3927 -nC~~gyTG~~Ce~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~g--r~c~~ 3978 (4289)
T KOG1219|consen 3927 -NCPNGYTGKRCEARGISECSKNVCGTGGQCINIPGSFHCNCTPGILG--RTCCA 3978 (4289)
T ss_pred -eCCCCccCceeecccccccccccccCCceeeccCCceEeccChhHhc--ccCcc
Confidence 78899999988855455676 7866 79974 479999999999 76644
No 10
>KOG1217|consensus
Probab=97.16 E-value=0.0043 Score=58.96 Aligned_cols=156 Identities=26% Similarity=0.585 Sum_probs=90.1
Q ss_pred cCCCCceeeeCCCCeeCCCCC---CCCCCCCCCCCCEEcC------CCeEEcCCCCeeCCCCCcc---cCC----CCCCC
Q psy8491 22 SQDLRQAVICRFDLKFVNASY---CAPVCSSPCINGACVF------PEQCHCSPGFQFVNETYCE---PYC----ENCQH 85 (273)
Q Consensus 22 ~~~~~~~C~C~~Gy~~~~~~~---C~~~C~~~C~~g~C~~------~~~C~C~~Gy~g~~~~~C~---~~C----~~C~~ 85 (273)
.....+.|.|++||.+..... |...-...+.++.|.. .+.|.|.+||.+ ..+. +.| ..|.+
T Consensus 105 ~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~---~~~~~~~~~C~~~~~~c~~ 181 (487)
T KOG1217|consen 105 DCVGSYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEG---EPCETDLDECIQYSSPCQN 181 (487)
T ss_pred CCCCCceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCccc---ccccccccccccCCCCcCC
Confidence 456788999999999865333 4332110123556653 468999999999 3333 234 24665
Q ss_pred -ceecCC---CceeeCCCcccCCCCCccccccCCCCCCCCCCeecCCCCceeeCCC----CC-----cCC---CCCCC--
Q psy8491 86 -GTCTAP---SVCECESGFVHNNETNACERICEEPKPPLYPGLGTIPNTERTCLDN----KC-----HCS---EGDSN-- 147 (273)
Q Consensus 86 -G~C~~~---~~C~C~~Gy~g~~~~~~C~~~C~~~C~~~~~g~C~~~~~~C~C~~G----~C-----~C~---~~C~n-- 147 (273)
+.|.+. ..|.|.+||.+...... .+++.|+.... |.+.+| .| .+. ..|.+
T Consensus 182 ~~~C~~~~~~~~C~c~~~~~~~~~~~~-----------~~~~~c~~~~~-~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~ 249 (487)
T KOG1217|consen 182 GGTCVNTGGSYLCSCPPGYTGSTCETT-----------GNGGTCVDSVA-CSCPPGARGPECEVSIVECASGDGTCVNTV 249 (487)
T ss_pred CcccccCCCCeeEeCCCCccCCcCcCC-----------CCCceEeccee-ccCCCCCCCCCcccccccccCCCCcccccC
Confidence 678664 37999999998532111 11344443322 444433 22 111 22332
Q ss_pred --CcccccCCCCeeeCCCCCCC--CCCCC---CCCC-CeecCC---CeeECCCCceeCC
Q psy8491 148 --PYCIIQCDVGYTLNPRTLYC--EPECF---NCTR-GYCLEP---NECSCPYNFHLEG 195 (273)
Q Consensus 148 --g~C~~~C~~Gy~g~~c~~~C--~~~C~---~C~n-G~C~~~---~~C~C~~Gy~G~~ 195 (273)
..| .|++||++..+ ..+ ..+|. .|.+ ++|+.. +.|.|++||.|..
T Consensus 250 ~~~~C--~~~~g~~~~~~-~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~ 305 (487)
T KOG1217|consen 250 GSYTC--RCPEGYTGDAC-VTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRL 305 (487)
T ss_pred Cceee--eCCCCcccccc-ceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCC
Confidence 145 78999999862 011 12333 2544 788754 8999999999943
No 11
>KOG4289|consensus
Probab=96.94 E-value=0.00081 Score=71.60 Aligned_cols=73 Identities=27% Similarity=0.585 Sum_probs=51.2
Q ss_pred CeEEcCCCCeeCCCCCcc---cCC--CCCC-CceecC---CCceeeCCCcccCCCCCcc-ccccCC-CCCCCCCCeecCC
Q psy8491 60 EQCHCSPGFQFVNETYCE---PYC--ENCQ-HGTCTA---PSVCECESGFVHNNETNAC-ERICEE-PKPPLYPGLGTIP 128 (273)
Q Consensus 60 ~~C~C~~Gy~g~~~~~C~---~~C--~~C~-~G~C~~---~~~C~C~~Gy~g~~~~~~C-~~~C~~-~C~~~~~g~C~~~ 128 (273)
.+|.|+|||+| ..|+ +.| .+|. ||+|.+ .++|.|.+||+|..+...- .-.|.+ .|.+ +|+|+..
T Consensus 1222 lrCrCPpGFTg---d~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~n--ggtC~~~ 1296 (2531)
T KOG4289|consen 1222 LRCRCPPGFTG---DYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKN--GGTCVNL 1296 (2531)
T ss_pred eeEeCCCCCCc---ccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecC--CCEEeec
Confidence 58999999999 7898 567 6887 599964 4699999999995321110 023433 4666 7888653
Q ss_pred C---CceeeCCC
Q psy8491 129 N---TERTCLDN 137 (273)
Q Consensus 129 ~---~~C~C~~G 137 (273)
. ..|.|++|
T Consensus 1297 ~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1297 LNGGFCCHCPYG 1308 (2531)
T ss_pred CCCceeccCCCc
Confidence 2 23899998
No 12
>KOG1219|consensus
Probab=96.70 E-value=0.0023 Score=70.91 Aligned_cols=80 Identities=26% Similarity=0.676 Sum_probs=57.8
Q ss_pred CCCC-CCEEcC----CCeEEcCCCCeeCCCCCcc---cCC--CCCCC-ceecC---CCceeeCCCcccCCCCCccc----
Q psy8491 49 SPCI-NGACVF----PEQCHCSPGFQFVNETYCE---PYC--ENCQH-GTCTA---PSVCECESGFVHNNETNACE---- 110 (273)
Q Consensus 49 ~~C~-~g~C~~----~~~C~C~~Gy~g~~~~~C~---~~C--~~C~~-G~C~~---~~~C~C~~Gy~g~~~~~~C~---- 110 (273)
.+|. .|.|.. .|.|.|++-|.| ..|+ ..| .+|.+ |+|+- .+.|.|+.||+|. .|+
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG---~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~----~Ce~~Gi 3942 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSG---NHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK----RCEARGI 3942 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccC---cccccccccccCCCCCCCCEEEecCCCeeEeCCCCccCc----eeecccc
Confidence 4677 678986 479999999999 8888 346 67886 78963 4589999999995 453
Q ss_pred cccC-CCCCCCCCCeecCCCC--ceeeCCC
Q psy8491 111 RICE-EPKPPLYPGLGTIPNT--ERTCLDN 137 (273)
Q Consensus 111 ~~C~-~~C~~~~~g~C~~~~~--~C~C~~G 137 (273)
..|+ +.|.+ +|.|++..+ .|.|.+|
T Consensus 3943 ~eCs~n~C~~--gg~C~n~~gsf~CncT~g 3970 (4289)
T KOG1219|consen 3943 SECSKNVCGT--GGQCINIPGSFHCNCTPG 3970 (4289)
T ss_pred cccccccccC--CceeeccCCceEeccChh
Confidence 3366 36777 788876543 3655555
No 13
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.16 E-value=0.0073 Score=37.01 Aligned_cols=20 Identities=40% Similarity=1.114 Sum_probs=13.5
Q ss_pred CC-CCEEcCC-CeEEcCCCCee
Q psy8491 51 CI-NGACVFP-EQCHCSPGFQF 70 (273)
Q Consensus 51 C~-~g~C~~~-~~C~C~~Gy~g 70 (273)
|. +|+|+.+ ++|+|.+||+|
T Consensus 8 C~~~G~C~~~~g~C~C~~g~~G 29 (32)
T PF07974_consen 8 CSGHGTCVSPCGRCVCDSGYTG 29 (32)
T ss_pred cCCCCEEeCCCCEEECCCCCcC
Confidence 54 6777754 67777777776
No 14
>KOG1217|consensus
Probab=96.14 E-value=0.062 Score=50.96 Aligned_cols=155 Identities=25% Similarity=0.543 Sum_probs=84.7
Q ss_pred ccccCCCCceeeeCCCCeeCCCCC------CC-------------CCCCC---CCC-C-CEEcC---CCeEEcCCCCeeC
Q psy8491 19 TSMSQDLRQAVICRFDLKFVNASY------CA-------------PVCSS---PCI-N-GACVF---PEQCHCSPGFQFV 71 (273)
Q Consensus 19 ~~~~~~~~~~C~C~~Gy~~~~~~~------C~-------------~~C~~---~C~-~-g~C~~---~~~C~C~~Gy~g~ 71 (273)
.+.+...++.|.|++||.+..... |. +.|.. .|. . +.|.+ .++|.|.+||.+.
T Consensus 184 ~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~ 263 (487)
T KOG1217|consen 184 TCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGD 263 (487)
T ss_pred ccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccc
Confidence 455666779999999998754221 21 11111 122 2 56665 3689999999982
Q ss_pred CCCCcc--cCC--C-CCCC-ceecCC---CceeeCCCcccCCCCCccc--cccCC-----CCCCCCCCeecCCCCceeeC
Q psy8491 72 NETYCE--PYC--E-NCQH-GTCTAP---SVCECESGFVHNNETNACE--RICEE-----PKPPLYPGLGTIPNTERTCL 135 (273)
Q Consensus 72 ~~~~C~--~~C--~-~C~~-G~C~~~---~~C~C~~Gy~g~~~~~~C~--~~C~~-----~C~~~~~g~C~~~~~~C~C~ 135 (273)
....+. ..| . .|.+ ++|+.. ..|.|++||.|..+ ..+. ..|.. .|.+ ++.|.....
T Consensus 264 ~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~-~~~~~~~~C~~~~~~~~c~~--g~~C~~~~~----- 335 (487)
T KOG1217|consen 264 ACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC-TECVDVDECSPRNAGGPCAN--GGTCNTLGS----- 335 (487)
T ss_pred ccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC-ccccccccccccccCCcCCC--CcccccCCC-----
Confidence 201122 334 2 2664 788754 58999999998643 2221 12210 1222 122210000
Q ss_pred CCCCcCCCCCCCCcccccCCCCeeeCCCCCCCCCCCC--CCC-CCeecC----CCeeECCCCcee
Q psy8491 136 DNKCHCSEGDSNPYCIIQCDVGYTLNPRTLYCEPECF--NCT-RGYCLE----PNECSCPYNFHL 193 (273)
Q Consensus 136 ~G~C~C~~~C~ng~C~~~C~~Gy~g~~c~~~C~~~C~--~C~-nG~C~~----~~~C~C~~Gy~G 193 (273)
.....| .|.+||.|..|+... ++|. ++. .+.|.. .+.|.|+.+|.+
T Consensus 336 ---------~~~~~C--~c~~~~~g~~C~~~~-~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 336 ---------FGGFRC--ACGPGFTGRRCEDSN-DECASSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred ---------CCCCCc--CCCCCCCCCccccCC-ccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence 000124 777888887766331 2343 343 478875 478999999987
No 15
>KOG0994|consensus
Probab=96.02 E-value=0.021 Score=60.14 Aligned_cols=44 Identities=34% Similarity=0.826 Sum_probs=27.1
Q ss_pred cCCCCeeeCCCC---CCCCCCCCCCCC----C-----eecC-----CCeeECCCCceeCCCCccc
Q psy8491 153 QCDVGYTLNPRT---LYCEPECFNCTR----G-----YCLE-----PNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 153 ~C~~Gy~g~~c~---~~C~~~C~~C~n----G-----~C~~-----~~~C~C~~Gy~G~~~~C~~ 200 (273)
.|..||+|++-- ..|.| | +|.. | +|.- .-.|.|.+||+| .+|+.
T Consensus 889 rCl~GyyGdP~lg~g~~CrP-C-pCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY~G--~RCe~ 949 (1758)
T KOG0994|consen 889 RCLDGYYGDPRLGSGIGCRP-C-PCPDGPASGRQHADSCYLDTRTQQIVCHCQEGYSG--SRCEI 949 (1758)
T ss_pred hhhccccCCcccCCCCCCCC-C-CCCCCCccchhccccccccccccceeeecccCccc--cchhh
Confidence 899999998532 23333 1 1211 1 3421 237999999999 77765
No 16
>smart00051 DSL delta serrate ligand.
Probab=95.60 E-value=0.011 Score=41.87 Aligned_cols=43 Identities=21% Similarity=0.345 Sum_probs=32.8
Q ss_pred cCCCCeeeCCCCCCCCCCCCCCC-CCeecCCCeeECCCCceeCCCCc
Q psy8491 153 QCDVGYTLNPRTLYCEPECFNCT-RGYCLEPNECSCPYNFHLEGGMC 198 (273)
Q Consensus 153 ~C~~Gy~g~~c~~~C~~~C~~C~-nG~C~~~~~C~C~~Gy~G~~~~C 198 (273)
.|.++|+|..|...|.+.-. .. +.+|...+.++|.+||+| ..|
T Consensus 20 ~C~~~~yG~~C~~~C~~~~d-~~~~~~Cd~~G~~~C~~Gw~G--~~C 63 (63)
T smart00051 20 TCDENYYGEGCNKFCRPRDD-FFGHYTCDENGNKGCLEGWMG--PYC 63 (63)
T ss_pred eCCCCCcCCccCCEeCcCcc-ccCCccCCcCCCEecCCCCcC--CCC
Confidence 88999999988877765321 22 358888899999999999 544
No 17
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=94.95 E-value=0.01 Score=50.77 Aligned_cols=93 Identities=23% Similarity=0.673 Sum_probs=59.6
Q ss_pred ccccccc-cCCCCceeeeCCCCeeCCCCCCCCC--CC------CCCC-CCEEcC--------CCeEEcCCCCeeCCCCCc
Q psy8491 15 VIQSTSM-SQDLRQAVICRFDLKFVNASYCAPV--CS------SPCI-NGACVF--------PEQCHCSPGFQFVNETYC 76 (273)
Q Consensus 15 ~~~~~~~-~~~~~~~C~C~~Gy~~~~~~~C~~~--C~------~~C~-~g~C~~--------~~~C~C~~Gy~g~~~~~C 76 (273)
.||+.-+ ....-+.|.|.+||.....+.|.+. |. .+|+ .+.|+. .+.|.|.+||+. ....|
T Consensus 7 ~CKNG~LiQMSNHfEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~-~~~vC 85 (197)
T PF06247_consen 7 ICKNGYLIQMSNHFECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL-KQGVC 85 (197)
T ss_dssp --BTEEEEEESSEEEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE-SSSSE
T ss_pred cccCCEEEEccCceEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCcee-eCCeE
Confidence 3454433 4445689999999998888888642 32 2466 578864 368999999998 33678
Q ss_pred c-cCC--CCCCCceecC-C-----CceeeCCCcccCCCCCcc
Q psy8491 77 E-PYC--ENCQHGTCTA-P-----SVCECESGFVHNNETNAC 109 (273)
Q Consensus 77 ~-~~C--~~C~~G~C~~-~-----~~C~C~~Gy~g~~~~~~C 109 (273)
. ..| ..|..|.|+. + ..|+|.-|+.- +....|
T Consensus 86 vp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~-~dn~kC 126 (197)
T PF06247_consen 86 VPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVP-DDNKKC 126 (197)
T ss_dssp EEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEET-TTTTES
T ss_pred chhhcCceecCCCeEEecCCCCCCceeEeeeceEe-ccCCcc
Confidence 8 457 5788899962 1 28999999983 344556
No 18
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=94.69 E-value=0.033 Score=34.07 Aligned_cols=21 Identities=29% Similarity=0.861 Sum_probs=18.2
Q ss_pred CCeecCC-CeeECCCCceeCCCCc
Q psy8491 176 RGYCLEP-NECSCPYNFHLEGGMC 198 (273)
Q Consensus 176 nG~C~~~-~~C~C~~Gy~G~~~~C 198 (273)
+|+|+.. ++|+|++||+| ..|
T Consensus 11 ~G~C~~~~g~C~C~~g~~G--~~C 32 (32)
T PF07974_consen 11 HGTCVSPCGRCVCDSGYTG--PDC 32 (32)
T ss_pred CCEEeCCCCEEECCCCCcC--CCC
Confidence 6999987 99999999999 544
No 19
>smart00051 DSL delta serrate ligand.
Probab=94.54 E-value=0.034 Score=39.43 Aligned_cols=42 Identities=17% Similarity=0.276 Sum_probs=24.3
Q ss_pred CceeeeCCCCeeCCCCCCCCCCCC--CCC-CCEEcCCCeEEcCCCCee
Q psy8491 26 RQAVICRFDLKFVNASYCAPVCSS--PCI-NGACVFPEQCHCSPGFQF 70 (273)
Q Consensus 26 ~~~C~C~~Gy~~~~~~~C~~~C~~--~C~-~g~C~~~~~C~C~~Gy~g 70 (273)
.+.-.|.++|+|.. |...|.+ +.. +..|...|.++|.+||+|
T Consensus 16 ~~rv~C~~~~yG~~---C~~~C~~~~d~~~~~~Cd~~G~~~C~~Gw~G 60 (63)
T smart00051 16 QIRVTCDENYYGEG---CNKFCRPRDDFFGHYTCDENGNKGCLEGWMG 60 (63)
T ss_pred EEEeeCCCCCcCCc---cCCEeCcCccccCCccCCcCCCEecCCCCcC
Confidence 34455777777643 3334432 233 566766677777777777
No 20
>KOG0994|consensus
Probab=93.80 E-value=0.13 Score=54.50 Aligned_cols=48 Identities=27% Similarity=0.662 Sum_probs=27.5
Q ss_pred CcccccCCCCeeeCCCCCCCC--------CCCC--CC-CCC----eec-CCCeeECCCCceeCCCCccc
Q psy8491 148 PYCIIQCDVGYTLNPRTLYCE--------PECF--NC-TRG----YCL-EPNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 148 g~C~~~C~~Gy~g~~c~~~C~--------~~C~--~C-~nG----~C~-~~~~C~C~~Gy~G~~~~C~~ 200 (273)
|.| .|+|||-|..|. .|+ ..|. .| ..| .|. ..+.|+|.+|-.| .+|..
T Consensus 1084 GQC--qCkpGfGGR~C~-qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr~tG~C~C~~Gv~G--~rCdq 1147 (1758)
T KOG0994|consen 1084 GQC--QCKPGFGGRTCS-QCQELYWGDPNEKCRACDCDPRGIETPQCDRATGRCVCRPGVGG--PRCDQ 1147 (1758)
T ss_pred cce--eccCCCCCcchh-HHHHhhcCCCCCCceecCCCCCCCCCCCccccCCceeecCCCCC--cchhh
Confidence 566 888888776553 222 1233 22 123 343 3478888888877 66644
No 21
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=93.67 E-value=0.033 Score=27.18 Aligned_cols=10 Identities=30% Similarity=0.896 Sum_probs=6.7
Q ss_pred eeECCCCcee
Q psy8491 184 ECSCPYNFHL 193 (273)
Q Consensus 184 ~C~C~~Gy~G 193 (273)
+|+|++||+|
T Consensus 1 ~C~C~~G~~G 10 (13)
T PF12661_consen 1 TCQCPPGWTG 10 (13)
T ss_dssp EEEE-TTEET
T ss_pred CccCcCCCcC
Confidence 4777888877
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=93.57 E-value=0.041 Score=34.57 Aligned_cols=23 Identities=22% Similarity=0.677 Sum_probs=17.7
Q ss_pred CCeec---CCCeeECCCCceeCCCCc
Q psy8491 176 RGYCL---EPNECSCPYNFHLEGGMC 198 (273)
Q Consensus 176 nG~C~---~~~~C~C~~Gy~G~~~~C 198 (273)
+++|. +.++|+|++||.|+|..|
T Consensus 11 nA~C~~~~~~~~C~C~~Gy~GdG~~C 36 (36)
T PF12947_consen 11 NATCTNTGGSYTCTCKPGYEGDGFFC 36 (36)
T ss_dssp TCEEEE-TTSEEEEE-CEEECCSTCE
T ss_pred CcEeecCCCCEEeECCCCCccCCcCC
Confidence 67775 358999999999988765
No 23
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=92.97 E-value=0.044 Score=33.36 Aligned_cols=21 Identities=29% Similarity=0.920 Sum_probs=17.0
Q ss_pred CCCC-CeecC----CCeeECCCCcee
Q psy8491 173 NCTR-GYCLE----PNECSCPYNFHL 193 (273)
Q Consensus 173 ~C~n-G~C~~----~~~C~C~~Gy~G 193 (273)
+|+| |+|+. .+.|+|++||+|
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 5766 79963 479999999999
No 24
>KOG4260|consensus
Probab=91.44 E-value=0.17 Score=45.88 Aligned_cols=46 Identities=22% Similarity=0.602 Sum_probs=30.4
Q ss_pred cCCCCeeeCCCCCCCC--CCCC----CCCC-Ceec---CCCeeECCCCceeCCCCccc
Q psy8491 153 QCDVGYTLNPRTLYCE--PECF----NCTR-GYCL---EPNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 153 ~C~~Gy~g~~c~~~C~--~~C~----~C~n-G~C~---~~~~C~C~~Gy~G~~~~C~~ 200 (273)
.|+.||..+ +..|. ++|. +|.. --|+ ++|+|...+||.+..+.|+.
T Consensus 221 kCkkGW~ld--e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d~C~~ 276 (350)
T KOG4260|consen 221 KCKKGWKLD--EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVDECQF 276 (350)
T ss_pred hhcccceec--ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChHHhhh
Confidence 889999987 33343 3443 3422 3444 46899999999985556765
No 25
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=90.89 E-value=0.086 Score=32.04 Aligned_cols=21 Identities=38% Similarity=1.086 Sum_probs=11.1
Q ss_pred CCC-CCEEcC----CCeEEcCCCCee
Q psy8491 50 PCI-NGACVF----PEQCHCSPGFQF 70 (273)
Q Consensus 50 ~C~-~g~C~~----~~~C~C~~Gy~g 70 (273)
+|. +|+|+. .++|+|.+||+|
T Consensus 5 ~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 5 PCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp SSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred cCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 344 455543 245666666655
No 26
>KOG1218|consensus
Probab=90.82 E-value=11 Score=33.94 Aligned_cols=54 Identities=15% Similarity=0.040 Sum_probs=29.2
Q ss_pred cCCCCceeeeCCCCeeC-CCCCCCCCCCCCCCCCE-EcCCCeEEcCCCCeeCCCCCcccC
Q psy8491 22 SQDLRQAVICRFDLKFV-NASYCAPVCSSPCINGA-CVFPEQCHCSPGFQFVNETYCEPY 79 (273)
Q Consensus 22 ~~~~~~~C~C~~Gy~~~-~~~~C~~~C~~~C~~g~-C~~~~~C~C~~Gy~g~~~~~C~~~ 79 (273)
....+..|.|.++|.+. .... .+........-. -....+|.+..+|.+ ..|.+.
T Consensus 10 ~~~~~~~c~c~~~~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~c~~~ 65 (316)
T KOG1218|consen 10 CLGGSGQCFCDPGYTGRLQCEH-QAVTSACSGICPCEVNSGECGLGYGFVG---SVCRIE 65 (316)
T ss_pred cCCCCCceecCCCccccccccC-CCCCccccccCCccCCceeEecccccCC---Cccccc
Confidence 34567889999999984 1111 111111111111 123567888899988 445533
No 27
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.36 E-value=0.12 Score=32.48 Aligned_cols=20 Identities=35% Similarity=0.981 Sum_probs=11.4
Q ss_pred CC-CCEEcC---CCeEEcCCCCee
Q psy8491 51 CI-NGACVF---PEQCHCSPGFQF 70 (273)
Q Consensus 51 C~-~g~C~~---~~~C~C~~Gy~g 70 (273)
|. ++.|.+ .++|+|++||+|
T Consensus 8 C~~nA~C~~~~~~~~C~C~~Gy~G 31 (36)
T PF12947_consen 8 CHPNATCTNTGGSYTCTCKPGYEG 31 (36)
T ss_dssp S-TTCEEEE-TTSEEEEE-CEEEC
T ss_pred CCCCcEeecCCCCEEeECCCCCcc
Confidence 44 666654 456777777776
No 28
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=90.05 E-value=0.23 Score=28.36 Aligned_cols=10 Identities=50% Similarity=1.570 Sum_probs=4.4
Q ss_pred eEEcCCCCee
Q psy8491 61 QCHCSPGFQF 70 (273)
Q Consensus 61 ~C~C~~Gy~g 70 (273)
+|+|++||+.
T Consensus 3 ~C~C~~Gy~l 12 (24)
T PF12662_consen 3 TCSCPPGYQL 12 (24)
T ss_pred EeeCCCCCcC
Confidence 4444444443
No 29
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=89.62 E-value=0.23 Score=31.22 Aligned_cols=25 Identities=8% Similarity=0.020 Sum_probs=19.6
Q ss_pred ccccccccCCCCceeeeCCCCeeCC
Q psy8491 15 VIQSTSMSQDLRQAVICRFDLKFVN 39 (273)
Q Consensus 15 ~~~~~~~~~~~~~~C~C~~Gy~~~~ 39 (273)
.|.+.++++++++.|.|++||.+.+
T Consensus 7 gC~h~C~~~~g~~~C~C~~Gy~L~~ 31 (36)
T PF14670_consen 7 GCSHICVNTPGSYRCSCPPGYKLAE 31 (36)
T ss_dssp GSSSEEEEETTSEEEE-STTEEE-T
T ss_pred CcCCCCccCCCceEeECCCCCEECc
Confidence 4778888889999999999998754
No 30
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=88.58 E-value=0.34 Score=27.66 Aligned_cols=21 Identities=10% Similarity=0.331 Sum_probs=15.4
Q ss_pred CceeeeCCCCeeC-CCCCCCCC
Q psy8491 26 RQAVICRFDLKFV-NASYCAPV 46 (273)
Q Consensus 26 ~~~C~C~~Gy~~~-~~~~C~~~ 46 (273)
+|.|.|++||+.. +...|+.+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DI 22 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDI 22 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccC
Confidence 5899999999974 35566543
No 31
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=87.37 E-value=0.23 Score=35.18 Aligned_cols=44 Identities=18% Similarity=0.299 Sum_probs=22.8
Q ss_pred cCCCCeeeCCCCCCCCCCCCCCCCCeecCCCeeECCCCceeCCCCc
Q psy8491 153 QCDVGYTLNPRTLYCEPECFNCTRGYCLEPNECSCPYNFHLEGGMC 198 (273)
Q Consensus 153 ~C~~Gy~g~~c~~~C~~~C~~C~nG~C~~~~~C~C~~Gy~G~~~~C 198 (273)
.|.+.|+|..|...|.+.=..=.+=+|...+.=+|.+||+| ..|
T Consensus 20 ~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G--~~C 63 (63)
T PF01414_consen 20 VCDENYYGPNCSKFCKPRDDSFGHYTCDSNGNKVCLPGWTG--PNC 63 (63)
T ss_dssp ---TTEETTTT-EE---EEETTEEEEE-SS--EEE-TTEES--TTS
T ss_pred ECCCCCCCccccCCcCCCcCCcCCcccCCCCCCCCCCCCcC--CCC
Confidence 78899999988777665211100238888889999999999 544
No 32
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.34 E-value=0.51 Score=30.28 Aligned_cols=10 Identities=40% Similarity=1.388 Sum_probs=4.5
Q ss_pred CeEEcCCCCe
Q psy8491 60 EQCHCSPGFQ 69 (273)
Q Consensus 60 ~~C~C~~Gy~ 69 (273)
|+|.|++||+
T Consensus 25 y~C~C~~Gy~ 34 (42)
T PF07645_consen 25 YSCSCPPGYE 34 (42)
T ss_dssp EEEEESTTEE
T ss_pred EEeeCCCCcE
Confidence 3444444444
No 33
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=85.96 E-value=0.81 Score=27.92 Aligned_cols=19 Identities=42% Similarity=1.249 Sum_probs=10.6
Q ss_pred CC-CCEEcC---CCeEEcCCCCe
Q psy8491 51 CI-NGACVF---PEQCHCSPGFQ 69 (273)
Q Consensus 51 C~-~g~C~~---~~~C~C~~Gy~ 69 (273)
|. ++.|++ .+.|.|.+||+
T Consensus 11 C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 11 CQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred cCCCCEeECCCCCeEeECCCCCc
Confidence 44 445553 34566666666
No 34
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=85.93 E-value=1.2 Score=27.15 Aligned_cols=18 Identities=11% Similarity=-0.005 Sum_probs=9.5
Q ss_pred ccccCCCCceeeeCCCCe
Q psy8491 19 TSMSQDLRQAVICRFDLK 36 (273)
Q Consensus 19 ~~~~~~~~~~C~C~~Gy~ 36 (273)
+++++.+.+.|.|++||.
T Consensus 16 ~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 16 TCVNTVGSYRCECPPGYT 33 (39)
T ss_pred EeECCCCCeEeECCCCCc
Confidence 444555555555555554
No 35
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=85.55 E-value=0.33 Score=31.20 Aligned_cols=20 Identities=10% Similarity=0.190 Sum_probs=18.4
Q ss_pred cccccCCCCceeeeCCCCee
Q psy8491 18 STSMSQDLRQAVICRFDLKF 37 (273)
Q Consensus 18 ~~~~~~~~~~~C~C~~Gy~~ 37 (273)
..++++.++|.|.|++||+.
T Consensus 16 ~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 16 GTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SEEEEETTEEEEEESTTEEE
T ss_pred CEEEcCCCCEEeeCCCCcEE
Confidence 77889999999999999994
No 36
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=84.19 E-value=0.59 Score=40.20 Aligned_cols=82 Identities=21% Similarity=0.570 Sum_probs=51.6
Q ss_pred cCCCCceeeeCCCCeeCCCCCCC-CCCCC-CCCCCEEcC------CCeEEcCCCCeeCCCCCcc----cCC-CCCC-Cce
Q psy8491 22 SQDLRQAVICRFDLKFVNASYCA-PVCSS-PCINGACVF------PEQCHCSPGFQFVNETYCE----PYC-ENCQ-HGT 87 (273)
Q Consensus 22 ~~~~~~~C~C~~Gy~~~~~~~C~-~~C~~-~C~~g~C~~------~~~C~C~~Gy~g~~~~~C~----~~C-~~C~-~G~ 87 (273)
..+..+.|.|-+||.... ..|. ..|.. .|++|.|+. ...|+|.=|+..++..+|. ..| ..|. |..
T Consensus 65 ~~~~~~~C~C~~gY~~~~-~vCvp~~C~~~~Cg~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~ 143 (197)
T PF06247_consen 65 GEERAYKCDCINGYILKQ-GVCVPNKCNNKDCGSGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEE 143 (197)
T ss_dssp TSSTSEEEEE-TTEEESS-SSEEEGGGSS---TTEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEE
T ss_pred ccceeEEEecccCceeeC-CeEchhhcCceecCCCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcc
Confidence 566789999999999754 4665 34553 588999973 2389999999955556776 345 5565 577
Q ss_pred ecC---CCceeeCCCcccCC
Q psy8491 88 CTA---PSVCECESGFVHNN 104 (273)
Q Consensus 88 C~~---~~~C~C~~Gy~g~~ 104 (273)
|.. .++|++.+|+.++.
T Consensus 144 CK~~~~~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 144 CKLVDGYYKCVCKEGFPGDG 163 (197)
T ss_dssp EEEETTEEEEEE-TT-EEET
T ss_pred eeeeCcEEEeecCCCCCCCC
Confidence 854 34899999998864
No 37
>KOG1836|consensus
Probab=83.45 E-value=1.9 Score=48.42 Aligned_cols=48 Identities=27% Similarity=0.495 Sum_probs=30.0
Q ss_pred cccccCCCCeeeCCCCCCCCCCCC--CCCC-Ceec-----CCCeeE-CCCCceeCCCCccc
Q psy8491 149 YCIIQCDVGYTLNPRTLYCEPECF--NCTR-GYCL-----EPNECS-CPYNFHLEGGMCMI 200 (273)
Q Consensus 149 ~C~~~C~~Gy~g~~c~~~C~~~C~--~C~n-G~C~-----~~~~C~-C~~Gy~G~~~~C~~ 200 (273)
.|. +|..||+|..-...-.+ |. +|-+ +.|. ....|. ||+||+| .+|+.
T Consensus 757 ~C~-~C~~GfYg~~~~~~~~d-C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG--~rCe~ 813 (1705)
T KOG1836|consen 757 QCA-QCVDGFYGLPDLGTSGD-CQPCPCPNGGACGQTPEILEVVCKNCPPGYTG--LRCEE 813 (1705)
T ss_pred chh-hhcCCCCCccccCCCCC-CccCCCCCChhhcCcCcccceecCCCCCCCcc--ccccc
Confidence 444 89999999753322111 44 3434 4453 236898 9999999 77765
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=83.15 E-value=1.4 Score=25.85 Aligned_cols=18 Identities=39% Similarity=1.001 Sum_probs=10.9
Q ss_pred CCEEcC---CCeEEcCCCCee
Q psy8491 53 NGACVF---PEQCHCSPGFQF 70 (273)
Q Consensus 53 ~g~C~~---~~~C~C~~Gy~g 70 (273)
++.|++ .++|.|++||.|
T Consensus 11 ~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 11 GGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCEEecCCCCeEeECCCCCcc
Confidence 455554 356666666666
No 39
>KOG1218|consensus
Probab=80.76 E-value=13 Score=33.52 Aligned_cols=12 Identities=42% Similarity=1.198 Sum_probs=6.3
Q ss_pred CCceeeCCCccc
Q psy8491 91 PSVCECESGFVH 102 (273)
Q Consensus 91 ~~~C~C~~Gy~g 102 (273)
.+.|.|.+||.+
T Consensus 161 ~~~c~c~~g~~g 172 (316)
T KOG1218|consen 161 NGICTCQPGFVG 172 (316)
T ss_pred CCceeccCCccc
Confidence 345555555555
No 40
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=78.86 E-value=0.82 Score=32.35 Aligned_cols=43 Identities=23% Similarity=0.443 Sum_probs=13.7
Q ss_pred CCceeeeCCCCeeCCCCCCCCCCCCC--CC-CCEEcCCCeEEcCCCCee
Q psy8491 25 LRQAVICRFDLKFVNASYCAPVCSSP--CI-NGACVFPEQCHCSPGFQF 70 (273)
Q Consensus 25 ~~~~C~C~~Gy~~~~~~~C~~~C~~~--C~-~g~C~~~~~C~C~~Gy~g 70 (273)
.+.+-.|.+.|+|.. |...|.+. =. +.+|...|.-.|.+||+|
T Consensus 15 ~~~rv~C~~nyyG~~---C~~~C~~~~d~~ghy~Cd~~G~~~C~~Gw~G 60 (63)
T PF01414_consen 15 YRIRVVCDENYYGPN---CSKFCKPRDDSFGHYTCDSNGNKVCLPGWTG 60 (63)
T ss_dssp --------TTEETTT---T-EE---EEETTEEEEE-SS--EEE-TTEES
T ss_pred EEEEEECCCCCCCcc---ccCCcCCCcCCcCCcccCCCCCCCCCCCCcC
Confidence 344555666666533 22233221 11 455655566666666666
No 41
>PHA02887 EGF-like protein; Provisional
Probab=78.61 E-value=1.7 Score=34.57 Aligned_cols=27 Identities=22% Similarity=0.545 Sum_probs=21.5
Q ss_pred CCCCCeec-----CCCeeECCCCceeCCCCcccc
Q psy8491 173 NCTRGYCL-----EPNECSCPYNFHLEGGMCMIA 201 (273)
Q Consensus 173 ~C~nG~C~-----~~~~C~C~~Gy~G~~~~C~~~ 201 (273)
-|.||+|. ....|.|++||+| .+|+..
T Consensus 93 YCiHG~C~yI~dL~epsCrC~~GYtG--~RCE~v 124 (126)
T PHA02887 93 FCINGECMNIIDLDEKFCICNKGYTG--IRCDEV 124 (126)
T ss_pred EeeCCEEEccccCCCceeECCCCccc--CCCCcc
Confidence 47789995 2368999999999 888753
No 42
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=77.77 E-value=2.4 Score=25.29 Aligned_cols=18 Identities=39% Similarity=1.034 Sum_probs=10.6
Q ss_pred CCEEcC---CCeEEcCCCCee
Q psy8491 53 NGACVF---PEQCHCSPGFQF 70 (273)
Q Consensus 53 ~g~C~~---~~~C~C~~Gy~g 70 (273)
++.|.+ .+.|.|.+||.|
T Consensus 14 ~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 14 GGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred CCEeECCCCCeEeECCCCCcC
Confidence 445543 346677777766
No 43
>smart00181 EGF Epidermal growth factor-like domain.
Probab=77.40 E-value=1.9 Score=25.81 Aligned_cols=11 Identities=36% Similarity=1.047 Sum_probs=5.6
Q ss_pred CeEEcCCCCee
Q psy8491 60 EQCHCSPGFQF 70 (273)
Q Consensus 60 ~~C~C~~Gy~g 70 (273)
++|.|.+||+|
T Consensus 20 ~~C~C~~g~~g 30 (35)
T smart00181 20 YTCSCPPGYTG 30 (35)
T ss_pred eEeECCCCCcc
Confidence 44555555544
No 44
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=74.78 E-value=2.2 Score=34.45 Aligned_cols=28 Identities=21% Similarity=0.546 Sum_probs=22.7
Q ss_pred CCCCCeec-----CCCeeECCCCceeCCCCccccC
Q psy8491 173 NCTRGYCL-----EPNECSCPYNFHLEGGMCMIAE 202 (273)
Q Consensus 173 ~C~nG~C~-----~~~~C~C~~Gy~G~~~~C~~~~ 202 (273)
-|.||+|. ....|.|+.||+| .+||...
T Consensus 52 YClHG~C~yI~dl~~~~CrC~~GYtG--eRCEh~d 84 (139)
T PHA03099 52 YCLHGDCIHARDIDGMYCRCSHGYTG--IRCQHVV 84 (139)
T ss_pred EeECCEEEeeccCCCceeECCCCccc--cccccee
Confidence 46688885 3478999999999 9998764
No 45
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=63.20 E-value=17 Score=24.00 Aligned_cols=22 Identities=32% Similarity=0.924 Sum_probs=16.6
Q ss_pred CCeecCCCeeECCCCceeCCCCc
Q psy8491 176 RGYCLEPNECSCPYNFHLEGGMC 198 (273)
Q Consensus 176 nG~C~~~~~C~C~~Gy~G~~~~C 198 (273)
+..|+ .+.|.|++||+-.+.+|
T Consensus 31 ~s~C~-~g~C~C~~g~~~~~~~C 52 (52)
T PF01683_consen 31 GSVCV-NGRCQCPPGYVEVGGRC 52 (52)
T ss_pred cCEEc-CCEeECCCCCEecCCCC
Confidence 45774 47999999998766654
No 46
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=61.53 E-value=7.6 Score=24.03 Aligned_cols=16 Identities=6% Similarity=0.044 Sum_probs=9.8
Q ss_pred CCceeeeCCCCeeCCC
Q psy8491 25 LRQAVICRFDLKFVNA 40 (273)
Q Consensus 25 ~~~~C~C~~Gy~~~~~ 40 (273)
..+.|.|++||..+..
T Consensus 16 ~~~~C~CPeGyIlde~ 31 (34)
T PF09064_consen 16 SPGQCFCPEGYILDEG 31 (34)
T ss_pred CCCceeCCCceEecCC
Confidence 4456777777766543
No 47
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=59.37 E-value=51 Score=34.80 Aligned_cols=20 Identities=30% Similarity=0.687 Sum_probs=15.3
Q ss_pred CCeeECCCCceeCCCCcccc
Q psy8491 182 PNECSCPYNFHLEGGMCMIA 201 (273)
Q Consensus 182 ~~~C~C~~Gy~G~~~~C~~~ 201 (273)
.+.|+|..||...+..|...
T Consensus 750 ~~vC~C~~g~~l~~~~c~~~ 769 (800)
T PTZ00214 750 QGVCMCELDAVLTKGVCVPA 769 (800)
T ss_pred CCeEEeCCcceecCCeeEec
Confidence 36889999998877777654
No 48
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=53.50 E-value=69 Score=30.78 Aligned_cols=18 Identities=22% Similarity=0.741 Sum_probs=9.0
Q ss_pred EcCCCCeeCCCCCcccCC
Q psy8491 63 HCSPGFQFVNETYCEPYC 80 (273)
Q Consensus 63 ~C~~Gy~g~~~~~C~~~C 80 (273)
.|.++|.+.....|...|
T Consensus 41 ~C~~~~~lt~t~~Ci~~C 58 (397)
T PF03302_consen 41 ECNSGYYLTPTNQCIEDC 58 (397)
T ss_pred cCCCCCcCCCCCccccCc
Confidence 366666653234454443
No 49
>PHA02887 EGF-like protein; Provisional
Probab=52.69 E-value=12 Score=29.90 Aligned_cols=24 Identities=38% Similarity=1.018 Sum_probs=16.7
Q ss_pred CCCCEEcC-----CCeEEcCCCCeeCCCCCcc
Q psy8491 51 CINGACVF-----PEQCHCSPGFQFVNETYCE 77 (273)
Q Consensus 51 C~~g~C~~-----~~~C~C~~Gy~g~~~~~C~ 77 (273)
|.||.|.- ...|.|.+||+| ..|+
T Consensus 94 CiHG~C~yI~dL~epsCrC~~GYtG---~RCE 122 (126)
T PHA02887 94 CINGECMNIIDLDEKFCICNKGYTG---IRCD 122 (126)
T ss_pred eeCCEEEccccCCCceeECCCCccc---CCCC
Confidence 55777763 357888888888 5554
No 50
>PF02363 C_tripleX: Cysteine rich repeat; InterPro: IPR003341 This signature describes a cysteine repeat C-X3-C-X3-C the function of which is unknown as is the function of the proteins in which they occur.
Probab=39.16 E-value=14 Score=18.86 Aligned_cols=12 Identities=42% Similarity=1.140 Sum_probs=6.2
Q ss_pred CCCCCCCCCCCC
Q psy8491 42 YCAPVCSSPCIN 53 (273)
Q Consensus 42 ~C~~~C~~~C~~ 53 (273)
.|.|.|...|.+
T Consensus 2 ~C~p~C~~~C~~ 13 (17)
T PF02363_consen 2 QCVPQCEPSCEN 13 (17)
T ss_pred cchhhccCcccc
Confidence 355555555544
No 51
>KOG1836|consensus
Probab=38.69 E-value=49 Score=37.82 Aligned_cols=16 Identities=19% Similarity=0.277 Sum_probs=11.8
Q ss_pred eeeCCCCeeCCCCCCC
Q psy8491 29 VICRFDLKFVNASYCA 44 (273)
Q Consensus 29 C~C~~Gy~~~~~~~C~ 44 (273)
|.|+.||.|...+.|.
T Consensus 697 c~C~~g~tG~~Ce~C~ 712 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCA 712 (1705)
T ss_pred ccCCCCcccchhhhcc
Confidence 8899999876555553
No 52
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=35.64 E-value=33 Score=27.87 Aligned_cols=24 Identities=33% Similarity=1.013 Sum_probs=14.3
Q ss_pred CCCCEEcC-----CCeEEcCCCCeeCCCCCcc
Q psy8491 51 CINGACVF-----PEQCHCSPGFQFVNETYCE 77 (273)
Q Consensus 51 C~~g~C~~-----~~~C~C~~Gy~g~~~~~C~ 77 (273)
|.||.|.- ...|.|..||+| .+|+
T Consensus 53 ClHG~C~yI~dl~~~~CrC~~GYtG---eRCE 81 (139)
T PHA03099 53 CLHGDCIHARDIDGMYCRCSHGYTG---IRCQ 81 (139)
T ss_pred eECCEEEeeccCCCceeECCCCccc---cccc
Confidence 45666642 345777777777 4554
No 53
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=35.62 E-value=16 Score=23.05 Aligned_cols=24 Identities=29% Similarity=0.859 Sum_probs=16.8
Q ss_pred CCeec----CCCeeECCCCceeCCCCcc
Q psy8491 176 RGYCL----EPNECSCPYNFHLEGGMCM 199 (273)
Q Consensus 176 nG~C~----~~~~C~C~~Gy~G~~~~C~ 199 (273)
|+.|. +..+|.|..||..++..|+
T Consensus 10 NA~C~~~~dG~eecrCllgyk~~~~~C~ 37 (37)
T PF12946_consen 10 NAGCFRYDDGSEECRCLLGYKKVGGKCV 37 (37)
T ss_dssp TEEEEEETTSEEEEEE-TTEEEETTEEE
T ss_pred CcccEEcCCCCEEEEeeCCccccCCCcC
Confidence 66675 3468999999998777663
No 54
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=32.58 E-value=32 Score=22.94 Aligned_cols=16 Identities=25% Similarity=0.791 Sum_probs=11.5
Q ss_pred eeECCCCceeCCC-Ccc
Q psy8491 184 ECSCPYNFHLEGG-MCM 199 (273)
Q Consensus 184 ~C~C~~Gy~G~~~-~C~ 199 (273)
.|.|++||.-++. .|.
T Consensus 34 gC~C~~G~v~~~~~~CV 50 (55)
T PF01826_consen 34 GCFCPPGYVRNDNGRCV 50 (55)
T ss_dssp EEEETTTEEEETTSEEE
T ss_pred cCCCCCCeeEcCCCCEE
Confidence 3999999987544 553
No 55
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=29.95 E-value=39 Score=21.93 Aligned_cols=17 Identities=24% Similarity=0.477 Sum_probs=12.8
Q ss_pred CCeeECCCCceeCCCCccc
Q psy8491 182 PNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 182 ~~~C~C~~Gy~G~~~~C~~ 200 (273)
.+.|.|+++|+| ..|+.
T Consensus 17 ~G~C~C~~~~~G--~~C~~ 33 (46)
T smart00180 17 TGQCECKPNVTG--RRCDR 33 (46)
T ss_pred CCEEECCCCCCC--CCCCc
Confidence 467888888888 77654
No 56
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=29.77 E-value=23 Score=23.04 Aligned_cols=21 Identities=19% Similarity=0.357 Sum_probs=13.5
Q ss_pred eecC-CCeeECCCCceeCCCCccc
Q psy8491 178 YCLE-PNECSCPYNFHLEGGMCMI 200 (273)
Q Consensus 178 ~C~~-~~~C~C~~Gy~G~~~~C~~ 200 (273)
.|.. .+.|.|+++|+| ..|+.
T Consensus 12 ~C~~~~G~C~C~~~~~G--~~C~~ 33 (49)
T PF00053_consen 12 TCDPSTGQCVCKPGTTG--PRCDQ 33 (49)
T ss_dssp SEEETCEEESBSTTEES--TTS-E
T ss_pred cccCCCCEEeccccccC--CcCcC
Confidence 4443 467888888888 76664
No 57
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=29.41 E-value=39 Score=25.93 Aligned_cols=22 Identities=41% Similarity=0.922 Sum_probs=16.8
Q ss_pred CCCC-CCEEcC--CCeEEcCCCCee
Q psy8491 49 SPCI-NGACVF--PEQCHCSPGFQF 70 (273)
Q Consensus 49 ~~C~-~g~C~~--~~~C~C~~Gy~g 70 (273)
..|+ +|.|.. ...|.|.+||.-
T Consensus 84 ~~CG~~g~C~~~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 84 GFCGPNGICNSNNSPKCSCLPGFEP 108 (110)
T ss_pred cccCCccEeCCCCCCceECCCCcCC
Confidence 4587 899975 347999999963
No 58
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=27.38 E-value=50 Score=21.68 Aligned_cols=14 Identities=7% Similarity=-0.100 Sum_probs=8.3
Q ss_pred CCceeeeCCCCeeC
Q psy8491 25 LRQAVICRFDLKFV 38 (273)
Q Consensus 25 ~~~~C~C~~Gy~~~ 38 (273)
..+.|.|.++|.|.
T Consensus 17 ~~G~C~C~~~~~G~ 30 (50)
T cd00055 17 GTGQCECKPNTTGR 30 (50)
T ss_pred CCCEEeCCCcCCCC
Confidence 35566666666653
No 59
>KOG1388|consensus
Probab=26.04 E-value=67 Score=28.34 Aligned_cols=47 Identities=17% Similarity=0.327 Sum_probs=31.0
Q ss_pred cCCCCeeeCCCCCCCCCCCC-CCCCCeecCCCeeEC-CCCceeCCCCcccc
Q psy8491 153 QCDVGYTLNPRTLYCEPECF-NCTRGYCLEPNECSC-PYNFHLEGGMCMIA 201 (273)
Q Consensus 153 ~C~~Gy~g~~c~~~C~~~C~-~C~nG~C~~~~~C~C-~~Gy~G~~~~C~~~ 201 (273)
.|..||+|+..+..|+++.. ....+.....++|.| ..|+.| +.|..-
T Consensus 80 kc~~g~~GdtN~g~c~~~~~~g~~~~~~~~~~~c~c~~kgvvg--d~c~~~ 128 (217)
T KOG1388|consen 80 KCIVGFYGDTNGGKCQPCDCNGGASACVTLTGKCFCTTKGVVG--DLCPKC 128 (217)
T ss_pred ceEEEEEecCCCCccCHhhhcCCeeeeeccCCccccccceEec--ccCccc
Confidence 68899999755667776422 222344456789999 678888 555443
No 60
>KOG0196|consensus
Probab=21.58 E-value=96 Score=32.88 Aligned_cols=44 Identities=25% Similarity=0.782 Sum_probs=25.8
Q ss_pred CeEEcCCCCee-CCCCCcc-------------cCCCCCC-Cceec--CCCceeeCCCcccC
Q psy8491 60 EQCHCSPGFQF-VNETYCE-------------PYCENCQ-HGTCT--APSVCECESGFVHN 103 (273)
Q Consensus 60 ~~C~C~~Gy~g-~~~~~C~-------------~~C~~C~-~G~C~--~~~~C~C~~Gy~g~ 103 (273)
|.|.|.+||.. .....|+ ..|..|. |-.=. +...|.|..||...
T Consensus 259 G~C~C~aGye~~~~~~~C~aCp~G~yK~~~~~~~C~~CP~~S~s~~ega~~C~C~~gyyRA 319 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQACPPGTYKASQGDSLCLPCPPNSHSSSEGATSCTCENGYYRA 319 (996)
T ss_pred CceeecCCCCcccCCCcceeCCCCcccCCCCCCCCCCCCCCCCCCCCCCCcccccCCcccC
Confidence 78999999986 3334554 1233332 22211 22378999998764
Done!