Query psy9427
Match_columns 324
No_of_seqs 289 out of 2381
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 19:04:44 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9427.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9427hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.4 1.3E-12 2.8E-17 128.0 11.2 151 19-217 699-858 (1289)
2 KOG1219|consensus 99.3 1.1E-11 2.3E-16 131.1 8.9 110 15-158 3865-3976(4289)
3 KOG1219|consensus 99.2 2.5E-11 5.5E-16 128.3 8.9 110 77-219 3864-3975(4289)
4 KOG1214|consensus 99.2 4.2E-11 9.1E-16 117.7 9.6 133 5-165 724-866 (1289)
5 KOG1217|consensus 99.0 9.9E-09 2.2E-13 99.7 16.6 190 15-217 170-388 (487)
6 KOG1217|consensus 99.0 2.7E-08 5.9E-13 96.6 18.1 178 16-218 91-304 (487)
7 KOG4289|consensus 98.7 1.3E-08 2.8E-13 105.1 5.5 104 16-152 1181-1308(2531)
8 KOG4260|consensus 98.5 5.2E-08 1.1E-12 85.1 3.7 152 20-217 150-305 (350)
9 KOG4289|consensus 98.5 1.2E-07 2.6E-12 98.1 6.4 92 92-214 1216-1308(2531)
10 KOG1225|consensus 98.2 1.4E-05 3E-10 77.8 11.5 20 194-218 345-364 (525)
11 KOG1225|consensus 98.1 4.2E-05 9.2E-10 74.5 11.8 149 98-313 234-387 (525)
12 KOG4260|consensus 98.0 1.2E-05 2.6E-10 70.6 5.3 105 83-217 150-269 (350)
13 PF06247 Plasmod_Pvs28: Plasmo 97.9 3.4E-06 7.5E-11 70.3 1.2 145 21-217 7-161 (197)
14 PF00008 EGF: EGF-like domain 97.9 1.1E-05 2.4E-10 48.2 2.9 30 17-47 1-31 (32)
15 PF07645 EGF_CA: Calcium-bindi 97.9 8.9E-06 1.9E-10 51.9 2.5 25 193-217 11-35 (42)
16 PF07645 EGF_CA: Calcium-bindi 97.7 2.3E-05 5E-10 50.0 2.4 34 121-157 3-37 (42)
17 PF00008 EGF: EGF-like domain 97.5 5.5E-05 1.2E-09 45.2 1.6 29 80-108 1-30 (32)
18 smart00179 EGF_CA Calcium-bind 97.4 0.00024 5.2E-09 44.0 3.9 32 77-108 2-35 (39)
19 PF12662 cEGF: Complement Clr- 97.4 0.00017 3.7E-09 39.8 2.7 24 144-167 1-24 (24)
20 PF12947 EGF_3: EGF domain; I 97.4 0.00021 4.5E-09 43.9 3.2 28 21-49 7-34 (36)
21 smart00179 EGF_CA Calcium-bind 97.3 0.00043 9.2E-09 42.9 3.7 31 14-45 2-33 (39)
22 PF12947 EGF_3: EGF domain; I 97.2 0.00024 5.1E-09 43.6 2.0 28 83-110 6-33 (36)
23 PF12662 cEGF: Complement Clr- 97.2 0.00038 8.1E-09 38.5 2.3 24 97-122 1-24 (24)
24 cd00054 EGF_CA Calcium-binding 97.0 0.0013 2.8E-08 40.1 3.8 32 77-108 2-34 (38)
25 cd00054 EGF_CA Calcium-binding 96.8 0.0015 3.2E-08 39.9 3.3 33 14-47 2-35 (38)
26 cd00053 EGF Epidermal growth f 96.7 0.0025 5.3E-08 38.2 3.4 28 19-47 5-32 (36)
27 cd00053 EGF Epidermal growth f 96.5 0.0048 1E-07 36.9 3.8 28 82-109 5-32 (36)
28 smart00181 EGF Epidermal growt 96.2 0.0061 1.3E-07 36.7 3.2 29 17-47 2-31 (35)
29 KOG1226|consensus 96.2 0.041 8.9E-07 55.5 10.4 142 99-283 479-636 (783)
30 smart00181 EGF Epidermal growt 96.1 0.0092 2E-07 35.9 3.5 28 80-108 2-30 (35)
31 PF14670 FXa_inhibition: Coagu 95.3 0.013 2.8E-07 35.9 1.9 26 132-160 9-34 (36)
32 PF14670 FXa_inhibition: Coagu 95.2 0.014 3.1E-07 35.7 1.8 23 89-111 10-32 (36)
33 KOG1226|consensus 95.1 0.43 9.4E-06 48.4 13.0 117 84-239 515-635 (783)
34 KOG0994|consensus 92.9 0.48 1E-05 50.1 8.4 63 90-159 877-948 (1758)
35 PF07974 EGF_2: EGF-like domai 92.9 0.17 3.7E-06 30.0 3.3 25 20-47 6-30 (32)
36 PF06247 Plasmod_Pvs28: Plasmo 92.3 0.1 2.2E-06 43.9 2.4 106 20-158 50-164 (197)
37 PF07974 EGF_2: EGF-like domai 91.1 0.25 5.3E-06 29.3 2.5 23 193-217 7-29 (32)
38 PF04706 Dickkopf_N: Dickkopf 91.0 0.41 8.9E-06 31.8 3.8 21 295-315 31-51 (52)
39 PF12946 EGF_MSP1_1: MSP1 EGF 91.0 0.22 4.7E-06 30.5 2.3 30 17-47 2-32 (37)
40 PF12661 hEGF: Human growth fa 90.2 0.27 5.8E-06 22.9 1.7 11 36-47 1-11 (13)
41 PF12946 EGF_MSP1_1: MSP1 EGF 89.3 0.17 3.6E-06 31.0 0.7 30 81-110 3-33 (37)
42 KOG0994|consensus 88.6 2 4.3E-05 45.8 8.2 75 32-112 931-1013(1758)
43 cd01475 vWA_Matrilin VWA_Matri 88.3 0.44 9.6E-06 41.8 3.1 36 161-217 183-218 (224)
44 cd01475 vWA_Matrilin VWA_Matri 85.1 1.2 2.5E-05 39.1 4.1 38 114-157 182-220 (224)
45 PF04706 Dickkopf_N: Dickkopf 84.8 1.4 3E-05 29.3 3.3 15 233-248 2-16 (52)
46 PF00954 S_locus_glycop: S-loc 79.8 1.8 3.9E-05 33.4 2.8 24 193-217 85-108 (110)
47 PF00954 S_locus_glycop: S-loc 78.4 2 4.2E-05 33.2 2.7 31 14-46 77-108 (110)
48 KOG1836|consensus 71.2 7 0.00015 44.2 5.5 51 101-158 760-811 (1705)
49 PF01683 EB: EB module; Inter 68.3 3.6 7.9E-05 27.0 1.7 18 297-314 22-39 (52)
50 PF00095 WAP: WAP-type (Whey A 60.7 3.2 6.9E-05 26.1 0.3 20 291-310 13-32 (43)
51 smart00051 DSL delta serrate l 57.8 17 0.00036 25.2 3.5 47 97-157 16-62 (63)
52 PF09064 Tme5_EGF_like: Thromb 55.6 14 0.00031 22.1 2.4 21 27-49 11-31 (34)
53 PF04885 Stig1: Stigma-specifi 55.1 32 0.0007 27.7 5.2 34 242-275 68-108 (136)
54 PHA02887 EGF-like protein; Pro 53.5 13 0.00028 29.0 2.5 24 193-217 93-118 (126)
55 PHA02887 EGF-like protein; Pro 53.1 15 0.00033 28.6 2.8 31 83-119 92-124 (126)
56 cd00055 EGF_Lam Laminin-type e 49.0 21 0.00044 23.2 2.7 25 27-54 13-37 (50)
57 PF00053 Laminin_EGF: Laminin 48.2 15 0.00032 23.7 1.9 26 26-54 11-36 (49)
58 smart00217 WAP Four-disulfide 48.2 25 0.00054 22.7 2.9 24 287-310 12-35 (47)
59 PHA03099 epidermal growth fact 42.1 22 0.00047 28.3 2.2 24 193-217 52-77 (139)
60 PHA03099 epidermal growth fact 41.8 24 0.00051 28.1 2.4 31 83-119 51-83 (139)
61 KOG3516|consensus 36.4 26 0.00056 37.9 2.4 37 77-118 545-582 (1306)
62 KOG3516|consensus 30.9 34 0.00074 37.1 2.3 33 14-47 545-578 (1306)
63 PF04885 Stig1: Stigma-specifi 30.4 1.1E+02 0.0023 24.8 4.6 54 262-315 67-134 (136)
64 cd00199 WAP whey acidic protei 29.6 64 0.0014 21.9 2.8 21 290-310 28-48 (60)
65 smart00180 EGF_Lam Laminin-typ 28.5 57 0.0012 20.7 2.3 25 27-54 12-36 (46)
66 PF08562 Crisp: Crisp; InterP 27.1 27 0.00058 23.5 0.5 15 306-320 5-19 (55)
67 PF11403 Yeast_MT: Yeast metal 23.1 93 0.002 18.5 2.2 12 293-304 26-37 (40)
68 KOG3514|consensus 23.1 56 0.0012 35.2 2.1 35 79-118 625-660 (1591)
69 cd01328 FSL_SPARC Follistatin- 21.3 1.2E+02 0.0026 22.4 3.1 27 16-42 1-28 (86)
No 1
>KOG1214|consensus
Probab=99.41 E-value=1.3e-12 Score=128.03 Aligned_cols=151 Identities=25% Similarity=0.564 Sum_probs=113.8
Q ss_pred CCCCCCCCEeeeC-CCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCC--CCCCCCCEEeeCC
Q psy9427 19 GVSCGVNAVCVVN-NHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDS--GACGTNAVCIPED 95 (324)
Q Consensus 19 ~~~C~~~~~C~~~-~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~--~~C~~~~~C~~~~ 95 (324)
+.-|..++.|... .-.|+|.|..| |.|++. .|.+ +++|.. ..|+.++.|++.+
T Consensus 699 sh~cdt~a~C~pg~~~~~tcecs~g-~~gdgr-~c~d----------------------~~eca~~~~~CGp~s~Cin~p 754 (1289)
T KOG1214|consen 699 SHMCDTTARCHPGTGVDYTCECSSG-YQGDGR-NCVD----------------------ENECATGFHRCGPNSVCINLP 754 (1289)
T ss_pred CcccCCCccccCCCCcceEEEEeec-cCCCCC-CCCC----------------------hhhhccCCCCCCCCceeecCC
Confidence 4567888889754 45799999999 888654 4555 667753 5699999999999
Q ss_pred CCcEeeCCCCCcCCCCCCCcccccC-----CCCCCC-CCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCC
Q psy9427 96 HRPVCHCPVGTHPSPSPDVHCARES-----NLCRDS-NPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCP 169 (324)
Q Consensus 96 ~~~~C~C~~G~~g~~~~~~~C~~~~-----~~C~~~-~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~ 169 (324)
++|.|.|..||...... .+|..+. +.|... ..|+..++++-+...-+.|+|+|.+||.|++. .|.+++||.
T Consensus 755 g~~rceC~~gy~F~dd~-~tCV~i~~pap~n~Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG~--~c~dvDeC~ 831 (1289)
T KOG1214|consen 755 GSYRCECRSGYEFADDR-HTCVLITPPAPANPCEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDGH--QCTDVDECS 831 (1289)
T ss_pred CceeEEEeecceeccCC-cceEEecCCCCCCccccCccccCcCCceEEEecCCceEEEeecCCccCCcc--ccccccccC
Confidence 99999999998754433 5676542 445332 35655554443222556799999999999984 499999983
Q ss_pred CCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 170 RGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 170 ~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.. .|+.+|.|.+++|+|.|+|.+||.|
T Consensus 832 --ps-------------------rChp~A~CyntpgsfsC~C~pGy~G 858 (1289)
T KOG1214|consen 832 --PS-------------------RCHPAATCYNTPGSFSCRCQPGYYG 858 (1289)
T ss_pred --cc-------------------ccCCCceEecCCCcceeecccCccC
Confidence 21 5889999999999999999999997
No 2
>KOG1219|consensus
Probab=99.27 E-value=1.1e-11 Score=131.12 Aligned_cols=110 Identities=26% Similarity=0.704 Sum_probs=98.2
Q ss_pred cCCCCCCCCCCCEeeeC-CCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEee
Q psy9427 15 NSCDGVSCGVNAVCVVN-NHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIP 93 (324)
Q Consensus 15 ~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~ 93 (324)
++|..+||+++|+|... .+.|+|.|++. |+|. .|+.. +++|.++||..+++|+.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsq-ysG~---~CEi~---------------------~epC~snPC~~GgtCip 3919 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQ-YSGN---HCEID---------------------LEPCASNPCLTGGTCIP 3919 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCccc-ccCc---ccccc---------------------cccccCCCCCCCCEEEe
Confidence 78999999999999965 67899999999 9995 34432 78999999999999999
Q ss_pred CCCCcEeeCCCCCcCCCCCCCccccc-CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCC
Q psy9427 94 EDHRPVCHCPVGTHPSPSPDVHCARE-SNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPY 158 (324)
Q Consensus 94 ~~~~~~C~C~~G~~g~~~~~~~C~~~-~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~ 158 (324)
..+.|.|.|+.||+| .+|+.. +++| ..++|..++.|+| ++|+|.|.|.+||.|..+
T Consensus 3920 ~~n~f~CnC~~gyTG-----~~Ce~~Gi~eC-s~n~C~~gg~C~n---~~gsf~CncT~g~~gr~c 3976 (4289)
T KOG1219|consen 3920 FYNGFLCNCPNGYTG-----KRCEARGISEC-SKNVCGTGGQCIN---IPGSFHCNCTPGILGRTC 3976 (4289)
T ss_pred cCCCeeEeCCCCccC-----ceeeccccccc-ccccccCCceeec---cCCceEeccChhHhcccC
Confidence 999999999999999 889764 5999 8899999999999 999999999999998763
No 3
>KOG1219|consensus
Probab=99.22 E-value=2.5e-11 Score=128.34 Aligned_cols=110 Identities=24% Similarity=0.605 Sum_probs=95.1
Q ss_pred cCCCCCCCCCCCCEEeeC-CCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCcc
Q psy9427 77 IPACDSGACGTNAVCIPE-DHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVG 155 (324)
Q Consensus 77 i~~C~~~~C~~~~~C~~~-~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g 155 (324)
.++|..+||+++++|..+ .++|.|.|++-|.| +.|+..++.| ..+||..+++|+. ..+.|.|.|+.||+|
T Consensus 3864 ~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG-----~~CEi~~epC-~snPC~~GgtCip---~~n~f~CnC~~gyTG 3934 (4289)
T KOG1219|consen 3864 TDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSG-----NHCEIDLEPC-ASNPCLTGGTCIP---FYNGFLCNCPNGYTG 3934 (4289)
T ss_pred ccccccCcccCCCEecCCCCCceEEeCcccccC-----cccccccccc-cCCCCCCCCEEEe---cCCCeeEeCCCCccC
Confidence 378888999999999966 46799999999999 9999766999 9999999999999 899999999999999
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCcc-CCCCCCeeeeCCCCceeeCCCCCccCC
Q psy9427 156 DPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQG-ACGPNALCRAVGKRPVCACPDGFSAVG 219 (324)
Q Consensus 156 ~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~-~C~~~~~C~~~~g~y~C~C~~Gy~g~~ 219 (324)
..++. +.++|| .. +|+++|.|+|.+|+|.|.|-+||.|+.
T Consensus 3935 ~~Ce~--~Gi~eC----------------------s~n~C~~gg~C~n~~gsf~CncT~g~~gr~ 3975 (4289)
T KOG1219|consen 3935 KRCEA--RGISEC----------------------SKNVCGTGGQCINIPGSFHCNCTPGILGRT 3975 (4289)
T ss_pred ceeec--cccccc----------------------ccccccCCceeeccCCceEeccChhHhccc
Confidence 98541 124555 22 699999999999999999999999844
No 4
>KOG1214|consensus
Probab=99.22 E-value=4.2e-11 Score=117.67 Aligned_cols=133 Identities=27% Similarity=0.635 Sum_probs=97.0
Q ss_pred eeecCCcccccC--CCC--CCCCCCCEeeeCCCceeeeeCCCeeec-CCCCCCcccCCCCCCcCCCCCceeCCCCcccCC
Q psy9427 5 FVFYLTLTCWNS--CDG--VSCGVNAVCVVNNHEARCFCPPGSYVG-DPNIECRQVPCVHNHDCSTSHACDLLSHTCIPA 79 (324)
Q Consensus 5 ~~~~~~~~c~~~--C~~--~~C~~~~~C~~~~~~~~C~C~~G~y~g-~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~ 79 (324)
|.+++ +.|+++ |.. ..|++++.|++.+|+|.|.|..| |.- ++...|..+.= +.. +++
T Consensus 724 ~~gdg-r~c~d~~eca~~~~~CGp~s~Cin~pg~~rceC~~g-y~F~dd~~tCV~i~~------pap----------~n~ 785 (1289)
T KOG1214|consen 724 YQGDG-RNCVDENECATGFHRCGPNSVCINLPGSYRCECRSG-YEFADDRHTCVLITP------PAP----------ANP 785 (1289)
T ss_pred cCCCC-CCCCChhhhccCCCCCCCCceeecCCCceeEEEeec-ceeccCCcceEEecC------CCC----------CCc
Confidence 33443 556554 654 46999999999999999999999 653 33344544200 000 444
Q ss_pred CCC--CCCCCCC--EEeeC-CCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCc
Q psy9427 80 CDS--GACGTNA--VCIPE-DHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSV 154 (324)
Q Consensus 80 C~~--~~C~~~~--~C~~~-~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~ 154 (324)
|.. +.|..++ .|+.. .+.|.|.|.|||.|++ ..|.++ |+| .++.|++.+.|.+ ++|+|.|+|.+||.
T Consensus 786 Ce~g~h~C~i~g~a~c~~hGgs~y~C~CLPGfsGDG---~~c~dv-DeC-~psrChp~A~Cyn---tpgsfsC~C~pGy~ 857 (1289)
T KOG1214|consen 786 CEDGSHTCAIAGQARCVHHGGSTYSCACLPGFSGDG---HQCTDV-DEC-SPSRCHPAATCYN---TPGSFSCRCQPGYY 857 (1289)
T ss_pred cccCccccCcCCceEEEecCCceEEEeecCCccCCc---cccccc-ccc-CccccCCCceEec---CCCcceeecccCcc
Confidence 542 3455444 44433 3569999999999998 789998 999 8899999999999 99999999999999
Q ss_pred cCCCCCCCCCC
Q psy9427 155 GDPYKTGCVPQ 165 (324)
Q Consensus 155 g~~~~~~C~~~ 165 (324)
|++.+ |.+.
T Consensus 858 GDGf~--CVP~ 866 (1289)
T KOG1214|consen 858 GDGFQ--CVPD 866 (1289)
T ss_pred CCCce--ecCC
Confidence 99854 7654
No 5
>KOG1217|consensus
Probab=99.02 E-value=9.9e-09 Score=99.65 Aligned_cols=190 Identities=26% Similarity=0.645 Sum_probs=124.8
Q ss_pred cCCC--CCCCCCCCEeeeCCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCC--CCCCCC-C
Q psy9427 15 NSCD--GVSCGVNAVCVVNNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDS--GACGTN-A 89 (324)
Q Consensus 15 ~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~--~~C~~~-~ 89 (324)
+.|. ..+|.+++.|.+..++|.|.|++| |.+.....- .+...|...+.|.+..+.=.+.|.. ..+... +
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~-~~~~~~~~~-----~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~ 243 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPG-YTGSTCETT-----GNGGTCVDSVACSCPPGARGPECEVSIVECASGDG 243 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCC-ccCCcCcCC-----CCCceEecceeccCCCCCCCCCcccccccccCCCC
Confidence 5787 346999999999999999999999 887533211 1122222222222221100111211 122222 8
Q ss_pred EEeeCCCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCC
Q psy9427 90 VCIPEDHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCP 169 (324)
Q Consensus 90 ~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~ 169 (324)
+|++..++|.|.+++||++... ..+.++ ++|+...+|..+++|++ ..+.|.|.|++||+|... ..+.+..+|.
T Consensus 244 ~c~~~~~~~~C~~~~g~~~~~~--~~~~~~-~~C~~~~~c~~~~~C~~---~~~~~~C~C~~g~~g~~~-~~~~~~~~C~ 316 (487)
T KOG1217|consen 244 TCVNTVGSYTCRCPEGYTGDAC--VTCVDV-DSCALIASCPNGGTCVN---VPGSYRCTCPPGFTGRLC-TECVDVDECS 316 (487)
T ss_pred cccccCCceeeeCCCCcccccc--ceeeec-cccCCCCccCCCCeeec---CCCcceeeCCCCCCCCCC-cccccccccc
Confidence 8999999999999999999541 245666 88844324888999999 888899999999999886 3455666663
Q ss_pred C--CCCCCCCCCcccC-----------------Ccc--c-CCCcc-CCCCCCeeee-CCCCceeeCCCCCcc
Q psy9427 170 R--GDADCPPSSRCSG-----------------GQC--V-SVCQG-ACGPNALCRA-VGKRPVCACPDGFSA 217 (324)
Q Consensus 170 ~--~~~~C~~~~~C~~-----------------~~C--~-d~C~~-~C~~~~~C~~-~~g~y~C~C~~Gy~g 217 (324)
. ....|.....|.. .+| . ++|.. .+..++.|++ ..++|.|.|+.+|.+
T Consensus 317 ~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~~~~~~~c~~~~~~~~ 388 (487)
T KOG1217|consen 317 PRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNETPGSYRCACPAGFAG 388 (487)
T ss_pred ccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccCCCCCeEecCCCcccc
Confidence 2 2333444334411 234 2 36777 6888999999 789999999999986
No 6
>KOG1217|consensus
Probab=98.99 E-value=2.7e-08 Score=96.57 Aligned_cols=178 Identities=28% Similarity=0.626 Sum_probs=120.7
Q ss_pred CCCCCCCCCCCEeeeCCCceeeeeCCCeeecCCCCC---CcccC--CCCCCcCCC------CCceeCCCCcc-------c
Q psy9427 16 SCDGVSCGVNAVCVVNNHEARCFCPPGSYVGDPNIE---CRQVP--CVHNHDCST------SHACDLLSHTC-------I 77 (324)
Q Consensus 16 ~C~~~~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~---C~~~~--C~~~~~C~~------~~~C~c~~~~C-------i 77 (324)
.+...+...+..+....++|.|.|++| |.+..... |...+ +...+.|.. .+.|.|..+.. .
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~c~c~~g-~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~ 169 (487)
T KOG1217|consen 91 PCRSPCLLLCGECVDCVGSYECTCPPG-YQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDL 169 (487)
T ss_pred cccCCcccCCccccCCCCCceeeCCCc-cccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccc
Confidence 344434445556667788999999999 88865443 66654 345556642 56677776633 2
Q ss_pred CCCC--CCCCCCCCEEeeCCCCcEeeCCCCCcCCCCCCC----ccccc----------CCCCCCC-CCCCCC-CeeecCC
Q psy9427 78 PACD--SGACGTNAVCIPEDHRPVCHCPVGTHPSPSPDV----HCARE----------SNLCRDS-NPCHPT-ARCEDTG 139 (324)
Q Consensus 78 ~~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~----~C~~~----------~~~C~~~-~~C~~~-~~C~~~~ 139 (324)
++|. ..+|.+++.|.+..++|.|.|++||.+...... .|... ...|... ..+... ++|.+
T Consensus 170 ~~C~~~~~~c~~~~~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~-- 247 (487)
T KOG1217|consen 170 DECIQYSSPCQNGGTCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVN-- 247 (487)
T ss_pred cccccCCCCcCCCcccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccc--
Confidence 5776 345888899999999999999999998542101 12110 0222111 123323 88999
Q ss_pred CCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccC
Q psy9427 140 ATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAV 218 (324)
Q Consensus 140 ~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~ 218 (324)
+.++|+|.+++||++... ..+.++++|.... .|.++++|++..+.|.|.|++||.|.
T Consensus 248 -~~~~~~C~~~~g~~~~~~-~~~~~~~~C~~~~--------------------~c~~~~~C~~~~~~~~C~C~~g~~g~ 304 (487)
T KOG1217|consen 248 -TVGSYTCRCPEGYTGDAC-VTCVDVDSCALIA--------------------SCPNGGTCVNVPGSYRCTCPPGFTGR 304 (487)
T ss_pred -cCCceeeeCCCCcccccc-ceeeeccccCCCC--------------------ccCCCCeeecCCCcceeeCCCCCCCC
Confidence 889999999999998872 2366777774332 15578999999999999999999983
No 7
>KOG4289|consensus
Probab=98.72 E-value=1.3e-08 Score=105.09 Aligned_cols=104 Identities=33% Similarity=0.742 Sum_probs=85.0
Q ss_pred CCCCCCCCCCCEeee----------------------CCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCC
Q psy9427 16 SCDGVSCGVNAVCVV----------------------NNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLS 73 (324)
Q Consensus 16 ~C~~~~C~~~~~C~~----------------------~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~ 73 (324)
.|...||.+-++|+. ..+.++|.||+| |+|+. |...
T Consensus 1181 iClrEPCenymkCvsvlrFdssapf~~s~s~lfRpi~pvnglrCrCPpG-FTgd~---CeTe------------------ 1238 (2531)
T KOG4289|consen 1181 ICLREPCENYMKCVSVLRFDSSAPFLASDSVLFRPIHPVNGLRCRCPPG-FTGDY---CETE------------------ 1238 (2531)
T ss_pred hhhcchhHHHHhhhhheeecccCccccccceeeeeccccCceeEeCCCC-CCccc---ccch------------------
Confidence 477778887777742 246788999999 99963 3332
Q ss_pred CcccCCCCCCCCCCCCEEeeCCCCcEeeCCCCCcCCCCCCCccccc--CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCC
Q psy9427 74 HTCIPACDSGACGTNAVCIPEDHRPVCHCPVGTHPSPSPDVHCARE--SNLCRDSNPCHPTARCEDTGATNGKVVCSCPE 151 (324)
Q Consensus 74 ~~Ci~~C~~~~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~--~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~ 151 (324)
||+|-.+||+++++|....++|.|.|++||+| ..|+.. ..-| .+..|.++++|++. ..|.|.|.|+.
T Consensus 1239 ---iDlCYs~pC~nng~C~srEggYtCeCrpg~tG-----ehCEvs~~agrC-vpGvC~nggtC~~~--~nggf~c~Cp~ 1307 (2531)
T KOG4289|consen 1239 ---IDLCYSGPCGNNGRCRSREGGYTCECRPGFTG-----EHCEVSARAGRC-VPGVCKNGGTCVNL--LNGGFCCHCPY 1307 (2531)
T ss_pred ---hHhhhcCCCCCCCceEEecCceeEEecCCccc-----cceeeecccCcc-ccceecCCCEEeec--CCCceeccCCC
Confidence 89999999999999999999999999999999 888753 1456 66789999999984 67889999998
Q ss_pred C
Q psy9427 152 L 152 (324)
Q Consensus 152 G 152 (324)
|
T Consensus 1308 g 1308 (2531)
T KOG4289|consen 1308 G 1308 (2531)
T ss_pred c
Confidence 8
No 8
>KOG4260|consensus
Probab=98.55 E-value=5.2e-08 Score=85.14 Aligned_cols=152 Identities=22% Similarity=0.458 Sum_probs=95.6
Q ss_pred CCCCCCCEeee---CCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEeeCCC
Q psy9427 20 VSCGVNAVCVV---NNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIPEDH 96 (324)
Q Consensus 20 ~~C~~~~~C~~---~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~~~~ 96 (324)
.||..+++|.. ..|+-.|.|.+| |+|.....|-..--.. ..=.....|. .|.. +|. +.|.....
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~G-Y~Gp~C~~Cg~eyfes-~Rne~~lvCt----~Ch~-----~C~--~~Csg~~~ 216 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETG-YTGPLCRYCGIEYFES-SRNEQHLVCT----ACHE-----GCL--GVCSGESS 216 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCC-CCCccccccchHHHHh-hcccccchhh----hhhh-----hhh--cccCCCCC
Confidence 57888899974 367779999999 9997554333200000 0000000000 1111 221 23433222
Q ss_pred CcEeeCCCCCcCCCCCCCcccccCCCCCC-CCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCC
Q psy9427 97 RPVCHCPVGTHPSPSPDVHCARESNLCRD-SNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADC 175 (324)
Q Consensus 97 ~~~C~C~~G~~g~~~~~~~C~~~~~~C~~-~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C 175 (324)
+---.|+.||..+. ..|.|+ |||.. +.+|..+..|+| +.|+|+|...+||.+. +++|..-...|
T Consensus 217 k~C~kCkkGW~lde---~gCvDv-nEC~~ep~~c~~~qfCvN---teGSf~C~dk~Gy~~g--------~d~C~~~~d~~ 281 (350)
T KOG4260|consen 217 KGCSKCKKGWKLDE---EGCVDV-NECQNEPAPCKAHQFCVN---TEGSFKCEDKEGYKKG--------VDECQFCADVC 281 (350)
T ss_pred CChhhhcccceecc---cccccH-HHHhcCCCCCChhheeec---CCCceEecccccccCC--------hHHhhhhhhhc
Confidence 22236899999886 789999 99954 458999999999 9999999999999863 33442211111
Q ss_pred CCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 176 PPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 176 ~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
. ..+..|.++.++|+|.|++|+.-
T Consensus 282 ~------------------~kn~~c~ni~~~~r~v~f~~~~~ 305 (350)
T KOG4260|consen 282 A------------------SKNRPCMNIDGQYRCVCFSGLII 305 (350)
T ss_pred c------------------cCCCCcccCCccEEEEeccccee
Confidence 1 24567999999999999999863
No 9
>KOG4289|consensus
Probab=98.54 E-value=1.2e-07 Score=98.12 Aligned_cols=92 Identities=29% Similarity=0.673 Sum_probs=72.7
Q ss_pred eeCCCCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCC
Q psy9427 92 IPEDHRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRG 171 (324)
Q Consensus 92 ~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~ 171 (324)
++..++++|.|++||+| ..|+..+|+| -..||.+++.|.. ..|.|+|.|.+||+|+.++. .
T Consensus 1216 i~pvnglrCrCPpGFTg-----d~CeTeiDlC-Ys~pC~nng~C~s---rEggYtCeCrpg~tGehCEv----------s 1276 (2531)
T KOG4289|consen 1216 IHPVNGLRCRCPPGFTG-----DYCETEIDLC-YSGPCGNNGRCRS---REGGYTCECRPGFTGEHCEV----------S 1276 (2531)
T ss_pred ccccCceeEeCCCCCCc-----ccccchhHhh-hcCCCCCCCceEE---ecCceeEEecCCccccceee----------e
Confidence 45678899999999999 7887655999 8889999999999 89999999999999988431 1
Q ss_pred CCCCCCCCcccCCcccCCCccCCCCCCeeeeC-CCCceeeCCCC
Q psy9427 172 DADCPPSSRCSGGQCVSVCQGACGPNALCRAV-GKRPVCACPDG 214 (324)
Q Consensus 172 ~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~-~g~y~C~C~~G 214 (324)
.+ ..+|+++ .|.++++|++. .|+|.|.||.|
T Consensus 1277 ~~----agrCvpG--------vC~nggtC~~~~nggf~c~Cp~g 1308 (2531)
T KOG4289|consen 1277 AR----AGRCVPG--------VCKNGGTCVNLLNGGFCCHCPYG 1308 (2531)
T ss_pred cc----cCccccc--------eecCCCEEeecCCCceeccCCCc
Confidence 10 1222222 46689999955 67899999998
No 10
>KOG1225|consensus
Probab=98.20 E-value=1.4e-05 Score=77.80 Aligned_cols=20 Identities=25% Similarity=0.815 Sum_probs=14.2
Q ss_pred CCCCCeeeeCCCCceeeCCCCCccC
Q psy9427 194 CGPNALCRAVGKRPVCACPDGFSAV 218 (324)
Q Consensus 194 C~~~~~C~~~~g~y~C~C~~Gy~g~ 218 (324)
|.+++.|++ | |.|..||+|.
T Consensus 345 C~~~g~cv~--g---C~C~~Gw~G~ 364 (525)
T KOG1225|consen 345 CSGGGQCVN--G---CKCKKGWRGP 364 (525)
T ss_pred cCCCceecc--C---ceeccCccCC
Confidence 556666765 2 8889999873
No 11
>KOG1225|consensus
Probab=98.07 E-value=4.2e-05 Score=74.50 Aligned_cols=149 Identities=30% Similarity=0.704 Sum_probs=89.8
Q ss_pred cEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCCCCCCCCC
Q psy9427 98 PVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPRGDADCPP 177 (324)
Q Consensus 98 ~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~ 177 (324)
++|.|+.+|.+ ..|+. -.| +..|..++.|+. -+|.|++||+|+.+.. ..|+
T Consensus 234 ~ic~c~~~~~g-----~~c~~--~~C--~~~c~~~g~c~~-------G~CIC~~Gf~G~dC~e------------~~Cp- 284 (525)
T KOG1225|consen 234 GICECPEGYFG-----PLCST--IYC--PGGCTGRGQCVE-------GRCICPPGFTGDDCDE------------LVCP- 284 (525)
T ss_pred ceeecCCceeC-----Ccccc--ccC--CCCCcccceEeC-------CeEeCCCCCcCCCCCc------------ccCC-
Confidence 47899999998 55653 234 234555677866 3899999999988431 1222
Q ss_pred CCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCccccCCCCCCCCccCCCCCcCCCCcccCCCCC
Q psy9427 178 SSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAVGGEARVGCFREMSACAGDQECLAGEEKCIGGRCRVACLS 257 (324)
Q Consensus 178 ~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~~~~C~~~~~~C~~~~~C~~~~~~C~~~~C~~~C~~ 257 (324)
..|+.++.+++. +|.|++||+|....... | + ++|. ..-.|+.|+|+ |..
T Consensus 285 --------------~~cs~~g~~~~g----~CiC~~g~~G~dCs~~~-c---p------adC~-g~G~Ci~G~C~--C~~ 333 (525)
T KOG1225|consen 285 --------------VDCSGGGVCVDG----ECICNPGYSGKDCSIRR-C---P------ADCS-GHGKCIDGECL--CDE 333 (525)
T ss_pred --------------cccCCCceecCC----EeecCCCcccccccccc-C---C------ccCC-CCCcccCCceE--eCC
Confidence 135556666553 69999999984333222 3 3 3444 22245555442 111
Q ss_pred CCCCCCCCcccCCcccCCCCCCCCCCCCCcCCCCcc-----ccccccCCCCCCCCcccCCc
Q psy9427 258 HSDCSEGERCISSLCQSPCTSHSQCPTGQACQAGTC-----ELGCRQNEECSGQMICAGSK 313 (324)
Q Consensus 258 ~~~C~~~~~C~~~~C~~~C~~~~~C~~~~~C~~~~c-----~~~C~~~~~C~~~~~C~~~~ 313 (324)
+ +.|..|... +|.++++|..+..|+.|+- .+.+.-..+|+....++...
T Consensus 334 G---y~G~~C~~~----~C~~~g~cv~gC~C~~Gw~G~d~~~~~~~~~~~cs~~~~~~~~~ 387 (525)
T KOG1225|consen 334 G---YTGELCIQR----ACSGGGQCVNGCKCKKGWRGPDVADPSLLLITECSPPSLCIAGV 387 (525)
T ss_pred C---CcCCccccc----ccCCCceeccCceeccCccCCCcCCchhhcccccCCCceeeccc
Confidence 1 344555554 3777788877777777764 23466677777777776654
No 12
>KOG4260|consensus
Probab=97.96 E-value=1.2e-05 Score=70.63 Aligned_cols=105 Identities=22% Similarity=0.469 Sum_probs=69.7
Q ss_pred CCCCCCCEEe---eCCCCcEeeCCCCCcCCCCCCCccccc--------CCCCC-CCCCCCC--CCeeecCCCCCCCeee-
Q psy9427 83 GACGTNAVCI---PEDHRPVCHCPVGTHPSPSPDVHCARE--------SNLCR-DSNPCHP--TARCEDTGATNGKVVC- 147 (324)
Q Consensus 83 ~~C~~~~~C~---~~~~~~~C~C~~G~~g~~~~~~~C~~~--------~~~C~-~~~~C~~--~~~C~~~~~~~g~~~C- 147 (324)
.+|.-++.|. ...|+..|.|.+||.| ..|..- .++=. .-..|+. .+.|. ..++-.|
T Consensus 150 r~C~GnG~C~GdGsR~GsGkCkC~~GY~G-----p~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Cs----g~~~k~C~ 220 (350)
T KOG4260|consen 150 RPCFGNGSCHGDGSREGSGKCKCETGYTG-----PLCRYCGIEYFESSRNEQHLVCTACHEGCLGVCS----GESSKGCS 220 (350)
T ss_pred CCcCCCCcccCCCCCCCCCcccccCCCCC-----ccccccchHHHHhhcccccchhhhhhhhhhcccC----CCCCCChh
Confidence 3455566665 3457789999999999 444321 01100 0012321 12443 2444556
Q ss_pred eCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 148 SCPELSVGDPYKTGCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 148 ~C~~G~~g~~~~~~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
+|..||..+. ..|.|++||..... +|..+..|+|+.|+|+|..++||.+
T Consensus 221 kCkkGW~lde--~gCvDvnEC~~ep~-------------------~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 221 KCKKGWKLDE--EGCVDVNECQNEPA-------------------PCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred hhcccceecc--cccccHHHHhcCCC-------------------CCChhheeecCCCceEecccccccC
Confidence 6999999986 56999999965543 5777889999999999999999985
No 13
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.92 E-value=3.4e-06 Score=70.25 Aligned_cols=145 Identities=26% Similarity=0.577 Sum_probs=91.4
Q ss_pred CCCCCCEeeeCCCceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCC-----CCCCCCCCEEeeCC
Q psy9427 21 SCGVNAVCVVNNHEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACD-----SGACGTNAVCIPED 95 (324)
Q Consensus 21 ~C~~~~~C~~~~~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~-----~~~C~~~~~C~~~~ 95 (324)
+|.+ |.-+.+...|.|.|.+| |.....+.|.. ..+|. ..+|+..|.|++..
T Consensus 7 ~CKN-G~LiQMSNHfEC~Cneg-fvl~~EntCE~----------------------kv~C~~~e~~~K~Cgdya~C~~~~ 62 (197)
T PF06247_consen 7 ICKN-GYLIQMSNHFECKCNEG-FVLKNENTCEE----------------------KVECDKLENVNKPCGDYAKCINQA 62 (197)
T ss_dssp --BT-EEEEEESSEEEEEESTT-EEEEETTEEEE--------------------------SG-GGTTSEEETTEEEEE-S
T ss_pred cccC-CEEEEccCceEEEcCCC-cEEcccccccc----------------------ceecCcccccCccccchhhhhcCC
Confidence 4654 57888889999999999 87655455665 23332 25788899999765
Q ss_pred -----CCcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCCCCCCCC
Q psy9427 96 -----HRPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVPQGQCPR 170 (324)
Q Consensus 96 -----~~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~~~eC~~ 170 (324)
..|.|.|.+||+... ..|.. +.| ....|+ .+.|+..+..+....|+|.-|++.+. ...|...++
T Consensus 63 ~~~~~~~~~C~C~~gY~~~~---~vCvp--~~C-~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~d-n~kCtk~G~--- 131 (197)
T PF06247_consen 63 NKGEERAYKCDCINGYILKQ---GVCVP--NKC-NNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDD-NKKCTKTGE--- 131 (197)
T ss_dssp STTSSTSEEEEE-TTEEESS---SSEEE--GGG-SS---T-TEEEEEEEGGGSEEEEEE-TEEETTT-TTESEEEE----
T ss_pred CcccceeEEEecccCceeeC---CeEch--hhc-CceecC-CCeEEecCCCCCCceeEeeeceEecc-CCcccCCCc---
Confidence 469999999999865 56765 567 556786 78998521133446999999998333 234543221
Q ss_pred CCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 171 GDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 171 ~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.+|...|..+.+|..+.+-|+|.+.+||.+
T Consensus 132 -----------------T~C~LKCk~nE~CK~~~~~Y~C~~~~~~~~ 161 (197)
T PF06247_consen 132 -----------------TKCSLKCKENEECKLVDGYYKCVCKEGFPG 161 (197)
T ss_dssp ------------------------TTTEEEEEETTEEEEEE-TT-EE
T ss_pred -----------------cceeeecCCCcceeeeCcEEEeecCCCCCC
Confidence 122235778899999999999999999987
No 14
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.89 E-value=1.1e-05 Score=48.22 Aligned_cols=30 Identities=33% Similarity=0.875 Sum_probs=27.1
Q ss_pred CCCCCCCCCCEeeeCC-CceeeeeCCCeeecC
Q psy9427 17 CDGVSCGVNAVCVVNN-HEARCFCPPGSYVGD 47 (324)
Q Consensus 17 C~~~~C~~~~~C~~~~-~~~~C~C~~G~y~g~ 47 (324)
|..+||.++|+|+... ++|+|.|++| |+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G-~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPG-YTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTT-EEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCC-CccC
Confidence 5677999999999888 9999999999 9884
No 15
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.89 E-value=8.9e-06 Score=51.94 Aligned_cols=25 Identities=28% Similarity=0.757 Sum_probs=22.8
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 193 ACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 193 ~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.|..+++|+|+.|+|+|.|++||+.
T Consensus 11 ~C~~~~~C~N~~Gsy~C~C~~Gy~~ 35 (42)
T PF07645_consen 11 NCPENGTCVNTEGSYSCSCPPGYEL 35 (42)
T ss_dssp SSSTTSEEEEETTEEEEEESTTEEE
T ss_pred cCCCCCEEEcCCCCEEeeCCCCcEE
Confidence 5777899999999999999999983
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.73 E-value=2.3e-05 Score=50.03 Aligned_cols=34 Identities=26% Similarity=0.621 Sum_probs=28.9
Q ss_pred CCCCCCC-CCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427 121 NLCRDSN-PCHPTARCEDTGATNGKVVCSCPELSVGDP 157 (324)
Q Consensus 121 ~~C~~~~-~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~ 157 (324)
|||+... .|..++.|+| +.|+|+|.|++||....
T Consensus 3 dEC~~~~~~C~~~~~C~N---~~Gsy~C~C~~Gy~~~~ 37 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVN---TEGSYSCSCPPGYELND 37 (42)
T ss_dssp STTTTTSSSSSTTSEEEE---ETTEEEEEESTTEEECT
T ss_pred cccCCCCCcCCCCCEEEc---CCCCEEeeCCCCcEECC
Confidence 8995533 7988999999 99999999999998443
No 17
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.49 E-value=5.5e-05 Score=45.20 Aligned_cols=29 Identities=31% Similarity=0.742 Sum_probs=26.1
Q ss_pred CCCCCCCCCCEEeeCC-CCcEeeCCCCCcC
Q psy9427 80 CDSGACGTNAVCIPED-HRPVCHCPVGTHP 108 (324)
Q Consensus 80 C~~~~C~~~~~C~~~~-~~~~C~C~~G~~g 108 (324)
|.+++|.++++|+... ++|.|.|++||+|
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEES
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCcc
Confidence 4457899999999888 9999999999998
No 18
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.40 E-value=0.00024 Score=44.02 Aligned_cols=32 Identities=28% Similarity=0.680 Sum_probs=28.4
Q ss_pred cCCCCC-CCCCCCCEEeeCCCCcEeeCCCCCc-C
Q psy9427 77 IPACDS-GACGTNAVCIPEDHRPVCHCPVGTH-P 108 (324)
Q Consensus 77 i~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~-g 108 (324)
+++|.. .+|.++++|+++.++|.|.|++||+ |
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g 35 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDG 35 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccC
Confidence 567776 6898889999999999999999999 6
No 19
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.40 E-value=0.00017 Score=39.85 Aligned_cols=24 Identities=25% Similarity=0.450 Sum_probs=21.3
Q ss_pred CeeeeCCCCCccCCCCCCCCCCCC
Q psy9427 144 KVVCSCPELSVGDPYKTGCVPQGQ 167 (324)
Q Consensus 144 ~~~C~C~~G~~g~~~~~~C~~~~e 167 (324)
+|+|+|++||+....+..|.||+|
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCCC
Confidence 589999999998877788999986
No 20
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.38 E-value=0.00021 Score=43.89 Aligned_cols=28 Identities=39% Similarity=0.814 Sum_probs=22.7
Q ss_pred CCCCCCEeeeCCCceeeeeCCCeeecCCC
Q psy9427 21 SCGVNAVCVVNNHEARCFCPPGSYVGDPN 49 (324)
Q Consensus 21 ~C~~~~~C~~~~~~~~C~C~~G~y~g~~~ 49 (324)
.|..+|+|+++.++|.|+|++| |.|+..
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~G-y~GdG~ 34 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPG-YEGDGF 34 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CE-EECCST
T ss_pred CCCCCcEeecCCCCEEeECCCC-CccCCc
Confidence 5889999999999999999999 999754
No 21
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.25 E-value=0.00043 Score=42.85 Aligned_cols=31 Identities=35% Similarity=0.866 Sum_probs=27.5
Q ss_pred ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeee
Q psy9427 14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYV 45 (324)
Q Consensus 14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~ 45 (324)
+++|.. .+|.++++|++..++|.|.|++| |+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g-~~ 33 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPG-YT 33 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCC-Cc
Confidence 466776 78999999999999999999999 87
No 22
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.20 E-value=0.00024 Score=43.62 Aligned_cols=28 Identities=29% Similarity=0.613 Sum_probs=23.0
Q ss_pred CCCCCCCEEeeCCCCcEeeCCCCCcCCC
Q psy9427 83 GACGTNAVCIPEDHRPVCHCPVGTHPSP 110 (324)
Q Consensus 83 ~~C~~~~~C~~~~~~~~C~C~~G~~g~~ 110 (324)
..|..+|+|+++.++|.|+|++||.|++
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCCcEeecCCCCEEeECCCCCccCC
Confidence 4688899999999999999999999987
No 23
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.16 E-value=0.00038 Score=38.47 Aligned_cols=24 Identities=29% Similarity=0.602 Sum_probs=20.4
Q ss_pred CcEeeCCCCCcCCCCCCCcccccCCC
Q psy9427 97 RPVCHCPVGTHPSPSPDVHCARESNL 122 (324)
Q Consensus 97 ~~~C~C~~G~~g~~~~~~~C~~~~~~ 122 (324)
+|.|.|++||+.++.. +.|++| ||
T Consensus 1 sy~C~C~~Gy~l~~d~-~~C~DI-dE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDG-RSCEDI-DE 24 (24)
T ss_pred CEEeeCCCCCcCCCCC-CccccC-CC
Confidence 6899999999987765 899998 75
No 24
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.95 E-value=0.0013 Score=40.11 Aligned_cols=32 Identities=28% Similarity=0.674 Sum_probs=28.0
Q ss_pred cCCCCC-CCCCCCCEEeeCCCCcEeeCCCCCcC
Q psy9427 77 IPACDS-GACGTNAVCIPEDHRPVCHCPVGTHP 108 (324)
Q Consensus 77 i~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g 108 (324)
+++|.. .+|.+++.|++..+.|.|.|++||+|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcC
Confidence 466765 68888899999999999999999998
No 25
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.83 E-value=0.0015 Score=39.89 Aligned_cols=33 Identities=36% Similarity=0.888 Sum_probs=28.3
Q ss_pred ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427 14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYVGD 47 (324)
Q Consensus 14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g~ 47 (324)
+++|.. .+|.+++.|++..+.|+|.|++| |.|.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g-~~g~ 35 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPG-YTGR 35 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCC-CcCC
Confidence 456766 68988899999999999999999 8873
No 26
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.67 E-value=0.0025 Score=38.21 Aligned_cols=28 Identities=43% Similarity=1.032 Sum_probs=25.2
Q ss_pred CCCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427 19 GVSCGVNAVCVVNNHEARCFCPPGSYVGD 47 (324)
Q Consensus 19 ~~~C~~~~~C~~~~~~~~C~C~~G~y~g~ 47 (324)
..+|.++++|++..+.|.|.|++| |.+.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g-~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPG-YTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCC-Cccc
Confidence 567988999999999999999999 8875
No 27
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.48 E-value=0.0048 Score=36.90 Aligned_cols=28 Identities=25% Similarity=0.625 Sum_probs=25.0
Q ss_pred CCCCCCCCEEeeCCCCcEeeCCCCCcCC
Q psy9427 82 SGACGTNAVCIPEDHRPVCHCPVGTHPS 109 (324)
Q Consensus 82 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 109 (324)
..+|.++++|++..+.|.|.|+.||.|+
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4678888999999999999999999983
No 28
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.20 E-value=0.0061 Score=36.68 Aligned_cols=29 Identities=38% Similarity=1.014 Sum_probs=24.8
Q ss_pred CCC-CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427 17 CDG-VSCGVNAVCVVNNHEARCFCPPGSYVGD 47 (324)
Q Consensus 17 C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g~ 47 (324)
|.. .+|.++ +|++..++|+|+|++| |.++
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g-~~g~ 31 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPG-YTGD 31 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCC-CccC
Confidence 445 578888 9999999999999999 8873
No 29
>KOG1226|consensus
Probab=96.17 E-value=0.041 Score=55.48 Aligned_cols=142 Identities=26% Similarity=0.638 Sum_probs=78.0
Q ss_pred EeeCCCCCcCCCCCCCccccc---------CCCCCCCC---CCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCCCCC-C
Q psy9427 99 VCHCPVGTHPSPSPDVHCARE---------SNLCRDSN---PCHPTARCEDTGATNGKVVCSCPELSVGDPYKTGCVP-Q 165 (324)
Q Consensus 99 ~C~C~~G~~g~~~~~~~C~~~---------~~~C~~~~---~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~C~~-~ 165 (324)
.|.|.+||.| ..|+-. .+.|+..+ +|...+.|.= -+|+|.+...+.-++..|.- .
T Consensus 479 ~C~C~~G~~G-----~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~C-------GqC~C~~~~~~~i~G~fCECDn 546 (783)
T KOG1226|consen 479 QCRCDEGWLG-----KKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCVC-------GQCVCHKPDNGKIYGKFCECDN 546 (783)
T ss_pred ceecCCCCCC-----CcccCCccccCcHhHHhhccCCCCCCCcCCCCcEeC-------CceEecCCCCCceeeeeeeccC
Confidence 5799999999 444321 13443222 5777777754 36788777664333333321 1
Q ss_pred CCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCccCCCCCCCCccccCCCCCC--CCccCCCC
Q psy9427 166 GQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSAVGGEARVGCFREMSACAG--DQECLAGE 243 (324)
Q Consensus 166 ~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g~~~~~~~~C~~~~~~C~~--~~~C~~~~ 243 (324)
-.|....+ ..|+.+++|.=. +|+|.+||+|..... ....+.|.+ +..|+ +.
T Consensus 547 fsC~r~~g------------------~lC~g~G~C~CG----~CvC~~GwtG~~C~C----~~std~C~~~~G~iCS-Gr 599 (783)
T KOG1226|consen 547 FSCERHKG------------------VLCGGHGRCECG----RCVCNPGWTGSACNC----PLSTDTCESSDGQICS-GR 599 (783)
T ss_pred cccccccC------------------cccCCCCeEeCC----cEEcCCCCccCCCCC----CCCCccccCCCCceeC-CC
Confidence 11211111 157778888653 599999999854321 112444532 35666 33
Q ss_pred CcCCCCcccCCCCCCCCCCCCCcccC-CcccCCCCCCCCCC
Q psy9427 244 EKCIGGRCRVACLSHSDCSEGERCIS-SLCQSPCTSHSQCP 283 (324)
Q Consensus 244 ~~C~~~~C~~~C~~~~~C~~~~~C~~-~~C~~~C~~~~~C~ 283 (324)
..|+=|+|+ |.+.. ..|..|.. ..|..+|..+-.|.
T Consensus 600 G~C~Cg~C~--C~~~~--~sG~~CE~cptc~~~C~~~~~Cv 636 (783)
T KOG1226|consen 600 GTCECGRCK--CTDPP--YSGEFCEKCPTCPDPCAENKSCV 636 (783)
T ss_pred ceeeCCceE--cCCCC--cCcchhhcCCCCCCcccccccch
Confidence 356666553 33332 45666644 56666777766665
No 30
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.06 E-value=0.0092 Score=35.88 Aligned_cols=28 Identities=36% Similarity=0.876 Sum_probs=24.0
Q ss_pred CCC-CCCCCCCEEeeCCCCcEeeCCCCCcC
Q psy9427 80 CDS-GACGTNAVCIPEDHRPVCHCPVGTHP 108 (324)
Q Consensus 80 C~~-~~C~~~~~C~~~~~~~~C~C~~G~~g 108 (324)
|.. .+|.++ +|+++.++|.|.|++||.|
T Consensus 2 C~~~~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 2 CASGGPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCcCCCCCC-EEECCCCCeEeECCCCCcc
Confidence 444 578777 9999999999999999998
No 31
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.28 E-value=0.013 Score=35.90 Aligned_cols=26 Identities=27% Similarity=0.390 Sum_probs=20.7
Q ss_pred CCeeecCCCCCCCeeeeCCCCCccCCCCC
Q psy9427 132 TARCEDTGATNGKVVCSCPELSVGDPYKT 160 (324)
Q Consensus 132 ~~~C~~~~~~~g~~~C~C~~G~~g~~~~~ 160 (324)
...|++ ++++|+|.|++||++....+
T Consensus 9 ~h~C~~---~~g~~~C~C~~Gy~L~~D~~ 34 (36)
T PF14670_consen 9 SHICVN---TPGSYRCSCPPGYKLAEDGR 34 (36)
T ss_dssp SSEEEE---ETTSEEEE-STTEEE-TTSS
T ss_pred CCCCcc---CCCceEeECCCCCEECcCCC
Confidence 578999 99999999999999877543
No 32
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=95.15 E-value=0.014 Score=35.68 Aligned_cols=23 Identities=22% Similarity=0.598 Sum_probs=19.1
Q ss_pred CEEeeCCCCcEeeCCCCCcCCCC
Q psy9427 89 AVCIPEDHRPVCHCPVGTHPSPS 111 (324)
Q Consensus 89 ~~C~~~~~~~~C~C~~G~~g~~~ 111 (324)
..|++++++|+|.|++||++.+.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~L~~D 32 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYKLAED 32 (36)
T ss_dssp SEEEEETTSEEEE-STTEEE-TT
T ss_pred CCCccCCCceEeECCCCCEECcC
Confidence 68999999999999999998654
No 33
>KOG1226|consensus
Probab=95.11 E-value=0.43 Score=48.41 Aligned_cols=117 Identities=23% Similarity=0.533 Sum_probs=63.6
Q ss_pred CCCCCCEEeeCCCCcEeeCCCCCcCCCCCCCcccccCCCCC--CCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCCCCC
Q psy9427 84 ACGTNAVCIPEDHRPVCHCPVGTHPSPSPDVHCARESNLCR--DSNPCHPTARCEDTGATNGKVVCSCPELSVGDPYKTG 161 (324)
Q Consensus 84 ~C~~~~~C~~~~~~~~C~C~~G~~g~~~~~~~C~~~~~~C~--~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~~~~ 161 (324)
+|...+.|.= ..|.|.+...+.-++ ..|+-+--.|. ....|..+++|.= -+|+|.+||+|..++
T Consensus 515 vCSgrG~C~C----GqC~C~~~~~~~i~G-~fCECDnfsC~r~~g~lC~g~G~C~C-------G~CvC~~GwtG~~C~-- 580 (783)
T KOG1226|consen 515 VCSGRGDCVC----GQCVCHKPDNGKIYG-KFCECDNFSCERHKGVLCGGHGRCEC-------GRCVCNPGWTGSACN-- 580 (783)
T ss_pred CcCCCCcEeC----CceEecCCCCCceee-eeeeccCcccccccCcccCCCCeEeC-------CcEEcCCCCccCCCC--
Confidence 5666666652 147777766631111 66765312231 1225888888865 389999999999875
Q ss_pred CC-CCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCC-CccCCCCCCCCccccCCCCCCCCcc
Q psy9427 162 CV-PQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDG-FSAVGGEARVGCFREMSACAGDQEC 239 (324)
Q Consensus 162 C~-~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~G-y~g~~~~~~~~C~~~~~~C~~~~~C 239 (324)
|. +.+.|...++. .|...++|.=. +|.|-+. |.|...+.-..| .+.|....+|
T Consensus 581 C~~std~C~~~~G~------------------iCSGrG~C~Cg----~C~C~~~~~sG~~CE~cptc---~~~C~~~~~C 635 (783)
T KOG1226|consen 581 CPLSTDTCESSDGQ------------------ICSGRGTCECG----RCKCTDPPYSGEFCEKCPTC---PDPCAENKSC 635 (783)
T ss_pred CCCCCccccCCCCc------------------eeCCCceeeCC----ceEcCCCCcCcchhhcCCCC---CCcccccccc
Confidence 43 33444322211 34445555432 4777666 876433333333 3445555544
No 34
>KOG0994|consensus
Probab=92.87 E-value=0.48 Score=50.11 Aligned_cols=63 Identities=22% Similarity=0.439 Sum_probs=36.4
Q ss_pred EEeeCCCCcEe-eCCCCCcCCCCCCCcccccCCCCCCCCCCCCC--------CeeecCCCCCCCeeeeCCCCCccCCCC
Q psy9427 90 VCIPEDHRPVC-HCPVGTHPSPSPDVHCARESNLCRDSNPCHPT--------ARCEDTGATNGKVVCSCPELSVGDPYK 159 (324)
Q Consensus 90 ~C~~~~~~~~C-~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~--------~~C~~~~~~~g~~~C~C~~G~~g~~~~ 159 (324)
.|.....++.| +|..||.|+|.- =.. ..| .+-||..+ ..|.- +.......|.|.+||+|..+.
T Consensus 877 ~CqD~T~G~~CdrCl~GyyGdP~l---g~g--~~C-rPCpCP~gp~Sg~~~A~sC~~-d~~t~~ivC~C~~GY~G~RCe 948 (1758)
T KOG0994|consen 877 DCQDSTTGHSCDRCLDGYYGDPRL---GSG--IGC-RPCPCPDGPASGRQHADSCYL-DTRTQQIVCHCQEGYSGSRCE 948 (1758)
T ss_pred cccccccccchhhhhccccCCccc---CCC--CCC-CCCCCCCCCccchhccccccc-cccccceeeecccCccccchh
Confidence 34555666777 799999998852 111 223 22233221 12221 113455789999999998863
No 35
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=92.86 E-value=0.17 Score=30.02 Aligned_cols=25 Identities=36% Similarity=0.852 Sum_probs=20.7
Q ss_pred CCCCCCCEeeeCCCceeeeeCCCeeecC
Q psy9427 20 VSCGVNAVCVVNNHEARCFCPPGSYVGD 47 (324)
Q Consensus 20 ~~C~~~~~C~~~~~~~~C~C~~G~y~g~ 47 (324)
..|.++++|+...+ +|+|.+| |+|+
T Consensus 6 ~~C~~~G~C~~~~g--~C~C~~g-~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSPCG--RCVCDSG-YTGP 30 (32)
T ss_pred CccCCCCEEeCCCC--EEECCCC-CcCC
Confidence 35889999987644 8999999 8885
No 36
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=92.26 E-value=0.1 Score=43.94 Aligned_cols=106 Identities=28% Similarity=0.710 Sum_probs=65.0
Q ss_pred CCCCCCCEeeeCC-----CceeeeeCCCeeecCCCCCCcccCCCCCCcCCCCCceeCCCCcccCCCCCCCCCCCCEEeeC
Q psy9427 20 VSCGVNAVCVVNN-----HEARCFCPPGSYVGDPNIECRQVPCVHNHDCSTSHACDLLSHTCIPACDSGACGTNAVCIPE 94 (324)
Q Consensus 20 ~~C~~~~~C~~~~-----~~~~C~C~~G~y~g~~~~~C~~~~C~~~~~C~~~~~C~c~~~~Ci~~C~~~~C~~~~~C~~~ 94 (324)
.+|+.-|.|++.. ..|.|.|.+| |..... .|.+ +.|....|+ .+.|+..
T Consensus 50 K~Cgdya~C~~~~~~~~~~~~~C~C~~g-Y~~~~~-vCvp-----------------------~~C~~~~Cg-~GKCI~d 103 (197)
T PF06247_consen 50 KPCGDYAKCINQANKGEERAYKCDCING-YILKQG-VCVP-----------------------NKCNNKDCG-SGKCILD 103 (197)
T ss_dssp SEEETTEEEEE-SSTTSSTSEEEEE-TT-EEESSS-SEEE-----------------------GGGSS---T-TEEEEEE
T ss_pred ccccchhhhhcCCCcccceeEEEecccC-ceeeCC-eEch-----------------------hhcCceecC-CCeEEec
Confidence 4799999998654 6799999999 876432 3444 444445676 5788843
Q ss_pred ---CCCcEeeCCCCCcCCCCCCCccccc-CCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCCC
Q psy9427 95 ---DHRPVCHCPVGTHPSPSPDVHCARE-SNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDPY 158 (324)
Q Consensus 95 ---~~~~~C~C~~G~~g~~~~~~~C~~~-~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~~ 158 (324)
+....|+|.-|+..+.. ..|... ..+| .-.|..+..|.. +.+-|+|.+..||.++..
T Consensus 104 ~~~~~~~~CSC~IGkV~~dn--~kCtk~G~T~C--~LKCk~nE~CK~---~~~~Y~C~~~~~~~~~~~ 164 (197)
T PF06247_consen 104 PDNPNNPTCSCNIGKVPDDN--KKCTKTGETKC--SLKCKENEECKL---VDGYYKCVCKEGFPGDGE 164 (197)
T ss_dssp EGGGSEEEEEE-TEEETTTT--TESEEEE----------TTTEEEEE---ETTEEEEEE-TT-EEETT
T ss_pred CCCCCCceeEeeeceEeccC--CcccCCCccce--eeecCCCcceee---eCcEEEeecCCCCCCCCC
Confidence 23459999999983221 566543 1344 224777889999 888899999999987763
No 37
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=91.11 E-value=0.25 Score=29.34 Aligned_cols=23 Identities=22% Similarity=0.717 Sum_probs=19.3
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 193 ACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 193 ~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.|..+++|+.. ..+|.|.+||+|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G 29 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTG 29 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcC
Confidence 47789999976 458999999997
No 38
>PF04706 Dickkopf_N: Dickkopf N-terminal cysteine-rich region; InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=91.03 E-value=0.41 Score=31.82 Aligned_cols=21 Identities=24% Similarity=0.702 Sum_probs=18.1
Q ss_pred cccccCCCCCCCCcccCCccc
Q psy9427 295 LGCRQNEECSGQMICAGSKCT 315 (324)
Q Consensus 295 ~~C~~~~~C~~~~~C~~~~C~ 315 (324)
..|+.+++|-.++.|++|+|+
T Consensus 31 ~rC~Rd~~CC~g~~CvnG~C~ 51 (52)
T PF04706_consen 31 KRCTRDAMCCPGNLCVNGVCT 51 (52)
T ss_pred CCCCCCcccCCCCeeeCCEec
Confidence 468899999999999999886
No 39
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=91.02 E-value=0.22 Score=30.47 Aligned_cols=30 Identities=30% Similarity=0.604 Sum_probs=20.9
Q ss_pred CCCCCCCCCCEeeeCC-CceeeeeCCCeeecC
Q psy9427 17 CDGVSCGVNAVCVVNN-HEARCFCPPGSYVGD 47 (324)
Q Consensus 17 C~~~~C~~~~~C~~~~-~~~~C~C~~G~y~g~ 47 (324)
|...+|..||.|++.. |++.|.|..| |..+
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllg-yk~~ 32 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLG-YKKV 32 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TT-EEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCC-cccc
Confidence 4556789999999765 9999999999 8763
No 40
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=90.20 E-value=0.27 Score=22.93 Aligned_cols=11 Identities=55% Similarity=1.573 Sum_probs=8.6
Q ss_pred eeeeCCCeeecC
Q psy9427 36 RCFCPPGSYVGD 47 (324)
Q Consensus 36 ~C~C~~G~y~g~ 47 (324)
.|+|++| |+|.
T Consensus 1 ~C~C~~G-~~G~ 11 (13)
T PF12661_consen 1 TCQCPPG-WTGP 11 (13)
T ss_dssp EEEE-TT-EETT
T ss_pred CccCcCC-CcCC
Confidence 4899999 9985
No 41
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=89.25 E-value=0.17 Score=30.97 Aligned_cols=30 Identities=27% Similarity=0.483 Sum_probs=21.1
Q ss_pred CCCCCCCCCEEeeCC-CCcEeeCCCCCcCCC
Q psy9427 81 DSGACGTNAVCIPED-HRPVCHCPVGTHPSP 110 (324)
Q Consensus 81 ~~~~C~~~~~C~~~~-~~~~C~C~~G~~g~~ 110 (324)
....|..+|.|++.. |++.|+|..||...+
T Consensus 3 ~~~~cP~NA~C~~~~dG~eecrCllgyk~~~ 33 (37)
T PF12946_consen 3 IDTKCPANAGCFRYDDGSEECRCLLGYKKVG 33 (37)
T ss_dssp SSS---TTEEEEEETTSEEEEEE-TTEEEET
T ss_pred cCccCCCCcccEEcCCCCEEEEeeCCccccC
Confidence 335677889999766 999999999998644
No 42
>KOG0994|consensus
Probab=88.60 E-value=2 Score=45.77 Aligned_cols=75 Identities=24% Similarity=0.556 Sum_probs=40.7
Q ss_pred CCceeeeeCCCeeecCCCCCCcccCCCC---CCcCCCCCceeCCCCcccCCCCCCCCC-CCCE---EeeCCCCcEe-eCC
Q psy9427 32 NHEARCFCPPGSYVGDPNIECRQVPCVH---NHDCSTSHACDLLSHTCIPACDSGACG-TNAV---CIPEDHRPVC-HCP 103 (324)
Q Consensus 32 ~~~~~C~C~~G~y~g~~~~~C~~~~C~~---~~~C~~~~~C~c~~~~Ci~~C~~~~C~-~~~~---C~~~~~~~~C-~C~ 103 (324)
.....|+|.+| |+|.....|.++--.+ +++|. .|.|.++ ||.-.+..|. ..+. |+....+.+| .|+
T Consensus 931 t~~ivC~C~~G-Y~G~RCe~CA~~~fGnP~~GGtCq---~CeC~~N--iD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck 1004 (1758)
T KOG0994|consen 931 TQQIVCHCQEG-YSGSRCEICADNHFGNPSEGGTCQ---KCECSNN--IDLYDPGACDVATGACLKCLYHTEGDHCEHCK 1004 (1758)
T ss_pred ccceeeecccC-ccccchhhhcccccCCcccCCccc---cccccCC--cCccCCCccchhhchhhhhhhcccccchhhcc
Confidence 34568999999 9998777776631111 33321 0111000 5555555553 1233 4433334455 699
Q ss_pred CCCcCCCCC
Q psy9427 104 VGTHPSPSP 112 (324)
Q Consensus 104 ~G~~g~~~~ 112 (324)
.||.|+...
T Consensus 1005 ~Gf~GdA~~ 1013 (1758)
T KOG0994|consen 1005 DGFYGDALR 1013 (1758)
T ss_pred ccchhHHHH
Confidence 999997653
No 43
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=88.27 E-value=0.44 Score=41.78 Aligned_cols=36 Identities=25% Similarity=0.637 Sum_probs=27.7
Q ss_pred CCCCCCCCCCCCCCCCCCCcccCCcccCCCccCCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 161 GCVPQGQCPRGDADCPPSSRCSGGQCVSVCQGACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 161 ~C~~~~eC~~~~~~C~~~~~C~~~~C~d~C~~~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.|.+.++|....++ |. ..|.++.|+|.|.|++||+.
T Consensus 183 ~C~~~~~C~~~~~~-------------------c~--~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 183 ICVVPDLCATLSHV-------------------CQ--QVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cCcCchhhcCCCCC-------------------cc--ceEEcCCCCEEeECCCCccC
Confidence 47777778554433 43 46999999999999999985
No 44
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=85.13 E-value=1.2 Score=39.07 Aligned_cols=38 Identities=26% Similarity=0.523 Sum_probs=29.8
Q ss_pred CcccccCCCCCCCC-CCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427 114 VHCARESNLCRDSN-PCHPTARCEDTGATNGKVVCSCPELSVGDP 157 (324)
Q Consensus 114 ~~C~~~~~~C~~~~-~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~ 157 (324)
..|.+. ++|...+ .| ...|.+ +.|+|.|.|++||++..
T Consensus 182 ~~C~~~-~~C~~~~~~c--~~~C~~---~~g~~~c~c~~g~~~~~ 220 (224)
T cd01475 182 KICVVP-DLCATLSHVC--QQVCIS---TPGSYLCACTEGYALLE 220 (224)
T ss_pred ccCcCc-hhhcCCCCCc--cceEEc---CCCCEEeECCCCccCCC
Confidence 678776 8884332 45 468999 99999999999998765
No 45
>PF04706 Dickkopf_N: Dickkopf N-terminal cysteine-rich region; InterPro: IPR006796 Dickkopf proteins are a class of Wnt antagonists. They possess two conserved cysteine-rich regions. This family represents the N-terminal conserved region []. The C-terminal region has been found to share significant sequence similarity to the colipase fold (IPR001981 from INTERPRO) [].; GO: 0007275 multicellular organismal development, 0030178 negative regulation of Wnt receptor signaling pathway, 0005576 extracellular region
Probab=84.85 E-value=1.4 Score=29.33 Aligned_cols=15 Identities=27% Similarity=0.738 Sum_probs=6.7
Q ss_pred CCCCCccCCCCCcCCC
Q psy9427 233 CAGDQECLAGEEKCIG 248 (324)
Q Consensus 233 C~~~~~C~~~~~~C~~ 248 (324)
|..+.+|. .+..|+.
T Consensus 2 C~~D~dC~-~g~yC~~ 16 (52)
T PF04706_consen 2 CSSDEDCG-YGKYCHS 16 (52)
T ss_pred CcccccCC-CCCCcCC
Confidence 44445554 3444443
No 46
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=79.77 E-value=1.8 Score=33.39 Aligned_cols=24 Identities=42% Similarity=1.186 Sum_probs=19.7
Q ss_pred CCCCCCeeeeCCCCceeeCCCCCcc
Q psy9427 193 ACGPNALCRAVGKRPVCACPDGFSA 217 (324)
Q Consensus 193 ~C~~~~~C~~~~g~y~C~C~~Gy~g 217 (324)
.|+..+.|.. .....|.|.+||.+
T Consensus 85 ~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 85 FCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred ccCCccEeCC-CCCCceECCCCcCC
Confidence 6899999954 45678999999975
No 47
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=78.38 E-value=2 Score=33.18 Aligned_cols=31 Identities=35% Similarity=0.856 Sum_probs=23.9
Q ss_pred ccCCCC-CCCCCCCEeeeCCCceeeeeCCCeeec
Q psy9427 14 WNSCDG-VSCGVNAVCVVNNHEARCFCPPGSYVG 46 (324)
Q Consensus 14 ~~~C~~-~~C~~~~~C~~~~~~~~C~C~~G~y~g 46 (324)
.|.|+. ..|++++.|.. ..+..|.|++| |..
T Consensus 77 ~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~G-F~P 108 (110)
T PF00954_consen 77 KDQCDVYGFCGPNGICNS-NNSPKCSCLPG-FEP 108 (110)
T ss_pred ccCCCCccccCCccEeCC-CCCCceECCCC-cCC
Confidence 356874 68999999954 45678999999 864
No 48
>KOG1836|consensus
Probab=71.15 E-value=7 Score=44.18 Aligned_cols=51 Identities=27% Similarity=0.486 Sum_probs=33.1
Q ss_pred eCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeee-CCCCCccCCC
Q psy9427 101 HCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCS-CPELSVGDPY 158 (324)
Q Consensus 101 ~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~-C~~G~~g~~~ 158 (324)
+|..||.|.+..+. . +.| .+-+|..++.|.... ......|+ |++||+|..+
T Consensus 760 ~C~~GfYg~~~~~~----~-~dC-~~C~Cp~~~~~~~~~-~~~~~iCk~Cp~gytG~rC 811 (1705)
T KOG1836|consen 760 QCVDGFYGLPDLGT----S-GDC-QPCPCPNGGACGQTP-EILEVVCKNCPPGYTGLRC 811 (1705)
T ss_pred hhcCCCCCccccCC----C-CCC-ccCCCCCChhhcCcC-cccceecCCCCCCCccccc
Confidence 79999998765311 1 225 444777777776510 13557788 9999999875
No 49
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=68.30 E-value=3.6 Score=26.96 Aligned_cols=18 Identities=33% Similarity=1.169 Sum_probs=8.9
Q ss_pred cccCCCCCCCCcccCCcc
Q psy9427 297 CRQNEECSGQMICAGSKC 314 (324)
Q Consensus 297 C~~~~~C~~~~~C~~~~C 314 (324)
|..+.+|..+..|++++|
T Consensus 22 C~~~~qC~~~s~C~~g~C 39 (52)
T PF01683_consen 22 CESDEQCIGGSVCVNGRC 39 (52)
T ss_pred CCCcCCCCCcCEEcCCEe
Confidence 444444544555555444
No 50
>PF00095 WAP: WAP-type (Whey Acidic Protein) 'four-disulfide core'; InterPro: IPR008197 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Whey acidic protein (WAP) is a major component of the whey fraction of milk, which contains a significant number of different proteins. WAP proteins share limited sequence identity, except for their conserved cysteine-rich regions, known as 4-disulphide core (4-DSC) domains, and the positional conservation of specific residues []. Several non-milk proteins also contain 4-DSC patterns, such as certain serine protease inhibitors. WAP itself appears to have a protease-inhibitor function, as seen with its inhibitory effect on the progression of cancer cells [].; GO: 0030414 peptidase inhibitor activity, 0005576 extracellular region; PDB: 1FLE_I 2REL_A 2Z7F_I 3NGG_B.
Probab=60.65 E-value=3.2 Score=26.11 Aligned_cols=20 Identities=30% Similarity=0.944 Sum_probs=14.1
Q ss_pred CccccccccCCCCCCCCccc
Q psy9427 291 GTCELGCRQNEECSGQMICA 310 (324)
Q Consensus 291 ~~c~~~C~~~~~C~~~~~C~ 310 (324)
..|...|.+|.||+..+.|=
T Consensus 13 ~~c~~~C~~D~dC~~~~KCC 32 (43)
T PF00095_consen 13 GDCRDECSSDSDCPGGQKCC 32 (43)
T ss_dssp CEESBS-SSGGGSSTT-EEE
T ss_pred CcccccCcCccccCCcCeEC
Confidence 44668899999998888883
No 51
>smart00051 DSL delta serrate ligand.
Probab=57.76 E-value=17 Score=25.16 Aligned_cols=47 Identities=19% Similarity=0.333 Sum_probs=28.7
Q ss_pred CcEeeCCCCCcCCCCCCCcccccCCCCCCCCCCCCCCeeecCCCCCCCeeeeCCCCCccCC
Q psy9427 97 RPVCHCPVGTHPSPSPDVHCARESNLCRDSNPCHPTARCEDTGATNGKVVCSCPELSVGDP 157 (324)
Q Consensus 97 ~~~C~C~~G~~g~~~~~~~C~~~~~~C~~~~~C~~~~~C~~~~~~~g~~~C~C~~G~~g~~ 157 (324)
.+.-.|.++|.| ..|.. .|...+....+.+|.. . -.++|.+||+|..
T Consensus 16 ~~rv~C~~~~yG-----~~C~~---~C~~~~d~~~~~~Cd~----~--G~~~C~~Gw~G~~ 62 (63)
T smart00051 16 QIRVTCDENYYG-----EGCNK---FCRPRDDFFGHYTCDE----N--GNKGCLEGWMGPY 62 (63)
T ss_pred EEEeeCCCCCcC-----CccCC---EeCcCccccCCccCCc----C--CCEecCCCCcCCC
Confidence 345578999999 55542 3422223445566633 2 3578999999865
No 52
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=55.64 E-value=14 Score=22.06 Aligned_cols=21 Identities=43% Similarity=1.012 Sum_probs=13.8
Q ss_pred EeeeCCCceeeeeCCCeeecCCC
Q psy9427 27 VCVVNNHEARCFCPPGSYVGDPN 49 (324)
Q Consensus 27 ~C~~~~~~~~C~C~~G~y~g~~~ 49 (324)
.|... ..+.|.||+| |..+..
T Consensus 11 ~CDpn-~~~~C~CPeG-yIlde~ 31 (34)
T PF09064_consen 11 DCDPN-SPGQCFCPEG-YILDEG 31 (34)
T ss_pred ccCCC-CCCceeCCCc-eEecCC
Confidence 56532 2238999999 876543
No 53
>PF04885 Stig1: Stigma-specific protein, Stig1; InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=55.13 E-value=32 Score=27.75 Aligned_cols=34 Identities=18% Similarity=0.505 Sum_probs=18.6
Q ss_pred CCCcCCCCcccCCCCC-------CCCCCCCCcccCCcccCC
Q psy9427 242 GEEKCIGGRCRVACLS-------HSDCSEGERCISSLCQSP 275 (324)
Q Consensus 242 ~~~~C~~~~C~~~C~~-------~~~C~~~~~C~~~~C~~~ 275 (324)
.+..|-+++|+++=.+ +..|..++.|-+++|+++
T Consensus 68 ~~~~CC~~~Cvdv~~d~~nCG~Cg~~C~~g~~cC~G~Cvd~ 108 (136)
T PF04885_consen 68 PGPTCCNNKCVDVSSDRNNCGACGNKCPYGQTCCGGQCVDL 108 (136)
T ss_pred CCCcccCCcCCccCCCccccHhhcCCCCCCceecCCEeECC
Confidence 3445656666553221 244566667777777653
No 54
>PHA02887 EGF-like protein; Provisional
Probab=53.51 E-value=13 Score=29.01 Aligned_cols=24 Identities=25% Similarity=0.699 Sum_probs=18.4
Q ss_pred CCCCCCeeeeC--CCCceeeCCCCCcc
Q psy9427 193 ACGPNALCRAV--GKRPVCACPDGFSA 217 (324)
Q Consensus 193 ~C~~~~~C~~~--~g~y~C~C~~Gy~g 217 (324)
-|- +|+|.-. ...+.|.|+.||+|
T Consensus 93 YCi-HG~C~yI~dL~epsCrC~~GYtG 118 (126)
T PHA02887 93 FCI-NGECMNIIDLDEKFCICNKGYTG 118 (126)
T ss_pred Eee-CCEEEccccCCCceeECCCCccc
Confidence 465 4689744 45689999999998
No 55
>PHA02887 EGF-like protein; Provisional
Probab=53.14 E-value=15 Score=28.63 Aligned_cols=31 Identities=23% Similarity=0.578 Sum_probs=23.0
Q ss_pred CCCCCCCEEee--CCCCcEeeCCCCCcCCCCCCCccccc
Q psy9427 83 GACGTNAVCIP--EDHRPVCHCPVGTHPSPSPDVHCARE 119 (324)
Q Consensus 83 ~~C~~~~~C~~--~~~~~~C~C~~G~~g~~~~~~~C~~~ 119 (324)
+-|. +|+|.- ....+.|.|.+||+| .+|+.+
T Consensus 92 ~YCi-HG~C~yI~dL~epsCrC~~GYtG-----~RCE~v 124 (126)
T PHA02887 92 DFCI-NGECMNIIDLDEKFCICNKGYTG-----IRCDEV 124 (126)
T ss_pred CEee-CCEEEccccCCCceeECCCCccc-----CCCCcc
Confidence 3455 468874 345689999999999 788754
No 56
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=48.99 E-value=21 Score=23.18 Aligned_cols=25 Identities=24% Similarity=0.558 Sum_probs=17.2
Q ss_pred EeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427 27 VCVVNNHEARCFCPPGSYVGDPNIECRQ 54 (324)
Q Consensus 27 ~C~~~~~~~~C~C~~G~y~g~~~~~C~~ 54 (324)
.|....| .|.|++| |+|...+.|.+
T Consensus 13 ~C~~~~G--~C~C~~~-~~G~~C~~C~~ 37 (50)
T cd00055 13 QCDPGTG--QCECKPN-TTGRRCDRCAP 37 (50)
T ss_pred cccCCCC--EEeCCCc-CCCCCCCCCCC
Confidence 3655445 7999999 89976655543
No 57
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=48.23 E-value=15 Score=23.68 Aligned_cols=26 Identities=27% Similarity=0.744 Sum_probs=17.7
Q ss_pred CEeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427 26 AVCVVNNHEARCFCPPGSYVGDPNIECRQ 54 (324)
Q Consensus 26 ~~C~~~~~~~~C~C~~G~y~g~~~~~C~~ 54 (324)
+.|....+ .|.|+++ |+|...+.|.+
T Consensus 11 ~~C~~~~G--~C~C~~~-~~G~~C~~C~~ 36 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPG-TTGPRCDQCKP 36 (49)
T ss_dssp SSEEETCE--EESBSTT-EESTTS-EE-T
T ss_pred CcccCCCC--EEecccc-ccCCcCcCCCC
Confidence 36766444 8999999 99976665554
No 58
>smart00217 WAP Four-disulfide core domains.
Probab=48.19 E-value=25 Score=22.66 Aligned_cols=24 Identities=25% Similarity=0.732 Sum_probs=18.1
Q ss_pred cCCCCccccccccCCCCCCCCccc
Q psy9427 287 ACQAGTCELGCRQNEECSGQMICA 310 (324)
Q Consensus 287 ~C~~~~c~~~C~~~~~C~~~~~C~ 310 (324)
.|..+.|...|.+|.+|+..+.|=
T Consensus 12 ~c~~~~c~~~C~~D~~C~~~~KCC 35 (47)
T smart00217 12 SCPLGNPPNKCSSDSDCPGVKKCC 35 (47)
T ss_pred ccccCCCCccCcCccccCCCCCcC
Confidence 455667777888888888877773
No 59
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=42.12 E-value=22 Score=28.26 Aligned_cols=24 Identities=21% Similarity=0.515 Sum_probs=18.8
Q ss_pred CCCCCCeeeeC--CCCceeeCCCCCcc
Q psy9427 193 ACGPNALCRAV--GKRPVCACPDGFSA 217 (324)
Q Consensus 193 ~C~~~~~C~~~--~g~y~C~C~~Gy~g 217 (324)
-|.+| +|.-. ...+.|.|..||+|
T Consensus 52 YClHG-~C~yI~dl~~~~CrC~~GYtG 77 (139)
T PHA03099 52 YCLHG-DCIHARDIDGMYCRCSHGYTG 77 (139)
T ss_pred EeECC-EEEeeccCCCceeECCCCccc
Confidence 46665 89744 47799999999998
No 60
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=41.83 E-value=24 Score=28.05 Aligned_cols=31 Identities=26% Similarity=0.579 Sum_probs=23.5
Q ss_pred CCCCCCCEEeeC--CCCcEeeCCCCCcCCCCCCCccccc
Q psy9427 83 GACGTNAVCIPE--DHRPVCHCPVGTHPSPSPDVHCARE 119 (324)
Q Consensus 83 ~~C~~~~~C~~~--~~~~~C~C~~G~~g~~~~~~~C~~~ 119 (324)
+-|.+ +.|.-. ...+.|.|..||+| .+|+..
T Consensus 51 ~YClH-G~C~yI~dl~~~~CrC~~GYtG-----eRCEh~ 83 (139)
T PHA03099 51 GYCLH-GDCIHARDIDGMYCRCSHGYTG-----IRCQHV 83 (139)
T ss_pred CEeEC-CEEEeeccCCCceeECCCCccc-----ccccce
Confidence 34655 488743 47899999999999 888765
No 61
>KOG3516|consensus
Probab=36.40 E-value=26 Score=37.95 Aligned_cols=37 Identities=19% Similarity=0.467 Sum_probs=32.5
Q ss_pred cCCCCCCCCCCCCEEeeCCCCcEeeCC-CCCcCCCCCCCcccc
Q psy9427 77 IPACDSGACGTNAVCIPEDHRPVCHCP-VGTHPSPSPDVHCAR 118 (324)
Q Consensus 77 i~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~~~~C~~ 118 (324)
++.|.+++|.+++.|......|+|.|. .||.| .+|..
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~G-----atCHt 582 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKG-----ATCHT 582 (1306)
T ss_pred ccccCCccccCCCcccccccceeEecccccccc-----ccccC
Confidence 788999999999999998899999998 89998 56643
No 62
>KOG3516|consensus
Probab=30.92 E-value=34 Score=37.08 Aligned_cols=33 Identities=24% Similarity=0.573 Sum_probs=29.7
Q ss_pred ccCCCCCCCCCCCEeeeCCCceeeeeC-CCeeecC
Q psy9427 14 WNSCDGVSCGVNAVCVVNNHEARCFCP-PGSYVGD 47 (324)
Q Consensus 14 ~~~C~~~~C~~~~~C~~~~~~~~C~C~-~G~y~g~ 47 (324)
.|.|..++|..++.|.-....|.|.|. .| |.|.
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TG-Y~Ga 578 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTG-YKGA 578 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccc-cccc
Confidence 567999999999999999999999999 89 8774
No 63
>PF04885 Stig1: Stigma-specific protein, Stig1; InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=30.37 E-value=1.1e+02 Score=24.80 Aligned_cols=54 Identities=30% Similarity=0.749 Sum_probs=32.0
Q ss_pred CCCCcccCCcccCC------CC-CCCCCCCCCcCCCCcccc------cccc-CCCCCCCCcccCCccc
Q psy9427 262 SEGERCISSLCQSP------CT-SHSQCPTGQACQAGTCEL------GCRQ-NEECSGQMICAGSKCT 315 (324)
Q Consensus 262 ~~~~~C~~~~C~~~------C~-~~~~C~~~~~C~~~~c~~------~C~~-~~~C~~~~~C~~~~C~ 315 (324)
..+..|-++.|++. |. =+..|+.++.|-.|.|.. +|-. +..|+.++.|+.|.|.
T Consensus 67 ~~~~~CC~~~Cvdv~~d~~nCG~Cg~~C~~g~~cC~G~Cvd~~~d~~~CG~Cg~~C~~G~~C~~G~C~ 134 (136)
T PF04885_consen 67 SPGPTCCNNKCVDVSSDRNNCGACGNKCPYGQTCCGGQCVDLNSDPRHCGACGNKCPPGQKCVYGMCG 134 (136)
T ss_pred CCCCcccCCcCCccCCCccccHhhcCCCCCCceecCCEeECCCCCccccCCCCCcCCCcCCcCCeECC
Confidence 44556666666642 10 123566677777776642 3432 4578888888888774
No 64
>cd00199 WAP whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.
Probab=29.60 E-value=64 Score=21.94 Aligned_cols=21 Identities=24% Similarity=0.662 Sum_probs=15.4
Q ss_pred CCccccccccCCCCCCCCccc
Q psy9427 290 AGTCELGCRQNEECSGQMICA 310 (324)
Q Consensus 290 ~~~c~~~C~~~~~C~~~~~C~ 310 (324)
.+.|...|.+|.+|+..+.|=
T Consensus 28 ~~~c~~~C~~D~~C~g~~KCC 48 (60)
T cd00199 28 LGIPPNRCSSDSDCPGDKKCC 48 (60)
T ss_pred cCcccccCccccccCCCCCcC
Confidence 445667788888888777773
No 65
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=28.49 E-value=57 Score=20.70 Aligned_cols=25 Identities=24% Similarity=0.612 Sum_probs=16.7
Q ss_pred EeeeCCCceeeeeCCCeeecCCCCCCcc
Q psy9427 27 VCVVNNHEARCFCPPGSYVGDPNIECRQ 54 (324)
Q Consensus 27 ~C~~~~~~~~C~C~~G~y~g~~~~~C~~ 54 (324)
.|....| .|+|+++ |+|...+.|.+
T Consensus 12 ~C~~~~G--~C~C~~~-~~G~~C~~C~~ 36 (46)
T smart00180 12 TCDPDTG--QCECKPN-VTGRRCDRCAP 36 (46)
T ss_pred cccCCCC--EEECCCC-CCCCCCCcCCC
Confidence 5654444 8999999 88865554443
No 66
>PF08562 Crisp: Crisp; InterPro: IPR013871 This entry is found on Crisp proteins which contain IPR001283 from INTERPRO and has been termed the Crisp domain. It is found in the mammalian reproductive tract and the venom of reptiles, and has been shown to regulate ryanodine receptor Ca2+ signalling []. It contains 10 conserved cysteines which are all involved in disulphide bonds and is structurally related to the ion channel inhibitor toxins BgK and ShK []. ; PDB: 3MZ8_B 1XX5_B 2GIZ_A 1XTA_A 1RC9_A 2A05_A 2CQ7_A 2DDA_C 2EPF_A 2DDB_C ....
Probab=27.06 E-value=27 Score=23.49 Aligned_cols=15 Identities=20% Similarity=0.262 Sum_probs=10.4
Q ss_pred CCcccCCcccccchh
Q psy9427 306 QMICAGSKCTGWVEE 320 (324)
Q Consensus 306 ~~~C~~~~C~np~~~ 320 (324)
+..|.+|-|.|||.-
T Consensus 5 P~~CdngLCTNpC~y 19 (55)
T PF08562_consen 5 PSNCDNGLCTNPCPY 19 (55)
T ss_dssp TTSEETTEES-B-SS
T ss_pred chhhcCCCcCCCccc
Confidence 457899999999963
No 67
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=23.10 E-value=93 Score=18.47 Aligned_cols=12 Identities=33% Similarity=1.284 Sum_probs=5.4
Q ss_pred cccccccCCCCC
Q psy9427 293 CELGCRQNEECS 304 (324)
Q Consensus 293 c~~~C~~~~~C~ 304 (324)
|..+|.+++.|+
T Consensus 26 cptgcnsddkcp 37 (40)
T PF11403_consen 26 CPTGCNSDDKCP 37 (40)
T ss_dssp S-TTTTSSTT--
T ss_pred CCCCCCCCCcCC
Confidence 344566666665
No 68
>KOG3514|consensus
Probab=23.09 E-value=56 Score=35.24 Aligned_cols=35 Identities=29% Similarity=0.722 Sum_probs=30.0
Q ss_pred CCCCCCCCCCCEEeeCCCCcEeeCC-CCCcCCCCCCCcccc
Q psy9427 79 ACDSGACGTNAVCIPEDHRPVCHCP-VGTHPSPSPDVHCAR 118 (324)
Q Consensus 79 ~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~~~~~~C~~ 118 (324)
.|.++||.+++.|..-.++|.|.|. .||.| +.|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G-----~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEG-----RTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccC-----ccccc
Confidence 6888999999999999999999995 57777 77764
No 69
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=21.34 E-value=1.2e+02 Score=22.37 Aligned_cols=27 Identities=30% Similarity=0.898 Sum_probs=20.8
Q ss_pred CCCCCCCCCCCEeee-CCCceeeeeCCC
Q psy9427 16 SCDGVSCGVNAVCVV-NNHEARCFCPPG 42 (324)
Q Consensus 16 ~C~~~~C~~~~~C~~-~~~~~~C~C~~G 42 (324)
||+...|+.+.+|+. ..+...|+|.+-
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~CvC~~~ 28 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCVCIDP 28 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEecCCc
Confidence 577778999999985 467788887654
Done!