Query psy1764
Match_columns 498
No_of_seqs 305 out of 1421
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 19:11:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1764.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1764hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd01328 FSL_SPARC Follistatin- 99.6 2.7E-16 5.8E-21 129.2 4.6 73 59-132 1-85 (86)
2 cd01328 FSL_SPARC Follistatin- 99.5 1E-14 2.2E-19 119.9 5.4 73 134-207 1-84 (86)
3 cd01327 KAZAL_PSTI Kazal-type 99.3 6.3E-13 1.4E-17 96.7 2.8 39 365-403 7-45 (45)
4 smart00280 KAZAL Kazal type se 99.3 9.4E-13 2E-17 96.3 3.0 45 355-403 2-46 (46)
5 cd01327 KAZAL_PSTI Kazal-type 99.2 5.9E-12 1.3E-16 91.6 3.4 43 237-282 3-45 (45)
6 smart00280 KAZAL Kazal type se 99.2 7.6E-12 1.7E-16 91.5 3.5 45 235-282 2-46 (46)
7 PF07648 Kazal_2: Kazal-type s 99.0 6.7E-11 1.4E-15 84.5 1.0 38 366-403 4-42 (42)
8 cd00104 KAZAL_FS Kazal type se 99.0 2E-10 4.2E-15 81.8 2.8 39 365-403 3-41 (41)
9 PF00050 Kazal_1: Kazal-type s 99.0 2.3E-10 5E-15 84.5 2.4 41 289-329 8-48 (48)
10 PF07648 Kazal_2: Kazal-type s 99.0 3.8E-10 8.2E-15 80.6 2.7 39 244-282 3-42 (42)
11 PF00050 Kazal_1: Kazal-type s 98.9 6E-10 1.3E-14 82.3 2.9 41 239-282 8-48 (48)
12 cd00104 KAZAL_FS Kazal type se 98.9 1.1E-09 2.5E-14 77.9 3.2 39 244-282 3-41 (41)
13 KOG4004|consensus 98.7 3.7E-09 8E-14 98.4 2.3 79 54-132 47-132 (259)
14 KOG4004|consensus 98.4 1.2E-07 2.5E-12 88.6 3.4 78 208-285 50-133 (259)
15 KOG4578|consensus 97.9 2.5E-06 5.3E-11 85.2 -0.1 83 82-188 34-116 (421)
16 KOG3555|consensus 97.8 6.6E-06 1.4E-10 82.8 0.6 75 54-132 84-185 (434)
17 KOG4578|consensus 97.6 5.8E-05 1.3E-09 75.7 3.8 74 231-309 30-114 (421)
18 KOG3555|consensus 97.1 0.00013 2.7E-09 73.7 0.3 46 439-488 141-186 (434)
19 cd01330 KAZAL_SLC21 The kazal- 96.6 0.0013 2.7E-08 49.8 2.2 22 244-265 14-35 (54)
20 cd01330 KAZAL_SLC21 The kazal- 96.6 0.0013 2.8E-08 49.7 2.1 22 168-189 14-35 (54)
21 smart00057 FIMAC factor I memb 96.4 0.0025 5.4E-08 50.7 2.7 63 60-131 2-68 (69)
22 smart00057 FIMAC factor I memb 96.3 0.0017 3.6E-08 51.6 1.1 64 411-487 4-69 (69)
23 smart00274 FOLN Follistatin-N- 96.0 0.0047 1E-07 39.5 2.0 23 59-81 1-24 (26)
24 PF09289 FOLN: Follistatin/Ost 95.9 0.0042 9E-08 37.9 1.4 21 60-80 1-22 (22)
25 KOG3509|consensus 94.4 0.26 5.7E-06 57.1 11.0 217 57-329 172-404 (964)
26 smart00274 FOLN Follistatin-N- 94.2 0.044 9.6E-07 35.1 2.4 24 134-157 1-24 (26)
27 KOG3509|consensus 94.1 0.3 6.5E-06 56.7 10.7 299 94-403 59-404 (964)
28 TIGR00805 oat sodium-independe 84.6 0.52 1.1E-05 53.1 2.0 30 159-188 449-478 (633)
29 TIGR00805 oat sodium-independe 84.4 0.54 1.2E-05 53.0 2.0 30 235-264 449-478 (633)
30 PF09289 FOLN: Follistatin/Ost 81.3 1 2.2E-05 27.7 1.4 22 135-156 1-22 (22)
31 KOG3626|consensus 80.1 1 2.2E-05 51.3 2.1 31 159-189 514-544 (735)
32 KOG4597|consensus 80.1 1.3 2.9E-05 47.2 2.8 64 58-126 104-168 (560)
33 KOG3626|consensus 75.0 1.7 3.7E-05 49.5 2.0 30 235-264 514-543 (735)
34 PF00683 TB: TB domain; Inter 69.1 1 2.2E-05 32.3 -1.0 25 5-32 1-25 (42)
35 KOG4597|consensus 52.0 11 0.00024 40.6 2.6 60 335-399 108-168 (560)
36 PF00008 EGF: EGF-like domain 32.1 27 0.00058 23.1 1.2 25 60-84 1-26 (32)
37 PF12946 EGF_MSP1_1: MSP1 EGF 25.4 37 0.0008 23.7 1.0 25 60-84 2-27 (37)
38 cd00053 EGF Epidermal growth f 22.3 66 0.0014 20.3 1.8 23 62-84 5-27 (36)
39 smart00181 EGF Epidermal growt 22.1 64 0.0014 20.9 1.7 21 63-84 6-26 (35)
40 PF12190 amfpi-1: Fungal prote 21.3 61 0.0013 26.3 1.6 42 367-409 8-51 (91)
No 1
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=99.62 E-value=2.7e-16 Score=129.24 Aligned_cols=73 Identities=33% Similarity=0.884 Sum_probs=63.7
Q ss_pred cccceeccCCceeEe-ecCCCceeeCchhhhhcccCCCCcccCCCccceeeehhhhhhhccc-----------ceEEEee
Q psy1764 59 TCENIDCGEGKKCAI-KKGYPKCICAPDCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKN-----------EVISVDY 126 (498)
Q Consensus 59 ~C~~~~C~~g~~C~~-~~~~~~C~C~~~C~~~~~~~~~PVCGsDG~TY~n~C~l~~~~C~~~-----------~~l~v~~ 126 (498)
||+++.|++|++|++ ..++|+|+|.+.|+... ..+.|||||||+||.|+|+|++++|..+ ..|+|.|
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~CvC~~~Cp~~~-~~~~~VCGsDg~TY~s~C~L~r~~C~~~~~~~~~~~~~~~~l~i~Y 79 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCVCIDPCPEEV-DDRRKVCTNDNETFDSDCELYRTRCLCKGGKKGCRGPKYQHLHLDY 79 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEecCCcCCCCC-CCCCceeCCCCCCcccHhHHhhhHhccCCCCcCCCCCccCeEEEEe
Confidence 689999999999999 58999999999999721 2347899999999999999999999732 4799999
Q ss_pred cccccc
Q psy1764 127 FGTCQT 132 (498)
Q Consensus 127 ~G~C~~ 132 (498)
.|+|+.
T Consensus 80 ~G~Ck~ 85 (86)
T cd01328 80 YGECKE 85 (86)
T ss_pred eccccC
Confidence 999975
No 2
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=99.53 E-value=1e-14 Score=119.94 Aligned_cols=73 Identities=33% Similarity=0.809 Sum_probs=65.7
Q ss_pred CCCceecCCCceeeecCCcccccccceecccCCCCCCCcccCCCCceeehhhhhHHHHhhcC-----------Ccceeee
Q psy1764 134 SCDRVTCPSGKFCILDQNLIPHCNRCIRHCVTPSSDTHQVCGTDGVTYESACHLREAACRRG-----------KSIPIAY 202 (498)
Q Consensus 134 ~C~~~~c~~~~~C~~d~~~~~~C~~C~~~Cp~~~~~~~PVCGsdG~TY~n~C~l~~a~C~~~-----------~~i~~~~ 202 (498)
||.++.|+.++.|++|+++.|+|+ |+..||....++.||||+||+||.|+|+|++++|..+ ..|.|.|
T Consensus 1 pC~~v~C~~G~~C~~d~~~~p~Cv-C~~~Cp~~~~~~~~VCGsDg~TY~s~C~L~r~~C~~~~~~~~~~~~~~~~l~i~Y 79 (86)
T cd01328 1 PCENHHCGAGKVCEVDDENTPKCV-CIDPCPEEVDDRRKVCTNDNETFDSDCELYRTRCLCKGGKKGCRGPKYQHLHLDY 79 (86)
T ss_pred CCCCcCCCCCCEeeECCCCCeEEe-cCCcCCCCCCCCCceeCCCCCCcccHhHHhhhHhccCCCCcCCCCCccCeEEEEe
Confidence 588999999999999999999998 9999998655668899999999999999999999743 3799999
Q ss_pred ccccC
Q psy1764 203 RGHCK 207 (498)
Q Consensus 203 ~G~C~ 207 (498)
+|+|.
T Consensus 80 ~G~Ck 84 (86)
T cd01328 80 YGECK 84 (86)
T ss_pred ecccc
Confidence 99996
No 3
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=99.33 E-value=6.3e-13 Score=96.73 Aligned_cols=39 Identities=31% Similarity=0.656 Sum_probs=37.3
Q ss_pred cCCCCccCCCCCcchhhhhhhhhhhccCceEEeeecccc
Q psy1764 365 HMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTC 403 (498)
Q Consensus 365 ~~~~PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~~G~C 403 (498)
..|.|||||||+||.|+|+|+.++|..+..|+|+|.|+|
T Consensus 7 ~~~~PVCGsDg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 45 (45)
T cd01327 7 KDYDPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC 45 (45)
T ss_pred CCCCceeCCCCCEeCCHhHHHHHHhccCCCeEEeecCCC
Confidence 579999999999999999999999999999999999987
No 4
>smart00280 KAZAL Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
Probab=99.31 E-value=9.4e-13 Score=96.34 Aligned_cols=45 Identities=42% Similarity=0.885 Sum_probs=40.2
Q ss_pred cCcchhcccccCCCCccCCCCCcchhhhhhhhhhhccCceEEeeecccc
Q psy1764 355 CAPDCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTC 403 (498)
Q Consensus 355 C~~~C~~~~~~~~~PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~~G~C 403 (498)
|+..|+ ..++|||||||+||+|+|+|+.++|..+..|+|+|.|+|
T Consensus 2 C~~~C~----~~~~pVCgsdg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 46 (46)
T smart00280 2 CPEACP----REYDPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC 46 (46)
T ss_pred CCCCCC----CCCCccCCCCCCEeCCHhHHHHHHhcCCCCeEEeecCCC
Confidence 444565 479999999999999999999999998999999999987
No 5
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=99.23 E-value=5.9e-12 Score=91.61 Aligned_cols=43 Identities=42% Similarity=0.760 Sum_probs=40.6
Q ss_pred ccccCCCCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeee
Q psy1764 237 RHCVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHC 282 (498)
Q Consensus 237 ~~Cp~~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 282 (498)
..|| ..+.|||||||+||.|+|+|..++|..+..|+++|.|+|
T Consensus 3 ~~Cp---~~~~PVCGsDg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 45 (45)
T cd01327 3 FGCP---KDYDPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC 45 (45)
T ss_pred CCCC---CCCCceeCCCCCEeCCHhHHHHHHhccCCCeEEeecCCC
Confidence 3688 899999999999999999999999999999999999987
No 6
>smart00280 KAZAL Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
Probab=99.22 E-value=7.6e-12 Score=91.52 Aligned_cols=45 Identities=49% Similarity=1.020 Sum_probs=42.1
Q ss_pred ccccccCCCCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeee
Q psy1764 235 CIRHCVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHC 282 (498)
Q Consensus 235 C~~~Cp~~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 282 (498)
|+..|| .++.||||+||+||.|+|+|..++|..+..|+++|.|+|
T Consensus 2 C~~~C~---~~~~pVCgsdg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 46 (46)
T smart00280 2 CPEACP---REYDPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC 46 (46)
T ss_pred CCCCCC---CCCCccCCCCCCEeCCHhHHHHHHhcCCCCeEEeecCCC
Confidence 456798 899999999999999999999999999999999999987
No 7
>PF07648 Kazal_2: Kazal-type serine protease inhibitor domain; InterPro: IPR011497 This domain is usually indicative of serine protease inhibitors that belong to Merops inhibitor families: I1, I2, I17 and I31. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays and have a central alpha-helix, a short two-stranded antiparallel beta-sheet and several disulphide bonds [, , ]. The amino terminal segment of this domain binds to the active site of its target proteases, thus inhibiting their function.; PDB: 2KMP_A 1LDT_L 2KMR_A 2KMO_A 2KMQ_A 1AN1_I 4E0S_B 3T5O_A 4A5W_B 1LR7_A ....
Probab=99.02 E-value=6.7e-11 Score=84.49 Aligned_cols=38 Identities=45% Similarity=0.896 Sum_probs=35.5
Q ss_pred CCC-CccCCCCCcchhhhhhhhhhhccCceEEeeecccc
Q psy1764 366 MRN-PVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTC 403 (498)
Q Consensus 366 ~~~-PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~~G~C 403 (498)
.+. ||||+||+||.|+|+|+.++|..+..|+|+|.|+|
T Consensus 4 ~~~~PVCg~dg~ty~n~C~l~~~~c~~~~~~~~~~~g~C 42 (42)
T PF07648_consen 4 EYSSPVCGSDGKTYSNECELRCANCRTNSKIQIVHDGSC 42 (42)
T ss_dssp SSSSTEEETTSEEESSHHHHHHHHHHHTTTCEEEEESSS
T ss_pred CCCCCEEcCCCCEEhhHHHHHHHHHhcCCCeEEEeCCCC
Confidence 444 99999999999999999999999999999999987
No 8
>cd00104 KAZAL_FS Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor.
Probab=99.00 E-value=2e-10 Score=81.82 Aligned_cols=39 Identities=44% Similarity=0.880 Sum_probs=36.3
Q ss_pred cCCCCccCCCCCcchhhhhhhhhhhccCceEEeeecccc
Q psy1764 365 HMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTC 403 (498)
Q Consensus 365 ~~~~PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~~G~C 403 (498)
..+.|||||||+||.|+|+|+.++|..+..|.+.|.|+|
T Consensus 3 ~~~~pVCgsdg~tY~n~C~~~~~~c~~~~~~~~~~~g~C 41 (41)
T cd00104 3 KEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41 (41)
T ss_pred CCCCccCCCCCCEECCHhhhhHHhhcCCCCeEEEeccCC
Confidence 467999999999999999999999998889999999987
No 9
>PF00050 Kazal_1: Kazal-type serine protease inhibitor domain; InterPro: IPR002350 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of Kazal inhibitors, belongs to MEROPS inhibitor family I1, clan IA. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) []. The members are primarily metazoan, but includes exceptions in the alveolata (apicomplexa), stramenopiles, higher plants and bacteria. Kazal inhibitors, which inhibit a number of serine proteases (such as trypsin and elastase), belong to family of proteins that includes pancreatic secretory trypsin inhibitor; avian ovomucoid; acrosin inhibitor; and elastase inhibitor. These proteins contain between 1 and 7 Kazal-type inhibitor repeats [, ]. The structure of the Kazal repeat includes a large quantity of extended chain, 2 short alpha-helices and a 3-stranded anti-parallel beta sheet [].The inhibitor makes 11 contacts with its enzyme substrate: unusually, 8 of these important residues are hypervariable []. Altering the enzyme-contact residues, and especially that of the active site bond, affects the the strength of inhibition and specificity of the inhibitor for particular serine proteases [, ]. The presence of this Pfam domain is usually indicative of serine protease inhibitors, however, Kazal-like domains are also seen in the extracellular part of agrins which are not known to be proteinase inhibitors.; GO: 0005515 protein binding; PDB: 1HPT_A 1CGI_I 1CGJ_I 2P6A_D 3HH2_C 2B0U_D 2BUS_A 1BUS_A 1OVO_A 3OVO_A ....
Probab=98.98 E-value=2.3e-10 Score=84.49 Aligned_cols=41 Identities=41% Similarity=0.660 Sum_probs=33.9
Q ss_pred cceeeeeeeCCCcceeeccchhhHHHhhcCCcceeeecccc
Q psy1764 289 IKTFWGVWSGTDGVTYESACHLREAACRRGKSIPIAYRGHC 329 (498)
Q Consensus 289 c~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 329 (498)
|+..++|||||||+||.|+|+|..+.-+.+..|.|+|.|+|
T Consensus 8 C~~~~~PVCGsdg~TY~N~C~lC~~~~~~~~~i~~~~~G~C 48 (48)
T PF00050_consen 8 CPREYDPVCGSDGVTYSNECELCAANRRSGKNIRIAHDGPC 48 (48)
T ss_dssp EESSCS-EEETTSEEESSHHHHHHHHHHTTTSS-EEEESS-
T ss_pred CCCCCCceECCCCCCccccchhhhhhcccCCCeEEeecCCC
Confidence 67889999999999999999996555578889999999987
No 10
>PF07648 Kazal_2: Kazal-type serine protease inhibitor domain; InterPro: IPR011497 This domain is usually indicative of serine protease inhibitors that belong to Merops inhibitor families: I1, I2, I17 and I31. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays and have a central alpha-helix, a short two-stranded antiparallel beta-sheet and several disulphide bonds [, , ]. The amino terminal segment of this domain binds to the active site of its target proteases, thus inhibiting their function.; PDB: 2KMP_A 1LDT_L 2KMR_A 2KMO_A 2KMQ_A 1AN1_I 4E0S_B 3T5O_A 4A5W_B 1LR7_A ....
Probab=98.95 E-value=3.8e-10 Score=80.56 Aligned_cols=39 Identities=44% Similarity=0.887 Sum_probs=36.2
Q ss_pred CCCC-ccccCCCceecchhhhHHHHhhcCceeeeeeeeee
Q psy1764 244 SDTH-QVCGTDGVTYESACHLREAACRRGKSIPIAYRGHC 282 (498)
Q Consensus 244 ~~~~-PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 282 (498)
..+. ||||+||+||.|+|+|..++|..+..+.++|.|+|
T Consensus 3 ~~~~~PVCg~dg~ty~n~C~l~~~~c~~~~~~~~~~~g~C 42 (42)
T PF07648_consen 3 REYSSPVCGSDGKTYSNECELRCANCRTNSKIQIVHDGSC 42 (42)
T ss_dssp SSSSSTEEETTSEEESSHHHHHHHHHHHTTTCEEEEESSS
T ss_pred CCCCCCEEcCCCCEEhhHHHHHHHHHhcCCCeEEEeCCCC
Confidence 4555 99999999999999999999999999999999987
No 11
>PF00050 Kazal_1: Kazal-type serine protease inhibitor domain; InterPro: IPR002350 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of Kazal inhibitors, belongs to MEROPS inhibitor family I1, clan IA. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) []. The members are primarily metazoan, but includes exceptions in the alveolata (apicomplexa), stramenopiles, higher plants and bacteria. Kazal inhibitors, which inhibit a number of serine proteases (such as trypsin and elastase), belong to family of proteins that includes pancreatic secretory trypsin inhibitor; avian ovomucoid; acrosin inhibitor; and elastase inhibitor. These proteins contain between 1 and 7 Kazal-type inhibitor repeats [, ]. The structure of the Kazal repeat includes a large quantity of extended chain, 2 short alpha-helices and a 3-stranded anti-parallel beta sheet [].The inhibitor makes 11 contacts with its enzyme substrate: unusually, 8 of these important residues are hypervariable []. Altering the enzyme-contact residues, and especially that of the active site bond, affects the the strength of inhibition and specificity of the inhibitor for particular serine proteases [, ]. The presence of this Pfam domain is usually indicative of serine protease inhibitors, however, Kazal-like domains are also seen in the extracellular part of agrins which are not known to be proteinase inhibitors.; GO: 0005515 protein binding; PDB: 1HPT_A 1CGI_I 1CGJ_I 2P6A_D 3HH2_C 2B0U_D 2BUS_A 1BUS_A 1OVO_A 3OVO_A ....
Probab=98.92 E-value=6e-10 Score=82.28 Aligned_cols=41 Identities=49% Similarity=0.913 Sum_probs=34.0
Q ss_pred ccCCCCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeee
Q psy1764 239 CVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHC 282 (498)
Q Consensus 239 Cp~~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 282 (498)
|+ .++.|||||||+||.|+|+|-.+.-..+..|.++|.|+|
T Consensus 8 C~---~~~~PVCGsdg~TY~N~C~lC~~~~~~~~~i~~~~~G~C 48 (48)
T PF00050_consen 8 CP---REYDPVCGSDGVTYSNECELCAANRRSGKNIRIAHDGPC 48 (48)
T ss_dssp EE---SSCS-EEETTSEEESSHHHHHHHHHHTTTSS-EEEESS-
T ss_pred CC---CCCCceECCCCCCccccchhhhhhcccCCCeEEeecCCC
Confidence 88 899999999999999999995455578889999999987
No 12
>cd00104 KAZAL_FS Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor.
Probab=98.89 E-value=1.1e-09 Score=77.87 Aligned_cols=39 Identities=51% Similarity=1.021 Sum_probs=36.8
Q ss_pred CCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeee
Q psy1764 244 SDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHC 282 (498)
Q Consensus 244 ~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 282 (498)
..+.||||+||+||.|+|+|..++|..+..+.+.|.|+|
T Consensus 3 ~~~~pVCgsdg~tY~n~C~~~~~~c~~~~~~~~~~~g~C 41 (41)
T cd00104 3 KEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41 (41)
T ss_pred CCCCccCCCCCCEECCHhhhhHHhhcCCCCeEEEeccCC
Confidence 678999999999999999999999998899999999987
No 13
>KOG4004|consensus
Probab=98.74 E-value=3.7e-09 Score=98.38 Aligned_cols=79 Identities=24% Similarity=0.824 Sum_probs=68.3
Q ss_pred ccccCcccceeccCCceeEe-ecCCCceeeCc-hhhhhcccCCCCcccCCCccceeeehhhhhhhcccce-----EEEee
Q psy1764 54 QPCKKTCENIDCGEGKKCAI-KKGYPKCICAP-DCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKNEV-----ISVDY 126 (498)
Q Consensus 54 ~~c~d~C~~~~C~~g~~C~~-~~~~~~C~C~~-~C~~~~~~~~~PVCGsDG~TY~n~C~l~~~~C~~~~~-----l~v~~ 126 (498)
.|.+.||.+.+|++|+.|++ ..+.|+|+|.. .|+......+.-|||.|++||.+.|+|.+.+|..... |.+.|
T Consensus 47 e~a~npC~dh~Cg~gk~C~vd~~~~P~Cvc~~~kCP~~~~~p~~KVC~nnNqTf~S~C~~~~~~C~~~~~~k~~k~hl~y 126 (259)
T KOG4004|consen 47 EEARNPCADHKCGPGKNCLVDLQTQPRCVCCRYKCPRKQQRPVHKVCGNNNQTFNSWCEMHKHSCESRYFIKVHKLHLKY 126 (259)
T ss_pred ccccCccccccCCCCceeeecCCCCceeEEecCCCCcccCCchhhhhcCCCcchhHHHHHHHhhhhhhcccccceeehhc
Confidence 45789999999999999999 68899999987 8987444567779999999999999999999987653 55589
Q ss_pred cccccc
Q psy1764 127 FGTCQT 132 (498)
Q Consensus 127 ~G~C~~ 132 (498)
.|+|..
T Consensus 127 ~G~Ck~ 132 (259)
T KOG4004|consen 127 QGSCKY 132 (259)
T ss_pred cccccc
Confidence 999987
No 14
>KOG4004|consensus
Probab=98.45 E-value=1.2e-07 Score=88.55 Aligned_cols=78 Identities=29% Similarity=0.728 Sum_probs=67.7
Q ss_pred CCCCCeeecCCCceeeccCCCCcccccccccccC-CCCCCCccccCCCceecchhhhHHHHhhcCceee-----eeeeee
Q psy1764 208 PGSCDRVTCPSGKFCILDQNLIPHCNRCIRHCVT-PSSDTHQVCGTDGVTYESACHLREAACRRGKSIP-----IAYRGH 281 (498)
Q Consensus 208 ~~~C~~~~C~~~~~C~~d~~~~~~C~~C~~~Cp~-~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~-----v~~~G~ 281 (498)
.++|++..|..|+.|++|.++.|+|+.|...||. ...++.-|||.|++||.+.|+|.+.+|.....+. +.|.|+
T Consensus 50 ~npC~dh~Cg~gk~C~vd~~~~P~Cvc~~~kCP~~~~~p~~KVC~nnNqTf~S~C~~~~~~C~~~~~~k~~k~hl~y~G~ 129 (259)
T KOG4004|consen 50 RNPCADHKCGPGKNCLVDLQTQPRCVCCRYKCPRKQQRPVHKVCGNNNQTFNSWCEMHKHSCESRYFIKVHKLHLKYQGS 129 (259)
T ss_pred cCccccccCCCCceeeecCCCCceeEEecCCCCcccCCchhhhhcCCCcchhHHHHHHHhhhhhhcccccceeehhcccc
Confidence 4689999999999999999999999977778995 4567788999999999999999999998766544 489999
Q ss_pred eccc
Q psy1764 282 CKLT 285 (498)
Q Consensus 282 C~~~ 285 (498)
|...
T Consensus 130 Ck~i 133 (259)
T KOG4004|consen 130 CKYI 133 (259)
T ss_pred cccC
Confidence 9644
No 15
>KOG4578|consensus
Probab=97.89 E-value=2.5e-06 Score=85.25 Aligned_cols=83 Identities=28% Similarity=0.682 Sum_probs=58.8
Q ss_pred eCchhhhhcccCCCCcccCCCccceeeehhhhhhhcccceEEEeeccccccCCCCceecCCCceeeecCCccccccccee
Q psy1764 82 CAPDCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTCQTGSCDRVTCPSGKFCILDQNLIPHCNRCIR 161 (498)
Q Consensus 82 C~~~C~~~~~~~~~PVCGsDG~TY~n~C~l~~~~C~~~~~l~v~~~G~C~~~~C~~~~c~~~~~C~~d~~~~~~C~~C~~ 161 (498)
|.-+|.. ...|||||||+||.+.|+|.+|+|.. +.|.|+|.|.|.. | +..+....+ .-..
T Consensus 34 c~l~C~~----~~kPvCasDGrty~srCe~qRAkC~d-pql~~~yrG~Ck~--C-----------~~erk~a~e--Qa~k 93 (421)
T KOG4578|consen 34 CQLECDD----NEKPVCASDGRTYPSRCELQRAKCGD-PQLSLKYRGSCKA--C-----------LEERKFARE--QAEK 93 (421)
T ss_pred hccccCC----CCCCccccCCccchhHHHHHHhhcCC-CceeEEecCcHHH--H-----------HHHHHHHHH--hhhc
Confidence 4556654 45999999999999999999999987 6799999999985 3 221110000 0001
Q ss_pred cccCCCCCCCcccCCCCceeehhhhhH
Q psy1764 162 HCVTPSSDTHQVCGTDGVTYESACHLR 188 (498)
Q Consensus 162 ~Cp~~~~~~~PVCGsdG~TY~n~C~l~ 188 (498)
. | ..+.|-|+.||..-.-.||-.
T Consensus 94 ~-~---~vFvPeC~~DG~f~qvQCh~y 116 (421)
T KOG4578|consen 94 D-P---GVFVPECRKDGNFAQVQCHGY 116 (421)
T ss_pred C-C---ceecccccCCCCeeeEEeccc
Confidence 1 2 368899999999888888754
No 16
>KOG3555|consensus
Probab=97.77 E-value=6.6e-06 Score=82.75 Aligned_cols=75 Identities=33% Similarity=0.721 Sum_probs=60.6
Q ss_pred ccccCcccceeccCCceeEe-ecCCCcee--------------------------eCchhhhhcccCCCCcccCCCccce
Q psy1764 54 QPCKKTCENIDCGEGKKCAI-KKGYPKCI--------------------------CAPDCKQRGRHMRNPVCGSDGRNYK 106 (498)
Q Consensus 54 ~~c~d~C~~~~C~~g~~C~~-~~~~~~C~--------------------------C~~~C~~~~~~~~~PVCGsDG~TY~ 106 (498)
.|-||||..++|...++|+. ...++-|+ |. .|+. ....+||||||+||+
T Consensus 84 D~~kdpc~kvkcs~hkvci~Qd~Q~A~cis~k~l~~r~k~a~v~~~q~~d~~l~~CK-pCpv---aq~a~vCGsDghtYs 159 (434)
T KOG3555|consen 84 DPIKDPCLKVKCSRHKVCIAQDYQTAGCISRKQLQHRQKAAGVSVIQWDDPELDNCK-PCPV---AQPAFVCGSDGHTYS 159 (434)
T ss_pred CcccChHhhhcccccceeeccccchhhhHHHHHHhhhccCCCcceecccCcccccCc-cCCc---ccccceecCCCCeeh
Confidence 45799999999999999996 44444332 22 2554 467789999999999
Q ss_pred eeehhhhhhhcccceEEEeecccccc
Q psy1764 107 TVCRLRKRACRKNEVISVDYFGTCQT 132 (498)
Q Consensus 107 n~C~l~~~~C~~~~~l~v~~~G~C~~ 132 (498)
+.|.|.-.+|.....|.|+-.|+|..
T Consensus 160 s~ckLe~~aC~~sksiav~c~g~cpc 185 (434)
T KOG3555|consen 160 SRCKLEYHACHVSKSIAVICEGPCPC 185 (434)
T ss_pred hhhhHHHHhhhhhhhhhhhhCCCCCC
Confidence 99999999999999999998888864
No 17
>KOG4578|consensus
Probab=97.57 E-value=5.8e-05 Score=75.66 Aligned_cols=74 Identities=31% Similarity=0.622 Sum_probs=57.2
Q ss_pred ccccccccccCCCCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeeeccc-----------ccccceeeeeeeCC
Q psy1764 231 HCNRCIRHCVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHCKLT-----------ESSIKTFWGVWSGT 299 (498)
Q Consensus 231 ~C~~C~~~Cp~~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C~~~-----------~~~c~~~~~PVCGs 299 (498)
.+..|...|. ....|||+|||+||.+.|+|.+|.|.. ..+.++|.|.|... +.. +..|-|-|+.
T Consensus 30 ~d~~c~l~C~---~~~kPvCasDGrty~srCe~qRAkC~d-pql~~~yrG~Ck~C~~erk~a~eQa~k~-~~vFvPeC~~ 104 (421)
T KOG4578|consen 30 ADAACQLECD---DNEKPVCASDGRTYPSRCELQRAKCGD-PQLSLKYRGSCKACLEERKFAREQAEKD-PGVFVPECRK 104 (421)
T ss_pred cchhhccccC---CCCCCccccCCccchhHHHHHHhhcCC-CceeEEecCcHHHHHHHHHHHHHhhhcC-CceecccccC
Confidence 3344667888 566999999999999999999999965 46899999999522 122 6788899999
Q ss_pred Ccceeeccch
Q psy1764 300 DGVTYESACH 309 (498)
Q Consensus 300 DG~TY~N~C~ 309 (498)
||.--.-.|+
T Consensus 105 DG~f~qvQCh 114 (421)
T KOG4578|consen 105 DGNFAQVQCH 114 (421)
T ss_pred CCCeeeEEec
Confidence 9975444444
No 18
>KOG3555|consensus
Probab=97.11 E-value=0.00013 Score=73.74 Aligned_cols=46 Identities=33% Similarity=0.788 Sum_probs=41.3
Q ss_pred CCCCCCCCCCCCccccCCCcccccccccccccccceEEEEeeeeccCCcc
Q psy1764 439 SCPHRHQGPLPPMLCGSNHVTYPSWCEMKADECQQRIVIETKYSGPCHSQ 488 (498)
Q Consensus 439 ~Cp~~~~~~~~~pVCGSDG~TY~N~C~l~~a~C~~~~~i~v~~~G~C~~~ 488 (498)
+||..+ ..+||||||.||++.|.|..++|...+.|.|+-.|+|.-+
T Consensus 141 pCpvaq----~a~vCGsDghtYss~ckLe~~aC~~sksiav~c~g~cpcp 186 (434)
T KOG3555|consen 141 PCPVAQ----PAFVCGSDGHTYSSRCKLEYHACHVSKSIAVICEGPCPCP 186 (434)
T ss_pred cCCccc----ccceecCCCCeehhhhhHHHHhhhhhhhhhhhhCCCCCCC
Confidence 699864 4689999999999999999999999999999999999643
No 19
>cd01330 KAZAL_SLC21 The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=96.63 E-value=0.0013 Score=49.79 Aligned_cols=22 Identities=45% Similarity=0.819 Sum_probs=19.9
Q ss_pred CCCCccccCCCceecchhhhHH
Q psy1764 244 SDTHQVCGTDGVTYESACHLRE 265 (498)
Q Consensus 244 ~~~~PVCGsDG~TY~N~C~l~~ 265 (498)
..+.||||+||+||.|.|++-.
T Consensus 14 ~~~~PVCg~~g~tY~SpC~AGC 35 (54)
T cd01330 14 SAYSPVCGENGITYFSPCHAGC 35 (54)
T ss_pred CCcCCccCCCCCEEcCHhHcCC
Confidence 5789999999999999999864
No 20
>cd01330 KAZAL_SLC21 The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=96.60 E-value=0.0013 Score=49.72 Aligned_cols=22 Identities=45% Similarity=0.819 Sum_probs=19.7
Q ss_pred CCCCcccCCCCceeehhhhhHH
Q psy1764 168 SDTHQVCGTDGVTYESACHLRE 189 (498)
Q Consensus 168 ~~~~PVCGsdG~TY~n~C~l~~ 189 (498)
..+.||||+||+||.|.|++..
T Consensus 14 ~~~~PVCg~~g~tY~SpC~AGC 35 (54)
T cd01330 14 SAYSPVCGENGITYFSPCHAGC 35 (54)
T ss_pred CCcCCccCCCCCEEcCHhHcCC
Confidence 4688999999999999999764
No 21
>smart00057 FIMAC factor I membrane attack complex.
Probab=96.40 E-value=0.0025 Score=50.68 Aligned_cols=63 Identities=27% Similarity=0.692 Sum_probs=51.5
Q ss_pred ccceeccCCceeEeecCCCceeeCc--hhhhhcccCCCCcccCCCccc--eeeehhhhhhhcccceEEEeeccccc
Q psy1764 60 CENIDCGEGKKCAIKKGYPKCICAP--DCKQRGRHMRNPVCGSDGRNY--KTVCRLRKRACRKNEVISVDYFGTCQ 131 (498)
Q Consensus 60 C~~~~C~~g~~C~~~~~~~~C~C~~--~C~~~~~~~~~PVCGsDG~TY--~n~C~l~~~~C~~~~~l~v~~~G~C~ 131 (498)
|..+.|.++.+|.. .+|+|.. +|+. ....||..|+++| .+.|+|...+|.+ ..+.+.+.|+|.
T Consensus 2 c~c~~C~pWekc~~----~~CvCk~P~qC~~----~~~~vCv~~~~~~~t~S~C~~~a~~C~g-~~~~~~~~g~C~ 68 (69)
T smart00057 2 CAKVFCQPWQKCSA----STCVCKLPYECPK----AGTDVCVEDGRSEKTLTYCKQKSLECLN-PKYKFLHIGSCT 68 (69)
T ss_pred CcCccCCCcccccC----CeeEeCCHhHCCC----CCCCeeEecCceeeeecHHHHHHHHhcC-CCcEEeccCCCC
Confidence 55668888888876 4588864 7775 3358999999999 9999999999997 688999999885
No 22
>smart00057 FIMAC factor I membrane attack complex.
Probab=96.27 E-value=0.0017 Score=51.63 Aligned_cols=64 Identities=25% Similarity=0.552 Sum_probs=51.3
Q ss_pred ceeecCCCceeccccCCCCccccCCCCCCCCCCCCCCCCCccccCCCccc--ccccccccccccceEEEEeeeeccCCc
Q psy1764 411 NIQCQPGQTCLTDLRNGSPRCQTCSYPPSCPHRHQGPLPPMLCGSNHVTY--PSWCEMKADECQQRIVIETKYSGPCHS 487 (498)
Q Consensus 411 ~~~C~~g~~c~~~~~~~~~~C~~C~~~~~Cp~~~~~~~~~pVCGSDG~TY--~N~C~l~~a~C~~~~~i~v~~~G~C~~ 487 (498)
-+.|++++.|+. ++++ |..|..||.. ...||..|+..| .|+|+|....|..+ .+++.|.|+|.+
T Consensus 4 c~~C~pWekc~~----~~Cv---Ck~P~qC~~~-----~~~vCv~~~~~~~t~S~C~~~a~~C~g~-~~~~~~~g~C~~ 69 (69)
T smart00057 4 KVFCQPWQKCSA----STCV---CKLPYECPKA-----GTDVCVEDGRSEKTLTYCKQKSLECLNP-KYKFLHIGSCTA 69 (69)
T ss_pred CccCCCcccccC----CeeE---eCCHhHCCCC-----CCCeeEecCceeeeecHHHHHHHHhcCC-CcEEeccCCCCC
Confidence 346788888864 4444 4578889973 247999999999 99999999999975 899999999964
No 23
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=96.01 E-value=0.0047 Score=39.49 Aligned_cols=23 Identities=39% Similarity=1.084 Sum_probs=21.0
Q ss_pred cccceeccCCceeEe-ecCCCcee
Q psy1764 59 TCENIDCGEGKKCAI-KKGYPKCI 81 (498)
Q Consensus 59 ~C~~~~C~~g~~C~~-~~~~~~C~ 81 (498)
+|+++.|++|++|++ ..++|+|+
T Consensus 1 ~C~~v~C~~G~~C~~d~~g~p~Cv 24 (26)
T smart00274 1 SCRNVQCPFGKVCVVDKGGNARCV 24 (26)
T ss_pred CCCCEECCCCCEEEeCCCCCEEEe
Confidence 589999999999999 78899985
No 24
>PF09289 FOLN: Follistatin/Osteonectin-like EGF domain; InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=95.92 E-value=0.0042 Score=37.92 Aligned_cols=21 Identities=43% Similarity=1.259 Sum_probs=16.3
Q ss_pred ccceeccCCceeEe-ecCCCce
Q psy1764 60 CENIDCGEGKKCAI-KKGYPKC 80 (498)
Q Consensus 60 C~~~~C~~g~~C~~-~~~~~~C 80 (498)
|+++.|++|++|.+ ..++|+|
T Consensus 1 C~n~~Ck~GKvC~~d~~~~P~C 22 (22)
T PF09289_consen 1 CDNFHCKRGKVCKVDEQGKPHC 22 (22)
T ss_dssp STT---BTTEEEEEETTTCEEE
T ss_pred CCCcccCCCCEeeeCCCCCcCC
Confidence 78999999999999 7899987
No 25
>KOG3509|consensus
Probab=94.44 E-value=0.26 Score=57.10 Aligned_cols=217 Identities=18% Similarity=0.200 Sum_probs=126.0
Q ss_pred cCcccceeccCCceeEeecCCCceeeCchhhhh---------------cccCCCCcccCCCccceeeehhhhhhhcccce
Q psy1764 57 KKTCENIDCGEGKKCAIKKGYPKCICAPDCKQR---------------GRHMRNPVCGSDGRNYKTVCRLRKRACRKNEV 121 (498)
Q Consensus 57 ~d~C~~~~C~~g~~C~~~~~~~~C~C~~~C~~~---------------~~~~~~PVCGsDG~TY~n~C~l~~~~C~~~~~ 121 (498)
.-.|..+.|..|..|.++.+...+.+.++|... ....--++|..+|++|..++.++..++.....
T Consensus 172 ~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~h~~~ 251 (964)
T KOG3509|consen 172 PIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAIHGAT 251 (964)
T ss_pred cccccCcccccceeEEecCCcceeeeccccccceehhhccCceecccccccccccceeecCCCccccchhhhHhhhccch
Confidence 346889999999999997766666555555442 11245578999999999999999988888888
Q ss_pred EEEeeccccccCCCCceecCCCceeeecCCcccccccceecccCCCCCCCcccCCCCceeehhhhhHHHHhhcCCcceee
Q psy1764 122 ISVDYFGTCQTGSCDRVTCPSGKFCILDQNLIPHCNRCIRHCVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIA 201 (498)
Q Consensus 122 l~v~~~G~C~~~~C~~~~c~~~~~C~~d~~~~~~C~~C~~~Cp~~~~~~~PVCGsdG~TY~n~C~l~~a~C~~~~~i~~~ 201 (498)
+.+...+.+.. +..+.+-.+..+..+...-..+. .+++ |++|++|...+..+-..+.....+...
T Consensus 252 ~~~~~~~~~~~--~s~~~~~~~~~~~~~~~~~f~~l-----------t~~~--g~~g~~~~~~~~~~~~~~~~~~~~~E~ 316 (964)
T KOG3509|consen 252 LHVDSLGVFFS--DSEHRDILGNFLFSSFKDGFRAL-----------TLDG--GTDGVRYDCGLPQREDRLDVTSYIGEW 316 (964)
T ss_pred hccchheeecc--ccccccccccccccccccceeee-----------ccCC--CCccccccccCcchhhhhcccccccee
Confidence 88888887776 33333332222222211111110 1222 568999999888887776655555555
Q ss_pred eccccCCCCCCeeecCCCceeeccCCCCcccccccccccCCCCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeee
Q psy1764 202 YRGHCKPGSCDRVTCPSGKFCILDQNLIPHCNRCIRHCVTPSSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGH 281 (498)
Q Consensus 202 ~~G~C~~~~C~~~~C~~~~~C~~d~~~~~~C~~C~~~Cp~~~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~ 281 (498)
+.+.+... +..-|=|.+++-..|++-+.... ......+. .
T Consensus 317 ~~~~i~r~-----------------------------------s~~~~~g~~~~l~g~~~~~~~~i----~~ee~v~l-g 356 (964)
T KOG3509|consen 317 RFGIIFRG-----------------------------------SGLSVSGHKGVLQGNSNILVSRI----TNEESVFL-G 356 (964)
T ss_pred eeeEeeec-----------------------------------ccccccCcceeecccccccccce----eecccccC-C
Confidence 55554320 01111122333333333332111 00000011 1
Q ss_pred ecccc-cccceeeeeeeCCCcceeeccchhhHHHhhcCCcceeeecccc
Q psy1764 282 CKLTE-SSIKTFWGVWSGTDGVTYESACHLREAACRRGKSIPIAYRGHC 329 (498)
Q Consensus 282 C~~~~-~~c~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C 329 (498)
+ ..+ ..-...+.+++|.+|.+|.=...++......++...|+..+.|
T Consensus 357 ~-i~ni~~l~~~~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~~~c 404 (964)
T KOG3509|consen 357 G-IINIETLQHNLPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQGTC 404 (964)
T ss_pred c-eeeeccccccCCCccCccceehhhhhhcccccccccccccccccccc
Confidence 1 000 0112346789999999999999888877776666666666633
No 26
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=94.19 E-value=0.044 Score=35.06 Aligned_cols=24 Identities=42% Similarity=0.962 Sum_probs=22.0
Q ss_pred CCCceecCCCceeeecCCcccccc
Q psy1764 134 SCDRVTCPSGKFCILDQNLIPHCN 157 (498)
Q Consensus 134 ~C~~~~c~~~~~C~~d~~~~~~C~ 157 (498)
+|.++.|+++.+|++|+.+.|+|+
T Consensus 1 ~C~~v~C~~G~~C~~d~~g~p~Cv 24 (26)
T smart00274 1 SCRNVQCPFGKVCVVDKGGNARCV 24 (26)
T ss_pred CCCCEECCCCCEEEeCCCCCEEEe
Confidence 588999999999999999999886
No 27
>KOG3509|consensus
Probab=94.15 E-value=0.3 Score=56.68 Aligned_cols=299 Identities=18% Similarity=0.147 Sum_probs=171.3
Q ss_pred CCCcccCCCccceeeehhhhh-hhcccceEEEeecccccc-CCCCceecCCCceeeecCCcccccccceecccCCC--CC
Q psy1764 94 RNPVCGSDGRNYKTVCRLRKR-ACRKNEVISVDYFGTCQT-GSCDRVTCPSGKFCILDQNLIPHCNRCIRHCVTPS--SD 169 (498)
Q Consensus 94 ~~PVCGsDG~TY~n~C~l~~~-~C~~~~~l~v~~~G~C~~-~~C~~~~c~~~~~C~~d~~~~~~C~~C~~~Cp~~~--~~ 169 (498)
..++|+-++.+|.+.+-+-.. .|+.-..+. ...+.|.- +.|.+ .+.+ -.+..+....+.| .|+.+. ..
T Consensus 59 ~~~~~~~~~~~~~~s~~~~~~~~c~~~~~~~-~~~~~~~g~~~~~~-~~~~-~~~~~~~~~~~~~-----tc~~~~~~~~ 130 (964)
T KOG3509|consen 59 PNSTPSGCNAKPSASDCKPTETQCRDRLRCN-PQSFQCDGTNDCKD-GSDE-VGCKSDALVDPGV-----TCPQVPAGRA 130 (964)
T ss_pred CCCCcCCccccccccccCCcccccccchhcC-CccccccCCCCCCc-cchh-cccccccccCCCC-----CCCCcccccc
Confidence 345999999999998876332 444321111 11223331 11110 0000 0112222222222 122221 26
Q ss_pred CCcccCCCCceeehhhhhHHHHhh-cCCcceeeeccccC--CCCCCeeecCCCceeeccCCCC-----cc------cccc
Q psy1764 170 THQVCGTDGVTYESACHLREAACR-RGKSIPIAYRGHCK--PGSCDRVTCPSGKFCILDQNLI-----PH------CNRC 235 (498)
Q Consensus 170 ~~PVCGsdG~TY~n~C~l~~a~C~-~~~~i~~~~~G~C~--~~~C~~~~C~~~~~C~~d~~~~-----~~------C~~C 235 (498)
+.++|+-++.+|.+.+.-..+.+. ..+.+--.+.|.+. +..|.++.|.++..|..+.... +. |..|
T Consensus 131 ~~~~c~~~~~~~~t~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~ 210 (964)
T KOG3509|consen 131 SSSTCKADTVGEPTIDWRLNAEPVCECKRCCPTSEGGPGTEPIHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEAC 210 (964)
T ss_pred cccchhhhccCCcchhhhccccccccchhcCcccCCCCccccccccCcccccceeEEecCCcceeeeccccccceehhhc
Confidence 788999999999999887776654 22344456677764 5578888999999998754331 11 1111
Q ss_pred cc--cccCC-CCCCCccccCCCceecchhhhHHHHhhcCceeeeeeeeeeccccc------------cccee--eeeeeC
Q psy1764 236 IR--HCVTP-SSDTHQVCGTDGVTYESACHLREAACRRGKSIPIAYRGHCKLTES------------SIKTF--WGVWSG 298 (498)
Q Consensus 236 ~~--~Cp~~-~~~~~PVCGsDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C~~~~~------------~c~~~--~~PVCG 298 (498)
.. .|+.. +-+..++|..+|++|..++.+...++..+....+.+.+.+..... ....+ ..+==|
T Consensus 211 ~~r~~~~f~~~~~g~~~~~~~~vp~~~e~S~~~~~~~h~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~f~~lt~~~g 290 (964)
T KOG3509|consen 211 KPRAFCPFEKSVEGCLKCFCFGVPRPSESSLHAFRAIHGATLHVDSLGVFFSDSEHRDILGNFLFSSFKDGFRALTLDGG 290 (964)
T ss_pred cCceecccccccccccceeecCCCccccchhhhHhhhccchhccchheeeccccccccccccccccccccceeeeccCCC
Confidence 11 12210 145578999999999999999999999988888888888743210 01111 222235
Q ss_pred CCcceeeccchhhHHHhhcCCcceeeeccccccc-----cc-ccc-----cC-CCceeeeecCCCccccCcchhcccccC
Q psy1764 299 TDGVTYESACHLREAACRRGKSIPIAYRGHCKQT-----CE-NID-----CG-EGKKCAIKKGYPKCICAPDCKQRGRHM 366 (498)
Q Consensus 299 sDG~TY~N~C~l~~~~C~~~~~i~v~~~G~C~~~-----c~-~~~-----C~-~g~~C~~~~g~~~C~C~~~C~~~~~~~ 366 (498)
+||++|.-.+..+...++....+.-.+.+.+... -. .+. +. ++.+-..+.-...++....+-+ ..
T Consensus 291 ~~g~~~~~~~~~~~~~~~~~~~~~E~~~~~i~r~s~~~~~g~~~~l~g~~~~~~~~i~~ee~v~lg~i~ni~~l~---~~ 367 (964)
T KOG3509|consen 291 TDGVRYDCGLPQREDRLDVTSYIGEWRFGIIFRGSGLSVSGHKGVLQGNSNILVSRITNEESVFLGGIINIETLQ---HN 367 (964)
T ss_pred CccccccccCcchhhhhccccccceeeeeEeeecccccccCcceeecccccccccceeecccccCCceeeecccc---cc
Confidence 6999999999888877766544443444444320 00 000 00 0001011100112355544443 56
Q ss_pred CCCccCCCCCcchhhhhhhhhhhccCceEEeeecccc
Q psy1764 367 RNPVCGSDGRNYKTVCRLRKRACRKNEVISVDYFGTC 403 (498)
Q Consensus 367 ~~PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~~G~C 403 (498)
+.+++|.+|.+|.=...++......+....|+..+.|
T Consensus 368 ~~~~eGf~gci~~~~~~~k~l~~~~~~~~~v~~~~~c 404 (964)
T KOG3509|consen 368 LPLPEGFAGCIRDLVMNLKDLRVTLQRASYVAAQGTC 404 (964)
T ss_pred CCCccCccceehhhhhhcccccccccccccccccccc
Confidence 7889999999999999999998888877788888833
No 28
>TIGR00805 oat sodium-independent organic anion transporter. Proteins of the OAT family catalyze the Na+-independent facilitated transport of organic anions such as bromosulfobromophthalein and prostaglandins as well as conjugated and unconjugated bile acids (taurocholate and cholate, respectively). These transporters have been characterized in mammals, but homologues are present in C. elegans and A. thaliana. Some of the mammalian proteins exhibit a high degree of tissue specificity. For example, the rat OAT is found at high levels in liver and kidney and at lower levels in other tissues. These proteins possess 10-12 putative a-helical transmembrane spanners. They may catalyze electrogenic anion uniport or anion exchange.
Probab=84.61 E-value=0.52 Score=53.14 Aligned_cols=30 Identities=43% Similarity=0.858 Sum_probs=24.8
Q ss_pred ceecccCCCCCCCcccCCCCceeehhhhhH
Q psy1764 159 CIRHCVTPSSDTHQVCGTDGVTYESACHLR 188 (498)
Q Consensus 159 C~~~Cp~~~~~~~PVCGsdG~TY~n~C~l~ 188 (498)
|+..|.-....|.||||.||+||-|.||.-
T Consensus 449 Cn~~C~C~~~~~~PVCg~~~~tY~SpC~AG 478 (633)
T TIGR00805 449 CNRQCSCDSSFFDPVCGDNGLAYLSPCHAG 478 (633)
T ss_pred cCCCCCCCCCCcccccCCCCCEEECccccC
Confidence 566666555689999999999999999965
No 29
>TIGR00805 oat sodium-independent organic anion transporter. Proteins of the OAT family catalyze the Na+-independent facilitated transport of organic anions such as bromosulfobromophthalein and prostaglandins as well as conjugated and unconjugated bile acids (taurocholate and cholate, respectively). These transporters have been characterized in mammals, but homologues are present in C. elegans and A. thaliana. Some of the mammalian proteins exhibit a high degree of tissue specificity. For example, the rat OAT is found at high levels in liver and kidney and at lower levels in other tissues. These proteins possess 10-12 putative a-helical transmembrane spanners. They may catalyze electrogenic anion uniport or anion exchange.
Probab=84.42 E-value=0.54 Score=53.03 Aligned_cols=30 Identities=43% Similarity=0.858 Sum_probs=24.1
Q ss_pred ccccccCCCCCCCccccCCCceecchhhhH
Q psy1764 235 CIRHCVTPSSDTHQVCGTDGVTYESACHLR 264 (498)
Q Consensus 235 C~~~Cp~~~~~~~PVCGsDG~TY~N~C~l~ 264 (498)
|...|.-....++||||.||+||-|.|+.-
T Consensus 449 Cn~~C~C~~~~~~PVCg~~~~tY~SpC~AG 478 (633)
T TIGR00805 449 CNRQCSCDSSFFDPVCGDNGLAYLSPCHAG 478 (633)
T ss_pred cCCCCCCCCCCcccccCCCCCEEECccccC
Confidence 446665444789999999999999999963
No 30
>PF09289 FOLN: Follistatin/Osteonectin-like EGF domain; InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=81.31 E-value=1 Score=27.66 Aligned_cols=22 Identities=45% Similarity=1.117 Sum_probs=16.3
Q ss_pred CCceecCCCceeeecCCccccc
Q psy1764 135 CDRVTCPSGKFCILDQNLIPHC 156 (498)
Q Consensus 135 C~~~~c~~~~~C~~d~~~~~~C 156 (498)
|.++.|..|..|.+|..+.|+|
T Consensus 1 C~n~~Ck~GKvC~~d~~~~P~C 22 (22)
T PF09289_consen 1 CDNFHCKRGKVCKVDEQGKPHC 22 (22)
T ss_dssp STT---BTTEEEEEETTTCEEE
T ss_pred CCCcccCCCCEeeeCCCCCcCC
Confidence 5678899999999999888865
No 31
>KOG3626|consensus
Probab=80.11 E-value=1 Score=51.28 Aligned_cols=31 Identities=39% Similarity=0.832 Sum_probs=25.1
Q ss_pred ceecccCCCCCCCcccCCCCceeehhhhhHH
Q psy1764 159 CIRHCVTPSSDTHQVCGTDGVTYESACHLRE 189 (498)
Q Consensus 159 C~~~Cp~~~~~~~PVCGsdG~TY~n~C~l~~ 189 (498)
|+..|.-....+.||||.||.||-|.||+-.
T Consensus 514 Cn~~C~C~~~~~~PVCg~~G~tY~SpChAGC 544 (735)
T KOG3626|consen 514 CNSDCSCDTSEYEPVCGENGITYFSPCHAGC 544 (735)
T ss_pred hhcCCCCCCcCcCcccCCCCCEEeChhhhCC
Confidence 5565665556899999999999999999653
No 32
>KOG4597|consensus
Probab=80.10 E-value=1.3 Score=47.25 Aligned_cols=64 Identities=30% Similarity=0.785 Sum_probs=54.7
Q ss_pred CcccceeccC-CceeEeecCCCceeeCchhhhhcccCCCCcccCCCccceeeehhhhhhhcccceEEEee
Q psy1764 58 KTCENIDCGE-GKKCAIKKGYPKCICAPDCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDY 126 (498)
Q Consensus 58 d~C~~~~C~~-g~~C~~~~~~~~C~C~~~C~~~~~~~~~PVCGsDG~TY~n~C~l~~~~C~~~~~l~v~~ 126 (498)
-.|+.+.|.+ |.+|.+.++.|+|.|...|.. +..--|.+||-+|-| |.|..++|..+..+.|.|
T Consensus 104 a~~~~~~~~qq~s~~dif~~~~r~~~~~~~~~----eP~~~~~d~~~k~~n-~t~cs~aCgKG~q~~iv~ 168 (560)
T KOG4597|consen 104 ALCAQFPCSQQGSVCDIFDGQPRCTCIDRCEK----EPSFTCADDGLKYYN-CTMCSEACGKGVQLRIVY 168 (560)
T ss_pred chhccCccccccccccccCCCCCccccccccc----CCchhhhhcCceecc-eEehhhhhcCCceeeeEE
Confidence 3688888886 999999888899988877765 555689999999999 999999999998887764
No 33
>KOG3626|consensus
Probab=74.96 E-value=1.7 Score=49.52 Aligned_cols=30 Identities=40% Similarity=0.893 Sum_probs=24.5
Q ss_pred ccccccCCCCCCCccccCCCceecchhhhH
Q psy1764 235 CIRHCVTPSSDTHQVCGTDGVTYESACHLR 264 (498)
Q Consensus 235 C~~~Cp~~~~~~~PVCGsDG~TY~N~C~l~ 264 (498)
|...|.-+...+.||||.||.||-+.||.-
T Consensus 514 Cn~~C~C~~~~~~PVCg~~G~tY~SpChAG 543 (735)
T KOG3626|consen 514 CNSDCSCDTSEYEPVCGENGITYFSPCHAG 543 (735)
T ss_pred hhcCCCCCCcCcCcccCCCCCEEeChhhhC
Confidence 446665445899999999999999999964
No 34
>PF00683 TB: TB domain; InterPro: IPR002212 Transforming growth factor beta (TGF-beta)-binding protein-like (TB) domain comes from human fibrillin-1[]. This domain is found in fibrillins and latent TGF-beta-binding proteins (LTBPs) which are localized to fibrillar structures in the extracellular matrix [].; GO: 0005488 binding; PDB: 2W86_A 1UZJ_B 1UZQ_A 1UZK_A 1UZP_A 1APJ_A 1KSQ_A.
Probab=69.15 E-value=1 Score=32.26 Aligned_cols=25 Identities=40% Similarity=1.018 Sum_probs=20.0
Q ss_pred CchhhhhhhhhChhHhhhccCCCCcccc
Q psy1764 5 GRCTELLSEKVGREECCKEANGLGAAWS 32 (498)
Q Consensus 5 ~~c~~~~~~~~~~~~cc~~~~~~~~~~~ 32 (498)
|+|+..|...|||.+||-+. +.||-
T Consensus 1 g~C~~~l~~~~tk~~CCCs~---G~aWG 25 (42)
T PF00683_consen 1 GQCSNPLPGNVTKSECCCSV---GRAWG 25 (42)
T ss_dssp TCEEEEEEEEEEHHHHHTTT----SEET
T ss_pred CcCCCcCCCCeeccccCCCC---CCcCC
Confidence 67888889999999999764 44785
No 35
>KOG4597|consensus
Probab=52.03 E-value=11 Score=40.59 Aligned_cols=60 Identities=30% Similarity=0.778 Sum_probs=45.8
Q ss_pred ccccC-CCceeeeecCCCccccCcchhcccccCCCCccCCCCCcchhhhhhhhhhhccCceEEeee
Q psy1764 335 NIDCG-EGKKCAIKKGYPKCICAPDCKQRGRHMRNPVCGSDGRNYKTVCRLRKRACRKNEVISVDY 399 (498)
Q Consensus 335 ~~~C~-~g~~C~~~~g~~~C~C~~~C~~~~~~~~~PVCGSDG~TY~N~C~l~~aaC~~~~~i~v~~ 399 (498)
...|. .+..|.+.+|.++|.|...|. .+-.--|.+||-||.| |.|...+|.+...+.|.+
T Consensus 108 ~~~~~qq~s~~dif~~~~r~~~~~~~~----~eP~~~~~d~~~k~~n-~t~cs~aCgKG~q~~iv~ 168 (560)
T KOG4597|consen 108 QFPCSQQGSVCDIFDGQPRCTCIDRCE----KEPSFTCADDGLKYYN-CTMCSEACGKGVQLRIVY 168 (560)
T ss_pred cCccccccccccccCCCCCcccccccc----cCCchhhhhcCceecc-eEehhhhhcCCceeeeEE
Confidence 34444 367788888889988865554 2444469999999999 999999999987777654
No 36
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=32.11 E-value=27 Score=23.11 Aligned_cols=25 Identities=32% Similarity=0.774 Sum_probs=19.8
Q ss_pred ccceeccCCceeEeec-CCCceeeCc
Q psy1764 60 CENIDCGEGKKCAIKK-GYPKCICAP 84 (498)
Q Consensus 60 C~~~~C~~g~~C~~~~-~~~~C~C~~ 84 (498)
|....|..|.+|+... +..+|+|++
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~ 26 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPP 26 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBT
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCC
Confidence 3445788899999965 889999975
No 37
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=25.40 E-value=37 Score=23.67 Aligned_cols=25 Identities=24% Similarity=0.667 Sum_probs=17.5
Q ss_pred ccceeccCCceeEe-ecCCCceeeCc
Q psy1764 60 CENIDCGEGKKCAI-KKGYPKCICAP 84 (498)
Q Consensus 60 C~~~~C~~g~~C~~-~~~~~~C~C~~ 84 (498)
|.++.|+.++.|.. .+|+..|.|-.
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCll 27 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLL 27 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-T
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeC
Confidence 67889999999999 56999999864
No 38
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=22.34 E-value=66 Score=20.32 Aligned_cols=23 Identities=30% Similarity=0.904 Sum_probs=17.5
Q ss_pred ceeccCCceeEeecCCCceeeCc
Q psy1764 62 NIDCGEGKKCAIKKGYPKCICAP 84 (498)
Q Consensus 62 ~~~C~~g~~C~~~~~~~~C~C~~ 84 (498)
...|..+..|+...+..+|.|+.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~ 27 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPP 27 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCC
Confidence 45677788899866778898863
No 39
>smart00181 EGF Epidermal growth factor-like domain.
Probab=22.08 E-value=64 Score=20.92 Aligned_cols=21 Identities=33% Similarity=0.833 Sum_probs=16.8
Q ss_pred eeccCCceeEeecCCCceeeCc
Q psy1764 63 IDCGEGKKCAIKKGYPKCICAP 84 (498)
Q Consensus 63 ~~C~~g~~C~~~~~~~~C~C~~ 84 (498)
..|..+ +|+...+..+|.|++
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~ 26 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPP 26 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCC
Confidence 567788 899887888898864
No 40
>PF12190 amfpi-1: Fungal protease inhibitor; InterPro: IPR021066 This entry represents fungal proteinase inhibitors, which are found primarily in Antheraea mylitta (Tasar silkworm); they belong to MEROPS inhibitor family I83, clan JH. They inhibits trypsin and chymotrypsin proteases from the fungi Aspergillus oryzae and Rhizopus oryzae. However they do not inhibit papain-like proteases from the bacterium Bacillus licheniformis []. ; PDB: 3BT4_A.
Probab=21.29 E-value=61 Score=26.29 Aligned_cols=42 Identities=26% Similarity=0.542 Sum_probs=19.6
Q ss_pred CCCccCCCCCcchhhhhhhhhh--hccCceEEeeecccceeeeee
Q psy1764 367 RNPVCGSDGRNYKTVCRLRKRA--CRKNEVISVDYFGTCQMAVTC 409 (498)
Q Consensus 367 ~~PVCGSDG~TY~N~C~l~~aa--C~~~~~i~v~~~G~C~~~~~C 409 (498)
-+.|||||- --.|.|.-..++ |+.+..-+..|.|.|.--..|
T Consensus 8 GdLvCGt~y-Ck~~PC~~~ia~~sCrsps~yRa~HaGKCACCPAC 51 (91)
T PF12190_consen 8 GDLVCGTNY-CKQNPCSSPIAASSCRSPSVYRANHAGKCACCPAC 51 (91)
T ss_dssp ------TTH-HHHS--SS---GGG--TTSEEEEEEESGGG-EEEE
T ss_pred cceeeccch-hhcCCCCChhHHhcCCCcHHHHhhhCCCccCChHH
Confidence 356888861 225777766554 999999999999998643333
Done!