Query psy11294
Match_columns 111
No_of_seqs 159 out of 807
Neff 6.0
Searched_HMMs 46136
Date Fri Aug 16 23:12:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11294.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11294hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd01328 FSL_SPARC Follistatin- 99.8 2.8E-20 6.1E-25 124.1 5.8 62 30-94 1-62 (86)
2 smart00280 KAZAL Kazal type se 99.5 1E-14 2.2E-19 85.9 4.1 42 54-100 1-46 (46)
3 cd01327 KAZAL_PSTI Kazal-type 99.5 4E-14 8.7E-19 83.8 4.1 39 57-100 3-45 (45)
4 PF07648 Kazal_2: Kazal-type s 99.3 1.7E-12 3.6E-17 74.2 2.3 32 69-100 7-42 (42)
5 cd00104 KAZAL_FS Kazal type se 99.2 6.3E-12 1.4E-16 71.9 3.1 37 59-100 1-41 (41)
6 PF00050 Kazal_1: Kazal-type s 99.1 3.8E-11 8.3E-16 71.7 2.8 37 59-100 8-48 (48)
7 KOG4004|consensus 99.1 4.7E-11 1E-15 91.1 3.8 75 26-102 48-132 (259)
8 KOG3555|consensus 98.7 6.5E-09 1.4E-13 84.6 3.2 80 26-111 85-194 (434)
9 KOG4578|consensus 98.3 4E-07 8.6E-12 74.0 2.0 43 55-102 34-79 (421)
10 smart00057 FIMAC factor I memb 98.2 1.9E-06 4.1E-11 55.3 4.5 61 31-102 2-69 (69)
11 cd01330 KAZAL_SLC21 The kazal- 97.5 6.8E-05 1.5E-09 46.0 2.2 18 69-86 17-34 (54)
12 smart00274 FOLN Follistatin-N- 96.5 0.0017 3.6E-08 34.3 1.5 24 30-54 1-24 (26)
13 PF09289 FOLN: Follistatin/Ost 94.9 0.0091 2E-07 30.4 0.3 22 31-53 1-22 (22)
14 TIGR00805 oat sodium-independe 94.0 0.033 7.2E-07 48.0 2.0 18 68-85 460-477 (633)
15 cd01327 KAZAL_PSTI Kazal-type 93.1 0.027 5.8E-07 32.9 0.0 13 12-24 16-28 (45)
16 KOG3626|consensus 88.8 0.28 6.1E-06 43.8 2.2 18 68-85 525-542 (735)
17 KOG4597|consensus 78.9 2.1 4.6E-05 36.8 3.1 57 30-94 105-162 (560)
18 PF00008 EGF: EGF-like domain 64.9 2.7 5.8E-05 22.4 0.4 26 31-57 1-26 (32)
19 KOG3509|consensus 48.0 27 0.00059 32.5 4.1 61 29-94 173-250 (964)
20 KOG1219|consensus 41.8 31 0.00067 35.9 3.6 28 28-57 3903-3930(4289)
21 cd00053 EGF Epidermal growth f 37.6 37 0.0008 16.7 2.1 22 33-56 5-26 (36)
22 smart00179 EGF_CA Calcium-bind 32.4 55 0.0012 16.7 2.3 26 30-57 4-30 (39)
23 smart00181 EGF Epidermal growt 28.5 65 0.0014 16.3 2.1 21 34-57 6-26 (35)
24 PF12946 EGF_MSP1_1: MSP1 EGF 26.5 21 0.00045 20.2 -0.1 25 31-56 2-26 (37)
25 cd00054 EGF_CA Calcium-binding 24.4 95 0.0021 15.4 2.3 26 30-57 4-30 (38)
26 PF00954 S_locus_glycop: S-loc 22.5 87 0.0019 20.6 2.4 28 27-57 76-104 (110)
No 1
>cd01328 FSL_SPARC Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=99.81 E-value=2.8e-20 Score=124.06 Aligned_cols=62 Identities=31% Similarity=0.726 Sum_probs=55.2
Q ss_pred CCCCCCCCCCCeeccCCCCCCccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC
Q psy11294 30 PCQENTCQYGAKCIPSEDGTSYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM 94 (111)
Q Consensus 30 pC~~~~C~~g~~C~~~~~g~~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~ 94 (111)
||+++.|+.|++|+++. .++|+|+|+..||...+ +.++||||||+||+|+|+|++++|..++
T Consensus 1 pC~~v~C~~G~~C~~d~-~~~p~CvC~~~Cp~~~~--~~~~VCGsDg~TY~s~C~L~r~~C~~~~ 62 (86)
T cd01328 1 PCENHHCGAGKVCEVDD-ENTPKCVCIDPCPEEVD--DRRKVCTNDNETFDSDCELYRTRCLCKG 62 (86)
T ss_pred CCCCcCCCCCCEeeECC-CCCeEEecCCcCCCCCC--CCCceeCCCCCCcccHhHHhhhHhccCC
Confidence 79999999999999975 45799999999999763 3478999999999999999999998664
No 2
>smart00280 KAZAL Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
Probab=99.53 E-value=1e-14 Score=85.88 Aligned_cols=42 Identities=38% Similarity=0.903 Sum_probs=35.9
Q ss_pred cCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294 54 ECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL 100 (111)
Q Consensus 54 ~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C 100 (111)
+|+..|+..+ .|||||||+||.|+|+|++++|..+. .+.|+|
T Consensus 1 ~C~~~C~~~~-----~pVCgsdg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 46 (46)
T smart00280 1 DCPEACPREY-----DPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC 46 (46)
T ss_pred CCCCCCCCCC-----CccCCCCCCEeCCHhHHHHHHhcCCCCeEEeecCCC
Confidence 3778899887 89999999999999999999998554 466766
No 3
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=99.48 E-value=4e-14 Score=83.80 Aligned_cols=39 Identities=28% Similarity=0.565 Sum_probs=33.8
Q ss_pred CCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294 57 QECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL 100 (111)
Q Consensus 57 ~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C 100 (111)
..||..+ .|||||||+||.|+|+|+.++|..+. +++|+|
T Consensus 3 ~~Cp~~~-----~PVCGsDg~TY~N~C~l~~~~c~~~~~i~~~~~G~C 45 (45)
T cd01327 3 FGCPKDY-----DPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC 45 (45)
T ss_pred CCCCCCC-----CceeCCCCCEeCCHhHHHHHHhccCCCeEEeecCCC
Confidence 4678887 79999999999999999999998776 567776
No 4
>PF07648 Kazal_2: Kazal-type serine protease inhibitor domain; InterPro: IPR011497 This domain is usually indicative of serine protease inhibitors that belong to Merops inhibitor families: I1, I2, I17 and I31. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays and have a central alpha-helix, a short two-stranded antiparallel beta-sheet and several disulphide bonds [, , ]. The amino terminal segment of this domain binds to the active site of its target proteases, thus inhibiting their function.; PDB: 2KMP_A 1LDT_L 2KMR_A 2KMO_A 2KMQ_A 1AN1_I 4E0S_B 3T5O_A 4A5W_B 1LR7_A ....
Probab=99.28 E-value=1.7e-12 Score=74.20 Aligned_cols=32 Identities=41% Similarity=0.854 Sum_probs=28.0
Q ss_pred CCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294 69 RPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL 100 (111)
Q Consensus 69 ~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C 100 (111)
.||||+||+||.|+|+|++++|..+. .++|+|
T Consensus 7 ~PVCg~dg~ty~n~C~l~~~~c~~~~~~~~~~~g~C 42 (42)
T PF07648_consen 7 SPVCGSDGKTYSNECELRCANCRTNSKIQIVHDGSC 42 (42)
T ss_dssp STEEETTSEEESSHHHHHHHHHHHTTTCEEEEESSS
T ss_pred CCEEcCCCCEEhhHHHHHHHHHhcCCCeEEEeCCCC
Confidence 49999999999999999999999765 456766
No 5
>cd00104 KAZAL_FS Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor.
Probab=99.24 E-value=6.3e-12 Score=71.90 Aligned_cols=37 Identities=35% Similarity=0.748 Sum_probs=30.1
Q ss_pred CCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294 59 CPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL 100 (111)
Q Consensus 59 C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C 100 (111)
||..+ .||||+||+||.|+|+|+.++|..+. .+.|+|
T Consensus 1 C~~~~-----~pVCgsdg~tY~n~C~~~~~~c~~~~~~~~~~~g~C 41 (41)
T cd00104 1 CPKEY-----DPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41 (41)
T ss_pred CCCCC-----CccCCCCCCEECCHhhhhHHhhcCCCCeEEEeccCC
Confidence 55555 79999999999999999999998554 455655
No 6
>PF00050 Kazal_1: Kazal-type serine protease inhibitor domain; InterPro: IPR002350 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of Kazal inhibitors, belongs to MEROPS inhibitor family I1, clan IA. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) []. The members are primarily metazoan, but includes exceptions in the alveolata (apicomplexa), stramenopiles, higher plants and bacteria. Kazal inhibitors, which inhibit a number of serine proteases (such as trypsin and elastase), belong to family of proteins that includes pancreatic secretory trypsin inhibitor; avian ovomucoid; acrosin inhibitor; and elastase inhibitor. These proteins contain between 1 and 7 Kazal-type inhibitor repeats [, ]. The structure of the Kazal repeat includes a large quantity of extended chain, 2 short alpha-helices and a 3-stranded anti-parallel beta sheet [].The inhibitor makes 11 contacts with its enzyme substrate: unusually, 8 of these important residues are hypervariable []. Altering the enzyme-contact residues, and especially that of the active site bond, affects the the strength of inhibition and specificity of the inhibitor for particular serine proteases [, ]. The presence of this Pfam domain is usually indicative of serine protease inhibitors, however, Kazal-like domains are also seen in the extracellular part of agrins which are not known to be proteinase inhibitors.; GO: 0005515 protein binding; PDB: 1HPT_A 1CGI_I 1CGJ_I 2P6A_D 3HH2_C 2B0U_D 2BUS_A 1BUS_A 1OVO_A 3OVO_A ....
Probab=99.12 E-value=3.8e-11 Score=71.66 Aligned_cols=37 Identities=38% Similarity=0.692 Sum_probs=25.6
Q ss_pred CCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC----CccccC
Q psy11294 59 CPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM----LSRGFL 100 (111)
Q Consensus 59 C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~----~~~G~C 100 (111)
|+..+ .|||||||+||.|+|+|-.+.-+.+. ++.|+|
T Consensus 8 C~~~~-----~PVCGsdg~TY~N~C~lC~~~~~~~~~i~~~~~G~C 48 (48)
T PF00050_consen 8 CPREY-----DPVCGSDGVTYSNECELCAANRRSGKNIRIAHDGPC 48 (48)
T ss_dssp EESSC-----S-EEETTSEEESSHHHHHHHHHHTTTSS-EEEESS-
T ss_pred CCCCC-----CceECCCCCCccccchhhhhhcccCCCeEEeecCCC
Confidence 66665 89999999999999999323334443 467776
No 7
>KOG4004|consensus
Probab=99.12 E-value=4.7e-11 Score=91.09 Aligned_cols=75 Identities=27% Similarity=0.693 Sum_probs=61.6
Q ss_pred CCCCCCCCCCCCCCCeeccCCCCCCccCcCCC-CCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCCC---------
Q psy11294 26 ELTNPCQENTCQYGAKCIPSEDGTSYKCECPQ-ECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKML--------- 95 (111)
Q Consensus 26 ~~~~pC~~~~C~~g~~C~~~~~g~~~~C~C~~-~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~~--------- 95 (111)
+.++||++..|++|..|+++..+ .|+|+|.. .||... ..+...|||.|++||.|.|+|++.+|.....
T Consensus 48 ~a~npC~dh~Cg~gk~C~vd~~~-~P~Cvc~~~kCP~~~-~~p~~KVC~nnNqTf~S~C~~~~~~C~~~~~~k~~k~hl~ 125 (259)
T KOG4004|consen 48 EARNPCADHKCGPGKNCLVDLQT-QPRCVCCRYKCPRKQ-QRPVHKVCGNNNQTFNSWCEMHKHSCESRYFIKVHKLHLK 125 (259)
T ss_pred cccCccccccCCCCceeeecCCC-CceeEEecCCCCccc-CCchhhhhcCCCcchhHHHHHHHhhhhhhcccccceeehh
Confidence 45899999999999999998665 79999885 999842 1123569999999999999999999998763
Q ss_pred ccccCCC
Q psy11294 96 SRGFLLD 102 (111)
Q Consensus 96 ~~G~C~~ 102 (111)
+-|+|+-
T Consensus 126 y~G~Ck~ 132 (259)
T KOG4004|consen 126 YQGSCKY 132 (259)
T ss_pred ccccccc
Confidence 4567764
No 8
>KOG3555|consensus
Probab=98.74 E-value=6.5e-09 Score=84.58 Aligned_cols=80 Identities=26% Similarity=0.563 Sum_probs=62.7
Q ss_pred CCCCCCCCCCCCCCCeeccCCCCCCccCc--------------------------CCCCCCCCCCCCCCCCeecCCCcee
Q psy11294 26 ELTNPCQENTCQYGAKCIPSEDGTSYKCE--------------------------CPQECPNYGDHTGSRPVCGSNGVDY 79 (111)
Q Consensus 26 ~~~~pC~~~~C~~g~~C~~~~~g~~~~C~--------------------------C~~~C~~~~d~~~~~pVCGSDG~TY 79 (111)
...+||..+.|.+..+|+.. +.+++.|+ |. .||... ..+||||||.||
T Consensus 85 ~~kdpc~kvkcs~hkvci~Q-d~Q~A~cis~k~l~~r~k~a~v~~~q~~d~~l~~CK-pCpvaq----~a~vCGsDghtY 158 (434)
T KOG3555|consen 85 PIKDPCLKVKCSRHKVCIAQ-DYQTAGCISRKQLQHRQKAAGVSVIQWDDPELDNCK-PCPVAQ----PAFVCGSDGHTY 158 (434)
T ss_pred cccChHhhhcccccceeecc-ccchhhhHHHHHHhhhccCCCcceecccCcccccCc-cCCccc----ccceecCCCCee
Confidence 45789999999999999874 33344442 32 688776 488999999999
Q ss_pred cCHhHHHHHHhhcCC----CccccCCCccCCCCCCC
Q psy11294 80 KDLCEFQRASCSTKM----LSRGFLLDIWIPPSPKH 111 (111)
Q Consensus 80 ~n~C~L~~aaC~~~~----~~~G~C~~i~~pp~~~~ 111 (111)
++.|.|...||...+ ..+|+|--...++.|+|
T Consensus 159 ss~ckLe~~aC~~sksiav~c~g~cpcps~~d~~~~ 194 (434)
T KOG3555|consen 159 SSRCKLEYHACHVSKSIAVICEGPCPCPSEADKPVH 194 (434)
T ss_pred hhhhhHHHHhhhhhhhhhhhhCCCCCCCCCCCcchh
Confidence 999999999999887 46888876666666654
No 9
>KOG4578|consensus
Probab=98.26 E-value=4e-07 Score=73.96 Aligned_cols=43 Identities=40% Similarity=0.792 Sum_probs=37.4
Q ss_pred CCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC---CccccCCC
Q psy11294 55 CPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM---LSRGFLLD 102 (111)
Q Consensus 55 C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~---~~~G~C~~ 102 (111)
|+..|.... .|||||||+||.+.|+|++|.|...+ +++|.|+.
T Consensus 34 c~l~C~~~~-----kPvCasDGrty~srCe~qRAkC~dpql~~~yrG~Ck~ 79 (421)
T KOG4578|consen 34 CQLECDDNE-----KPVCASDGRTYPSRCELQRAKCGDPQLSLKYRGSCKA 79 (421)
T ss_pred hccccCCCC-----CCccccCCccchhHHHHHHhhcCCCceeEEecCcHHH
Confidence 777777764 89999999999999999999999887 57888864
No 10
>smart00057 FIMAC factor I membrane attack complex.
Probab=98.24 E-value=1.9e-06 Score=55.32 Aligned_cols=61 Identities=30% Similarity=0.585 Sum_probs=51.5
Q ss_pred CCCCCCCCCCeeccCCCCCCccCcCC--CCCCCCCCCCCCCCeecCCCcee--cCHhHHHHHHhhcCC---CccccCCC
Q psy11294 31 CQENTCQYGAKCIPSEDGTSYKCECP--QECPNYGDHTGSRPVCGSNGVDY--KDLCEFQRASCSTKM---LSRGFLLD 102 (111)
Q Consensus 31 C~~~~C~~g~~C~~~~~g~~~~C~C~--~~C~~~~d~~~~~pVCGSDG~TY--~n~C~L~~aaC~~~~---~~~G~C~~ 102 (111)
|..+.|.+|+.|+. .+|+|. .+||... ..||..|+..| .|+|+|....|+.++ .+.|+|.+
T Consensus 2 c~c~~C~pWekc~~------~~CvCk~P~qC~~~~-----~~vCv~~~~~~~t~S~C~~~a~~C~g~~~~~~~~g~C~~ 69 (69)
T smart00057 2 CAKVFCQPWQKCSA------STCVCKLPYECPKAG-----TDVCVEDGRSEKTLTYCKQKSLECLNPKYKFLHIGSCTA 69 (69)
T ss_pred CcCccCCCcccccC------CeeEeCCHhHCCCCC-----CCeeEecCceeeeecHHHHHHHHhcCCCcEEeccCCCCC
Confidence 56678999999986 459988 6899864 58999999999 999999999999876 57888863
No 11
>cd01330 KAZAL_SLC21 The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
Probab=97.51 E-value=6.8e-05 Score=45.97 Aligned_cols=18 Identities=44% Similarity=0.997 Sum_probs=16.7
Q ss_pred CCeecCCCceecCHhHHH
Q psy11294 69 RPVCGSNGVDYKDLCEFQ 86 (111)
Q Consensus 69 ~pVCGSDG~TY~n~C~L~ 86 (111)
.||||+||+||.|.|+..
T Consensus 17 ~PVCg~~g~tY~SpC~AG 34 (54)
T cd01330 17 SPVCGENGITYFSPCHAG 34 (54)
T ss_pred CCccCCCCCEEcCHhHcC
Confidence 799999999999999865
No 12
>smart00274 FOLN Follistatin-N-terminal domain-like. Follistatin-N-terminal domain-like, EGF-like. Region distinct from the kazal-like sequence
Probab=96.49 E-value=0.0017 Score=34.28 Aligned_cols=24 Identities=21% Similarity=0.798 Sum_probs=20.0
Q ss_pred CCCCCCCCCCCeeccCCCCCCccCc
Q psy11294 30 PCQENTCQYGAKCIPSEDGTSYKCE 54 (111)
Q Consensus 30 pC~~~~C~~g~~C~~~~~g~~~~C~ 54 (111)
+|+++.|++|..|++++. ++|+|+
T Consensus 1 ~C~~v~C~~G~~C~~d~~-g~p~Cv 24 (26)
T smart00274 1 SCRNVQCPFGKVCVVDKG-GNARCV 24 (26)
T ss_pred CCCCEECCCCCEEEeCCC-CCEEEe
Confidence 589999999999999844 468775
No 13
>PF09289 FOLN: Follistatin/Osteonectin-like EGF domain; InterPro: IPR015369 This domain is predominantly found in osteonectin and follistatin. They adopt an EGF-like structure [, ]. Follistatin is involved in diverse activities from embryonic development to cell secretion. ; GO: 0005515 protein binding; PDB: 1LR7_A 1LR8_A 1LR9_A 2ARP_F 3B4V_H 2KCX_A 3SEK_C 2P6A_D 3HH2_C 2B0U_D ....
Probab=94.88 E-value=0.0091 Score=30.41 Aligned_cols=22 Identities=32% Similarity=0.816 Sum_probs=14.9
Q ss_pred CCCCCCCCCCeeccCCCCCCccC
Q psy11294 31 CQENTCQYGAKCIPSEDGTSYKC 53 (111)
Q Consensus 31 C~~~~C~~g~~C~~~~~g~~~~C 53 (111)
|.++.|++|.+|.++.++ +|.|
T Consensus 1 C~n~~Ck~GKvC~~d~~~-~P~C 22 (22)
T PF09289_consen 1 CDNFHCKRGKVCKVDEQG-KPHC 22 (22)
T ss_dssp STT---BTTEEEEEETTT-CEEE
T ss_pred CCCcccCCCCEeeeCCCC-CcCC
Confidence 788999999999997654 5654
No 14
>TIGR00805 oat sodium-independent organic anion transporter. Proteins of the OAT family catalyze the Na+-independent facilitated transport of organic anions such as bromosulfobromophthalein and prostaglandins as well as conjugated and unconjugated bile acids (taurocholate and cholate, respectively). These transporters have been characterized in mammals, but homologues are present in C. elegans and A. thaliana. Some of the mammalian proteins exhibit a high degree of tissue specificity. For example, the rat OAT is found at high levels in liver and kidney and at lower levels in other tissues. These proteins possess 10-12 putative a-helical transmembrane spanners. They may catalyze electrogenic anion uniport or anion exchange.
Probab=93.97 E-value=0.033 Score=47.99 Aligned_cols=18 Identities=44% Similarity=0.960 Sum_probs=16.3
Q ss_pred CCCeecCCCceecCHhHH
Q psy11294 68 SRPVCGSNGVDYKDLCEF 85 (111)
Q Consensus 68 ~~pVCGSDG~TY~n~C~L 85 (111)
..||||.||+||-|.|+-
T Consensus 460 ~~PVCg~~~~tY~SpC~A 477 (633)
T TIGR00805 460 FDPVCGDNGLAYLSPCHA 477 (633)
T ss_pred cccccCCCCCEEECcccc
Confidence 389999999999999974
No 15
>cd01327 KAZAL_PSTI Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins, including the second domain of the ovomucoid turkey inhibitor and the C-terminal domain of the esophagus cancer-related gene-2 protein (ECRG-2), are members of the superfamily of kazal-type proteinase inhibitors and follistatin-like proteins.
Probab=93.13 E-value=0.027 Score=32.91 Aligned_cols=13 Identities=8% Similarity=0.225 Sum_probs=11.4
Q ss_pred ecceeCCCcccCC
Q psy11294 12 LILDFLYACYIFP 24 (111)
Q Consensus 12 ~g~TY~NeC~L~~ 24 (111)
+|+||.|+|+|+.
T Consensus 16 Dg~TY~N~C~l~~ 28 (45)
T cd01327 16 DGVTYSNECLLCA 28 (45)
T ss_pred CCCEeCCHhHHHH
Confidence 6999999999963
No 16
>KOG3626|consensus
Probab=88.83 E-value=0.28 Score=43.80 Aligned_cols=18 Identities=44% Similarity=0.992 Sum_probs=16.2
Q ss_pred CCCeecCCCceecCHhHH
Q psy11294 68 SRPVCGSNGVDYKDLCEF 85 (111)
Q Consensus 68 ~~pVCGSDG~TY~n~C~L 85 (111)
-.||||.||.||-|.|+-
T Consensus 525 ~~PVCg~~G~tY~SpChA 542 (735)
T KOG3626|consen 525 YEPVCGENGITYFSPCHA 542 (735)
T ss_pred cCcccCCCCCEEeChhhh
Confidence 389999999999999973
No 17
>KOG4597|consensus
Probab=78.87 E-value=2.1 Score=36.85 Aligned_cols=57 Identities=21% Similarity=0.632 Sum_probs=44.3
Q ss_pred CCCCCCCC-CCCeeccCCCCCCccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhhcCC
Q psy11294 30 PCQENTCQ-YGAKCIPSEDGTSYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCSTKM 94 (111)
Q Consensus 30 pC~~~~C~-~g~~C~~~~~g~~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~~~~ 94 (111)
.|+...|. .+.+|..-. +.++|.|...|..+. .--|.+||-+|.| |.|-..+|..+.
T Consensus 105 ~~~~~~~~qq~s~~dif~--~~~r~~~~~~~~~eP-----~~~~~d~~~k~~n-~t~cs~aCgKG~ 162 (560)
T KOG4597|consen 105 LCAQFPCSQQGSVCDIFD--GQPRCTCIDRCEKEP-----SFTCADDGLKYYN-CTMCSEACGKGV 162 (560)
T ss_pred hhccCccccccccccccC--CCCCcccccccccCC-----chhhhhcCceecc-eEehhhhhcCCc
Confidence 46666665 455666542 248899999898875 5679999999999 999999998875
No 18
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=64.91 E-value=2.7 Score=22.43 Aligned_cols=26 Identities=50% Similarity=1.227 Sum_probs=19.2
Q ss_pred CCCCCCCCCCeeccCCCCCCccCcCCC
Q psy11294 31 CQENTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 31 C~~~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
|....|..++.|+... .+...|+|+.
T Consensus 1 C~~~~C~n~g~C~~~~-~~~y~C~C~~ 26 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLP-GGGYTCECPP 26 (32)
T ss_dssp TTTTSSTTTEEEEEES-TSEEEEEEBT
T ss_pred CCCCcCCCCeEEEeCC-CCCEEeECCC
Confidence 4455788899999864 3468899884
No 19
>KOG3509|consensus
Probab=47.96 E-value=27 Score=32.48 Aligned_cols=61 Identities=21% Similarity=0.261 Sum_probs=44.6
Q ss_pred CCCCCCCCCCCCeeccCCCCC-----------------CccCcCCCCCCCCCCCCCCCCeecCCCceecCHhHHHHHHhh
Q psy11294 29 NPCQENTCQYGAKCIPSEDGT-----------------SYKCECPQECPNYGDHTGSRPVCGSNGVDYKDLCEFQRASCS 91 (111)
Q Consensus 29 ~pC~~~~C~~g~~C~~~~~g~-----------------~~~C~C~~~C~~~~d~~~~~pVCGSDG~TY~n~C~L~~aaC~ 91 (111)
-+|....|.+|..|.+..+-+ .+.|.|+..+..+. .++|..+|++|..++-++..++.
T Consensus 173 ~~~~~~~~q~g~tC~~~~~~~~~~~~~~~~~~~~c~~~~~r~~~~f~~~~~g-----~~~~~~~~vp~~~e~S~~~~~~~ 247 (964)
T KOG3509|consen 173 IHCAQPVCQGGATCEVRNGKGYSLECPDCKVRVVCEACKPRAFCPFEKSVEG-----CLKCFCFGVPRPSESSLHAFRAI 247 (964)
T ss_pred ccccCcccccceeEEecCCcceeeeccccccceehhhccCceeccccccccc-----ccceeecCCCccccchhhhHhhh
Confidence 358888999999999864311 12233444444443 78999999999999999998888
Q ss_pred cCC
Q psy11294 92 TKM 94 (111)
Q Consensus 92 ~~~ 94 (111)
.+.
T Consensus 248 h~~ 250 (964)
T KOG3509|consen 248 HGA 250 (964)
T ss_pred ccc
Confidence 765
No 20
>KOG1219|consensus
Probab=41.77 E-value=31 Score=35.93 Aligned_cols=28 Identities=39% Similarity=1.088 Sum_probs=21.7
Q ss_pred CCCCCCCCCCCCCeeccCCCCCCccCcCCC
Q psy11294 28 TNPCQENTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 28 ~~pC~~~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
..||...+|..|+.|++.. +...|.|+.
T Consensus 3903 ~epC~snPC~~GgtCip~~--n~f~CnC~~ 3930 (4289)
T KOG1219|consen 3903 LEPCASNPCLTGGTCIPFY--NGFLCNCPN 3930 (4289)
T ss_pred cccccCCCCCCCCEEEecC--CCeeEeCCC
Confidence 4688888899999999864 347888873
No 21
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=37.64 E-value=37 Score=16.67 Aligned_cols=22 Identities=41% Similarity=1.159 Sum_probs=15.4
Q ss_pred CCCCCCCCeeccCCCCCCccCcCC
Q psy11294 33 ENTCQYGAKCIPSEDGTSYKCECP 56 (111)
Q Consensus 33 ~~~C~~g~~C~~~~~g~~~~C~C~ 56 (111)
...|..++.|+... +...|.|+
T Consensus 5 ~~~C~~~~~C~~~~--~~~~C~C~ 26 (36)
T cd00053 5 SNPCSNGGTCVNTP--GSYRCVCP 26 (36)
T ss_pred CCCCCCCCEEecCC--CCeEeECC
Confidence 44677788998754 34788877
No 22
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=32.42 E-value=55 Score=16.71 Aligned_cols=26 Identities=50% Similarity=1.299 Sum_probs=15.8
Q ss_pred CCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294 30 PCQE-NTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 30 pC~~-~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
.|.. ..|..++.|+... + ...|.|+.
T Consensus 4 ~C~~~~~C~~~~~C~~~~-g-~~~C~C~~ 30 (39)
T smart00179 4 ECASGNPCQNGGTCVNTV-G-SYRCECPP 30 (39)
T ss_pred cCcCCCCcCCCCEeECCC-C-CeEeECCC
Confidence 3444 4567777898642 2 46787763
No 23
>smart00181 EGF Epidermal growth factor-like domain.
Probab=28.45 E-value=65 Score=16.31 Aligned_cols=21 Identities=43% Similarity=1.135 Sum_probs=14.5
Q ss_pred CCCCCCCeeccCCCCCCccCcCCC
Q psy11294 34 NTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 34 ~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
..|..+ .|+.. .+...|.|+.
T Consensus 6 ~~C~~~-~C~~~--~~~~~C~C~~ 26 (35)
T smart00181 6 GPCSNG-TCINT--PGSYTCSCPP 26 (35)
T ss_pred CCCCCC-EEECC--CCCeEeECCC
Confidence 467778 89875 3457887774
No 24
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=26.53 E-value=21 Score=20.22 Aligned_cols=25 Identities=36% Similarity=0.922 Sum_probs=16.1
Q ss_pred CCCCCCCCCCeeccCCCCCCccCcCC
Q psy11294 31 CQENTCQYGAKCIPSEDGTSYKCECP 56 (111)
Q Consensus 31 C~~~~C~~g~~C~~~~~g~~~~C~C~ 56 (111)
|.+..|+..+.|+...+| +..|.|.
T Consensus 2 C~~~~cP~NA~C~~~~dG-~eecrCl 26 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDG-SEECRCL 26 (37)
T ss_dssp -SSS---TTEEEEEETTS-EEEEEE-
T ss_pred ccCccCCCCcccEEcCCC-CEEEEee
Confidence 778899999999987555 6778876
No 25
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=24.40 E-value=95 Score=15.37 Aligned_cols=26 Identities=42% Similarity=1.158 Sum_probs=15.5
Q ss_pred CCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294 30 PCQE-NTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 30 pC~~-~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
.|.. ..|..++.|+... +...|.|+.
T Consensus 4 ~C~~~~~C~~~~~C~~~~--~~~~C~C~~ 30 (38)
T cd00054 4 ECASGNPCQNGGTCVNTV--GSYRCSCPP 30 (38)
T ss_pred cCCCCCCcCCCCEeECCC--CCeEeECCC
Confidence 4444 4566677887643 236687763
No 26
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=22.53 E-value=87 Score=20.61 Aligned_cols=28 Identities=25% Similarity=0.574 Sum_probs=21.3
Q ss_pred CCCCCCC-CCCCCCCeeccCCCCCCccCcCCC
Q psy11294 27 LTNPCQE-NTCQYGAKCIPSEDGTSYKCECPQ 57 (111)
Q Consensus 27 ~~~pC~~-~~C~~g~~C~~~~~g~~~~C~C~~ 57 (111)
..+.|+. ..|+..+.|..+ ..+.|.|..
T Consensus 76 p~d~Cd~y~~CG~~g~C~~~---~~~~C~Cl~ 104 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNSN---NSPKCSCLP 104 (110)
T ss_pred cccCCCCccccCCccEeCCC---CCCceECCC
Confidence 4567886 689999999663 357899884
Done!