Query psy10996
Match_columns 192
No_of_seqs 179 out of 1160
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 15:31:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10996.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10996hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00093 VWC: von Willebrand f 99.5 1.3E-14 2.7E-19 92.5 3.7 54 125-178 1-57 (57)
2 smart00214 VWC von Willebrand 99.4 1.3E-13 2.9E-18 88.3 4.9 54 125-178 1-59 (59)
3 PF00093 VWC: von Willebrand f 99.3 2.5E-12 5.5E-17 81.8 4.3 51 2-52 5-57 (57)
4 smart00214 VWC von Willebrand 99.2 2.2E-11 4.7E-16 78.0 5.2 52 1-52 4-59 (59)
5 smart00215 VWC_out von Willebr 98.2 1.9E-06 4.1E-11 56.3 3.1 40 125-165 1-41 (67)
6 smart00215 VWC_out von Willebr 97.8 2.2E-05 4.9E-10 51.3 3.2 39 1-40 4-42 (67)
7 PF12714 TILa: TILa domain 97.6 0.0001 2.2E-09 46.4 4.1 37 124-160 1-40 (56)
8 PF12714 TILa: TILa domain 96.7 0.0051 1.1E-07 38.7 5.1 35 2-36 7-43 (56)
9 PF05375 Pacifastin_I: Pacifas 95.8 0.012 2.6E-07 34.2 2.8 27 133-160 3-30 (39)
10 PF05375 Pacifastin_I: Pacifas 95.7 0.018 3.9E-07 33.4 3.3 29 6-35 3-32 (39)
11 PF05825 PSP94: Beta-microsemi 93.1 0.1 2.2E-06 36.3 2.9 37 121-157 14-51 (94)
12 KOG1544|consensus 90.9 0.28 6E-06 41.6 3.6 37 123-160 110-146 (470)
13 cd00061 FN1 Fibronectin type 1 89.8 0.57 1.2E-05 27.7 3.3 30 1-30 8-42 (43)
14 PF05825 PSP94: Beta-microsemi 89.2 0.45 9.7E-06 33.1 3.0 30 2-31 23-52 (94)
15 smart00058 FN1 Fibronectin typ 84.6 2.1 4.6E-05 25.5 3.7 31 1-31 6-43 (45)
16 cd00061 FN1 Fibronectin type 1 82.2 1.8 3.9E-05 25.6 2.7 29 129-157 9-42 (43)
17 PF00039 fn1: Fibronectin type 75.9 2.9 6.2E-05 24.1 2.2 28 1-28 6-39 (39)
18 PF14828 Amnionless: Amnionles 74.2 1.2 2.6E-05 39.5 0.5 37 54-91 193-234 (437)
19 PF15430 SVWC: Single domain v 72.1 8 0.00017 24.2 3.9 36 125-160 1-37 (65)
20 PF14828 Amnionless: Amnionles 71.8 1.5 3.3E-05 38.9 0.5 38 16-53 191-233 (437)
21 PF09443 CFC: Cripto_Frl-1_Cry 65.1 3.1 6.8E-05 23.6 0.7 23 137-159 8-30 (36)
22 KOG1544|consensus 64.0 4.9 0.00011 34.3 2.0 32 2-34 116-147 (470)
No 1
>PF00093 VWC: von Willebrand factor type C domain; InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.51 E-value=1.3e-14 Score=92.47 Aligned_cols=54 Identities=37% Similarity=1.147 Sum_probs=33.8
Q ss_pred eeeCCEEecCCCeeccCCcceeEECCCCceeecccc-CCCCCCC--ccCCCCCcccc
Q psy10996 125 CTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC-SVTCSNP--MTIPNQCCPLC 178 (192)
Q Consensus 125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C-~~~C~~~--~~~~g~CCp~C 178 (192)
|.|+|++|.+|++|+++.|+.|+|.+|.|.|.++.| ++.|+.+ +..+|+|||+|
T Consensus 1 C~~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C 57 (57)
T PF00093_consen 1 CSFNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC 57 (57)
T ss_dssp EESSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred CeeCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence 689999999999999999999999999999999999 7899988 78999999987
No 2
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.45 E-value=1.3e-13 Score=88.34 Aligned_cols=54 Identities=33% Similarity=0.968 Sum_probs=50.0
Q ss_pred eeeCCEEecCCCeeccCCcceeEECCCC-ceeecccc-CC-CCCCCc--cCCCCCcccc
Q psy10996 125 CTVGNVTYQENQEWRLDACTSCVCKDRS-HHCTQRIC-SV-TCSNPM--TIPNQCCPLC 178 (192)
Q Consensus 125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~-v~C~~~~C-~~-~C~~~~--~~~g~CCp~C 178 (192)
|+|+|++|.+|++|++++|++|+|.+|. +.|.++.| ++ .|+++. ..+|+|||+|
T Consensus 1 C~~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C 59 (59)
T smart00214 1 CVHNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC 59 (59)
T ss_pred CccCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence 6799999999999999999999999999 99999999 54 799887 7899999986
No 3
>PF00093 VWC: von Willebrand factor type C domain; InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.30 E-value=2.5e-12 Score=81.80 Aligned_cols=51 Identities=37% Similarity=0.969 Sum_probs=30.6
Q ss_pred CceecCCCEEecCCCeeeEECCCceEEeeecCCCCCCCCc--cccCCCcccCC
Q psy10996 2 NESCENGTSWLSADCEECKCERGIKFCSKVECPKLPEACT--KSEIGQCCPVC 52 (192)
Q Consensus 2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~~--~~~~g~CCp~C 52 (192)
|++|++|++|+++.|+.|+|.+|.+.|..+.|+.+.|+.+ +..+|+|||.|
T Consensus 5 g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C 57 (57)
T PF00093_consen 5 GRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC 57 (57)
T ss_dssp S-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred CEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence 8899999999998899999999999999999999899988 78899999876
No 4
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.22 E-value=2.2e-11 Score=78.00 Aligned_cols=52 Identities=33% Similarity=0.857 Sum_probs=46.4
Q ss_pred CCceecCCCEEecCCCeeeEECCCc-eEEeeecCCCC-CCCCcc--ccCCCcccCC
Q psy10996 1 MNESCENGTSWLSADCEECKCERGI-KFCSKVECPKL-PEACTK--SEIGQCCPVC 52 (192)
Q Consensus 1 ~G~~y~~G~~w~~~~C~~C~C~~g~-v~C~~~~Cp~~-~C~~~~--~~~g~CCp~C 52 (192)
.|++|++|++|++++|++|+|.+|. +.|..+.|+.+ .|+.+. ..+|+|||.|
T Consensus 4 ~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C 59 (59)
T smart00214 4 NGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC 59 (59)
T ss_pred CCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence 3899999999999889999999999 99999999875 788876 7888888876
No 5
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=98.15 E-value=1.9e-06 Score=56.33 Aligned_cols=40 Identities=28% Similarity=0.792 Sum_probs=35.4
Q ss_pred eeeCCEEecCCCeeccCCcceeEECCCCceeecccc-CCCCC
Q psy10996 125 CTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC-SVTCS 165 (192)
Q Consensus 125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C-~~~C~ 165 (192)
|+++|++|++|++|. +.|.+|+|.+|.+.|+.+.| +..|.
T Consensus 1 C~~~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~ 41 (67)
T smart00215 1 CYNNGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCL 41 (67)
T ss_pred CeECCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhh
Confidence 678999999999997 58999999999999999999 55543
No 6
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.79 E-value=2.2e-05 Score=51.26 Aligned_cols=39 Identities=31% Similarity=0.633 Sum_probs=34.4
Q ss_pred CCceecCCCEEecCCCeeeEECCCceEEeeecCCCCCCCC
Q psy10996 1 MNESCENGTSWLSADCEECKCERGIKFCSKVECPKLPEAC 40 (192)
Q Consensus 1 ~G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~ 40 (192)
.|+.|++|+.|. +.|..|+|.+|.+.|+++.|+...|..
T Consensus 4 ~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~l 42 (67)
T smart00215 4 NGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCLL 42 (67)
T ss_pred CCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhhc
Confidence 388999999998 799999999999999999998766543
No 7
>PF12714 TILa: TILa domain
Probab=97.61 E-value=0.0001 Score=46.45 Aligned_cols=37 Identities=32% Similarity=0.892 Sum_probs=32.4
Q ss_pred cee-eCCEEecCCCeeccCCcc-eeEE-CCCCceeecccc
Q psy10996 124 GCT-VGNVTYQENQEWRLDACT-SCVC-KDRSHHCTQRIC 160 (192)
Q Consensus 124 ~C~-~~g~~y~~G~~w~~~~C~-~C~C-~~g~v~C~~~~C 160 (192)
+|+ ++|+.|+.||+|..+.|+ +|+| .+|.|.|....|
T Consensus 1 GC~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C 40 (56)
T PF12714_consen 1 GCTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSC 40 (56)
T ss_pred CCcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCC
Confidence 478 899999999999887666 7999 689999999888
No 8
>PF12714 TILa: TILa domain
Probab=96.70 E-value=0.0051 Score=38.68 Aligned_cols=35 Identities=31% Similarity=0.806 Sum_probs=29.9
Q ss_pred CceecCCCEEecCCCe-eeEE-CCCceEEeeecCCCC
Q psy10996 2 NESCENGTSWLSADCE-ECKC-ERGIKFCSKVECPKL 36 (192)
Q Consensus 2 G~~y~~G~~w~~~~C~-~C~C-~~g~v~C~~~~Cp~~ 36 (192)
|+.|..||+|..+.|+ .|+| .+|.|+|....|++.
T Consensus 7 G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~~ 43 (56)
T PF12714_consen 7 GRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPPG 43 (56)
T ss_pred CEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCCC
Confidence 7889999999986665 8999 788999999999753
No 9
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.75 E-value=0.012 Score=34.21 Aligned_cols=27 Identities=26% Similarity=0.884 Sum_probs=23.0
Q ss_pred cCCCeeccCCcceeEEC-CCCceeecccc
Q psy10996 133 QENQEWRLDACTSCVCK-DRSHHCTQRIC 160 (192)
Q Consensus 133 ~~G~~w~~~~C~~C~C~-~g~v~C~~~~C 160 (192)
.+|++|+. .|..|+|. +|.+.|+++.|
T Consensus 3 ~pG~~~k~-dCN~C~C~~~G~~~CT~~~C 30 (39)
T PF05375_consen 3 TPGSTFKK-DCNTCTCSSDGKWACTRKAC 30 (39)
T ss_dssp -TTTEEEE-TTEEEEEBTTSSEEEE-SSS
T ss_pred CCCCeeeC-CCCCCCCCCCCcCccCcccC
Confidence 36899997 69999999 99999999888
No 10
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.67 E-value=0.018 Score=33.44 Aligned_cols=29 Identities=31% Similarity=0.935 Sum_probs=23.9
Q ss_pred cCCCEEecCCCeeeEEC-CCceEEeeecCCC
Q psy10996 6 ENGTSWLSADCEECKCE-RGIKFCSKVECPK 35 (192)
Q Consensus 6 ~~G~~w~~~~C~~C~C~-~g~v~C~~~~Cp~ 35 (192)
.+|++|.. .|..|+|. +|.+.|+.+.|++
T Consensus 3 ~pG~~~k~-dCN~C~C~~~G~~~CT~~~C~~ 32 (39)
T PF05375_consen 3 TPGSTFKK-DCNTCTCSSDGKWACTRKACPP 32 (39)
T ss_dssp -TTTEEEE-TTEEEEEBTTSSEEEE-SSSS-
T ss_pred CCCCeeeC-CCCCCCCCCCCcCccCcccCcC
Confidence 47999994 79999998 9999999999975
No 11
>PF05825 PSP94: Beta-microseminoprotein (PSP-94); InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=93.09 E-value=0.1 Score=36.31 Aligned_cols=37 Identities=24% Similarity=0.708 Sum_probs=30.9
Q ss_pred CCccee-eCCEEecCCCeeccCCcceeEECCCCceeec
Q psy10996 121 VPSGCT-VGNVTYQENQEWRLDACTSCVCKDRSHHCTQ 157 (192)
Q Consensus 121 ~~~~C~-~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~ 157 (192)
...+|. ++|+.|.-|++|+.+.|..|+|.+..+.|-.
T Consensus 14 ~~~~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~ 51 (94)
T PF05825_consen 14 SPNECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCS 51 (94)
T ss_dssp STTCEEETTTEEEETTEEEEETTSEEEEEESSEEEEEE
T ss_pred CCCccEeCCCcEEeCCCeECCCCCcEEEecCCceEecc
Confidence 356787 5899999999999999999999876676644
No 12
>KOG1544|consensus
Probab=90.89 E-value=0.28 Score=41.64 Aligned_cols=37 Identities=30% Similarity=0.811 Sum_probs=34.4
Q ss_pred cceeeCCEEecCCCeeccCCcceeEECCCCceeecccc
Q psy10996 123 SGCTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC 160 (192)
Q Consensus 123 ~~C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C 160 (192)
.+|..+|+.|.+|.+... .|.+|+|.++.+.|....|
T Consensus 110 ~gc~~gg~~y~~G~t~~~-NCn~CTC~n~qWKCdq~~C 146 (470)
T KOG1544|consen 110 QGCMHGGRIYPVGGTYWD-NCNRCTCQNRQWKCDQEPC 146 (470)
T ss_pred hhcccCceecccCCeeec-cccceeecCCceecCCcee
Confidence 789999999999999777 8999999999999999877
No 13
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=89.82 E-value=0.57 Score=27.71 Aligned_cols=30 Identities=20% Similarity=0.476 Sum_probs=23.8
Q ss_pred CCceecCCCEEec---CCCeeeEECC--CceEEee
Q psy10996 1 MNESCENGTSWLS---ADCEECKCER--GIKFCSK 30 (192)
Q Consensus 1 ~G~~y~~G~~w~~---~~C~~C~C~~--g~v~C~~ 30 (192)
.|+.|+.|++|.. ..-..|+|.. |.+.|++
T Consensus 8 t~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~ 42 (43)
T cd00061 8 TGVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP 42 (43)
T ss_pred cccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence 3788999999977 3355788888 8888875
No 14
>PF05825 PSP94: Beta-microseminoprotein (PSP-94); InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=89.19 E-value=0.45 Score=33.12 Aligned_cols=30 Identities=27% Similarity=0.701 Sum_probs=26.2
Q ss_pred CceecCCCEEecCCCeeeEECCCceEEeee
Q psy10996 2 NESCENGTSWLSADCEECKCERGIKFCSKV 31 (192)
Q Consensus 2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~ 31 (192)
|+.|.-|++|..+.|..|.|.+..+.|-..
T Consensus 23 G~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~ 52 (94)
T PF05825_consen 23 GKKHPLNSTWKTDDCEECSCSEDGISCCST 52 (94)
T ss_dssp TEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred CcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence 788999999999999999998888887543
No 15
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
Probab=84.64 E-value=2.1 Score=25.54 Aligned_cols=31 Identities=19% Similarity=0.432 Sum_probs=23.6
Q ss_pred CCceecCCCEEec--C--CCeeeEECCCc---eEEeee
Q psy10996 1 MNESCENGTSWLS--A--DCEECKCERGI---KFCSKV 31 (192)
Q Consensus 1 ~G~~y~~G~~w~~--~--~C~~C~C~~g~---v~C~~~ 31 (192)
+|..|+.|++|.. + .-..|+|..+. +.|++.
T Consensus 6 t~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~ 43 (45)
T smart00058 6 TGTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV 43 (45)
T ss_pred cccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence 4789999999987 2 34578997765 888764
No 16
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=82.25 E-value=1.8 Score=25.58 Aligned_cols=29 Identities=21% Similarity=0.609 Sum_probs=23.5
Q ss_pred CEEecCCCeecc---CCcceeEECC--CCceeec
Q psy10996 129 NVTYQENQEWRL---DACTSCVCKD--RSHHCTQ 157 (192)
Q Consensus 129 g~~y~~G~~w~~---~~C~~C~C~~--g~v~C~~ 157 (192)
++.|..|++|.. .--..|+|.. |.+.|..
T Consensus 9 ~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~ 42 (43)
T cd00061 9 GVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP 42 (43)
T ss_pred ccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence 778999999963 3566899998 8888875
No 17
>PF00039 fn1: Fibronectin type I domain; InterPro: IPR000083 Fibronectin type I repeats are one of the three repeats found in the fibronectin protein. Fibronectin is a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin. Type I domain (FN1) is approximately 40 residues in length. Four conserved cysteines are involved in disulphide bonds. The 3D structure of the FN1 domain has been determined [, , ]. It consists of two antiparallel beta-sheets, first a double-stranded one, that is linked by a disulphide bond to a triple-stranded beta-sheet. The second conserved disulphide bridge links the C-terminal adjacent strands of the domain. In human tissue plasminogen activator chain A the FN1 domain together with the following epidermal growth factor (EGF)-like domain are involved in fibrin-binding []. It has been suggested that these two modules form a single structural and functional unit []. The two domains keep their specific tertiary structure, but interact intimately to bury a hydrophobic core; the inter-module linker makes up the third strand of the EGF-module's major beta-sheet.; GO: 0005576 extracellular region; PDB: 2RKY_A 1FBR_A 2RL0_D 3M7P_A 3MQL_A 3ZRZ_A 2RKZ_F 2CKU_A 2CG6_A 3CAL_C ....
Probab=75.86 E-value=2.9 Score=24.12 Aligned_cols=28 Identities=21% Similarity=0.531 Sum_probs=19.1
Q ss_pred CCceecCCCEEec---CCCeeeEECC---CceEE
Q psy10996 1 MNESCENGTSWLS---ADCEECKCER---GIKFC 28 (192)
Q Consensus 1 ~G~~y~~G~~w~~---~~C~~C~C~~---g~v~C 28 (192)
.|..|+.|++|.. ..=..|+|.. |.+.|
T Consensus 6 t~~~Y~vge~W~R~~~g~~~~CtC~G~g~G~~~C 39 (39)
T PF00039_consen 6 TGQFYQVGETWERPYQGHMMQCTCLGNGRGEWKC 39 (39)
T ss_dssp TTEEEETTEEEEEEETSSEEEEEEEETTTTEEEE
T ss_pred CccEEeCCCEEEeecCCeEEEeEECCCCccceEC
Confidence 3788999999986 2334688844 55544
No 18
>PF14828 Amnionless: Amnionless
Probab=74.24 E-value=1.2 Score=39.48 Aligned_cols=37 Identities=35% Similarity=1.076 Sum_probs=26.9
Q ss_pred ccEEcC---CeEEEeec--CCCCCCCCCCCCCCCCCCCCcCCC
Q psy10996 54 DCTCRE---GKVLCYSQ--QCPAAACSNPRPPEPGTCCPRCAG 91 (192)
Q Consensus 54 ~C~C~~---g~v~C~~~--~Cp~~~C~~~~~~~~g~CCp~C~~ 91 (192)
-|.|.+ ....|..+ .||.+.|..+++. .|.||++|-.
T Consensus 193 gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P-~GhCC~iCGa 234 (437)
T PF14828_consen 193 GCPCGNDEVLEWICSNVLQRCPKPHCRSPLRP-EGHCCPICGA 234 (437)
T ss_pred cCccCcccchhhhhHHhhCcCCCCccCCCCCC-CCCchhhcce
Confidence 455533 34567666 8999999999874 8999999943
No 19
>PF15430 SVWC: Single domain von Willebrand factor type C
Probab=72.06 E-value=8 Score=24.17 Aligned_cols=36 Identities=17% Similarity=0.404 Sum_probs=27.3
Q ss_pred eeeCCEEecCCCeecc-CCcceeEECCCCceeecccc
Q psy10996 125 CTVGNVTYQENQEWRL-DACTSCVCKDRSHHCTQRIC 160 (192)
Q Consensus 125 C~~~g~~y~~G~~w~~-~~C~~C~C~~g~v~C~~~~C 160 (192)
|.|+|..+.+|++|.. ++|..=+|..+.-.=....|
T Consensus 1 C~y~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~v~~C 37 (65)
T PF15430_consen 1 CYYNGRTIPSGESYNPEEPCERWTCDASDGYVTVEGC 37 (65)
T ss_pred CEECCEEcCCCcEecCCCCCceEEEECCCCEEEEEeC
Confidence 6889999999999974 69999999843334445556
No 20
>PF14828 Amnionless: Amnionless
Probab=71.83 E-value=1.5 Score=38.90 Aligned_cols=38 Identities=34% Similarity=0.789 Sum_probs=31.6
Q ss_pred CeeeEECCC---ceEEeee--cCCCCCCCCccccCCCcccCCC
Q psy10996 16 CEECKCERG---IKFCSKV--ECPKLPEACTKSEIGQCCPVCL 53 (192)
Q Consensus 16 C~~C~C~~g---~v~C~~~--~Cp~~~C~~~~~~~g~CCp~C~ 53 (192)
-..|.|.+. ...|..+ .|+.+.|..+..+.|+||+.|-
T Consensus 191 ~~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P~GhCC~iCG 233 (437)
T PF14828_consen 191 PSGCPCGNDEVLEWICSNVLQRCPKPHCRSPLRPEGHCCPICG 233 (437)
T ss_pred cccCccCcccchhhhhHHhhCcCCCCccCCCCCCCCCchhhcc
Confidence 347888765 4667777 8999999999999999999993
No 21
>PF09443 CFC: Cripto_Frl-1_Cryptic (CFC); InterPro: IPR019011 This entry represents the CFC domain found in the membrane protein Cripto (or teratocarcinoma-derived growth factor), a protein over expressed in many tumours [, ] and structurally similar to the C-terminal extracellular portions of Jagged 1 and Jagged 2 []. CFC is approx 40-residues long, compacted by three internal disulphide bridges, and binds Alk4 via a hydrophobic patch. CFC is structurally homologous to the VWFC-like domain []. The protein Cripto is the founding member of the extra-cellular EGF-CFC growth factors, which are composed of two adjacent cysteine-rich domains: the EGF-like (IPR006210 from INTERPRO) and the CFC domains. Members of the EGF-CFC family play key roles in embryonic development and are also implicated in tumourigenesis []. The Cripto protein could play a role in the determination of the epiblastic cells that subsequently give rise to the mesoderm. Although both the EGF and CFC domains are involved in the tumourigenic activity of Crispto proteins, the CFC domain appears to play a crucial role, as it is through the CFC domain that Crispto interferes with the onco-suppressive activity of Activins, either by blocking the Activin receptor ALK4 or by antagonising proteins of the TGF-beta family []. The Cryptic protein is involved in the correct establishment of the left-right axis. May play a role in mesoderm and/or neural patterning during gastrulation.; PDB: 2J5H_A.
Probab=65.13 E-value=3.1 Score=23.57 Aligned_cols=23 Identities=30% Similarity=0.774 Sum_probs=7.1
Q ss_pred eeccCCcceeEECCCCceeeccc
Q psy10996 137 EWRLDACTSCVCKDRSHHCTQRI 159 (192)
Q Consensus 137 ~w~~~~C~~C~C~~g~v~C~~~~ 159 (192)
.|....|.+|+|..|...|...+
T Consensus 8 ~W~~~~C~lCrC~~G~LhC~p~~ 30 (36)
T PF09443_consen 8 EWVPKGCSLCRCWYGTLHCFPQQ 30 (36)
T ss_dssp T-SS---------SSS--S--SS
T ss_pred CeeecCceeeEeecCcEEEcCcc
Confidence 58888999999999999988754
No 22
>KOG1544|consensus
Probab=63.98 E-value=4.9 Score=34.35 Aligned_cols=32 Identities=19% Similarity=0.590 Sum_probs=28.7
Q ss_pred CceecCCCEEecCCCeeeEECCCceEEeeecCC
Q psy10996 2 NESCENGTSWLSADCEECKCERGIKFCSKVECP 34 (192)
Q Consensus 2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp 34 (192)
|+.|.+|++.. +.|..|+|.++...|+...|-
T Consensus 116 g~~y~~G~t~~-~NCn~CTC~n~qWKCdq~~CL 147 (470)
T KOG1544|consen 116 GRIYPVGGTYW-DNCNRCTCQNRQWKCDQEPCL 147 (470)
T ss_pred ceecccCCeee-ccccceeecCCceecCCceee
Confidence 67788999887 689999999999999999885
Done!