Query         psy10996
Match_columns 192
No_of_seqs    179 out of 1160
Neff          8.1 
Searched_HMMs 46136
Date          Fri Aug 16 15:31:37 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy10996.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10996hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00093 VWC:  von Willebrand f  99.5 1.3E-14 2.7E-19   92.5   3.7   54  125-178     1-57  (57)
  2 smart00214 VWC von Willebrand   99.4 1.3E-13 2.9E-18   88.3   4.9   54  125-178     1-59  (59)
  3 PF00093 VWC:  von Willebrand f  99.3 2.5E-12 5.5E-17   81.8   4.3   51    2-52      5-57  (57)
  4 smart00214 VWC von Willebrand   99.2 2.2E-11 4.7E-16   78.0   5.2   52    1-52      4-59  (59)
  5 smart00215 VWC_out von Willebr  98.2 1.9E-06 4.1E-11   56.3   3.1   40  125-165     1-41  (67)
  6 smart00215 VWC_out von Willebr  97.8 2.2E-05 4.9E-10   51.3   3.2   39    1-40      4-42  (67)
  7 PF12714 TILa:  TILa domain      97.6  0.0001 2.2E-09   46.4   4.1   37  124-160     1-40  (56)
  8 PF12714 TILa:  TILa domain      96.7  0.0051 1.1E-07   38.7   5.1   35    2-36      7-43  (56)
  9 PF05375 Pacifastin_I:  Pacifas  95.8   0.012 2.6E-07   34.2   2.8   27  133-160     3-30  (39)
 10 PF05375 Pacifastin_I:  Pacifas  95.7   0.018 3.9E-07   33.4   3.3   29    6-35      3-32  (39)
 11 PF05825 PSP94:  Beta-microsemi  93.1     0.1 2.2E-06   36.3   2.9   37  121-157    14-51  (94)
 12 KOG1544|consensus               90.9    0.28   6E-06   41.6   3.6   37  123-160   110-146 (470)
 13 cd00061 FN1 Fibronectin type 1  89.8    0.57 1.2E-05   27.7   3.3   30    1-30      8-42  (43)
 14 PF05825 PSP94:  Beta-microsemi  89.2    0.45 9.7E-06   33.1   3.0   30    2-31     23-52  (94)
 15 smart00058 FN1 Fibronectin typ  84.6     2.1 4.6E-05   25.5   3.7   31    1-31      6-43  (45)
 16 cd00061 FN1 Fibronectin type 1  82.2     1.8 3.9E-05   25.6   2.7   29  129-157     9-42  (43)
 17 PF00039 fn1:  Fibronectin type  75.9     2.9 6.2E-05   24.1   2.2   28    1-28      6-39  (39)
 18 PF14828 Amnionless:  Amnionles  74.2     1.2 2.6E-05   39.5   0.5   37   54-91    193-234 (437)
 19 PF15430 SVWC:  Single domain v  72.1       8 0.00017   24.2   3.9   36  125-160     1-37  (65)
 20 PF14828 Amnionless:  Amnionles  71.8     1.5 3.3E-05   38.9   0.5   38   16-53    191-233 (437)
 21 PF09443 CFC:  Cripto_Frl-1_Cry  65.1     3.1 6.8E-05   23.6   0.7   23  137-159     8-30  (36)
 22 KOG1544|consensus               64.0     4.9 0.00011   34.3   2.0   32    2-34    116-147 (470)

No 1  
>PF00093 VWC:  von Willebrand factor type C domain;  InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.51  E-value=1.3e-14  Score=92.47  Aligned_cols=54  Identities=37%  Similarity=1.147  Sum_probs=33.8

Q ss_pred             eeeCCEEecCCCeeccCCcceeEECCCCceeecccc-CCCCCCC--ccCCCCCcccc
Q psy10996        125 CTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC-SVTCSNP--MTIPNQCCPLC  178 (192)
Q Consensus       125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C-~~~C~~~--~~~~g~CCp~C  178 (192)
                      |.|+|++|.+|++|+++.|+.|+|.+|.|.|.++.| ++.|+.+  +..+|+|||+|
T Consensus         1 C~~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C   57 (57)
T PF00093_consen    1 CSFNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC   57 (57)
T ss_dssp             EESSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred             CeeCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence            689999999999999999999999999999999999 7899988  78999999987


No 2  
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.45  E-value=1.3e-13  Score=88.34  Aligned_cols=54  Identities=33%  Similarity=0.968  Sum_probs=50.0

Q ss_pred             eeeCCEEecCCCeeccCCcceeEECCCC-ceeecccc-CC-CCCCCc--cCCCCCcccc
Q psy10996        125 CTVGNVTYQENQEWRLDACTSCVCKDRS-HHCTQRIC-SV-TCSNPM--TIPNQCCPLC  178 (192)
Q Consensus       125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~-v~C~~~~C-~~-~C~~~~--~~~g~CCp~C  178 (192)
                      |+|+|++|.+|++|++++|++|+|.+|. +.|.++.| ++ .|+++.  ..+|+|||+|
T Consensus         1 C~~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C   59 (59)
T smart00214        1 CVHNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC   59 (59)
T ss_pred             CccCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence            6799999999999999999999999999 99999999 54 799887  7899999986


No 3  
>PF00093 VWC:  von Willebrand factor type C domain;  InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.30  E-value=2.5e-12  Score=81.80  Aligned_cols=51  Identities=37%  Similarity=0.969  Sum_probs=30.6

Q ss_pred             CceecCCCEEecCCCeeeEECCCceEEeeecCCCCCCCCc--cccCCCcccCC
Q psy10996          2 NESCENGTSWLSADCEECKCERGIKFCSKVECPKLPEACT--KSEIGQCCPVC   52 (192)
Q Consensus         2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~~--~~~~g~CCp~C   52 (192)
                      |++|++|++|+++.|+.|+|.+|.+.|..+.|+.+.|+.+  +..+|+|||.|
T Consensus         5 g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C   57 (57)
T PF00093_consen    5 GRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC   57 (57)
T ss_dssp             S-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred             CEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence            8899999999998899999999999999999999899988  78899999876


No 4  
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.22  E-value=2.2e-11  Score=78.00  Aligned_cols=52  Identities=33%  Similarity=0.857  Sum_probs=46.4

Q ss_pred             CCceecCCCEEecCCCeeeEECCCc-eEEeeecCCCC-CCCCcc--ccCCCcccCC
Q psy10996          1 MNESCENGTSWLSADCEECKCERGI-KFCSKVECPKL-PEACTK--SEIGQCCPVC   52 (192)
Q Consensus         1 ~G~~y~~G~~w~~~~C~~C~C~~g~-v~C~~~~Cp~~-~C~~~~--~~~g~CCp~C   52 (192)
                      .|++|++|++|++++|++|+|.+|. +.|..+.|+.+ .|+.+.  ..+|+|||.|
T Consensus         4 ~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C   59 (59)
T smart00214        4 NGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC   59 (59)
T ss_pred             CCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence            3899999999999889999999999 99999999875 788876  7888888876


No 5  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=98.15  E-value=1.9e-06  Score=56.33  Aligned_cols=40  Identities=28%  Similarity=0.792  Sum_probs=35.4

Q ss_pred             eeeCCEEecCCCeeccCCcceeEECCCCceeecccc-CCCCC
Q psy10996        125 CTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC-SVTCS  165 (192)
Q Consensus       125 C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C-~~~C~  165 (192)
                      |+++|++|++|++|. +.|.+|+|.+|.+.|+.+.| +..|.
T Consensus         1 C~~~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~   41 (67)
T smart00215        1 CYNNGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCL   41 (67)
T ss_pred             CeECCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhh
Confidence            678999999999997 58999999999999999999 55543


No 6  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.79  E-value=2.2e-05  Score=51.26  Aligned_cols=39  Identities=31%  Similarity=0.633  Sum_probs=34.4

Q ss_pred             CCceecCCCEEecCCCeeeEECCCceEEeeecCCCCCCCC
Q psy10996          1 MNESCENGTSWLSADCEECKCERGIKFCSKVECPKLPEAC   40 (192)
Q Consensus         1 ~G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~   40 (192)
                      .|+.|++|+.|. +.|..|+|.+|.+.|+++.|+...|..
T Consensus         4 ~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~l   42 (67)
T smart00215        4 NGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCLL   42 (67)
T ss_pred             CCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhhc
Confidence            388999999998 799999999999999999998766543


No 7  
>PF12714 TILa:  TILa domain
Probab=97.61  E-value=0.0001  Score=46.45  Aligned_cols=37  Identities=32%  Similarity=0.892  Sum_probs=32.4

Q ss_pred             cee-eCCEEecCCCeeccCCcc-eeEE-CCCCceeecccc
Q psy10996        124 GCT-VGNVTYQENQEWRLDACT-SCVC-KDRSHHCTQRIC  160 (192)
Q Consensus       124 ~C~-~~g~~y~~G~~w~~~~C~-~C~C-~~g~v~C~~~~C  160 (192)
                      +|+ ++|+.|+.||+|..+.|+ +|+| .+|.|.|....|
T Consensus         1 GC~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C   40 (56)
T PF12714_consen    1 GCTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSC   40 (56)
T ss_pred             CCcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCC
Confidence            478 899999999999887666 7999 689999999888


No 8  
>PF12714 TILa:  TILa domain
Probab=96.70  E-value=0.0051  Score=38.68  Aligned_cols=35  Identities=31%  Similarity=0.806  Sum_probs=29.9

Q ss_pred             CceecCCCEEecCCCe-eeEE-CCCceEEeeecCCCC
Q psy10996          2 NESCENGTSWLSADCE-ECKC-ERGIKFCSKVECPKL   36 (192)
Q Consensus         2 G~~y~~G~~w~~~~C~-~C~C-~~g~v~C~~~~Cp~~   36 (192)
                      |+.|..||+|..+.|+ .|+| .+|.|+|....|++.
T Consensus         7 G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~~   43 (56)
T PF12714_consen    7 GRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPPG   43 (56)
T ss_pred             CEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCCC
Confidence            7889999999986665 8999 788999999999753


No 9  
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.75  E-value=0.012  Score=34.21  Aligned_cols=27  Identities=26%  Similarity=0.884  Sum_probs=23.0

Q ss_pred             cCCCeeccCCcceeEEC-CCCceeecccc
Q psy10996        133 QENQEWRLDACTSCVCK-DRSHHCTQRIC  160 (192)
Q Consensus       133 ~~G~~w~~~~C~~C~C~-~g~v~C~~~~C  160 (192)
                      .+|++|+. .|..|+|. +|.+.|+++.|
T Consensus         3 ~pG~~~k~-dCN~C~C~~~G~~~CT~~~C   30 (39)
T PF05375_consen    3 TPGSTFKK-DCNTCTCSSDGKWACTRKAC   30 (39)
T ss_dssp             -TTTEEEE-TTEEEEEBTTSSEEEE-SSS
T ss_pred             CCCCeeeC-CCCCCCCCCCCcCccCcccC
Confidence            36899997 69999999 99999999888


No 10 
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.67  E-value=0.018  Score=33.44  Aligned_cols=29  Identities=31%  Similarity=0.935  Sum_probs=23.9

Q ss_pred             cCCCEEecCCCeeeEEC-CCceEEeeecCCC
Q psy10996          6 ENGTSWLSADCEECKCE-RGIKFCSKVECPK   35 (192)
Q Consensus         6 ~~G~~w~~~~C~~C~C~-~g~v~C~~~~Cp~   35 (192)
                      .+|++|.. .|..|+|. +|.+.|+.+.|++
T Consensus         3 ~pG~~~k~-dCN~C~C~~~G~~~CT~~~C~~   32 (39)
T PF05375_consen    3 TPGSTFKK-DCNTCTCSSDGKWACTRKACPP   32 (39)
T ss_dssp             -TTTEEEE-TTEEEEEBTTSSEEEE-SSSS-
T ss_pred             CCCCeeeC-CCCCCCCCCCCcCccCcccCcC
Confidence            47999994 79999998 9999999999975


No 11 
>PF05825 PSP94:  Beta-microseminoprotein (PSP-94);  InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=93.09  E-value=0.1  Score=36.31  Aligned_cols=37  Identities=24%  Similarity=0.708  Sum_probs=30.9

Q ss_pred             CCccee-eCCEEecCCCeeccCCcceeEECCCCceeec
Q psy10996        121 VPSGCT-VGNVTYQENQEWRLDACTSCVCKDRSHHCTQ  157 (192)
Q Consensus       121 ~~~~C~-~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~  157 (192)
                      ...+|. ++|+.|.-|++|+.+.|..|+|.+..+.|-.
T Consensus        14 ~~~~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~   51 (94)
T PF05825_consen   14 SPNECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCS   51 (94)
T ss_dssp             STTCEEETTTEEEETTEEEEETTSEEEEEESSEEEEEE
T ss_pred             CCCccEeCCCcEEeCCCeECCCCCcEEEecCCceEecc
Confidence            356787 5899999999999999999999876676644


No 12 
>KOG1544|consensus
Probab=90.89  E-value=0.28  Score=41.64  Aligned_cols=37  Identities=30%  Similarity=0.811  Sum_probs=34.4

Q ss_pred             cceeeCCEEecCCCeeccCCcceeEECCCCceeecccc
Q psy10996        123 SGCTVGNVTYQENQEWRLDACTSCVCKDRSHHCTQRIC  160 (192)
Q Consensus       123 ~~C~~~g~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~C  160 (192)
                      .+|..+|+.|.+|.+... .|.+|+|.++.+.|....|
T Consensus       110 ~gc~~gg~~y~~G~t~~~-NCn~CTC~n~qWKCdq~~C  146 (470)
T KOG1544|consen  110 QGCMHGGRIYPVGGTYWD-NCNRCTCQNRQWKCDQEPC  146 (470)
T ss_pred             hhcccCceecccCCeeec-cccceeecCCceecCCcee
Confidence            789999999999999777 8999999999999999877


No 13 
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices.  Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=89.82  E-value=0.57  Score=27.71  Aligned_cols=30  Identities=20%  Similarity=0.476  Sum_probs=23.8

Q ss_pred             CCceecCCCEEec---CCCeeeEECC--CceEEee
Q psy10996          1 MNESCENGTSWLS---ADCEECKCER--GIKFCSK   30 (192)
Q Consensus         1 ~G~~y~~G~~w~~---~~C~~C~C~~--g~v~C~~   30 (192)
                      .|+.|+.|++|..   ..-..|+|..  |.+.|++
T Consensus         8 t~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~   42 (43)
T cd00061           8 TGVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP   42 (43)
T ss_pred             cccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence            3788999999977   3355788888  8888875


No 14 
>PF05825 PSP94:  Beta-microseminoprotein (PSP-94);  InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=89.19  E-value=0.45  Score=33.12  Aligned_cols=30  Identities=27%  Similarity=0.701  Sum_probs=26.2

Q ss_pred             CceecCCCEEecCCCeeeEECCCceEEeee
Q psy10996          2 NESCENGTSWLSADCEECKCERGIKFCSKV   31 (192)
Q Consensus         2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~   31 (192)
                      |+.|.-|++|..+.|..|.|.+..+.|-..
T Consensus        23 G~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~   52 (94)
T PF05825_consen   23 GKKHPLNSTWKTDDCEECSCSEDGISCCST   52 (94)
T ss_dssp             TEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred             CcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence            788999999999999999998888887543


No 15 
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type  contributes to fibrin-binding.
Probab=84.64  E-value=2.1  Score=25.54  Aligned_cols=31  Identities=19%  Similarity=0.432  Sum_probs=23.6

Q ss_pred             CCceecCCCEEec--C--CCeeeEECCCc---eEEeee
Q psy10996          1 MNESCENGTSWLS--A--DCEECKCERGI---KFCSKV   31 (192)
Q Consensus         1 ~G~~y~~G~~w~~--~--~C~~C~C~~g~---v~C~~~   31 (192)
                      +|..|+.|++|..  +  .-..|+|..+.   +.|++.
T Consensus         6 t~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~   43 (45)
T smart00058        6 TGTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV   43 (45)
T ss_pred             cccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence            4789999999987  2  34578997765   888764


No 16 
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices.  Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=82.25  E-value=1.8  Score=25.58  Aligned_cols=29  Identities=21%  Similarity=0.609  Sum_probs=23.5

Q ss_pred             CEEecCCCeecc---CCcceeEECC--CCceeec
Q psy10996        129 NVTYQENQEWRL---DACTSCVCKD--RSHHCTQ  157 (192)
Q Consensus       129 g~~y~~G~~w~~---~~C~~C~C~~--g~v~C~~  157 (192)
                      ++.|..|++|..   .--..|+|..  |.+.|..
T Consensus         9 ~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~   42 (43)
T cd00061           9 GVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP   42 (43)
T ss_pred             ccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence            778999999963   3566899998  8888875


No 17 
>PF00039 fn1:  Fibronectin type I domain;  InterPro: IPR000083 Fibronectin type I repeats are one of the three repeats found in the fibronectin protein. Fibronectin is a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin. Type I domain (FN1) is approximately 40 residues in length. Four conserved cysteines are involved in disulphide bonds. The 3D structure of the FN1 domain has been determined [, , ]. It consists of two antiparallel beta-sheets, first a double-stranded one, that is linked by a disulphide bond to a triple-stranded beta-sheet. The second conserved disulphide bridge links the C-terminal adjacent strands of the domain.  In human tissue plasminogen activator chain A the FN1 domain together with the following epidermal growth factor (EGF)-like domain are involved in fibrin-binding []. It has been suggested that these two modules form a single structural and functional unit []. The two domains keep their specific tertiary structure, but interact intimately to bury a hydrophobic core; the inter-module linker makes up the third strand of the EGF-module's major beta-sheet.; GO: 0005576 extracellular region; PDB: 2RKY_A 1FBR_A 2RL0_D 3M7P_A 3MQL_A 3ZRZ_A 2RKZ_F 2CKU_A 2CG6_A 3CAL_C ....
Probab=75.86  E-value=2.9  Score=24.12  Aligned_cols=28  Identities=21%  Similarity=0.531  Sum_probs=19.1

Q ss_pred             CCceecCCCEEec---CCCeeeEECC---CceEE
Q psy10996          1 MNESCENGTSWLS---ADCEECKCER---GIKFC   28 (192)
Q Consensus         1 ~G~~y~~G~~w~~---~~C~~C~C~~---g~v~C   28 (192)
                      .|..|+.|++|..   ..=..|+|..   |.+.|
T Consensus         6 t~~~Y~vge~W~R~~~g~~~~CtC~G~g~G~~~C   39 (39)
T PF00039_consen    6 TGQFYQVGETWERPYQGHMMQCTCLGNGRGEWKC   39 (39)
T ss_dssp             TTEEEETTEEEEEEETSSEEEEEEEETTTTEEEE
T ss_pred             CccEEeCCCEEEeecCCeEEEeEECCCCccceEC
Confidence            3788999999986   2334688844   55544


No 18 
>PF14828 Amnionless:  Amnionless
Probab=74.24  E-value=1.2  Score=39.48  Aligned_cols=37  Identities=35%  Similarity=1.076  Sum_probs=26.9

Q ss_pred             ccEEcC---CeEEEeec--CCCCCCCCCCCCCCCCCCCCcCCC
Q psy10996         54 DCTCRE---GKVLCYSQ--QCPAAACSNPRPPEPGTCCPRCAG   91 (192)
Q Consensus        54 ~C~C~~---g~v~C~~~--~Cp~~~C~~~~~~~~g~CCp~C~~   91 (192)
                      -|.|.+   ....|..+  .||.+.|..+++. .|.||++|-.
T Consensus       193 gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P-~GhCC~iCGa  234 (437)
T PF14828_consen  193 GCPCGNDEVLEWICSNVLQRCPKPHCRSPLRP-EGHCCPICGA  234 (437)
T ss_pred             cCccCcccchhhhhHHhhCcCCCCccCCCCCC-CCCchhhcce
Confidence            455533   34567666  8999999999874 8999999943


No 19 
>PF15430 SVWC:  Single domain von Willebrand factor type C
Probab=72.06  E-value=8  Score=24.17  Aligned_cols=36  Identities=17%  Similarity=0.404  Sum_probs=27.3

Q ss_pred             eeeCCEEecCCCeecc-CCcceeEECCCCceeecccc
Q psy10996        125 CTVGNVTYQENQEWRL-DACTSCVCKDRSHHCTQRIC  160 (192)
Q Consensus       125 C~~~g~~y~~G~~w~~-~~C~~C~C~~g~v~C~~~~C  160 (192)
                      |.|+|..+.+|++|.. ++|..=+|..+.-.=....|
T Consensus         1 C~y~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~v~~C   37 (65)
T PF15430_consen    1 CYYNGRTIPSGESYNPEEPCERWTCDASDGYVTVEGC   37 (65)
T ss_pred             CEECCEEcCCCcEecCCCCCceEEEECCCCEEEEEeC
Confidence            6889999999999974 69999999843334445556


No 20 
>PF14828 Amnionless:  Amnionless
Probab=71.83  E-value=1.5  Score=38.90  Aligned_cols=38  Identities=34%  Similarity=0.789  Sum_probs=31.6

Q ss_pred             CeeeEECCC---ceEEeee--cCCCCCCCCccccCCCcccCCC
Q psy10996         16 CEECKCERG---IKFCSKV--ECPKLPEACTKSEIGQCCPVCL   53 (192)
Q Consensus        16 C~~C~C~~g---~v~C~~~--~Cp~~~C~~~~~~~g~CCp~C~   53 (192)
                      -..|.|.+.   ...|..+  .|+.+.|..+..+.|+||+.|-
T Consensus       191 ~~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P~GhCC~iCG  233 (437)
T PF14828_consen  191 PSGCPCGNDEVLEWICSNVLQRCPKPHCRSPLRPEGHCCPICG  233 (437)
T ss_pred             cccCccCcccchhhhhHHhhCcCCCCccCCCCCCCCCchhhcc
Confidence            347888765   4667777  8999999999999999999993


No 21 
>PF09443 CFC:  Cripto_Frl-1_Cryptic (CFC);  InterPro: IPR019011 This entry represents the CFC domain found in the membrane protein Cripto (or teratocarcinoma-derived growth factor), a protein over expressed in many tumours [, ] and structurally similar to the C-terminal extracellular portions of Jagged 1 and Jagged 2 []. CFC is approx 40-residues long, compacted by three internal disulphide bridges, and binds Alk4 via a hydrophobic patch. CFC is structurally homologous to the VWFC-like domain [].  The protein Cripto is the founding member of the extra-cellular EGF-CFC growth factors, which are composed of two adjacent cysteine-rich domains: the EGF-like (IPR006210 from INTERPRO) and the CFC domains. Members of the EGF-CFC family play key roles in embryonic development and are also implicated in tumourigenesis []. The Cripto protein could play a role in the determination of the epiblastic cells that subsequently give rise to the mesoderm. Although both the EGF and CFC domains are involved in the tumourigenic activity of Crispto proteins, the CFC domain appears to play a crucial role, as it is through the CFC domain that Crispto interferes with the onco-suppressive activity of Activins, either by blocking the Activin receptor ALK4 or by antagonising proteins of the TGF-beta family [].  The Cryptic protein is involved in the correct establishment of the left-right axis. May play a role in mesoderm and/or neural patterning during gastrulation.; PDB: 2J5H_A.
Probab=65.13  E-value=3.1  Score=23.57  Aligned_cols=23  Identities=30%  Similarity=0.774  Sum_probs=7.1

Q ss_pred             eeccCCcceeEECCCCceeeccc
Q psy10996        137 EWRLDACTSCVCKDRSHHCTQRI  159 (192)
Q Consensus       137 ~w~~~~C~~C~C~~g~v~C~~~~  159 (192)
                      .|....|.+|+|..|...|...+
T Consensus         8 ~W~~~~C~lCrC~~G~LhC~p~~   30 (36)
T PF09443_consen    8 EWVPKGCSLCRCWYGTLHCFPQQ   30 (36)
T ss_dssp             T-SS---------SSS--S--SS
T ss_pred             CeeecCceeeEeecCcEEEcCcc
Confidence            58888999999999999988754


No 22 
>KOG1544|consensus
Probab=63.98  E-value=4.9  Score=34.35  Aligned_cols=32  Identities=19%  Similarity=0.590  Sum_probs=28.7

Q ss_pred             CceecCCCEEecCCCeeeEECCCceEEeeecCC
Q psy10996          2 NESCENGTSWLSADCEECKCERGIKFCSKVECP   34 (192)
Q Consensus         2 G~~y~~G~~w~~~~C~~C~C~~g~v~C~~~~Cp   34 (192)
                      |+.|.+|++.. +.|..|+|.++...|+...|-
T Consensus       116 g~~y~~G~t~~-~NCn~CTC~n~qWKCdq~~CL  147 (470)
T KOG1544|consen  116 GRIYPVGGTYW-DNCNRCTCQNRQWKCDQEPCL  147 (470)
T ss_pred             ceecccCCeee-ccccceeecCCceecCCceee
Confidence            67788999887 689999999999999999885


Done!