RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13151
         (256 letters)



>gnl|CDD|188969 cd09570, SAM_kazrin_repeat3, SAM domain of kazrin proteins repeat
           3.  SAM (sterile alpha motif) domain repeat 3 of kazrin
           proteins is a protein-protein interaction domain. The
           long isoform of kazrins contains three copies (repeats)
           of SAM domain. Kazrin can interact with periplakin. It
           is involved in interplay between desmosomes and in
           adheren junctions. Additionally kazrins play a role in
           regulation of intercellular differentiation, junction
           assembly, and cytoskeletal organization.
          Length = 72

 Score = 29.7 bits (67), Expect = 0.29
 Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 8/44 (18%)

Query: 212 GAHASCVVLNHTPRCTCDPGFTGDPFSTCFYIQESKHPNRRLVT 255
           G H + +VL        +P F  D  +T   I  SK+  RR +T
Sbjct: 30  GVHGALMVL--------EPSFNSDTMATALGIPSSKNIIRRHLT 65


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 27.8 bits (62), Expect = 0.54
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 71 PGSCGAHASCVVLNHTPRCTCDPGFTGD 98
             C    +CV    + RC C PG+TGD
Sbjct: 5  SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 27.8 bits (62), Expect = 0.54
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 208 PGSCGAHASCVVLNHTPRCTCDPGFTGD 235
              C    +CV    + RC C PG+TGD
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 1/24 (4%)

Query: 72  GSCGAHASCVVLNHTPRCTCDPGF 95
           G CG +  C V N +P+C C  GF
Sbjct: 84  GRCGPYGYCDV-NTSPKCNCIKGF 106



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 1/24 (4%)

Query: 209 GSCGAHASCVVLNHTPRCTCDPGF 232
           G CG +  C V N +P+C C  GF
Sbjct: 84  GRCGPYGYCDV-NTSPKCNCIKGF 106



 Score = 26.9 bits (60), Expect = 4.9
 Identities = 7/19 (36%), Positives = 11/19 (57%), Gaps = 1/19 (5%)

Query: 154 CGPNSICRVIGNTPACSCV 172
           CGP   C  +  +P C+C+
Sbjct: 86  CGPYGYC-DVNTSPKCNCI 103


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 26.2 bits (58), Expect = 1.9
 Identities = 8/25 (32%), Positives = 10/25 (40%)

Query: 74 CGAHASCVVLNHTPRCTCDPGFTGD 98
          C    +CV       C C  G+TG 
Sbjct: 7  CSNGGTCVDTPGGYTCECPEGYTGK 31



 Score = 26.2 bits (58), Expect = 1.9
 Identities = 8/25 (32%), Positives = 10/25 (40%)

Query: 211 CGAHASCVVLNHTPRCTCDPGFTGD 235
           C    +CV       C C  G+TG 
Sbjct: 7   CSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|224409 COG1492, CobQ, Cobyric acid synthase [Coenzyme metabolism].
          Length = 486

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 23/83 (27%), Positives = 28/83 (33%), Gaps = 16/83 (19%)

Query: 139 AAEPRPSG--NPCLPSPCGPNS---IC--RVIGNTPACSCVESYIGRP---------PNC 182
           AA   PS   NP L  PC       I   + IG   A    +   G              
Sbjct: 64  AAGIEPSVHMNPVLLKPCSDTGSQVIVMGKDIGRKSAVEYYQEGKGLLWVAVKESLERLD 123

Query: 183 RPECTINAECAGNLACINERCKD 205
           R    +  E AG+ A IN R +D
Sbjct: 124 REYDVVVIEGAGSPAEINLRDRD 146


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.0 bits (58), Expect = 2.5
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 71 PGSCGAHASCVVLNHTPRCTCDPGFTGD 98
           G C  +A+C     +  CTC  G+TGD
Sbjct: 5  NGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 26.0 bits (58), Expect = 2.5
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 208 PGSCGAHASCVVLNHTPRCTCDPGFTGD 235
            G C  +A+C     +  CTC  G+TGD
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 26.1 bits (58), Expect = 2.5
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 68 DPC--PGSCGAHASCVVLNHTPRCTCDPGFTGD 98
          D C     C    +CV    + RC+C PG+TG 
Sbjct: 3  DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35



 Score = 26.1 bits (58), Expect = 2.5
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 205 DPC--PGSCGAHASCVVLNHTPRCTCDPGFTGD 235
           D C     C    +CV    + RC+C PG+TG 
Sbjct: 3   DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.479 

Gapped
Lambda     K      H
   0.267   0.0656    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,337,921
Number of extensions: 1059979
Number of successful extensions: 780
Number of sequences better than 10.0: 1
Number of HSP's gapped: 755
Number of HSP's successfully gapped: 78
Length of query: 256
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 161
Effective length of database: 6,723,972
Effective search space: 1082559492
Effective search space used: 1082559492
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.3 bits)