RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11059
         (429 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 44.9 bits (107), Expect = 8e-07
 Identities = 21/37 (56%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 348 DFDECLS-NPCFNGATCQNKINGYTCVCAPGYSGKEC 383
           D DEC S NPC NG TC N +  Y C C PGY+G+ C
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37



 Score = 40.7 bits (96), Expect = 2e-05
 Identities = 19/38 (50%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 386 NINECES-SPCLHGATCIDEVATFSCVCPKGLTGRLCE 422
           +I+EC S +PC +G TC++ V ++ C CP G TGR CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 30.7 bits (70), Expect = 0.095
 Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 5/39 (12%)

Query: 294 CVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGKICE 332
           C   + C NGGTC       T   + C+CP GY G+ CE
Sbjct: 5   CASGNPCQNGGTCVN-----TVGSYRCSCPPGYTGRNCE 38



 Score = 29.9 bits (68), Expect = 0.20
 Identities = 15/31 (48%), Positives = 21/31 (67%), Gaps = 1/31 (3%)

Query: 258 TGDPCK-HGSCVDKRAGYFCDCPPTYGGKNC 287
           +G+PC+  G+CV+    Y C CPP Y G+NC
Sbjct: 7   SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37



 Score = 28.4 bits (64), Expect = 0.72
 Identities = 15/30 (50%), Positives = 16/30 (53%), Gaps = 2/30 (6%)

Query: 67 CTTAPPCLNGATCRPQLTEQLYECVCPPGY 96
          C +  PC NG TC    T   Y C CPPGY
Sbjct: 5  CASGNPCQNGGTCVN--TVGSYRCSCPPGY 32


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 41.5 bits (98), Expect = 1e-05
 Identities = 21/38 (55%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 348 DFDECLS-NPCFNGATCQNKINGYTCVCAPGYS-GKEC 383
           D DEC S NPC NG TC N +  Y C C PGY+ G+ C
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38



 Score = 37.2 bits (87), Expect = 4e-04
 Identities = 19/39 (48%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 386 NINECES-SPCLHGATCIDEVATFSCVCPKGLT-GRLCE 422
           +I+EC S +PC +G TC++ V ++ C CP G T GR CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 29.9 bits (68), Expect = 0.19
 Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 67  CTTAPPCLNGATCRPQLTEQLYECVCPPGYKEIRDCT 103
           C +  PC NG TC    T   Y C CPPGY + R+C 
Sbjct: 5   CASGNPCQNGGTCVN--TVGSYRCECPPGYTDGRNCE 39



 Score = 26.8 bits (60), Expect = 2.1
 Identities = 15/32 (46%), Positives = 22/32 (68%), Gaps = 2/32 (6%)

Query: 258 TGDPCKHGS-CVDKRAGYFCDCPPTY-GGKNC 287
           +G+PC++G  CV+    Y C+CPP Y  G+NC
Sbjct: 7   SGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 35.1 bits (81), Expect = 0.002
 Identities = 17/29 (58%), Positives = 18/29 (62%)

Query: 353 LSNPCFNGATCQNKINGYTCVCAPGYSGK 381
            SNPC NG TC N    Y CVC PGY+G 
Sbjct: 4   ASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 30.1 bits (68), Expect = 0.15
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 391 ESSPCLHGATCIDEVATFSCVCPKGLTGR 419
            S+PC +G TC++   ++ CVCP G TG 
Sbjct: 4   ASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 29.8 bits (67), Expect = 0.21
 Identities = 18/37 (48%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 67  CTTAPPCLNGATCRPQLTEQLYECVCPPGYKEIRDCT 103
           C  + PC NG TC    T   Y CVCPPGY   R C 
Sbjct: 2   CAASNPCSNGGTCVN--TPGSYRCVCPPGYTGDRSCE 36



 Score = 25.1 bits (55), Expect = 9.9
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 5/36 (13%)

Query: 294 CVGPDTCLNGGTCKPYLVDETQHRFNCTCPSGYHGK 329
           C   + C NGGTC       T   + C CP GY G 
Sbjct: 2   CAASNPCSNGGTCVN-----TPGSYRCVCPPGYTGD 32


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 33.6 bits (77), Expect = 0.009
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 355 NPCFNGATCQNKINGYTCVCAPGYSGK 381
           NPC NG TC +   GYTC C  GY+GK
Sbjct: 5   NPCSNGGTCVDTPGGYTCECPEGYTGK 31



 Score = 28.9 bits (65), Expect = 0.37
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 394 PCLHGATCIDEVATFSCVCPKGLTGR 419
           PC +G TC+D    ++C CP+G TG+
Sbjct: 6   PCSNGGTCVDTPGGYTCECPEGYTGK 31



 Score = 27.8 bits (62), Expect = 1.0
 Identities = 15/34 (44%), Positives = 16/34 (47%), Gaps = 2/34 (5%)

Query: 67  CTTAPPCLNGATCRPQLTEQLYECVCPPGYKEIR 100
           C+   PC NG TC    T   Y C CP GY   R
Sbjct: 1   CSPNNPCSNGGTCVD--TPGGYTCECPEGYTGKR 32



 Score = 26.2 bits (58), Expect = 3.0
 Identities = 14/27 (51%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 261 PCKH-GSCVDKRAGYFCDCPPTYGGKN 286
           PC + G+CVD   GY C+CP  Y GK 
Sbjct: 6   PCSNGGTCVDTPGGYTCECPEGYTGKR 32


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 30.4 bits (69), Expect = 0.13
 Identities = 14/37 (37%), Positives = 16/37 (43%), Gaps = 2/37 (5%)

Query: 348 DFDECLS--NPCFNGATCQNKINGYTCVCAPGYSGKE 382
           D DEC    + C     C N I  + CVC  GY   E
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNE 37


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 27.9 bits (62), Expect = 0.94
 Identities = 18/31 (58%), Positives = 19/31 (61%), Gaps = 2/31 (6%)

Query: 351 ECLS-NPCFNGATCQNKINGYTCVCAPGYSG 380
           EC S  PC NG TC N    YTC C PGY+G
Sbjct: 1   ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30



 Score = 26.3 bits (58), Expect = 3.8
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 258 TGDPCKHGSCVDKRAGYFCDCPPTYGG 284
           +G PC +G+C++    Y C CPP Y G
Sbjct: 4   SGGPCSNGTCINTPGSYTCSCPPGYTG 30


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 27.5 bits (62), Expect = 1.2
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 361 ATCQNKINGYTCVCAPGYSG 380
           ATC N    +TC C  GY+G
Sbjct: 12  ATCTNTGGSFTCTCKSGYTG 31


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.141    0.505 

Gapped
Lambda     K      H
   0.267   0.0667    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,385,196
Number of extensions: 1806472
Number of successful extensions: 1082
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1059
Number of HSP's successfully gapped: 112
Length of query: 429
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 329
Effective length of database: 6,502,202
Effective search space: 2139224458
Effective search space used: 2139224458
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.0 bits)