RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9687
         (141 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 44.2 bits (105), Expect = 2e-07
 Identities = 21/37 (56%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 55 NIDDCA-PNPCKNGARCVDKVKDYLCECYPGYTGKTC 90
          +ID+CA  NPC+NG  CV+ V  Y C C PGYTG+ C
Sbjct: 1  DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37



 Score = 40.7 bits (96), Expect = 5e-06
 Identities = 20/39 (51%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 16 NIDDCESG-PCLNGGLCKDLLNNYTCNCQQTGYTGPHCE 53
          +ID+C SG PC NGG C + + +Y C+C   GYTG +CE
Sbjct: 1  DIDECASGNPCQNGGTCVNTVGSYRCSCPP-GYTGRNCE 38


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 40.7 bits (96), Expect = 5e-06
 Identities = 22/38 (57%), Positives = 27/38 (71%), Gaps = 2/38 (5%)

Query: 55 NIDDCA-PNPCKNGARCVDKVKDYLCECYPGYT-GKTC 90
          +ID+CA  NPC+NG  CV+ V  Y CEC PGYT G+ C
Sbjct: 1  DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38



 Score = 36.8 bits (86), Expect = 2e-04
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 16 NIDDCES-GPCLNGGLCKDLLNNYTCNCQQTGYTGPHCE 53
          +ID+C S  PC NGG C + + +Y C C      G +CE
Sbjct: 1  DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 35.5 bits (82), Expect = 5e-04
 Identities = 19/32 (59%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 59 CAPN-PCKNGARCVDKVKDYLCECYPGYTGKT 89
          C+PN PC NG  CVD    Y CEC  GYTGK 
Sbjct: 1  CSPNNPCSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 28.6 bits (64), Expect = 0.14
 Identities = 15/28 (53%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 24 PCLNGGLCKDLLNNYTCNCQQTGYTGPH 51
          PC NGG C D    YTC C + GYTG  
Sbjct: 6  PCSNGGTCVDTPGGYTCECPE-GYTGKR 32


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 34.0 bits (78), Expect = 0.002
 Identities = 16/30 (53%), Positives = 17/30 (56%)

Query: 60 APNPCKNGARCVDKVKDYLCECYPGYTGKT 89
          A NPC NG  CV+    Y C C PGYTG  
Sbjct: 4  ASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 30.1 bits (68), Expect = 0.041
 Identities = 16/34 (47%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 21 ESGPCLNGGLCKDLLNNYTCNCQQTGYTGP-HCE 53
           S PC NGG C +   +Y C C   GYTG   CE
Sbjct: 4  ASNPCSNGGTCVNTPGSYRCVCPP-GYTGDRSCE 36


>gnl|CDD|236205 PRK08259, PRK08259, enoyl-CoA hydratase; Provisional.
          Length = 254

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 12/39 (30%), Positives = 16/39 (41%), Gaps = 5/39 (12%)

Query: 107 TCLEHSNMSLYQRSDLPALPKIFAQEFNYST----AEGL 141
           TCL    +S  ++  LP      A EF +      AE L
Sbjct: 201 TCLRADRLSALEQWGLP-EEAALANEFAHGLAVLAAEAL 238


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 25.9 bits (57), Expect = 1.3
 Identities = 13/27 (48%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 61 PNPCKNGARCVDKVKDYLCECYPGYTG 87
            PC NG  C++    Y C C PGYTG
Sbjct: 5  GGPCSNG-TCINTPGSYTCSCPPGYTG 30


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 24.4 bits (54), Expect = 5.3
 Identities = 13/36 (36%), Positives = 15/36 (41%), Gaps = 5/36 (13%)

Query: 52 CEINIDDCAPNPCKNGARCVDKVKDYLCECYPGYTG 87
          C  N   C PN     A C +    + C C  GYTG
Sbjct: 1  CAENNGGCHPN-----ATCTNTGGSFTCTCKSGYTG 31


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.468 

Gapped
Lambda     K      H
   0.267   0.0612    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,616,621
Number of extensions: 521367
Number of successful extensions: 293
Number of sequences better than 10.0: 1
Number of HSP's gapped: 290
Number of HSP's successfully gapped: 41
Length of query: 141
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 54
Effective length of database: 7,078,804
Effective search space: 382255416
Effective search space used: 382255416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 54 (24.8 bits)