RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9821
         (78 letters)



>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 45.2 bits (108), Expect = 2e-08
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query: 39 CGLGLHDCHKDAKCTNTHGSYSCQCKRGFHGDGKT 73
          C      CH +A CTNT GS++C CK G+ GDG T
Sbjct: 1  CAENNGGCHPNATCTNTGGSFTCTCKSGYTGDGVT 35


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 43.9 bits (104), Expect = 6e-08
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGF-HGDGKTSC 75
          DVDEC  G H+C  +  C NT GS+ C C  G+ + +  T+C
Sbjct: 1  DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 40.7 bits (96), Expect = 9e-07
 Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGFHGD 70
          D+DEC  G + C     C NT GSY C C  G+ G 
Sbjct: 1  DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR 35


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 39.5 bits (93), Expect = 3e-06
 Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGFH 68
          D+DEC  G + C     C NT GSY C+C  G+ 
Sbjct: 1  DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 40.8 bits (96), Expect = 1e-05
 Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 2/37 (5%)

Query: 31  SLCPDVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGF 67
            +C   D C    H C +   C +T GSY C C  G+
Sbjct: 182 KICVVPDLCATLSHVCQQ--VCISTPGSYLCACTEGY 216


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 32.8 bits (75), Expect = 0.001
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 44 HDCHKDAKCTNTHGSYSCQCKRGFHGDG 71
          + C     C NT GSY C C  G+ GD 
Sbjct: 6  NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 29.7 bits (67), Expect = 0.019
 Identities = 9/25 (36%), Positives = 13/25 (52%)

Query: 46 CHKDAKCTNTHGSYSCQCKRGFHGD 70
          C     C +T G Y+C+C  G+ G 
Sbjct: 7  CSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 27.5 bits (61), Expect = 0.16
 Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 44 HDCHKDAKCTNTHGSYSCQCKRGFHGD 70
            C     C NT GSY+C C  G+ GD
Sbjct: 6  GPCSNG-TCINTPGSYTCSCPPGYTGD 31


>gnl|CDD|119287 pfam10767, DUF2593, Protein of unknown function (DUF2593).  This
          family of proteins appear to be restricted to
          Enterobacteriaceae. Some members in the family are
          annotated as YbjO however currently there is no known
          function.
          Length = 144

 Score = 27.8 bits (62), Expect = 0.36
 Identities = 10/33 (30%), Positives = 12/33 (36%), Gaps = 6/33 (18%)

Query: 1  MLCFISDSCAVAVAARHNISVYEEDARWSYSLC 33
          +L  +   C  AV    N        RW Y LC
Sbjct: 56 VLLCLEIRCGFAVLKGRNW------GRWGYLLC 82


>gnl|CDD|225400 COG2844, GlnD, UTP:GlnB (protein PII) uridylyltransferase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 867

 Score = 28.1 bits (63), Expect = 0.40
 Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 6/33 (18%)

Query: 21  VYEEDARW------SYSLCPDVDECGLGLHDCH 47
           V E+D R        Y+L PD+     GL D H
Sbjct: 178 VEEQDERHARYGDTRYNLEPDIKSGPGGLRDLH 210


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 27.4 bits (61), Expect = 0.56
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 46  CHKDAKCTNTHGSYSCQCKRGFHGDG 71
           C ++ +C    G Y C CK GF GDG
Sbjct: 138 CKENEECKLVGGYYECVCKEGFPGDG 163


>gnl|CDD|225459 COG2907, COG2907, Predicted NAD/FAD-binding protein [General
          function prediction only].
          Length = 447

 Score = 26.3 bits (58), Expect = 1.7
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query: 8  SCAVAVAARHNISVYEEDAR 27
          S A  ++ RH+++++E D R
Sbjct: 22 SAAWLLSRRHDVTLFEADRR 41


>gnl|CDD|182256 PRK10126, PRK10126, tyrosine phosphatase; Provisional.
          Length = 147

 Score = 25.3 bits (55), Expect = 3.3
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 6  SDSCAVAVAARHNISVYEEDARW-SYSLCPDVDECGLGLHDCHKDAKC 52
          +D  A++VAA H +S+    AR  S  LC + D   L +   H +  C
Sbjct: 45 ADPTAISVAAEHQLSLEGHCARQISRRLCRNYDLI-LTMEKRHIERLC 91


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or
          complement Clr-like EGF, domains have six conserved
          cysteine residues disulfide-bonded into the
          characteristic pattern 'ababcc'. They are found in
          blood coagulation proteins such as fibrillin, Clr and
          Cls, thrombomodulin, and the LDL receptor. The core
          fold of the EGF domain consists of two small
          beta-hairpins packed against each other. Two major
          structural variants have been identified based on the
          structural context of the C-terminal cysteine residue
          of disulfide 'c' in the C-terminal hairpin: hEGFs and
          cEGFs. In cEGFs the C-terminal thiol resides on the
          C-terminal beta-sheet, resulting in long loop-lengths
          between the cysteine residues of disulfide 'c',
          typically C[10+]XC. These longer loop-lengths may have
          arisen by selective cysteine loss from a four-disulfide
          EGF template such as laminin or integrin. Tandem cEGF
          domains have five linking residues between terminal
          cysteines of adjacent domains. cEGF domains may or may
          not bind calcium in the linker region. cEGF domains
          with the consensus motif CXN4X[F,Y]XCXC are
          hydroxylated exclusively on the asparagine residue.
          Length = 24

 Score = 22.8 bits (50), Expect = 6.8
 Identities = 7/20 (35%), Positives = 10/20 (50%), Gaps = 1/20 (5%)

Query: 58 SYSCQCKRGFHGDG-KTSCT 76
          SY+C C  G+   G   +C 
Sbjct: 1  SYTCSCPPGYQLSGDGRTCE 20


>gnl|CDD|233157 TIGR00863, P2X, cation transporter protein.  ATP-gated Cation
           Channel (ACC) Family (TC 1.A.7)Members of the ACC family
           (also called P2X receptors) respond to ATP, a functional
           neurotransmitter released by exocytosis from many types
           of neurons.These channels, which function at
           neuron-neuron and neuron-smooth muscle junctions, may
           play roles in the control of blood pressure and pain
           sensation. They may also function in lymphocyte and
           plateletphysiology. They are found only in animals.ACC
           channels are probably hetero- or homomultimers and
           transport small monovalent cations (Me+). Some also
           transport Ca2+; a few also transport small metabolites
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 372

 Score = 24.3 bits (53), Expect = 8.7
 Identities = 13/49 (26%), Positives = 20/49 (40%), Gaps = 9/49 (18%)

Query: 31  SLCPDVDECGLGLHDCHKDAKC------TNTHGSYSCQCKRGFHGDGKT 73
             CP+     L    C  D+ C      T+ +G  + +C   F+G  KT
Sbjct: 116 GRCPEHPSVPLA--ICWSDSDCTAGEAGTHGNGIKTGRCVP-FNGTVKT 161


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.134    0.464 

Gapped
Lambda     K      H
   0.267   0.0632    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,586,744
Number of extensions: 247839
Number of successful extensions: 186
Number of sequences better than 10.0: 1
Number of HSP's gapped: 184
Number of HSP's successfully gapped: 23
Length of query: 78
Length of database: 10,937,602
Length adjustment: 47
Effective length of query: 31
Effective length of database: 8,852,964
Effective search space: 274441884
Effective search space used: 274441884
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 53 (24.4 bits)