RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9688
         (181 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 35.3 bits (82), Expect = 8e-04
 Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 42 VDHCLT-DPCLGHGTCINRSDRYQCLCTPRYSGNNCE 77
          +D C + +PC   GTC+N    Y+C C P Y+G NCE
Sbjct: 2  IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 27.6 bits (62), Expect = 0.55
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 113 TSNPCLNDCVCVDDGNLSFYCNCTEDFTGEYCQ 145
           + NPC N   CV+    S+ C+C   +TG  C+
Sbjct: 7   SGNPCQNGGTCVNTVG-SYRCSCPPGYTGRNCE 38


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 29.9 bits (68), Expect = 0.067
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 42 VDHCLT-DPCLGHGTCINRSDRYQCLCTPRYS-GNNCE 77
          +D C + +PC   GTC+N    Y+C C P Y+ G NCE
Sbjct: 2  IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 28.9 bits (65), Expect = 0.16
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 49 PCLGHGTCINRSDRYQCLCTPRYSGNN 75
          PC   GTC++    Y C C   Y+G  
Sbjct: 6  PCSNGGTCVDTPGGYTCECPEGYTGKR 32


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 27.8 bits (62), Expect = 0.47
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 48 DPCLGHGTCINRSDRYQCLCTPRYSGNN 75
          +PC   GTC+N    Y+C+C P Y+G+ 
Sbjct: 6  NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 24.4 bits (53), Expect = 8.4
 Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 113 TSNPCLNDCVCVDDGNLSFYCNCTEDFTGE 142
            SNPC N   CV+    S+ C C   +TG+
Sbjct: 4   ASNPCSNGGTCVNTPG-SYRCVCPPGYTGD 32


>gnl|CDD|221873 pfam12955, DUF3844, Domain of unknown function (DUF3844).  This
          presumed domain is found in fungal species. It contains
          8 largely conserved cysteine residues. This domain is
          found in proteins that are thought to be found in the
          endoplasmic reticulum.
          Length = 103

 Score = 28.4 bits (64), Expect = 0.83
 Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 18/50 (36%)

Query: 47 TDPCLGHGTCIN-----RSDRYQCLCTP-------------RYSGNNCEK 78
          T+ C GHG+C+        D Y C C P             R+ G  C+K
Sbjct: 12 TNSCSGHGSCVKKSKSKGGDCYACKCKPTVVRTGSDKGKTTRWGGPACQK 61


>gnl|CDD|217384 pfam03137, OATP, Organic Anion Transporter Polypeptide (OATP)
           family.  This family consists of several eukaryotic
           Organic-Anion-Transporting Polypeptides (OATPs). Several
           have been identified mostly in human and rat. Different
           OATPs vary in tissue distribution and substrate
           specificity. Since the numbering of different OATPs in
           particular species was based originally on the order of
           discovery, similarly numbered OATPs in humans and rats
           did not necessarily correspond in function, tissue
           distribution and substrate specificity (in spite of the
           name, some OATPs also transport organic cations and
           neutral molecules). Thus, Tamai et al. initiated the
           current scheme of using digits for rat OATPs and letters
           for human ones. Prostaglandin transporter (PGT) proteins
           are also considered to be OATP family members. In
           addition, the methotrexate transporter OATK is closely
           related to OATPs. This family also includes several
           predicted proteins from Caenorhabditis elegans and
           Drosophila melanogaster. This similarity was not
           previously noted. Note: Members of this family are
           described (in the Swiss-Prot database) as belonging to
           the SLC21 family of transporters.
          Length = 582

 Score = 28.8 bits (65), Expect = 1.5
 Identities = 14/70 (20%), Positives = 23/70 (32%), Gaps = 9/70 (12%)

Query: 114 SNPCLNDCVC--------VDDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQNNATCV 165
            + C  DC C          D  L++   C    +      + S +  + N C +  +  
Sbjct: 422 LSSCNEDCSCDTSFFPPVCGDNGLTYLSPCHAGSSSSGTGCDTSCSTWSNN-CSSGNSHS 480

Query: 166 ASPGEKQITC 175
           AS G     C
Sbjct: 481 ASKGYCPSDC 490


>gnl|CDD|149504 pfam08475, Baculo_VP91_N, Viral capsid protein 91 N-terminal.  This
           domain is found in Baculoviridae including the
           nucleopolyhedrovirus at the N-terminus of the viral
           capsid protein 91 (VP91).
          Length = 185

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 12/37 (32%), Positives = 14/37 (37%), Gaps = 6/37 (16%)

Query: 124 VDDGNLSFYCNCTEDFTGEYCQFENSAACVTLNPCQN 160
            DDG +   C   E F G   Q      CV + PC  
Sbjct: 119 GDDGWVEMECPANERFDGNQLQ------CVPIPPCDG 149


>gnl|CDD|239875 cd04412, NDPk7B, Nucleoside diphosphate kinase 7 domain B (NDPk7B):
           The nm23-H7 class of nucleoside diphosphate kinase
           (NDPk7) consists of an N-terminal DM10 domain and two
           functional catalytic NDPk modules, NDPk7A and NDPk7B.
           The function of the DM10 domain, which also occurs in
           multiple copies in other proteins, is unknown. NDPk7 is
           predominantly expressed in testes, although appreciable
           amount are also found in liver, heart, brain, ovary,
           small intestine and spleen. The nm23-H7 gene is located
           in or near the hereditary prostrate cancer
           susceptibility locus. Nm23-H7 may be involved in the
           development of colon and gastric carcinoma, the latter
           possibly in a type-specific manner.
          Length = 134

 Score = 25.7 bits (57), Expect = 9.2
 Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 1/23 (4%)

Query: 97  EVYKGRYWELPEIRDC-TSNPCL 118
           EVYKG   ELP + D  TS PC+
Sbjct: 49  EVYKGVVPELPAMVDELTSGPCI 71


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.137    0.475 

Gapped
Lambda     K      H
   0.267   0.0612    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,172,879
Number of extensions: 663451
Number of successful extensions: 450
Number of sequences better than 10.0: 1
Number of HSP's gapped: 442
Number of HSP's successfully gapped: 47
Length of query: 181
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 90
Effective length of database: 6,901,388
Effective search space: 621124920
Effective search space used: 621124920
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.6 bits)