RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5614
         (98 letters)



>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 29.4 bits (66), Expect = 0.27
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 9/35 (25%)

Query: 17  CSCLPNYIGAPPNCRPE---CVMNSECASNLACIN 48
           C+CL         CR E   C+ N +CA+N+AC+ 
Sbjct: 114 CACLLK------ECRLELAKCIANPKCAANVACLQ 142


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 24.8 bits (55), Expect = 2.0
 Identities = 12/28 (42%), Positives = 13/28 (46%)

Query: 56 PGSCGYNAQCKVINHTPICTCPEGYIGD 83
           G C  NA C     +  CTC  GY GD
Sbjct: 5  NGGCHPNATCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 24.7 bits (54), Expect = 2.6
 Identities = 9/28 (32%), Positives = 10/28 (35%)

Query: 56 PGSCGYNAQCKVINHTPICTCPEGYIGD 83
             C     C     +  C CP GY GD
Sbjct: 5  SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease.  Aspartyl (acid)
          proteases include pepsins, cathepsins, and renins.
          Two-domain structure, probably arising from ancestral
          duplication. This family does not include the
          retroviral nor retrotransposon proteases (pfam00077),
          which are much smaller and appear to be homologous to a
          single domain of the eukaryotic asp proteases.
          Length = 316

 Score = 26.1 bits (58), Expect = 2.8
 Identities = 7/20 (35%), Positives = 10/20 (50%)

Query: 35 VMNSECASNLACINEKCRDP 54
          V +  C S+ AC +    DP
Sbjct: 28 VPSVYCTSSYACKSHGTFDP 47


>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein.  The N-terminal
           end matches very strongly a pfam Mg_chelatase domain
           [Unknown function, General].
          Length = 499

 Score = 26.0 bits (57), Expect = 4.0
 Identities = 6/20 (30%), Positives = 9/20 (45%)

Query: 76  CPEGYIGDPFSSCFPKPPER 95
           CP G+ G   + C   P + 
Sbjct: 348 CPCGHYGGKNTHCRCSPQQI 367


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 23.9 bits (52), Expect = 4.3
 Identities = 9/25 (36%), Positives = 9/25 (36%)

Query: 59 CGYNAQCKVINHTPICTCPEGYIGD 83
          C     C        C CPEGY G 
Sbjct: 7  CSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|165621 PLN00043, PLN00043, elongation factor 1-alpha; Provisional.
          Length = 447

 Score = 25.8 bits (56), Expect = 4.3
 Identities = 11/51 (21%), Positives = 19/51 (37%), Gaps = 8/51 (15%)

Query: 53  DPCPGSCGYNAQCKVINH--------TPICTCPEGYIGDPFSSCFPKPPER 95
           DP   +  + +Q  ++NH         P+  C   +I   F+    K   R
Sbjct: 320 DPAKEAANFTSQVIIMNHPGQIGNGYAPVLDCHTSHIAVKFAEILTKIDRR 370


>gnl|CDD|188457 TIGR03942, sulfatase_rSAM, anaerobic sulfatase-maturating enzyme.
           Members of this protein family are radical SAM family
           enzymes, maturases that prepare the oxygen-sensitive
           radical required in the active site of anaerobic
           sulfatases. This maturase role has led to many
           misleading legacy annotations suggesting that this
           enzyme maturase is instead a sulfatase regulatory
           protein. All members of the seed alignment are radical
           SAM enzymes encoded next to or near an anaerobic
           sulfatase. Note that a single genome may encode more
           than one sulfatase/maturase pair [Protein fate, Protein
           modification and repair].
          Length = 363

 Score = 24.9 bits (55), Expect = 9.8
 Identities = 8/28 (28%), Positives = 12/28 (42%), Gaps = 3/28 (10%)

Query: 18  SCLPNYIGAPPNCRPECVMNSECASNLA 45
           + L  ++G P      CV +  C  NL 
Sbjct: 235 NALAIWLGNPS---QSCVHSPTCGQNLV 259


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.137    0.515 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,562,252
Number of extensions: 314419
Number of successful extensions: 274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 269
Number of HSP's successfully gapped: 48
Length of query: 98
Length of database: 10,937,602
Length adjustment: 64
Effective length of query: 34
Effective length of database: 8,098,946
Effective search space: 275364164
Effective search space used: 275364164
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.2 bits)