RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2457
         (189 letters)



>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.3 bits (59), Expect = 1.4
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 142 CGINAQCTARNHVATCSCPAGYQGD 166
           C  NA CT      TC+C +GY GD
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 24.4 bits (54), Expect = 7.6
 Identities = 11/33 (33%), Positives = 11/33 (33%), Gaps = 5/33 (15%)

Query: 75  PNPCGENAKCQPGYDKSGKDRPVCTCLPGYVGD 107
              C  NA C             CTC  GY GD
Sbjct: 5   NGGCHPNATCTNTGGSF-----TCTCKSGYTGD 32


>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 26.9 bits (60), Expect = 3.0
 Identities = 11/38 (28%), Positives = 14/38 (36%), Gaps = 7/38 (18%)

Query: 69  PHDLCEP-NPCGENAKCQPGYDKSGKDRPVCTCLPGYV 105
           P D C+    CG    C           P C C+ G+V
Sbjct: 76  PKDQCDVYGRCGPYGYCDV------NTSPKCNCIKGFV 107


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 25.1 bits (55), Expect = 4.6
 Identities = 10/30 (33%), Positives = 12/30 (40%)

Query: 137 ACTSQCGINAQCTARNHVATCSCPAGYQGD 166
           A ++ C     C        C CP GY GD
Sbjct: 3   AASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 26.6 bits (59), Expect = 7.4
 Identities = 33/145 (22%), Positives = 49/145 (33%), Gaps = 32/145 (22%)

Query: 50  CSCPPGY------TGDPLTQCRRFDPHDLCEPNPCGENAKCQPGYDKSGKDRPVCTCLPG 103
           C C  GY      T +   +C + +  +      CGE A C    +K+ +    C C+ G
Sbjct: 22  CKCNEGYVLKNENTCEEKVKCDKLENVN----KVCGEYATCINQANKAEEKALKCGCING 77

Query: 104 YVGDA----LTYCRRGEC-------------QSDAECNYDQVCN-NYNCEK----ACTSQ 141
           Y           C    C              +   CN  +V + N  C K     C+ +
Sbjct: 78  YTLSQGVCVPNKCNNKVCGSGKCIVDPANPNNTTCSCNIGKVPDQNGKCTKTGETKCSLK 137

Query: 142 CGINAQCTARNHVATCSCPAGYQGD 166
           C  N +C        C C  G+ GD
Sbjct: 138 CKENEECKLVGGYYECVCKEGFPGD 162


>gnl|CDD|235531 PRK05605, PRK05605, long-chain-fatty-acid--CoA ligase; Validated.
          Length = 573

 Score = 26.9 bits (60), Expect = 8.2
 Identities = 14/29 (48%), Positives = 15/29 (51%), Gaps = 2/29 (6%)

Query: 54  PGYTGDPL--TQCRRFDPHDLCEPNPCGE 80
           PGY G P   T+ R  DP D  E  P GE
Sbjct: 388 PGYVGVPFPDTEVRIVDPEDPDETMPDGE 416


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.137    0.471 

Gapped
Lambda     K      H
   0.267   0.0700    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,076,481
Number of extensions: 742582
Number of successful extensions: 540
Number of sequences better than 10.0: 1
Number of HSP's gapped: 524
Number of HSP's successfully gapped: 63
Length of query: 189
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 98
Effective length of database: 6,901,388
Effective search space: 676336024
Effective search space used: 676336024
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.4 bits)