RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13158
         (289 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 29.9 bits (68), Expect = 0.12
 Identities = 12/27 (44%), Positives = 13/27 (48%)

Query: 17 PSPCGPYSECRNINGGPSCSCRPGYIG 43
           +PC     C N  G   CSC PGY G
Sbjct: 8  GNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 26.1 bits (58), Expect = 2.8
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 167 VNPCV-PSPCGLYSQCRDIGGSPSCSCLPNYIG 198
           ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 28.6 bits (64), Expect = 0.30
 Identities = 11/28 (39%), Positives = 12/28 (42%)

Query: 17 PSPCGPYSECRNINGGPSCSCRPGYIGS 44
           +PC     C N  G   C C PGY G 
Sbjct: 5  SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 25.5 bits (56), Expect = 4.0
 Identities = 11/33 (33%), Positives = 13/33 (39%), Gaps = 2/33 (6%)

Query: 224 CADPCPGSCGYNAQCKVINHTPTCTCPDGFIGD 256
           CA   P  C     C     +  C CP G+ GD
Sbjct: 2   CAASNP--CSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 24.7 bits (54), Expect = 9.0
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 172 PSPCGLYSQCRDIGGSPSCSCLPNYIG 198
            +PC     C +  GS  C C P Y G
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTG 31


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 27.6 bits (62), Expect = 0.95
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 17 PSPCGPYSECRNINGGPSCSCRPGY 41
           +PC     C N  G   C C PGY
Sbjct: 8  GNPCQNGGTCVNTVGSYRCECPPGY 32


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.7 bits (60), Expect = 1.7
 Identities = 12/31 (38%), Positives = 15/31 (48%)

Query: 116 PGSCGYNAECKVINHNPICSCSQGYIGDGFS 146
            G C  NA C     +  C+C  GY GDG +
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTGDGVT 35



 Score = 26.7 bits (60), Expect = 1.8
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 19 PCGPYSECRNINGGPSCSCRPGYIG 43
           C P + C N  G  +C+C+ GY G
Sbjct: 7  GCHPNATCTNTGGSFTCTCKSGYTG 31



 Score = 26.0 bits (58), Expect = 3.1
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 229 PGSCGYNAQCKVINHTPTCTCPDGFIGD 256
            G C  NA C     + TCTC  G+ GD
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 35/131 (26%), Positives = 45/131 (34%), Gaps = 23/131 (17%)

Query: 19  PCGPYSECRNINGGPS-----CSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQN 73
            CG Y+ C N           C C  GY  S   C P    N  C S        +CI +
Sbjct: 51  VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSG-------KCIVD 103

Query: 74  SECPYDKACIREKCVDPCPGSCGYGAVCTVINHSNEKCQDPCPGSCGYNAECKVINHNPI 133
              P +  C           SC  G V        +  +  C   C  N ECK++     
Sbjct: 104 PANPNNTTC-----------SCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYE 152

Query: 134 CSCSQGYIGDG 144
           C C +G+ GDG
Sbjct: 153 CVCKEGFPGDG 163



 Score = 27.8 bits (62), Expect = 6.2
 Identities = 27/113 (23%), Positives = 37/113 (32%), Gaps = 31/113 (27%)

Query: 174 PCGLYSQCRDIGGSPS-----CSCLPNYIGAPPNCRPECLQNSECPNDKACIRE------ 222
            CG Y+ C +           C C+  Y  +   C P    N  C + K CI +      
Sbjct: 51  VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSGK-CIVDPANPNN 109

Query: 223 ---------------KCADP----CPGSCGYNAQCKVINHTPTCTCPDGFIGD 256
                          KC       C   C  N +CK++     C C +GF GD
Sbjct: 110 TTCSCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPGD 162


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 28.2 bits (62), Expect = 5.1
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 150 PKPPEVPPPPQQDVQE 165
           PK P  PPPP+Q  +E
Sbjct: 210 PKAPSSPPPPKQSAKE 225


>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 26.9 bits (60), Expect = 6.9
 Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 230 GSCGYNAQCKVINHTPTCTCPDGF 253
           G CG    C  +N +P C C  GF
Sbjct: 84  GRCGPYGYC-DVNTSPKCNCIKGF 106


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.142    0.513 

Gapped
Lambda     K      H
   0.267   0.0656    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,064,013
Number of extensions: 1222572
Number of successful extensions: 1184
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1161
Number of HSP's successfully gapped: 110
Length of query: 289
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 193
Effective length of database: 6,679,618
Effective search space: 1289166274
Effective search space used: 1289166274
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.3 bits)