RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13148
         (243 letters)



>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 30.3 bits (69), Expect = 0.26
 Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)

Query: 53  CGPYSQCRDINGSPSCSCLP 72
           CGPY  C D+N SP C+C+ 
Sbjct: 86  CGPYGYC-DVNTSPKCNCIK 104



 Score = 27.2 bits (61), Expect = 3.4
 Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 108 GSCGQGAQCRVINHSPVCYCPDGF 131
           G CG    C  +N SP C C  GF
Sbjct: 84  GRCGPYGYC-DVNTSPKCNCIKGF 106


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 28.0 bits (63), Expect = 0.51
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 45 VNPC-IPSPCGPYSQCRDINGSPSCSCLPNYIG 76
          ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2  IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|163554 TIGR03842, F420_CPS_4043, F420-dependent oxidoreductase, CPS_4043
           family.  This model represents a family of putative
           F420-dependent oxidoreductases, fairly closely related
           to 5,10-methylenetetrahydromethanopterin reductase (mer,
           TIGR03555), both within the bacterial luciferase-like
           monoxygenase (LLM) family. A fairly deep split (to about
           40 % sequence identity) in the present family separates
           a strictly Actinobacterial clade from an
           alpha/beta/gamma-proteobacterial clade, in which the
           member is often the only apparent F420-dependent LLM
           family member. The specific function, and whether
           Actinobacterial and Proteobacterial clades differ in
           function, are unknown [Unknown function, Enzymes of
           unknown specificity].
          Length = 330

 Score = 30.8 bits (70), Expect = 0.56
 Identities = 17/50 (34%), Positives = 18/50 (36%), Gaps = 13/50 (26%)

Query: 152 QQADPCIC---------APNAVCRD----NVCVCLPDYYGDGYTVCRPEC 188
           Q ADP I          A     RD     VCV  P Y GD     R +C
Sbjct: 170 QLADPDIAKWMIDAVREAAEDAGRDPDSVKVCVAAPAYVGDDLEHQREQC 219


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.7 bits (60), Expect = 1.4
 Identities = 12/28 (42%), Positives = 13/28 (46%), Gaps = 4/28 (14%)

Query: 159 CAPNAVCRDN----VCVCLPDYYGDGYT 182
           C PNA C +      C C   Y GDG T
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTGDGVT 35


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 25.9 bits (57), Expect = 2.5
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 50 PSPCGPYSQCRDINGSPSCSCLPNYIG 76
           +PC     C +  GS  C C P Y G
Sbjct: 5  SNPCSNGGTCVNTPGSYRCVCPPGYTG 31



 Score = 25.9 bits (57), Expect = 3.0
 Identities = 10/28 (35%), Positives = 11/28 (39%)

Query: 107 PGSCGQGAQCRVINHSPVCYCPDGFIGD 134
              C  G  C     S  C CP G+ GD
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 28.2 bits (63), Expect = 4.1
 Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 3/24 (12%)

Query: 8  VINHNPSCTCKAGFTGD--PRVFC 29
          VI  N S T KAGF G+  P    
Sbjct: 10 VI-DNGSGTTKAGFAGNDTPTTVF 32


>gnl|CDD|214592 smart00268, ACTIN, Actin.  ACTIN subfamily of
          ACTIN/mreB/sugarkinase/Hsp70 superfamily.
          Length = 373

 Score = 28.0 bits (63), Expect = 4.4
 Identities = 10/23 (43%), Positives = 14/23 (60%), Gaps = 2/23 (8%)

Query: 8  VINHNPSCTCKAGFTGD--PRVF 28
          ++  N S T KAGF G+  P+V 
Sbjct: 4  IVIDNGSGTIKAGFAGEDFPQVV 26


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 28.0 bits (62), Expect = 5.3
 Identities = 13/46 (28%), Positives = 17/46 (36%)

Query: 33   PPPPPQESPPEYVNPCIPSPCGPYSQCRDINGSPSCSCLPNYIGAP 78
            PPPP  E  P  +    P P GP +  +     P+    P     P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP 2748


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 24.9 bits (55), Expect = 5.7
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 50 PSPCGPYSQCRDINGSPSCSCLPNY 74
           +PC     C +  GS  C C P Y
Sbjct: 8  GNPCQNGGTCVNTVGSYRCECPPGY 32


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.140    0.499 

Gapped
Lambda     K      H
   0.267   0.0758    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,828,189
Number of extensions: 1015189
Number of successful extensions: 1067
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1038
Number of HSP's successfully gapped: 84
Length of query: 243
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 149
Effective length of database: 6,768,326
Effective search space: 1008480574
Effective search space used: 1008480574
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.1 bits)