RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5613
         (1010 letters)



>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 30.7 bits (70), Expect = 1.0
 Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)

Query: 637 CGPYSQCRDINGSPSCSCLP 656
           CGPY  C D+N SP C+C+ 
Sbjct: 86  CGPYGYC-DVNTSPKCNCIK 104



 Score = 28.8 bits (65), Expect = 5.7
 Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 900 GSCGQNANCRVINHSPICTCRPGF 923
           G CG    C  +N SP C C  GF
Sbjct: 84  GRCGPYGYC-DVNTSPKCNCIKGF 106


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 27.9 bits (63), Expect = 2.3
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 581 PGTCGQNANCRVINHNPSCTCKAGFTGD 608
            G C  NA C     + +CTCK+G+TGD
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 26.7 bits (60), Expect = 5.9
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 899 PGSCGQNANCRVINHSPICTCRPGFTG 925
            G C  NA C     S  CTC+ G+TG
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTG 31



 Score = 26.3 bits (59), Expect = 8.1
 Identities = 12/28 (42%), Positives = 13/28 (46%), Gaps = 4/28 (14%)

Query: 743 CAPNAVCRDN----VCVCLPDYYGDGYT 766
           C PNA C +      C C   Y GDG T
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTGDGVT 35


>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
          Length = 214

 Score = 31.1 bits (70), Expect = 2.4
 Identities = 16/44 (36%), Positives = 19/44 (43%), Gaps = 8/44 (18%)

Query: 213 PVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCFPSPCGSNAR 256
           PV    PG           P  PTP +  PTDP   S CG ++R
Sbjct: 176 PVAGGLPGGGA--------PAPPTPPRPGPTDPASESDCGRDSR 211


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 27.6 bits (62), Expect = 2.8
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 629 VNPC-IPSPCGPYSQCRDINGSPSCSCLPNYIG 660
           ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|163554 TIGR03842, F420_CPS_4043, F420-dependent oxidoreductase, CPS_4043
           family.  This model represents a family of putative
           F420-dependent oxidoreductases, fairly closely related
           to 5,10-methylenetetrahydromethanopterin reductase (mer,
           TIGR03555), both within the bacterial luciferase-like
           monoxygenase (LLM) family. A fairly deep split (to about
           40 % sequence identity) in the present family separates
           a strictly Actinobacterial clade from an
           alpha/beta/gamma-proteobacterial clade, in which the
           member is often the only apparent F420-dependent LLM
           family member. The specific function, and whether
           Actinobacterial and Proteobacterial clades differ in
           function, are unknown [Unknown function, Enzymes of
           unknown specificity].
          Length = 330

 Score = 31.2 bits (71), Expect = 3.0
 Identities = 17/50 (34%), Positives = 18/50 (36%), Gaps = 13/50 (26%)

Query: 736 QQADPCIC---------APNAVCRD----NVCVCLPDYYGDGYTVCRPEC 772
           Q ADP I          A     RD     VCV  P Y GD     R +C
Sbjct: 170 QLADPDIAKWMIDAVREAAEDAGRDPDSVKVCVAAPAYVGDDLEHQREQC 219


>gnl|CDD|221000 pfam11145, DUF2921, Protein of unknown function (DUF2921).  This
           eukaryotic family of proteins has no known function.
          Length = 871

 Score = 31.3 bits (71), Expect = 3.2
 Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 8/60 (13%)

Query: 347 EYRDPCSTTQCGLNAICTVINGAAQCACLLLLQHHIHKNQDMDQYIS--------LGYML 398
             +   S  +  L  I  V++    C    L   H+  + D+  ++S        LGYM 
Sbjct: 582 RRQAKDSIWRMDLEGIMVVVSLTLSCVFAGLQLFHVKSHPDVLPFVSLVMLGVQALGYMT 641


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.139    0.496 

Gapped
Lambda     K      H
   0.267   0.0656    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,829,603
Number of extensions: 4328586
Number of successful extensions: 3830
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3741
Number of HSP's successfully gapped: 220
Length of query: 1010
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 904
Effective length of database: 6,236,078
Effective search space: 5637414512
Effective search space used: 5637414512
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (28.6 bits)