RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17761
         (92 letters)



>gnl|CDD|177669 PLN00035, PLN00035, histone H4; Provisional.
          Length = 103

 Score = 56.8 bits (137), Expect = 4e-12
 Identities = 27/34 (79%), Positives = 31/34 (91%)

Query: 35 AGLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
          +GLIYEETRGVLK+ LENVIRDAVTY  HA+R+T
Sbjct: 48 SGLIYEETRGVLKIFLENVIRDAVTYTEHARRKT 81


>gnl|CDD|128694 smart00417, H4, Histone H4. 
          Length = 74

 Score = 54.1 bits (130), Expect = 2e-11
 Identities = 24/34 (70%), Positives = 29/34 (85%)

Query: 35 AGLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
          +GLIY+ETR VLK  LENV+RDAVTY  HA+R+T
Sbjct: 32 SGLIYDETRNVLKSFLENVVRDAVTYTEHARRKT 65


>gnl|CDD|185397 PTZ00015, PTZ00015, histone H4; Provisional.
          Length = 102

 Score = 53.6 bits (129), Expect = 6e-11
 Identities = 20/34 (58%), Positives = 26/34 (76%)

Query: 35 AGLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
          +G IYEE RGVLK  LENV+RD+  Y  +A+R+T
Sbjct: 49 SGDIYEEVRGVLKAFLENVVRDSTAYTEYARRKT 82


>gnl|CDD|238031 cd00076, H4, Histone H4, one of the four histones, along with
          H2A, H2B and H3, which forms the eukaryotic nucleosome
          core; along with H3, it plays a central role in
          nucleosome formation; histones bind to DNA and wrap the
          genetic material into "beads on a string" in which DNA
          (the string) is wrapped around small blobs of histones
          (the beads) at regular intervals; play a role in the
          inheritance of specialized chromosome structures and
          the control of gene activity; defects in the
          establishment of proper chromosome structure by
          histones may activate or silence genes aberrantly and
          thus lead to disease;  the sequence of histone H4 has
          remained almost invariant in more than 2 billion years
          of evolution.
          Length = 85

 Score = 49.4 bits (118), Expect = 2e-09
 Identities = 22/33 (66%), Positives = 26/33 (78%)

Query: 36 GLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
          G +Y+E R VLK  LE+VIRDAVTY  HAKR+T
Sbjct: 33 GGVYDEVRNVLKSYLEDVIRDAVTYTEHAKRKT 65


>gnl|CDD|224947 COG2036, HHT1, Histones H3 and H4 [Chromatin structure and
          dynamics].
          Length = 91

 Score = 35.4 bits (82), Expect = 5e-04
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 34 FAGLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
           +    EE +  L+  LE +  DAV    HAKR+T
Sbjct: 37 VSSSAIEELQEALEEYLEEIAEDAVELAEHAKRKT 71


>gnl|CDD|201020 pfam00125, Histone, Core histone H2A/H2B/H3/H4. 
          Length = 75

 Score = 29.1 bits (66), Expect = 0.11
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 39 YEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
              +  L+ +LE ++ DA    RHAKR+T
Sbjct: 33 PVALQEALEDLLEEILEDAGLLARHAKRKT 62


>gnl|CDD|165730 PLN00163, PLN00163, histone H4; Provisional.
          Length = 59

 Score = 26.2 bits (57), Expect = 1.1
 Identities = 10/12 (83%), Positives = 11/12 (91%)

Query: 35 AGLIYEETRGVL 46
          +GLIYEETR VL
Sbjct: 48 SGLIYEETRTVL 59


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 26.1 bits (57), Expect = 2.6
 Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 8/69 (11%)

Query: 10  AHGLRT-PVSGPPSVVWRVEVESSVFAGLIYEETRGVLKVVLENVIRDAVTYPRHAKRQT 68
           + G++T P +    V+W        +   ++ ++RG  K V++   +    YPR A  + 
Sbjct: 349 SSGVQTFPDAPGSPVIW-------AYCITVFCQSRGTAKAVIKAQKKYEKKYPRSAHLKA 401

Query: 69  GHQRMDSRL 77
              RM   L
Sbjct: 402 SLVRMSRGL 410


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
          22A1-like.  Aldehyde dehydrogenase family members
          ALDH15A1 (Saccharomyces cerevisiae YHR039C) and
          ALDH22A1 (Arabidopsis thaliana, EC=1.2.1.3), and
          similar sequences, are in this CD. Significant
          improvement of stress tolerance in tobacco plants was
          observed by overexpressing the ALDH22A1 gene from maize
          (Zea mays) and was accompanied by a reduction of
          malondialdehyde  derived from cellular lipid
          peroxidation.
          Length = 465

 Score = 26.1 bits (58), Expect = 3.0
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 39 YEETRGVLKVVLENVIRDAVTYPRHAKRQTGHQRMDSRL 77
          + E R VL+ +L+ ++ +     R A R TG   +D+ L
Sbjct: 39 FAERRKVLRSLLKYILENQEEICRVACRDTGKTMVDASL 77


>gnl|CDD|224515 COG1599, RFA1, Single-stranded DNA-binding replication protein A
           (RPA), large (70 kD) subunit and related ssDNA-binding
           proteins [DNA replication, recombination, and repair].
          Length = 407

 Score = 25.9 bits (57), Expect = 3.2
 Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 1/41 (2%)

Query: 17  VSGPPSVVWRVEV-ESSVFAGLIYEETRGVLKVVLENVIRD 56
           V G     +  +  ES VF+  + +E RGV+     +  +D
Sbjct: 177 VVGSEIKTFDNQGGESKVFSNELEDEERGVIVFTDWDPSQD 217


>gnl|CDD|129039 smart00803, TAF, TATA box binding protein associated factor.
          TAFs (TATA box binding protein associated factors) are
          part of the transcription initiation factor TFIID
          multimeric protein complex. TFIID is composed of the
          TATA box binding protein (TBP) and a number of TAFs.
          The TAFs provide binding sites for many different
          transcriptional activators and co-activators that
          modulate transcription initiation by Pol II. TAF
          proteins adopt a histone-like fold.
          Length = 65

 Score = 24.1 bits (53), Expect = 5.4
 Identities = 6/19 (31%), Positives = 15/19 (78%)

Query: 50 LENVIRDAVTYPRHAKRQT 68
          ++ ++++A+ + RH+KR T
Sbjct: 36 IKEIVQEALKFMRHSKRTT 54


>gnl|CDD|240426 PTZ00466, PTZ00466, actin-like protein; Provisional.
          Length = 380

 Score = 24.9 bits (54), Expect = 8.1
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 29 VESSVFAGLIYEETRGVLKV 48
          VE ++F G   EE RG+LKV
Sbjct: 55 VEGNIFVGNKAEEYRGLLKV 74


>gnl|CDD|205235 pfam13054, DUF3915, Protein of unknown function (DUF3915).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 120 amino acids in
          length.
          Length = 116

 Score = 24.5 bits (53), Expect = 8.8
 Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 1/31 (3%)

Query: 62 RHAKRQTGHQRMDSRLRAKTS-RPHPCTVLA 91
           H   +   +R   R R K   RP  C VLA
Sbjct: 14 CHHHEREDFEREREREREKEPQRPAVCNVLA 44


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.134    0.405 

Gapped
Lambda     K      H
   0.267   0.0768    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,619,747
Number of extensions: 370599
Number of successful extensions: 294
Number of sequences better than 10.0: 1
Number of HSP's gapped: 294
Number of HSP's successfully gapped: 21
Length of query: 92
Length of database: 10,937,602
Length adjustment: 59
Effective length of query: 33
Effective length of database: 8,320,716
Effective search space: 274583628
Effective search space used: 274583628
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (24.1 bits)