RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15398
         (88 letters)



>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.4 bits (94), Expect = 3e-05
 Identities = 20/26 (76%), Positives = 22/26 (84%)

Query: 54  VRGGGGVEGGPGKEEEEEEEEEEEEE 79
           V GGGG +GG  +EEEEEEEEEEEEE
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEE 877



 Score = 27.7 bits (61), Expect = 0.82
 Identities = 13/21 (61%), Positives = 16/21 (76%)

Query: 59  GVEGGPGKEEEEEEEEEEEEE 79
           GV+GG G +  + EEEEEEEE
Sbjct: 851 GVDGGGGSDGGDSEEEEEEEE 871


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.3 bits (79), Expect = 0.004
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 58  GGVEGGPGKEEEEEEEEEEEEENGPQ 83
              E    +EE+EEEEEEE+E+ GP 
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPS 409



 Score = 25.1 bits (55), Expect = 6.9
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 52  YIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
           YI+     +E      E ++EEEE+EEE
Sbjct: 371 YILDAFSALEIEDANTERDDEEEEDEEE 398


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 31.4 bits (72), Expect = 0.014
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                     EEE++EEEEEEEE
Sbjct: 56 AAAAAAAAAAAEEEKKEEEEEEEE 79



 Score = 28.0 bits (63), Expect = 0.28
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                    +EE++EEEEEEEE+
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEED 80



 Score = 26.0 bits (58), Expect = 1.3
 Identities = 9/16 (56%), Positives = 15/16 (93%)

Query: 66 KEEEEEEEEEEEEENG 81
          +++EEEEEEEE+++ G
Sbjct: 69 EKKEEEEEEEEDDDMG 84


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 31.1 bits (70), Expect = 0.047
 Identities = 13/21 (61%), Positives = 16/21 (76%)

Query: 59  GVEGGPGKEEEEEEEEEEEEE 79
           G +    ++EEEEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 30.2 bits (68), Expect = 0.086
 Identities = 15/23 (65%), Positives = 16/23 (69%)

Query: 57  GGGVEGGPGKEEEEEEEEEEEEE 79
           G  VE    KE+EEEEE EEEEE
Sbjct: 257 GDYVEEKELKEDEEEEETEEEEE 279


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.9 bits (67), Expect = 0.11
 Identities = 6/30 (20%), Positives = 7/30 (23%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
                      E E+E  E E E       
Sbjct: 221 QAPSPNTQQAVEHEDEPTEPEREGPPFPGH 250



 Score = 24.9 bits (54), Expect = 6.9
 Identities = 5/29 (17%), Positives = 10/29 (34%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEENGPQT 84
                +    ++E  E E E     G ++
Sbjct: 224 SPNTQQAVEHEDEPTEPEREGPPFPGHRS 252



 Score = 24.5 bits (53), Expect = 9.3
 Identities = 5/26 (19%), Positives = 5/26 (19%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEENG 81
             G     P    EE       E   
Sbjct: 99  SVGSPTPSPSGSAEELASGLSPENTS 124



 Score = 24.5 bits (53), Expect = 9.3
 Identities = 8/31 (25%), Positives = 10/31 (32%), Gaps = 4/31 (12%)

Query: 56  GGGGVEGGPGKEE----EEEEEEEEEEENGP 82
                +  P        E E+E  E E  GP
Sbjct: 215 QSPTPQQAPSPNTQQAVEHEDEPTEPEREGP 245


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 30.0 bits (68), Expect = 0.12
 Identities = 14/24 (58%), Positives = 16/24 (66%)

Query: 64  PGKEEEEEEEEEEEEENGPQTTFE 87
             +EEEE  EEEEEEE   + TFE
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFE 331



 Score = 28.5 bits (64), Expect = 0.40
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 61  EGGPGKEEEEEEEEEEEEE 79
           E    +EE  EEEEEEEEE
Sbjct: 307 EEEEEEEEVPEEEEEEEEE 325



 Score = 26.9 bits (60), Expect = 1.4
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 61  EGGPGKEEEEEEEEEEEEE 79
           E    +  EEEEEEEEEEE
Sbjct: 309 EEEEEEVPEEEEEEEEEEE 327



 Score = 25.0 bits (55), Expect = 7.0
 Identities = 12/19 (63%), Positives = 12/19 (63%)

Query: 61  EGGPGKEEEEEEEEEEEEE 79
           E      EEEEEEEEEEE 
Sbjct: 310 EEEEEVPEEEEEEEEEEER 328



 Score = 24.6 bits (54), Expect = 8.9
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 61  EGGPGKEEEEEEEEEEEEE 79
           E    +E  EEEEEEEEEE
Sbjct: 308 EEEEEEEVPEEEEEEEEEE 326


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.0 bits (67), Expect = 0.14
 Identities = 5/20 (25%), Positives = 15/20 (75%)

Query: 65  GKEEEEEEEEEEEEENGPQT 84
             +E+E+E++++EE++  + 
Sbjct: 155 DDDEDEDEDDDDEEDDEEEE 174



 Score = 29.2 bits (65), Expect = 0.23
 Identities = 7/29 (24%), Positives = 18/29 (62%)

Query: 60  VEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
           V     ++E+E++++EE++E   +   E+
Sbjct: 152 VIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180



 Score = 25.8 bits (56), Expect = 3.3
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 55  RGGGGVEGGPGKEEEEEEEEEEEEE 79
           R    ++     E+E++++EE++EE
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEE 172



 Score = 25.4 bits (55), Expect = 4.6
 Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 7/32 (21%)

Query: 58  GGVEGGPGKEEEEEEEEEE-------EEENGP 82
              +    +++EEEEEEEE       E+E   
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
          represents the eukaryotic large ribosomal protein P1.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P1 is located in the L12 stalk, with proteins
          P2, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers) and bacteria may have four or six
          copies (two or three homodimers), depending on the
          species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plant
          species have a third P-protein, called P3, which is not
          homologous to P1 and P2. In humans, P1 and P2 are
          strongly autoimmunogenic. They play a significant role
          in the etiology and pathogenesis of systemic lupus
          erythema (SLE). In addition, the ribosome-inactivating
          protein trichosanthin (TCS) interacts with human P0,
          P1, and P2, with its primary binding site located in
          the C-terminal region of P2. TCS inactivates the
          ribosome by depurinating a specific adenine in the
          sarcin-ricin loop of 28S rRNA.
          Length = 103

 Score = 28.8 bits (65), Expect = 0.15
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
                    K+EE++EEEEEE ++
Sbjct: 72 AAAAAAAAEAKKEEKKEEEEEESDD 96



 Score = 27.3 bits (61), Expect = 0.69
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
                    KEE++EEEEEE +++
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEESDDD 97



 Score = 26.9 bits (60), Expect = 0.74
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                    + ++EE++EEEEEE
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEE 93



 Score = 26.9 bits (60), Expect = 0.76
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
                     ++EE++EEEEEE +
Sbjct: 71 AAAAAAAAAEAKKEEKKEEEEEESD 95


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.0 bits (65), Expect = 0.18
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEE 79
           G  G   G G   +EEEEEEEE E
Sbjct: 83  GPAGTTSGTGHTRQEEEEEEEENE 106



 Score = 27.9 bits (62), Expect = 0.48
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 64  PGKEEEEEEEEEEEEENGPQ 83
            G     +EEEEEEEEN  Q
Sbjct: 89  SGTGHTRQEEEEEEEENEKQ 108



 Score = 27.9 bits (62), Expect = 0.52
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 62  GGPGKEEEEEEEEEEEEENGPQT 84
           G   +EEEEEEEE E+++   + 
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEA 114


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.9 bits (65), Expect = 0.23
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 50  VIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGP 82
           V    +     +    +EEEEEE EE E E GP
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGP 374



 Score = 28.1 bits (63), Expect = 0.48
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 66  KEEEEEEEEEEEEENGPQT 84
           ++EEEEEEEEE EE  P+ 
Sbjct: 354 EDEEEEEEEEEPEEPEPEE 372



 Score = 26.6 bits (59), Expect = 1.6
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 67  EEEEEEEEEEEEENGPQ 83
           +E+EEEEEEEEE   P+
Sbjct: 353 QEDEEEEEEEEEPEEPE 369



 Score = 25.8 bits (57), Expect = 2.9
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 66  KEEEEEEEEEEEEENGPQTTFE 87
           + EEEEEEEEEEE        E
Sbjct: 286 QLEEEEEEEEEEEPAERDELEE 307



 Score = 24.6 bits (54), Expect = 7.7
 Identities = 11/16 (68%), Positives = 11/16 (68%)

Query: 67  EEEEEEEEEEEEENGP 82
           EEEEE EE E EE  P
Sbjct: 360 EEEEEPEEPEPEEGPP 375


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 28.8 bits (65), Expect = 0.25
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 64  PGKEEEEEEEEEEEEENG 81
                  EEEEEEE++ G
Sbjct: 288 AAAAAPAEEEEEEEDDMG 305



 Score = 24.6 bits (54), Expect = 7.3
 Identities = 6/16 (37%), Positives = 6/16 (37%)

Query: 66  KEEEEEEEEEEEEENG 81
                   EEEEEE  
Sbjct: 287 AAAAAAPAEEEEEEED 302


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 28.0 bits (63), Expect = 0.30
 Identities = 8/24 (33%), Positives = 12/24 (50%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEE 79
                     K+EE++EE EEE +
Sbjct: 77  AAAAAAAAAAKKEEKKEESEEESD 100



 Score = 26.8 bits (60), Expect = 1.0
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                     ++EE++EE EEE 
Sbjct: 76 AAAAAAAAAAAKKEEKKEESEEES 99



 Score = 25.7 bits (57), Expect = 2.3
 Identities = 8/24 (33%), Positives = 12/24 (50%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEE 79
                     KEE++EE EEE ++
Sbjct: 78  AAAAAAAAAKKEEKKEESEEESDD 101



 Score = 25.7 bits (57), Expect = 2.7
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                      ++EE++EE EEE
Sbjct: 75 AAAAAAAAAAAAKKEEKKEESEEE 98


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 28.5 bits (64), Expect = 0.33
 Identities = 14/18 (77%), Positives = 14/18 (77%)

Query: 65  GKEEEEEEEEEEEEENGP 82
             EEEEEEEEEEEEE  P
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420



 Score = 28.5 bits (64), Expect = 0.41
 Identities = 15/20 (75%), Positives = 16/20 (80%)

Query: 60  VEGGPGKEEEEEEEEEEEEE 79
           VE    +EEEEEEEEEEEEE
Sbjct: 399 VERWAAEEEEEEEEEEEEEE 418



 Score = 28.1 bits (63), Expect = 0.52
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 67  EEEEEEEEEEEEENGPQTTFEV 88
             EEEEEEEEEEE   +   EV
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEV 424



 Score = 26.6 bits (59), Expect = 1.8
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 67  EEEEEEEEEEEEENGPQTTF 86
           EEEEEEEEEEEEE   +   
Sbjct: 407 EEEEEEEEEEEEEPVAEVMM 426



 Score = 25.8 bits (57), Expect = 3.5
 Identities = 12/22 (54%), Positives = 12/22 (54%)

Query: 67  EEEEEEEEEEEEENGPQTTFEV 88
           EEEEEEEEEEEE          
Sbjct: 408 EEEEEEEEEEEEPVAEVMMMPA 429


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 28.3 bits (64), Expect = 0.41
 Identities = 11/18 (61%), Positives = 12/18 (66%)

Query: 64  PGKEEEEEEEEEEEEENG 81
             +EEEEEEEEE  EE  
Sbjct: 304 EEEEEEEEEEEEPSEEEA 321



 Score = 27.1 bits (61), Expect = 0.94
 Identities = 13/14 (92%), Positives = 14/14 (100%)

Query: 66  KEEEEEEEEEEEEE 79
           +EEEEEEEEEEEEE
Sbjct: 302 EEEEEEEEEEEEEE 315



 Score = 26.8 bits (60), Expect = 1.5
 Identities = 13/17 (76%), Positives = 13/17 (76%)

Query: 66  KEEEEEEEEEEEEENGP 82
             EEEEEEEEEEEE  P
Sbjct: 300 AAEEEEEEEEEEEEEEP 316



 Score = 26.4 bits (59), Expect = 2.0
 Identities = 13/16 (81%), Positives = 13/16 (81%)

Query: 64  PGKEEEEEEEEEEEEE 79
              EEEEEEEEEEEEE
Sbjct: 299 AAAEEEEEEEEEEEEE 314



 Score = 26.0 bits (58), Expect = 2.9
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 60  VEGGPGKEEEEEEEEEEEEE 79
                 +EEEEEEEEEEEE 
Sbjct: 297 QAAAAEEEEEEEEEEEEEEP 316



 Score = 25.2 bits (56), Expect = 5.2
 Identities = 10/18 (55%), Positives = 11/18 (61%)

Query: 60  VEGGPGKEEEEEEEEEEE 77
            E    +EEEEEE  EEE
Sbjct: 303 EEEEEEEEEEEEEPSEEE 320


>gnl|CDD|178307 PLN02705, PLN02705, beta-amylase.
          Length = 681

 Score = 28.4 bits (63), Expect = 0.47
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 53 IVRGGGGVEGGPGKEEEEEEEE 74
          I  GGGG  GG GK E E+E+E
Sbjct: 65 ISSGGGGGGGGKGKREREKEKE 86


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
          [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 27.4 bits (61), Expect = 0.56
 Identities = 11/22 (50%), Positives = 12/22 (54%)

Query: 58 GGVEGGPGKEEEEEEEEEEEEE 79
          G        E EEEE+EEE EE
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEE 97



 Score = 25.4 bits (56), Expect = 2.8
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEENG 81
                E    +EEE+EEE EEE ++ 
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 25.4 bits (56), Expect = 2.9
 Identities = 11/24 (45%), Positives = 12/24 (50%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
          G          EEEE+EEE EEE 
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEES 99



 Score = 25.4 bits (56), Expect = 3.1
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 62 GGPGKEEEEEEEEEEEEEN 80
          G     E +E EEEE+EE 
Sbjct: 76 GAEAAAEADEAEEEEKEEE 94


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 27.4 bits (61), Expect = 0.84
 Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 4/63 (6%)

Query: 21  NLRKEVIRNKIRAIGKMARVFSV---LRPLLSVIYIVRGGGGVE-GGPGKEEEEEEEEEE 76
            LR+  + + IR + +      +   LR   +    +R            EE +E+E+EE
Sbjct: 318 ELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEE 377

Query: 77  EEE 79
           EE+
Sbjct: 378 EEQ 380



 Score = 25.5 bits (56), Expect = 3.9
 Identities = 7/28 (25%), Positives = 11/28 (39%)

Query: 57  GGGVEGGPGKEEEEEEEEEEEEENGPQT 84
               E    + E+ EEE  +  E+G   
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSE 407


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 27.4 bits (60), Expect = 0.88
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEE 79
           G    +     E+EEEEEEEEEEE
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEE 149



 Score = 27.4 bits (60), Expect = 0.99
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEEN 80
           G  G +     E +E+EEEEEEEE 
Sbjct: 123 GNAGKKATKEDESDEDEEEEEEEEE 147



 Score = 27.4 bits (60), Expect = 1.0
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 48  LSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEEN 80
           L+ + + +  G       KE+E +E+EEEEEE 
Sbjct: 113 LAALQLPKKAGNAGKKATKEDESDEDEEEEEEE 145



 Score = 27.0 bits (59), Expect = 1.4
 Identities = 13/28 (46%), Positives = 13/28 (46%)

Query: 54  VRGGGGVEGGPGKEEEEEEEEEEEEENG 81
           V  G G  GG   EE EEE   E E  G
Sbjct: 169 VDHGNGSSGGDNGEEGEEESVTEAEAEG 196



 Score = 25.4 bits (55), Expect = 4.7
 Identities = 12/13 (92%), Positives = 12/13 (92%)

Query: 67  EEEEEEEEEEEEE 79
           EEEEEEEEEEE E
Sbjct: 139 EEEEEEEEEEEAE 151



 Score = 25.4 bits (55), Expect = 4.8
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 55  RGGGGVEGGPGKEEEEEEEEEEEE 78
           +     E    +EEEEEEEEEE E
Sbjct: 128 KATKEDESDEDEEEEEEEEEEEAE 151



 Score = 25.0 bits (54), Expect = 6.6
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query: 66  KEEEEEEEEEEEEE 79
           ++EEEEEEEEEEE 
Sbjct: 137 EDEEEEEEEEEEEA 150



 Score = 24.7 bits (53), Expect = 8.6
 Identities = 13/19 (68%), Positives = 13/19 (68%)

Query: 67  EEEEEEEEEEEEENGPQTT 85
           EEEEEEEEE E E   Q T
Sbjct: 141 EEEEEEEEEAEVEENEQGT 159


>gnl|CDD|235323 PRK04960, PRK04960, universal stress protein UspB; Provisional.
          Length = 111

 Score = 26.9 bits (60), Expect = 0.98
 Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 16/44 (36%)

Query: 37 MARVFSVLRPLLSVIYIVRG----------GGG---VEGGPGKE 67
          MAR FS LR LL    ++RG          GGG     G P K+
Sbjct: 19 MARYFSSLRALL---VVLRGCDPLLYQYVDGGGFFTSHGQPNKQ 59


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 26.8 bits (59), Expect = 1.0
 Identities = 15/40 (37%), Positives = 19/40 (47%), Gaps = 15/40 (37%)

Query: 57  GGGVEGGPG---------------KEEEEEEEEEEEEENG 81
           GGGV                    KEE++EEEEEEE++ G
Sbjct: 68  GGGVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLG 107


>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
           This family consists of several mammalian protein
           phosphatase inhibitor 1 (IPP-1) and dopamine- and
           cAMP-regulated neuronal phosphoprotein (DARPP-32)
           proteins. Protein phosphatase inhibitor-1 is involved in
           signal transduction and is an endogenous inhibitor of
           protein phosphatase-1. It has been demonstrated that
           DARPP-32, if phosphorylated, can inhibit
           protein-phosphatase-1. DARPP-32 has a key role in many
           neurotransmitter pathways throughout the brain and has
           been shown to be involved in controlling receptors, ion
           channels and other physiological factors including the
           brain's response to drugs of abuse, such as cocaine,
           opiates and nicotine. DARPP-32 is reciprocally regulated
           by the two neurotransmitters that are most often
           implicated in schizophrenia - dopamine and glutamate.
           Dopamine activates DARPP-32 through the D1 receptor
           pathway and disables DARPP-32 through the D2 receptor.
           Glutamate, acting through the N-methyl-d-aspartate
           receptor, renders DARPP-32 inactive. A mutant form of
           DARPP-32 has been linked with gastric cancers.
          Length = 170

 Score = 26.8 bits (59), Expect = 1.1
 Identities = 9/30 (30%), Positives = 11/30 (36%)

Query: 56  GGGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
                E G    E E  E ++ EE G   T
Sbjct: 109 LDSPDELGYPTGETESTEAQDSEEPGSCPT 138


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 27.0 bits (60), Expect = 1.1
 Identities = 13/20 (65%), Positives = 16/20 (80%)

Query: 60  VEGGPGKEEEEEEEEEEEEE 79
           V+    K+EEEEEEEEEE+E
Sbjct: 168 VDEEDEKDEEEEEEEEEEDE 187



 Score = 26.6 bits (59), Expect = 1.5
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 60  VEGGPGKEEEEEEEEEEEEE 79
            E     EEEEEEEEEE+E+
Sbjct: 169 DEEDEKDEEEEEEEEEEDED 188



 Score = 25.1 bits (55), Expect = 5.6
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 65  GKEEEEEEEEEEEEEN 80
             E++EEEEEEEEEE+
Sbjct: 171 EDEKDEEEEEEEEEED 186


>gnl|CDD|220587 pfam10127, Nuc-transf, Predicted nucleotidyltransferase.  Members
           of this family of bacterial proteins catalyze the
           transfer of nucleotide residues from nucleoside
           diphosphates or triphosphates into dimer or polymer
           forms.
          Length = 247

 Score = 26.9 bits (60), Expect = 1.2
 Identities = 26/85 (30%), Positives = 35/85 (41%), Gaps = 27/85 (31%)

Query: 18  AAANLRK----EVIRNKIRAIGKMARVFSVLRPLLSVIYIVRGGG-----------GVEG 62
           A    RK    E +R K        + F VLRPLL+  +I+R G            GV  
Sbjct: 133 ARKQFRKYLKGETVRLK--------KYFYVLRPLLAGRWILRTGTPPPMEFQVLLAGVLD 184

Query: 63  GPGKEEEEE----EEEEEEEENGPQ 83
           G    E +E    +   EE+E GP+
Sbjct: 185 GSLLAEIDELLARKRGGEEKEEGPR 209


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 27.0 bits (60), Expect = 1.2
 Identities = 7/19 (36%), Positives = 12/19 (63%)

Query: 64 PGKEEEEEEEEEEEEENGP 82
            K+ E ++++ EE EN P
Sbjct: 79 EEKKPESDDDKTEENENDP 97


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 27.0 bits (60), Expect = 1.2
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 57  GGGVEGGPGKEEEEEEEEEEEEENGP 82
           G  V+  P +EEEEE  EEEEE   P
Sbjct: 154 GSTVDLEPPEEEEEEIAEEEEEVKEP 179


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 26.1 bits (58), Expect = 1.5
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 66 KEEEEEEEEEEEEENG 81
          +EEEEEEE+EE EE  
Sbjct: 82 EEEEEEEEKEESEEEA 97



 Score = 25.7 bits (57), Expect = 2.0
 Identities = 12/14 (85%), Positives = 13/14 (92%)

Query: 66 KEEEEEEEEEEEEE 79
          KEEEEEEEE+EE E
Sbjct: 81 KEEEEEEEEKEESE 94



 Score = 25.7 bits (57), Expect = 2.2
 Identities = 10/22 (45%), Positives = 12/22 (54%)

Query: 61 EGGPGKEEEEEEEEEEEEENGP 82
                 EE++EEEEEEEE   
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEE 92



 Score = 25.7 bits (57), Expect = 2.2
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query: 64 PGKEEEEEEEEEEEEE 79
            +EEEEEE+EE EEE
Sbjct: 81 KEEEEEEEEKEESEEE 96



 Score = 25.3 bits (56), Expect = 2.8
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 61 EGGPGKEEEEEEEEEEEEE 79
               K+EEEEEEEE+EE 
Sbjct: 75 AAAEEKKEEEEEEEEKEES 93


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 26.7 bits (59), Expect = 1.6
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 61 EGGPGKEEEEEEEEEEEEENGPQT 84
          E     +EEE +EEEE+EE   +T
Sbjct: 51 EEKTTDKEEEVDEEEEKEEKKKKT 74


>gnl|CDD|215967 pfam00519, PPV_E1_C, Papillomavirus helicase.  This protein is a
           DNA helicase that is required for initiation of viral
           DNA replication. This protein forms a complex with the
           E2 protein pfam00508.
          Length = 432

 Score = 26.7 bits (60), Expect = 1.6
 Identities = 8/16 (50%), Positives = 12/16 (75%)

Query: 71  EEEEEEEEENGPQTTF 86
            ++E+EEE+  PQ TF
Sbjct: 415 SDQEDEEEDGEPQRTF 430


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 26.7 bits (59), Expect = 1.7
 Identities = 13/36 (36%), Positives = 15/36 (41%)

Query: 53  IVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
           +V      E    +EEEEE  E E     P   FEV
Sbjct: 397 VVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The
          domain is found in the primary vegetative sigma factor.
          The function of this domain is unclear and can be
          removed without loss of function.
          Length = 211

 Score = 26.4 bits (59), Expect = 1.7
 Identities = 8/16 (50%), Positives = 14/16 (87%)

Query: 67 EEEEEEEEEEEEENGP 82
          +E+E+E++EEE + GP
Sbjct: 60 DEDEDEDDEEEADLGP 75


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 26.3 bits (59), Expect = 2.3
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 53  IVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFE 87
           I+ G  G      +EEEEE EEEEEEE   +    
Sbjct: 221 ILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEA 255


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 26.1 bits (58), Expect = 2.4
 Identities = 25/80 (31%), Positives = 35/80 (43%), Gaps = 22/80 (27%)

Query: 23  RKEVIRNKIRA--------IGKMARVFSVLRPLLSVIYIVRGGGGVEGGPG-------KE 67
           +   IR+ IR+        I  MAR+FS+L  +  + Y+ R        P        KE
Sbjct: 176 QNSSIRSNIRSNKYLVSELINSMARIFSILLFITCLYYLGRI-------PSPFFTKKLKE 228

Query: 68  EEEEEEEEEEEENGPQTTFE 87
             E EE EEE +   +TT E
Sbjct: 229 TSETEEREEETDVEIETTSE 248


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 26.2 bits (57), Expect = 2.6
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 66  KEEEEEEEEEEEEENGPQ 83
           K+ +EE+EEEE EE   Q
Sbjct: 218 KQPQEEQEEEEVEEEAKQ 235



 Score = 25.8 bits (56), Expect = 2.8
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 61  EGGPGKEEEEEEEEEEEEENGPQTT 85
           E  P +E+EEEE EEE ++   Q T
Sbjct: 217 EKQPQEEQEEEEVEEEAKQEEGQGT 241


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 25.7 bits (57), Expect = 2.9
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 59  GVEGGPGKEEEEEEEEEEEEE 79
             E     ++EE+EEEE++EE
Sbjct: 108 ASEEDESDDDEEDEEEEDDEE 128



 Score = 25.7 bits (57), Expect = 3.0
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query: 67  EEEEEEEEEEEEENGPQT 84
           +EE+++E+E EEE  P  
Sbjct: 126 DEEDDDEDESEEEESPVK 143



 Score = 25.0 bits (55), Expect = 5.1
 Identities = 8/13 (61%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           EE+EEEE++EE++
Sbjct: 118 EEDEEEEDDEEDD 130



 Score = 24.6 bits (54), Expect = 6.9
 Identities = 7/13 (53%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           EEE++EE+++E+E
Sbjct: 122 EEEDDEEDDDEDE 134



 Score = 24.6 bits (54), Expect = 7.2
 Identities = 8/13 (61%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           +EE+EEEE++EE+
Sbjct: 117 DEEDEEEEDDEED 129



 Score = 24.6 bits (54), Expect = 7.7
 Identities = 7/13 (53%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           E+EEEE++EE+++
Sbjct: 119 EDEEEEDDEEDDD 131



 Score = 24.6 bits (54), Expect = 8.2
 Identities = 7/13 (53%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           EEEE++EE+++E+
Sbjct: 121 EEEEDDEEDDDED 133



 Score = 24.2 bits (53), Expect = 9.4
 Identities = 7/13 (53%), Positives = 13/13 (100%)

Query: 67  EEEEEEEEEEEEE 79
           +EEEE++EE+++E
Sbjct: 120 DEEEEDDEEDDDE 132


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 26.0 bits (57), Expect = 2.9
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 51  IYIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
           + I   G   E   G+E+EEEE EEE EE
Sbjct: 73  VRISEDGEEEEVEEGEEDEEEEGEEESEE 101



 Score = 24.8 bits (54), Expect = 6.8
 Identities = 13/28 (46%), Positives = 15/28 (53%)

Query: 60  VEGGPGKEEEEEEEEEEEEENGPQTTFE 87
            E G  +E EE EE+EEEE       FE
Sbjct: 76  SEDGEEEEVEEGEEDEEEEGEEESEEFE 103


>gnl|CDD|233307 TIGR01195, oadG_fam, sodium pump decarboxylases, gamma subunit.
          This model finds the subfamily of distantly related,
          low complexity, hydrophobic small subunits of several
          related sodium ion-pumping decarboxylases. These
          include oxaloacetate decarboxylase gamma subunit and
          methylmalonyl-CoA decarboxylase delta subunit. Most
          sequences scoring between the noise and trusted cutoffs
          are eukaryotic sodium channel proteins.
          Length = 82

 Score = 25.1 bits (55), Expect = 3.0
 Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 3/43 (6%)

Query: 37 MARVFSVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
          M  VF  L  L   IY VRG G V G     E     ++ ++ 
Sbjct: 15 MGIVFLFLSLL---IYAVRGMGKVVGRKKVPEPPASAKKSKKA 54


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 25.4 bits (56), Expect = 3.0
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENG 81
               E    +EEEEEEEEEE EE  
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 25.4 bits (56), Expect = 3.2
 Identities = 13/26 (50%), Positives = 13/26 (50%)

Query: 57 GGGVEGGPGKEEEEEEEEEEEEENGP 82
                    EEEEEEEEEEEEE   
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEE 91



 Score = 25.0 bits (55), Expect = 3.7
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                    +EEEEEEEEEEEEE
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEE 89



 Score = 24.6 bits (54), Expect = 5.8
 Identities = 12/16 (75%), Positives = 13/16 (81%)

Query: 66 KEEEEEEEEEEEEENG 81
          +EEEEEEEEE EEE  
Sbjct: 82 EEEEEEEEEESEEEAM 97



 Score = 24.2 bits (53), Expect = 7.6
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 57 GGGVEGGPGKEEEEEEEEEEEEE 79
                   +EEEEEEEEEEEE 
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEES 92


>gnl|CDD|212001 TIGR04278, viperin, antiviral radical SAM protein viperin.
          Viperin (Virus Inhibitory Protein, ER-associated,
          Iterferon-inducible) is a radical SAM enzyme found in
          human and other vertebrates. It is both induced by
          interferon and demonstrably active in blocking
          replication by several types of virus, apparently by
          modifying lipid chemistries in lipid droplets and
          membrane rafts.
          Length = 347

 Score = 25.8 bits (57), Expect = 3.1
 Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 42 SVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
          S+L  L  +   +   G  +       E   +EEEE+ + P T   V
Sbjct: 15 SLLSLLCWLRAALWLAGSEKSRQQLRGEPTRKEEEEDPDQPTTPTSV 61


>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
           His-Domain type N23 protein tyrosine phosphatase and
           related domains.  This family contains the V-shaped (V)
           domain of mammalian His-Domain type N23 protein tyrosine
           phosphatase (HD-PTP, also known as PTPN23) and related
           domains. It belongs to the V_Alix_like superfamily which
           includes the V domains of  Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, mammalian Alix
           (apoptosis-linked gene-2 interacting protein X/ also
           known as apoptosis-linked gene-2 interacting protein 1,
           AIP1), and related domains. HD_PTP interacts with the
           ESCRT (Endosomal Sorting Complexes Required for
           Transport) system, and participates in cell migration
           and endosomal trafficking. The related Alix V-domain
           (belonging to a different family in this superfamily)
           contains a binding site, partially conserved in the
           superfamily, for the retroviral late assembly (L) domain
           YPXnL motif. The Alix V-domain is also a dimerization
           domain. In addition to the V-domain, HD_PTP also has an
           N-terminal Bro1-like domain, a proline-rich region
           (PRR), a catalytically inactive tyrosine phosphatase
           domain, and a region containing a PEST motif. Bro1-like
           domains bind components of the ESCRT-III complex,
           specifically to CHMP4 in the case of HD-PTP. The
           Bro1-like domain of HD-PTP can also bind human
           immunodeficiency virus type 1 (HIV-1) nucleocapsid.
           HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
           gene candidate frequently absent in human kidney,
           breast, lung, and cervical tumors. This family also
           contains Drosophila Myopic, which promotes epidermal
           growth factor receptor (EGFR) signaling, and
           Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
           promotes Notch signaling.
          Length = 337

 Score = 25.7 bits (57), Expect = 3.1
 Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 23/80 (28%)

Query: 8   DLVLNCLISGAAANLRKEVIRNKIRAIGKMARVF----SVLRPLLSVIYIVRGGGGVEGG 63
           DLV  C    AA ++R + I+N + A+G+++ V+    ++L  + S++            
Sbjct: 59  DLVERC----AALSVRPDTIKNLVEAMGELSDVYQDVEAMLNEIESLL------------ 102

Query: 64  PGKEEEEEEEEEEEEENGPQ 83
              EEEE +E+E +E  G +
Sbjct: 103 ---EEEELQEKEFQEAVGKR 119


>gnl|CDD|165509 PHA03250, PHA03250, UL35; Provisional.
          Length = 564

 Score = 25.9 bits (57), Expect = 3.3
 Identities = 10/37 (27%), Positives = 14/37 (37%)

Query: 47  LLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
           LL  +Y +R      G     E +E  E E +   P 
Sbjct: 342 LLLRLYNLRDATHCAGLDDDLEGQESAELEVDPTSPY 378


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 25.7 bits (56), Expect = 3.4
 Identities = 11/15 (73%), Positives = 14/15 (93%)

Query: 66  KEEEEEEEEEEEEEN 80
           +E++EEEE EEEEEN
Sbjct: 336 QEQQEEEENEEEEEN 350



 Score = 24.9 bits (54), Expect = 7.1
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 67  EEEEEEEEEEEEENG 81
           +E++EEEE EEEE  
Sbjct: 336 QEQQEEEENEEEEEN 350


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 25.4 bits (56), Expect = 3.5
 Identities = 9/29 (31%), Positives = 10/29 (34%)

Query: 57  GGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
           G G   GP  E   E E  E+E       
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEK 145


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 25.7 bits (56), Expect = 3.6
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 55  RGGGGVEGGPGKEEEEEEEEEEEEENGP 82
           RG G  +    +E  EE EE+E +   P
Sbjct: 297 RGRGRSQNRAERERSEESEEKELDRASP 324


>gnl|CDD|237798 PRK14714, PRK14714, DNA polymerase II large subunit; Provisional.
          Length = 1337

 Score = 25.4 bits (56), Expect = 4.2
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 60  VEGGPGKEEEEEEEEEEEEENGPQTTFE 87
           ++G    + +  EE EEEE +     +E
Sbjct: 277 IDGTIEGKSDAGEEGEEEELSTDADEYE 304


>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
          Provisional.
          Length = 134

 Score = 25.2 bits (55), Expect = 4.3
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 13/57 (22%)

Query: 29 NKIRAIGKMARVFS--VLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
          N IR +  MAR FS  VL  +L    +V            +E  EEEE+++ E   +
Sbjct: 9  NNIRTLRAMAREFSIDVLEEMLEKFRVV-----------TKERREEEEQQQRELAER 54


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
          family consist of exonuclease VII, large subunit
          EC:3.1.11.6 This enzyme catalyzes exonucleolytic
          cleavage in either 5'->3' or 3'->5' direction to yield
          5'-phosphomononucleotides. This exonuclease VII enzyme
          is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 25.3 bits (56), Expect = 4.5
 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 9/60 (15%)

Query: 4  FYLTDLVL-NCLISGAAANLRKEVIRNKIRAIGKMARVFSVLRPLLSVIYIVRGGGGVEG 62
            L ++ +   L+ G  A   + ++   +    +             VI I+RGGG  E 
Sbjct: 39 GPLVEIEIYPTLVQGDGA--AESIVSA-LERANERETAKDY-----DVIVIIRGGGSKED 90


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 25.3 bits (56), Expect = 4.5
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 52  YIVRGGGGVEGGPGKEEEEEEEEEEEEENG 81
           +          G G+E+EEEEE+  ++E+ 
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDE 335


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 25.2 bits (55), Expect = 4.6
 Identities = 9/14 (64%), Positives = 12/14 (85%)

Query: 68  EEEEEEEEEEEENG 81
           +EEEEEEE++  NG
Sbjct: 159 QEEEEEEEQDAANG 172


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 25.1 bits (56), Expect = 5.0
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 61  EGGPGKEEEEEEEEEEEEENG 81
             G   EE +++E+EEEEE+ 
Sbjct: 184 HVGSELEELDDDEDEEEEEDE 204



 Score = 24.4 bits (54), Expect = 9.5
 Identities = 8/21 (38%), Positives = 14/21 (66%)

Query: 61  EGGPGKEEEEEEEEEEEEENG 81
            G   +E +++E+EEEEE+  
Sbjct: 185 VGSELEELDDDEDEEEEEDEN 205


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 24.7 bits (53), Expect = 5.0
 Identities = 6/21 (28%), Positives = 15/21 (71%)

Query: 60 VEGGPGKEEEEEEEEEEEEEN 80
          +EG    + + +EEE++++E+
Sbjct: 6  LEGEEDSDSDSDEEEDDDDED 26


>gnl|CDD|218187 pfam04637, Herpes_pp85, Herpesvirus phosphoprotein 85 (HHV6-7
           U14/HCMV UL25).  This family includes UL25 proteins from
           HCMV, as well as U14 proteins from HHV 6 and HHV7. These
           85 kD phosphoproteins appear to act as structural
           antigens, but their precise function is otherwise
           unknown.
          Length = 502

 Score = 25.4 bits (56), Expect = 5.0
 Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 5/44 (11%)

Query: 40  VFSVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
           V   LR LL+ +  +R         G + + E ++ E +  GP 
Sbjct: 318 VRFALRLLLNRLSRLRDAEA-----GLDSDSESDDLELDPTGPY 356


>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
          Length = 242

 Score = 25.2 bits (56), Expect = 5.1
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 64 PGKEEEEEEEEEEEEE 79
          P  EE EEEEE+  +E
Sbjct: 75 PKVEEFEEEEEDPRQE 90


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 25.2 bits (56), Expect = 5.3
 Identities = 8/16 (50%), Positives = 12/16 (75%), Gaps = 1/16 (6%)

Query: 72  EEEEEEEENG-PQTTF 86
            ++E+E E+G PQ TF
Sbjct: 587 SDQEDEGEDGEPQRTF 602


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The
          CAF-1 or chromatin assembly factor-1 consists of three
          subunits, and this is the first, or A. The A domain is
          uniquely required for the progression of S phase in
          mouse cells, independent of its ability to promote
          histone deposition but dependent on its ability to
          interact with HP1 - heterochromatin protein 1-rich
          heterochromatin domains next to centromeres that are
          crucial for chromosome segregation during mitosis. This
          HP1-CAF-1 interaction module functions as a built-in
          replication control for heterochromatin, which, like a
          control barrier, has an impact on S-phase progression
          in addition to DNA-based checkpoints.
          Length = 76

 Score = 24.5 bits (54), Expect = 5.3
 Identities = 6/13 (46%), Positives = 13/13 (100%)

Query: 67 EEEEEEEEEEEEE 79
          E+EE+EEE+++++
Sbjct: 60 EDEEDEEEDDDDD 72



 Score = 24.1 bits (53), Expect = 7.0
 Identities = 6/15 (40%), Positives = 14/15 (93%)

Query: 67 EEEEEEEEEEEEENG 81
          +EE+EEE+++++ +G
Sbjct: 61 DEEDEEEDDDDDMDG 75


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 25.3 bits (56), Expect = 5.5
 Identities = 7/17 (41%), Positives = 10/17 (58%)

Query: 67  EEEEEEEEEEEEENGPQ 83
            + E+EE EEE+   P 
Sbjct: 161 GKSEKEEPEEEKTPAPD 177


>gnl|CDD|226744 COG4294, Uve, UV damage repair endonuclease [DNA replication,
           recombination, and repair].
          Length = 347

 Score = 25.2 bits (55), Expect = 5.5
 Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 3/44 (6%)

Query: 23  RKEVIRNKIRAIGKMARVFSVLR-PLLSVIYIVRGG--GGVEGG 63
           R+EV+ + IR +    R+   +     SV  +  GG  GG +  
Sbjct: 146 REEVVDSSIRDLAYHYRILDGMGLAERSVWNLHLGGTHGGKKER 189


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 25.3 bits (55), Expect = 5.6
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query: 66  KEEEEEEEEEEEEEN 80
            EE E EEE E+ E+
Sbjct: 380 PEESEHEEEVEDYED 394


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
          family is conserved in dsDNA adenoviruses of
          vertebrates. The function is not known.
          Length = 172

 Score = 24.9 bits (54), Expect = 5.7
 Identities = 9/19 (47%), Positives = 10/19 (52%)

Query: 67 EEEEEEEEEEEEENGPQTT 85
          EE+EE EE EEE       
Sbjct: 47 EEDEEAEEVEEETAASSKA 65



 Score = 24.5 bits (53), Expect = 7.5
 Identities = 9/13 (69%), Positives = 11/13 (84%)

Query: 67 EEEEEEEEEEEEE 79
          +EE+EE EE EEE
Sbjct: 46 DEEDEEAEEVEEE 58


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 25.1 bits (55), Expect = 6.0
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 36  KMARVFSVLRPLLSVIYIVRGGGGVEGGPG 65
           K   V SV  P+++ +Y   GGG   G PG
Sbjct: 597 KQKEVESVCNPIMTKMYQAAGGGMPGGMPG 626


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 25.0 bits (54), Expect = 6.9
 Identities = 6/25 (24%), Positives = 15/25 (60%)

Query: 61  EGGPGKEEEEEEEEEEEEENGPQTT 85
           +   G + + EE++++ EE   ++T
Sbjct: 107 QDTKGSKTDSEEDDDDSEEEDNKST 131


>gnl|CDD|114257 pfam05525, Branch_AA_trans, Branched-chain amino acid transport
          protein.  This family consists of several bacterial
          branched-chain amino acid transport proteins which are
          responsible for the transport of leucine, isoleucine
          and valine via proton motive force.
          Length = 428

 Score = 24.9 bits (55), Expect = 7.1
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 43 VLRPLLSVIYIVRGGGGVEG 62
          V  PLL +I + + GGG+E 
Sbjct: 48 VGLPLLGIIAVAKSGGGLES 67


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 24.9 bits (54), Expect = 7.2
 Identities = 11/14 (78%), Positives = 12/14 (85%)

Query: 66  KEEEEEEEEEEEEE 79
            EEEE E+EEEEEE
Sbjct: 462 SEEEEGEDEEEEEE 475



 Score = 24.5 bits (53), Expect = 9.8
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 66  KEEEEEEEEEEEEEN 80
           +E EEEE E+EEEE 
Sbjct: 460 QESEEEEGEDEEEEE 474


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 25.0 bits (54), Expect = 7.2
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 64  PGKEEEEEEEEEEEEENGPQ 83
           P  E EEE+EEEE E   P+
Sbjct: 96  PDTEAEEEDEEEEIEAPDPE 115


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 24.6 bits (54), Expect = 7.3
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 66  KEEEEEEEEEEEEENGPQT 84
           ++++E EE E +  NGP+ 
Sbjct: 153 EKDQEPEESETKLSNGPKV 171


>gnl|CDD|180650 PRK06665, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 627

 Score = 25.0 bits (55), Expect = 7.4
 Identities = 8/22 (36%), Positives = 12/22 (54%)

Query: 17  GAAANLRKEVIRNKIRAIGKMA 38
           GA   LR   IR++I  +  +A
Sbjct: 283 GALIELRDTDIRDEINELDTLA 304


>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein. 
          Length = 92

 Score = 24.1 bits (53), Expect = 7.7
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 67 EEEEEEEEEEEEENGPQTT 85
          + EEEEE+EE+    P   
Sbjct: 29 DSEEEEEDEEDRGQPPAQQ 47


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 24.5 bits (53), Expect = 7.8
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 58  GGVEGGPGKEEEEEEEEEEEEENGPQTT 85
           GG +   G E + E E  E E+  P+ T
Sbjct: 108 GGEKPDQGPEAKGEGEGHEPEDPPPEDT 135


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 24.7 bits (54), Expect = 8.1
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 65  GKEEEEEEEEEEEEE 79
            K+EEEEEEEE+E  
Sbjct: 180 EKKEEEEEEEEDEAL 194


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This
          is a family of uncharacterized proteins.
          Length = 449

 Score = 24.6 bits (53), Expect = 8.8
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 65 GKEEEEEEEEEEEEE 79
          GKE EEE  EEE+++
Sbjct: 42 GKEAEEEAMEEEDDD 56



 Score = 24.6 bits (53), Expect = 8.9
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 63 GPGKEEEEEEEEEEEEENG 81
          G   EEE  EEE+++EE+ 
Sbjct: 42 GKEAEEEAMEEEDDDEEDD 60


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 24.4 bits (53), Expect = 9.4
 Identities = 7/19 (36%), Positives = 12/19 (63%)

Query: 61 EGGPGKEEEEEEEEEEEEE 79
          E    + +E+EEE+ E+E 
Sbjct: 60 EDAGEEGDEDEEEQGEDEH 78


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 24.2 bits (53), Expect = 9.4
 Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 52  YIVRGGGGVEGGPGKEEEEEEEEEEEEEN 80
            +       E    +E E EE +EEE+ +
Sbjct: 93  LLGLDKKEKEEEEEEEVEVEELDEEEQID 121


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and
          L12p. Ribosomal proteins P1 and P2 are the eukaryotic
          proteins that are functionally equivalent to bacterial
          L7/L12. L12p is the archaeal homolog. Unlike other
          ribosomal proteins, the archaeal L12p and eukaryotic P1
          and P2 do not share sequence similarity with their
          bacterial counterparts. They are part of the ribosomal
          stalk (called the L7/L12 stalk in bacteria), along with
          28S rRNA and the proteins L11 and P0 in eukaryotes (23S
          rRNA, L11, and L10e in archaea). In bacterial
          ribosomes, L7/L12 homodimers bind the extended
          C-terminal helix of L10 to anchor the L7/L12 molecules
          to the ribosome. Eukaryotic P1/P2 heterodimers and
          archaeal L12p homodimers are believed to bind the L10
          equivalent proteins, eukaryotic P0 and archaeal L10e,
          in a similar fashion. P1 and P2 (L12p, L7/L12) are the
          only proteins in the ribosome to occur as multimers,
          always appearing as sets of dimers. Recent data
          indicate that most archaeal species contain six copies
          of L12p (three homodimers), while eukaryotes have two
          copies each of P1 and P2 (two heterodimers). Bacteria
          may have four or six copies (two or three homodimers),
          depending on the species. As in bacteria, the stalk is
          crucial for binding of initiation, elongation, and
          release factors in eukaryotes and archaea.
          Length = 105

 Score = 24.1 bits (52), Expect = 9.8
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
                    K EE +EEEEEEE+
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEED 97


>gnl|CDD|220077 pfam08938, DUF1916, Domain of unknown function (DUF1916).  This
          domain is found in various eukaryotic HBS1-like
          proteins.
          Length = 148

 Score = 24.3 bits (53), Expect = 9.9
 Identities = 8/13 (61%), Positives = 11/13 (84%)

Query: 67 EEEEEEEEEEEEE 79
          EEE+E+E E+E E
Sbjct: 57 EEEDEDEYEDENE 69


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.137    0.386 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,826,499
Number of extensions: 426621
Number of successful extensions: 2817
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2215
Number of HSP's successfully gapped: 336
Length of query: 88
Length of database: 10,937,602
Length adjustment: 56
Effective length of query: 32
Effective length of database: 8,453,778
Effective search space: 270520896
Effective search space used: 270520896
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 53 (24.3 bits)