RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17102
         (412 letters)



>gnl|CDD|144376 pfam00751, DM, DM DNA binding domain.  The DM domain is named after
           dsx and mab-3. dsx contains a single amino-terminal DM
           domain, whereas mab-3 contains two amino-terminal
           domains. The DM domain has a pattern of conserved zinc
           chelating residues C2H2C4. The dsx DM domain has been
           shown to dimerise and bind palindromic DNA.
          Length = 46

 Score =  103 bits (258), Expect = 1e-27
 Identities = 36/46 (78%), Positives = 41/46 (89%)

Query: 56  TPKCARCRNHGVVSALKGHKRYCRWRDCVCAKCTLIAERQRVMAAQ 101
           +PKCARCRNHGV+S LKGHKRYC +RDC C KC L+AERQR+MAAQ
Sbjct: 1   SPKCARCRNHGVLSPLKGHKRYCPYRDCTCEKCALVAERQRLMAAQ 46


>gnl|CDD|214606 smart00301, DM, Doublesex DNA-binding motif. 
          Length = 54

 Score = 96.7 bits (241), Expect = 3e-25
 Identities = 29/54 (53%), Positives = 38/54 (70%)

Query: 55  RTPKCARCRNHGVVSALKGHKRYCRWRDCVCAKCTLIAERQRVMAAQVALRRQQ 108
           R P C +C NHGV   LKGHK  C +RDC C KCTL+ +R+ +MA Q+ L+R+Q
Sbjct: 1   RIPYCQKCENHGVKVPLKGHKPECPFRDCECEKCTLVEKRRALMALQIKLKREQ 54


>gnl|CDD|112299 pfam03474, DMA, DMRTA motif.  This region is found to the
           C-terminus of the pfam00751. DM-domain proteins with
           this motif are known as DMRTA proteins. The function of
           this region is unknown.
          Length = 39

 Score = 60.1 bits (146), Expect = 3e-12
 Identities = 25/37 (67%), Positives = 32/37 (86%)

Query: 293 RSPVDVLRRVFPNRRRGEVETVLQRCKGDVLQAIEMM 329
           RSP+DVL ++FPNR+R  +E VL+RCKGDVLQAIE+ 
Sbjct: 2   RSPIDVLMKIFPNRKRSVLELVLKRCKGDVLQAIELA 38


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 34.9 bits (81), Expect = 0.013
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
           A  KK+EEEEEEE+E+ EEE   
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 33.4 bits (77), Expect = 0.050
 Identities = 13/21 (61%), Positives = 17/21 (80%)

Query: 1  EARRKKKEEEEEEEEEKQEEE 21
           A  +KKEEEEEEEE+++ EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEE 95



 Score = 31.8 bits (73), Expect = 0.14
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
           A   ++++EEEEEEE++EE  EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 31.5 bits (72), Expect = 0.24
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 6  KKEEEEEEEEEKQEEEGEEKKDG 28
             EE++EEEE++EE+ E +++ 
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEA 97



 Score = 28.8 bits (65), Expect = 1.7
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query: 1   EARRKKKEEEEEEEEEKQEEEG 22
           E + +++EEEE+EE E++   G
Sbjct: 79  EKKEEEEEEEEKEESEEEAAAG 100



 Score = 28.8 bits (65), Expect = 1.8
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query: 8  EEEEEEEEEKQEEEGEEKKD 27
              EE++E++EEE E+++ 
Sbjct: 74 AAAAEEKKEEEEEEEEKEES 93



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 8  EEEEEEEEEKQEEEGEEKKD 27
                EE+K+EEE EE+K+
Sbjct: 72 AAAAAAEEKKEEEEEEEEKE 91



 Score = 28.0 bits (63), Expect = 3.4
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 8  EEEEEEEEEKQEEEGEEKKDGPT 30
                 EEK+EEE EE++   +
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEES 93


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 36.1 bits (83), Expect = 0.035
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 6   KKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
           ++EEEEEEEEE++EEE EE+++      LSL
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896



 Score = 35.7 bits (82), Expect = 0.053
 Identities = 16/29 (55%), Positives = 23/29 (79%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
           E   +++EEEEEEEEE++EEE EE+ + P
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEP 893



 Score = 35.0 bits (80), Expect = 0.073
 Identities = 15/27 (55%), Positives = 23/27 (85%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           E   +++EEEEEEEEE++EEE EE+++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEEN 890



 Score = 29.6 bits (66), Expect = 3.5
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTT 31
           E   +++EEEEEEEEE++E E     + P T
Sbjct: 871 EEEEEEEEEEEEEEEEEEENEEPLSLEWPET 901



 Score = 28.8 bits (64), Expect = 6.2
 Identities = 12/20 (60%), Positives = 18/20 (90%)

Query: 8   EEEEEEEEEKQEEEGEEKKD 27
           + EEEEEEE++EEE EE+++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEE 881


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 33.5 bits (77), Expect = 0.044
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
           A  +++EEEEEEEEE++EE  EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 33.1 bits (76), Expect = 0.062
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 1  EARRKKKEEEEEEEEEKQEEEG 22
              +++EEEEEEEEE+ EEE 
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEA 96



 Score = 31.6 bits (72), Expect = 0.18
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 7  KEEEEEEEEEKQEEEGEEKKD 27
             EEEEEEE++EEE EE+ +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESE 93



 Score = 30.4 bits (69), Expect = 0.54
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGE 23
          E   +++EEEEEEEEE +EE   
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMA 98



 Score = 30.0 bits (68), Expect = 0.68
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKKDG 28
             EEEEEEEEE++EEE E +++ 
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 29.6 bits (67), Expect = 0.81
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 8  EEEEEEEEEKQEEEGEEKKD 27
               EEEE++EEE EE+++
Sbjct: 71 AAAAAEEEEEEEEEEEEEEE 90



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 13/22 (59%), Positives = 18/22 (81%)

Query: 7  KEEEEEEEEEKQEEEGEEKKDG 28
          +EEEEEEEEE++EEE  E++  
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAM 97



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 9  EEEEEEEEKQEEEGEEKKDGP 29
               EEE++EEE EE+++  
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEE 91



 Score = 28.1 bits (63), Expect = 3.5
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 8  EEEEEEEEEKQEEEGEEKKD 27
              EEEEE++EEE EE+++
Sbjct: 72 AAAAEEEEEEEEEEEEEEEE 91



 Score = 27.7 bits (62), Expect = 4.6
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 1  EARRKKKEEEEEEEEEKQEEEG 22
          E   +++EEEEEEE E++   G
Sbjct: 78 EEEEEEEEEEEEEESEEEAMAG 99


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 33.6 bits (77), Expect = 0.061
 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 2/27 (7%)

Query: 3   RRKKKEEEEEEEEE--KQEEEGEEKKD 27
             KK+EE+EEE  E  K+EE   E +D
Sbjct: 94  TSKKEEEKEEEIPEPTKEEELLGEIRD 120


>gnl|CDD|202430 pfam02845, CUE, CUE domain.  CUE domains have been shown to bind
           ubiquitin. It has been suggested that CUE domains are
           related to pfam00627 and this has been confirmed by the
           structure of the domain. CUE domains also occur in two
           protein of the IL-1 signal transduction pathway, tollip
           and TAB2.
          Length = 42

 Score = 31.3 bits (72), Expect = 0.061
 Identities = 10/33 (30%), Positives = 18/33 (54%)

Query: 296 VDVLRRVFPNRRRGEVETVLQRCKGDVLQAIEM 328
           ++ L+ +FP+     +  VLQ   G+V  AI+ 
Sbjct: 5   LETLKEMFPDLDEEVIRAVLQASNGNVEAAIDA 37


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 32.2 bits (74), Expect = 0.062
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
          E +RK++EE+E  +E ++ E  E +K G
Sbjct: 53 EKKRKQEEEKERRKEARKAERAEARKRG 80


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 33.1 bits (76), Expect = 0.063
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKKDG 28
              E +E EEEEK+EE  EE  D 
Sbjct: 79  AAAEADEAEEEEKEEEAEEESDDD 102



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEE 24
            A    + EEEE+EEE +EE  ++
Sbjct: 79  AAAEADEAEEEEKEEEAEEESDDD 102



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
            A    + +E EEEE+++E E E   D 
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDDM 103



 Score = 27.4 bits (61), Expect = 6.6
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
                + +E EEEE+E++ EE  +    
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 27.0 bits (60), Expect = 7.9
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 2  ARRKKKEEEEEEEEEKQEEEGEEKKD 27
          A      E   E +E +EEE EE+ +
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAE 96


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
          secreted from the skin of amphibians, including the
          opiate-like dermorphins and deltorphins, and the
          antimicrobial dermoseptins and temporins. The alignment
          for this family includes the signal peptide.
          Length = 46

 Score = 31.2 bits (71), Expect = 0.076
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKK 26
          K+++EEE E+EE+ EE+ E K+
Sbjct: 25 KREDEEENEDEEEGEEQSEVKR 46



 Score = 30.4 bits (69), Expect = 0.15
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKK 26
          +K+E+EEE E+E++ EE  E K
Sbjct: 24 EKREDEEENEDEEEGEEQSEVK 45


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 33.7 bits (78), Expect = 0.13
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 1   EARRKKKEEEEEEEEEKQEEEG 22
            A  +++EEEEEEEEE  EEE 
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEA 321



 Score = 32.1 bits (74), Expect = 0.45
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
           +A   ++EEEEEEEEE++E   EE   G
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEEAAAG 324


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.0 bits (78), Expect = 0.13
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
          E  +++K+EEEE+  +K+EE  EE++      K
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK 73



 Score = 29.7 bits (67), Expect = 3.4
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDGPTT 31
          E   +K+  +EEEEEEK+E++ EE+K     
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE 58



 Score = 29.7 bits (67), Expect = 3.6
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
          E   K++++EEEE+   +EEE +E+++     K
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72



 Score = 28.2 bits (63), Expect = 10.0
 Identities = 10/30 (33%), Positives = 21/30 (70%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
          K++EE+  ++EE+ +EE E+++    T K+
Sbjct: 48 KEEEEKTTDKEEEVDEEEEKEEKKKKTKKV 77


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
          includes archaeal L12p, the protein that is
          functionally equivalent to L7/L12 in bacteria and the
          P1 and P2 proteins in eukaryotes. L12p is homologous to
          P1 and P2 but is not homologous to bacterial L7/L12. It
          is located in the L12 stalk, with proteins L10, L11,
          and 23S rRNA. L12p is the only protein in the ribosome
          to occur as multimers, always appearing as sets of
          dimers. Recent data indicate that most archaeal species
          contain six copies of L12p (three homodimers), while
          eukaryotes have four copies (two heterodimers), and
          bacteria may have four or six copies (two or three
          homodimers), depending on the species. The organization
          of proteins within the stalk has been characterized
          primarily in bacteria, where L7/L12 forms either two or
          three homodimers and each homodimer binds to the
          extended C-terminal helix of L10. L7/L12 is attached to
          the ribosome through L10 and is the only ribosomal
          protein that does not directly interact with rRNA.
          Archaeal L12p is believed to function in a similar
          fashion. However, hybrid ribosomes containing the large
          subunit from E. coli with an archaeal stalk are able to
          bind archaeal and eukaryotic elongation factors but not
          bacterial elongation factors. In several mesophilic and
          thermophilic archaeal species, the binding of 23S rRNA
          to protein L11 and to the L10/L12p pentameric complex
          was found to be temperature-dependent and cooperative.
          Length = 106

 Score = 31.7 bits (72), Expect = 0.18
 Identities = 13/23 (56%), Positives = 20/23 (86%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGE 23
          E + ++KEEE+++EEEK+EEE E
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 31.7 bits (72), Expect = 0.19
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
           A  K +E+EEE+++E+++EE EE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 31.7 bits (72), Expect = 0.21
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
              + +++EEE+++EE++EEE EE   G
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 30.9 bits (70), Expect = 0.33
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 2   ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSLWF 38
           A    +E+ EE+EEEK++EE +E+++      L   F
Sbjct: 70  AAAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGALF 106



 Score = 29.0 bits (65), Expect = 1.4
 Identities = 12/28 (42%), Positives = 22/28 (78%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
           A  ++K EE+EEE++K+EE+ EE+++ 
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 29.0 bits (65), Expect = 1.6
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
           A  +K EE+EEE+++++E+E EE++  
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEEAL 99


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 31.1 bits (71), Expect = 0.19
 Identities = 9/18 (50%), Positives = 14/18 (77%)

Query: 5  KKKEEEEEEEEEKQEEEG 22
             EEE++EEEE++EE+ 
Sbjct: 64 AAAEEEKKEEEEEEEEDD 81



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 8/18 (44%), Positives = 16/18 (88%)

Query: 5  KKKEEEEEEEEEKQEEEG 22
          +++++EEEEEEE+ ++ G
Sbjct: 67 EEEKKEEEEEEEEDDDMG 84



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 8/18 (44%), Positives = 14/18 (77%)

Query: 7  KEEEEEEEEEKQEEEGEE 24
             EEE++EE++EEE ++
Sbjct: 64 AAAEEEKKEEEEEEEEDD 81



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 1  EARRKKKEEEEEEEEE 16
           A  ++K+EEEEEEEE
Sbjct: 64 AAAEEEKKEEEEEEEE 79



 Score = 27.2 bits (61), Expect = 4.1
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 6  KKEEEEEEEEEKQEEEGEEKKD 27
                 EEE+K+EEE EE+ D
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81



 Score = 27.2 bits (61), Expect = 4.5
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKKD 27
                 EEE++++EEE EE  D
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDDD 82



 Score = 27.2 bits (61), Expect = 5.0
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 7  KEEEEEEEEEKQEEEGEEKKDG 28
                 EEEK+EEE EE++D 
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKK 26
           E   +K+ E+E  E+ K+EEE  E K
Sbjct: 85  ERELRKRAEKEALEQAKKEEELREAK 110


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 32.7 bits (75), Expect = 0.39
 Identities = 13/25 (52%), Positives = 20/25 (80%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKKDGP 29
           +K EEEE+E+EE++EEE E ++  P
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 29.6 bits (67), Expect = 2.9
 Identities = 12/20 (60%), Positives = 19/20 (95%)

Query: 5   KKKEEEEEEEEEKQEEEGEE 24
           ++K+EEEE+E+E++EEE EE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEE 364



 Score = 28.1 bits (63), Expect = 8.8
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 7   KEEEEEEEEEKQEEEGEEKKDGPTTPK 33
             E+++EEEE+++EE EE+++ P  P+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPE 369


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.0 bits (73), Expect = 0.41
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 3   RRKKKEEEEEEEEEKQEEEGEEKKD 27
           ++++ EE+E EEEEK  EE + +++
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREE 179



 Score = 27.8 bits (62), Expect = 9.6
 Identities = 9/21 (42%), Positives = 16/21 (76%)

Query: 1   EARRKKKEEEEEEEEEKQEEE 21
           E + +++EE+  EEE+ +EEE
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 32.0 bits (73), Expect = 0.59
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 2   ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
            R   +EEEEEEEEE++EEE   +      P++ L
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQL 434


>gnl|CDD|234251 TIGR03544, DivI1A_domain, DivIVA domain.  This model describes a
           domain found in Bacillus subtilis cell division
           initiation protein DivIVA, and homologs, toward the
           N-terminus. It is also found as a repeated domain in
           certain other proteins, including family TIGR03543.
          Length = 34

 Score = 28.2 bits (64), Expect = 0.60
 Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 4/32 (12%)

Query: 294 SPVDVLRRVFPNRRRG----EVETVLQRCKGD 321
           +P D+  + F  + RG    EV+  L R   D
Sbjct: 2   TPEDIRNKRFKKKLRGYDAAEVDAFLDRVADD 33


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.9 bits (72), Expect = 0.66
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
           ++ E+ EE EEEK EEEG   K G    KL
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKL 344


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 31.8 bits (73), Expect = 0.75
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
             + +KK EEE++E++K+   G++K++     K
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448



 Score = 31.0 bits (71), Expect = 1.2
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
            A +KK+EEEEEE+E+K+EE+ EE+++     +
Sbjct: 435 FAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 30.3 bits (69), Expect = 1.9
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           E  +K+K+++    ++K+EEE EEK+      +
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 29.1 bits (66), Expect = 4.9
 Identities = 8/33 (24%), Positives = 21/33 (63%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           E +++KK++    +++++EEE E++K      +
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 29.1 bits (66), Expect = 5.1
 Identities = 16/29 (55%), Positives = 23/29 (79%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
           E  ++KKEEE+EEEEE+ EEE EE+++  
Sbjct: 445 EEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 29.1 bits (66), Expect = 5.3
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           E   ++KE++EEE+EE++EE  EEK++     K
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 28.7 bits (65), Expect = 5.8
 Identities = 9/33 (27%), Positives = 21/33 (63%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           + ++    +++EEEEE+++E+ EE+K+      
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 28.7 bits (65), Expect = 6.2
 Identities = 10/35 (28%), Positives = 22/35 (62%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLS 35
           + +    +++EEEEEE++E++ EEK++     +  
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 28.3 bits (64), Expect = 7.7
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 3   RRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           + +++EEE+E++EE++EEE EE ++     +
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 28.3 bits (64), Expect = 7.7
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
               KKKEEEEEEE+EK+EEE EE+++     K
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 31.7 bits (72), Expect = 0.83
 Identities = 35/165 (21%), Positives = 51/165 (30%), Gaps = 34/165 (20%)

Query: 124 SGAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVD 183
             A       +PS            +L      G ++ +   SP   RS  +KK  I  D
Sbjct: 173 KPASPSSSYQSPSYSS---------SLGPVNSSGNRS-NLRSSPWALRSSGDKK-DITTD 221

Query: 184 EDSEDSLDDNMNAE-HRTRTSTNNNNTTMPK-------------SSRTPSPNQLVDPTGE 229
           E   ++    ++ E H   +S   N T                  SR  S          
Sbjct: 222 EKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKRS 281

Query: 230 VTKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPM 274
              SP+PVPS +    S  + E   P          MS +S S +
Sbjct: 282 YQLSPSPVPSKQKASTSPKKGEGEPP---------NMSLESASEV 317


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 30.8 bits (70), Expect = 0.96
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
          E R KKKEEEE +E E+Q  + +E
Sbjct: 50 EEREKKKEEEERKEREEQARKEQE 73



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKK 26
          E  RK++EE EEE E+K+EEE  +++
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKER 64



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEK 25
          E   +++E+++EEEE K+ EE   K
Sbjct: 46 EELEEEREKKKEEEERKEREEQARK 70



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 11/27 (40%), Positives = 22/27 (81%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKD 27
            R++++E EEE E++K+EEE +E+++
Sbjct: 40 GERKEEEELEEEREKKKEEEERKEREE 66



 Score = 28.5 bits (64), Expect = 5.2
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEE 24
          E  +KK+EEE +E EE+  +E EE
Sbjct: 51 EREKKKEEEERKEREEQARKEQEE 74


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 14/28 (50%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 1   EARRKKKEEE-EEEEEEKQEEEGEEKKD 27
            A +K+ EE  + EEEEK+ EE EE+ D
Sbjct: 93  SAYKKQLEENRKLEEEEKEREELEEEND 120


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
          protein SURF-6 is shown to be a component of the
          nucleolar matrix and has a strong binding capacity for
          nucleic acids.
          Length = 206

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 2  ARRKKKEEEEEEEEEKQEEEGEEKKD 27
          AR+K+K +E +++E+ Q+ E EE K+
Sbjct: 20 ARKKQKRKEAKKKEDAQKSEAEEVKN 45



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 7/28 (25%), Positives = 19/28 (67%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
          E RR+K+E+ +  +++K++E  +++   
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQ 36



 Score = 28.4 bits (64), Expect = 5.0
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDGPT 30
          EA++K+  ++ E EE K EE   +KK  P 
Sbjct: 28 EAKKKEDAQKSEAEEVKNEENKSKKKAAPI 57


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 30.9 bits (69), Expect = 1.3
 Identities = 44/238 (18%), Positives = 73/238 (30%), Gaps = 10/238 (4%)

Query: 84  VCAKCTLIAERQRVMAAQVALRRQQAQEENEARELGLIYTSGAPLP-PGSANPSSPGGQD 142
           V    TLI  R    A     +    +  +  + +  +YT     P   +A P++P    
Sbjct: 398 VADAKTLIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPP 457

Query: 143 NNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRT 202
            + G      +   G    +   +   + SP  +      +  S        NA   T  
Sbjct: 458 ASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQ 517

Query: 203 STNNNNTTMPKSS--RTPSPNQLVDPTGEVTKSPAPVPSPRIQEES-----SDESETVLP 255
            T++       +      +      PTG  T S     SP++ EES     +    T  P
Sbjct: 518 KTSDTPNATSPTPIVIGVTTTATSPPTG--TTSVPNATSPQVTEESPVNNTNTPVVTSAP 575

Query: 256 ENLSLPKNTPMSADSTSPMSAEHAYLYTHPYSPYQNGRSPVDVLRRVFPNRRRGEVET 313
             L+    T      +SP S +     +   +P  N  S   +L    P       E 
Sbjct: 576 SVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE 633


>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 402

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 2/69 (2%)

Query: 40  HLPNCRFLTTVHRYQRTPKCARCRNHGV--VSALKGHKRYCRWRDCVCAKCTLIAERQRV 97
            LP       +   +R  KC RC       V  +   +R  R      A   +  +    
Sbjct: 61  DLPLFEVPVYLFLRKRRYKCCRCGKRFAEKVDFVDRAQRITRRWAQAAAIALIKGKAFTD 120

Query: 98  MAAQVALRR 106
            A    + R
Sbjct: 121 TAELNLISR 129


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 16/102 (15%), Positives = 28/102 (27%), Gaps = 4/102 (3%)

Query: 176 KKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPA 235
           ++I     ED E++ ++N N      TS    +         P              S A
Sbjct: 370 RQIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVE----EISDA 425

Query: 236 PVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAE 277
                  ++E  D    +   +    K         S +  E
Sbjct: 426 EEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPSTILEE 467


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 30.7 bits (69), Expect = 1.4
 Identities = 23/90 (25%), Positives = 34/90 (37%), Gaps = 17/90 (18%)

Query: 151 QGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTT 210
           +G R G  +  S    PSG  SP+ +            + ++N N      ++ NNNN  
Sbjct: 477 EGRRRGPRR-ESKSQEPSGGESPNPEL----------PANNNNSN------SNNNNNNGA 519

Query: 211 MPKSSRTPSPNQLVDPTGEVTKSPAPVPSP 240
             K +   + N      G  T  P PV S 
Sbjct: 520 DRKEAAATTGNATTTSNGSGTSVPLPVSSE 549


>gnl|CDD|183147 PRK11465, PRK11465, putative mechanosensitive channel protein;
           Provisional.
          Length = 741

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 219 SPNQLVDPTGEVTKSPAPVPSPRIQEESSDESETVLPENLS 259
           S  +L+D       +P   P P+I   +  E +TVL +N++
Sbjct: 61  SRKELIDQLRTAAATPPAEPVPKIVPPTLVEEKTVL-QNVT 100


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
          This family consists of several Plasmodium falciparum
          SPAM (secreted polymorphic antigen associated with
          merozoites) proteins. Variation among SPAM alleles is
          the result of deletions and amino acid substitutions in
          non-repetitive sequences within and flanking the
          alanine heptad-repeat domain. Heptad repeats in which
          the a and d position contain hydrophobic residues
          generate amphipathic alpha-helices which give rise to
          helical bundles or coiled-coil structures in proteins.
          SPAM is an example of a P. falciparum antigen in which
          a repetitive sequence has features characteristic of a
          well-defined structural element.
          Length = 164

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 14/27 (51%), Positives = 21/27 (77%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKD 27
          E  + +K+E++EEEEE+ EEE EE +D
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEEIEEPED 67



 Score = 28.3 bits (63), Expect = 5.4
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 7   KEEEEEEEEEKQEEEGEEKKDGP 29
           +EEEEE+EE+  + +  EKK+  
Sbjct: 78  EEEEEEDEEDNVDLKDIEKKNIN 100


>gnl|CDD|234696 PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase;
           Reviewed.
          Length = 332

 Score = 30.1 bits (69), Expect = 2.2
 Identities = 9/56 (16%), Positives = 18/56 (32%), Gaps = 4/56 (7%)

Query: 84  VCAKCTLIAERQRVMAAQVALRRQQAQEENEARELGLIY----TSGAPLPPGSANP 135
           V A   L+  R  ++   + LR        + +E G +          +  G  + 
Sbjct: 39  VIADRALLERRAAILGLPLDLRPYSPDAPAQPQEAGTLTVLPVPLAPAVVAGQLDA 94


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 29.2 bits (66), Expect = 2.3
 Identities = 11/21 (52%), Positives = 18/21 (85%)

Query: 1  EARRKKKEEEEEEEEEKQEEE 21
          E +R++KE E+EE+E K+E+E
Sbjct: 15 EEKREEKEREKEEKERKEEKE 35



 Score = 28.0 bits (63), Expect = 4.4
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKK 26
          E  R+K+E+E +EE+EK+  +G  +K
Sbjct: 20 EKEREKEEKERKEEKEKEWGKGLVQK 45


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 27.7 bits (62), Expect = 2.4
 Identities = 8/21 (38%), Positives = 15/21 (71%)

Query: 1  EARRKKKEEEEEEEEEKQEEE 21
          E +R+++EE  + EE K+ E+
Sbjct: 33 EEKREEEEEARKREERKEREK 53



 Score = 26.1 bits (58), Expect = 7.3
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 6  KKEEEEEEEEEKQEEEGEEKK 26
          KK+E + EEE+++EEE   K+
Sbjct: 25 KKKELKAEEEKREEEEEARKR 45



 Score = 26.1 bits (58), Expect = 9.2
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKD 27
          +A  +K+EEEEE  + ++ +E E+ K 
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNKS 56


>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Flap endonuclease-1 (FEN1) is
           involved in multiple DNA metabolic pathways, including
           DNA replication processes (5' flap DNA endonuclease
           activity and double stranded DNA 5'-exonuclease
           activity) and DNA repair processes (long-patch base
           excision repair) in eukaryotes and archaea. Interaction
           between FEN1 and PCNA (Proliferating cell nuclear
           antigen) is an essential prerequisite to FEN1's DNA
           replication functionality and stimulates FEN1 nuclease
           activity by 10-50 fold. FEN1 belongs to the
           FEN1-EXO1-like family of structure-specific, 5'
           nucleases. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain) of variable length (approximately 45 residues in
           FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region.
           Both the H3TH domain (not included here) and the helical
           arch/clamp region are involved in DNA binding. Nucleases
           within this group also have a carboxylate-rich active
           site that is involved in binding essential divalent
           metal ion cofactors (Mg2+/Mn2+).  FEN1 has a C-terminal
           extension containing residues forming the consensus
           PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
           to PCNA.
          Length = 261

 Score = 29.8 bits (68), Expect = 2.7
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEK 25
           E RR+++EE EE+ EE +EE   E+
Sbjct: 93  EKRRERREEAEEKLEEAKEEGDAEE 117


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 29.8 bits (67), Expect = 2.7
 Identities = 14/52 (26%), Positives = 17/52 (32%), Gaps = 7/52 (13%)

Query: 129 PPGSANPSSPGGQDNNNGALNLQGNRVGGEQ-------NHSYPDSPSGNRSP 173
           P G+  P S GG D + G  N       G              D+PS    P
Sbjct: 8   PLGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 30.0 bits (68), Expect = 2.7
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           E    + E   E +  KQE+EG  ++D
Sbjct: 237 EETDVEIETTSETKGTKQEQEGSTEED 263


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.0 bits (67), Expect = 2.8
 Identities = 6/22 (27%), Positives = 18/22 (81%)

Query: 8   EEEEEEEEEKQEEEGEEKKDGP 29
           +E+E+E+++ +E++ EE+++  
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178



 Score = 28.5 bits (63), Expect = 7.9
 Identities = 7/21 (33%), Positives = 19/21 (90%)

Query: 7   KEEEEEEEEEKQEEEGEEKKD 27
           ++E+E++++E+ +EE EE+++
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEE 178


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
               +++EEEEEEE  + E   EE   G   P++ +
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMPM 437



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 12/22 (54%), Positives = 17/22 (77%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKK 26
           ++  EEEEEEEE++EEE  E +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAE 420


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 10/66 (15%), Positives = 20/66 (30%), Gaps = 1/66 (1%)

Query: 191 DDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQEESSD-E 249
           DD  N +                +S  P+       TG    +    P+   +E + +  
Sbjct: 695 DDGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETT 754

Query: 250 SETVLP 255
           + +  P
Sbjct: 755 TGSSTP 760


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 28.3 bits (63), Expect = 3.1
 Identities = 10/17 (58%), Positives = 16/17 (94%)

Query: 6   KKEEEEEEEEEKQEEEG 22
           KKEE++EEEEE++++ G
Sbjct: 91  KKEEKKEEEEEEEDDLG 107



 Score = 28.0 bits (62), Expect = 4.6
 Identities = 9/21 (42%), Positives = 18/21 (85%)

Query: 1   EARRKKKEEEEEEEEEKQEEE 21
            A+ + K+EE++EEEE++E++
Sbjct: 85  GAKAEAKKEEKKEEEEEEEDD 105


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 6   KKEEEEEEEEEKQEEEGEE 24
             EEEE+EEEE++E+E E 
Sbjct: 389 DDEEEEDEEEEEEEDEDEG 407



 Score = 29.3 bits (66), Expect = 4.2
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEK 25
             R  ++EE+EEEEEE+ E+EG  K
Sbjct: 386 TERDDEEEEDEEEEEEEDEDEGPSK 410



 Score = 28.9 bits (65), Expect = 6.0
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 4   RKKKEEEEEEEEEKQEEEGEEKKDG 28
           R  +EEE+EEEEE+++E+    K+ 
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEH 412


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.8 bits (65), Expect = 3.3
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 8   EEEEEEEEEKQEEEGEEKKDGPTTPK 33
           EE+EEEE+++++++ +E ++  +  K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVK 143



 Score = 28.4 bits (64), Expect = 3.8
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 8   EEEEEEEEEKQEEEGEEKKDGPTTPK 33
           +EE+EEEE+ +E++ E++ +   +P 
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPV 142



 Score = 27.3 bits (61), Expect = 9.2
 Identities = 9/19 (47%), Positives = 15/19 (78%)

Query: 8   EEEEEEEEEKQEEEGEEKK 26
           +EE+++E+E +EEE   KK
Sbjct: 126 DEEDDDEDESEEEESPVKK 144


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 7/16 (43%), Positives = 11/16 (68%)

Query: 1   EARRKKKEEEEEEEEE 16
             + +KKEE EEE ++
Sbjct: 86  AKKEEKKEESEEESDD 101


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 2   ARRKKKEEEEEEEEEKQEEEGEE 24
            +R+++ EE EEE EK  EEG++
Sbjct: 100 EKRRERREEAEEELEKAIEEGDD 122


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 29.3 bits (65), Expect = 3.9
 Identities = 22/118 (18%), Positives = 37/118 (31%), Gaps = 16/118 (13%)

Query: 175 EKKIKINVDEDSEDSLDDNMNAEH------RTRTSTNNNNTTMPKSSRTPSPNQLVDPTG 228
           E+   IN+   +  +     N             +TN   T+  K  +   P        
Sbjct: 47  ERLAMINITAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQP-----KQE 101

Query: 229 EVTKSPAPVPSPRIQEESSDESETVLPENLSLPK-NTPMSADSTSPMSAEHAYLYTHP 285
           E             QE+S   +E+  P+     K  TP S ++  PM +  +     P
Sbjct: 102 EQKSLNISATPAPKQEQSQTTTESTTPK----TKVTTPPSTNTPQPMQSTKSDTPQSP 155


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 29.6 bits (66), Expect = 4.0
 Identities = 18/56 (32%), Positives = 22/56 (39%), Gaps = 5/56 (8%)

Query: 193 NMNAEHRTRTSTNNN-----NTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQ 243
           NM A   T    NN+         P S   P P    DPT +   +P P P+P  Q
Sbjct: 896 NMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQ 951


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 28.9 bits (64), Expect = 4.1
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           E + ++++EEEE EEE ++EEG+   D
Sbjct: 217 EKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEK 25
           EA+R + EE++ +EEE + +  E K
Sbjct: 94  EAQRARDEEKKLDEEEAKRQHEEAK 118


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 29.1 bits (66), Expect = 4.3
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
             ++ +KEE E+   +K+ E   +K  GP
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449



 Score = 28.7 bits (65), Expect = 5.6
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           E + +K+E E+   ++K E   ++ K      K
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 28.0 bits (63), Expect = 4.3
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 5  KKKEEEEEEEEEKQEEEGEEKKDGPTT 31
          ++ ++EEEEE+E++E+E E  +DG  T
Sbjct: 28 EEYDKEEEEEKEEKEKEAEPDEDGWVT 54


>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 451

 Score = 29.4 bits (66), Expect = 4.4
 Identities = 13/67 (19%), Positives = 26/67 (38%), Gaps = 2/67 (2%)

Query: 215 SRTPSPNQLVDPTGEV--TKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTS 272
           S   +  +L     ++  + S   V SP+ Q  +  E  ++L       K   +  +  S
Sbjct: 364 SPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKKETIHQEVPS 423

Query: 273 PMSAEHA 279
           P  ++ A
Sbjct: 424 PSLSKSA 430


>gnl|CDD|225372 COG2815, COG2815, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 303

 Score = 28.9 bits (65), Expect = 4.4
 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 5/63 (7%)

Query: 215 SRTPSPNQLVDPTGEVT----KSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADS 270
           S++PS    V P   V     K P  +  P +   + DE+ + L +   L  N+     S
Sbjct: 136 SQSPSAGTEVKPGETVKLTVSKGPETITVPDLVGMTYDEASSNL-KAAGLTVNSKEYVSS 194

Query: 271 TSP 273
             P
Sbjct: 195 DRP 197


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 28.4 bits (64), Expect = 4.4
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 4   RKKKEEEEEEEEEKQEEEGEEKK 26
           ++++EEEE E EE  EEE  ++ 
Sbjct: 101 KEEEEEEEVEVEELDEEEQIDEL 123



 Score = 27.6 bits (62), Expect = 7.4
 Identities = 8/23 (34%), Positives = 15/23 (65%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKKD 27
           ++ E EE +EEE+ +E  E++  
Sbjct: 107 EEVEVEELDEEEQIDELLEKELA 129



 Score = 27.6 bits (62), Expect = 7.4
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 5   KKKEEEEEEEEEKQEEEGEEKKD 27
           K+KEEEEEEE E +E + EE+ D
Sbjct: 99  KEKEEEEEEEVEVEELDEEEQID 121


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
          Trichoplein or mitostatin, was first defined as a
          meiosis-specific nuclear structural protein. It has
          since been linked with mitochondrial movement. It is
          associated with the mitochondrial outer membrane, and
          over-expression leads to reduction in mitochondrial
          motility whereas lack of it enhances mitochondrial
          movement. The activity appears to be mediated through
          binding the mitochondria to the actin intermediate
          filaments (IFs).
          Length = 349

 Score = 29.1 bits (66), Expect = 4.4
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 1  EARRKKKEEEEEEEEEKQEEE 21
          E  R K   EEEE E K++EE
Sbjct: 48 EEERLKALAEEEERERKRKEE 68


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 2   ARRKKKEEEEEEEEEKQEEEGEEKK 26
           AR + +EEE  E+EE++E+   ++K
Sbjct: 82  ARLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 27.6 bits (61), Expect = 4.5
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 1   EARRKKKEEEEEEEEEKQEEEG 22
           E   K +E +EEEEEE+ E+ G
Sbjct: 80  EPAEKAEEAKEEEEEEEDEDFG 101


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 29.0 bits (64), Expect = 4.5
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 9/73 (12%)

Query: 125 GAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDE 184
           GA    G + P S G + ++ G  +L+G   G ++N     SP      D++    +VD+
Sbjct: 33  GAQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKEN-----SPPLPTEEDDE----DVDD 83

Query: 185 DSEDSLDDNMNAE 197
           DSE+  DD+  AE
Sbjct: 84  DSEEGDDDDGGAE 96


>gnl|CDD|219086 pfam06553, BNIP3, BNIP3.  This family consists of several mammalian
           specific BCL2/adenovirus E1B 19-kDa protein-interacting
           protein 3 or BNIP3 sequences. BNIP3 belongs to the Bcl-2
           homology 3 (BH3)-only family, a Bcl-2-related family
           possessing an atypical Bcl-2 homology 3 (BH3) domain,
           which regulates PCD from mitochondrial sites by
           selective Bcl-2/Bcl-XL interactions. BNIP3 family
           members contain a C-terminal transmembrane domain that
           is required for their mitochondrial localisation,
           homodimerisation, as well as regulation of their
           pro-apoptotic activities. BNIP3-mediated apoptosis has
           been reported to be independent of caspase activation
           and cytochrome c release and is characterized by early
           plasma membrane and mitochondrial damage, prior to the
           appearance of chromatin condensation or DNA
           fragmentation.
          Length = 197

 Score = 28.7 bits (64), Expect = 5.0
 Identities = 26/134 (19%), Positives = 44/134 (32%), Gaps = 20/134 (14%)

Query: 132 SANPSSPGGQDNNNGA-LNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSL 190
           SA    P  +D  NG+ + L+ +      +         + S +    K+ ++   E S 
Sbjct: 2   SAQQVPPSSEDGLNGSWVELEMSGNNASSSSPGRLEHVPSSSSNGDMEKLLLEAQRESS- 60

Query: 191 DDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQEESSDES 250
                             ++   SSR  SP +   P  E  ++   V       +   +S
Sbjct: 61  -----------------RSSSRTSSRCDSPPRSPTPPQESPQTELDVEMDS-LRDVYIQS 102

Query: 251 ETVLPENLSLPKNT 264
           E    E L   KNT
Sbjct: 103 EEEAGEKLKEVKNT 116


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.6 bits (64), Expect = 5.0
 Identities = 10/27 (37%), Positives = 21/27 (77%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           E + K+ E E+ +EE++++EE EE+++
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEE 183



 Score = 27.8 bits (62), Expect = 9.2
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           +   +K EEEEEEEEE+ E+  ++  D
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 28.9 bits (64), Expect = 5.2
 Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 2/73 (2%)

Query: 194 MNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQ-EESSDESET 252
           +N  +R R  T         +    +   +  PT  V   P+P  S  I  + +  ES  
Sbjct: 417 VNERNRRRNHTEIRLAEQRMNEEAATATPMSAPT-SVLTQPSPSLSTSIMPKLNPSESVV 475

Query: 253 VLPENLSLPKNTP 265
           V  E  S P  +P
Sbjct: 476 VASEKASSPDLSP 488


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 5.3
 Identities = 21/104 (20%), Positives = 40/104 (38%), Gaps = 6/104 (5%)

Query: 183 DEDSEDSLDDNMNAEHRTRTSTNNNNTTM---PKSSRTPSPNQLVDPTGEVTKSPAP--- 236
           DEDSED  +DN N       S++    T      +++                 P P   
Sbjct: 213 DEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA 272

Query: 237 VPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHAY 280
           + S  ++++++ ES +  P +      +P +A ++S  S     
Sbjct: 273 LSSLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGDYS 316


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 29.0 bits (64), Expect = 5.5
 Identities = 41/197 (20%), Positives = 69/197 (35%), Gaps = 13/197 (6%)

Query: 174 DEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKS 233
           D   +     E  +DS DD +  +     S+     T PK    P   + VD        
Sbjct: 231 DGNWVWTRELEPQDDSEDDYVIPDAEI-ISSPTLEVTAPKEVEQPLQPEPVDEETVAETK 289

Query: 234 PAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHAYLYTHPYSPY-QNG 292
               P P    E+   S    PE+ ++ +N P     T  +      + T P  P  Q  
Sbjct: 290 AEEEPQPTQTVETKPTSA---PES-TVEENLPEINQPTQAVQPTSETISTTPVEPTDQLK 345

Query: 293 RSPVDVLRRVFPNRRRGEVETV------LQRCKGDVLQAIEM-MVYSEQLEEPRSAFSPL 345
              VD ++      +  EVE +      L     D L+ ++  +V + Q +EP  A    
Sbjct: 346 PKEVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENW 405

Query: 346 VSAFQRNFASQQANRRF 362
            + +Q +  +Q     +
Sbjct: 406 ATDYQLDEPTQSNIDWY 422


>gnl|CDD|133058 cd06436, GlcNAc-1-P_transferase, N-acetyl-glucosamine transferase
           is involved in the synthesis of
           Poly-beta-1,6-N-acetyl-D-glucosamine.
           N-acetyl-glucosamine transferase is responsible for the
           synthesis of bacteria
           Poly-beta-1,6-N-acetyl-D-glucosamine (PGA).
           Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer
           that serves as an adhesion for the maintenance of
           biofilm structural stability in diverse eubacteria.
           N-acetyl-glucosamine transferase is the product of gene
           pgaC. Genetic analysis indicated that all four genes of
           the pgaABCD locus were required for the PGA production,
           pgaC being a glycosyltransferase.
          Length = 191

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 11/36 (30%)

Query: 290 QNGRSPVDVLRRVFPNRRRGEVETVLQRCKGDVLQA 325
            + R  V +LRR  PN R G         KGD L A
Sbjct: 48  TDSR--VHLLRRHLPNARTG---------KGDALNA 72


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 28.6 bits (64), Expect = 6.0
 Identities = 13/27 (48%), Positives = 19/27 (70%), Gaps = 1/27 (3%)

Query: 1   EARRKKKEEEEEEE-EEKQEEEGEEKK 26
           E +RKKKE+++ EE ++KQ  E E  K
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERLK 104


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.2 bits (63), Expect = 7.0
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
           E   ++ EEE EEE+E+  +E E++ +  T   
Sbjct: 85  EKEDEESEEENEEEDEESSDENEKETEEKTESN 117


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 28.7 bits (64), Expect = 7.2
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKKD 27
           E + KK+E++EE+ EE + EE +E+ +
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFE 304


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.2 bits (63), Expect = 8.3
 Identities = 14/29 (48%), Positives = 21/29 (72%), Gaps = 2/29 (6%)

Query: 1   EARRKKKEEE--EEEEEEKQEEEGEEKKD 27
           EA  K KEE+  +EE E+KQ+E+ +E K+
Sbjct: 397 EAEAKAKEEKLKQEENEKKQKEQADEDKE 425


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 28.4 bits (63), Expect = 8.4
 Identities = 39/187 (20%), Positives = 61/187 (32%), Gaps = 15/187 (8%)

Query: 100 AQVALRRQQAQEENEARELGLIYTSGAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQ 159
            Q+ LR  + Q++ + ++        A            GGQ    G LN+QGN+ G   
Sbjct: 341 KQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLGA 400

Query: 160 NHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPS 219
           N      P    SP            +  S+         +     +           PS
Sbjct: 401 NPMQQGQPGMMSSPSPVP-----QVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPS 455

Query: 220 PNQLVDPTGEVTKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHA 279
           P  L+        SP+P  S     + + + + V P     P NTP  +   SP + +  
Sbjct: 456 PPALM-------PSPSPQMSQSPASQRTIQQDMVSP---GGPLNTPGQSSVNSPANPQEE 505

Query: 280 YLYTHPY 286
            LY   Y
Sbjct: 506 QLYREKY 512


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.0 bits (63), Expect = 8.7
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKK 26
           E  +  K  EEE +EE QE++ E+KK
Sbjct: 268 EEEKILKAAEEERQEEAQEKKEEKKK 293


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 25.2 bits (56), Expect = 8.9
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 2  ARRKKKEEEEEEEEEKQEEEGEE 24
           R  K E++ EEE++K EE  +E
Sbjct: 11 KRVWKAEQKAEEEKKKIEERRKE 33


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 28.1 bits (63), Expect = 9.3
 Identities = 15/24 (62%), Positives = 18/24 (75%)

Query: 8   EEEEEEEEEKQEEEGEEKKDGPTT 31
           EEEEEEEEE  EEE EE+++   T
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERT 329


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 26.7 bits (58), Expect = 9.4
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 1  EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
          ++   ++E++++E+EE  +E+ +E  D    P  
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSF 46


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 28.3 bits (63), Expect = 9.5
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 2   ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTP 32
            +R  +E +EEE EE++E    E+ +     
Sbjct: 396 VKRVVREVDEEEIEEEEEAMQPEEMEMEGFE 426


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 28.4 bits (63), Expect = 9.5
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 4   RKKKEEEEEEEEEKQEEEGEEKKDG 28
           R  +E E EEE E  E+E +  K  
Sbjct: 378 RNPEESEHEEEVEDYEDENDHSKRI 402


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 27.9 bits (62), Expect = 9.5
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 1   EARRKKKEEEEEEEEEKQEEEGEEKK 26
           +A ++ +E++++ EE K ++  E K 
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKA 134


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.127    0.371 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,951,719
Number of extensions: 1978423
Number of successful extensions: 6637
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5986
Number of HSP's successfully gapped: 324
Length of query: 412
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 313
Effective length of database: 6,546,556
Effective search space: 2049072028
Effective search space used: 2049072028
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (26.7 bits)