RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4232
         (130 letters)



>gnl|CDD|110260 pfam01246, Ribosomal_L24e, Ribosomal protein L24e. 
          Length = 71

 Score = 90.3 bits (224), Expect = 4e-25
 Identities = 43/71 (60%), Positives = 55/71 (77%)

Query: 1  MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQK 60
          MK+ LC++SGYKIYPGHG   V+ D K F F +SKCE +F M+RNPRK+ WT LYR++ K
Sbjct: 1  MKVELCSFSGYKIYPGHGIMFVRKDSKVFRFCSSKCEKNFHMKRNPRKLAWTKLYRKQHK 60

Query: 61 KGQEEEATRKR 71
          KG+ EEA +KR
Sbjct: 61 KGEAEEAAKKR 71


>gnl|CDD|100103 cd00472, Ribosomal_L24e_L24, Ribosomal protein L24e/L24 is a
          ribosomal protein found in eukaryotes (L24) and in
          archaea (L24e, distinct from archaeal L24). L24e/L24 is
          located on the surface of the large subunit, adjacent
          to proteins L14 and L3, and near the translation factor
          binding site.  L24e/L24 appears to play a role in the
          kinetics of peptide synthesis, and may be involved in
          interactions between the large and small subunits,
          either directly or through other factors. In mouse, a
          deletion mutation in L24 has been identified as the
          cause for the belly spot and tail (Bst) mutation that
          results in disrupted pigmentation, somitogenesis and
          retinal cell fate determination.  L24 may be an
          important protein in eukaryotic reproduction:  in
          shrimp, L24 expression is elevated in the ovary,
          suggesting a role in oogenesis, and in Arabidopsis, L24
          has been proposed to have a specific function in
          gynoecium development. No protein with sequence or
          structural homology to L24e/L24 has been identified in
          bacteria, but a functionally equivalent protein may
          exist.  Bacterial L19 forms an interprotein beta sheet
          with L14 that is similar to the L24e/L14 interprotein
          beta sheet observed in the archaeal L24e structures.
          Some eukaryotic L24 proteins were initially identified
          as L30, and this alignment model contains several
          sequences called L30.
          Length = 54

 Score = 88.8 bits (221), Expect = 9e-25
 Identities = 32/54 (59%), Positives = 39/54 (72%)

Query: 1  MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVL 54
          MK   C++ GYKIYPGHGK  V+ DGK F F +SKCE +F  +RNPRK+ WTV 
Sbjct: 1  MKTEKCSFCGYKIYPGHGKMYVRNDGKVFRFCSSKCEKNFLRKRNPRKLKWTVA 54


>gnl|CDD|224986 COG2075, RPL24A, Ribosomal protein L24E [Translation, ribosomal
          structure and biogenesis].
          Length = 66

 Score = 76.6 bits (189), Expect = 8e-20
 Identities = 33/66 (50%), Positives = 43/66 (65%)

Query: 1  MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQK 60
          MK+ +C++ G KI PG G   V+ DGK   F +SKCE  FK+ RNPRK+ WT  YR+  K
Sbjct: 1  MKVRVCSFCGKKIEPGTGIMYVRNDGKVLRFCSSKCEKLFKLGRNPRKLKWTKKYRKMHK 60

Query: 61 KGQEEE 66
          K  +EE
Sbjct: 61 KEIKEE 66


>gnl|CDD|140068 PTZ00033, PTZ00033, 60S ribosomal protein L24; Provisional.
          Length = 125

 Score = 69.3 bits (169), Expect = 2e-16
 Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 7/128 (5%)

Query: 1   MKLGLCAYSGYKIYPGHGKTLVKV----DGKSFTFINSKCESSFKMRRNPRKVTWTVLYR 56
           M+   C +S + ++PGHG+  V           TF+  KC + +  ++NPR + WT  YR
Sbjct: 1   MRTIACEFSHFAVHPGHGRRYVPFAFLSTKPVLTFLRPKCFALYMRKKNPRFLPWTRTYR 60

Query: 57  RKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQK 116
           R  +K   +   R+R  RT K QRAIVGA ++ I   + V+  ++K  R    KA + +K
Sbjct: 61  RINRKTTTDRVQRRRAARTVKVQRAIVGADLSYI---QEVRAYVQKVDRSAKAKAVRAEK 117

Query: 117 RAQKEAKK 124
             +K AKK
Sbjct: 118 AERKAAKK 125


>gnl|CDD|179131 PRK00807, PRK00807, 50S ribosomal protein L24e; Validated.
          Length = 52

 Score = 60.8 bits (148), Expect = 1e-13
 Identities = 23/47 (48%), Positives = 31/47 (65%)

Query: 6  CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWT 52
          C++ G +I PG GK  VK DG    F +SKCE ++K+ R PRK+ WT
Sbjct: 4  CSFCGKEIEPGTGKMYVKKDGTILYFCSSKCEKNYKLGRVPRKLKWT 50


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
          fusion protein; Provisional.
          Length = 131

 Score = 53.8 bits (129), Expect = 3e-10
 Identities = 21/63 (33%), Positives = 32/63 (50%)

Query: 6  CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQKKGQEE 65
          C Y+G +I PG G   V+ DG    F++SKCE ++ + R  R + WT   R ++      
Sbjct: 7  CDYTGEEIEPGTGTMFVRKDGTVLHFVDSKCEKNYDLGREARDLEWTEAGRAEKGPAAAA 66

Query: 66 EAT 68
           A 
Sbjct: 67 TAA 69


>gnl|CDD|214799 smart00746, TRASH, metallochaperone-like domain. 
          Length = 39

 Score = 45.8 bits (109), Expect = 5e-08
 Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 6  CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRR 44
          C++ G  IY      +V  DGK + F +SKC S FK +R
Sbjct: 1  CSFCGKDIYNPGTGIMVVNDGKVYYFCSSKCLSKFKKKR 39


>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Validated.
          Length = 339

 Score = 29.3 bits (67), Expect = 0.49
 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 8/46 (17%)

Query: 79  QRAIVGASINDILAKRNVKPEIRKA--QREQAIKAAKEQKRAQKEA 122
           +R   GA IN+      +K  I  A  +R++ ++AA    R   E 
Sbjct: 53  ERKEAGALINE------LKQAIEAALEERKEELEAAALNARLAAET 92


>gnl|CDD|234988 PRK01889, PRK01889, GTPase RsgA; Reviewed.
          Length = 356

 Score = 28.7 bits (65), Expect = 0.71
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 101 RKAQREQAIKAAKEQKRAQKEAKK 124
           RK QREQA  AAK   R  +  +K
Sbjct: 316 RKLQREQAYNAAKLAARLARAERK 339


>gnl|CDD|216327 pfam01145, Band_7, SPFH domain / Band 7 family.  This family has
           been called SPFH, Band 7 or PHB domain. Recent
           phylogenetic analysis has shown this domain to be a
           slipin or Stomatin-like integral membrane domain
           conserved from protozoa to mammals.
          Length = 177

 Score = 28.1 bits (63), Expect = 0.96
 Identities = 11/41 (26%), Positives = 16/41 (39%)

Query: 82  IVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEA 122
           I    I DI     +   I + Q  +      E +RA+ EA
Sbjct: 137 IEDVQITDIDPPPEIAEAIEEKQAAEQEAEEAEIERAEAEA 177


>gnl|CDD|234317 TIGR03703, plsB, glycerol-3-phosphate O-acyltransferase.  Members
           of this protein family are PlsB, glycerol-3-phosphate
           O-acyltransferase, present in E. coli and numerous
           related species. In many bacteria, PlsB is not found,
           and appears to be replaced by a two enzyme system for
           1-acyl-glycerol-3-phosphate biosynthesis, the PlsX/Y
           system [Fatty acid and phospholipid metabolism,
           Biosynthesis].
          Length = 799

 Score = 28.4 bits (64), Expect = 1.2
 Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 9/74 (12%)

Query: 59  QKKGQEEEATRK--RTRRTHKFQRAIVGASINDILAKRNV------KPEIRKAQREQAIK 110
            + G +E    K  R  R H F+R  + A   D+  +R +       P +RKA R++A  
Sbjct: 172 DEHGTDESIAHKLARVARVH-FRRQRLAAIGPDLSHRRTLFNKLLASPSVRKAIRDEAKS 230

Query: 111 AAKEQKRAQKEAKK 124
               +++A+K A K
Sbjct: 231 KKISEEKARKRALK 244


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 8/65 (12%)

Query: 60  KKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKA--QREQAIKAAKEQKR 117
           KKG   +  +K  + +   +R  VGA IN+      +K E+  A  +    ++AA   +R
Sbjct: 37  KKGVLTDLLKKLGKLSPLEERKEVGALINE------LKKEVEDAITELTPELEAAGLWER 90

Query: 118 AQKEA 122
              E 
Sbjct: 91  LAFEK 95


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 13/70 (18%), Positives = 25/70 (35%), Gaps = 6/70 (8%)

Query: 57  RKQKKGQEEEATRKRTRRTH-----KFQRAIVGASINDILAKRNVKPEIRKAQREQAIKA 111
           R+Q +  + E   +           K +     A+  ++         + KAQ EQ  +A
Sbjct: 430 RQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQE-QA 488

Query: 112 AKEQKRAQKE 121
               ++ Q E
Sbjct: 489 EANVEQLQSE 498


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 27.8 bits (62), Expect = 1.8
 Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 3/84 (3%)

Query: 44  RNPRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQ--RAIVGASINDILAKRNVKPEIR 101
           R PR V  T   + K+    ++   R++  +   F+   A  G       A +    + +
Sbjct: 729 RAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAK 788

Query: 102 K-AQREQAIKAAKEQKRAQKEAKK 124
           K + + Q I AA + KRA K+   
Sbjct: 789 KPSAKTQKIAAATKAKRAAKKKVA 812


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 27.9 bits (62), Expect = 1.8
 Identities = 10/32 (31%), Positives = 14/32 (43%)

Query: 99   EIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
               K QR  A   +   K+  ++A K P P K
Sbjct: 1178 AREKLQRAAARGESGAAKKVSRQAPKKPAPKK 1209


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 97  KPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
           +   +KA++E+A KAA ++K      K   P  +
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452



 Score = 25.7 bits (57), Expect = 9.9
 Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 3/30 (10%)

Query: 95  NVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
           N+ P  RK  R+   K  K +K+A+KE  +
Sbjct: 405 NLSPAERKKLRK---KQRKAEKKAEKEEAE 431


>gnl|CDD|165098 PHA02730, PHA02730, ankyrin-like protein; Provisional.
          Length = 672

 Score = 27.7 bits (61), Expect = 1.8
 Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 7/78 (8%)

Query: 46  PRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQR 105
           P KV + +        G+ +E  R R          I G S+ DIL KRN +  +R A+ 
Sbjct: 558 PSKVKYDI-------AGRHKELKRYRADINRMKNAYISGVSMFDILFKRNRRHRLRYAKN 610

Query: 106 EQAIKAAKEQKRAQKEAK 123
              I      K  +KE  
Sbjct: 611 PTFINFVSNIKWYKKELT 628


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 12/68 (17%), Positives = 30/68 (44%), Gaps = 6/68 (8%)

Query: 63  QEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEA 122
           +E     +   + ++ ++ I  A    +  K       +KA ++   + A  ++  + EA
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLEKK------AKKAIKKGKDEEALAEELLELEA 121

Query: 123 KKPPPPTK 130
           ++P PP +
Sbjct: 122 EEPEPPLR 129


>gnl|CDD|235325 PRK04974, PRK04974, glycerol-3-phosphate acyltransferase;
           Validated.
          Length = 818

 Score = 27.5 bits (62), Expect = 2.1
 Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 15/77 (19%)

Query: 59  QKKGQEEEATRK--RTRRTHKFQR---AIVGASINDILAKRNV------KPEIRKAQREQ 107
            + G ++   RK  R  R H F+R   A +G  +     ++ +       P I+KA  ++
Sbjct: 182 DEHGTDKRIARKLARVARVH-FRRQRLAAIGPDLPH---RQALFNKLLASPAIKKAIEDE 237

Query: 108 AIKAAKEQKRAQKEAKK 124
           A       ++A++ A  
Sbjct: 238 AKSKKISHEKARQNAIA 254


>gnl|CDD|218745 pfam05783, DLIC, Dynein light intermediate chain (DLIC).  This
           family consists of several eukaryotic dynein light
           intermediate chain proteins. The light intermediate
           chains (LICs) of cytoplasmic dynein consist of multiple
           isoforms, which undergo post-translational modification
           to produce a large number of species. DLIC1 is known to
           be involved in assembly, organisation, and function of
           centrosomes and mitotic spindles when bound to
           pericentrin. DLIC2 is a subunit of cytoplasmic dynein 2
           that may play a role in maintaining Golgi organisation
           by binding cytoplasmic dynein 2 to its Golgi-associated
           cargo.
          Length = 490

 Score = 27.1 bits (60), Expect = 2.7
 Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 96  VKPEIRKAQREQAIKAAKEQ----KRAQKEAKKPPPPTK 130
           VKP +RK   ++ ++A  EQ    K+    AK+P  PT+
Sbjct: 332 VKPPVRKLVHDKEVQAEDEQVFLMKQQSLLAKQPATPTR 370


>gnl|CDD|236973 PRK11767, PRK11767, SpoVR family protein; Provisional.
          Length = 498

 Score = 27.1 bits (61), Expect = 3.0
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 55  YRRKQKKGQEEEATRKRTRRTH 76
           Y+R QK   +EE  R++ R  +
Sbjct: 179 YKRPQKISLQEEKARQKEREEY 200


>gnl|CDD|148767 pfam07347, CI-B14_5a, NADH:ubiquinone oxidoreductase subunit B14.5a
           (Complex I-B14.5a).  This family contains the eukaryotic
           NADH:ubiquinone oxidoreductase subunit B14.5a (Complex
           I-B14.5a) (EC:1.6.5.3). This is approximately 100
           residues long, and forms part of a multiprotein complex
           that resides on the inner mitochondrial membrane. The
           main function of the complex is the transport of
           electrons from NADH to ubiquinone, accompanied by
           translocation of protons from the mitochondrial matrix
           to the intermembrane space.
          Length = 97

 Score = 25.9 bits (57), Expect = 3.9
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 92  AKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
            +R V P I     ++A+ A KE  +A  ++KK P P K
Sbjct: 54  GRREVTPPIDIVSSQKALLAGKEAAKAAVKSKKLPTPGK 92


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 26.6 bits (58), Expect = 4.4
 Identities = 17/69 (24%), Positives = 35/69 (50%)

Query: 56   RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
            + ++ K  EE   +    +  + ++  V         ++    E++KA+ E  IKAA+E 
Sbjct: 1608 KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA 1667

Query: 116  KRAQKEAKK 124
            K+A+++ KK
Sbjct: 1668 KKAEEDKKK 1676


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 26.7 bits (59), Expect = 4.6
 Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 9/69 (13%)

Query: 56  RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
           R+K+++ Q EE  +K+     + ++         +  +R    E +K   E A +AA +Q
Sbjct: 81  RKKKEQQQAEELQQKQAAEQERLKQ---------LEKERLAAQEQKKQAEEAAKQAALKQ 131

Query: 116 KRAQKEAKK 124
           K+A++ A K
Sbjct: 132 KQAEEAAAK 140


>gnl|CDD|224385 COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family
           exonuclease) [Defense    mechanisms].
          Length = 190

 Score = 26.2 bits (58), Expect = 4.6
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 90  ILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
              K+ V+ E+ +  RE+  +  KE +   +  K PPP  K
Sbjct: 134 PKLKKRVEVELTEELREEVERVLKEIEEILEGGKPPPPKKK 174


>gnl|CDD|237310 PRK13222, PRK13222, phosphoglycolate phosphatase; Provisional.
          Length = 226

 Score = 26.3 bits (59), Expect = 4.9
 Identities = 8/13 (61%), Positives = 8/13 (61%)

Query: 83  VGASINDILAKRN 95
           VG S NDI A R 
Sbjct: 172 VGDSRNDIQAARA 184


>gnl|CDD|140249 PTZ00222, PTZ00222, 60S ribosomal protein L7a; Provisional.
          Length = 263

 Score = 26.2 bits (57), Expect = 5.2
 Identities = 16/54 (29%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 76  HKFQRAIVGASINDILAK-RNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPP 128
           ++F + +  +S N++L   +   PE RKA+R++  K A+E+K+  K+      P
Sbjct: 77  NQFTKVLDRSSRNELLKLIKKYAPETRKARRDRLHKVAEEKKKDPKKTVSTKAP 130


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 26.1 bits (58), Expect = 5.7
 Identities = 14/81 (17%), Positives = 32/81 (39%), Gaps = 20/81 (24%)

Query: 44  RNPRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKA 103
              R+     + +  +++ QEE   +K  ++                  +R  + ++ K 
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKK----------------KEER--EAKLAKL 303

Query: 104 QREQAIKAAKEQKRAQKEAKK 124
             E+  K   E+K  +K+A+K
Sbjct: 304 SPEEQRK--LEEKERKKQARK 322


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 26.2 bits (58), Expect = 5.8
 Identities = 11/75 (14%), Positives = 25/75 (33%), Gaps = 2/75 (2%)

Query: 56  RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
             ++KK   +  +     +    +     + + + +A    +    KA+   A   A   
Sbjct: 205 LEERKKTLAQLNSELSADQKKLEELRANESRLKNEIAS--AEAAAAKAREAAAAAEAAAA 262

Query: 116 KRAQKEAKKPPPPTK 130
           +    EAK+     K
Sbjct: 263 RARAAEAKRTGETYK 277


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
           (DMAP1).  DNA methylation can contribute to
           transcriptional silencing through several
           transcriptionally repressive complexes, which include
           methyl-CpG binding domain proteins (MBDs) and histone
           deacetylases (HDACs). The chief enzyme that maintains
           mammalian DNA methylation, DNMT1, can also establish a
           repressive transcription complex. The non-catalytic
           amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
           DNMT1 associated protein), and can mediate
           transcriptional repression. DMAP1 has intrinsic
           transcription repressive activity, and binds to the
           transcriptional co-repressor TSG101. DMAP1 is targeted
           to replication foci through interaction with the far N
           terminus of DNMT1 throughout S phase, whereas HDAC2
           joins DNMT1 and DMAP1 only during late S phase,
           providing a platform for how histones may become
           deacetylated in heterochromatin following replication.
          Length = 175

 Score = 26.0 bits (57), Expect = 6.2
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 3/64 (4%)

Query: 65  EEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
           E   ++R ++    Q+ I  A   D  A      E RK +++   K   ++    KE  K
Sbjct: 13  EARKKEREKKAQDLQKLITAA---DQQAAGFSTAEARKREKKLPKKKIPQKPGPSKEDSK 69

Query: 125 PPPP 128
              P
Sbjct: 70  VNVP 73


>gnl|CDD|163696 cd08065, MPN_eIF3h, Mpr1p, Pad1p N-terminal (MPN) domains without
           catalytic isopeptidase activity, found in eIF2h.
           Eukaryotic translation initiation factor 3 (eIF3)
           subunit h (eIF3h; eIF3 subunit 3; eIF3S3; eIF3-gamma;
           eIF3-p40) is an evolutionarily non-conserved subunit of
           the functional core that comprises eIF3a, eIF3b, eIF3c,
           eIF3e, eIF3f, and eIF3h, and contains the MPN domain.
           However, it lacks the canonical JAMM motif, and
           therefore does not show catalytic isopeptidase
           activity.Together with eIF3e and eIF3f, eIF3h stabilizes
           the eIF3 complex. Results suggest that eIF3h regulates
           cell growth and viability, and that over-expression of
           the gene may provide growth advantage to prostate,
           breast, and liver cancer cells. For example, EIF3h gene
           amplification is common in late-stage prostate cancer
           suggesting that it may be functionally involved in the
           progression of the disease. It has been shown that
           coamplification of MYC, a well characterized oncogene
           involved in cell growth, differentiation, and apoptosis,
           and EIF3h in patients with non-small cell lung cancer
           (NSCLC) improves survival if treated with the Epidermal
           Growth Factor Receptor Tyrosine Kinase Inhibitor
           (EGFR-TKI), Gefitinib. Plant eIF3h is implicated in
           translation of specific mRNAs.
          Length = 266

 Score = 26.1 bits (58), Expect = 6.8
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 4/31 (12%)

Query: 101 RKAQREQAIKAAKEQKR----AQKEAKKPPP 127
           R   R+QA      QKR    AQ+EA+   P
Sbjct: 231 RNLARQQAQIQQWLQKRKAENAQREARGEEP 261


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 26.1 bits (58), Expect = 7.4
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 91  LAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
            A+     E R A+ + A+ AA  + +A+K A   P   K
Sbjct: 470 EARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIK 509



 Score = 25.7 bits (57), Expect = 9.9
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 101 RKAQREQAIKAAKEQKRAQKEAKKPPPPT 129
           +KA    AI  AK +K AQ+ A       
Sbjct: 546 KKAAVAAAIARAKAKKAAQQAANAEAEEE 574


>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
           Members of this protein family resemble the widely
           distributed ATP-dependent protease La, also called Lon
           and LonA. It resembles even more closely LonB, which is
           a LonA paralog found in genomes if and only if the
           species is capable of endospore formation (as in
           Bacillus subtilis, Clostridium tetani, and select other
           members of the Firmicutes) and expressed specifically in
           the forespore compartment. Members of this family are
           restricted to a subset of spore-forming species, and are
           very likely to participate in the program of endospore
           formation. We propose the designation LonC [Protein
           fate, Degradation of proteins, peptides, and
           glycopeptides, Cellular processes, Sporulation and
           germination].
          Length = 615

 Score = 25.9 bits (57), Expect = 7.5
 Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 3/54 (5%)

Query: 58  KQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKA 111
           K++KG E  +T K+  R  K ++  +  S   +L  R    EI    +E+AIKA
Sbjct: 115 KEEKGPENSSTLKKLERLEKLEKKKLHKSAQSLLRPRAF-SEI--VGQERAIKA 165


>gnl|CDD|233370 TIGR01353, dGTP_triPase, deoxyguanosinetriphosphate
          triphosphohydrolase, putative.  dGTP
          triphosphohydrolase (dgt) releases inorganic
          triphosphate, an unusual activity reaction product,
          from GTP. Its activity has been called limited to the
          Enterobacteriaceae, although homologous sequences are
          detected elsewhere. This finding casts doubt on whether
          the activity is shared in other species. In several of
          these other species, the homologous gene is found in an
          apparent operon with dnaG, the DNA primase gene. The
          enzyme from E. coli was shown to bind coopertatively to
          single stranded DNA. The biological role of dgt is
          unknown [Purines, pyrimidines, nucleosides, and
          nucleotides, Nucleotide and nucleoside
          interconversions].
          Length = 381

 Score = 25.8 bits (57), Expect = 8.2
 Identities = 17/63 (26%), Positives = 24/63 (38%), Gaps = 7/63 (11%)

Query: 43 RRNPRKVTWTVLYRRKQKKGQ---EEEATRKRTRRTHKFQRAIVGASI----NDILAKRN 95
           R+  ++  +  +RR Q K Q     E    RTR TH  + A VG SI            
Sbjct: 6  ERDYDRIIHSSAFRRLQDKTQVFPLAENDFVRTRLTHSLEVAQVGRSIANLIGLRYDLEL 65

Query: 96 VKP 98
           + 
Sbjct: 66 EEL 68


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 25.6 bits (57), Expect = 8.4
 Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)

Query: 97  KPEIRKAQREQAIKAAKEQKRAQKEAKKP 125
           K E++   + Q  K  K++ +A K   KP
Sbjct: 63  KKELKAWHKAQK-KKEKQEAKAAKAKSKP 90


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 25.8 bits (58), Expect = 8.5
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 80  RAIVGASINDILAKR-NVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
           R  V   + D      N  PE+ +   E+AIKAA+ + RA K+ K+
Sbjct: 347 RRFVSGVVKDAFDLWLNQNPELAEKLAEKAIKAAQARLRAAKKVKR 392


>gnl|CDD|224796 COG1884, Sbm, Methylmalonyl-CoA mutase, N-terminal domain/subunit
           [Lipid metabolism].
          Length = 548

 Score = 25.8 bits (57), Expect = 9.1
 Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 9/69 (13%)

Query: 61  KGQEEEATRKRTRRTHKFQRAIVG------ASINDILAKRNVKPEIRKAQREQAIKAAKE 114
           + +  E+  +R  R  + +R IVG          + +    V PE   A RE+ I   K+
Sbjct: 433 QREIAESAARRQARIEEGERVIVGVNEYPEEGPEEPVEILKVDPE---AVRERQIARLKK 489

Query: 115 QKRAQKEAK 123
            +  + E  
Sbjct: 490 VRAERDEEA 498


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 25.4 bits (56), Expect = 9.4
 Identities = 11/53 (20%), Positives = 20/53 (37%)

Query: 56  RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQA 108
           +R+ ++  EE   +       + QR I  A +         K EI ++  E  
Sbjct: 38  KREAEEAIEEILRKAEKEAERERQRIISSALLEARRKLLEAKEEILESVFEAV 90


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.129    0.371 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,500,251
Number of extensions: 554728
Number of successful extensions: 1527
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1368
Number of HSP's successfully gapped: 217
Length of query: 130
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 44
Effective length of database: 7,123,158
Effective search space: 313418952
Effective search space used: 313418952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 53 (24.3 bits)