RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4232
(130 letters)
>gnl|CDD|110260 pfam01246, Ribosomal_L24e, Ribosomal protein L24e.
Length = 71
Score = 90.3 bits (224), Expect = 4e-25
Identities = 43/71 (60%), Positives = 55/71 (77%)
Query: 1 MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQK 60
MK+ LC++SGYKIYPGHG V+ D K F F +SKCE +F M+RNPRK+ WT LYR++ K
Sbjct: 1 MKVELCSFSGYKIYPGHGIMFVRKDSKVFRFCSSKCEKNFHMKRNPRKLAWTKLYRKQHK 60
Query: 61 KGQEEEATRKR 71
KG+ EEA +KR
Sbjct: 61 KGEAEEAAKKR 71
>gnl|CDD|100103 cd00472, Ribosomal_L24e_L24, Ribosomal protein L24e/L24 is a
ribosomal protein found in eukaryotes (L24) and in
archaea (L24e, distinct from archaeal L24). L24e/L24 is
located on the surface of the large subunit, adjacent
to proteins L14 and L3, and near the translation factor
binding site. L24e/L24 appears to play a role in the
kinetics of peptide synthesis, and may be involved in
interactions between the large and small subunits,
either directly or through other factors. In mouse, a
deletion mutation in L24 has been identified as the
cause for the belly spot and tail (Bst) mutation that
results in disrupted pigmentation, somitogenesis and
retinal cell fate determination. L24 may be an
important protein in eukaryotic reproduction: in
shrimp, L24 expression is elevated in the ovary,
suggesting a role in oogenesis, and in Arabidopsis, L24
has been proposed to have a specific function in
gynoecium development. No protein with sequence or
structural homology to L24e/L24 has been identified in
bacteria, but a functionally equivalent protein may
exist. Bacterial L19 forms an interprotein beta sheet
with L14 that is similar to the L24e/L14 interprotein
beta sheet observed in the archaeal L24e structures.
Some eukaryotic L24 proteins were initially identified
as L30, and this alignment model contains several
sequences called L30.
Length = 54
Score = 88.8 bits (221), Expect = 9e-25
Identities = 32/54 (59%), Positives = 39/54 (72%)
Query: 1 MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVL 54
MK C++ GYKIYPGHGK V+ DGK F F +SKCE +F +RNPRK+ WTV
Sbjct: 1 MKTEKCSFCGYKIYPGHGKMYVRNDGKVFRFCSSKCEKNFLRKRNPRKLKWTVA 54
>gnl|CDD|224986 COG2075, RPL24A, Ribosomal protein L24E [Translation, ribosomal
structure and biogenesis].
Length = 66
Score = 76.6 bits (189), Expect = 8e-20
Identities = 33/66 (50%), Positives = 43/66 (65%)
Query: 1 MKLGLCAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQK 60
MK+ +C++ G KI PG G V+ DGK F +SKCE FK+ RNPRK+ WT YR+ K
Sbjct: 1 MKVRVCSFCGKKIEPGTGIMYVRNDGKVLRFCSSKCEKLFKLGRNPRKLKWTKKYRKMHK 60
Query: 61 KGQEEE 66
K +EE
Sbjct: 61 KEIKEE 66
>gnl|CDD|140068 PTZ00033, PTZ00033, 60S ribosomal protein L24; Provisional.
Length = 125
Score = 69.3 bits (169), Expect = 2e-16
Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 7/128 (5%)
Query: 1 MKLGLCAYSGYKIYPGHGKTLVKV----DGKSFTFINSKCESSFKMRRNPRKVTWTVLYR 56
M+ C +S + ++PGHG+ V TF+ KC + + ++NPR + WT YR
Sbjct: 1 MRTIACEFSHFAVHPGHGRRYVPFAFLSTKPVLTFLRPKCFALYMRKKNPRFLPWTRTYR 60
Query: 57 RKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQK 116
R +K + R+R RT K QRAIVGA ++ I + V+ ++K R KA + +K
Sbjct: 61 RINRKTTTDRVQRRRAARTVKVQRAIVGADLSYI---QEVRAYVQKVDRSAKAKAVRAEK 117
Query: 117 RAQKEAKK 124
+K AKK
Sbjct: 118 AERKAAKK 125
>gnl|CDD|179131 PRK00807, PRK00807, 50S ribosomal protein L24e; Validated.
Length = 52
Score = 60.8 bits (148), Expect = 1e-13
Identities = 23/47 (48%), Positives = 31/47 (65%)
Query: 6 CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWT 52
C++ G +I PG GK VK DG F +SKCE ++K+ R PRK+ WT
Sbjct: 4 CSFCGKEIEPGTGKMYVKKDGTILYFCSSKCEKNYKLGRVPRKLKWT 50
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 53.8 bits (129), Expect = 3e-10
Identities = 21/63 (33%), Positives = 32/63 (50%)
Query: 6 CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRRNPRKVTWTVLYRRKQKKGQEE 65
C Y+G +I PG G V+ DG F++SKCE ++ + R R + WT R ++
Sbjct: 7 CDYTGEEIEPGTGTMFVRKDGTVLHFVDSKCEKNYDLGREARDLEWTEAGRAEKGPAAAA 66
Query: 66 EAT 68
A
Sbjct: 67 TAA 69
>gnl|CDD|214799 smart00746, TRASH, metallochaperone-like domain.
Length = 39
Score = 45.8 bits (109), Expect = 5e-08
Identities = 16/39 (41%), Positives = 22/39 (56%)
Query: 6 CAYSGYKIYPGHGKTLVKVDGKSFTFINSKCESSFKMRR 44
C++ G IY +V DGK + F +SKC S FK +R
Sbjct: 1 CSFCGKDIYNPGTGIMVVNDGKVYYFCSSKCLSKFKKKR 39
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 29.3 bits (67), Expect = 0.49
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 8/46 (17%)
Query: 79 QRAIVGASINDILAKRNVKPEIRKA--QREQAIKAAKEQKRAQKEA 122
+R GA IN+ +K I A +R++ ++AA R E
Sbjct: 53 ERKEAGALINE------LKQAIEAALEERKEELEAAALNARLAAET 92
>gnl|CDD|234988 PRK01889, PRK01889, GTPase RsgA; Reviewed.
Length = 356
Score = 28.7 bits (65), Expect = 0.71
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 101 RKAQREQAIKAAKEQKRAQKEAKK 124
RK QREQA AAK R + +K
Sbjct: 316 RKLQREQAYNAAKLAARLARAERK 339
>gnl|CDD|216327 pfam01145, Band_7, SPFH domain / Band 7 family. This family has
been called SPFH, Band 7 or PHB domain. Recent
phylogenetic analysis has shown this domain to be a
slipin or Stomatin-like integral membrane domain
conserved from protozoa to mammals.
Length = 177
Score = 28.1 bits (63), Expect = 0.96
Identities = 11/41 (26%), Positives = 16/41 (39%)
Query: 82 IVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEA 122
I I DI + I + Q + E +RA+ EA
Sbjct: 137 IEDVQITDIDPPPEIAEAIEEKQAAEQEAEEAEIERAEAEA 177
>gnl|CDD|234317 TIGR03703, plsB, glycerol-3-phosphate O-acyltransferase. Members
of this protein family are PlsB, glycerol-3-phosphate
O-acyltransferase, present in E. coli and numerous
related species. In many bacteria, PlsB is not found,
and appears to be replaced by a two enzyme system for
1-acyl-glycerol-3-phosphate biosynthesis, the PlsX/Y
system [Fatty acid and phospholipid metabolism,
Biosynthesis].
Length = 799
Score = 28.4 bits (64), Expect = 1.2
Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 9/74 (12%)
Query: 59 QKKGQEEEATRK--RTRRTHKFQRAIVGASINDILAKRNV------KPEIRKAQREQAIK 110
+ G +E K R R H F+R + A D+ +R + P +RKA R++A
Sbjct: 172 DEHGTDESIAHKLARVARVH-FRRQRLAAIGPDLSHRRTLFNKLLASPSVRKAIRDEAKS 230
Query: 111 AAKEQKRAQKEAKK 124
+++A+K A K
Sbjct: 231 KKISEEKARKRALK 244
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 28.0 bits (63), Expect = 1.5
Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 8/65 (12%)
Query: 60 KKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKA--QREQAIKAAKEQKR 117
KKG + +K + + +R VGA IN+ +K E+ A + ++AA +R
Sbjct: 37 KKGVLTDLLKKLGKLSPLEERKEVGALINE------LKKEVEDAITELTPELEAAGLWER 90
Query: 118 AQKEA 122
E
Sbjct: 91 LAFEK 95
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 28.1 bits (63), Expect = 1.5
Identities = 13/70 (18%), Positives = 25/70 (35%), Gaps = 6/70 (8%)
Query: 57 RKQKKGQEEEATRKRTRRTH-----KFQRAIVGASINDILAKRNVKPEIRKAQREQAIKA 111
R+Q + + E + K + A+ ++ + KAQ EQ +A
Sbjct: 430 RQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQE-QA 488
Query: 112 AKEQKRAQKE 121
++ Q E
Sbjct: 489 EANVEQLQSE 498
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 27.8 bits (62), Expect = 1.8
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 3/84 (3%)
Query: 44 RNPRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQ--RAIVGASINDILAKRNVKPEIR 101
R PR V T + K+ ++ R++ + F+ A G A + + +
Sbjct: 729 RAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAK 788
Query: 102 K-AQREQAIKAAKEQKRAQKEAKK 124
K + + Q I AA + KRA K+
Sbjct: 789 KPSAKTQKIAAATKAKRAAKKKVA 812
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 27.9 bits (62), Expect = 1.8
Identities = 10/32 (31%), Positives = 14/32 (43%)
Query: 99 EIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
K QR A + K+ ++A K P P K
Sbjct: 1178 AREKLQRAAARGESGAAKKVSRQAPKKPAPKK 1209
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 27.6 bits (62), Expect = 1.8
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 97 KPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
+ +KA++E+A KAA ++K K P +
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
Score = 25.7 bits (57), Expect = 9.9
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 3/30 (10%)
Query: 95 NVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
N+ P RK R+ K K +K+A+KE +
Sbjct: 405 NLSPAERKKLRK---KQRKAEKKAEKEEAE 431
>gnl|CDD|165098 PHA02730, PHA02730, ankyrin-like protein; Provisional.
Length = 672
Score = 27.7 bits (61), Expect = 1.8
Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 7/78 (8%)
Query: 46 PRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQR 105
P KV + + G+ +E R R I G S+ DIL KRN + +R A+
Sbjct: 558 PSKVKYDI-------AGRHKELKRYRADINRMKNAYISGVSMFDILFKRNRRHRLRYAKN 610
Query: 106 EQAIKAAKEQKRAQKEAK 123
I K +KE
Sbjct: 611 PTFINFVSNIKWYKKELT 628
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 27.6 bits (62), Expect = 1.9
Identities = 12/68 (17%), Positives = 30/68 (44%), Gaps = 6/68 (8%)
Query: 63 QEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEA 122
+E + + ++ ++ I A + K +KA ++ + A ++ + EA
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKK------AKKAIKKGKDEEALAEELLELEA 121
Query: 123 KKPPPPTK 130
++P PP +
Sbjct: 122 EEPEPPLR 129
>gnl|CDD|235325 PRK04974, PRK04974, glycerol-3-phosphate acyltransferase;
Validated.
Length = 818
Score = 27.5 bits (62), Expect = 2.1
Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 15/77 (19%)
Query: 59 QKKGQEEEATRK--RTRRTHKFQR---AIVGASINDILAKRNV------KPEIRKAQREQ 107
+ G ++ RK R R H F+R A +G + ++ + P I+KA ++
Sbjct: 182 DEHGTDKRIARKLARVARVH-FRRQRLAAIGPDLPH---RQALFNKLLASPAIKKAIEDE 237
Query: 108 AIKAAKEQKRAQKEAKK 124
A ++A++ A
Sbjct: 238 AKSKKISHEKARQNAIA 254
>gnl|CDD|218745 pfam05783, DLIC, Dynein light intermediate chain (DLIC). This
family consists of several eukaryotic dynein light
intermediate chain proteins. The light intermediate
chains (LICs) of cytoplasmic dynein consist of multiple
isoforms, which undergo post-translational modification
to produce a large number of species. DLIC1 is known to
be involved in assembly, organisation, and function of
centrosomes and mitotic spindles when bound to
pericentrin. DLIC2 is a subunit of cytoplasmic dynein 2
that may play a role in maintaining Golgi organisation
by binding cytoplasmic dynein 2 to its Golgi-associated
cargo.
Length = 490
Score = 27.1 bits (60), Expect = 2.7
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 96 VKPEIRKAQREQAIKAAKEQ----KRAQKEAKKPPPPTK 130
VKP +RK ++ ++A EQ K+ AK+P PT+
Sbjct: 332 VKPPVRKLVHDKEVQAEDEQVFLMKQQSLLAKQPATPTR 370
>gnl|CDD|236973 PRK11767, PRK11767, SpoVR family protein; Provisional.
Length = 498
Score = 27.1 bits (61), Expect = 3.0
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 55 YRRKQKKGQEEEATRKRTRRTH 76
Y+R QK +EE R++ R +
Sbjct: 179 YKRPQKISLQEEKARQKEREEY 200
>gnl|CDD|148767 pfam07347, CI-B14_5a, NADH:ubiquinone oxidoreductase subunit B14.5a
(Complex I-B14.5a). This family contains the eukaryotic
NADH:ubiquinone oxidoreductase subunit B14.5a (Complex
I-B14.5a) (EC:1.6.5.3). This is approximately 100
residues long, and forms part of a multiprotein complex
that resides on the inner mitochondrial membrane. The
main function of the complex is the transport of
electrons from NADH to ubiquinone, accompanied by
translocation of protons from the mitochondrial matrix
to the intermembrane space.
Length = 97
Score = 25.9 bits (57), Expect = 3.9
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 92 AKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
+R V P I ++A+ A KE +A ++KK P P K
Sbjct: 54 GRREVTPPIDIVSSQKALLAGKEAAKAAVKSKKLPTPGK 92
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 26.6 bits (58), Expect = 4.4
Identities = 17/69 (24%), Positives = 35/69 (50%)
Query: 56 RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
+ ++ K EE + + + ++ V ++ E++KA+ E IKAA+E
Sbjct: 1608 KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA 1667
Query: 116 KRAQKEAKK 124
K+A+++ KK
Sbjct: 1668 KKAEEDKKK 1676
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 26.7 bits (59), Expect = 4.6
Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 9/69 (13%)
Query: 56 RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
R+K+++ Q EE +K+ + ++ + +R E +K E A +AA +Q
Sbjct: 81 RKKKEQQQAEELQQKQAAEQERLKQ---------LEKERLAAQEQKKQAEEAAKQAALKQ 131
Query: 116 KRAQKEAKK 124
K+A++ A K
Sbjct: 132 KQAEEAAAK 140
>gnl|CDD|224385 COG1468, COG1468, CRISPR-associated protein Cas4 (RecB family
exonuclease) [Defense mechanisms].
Length = 190
Score = 26.2 bits (58), Expect = 4.6
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 90 ILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
K+ V+ E+ + RE+ + KE + + K PPP K
Sbjct: 134 PKLKKRVEVELTEELREEVERVLKEIEEILEGGKPPPPKKK 174
>gnl|CDD|237310 PRK13222, PRK13222, phosphoglycolate phosphatase; Provisional.
Length = 226
Score = 26.3 bits (59), Expect = 4.9
Identities = 8/13 (61%), Positives = 8/13 (61%)
Query: 83 VGASINDILAKRN 95
VG S NDI A R
Sbjct: 172 VGDSRNDIQAARA 184
>gnl|CDD|140249 PTZ00222, PTZ00222, 60S ribosomal protein L7a; Provisional.
Length = 263
Score = 26.2 bits (57), Expect = 5.2
Identities = 16/54 (29%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 76 HKFQRAIVGASINDILAK-RNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPP 128
++F + + +S N++L + PE RKA+R++ K A+E+K+ K+ P
Sbjct: 77 NQFTKVLDRSSRNELLKLIKKYAPETRKARRDRLHKVAEEKKKDPKKTVSTKAP 130
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 26.1 bits (58), Expect = 5.7
Identities = 14/81 (17%), Positives = 32/81 (39%), Gaps = 20/81 (24%)
Query: 44 RNPRKVTWTVLYRRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKA 103
R+ + + +++ QEE +K ++ +R + ++ K
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKK----------------KEER--EAKLAKL 303
Query: 104 QREQAIKAAKEQKRAQKEAKK 124
E+ K E+K +K+A+K
Sbjct: 304 SPEEQRK--LEEKERKKQARK 322
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 26.2 bits (58), Expect = 5.8
Identities = 11/75 (14%), Positives = 25/75 (33%), Gaps = 2/75 (2%)
Query: 56 RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQ 115
++KK + + + + + + + +A + KA+ A A
Sbjct: 205 LEERKKTLAQLNSELSADQKKLEELRANESRLKNEIAS--AEAAAAKAREAAAAAEAAAA 262
Query: 116 KRAQKEAKKPPPPTK 130
+ EAK+ K
Sbjct: 263 RARAAEAKRTGETYK 277
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 26.0 bits (57), Expect = 6.2
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 3/64 (4%)
Query: 65 EEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
E ++R ++ Q+ I A D A E RK +++ K ++ KE K
Sbjct: 13 EARKKEREKKAQDLQKLITAA---DQQAAGFSTAEARKREKKLPKKKIPQKPGPSKEDSK 69
Query: 125 PPPP 128
P
Sbjct: 70 VNVP 73
>gnl|CDD|163696 cd08065, MPN_eIF3h, Mpr1p, Pad1p N-terminal (MPN) domains without
catalytic isopeptidase activity, found in eIF2h.
Eukaryotic translation initiation factor 3 (eIF3)
subunit h (eIF3h; eIF3 subunit 3; eIF3S3; eIF3-gamma;
eIF3-p40) is an evolutionarily non-conserved subunit of
the functional core that comprises eIF3a, eIF3b, eIF3c,
eIF3e, eIF3f, and eIF3h, and contains the MPN domain.
However, it lacks the canonical JAMM motif, and
therefore does not show catalytic isopeptidase
activity.Together with eIF3e and eIF3f, eIF3h stabilizes
the eIF3 complex. Results suggest that eIF3h regulates
cell growth and viability, and that over-expression of
the gene may provide growth advantage to prostate,
breast, and liver cancer cells. For example, EIF3h gene
amplification is common in late-stage prostate cancer
suggesting that it may be functionally involved in the
progression of the disease. It has been shown that
coamplification of MYC, a well characterized oncogene
involved in cell growth, differentiation, and apoptosis,
and EIF3h in patients with non-small cell lung cancer
(NSCLC) improves survival if treated with the Epidermal
Growth Factor Receptor Tyrosine Kinase Inhibitor
(EGFR-TKI), Gefitinib. Plant eIF3h is implicated in
translation of specific mRNAs.
Length = 266
Score = 26.1 bits (58), Expect = 6.8
Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 4/31 (12%)
Query: 101 RKAQREQAIKAAKEQKR----AQKEAKKPPP 127
R R+QA QKR AQ+EA+ P
Sbjct: 231 RNLARQQAQIQQWLQKRKAENAQREARGEEP 261
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 26.1 bits (58), Expect = 7.4
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 91 LAKRNVKPEIRKAQREQAIKAAKEQKRAQKEAKKPPPPTK 130
A+ E R A+ + A+ AA + +A+K A P K
Sbjct: 470 EARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIK 509
Score = 25.7 bits (57), Expect = 9.9
Identities = 10/29 (34%), Positives = 13/29 (44%)
Query: 101 RKAQREQAIKAAKEQKRAQKEAKKPPPPT 129
+KA AI AK +K AQ+ A
Sbjct: 546 KKAAVAAAIARAKAKKAAQQAANAEAEEE 574
>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
Members of this protein family resemble the widely
distributed ATP-dependent protease La, also called Lon
and LonA. It resembles even more closely LonB, which is
a LonA paralog found in genomes if and only if the
species is capable of endospore formation (as in
Bacillus subtilis, Clostridium tetani, and select other
members of the Firmicutes) and expressed specifically in
the forespore compartment. Members of this family are
restricted to a subset of spore-forming species, and are
very likely to participate in the program of endospore
formation. We propose the designation LonC [Protein
fate, Degradation of proteins, peptides, and
glycopeptides, Cellular processes, Sporulation and
germination].
Length = 615
Score = 25.9 bits (57), Expect = 7.5
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 3/54 (5%)
Query: 58 KQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQAIKA 111
K++KG E +T K+ R K ++ + S +L R EI +E+AIKA
Sbjct: 115 KEEKGPENSSTLKKLERLEKLEKKKLHKSAQSLLRPRAF-SEI--VGQERAIKA 165
>gnl|CDD|233370 TIGR01353, dGTP_triPase, deoxyguanosinetriphosphate
triphosphohydrolase, putative. dGTP
triphosphohydrolase (dgt) releases inorganic
triphosphate, an unusual activity reaction product,
from GTP. Its activity has been called limited to the
Enterobacteriaceae, although homologous sequences are
detected elsewhere. This finding casts doubt on whether
the activity is shared in other species. In several of
these other species, the homologous gene is found in an
apparent operon with dnaG, the DNA primase gene. The
enzyme from E. coli was shown to bind coopertatively to
single stranded DNA. The biological role of dgt is
unknown [Purines, pyrimidines, nucleosides, and
nucleotides, Nucleotide and nucleoside
interconversions].
Length = 381
Score = 25.8 bits (57), Expect = 8.2
Identities = 17/63 (26%), Positives = 24/63 (38%), Gaps = 7/63 (11%)
Query: 43 RRNPRKVTWTVLYRRKQKKGQ---EEEATRKRTRRTHKFQRAIVGASI----NDILAKRN 95
R+ ++ + +RR Q K Q E RTR TH + A VG SI
Sbjct: 6 ERDYDRIIHSSAFRRLQDKTQVFPLAENDFVRTRLTHSLEVAQVGRSIANLIGLRYDLEL 65
Query: 96 VKP 98
+
Sbjct: 66 EEL 68
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 25.6 bits (57), Expect = 8.4
Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)
Query: 97 KPEIRKAQREQAIKAAKEQKRAQKEAKKP 125
K E++ + Q K K++ +A K KP
Sbjct: 63 KKELKAWHKAQK-KKEKQEAKAAKAKSKP 90
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 25.8 bits (58), Expect = 8.5
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 80 RAIVGASINDILAKR-NVKPEIRKAQREQAIKAAKEQKRAQKEAKK 124
R V + D N PE+ + E+AIKAA+ + RA K+ K+
Sbjct: 347 RRFVSGVVKDAFDLWLNQNPELAEKLAEKAIKAAQARLRAAKKVKR 392
>gnl|CDD|224796 COG1884, Sbm, Methylmalonyl-CoA mutase, N-terminal domain/subunit
[Lipid metabolism].
Length = 548
Score = 25.8 bits (57), Expect = 9.1
Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 9/69 (13%)
Query: 61 KGQEEEATRKRTRRTHKFQRAIVG------ASINDILAKRNVKPEIRKAQREQAIKAAKE 114
+ + E+ +R R + +R IVG + + V PE A RE+ I K+
Sbjct: 433 QREIAESAARRQARIEEGERVIVGVNEYPEEGPEEPVEILKVDPE---AVRERQIARLKK 489
Query: 115 QKRAQKEAK 123
+ + E
Sbjct: 490 VRAERDEEA 498
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 25.4 bits (56), Expect = 9.4
Identities = 11/53 (20%), Positives = 20/53 (37%)
Query: 56 RRKQKKGQEEEATRKRTRRTHKFQRAIVGASINDILAKRNVKPEIRKAQREQA 108
+R+ ++ EE + + QR I A + K EI ++ E
Sbjct: 38 KREAEEAIEEILRKAEKEAERERQRIISSALLEARRKLLEAKEEILESVFEAV 90
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.129 0.371
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,500,251
Number of extensions: 554728
Number of successful extensions: 1527
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1368
Number of HSP's successfully gapped: 217
Length of query: 130
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 44
Effective length of database: 7,123,158
Effective search space: 313418952
Effective search space used: 313418952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 53 (24.3 bits)