RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12569
(94 letters)
>gnl|CDD|176655 cd06146, mut-7_like_exo, DEDDy 3'-5' exonuclease domain of
Caenorhabditis elegans mut-7 and similar proteins. The
mut-7 subfamily is composed of Caenorhabditis elegans
mut-7 and similar proteins found in plants and
metazoans. Mut-7 is implicated in posttranscriptional
gene silencing. It contains a DEDDy-type DnaQ-like 3'-5'
exonuclease domain possessing three conserved sequence
motifs, termed ExoI, ExoII and ExoIII, with a specific
YX(3)D pattern at ExoIII. These motifs are clustered
around the active site and contain four conserved acidic
residues that serve as ligands for the two metal ions
required for catalysis.
Length = 193
Score = 92.4 bits (230), Expect = 4e-25
Identities = 29/54 (53%), Positives = 38/54 (70%)
Query: 13 PQSLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVLHD 66
+ L+ LV + GK L+K +Q SNWE RPLR QI YAALDA+CLL+V+ L +
Sbjct: 140 TKGLADLVQEVLGKPLDKSEQCSNWERRPLREEQILYAALDAYCLLEVFDKLLE 193
>gnl|CDD|176654 cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease
D and similar proteins. Ribonuclease (RNase) D is a
bacterial enzyme involved in the maturation of small
stable RNAs and the 3' maturation of tRNA. It contains a
DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing
three conserved sequence motifs termed ExoI, ExoII and
ExoIII, with a specific YX(3)D pattern at ExoIII. These
motifs are clustered around the active site and contain
four conserved acidic residues that serve as ligands for
the two metal ions required for catalysis. In vivo,
RNase D only becomes essential upon removal of other
ribonucleases. Eukaryotic RNase D homologs include yeast
Rrp6p, human PM/Scl-100, and the Drosophila melanogaster
egalitarian protein.
Length = 178
Score = 66.8 bits (164), Expect = 2e-15
Identities = 23/59 (38%), Positives = 34/59 (57%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVLHDQCARQGI 73
L+ LV + G L+K +Q S+W RPL Q+ YAALD LL +Y+ L ++ +G
Sbjct: 107 GLAALVEELLGVELDKGEQRSDWSKRPLTDEQLEYAALDVRYLLPLYEKLKEELEEEGR 165
>gnl|CDD|176653 cd06141, WRN_exo, DEDDy 3'-5' exonuclease domain of WRN and similar
proteins. WRN is a unique RecQ DNA helicase exhibiting
an exonuclease activity. It contains a DEDDy-type
DnaQ-like 3'-5' exonuclease domain possessing three
conserved sequence motifs termed ExoI, ExoII and ExoIII,
with a specific YX(3)D pattern at ExoIII. These motifs
are clustered around the active site and contain four
conserved acidic residues that serve as ligands for the
two metal ions required for catalysis. Mutations in the
WRN gene cause Werner syndrome, an autosomal recessive
disorder associated with premature aging and increased
susceptibility to cancer and type II diabetes. WRN
interacts with key proteins involved in DNA replication,
recombination, and repair. It is believed to maintain
genomic stability and life span by participating in DNA
processes. WRN is stimulated by Ku70/80, an important
regulator of genomic stability.
Length = 170
Score = 64.1 bits (157), Expect = 2e-14
Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 15 SLSTLVLKCFGKTLNKQD--QFSNWENRPLRPSQISYAALDAFCLLQVYQVL 64
SL+ LV + G L+K + SNWE RPL QI YAA DA+ L++Y+ L
Sbjct: 117 SLARLVEEVLGLPLSKPKKVRCSNWEARPLSKEQILYAATDAYASLELYRKL 168
>gnl|CDD|223426 COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and
biogenesis].
Length = 361
Score = 59.2 bits (144), Expect = 6e-12
Identities = 24/58 (41%), Positives = 35/58 (60%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVLHDQCARQG 72
L+ LV + G L+K +Q S+W RPL +Q+ YAA D LL +Y L ++ AR+G
Sbjct: 113 GLADLVEELLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELAREG 170
>gnl|CDD|216604 pfam01612, DNA_pol_A_exo1, 3'-5' exonuclease. This domain is
responsible for the 3'-5' exonuclease proofreading
activity of E. coli DNA polymerase I (polI) and other
enzymes, it catalyzes the hydrolysis of unpaired or
mismatched nucleotides. This domain consists of the
amino-terminal half of the Klenow fragment in E. coli
polI it is also found in the Werner syndrome helicase
(WRN), focus forming activity 1 protein (FFA-1) and
ribonuclease D (RNase D). Werner syndrome is a human
genetic disorder causing premature aging; the WRN
protein has helicase activity in the 3'-5' direction.
The FFA-1 protein is required for formation of a
replication foci and also has helicase activity; it is a
homologue of the WRN protein. RNase D is a 3'-5'
exonuclease involved in tRNA processing. Also found in
this family is the autoantigen PM/Scl thought to be
involved in polymyositis-scleroderma overlap syndrome.
Length = 172
Score = 56.5 bits (137), Expect = 2e-11
Identities = 22/50 (44%), Positives = 29/50 (58%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVL 64
SL L K G L+K +Q ++W RPL Q+ YAA DA LL++Y L
Sbjct: 118 SLDDLAEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKL 167
>gnl|CDD|176650 cd06129, RNaseD_like, DEDDy 3'-5' exonuclease domain of RNase D,
WRN, and similar proteins. The RNase D-like group is
composed of RNase D, WRN, and similar proteins. They
contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease
domain that contains three conserved sequence motifs
termed ExoI, ExoII and ExoIII, with a specific YX(3)D
pattern at ExoIII. These motifs are clustered around the
active site and contain four conserved acidic residues
that serve as ligands for the two metal ions required
for catalysis. RNase D is involved in the 3'-end
processing of tRNA precursors. RNase D-like proteins in
eukaryotes include yeast Rrp6p, human PM/Scl-100 and
Drosophila melanogaster egalitarian (Egl) protein. WRN
is a unique DNA helicase possessing exonuclease
activity. Mutation in the WRN gene is implicated in
Werner syndrome, a disease associated with premature
aging and increased predisposition to cancer. Yeast
Rrp6p and the human Polymyositis/scleroderma autoantigen
100kDa (PM/Scl-100) are exosome-associated proteins
involved in the degradation and processing of precursors
to stable RNAs. Egl is a component of an mRNA-binding
complex which is required for oocyte specification. The
Egl subfamily does not possess a completely conserved
YX(3)D pattern at the ExoIII motif.
Length = 161
Score = 48.3 bits (115), Expect = 2e-08
Identities = 22/50 (44%), Positives = 29/50 (58%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVL 64
SL++LV GKTL+K ++W RPL Q YAA D + LL +Y L
Sbjct: 110 SLASLVEHFLGKTLDKSISCADWSYRPLTEDQKLYAAADVYALLIIYTKL 159
>gnl|CDD|130455 TIGR01388, rnd, ribonuclease D. This model describes ribonuclease
D, a 3'-exonuclease shown to act on tRNA both in vitro
and when overexpressed in vivo. Trusted members of this
family are restricted to the Proteobacteria; Aquifex,
Mycobacterial, and eukaryotic homologs are not
full-length homologs. Ribonuclease D is not essential in
E. coli and is deleterious when overexpressed. Its
precise biological role is still unknown [Transcription,
RNA processing].
Length = 367
Score = 49.0 bits (117), Expect = 2e-08
Identities = 19/58 (32%), Positives = 30/58 (51%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVLHDQCARQG 72
+ LV + G L+K + ++W RPL +Q+ YAA D LL +Y L ++ G
Sbjct: 113 GYAKLVQEVLGVELDKSESRTDWLARPLTDAQLEYAAADVTYLLPLYAKLMERLEESG 170
>gnl|CDD|99850 cd06147, Rrp6p_like_exo, DEDDy 3'-5' exonuclease domain of yeast
Rrp6p, human polymyositis/scleroderma autoantigen
100kDa, and similar proteins. Yeast Rrp6p and its human
homolog, the polymyositis/scleroderma autoantigen 100kDa
(PM/Scl-100), are exosome-associated proteins involved
in the degradation and processing of precursors to
stable RNAs. Both proteins contain a DEDDy-type
DnaQ-like 3'-5' exonuclease domain possessing three
conserved sequence motifs termed ExoI, ExoII and ExoIII,
with a specific YX(3)D pattern at ExoIII. The motifs are
clustered around the active site and contain four
conserved acidic residues that serve as ligands for the
two metal ions required for catalysis. PM/Scl-100, an
autoantigen present in the nucleolar compartment of the
cell, reacts with autoantibodies produced by about 50%
of patients with polymyositis-scleroderma overlap
syndrome.
Length = 192
Score = 48.0 bits (115), Expect = 4e-08
Identities = 22/72 (30%), Positives = 33/72 (45%)
Query: 12 SPQSLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVLHDQCARQ 71
SL+ L+ K +K+ Q ++W RPL I YA D LL +Y L ++ +
Sbjct: 116 PRHSLAYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLER 175
Query: 72 GIELGPLLTEVL 83
L P L E +
Sbjct: 176 ANALAPNLLESV 187
>gnl|CDD|214681 smart00474, 35EXOc, 3'-5' exonuclease. 3\' -5' exonuclease
proofreading domain present in DNA polymerase I, Werner
syndrome helicase, RNase D and other enzymes.
Length = 172
Score = 45.8 bits (109), Expect = 2e-07
Identities = 23/50 (46%), Positives = 33/50 (66%)
Query: 15 SLSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVYQVL 64
L+TL+L G L+K++Q S+W RPL Q+ YAA DA LL++Y+ L
Sbjct: 118 GLATLLLGYLGVELDKEEQKSDWGARPLSEEQLEYAAEDADALLRLYEKL 167
>gnl|CDD|176656 cd09018, DEDDy_polA_RNaseD_like_exo, DEDDy 3'-5' exonuclease domain
of family-A DNA polymerases, RNase D, WRN, and similar
proteins. DEDDy exonucleases, part of the DnaQ-like (or
DEDD) exonuclease superfamily, catalyze the excision of
nucleoside monophosphates at the DNA or RNA termini in
the 3'-5' direction. They contain four invariant acidic
residues in three conserved sequence motifs termed ExoI,
ExoII and ExoIII. DEDDy exonucleases are classified as
such because of the presence of a specific YX(3)D
pattern at ExoIII. The four conserved acidic residues
serve as ligands for the two metal ions required for
catalysis. This family of DEDDy exonucleases includes
the proofreading domains of family A DNA polymerases, as
well as RNases such as RNase D and yeast Rrp6p. The
Egalitarian (Egl) and Bacillus-like DNA Polymerase I
subfamilies do not possess a completely conserved YX(3)D
pattern at the ExoIII motif. In addition, the
Bacillus-like DNA polymerase I subfamily has inactive
3'-5' exonuclease domains which do not possess the
metal-binding residues necessary for activity.
Length = 150
Score = 36.8 bits (85), Expect = 3e-04
Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 15 SLSTLVLKCFGKTLNKQDQF--SNWENRPLRPSQISYAALDAFCLLQVYQVL 64
+ +LV + G L K + W N+PL Q YAA DA LQ++ L
Sbjct: 97 DMDSLVERWLGHKLIKFESIAGKLWFNQPLTEEQGRYAAEDADVTLQIHLKL 148
>gnl|CDD|99851 cd06148, Egl_like_exo, DEDDy 3'-5' exonuclease domain of Drosophila
Egalitarian (Egl) and similar proteins. The Egalitarian
(Egl) protein subfamily is composed of Drosophila Egl
and similar proteins. Egl is a component of an
mRNA-binding complex which is required for oocyte
specification. Egl contains a DEDDy-type DnaQ-like 3'-5'
exonuclease domain possessing three conserved sequence
motifs termed ExoI, ExoII and ExoIII, with a specific
YX(3)D pattern at ExoIII. The motifs are clustered
around the active site and contain four conserved acidic
residues that serve as ligands for the two metal ions
required for catalysis. The conservation of this
subfamily throughout eukaryotes suggests that its
members may be part of ancient RNA processing complexes
that are likely to participate in the regulated
processing of specific mRNAs. Some members of this
subfamily do not have a completely conserved YX(3)D
pattern at the ExoIII motif.
Length = 197
Score = 36.1 bits (84), Expect = 8e-04
Identities = 17/36 (47%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 26 KTLNKQDQFSNWENRPLRPSQISYAALDAFCLLQVY 61
K L ++D W RPL I YAALD CLL +Y
Sbjct: 136 KKLMREDP-KFWALRPLTEDMIRYAALDVLCLLPLY 170
>gnl|CDD|236771 PRK10829, PRK10829, ribonuclease D; Provisional.
Length = 373
Score = 34.2 bits (79), Expect = 0.004
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 7 ASTQGSPQS--LSTLVLKCFGKTLNKQDQFSNWENRPLRPSQISYAALDAFCLL 58
A+ G P S +++V + G TL+K + ++W RPL Q YAA D F LL
Sbjct: 107 AAFCGRPLSCGFASMVEEYTGVTLDKSESRTDWLARPLSERQCEYAAADVFYLL 160
>gnl|CDD|181973 PRK09589, celA, 6-phospho-beta-glucosidase; Reviewed.
Length = 476
Score = 27.8 bits (62), Expect = 0.71
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 7/36 (19%)
Query: 58 LQVYQVLHDQCARQGIELGPLLT----EVLSSHAVT 89
LQ Y L D+C +QGIE P++T E + H VT
Sbjct: 107 LQFYDDLFDECLKQGIE--PVVTLSHFE-MPYHLVT 139
>gnl|CDD|213768 TIGR03072, release_prfH, putative peptide chain release factor H.
Members of this protein family are bacterial proteins
homologous to peptide chain release factors 1 (RF-1,
product of the prfA gene), and 2 (RF-2, product of the
prfB gene). The member from Escherichia coli K-12,
designated prfH, appears to be a pseudogene. This class
I release factor is always found as the downstream gene
of a two-gene operon [Protein synthesis, Translation
factors].
Length = 200
Score = 26.3 bits (58), Expect = 2.0
Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 3/40 (7%)
Query: 46 QISYAALDAFCLLQVYQVL---HDQCARQGIELGPLLTEV 82
Q+S A A C L V + L + A +G+ + L E
Sbjct: 4 QLSSAQGPAECCLAVAKALERLTREAAARGVRVEVLEQEP 43
>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
Length = 880
Score = 26.6 bits (60), Expect = 2.3
Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 10/44 (22%)
Query: 49 YAALDAFCLLQVYQVLHDQCARQG--------IELGPLLTEVLS 84
YAA DA L++++VL + + IEL P L VL+
Sbjct: 450 YAAEDADVTLRLHEVLKPKLLEEPGLLELYEEIEL-P-LVPVLA 491
>gnl|CDD|176651 cd06139, DNA_polA_I_Ecoli_like_exo, DEDDy 3'-5' exonuclease domain
of Escherichia coli DNA polymerase I and similar
bacterial family-A DNA polymerases. Escherichia
coli-like Polymerase I (Pol I), a subgroup of family-A
DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5'
exonuclease domain in the same polypeptide chain as the
polymerase domain. The exonuclease domain contains three
conserved sequence motifs termed ExoI, ExoII and ExoIII,
with a specific YX(3)D pattern at ExoIII. These motifs
are clustered around the active site and contain four
conserved acidic residues that serve as ligands for the
two metal ions required for catalysis. The 3'-5'
exonuclease domain of DNA polymerases has a fundamental
role in reducing polymerase errors and is involved in
proofreading activity. E. coli DNA Pol I is involved in
genome replication but is not the main replicating
enzyme. It is also implicated in DNA repair.
Length = 193
Score = 26.3 bits (59), Expect = 2.4
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 49 YAALDAFCLLQVYQVLHDQ 67
YAA DA L++Y++L +
Sbjct: 151 YAAEDADITLRLYELLKPK 169
>gnl|CDD|216166 pfam00872, Transposase_mut, Transposase, Mutator family.
Length = 381
Score = 26.1 bits (58), Expect = 2.9
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 36 NWENRPLRPSQISYAALDA 54
W RPL S+ Y LDA
Sbjct: 151 AWRERPLEESRYPYLFLDA 169
>gnl|CDD|176117 cd08426, PBP2_LTTR_like_5, The C-terminal substrate binding domain
of an uncharacterized LysR-type transcriptional
regulator, contains the type 2 periplasmic binding fold.
LysR-transcriptional regulators comprise the largest
family of prokaryotic transcription factor. Homologs of
some of LTTRs with similar domain organizations are also
found in the archaea and eukaryotic organisms. The LTTRs
are composed of two functional domains joined by a
linker helix involved in oligomerization: an N-terminal
HTH (helix-turn-helix) domain, which is responsible for
the DNA-binding specificity, and a C-terminal
substrate-binding domain, which is structurally
homologous to the type 2 periplasmic binding proteins.
As also observed in the periplasmic binding proteins,
the C-terminal domain of the bacterial transcriptional
repressor undergoes a conformational change upon
substrate binding which in turn changes the DNA binding
affinity of the repressor. The genes controlled by the
LTTRs have diverse functional roles including amino acid
biosynthesis, CO2 fixation, antibiotic resistance,
degradation of aromatic compounds, nodule formation of
nitrogen-fixing bacteria, and synthesis of virulence
factors, to a name a few. This substrate-binding domain
shows significant homology to the type 2 periplasmic
binding proteins (PBP2), which are responsible for the
uptake of a variety of substrates such as phosphate,
sulfate, polysaccharides, lysine/arginine/ornithine, and
histidine. The PBP2 bind their ligand in the cleft
between these domains in a manner resembling a Venus
flytrap. After binding their specific ligand with high
affinity, they can interact with a cognate membrane
transport complex comprised of two integral membrane
domains and two cytoplasmically located ATPase domains.
This interaction triggers the ligand translocation
across the cytoplasmic membrane energized by ATP
hydrolysis.
Length = 199
Score = 25.7 bits (57), Expect = 3.5
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 41 PL-RPSQISYAALDAFCL------LQVYQVLHDQCARQGIELGPLLTEVLSSHAVTTK 91
PL R ++ A L + L + Q+L AR G++L P+L +S+ T K
Sbjct: 83 PLARQPSVTLAQLAGYPLALPPPSFSLRQILDAAFARAGVQLEPVL---ISNSIETLK 137
>gnl|CDD|236976 PRK11773, uvrD, DNA-dependent helicase II; Provisional.
Length = 721
Score = 25.2 bits (56), Expect = 6.0
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 3/33 (9%)
Query: 42 LRPSQISYAALDAF--CLLQVYQVLHDQCARQG 72
LRP I + D L++YQ + C R G
Sbjct: 154 LRPQHIQ-SYGDPVEQTWLKIYQAYQEACDRAG 185
>gnl|CDD|212577 cd11704, DHR2_DOCK3, Dock Homology Region 2, a GEF domain, of
Class B Dedicator of Cytokinesis 3. Dock3, also called
modifier of cell adhesion (MOCA), is an atypical
guanine nucleotide exchange factor (GEF) that lacks the
conventional Dbl homology (DH) domain. As a GEF, it
activates small GTPases by exchanging bound GDP for
free GTP. Dock3 is a specific GEF for Rac. It regulates
N-cadherin dependent cell-cell adhesion, cell polarity,
and neuronal morphology. It promotes axonal growth by
stimulating actin polymerization and microtubule
assembly. DOCK proteins are divided into four classes
(A-D) based on sequence similarity and domain
architecture; class B includes Dock3 and 4. All DOCKs
contain two homology domains: the DHR-1 (Dock homology
region-1), also called CZH1 (CED-5, Dock180, and
MBC-zizimin homology 1), and DHR-2 (also called CZH2 or
Docker). The DHR-1 domain binds
phosphatidylinositol-3,4,5-triphosphate. This alignment
model represents the DHR-2 domain of Dock3, which
contains the catalytic GEF activity for Rac and/or
Cdc42. Class B DOCKs also contain an SH3 domain at the
N-terminal region and a PxxP motif at the C-terminus.
Length = 392
Score = 25.4 bits (55), Expect = 6.1
Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 5/45 (11%)
Query: 36 NWENRPLR-----PSQISYAALDAFCLLQVYQVLHDQCARQGIEL 75
WE+RPLR PSQ + + C ++ + GI L
Sbjct: 35 QWEDRPLREFLHYPSQSEWQRKEGLCRKIIHYFNKGKSWEFGIPL 79
>gnl|CDD|165547 PHA03290, PHA03290, envelope glycoprotein I; Provisional.
Length = 357
Score = 25.4 bits (55), Expect = 6.5
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 46 QISYAALDAFCLLQVYQVLHDQCAR 70
++ +A AFC + ++++D+C R
Sbjct: 73 ELLHADQTAFCFQPLQRIMYDECPR 97
>gnl|CDD|150740 pfam10100, DUF2338, Uncharacterized protein conserved in bacteria
(DUF2338). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 429
Score = 25.1 bits (55), Expect = 7.0
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 33 QFSNWENRPLRPSQISYAALDAF 55
+F +N P+RP +S ++ F
Sbjct: 278 RFMVDDNYPVRPEMLSRRDIENF 300
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.129 0.376
Gapped
Lambda K H
0.267 0.0706 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,210,899
Number of extensions: 313480
Number of successful extensions: 247
Number of sequences better than 10.0: 1
Number of HSP's gapped: 246
Number of HSP's successfully gapped: 23
Length of query: 94
Length of database: 10,937,602
Length adjustment: 61
Effective length of query: 33
Effective length of database: 8,232,008
Effective search space: 271656264
Effective search space used: 271656264
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (24.2 bits)