RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17284
         (286 letters)



>gnl|CDD|99996 cd04300, GT1_Glycogen_Phosphorylase, This is a family of
           oligosaccharide phosphorylases. It includes yeast and
           mammalian glycogen phosphorylases, plant starch/glucan
           phosphorylase, as well as the maltodextrin
           phosphorylases of bacteria. The members of this family
           catalyze the breakdown of oligosaccharides into
           glucose-1-phosphate units. They are important allosteric
           enzymes in carbohydrate metabolism. The allosteric
           control mechanisms of yeast and mammalian members of
           this family are different from that of bacterial
           members. The members of this family belong to the GT-B
           structural superfamily of glycoslytransferases, which
           have characteristic N- and C-terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology.  The large cleft that separates the two
           domains includes the catalytic center and permits a high
           degree of flexibility.
          Length = 797

 Score =  245 bits (629), Expect = 8e-76
 Identities = 82/118 (69%), Positives = 96/118 (81%)

Query: 2   KAELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIF 61
            AE IIPAADLSEQISTAG EASGTGNMKFMLNGALTIGTLDGANVE+AEE+G +NIFIF
Sbjct: 621 LAEKIIPAADLSEQISTAGKEASGTGNMKFMLNGALTIGTLDGANVEIAEEVGEENIFIF 680

Query: 62  GMTVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILLKWD 119
           G+T ++VE L+  GY    YY A+PEL+ V+DQI +GFFSP +P EF+ L D LL  +
Sbjct: 681 GLTAEEVEALRANGYYPADYYEADPELRRVLDQIASGFFSPGDPGEFRPLVDSLLNGN 738



 Score =  206 bits (528), Expect = 2e-61
 Identities = 69/86 (80%), Positives = 72/86 (83%)

Query: 199 KWHTRTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELI 258
               RT + GGKAAPGYY AK IIKLI +VA VVNNDPDVGDKLKVVFL NY V+LAE I
Sbjct: 566 DIVPRTFIFGGKAAPGYYMAKLIIKLINAVADVVNNDPDVGDKLKVVFLPNYNVSLAEKI 625

Query: 259 IPAADLSEQISTAGTEASGTGNMKFM 284
           IPAADLSEQISTAG EASGTGNMKFM
Sbjct: 626 IPAADLSEQISTAGKEASGTGNMKFM 651


>gnl|CDD|233722 TIGR02093, P_ylase, glycogen/starch/alpha-glucan phosphorylases.
           This family consists of phosphorylases. Members use
           phosphate to break alpha 1,4 linkages between pairs of
           glucose residues at the end of long glucose polymers,
           releasing alpha-D-glucose 1-phosphate. The nomenclature
           convention is to preface the name according to the
           natural substrate, as in glycogen phosphorylase, starch
           phosphorylase, maltodextrin phosphorylase, etc. Name
           differences among these substrates reflect differences
           in patterns of branching with alpha 1,6 linkages.
           Members include allosterically regulated and unregulated
           forms. A related family, TIGR02094, contains examples
           known to act well on particularly small alpha 1,4
           glucans, as may be found after import from exogenous
           sources [Energy metabolism, Biosynthesis and degradation
           of polysaccharides].
          Length = 794

 Score =  217 bits (556), Expect = 2e-65
 Identities = 82/114 (71%), Positives = 97/114 (85%)

Query: 3   AELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIFG 62
           AELIIPAADLSEQISTAG EASGTGNMKFMLNGALTIGTLDGANVE+ EE+G +NIFIFG
Sbjct: 619 AELIIPAADLSEQISTAGKEASGTGNMKFMLNGALTIGTLDGANVEIREEVGAENIFIFG 678

Query: 63  MTVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILL 116
           +TV++VE L++KGY+   YY A+PELK V+D I++G FSP +P  F+ L D LL
Sbjct: 679 LTVEEVEALREKGYNPREYYEADPELKRVLDLISSGTFSPGDPGLFRPLYDSLL 732



 Score =  185 bits (473), Expect = 1e-53
 Identities = 68/86 (79%), Positives = 73/86 (84%)

Query: 199 KWHTRTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELI 258
               RT++ GGKAAPGY+ AK IIKLI SVA VVNNDP VGDKLKVVF+ NY V+LAELI
Sbjct: 563 DIVPRTVIFGGKAAPGYHMAKLIIKLINSVAEVVNNDPAVGDKLKVVFVPNYNVSLAELI 622

Query: 259 IPAADLSEQISTAGTEASGTGNMKFM 284
           IPAADLSEQISTAG EASGTGNMKFM
Sbjct: 623 IPAADLSEQISTAGKEASGTGNMKFM 648


>gnl|CDD|215868 pfam00343, Phosphorylase, Carbohydrate phosphorylase.  The members
           of this family catalyze the formation of glucose
           1-phosphate from one of the following polyglucoses;
           glycogen, starch, glucan or maltodextrin.
          Length = 712

 Score =  209 bits (535), Expect = 8e-63
 Identities = 80/118 (67%), Positives = 97/118 (82%)

Query: 3   AELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIFG 62
           AE IIPA+D+SEQISTAGTEASGT NMKF LNGALTIGTLDGANVE+AEE+G +NIFIFG
Sbjct: 536 AEKIIPASDISEQISTAGTEASGTSNMKFALNGALTIGTLDGANVEIAEEVGEENIFIFG 595

Query: 63  MTVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILLKWDR 120
           +T ++VE L+KKGY +  YY  +P L+ V+DQI +GFFSPE+PD F+D+ D L   D 
Sbjct: 596 LTAEEVEALRKKGYRSREYYKKDPRLRQVLDQIISGFFSPEDPDRFRDILDSLQGGDY 653



 Score =  176 bits (449), Expect = 1e-50
 Identities = 66/87 (75%), Positives = 73/87 (83%)

Query: 199 KWHTRTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELI 258
               R ++ GGKAAPGYY AK+IIKLI SVA VVNNDP VGDKLKVVFL NYRV+LAE I
Sbjct: 480 DVVPRVVIFGGKAAPGYYMAKRIIKLINSVADVVNNDPAVGDKLKVVFLPNYRVSLAEKI 539

Query: 259 IPAADLSEQISTAGTEASGTGNMKFMI 285
           IPA+D+SEQISTAGTEASGT NMKF +
Sbjct: 540 IPASDISEQISTAGTEASGTSNMKFAL 566


>gnl|CDD|223136 COG0058, GlgP, Glucan phosphorylase [Carbohydrate transport and
           metabolism].
          Length = 750

 Score =  184 bits (469), Expect = 3e-53
 Identities = 67/115 (58%), Positives = 84/115 (73%), Gaps = 1/115 (0%)

Query: 3   AELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIFG 62
           AEL+IPAAD+ EQI TAG EASGT NMK  LNGALT+GTLDGANVE+ E +G +N +IFG
Sbjct: 570 AELLIPAADVWEQIPTAGKEASGTSNMKAALNGALTLGTLDGANVEIYEHVGGENGWIFG 629

Query: 63  MTVDQVEELKKKGYDAPS-YYNANPELKLVVDQITNGFFSPENPDEFKDLSDILL 116
            TV++V+ L+  GYD  + YY    E+K V+D+I +G FSP     FK+L D LL
Sbjct: 630 ETVEEVDALRADGYDPNALYYELENEVKPVLDEIIDGRFSPGWKSRFKNLIDSLL 684



 Score =  145 bits (368), Expect = 2e-39
 Identities = 54/90 (60%), Positives = 61/90 (67%), Gaps = 7/90 (7%)

Query: 197 KGKWHTRTIMI-GGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLA 255
           K  W  R  +I  GKA P  Y AK+IIKLI  VA V+NN      KLKVVFL NY V+LA
Sbjct: 517 KEDWVPRVQIIFAGKAHPADYAAKEIIKLINDVADVINN------KLKVVFLPNYDVSLA 570

Query: 256 ELIIPAADLSEQISTAGTEASGTGNMKFMI 285
           EL+IPAAD+ EQI TAG EASGT NMK  +
Sbjct: 571 ELLIPAADVWEQIPTAGKEASGTSNMKAAL 600


>gnl|CDD|237881 PRK14985, PRK14985, maltodextrin phosphorylase; Provisional.
          Length = 798

 Score =  182 bits (465), Expect = 1e-52
 Identities = 64/115 (55%), Positives = 83/115 (72%)

Query: 3   AELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIFG 62
           AEL+IPAAD+SEQISTAG EASGTGNMK  LNGALT+GTLDGANVE+AE++G +NIFIFG
Sbjct: 621 AELLIPAADISEQISTAGKEASGTGNMKLALNGALTVGTLDGANVEIAEQVGEENIFIFG 680

Query: 63  MTVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILLK 117
            TV+QV+ L  KGYD   +   +  L  V+ ++ +G +S  +   F  +   L +
Sbjct: 681 HTVEQVKALLAKGYDPVKWRKKDKVLDAVLKELESGKYSDGDKHAFDQMLHSLKQ 735



 Score =  148 bits (376), Expect = 3e-40
 Identities = 58/82 (70%), Positives = 64/82 (78%)

Query: 201 HTRTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELIIP 260
             R  + G KAAPGYY AK II  I  VA V+NNDP VGDKLKVVFL +Y V+ AEL+IP
Sbjct: 567 VPRVFLFGAKAAPGYYLAKNIIFAINKVAEVINNDPLVGDKLKVVFLPDYCVSAAELLIP 626

Query: 261 AADLSEQISTAGTEASGTGNMK 282
           AAD+SEQISTAG EASGTGNMK
Sbjct: 627 AADISEQISTAGKEASGTGNMK 648


>gnl|CDD|184948 PRK14986, PRK14986, glycogen phosphorylase; Provisional.
          Length = 815

 Score =  182 bits (463), Expect = 2e-52
 Identities = 74/114 (64%), Positives = 88/114 (77%)

Query: 3   AELIIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIFIFG 62
           A+LIIPAADLSEQIS AGTEASGT NMKF LNGALTIGTLDGANVEM E +G +NIFIFG
Sbjct: 635 AQLIIPAADLSEQISLAGTEASGTSNMKFALNGALTIGTLDGANVEMLEHVGEENIFIFG 694

Query: 63  MTVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILL 116
            T ++VE L+++GY    YY  + EL  V+ QI +G FSPE P  ++DL D L+
Sbjct: 695 NTAEEVEALRRQGYKPREYYEKDEELHQVLTQIGSGVFSPEEPGRYRDLVDSLI 748



 Score =  141 bits (358), Expect = 6e-38
 Identities = 60/85 (70%), Positives = 67/85 (78%)

Query: 199 KWHTRTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELI 258
           KW  R  +  GKAA  YY AK II LI  VA+V+NNDP +GDKLKVVF+ NY V+LA+LI
Sbjct: 579 KWVPRVNIFAGKAASAYYMAKHIIHLINDVAKVINNDPQIGDKLKVVFIPNYSVSLAQLI 638

Query: 259 IPAADLSEQISTAGTEASGTGNMKF 283
           IPAADLSEQIS AGTEASGT NMKF
Sbjct: 639 IPAADLSEQISLAGTEASGTSNMKF 663


>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases.  This family
           consists of known phosphorylases, and homologs believed
           to share the function of using inorganic phosphate to
           cleave an alpha 1,4 linkage between the terminal glucose
           residue and the rest of the polymer (maltodextrin,
           glycogen, etc.). The name of the glucose storage polymer
           substrate, and therefore the name of this enzyme,
           depends on the chain lengths and branching patterns. A
           number of the members of this family have been shown to
           operate on small maltodextrins, as may be obtained by
           utilization of exogenous sources. This family represents
           a distinct clade from the related family modeled by
           TIGR02093/pfam00343.
          Length = 601

 Score = 46.9 bits (112), Expect = 6e-06
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 205 IMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELIIPAADL 264
           I+  GKA P     K+II+ I   ++     P+   +  +VFLENY + LA  ++   D+
Sbjct: 427 IVFAGKAHPADGEGKEIIQRIVEFSK----RPEFRGR--IVFLENYDINLARYLVSGVDV 480

Query: 265 SEQISTAGTEASGTGNMK 282
                    EASGT  MK
Sbjct: 481 WLNNPRRPLEASGTSGMK 498



 Score = 35.4 bits (82), Expect = 0.035
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 22  EASGTGNMKFMLNGALTIGTLDG 44
           EASGT  MK  +NG L +  LDG
Sbjct: 490 EASGTSGMKAAMNGVLNLSILDG 512


>gnl|CDD|99995 cd04299, GT1_Glycogen_Phosphorylase_like, This family is most
           closely related to the oligosaccharide phosphorylase
           domain family and other unidentified sequences.
           Oligosaccharide phosphorylase catalyzes the breakdown of
           oligosaccharides into glucose-1-phosphate units. They
           are important allosteric enzymes in carbohydrate
           metabolism. The members of this family are found in
           bacteria and Archaea.
          Length = 778

 Score = 43.7 bits (104), Expect = 7e-05
 Identities = 25/87 (28%), Positives = 37/87 (42%), Gaps = 24/87 (27%)

Query: 205 IMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELIIPAAD- 263
            +  GKA P     K++I+ I   +R     P+   +  +VFLE+Y + LA  ++   D 
Sbjct: 516 FIFAGKAHPADEPGKELIQEIVEFSR----RPEFRGR--IVFLEDYDMALARHLVQGVDV 569

Query: 264 --------LSEQISTAGTEASGTGNMK 282
                   L         EASGT  MK
Sbjct: 570 WLNTPRRPL---------EASGTSGMK 587



 Score = 36.8 bits (86), Expect = 0.013
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 22  EASGTGNMKFMLNGALTIGTLDG 44
           EASGT  MK  LNG L +  LDG
Sbjct: 579 EASGTSGMKAALNGGLNLSVLDG 601


>gnl|CDD|215970 pfam00523, Fusion_gly, Fusion glycoprotein F0. 
          Length = 468

 Score = 32.7 bits (75), Expect = 0.22
 Identities = 21/70 (30%), Positives = 31/70 (44%), Gaps = 7/70 (10%)

Query: 214 GYYTAKKIIKLICSVARVVNNDPDVGDKLKVVFLENYRVTLAELIIPAADLSEQISTAGT 273
           G  T+  ++KL+        N P + +      LE Y  TL EL+ P AD  + IS   T
Sbjct: 24  GSSTSYIVVKLL-------PNIPPLKNSCIKTELEQYNNTLNELLTPLADALDLISRLIT 76

Query: 274 EASGTGNMKF 283
             +G    +F
Sbjct: 77  TITGRRQKRF 86


>gnl|CDD|99959 cd01635, Glycosyltransferase_GTB_type, Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. The structures of the formed
           glycoconjugates are extremely diverse, reflecting a wide
           range of biological functions. The members of this
           family share a common GTB topology, one of the two
           protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility.
          Length = 229

 Score = 29.9 bits (67), Expect = 1.2
 Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 2/51 (3%)

Query: 9   AADLSEQISTAGTEASGTGNMKFMLNGALTIGTLDGANVEMAEEMGNDNIF 59
           AAD+   +  +  E  G   ++ M  G   I T  G   E+ E+     + 
Sbjct: 181 AADVF--VLPSLREGFGLVVLEAMACGLPVIATDVGGPPEIVEDGLTGLLV 229


>gnl|CDD|183687 PRK12693, flgG, flagellar basal body rod protein FlgG; Provisional.
          Length = 261

 Score = 28.3 bits (64), Expect = 4.1
 Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 7/40 (17%)

Query: 21  TEASG---TGNMKFMLNGALTI--GTLDGANVEMAEEMGN 55
           T ASG    G     LNG  T+  G L+ +NV + EE+ N
Sbjct: 195 TAASGAPIEGTPG--LNGLGTLRQGFLETSNVNVVEELVN 232


>gnl|CDD|217114 pfam02574, S-methyl_trans, Homocysteine S-methyltransferase.  This
           is a family of related homocysteine S-methyltransferases
           enzymes: 5-methyltetrahydrofolate--homocysteine
           S-methyltransferases also known EC:2.1.1.13;
           Betaine--homocysteine S-methyltransferase (vitamin B12
           dependent), EC:2.1.1.5; and Homocysteine
           S-methyltransferase, EC:2.1.1.10.
          Length = 303

 Score = 28.5 bits (64), Expect = 4.6
 Identities = 28/125 (22%), Positives = 40/125 (32%), Gaps = 18/125 (14%)

Query: 9   AADLSEQISTAGTEASGTGNMKFMLNGALTIGT---LDGANVEMAEEMGNDNIFIFGM-- 63
           A    E++     EA G  +     +G L  GT   L GA +E A       I   G+  
Sbjct: 164 AVRAVEEV----FEAPGWISFPVFDSGTLVDGTGLTLSGATIEAALNSVQAEIDAVGVNC 219

Query: 64  -----TVDQVEELKKKGYDAPSYYNANPELKLVVDQITNGFFSPENPDEFKDLSDILLKW 118
                 +D++ EL       P   +  P      D           PDE       L ++
Sbjct: 220 ALGPDIMDEILELLSLTARKP--LSVYPNSGEPYDA--GKKEYDGTPDELAPSYWSLKEF 275

Query: 119 DRPGA 123
              GA
Sbjct: 276 VASGA 280


>gnl|CDD|222397 pfam13814, Replic_Relax, Replication-relaxation.  This family
           includes proteins which are essential for plasmid
           replication and plasmid DNA relaxation.
          Length = 191

 Score = 27.7 bits (62), Expect = 5.2
 Identities = 14/70 (20%), Positives = 19/70 (27%), Gaps = 6/70 (8%)

Query: 96  TNGFF------SPENPDEFKDLSDILLKWDRPGAIPLKGPVPTTQLVLRWASSGRWPVTF 149
            N  F      S +   E  +         +   I   G         RW   GR    F
Sbjct: 93  VNEVFVALLAASRQEGGELTEWWSEPRCTRQYTNIKGAGKKIRPDAYGRWTEGGRTLEFF 152

Query: 150 MEHFRGSSHR 159
           +E  RG+   
Sbjct: 153 LEVDRGTEPL 162


>gnl|CDD|225347 COG2733, COG2733, Predicted membrane protein [Function unknown].
          Length = 415

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 13/79 (16%), Positives = 24/79 (30%), Gaps = 13/79 (16%)

Query: 203 RTIMIGGKAAPGYYTAKKIIKLICSVARVVNNDPDVGDKLK-------VVFLENYRVTLA 255
              +     +      K++ + + SV   +  D  +  KL            E     + 
Sbjct: 303 SEWLKEDYESEDSMLRKRLARAVQSVGEELIADDALRAKLNEHLVQAAERLAEEKHAEIT 362

Query: 256 ELIIP------AADLSEQI 268
           + I        A +LS QI
Sbjct: 363 KHISDTVKRWDAEELSRQI 381


>gnl|CDD|153101 cd01042, DMQH, Demethoxyubiquinone hydroxylase, ferritin-like
           diiron-binding domain.  Demethoxyubiquinone hydroxylases
           (DMQH) are members of the ferritin-like,
           diiron-carboxylate family which are present in
           eukaryotes (the CLK-1/CAT5 family) and prokaryotes (the
           Coq7 family). DMQH participates in one of the last steps
           of ubiquinone biosysnthesis and is responsible for DMQ
           hydroxylation, resulting in the formation of
           hydroxyubiquinone, a precursor of ubiquinone. CLK-1 is a
           mitochondrial inner membrane protein and Coq7 is a
           proposed interfacial integral membrane protein.
           Mutations in the Caenorhabditis elegans gene clk-1
           affect biological timing and extend longevity. The
           conserved residues of a diiron center are present in
           this domain.
          Length = 165

 Score = 27.5 bits (62), Expect = 6.3
 Identities = 10/22 (45%), Positives = 12/22 (54%)

Query: 208 GGKAAPGYYTAKKIIKLICSVA 229
           G + AP Y   K +IK  C VA
Sbjct: 138 GAEKAPLYALLKALIKAGCKVA 159


>gnl|CDD|219024 pfam06429, Flg_bbr_C, Flagellar basal body rod FlgEFG protein
          C-terminal.  This family consists of a number of
          C-terminal domains of unknown function. This domain
          seems to be specific to flagellar basal-body rod and
          flagellar hook proteins in which pfam00460 is often
          present at the extreme N terminus.
          Length = 74

 Score = 26.1 bits (58), Expect = 6.9
 Identities = 12/33 (36%), Positives = 15/33 (45%)

Query: 21 TEASGTGNMKFMLNGALTIGTLDGANVEMAEEM 53
           E  G        NG +  G L+G+NV   EEM
Sbjct: 12 RETDGAPIAGADANGRVAQGYLEGSNVNPVEEM 44


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.133    0.384 

Gapped
Lambda     K      H
   0.267   0.0701    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,818,291
Number of extensions: 1444459
Number of successful extensions: 1279
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1274
Number of HSP's successfully gapped: 35
Length of query: 286
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 190
Effective length of database: 6,679,618
Effective search space: 1269127420
Effective search space used: 1269127420
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.2 bits)