RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy6415
         (174 letters)



>3i2n_A WD repeat-containing protein 92; WD40 repeats, structural genomics,
           structural genomic consortium, SGC, apoptosis,
           transcription; 1.95A {Homo sapiens}
          Length = 357

 Score =  135 bits (343), Expect = 3e-39
 Identities = 101/173 (58%), Positives = 127/173 (73%), Gaps = 19/173 (10%)

Query: 2   VVAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPMNKLVATTLEAKLFVFDLK 61
           V AGYDNGD+K+FDL+ M+L+WETN+KNGVC ++FDR+DI MNKLVAT+LE K  VFD++
Sbjct: 183 VCAGYDNGDIKLFDLRNMALRWETNIKNGVCSLEFDRKDISMNKLVATSLEGKFHVFDMR 242

Query: 62  TLHREKGFAYLSEKAHKATTVWAAKHLPQNRDLFVTCGGSGSLNLWQLVSALRADTNLCS 121
           T H  KGFA +SEKAHK+ TVW  +HLPQNR+LF+T GG+G L+LW+             
Sbjct: 243 TQHPTKGFASVSEKAHKS-TVWQVRHLPQNRELFLTAGGAGGLHLWKY------------ 289

Query: 122 IFRYSSNYPTQRTKPDIDDIPMGVPGSLTLLQEATLSSQPINSLDWSADKLGL 174
                  YP QR+K D + I MGV GS++LLQ  TLS+QPI+SLDWS DK GL
Sbjct: 290 ------EYPIQRSKKDSEGIEMGVAGSVSLLQNVTLSTQPISSLDWSPDKRGL 336


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 44.3 bits (104), Expect = 7e-06
 Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 29/109 (26%)

Query: 86  KHLPQNRDL----------FVTCGGSGSLNLWQLVSALR---ADTNLCSIFRYSSNYP-T 131
            HLP  + +           V  G   S  L+ L   LR   A + L       S  P +
Sbjct: 358 SHLPAGKQVEISLVNGAKNLVVSGPPQS--LYGLNLTLRKAKAPSGL-----DQSRIPFS 410

Query: 132 QRTKPDIDD--IPMGVP-GSLTLLQEAT---LSSQPINSLDWSADKLGL 174
           +R K    +  +P+  P  S  LL  A+         N++ ++A  + +
Sbjct: 411 ER-KLKFSNRFLPVASPFHS-HLLVPASDLINKDLVKNNVSFNAKDIQI 457



 Score = 30.0 bits (67), Expect = 0.40
 Identities = 33/168 (19%), Positives = 53/168 (31%), Gaps = 57/168 (33%)

Query: 35  DFDRR-----DIPMNKLVATTLEAKLFVFDLKTLHREKGF--AYLS---------EKAHK 78
           +F+       DI         L AKL   +  TL + K     Y++         +K   
Sbjct: 90  EFENCYLEGNDI-------HALAAKLLQENDTTLVKTKELIKNYITARIMAKRPFDKKSN 142

Query: 79  ATTVWAAKHLPQNRDLFVTCGGSGSLN--------LWQ----LVSAL--RADTNLCSIFR 124
           +    A      N  L    GG G+ +        L+Q    LV  L   +   L  + R
Sbjct: 143 SALFRAVGE--GNAQLVAIFGGQGNTDDYFEELRDLYQTYHVLVGDLIKFSAETLSELIR 200

Query: 125 YSSN----YP----------TQRTKPDID---DIPMGVPGSLTLLQEA 155
            + +    +                PD D    IP+  P  + ++Q A
Sbjct: 201 TTLDAEKVFTQGLNILEWLENPSNTPDKDYLLSIPISCPL-IGVIQLA 247


>3q7m_A Lipoprotein YFGL, BAMB; beta-propeller, BAM complex, outer membrane
           protein folding, negative, BAMA, protein binding; 1.65A
           {Escherichia coli} PDB: 3q7n_A 3q7o_A 3p1l_A 3prw_A
           2yh3_A
          Length = 376

 Score = 33.6 bits (77), Expect = 0.022
 Identities = 13/69 (18%), Positives = 24/69 (34%), Gaps = 7/69 (10%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDR-RDIPM------NKLVATTLEA 53
             V G DNG +    ++   + W+  +       + DR  D+          + A     
Sbjct: 190 AAVVGGDNGRVSAVLMEQGQMIWQQRISQATGSTEIDRLSDVDTTPVVVNGVVFALAYNG 249

Query: 54  KLFVFDLKT 62
            L   DL++
Sbjct: 250 NLTALDLRS 258



 Score = 30.9 bits (70), Expect = 0.19
 Identities = 7/68 (10%), Positives = 20/68 (29%), Gaps = 6/68 (8%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPM------NKLVATTLEAK 54
           +V A    G +K  +       W  +L         +   +          +   + +A+
Sbjct: 55  VVYAADRAGLVKALNADDGKEIWSVSLAEKDGWFSKEPALLSGGVTVSGGHVYIGSEKAQ 114

Query: 55  LFVFDLKT 62
           ++  +   
Sbjct: 115 VYALNTSD 122



 Score = 27.5 bits (61), Expect = 2.2
 Identities = 7/65 (10%), Positives = 18/65 (27%), Gaps = 10/65 (15%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNLKNGVCG---VDFDRRDIPMNKLVATTLEAKLFV 57
            V  G +   +   +    ++ W+T +         V           ++  T   +L  
Sbjct: 105 HVYIGSEKAQVYALNTSDGTVAWQTKVAGEALSRPVVSDGL-------VLIHTSNGQLQA 157

Query: 58  FDLKT 62
            +   
Sbjct: 158 LNEAD 162


>2xyi_A Probable histone-binding protein CAF1; transcription, repressor,
           phosphoprotein, WD-repeat; HET: PG4; 1.75A {Drosophila
           melanogaster} PDB: 3c99_A 3c9c_A 2yb8_B 2yba_A 2xu7_A*
           3gfc_A 3cfs_B 3cfv_B
          Length = 430

 Score = 32.0 bits (73), Expect = 0.092
 Identities = 16/122 (13%), Positives = 38/122 (31%), Gaps = 20/122 (16%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNL-----------KNGVCGVDF--DRRDIPMNKLV 47
           ++     + D+ +FD      K E +            +    G+ +  +        L+
Sbjct: 143 VIATKTPSSDVLVFDYTKHPSKPEPSGECQPDLRLRGHQKEGYGLSWNPNLNGY----LL 198

Query: 48  ATTLEAKLFVFDLKTLHREKGFAYLSE--KAHKATTVWAAKHLPQNRDLFVTCGGSGSLN 105
           + + +  + ++D+    +E            H A  V        +  LF +      L 
Sbjct: 199 SASDDHTICLWDINATPKEHRVIDAKNIFTGHTAV-VEDVAWHLLHESLFGSVADDQKLM 257

Query: 106 LW 107
           +W
Sbjct: 258 IW 259



 Score = 27.8 bits (62), Expect = 2.2
 Identities = 14/96 (14%), Positives = 25/96 (26%), Gaps = 31/96 (32%)

Query: 81  TVWAAKHLPQNRDLFVTCGGSGSLNLWQLVSALRADTNLCSIFRYSSNYPTQRTKPDIDD 140
            V  A+++PQN  +  T   S  + ++                             D   
Sbjct: 130 EVNRARYMPQNACVIATKTPSSDVLVF-----------------------------DYTK 160

Query: 141 IPMGVPGSLTLLQEATLS--SQPINSLDWSADKLGL 174
            P     S     +  L    +    L W+ +  G 
Sbjct: 161 HPSKPEPSGECQPDLRLRGHQKEGYGLSWNPNLNGY 196


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 31.7 bits (71), Expect = 0.11
 Identities = 26/162 (16%), Positives = 49/162 (30%), Gaps = 35/162 (21%)

Query: 14  FDLKTMSLKW----ETNLKNGVCGVDFDRRDIP-MNKLVATTLEAKLFVFDLKTLHREKG 68
           F+      ++           V   +FD +D+  M K + +  E       + +     G
Sbjct: 9   FETGEHQYQYKDILSVFEDAFV--DNFDCKDVQDMPKSILSKEEIDHI---IMSKDAVSG 63

Query: 69  FAYLSEKAHKATTVWAAKHLPQNR-DLFVTCGGSGSLNL---WQLVSALRADTNLCSIFR 124
              L          W      +     FV       L +   + L+S ++ +    S+  
Sbjct: 64  TLRL---------FWTLLSKQEEMVQKFV----EEVLRINYKF-LMSPIKTEQRQPSMMT 109

Query: 125 YSSNYPTQRTKPDIDD---IPMGVP--GSLTLLQEATLSSQP 161
               Y  QR +   D+       V        L++A L  +P
Sbjct: 110 --RMYIEQRDRLYNDNQVFAKYNVSRLQPYLKLRQALLELRP 149



 Score = 31.0 bits (69), Expect = 0.24
 Identities = 23/152 (15%), Positives = 40/152 (26%), Gaps = 40/152 (26%)

Query: 6   YDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIP-MNKLVATTLEAKLFVFDLKTLH 64
            +      F+L               C +    R     + L A T           TL 
Sbjct: 254 QNAKAWNAFNLS--------------CKILLTTRFKQVTDFLSAATTTHISLDHHSMTLT 299

Query: 65  REKG---FA-YLSEKAHKATTVWAAKHLPQN--------RDLFVTCGGSGSLNLWQLVSA 112
            ++       YL  +             P+         RD   T      +N  +L + 
Sbjct: 300 PDEVKSLLLKYLDCRPQDLPRE-VLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTI 358

Query: 113 LRADTNL------------CSIFRYSSNYPTQ 132
           + +  N+             S+F  S++ PT 
Sbjct: 359 IESSLNVLEPAEYRKMFDRLSVFPPSAHIPTI 390


>3lrv_A PRE-mRNA-splicing factor 19; PRP19, WD40, E3 ubiquitin ligase,
           spliceosome, DNA damage, D repair, mRNA processing,
           nucleus; 2.60A {Saccharomyces cerevisiae}
          Length = 343

 Score = 30.1 bits (67), Expect = 0.29
 Identities = 11/66 (16%), Positives = 23/66 (34%), Gaps = 4/66 (6%)

Query: 42  PMNKLVATTLEAKLFVFDLKTLHREKGFAYLSEKAHKATTVWAAKHLPQNRDLFVTCGGS 101
           P N+L+      ++ + D KT    +      E       ++   H   N + F+     
Sbjct: 93  PCNRLLLLYPGNQITILDSKTNKVLREI----EVDSANEIIYMYGHNEVNTEYFIWADNR 148

Query: 102 GSLNLW 107
           G++   
Sbjct: 149 GTIGFQ 154


>2j04_B YDR362CP, TAU91; beta propeller, type 2 promoters, transcription,
           hypothetica protein, preinitiation complex, yeast RNA
           polymerase III; 3.2A {Saccharomyces cerevisiae}
          Length = 524

 Score = 30.2 bits (67), Expect = 0.37
 Identities = 18/109 (16%), Positives = 38/109 (34%), Gaps = 7/109 (6%)

Query: 1   MVVAGYDNGDLKMFDLKT--MSLKWETNLKNGVCGVDFDRRDIPMNKLVATTLEAKLFVF 58
            VV G+ NG +  FDL    +   ++    + +  V     D     +    ++   ++F
Sbjct: 279 TVVCGFKNGFVAEFDLTDPEVPSFYDQVHDSYILSVSTAYSDFEDTVVSTVAVDGYFYIF 338

Query: 59  DLKTLHREKGFAYLSEKAHKATTVWAAKHLPQNRDLFVTCGGSGSLNLW 107
           + K +   K            + +    + PQ    ++   G+ SL   
Sbjct: 339 NPKDIATTKTTVSRFRG----SNLVPVVYCPQIY-SYIYSDGASSLRAV 382


>1l0q_A Surface layer protein; SLP, S-layer, 7-bladed beta-propeller
           superfamily, protein binding; HET: YCM; 2.40A
           {Methanosarcina mazei} SCOP: b.1.3.1 b.69.2.3
          Length = 391

 Score = 29.7 bits (67), Expect = 0.51
 Identities = 19/104 (18%), Positives = 32/104 (30%), Gaps = 13/104 (12%)

Query: 3   VAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPMNKL--VATTLEAKLFVFDL 60
           VA   + D+ + D  T ++       +   GV       P  K   V     + L V D 
Sbjct: 48  VANAHSNDVSIIDTATNNVIATVPAGSSPQGVAVS----PDGKQVYVTNMASSTLSVIDT 103

Query: 61  KTLHREKGFAYLSEKAHKATTVWAAKHLPQNRDLFVTCGGSGSL 104
            +                A +       P  + L+VT  G  ++
Sbjct: 104 TSNTVAGTVKTGKSPLGLALS-------PDGKKLYVTNNGDKTV 140


>2hz6_A Endoplasmic reticulum to nucleus signalling 1 isoform 1 variant;
           triangular beta-sheet cluster, signaling protein; 3.10A
           {Homo sapiens}
          Length = 369

 Score = 28.1 bits (62), Expect = 1.6
 Identities = 8/65 (12%), Positives = 21/65 (32%), Gaps = 7/65 (10%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPM---NKLVATTLEAKLFV 57
           ++  G     + M+D KT  L+W     +          +  +        +  +  +  
Sbjct: 132 LLYLGRTEYTITMYDTKTRELRWNATYFDYA----ASLPEDDVDYKMSHFVSNGDGLVVT 187

Query: 58  FDLKT 62
            D ++
Sbjct: 188 VDSES 192


>3mbr_X Glutamine cyclotransferase; beta-propeller; 1.44A {Xanthomonas
           campestris}
          Length = 243

 Score = 27.9 bits (61), Expect = 1.7
 Identities = 14/69 (20%), Positives = 23/69 (33%), Gaps = 2/69 (2%)

Query: 6   YDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPMNKLVATTLEAKLFVFDLKTLHR 65
                ++  DL+T  +     +     G         +  +  T    + FV+DL TL  
Sbjct: 40  TGRSSVRKVDLETGRILQRAEVPPPYFGAGIVAWRDRL--IQLTWRNHEGFVYDLATLTP 97

Query: 66  EKGFAYLSE 74
              F Y  E
Sbjct: 98  RARFRYPGE 106


>2iyf_A OLED, oleandomycin glycosyltransferase; antibiotic resistance,
           glycosylation, enzyme, macrolide, carbohydrate; HET: ERY
           UDP; 1.7A {Streptomyces antibioticus}
          Length = 430

 Score = 27.5 bits (61), Expect = 2.4
 Identities = 10/29 (34%), Positives = 12/29 (41%), Gaps = 5/29 (17%)

Query: 82  VWAAKHLPQNR-----DLFVTCGGSGSLN 105
           V     +PQ       DLFVT  G+G   
Sbjct: 285 VEVHDWVPQLAILRQADLFVTHAGAGGSQ 313


>2kyg_A CAMP-dependent protein kinase type II-alpha regul subunit;
           protein/protein, homodimer bound to monomer, protein
           binding; NMR {Homo sapiens}
          Length = 50

 Score = 25.2 bits (55), Expect = 2.7
 Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 141 IPMGVPGSLTLLQEATLSSQPINSLDWSAD 170
           IP G+   L       L  QP + ++++ +
Sbjct: 11  IPPGLTELLQGYTVEVLRQQPPDLVEFAVE 40


>3pfn_A NAD kinase; structural genomics consortium, SNP, SGC, transferase;
           2.70A {Homo sapiens}
          Length = 365

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 17/96 (17%), Positives = 31/96 (32%), Gaps = 6/96 (6%)

Query: 15  DLKTMSLKWETNLKN-GVCGVDFDRRDIPMNKLVATTLEAK-LFVFDLKTLHREKGFAYL 72
           D  +  L W  + K+  V     D   +   K + T L  + + V+  K +  +   A  
Sbjct: 26  DPASQRLTWNKSPKSVLVIKKMRDASLLQPFKELCTHLMEENMIVYVEKKVLEDPAIASD 85

Query: 73  SEKAHKATTVWAAK----HLPQNRDLFVTCGGSGSL 104
                        +     +    D  +  GG G+L
Sbjct: 86  ESFGAVKKKFCTFREDYDDISNQIDFIICLGGDGTL 121


>3a9z_A Selenocysteine lyase; PLP, cytoplasm, pyridoxal phosphate,
           transferase; HET: PLP SLP; 1.55A {Rattus norvegicus}
           PDB: 3a9x_A* 3a9y_A* 3gzd_A* 3gzc_A* 2hdy_A*
          Length = 432

 Score = 27.2 bits (61), Expect = 3.0
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 5/50 (10%)

Query: 77  HKATTVWAAKHLPQNRDLFVTC----GGSGSLNLWQLVSALRADTNLCSI 122
           H +      +HL +++   VT       +G + +  +++A+R  T L +I
Sbjct: 133 HDSI-RLPLEHLVEDQVAEVTFVPVSKVNGQVEVEDILAAVRPTTCLVTI 181


>2uvk_A YJHT; unknown function, hypothetical protein, sialic acid
          metabolism, kelch repeat, beta-propeller; HET: MSE;
          1.50A {Escherichia coli}
          Length = 357

 Score = 27.2 bits (60), Expect = 3.2
 Identities = 10/73 (13%), Positives = 21/73 (28%), Gaps = 9/73 (12%)

Query: 2  VVAGYDNGDLKMFDLKTMSLKWETNLKNGVCGVDFDRRDIPMNKLVATTLEAKLFVFDLK 61
          +  G         D +    KW            F     P ++  +  ++  L+VF   
Sbjct: 24 IGLGSAGTAWYKLDTQAKDKKWTA-------LAAFP--GGPRDQATSAFIDGNLYVFGGI 74

Query: 62 TLHREKGFAYLSE 74
            + E      ++
Sbjct: 75 GKNSEGLTQVFND 87


>3ia7_A CALG4; glycosysltransferase, calicheamicin, enediyne, transf; 1.91A
           {Micromonospora echinospora}
          Length = 402

 Score = 26.6 bits (59), Expect = 4.3
 Identities = 8/39 (20%), Positives = 15/39 (38%), Gaps = 10/39 (25%)

Query: 82  VWAAKHLPQNR-----DLFVTCGGSGSLNLWQLVSALRA 115
           V A + +P +         +T G +G++       A  A
Sbjct: 283 VEAHQWIPFHSVLAHARACLTHGTTGAVL-----EAFAA 316


>2iya_A OLEI, oleandomycin glycosyltransferase; carbohydrate,
           glycosylation, enzyme, macrolide; HET: UDP ZIO; 1.7A
           {Streptomyces antibioticus}
          Length = 424

 Score = 26.6 bits (59), Expect = 5.0
 Identities = 10/39 (25%), Positives = 13/39 (33%), Gaps = 10/39 (25%)

Query: 82  VWAAKHLPQNR-----DLFVTCGGSGSLNLWQLVSALRA 115
           V   + +PQ         F+T  G GS        AL  
Sbjct: 307 VEVHQWVPQLDILTKASAFITHAGMGSTM-----EALSN 340


>3rsc_A CALG2; TDP, enediyne, structural genomics, PSI-2, protein structure
           initiative, center for eukaryotic structural genomics;
           HET: TYD C0T; 2.19A {Micromonospora echinospora} PDB:
           3iaa_A*
          Length = 415

 Score = 26.2 bits (58), Expect = 7.1
 Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 10/39 (25%)

Query: 82  VWAAKHLPQNR-----DLFVTCGGSGSLNLWQLVSALRA 115
           V A + +P  +      + VT GG G+L       AL  
Sbjct: 299 VEAHRWVPHVKVLEQATVCVTHGGMGTLM-----EALYW 332


>1jmx_B Amine dehydrogenase; oxidoreductase; HET: TRQ HEC; 1.90A
           {Pseudomonas putida} SCOP: b.69.2.2 PDB: 1jmz_B*
          Length = 349

 Score = 25.9 bits (56), Expect = 8.9
 Identities = 19/118 (16%), Positives = 41/118 (34%), Gaps = 9/118 (7%)

Query: 1   MVVAGYDNGDLKMFDLKTMSLKWETNLKNGV--CGVDFDRRDIPMNKL--VATTLEAKLF 56
           M+V  Y N +L + D+ + ++     + +              P N+   V       ++
Sbjct: 14  MIVTNYPN-NLHVVDVASDTVYKSCVMPDKFGPGTAMMA----PDNRTAYVLNNHYGDIY 68

Query: 57  VFDLKTLHREKGFAYLSEKAHKATTVWAAKHLPQNRDLFVTCGGSGSLNLWQLVSALR 114
             DL T          S       ++++    P  ++++ T   +  LN   +V   R
Sbjct: 69  GIDLDTCKNTFHANLSSVPGEVGRSMYSFAISPDGKEVYATVNPTQRLNDHYVVKPPR 126


>3otg_A CALG1; calicheamicin, TDP, structural genomics, PSI-2, protein STRU
           initiative, center for eukaryotic structural genomics,
           CESG fold; HET: TYD; 2.08A {Micromonospora echinospora}
           PDB: 3oth_A*
          Length = 412

 Score = 25.8 bits (57), Expect = 9.0
 Identities = 13/39 (33%), Positives = 15/39 (38%), Gaps = 10/39 (25%)

Query: 82  VWAAKHLPQNR-----DLFVTCGGSGSLNLWQLVSALRA 115
           V     +PQ       DL V  GGSG+        AL A
Sbjct: 294 VRLESWVPQAALLPHVDLVVHHGGSGTTL-----GALGA 327


>3nl6_A Thiamine biosynthetic bifunctional enzyme; thiamin biosynthesis,
           eukaryoyes, transferase; HET: TPS ACP; 2.61A {Candida
           glabrata} PDB: 3nl2_A* 3nl5_A* 3nl3_A* 3nm3_A* 3nm1_A*
          Length = 540

 Score = 26.0 bits (57), Expect = 9.3
 Identities = 9/33 (27%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 129 YPTQRTKPDIDDIPMGVPGSLTLLQEATLSSQP 161
           +PT  TK +    PMG  G++ +L     ++  
Sbjct: 141 FPTL-TKKNPKKAPMGTAGAIRVLDALERNNAH 172


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.319    0.134    0.408 

Gapped
Lambda     K      H
   0.267   0.0693    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 2,605,881
Number of extensions: 139551
Number of successful extensions: 392
Number of sequences better than 10.0: 1
Number of HSP's gapped: 387
Number of HSP's successfully gapped: 37
Length of query: 174
Length of database: 6,701,793
Length adjustment: 87
Effective length of query: 87
Effective length of database: 4,272,666
Effective search space: 371721942
Effective search space used: 371721942
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (24.7 bits)