RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy16696
         (296 letters)



>3bk6_A PH stomatin; archaea, trimer, coiled- coil, flotillin, SPFH,
           membrane fusion, trafficking, transmembrane, membrane
           protein; 3.20A {Pyrococcus horikoshii}
          Length = 188

 Score =  164 bits (417), Expect = 1e-50
 Identities = 60/161 (37%), Positives = 96/161 (59%), Gaps = 5/161 (3%)

Query: 98  IIDKIKYVQSLKEIAIDVPQQSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEFAITQ 157
           I +K   V  L+   +DVP Q  IT DNV + ++ V+Y ++ DP  A   V++   A +Q
Sbjct: 2   IFEKAVIV-DLRTQVLDVPVQETITKDNVPVRVNAVVYFRVVDPVKAVTQVKNYIMATSQ 60

Query: 158 LAQTTMRSELGKISLDKVFRERESLNESIVHAINNASEAWGITCLRYEIRDIKLPSRVNE 217
           ++QTT+RS +G+  LD++  ER+ LN  +   I+ A++ WGI     EI+D++LP+ + +
Sbjct: 61  ISQTTLRSVIGQAHLDELLSERDKLNMQLQRIIDEATDPWGIKVTAVEIKDVELPAGMQK 120

Query: 218 AMQMQVEAERKKRAAILESEGIRAADINVAEGKRQAKILEA 258
           AM  Q EAER++RA I  +E  R A    AE  R+A  + +
Sbjct: 121 AMARQAEAERERRARITLAEAERQA----AEKLREAAEIIS 157


>2rpb_A Hypothetical membrane protein; SPFH domain; NMR {Pyrococcus
           horikoshii}
          Length = 113

 Score =  151 bits (383), Expect = 2e-46
 Identities = 39/106 (36%), Positives = 62/106 (58%)

Query: 107 SLKEIAIDVPQQSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEFAITQLAQTTMRSE 166
            L+E  IDVP Q  I  DNV +++D V+Y ++ DP  A Y V D   AI +LAQT +R+ 
Sbjct: 6   DLREHVIDVPPQEVICKDNVVVTVDAVVYYQVIDPVKAVYNVSDFLMAIVKLAQTNLRAI 65

Query: 167 LGKISLDKVFRERESLNESIVHAINNASEAWGITCLRYEIRDIKLP 212
           +G++ LD+    R+ +N  +   ++  ++ WG+   R EI+ I  P
Sbjct: 66  IGEMELDETLSGRDIINARLREELDKITDRWGVKITRVEIQRIDPP 111


>4fvg_A Stomatin; mixed alpha-beta fold, membrane scaffold, membrane
           protein; 1.80A {Mus musculus} PDB: 4fvj_A 4fvf_A
          Length = 133

 Score =  149 bits (379), Expect = 1e-45
 Identities = 45/132 (34%), Positives = 79/132 (59%), Gaps = 1/132 (0%)

Query: 94  ILIPIIDKIKYVQSLKEIAIDVPQQSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEF 153
           +  P  D    V  ++ I+ D+P Q  +T D+VT+S+DGV+Y ++ +  LA   + + + 
Sbjct: 3   LGSPSTDSAAKV-DMRTISFDIPPQEVLTKDSVTISVDGVVYYRVQNATLAVANITNADS 61

Query: 154 AITQLAQTTMRSELGKISLDKVFRERESLNESIVHAINNASEAWGITCLRYEIRDIKLPS 213
           A   LAQTT+R+ LG  +L ++  +RE +   +   +++A++ WGI   R EI+D+KLP 
Sbjct: 62  ATRLLAQTTLRNALGTKNLSQILSDREEIAHHMQSTLDDATDDWGIKVERVEIKDVKLPV 121

Query: 214 RVNEAMQMQVEA 225
           ++  AM  + EA
Sbjct: 122 QLQRAMAAEAEA 133


>1win_A Flotillin 2; BAND 7 domain, structural genomics, riken structural
           genomics/proteomics initiative, RSGI, cell adhesion; NMR
           {Mus musculus} SCOP: d.43.2.1
          Length = 143

 Score =  102 bits (255), Expect = 3e-27
 Identities = 22/131 (16%), Positives = 54/131 (41%), Gaps = 9/131 (6%)

Query: 107 SLKEIAIDVPQQSAITSDNVTLSIDGVLYLKINDPY---------LASYGVEDPEFAITQ 157
           SL+ + +    +   T++ V L++ GV  +KI                  V+D +  + Q
Sbjct: 11  SLEIMTLQPRCEDVETAEGVALTVTGVAQVKIMTEKELLAVACEQFLGKNVQDIKNVVLQ 70

Query: 158 LAQTTMRSELGKISLDKVFRERESLNESIVHAINNASEAWGITCLRYEIRDIKLPSRVNE 217
             +  +RS LG +++++++++R+   + +           GI  L + I+D+        
Sbjct: 71  TLEGHLRSILGTLTVEQIYQDRDQFAKLVREVAAPDVGRMGIEILSFTIKDVYDKVDYLS 130

Query: 218 AMQMQVEAERK 228
           ++     +   
Sbjct: 131 SLGKTQTSGPS 141


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 55.6 bits (133), Expect = 7e-09
 Identities = 46/370 (12%), Positives = 106/370 (28%), Gaps = 125/370 (33%)

Query: 2   CTTPVATALLSVAEA-------------------RAKSLDLVAKALETKDGRSAASLSIA 42
           C   + T    V +                      +   L+ K L+ +       L   
Sbjct: 266 CKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCR----PQDLP-- 319

Query: 43  EQYVKAFNELAKTNNTLIVPSDANNIASMV------TQAWVVERMGKYHRILEPGLNILI 96
                       T N   +    + IA  +         W      K   I+E  LN+L 
Sbjct: 320 --------REVLTTNPRRL----SIIAESIRDGLATWDNWKHVNCDKLTTIIESSLNVLE 367

Query: 97  PIIDKIKYVQSLKEIAI---DVPQQSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEF 153
           P   +  +      +++          I +    LS+     +  +          D   
Sbjct: 368 PAEYRKMF----DRLSVFPPSAH----IPTI--LLSL-----IWFDVIK------SDVMV 406

Query: 154 AITQLAQTTM---RSELGKISLDKVFRE--RESLNESIVHA-------INNASEAWGITC 201
            + +L + ++   + +   IS+  ++ E   +  NE  +H        I    ++  +  
Sbjct: 407 VVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLIP 466

Query: 202 LR----------YEIRDIKLPSRVNEA---------MQMQVEAE---------------- 226
                       + +++I+ P R+            ++ ++  +                
Sbjct: 467 PYLDQYFYSHIGHHLKNIEHPERMTLFRMVFLDFRFLEQKIRHDSTAWNASGSILNTLQQ 526

Query: 227 -RKKRAAILESEGIRAADINVAEG--KRQAKILEAAKYLCKLLIKRAAILESEGIRAADI 283
            +  +  I +++      +N       +  + L  +KY    L++ A + E E I     
Sbjct: 527 LKFYKPYICDNDPKYERLVNAILDFLPKIEENLICSKYTD--LLRIALMAEDEAI----- 579

Query: 284 NVAEGKRQAK 293
              E  +Q +
Sbjct: 580 -FEEAHKQVQ 588



 Score = 49.1 bits (116), Expect = 1e-06
 Identities = 35/275 (12%), Positives = 76/275 (27%), Gaps = 58/275 (21%)

Query: 21  LDLVAKALETKDGRSAASLSIAEQYVKAFNELAKTNNTLIVPSDANNIASMVTQAWVVER 80
            D      + KD          +   K+     + ++ ++     +    +    W +  
Sbjct: 26  EDAFVDNFDCKD---------VQDMPKSILSKEEIDHIIMSKDAVSGTLRLF---WTLLS 73

Query: 81  MGK--YHRILEPGLNI-------------LIPIIDKIKYVQSLKEIAIDVPQQSAITSDN 125
             +    + +E  L I               P +    Y++    +  D          N
Sbjct: 74  KQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYNDNQ---VFAKYN 130

Query: 126 VT-----LSIDGVLY-LKINDPYLASYGVEDPEFAITQLAQTTMRSELGKISLD-KVF-- 176
           V+     L +   L  L+     +   GV       T +A     S   +  +D K+F  
Sbjct: 131 VSRLQPYLKLRQALLELR-PAKNVLIDGV--LGSGKTWVALDVCLSYKVQCKMDFKIFWL 187

Query: 177 --RERESLNESIVHAINNASEAWGITCLRYEIRDIKLPSRVNEAMQMQVEAERKKRAAIL 234
             +   S  E+++  +            + +          +         + + R  + 
Sbjct: 188 NLKNCNS-PETVLEMLQK-------LLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLK 239

Query: 235 ESEGIRAADI--NVAEGKRQAKILEAAKYLCKLLI 267
                    +  NV    + AK   A    CK+L+
Sbjct: 240 SKPYENCLLVLLNV----QNAKAWNAFNLSCKILL 270


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 49.7 bits (118), Expect = 8e-07
 Identities = 49/337 (14%), Positives = 108/337 (32%), Gaps = 98/337 (29%)

Query: 23  LVAKALETKDGRSAASLSIAEQYVKAFN----ELAKTNNTLIVPSDANNIASMVTQ---- 74
           L AK L+  D     +  + + Y+ A         K +N+ +  +     A +V      
Sbjct: 104 LAAKLLQENDTTLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEGNAQLVAIFGGQ 163

Query: 75  ----AWVVERMGKYHRILEPGLNILIPIIDKIKYVQSLKEIAIDVPQQSAITSDNVTLSI 130
                +  E +   ++       ++  +I      ++L E+         + +      +
Sbjct: 164 GNTDDYFEE-LRDLYQTYHV---LVGDLIKFS--AETLSELIRTTLDAEKVFTQ----GL 213

Query: 131 DGVLYLKI-----NDPYLASYGVEDPEFAITQLAQ--TTMRSELGKISLDKVFRERESL- 182
           + + +L+      +  YL S  +  P   + QLA    T +  LG  +  ++    +   
Sbjct: 214 NILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYVVTAKL-LG-FTPGELRSYLKGAT 271

Query: 183 --NESIVHAI------------NNASEA-----W-GITCLR-YEIRDIKLPSRVNEAMQ- 220
             ++ +V A+             +  +A     + G+ C   Y    +  PS + ++++ 
Sbjct: 272 GHSQGLVTAVAIAETDSWESFFVSVRKAITVLFFIGVRCYEAYPNTSLP-PSILEDSLEN 330

Query: 221 -------M---------QVEAERKKRAAIL-ESEGIRAADINVAEGKRQ------AKILE 257
                  M         QV+    K  + L   + +  + +N   G +        + L 
Sbjct: 331 NEGVPSPMLSISNLTQEQVQDYVNKTNSHLPAGKQVEISLVN---GAKNLVVSGPPQSLY 387

Query: 258 AAKYLCKLLIKRAAILESEGIRAADINVAEGKRQAKI 294
               L   L K  A                G  Q++I
Sbjct: 388 G---LNLTLRKAKA--------------PSGLDQSRI 407



 Score = 41.6 bits (97), Expect = 3e-04
 Identities = 28/159 (17%), Positives = 55/159 (34%), Gaps = 61/159 (38%)

Query: 8    TALLSVAEA------------RAKSLDLVAKALE-TKDGRS----AA------SLSIAEQ 44
             AL S+A+             R      +  A+   + GRS     A      + S +++
Sbjct: 1768 AALASLADVMSIESLVEVVFYRGM---TMQVAVPRDELGRSNYGMIAINPGRVAASFSQE 1824

Query: 45   YVKAFNE-LAKTNNTLIVPSDAN-NIASMVTQAWVVERMGKYHRI---LEPGLNILIPII 99
             ++   E + K    L+     N N+ +   Q   V            L   L+ +  ++
Sbjct: 1825 ALQYVVERVGKRTGWLVEI--VNYNVENQ--Q--YV--------AAGDLR-ALDTVTNVL 1869

Query: 100  DKIKYVQSLKEIAIDVPQQSAITSDNVTLSIDGV---LY 135
            + IK +Q +  I +             +LS++ V   L+
Sbjct: 1870 NFIK-LQKIDIIEL-----------QKSLSLEEVEGHLF 1896



 Score = 30.0 bits (67), Expect = 1.1
 Identities = 30/149 (20%), Positives = 58/149 (38%), Gaps = 31/149 (20%)

Query: 157  QLAQTT-MRSELGKISLDKVFRE---------RESLNESIVHAI-NNASEAWGITCLRYE 205
            Q +Q   M  +L K S  K  ++         +++   SI+  + NN      +T     
Sbjct: 1624 QGSQEQGMGMDLYKTS--KAAQDVWNRADNHFKDTYGFSILDIVINNPVN---LTIHFGG 1678

Query: 206  IRDIKLPSRVN-EAMQMQVEAERKKRA-AILESEGIRAADINVAEGKRQAKILEAAKY-- 261
             +  ++  R N  AM  +   + K +   I +     +          +  +L A ++  
Sbjct: 1679 EKGKRI--RENYSAMIFETIVDGKLKTEKIFKEINEHSTSY-TFRS--EKGLLSATQFTQ 1733

Query: 262  --LCKLLIKRAA--ILESEGIRAADINVA 286
              L   L+++AA   L+S+G+  AD   A
Sbjct: 1734 PAL--TLMEKAAFEDLKSKGLIPADATFA 1760


>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
           vesicle, membrane, Ca structural protein; 7.94A {Bos
           taurus}
          Length = 190

 Score = 31.7 bits (71), Expect = 0.16
 Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 6/48 (12%)

Query: 215 VNEAMQMQVEAE--RKKRAAILESEGIRAADINVAEGKRQAKILEAAK 260
           + +A ++  E E  RK R    E +  R  +++ A    + +  E AK
Sbjct: 74  IAQADRLTQEPESIRKWR----EEQRKRLQELDAASKVMEQEWREKAK 117


>3ihj_A Alanine aminotransferase 2; helix, structural genomics, structural
           genomics consortium, pyridoxal phosphate; HET: PLP;
           2.30A {Homo sapiens}
          Length = 498

 Score = 30.9 bits (70), Expect = 0.47
 Identities = 23/146 (15%), Positives = 48/146 (32%), Gaps = 23/146 (15%)

Query: 110 EIAIDVPQQSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEFAITQLAQTTMRSELGK 169
            I +   +        +      V+   I D    + G +   F    +A  T  + L  
Sbjct: 42  PIVLKAGEIELELQRGIKKPFTEVIRANIGDA--QAMGQQPITFLRQVMALCTYPNLLDS 99

Query: 170 --ISLDKVFRERESLNESIVHAINNASEAWGITCLRYEI------RDIKLPSRVNE---- 217
                D   R R  L     +++ + S + G+ C+R ++      RD  +P+  +     
Sbjct: 100 PSFPEDAKKRARRILQACGGNSLGSYSASQGVNCIREDVAAYITRRDGGVPADPDNIYLT 159

Query: 218 ---------AMQMQVEAERKKRAAIL 234
                     +++ V    K R  ++
Sbjct: 160 TGASDGISTILKILVSGGGKSRTGVM 185


>3mn8_A LP15968P; catalytic domain of alpha/beta-hydrolase fold,
           C-terminal, A transthyretin-like domain, hydrolase; HET:
           NAG GEM; 2.70A {Drosophila melanogaster}
          Length = 435

 Score = 29.9 bits (67), Expect = 1.1
 Identities = 13/65 (20%), Positives = 21/65 (32%), Gaps = 2/65 (3%)

Query: 82  GKYHRILEPGLNILIPIIDKIKYVQSLKEIAIDVPQQSAITSDNVTLSIDGVLYLKINDP 141
           G+Y R+L PGL  +        Y  S  +              +  L+     +  I+  
Sbjct: 372 GEYWRLLTPGLYSV--HASAFGYQTSAPQQVRVTNDNQEALRLDFKLAPVETNFDGISSF 429

Query: 142 YLASY 146
           Y   Y
Sbjct: 430 YSPYY 434


>2zuo_A MVP, major vault protein; repeat domains, protein-protein complex,
           cytoplasm, ribonucleoprotein, structural protein; 3.50A
           {Rattus norvegicus} PDB: 2zv4_N 2zv5_a 2qzv_A
          Length = 861

 Score = 29.4 bits (65), Expect = 1.7
 Identities = 18/82 (21%), Positives = 35/82 (42%), Gaps = 9/82 (10%)

Query: 217 EAMQMQVEAERKKRAAILESEGIRAADINVAEGKRQAKI--LEAAKYLCKLLIKRAAILE 274
           +A+ ++ EAE ++   + E E I A      E  +  ++  +EA K+       +     
Sbjct: 749 QALAIETEAELERVKKVREMELIYARAQLELEVSKAQQLANVEAKKF-------KEMTEA 801

Query: 275 SEGIRAADINVAEGKRQAKILE 296
                  D+ VA  + Q K+L+
Sbjct: 802 LGPGTIRDLAVAGPEMQVKLLQ 823



 Score = 27.9 bits (61), Expect = 5.7
 Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 5/81 (6%)

Query: 214 RVNEAMQMQVEAERKKRAAILESEGIRAADINVAEGKRQAKILEAAKYLCKLLIKRAAIL 273
            V      + EAE +  AA +E E      +  A+ K QA  +E    L ++   R   L
Sbjct: 715 AVESTGNAKAEAESRAEAARIEGE----GSVLQAKLKAQALAIETEAELERVKKVREMEL 770

Query: 274 ESEGIRAADINVAEGKRQAKI 294
                   ++ V++ ++ A +
Sbjct: 771 IYA-RAQLELEVSKAQQLANV 790


>1n71_A AAC(6')-II; aminoglycoside 6'-N-acetyltransferase, antibiotic
           resistance, coenzyme A; HET: COA; 1.80A {Enterococcus
           faecium} SCOP: d.108.1.1 PDB: 2a4n_A* 1b87_A*
          Length = 180

 Score = 28.2 bits (62), Expect = 2.6
 Identities = 14/59 (23%), Positives = 24/59 (40%)

Query: 6   VATALLSVAEARAKSLDLVAKALETKDGRSAASLSIAEQYVKAFNELAKTNNTLIVPSD 64
           + T L++  E    S   +   L T D     +LS  + Y   F+++A   N    P +
Sbjct: 87  IGTRLVNYLEKEVASRGGITIYLGTDDLDHGTTLSQTDLYEHTFDKVASIQNLREHPYE 145


>3tcm_A Alanine aminotransferase 2; pyridoxal phosphate (PLP)-binding; HET:
           DCS; 2.71A {Hordeum vulgare}
          Length = 500

 Score = 28.6 bits (64), Expect = 2.9
 Identities = 23/95 (24%), Positives = 35/95 (36%), Gaps = 8/95 (8%)

Query: 118 QSAITSDNVTLSIDGVLYLKINDPYLASYGVEDPEFAITQLAQTTMRSELGKISLDKVF- 176
           Q  + +   +L  D +LY  I +P   S G +   F    LA       L +  +  +F 
Sbjct: 51  QEQLKTQPGSLPFDEILYCNIGNP--QSLGQQPVTFFREVLALCDHPDLLQREEIKTLFS 108

Query: 177 -----RERESLNESIVHAINNASEAWGITCLRYEI 206
                R ++ L      A    S + GI  LR  I
Sbjct: 109 ADSISRAKQILAMIPGRATGAYSHSQGIHGLRDAI 143


>1gp6_A Leucoanthocyanidin dioxygenase; 2-oxoglutarate dependent
           dioxygenase, flavonoid biosynthesis; HET: MES QUE DH2;
           1.75A {Arabidopsis thaliana} SCOP: b.82.2.1 PDB: 1gp5_A*
           1gp4_A* 2brt_A*
          Length = 356

 Score = 27.2 bits (61), Expect = 7.2
 Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 170 ISLDKVFRERESLNESIVHAINNASEAWGI 199
           I L  +  + E + E+ +  +  AS  WG+
Sbjct: 50  IDLKNIESDDEKIRENCIEELKKASLDWGV 79


>3sfw_A Dihydropyrimidinase; hydrolase, zinc binding; HET: KCX; 1.73A
           {Brevibacillus agri} PDB: 1yny_A 1k1d_A*
          Length = 461

 Score = 27.1 bits (61), Expect = 7.7
 Identities = 5/26 (19%), Positives = 11/26 (42%)

Query: 264 KLLIKRAAILESEGIRAADINVAEGK 289
           K  I+   ++ +     AD+ +   K
Sbjct: 2   KKWIRNGTVVTASDTYQADVLIDGEK 27


>4ubp_C Protein (urease (chain C)); bacillus pasteurii, nickel,
           acetohydroxamic acid, metalloenzyme, hydrolase; HET:
           KCX; 1.55A {Sporosarcina pasteurii} SCOP: b.92.1.1
           c.1.9.2 PDB: 1s3t_C* 2ubp_C* 3ubp_C* 1ubp_C* 1ie7_C*
          Length = 570

 Score = 27.3 bits (60), Expect = 8.1
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 248 EGKRQAKILEAAKYLCKLLIKRAAILESEGIRAADINVAEGK 289
           EG  +       + +  LL+  A IL+  GI  ADI V +G 
Sbjct: 52  EGMGENGTYTRTENVLDLLLTNALILDYTGIYKADIGVKDGY 93


>3nqb_A Adenine deaminase 2; PSI-II, structural genomics, protein structure
           initiative, N SGX research center for structural
           genomics, nysgxrc; 2.21A {Agrobacterium tumefaciens}
           PDB: 3t81_A 3t8l_A
          Length = 608

 Score = 26.9 bits (59), Expect = 8.5
 Identities = 14/73 (19%), Positives = 26/73 (35%), Gaps = 12/73 (16%)

Query: 219 MQMQVEAERKKRAAILESEGIRAADINVAEGKRQAKILEAAKYLCKLLIKRAAIL--ESE 276
           M +     R    A L  + +RA  +  A G ++            +LI    ++   + 
Sbjct: 1   MSLMTAQIRLAEPADLNDDTLRARAVAAARGDQRF----------DVLITGGTLVDVVTG 50

Query: 277 GIRAADINVAEGK 289
            +R ADI +    
Sbjct: 51  ELRPADIGIVGAL 63


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.315    0.130    0.346 

Gapped
Lambda     K      H
   0.267   0.0538    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,356,019
Number of extensions: 267457
Number of successful extensions: 640
Number of sequences better than 10.0: 1
Number of HSP's gapped: 629
Number of HSP's successfully gapped: 37
Length of query: 296
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 203
Effective length of database: 4,105,140
Effective search space: 833343420
Effective search space used: 833343420
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (26.2 bits)