RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14093
         (857 letters)



>gnl|CDD|239798 cd04270, ZnMc_TACE_like,  Zinc-dependent metalloprotease; TACE_like
           subfamily. TACE, the tumor-necrosis factor-alpha
           converting enzyme, releases soluble TNF-alpha from
           transmembrane pro-TNF-alpha.
          Length = 244

 Score =  274 bits (702), Expect = 4e-86
 Identities = 101/211 (47%), Positives = 123/211 (58%), Gaps = 22/211 (10%)

Query: 189 ISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDV 248
           IS IDRV  IY +T W       G++G+GF IK+I +H+    V  G   YN     W V
Sbjct: 30  ISHIDRVDDIYRNTDWDGG----GFKGIGFQIKRIRIHTTPDEVDPGNKFYNKSFPNWGV 85

Query: 249 RNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTL 308
              L     E    D CLAHLFT   F+ G LGLAYVGSPR NS GGIC   Y       
Sbjct: 86  EKFLVKLLLEQFSDDVCLAHLFTYRDFDMGTLGLAYVGSPRDNSAGGICEKAY------- 138

Query: 309 YLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDM 368
                     Y+ NG   YLN+GL+++ N YG+RV T+E+DLVTAHE GHN+GS HDPD+
Sbjct: 139 ----------YYSNGKKKYLNTGLTTTVN-YGKRVPTKESDLVTAHELGHNFGSPHDPDI 187

Query: 369 PECSPSASQGGSYLMYTYSVSGYDVNNKTFS 399
            EC+P  SQGG+Y+MY  + SG   NNK FS
Sbjct: 188 AECAPGESQGGNYIMYARATSGDKENNKKFS 218



 Score =  125 bits (317), Expect = 2e-32
 Identities = 45/99 (45%), Positives = 58/99 (58%), Gaps = 9/99 (9%)

Query: 95  TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFV 154
             C LLLVAD+RFY+ MG    +TTINYLIS IDRV  IY +T W       G++G+GF 
Sbjct: 1   NTCKLLLVADHRFYKYMGRGEEETTINYLISHIDRVDDIYRNTDWDGG----GFKGIGFQ 56

Query: 155 IKKIVVHSDATRVRQGEAHY-----NMIRDKWDVRNLLE 188
           IK+I +H+    V  G   Y     N   +K+ V+ LLE
Sbjct: 57  IKRIRIHTTPDEVDPGNKFYNKSFPNWGVEKFLVKLLLE 95



 Score = 93.6 bits (233), Expect = 3e-21
 Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 14/105 (13%)

Query: 552 DVNGYRGMGFVIKKIVVHSDATRVRQGEAHY-----NMIRDKWDVRNLLETFSSHVDGTN 606
           D  G++G+GF IK+I +H+    V  G   Y     N   +K+ V+ LLE FS      +
Sbjct: 46  DGGGFKGIGFQIKRIRIHTTPDEVDPGNKFYNKSFPNWGVEKFLVKLLLEQFSD-----D 100

Query: 607 FCLAHLFTHQSFWSRGSSILGLAYIASPRPYSIGGSIHENLKYFD 651
            CLAHLFT++ F       LGLAY+ SPR  S GG I E   Y+ 
Sbjct: 101 VCLAHLFTYRDF---DMGTLGLAYVGSPRDNSAGG-ICEKAYYYS 141



 Score = 45.8 bits (109), Expect = 3e-05
 Identities = 23/75 (30%), Positives = 32/75 (42%), Gaps = 6/75 (8%)

Query: 367 DMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTHQSFWSRGSSIL 426
             P+     ++  +     + V  + V         D    CLAHLFT++ F       L
Sbjct: 64  TTPDEVDPGNKFYNKSFPNWGVEKFLVKLLLEQFSDD---VCLAHLFTYRDF---DMGTL 117

Query: 427 GLAYIASPRPYSIGG 441
           GLAY+ SPR  S GG
Sbjct: 118 GLAYVGSPRDNSAGG 132



 Score = 38.1 bits (89), Expect = 0.012
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 688 LRSIRKVLEAKSGKCFSEPEESF 710
            +SI KVLE KS  CF E  +SF
Sbjct: 222 KKSISKVLEVKSNSCFVERSQSF 244


>gnl|CDD|240257 PTZ00073, PTZ00073, 60S ribosomal protein L37; Provisional.
          Length = 91

 Score =  140 bits (354), Expect = 2e-39
 Identities = 53/74 (71%), Positives = 61/74 (82%)

Query: 779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQRRKTTG 838
            TKGT SFGKR  KTHTLCRRCGK S+H+QKK+CA CGYP  ++R YNWSVKA+RR+TTG
Sbjct: 1   MTKGTGSFGKRHGKTHTLCRRCGKRSFHVQKKRCASCGYPSAKMRRYNWSVKAKRRRTTG 60

Query: 839 TGRMRHLKIVRRRF 852
           TGR R+LK V RR 
Sbjct: 61  TGRCRYLKTVARRA 74


>gnl|CDD|145202 pfam01907, Ribosomal_L37e, Ribosomal protein L37e.  This family
           includes ribosomal protein L37 from eukaryotes and
           archaebacteria. The family contains many conserved
           cysteines and histidines suggesting that this protein
           may bind to zinc.
          Length = 54

 Score =  105 bits (263), Expect = 2e-27
 Identities = 38/54 (70%), Positives = 44/54 (81%)

Query: 780 TKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQR 833
           TKGT SFGKR  KTHTLCRRCG+ SYH+QKK+CA CG+P  + R YNWS KA+R
Sbjct: 1   TKGTPSFGKRNKKTHTLCRRCGRRSYHVQKKRCASCGFPRAKRRRYNWSKKAKR 54


>gnl|CDD|225037 COG2126, RPL37A, Ribosomal protein L37E [Translation, ribosomal
           structure and biogenesis].
          Length = 61

 Score = 89.8 bits (223), Expect = 4e-22
 Identities = 36/60 (60%), Positives = 43/60 (71%), Gaps = 1/60 (1%)

Query: 780 TKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPC-PRLRHYNWSVKAQRRKTTG 838
           +KGT SFGKR  KTH  CRRCG+ SYH++KK CA CG+    R+R YNW  K +RRKT G
Sbjct: 2   SKGTPSFGKRNKKTHIRCRRCGRRSYHVRKKYCAACGFGRSARMRSYNWQAKKKRRKTIG 61


>gnl|CDD|222320 pfam13688, Peptidase_M84, Metallo-peptidase family M84. 
          Length = 193

 Score = 84.8 bits (210), Expect = 1e-18
 Identities = 39/222 (17%), Positives = 62/222 (27%), Gaps = 69/222 (31%)

Query: 189 ISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDV 248
           I+L++    +Y                +   +  + +           +         + 
Sbjct: 30  INLVNTASNVYEREF-----------NISLGLVNLTISDSTCPYTPPASS------SGNA 72

Query: 249 RNLLEVF----SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKN 304
            +LL  F    +      D  LA+         G  GLA++G    +   G  + +   +
Sbjct: 73  SDLLSRFQAFSAWRGRQND-DLAYWTLMTNCSTG--GLAWLGQLCNSGSAGSVSNQVSGS 129

Query: 305 GYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEH 364
                                                 V T     V AHE GHN+G+ H
Sbjct: 130 ----------------------------------ANVVVGTATEWQVFAHEIGHNFGAVH 155

Query: 365 DPDM----PECSPSAS---QGGSYLMYTYSVSGYDVNNKTFS 399
           D D       C  S+S    GG Y+M   S      N   FS
Sbjct: 156 DCDSSTESQCCPLSSSTCPAGGRYIMNPSSSP----NITYFS 193



 Score = 36.6 bits (85), Expect = 0.028
 Identities = 9/67 (13%), Positives = 22/67 (32%), Gaps = 13/67 (19%)

Query: 96  RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVI 155
              LL+ AD  +    GG++       +I+L++    +Y                +   +
Sbjct: 4   TVALLVAADCSYVAAFGGTD--AAQANIINLVNTASNVYEREF-----------NISLGL 50

Query: 156 KKIVVHS 162
             + +  
Sbjct: 51  VNLTISD 57



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 10/73 (13%), Positives = 23/73 (31%), Gaps = 9/73 (12%)

Query: 558 GMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLL---ETFSSHVDGTNFCLAHLFT 614
            +   +  + +           +         +  +LL   + FS+     N  LA+   
Sbjct: 45  NISLGLVNLTISDSTCPYTPPASS------SGNASDLLSRFQAFSAWRGRQNDDLAYWTL 98

Query: 615 HQSFWSRGSSILG 627
             +  + G + LG
Sbjct: 99  MTNCSTGGLAWLG 111


>gnl|CDD|222233 pfam13574, Reprolysin_2, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 173

 Score = 82.7 bits (205), Expect = 3e-18
 Identities = 43/210 (20%), Positives = 67/210 (31%), Gaps = 64/210 (30%)

Query: 189 ISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDAT-RVRQGEAHYNMIRDK-- 245
           ++ ++RV+ +Y                  F I+ ++V++D           YN   +   
Sbjct: 9   VNTVNRVNGVYER---------------DFNIRLVLVNNDRLIFTDPATDPYNNNCNGGT 53

Query: 246 WDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNG 305
               N     S      ++ + HLF+   F GG LGLA++G        GIC   Y   G
Sbjct: 54  LLSENQNL-LSNLIGEANYDIGHLFST--FGGGGLGLAWLG--------GICQKGYKGTG 102

Query: 306 YTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHD 365
            T                                G      E D V AHE GH +G+ H 
Sbjct: 103 ST-----------------------------TPSGDPF---EID-VVAHEIGHQFGANHT 129

Query: 366 PDM--PECSPSASQGGSYLMYTYSVSGYDV 393
                   S +    GS +M    +    +
Sbjct: 130 FSGGCEGSSATEPGSGSTIMSYAGICNNTL 159



 Score = 32.3 bits (74), Expect = 0.55
 Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 543 TQNQQSCMCDVNGYRGMGFVIKKIVVHSDAT-RVRQGEAHYNMIRDK--WDVRNLLETFS 599
                + +  VNG     F I+ ++V++D           YN   +       N     S
Sbjct: 5   LAAMVNTVNRVNGVYERDFNIRLVLVNNDRLIFTDPATDPYNNNCNGGTLLSENQN-LLS 63

Query: 600 SHVDGTNFCLAHLFTHQSFWSRGSSILGLAYI 631
           + +   N+ + HLF+     + G   LGLA++
Sbjct: 64  NLIGEANYDIGHLFS-----TFGGGGLGLAWL 90



 Score = 30.7 bits (70), Expect = 1.9
 Identities = 11/38 (28%), Positives = 19/38 (50%), Gaps = 5/38 (13%)

Query: 394 NNKTFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYI 431
           N    S+ +   N+ + HLF+     + G   LGLA++
Sbjct: 58  NQNLLSNLIGEANYDIGHLFS-----TFGGGGLGLAWL 90


>gnl|CDD|235242 PRK04179, rpl37e, 50S ribosomal protein L37e; Reviewed.
          Length = 62

 Score = 74.3 bits (183), Expect = 1e-16
 Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 2/58 (3%)

Query: 780 TKGTSSFGKR-RNKTHTLCRRCGKSSYHIQKKKCAQCGY-PCPRLRHYNWSVKAQRRK 835
           +KGT S GKR + KTH  CRRCG+ SY+++KK CA CG+    R+R Y+W  K    K
Sbjct: 2   SKGTPSMGKRNKGKTHIRCRRCGRHSYNVRKKYCAACGFGRSKRIRRYSWQNKKVNGK 59


>gnl|CDD|239795 cd04267, ZnMc_ADAM_like, Zinc-dependent metalloprotease, ADAM_like
           or reprolysin_like subgroup. The adamalysin_like or ADAM
           family of metalloproteases contains proteolytic domains
           from snake venoms, proteases from the mammalian
           reproductive tract, and the tumor necrosis factor alpha
           convertase, TACE. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
          Length = 192

 Score = 75.5 bits (186), Expect = 2e-15
 Identities = 60/247 (24%), Positives = 87/247 (35%), Gaps = 74/247 (29%)

Query: 157 KIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGM 216
           ++VV +D   V    +  N+++    +       LI+  + IY  T        +   G+
Sbjct: 4   ELVVVADHRMVSYFNSDENILQ--AYI-----TELINIANSIYRST--------NLRLGI 48

Query: 217 GFVIKKIVVHSDATRVRQGEAHYNMIRD-KWDVRNLLEVFS--REYSHKDFCLAHLFTDL 273
              ++ +       ++ +GE           D  N L  FS  R         A L T  
Sbjct: 49  RISLEGL-------QILKGE---QFAPPIDSDASNTLNSFSFWRAEGPIRHDNAVLLTAQ 98

Query: 274 KF-EGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGL 332
            F EG ILGLAYVGS        +C P                           Y + G+
Sbjct: 99  DFIEGDILGLAYVGS--------MCNP---------------------------YSSVGV 123

Query: 333 SSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYD 392
                       T    L  AHE GHN G+EHD    E +     GG+Y+M     SG  
Sbjct: 124 VEDTGF------TLLTALTMAHELGHNLGAEHDGG-DELAFECDGGGNYIMA-PVDSG-- 173

Query: 393 VNNKTFS 399
           +N+  FS
Sbjct: 174 LNSYRFS 180



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 18/59 (30%), Positives = 21/59 (35%), Gaps = 4/59 (6%)

Query: 589 WDVRNLLETFS--SHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPYSIGGSIHE 645
            D  N L +FS            A L T Q F      ILGLAY+ S         + E
Sbjct: 69  SDASNTLNSFSFWRAEGPIRHDNAVLLTAQDF--IEGDILGLAYVGSMCNPYSSVGVVE 125



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 12/64 (18%), Positives = 24/64 (37%), Gaps = 9/64 (14%)

Query: 97  CPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIK 156
             L++VAD+R               Y+  LI+  + IY  T        +   G+   ++
Sbjct: 3   IELVVVADHRMVSYFNSDE-NILQAYITELINIANSIYRST--------NLRLGIRISLE 53

Query: 157 KIVV 160
            + +
Sbjct: 54  GLQI 57



 Score = 35.5 bits (82), Expect = 0.061
 Identities = 15/52 (28%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 392 DVNNKTFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPYSIGGGI 443
            +N+ +F           A L T Q F      ILGLAY+ S        G+
Sbjct: 74  TLNSFSFWRAEGPIRHDNAVLLTAQDF--IEGDILGLAYVGSMCNPYSSVGV 123


>gnl|CDD|215784 pfam00200, Disintegrin, Disintegrin. 
          Length = 76

 Score = 53.0 bits (128), Expect = 7e-09
 Identities = 18/44 (40%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query: 477 NSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPPSAPM 520
             PCC  CKF   G  CR A    C+    CTG S+ CPP    
Sbjct: 34  TGPCCDQCKFKPAGTVCRPASG-ECDLPEYCTGQSAECPPDVYK 76



 Score = 37.2 bits (87), Expect = 0.002
 Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 16/59 (27%)

Query: 717 EGDEECDAGLLGTEDNDSCCD-KVCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLR 774
           E  EECD G    E  + CCD   CKL+   GA   +C  G         CCD+ CK +
Sbjct: 1   EEGEECDCGSPE-ECQNPCCDATTCKLK--PGA---QCATGP--------CCDQ-CKFK 44


>gnl|CDD|214490 smart00050, DISIN, Homologues of snake disintegrins.  Snake
           disintegrins inhibit the binding of ligands to integrin
           receptors. They contain a 'RGD' sequence, identical to
           the recognition site of many adhesion proteins.
           Molecules containing both disintegrin and
           metalloprotease domains are known as ADAMs.
          Length = 75

 Score = 46.5 bits (111), Expect = 1e-06
 Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 477 NSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPP 516
           + PCC NCKF   G  CR +    C+    C G+S+ CPP
Sbjct: 34  SGPCCDNCKFKPAGTLCRPSV-DECDLPEYCNGTSADCPP 72



 Score = 34.2 bits (79), Expect = 0.025
 Identities = 21/60 (35%), Positives = 25/60 (41%), Gaps = 16/60 (26%)

Query: 717 EGDEECDAGLLGTEDNDSCCDKV-CKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRR 775
           E  EECD G    E  D CCD   CKL+        +C +G         CCD  CK + 
Sbjct: 1   EEGEECDCGSPK-ECTDPCCDPATCKLKPG-----AQCASGP--------CCDN-CKFKP 45


>gnl|CDD|238124 cd00203, ZnMc, Zinc-dependent metalloprotease. This super-family of
           metalloproteases contains two major branches, the
           astacin-like proteases and the
           adamalysin/reprolysin-like proteases. Both branches have
           wide phylogenetic distribution, and contain
           sub-families, which are involved in vertebrate
           development and disease.
          Length = 167

 Score = 48.3 bits (115), Expect = 2e-06
 Identities = 30/145 (20%), Positives = 39/145 (26%), Gaps = 49/145 (33%)

Query: 265 CLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGY 324
            +A L T   F+GG  G AY+G    +  G                              
Sbjct: 53  DIAILVTRQDFDGGTGGWAYLGRVCDSLRGVG---------------------------- 84

Query: 325 TLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHD---------PDMPECSPSA 375
                       N  G    T+E     AHE GH  G  HD         P + +   + 
Sbjct: 85  --------VLQDNQSG----TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAE 132

Query: 376 SQGGSYLMYTYSVSGYDVNNKTFSS 400
                 +M     S  D   K FS 
Sbjct: 133 DDDYYSVMSYTKGSFSDGQRKDFSQ 157


>gnl|CDD|239797 cd04269, ZnMc_adamalysin_II_like, Zinc-dependent metalloprotease;
           adamalysin_II_like subfamily. Adamalysin II is a snake
           venom zinc endopeptidase. This subfamily contains other
           snake venom metalloproteinases, as well as
           membrane-anchored metalloproteases belonging to the ADAM
           family. ADAMs (A Disintegrin And Metalloprotease) are
           glycoproteins, which play roles in cell signaling, cell
           fusion, and cell-cell interactions.
          Length = 194

 Score = 44.1 bits (105), Expect = 8e-05
 Identities = 32/134 (23%), Positives = 43/134 (32%), Gaps = 49/134 (36%)

Query: 267 AHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTL 326
           A L T   F+G  +GLAY        VGG+C+P+Y                Q        
Sbjct: 90  AQLLTGRDFDGNTVGLAY--------VGGMCSPKY-----------SGGVVQDH------ 124

Query: 327 YLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTY 386
                      +     +T       AHE GHN G EHD     C+       + +M   
Sbjct: 125 ---------SRNLLLFAVT------MAHELGHNLGMEHDD--GGCT---CGRSTCIMAPS 164

Query: 387 SVSGYDVNNKTFSS 400
             S        FS+
Sbjct: 165 PSSL----TDAFSN 174



 Score = 31.8 bits (73), Expect = 0.90
 Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 11/54 (20%)

Query: 99  LLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYND----------TIWHDR 142
           L++V D   Y++  GSN       +I +++ V  IY             IW D+
Sbjct: 5   LVVVVDNSLYKKY-GSNLSKVRQRVIEIVNIVDSIYRPLNIRVVLVGLEIWTDK 57


>gnl|CDD|222240 pfam13582, Reprolysin_3, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 123

 Score = 42.7 bits (101), Expect = 8e-05
 Identities = 28/120 (23%), Positives = 36/120 (30%), Gaps = 41/120 (34%)

Query: 246 WDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNG 305
             + NL  VF        + + HLF+     GG  GLAYVG        G+C        
Sbjct: 45  ETLDNLQTVFDARIGTAGYDIGHLFSGYDG-GGGCGLAYVG--------GVCNS------ 89

Query: 306 YTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHD 365
                          K G +                         V AHE GHN+G+ H 
Sbjct: 90  -------------GKKAGVS-------------ASSSPTGDFGIDVVAHEIGHNFGANHT 123



 Score = 30.8 bits (70), Expect = 0.99
 Identities = 11/55 (20%), Positives = 19/55 (34%), Gaps = 4/55 (7%)

Query: 589 WDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPYSIGGSI 643
             + NL   F + +    + + HLF+       G    GLAY+           +
Sbjct: 45  ETLDNLQTVFDARIGTAGYDIGHLFSGYD----GGGGCGLAYVGGVCNSGKKAGV 95


>gnl|CDD|222241 pfam13583, Reprolysin_4, Metallo-peptidase family M12B
           Reprolysin-like.  This zinc-binding metallo-peptidase
           has the characteristic binding motif HExxGHxxGxxH of
           Reprolysin-like peptidases of family M12B.
          Length = 195

 Score = 38.9 bits (91), Expect = 0.005
 Identities = 28/136 (20%), Positives = 46/136 (33%), Gaps = 51/136 (37%)

Query: 251 LLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYL 310
            L  F+     K++ L HLFT         GLA++G+  +N+ GG               
Sbjct: 77  RLATFNSWRGSKNYDLGHLFTM---YTSNCGLAWLGALCQNAKGGGV------------- 120

Query: 311 NSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHD-PDMP 369
                                 +     +           + AHE GH +G+ HD     
Sbjct: 121 ----------------------ARPTKEFD----------IFAHEIGHLFGAAHDCTSSG 148

Query: 370 ECSPSASQ--GGSYLM 383
           E + SA++   G+ +M
Sbjct: 149 ETASSATEPDSGNTIM 164



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 93  TKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTI 138
           T+    L +VADY +Y   GGS  K     + +++ R++++Y   +
Sbjct: 1   TRRTYRLAVVADYSYYSIFGGSVDKVKAF-INNVVARLNEVYGRNV 45


>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
           metalloprotease.  The members of this family are enzymes
           that cleave peptides. These proteases require zinc for
           catalysis. Members of this family are also known as
           adamalysins. Most members of this family are snake venom
           endopeptidases, but there are also some mammalian
           proteins such as human ADAM8 and fertilin. Fertilin and
           closely related proteins appear to not have some active
           site residues and may not be active enzymes.
          Length = 198

 Score = 35.7 bits (83), Expect = 0.056
 Identities = 33/147 (22%), Positives = 43/147 (29%), Gaps = 57/147 (38%)

Query: 245 KWDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKN 304
           +W   +LL+        K    A L T + F+G  +G AY        VGG+C+P+    
Sbjct: 75  EWRETDLLK-------RKSHDNAQLLTGIDFDGNTIGAAY--------VGGMCSPKR--- 116

Query: 305 GYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVT-AHEFGHNWGSE 363
                                       S                 VT AHE GHN G  
Sbjct: 117 ----------------------------SVGVVQDHS--PIVLLVAVTMAHELGHNLGMT 146

Query: 364 HDPDM------PEC--SPSASQGGSYL 382
           HD           C  +P AS      
Sbjct: 147 HDDIDGCTCGGGGCIMNPVASSSPGKK 173



 Score = 31.4 bits (72), Expect = 1.4
 Identities = 17/76 (22%), Positives = 29/76 (38%), Gaps = 24/76 (31%)

Query: 99  LLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYND----------TIWHDRSEQDGY 148
           L +V D+  + +  GS+       +  +++ V++IY             IW   S+ D  
Sbjct: 5   LFIVVDHGMFTKY-GSDLNKIRQRVHQIVNLVNEIYRPLNIRVVLVGLEIW---SDGD-- 58

Query: 149 RGMGFVIKKIVVHSDA 164
                   KI V  DA
Sbjct: 59  --------KITVQGDA 66


>gnl|CDD|239801 cd04273, ZnMc_ADAMTS_like, Zinc-dependent metalloprotease,
           ADAMTS_like subgroup. ADAMs (A Disintegrin And
           Metalloprotease) are glycoproteins, which play roles in
           cell signaling, cell fusion, and cell-cell interactions.
           This particular subfamily represents domain
           architectures that combine ADAM-like metalloproteinases
           with thrombospondin type-1 repeats. ADAMTS (a
           disintegrin and metalloproteinase with thrombospondin
           motifs) proteinases are inhibited by TIMPs (tissue
           inhibitors of metalloproteinases), and they play roles
           in coagulation, angiogenesis, development and
           progression of arthritis. They hydrolyze the von
           Willebrand factor precursor and various components of
           the extracellular matrix.
          Length = 207

 Score = 35.3 bits (82), Expect = 0.087
 Identities = 12/28 (42%), Positives = 12/28 (42%)

Query: 351 VTAHEFGHNWGSEHDPDMPECSPSASQG 378
             AHE GH  G  HD D   C P    G
Sbjct: 143 TIAHELGHVLGMPHDGDGNSCGPEGKDG 170



 Score = 30.7 bits (70), Expect = 2.3
 Identities = 10/38 (26%), Positives = 21/38 (55%), Gaps = 3/38 (7%)

Query: 99  LLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYND 136
            L+VAD +  +   G + +    Y+++L++ V  +Y D
Sbjct: 5   TLVVADSKMVEFHHGEDLEH---YILTLMNIVASLYKD 39


>gnl|CDD|239799 cd04271, ZnMc_ADAM_fungal, Zinc-dependent metalloprotease,
           ADAM_fungal subgroup. The adamalysin_like or ADAM (A
           Disintegrin And Metalloprotease) family of
           metalloproteases are integral membrane proteases acting
           on a variety of extracellular targets. They are involved
           in shedding soluble peptides or proteins from the cell
           surface. This subfamily contains fungal ADAMs, whose
           precise function has yet to be determined.
          Length = 228

 Score = 35.1 bits (81), Expect = 0.099
 Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 4/56 (7%)

Query: 329 NSGLSSSRN--HYGQRVITREAD--LVTAHEFGHNWGSEHDPDMPECSPSASQGGS 380
            +G S   N    G  V+ R ++   V AHE GH +G+ HD     CS  +     
Sbjct: 122 RTGASDQGNETVAGTNVVVRTSNEWQVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQ 177


>gnl|CDD|200169 TIGR02232, myxo_disulf_rpt, Myxococcus cysteine-rich repeat.  This
           model represents a sequence region shared between
           several proteins of Myxococcus xanthus DK 1622 and some
           eukaryotic proteins that include human pappalysin-1
           (SP|Q13219). The region of about 40 amino acids contains
           several conserved Cys residues presumed to form
           disulfide bonds. The region appears in up to 13 repeats
           in Myxococcus.
          Length = 38

 Score = 29.6 bits (67), Expect = 0.44
 Identities = 13/34 (38%), Positives = 16/34 (47%), Gaps = 3/34 (8%)

Query: 711 CGNLRVEGDEECDAGLLGTEDNDSCCDKVCKLRR 744
           CG+  +E  EECD G   T   D  C   C+L  
Sbjct: 4   CGDGIIEPGEECDDG--NTTSGDG-CSATCRLEE 34


>gnl|CDD|184418 PRK13958, PRK13958, N-(5'-phosphoribosyl)anthranilate isomerase;
           Provisional.
          Length = 207

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 636 PYSIGGSIH-ENLKYFDTLHSSHISHTIVKRGVQESNHP-FNKIKEV 680
           PY I G I+ EN++  + L  SH  + I   G++ +     NK+  +
Sbjct: 154 PYLIAGGINSENIQTVEQLKLSHQGYDIAS-GIETNGRKDINKMTAI 199


>gnl|CDD|117046 pfam08469, NPHI_C, Nucleoside triphosphatase I C-terminal.  This
           viral domain is found to the C-terminus of Poxvirus
           nucleoside triphosphatase phosphohydrolase I (NPH I)
           together with the helicase conserved C-terminal domain
           (pfam00271).
          Length = 148

 Score = 30.4 bits (69), Expect = 1.9
 Identities = 13/69 (18%), Positives = 26/69 (37%), Gaps = 5/69 (7%)

Query: 642 SIHENLKYFDTLHSSHISHTIVKRGVQESNHP--FNKIKE---VTFKTLGNLRSIRKVLE 696
            IH + K F+ +        +  R + E+N      K+     + +     L +I K  +
Sbjct: 56  WIHSHYKDFEPVDDESGFRALRSRRIDENNKSNVIVKLALGQNIWYSKSYRLVTIHKGFK 115

Query: 697 AKSGKCFSE 705
            K G+ +  
Sbjct: 116 TKDGRIYDS 124


>gnl|CDD|218407 pfam05061, Pox_A11, Poxvirus A11 Protein.  Family of conserved
           Chordopoxvirinae A11 family proteins. Conserved region
           spans entire protein in the majority of family members.
          Length = 309

 Score = 31.3 bits (71), Expect = 2.1
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 13/68 (19%)

Query: 669 ESNHPFNKIKEVTFKTLGNLRSIRKVLEAKSGKCFSEPEESFCGNLRVEGDEECDAGLLG 728
           E  +P NK  E+T   L  LR++   L AK+    S P+         + + + D  L+ 
Sbjct: 20  EDGYPSNKNYEITTGQLSILRTVNDKLLAKT----SSPD---------DYESDVDESLIP 66

Query: 729 TEDNDSCC 736
            +D+    
Sbjct: 67  DDDSPRTI 74


>gnl|CDD|237756 PRK14559, PRK14559, putative protein serine/threonine phosphatase;
           Provisional.
          Length = 645

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 3/27 (11%)

Query: 794 HTLCRRCGKSSYHIQKKKCAQCGYPCP 820
           +  C++CG S  H   K C QCG   P
Sbjct: 15  NRFCQKCGTSLTH---KPCPQCGTEVP 38


>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin.  The members of this family are
           enzymes that cleave peptides. These proteases require
           zinc for catalysis.
          Length = 159

 Score = 30.3 bits (69), Expect = 2.4
 Identities = 16/39 (41%), Positives = 17/39 (43%), Gaps = 12/39 (30%)

Query: 350 LVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMY-TYS 387
           LV AHE GH  G  H  D     P A      +MY  YS
Sbjct: 110 LVAAHEIGHALGLGHSSD-----PDA------IMYPYYS 137


>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
          Length = 1121

 Score = 31.0 bits (71), Expect = 3.4
 Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 11/55 (20%)

Query: 797 CRRCGKSSYHIQKKKCAQCG------YPCPRLRHYNWSVKAQR--RKTTGTGRMR 843
           C  CGK ++    ++C  CG      Y CPR        + ++  R+ T   + +
Sbjct: 629 CPSCGKETF---YRRCPFCGTHTEPVYRCPRCGIEVEEDECEKCGREPTPYSKRK 680


>gnl|CDD|220768 pfam10462, Peptidase_M66, Peptidase M66.  This family of
           metallopeptidases contains StcE, a virulence factor
           found in Shiga toxigenic Escherichia coli organisms.
           StcE peptidase cleaves C1 esterase inhibitor.
          Length = 304

 Score = 30.4 bits (69), Expect = 3.7
 Identities = 18/47 (38%), Positives = 21/47 (44%), Gaps = 9/47 (19%)

Query: 354 HEFGHNWGSEHDPDMPECSP--SASQGGSYLMYTYSVSGYDVNNKTF 398
           HE GHN+G  H P     SP  S S  G       S  G+D N+  F
Sbjct: 198 HEQGHNYGLGHYPAGEAGSPYRSGSLLG-------SEWGWDANHNRF 237


>gnl|CDD|184942 PRK14980, PRK14980, DNA-directed RNA polymerase subunit G;
           Provisional.
          Length = 127

 Score = 29.4 bits (66), Expect = 4.0
 Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 240 NMIRDKWDVRNLLEVFSREYSHKDFCLAH--LFTDLKFEGG 278
           N+   K  V+ ++      Y++ DFC AH  + T+    G 
Sbjct: 47  NIFSQKEKVKAIISKERPSYTNDDFC-AHGYIVTESSNNGN 86


>gnl|CDD|99898 cd05837, MSH6_like, The PWWP domain is present in MSH6, a mismatch
           repair protein homologous to bacterial MutS.   The PWWP
           domain of histone-lysine N-methyltransferase, also known
           as Nuclear SET domain-containing protein 3, is also
           included. Mutations in MSH6 have been linked to
           increased cancer susceptibility, particularly in
           hereditary nonpolyposis colorectal cancer in humans.
           The role of the PWWP domain in MSH6 is not clear; MSH6
           orthologs found in S. cerevisiae, Caenorhabditis elegans
           and Arabidopsis thaliana lack the PWWP domain.   Histone
           methyltransferases (HMTases) induce the
           posttranslational methylation of lysine residues in
           histones and play a role in apoptosis.  In the HMTase
           Whistle, the PWWP domain is necessary for HMTase
           activity. The PWWP domain, named for a conserved
           Pro-Trp-Trp-Pro motif, is a small domain consisting of
           100-150 amino acids. The PWWP domain is found in
           numerous proteins that are involved in cell division,
           growth and differentiation. Most PWWP-domain proteins
           seem to be nuclear, often DNA-binding, proteins that
           function as transcription factors regulating a variety
           of developmental processes.
          Length = 110

 Score = 28.9 bits (65), Expect = 4.2
 Identities = 9/39 (23%), Positives = 13/39 (33%), Gaps = 3/39 (7%)

Query: 349 DLVTAHEFGHNWGS---EHDPDMPECSPSASQGGSYLMY 384
           DLV A   G+ W       DP +   + +         Y
Sbjct: 7   DLVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQY 45


>gnl|CDD|222658 pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy.
           This family includes O-antigen polysaccharide
           polymerases. These enzymes link O-units via a glycosidic
           linkage to form a long O-antigen. These enzymes vary in
           specificity and sequence.
          Length = 440

 Score = 30.3 bits (69), Expect = 4.7
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 376 SQGGSYLM--YTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTHQSFW--SRGSSILG---- 427
            QG S  +  Y  +      N K ++         L ++F    F   S  +++LG    
Sbjct: 284 GQGVSLNVLAYAITYVDSLPNGKNYTYGFLLQGI-LGNIFGLSPFIGQSVENALLGNSLA 342

Query: 428 --LAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELF 468
             L+Y+ +P  Y +GGG      +GS          S ++EL+
Sbjct: 343 HRLSYLLNPDTYLLGGG------LGS----------SYIAELY 369


>gnl|CDD|188777 cd09391, LIM1_Lrg1p_like, The first LIM domain of Lrg1p, a LIM and
           RhoGap domain containing protein.  The first LIM domain
           of Lrg1p, a LIM and RhoGap domain containing protein:
           The members of this family contain three tandem repeats
           of LIM domains and a Rho-type GTPase activating protein
           (RhoGap) domain. Lrg1p is a Rho1 GTPase-activating
           protein required for efficient cell fusion in yeast.
           Lrg1p-GAP domain strongly and specifically stimulates
           the GTPase activity of Rho1p, a regulator of beta
           (1-3)-glucan synthase in vitro. The LIM domain is 50-60
           amino acids in size and shares two characteristic zinc
           finger motifs. The two zinc fingers contain eight
           conserved residues, mostly cysteines and histidines,
           which coordinately bond to two zinc atoms. LIM domains
           function as adaptors or scaffolds to support the
           assembly of multimeric protein.
          Length = 57

 Score = 26.9 bits (60), Expect = 6.8
 Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 11/34 (32%)

Query: 797 CRRCGK-----------SSYHIQKKKCAQCGYPC 819
           C +CGK             YH+    C  CG P 
Sbjct: 1   CAKCGKPITGQFVRALGDVYHLDCFTCHDCGKPV 34


>gnl|CDD|215548 PLN03042, PLN03042, Lactoylglutathione lyase; Provisional.
          Length = 185

 Score = 29.0 bits (65), Expect = 6.9
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 355 EFGHNWGSEHDPDM 368
           E  HNWG+E DP+ 
Sbjct: 96  ELTHNWGTESDPEF 109


>gnl|CDD|173690 cd05599, STKc_NDR_like, Catalytic domain of Nuclear Dbf2-Related
           kinase-like Protein Serine/Threonine Kinases.
           Serine/Threonine Kinases (STKs), Nuclear Dbf2-Related
           (NDR) kinase subfamily, catalytic (c) domain. STKs
           catalyze the transfer of the gamma-phosphoryl group from
           ATP to serine/threonine residues on protein substrates.
           The NDR subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. NDR kinase contains an N-terminal regulatory
           (NTR) domain and an insert within the catalytic domain
           that contains an auto-inhibitory sequence. Like many
           other AGC kinases, NDR kinase requires phosphorylation
           at two sites, the activation loop (A-loop) and the
           hydrophobic motif (HM), for activity. NDR kinases
           regulate mitosis, cell growth, embryonic development,
           and neurological processes. They are also required for
           proper centrosome duplication. Higher eukaryotes contain
           two NDR isoforms, NDR1 and NDR2. This subfamily also
           contains fungal NDR-like kinases.
          Length = 364

 Score = 29.6 bits (67), Expect = 6.9
 Identities = 14/43 (32%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 41  WDHAHDKPASPCGYVKEEFNTTDFDLDMEEDDPDMPHTRSKRQ 83
           W+H  ++PA     +K   +T++FD D EE D D+P +    +
Sbjct: 304 WEHIRERPAPIIPELKSITDTSNFD-DFEEIDLDVPTSPGPPR 345


>gnl|CDD|237325 PRK13267, PRK13267, archaemetzincin-like protein; Reviewed.
          Length = 179

 Score = 28.8 bits (65), Expect = 7.8
 Identities = 36/153 (23%), Positives = 47/153 (30%), Gaps = 48/153 (31%)

Query: 265 CLAHLFTD-LKFEGGILGLAYVG-------SPRRNSVGGICTPEYFKNGYTLYLNSGLSS 316
             A  F   L   G   G   +G           N V G+  P          L   + S
Sbjct: 53  YRAEKFLPLLSRIGRFNGDKNIGITDCDLYYRGLNFVFGLAYPN---------LRGAVIS 103

Query: 317 SQYFKNGYTLYLNS-GLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSA 375
           +      Y L     G       + +RV  +E      HE GH  G EH  + P C    
Sbjct: 104 T------YRLRPEFYGNKPDSELFEERVR-KEV----THELGHTLGLEHCDN-PRC---- 147

Query: 376 SQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFC 408
                 +M  +S S  DV+ K         NFC
Sbjct: 148 ------VMN-FSNSVRDVDIK-------EPNFC 166


>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
           Aminopeptidase N.  This family contains bacterial
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 435

 Score = 29.2 bits (66), Expect = 9.3
 Identities = 10/23 (43%), Positives = 11/23 (47%), Gaps = 2/23 (8%)

Query: 340 GQRVITREADL--VTAHEFGHNW 360
           G RV+     L  V AHE  H W
Sbjct: 278 GGRVLRLPFILETVLAHEIAHQW 300


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.135    0.431 

Gapped
Lambda     K      H
   0.267   0.0732    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 42,909,640
Number of extensions: 4086230
Number of successful extensions: 2980
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2946
Number of HSP's successfully gapped: 78
Length of query: 857
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 752
Effective length of database: 6,280,432
Effective search space: 4722884864
Effective search space used: 4722884864
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.0 bits)