BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy7399
         (107 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 578

 Score =  128 bits (322), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 60/101 (59%), Positives = 78/101 (77%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+S+GR++ SL + G+L NS++VF+SDNGG T+       N ASNWPLRGTKY+ 
Sbjct: 247 MVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFR---NWASNWPLRGTKYTL 303

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR+VAA+WSP LL KG+V E L H+TDWLPTL+ +AG
Sbjct: 304 FEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAG 344


>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 575

 Score =  126 bits (317), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/101 (64%), Positives = 74/101 (73%), Gaps = 1/101 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG++I+SL E  +L NS+VVFISDNG PT D L  H N  SNWPLRG K S 
Sbjct: 263 MVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPTDDPLWGHDNFGSNWPLRGKKASV 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VAA+WSP L+ K  V  NL HITDWLPTLY  AG
Sbjct: 323 LEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAG 363


>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 402

 Score =  125 bits (315), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/101 (63%), Positives = 74/101 (73%), Gaps = 1/101 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG++I+SL E  +L NS+VVF+SDNG PT D L  H N  SNWPLRG K S 
Sbjct: 90  MVTSLDDSVGKIIESLHEHEMLENSIVVFMSDNGAPTDDPLWGHDNFGSNWPLRGKKASV 149

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VAA+WSP L+ K  V  NL HITDWLPTLY  AG
Sbjct: 150 LEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAG 190


>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 599

 Score =  125 bits (314), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 61/101 (60%), Positives = 77/101 (76%), Gaps = 1/101 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML SLDESVG +I+SL+  G+L +S+V+FISDNG P  D +  +GN+ SNWPLRG K + 
Sbjct: 263 MLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDPIWGYGNSGSNWPLRGEKGAV 322

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            +GGVR VAA+WSP L+K  ++ ENL HITDWLPTLY  AG
Sbjct: 323 LDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAG 363


>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
          Length = 564

 Score =  118 bits (295), Expect = 5e-25,   Method: Composition-based stats.
 Identities = 57/101 (56%), Positives = 76/101 (75%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGRV+ +L E+G+L +SL++F++DNG  ++     + N  SN+PLRGTKY+ 
Sbjct: 251 MVSKLDESVGRVVHALGEKGMLRDSLILFLTDNGAASIGR---YRNYGSNYPLRGTKYTL 307

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
           +EGGVR VAALWS  L KG +V + L+HITDWLPTLY  AG
Sbjct: 308 YEGGVRGVAALWSSRLEKGARVFKKLIHITDWLPTLYSAAG 348


>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 575

 Score =  116 bits (291), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 56/101 (55%), Positives = 77/101 (76%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR++ +L E+G+L +SL++F++DNG  +V     + N  SN+PLRGTKY+ 
Sbjct: 248 MVSRLDDSVGRIVHALGEKGMLRDSLILFLTDNGAASVGR---YRNWGSNYPLRGTKYTL 304

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR+VAALWSP L+K  +V   L+HITDWLPTLY  AG
Sbjct: 305 YEGGVRSVAALWSPRLQKVARVSNQLIHITDWLPTLYSAAG 345


>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 579

 Score =  115 bits (288), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 55/101 (54%), Positives = 75/101 (74%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGR++ +L E+G+L +SL++F++DNG  ++     + N  SN+PLRGTKY+ 
Sbjct: 251 MVSKLDESVGRIVHALGEKGMLRDSLILFLTDNGAASIGR---YRNYGSNYPLRGTKYTL 307

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR VAALWS  L K  +V + L+HITDWLPTLY  AG
Sbjct: 308 YEGGVRGVAALWSSRLEKAARVFKKLIHITDWLPTLYSAAG 348


>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
          Length = 591

 Score =  115 bits (287), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 56/101 (55%), Positives = 74/101 (73%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD SVGR++Q+L  +G+L NSL++F++DNG   +     +G   SN+PLRG KY+ 
Sbjct: 252 MVLRLDNSVGRIVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYG---SNYPLRGMKYTL 308

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR VA LWSP LRK  +V + LMH+TDWLPTLY IAG
Sbjct: 309 YEGGVRGVALLWSPRLRKTARVSDELMHVTDWLPTLYSIAG 349


>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 532

 Score =  113 bits (283), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 55/100 (55%), Positives = 77/100 (77%), Gaps = 1/100 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            L+ LD+SVG V+++L+E+ +L NS+++ +SDNGG TV S+    N +SNWPL GTK++ 
Sbjct: 261 CLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTV-SVDLPPNWSSNWPLGGTKFTL 319

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR+VA +WSPLL KG + ++ +HITDWLPTLY  AG
Sbjct: 320 FEGGVRSVALIWSPLLPKGVINDDFIHITDWLPTLYSAAG 359


>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 593

 Score =  112 bits (279), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 55/101 (54%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGR++ +L   G+L +SL++F++DNG PT+       N  SN PLRG KY+ 
Sbjct: 255 MVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTIGKFR---NWGSNLPLRGMKYTL 311

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR VA LWSP LRK  +V   L+HITDWLPT Y  AG
Sbjct: 312 YEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAAG 352


>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
          Length = 581

 Score =  111 bits (278), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 56/103 (54%), Positives = 75/103 (72%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGR+  +L E+G+L +SL++F++DNG  +V     + N  SN+PLRGTKY+ 
Sbjct: 252 MVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGR---YRNYGSNYPLRGTKYTL 308

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
           +EGGVR VAALWS  L+K   + N L+HITDWLPTLY  AG +
Sbjct: 309 YEGGVRGVAALWSSRLQKAARVSNELVHITDWLPTLYSAAGGD 351


>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 545

 Score =  111 bits (277), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 57/101 (56%), Positives = 72/101 (71%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG+V+++L ++G+L NS++VF +DNGGP      F+ N ASNWPLRGTK + 
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 307

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    LWSP L+R G+V   L HI+DWLPTL   AG
Sbjct: 308 WEGGVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAG 348


>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
          Length = 545

 Score =  111 bits (277), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 57/101 (56%), Positives = 72/101 (71%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG+V+++L ++G+L NS++VF +DNGGP      F+ N ASNWPLRGTK + 
Sbjct: 251 MLSKLDESVGQVVEALRKKGMLKNSVIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 307

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    LWSP L+R G+V   L HI+DWLPTL   AG
Sbjct: 308 WEGGVRGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAG 348


>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 579

 Score =  110 bits (276), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 56/101 (55%), Positives = 73/101 (72%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGR+  +L E+G+L +SL++F++DNG  +V     + N  SN+PLRGTKY+ 
Sbjct: 252 MVSKLDESVGRIFHALGEKGMLKDSLILFLTDNGAASVGR---YRNYGSNYPLRGTKYTL 308

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR VAALWS  L R  +V   L+HITDWLPTLY  AG
Sbjct: 309 YEGGVRGVAALWSSRLQRAARVSNELVHITDWLPTLYSAAG 349


>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
          Length = 544

 Score =  110 bits (276), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 73/105 (69%), Gaps = 4/105 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG V+++L+ +G+L NS++VF +DNGGP      F+ N ASNWPLRGTK + 
Sbjct: 250 MLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAG---FNLNAASNWPLRGTKNTL 306

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR    +WSP L+R G+V   L HI+DWLPTL   AG + +
Sbjct: 307 WEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPS 351


>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
 gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
          Length = 472

 Score =  108 bits (271), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 71/105 (67%), Gaps = 4/105 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD+SVG +I +L E G+L NS++VF +DNGGP      F+ N ASNWPLRGTK + 
Sbjct: 170 MLTELDQSVGAIIVALSERGMLDNSIIVFSTDNGGPAEG---FNNNAASNWPLRGTKNTL 226

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR    +WSPL+  K +V   +MHI+DWLPTL   AG + N
Sbjct: 227 WEGGVRGAGCIWSPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLN 271


>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 294

 Score =  108 bits (270), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 55/102 (53%), Positives = 70/102 (68%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V ++L++ G+L NS+VVF SDNGG   +  +   N+ S WPLRG K S 
Sbjct: 43  MVSALDESVGAVFEALDKRGMLSNSIVVFSSDNGGAPDEETY---NSGSPWPLRGQKMSL 99

Query: 61  HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A LWSPLL   +G + + L HITDWLPT Y +AG
Sbjct: 100 SEGGVRAPALLWSPLLSATEGSIFDGLFHITDWLPTFYELAG 141


>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
          Length = 581

 Score =  108 bits (269), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 71/101 (70%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVGRV+++L+E  +L NS+++F+SDNG PTV     + NT SN+P+RG K   
Sbjct: 268 MMAALDESVGRVVKALKEAEMLENSIIIFMSDNGAPTVG---LYNNTGSNYPMRGIKGGM 324

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EG  R  A ++SPL++   +V E LMHI DWLPTLY  AG
Sbjct: 325 FEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYTAAG 365


>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 557

 Score =  107 bits (267), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 3/98 (3%)

Query: 4   SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           ++DES+GRV+ +L    +L NS++VF+SDNG PT D + +  N  SN+PLRG K SF EG
Sbjct: 268 TMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQILY--NFGSNYPLRGMKQSFFEG 325

Query: 64  GVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            VR VA ++SPL+    +V   L HITDWLPTLY  AG
Sbjct: 326 AVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAAG 363


>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 536

 Score =  107 bits (267), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+S+G V ++LE + +L N++VVFISDNG PTV     H N  SN+PLRG K + 
Sbjct: 235 MVSKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGP---HHNWGSNYPLRGIKDTL 291

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR VA +WSPLL +  +V  +L+HITDWLPTL+   G + ++
Sbjct: 292 FEGGVRTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSV 337


>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
          Length = 558

 Score =  107 bits (267), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 74/106 (69%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+S+G V ++LE + +L N++VVFISDNG PTV     H N  SN+PLRG K + 
Sbjct: 257 MVSKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGP---HHNWGSNYPLRGIKDTL 313

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR VA +WSPLL +  +V  +L+HITDWLPTL+   G + ++
Sbjct: 314 FEGGVRTVACIWSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSV 359


>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
          Length = 633

 Score =  107 bits (266), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 53/104 (50%), Positives = 76/104 (73%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-VDSLHFHGNTASNWPLRGTKYS 59
           M+++LD++ G+ + +L+E  +L NS+++F SDNGGP+ VD  +  G  ASNWPLRG K+S
Sbjct: 285 MIRALDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYKTG--ASNWPLRGQKFS 342

Query: 60  FHEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
             EGGVR  + LWSPLL+  + + ++L HITDWLPT Y +AG E
Sbjct: 343 MWEGGVRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGE 386


>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
          Length = 502

 Score =  107 bits (266), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 52/100 (52%), Positives = 69/100 (69%), Gaps = 4/100 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG ++ +L   G+L +S+++F++DNG   V     +G   SN+PLRG KY+ 
Sbjct: 160 MVSKLDDSVGLIVNTLGSRGMLRDSVILFLTDNGAAPVGKFRNYG---SNFPLRGMKYTL 216

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
           +EGGVR VA LWSP L K  +V  NLMHITDWLPTLY  A
Sbjct: 217 YEGGVRGVAVLWSPRLSKAARVCNNLMHITDWLPTLYAAA 256


>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 374

 Score =  106 bits (265), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/103 (50%), Positives = 72/103 (69%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG + Q+L++ GL  NS++VF +DNGGPT     + GN ASNWPLRG K++ 
Sbjct: 257 MVAGLDDSVGNISQTLQDAGLYNNSIIVFTTDNGGPTNG---YDGNHASNWPLRGCKHTL 313

Query: 61  HEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
            EGGVR  A + SPL+ K +   + +MH+ DWLPTLY +AG +
Sbjct: 314 WEGGVRGTAFVNSPLIEKPRRFSDRMMHVCDWLPTLYGVAGGD 356


>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
          Length = 512

 Score =  106 bits (265), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 56/104 (53%), Positives = 70/104 (67%), Gaps = 7/104 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V ++L + G+L N+L VF SDNG    D+   + N AS+WPLRG KY+ 
Sbjct: 217 MVSALDESVGLVFEALAKNGMLSNTLFVFTSDNGA---DAAGPNANYASSWPLRGQKYTP 273

Query: 61  HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A LWS    P +R G   E+L HITDWLPTLY +AG
Sbjct: 274 WEGGVRAAALLWSHALGPRVRAGGDYEHLFHITDWLPTLYQLAG 317


>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  106 bits (265), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 54/101 (53%), Positives = 71/101 (70%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ +LDESVGRVI +L++  +L NS++VFISDNG  T   L  +G   SN+PLRG K+S 
Sbjct: 232 VVATLDESVGRVIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYG---SNYPLRGLKFSL 288

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A ++SPL+ R  +V   L HITDWLPTLY  AG
Sbjct: 289 FEGGIRGAACIYSPLIDRPSRVSNQLFHITDWLPTLYSAAG 329


>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
 gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
          Length = 585

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 72/106 (67%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG+++Q+L++ G+L N++V+F +DNG PT+     H N  SN+PLRG KYS 
Sbjct: 259 MISKLDDGVGQIVQALKKSGMLNNTVVLFYADNGAPTIGK---HSNGGSNFPLRGQKYSP 315

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
            EG VR  AA+WSPLL   G+V    +H++DWLPTL   AG E N 
Sbjct: 316 WEGAVRTAAAIWSPLLNNTGRVSNQWIHVSDWLPTLARAAGIEANF 361


>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
          Length = 563

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD S+GR++  L+E+ +L NS+V+F SDNG  TV     + N+ SNWPLRG K+S 
Sbjct: 270 VVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVG---MYENSGSNWPLRGVKFSD 326

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A ++SPL  +KG V E+L+HI+DWLPTLY  AG
Sbjct: 327 FEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAG 367


>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
          Length = 557

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 74/106 (69%), Gaps = 6/106 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD SVG++++SL +  +L N+++VF +DNGGP      F+ N ASNWPL+G K + 
Sbjct: 250 MVHELDVSVGKIVKSLGDNNMLENTVIVFSTDNGGPAEG---FNQNAASNWPLKGVKNTP 306

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEK 103
            EGGVR    +WSPL+   R+GQV+ NLM I+DWLPTL+  AG+ +
Sbjct: 307 WEGGVRAAGLIWSPLIPKSRRGQVMSNLMDISDWLPTLFEAAGQSR 352


>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
          Length = 531

 Score =  105 bits (263), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD S+GR++  L+E+ +L NS+V+F SDNG  TV     + N+ SNWPLRG K+S 
Sbjct: 255 VVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVG---MYENSGSNWPLRGVKFSD 311

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A ++SPL  +KG V E+L+HI+DWLPTLY  AG
Sbjct: 312 FEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAG 352


>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
           purpuratus]
          Length = 514

 Score =  105 bits (263), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 3/101 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG + Q+L + GL  NS++VF +DNGG T D   F G  ASNWPLRG K++ 
Sbjct: 236 MVSALDDSVGSISQTLRDAGLYDNSIIVFTTDNGGATYD--FFDGTHASNWPLRGGKHTL 293

Query: 61  HEGGVRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAG 100
            EGGVR  A + SPL++K +   + +MH+ DWLPTL+ +AG
Sbjct: 294 WEGGVRGTAFVNSPLIKKPRRFSDQMMHVCDWLPTLHSVAG 334


>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
          Length = 881

 Score =  105 bits (263), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 53/101 (52%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG+V ++L++ G+L NS++VFISDNGG T      H N+ SN+PLRG K+S 
Sbjct: 261 MMSKLDDSVGQVYRALQQHGMLENSIIVFISDNGGVTRG---MHSNSGSNYPLRGQKHSP 317

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EG VR  A +WSPLL+  Q + N   HI+DWLPTL   AG
Sbjct: 318 WEGAVRTAALIWSPLLKDRQRVSNQWFHISDWLPTLASAAG 358


>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 593

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG+V+++L + G+L N+++VF SDNGG        H +   NWPLRG K + 
Sbjct: 299 MVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGE---HNSRGFNWPLRGAKGTV 355

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +WSPLL R+ +V   LMHITDWLPT Y IAG
Sbjct: 356 WEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAG 396


>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
          Length = 563

 Score =  105 bits (262), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 53/112 (47%), Positives = 78/112 (69%), Gaps = 9/112 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++DESVGRV+Q+L +  +L NS++VF+SDNG  TV  L+ +G   SN+PLRG K++ 
Sbjct: 267 VVTAMDESVGRVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYG---SNYPLRGLKFTL 323

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
            EGGVR VA ++SP ++    + N L+H+TDWLPT Y  AG      E+N+D
Sbjct: 324 FEGGVRGVACVYSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLD 375


>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 554

 Score =  105 bits (262), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG+V+++L + G+L N+++VF SDNGG        H +   NWPLRG K + 
Sbjct: 260 MVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGE---HNSRGFNWPLRGAKGTV 316

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +WSPLL R+ +V   LMHITDWLPT Y IAG
Sbjct: 317 WEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAG 357


>gi|241779012|ref|XP_002399826.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215508524|gb|EEC17978.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 103

 Score =  105 bits (261), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 53/102 (51%), Positives = 70/102 (68%), Gaps = 4/102 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDES+G V Q+L + G+LGN+++   SDNGG  + +   H N   N+PLRG K + 
Sbjct: 1   MVDALDESIGEVFQALGDAGMLGNTIIALSSDNGGLPIGA---HSNRGFNFPLRGGKGTL 57

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGK 101
            EGG R  A +WSPLL RKG V + +MHITDWLPTLY  AG+
Sbjct: 58  WEGGCRATAFIWSPLLKRKGVVSDQMMHITDWLPTLYSAAGQ 99


>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  105 bits (261), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 54/101 (53%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ +LDESVGR+I +L +  +L NS++VFISDNG  T   L  +G   SN+PLRG K+S 
Sbjct: 244 VVATLDESVGRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYG---SNYPLRGLKFSL 300

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A ++SPL+ R  +V   L HITDWLPTLY  AG
Sbjct: 301 FEGGVRGAAYIYSPLIDRLSRVSTQLFHITDWLPTLYSAAG 341


>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
 gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
          Length = 558

 Score =  104 bits (260), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD S+G+V++ L E  +L NS+++F SDNG PTV     + N+ SNWPLRG K + 
Sbjct: 254 IVAELDRSIGQVVRKLSERQMLENSIILFFSDNGAPTVGP---YTNSGSNWPLRGIKLTN 310

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A ++SPLL+K G V + L+H++DWLPT Y  AG
Sbjct: 311 FEGGIRGTATIFSPLLKKRGYVNKELIHVSDWLPTFYAAAG 351


>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 532

 Score =  104 bits (259), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 55/110 (50%), Positives = 74/110 (67%), Gaps = 7/110 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD+SVG+V++ L ++G+L +S+++F +DNGGP      F+ N ASNWPLRG K + 
Sbjct: 239 ILSKLDKSVGQVVEGLRKKGMLRDSVIIFSTDNGGPAAG---FNLNAASNWPLRGVKNTL 295

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEK---NID 106
            EGGVR    +WSP L R G+V   L+HITDWLPTL   AG +    NID
Sbjct: 296 WEGGVRGAGLIWSPKLTRSGRVSRQLLHITDWLPTLITAAGGDPSNLNID 345


>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
          Length = 528

 Score =  104 bits (259), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 71/105 (67%), Gaps = 4/105 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD+SVG+V+ +L +  +L +S+++F SDNGGP      F+ N ASNWPLRG K + 
Sbjct: 234 MLSKLDQSVGQVVDALRKRDMLRDSIIIFSSDNGGPAAG---FNLNAASNWPLRGVKNTL 290

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR    +WSP L+R G+V   ++HITDWLPTL   AG + +
Sbjct: 291 WEGGVRGTGLIWSPRLVRPGRVSRQMLHITDWLPTLITAAGGDSS 335


>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 583

 Score =  104 bits (259), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/104 (50%), Positives = 72/104 (69%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LDESVG+V+++L+   +L NS+++F+SDNG PT     F  N  SN+P RG K + 
Sbjct: 267 VVSKLDESVGKVVEALKNRNMLSNSIILFMSDNGAPTSG---FLSNGGSNYPFRGIKKTL 323

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKEK 103
            EGGVR  AA+WSPL++K + V  +LMH+TDWLPTL   AG  K
Sbjct: 324 WEGGVRGAAAIWSPLIKKRERVSYHLMHVTDWLPTLLSAAGGNK 367


>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 610

 Score =  104 bits (259), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 55/111 (49%), Positives = 75/111 (67%), Gaps = 8/111 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LDESVGR+I +LEE  +L NS++VF +DNG PT+     + N  SN+PLRG K + 
Sbjct: 271 MVQKLDESVGRLIAALEESNMLHNSIIVFSADNGAPTIGG---YPNWGSNYPLRGIKETL 327

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE----KNID 106
            EGG R V+ +WSPL++    + N L+HI+DWLPTLY  AG      KN+D
Sbjct: 328 WEGGTRGVSFIWSPLIKNTPRVSNELIHISDWLPTLYAAAGGNMENVKNLD 378


>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
          Length = 648

 Score =  103 bits (258), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 55/104 (52%), Positives = 72/104 (69%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L ++ +L NS++VF+SDNG    D +H   N  SN+PLRG K S 
Sbjct: 249 MVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKP-DGIH--ANHGSNYPLRGNKNSA 305

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK 103
            EG +R VAA+WSPL++K Q + N LMHI+DWLPT Y  AG  K
Sbjct: 306 WEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNK 349


>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
           impatiens]
          Length = 413

 Score =  103 bits (258), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 56/111 (50%), Positives = 74/111 (66%), Gaps = 8/111 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVG V+ +L   G+L NS+VVF+SDNG PT   L    N  SN+PLRG K S 
Sbjct: 1   VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPTNGIL---SNQGSNYPLRGIKNSP 57

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
            EGG R VAA+WSPL+RK + + N +M ++DWLPTL   AG ++    NID
Sbjct: 58  WEGGTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQLGNID 108


>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
          Length = 563

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/112 (46%), Positives = 77/112 (68%), Gaps = 9/112 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ +LDESVGRV+Q+L++  +L NS+++F+SDNG  TV  L    N+ SN+PLRG K++ 
Sbjct: 267 VVTALDESVGRVVQALKQANMLENSIILFMSDNGAQTVGLL---DNSGSNYPLRGMKFTL 323

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
            EGGVR  A ++SP ++    + N L+H+TDWLPT Y  AG      E+N+D
Sbjct: 324 FEGGVRGAACVYSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLD 375


>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 657

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG + +SL  + ++ NS+++FISDNG P   S   H N+ SN+PLRG K S 
Sbjct: 238 MVSKLDDSVGAIFKSLRAKNMIQNSIILFISDNGAP---SFGLHSNSGSNFPLRGQKNSP 294

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EG  RNVAA+WSPLL + Q + N  +HI+DWLPT   IAG
Sbjct: 295 WEGATRNVAAIWSPLLEERQRVSNQYIHISDWLPTFASIAG 335


>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 458

 Score =  103 bits (258), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 53/102 (51%), Positives = 68/102 (66%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V +SL   G+L N++ VF SDNG    D+   + NTAS+WP +G KY+ 
Sbjct: 161 MVSALDESVGSVFESLNHRGMLDNTVFVFSSDNGA---DTDSANANTASSWPFKGQKYTP 217

Query: 61  HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A +WSPL     G V  N+ HI+DWLPTLY +AG
Sbjct: 218 WEGGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAG 259


>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 646

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 72/104 (69%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L ++ +L NS++VF+SDNG    D +H   N  SN+PLRG K S 
Sbjct: 247 MVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGA-KPDGIH--ANHGSNYPLRGNKNSA 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK 103
            EG +R VAA+WSPL++K Q + N LMHI+DWLPT Y  AG  K
Sbjct: 304 WEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNK 347


>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 570

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 53/101 (52%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVG+V+ +LE+  +L NS+++F+SDNG  T      H N  SN+PLRG K S 
Sbjct: 272 MVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYG---LHSNRGSNYPLRGLKESP 328

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGGVR  AA+WSP L K  +V + LMH++DWLPTL   AG
Sbjct: 329 WEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAG 369


>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
 gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
          Length = 577

 Score =  103 bits (256), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG+ +++L + G+L NS+V+  SDNG PTV     H N  SN+P RG K S 
Sbjct: 255 MISSLDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVG---IHSNAGSNYPFRGQKESP 311

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+V ALWSPLL ++G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSVGALWSPLLQQRGYVSNQAIHAIDWLPTLAAAAG 352


>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
 gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
          Length = 634

 Score =  103 bits (256), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 53/101 (52%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG V +SL   G+L NS++VF+SDNGG T      H NT SN+P RG K+S 
Sbjct: 261 MMSKLDDSVGAVYESLRANGMLDNSIIVFLSDNGGVTRG---MHSNTGSNYPFRGQKHSP 317

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EG VR  A +WSPLL+  Q + N   HI+DWLPTL   AG
Sbjct: 318 WEGAVRTAALIWSPLLKDTQRVSNQWFHISDWLPTLASAAG 358


>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 234

 Score =  102 bits (255), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG V+ +L++  +L ++++VF +DNG        F  N  SNWPLRGTK++ 
Sbjct: 137 MVDALDQSVGSVVDALQKADMLTDTILVFCTDNGAL---PWGFKSNRGSNWPLRGTKFTL 193

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
           +EGGVR+ A +WSPL+   + V   LMHITDWLPTLY  AG
Sbjct: 194 YEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234


>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 425

 Score =  102 bits (255), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LDES+GRV+++L   G+L ++++VF SDNGG          N   NWPLRG K++ 
Sbjct: 176 MLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGG---GPYGIESNRGFNWPLRGAKFTV 232

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A +WSP  L+K +V   LMHI+DWLPTLY  AG
Sbjct: 233 WEGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAG 273


>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 552

 Score =  102 bits (255), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG ++++L+ +G+L N+++VF SDNGGP      F+ N ASNWPLRG K + 
Sbjct: 249 MLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEG---FNSNAASNWPLRGVKNTL 305

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGGVR    +WSPL+ +  +V + ++HI+DWLPTL   AG
Sbjct: 306 WEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAAG 346


>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
          Length = 613

 Score =  102 bits (254), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+S+G + QSL+E  L  N+L+VF +DNGGP      F  N ASNWPLRG K++ 
Sbjct: 286 MVAALDDSIGNITQSLKESSLYNNTLIVFTTDNGGPANG---FDFNYASNWPLRGMKHTT 342

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  LW  L+ K G++ + +MH+ DW+PTLY +AG
Sbjct: 343 WEGGLRGVGFLWGALIEKPGRMSDGMMHVCDWVPTLYGLAG 383


>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
          Length = 438

 Score =  102 bits (253), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 71/101 (70%), Gaps = 3/101 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ +LD+S+GRV+ +L ++ +L N++++FISDNG  T   L+   N  SN+PLRG K+S 
Sbjct: 185 VVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLY--QNFGSNYPLRGLKFSL 242

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A ++SPL+    +V   L HITDWLPTLY  AG
Sbjct: 243 YEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAG 283


>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
          Length = 519

 Score =  102 bits (253), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD+SVG+V+++L +  +L +S++VF +DNGGP      F+ N ASNWPLRG K + 
Sbjct: 223 ILSKLDQSVGQVVEALYKNNMLQDSVIVFSTDNGGPAAG---FNLNAASNWPLRGVKNTL 279

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    +WSP L+R G+V   ++HITDWLPTL   AG
Sbjct: 280 WEGGVRGAGLIWSPRLVRAGRVSRQMIHITDWLPTLITAAG 320


>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 559

 Score =  102 bits (253), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG + ++L+  GL  N +++F +DNGGP      F  N ASNWPLRG K++ 
Sbjct: 252 MVSALDDSVGNITRALKHSGLYENCIILFSTDNGGPAAG---FDANYASNWPLRGIKHTL 308

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGGVR    + SPLL K G++  +++H+ DWLPT+Y IAG
Sbjct: 309 WEGGVRGDGFIHSPLLEKPGRISTDMIHVCDWLPTIYHIAG 349


>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 548

 Score =  102 bits (253), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/101 (54%), Positives = 64/101 (63%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD SVGRV+++L   G+L NS+V F SDNG        F  N   NWPLRG K S 
Sbjct: 263 MVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGA---VPFGFRSNRGFNWPLRGIKASV 319

Query: 61  HEGGVRNVAALWSPLLRKGQVLE-NLMHITDWLPTLYFIAG 100
            EGGVR    +WSPLLRK   L   +MHITDWLPTLY  AG
Sbjct: 320 WEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAG 360


>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 196

 Score =  102 bits (253), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 52/100 (52%), Positives = 71/100 (71%), Gaps = 4/100 (4%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + +LD+SVG V Q+LE++ +L N+++VF+SDNG           N  SNWPLRG K++  
Sbjct: 95  VDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEW---SNAGSNWPLRGAKFTLW 151

Query: 62  EGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
           EGGVR  A +WSPLLRK Q + N LMH++DW+PTLY +AG
Sbjct: 152 EGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191


>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 166

 Score =  102 bits (253), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 52/100 (52%), Positives = 70/100 (70%), Gaps = 4/100 (4%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + +LD+SVG V Q+LE++ +L N+++VFISDNG           N  SNWPLRG K++  
Sbjct: 45  VDALDQSVGIVFQALEKKDMLQNTMLVFISDNGAIPWGEWS---NAGSNWPLRGAKFTLW 101

Query: 62  EGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
           EGGVR  A +W PLLRK Q + N LMH++DW+PTLY +AG
Sbjct: 102 EGGVRVPAFVWGPLLRKPQRVSNQLMHVSDWVPTLYSLAG 141


>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 510

 Score =  101 bits (252), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LDESVGRV+++L+  G+L ++++VF SDNGG          N   NWPLRG K++ 
Sbjct: 223 MLDALDESVGRVVEALDNAGMLEDTIIVFSSDNGGAPCGEA---SNQGFNWPLRGAKFTL 279

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EG VR  A +WSP  L++ +V   LMHI+DWLPTLY  AG +
Sbjct: 280 WEGSVRVPAFVWSPKFLKQSRVSNQLMHISDWLPTLYTAAGGD 322


>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 514

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 68/105 (64%), Gaps = 3/105 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG+V+++L +  ++ N +++F +DNGGP      F+ N ASNWPLRG K + 
Sbjct: 256 MLSKLDESVGKVVEALADSNMINNCVILFSTDNGGPAGG---FNLNAASNWPLRGVKDTL 312

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR V  +WSP L   +V   ++HITDWLPTL  +     +I
Sbjct: 313 WEGGVRGVGFIWSPFLPSSKVSNAMIHITDWLPTLLSLTNASNSI 357


>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
 gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
          Length = 542

 Score =  101 bits (252), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++DESVG++++ L +  +L NS++VF +DNGGP   +  F+ N ASN+PLRG K + 
Sbjct: 240 LVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 296

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A LWSP L ++ +  E LMH+ DWLPTL   AG
Sbjct: 297 WEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAAG 337


>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 562

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/112 (48%), Positives = 76/112 (67%), Gaps = 9/112 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++DESVGRVI++L +  +L NS++VFISDNG  T   L  +G   SN+PLRG K++ 
Sbjct: 267 VVTAMDESVGRVIKALGQSSMLENSIIVFISDNGAQTEGLLENYG---SNYPLRGLKFTL 323

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
            EGG+R VA ++S L++    + N LMHITDWLPT Y  AG      E+N+D
Sbjct: 324 FEGGIRGVACVYSRLIQNSSRISNELMHITDWLPTFYSAAGGNLENLEENMD 375


>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 287

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 6/98 (6%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           +D+S+G VI++L +  +LGNS+V+F +DNGG PT        N   NWPLRG K +  EG
Sbjct: 5   MDQSIGLVIEALHKRNMLGNSIVIFSTDNGGLPT----GLGSNYGFNWPLRGFKATLWEG 60

Query: 64  GVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           GVR  A +WSPLLR+ G++   +MHITDWLPTLY  AG
Sbjct: 61  GVRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAG 98


>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 543

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/105 (51%), Positives = 71/105 (67%), Gaps = 4/105 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LDESVG V+++L    +L NS++VF +DNGG       F+ N ASNWPLRGTK + 
Sbjct: 249 ILSKLDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAG---FNLNAASNWPLRGTKNTL 305

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  LWSP L++ G+V   L+HI+DWLPTL   AG + +
Sbjct: 306 WEGGVRGVGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAAGGDPS 350


>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 263

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG + + L++ G+  N++++F +DNGGP      F+ N ASNWPLRG K + 
Sbjct: 83  MVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG---FNNNHASNWPLRGIKATL 139

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGGVR    + SPLL K G+V  +++H+ DWLPT+Y IAG
Sbjct: 140 WEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAG 180


>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 594

 Score =  101 bits (251), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 71/101 (70%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD SVGRV+++L ++G+L N++++F SDNGG     +    NT +N+PLRG K + 
Sbjct: 305 MLAELDVSVGRVVEALAKKGILDNTIILFSSDNGGQATAPME---NTGTNFPLRGQKRTL 361

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGG R  A +WSPL+R+  +V  +++HI DWLPT+Y +AG
Sbjct: 362 FEGGTRVPAFVWSPLIRRPRRVFYDMVHIVDWLPTIYSLAG 402


>gi|241836593|ref|XP_002415110.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509322|gb|EEC18775.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 254

 Score =  101 bits (251), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 53/97 (54%), Positives = 65/97 (67%), Gaps = 4/97 (4%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG V++SL   G+L NS+VVF SDNG      L    N  SNWPLRG K +  EGG
Sbjct: 11  LDQSVGMVMESLHRRGMLENSVVVFSSDNGA---RPLGLGANAGSNWPLRGGKSTLWEGG 67

Query: 65  VRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
           VR  A +WSPLL R G++ + +MHI+DWLPT Y  AG
Sbjct: 68  VRGAAFVWSPLLSRVGRLSDQMMHISDWLPTFYTAAG 104


>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 167

 Score =  100 bits (250), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 71/101 (70%), Gaps = 4/101 (3%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG V+++L + G+L NS++VF +DNGG          N+  NWPLRG+K++  EGG
Sbjct: 65  LDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGK---EPNSGYNWPLRGSKFTLWEGG 121

Query: 65  VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
           VR  A +WSPLL K G++   LMHITDWLPT Y  AG++++
Sbjct: 122 VRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAGEKEH 162


>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 470

 Score =  100 bits (250), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K+LD SV  V  +L    +L N++ VF+SDNG PT      H N  SNWPL+G K + 
Sbjct: 171 MMKNLDNSVAEVFNALHTNNMLDNTIFVFVSDNGAPTNG---IHRNYGSNWPLKGEKATP 227

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG+R  A +WS LL K + L N LMHI+DWLPTLY  AG
Sbjct: 228 WEGGIRTAAFVWSKLLSKKKTLPNPLMHISDWLPTLYQAAG 268


>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 490

 Score =  100 bits (250), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LDES+GRV+++L   G+L ++++VF SDNGG          N   NWPLRG K++ 
Sbjct: 255 MLDALDESMGRVVEALSSAGILEDTIIVFSSDNGG---SPYGLESNRGYNWPLRGAKFTL 311

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG VR  A +W+P LL+K +V   LMHI+DWLPTLY  AG
Sbjct: 312 WEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAAG 352


>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
 gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
          Length = 675

 Score =  100 bits (250), Expect = 9e-20,   Method: Composition-based stats.
 Identities = 50/101 (49%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG++  SL  + +L NS+++F+SDNG PT      H NT SN+PLRG K   
Sbjct: 259 MVSKLDDSVGQIFNSLRSKNMLDNSIILFMSDNGAPTA---ALHANTGSNYPLRGIKSVP 315

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            E   R VAA+WSPLL++ Q + N  +HI+DWLPTL   AG
Sbjct: 316 WEAATRCVAAIWSPLLQERQRVSNQFIHISDWLPTLASAAG 356


>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 689

 Score =  100 bits (250), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D+SVG V+ +L + G+L NS+V+F+SDNG PT  SL    N  SN+PLRG K S 
Sbjct: 288 VVSKVDQSVGEVMDALRKRGMLENSIVLFMSDNGAPTQGSL---SNQGSNYPLRGIKDSP 344

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG R VAA+WSPL+++ Q + N +M I DWLPTL   AG
Sbjct: 345 WEGGTRGVAAIWSPLIKESQRVSNQMMFIADWLPTLLSAAG 385


>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
 gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
          Length = 607

 Score =  100 bits (249), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 50/101 (49%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG++  SL  + +L NS+++F+SDNG PT      H NT SN+PLRG K   
Sbjct: 259 MVSKLDDSVGQIFNSLRSKNMLDNSIILFMSDNGAPTA---ALHANTGSNYPLRGIKSVP 315

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            E   R VAA+WSPLL++ Q + N  +HI+DWLPTL   AG
Sbjct: 316 WEAATRCVAAIWSPLLQERQRVSNQFIHISDWLPTLASAAG 356


>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
          Length = 513

 Score =  100 bits (249), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 73/101 (72%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVG++++ L+E+ LL NS+++F+SDNG  T+     + N+ SNWPLRG K + 
Sbjct: 226 IVSRLDDSVGQIVEKLDEKNLLDNSIILFLSDNGAQTIG---VYENSGSNWPLRGLKLTE 282

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  AA++SPL  +KG V + L+HITD+LPT Y  AG
Sbjct: 283 FEGGIRAPAAIYSPLFHQKGYVSKELIHITDFLPTFYAAAG 323


>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
          Length = 775

 Score =  100 bits (249), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ +LD+SVGRV+ +L+ + +L N++++F++DNG  T   L    N  SN+PLRG K++ 
Sbjct: 464 VVATLDKSVGRVVDALKRKDILNNTIIIFMADNGAQTEGYL---ANYGSNYPLRGVKFTL 520

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EG VR VA ++SPL+ R  +V   L HITDWLPTLY  AG
Sbjct: 521 FEGAVRGVACIYSPLIDRSSRVSTKLFHITDWLPTLYSAAG 561



 Score = 92.0 bits (227), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 65/100 (65%), Gaps = 11/100 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR++ +L   G+L +SL++F++DNG   +       N  SN+PLRG     
Sbjct: 158 MVSKLDDSVGRIVHTLGNRGMLRDSLILFLTDNGAAPIGKFR---NFGSNYPLRG----- 209

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
             GGVR V+ LWSP L K  +V +NL+HI+DWLPTLY  A
Sbjct: 210 --GGVRGVSVLWSPRLNKPARVCDNLIHISDWLPTLYAAA 247


>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
 gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
          Length = 542

 Score =  100 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 49/103 (47%), Positives = 69/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+SVG+++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PL+G K + 
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
            EGGVR    +WSPLL+K Q + N  MHI DWLPTL   AG +
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQ 330


>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 567

 Score =  100 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 52/120 (43%), Positives = 75/120 (62%), Gaps = 14/120 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD--------SLHFHGNTASNWP 52
           M+  LDESVGRV+++L E+ +L N+++VF+SDNG P+ D              N  SN+P
Sbjct: 257 MVSKLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFP 316

Query: 53  LRGTKYSFHEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKE-----KNID 106
            RG K +  EGGV++ + +W+P  ++  +V + LMHITDWLPTLY  AG       KN+D
Sbjct: 317 YRGIKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLD 376


>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 571

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 74/101 (73%), Gaps = 3/101 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+SVGRV+++L++  +L NS+++F+SDNG PT ++   + N  SN+PLRG K + 
Sbjct: 254 VVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAP--YTNYGSNYPLRGIKATV 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A ++SP L+ + +V + L HITDW PTLY +AG
Sbjct: 312 FEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLAG 352


>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
 gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
          Length = 544

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+SVG+++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PL+G K + 
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGGVR    +WSPLL+K Q + N  MHI DWLPTL   AG
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 328


>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
 gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
          Length = 478

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+SVG+++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PL+G K + 
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 287

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGGVR    +WSPLL+K Q + N  MHI DWLPTL   AG
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 328


>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
          Length = 543

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 71/110 (64%), Gaps = 7/110 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD+SVG V++SL +  +L NS+++F +DNGG       F+ N ASN+PLRG K + 
Sbjct: 247 MLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAG---FNLNAASNFPLRGVKNTV 303

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE---KNID 106
            EGGVR    LWSPL+++ Q V +  MHI+DWLPTL    G     KNID
Sbjct: 304 WEGGVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVGANSSLKNID 353


>gi|241574524|ref|XP_002403353.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500210|gb|EEC09704.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 255

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/103 (48%), Positives = 68/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDES+G V Q+L + G+L N+++V  SDNGG  +    F  N   N+PLRG K + 
Sbjct: 1   MVDALDESIGEVFQALGDAGILENTIIVLSSDNGGLPI---GFQSNRGFNFPLRGGKGTL 57

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG R    +WSPLL RKG   + +MH+TDWLPTLY  AG++
Sbjct: 58  WEGGCRATGFIWSPLLKRKGVASDQMMHVTDWLPTLYSAAGED 100


>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 479

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 46/101 (45%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+S+G + Q+L+E  L  N+++VF +DNGGP   +  F  N A+NWPLRG K + 
Sbjct: 257 MVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGP---AHGFDANYANNWPLRGVKDTT 313

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R    LW  L+ K G+  + +MH+ DW+PTLY +AG
Sbjct: 314 WEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAG 354


>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
          Length = 354

 Score = 99.4 bits (246), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 71/106 (66%), Gaps = 11/106 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
           M+ +LDESVG + +SL   G+L N++++F SDNG P       HGNT   +SN+PLRG K
Sbjct: 225 MISALDESVGNLTKSLHANGMLENTIIIFSSDNGAP-------HGNTDVCSSNYPLRGAK 277

Query: 58  YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
            S  EGG R  A +WSPLL+K G V + +MHI+DWLPTL   AG +
Sbjct: 278 SSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLKAAGYD 323


>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
          Length = 418

 Score = 99.4 bits (246), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 53/102 (51%), Positives = 68/102 (66%), Gaps = 7/102 (6%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSF 60
           + +LDESVGR++ +L +  +L NS++VF SDNGG P  D      NT SNWPLRGTK + 
Sbjct: 115 VDALDESVGRIVAALHQRRMLANSVLVFTSDNGGIPWGD----FSNTGSNWPLRGTKGTM 170

Query: 61  HEGGVRNVAALWSPLLRKGQ--VLENLMHITDWLPTLYFIAG 100
            EGGVR  A +WS +L  G   V+ + MH+ DWLPTLY  AG
Sbjct: 171 WEGGVRLPAVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAG 212


>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
          Length = 1007

 Score = 99.4 bits (246), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 55/125 (44%), Positives = 71/125 (56%), Gaps = 26/125 (20%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGT---- 56
           M+  +DE VG++IQ+L E  +L NS+V+F +DNG PTV     H N+ SN+PLRG     
Sbjct: 682 MISKVDEGVGQIIQTLAERDMLDNSIVLFYADNGAPTVG---MHSNSGSNFPLRGVCIDN 738

Query: 57  -----------------KYSFHEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYF 97
                            KYS  EG VR VAA+WSPLL    G+V    +H++DWLPTL  
Sbjct: 739 WSTLHIRCTNGCFSFQQKYSPWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAH 798

Query: 98  IAGKE 102
            AG E
Sbjct: 799 AAGIE 803


>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 476

 Score = 99.4 bits (246), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 46/101 (45%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+S+G + Q+L+E  L  N+++VF +DNGGP   +  F  N A+NWPLRG K + 
Sbjct: 254 MVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGP---AHGFDANYANNWPLRGVKDTT 310

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R    LW  L+ K G+  + +MH+ DW+PTLY +AG
Sbjct: 311 WEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAG 351


>gi|241861175|ref|XP_002416317.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215510531|gb|EEC19984.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 99

 Score = 99.0 bits (245), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 5/101 (4%)

Query: 1  MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
          M+ +LDESVG V Q+L    +L N++ VF SDNGG   D+   + N AS+WPL+G KY+ 
Sbjct: 1  MVSALDESVGAVFQALNRRNMLSNTVFVFTSDNGG---DTESENANFASSWPLKGQKYTP 57

Query: 61 HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIA 99
           EGGV   A +WSP   +  G V  NL HI+DWLPTLY +A
Sbjct: 58 WEGGVHAPAVIWSPRFSRLLGSVYSNLFHISDWLPTLYQLA 98


>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 328

 Score = 99.0 bits (245), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/97 (54%), Positives = 66/97 (68%), Gaps = 4/97 (4%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG V+++L + G+L NS+VVF +DNGG          N+  NWPLRG K +  EGG
Sbjct: 47  LDKSVGLVVEALNKRGMLSNSIVVFSTDNGG---LPWGIDPNSGYNWPLRGAKATLWEGG 103

Query: 65  VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           VR  A +WSPLL K G++   LMHITDWLPTLY  AG
Sbjct: 104 VRGAAFVWSPLLFKSGRLSNQLMHITDWLPTLYSAAG 140


>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 159

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 68/102 (66%), Gaps = 4/102 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+S+G ++++L+E  L+GN++VVF SDNG           N   NWPLRG K++ 
Sbjct: 60  MVDALDQSIGTIMEALQEADLIGNTIVVFTSDNGA---LPWGVKSNRGYNWPLRGGKFTL 116

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK 101
            EGG+R    +WSPLL   + + N +MHITDWLPTLY  AG+
Sbjct: 117 WEGGLRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAGE 158


>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
 gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
          Length = 550

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG+ +++L + G+L NS+++  SDNG PTV     H N  SN+P RG K S 
Sbjct: 254 MISSLDKSVGQTVKALSDNGMLNNSIILLYSDNGAPTVG---IHSNQGSNYPFRGQKESP 310

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R+  A+WSPLL ++G V   ++H  DWLPTL   A  E
Sbjct: 311 WEGGIRSAGAIWSPLLQQRGYVSNQIIHAIDWLPTLAAAANVE 353


>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 546

 Score = 98.6 bits (244), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 52/112 (46%), Positives = 76/112 (67%), Gaps = 9/112 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++DESVGRVI++L +  +L NS++VF+SDNG  T   L  +G   SN+PLRG K++ 
Sbjct: 252 VVTAMDESVGRVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYG---SNYPLRGLKFTL 308

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGK-----EKNID 106
            EGG+R VA ++S L++    + N L+HITDWLPT Y  AG      E+N+D
Sbjct: 309 FEGGIRGVACVYSRLIQNSSRISNELIHITDWLPTFYSAAGGNLENLEENMD 360


>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
 gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score = 98.6 bits (244), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D ++ +V+ SL+   +  NS++VF +DNGGP      F  N ASN+PLRG K + 
Sbjct: 248 MVSSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANG---FDSNMASNFPLRGVKRTL 304

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A + SPL+ K G+V+  LMH++DWLPTLY +AG
Sbjct: 305 WEGGIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAG 345


>gi|241847930|ref|XP_002400856.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509831|gb|EEC19284.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 275

 Score = 98.2 bits (243), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/96 (53%), Positives = 66/96 (68%), Gaps = 4/96 (4%)

Query: 5  LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
          LD+SVG V+++L + G+L NS++VF +DNGG          N+  NWPLRG K++  EGG
Sbjct: 5  LDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGE---EPNSGYNWPLRGAKFTLWEGG 61

Query: 65 VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIA 99
          VR  A +WSPLL K G++   LMHITDWLPTLY  A
Sbjct: 62 VRGAAFVWSPLLSKSGRLSNQLMHITDWLPTLYSAA 97


>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
 gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
          Length = 575

 Score = 98.2 bits (243), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 48/103 (46%), Positives = 65/103 (63%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR++ +L   G L NS+V+F SDNG P+V       NT SNWPLRG K + 
Sbjct: 265 MVSKLDQSVGRIVSALNSTGQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+WS  L+ +G +    +++ DWLP+L   AG E
Sbjct: 322 WEGGVRVAGAIWSAQLQARGNIFTQPIYVADWLPSLAHAAGIE 364


>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 671

 Score = 98.2 bits (243), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 74/111 (66%), Gaps = 8/111 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVG V+ +L   G+L NS+VVF+SDNG P+   L    N  SN+PLRG K S 
Sbjct: 257 VVSKLDQSVGDVMDALRNRGMLENSIVVFMSDNGAPSQGLL---SNEGSNYPLRGIKNSP 313

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
            EGG R VAA+WSPL++K + + N +M ++DWLPTL   AG ++    NID
Sbjct: 314 WEGGTRGVAAIWSPLIKKSKRVSNQMMFMSDWLPTLLSAAGVDRRQLGNID 364


>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 713

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 74/111 (66%), Gaps = 8/111 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVG V+ +L   G+L NS+V+F+SDNG P+   L+   N  SN+PLRG K S 
Sbjct: 301 VVSKLDQSVGDVMDALRNRGMLENSIVLFMSDNGAPSQGILN---NKGSNYPLRGIKNSP 357

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEK----NID 106
            EGG R VAA+WSPL++K + + N +M I+DWLPTL   AG  +    NID
Sbjct: 358 WEGGTRGVAAIWSPLIKKSKRVSNQMMFISDWLPTLLSAAGVNRKQLGNID 408


>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
 gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
          Length = 545

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DESVG ++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PLRG K + 
Sbjct: 234 MVAKMDESVGVIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLRGVKNTL 290

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGGVR    +WSPLL K Q V++  MHI+DWLPTL   AG
Sbjct: 291 WEGGVRAAGLVWSPLLEKRQRVVDETMHISDWLPTLIEAAG 331


>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
 gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
          Length = 545

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DESVG ++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PLRG K + 
Sbjct: 234 MVAKMDESVGVIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLRGVKNTL 290

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGGVR    +WSPLL K Q V++  MHI+DWLPTL   AG
Sbjct: 291 WEGGVRAAGLVWSPLLEKRQRVVDETMHISDWLPTLIEAAG 331


>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
 gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
          Length = 542

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG+ +++L++  +L N++V+  SDNG PTV     H N  SN+P RG K S 
Sbjct: 218 MISSLDKSVGQTMRALKDNDMLNNTIVLLYSDNGAPTVG---IHSNAGSNYPYRGQKESP 274

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R+V ALWSPLL K G V    +H  DWLPTL   AG
Sbjct: 275 WEGGIRSVGALWSPLLEKQGYVSNQAIHAIDWLPTLAAAAG 315


>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
          Length = 546

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG V+++L +  +L NS++VF +DNGGP      F+ N ASN+PLRGTK + 
Sbjct: 252 MLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAG---FNLNAASNFPLRGTKNTL 308

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR    +WSP L+  G++   L+HI+DWLPTL    G +
Sbjct: 309 WEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAVGGD 351


>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
 gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
          Length = 619

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGRVI +L     L NS+V+F SDNG P+V       NT SNWPL+G K S 
Sbjct: 261 MVSKLDESVGRVITALARTDQLENSIVIFYSDNGAPSVGMF---SNTGSNWPLKGQKNSP 317

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R   A+WSPLL+ +G +    +++ D+LPTL   AG E
Sbjct: 318 WEGGLRVAGAIWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIE 360


>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
 gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
          Length = 633

 Score = 98.2 bits (243), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+SVG+++  L +  +L NS+++F SDNGGP      F+ N ASN+PL+G K + 
Sbjct: 231 MVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQG---FNLNFASNYPLKGVKNTL 287

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
            EGGVR    +WSPLL+K Q + N  MHI DWLPTL   AG +
Sbjct: 288 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQ 330


>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 311

 Score = 97.4 bits (241), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG V+ +L++ G+L ++++VF SDNG        F  N   NWPLRG K++ 
Sbjct: 214 MVDALDQSVGIVVDTLQKAGMLNDTIIVFCSDNG---ALPWGFKSNRGYNWPLRGGKFTL 270

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
           +EGGVR  A LWS  L+  + + N LMHI+DWLPTLY  AG
Sbjct: 271 YEGGVRGTAFLWSRRLKNTKRVSNQLMHISDWLPTLYSAAG 311


>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
 gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
          Length = 575

 Score = 97.4 bits (241), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 64/101 (63%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR++ +L   G L NS+V+F SDNG P+V       NT SNWPLRG K + 
Sbjct: 265 MVSKLDQSVGRIVSALNSTGQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR   A+WS  L+ +G +    +++ DWLP+L   AG
Sbjct: 322 WEGGVRVAGAIWSAQLQARGNIFTQPIYVADWLPSLAHAAG 362


>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 545

 Score = 97.4 bits (241), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 50/101 (49%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D SVG V+++LE+  +L + L+VF +DNGGP   +  F+ N ASNWPLRG K + 
Sbjct: 251 MASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGP---AEGFNLNAASNWPLRGVKDTL 307

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  +WS  L R G++ E   HI DWLPTL   AG
Sbjct: 308 FEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAG 348


>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
          Length = 365

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + I +L++ G+L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 41  MISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 97

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 98  WEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG 138


>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
          Length = 502

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 72/104 (69%), Gaps = 5/104 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYS 59
           ML +LDESVG+V ++L  +G+L N+++VF SDNG P + +S H  G   SN+PLRG K S
Sbjct: 245 MLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVG---SNFPLRGGKPS 301

Query: 60  FHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
             EGG R  A +WSPL +K G + + LMHI+DWLPTL   AG +
Sbjct: 302 IWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAGYD 345


>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
 gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
          Length = 525

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGRVI++L    +L NS+V+F+SDNGGP+V     H NTASN+PLRG K++ 
Sbjct: 223 MVSRLDKSVGRVIEALANREMLRNSIVLFLSDNGGPSVGE---HANTASNYPLRGQKHTP 279

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 280 WEGGIRSSAAIWSTEFERLGSVWKQRIYIGDLLPTLCSAAG 320


>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
 gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
 gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
          Length = 562

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D SVG+++  L +  +L NS+++F SDNGGP   +  F+ N ASN+PL+G K + 
Sbjct: 251 MVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGP---AQGFNLNFASNYPLKGVKNTL 307

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
            EGGVR    +WSPLL+K Q + N  MHI DWLPTL   AG +
Sbjct: 308 WEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGQ 350


>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
 gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
          Length = 560

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVGRVI +L     L NS+V+F SDNG P+V       NT SNWPL+G K S 
Sbjct: 261 MVSKLDESVGRVITALARTDQLENSIVIFYSDNGAPSVG---IFANTGSNWPLKGQKDSP 317

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R   A+WSPLL+ +G +    +++ D+LPTL   AG E
Sbjct: 318 WEGGLRVAGAIWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIE 360


>gi|241237879|ref|XP_002401191.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496126|gb|EEC05767.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 117

 Score = 97.1 bits (240), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/104 (48%), Positives = 67/104 (64%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LDES+GRV++ L   G+LG++++ F SDNGG          N   NWPLRG K + 
Sbjct: 1   MLDALDESIGRVVEVLGNAGMLGDTIIAFSSDNGGSPFGPFS---NRGFNWPLRGAKRTL 57

Query: 61  HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
            EGGVR  A +W S  L+K +V + LMHI+DWLPTLY  A  ++
Sbjct: 58  WEGGVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAVDQR 101


>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
          Length = 549

 Score = 96.7 bits (239), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/104 (50%), Positives = 69/104 (66%), Gaps = 11/104 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
           M+ +LDESVG + +SL   G+L N++++F SDNG P       H NT   +SN+PLRG K
Sbjct: 225 MISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP-------HENTDVCSSNYPLRGAK 277

Query: 58  YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            S  EGG R  A +WSPLL+K G V + +MHI+DWLPTL   AG
Sbjct: 278 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAG 321


>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
 gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
          Length = 579

 Score = 96.7 bits (239), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + I +L++ G+L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 255 MISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG 352


>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
          Length = 542

 Score = 96.3 bits (238), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/104 (50%), Positives = 69/104 (66%), Gaps = 11/104 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTK 57
           M+ +LDESVG + +SL   G+L N++++F SDNG P       H NT   +SN+PLRG K
Sbjct: 248 MISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP-------HENTDVCSSNYPLRGAK 300

Query: 58  YSFHEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            S  EGG R  A +WSPLL+K G V + +MHI+DWLPTL   AG
Sbjct: 301 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAG 344


>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
 gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
          Length = 579

 Score = 96.3 bits (238), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + I +L++ G+L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 255 MVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352


>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
 gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
          Length = 531

 Score = 96.3 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 4/96 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++DESVG++++ L +  +L  S++VF +DNGGP   +  F+ N ASN+PLRG K + 
Sbjct: 229 LVTAMDESVGKIVEQLRKSRMLETSIIVFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 285

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            EGGVR  A LWSP L ++ +  E  MHI DWLPTL
Sbjct: 286 WEGGVRGAALLWSPQLTKRPRTAEQTMHIVDWLPTL 321


>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
 gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
          Length = 579

 Score = 95.9 bits (237), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + I +L++ G+L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 255 MVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352


>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 451

 Score = 95.9 bits (237), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML SLDES+GRV++ L   G+L ++++ F SDNGG   ++     N   NWPLRG K + 
Sbjct: 164 MLDSLDESIGRVVEILGNAGMLEDTIIAFSSDNGGLPYNAF---SNRGFNWPLRGAKGTL 220

Query: 61  HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A +W S  L+K +V + LMHI+DWLPTLY  AG
Sbjct: 221 WEGGVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAG 261


>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
 gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
          Length = 578

 Score = 95.9 bits (237), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + + +L++ G+L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 255 MISSLDKSVAQTVGALKDNGMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352


>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
          Length = 545

 Score = 95.9 bits (237), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 68/103 (66%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG + ++L  +G+L N+++VF +DNGGP   +  F  N ASNWPLRG K + 
Sbjct: 259 MISALDDSVGNITRTLHAKGMLNNTIIVFSTDNGGP---ADRFDDNCASNWPLRGGKRTM 315

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE 102
            EGG+R    +WSPLL+    V E+ M I DWLPTL   AG +
Sbjct: 316 WEGGLRGNGFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAAGYD 358


>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
          Length = 563

 Score = 95.5 bits (236), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LDESVG V+ +L+ +G+L +S+VVF +DNGG       F+ N ASN+PLRG K + 
Sbjct: 273 VLTKLDESVGEVVAALKAKGVLNDSIVVFTTDNGGAAAG---FNDNAASNYPLRGVKNTL 329

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    LWSP + ++ +V    MH+ DWLPTL   AG
Sbjct: 330 WEGGVRGAGWLWSPFIDKRSRVATQRMHLVDWLPTLLSAAG 370


>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 406

 Score = 95.5 bits (236), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 65/102 (63%), Gaps = 4/102 (3%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L  LD+SVG ++++L   G L N+++ F +DNGG  V    F  NT+ NWPLRGTK +  
Sbjct: 259 LAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVG---FSANTSPNWPLRGTKGTVA 315

Query: 62  EGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EGGVR    LW S L  +G+V + L H+TDW+PT Y  AG +
Sbjct: 316 EGGVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAGGQ 357


>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 418

 Score = 95.1 bits (235), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D++ GR++++L   G+L N++VVF SDNG   V  + F  N   NWPLRG K + 
Sbjct: 162 MVDSMDDAFGRLVEALSGAGVLNNTIVVFSSDNGA--VPIIEF-PNRGFNWPLRGAKKTL 218

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EG VR  A +WSPLL   G+V + +MHI DWLPT Y +AG
Sbjct: 219 WEGAVRVPAFIWSPLLESSGRVSDQMMHIVDWLPTFYSVAG 259


>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
          Length = 358

 Score = 95.1 bits (235), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 48/105 (45%), Positives = 70/105 (66%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L+E GL  N++++F +DNGG T+    + GN   NWPLRG K++ 
Sbjct: 247 MVSAMDEAVGNITHTLQETGLWDNTVLIFSTDNGGQTL----YGGN---NWPLRGRKWTL 299

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR VA +  PL+ K G V   L+HI+DWLPT+  +AG   N
Sbjct: 300 WEGGVRGVAFVSGPLIEKPGAVNRELIHISDWLPTIAGLAGASTN 344


>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 710

 Score = 95.1 bits (235), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG V+ +L  + +L NS+VVF++DNG  T      H N  SN+PLRG K S 
Sbjct: 273 MMSKLDQSVGEVVSALRRKNMLQNSIVVFMADNGAATQG---IHYNRGSNYPLRGIKASA 329

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EG VR  AA+WSPL+++ + + N LM I DWLPTL   +G
Sbjct: 330 WEGAVRGAAAVWSPLIQRPKRIYNELMSIADWLPTLLSASG 370


>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 529

 Score = 94.7 bits (234), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 50/97 (51%), Positives = 66/97 (68%), Gaps = 4/97 (4%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG V+++L + G+L NS+V+F +DNGG          N+  NWPLRG+K +  EGG
Sbjct: 248 LDKSVGLVMEALNKRGMLSNSIVIFSTDNGG---LPWGVEPNSGYNWPLRGSKETNWEGG 304

Query: 65  VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            R  A +WSPLL K G++   +MHITDWLPTLY  AG
Sbjct: 305 ARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAG 341


>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
 gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
          Length = 573

 Score = 94.4 bits (233), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  VGR+ +SLE   +L N++++F +DNG PT   L  H N+ SN+PLRG K S 
Sbjct: 254 MVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPT---LGIHANSGSNYPLRGQKESP 310

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EG VR  A +WS LL RKG V    +H++DWLPTL   AG
Sbjct: 311 WEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAG 351


>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
 gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
          Length = 567

 Score = 94.4 bits (233), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR+I +L +   L NS+V+F SDNG P+V       NT SNWPLRG K + 
Sbjct: 256 MVAKLDQSVGRIITTLAQTEQLENSIVIFYSDNGAPSVGQF---ANTGSNWPLRGQKNTP 312

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R   A+WS  L+ +G      +++ DWLPTL   AG
Sbjct: 313 WEGGIRVAGAIWSTQLQARGNKFTQPIYVADWLPTLAHAAG 353


>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 511

 Score = 94.4 bits (233), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LDES+G V ++L +  +L N+++V  +DNGG       F  N   N+PLRG K + 
Sbjct: 215 MTDALDESIGEVFRALFDADMLDNTIIVLSADNGGL---PFGFESNRGYNFPLRGGKGTL 271

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +WSPLL RKG V + +MHITDW+PTLY  AG
Sbjct: 272 WEGGCRASAFVWSPLLKRKGVVSDQMMHITDWVPTLYSAAG 312


>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
 gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
          Length = 576

 Score = 94.0 bits (232), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 50/103 (48%), Positives = 65/103 (63%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGRVI +L     L NS+V+F SDNG P+V       NT SNWPL+G K S 
Sbjct: 263 MVSKLDKSVGRVIGALARAEQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLKGQKNSP 319

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R   A+WSPLL+ +  V    +++ D+LPTL   AG E
Sbjct: 320 WEGGLRVAGAIWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIE 362


>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
 gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
          Length = 577

 Score = 94.0 bits (232), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SV + I +L++  +L NS+++  SDNG PT+     H N  SN+P RG K S 
Sbjct: 255 MISSLDKSVAQTIGALKDNDMLNNSIILLYSDNGAPTIG---IHSNAGSNYPYRGQKESP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+  ALWSPLL+ +G V    +H  DWLPTL   AG
Sbjct: 312 WEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG 352


>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
 gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
          Length = 554

 Score = 94.0 bits (232), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 48/103 (46%), Positives = 64/103 (62%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR+I +L     L NS+V+F SDNG P+V       NT SN+PLRG K + 
Sbjct: 255 MISKLDQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+WS  L+ +G +    +++ DWLPTL   AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSRAAGIE 354


>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
          Length = 519

 Score = 93.6 bits (231), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L++EGL  N+++VF +DNGG T+          SNWPLRG K+S 
Sbjct: 254 MVSAMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTLSG-------GSNWPLRGRKWSL 306

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  + SPLL++ G V   L+HI+DWLPTL  +AG
Sbjct: 307 WEGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAG 347


>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
 gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
          Length = 556

 Score = 93.2 bits (230), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDESVG ++  L    +L NS++VF +DNGG       F+ N ASN+PLRG K + 
Sbjct: 234 MVSKLDESVGIIVDQLRRSNMLENSIIVFATDNGGAAEG---FNLNFASNYPLRGVKNTL 290

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR V  LWS   L++ +V E  MHITDWLPT   +AG ++ +
Sbjct: 291 WEGGVRGVGLLWSQRYLKRPRVAEQTMHITDWLPTFVEVAGGKEAL 336


>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 433

 Score = 93.2 bits (230), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 71/106 (66%), Gaps = 5/106 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V+++L   G+LG++++VF SDNG    +S   + N AS WP +G K + 
Sbjct: 141 MVSALDESVGAVVEALGRRGMLGDTVLVFSSDNG---ANSRGENPNYASAWPFKGQKITP 197

Query: 61  HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR  A +WSP L   +G+   N+ HI+DWLPTLY +AG + +
Sbjct: 198 WEGGVRAPAIIWSPSLSGTQGRDYSNIFHISDWLPTLYQLAGGDPS 243


>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
 gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
          Length = 528

 Score = 93.2 bits (230), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L+E GL  N+++VF +DNGG T+          SNWPLRG K+S 
Sbjct: 253 MVSAMDEAVGNITLALQETGLWDNTVLVFSTDNGGQTLAG-------GSNWPLRGRKWSL 305

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL++ G V   L+HI+DWLPTL  +AG   N
Sbjct: 306 WEGGVRGVGFVTSPLLKQPGTVSRQLIHISDWLPTLVGLAGGSTN 350


>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 526

 Score = 93.2 bits (230), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 67/101 (66%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG + ++L++ GL  NS++VF +DNGGP       H   A N PLRGTK + 
Sbjct: 254 MVSALDDSVGNITRALKKSGLYNNSIIVFSTDNGGP-------HECEAFNTPLRGTKNTL 306

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGG+R  A + S LL K + V E +MH+ DWLPT+Y +AG
Sbjct: 307 WEGGIRGAAFIHSVLLEKPKRVCEGMMHVVDWLPTMYHVAG 347


>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
          Length = 500

 Score = 92.8 bits (229), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SV  V ++L E+G+L +++++F +DNGGP     +   N A NWPLRG+K + 
Sbjct: 252 MVSALDDSVSNVTKALHEKGILNDTIIIFSTDNGGPANRQDY---NDACNWPLRGSKRTM 308

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    +WSPLL   G V E+LM I DW+PT+   AG
Sbjct: 309 WEGGVRGNGFIWSPLLENSGYVSEHLMQIVDWVPTVLEAAG 349


>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
 gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
          Length = 564

 Score = 92.4 bits (228), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD +VG ++ +L    +L N+++VF SDNGGP      F+ N ASNWPLRG K + 
Sbjct: 263 MMVELDRTVGSLVDALHARDMLENTIIVFSSDNGGPADG---FNDNAASNWPLRGVKNTL 319

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R    +WSPLL+   +V   ++ + DWLPTLY  AG
Sbjct: 320 WEGGLRAAGFIWSPLLQNVSRVSHQMVQVCDWLPTLYEAAG 360


>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
 gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
          Length = 554

 Score = 92.4 bits (228), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/103 (46%), Positives = 63/103 (61%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR+I +L     L NS+V+F SDNG P+V       NT SN+PLRG K + 
Sbjct: 255 MISKLDQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+W S L  +G +    +++ DWLPTL   AG E
Sbjct: 312 WEGGVRVAGAIWNSKLQARGSIFSQPLYVADWLPTLSRAAGIE 354


>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
 gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
          Length = 570

 Score = 92.4 bits (228), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/103 (46%), Positives = 62/103 (60%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR+I +L     L NS+V+F SDNG P+V       NT SNWPLRG K + 
Sbjct: 258 MVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLRGQKNTP 314

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+WS  L  +G +    +++ DWLPTL   A  E
Sbjct: 315 WEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAADIE 357


>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 336

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 47/106 (44%), Positives = 68/106 (64%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  +D  +G++   L ++ +L +S+++F SDNGGP       + N A+N+PLRGTK + 
Sbjct: 80  MLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANG---LNNNWATNYPLRGTKTTV 136

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKNI 105
           +EGGVR  A +WS  L K  +V  +LMHITDWLPTL   AG +  +
Sbjct: 137 YEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAAGVDTTL 182


>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
 gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
          Length = 548

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD SVGRVI +L  + +L +S+V+F+SDNGGPTV     H +TASN+PLRG K S 
Sbjct: 225 MVSRLDRSVGRVIDALARQQMLQDSIVLFLSDNGGPTVGE---HSSTASNYPLRGQKNSP 281

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+ AA+WS    R G V +   +I D LPTL   AG
Sbjct: 282 WEGGLRSSAAIWSTEFERLGTVWKQQTYIGDLLPTLAAAAG 322


>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
          Length = 2048

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 50/103 (48%), Positives = 62/103 (60%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD +V +V+ +L    LL N++V F SDNGG   D         SNWPLRGTK+S 
Sbjct: 412 MVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNGGCPRDG-------GSNWPLRGTKFSQ 464

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A LWS  L+K   G V E L H+ DW PTL   AG
Sbjct: 465 YEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVASAG 507


>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
 gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
          Length = 565

 Score = 92.0 bits (227), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR++ SL     L NS+V+F SDNG P+V       NT SN+PLRG K + 
Sbjct: 255 MISKLDQSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMF---ANTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR   A+WS  L+ +G +    +++ DWLPTL   AG
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLCRAAG 352


>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
 gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
          Length = 548

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 52/101 (51%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD SVGRVI +L  + +L +S+V+F+SDNGGPTV     H  TASN+PLRG K S 
Sbjct: 225 MVSRLDRSVGRVIDALARQQMLQDSIVLFLSDNGGPTVGE---HSTTASNYPLRGQKNSP 281

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R+ AA+WS    R G V +   +I D LPTL   AG
Sbjct: 282 WEGGLRSSAAIWSTEFERLGTVWKQQTYIGDLLPTLAAAAG 322


>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 527

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 50/100 (50%), Positives = 66/100 (66%), Gaps = 4/100 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDES G V+++L E G+L N+++V  S NGG    S     N  SN+PLRG K + 
Sbjct: 241 MVDALDESAGDVLEALYEAGMLANTIIVMSSANGG---LSSGVESNAGSNFPLRGGKGAL 297

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGG R  A +WSPLL +K +V + +MHITDWLPTLY  A
Sbjct: 298 WEGGTRASAFIWSPLLYQKNRVSDQMMHITDWLPTLYAAA 337


>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
          Length = 500

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 67/102 (65%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M+ +LD+SVG +++ LE+ G+L ++++VF +DNGG P   S     N   NWPLRG K +
Sbjct: 249 MVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRS---DPNRGFNWPLRGIKGT 305

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
             EGG+R  A +WSP L    +V   LMHI+DWLPTLY  AG
Sbjct: 306 VWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAAG 347


>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
 gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
          Length = 565

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVGR++ +L     L NS+V+F SDNG P   SL    NT SN+PLRG K + 
Sbjct: 255 IISKLDQSVGRIMSTLAATDQLENSIVIFYSDNGAP---SLGMFANTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+WS  L+ +G +    +++ DWLPTL   AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIE 354


>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
 gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
          Length = 498

 Score = 91.3 bits (225), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLDES+ +++ SL+ +GL  NS+++F +DNGG     L    +  SN+PLRG K + 
Sbjct: 224 MVSSLDESMRQLVTSLKRKGLYQNSIIIFTTDNGG-AAGGLDM--SAGSNFPLRGNKNTL 280

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + SPL+ R G+V + LMH +DWLPTL+ +AG
Sbjct: 281 WEGGVRGVAFVHSPLIKRPGRVYDGLMHASDWLPTLHLLAG 321


>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
          Length = 561

 Score = 90.9 bits (224), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 65/103 (63%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG + ++L  +G+L N+++ F +DNGGP      F  N ASNWPLRG K S 
Sbjct: 259 MVSALDDSVGNITKALHAKGILENTIICFSTDNGGPASG---FDFNDASNWPLRGVKESL 315

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR    +WSPL++  G     +M I D+LPTL   AG +
Sbjct: 316 WEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAAGYD 358


>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
 gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
          Length = 565

 Score = 90.9 bits (224), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+SVGR++ +L     L NS+V+F SDNG P+V       NT SN+PLRG K + 
Sbjct: 255 IISKLDQSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMF---ANTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR   A+WS  L+ +G +    +++ DWLPTL   AG E
Sbjct: 312 WEGGVRVAGAIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIE 354


>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
          Length = 516

 Score = 90.9 bits (224), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 68/105 (64%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L++ GL  N+++VF +DNGG T+          SNWPLRG K+S 
Sbjct: 249 MVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQTLSG-------GSNWPLRGRKWSL 301

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL + G +   L+HI+DWLPTL  +AG   N
Sbjct: 302 WEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGSTN 346


>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
 gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
 gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
          Length = 564

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 45/96 (46%), Positives = 61/96 (63%), Gaps = 4/96 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGR+I +L     L NS+V+F SDNG P+V       NT SN+PLRG K + 
Sbjct: 255 MISKLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMF---SNTGSNFPLRGQKNTP 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTL 95
            EGGVR   A+WS  L+ +G +    +++ DWLPTL
Sbjct: 312 WEGGVRVAGAIWSSGLQARGSIFRQPLYVADWLPTL 347


>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
 gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
          Length = 528

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 42/95 (44%), Positives = 65/95 (68%), Gaps = 4/95 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+SVGR+++ L +  +L NS+++F +DNGGP   +  F+ N ASN+PLRG K + 
Sbjct: 226 LVTAMDDSVGRIVEQLRKSHMLDNSIIIFSTDNGGP---AEGFNSNFASNYPLRGVKNTL 282

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPT 94
            EGGVR    +WS  L ++ +V +  MHI DWLPT
Sbjct: 283 WEGGVRGAGLVWSAQLSKRPRVADQTMHIADWLPT 317


>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
 gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
          Length = 574

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 49/103 (47%), Positives = 63/103 (61%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVGRVI +L     L NS+V+F SDNG P+V       NT SNWPL+G K S 
Sbjct: 263 MVSKLDQSVGRVITALARAEQLENSIVIFYSDNGAPSVGMF---ANTGSNWPLKGQKNSP 319

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR    +WS LL  +G +    +++ D+LPTL   AG E
Sbjct: 320 WEGGVRVAGVIWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIE 362


>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
          Length = 531

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 67/102 (65%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M+ +LD+SVG +++ LE+ G+L ++++VF +DNGG P   +     N   NWPLRG K +
Sbjct: 249 MMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRA---DPNRGFNWPLRGIKGT 305

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
             EGG+R  A +WSP L    +V   LMHI+DWLPTLY  AG
Sbjct: 306 VWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAAG 347


>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
 gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
          Length = 541

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 63/96 (65%), Gaps = 4/96 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DESVG ++  L +  +L NS+++F SDNGGP      F+ N ASN+PL+G K + 
Sbjct: 232 MVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQG---FNLNFASNYPLKGVKNTL 288

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTL 95
            EGGVR    +WSPLL+  Q V    +HI+DWLPTL
Sbjct: 289 WEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTL 324


>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
          Length = 429

 Score = 90.1 bits (222), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE+VG V  +LE  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL RKG     L+HI+DWLPTL  +AG   N
Sbjct: 322 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 366


>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 502

 Score = 90.1 bits (222), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 66/100 (66%), Gaps = 3/100 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  +D  VGRV  +L ++ +L NS+++F +DNGGP      F  N A+N+PLRG K + 
Sbjct: 244 MLGYMDYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANG---FDYNWANNFPLRGVKATL 300

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V+ ++S L+   +V   L+HITDWLPTL  +AG
Sbjct: 301 FEGGVRGVSFVYSKLIESPRVSHELIHITDWLPTLVNLAG 340


>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
 gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
          Length = 485

 Score = 90.1 bits (222), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/111 (42%), Positives = 70/111 (63%), Gaps = 8/111 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD++VG V ++L   G+L NS+++F +DNGGP      F  N ASNWPLRG K + 
Sbjct: 222 MVSALDDAVGNVTKALSARGMLENSVIIFTTDNGGPAAG---FDQNYASNWPLRGVKNTL 278

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKE----KNID 106
            EGGV     + SPL+++ +     L+H+ D LPT+Y +AG +    KN+D
Sbjct: 279 WEGGVHGTGFVHSPLIKQPKRTTHELLHVCDLLPTIYELAGGDSTELKNLD 329


>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
 gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
          Length = 380

 Score = 90.1 bits (222), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 72/102 (70%), Gaps = 6/102 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-FHGNTASNWPLRGTKYS 59
           M+  +D+S+G V ++LE+  +L NS+++F +DNGG    + H F+ N  SN+PLRG K  
Sbjct: 230 MVTCVDDSIGEVFRALEKNRMLNNSVILFTTDNGG----APHGFNRNQGSNYPLRGGKDM 285

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
             EGGVR  A ++S L++ KG+V  +L+ +TDW+PTLY++AG
Sbjct: 286 MWEGGVRGTAFIYSDLIKHKGRVSTDLIDVTDWVPTLYYLAG 327


>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
 gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
          Length = 533

 Score = 90.1 bits (222), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE+VG V  +LE  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 269 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL RKG     L+HI+DWLPTL  +AG   N
Sbjct: 322 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 366


>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
 gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
          Length = 583

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/107 (46%), Positives = 69/107 (64%), Gaps = 4/107 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG V+ +L  + +L NS+++F+SDNGGPT+     H  TASN+PLRG K S 
Sbjct: 264 MVSRLDKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGE---HATTASNYPLRGQKNSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R+ AA+WS    R G V +  ++I D LPTL   AG   + D
Sbjct: 321 WEGGIRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPD 367


>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 557

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/101 (44%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG+V+++L  + LL ++++VF+SDNGG T   +    NT SN+P RG K + 
Sbjct: 305 MVDELDQSVGQVVEALYRKRLLDSTMIVFLSDNGGQTTGVM---NNTGSNYPFRGQKRTL 361

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +WS  +++K ++   LMH+TDWLPT+   AG
Sbjct: 362 FEGGTRVSAFVWSTDIVKKPRIESGLMHVTDWLPTILKRAG 402


>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 571

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 46/105 (43%), Positives = 67/105 (63%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD  V +V+Q L+ +GL  NS++++ SDNGG  +          SNWPLRG K ++
Sbjct: 265 MLSCLDSGVAQVVQELKTQGLYENSVLIYSSDNGGQPLSG-------GSNWPLRGGKGTY 317

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL +KG V   L+H++DW PTL  +AG +++
Sbjct: 318 WEGGIRAVGFVHSPLLKKKGIVSRALIHVSDWYPTLLGLAGAQQS 362


>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
          Length = 431

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE+VG V  +LE  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 167 MASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 219

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL RKG     L+HI+DWLPTL  +AG   N
Sbjct: 220 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTN 264


>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
          Length = 496

 Score = 89.4 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE+VG V  +L   GL  NS++++ SDNGG P+           SNWPLRG+K +
Sbjct: 193 MLSCLDEAVGNVTLALRAHGLYNNSILIYSSDNGGQPSAG--------GSNWPLRGSKGT 244

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
           + EGG+R V  + SPLLR +G V   L+HITDW PTL  +A
Sbjct: 245 YWEGGIRAVGFVHSPLLRQRGTVCRELVHITDWYPTLVALA 285


>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 588

 Score = 89.4 bits (220), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 70/106 (66%), Gaps = 5/106 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V+++L + G+L ++++VF SDNGG   D+   + N AS+WP +G K + 
Sbjct: 292 MVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGG---DTNGENPNYASSWPFKGQKRTL 348

Query: 61  HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+     +WSPL    +G    N+ HI+DWLPTLY +AG + +
Sbjct: 349 WEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAGGDPS 394


>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
          Length = 515

 Score = 89.4 bits (220), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 784

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 65/103 (63%), Gaps = 13/103 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VGRV+ +L+  GL  NS++VF +DNGG              NWPLRG+K S 
Sbjct: 263 MVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNGGI----------LGGNWPLRGSKASL 312

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR V  + SPLL    +G V   L+H+TDWLPTL  +AG
Sbjct: 313 YEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLAG 355


>gi|241619164|ref|XP_002407086.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500932|gb|EEC10426.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 383

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 44/102 (43%), Positives = 66/102 (64%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESVG V+++L + G+L ++++VF SDNG  T      + N AS+WP +G K + 
Sbjct: 126 MVSALDESVGAVVEALGKRGMLNDTVLVFSSDNGADTNSE---NANYASSWPFKGQKMTP 182

Query: 61  HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTLYFIAG 100
            EGGV     +WSPL    +G    ++ H++DWLPTLY +AG
Sbjct: 183 WEGGVHVPGIIWSPLFSGMRGIDYTHIFHVSDWLPTLYQLAG 224


>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
          Length = 599

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +LE  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLACLDEAINNVTLALETYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLITLA--EGQID 393


>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 502

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 63/106 (59%), Gaps = 5/106 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDESVG V ++L   G+L  ++VV  +DNG          G   S WP RG K + 
Sbjct: 254 MLSVLDESVGAVFEALGRRGILNETIVVLTTDNGAGITTQFSSFG---SGWPFRGQKGTA 310

Query: 61  HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR  A +WSPL    +G V+E+L H++DWLPT Y +AG + +
Sbjct: 311 WEGGVRVPAVVWSPLFSEHRGAVVESLFHVSDWLPTFYELAGGDPS 356


>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
          Length = 523

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG V ++L+  GL  N++++F +DNGG T       GN   NWPLRG K + 
Sbjct: 259 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG----GN---NWPLRGRKGTL 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG
Sbjct: 312 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 352


>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
          Length = 621

 Score = 88.6 bits (218), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 273 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 324

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 325 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 370


>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 355

 Score = 88.6 bits (218), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 68/108 (62%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LEE+ +L NS++VF SDNGG  V+S        +NWPLRG K++ 
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293

Query: 61  HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
            EGGVR V  ++  LL R  +  EN  LMHITDW PTL  +AG   ++
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341


>gi|291238953|ref|XP_002739390.1| PREDICTED: arylsulfatase B-like [Saccoglossus kowalevskii]
          Length = 393

 Score = 88.6 bits (218), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LEE+ +L NS++VF SDNGG  V      G+  +NWPLRG K++ 
Sbjct: 133 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALV------GSAGNNWPLRGGKHTS 186

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  ++  LL    +G     LMHITDW PTL  +AG
Sbjct: 187 WEGGVRAVGFVYGDLLPCHSRGTENTGLMHITDWFPTLITLAG 229


>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
          Length = 782

 Score = 88.6 bits (218), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 48/105 (45%), Positives = 67/105 (63%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++++F +DNGG T+      GN   NWPLRG K++ 
Sbjct: 518 MVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQTLAG----GN---NWPLRGRKWTL 570

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR +  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 571 WEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVN 615


>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 495

 Score = 88.6 bits (218), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 68/108 (62%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LEE+ +L NS++VF SDNGG  V+S        +NWPLRG K++ 
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293

Query: 61  HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
            EGGVR V  ++  LL R  +  EN  LMHITDW PTL  +AG   ++
Sbjct: 294 WEGGVRAVGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341


>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
 gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
 gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
 gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
 gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
          Length = 599

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
          Length = 596

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
          Length = 578

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV--DSLHFHGNTASNWPLRGTKY 58
           ML  LDE +G V ++L+  G   N+L+VF +DNGGP +  DS+      A N+P+RG K+
Sbjct: 237 MLSCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSV-----GARNYPMRGGKH 291

Query: 59  SFHEGGVRNVAALWSP---LLRKGQVLENLMHITDWLPTLYFIAGKE 102
           + +EGGVR +A +  P    +  G    NLMH  DWLPTL   AG E
Sbjct: 292 AVYEGGVRGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAAGVE 338


>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 533

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 67/106 (63%), Gaps = 8/106 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--HGNTASNWPLRGTKYS 59
           + +LD+SVG V+++L E  +L N+++V  +DNGG     L F    N   N+PLRG K +
Sbjct: 246 VDALDQSVGEVLETLYEADMLDNTIIVLSTDNGG-----LPFGVQSNRGYNFPLRGAKGT 300

Query: 60  FHEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
             EGG R  A +WSPLL +  +V + ++HITDWLPTLY  AG   N
Sbjct: 301 LWEGGTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGN 346


>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
          Length = 599

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
 gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
          Length = 598

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
          Length = 599

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
          Length = 600

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++V+ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIVYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 562

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DESVG ++  L E G   NS++++ SDNGG  +       +   NWPLRG K S+
Sbjct: 249 MVSGVDESVGEIVSELRERGYYDNSVLIYSSDNGGQPL-------SGGCNWPLRGGKGSY 301

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL RKG V   L+HI+DW PTL  +AG  ++
Sbjct: 302 WEGGVRAVGFVHSPLLKRKGVVSHALIHISDWYPTLLSLAGYRES 346


>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
          Length = 596

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
          Length = 596

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
          Length = 597

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 295 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 346

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 347 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 392


>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
          Length = 599

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
 gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
 gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
          Length = 597

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
          Length = 572

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 268 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 319

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 320 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 365


>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
          Length = 614

 Score = 87.8 bits (216), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 500

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LEE+ +L NS++VF SDNGG  V+S        +NWPLRG K++ 
Sbjct: 240 MTSCVDEAIANITEALEEKNMLKNSIIVFTSDNGGALVNS------AGNNWPLRGGKHTS 293

Query: 61  HEGGVRNVAALWSPLL-RKGQVLEN--LMHITDWLPTLYFIAGKEKNI 105
            EGGVR +  ++  LL R  +  EN  LMHITDW PTL  +AG   ++
Sbjct: 294 WEGGVRAIGFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHV 341


>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
          Length = 597

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 327

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 70/100 (70%), Gaps = 3/100 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  +D  +GRV ++L E+ +L NS+++F +DNGGP   S  F+ N A+N+PLRG K + 
Sbjct: 62  MLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGP---SNGFNNNWANNFPLRGVKATL 118

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  ++S L+ K +V  +L+H+TDWLPTL  +AG
Sbjct: 119 FEGGVRGVGFVYSKLIEKSRVSHDLIHVTDWLPTLVSLAG 158


>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
          Length = 601

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 298 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 349

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 350 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 395


>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
          Length = 605

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 302 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 353

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 354 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 399


>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
 gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
          Length = 584

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+V+F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 264 MVSKLDQSVGSVIDALARQEMLQNSIVLFLSDNGGPTQGQ---HSTTASNYPLRGQKDSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361


>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 437

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 47/102 (46%), Positives = 67/102 (65%), Gaps = 3/102 (2%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L  +D+S+G++ ++L    LL N+++VF+S +GG  V    F  N + N+PLRGTK +  
Sbjct: 247 LTEVDQSLGQIFEALHNRRLLNNTILVFVSASGGMPVGD--FTNNLSFNFPLRGTKGTCF 304

Query: 62  EGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKE 102
           EGGVR  A +WSPLL K  +    LMH+TDWLPTL+  AG +
Sbjct: 305 EGGVRVPAFVWSPLLNKTRRTSSQLMHVTDWLPTLFSAAGGD 346


>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
 gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
          Length = 598

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
          Length = 598

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
          Length = 600

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
 gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
 gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
 gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
          Length = 598

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
          Length = 564

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 44/103 (42%), Positives = 66/103 (64%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD++V  V  +L++ G   NS++++ SDNGG  +          +NWPLRG+K ++
Sbjct: 267 MLSCLDDAVNNVTTALKKYGFYDNSIIIYSSDNGGQPMAG-------GNNWPLRGSKGTY 319

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R V  + SP+L+ KG + + L+HITDW PTL  +AG E
Sbjct: 320 WEGGIRAVGFVHSPILKMKGYICKELVHITDWFPTLVTLAGGE 362


>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
          Length = 595

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 347

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 348 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 393


>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
          Length = 601

 Score = 87.4 bits (215), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 51/108 (47%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE+V  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 298 MLSCLDEAVSNVTLALKTYGFYNNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 349

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 350 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 395


>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score = 87.4 bits (215), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 44/101 (43%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G V ++L+E G+  N+++VF+SDNGG              NWPLRG K S 
Sbjct: 232 MVTCMDEGIGNVTEALKESGMWDNTVLVFVSDNGGEL-------SAGGRNWPLRGGKGSV 284

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLM-HITDWLPTLYFIAG 100
            EGG+R V+ + S L+ + Q + N M HITDW PTL  +AG
Sbjct: 285 FEGGIRTVSFVTSSLIERPQRISNAMIHITDWFPTLTHLAG 325


>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
          Length = 570

 Score = 87.4 bits (215), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
          Length = 555

 Score = 87.4 bits (215), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 345

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 346 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
          Length = 427

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 124 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 175

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 176 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 221


>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 67/103 (65%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V+ +L+  GL  NS++VF +DNGG  + SL        NWPLRG K S 
Sbjct: 243 MVTCMDEAVGGVVATLKHSGLWENSVLVFSTDNGG--IHSL------GGNWPLRGGKASL 294

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR VA + SPLL    +G + + L+H+TDW PTL  +AG
Sbjct: 295 YEGGVRGVAFVTSPLLPSRVRGTINKELLHMTDWFPTLVRLAG 337


>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 533

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 43/100 (43%), Positives = 59/100 (59%), Gaps = 3/100 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V  +L+      +++VVF+SDNGGPT       G  ++NWPLRG K + 
Sbjct: 268 MVSALDEAVGNVTGALKARRAYDDAVVVFVSDNGGPTNGD---EGTASNNWPLRGGKNTL 324

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG R V  L +P + K       +H  DW PTLY  AG
Sbjct: 325 YEGGTRVVGLLKAPGVAKSSASRAKLHAVDWFPTLYRFAG 364


>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
          Length = 573

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L++ G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 270 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 323 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 367


>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
          Length = 467

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L++ G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 164 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 216

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 217 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 261


>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
          Length = 514

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG +  +L+E GL  N+++VF +DNGG T+      GN   NWPLRG K++ 
Sbjct: 248 MVSLMDEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAG----GN---NWPLRGRKWTL 300

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 301 WEGGVRGVGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTN 345


>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
          Length = 593

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 290 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 341

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 342 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 387


>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
          Length = 599

 Score = 87.0 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 297 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 348

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 349 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 394


>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
          Length = 573

 Score = 86.7 bits (213), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/107 (44%), Positives = 67/107 (62%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L++ G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 270 MLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 323 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 367


>gi|119626709|gb|EAX06304.1| hCG20717, isoform CRA_b [Homo sapiens]
          Length = 354

 Score = 86.7 bits (213), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 51  MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 102

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 103 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 148


>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
 gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
          Length = 535

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ VF +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  + SPLL RKG     L+HI+DWLPTL  +AG
Sbjct: 324 WEGGVRGVGFVASPLLKRKGVKSRELVHISDWLPTLVGLAG 364


>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
          Length = 569

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
 gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
          Length = 569

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
          Length = 300

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+++F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 184 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQ---HSTTASNYPLRGQKNSP 240

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 281


>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
          Length = 597

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L+  G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 294 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 346

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 347 WEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 391


>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
          Length = 571

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L+  G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 268 MLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 320

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 321 WEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 365


>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
 gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
 gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
 gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
 gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
 gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
          Length = 585

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+++F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQ---HSTTASNYPLRGQKNSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361


>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
          Length = 539

 Score = 86.7 bits (213), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 237 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 288

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 289 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 334


>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
          Length = 426

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216


>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
           caballus]
          Length = 523

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/105 (48%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 259 MVSLLDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 312 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGSTN 356


>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
 gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
 gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
          Length = 599

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 297 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 348

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R +  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 349 YWEGGIRAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 394


>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
          Length = 551

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 287 MVSLMDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 339

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 340 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 384


>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
          Length = 466

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 164 MLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 215

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R +  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 216 YWEGGIRAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 261


>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
 gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
 gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
          Length = 569

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
          Length = 569

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
 gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
          Length = 585

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+++F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTEGQ---HSTTASNYPLRGQKNSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361


>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
 gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
          Length = 585

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+++F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 264 MVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTEGQ---HSTTASNYPLRGQKNSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAG 361


>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
          Length = 543

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/108 (46%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 240 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG--------GSNWPLRGSKGT 291

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 292 YWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA--EGQID 337


>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
          Length = 569

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
          Length = 520

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L + GL  N+++VF +DNGG T           SNWPLRG K++ 
Sbjct: 253 MVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQTRAG-------GSNWPLRGRKWAL 305

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL   G V   LMHI+DWLPTL  +AG
Sbjct: 306 WEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAG 346


>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
          Length = 569

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALMHITDWYPTLVGLAG 359


>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
 gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
 gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
 gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
 gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
          Length = 569

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359


>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
           purpuratus]
          Length = 784

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 44/98 (44%), Positives = 63/98 (64%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG+++ SL++ GL  N++++F +DNGG         GN   NWPLRG K S 
Sbjct: 242 MMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVAAG----GN---NWPLRGWKRSI 294

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGG+R V  + SPLL    +G V + L+H++DW PTL
Sbjct: 295 WEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTL 332


>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
          Length = 607

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L+  G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 304 MLSCLDEAINNVTLALKRYGFYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 356

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
            EGG+R V  + SPLL+ KG + + L+HITDW PTL  +A
Sbjct: 357 WEGGIRAVGFVHSPLLKNKGTICKELVHITDWYPTLITLA 396


>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
          Length = 522

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 70/105 (66%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L++ GL  N++ +F +DNGG T+      GN   NWPLRG K++ 
Sbjct: 258 MVTLMDEAVGNVTRALQKYGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWTL 310

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL++ +V  + L+HI+DWLPTL  +AG   N
Sbjct: 311 WEGGIRGVGFVSSPLLKRKRVKSSELIHISDWLPTLVNLAGGRTN 355


>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
 gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
          Length = 591

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG VI +L  + +L NS+V+F+SDNGGPT      H  TASN+PLRG K S 
Sbjct: 264 MVSKLDQSVGYVIDALARQEMLQNSIVLFLSDNGGPTQGQ---HSTTASNYPLRGQKDSP 320

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EG +R+ AA+WS    R G V +  ++I D LPTL   AG
Sbjct: 321 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAGAG 361


>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
          Length = 481

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 254 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 306

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 307 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 347


>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
          Length = 572

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALMHITDWYPTLVGLAG 359


>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
          Length = 504

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 198 MLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 249

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
           + EGG+R V  + SPLL+ KG V   L+HITDW PTL  +A
Sbjct: 250 YWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLA 290


>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
          Length = 426

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216


>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
 gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV       N+ SN+PLRG K S 
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306

Query: 61  HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  A +W+ P +      +G+V +  MH  DW+PTL    G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354


>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
          Length = 488

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L + G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 185 MLACLDEAINNVTLALRKYGYYENSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 237

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 238 WEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTLITLA--EGQID 282


>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
          Length = 476

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 65/101 (64%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG +  + ++ GL  N++VVF +DNGG        + N+ +NWPLRG K+S 
Sbjct: 227 MVTCMDEAVGNITAAFKDTGLWDNTVVVFTTDNGG------DVNENSGNNWPLRGWKHSL 280

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL+  G  +  LMH++DW PTL  +AG
Sbjct: 281 WEGGMRGVGFVNSPLLKSFGYPVNGLMHVSDWFPTLLNLAG 321


>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
          Length = 430

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 166 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 218

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 219 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 263


>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
          Length = 430

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 166 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 218

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 219 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 263


>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
          Length = 426

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/101 (47%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 123 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 175

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    LMHITDW PTL  +AG
Sbjct: 176 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 216


>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
          Length = 520

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +LE  GL  N++ +F +DNGG T+          +NWPLRG K++ 
Sbjct: 256 MVSLMDEAVGNVTAALESHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 308

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +AG
Sbjct: 309 WEGGIRGVGFVASPLLKQKGVRTRELIHISDWLPTLVKLAG 349


>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV       N+ SN+PLRG K S 
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306

Query: 61  HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  A +W+ P +      +G+V +  MH  DW+PTL    G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354


>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
          Length = 547

 Score = 85.5 bits (210), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 69/108 (63%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K LD+S+G ++ +LE++G+L N+++ F +DNG PTV       N+ SN+PLRG K S 
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVG---LGANSGSNYPLRGVKKSP 306

Query: 61  HEGGVRNVAALWS-PLLR-----KGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  A +W+ P +      +G+V +  MH  DW+PTL    G++
Sbjct: 307 WEGGIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEK 354


>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
          Length = 532

 Score = 85.5 bits (210), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE++G V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 268 MVSLMDEAIGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 320

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 321 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 365


>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
          Length = 597

 Score = 85.5 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 297 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 349

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 350 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 390


>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
          Length = 466

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L+  G   NS++++ SDNGG  +          SNWPLRG+K ++
Sbjct: 163 MLSCLDEAINNVTLALKRYGFYDNSIIIYSSDNGGQPMAG-------GSNWPLRGSKGTY 215

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
            EGG+R V  + SPLL+ KG + + L+HITDW PTL  +A
Sbjct: 216 WEGGIRAVGFVHSPLLKNKGTICKELVHITDWYPTLITLA 255


>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
          Length = 564

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 51/108 (47%), Positives = 68/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT       GN   NWPLRG+K +
Sbjct: 263 MLACLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GN---NWPLRGSKGT 314

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 315 YWEGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLA--EGQID 360


>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
          Length = 574

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 322 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 362


>gi|299473382|emb|CBN77780.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 623

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 50/115 (43%), Positives = 68/115 (59%), Gaps = 17/115 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           +L  LD+S+GR++  LE EG + NS++V  SDNGG PT           SN+PLRGTK S
Sbjct: 302 VLMYLDKSIGRLVDYLESEGWMDNSIIVVASDNGGCPTAG--------GSNFPLRGTKAS 353

Query: 60  FHEGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
           + EGG +  A ++SP      R+G   + LMH+TDWLPTL    G      +N+D
Sbjct: 354 YWEGGTKVPALVYSPSHIPKERRGGQYDGLMHVTDWLPTLMAATGSVVESHRNLD 408


>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
          Length = 491

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG +  +L+  GL  N++ VF +DNGG T+      GN   NWPLRG K++ 
Sbjct: 225 MVTLMDEAVGNLTDALKTYGLWDNTVFVFSTDNGGQTLAG----GN---NWPLRGRKWTL 277

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL RKG     L+HI+DWLPTL  +AG   N
Sbjct: 278 WEGGVRGVGFVASPLLKRKGVESHELIHISDWLPTLVHLAGGHTN 322


>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
          Length = 565

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 260 MVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 312

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 313 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVALAG 353


>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
          Length = 613

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 311 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 363

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 364 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 404


>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
          Length = 532

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L++ GL  N++ +F +DNGG T+          +NWPLRG K++ 
Sbjct: 268 MVSLMDEAVGNVTAALKKRGLWDNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 320

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 321 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTN 365


>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 843

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/108 (41%), Positives = 64/108 (59%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LEE+ +L NS++VF SDNGG    S        SNWPLRG K++ 
Sbjct: 583 MTSCVDEAIANITEALEEKDMLKNSIIVFTSDNGGAVERS------AGSNWPLRGGKHTI 636

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGV+ V  ++  LL    +G     LMHITDW PTL  +AG   ++
Sbjct: 637 WEGGVKAVGFVYGELLPHHSRGTENTGLMHITDWFPTLVTLAGANPHV 684


>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 507

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 45/103 (43%), Positives = 64/103 (62%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++G V ++L   GLL N+++VF SDNGG  +     +G  ASN+PLRG K + 
Sbjct: 251 MTTCMDEAIGNVTRTLARRGLLDNTIIVFSSDNGGAKL-----YG--ASNYPLRGQKGTL 303

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A ++SPLL    +G     LM++ DW PT   +AG
Sbjct: 304 YEGGVRAPAFVYSPLLSNRVRGTATHELMYVGDWFPTFLHLAG 346


>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
          Length = 418

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 156 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 208

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 209 WEGGIRGVGFVASPLLKQKGIKNRELIHISDWLPTLVTLAGGSTN 253


>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
          Length = 414

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 367


>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
          Length = 471

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           +NWPLRG+K +
Sbjct: 170 MLACLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG--------GNNWPLRGSKGT 221

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ KG V + L+HITDW PTL  +A  E  ID
Sbjct: 222 YWEGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLA--EGQID 267


>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 524

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 63/98 (64%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE VG V ++L+E GL  N++++F +DNGGP         + A+NWPLRG+K S 
Sbjct: 253 MVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGPI-------QSGANNWPLRGSKGSL 305

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGG+  V  + SPLL    KG V   L+H++DWLPT+
Sbjct: 306 WEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTI 343


>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 537

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 42/98 (42%), Positives = 64/98 (65%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG+V+ SL+E GL  ++++VF +DNGG         G + +NWPLRGTK + 
Sbjct: 266 MVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTP-------GKSGNNWPLRGTKNTL 318

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGGV  V  +  P++    +G V ++ MHI+DW PTL
Sbjct: 319 WEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPTL 356


>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
          Length = 481

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 43/101 (42%), Positives = 67/101 (66%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V ++L + GL+ ++L++F +DNGG   +S    GN   N+PLRG+K++ 
Sbjct: 240 MVSALDEAVGNVTKTLRQRGLMDDTLILFTADNGGGVDES----GN---NYPLRGSKFTV 292

Query: 61  HEGGVRNVAALW-SPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG R V  ++ S L + G V + ++H  DWLPTL   AG
Sbjct: 293 YEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAG 333


>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
          Length = 431

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG   N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367


>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
 gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
          Length = 575

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVRNITSALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 322 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 362


>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
          Length = 661

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE++  V  +L+  G   NS++++ SDNGG             SNWPLRG+K ++
Sbjct: 358 MLSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPAAG-------GSNWPLRGSKGTY 410

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R +  + SPLL+ KG V   L+HITDW PTL  +A  E  ID
Sbjct: 411 WEGGIRAIGFVHSPLLKNKGTVCRELVHITDWYPTLISLA--EGQID 455


>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
          Length = 573

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITSALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 320 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 360


>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
          Length = 506

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 242 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 294

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 295 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 339


>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
          Length = 534

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHIN 367


>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
          Length = 431

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG +  +L+  GL  N+++VF +DNGG T+      GN   NWPLRG K++ 
Sbjct: 165 MVTLMDEAVGNLTDALKRYGLWNNTVLVFSTDNGGQTLAG----GN---NWPLRGRKWTL 217

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG   N
Sbjct: 218 WEGGVRGVGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTN 262


>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
 gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
          Length = 533

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 50/103 (48%), Positives = 68/103 (66%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD SVG +I +L++  +L NS+V+F +DNGGP  + L++  N ASNWP+RG K + 
Sbjct: 234 MVSELDTSVGNIISALKDANMLSNSIVIFSTDNGGP-ANGLNY--NYASNWPMRGVKNTL 290

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  +WS   +    G+V   +MHI DWLPTLY  AG
Sbjct: 291 WEGGVRGVGVVWSSKFKPNSGGRVSNRMMHICDWLPTLYAAAG 333


>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
           [Strongylocentrotus purpuratus]
          Length = 401

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 65/98 (66%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K +DE+VG + ++L++ GL  N++++F SDNG     + ++ G   SNWPLRG K S 
Sbjct: 144 MVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNGA----NFNYGG---SNWPLRGMKRSL 196

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTL 95
            EGG+++V  + SPLL K   G V  NL H+TDW PTL
Sbjct: 197 WEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTL 234


>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 739

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 67/103 (65%), Gaps = 12/103 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G V ++L++ GL  N++++F +DNGGP         + A+NWPLRG+K + 
Sbjct: 486 MVSCMDEGIGNVTRALKDAGLYDNTIIIFSTDNGGPI-------QSGANNWPLRGSKGNM 538

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
            EGG+  V  + SPLL    KG V  +LMH++DWLPT+  +AG
Sbjct: 539 WEGGIHGVGFVHSPLLPTSVKGTVNHDLMHVSDWLPTI--VAG 579


>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
          Length = 460

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 65/101 (64%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ VF +DNGG T+      GN   NWPLRG K++ 
Sbjct: 196 MVSLMDEAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLSG----GN---NWPLRGRKWTL 248

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +AG
Sbjct: 249 WEGGIRGVGFVASPLLKQKGVKSRELIHISDWLPTLVNLAG 289


>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
           melanoleuca]
          Length = 573

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 320 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 360


>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
          Length = 573

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 268 MVTCMDEAVRNITWALKRYGFYNNSIIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 321 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 361


>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
          Length = 473

 Score = 84.7 bits (208), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG V ++L+  GL  N++++F +DNGG T           +NWPLRG K + 
Sbjct: 209 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 261

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG
Sbjct: 262 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 302


>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score = 84.3 bits (207), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LD+ V +V++ L+  G+  NS++++ SDNGG  +          SNWPLRG K S+
Sbjct: 263 MLSCLDDGVAQVVKELKTTGIYDNSILIYSSDNGGQPLSG-------GSNWPLRGGKGSY 315

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL +KG + + ++H++DW PTL  +AG
Sbjct: 316 WEGGIRAVGFVHSPLLKKKGVISKEMIHVSDWYPTLLGLAG 356


>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
          Length = 556

 Score = 84.3 bits (207), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 292 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 344

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG   N
Sbjct: 345 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 389


>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
 gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
 gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
 gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
          Length = 528

 Score = 84.3 bits (207), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG V ++L+  GL  N++++F +DNGG T           +NWPLRG K + 
Sbjct: 264 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 316

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG
Sbjct: 317 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 357


>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 648

 Score = 84.3 bits (207), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++G V Q+L++ G+  N++VVF +DNGG T       G  ASN+PLRG K   
Sbjct: 250 MTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGGAT-------GFGASNYPLRGQKCGH 302

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A + SPLL+   +G     LM++ DWLPT   +AG
Sbjct: 303 YEGGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAG 345


>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
          Length = 517

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG V ++L+  GL  N++++F +DNGG T           +NWPLRG K + 
Sbjct: 255 MVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSG-------GNNWPLRGRKGTL 307

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG
Sbjct: 308 WEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAG 348


>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
          Length = 534

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG   N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367


>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
           intestinalis]
          Length = 522

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 67/101 (66%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G V+ +L+ +G+  ++LVVF +DNGG T+    F GN   NWPLRG K S 
Sbjct: 253 MVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQTL----FGGN---NWPLRGRKASL 305

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR V  +    +R KG+    L+HI+DW PTL +IAG
Sbjct: 306 WEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIAG 346


>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 516

 Score = 84.3 bits (207), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG + Q+L++ GL  NS+++F +DNGG  +          +NWPLRG K S 
Sbjct: 259 MVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQVIAG-------GNNWPLRGWKGSL 311

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+  V  + SPL+    +G   + L+H+TDW PT+  +AG   N
Sbjct: 312 WEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVN 358


>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
 gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
 gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
          Length = 534

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHITDWLPTL  +AG   N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTN 367


>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
          Length = 571

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359


>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
          Length = 570

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 264 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 316

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 317 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 357


>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
          Length = 476

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 170 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 222

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 223 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 263


>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
          Length = 602

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 41/102 (40%), Positives = 66/102 (64%), Gaps = 10/102 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + +LD  +GR++ +L+E+    N+++VF SDNGG   +        ASNWPLRG+K + +
Sbjct: 270 ITTLDHHIGRMVTALKEKNYWNNTVLVFTSDNGGQPNEG-------ASNWPLRGSKGTIY 322

Query: 62  EGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +GG+++ A + SPLL+   KGQ    L H++DW PTL  ++G
Sbjct: 323 DGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 364


>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 496

 Score = 84.3 bits (207), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 12/103 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++  + ++LE +G+L NS++V  +DNGG         G+  +NWPLRG K S 
Sbjct: 242 MASCVDEAIANITEALENKGMLKNSVIVLTTDNGG---------GSVGNNWPLRGRKNSH 292

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA ++S LL    +G     LMHITDW PTL  + G
Sbjct: 293 WEGGVRGVAFVYSDLLPMDVRGTENSELMHITDWFPTLVKLGG 335


>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
          Length = 547

 Score = 84.0 bits (206), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 243 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 295

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 296 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 336


>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
          Length = 572

 Score = 84.0 bits (206), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359


>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
          Length = 521

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 67/105 (63%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG + ++L+  GL  N++++F +DNGG T+      GN   NWPLRG K++ 
Sbjct: 257 MVSILDEAVGNITEALKTRGLWNNTVLIFSTDNGGQTLAG----GN---NWPLRGRKWTL 309

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR    + SPLL+ +G     L+HI+DWLPTL  +AG   N
Sbjct: 310 WEGGVRGAGFVASPLLKQRGVESHELIHISDWLPTLVKLAGGSTN 354


>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 1410

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+++G V+  L E GL  N++++  +DNGG  +        + +NWPL+G+K S 
Sbjct: 242 MVTCMDDAIGNVVDQLREAGLWNNTVLIVSTDNGGAKI--------SGNNWPLKGSKASL 293

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + SP L    +G     LMH+TDW PT+  +AG
Sbjct: 294 WEGGVRGVAFVTSPFLADHVRGTSNHQLMHVTDWFPTMLHVAG 336


>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 42/103 (40%), Positives = 62/103 (60%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG + Q+LE+ GL  NS+++F +DNGG             +NWPLRG K S 
Sbjct: 233 MVSCMDEAVGNITQTLEDAGLYDNSVIIFSTDNGGQVTAG-------GNNWPLRGWKGSL 285

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
            EGG+  V  + SPL+    +G   + L+H+TDW PT+  +AG
Sbjct: 286 WEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAG 328


>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
          Length = 252

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 134 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 186

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHI+DWLPTL  +AG   N
Sbjct: 187 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 231


>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
          Length = 571

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAG 359


>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1818

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 43/95 (45%), Positives = 59/95 (62%), Gaps = 8/95 (8%)

Query: 4   SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           ++D ++GRV+ +L+  G   N+LVVF SDNG     ++       SNWPLRG K+S  EG
Sbjct: 296 AMDRAIGRVVDALQATGAYENALVVFASDNG-----AIPGQMGGGSNWPLRGGKFSAWEG 350

Query: 64  GVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
           GVR  A + SPLL    +G V + L H+ DWLPT+
Sbjct: 351 GVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTI 385



 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 48/110 (43%), Positives = 64/110 (58%), Gaps = 15/110 (13%)

Query: 5   LDESVGRVIQSLEE-EGLLGNSLVVFISDNGGPTVDSLHFHGNTA--SNWPLRGTKYSFH 61
           +D ++  V+  L+  +G   NS+++F SDNGG         G  A  SNWPLRG K+S  
Sbjct: 869 MDRAMEAVVDELKAVDGAYENSVIMFASDNGG-------LPGQIAGGSNWPLRGQKFSPF 921

Query: 62  EGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
           EGGVR  A + SPLL   R+G     L H+TDWLPTL  +AG    +N+D
Sbjct: 922 EGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGAPPRNVD 971


>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 570

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 43/106 (40%), Positives = 68/106 (64%), Gaps = 8/106 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  +  +L + G   NS+++F +DNGG  +          SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYALRKYGYYQNSVIIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKNI 105
            EGG+R +  + SPLLRK  +V + L+HITDW PTL  +AG ++++
Sbjct: 321 WEGGIRGLGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESL 366


>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
          Length = 581

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 40/102 (39%), Positives = 66/102 (64%), Gaps = 10/102 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + +LD  +G+++ +L+E+    N+++VF SDNGG   +        ASNWPLRG+K + +
Sbjct: 249 ITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNGGQPKEG-------ASNWPLRGSKGTIY 301

Query: 62  EGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
           +GG+++ A + SPLL+   KGQ    L H++DW PTL  ++G
Sbjct: 302 DGGLKSRAFIASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSG 343


>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
          Length = 584

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F +DNGG T     F G   SNWPLRG K ++
Sbjct: 273 MVTCMDEAVRNITWALKRYGYYDNSVIIFSTDNGGQT-----FSG--GSNWPLRGRKGTY 325

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK +    L+HITDW PTL  +AG
Sbjct: 326 WEGGVRGIGFVHSPLLKRKRRTSRALVHITDWYPTLVTLAG 366


>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
          Length = 431

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHI+DWLPTL  +AG   N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 367


>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
 gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 367


>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
          Length = 415

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 368


>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
          Length = 503

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K++ 
Sbjct: 240 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWTL 292

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +AG + +
Sbjct: 293 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGDTH 337


>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 41/101 (40%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE +G V  +L++ G+  N++++F+SDNG        FH     NWPLRG K S 
Sbjct: 248 MATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNGA------QFH-TGGRNWPLRGGKGSV 300

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V+ + SPL+ R  ++ E+++H+ DW PTL  +AG
Sbjct: 301 FEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAG 341


>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
          Length = 383

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 49/108 (45%), Positives = 66/108 (61%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ML  LDE++  V  +L+  G   NS++++ SDNGG PT           SNWPLRG+K +
Sbjct: 80  MLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQPTAG--------GSNWPLRGSKGT 131

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
           + EGG+R V  + SPLL+ +G V   L+HITDW PTL  +A  E  ID
Sbjct: 132 YWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISLA--EGQID 177


>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
          Length = 458

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V ++L+  GL  N++ +F +DNGG T           +NWPLRG K + 
Sbjct: 270 MVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSG-------GNNWPLRGRKGTL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + SPLL+ KG     LMHI+DWLPTL  +AG   N
Sbjct: 323 WEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 367


>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
 gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
 gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
 gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
 gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 367


>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
 gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTN 368


>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
          Length = 316

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  +   L + G   NS+++F +DNGG  +          SNWPLRG K ++
Sbjct: 162 MVSAVDEAVRNITYGLRKYGYYENSIMIFSTDNGGQPLSG-------GSNWPLRGRKGTY 214

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA-GKEKNID 106
            EGGVR +  + SPLLRK + V + L+HITDW PTL  +A GKE +I+
Sbjct: 215 WEGGVRGLGFIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE 262


>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
          Length = 502

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+V  V  +L+++GL  N++ +F +DNGG T+          +NWPLRG K++ 
Sbjct: 237 MVSVLDEAVANVTNALKKQGLWNNTVFIFSTDNGGQTLSG-------GNNWPLRGRKWTL 289

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + S LL+ KG   + L+HI+DWLPTL  +AG   N
Sbjct: 290 WEGGIRGVGFVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGSTN 334


>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
          Length = 475

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/96 (50%), Positives = 60/96 (62%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE+VG V  +LE  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 216 MASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 268

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
            EGGVR V  + SPLL RKG     L+HI+DWLP L
Sbjct: 269 WEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPPL 304


>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 574

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 45/101 (44%), Positives = 65/101 (64%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  V  +L + G   NS++++ +DNG        F G   SNWPLRG K ++
Sbjct: 266 MVSTVDEAVRNVTYALRKYGFYRNSVIIYSTDNGAQP-----FTG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R VA + SPLL R+ +V + L+HITDW PTL  +AG
Sbjct: 319 WEGGIRGVAFVHSPLLKRRRRVSKALLHITDWFPTLVGLAG 359


>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
          Length = 408

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368


>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 469

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/98 (42%), Positives = 62/98 (63%), Gaps = 9/98 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDES+G++  +L+EEGL  +++ +F +DNGG          N  +NWPLRG K ++
Sbjct: 207 MVTMLDESIGKLTDTLKEEGLWNDTVFIFSTDNGGVG------KKNAGNNWPLRGVKGNY 260

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGG+R V  +  PLL    +G +  +LMHI+DW PTL
Sbjct: 261 FEGGIRGVGFVAGPLLSTNVQGTISTDLMHISDWYPTL 298


>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
          Length = 415

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368


>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 504

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 66/103 (64%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE++  +I++LE + +L NS++VF SDNGG   D L+    + +NWPLRG K S 
Sbjct: 237 MVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGG---DPLY----SGNNWPLRGWKASN 289

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  ++S +L    +G     LMHITDW PTL  I+G
Sbjct: 290 WEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDISG 332


>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
          Length = 575

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D +V  + ++L++ G   NS++VF SDNGG T     F G   SNWPLRG K ++
Sbjct: 268 MVTCMDAAVKNITRALKKYGYYSNSVIVFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPL++K  +    LMHITDW PTL  +AG
Sbjct: 321 WEGGVRGLGFVHSPLIKKKRRTSRALMHITDWYPTLVKLAG 361


>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
 gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
 gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
          Length = 573

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL+K  +    L+HITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLVGLAG 359


>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
          Length = 573

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 45/101 (44%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 268 MVTCMDEAVRNITGALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL++  +    L+HITDW PTL  +AG
Sbjct: 321 WEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLVGLAG 361


>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
          Length = 513

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 249 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 301

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 302 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 346


>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
 gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367


>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
 gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
          Length = 584

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 68/101 (67%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG +  +L +  +L +S+V+F +DNGG        + N  SN+PLRG K   
Sbjct: 224 MVTSLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGA---PYGLNWNRGSNFPLRGGKDML 280

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGGV+ VA ++S L++ KG+V + L+ +TDW+PT+Y +AG
Sbjct: 281 WEGGVKGVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAG 321


>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
 gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
 gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
          Length = 413

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366


>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
 gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
 gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
 gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
 gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
          Length = 573

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 46/101 (45%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 266 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 318

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL+K  +    L+HITDW PTL  +AG
Sbjct: 319 WEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLVGLAG 359


>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
          Length = 413

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366


>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
          Length = 572

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL RK      L+HITDW PTL  +AG
Sbjct: 320 WEGGVRGLGFIHSPLLKRKRWTSHALVHITDWYPTLVGLAG 360


>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
 gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTN 366


>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
          Length = 534

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367


>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 228 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 280

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 281 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 325


>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
 gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
          Length = 455

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 61/101 (60%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG+++  ++ + +L N+L +F SDNGGP        G    N PLRG K++ 
Sbjct: 240 MVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPE------PGKLTDNGPLRGGKHTL 293

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A A W   +  G  +E  +HI DW PTL  +AG
Sbjct: 294 YEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAG 334


>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
          Length = 535

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368


>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
          Length = 533

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366


>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
 gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
 gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
 gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
          Length = 533

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 269 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 321

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 366


>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
 gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 368


>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
 gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
 gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
 gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367


>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
          Length = 519

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 255 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 307

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 308 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTN 352


>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 588

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 65/106 (61%), Gaps = 11/106 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD+SVG +I  LE+EG + NS+VV  SDNGG + +         SN+PLRG K S+
Sbjct: 291 VLMFLDKSVGHLIHYLEDEGWMENSIVVVASDNGGDSNEG-------GSNYPLRGKKASY 343

Query: 61  HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG +  A ++S    P  R+G   + LMH+TDWLPTL   AG E
Sbjct: 344 WEGGSKVPAFVYSTSHIPEARRGSEYDGLMHVTDWLPTLAAGAGLE 389


>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/101 (42%), Positives = 62/101 (61%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD +VGR++Q++ +E L+ N+L +F SDNGGP    L       SN PLR  K + 
Sbjct: 264 MVTALDAAVGRIVQAVRDEKLVENTLFIFSSDNGGPNPKQL------TSNGPLRAGKGTV 317

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A A W   ++ G  ++  +HI DW PTL  +AG
Sbjct: 318 YEGGVRVCAFATWPGKIKPGTSVQAPIHIADWFPTLLGLAG 358


>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
          Length = 573

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 46/100 (46%), Positives = 62/100 (62%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L+  G   NS+++F SDNGG T     F G   SNWPLRG K ++
Sbjct: 267 MVTCMDEAVRNITWALKRYGFYNNSIIIFSSDNGGQT-----FSG--GSNWPLRGRKGTY 319

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPLL RK +    L+HITDW PTL  +A
Sbjct: 320 WEGGVRGLGFIHSPLLKRKRRTSHALVHITDWYPTLVGLA 359


>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
          Length = 534

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 270 MVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 322

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 323 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTN 367


>gi|47228126|emb|CAF97755.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 650

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/101 (44%), Positives = 65/101 (64%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  V  +L + G   NS++++ +DNG        F G   SNWPLRG K ++
Sbjct: 278 MVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQP-----FTG--GSNWPLRGRKGTY 330

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R VA + SPLL R+ +V + L+HITDW PTL  +AG
Sbjct: 331 WEGGIRGVAFVHSPLLRRRRRVSKALLHITDWFPTLVGLAG 371


>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
          Length = 484

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 8/106 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE++G +   L E G+  ++L+VF SDNGG  +D    +GN   N+PLRG K++ 
Sbjct: 240 MVSALDEAIGNITDQLRESGMYDDTLIVFTSDNGGSPMD----YGN---NYPLRGAKFTV 292

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
           +EGG R V+ +    L+K G+  E++MH  DW  TL   AG E ++
Sbjct: 293 YEGGTRVVSFVHGAGLQKTGEAYESMMHAVDWSDTLVAAAGGEPDM 338


>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
          Length = 431

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 64/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+          +NWPLRG K+S 
Sbjct: 167 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG-------GNNWPLRGRKWSL 219

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +A    N
Sbjct: 220 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTN 264


>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
 gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
          Length = 520

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++DE+VG +  +L++ GL  N+++VF +DNGG  +          +NWPLRG K S 
Sbjct: 266 MTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNGGQVMAG-------GNNWPLRGWKGSL 318

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R V  + SPL++ KG+  + L+HI+DW PTL  +AG   N
Sbjct: 319 WEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGSTN 363


>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 570

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 9/108 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  +   L + G   NS+++F +DNGG  +          SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYGLRKYGYYENSIMIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA-GKEKNID 106
            EGGVR +  + SPLLRK + V + L+HITDW PTL  +A GKE +I+
Sbjct: 321 WEGGVRGLGFIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE 368


>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
 gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
          Length = 535

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 64/100 (64%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V  +L+  GL  N++ +F +DNGG T+      GN   NWPLRG K+S 
Sbjct: 271 MVSLMDEAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG----GN---NWPLRGRKWSL 323

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
            EGG+R V  + SPLL+ KG     L+HI+DWLPTL  +A
Sbjct: 324 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 363


>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 527

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 66/105 (62%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  +   L + G   NS+++F +DNGG  +          SNWPLRG K ++
Sbjct: 268 MVSAVDEAVRNITYGLRKYGYYQNSIIIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR +  + SPLLRK  +V + L+HITDW PTL  +AG +++
Sbjct: 321 WEGGVRGLGFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDES 365


>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 505

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE +G + ++L+  G+L N+++VF SDNG        FH     NWPLRG K + 
Sbjct: 247 MATCMDEGIGNITETLKASGMLNNTILVFSSDNGA------QFH-TGGRNWPLRGGKGTM 299

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGGVR +A + SPLL+K + + + ++H +DW PT   +AG
Sbjct: 300 WEGGVRVLAFVNSPLLKKSRTVNHEMIHTSDWFPTFVRLAG 340


>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 545

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 62/98 (63%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G V Q+L++ GL  N++++F +DNGG     +   GN   NWPLRG+K S 
Sbjct: 292 MVSCMDEGIGNVTQALKDAGLYDNTIIIFSTDNGG----HIDRGGN---NWPLRGSKGSM 344

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGG+  V  + SPLL    KG V   LMH++DWLPT+
Sbjct: 345 WEGGIHGVGFVHSPLLPDAVKGTVNHELMHVSDWLPTI 382


>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
          Length = 382

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 44/104 (42%), Positives = 61/104 (58%), Gaps = 9/104 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+V  +  +LEE GL+ N L+VF +DNGGP      F+G  A+N PLRG K + 
Sbjct: 243 MISALDEAVANITNALEETGLIDNLLLVFTTDNGGP------FNG-AANNLPLRGCKATL 295

Query: 61  HEGGVRNVAALWSPLLRK--GQVLENLMHITDWLPTLYFIAGKE 102
            EGG +    ++S  L K  G +   +MH  DW PT   +AG E
Sbjct: 296 WEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAGGE 339


>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
          Length = 484

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 68/110 (61%), Gaps = 11/110 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D ++G  I++L++  +L N+++VF+SDNGG    +  F GN   NWPLRG+K + 
Sbjct: 244 MVSAMDSAIGMTIKALKQRNMLTNTMIVFLSDNGG----TPFFGGN---NWPLRGSKATL 296

Query: 61  HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
            EGG R  + +W  +L  + G     ++H  DW PTL  +AG   EK++D
Sbjct: 297 WEGGTRVPSFVWGKMLLQKAGYTSNEMIHAVDWFPTLVSLAGGTPEKDVD 346


>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
 gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
          Length = 333

 Score = 80.9 bits (198), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 9/99 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG++++ L E G++ N+ V+F+SDNG    + L        N P RG K S  EGG
Sbjct: 233 LDAMVGKIVEKLIENGMMNNTYVIFVSDNGADPPEGL--------NTPFRGGKSSLFEGG 284

Query: 65  VRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAGKE 102
            ++ + ++SPLL+K +   + LMHITDWLPTL  +AG +
Sbjct: 285 TKSNSFIYSPLLKKTEYENDGLMHITDWLPTLVRLAGGQ 323


>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
           carolinensis]
          Length = 580

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 44/100 (44%), Positives = 64/100 (64%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L++ G   NS++VF +DNGG T     F G   SNWPLRG K ++
Sbjct: 277 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 329

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPL++ K ++   L+HITDW PTL  +A
Sbjct: 330 WEGGVRGIGFVHSPLIKHKRRISRALIHITDWYPTLVTLA 369


>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
          Length = 345

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 9/97 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG++++ LE  GL+ N+ V+F+SDNG    + L        N P RG K S  EGG
Sbjct: 245 LDAMVGKIVEKLENNGLMDNTYVIFVSDNGADPPEGL--------NTPFRGGKGSLFEGG 296

Query: 65  VRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            ++ + ++SPLL K G   + LMHITDWLPTL  +AG
Sbjct: 297 TKSNSFVYSPLLNKTGYENDGLMHITDWLPTLVKLAG 333


>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
          Length = 873

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 60/96 (62%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE++  +  +L+  G   N+++V+ SDNGG  +          SNWPLRG+K ++
Sbjct: 584 MVSCLDEAIHNLTLALKRYGYYDNTVIVYSSDNGGQPLAG-------GSNWPLRGSKATY 636

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
            EGG+R V  + SPLL +KG    +L+HITDW PTL
Sbjct: 637 WEGGIRAVGFVHSPLLMKKGTKCRSLIHITDWFPTL 672


>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
          Length = 560

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/96 (44%), Positives = 60/96 (62%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE++  +  +L+  G   N+++V+ SDNGG  +          SNWPLRG+K S+
Sbjct: 259 MVSCLDEAIRNLTLALKRYGYYDNTVLVYSSDNGGQPLLG-------GSNWPLRGSKASY 311

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTL 95
            EGG+R V  + SPLLR KG    +L+HITDW PTL
Sbjct: 312 WEGGIRAVGFVHSPLLRNKGTKCRSLIHITDWFPTL 347


>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE +  +   L+ +GLL +++++  +DNGG T          +SNWPLRG K + 
Sbjct: 236 MMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTALG-------SSNWPLRGNKATV 288

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
            EGGVR ++ +WS  LRK     N +MHITDW PT+
Sbjct: 289 FEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTI 324


>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 491

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE +  +   L+ +GLL +++++  +DNGG T          +SNWPLRG K + 
Sbjct: 236 MMLALDEGISNITSLLQTKGLLDDTIIILTTDNGGQTALG-------SSNWPLRGNKATV 288

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
            EGGVR ++ +WS  LRK     N +MHITDW PT+
Sbjct: 289 FEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTI 324


>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
 gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
          Length = 492

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 63/98 (64%), Gaps = 10/98 (10%)

Query: 4   SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           ++DE+VG + ++L++ G+  N++++F +DNG              SNWPLRG K +  EG
Sbjct: 243 AMDEAVGNITRALKDHGMWENTVLIFSTDNGA---------SKAGSNWPLRGFKNTLWEG 293

Query: 64  GVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
           GVR V  + SPLL+ K ++ + L+HI+DW PT+  IAG
Sbjct: 294 GVRGVGFVSSPLLKVKQRISDALLHISDWFPTIVRIAG 331


>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 539

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 12/103 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG--PTVDSLHFHGNTASNWPLRGTKY 58
           M   +DE +G V  +L++ G+  N++ +F+SDNG   PT            NWPLRG K 
Sbjct: 281 MATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQFPT---------GGRNWPLRGGKG 331

Query: 59  SFHEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
           S  EGG+R V+ + SPL+ + Q   N ++H+TDWLPTL  +AG
Sbjct: 332 SVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAG 374


>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 631

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 63/109 (57%), Gaps = 5/109 (4%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++SLD+S   ++  L+   +L  S+++F SDNGG T    +F    ASNWPLRG K +  
Sbjct: 274 VESLDDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFK-TGASNWPLRGQKNTQW 332

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE----KNID 106
           EGGVR  + +WS  + +  V E L H  DWLPT   + G +    K+ID
Sbjct: 333 EGGVRVSSFVWSSQITEPYVSEELYHFVDWLPTFRRLGGGDIGDLKDID 381


>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
          Length = 573

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L++ G   NS++VF +DNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPL+ RK +    L+HITDW PTL  +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361


>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
          Length = 574

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L++ G   NS++VF +DNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPL+ RK +    L+HITDW PTL  +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361


>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 568

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L + G   NS+V+F +DNG   +          SNWPLRG K ++
Sbjct: 264 MVSIVDEAVRNITYALRKYGFYRNSVVIFSTDNGAQPLTG-------GSNWPLRGCKGTY 316

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPL+R + ++  +L+HITDW PTL  +AG
Sbjct: 317 WEGGIRGVGFVHSPLIRHRRRISRDLIHITDWYPTLVGLAG 357


>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
          Length = 574

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L++ G   NS++VF +DNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPL+ RK +    L+HITDW PTL  +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361


>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 732

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  V  +L + G   NS++++ +DNG        F G   +NWPLRG K ++
Sbjct: 269 MVSTVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQP-----FTG--GNNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR V  + SPLL R+ +V + LMHITDW PTL  +A
Sbjct: 322 WEGGVRGVGFVHSPLLRRRRRVSKALMHITDWFPTLVGLA 361


>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 64/102 (62%), Gaps = 10/102 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M+ ++DE+V  V  +L + G   NS++++ +DNG  P V          SNWPLRG K +
Sbjct: 264 MVSTVDEAVRNVTYALRKYGYYKNSVIIYSTDNGAQPYVG--------GSNWPLRGRKGT 315

Query: 60  FHEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
           + EGGVR VA + SPLL+ + +V   L+HITDW PTL  +AG
Sbjct: 316 YWEGGVRGVAFVHSPLLKHRRRVSTALLHITDWFPTLVGLAG 357


>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
 gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
          Length = 449

 Score = 79.3 bits (194), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 9/105 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG V  +L+  GL  N++++F +DNG           N  SNWPLRG K + 
Sbjct: 196 MVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA--------RENAGSNWPLRGWKNTL 247

Query: 61  HEGGVRNVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + S LLRKG +  + L+HI+DW PTL  I+    N
Sbjct: 248 WEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRISRSVSN 292


>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 673

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 64/101 (63%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  +  +L + G   NS+++F +DNGG  +          SNWPLRG K ++
Sbjct: 268 MVSTVDEAVRNITYALRKYGYYQNSVLIFSTDNGGQPLSG-------GSNWPLRGRKGTY 320

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG+R +  + SPLL+K  +V + L+HITDW PTL  +AG
Sbjct: 321 WEGGIRGLGFVHSPLLKKRKRVSKALVHITDWYPTLVGLAG 361


>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 545

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 62/98 (63%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE+VG + + L+++GL  ++++VF SDNGG  +D        ASNWPLRG+K + 
Sbjct: 286 MTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGG-NID------QGASNWPLRGSKTTL 338

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGGVR V  + SPLL    KG V   L+ I+DW PTL
Sbjct: 339 WEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTL 376


>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
          Length = 551

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE++  +  +L+  G   N+++V+ SDNGG  +          SNWPLRG+K ++
Sbjct: 245 MVSCLDEAIHNLTLALKHYGYYDNTVIVYSSDNGGQPLAG-------GSNWPLRGSKATY 297

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTL 95
            EGG+R V  + SPLL  KG    +L+HITDW PTL
Sbjct: 298 WEGGIRTVGFVHSPLLMNKGTKCRSLIHITDWFPTL 333


>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 461

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 39/93 (41%), Positives = 58/93 (62%), Gaps = 3/93 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+SVG V ++L+ +G+L +++V+  SD+GG             SNWPLRGTK + 
Sbjct: 226 MVAALDQSVGDVFRALQRKGILEDTIVLLTSDSGGAAGG---VEQGVGSNWPLRGTKKTL 282

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLP 93
            EGGVR V  LWS  L +G+V   L+H + + P
Sbjct: 283 WEGGVRAVGLLWSAKLARGRVASQLIHRSAYFP 315


>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score = 79.0 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 61/101 (60%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++DE+VG++  +++ +G+  N++ +F SDNGGP        G   SN PLRG K + 
Sbjct: 248 MLAAMDEAVGQIASAIDGKGMRSNTIFLFSSDNGGPA------PGQVTSNGPLRGQKGTL 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A + W   LR G V+   +HI D  PTL  +AG
Sbjct: 302 YEGGVRVPAFISWEGHLRPGTVVNAPLHIVDLFPTLLTLAG 342


>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
          Length = 575

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 45/100 (45%), Positives = 63/100 (63%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  +  +L++ G   NS++VF +DNGG T     F G   SNWPLRG K ++
Sbjct: 269 MVTCMDEAVKNITWALKKYGYYDNSVMVFSTDNGGQT-----FSG--GSNWPLRGRKGTY 321

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR +  + SPL+ RK +    L+HITDW PTL  +A
Sbjct: 322 WEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLVSLA 361


>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
          Length = 328

 Score = 79.0 bits (193), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 11/102 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +   LD  VG+V++ L +  ++ N+ ++F+SDNG    + L        N P RG K S 
Sbjct: 228 LTSGLDSMVGKVVELLNDNHMMDNTYLIFLSDNGADPAEGL--------NAPFRGGKGSL 279

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN--LMHITDWLPTLYFIAG 100
            EGG ++ + ++SPLL+K +  EN  LMHITDWLPTL  +AG
Sbjct: 280 FEGGTKSSSFIYSPLLKKTE-YENDGLMHITDWLPTLVKLAG 320


>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
          Length = 330

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 9/99 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG+++  L E  ++ N+ V+F+SDNG   ++          N P RG K S  EGG
Sbjct: 231 LDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPIE--------GRNAPFRGGKGSLFEGG 282

Query: 65  VRNVAALWSPLLRKGQVL-ENLMHITDWLPTLYFIAGKE 102
            ++ + ++SPLL+K +   + LMHITDWLPTL  +AG +
Sbjct: 283 TKSSSFIYSPLLKKTEYENDGLMHITDWLPTLVKLAGGQ 321


>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 473

 Score = 78.6 bits (192), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +DE++G + ++L+++G+  N+++VF +DNGG            ASN+PLRG K S 
Sbjct: 215 MTSCMDEAIGNITRTLKDKGIWNNTVIVFSTDNGGARTFG-------ASNYPLRGQKASN 267

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A + SPLL    +G +   LM+I DW PT   +AG
Sbjct: 268 FEGGIRGPAFVSSPLLDPLVRGTINNELMYIGDWFPTFMNLAG 310


>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
 gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
          Length = 478

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG V  +L+   L  N++++F +DNG           N  SNWPLRG K + 
Sbjct: 225 MVSAMDEAVGNVTDALKRTRLWDNTVLIFSTDNGA--------RRNAGSNWPLRGWKNTL 276

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIA 99
            EGGVR V  + S LLRKG+   + L+HI+DW PTL  I+
Sbjct: 277 WEGGVRGVGFVTSKLLRKGKRKCDALIHISDWFPTLLRIS 316


>gi|335058614|gb|AEH26503.1| arylsulphatase [uncultured Acidobacteria bacterium A11]
          Length = 491

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 50/117 (42%), Positives = 67/117 (57%), Gaps = 17/117 (14%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-------------GPTVDS-LHFHGNT 47
           L+ LD SVGR++ +L+E GL   +LVVF+SDNG             GP  D+ LH     
Sbjct: 238 LEHLDNSVGRLMAALDELGLRERTLVVFMSDNGGVSRSYDPKPFTEGPGTDTQLHLANAE 297

Query: 48  ASNWPLRGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
             N PLRG K S +EGG+R V  L  W  ++  G+V+E   H+ DWLPTL  +AG +
Sbjct: 298 FPNAPLRGWKGSPYEGGIR-VPCLVRWPGVVAAGRVVEAPAHVVDWLPTLLEVAGAK 353


>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
 gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
          Length = 488

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE++G V  +L+  GL  N+L+VF +DNGG        H     N+PLRG K + 
Sbjct: 239 MVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGG-------IHTAGGYNYPLRGEKNTL 291

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    +  P+  R G +   LMH+TDW PTL  +AG
Sbjct: 292 WEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAG 332


>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
          Length = 545

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 44/112 (39%), Positives = 69/112 (61%), Gaps = 13/112 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D+++G V+ +  +   +GN +VVF +DNGG    + H  GN   N+PLRG+K + 
Sbjct: 243 MVTAMDDAIGAVMAAFRKHKFMGNLIVVFTTDNGG----AAHIVGN---NFPLRGSKTTV 295

Query: 61  HEGGVRNVAALWSP-LLRK-GQVLENLMHITDWLPTLYFIAG----KEKNID 106
            EGG R V+ ++S  LL+K G   + L H TDW PT+  +AG    +EK +D
Sbjct: 296 WEGGTRAVSFVYSKNLLKKTGYTHDGLFHATDWFPTILAVAGGKPKQEKQLD 347


>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
          Length = 477

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 43/106 (40%), Positives = 59/106 (55%), Gaps = 9/106 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD++V  +   LE+ G + N L+VF +DNGGP           A+NWPLRG K + 
Sbjct: 243 MVSALDDAVYNITTVLEDLGFMDNLLLVFTTDNGGPPTRG-------ANNWPLRGGKNTL 295

Query: 61  HEGGVRNVAALWSPLL--RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG R  A ++S  L  +KG     L H  DW PTL  IAG + +
Sbjct: 296 WEGGTRGAAFVYSKTLLKKKGYTHPGLFHAVDWYPTLLDIAGGDSS 341


>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
          Length = 328

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG + ++L+  G+  N+++VF SDNGG  V     HG + +N PLRG K ++
Sbjct: 228 MVNVLDDIVGNITRALKANGMWNNTVLVFSSDNGG-AVPGAPQHG-SMNNHPLRGGKVAY 285

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A + SPLL K   G  L  L+HI+DW  T   +AG
Sbjct: 286 FEGGVRTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328


>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 42/108 (38%), Positives = 62/108 (57%), Gaps = 13/108 (12%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D  +G V++ L+E+G   ++++V  SDNGG T +        ASNWPLRG K    EGG
Sbjct: 246 VDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTREG-------ASNWPLRGRKGDVFEGG 298

Query: 65  VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAG---KEKNID 106
           +R+ A + SP L    KG    ++ H+TDW PTL   +G    + N+D
Sbjct: 299 IRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLLRFSGCQQPDSNLD 346


>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 469

 Score = 78.2 bits (191), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            ++S+DE++G++IQSLE + +  N+ V F+SDNGG  +       N   N PLRG K S 
Sbjct: 250 QVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSVM-----RANRGDNRPLRGGKNSL 304

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGGVR  A + + P L   Q +  +  I D  PTL  +AG E
Sbjct: 305 YEGGVRVPAIVSYPPKLSANQEINQIFSIVDLYPTLAKLAGVE 347


>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
          Length = 1542

 Score = 77.8 bits (190), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 8/99 (8%)

Query: 1    MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            M+  LDE+V  +  +L++ G   N ++V+ SDNGG  +          SNWPLRG+K S+
Sbjct: 1247 MVSCLDEAVRNLTLALKQYGYYDNMVMVYSSDNGGQPMAG-------GSNWPLRGSKGSY 1299

Query: 61   HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFI 98
             EGG+R V  + SPLL +KG   + L+HITDW PTL  +
Sbjct: 1300 WEGGIRAVGFVHSPLLVKKGTKSKALIHITDWYPTLVML 1338


>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
          Length = 514

 Score = 77.4 bits (189), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 63/103 (61%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  +D +VG +++ L+ + +  N+L+VFISDNGG     + + GN   NWPLRG K + 
Sbjct: 247 MLAVMDHAVGLIVKKLKMKDMWRNTLLVFISDNGG----QILYGGN---NWPLRGNKNTL 299

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
            EGG R  A +  PL+R  G+   +++H+ DW PTL   AG E
Sbjct: 300 FEGGTRVPAFVAGPLIRNGGRNSSSIIHVVDWFPTLLSAAGGE 342


>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
 gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
          Length = 598

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +D++VG V ++L++ GL  N++++F +DNGG  VD          NWPLRG K S 
Sbjct: 346 MTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGG-RVD------RGGINWPLRGWKGSL 398

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR V  + SPL++ KG+  + L+HI+DW PTL  +A    N
Sbjct: 399 WEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASGSTN 443


>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 10/102 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D  +G V++ L+E+G   ++++V  SDNGG T +        ASNWPLRG K    EGG
Sbjct: 246 VDHFIGEVVKKLKEKGFWKDTILVITSDNGGQTREG-------ASNWPLRGRKGDVFEGG 298

Query: 65  VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEK 103
           +R+ A + SP L    KG   +++ H+TDW PTL   +G ++
Sbjct: 299 IRSRAFIHSPKLPDSLKGSSFQHVFHVTDWFPTLLRFSGCQQ 340


>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 530

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 39/98 (39%), Positives = 57/98 (58%), Gaps = 8/98 (8%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFH----GNTASNWPLRGTKYSFH 61
           D SVGRV+++L+  G   +S+++F SDNG         H     NT SNWPLRG K ++ 
Sbjct: 263 DRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRGMKATYW 322

Query: 62  EGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTL 95
           EGG R  A + +P        G + + ++HITDW+PT+
Sbjct: 323 EGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI 360


>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
 gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
          Length = 495

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 47/119 (39%), Positives = 67/119 (56%), Gaps = 15/119 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--------ASNWP 52
           M+ ++D+ +G+V+ +LE+ G+  N+L+VF SDNGGP   S  F G          A N P
Sbjct: 260 MITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPR--SAQFTGEVDTSGGTIPADNGP 317

Query: 53  LRGTKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
            R  K S +EGG R VA A W   ++ G V+ + +HI D  PTL  +A    GK K +D
Sbjct: 318 YRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLTGLASVSVGKNKPLD 376


>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 635

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 42/99 (42%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D +V  +++++   G   NS+V+F SDNG     +        SNWPLRG+K++ +EGG
Sbjct: 363 MDRAVESIVKTMHAVGAYDNSIVLFASDNGATLAQT-----GGGSNWPLRGSKFTPYEGG 417

Query: 65  VRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
           VR  A L SPLL   R+G     L H+TD LPTL   AG
Sbjct: 418 VRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAG 456


>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
          Length = 898

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 65/105 (61%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G + +  +++G+  N+L+VF +DNGG  ++         +N+PLRG K S 
Sbjct: 645 MVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQILEG-------GNNYPLRGWKASL 697

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+  V  +   +L RKG V + LMH+TDW PTL  +AG   N
Sbjct: 698 WEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGGNLN 742


>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 500

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 61/98 (62%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+VG+++ SL++ G+  N++++F +DNGG             +NWPLRG K S 
Sbjct: 252 MMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAAG-------GNNWPLRGWKRSI 304

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
            EGG+R V  + S LL    +G V + L+H++DW PTL
Sbjct: 305 WEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTL 342


>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
 gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
          Length = 470

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 62/100 (62%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            + ++DE+VG V  +L+  GL  N++++F +DNG           +T SNWPLRG K + 
Sbjct: 218 QVTAMDEAVGNVTNALKNSGLWENTVLIFSTDNGA--------RRHTGSNWPLRGWKNTL 269

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGGVR V  + S LL RKG+  + L+HI+DW PTL  I+
Sbjct: 270 WEGGVRGVGFVNSNLLKRKGKTSDALIHISDWFPTLLRIS 309


>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
 gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
          Length = 478

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG V  +L+  GL  N++++F +DNG           +  SNWPLRG K + 
Sbjct: 227 MVTAMDEAVGNVTDALKGSGLWDNTVLIFSTDNGA--------RRSAGSNWPLRGWKNTL 278

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIA 99
            EGG+R V  + S LL RKG   + L+HI+DW PTL  +A
Sbjct: 279 WEGGIRGVCFVTSNLLERKGTKSDALIHISDWFPTLIHLA 318


>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
          Length = 370

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD  VG ++  L+  GL+ N+ ++F+SDNG    + L        N P RG K S 
Sbjct: 217 LVSGLDRMVGGIVDGLKRNGLMNNTYIIFVSDNGADPEEGL--------NDPFRGGKGSL 268

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG ++ + ++SPLL K     N LMHITDW+PT+  +AG
Sbjct: 269 FEGGTKSASFIYSPLLNKAGYENNGLMHITDWMPTILKLAG 309


>gi|313211935|emb|CBY17782.1| unnamed protein product [Oikopleura dioica]
          Length = 342

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 62/101 (61%), Gaps = 10/101 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D  +G ++ +L+++G   N+++V  SDNGG   +        ASNWPLRG+K +  EGG
Sbjct: 50  VDHYIGLIVNALKKKGFWQNTVMVVTSDNGGAREEG-------ASNWPLRGSKGTILEGG 102

Query: 65  VRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKE 102
           +R  A L SP++    KG+    +MH+TDW PTL  ++G E
Sbjct: 103 IRGRAFLHSPIIPENLKGKSFPYVMHVTDWYPTLLRMSGCE 143


>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
 gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
          Length = 471

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 10/96 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD S+GRV+++LE++GL  N++V+F SDNGG            +  WPL G K   
Sbjct: 257 MVESLDASIGRVLKALEDKGLAENTIVIFTSDNGGERF---------SKTWPLTGQKTEL 307

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R  A + W   +  GQV E +    DWLPTL
Sbjct: 308 LEGGIRVPAIVRWPAKIAAGQVSEQVAISMDWLPTL 343


>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
          Length = 503

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+V  V  + ++ GL  N+++VF +DNGG   +         +NWPLRG K S 
Sbjct: 255 MVSCMDEAVKNVTDAFKKYGLWDNTILVFSTDNGGQVAEG-------GNNWPLRGWKGSL 307

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            +GG+R V  + S LL+ KG V   LMH++DW PTL   AG
Sbjct: 308 WDGGMRGVGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQAG 348


>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 588

 Score = 76.3 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 10/99 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           ++  LDE+V  V ++L+  GL  NS+++F +DNGG P   S+   GN   NWPLRG K +
Sbjct: 321 LVNILDEAVRNVTEALKYNGLYDNSVIIFTTDNGGRPKPRSV---GN---NWPLRGGKST 374

Query: 60  FHEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTL 95
             EGG+R V  + SPL+    +G V   L+H++DW PT+
Sbjct: 375 LWEGGIRGVGFVHSPLIPWELRGTVNRQLIHVSDWFPTI 413


>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 513

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 10/102 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE++  + ++L + G+  N++++F +DNGG T       GN   NWPLRG K S 
Sbjct: 255 MVTCMDEAIWNITKALMDTGMWDNTVLIFSTDNGGAT----SVGGN---NWPLRGGKRSM 307

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIA 99
            EGG+R V  + SPLL    +G     L+H+TDW PT   +A
Sbjct: 308 FEGGIRGVGFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLA 349


>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
          Length = 501

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD  V  +  +LE++ LL N+ ++F++DNG              SN P RGTK + 
Sbjct: 237 LITGLDGMVKTIKDTLEKKNLLNNTYIIFLADNGADV--------GIGSNEPFRGTKSTL 288

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGGV++++ ++SPLL+K G   + L+HITDW+PTL  +AG
Sbjct: 289 WEGGVKSLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAG 329


>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
          Length = 684

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 40/106 (37%), Positives = 65/106 (61%), Gaps = 8/106 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE++G +   L E G+  ++L++F SDNGG       ++GN   N+PLRG K++ 
Sbjct: 440 MVSALDEAIGNITDQLRESGMYDDTLIIFTSDNGGWP----KYYGN---NYPLRGGKFTV 492

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNI 105
           +EGG R V+ +    L+K G+  E++MH  DW  TL   AG + ++
Sbjct: 493 YEGGTRVVSFVHGAGLQKTGEAYESIMHAVDWSDTLVEAAGGKPDL 538


>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score = 75.5 bits (184), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 43/105 (40%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +D+S+  V  +L++ GL  N++ +F +DNG  T      +G    +WPL+G K S 
Sbjct: 252 MATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA-----YG--GCSWPLKGGKTSL 304

Query: 61  HEGGVRNVAALWSPLLRKG-QVLEN--LMHITDWLPTLYFIAGKE 102
            EGG+R V  + SPLL    Q  EN  LMH++DW PTL  +AG E
Sbjct: 305 FEGGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGE 349


>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
 gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
          Length = 545

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 60/96 (62%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+++GRV+ +L++EG+  N++V+F SDNGG       +    A+N PLRG K   
Sbjct: 269 VVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGAA-----YSYGGANNAPLRGGKGET 323

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R  + + W  +L  GQ+ E +M + D  PTL
Sbjct: 324 FEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTL 359


>gi|156353132|ref|XP_001622929.1| predicted protein [Nematostella vectensis]
 gi|156209565|gb|EDO30829.1| predicted protein [Nematostella vectensis]
          Length = 175

 Score = 75.1 bits (183), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 61/100 (61%), Gaps = 8/100 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G V ++L++ GL  N+L+VF SDNGG  +           NWPLRG K + 
Sbjct: 56  MVSAVDEAIGNVTKTLQQRGLWNNTLLVFSSDNGGVPLGG-------GYNWPLRGYKGNL 108

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIA 99
            EGGVR  A +   +L+ KG   + L+H++DW PT   +A
Sbjct: 109 WEGGVRAAAFVHGKMLKNKGSRSKELLHVSDWYPTFTALA 148


>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
 gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
          Length = 526

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 63/111 (56%), Gaps = 12/111 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D+++G+V+ +L+EEG+  N++V F SDNGG       F    + N P RG K   
Sbjct: 268 MVDSMDQAIGKVLTALDEEGVADNTIVFFFSDNGG-------FENYGSDNGPYRGGKLEV 320

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
           +EGG+R  A + W   L  G  ++ ++ + D LPTL   A    G EK ID
Sbjct: 321 YEGGIRVTAVMRWPEKLAAGSEVDEIVSVMDLLPTLTHAAGVENGTEKKID 371


>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 726

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 45/99 (45%), Positives = 58/99 (58%), Gaps = 11/99 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD+S+GR++  LE EG L NS++V  SDNGG             SN+PLRG K+S 
Sbjct: 378 VLMYLDKSIGRLVDYLETEGWLENSIIVVASDNGG-------CPSTGGSNYPLRGLKHSN 430

Query: 61  HEGGVRNVAALWSPL----LRKGQVLENLMHITDWLPTL 95
            EGGV+  A ++SP        G     LMH+TDWLPTL
Sbjct: 431 WEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWLPTL 469


>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
           obscuriglobus UQM 2246]
          Length = 459

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++DE+VG+++ ++E+ G+  N+L VF+SDNGGP        G   SN   R  K + 
Sbjct: 237 MLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQ------PGVVTSNGKFRAGKGTL 290

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  A A +   +R G  +   +H+ DW PTL  + G
Sbjct: 291 YEGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLLKLGG 331


>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
 gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
          Length = 540

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 60/107 (56%), Gaps = 7/107 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ + D S+GR+I   +  GL  N+L+++ SDNG        +      NWPLRG K S 
Sbjct: 251 MMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGY------NWPLRGYKSSL 304

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGGVR  A +   +L RKG  + +L H+TDW PTL  +AG E   D
Sbjct: 305 FEGGVRVPAFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEPD 351


>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
          Length = 476

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG++    ++  +   +++ F +DNGG     +H  GN   NWPLRG K S 
Sbjct: 227 MVAAMDEAVGKIRDLFKKYEMWDETVLAFSTDNGG----QIHRGGN---NWPLRGWKISL 279

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R V  + SPLL   G   + LMH++DWLPTL  +AG
Sbjct: 280 WEGGMRGVGFVHSPLLPNSGGKSKGLMHVSDWLPTLVHVAG 320


>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 485

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 44/103 (42%), Positives = 60/103 (58%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD+SVG V+Q+LE+  L  N++V+F+SDNGGPT +       T+SN PLRG K   
Sbjct: 295 MLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGPTREL------TSSNAPLRGEKGDM 348

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R  V   W   +  G+V E  +   D  PT    A  E
Sbjct: 349 YEGGIRVPVLMQWPGQIPAGRVEERPIISLDLFPTCLAAAEAE 391


>gi|194919165|ref|XP_001983032.1| GG19813 [Drosophila erecta]
 gi|190647643|gb|EDV45031.1| GG19813 [Drosophila erecta]
          Length = 299

 Score = 74.3 bits (181), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 40/91 (43%), Positives = 54/91 (59%), Gaps = 4/91 (4%)

Query: 13  IQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALW 72
           + +L     L NS+V+F SDNG P+V       NT SN+PLRG K +  EGGVR   A+W
Sbjct: 1   MSTLAATDQLENSIVIFYSDNGAPSVG---MFANTGSNFPLRGQKNTPWEGGVRVAGAIW 57

Query: 73  SPLLR-KGQVLENLMHITDWLPTLYFIAGKE 102
           S  L+ +G +    +++ DWLPTL   AG E
Sbjct: 58  SSKLQARGSIFSQPLYVADWLPTLSHAAGIE 88


>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 513

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 3/100 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D ++G V+ ++   G  GN+LV F +DNGGP   +    G   SN+PLR  K++ 
Sbjct: 260 MVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRSPG---SNYPLRAGKFTL 316

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR     W+P + K      L HI D LPT+   AG
Sbjct: 317 WEGGVRGTGIFWAPQVLKPGKFGGLSHIVDVLPTILSAAG 356


>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
          Length = 697

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 42/105 (40%), Positives = 61/105 (58%), Gaps = 11/105 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++  V  +L   GL  ++L++F SDNGGP     H H N   N+P RG K + 
Sbjct: 288 MVSAVDEAINNVTLALRRAGLFDDTLIIFSSDNGGP-----HDHAN---NYPFRGAKTAD 339

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN---LMHITDWLPTLYFIAGKE 102
            EGGVR VA +   ++ K  V  N   LMH+ DW  TL  +AG++
Sbjct: 340 FEGGVRAVAFVSGGVVPKNMVGTNVHGLMHVCDWYSTLARLAGED 384


>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 658

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/103 (44%), Positives = 60/103 (58%), Gaps = 12/103 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M   +DE++G V  +L+E GL   ++ +F SDNGG PT     F G   SNWPLRG K  
Sbjct: 419 MTSCMDEAIGNVTDTLKETGLWNKTVFIFSSDNGGTPT-----FGG---SNWPLRGRKGL 470

Query: 60  FHEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIA 99
           ++EGG+R V  + S LL     G     L+HITDW PTL  +A
Sbjct: 471 WYEGGIRAVGFVNSYLLSNHVTGSTNHELIHITDWFPTLLRLA 513


>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
          Length = 491

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G +    ++ G+  N+L+VF +DNGG     +H  GN   N+PLRG K S 
Sbjct: 241 MVSAMDEAIGNLTDVFKKRGMWNNTLMVFSTDNGG----QIHAGGN---NFPLRGWKDSL 293

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGV  V  +   +L RKG V  +L+H++DW PTL  + G   N
Sbjct: 294 WEGGVHGVGLVHGQMLKRKGSVSRDLIHVSDWFPTLISMVGGTLN 338


>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
          Length = 503

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 63/105 (60%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE V  + Q+L+++GL  N++++F +DNGG     +H  GN   N+PLRG K S 
Sbjct: 252 MVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGG----QIHAGGN---NYPLRGWKASL 304

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG   V  +    L R G V + L+H++DW PTL  +AG   N
Sbjct: 305 WEGGFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAGGNLN 349


>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
          Length = 540

 Score = 73.9 bits (180), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 19/117 (16%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN-------------- 46
           M+  +DE+V  V  +L+  G+  ++L+V+ SDNG PT+ S  F+G               
Sbjct: 257 MISGVDEAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGS--FNGGCKNVVPDSRGGACM 314

Query: 47  TASNWPLRGTKYSFHEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
             SN+PLRG+K++  EGGVR    +  P+L    +G+ L+ LMH+ DW  TL  +AG
Sbjct: 315 CGSNYPLRGSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAG 371


>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
          Length = 617

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 46/110 (41%), Positives = 62/110 (56%), Gaps = 13/110 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD ++GR+I+S ++ G   N++  F SDNGG    SL +  N   N P RG K   
Sbjct: 374 MVTGLDRAIGRIIESFKKYGKYKNTIFAFASDNGG----SLTYRAN---NMPFRGGKNDL 426

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGK--EKNI 105
            EGGVR ++ L S L+   + G    NL  ITDW PTL   AGK  +KN+
Sbjct: 427 LEGGVRTLSFLHSSLIPNDKIGSRYSNLFDITDWFPTL-LDAGKCSQKNL 475


>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 476

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 44/111 (39%), Positives = 66/111 (59%), Gaps = 7/111 (6%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYSF 60
           ++S+DE++G +IQSL+ + +  N+L+VF SDNGG  V + + F G    N PLRG K + 
Sbjct: 249 VQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR-GDNRPLRGGKRNL 307

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
           +EGGVR    +  P  L  G+ +E +  I D  PT   +AG    +E+ ID
Sbjct: 308 YEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLAGLKINQEQQID 358


>gi|167534419|ref|XP_001748885.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772565|gb|EDQ86215.1| predicted protein [Monosiga brevicollis MX1]
          Length = 690

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 41/104 (39%), Positives = 58/104 (55%), Gaps = 10/104 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT---VDSLHFHG--NTASNWPLRG 55
           M+ ++DE++G +  +L    +  N+L++F +DNGGP     D     G    + N+PLRG
Sbjct: 287 MVAAMDEAIGNITSALHARSMFDNTLIIFTTDNGGPVGSHNDKPQGIGCATGSQNYPLRG 346

Query: 56  TKYSFHEGGVRNVA----ALWSPLLRKGQVLENLMHITDWLPTL 95
            K ++ +GGVR  A    A   P LR G    NLMH  DWLPTL
Sbjct: 347 GKGAYFQGGVRGTAWVHGAQVHPALR-GTTNYNLMHAVDWLPTL 389


>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
          Length = 318

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +  +LD  VG+++Q L++  ++ N+ ++F+SDNG               N P RG K S 
Sbjct: 218 LTSALDSMVGKIVQHLKDNCMMDNTYLIFMSDNGADP--------GEGQNTPFRGGKTSL 269

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG ++ + ++S LL+K +   + LMHITDWLPTL  +AG
Sbjct: 270 FEGGTKSNSFIYSQLLKKTEYENDGLMHITDWLPTLVKLAG 310


>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
          Length = 571

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 10/108 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + ++D  +G+V+ +L++  LL N+L+VF SDNGG T    HF   +A+N PLR  K + +
Sbjct: 261 ITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNGGDT----HF---SANNGPLRDRKTTLY 313

Query: 62  EGGVRNVAALWSP--LLRKGQVLENLMHITDWLPTLYFIAGKEK-NID 106
           EGGV+  +  + P  L    +  ++L HI+DW  T+   AG E  N+D
Sbjct: 314 EGGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAAGLEAGNVD 361


>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 467

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/113 (40%), Positives = 62/113 (54%), Gaps = 12/113 (10%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGG--------PTVDSLHFHGNTA--SNWPLR 54
           LD+SVG+++ +LE+ G  GN+LV+F SDNG         P     H+    A  SN PL 
Sbjct: 255 LDDSVGKILVALEKTGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLH 314

Query: 55  GTKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           G K   +EGG+   A A W   L+ G+ L  L HITDW+PT   +AG +   D
Sbjct: 315 GMKGEVYEGGIHTAAVARWPGQLKPGKFL-GLAHITDWMPTFCALAGYKPEKD 366


>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
          Length = 784

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDESV  +  +L ++GLL N+++VF +D GG          ++ASN+PLRG+K + 
Sbjct: 214 MVLALDESVLNITIALAQKGLLDNTVIVFTTDAGGAVGGQ---ELSSASNFPLRGSKLTV 270

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG+R VA ++S L++ KG+V   + H TDWL T   +AG
Sbjct: 271 WEGGIRAVAFVYSDLIKNKGRVSLEMYHSTDWLLTTLGLAG 311


>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
 gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
          Length = 485

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V  ++++ GL  N+LV+F +DNGG       +   + +N+PLRG K + 
Sbjct: 200 MVSALDEAVGNVTMAMKKAGLWDNTLVIFTTDNGG-------WISASGNNYPLRGGKVTL 252

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG R VA     +L+K G     ++H  DW PT+   AG
Sbjct: 253 WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTILAAAG 293


>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
 gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
          Length = 474

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 61/101 (60%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D ++GRV+Q+L++EGL  N+LV+F +DNGGP+        N + N PL GTK + 
Sbjct: 261 MMLNMDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS------DHNGSINLPLSGTKANL 314

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  +   W  + + G   + ++   D LPT Y  AG
Sbjct: 315 LEGGIRVPMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAG 355


>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
 gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
          Length = 353

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 61/105 (58%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+++ RV  +L+ +GL  N+L +F+SDNGG  +D         SNWPLRG K + 
Sbjct: 214 MVAALDDAIKRVTDALQGKGLWNNTLTIFMSDNGGDYLDG-------QSNWPLRGAKGTV 266

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG R  A A  + L R G     +MH  D LPTL  +AG  ++
Sbjct: 267 WEGGTRVPAFAHGNMLERTGYTYHGMMHGVDILPTLVSVAGGTED 311


>gi|313212374|emb|CBY36362.1| unnamed protein product [Oikopleura dioica]
          Length = 247

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/99 (38%), Positives = 61/99 (61%), Gaps = 11/99 (11%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D ++GRV+++L+   +  NS++VF +DNGG +          ++NWP +  K S  EGG
Sbjct: 101 MDRAIGRVVETLQRANMWENSIIVFANDNGGKS--------GLSNNWPFQRAKSSILEGG 152

Query: 65  VRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAG 100
           +R  + + S LL K   G+  +NLM +TDWLPT+  +AG
Sbjct: 153 IRVPSFIHSNLLPKSRLGERTDNLMDMTDWLPTMLSMAG 191


>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
 gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
          Length = 494

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V  ++++ GL  N+L++F +DNGG  + S        +N+PLRG K + 
Sbjct: 242 MVSALDEAVGNVTMAMKKTGLWDNTLLIFSTDNGGMVIQS-------GNNYPLRGGKTTL 294

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG R VA     +L+K G     ++H  DW PTL   AG
Sbjct: 295 WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTLLAAAG 335


>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
          Length = 446

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 56/106 (52%), Gaps = 9/106 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD +VG+V+ +L+E+GL  N++VVF SDNGG            +  WP  G K   
Sbjct: 234 MVRSLDGAVGQVLHALQEQGLADNTIVVFTSDNGGERF---------SKTWPFTGQKTEL 284

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L  P     QV + +    DWLPTL   AG   + D
Sbjct: 285 LEGGIRVPTLLRWPARIAPQVSDQVTATMDWLPTLLAAAGVAPDAD 330


>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 558

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  V  +L + G   N+++ F +DNGG  +    F G   SNWPLRG K ++
Sbjct: 258 MVSAVDEAVRNVTYALRKYGYYKNTVIFFSTDNGGQPL----FGG---SNWPLRGRKGTY 310

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SP L R+ +V   L+HITDW PTL  +AG
Sbjct: 311 WEGGVRGIGFVHSPLLRRRRRVSRALIHITDWYPTLMRLAG 351


>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 45/118 (38%), Positives = 68/118 (57%), Gaps = 19/118 (16%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV------DSLH-------FHGNT 47
           M+K LD+++  +   ++++G+  N+L+VF SDNGGP +      D+ H       ++G  
Sbjct: 289 MVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCYNGEA 348

Query: 48  -ASNWPLRGTKYSFHEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
            A+N+PLRG KYSF EGG+R V A  S    P  ++G     +MHI DW  T   +AG
Sbjct: 349 GANNYPLRGGKYSFFEGGIR-VNAFVSGGLIPADQRGSNRTGIMHIADWYATFCALAG 405


>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
          Length = 506

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 60/95 (63%), Gaps = 8/95 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M KS+D++VGR+I S + +G+  +++V F +DNGG T+    + GN   NWPLRG K + 
Sbjct: 237 MTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGN---NWPLRGLKNTL 289

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R + ++  P +   +  + LMH+ D  PTL
Sbjct: 290 WEGGIRGIGSIKIPGVSADK-RDQLMHVVDMFPTL 323


>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
          Length = 1683

 Score = 72.4 bits (176), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 60/95 (63%), Gaps = 8/95 (8%)

Query: 1    MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            M KS+D++VGR+I S + +G+  +++V F +DNGG T+    + GN   NWPLRG K + 
Sbjct: 1414 MTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGN---NWPLRGLKNTL 1466

Query: 61   HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
             EGG+R + ++  P +   +  + LMH+ D  PTL
Sbjct: 1467 WEGGIRGIGSIKIPGVSADK-RDQLMHVVDMFPTL 1500


>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
           caulinodans ORS 571]
 gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
           caulinodans ORS 571]
          Length = 490

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 46/113 (40%), Positives = 67/113 (59%), Gaps = 11/113 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT----VDSLHFHGN-TASNWPLRG 55
           M+ ++D+ +G+V+ +LE+ G+  N+L+VF SDNGG      V    FHG   ASN P R 
Sbjct: 259 MISAMDDQIGKVVAALEQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRD 318

Query: 56  TKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            K + +EGG R VAAL  W   +  G   + +MH+ D +PTL  +AG   N+D
Sbjct: 319 GKGTLYEGGTR-VAALANWPGRITPGAA-DGVMHVVDMMPTLARLAG--ANLD 367


>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 539

 Score = 72.0 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 40/103 (38%), Positives = 60/103 (58%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D+S+GRV+ +L+ EGL  N++V+F SDNGG       +    A N PLRG K   
Sbjct: 263 VVDGMDQSIGRVLDTLDAEGLTDNTIVLFFSDNGGAV-----YAIGGADNAPLRGGKGDT 317

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R VA + W   +  G  ++++M + D  PTL   AG E
Sbjct: 318 FEGGIRVVATMRWPEKIAPGGKVDSIMSVMDVFPTLLAAAGIE 360


>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
 gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
          Length = 452

 Score = 72.0 bits (175), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 40/106 (37%), Positives = 57/106 (53%), Gaps = 9/106 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD++VG+V+++L+ +GL  N++VVF SDNGG            +  WP  G K   
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGGERF---------SKTWPFTGQKTEL 290

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L  P     QV + +    DWLPTL   AG   + D
Sbjct: 291 LEGGIRVPTLLRWPARIAPQVSDQVTATMDWLPTLLAAAGVAPHAD 336


>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
 gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
          Length = 500

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 60/101 (59%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDESVGRV+  L+E GL  N++++F SDNGG +       G   SN PLR  K   
Sbjct: 278 MMQSLDESVGRVLDKLKELGLDKNTVIIFTSDNGGLSTA----EGAPTSNMPLRAGKGWP 333

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  +   W  + +   V ++ +  TD+ PTL  IAG
Sbjct: 334 YEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPTLLEIAG 374


>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 60/103 (58%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L+E+ +  N+++V  +DNGG            ASN+PLRG KY+F
Sbjct: 236 MVSVADNVTGHLVELLKEKEMWDNTIMVISADNGGDQC--------VASNYPLRGCKYTF 287

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR++A +   LL   R+GQ  +  +HI+DW  T   +AG
Sbjct: 288 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAG 330


>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 389

 Score = 71.6 bits (174), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 63/107 (58%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G V++ L+++G+  N+L+V  +DNGG   +         SN+PL+G+K +F
Sbjct: 236 MVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPCE--------GSNYPLKGSKMTF 287

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKN 104
           +EGGVR++A +   LL   R+G+  E  +HI DW  T   +AG + +
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSD 334


>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score = 71.2 bits (173), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L+++G+  N+++V  +DNGG             SN+PL+G K +F
Sbjct: 235 MISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC--------AGSNYPLKGCKTTF 286

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR+VA +   LL   R+GQ  +  +HI DW PT   +AG
Sbjct: 287 FEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAG 329


>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
 gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
          Length = 357

 Score = 71.2 bits (173), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD+++ RV  +L+ +GL  N+L +F+SDNGG  V          SNWPLRG K + 
Sbjct: 223 MATALDDAIKRVTDALKGKGLWDNTLTIFMSDNGGNYVQG-------QSNWPLRGAKGTL 275

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +   +L R G     +MH  D LPTL  +AG
Sbjct: 276 WEGGTRVPAFVHGNMLERTGYTYHGMMHGVDILPTLVSVAG 316


>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 471

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 7/104 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD + G V+  L+E GL  ++LVVF +DNGGPT        N +SN+PL GTK +F
Sbjct: 257 MTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPT------DKNASSNYPLAGTKSNF 310

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
            EGG+R      W   L  G+V +  +   D LPT +   G E+
Sbjct: 311 LEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGGGEE 354


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 58/96 (60%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+++G+V+ +L+EEGL  N++V+F SDNGG       +    A N PLRG K   
Sbjct: 272 VVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAA-----YSYGGADNAPLRGGKGET 326

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R V+ + W  +L   Q  + +M + D  PTL
Sbjct: 327 FEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTL 362


>gi|260824679|ref|XP_002607295.1| hypothetical protein BRAFLDRAFT_88244 [Branchiostoma floridae]
 gi|229292641|gb|EEN63305.1| hypothetical protein BRAFLDRAFT_88244 [Branchiostoma floridae]
          Length = 226

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V  ++++ GL  N+LV+F +DNGG       +   + +N+PLRG K + 
Sbjct: 16  MVSALDEAVGNVTMAMKKAGLWDNTLVIFTTDNGG-------WISASGNNFPLRGGKTTL 68

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG R VA     +L+K G     ++H  DW PT+   AG
Sbjct: 69  WEGGTRGVAFAHGKMLQKTGYTNNEMIHAVDWFPTILEAAG 109


>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 706

 Score = 70.9 bits (172), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 39/100 (39%), Positives = 60/100 (60%), Gaps = 13/100 (13%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           +L  LD ++G +++ LE  G + NS+++  SDNGG P+           SN+PLRG K+S
Sbjct: 407 VLMYLDYTIGELVEYLETNGWMENSIIIVASDNGGCPSCG--------GSNYPLRGIKHS 458

Query: 60  FHEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTL 95
           + EGG +  + ++S    P  R G   + LMH+TDWLPT+
Sbjct: 459 YWEGGTKVPSFVYSKSHIPEERWGTEYDGLMHVTDWLPTI 498


>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
 gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
          Length = 487

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE+VG V  +++  GL  N+LV+F +DNGG  + S        +N+PLRG K + 
Sbjct: 243 MVSALDEAVGNVTMAMKNAGLWDNTLVIFTTDNGGWIIAS-------GNNYPLRGGKVTL 295

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTL 95
            EGG R VA +   +L+K G     ++H  DW PT+
Sbjct: 296 WEGGTRGVAFVHGKMLQKTGYTNNEMIHAVDWFPTI 331


>gi|149176385|ref|ZP_01854999.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148844737|gb|EDL59086.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 498

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 61/101 (60%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D++VG+V+Q LEE G+  N++V+  +DNGG +       G+  SN PLRG K   
Sbjct: 283 MVESMDKAVGKVLQQLEESGVAENTVVMLTADNGGLSTS----EGSPTSNLPLRGGKGWL 338

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R V  + W      G V +  +  TD+ PT+  +AG
Sbjct: 339 YEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDLAG 379


>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 461

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD+++GRV+ +L   G L N+LVVF SDNGG       +    A+N   RG K  F
Sbjct: 256 LIEHLDDNIGRVLATLRANGQLANTLVVFTSDNGG-------WGPGKANNGQFRGVKGQF 308

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    A+WS  +  G     L+ + DW PTL   AG
Sbjct: 309 YEGGIRIPTGAMWSGHIEAGSESGALLQLMDWFPTLLQAAG 349


>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 564

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 43/116 (37%), Positives = 60/116 (51%), Gaps = 15/116 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML + DES+G V ++L++ G+  N+LVVF +DNGGPT          +SN+P RG K + 
Sbjct: 296 MLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCA---AQGSSNYPKRGGKCTV 352

Query: 61  HEGGVRNVAALWSPLL------RKGQVLENL------MHITDWLPTLYFIAGKEKN 104
           +EGG      +  P        RK +  E L       H+ DWLPTL  + G   N
Sbjct: 353 YEGGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATPN 408


>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 528

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 3/100 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D S+G ++  +   G   N+++   +DNGGPT        + +SN PLRG KY+ 
Sbjct: 252 MVSAMDASIGELMHGISTNGFAENTIIFLTNDNGGPTNG---MASSGSSNHPLRGCKYTL 308

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG R  A  W P        + + HITD LPTL   +G
Sbjct: 309 WEGGTRGSALFWYPKKLGAGTYKGMAHITDVLPTLLTASG 348


>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 519

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 60/107 (56%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G + Q+L+E  L  N++++F SDNGG   D         +NWPLRG K + 
Sbjct: 239 MVTCMDEGIGNITQTLKETELWNNTIIIFSSDNGGDPNDG-------GNNWPLRGEKGTH 291

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAGKEKN 104
            EG +  +  ++S  + K   G V   ++H++DW PT+  +AG   N
Sbjct: 292 WEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGSLN 338


>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
 gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
          Length = 472

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 66/109 (60%), Gaps = 9/109 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML S+D+S+G++++ +++ GL   +L+VF+SDNGGPT +       T+SN PLRG K S 
Sbjct: 279 MLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPTREL------TSSNLPLRGEKGSM 332

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
           +EGG+R    + W+  L   Q ++  +   D  PT   +AG    +N+D
Sbjct: 333 YEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTSVALAGASLPQNLD 381


>gi|260821541|ref|XP_002606091.1| hypothetical protein BRAFLDRAFT_88001 [Branchiostoma floridae]
 gi|229291429|gb|EEN62101.1| hypothetical protein BRAFLDRAFT_88001 [Branchiostoma floridae]
          Length = 311

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G V  ++++ GL  N+L++F +DNGG  +    F GN   N+PLRG K + 
Sbjct: 75  MVSAMDEAIGNVTMAMKKAGLWDNTLLIFTADNGGWPL----FSGN---NFPLRGGKITL 127

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +   +L+K G   + ++H  DW PT+   AG
Sbjct: 128 WEGGTRAAAFVHGKMLQKTGYTSDEMIHAVDWFPTILAAAG 168


>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G + ++L+ +G+  ++L++F +DNGG    ++ F GN   N+PLRG K + 
Sbjct: 238 MVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNGG----AVKFAGN---NYPLRGGKATL 290

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
            EGG R  A +    ++K       L+H  DW+PT+   AG
Sbjct: 291 FEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAG 331


>gi|32473691|ref|NP_866685.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
           SH 1]
 gi|32444227|emb|CAD74224.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
           SH 1]
          Length = 480

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD  VGR+I+ L+++ L  ++LVVF SDNGGPT +       T+SN PLRG K S 
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340

Query: 61  HEGGVRNVAALWS 73
           +EGGVR +  +WS
Sbjct: 341 YEGGVR-IPMIWS 352


>gi|440718712|ref|ZP_20899155.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SWK14]
 gi|436436039|gb|ELP29830.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SWK14]
          Length = 480

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD  VGR+I+ L+++ L  ++LVVF SDNGGPT +       T+SN PLRG K S 
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340

Query: 61  HEGGVRNVAALWS 73
           +EGGVR +  +WS
Sbjct: 341 YEGGVR-IPMIWS 352


>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 498

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 46/107 (42%), Positives = 66/107 (61%), Gaps = 6/107 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D++V  VI +LE   L  N+L+VF +DNGGP   +     N  +NWPLRG+K + 
Sbjct: 247 MVTAMDDAVKVVIAALERYELWDNTLLVFTTDNGGPANPN-----NAGNNWPLRGSKLTL 301

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R V  +   +L +KG V   +MH+TDW PTL  IAG + + D
Sbjct: 302 WEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKADSD 348


>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 572

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 41/98 (41%), Positives = 56/98 (57%), Gaps = 8/98 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++  D+ VGR++ +LE  G+L N++V F SDNGG  + +    GN   NWPL G K+S 
Sbjct: 292 MVRYADDEVGRLVDALERRGMLANAVVTFHSDNGGEIMGAGACGGN---NWPLTGGKFSN 348

Query: 61  HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPT 94
            EGG+R V A  S    P  R+G  L  L  + DW  T
Sbjct: 349 FEGGIR-VNAFVSGGAVPEARRGAALSQLSTLWDWYAT 385


>gi|421613374|ref|ZP_16054460.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
 gi|408495968|gb|EKK00541.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
          Length = 480

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD  VGR+I+ L+++ L  ++LVVF SDNGGPT +       T+SN PLRG K S 
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340

Query: 61  HEGGVRNVAALWS 73
           +EGGVR +  +WS
Sbjct: 341 YEGGVR-IPMIWS 352


>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G + ++L+ +G+  ++L++F +DNGG     + F GN   N+PLRG K + 
Sbjct: 238 MVTAMDEAIGNITKALKTKGMFNDTLIIFTADNGG----WITFAGN---NYPLRGGKATV 290

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGG R  A +    ++K   V   ++H  DW+PT+   AG
Sbjct: 291 FEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAG 331


>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 459

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 41/100 (41%), Positives = 58/100 (58%), Gaps = 1/100 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD+SVG+++ +L+   L  N+LV+F SDNGG      +F  N +SN PLRG K + 
Sbjct: 263 MIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGYLTYGKNFQ-NISSNGPLRGQKATL 321

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG R    +  P +    V +   H  D LPTL   AG
Sbjct: 322 YEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTLAQAAG 361


>gi|417302808|ref|ZP_12089892.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327540882|gb|EGF27442.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 480

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/73 (50%), Positives = 50/73 (68%), Gaps = 7/73 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD  VGR+I+ L+++ L  ++LVVF SDNGGPT +       T+SN PLRG K S 
Sbjct: 287 MLIALDRGVGRIIEKLDQQKLRQDTLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340

Query: 61  HEGGVRNVAALWS 73
           +EGGVR +  +WS
Sbjct: 341 YEGGVR-IPMIWS 352


>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1517

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 40/95 (42%), Positives = 58/95 (61%), Gaps = 8/95 (8%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L+ +D++VG V+ +LEE  +  NS+++F SDNG         HG   SN PLRG+K    
Sbjct: 326 LRLVDDAVGNVVAALEEARMDANSILIFASDNGANPE-----HG--GSNAPLRGSKGYLF 378

Query: 62  EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
           EGGVR  A + +P  L +G   ++  H+TDW+PTL
Sbjct: 379 EGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPTL 413


>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
 gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
          Length = 478

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 60/103 (58%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD SVG V++ LEE+G+L N+L+VF SDNG   V  +  HG +A +  LR  K   
Sbjct: 248 VIQELDWSVGEVLRKLEEKGILENTLIVFSSDNGPWLV--MEDHGGSAGH--LREGKQFT 303

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR    A+W   +  GQV E+L    DW PT   + G E
Sbjct: 304 FEGGVRVPTVAMWKGKIEPGQVYEDLATQMDWFPTFSQLVGAE 346


>gi|325110760|ref|YP_004271828.1| Cerebroside-sulfatase,N-acetylgalactosamine-6-sulfatase
           [Planctomyces brasiliensis DSM 5305]
 gi|324971028|gb|ADY61806.1| Cerebroside-sulfatase,N-acetylgalactosamine-6-sulfatase
           [Planctomyces brasiliensis DSM 5305]
          Length = 932

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 38/100 (38%), Positives = 58/100 (58%), Gaps = 9/100 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +++LD  VG ++Q+++E G   N+LV+F+SDNGG        H    +N PLRG+K++ +
Sbjct: 708 VETLDHHVGTILQAIDEAGQRENTLVIFLSDNGG--------HPEYCANAPLRGSKWNLY 759

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R    A W   +    V E  +   D LPTL  +AG
Sbjct: 760 EGGIRVPFIARWPKTIAAASVCETPVVGYDLLPTLVDVAG 799



 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 52/96 (54%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D  +GR++  ++  GL   ++V+F SDNG   +    F      N  LRG+KY  
Sbjct: 249 MIEVMDRGIGRLLDEIDALGLAEETIVLFASDNGPDPLTGERF------NLELRGSKYQV 302

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R  +   W+  +  G V + ++   D LPTL
Sbjct: 303 YEGGLRVPLLIRWTNQVIPG-VRDQMVSFVDLLPTL 337


>gi|319899988|ref|YP_004159716.1| sulfatase [Bacteroides helcogenes P 36-108]
 gi|319415019|gb|ADV42130.1| sulfatase [Bacteroides helcogenes P 36-108]
          Length = 461

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 8/66 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D SVGR++ ++E  GL GN+LVVF SDNGGP           +  WPLR  K SF
Sbjct: 251 MVENMDMSVGRILDAVERNGLTGNTLVVFTSDNGGPY--------EISHQWPLRAGKGSF 302

Query: 61  HEGGVR 66
           +EGG+R
Sbjct: 303 YEGGIR 308


>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
          Length = 558

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G V+  L+ +G+  N+ + F SDNGG          N   N P RG K S 
Sbjct: 286 LVSGMDAVIGNVVAKLKAKGMFDNTYIFFSSDNGGDV--------NEGDNSPWRGAKSSL 337

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
            EGG    +   SPL+R KG   + L+H+TDWLPT+  +AG
Sbjct: 338 FEGGCHAHSWASSPLIRKKGVEADGLLHMTDWLPTILTLAG 378


>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
 gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
          Length = 538

 Score = 69.3 bits (168), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            ++S+DE+VGR++ +L+E+ +  N+L+ F SDNGG + D     G+   N PLRG K + 
Sbjct: 260 QVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGS-DQPWTRGD---NRPLRGQKGTL 315

Query: 61  HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAG 100
           ++GGVR    +  P  L+ GQV+E +  + D  PTL  +AG
Sbjct: 316 YDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLAG 356


>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
          Length = 370

 Score = 69.3 bits (168), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 60/103 (58%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD  VG +++ +    +L N+ ++F SDNGG  V++L        N P RG K S 
Sbjct: 228 LVSGLDWMVGELLELVIANNMLDNTYIIFQSDNGGDAVEAL--------NGPYRGGKGSL 279

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
            EGG + V  ++SPLL+K G      MHITDW+PT+  +AG E
Sbjct: 280 FEGGSKVVNFIYSPLLKKVGYENNGWMHITDWMPTIISLAGGE 322


>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 468

 Score = 69.3 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/95 (38%), Positives = 55/95 (57%), Gaps = 7/95 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D++VG+V+  LE E LL N+L+ F+SDNGGPT        NT  N PLRG K + 
Sbjct: 245 MMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTA------ANTGDNTPLRGFKATT 298

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPT 94
            EGG+R      W   +  G+  +  +   D++PT
Sbjct: 299 WEGGIRVPYFVSWKGKIPAGKTYDQPVIQIDFVPT 333


>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
 gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
          Length = 486

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 36/105 (34%), Positives = 56/105 (53%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D  +GR++Q+L+  G+  N++V+F SDNGG            A  WP  G K   
Sbjct: 260 MIAAMDLQIGRIVQALQANGISENTIVIFTSDNGGERF---------ADTWPFTGRKTEL 310

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R  A + W   ++  Q ++ +    DWLPTL   AG E +
Sbjct: 311 LEGGLRIPALVSWPARIKADQTIDQVSISMDWLPTLLAAAGSEPD 355


>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
          Length = 483

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G + ++L+ +G+  ++L+VF +DNGG     + + GN   N+PLRG K + 
Sbjct: 239 MVTAMDEAIGNITKALKTKGMFNDTLIVFTADNGG----WITYAGN---NYPLRGGKTTV 291

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGG R  A +    ++K   V   ++H  DW+PT+   AG
Sbjct: 292 FEGGTRATAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAG 332


>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
          Length = 492

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 36/79 (45%), Positives = 51/79 (64%), Gaps = 10/79 (12%)

Query: 25  SLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR---KGQV 81
           SL+VF SDNGG   +        ASNWPLRG+K + ++GG+++ A + SPLL+   KGQ 
Sbjct: 284 SLLVFTSDNGGRPKEG-------ASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQN 336

Query: 82  LENLMHITDWLPTLYFIAG 100
              L H++DW PTL  ++G
Sbjct: 337 YNYLFHVSDWFPTLLTLSG 355


>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
          Length = 548

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G+V+  L+E+ +  N+ + F SDNGG          +   N P RG K S 
Sbjct: 244 LVSGMDAVIGKVVAKLKEKDMFDNTYIFFSSDNGGDV--------HEGDNSPKRGAKSSL 295

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
            +GG    +   SPLL+K G   + L+HITDWLPT+  +AG E
Sbjct: 296 FDGGCHAHSWASSPLLKKKGVEADGLLHITDWLPTIVKLAGGE 338


>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
 gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
          Length = 559

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 24/100 (24%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG+ +++L + G+L NS+V+  SDNG PTV     H N  SN+P RG     
Sbjct: 255 MISSLDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVG---IHSNAGSNYPFRG----- 306

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
                           ++G V    +H  DWLPTL   AG
Sbjct: 307 ----------------QRGYVSNQAIHAIDWLPTLAAAAG 330


>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 583

 Score = 68.9 bits (167), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 62/107 (57%), Gaps = 6/107 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD SVG+++ SL+EEGL  N++V+F SDNG P     ++ G    N P RG K + 
Sbjct: 362 MIRSLDRSVGKILASLKEEGLDENTIVIFTSDNGAP-----NYIGLPDVNSPYRGWKLTL 416

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            +GG+R    A W   ++ G   +N +   D LPT+   AG +  +D
Sbjct: 417 FQGGIRVPYVAKWPGHIKPGTKYQNAITNIDILPTVANAAGIKLPLD 463


>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
          Length = 441

 Score = 68.9 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G+V+  L+E+ +  N+ + F SDNGG          +   N P RG K S 
Sbjct: 139 LVSGMDAVIGKVVAKLKEKDMFDNTYIFFSSDNGGDV--------HEGDNSPKRGAKSSL 190

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKE 102
            +GG    +   SPLL+K G   + L+HITDWLPT+  +AG E
Sbjct: 191 FDGGCHAHSWASSPLLKKKGVEADGLLHITDWLPTIVKLAGGE 233


>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
            queenslandica]
          Length = 2318

 Score = 68.9 bits (167), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 38/107 (35%), Positives = 56/107 (52%), Gaps = 11/107 (10%)

Query: 1    MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            M+   D   G V+  L E  +  N+L+V  +DNGG             SN+PL+G+KY+F
Sbjct: 1353 MVSVADNVTGHVVNLLRENDMWDNTLMVVSADNGGADC--------KGSNYPLKGSKYTF 1404

Query: 61   HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAGKEKN 104
             EGGVR +A     L+   R+G+  +  +HI DW  T   +AG + N
Sbjct: 1405 FEGGVRVIAFASGGLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHN 1451


>gi|398955521|ref|ZP_10676483.1| arylsulfatase A family protein [Pseudomonas sp. GM33]
 gi|398150946|gb|EJM39513.1| arylsulfatase A family protein [Pseudomonas sp. GM33]
          Length = 570

 Score = 68.6 bits (166), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 62/103 (60%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD+++G V++ LE+ G L N++VVF +DNG  T+ +    G T    P +G K +  
Sbjct: 305 MKQLDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W  +++ G VL ++    DW+PTL  IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVSIAGGKK 402


>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 474

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 61/108 (56%), Gaps = 8/108 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   LD + G V+  L+E G+  N+L+VF +DNGGP+        N +SN PL GTK + 
Sbjct: 261 MTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPS------GTNASSNKPLGGTKANH 314

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA-GKEKNID 106
            EGG+R    + W   L++  V +  +   D LPT Y IA G EKNI+
Sbjct: 315 LEGGIRVPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIAGGNEKNIN 362


>gi|418207595|gb|AFX62894.1| sulfatase 1A precursor, partial [Haliotis rufescens]
          Length = 143

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 47/85 (55%), Gaps = 10/85 (11%)

Query: 20  GLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLL--R 77
           G + N L+VF +DNGGPT        N ASNWPLRG K +  EGG R    ++S  L  +
Sbjct: 3   GFMDNLLLVFTTDNGGPT--------NIASNWPLRGGKGTLWEGGTRGAGFVYSKTLLKK 54

Query: 78  KGQVLENLMHITDWLPTLYFIAGKE 102
           KG     L H  DW PTL  IAG +
Sbjct: 55  KGYTHPGLFHAVDWYPTLLDIAGGD 79


>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1605

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 37/98 (37%), Positives = 57/98 (58%), Gaps = 10/98 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L +LD+SVGRV++ ++   +  ++L VF SDNGG             +N+PLRG K+  
Sbjct: 337 LLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGGCWAQG-------GTNYPLRGGKHFL 389

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
            EGGV+  A ++S L+    + +    L H+TDW PTL
Sbjct: 390 WEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTL 427


>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
          Length = 470

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 62/110 (56%), Gaps = 6/110 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            + S+DE +G +++ ++E GL   ++VVF SD+G  T +  H  G +A  +  RG K+S 
Sbjct: 263 FVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSAGIY--RGAKFSL 320

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG---KEKNID 106
            EGG+R  A + W   L +G+    L H  DWLPTL  +A     E +ID
Sbjct: 321 FEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALAAVPLVEPDID 370


>gi|257092922|ref|YP_003166563.1| sulfatase [Candidatus Accumulibacter phosphatis clade IIA str.
           UW-1]
 gi|257045446|gb|ACV34634.1| sulfatase [Candidatus Accumulibacter phosphatis clade IIA str.
           UW-1]
          Length = 574

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 39/106 (36%), Positives = 61/106 (57%), Gaps = 6/106 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LE+ G L N++VVF +DNG   + +    GNT    P +G K +  
Sbjct: 306 MKQMDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAEVI-TYPDGGNT----PFKGGKLTTW 360

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           EGG+R  A + W   ++ G VL ++    DW+PT   IAG  K  D
Sbjct: 361 EGGMRAPAVIRWPGHIKPGTVLNDIFASYDWMPTFVEIAGGAKGND 406


>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
 gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
          Length = 501

 Score = 68.6 bits (166), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 45/115 (39%), Positives = 71/115 (61%), Gaps = 15/115 (13%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
           M+  +D+ VG+V+ +LE++G+  N+++VF SDNGG  V +  F G++       ASN P 
Sbjct: 272 MISVMDDGVGKVVAALEKKGMRDNTIIVFHSDNGG--VKNSLFAGDSKVGGELPASNGPY 329

Query: 54  RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGK--EKN 104
           R  K + +EGG R VAAL  W   ++ G V+++++H+ D  PTL  +A    EKN
Sbjct: 330 RDGKGTLYEGGTR-VAALANWPGKIKPG-VVDSMVHVVDMYPTLAAVADAKLEKN 382


>gi|426408644|ref|YP_007028743.1| arylsulfatase [Pseudomonas sp. UW4]
 gi|426266861|gb|AFY18938.1| arylsulfatase [Pseudomonas sp. UW4]
          Length = 570

 Score = 68.2 bits (165), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 62/103 (60%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD+++G V++ LE+ G L N++VVF +DNG  T+ +    G T    P +G K +  
Sbjct: 305 MKQLDDNIGYVLKKLEDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W  +++ G VL ++    DW+PTL  IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVNIAGGKK 402


>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
 gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
          Length = 465

 Score = 68.2 bits (165), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 66/109 (60%), Gaps = 4/109 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D++VG++IQ L+E+G+L N++++F SDNG   V   H      SN P RG K   
Sbjct: 258 MVSEIDKNVGQIIQLLKEKGILENTIIIFSSDNGVHCVGG-HEPDFFDSNGPYRGYKRDL 316

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
           +EGGVR    + W  +++K + +E++    D+LPT+  + G +  K ID
Sbjct: 317 YEGGVRAPFIVSWPKMIKKKRTVEHITTFWDFLPTVTELVGADMVKGID 365


>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
 gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
          Length = 491

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 45/116 (38%), Positives = 61/116 (52%), Gaps = 11/116 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M+ ++D+ +G V+ +L   G+  N+L+VF SDNGG         G       ASN P R 
Sbjct: 261 MITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRD 320

Query: 56  TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
            K S +EGG R VA A W   +  G   E +MH+ D LPTL  +AG    K K +D
Sbjct: 321 GKGSLYEGGTRVVALANWPGRIAPGAA-EGVMHVVDMLPTLAKLAGASLAKSKPLD 375


>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
 gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
          Length = 440

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++++LD++VG+V+Q+L++ G   N++V+F SDNGG            +  WP  G K   
Sbjct: 229 IVEALDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L WS  +R  QV E +    DWLPTL   AG   + D
Sbjct: 280 LEGGIRVPTLLRWSARIRP-QVQEQVTASFDWLPTLLAAAGARPHPD 325


>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
          Length = 609

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 37/100 (37%), Positives = 58/100 (58%), Gaps = 17/100 (17%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D ++ +V+++ ++     +++++F SDNGG      +  G  ASNWPLRG K S 
Sbjct: 321 MVTAMDTAIYKVVKAYKKFKFWDDTILIFSSDNGG------NLKGG-ASNWPLRGAKGSL 373

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +  + SPLL           +TDW PTL  +AG
Sbjct: 374 FEGGVRTIGFVHSPLLP----------VTDWFPTLLQLAG 403


>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score = 68.2 bits (165), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 3/101 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LDE+ G VI+ L   GL   +L++F SDNGGPT+     +G  + N PLRG+K + 
Sbjct: 251 MLSALDEAAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNG--SKNDPLRGSKRTT 308

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    L W   L  G+V +  +   D LPT    AG
Sbjct: 309 LEGGVRVPFVLSWKGTLPAGKVYDQPIIQLDILPTALAAAG 349


>gi|406833200|ref|ZP_11092794.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 476

 Score = 68.2 bits (165), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 41/107 (38%), Positives = 56/107 (52%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE VGR++  L   GL  N+LV+F SDN GP  +S H     + + PLRG K S 
Sbjct: 268 MISRLDEYVGRMLDVLRANGLAENTLVIFTSDN-GPHQESNHDTARFSPSGPLRGMKRSL 326

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W   +  G V  ++ +  DW+ T   +AG     D
Sbjct: 327 TEGGIRVPFIAWWPGRISAGSVSNHVGYFGDWMATAAELAGTAPTAD 373


>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
          Length = 1489

 Score = 68.2 bits (165), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 35/94 (37%), Positives = 55/94 (58%), Gaps = 10/94 (10%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD ++ +V+ ++++ G   NS++V  SDNGG +       G+   N+P RG K   +EGG
Sbjct: 308 LDNAMKKVVGAMKKSGFYDNSILVVASDNGGCS-------GSGGYNYPYRGGKQYLYEGG 360

Query: 65  VRNVAALWSPLLRK---GQVLENLMHITDWLPTL 95
           +   A + SPL+ K   G   + L H++DWLPTL
Sbjct: 361 IHVNAFIHSPLIPKSARGTSYQGLFHVSDWLPTL 394


>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
 gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
          Length = 495

 Score = 67.8 bits (164), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 43/102 (42%), Positives = 53/102 (51%), Gaps = 12/102 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+Q+L   G   N+LVVF SDNGG            + NWPL G K S 
Sbjct: 308 MVEDLDRSVGEVLQALRRSGQEENTLVVFSSDNGGERF---------SYNWPLSGNKASL 358

Query: 61  HEGGVR--NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  N+   W   L  GQV    +   DW  TL  +AG
Sbjct: 359 QEGGIRVPNIVR-WPARLDGGQVSRVPVFSPDWTATLLEVAG 399


>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 480

 Score = 67.8 bits (164), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 4/95 (4%)

Query: 9   VGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNV 68
           +G  + SL +     + L++  SDNGG          N   NWPLRG K +  EGGVR  
Sbjct: 222 LGLALVSLVKTKNYVDILLMGFSDNGG---SPFGLFSNRGFNWPLRGAKANLWEGGVRVP 278

Query: 69  AALWS-PLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           A +WS   L+K +V + LMHI+DWLPTLY  AG +
Sbjct: 279 AFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGD 313


>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
          Length = 459

 Score = 67.8 bits (164), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 43/108 (39%), Positives = 63/108 (58%), Gaps = 10/108 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD SVGRV+++L+  GL  N+LVVF+SDNGG    +L +     SN PLRG K   +
Sbjct: 263 VEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGG----ALWY---AQSNGPLRGGKQDMY 315

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
           EGG+R  A   W   +  G   +N   + D  PT   +AG++  +N+D
Sbjct: 316 EGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPENVD 363


>gi|284035516|ref|YP_003385446.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283814809|gb|ADB36647.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 522

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 62/104 (59%), Gaps = 1/104 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G V+  L+E  L GN++++F+SDNGG ++  L        N PLR  K S 
Sbjct: 290 LVEGMDKSLGDVLAFLDERKLTGNTVIIFMSDNGGLSMSPLRGGQAHTHNLPLRVGKGSV 349

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           +EGG+R  +   W  +++ G + E  + I D+ PT+  IAG  K
Sbjct: 350 YEGGIREPMLVRWPGVVKAGSLTEQPVIIEDFFPTILAIAGGPK 393


>gi|343086462|ref|YP_004775757.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354996|gb|AEL27526.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 472

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 61/104 (58%), Gaps = 9/104 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD ++GR+I  L+E+ L  N+ + F++DNGG     L+         PLR  K S+
Sbjct: 251 MVENLDRNIGRLINLLQEKKLFENTFIFFVTDNGG-----LY---KVTKQRPLRAGKGSY 302

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           +EGG+R  A A+W   +R GQV E  +   D  PTL  +AG EK
Sbjct: 303 YEGGIRIPAFAVWPGEIRPGQVSEVPITNLDIFPTLLELAGIEK 346


>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
 gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
          Length = 440

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 39/107 (36%), Positives = 60/107 (56%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++++LD++VG+V+Q+L++ G   N++V+F SDNGG            +  WP  G K   
Sbjct: 229 IVEALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L WS  ++  QV E +    DWLPTL   AG  ++ D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQITASFDWLPTLLAAAGARQHPD 325


>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 508

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 6/101 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML+++D SVGR+++ LEEEGL  N+++VF SDNGG       + G    N P RG K + 
Sbjct: 285 MLRAVDRSVGRIMEKLEEEGLADNTIIVFSSDNGGAG-----YIGIPEVNAPYRGWKITL 339

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  +   W   ++ G V++  +   D +PTL    G
Sbjct: 340 FEGGIRVPLFMKWPGTIQPGTVIDEPVAHIDLMPTLAAATG 380


>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 57/96 (59%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D+++GRV  +LE +G+L N+L+VF SDNGG          N + N PL+G K  F
Sbjct: 240 MLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP------DNNYSLNHPLKGQKTQF 293

Query: 61  HEGGVRNVAALWSP--LLRKGQVLENLMHITDWLPT 94
           +EGG+R  A +  P   +  G+ L+  + + D  P+
Sbjct: 294 YEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPS 329


>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 464

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 55/107 (51%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD  +G+V+ +L   G   N+LVVF SDNGG            + NWPL G K S 
Sbjct: 280 MVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERF---------SYNWPLTGNKGSL 330

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L W   +R GQV    +   DW  TL  IAG   + D
Sbjct: 331 SEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPSPD 377


>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 534

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 58/113 (51%), Gaps = 21/113 (18%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFIS-----------DNGGPTVDSLHFHGNTAS 49
           M   +DE++G V  +L+E+GL  N++ +F S           DNG    D         S
Sbjct: 284 MASCMDEAIGNVTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATPTDG-------GS 336

Query: 50  NWPLRGTKYSFHEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIA 99
           NWPLRG K  ++EGG+R V  + S LL     G     L+H+TDW PTL  +A
Sbjct: 337 NWPLRGRKGLWYEGGIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLA 389


>gi|149200548|ref|ZP_01877558.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
 gi|149136366|gb|EDM24809.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
          Length = 461

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 43/109 (39%), Positives = 60/109 (55%), Gaps = 12/109 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML+SLDE++GR+++ LEE  L  N+L++F +DNGG            + NWPLR  K S+
Sbjct: 242 MLESLDENIGRLVKVLEEFKLDKNTLIIFTTDNGGVW--------KFSKNWPLRAGKGSY 293

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFI---AGKEKNI 105
            EGG+R    A W   ++ G   +  +   D  PTL  I   A KEK I
Sbjct: 294 FEGGIRVPTFAYWPGKIKAGSSSDIPITALDMYPTLLDIAESAKKEKKI 342


>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
 gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
          Length = 500

 Score = 67.8 bits (164), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D ++GR+++ L+   L  N+++VF SDNGG +          ASN PLRG K + 
Sbjct: 278 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTS-------NFGASNAPLRGEKDTI 330

Query: 61  HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG +    + SP+ + +G   E + H+ DW  T+  I G E  IDS
Sbjct: 331 WEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITGLE--IDS 376


>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score = 67.4 bits (163), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 40/103 (38%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D+++G+V+  + E G   N+LV FISDNGGPT       G T+ N PLRG K + 
Sbjct: 251 MMSAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPT------QGTTSQNGPLRGFKMTT 304

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG R      W   L  G+  +N +   D LPT+   AG +
Sbjct: 305 FEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAGSK 347


>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 440

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 39/107 (36%), Positives = 60/107 (56%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++++LD++VG+V+Q+L++ G   N++V+F SDNGG            +  WP  G K   
Sbjct: 229 IVEALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L WS  ++  QV E +    DWLPTL   AG  ++ D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQVTASFDWLPTLLAAAGARQHPD 325


>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 478

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 49/85 (57%), Gaps = 2/85 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-FHGNTASNWPLRGTKYS 59
           M+  +D  +G V+ +L+  GL  N+L+ F+SDNGGP     H   GN +SN P RG K  
Sbjct: 242 MVDVMDRGIGEVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRGGKTD 301

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLE 83
           F+EGGV     A W   ++ GQV +
Sbjct: 302 FYEGGVHVPFIASWPAKIQPGQVYD 326


>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
          Length = 515

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 7/109 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +D+++G++  +L E  +  ++LVV+ISDNGG +      + N  +N+PLRG K++ 
Sbjct: 255 MSSVMDDAIGKITTALWETTMWEDTLVVYISDNGGAS----GINANNGNNYPLRGGKHTD 310

Query: 61  HEGGVRNVAAL---WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGGVR  A +   + P  ++G   +  MH  DW  TL  +AG     D
Sbjct: 311 FEGGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADD 359


>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L++ G+  N+L+V  +DNGG             SN+PL+G+K +F
Sbjct: 234 MVSVADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG--------GSNYPLKGSKGTF 285

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR++A +   LL   RKG+  +  +HI DW  T   +AG
Sbjct: 286 YEGGVRSLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLAG 328


>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
          Length = 478

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D  +GRV+ +L+  G   N++VVF SDNGG            A  WP  G K   
Sbjct: 262 MIEQMDLQIGRVLGALDANGATDNTIVVFTSDNGGERF---------ADTWPFTGRKTEL 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A + W   +R+G   + +    DWLPTL   AG
Sbjct: 313 LEGGLRIPALISWPARIRQGATTDQVAMSMDWLPTLLAAAG 353


>gi|398872055|ref|ZP_10627361.1| arylsulfatase A family protein [Pseudomonas sp. GM74]
 gi|398204372|gb|EJM91174.1| arylsulfatase A family protein [Pseudomonas sp. GM74]
          Length = 570

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 62/103 (60%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD+++G V++ L++ G L N++VVF +DNG  T+ +    G T    P +G K +  
Sbjct: 305 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W  +++ G VL ++    DW+PTL  IAG +K
Sbjct: 360 EGGMRAPLVIRWPGVIKPGTVLTDIFASLDWVPTLVSIAGGKK 402


>gi|384218661|ref|YP_005609827.1| arylsulfatase protein [Bradyrhizobium japonicum USDA 6]
 gi|354957560|dbj|BAL10239.1| arylsulfatase protein [Bradyrhizobium japonicum USDA 6]
          Length = 591

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 60/103 (58%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD+++G V++ L++ G L N++VVF +DNG  T+ +    G T    P +G K S  
Sbjct: 325 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLSTW 379

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R  A + W  +++ G +  ++    DWLPT   IAG  K
Sbjct: 380 EGGMRAPALVRWPGVIKPGTIKNDIFASLDWLPTFVSIAGGAK 422


>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
 gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
          Length = 553

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 43/103 (41%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD SVGRV+QSL+E GL  N+LV+F SDNG P      + G    N P RG K +F
Sbjct: 332 MIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPG-----YIGLPDVNKPYRGWKLTF 386

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R    A W   +  G      +   D  PT+   AG E
Sbjct: 387 FEGGIRVPFFAKWPARIPAGTERTTPVAHLDMFPTIVAAAGGE 429


>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
 gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
          Length = 480

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/73 (50%), Positives = 48/73 (65%), Gaps = 7/73 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML +LD  VGR+I+ L+ + L   +LVVF SDNGGPT +       T+SN PLRG K S 
Sbjct: 287 MLIALDRGVGRIIKKLDRQKLRQETLVVFFSDNGGPTAEL------TSSNAPLRGGKGSL 340

Query: 61  HEGGVRNVAALWS 73
           +EGGVR +  +WS
Sbjct: 341 YEGGVR-IPMIWS 352


>gi|325110839|ref|YP_004271907.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324971107|gb|ADY61885.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 494

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG--PTVDSLHFHGNTASNW-PLRGTK 57
           M++  D+ VGR++ S+EE GL  N+L+VF SDNGG     D      +  S+  PL+G K
Sbjct: 248 MIEHCDDMVGRIVDSVEELGLTDNTLIVFTSDNGGLISRYDYREAADDIVSDLSPLKGEK 307

Query: 58  YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            S HEGG+R  +   + PL+  G V +      D+ PT   +AG E
Sbjct: 308 GSLHEGGIRVPLIVKYPPLVNAGSVTDEPAISYDFYPTFVDLAGGE 353


>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
          Length = 457

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 60/95 (63%), Gaps = 7/95 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD ++GRV ++LEE G L N+L++F SDNGG         G  A+N P+RG+K   
Sbjct: 257 LIEHLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDA-------GAEANNGPVRGSKGDM 309

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG++   A++     K + +++L+ ++D  PTL
Sbjct: 310 YEGGIKVPCAVYWRNHLKPRSIDDLVIMSDIFPTL 344


>gi|167515780|ref|XP_001742231.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778855|gb|EDQ92469.1| predicted protein [Monosiga brevicollis MX1]
          Length = 241

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 59/107 (55%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+ VGR+  +L   G L N+++VF+SDNGG  VD L             G K+  
Sbjct: 121 MVAALDDQVGRLYDTLNATGQLDNTVIVFLSDNGG-WVDRLDGLACLLCLPLSAGWKHWV 179

Query: 61  HEGGVRNVAALWS-PLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGGVR+ A +WS  L   G V   L H  DWLPTL  +AG + + +
Sbjct: 180 FEGGVRSAAFVWSRQLTNPGSVHHGLFHSVDWLPTLARLAGADTSAN 226


>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
 gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
          Length = 491

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 60/101 (59%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+++G + ++ ++ GL  N++++F +DNGG          N   ++PLRG K + 
Sbjct: 239 MVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGG-------VPKNGGYDYPLRGRKDTL 291

Query: 61  HEGGVRNVAALWSPLLRKGQV-LENLMHITDWLPTLYFIAG 100
            EGGVR VA +    L +  V  + LMH+TDW PTL  +AG
Sbjct: 292 WEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAG 332


>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
           sphaeroides KD131]
 gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
           KD131]
          Length = 509

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 42/108 (38%), Positives = 58/108 (53%), Gaps = 7/108 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-----NWPLRG 55
           M+  +D+ VG V+Q+LE  G+  ++LV+F SDNGG         G  A      N PLR 
Sbjct: 276 MISCMDDQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLRE 335

Query: 56  TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            K + +EGG R VA A W   +  G+    +MH+ D LPTL  +A  E
Sbjct: 336 GKGTLYEGGTRVVALANWPGRIPAGET-HGMMHVVDMLPTLAGLAQAE 382


>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
          Length = 544

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD SVG++I+ L+E  + G +L++F SDNGG     L+       N P RG K SF
Sbjct: 324 MINSLDRSVGKIIEKLKELDIYGKTLIIFTSDNGGANYIELN-----DINKPYRGWKISF 378

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            +GG+R    + W   +  G+  EN +H  D  PT+   AG E
Sbjct: 379 FDGGIRVPYIISWPDEINPGKKSENAVHHFDIFPTILKAAGIE 421


>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
          Length = 637

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 36/78 (46%), Positives = 46/78 (58%), Gaps = 7/78 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  LDE+V  V  +L+  G   NS++V+ SDNGG             SNWPLRG K ++
Sbjct: 272 MLSCLDEAVANVTLALKAHGFYDNSVIVYSSDNGGQPAAG-------GSNWPLRGRKGTY 324

Query: 61  HEGGVRNVAALWSPLLRK 78
            EGGVR V  + SPLLR+
Sbjct: 325 WEGGVRAVGFVHSPLLRQ 342


>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 469

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 44/111 (39%), Positives = 61/111 (54%), Gaps = 5/111 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
           ML S+DESVGRV+++L+E GL  ++LV+F SDNGG      + +   G    N PLRG K
Sbjct: 249 MLASVDESVGRVVKTLDELGLSDDTLVIFSSDNGGVGGYEREGIGKAGAVTDNAPLRGGK 308

Query: 58  YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA-GKEKNID 106
              +EGG+R      W   +  G V +  ++  D  PTL  +A GK    D
Sbjct: 309 GMLYEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELAEGKAPESD 359


>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
          Length = 534

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 62/107 (57%), Gaps = 8/107 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD +V  +  +L  + +  + ++V+  DNGGP ++  H++G       LRG K+++
Sbjct: 279 MVTALDAAVANMTTALASKNM-ADFVLVYSGDNGGPEIEG-HWNGG------LRGGKWTW 330

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGGVR  A + SPLL +       +H+ DWLPT   +AG  ++ D+
Sbjct: 331 FEGGVRPAAFVHSPLLTRRGWHNGTLHLVDWLPTFLALAGAARDDDA 377


>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
 gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
          Length = 498

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 42/108 (38%), Positives = 58/108 (53%), Gaps = 7/108 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-----NWPLRG 55
           M+  +D+ VG V+Q+LE  G+  ++LV+F SDNGG         G  A      N PLR 
Sbjct: 265 MISCMDDQVGPVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLRE 324

Query: 56  TKYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            K + +EGG R VA A W   +  G+    +MH+ D LPTL  +A  E
Sbjct: 325 GKGTLYEGGTRVVALANWPGRIPAGET-HGMMHVVDMLPTLAGLAQAE 371


>gi|171915961|ref|ZP_02931431.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Verrucomicrobium
           spinosum DSM 4136]
          Length = 527

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 59/96 (61%), Gaps = 5/96 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++VG+V+  L+E GL  N++VVF SDNGG +       G+  +N P RG K   
Sbjct: 292 MVEAMDQAVGKVLAKLDELGLKENTIVVFTSDNGGLSTS----EGSPTANLPWRGGKGWM 347

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  +++ G V+   +  TD+ PTL
Sbjct: 348 YEGGIREALLVRWPGVVQPGSVIHEPVIGTDFYPTL 383


>gi|346992020|ref|ZP_08860092.1| arylsulfatase [Ruegeria sp. TW15]
          Length = 522

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 16/111 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD-----SLHFHGNTASNWPLRG 55
           ++K LD++VG+V   LEE G+  N++V+F SDNG  T+      +  FHG   + W    
Sbjct: 277 VMKELDDNVGKVTAHLEELGIADNTIVIFTSDNGPETMTWPDGGTTPFHGEKGTTW---- 332

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
                 EGG R  A + W   + +GQVL  +    DW+PTL   AG  +N+
Sbjct: 333 ------EGGFRVPAIIRWPGKVAEGQVLNGIFDGMDWMPTLVAAAGGSQNL 377


>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 498

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 54/103 (52%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD  VGR+ + LEE G  GN+L+VF SDNGG T        N + N PLRG K S 
Sbjct: 257 MMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  +   W      G   + ++   D LPT    AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVVSSLDLLPTFCSAAGAE 352


>gi|255014753|ref|ZP_05286879.1| arylsulfatase A [Bacteroides sp. 2_1_7]
 gi|410102838|ref|ZP_11297763.1| hypothetical protein HMPREF0999_01535 [Parabacteroides sp. D25]
 gi|409237965|gb|EKN30760.1| hypothetical protein HMPREF0999_01535 [Parabacteroides sp. D25]
          Length = 468

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+LV+F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGADE----HFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|423330334|ref|ZP_17308118.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231950|gb|EKN24798.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
           CL03T12C09]
          Length = 468

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+LV+F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGADE----HFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 536

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 8/94 (8%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD +V R ++  +  G   +++  F+SDNG     S        SNWPLRG+K++ +EGG
Sbjct: 265 LDRAVERFVEFTKARGAYDDAVFFFMSDNGATLAQS-----GGGSNWPLRGSKFTAYEGG 319

Query: 65  VRNVAALWS---PLLRKGQVLENLMHITDWLPTL 95
           VR  A L S   P  R+G     L H+ D LPTL
Sbjct: 320 VRVPAFLHSARVPPARRGAAHAGLFHVVDVLPTL 353


>gi|149198310|ref|ZP_01875356.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138606|gb|EDM27013.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 578

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 56/96 (58%), Gaps = 1/96 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD++VG +++ LE  GL+ N++++F SDNGG     L       SN+PL G K S 
Sbjct: 260 MVQSLDDAVGALLEGLESRGLMDNTVIIFTSDNGGNIYSQLDEGIVPTSNFPLSGGKASM 319

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGGVR     +W  + + G   + ++  +D+  T+
Sbjct: 320 CEGGVRVPCTVVWPGVTKAGSRSDEIVQTSDFYTTI 355


>gi|323456611|gb|EGB12478.1| hypothetical protein AURANDRAFT_5553, partial [Aureococcus
           anophagefferens]
          Length = 190

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/69 (49%), Positives = 45/69 (65%), Gaps = 4/69 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +D+SVGRV  +L   G+  +SLVVF +DNGG    ++ F  ++ +NWPLRG KYS 
Sbjct: 123 MASFVDDSVGRVEAALRARGMWRDSLVVFATDNGG----AVGFDADSGNNWPLRGGKYSD 178

Query: 61  HEGGVRNVA 69
            EGGVR V 
Sbjct: 179 FEGGVRGVC 187


>gi|119505846|ref|ZP_01627912.1| sulfatase family protein [marine gamma proteobacterium HTCC2080]
 gi|119458344|gb|EAW39453.1| sulfatase family protein [marine gamma proteobacterium HTCC2080]
          Length = 558

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 60/105 (57%), Gaps = 8/105 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D SVGR++ +LEEEG+  N++VVF SDNGG       + G    N P RG K + 
Sbjct: 332 MIRAVDRSVGRILDTLEEEGIANNTVVVFTSDNGG-----AGYIGIPEVNSPFRGFKITM 386

Query: 61  HEGGVR-NVAALWSPLLRKG-QVLENLMHITDWLPTLYFIAGKEK 103
            EGG+R  +   W   +  G  V E + HI D +PTL   AG  +
Sbjct: 387 FEGGLRVPLFVRWPAKIAPGISVNEPVAHI-DVMPTLAAAAGASE 430


>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 453

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD ++G+VIQSLEE G L N+LV+F SDNGG         G+ A+N P RG K   
Sbjct: 251 LIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
            EGG+    AL  P + +G   +N  + + D +PT+
Sbjct: 304 FEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339


>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 514

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 40/107 (37%), Positives = 62/107 (57%), Gaps = 8/107 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D+ VG + Q+L+++G+  ++L VF+SDNGG  +    F G   SN+P RG K + 
Sbjct: 260 MMAAMDQGVGNITQALKDKGMWDDTLFVFLSDNGGDKM----FAG---SNYPYRGNKATM 312

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGGV+  A +   +L+  G V   L H TD   TL  +AG   + D
Sbjct: 313 WEGGVKVPAFIRGSMLKTSGTVNNELYHFTDIFATLLKVAGGTPDED 359


>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 470

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 58/103 (56%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD +VGR+++++EE  LL N++ VFISDNGGP +       N + N P  G K + 
Sbjct: 255 MVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGPVMT------NGSINAPYNGKKGTL 308

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R    + W   L  G+V E+ +   D  PT   +AG E
Sbjct: 309 LEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVALAGGE 351


>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
           DSM 17565]
 gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 453

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD ++G+VIQSLEE G L N+LV+F SDNGG         G+ A+N P RG K   
Sbjct: 251 LIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
            EGG+    AL  P + +G   +N  + + D +PT+
Sbjct: 304 FEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339


>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
 gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
          Length = 488

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 63/108 (58%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D ++GR+++ L+   L  N+++VF SDNGG    + +F    ASN PLRG K + 
Sbjct: 266 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG----TANF---GASNAPLRGEKDTI 318

Query: 61  HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG +    + SP+ + +G   + + H+ DW  T+  I G E  IDS
Sbjct: 319 WEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLE--IDS 364


>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 481

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 4/102 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG+++ +L+EEG+  N+LVVF SDNG   V     HG TA    LRG K   
Sbjct: 257 VIEEIDWSVGQILSTLKEEGIAENTLVVFTSDNGPWHV--FKTHGGTAG--LLRGAKGGT 312

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R     W P   K  V+ ++    D LPT   I+G E
Sbjct: 313 FEGGMREPTVFWWPAQIKPGVVMDMGTTMDLLPTFCAISGTE 354


>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 556

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 55/96 (57%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD SVGR+ ++LEE G+  N+LVVF +DNGGP      + G    N P RG K S 
Sbjct: 333 MIRSLDRSVGRINKALEELGIADNTLVVFTNDNGGPG-----YIGLPDINKPFRGWKISQ 387

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R  +   W   +  G V E  +   D +PTL
Sbjct: 388 FEGGIRVPLMMKWPARISPGTVSEEPVAHIDVMPTL 423


>gi|27379852|ref|NP_771381.1| arylsulfatase [Bradyrhizobium japonicum USDA 110]
 gi|27353005|dbj|BAC50006.1| bll4741 [Bradyrhizobium japonicum USDA 110]
          Length = 556

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ L++ G L N++VVF +DNG  T     F        P +G+K +  
Sbjct: 290 MKQMDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAET-----FTYPDGGTTPFKGSKMNTW 344

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W  +++ G V  ++    DWLPTL  IAG  K
Sbjct: 345 EGGMRAPCVIRWPGVIKPGTVKNDIFASLDWLPTLVNIAGGAK 387


>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
 gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
          Length = 463

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD+S+G+VI++LE  G L +++V F SDNGG    ++         WP  G K   
Sbjct: 278 MVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGERFSNV---------WPFTGAKSGL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           +EGG+R    L W  +LR  QV +  +  TDW  T   +AG     D
Sbjct: 329 NEGGIRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAGATPAED 375


>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
 gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
          Length = 452

 Score = 66.2 bits (160), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 63/108 (58%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D ++GR+++ L+   L  N+++VF SDNGG    + +F    ASN PLRG K + 
Sbjct: 230 MLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG----TANF---GASNAPLRGEKDTI 282

Query: 61  HEGGVRNVAALWSPL-LRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG +    + SP+ + +G   + + H+ DW  T+  I G E  IDS
Sbjct: 283 WEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLE--IDS 328


>gi|326431402|gb|EGD76972.1| hypothetical protein PTSG_07315 [Salpingoeca sp. ATCC 50818]
          Length = 439

 Score = 66.2 bits (160), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 17/117 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-------------PTVDSLHFHGNT 47
           M+  LD+++  +   L+ +G+  N+L+V  SDNGG             P+   + ++G  
Sbjct: 168 MVNFLDDNLANITALLKAKGMWENTLMVLTSDNGGYVNPLHGDCDLSDPSRGYMCYNGEA 227

Query: 48  -ASNWPLRGTKYSFHEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            A+N+PLRG KYSF EGG+R    V+  + P   +G + + ++HI DW  T   +AG
Sbjct: 228 GANNYPLRGGKYSFFEGGIRANAFVSGGFVPKHLRGTINDGIVHIADWYATFAALAG 284


>gi|399037695|ref|ZP_10734370.1| arylsulfatase A family protein [Rhizobium sp. CF122]
 gi|398064717|gb|EJL56393.1| arylsulfatase A family protein [Rhizobium sp. CF122]
          Length = 568

 Score = 66.2 bits (160), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LEE G L N++VVF +DNG  T+ +    G T    P +G K S  
Sbjct: 305 MKQMDDNIGYVLKRLEEMGELDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLSTW 359

Query: 62  EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R  +   W   +  G V  ++    DWLPT   IAG  K
Sbjct: 360 EGGMRAPLVVRWPGHIEPGTVKNDMFSALDWLPTFVNIAGGPK 402


>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 495

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L+++GL  N+++V  +DNGG      H  G   SN+PLRG K + 
Sbjct: 236 MVSVADNVTGHLVELLKKKGLWDNTIMVISADNGGA-----HCSG---SNYPLRGCKETL 287

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR++A +   LL   R+GQ  +  +HI DW  T   +AG
Sbjct: 288 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 330


>gi|224537344|ref|ZP_03677883.1| hypothetical protein BACCELL_02222 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521037|gb|EEF90142.1| hypothetical protein BACCELL_02222 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 500

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+DE++GR+I  L+  GL  N++++F SDNGG +       G+  +N PLRG K   
Sbjct: 291 MIASMDENIGRIIDELKRLGLYENTIIIFTSDNGGLSTS----EGSPTANLPLRGGKGFT 346

Query: 61  HEGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P +   G   ++++  TD+ PT+  + G
Sbjct: 347 YEGGIREPLIVRIPHMENAGNRCDSMVISTDYYPTILEMTG 387


>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 455

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 3/95 (3%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + +LDE +G V+ +LEE G   N++VVF SD+G  T +   F G +A   P RG K    
Sbjct: 249 VSTLDEKLGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFGGGSAG--PYRGAKGCLF 306

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           EGG+R  + + W   L +G+V + +    DWLPTL
Sbjct: 307 EGGIRVPSVISWPAGLPQGEVRDQMAVGCDWLPTL 341


>gi|256841081|ref|ZP_05546588.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|256736924|gb|EEU50251.1| arylsulfatase A [Parabacteroides sp. D13]
          Length = 468

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+LV+F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
 gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
          Length = 452

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 3/95 (3%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           + ++DE +G+V+Q L+  GL  N++VVF SD G  T D     G      P RG K+S  
Sbjct: 247 VSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTED--RSFGGGGFTGPYRGAKFSLF 304

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           EGG+R  A + W+  L K +V + L    DW PTL
Sbjct: 305 EGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWYPTL 339


>gi|398866290|ref|ZP_10621789.1| arylsulfatase A family protein [Pseudomonas sp. GM78]
 gi|398241341|gb|EJN26995.1| arylsulfatase A family protein [Pseudomonas sp. GM78]
          Length = 570

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 61/103 (59%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD+++G V++ L++ G L N++VVF +DNG  T+ +    G T    P +G K +  
Sbjct: 305 MKQLDDNIGYVLKKLQDMGQLDNTIVVFTTDNGAETI-TFPDGGTT----PFKGGKLTTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W   ++ G VL ++    DW+PTL  IAG +K
Sbjct: 360 EGGMRAPLVIRWPGKIKPGTVLTDIFASLDWVPTLVNIAGGKK 402


>gi|301311963|ref|ZP_07217885.1| sulfatase family protein [Bacteroides sp. 20_3]
 gi|423339481|ref|ZP_17317222.1| hypothetical protein HMPREF1059_03147 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830065|gb|EFK60713.1| sulfatase family protein [Bacteroides sp. 20_3]
 gi|409230862|gb|EKN23723.1| hypothetical protein HMPREF1059_03147 [Parabacteroides distasonis
           CL09T03C24]
          Length = 468

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+LV+F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|262383797|ref|ZP_06076933.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294695|gb|EEY82627.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 468

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+LV+F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLVIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|170727985|ref|YP_001762011.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169813332|gb|ACA87916.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 489

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 15/104 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKY-- 58
           M++SLD SVGR++Q+LE+ G   N++V+F SD GG      +F     +N PLRG K   
Sbjct: 292 MVESLDTSVGRILQALEDSGQADNTVVIFYSDQGG------YF-----TNAPLRGGKIGG 340

Query: 59  -SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            + +EGG R  +   W  +    Q+ E L+  TD LPT   IAG
Sbjct: 341 RALYEGGARVPLIVRWPGVTPNSQISEQLVLSTDILPTFVDIAG 384


>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 502

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 42/110 (38%), Positives = 61/110 (55%), Gaps = 11/110 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
           M+  +D+ +GRV+++LE  G+  N+L+VF SDNGG  VD+  F G +       A N P 
Sbjct: 273 MISVMDDGIGRVVEALEMRGMRENTLIVFHSDNGG-VVDAF-FAGESNVKGKLPADNGPY 330

Query: 54  RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           R  K + +EGG R V  + W   +  G     +MH+ D  PTL  +AG E
Sbjct: 331 REGKGTTYEGGTRVVGLVNWPGKIEPGTA-NGMMHVVDMYPTLAALAGAE 379


>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
 gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
          Length = 520

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD  VGR+ + LEE G   N+L+VF SDNGG T        N + N PLRG K S 
Sbjct: 279 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 331

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  +   W      G + + ++   D LPT    AG E
Sbjct: 332 REGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAGAE 374


>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
          Length = 1941

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 8/106 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L  +D++VG VI +L+  G+   +L++FISDNG        F     +N PLRG   +  
Sbjct: 266 LSLMDDAVGHVIDALKSRGMWEETLLIFISDNGA-------FDFMEGTNLPLRGAGTTLF 318

Query: 62  EGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGKEKNID 106
           EGG+R  A     +++K G +   L HITDW  T+  +AG +   D
Sbjct: 319 EGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLAGIQPEPD 364


>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 440

 Score = 65.5 bits (158), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 39/107 (36%), Positives = 59/107 (55%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++++LD++VG+V+Q+L++ G   N++V+F SDNGG            +  WP  G K   
Sbjct: 229 IVEALDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF---------SKTWPFTGQKTEL 279

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    L WS  ++  QV E +    DWLPTL   AG   + D
Sbjct: 280 LEGGIRVPTLLRWSARIQP-QVQEQVTVSFDWLPTLLAAAGARPHPD 325


>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 508

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 37/102 (36%), Positives = 60/102 (58%), Gaps = 8/102 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG ++ +L+++GL  N L V  SDNGGP      + G  A+N+PL+G K S 
Sbjct: 260 MVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGPV-----YPGGGANNYPLKGGKASD 314

Query: 61  HEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
            +GG+R+   V+  + P   +G+ L+  +H+ DW  T   +A
Sbjct: 315 WQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYATFCALA 356


>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 489

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D  +GR++ +++  G+  N+LVVF SDNG    D  H      SN PLRG K++ 
Sbjct: 286 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 338

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGGVR VA + W      G   E++    D LPT+   AG +
Sbjct: 339 WEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAGGQ 381


>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 35/103 (33%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L+++G+  N+++V  +DNGG           + SN+PL+G K +F
Sbjct: 235 MVSVADNVTGHLVELLKKKGMWDNTIMVISADNGGAPC--------SGSNYPLKGCKSTF 286

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R++A +   LL   R+GQ  +  +HI DW  T   +AG
Sbjct: 287 FEGGIRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 329


>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 63/104 (60%), Gaps = 7/104 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
           ML S+D+ VGR+IQ+L++EG+  N+LV+F+SDNG P  ++LH         ASN  LRG 
Sbjct: 245 MLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAP--NNLHEAERAGDYLASNGSLRGR 302

Query: 57  KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
           K   +EGG+R    + W  ++ K    ++ +   D +PTL  I+
Sbjct: 303 KGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHIS 346


>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 512

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 17/114 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LDE+V  V   L++ G+  ++L++F +DNGG        HG   +NWP RGTK + 
Sbjct: 255 MVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEVK-----HG--GNNWPYRGTKNTL 307

Query: 61  HEGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLY------FIAGKEKN 104
           +EGGV+ V  +   +L       +V   L+H+TDW PT+       +++G  +N
Sbjct: 308 YEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAADCPYVSGNSRN 361


>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 487

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE++G+V+  LE  G    +LV+FISDNGGPT+  +  +G+   N PLRG+K + 
Sbjct: 242 MMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSI--NTPLRGSKRTT 299

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG+R    +  P      V ++ +   D   T   +AG EK++ S
Sbjct: 300 LEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALAVAGVEKDVKS 346


>gi|431799800|ref|YP_007226704.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430790565|gb|AGA80694.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 497

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 54/92 (58%), Gaps = 9/92 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D  VGR++++LE  G   N++V+F SDNGG        H   A N P RG+K++ +EGG
Sbjct: 280 MDHYVGRILKALERTGKSKNTIVIFTSDNGG--------HPEIADNGPFRGSKWNLYEGG 331

Query: 65  VRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +R    + W   +++G V + ++   D++PT 
Sbjct: 332 IRVPMIMSWPAHIQEGSVCDQIVSQLDFMPTF 363


>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
 gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 462

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 7/105 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D + G+V+  ++  GL  N+L+VF +DNGGP+        N +SN PL GTK + 
Sbjct: 251 MTIAMDRACGKVLDRIKALGLDKNTLIVFTNDNGGPS------DANESSNLPLSGTKANH 304

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    + W   L+   V EN +   D LPT    AG + N
Sbjct: 305 LEGGIRVPFLMSWPGKLKPNTVYENPISTLDLLPTFVTAAGGDVN 349


>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
          Length = 518

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 11/111 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LDE VG+++  L+E G+  N++ VF SDNGG    S    G + +N+PLRG K S 
Sbjct: 249 MISALDEQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQS----GQSGNNFPLRGKKGSL 304

Query: 61  HEGGVRNVAALWSPLLRKGQV------LENLMHITDWLPTLYFIAGKEKNI 105
            EGG R V A      R G +         L HI+D   T   +AG +  +
Sbjct: 305 FEGGYR-VPAFLIARTRAGNLELIASTSSTLFHISDMFATFIALAGGDAAV 354


>gi|298375847|ref|ZP_06985803.1| sulfatase family protein [Bacteroides sp. 3_1_19]
 gi|298266884|gb|EFI08541.1| sulfatase family protein [Bacteroides sp. 3_1_19]
          Length = 468

 Score = 65.5 bits (158), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+L++F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLIIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           GG R    + W   +  GQ  +NL+   D+LPTL    G
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCG 337


>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 517

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 58/99 (58%), Gaps = 11/99 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+V  + + L+  GL  N+++VF +DNGG T+          +NWPLRG K + 
Sbjct: 246 MVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTLSG-------GNNWPLRGRKLTL 298

Query: 61  HEGGVRNVAALWSPLL---RKGQVLEN-LMHITDWLPTL 95
            EGG++ V  +   +L       ++ N ++HI+DW PT+
Sbjct: 299 WEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTI 337


>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 472

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D  +GR++ +++  G+  N+LVVF SDNG    D  H      SN PLRG K++ 
Sbjct: 269 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + W   L  G   +++    D LPT+   AG
Sbjct: 322 WEGGVRVVAMMRWPRELPAGIAYDSVTSYVDLLPTMVAAAG 362


>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 494

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G +++ L+++G+  N+++V  +DNGG      H  G   SN+PLRG K + 
Sbjct: 235 MISVADNVTGHLVELLKKKGIWDNTIMVISADNGGA-----HCSG---SNYPLRGCKETL 286

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR++A +   LL   R+GQ  +  +HI DW  T   +AG
Sbjct: 287 FEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAG 329


>gi|221113984|ref|XP_002154333.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 479

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 61/101 (60%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD +VG +++ L + G+L +++++F +  GG   ++ +      S +P RG   + 
Sbjct: 207 MIMQLDRAVGHLVRILYDTGILHDAILIFTTLTGGTKGNNFY---TWPSTYPFRGGNGTL 263

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG R +  ++S  + R+G+V   L+HI DWLPT++ +AG
Sbjct: 264 WEGGSRALGFVYSNRIARRGRVSYGLVHINDWLPTIFRLAG 304


>gi|32471071|ref|NP_864064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32396773|emb|CAD71738.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 490

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 44/115 (38%), Positives = 62/115 (53%), Gaps = 11/115 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS--------LHFHGNTASNWP 52
           +++ +D +VGR++ SL E  L  N+ V+F SDNG   V +        L  HG +A   P
Sbjct: 227 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRLGDHGGSAG--P 284

Query: 53  LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           LR  K S  EGGVR  A LW+P  +  G V +++    D +PTL  +AG E   D
Sbjct: 285 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAEIPTD 339


>gi|375088716|ref|ZP_09735054.1| hypothetical protein HMPREF9703_01136 [Dolosigranulum pigrum ATCC
           51524]
 gi|374561681|gb|EHR33020.1| hypothetical protein HMPREF9703_01136 [Dolosigranulum pigrum ATCC
           51524]
          Length = 483

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 36/102 (35%), Positives = 54/102 (52%), Gaps = 8/102 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L  +D  +GR++  +E  G   N+L+++++DNGG          N  +N PL GTKYS +
Sbjct: 268 LDIMDREIGRILDKIEAAGEQENTLIIYLTDNGGSPC-------NYGNNTPLFGTKYSLY 320

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EGGVR    A W  ++       NL+   D LPT  ++A  E
Sbjct: 321 EGGVRVPFIASWPGVVEANSQSRNLVSSLDLLPTFAYLASGE 362


>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
 gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
          Length = 472

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 59/101 (58%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD +VG+++ SLEE+GL  N+LVVF SDNGGPT DS     N + N P RG K   
Sbjct: 255 MVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPT-DS-----NASLNAPYRGQKGIL 308

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+     +  P LL +G + +  +   D +PT   +AG
Sbjct: 309 LEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTFLALAG 349


>gi|343085191|ref|YP_004774486.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353725|gb|AEL26255.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 478

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 3/103 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VGR++Q L+ +GL  N+ V+  SDNGG  + +L+  G   SN PLR  K   
Sbjct: 275 MVAAMDENVGRILQELKAQGLDDNTWVILTSDNGG--LSTLYGEGAPTSNGPLRAGKGWC 332

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKE 102
           +EGG+R    +  P ++       L  I+ D  PTL  IAG E
Sbjct: 333 YEGGIRVPMLIKGPGIKNPGTAPELPVISMDIFPTLLSIAGAE 375


>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
          Length = 504

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 11/103 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G ++Q L+++G+  N++++  +DNGG             SN+PL+G K SF
Sbjct: 248 MVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG--------GSNYPLKGCKGSF 299

Query: 61  HEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR +A +   LL   R+G+  +  +HI DW  T   +AG
Sbjct: 300 FEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAG 342


>gi|294053904|ref|YP_003547562.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613237|gb|ADE53392.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 566

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 62/103 (60%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D++VGR++QSL E   L N+LV+F SDNG  +V +L    N   N P RG+K   
Sbjct: 347 MIDSVDQNVGRLMQSLVETEELDNTLVLFFSDNG--SVSTL----NEYMNAPYRGSKALV 400

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG++ +  A W   ++   +  + + I DW+PT+  +AG E
Sbjct: 401 WEGGLKTHCIAYWKGKIQPASISHDQVWIGDWMPTVLELAGVE 443


>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 472

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D  +GR++ +++  G+  N+LVVF SDNG    D  H      SN PLRG K++ 
Sbjct: 269 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + W   L  G   +++    D LPT+   AG
Sbjct: 322 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPTMVAAAG 362


>gi|340618199|ref|YP_004736652.1| sulfatase [Zobellia galactanivorans]
 gi|339732996|emb|CAZ96371.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 506

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 58/99 (58%), Gaps = 8/99 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE++GR+I  L++  L  N+L++  SDNG        +HG  + N  LRG+K   
Sbjct: 306 MVQSMDENIGRIITYLKKNNLEENTLIIITSDNG--------YHGIASENLKLRGSKGEI 357

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
           +EGG++  A +  P   K +  E  + +TD+ PTL  +A
Sbjct: 358 YEGGIKVPALIHWPGKVKARRTEIPISVTDYFPTLMDVA 396


>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
 gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
          Length = 498

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 11/108 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
           M+  +D+ +GRV+++LE  G+  N+L++F SDNGG  VD+  F G +       A N P 
Sbjct: 269 MISVMDDGIGRVVEALETRGMRENTLILFHSDNGG-VVDAF-FAGESDVKGKLPADNGPY 326

Query: 54  RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  K + +EGG R V  + W   +  G   + +MH+ D  PTL  +AG
Sbjct: 327 RQGKGTTYEGGTRVVGLINWPGKIEPGSA-DGMMHVVDMYPTLAALAG 373


>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG ++ +L+++G+  N L V  SDNGGP      + G  A+N+PL+G K + 
Sbjct: 258 MVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGPI-----YAGGGANNYPLKGGKLTD 312

Query: 61  HEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GG+R    V+  + P   +GQ  +  +H+ DW  T   IAG
Sbjct: 313 WQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAG 355


>gi|218679088|ref|ZP_03526985.1| putative sulfatase protein [Rhizobium etli CIAT 894]
          Length = 295

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 43/109 (39%), Positives = 62/109 (56%), Gaps = 13/109 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
           M+  +D+ +GRV+++LE  G+  N+L+VF SDNGG  VD+  F G +       A N P 
Sbjct: 66  MISVMDDGIGRVVEALETRGMRENTLIVFHSDNGG-VVDAF-FAGESDVKGKLPADNGPY 123

Query: 54  RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  K + +EGG R VA L  W   +  G   + +MH+ D  PTL  +AG
Sbjct: 124 RQGKGTTYEGGTR-VAGLINWPGKIVPGSA-DGMMHVVDMYPTLAALAG 170


>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 585

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 61/107 (57%), Gaps = 8/107 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++DE+VG +  +L+++G+  ++L +F+SDNGG     + + GN   N+P RG K S 
Sbjct: 331 MISAMDEAVGNITNALKDKGMWNDTLFIFLSDNGG----DMFYAGN---NYPFRGNKASL 383

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EG V+    ++  +L +KG     L+H TD   TL   AG   + D
Sbjct: 384 WEGSVKIPGFIYGNMLEKKGYSNNELIHFTDLFATLLSAAGGTPDDD 430


>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
 gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
          Length = 453

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 58/96 (60%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD+++G+V+Q+LEE G L N+LV+F SDNGG         G+ A+N P RG K   
Sbjct: 251 LIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGGD-------RGSMANNGPTRGAKGDM 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTL 95
            EGG+    AL  P + +G   +N  + + D +PT+
Sbjct: 304 FEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTI 339


>gi|283781860|ref|YP_003372615.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283440313|gb|ADB18755.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 513

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 56/96 (58%), Gaps = 5/96 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D++VG+V+  L+E GL  N+LV+F SDNGG +       G   SN PLRG K   
Sbjct: 283 MVDAMDQAVGKVLAKLDELGLRENTLVIFTSDNGGLSTS----EGWPTSNLPLRGGKGWM 338

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   ++ G  ++  +   D++ TL
Sbjct: 339 YEGGIREPLVMRWPAKVKAGSTIDTPVSSPDFMATL 374


>gi|392410845|ref|YP_006447452.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
 gi|390623981|gb|AFM25188.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
          Length = 571

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LE+ G L N++VVF +DNG  T  S    G T    P +G+K +  
Sbjct: 305 MKQMDDNIGLVLKKLEDMGQLDNTIVVFTTDNGAETW-SFPDGGTT----PFKGSKMNTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R    + W   ++ G V  ++    DWLPT   IAG  K
Sbjct: 360 EGGMRAPCVIRWPGHIKPGTVFNDIFASLDWLPTFVEIAGGPK 402


>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 500

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 11/107 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+   D   G V+  L+E  +  N+++V  +DNGG             SN+PL+G+K++F
Sbjct: 240 MVSVADNVTGHVVNLLKENDMWDNTIMVISADNGGADC--------KGSNYPLKGSKHTF 291

Query: 61  HEGGVRNVAALWS---PLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGGVR +A       P  R+G+  +  +H+ DW  T   +AG + N
Sbjct: 292 FEGGVRVIAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHN 338


>gi|365118348|ref|ZP_09336988.1| hypothetical protein HMPREF1033_00334 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649879|gb|EHL88976.1| hypothetical protein HMPREF1033_00334 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 508

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ S+DE+VG+V+  L++EGL  N++V  +SDNGG +       G+  SN+PLRG K   
Sbjct: 296 LIWSMDENVGKVLDYLKKEGLYDNTVVCLVSDNGGLSTA----EGSPTSNFPLRGGKGWL 351

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    + +P ++    + +  +  TD+ PT+  + G
Sbjct: 352 YEGGIRVPYIIKTPYMKGSAGICKTPVSSTDFYPTILELTG 392


>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 40/107 (37%), Positives = 60/107 (56%), Gaps = 7/107 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
           M+ + D+ VG +  +L+ +G+  N+L+V  SDNGG     L     +    ASNWPLRG 
Sbjct: 282 MVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGGASNWPLRGG 341

Query: 57  KYSFHEGGVRNVAAL---WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           K S  +GG+R  A +   + P  R+GQV   L+ + DW+ T   +AG
Sbjct: 342 KLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAG 388


>gi|149196713|ref|ZP_01873767.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149140393|gb|EDM28792.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 483

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 53/100 (53%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDESVGRV+  LEE GL  N+ + F  DNG          GN      L+G K   
Sbjct: 268 MVQSLDESVGRVLAKLEETGLRDNTYIFFTGDNGS---------GNNLYTGGLKGAKAKA 318

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           HEGG R V  +  P ++ G   +  +  TD+ PT+  +AG
Sbjct: 319 HEGGTREVYLISGPGIKAGGKSDVPVIGTDFYPTMLDLAG 358


>gi|329896998|ref|ZP_08271791.1| Arylsulfatase [gamma proteobacterium IMCC3088]
 gi|328921506|gb|EGG28891.1| Arylsulfatase [gamma proteobacterium IMCC3088]
          Length = 560

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 37/96 (38%), Positives = 56/96 (58%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD SVGRV+ +L++ GL  N+LV+F SDNGGP      + G +  N P RG K + 
Sbjct: 334 MIRALDRSVGRVLDALDQRGLSENTLVIFTSDNGGPG-----YIGLSDINKPFRGWKITQ 388

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R    L W  ++  G  +   +   D +PT+
Sbjct: 389 FEGGIRVPLFLRWPRVIESGVRINTPVAHIDMMPTI 424


>gi|402871939|ref|XP_003899903.1| PREDICTED: arylsulfatase B-like [Papio anubis]
          Length = 242

 Score = 64.7 bits (156), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 38/76 (50%), Positives = 47/76 (61%), Gaps = 8/76 (10%)

Query: 30  ISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR-KGQVLENLMHI 88
           +SDNGG T+          +NWPLRG K+S  EGGVR V  + SPLL+ KG     L+HI
Sbjct: 7   VSDNGGQTLAG-------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHI 59

Query: 89  TDWLPTLYFIAGKEKN 104
           +DWLPTL  +AG   N
Sbjct: 60  SDWLPTLVKLAGGHTN 75


>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 486

 Score = 64.7 bits (156), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 41/109 (37%), Positives = 58/109 (53%), Gaps = 11/109 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA---------SNW 51
           M+  LDE++G+V+ +L+++GL  N+L+ F SDNGG T D + F G  A          N 
Sbjct: 242 MVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGG-TKDKM-FAGQMADMSKVVLPCDNG 299

Query: 52  PLRGTKYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           P R  K S  EGG R  A    P   K Q ++ ++H  D  PT   +AG
Sbjct: 300 PYRNGKGSLFEGGSRVCALANWPGKIKAQTVDGMIHAVDLYPTFAALAG 348


>gi|149178575|ref|ZP_01857162.1| aryl-sulphate sulphohydrolase [Planctomyces maris DSM 8797]
 gi|148842593|gb|EDL56969.1| aryl-sulphate sulphohydrolase [Planctomyces maris DSM 8797]
          Length = 467

 Score = 64.7 bits (156), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 8/66 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D+S+GRV+Q+L E+ L  N++VVF SDNGG        +G   S  PLRG+K   
Sbjct: 250 MIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGG--------YGPATSMLPLRGSKGML 301

Query: 61  HEGGVR 66
           +EGG+R
Sbjct: 302 YEGGIR 307


>gi|421738848|ref|ZP_16177186.1| arylsulfatase A family protein [Streptomyces sp. SM8]
 gi|406692707|gb|EKC96390.1| arylsulfatase A family protein [Streptomyces sp. SM8]
          Length = 485

 Score = 64.7 bits (156), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 40/101 (39%), Positives = 54/101 (53%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG+V+++LE  G   N+LV F SDNGG            + NWPL G K S 
Sbjct: 296 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 346

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  + L W   +   QV +  ++  DW  TL  I G
Sbjct: 347 KEGGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLDIGG 387


>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 498

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD  VGR+ + LEE G   N+L+VF SDNGG T        N + N PLRG K S 
Sbjct: 257 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  +   W      G   + ++   D LPT    AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAE 352


>gi|408371886|ref|ZP_11169642.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
 gi|407742679|gb|EKF54270.1| arylsulfatase A family protein [Galbibacter sp. ck-I2-15]
          Length = 480

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 9/113 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN-----GGPTVDSLHFHGNTASNWPLRG 55
           M+ SLD  VG++++ L+ +GL  N+LV+F SDN     GG T + + F  +TA    LRG
Sbjct: 267 MVTSLDSYVGKLLEKLKSKGLEENTLVIFTSDNGTHIEGGRTKEDVKFFNSTAG---LRG 323

Query: 56  TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            K   +EGG+R    A W   + KG +        D LPT   +AG +K +++
Sbjct: 324 VKRDLYEGGIRTPFIAHWPGTIPKGSISSVTAAYWDLLPTFTELAGYKKRLET 376


>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
 gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 483

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 61/102 (59%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D +VG+++  L+E+G+  N++V+F SDNGG +       G   +N PLR  K   
Sbjct: 272 MVESMDMAVGKILGKLQEKGIADNTVVIFFSDNGGLSTSE----GIPTANTPLRAGKGWL 327

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
           +EGG+R  A + W   ++ G V + ++   D+ PT+  + G+
Sbjct: 328 YEGGIREPAIIRWPGTIKPGTVSDAVITSMDFYPTILEMTGQ 369


>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
 gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
          Length = 452

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++LE+ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V +N + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336


>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
          Length = 456

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 58/95 (61%), Gaps = 7/95 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+ VGRVI SLE+ G L N+L++F SDNGG          + A+N P+RG K   
Sbjct: 256 LIEHMDDGVGRVINSLEKSGQLDNTLIIFCSDNGGD-------RKSEANNGPVRGDKGDM 308

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
           ++GG++   +L+     + + + NL+ ++D  PTL
Sbjct: 309 YDGGIKVACSLYWKGHLEHRRVNNLVMMSDIFPTL 343


>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
          Length = 486

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 41/108 (37%), Positives = 57/108 (52%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G +   L+E GL  N++ +F +DNGG      +  GN   N PLRG K  +
Sbjct: 214 MVSVMDEGIGNITSHLKESGLWNNTIFIFSTDNGGQA----YVGGN---NLPLRGNKAGY 266

Query: 61  HEGGVRN---VAALWSPLLRKGQVLEN--LMHITDWLPTLYFIAGKEK 103
            EGG+     VA  +    RKG  L N  L+HI+DW PT+      EK
Sbjct: 267 WEGGIHGNGFVAGGYFEDRRKGSELVNNELIHISDWYPTIMEATSCEK 314


>gi|257067328|ref|YP_003153583.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
 gi|256558146|gb|ACU83993.1| arylsulfatase A family protein [Brachybacterium faecium DSM 4810]
          Length = 480

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 40/102 (39%), Positives = 52/102 (50%), Gaps = 8/102 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L+ +D  +GR++  LE  G   N+LVV+++DNGG T        N   N PL GTKYS  
Sbjct: 272 LEIMDREIGRMLDHLEATGRRENTLVVYLTDNGGSTC-------NYGINTPLHGTKYSLF 324

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EGGVR      W   +  G   + L+   D LPT    AG E
Sbjct: 325 EGGVRVPFMVSWPGEVPAGARCDELVSSMDLLPTFLAAAGAE 366


>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
          Length = 314

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 59/95 (62%), Gaps = 12/95 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M++++DESV  +  + ++ GL  +++++F +DNGG P            SN+PLRG K +
Sbjct: 228 MMRAVDESVRNITATYKQLGLWNDTVLIFTTDNGGNPETG--------GSNYPLRGQKAT 279

Query: 60  FHEGGVRNVAALWSPLL---RKGQVLENLMHITDW 91
             EGG+R V  + SPLL   ++G + + L+H++DW
Sbjct: 280 TFEGGMRGVGFVNSPLLNESQRGFISDELIHVSDW 314


>gi|150008925|ref|YP_001303668.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937349|gb|ABR44046.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 468

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 8/94 (8%)

Query: 3   KSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHE 62
           + LD S+G ++++L+EEGL  N+L++F SDNG       HF G   +N PLRG K + +E
Sbjct: 246 EELDWSMGVLLETLKEEGLDQNTLIIFTSDNGA----DEHFGG---TNRPLRGQKGTTYE 298

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           GG R    + W   +  GQ  +NL+   D+LPTL
Sbjct: 299 GGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTL 332


>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
 gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
          Length = 470

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 8/107 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE++GRV   L+E G+   +L+VF SDNGG           T++N PLRG K   
Sbjct: 263 MIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNGG-------VRRKTSNNDPLRGEKGQH 315

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  LW  +   G V    +   D+ PT+  I G   N +
Sbjct: 316 WEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTILNITGVAGNTE 362


>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
 gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++LE+ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V +N + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336


>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 614

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 40/99 (40%), Positives = 57/99 (57%), Gaps = 9/99 (9%)

Query: 1   MLKSLDESVGRVI-QSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
           ++ + DES+  +  + L+  GL   +++V  SDNGG  V ++       SN+PLRG K  
Sbjct: 292 LMMNADESLANLTNRGLKATGLYDTAVIVIASDNGG-QVSAM----GGGSNYPLRGEKKY 346

Query: 60  FHEGGVRNVAALWSPLL---RKGQVLENLMHITDWLPTL 95
             EGGVR  A + SPLL    +G     LMH++DWLPTL
Sbjct: 347 LFEGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTL 385


>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 499

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 2/106 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D SVG VI++L+E G+   +++ F SDNG  +  +   H    SN PL+G K S 
Sbjct: 257 MIEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKE-HAEKGSNLPLKGWKGSS 315

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR    L W   L +G+    +  + D  PT   +AG E  +
Sbjct: 316 EEGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEV 361


>gi|218131663|ref|ZP_03460467.1| hypothetical protein BACEGG_03284 [Bacteroides eggerthii DSM 20697]
 gi|317474715|ref|ZP_07933989.1| sulfatase [Bacteroides eggerthii 1_2_48FAA]
 gi|217985966|gb|EEC52305.1| arylsulfatase [Bacteroides eggerthii DSM 20697]
 gi|316909396|gb|EFV31076.1| sulfatase [Bacteroides eggerthii 1_2_48FAA]
          Length = 457

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR +  L+++GL  N+LV+F SDNG      L F G   S  PLRG K+S+
Sbjct: 267 VVEEIDWNVGRFLDYLDKKGLAENTLVIFASDNG----PWLSFKGEGGSADPLRGGKFSY 322

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR    + W  ++  G   + ++   D  PT+   AG
Sbjct: 323 YEGGVRVPCIIRWKGVVPAGVTSDAIIASIDLFPTIMHYAG 363


>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 506

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 10/108 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L ++D S+G ++++L+  G+  ++L +F SDNG        F   + SNWPLRG   +  
Sbjct: 256 LSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNGA-------FATESGSNWPLRGNIATLF 308

Query: 62  EGGVRNVAALWSPLLRK-GQVLENLMHITDWLPTLYFIAGK--EKNID 106
           EGG R     +  +L+K G V   L+HI D   T+  +AG   E +ID
Sbjct: 309 EGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGATTESDID 356


>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
          Length = 452

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++LE+ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V +N + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336


>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
 gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
 gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
          Length = 452

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++LE+ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V +N + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336


>gi|254501736|ref|ZP_05113887.1| sulfatase, putative [Labrenzia alexandrii DFL-11]
 gi|222437807|gb|EEE44486.1| sulfatase, putative [Labrenzia alexandrii DFL-11]
          Length = 518

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 16/111 (14%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD-----SLHFHGNTASNWPLRG 55
           ++K LD +VG+V+  L+E G+  N++V+F SDNG  T+      +  FHG   + W    
Sbjct: 273 VMKELDNNVGKVLSRLDELGIADNTIVMFTSDNGPETMTWPDGGTTPFHGEKGTTW---- 328

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
                 EGG R  A + W   + +GQVL  +    DW+PTL   AG   N+
Sbjct: 329 ------EGGFRVPAIIRWPGKIAEGQVLNGIFSGLDWMPTLVAAAGGPDNL 373


>gi|386850106|ref|YP_006268119.1| sulfatase [Actinoplanes sp. SE50/110]
 gi|359837610|gb|AEV86051.1| sulfatase [Actinoplanes sp. SE50/110]
          Length = 462

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD  VG+V+ +L + G   N+LV+F SDNGG            +  WPL GTK S 
Sbjct: 276 MVENLDVRVGQVLSALRQTGQEENTLVIFSSDNGGE---------RYSYQWPLTGTKASL 326

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R      W   +  GQV    +   DW  TL  IAG
Sbjct: 327 NEGGIRVPTIVRWPARIHGGQVSHEPVVTPDWTATLLEIAG 367


>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
 gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
          Length = 470

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 12/107 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D  VGR++Q+L ++ +  N++VVF SDNGG            ++ WP  G K   
Sbjct: 254 IVERMDFQVGRILQALTDKKIRENTIVVFTSDNGGERF---------SNTWPFSGKKSEL 304

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R  + + W   +  GQ  E +    DWLPTL  +   E N+D
Sbjct: 305 LEGGIRIPSLICWPAKIPSGQTSEQVTLSMDWLPTL--LEAAETNVD 349


>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 468

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD++ G++++ L+EEGL  ++L+VF +DNGGP        G    N+PL G K + 
Sbjct: 252 MTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPD-------GTKTCNYPLSGCKSNN 304

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    +  P +++ G   E  +   D LPT   IAG
Sbjct: 305 LEGGIRVPFVMKLPKVIQPGSSYEYPVSTLDLLPTFVNIAG 345


>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
 gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
          Length = 482

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +GRV+ +LE+ G+  +++VVF SDNGG            +  WP  G K   
Sbjct: 260 IVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGERF---------SDTWPFTGRKTEL 310

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A + W   L  G V    M   DWLPTL   AG
Sbjct: 311 LEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAG 351


>gi|423220023|ref|ZP_17206519.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
           CL03T12C61]
 gi|392623855|gb|EIY17953.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
           CL03T12C61]
          Length = 455

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 37/112 (33%), Positives = 61/112 (54%), Gaps = 12/112 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M+  +D +VG++IQ L+EEG+  N+++ F SDNG      +H  G        SN P RG
Sbjct: 250 MVSEIDRNVGQIIQLLKEEGIWENTIIFFSSDNG------VHLTGGHDYKFFNSNGPFRG 303

Query: 56  TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            K   +EGG+R      W  ++++ +V ++L    D++PT+  I G +   D
Sbjct: 304 HKRDLYEGGIRAPFIVSWPMIIKEKRVTDHLSAFWDFMPTVAEIVGADVKSD 355


>gi|302543697|ref|ZP_07296039.1| LOW QUALITY PROTEIN: N-acetylgalactosamine-6-sulfate sulfatase
           [Streptomyces hygroscopicus ATCC 53653]
 gi|302461315|gb|EFL24408.1| LOW QUALITY PROTEIN: N-acetylgalactosamine-6-sulfate sulfatase
           [Streptomyces himastatinicus ATCC 53653]
          Length = 318

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 50/105 (47%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+ +L   G   ++LV F SDNGG            +  WPL G K+S 
Sbjct: 130 MVEDLDRSVGEVLAALRRSGQERDTLVFFASDNGGE---------RWSYQWPLTGAKFSL 180

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
           HEGG+R      W   L  GQV    +   DW  TL  + G   +
Sbjct: 181 HEGGIRVPTIVRWPARLAPGQVSHEPVFTPDWTATLLELGGARPD 225


>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 889

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 11/104 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DES+  +   L+ E +  N+L+ F+SDNGG     +  HG  +SN  LRG K S+
Sbjct: 223 MVSMVDESLASIHDDLKAEAMWENTLLFFLSDNGG-----ILRHG--SSNLDLRGEKGSY 275

Query: 61  HEGGVRNVA----ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R  A       + LL  G+V E L+HITD   T   +AG
Sbjct: 276 YEGGIRVPAFVSGGYLAKLLSAGRVFEGLVHITDLHATALRVAG 319


>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
 gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
          Length = 454

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 50/95 (52%), Gaps = 9/95 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++++LD +VG+V+ +L+E G   N+LVVF SDNGG            +  WP  G K   
Sbjct: 235 LVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGERF---------SKTWPFTGQKTEL 285

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R    L  P   + QV   +    DWLPTL
Sbjct: 286 LEGGLRVPTLLRWPARVRPQVQAQVTITADWLPTL 320


>gi|409198451|ref|ZP_11227114.1| arylsulfatase [Marinilabilia salmonicolor JCM 21150]
          Length = 465

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 42/114 (36%), Positives = 59/114 (51%), Gaps = 14/114 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M+  +D  +GR++Q L++ GL  N+LVVF SDNG       H  G        SN P RG
Sbjct: 261 MVTHMDRDIGRLMQLLKDLGLDENTLVVFTSDNG------THVEGGNDPDFFDSNGPFRG 314

Query: 56  TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGK--EKNID 106
            K   +EGG+R    A W  ++  G+  +++    D  PT   +AGK  EKN D
Sbjct: 315 YKRDLYEGGIRVPFIARWKNVIEPGRESDHISAFWDMYPTFCELAGKQPEKNTD 368


>gi|359146688|ref|ZP_09180156.1| sulfatase [Streptomyces sp. S4]
          Length = 508

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG+V+++LE  G   N+LV F SDNGG            + NWPL G K S 
Sbjct: 319 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 369

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  + L W   +   QV +  ++  DW  TL  + G
Sbjct: 370 KEGGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLELGG 410


>gi|325109217|ref|YP_004270285.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969485|gb|ADY60263.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 485

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D S+GRV+  LE+ GL  N+++ F SDNGG +       G+  SN P RG K   
Sbjct: 272 MVETMDTSIGRVLDHLEKLGLTDNTVICFTSDNGGLSTS----EGSPTSNLPFRGGKGWV 327

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R    + W      G V E  +   D  PTL  IA  E
Sbjct: 328 YEGGIREPLLIKWPGTTPAGSVCETPICSIDMHPTLCDIADAE 370


>gi|421612348|ref|ZP_16053456.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408496803|gb|EKK01354.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 482

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 11/115 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
           +++ +D +VGR++ SL E  L  N+ V+F SDNG   V +           HG +A   P
Sbjct: 219 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRPGDHGGSAG--P 276

Query: 53  LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           LR  K S  EGGVR  A LW+P  +  G V +++    D +PTL  +AG E   D
Sbjct: 277 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAEIPTD 331


>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 498

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 54/103 (52%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD  VGR+ + LEE G   N+L+VF SDNGG T        N + N PLRG K S 
Sbjct: 257 MMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  +   W      G+  + ++   D LPT    AG +
Sbjct: 310 REGGIRVPMIWTWPAKFPAGKRYDGVISSLDLLPTFCSAAGAQ 352


>gi|325108394|ref|YP_004269462.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968662|gb|ADY59440.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 475

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 8/108 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG----PTVDSLHFHGNTASNWPLRGT 56
           ML+ LD +VGR++ ++E+     N+ V F +DNGG    P  D+      T +N PL G+
Sbjct: 253 MLEDLDNNVGRLLDAIEKLEAEENTYVFFTADNGGRGTVPGGDT----SRTETNHPLTGS 308

Query: 57  KYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
           K+S +EGG+R       P +++  V    +   D+LPT Y +AG + N
Sbjct: 309 KHSLYEGGIRVPFLARGPGIKQNSVCHQPVVGYDFLPTFYSLAGGKPN 356


>gi|417301290|ref|ZP_12088451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327542405|gb|EGF28888.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 482

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 11/115 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
           +++ +D +VGR++ SL E  L  N+ V+F SDNG   V +           HG +A   P
Sbjct: 219 VIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRPGDHGGSAG--P 276

Query: 53  LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           LR  K S  EGGVR  A LW+P  +  G V +++    D +PTL  +AG E   D
Sbjct: 277 LRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAELPTD 331


>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
 gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
          Length = 482

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  S+D  +G+V+ +L+E GL  N+LVVF +DNGGP+        N ++N PL GTK + 
Sbjct: 263 MTLSMDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS------DTNASNNGPLSGTKANH 316

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAG 100
            EGG+R    +  P  +  Q    N +   D LPT +  AG
Sbjct: 317 LEGGIRVPFIMRWPAGKVSQGEYPNPISTLDLLPTFFSAAG 357


>gi|421612380|ref|ZP_16053488.1| N-acetyl-galactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408496835|gb|EKK01386.1| N-acetyl-galactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 889

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 60/97 (61%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M++S+D+++G ++ +L+  GL   +++VF SDNGG   + +   G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGLADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            +EGGVR  A +  P ++  G   + ++   D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598


>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 498

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD  VGR+ + LEE G   N+L+VF SDNGG T        N + N PLRG K S 
Sbjct: 257 MMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT-------NNGSWNGPLRGVKGSM 309

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R  +   W      G   + ++   D LPT    AG E
Sbjct: 310 REGGIRVPMIWTWPAKFPAGVRYDGVISSLDLLPTFCSAAGAE 352


>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 452

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++L++ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V EN + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNRIRPA-VRENFVMLSDMFPTL 336


>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
 gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 509

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ S+DE+VGRV+Q+L++ GL  N++V  +SDNGG +       G+   N PLR  K   
Sbjct: 299 LIYSMDENVGRVMQALKDNGLDKNTIVCLLSDNGGLSTA----EGSPTCNAPLRAGKGWL 354

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P ++  G V    +   D+ PTL  +AG
Sbjct: 355 YEGGIREPFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMAG 395


>gi|255034790|ref|YP_003085411.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254947546|gb|ACT92246.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 517

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 59/101 (58%), Gaps = 1/101 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G V++ L+E+ +  N++V+F+SDNGG +            N PL+  K S 
Sbjct: 288 LVEGMDKSLGDVLRYLDEQKIADNTVVLFMSDNGGLSTSPARGGKAWTHNLPLKAGKGSV 347

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R  +   W  + + G V E  + I D+ PT+  IAG
Sbjct: 348 YEGGIREPMLVRWPGVTKAGSVTEQYVIIEDFFPTILDIAG 388


>gi|372208819|ref|ZP_09496621.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 459

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D+++G V+Q+L+E GL  N++VVF SDNGG        H    SN PLR  K   
Sbjct: 241 MIEHMDDALGAVVQALKETGLENNTIVVFHSDNGG--------HLKPTSNAPLRSGKGDL 292

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTL 95
           +EGGVR       PL  K   + N + I+ D +PTL
Sbjct: 293 YEGGVRVPLVFKWPLKIKAGAVSNELAISADIMPTL 328


>gi|404406436|ref|ZP_10998020.1| sulfatase [Alistipes sp. JC136]
          Length = 468

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 56/101 (55%), Gaps = 11/101 (10%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSF 60
           LK +D+ VG +++ LEE G+  N++V+F SDNGG P V           N PLRG K + 
Sbjct: 273 LKVIDDGVGMIVEKLEELGIADNTIVIFTSDNGGSPQV---------TDNGPLRGGKGTL 323

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG R    +W P  +  G+V E   +  D+ PTL  + G
Sbjct: 324 YEGGTREPMIVWQPGRICGGRVSELPTNNYDFYPTLCQLTG 364


>gi|313231682|emb|CBY08795.1| unnamed protein product [Oikopleura dioica]
          Length = 509

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 54/94 (57%), Gaps = 7/94 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K+LD ++G + ++LE  G L N++V+++ DNG   V++    G+      LRG K    
Sbjct: 220 VKALDIAIGEIFEALENAGKLDNTIVIYVGDNGA-AVENFQSSGD------LRGYKGGAF 272

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
           EGGV+  A L  P + +G   + +   TDWLPTL
Sbjct: 273 EGGVKTFALLSGPGISQGASYKGIFQSTDWLPTL 306


>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 60/107 (56%), Gaps = 7/107 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D+ V +V+QSL+E  +  N++V F+SDNGGP+      H N + N+PL+G K   
Sbjct: 248 MVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS------HKNKSDNFPLKGQKSDV 301

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R   A+  P  ++  QV ++ +   D   T+  +A    + D
Sbjct: 302 WEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHAD 348


>gi|311745994|ref|ZP_07719779.1| N-acetylgalactosamine-6-sulfatase [Algoriphagus sp. PR1]
 gi|126576208|gb|EAZ80486.1| N-acetylgalactosamine-6-sulfatase [Algoriphagus sp. PR1]
          Length = 504

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D+S+G ++  LE+  L+ N+++VF+SDNG         H     N PLRG K S 
Sbjct: 290 MIEGMDKSLGDIMDFLEDSKLVENTIIVFMSDNGA--------HKQVPQNSPLRGWKLSP 341

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R  +   W  + + G V+E  + I D  PT   ++G
Sbjct: 342 YEGGIRVPLIVKWPGVTKPGSVVEKQVIIEDIYPTFLDMSG 382


>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLDE VG VI    +  LL N++  F++DNGG T D         +NWPL G K++ 
Sbjct: 249 MVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 301

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
            EGGV+    ++  L+  G+    L H++D LPT+   F    +K ID
Sbjct: 302 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 348


>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 4/103 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--ASNWPLRGTKY 58
           M+  LD++VGR++ +LE  GL  ++LVVF+SD+G  T +S +   +    SN P RG K 
Sbjct: 255 MVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGA-TFESGNLGASDYHDSNRPFRGQKR 313

Query: 59  SFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +  EGG+R    A W   +  G +   ++H+TD  PTL    G
Sbjct: 314 TLWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATG 356


>gi|410633554|ref|ZP_11344196.1| N-acetylgalactosamine-6-sulfatase [Glaciecola arctica BSs20135]
 gi|410146795|dbj|GAC21063.1| N-acetylgalactosamine-6-sulfatase [Glaciecola arctica BSs20135]
          Length = 440

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 4/100 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD SVG+V+Q+L E+G+  N+LVVF SDNG      L F     S  P RG K   
Sbjct: 226 VMEELDWSVGQVLQTLREQGIAENTLVVFSSDNG----PWLLFETMGGSAGPFRGGKNET 281

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R     W P   K +++ ++    D LPTL  +AG
Sbjct: 282 LEGGLRVPGFFWWPGHIKPELVHDIGSTIDLLPTLVSVAG 321


>gi|383116283|ref|ZP_09937035.1| hypothetical protein BSHG_3339 [Bacteroides sp. 3_2_5]
 gi|251945469|gb|EES85907.1| hypothetical protein BSHG_3339 [Bacteroides sp. 3_2_5]
          Length = 493

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSRHLKGRQVSDTPIIGTDYYPT 363


>gi|149196406|ref|ZP_01873461.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
 gi|149140667|gb|EDM29065.1| aryl-sulphate sulphohydrolase [Lentisphaera araneosa HTCC2155]
          Length = 493

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD +VGR++ +LEE+GL   +L++F SDNGG        H   +   PLR  K S+
Sbjct: 245 MIEALDHNVGRLVAALEEQGLREKTLIMFTSDNGG--------HMKFSRQEPLRAGKGSY 296

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R    A W  ++  G   +  +   D+ PT+  +AG E
Sbjct: 297 YEGGIRVPFFASWPGVIEAGSRSQVPVTGLDFYPTVCELAGVE 339


>gi|423260671|ref|ZP_17241593.1| hypothetical protein HMPREF1055_03870 [Bacteroides fragilis
           CL07T00C01]
 gi|423266807|ref|ZP_17245809.1| hypothetical protein HMPREF1056_03496 [Bacteroides fragilis
           CL07T12C05]
 gi|387775225|gb|EIK37334.1| hypothetical protein HMPREF1055_03870 [Bacteroides fragilis
           CL07T00C01]
 gi|392699582|gb|EIY92757.1| hypothetical protein HMPREF1056_03496 [Bacteroides fragilis
           CL07T12C05]
          Length = 493

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|375092146|ref|ZP_09738431.1| hypothetical protein HMPREF9709_01293 [Helcococcus kunzii ATCC
           51366]
 gi|374561912|gb|EHR33249.1| hypothetical protein HMPREF9709_01293 [Helcococcus kunzii ATCC
           51366]
          Length = 472

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            L+ +D+ +G +I  +EE G   N+L+V+ +DNGG          N  +N PL GTKY+ 
Sbjct: 264 QLELMDKKIGEIIDKVEEMGQKDNTLIVYTTDNGGSPC-------NYGNNAPLTGTKYTL 316

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG R      W  ++ K Q   NL    D L T  ++ G E
Sbjct: 317 YEGGTRVPFIFSWDGVIPKNQSSRNLSSTLDLLTTFAYLGGVE 359


>gi|375360006|ref|YP_005112778.1| putative exported sulfatase [Bacteroides fragilis 638R]
 gi|301164687|emb|CBW24246.1| putative exported uslfatase [Bacteroides fragilis 638R]
          Length = 493

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 467

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 41/110 (37%), Positives = 61/110 (55%), Gaps = 11/110 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D ++GRV+Q L+ +G+  N+LV+F++DNGG T        N + N  LRG K S 
Sbjct: 252 MLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT-------NYSDNGQLRGMKGSV 304

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG---KEKNID 106
            EGGVR      W+  +    V +  +   D LPT    AG   K+K++D
Sbjct: 305 WEGGVRVGYIMRWNGKIPANVVYDKAVSSLDILPTSLAAAGNTRKDKHLD 354


>gi|60683167|ref|YP_213311.1| sulfatase [Bacteroides fragilis NCTC 9343]
 gi|423270164|ref|ZP_17249135.1| hypothetical protein HMPREF1079_02217 [Bacteroides fragilis
           CL05T00C42]
 gi|423276123|ref|ZP_17255065.1| hypothetical protein HMPREF1080_03718 [Bacteroides fragilis
           CL05T12C13]
 gi|60494601|emb|CAH09402.1| putative exported uslfatase [Bacteroides fragilis NCTC 9343]
 gi|392698088|gb|EIY91270.1| hypothetical protein HMPREF1079_02217 [Bacteroides fragilis
           CL05T00C42]
 gi|392699263|gb|EIY92444.1| hypothetical protein HMPREF1080_03718 [Bacteroides fragilis
           CL05T12C13]
          Length = 493

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|423285640|ref|ZP_17264522.1| hypothetical protein HMPREF1204_04060 [Bacteroides fragilis HMW
           615]
 gi|404579155|gb|EKA83873.1| hypothetical protein HMPREF1204_04060 [Bacteroides fragilis HMW
           615]
          Length = 493

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|423251817|ref|ZP_17232830.1| hypothetical protein HMPREF1066_03840 [Bacteroides fragilis
           CL03T00C08]
 gi|423255138|ref|ZP_17236068.1| hypothetical protein HMPREF1067_02712 [Bacteroides fragilis
           CL03T12C07]
 gi|392649242|gb|EIY42921.1| hypothetical protein HMPREF1066_03840 [Bacteroides fragilis
           CL03T00C08]
 gi|392652579|gb|EIY46238.1| hypothetical protein HMPREF1067_02712 [Bacteroides fragilis
           CL03T12C07]
          Length = 493

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|440713548|ref|ZP_20894148.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436441706|gb|ELP34912.1| N-acetylgalactosamine-6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 889

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 60/97 (61%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M++S+D+++G ++ +L+  G+   +++VF SDNGG   + +   G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            +EGGVR  A +  P ++  G   + ++   D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598


>gi|408676338|ref|YP_006876165.1| Arylsulfatase [Streptomyces venezuelae ATCC 10712]
 gi|328880667|emb|CCA53906.1| Arylsulfatase [Streptomyces venezuelae ATCC 10712]
          Length = 467

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD S+G+V+ +L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 282 MVEDLDRSIGQVLTALKRSGQEQDTLVFFASDNGGERF---------SYNWPLAGNKGSL 332

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  + L W   +  GQV +  +   DW  TL  +AG
Sbjct: 333 QEGGIRVPSILRWPARIDGGQVSDLPVFTPDWTATLLDLAG 373


>gi|53715233|ref|YP_101225.1| sulfatase ydeN [Bacteroides fragilis YCH46]
 gi|52218098|dbj|BAD50691.1| putative secreted sulfatase ydeN precursor [Bacteroides fragilis
           YCH46]
          Length = 493

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|343084295|ref|YP_004773590.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352829|gb|AEL25359.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 499

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 63/103 (61%), Gaps = 2/103 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G ++Q L+E+ L+ N++++F+SDNGG +V S     +T  N PL   K S 
Sbjct: 275 LIEGMDKSLGDIMQHLKEKNLMENTIILFMSDNGGLSVHSRGGEPHT-HNKPLSSGKGSI 333

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           HEGG+R  +   W  +   G   +N + I D+ P++  +AG E
Sbjct: 334 HEGGIRVPMIVKWPGVTIPGSTNDNYLIIEDFYPSILEMAGVE 376


>gi|407940996|ref|YP_006856637.1| sulfatase [Acidovorax sp. KKS102]
 gi|407898790|gb|AFU47999.1| sulfatase [Acidovorax sp. KKS102]
          Length = 444

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GR++  LE +GL  ++LVVF SDNGG            + NWPL G K   
Sbjct: 240 MIHHMDEGIGRIMALLEAKGLARDTLVVFTSDNGGERF---------SDNWPLVGGKMDL 290

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W  ++  G V        DW  T+   AG   + D
Sbjct: 291 TEGGIRVPWIAHWPAVIAPGGVSAQTCMTMDWSATMLDAAGAATHTD 337


>gi|32471039|ref|NP_864032.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32396741|emb|CAD71706.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 889

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 60/97 (61%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M++S+D+++G ++ +L+  G+   +++VF SDNGG   + +   G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            +EGGVR  A +  P ++  G   + ++   D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTL 598


>gi|239816550|ref|YP_002945460.1| sulfatase [Variovorax paradoxus S110]
 gi|239803127|gb|ACS20194.1| sulfatase [Variovorax paradoxus S110]
          Length = 438

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++ +L + G+  N+LVVF SDNGG            + NWPL G K   
Sbjct: 234 MIHHMDEGIGWIMAALRKHGMADNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 284

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W  ++ KG     L    DW  T+   AG + + D
Sbjct: 285 TEGGIRVPWIAHWPAVIAKGGESRQLCMTMDWSATMLDAAGVKADAD 331


>gi|265767063|ref|ZP_06094892.1| sulfatase [Bacteroides sp. 2_1_16]
 gi|263253440|gb|EEZ24916.1| sulfatase [Bacteroides sp. 2_1_16]
          Length = 493

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++ +L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 489

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D  +GR++ +++  G+  N+LVVF SDNG    D  H      SN PLRG K++ 
Sbjct: 286 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNG---ADPKH----GGSNEPLRGNKFTT 338

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + W   L  G   +++    D LP +   AG
Sbjct: 339 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAAG 379


>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
          Length = 520

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/115 (33%), Positives = 59/115 (51%), Gaps = 15/115 (13%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH------------FHGNTA 48
           M+  LD+++      L+ +G+  N+L+V  SDNGG     L             F+G   
Sbjct: 248 MVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGYACFNGEAG 307

Query: 49  SNWPLRGTKYSFHEGGVRN---VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +N+PL+G KYS  EGGVR    V+  + P  R+G   + ++H+ DW  T   +AG
Sbjct: 308 ANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAALAG 362


>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLDE VG V+    +  LL N++  F++DNGG T D         +NWPL G K++ 
Sbjct: 249 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 301

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
            EGGV+    ++  L+  G+    L H++D LPT+   F    +K ID
Sbjct: 302 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 348


>gi|313246656|emb|CBY35538.1| unnamed protein product [Oikopleura dioica]
          Length = 351

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLDE VG V+    +  LL N++  F++DNGG T D         +NWPL G K++ 
Sbjct: 110 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGETRDG-------GNNWPLSGQKWTL 162

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLY--FIAGKEKNID 106
            EGGV+    ++  L+  G+    L H++D LPT+   F    +K ID
Sbjct: 163 LEGGVKATGFMFGGLVPLGK-YNGLFHVSDILPTVMEAFNCPIKKKID 209


>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 470

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 36/95 (37%), Positives = 55/95 (57%), Gaps = 8/95 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+ VG ++++L+E+ L  N+L+VF++DNG  T       GN A N  LRG K S 
Sbjct: 251 MMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAAT-------GNYADNGKLRGLKGSK 303

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPT 94
            EGG+R    +  P  +  G+V   ++   D +PT
Sbjct: 304 WEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPT 338


>gi|320103547|ref|YP_004179138.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750829|gb|ADV62589.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 502

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 3/96 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L+ +D S+GR++ +L + GL  N+ VVF+SDNGG   +S  + G   SN PLR  K + 
Sbjct: 265 LLEEMDTSIGRILNALNQRGLDSNTWVVFLSDNGGLERESGGWPG--TSNRPLRDQKGTL 322

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R  + A    L + G V+E  +   D  PTL
Sbjct: 323 YEGGIRIPMIARIPGLTKPGTVVETPVVTHDLAPTL 358


>gi|423239862|ref|ZP_17220977.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
           CL03T12C01]
 gi|392644851|gb|EIY38585.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
           CL03T12C01]
          Length = 464

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|333907388|ref|YP_004480974.1| N-acetylgalactosamine-6-sulfatase [Marinomonas posidonica
           IVIA-Po-181]
 gi|333477394|gb|AEF54055.1| N-acetylgalactosamine-6-sulfatase [Marinomonas posidonica
           IVIA-Po-181]
          Length = 437

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 49/96 (51%), Gaps = 10/96 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GR++ +LEE GL  ++L+VF SDNGG            + NWPL G K   
Sbjct: 233 MIHHMDEGIGRIMAALEEHGLTDDTLIVFTSDNGGERF---------SDNWPLVGGKMDL 283

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R    A W   +  G+V        DW  TL
Sbjct: 284 TEGGIRVPWIARWPKKIDAGKVSTQHCMTMDWSRTL 319


>gi|198274831|ref|ZP_03207363.1| hypothetical protein BACPLE_00990 [Bacteroides plebeius DSM 17135]
 gi|198272278|gb|EDY96547.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 459

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 5/104 (4%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ +D +VGR++  LE+EGL  N++V+F SDNG      L    N  S  PLR  K+S +
Sbjct: 266 VEEIDWNVGRLLDFLEKEGLSENTMVIFTSDNG----PWLGMKENGGSALPLRDGKFSAY 321

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
           EGGVR    + W   +  G+V ++++   D  PTL  +A  +K+
Sbjct: 322 EGGVRTPCIIKWKNHIPAGKVSDHIIASIDIFPTLLELAQMDKS 365


>gi|431907856|gb|ELK11463.1| Arylsulfatase B [Pteropus alecto]
          Length = 267

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 47/77 (61%), Gaps = 8/77 (10%)

Query: 29  FISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLR-KGQVLENLMH 87
            + DNGG T+      GN   NWPLRG K+S  EGG+R V  + SPLL+ KG     L+H
Sbjct: 31  LVHDNGGQTLAG----GN---NWPLRGRKWSLWEGGIRGVGFVTSPLLKQKGVKNRELIH 83

Query: 88  ITDWLPTLYFIAGKEKN 104
           I+DWLPTL  +AG   N
Sbjct: 84  ISDWLPTLVNLAGGSTN 100


>gi|212690903|ref|ZP_03299031.1| hypothetical protein BACDOR_00391 [Bacteroides dorei DSM 17855]
 gi|237712609|ref|ZP_04543090.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
 gi|212666135|gb|EEB26707.1| arylsulfatase [Bacteroides dorei DSM 17855]
 gi|229453930|gb|EEO59651.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
          Length = 464

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|265752292|ref|ZP_06088085.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|423229021|ref|ZP_17215426.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
           CL02T00C15]
 gi|423244861|ref|ZP_17225935.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
           CL02T12C06]
 gi|263237084|gb|EEZ22554.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|392634774|gb|EIY28686.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
           CL02T00C15]
 gi|392640902|gb|EIY34693.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
           CL02T12C06]
          Length = 464

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|149198336|ref|ZP_01875382.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138632|gb|EDM27039.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 512

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 61/107 (57%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D++VG ++ +++E G+   ++++F+SDNGG     +     T  N+PLRG K S 
Sbjct: 268 MIESMDDAVGSLLNAVDEAGIADETVIIFVSDNGGNMYSKIDGVFPT-DNYPLRGGKASM 326

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    + W  + +     E L+  +D+ PT+    G E  +D
Sbjct: 327 CEGGIRVPCVISWPGVTQDETRSEELIQTSDFYPTILNQLGIEIPVD 373


>gi|345512484|ref|ZP_08792010.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
 gi|345456366|gb|EEO45525.2| arylsulfatase [Bacteroides dorei 5_1_36/D4]
          Length = 464

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|421613867|ref|ZP_16054938.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495453|gb|EKK00041.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 501

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 58/107 (54%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +GR++  LEE  L   +LV+F SDNG P  +  H      S+ PL+G+K S 
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           HEGG+R    A WS  +  G + ++     D+LPT   +AG E   D
Sbjct: 352 HEGGIRVPFIAKWSGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398


>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
 gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
          Length = 496

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D+++GR++  L+ +G+  N++++F SDNGG       F G    N P RG K   
Sbjct: 238 MVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNGG-------FAGFGGVNKPYRGGKLET 290

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
            EGG+R N    W   +  G      + + D  PTL   A      EK ID
Sbjct: 291 FEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAADIEMHNEKKID 341


>gi|417302348|ref|ZP_12089451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327541354|gb|EGF27895.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 501

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 58/107 (54%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +GR++  LEE  L   +LV+F SDNG P  +  H      S+ PL+G+K S 
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           HEGG+R    A WS  +  G + ++     D+LPT   +AG E   D
Sbjct: 352 HEGGIRVPFIAKWSGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398


>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
 gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
          Length = 502

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 59/111 (53%), Gaps = 19/111 (17%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  S+D  +G+V+ SL+E GL  N+L+VF +DNGGP+        N + N PL GTK + 
Sbjct: 285 MTLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPS------DTNASDNTPLSGTKANH 338

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMH----ITDWLPTLYFIAGKEKN 104
            EGG+R       P L +   G +  N  H      D LPT +  AG +K+
Sbjct: 339 LEGGIR------VPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKS 383


>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
 gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
          Length = 579

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D  +GR++ +++  G+  N+LVVF SDNG    D  H      +N PLRG K++ 
Sbjct: 376 MTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---DPKH----GGNNEPLRGNKFTT 428

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR VA + W   L  G   +++    D LP++   AG
Sbjct: 429 WEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAG 469


>gi|410096140|ref|ZP_11291130.1| hypothetical protein HMPREF1076_00308 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409227211|gb|EKN20112.1| hypothetical protein HMPREF1076_00308 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 516

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 8/104 (7%)

Query: 4   SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFH------GNTASNWPLRGTK 57
             D SVG+++  L+  GL  N+L++  SDN GP VD  +        G+     PLRG K
Sbjct: 303 QFDWSVGQILDELDRLGLRENTLIILSSDN-GPVVDDGYADQAEELLGDHKPGGPLRGGK 361

Query: 58  YSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           YS  E G R  A A W   ++KGQV + LM   DW  +L  + G
Sbjct: 362 YSAFEAGTRIPAIASWPKEIKKGQVSDALMSQVDWFASLAALTG 405


>gi|398996733|ref|ZP_10699583.1| arylsulfatase A family protein [Pseudomonas sp. GM21]
 gi|398126037|gb|EJM15485.1| arylsulfatase A family protein [Pseudomonas sp. GM21]
          Length = 571

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LE  G L N++VVF +DNG    +S  F     +  P +  K +  
Sbjct: 305 MKQMDDNIGLVLKKLESMGQLDNTIVVFTTDNGA---ESQTFPDGGMT--PFKAGKLTTW 359

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R  A + W   ++ G V E+++   DWLPTL  +AG  K
Sbjct: 360 EGGMRVPALIRWPGQIKPGTVYEDIVASYDWLPTLVEVAGGPK 402


>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
 gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 488

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 41/106 (38%), Positives = 57/106 (53%), Gaps = 8/106 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT---VDSLHFHGNTASNWPLRGTK 57
           M+  LD  VG ++ +LE+ G+  N+LVVF SDNG  T    D  +F+    S+ P RGTK
Sbjct: 276 MVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNGPHTEGGADPEYFN----SSGPFRGTK 331

Query: 58  YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
              +EGG+R  + A W   ++ G   E +    D  PT   IAG E
Sbjct: 332 RDLYEGGIRVPMVAAWPGKIKPGSRTELVSAFWDVFPTFSDIAGIE 377


>gi|294778194|ref|ZP_06743624.1| arylsulfatase [Bacteroides vulgatus PC510]
 gi|294447960|gb|EFG16530.1| arylsulfatase [Bacteroides vulgatus PC510]
          Length = 466

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 263 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 322 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 357


>gi|410636000|ref|ZP_11346606.1| predicted sulfatase [Glaciecola lipolytica E3]
 gi|410144354|dbj|GAC13811.1| predicted sulfatase [Glaciecola lipolytica E3]
          Length = 474

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLDESVG ++  L    L  N+L++F SDNGG   +   F  N A   PLR  K + 
Sbjct: 261 MVASLDESVGTLLNHLTSNNLTHNTLILFYSDNGGLDKNDTGFPTNNA---PLRNGKATL 317

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           +EGG+R    A W   +  G     ++  TD+ PTL  + G +K
Sbjct: 318 YEGGIRVPFIASWPGKIAAGSESNEMITSTDFYPTLAKLVGSKK 361


>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG +  +L+ +G+  N L V  SDNGGP        G  A+N+PL+G K + 
Sbjct: 258 MVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPV-----HEGTGANNYPLKGGKTTD 312

Query: 61  HEGGVR---NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GGVR    V+  + P   +GQ  +  +H+ DW  T   IAG
Sbjct: 313 WQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAG 355


>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 466

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 33/95 (34%), Positives = 56/95 (58%), Gaps = 8/95 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+ +GR+++ L++ G L N+L++F++DNG  T        N A N PLRG K S 
Sbjct: 249 LMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT-------NNGADNGPLRGLKGSK 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
            EGG+R    + W   L+ G+  + L+   D +PT
Sbjct: 302 WEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPT 336


>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
           anophagefferens]
          Length = 304

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 56/108 (51%), Gaps = 11/108 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ + D ++G +  +L++  L  N+L+V +SDNG              SN PLRG K  +
Sbjct: 188 VVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAGG--------SNAPLRGAKGGY 239

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR  A L   LL    +G   E L+H+ DW PTL   AG   ++
Sbjct: 240 FEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAGLRPDL 287


>gi|423315320|ref|ZP_17293250.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
           CL09T03C04]
 gi|423315328|ref|ZP_17293258.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
           CL09T03C04]
 gi|392679557|gb|EIY72938.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
           CL09T03C04]
 gi|392679565|gb|EIY72946.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
           CL09T03C04]
          Length = 464

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|325109684|ref|YP_004270752.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969952|gb|ADY60730.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 498

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D++VGR++ +L+   L  N+LV+F+SDNGG   +++       SN PLRG K S 
Sbjct: 266 MIESMDDAVGRLLDTLDRLQLTENTLVIFMSDNGGNMYNTVD-KTTPTSNAPLRGGKASM 324

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            EGG+R    +  P  +  G   E ++   D+ PT+
Sbjct: 325 FEGGIRVPCIISYPGHVEAGSRSEAIIQSCDFYPTI 360


>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
           ATCC 8503]
          Length = 452

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD +VGRV ++L++ G L N++++F SDNGG            A+N P RG K   
Sbjct: 249 LIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD-------DAGANNGPFRGAKQDM 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W   +R   V +N + ++D  PTL
Sbjct: 302 YEGGIRVAGGIYWKNQIRPA-VRDNFVMLSDMFPTL 336


>gi|150002770|ref|YP_001297514.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
 gi|149931194|gb|ABR37892.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
          Length = 464

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
 gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
          Length = 482

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 4/109 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD  VG+V+Q L+E G+  N+LV+F SDN GP  +  +      SN PLRG K   
Sbjct: 272 MVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDN-GPHKEGGYDPDYFNSNGPLRGIKRDL 330

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG--KEKNID 106
           +EGG+R  + A W   ++ G   + +    D+LPT   +A   K  NID
Sbjct: 331 YEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELAKALKPANID 379


>gi|319643165|ref|ZP_07997794.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|345520543|ref|ZP_08799930.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
 gi|317385206|gb|EFV66156.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|345456873|gb|EET15372.2| arylsulfatase [Bacteroides sp. 4_3_47FAA]
          Length = 464

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 57/96 (59%), Gaps = 2/96 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+SVG +I+ L+E+GL  ++++VF SDNG  + +  H      SN P RG K   
Sbjct: 261 MVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS-EGGHDPSYFDSNGPFRGQKRDL 319

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R    + W  ++ +G V  ++    D+LPT+
Sbjct: 320 YEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTI 355


>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 548

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 60/103 (58%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML+S+D SV RV+  LE++G+  N++V+  SDNGG    +++       N P RG K +F
Sbjct: 325 MLRSIDRSVERVMAKLEKQGIADNTIVILSSDNGGADYVAIN-----DLNKPYRGWKNTF 379

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R   +  W  ++ +  V+E  ++  D +PT+  +A  +
Sbjct: 380 FEGGIRVPFSVTWPNVIDESTVIEEPVNHIDLMPTIINMANAD 422


>gi|108761533|ref|YP_634630.1| sulfatase [Myxococcus xanthus DK 1622]
 gi|108465413|gb|ABF90598.1| sulfatase family protein [Myxococcus xanthus DK 1622]
          Length = 553

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 6/105 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD+ VG ++  L+E G+  N+LVVF +DNG   V+ + +      N P RG K S  
Sbjct: 268 MRELDDIVGVLLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 322

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
           EGGVR    + W  ++  G+V+ ++    DW+PTL   AG  K++
Sbjct: 323 EGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDL 367


>gi|405345812|ref|ZP_11022551.1| Arylsulfatase [Chondromyces apiculatus DSM 436]
 gi|397093455|gb|EJJ24162.1| Arylsulfatase [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 554

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 6/105 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD+ VG ++  L+E G+  N+LVVF +DNG   V+ + +      N P RG K S  
Sbjct: 269 MRELDDIVGVLLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 323

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
           EGGVR    + W  ++  G+V+ ++    DW+PTL   AG  K++
Sbjct: 324 EGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDL 368


>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
 gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
          Length = 477

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 41/106 (38%), Positives = 57/106 (53%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
           M+ S+DESVGRV+  L+E  L  N+LV+F SDNGG      + +   G+   N PLRG K
Sbjct: 240 MIASVDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGK 299

Query: 58  YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
              +EGG R      W   +  G+V +  +   D  PTL  +AG +
Sbjct: 300 GMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTLLELAGAK 345


>gi|417305741|ref|ZP_12092689.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327537968|gb|EGF24664.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 889

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 59/97 (60%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M++S+D+++G ++ +L+  G+   +++VF SDNGG   + +   G TA SN PLRG K +
Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 561

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            +EGGVR  A +  P  +  G   + ++   D+ PTL
Sbjct: 562 MYEGGVRGPAIVVQPGAVESGSRSDAIIQSIDFYPTL 598


>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
          Length = 551

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G ++  ++E G+  N+ ++F SDNGG              N P RG K + 
Sbjct: 243 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 294

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG +  +   SPLL    V  N LMH+TDWLPT+  +AG
Sbjct: 295 WEGGCKANSWTHSPLLGATGVENNGLMHVTDWLPTIVDLAG 335


>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
          Length = 529

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G ++  ++E G+  N+ ++F SDNGG              N P RG K + 
Sbjct: 221 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 272

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG +  +   SPLL    V  N LMH+TDWLPT+  +AG
Sbjct: 273 WEGGCKANSWTHSPLLGATGVENNGLMHVTDWLPTIVDLAG 313


>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
           anophagefferens]
          Length = 367

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 37/103 (35%), Positives = 58/103 (56%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+  G V+ +L   GL   +L+++ SDNGG    ++H  G  ++ WPLRG  Y+ 
Sbjct: 240 MVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGG----AVHL-GGGSNTWPLRGGYYNN 294

Query: 61  HEGGVRNVAALWS---PLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  A L     P + +G++L   ++I DW  T   +AG
Sbjct: 295 FEGGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAG 337


>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
 gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
          Length = 629

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 54/100 (54%), Gaps = 7/100 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD ++ R++++LEE G   N++V+F SDNGG  +       N  +N+PL G K ++
Sbjct: 253 ILSQLDHNMARLLKALEESGQADNTVVIFASDNGGANI-------NMDNNYPLIGKKATY 305

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    L  P   + Q +       D  PTL  IAG
Sbjct: 306 TEGGVRTPLLLRWPGHYENQDIREPTVFMDIFPTLVSIAG 345


>gi|149200261|ref|ZP_01877282.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
 gi|149136625|gb|EDM25057.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
          Length = 472

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 36/100 (36%), Positives = 62/100 (62%), Gaps = 2/100 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE++GR+ + L++ GL  N++V+F SDNGG + +     G T+S +PLRG K   
Sbjct: 261 MVESMDENIGRLNKFLKQSGLDKNTVVIFTSDNGGLSTNKNPKAGPTSS-FPLRGGKAWV 319

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
           +EGG+R  +   W    + G  + + +  TD+ PT+  +A
Sbjct: 320 YEGGIRVPLIIKWPESTQAGSEISSPVISTDFYPTILSMA 359


>gi|313212310|emb|CBY36307.1| unnamed protein product [Oikopleura dioica]
          Length = 661

 Score = 62.0 bits (149), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 53/94 (56%), Gaps = 7/94 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K+LD ++G + ++LE  G L N++V++  DNG   V++    G+      LRG K    
Sbjct: 372 VKALDIAIGEIFEALENAGKLDNTIVIYAGDNGA-AVENFQSSGD------LRGYKGGAF 424

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
           EGGV+  A L  P + +G   + +   TDWLPTL
Sbjct: 425 EGGVKTFALLSGPGISQGASYKGIFQSTDWLPTL 458


>gi|329956963|ref|ZP_08297531.1| arylsulfatase [Bacteroides clarus YIT 12056]
 gi|328523720|gb|EGF50812.1| arylsulfatase [Bacteroides clarus YIT 12056]
          Length = 460

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 12/101 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M+  LD SVG +I  L+E GL  N++VVF SDNG      +H  G        SN P RG
Sbjct: 256 MVTYLDRSVGIIIDKLKELGLYDNTIVVFTSDNG------VHAEGGHDPYFFDSNGPFRG 309

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            K   +EGG+R    + W  ++  G V +++    D+LPT+
Sbjct: 310 YKRDLYEGGIRTPFVIQWPRIIPAGVVTDHVSAFWDFLPTI 350


>gi|410631840|ref|ZP_11342513.1| arylsulfatase B [Glaciecola arctica BSs20135]
 gi|410148741|dbj|GAC19380.1| arylsulfatase B [Glaciecola arctica BSs20135]
          Length = 544

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 57/99 (57%), Gaps = 12/99 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
           ML+SLD SV RV+  LEE+G+  N++V+  SDNGG     +D+L        N P RG K
Sbjct: 321 MLRSLDRSVERVMAKLEEQGIADNTIVILSSDNGGADYVAIDNL--------NKPYRGWK 372

Query: 58  YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            +F EGG+R   +  W   +  G +++  ++  D +PT+
Sbjct: 373 STFFEGGIRVPFSITWPGEIEAGLIIDEPVNHFDLMPTI 411


>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 486

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 41/110 (37%), Positives = 60/110 (54%), Gaps = 3/110 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-VDSLHFHG-NTASNWPLRGTKY 58
           M K LDE+VGRV++ L+E GL  N++V+F SDNGG   VD +        +N PLR  K 
Sbjct: 260 MNKELDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKG 319

Query: 59  SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
           + +EGG+R  +   W  +   G   +  + +TD L T   I G+    D+
Sbjct: 320 ALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA 369


>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 480

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 2/101 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L++LD ++GRV+++LEE+ L   ++VVF SDNGG    +    G T SN PLR  K   
Sbjct: 266 LLENLDTNIGRVLKALEEQKLREKTIVVFTSDNGGLCTLAKGRTGPT-SNLPLRCGKGWN 324

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    + W   L+ G V+    +  D  PTL  ++G
Sbjct: 325 YEGGIRTPCYISWPGHLKGGTVIGTPAYTPDLYPTLLELSG 365


>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 453

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 39/96 (40%), Positives = 52/96 (54%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D++VG+V+  L  E L   +LVVFISDNGGP         N ++N PL G K + 
Sbjct: 247 MLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPP-------ANGSTNTPLNGQKATT 299

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGGVR      W   L  G+  E  +   D LPT+
Sbjct: 300 WEGGVRVPFVVSWPGQLAAGKKYEEPVIQLDLLPTI 335


>gi|397643272|gb|EJK75757.1| hypothetical protein THAOC_02512 [Thalassiosira oceanica]
          Length = 870

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D+     +QS+++ G   N+LV+F SDNGG    ++  +G +  N+PLRG+K SF
Sbjct: 343 MMRYIDDLFDGTMQSIKDAGQWENTLVLFTSDNGG----AVFPNGQSNQNYPLRGSKRSF 398

Query: 61  HEGGVRNVAALWSPLL--RKGQVLENLMHI----TDWLPTLYFIAGKEKN 104
            +GG R    L    +  R G+   N  H      DW PT   IAG ++N
Sbjct: 399 FDGGNRVNQFLTGGWIDARLGKDRSNESHTYVFGLDWAPTFVSIAGGDRN 448


>gi|423288214|ref|ZP_17267065.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
           CL02T12C04]
 gi|392671103|gb|EIY64579.1| hypothetical protein HMPREF1069_02108 [Bacteroides ovatus
           CL02T12C04]
          Length = 473

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 35/106 (33%), Positives = 61/106 (57%), Gaps = 15/106 (14%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKY--- 58
           ++++DESVGR+ +++++ G+  N++V++ SD GG             SN+PLRG K    
Sbjct: 274 IEAVDESVGRIRKAIKDRGIEDNTIVIYFSDQGG-----------LYSNYPLRGCKLIDD 322

Query: 59  SFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           +  EGG+R   A++ P + +KG V +  +   D+ PT   IA  +K
Sbjct: 323 TMAEGGIRVPLAIYYPGVTKKGSVCQTPVQALDFFPTFMEIATGKK 368


>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
          Length = 311

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G ++  ++E G+  N+ ++F SDNGG              N P RG K + 
Sbjct: 169 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 220

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG +  +   SPLL    V  N LMH+TDWLPT+  +AG
Sbjct: 221 WEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAG 261


>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
 gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
          Length = 438

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++ +L++ G+  N+LVVF SDNGG            + NWPL G K   
Sbjct: 234 MIHHMDEGIGWIMAALQKHGMADNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 284

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W  ++ KG     L    DW  T+   AG   + D
Sbjct: 285 TEGGIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAGVAAHAD 331


>gi|14518340|ref|NP_116823.1| MS135, putative arylsulfatase [Microscilla sp. PRE1]
 gi|14484975|gb|AAK62857.1| MS135, putative arylsulfatase [Microscilla sp. PRE1]
          Length = 249

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +++ + G+++  L+E GL  N+LVVF +DNGGP        G  ASN+PL G K + 
Sbjct: 35  MTLAMNRACGQILDKLKEHGLEENTLVVFSNDNGGPV-------GTMASNYPLSGAKSNN 87

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    +  P ++  G    + + + D LPT   +AG
Sbjct: 88  LEGGIRVPCIMKLPGVIDPGSEYPHPISMLDMLPTFVSVAG 128


>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 474

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 40/103 (38%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LDE +G+V   L  +GL  N+LVVF+SDNGGPT +       T+SN PLRG K   
Sbjct: 283 VLAHLDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKEL------TSSNLPLRGGKGDL 336

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            +GG+R   A  W   +  G  ++      D   T   +AG E
Sbjct: 337 WDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTATALKLAGAE 379


>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 540

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 6/101 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  LD+++G+V+ +L +EG+   ++++F SDNG        + G  A N+PLRG K   
Sbjct: 258 VVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNG-----GAAYAGGGADNFPLRGGKGDT 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R VAA+ W   L  G+  E++M + D  PTL   AG
Sbjct: 313 YEGGIRVVAAMRWKGTLDAGKSFESIMTVMDVFPTLASAAG 353


>gi|320101855|ref|YP_004177446.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319749137|gb|ADV60897.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 493

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 38/104 (36%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP---TVDSLHFHGNTASNWPLRGTK 57
           M+ S+DESVGR++  ++   L   +LV+F SDNGG      + L    +   N PLRG K
Sbjct: 254 MIASVDESVGRIVAQIDRLALDRQTLVIFTSDNGGVGGYAREGLRRSQDVTDNAPLRGGK 313

Query: 58  YSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
              +EGG+R    A W   +  G V +  +   D  PTL  +AG
Sbjct: 314 GMLYEGGIRVPFLARWPGRVPSGVVCDEPIQSIDLFPTLLELAG 357


>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
 gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
          Length = 555

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 55/96 (57%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D SVGR++ +L+EEGL  N++VVF SDNGG       + G    N P RG K + 
Sbjct: 332 MVRAVDRSVGRILATLKEEGLSDNTIVVFSSDNGGAG-----YIGIPDINAPYRGWKLTM 386

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R  +   W   + +   ++      D+LPTL
Sbjct: 387 FEGGIRVPLFMRWPAHIARNTSIDTPAAHIDFLPTL 422


>gi|319794759|ref|YP_004156399.1| sulfatase [Variovorax paradoxus EPS]
 gi|315597222|gb|ADU38288.1| sulfatase [Variovorax paradoxus EPS]
          Length = 439

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++ +L++ G+  N+LVVF SDNGG            + NWPL G K   
Sbjct: 235 MIHHMDEGIGWIMAALQKHGMSDNTLVVFTSDNGGERF---------SDNWPLVGGKMDL 285

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    A W  ++ KG     L    DW  T+   AG
Sbjct: 286 TEGGIRVPWIAHWPAVIAKGGESAQLCMTMDWSATMLEAAG 326


>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 58/101 (57%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           MLK LD  VG ++++L++  +  N++++++SDNGG            A+N PLRG K+  
Sbjct: 239 MLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAK-------SMVANNKPLRGVKHDI 291

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           ++GG+R    + W   ++ GQ  ++ +   D LPTL   AG
Sbjct: 292 YDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG 332


>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 441

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 3/101 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML ++D+++G+V+ SL E     N+L+ FISDNGGPT+       N++ N PLRG+K + 
Sbjct: 236 MLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTI--NSSINAPLRGSKRTT 293

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R      W   +  G V  + +   D  PT    AG
Sbjct: 294 LEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAG 334


>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
 gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 452

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 41/113 (36%), Positives = 61/113 (53%), Gaps = 13/113 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN-----GGPTVDSLHFHGNTASNWPLRG 55
           M+  LD  VG+++  L++  L GN+LV+F SDN     GG  VD         SN PLRG
Sbjct: 258 MITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGPHQEGGWQVDYFD------SNGPLRG 311

Query: 56  TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            K   +EGG+R    A W   + KG+  + +    D+LPT   ++G++K I +
Sbjct: 312 MKRDLYEGGIRVPFIAKWPGKIEKGET-DQVATFWDFLPTACELSGQDKPIST 363


>gi|333911967|ref|YP_004485699.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
 gi|333742167|gb|AEF87344.1| N-acetylgalactosamine-6-sulfatase [Delftia sp. Cs1-4]
          Length = 442

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G V+ +LE++G+  N+L+VF SDNGG            ++NWP  G K   
Sbjct: 236 MIHHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGERF---------SNNWPFIGQKMDL 286

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R  + A W   +  G V +      DW  T    AG   + D
Sbjct: 287 LEGGIRVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAGVHGHAD 333


>gi|311746014|ref|ZP_07719799.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126576230|gb|EAZ80508.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 489

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 61/106 (57%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++S+D++VG+V+  LE  GL  N+LV+F SDNGG   +   +     +N+PL+  K   
Sbjct: 285 LVESVDQNVGKVLSELERMGLRENTLVIFTSDNGGLIGN---YDNPITNNYPLKSQKGYP 341

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
           +EGG+R    + W   + +G V E  +   DW+PT+    G++  +
Sbjct: 342 YEGGIRIPTIVSWPGKIPQGFVDETPIITMDWIPTILDFMGEDPTL 387


>gi|160895745|ref|YP_001561327.1| sulfatase [Delftia acidovorans SPH-1]
 gi|160361329|gb|ABX32942.1| sulfatase [Delftia acidovorans SPH-1]
          Length = 442

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G V+ +LE++G+  N+L+VF SDNGG            ++NWP  G K   
Sbjct: 236 MIHHMDEGIGWVLDALEKKGMAENTLIVFTSDNGGERF---------SNNWPFIGQKMDL 286

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R  + A W   +  G V +      DW  T    AG   + D
Sbjct: 287 LEGGIRVPLVARWPDRMPAGAVSDTPNLTMDWSATFLAAAGVHGHAD 333


>gi|339503592|ref|YP_004691012.1| arylsulfatase [Roseobacter litoralis Och 149]
 gi|338757585|gb|AEI94049.1| putative arylsulfatase [Roseobacter litoralis Och 149]
          Length = 525

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++K LD++VG+V+  L+E G   N++VVF SDN GP   +    G T    P RG K + 
Sbjct: 280 VMKELDDNVGKVLARLDELGQRDNTIVVFTSDN-GPETMTWPDGGTT----PFRGEKGTT 334

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGG R  A + W   + +G+V   +    DW+PTL   AG   N+
Sbjct: 335 WEGGFRVPAIIRWPGKVPEGRVGNGIFSGMDWMPTLVAAAGGPDNL 380


>gi|264676341|ref|YP_003276247.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262206853|gb|ACY30951.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 442

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++ +LEE GL  N+L+VF SDNGG            ++ WP  G K   
Sbjct: 236 MIHHMDEGIGWLLDALEENGLSENTLIVFTSDNGGERF---------SNTWPFVGQKMDL 286

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R  + A W   +  G V +      DW  T    AG + + D
Sbjct: 287 LEGGIRVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVDAHAD 333


>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 462

 Score = 61.6 bits (148), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 12/101 (11%)

Query: 4   SLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEG 63
            +D+S+G +++ ++E GL  N+LV+F+SDNGG ++         A N+PL+G K  F EG
Sbjct: 272 QMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL---------ADNYPLKGGKAQFFEG 322

Query: 64  GVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           GVR V  L  W   +  GQ  +  +   +  P +   AG E
Sbjct: 323 GVR-VPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAGLE 362


>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
          Length = 382

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++  +D  +G ++  ++E G+  N+ ++F SDNGG              N P RG K + 
Sbjct: 240 LVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDV--------GEGDNSPHRGGKSTL 291

Query: 61  HEGGVRNVAALWSPLLRKGQVLEN-LMHITDWLPTLYFIAG 100
            EGG +  +   SPLL    V  N LMH+TDWLPT+  +AG
Sbjct: 292 WEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAG 332


>gi|386395695|ref|ZP_10080473.1| arylsulfatase A family protein [Bradyrhizobium sp. WSM1253]
 gi|385736321|gb|EIG56517.1| arylsulfatase A family protein [Bradyrhizobium sp. WSM1253]
          Length = 589

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LE+ G L N+++VF +DNG   + +    G T      +G K +  
Sbjct: 326 MKQMDDNIGYVLKKLEDMGQLDNTIIVFTTDNGAENI-TFPDGGTTL----FKGGKLTTW 380

Query: 62  EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG+R  +   W   ++ G V   +M   DWLPTL  IAG  K
Sbjct: 381 EGGMRAPMVVRWPGHIKPGTVKSEMMSALDWLPTLVDIAGGPK 423


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 36/106 (33%), Positives = 62/106 (58%), Gaps = 7/106 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD++VG ++++LEE+G++ N+++VF SDNGG  +     HG T  N+PLR  K S 
Sbjct: 267 LVEHLDDAVGAILKALEEKGIIDNTIIVFTSDNGGEIL-----HGIT-DNFPLRDGKGSS 320

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
           +EGG R  +   W  + +   V    +   D+ PT   +AG   ++
Sbjct: 321 YEGGTRVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMAGANPDL 366


>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
           variabilis ATCC 29413]
 gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
           ATCC 29413]
          Length = 457

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 9/89 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+KSLD+ VGRV+ +LE  G   N+LV+F SDNGG    +           P RG K S 
Sbjct: 266 MVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERFSNFG---------PFRGQKASL 316

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT 89
           +EGG+R  A +  P + +   + N + IT
Sbjct: 317 YEGGIRVPAIIRYPGVTQANQVSNQVIIT 345


>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
          Length = 507

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG ++ +L+++G+  N L V  SDNGGP      F G  A+N+PL+G K S 
Sbjct: 258 MVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPVYP---FGG--ANNYPLKGGKLSD 312

Query: 61  HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GG+R V A  S    P   +GQ  +  +H+ DW  T   IAG
Sbjct: 313 WQGGIR-VNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCAIAG 355


>gi|291449944|ref|ZP_06589334.1| n-acetylgalactosamine-6-sulfate sulfatase [Streptomyces albus
           J1074]
 gi|291352893|gb|EFE79795.1| n-acetylgalactosamine-6-sulfate sulfatase [Streptomyces albus
           J1074]
          Length = 464

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG+V+++LE  G   N+LV F SDNGG            + NWPL G K S 
Sbjct: 275 MVEDLDRSVGQVLRALERSGQEENTLVFFASDNGGERF---------SYNWPLDGNKGSL 325

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            E G+R  + L W   +   QV +  ++  DW  TL  I G
Sbjct: 326 KERGIRVPSVLRWPARIDPRQVSDVPVYSPDWTATLLDIGG 366


>gi|393786360|ref|ZP_10374496.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
           CL02T12C05]
 gi|392659989|gb|EIY53606.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
           CL02T12C05]
          Length = 467

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 57/109 (52%), Gaps = 4/109 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD  + RVI  L E+G+L N++++F SDNG    +  H  G   SN P RG K   
Sbjct: 261 MVTRLDRDIQRVIDLLAEKGVLENTIIIFTSDNGTHK-EGGHDPGYFDSNGPFRGIKRDL 319

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
           +EGG+R      W  ++  G V   +    D++PT+  + G +   NID
Sbjct: 320 YEGGIRTPFIVRWPGVIPAGSVSYQISTFWDFMPTMCELIGAKAPANID 368


>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
          Length = 339

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 53/96 (55%), Gaps = 5/96 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   LD++VG +  +L+   +  N++++F SDNGGP    +H     ++N+PLRG K + 
Sbjct: 247 MTSILDDAVGNITDALKRSRIADNTILIFTSDNGGP----VHDENTESNNYPLRGGKNTL 302

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
             GG + V  +    +   G     +MH +DWLP+L
Sbjct: 303 WNGGTQVVGMIAGKGIENPGTDCHGMMHASDWLPSL 338


>gi|149196669|ref|ZP_01873723.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149140349|gb|EDM28748.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 472

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 52/96 (54%), Gaps = 5/96 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++K +D   GR+I+ L+E+ L  N+LV+F SDNG      L +     S+ PLR  K++ 
Sbjct: 269 VIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNG----PWLSYGDEGGSSGPLRDGKFTS 324

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
           +EGGVR     W P L++   V   L    D LPT 
Sbjct: 325 YEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTF 360


>gi|149197139|ref|ZP_01874191.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
 gi|149139685|gb|EDM28086.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
          Length = 493

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 5/100 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD  VG ++  +E++G+  N+ ++F SDNGG       +   +  N PL+G K   
Sbjct: 249 MVEDLDIGVGMILDEVEKQGIKDNTYIIFTSDNGG-----FSYANTSGQNTPLKGGKRWL 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P ++ G      +   D+LPT Y + G
Sbjct: 304 YEGGIRVPFVIQGPKIKAGTYCNQPIINWDFLPTFYDLVG 343


>gi|383458406|ref|YP_005372395.1| sulfatase family protein [Corallococcus coralloides DSM 2259]
 gi|380732108|gb|AFE08110.1| sulfatase family protein [Corallococcus coralloides DSM 2259]
          Length = 552

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 56/100 (56%), Gaps = 6/100 (6%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD+ VG ++  LEE G+  N+LVVF +DNG   V+ + +      N P RG K S  
Sbjct: 267 MRELDDIVGVLLDKLEELGIADNTLVVFSTDNG---VEKMGW--PDGGNSPFRGEKGSTW 321

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGGVR      W  ++  G+V+ ++    DW+PT+   AG
Sbjct: 322 EGGVRVPCVVRWPGVVEPGRVINDIFAHEDWMPTMVAAAG 361


>gi|167764178|ref|ZP_02436305.1| hypothetical protein BACSTE_02562 [Bacteroides stercoris ATCC
           43183]
 gi|167698294|gb|EDS14873.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
          Length = 502

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 34/101 (33%), Positives = 59/101 (58%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L+ +D+S+G ++  ++E+G+  N++++F+SDNGG T+      G    N+PL   K S 
Sbjct: 281 LLEGMDKSLGDLMDYVDEKGIADNTVIIFMSDNGGYTI------GRHDKNYPLSEGKGSL 334

Query: 61  HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
            EGG+R    +  P + K   + +  + I D+ PTL  IAG
Sbjct: 335 KEGGIREPMIVCYPHVAKPSTINDTPVIIEDFFPTLLEIAG 375


>gi|336410426|ref|ZP_08590905.1| hypothetical protein HMPREF1018_02922 [Bacteroides sp. 2_1_56FAA]
 gi|335945158|gb|EGN06973.1| hypothetical protein HMPREF1018_02922 [Bacteroides sp. 2_1_56FAA]
          Length = 493

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 56/95 (58%), Gaps = 2/95 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDE++GR++  L   GL   ++VVF SDNGG    +   +  T SN PLR  K   
Sbjct: 270 MVESLDENIGRLLDILHRSGLDERTIVVFTSDNGGMATSNTTRNIPT-SNLPLRAGKGYL 328

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
           +EGG++  A + WS  L+  QV +  +  TD+ PT
Sbjct: 329 YEGGIKVPAIIRWSGHLKGRQVSDTPIIGTDYYPT 363


>gi|116623176|ref|YP_825332.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116226338|gb|ABJ85047.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 560

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V+  LE+ G L N++VVF +DNG  T+ S    G T    P +G K    
Sbjct: 293 MKQMDDNIGVVLAKLEQMGQLDNTIVVFTTDNGAETI-SFPDGGIT----PFKGQKGEAW 347

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG R    + W   ++ G V + L    DWLPT    AG  K
Sbjct: 348 EGGYRAPCVIRWPGHIKPGTVYKELFAALDWLPTFVEFAGGPK 390


>gi|224536571|ref|ZP_03677110.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521827|gb|EEF90932.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 450

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 35/112 (31%), Positives = 60/112 (53%), Gaps = 10/112 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDN----GGPTVDSLH-----FHGNTASNW 51
           ++   DE++G ++  LE   L+ N+++VF+SDN    GGP + +       F        
Sbjct: 241 LIACADENIGILLNYLENRKLIDNTIIVFVSDNGPAKGGPELKTWEGYDPSFEYVFGQMK 300

Query: 52  PLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            LRG K   +EGG+R    + + PLLR+G+V  +++   D+ PT+  + G E
Sbjct: 301 TLRGHKVDLYEGGIRTPMIIAYKPLLREGKVFRDMISTLDFYPTICEMTGSE 352


>gi|254282184|ref|ZP_04957152.1| arylsulfatase A [gamma proteobacterium NOR51-B]
 gi|219678387|gb|EED34736.1| arylsulfatase A [gamma proteobacterium NOR51-B]
          Length = 535

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 59/101 (58%), Gaps = 6/101 (5%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
           D+++ R+ +SL+  G+  N+LV++ISDNG        F+ +T  +W LRG K   +EGGV
Sbjct: 324 DKNMARLFESLDNMGIDENTLVLWISDNGPMN----KFYPSTGFSW-LRGYKSEVYEGGV 378

Query: 66  RNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
           R    A W   +  GQ   +++H++DW  T+  +AG +  I
Sbjct: 379 RTPGIAKWPGSIAPGQTPIDIVHVSDWYTTIANLAGAKAAI 419


>gi|149195594|ref|ZP_01872651.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149141056|gb|EDM29452.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 481

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 64/111 (57%), Gaps = 5/111 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D++VG V+Q++++ G+  N++++F SDNGG   +  +      SN+PLR  K   
Sbjct: 253 MVEHMDDAVGTVMQAVKDAGIEDNTIIIFNSDNGGLRGNYENNRQKVTSNYPLRSGKGDM 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
           +EGGVR    + WS  ++ GQ   + +   D  PTL  +      K+++ID
Sbjct: 313 YEGGVRVPMIIKWSRKIKAGQTSSSPVISHDIYPTLLDLCKIDVSKKQDID 363


>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
 gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
          Length = 497

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 40/109 (36%), Positives = 61/109 (55%), Gaps = 13/109 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-------ASNWPL 53
           M+  +D+ +G+V+ +LE+ G+  N+L+VF SDNGG  V +  F G +       A N P 
Sbjct: 268 MISVMDDEIGKVVAALEKRGMRDNTLIVFHSDNGG--VVNAFFAGESEVKGKLPADNGPY 325

Query: 54  RGTKYSFHEGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  K + +EGG R VA L  W   ++ G + +  MH+ D  PTL  + G
Sbjct: 326 REGKGTTYEGGTR-VAGLVNWPGKIKPGTI-DGPMHVVDMYPTLAGLVG 372


>gi|372209156|ref|ZP_09496958.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 529

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 61/102 (59%), Gaps = 7/102 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++S+D++VGRV+Q L++ GL  N+LVVF SDNGG           + SN PLRG K   
Sbjct: 277 LVESMDDAVGRVLQGLKDLGLDENTLVVFTSDNGGVAAGDAF----STSNLPLRGGKGYQ 332

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT--DWLPTLYFIAG 100
           +EGG++    +  P + K QV +  + ++  D+ PTL  + G
Sbjct: 333 YEGGIKEPYFIKVPWM-KSQVKQTNVPVSGVDFYPTLLDLVG 373


>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 562

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 56/97 (57%), Gaps = 9/97 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D  + ++++ L+   +  N++ +F +DNGG T       GN   NWPLRG K + 
Sbjct: 261 MVYAMDRGIRQLVKHLKRARMWKNTIFIFSTDNGGQTTRG----GN---NWPLRGKKGTL 313

Query: 61  HEGGVRNVAALWSPLLR--KGQVLENLMHITDWLPTL 95
            EGG+R V  +    L+    +V + L+H++DW PT+
Sbjct: 314 WEGGIRGVGFVHGKPLQVTTPRVNKELLHVSDWYPTI 350


>gi|449138581|ref|ZP_21773838.1| arylsulfatase [Rhodopirellula europaea 6C]
 gi|448882843|gb|EMB13400.1| arylsulfatase [Rhodopirellula europaea 6C]
          Length = 459

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 9/100 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L++ D  VG+++ SLEE G    ++VVF SDNGG        H    +N PLRG+K++ +
Sbjct: 243 LETFDHHVGQILDSLEESGQADRTIVVFCSDNGG--------HPEYTANAPLRGSKWNLY 294

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R  +   W  +++    ++  +   D LPT+   AG
Sbjct: 295 EGGIRVPMIVRWPDVVQPQTEIKRPVIGYDLLPTMVDWAG 334


>gi|421611066|ref|ZP_16052221.1| arylsulfatase [Rhodopirellula baltica SH28]
 gi|408498168|gb|EKK02672.1| arylsulfatase [Rhodopirellula baltica SH28]
          Length = 484

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L++ D  VG+++ SLE  G    ++V+F SDNGG        H    +N PLRG+K++ +
Sbjct: 268 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 319

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R  +   W  +++    ++  +   D LPT+  +AG
Sbjct: 320 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 359


>gi|323450061|gb|EGB05945.1| hypothetical protein AURANDRAFT_66030 [Aureococcus anophagefferens]
          Length = 611

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 13/98 (13%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           +D S+  + Q+L+  G+  ++L+ F SDNGG T       G   SN+PLRG K +  EGG
Sbjct: 312 VDRSLETLFQTLDAAGMTDDTLLFFGSDNGGLT-------GFGGSNYPLRGEKLTLFEGG 364

Query: 65  VRNVAALWS----PLLR--KGQVLENLMHITDWLPTLY 96
           VR  A  W+    P  R   G     ++H++D  PTL+
Sbjct: 365 VRVPAFFWAGASFPRFRATAGAAYGGIVHVSDVAPTLF 402


>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
 gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
          Length = 478

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D+ +GRV+ +L+   L  N++VVF SDNGG            +  WP  G K   
Sbjct: 268 MIKAMDDQIGRVLATLDSYHLTENTIVVFTSDNGGERF---------SDTWPFTGMKGEL 318

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A L W   +  G   +  +   DWLP+     G
Sbjct: 319 LEGGLRVPAVLRWPARVAAGSESDQTLITMDWLPSFLAATG 359


>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 484

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 12/111 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D  VG ++ SL++ G   N+L++F+SDNGG T       G+ A+N+PL G K   
Sbjct: 263 MVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGGHT-------GHGANNYPLTGRKGDT 315

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA----GKEKNID 106
            EGG R      W   ++KGQ  +  +   D  PT+  I     G +K +D
Sbjct: 316 WEGGFRVPMFFHWPKKIKKGQKFDYPVSALDLYPTIAHIGQAKIGSDKILD 366


>gi|449132692|ref|ZP_21768698.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448888163|gb|EMB18494.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 544

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 33/66 (50%), Positives = 46/66 (69%), Gaps = 3/66 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D SVGR++Q+L+E G+  N+LVVF SDNGG  + +L   G T  N PLR  K   
Sbjct: 319 MVKAVDTSVGRIMQALKEHGVDDNTLVVFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 375

Query: 61  HEGGVR 66
           +EGG+R
Sbjct: 376 YEGGIR 381


>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
 gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
          Length = 488

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 4/100 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG+++++LEEEGL  N+LVVF SDNG   +     HG TA    LRG K   
Sbjct: 267 VIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNGPWHI--FETHGGTAGL--LRGAKGGT 322

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R     W P   K  V+ ++    D  PT   ++G
Sbjct: 323 FEGGMREPTIFWWPGKLKQGVVMDMATTMDLFPTFCNLSG 362


>gi|283782011|ref|YP_003372766.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283440464|gb|ADB18906.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 516

 Score = 60.8 bits (146), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 10/111 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNW-----PLRG 55
           M+  +D  +GR+++ LE+ GL   ++VVF SDN GP  D L   G T +++      LRG
Sbjct: 283 MITRMDREIGRILERLEKYGLQRRTIVVFSSDN-GPLYDKL---GGTDADFFQSALDLRG 338

Query: 56  TKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            K S +EGG+R    +  P ++  G     L    DW+PTL  +AG    I
Sbjct: 339 RKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPTLLSLAGMSTKI 389


>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
          Length = 659

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 9/83 (10%)

Query: 24  NSLVVF-ISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGVRNVAALWSPLLRK-GQV 81
           NS   F   DNGG T+      GN   NWPLRG K++  EGG+R V  + SPLL++ G  
Sbjct: 417 NSSSAFPFPDNGGQTLAG----GN---NWPLRGRKWTLWEGGIRGVGFVASPLLKQTGVQ 469

Query: 82  LENLMHITDWLPTLYFIAGKEKN 104
              L+HI+DWLPTL  +AG   N
Sbjct: 470 SRELIHISDWLPTLVNLAGGHTN 492


>gi|374620757|ref|ZP_09693291.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374303984|gb|EHQ58168.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 519

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 54/97 (55%), Gaps = 3/97 (3%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ +D SVG +I +LE+ G+  N+LV F SDNG   + S HF G+T    P +G+K   +
Sbjct: 304 IEEIDWSVGEIIGALEKAGIDDNTLVFFSSDNGPWQLASTHFAGSTG---PFKGSKQEIY 360

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFI 98
           EGGVR     W P     +V+ ++  + D   T+  I
Sbjct: 361 EGGVRVPGIFWWPGTVAPEVISDMGSVLDLYETIATI 397


>gi|189465112|ref|ZP_03013897.1| hypothetical protein BACINT_01456 [Bacteroides intestinalis DSM
           17393]
 gi|189437386|gb|EDV06371.1| arylsulfatase [Bacteroides intestinalis DSM 17393]
          Length = 467

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR +  L+++GL  N+LV+F SDNG      L +  ++ S  PLRG K+S+
Sbjct: 270 VVEEIDWNVGRFLDYLDQQGLAENTLVIFASDNG----PWLGYKEDSGSADPLRGGKFSY 325

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR    L W   +  G   + ++   D  PT+    G
Sbjct: 326 YEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYVG 366


>gi|423339136|ref|ZP_17316877.1| hypothetical protein HMPREF1059_02802 [Parabacteroides distasonis
           CL09T03C24]
 gi|409231038|gb|EKN23895.1| hypothetical protein HMPREF1059_02802 [Parabacteroides distasonis
           CL09T03C24]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPARPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|340616998|ref|YP_004735451.1| hypothetical protein zobellia_1003 [Zobellia galactanivorans]
 gi|339731795|emb|CAZ95060.1| Pseudogene of sulfatase [Zobellia galactanivorans]
          Length = 339

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 11/102 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTK-YS 59
           ML S+DES+GR+   L+  G+  N+LV+F SDNGG          N+ SN+PL G K YS
Sbjct: 97  MLVSVDESIGRINTELKRLGIENNTLVIFTSDNGG--------LNNSTSNYPLLGGKSYS 148

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           F EG +R  +   W  +++ G+  +  +   D+ PT   IAG
Sbjct: 149 F-EGAMRVPLIVKWPAMIKPGRKSKKRVIGMDFYPTFLDIAG 189


>gi|150009207|ref|YP_001303950.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|298377006|ref|ZP_06986960.1| arylsulfatase A [Bacteroides sp. 3_1_19]
 gi|149937631|gb|ABR44328.1| putative arylsulfatase precursor [Parabacteroides distasonis ATCC
           8503]
 gi|298265990|gb|EFI07649.1| arylsulfatase A [Bacteroides sp. 3_1_19]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|262384095|ref|ZP_06077231.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262294993|gb|EEY82925.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|255015814|ref|ZP_05287940.1| putative arylsulfatase precursor [Bacteroides sp. 2_1_7]
 gi|410104728|ref|ZP_11299640.1| hypothetical protein HMPREF0999_03412 [Parabacteroides sp. D25]
 gi|423334479|ref|ZP_17312258.1| hypothetical protein HMPREF1075_03781 [Parabacteroides distasonis
           CL03T12C09]
 gi|409225670|gb|EKN18588.1| hypothetical protein HMPREF1075_03781 [Parabacteroides distasonis
           CL03T12C09]
 gi|409233740|gb|EKN26574.1| hypothetical protein HMPREF0999_03412 [Parabacteroides sp. D25]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|301311001|ref|ZP_07216930.1| arylsulfatase A [Bacteroides sp. 20_3]
 gi|300831064|gb|EFK61705.1| arylsulfatase A [Bacteroides sp. 20_3]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|256841763|ref|ZP_05547269.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|256736657|gb|EEU49985.1| conserved hypothetical protein [Parabacteroides sp. D13]
          Length = 515

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVD------SLHFHGNTASNWPLRG 55
           ++  D SVG+++++L+E GL  N+L++  SDN GP VD      ++   G+     PLRG
Sbjct: 301 IEQFDWSVGQLMKTLDEMGLTENTLIILSSDN-GPVVDDGYADRAVELLGDHKPAGPLRG 359

Query: 56  TKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            KYS  EGG R  A + W   +++  V + L+   DW  +L
Sbjct: 360 NKYSAFEGGTRIPAIVHWPKEIKQAAVSDALVSQIDWFASL 400


>gi|149196742|ref|ZP_01873796.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
 gi|149140422|gb|EDM28821.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
          Length = 482

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 37/108 (34%), Positives = 58/108 (53%), Gaps = 20/108 (18%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS-------------LHFHGNTA 48
           ++ LD SVG+ IQ+LE+EGL+ N+L +F SDNGG    S              H++G T 
Sbjct: 264 IQDLDYSVGQFIQALEQEGLMENTLFIFTSDNGGDIPASNKDWPEYQAYDMGFHYNGKT- 322

Query: 49  SNWPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
                RG K+  +EGG++    + W   ++KGQ  ++L+   D+  T 
Sbjct: 323 -----RGDKHQIYEGGLKVPFIIHWPKKVKKGQESDHLVTTADFFSTF 365


>gi|418528067|ref|ZP_13094017.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371454443|gb|EHN67445.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 442

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++ +LEE+G+  N+L+VF SDNGG            ++ WP  G K   
Sbjct: 236 MIHHMDEGIGWLLDALEEKGMSENTLIVFTSDNGGERF---------SNTWPFVGQKMDL 286

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R  + A W   +  G V +      DW  T    AG   + D
Sbjct: 287 LEGGIRVPLLARWPARMTAGAVCDTPSLTMDWSATFLAAAGVSADAD 333


>gi|357412212|ref|YP_004923948.1| sulfatase [Streptomyces flavogriseus ATCC 33331]
 gi|320009581|gb|ADW04431.1| sulfatase [Streptomyces flavogriseus ATCC 33331]
          Length = 484

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+++L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 295 MVEDLDRSVGEVLRALKRSGQEEDTLVFFASDNGGERF---------SYNWPLSGNKSSL 345

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R  + L W   +   QV +  +   DW  TL  + G   +
Sbjct: 346 QEGGIRVPSVLRWPARIEANQVSDLPVFSPDWTATLLELGGARPD 390


>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 496

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 58/102 (56%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D +VG+V+ +L+   L  N+LV+F SDNGG +       G+  SN PLRG K   
Sbjct: 287 MVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNGGLSTS----EGHPTSNLPLRGGKGWM 342

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
           +EGG+R    +  P +   G   + L+   D+LPT+  +  K
Sbjct: 343 YEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTILAVVDK 384


>gi|421609369|ref|ZP_16050565.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408499866|gb|EKK04329.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 491

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 2/101 (1%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
           + ++D +VGR+++ LEE G   N+L+VF SDNG  T+D       +  S  PLRG K   
Sbjct: 253 VANVDAAVGRLVKGLEEIGKRDNTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            E G R    + W   +  GQ L+ ++   D LPT   +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353


>gi|384098832|ref|ZP_09999944.1| putative secreted sulfatase ydeN [Imtechella halotolerans K1]
 gi|383834975|gb|EID74406.1| putative secreted sulfatase ydeN [Imtechella halotolerans K1]
          Length = 504

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 63/106 (59%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-NWPLRGTKYS 59
           M++ +D+S+G V+  LE  G+  N++++FISDNGG  + ++   G   S N PL+  K S
Sbjct: 282 MIEGMDKSLGDVMSFLENRGIADNTIILFISDNGG--LSAVGRGGEAHSHNRPLKSGKGS 339

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            +EGG+R  + A W  + +   V+ + + I D+ P++  +A  E N
Sbjct: 340 MYEGGIRVPMLAYWPGVTKPSTVINDYVIIEDFFPSILEMAQIEPN 385


>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 466

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 33/95 (34%), Positives = 55/95 (57%), Gaps = 8/95 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++ ++D+ +GR+++ L++ G L N+L++F++DNG  T        N A N PLRG K S 
Sbjct: 249 LMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT-------NNGADNGPLRGLKGSK 301

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPT 94
            EGG+R    + W   L  G+  + L+   D +PT
Sbjct: 302 WEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPT 336


>gi|149177652|ref|ZP_01856253.1| arylsulfatase [Planctomyces maris DSM 8797]
 gi|148843470|gb|EDL57832.1| arylsulfatase [Planctomyces maris DSM 8797]
          Length = 494

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 40/121 (33%), Positives = 60/121 (49%), Gaps = 19/121 (15%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG----------------GPTVDSLHF- 43
           M+ S+D+++GR++ +L+E G L N+LV+F+SDNG                GP  D +   
Sbjct: 269 MIDSMDQNIGRIMTTLKETGQLDNTLVLFLSDNGGCSEEPGGRNPQERTPGPKDDYVAVG 328

Query: 44  -HGNTASNWPLRGTKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
                A N P R  K   HEGG+   + A W   +  G +  +  HI D +PT   +AG 
Sbjct: 329 PAWGWAQNSPFRRYKSWVHEGGISTPMIAWWPGKVPAGAINRSPAHIIDLMPTFLEMAGA 388

Query: 102 E 102
           E
Sbjct: 389 E 389


>gi|32475023|ref|NP_868017.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32445563|emb|CAD75564.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 501

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 2/107 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +GR++  LEE  L   +LV+F SDNG P  +  H      S+ PL+G+K S 
Sbjct: 293 MITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNG-PHHEGGHSDLFFNSSGPLQGSKRSM 351

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           HEGG+R    A W   +  G + ++     D+LPT   +AG E   D
Sbjct: 352 HEGGIRVPFIAKWPGTIEPGTISDHPSAFWDFLPTACELAGAEPPAD 398


>gi|423221012|ref|ZP_17207506.1| hypothetical protein HMPREF1061_04279 [Bacteroides caccae
           CL03T12C61]
 gi|392622251|gb|EIY16383.1| hypothetical protein HMPREF1061_04279 [Bacteroides caccae
           CL03T12C61]
          Length = 520

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
            D  VG ++ +LE+ GL  N+L++  SDN GP VD  +        +G++ +  PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364

Query: 58  YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           YS  EGG R  A L W   ++KG+  + L+   DWL +L  + G
Sbjct: 365 YSAFEGGTRIPAILSWPKEVKKGKTSDLLVSQVDWLASLASLTG 408


>gi|417301112|ref|ZP_12088282.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542541|gb|EGF29015.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 484

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L++ D  VG+++ SLE  G    ++V+F SDNGG        H    +N PLRG+K++ +
Sbjct: 268 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 319

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R  +   W  +++    ++  +   D LPT+  +AG
Sbjct: 320 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 359


>gi|153809423|ref|ZP_01962091.1| hypothetical protein BACCAC_03737 [Bacteroides caccae ATCC 43185]
 gi|149128009|gb|EDM19231.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 520

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
            D  VG ++ +LE+ GL  N+L++  SDN GP VD  +        +G++ +  PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364

Query: 58  YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           YS  EGG R  A L W   ++KG+  + L+   DWL +L  + G
Sbjct: 365 YSAFEGGTRIPAILSWPKEVKKGKTSDLLVSQVDWLASLASLTG 408


>gi|260819473|ref|XP_002605061.1| hypothetical protein BRAFLDRAFT_85207 [Branchiostoma floridae]
 gi|229290391|gb|EEN61071.1| hypothetical protein BRAFLDRAFT_85207 [Branchiostoma floridae]
          Length = 289

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D +VGR++ +++  GL  N+L++F +DNGG       +     +N P +G K++ 
Sbjct: 1   MVNAMDLNVGRIVDAMKGAGLWDNTLLIFSTDNGG-------WPDQGGNNDPYKGGKFTL 53

Query: 61  HEGGVRNVAALWSPLL-RKGQVLENLMHITDWLPTLYFIAG 100
            EGG + VA +   +L   G     ++H  DW PT+   AG
Sbjct: 54  WEGGTKGVAFVHGNMLVETGYTNNEMIHAVDWFPTIVAAAG 94


>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 441

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 32/74 (43%), Positives = 45/74 (60%), Gaps = 6/74 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ++K+LD+  G ++Q+L++     N+L+ F+SDNGGPT        N +SNWPLRG K S 
Sbjct: 241 LMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPTTH------NGSSNWPLRGFKGSE 294

Query: 61  HEGGVRNVAALWSP 74
            EGG R    L  P
Sbjct: 295 FEGGNRTPFLLQWP 308


>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 443

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 56/101 (55%), Gaps = 9/101 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD S G V+ +LE  G   N+LV F+SDNG         HG TA    LRG K+  
Sbjct: 245 VVECLDWSTGEVLDALERTGQAENTLVAFLSDNGPKK-----GHGTTAG---LRGFKHDP 296

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    + +  L+  GQ  + + ++ DW PTL  +AG
Sbjct: 297 YEGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAG 337


>gi|440716552|ref|ZP_20897057.1| arylsulfatase [Rhodopirellula baltica SWK14]
 gi|436438411|gb|ELP31961.1| arylsulfatase [Rhodopirellula baltica SWK14]
          Length = 459

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 9/100 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L++ D  VG+++ SLE  G    ++V+F SDNGG        H    +N PLRG+K++ +
Sbjct: 243 LETFDHHVGQILDSLEASGRADRTIVIFYSDNGG--------HPEYTANAPLRGSKWNLY 294

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R  +   W  +++    ++  +   D LPT+  +AG
Sbjct: 295 EGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAG 334


>gi|406836223|ref|ZP_11095817.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 456

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  VGR++  L+E  L  N++V F SDNGGP      F  N A    LRG K   
Sbjct: 251 MVTRMDADVGRIVALLDELKLRDNTIVFFCSDNGGPAPFGETFQTNGA----LRGKKGQV 306

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R  + A W   +  G++ +   +  D LPTL  + G E
Sbjct: 307 YEGGLRVPMIARWPGHVPAGKISDVPWYFADVLPTLAELGGGE 349


>gi|338532447|ref|YP_004665781.1| sulfatase family protein [Myxococcus fulvus HW-1]
 gi|337258543|gb|AEI64703.1| sulfatase family protein [Myxococcus fulvus HW-1]
          Length = 554

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 8/101 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD+ VG ++  L+E G+  N+LVVF +DNG   V+ + +      N P RG K S  
Sbjct: 269 MRELDDIVGALLAKLDELGIADNTLVVFSTDNG---VEKMGW--PDGGNCPFRGEKGSTW 323

Query: 62  EGGVRNVAAL--WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGGVR V  L  W  ++  G+V+ ++    DW+PTL   AG
Sbjct: 324 EGGVR-VPCLVRWPGVVEAGRVINDIFAHEDWMPTLVSAAG 363


>gi|427385168|ref|ZP_18881673.1| hypothetical protein HMPREF9447_02706 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727336|gb|EKU90196.1| hypothetical protein HMPREF9447_02706 [Bacteroides oleiciplenus YIT
           12058]
          Length = 455

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 58/101 (57%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR +  L+++GL  ++LV+F SDNG      L +  ++ S  PLRG K+S+
Sbjct: 258 VVEEIDWNVGRFLDYLDQQGLAESTLVIFASDNG----PWLGYKEDSGSADPLRGGKFSY 313

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR    L W   +  G   + ++   D  PT+   AG
Sbjct: 314 YEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYAG 354


>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 453

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+ +G+VIQ+L  +GL  N+L+VF+SDNGG   D        A+N PLR  K   
Sbjct: 249 LIEHMDDGIGKVIQTLRAKGLYENTLIVFVSDNGGKLFDG-------ATNGPLRSGKGHM 301

Query: 61  HEGGVRNVAA-LWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R  A  +W   +      +  + + D  PTL
Sbjct: 302 YEGGIRIPACVVWPGKVAAQSQSQQPLLLMDIFPTL 337


>gi|196228589|ref|ZP_03127455.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196226870|gb|EDY21374.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 631

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS--NWPLRGTKY 58
           M++SLD+ VGR+++ ++E GL   ++ +F SDNGG  V  L    NT S  N P R  K 
Sbjct: 236 MIESLDDCVGRIMKKVDELGLTEKTIFIFTSDNGGLHVYEL---PNTPSTYNAPFRAGKG 292

Query: 59  SFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
              EGG+R  +   W   ++ G   E  + + D++PTL   AG
Sbjct: 293 YLEEGGLREPLIVRWPGKIKAGATNETPVVLYDFMPTLMTAAG 335


>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 465

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 55/101 (54%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD + G+V+  L+E G+  N++VVF +DNGGP+        N ++N PL GTK + 
Sbjct: 245 MTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS------DKNASNNKPLSGTKSNH 298

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + W   ++K       +   D LPT Y  AG
Sbjct: 299 LEGGIRVPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYAAAG 339


>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
 gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
          Length = 324

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 3/60 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+SVG+ I++L + G+L NS+V+  SDNG PT   L  H N+ SN P RG    F
Sbjct: 255 MVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPT---LGIHSNSGSNHPFRGVSTYF 311


>gi|167646636|ref|YP_001684299.1| sulfatase [Caulobacter sp. K31]
 gi|167349066|gb|ABZ71801.1| sulfatase [Caulobacter sp. K31]
          Length = 559

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 46/67 (68%), Gaps = 5/67 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD +VGR++Q+L+EEGL  N+LV+F SDNGG      ++ G    N P RG K +F
Sbjct: 337 MVRNLDRNVGRLLQALKEEGLDQNTLVIFTSDNGGA-----NYIGLPDINRPYRGWKATF 391

Query: 61  HEGGVRN 67
            EGG+ +
Sbjct: 392 FEGGIHS 398


>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 474

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/111 (35%), Positives = 57/111 (51%), Gaps = 13/111 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +G+VIQ L+E G+  N+L+VF SDNGG      H HG  AS+ P RG K   
Sbjct: 249 MVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG------HSHG--ASSAPYRGHKGML 300

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE----KNID 106
            EGG+R    + W   ++  +  +  +   D  PT+      +    KNID
Sbjct: 301 FEGGIREPFVISWPNKIKGNRTYKEPIIALDIFPTILAATNTQVPASKNID 351


>gi|295690083|ref|YP_003593776.1| sulfatase [Caulobacter segnis ATCC 21756]
 gi|295431986|gb|ADG11158.1| sulfatase [Caulobacter segnis ATCC 21756]
          Length = 551

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 45/66 (68%), Gaps = 5/66 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD +VGRV+Q+L++ GL  N+LV+F SDNGG      ++ G    N P RG K +F
Sbjct: 329 MVRNLDRNVGRVLQALKDRGLDDNTLVIFTSDNGGA-----NYVGLPDLNRPYRGWKATF 383

Query: 61  HEGGVR 66
            EGG++
Sbjct: 384 FEGGIK 389


>gi|374373039|ref|ZP_09630700.1| N-acetylgalactosamine-6-sulfatase [Niabella soli DSM 19437]
 gi|373235115|gb|EHP54907.1| N-acetylgalactosamine-6-sulfatase [Niabella soli DSM 19437]
          Length = 469

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 58/107 (54%), Gaps = 9/107 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D  VG ++++L++EG+  N++V F SDNGG             +N  LR  K   
Sbjct: 261 MIERIDNGVGLIMETLKKEGIDKNTIVFFFSDNGGVW--------KLGNNGILRAGKSWL 312

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           +EGGVR N+ A W  ++R   V+   +  TD+ PT   +A  +K  D
Sbjct: 313 YEGGVRENLLARWPGVIRPNTVVNTRVMATDFYPTFLELAKTKKKKD 359


>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 461

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 10/96 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +GR+++ L+  GL  ++++VF SDNGG            +  WP  G K   
Sbjct: 246 MVTRMDMEIGRILERLDRLGLAEDTIIVFTSDNGGE---------RYSKTWPYNGLKTEL 296

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R  A + W   +  GQ  E ++   DW+PT 
Sbjct: 297 LEGGLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTF 332


>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 550

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/124 (31%), Positives = 55/124 (44%), Gaps = 24/124 (19%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA------------ 48
           ML   D+ VG+V+  L+E     N+L++FISDNG P  D  H   N A            
Sbjct: 303 MLDRADQGVGKVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGTAGS 362

Query: 49  -----------SNWPLRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLY 96
                      SN P R  K   +EGG+ +    W P  ++ G + +   H+ D  PT Y
Sbjct: 363 FESQGKPWSYVSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAPTFY 422

Query: 97  FIAG 100
            +AG
Sbjct: 423 ELAG 426


>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
          Length = 981

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 10/103 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           LK++D  VG V ++L++       ++V  SDNGG T  + H +GN   N+P RG K+   
Sbjct: 323 LKAVDRGVGAVRRALDDRAR--EYVLVVHSDNGGNTCGT-HCNGN---NYPRRGAKFFEF 376

Query: 62  EGGVRNVAALWSPLL----RKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R    ++SP L    R+G     LMH  DWL T   + G
Sbjct: 377 EGGLRVPGVVYSPTLIPARRRGAAYAGLMHHVDWLATFVALGG 419


>gi|150008689|ref|YP_001303432.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937113|gb|ABR43810.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 452

 Score = 60.1 bits (144), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 37/99 (37%), Positives = 54/99 (54%), Gaps = 5/99 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD SVG+++Q+L+ EGL  N+LV+F SDNG      L       S  PL+  K S  
Sbjct: 246 VEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG----PWLLCKQEGGSPGPLKDGKASMF 301

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG R    +W  +++ G +  ++    D LPT   IAG
Sbjct: 302 EGGFRVPCIMWGAMVKPGYI-TDMASTLDLLPTFCEIAG 339


>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 484

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD + G V+  L+E GL  N++VVF +DNGGPT        N + N PL GTK + 
Sbjct: 248 MTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGPT------DKNASLNLPLSGTKSNH 301

Query: 61  HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
            EGG+R    +  P   K + + N  +   D LPT Y   G
Sbjct: 302 LEGGIRVPFLISWPKQIKSKTVYNFPVSTLDLLPTFYAAGG 342


>gi|256841279|ref|ZP_05546786.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|262383570|ref|ZP_06076706.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|423330544|ref|ZP_17308328.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
           CL03T12C09]
 gi|256737122|gb|EEU50449.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|262294468|gb|EEY82400.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|409232160|gb|EKN25008.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/99 (37%), Positives = 54/99 (54%), Gaps = 5/99 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD SVG+++Q+L+ EGL  N+LV+F SDNG      L       S  PL+  K S  
Sbjct: 246 VEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG----PWLLCKQEGGSPGPLKDGKASMF 301

Query: 62  EGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG R    +W  +++ G +  ++    D LPT   IAG
Sbjct: 302 EGGFRVPCIMWGAMVKPGYI-TDMASTLDLLPTFCEIAG 339


>gi|255533274|ref|YP_003093646.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255346258|gb|ACU05584.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 513

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 34/103 (33%), Positives = 64/103 (62%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS--NWPLRGTKY 58
           +++ +D+S+G ++  L+++G+  N++++F+SDNGG  +D  H  G   +  N+PLR  K 
Sbjct: 284 LVEGMDKSLGDIMDYLKKKGVDKNTIIIFMSDNGG--LDH-HQRGGALNTHNYPLRSGKG 340

Query: 59  SFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           S +EGG+R  +   W  +   G V +N + I D+ P++  +AG
Sbjct: 341 SVYEGGIREPMIVRWPGVTSAGSVYKNPVIIEDFFPSILEMAG 383


>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
 gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
          Length = 457

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 9/66 (13%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D SVGR++ +L+E  L  N+L VF+SDNGG  + S+          PLRG+K S 
Sbjct: 259 MMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGERITSMA---------PLRGSKGSL 309

Query: 61  HEGGVR 66
           +EGGVR
Sbjct: 310 YEGGVR 315


>gi|90408762|ref|ZP_01216908.1| arylsulfatase [Psychromonas sp. CNPT3]
 gi|90310142|gb|EAS38281.1| arylsulfatase [Psychromonas sp. CNPT3]
          Length = 507

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 56/96 (58%), Gaps = 6/96 (6%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
           DE VGR+++++++ G+  N++V++ +DN GP + +    G T    P RG K +  EG  
Sbjct: 272 DEQVGRILKTIKDLGIEDNTIVIYTTDN-GPMIATWPDAGMT----PFRGEKNTGWEGSF 326

Query: 66  RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  A + W   + KG+ L  +M + DWLPTL   AG
Sbjct: 327 RVPAMVKWPNHIPKGKNLNGIMSLEDWLPTLLAAAG 362


>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 491

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 53/101 (52%), Gaps = 2/101 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D  +G V+ +LE+ G L N+L+ F+SDNGG    +   + + A N P +  K SF
Sbjct: 258 MIDEMDAGIGLVVDALEQSGKLDNTLIFFLSDNGGCVPQAWDSYSDFADNGPFKDGKTSF 317

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG      A W P   K Q  +N++   D   T   +AG
Sbjct: 318 YEGGTHVPFIAHW-PAKLKPQTYDNIISSLDLAATAVALAG 357


>gi|329956451|ref|ZP_08297048.1| putative cerebroside-sulfatase [Bacteroides clarus YIT 12056]
 gi|328524348|gb|EGF51418.1| putative cerebroside-sulfatase [Bacteroides clarus YIT 12056]
          Length = 457

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/100 (36%), Positives = 55/100 (55%), Gaps = 5/100 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ +D +VGR +  L+ +GL  N+LV+F SDNG      L F     S  PLRG K+S++
Sbjct: 268 VEEIDWNVGRFLDYLDTKGLAENTLVIFASDNG----PWLSFKEEGGSADPLRGGKFSYY 323

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGGVR    + W  ++  G   + ++   D  PT+   AG
Sbjct: 324 EGGVRVPCIIRWKGVVPAGVTSDAIIASIDLFPTIMHYAG 363


>gi|442317872|ref|YP_007357893.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
 gi|441485514|gb|AGC42209.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
          Length = 559

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 6/100 (6%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +  LD+ VG ++  L+E G+  N++VVF +DNG   V+ + +      N P RG K S  
Sbjct: 275 MTELDDIVGTLLAKLDELGIADNTIVVFSTDNG---VEKMSW--PDGGNAPFRGEKGSTW 329

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGGVR    + W  ++  G V+ +L    DW+PTL   AG
Sbjct: 330 EGGVRVPCVVRWPGVIEPGTVINDLFAHEDWMPTLVAAAG 369


>gi|332662429|ref|YP_004445217.1| Steryl-sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332331243|gb|AEE48344.1| Steryl-sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 506

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 5/96 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML+++D +VGR++  L   GL  N++V+F SDNGG +       G+  +N PLR  K   
Sbjct: 285 MLENMDYNVGRILDQLAASGLTENTIVLFTSDNGGLSTA----EGSPTTNDPLRAGKGWL 340

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R  V   W   +  GQ+ +  ++  D+ PT 
Sbjct: 341 YEGGIRVPVIMRWPAKIVAGQISDVPVNTIDFYPTF 376


>gi|254430517|ref|ZP_05044220.1| sulfatase [Cyanobium sp. PCC 7001]
 gi|197624970|gb|EDY37529.1| sulfatase [Cyanobium sp. PCC 7001]
          Length = 601

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ L++ G L N+L+VF +DNG   + S    G T    P +G K   +
Sbjct: 337 MKQMDDNIGYVLKKLDDMGQLDNTLIVFTTDNGAEKI-SFPDGGVT----PFKGQKGEAY 391

Query: 62  EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG R  +   W   +  G V + L    DW+PTL  I G  K
Sbjct: 392 EGGYRAPMVVRWPGHIEPGTVKDQLFAALDWVPTLVDIGGGAK 434


>gi|424864690|ref|ZP_18288593.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
 gi|400759436|gb|EJP73618.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
          Length = 549

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD SVG++++ L++  + G +L++F SDNGG     L        N P RG K SF
Sbjct: 329 MINALDRSVGKIVEKLKDLNIYGKTLIIFTSDNGGANYIEL-----DDINKPYRGWKISF 383

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+R      W   +  G+  +  +H  D  PT+   AG E
Sbjct: 384 FEGGIRVPYIVSWPSEISPGKKSDIAVHHFDIFPTIAKAAGIE 426


>gi|149197500|ref|ZP_01874551.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139518|gb|EDM27920.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 472

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 56/101 (55%), Gaps = 2/101 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M K LD++VGR+++ ++E G+  N++++F SDNGG    S   +G+   SN+P R  K S
Sbjct: 249 MTKHLDDTVGRLVKKVDELGIAENTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSS 308

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
            +EGG R  +   W  +     +   ++   D  PTL  IA
Sbjct: 309 HYEGGSRVPLIFHWPGVTEANSLSHEVVSGIDIYPTLLKIA 349


>gi|430746124|ref|YP_007205253.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430017844|gb|AGA29558.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 478

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++++D SVGR++ +LEE+GL   +LVVF SDNG      L +  +  S  P R  K + 
Sbjct: 249 VIEAIDGSVGRILATLEEQGLDQKTLVVFTSDNG----PWLSYGNHAGSAGPFREGKGTS 304

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W   +  G +        D LPT   +AG +++ D
Sbjct: 305 FEGGIREPFVARWPGKIPAGSICREPAMTIDLLPTFAELAGADRSGD 351


>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 467

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 7/96 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D ++G V++ LE +GL  N+L+VF +DNGGP+        N + N PL GTK + 
Sbjct: 252 MTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS------DANDSDNSPLSGTKANH 305

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R    + W   L+K +     +   D  PT 
Sbjct: 306 LEGGIRVPFLMSWPKQLQKNKNYNYPISTLDLFPTF 341


>gi|365860802|ref|ZP_09400595.1| putative sulfatase [Streptomyces sp. W007]
 gi|364009756|gb|EHM30703.1| putative sulfatase [Streptomyces sp. W007]
          Length = 435

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+++L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 246 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 296

Query: 61  HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
            EGG+R  + L  P   +G  + +L +   DW  TL  + G
Sbjct: 297 QEGGIRVPSVLRWPARVEGHQVSDLPVFSPDWTATLLELGG 337


>gi|340619116|ref|YP_004737569.1| sulfatase [Zobellia galactanivorans]
 gi|339733913|emb|CAZ97290.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
          Length = 596

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 64/113 (56%), Gaps = 9/113 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH--FHGNTASNWPLRGTKY 58
           M+ ++D +VGRV++ L++ G+  N+LV+F+SDNG     + +  + G   SN  LRGTK 
Sbjct: 254 MIDNIDANVGRVLKRLKDLGIADNTLVIFLSDNGPRHRRTKNDVYPGRWVSN--LRGTKT 311

Query: 59  SFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFI----AGKEKNID 106
           S +E G+R      W   L+KGQ    +  + D LPT+  +    A KE  +D
Sbjct: 312 SVYECGIRVPFFVKWPTQLKKGQKSSTMGAVIDILPTILDVCDIKAPKEVKLD 364


>gi|410907337|ref|XP_003967148.1| PREDICTED: arylsulfatase A-like [Takifugu rubripes]
          Length = 501

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 6/102 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L  LD ++G ++++LE  G+L N+L++F SDN GP +  +   GN     PLR  K + +
Sbjct: 253 LLELDNTIGSLLETLEGTGVLNNTLILFTSDN-GPELMRMSRGGNAG---PLRCGKGTTY 308

Query: 62  EGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EGG+R  A A W  +++ G V   +    D LPT+  +AG +
Sbjct: 309 EGGMREPAIAYWRGIIQPG-VTHEMASTLDILPTIASLAGAK 349


>gi|348029892|ref|YP_004872578.1| sulfatase [Glaciecola nitratireducens FR1064]
 gi|347947235|gb|AEP30585.1| sulfatase [Glaciecola nitratireducens FR1064]
          Length = 617

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 56/97 (57%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL-HFHGNTASNWPLRGTKYS 59
           M+K  D++VG + ++L E  +  N++++F SDNGG   D L   H    SN+PLRG K +
Sbjct: 155 MVKQFDDAVGILWEALVEAQVEDNTIIIFTSDNGGNMYDVLGTIHAT--SNFPLRGGKAT 212

Query: 60  FHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTL 95
            ++GG+R   A+  P L +   L N  +   D+ PTL
Sbjct: 213 NYDGGLRVPTAIIWPGLTQPNTLSNAPIQTVDYFPTL 249


>gi|386820648|ref|ZP_10107864.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386425754|gb|EIJ39584.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 512

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 10/108 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS----LHFHGNTASNWPLRGT 56
           M++ +D+S+G ++  L+E  L  N++++F+SDNGG +       LH H     N PL   
Sbjct: 288 MVEGMDKSLGDIMNYLDENDLAENTIILFMSDNGGLSAHKRGGELHTH-----NKPLASG 342

Query: 57  KYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           K S +EGG+R  +   W  +++   V    + I D+ PT+  +AG EK
Sbjct: 343 KGSMYEGGIREPMLVKWPRVIKPESVTNEYLIIEDFFPTILEMAGIEK 390


>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
 gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
          Length = 433

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G ++Q+L + G L N+L+VF SDNGG            + NWPL G K   
Sbjct: 229 MISHMDEGIGWLVQALRQLGQLDNTLIVFTSDNGGERF---------SDNWPLVGGKMDL 279

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R    A W  ++  G          DW  TL   AG + +
Sbjct: 280 TEGGIRVPWIAHWPAMIAPGGTSAQHCMTMDWSATLLDAAGVQAD 324


>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
 gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
          Length = 476

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 40/114 (35%), Positives = 58/114 (50%), Gaps = 14/114 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M++ +D  VG V+Q LE  GL  N++++F SDNG       H  G        SN   RG
Sbjct: 264 MVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGA------HREGGADPEFFDSNGKFRG 317

Query: 56  TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
            K   +EGGVR  + A W   ++ G   +++    D LPT   I G++  K+ID
Sbjct: 318 FKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPTFAAIGGEKITKSID 371


>gi|443734458|gb|ELU18440.1| hypothetical protein CAPTEDRAFT_208445 [Capitella teleta]
          Length = 240

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 8/76 (10%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG++++ L E G++ N+ V+F+SDNG    + L        N P RG K S  EGG
Sbjct: 168 LDAMVGKIVEKLIENGMMNNTYVIFVSDNGADPPEGL--------NTPFRGGKSSLFEGG 219

Query: 65  VRNVAALWSPLLRKGQ 80
            ++ + ++SPLL+K +
Sbjct: 220 TKSNSFIYSPLLKKTE 235


>gi|87301451|ref|ZP_01084291.1| putative arylsulfatase protein [Synechococcus sp. WH 5701]
 gi|87283668|gb|EAQ75622.1| putative arylsulfatase protein [Synechococcus sp. WH 5701]
          Length = 374

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ L++ G L N+L+VF +DNG   + S    G T    P +G K   +
Sbjct: 110 MKQMDDNIGYVLKKLDDMGQLDNTLIVFTTDNGAEKI-SFPDGGVT----PFKGQKGEAY 164

Query: 62  EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           EGG R  +   W   +  G V + L    DW+PTL  I G  K
Sbjct: 165 EGGYRAPMVVRWPGHIEPGTVKDQLFAALDWVPTLVDIGGGAK 207


>gi|149173445|ref|ZP_01852075.1| putative arylsulfatase [Planctomyces maris DSM 8797]
 gi|148847627|gb|EDL61960.1| putative arylsulfatase [Planctomyces maris DSM 8797]
          Length = 459

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 1/101 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD+ VG ++  LE+ G+  N+L++F SDNGG              N PLRG K S 
Sbjct: 245 MVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNGNAPLRGHKGSM 304

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R    A W   +  G+  +  +   D LPTL  +AG
Sbjct: 305 YEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAG 345


>gi|449136003|ref|ZP_21771428.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448885345|gb|EMB15791.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 480

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 40/111 (36%), Positives = 59/111 (53%), Gaps = 11/111 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNTASNWP 52
           +++ +D +VGR++ +L E  L  N+ V+F SDNG   V +           HG +A   P
Sbjct: 217 VIEEIDFNVGRILDTLSELKLADNTYVLFTSDNGPWLVKNKGHVDGHRPRDHGGSAG--P 274

Query: 53  LRGTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
           LR  K S  EGGVR  A LW P  +  G V +++    D +PTL  +AG +
Sbjct: 275 LRSGKVSSFEGGVRVPAILWGPGKVPAGTVCDSIATTMDVMPTLAALAGAD 325


>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 457

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA----SNWPLRGTKYSF 60
           LDE+VG++ + L E GL  N++V F SDN GP  +        A    SN  LRG K   
Sbjct: 259 LDENVGKIKKYLTESGLAKNTIVFFCSDN-GPRSEGTPQQTAIAEFFDSNGRLRGYKRDM 317

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R    +W+P +++ G V     +  D +PT   IAG +
Sbjct: 318 YEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAGSK 360


>gi|325110465|ref|YP_004271533.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970733|gb|ADY61511.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 535

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 19/110 (17%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNS--------LVVFISDNGGPTVDSLHFHGNTASNWPL 53
           + S+DE++GR+   L +    G+S        L++F SDNGG            ASN PL
Sbjct: 303 ISSVDENLGRLRSLLNDPNQDGDSSDSISEQTLILFTSDNGG----------THASNAPL 352

Query: 54  RGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           RG K  F+EGG+R  + A W   ++   V   L+H  D+ PT   +AG +
Sbjct: 353 RGEKGEFYEGGIRVPLIACWPGTIQADSVNNRLVHCVDYYPTFLELAGNQ 402


>gi|449136701|ref|ZP_21772070.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448884660|gb|EMB15143.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 890

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 59/97 (60%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA-SNWPLRGTKYS 59
           M++S+D+++G ++ +L+   +   +++VF SDNGG   + +   G TA SN PLRG K +
Sbjct: 505 MIESMDDAIGTLLDTLDRLEIADETIIVFASDNGGNMYNEV--DGTTATSNAPLRGGKAT 562

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTL 95
            +EGGVR  A +  P ++  G   + ++   D+ PTL
Sbjct: 563 MYEGGVRGPAIVVQPGVVEAGSRSDAIIQSIDFYPTL 599


>gi|149195948|ref|ZP_01873004.1| arylsulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140795|gb|EDM29192.1| arylsulfatase [Lentisphaera araneosa HTCC2155]
          Length = 510

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 19/119 (15%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG----PTVDS--LHFHG--------- 45
           M+ S+D+++GR++++L++ G+  N+LV+FISDNGG    P  D+   H  G         
Sbjct: 292 MVDSMDQNIGRILKTLDQTGVSENTLVLFISDNGGCAEMPGGDNNISHIPGPKEFYTACG 351

Query: 46  ---NTASNWPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
                A N P +  K + HEGG+     + W   ++     +++ H+ D  PT   +AG
Sbjct: 352 PSWANAQNTPFKRYKVNMHEGGICTPCVVRWPAKIKANSWTDSIAHVIDLQPTCMALAG 410


>gi|149176837|ref|ZP_01855447.1| arylsulfatase (A or B) [Planctomyces maris DSM 8797]
 gi|148844274|gb|EDL58627.1| arylsulfatase (A or B) [Planctomyces maris DSM 8797]
          Length = 476

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 9/99 (9%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +++LD  VG+V+ +L+++GL  ++LVVF SDNGG        H   A N PLRG+K++ +
Sbjct: 256 IETLDAYVGQVLTALDQQGLRESTLVVFTSDNGG--------HPEYADNAPLRGSKWNLY 307

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
           E G+R  +   W   ++ G      +   D  PT   +A
Sbjct: 308 EAGIRVPLLVRWPQHVQAGSGCTKPVSGIDLFPTFCDVA 346


>gi|119477026|ref|ZP_01617307.1| sulfatase family protein [marine gamma proteobacterium HTCC2143]
 gi|119449833|gb|EAW31070.1| sulfatase family protein [marine gamma proteobacterium HTCC2143]
          Length = 551

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/96 (38%), Positives = 51/96 (53%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD SVGRV+QSLE+ G+  N++VVF SDNGG    +L        N P RG K + 
Sbjct: 329 MIRALDRSVGRVLQSLEDNGIADNTIVVFTSDNGGAGYIAL-----PDINKPYRGWKLTQ 383

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG      A W   +  G   ++ +H  D   T 
Sbjct: 384 FEGGTHVPFMAKWPAKIAAGTTFDSPIHHIDLFHTF 419


>gi|423294186|ref|ZP_17272313.1| hypothetical protein HMPREF1070_00978 [Bacteroides ovatus
           CL03T12C18]
 gi|392676443|gb|EIY69879.1| hypothetical protein HMPREF1070_00978 [Bacteroides ovatus
           CL03T12C18]
          Length = 450

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 62/105 (59%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR+I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRLIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|343086211|ref|YP_004775506.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354745|gb|AEL27275.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 471

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 60/105 (57%), Gaps = 9/105 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT----ASNWPLRGT 56
           M++SLD++VGR+++ LE EGL  N+L+ F SDNG      L++ G+T     SN  LRG 
Sbjct: 272 MIESLDKNVGRLVEYLEAEGLDKNTLIFFTSDNG----QYLNYDGSTWPKVGSNGVLRGE 327

Query: 57  KYSFHEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           K   +EGG R  + A W   + +  + +  +   D LPTL  I G
Sbjct: 328 KGDLYEGGHRVPSIAYWPEKIPENVISDAPLMSMDILPTLLEIIG 372


>gi|332665500|ref|YP_004448288.1| Cerebroside-sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334314|gb|AEE51415.1| Cerebroside-sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 464

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 7/97 (7%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  +G+++  L++EGL   +LVVF+SDNG    D+    G       LRG K   +EGG
Sbjct: 268 LDREIGKLLAQLKKEGLDKKTLVVFVSDNGPKQRDAFESTGG------LRGKKRDMYEGG 321

Query: 65  VR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +R    A W   ++ G V E    + D+LPT   +AG
Sbjct: 322 IRVPFIARWPGKIKAGTVNETPTALWDFLPTACALAG 358


>gi|302554271|ref|ZP_07306613.1| arylsulfatase [Streptomyces viridochromogenes DSM 40736]
 gi|302471889|gb|EFL34982.1| arylsulfatase [Streptomyces viridochromogenes DSM 40736]
          Length = 474

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD +VG V+ +L   G   +++V+F SDNGG     L         WPL G K+S 
Sbjct: 287 MVESLDAAVGEVLAALRRSGQEDDTVVLFASDNGGERWSYL---------WPLSGGKFSL 337

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKEKN 104
            EGG+R  A L  P    G+ + +  H + DW  TL  +AG   +
Sbjct: 338 QEGGIRVPAILRWPDRIDGRQVSHEPHFSPDWTATLLELAGARHD 382


>gi|262406944|ref|ZP_06083493.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294805693|ref|ZP_06764573.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|345507695|ref|ZP_08787342.1| hypothetical protein BSAG_05002 [Bacteroides sp. D1]
 gi|262355647|gb|EEZ04738.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294447109|gb|EFG15696.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|345455385|gb|EGX26397.1| hypothetical protein BSAG_05002 [Bacteroides sp. D1]
          Length = 853

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR+I  L+++ L  N+LV+F SDNG      L F  +  S  PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGGVR    + W   +  G   + ++   D  PT+   AG +
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAGCQ 364


>gi|340620633|ref|YP_004739086.1| sulfatase [Zobellia galactanivorans]
 gi|339735430|emb|CAZ98807.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 538

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 7/104 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTV-DSLHFHGNTASNWPLRGTKYS 59
           +++++D++VGRV+ +L+  GL  N+LVVF SDNGG +  DS      + SN PLRG K  
Sbjct: 286 LVETMDDAVGRVLNTLDSLGLDKNTLVVFTSDNGGVSAGDSF-----STSNLPLRGGKGY 340

Query: 60  FHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
             EGG+R    + +P L++ G      +   D  PT+  + G +
Sbjct: 341 QFEGGIREPYFIKAPWLVKGGAYTSEPVTGADLYPTILDVVGAD 384


>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
          Length = 913

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M    D+ V     +L+ +G+  ++L+V  SDNGGP+        ++++N+PLRG KYS 
Sbjct: 632 MTAMADDVVRNATAALKAKGMWDDTLIVLTSDNGGPSGTD----ADSSNNFPLRGGKYSD 687

Query: 61  HEGGVRN---VAALWSPLLRKGQVLEN---LMHITDWLPTL 95
            EGGVR    VA  + P   +G+ L+     +HI DW  T 
Sbjct: 688 LEGGVRAAGLVAGGFLPDAARGRTLDGAGAYVHICDWYATF 728


>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 522

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 23/105 (21%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE+VG V   L++  L  NS+V+F +                     LRG K + 
Sbjct: 277 MVTCMDEAVGYVTTVLKQSALWENSVVIFSTX--------------------LRGGKGTL 316

Query: 61  HEGGVRNVAALWSPLLR---KGQVLENLMHITDWLPTLYFIAGKE 102
           +EGGVR V  + SPLL    +G +   L+HI+DW PTL  +AG +
Sbjct: 317 YEGGVRGVGFVTSPLLHEQVRGTINHELIHISDWFPTLTRLAGGD 361


>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
 gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
          Length = 473

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 7/95 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   +D+ +G V+ +L++ G+  N++VVF+SDNGG    +L    N  S   L+G K S 
Sbjct: 252 MTSDVDQDIGEVLAALDKRGIAENTIVVFMSDNGG----NLEAGANNGS---LQGGKSSV 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGGVR    +  P   + + +   M   DW PTL
Sbjct: 305 YEGGVRVPVLIRWPAQLQSKTMTQPMFTQDWAPTL 339


>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 472

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 30/66 (45%), Positives = 44/66 (66%), Gaps = 7/66 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ SLD+ +G+V+ +L+   L  N+L++FI+DNGG TV       N++ N PLRG K S 
Sbjct: 251 MMTSLDDGIGKVMATLKANQLDKNTLIIFINDNGGATV-------NSSDNGPLRGMKGSK 303

Query: 61  HEGGVR 66
            EGG+R
Sbjct: 304 WEGGIR 309


>gi|149196555|ref|ZP_01873609.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149140235|gb|EDM28634.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 507

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 3/96 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            L  +DE VGR+   L E G+  N+++ F SDNG     +      T S   LRG K S 
Sbjct: 245 CLTEMDEQVGRIRAKLREMGVEKNTVLFFCSDNGPEGKKAKGAKAGTTSG--LRGRKRSL 302

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTL 95
           ++GGVR  A A W   ++ G V++  M   D+LPT+
Sbjct: 303 YDGGVRVPALAEWPGKIQAGSVIDAAMSTLDYLPTV 338


>gi|440718731|ref|ZP_20899174.1| arylsulfatase [Rhodopirellula baltica SWK14]
 gi|436436058|gb|ELP29849.1| arylsulfatase [Rhodopirellula baltica SWK14]
          Length = 490

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 9/107 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE VGR++++L E GL  N+ +VF SDNGG         G   S  PLRG K ++
Sbjct: 255 MIESVDEGVGRMVETLRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           +EGG+R      W  ++  G   +  +   D  PT   + G +   D
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPAD 353


>gi|294644129|ref|ZP_06721904.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|292640494|gb|EFF58737.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
          Length = 743

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR+I  L+++ L  N+LV+F SDNG      L F  +  S  PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR    + W   +  G   + ++   D  PT+   AG
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362


>gi|283781484|ref|YP_003372239.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283439937|gb|ADB18379.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 475

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/107 (33%), Positives = 58/107 (54%), Gaps = 7/107 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLD+SVGR++  L+   L   ++V+F SDNGG  +  +       SN P R  K S 
Sbjct: 269 MVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNGGLVLREI------TSNLPARAGKGSA 322

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           +EGGVR  +   + P+++ G   +      D  PTL  ++G + + D
Sbjct: 323 YEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELSGAKYSHD 369


>gi|421611130|ref|ZP_16052281.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408497944|gb|EKK02452.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 496

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++VGRV++ +++ GL   ++V+  SDNGG +       G   SN PLRG K   
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLAEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
           +EGG+R    + +P + + G   +  +   D+ PT+  + G+
Sbjct: 337 YEGGIREAFMICAPGVTQPGTTCKEPVCSIDFYPTILDLCGQ 378


>gi|326778304|ref|ZP_08237569.1| N-acetylgalactosamine-4-sulfatase [Streptomyces griseus XylebKG-1]
 gi|326658637|gb|EGE43483.1| N-acetylgalactosamine-4-sulfatase [Streptomyces griseus XylebKG-1]
          Length = 487

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+++L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 298 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 348

Query: 61  HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
            EGG+R  + L  P    G  + +L +   DW  TL  + G
Sbjct: 349 QEGGIRVPSVLRWPARVDGHQVSDLPVFSPDWTATLLELGG 389


>gi|149175234|ref|ZP_01853856.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148845843|gb|EDL60184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 459

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 55/109 (50%), Gaps = 7/109 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  LD  VGR++  + E  L   +L++F SDNGG        H    +N PLRG K   
Sbjct: 250 MIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGGHRGVPAQLH----TNGPLRGFKRDL 305

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE--KNID 106
            EGG+R    A W   +  G+V + ++   D LPT   +AG +   N+D
Sbjct: 306 TEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELAGAQVSANLD 354


>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
 gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
          Length = 466

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 54/100 (54%), Gaps = 7/100 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  ++D ++G V+  L+++GL  N+LVVF +DNGGP+        N ++N PL GTK + 
Sbjct: 252 MTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS------DANQSNNNPLSGTKANH 305

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
            EGG+R    + W   L+K       +   D  PT   +A
Sbjct: 306 LEGGIRVPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVA 345


>gi|32473667|ref|NP_866661.1| aryl-sulphate sulphohydrolase [Rhodopirellula baltica SH 1]
 gi|32444203|emb|CAD74200.1| aryl-sulphate sulphohydrolase [Rhodopirellula baltica SH 1]
          Length = 490

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE VGR++++L E GL  N+ +VF SDNGG         G   S  PLRG K ++
Sbjct: 255 MIESVDEGVGRMVETLRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R      W  ++  G   +  +   D  PT   + G +
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAK 349


>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 616

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 37/98 (37%), Positives = 51/98 (52%), Gaps = 15/98 (15%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           LD+ + R+   L+  G L N+++V  SDNG  PT           SN+PLRG K++  EG
Sbjct: 362 LDKEMRRLHDELDALGALDNTVLVVASDNGACPTAG--------GSNYPLRGYKHTIFEG 413

Query: 64  GVRNVAALWS------PLLRKGQVLENLMHITDWLPTL 95
           GVR  A ++S      P   +G     +MH TDW PT 
Sbjct: 414 GVRVPAFVYSKSTDLIPEEARGTRYSGMMHSTDWTPTF 451


>gi|440718016|ref|ZP_20898485.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436436817|gb|ELP30517.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 491

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 2/101 (1%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
           + ++D +VGR+++ LEE G   ++L+VF SDNG  T+D       +  S  PLRG K   
Sbjct: 253 VANVDAAVGRLVKGLEEIGKRDDTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            E G R    + W   +  GQ L+ ++   D LPT   +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353


>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 462

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 34/101 (33%), Positives = 54/101 (53%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++V +V+ +++  G   N++VVF SDNGG            +  WP  G K   
Sbjct: 260 MVETMDQNVAKVLAAIDRSGKADNTVVVFTSDNGGERF---------SDTWPFVGHKGEV 310

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR  +   W   ++ G   E +M   D+LPTL  +AG
Sbjct: 311 LEGGVRVPLMVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAG 351


>gi|32473340|ref|NP_866334.1| arylsulphatase A [Rhodopirellula baltica SH 1]
 gi|32398020|emb|CAD78114.1| arylsulphatase A [Rhodopirellula baltica SH 1]
          Length = 491

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 2/101 (1%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYSF 60
           + ++D +VGR+++ LEE G   ++L+VF SDNG  T+D       +  S  PLRG K   
Sbjct: 253 VANVDAAVGRLVKGLEELGKRDDTLIVFTSDNGPETLDRYRSANRSYGSPGPLRGMKLHT 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            E G R    + W   +  GQ L+ ++   D LPT   +AG
Sbjct: 313 TEAGFRVPGIVNWPGEIEPGQTLDTVVSSLDLLPTFCRLAG 353


>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
          Length = 467

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 11/108 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPT-----VDSLHFHGNTASNWPLRG 55
           M+  +D SVGRV++ L   GL  N+LV+F SDN GPT      DS  F  N+A    LRG
Sbjct: 259 MVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDN-GPTHNVGGADSSFF--NSAGK--LRG 313

Query: 56  TKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
            K S +EGG+R     + P  ++ G   +  ++  D LPTL   AG +
Sbjct: 314 LKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTK 361


>gi|254483658|ref|ZP_05096879.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
 gi|214036091|gb|EEB76777.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
          Length = 558

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 6/104 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D SV R++ +LE EG+  +++VVF SDNGG       + G    N P RG K + 
Sbjct: 332 MIRAVDRSVSRILDTLEAEGIADDTVVVFTSDNGGAG-----YIGIQDVNSPFRGWKITM 386

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
            EGG+R  +   W   +  G  ++  +   D +PTL   AG  +
Sbjct: 387 FEGGLRVPMFVKWPKRIAAGSSVDTPVAHIDMMPTLAAAAGASQ 430


>gi|182437669|ref|YP_001825388.1| sulfatase [Streptomyces griseus subsp. griseus NBRC 13350]
 gi|178466185|dbj|BAG20705.1| putative sulfatase [Streptomyces griseus subsp. griseus NBRC 13350]
          Length = 449

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+++L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 260 MVEDLDRSVGEVLKALKRSGQEKDTLVFFASDNGGERF---------SYNWPLAGNKGSL 310

Query: 61  HEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAG 100
            EGG+R  + L  P    G  + +L +   DW  TL  + G
Sbjct: 311 QEGGIRVPSVLRWPARVDGHQVSDLPVFSPDWTATLLELGG 351


>gi|299147171|ref|ZP_07040238.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Bacteroides sp. 3_1_23]
 gi|298515056|gb|EFI38938.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Bacteroides sp. 3_1_23]
          Length = 450

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKRIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|374620943|ref|ZP_09693477.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304170|gb|EHQ58354.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 524

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 5/102 (4%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L   D  VG+V+ +L+E GL  N++VV+ SDNG    +  H  G+T    P RG   + H
Sbjct: 262 LAEHDYRVGQVLDALDELGLTENTVVVYASDNGPDRAEFPHI-GDTG---PYRGYLGTVH 317

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EG VR    L W   +  G+V  +++ I D++PT   I G+E
Sbjct: 318 EGSVRTPMMLRWPGKVEPGRVTNDIVAIHDFMPTFAAIVGEE 359


>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 536

 Score = 58.9 bits (141), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 8/103 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDS-LHFHGNTA------SNWPL 53
           M++ LD  VG VI SLE+ GLL  +L+VF SDNG     +   FH   A      S   L
Sbjct: 292 MIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGADPLFFNSTREL 351

Query: 54  RGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
           +G K S +EGG+R  A + W   +     +    +  DW PTL
Sbjct: 352 KGFKGSIYEGGLRVPAIVSWPGQIPPATTINTPSYFPDWFPTL 394


>gi|423290540|ref|ZP_17269389.1| hypothetical protein HMPREF1069_04432 [Bacteroides ovatus
           CL02T12C04]
 gi|392665927|gb|EIY59450.1| hypothetical protein HMPREF1069_04432 [Bacteroides ovatus
           CL02T12C04]
          Length = 450

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKRIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|32474361|ref|NP_867355.1| arylsulfatase A [Rhodopirellula baltica SH 1]
 gi|32444899|emb|CAD74901.1| arylsulfatase A [Rhodopirellula baltica SH 1]
          Length = 525

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D SVGR++++L+E G+  N+LV+F SDNGG  + +L   G T  N PLR  K   
Sbjct: 300 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 356

Query: 61  HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P    G           + ++++   TD  PT+  + G
Sbjct: 357 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 407


>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
           HF0010_16H03]
          Length = 543

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 6/96 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD S+G+++ +LEE GL  N+L++F SDNGG       + G    N P RG K + 
Sbjct: 321 MIEALDRSIGKIVTTLEENGLTENTLIIFTSDNGGAG-----YIGLDNINKPYRGWKLTH 375

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            EGG+     A W   ++K    +  +H TD   T+
Sbjct: 376 FEGGMHIPFFAKWPAKIKKDMRYDKRIHHTDIFSTI 411


>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 486

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 11/109 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTA---------SNW 51
           M+  LD+++GRV+ +L+++GL  N+L++F SDNGG    +  F G  A          N 
Sbjct: 243 MVACLDDNIGRVVAALDKKGLRENTLILFHSDNGG--THNPMFAGVMADVSKIKIPCDNS 300

Query: 52  PLRGTKYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           P R  K S  EGGVR  A    P   K   ++ L+H TD   T   +AG
Sbjct: 301 PYRDGKGSLFEGGVRVCALANWPGKIKPATVDGLIHATDIFTTFTKLAG 349


>gi|393784539|ref|ZP_10372702.1| hypothetical protein HMPREF1071_03570 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665520|gb|EIY59044.1| hypothetical protein HMPREF1071_03570 [Bacteroides salyersiae
           CL02T12C01]
          Length = 511

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 61/108 (56%), Gaps = 7/108 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G ++  LEE+G+  N++V+F+SDNGG T+      G    N+PL   K S 
Sbjct: 283 LVEGMDKSLGDLMDYLEEKGIAQNTVVIFMSDNGGYTI------GRPDKNFPLSEGKGSL 336

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG+R  +   W  + +        + I D+ P++  +AG +K I +
Sbjct: 337 KEGGIREPMIVYWPGVTQPSAENATPVIIEDFFPSILELAGVKKGIHT 384


>gi|417302363|ref|ZP_12089465.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327541297|gb|EGF27839.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 544

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D SVGR++++L+E G+  N+LV+F SDNGG  + +L   G T  N PLR  K   
Sbjct: 319 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 375

Query: 61  HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P    G           + ++++   TD  PT+  + G
Sbjct: 376 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 426


>gi|171912351|ref|ZP_02927821.1| aryl-sulphate sulphohydrolase [Verrucomicrobium spinosum DSM 4136]
          Length = 491

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 45/107 (42%), Positives = 59/107 (55%), Gaps = 7/107 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS----NWPLRGT 56
           ML S+DESVGR+++ L++  L  N+LVVF SDNGG  V      G  A     N PLRG 
Sbjct: 248 MLASVDESVGRILKLLDDLKLSQNTLVVFSSDNGG--VGGYEREGIDAKGITDNAPLRGG 305

Query: 57  KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           K   +EGGVR      W   ++ GQV    ++  D  PTL  +AG +
Sbjct: 306 KGMLYEGGVRVPYVFQWKGHIKAGQVNTTPINSVDLYPTLVELAGAQ 352


>gi|170727981|ref|YP_001762007.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169813328|gb|ACA87912.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 482

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 58/104 (55%), Gaps = 7/104 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN----TASNWPLRGT 56
           M+ ++D+SVGR+++SLE++GL  N+LVV  SDNGG    S    G+    + +N PLR  
Sbjct: 275 MIWAVDKSVGRILESLEKQGLDKNTLVVLTSDNGG---FSTRHQGDERVTSTANLPLRSG 331

Query: 57  KYSFHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           K   +EGGVR    +  P  +     + L    D  PTL  +AG
Sbjct: 332 KGWVYEGGVRIPLLIHQPGQQIQSQHDTLTTSADLYPTLANVAG 375


>gi|393783639|ref|ZP_10371811.1| hypothetical protein HMPREF1071_02679 [Bacteroides salyersiae
           CL02T12C01]
 gi|392668564|gb|EIY62059.1| hypothetical protein HMPREF1071_02679 [Bacteroides salyersiae
           CL02T12C01]
          Length = 553

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 2/101 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G ++  LEE+ L  N++++F+SDNGG    +     +T  N+PL   K S 
Sbjct: 329 LVEGMDKSLGDLMDYLEEKNLQENTVILFMSDNGGLAAHTRAGQLHT-QNYPLNSGKGSA 387

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR  +   W   ++ G   +  + I D+ P++  IAG
Sbjct: 388 YEGGVREPMIVSWPGTVKPGSKCDRYLMIEDFFPSILEIAG 428


>gi|336415446|ref|ZP_08595786.1| hypothetical protein HMPREF1017_02894 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941042|gb|EGN02904.1| hypothetical protein HMPREF1017_02894 [Bacteroides ovatus
           3_8_47FAA]
          Length = 450

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|160885324|ref|ZP_02066327.1| hypothetical protein BACOVA_03323 [Bacteroides ovatus ATCC 8483]
 gi|156108946|gb|EDO10691.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
          Length = 450

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 460

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 12/108 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  SLD+ +GRV+ SL+  GL  N+LV+F++D+GG  V    + GN   N P RG K + 
Sbjct: 260 MTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDYV----YGGN---NQPFRGAKATL 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNIDS 107
            EGG+R    + W   ++ G     +    D  PT+   A    N+D+
Sbjct: 313 FEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFA----NVDT 356


>gi|427388135|ref|ZP_18884018.1| hypothetical protein HMPREF9447_05051 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724718|gb|EKU87592.1| hypothetical protein HMPREF9447_05051 [Bacteroides oleiciplenus YIT
           12058]
          Length = 557

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 1/104 (0%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D+S+G V+  LE +G+  N++++F SDNGG +V +         N PLR  K S 
Sbjct: 303 MVEGMDKSLGDVLDFLETKGVADNTIIIFYSDNGGHSVGNEKGGEPHTQNLPLREGKGSV 362

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
           +EGG+R  +   W   +  G  +   +   D  PT+  +AG +K
Sbjct: 363 YEGGIRVPMLFCWPGKVSAGTRINTPVSTEDMFPTILEMAGVKK 406


>gi|423214930|ref|ZP_17201458.1| hypothetical protein HMPREF1074_02990 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692193|gb|EIY85431.1| hypothetical protein HMPREF1074_02990 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 450

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|390573121|ref|ZP_10253308.1| sulfatase [Burkholderia terrae BS001]
 gi|389934853|gb|EIM96794.1| sulfatase [Burkholderia terrae BS001]
          Length = 563

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 6/96 (6%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
           D+ VG++++SL+E G+  N++VV+ +DN GP      F    A+  P R  K S +EG  
Sbjct: 301 DQDVGKLLKSLDEMGIADNTIVVYTTDN-GPN----QFTWPDAATTPFRNEKDSNYEGAF 355

Query: 66  RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  A + W   ++ GQV   +    DW PTL   AG
Sbjct: 356 RVPAMVRWPGHVKAGQVSNEIFSGMDWFPTLLAAAG 391


>gi|187736242|ref|YP_001878354.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187426294|gb|ACD05573.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 562

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 3/104 (2%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G + + L++  L  N++++F++DNGG  +      GN  SN+PL   K S 
Sbjct: 326 LIQGMDKSLGDIREYLKKNNLDKNTVIIFMADNGGLAISGR--MGNKESNYPLSFGKGSN 383

Query: 61  HEGGVRNVAALWSPLLRKGQ-VLENLMHITDWLPTLYFIAGKEK 103
            EGG+R    ++ P + K + V    + I D+ PT+  IAG +K
Sbjct: 384 REGGIREPMIVYWPGVTKAESVCTTPVIIEDFFPTILEIAGAKK 427


>gi|32474453|ref|NP_867447.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32444991|emb|CAD74993.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 496

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 63/111 (56%), Gaps = 9/111 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++VGRV++ +++ GL   ++V+  SDNGG +       G   SN PLRG K   
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLAEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK----EKNID 106
           +EGG+R    + +P + + G   +  +   D+ PT+  + G+    EK +D
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQSPPAEKIVD 387


>gi|420253579|ref|ZP_14756627.1| arylsulfatase A family protein [Burkholderia sp. BT03]
 gi|398051875|gb|EJL44184.1| arylsulfatase A family protein [Burkholderia sp. BT03]
          Length = 563

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 6/96 (6%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGGV 65
           D+ VG++++SL+E G+  N++VV+ +DN GP      F    A+  P R  K S +EG  
Sbjct: 301 DQDVGKLLKSLDEMGIADNTIVVYTTDN-GPN----QFTWPDAATTPFRNEKDSNYEGAF 355

Query: 66  RNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           R  A + W   ++ GQV   +    DW PTL   AG
Sbjct: 356 RVPAMVRWPGHVKAGQVSNEIFSGMDWFPTLLAAAG 391


>gi|440716917|ref|ZP_20897419.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436437985|gb|ELP31567.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 496

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 63/111 (56%), Gaps = 9/111 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++VGRV++ +++ GL   ++V+  SDNGG +       G   SN PLRG K   
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLTEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK----EKNID 106
           +EGG+R    + +P + + G   +  +   D+ PT+  + G+    EK +D
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQSPPAEKIVD 387


>gi|421613610|ref|ZP_16054683.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408495569|gb|EKK00155.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 503

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 61/111 (54%), Gaps = 14/111 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D SVGR++++L+E G+  N+LV+F SDNGG  + +L   G T  N PLR  K   
Sbjct: 278 MVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 334

Query: 61  HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P    G           + ++++   TD  PT+  + G
Sbjct: 335 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVG 385


>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 442

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 36/96 (37%), Positives = 53/96 (55%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  SLDE+VG++   LE  G L N+++ FI+DNGG    S H    +A N PL+G K   
Sbjct: 243 MTHSLDENVGKIFNRLEAMGELDNTIIFFINDNGG----SKHI---SADNSPLKGYKGQL 295

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
           +EGG+R   A  W   ++   V  ++    D LP++
Sbjct: 296 YEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSI 331


>gi|351732525|ref|ZP_08950216.1| sulfatase [Acidovorax radicis N35]
          Length = 444

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/107 (32%), Positives = 52/107 (48%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GR++ +L+  G+  ++LVVF SDNGG            + +WPL G K   
Sbjct: 240 MIHHMDEGIGRIMAALQRHGIDRDTLVVFTSDNGGERF---------SDSWPLVGGKMDL 290

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W  ++  G V        DW  T+   AG   + D
Sbjct: 291 TEGGIRVPWIAHWPAVIAPGSVSPQTCMTMDWSATMLDAAGAMPHAD 337


>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 471

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M  +LD + GRV+  L++ GL  N+++VF +DNGGPT        N + N PL GTK + 
Sbjct: 247 MTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT------DKNASLNLPLSGTKSNH 300

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + W   ++        +   D LPT Y   G
Sbjct: 301 LEGGIRVPFLMSWPKHIQPKTTYTFPISTMDLLPTFYAAGG 341


>gi|325108172|ref|YP_004269240.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968440|gb|ADY59218.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 520

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 41/120 (34%), Positives = 65/120 (54%), Gaps = 18/120 (15%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLH-----FHGN-------- 46
           M+  LDE++GR++Q LE++G L N+L++F SDNG  P   S H     + G         
Sbjct: 270 MVDRLDENIGRLLQHLEQKGQLENTLILFCSDNGACPFERSKHIDIAPWKGGSFYLYDAS 329

Query: 47  --TASNWPLRGTKYSFHEGGVRNVAALWSP--LLRKGQVLENLMHITDWLPTLYFIAGKE 102
             T  N PLR  K + HEGG+ ++  +  P  + + G+  ++  H+ D + TL  +AG E
Sbjct: 330 WATVGNTPLRHYKQTQHEGGISSLLIVHWPGKIKQPGRWEDSPGHLVDVMATLLEVAGAE 389


>gi|149176594|ref|ZP_01855206.1| arylsulfatase A [Planctomyces maris DSM 8797]
 gi|148844473|gb|EDL58824.1| arylsulfatase A [Planctomyces maris DSM 8797]
          Length = 510

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/100 (35%), Positives = 51/100 (51%), Gaps = 2/100 (2%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD S G+++  L E G+  N+LV++ SDNG P    ++      +N PL G  Y+  
Sbjct: 275 IEELDWSTGQILDKLVELGIDKNTLVIWTSDNGSPMAKDMN-STERGTNKPLNGRGYTTS 333

Query: 62  EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
           EG  R    +W P  +  G V E L    D LPT   +AG
Sbjct: 334 EGAFRVPTIVWWPETVPAGTVCEELATTMDLLPTFARLAG 373


>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 473

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 7/100 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   LD + G V+  L+E GL  N++VVF +DNGGP+        N ++N PL GTK + 
Sbjct: 259 MTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS------DKNASNNAPLAGTKSNQ 312

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
            EGG+R    + W   ++ G   +  +   D LPT Y  A
Sbjct: 313 LEGGIRVPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAA 352


>gi|417304244|ref|ZP_12091273.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327539456|gb|EGF26071.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 496

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 5/102 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++++D++VGRV++ +++ GL   ++V+  SDNGG +       G   SN PLRG K   
Sbjct: 281 MVEAMDQAVGRVLKQIDDLGLTEETIVILTSDNGGLSTS----EGWPTSNLPLRGGKGWV 336

Query: 61  HEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGK 101
           +EGG+R    + +P + + G   +  +   D+ PT+  + G+
Sbjct: 337 YEGGIREAFMIRAPGVTQPGTTCKEPVCSIDFYPTILDLCGQ 378


>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
 gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 525

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 11/106 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF--------HGNT--ASN 50
           M+ +LD+ +GRV++SL + GL  N+L + ISDNG P    +           GNT  +S 
Sbjct: 253 MVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSP 312

Query: 51  WPLRGTKYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            P RG K   +EGG+R    + W   L +G V +N +   D  PT+
Sbjct: 313 GPFRGYKADTYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTV 358


>gi|119505013|ref|ZP_01627089.1| Twin-arginine translocation pathway signal [marine gamma
           proteobacterium HTCC2080]
 gi|119458995|gb|EAW40094.1| Twin-arginine translocation pathway signal [marine gamma
           proteobacterium HTCC2080]
          Length = 653

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 52/96 (54%), Gaps = 8/96 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           ML  +D +VGRV+++L+   L  ++LV+ +SDNGG             SN P  G K +F
Sbjct: 271 MLMQVDAAVGRVLETLDASDLTRSTLVIVVSDNGGT-------EKQLPSNQPFIGVKNTF 323

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTL 95
            EGG+R    + W  ++ K  V++  +   D+ PTL
Sbjct: 324 TEGGLRTPLLMRWPEVIPKNMVIDETVSYLDYFPTL 359


>gi|395804316|ref|ZP_10483556.1| sulfatase [Flavobacterium sp. F52]
 gi|395433415|gb|EJF99368.1| sulfatase [Flavobacterium sp. F52]
          Length = 539

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ +LD+++GR+ + +++EGL  N+L++F SDNGG        +    +N PL+G K+S 
Sbjct: 330 MVSALDDAIGRIREKVKKEGLEDNTLIIFASDNGGAD------YTFATTNAPLKGGKFSH 383

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+    AL W   ++   V +N +   D   T+    G +
Sbjct: 384 FEGGINVPFALSWKGKIKPNTVYKNPVSTLDIFTTIATAIGSD 426


>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 826

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 9/105 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GRV+  + E GL  N++V+F SDNGG            A N+P RG K   
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG--------RREHAVNFPYRGHKGML 317

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R      W   +R G   E+ +   D  PT    AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362


>gi|421502322|ref|ZP_15949276.1| putative sulfatase [Pseudomonas mendocina DLHK]
 gi|400346754|gb|EJO95110.1| putative sulfatase [Pseudomonas mendocina DLHK]
          Length = 500

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 54/105 (51%), Gaps = 7/105 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD ++ R++ +LEE G   N++V+F SDNGG  +D         +N+PL G K ++
Sbjct: 124 ILSQLDHNIARLLTALEESGQAKNTVVIFASDNGGANID-------MDNNYPLVGKKATY 176

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGGVR    L  P   +   +       D  PTL  +AG +  +
Sbjct: 177 TEGGVRTPLLLRWPGRYENLDIREPTVFMDLFPTLVAMAGGKPPV 221


>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
 gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
          Length = 479

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 56/100 (56%), Gaps = 4/100 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG+V+ +L E+G+  N+LVVF SDNG   +  +  HG +A    L+  K + 
Sbjct: 262 VIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNGPWLI--MGAHGGSAGL--LKSGKGTS 317

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R  A  W P   K  V  N     D  PT+  IAG
Sbjct: 318 YEGGMREPAIFWWPEKIKPAVAHNTASTLDLFPTIMSIAG 357


>gi|449136514|ref|ZP_21771894.1| arylsulfatase [Rhodopirellula europaea 6C]
 gi|448884831|gb|EMB15303.1| arylsulfatase [Rhodopirellula europaea 6C]
          Length = 490

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE VGR++ +L E GL  ++ +VF SDNGG         G   S  PLRG K ++
Sbjct: 255 MIESVDEGVGRMVDTLRELGLEESTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGG+R      W  ++  G   E  +   D  PT   + G +
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKCEIPIIAADLYPTFVEMTGAK 349


>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 454

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 55/101 (54%), Gaps = 8/101 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            ++ LD+ +G+VI +L+E G   N+L++F SDNGG   D        A+N  LR  K S 
Sbjct: 252 FIEHLDDGIGQVINALKETGQYENTLIIFTSDNGGHLPD-------LANNGLLRDGKQSM 304

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG+R   A+ W   ++ G   + +    D  PTL  IAG
Sbjct: 305 YEGGLRVPTAISWPAKIKAGSTSDQVNLSMDIFPTLTEIAG 345


>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 477

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 39/97 (40%), Positives = 54/97 (55%), Gaps = 5/97 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTAS-NWPLRGTKYS 59
           ML S DE VGRV+Q L++ GL  N+LVVF SDNGG  +      G T + N PLR  K  
Sbjct: 264 MLLSFDEGVGRVLQKLKDTGLADNTLVVFTSDNGGLGMPEA---GPTPTDNSPLRKWKGH 320

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            +EGG+R  +    S +  +    +  +  TD++PT 
Sbjct: 321 VYEGGIRVPLIMAGSGITARNATCDRYLTGTDFVPTF 357


>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
           siliculosus]
          Length = 597

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 15/98 (15%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYSFHEG 63
           LD+ V R+  +L+  G+L N+++VF SDNG  P+           SN+PLRG K++  +G
Sbjct: 328 LDKEVRRLHTALDALGVLDNAVIVFASDNGACPS--------KGGSNYPLRGFKHTTFQG 379

Query: 64  GVRNVAALWS------PLLRKGQVLENLMHITDWLPTL 95
           GVR    ++S      P   +G     +MH  DW PTL
Sbjct: 380 GVRVPGFVYSKSADLIPEEARGTRYSGMMHSMDWTPTL 417


>gi|325110466|ref|YP_004271534.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970734|gb|ADY61512.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 536

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 9/111 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D++VG V++ L++ GL  N++V F SDNGG +           SN P RG K   
Sbjct: 285 MIESMDDAVGIVLKKLDDLGLRENTIVCFTSDNGGVSSGDAF----ATSNLPFRGGKGRQ 340

Query: 61  HEGGVRNVAALWSPLLR-KGQVLENLMHITDWLPTLYFIAG----KEKNID 106
            EGG+R    + +P     G  ++   +  DW PTL  +AG    KE+ +D
Sbjct: 341 WEGGIRVPFYISAPSAAVAGTTVDTPANGIDWYPTLLELAGVDQPKEQQVD 391


>gi|298479797|ref|ZP_06997997.1| LOW QUALITY PROTEIN: sulfatase family protein [Bacteroides sp. D22]
 gi|298274187|gb|EFI15748.1| LOW QUALITY PROTEIN: sulfatase family protein [Bacteroides sp. D22]
          Length = 456

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR++  L+++ L  N+LV+F SDNG      L F  +  S  PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLLDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGGVR    + W   +  G   + ++   D  PT+   AG
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362


>gi|168702101|ref|ZP_02734378.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Gemmata
           obscuriglobus UQM 2246]
          Length = 596

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 36/97 (37%), Positives = 58/97 (59%), Gaps = 4/97 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGNTASNWPLRGTKYS 59
           M++S+D +VGRV++ L++  L  N+LV+F SDNGG  T++ + F      N PLR  K  
Sbjct: 250 MVESMDAAVGRVLKRLDDLKLSDNTLVLFTSDNGGLATLEGMPFAPTI--NAPLREGKGY 307

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            +EGGVR  + A W   ++ G V++ +    D+  T+
Sbjct: 308 LYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDFFDTI 344


>gi|293372063|ref|ZP_06618458.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632967|gb|EFF51552.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 450

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR+I++L+E GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRLIKALDELGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAG 100
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAG 336


>gi|296123895|ref|YP_003631673.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016235|gb|ADG69474.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 637

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 57/101 (56%), Gaps = 4/101 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN-TASNWPLRGTKYS 59
           M+ +LD+ VG ++ +LE   L   +++VF SDNGG   D++   G+   SN PLRG K S
Sbjct: 282 MIAALDQRVGLIVAALERLQLTQRTIIVFTSDNGGNMTDTI--EGDLLTSNRPLRGGKGS 339

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
            +EGG R  +  +W  +    +  ++ +   D LPTL  +A
Sbjct: 340 MYEGGSRVPLIVVWPGVATPARSCDDAVSAVDLLPTLVDMA 380


>gi|440716652|ref|ZP_20897156.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436438149|gb|ELP31709.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 521

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 14/111 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K++D SVGR++++L+E G+  N+L+VF SDNGG  + +L   G T  N PLR  K   
Sbjct: 296 MVKAVDTSVGRIMKALKEHGVDDNTLIVFFSDNGG--LSTLRKFGPTC-NSPLRAGKGWL 352

Query: 61  HEGGVRNVAALWSPLLRKG-----------QVLENLMHITDWLPTLYFIAG 100
           +EGG+R    +  P    G           + +++    TD  PT+  + G
Sbjct: 353 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSFACSTDLFPTILDVVG 403


>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 472

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 36/102 (35%), Positives = 55/102 (53%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHG-NTASNWPLRGTKYS 59
           M+  +D  +G VIQSL++ G+  N+L+ F+SDNGGP        G N + N P RG K +
Sbjct: 246 MVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGSTKGWNGSDNQPFRGGKGN 305

Query: 60  FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            ++GGV     A W   ++ G V+   ++  D   T   +AG
Sbjct: 306 LYDGGVHVPFFASWPNKIKPGTVVNYPVNSLDIGRTAVALAG 347


>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
          Length = 511

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 50/82 (60%), Gaps = 6/82 (7%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGP-TVDSLHFHGNT-----ASNWPLRG 55
           L ++D ++  + ++ +  G+  N L+V+ +DNGGP TV S    G       +SN+PLRG
Sbjct: 251 LTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNYPLRG 310

Query: 56  TKYSFHEGGVRNVAALWSPLLR 77
            K++ +EGGVR+ A L  P L+
Sbjct: 311 GKHNAYEGGVRSTAFLSGPALK 332


>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
 gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
          Length = 426

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ LD SVG+V+ +L+  G   ++LV F SDNGG            + NWPL G K S 
Sbjct: 240 LVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERF---------SYNWPLSGNKGSL 290

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GG+R  + L W   +  GQV    ++  DW  TL  + G
Sbjct: 291 LDGGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGG 331


>gi|291227067|ref|XP_002733509.1| PREDICTED: steroid sulfatase-like, partial [Saccoglossus
           kowalevskii]
          Length = 321

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 59/107 (55%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +G + ++L+E  L  N++++F S       D+     +  +NWPLRG K ++
Sbjct: 41  MVTCMDEGIGNITRTLKETELWNNTIIIFSS-------DNGGNPNHGGNNWPLRGGKGTY 93

Query: 61  HEGGVRNVAALWSPLLRK---GQVLENLMHITDWLPTLYFIAGKEKN 104
            EG +  +  ++S  + K   G V   ++H++DW PT+  +AG   N
Sbjct: 94  WEGAIHGLGFVYSSAIAKEMRGTVNTEMIHVSDWFPTIVNLAGGSLN 140


>gi|417302824|ref|ZP_12089908.1| sulfatase [Rhodopirellula baltica WH47]
 gi|327540898|gb|EGF27458.1| sulfatase [Rhodopirellula baltica WH47]
          Length = 490

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 9/107 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+DE VGR++ +L E GL  N+ +VF SDNGG         G   S  PLRG K ++
Sbjct: 255 MIESVDEGVGRMVDALRELGLEENTAIVFTSDNGG--------FGPATSMKPLRGYKGTY 306

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           +EGG+R      W  ++  G   +  +   D  PT   + G +   D
Sbjct: 307 YEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPAD 353


>gi|410638759|ref|ZP_11349312.1| hypothetical protein GLIP_3906 [Glaciecola lipolytica E3]
 gi|410141287|dbj|GAC16517.1| hypothetical protein GLIP_3906 [Glaciecola lipolytica E3]
          Length = 508

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 63/113 (55%), Gaps = 14/113 (12%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
            L  LD+++G ++  LE+  LL N+++V++SDNGG T          A+N   RG+KY+ 
Sbjct: 296 QLYYLDQAIGEIVNELEKNNLLENTIIVYLSDNGGST-------PIYANNGRFRGSKYTL 348

Query: 61  HEGGVRNVAALWSPLLRK-----GQVLENLMHITDWLPTLYFIAGKE--KNID 106
           +EGG+R    ++ P   +      + +++++   D  PTL   AG +  +NID
Sbjct: 349 YEGGIRVPMIVYIPEEFRHNTTMNRRIDDVVSTMDIYPTLLQAAGLQAPENID 401


>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
 gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
          Length = 488

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 28/65 (43%), Positives = 41/65 (63%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D+ +GR++  L++ G   N+L+ F+SDNGG   +S   + N ASN P R  K S 
Sbjct: 253 MVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNSNWASNKPFRRGKVSL 312

Query: 61  HEGGV 65
            EGGV
Sbjct: 313 TEGGV 317


>gi|126663705|ref|ZP_01734701.1| arylsulfatase A precursor [Flavobacteria bacterium BAL38]
 gi|126624288|gb|EAZ94980.1| arylsulfatase A precursor [Flavobacteria bacterium BAL38]
          Length = 512

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/66 (48%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNG-----GPTVDSLHFHGNTASNWPLRGTKYSF 60
           D  VG +I++L+EEG+L N+L+VF SDNG     G   D++   GN   N PLRG KYS 
Sbjct: 300 DYCVGEIIKTLKEEGVLDNTLIVFSSDNGPVLNDGYYDDAVEKLGNHKPNGPLRGGKYSL 359

Query: 61  HEGGVR 66
            E G R
Sbjct: 360 LEAGTR 365


>gi|440717770|ref|ZP_20898247.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436437072|gb|ELP30746.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 480

 Score = 58.5 bits (140), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 57/109 (52%), Gaps = 7/109 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG------GPTVDSLHFHGNTASNWPLR 54
           +++ +D +VG+++ +L E  L  N+ V+F SDNG         VD      +  S  PLR
Sbjct: 217 VIEEIDFNVGKILDTLSELKLADNTYVLFTSDNGPWLVKNKGHVDGHRPRDHGGSAGPLR 276

Query: 55  GTKYSFHEGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAGKE 102
             K S  EGGVR  A LW P  +  G V +++    D +PTL  +AG +
Sbjct: 277 SGKVSSFEGGVRVPAVLWGPGRVPAGTVCDSIATTMDVMPTLAALAGAD 325


>gi|393788830|ref|ZP_10376956.1| hypothetical protein HMPREF1068_03236 [Bacteroides nordii
           CL02T12C05]
 gi|392653936|gb|EIY47586.1| hypothetical protein HMPREF1068_03236 [Bacteroides nordii
           CL02T12C05]
          Length = 509

 Score = 58.5 bits (140), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 60/106 (56%), Gaps = 7/106 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D+S+G ++  LE +G+  N++++F+SDNGG T+      G T  N+PL   K S 
Sbjct: 284 LVEGMDKSLGDLMDYLEVKGIAQNTVIIFMSDNGGYTI------GRTNKNYPLSEGKGSL 337

Query: 61  HEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
            EGG+R  +   W  + +   V    + I D+ P++  +AG +  I
Sbjct: 338 KEGGIREPMIVCWPGVTQPATVNVTPVIIEDFFPSILELAGMKGRI 383


>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
 gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
          Length = 459

 Score = 58.5 bits (140), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTKHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|256840089|ref|ZP_05545598.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|256739019|gb|EEU52344.1| conserved hypothetical protein [Parabacteroides sp. D13]
          Length = 514

 Score = 58.5 bits (140), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 33/102 (32%), Positives = 60/102 (58%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+ G + N++V+F+SDNGG + +     G     N+PL   K S
Sbjct: 287 LIEGMDKSLGDLMDWLEKNGEVDNTIVIFMSDNGGLSSEPGWRDGEIHTQNYPLNSGKGS 346

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +EGGVR  +   W  +++ G   +  + I D+ PT+  +AG
Sbjct: 347 AYEGGVREPMIVRWPGVVKPGSKCDKYLIIEDFYPTILEMAG 388


>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
          Length = 453

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 10/96 (10%)

Query: 2   LKSLDESVGRVIQSLEEEGL--LGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
           +KS+D  +GR +  L+   +    N++V+F SDNGGP           ASNWPLRG K  
Sbjct: 173 VKSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP-------RKLGASNWPLRGGKGC 225

Query: 60  FHEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTL 95
             +G  R+ A +  P + K +   ++ H  DW+PTL
Sbjct: 226 IFDGAFRSRALIHIPGM-KHKNYPHMFHAVDWMPTL 260


>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 494

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 40/64 (62%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ S+D+ +GRV+ +LE+ G L N+L+ F SDNGGP     +   + A N P RG K + 
Sbjct: 255 MIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNGGPRPKPGYEGEDFADNTPYRGGKGAM 314

Query: 61  HEGG 64
            EGG
Sbjct: 315 LEGG 318


>gi|336403079|ref|ZP_08583800.1| hypothetical protein HMPREF0127_01113 [Bacteroides sp. 1_1_30]
 gi|335947153|gb|EGN08947.1| hypothetical protein HMPREF0127_01113 [Bacteroides sp. 1_1_30]
          Length = 853

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 58/103 (56%), Gaps = 5/103 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D +VGR++  L+++ L  N+LV+F SDNG      L F  +  S  PLRG K+S+
Sbjct: 266 VVEEIDWNVGRLLDYLDKKKLAENTLVIFASDNG----PWLSFKEDGGSAEPLRGGKFSY 321

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           +EGGVR    + W   +  G   + ++   D  PT+   AG +
Sbjct: 322 YEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAGCQ 364


>gi|119713775|gb|ABL97823.1| putative secreted sulfatase [uncultured marine bacterium
           HF10_49E08]
          Length = 667

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 11/111 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGN----------TASN 50
           ML++ DES+GR+ + L++ GL  N++V+F +DNGG +  + +   N           +SN
Sbjct: 286 MLEATDESLGRIRKKLKDLGLADNTIVIFTADNGGMSASNQYRGINHPIESLDSRFASSN 345

Query: 51  WPLRGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            PLRG K   +EGG+R  +   W   ++       L+  TD+ PTL  + G
Sbjct: 346 LPLRGAKGWNYEGGIRVPLVVYWPGRIKPDSTSNALVTGTDFYPTLLEMIG 396


>gi|87308971|ref|ZP_01091109.1| Arylsulfatase [Blastopirellula marina DSM 3645]
 gi|87288314|gb|EAQ80210.1| Arylsulfatase [Blastopirellula marina DSM 3645]
          Length = 482

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 57/101 (56%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNW-----PLRG 55
           M+  +D  +G+++  +E  GL G++LV+F SDNG  T D L   G + S++      LRG
Sbjct: 269 MITHMDRDIGKILSKIETLGLDGDTLVIFTSDNGA-TYDRL---GGSDSDFFESVANLRG 324

Query: 56  TKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTL 95
            K S +EGGVR  + A W   ++ GQV      + D LPT+
Sbjct: 325 LKGSLYEGGVRVPLVARWPGKIKSGQVSHVPSALWDMLPTI 365


>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 453

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 6/101 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D  VG ++Q+L++ GL  N+LVVF SDN GP ++ +  HG+     PLR  K++ 
Sbjct: 238 VVQEIDWQVGELMQTLDKLGLTDNTLVVFASDN-GPQLN-VEGHGSAG---PLRDGKWTN 292

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R    + W  ++  G V   ++ I D LPT   IAG
Sbjct: 293 FEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAG 333


>gi|149173056|ref|ZP_01851687.1| arylsulphatase A [Planctomyces maris DSM 8797]
 gi|148847862|gb|EDL62194.1| arylsulphatase A [Planctomyces maris DSM 8797]
          Length = 442

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 36/117 (30%), Positives = 63/117 (53%), Gaps = 12/117 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWP-------- 52
           M+ S+D+ VG+V+ SL E G+  N+L++F +DNG P    L  + N     P        
Sbjct: 236 MIASMDDMVGKVVASLNEMGVRDNTLIIFTTDNGTPAASYLTVNENGKMVRPKVVSVQNG 295

Query: 53  --LRGTKYSFHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG-KEKNI 105
             + G K    + G R  + A W   ++ G  +++++ ++D+LPT+  IAG KE ++
Sbjct: 296 KIVPGGKGKLDDTGTRVPLIANWPGHIKAGTEVDDMVDMSDYLPTVAEIAGLKEADV 352


>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
          Length = 459

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 826

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 9/105 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GRV+  + E GL  N++++F SDNGG            A N+P RG K   
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG--------RREHAVNFPYRGHKGML 317

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R      W   +R G   E+ +   D  PT    AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362


>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 512

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 12/104 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M   LD+SVG V ++L+++ +  +SL+V+  DNGG       +      + P RG K+S 
Sbjct: 243 MAAFLDDSVGNVTKALKKQNMYDSSLIVYTPDNGG-------YLTQGGDDAPFRGGKFSD 295

Query: 61  HEGGVRNVAALWS----PLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GGVR VAA  S    P   +G     L+H+ DW+ T   + G
Sbjct: 296 FQGGVR-VAAFISGGLVPAGLRGTASAALVHVCDWMATFAALGG 338


>gi|423294501|ref|ZP_17272628.1| hypothetical protein HMPREF1070_01293 [Bacteroides ovatus
           CL03T12C18]
 gi|392675692|gb|EIY69133.1| hypothetical protein HMPREF1070_01293 [Bacteroides ovatus
           CL03T12C18]
          Length = 518

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 59/102 (57%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+ G + N++++F+SDNGG   +S    G     N+PL   K S
Sbjct: 291 LIEGMDKSLGDLMNWLEKNGEVNNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 350

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +EGG+R  +   W  ++  G    N + I D+ PT+  +AG
Sbjct: 351 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 392


>gi|343086294|ref|YP_004775589.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354828|gb|AEL27358.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 511

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 16/115 (13%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS- 59
           M+  +DE VG +++ L+ +GL  N+L++F SDN GPT     F+G T S W   G  +  
Sbjct: 299 MVSYMDEQVGELVEQLKSKGLYENTLIIFTSDN-GPT-----FNGGTDSPWFNSGGAFRE 352

Query: 60  --------FHEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
                    +EGG+R  + A W   +++G V ++     D +PTL  +AGK   I
Sbjct: 353 EKGFGKGFLYEGGIRVPMIASWPGKIKEGSVSDHASVFYDVMPTLAEVAGKPLEI 407


>gi|398909246|ref|ZP_10654456.1| arylsulfatase A family protein [Pseudomonas sp. GM49]
 gi|398188431|gb|EJM75734.1| arylsulfatase A family protein [Pseudomonas sp. GM49]
          Length = 559

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 6/106 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +  LD+SVG +++ +E+ G   N++VVF +DNG        F        P +GTK +  
Sbjct: 300 MAQLDDSVGAILKCVEDAGEADNTIVVFTTDNGAEV-----FTWPDGGMTPFKGTKGTVM 354

Query: 62  EGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
           EGG R  A + W   ++ G V   +    DW PTL   AG     D
Sbjct: 355 EGGFRAPAIIRWPGKIKPGTVENGIYSALDWFPTLVAAAGNPNITD 400


>gi|294817955|ref|ZP_06776597.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
 gi|326446919|ref|ZP_08221653.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
 gi|294322770|gb|EFG04905.1| putative sulfatase [Streptomyces clavuligerus ATCC 27064]
          Length = 473

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 50/101 (49%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD SVG V+ +L   G   ++LVVF SDNGG            +  WPL G K S 
Sbjct: 286 MVEDLDRSVGAVLAALRRSGQEDDTLVVFASDNGGERF---------SYTWPLSGGKGSL 336

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +GG+R    L W   +  GQV    ++  DW  TL  + G
Sbjct: 337 LDGGIRVPTILRWPARIEGGQVSRVPVYSPDWTATLLEVGG 377


>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 543

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 51/105 (48%), Gaps = 9/105 (8%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GRV+  + E GL  N++++F SDNGG            A N+P RG K   
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG--------RREHAVNFPYRGHKGML 317

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKN 104
            EGG+R      W   +R G   E+ +   D  PT    AG + +
Sbjct: 318 FEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS 362


>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
 gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 463

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 13/111 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D +VGR+  +L   G+  N+++VF+SDNGG         G  A N P RG K   
Sbjct: 243 MVNAVDANVGRIDSTLVANGMKENTILVFLSDNGG--------RGEHADNRPYRGHKGML 294

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
            EGG++      W   L+ GQ  E ++   D  PT    AG    KE  +D
Sbjct: 295 FEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAGGKVKKESQLD 345


>gi|304309759|ref|YP_003809357.1| sulfatase [gamma proteobacterium HdN1]
 gi|301795492|emb|CBL43690.1| probable sulfatase precursor [gamma proteobacterium HdN1]
          Length = 661

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 56/100 (56%), Gaps = 7/100 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +L  LD++V R++ +L+++G+  +++VVF SDNGGP       +    +N PL+G K ++
Sbjct: 280 VLNQLDDNVARLLAALKQKGVDDDTIVVFASDNGGP-------NRARNNNLPLQGLKATY 332

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGGVR    +  P     + +     + D  PTL  +AG
Sbjct: 333 TEGGVRTPVMVRWPARFSSRDVVQPTQLVDLFPTLVALAG 372


>gi|47216038|emb|CAG11369.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 474

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L   D+++G ++ +LE  G++ N+L+ F SDN GP +  +   GN     PLR  K + +
Sbjct: 226 LLEFDQTIGSLLATLERTGVINNTLIFFTSDN-GPELMRMSRGGNAG---PLRCGKGTTY 281

Query: 62  EGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
           EGG+R  A A W  L++ G V   +    D LPT   +AG +
Sbjct: 282 EGGMREPAIAYWQGLIQPG-VTHEMASTLDILPTFASLAGAK 322


>gi|256841082|ref|ZP_05546589.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|256736925|gb|EEU50252.1| arylsulfatase A [Parabacteroides sp. D13]
          Length = 459

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|393784389|ref|ZP_10372554.1| hypothetical protein HMPREF1071_03422 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666165|gb|EIY59682.1| hypothetical protein HMPREF1071_03422 [Bacteroides salyersiae
           CL02T12C01]
          Length = 463

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 35/109 (32%), Positives = 57/109 (52%), Gaps = 12/109 (11%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-----ASNWPLRG 55
           M+  LD +VG+++  L+E+ +L N++++F SDNG      +H  G        SN   RG
Sbjct: 261 MVSCLDHAVGKIVGKLKEKNILENTIIIFSSDNG------VHQEGGHDPEYFNSNGIFRG 314

Query: 56  TKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEK 103
            K   +EGG+       W  ++ KGQ   ++    D+LPT+  I G +K
Sbjct: 315 IKRDLYEGGIHTPFIVQWENVIAKGQTSSHISAFWDFLPTVADIVGCDK 363


>gi|87198499|ref|YP_495756.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|87134180|gb|ABD24922.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 491

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ LD  +GRV+++L+      +++VVF SDNGG            +  WP  G K   
Sbjct: 283 MVRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGERF---------SDTWPFSGRKTEL 333

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  A + W  + R G   +  +   DWLPT    AG
Sbjct: 334 LEGGLRIPAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAG 374


>gi|343087660|ref|YP_004776955.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342356194|gb|AEL28724.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 513

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 61/117 (52%), Gaps = 19/117 (16%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL-----HFHGNTAS---------- 49
           LD  +G ++Q LE  G+  N+L++F SDNG   + +        H N+ S          
Sbjct: 300 LDYYIGELLQELERLGMKENTLIIFASDNGSQFITTHAGQEGESHSNSPSNVNKEINPDA 359

Query: 50  ---NWPLRGTKYSFHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
              N+P +GTK+S +EGGVR  + A W     +G V + L+ + D LP+L  + G++
Sbjct: 360 HQPNYPYKGTKWSVNEGGVRTPLIASWKGNFPEGLVSDQLIALNDVLPSLAALVGEK 416


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 10/103 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +DE VGR++QSL+E G   N++V+FISDNG           +   N  LRG K   
Sbjct: 253 MVEVMDEGVGRIMQSLKETGQYENTIVIFISDNGA---------NHYGDNGGLRGFKAGP 303

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHIT-DWLPTLYFIAGKE 102
           +EGG R  A    P   KG  + N   ++ D LPTL    G++
Sbjct: 304 YEGGSRVPAIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIGRK 346


>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
           [Ciona intestinalis]
          Length = 513

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 8/107 (7%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG +IQ L++ GL  N+LV+F SDNG   + S   +G   SN P    K +  EGG
Sbjct: 252 LDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAFENG---SNGPFLCGKQTTFEGG 308

Query: 65  VRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG----KEKNID 106
           +R     W P  +++GQV   +  + DW  T   +A     K++ ID
Sbjct: 309 IREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIAVPKDRIID 355


>gi|16125424|ref|NP_419988.1| sulfatase [Caulobacter crescentus CB15]
 gi|13422494|gb|AAK23156.1| sulfatase family protein [Caulobacter crescentus CB15]
          Length = 521

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 5/66 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD +VG+V+Q+L++ GL  N+LV+F SDNGG      ++ G    N P RG K +F
Sbjct: 299 MVRNLDRNVGKVLQALKDRGLDDNTLVIFTSDNGGA-----NYIGLPDINKPYRGWKATF 353

Query: 61  HEGGVR 66
            EGG++
Sbjct: 354 FEGGIK 359


>gi|149196983|ref|ZP_01874036.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
 gi|149140093|gb|EDM28493.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
          Length = 485

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +D+S+G +++ +EE      +LV+F+SDNGG +            N PLR  K S 
Sbjct: 242 MIDYMDQSIGLLMKGIEEMKSDRETLVIFMSDNGGLS--------RVWENEPLRRGKGSA 293

Query: 61  HEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
           +EGG+R    + W   ++ GQ +   +HITD  P+   IA
Sbjct: 294 YEGGIRVPLIMHWPERIKAGQTISEPVHITDLFPSFLSIA 333


>gi|443325558|ref|ZP_21054248.1| arylsulfatase A family protein [Xenococcus sp. PCC 7305]
 gi|442794839|gb|ELS04236.1| arylsulfatase A family protein [Xenococcus sp. PCC 7305]
          Length = 523

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 12/100 (12%)

Query: 6   DESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT---ASNWPLRGTKYSFHE 62
           D++VG+++ +L+E GL  N++V++ +DNG         H NT   A+  P RG K +  E
Sbjct: 278 DKNVGQILVALDEAGLTNNTIVIYGTDNGP--------HMNTWPDAAMTPFRGEKNTGWE 329

Query: 63  GGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGK 101
           GG R    + W   ++ G V  ++M   DW+PTL   AG+
Sbjct: 330 GGFRTPCMIRWPGHIQAGTVSNDIMSNLDWMPTLVAAAGE 369


>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 459

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|301310141|ref|ZP_07216080.1| N-acetylgalactosamine-6-sulfatase [Bacteroides sp. 20_3]
 gi|423336382|ref|ZP_17314129.1| hypothetical protein HMPREF1059_00081 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831715|gb|EFK62346.1| N-acetylgalactosamine-6-sulfatase [Bacteroides sp. 20_3]
 gi|409240857|gb|EKN33631.1| hypothetical protein HMPREF1059_00081 [Parabacteroides distasonis
           CL09T03C24]
          Length = 510

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 33/102 (32%), Positives = 59/102 (57%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+ G   N++V+F+SDNGG + +     G     N+PL   K S
Sbjct: 284 LIEGMDKSLGDLMDWLEKNGEADNTIVIFMSDNGGLSSEPRWRDGEIHTQNYPLNSGKGS 343

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +EGGVR  +   W  +++ G   +  + I D+ PT+  +AG
Sbjct: 344 AYEGGVREPMIVRWPGVVKPGSKCDKYLIIEDFYPTILEMAG 385


>gi|307152367|ref|YP_003887751.1| sulfatase [Cyanothece sp. PCC 7822]
 gi|306982595|gb|ADN14476.1| sulfatase [Cyanothece sp. PCC 7822]
          Length = 570

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 16/110 (14%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-----PTVDSLHFHGNTASNWPLRGT 56
           ++  D+ VG ++  L+E+GL  +++V++ +DNG      P   +  FHG   +NW     
Sbjct: 308 MEEHDQDVGILLDYLDEQGLTDDTIVIYTTDNGAEVFSWPDGGTTPFHGEKNTNW----- 362

Query: 57  KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNI 105
                EGG R  A + W   +  GQ+   ++   DWLPTL   AG   +I
Sbjct: 363 -----EGGFRVPAMIRWPGYIEAGQISNEIISHQDWLPTLLAAAGAPDDI 407


>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
 gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
          Length = 496

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 37/105 (35%), Positives = 58/105 (55%), Gaps = 5/105 (4%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGG-PTVDSLHFHGN---TASNWPLRGT 56
           M+ +LD +VGR++ +L+E G+   +LVVF SDNGG    D +    +     S  PLRG 
Sbjct: 275 MVSNLDANVGRLLAALDERGIREETLVVFTSDNGGLGDYDEIGLPRDWKRVTSQGPLRGG 334

Query: 57  KYSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           K + +EGG+R    + W  ++  G+     +  TD+ PT   +AG
Sbjct: 335 KRTLYEGGIRVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAG 379


>gi|262405065|ref|ZP_06081615.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294643590|ref|ZP_06721394.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294808689|ref|ZP_06767423.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|345511384|ref|ZP_08790927.1| hypothetical protein BSAG_02032 [Bacteroides sp. D1]
 gi|229444530|gb|EEO50321.1| hypothetical protein BSAG_02032 [Bacteroides sp. D1]
 gi|262355940|gb|EEZ05030.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292641085|gb|EFF59299.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294444128|gb|EFG12861.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 518

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+ G   N++++F+SDNGG   +S    G     N+PL   K S
Sbjct: 291 LIEGMDKSLGDLMNWLEKNGEANNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 350

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +EGG+R  +   W  ++  G    N + I D+ PT+  +AG
Sbjct: 351 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 392


>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
 gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
 gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
          Length = 459

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 459

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 55/106 (51%), Gaps = 4/106 (3%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           +++ +D SVG V+++L E GL  N+ V+F SDNG      L  H N  S  PL+  K ++
Sbjct: 249 VVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG----PWLTEHENGGSAGPLKDGKGTW 304

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG R  A  W P      + + +M   D  PT   +AG E+  D
Sbjct: 305 WEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKD 350


>gi|405379201|ref|ZP_11033103.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397324334|gb|EJJ28697.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 560

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 35/103 (33%), Positives = 50/103 (48%), Gaps = 6/103 (5%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG +++ +E+ G   N++VVF +DNG        F        P +GTK +  EGG
Sbjct: 304 LDDSVGAILKCVEDAGETDNTIVVFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 358

Query: 65  VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            R  A + W   ++ G V   +    DW PTL   AG     D
Sbjct: 359 FRAPAIIRWPGKVKPGTVENGIFSALDWFPTLVAAAGNPNITD 401


>gi|293371631|ref|ZP_06618042.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292633328|gb|EFF51898.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 494

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 33/102 (32%), Positives = 58/102 (56%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT-ASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+ G   N++++F+SDNGG   +S    G     N+PL   K S
Sbjct: 267 LIEGMDKSLGDLMNWLEKNGEANNTIIIFMSDNGGLASESGWRDGKLHTQNYPLNSGKGS 326

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            +EGG+R  +   W  ++  G    N + I D+ PT+  +AG
Sbjct: 327 TYEGGIREPMIVSWPGVVAPGSKCNNYLLIEDFYPTILEMAG 368


>gi|221234167|ref|YP_002516603.1| sulfatase family protein [Caulobacter crescentus NA1000]
 gi|220963339|gb|ACL94695.1| sulfatase family protein [Caulobacter crescentus NA1000]
          Length = 551

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 29/66 (43%), Positives = 45/66 (68%), Gaps = 5/66 (7%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+++LD +VG+V+Q+L++ GL  N+LV+F SDNGG      ++ G    N P RG K +F
Sbjct: 329 MVRNLDRNVGKVLQALKDRGLDDNTLVIFTSDNGGA-----NYIGLPDINKPYRGWKATF 383

Query: 61  HEGGVR 66
            EGG++
Sbjct: 384 FEGGIK 389


>gi|357061552|ref|ZP_09122301.1| hypothetical protein HMPREF9332_01859 [Alloprevotella rava F0323]
 gi|355373735|gb|EHG21044.1| hypothetical protein HMPREF9332_01859 [Alloprevotella rava F0323]
          Length = 513

 Score = 57.8 bits (138), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 57/102 (55%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLG-NSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
           +++ +D+S+G ++  LE+      N++++FISDNGG  V       +   NWP R  K S
Sbjct: 285 LVEGMDKSLGDLMDFLEQHPKTAQNTIIIFISDNGGQGVAVRQGREDYDQNWPARAGKGS 344

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            + GG++  +   W  + + G V +N + I D+ P+L  +AG
Sbjct: 345 AYLGGIKEPMIVYWPKVTKAGTVNDNRVMIEDFFPSLLEMAG 386


>gi|229818890|ref|YP_002880416.1| sulfatase [Beutenbergia cavernae DSM 12333]
 gi|229564803|gb|ACQ78654.1| sulfatase [Beutenbergia cavernae DSM 12333]
          Length = 464

 Score = 57.8 bits (138), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 21/107 (19%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           L+ +D  +  ++ +LEE GL  +++VV+++DNGG T        N  SN PL GTKY+  
Sbjct: 258 LELMDAEIAALLDTLEERGLADDTIVVYLTDNGGSTC-------NYGSNAPLAGTKYTLW 310

Query: 62  EGGVRNVAALWSPLLRK--------GQVLENLMHITDWLPTLYFIAG 100
           EGG+R       P L +        G+    L+   D +PTL   AG
Sbjct: 311 EGGIR------VPFLVRWPGGGWAGGRTTTALVSSLDLVPTLVAAAG 351


>gi|261880234|ref|ZP_06006661.1| N-acetylgalactosamine-6-sulfatase [Prevotella bergensis DSM 17361]
 gi|270333067|gb|EFA43853.1| N-acetylgalactosamine-6-sulfatase [Prevotella bergensis DSM 17361]
          Length = 513

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 31/102 (30%), Positives = 57/102 (55%), Gaps = 2/102 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLG-NSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYS 59
           +++ +D+S+G ++  LE    +  N++++F+SDNGG  ++      N   N+P RG K S
Sbjct: 294 LIEGMDKSLGDIMNYLEARPEVARNTIIIFMSDNGGQAINIRQGRANYDQNYPARGGKGS 353

Query: 60  FHEGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
             EGG+   +   W  +++ G    N + I D+ PT+  +AG
Sbjct: 354 ALEGGIHEPMLVSWPGVIKGGTTNGNRVMIEDFYPTIIEMAG 395


>gi|399043292|ref|ZP_10737678.1| arylsulfatase A family protein [Rhizobium sp. CF122]
 gi|398058225|gb|EJL50129.1| arylsulfatase A family protein [Rhizobium sp. CF122]
          Length = 561

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 6/97 (6%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG +++ +E+ G   N++V+F +DNG        F        P +GTK +  EGG
Sbjct: 305 LDDSVGALMKCVEDAGETDNTIVIFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 359

Query: 65  VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            R  A + WS  ++ G V   +    DW PTL   AG
Sbjct: 360 FRAPAIIRWSGKVKPGTVENGIFSALDWFPTLAAAAG 396


>gi|149197012|ref|ZP_01874065.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
 gi|149140122|gb|EDM28522.1| putative secreted sulfatase ydeN precursor [Lentisphaera araneosa
           HTCC2155]
          Length = 463

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 56/100 (56%), Gaps = 10/100 (10%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++SLDESVGRV+  L+E G+  N++V+F  DNG   V + +  G       L+G K   
Sbjct: 257 MVQSLDESVGRVLDKLDELGIADNTIVIFTGDNGA--VGTNYCGG-------LKGAKALS 307

Query: 61  HEGGVRNVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           HEGGVR    +  P +  G     +M  TD+ PT+  +AG
Sbjct: 308 HEGGVREPFFIKGPGIEPGVSSVPVM-ATDFYPTILDLAG 346


>gi|383110969|ref|ZP_09931787.1| hypothetical protein BSGG_2074 [Bacteroides sp. D2]
 gi|313694539|gb|EFS31374.1| hypothetical protein BSGG_2074 [Bacteroides sp. D2]
          Length = 450

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 40/105 (38%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNG-GPTVDSLHFHGNTASNWPLRGTKYS 59
           +L  +D+ +GR I++L++ GL  N++V+F SDNG  P+  S       A    LRGTK S
Sbjct: 241 VLAEMDKQIGRFIKALDDMGLSENTIVIFTSDNGPAPSFQS-------ARAAYLRGTKNS 293

Query: 60  FHEGGVRNVAALWSP-LLRKGQV-LENLMHITDWLPTLYFIAGKE 102
            +EGG+R    +  P  ++ GQV  E+++   D  P+L  IAG E
Sbjct: 294 LYEGGIRMPFLIKYPKKIKAGQVNNESVLCAVDLYPSLCAIAGIE 338


>gi|344339408|ref|ZP_08770337.1| sulfatase [Thiocapsa marina 5811]
 gi|343800712|gb|EGV18657.1| sulfatase [Thiocapsa marina 5811]
          Length = 653

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 6/100 (6%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K +D+++G V++ LE+ G L N++VVF +DNG  T+   +  G      P RG K +  
Sbjct: 387 MKQMDDNIGYVLKKLEDMGELDNTIVVFTTDNGAETIT--YPDGGVT---PFRGGKLTTW 441

Query: 62  EGGVRN-VAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R  +   W   +  G   + +    DW+PT   IAG
Sbjct: 442 EGGMRAPMLVRWPGHVEAGSNPDQIFSSLDWVPTFVNIAG 481


>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
           queenslandica]
          Length = 523

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 36/100 (36%), Positives = 54/100 (54%), Gaps = 5/100 (5%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           +K LD SVG+++ +L+  G+  ++LV+F SDNGG T    +   +  SN P    K +  
Sbjct: 261 VKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGAT----YAKESGGSNSPFLCGKETTF 316

Query: 62  EGGVRNVAALWSP-LLRKGQVLENLMHITDWLPTLYFIAG 100
           EGG+R     W P  ++ GQV   L  + DW  T   +AG
Sbjct: 317 EGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAG 356


>gi|409437884|ref|ZP_11264988.1| Sulfatase [Rhizobium mesoamericanum STM3625]
 gi|408750579|emb|CCM76148.1| Sulfatase [Rhizobium mesoamericanum STM3625]
          Length = 560

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 6/103 (5%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD+SVG +++ +E+ G   N++V+F +DNG        F        P +GTK +  EGG
Sbjct: 304 LDDSVGALMKCVEDAGEADNTIVIFTTDNGAEV-----FTWPDGGMTPFKGTKGTVMEGG 358

Query: 65  VRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            R  A + W+  ++ G V   +    DW PTL   AG  +  D
Sbjct: 359 FRAPAIIRWTGKVKPGSVENGIFSALDWFPTLAAAAGNTEITD 401


>gi|340619473|ref|YP_004737926.1| sulfatase [Zobellia galactanivorans]
 gi|339734270|emb|CAZ97647.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 477

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 34/100 (34%), Positives = 57/100 (57%), Gaps = 9/100 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+ ++D++VG+++++L++ GL GN+LV+F SDNGG    S           PLR  K S+
Sbjct: 256 MVYAMDQNVGKLLKALKQNGLEGNTLVIFTSDNGGIRAIS--------EQSPLRAGKGSY 307

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIA 99
           +EGG+R  +   W   ++   +    +   D+ PTL  IA
Sbjct: 308 YEGGIRVPLIIKWPGKIQAQSISTAPVSHLDFYPTLQSIA 347


>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 486

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 39/100 (39%), Positives = 53/100 (53%), Gaps = 8/100 (8%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           ++ LD S G ++++L+E GL  N+LV+F SDNG  T       G   S  PL+G K S +
Sbjct: 260 IQELDWSTGEILRALKELGLDSNTLVIFTSDNGPNT-------GKGGSAGPLKGGKGSTY 312

Query: 62  EGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           EGGVR    A W   +  G   +  M   D  PTL  +AG
Sbjct: 313 EGGVRVPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAG 352


>gi|120613101|ref|YP_972779.1| sulfatase [Acidovorax citrulli AAC00-1]
 gi|120591565|gb|ABM35005.1| sulfatase [Acidovorax citrulli AAC00-1]
          Length = 453

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 35/107 (32%), Positives = 50/107 (46%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+  +DE +GR++ +L   GL  ++LVVF SDNGG            + NWPL G K   
Sbjct: 249 MIHHMDEGIGRIMDTLRAHGLERDTLVVFTSDNGGERF---------SDNWPLVGGKMDL 299

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKEKNID 106
            EGG+R    A W   +  G          DW  T+  + G + + D
Sbjct: 300 TEGGIRVPWIAHWPAAIAPGGTSAQPCLTMDWSATMLELGGAQPHPD 346


>gi|297180722|gb|ADI16930.1| arylsulfatase a and related enzymes [uncultured SAR406 cluster
           bacterium HF0010_18O13]
          Length = 585

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 6/103 (5%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M+K+LD S+GR++ SL++ GL  N+L++  SDNGG +   L     +  N P RG K + 
Sbjct: 363 MIKALDRSIGRILDSLQQNGLSENTLIILTSDNGGASYIEL-----SDINKPYRGWKLTH 417

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAGKE 102
            EGG+     A W   +  G     ++H  D   T+   A  E
Sbjct: 418 FEGGMHIPFMARWPKEIEAGTSFSPVVHHNDIFKTIAAAAKAE 460


>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
 gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 511

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHF-HGNTASNWPLRGTKYS 59
           M+ ++D  VG+++Q+L+E     N+L+VF+SDNGG      +F HG  A+N+PL+GTK  
Sbjct: 283 MVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG------NFNHG--ANNYPLKGTKGD 334

Query: 60  FHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIA 99
             EGG R      W   ++K Q  +  +   D  PT   +A
Sbjct: 335 TWEGGYRVPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGLA 375


>gi|109899035|ref|YP_662290.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109701316|gb|ABG41236.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 724

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 35/98 (35%), Positives = 57/98 (58%), Gaps = 6/98 (6%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNT--ASNWPLRGTKY 58
           M+K  D+++G + Q+L E  +  N++++F SDNGG     +   GNT   SN+PL+G K 
Sbjct: 261 MVKQFDDAIGVLWQALVEAKVEKNTIIIFTSDNGGNMYTVV---GNTHATSNFPLKGGKA 317

Query: 59  SFHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTL 95
           + +EGG++   A+  P L +   L N  +   D+ PTL
Sbjct: 318 TEYEGGLKVPTAVIWPGLTQPNTLSNTPIQTADFFPTL 355


>gi|149199413|ref|ZP_01876449.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149137491|gb|EDM25908.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 608

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 2/101 (1%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++S+D+S+G+VI +LE   L+  +++VF SDNGG  + S+       SN P RG K S 
Sbjct: 253 MVESMDDSIGKVIDALETNKLMEKTIIVFFSDNGG-NIHSVVDGTTATSNKPFRGGKASI 311

Query: 61  HEGGVRNVA-ALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           +EGG    A  +W    + G   ++L+   D   ++  +A 
Sbjct: 312 YEGGTHVPAIVVWPNQTKTGVRNDSLIQSEDLYASILEMAA 352


>gi|299148032|ref|ZP_07041095.1| arylsulfatase A [Bacteroides sp. 3_1_23]
 gi|298514215|gb|EFI38101.1| arylsulfatase A [Bacteroides sp. 3_1_23]
          Length = 520

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 10/104 (9%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLH-------FHGNTASNWPLRGTK 57
            D  VG ++ +LE+ GL  N+L++  SDN GP VD  +        +G++ +  PLRG K
Sbjct: 307 FDYCVGEILNTLEKLGLRENTLIILSSDN-GPVVDDGYDDKAEELLNGHSPAG-PLRGNK 364

Query: 58  YSFHEGGVRNVAAL-WSPLLRKGQVLENLMHITDWLPTLYFIAG 100
           YS  EGG R  A + W   ++KG   + L+   DWL +L  + G
Sbjct: 365 YSAFEGGTRIPAIVSWPAGVKKGMTSDLLVSQVDWLASLASLTG 408


>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
          Length = 330

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 8/76 (10%)

Query: 5   LDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFHEGG 64
           LD  VG+++  L E  ++ N+ V+F+SDNG   ++          N P RG K S  EGG
Sbjct: 258 LDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPIE--------GRNAPFRGGKGSLFEGG 309

Query: 65  VRNVAALWSPLLRKGQ 80
            ++ + ++SPLL+K +
Sbjct: 310 TKSSSFIYSPLLKKTE 325


>gi|393782821|ref|ZP_10371003.1| hypothetical protein HMPREF1071_01871 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672206|gb|EIY65676.1| hypothetical protein HMPREF1071_01871 [Bacteroides salyersiae
           CL02T12C01]
          Length = 904

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 30/65 (46%), Positives = 38/65 (58%), Gaps = 8/65 (12%)

Query: 2   LKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSFH 61
           LK +D  VG ++Q L +  LL N+LV+F SDNGG T           +N PLRG K S +
Sbjct: 269 LKHIDTGVGEILQCLRDNDLLNNTLVIFTSDNGGAT--------GVTTNTPLRGGKTSMY 320

Query: 62  EGGVR 66
           EGG R
Sbjct: 321 EGGNR 325


>gi|225013127|ref|ZP_03703540.1| sulfatase [Flavobacteria bacterium MS024-2A]
 gi|225002747|gb|EEG40730.1| sulfatase [Flavobacteria bacterium MS024-2A]
          Length = 509

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 61/107 (57%), Gaps = 10/107 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSL----HFHGNTASNWPLRGT 56
           M++S+DE++G V+  +EEE +   ++V+F+SDNGG +  +     H H     N PL   
Sbjct: 283 MIESMDEALGSVLDFVEEEQIEKETVVLFMSDNGGLSAVARGGRKHIH-----NAPLNSG 337

Query: 57  KYSFHEGGVRNVAALWSPLLRKGQVLENL-MHITDWLPTLYFIAGKE 102
           K S +EGG+R    ++ P + K   +E   + I D+ P+L  +AG E
Sbjct: 338 KGSIYEGGIRVPMMVYWPGVTKANSIEKTPVIIEDFFPSLLSVAGIE 384


>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
 gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
          Length = 468

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 10/101 (9%)

Query: 1   MLKSLDESVGRVIQSLEEEGLLGNSLVVFISDNGGPTVDSLHFHGNTASNWPLRGTKYSF 60
           M++ +D +VGRV+  L   GL  +++V+F SDNGG            + +WPL G K   
Sbjct: 263 MMEGMDANVGRVLAELAALGLQEDTIVIFTSDNGGERF---------SDSWPLTGMKGEL 313

Query: 61  HEGGVR-NVAALWSPLLRKGQVLENLMHITDWLPTLYFIAG 100
            EGG+R  +   W   L KG+    +    D LPTL   AG
Sbjct: 314 LEGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLSAAG 354


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.136    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,801,250,933
Number of Sequences: 23463169
Number of extensions: 69097476
Number of successful extensions: 164576
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3840
Number of HSP's successfully gapped in prelim test: 2499
Number of HSP's that attempted gapping in prelim test: 156458
Number of HSP's gapped (non-prelim): 6628
length of query: 107
length of database: 8,064,228,071
effective HSP length: 75
effective length of query: 32
effective length of database: 6,304,490,396
effective search space: 201743692672
effective search space used: 201743692672
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)