BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy1575
         (242 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 599

 Score =  273 bits (698), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 129/222 (58%), Positives = 159/222 (71%), Gaps = 7/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GWNDVGFHG   IPTPNIDALAYNG++LNRHY  PTCTPSRAA LTGKYP RYG+   P+
Sbjct: 45  GWNDVGFHGSIQIPTPNIDALAYNGVILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPI 104

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AGV  A+P+ EK+LPQYLK+LGYSTHL+GKWH+G NK +  P  RGFD+H GYWNG+++
Sbjct: 105 IAGVPLALPLNEKILPQYLKDLGYSTHLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFIS 164

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQITH 200
           Y +S H T   VG DARR  ER   +M  +Y TD FTD++  VIK   NH +P+FL ++H
Sbjct: 165 YRNSTHSTGLMVGKDARRGFERAGDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSH 224

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVHTG  G       +L+V +   ND  F +I N +RRL+A
Sbjct: 225 LAVHTGVPG-----PNILEVSNKTHNDIRFDYIENKERRLYA 261


>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 575

 Score =  251 bits (640), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 121/223 (54%), Positives = 153/223 (68%), Gaps = 8/223 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG   IPTPNIDALAYNG +LNRHY  PTCTPSRAA LTGKYP RYG+  P +
Sbjct: 44  GWNDVGFHGSIQIPTPNIDALAYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPI 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +G A A+P  EK+LPQYLKELGYSTHL+GKWH+G  ++   P  RGFD+H GYWNGY++
Sbjct: 104 ASGKASALPTNEKILPQYLKELGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYIS 163

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSR-PLFLQIT 199
           Y +S H T    G+DARR  ER   +M   +Y TD FT+++  +I+S       +FL ++
Sbjct: 164 YRNSTHATRTMSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVS 223

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H AVH+G +G        L+V +   ND  F +I N +RRL+A
Sbjct: 224 HLAVHSGNSG-----PNHLEVLNKTYNDEAFGYIENENRRLYA 261


>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
          Length = 563

 Score =  222 bits (565), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 110/221 (49%), Positives = 148/221 (66%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG N IPTPNID +A++G+ L+ +Y  P CTPSRAA +TGKYP   G+  T +
Sbjct: 60  GWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKYPIHTGMQHTVI 119

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+TEK+LPQYLKELGY THL+GKWH+G  K+E LP NRGFD+H+G+WNG + 
Sbjct: 120 FAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFDSHLGFWNGKID 179

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
             D  ++     G D RR+    A  +  +Y TD +T+++V +IKSHN S PLFL ++H+
Sbjct: 180 MYDHTNQEKGYWGFDFRRDFST-AHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLMLSHS 238

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVHTG       P+  ++ P  E+    F HI +  RR FA
Sbjct: 239 AVHTGN------PSEPIRAP--EKLFVNFTHIQDFQRRKFA 271


>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
          Length = 563

 Score =  216 bits (550), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 104/222 (46%), Positives = 146/222 (65%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG + IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+   P+
Sbjct: 50  GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +A+P+   LLP+YL++LGY+THL+GKWH+G   +   P  RGFD  +GY++GY+T
Sbjct: 110 KAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGYIT 169

Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y     E +  VG D   ++    + + S +Y+TD  T+++  +I +HN S+PL+LQ++H
Sbjct: 170 YFKHTIEQNLHVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIIFNHNRSKPLYLQLSH 229

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H+  A         ++V D EE + T  +I + DRR  A
Sbjct: 230 VAAHSSDA------KANMEVRDEEETNATLGYIEDFDRRKLA 265


>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
 gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
          Length = 533

 Score =  215 bits (548), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 108/221 (48%), Positives = 149/221 (67%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+HG ++IPTPNIDALAYNGI+LNR+Y LP CTPSR+A +TG++P   G+   V 
Sbjct: 21  GWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTGRHPIHNGMQHRVL 80

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            GV  + +P+TEKLLP+YL++LGYSTH++GKWH+G  K+E  P  RGF++H+G+W G+  
Sbjct: 81  FGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRGFESHIGFWTGHQD 140

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D   E +   GLD R  M + A  +  +Y T  +T +SV +IK++N ++PLFL + HA
Sbjct: 141 YYDHTAEEERLWGLDMRHGM-KPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLYVAHA 199

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+G   N       L  PD   +     HI N +RR +A
Sbjct: 200 AVHSGNKYNP------LPAPDKTVD--KLDHIQNYNRRRYA 232


>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 546

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 149/222 (67%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+   P+
Sbjct: 35  GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +A+P+   LLP+YL++LGY+THL+GKWH+G   +   P  RGFD   GY+NGY++
Sbjct: 95  KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYNGYIS 154

Query: 142 YNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y +   + +  VG D    N +  +   + +Y+TD  T+++ ++IK+H+  +PL+LQ++H
Sbjct: 155 YFNHTIKQNNHVGYDLHYHNSKNLSVAYNFEYITDLITERAENIIKNHDRRKPLYLQLSH 214

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVH+  A        +++V D +E + T  +I + +RR +A
Sbjct: 215 LAVHSSDAKE------VMEVRDEQETNATLEYIEDYNRRKYA 250


>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 567

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 109/226 (48%), Positives = 145/226 (64%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWND+ FHG ++IPTPNIDALA+NGIVLN  YT P CTPSR A +TGKYP + G+  P  
Sbjct: 39  GWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPT 98

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G     +P++EKLLP+YL+ELGY+T  IGKWH+G  K+   P  RGFD+H GY+ GY++
Sbjct: 99  YGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVS 158

Query: 142 YNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y D + +  +       G D RRN +  A  +  KY TD FTD++V +IK    ++PLF+
Sbjct: 159 YYDYLLQDVYQNFGEFQGFDMRRN-DTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFM 217

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H AVHTG  G        L+ P  E N   F HI +P+RR++A
Sbjct: 218 YLAHVAVHTGNRGK------YLEAPQSEVNK--FNHILDPNRRIYA 255


>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 610

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 109/228 (47%), Positives = 147/228 (64%), Gaps = 18/228 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+ FHG + I TPN+DALAYNG++LN  Y LP CTPSRAA +TG YP   G+   P+
Sbjct: 52  GWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMYPIHNGMQGLPL 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   +A+P   KLLP YLK+LGY+T ++GKWH+G  ++E  P  RGFD+H+GYWNG ++
Sbjct: 112 EASEPRALPAG-KLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFDSHLGYWNGIVS 170

Query: 142 YNDSIHETD-------FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y D I + D          G D RRN+   A  +  +Y T+ FTD++VH+I+SHN + PL
Sbjct: 171 YYDYILQEDDNRKPRSSLNGFDMRRNITP-AYDLQGRYATEMFTDEAVHLIRSHNKNTPL 229

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           FL ++H AVH G  G        L+ P  +E    F HI++P+RR FA
Sbjct: 230 FLYMSHLAVHAGNPGK------FLEAP--QEAINKFLHIADPNRRTFA 269


>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
          Length = 563

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 146/222 (65%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GWNDV FHG + IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+   P+
Sbjct: 50  GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPL 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  +A+P+   LLP+YL++LGY+THL+GKWH+G   +   P  RGFD  +GY++G++T
Sbjct: 110 DPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMT 169

Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y +   E +  VG D   ++    + + S +Y+TD  T+++  +I +HNHS+PL+LQ++H
Sbjct: 170 YFNHTIEQNHHVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIILNHNHSKPLYLQLSH 229

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H+            ++V D EE + T  +I + DRR  A
Sbjct: 230 IAAHSSNINKT------VEVRDEEETNATLGYIEDFDRRKLA 265


>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
          Length = 528

 Score =  212 bits (540), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 111/235 (47%), Positives = 149/235 (63%), Gaps = 10/235 (4%)

Query: 9   VAKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLT 68
           V + + +   L   GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +T
Sbjct: 7   VQRTLSLILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMT 66

Query: 69  GKYPFRYGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
           GKYP   G+   V  G   + +P+ EKLLP+YL+ELGY+TH++GKWH+G  K+E  P  R
Sbjct: 67  GKYPIHTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYR 126

Query: 128 GFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
           GFD H+G+W G+  Y D     +   GLD RR M+  A  +  +Y TD FT ++V +I +
Sbjct: 127 GFDTHIGFWTGHHDYFDHTAVENPYWGLDIRRGMQP-AWDLHGQYSTDIFTKEAVRLIDN 185

Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           HN SRP+FL + HAAVH+G       P   L  PD  E    F +I + +RR FA
Sbjct: 186 HNSSRPMFLYLAHAAVHSGN------PYNPLPAPD--EEVAKFNNIFDYNRRRFA 232


>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 571

 Score =  212 bits (540), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 106/223 (47%), Positives = 148/223 (66%), Gaps = 8/223 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
            GWNDVGFHG N+IPTPNIDALAY G++LNRHY LPTCTPSR AFLTG++P R G+   P
Sbjct: 36  MGWNDVGFHGSNEIPTPNIDALAYGGVILNRHYALPTCTPSRTAFLTGRHPIRMGLQGIP 95

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +     + VP+ E+LLP+YL+ELGY T L+GKWH+G   ++  P  RGFD+ VGY+ G +
Sbjct: 96  MNVAEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVI 155

Query: 141 TYNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           TY +     D   G+D   +   +  P  + +Y+TDF +DQ+  VIK+H+  +PLFLQ+ 
Sbjct: 156 TYFNHTVTKDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLA 215

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H A H   A   + P   ++V +M E + T ++I + +RR +A
Sbjct: 216 HVAAH---ASENRDP---IEVRNMTEVNDTLSYIPDINRRKYA 252


>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 562

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 146/222 (65%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+   P+
Sbjct: 50  GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +A+P+   LLP+YL++LGY+THL+GKWH+G   +   P  RGFD   GY++GY++
Sbjct: 110 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYIS 169

Query: 142 YNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y +   + D  +G D    N +  +   + +Y TD  T+++ ++IK+H+  +PL+LQ+ H
Sbjct: 170 YFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPLYLQLCH 229

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H+  A        +++V D +E + T  +I + +RR +A
Sbjct: 230 LAAHSSDAKE------VMEVRDEQETNATLKYIEDYNRRKYA 265


>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
          Length = 519

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 108/229 (47%), Positives = 146/229 (63%), Gaps = 10/229 (4%)

Query: 15  VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR 74
           + + +  +GWNDVGFHG   IPTPNIDALAY+G++L+R+Y    CTPSR+A +TGKYP  
Sbjct: 2   IWQVVFFEGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIH 61

Query: 75  YGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            G+   +  G   + +P+ EK+LP+YL+ELGYSTH++GKWH+G  K E  P  RGFD H+
Sbjct: 62  TGMQHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHI 121

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           GYW G+  Y D     +   GLD RR M + A  +  +Y TD FT ++V +I +HN SRP
Sbjct: 122 GYWTGHHDYYDHTAVENPYWGLDMRRGM-KPAWDLHGEYSTDVFTKEAVKLINNHNSSRP 180

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +FL + HAAVH+G       P   L  PD  E    F +I + +RR FA
Sbjct: 181 MFLYLAHAAVHSGN------PYNPLPAPD--EEVAKFNNIFDYNRRRFA 221


>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 545

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 100/189 (52%), Positives = 131/189 (69%), Gaps = 2/189 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG   IPTPNIDALAY+G++LNR+Y  P CTPSR+A +TGKYP   G+   V 
Sbjct: 39  GWNDVGFHGSGQIPTPNIDALAYSGLILNRYYVSPICTPSRSALMTGKYPIHTGMQRGVL 98

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G   + +P+ EKLLP+YL+ELGY TH++GKWH+G   +E  P  RGF++H+GYW G+  
Sbjct: 99  KGAEPRGLPLKEKLLPEYLRELGYRTHIVGKWHLGFYTKEYTPTYRGFESHLGYWTGHQD 158

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D     +   G+D RRNME  A  +  +Y TD FT ++V +IKSHN S+P+FL + HA
Sbjct: 159 YYDHSAVEEPYWGMDMRRNMEP-AWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHA 217

Query: 202 AVHTGTAGN 210
           AVH+    N
Sbjct: 218 AVHSANPYN 226


>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 545

 Score =  209 bits (531), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 105/221 (47%), Positives = 144/221 (65%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +TGK+P   G+   V 
Sbjct: 38  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKHPIHTGMQHGVL 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EKLLPQYL+ELGYSTH++GKWH+G   +E  P  RGFD+H+G+W+G+  
Sbjct: 98  KCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D         GLD RR +   A  +  +Y TD FT ++V +I  HN SRP+FL ++HA
Sbjct: 158 YFDHSAVESPYWGLDMRRGLNS-AWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLSHA 216

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+G + N         +P  +++   F +I N +RR FA
Sbjct: 217 AVHSGNSYNP--------LPAPDQDVAKFTNIFNYERRRFA 249


>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
          Length = 545

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/221 (48%), Positives = 144/221 (65%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +TGK+P   G+   V 
Sbjct: 38  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKHPIHTGMQHGVL 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EKLLPQYL+ELGYSTH++GKWH+G   +E  P  RGFD+H+G+W+G+  
Sbjct: 98  KCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D         GLD RR +   A  +  +Y TD FT ++V +I  HN SRP+FL + HA
Sbjct: 158 YFDHSAVESPYWGLDMRRGLNS-AWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLPHA 216

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+G + N       L VPD  ++   F +I N +RR FA
Sbjct: 217 AVHSGNSYNP------LPVPD--QDVAKFTNIFNYERRRFA 249


>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 106/224 (47%), Positives = 147/224 (65%), Gaps = 10/224 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDV FHG ++IPTPNIDALAYNG++LNRHY LP CTPSR AFLTGKYP R G+   V 
Sbjct: 25  GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGKYPIRTGMQGYVL 84

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ + LLP+YL++LGY+THL+GKWH+G + +   P  RGFD  +GY+NGY+ 
Sbjct: 85  QPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGFDTFLGYYNGYIH 144

Query: 142 Y-NDSI-HETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y N +I  E    +G D  R + E    +    Y+TD  TD+  ++I SHN ++PL+LQ+
Sbjct: 145 YFNHTILDEEQKYLGYDFHRIVGENRTIEYRYDYITDIITDEVENIIFSHNPAKPLYLQV 204

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +H A H+G  G        +QV D +E + T  +I + +RR +A
Sbjct: 205 SHDAAHSGGIGIE------MQVRDWKETNATLGYIEDINRRKYA 242


>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
          Length = 544

 Score =  207 bits (528), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 107/221 (48%), Positives = 143/221 (64%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +TGKYP   G+   V 
Sbjct: 37  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKYPIHTGMQHGVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EKLLP+YLKELGY TH++GKWH+G   ++  P  RGFD+H+G+W+G+  
Sbjct: 97  KCAEPRGLPLQEKLLPEYLKELGYRTHIVGKWHLGFYTKQYTPTYRGFDSHIGFWSGHQD 156

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D         GLD RR ME  A  +  +Y TD FT ++V +I +HN S+PLFL + HA
Sbjct: 157 YFDHTAVESPYWGLDMRRGMEA-AWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHA 215

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+G       P   L  PD++     F +I + +RR FA
Sbjct: 216 AVHSGN------PYDPLPAPDVDV--AKFTNIFDYNRRRFA 248


>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 532

 Score =  206 bits (524), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 107/222 (48%), Positives = 142/222 (63%), Gaps = 10/222 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           +GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +TGKYP   G+   V
Sbjct: 25  EGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGV 84

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
             G   + +P+ EK+LP+YL++LGYSTH++GKWH+G   +E  P  RGF +H G+W G+ 
Sbjct: 85  LKGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQ 144

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D         GLD RR+ME  A  +  +Y TD FT +++ +I  HN SRPLFL + H
Sbjct: 145 DYFDHTAVESPYWGLDMRRDMEP-AWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAH 203

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AAVH+G       P   L  PD  E    F +I + +RR FA
Sbjct: 204 AAVHSGN------PYNPLPAPD--EEVAKFDNIFDYNRRRFA 237


>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  206 bits (523), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 100/223 (44%), Positives = 148/223 (66%), Gaps = 10/223 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG ++IPTPNIDALAYNG++LNRHY LP CTPSR AFLTGKYP R G+   P+
Sbjct: 15  GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGKYPIRTGMQGYPL 74

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + + +   LLP+YL++LGY+THL+GKWH+G +     P +RGFD   GY+NGY+ 
Sbjct: 75  QGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGFDTFAGYYNGYIQ 134

Query: 142 Y-NDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           Y N +++E++  +G D  R + + +  +    Y+TD  TD++ ++I SHN ++PL+LQ+ 
Sbjct: 135 YFNHTLYESE-QLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSHNPAKPLYLQVA 193

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H A H+  A         ++V + +E + T  +I + +RR +A
Sbjct: 194 HLAAHSSDAEEE------MEVRNWKETNATLGYIEDINRRKYA 230


>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
          Length = 543

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 139/221 (62%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG   IPTPNIDALAY+G++L  +Y  P CTPSR+A +TGKYP   G+   V 
Sbjct: 34  GWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVTPICTPSRSALMTGKYPIHTGMQHTVL 93

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G   + +P+TEK+LP+YL+ELGY+  L+GKWH+G   +E  P  RGFD+H+GYW G+  
Sbjct: 94  FGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHLGSYTKEYTPLYRGFDSHLGYWTGHQD 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D     +   G D RRNM+  A  +  +Y TD FT ++V +I++HN + PLFL + H 
Sbjct: 154 YYDHTAVENPGWGFDMRRNMD-LAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLYLAHV 212

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+        P   L  PD  E    F++I +  R+ FA
Sbjct: 213 AVHSAN------PYNPLPAPD--ETVEKFSNIPSYKRQRFA 245


>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 543

 Score =  204 bits (519), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 144/221 (65%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVGFHG   IPTPNIDALAY+G++L+R+Y  P CTPSR+A +TGK+P   G+   V 
Sbjct: 36  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EKLLP+Y ++LGYSTH++GKWH+G   +E  P  RGFD+H+G+W+G+  
Sbjct: 96  KCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D     +   GLD RR +E  A  +  +Y TD FT ++V +I +HN SRP+FL + HA
Sbjct: 156 YFDHSAVEEPYWGLDMRRGLEP-AWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLYLAHA 214

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+G   N         +P  +++   F +I N +RR FA
Sbjct: 215 AVHSGNPYNP--------LPAHDQDVAKFTNIFNYNRRRFA 247


>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
          Length = 513

 Score =  203 bits (517), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 106/225 (47%), Positives = 140/225 (62%), Gaps = 11/225 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G NDVGF+G   IPTP+IDALAYNGI+L+R YT  +CTPSRAA LTG+YP R G+   P+
Sbjct: 6   GRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGMQGLPI 65

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  K++P+    +PQYLK LGY THL+GKWH+G    E  P  RGFD+H GYWNG++ 
Sbjct: 66  RAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYWNGFVG 125

Query: 142 YND--SIHETDFAVGLDARRNMERYAP--QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           Y +  +++E      +      +   P  Q   KY TD FT +++ +I  HN  +PLFL 
Sbjct: 126 YFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEKPLFLV 185

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + H A HTG  G        L VPD+ + +  F+ I NP RRL+A
Sbjct: 186 LAHLAGHTGEDGVE------LGVPDVAQAETRFSFIKNPKRRLYA 224


>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
 gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
          Length = 559

 Score =  202 bits (515), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 113/249 (45%), Positives = 151/249 (60%), Gaps = 27/249 (10%)

Query: 6   GAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSR 63
           G G A A P    +L    G+NDV FHG N I TPNIDALAYNGI+LNRHY    CTPSR
Sbjct: 20  GEGEASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSR 79

Query: 64  AAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
           A  LTGKYP   G+       D P G      +P  E+L+P+  ++ GY+THL+GKWH+G
Sbjct: 80  ATLLTGKYPIHTGMQHFVIVTDEPWG------LPRQERLMPELFRDAGYATHLVGKWHLG 133

Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYL 173
             +++L P  RGFD+H GY+NGY+ Y D    + + +++ GLD RR++E    +    Y 
Sbjct: 134 FWRKDLTPTMRGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEP-CREAEGTYA 192

Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
           T+ FT ++  VI+ H+ SRPLF+ ++H AVHTG   N       +Q P  EE    FAHI
Sbjct: 193 TEAFTTEARKVIERHDKSRPLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHI 244

Query: 234 SNPDRRLFA 242
            +P RR +A
Sbjct: 245 RDPKRRTYA 253


>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
          Length = 301

 Score =  202 bits (515), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 108/226 (47%), Positives = 139/226 (61%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FHG + IPTPNIDALAYNGI+LN HY    CTPSR+A +TGK P   G+   V 
Sbjct: 85  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 144

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P++EKLLPQYL+E+GY TH +GKWH+G  K++  P  RGFD+H GYWNG   
Sbjct: 145 YPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFDSHFGYWNGLED 204

Query: 142 YNDSI-HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y   I  E D       G D RRN+   A   + KY TD FT+++V +I  H+  RP+FL
Sbjct: 205 YYTHIAQEPDSQYNEYKGFDMRRNLT-VAWDTAGKYATDLFTNEAVRLINEHDTERPMFL 263

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H AVH G          LL+ PD  E    F++I +P+RR+ A
Sbjct: 264 YLAHLAVHKGNENQ------LLRAPD--EEIAKFSYILDPERRIQA 301


>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 646

 Score =  202 bits (513), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 109/223 (48%), Positives = 141/223 (63%), Gaps = 14/223 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGFHG N+IPTPNIDALAYNG++LN HYT   CTPSR+AFLTGKYP   G+   V 
Sbjct: 34  GFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAFLTGKYPIHLGMQHLVI 93

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E +LPQYLK  GY+TH IGKWH+G  ++E  P  RGFD+H GYW G   
Sbjct: 94  LEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHFGYWQGLQD 153

Query: 142 -YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y  ++H T    G D RRNM   ++ Q   KY T  FTD++V +I+ HN   P+F+ + 
Sbjct: 154 YYKHTVHFTP-EHGYDMRRNMTVDWSAQ--GKYSTTLFTDEAVRLIREHNTENPMFMYLA 210

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H A H+G   +       LQ PD  E    F HI++P+RR++A
Sbjct: 211 HLAPHSGNDDDP------LQAPD--EEIAKFGHIADPERRIYA 245


>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 713

 Score =  202 bits (513), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 107/226 (47%), Positives = 138/226 (61%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FHG + IPTPNIDALAYNGI+LN HY    CTPSR+A +TGK P   G+   V 
Sbjct: 83  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 142

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P++EKLLP+YL+E+GY TH +GKWH+G  K+E  P  RGFD+H GYWNG   
Sbjct: 143 FPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 202

Query: 142 YNDSI-HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y   I  E D A     G D RRN+   A     KY TD FT++++ +I  H+  RP+FL
Sbjct: 203 YYTHITQEPDPAFSEFKGFDMRRNLT-VAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFL 261

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H AVH G          L + PD  E    F++I +P+RR+ A
Sbjct: 262 YLAHLAVHKGNEEQ------LFRAPD--EEIAKFSYILDPERRIQA 299


>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
          Length = 546

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 108/237 (45%), Positives = 149/237 (62%), Gaps = 12/237 (5%)

Query: 9   VAKAVPVTEKLLPQ--GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAF 66
           VA A P    +L    GWNDVGFHG + IPTPNIDALAY G++L+R+Y  P CTPSR+A 
Sbjct: 23  VASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLDRYYVSPICTPSRSAL 82

Query: 67  LTGKYPFRYGIDTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPF 125
           +TGK+P   G+   V      + +P+ EKLLP+YL+ LGYSTH++GKWH+G   +E  P 
Sbjct: 83  MTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKWHLGFYTKEYTPT 142

Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
            RGFD+H+G+W+G+  Y D     +   GLD RR +E  A  +  +Y TD FT ++V +I
Sbjct: 143 YRGFDSHLGFWSGHHDYFDHTAVEEPYWGLDMRRGLEP-AWDLHGQYSTDVFTKEAVRLI 201

Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +HN SRP+FL ++HAAVH+G   N         +P  + +   F  I + +RR FA
Sbjct: 202 DNHNTSRPMFLYLSHAAVHSGNPYNP--------LPAHDHDVAKFPKILDYNRRRFA 250


>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
 gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
          Length = 542

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 144/230 (62%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNG++LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 2   GMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWVI 61

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  +E GYSTHL+GKWH+G  +++L P  RGFD+H GY
Sbjct: 62  ITDEPWG------LPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGY 115

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    +  T+++ GLD RR+ E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 116 YNGYIDYYDHQVRLLGTNYSAGLDFRRDFEP-NPKANGTYATEAFTSEAKRIIEEHDKSK 174

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHTG   N       +Q P  EE    F+HI +P RR +A
Sbjct: 175 PLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFSHIKDPKRRTYA 216


>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
          Length = 648

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 138/224 (61%), Gaps = 14/224 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGFHG N+IPTPNIDALAYNG++LN HYT   CTPSR+AFLTGKYP   G+   V 
Sbjct: 34  GFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAFLTGKYPIHLGMQHLVI 93

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E +LPQYLK  GY+TH IGKWH+G  ++E  P  RGFD+H GYW G   
Sbjct: 94  LEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHFGYWQGLQD 153

Query: 142 YNDSIHETDFAV--GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y        F    G D RRNM   ++ Q   KY T  FTD++V +I+ HN   P+F+ +
Sbjct: 154 YYKHTVHATFTPEHGYDMRRNMTVDWSAQ--GKYSTTLFTDEAVRLIREHNTENPMFMYL 211

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            H A H+G   +       LQ PD  E    F HI++P+RR++A
Sbjct: 212 AHLAPHSGNDDDP------LQAPD--EEIAKFGHIADPERRIYA 247


>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 575

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 102/221 (46%), Positives = 143/221 (64%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR+AFLTG YP R G+    +
Sbjct: 38  GWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLYPIRIGMQGDGI 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  K+LP++L++LGY+T LIGKWH+G +  +  P +RGFD  +G++N +++
Sbjct: 98  RGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFDFFLGFYNSHIS 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D  R  +  A  ++ +Y TD FT ++V +I++H   RPL+LQI+H 
Sbjct: 158 YYDYHYSNQNMSGYDLHRG-DDPAHGINREYATDLFTKEAVRMIETHELPRPLYLQISHL 216

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             L+ P  E ND  F++I  P+RR +A
Sbjct: 217 AVHAP-----------LEQPRDEYNDGRFSYIREPNRRKYA 246


>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
 gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
          Length = 577

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 112/249 (44%), Positives = 151/249 (60%), Gaps = 27/249 (10%)

Query: 6   GAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSR 63
           G G A A P    +L    G+NDV FHG N I TPNIDALAYNGI+LNRHY    CTPSR
Sbjct: 20  GEGEASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSR 79

Query: 64  AAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
           A  LTGKYP   G+       D P G      +P  E+L+P+  ++ GY+THL+GKWH+G
Sbjct: 80  ATLLTGKYPIHTGMQHFVIVTDEPWG------LPRQERLMPELFRDAGYATHLVGKWHLG 133

Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYL 173
             +++L P  RGFD+H GY+NGY+ Y D    + + +++ GLD RR++E    +    Y 
Sbjct: 134 FWRKDLTPTMRGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEP-CREAEGTYA 192

Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
           T+ FT ++  VI+ H+ +RPLF+ ++H AVHTG   N       +Q P  EE    FAHI
Sbjct: 193 TEAFTTEARKVIERHDKNRPLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHI 244

Query: 234 SNPDRRLFA 242
            +P RR +A
Sbjct: 245 RDPKRRTYA 253


>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 579

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 140/221 (63%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+    +
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  K+LP++L+ LGY+T LIGKWH+G    +  P +RGFD  +G++N Y++
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D  R  +  A  M+ +Y TD FT +++++I++H  +RPL+LQ++H 
Sbjct: 162 YYDYNYSNQNMSGYDMHRG-DDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQLSHL 220

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+            L+ P    NDR   HI  P+RR +A
Sbjct: 221 AVHSP-----------LEQPANVYNDREPIHIREPNRRKYA 250


>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 557

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 96/222 (43%), Positives = 142/222 (63%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV F+G ++IPTPNID+LAYNG++LNRHY LP CTPSR AF TG+YP R G+   P+
Sbjct: 48  GWNDVSFNGGDEIPTPNIDSLAYNGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPL 107

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +++P+   LLPQYL++LGY+THL+GKWH+G       P NRGFD  +GY++GY+ 
Sbjct: 108 QGAEPRSIPLNNILLPQYLRKLGYATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIE 167

Query: 142 YNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y       +   G D  R++ + +  +    Y+TD  TD++ ++I SHN ++PL+LQ+ H
Sbjct: 168 YFSHNLVENGQSGYDIHRSVGDNHTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAH 227

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H  T  +      +++V   +  + T  +I + +RR FA
Sbjct: 228 LAPHASTVDD------VIEVRSWKATNDTLGYIRDINRRKFA 263


>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
          Length = 581

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 155/243 (63%), Gaps = 14/243 (5%)

Query: 10  AKAVPVTEKLL-----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRA 64
            KA+P+   ++       GWNDV FHG N+IPTPNIDALAYNG++LN++YT+P CTPSR+
Sbjct: 28  GKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPNIDALAYNGVILNKYYTMPICTPSRS 87

Query: 65  AFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL 123
           A +TG+YP R G+  TP+     + +P+   L+P+ ++ LGY T L+GKWH+G   E+  
Sbjct: 88  ALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYT 147

Query: 124 PFNRGFDNHVGYWNGYLTYND---SIHETDFAVGLDARRN-MERYAPQMSSKYLTDFFTD 179
           P  RGFD   GY+NG+++Y D     ++T+   G D  R+  + +    SS+Y TD  TD
Sbjct: 148 PVRRGFDTFFGYYNGFISYYDYWIGWNDTNEVTGYDLHRDESDSFELAHSSEYFTDLITD 207

Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
           ++  +I+++ +++PLFL+I+H AVH G+    K+    L+V   ++ + +F +I +   R
Sbjct: 208 EAEKIIRNNKNAKPLFLEISHLAVHAGS----KVHDDPLEVRRTDDVNASFPYIEDYQHR 263

Query: 240 LFA 242
            +A
Sbjct: 264 KYA 266


>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 710

 Score =  200 bits (509), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 107/226 (47%), Positives = 139/226 (61%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG + IPTPNIDALAYNG++LN HY    CTPSR+A LTGKYP   G+    +
Sbjct: 55  GWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKYPIHTGMQHLVI 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EK+LPQYLKE GY+TH IGKWH G ++ E  P  RGFD+H GYW G   
Sbjct: 115 LEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGYWQGLQD 174

Query: 142 YND----SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
           Y      S +  +  +G D RRNM   A     KY TD FTD++V +I+ H   + P+FL
Sbjct: 175 YYTHEVGSSNPKEGFLGFDMRRNMS-LARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFL 233

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H A H+   GN   P   LQ PD  E    F+++ +P+RR++A
Sbjct: 234 YLAHLAPHS---GNDNEP---LQAPD--EEVAKFSYVEDPERRIYA 271


>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
          Length = 881

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 114/231 (49%), Positives = 141/231 (61%), Gaps = 27/231 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDVGFHG N IPTPNIDALAY+GI+LNRHY+ P CTPSRAA +TG++P   G+     
Sbjct: 45  GLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVI 104

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G+       EKLLPQY +E GY THLIGKWH+G   E  LP NRGFD H+GY
Sbjct: 105 DSDEPWGLGLQ------EKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGY 158

Query: 136 WNGYLTYNDSIHETDFAV--GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
              Y+ Y   I + D A   G D R+N+   Y P  +  Y TD+FT+ +V +I+SHN + 
Sbjct: 159 LGPYIDYWSYISKMDSATFEGYDLRQNLAVNYKP--NGTYATDYFTEAAVEIIRSHNRTG 216

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             + L + H A HT   GN   P   LQ P  EE    FA+I + DRR +A
Sbjct: 217 ERMLLVLNHLAPHT---GNDDAP---LQAP--EETIEKFAYIRDTDRRTYA 259


>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
 gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
          Length = 564

 Score =  199 bits (507), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   IPTPN+DALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 50  GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P++EKLLPQYLK+LGYS H++GKWH+G  +    P  RGFD+H G+W G+  
Sbjct: 110 YAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQRGFDSHTGFWTGHHH 169

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            ND         GLD RR  +  A  +  +Y T     +++ +++ HN S PLFL + HA
Sbjct: 170 MNDHTAVEHGHWGLDMRRGYD-VAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHA 228

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+        P   L  PD  E      HI N  RR FA
Sbjct: 229 AVHSAN------PYDFLPAPD--ETVANLGHIENYRRRKFA 261


>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
 gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
 gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
          Length = 562

 Score =  199 bits (507), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 110/223 (49%), Positives = 141/223 (63%), Gaps = 13/223 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG  +IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL ELGY++H+ GKWH+G  K +  P  RGF +HVG+W+G+  
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQD 156

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
           YND     +   GLD  RN  + A  +   Y TD  TD SV VI +HN ++ PLFL + H
Sbjct: 157 YNDHTAVENNQWGLDM-RNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAH 215

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
           AA H+        P   L VPD   ND    +HI N  RR FA
Sbjct: 216 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 249


>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
 gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
          Length = 579

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNGI+LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 39  GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  ++ GYSTHL+GKWH+G  +++L P  RGFD+H GY
Sbjct: 99  ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    + + +++ GLD RR++E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P  EE    F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253


>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
 gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
          Length = 578

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 144/230 (62%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNGI+LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 39  GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  +E GYSTHL+GKWH+G   ++L P  RGFD+H GY
Sbjct: 99  ITDEPWG------LPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPTRRGFDHHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    + + +++ GLD RR++E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTAEAKRIIEQHDKSK 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P  EE    F HI +P RR +A
Sbjct: 212 PLFMVMSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253


>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
 gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
          Length = 579

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNGI+LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 39  GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  ++ GYSTHL+GKWH+G  +++L P  RGFD+H GY
Sbjct: 99  ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    + + +++ GLD RR++E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRLLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P  EE    F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253


>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
 gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
          Length = 579

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNGI+LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 39  GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIYTGMQHFVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  ++ GYSTHL+GKWH+G  +++L P  RGFD+H GY
Sbjct: 99  ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    + + +++ GLD RR++E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRLLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P  EE    F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253


>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 671

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 105/226 (46%), Positives = 136/226 (60%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDV FHG + IPTPNIDALAYNGI+LN HY    CTPSR+A +TGK P   G+   V 
Sbjct: 39  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P++EKLLP+YL+E+GY TH +GKWH+G  K+E  P  RGFD+H GYWNG   
Sbjct: 99  FPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 158

Query: 142 YNDSIHETDFAV-----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y   I +    V     G D RRN+   A     KY TD FT+++V +I  HN  +P+FL
Sbjct: 159 YYTHITQEPDPVYSEYKGFDMRRNLT-VAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFL 217

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H A H G          L + PD  E    F++I +P+RR+ A
Sbjct: 218 YLAHLAPHKGNEEQ------LFRAPD--EEIAKFSYILDPERRIQA 255


>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
 gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
          Length = 550

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 146/230 (63%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G+NDV FHG N I TPNIDALAYNG++LN+ Y    CTPSRA  LTGKYP   G+     
Sbjct: 38  GFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATLLTGKYPIHTGMQHYVI 97

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P++ ++ GYST LIGKWH+G  +++L P  RGFD+H GY
Sbjct: 98  ITDEPWG------LPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPTMRGFDHHYGY 151

Query: 136 WNGYLTYND-SIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D ++H    ++  G+D RR+++   PQ +  Y TD FT ++  VI+ H+ SR
Sbjct: 152 YNGYIDYYDHTLHMLTKNYTKGVDFRRDLDP-CPQDNGTYATDAFTAEAKRVIEQHDKSR 210

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHTG   N       +Q P  EE    FAHI++P RR +A
Sbjct: 211 PLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHITDPKRRTYA 252


>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
 gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
          Length = 577

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G NDV FHG N I TPNIDALAYNGI+LN+HY    CTPSRA  LTGKYP   G+     
Sbjct: 39  GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P  E+L+P+  ++ GYSTHL+GKWH+G  +++L P  RGFD+H GY
Sbjct: 99  ITDEPWG------LPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D    + + +++ GLD RR++E   P+ +  Y T+ FT ++  +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSHGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P  EE    F HI +P RR +A
Sbjct: 212 PLFMVMSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253


>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 583

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 101/223 (45%), Positives = 130/223 (58%), Gaps = 12/223 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N IPTPNIDALAY G++L  HY    CTPSRAA LTGKYP   G+    +
Sbjct: 52  GWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKYPIHLGMQHEAI 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ EKLLPQYLK++ Y TH++GKWH+G  K E  P  RGFD H GYWNG   
Sbjct: 112 FPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFDTHFGYWNGLQD 171

Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           Y  + +       +G+D RRN    A     KY  D +TD++V +I +HN   P+FL + 
Sbjct: 172 YYSHKTAEPYTLNIGMDMRRNF-TVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLYLA 230

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             A H G A   +LP  L       E    F++I +P R+ +A
Sbjct: 231 QIAPHAGNAN--QLPQAL------PEEIEKFSYIIDPKRKRYA 265


>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 689

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 104/226 (46%), Positives = 136/226 (60%), Gaps = 15/226 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FHG + IPTPNIDA+AYNGI+LN HY    CTPSR A +TGK P   G+   V 
Sbjct: 70  GWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKNPIHLGMQHSVL 129

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P++EKLLP+YL+E+GY TH +GKWH+G  + E  P  RGFD H GYWNG   
Sbjct: 130 LPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFDTHFGYWNGLQD 189

Query: 142 YNDSIHET---DFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y   I +     F   +GLD RRN+   A     KY TD FT+++V +I  H+   P+FL
Sbjct: 190 YYTHITQEPDPQFGEFMGLDMRRNLT-AAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFL 248

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H A H G       P  LL+  D  E+   F++I +P+RR+ A
Sbjct: 249 YLAHLAPHKGN------PNRLLRASD--EDIARFSYILDPERRIQA 286


>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 532

 Score =  196 bits (498), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 101/228 (44%), Positives = 141/228 (61%), Gaps = 15/228 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW D GFHG + I TPN+DALAY+G++LNRHY LP+CTPSR+A LTG YP R G+   P+
Sbjct: 39  GWTDTGFHGSDQIKTPNMDALAYSGMILNRHYVLPSCTPSRSALLTGLYPIRTGMQGMPL 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P++ KL P++LK LGY THL+GKWH+G      LP  RGFD+  GY+NGY+ 
Sbjct: 99  KGGDVRNLPLSFKLKPEFLKNLGYRTHLVGKWHLGYRTINHLPNQRGFDSFFGYYNGYVD 158

Query: 142 Y-----NDSI--HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y     N ++   + ++  G D  RN E Y     + Y T  FT ++  +IK+HN S PL
Sbjct: 159 YFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQTDKDT-YATRLFTREAEKIIKNHNESEPL 217

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +L  +H A HTG           ++VP+  + ++T+ HI +  RR FA
Sbjct: 218 YLYFSHLATHTGDDDIG------MEVPEDADVNKTYGHIKHYGRRAFA 259


>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
 gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
          Length = 573

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 109/230 (47%), Positives = 136/230 (59%), Gaps = 26/230 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDV FHG N IPTPNIDALAY+GI+LNRHY  P CTPSRA+ +TGK+P   G+     
Sbjct: 39  GWNDVSFHGSNQIPTPNIDALAYDGIILNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVI 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G      + +KL+PQY +E GY THL+GKWH+G  +    P  RGFD+H GY
Sbjct: 99  ISDEPWGLG------LDQKLMPQYFREAGYRTHLVGKWHLGFFRRAYTPTYRGFDSHFGY 152

Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
              Y+ Y D    ++ET  A GLD RRN        +  Y TD F D++V +I SHN S+
Sbjct: 153 LGPYIDYWDHSLQMNETS-ARGLDMRRNTA-VNYDANGTYATDLFNDEAVRLIDSHNRSK 210

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL +TH A HTG   +       LQ P   +    F +I +P RR  A
Sbjct: 211 PLFLVLTHLAPHTGNEDDP------LQAP--ADEIAKFDYIQDPKRRTLA 252


>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
 gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
          Length = 324

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 104/230 (45%), Positives = 143/230 (62%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G+NDV FHG N I TPNIDA AYNG++LNR+Y    CTPSRAA LTGKYP   G+     
Sbjct: 39  GFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCTPSRAALLTGKYPIHNGMQHFVQ 98

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E+L+PQ+ ++ GYST L+GKWH+G  +++  P  RGFD+H GY
Sbjct: 99  IPDEPWG------LPLGERLMPQFFRDAGYSTQLVGKWHLGFWRQDHTPIMRGFDHHFGY 152

Query: 136 WNGYLTYNDSIH---ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           +NGY+ Y D  H   + ++  G D RR+++R     +  Y T+ FT ++  +I+ H+ SR
Sbjct: 153 YNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCHSD-NGTYATEAFTKEARRIIEQHDLSR 211

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H AVHT   GN   P   +Q P   E    F HIS+P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNENQP---MQAP--YEEVAKFVHISDPKRRTYA 253


>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 579

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 137/221 (61%), Gaps = 14/221 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG N IPTPNIDALAYNGI+LNRHY LP+ TPSR AF TG YP R G+    +
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  K+LP++L+ LGY+T LIGKWH+G +  +  P +RGFD   G++N ++T
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHIT 161

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D  R  +  A  +  +Y+TD FT +++ +I++H   RPL+LQI+H 
Sbjct: 162 YYDYEYSNQNMTGYDMHRG-DDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQISHL 220

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             ++ PD   +D     I  P+RR +A
Sbjct: 221 AVHAP-----------IEQPDDSSSDEII-QIREPNRRKYA 249


>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
          Length = 591

 Score =  193 bits (490), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 96/221 (43%), Positives = 137/221 (61%), Gaps = 12/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVGFHG + I TPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+    +
Sbjct: 41  GWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRMGMQGEDI 100

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  ++LP++L++LGY T LIGKWH+G    +  P  RGFD+ +G++N +++
Sbjct: 101 QGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFDSFLGFYNSHVS 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +  +      G D  R  +  A   + KY+TDFFTD+++ +I+ ++ SRPL+LQI+H 
Sbjct: 161 YYNYKYSFQNMSGYDMHRG-DAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQISHL 219

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH    G            D    D  F HI   +RR +A
Sbjct: 220 AVHAPLEGPQ----------DYNHYDSQFLHIREINRRKYA 250


>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 593

 Score =  192 bits (489), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 99/224 (44%), Positives = 138/224 (61%), Gaps = 16/224 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSRAAF TG YP R G+    +
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  ++LP+YL+ LGY+T LIGKWH+G +  +  P +RGFD  +G++N +++
Sbjct: 102 QGGEPRGLPLNIRILPEYLRGLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVS 161

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D  R  +  A  ++++Y TD FTD+++ +I+ H   RPL+LQI+H 
Sbjct: 162 YYDYRYSYQNMSGYDMHRG-DNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHL 220

Query: 202 AVHTGTAGNAKLPTGLLQVPD---MEENDRTFAHISNPDRRLFA 242
           AVH             ++ PD      N   F HI   +RR +A
Sbjct: 221 AVHAP-----------IESPDDDHRNSNRERFKHIPEVNRRNYA 253


>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 514

 Score =  192 bits (487), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 108/223 (48%), Positives = 135/223 (60%), Gaps = 13/223 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG N IPTPNIDALA+ GI+LN +Y  P CTPSR+A LTGKYP   G+   V 
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKYPIHTGLQHGVI 101

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G A   + + EKLLP+YL+ L Y T  +GKWH+G  K++  P  RGFD+H GYW G+  
Sbjct: 102 HGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFDSHYGYWTGHQD 161

Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           Y D  +I    F  G D RR M          Y TD FT+++V VIK H+ ++PLFL + 
Sbjct: 162 YYDHTAIENPGFW-GYDMRRGMNVTRSDFGY-YTTDLFTNEAVKVIKGHDSNKPLFLYLA 219

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H A H+   GN   P   LQ P   E    F +I + +RRLFA
Sbjct: 220 HLATHS---GNKYSP---LQAP--AETVAKFNYIKDKNRRLFA 254


>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
 gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
          Length = 558

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 100/224 (44%), Positives = 140/224 (62%), Gaps = 13/224 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G ND+G    N IPTPNIDAL YNG+VL+R+Y    CTPSRAAFLTG YP R  +   P+
Sbjct: 38  GHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNYPIRSAMQGLPI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +++P+   L+PQ+LK LGY TH++GKWH+G       P  +GFD+H GYWNG+  
Sbjct: 97  VAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGYWNGFTG 156

Query: 142 YNDSIHETDF-AVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y D  + TDF +  ++     +R+  +     +Y T  FT++++ +I+ HN +RPLFL +
Sbjct: 157 YYD--YFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPLFLLM 214

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           TH A H G  G        L VP+  E  RT+++I +P RRL+A
Sbjct: 215 THLAAHAGRDGTE------LGVPNEVEAQRTYSYIQDPRRRLYA 252


>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
          Length = 564

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 99/221 (44%), Positives = 134/221 (60%), Gaps = 14/221 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG N IPTPNIDALAYNGI+LNRHY LP+ TPSR AF TG YP R G+    +
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  K+LP++L+ LGY T LIGKWH+G +  +  P +RGFD   G++N ++T
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHIT 161

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D     +  A  M  +Y TD FT++++ +I++H   RPL+LQI+H 
Sbjct: 162 YYDYEYSNQNMTGYDMHCG-DDPAYGMKREYATDLFTNEAIKIIENHELPRPLYLQISHL 220

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             ++ PD    D     I  P+RR +A
Sbjct: 221 AVHAP-----------IEQPDDSSRDE-IVQIREPNRRKYA 249


>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
          Length = 563

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 95/225 (42%), Positives = 141/225 (62%), Gaps = 11/225 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           G+NDV FHG + IPTPN+  +A  GI+L+R YT  TCTPSR A LTG+YP R G+   P+
Sbjct: 50  GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +++P+    +P + + LGY THL+GKWH+G   +E  P  +GFD+H GYWNG++ 
Sbjct: 110 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 169

Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           Y D  S  + D    +      +++ P   S  +Y T+ FT++S+ VI+ H+   PLFL 
Sbjct: 170 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 229

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HTG  G+       L VPD+++ +  F++I +P RRL+A
Sbjct: 230 VSHLAAHTGQNGSE------LGVPDVDQTNHEFSYIQDPRRRLYA 268


>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 536

 Score =  190 bits (483), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 21/228 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG N IPTPNIDALAYNGI+LN HY+    TPSRAA LTGKYP + G+  P +
Sbjct: 19  GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 78

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +++P   K++ +Y K++GY+THL+GKWH+G ++    P  RGFD+  G++NG+ +
Sbjct: 79  TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 137

Query: 142 YND-----SIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y D      I++ +++ G D RR+     P    + KY TD F + +V VI+ HN + PL
Sbjct: 138 YYDYVSNWKINDKEYS-GFDLRRDT---VPSWNDAGKYATDLFAEHAVDVIQKHNVNTPL 193

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           F+ I H AVH G  G        L+ P  +E    F HI +P+RR +A
Sbjct: 194 FMMIAHLAVHVGNEGK------WLEAP--QETVNKFKHIRDPNRRTYA 233


>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
          Length = 531

 Score =  190 bits (483), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 95/225 (42%), Positives = 141/225 (62%), Gaps = 11/225 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           G+NDV FHG + IPTPN+  +A  GI+L+R YT  TCTPSR A LTG+YP R G+   P+
Sbjct: 35  GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG  +++P+    +P + + LGY THL+GKWH+G   +E  P  +GFD+H GYWNG++ 
Sbjct: 95  KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 154

Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           Y D  S  + D    +      +++ P   S  +Y T+ FT++S+ VI+ H+   PLFL 
Sbjct: 155 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 214

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HTG  G+       L VPD+++ +  F++I +P RRL+A
Sbjct: 215 VSHLAAHTGQNGSE------LGVPDVDQTNHEFSYIQDPRRRLYA 253


>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 578

 Score =  190 bits (483), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 105/226 (46%), Positives = 136/226 (60%), Gaps = 22/226 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG   IPTPNIDAL YNGI+LN+HY LP+C+P+RAAFLTGKYP R G+    G
Sbjct: 36  GWNDVGFHGATQIPTPNIDALGYNGIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQ---G 92

Query: 83  AGVA----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           AG+A    + +PV  + LP+YL+ LGY T+LIGKWH+G +  + LP  RGFD   G++N 
Sbjct: 93  AGIAGGEPRGLPVHVQTLPEYLQGLGYETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNS 152

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           ++ Y D  +      G D   N E  A      Y TD FT  ++ VI  H+   P++LQ+
Sbjct: 153 HIGYYDYRYSQGNMSGFDMHINGET-AYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQV 211

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEEN--DRTFAHISNPDRRLFA 242
           +H A H             + VP  E+N  D  F HIS P RR +A
Sbjct: 212 SHLAPHAP-----------MDVP-FEDNPYDDEFRHISEPKRRAYA 245


>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
          Length = 558

 Score =  190 bits (482), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 21/228 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GWNDVGFHG N IPTPNIDALAYNGI+LN HY+    TPSRAA LTGKYP + G+  P +
Sbjct: 41  GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 100

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +++P   K++ +Y K++GY+THL+GKWH+G ++    P  RGFD+  G++NG+ +
Sbjct: 101 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 159

Query: 142 YND-----SIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y D      I++ +++ G D RR+     P    + KY TD F + +V VI+ HN + PL
Sbjct: 160 YYDYVSNWKINDKEYS-GFDLRRDT---VPSWNDAGKYATDLFAEHAVDVIQKHNVNTPL 215

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           F+ I H AVH G  G        L+ P  +E    F HI +P+RR +A
Sbjct: 216 FMMIAHLAVHVGNEGK------WLEAP--QETVNKFKHIRDPNRRTYA 255


>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 552

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 134/222 (60%), Gaps = 12/222 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG   IPTPN+DALAY+GI+LNR+Y  P CTPSRAA +TG+YP   G+   V 
Sbjct: 36  GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVL 95

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YL 140
            G+  + +P+ EKLLP+YL+ELGY  H++GKWH+G       P  RGFD+HVG+W G + 
Sbjct: 96  YGMEPRGLPLEEKLLPEYLRELGYKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHH 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            ++ S  ET+   GLD RR  +  A  +  KY T    D++V  I +H+   PLFL + H
Sbjct: 156 MFDHSAVETE-TWGLDMRRGYD-VAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAH 213

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AAVH+        P   L  PD+        H+    RR FA
Sbjct: 214 AAVHSAN------PYDFLPAPDVTV--AGLEHVEPYPRRKFA 247


>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 570

 Score =  186 bits (473), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 100/223 (44%), Positives = 133/223 (59%), Gaps = 13/223 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FHG N I TPNIDALAYNGI+LN HY    CTPSRA+ +TGKYP   G+   V 
Sbjct: 58  GWNDVSFHGSNQIQTPNIDALAYNGIILNSHYVPALCTPSRASLMTGKYPTSLGMQHLVI 117

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E L+P+Y  + GY+TH +GKWH+G  K+E  P  RGFD+H G+WNG+  
Sbjct: 118 LSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQD 177

Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQIT 199
           Y D    +D   G D RRN E  Y+ Q    Y TD FT +++ +I +HN  + PLFL ++
Sbjct: 178 YYDHTTMSDSLKGYDMRRNFEVDYSYQ--GMYTTDVFTKEAIKIIDNHNSQKGPLFLYLS 235

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H A H+G       P    Q P  E+       I++P R+++A
Sbjct: 236 HLAPHSGN------PDNPFQAP--EDEISKHECINDPGRKIYA 270


>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 657

 Score =  186 bits (471), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 105/227 (46%), Positives = 131/227 (57%), Gaps = 39/227 (17%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDVGFHG N IPTPNIDALAY GI+LNRHYT P CTPSRAA +TG+ P   G+     
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTGRNPISVGMQHYVI 101

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G      + +K++P+Y +E GY THL+GKWH+G   ++  P  RGFD+H  Y
Sbjct: 102 DSDEPWGLG------LDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRGFDSHTNY 155

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             GY    D  H  + AV  DA           +  Y TD FTD +  +I  HN S PLF
Sbjct: 156 -TGY----DMRH--NLAVDYDA-----------NGTYATDHFTDAASRIIDKHNPSEPLF 197

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + H A HTG   +       LQ P  EE  + F HIS+ +RR++A
Sbjct: 198 LMVNHLAPHTGNDNDP------LQAP--EERIKKFEHISDENRRIYA 236


>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
          Length = 557

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 127/221 (57%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FHG   IPTPN+DALA++G++L  +Y  P CTPSR+A +TGK+P   G+   V 
Sbjct: 37  GWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDVL 96

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G ++  +P++E  LP+YLK+LGY  H++GKWH+G  K    P  RGFD+H GYW G+  
Sbjct: 97  YGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQD 156

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D       A G D RRN          KY T   TD++  VI  H+ S PLFL + H 
Sbjct: 157 YYDHTAVEWNAWGYDMRRN-HSVDWSAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHL 215

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+        P   LQ P  EE    F+ I N  RR +A
Sbjct: 216 AVHSAN------PYSPLQAP--EETVEMFSSIENLQRRRYA 248


>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
 gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
          Length = 675

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 106/232 (45%), Positives = 136/232 (58%), Gaps = 29/232 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDVGFHG N IPTPNIDALAY+GI+LNRHYT P CTPSRA+ +TGK P   G+     
Sbjct: 43  GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G      + +K++P+Y KE GY THL+GKWH+G + ++  P  RGFD HVGY
Sbjct: 103 VSDEPWGLG------LDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156

Query: 136 WNGYLTYNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
              Y+ Y D  +   F+      G D R N+       +  Y TD FT  +  +I+ H+ 
Sbjct: 157 LGPYVDYWD--YTLKFSPPKSFQGYDMRNNLN-VDYDSNGTYATDHFTKAASSIIERHDT 213

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             PLFL + H A H   A N   P   LQ P  EE+ R F +IS+  RR++A
Sbjct: 214 KDPLFLVVNHLAPH---AANDDDP---LQAP--EEDIRKFDYISDERRRIYA 257


>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
          Length = 581

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/221 (45%), Positives = 139/221 (62%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+    +
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P+  K+LP++L+ LGY+T LIGKWH+G    +  P +RGFD  +G++N Y++
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +      G D  R  +  A  M+ +Y TD FT +++++I++H  +RPL+LQ++H 
Sbjct: 162 YYDYSYSNQNMSGYDMHRG-DDPAYGMNREYATDMFTREAINIIENHELNRPLYLQLSHL 220

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             L+ P    NDR   HI  P+RR +A
Sbjct: 221 AVHAP-----------LEQPMNVYNDREPIHIREPNRRKYA 250


>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
 gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
          Length = 607

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 106/232 (45%), Positives = 136/232 (58%), Gaps = 29/232 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDVGFHG N IPTPNIDALAY+GI+LNRHYT P CTPSRA+ +TGK P   G+     
Sbjct: 43  GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G      + +K++P+Y KE GY THL+GKWH+G + ++  P  RGFD HVGY
Sbjct: 103 VSDEPWGLG------LDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156

Query: 136 WNGYLTYNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
              Y+ Y D  +   F+      G D R N+       +  Y TD FT  +  +I+ H+ 
Sbjct: 157 LGPYVDYWD--YTLKFSPPKSFQGYDMRNNLN-VDYDSNGTYATDHFTKAASSIIERHDT 213

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             PLFL + H A H   A N   P   LQ P  EE+ R F +IS+  RR++A
Sbjct: 214 KDPLFLVVNHLAPH---AANDDDP---LQAP--EEDIRKFDYISDERRRIYA 257


>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
 gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
          Length = 585

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 140/232 (60%), Gaps = 28/232 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG------ 76
           GWNDV FHG   IPTPNIDALAY+GI+LNRHY  P CTPSRA+ +TGK+P   G      
Sbjct: 42  GWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVI 101

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
            +D P G G      + +KL+P+Y +E GY T L+GKWH+G  ++   P  RGFD+H GY
Sbjct: 102 EVDEPWGLG------LDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGY 155

Query: 136 WNGYLTYND-SIHETDFAV-GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
              Y+ Y D S+  ++ +  GLD RRN++  Y+ +    Y TD FT ++V +I  HN + 
Sbjct: 156 IGPYIDYWDHSLQMSNTSTRGLDMRRNLQVDYSAR--GTYATDLFTREAVRLIHDHNQTS 213

Query: 192 -RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             PLFL +TH A HTG   +       +Q P  EE+   F+ I +P RR+ A
Sbjct: 214 ANPLFLVVTHLAPHTGNEDDP------MQAP--EEDVELFSFIKDPKRRVLA 257


>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 559

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 95/221 (42%), Positives = 133/221 (60%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GW+DV FHG + IPTPNID LAY+G++L+ +Y  P CTP+RAA +TG++P   G+ D  +
Sbjct: 39  GWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAALMTGRHPIHLGLQDGVI 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+ E ++PQYLK LGY TH++GKWH+G    +  P  RGFD H GY+NG   
Sbjct: 99  VASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTPLYRGFDTHFGYYNGEEG 158

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D   E    +GLD R N E +      +Y T+ FT  +  +I +HN ++PLFL + H 
Sbjct: 159 YYDHTAEEPKYIGLDFRNNTELFK-SAYGEYSTELFTSYAEKIIHNHNKNKPLFLYLAHQ 217

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+   GN+  P   L+ P   +    F +I +  RR FA
Sbjct: 218 AVHS---GNSYSP---LEAP--YKYTSRFPYIQDERRRTFA 250


>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
 gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
          Length = 634

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 103/231 (44%), Positives = 136/231 (58%), Gaps = 27/231 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDVGFHG N I TP+IDALAY+G++LNRHY+ P CTPSRAA +TG++P   G+     
Sbjct: 45  GWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMTGRHPINVGMQHYVI 104

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G      + ++++PQY +  GY TH+IGKWH+G   E  +P NRGFD H+GY
Sbjct: 105 DSDEPWGLG------LDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNRGFDTHIGY 158

Query: 136 WNGYLTYNDSIHETDFAV--GLDARRN-MERYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
              Y+ Y   + + +     G D R+N    YA   +  Y TD+FT  +  +I  H  S 
Sbjct: 159 LGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYA--ANGTYATDYFTSAARDIIAQHGKSG 216

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +P+ L + H A H   AGN   P   LQ P  E  DR FA+I N DRR +A
Sbjct: 217 QPMLLVMNHLAPH---AGNDDDP---LQAP-QETIDR-FAYIGNRDRRTYA 259


>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
          Length = 633

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/222 (42%), Positives = 130/222 (58%), Gaps = 8/222 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GW+DV FH    IPTPNIDA+  + ++LN HY   +CTPSR A LTGKYP + G+ +  +
Sbjct: 68  GWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTGKYPIKIGLQEYVI 127

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  +A+ +  +LLPQYL++LGY+THL+GKWH+G   E+ LP NRGFD+  G++NG  T
Sbjct: 128 QPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRGFDSFYGFYNGAGT 187

Query: 142 -YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            YN S  + D  +G D   N E   P    KY TD  T +  H+I+S +  +P+FL I+H
Sbjct: 188 YYNHSASDADGRIGYDWHLNKES-DPDAHGKYATDIITQRVKHLIQSRDPEKPMFLMISH 246

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H G   +      L +V      D   AHI    R  +A
Sbjct: 247 MAPHGGDNEDE-----LFEVDRQWIEDPEIAHIMVESRTKYA 283


>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 594

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/221 (43%), Positives = 128/221 (57%), Gaps = 12/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV F G   IPTPN+DALA  G++L  HY  P CTPSRAA LTG YP   G+    +
Sbjct: 94  GWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPIHSGMQHYVI 153

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +     +P+  KLLPQ+LK+LGY THLIGKWH+G  K+E LP  RGFD+H+GY+NGY+ 
Sbjct: 154 RSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSHLGYYNGYID 213

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    H       LD  ++     P  S +Y T  FTD++  +I+ H+   PLFL   H 
Sbjct: 214 YFTHNHTYKRDSALDFFKDE---VPYHSEEYATRLFTDRAEEIIRDHDVDNPLFLYFAHL 270

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH  T  +        Q P  +E    F+++ + +R  FA
Sbjct: 271 AVHRATDRDP------FQAP--QETIDKFSYVGDRNRTTFA 303


>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
 gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
          Length = 478

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 106/223 (47%), Positives = 130/223 (58%), Gaps = 33/223 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG  DIPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL ELGY++H+ GKWH+G  K +  P  RGF +H   W     
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD R   E  A  +   Y TD  TD SV VI SHN ++ PLFL + H
Sbjct: 149 ------------GLDMRNGTE-VAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAH 195

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
           AA H+        P   L VPD   ND    AHI +  RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMAHIPHYKRRKFA 229


>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
 gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
          Length = 485

 Score =  177 bits (448), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 134/228 (58%), Gaps = 16/228 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGWNDV FHG + IPTPN+D+LAY+G++L  +Y  P CTP+R+A +TG++P   G+   V
Sbjct: 2   QGWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGV 61

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            +G     +P+ E +LPQYLK LGY+TH++GKWH+G +  E  P  RGFD++ GY  G  
Sbjct: 62  ISGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKD 121

Query: 141 TYNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
            Y D   +   +       GLD R   E    + +  Y T+ F  ++  +I SH+ S+PL
Sbjct: 122 NYYDHTDDESNSPEELGYKGLDLRNGTEPVWTE-NGTYSTELFATEAERIITSHDTSKPL 180

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           FL + H AVH+G   N       LQ P  ++    F HI +P RR FA
Sbjct: 181 FLYLPHQAVHSGNPDNP------LQAP--QKYIDKFPHIQHPGRRTFA 220


>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
 gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
          Length = 542

 Score =  176 bits (447), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 105/223 (47%), Positives = 132/223 (59%), Gaps = 33/223 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG  +IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL ELGY++H+ GKWH+G  K +  P  RGF +H   W     
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN  + A  +   Y TD  TDQSV VI +HN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAH 195

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
           AA H+        P   L VPD   ND    +HI N  RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 229


>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
          Length = 561

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 98/232 (42%), Positives = 132/232 (56%), Gaps = 23/232 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG   + TPN+DALAY+G++L  +Y  P CTPSRAA +TG++P   G+   V 
Sbjct: 37  GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNV- 95

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + E + PQYLK++GY TH++GKWH+G   E+  P  RGFD+H GY+ G
Sbjct: 96  --IYSAEPYGLGLNEIIFPQYLKQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMG 153

Query: 139 YLTYNDSI----HETDF--AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
              Y   I    HE  F  + GLD RRN E        +Y T+ FT ++ ++I SHN S 
Sbjct: 154 AEDYWVHIAGNAHEVSFNASWGLDFRRNGEVVKTAF-GQYSTELFTTEAENIIASHNQSE 212

Query: 193 PLFLQITHAAVHTGT--AGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ +   AVH+     G+A+L           +    F HI N  RR FA
Sbjct: 213 PLFMYVAQQAVHSANPYTGDAELEAPF-------KYYEKFPHIKNEKRRKFA 257


>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 502

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 97/225 (43%), Positives = 134/225 (59%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV +HG   I TPNIDALA+NGI LNR+YT P CTPSR+AFLTG YP   G+   V 
Sbjct: 37  GWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCYPMNTGMQHSVI 96

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+  KLLPQ+L + GY + ++GKWH+G  KEE  P  RGF +HVG W G+  
Sbjct: 97  LTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQSHVGSWEGFSD 156

Query: 142 YNDSIHETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           Y   I +  +      G D RR+M++ + +   +Y T   T++++ +IK H + +PLFL 
Sbjct: 157 YYSHIMDFSWQTWSISGHDFRRDMQK-SKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLY 215

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           I H AVH+   GN   P   L+ P    +   +  I +P R L+A
Sbjct: 216 IAHLAVHS---GNQPEP---LKAPTKYTD--PYMDIGHPSRTLYA 252


>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
 gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
          Length = 544

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 104/223 (46%), Positives = 132/223 (59%), Gaps = 33/223 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG  +IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL ELGY++H+ GKWH+G  K +  P +RGF +H   W     
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFSSH---W----- 148

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN  + A  +   Y TD  TD SV VI SHN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAH 195

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEEND-RTFAHISNPDRRLFA 242
           AA H+        P   L VPD   ND    +HI +  RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVLKMSHIPHYKRRKFA 229


>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 526

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 131/227 (57%), Gaps = 19/227 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GW+DV FHG + IPTPNID LAY+G++L+ +Y  P CTP+R A LTG+YP   G+     
Sbjct: 38  GWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTGRYPMHLGLQHFVI 97

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             + PVG      +P+ E  LP YLK+LGYSTH++GKWH+G   +E  P  RGFD+H GY
Sbjct: 98  TPNEPVG------LPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRGFDSHYGY 151

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           + G+  Y       +   G D R NM+        +Y  + FT Q+  +I  H+H +PLF
Sbjct: 152 FLGHQDYYTHNALWNNQWGFDLRHNMDLQRSTF-GEYGPELFTTQAEKLIYDHDHKKPLF 210

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L   H AVH G +G    P G L     +   R F HI++  RR++A
Sbjct: 211 LYFAHQAVHYGNSG----PNGTLLEAPYKYTSR-FPHIADHQRRIYA 252


>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
 gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
          Length = 633

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 103/223 (46%), Positives = 131/223 (58%), Gaps = 33/223 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG  +IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL ELGY++H+ GKWH+G  K +  P  RGF +H   W     
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN  + A  +   Y TD  T+ SV VI +HN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAH 195

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
           AA H+        P   L VPD   ND    +HI N  RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 229


>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
          Length = 512

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 126/206 (61%), Gaps = 15/206 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV +HG   I TPNIDALA+NGI L R+Y  P CTPSRAA LTG+YP   G+  + +
Sbjct: 1   GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+++ LLPQ+L +LGY TH +GKWHIG  K+E  P  RGF+ HVG+W  Y+ 
Sbjct: 61  YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120

Query: 142 YNDSIHETDF-----AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y    HE  +     + GLD RRN+   A   + +Y+T   T +++ VI++H   +PLFL
Sbjct: 121 YYK--HEKAYLGPTRSPGLDMRRNL-FLARNDTGRYVTQLLTKEALEVIENHPVDKPLFL 177

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD 222
            + H A H+        P   LQVPD
Sbjct: 178 YLAHLAPHSAG------PQDPLQVPD 197


>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
 gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 93/221 (42%), Positives = 128/221 (57%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV FHG   IPTPNID LA  G++LN +Y  P CTP+R+A +TGKYP   G+  + +
Sbjct: 35  GWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSPICTPTRSAIMTGKYPIHTGMQHSVI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     + + E L+PQYLK LGY+TH +GKWH+G  K E  P  RGFD++ GYW G   
Sbjct: 95  LAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYEYTPIQRGFDSYFGYWCGKGD 154

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D  +   +  GLD   + E+        Y +D F +++V+VI +HN S PLFL +   
Sbjct: 155 YWDHSNNEKYGWGLDL-HDSEQDVWTEWGHYSSDLFAEKAVNVISTHNASVPLFLYLPFQ 213

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+     A     L   PD+ +    F +I +  RR+FA
Sbjct: 214 AVHS-----ANFIQPLQAPPDLIDK---FKNIKDERRRIFA 246


>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 513

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 129/227 (56%), Gaps = 15/227 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW+D+G HG + IPTPNID LA  G+VL+ +YT P CTPSRA+ +TGKYP R G+   V 
Sbjct: 40  GWDDIGLHGSSQIPTPNIDKLAEEGVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVI 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
            A     +P   K++PQYL +  Y  H++GKWH+G ++ E LP  RGF +H GY  G   
Sbjct: 100 SAATPFGLPSNFKIMPQYLHDKNYDCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSD 159

Query: 139 -YLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            Y  Y  +DS        GLD   N E  A + + KY TD +T +S  +++ HN SRPLF
Sbjct: 160 HYSHYGADDSDVPGSLFYGLDLWHN-EVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLF 218

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + + AVH G       P   LQ P     DR  + I N  RR +A
Sbjct: 219 LYLAYQAVHAGN------PDQALQAP-QSIVDRFSSSIRNDRRRRYA 258


>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 554

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/221 (41%), Positives = 126/221 (57%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV FHG + IPTPN+D LA +G++LN +Y  P CTPSRAA +TG YP R G+   P+
Sbjct: 47  GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P   ++LPQYLKE GY THL+GKWH+G  KE L P  RGFD+  GY+ G   
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y       +   GLD   N +    ++ + Y T  FT ++ ++I++   S+PL L ITH 
Sbjct: 167 YFAHTISYENHTGLDFWLNKKPVWSEIGT-YSTSVFTKRAQYIIENRTKSKPLLLVITHQ 225

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A H        L    LQ P  +EN   F +I   +R ++A
Sbjct: 226 ATHCA------LERERLQAP--QENIDKFPYIGEKNRTIYA 258


>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
          Length = 500

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 133/227 (58%), Gaps = 19/227 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+D+  HG  +IPTPNID LA +GI+LN +Y  P CTPSRAA +TG++P   G+   V 
Sbjct: 36  GWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMTGRHPVHLGLQHDV- 94

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + E LLPQYLK LGYSTH++GKWH+G   +E  P  RGFD+H+GY+ G
Sbjct: 95  --IVWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKRGFDSHLGYYTG 152

Query: 139 YLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y D      + D+ +     R ++R A     +Y T+ FT ++  VI  H+ S+PLF
Sbjct: 153 CEDYYDHTWGFTKQDWGLDFWHDREVDRSA---FGQYSTEVFTSEAERVIAEHDVSKPLF 209

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +   AVH+G  GN       LQ P   +  + F  I + +RR+FA
Sbjct: 210 LYLAQQAVHSGNPGNKV----RLQAP--WKYVKNFMGIKSEERRVFA 250


>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
           purpuratus]
          Length = 514

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 126/227 (55%), Gaps = 21/227 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GW+DV  HG + IPTPNID LA +G+ L  +Y  P CTPSR+A +TG++P   G+     
Sbjct: 22  GWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGRHPIHTGLQFGVI 81

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             + P G G      + EK + QYLK LGYSTH +GKWH+G   +E  P  RGFD+  G+
Sbjct: 82  SPEAPYGLG------LEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGFDSFFGF 135

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           +NG   Y           G D  +N + Y P    +Y TD F  ++  +IK+HN S+PLF
Sbjct: 136 YNGRGDYYTHEEVQSEVSGYDLHKNGKVYRPAF-GQYSTDIFNQEAEQIIKAHNASQPLF 194

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + H AVH G       P  LLQ PD  +  + F HI    RR++A
Sbjct: 195 LYLAHQAVHAGV-----YPDRLLQAPD--KYYQRFPHIETEGRRMYA 234


>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
 gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
          Length = 545

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/222 (44%), Positives = 129/222 (58%), Gaps = 28/222 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG   IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL +LGY++H+ GKWH+G  K E  P  RGF +H   W     
Sbjct: 97  YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSHKNLW----- 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN    A  +  +Y TD  T  SV VI +H+ ++ PLFL + H
Sbjct: 152 ------------GLDM-RNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAH 198

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AA H+G       P   L VPD  ++     HI +  RR +A
Sbjct: 199 AAGHSGN------PYNPLPVPD--DDVMKLDHILHYKRRRYA 232


>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
 gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
          Length = 545

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/222 (44%), Positives = 129/222 (58%), Gaps = 28/222 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGFHG   IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+ EK+LPQYL +LGY++H+ GKWH+G  K E  P  RGF +H   W     
Sbjct: 97  YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSHKNLW----- 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN    A  +  +Y TD  T  SV VI +H+ ++ PLFL + H
Sbjct: 152 ------------GLDM-RNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAH 198

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AA H+G       P   L VPD  ++     HI +  RR +A
Sbjct: 199 AAGHSGN------PYNPLPVPD--DDVMKLDHILHYKRRRYA 232


>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 374

 Score =  170 bits (431), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 134/227 (59%), Gaps = 15/227 (6%)

Query: 20  LPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID- 78
           + QGW+DV  HG + I TPNID LA  G+ L  +Y  P CTP+R+A +TGK+P   G+  
Sbjct: 40  IEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQH 99

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +GA     + + EK + Q+LK LGYSTH +GKWH+G   E+ +P  RGFD+  GY+NG
Sbjct: 100 DTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNG 159

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y T+ D+  E  F  G D  +N E + P    +Y T+ FT+++  +IK+HN S+PLF
Sbjct: 160 RGDYYTHEDT--EGGFG-GYDLHKNGEVHWPDF-GQYSTEIFTEEAQQIIKTHNASQPLF 215

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + H AVH G  G       L++ P   +  + F +I    RR+FA
Sbjct: 216 LYLAHQAVHAGVYGKD-----LVEAP--HKYYQMFPNIKTEGRRMFA 255


>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
 gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
          Length = 556

 Score =  169 bits (428), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 89/190 (46%), Positives = 117/190 (61%), Gaps = 20/190 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGFHG   IPTPNIDALAY+G++LNR+Y  P CTPSR++ +TGKY    G+   V 
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGKYAIHTGMQHTVL 96

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G   + +P+ EKLLPQYL +LGY++H+ GKWH+G  K    P  RGF++H G+W     
Sbjct: 97  YGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGFNSHHGFW----- 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD  RN  R A ++  +Y TD  T  S+ VI +H  S+ PLFL + H
Sbjct: 152 ------------GLDM-RNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFLYVAH 198

Query: 201 AAVHTGTAGN 210
           AA H+G   N
Sbjct: 199 AAAHSGNPYN 208


>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 311

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 88/221 (39%), Positives = 132/221 (59%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D   HG + IPTPN+DA+A +GI+LN+HY  P CTPSRAA +TG+YPF  G+  + +
Sbjct: 1   GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                 A+P+   L+P+Y + LGY TH++GKWH+G    + +P  RGFD  +G++N  L 
Sbjct: 61  KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +     +   G D R   + Y  +   +Y T ++T+++V +I+ HN S P+FL ++H 
Sbjct: 121 YYNQNFTGNNHTGHDFRCGDQNYWAE-EKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQ 179

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A H  + G       LLQVP      R F++I   +R LFA
Sbjct: 180 APHV-SGGRP-----LLQVP--THGVRNFSYIGENNRTLFA 212


>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
 gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
          Length = 270

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 92/224 (41%), Positives = 128/224 (57%), Gaps = 14/224 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV FHG + IPTP ID LA  G++LN +Y  P CTP+RA+ +TGK+P   G+     
Sbjct: 14  GWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKHPMNLGMLIHTH 73

Query: 83  AGV----AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           A V       +P+ E   PQY+K LGY TH IGKWH+G  ++E  P  RGFD+  G+WNG
Sbjct: 74  ATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWNG 133

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
              Y D   + D   G D R N E+     S  Y T+ F +++  +I  HN ++PL+L +
Sbjct: 134 KEDYWDHSSQED-VWGTDLRDN-EKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYL 191

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
               VH   + N   P   LQ P  +   + F+HIS+P RR++A
Sbjct: 192 AQQGVH---SANGNEP---LQAP--KRLIKKFSHISSPKRRIYA 227


>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
 gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
          Length = 542

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/222 (42%), Positives = 126/222 (56%), Gaps = 28/222 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGF G   IPTPNIDALAY+G++LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 43  GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 102

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+  K+LPQYL ELGY++H+ GKWH+G  K    P  RGF +H GYW     
Sbjct: 103 YAAEPRGLPLDLKILPQYLNELGYTSHIAGKWHLGHWKRVYTPLYRGFSSHHGYW----- 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD R   E  A  +  +Y TD  T  S+ VI +H  ++ PLFL + H
Sbjct: 158 ------------GLDMRNGTE-IAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFLYVAH 204

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AAVH+G   N         +P  ++  R    I +  RR +A
Sbjct: 205 AAVHSGNPYNP--------LPASDDAVRRLDKIQHYKRRKYA 238


>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 557

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 91/224 (40%), Positives = 130/224 (58%), Gaps = 15/224 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GWND  FHG  +IPTPN+DALA +G++L  HY  P CTP+RAA LTG YPF  G+    +
Sbjct: 91  GWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTGLYPFHTGMQNFVI 150

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    +P+  K+LP YL E  Y +HL+GKWH+G +    LP  R F+ HVGY+NG++ 
Sbjct: 151 RTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARHFNTHVGYYNGFID 210

Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y    H +   D  +GLD   N E    +    Y T  FT ++V++I++H  + PLF+ +
Sbjct: 211 YFTHEHISPGNDSLIGLDWHINEEN---ENEEGYATHLFTKRAVNLIENHKSTEPLFILL 267

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +H A H G   +        Q P   E+   FAHI + +R+++A
Sbjct: 268 SHLAPHAGCKRDP------FQAP--RESIEKFAHIKDQNRKVYA 303


>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
 gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
          Length = 531

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 94/222 (42%), Positives = 127/222 (57%), Gaps = 28/222 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGF G   IPTPNIDALAY+G++LNR+Y  P CTPSR+A +TGKYP   G+  T +
Sbjct: 32  GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 91

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+  K+LPQYL +LGY++H+ GKWH+G  +    P  RGF +H GYW     
Sbjct: 92  YAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFSSHHGYW----- 146

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD R   E  A  +  +Y TD  T  S++VI  HN ++ PLFL + H
Sbjct: 147 ------------GLDMRNGTE-VAYDLHGQYSTDVITQHSLNVISKHNATKGPLFLYVAH 193

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AAVH+G   N         +P  ++  R    I +  RR +A
Sbjct: 194 AAVHSGNPYNP--------LPVKDDAVRRLDTIQHYKRRKYA 227


>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
 gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
          Length = 541

 Score =  166 bits (420), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 125/222 (56%), Gaps = 31/222 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+NDVGFHG   IPTPNIDALAY+GI+LNR+Y  P CTPSR+A +TGKYP   G+   V 
Sbjct: 38  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVL 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + + + EK+LPQYL +LGY++H+ GKWH+G  K +  P  RGF +H   W     
Sbjct: 98  YAAEPRGLSLKEKILPQYLNDLGYTSHIAGKWHLGHWKLKYTPLFRGFSSH---W----- 149

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQITH 200
                       GLD R   E  A  +  +Y TD  TD +V VI +HN  S PLFL + H
Sbjct: 150 ------------GLDMRNGTE-VAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAH 196

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AA H+        P   L VPD E       HI +  RR FA
Sbjct: 197 AACHSSN------PYNPLPVPDNEV--MKLGHIPHYKRRKFA 230


>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
          Length = 500

 Score =  166 bits (420), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 90/220 (40%), Positives = 123/220 (55%), Gaps = 9/220 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW D   HG   IPTPN+DALA  G++LN +Y  P C+PSRAA +TG YP   GI  P+ 
Sbjct: 37  GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRAALMTGLYPAHNGIRMPLM 96

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +P+  K+LP++LK+LGY TH++GKWH+G +     P  RGFD   G++NG + Y
Sbjct: 97  GAQVAGLPLQFKILPEHLKDLGYETHMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDY 156

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
              I E +  +GLD   N  R  P     Y T  F D + ++I + N S+PLFL + H A
Sbjct: 157 YHGIMEQEGHIGLDF-WNGTRALPLEERIYATTRFQDHANYIIANRNASKPLFLYLAHQA 215

Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           VH+     A  P   LQ P   EN + F  + +  R+  A
Sbjct: 216 VHS-----AYEPE-FLQAPG--ENTKKFPFLGDASRKSLA 247


>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
 gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
          Length = 619

 Score =  166 bits (420), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 130/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G++D+   G  +  TPNIDALAY+G +L+R Y    CTPSR A L+GKYP   G     +
Sbjct: 44  GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHYVI 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G      + +   L+P+  +E GYST+L+GKWH+G ++ E  P +RGFDNH GYW  Y+ 
Sbjct: 104 GNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFDNHYGYWGAYID 163

Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y     +    +++VG D RRNM+         Y+TD  T+++  +I+      +PLFL 
Sbjct: 164 YYQRRSQMPLGNYSVGYDFRRNMQVECTDRGV-YVTDLLTNEAERIIREQPAKEQPLFLI 222

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT   GN   P   LQ P  EE  R F HI +P+RRL+A
Sbjct: 223 LSHLAPHT---GNTNKP---LQAP--EEELRKFTHIKDPNRRLYA 259


>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 476

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 132/230 (57%), Gaps = 27/230 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDV FHG + IPTP+IDALA  G++L  +Y  P CTP+R+A +TGK+P   G+     
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G        E ++PQYL+ LGY TH++GKWH+G  KE L P +RGF+++ GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153

Query: 136 WNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           + G   Y T+  + H      G D   N   Y P +  +Y T+ +T+++  +I++HN   
Sbjct: 154 YGGMQDYFTHESTEHT---LTGFDFHVNGSIYKP-VFGQYSTEIYTEKTQEIIRNHNPQE 209

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PL++ + H AVH+      +     LQ P   +    F +I+N +RR FA
Sbjct: 210 PLYIYLAHQAVHSANYNGQR-----LQAP--YKYYERFPNITNENRRKFA 252


>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
 gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
          Length = 498

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 131/227 (57%), Gaps = 23/227 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
           GW+D+ FHG   IPTPN+DALA +G++LN +Y  P  TPSRA+F+TGKYP   G+     
Sbjct: 12  GWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKYPIHMGVQHDTL 71

Query: 79  ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
               P G      VP+TEK LP++L+E+GY TH +GKW +G   +E  P  RGFD+  G+
Sbjct: 72  HNRQPFG------VPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFDSFFGF 125

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           W  +  Y + +   D   G+D RRN++  +   +  Y T+    ++  VI++H+  +PLF
Sbjct: 126 WTSHEDYYNHV-ANDGGYGIDLRRNLD-VSNDHTGVYGTELLAREADEVIENHSGDKPLF 183

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + H AVH    GN   P   LQ P    +   F +I++  RR FA
Sbjct: 184 LYLAHQAVHV---GNMDEP---LQAPKRHVD--KFKYITDERRRTFA 222


>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
          Length = 784

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 95/225 (42%), Positives = 134/225 (59%), Gaps = 18/225 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV FHG   IPTPNID+LA +G++LN +Y  P+   ++  F+TGKY    G    V 
Sbjct: 1   GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P TEK+LPQYLKE GY+ + +GKW +G  KEE+LP+ RGFD    ++ G LT
Sbjct: 61  HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFD----FFYGGLT 116

Query: 142 YNDSIHET----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            +   + T    D   GLD RRN E    + +  Y+T+ +T ++V++IK++N ++PLFL 
Sbjct: 117 SSGKDYYTHSAFDENYGLDLRRNNEVIHNE-TGNYITEVYTREAVNIIKNYNDNKPLFLY 175

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + H AVHTG A +       LQ P  E   +   HI N  R+LFA
Sbjct: 176 VAHQAVHTGNADDP------LQAP--ESYLKKLNHIKNIKRKLFA 212


>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
 gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
 gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
          Length = 564

 Score =  162 bits (411), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 130/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  KE GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNME    +    Y+TD  T ++  +IK H +  +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNME-LECRDRGVYVTDLLTAEAERLIKDHADKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253


>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
          Length = 274

 Score =  162 bits (411), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 130/221 (58%), Gaps = 10/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV FHG   IPTPN+DALA +GI+LN+HY    CTPSRAA +TG+YP   G+    +
Sbjct: 59  GWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRYPIYTGMQHFVI 118

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    +P+  +L+P++  +LGY TH++GKWH+G  K++ +P  RGFD+  G++N    
Sbjct: 119 QPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFDSFYGFYNADQD 178

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +         G D   N E      +++Y T  +T+++V +I+SHN S+PLFL +++ 
Sbjct: 179 YYNKTLTEGEHTGYDFWLN-EDIHIYPNNRYSTHHYTERAVSLIRSHNPSQPLFLYLSYX 237

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
               GT         LL+ P  EEN   F +I   +R ++A
Sbjct: 238 XXXVGTG------PSLLEAP--EENVNKFLYIPEKNRTIYA 270


>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 479

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 128/230 (55%), Gaps = 24/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDV FHG + IPTP+IDALA  G++L  +Y  P CTP+R+A +TGK+P   G+     
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G        E ++PQYL+ LGY TH++GKWH+G   +E  P  RGF++  GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFYSKEHTPIERGFESTFGY 153

Query: 136 WNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           + G   Y T+   +       G D   N   Y P +  +Y T+ +T+++  +I++HN   
Sbjct: 154 YLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYKP-VFGQYSTEIYTEKTQEIIRNHNPQE 212

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PL++ + H AVH+      +     LQ P   +    F +I+N +RR FA
Sbjct: 213 PLYIYLAHQAVHSANYNGQR-----LQAP--YKYYERFPNITNENRRKFA 255


>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
 gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
          Length = 565

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  K  GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNME       + Y+TD  T ++  +IK H +  +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMELECRDRGA-YVTDLLTAEAERLIKDHADKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + FA+I +P+RR +A
Sbjct: 217 LSHLAAHTANKDDP------LQAP--EEEIQKFAYIKDPNRRKYA 253


>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
 gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
          Length = 565

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  KE GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNME    +    Y+TD  T ++  +IK H    +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNME-LECRDRGVYVTDLLTAEAERLIKGHAGKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253


>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
 gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
          Length = 565

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  K  GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNME       + Y+TD  T ++  +IK H +  +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMELECRDRGA-YVTDLLTAEAERLIKDHADKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + FA+I +P+RR +A
Sbjct: 217 LSHLAAHTANKDDP------LQAP--EEEIQKFAYIKDPNRRKYA 253


>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
          Length = 542

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVGFHG   IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+   V 
Sbjct: 35  GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 93

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + EKLLP+YLK LGY +H++GKWH+G   +E  P  RGFD+H G++  
Sbjct: 94  --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 151

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y+T+   +   DF +     R+ + +       Y T  FT ++  ++  HN + P+F
Sbjct: 152 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 204

Query: 196 LQITHAAVHT 205
           L  +H AVHT
Sbjct: 205 LYFSHQAVHT 214


>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
          Length = 549

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVGFHG   IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+   V 
Sbjct: 12  GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 70

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + EKLLP+YLK LGY +H++GKWH+G   +E  P  RGFD+H G++  
Sbjct: 71  --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 128

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y+T+   +   DF +     R+ + +       Y T  FT ++  ++  HN + P+F
Sbjct: 129 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 181

Query: 196 LQITHAAVHT 205
           L  +H AVHT
Sbjct: 182 LYFSHQAVHT 191


>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
          Length = 354

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVGFHG   IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+   V 
Sbjct: 12  GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 70

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + EKLLP+YLK LGY +H++GKWH+G   +E  P  RGFD+H G++  
Sbjct: 71  --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 128

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y+T+   +   DF +     R+ + +       Y T  FT ++  ++  HN + P+F
Sbjct: 129 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 181

Query: 196 LQITHAAVHT 205
           L  +H AVHT
Sbjct: 182 LYFSHQAVHT 191


>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 502

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/223 (43%), Positives = 129/223 (57%), Gaps = 14/223 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DTP 80
           GWND+ FHG N+IPTPNID LA NG++L+ +Y LP CTPSR+A +TG+YP   G+  DT 
Sbjct: 31  GWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 90

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G      V + EK LPQYLK+ GY TH +GKWH+G   ++  P  RGFD++ G + G  
Sbjct: 91  FGPN-PYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTPTYRGFDSYYGSYLGKG 149

Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y N S  ET    GLD   N E         Y T+ +T +++  I +HN S PLFL + 
Sbjct: 150 DYWNHSNTET--YSGLDLHDN-ENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLYLA 206

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + AVH+     A      LQ P  +E    F++I +  RR +A
Sbjct: 207 YQAVHS-----ANTEEDPLQAP--QEWIDKFSYIKHEQRRKYA 242


>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
 gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
          Length = 554

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 130/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  KE GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNM+    +    Y+TD  T ++  +IK H +  +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMD-LECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253


>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 588

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 80/186 (43%), Positives = 112/186 (60%), Gaps = 5/186 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV ++G   I TPNIDALA+NGI L R+YT P CTPSRAA +TG+YP   G+    +
Sbjct: 76  GWNDVSYNGCPQIRTPNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVI 135

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+  KLLPQ+L +LGY + ++GKWH+G  K+E  P  RGF  H+G W G++ 
Sbjct: 136 LQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVD 195

Query: 142 YNDSIHETDFAV---GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y   I          GLD R+ +     +   +Y T+F T+ +  VI++H   +PLFL +
Sbjct: 196 YYSHIRFNKIGFSHSGLDFRQGLSE-GREFDGQYYTEFMTEAATRVIENHPLEKPLFLYL 254

Query: 199 THAAVH 204
            H A H
Sbjct: 255 AHLAPH 260


>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 593

 Score =  159 bits (403), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 97/260 (37%), Positives = 133/260 (51%), Gaps = 49/260 (18%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV FHG + IPTPN+D LA +G++LN +Y  P CTPSRAA +TG YP R G+   P+
Sbjct: 47  GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY------ 135
                  +P   ++LPQYLKE GY THL+GKWH+G  KE L P  RGFD+  GY      
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166

Query: 136 -------------WNGYLTYN------DSIH-----ETDFAV---------GLDARRNME 162
                        W  +LT        DS +     E+D+           GLD   N +
Sbjct: 167 YFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYENHTGLDFWLNKK 226

Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
               ++ + Y T  FT ++ ++I++   S+PL L ITH A H        L    LQ P 
Sbjct: 227 PVWSEIGT-YSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCA------LERERLQAP- 278

Query: 223 MEENDRTFAHISNPDRRLFA 242
            +EN   F +I   +R ++A
Sbjct: 279 -QENIDKFPYIGEKNRTIYA 297


>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
 gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
          Length = 528

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/222 (41%), Positives = 124/222 (55%), Gaps = 31/222 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVGF G   IPTPNIDALAY+G++LNR+Y  P CTPSR+A +T KYP   G+  T +
Sbjct: 32  GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTAKYPIHTGMQHTVL 91

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P+  K+LPQYL +LGY++H+ GKWH+G  K    P  RGF +H   W     
Sbjct: 92  YAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLYRGFSSH---W----- 143

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
                       GLD R   E  A  +  +Y TD  T  S++VI  HN ++ PLFL + H
Sbjct: 144 ------------GLDMRNGTE-LAYDLHGQYTTDVITQHSLNVIAKHNSTKGPLFLYVAH 190

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AAVH+G   N         +P  ++  R    I +  RR +A
Sbjct: 191 AAVHSGNPYNP--------LPAKDDIVRRLGTIQDYKRRKYA 224


>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
 gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
          Length = 554

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+ +   L+P+  KE GYST+L+GKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNM+    +    Y+TD  T ++  +IK H +  +PLFL 
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMD-LECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLM 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  + F++I + +RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDSNRRKYA 253


>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
 gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
          Length = 575

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 128/225 (56%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALA++G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 48  GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +      + +   L+P+  ++ GYST+LIGKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQ 197
           Y      +   ++++G D RRNME    +    Y+TD  T+++  VI+      +PLFL 
Sbjct: 168 YYQRRSKMPARNYSLGYDFRRNME-LECRDRGVYVTDLLTNEAERVIREREGQEQPLFLV 226

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  R FA+I +P+RR +A
Sbjct: 227 LSHLATHTANEDDP------LQAP--EEEIRKFAYIKDPNRRKYA 263


>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
 gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
          Length = 575

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 127/225 (56%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALA++G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 48  GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +      + +   L+P+  ++ GYST+LIGKWH+G ++ E  P  RGFD H GYW  Y+ 
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQ 197
           Y      +   ++++G D RRNME    +    Y+TD  T+++  VI+       PLFL 
Sbjct: 168 YYQRRSKMPARNYSLGYDFRRNME-LECRDRGVYVTDLLTNEAERVIREREGQEEPLFLV 226

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  R FA+I +P+RR +A
Sbjct: 227 LSHLATHTANEDDP------LQAP--EEEIRKFAYIKDPNRRKYA 263


>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
          Length = 531

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 91/221 (41%), Positives = 122/221 (55%), Gaps = 11/221 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
           GW D   HG   IPTPN+DALA  G++LN +Y  P C+PSR A +TG YP   GI  P V
Sbjct: 37  GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           GA VA  +P+  K+LP++LK+LGY TH++GKWH+G       P  RGFD   G+ NG + 
Sbjct: 97  GAQVA-GLPLQFKILPEHLKDLGYETHIVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPID 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   I E +  VGLD   N     P     Y T    + +  +I + N S+PLFL + H 
Sbjct: 156 YYRGIMEQEGHVGLDF-WNGTSALPLKERTYATARLQNHAKSIIANRNTSKPLFLYLAHQ 214

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+  +         LQ P   EN + F +I +  R++ A
Sbjct: 215 AVHSVYSPE------FLQAP--VENTKKFPYIRDSSRKILA 247


>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 517

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/222 (40%), Positives = 127/222 (57%), Gaps = 23/222 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV +HG + IPTPNIDALA NG+ LNR Y  P C+P+RA+ LTG + F +GI  P+ 
Sbjct: 30  GWGDVSYHGGH-IPTPNIDALAKNGVELNRFYASPVCSPTRASLLTGLHIFNHGIIRPLA 88

Query: 83  AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              A+   +PV  K++PQ+ KE GY T L GKWH+G + EE  P NRGFD   G+  G +
Sbjct: 89  NPTAEQYGLPVDLKIMPQFFKEAGYQTALSGKWHLGMHLEEYWPTNRGFDQSYGHMLGGI 148

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D +H +     LD  RN E   P     Y T+   +++V +I++ + +RPLFL +  
Sbjct: 149 GYFDHVHSSR----LDWHRNEE---PLFEDGYSTELIANEAVRIIETKDPNRPLFLYVAF 201

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A HT            +Q PD  +N   F++I +P  R +A
Sbjct: 202 NAPHTP-----------IQAPD--KNIELFSYIEDPLDRAYA 230


>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
 gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
          Length = 570

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 127/225 (56%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+R Y    CTPSR A L+G+YP   G    V 
Sbjct: 41  GFDDVSFRGGREFITPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 100

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     A+     L+P+  +  GYST+L+GKWH+G ++ E  P +RGFD H GYW  Y+ 
Sbjct: 101 SNEEPWALDSNATLMPEIFQRAGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYID 160

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNME       + Y+TD  T ++  +IK      +PLFL 
Sbjct: 161 YYQRRSKMPVANYSLGYDFRRNMELECRDRGT-YVTDLLTTEAERLIKEQAGKDKPLFLM 219

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  R F++I +P+RR +A
Sbjct: 220 LSHLATHTANEDDP------LQAP--EEEIRKFSYIKDPNRRKYA 256


>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
 gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
          Length = 560

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 87/225 (38%), Positives = 127/225 (56%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+   G  +  TPNIDALAY+G +L+R Y    CTPSR A L+GKYP   G    V 
Sbjct: 44  GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHSVI 103

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  + +   L+P+  ++ GYST+L+GKWH+G  + E  P  RGFD H GYW GY+ 
Sbjct: 104 LNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTYRGFDYHYGYWGGYID 163

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
           Y      +   ++++G D RRNM+         Y+TD  T+++  +I+      +PLFL 
Sbjct: 164 YYQRRSQMPSDNYSMGYDFRRNMQVECTD-RGVYVTDLLTNEAERIIREQPAKEQPLFLI 222

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HTG   +       LQ P  EE  + F HI++P+RRL+A
Sbjct: 223 LSHLAPHTGNEIDP------LQAP--EEELQKFVHINDPNRRLYA 259


>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 511

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 87/222 (39%), Positives = 125/222 (56%), Gaps = 10/222 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW+DV FHG   IPTPN+D LA +GI+LN +Y  P CTPSRAA +TG YP   G+   V
Sbjct: 1   QGWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGV 60

Query: 82  -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                   +P++  ++PQYLK LGY TH++GKW++G  K    P  RGFD+  GY++   
Sbjct: 61  LSPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVE 120

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y +     D   GLD   N +     +S  Y T  +T+++  +I++ + S+P FL + +
Sbjct: 121 DYYNHTVLWDNQTGLDFWLNTQPLR-NVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPY 179

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVH G   +       LQ P  +EN   F +I   +R +FA
Sbjct: 180 QAVHCGNFDDP------LQAP--QENIDKFPYIGEENRTIFA 213


>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
          Length = 545

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 14/227 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D+  HG   IPTPNIDALA +GI+LN +Y  P CTPSRAA LTGK+P   G+    +
Sbjct: 38  GWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTGKHPVHLGLQHNTI 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A  A  + + E++LP+YL  LGY +H++GKWH+G    +  P  RGF +H GY NG   
Sbjct: 98  PAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRGFKSHFGYLNGCED 157

Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y D     DF+       GLD   + + +      +Y T+ FT ++  +I+S N   PLF
Sbjct: 158 YLDHTLAYDFSTLGMDGWGLDFWNDTKIHRTSF-GQYSTEIFTTRAEELIRS-NTGEPLF 215

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L ++H AVH+G  G   L    LQ P    N   F +I + +RR  A
Sbjct: 216 LYMSHQAVHSGNPG---LNGSKLQAPWKYFN--KFNYIQSDERRRLA 257


>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 529

 Score =  156 bits (395), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 108/185 (58%), Gaps = 2/185 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DV F G+  IPTPN+D LA  GI+LN +Y  P C PSR A ++G YP   G+   V 
Sbjct: 31  GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAPSRGALMSGLYPIHTGLQHLVP 90

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G G    +P    ++P+YLK LGY+TH+IGKWH+G +KE   P  RGFD+  GY NG   
Sbjct: 91  GPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKESYTPTRRGFDSFYGYLNGGED 150

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y D       A GLD   N      +  + Y T+ FT ++  +IK H+ ++P+FL  +H 
Sbjct: 151 YYDHTILWSNASGLDFWENTTPVRNE-GNHYSTELFTKKAQSLIKHHDPAKPMFLYFSHQ 209

Query: 202 AVHTG 206
           AVH G
Sbjct: 210 AVHCG 214


>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
          Length = 1007

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/226 (38%), Positives = 127/226 (56%), Gaps = 18/226 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV FH    I TPNID LAY+G++LNRHY  P  T S+ A +TG +P   G  +   
Sbjct: 467 GWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQS--- 523

Query: 83  AGVAKAVPVT----EKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           A      P T     KL+P+Y ++ GY+THLIGKW +G ++++  P  RGFD+H G+   
Sbjct: 524 ASNEPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGP 583

Query: 139 YLTYND-SIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y+ Y D S+   + +  GLD RRN++     ++  Y TD F  ++V +I+ H+  +PL L
Sbjct: 584 YIDYWDHSLRLRNTSTRGLDMRRNLD-VDYSVNGSYATDLFNGEAVRLIREHDQKKPLLL 642

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +TH A HTG   +       +Q P   E    F +I +  RR+ A
Sbjct: 643 VLTHLAPHTGNEDDP------MQAP--AEEVEKFDYIRDEKRRVLA 680


>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
 gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
          Length = 380

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/219 (41%), Positives = 121/219 (55%), Gaps = 12/219 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV FHG   IPTPN+D LA  G++LN +Y  P CTP+RA+ +TGKYP   G+    +
Sbjct: 16  GWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKYPIHLGMQHFVI 75

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+ E  LPQYL+  GY T  IGKWH+G   +E  P  RGFD+  G W+    
Sbjct: 76  YAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFDSFYGMWSAKAD 135

Query: 142 Y-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y N +  E  F  G D R NME        KY T+ FT +++ VI++HN S PLFL I H
Sbjct: 136 YWNHTSFENGFW-GTDMRNNMEPVTTD-KDKYATEVFTREALKVIENHNKSEPLFLYIAH 193

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
            A H+        P   LQ P  E+  + F+ + +   R
Sbjct: 194 QAPHSAN------PHDPLQAP--EDKVKKFSGVIDKIER 224


>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
 gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
          Length = 548

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 16/227 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G N+  TPNIDALAY+G++LN  YT   CTPSRAA LTGKYP   G+   V 
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P  EK + +  +E GY T L+GKWH+G ++    P  RGFD+H+GY   Y+ 
Sbjct: 66  VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125

Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRPLF 195
           Y D  ++    ++A G D R N+     Q+   Y+TD  +D +V +I+ H   N S+PLF
Sbjct: 126 YYDQTYQQNGKNYARGHDFRLNLNVTHDQV-GHYVTDVLSDAAVELIEQHSGSNSSQPLF 184

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L ++H A H   A NA  P   +Q P   E    F +I N   R +A
Sbjct: 185 LLLSHLAPH---AANADDP---MQAP--AEELAKFEYIRNETHRYYA 223


>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
          Length = 502

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/226 (38%), Positives = 125/226 (55%), Gaps = 23/226 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV FHG   +PTPNIDALA +GI+L+ +Y    C+PSR A +TGK+P + G+   V 
Sbjct: 35  GWDDVSFHGSRQVPTPNIDALASDGIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGV- 93

Query: 83  AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  A P    + EKLLP+YL  LGY +H++GKWH+G   EE  P +RGF++H G++ G
Sbjct: 94  --IMPAQPSGLGLKEKLLPEYLNTLGYKSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQG 151

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFL 196
             +Y   +       GLD   N E   P  S+  +Y T  FT ++  ++  H+ + P+FL
Sbjct: 152 CESYTTHMCGNS---GLDFWLNEE---PDHSAGGQYSTSLFTAKAEQLLAEHDTASPMFL 205

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + H AVH G              PD   +  +F  IS+  RR  A
Sbjct: 206 YLAHQAVHVGNQDQK------FYAPDKYTDKLSF--ISDDRRRQMA 243


>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
 gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
          Length = 576

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 126/225 (56%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+   G  +  TPNIDALA++G +L+R Y    CTPSR A L+GKYP   G    V 
Sbjct: 46  GFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGALLSGKYPIHTGTQHFVI 105

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     ++ +   L+P+  +  GYST+L+GKWH+G  + E  P +RGFD H GYW  Y+ 
Sbjct: 106 SNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPTHRGFDYHYGYWGAYID 165

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
           Y      + +  + +G D RRNME         Y+TD  T+++  VI+ +    +PLFL 
Sbjct: 166 YYQRRSQMPDKTYIMGYDFRRNMEVECAD-RGVYMTDLLTNEAERVIQETAAKQQPLFLM 224

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           I H AVHTG   +       LQVP  EE  + F HI +P+ R +A
Sbjct: 225 INHLAVHTGNDNDP------LQVP--EEELQKFTHIKDPNHRKYA 261


>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 516

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 125/221 (56%), Gaps = 15/221 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G++D+G+H ++ I TPN+D LA  G+ L  +Y  P CTP+R+  ++G+Y    G+    +
Sbjct: 51  GFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGII 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E  +PQ LKE GY+TH++GKWH+G  K+E LP  RGFD   GY  G   
Sbjct: 110 WPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSED 169

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y       D   G+D R NM+   PQ + +Y T  F +++ ++IKSH+   PLFL +   
Sbjct: 170 YYTHNRSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQ 229

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH           G LQVPD  E  + +A+I+N  RR +A
Sbjct: 230 AVH-----------GPLQVPDQYE--KPYANITNKQRRTYA 257


>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 528

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 76/194 (39%), Positives = 112/194 (57%), Gaps = 6/194 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+D+  HG + IPTPNID LA  G++L  +YT   CTPSR A +TGKYP   G+   V 
Sbjct: 32  GWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALMTGKYPIHLGLQYDVI 91

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G     +P   K++PQYL    Y +H+IGKWH+G ++ ELLP  RGF +H G+  G+  
Sbjct: 92  QGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTRRGFHSHFGFRLGHSD 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSK-----YLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y +   E    V  +    ++ ++ ++  K     Y  D FT +++ ++++HN + PLFL
Sbjct: 152 YFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRAISILETHNKTTPLFL 211

Query: 197 QITHAAVHTGTAGN 210
            + H AVH G   N
Sbjct: 212 YLAHQAVHVGDGEN 225


>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
 gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
          Length = 548

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 16/227 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G N+  TPNIDALAY+G++LN  YT   CTPSRAA LTGKYP   G+   V 
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P  EK + +  +E GY T L+GKWH+G ++    P  RGFD+H+GY   Y+ 
Sbjct: 66  VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125

Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRPLF 195
           Y D  ++    ++A G D R N+     Q+   Y+TD  +D +V +I+ H   N S+PLF
Sbjct: 126 YYDQTYQQNGKNYARGHDFRLNLNVTHDQV-GHYVTDVLSDAAVELIEQHSGSNSSQPLF 184

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L ++H A H   A NA  P   +Q P   E    F +I N   R +A
Sbjct: 185 LLLSHLAPH---AANADDP---MQAP--AEELAKFEYIRNETHRHYA 223


>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 469

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 123/214 (57%), Gaps = 16/214 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TPN+D LA +G+ L R Y    CTP+RAAFLTG++PFRYG+ T V 
Sbjct: 46  GWNDVGFHG-SEIKTPNLDKLAASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI 104

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
               K  +P+ EK + + LKE GY T ++GKWH+G  +E  LP +RGFD H G++ G   
Sbjct: 105 KPWDKVGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGID 164

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH-SRPLFLQ 197
           Y T+ND     DF   LD  RN      +    Y TD    ++V +I +HN+  +PLFL 
Sbjct: 165 YFTHND-----DFLGALDWHRNRIHLKEE---GYATDLIGQEAVKLINNHNYEQQPLFLY 216

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFA 231
           I   A HT      +     L + D  E  R FA
Sbjct: 217 IAFNAPHTPLHAKTEDIEDYLTIDD--EKRRVFA 248


>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
 gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
          Length = 528

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 124/229 (54%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG+HG ++I TPN+D L+  G+ L  +Y  P CTPSR   +TG+Y  R G+    +
Sbjct: 40  GWNDVGYHG-SEIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGRYQIRTGMQHQII 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ ++E GY+TH++GKWH+G  +++ LP  RGFD++ GY  G   
Sbjct: 99  WPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGFDSYFGYLTGSED 158

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  +Y   ++ T  AV L   R  E  A   +  Y T  FTD+   +I   N  +P
Sbjct: 159 YFSHQRCSYVPPLNVTRCAVDL---REGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKKP 215

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   AVH             LQVP  E     ++ I +P+RRL+A
Sbjct: 216 LFLYVALQAVHAP-----------LQVP--ERYVAPYSFIKDPNRRLYA 251


>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
 gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
          Length = 567

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 122/225 (54%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DV F G  +  TPNIDALAY+G +L+  Y    CTPSR A L+G+YP   G    V 
Sbjct: 39  GFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGALLSGRYPAHTGTQHFVI 98

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +      + +   L+P+  KE GYST+LIGKWH+G    E  P  RGFD H GYW  Y+ 
Sbjct: 99  SNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTPTRRGFDYHYGYWGAYID 158

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQ 197
           Y      +   ++++G D RRNM+    Q    Y+TD  T ++ HVI+    +    FL 
Sbjct: 159 YYQRRSQMPVANYSMGYDFRRNMDLEC-QNRGVYITDLLTQEAEHVIREKAAANETFFLM 217

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A HT    +       LQ P  EE  R FA+I +P RR +A
Sbjct: 218 LSHLATHTANDNDP------LQAP--EEEIRKFAYIKDPRRRKYA 254


>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
          Length = 484

 Score =  153 bits (386), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 90/221 (40%), Positives = 128/221 (57%), Gaps = 18/221 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG++   +I TPN+D LA NG++LN  Y L TC+PSR A LTG+YPF+ G+    V
Sbjct: 39  GWNDVGWNNP-EIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRYPFKLGLQHGVV 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    +P+   LLPQ LK LGYSTH IGKWH+G  + E  P  RGFD+  G+++G   
Sbjct: 98  KKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFDSFYGFYSGSED 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y     +T    G D R N + + P++  KY T  +  ++V +I++H  + PLFL +   
Sbjct: 158 YYK--RKTAAIRGYDFRMNTKVFKPKI-KKYSTLDYGRRAVKIIQAHKRTEPLFLYMPFQ 214

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH+            LQVP   E    + +I + +RR+F+
Sbjct: 215 AVHSP-----------LQVPKSFEFK--YRNIVDRNRRIFS 242


>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
 gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
          Length = 525

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/231 (39%), Positives = 131/231 (56%), Gaps = 26/231 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G N+  TPNIDALAY+G++LN  YT   CTPSR+A LTGKYP   G+     
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGMQHYVI 65

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  ++ GY T L+GKWH+G ++    P  RGFD H+GY
Sbjct: 66  VNDQPWG------LPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGY 119

Query: 136 WNGYLTYNDSIH---ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HS 191
              Y+ Y D  +     +++ G D R N+E    ++   Y+TD  +D +V +I+ HN  +
Sbjct: 120 LGAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKV-GHYVTDILSDAAVELIEKHNVTA 178

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +PLFL ++H A H   A N   P   LQ P MEE  + F +I N   R +A
Sbjct: 179 KPLFLLLSHLAPH---AANDNDP---LQAP-MEELSQ-FEYIQNKSHRYYA 221


>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 124/221 (56%), Gaps = 15/221 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G++D+G+H ++ I TPN+D LA  G+ L  +Y  P CTP+R+  ++G+Y    G+    +
Sbjct: 25  GFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLMSGRYQIHTGLQHGII 83

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E  +PQ LKE GY+TH++GKWH+G  K+E LP  RGFD   GY  G   
Sbjct: 84  WPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSED 143

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y       D   G+D R NM+   PQ + +Y T  F +++ ++IKSH+   PLFL +   
Sbjct: 144 YYTHNRSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPLH 203

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH           G LQVPD  E  + + +I+N  RR +A
Sbjct: 204 AVH-----------GPLQVPDQYE--KPYTNITNKQRRTYA 231


>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
          Length = 431

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268


>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
          Length = 408

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPREKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
 gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
          Length = 584

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 93/232 (40%), Positives = 123/232 (53%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G N+  TPNIDALAY+G++LN  Y  P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  +E GY T L+GKWH+G ++    P  RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGY 159

Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y    +E       G D R N++    Q+   Y+TD  TD +V  I+ H   N 
Sbjct: 160 LGAYVDYYTQSYEQQSKGYNGHDFRDNLKSSHDQV-GHYITDVLTDAAVKEIEDHASKNS 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H     N       +Q P  EE  R F +I N   R +A
Sbjct: 219 SQPLFLLLNHLAPHAANDDNP------MQAP-AEEVSR-FEYIRNKTHRYYA 262


>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
          Length = 431

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268


>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268


>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 307

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 1   MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GYSTH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 60  AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R NM    P+ S  YL     D++ H++ +H    PLFL  T
Sbjct: 120 GDYDKMDGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGDRAEHIVNTHYPGTPLFLAFT 177

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                        +P   L++P  EE +  +A I +   R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEKYAEIEDDRTRQF 206


>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
          Length = 316

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT ++  +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
 gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
          Length = 413

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 56  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267


>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
          Length = 415

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
 gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
 gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
          Length = 534

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268


>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
 gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
 gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
 gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
          Length = 413

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 56  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267


>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 461

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 123/222 (55%), Gaps = 18/222 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H  +DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 1   MGWNDVHWH-NSDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GYSTH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 60  VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R NM    P+ S  YL     D++ H++ +H    PLFL  T
Sbjct: 120 SDYDKMNGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGDRAEHIVNTHYPGTPLFLAFT 177

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                        +P   L++P  EE +  +A I +   R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEKYAEIEDDRTRQF 206


>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268


>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
          Length = 415

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
          Length = 545

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 21/222 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG+H   DI TPN+D +A  G++LN  Y  P CTPSR +FLTG YPFR G+  T +
Sbjct: 39  GWNDVGWHNP-DIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAI 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A+ + +    LP+ LK+LGYSTH+IGKWH+G   E   P  RGFD+ +G++ G   
Sbjct: 98  TPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQD 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
           Y    H T  A G D R N   + P    +Y T  +  ++V +I  H   + PLFL +  
Sbjct: 158 YYK--HTT--AKGYDFRFNQTVFYPP-KKQYSTKTYAKRAVDIITEHKRKKNPLFLYLAF 212

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +VHT            LQVP  ++ ++ + +I N DRR+++
Sbjct: 213 QSVHTP-----------LQVP--KKYEKQYNNIKNKDRRVYS 241


>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
 gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
 gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
 gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
          Length = 533

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 56  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267


>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
 gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
 gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
 gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
 gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
          Length = 502

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 125/229 (54%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TP +D L+  G+ L  +YT P CTPSR+  L+G+Y    G+   + 
Sbjct: 24  GWNDVGFHG-SEILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 82

Query: 83  AGVAKAV-PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                   P+ +KLLP+ LKE GY TH++GKWH+G  K + LP  RGFD++ GYW G   
Sbjct: 83  WPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTRRGFDSYFGYWTGGED 142

Query: 142 YNDSIHETDFAV--------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y    HE  + +         LD  R+ E  A     KY T  FTD++V +I +HN  +P
Sbjct: 143 YYS--HERCYLITTLNITRCALDF-RDGEVPATDYQMKYSTHLFTDRAVDLITNHNPEKP 199

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL + + AVH+            LQVPD      T  H  N  RRL+A
Sbjct: 200 LFLYLAYQAVHSP-----------LQVPDQYIEPYTSIHDKN--RRLYA 235


>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
          Length = 429

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG + I TP +DALA  G++L+ +YT P CTPSR+  LTG+Y    G+   + 
Sbjct: 56  GWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 267


>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
          Length = 533

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 56  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267


>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
 gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
 gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
 gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
 gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
 gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
 gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 56  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267


>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
          Length = 535

 Score =  149 bits (376), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 58  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269


>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
 gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
          Length = 574

 Score =  149 bits (376), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 14/225 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+   G  +  TPNIDAL ++G +L+R Y    CTPSR A L+GKYP   G    V 
Sbjct: 46  GFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGALLSGKYPIHTGTQHFVI 105

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           +     ++ +   L+P+  +  GYST+L+GKWH+G  + E  P +RGFD H GYW  Y+ 
Sbjct: 106 SNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPTHRGFDYHYGYWGAYID 165

Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
           Y      + E  + VG D RRNME         Y+TD  T+++  +I+ +    +PLFL 
Sbjct: 166 YYQRRSQMPEKTYIVGYDFRRNMEVECTD-RGVYVTDLLTNEAERIIQETAAKQKPLFLM 224

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           I H A HT    +       LQ P  EE  + F HI +P+ R +A
Sbjct: 225 INHLATHTANDNDP------LQAP--EEEVQKFLHIKDPNHRKYA 261


>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
          Length = 534

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IHTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
          Length = 414

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT ++  +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
 gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
          Length = 591

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 92/232 (39%), Positives = 122/232 (52%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G N+  TPNIDALAY+G++LN  Y  P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  +E GY T L+GKWH+G ++    P  RGFD H GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHFGY 159

Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y    +E       G D R N++     +  +Y+TD  TD +V  I+ H   N 
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHV-GRYITDVLTDAAVKEIEDHGSKNS 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H     N       +Q P  EE  R F +I N   R +A
Sbjct: 219 SQPLFLLLNHLAPHAANDDNP------MQAP-AEEVSR-FEYIGNKTHRYYA 262


>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
 gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
          Length = 533

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG + I TP +DALA  G++L+ +YT P CTPSR+  LTG+Y    G+   + 
Sbjct: 56  GWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKP 231

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 267


>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
          Length = 534

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT ++  +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268


>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
 gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
 gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
 gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
 gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
 gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
          Length = 585

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G N+  TPNIDALAY+G++LN  Y  P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  +E GY T L+GKWH+G ++    P  RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGY 159

Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y    +E       G D R +++     +   Y+TD  TD +V  I+ H   N 
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHV-GHYVTDLLTDAAVKEIEDHGSKNS 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H   A N   P   +Q P  EE  R F +ISN   R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKTHRYYA 262


>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
          Length = 532

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 129/229 (56%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG + I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +
Sbjct: 55  GWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRMGLQHQII 113

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  ++++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +P
Sbjct: 174 YYSHERCTLINALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKP 230

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 231 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 266


>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
 gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
          Length = 535

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDV FHG N I TP++D LA  G++L+ +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 58  GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         DS++ T  A+     R+ E+ A    + Y T+ FT+++  +I SH   +P
Sbjct: 177 YYSHERCALIDSLNVTRCALDF---RDGEQVATGYKNMYSTNIFTERATALITSHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHYYA 269


>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
          Length = 491

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 87/225 (38%), Positives = 121/225 (53%), Gaps = 25/225 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ND+G+HG ++I TPN+D LA  G+ L  +Y  P CTP+R+  ++G    RY I T + 
Sbjct: 35  GYNDIGYHG-SEIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSG----RYQIHTGLQ 89

Query: 83  AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            GV +      +P+   +LPQ LKE+GYSTH +GKWH+G  KEE LP NRGFD+H GY  
Sbjct: 90  HGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFDSHFGYLT 149

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           G   Y           G D R NM       +  Y T  F  ++  V+ +HN  +PLFL 
Sbjct: 150 GAEDYFKHDRCFGAMCGTDLRDNMN--PANYTGVYSTHLFAQKAAEVVNNHNTDKPLFLY 207

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +   +VH             LQVP  E+  + + HI +  RR +A
Sbjct: 208 LPFQSVHAP-----------LQVP--EQYTKPYMHIQDKQRRTYA 239


>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
          Length = 506

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 35/254 (13%)

Query: 4   PVGAGVAKAVP------VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP 57
           P+G+G   + P      + + L   GWNDVGFHG   I TP++DALA  G++L+ +YT P
Sbjct: 7   PLGSGAEASRPPHLVFVLADDL---GWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQP 62

Query: 58  TCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
            CTPSR+  LTG+Y  R G+    +       VP+ EKLLPQ LKE GY+TH++GKWH+G
Sbjct: 63  LCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLG 122

Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYN--------DSIHETDFAVGLDARRNMERYAPQM 168
             ++E LP  RGFD + GY  G   Y         D+++ T  A+     R+ E  A   
Sbjct: 123 MYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDF---RDGEEVATGY 179

Query: 169 SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
            + Y T+ FT ++  +I +H   +PLFL +   +VH             LQVP  EE  +
Sbjct: 180 KNMYSTNIFTKRATALITNHPPEKPLFLYLALQSVHEP-----------LQVP--EEYLK 226

Query: 229 TFAHISNPDRRLFA 242
            +  I + +R  +A
Sbjct: 227 PYDFIQDKNRHHYA 240


>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
          Length = 520

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 88/227 (38%), Positives = 125/227 (55%), Gaps = 22/227 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   + TP++DALA  G+ L+ +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 43  GWNDVGFHGSR-LRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 102 WPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 161

Query: 142 YNDSIHETDFAVGLDAR------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    H   F   L+        R+ E  A +  + Y  + F ++++ +I +H   +PLF
Sbjct: 162 YFSHEHCV-FIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAISLIANHPPEKPLF 220

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +   +VH             LQVP  EE  + +  I + +RRL+A
Sbjct: 221 LYLALQSVHEP-----------LQVP--EEYLKPYDFIRDKNRRLYA 254


>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
 gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
          Length = 224

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 80/186 (43%), Positives = 110/186 (59%), Gaps = 5/186 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV F G+  IPTPN+D LA  GI+LN +Y L  CTPSR A ++G YP   G+   V 
Sbjct: 13  GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLYPIHTGLQHYVQ 72

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+   ++P++LK LGY+TH+IGKW++G  KE   P  RGFD+  G+ NG   
Sbjct: 73  LPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFDSFYGFLNGGED 132

Query: 142 YNDSIHETDF-AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y D  H   F + GLD          Q S  Y TD FT +++ +IK H+ ++P+FL  +H
Sbjct: 133 YYD--HTILFVSTGLDFWDGTTPVRNQ-SHHYSTDLFTKKALALIKDHDQAKPMFLYFSH 189

Query: 201 AAVHTG 206
            AVH+G
Sbjct: 190 QAVHSG 195


>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
          Length = 503

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 89/226 (39%), Positives = 126/226 (55%), Gaps = 20/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   + TP++DALA  G+ L+ +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 27  GWNDVGFHGSR-LRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 85

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ L+E GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 86  WPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGFDTYFGYLLGSED 145

Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y T+   +      V   A   R+ E  A Q  + Y T+ FT+++  +I +H   +PLFL
Sbjct: 146 YYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATSLIANHPPEKPLFL 205

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   +VH             LQVP  EE  + +  I + +R L+A
Sbjct: 206 YLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHLYA 238


>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
 gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
          Length = 585

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 123/232 (53%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G ++  TPNIDALAY+G++LN  Y  P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  +E GY T L+GKWH+G ++    P  RGFD H GY
Sbjct: 106 VNDQPWG------LPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGY 159

Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y    +E       G D R N+ +        Y+TD  TD +V  I+ H   N 
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNL-KSTHDYVGHYVTDVLTDAAVKEIEDHGSKNS 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H   A N   P   +Q P  EE  R F +ISN   R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKTHRYYA 262


>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
 gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
          Length = 585

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 124/232 (53%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G ++  TPNIDALAY+G++LN  Y  P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  + +  +E GY T L+GKWH+G ++    P  RGFD H GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGY 159

Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y    +E       G D R N++     +   Y+TD  TD +V  I+ H   N 
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHV-GHYVTDVLTDAAVKEIEDHGSKNS 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H   A N   P   +Q P  EE  R F +ISN   R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKAHRYYA 262


>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 37  MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 95

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GYSTH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 96  VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 155

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R NM    P+ S  YL     D++ H++ +H    PLFL  T
Sbjct: 156 GDYDKMNGVLSPSAGYDFRDNM-GVVPK-SDGYLALMLGDRAEHIVNTHYPGTPLFLAFT 213

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                        +P   L++P  EE +  ++ I +   R F
Sbjct: 214 -----------LDIPAKHLEIP--EEYEEKYSDIEDDRTRQF 242


>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
 gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
          Length = 535

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 58  GWHDVGFHGSR-IRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALISNHPPEKP 233

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIHDKNRRYYA 269


>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 286

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 1   MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GY+TH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 60  VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R NM    P+ S  YL     D++ H++ +H    PLFL  T
Sbjct: 120 GDYDKLNGVLSPSAGYDFRDNM-GVVPK-SDGYLALMLGDRAEHIVNTHYPGTPLFLTFT 177

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                        +P   L++P  EE +  +A I +   R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEAYADIEDDRTRQF 206


>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
          Length = 534

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 126/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 57  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 115

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         D+++ T  A+     R+ E  A    + Y T+ FT ++  +I +H   +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATTLITNHPPEKP 232

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHYYA 268


>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 490

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 126/232 (54%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DV FHG   IPTPNID LA +G++LN +Y LP CTPSRAA +TG YP R G+  T +
Sbjct: 37  GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+  K+LPQ+ K+LGY  ++IGKWH+G  K   +P  RGFD   G++ G   
Sbjct: 97  QAAGPWGLPLENKILPQHFKDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 156

Query: 142 Y----NDSIHETDFAVGLDARRN------MERYAP-QMSSKYLTDFFTDQSVHVIKSHNH 190
           Y    + S H    AV    + N      +  + P ++S  +L   ++  + ++      
Sbjct: 157 YYNHTSGSSHRKILAVTSSVQVNTLEKGRLSLWGPRELSVCFLHQIYSPLNFYL------ 210

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +P F  I+H AVH   A NA+    + Q P    N   F++I  P+R ++A
Sbjct: 211 -QPFFCYISHQAVH--HALNAE----MFQAP--ARNVLKFSYIGEPNRTIYA 253


>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
 gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D   HG   + TPN+D L  +G+ L+R+YT   C+P+R A LTGKY    G+   P+
Sbjct: 34  GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ E+L+ QYL++ GY T ++GKWH+G    E LP  RGF+NH G   G++ 
Sbjct: 94  SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153

Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    +E +    LD R        +   P  +++ Y+TD +T++S  +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +TH A H G    +      LQ P   E  R   H+    RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248


>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
          Length = 547

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D   HG   + TPN+D L  +G+ L+R+YT   C+P+R A LTGKY    G+   P+
Sbjct: 34  GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ E+L+ QYL++ GY T ++GKWH+G    E LP  RGF+NH G   G++ 
Sbjct: 94  SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153

Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    +E +    LD R        +   P  +++ Y+TD +T++S  +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +TH A H G    +      LQ P   E  R   H+    RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248


>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D   HG   + TPN+D L  +G+ L+R+YT   C+P+R A LTGKY    G+   P+
Sbjct: 34  GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +P+ E+L+ QYL++ GY T ++GKWH+G    E LP  RGF+NH G   G++ 
Sbjct: 94  SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153

Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    +E +    LD R        +   P  +++ Y+TD +T++S  +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +TH A H G    +      LQ P   E  R   H+    RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248


>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
          Length = 520

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG+H  ++I TPN+D L+  G+ L  +Y  P C+PSR   +TG+Y    G+    +
Sbjct: 40  GWNDVGYH-NSEIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQLMTGRYQIHTGMQHQII 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  K++ LP +RGFD++ GY+ G   
Sbjct: 99  WPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLPTHRGFDSYFGYYLGSED 158

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+       +++ T  A+ L   R+ E  A      Y T+ F+ ++V VI  HN S+P
Sbjct: 159 YYTHTRCYPITALNLTRCALDL---RDGEEVATAYKGAYSTELFSQRAVSVIAKHNASQP 215

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   AVH             LQVP  E     ++ I +  RR +A
Sbjct: 216 LFLYVAMQAVHEP-----------LQVP--ERYVTPYSFIKDVSRRKYA 251


>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
 gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
          Length = 538

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 23/224 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H  ++I TPN+D LA +   L+R Y   +CTP+RAA +TG++P RYG+ + V 
Sbjct: 54  GWNDVGYHN-SEIKTPNLDKLAESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVI 112

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               K  +P+ EK + Q LKE GY T ++GKWH+G  KEE LP  RGFD H G++ G + 
Sbjct: 113 WPWDKVGLPLEEKTIAQTLKEAGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSID 172

Query: 142 YNDSIHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQI 198
           Y    H+ D  +  GLD  RN +   P     Y TD    ++V +I+  ++++ PLFL +
Sbjct: 173 Y--FTHQLDAGIQGGLDWHRNEQ---PVEEEGYATDLLAQEAVKLIRDCDYNKSPLFLYV 227

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +  A H                   E++ + +A+I +  RR+FA
Sbjct: 228 SFNAPHAPLQAK-------------EKDIKNYANIQDEGRRIFA 258


>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 527

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 88/221 (39%), Positives = 120/221 (54%), Gaps = 16/221 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW+DV FHG   IPTPN+DALA +GI+LN +Y  P CTPSRAA +TG YP   G+   V 
Sbjct: 34  GWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALMTGMYPIHTGLQHGVL 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+  K++P+Y K+LGY THLIGKW++G   +E  P  RGFD+  G++N    
Sbjct: 94  LAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTYRGFDSFYGFYNYEED 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    H  +F      + N   + P     YLT    D    ++ +     P FL ++H 
Sbjct: 154 Y--FTHNLEFV----NQSNAMVWRPSSFCVYLT-LSPDTGFDLLSASIERGPFFLYLSHQ 206

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +VH G +GN       LQ P  EEN   F +I +  R  +A
Sbjct: 207 SVH-GASGNDP-----LQAP--EENIAKFPYIGDERRTKYA 239


>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
 gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
 gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
 gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
          Length = 528

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 129/229 (56%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 51  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLI 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LK+ GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 110 MTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 169

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      + ++ T  A+ L   R+ E  A + +  Y T+ FT ++  +I +H   +P
Sbjct: 170 YYTHEACAPIECLNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKP 226

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 227 LFLYLAFQSVHDP-----------LQVP--EEYMEPYDFIQDKHRRIYA 262


>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 118/205 (57%), Gaps = 16/205 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV +H   DI  P +  LA +G++ N+ YT PTCTPSRAA +TG YPFR G    + 
Sbjct: 38  GWNDVEWHNP-DIKMPVLSKLAADGVIFNQSYTHPTCTPSRAAMMTGMYPFRTGNQHQMI 96

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             +    VP+  KLLP+ LKE+GY TH++GKWH+G  KEE LP +RGFD+H G W   + 
Sbjct: 97  FNLHPSGVPLEFKLLPEKLKEVGYFTHMVGKWHLGFCKEEYLPTSRGFDSHYGLWTLGVG 156

Query: 142 YNDSIHET-DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           + D ++     + G D R N+    P+ S +YLT    +++ H+I  H +  PLFLQ T 
Sbjct: 157 HYDKMNGVLSPSEGYDFRDNI-GVVPK-SDEYLTLMLAERAEHIINGHYNKHPLFLQFT- 213

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEE 225
                       +P   L++PD  E
Sbjct: 214 ----------MDIPAKHLEIPDTFE 228


>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 784

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 121/221 (54%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVG+HG + I TPNIDALA  G+ L+ +Y    CTPSR   L+G+Y    G+    +
Sbjct: 53  GWSDVGYHG-SVIKTPNIDALASEGVKLDNYYMSLLCTPSRGQLLSGRYEIHTGLQHRTI 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E +LPQ LKE GY+TH++GKWH+G  ++E LP  RGFD  +G++ G   
Sbjct: 112 DMMQPLCLPIDETILPQKLKERGYATHMVGKWHLGFYRKECLPNYRGFDTFMGFYQGMAD 171

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y      T    G D RRN +  A + + +Y T  F D++  +I  HN   PLFL ++  
Sbjct: 172 YYYHNISTGIYHGWDFRRNNDVIAQKYAGQYSTHVFADEAQIIIMKHNPEVPLFLFLSFQ 231

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A+H        LP   LQVP    +       +N DR+ +A
Sbjct: 232 AIH--------LP---LQVPSRYADMYKTLIPNNADRQKYA 261


>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
          Length = 358

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 126/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG ++I TP++D LA  G+ L+ +Y  P CTPSR   +TG+Y  R G+    +
Sbjct: 34  GWNDVGFHG-SEIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTGRYQIRTGLQHQII 92

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ L+E GY TH++GKWH+G  +++ LP +RGF +  GY  G   
Sbjct: 93  WPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGYLTGSED 152

Query: 139 YLTYNDS-----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+        ++ T  A+ L   R+ +  A   S +Y T+  T+++ H+I  H   +P
Sbjct: 153 YYTHKRCSPIAPLNVTRCALDL---RDGDAVALNYSGRYSTELLTERATHIITQHTPDQP 209

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   AVH             LQVPD      +F  I +P RR +A
Sbjct: 210 LFLYVALQAVHAP-----------LQVPDHYIAPYSF--IQDPHRRRYA 245


>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
          Length = 532

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG + I TP++DALA  G+ L+ +Y  P CTPSR+  L+G+Y    G+    +
Sbjct: 55  GWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGRYQIHTGLQHQII 113

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +P
Sbjct: 174 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATALIANHPPEKP 230

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 231 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIEDKNRRHYA 266


>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
           intestinalis]
          Length = 522

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 125/228 (54%), Gaps = 22/228 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVG+H   DI TPNI+ LA +G++L  +Y  P CTPSR+  +TG+Y    G+  + +
Sbjct: 38  GFNDVGYHNP-DIYTPNINKLAKDGVILESYYVQPICTPSRSQLMTGRYQIHTGLQHSVI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +PV E +LPQ LKE GY+TH +GKWH+G  K+E LP +RGFD   GY+ G   
Sbjct: 97  FAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLGFYKKECLPTSRGFDTFYGYYCGAED 156

Query: 142 YNDSIHETDFAVGLDARR-------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y       +F  G   RR       +  R   + +  Y +  + D++V +IKSHN S PL
Sbjct: 157 YYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEANGTYSSYLYRDRAVRIIKSHNSSIPL 216

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           F+ +   +VH         P   LQVP   +  + + HI +  RR F+
Sbjct: 217 FMYLPFQSVH--------FP---LQVP--AKYIKRYRHIKDRKRRTFS 251


>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
          Length = 438

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 71/189 (37%), Positives = 114/189 (60%), Gaps = 8/189 (4%)

Query: 56  LPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
           +P CTPSR AFL+G++P R G+   P+ A   + + + + LLP+YL++LGY+THL+GKWH
Sbjct: 1   MPVCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWH 60

Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNM-ERYAPQMSSKYL 173
           +G      +P  RGFD  +GY+NG + Y +   E +  VG D  R + + +  +    Y+
Sbjct: 61  VGYLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQVGYDLHRIVGDNHTVEYRYDYM 120

Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
           T+  T+++ ++I SHN  +P++LQ+ H A H   A  A      ++V D EE + T  +I
Sbjct: 121 TNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEA------MEVYDWEETNATLGYI 174

Query: 234 SNPDRRLFA 242
            + +RR FA
Sbjct: 175 QDVNRRKFA 183


>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
          Length = 518

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/184 (41%), Positives = 108/184 (58%), Gaps = 3/184 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GWNDV +H  + +  PN+  LA  G++L   Y    CTPSRAAFLTG+YP   G+ +  V
Sbjct: 41  GWNDVSWH-NSIVQMPNLQDLAERGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVV 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+  KLLP YLK+ GY+TH+IGKWH+G   E   P  RGFD+H G++N  ++
Sbjct: 100 VATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGIS 159

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y++        VG D R ++         KY T  FTDQ+  +I +H+ + P+FL + + 
Sbjct: 160 YSNYSSTEGTDVGYDYRDDLALNL-AAEGKYTTTDFTDQAKTLIDNHDQTNPMFLYMAYN 218

Query: 202 AVHT 205
           A HT
Sbjct: 219 APHT 222


>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGFHG   I TP++DALA  G++L+ +YT P  TPSR+  LTG+Y  R G+    +
Sbjct: 15  GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTGRYQIRTGLQHQII 73

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 74  WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 133

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y         D+++ T  A+     R+ E  A    + Y T+ FT +++ +I +H   +P
Sbjct: 134 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 190

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 191 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 226


>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 631

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 125/230 (54%), Gaps = 21/230 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           G++DV FHG   IPTPN+DA+A +G++LNRHY   + TPSR AF TGK P R G++  P+
Sbjct: 53  GYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKLPLRIGLNEGPI 112

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
             GV    + +  ++LP Y+++LGY THLIG+W +G  KE LLP NRGFD H G ++   
Sbjct: 113 LKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGFDTHYGPYSDSA 172

Query: 141 TYNDSIHETDFAVGL------DARRNMERYAPQMS--SKYLTDFFTDQSVHVIKSHNHSR 192
           +Y+  +   D    L      D  R+ +   P  S    Y+TD +  +   +I+     +
Sbjct: 173 SYSSHLSREDAWKSLSVPPAYDLHRDGK---PDFSGFGSYVTDLYKGRFERIIEQRR--K 227

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLF+ ++H   H  + G       L Q P        F HI +  RR +A
Sbjct: 228 PLFIVLSHQTPHGASFGP------LHQPPPRTNRASQFLHIKDRSRRSYA 271


>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
          Length = 382

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 115/221 (52%), Gaps = 16/221 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GWNDVGF     + TP++D LA  G++LN  Y  P C+PSR  F++G +P+  G+ D  +
Sbjct: 36  GWNDVGFRNPQ-VLTPHLDKLAKAGVILNSSYVQPLCSPSRNCFMSGYFPYHTGLQDGVI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+    LPQ LKELGYSTH +GKWH+G    +  P  RGFD  VGY+ G   
Sbjct: 95  RPASPGFVPIKFTFLPQKLKELGYSTHAVGKWHLGFCNLKYTPTYRGFDTFVGYYIGAED 154

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y     E D   G D R N   Y  +   KY T  F +++V +IKSH+   PL+L +   
Sbjct: 155 YYKHTREYDKFSGYDLRFNTSVYT-EAKGKYSTRVFAERAVDIIKSHDTDTPLYLYLPFQ 213

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             L+VP   EN   + HI +  RR + 
Sbjct: 214 AVHAP-----------LEVPPEYEN--LYKHIHDLPRRTYC 241


>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
 gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
          Length = 583

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 93/232 (40%), Positives = 125/232 (53%), Gaps = 27/232 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++DV F G N+  TPNIDALAY+G++LN  YT P CTPSRAA LTGKYP   G+     
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G      +P+ E  +    +  GY T LIGKWH+G ++    P  RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTPTLRGFDYHLGY 159

Query: 136 WNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
              Y+ Y +  +E  +    G D R N++     +  KY+TD  TD +V  I  H   N+
Sbjct: 160 LGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHEHV-DKYVTDILTDAAVREIDDHAAKNN 218

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + H A H   A N   P   +Q P  E +   F +I +   R +A
Sbjct: 219 SKPLFLLLNHLAPH---AANDADP---MQAPADELSG--FEYIRDETHRYYA 262


>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 88/221 (39%), Positives = 120/221 (54%), Gaps = 18/221 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    + 
Sbjct: 38  GWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQML 96

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GYL 140
             +    +P+  KLLP+ LKE+GYSTH++GKWH+G  K+E LP NRGFD+H G W  G  
Sbjct: 97  FNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVG 156

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y+        + G D R N      Q S+ YL     D++ H++ +H    PLFL  T 
Sbjct: 157 DYDKMNGVLSPSKGYDFRDNTG--VVQKSNGYLALMLGDRAEHIVNTHYPGTPLFLAFT- 213

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                       +P   L +P  EE +  +A I +   R F
Sbjct: 214 ----------LDIPAKHLAIP--EEYENKYADIEDSRTRHF 242


>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
 gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
          Length = 598

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 22/227 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+G+HG + I TPN+D LA  G+ L  +Y  P C+PSR   +TG+Y  RYG+  + +
Sbjct: 133 GWNDIGYHG-SVIRTPNLDRLAAEGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLI 191

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ E  LPQ LKE GYSTH++GKWH+G  K++  P +RGFD   GY  G   
Sbjct: 192 WPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAED 251

Query: 139 YLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y T+        +     GLD  R+  R     +  Y T  F ++++ +I   + ++P+F
Sbjct: 252 YWTHRQKGGLPGQPQTWSGLDL-RDQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMF 310

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L ++  AVH             LQ P  EE+   ++HIS+ +RR++A
Sbjct: 311 LFLSFQAVHDP-----------LQAP--EEDISRYSHISDTNRRVYA 344


>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
          Length = 515

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 86/222 (38%), Positives = 122/222 (54%), Gaps = 15/222 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV +H    IPTPNID+   NGI LNR Y  PTC+P+RA+ LTG + F +G+  P  
Sbjct: 29  GWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFM 87

Query: 83  AGVAKAVPVTE--KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              A+   + E  K++P+Y KE GY T L GKWH+G +KEE LP NRGFD+  G+  G +
Sbjct: 88  NPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGI 147

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y D +H          R +  R    ++   Y T+   D+++++IK+ +  RPLFL + 
Sbjct: 148 GYYDHVHTN--------RMDWHRDGVSLNEDGYSTELIADEAINIIKNKDDDRPLFLYVA 199

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRL 240
             A HT      +     L + D  E DR + A+IS  D  +
Sbjct: 200 FNAPHTPIEAPEEDVNNFLYIED--ELDRNYAANISKLDIEI 239


>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
          Length = 522

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 87/226 (38%), Positives = 125/226 (55%), Gaps = 20/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG+H  N I TP++DAL+  G+ L  +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 45  GWNDVGYHDSN-IFTPHLDALSAQGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ EKLLP+ L+E GY TH++GKWH+G  ++E LP  RGFD   GY  G   
Sbjct: 104 WPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 163

Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y T+   +H     V   A   R+ E  A    + Y T+ FT+++V++I +H   +PLFL
Sbjct: 164 YYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERAVNLIANHPAQKPLFL 223

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   +VH             LQVP  EE  + +  I N +R+ +A
Sbjct: 224 YLALQSVHEP-----------LQVP--EEYLQPYDFIQNKNRQHYA 256


>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
          Length = 339

 Score =  143 bits (361), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 128/244 (52%), Gaps = 31/244 (12%)

Query: 10  AKAVPVTEKLLPQ---------GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT 60
           A  VP +E   P          GWNDV  HG   IPTP+IDA+A++G+ L  ++  P CT
Sbjct: 22  ADGVPGSEAKRPNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCT 81

Query: 61  PSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE 120
           P+R+ FL+G++    GI  P   G A  + ++  LLP YLK+LGY T  +GKWH+G N E
Sbjct: 82  PTRSTFLSGRHVIHTGIYMPFAQGTALRLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVE 141

Query: 121 ELLPFNRGFDNHVGYWNGYLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
           + LP  RGFD ++GYW+G   Y  +D+    DF  G +        A + ++ Y T  F 
Sbjct: 142 KALPTGRGFDEYLGYWSGAEDYYTHDTHGGYDFQDGTEC-------AIKYNNTYSTYIFA 194

Query: 179 DQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDR 238
           +++V+ I   +  +PLFL      VH         P   L+ P   E    F+HI N +R
Sbjct: 195 ERAVNTILEADPEQPLFLYTAFQNVH--------WP---LEAP--AEYVARFSHIPNSER 241

Query: 239 RLFA 242
           +  A
Sbjct: 242 QYVA 245


>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
          Length = 481

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 119/221 (53%), Gaps = 20/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFH   DI TPNID LA  G++LN HY  P C+PSRAAF++G YPF+ G+  + +
Sbjct: 37  GWNDIGFHNP-DIITPNIDKLAREGLLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+   +LPQ LKELGY+TH++GKWH G       P  RGFD+  GY+     
Sbjct: 96  LENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMED 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y      T    G    RN        +  Y T  FTD +  +I+ HN S+PLFL + + 
Sbjct: 156 Y-----YTHVIRGFLDYRNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQ 210

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AV+           G ++VP   E    + +I + +RR F+
Sbjct: 211 AVY-----------GPIEVPAKYE--AMYPNIKSENRRKFS 238


>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 87/222 (39%), Positives = 122/222 (54%), Gaps = 18/222 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++  + YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 37  MGWNDVHWHNP-DIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 95

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GY+TH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 96  VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 155

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R N+E   P+ S  YL     D++  ++ +H+   PLFL  T
Sbjct: 156 GDYDKLNGVLSPSAGYDFRDNLE-VVPK-SDGYLALMLGDRAEEIVNNHSPETPLFLVFT 213

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
                        +P   L++P  EE +  +A I +   R F
Sbjct: 214 -----------LDIPAKHLEIP--EEYEELYADIEDDRTRQF 242


>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/183 (41%), Positives = 106/183 (57%), Gaps = 8/183 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGF    D+ TPNID LA++G+VLN  Y +P CTPSR +F+TG Y F+ G+    +
Sbjct: 36  GWNDVGFRNP-DVLTPNIDKLAHSGMVLNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A   P+    LPQ LKELGY+TH IGKWH+G  K E  P  RGFD   G++NG   
Sbjct: 95  LPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNG--- 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
             +  +    A G D R N  R     + +Y T  +  ++  +IK H+ S+P+++ +   
Sbjct: 152 -QEDYYTLSVAGGKDFRDN--RTPVNATGEYSTFLYARRAESIIKEHDASKPMYMYLPFQ 208

Query: 202 AVH 204
           +VH
Sbjct: 209 SVH 211


>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
          Length = 483

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/183 (41%), Positives = 106/183 (57%), Gaps = 8/183 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGF   + + TPNID LA +G++LN  Y +P CTPSR +F+TG+Y F+ G+    +
Sbjct: 36  GWNDVGFRNPS-VLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIVI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A   P+    LPQ LKELGY+TH IGKWH+G  K E  P  RGFD   GY+NG   
Sbjct: 95  LPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAED 154

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    +    A G D R N  R     + +Y T  +  ++  +IK H+ S+P+++ +   
Sbjct: 155 Y----YNLSIAGGKDFRDN--RTPVNATGEYSTILYARRAESIIKDHDASKPMYMYLPFQ 208

Query: 202 AVH 204
           +VH
Sbjct: 209 SVH 211


>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 476

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 121/222 (54%), Gaps = 23/222 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GWNDVGFHG ++I T N+D LA +G+ L R Y    CTP+RAAFLTG++PFRYG+    V
Sbjct: 46  GWNDVGFHG-SEIKTTNLDKLAVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINV 104

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ EK + + LKE GY T ++GKWH+G  +E  LP +RGFD H G++   + 
Sbjct: 105 TPWSETGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGID 164

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQITH 200
           Y    H++    GLD  RN     P     Y TD     +V +I +H+ H  PLFL I  
Sbjct: 165 Y--FTHKS--GDGLDWHRNNN---PVYIEGYSTDLIAQDAVQLINNHDYHKNPLFLYIAF 217

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A H      A+         D+E+    +  I +  RRLFA
Sbjct: 218 NAPHIPLQAKAE---------DLED----YLTIEDEQRRLFA 246


>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 497

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 30/232 (12%)

Query: 23  GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP----FRY 75
           G+NDVG+HG      + TPN+D LA  G+ L ++Y  P C+P+R+  L+G+Y      +Y
Sbjct: 46  GYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLLSGRYQIHTGLQY 105

Query: 76  GIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           G+  P        +P+ E  LPQ LKE  Y+TH++GKWHIG  K+   P  RGFD++ GY
Sbjct: 106 GVIRPAQP---HCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGY 162

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSHNH 190
            +G   Y    H   F +G    + ++  A      Q   +Y T  FT +++ VI +H  
Sbjct: 163 LSGAEDYYS--HSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHER 220

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           S+PLFL + + AVH+            LQVP   E    +A+I++  RR +A
Sbjct: 221 SKPLFLYLAYQAVHSP-----------LQVPSKYE--EPYANITSSARRAYA 259


>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 480

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 80/180 (44%), Positives = 107/180 (59%), Gaps = 5/180 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA +TG YPF+ G    +
Sbjct: 20  MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 78

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+  KE+GYSTH++GKWH+G  K+E LP NRGFD+H G W  G 
Sbjct: 79  VFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGV 138

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             Y+        + G D R NM    P+ S+ YL     D++ H++ +H    PLFL  T
Sbjct: 139 GDYDKMNGVLSPSAGYDFRDNM-GVVPK-SNGYLALMLGDRAEHIVNNHYPGTPLFLAFT 196


>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 533

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/184 (42%), Positives = 105/184 (57%), Gaps = 11/184 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGFHG   IPTP +DALA +G+ L  ++T P C+PSRA+ L+G++   +GI  P  
Sbjct: 67  GFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAIHHGIYMPFA 126

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
            G A  + +  +LLP+ L+ LGY TH +GKWH+G N    LP  RGFD+ +GYW+G   Y
Sbjct: 127 QGTAYHLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSFLGYWSGAEDY 186

Query: 143 --NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
             +D     DFA       N E  A      Y    FTD++V V+ S   S P FL +  
Sbjct: 187 FAHDCAGAYDFA-------NNETTAWAYDGVYSAYSFTDRAVDVVAS--ASTPYFLYVAW 237

Query: 201 AAVH 204
             VH
Sbjct: 238 QNVH 241


>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/183 (40%), Positives = 106/183 (57%), Gaps = 8/183 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVGF    D+ TPNID LA +G++LN  Y +P CTPSR +F+TG Y F+ G+    +
Sbjct: 36  GWNDVGFRNP-DVLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A   P+    LPQ LKELGY+TH IGKWH+G  K E  P  RGFD   G++NG   
Sbjct: 95  NPQQATCAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNGQED 154

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    +    A G D R N  +     + +Y T  ++ ++  +IK H+ S+P+++ +   
Sbjct: 155 Y----YTLSVAGGKDFRDN--KVPVNATGEYSTFLYSRRAESIIKEHDASKPIYMYLPFQ 208

Query: 202 AVH 204
           +VH
Sbjct: 209 SVH 211


>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 122/221 (55%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVG+HG + I TP+IDALA  G+ L+ +YT   CTPSR+  +TG+Y    G+    +
Sbjct: 33  GWSDVGYHG-SVIKTPHIDALASEGVKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTI 91

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E +LPQ LK+ GY+TH++GKWH+G  ++E LP NRGFD  +G++     
Sbjct: 92  DMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGD 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y      T    G D RR+ +  A + + +Y T  F D++  +I  HN   PLFL ++  
Sbjct: 152 YYYHNVSTGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDIISKHNPDVPLFLFLSFQ 211

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A+H         P   LQVP    +       ++ DRR +A
Sbjct: 212 AIH--------FP---LQVPSRYADIYNTLIPNSADRRTYA 241


>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
          Length = 519

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 126/229 (55%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW D+G+HG ++I TPN+D L+  G+ L  +Y  P CTPSR   +TG+Y    G+    +
Sbjct: 41  GWYDIGYHG-SEIRTPNLDKLSAGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQII 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ +KE GY+TH++GKWH+G  K++ LP  RGFD ++GY  G   
Sbjct: 100 WPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDTYLGYLTGSED 159

Query: 139 YLTY-----NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+     + S++ +  A+ L   R+ E  A      Y T+  + +++ +I+ H   +P
Sbjct: 160 YFTHFRCYQSPSLNLSRCALDL---RDGEEVATGYKGVYSTELLSQRAISIIERHISQKP 216

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LF+ +   AVH             LQVP  E     ++ I + +RRL+A
Sbjct: 217 LFMYVALQAVHAP-----------LQVP--ERYVTPYSFIKDTNRRLYA 252


>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
          Length = 176

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 70/140 (50%), Positives = 90/140 (64%), Gaps = 16/140 (11%)

Query: 12  AVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKY 71
            V VT+ L   GWNDV FHG + IPTPNIDALAY GI+LNRHYT P CTPSRA+ ++GK+
Sbjct: 33  VVIVTDDL---GWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSGKH 89

Query: 72  PFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLP 124
           P   G+       + P G G      + +KL+P+Y +E GY T L+GKWH+G  ++   P
Sbjct: 90  PINVGMQHHVIESNEPWGLG------LDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTP 143

Query: 125 FNRGFDNHVGYWNGYLTYND 144
             RGFD+H GY   Y+ Y D
Sbjct: 144 TRRGFDSHFGYIGPYIDYWD 163


>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
          Length = 491

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 28/230 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVG+HG + I TP +DAL   G+ L R+YT P CTPSR+  L+G+Y    G+    +
Sbjct: 12  GWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ EKLLP+ L+E GY TH++GKWH+G  ++E LP +RGFD +     GYL 
Sbjct: 71  WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFDTYF----GYLL 126

Query: 142 YNDSIHETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
            ++  +  D  V + A+         R+ E  A    + Y T+ FT++++ VI +H   +
Sbjct: 127 GSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVIANHKTEK 186

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL +   +VH             L+VP  E+  + ++ I +  RR +A
Sbjct: 187 PLFLYLAFQSVHEP-----------LEVP--EKYVKPYSSIKDVKRRHYA 223


>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 74/184 (40%), Positives = 108/184 (58%), Gaps = 9/184 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVG+H +++I TP++D LA +G  L + Y  P C+P+RAA +TG+YP R+G+   V 
Sbjct: 48  GWGDVGWH-DSEIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGRYPMRHGLQVGVV 106

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              A+  +P+ E+ LPQ LKE+GY T + GKWH+G  + E LP +RGFD+  G++NG L 
Sbjct: 107 RPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGFDHQYGHYNGALD 166

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   I +  F    D R N +         Y T     ++  +I  H+ S+PLFL +   
Sbjct: 167 YFTHIRDGGFDWHRDDRVNRD-------EGYSTHLIGREATRIIGHHDTSKPLFLYVPFN 219

Query: 202 AVHT 205
           AVH 
Sbjct: 220 AVHA 223


>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
 gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
          Length = 491

 Score =  139 bits (351), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 121/221 (54%), Gaps = 20/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVGFHG   I TPNID LA NG++L+ +Y  P CTP+RA+ +TGKYP   G+    +
Sbjct: 36  GWSDVGFHGSK-IQTPNIDRLAANGVILDNYYVQPVCTPTRASLMTGKYPIHTGLQHGII 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    +P+   LLPQ L++ GYSTH++GKWH+G    E  P  RGFD   G+++G   
Sbjct: 95  HNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGFYSG--A 152

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            N   H  D  + L   R+ E      +  Y    FT ++  ++++H+ S PLF+ +   
Sbjct: 153 ENHYTHVQDHYLDL---RDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMYMAFQ 209

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            VH+            +Q P  E  DR ++ I +P RR +A
Sbjct: 210 NVHSP-----------VQAPK-EYIDR-YSFIKDPLRRTYA 237


>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 533

 Score =  139 bits (351), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 121/227 (53%), Gaps = 27/227 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GW+DV FHG   IPTPN+D LA +G++LN +Y    CTPSRAA +TG YP   G+ D  +
Sbjct: 37  GWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPSRAALMTGLYPIHNGLQDFVI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+  K++P++ K++GY TH+IGKWH+G  ++E  P  RGFD+  GY+NG   
Sbjct: 97  DVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKEYTPTYRGFDSFYGYYNGAED 156

Query: 142 -YNDSIHET-----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            YN SI +      +   G   ++ +E Y        L       S+    S   S+PLF
Sbjct: 157 YYNHSITKVISQSYNIRQGSVTKKRIENYIKNTELVLL-------SLTFYLSILFSQPLF 209

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L + + +VH           G L+ P  EEN   F +I   +R +FA
Sbjct: 210 LYLAYQSVH-----------GPLEAP--EENIMKFPYIGEENRTIFA 243


>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
 gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
          Length = 478

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 75/189 (39%), Positives = 109/189 (57%), Gaps = 9/189 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+G+HG   I TPN+D LA  G+ L  +Y  P C+PSR   +TG+Y   YG+  + +
Sbjct: 14  GWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSVI 72

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E  LPQ LKE+GYSTHL+GKWH+G  ++E LP  RGFD   G+  G   
Sbjct: 73  THDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFDTFYGFLTGGED 132

Query: 142 Y----NDSIHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y      +++ TD +   GLD R   E    Q +  Y T  F  +++ +I  H+ ++P+F
Sbjct: 133 YWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQ-NGTYSTHLFQRKAIDIIAHHDRNKPMF 191

Query: 196 LQITHAAVH 204
           L ++  AVH
Sbjct: 192 LYLSFQAVH 200


>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 502

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 87/223 (39%), Positives = 116/223 (52%), Gaps = 25/223 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG +DI TPNID LA  G  L ++Y  P CTP+RAAF+TG+YPFRYG+ T V 
Sbjct: 71  GWKDVGFHG-SDIKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRYPFRYGMQTAVI 129

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    + + + LLP+ LKE GY+T   GKWH+G  K    P  RGFD+  G   G + 
Sbjct: 130 PQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFDSFYGALLGEI- 188

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
             D           D  RN +    +     L   F D++V VI  H+ ++PLFL +   
Sbjct: 189 --DHFTHKSANGNADWYRNNDALEEEGFDNVL---FADEAVRVINEHDQAKPLFLYLAFT 243

Query: 202 AVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
           + HT             Q P   +E N    +HI++  RR +A
Sbjct: 244 SPHTP-----------FQAPKEFLERN----SHIADESRRNYA 271


>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
 gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
          Length = 520

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 20/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+G+HG + I TPN+D LA  G+ L  +Y  P CTPSR+  +TG+Y   +G+  + +
Sbjct: 53  GWNDIGYHG-SVIRTPNLDRLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSII 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ E  LPQ LKE GYSTH++GKWH+G  KEE  P +RGFD   G+  G   
Sbjct: 112 WPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSEN 171

Query: 139 YLTYNDSIHETDFAVGLDA--RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           + ++ +S     F  G +    R+ +R     +  Y T  F  +++ VI   + S+P+FL
Sbjct: 172 HYSHRNSGGMPGFRPGWNGLDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFL 231

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   AVH             LQ P  ++    + HI++ +RR++A
Sbjct: 232 YLPFQAVHAP-----------LQAP--QKYISMYRHINDYNRRMYA 264


>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
 gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
          Length = 353

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 121/217 (55%), Gaps = 19/217 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV ++  N +  PN+  LA  G++ N+ Y    CTPSR A LTGK+P+R G+  P+ 
Sbjct: 12  GWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKFPYRLGMQRPIR 70

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
              A  +P+ E+LLPQ LK+LGY+TH+IGKWH+GC K E  P  RGFD+  GY  G   Y
Sbjct: 71  HKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFDSFYGYHRGSQDY 130

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H +D   GLD        + Q +  Y T+ F  ++ ++I  H+ + PLFL +    
Sbjct: 131 --YTHMSDG--GLDFWEGKTAISDQ-NGVYSTESFATRAENIISQHDPNTPLFLYLPLQP 185

Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
           VHT      ++P+  LQ         TF+ I + +R+
Sbjct: 186 VHT----PHQVPSSYLQ---------TFSTIQDHNRK 209


>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
 gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
          Length = 495

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 88/223 (39%), Positives = 114/223 (51%), Gaps = 24/223 (10%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW DVGFHG +DI TPN+D LA  G  L ++Y+ P CTPSRAA LTG+YP RYG+ T V
Sbjct: 58  QGWKDVGFHG-SDIRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLV 116

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                K  +P  E LLPQ LKE GY T ++GKWH+G    +  P  RGFD   G   G +
Sbjct: 117 IPSAGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEI 176

Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y   S H       +D  RN +         Y+T      +V +I+ HN   PLFL + 
Sbjct: 177 DYFTHSAHGK-----VDWYRNNQLIK---EEGYVTTLLGQDAVKLIEKHNPKTPLFLYLA 228

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             A H              Q P    +   +  I++P+RR +A
Sbjct: 229 FTAPHAP-----------YQAPQKYLDQ--YKTIADPNRRAYA 258


>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 548

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 126/230 (54%), Gaps = 26/230 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG-----I 77
           GW+DV FHG + IPTPNID LA +GI L+ +Y  P CTPSRAA +TG YP R G     I
Sbjct: 48  GWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQPMCTPSRAALMTGLYPIRTGMQHWVI 107

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            +P   G    +P+  KL+P++LK+LGYSTHL+GK      K+ ++  NR   N      
Sbjct: 108 RSPEPWG----LPLELKLMPEHLKDLGYSTHLVGKVLFDL-KKFIVSVNRLCINEST--E 160

Query: 138 GYLTYNDSIHETDFAV-----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
              T+  ++    + V     GLD R   E +    + +Y T  FTD+++ +I+ HN ++
Sbjct: 161 VCHTFVSAVTLCIYFVYKSHAGLDFRNGEEPFHND-TGQYATTLFTDRAISIIEQHNQTK 219

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL ++H A H  T          LQ PD  EN   F +I   DR ++A
Sbjct: 220 PLFLYLSHLAPHGATHDEP------LQAPD--ENVEKFDYIGEEDRTIYA 261


>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 562

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 130/234 (55%), Gaps = 33/234 (14%)

Query: 23  GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           G+ND+G+H     +D+ TP +D+LA  G++L  +Y  P C+P+R   LTG    RY I T
Sbjct: 45  GFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQLLTG----RYQIHT 100

Query: 80  PVGAGVAKA-----VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            +  G+ +A     +P+   LL Q L++ GY T+++GKWH+G  +EE LP+NRGF N  G
Sbjct: 101 GLAHGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPWNRGFQNFFG 160

Query: 135 YWNG----YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           + NG    +  Y+    +T    G D   +  RY P  ++  +Y T+ F  +S  +I  H
Sbjct: 161 FLNGGVNHFTRYHCEPKKTRRFCGYDMIDS--RYGPTNATYGEYSTNLFIRKSKEMIDKH 218

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           N  +P+FL ++  AVH           G LQVP+  +  + F HI + +RR++A
Sbjct: 219 NKQKPMFLYLSLQAVH-----------GPLQVPN--QYLKRFKHIRDKNRRIYA 259


>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 519

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 120/222 (54%), Gaps = 17/222 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
            GWND+G+H  + I +PNI+AL  +G+ L  +Y  P CTPSR+  ++G+Y    G+  + 
Sbjct: 32  HGWNDIGYH-SHIIRSPNINALCNDGVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSV 90

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+ E  L + LKE+GY+THL+GKWH+G      LP  RGFD+  GY  G  
Sbjct: 91  IRNDQPNCLPLDEVTLAEKLKEVGYATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQE 150

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y   IH+     G D +R+    + Q    Y T  FT ++ ++IKSH+ S PLFL +++
Sbjct: 151 DYYKHIHDG----GYDLKRHETDVSKQYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSY 206

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +VH             LQVP+   N      I + DRR+ A
Sbjct: 207 QSVH----------ANYLQVPEHYSNMYNGV-IDDEDRRIVA 237


>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
          Length = 521

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 118/227 (51%), Gaps = 22/227 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVG+HG + I TP +DAL+  G+ L R+Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 44  GWQDVGWHG-SQIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRYQIHTGLQHEII 102

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLP+ LKE GY TH++GKWH+G  + E LP  RGFD + GY  G   
Sbjct: 103 WPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFDTYFGYLLGSED 162

Query: 142 YNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    H             LD  R+ E+ A    + Y T+ FT ++  +I +H   +PLF
Sbjct: 163 YYSHEHCVPIVSKNVTRCALDL-RDGEKIADGFKNMYSTNVFTQRAQDLIANHQPEKPLF 221

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +   +VH             LQVP  E+    ++ I +  RR +A
Sbjct: 222 LYLALQSVHEP-----------LQVP--EKYVEPYSFIKDEKRRKYA 255


>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 537

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVG+H  + I TPN+D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+    +
Sbjct: 51  GWFDVGYH-NSTIKTPNLDLLASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVI 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A     +P+ E  LPQ LKE GY+THL+GKWH+G  K E +P  RGFD+  GY +G   
Sbjct: 110 IAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQD 169

Query: 142 YNDSIHETDFA--------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y        F         +G+D   N  R A + +  Y    FT+++  VI+ HN ++P
Sbjct: 170 YWTHFRSGSFPGFPEGNHWLGIDFWDN-NRVAWEYTGNYSQFVFTERAQRVIQQHNPNQP 228

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH           G LQVP  E+  + +AH  +  R+ +A
Sbjct: 229 LFLYLPLQSVH-----------GPLQVP--EKYMKPYAHFQDVGRQTYA 264


>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
          Length = 572

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 76/188 (40%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 58  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SHN  RPLFL 
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLY 233

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 234 VAFQAVHT 241


>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
           caulinodans ORS 571]
 gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
           caulinodans ORS 571]
          Length = 490

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 121/221 (54%), Gaps = 22/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+ DVGFHG +DI TPN+D LA  G  L + YT P CTP+RAA +TG+YP RYG+ T V 
Sbjct: 58  GFADVGFHG-SDIKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVI 116

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +G +  +   E LLPQ LK +GYST LIGKWH+G  K++  P  RGFD   G   G + 
Sbjct: 117 PSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEID 176

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           +    HE    V  D  R+ ++    +   Y T+ F   +  +I +H+   PLFL +   
Sbjct: 177 HYK--HEAHGVV--DWYRDNKQV---VEEGYDTELFGTDAARLIGAHDPKTPLFLYLAFT 229

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A HT             Q P     DR + +I++P RRL+A
Sbjct: 230 APHTP-----------FQAP-QAYVDR-YPNITDPARRLYA 257


>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
          Length = 514

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 73/192 (38%), Positives = 110/192 (57%), Gaps = 15/192 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVG+HG + I TP +DAL   G+ L R+YT P CTPSR+  L+G+Y    G+    +
Sbjct: 35  GWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ EKLLP+ LK+ GY TH++GKWH+G  ++E LP  RGFD +     GYL 
Sbjct: 94  WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYF----GYLL 149

Query: 142 YNDSIHETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
            ++  +  D  V + A+         R+ E  A    + Y T+ FT++++ +I +H   +
Sbjct: 150 GSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEK 209

Query: 193 PLFLQITHAAVH 204
           PLFL +   +VH
Sbjct: 210 PLFLYLAFQSVH 221


>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
 gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
          Length = 519

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 121/221 (54%), Gaps = 13/221 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV ++G   I TPNID LA +G+ +NR Y+ PTC+P+RAA  TG    + GI  P+ 
Sbjct: 31  GWGDVSYNG-GPINTPNIDKLADDGLQMNRFYSAPTCSPTRAALFTGINSLKNGIIRPLN 89

Query: 83  AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              A+   +P+  K+LP+YLKE+GY T L GKWH+G   +E LP NRGF++  G+  G +
Sbjct: 90  NPTAERYGLPLKHKILPEYLKEIGYQTALSGKWHLGMFSDEYLPRNRGFESTYGHLGGGI 149

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D       +  LD  RN E         Y T    D+++ +I++ N+  PLFL +  
Sbjct: 150 GYFDHA----LSGRLDWHRNGEIL---YEDGYSTTLIADEAIRIIENKNNETPLFLYVAF 202

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRL 240
            A HT      K+   L  + D +E  R + A+I   DR +
Sbjct: 203 NAPHTPIQAEEKIINNLSDISDKKE--RVYAANIITLDREI 241


>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
 gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
          Length = 377

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 124/217 (57%), Gaps = 19/217 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV ++    + TPN+  LA  G++ N+ Y  PTC+PSR A LTGK+PFR G+   + 
Sbjct: 36  GWSDVSWNNPY-VVTPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGKFPFRLGMQRVMD 94

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
           +     +P+ E+LLPQ LK+LGY+TH++GKWH+G  K E  P  RGFD+  GY +G   Y
Sbjct: 95  SKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFYGYHHGSQDY 154

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H++  A GLD        + Q +  Y T+ F  ++ ++I  H+ + PLFL +   +
Sbjct: 155 --YTHKS--ARGLDFWDGKTSISDQ-NGVYSTESFATRAENIISQHDPNTPLFLYLPFQS 209

Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
           VHT      ++P+  LQ         TF+ I + +R+
Sbjct: 210 VHTP----HQVPSSYLQ---------TFSTIQDDNRK 233


>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
          Length = 613

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 87/254 (34%), Positives = 129/254 (50%), Gaps = 42/254 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDV FHG + IPTP+IDALA  G++L  +Y  P CTP+R+A +TGK+P   G++  V 
Sbjct: 39  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLEHGVI 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH-----IGCN------------------ 118
           G      + + EKL+PQYL+ELGY TH++GK       IG +                  
Sbjct: 99  GVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGVSHPYGLGLEEKLMHQYLRE 158

Query: 119 ---------KEELLPFNRGFDNHVGYWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQM 168
                    KE L P +RGF++  GY+ G    Y   I       G D   +   + P +
Sbjct: 159 LGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSDGNMTGFDFHMDGSVHKP-V 217

Query: 169 SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
             +Y T+ FT+++  +I  HN   PL++ + H AVH+      +     LQ P   E  +
Sbjct: 218 FGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQR-----LQAP--HEYYK 270

Query: 229 TFAHISNPDRRLFA 242
            F +I++ +RR +A
Sbjct: 271 RFPNITHENRRKYA 284


>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
          Length = 481

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 45  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 103

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 104 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 163

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 164 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 220

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 221 VAFQAVHT 228


>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
          Length = 569

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
 gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
 gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
 gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
 gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
          Length = 569

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
 gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
          Length = 569

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
          Length = 569

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
 gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
 gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
          Length = 569

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
          Length = 572

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
 gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
          Length = 487

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 119/218 (54%), Gaps = 20/218 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H   D+ TP +D LA+ G++LN+ Y    CTPSR AF+TG YP+  G    V 
Sbjct: 40  GWNDVGWHNP-DVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYYPYHAGSQHLVF 98

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A+ +P     LP+ LK+LGY+TH++GKWH+G    +  P  RGFD+  GY+N    
Sbjct: 99  LPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFDSFYGYYNADED 158

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   +     A GLD R + E    + + +Y T FFTD+ V +I+ H    PLFL +   
Sbjct: 159 YYTHV----VAGGLDLRDDKEVVNTK-NGQYGTYFFTDRMVDIIEKHPADTPLFLYLPFQ 213

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
            VH             L+VP+  EN   + ++ N +RR
Sbjct: 214 NVHEP-----------LEVPERFEN--IYMNVQNENRR 238


>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
          Length = 573

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 59  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 118 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 177

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLY 234

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 235 VAFQAVHT 242


>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
          Length = 571

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
          Length = 573

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 58  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 233

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 234 VAFQAVHT 241


>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 507

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 121/230 (52%), Gaps = 30/230 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW+DVG+H ++ I TPNID LA  G+ L  +Y  P CTP+RA  +TG+Y    G+   V 
Sbjct: 40  GWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVL 98

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A   + +P  E L+PQ LKE GY+TH++GKWH+G  K    P +RGFD   G    YL 
Sbjct: 99  MAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGFDTFFGM---YLA 155

Query: 142 YNDSIHETDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
             D  + T    G      D R   +  AP+   KY T  F +++  +IK H+ + PLFL
Sbjct: 156 GGDYFNHTRLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFL 215

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD----MEENDRTFAHISNPDRRLFA 242
            ++  AVH             LQVP+    M ++D     I +  RR++A
Sbjct: 216 YLSFQAVHAP-----------LQVPERYINMYKDD-----IRDESRRIYA 249


>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
           melanoleuca]
          Length = 573

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 81/214 (37%), Positives = 116/214 (54%), Gaps = 14/214 (6%)

Query: 2   DTPVGAGVAKAVPVTEKLL------PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
           D P  AGV +  P     +       QG++DVG+HG +DI TP +D LA  G+ L  +Y 
Sbjct: 32  DGPGEAGVEQPXPSQPPHIIFILTDDQGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYI 90

Query: 56  LPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
            P CTPSR+  LTG+Y    G+  + +       +P+ +  LPQ L+E GYSTH++GKWH
Sbjct: 91  QPICTPSRSQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWH 150

Query: 115 IGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSK 171
           +G  ++E LP  RGFD  +G   G   Y TY++   +     G D     E  A  +S +
Sbjct: 151 LGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQ 207

Query: 172 YLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           Y T  +  +  H++ SH+  RPLFL +   AVHT
Sbjct: 208 YSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT 241


>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
 gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
 gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
 gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
 gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
          Length = 573

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SHN   PLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
          Length = 518

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 117/225 (52%), Gaps = 22/225 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW D G HG   + TPN+DA+A +GI+L ++YT   C+P+R++ LTG+YP RYG+    +
Sbjct: 29  GWGDFGVHGSK-LETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYPIRYGMQHNVI 87

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            AG    +P    LLPQ LK  G++TH++GKWH G + E +LPFNRGFD++ GY  G   
Sbjct: 88  LAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDSYYGYLQGAED 147

Query: 142 YNDSIH-ETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
           +   I  +     G+D         P  S+   Y T+ ++ Q   V+ K     +P FL 
Sbjct: 148 HYSRIQCQAKEWCGVDF---CTENGPTNSTWGTYGTEIYSAQVAQVLDKVSKEEKPFFLY 204

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
                VH             LQ P  E     F  I +PDRR +A
Sbjct: 205 YAMQNVHDP-----------LQAP--EHYKIKFDWIEDPDRRTYA 236


>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
          Length = 516

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 120/229 (52%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVG+H  ++I TP +D L+  G+ L  +Y  P CTPSR   +TG+Y    G+    +
Sbjct: 36  GWYDVGYH-LSEIRTPILDKLSSGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQII 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+ EKLLPQ +KE GY+TH++GKWH+G  K++ LP  RGFD+++GY  G   
Sbjct: 95  WPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSED 154

Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y          +++ T  A+ L   R  E  A      Y T+ F+ ++V +I+ H  + P
Sbjct: 155 YYTHIRCHPISALNLTRCALDL---REAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEP 211

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   AVH             LQVP  E     ++ I +  RR +A
Sbjct: 212 LFLYVAFQAVHAP-----------LQVP--ERYVAPYSFIQDHSRRSYA 247


>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 510

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/213 (36%), Positives = 115/213 (53%), Gaps = 14/213 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV FHG   IPTPNID LA +G++LN +Y LPTCTPSRAA +TG YP   G+ + + 
Sbjct: 14  GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLYPIHTGMQSDII 73

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
              A   +P+  K+LPQ+ ++LGY  ++IGKWH+G  K   +P  RGFD   G++ G   
Sbjct: 74  EPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 133

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y  +       +  V L A       A    ++ +    TD  ++V     + +P F   
Sbjct: 134 YYNHTSGSENNEQGVSLVATLTA---AWGRIAQGVEPLLTDSPLNV-----YPQPFFCYF 185

Query: 199 THAAVHTGTAGNA-KLPT-GLLQVPDMEENDRT 229
           +H AVH+       + P   +L+ P + E++RT
Sbjct: 186 SHHAVHSALMAEPFQAPARNVLKFPYIGESNRT 218


>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
          Length = 597

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 88  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 146

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+ELGYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 147 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 206

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  +PLFL 
Sbjct: 207 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 263

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 264 VAFQAVHT 271


>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 193

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/152 (48%), Positives = 91/152 (59%), Gaps = 6/152 (3%)

Query: 10  AKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTG 69
           A  V    K +  G+NDV FHG   IPTPNID +A  G++LN +Y LP CTPSR+A +TG
Sbjct: 33  ANIVSSNFKEINLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTG 92

Query: 70  KYPFRYGI-----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLP 124
           +YP   GI     DT + A  A  V + EK LPQYLK +GY TH IGKWH+G   +E  P
Sbjct: 93  RYPIHTGIFYHTYDT-IFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTP 151

Query: 125 FNRGFDNHVGYWNGYLTYNDSIHETDFAVGLD 156
             RGFD+  GY+ G   Y D    ++   GLD
Sbjct: 152 TYRGFDSFYGYYGGQADYWDHSLASNGWWGLD 183


>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
          Length = 565

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 51  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 109

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+ELGYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 110 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 169

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  +PLFL 
Sbjct: 170 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 226

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 227 VAFQAVHT 234


>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
          Length = 574

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 60  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+ELGYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  +PLFL 
Sbjct: 179 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
 gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
          Length = 575

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 60  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  RPLFL 
Sbjct: 179 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
          Length = 503

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 79/224 (35%), Positives = 128/224 (57%), Gaps = 22/224 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G++DVG+HG ++I TP +DAL+ +G+ L  +Y  P CTP+R+  ++G+Y    G+    +
Sbjct: 45  GFHDVGYHG-SEIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGRYQIHTGLQHGII 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
            +    A+P     L   LKE GY+TH++GKWH+G  K+E LP+NRGFD + GY N    
Sbjct: 104 NSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGYLNAAED 163

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y  +N    +  +   LD R N      + + +Y    FT +++ V++SHN S+PLFL +
Sbjct: 164 YFNHNVPWRQVRY---LDLRDNNGPVRNE-TGQYSAHLFTGKAIDVVQSHNTSKPLFLYL 219

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + +VH             L+VP  E+ +  + +I++ +RR FA
Sbjct: 220 AYQSVHAP-----------LEVP--EKYEHKYRNITDKNRRTFA 250


>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
          Length = 573

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 58  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  RPLFL 
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 234 VAFQAVHT 241


>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
          Length = 570

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 55  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 113

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 114 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 173

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  RPLFL 
Sbjct: 174 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 230

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 231 VAFQAVHT 238


>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
 gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
 gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
          Length = 573

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  +PLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
          Length = 484

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/205 (39%), Positives = 111/205 (54%), Gaps = 18/205 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H    + TPNID LA  G++LN+ Y  P C+PSR AF+TG YP+R G+   V 
Sbjct: 37  GWNDVGYHNPA-MKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVI 95

Query: 83  AGVAKAV-PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                   P+  K LPQ LK+LGY+TH++GKWH+G    +  P  RGFD+  G+++    
Sbjct: 96  MPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKAD 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   I        LD R N E+    ++  Y T  FT ++  +IK HN S+PLFL +   
Sbjct: 156 YYSHISYK----YLDYRDN-EKPVKNLNGTYSTFTFTSRAQDIIKKHNSSQPLFLYMAFP 210

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEEN 226
                      +P   LQVP   E+
Sbjct: 211 -----------IPHEPLQVPQQYED 224


>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
          Length = 569

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ +H+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
          Length = 572

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ +H+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
          Length = 613

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 102 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 160

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 161 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 220

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  +PLFL 
Sbjct: 221 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLY 277

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 278 VAFQAVHT 285


>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
          Length = 598

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 224 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 282

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E+GYSTH++GKWH+G  K+  LP  RGFD  +G   G  
Sbjct: 283 IRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNV 342

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A + S +Y T  +  ++  ++ SHN  +PLFL 
Sbjct: 343 DYYTYDNC--DGPGVCGYDLHEG-ESVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLY 399

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 400 VAFQAVHT 407


>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
 gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
          Length = 501

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 77/186 (41%), Positives = 104/186 (55%), Gaps = 11/186 (5%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            G+ D GF+G +DIPTPN+D LA  G  L + Y LP CTP+RAA +TG+YP RYG+   V
Sbjct: 70  MGFGDAGFNG-SDIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRYPLRYGLQVGV 128

Query: 82  -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
             A    ++PV E LLPQ LK+ GY+T ++GKWH+G  K E  P  RGFD   G   G +
Sbjct: 129 IPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFDYFYGALVGEI 188

Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             +  S H        D  RN +   P   + +    F +++V V++ H    PLFL + 
Sbjct: 189 DHFKHSSHGVK-----DWYRNNK---PLNETGFDNTLFGNEAVRVVERHEGKSPLFLYLA 240

Query: 200 HAAVHT 205
             A HT
Sbjct: 241 FTAPHT 246


>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
          Length = 556

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 126/229 (55%), Gaps = 35/229 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 88  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 146

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 147 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 206

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      +S++ T  A+ L   R+ E  A + ++ Y T+ FT ++  VI +H   + 
Sbjct: 207 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEK- 262

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
                   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 263 --------SVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 290


>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
          Length = 569

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LP  L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+  RPLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
 gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
          Length = 455

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 79/227 (34%), Positives = 119/227 (52%), Gaps = 22/227 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+G+H  + I TPN+D LA  G+ L  +Y  P C+PSR   +TG+Y   YG+  + +
Sbjct: 12  GWNDIGYHN-SFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSVI 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E  LPQ LKE GYST+++GKWH+G  ++E +P  RGF+   GY  G   
Sbjct: 71  MCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFERFFGYLTGGED 130

Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y      + F+       GLD  R+ ++     +  Y T  F  +++ +I +H+ S+P+F
Sbjct: 131 YWTHRKPSQFSKDPSEFHGLDL-RDQDKPVLDQNGTYSTHLFARKAIEMILNHDQSKPMF 189

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +   AVH           G L+ P  EE  R +  I N   R +A
Sbjct: 190 LYLPFQAVH-----------GPLEAP--EEYKRIYEDIDNSLVRTYA 223


>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 486

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 85/222 (38%), Positives = 114/222 (51%), Gaps = 23/222 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVGF+G  DI TP++DALA  G    + Y  P CTP+RAA +TG+YPFRYG+ T V 
Sbjct: 41  GWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALMTGRYPFRYGLQTAVI 100

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             V+   +   E LLPQ L++ GY+T +IGKWH+G   ++  P  RGF+   G   G L 
Sbjct: 101 PSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQRGFEYQYGAMIGELD 160

Query: 142 -YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y  S H       LD  R+ E   P     Y T+     +V  ++     RP +L +  
Sbjct: 161 YYTHSEHGV-----LDWFRDNE---PVHEEGYTTNLLGADAVKYLEKQKADRPFYLYLAF 212

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A HT             Q P  E  DR + HI++P RR +A
Sbjct: 213 NAPHTP-----------YQAP-QEYIDR-YTHIADPTRRTYA 241


>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 437

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 83/222 (37%), Positives = 114/222 (51%), Gaps = 12/222 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVGFHG   IP PNIDALA +G++LN++Y  P    SR   LTG YP+R G+   + 
Sbjct: 33  GWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIYPYRTGVGRVML 92

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                A+P   ++LP + K LGY TH +G W++G  K+   P NRGFD     W G   Y
Sbjct: 93  PCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGDY 152

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS--VHVIKSHNHSRPLFLQITH 200
                +T    G D R N +    Q +  Y T  FT+++     +      +PL L ++H
Sbjct: 153 WTHDMQTKMQ-GFDLRLNDDLMWNQ-TGVYSTRLFTERADPTRKLCFFVLHQPLLLILSH 210

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            A+HT          G LQ P +E  D +F  I +  RR+FA
Sbjct: 211 QALHTANY------HGSLQCP-LEHLD-SFGFIRDRKRRIFA 244


>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
          Length = 560

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 112 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 170

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 171 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNV 230

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+   PLFL 
Sbjct: 231 DYYTYDNC--DGPGVCGFDLHEG-ETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 287

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 288 VAFQAVHT 295


>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
 gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
          Length = 488

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 74/183 (40%), Positives = 103/183 (56%), Gaps = 7/183 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW D+G+HG + I TPNI+ LA +GI+L+ +Y  P CTP+R+A +TGKYP   G    V 
Sbjct: 36  GWFDLGYHG-SVIRTPNINQLAGDGIILDNYYVQPLCTPTRSALMTGKYPIHLGTQHGVI 94

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    +P+    LP+ LK+ GY+TH++GKWH+G  KE+ +P  RGFD+  GY+ G   
Sbjct: 95  LPGQPMGLPLDSSTLPEQLKQQGYATHIVGKWHLGFYKEDFVPTKRGFDSFYGYYCGA-- 152

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
                H T   +G    R+ +         Y T  FT ++V  I  HN S PLFL +   
Sbjct: 153 ---EDHFTHNVLGFLDFRDNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQ 209

Query: 202 AVH 204
            VH
Sbjct: 210 NVH 212


>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
 gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
          Length = 260

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 17/190 (8%)

Query: 23  GWNDVGFHG-ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG----- 76
           GW+DVG+H   + + TPNID LA  G+ L  +Y+ P CTPSR A +TGKYP   G     
Sbjct: 36  GWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIHLGMQHFV 95

Query: 77  IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           I+     G+ +  P     +PQ L+ LGY T +IGKWH+G    +  P  RGFD+ +G++
Sbjct: 96  INITSPWGMPRRFPT----IPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGFF 151

Query: 137 NGYLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            G     +  H     +G LD RR+ E  A +   ++ TD FT +++++   HN S+PLF
Sbjct: 152 AG-----EQDHWRHSKMGFLDFRRD-EEPANEYGGQHSTDVFTQEAINIAMRHNASQPLF 205

Query: 196 LQITHAAVHT 205
           L +++AAVHT
Sbjct: 206 LLLSYAAVHT 215


>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
 gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
          Length = 478

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 107/190 (56%), Gaps = 9/190 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D+G+HG   I TP +D LA  G+ L  +Y  P C+PSR   +TG+Y  RYG+  + +
Sbjct: 12  GWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRYQIRYGLQHSVI 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +     +P+ E  LPQ LKE GYST+++GKWH+G  ++E +P  RGFD   GY  G   
Sbjct: 71  TSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFDKFYGYLTGGED 130

Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y        +A       GLD  R+ ++     +  Y T  F  +++ +I++H  S+P+F
Sbjct: 131 YWTHRRPNLYAKDPLAFHGLDL-RDQDKPVLDQNGTYSTHLFAKKAIEIIQNHERSKPMF 189

Query: 196 LQITHAAVHT 205
           L +   AVH+
Sbjct: 190 LYLPFQAVHS 199


>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 74/184 (40%), Positives = 104/184 (56%), Gaps = 9/184 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G +DVG+ G ++I TP++D LA  G  L++ Y  P C+P+RAA LTG+YPFRYG  T V 
Sbjct: 64  GSHDVGWRG-SEIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRYPFRYGFQTGVV 122

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              A+  +P+ E+ LPQ LKE GY T + GKWH+G  +   LP  RGFD+  G++NG L 
Sbjct: 123 RPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFDHQYGHYNGMLD 182

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   I       G D  RN +         Y T+    ++   ++  + SRPLFL +   
Sbjct: 183 YYTHIRHG----GFDWHRNDQE---NHDEGYSTELVGKEAARRVRERDKSRPLFLYVPFN 235

Query: 202 AVHT 205
            VH+
Sbjct: 236 GVHS 239


>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
          Length = 571

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +  H++ SH+  +PLFL 
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 232

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 233 VAFQAVHT 240


>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
 gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
          Length = 498

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 112/221 (50%), Gaps = 21/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG +DI TPNID LA  G  L + Y  P CTP+RAA +TG+YPFRYG+ T V 
Sbjct: 67  GWKDVGFHG-SDIKTPNIDQLAEKGGRLEQFYAQPMCTPTRAALMTGRYPFRYGMQTAVI 125

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    + + + LLP+ LKE GY+T   GKWH+G       P  RGFD+  G   G + 
Sbjct: 126 PQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFWPRQRGFDSFYGALLGEI- 184

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
             D           D  RN E       + +    F  ++V VI  H+ S+PLFL +   
Sbjct: 185 --DHFTHKSANGNADWYRNNEAIE---EAGFDNILFATEAVRVINEHDQSKPLFLYLAFT 239

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + HT             Q P  E  DR  +HI++  RR +A
Sbjct: 240 SPHTP-----------FQAPK-EYLDRN-SHIADESRRAYA 267


>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
 gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
          Length = 540

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 17/190 (8%)

Query: 23  GWNDVGFHG-ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG----- 76
           GW+DVG+H   + + TPNID LA  G+ L  +Y+ P CTPSR A +TGKYP   G     
Sbjct: 46  GWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIHLGMQHFV 105

Query: 77  IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           I+     G+ +  P     +PQ L+ LGY T +IGKWH+G    +  P  RGFD+ +G++
Sbjct: 106 INITSPWGMPRRFPT----IPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGFF 161

Query: 137 NGYLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            G     +  H     +G LD RR+ E  A +   ++ TD FT +++++   HN S+PLF
Sbjct: 162 AG-----EQDHWRHSKMGFLDFRRD-EEPANEYGGQHSTDVFTQEAINIAMRHNASQPLF 215

Query: 196 LQITHAAVHT 205
           L +++AAVHT
Sbjct: 216 LLLSYAAVHT 225


>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
          Length = 484

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 59  QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 118 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNV 177

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  ++ H++ SH+   PLFL 
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 234

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 235 VAFQAVHT 242


>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
          Length = 477

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 120/221 (54%), Gaps = 15/221 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GWNDVG+    DI TP +D LA +G++LN  Y  P C+PSR  F++GK+P+  G+ D  V
Sbjct: 35  GWNDVGWINP-DIKTPTLDRLARSGVILNSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVV 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                K +P     +PQ LK LGY TH++GKWH+G    +  P  RGFD  +GY+ G   
Sbjct: 94  FIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTPTYRGFDTFMGYYAGMED 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   + +          RNM         +Y T  F ++++ +I +H+ S+P++L +   
Sbjct: 154 YFTHVRDEVADYNGYDFRNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQ 213

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             LQVP  + +DR ++HI + +R++++
Sbjct: 214 AVHVP-----------LQVP-TKYSDR-YSHIHDLNRKVYS 241


>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 530

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 121/235 (51%), Gaps = 29/235 (12%)

Query: 22  QGWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--- 77
            GWND+G+   +    TP +  LA NG+ L ++YT  TCT SRAA L+G  P   GI   
Sbjct: 37  MGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAALLSGVLPMHNGISHG 96

Query: 78  ----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
               D+P+G      +P+  KLLPQYL+E GY T+++GKW IG   EE LP NRGFD+  
Sbjct: 97  TIVMDSPIG------LPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLPHNRGFDHFF 150

Query: 134 GYWNGYLTYNDSIHETDFAVGLDA---RRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-- 188
           G++   +TY   I    +    +     RN +      S +Y TD F +++V  ++ H  
Sbjct: 151 GFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERAVGFVEGHAA 210

Query: 189 NHSR-PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           NH+  PLFL ++  A H  T+   +          M       A  +N +RR+FA
Sbjct: 211 NHATDPLFLYLSFNAPHYPTSAPQEF---------MRNEAELLAPFTNRERRVFA 256


>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
          Length = 476

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 121/230 (52%), Gaps = 26/230 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G +DVG+HG   I TPNID LAY G+ L  +Y  P CTPSR+  +TG+Y    G+    +
Sbjct: 12  GHHDVGYHGSV-IKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHNII 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                 AVP+   +LP+ LK+ GYSTH++GKWH+G  K+E+LP NRGFD++ GY  G   
Sbjct: 71  NPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFDSYYGYLTGSED 130

Query: 139 YLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS---HNHSR 192
           Y T+              G+D R + E    Q + KY T  F +++  V++    H   +
Sbjct: 131 YFTHRRCGALPGANKTVCGIDLRNDFEVDWNQ-TGKYSTQLFAEKAEDVVRKHAVHQPDQ 189

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL +   AVH       ++P   L+  D          I +P RRL A
Sbjct: 190 PLFLYVAFQAVHAPN----QVPNEYLKPYD----------IDDPKRRLLA 225


>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
          Length = 684

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 117/227 (51%), Gaps = 32/227 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
           GWNDVGFH    + TPNID LA  G++LN+ Y  P C+PSR A +TG YP+  G+     
Sbjct: 237 GWNDVGFHNPA-MKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTGYYPYHAGLQHLVI 295

Query: 79  ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
               PV +      P+  K LPQ LK++GY+TH++GKWH+G       P  RGFD+  GY
Sbjct: 296 LPWQPVCS------PLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRGFDSFFGY 349

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           +N      D    T +   LD R N E+    ++  Y T  F  ++  +IK HN S+PLF
Sbjct: 350 YNA---QGDHYSHTWYNY-LDYRDN-EKPVKNLNGTYSTFTFVSRAQDIIKKHNSSQPLF 404

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L +    VH             +QVP   E+   + +I    RR F+
Sbjct: 405 LYMAFQNVHDP-----------IQVPKQYED--MYPNIKTRGRRQFS 438


>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 181

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 61/127 (48%), Positives = 82/127 (64%), Gaps = 13/127 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           GWNDV FHG + IPTP+IDALA  G++L  +Y  P CTP+R+A +TGK+P   G+     
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D P G G        E ++PQYL+ LGY TH++GKWH+G  KE L P +RGF+++ GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153

Query: 136 WNGYLTY 142
           + G   Y
Sbjct: 154 YGGMQDY 160


>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
          Length = 513

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/215 (37%), Positives = 118/215 (54%), Gaps = 25/215 (11%)

Query: 37  TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKL 95
           TP++DALA  G++L+ +YT P CTPSR+  LTG+Y  R G+    +       VP+ EKL
Sbjct: 49  TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108

Query: 96  LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------YLTYNDSIH 147
           LPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G          T  D+++
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168

Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGT 207
            T  A+     R+ E  A    + Y T+ FT +++ +I +H   +PLFL +   +VH   
Sbjct: 169 VTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEP- 224

Query: 208 AGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
                     LQVP  EE  + +  I + +R  +A
Sbjct: 225 ----------LQVP--EEYLKPYDFIQDKNRHHYA 247


>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
          Length = 564

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 106/186 (56%), Gaps = 3/186 (1%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ D+G+HG ++I TP +D LA  G+ L  +Y  P C+PSR+ F+TGKY    G+  + 
Sbjct: 57  QGYRDIGYHG-SEIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LK+ GY TH++GKWH+G  K+E +P  RGFD+  G   G  
Sbjct: 116 IRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFDSFFGSLLGSG 175

Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             YN    ++    G D   N      Q +  Y T+ +T + + ++ SHN ++PLFL I 
Sbjct: 176 DYYNHYKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFLYIA 235

Query: 200 HAAVHT 205
           + AVH+
Sbjct: 236 YQAVHS 241


>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
 gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
          Length = 472

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/177 (42%), Positives = 98/177 (55%), Gaps = 10/177 (5%)

Query: 67  LTGKYPFRYGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPF 125
           +TGKYP   G+   V  G+  + +P+TEKLLPQYLKELGY  H+ GKWH+G    +  P 
Sbjct: 1   MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60

Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
            RGFD+HVG+W G+    D       A GLD RR  +  A  +   Y T    D+SV  I
Sbjct: 61  ERGFDSHVGFWTGHHHMFDHTAVETNAWGLDMRRGFD-VAYDLHGYYTTHVIRDESVAAI 119

Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++HN S+P+FL ++HAA H+        P   L  PD  E     A ISN  RR FA
Sbjct: 120 RAHNTSQPMFLYVSHAATHSAN------PYDFLPAPD--ETVERLAGISNYSRRKFA 168


>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
 gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
          Length = 491

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 28/224 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           G+ DVGFHG +DI TPN+D LA  G  L + YT P CTP+RAAFLTG+YP  YG+    +
Sbjct: 60  GFADVGFHG-SDIKTPNLDHLAAQGARLGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAI 118

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +G    +   E LLPQ LK++GY T L+GKWH+G   ++  P  RGFD+  G   G + 
Sbjct: 119 PSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGPLVGEID 178

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           +    HE        A    + Y      K   Y T+ F  ++V +I +H+   PLFL +
Sbjct: 179 HFK--HE--------AHGVTDWYHDNTQVKEEGYDTELFGKEAVRLIAAHDPKTPLFLYL 228

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              A HT             Q P    +   +AHI+ P RR +A
Sbjct: 229 AFTAPHT-----------PFQAPQSYLDQ--YAHIAAPQRRAYA 259


>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 120/230 (52%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D+G+HG   + TP++D LA  GI L  +Y  P CTP+R+  ++G+Y    G+  T +
Sbjct: 34  GWHDIGYHGSR-VQTPHLDKLASEGIKLENYYVQPMCTPTRSQLMSGRYQIHTGLQHTVI 92

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---G 138
                  +P+ E  + Q LKE GYSTH++GKWH+G   +  LP  RGFD+  G++N    
Sbjct: 93  NPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLPTKRGFDSFFGFYNCAVD 152

Query: 139 YLTYNDSI-----HETDFAV-GLDARRNMERY-APQMSSKYLTDFFTDQSVHVIKSHNHS 191
           Y TY         +ET   + G D  RN E + AP     Y T     ++  VI+ HN S
Sbjct: 153 YYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQTHVLAKEAEDVIRKHNPS 212

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
           +PLFL +   AVH             L+VP + E+   +A + +  RR+ 
Sbjct: 213 KPLFLYLAFGAVHVP-----------LEVPKVYED--MYADVKDNSRRIL 249


>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
          Length = 511

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 73/199 (36%), Positives = 99/199 (49%), Gaps = 18/199 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWND GF G   + TP ID L   GI LN+HY    C+P+RAA +TG+YP RYG+  P  
Sbjct: 25  GWNDCGFAGTR-VKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRYPHRYGLQFPFC 83

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
            G A A+   E LLPQY+K  GY+T  +GKWH+G  + +  P  RGFD+  G+++    Y
Sbjct: 84  GGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFDSFYGFYSCAEDY 143

Query: 143 --------NDS---IHETDF------AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
                   N S   +   DF      + G D  +            Y T  F  + V ++
Sbjct: 144 FFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYSTTLFAGRIVDIV 203

Query: 186 KSHNHSRPLFLQITHAAVH 204
             H+ S+PLFL       H
Sbjct: 204 DGHDPSQPLFLYFASQDTH 222


>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
           caballus]
          Length = 523

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 123/229 (53%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G +DVG H E+   TP +DALA  G++L+ +YT P C PSR+  L+G+Y    G+    +
Sbjct: 46  GXHDVGLH-ESRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQIHTGLQHQII 104

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 105 WPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 164

Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
                  T+ D+++ T  A+     R+ E  A    + Y T  FT+++  +I +H   +P
Sbjct: 165 YYSHERCTFIDALNVTRCALDF---RDGEEVATGYKNMYSTSVFTERATALITNHPPEKP 221

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LFL +   +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 222 LFLYLALQSVHQP-----------LQVP--EEYLKPYDFIQDKNRYHYA 257


>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 513

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 71/193 (36%), Positives = 110/193 (56%), Gaps = 13/193 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+D+G+HG + + +P +D LA  G+ L  +Y  P C+P+RA  ++G+Y  RYG+   V 
Sbjct: 37  GWHDIGYHG-SIVRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRYQIRYGLQHLVI 95

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---- 137
               +A +P  E  + Q +KE GY+TH++GKWH+G  ++E LP NRGFD   G+ N    
Sbjct: 96  QPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFDTFFGFLNCLIY 155

Query: 138 ------GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
                 GY    +S ++T    G D  RN +  A +   +Y T  F +++  VI +H+  
Sbjct: 156 HYTYDFGYWHTPESGNKT-IMFGWDLFRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQE 214

Query: 192 RPLFLQITHAAVH 204
            P+FL +  AAVH
Sbjct: 215 TPMFLYLPFAAVH 227


>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
          Length = 573

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/188 (38%), Positives = 107/188 (56%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 59  QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L++LGY+TH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 118 IRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 177

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A  +S +Y T  +  +   ++  H+  RPLFL 
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLY 234

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 235 VAFQAVHT 242


>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
          Length = 584

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 73/188 (38%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 64  QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 122

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E+GYSTH++GKWH+G  K+  LP  RGFD  +G   G  
Sbjct: 123 IRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNV 182

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A + S +Y T  +  ++  ++ SH+  +PLFL 
Sbjct: 183 DYYTYDNC--DGPGVCGYDLHEG-ENVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLY 239

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 240 VAFQAVHT 247


>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 726

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 73/222 (32%), Positives = 120/222 (54%), Gaps = 44/222 (19%)

Query: 23  GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
           GWND+G+H   D+   TPN+D L+ +G+ ++++Y++  CTP+RAA +TG+YP RYG+   
Sbjct: 123 GWNDIGYHS-TDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGRYPVRYGLQYN 181

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            +  G    +P+TEKLLP+Y+ E GY +H++GKWH+G       P  RGF+ + GY N  
Sbjct: 182 VIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGFETYFGYLNDE 241

Query: 140 LTYNDSIHETDFAVGLDARR--------------NMERYAP------------------- 166
             Y    H+T +   ++ R+               +ER+ P                   
Sbjct: 242 EMY--WTHQT-WTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGPTSSVYSS 298

Query: 167 --QMSSKYLTDFFTDQSVHVI--KSHNHSRPLFLQITHAAVH 204
             ++   Y T+ FTD+++ ++  K+ +   PLFL ++H AVH
Sbjct: 299 SLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVH 340


>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
          Length = 515

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G N++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMPTRRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
             + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 TAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
           obscuriglobus UQM 2246]
          Length = 459

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/184 (38%), Positives = 100/184 (54%), Gaps = 8/184 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G  D GF G  +I TPNID +A  G  L+  Y  P C+P+RAAF+TG+YP R+G+   V 
Sbjct: 36  GREDCGFMGGKEIKTPNIDKIAAAGATLDAFYAQPVCSPTRAAFMTGRYPMRHGLQVGVV 95

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              A+  +P+ E+ + Q LK+ GY+T +IGKWH+G    E LP  RGFD+  G++NG L 
Sbjct: 96  RPWAQYGLPLDERTVAQGLKDAGYTTAVIGKWHLGHFAPEYLPTKRGFDHQYGHYNGALD 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   I +  F    D + N +         Y T      +V  +++H   +P FL +   
Sbjct: 156 YFTHIRDGGFDWHRDDKVNSD-------EGYSTHLVAKDAVQFVQTHAGKKPFFLYVPFN 208

Query: 202 AVHT 205
           AVH 
Sbjct: 209 AVHA 212


>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 570

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 121/226 (53%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H  +DI TP +D LA +G+ L  +Y  P CTPSR+ F+TG+Y    G+  + 
Sbjct: 58  QGFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P  +  LPQ L+ELGYSTH++GKWH+G  K+E LP  RGFD + G   G  
Sbjct: 117 IRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
            Y TY+    +     G D     E  A   S KY T  +T +   ++ +H+  S+PLF+
Sbjct: 177 NYYTYDGC--DGAGLCGFDLHEG-ESVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFI 233

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            ++  AVHT            LQ PD  E    +  + N  RR +A
Sbjct: 234 FLSFQAVHTP-----------LQYPD--EYIYPYLGLENVARRKYA 266


>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
 gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
          Length = 497

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 93/247 (37%), Positives = 120/247 (48%), Gaps = 32/247 (12%)

Query: 5   VGAGVAKAVPVTEKLL-----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTC 59
            GAG A+A      ++       GW DVG+HG +DI TPNID LA  G  L + Y  P C
Sbjct: 43  AGAGEARAQGAAPNIVYIISDDSGWKDVGYHG-SDIRTPNIDRLAAEGARLEQFYVQPMC 101

Query: 60  TPSRAAFLTGKYPFRYGIDTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCN 118
           TPSRAAF+TG+YPFRYG+ T V        + + E  LPQ LK+ GY T + GKWH+G +
Sbjct: 102 TPSRAAFMTGRYPFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHS 161

Query: 119 KEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTD 175
           K    P  RGFD+  G   G         E D      A  N + Y    + K   Y   
Sbjct: 162 KTAYWPRQRGFDSFYGALLG---------EIDHFTHKAANGNPDWYRNNKALKEEGYDNI 212

Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISN 235
               ++V VI  H+  +PLFL +   A HT             Q P  E  DR  +HI++
Sbjct: 213 LIGAEAVRVINKHDQQKPLFLYLAFTAPHTP-----------YQAPK-EYLDRN-SHIAD 259

Query: 236 PDRRLFA 242
             RR +A
Sbjct: 260 ESRRKYA 266


>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
 gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
          Length = 492

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 117/226 (51%), Gaps = 20/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWND+G+H  + I TPN+D LA  G+ L  +Y  P C+PSRA  +TG+Y  RYG+   V 
Sbjct: 27  GWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRYQIRYGMQHSVL 85

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
            +     +P+ E  LPQ LKE GY+TH++GKWH+G  K+E LP  RGFD   G+  G   
Sbjct: 86  MSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFDTFFGFLGGGED 145

Query: 139 YLTYN--DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y T+   + I ET          +  +     +  Y T  F  +S+ +I  H+  +P+FL
Sbjct: 146 YFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSIDLISRHDKDKPMFL 205

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   AVH             L+ P  EE    + HI + + R +A
Sbjct: 206 YLPFQAVHAP-----------LEAP--EEFINKYTHIRSKNMRTYA 238


>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
          Length = 517

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 124/229 (54%), Gaps = 35/229 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+GFHG + I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +
Sbjct: 51  GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLI 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  VP+ EKLLPQ LK+ GY+TH++GKWH+G  ++E LP  RGFD + GY  G   
Sbjct: 110 MTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 169

Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y T+      + ++ T  A+ L   R+ E  A + +  Y T+ FT ++  +I +H   + 
Sbjct: 170 YYTHEACAPIECLNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEK- 225

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
                   +VH             LQVP  EE    +  I +  RR++A
Sbjct: 226 --------SVHDP-----------LQVP--EEYMEPYDFIQDKHRRIYA 253


>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 673

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 72/187 (38%), Positives = 105/187 (56%), Gaps = 5/187 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H  +DI TP +D LA  G+ L  +Y  P CTPSR+ F+TG+Y    G+  + 
Sbjct: 58  QGFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P  +  LPQ L++LGYSTH++GKWH+G  K+E LP  RGFD + G   G +
Sbjct: 117 IRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176

Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQI 198
             Y  S  +     G D     E  A     KY T  FT +   ++  H+  S+PLF+ +
Sbjct: 177 NYYTYSSCDGPELCGFDLHEG-ESVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFIFL 235

Query: 199 THAAVHT 205
           +  AVH+
Sbjct: 236 SFQAVHS 242


>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
          Length = 601

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 88  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 146

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 147 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 206

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 207 DYYTHYKC--DSPGMCGYDLYENDSAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLY 264

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 296


>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 489

 Score =  129 bits (325), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 119/222 (53%), Gaps = 23/222 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H   DI  P ++ LA +G++ N+ Y  P CTPSRAA LTG YPF+      + 
Sbjct: 34  GWNDVGWHNA-DIKMPILNQLAADGVIFNQSYVQPACTPSRAALLTGYYPFKIQRQHQML 92

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             + A  + +  K LP+ LK++GY THL+GKWH+G  KEE LP  RGFD+    + G+LT
Sbjct: 93  LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDS----FYGWLT 148

Query: 142 YNDSIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
              +++  E   A G D R N      Q S  YL     +++V ++  H    PLFL+ +
Sbjct: 149 LGTTLYSKENIIAPGYDFRDNTG--VVQESDTYLPFMLAERAVDIVMGHYKEYPLFLEFS 206

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
            A           L    L+VP  +E +  ++ I +  +R F
Sbjct: 207 MA-----------LSGKFLEVP--QEYEDLYSDIEDDRQRKF 235


>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
          Length = 601

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 88  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 146

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 147 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 206

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++PLFL 
Sbjct: 207 DYYTHYKC--DSPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFLY 264

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 LAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 296


>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
 gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
          Length = 584

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 107/222 (48%), Gaps = 39/222 (17%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV FHG   IPTP ID  A  G++LN +Y  P CTPSRA+ +TGKYP          
Sbjct: 38  GWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKYPIN-------- 89

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                                      +G WH+G   +E  P  RGFD+  G+WN    Y
Sbjct: 90  ---------------------------LGMWHLGFFTKEYTPVYRGFDSFYGFWNAKTDY 122

Query: 143 -NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            N S +E +F  G+D R NME    +    Y T+ FT ++V VI++H+ S PLFL + H 
Sbjct: 123 WNHSSYENNFW-GVDLRDNMEPVQSE-DGTYGTELFTREAVKVIEAHDTSTPLFLYVAHQ 180

Query: 202 AVHTGTAGN-AKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVHT       + P   + V   +   R    I +  R+++A
Sbjct: 181 AVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYA 222


>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
          Length = 597

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 85  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 143

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 144 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 203

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 204 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 261

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 262 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 293


>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
          Length = 599

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
          Length = 596

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 585

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 80/182 (43%), Positives = 109/182 (59%), Gaps = 11/182 (6%)

Query: 23  GWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           GWNDVG++  ND +PTP ++ LA NG++LN  Y+ P C+PSRAA LTGKYP   GI    
Sbjct: 119 GWNDVGWN--NDFMPTPILNELASNGVILNNTYSQPACSPSRAALLTGKYPANAGIQHLV 176

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           V       +P+   LL   LKELGY  H IGKWH+G    +  P  RGFD+  G +NGYL
Sbjct: 177 VQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRGFDSFYGIFNGYL 236

Query: 141 T-YNDSIHETDF-----AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           + Y+  I  + F     A GLD R N    A + +  ++T  FT+++  +I++HN + PL
Sbjct: 237 SDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHE-NGTHVTYLFTERAERIIRNHNPAAPL 295

Query: 195 FL 196
           FL
Sbjct: 296 FL 297


>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
 gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
 gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
 gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
 gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
          Length = 599

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
          Length = 596

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
          Length = 597

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
          Length = 578

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
          Length = 596

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 648

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVG+H ++ I TPNID LA  G+ L  +Y  P CTP+RA  +TG    RY I T + 
Sbjct: 39  GWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPICTPTRAVLMTG----RYQIHTTMQ 93

Query: 83  AGV-----AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW- 136
            GV      + +P  E L+PQ LKE GYSTH++GKWH+G  K +  P +RGFD   G++ 
Sbjct: 94  HGVLMAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFYKWDCTPNHRGFDTFFGFYL 153

Query: 137 --NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
               Y T+    H        D R   +   P+ + +Y T  +  ++  VI+  + + P+
Sbjct: 154 AGGEYFTHTRKCHGHRLD-AWDLRDGDKMVGPEYTGEYSTMLYARKAQEVIRKQDPNVPM 212

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           FL ++  AVH             L+VPD    D     I +  R+L+A
Sbjct: 213 FLYVSFQAVHAP-----------LEVPD-SYADAYGKDIYDQSRKLYA 248


>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
          Length = 661

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 7/188 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 148 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 206

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 207 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 266

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 267 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 324

Query: 198 ITHAAVHT 205
           I + AVH+
Sbjct: 325 IAYQAVHS 332


>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
          Length = 599

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
 gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
          Length = 598

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
          Length = 599

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 1410

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 117/225 (52%), Gaps = 27/225 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+HG + I TP++D LA  G  L  +Y    C+PSR  FLTGK+    G++  + 
Sbjct: 38  GWNDVGYHG-SSISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMIHLGMEGGII 96

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                K +PV E  + Q LK   YSTH IGKWH+G  K+  LP NRGFD   G   G   
Sbjct: 97  MPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTFFGIIGG--C 154

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            +   H+      L   RN    A +    Y TD +  ++ +VI++H+ S+P+F+ +   
Sbjct: 155 ADHYTHKNTHWWEL--YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMYLAFQ 212

Query: 202 AVHTGTAGNAKLPTGLLQVP----DMEENDRTFAHISNPDRRLFA 242
           A H        LP   LQ P    DM      +++I +PDRR++A
Sbjct: 213 AAH--------LP---LQAPRKYIDM------YSNIEDPDRRVYA 240


>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
          Length = 551

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 117/230 (50%), Gaps = 29/230 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG +DI TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 34  QGFRDVGYHG-SDIKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSV 92

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
           + A     +P+    LPQ LK  GYSTH++GKWH+G  K   LP  RGFD   G      
Sbjct: 93  IRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSG 152

Query: 135 -YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSR 192
            Y++ Y     S+   D   G +A    +R        Y T+ FT ++V ++ +HN   +
Sbjct: 153 DYYSHYKCQGPSMCGYDLYEGEEAAWEQDR------GLYSTEMFTQKAVSILANHNPRKQ 206

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL + + AVH+            LQVP        +  I NP RR +A
Sbjct: 207 PLFLYLAYQAVHSP-----------LQVP--ARYLERYKGIPNPYRRKYA 243


>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
          Length = 593

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 7/188 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 80  QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 138

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 139 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 198

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 199 DYYTHYKC--DSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLY 256

Query: 198 ITHAAVHT 205
           I + AVH+
Sbjct: 257 IAYQAVHS 264


>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
          Length = 605

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 92  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 150

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 151 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 210

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN ++P+FL 
Sbjct: 211 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFLY 268

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 269 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 300


>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
          Length = 473

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 118/217 (54%), Gaps = 25/217 (11%)

Query: 35  IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTE 93
           I TP++DALA  G+VL+ +Y  P CTPSR+  LTG+Y    G+    +       VP+ E
Sbjct: 7   IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDE 66

Query: 94  KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG---YLTYN-----DS 145
           KLLPQ LK+ G STH++GKWH+G  ++E LP  RGFD + GY  G   Y T+      + 
Sbjct: 67  KLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIEC 126

Query: 146 IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           ++ T  A+ L   R+ E  A + +  Y T+ FT ++  +I +H   +PLFL +   +VH 
Sbjct: 127 LNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVHD 183

Query: 206 GTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
                       LQVP  EE    +  I +  RR++A
Sbjct: 184 P-----------LQVP--EEYMEPYDFIQDKHRRIYA 207


>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 570

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/189 (38%), Positives = 107/189 (56%), Gaps = 9/189 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H  +DI TP +D LA +G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 58  QGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P  +  LPQ L+ELGYSTH++GKWH+G  K+E LP  RGFD + G   G  
Sbjct: 117 IRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
            Y TY+    +     G D     E  A     KY T  +T +   ++ +H+  S+PLF+
Sbjct: 177 NYYTYDSC--DGPGMCGFDLHEG-ESVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFI 233

Query: 197 QITHAAVHT 205
            ++  AVHT
Sbjct: 234 FLSFQAVHT 242


>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
          Length = 575

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 106/188 (56%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 59  QGFHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 117

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  K+E LP  RGFD  +G   G  
Sbjct: 118 IRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 177

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y +Y++   +     G D     E  A   + KY T  +  +   ++ SHN  +P+F+ 
Sbjct: 178 DYYSYDNC--DGPGVCGFDLHEG-ENVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIY 234

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 235 VAFQAVHT 242


>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
          Length = 564

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/223 (34%), Positives = 114/223 (51%), Gaps = 16/223 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TG+Y    G+  + 
Sbjct: 53  QGFRDVGYHG-SEIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQIHTGLQHSV 111

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 112 IRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDTFFGSLLGSG 171

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN  +P+FL I 
Sbjct: 172 DYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 231

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + AVH+            LQ P M      +  I+N +RR +A
Sbjct: 232 YQAVHSP-----------LQAPGMYY--ERYRSINNINRRRYA 261


>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 197

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/163 (44%), Positives = 96/163 (58%), Gaps = 5/163 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
            GWNDV +H   DI  PN+  LA +G++ N+ YT PTCTPSRAA + G YPF+ G    +
Sbjct: 37  MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMKGLYPFKTGNQHQM 95

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
              +    VP+  KLLP+ LKE+GYSTH++GKWH+G  K+E  P NRGFD+H G W  G 
Sbjct: 96  VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYQPTNRGFDSHYGLWTLGV 155

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
             Y+        + G D R NM    P+ S  YL     +QS+
Sbjct: 156 GNYDKMNGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGEQSI 196


>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 527

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/189 (38%), Positives = 107/189 (56%), Gaps = 9/189 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H  +DI TP +D LA +G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 58  QGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P  +  LPQ L+ELGYSTH++GKWH+G  K+E LP  RGFD + G   G  
Sbjct: 117 IRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
            Y TY+    +     G D     E  A     KY T  +T +   ++ +H+  S+PLF+
Sbjct: 177 NYYTYDSC--DGPGVCGFDLHEG-ESVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFI 233

Query: 197 QITHAAVHT 205
            ++  AVHT
Sbjct: 234 FLSFQAVHT 242


>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
          Length = 574

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 60  QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  K+E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A   S KY T  +  +   ++ SH+   P+F+ 
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-ENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
          Length = 599

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SHN  +P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 160

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/124 (51%), Positives = 81/124 (65%), Gaps = 10/124 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DTP 80
           GWND+GFHG  +I TPNID LA NG+VL+ +Y LP CTPSR+A +TG+YP   G+  DT 
Sbjct: 21  GWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 80

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
            G      V + EK LPQYLK+ GY T+ +GKWH+G   +E  P  RGFD++ G      
Sbjct: 81  YGPN-PYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFDSYYGSYLGKG 139

Query: 135 -YWN 137
            YWN
Sbjct: 140 DYWN 143


>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
          Length = 574

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 60  QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  K+E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A   S KY T  +  +   ++ SH+   P+F+ 
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-ENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 406

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 118/230 (51%), Gaps = 23/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV +HG + IPTPNID LA +GI+L  +Y  P  TP+RAA LTG YP   G     +
Sbjct: 40  GWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLSTPTRAALLTGLYPIHTGTQRLDI 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGK---WHIGCNKEELLPFNRGFDNHVGYWNG 138
           G+     +     LLPQ    L  +   +G    WH+G  K+E  P  RGFD   G +NG
Sbjct: 100 GSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLGFCKDEFKPTKRGFDTFYGIYNG 159

Query: 139 YLTYNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
                DS + T FA      V   A ++ +R   + + +YLT    +Q+V +I +   ++
Sbjct: 160 -----DSDYWTHFARDNNIDVSGHALKDEKRALVEEAGRYLTSLLANQAVQLIHNRPKNK 214

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           P FL     AVH G +       G LQ P  +E    F ++++ DR+LFA
Sbjct: 215 PFFLYFAPTAVHCGGS------NGSLQAP--KEYISKFGYLADYDRQLFA 256


>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
          Length = 573

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 60  QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  K+E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A   S KY T  +  +   ++ SH+   P+F+ 
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-EDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
          Length = 573

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 60  QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  + E +P  RGFD   G   G  
Sbjct: 119 IRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSG 178

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN  +P+FL I 
Sbjct: 179 DYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 238

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      + F H   I+N +RR +A
Sbjct: 239 YQAVHSP-----------LQAP-----GKYFEHYRSINNINRRRYA 268


>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
          Length = 571

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 58  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 117 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SH+ ++P+FL 
Sbjct: 177 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLY 234

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 235 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 266


>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
          Length = 597

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 202

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+ ++P+FL I 
Sbjct: 203 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIA 262

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 292


>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
          Length = 598

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----SRYFEHYRSIVNINRRRYA 294


>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
          Length = 575

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++D+G+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 60  QGYHDIGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  K+E LP  RGFD  +G   G  
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D     E  A   S KY T  +  +   ++ SH+   P+F+ 
Sbjct: 179 DYYTYDNC--DGPDVCGYDLHEG-EDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 236 VAFQAVHT 243


>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
          Length = 573

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 101/186 (54%), Gaps = 3/186 (1%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 60  QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSI 118

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  + E +P  RGFD   G   G  
Sbjct: 119 IRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSG 178

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SHN  +P+FL I 
Sbjct: 179 DYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 238

Query: 200 HAAVHT 205
           + AVH+
Sbjct: 239 YQAVHS 244


>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 487

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 115/213 (53%), Gaps = 20/213 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ DVGFHG  DIPTPN+DALA +G+     Y T P C+P+RA  LTG+Y  R+G +  P
Sbjct: 48  GYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNP 107

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            GA     +P+TE  +   LK++GY+T L+GKWH+G ++  + P  RGF+  +G+  G  
Sbjct: 108 SGANT--GLPLTEVTIADRLKQVGYTTGLVGKWHLG-SQPAMHPQERGFEEFIGFLGGAH 164

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           ++             DA+  +  + P  +  Y TD F  ++V  I+ H   +P FL ++ 
Sbjct: 165 SF------------FDAQGILRGHEPVKTIDYTTDLFGREAVSFIEKH-RDKPWFLYLSF 211

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
            AVHT           L  + D E   RT+A +
Sbjct: 212 NAVHTPMHATEDRMAKLASISDQER--RTYAAM 242


>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
          Length = 595

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+  +PLFL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
          Length = 600

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+  +PLFL I 
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIA 264

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
          Length = 659

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 70/172 (40%), Positives = 100/172 (58%), Gaps = 7/172 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWNDVG+H  N I TP++DALA  G+ L  +YT P CTPSR+  LTG+Y    G+    +
Sbjct: 185 GWNDVGYHDSN-IFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 243

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ EKLLP+ L+E GY TH++GKWH+G  ++E LP  RGFD   GY  G   
Sbjct: 244 WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 303

Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           Y ++   +H     V   A   R+ E  A   ++ Y T+ FT++++ +I  H
Sbjct: 304 YYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLIAKH 355


>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 564

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 107/191 (56%), Gaps = 9/191 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G +D+G H  + I TP+ D LA +G+ L+++Y LP C+P+RA+ L+G+YP   G  T V 
Sbjct: 75  GSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRYPLHTGCHTIVN 134

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               + +P+ E+ LPQ L+  GY  H +GKWH+G ++    P  RGF +  G++ G   Y
Sbjct: 135 DWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQSFFGFYLGAQDY 194

Query: 143 NDSIHETD----FAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHS--RP 193
           N  I + +    + +  DAR    R   ++  +   Y T  FT +++ VI++H      P
Sbjct: 195 NTHIKQGERGNAYEMHWDARGKCGRDCSRLVDERGNYSTHVFTREAIRVIENHPQRPHEP 254

Query: 194 LFLQITHAAVH 204
           LFL + H AVH
Sbjct: 255 LFLYLAHQAVH 265


>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
          Length = 1941

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 69/180 (38%), Positives = 101/180 (56%), Gaps = 9/180 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GWND+G++    I TP +D LAY G++ N+ Y  P CTP+RAA +TG YPFR G+    V
Sbjct: 54  GWNDIGWNNLQ-IKTPVLDKLAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMV 112

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+  K+LPQ LK+ GY  H++GKWH+G    E  P NRGFD+  G ++  + 
Sbjct: 113 LPFQPSGLPLHLKILPQKLKQAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVN 172

Query: 142 YNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           +N+ I +   +      G D R N      Q   + LT  FT + V +I +H+   P+F+
Sbjct: 173 HNNKISQLPISDHSKYKGYDFRDNTG--VVQNDGQPLTKLFTQRVVDIISNHHKDYPMFM 230


>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
          Length = 599

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 87  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 145

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 146 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 205

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL 
Sbjct: 206 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 263

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 264 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 295


>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
 gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
 gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
          Length = 599

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 87  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 145

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 146 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 205

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL 
Sbjct: 206 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLY 263

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 264 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 295


>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
          Length = 607

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 102/186 (54%), Gaps = 3/186 (1%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 94  QGFRDVGYHG-SEIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 152

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 153 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 212

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL I 
Sbjct: 213 DYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFLYIA 272

Query: 200 HAAVHT 205
           + AVH+
Sbjct: 273 YQAVHS 278


>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
           purpuratus]
          Length = 784

 Score =  126 bits (317), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVG+HG ++I TPNID LA  G+ L  +Y  P CTP+R+  L+G+Y    G+  + +
Sbjct: 38  GYNDVGYHG-SEIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P         L+E GY+TH +GKWH+G  K+E LP  RGFD++ GY  G   
Sbjct: 97  RPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGED 156

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y T+    H     + L   R++++ A +    Y    FT++   ++  H   +P  L +
Sbjct: 157 YWTH----HRKRPXLAL---RHVDKVAWEYGGYYSAFVFTEKIQQIVAQHPVEQPFLLYL 209

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              +VH+            LQVP   E    + +I N +RR++A
Sbjct: 210 PFQSVHSP-----------LQVPSSYE--ERYKNIKNTNRRIYA 240


>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
 gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
 gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
          Length = 597

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ SH+ ++PLFL + 
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFLYVA 262

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
 gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
          Length = 598

 Score =  126 bits (316), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 202

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL 
Sbjct: 203 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 260

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 261 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 506

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 23/222 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H   D+  P ++ LA +G++ N+ Y  P CTPSR+A  TG YPF+      + 
Sbjct: 34  GWNDVGWHNP-DLKMPILNQLAADGVIFNQSYVQPACTPSRSALFTGYYPFKIKRQHQML 92

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             + A  + +  K LP+ LK++GY THL+GKWH+G  KEE LP  RGFD+    + G+LT
Sbjct: 93  LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDS----FYGWLT 148

Query: 142 YNDSIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
               ++  E   A G D R N      Q S  YL     +++V ++  H    PLFL+ +
Sbjct: 149 LGTDLYTKENVLAPGYDFRDNTG--VVQESDTYLPFMLAERAVDIVMGHYKEYPLFLEFS 206

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
            A           LP   L+VP  ++ +  ++ I +   R F
Sbjct: 207 MA-----------LPGKFLEVP--QDYEDLYSDIDDDRTRKF 235


>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
 gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
          Length = 357

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 122/221 (55%), Gaps = 25/221 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG--IDTP 80
           GW+DV ++  N +  PN+  LA  G++ N+ Y+   CTPSR A LTGK+P+R G  +   
Sbjct: 19  GWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGKFPYRLGMQVQKS 77

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +    +  +P+ E+LLPQ LK+LGY+TH++GKWH+G  K E  P  RGFD+  G+ +G  
Sbjct: 78  MFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFYGFHHGGE 137

Query: 141 TYNDSIHET--DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
            Y   + E   DF    D R ++       +  Y T+ F  ++ ++I  H+ + PLFL +
Sbjct: 138 DYYTHMSERGLDF---WDGRTSVS----DRNGVYSTESFARRAENIISQHDPNTPLFLYL 190

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
              +VHT      ++P+  LQ         TF+ I + +R+
Sbjct: 191 PFQSVHT----PHQVPSSYLQ---------TFSTIQDDNRK 218


>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
          Length = 600

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 86  QGFRDVGYHG-SEIKTPTLDRLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G  
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ SH+  +P+FL 
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 262

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           I + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294


>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 781

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 113/228 (49%), Gaps = 43/228 (18%)

Query: 23  GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
           G+ D+G+    D+   TPN+DALA  G+ L+ +YT+  CTP+RA+ +TG+YP RYG+  +
Sbjct: 228 GFGDMGYQ-STDLSEITPNLDALAAGGVKLSNYYTMTLCTPARASIMTGRYPVRYGMQYS 286

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            +  G    +P +EK+LP+Y+ E GY +H++GKWH+G  ++E LP  RGF   +GY NG 
Sbjct: 287 VIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPSQRGFKTFLGYLNGI 346

Query: 140 LTYN---------DSIHETDFAVG----------------------------LDARRNME 162
            TY          D  +  DF  G                             D   N +
Sbjct: 347 ETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCTDGGPRWGDVMENED 406

Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHN--HSRPLFLQITHAAVHTGTA 208
                 +  Y TD F  ++  ++KS       PLF+ I H +VH+ T 
Sbjct: 407 PADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHSPTG 454


>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 522

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 85/250 (34%), Positives = 121/250 (48%), Gaps = 43/250 (17%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVG+HG + + TP IDALA  G+ L  +Y    CTPSR+  LTG+Y    G+    +
Sbjct: 39  GWDDVGYHG-SVMKTPYIDALAAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTI 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+ E  LPQ LKE GY TH++GKWH+G  ++E LP NRGFD+ +G++     
Sbjct: 98  LMMQPLCLPLDEITLPQKLKEEGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGD 157

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH--------------- 183
           +  +N S     F  G D RRN +  A Q + KY T  F    ++               
Sbjct: 158 HFYHNISASPGHFN-GFDFRRNNDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVKGV 216

Query: 184 -----------VIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH 232
                      +  S+N  +PLFL ++  AVHT            LQVP           
Sbjct: 217 YFRGYLSSFTPITSSYNPQQPLFLYLSFQAVHTP-----------LQVPSRYAELYNDLI 265

Query: 233 ISNPDRRLFA 242
            ++ DRR++A
Sbjct: 266 PNDEDRRIYA 275


>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 498

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/219 (33%), Positives = 119/219 (54%), Gaps = 20/219 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWND+G++  + I TP +D LA  G++LN+ Y LP CTP RA+ ++G Y +R G+   V 
Sbjct: 44  GWNDIGYNNPS-IFTPTLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVL 102

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                A +P+   L+PQ +KE GY+T+++GKWH+G  K E  P  RGFD+  G++N    
Sbjct: 103 DHAEPAGLPLNFTLIPQRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAED 162

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +  H T   + L   RN +      +  Y T  + +++   I +HN S P+++ +   
Sbjct: 163 YFN--HTTSKYLDL---RNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQ 217

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
           +VH           G+++ P    +  TF H +N  RR+
Sbjct: 218 SVH-----------GVIEAPQKYLDMYTFIHDTN--RRI 243


>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
          Length = 572

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 58  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 116

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 117 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 176

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ +H+ ++PLFL 
Sbjct: 177 DYYTHYKC--DSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLY 234

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 235 VAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 266


>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
          Length = 614

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ +H+ ++PLFL + 
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
 gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
 gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
 gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
          Length = 598

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 118/228 (51%), Gaps = 26/228 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y T+     ++    G D   N        +  Y T  +T +   ++ +H+ ++PLFL 
Sbjct: 203 DYYTHYKC--DSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLY 260

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 261 VAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
          Length = 555

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ +H+ ++PLFL + 
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
          Length = 570

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + 
Sbjct: 84  QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P+    LPQ LKE+GYSTH++GKWH+G  +++ +P  RGFD   G   G  
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202

Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y      ++    G D   N        +  Y T  +T +   ++ +H+ ++PLFL + 
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
           + AVH+            LQ P      R F H   I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292


>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 562

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 108/188 (57%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+HG ++I TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 40  QGYNDIGYHG-SEIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 98

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           + A     +P     LP+ L+E GYSTH++GKWH+G    E LP +RGF + +G   G  
Sbjct: 99  IRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQSFLGSLTGSG 158

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + ++     +   A G D   + +R A ++   Y T  +T++   +++ H+H +PLFL 
Sbjct: 159 DHFSFQSC--DGTEACGFDL-HDGDRPAWELRGNYSTRLYTERVKDILRRHDHRKPLFLY 215

Query: 198 ITHAAVHT 205
           +   AVHT
Sbjct: 216 VALQAVHT 223


>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
          Length = 476

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 74/226 (32%), Positives = 117/226 (51%), Gaps = 25/226 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+D+G+HG   I TP +D LAYNGI L  +Y  P C+P+R+ F++G Y    G+    +
Sbjct: 19  GWHDIGYHGSK-IRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGVYQIHTGLQHNVI 77

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               A  +P+    +   ++E GY+TH+ GKWH+G  KEE LP NRGFD + GY NG   
Sbjct: 78  WPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGFDTYYGYLNGCED 137

Query: 142 YNDSIH-----ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y D  +       DF +  D + N+  Y+  +    + +   +  +     ++  +PLFL
Sbjct: 138 YYDKSYCHPYCGYDFRLNDDIQWNLTDYSTYLYVSRVNEILLNHKI-----YSPDKPLFL 192

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   +VH             L+VP  +E    ++HI + +RR +A
Sbjct: 193 YLPLQSVHEP-----------LEVP--KEYSDKYSHIKDNNRRTYA 225


>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
 gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
          Length = 507

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 76/220 (34%), Positives = 116/220 (52%), Gaps = 21/220 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+HG +DI TP+ID LA  G+ L+R Y    C+P+RAA L+G+     GI +P+ 
Sbjct: 52  GWNDVGYHG-SDIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGIYSPLS 110

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 + + +K++P Y ++ GY T ++GKWH+G  + E  P  RGFD+  G   G + Y
Sbjct: 111 KLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVGY 170

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
            + +H      GLD +RN +    +    Y T   + +   +I+  +  +PLFL     A
Sbjct: 171 WNHVH----GGGLDWQRNGKTLRQE---GYSTHLQSAEITRLIQQRDPEKPLFLYAAFNA 223

Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            H        LP    + P   +    +AHI NP+RR+ A
Sbjct: 224 PH--------LPN---EAP--ADTLARYAHIENPNRRIHA 250


>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
          Length = 1542

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/187 (37%), Positives = 101/187 (54%), Gaps = 5/187 (2%)

Query: 22   QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
            QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 1037 QGFRDVGYHG-SEIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSI 1095

Query: 81   VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            +       +P+    LPQ LK  GYSTH++GKWH+G  K   +P  RGFD   G   G  
Sbjct: 1096 IRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFDTFFGSLLGSG 1155

Query: 141  TYNDSIHETDF--AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
             Y  S ++ D     G D     E    Q    Y T  +T ++V+++ SHN  RP+FL +
Sbjct: 1156 DYY-SHYKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHNPKRPIFLYL 1214

Query: 199  THAAVHT 205
               AVH+
Sbjct: 1215 AFQAVHS 1221


>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
 gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 463

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 78/189 (41%), Positives = 109/189 (57%), Gaps = 12/189 (6%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT- 79
           QGW DVGF+G  DIPTPN+D LA  GIV +  Y + P C+PSRA  LTG+Y  R+G D  
Sbjct: 36  QGWADVGFNGATDIPTPNLDRLASEGIVFDNAYVSHPYCSPSRAGLLTGRYQARFGHDCN 95

Query: 80  -PVGAGVAKAV--PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
            P  +     V  P++EK++P+ LKE GY T  IGKWH+G +   L P ++GFD+  G+ 
Sbjct: 96  MPYDSENDDTVGTPLSEKMIPEALKEHGYRTSAIGKWHLG-DHPSLHPIHQGFDHWFGFA 154

Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            G + Y   I +      +   RN E   PQ   +YLTD FTD+++  I   +  +P F+
Sbjct: 155 GGGMNYW-GIPDGPIKTIV---RNGEP-VPQNELRYLTDDFTDEAIDFITKKD-DKPFFM 208

Query: 197 QITHAAVHT 205
            + + A H 
Sbjct: 209 YLAYNAPHA 217


>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 486

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 111/223 (49%), Gaps = 25/223 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           GW DVGF+G  +I TPNIDALA  G   ++ Y    CTP+RA  +TG++P+RYG+ T   
Sbjct: 40  GWKDVGFNGCTEIKTPNIDALAKGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVI 99

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P  AG    +  +E L+PQ L + GY T +IGKWH+G   ++  P  RGFD   G   G 
Sbjct: 100 PTAAGY--GLDTSEYLMPQCLGDAGYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGE 157

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           L Y       +  V LD  R+ +   P     Y T    D +V  I   +  +P +L +T
Sbjct: 158 LDY---FTHDEHGV-LDWFRDNK---PVHEQGYTTTLIGDDAVKYIHGQDGKKPFYLYLT 210

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             A HT             Q P  +E    + +I+ P RR +A
Sbjct: 211 FNAPHTP-----------YQAP--KEYITKYLNIAEPTRRTYA 240


>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 496

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 68/183 (37%), Positives = 98/183 (53%), Gaps = 5/183 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H    I TP ID LA +G+ LN +Y    C PSR   ++G++    G+     
Sbjct: 36  GWNDVGYHNSY-IKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI 94

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               + +P+ E  +   LKE+GY+THLIGKWH GC     LP NRGFD   GY     + 
Sbjct: 95  GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLG---SS 151

Query: 143 NDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           +D       + GL D R N E    +    Y T  + +++ ++I  H+ ++PLFL +  +
Sbjct: 152 DDHYTHIIMSNGLADLRLNDECVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFS 211

Query: 202 AVH 204
           AVH
Sbjct: 212 AVH 214


>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 539

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 111/206 (53%), Gaps = 22/206 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+ DVG+   + + TPNID LA  G+ L RHY  P+C PSR+  + G+Y    G +    
Sbjct: 72  GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFNYKCT 131

Query: 82  -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G+G    +      +P  LKE GY+TH++GKWH+G  + E LP  +GFD   GY     
Sbjct: 132 DGSGSQLCMHPDTITIPMKLKENGYATHMVGKWHLGNIRWECLPNAKGFDTFFGYHGASE 191

Query: 141 TYNDSIHETDFA-VGLDAR---RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            Y      T F+  G + R   RN +  A +   +Y T  FT++++++I++H+ S+P+FL
Sbjct: 192 DY-----YTHFSPAGRECRDLWRNRDDVAQEYYGQYSTHIFTNEALNIIENHDVSKPMFL 246

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD 222
            + + AVH           G LQVP+
Sbjct: 247 YLPYQAVH-----------GPLQVPE 261


>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
 gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
          Length = 498

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 75/185 (40%), Positives = 105/185 (56%), Gaps = 11/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+ DVG+HG +D+ TPN+D LA  G  L + YT P CTP+RAA +TG YP RYG+ T V 
Sbjct: 64  GYADVGYHG-SDVKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCYPMRYGLQTGVI 122

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +G    +   E LLPQ LKE GY T L+GKWH+G   ++  P  RGFD   G   G + 
Sbjct: 123 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFDYFYGPLVGEID 182

Query: 142 YNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           +    HE   A G+ D  R+ E         Y T+ F   ++ +I+ H+ + PL++ ++ 
Sbjct: 183 HFK--HE---AHGITDWYRDNEMVK---EPGYDTELFGADAIRLIEEHDSATPLYMYLSF 234

Query: 201 AAVHT 205
            A HT
Sbjct: 235 TAPHT 239


>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
          Length = 503

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 122/230 (53%), Gaps = 28/230 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G++DVG+HG + I TPNID LA+ G+ L  +Y  P CTP+R+  L+G+Y    G+  + +
Sbjct: 42  GYHDVGYHG-SAIRTPNIDRLAFEGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSII 100

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A    A+P     L   L+E GY+ H++GKWH+G  KEE +P NRGFD+  GY  G   
Sbjct: 101 WAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTNRGFDSFYGYLTGSEF 160

Query: 142 YND------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS---R 192
           Y +       I+ +D   GLD R N         S+Y T  + +++  ++  H  +   +
Sbjct: 161 YYNKTYCLAQINRSD-VCGLDFRENDRSIN---ESEYSTHLYAERTKQLVADHTSAHPDQ 216

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL +   +VH           G L+VP   +    + HI + +R+++A
Sbjct: 217 PLFLYLALQSVH-----------GPLEVP--AQYRTPYKHIKDENRQIYA 253


>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 504

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 78/221 (35%), Positives = 113/221 (51%), Gaps = 25/221 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H    I TP ID LA NG+ LN +Y    C PSR   ++G++         V 
Sbjct: 39  GWNDVGYHNLY-IKTPTIDRLANNGVKLNNYYAANLCVPSRNMLMSGRHVH------GVI 91

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT- 141
            G  + +P+ E  +   LKE GYSTHL+GKW+ G   +E LP NRGFD   G+ +     
Sbjct: 92  MGYPRGLPLNETTIANKLKEAGYSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDH 151

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y   +H+       D RRN    A +    Y T  + ++   +I +H+ ++PLFL ++ +
Sbjct: 152 YTHMVHDIS-----DLRRNDLCVADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFS 206

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           AVH             LQVP + E +     I + DRR++A
Sbjct: 207 AVHEP-----------LQVPSVYEKEY-IPTIDDTDRRIYA 235


>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 514

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 122/230 (53%), Gaps = 14/230 (6%)

Query: 9   VAKAVPVTEKLLPQGWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFL 67
           +A    +   +   GWNDVG+H  ND +PTPN++ LA  G++L+  Y+ P CTPSR A +
Sbjct: 36  IAMICIIILTISASGWNDVGWH--NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALM 93

Query: 68  TGKYPFRYGIDTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
           TGKYP + G+   V   +    +P     L + LKE GY+ H++GKWH+G    +  P  
Sbjct: 94  TGKYPAKVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMW 153

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDF--AVGLDAR--RNMERYAPQMSSKYLTDFFTDQSV 182
           RGFD+H G   G +T N   H   +   VG+  R  R+        +  + T  F++++ 
Sbjct: 154 RGFDSHYGCHEG-VTSNFETHMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAE 212

Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEND-RTFA 231
            ++K+HN   PLFL + + A H       + P G  +   +++ D RTFA
Sbjct: 213 RIVKNHNPESPLFLYVPYMAPHFP----LQAPQGFEEAVQLDDTDRRTFA 258


>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
           sphaeroides KD131]
 gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
           KD131]
          Length = 509

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 74/185 (40%), Positives = 105/185 (56%), Gaps = 11/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+ DVG+HG +D+ TPN+D LA  G  L + YT P CTP+RAA +TG+YP RYG+ T V 
Sbjct: 75  GYADVGYHG-SDVKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRYPMRYGLQTGVI 133

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +G    +   E LLPQ LKE GY T L+GKWH+G   ++  P  RG D   G   G + 
Sbjct: 134 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVDYFYGPLVGEID 193

Query: 142 YNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           +    HE   A G+ D  R+ E         Y T+ F   ++ +I+ H+ + PL++ ++ 
Sbjct: 194 HFK--HE---AHGITDWYRDNEMVK---EPGYDTELFGADAIRLIEEHDSATPLYMYLSF 245

Query: 201 AAVHT 205
            A HT
Sbjct: 246 TAPHT 250


>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 72/202 (35%), Positives = 104/202 (51%), Gaps = 16/202 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVG+   + + TPNID LA  G+ L RHY  P+C PSR+  + G+Y    G D    
Sbjct: 37  GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCK 96

Query: 83  AGVAKAVPV--TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G    + +      +P  LKE GY+TH+IGKWH+G  + E LP  +GFD   GY +   
Sbjct: 97  DGKRSQLCMHPDTITMPMKLKENGYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIE 156

Query: 141 TYNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y    H T       D  RN +  A     +Y T  FT ++  +IK+H+ ++P+F+ ++
Sbjct: 157 DY--FTHYTPAGANCHDFWRNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLS 214

Query: 200 HAAVHTGTAGNAKLPTGLLQVP 221
           + AVH           G LQVP
Sbjct: 215 YQAVH-----------GPLQVP 225


>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 104/188 (55%), Gaps = 8/188 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G +DV  HG  DI TPN+D LA +G++LN +Y  P C+P+R + +TG+YP+R G+     
Sbjct: 31  GKHDVSMHGA-DIYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENL 89

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
            G   A +P+ E ++PQY+KE GY T+++GKW +G  K+  LP+ RGFD   G   G   
Sbjct: 90  VGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQD 149

Query: 139 YLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y +    +   ++    G D R   +      S KY    + D++     +HN + PL++
Sbjct: 150 YYSRRKCLKLRNYGNLCGYDLRTE-QGPVRDTSMKYQPFLYADKAREKFFAHNKTDPLYM 208

Query: 197 QITHAAVH 204
            +   +VH
Sbjct: 209 YVAFQSVH 216


>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
          Length = 514

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 112/233 (48%), Gaps = 18/233 (7%)

Query: 14  PVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF 73
           PV       GWNDV +H E  + +P ++ LA  G++L++HY LPTCTP+RAA +TG+YP+
Sbjct: 27  PVVPARFKPGWNDVSWHNER-MESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPY 85

Query: 74  RYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
           + GI +  +       +P+    L + LK  GY+TH  GKWH+G   + L P  RGFD  
Sbjct: 86  KLGIQSHGIRTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTF 145

Query: 133 VGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
            G++ G   Y ++  S  +T         RN +         Y T    +  +  I+   
Sbjct: 146 RGFYVGGQDYFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQ 205

Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +P+FL +   AVH             LQVP   +  +  +   NP R+L  
Sbjct: 206 PDKPMFLYVAFQAVHAP-----------LQVP--TQYRKMCSIYRNPKRKLLC 245


>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 558

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 70/189 (37%), Positives = 106/189 (56%), Gaps = 9/189 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H   DI TP +D LA  G+ L  +Y  P CTPSR+ F+TG+Y    G+  + 
Sbjct: 48  QGFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 106

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           + +     +P   + LPQ L+E GY+TH++GKWH+G  K + LP  RGF+ + G   G  
Sbjct: 107 IRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFNTYFGSLTGSV 166

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
            Y TY     +     G D   + ER A     +Y T  +T +   ++ +H+  S+PLF+
Sbjct: 167 DYYTYKSC--DGPKVCGFDL-HDGERVAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFI 223

Query: 197 QITHAAVHT 205
            ++  AVHT
Sbjct: 224 FLSFQAVHT 232


>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
          Length = 511

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 104/188 (55%), Gaps = 8/188 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G +DV  HG  DI TPN+D LA +G++LN +Y  P C+P+R + +TG+YP+R G+     
Sbjct: 31  GKHDVSMHGA-DIYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENL 89

Query: 83  AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
            G   A +P+ E ++PQY+KE GY T+++GKW +G  K+  LP+ RGFD   G   G   
Sbjct: 90  VGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQD 149

Query: 139 YLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y +    +   ++    G D R   +      S KY    + D++     +HN + PL++
Sbjct: 150 YYSRRKCLKLRNYGNLCGYDLRTE-QGPVRDTSMKYQPFLYADKAREKFFAHNKTDPLYM 208

Query: 197 QITHAAVH 204
            +   +VH
Sbjct: 209 YVAFQSVH 216


>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 490

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/216 (36%), Positives = 111/216 (51%), Gaps = 9/216 (4%)

Query: 26  DVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGV 85
           D+G+ G + I TP+IDALA  G+ L  +Y LP CTP+RAA +TG+YP R G+ T V    
Sbjct: 40  DLGYRG-SKIKTPHIDALAKGGVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPS 98

Query: 86  AK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYND 144
            +  +P  EK LPQ LKE+GY T ++GKWH+G   ++  P NRGFD+  G   G + Y  
Sbjct: 99  HRYGLPTDEKTLPQALKEVGYHTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDY-- 156

Query: 145 SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
                +    +D +RN E         Y  D    ++V +I  H+ ++PLFL     A H
Sbjct: 157 --FTRERGGVVDWQRNGEFL---REDGYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPH 211

Query: 205 TGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
                          + D E +      ISN DR++
Sbjct: 212 APYQAPKADIDAYNDIFDNEMHRTYAGMISNLDRQV 247


>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
          Length = 383

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 110/198 (55%), Gaps = 18/198 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G+H + +I TP +D LA  G+ L ++Y  P CTP+R   +TG+Y  RYG+     
Sbjct: 187 GYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQ---- 241

Query: 83  AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            GV +      VP+ EKLLP+ L++ GY+T +IGKWH+G   E+ LP NRGFD+ +G++ 
Sbjct: 242 HGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYT 301

Query: 138 G---YLTYNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
           G   + ++N         DF      +  + R+   +++ Y T  F D+    +   N S
Sbjct: 302 GSQDFYSHNKCFSGMCGYDFREATAGQPEVIRW--DLNNTYSTGVFADELEKRLSKMNPS 359

Query: 192 RPLFLQITHAAVHTGTAG 209
            P F  ++  AVH+   G
Sbjct: 360 EPSFTYLSFQAVHSPLQG 377


>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
           carolinensis]
          Length = 580

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 104/188 (55%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++D G+HG +DI TP +D LA  G+ L  +Y  P CT SR+  +TG+Y    G+  + 
Sbjct: 68  QGFHDXGYHG-SDIXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQIHTGLQHSI 126

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P  +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD  +G   G  
Sbjct: 127 IRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 186

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y TY++   +     G D   + E  A + S KY T  +  +   ++ +HN   P+F+ 
Sbjct: 187 DYYTYDNC--DGPGVCGYDL-HDGENVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIY 243

Query: 198 ITHAAVHT 205
           I   AVHT
Sbjct: 244 IAFQAVHT 251


>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 505

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 101/188 (53%), Gaps = 11/188 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVG+   + + TPNID LA  G+ L R+Y   +C PSR+  + G+Y    G D    
Sbjct: 37  GYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCL 96

Query: 83  AGVAKAVPVTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G    + +      LP  L++ GY+TH+IGKWH+G  ++E +P ++GFD   GY     
Sbjct: 97  DGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTHKGFDTFFGYHGAAE 156

Query: 141 TYNDSIHETDFAVGL----DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            Y      T  A+G     D  RNME  A     +Y T F+T ++  +IK+H+  +P F+
Sbjct: 157 NYY-----THTALGRPRCHDLWRNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFFM 211

Query: 197 QITHAAVH 204
            +++ AVH
Sbjct: 212 YLSYQAVH 219


>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
          Length = 571

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/192 (35%), Positives = 103/192 (53%), Gaps = 12/192 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           G+NDV +H    + TPN+  +A NG++L   Y+   CTPSRA+++TG YPFR G+ ++ V
Sbjct: 43  GYNDVSWHNPQ-VLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGYYPFRIGVQNSVV 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G+   VP+    LP+ LKE GY +HL+GKWH+G  + ++ P  RGFD  +G  NGY  
Sbjct: 102 REGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGFDTFLGLLNGYND 161

Query: 142 -YNDSIHETDFAVGLDARRNMERY--------APQMSSKYLTDFFTDQSVHVIKSHNHSR 192
            Y   I         D       Y         P   + Y TD FT++++ +I+    + 
Sbjct: 162 YYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTDIFTNRAIELIQQSKDT- 220

Query: 193 PLFLQITHAAVH 204
           P FL + + A H
Sbjct: 221 PFFLALHYTAPH 232


>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 458

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/233 (34%), Positives = 121/233 (51%), Gaps = 26/233 (11%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ DVGF+G  DIPTP++D++A NG+  ++ H + P C PSRA  LTG+Y  R+G  T 
Sbjct: 30  QGYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHVSYPVCGPSRAGLLTGRYQDRFGFTTN 89

Query: 81  VGAGVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
                   +   P+ EK + + LKE+GYS+ +IGKWH+G +     P NRGFD+  G+ +
Sbjct: 90  PTVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIGKWHMGTHPIH-HPLNRGFDHFFGFLS 148

Query: 138 GYLTYNDSIHE----TDFAVGLDARRN---MERYAPQMSSKYLTDFFTDQSVHVI-KSHN 189
           G   Y  + +     ++     D  R     +R   Q+S  YLTD  TD +V  I K  +
Sbjct: 149 GGHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRERIQVSEGYLTDILTDAAVDFIDKKAS 208

Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +P  L +++ A HT    +             E+  + F HI +  RR +A
Sbjct: 209 EKKPFMLYLSYNAPHTPLQAS-------------EKYLKRFTHIKDSKRRTYA 248


>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
 gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
          Length = 470

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 114/222 (51%), Gaps = 19/222 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW+D+G+H    I TPN+D LA  G+ L  +Y  P C+PSR   +TG+Y   YG+   V 
Sbjct: 12  GWDDIGYHNHF-IHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRYQIHYGLQHGVI 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+ E  LPQ LK+ GY T+++GKWH+G  K+E  P  RGFD   G+  G   
Sbjct: 71  RNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFDKFYGFLTGSED 130

Query: 142 YNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y    H     V GLD R   E    + +  Y T  F  ++  +I  H+ ++P+FL +  
Sbjct: 131 Y--WTHRRYKGVRGLDLRDQDEPVLDE-NGTYSTHLFARKATDMILKHDQNQPMFLYLPF 187

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVH           G LQVP  E+  + + HI+    R++A
Sbjct: 188 QAVH-----------GPLQVP--EKYLQEYMHINFTVDRIYA 216


>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 500

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 119/228 (52%), Gaps = 23/228 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+NDVG+HG ++I TPNID LA  G+ L  +Y  P CTP+R+  L+G+Y    G+  + +
Sbjct: 38  GYNDVGYHG-SEIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P         L+E GY+TH +GKWH+G  K+E LP  RGFD++ GY  G   
Sbjct: 97  RPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGED 156

Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y T++ +    +  ++  +G+D   N E+ A +    Y    FT++   ++  H   +P 
Sbjct: 157 YWTHHRAGDGLLPNSNHWLGMDLWDN-EKVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPF 215

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            L +   +VH+            LQVP   E    + +I N +RR++A
Sbjct: 216 LLYLPFQSVHSP-----------LQVPSSYE--ERYKNIKNTNRRIYA 250


>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
          Length = 609

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 109/194 (56%), Gaps = 18/194 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G+H + +I TP +D LA  G+ L ++Y  P CTP+R   +TG+Y  RYG+     
Sbjct: 109 GYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQ---- 163

Query: 83  AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            GV +      VP+ EKLLP+ L++ GY+T +IGKWH+G   E+ LP NRGFD+ +G++ 
Sbjct: 164 HGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYT 223

Query: 138 G---YLTYNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
           G   + ++N         DF      +  + R+   +++ Y T  F D+    +   N S
Sbjct: 224 GSQDFYSHNKCFSGMCGYDFREATAGQPEVIRW--DLNNTYSTGVFADELEKRLSKMNPS 281

Query: 192 RPLFLQITHAAVHT 205
            P F  ++  AVH+
Sbjct: 282 EPSFTYLSFQAVHS 295


>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 490

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 118/231 (51%), Gaps = 24/231 (10%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D  F G  DI TPN+DALA +G+   R Y T P C+PSRA  +TG+Y  R+G    
Sbjct: 49  QGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNN 108

Query: 81  VGAGVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           + A  A  +   P  E LLPQ L + GY T ++GKWH+G  ++   P+ RGFD   G   
Sbjct: 109 IVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYERGFDEFFGIIT 167

Query: 138 GYLTYNDSIHETDFAVGLDA-RRNMERYAP--QMSSKYLTDFFTDQSVHVIKSHNHSR-- 192
           G   Y  + H  + AVG  + +  +ER  P  +    YLTD F   +V +I+  +  R  
Sbjct: 168 GGHDYFVN-HPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGADAVRIIRESHTKRPD 226

Query: 193 -PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            PLFL +   A HT T    + P  L+        D   A + + DRR +A
Sbjct: 227 QPLFLYLAFNAPHTPT----QAPKDLV--------DTMPATLESKDRRTYA 265


>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 588

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 12/144 (8%)

Query: 23  GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DT 79
           GW+D+G+   +    TP +D LA  G+ +  +YT+ TCTP+RA+ +TG+Y  RYG+  + 
Sbjct: 6   GWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYGMQYNV 65

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            +  G    VP++EK+LP+Y KE GY THL+GKWH+G +  E +P  RGFD ++GY  G+
Sbjct: 66  AINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMGYVGGF 125

Query: 140 LTY---------NDSIHETDFAVG 154
             Y         +D  H  DF  G
Sbjct: 126 EAYWTHETVGVISDGRHVCDFGFG 149


>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 843

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/216 (32%), Positives = 118/216 (54%), Gaps = 10/216 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG++      TP ID LA +G+ L  +Y    C PSR   +TG++  + GI     
Sbjct: 377 GWNDVGYNNPV-FKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQRHGF 435

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               +++P+ E  + Q LK++GYSTH+IGKWH G   +  LP NRGFD   G+    + +
Sbjct: 436 GYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIEH 495

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H   F    + R+N +  A Q   KY T  + ++  ++I +H+ ++P FL ++ +A
Sbjct: 496 --YTHSDHFNHMHNLRKNDDCIAKQYIGKYSTTIYANEGKNIINAHDQNKPFFLYLSFSA 553

Query: 203 VHTGTAGNAKLPTGLLQVPD---MEENDRTFAHISN 235
           VHT      ++P+  L+  +    +E+ RT+A +++
Sbjct: 554 VHTP----LEVPSSYLKQYESTIYDEDRRTYAAMTS 585


>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 115/224 (51%), Gaps = 18/224 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H    I TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 54  QGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRYQIHTGLQHSI 112

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           + +     +P     LP+ L+  GYSTH++GKWH+G  ++  LP  +GFD   G   G +
Sbjct: 113 IRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFDTFFGSLTGSV 172

Query: 141 TYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
            Y          V G D   N ER A     KY T  FT ++  ++KSH+ + RPLFL +
Sbjct: 173 DYYSYGSCNGPGVCGYDLHDN-ERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFLLL 231

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +  AVH           G LQ P  +     +  ++N DRR FA
Sbjct: 232 SLQAVH-----------GPLQPP--KSFVYLYRDMANVDRRKFA 262


>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 121/230 (52%), Gaps = 25/230 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+NDVGF+G  +IPTP ID++A NG+     YT    C PSRA F+TG+Y  R+G +   
Sbjct: 32  GYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNP 91

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
              +     A+P +E  + + L ++GY   +IGKWH+G  +  L P  RGFD   G+  G
Sbjct: 92  QWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGG 150

Query: 139 ---YLTYNDSI-HETDFAVGLDARRN--MERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
              ++  +  I H  +    LD+ R+       P  ++KYLT+ F+D++V  IK  NH +
Sbjct: 151 GHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFIK-RNHQK 209

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           P FL +++ A H             L +   E+    F HI +P R+ +A
Sbjct: 210 PFFLFLSYNAPH-------------LPLQATEKYLARFPHIKDPKRKTYA 246


>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
          Length = 560

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 112/224 (50%), Gaps = 17/224 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ DVG+HG ++I TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 48  QGFRDVGYHG-SEIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSI 106

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           + A     +P+    LP  LK+ GY+TH++GKWH+G  K   LP  RGFD   G   G  
Sbjct: 107 IRATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSG 166

Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQI 198
             Y+    E     G D     E    Q    Y T  FT +++ ++  H+ H +PLFL +
Sbjct: 167 DHYSHYKCEAPGMCGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYL 226

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + AVH+            LQVP        +  ISN  RR +A
Sbjct: 227 AYQAVHSP-----------LQVP--SRYLERYKGISNVHRRKYA 257


>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
          Length = 370

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/184 (40%), Positives = 98/184 (53%), Gaps = 5/184 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGF    DI TPNID LA  G+V+   Y+   CTPSR A +TG+YP++ G+   V 
Sbjct: 22  GYNDVGFRNP-DIITPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGRYPYKTGMQNFVI 80

Query: 83  AGVAKAVPVTE-KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G A      E K LPQYLK LGY+TH +GKWH+G  ++E LP  RGFD+  G   G   
Sbjct: 81  PGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGFDSFYGLLLGGGG 140

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +  +    A      ++++  A    S+   D   D+   V  SHN   P+FL     
Sbjct: 141 YWNHTYTLFGAYDWFNNKDLDLSANGTHSQ---DLMVDRLSAVFASHNREEPMFLYFAPQ 197

Query: 202 AVHT 205
             HT
Sbjct: 198 NPHT 201


>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
 gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
          Length = 473

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 100/183 (54%), Gaps = 10/183 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVG+ G + I TPNID LA  G+ L   Y  P C+P+RAA +TGK P  +GID P+ 
Sbjct: 54  GYGDVGYLG-SQIQTPNIDNLASQGVTLKHGYAYPICSPTRAALMTGKNPLNFGIDGPME 112

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +P     +P+  +E GY T ++GKWH+G  K   +P NRGFD+  G+  G++ Y
Sbjct: 113 NDA--MLPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAMPHNRGFDDFYGFLGGFVDY 170

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
              +    +  GLD + N      +    ++T+  T +++  I      +P F+ ++++A
Sbjct: 171 YTHV----YFGGLDWQNNDTSLREE---GFVTELLTAKAIDKITHFKGDKPFFMYLSYSA 223

Query: 203 VHT 205
            HT
Sbjct: 224 PHT 226


>gi|313225802|emb|CBY07276.1| unnamed protein product [Oikopleura dioica]
          Length = 207

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 4/130 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ND+G++      TPNI+ LA NGI+L+ HY+ P C+PSR+ FLTG+Y FRYG+    +
Sbjct: 6   GYNDIGYNSVEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 64

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+TEK+LP+  KE GYST   GKWH G   E  LP +RGFD  VG ++G  +
Sbjct: 65  LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSG--S 122

Query: 142 YNDSIHETDF 151
              S HE  F
Sbjct: 123 SQHSTHEKCF 132


>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
          Length = 617

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 4/130 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ND+G++      TPNI+ LA NGI+L+ HY+ P C+PSR+ FLTG+Y FRYG+    +
Sbjct: 169 GYNDIGYNSIEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 227

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  VP+TEK+LP+  KE GYST   GKWH G   E  LP +RGFD  VG ++G  +
Sbjct: 228 LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSG--S 285

Query: 142 YNDSIHETDF 151
              S HE  F
Sbjct: 286 SQHSTHEKCF 295


>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
 gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
          Length = 494

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 116/221 (52%), Gaps = 21/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVG+H   D+ TP +D LA+ G++LN+ Y    CTPSR AF+TG +P+  G    V 
Sbjct: 34  GWNDVGWHNP-DVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVF 92

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                 A+      LP+ LK LGY+TH++GKWH+G    +  P  RGFD++ GY++G   
Sbjct: 93  RPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGFDSYYGYYSGAED 152

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y T+  SI       G+D   N +    Q +  Y    F+ ++  ++  H+ + PLFL +
Sbjct: 153 YFTHFRSIRNG--TGGIDFHDNKDVVTDQ-NGTYSAYLFSQRAADIVNKHDPNTPLFLYL 209

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
               VH             L+VP   E+   +A++ + +RR
Sbjct: 210 PFQNVHAP-----------LEVPKRFED--MYANVQDENRR 237


>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 103/191 (53%), Gaps = 17/191 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVG+   + I TPNID LA  G+ L RHY  P+C PSRA  + G+Y    G      
Sbjct: 23  GYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRYQIHTGYRDECM 82

Query: 83  AGVAKAVPVTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                   +   +  LP  +K+ GY TH+IGKWH+G N  + LP  +GFD + GY     
Sbjct: 83  NDTRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKGFDTYFGY----- 137

Query: 141 TYNDSIHETDFAVGLDARRN---MERYAPQMSSKYL----TDFFTDQSVHVIKSHNHSRP 193
              ++  E  +   L  R+N   + R    ++ KY+    T  FT+++V++I++H+ S+P
Sbjct: 138 ---NAAAEDYYTHMLSGRQNCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQP 194

Query: 194 LFLQITHAAVH 204
           +F+ + H AVH
Sbjct: 195 MFMYLAHQAVH 205


>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
          Length = 328

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 98/191 (51%), Gaps = 9/191 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           G+ND+GF    D+ TPN+D LA  G++L  +Y    CTPSR A +TG+YP+R  +    +
Sbjct: 12  GYNDIGFRNP-DVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRYPYRSAMQNFVI 70

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               AK   +  K LPQYLKELGY  HLIGKW++G  +EE LP +RGFD+  G  +G   
Sbjct: 71  NPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFDSFFGLLDGAGD 130

Query: 142 YNDSIHETDFAVGLDARRNM-------ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           Y +      +     +   M       ++ +  M      D   D+   +   H++ +PL
Sbjct: 131 YWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDKIFTEHDNKQPL 190

Query: 195 FLQITHAAVHT 205
           FL       HT
Sbjct: 191 FLYFAPQNPHT 201


>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 524

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 80/249 (32%), Positives = 128/249 (51%), Gaps = 30/249 (12%)

Query: 6   GAGVAKAVPVTEKLLPQ--GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCT 60
           G+  A+ +P    +L    G+NDVG+HG +    I TPN+D LA  G+ L  +Y  P C+
Sbjct: 21  GSSYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICS 80

Query: 61  PSRAAFLTGKYPFRYGIDTPVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHI 115
           P+R+  ++G    RY I T +  GV +      +P+ E  LPQ LKE GY+T+++GKWHI
Sbjct: 81  PTRSQLMSG----RYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHI 136

Query: 116 GCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP--QMSSKYL 173
           G   +  LP  RGFD++   W  + +                        P  Q   +Y 
Sbjct: 137 GFYLDACLPTERGFDSYFA-WEDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYS 195

Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
           T  FT++++ VI+ H+ ++PLF+ + + AVH         P   L+VPD   +   + +I
Sbjct: 196 THLFTNKTIDVIERHDKTKPLFIYLAYQAVH--------FP---LEVPDSYMD--PYMNI 242

Query: 234 SNPDRRLFA 242
           ++ +RR +A
Sbjct: 243 TDKNRRTYA 251


>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 434

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/190 (38%), Positives = 107/190 (56%), Gaps = 14/190 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G  DIPTPN+D LA  G++ +  Y + P C+PSRA  LTG+Y  R+G D  
Sbjct: 7   QGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCN 66

Query: 81  V---GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           +   G   A    P++EK++ + LKE GY T  IGKWH+G +  +L P  +GFD+  G+ 
Sbjct: 67  MPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLG-DHPDLYPPAQGFDHWFGFP 125

Query: 137 NGYLTY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            G + Y  +S +E          RN  +  P+    YLTD FT +++  I +    +P F
Sbjct: 126 GGGMNYWGESKNEIQTIY-----RN-RKVVPEEELTYLTDDFTTEAIRFI-TQKDEKPFF 178

Query: 196 LQITHAAVHT 205
           + + + A H 
Sbjct: 179 MYLAYNAPHA 188


>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/178 (37%), Positives = 100/178 (56%), Gaps = 4/178 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+H   ++  P ++ LA +G++ N+ Y  PTCTP+RAA ++G YPF+ G    + 
Sbjct: 42  GWNDVGWHNP-EVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGYYPFKTGNQHQLL 100

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             +    +P+  K LPQ LK++GY TH++GKWH+G  KE  LP NRGFD+  G      +
Sbjct: 101 LNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGFDSFYGGLTLGTS 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           +   ++      G D   N     PQ ++ YL     D++V +I  H    PLF+  +
Sbjct: 161 HFSKMNGILSTPGYDFYDN-SGVVPQ-TNDYLAFMLADRAVKIINGHYQEYPLFMYFS 216


>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 462

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 119/221 (53%), Gaps = 25/221 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVGF+G  DIPTP ID++A NG+  +  YT    C PSRA F+TG+Y  R+G +   
Sbjct: 34  GYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPSRAGFITGRYQQRFGFERNP 93

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
              +     A+P +E  + + L+++GY   +IGKWH+G  +  L P  RGF+   G+  G
Sbjct: 94  QWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLGA-EPSLRPNQRGFNEFFGHLGG 152

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYA--------PQMSSKYLTDFFTDQSVHVIKS 187
              Y      I +T+     D +  M+ YA        P  ++KYLTD F+D+++  ++ 
Sbjct: 153 GHAYFPEKLRIIKTE-----DVKNEMDSYASYITRNDTPVKTTKYLTDEFSDEAIRFVEK 207

Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
            N+ +P FL +++ A HT      K    L + P +E+ +R
Sbjct: 208 -NYEQPFFLFLSYNAPHTPLQATQKY---LDRFPHIEDQNR 244


>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 588

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 67/190 (35%), Positives = 107/190 (56%), Gaps = 7/190 (3%)

Query: 23  GWNDVGFH---GENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID- 78
           G+NDVG+H   G + I TPNID +AY+G+ L  +Y  P CTP+R+  +TG+Y    G+  
Sbjct: 105 GYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLITGRYQIHTGMQH 164

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +  G    +P+ E  L Q LK+ GYSTH +GKWH+G   ++ LP  RGF++  G   G
Sbjct: 165 LNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGNIMG 224

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              + ++N +    D  V   +    ER   +    + T  +T+++  +I+    ++PLF
Sbjct: 225 SADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPLF 284

Query: 196 LQITHAAVHT 205
           L +++ AVHT
Sbjct: 285 LYLSYEAVHT 294


>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
 gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
          Length = 622

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 108/202 (53%), Gaps = 5/202 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GW DVG++ +    TP +D L  NG    + Y+   C+PSRA  LTG+Y FR G+ + P+
Sbjct: 40  GWADVGWNNKGLESTPFMDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPI 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              V   +   +K LP+YLKE+GY TH +GKWH+G      LP +RGFD   G+++G + 
Sbjct: 100 AREVPFGMNTQDKTLPEYLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVD 159

Query: 142 YNDSIHETDFAVGLDARRN-MERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSR-PLFLQ 197
           Y     +       D   N +E++   + S  ++ TD F D+++ ++K    S+ P ++ 
Sbjct: 160 YRGHFIKRSKNFYHDFFDNTIEQHKLDLESDGQWTTDLFRDRTIDILKEAKRSKTPAYVY 219

Query: 198 ITHAAVHTGTAGNAKLPTGLLQ 219
           +   A H  T   A L   +L+
Sbjct: 220 LAFNAPHEPTRAPADLIARILE 241


>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
 gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
          Length = 157

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 55/117 (47%), Positives = 75/117 (64%), Gaps = 1/117 (0%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DVG+H   D+ TPNID LA  G+VL  +Y  P CTP+R  FL+G+YP   G+  + +
Sbjct: 33  GWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFLSGRYPIHTGLQHSNI 92

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                  +P+   LLPQ LK+ GYSTH +GKWH+G  ++E  P  RGFD   GY++G
Sbjct: 93  HETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLYRGFDTFFGYYSG 149


>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 732

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/188 (36%), Positives = 103/188 (54%), Gaps = 7/188 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H    I TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 59  QGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRYQIHTGLQHSI 117

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P     LP+ L+E GY+TH++GKWH+G  ++  LP  +GFD   G   G +
Sbjct: 118 IRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFDTFFGSLTGSV 177

Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQ 197
            Y   +S    D   G D   + E  A     KY T  FT ++  +++SH+ + RPLFL 
Sbjct: 178 DYYSYESCDGKDLC-GYDLHDD-EGVAWGQEGKYSTTLFTQRARKILESHDPAERPLFLL 235

Query: 198 ITHAAVHT 205
           ++  AVHT
Sbjct: 236 LSFQAVHT 243


>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
 gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
          Length = 455

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 99/184 (53%), Gaps = 9/184 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G  DV + G + I TP +DALA +G  L + Y  P C+P+R+A LTG+YP RYG+   V 
Sbjct: 40  GGADVSWRG-SPIKTPQLDALANSGAKLEQFYVQPVCSPTRSALLTGRYPMRYGLQVGVV 98

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              A   +P+ E+ L + L++ GY T ++GKWH+G      LP  RGFD+  G++NG L 
Sbjct: 99  RPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGHYNGALD 158

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    H+ D         ++ R        Y T     ++V VI+  +  +PLFL +   
Sbjct: 159 Y--FTHDRDGGHDWHKDDHVNR-----DEGYATHLIAQEAVRVIQDRDKKKPLFLYVPFN 211

Query: 202 AVHT 205
           AVH+
Sbjct: 212 AVHS 215


>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 574

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/187 (36%), Positives = 102/187 (54%), Gaps = 5/187 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H    I TP +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 56  QGFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHSI 114

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +P     LP+ L++ GYSTHL+GKWH+G  ++  LP  +GFD   G   G +
Sbjct: 115 IRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFDTFFGSLTGSV 174

Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
             YN    +     G D   + E  A     KY T  FT ++  +++SHN + +PLFL +
Sbjct: 175 DHYNYLSCDGPGVCGYDL-HDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLL 233

Query: 199 THAAVHT 205
           +  AVHT
Sbjct: 234 SLQAVHT 240


>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 114/229 (49%), Gaps = 30/229 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVG+HG  +IPTPNID +A  G+  +  Y+  + C P+RAA ++G Y  R G +   
Sbjct: 31  GYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEGIC 90

Query: 82  GAG-----VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK---EELLPFNRGFDNHV 133
           G       V   +P   K L QY +E GY+T L GKWH+G  +   + L+P +RGFD   
Sbjct: 91  GGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDEFF 150

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           G   G   Y+D+++     +  D   + E        +Y TD    ++V  I +    +P
Sbjct: 151 GILEGASLYDDTVNRERKYIRQDTVIDYE-------GEYFTDAIGREAVSFI-TRKGDKP 202

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            FL +   AVH     +             E+  + FAHI++P+RR+FA
Sbjct: 203 FFLYLPFTAVHAPMQAS-------------EKYMQRFAHIADPNRRVFA 238


>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 453

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 73/187 (39%), Positives = 101/187 (54%), Gaps = 15/187 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+GF G  DIPTP ID LA +G+  +  Y + P C+P+RA  LTG+Y  R+G +   
Sbjct: 44  GYGDLGFQGGRDIPTPRIDGLARSGVTCSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 103

Query: 82  G--AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           G  A V +   + + E+ LPQ LK+ GY+T ++GKWH+G    +  P  RGFD   G+  
Sbjct: 104 GNAARVTETFGLSLEERTLPQRLKQAGYATGIVGKWHLGF-APQFQPLERGFDEFFGFLG 162

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           G   Y    +  D       RR  E     + S+YLTD F  +SV  I   N +RP FL 
Sbjct: 163 GAHPYFPDANSND-----PIRRGREAV---VESEYLTDAFARESVAYI-DRNKNRPFFLY 213

Query: 198 ITHAAVH 204
           +   AVH
Sbjct: 214 LAFNAVH 220


>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 517

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/226 (29%), Positives = 118/226 (52%), Gaps = 22/226 (9%)

Query: 23  GWNDVGFHG---ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           G+ND+G+H     +D+ TP +D+LA  G+ L  +Y  P C+PSR+  ++G+Y    G+  
Sbjct: 35  GFNDIGYHAVEHHSDMKTPFLDSLAMAGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQH 94

Query: 80  PVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            V +   +  +P+   +LP+ L + GY TH++GKWH+G  K+E LP+ RGF+++ GY  G
Sbjct: 95  YVISPQQRNGLPLDNIILPEQLHKCGYDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTG 154

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFL 196
              Y           G D         P  ++  +Y  + F +++   I  H+ ++PLFL
Sbjct: 155 GEDYYTKWRCDGKLCGYDMTSEK---GPTNATYGQYSANLFANKANEAIDKHDKTKPLFL 211

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +   +VH+            ++VP  E   + F +I N +R+++ 
Sbjct: 212 YVAFQSVHSP-----------MEVP--ESYAKPFDYIKNHNRKMYG 244


>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
 gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
          Length = 452

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 120/218 (55%), Gaps = 22/218 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           G++DV +  ++ + TPN+  LA+  N  +L+  Y    CTP+R+AF+TG YPFR G    
Sbjct: 6   GFSDVDWK-DSTLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 64

Query: 81  VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
           V   +  A VP     L + +++L YST+L+GKWH+G  K+E LP NRGFD   G++   
Sbjct: 65  VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 124

Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
            GY  ++ D  H     V  GLD    +   +  P  S    Y TD FTD ++ V+ +HN
Sbjct: 125 TGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 184

Query: 190 HSRPLFLQITHAAVH--------TGTAGNAKLPTGLLQ 219
           +S+P F+ +++ AVH        + T G  K  T +L+
Sbjct: 185 NSKPFFMFLSYQAVHPPLQVSQQSKTIGQGKEATFILR 222


>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
 gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
          Length = 488

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 120/218 (55%), Gaps = 22/218 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           G++DV +  ++ + TPN+  LA+  N  +L+  Y    CTP+R+AF+TG YPFR G    
Sbjct: 42  GFSDVDWK-DSTLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100

Query: 81  VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
           V   +  A VP     L + +++L YST+L+GKWH+G  K+E LP NRGFD   G++   
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160

Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
            GY  ++ D  H     V  GLD    +   +  P  S    Y TD FTD ++ V+ +HN
Sbjct: 161 TGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220

Query: 190 HSRPLFLQITHAAVH--------TGTAGNAKLPTGLLQ 219
           +S+P F+ +++ AVH        + T G  K  T +L+
Sbjct: 221 NSKPFFMFLSYQAVHPPLQVSQQSKTIGQGKEATFILR 258


>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 500

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 116/216 (53%), Gaps = 10/216 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG++      TP ID LA +G+ L  +Y    C PSR   +TG++  + GI     
Sbjct: 34  GWNDVGYNNPV-FKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQHDDY 92

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               +++P+ E  + + LK +GYSTH++GKWH G   +  LP NRGFD   G+    + +
Sbjct: 93  GFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIDH 152

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H   F    + R+N +  A +   KY T  F ++   +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210

Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
           VH       ++P+  L+  +    +E+ RT+A +++
Sbjct: 211 VH----APLEVPSSYLKQYESTIHDEDRRTYAAMTS 242


>gi|299473382|emb|CBN77780.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 623

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 83/122 (68%), Gaps = 2/122 (1%)

Query: 23  GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           GWND+G+   +    TPN++ +A +G+ L+++Y++  CTP+RAA +TG+YP RYG    V
Sbjct: 6   GWNDIGYQSTDMHAVTPNLNRIAESGVKLSQYYSMSICTPARAALMTGRYPVRYGFQYKV 65

Query: 82  -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              G    +P+TEKL PQ++ + GY++H++GKWH+G +  + +P  RGF+ ++GY  G  
Sbjct: 66  INVGAPWGLPLTEKLFPQFMNDAGYTSHMVGKWHLGSHTFDHMPHLRGFETYLGYTQGRE 125

Query: 141 TY 142
           TY
Sbjct: 126 TY 127


>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 355

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 70/216 (32%), Positives = 115/216 (53%), Gaps = 10/216 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG++      TP ID LA NG+ L  +Y    C PSR   +TG++  + GI     
Sbjct: 34  GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPQDGF 92

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               +++P+ E  + + LK  GYSTH++GKWH G   +  LP NRGFD   G+    + +
Sbjct: 93  GYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGIDH 152

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H   F    + R+N +  A +   KY T  F ++   +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210

Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
           VH       ++P+  L+  +    +E+ RT+A +++
Sbjct: 211 VHAPL----EVPSSYLKQYESTIHDEDRRTYAAMTS 242


>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 465

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 111/225 (49%), Gaps = 25/225 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---ID 78
           G+ D GF G   + TPN+D LA +G+   + Y + PTC PSRA  +TGKY  R+G   I+
Sbjct: 34  GYMDFGFQGSKVMKTPNLDKLAKSGVTFTQGYVSDPTCGPSRAGMMTGKYQARFGYEEIN 93

Query: 79  TP-------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
            P          G    +P+ +KL+  YLK+LGY T + GKWH+G N +   P NRGFD 
Sbjct: 94  VPGYMSSHSALKGDEMGLPLDQKLMSNYLKDLGYKTAVYGKWHLG-NADRFHPLNRGFDE 152

Query: 132 HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS--SKYLTDFFTDQSVHVIKSHN 189
             G+  G  +Y    + T         R ME    Q    + Y TD F +++VH I+  N
Sbjct: 153 FYGFRGGARSYFAYKNPT-------GDRKMETNFGQYEEPNHYATDVFAEKAVHFIE-RN 204

Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
              P F+ ++  AVHT           L Q P++    +  A ++
Sbjct: 205 KEHPFFIYLSFNAVHTPMEATE---ADLAQFPNLTGKRQQLAAMT 246


>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 571

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 70/205 (34%), Positives = 107/205 (52%), Gaps = 19/205 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ D+G+HG +DI TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 56  QGYGDIGYHG-SDIHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 114

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P     LP+ L E GY+TH++GKWH+G  +   LP  RGF + +G   G  
Sbjct: 115 IRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGFQSFLGTLTGSG 174

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + +Y     +   A G D   + +R A +M+  Y T  + D+   ++K H+   PLFL 
Sbjct: 175 DHFSYQSC--DGAEACGFDL-HDGDRPAWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLY 231

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPD 222
           ++  A HT            LQVPD
Sbjct: 232 LSLQAAHTP-----------LQVPD 245


>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
          Length = 432

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 110/195 (56%), Gaps = 14/195 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           G+ND+ +  ++ + TPN+  LA+  N  +L   Y    CTP+R+AF+TG YPFR G    
Sbjct: 44  GFNDLDWK-DSTLHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQAG 102

Query: 81  VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
           V   +  A VP     L + +++L YST+L+GKWH+G  K+E LP NRGFD   G++   
Sbjct: 103 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 162

Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
            GY  ++ D  H     V  GLD    +   +  P  S    Y TD FTD ++ VI +HN
Sbjct: 163 TGYFNHSADQYHRELRRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVIDNHN 222

Query: 190 HSRPLFLQITHAAVH 204
            ++P F+ +++ AVH
Sbjct: 223 TTKPFFMFLSYQAVH 237


>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
 gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
          Length = 458

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 70/185 (37%), Positives = 99/185 (53%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP ++ALA  G+VL+R YT P  TP+RA  +TG+YP R+GI T V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTGRYPNRFGIRTTVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ L   L   GYS   +IGKWH+G  ++   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D + E +    LD   + E         Y T+  T ++V  I ++    P  L + +
Sbjct: 156 DYFDHMREGE----LDWHNDWETC---YDKGYSTELITQEAVRCINTYEKEGPFLLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
          Length = 370

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 100/192 (52%), Gaps = 4/192 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+GF    D+ TPNIDALA  G++   +Y    CTPSR A LTG+YP R  +   V 
Sbjct: 10  GYHDLGFRNP-DVITPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +  A+   +  K LP+YLK+LGYSTH +GKWH+G  +EE LP +RGFD+  G ++G   
Sbjct: 69  MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y +  H +    G     N      +    +  D   ++   ++   N   P FL  +  
Sbjct: 129 YWN--HTSTVIPGAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFSPQ 186

Query: 202 AVHTGTAGNAKL 213
             HT +   A+ 
Sbjct: 187 NPHTPSQPQAEF 198


>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
          Length = 318

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 67/175 (38%), Positives = 96/175 (54%), Gaps = 7/175 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G     D+ TPN+DALA  G++L  +Y    C+PSR A +TG+YP++  + + V 
Sbjct: 14  GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCSPSRHALMTGRYPYKSAMQSFVV 72

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
               AK   +  KLLPQYLKELGY  HLIGKWH+G  +EE LP +RGFD+  G  +G   
Sbjct: 73  LPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFDSFYGLLDGAGD 132

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           Y +      +   L+     E Y       +  D   D+   +   H++  PLFL
Sbjct: 133 YWEHTTSGVYDWHLNDEVFHEAYG-----NHSQDLELDRLDKLFAEHDNKDPLFL 182


>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 441

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 72/185 (38%), Positives = 104/185 (56%), Gaps = 16/185 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVGFHG  DIPTP+ID+LA +G   +  Y + P C+P+RA  LTG+Y  R+G +   
Sbjct: 40  GYADVGFHGGKDIPTPHIDSLAASGTRFSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G G  K +P+TE  +   L+  GY+T L+GKWH+G +  +  P  RGF    G+  G+ T
Sbjct: 100 G-GANKGLPLTETTIADRLQAAGYATGLVGKWHLGTDP-KFHPLKRGFGEFFGFLAGHHT 157

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSS-KYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y D   E D          ++R   +++   YLTD F  ++V  I+ H  + P FL +  
Sbjct: 158 YFDK-QEAD----------IQRGTTKVTEPGYLTDAFGREAVSFIERH-QNHPFFLYLAF 205

Query: 201 AAVHT 205
            AVHT
Sbjct: 206 NAVHT 210


>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
 gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
          Length = 333

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 98/190 (51%), Gaps = 8/190 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G     D+ TPN+DALA  G++L  +Y    CTPSR A +TG+YP    + T V 
Sbjct: 14  GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCTPSRHALMTGRYPSASAMQTSVI 72

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             + AK   +  KLLPQYLK+LGY  H++GKWH+G  ++E LP +RGFD   G + G   
Sbjct: 73  LPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGTGD 132

Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKY----LTDFFTDQSVHVIKSHNHSRPLF 195
           Y  +    + D+    D          Q+ S Y    L D   ++   V   H+   PLF
Sbjct: 133 YWSHTFFGKYDWHTNADIDFEANSTHSQVRSSYMNFVLQDLEMERLDKVFDEHDSKDPLF 192

Query: 196 LQITHAAVHT 205
           L       HT
Sbjct: 193 LYFAPQNPHT 202


>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
          Length = 330

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/194 (37%), Positives = 104/194 (53%), Gaps = 18/194 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ND+G   + D+ TPN+DALA  G++L  +Y    C+PSR A +TG+YP    + + V 
Sbjct: 14  GYNDLGLR-DPDVITPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 72

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------- 134
             + AK   +  K LPQYLK+LGY  H+IGKWH+G  +EE LP +RGFD   G       
Sbjct: 73  QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 132

Query: 135 YW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKY--LTDFFTDQSVHVIKSHNHS 191
           YW +G +   D    T+  V  +AR        Q+ S+Y  + D   ++   V   H++ 
Sbjct: 133 YWEHGIMGMYD--WHTEAGVDFEARGT----HAQVGSRYWHIYDLEMERLDKVFDEHDNK 186

Query: 192 RPLFLQITHAAVHT 205
            PLFL       HT
Sbjct: 187 DPLFLYFAPQNSHT 200


>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 568

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 67/189 (35%), Positives = 104/189 (55%), Gaps = 9/189 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ND+G+H   +I +P +D LA  G+ L  +Y  P CTPSR+  +TG+Y    G+  + 
Sbjct: 54  QGFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGRYQIHTGLQHSI 112

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+    LPQ L+E+GYSTH++GKWH+G  +++ LP  RGF  + G   G  
Sbjct: 113 IRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGFHTYFGSLTGSV 172

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
            Y TY     +     G D     E  A   + KY T  +T +   ++ +H+  S+PLF+
Sbjct: 173 DYYTYGSC--DGKSLCGFDLHEG-ESVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFI 229

Query: 197 QITHAAVHT 205
            ++  AVHT
Sbjct: 230 FLSLQAVHT 238


>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
 gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
          Length = 500

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/195 (37%), Positives = 110/195 (56%), Gaps = 14/195 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           G+ND+ +  ++ + TPN+  LA+  N  +L   Y    CTP+R+AF+TG YPFR G    
Sbjct: 42  GFNDLDWK-DSTLHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100

Query: 81  VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
           V   +  A VP     L + +++L YST+L+GKWH+G  K+E LP NRGFD   G++   
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160

Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
            GY  ++ D  H     V  GLD    +   +  P  S    Y TD FTD ++ V+ +HN
Sbjct: 161 TGYFNHSADQYHRELKRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220

Query: 190 HSRPLFLQITHAAVH 204
            ++P F+ +++ AVH
Sbjct: 221 TTKPFFMFLSYQAVH 235


>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 706

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 51/117 (43%), Positives = 77/117 (65%), Gaps = 2/117 (1%)

Query: 23  GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           GWND+G+   +    TP++D LA  G+ +  +YT+  CTP+RA+ +TG+Y  RYG+  + 
Sbjct: 142 GWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTGRYVMRYGLQYSV 201

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           +  G    +P+TEK+ P+Y+K+ GY TH+IGKWHIG      +P  RGFD ++GY N
Sbjct: 202 IQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRGFDTYLGYLN 258


>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 471

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/191 (36%), Positives = 108/191 (56%), Gaps = 12/191 (6%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW+DVGF+G  +IPTP++DALA +G+  +  Y + P C+PSRA  LTG+Y  R+G +  
Sbjct: 38  QGWSDVGFNGCKEIPTPHLDALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECN 97

Query: 81  VGAG------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            GA         + +P++E LL    +  GY T  IGKWH+G ++ +  P  RGF+   G
Sbjct: 98  PGAHGNDDAIEMEGLPLSETLLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWFG 156

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           +  G L+Y   + +     G+   RN +   P+    YLTD F+ ++V  ++  N +RP 
Sbjct: 157 FSGGGLSYWGDLGKKPPLHGV--LRNGD-VVPKDELTYLTDDFSTEAVKFVE-ENRARPF 212

Query: 195 FLQITHAAVHT 205
           FL + + A H 
Sbjct: 213 FLYLAYNAPHA 223


>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 495

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 114/216 (52%), Gaps = 10/216 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG++      TP ID LA NG+ L  +Y    C PSR   +TG++  + GI     
Sbjct: 34  GWNDVGYNNPV-FKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPNDGF 92

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               +++P+ E  + + LK  GYS H++GKWH G   +  LP NRGFD   G+    + +
Sbjct: 93  GYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGIDH 152

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
               H   F    + R+N +  A +   KY T  F ++   +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210

Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
           VH       ++P+  L+  +    +E+ RT+A +++
Sbjct: 211 VHAPL----EVPSSYLKQYESTIHDEDRRTYAAMTS 242


>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
 gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
          Length = 496

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/188 (38%), Positives = 102/188 (54%), Gaps = 16/188 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D+GF G +DI TPN+D LA  GIVLNR Y+LP CTP+R+A +T + P + G      
Sbjct: 34  GYADLGFRG-SDIQTPNLDRLAAEGIVLNRFYSLPICTPTRSALMTARDPIKLGT---AY 89

Query: 83  AGVA----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           AG+       V   E  +P+  K+ GY T +IGKWHIG   E L+P +RGFD+  G+ N 
Sbjct: 90  AGLQPWENGGVSPDEHFMPESFKKAGYQTAMIGKWHIGRQYESLVPHHRGFDHFFGHLNT 149

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQ 197
            + Y    H +  A G D + N +         Y TD   D+SV  +K   + S+P  L 
Sbjct: 150 QVDY--YTHAS--AGGHDLQENGKSLK---RDAYATDIHGDESVRYLKEIRDPSKPFLLY 202

Query: 198 ITHAAVHT 205
           +   A H+
Sbjct: 203 VPFLAPHS 210


>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 468

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 114/226 (50%), Gaps = 29/226 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G++D+G HG  DIPTP++DALA +G+     Y + P C+P+RA  LTG+Y  R+G +   
Sbjct: 41  GYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 100

Query: 79  --TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
             TP G      +P++E  L   LK++GY T ++GKWH+G N E+  P +RGFD   G+ 
Sbjct: 101 GPTPTG---EIGLPLSETTLADRLKKVGYKTGMVGKWHLG-NDEKRHPLSRGFDEFFGFL 156

Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            G  TY  +         L   R +         +YLTD F  ++V  I     S P FL
Sbjct: 157 GGARTYFATPGNASAGTKLLRGREVVD-----EKEYLTDAFAREAVAYIDRSKAS-PFFL 210

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +T  AVHT    + K              DR F  +S+P R+ + 
Sbjct: 211 YLTFNAVHTPMEASQKY------------LDR-FTAVSDPKRQKYC 243


>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
          Length = 502

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 98/162 (60%), Gaps = 11/162 (6%)

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +  G  + +P++ ++LP++L+ LGY+T +IGKWH+G +  +  P +RGFD+ +G++N ++
Sbjct: 8   IQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLGFYNSHV 67

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           +Y D  +      G D  R  +  A  ++ KY+TD FTD++V +I++H+ SRPL+LQI+H
Sbjct: 68  SYYDYKYSYQNMSGYDMHRG-DAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLYLQISH 126

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVH            L    D +  D+ F HI   +RR +A
Sbjct: 127 LAVH----------APLENPQDYDHYDKRFMHIVEQNRRKYA 158


>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 80/233 (34%), Positives = 117/233 (50%), Gaps = 31/233 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ND+G+     + TPN+D LA  GI L+ +Y  P CTPSRA  ++GKY    G+  + +
Sbjct: 70  GFNDIGYRNPA-MRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGKYQIHTGLQHSII 128

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+    LPQ LKE GY+TH+ GKWH+G  K+E  P NRGFD+ +G     L 
Sbjct: 129 WPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGFDSFLGI---LLG 185

Query: 142 YNDSIHETDFA-----------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
             D    T+              GLD R  ++      S  Y T    ++  ++I+ H+ 
Sbjct: 186 KGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQS-TNAYSGIYSTHVIAERVENIIEKHDK 244

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRLFA 242
            +PLFL ++  AVHT            LQVP  E   + F + I +  RR++A
Sbjct: 245 DKPLFLYVSFQAVHTP-----------LQVP--ESYLQPFESSIQDEKRRIYA 284


>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
          Length = 467

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/185 (38%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GW DVG+HG + IPTPNIDALA  GI +NR YT P  +P+RA  +TG+YP R+GI  T +
Sbjct: 43  GWGDVGYHG-SVIPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRYPSRFGIRKTVI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYS-THLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ L   L   GY+   ++GKWH+G  +    P NRGF +  G  NG L
Sbjct: 102 PPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGFTHFYGCLNGAL 161

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y     E +    LD   + E         Y TD   D++V  I  +    P FL +  
Sbjct: 162 DYFTHEREGE----LDWHNDWESC---RDEGYSTDLIADEAVRCIGGYASEGPFFLYVAF 214

Query: 201 AAVHT 205
            A HT
Sbjct: 215 NAPHT 219


>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 113/227 (49%), Gaps = 32/227 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ D+GF G  DIPTP++DALA  G+     Y + P C+P+RA  LTG+Y  R+G +  P
Sbjct: 48  GYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 107

Query: 81  VGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
            G G A       +PVTE  L   LK  GY+T L+GKWH+G ++ +  P  RGFD   G+
Sbjct: 108 GGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLG-SEAKFHPQKRGFDEFFGF 166

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G  TY  S          D  R  E    +    YLTD F+ +++  I  H    P F
Sbjct: 167 LGGQHTYFASKSG-------DVYRGTEVVKEEA---YLTDAFSREALSFIDRHK-DHPFF 215

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           LQ++  AVHT              +   E+    F+ I +P RR +A
Sbjct: 216 LQLSFNAVHT-------------PMDATEDRVARFSSIEDPKRRTYA 249


>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
          Length = 775

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 97/165 (58%), Gaps = 9/165 (5%)

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P+     + +P+   LLP+YL+ LGY+THL+GKWH+G + +   P  RGFD  +GY+ G 
Sbjct: 305 PLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGFDTFLGYYTGM 364

Query: 140 LTY-NDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           + Y N +++E+   +G D  R + E +  +    Y+TD  TD+   +I SHN  +P++LQ
Sbjct: 365 IQYFNHTLYESG-QLGYDLHRIVGENHTVEYRYDYMTDLLTDEVESIISSHNTEKPMYLQ 423

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           ++H A H   A        +++V D +E + TF +I + +RR +A
Sbjct: 424 LSHLAPHASDAEE------VMEVRDWKETNDTFGYIKDLNRRKYA 462



 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 94/162 (58%), Gaps = 11/162 (6%)

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +  G  + +P+  K+LP++L+ LGY+T+LIGKWH+G +  +  P  RGFD   G++N ++
Sbjct: 6   IQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGFYNSHV 65

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           +Y+D  +      G D     +  A  ++ KY+TD FTD++V +I++H+  RPL+LQI+H
Sbjct: 66  SYHDYKYSYQNMSGYDMHCG-DAPAYGLNDKYVTDLFTDKAVKIIENHDSFRPLYLQISH 124

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            AVH            L    D + +DR F HI    RR +A
Sbjct: 125 LAVH----------APLENPQDYDHSDRRFIHIREQHRRKYA 156


>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 460

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 72/188 (38%), Positives = 106/188 (56%), Gaps = 12/188 (6%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G  DIPTPN+D +A  G++ +  Y + P C+PSRA  LTG+Y  R+G D  
Sbjct: 36  QGWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCN 95

Query: 81  V---GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           +   G   A    P++EKL+ + LKE GY T  IGKWHIG +   L P  +GFD+  G+ 
Sbjct: 96  MPYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIG-DHPNLHPPAQGFDHWFGFP 154

Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            G + Y          +     RN +  A +  + YLTD FT+++++ I   + + P F+
Sbjct: 155 GGSMNYWGKATSKIQTI----YRNTKPVAEEELT-YLTDDFTNEAINFINKKDKN-PFFI 208

Query: 197 QITHAAVH 204
            + + A H
Sbjct: 209 YLAYNAPH 216


>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 64/183 (34%), Positives = 102/183 (55%), Gaps = 4/183 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+   +D+ +PNID LA N + +  +Y  P+CTPSRAAF+TG+Y  RYG+ + V 
Sbjct: 35  GFDDLGYVN-DDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +A+P++E LLPQ  K+ GY+T + GKWH+G   E+  P NRGFD   G++ G   
Sbjct: 94  KPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    H++     L      E+    ++  Y T    +  +  +  ++   PLF  ++  
Sbjct: 154 Y--FYHDSGNNCYLYEPNGTEKVRLDLNGTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQ 211

Query: 202 AVH 204
            VH
Sbjct: 212 EVH 214


>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 474

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/222 (32%), Positives = 110/222 (49%), Gaps = 16/222 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G+ D GF G ++  TP++D LA   I  ++ Y +   C PSRA  LTGKY  R+G     
Sbjct: 47  GYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 106

Query: 77  ----IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
               +      G    +P+ +KLLP+YLKE GY T L GKWH+G N ++  P  RGFD  
Sbjct: 107 VPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKWHMG-NADKFHPTKRGFDTF 165

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
            G+  G  +Y +  +E +     + R        + S  YLTD   + +   I+  N  +
Sbjct: 166 YGFRGGARSYYE-FNENNKNNRQEDRLERGFGNFEESKLYLTDALAEATTDFIEK-NQKQ 223

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
           P F+ ++  AVHT        P  L Q P+++   +T A ++
Sbjct: 224 PFFVYLSFNAVHTPMEAR---PDDLKQFPNLKGKRKTLAAMT 262


>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 103/183 (56%), Gaps = 4/183 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+   +D+ +PNID LA N + +  +Y  P+CTPSRAAF+TG+Y  RYG+ + V 
Sbjct: 35  GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +A+P++E LLPQ LK+ GY+T + GKWH+G   E+  P NRGFD   G++ G   
Sbjct: 94  KPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y    H++     L      E+    ++  Y T    +  +  +  ++   PLF  ++  
Sbjct: 154 Y--FYHDSGNNCYLYEPCYREKVRLDLNGTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQ 211

Query: 202 AVH 204
            VH
Sbjct: 212 EVH 214


>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 441

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 105/195 (53%), Gaps = 24/195 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYG--IDT 79
           G +D   +G   + TP+ID++A+NGI   + YT  + C+PSRA  LTG+Y   +G   + 
Sbjct: 32  GSSDFSCYGSKQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANI 91

Query: 80  PVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           P     A       +PVTE  L   LKELGYSTH IGKWH+G   +   P  RGFDN  G
Sbjct: 92  PHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLG-EADHFHPNARGFDNFYG 150

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA-----PQMSSKYLTDFFTDQSVHVIKSHN 189
           + +G  TY          +G + R +M+R        + SS Y T+ FT +++ +I+   
Sbjct: 151 FLSGARTY---------FLGGELRGDMDRIMRNKEFAEPSSGYTTEVFTQEAIRIIQ-EE 200

Query: 190 HSRPLFLQITHAAVH 204
             +P F+ ++H AVH
Sbjct: 201 QDKPFFIYLSHNAVH 215


>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 473

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 56/117 (47%), Positives = 76/117 (64%), Gaps = 2/117 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW+DVG+H +++I TPNID LA  G+ L  +Y  P CTPSRA  +TG+Y    G+   V 
Sbjct: 35  GWHDVGYH-DSEIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVL 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            A   + +P  E LLPQ LK+ GYSTH++GKWH+G  K +  P +RGFD   G++N 
Sbjct: 94  VAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGFDTFFGWYNA 150


>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
 gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
          Length = 461

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 68/183 (37%), Positives = 98/183 (53%), Gaps = 10/183 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWND  FHG ++I TPN+D LA  G+ L+R YT PTC+P+RA+ LTG+   R GI  P+ 
Sbjct: 50  GWNDFSFHG-SEIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRPASRMGIVAPIS 108

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +P +   LPQ L +L Y T L+GKWH+G  K E  P   GFD   G+ +G L  
Sbjct: 109 GRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFDFSYGFLHGQLDQ 167

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
               ++       +      R    +S K ++TD  T  +VH I +    +  +LQ+ ++
Sbjct: 168 YAHTYK-------NGDSTWYRNGKFISEKGHVTDLLTQSAVHYIDTLQTDQNFYLQVAYS 220

Query: 202 AVH 204
           A H
Sbjct: 221 APH 223


>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 474

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 119/234 (50%), Gaps = 36/234 (15%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG---- 76
           QGW DVGF+G  DIPTPN++ALA +G++ ++ Y+  P C+PSRA  LTG+Y  ++G    
Sbjct: 37  QGWGDVGFNGATDIPTPNLNALAKDGVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENN 96

Query: 77  -------IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
                   DT +G      +P+ E ++ + L++  Y T  IGKWH+G N  + LP  RGF
Sbjct: 97  PENEKQNEDTVIG------LPLNELMISEVLQQNNYHTCAIGKWHLG-NAHKFLPNQRGF 149

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
            +  G+  G   Y       +  +G+       +  P+ +  YLTD F++Q+++ I  ++
Sbjct: 150 KDWFGFSGGGFNYWGKTTPKNKELGV---MKNGKPVPENTLTYLTDDFSNQAINYIDQYS 206

Query: 190 HS-RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            + +P F+ + + A H               +   +E      HI N +R  +A
Sbjct: 207 KTEQPFFMYLAYNAPHA-------------PIQATKEYTNLVTHIENGERAAYA 247


>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
          Length = 330

 Score =  113 bits (282), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 99/192 (51%), Gaps = 23/192 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ND+G   + D+ TPN+DALA  G++L  +Y    C+PSR A +TG+YP    + + V 
Sbjct: 50  GYNDLGLR-DPDVITPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 108

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------- 134
             + AK   +  K LPQYLK+LGY  H+IGKWH+G  +EE LP +RGFD   G       
Sbjct: 109 QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 168

Query: 135 YW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           YW +G +   D    T+  V  +AR             +  D   ++   V   H++  P
Sbjct: 169 YWEHGIMGMYD--WHTEAGVDFEAR-----------GTHAQDLEIERLDKVFDEHDNKDP 215

Query: 194 LFLQITHAAVHT 205
           LFL       HT
Sbjct: 216 LFLYFAPQNSHT 227


>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 491

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 22/221 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWND  + G +DI TPNID LA  GI L ++Y  P C+PSR+A L GKYP+  G+   V 
Sbjct: 35  GWNDTSYQG-SDIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    + + E  +  +LK+ GYSTH +GKW +G +K E  P  RGFD   GY++    
Sbjct: 94  TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDA--- 150

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            ++  +       LD R N +    +    Y T  FT      I + + S P F+   + 
Sbjct: 151 -DEDYYTHKVGGYLDFRNNTDPVKDE-DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQ 207

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +VH           G L+ PD+  N     +I  P+R++F 
Sbjct: 208 SVH-----------GPLEAPDIYLNK---CNIPYPNRKIFC 234


>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
          Length = 602

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 100/185 (54%), Gaps = 4/185 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+    D+ +PNIDALA + + L +HY  P+CTPSRAAFLTG+Y  R G+ + V 
Sbjct: 56  GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 114

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
            A   + +P+ E LL +  K+ GY T L GKWH+G    +  P NRGFD   G++ G   
Sbjct: 115 RAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLGSQD 174

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             ++DS     +    D   +        +  Y T  F D  ++ +  H+ + PLF  ++
Sbjct: 175 FYFHDSGRLEAYPGNGDVENDTILDDFHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVS 234

Query: 200 HAAVH 204
              VH
Sbjct: 235 FQDVH 239


>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
          Length = 500

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 28/221 (12%)

Query: 1   IDTPVGAGVAKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT 60
           I   +GA   K   +   +   G ND+G++   ++ TPN+D LA NG++L  +Y  P C+
Sbjct: 19  IQLSLGAANQKPNIIFIAVDDMGNNDIGYNNP-EVDTPNLDNLANNGVILESNYVYPVCS 77

Query: 61  PSRAAFLTGKYPFRYGIDT-PVGAGVAKAVP-----VTEKLLPQYLKELGYSTHLIGKWH 114
           PSRAAF+TG+Y  + G    PV       +      V EKL   Y    GY+ H+IGKWH
Sbjct: 78  PSRAAFMTGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY----GYAAHMIGKWH 133

Query: 115 IGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSS- 170
           +G  K+ + P NRGFD+  G++ G   Y TY  + ++       D R N+    PQ  + 
Sbjct: 134 LGYCKDAVTPTNRGFDSFYGFYGGQENYYTYTSARYK-------DFRDNLTAVTPQNPNY 186

Query: 171 ------KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
                  Y T  +  +++ ++ +H+ S PLFL +   A H+
Sbjct: 187 PREDVDGYSTFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS 227


>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 478

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 115/232 (49%), Gaps = 36/232 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--T 79
           G+ D GF G  DI TPN+D LA +G++ N+ Y T   C PSRA FL G+Y  R+G +  T
Sbjct: 33  GYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAGRYQHRFGFEHNT 92

Query: 80  PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P   A     + V E L P  L+++GY+T +IGKWH+G +     P NRGFD    Y+ G
Sbjct: 93  PYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGAS-SPFYPLNRGFD----YFYG 147

Query: 139 YLTYNDSIHETD--------FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
           +LT      E D        +  GL   + +  +       YLT   +  +V  + + N 
Sbjct: 148 FLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVANF-----EGYLTTALSRDAVQFV-NDNK 201

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             P FL +++ A H             LQ P  +E+   +AHI +  RR++A
Sbjct: 202 ENPFFLFLSYNAPHQP-----------LQAP--QEDIARYAHIKDKKRRVYA 240


>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 489

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG +DI TPNID LA   +VL+R Y  P C+P+RA  LTG YPFR+GI   V 
Sbjct: 57  GWNDVGFHG-SDIRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +   K  +P   +  P++L +LGY    + GKWH+G       P   G     G++NG +
Sbjct: 116 SPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAI 175

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y        F   LD  RN +    +    Y T+   +  V  I  H   +PL+  +  
Sbjct: 176 DY---FSRERFGQ-LDWHRNHDSVHEE---GYSTELVGNAVVDFIDRHAGQQPLYAYVAF 228

Query: 201 AAVHT 205
            A H+
Sbjct: 229 NAPHS 233


>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1517

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 82/259 (31%), Positives = 120/259 (46%), Gaps = 56/259 (21%)

Query: 23  GWNDVGF-HGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
           G++DVG+ + +  + TP+IDALA  G+ L+R+Y+  +CTP+R A LTG  P R G+    
Sbjct: 82  GFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGALLTGLSPHRLGLQHGQ 141

Query: 78  ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              + P G      +P    +LPQ+L +LGY +HL+GKWH+G    E LP  RGFD+  G
Sbjct: 142 VFPEQPWG------LPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPTARGFDSFFG 195

Query: 135 YWNGYLTYNDSIHETD-----------FAVG---------------LDARRNMERYAPQM 168
             +G   Y   I   D           F VG                D R N +R     
Sbjct: 196 GLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRENNDRVEDLF 255

Query: 169 SSKYLTDFFTDQSVHVIKSHNH-----SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDM 223
            S Y T  F  ++  +I +H+       +PLFL ++  AVH            +    D 
Sbjct: 256 GS-YSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVH----------APVWAPEDT 304

Query: 224 EENDRTFAHISNPDRRLFA 242
            E      +++N +RR FA
Sbjct: 305 YETHPDLLNVTNGNRRKFA 323


>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 653

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 73/117 (62%), Gaps = 2/117 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GW DVGFH +    TPN+DA+   G+ L+  YT PTCTPSRA  +TG+Y +R G+ D+ +
Sbjct: 48  GWKDVGFH-DTTFSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                + VP+TE  + + L+  GYST  +GKWH+G +  + LP  RGFD+  G   G
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTG 163


>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
 gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
          Length = 474

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 72/198 (36%), Positives = 99/198 (50%), Gaps = 25/198 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G+ D GF G  +IPTPN+D LA  G+V  + Y +   C PSRA  LTGKYP R+G +   
Sbjct: 47  GYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENN 106

Query: 79  -----TPVGA-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
                +  GA G    + + +  +  YL E GY T LIGKWH G N++   P  RGFD  
Sbjct: 107 VPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQG-NEDRFHPLKRGFDEF 165

Query: 133 VGYWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
            G+  G  +Y      + S    DF       RN   Y    S  YLTD   ++++  IK
Sbjct: 166 FGFRGGARSYFPFTQAHPSSRREDF-----LERNFNNYGE--SPLYLTDALANETIEFIK 218

Query: 187 SHNHSRPLFLQITHAAVH 204
            + H +P F  ++ +A H
Sbjct: 219 RNKH-QPFFTFLSLSAPH 235


>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 512

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 107/209 (51%), Gaps = 31/209 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI-DTP 80
           G++DVG+HG   I TPNID++A  G+  ++ Y +   C PSRA  LTG Y  R+G  + P
Sbjct: 32  GYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRAGLLTGVYQQRFGCGENP 91

Query: 81  VGAGV-------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            G+G           +P ++ ++ + LK LGY+  +IGKWH+G +   L P  RG+D   
Sbjct: 92  NGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMGFDM-SLRPNQRGYDFFY 150

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNME-------RYAPQMSSK--------YLTD 175
           G+ NG   Y +   E  FA G       RN E       +Y      K        YLTD
Sbjct: 151 GFINGSHDYTEWTQE--FAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLTD 208

Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
            FTD++V+ I   N  +P FL + + AVH
Sbjct: 209 LFTDEAVNFI-DRNADKPFFLYLAYNAVH 236


>gi|326801926|ref|YP_004319745.1| Cerebroside-sulfatase [Sphingobacterium sp. 21]
 gi|326552690|gb|ADZ81075.1| Cerebroside-sulfatase [Sphingobacterium sp. 21]
          Length = 454

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 106/206 (51%), Gaps = 6/206 (2%)

Query: 6   GAGVAKAVPVTEKLLP--QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPS 62
           G+G A+  P    +L    G++D+G +G   I TP +D++A NG+   +   T P+CTPS
Sbjct: 18  GSGSAQERPNIILVLADDMGYSDLGCYGSPSISTPFLDSMAANGVRATDFMVTSPSCTPS 77

Query: 63  RAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
           RA+ LTG+Y  RY +  P+G G    +P  E  + + LKE+GY T L+GKWH+G   +  
Sbjct: 78  RASLLTGRYASRYNLPDPIGPGSTLGLPDEEITIAEMLKEVGYRTALVGKWHLGDKHDFN 137

Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
            P  +GFD+  G    +  Y D   +TD  + +   RN +      +   L+  ++++ +
Sbjct: 138 YPTGQGFDSFFGMLYSH-DYRDPYVKTDTTIKI--FRNPKPAIQGPADSNLSRIYSEEVI 194

Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTA 208
             IK     +P FL   H   H   A
Sbjct: 195 RFIKEQRKDQPFFLYYAHNMPHLPVA 220


>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
 gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
          Length = 458

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETVADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++H I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIHCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 536

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 68/199 (34%), Positives = 103/199 (51%), Gaps = 11/199 (5%)

Query: 23  GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           G+ DV ++G+    N + TP +D LA +GI L R Y+   CTP+RAA LTG+YP   G+ 
Sbjct: 44  GYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALLTGRYPSNTGMQ 103

Query: 79  TPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
             V    ++ ++P    LLP  L E GY  H IGKW +G  +    P  RGFD+H+GY+ 
Sbjct: 104 HEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTARGFDSHLGYYG 162

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSS---KYLTDFFTDQSVHVIKSHNHSRPL 194
             +TY++  H    +      R+M      +++   +Y T  F D ++ ++        L
Sbjct: 163 AEITYDE--HAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMALVDREADEYKL 220

Query: 195 FLQITHAAVHTGTAGNAKL 213
           FL +   AVH   A +A L
Sbjct: 221 FLYLCFQAVHQPLAADAAL 239


>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 614

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 115/231 (49%), Gaps = 25/231 (10%)

Query: 23  GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
           G+NDVG+   +D+   TP +D L  +G+ ++R Y    CTPSRAA LTGK P    +   
Sbjct: 74  GFNDVGY-ASSDLGEMTPFLDGLMADGVRVDRLYGQQVCTPSRAAMLTGKLPIHLELQHW 132

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V       +P  E  L QYLK LGYSTH++GKWH+G       P NRGFD+  G+++G 
Sbjct: 133 QVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNASTPLNRGFDSFYGFYSGG 192

Query: 140 LTYNDSIHETDFAVGLDARRNM---ERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRP 193
           + Y    H+          R++   ER       ++ T    ++++ V++ H     S P
Sbjct: 193 VDY--LTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMNERAIAVLERHAVEKKSEP 250

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
           +F  +++         NA LP   LQ P   +E  + T   I N DR+ FA
Sbjct: 251 VFAYVSYP--------NAHLP---LQPPTELLERRNATLLDIPNHDRKNFA 290


>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
 gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
          Length = 449

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 92/170 (54%), Gaps = 8/170 (4%)

Query: 43  LAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLK 101
           LA  G+ L  +Y  P C+PSR   +TG+Y  RYG+  + + +     +P+ E  LPQ L+
Sbjct: 2   LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61

Query: 102 ELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG---YLTY---NDSIHETDFAVGL 155
           E GY ++++GKWH+G  ++E +P  RGFD   GY  G   Y T+   N    +     GL
Sbjct: 62  ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121

Query: 156 DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           D  R+ ++     +  Y T  F  +++  I SH  S+P+FL +   AVH+
Sbjct: 122 DL-RDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS 170


>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 826

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 105/190 (55%), Gaps = 12/190 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G++DVGF+G  +IPTP++D LA +G+V    Y + P C+PSRA  LTG++  R+G     
Sbjct: 56  GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             DT         +P++E  L   LKE GY T  IGKWH+G + +   P +RGFD   G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHRGFDEWFGF 174

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G  +Y   + + D  +G+   R  E   P+  + +LTD F+ ++V  I+ H  S P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-SEPFF 230

Query: 196 LQITHAAVHT 205
           L + + A H 
Sbjct: 231 LYLAYNAPHA 240


>gi|441597518|ref|XP_003266414.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Nomascus
           leucogenys]
          Length = 431

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 5/128 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG++DVG+HG +DI TP +D LA  G+ L  +Y  P CTPSR+  LTG+Y    G+  + 
Sbjct: 57  QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P+ +  LPQ L+E GYSTH++GKWH+G  ++E LP  RGFD   G   G  
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFXGSLTGNV 175

Query: 139 -YLTYNDS 145
            Y TY++ 
Sbjct: 176 DYYTYDNC 183


>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 826

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 104/190 (54%), Gaps = 12/190 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G++DVGF+G  +IPTP++D LA +G+V    Y + P C+PSRA  LTG++  R+G     
Sbjct: 56  GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             DT         +P+TE  L   LKE GY T  IGKWH+G + +   P  RGFD   G+
Sbjct: 116 EPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 174

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G  +Y   + + D  +G+   R  E   P+  + +LTD F+ ++V  I+ H  + P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-TEPFF 230

Query: 196 LQITHAAVHT 205
           L + + A H 
Sbjct: 231 LYLAYNAPHA 240


>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
          Length = 581

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 99/185 (53%), Gaps = 4/185 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+    D+ +PNIDALA + + L +HY  P+CTPSRAAFLTG+Y  R G+ + V 
Sbjct: 35  GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
            A   + +P+ E LL +  K+ GY T L GKWH+G    +  P  RGFD   G++ G   
Sbjct: 94  RATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQIRGFDRFYGFYLGSQD 153

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             ++DS     +    D   +        +  Y T  F D  ++ +  H+ + PLF  ++
Sbjct: 154 FYFHDSGRLKAYPGNGDVENDTILDDLHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVS 213

Query: 200 HAAVH 204
              VH
Sbjct: 214 FQDVH 218


>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
          Length = 529

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 102/184 (55%), Gaps = 5/184 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGF   N I +PN+DALA +GI+L   YT P C+PSR +FL+G+Y ++  +   V 
Sbjct: 15  GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVI 73

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + + +   +LP YLKELGY TH  GKWH+G  ++E  P +RGFD+  G ++G   
Sbjct: 74  LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 133

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           + +    T           ++  A    S+ L  ++ +++   +  ++ S PLF+ +   
Sbjct: 134 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 190

Query: 202 AVHT 205
            VH+
Sbjct: 191 NVHS 194


>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
          Length = 382

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 102/184 (55%), Gaps = 5/184 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGF   N I +PN+DALA +GI+L   YT P C+PSR +FL+G+Y ++  +   V 
Sbjct: 32  GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVI 90

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + + +   +LP YLKELGY TH  GKWH+G  ++E  P +RGFD+  G ++G   
Sbjct: 91  LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 150

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           + +    T           ++  A    S+ L  ++ +++   +  ++ S PLF+ +   
Sbjct: 151 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 207

Query: 202 AVHT 205
            VH+
Sbjct: 208 NVHS 211


>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
           anophagefferens]
          Length = 351

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/209 (33%), Positives = 99/209 (47%), Gaps = 27/209 (12%)

Query: 23  GWNDVGFH----GENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           G NDVG+     G   I +P IDALA   +VL+R Y  P CTP+RAA +TG++ +R G+ 
Sbjct: 22  GRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAALMTGRHAYRTGLA 81

Query: 79  TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
             V  A     +P  E  + + L++ GYSTH+IGKWH+G  K+ +LP +RGFD   GY  
Sbjct: 82  YFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPTSRGFDRFFGYCL 141

Query: 138 GYLTY--------NDSIHETDFAVG-------------LDARRNMERYAPQMSSKYLTDF 176
           G   Y           +  TD A G              D    + R  P+  + +  D 
Sbjct: 142 GSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPR-TPKTENVHSADL 200

Query: 177 FTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           F  ++     +    +PLFL     A H 
Sbjct: 201 FAARATETFATAPRDKPLFLYYASQAPHA 229


>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 467

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 102/200 (51%), Gaps = 29/200 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-P 80
           G+ D   +G  +IPTPNI+ LA  G + ++ Y +   C PSRA  LTG+Y  R+G +  P
Sbjct: 39  GYADFECYGNKEIPTPNINRLAKEGTLFSKAYVSASVCAPSRAGLLTGRYQQRFGFENNP 98

Query: 81  VGA---GVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            G    G  K    + ++EK +   +KE GY T  +GKWH+G N  +  P  RGFD   G
Sbjct: 99  TGKPREGFKKEDMGLALSEKTIGDRMKEEGYRTLAVGKWHLG-NDAKFFPLKRGFDEFYG 157

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA--------PQMSSKYLTDFFTDQSVHVIK 186
           +  G        H   F+     ++  E+YA        P+    YLTD FTD+++  I 
Sbjct: 158 FQEG--------HRDFFSF---KKKRAEKYALWDNDKIIPEEEITYLTDMFTDKALKFID 206

Query: 187 SH-NHSRPLFLQITHAAVHT 205
            + +  +P F+ + + AVHT
Sbjct: 207 ENADKKQPFFIYLAYNAVHT 226


>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
          Length = 458

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 96/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP++DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E    +    Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETCHDK---GYSTELITQEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
 gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
          Length = 458

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP++DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 810

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 99/190 (52%), Gaps = 12/190 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G++DVGF+G  +IPTP +D LA  G+V    Y + P C+PSRA  LTG+Y  R+G     
Sbjct: 40  GYSDVGFNGCKEIPTPRLDELAGEGVVFTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNP 99

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D          +P++E  L   LKE GY T  IGKWH+G + +   P  RGFD   G+
Sbjct: 100 EPDPQWHGDDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 158

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G L+Y   +   D  +G+      +    + S  YLTD F+ ++V  I+ H  + P F
Sbjct: 159 SGGGLSYWGDLGRMDPLLGV---HRGDEPVDRKSLTYLTDDFSTEAVKFIQRH-ETDPFF 214

Query: 196 LQITHAAVHT 205
           L + + A H 
Sbjct: 215 LYLAYNAPHA 224


>gi|313241546|emb|CBY33792.1| unnamed protein product [Oikopleura dioica]
          Length = 336

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 96/184 (52%), Gaps = 26/184 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GW+DV ++ +    TP +  L  + I L+  Y+   CTPSRA+ LTGKY +R+G+ T P+
Sbjct: 41  GWSDVSWNNKKIKATPFLGQLEKHSITLSSSYSTHRCTPSRASLLTGKYAWRFGLGTDPI 100

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A  A  + + EKLLP+ L++ GYSTH +GKWH+G      LP NRGFD   G+  G L 
Sbjct: 101 DANTAAGLDLKEKLLPEILRKNGYSTHHVGKWHLGHCNSSYLPHNRGFDTFYGHTGGVLN 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH-------VIKSHNHSRPL 194
           Y     + + AVG              + KYL  F  D  +H            +H+R L
Sbjct: 161 Y----FQHNRAVG--------------NCKYLDYFENDTPIHEKTGVYSTFDFGDHARKL 202

Query: 195 FLQI 198
           + +I
Sbjct: 203 YNKI 206


>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
 gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
          Length = 446

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 6/187 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + + TPN+D+LA  GI     Y+  P C+PSRA+ LTGKYP R G+   +
Sbjct: 22  GYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 81

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           GA      +P TE  L + LK  GY T L GKWH+G + EE  P   GFD   G+  G +
Sbjct: 82  GAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGVS-EETSPNAHGFDEFFGFKAGCI 140

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
            +   I       G++   ++     ++  + +Y+T+  T++SV  IK S     P FL 
Sbjct: 141 DFYSHIFYWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVDFIKRSREQEDPFFLF 200

Query: 198 ITHAAVH 204
           +++ A H
Sbjct: 201 VSYNAPH 207


>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
          Length = 621

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 101/208 (48%), Gaps = 21/208 (10%)

Query: 40  IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQ 98
           +D LA  G+ L  +Y  P CTPSR+ F+TGKY    G+  + +       +P+    LPQ
Sbjct: 80  LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139

Query: 99  YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH-ETDFAVGLDA 157
            LKE+GYSTH++GKWH+G  ++E +P  RGFD   G   G   Y      ++    G D 
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 199

Query: 158 RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGL 217
             N        +  Y T  +T +   ++ SH+  +P+FL I + AVH+            
Sbjct: 200 YENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----------P 248

Query: 218 LQVPDMEENDRTFAH---ISNPDRRLFA 242
           LQ P      R F H   I N +RR +A
Sbjct: 249 LQAP-----GRYFEHYRSIININRRRYA 271


>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
 gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
          Length = 498

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP++DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 77  GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 135

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 136 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 195

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 196 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 248

Query: 201 AAVHT 205
            A HT
Sbjct: 249 NAPHT 253


>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 826

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 105/190 (55%), Gaps = 12/190 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G++DVGF+G  +IPTP++D LA +G+V    Y + P C+PSRA  LTG++  R+G     
Sbjct: 56  GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             DT         +P++E  L   LKE GY T  IGKWH+G + +   P +RGFD   G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHRGFDEWFGF 174

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G  +Y   + + D  +G+   R  E   P+  + +LTD F+ ++V  I+  N + P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVEPKTLT-HLTDDFSTEAVKFIQ-RNETEPFF 230

Query: 196 LQITHAAVHT 205
           L + + A H 
Sbjct: 231 LYLAYNAPHA 240


>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
          Length = 551

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 102/184 (55%), Gaps = 5/184 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+NDVGF   N I +PN+DALA +GI+L   YT P C+PSR +F++G+Y ++  +   V 
Sbjct: 37  GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRYSYKSAMQHGVI 95

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + + +   +LP YLKELGY TH  GKWH+G  ++E  P +RGFD+  G ++G   
Sbjct: 96  LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           + +    T           ++  A    S+ L  ++ +++   +  ++ S PLF+ +   
Sbjct: 156 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 212

Query: 202 AVHT 205
            VH+
Sbjct: 213 NVHS 216


>gi|365877064|ref|ZP_09416570.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
 gi|442586911|ref|ZP_21005733.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
 gi|365755338|gb|EHM97271.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
 gi|442563318|gb|ELR80531.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
          Length = 454

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 96/183 (52%), Gaps = 4/183 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIV-LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G   I TP +D ++ NG++  N   + PTCTPSRA+ LTG+Y  RY +  P+
Sbjct: 37  GYADIGAYGNPVIKTPFLDQMSRNGLMATNYVVSSPTCTPSRASMLTGRYSSRYDLPWPI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G  + +P  E  + + LK  GY+T ++GKWH+G  K E  P  +GFD + G    +  
Sbjct: 97  APGSKQGLPDDEVTIAEMLKANGYNTGMVGKWHLGDQKAENKPNGQGFDFYYGILYSH-D 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y      TD  + +     +E   P  +   LT  +T +S++ I+     +P FL + H 
Sbjct: 156 YKAPYVNTDIPIRMFRNTKVEIEKP--ADSLLTRLYTKESINYIRQQKKDKPFFLYLAHN 213

Query: 202 AVH 204
             H
Sbjct: 214 MPH 216


>gi|440716880|ref|ZP_20897383.1| arylsulfatase [Rhodopirellula baltica SWK14]
 gi|436438073|gb|ELP31649.1| arylsulfatase [Rhodopirellula baltica SWK14]
          Length = 616

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/187 (36%), Positives = 93/187 (49%), Gaps = 24/187 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  HG   I TP +DALA     L+R Y  P C P+RAA LTG+YP R G+    
Sbjct: 72  QGWGDLAAHGNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            AGV    + +   E  L +  +  GY T   GKWH G  +  L P  +GFD   G+  G
Sbjct: 128 -AGVTGRREVMRAEETTLAELYRSAGYVTGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 185

Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           +   Y+D++ E          RN     P  ++ Y+TD  TD +V  I++H H RP F  
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTNGYITDVLTDAAVDFIQNH-HDRPFFCY 231

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 232 VPFNAPH 238


>gi|417303299|ref|ZP_12090357.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327540271|gb|EGF26857.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 616

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/187 (36%), Positives = 93/187 (49%), Gaps = 24/187 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  HG   I TP +DALA     L+R Y  P C P+RAA LTG+YP R G+    
Sbjct: 72  QGWGDLAAHGNPKISTPTLDALANKSARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            AGV    + +   E  L +  +  GY T   GKWH G  +  L P  +GFD   G+  G
Sbjct: 128 -AGVTGRREVMRAEEITLAELYRSAGYVTGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 185

Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           +   Y+D++ E          RN     P  ++ Y+TD  TD +V  I++H H RP F  
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTNGYITDVLTDAAVEFIQNH-HDRPFFCY 231

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 232 VPFNAPH 238


>gi|300773469|ref|ZP_07083338.1| cerebroside-sulfatase [Sphingobacterium spiritivorum ATCC 33861]
 gi|300759640|gb|EFK56467.1| cerebroside-sulfatase [Sphingobacterium spiritivorum ATCC 33861]
          Length = 449

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 11/194 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G++D+G +G   I TP +D +A  G+   +   T P+CTPSRA+ LTG+Y  RY +  P+
Sbjct: 23  GYSDLGCYGNPSIATPFLDKMAAKGVRATDYMVTSPSCTPSRASLLTGRYASRYNLPDPI 82

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
           G G    +P  E  + + LKE GY T LIGKWH+G +  E LP  +GFD    Y+ G L 
Sbjct: 83  GPGAKNGLPAQEVTIAEMLKEKGYRTALIGKWHLG-DHGEYLPNKQGFD----YFYGMLY 137

Query: 141 --TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
              Y D   +TD  + +   RN      + +   L+  +T++    I       P FL  
Sbjct: 138 SHDYRDPYVKTDTTIKI--FRNQTPVVTRPADSALSRIYTEEVKQYISQQKKGEPFFLYY 195

Query: 199 THAAVHTGTAGNAK 212
            H   H   A +A+
Sbjct: 196 AHNMPHLPVAFSAE 209


>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 110/223 (49%), Gaps = 26/223 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWND  + G +DI TPNID LA  GI L ++Y  P C+PSR+A L GKYP+  G+   V 
Sbjct: 35  GWNDTSYQG-SDIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             G    + + E  +  +LK+ GYSTH +GKW +G +K E  P  RGFD   GY++    
Sbjct: 94  TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDA--- 150

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            ++  +       LD R N +    +    Y T  FT      I + + S P F+   + 
Sbjct: 151 -DEDYYTHKVGGYLDFRNNTDPVKDE-DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQ 207

Query: 202 AVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
           +VH+            L+ PD  +E+      H   P+R++F 
Sbjct: 208 SVHSP-----------LEAPDTYLEK-----CHSPYPNRKIFC 234


>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
 gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
          Length = 508

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 95/184 (51%), Gaps = 14/184 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G   I TPNID +A  G   +  Y   P CTPSRA  LTG+YP R GI    
Sbjct: 61  GYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRAGLLTGRYPIRQGIHNVF 120

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                + +   E  + + LK  GY+T L+GKWH+G + E+ +P+N+GFD   G     L 
Sbjct: 121 FPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYMPWNQGFDEFFG-----LP 174

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y++ +       GL    N +    ++  +Y+T  +TDQ++  I  H   +P FL + H 
Sbjct: 175 YSNDMG------GLYYFNNKDIDFEEVDQRYMTKTYTDQALQFIDKH-QEQPFFLYLAHN 227

Query: 202 AVHT 205
             H 
Sbjct: 228 MPHV 231


>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 534

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/224 (33%), Positives = 109/224 (48%), Gaps = 25/224 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVG+H ++ I TPNID LA  G+ L  +Y    C PSR   +TG+Y  R G+     
Sbjct: 80  GWHDVGYH-DSVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRYRIRTGL----Y 134

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-GCNKEELLPFNRGFDNHVGYWNG--- 138
             V   + + E  L   L E GY T ++GKWH+ G    E  P +RGF   +GY  G   
Sbjct: 135 GDVCDFMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGFQTFLGYHGGSQN 194

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y T+        +    D   N    A +   +Y T  F D++  +I+ HN  +PLFL +
Sbjct: 195 YFTHRRGGPHAPY----DFWANDTSIAVKYEGQYSTMIFADEAQRIIRQHNTKQPLFLYL 250

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +  AVH        +P  LL  P  E+  R+   I +  RR++A
Sbjct: 251 SFQAVH--------VP--LLVPPSYEDQYRSL--IEDDKRRVYA 282


>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 473

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/226 (29%), Positives = 105/226 (46%), Gaps = 19/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI---- 77
           G+ D GF G   + TPN+D LA +G+   + Y +  TC PSRA  +TGKY  R+G     
Sbjct: 40  GFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMTGKYQQRFGYEEIN 99

Query: 78  ------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
                 D     G    +P+ +K +  YLKE GY T + GKWH+G + +   P  RGFD 
Sbjct: 100 VPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLG-DADRFHPLKRGFDT 158

Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
            +G+  G   Y  Y++   +       D +   +    +   +YLTD    ++   I+  
Sbjct: 159 FLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGNYEEPKEYLTDVLGKEAAKYIE-Q 217

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
           N   P F+ +   AVHT    +   P  L + P++    +  A ++
Sbjct: 218 NKDEPFFIYLAFNAVHTPLESD---PKDLAKFPNLTGKRKELAAMT 260


>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 484

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 77/234 (32%), Positives = 112/234 (47%), Gaps = 28/234 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
           G+NDVGF+G  DI TPN+D LA +G +    Y   P C PSRAA LTG+YP   G   + 
Sbjct: 42  GYNDVGFNGSTDITTPNLDQLAQDGTIFTSAYVAHPFCGPSRAALLTGRYPHTLGSQFNL 101

Query: 80  PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P  GA   K + V EK +   +++ GY T  IGKWH+G    E  P  RGF++  G+  G
Sbjct: 102 PANGASTGKGISVEEKFMGVPMQKAGYYTGAIGKWHLG-ETAEYHPNKRGFNDFYGFLGG 160

Query: 139 YLTYNDSIHETDFAVGLD-ARRNMERY--------APQMSSKYLTDFFTDQSVHVIK-SH 188
              Y    ++  +    +   +N+  Y        A    + YLTD  + + +   K +H
Sbjct: 161 GHKYFPEEYKLQYKHQKEMGTKNINDYVLPLEHNGAIVEENDYLTDVLSREGIRFTKEAH 220

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +  +P FL + + A H       K         D+E+    F  I + DRR +A
Sbjct: 221 DKKKPFFLYLAYNAPHVPLEAKEK---------DLEK----FKDIEDIDRRTYA 261


>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
 gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
          Length = 458

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E    +    Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETCHDK---GYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|227536643|ref|ZP_03966692.1| sulfatase family protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227243444|gb|EEI93459.1| sulfatase family protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 461

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 11/194 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G++D+G +G   I TP +D +A  G+   +   T P+CTPSRA+ LTG+Y  RY +  P+
Sbjct: 35  GYSDLGCYGNPSISTPFLDKMAAKGVRATDYMVTSPSCTPSRASLLTGRYASRYNLPDPI 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
           G G    +P  E  + + LKE GY T LIGKWH+G +  E LP  +GFD    Y+ G L 
Sbjct: 95  GPGAKNGLPAQEVTIAEMLKEKGYHTALIGKWHLG-DHGEYLPNKQGFD----YFYGMLY 149

Query: 141 --TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
              Y D   +TD  + +   RN      + +   L+  +T++    I       P FL  
Sbjct: 150 SHDYRDPYVKTDTTIKI--FRNQTPVVTRPADSALSRIYTEEVKQYISQQKKGEPFFLYY 207

Query: 199 THAAVHTGTAGNAK 212
            H   H   A +A+
Sbjct: 208 AHNMPHLPVAFSAE 221


>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
 gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
          Length = 458

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
          Length = 506

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 111/223 (49%), Gaps = 20/223 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWND+  H    + TPN+D L    + L  +Y  P CTP+R+  ++G+Y    G+   V 
Sbjct: 30  GWNDISLHNSY-LSTPNVDGLIQESLHLQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVI 88

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            G     +P+T+ + P+  K+ GY TH++GKWH+G   E+++P NRGF++H GY  G   
Sbjct: 89  LGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVPENRGFESHYGYLIGAEG 148

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           + +         G+D R   +  A   SS  +Y  D F  +   ++++H+    L++ + 
Sbjct: 149 HYNHSQFMQGQNGVDFR---DGGASTNSSWGQYSADLFAKRVEDLVEAHDVEESLYMYVG 205

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              VH             L+ P    +   F+ I + DRR++A
Sbjct: 206 LQNVHYP-----------LEAPQHYVD--QFSWIKDRDRRVYA 235


>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 458

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 472

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 81/232 (34%), Positives = 111/232 (47%), Gaps = 36/232 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D GF G  DI TPN+D LA NG    + Y     C PSRAA L+G+Y  R+G +T  
Sbjct: 37  GYADTGFTGATDIQTPNLDNLAKNGAFFKQGYANHAYCGPSRAALLSGRYQHRFGFETNP 96

Query: 82  GAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
               A     + V EKL P+ L+E GY T  IGKWH+G    E  P NRGFD    Y+ G
Sbjct: 97  AYDPANPHMGIDVGEKLFPKRLQEAGYKTGAIGKWHLGA-AAEFHPLNRGFD----YFYG 151

Query: 139 YLTYNDSIHETDFAVGLDARRNMERY-APQMSSK-------YLTDFFTDQSVHVIKSHNH 190
           +L         D       ++  E Y  P + +K       YLT   ++ +   +K  N 
Sbjct: 152 FLGGGHDYFRID-----GTKKVWEAYLQPLVRNKRADNFEGYLTTALSNDAAQFVKD-NK 205

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             P FL + + A H        +P   LQ P  +E+   +AHI +  RR++A
Sbjct: 206 ENPFFLYVAYNAPH--------MP---LQAP--KEDIARYAHIKDNKRRVYA 244


>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
 gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
          Length = 458

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
          Length = 458

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 458

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
 gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
          Length = 458

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP +DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRGF +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 97/187 (51%), Gaps = 16/187 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP---FRYGIDT 79
           G+ DVGF     I +PN D LA  G+VLNRHY    C PSRA+ LTG++P   +++ + T
Sbjct: 35  GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTGRWPHHVYQWNLAT 93

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
              AG      +   + P  LK   YSTH++GKWH G      LP NRGFD   GY  G+
Sbjct: 94  DATAGTN----LNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRGFDTSSGYLCGW 149

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           + + +          +D  +N    AP   +  Y + ++ D   +++ +H+ + PLF+ +
Sbjct: 150 VDHFNQKQ----GCAVDCWKNN---APDPRNGTYDSYYYRDDLTNIVNNHDANNPLFIYL 202

Query: 199 THAAVHT 205
               VHT
Sbjct: 203 PLHNVHT 209


>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 470

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/159 (39%), Positives = 89/159 (55%), Gaps = 15/159 (9%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
            +P+ E LLP++L +LGY++H +GKWH+G  K+   P  RGF +  G+WNGY  Y   + 
Sbjct: 22  GLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFKSFYGFWNGYQDYYTHMV 81

Query: 148 ETDFAV--GLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAV 203
           +  FA   G D RR++    P  SS  KY T  FT ++  +I  HN+S PLFL + H A 
Sbjct: 82  QATFASFEGFDMRRDLN---PDWSSVGKYSTHLFTKEATDIITKHNNSVPLFLYLAHLAP 138

Query: 204 HTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           H GT  N       LQ P  +E+  +F  I +  RR +A
Sbjct: 139 HAGTYENP------LQAP--QEDINSFQSIKDKYRRKYA 169


>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 543

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 103/190 (54%), Gaps = 12/190 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
           G++DVGF+G  +IPTP++D LA +G+V    Y + P C+PSRA  LTG++  R+G     
Sbjct: 56  GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHGSNP 115

Query: 77  -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             DT         +P++E  L   LKE GY T  IGKWH+G + +   P  RGFD   G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 174

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G  +Y   +   D  +G+   R  E   P+  + +LTD F+ ++V  I+ H  + P F
Sbjct: 175 SGGGFSYWGDLGMKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-TEPFF 230

Query: 196 LQITHAAVHT 205
           L + + A H 
Sbjct: 231 LYLAYNAPHA 240


>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 461

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 67/183 (36%), Positives = 97/183 (53%), Gaps = 9/183 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVG+HG + I TP +D+LA NG  L R Y  PTC+PSRAA LTG    R GI  P+ 
Sbjct: 47  GWNDVGYHG-SKIKTPVLDSLANNGAKLERFYVAPTCSPSRAALLTGIPASRLGIVAPIA 105

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                A+P +   LP+ +K+LGY T L GKWH+G   E   P   GFD   G+ +G +  
Sbjct: 106 GKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLGLTPEN-GPQAYGFDTSYGFLHGQI-- 162

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITHA 201
           +   HE  +  G  +     ++  +    ++TD  TD ++        +  P F+ + ++
Sbjct: 163 DQYTHE--YKNGDPSWHKNGKFLKE--DGHVTDLLTDAAIAYFNQETKTETPSFVTLAYS 218

Query: 202 AVH 204
           A H
Sbjct: 219 APH 221


>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
           variabilis ATCC 29413]
 gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
           ATCC 29413]
          Length = 457

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 95/190 (50%), Gaps = 16/190 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRY--GIDT 79
           GW D+  +G  D  TPN+D LA  G+     Y   T CTP+R AFLTG+Y  R   G+  
Sbjct: 53  GWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLRE 112

Query: 80  PVGAGVAKA-----VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           P+GA    A     +P  +  +   LK  GY T L+GKWH G       P  +GFD + G
Sbjct: 113 PLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGY-PPNFGPLQKGFDEYFG 171

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           + +G + Y      TD  + L      E   P   S Y+TD FTD++V  I+   HSRP 
Sbjct: 172 HLSGGIEYFTHTG-TDRILDL-----YENDVPVQRSGYVTDLFTDRAVEFIQ-RPHSRPF 224

Query: 195 FLQITHAAVH 204
           +L + + A H
Sbjct: 225 YLSLHYNAPH 234


>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 458

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 94/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GW DVGFHG ++I TP++DAL   G+ L R YT P  TP+RA  +TG+YP R+G+ + V 
Sbjct: 37  GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                  +   E+ +   L   GY    +IGKWH+G  K+   P NRG  +  G+ NG +
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAI 155

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +    LD   + E         Y T+  T +++  I ++    P  L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 208

Query: 201 AAVHT 205
            A HT
Sbjct: 209 NAPHT 213


>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 480

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 106/224 (47%), Gaps = 35/224 (15%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW DV FHG   IPTPNID LA +G++L+ +Y LP CTPSRAA +TG YP   G+   V
Sbjct: 4   QGWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGV 63

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKW-HIGCNKEELLPFNRGFD--------- 130
               A   + +  K++PQ+ ++LGY  ++IGK  H GC+  +      G D         
Sbjct: 64  IQDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSHHDGCHNFD--STKSGIDLLHTPLISS 121

Query: 131 ---NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
               H   W    TYN   + T     L  R        Q    YL  F  D        
Sbjct: 122 IPGQHDNTWMYGKTYN--FYRTRAICRLQKR------VIQGVVVYLIRFLLDL------- 166

Query: 188 HNHSRPLFLQITHAAVHTGTAGNA-KLPT-GLLQVPDMEENDRT 229
             HS+P F  ++H AVH+    +  + P   LL+ P + E +RT
Sbjct: 167 --HSQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRT 208


>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 100/188 (53%), Gaps = 8/188 (4%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           QG+ D+G+HG +D+ TP +D LA  G+ L  +Y  P C+PSR+  +TG+Y    G+  + 
Sbjct: 54  QGYADIGYHG-SDVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 112

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           +       +P     LP+ L + GY TH++GKWH+G  +   LP  RGF +  G   G  
Sbjct: 113 IRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQSFFGTLTGSG 172

Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + +Y     +   A G D   +  R A +M   Y T  + ++   ++++H+ + PLFL 
Sbjct: 173 DHFSYQSC--DGAEACGFDL-HDGSRPAWEMRGNYSTLLYIERVKQILRNHDPNTPLFLY 229

Query: 198 ITHAAVHT 205
           ++  A HT
Sbjct: 230 LSLQAAHT 237


>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
           siliculosus]
          Length = 597

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 100/205 (48%), Gaps = 23/205 (11%)

Query: 23  GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G NDVG+   +    TP ++ LA  G++L+ +Y+   CTPSRA+ +TG+  FR G+    
Sbjct: 85  GTNDVGYESTDLWQLTPFMNTLAAEGVILDDYYSNEICTPSRASLMTGRDSFRTGMQFGV 144

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
           V    A  +P+ E  L +  +  GYSTH+ GKWH+G       PF+RGFD  +G      
Sbjct: 145 VEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANYPFSRGFDTFLGYTGGGE 204

Query: 135 ------------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
                       +  G  T        D    ++   N  R  P+M+ KY T   TD+++
Sbjct: 205 GYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRKGPEMTGKYSTTVITDRAI 264

Query: 183 HVIKSH---NHSRPLFLQITHAAVH 204
            V + H   + S PLFL + H AVH
Sbjct: 265 EVAREHVEKSPSDPLFLYVAHQAVH 289


>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 465

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 25/228 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   I TP++D LA  G+  +R Y T P C+PSR   LTG++P RYGI T  
Sbjct: 40  GYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNP 99

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
              +         +P TEKL+P+YL   GY + + GKWH+G  K    P  RGF +  G+
Sbjct: 100 NIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLGHTK-GYTPPERGFTHWWGF 158

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPL 194
             G   Y     E   A GL+    +  +  +    YLTD  TD++V  ++ +    +P 
Sbjct: 159 LGGSRHYFPVKKE---AEGLNPSMIVSNFTDKTDITYLTDDITDRAVEFLQEAGKDKKPF 215

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           F+ +++ A H                    E+   F ++ N +RR++ 
Sbjct: 216 FMFVSYNAPHWPNEAKP-------------EDIAKFRNVQNGERRVYC 250


>gi|301308937|ref|ZP_07214882.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
 gi|423338414|ref|ZP_17316156.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
           CL09T03C24]
 gi|300832963|gb|EFK63588.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
 gi|409233843|gb|EKN26675.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
           CL09T03C24]
          Length = 589

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)

Query: 5   VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
           VGAG   A    +K LP         QGW D+GF G   + TPNID +A+ G +L   Y 
Sbjct: 13  VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70

Query: 56  LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            P  +P+RA FLTG+Y  R G+++  G G  +   + EK + +Y +E GY+T L GKWH 
Sbjct: 71  CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128

Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
           G  +    P  RGF+   G        YWN  L +N  I   +                 
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170

Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
               ++ D  TD+++  I+ H    P F+ +++   H+            +QVPD   N 
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215

Query: 227 --DRTFAH 232
             DRT + 
Sbjct: 216 VKDRTLSQ 223


>gi|298377639|ref|ZP_06987590.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|298265342|gb|EFI07004.1| arylsulfatase [Bacteroides sp. 3_1_19]
          Length = 589

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)

Query: 5   VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
           VGAG   A    +K LP         QGW D+GF G   + TPNID +A+ G +L   Y 
Sbjct: 13  VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70

Query: 56  LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            P  +P+RA FLTG+Y  R G+++  G G  +   + EK + +Y +E GY+T L GKWH 
Sbjct: 71  CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128

Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
           G  +    P  RGF+   G        YWN  L +N  I   +                 
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170

Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
               ++ D  TD+++  I+ H    P F+ +++   H+            +QVPD   N 
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215

Query: 227 --DRTFAH 232
             DRT + 
Sbjct: 216 VKDRTLSQ 223


>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
          Length = 300

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 74/196 (37%), Positives = 99/196 (50%), Gaps = 27/196 (13%)

Query: 59  CTPSRAAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIG 111
           CTPSRAA LTGKYP   G+       D P G      +P+ E  + +  +E GY T L+G
Sbjct: 2   CTPSRAALLTGKYPINTGMQHYVIVNDQPWG------LPLNETTMAEIFRENGYRTSLLG 55

Query: 112 KWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMS 169
           KWH+G ++    P  RGFD H+GY   Y+ Y    +E       G D R +++     + 
Sbjct: 56  KWHLGLSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHV- 114

Query: 170 SKYLTDFFTDQSVHVIKSH---NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
             Y+TD  TD +V  I+ H   N S+PLFL + H A H   A N   P   +Q P  EE 
Sbjct: 115 GHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPH---AANDDDP---MQAP-AEEV 167

Query: 227 DRTFAHISNPDRRLFA 242
            R F +ISN   R +A
Sbjct: 168 SR-FEYISNKTHRYYA 182


>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
          Length = 345

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 11/186 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G     D+ TPN+DALA  G++L  +Y    CTPSR A ++G+YP    + + V 
Sbjct: 37  GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCTPSRHALMSGRYPSASAMQSMVI 95

Query: 83  AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
             + A+   +  K LPQYLKELGY  H++GKWH+G  ++E LP +RGFD   G + G   
Sbjct: 96  QPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGD 155

Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           Y  ++   + D+ +  D          + +  +  D   +    V   H+   PLFL   
Sbjct: 156 YWSHEIFGKYDWHINGDIHF-------EANGTHSQDLEMEGLDKVFDEHDSKDPLFLYFA 208

Query: 200 HAAVHT 205
               HT
Sbjct: 209 PQNPHT 214


>gi|150010519|ref|YP_001305262.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
           8503]
 gi|149938943|gb|ABR45640.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
           8503]
          Length = 589

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)

Query: 5   VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
           VGAG   A    +K LP         QGW D+GF G   + TPNID +A+ G +L   Y 
Sbjct: 13  VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70

Query: 56  LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            P  +P+RA FLTG+Y  R G+++  G G  +   + EK + +Y +E GY+T L GKWH 
Sbjct: 71  CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128

Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
           G  +    P  RGF+   G        YWN  L +N  I   +                 
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170

Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
               ++ D  TD+++  I+ H    P F+ +++   H+            +QVPD   N 
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215

Query: 227 --DRTFAH 232
             DRT + 
Sbjct: 216 VKDRTLSQ 223


>gi|374373208|ref|ZP_09630868.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
 gi|373234181|gb|EHP53974.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
          Length = 454

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 60/183 (32%), Positives = 95/183 (51%), Gaps = 4/183 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIV-LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G++D G +G   I TP +D +A +G +  N   + P+CTPSRA+ LTG+Y  RY +  P+
Sbjct: 37  GYSDPGCYGNPVIQTPFLDKIARSGFMSTNYIVSSPSCTPSRASLLTGRYASRYNLPDPI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G G    +P  E  + + LK  GY T ++GKWH+G   +   P  +GFD   G       
Sbjct: 97  GPGSKLGLPDAEVTMAEMLKAAGYKTAMVGKWHLGDQHDYNYPTGQGFDRFYGMLYSQ-D 155

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y     +TD  + +   R  E + P+ S+  LT  +T +S+ +I+     +P FL + + 
Sbjct: 156 YRAPYVKTDTVIKIFRNRTPEIFKPEDST--LTQLYTKESIKIIREQRPGQPFFLYLAYN 213

Query: 202 AVH 204
             H
Sbjct: 214 MPH 216


>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 470

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 77/234 (32%), Positives = 113/234 (48%), Gaps = 36/234 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D G +G  +IPTPNIDALA NG +    Y +   C PSRA  LTG Y  R+G +  +
Sbjct: 38  GYADFGCYGGKEIPTPNIDALAKNGTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNI 97

Query: 82  GAGVAKAVPVTE-------KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
                K   + +       K +   +K  GY T  IGKWH G +  +  P  RGFD   G
Sbjct: 98  SELPVKPYTLNDVGMDPKIKTIGDQMKHNGYRTIAIGKWHQG-DLPQYFPLKRGFDEFYG 156

Query: 135 YWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           +  G+ ++          HE        A  + ++  P+ +  YLTD FTD+++  +K  
Sbjct: 157 FVGGHRSFFGYPGGKAPSHEL-------ALFDNDKIVPENTIGYLTDMFTDKAISFVK-E 208

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           N S+P F+ + + AVH     NAK           E  DR F +I++P R+ +A
Sbjct: 209 NKSKPFFMYLAYNAVHVPM--NAK----------KELMDR-FPNITDPGRKAYA 249


>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 498

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 77/232 (33%), Positives = 112/232 (48%), Gaps = 33/232 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   + TPN+D LA +G++ ++ Y     C+PSRA  LTG+ P R+G +  +
Sbjct: 45  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104

Query: 82  GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            A   +         +PV+EK L  +L   GY+T LIGKWH+G   E   P  RGFD+  
Sbjct: 105 NASDERYATRPELLGLPVSEKTLGDHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR- 192
           G   G      S H     +     RN +R     S++YLTDFFTD+ +  I  H  ++ 
Sbjct: 164 GMLTG------SHHYFPTTMNHVIERNGQR-VEDFSNEYLTDFFTDEGLRFIDQHEAAKP 216

Query: 193 --PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             P F+  ++ A HT                  E +   FA+I N  RR +A
Sbjct: 217 DQPWFVFYSYNAPHTPMHAT-------------EADLARFANIQNKKRRTYA 255


>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 474

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 82/260 (31%), Positives = 122/260 (46%), Gaps = 57/260 (21%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G +G  DIPTPNID L  +G+  +  Y + P C  SRAA +TG+Y  R+G +  P
Sbjct: 39  GYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNP 98

Query: 81  VGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
           +GA  A     +PV EK +   L+++GY+T L+GKWH+G       P  RGFD   G+  
Sbjct: 99  IGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLG-GTAPFHPQRRGFDEFFGFLH 157

Query: 136 ---------WNGYLT-----------------------YNDSIHETDFAVGLDARRNMER 163
                    W+G  T                       ++  +HE + A   DA   + R
Sbjct: 158 EGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLHENEPAY--DADNPLLR 215

Query: 164 YAPQMSSKY-LTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
            +  +  K  LTD FT ++   I  H  ++P FL + + AVH+   G             
Sbjct: 216 NSQPVEEKANLTDAFTREACSFIDRH-QAQPWFLYLAYNAVHSPLQGEDTY--------- 265

Query: 223 MEENDRTFAHISNPDRRLFA 242
           ME+    F+HI +  RR+FA
Sbjct: 266 MEK----FSHIGDIQRRIFA 281


>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 471

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 69/200 (34%), Positives = 101/200 (50%), Gaps = 27/200 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D GF G  ++ TPN+D LA +G+   + Y T  TC PSRA  +TGKY  R+G +   
Sbjct: 33  GFADFGFQGSTEMKTPNLDKLANSGVKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 92

Query: 82  GAGVAK----------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
             G              +P+ +  +  YLK+LGY+T + GKWH+G N +   P NRGFD 
Sbjct: 93  VPGYMSENSKFLADDMGLPLDQLTIGDYLKKLGYNTAMYGKWHLG-NADRFHPMNRGFDE 151

Query: 132 HVGYWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
             G+  G  +Y      + + H+T    G     N E   PQ   +Y+TD   D+++  I
Sbjct: 152 FYGFRGGARSYFGYDVASSAHHDTKMERGFG---NFEE--PQ---EYVTDALADEAISFI 203

Query: 186 KSHNHSRPLFLQITHAAVHT 205
           +  N   P F+ +   AVHT
Sbjct: 204 EK-NKKNPFFIYLAFNAVHT 222


>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 494

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 111/234 (47%), Gaps = 28/234 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
           G++DVGF+G  DI TP +D LA  G +    Y   P C PSRAA LTGKYP   G   + 
Sbjct: 53  GYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGKYPHTIGSQFNL 112

Query: 80  PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P  G  + K +   E+ + + L+E GY T  IGKWH+G   EE  P  RGF +  G+  G
Sbjct: 113 PANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGAT-EEFHPNQRGFTDFYGFLGG 171

Query: 139 YLTY--------NDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQSVHVIK-SH 188
              Y             +    +  D    +E    ++  ++YLTD F+ ++   +K + 
Sbjct: 172 GHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETEYLTDAFSREASRFVKEAS 231

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           N  +P FL + + A H             + +   EE+   F+ I + DRR +A
Sbjct: 232 NKKKPFFLYLAYNAPH-------------VPLEAKEEDLEKFSVIKDKDRRTYA 272


>gi|313215712|emb|CBY16310.1| unnamed protein product [Oikopleura dioica]
          Length = 350

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 77/121 (63%), Gaps = 2/121 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+    D+ +PNID LA N +   ++Y  P+CTPSRAA +TG+Y  RYG+ + V 
Sbjct: 130 GYDDLGYVNP-DVKSPNIDYLANNALHFEKYYNQPSCTPSRAALMTGRYNIRYGLQSGVI 188

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                +A+P++E LLP+  K+ GY+T + GKWH+G   EE  P  RGFD   G++ G   
Sbjct: 189 KPDEPEAIPLSETLLPKAFKKCGYNTSMHGKWHLGYYTEEHCPQKRGFDRFFGFYLGSQD 248

Query: 142 Y 142
           Y
Sbjct: 249 Y 249


>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
          Length = 501

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/187 (32%), Positives = 96/187 (51%), Gaps = 13/187 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G     D+ TP +D LA  G+    +Y +  C+PSR +F++G+YPF   +   V 
Sbjct: 33  GYHDIGLRNP-DLHTPTLDKLATKGVQFKNNYVMHACSPSRHSFMSGRYPFTSQMQKDVI 91

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
             V+    P+  K LP+YLKELGY TH +GKWH+G  +E+ +P +RGFD+  G  +G   
Sbjct: 92  FPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCREDCMPTSRGFDSFYGTLDGEGD 151

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           YLT+  +        G   R    +++  + +  L D        +I+     +P FL  
Sbjct: 152 YLTHMSAGFYDWHTNGTVDRSKSGQHSQDLHTAALAD--------IIERQTEEKPFFLYF 203

Query: 199 THAAVHT 205
                HT
Sbjct: 204 AAQNPHT 210


>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 498

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 112/232 (48%), Gaps = 33/232 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   + TPN+D LA +G++ ++ Y     C+PSRA  LTG+ P R+G +  +
Sbjct: 45  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104

Query: 82  GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            A             +P +EK L  +L   GY+T LIGKWH+G   E   P  RGFD+  
Sbjct: 105 NASDENYATRPELLGLPKSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNH 190
           G   G      S H     +     RN +R     SS+YLTDFFTD+ +  I   KS N 
Sbjct: 164 GMLTG------SHHYFPTTMNHVIERNGKR-VENFSSEYLTDFFTDEGLRFIDQHKSANP 216

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +P F+  ++ A HT              +   E +   FA+I N  RR +A
Sbjct: 217 DQPWFVFFSYNAPHT-------------PMHATEADLARFANIQNQKRRTYA 255


>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1605

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/209 (33%), Positives = 105/209 (50%), Gaps = 39/209 (18%)

Query: 23  GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
           G NDVG+   + +  TP ID LA  G+ L  +Y++  CTP+RAA ++G YP R G+    
Sbjct: 103 GSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALMSGHYPMRIGMQLEN 162

Query: 78  ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              D+P G      +P     + + LK LGY+TH +GKW +G ++   LP NRGFD   G
Sbjct: 163 IKPDSPWG------MPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVNRGFDTWYG 216

Query: 135 YWNGYLTYNDSIHE------------------TDFAVGLDARRNMERYAPQMSSKYLTDF 176
           Y +  + Y    HE                  TD+     +R    +Y P ++  + ++ 
Sbjct: 217 YLSDEIDYYS--HEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQYMPDLNGTHSSEL 274

Query: 177 FTDQSVHVIKSHNHSR-PLFL----QITH 200
           +T +   ++KS N SR PLF+    Q+TH
Sbjct: 275 YTQRVQQIVKSANASREPLFVYYASQMTH 303


>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
          Length = 482

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 100/209 (47%), Gaps = 33/209 (15%)

Query: 25  NDVGFHGENDIP-------TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
           +D+GF   ND+P        PN+ +LA NG +L+  Y  P CTPSR+A +T +YP R G+
Sbjct: 91  DDLGF---NDMPWNNPAIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGL 147

Query: 78  DTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
            T V  A     +P+ E  +    +  GY+TH++GKWH+G    + LP NRGFD   GY 
Sbjct: 148 QTDVITAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYL 207

Query: 137 NGYLTYNDSI-------HETDFAVGLDARRNMERYAPQMSSKYLTD-------------F 176
            G   Y           ++   A G D   N  R  P+ +  Y T               
Sbjct: 208 TGAEDYYKKTFCIPLVPNQRPAACGFDFYDNENR-MPKANGTYSTYQVLIYLFIHTIILK 266

Query: 177 FTDQSVHVIKSHNHSR-PLFLQITHAAVH 204
           F D S  VIKSH  S+ P FL +   +VH
Sbjct: 267 FADASREVIKSHEGSKTPFFLYLPFQSVH 295


>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 485

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 79/256 (30%), Positives = 112/256 (43%), Gaps = 51/256 (19%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ ++G  G   IPTP+ID+LA NG+     Y T   C+PSRA  LTG+Y  R+G +  P
Sbjct: 53  GYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLLTGRYQSRFGYEQNP 112

Query: 81  VGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            GA        +P+ EK L + L + GY+T L+GKWH+G       P  RGFD   G+ +
Sbjct: 113 TGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLG-GTARFHPLRRGFDEFFGFLH 171

Query: 138 G--------YLTYNDSIHETDFAVGLDARRNMERY-----------------------AP 166
                    Y   +  +       G   R   ER                         P
Sbjct: 172 EGHFFVPPPYEGVSTFLRRRALPNGKTGRWGDERLMLSTHMGHDEPAYDANNPILRGGQP 231

Query: 167 QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
              + YLTD FT ++   I + N  RP FL + + AVH+   G              +  
Sbjct: 232 VEEAAYLTDAFTREACDFI-ARNQDRPFFLYLAYNAVHSPLQG-------------ADAY 277

Query: 227 DRTFAHISNPDRRLFA 242
            + FAHI++  RR+FA
Sbjct: 278 MQQFAHIADQQRRIFA 293


>gi|262384881|ref|ZP_06078013.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
 gi|262293597|gb|EEY81533.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
          Length = 589

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 111/248 (44%), Gaps = 57/248 (22%)

Query: 5   VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
           VGAG   A    +K LP         QGW D+GF G   + TPNID +A+ G +L   Y 
Sbjct: 13  VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70

Query: 56  LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            P  +P+RA FLTG+Y  R G+++  G G  +     EK + +Y +E GY+T L GKWH 
Sbjct: 71  CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNQGEKTIAEYFREAGYATSLFGKWHS 128

Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
           G  +    P  RGF+   G        YWN  L +N  I   +                 
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170

Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
               ++ D  TD+++  I+ H    P F+ +++   H+            +QVPD   N 
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215

Query: 227 --DRTFAH 232
             DRT + 
Sbjct: 216 VKDRTLSQ 223


>gi|296122626|ref|YP_003630404.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014966|gb|ADG68205.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/190 (35%), Positives = 95/190 (50%), Gaps = 15/190 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G +   TP IDALA +G    + Y+  P C+P+RAA +TGK P R GI   +
Sbjct: 40  GKTDIGIEGSSFYETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWI 99

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD-----NHVGYW 136
                 A+P +E  + Q  +E GY T  +GKWH+G +K +  P  RGFD     NH G  
Sbjct: 100 RPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLG-HKPQQHPAARGFDWTKGVNHGGQP 158

Query: 137 NG-YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           +  Y  Y +           DA  N+  +       YLTD  T  ++  ++  + +RP F
Sbjct: 159 SSYYFPYKNPQKP-------DAPNNVPDFEKCQPEDYLTDVLTSSAIEHLQQRDRTRPFF 211

Query: 196 LQITHAAVHT 205
           L + H AVHT
Sbjct: 212 LCLAHYAVHT 221


>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 97/190 (51%), Gaps = 9/190 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV FH    I TPNID L   G+ L  +YT   CTP+R+A LTG+YP   G+ T V 
Sbjct: 34  GWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSALLTGRYPIHTGLQTNVI 93

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYST-HLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           A   +  +   E LLP+YL+       H++GKWH+G     + P+ RGF+   GY  G  
Sbjct: 94  AISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMTPWKRGFETFSGYLAGAE 153

Query: 139 -YLTYNDSIHETDFAVGLD-ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPL 194
            + T    + +TD+  G+D +  +     P  SS  KY  D +  +   ++KS + ++  
Sbjct: 154 DHYTREWCMEQTDWC-GVDYSEHSASLSGPTNSSWGKYSGDLYLQKMSEIVKSIDPTKDS 212

Query: 195 FLQITHAAVH 204
           F+      VH
Sbjct: 213 FIYFAPQHVH 222


>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 97/190 (51%), Gaps = 9/190 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DV FH    I TPNID L   G+ L  +YT   CTP+R+A LTG+YP   G+ T V 
Sbjct: 34  GWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSALLTGRYPIHTGLQTNVI 93

Query: 83  A-GVAKAVPVTEKLLPQYLKELGYST-HLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
           A   +  +   E LLP+YL+       H++GKWH+G     + P+ RGF+   GY  G  
Sbjct: 94  AISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMTPWKRGFETFSGYLAGAE 153

Query: 139 -YLTYNDSIHETDFAVGLD-ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPL 194
            + T    + +TD+  G+D +  +     P  SS  KY  D +  +   ++KS + ++  
Sbjct: 154 DHYTREWCMEQTDWC-GVDYSEHSASLSGPTNSSWGKYSGDLYLQKMSEIVKSIDPTKDS 212

Query: 195 FLQITHAAVH 204
           F+      VH
Sbjct: 213 FIYFAPQHVH 222


>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/225 (32%), Positives = 113/225 (50%), Gaps = 23/225 (10%)

Query: 23  GWNDVGFHGE-NDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           G+ DV +HG   +  TP+ID++A +G      Y+  P C PSRA  L+G+Y  R+G    
Sbjct: 34  GYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYSAAPVCGPSRAGLLSGRYQQRFGYYDN 93

Query: 81  VG-----AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           +G       V   +P+++KL+P+ L + GY+T ++GKWH G ++ +  P+NRGF    G+
Sbjct: 94  IGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVGKWHDG-DQHKFWPYNRGFQEFYGF 152

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            NG +  N  +   +  V      + E    + S +Y+T+ F  ++V  I  H  + P F
Sbjct: 153 NNGAIN-NWVLKGENHTVDEWGAVHRENKRVENSGEYMTEAFGREAVEFIDRHK-TEPFF 210

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
           L ++  AVH           G LQ P    N   F HI   +R L
Sbjct: 211 LYLSFNAVH-----------GPLQAPKSYTN--QFKHIKPENRAL 242


>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 498

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 78/232 (33%), Positives = 110/232 (47%), Gaps = 33/232 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   + TPN+D LA +G++ ++ Y     C+PSRA  LTG+ P R+G +  +
Sbjct: 45  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104

Query: 82  GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            A             +P +EK L  +L   GY+T LIGKWH+G   E   P  RGFD+  
Sbjct: 105 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
           G   G      S H     +     RN +R     SS+YLTDFFTD+ +  I  H   N 
Sbjct: 164 GMLTG------SHHYFPTTMKHVIERNGKR-VDGFSSEYLTDFFTDEGLRFIDQHESANP 216

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +P F+  ++ A HT                  E +   FA+I N  RR +A
Sbjct: 217 DQPWFVFFSYNAPHTPMHAT-------------EADLARFANIQNQKRRTYA 255


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 95/194 (48%), Gaps = 28/194 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVG+HG N I TP++D LA  G+ LNR YT P C+P+RAA +TG+ P R GI   V 
Sbjct: 47  GWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVI 105

Query: 82  ----GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
                 GV  A    E  +PQ  +  GY T ++GKWH+G  +    P  RGF++  G+ +
Sbjct: 106 LPWDNIGVNPA----EHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLH 161

Query: 138 ---GYLTYNDSIHETDF---AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
              G+     ++   DF    V +D               Y T    D+    I+  +  
Sbjct: 162 TEVGFYPPFANVGGKDFQENGVSID------------DEGYETYLLADEVSRYIRDRDEE 209

Query: 192 RPLFLQITHAAVHT 205
           +P F+ +   A HT
Sbjct: 210 KPFFIYMPFIAPHT 223


>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
 gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
          Length = 526

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 98/185 (52%), Gaps = 9/185 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVG+HG +DI TP+ID LA  G  LNR Y  P C+P+RAA +TG+ P + G+   V 
Sbjct: 62  GWGDVGYHG-SDIQTPHIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVL 120

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  V + E  LPQ  +  GY+T ++GKWH+G   E+  P  RGFD   G+ +  ++
Sbjct: 121 MPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVS 180

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQITH 200
           Y D  H+   A G D + N +      + +Y TD    Q+   I    + ++P  L +  
Sbjct: 181 YFD--HQ--IANGHDFQENGK--PVDHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPF 234

Query: 201 AAVHT 205
            A H+
Sbjct: 235 LAPHS 239


>gi|291231158|ref|XP_002735532.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 191

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 75/127 (59%), Gaps = 5/127 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG-IDTPV 81
           GWND+G+H      TPN+++LA +GI L  +Y  P CTPSR   LTG+Y  RYG +   +
Sbjct: 35  GWNDIGYHNP-IFQTPNLNSLAADGIKLENYYVAPVCTPSRGQLLTGRYAMRYGLVHRNI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW---NG 138
                  +P+ E  LP+ +K+ GY+TH++GKWH G      +P  RGFD+  G++     
Sbjct: 94  RPAQRMCLPLDEVTLPEKMKQAGYATHMVGKWHQGFYTPACIPTQRGFDSFFGFYICTED 153

Query: 139 YLTYNDS 145
           Y T++ S
Sbjct: 154 YFTHSAS 160


>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
 gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
          Length = 482

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 106/223 (47%), Gaps = 18/223 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G+ D GF G + + TPN+D LA  G V  + Y +   C PSRA  LTGKY  R+G +   
Sbjct: 49  GYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 108

Query: 79  -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
                +  G  G    +P+ +K +  YL+E GY T LIGKWH G N +   P  RGFD  
Sbjct: 109 VPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG-NADRFHPTKRGFDEF 167

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDA-RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
            G+  G  +Y     +   +   D   R    +  Q S +YLT+    ++V  IK  N  
Sbjct: 168 YGFRGGARSYFGFGAQNPVSYPEDKLERGFAHF--QESKRYLTEALATETVEFIK-RNQK 224

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
            P F+ ++  AVHT        P  L Q  +++   +  A ++
Sbjct: 225 HPFFVFLSFNAVHTPMEAK---PADLAQFSNLKGKRQQLAAMT 264


>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 481

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 113/215 (52%), Gaps = 28/215 (13%)

Query: 5   VGAGVAKAVP-----VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNR-HYTLPT 58
           +G+G   A+P     +T+ L   G+ D+G +G  DI TP ID+LA +G  L+  H   P 
Sbjct: 56  LGSGTNDALPHIVLIMTDDL---GYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPV 112

Query: 59  CTPSRAAFLTGKYPFRYGIDTPVGAG-VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
           CTP+RAA LTG++P R G++  + A      +PV E +L + LKE+GY T ++GKWH+G 
Sbjct: 113 CTPTRAALLTGRWPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG- 171

Query: 118 NKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
            + E  P   GFD   G  +G   + ++ +   + D+          E   P     Y T
Sbjct: 172 YRPEFGPNAHGFDEFFGLLSGNVDHYSHREINGKEDW---------YENTKPVRVEGYST 222

Query: 175 DFFTDQSVHVIKSH-----NHSRPLFLQITHAAVH 204
           D  +D++V  I+       +  +PL+L + + AVH
Sbjct: 223 DLLSDRAVAAIQKTAAQPPDQRQPLWLYVAYNAVH 257


>gi|313212372|emb|CBY36360.1| unnamed protein product [Oikopleura dioica]
 gi|313214813|emb|CBY41065.1| unnamed protein product [Oikopleura dioica]
          Length = 174

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 5/119 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ D+G++ ++    PN+D LA NGI+ +  YT P CTPSRA F+TG+Y  R G+    +
Sbjct: 53  GYADIGYNSDHAF-MPNMDFLANNGIIFDSFYTQPVCTPSRAQFMTGRYTNRLGLQHRNI 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            +     +P+ EK +P+Y +E GYST +IGKWH+G      LP NRGF   V  + GY+
Sbjct: 112 LSAQPSGIPLDEKTVPEYFRECGYSTEMIGKWHLGLFTSNFLPHNRGF---VSGFQGYI 167


>gi|313214045|emb|CBY42606.1| unnamed protein product [Oikopleura dioica]
          Length = 191

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/185 (32%), Positives = 98/185 (52%), Gaps = 5/185 (2%)

Query: 40  IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQ 98
           +D L  NG    + Y+   C+PSRA  LTG+Y FR G+ + P+   V   +   +K LP+
Sbjct: 1   MDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPE 60

Query: 99  YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR 158
           YLKE+GY TH +GKWH+G      LP +RGFD   G+++G + Y     +       D  
Sbjct: 61  YLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFF 120

Query: 159 RN-MERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSR-PLFLQITHAAVHTGTAGNAKLP 214
            N +E++   + S  ++ TD F D+++ ++K    S+ P ++ +   A H  T   A L 
Sbjct: 121 DNTIEQHKLDLESDGQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNAPHEPTRAPADLI 180

Query: 215 TGLLQ 219
             +L+
Sbjct: 181 ARILE 185


>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
          Length = 511

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 15/196 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  GE +  TP +D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 31  GWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGRLPIRNGFYTTN 90

Query: 82  G--------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P  E LLP+ L++ GY + L+GKWH+G ++ + LP   GFD   
Sbjct: 91  GHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAKYLPLKHGFDEWF 149

Query: 134 GYWNGYLTYNDSIHETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSH 188
           G  N +    D++H  +  V     +  R   +    +     LT  +T ++V  I + H
Sbjct: 150 GAPNCHFGPYDNVHTPNIPVYRNEEMAGRYYQDFKIEKNGESNLTQLYTKEAVEFITRMH 209

Query: 189 NHSRPLFLQITHAAVH 204
           N S+P FL     A H
Sbjct: 210 NKSKPFFLYWAVDATH 225


>gi|421613763|ref|ZP_16054834.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408495349|gb|EKJ99936.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 616

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 92/187 (49%), Gaps = 24/187 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  H    I TP +DALA     L+R Y  P C P+RAA LTG+YP R G+    
Sbjct: 72  QGWGDLAAHRNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            AGV    + +   E  L +  +  GY+T   GKWH G  +  L P  +GF+   G+  G
Sbjct: 128 -AGVTGRREVMRAEETTLAELYRAAGYATGCFGKWHNGA-QMPLHPNGQGFNEFFGFCGG 185

Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           +   Y+D++ E          RN     P  +  Y+TD  TD +V  I++H H RP F  
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAVEFIQNH-HDRPFFCY 231

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 232 VPFNAPH 238


>gi|32475139|ref|NP_868133.1| arylsulfatase [Rhodopirellula baltica SH 1]
 gi|32445680|emb|CAD78411.1| arylsulfatase homolog b1498 [Rhodopirellula baltica SH 1]
          Length = 656

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 92/187 (49%), Gaps = 24/187 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  H    I TP +DALA     L+R Y  P C P+RAA LTG+YP R G+    
Sbjct: 112 QGWGDLAAHRNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 167

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            AGV    + +   E  L +  +  GY+T   GKWH G  +  L P  +GF+   G+  G
Sbjct: 168 -AGVTGRREVMRAEETTLAELYRSAGYATGCFGKWHNGA-QMPLHPNGQGFNEFFGFCGG 225

Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           +   Y+D++ E          RN     P  +  Y+TD  TD +V  I++H H RP F  
Sbjct: 226 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAVEFIQNH-HDRPFFCY 271

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 272 VPFNAPH 278


>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
 gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 525

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 76/229 (33%), Positives = 111/229 (48%), Gaps = 29/229 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT---PSRAAFLTGKYPFRYGIDT 79
           GW DVG+ G  D+ TPNIDALA  G+  ++ Y   +C+   PSRA  LTG Y  R+G   
Sbjct: 43  GWGDVGYQGAVDVSTPNIDALARRGVQFSQGYV--SCSISGPSRAGILTGVYQQRFGFYN 100

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            +       +P  +  L + +++ GY+T  +GKWH+  + E+  P  RGFD   G+W+  
Sbjct: 101 NLHPWA--KIPEGQSTLGEMVRDCGYATGFVGKWHMADSPEQ-SPNRRGFDQFYGFWSDT 157

Query: 140 LTYNDS-----IHETDFAVGLDARRNMERYAP-QMSSKYLTDFFTDQSVHVIKSHNHSRP 193
             Y  S     +   DF       RN E   P   S +Y+TD FT ++V  I  H  S P
Sbjct: 158 HDYYRSTDKPGVELYDFC---PLYRNGEIQPPLHESGEYITDCFTREAVEFIDKHASS-P 213

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             L +++ AVH+             QVP+   N        + DR++FA
Sbjct: 214 FLLCLSYNAVHSP-----------WQVPEHYVNRLEGRRFHHEDRKVFA 251


>gi|313237610|emb|CBY12754.1| unnamed protein product [Oikopleura dioica]
          Length = 168

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/109 (44%), Positives = 70/109 (64%), Gaps = 2/109 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ D+G++ ++    PN+D LA NGI+ +  YT P CTPSRA F+TG+Y  R G+    +
Sbjct: 53  GYADIGYNSDHAF-MPNMDFLANNGIIFDSFYTQPVCTPSRAQFMTGRYTNRLGLQHRNI 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
            +     +P+ EK +P+Y +E GYST +IGKWH+G      LP NRGF+
Sbjct: 112 LSAQPSGIPLDEKTVPEYFRECGYSTEMIGKWHLGLFTSNFLPHNRGFN 160


>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
 gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
          Length = 460

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 66/183 (36%), Positives = 97/183 (53%), Gaps = 9/183 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDV +HG + I TPNID LA NG+ LNR Y  PTC+PSRA+  TG+   R GI  P+ 
Sbjct: 46  GWNDVEYHG-SVIQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTGRPASRMGIVAPIS 104

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +P +   LP+ L +  Y T LIGKWH+G  ++   P   GFD   G+ +G +  
Sbjct: 105 DKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLGL-QQSSGPKAYGFDYSYGFLHGQIDQ 163

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQITHA 201
              +++          RN E    Q    + TD  T++++H +     S +  FL++ ++
Sbjct: 164 YTHLYKNG---DKSWYRNGEFIDEQ---GHATDLITNEAIHWLSEKRDSNKNFFLEVAYS 217

Query: 202 AVH 204
           A H
Sbjct: 218 APH 220


>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 495

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/187 (37%), Positives = 92/187 (49%), Gaps = 18/187 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP---FRYGIDT 79
           G+ DVGF     I +PN D LA  G+VLNRHY    C PSRA+ LTG++P   +++ + T
Sbjct: 35  GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTGRWPHHVYQWNLAT 93

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
              AG      +   + P  LK   YSTH++GKWH G      LP NRGFD   G+  G 
Sbjct: 94  DATAGTN----LNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRGFDTSSGFLCG- 148

Query: 140 LTYNDSIHETDFAV-GLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
                  H T  A+  +D  +N    AP   +  Y    + D    +I SHN   PLFL 
Sbjct: 149 ----SEDHMTQNAICAIDYWKNN---APDPRNGTYDAYIYRDDLTDIINSHNTDEPLFLY 201

Query: 198 ITHAAVH 204
           +    VH
Sbjct: 202 LPLHNVH 208


>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 539

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/187 (33%), Positives = 93/187 (49%), Gaps = 13/187 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DVGFHG   I TP++D +A  G  LNR YT P C+P+RAA +TG+ P R G+  + +
Sbjct: 37  GWADVGFHGNQIIETPSLDRIAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---G 138
                  +   E  LP+     GY T ++GKWH+G  ++   P  RGF++  G+ +   G
Sbjct: 97  MPWHNNGIHPEETFLPELFAGAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVG 156

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           +     S+   DF      +RN      Q    YL     D+    I+  + ++P F+ +
Sbjct: 157 FFPPFASLGGKDF------QRNGVSIDDQGYESYL---LADEVSRYIRERDAAKPFFIYM 207

Query: 199 THAAVHT 205
              A HT
Sbjct: 208 PFIAPHT 214


>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 489

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 86/159 (54%), Gaps = 6/159 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TPNID LA   + L+R Y  P C+P+RA  LTG YPFR+GI   V 
Sbjct: 57  GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +   K  +P   +  P++L +LGY    + GKWH+G       P + G     G++NG +
Sbjct: 116 SPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 175

Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFF 177
            Y   +   + D+    D+    E Y+ ++    + DF 
Sbjct: 176 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDFI 213


>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
          Length = 534

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 65/190 (34%), Positives = 91/190 (47%), Gaps = 10/190 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVG+    D+ +P +D LA  G+ L RHY    C PSRAA LTG YP   G+ +  G
Sbjct: 38  GFGDVGYS-SPDVISPTLDRLAAEGLKLGRHYAYMWCAPSRAALLTGYYPSTTGVYSTSG 96

Query: 83  AGVAKAVPVTEKLLPQYLKE-LGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
           A    A+P+   LLP  L++  GY T  +GKWH+G   E  LP  RGFD   G+ +G   
Sbjct: 97  A--QNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEADLPERRGFDGFFGFLDGGED 154

Query: 139 -YLTYNDSIHETD--FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
            Y          D  F +    RR      P    +Y  + + + +   I+ H+ + PLF
Sbjct: 155 HYSRVGAGAPGCDRVFDLWDSRRRGPATDDPSAFGRYSAELYGEAAADAIRGHDAAEPLF 214

Query: 196 LQITHAAVHT 205
           L       H+
Sbjct: 215 LYAAFQVAHS 224


>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 980

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 103/224 (45%), Gaps = 18/224 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+   G +G+    TP++DALA  G+   N + +   CTPSRA  LTG+Y  R+G++T  
Sbjct: 41  GFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVETVY 100

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G      +P +E  +   L++ GY T+ IGKWH+G +  E LP  RGFD   G   G  T
Sbjct: 101 GRIPEAGLPASEITMADTLRKAGYRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTGART 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHSR-PLFLQI 198
           +           G   +RN       +   Y TD    Q+V  I  H  NH+  P FL +
Sbjct: 161 F---FPYRGNNPGSKLQRNGVFLPEPLDQPYFTDLLARQTVAYIDDHVANHANAPFFLYL 217

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              AVHT    + K             +DR    IS P R+  A
Sbjct: 218 AFTAVHTPLEADPK-----------RLDDRRIQDISPPQRKTLA 250


>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
 gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
          Length = 510

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 117/243 (48%), Gaps = 30/243 (12%)

Query: 9   VAKAVPVTEKLLPQ--GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAA 65
           VAK  P    +L    G+ D+GF G  +I TPNIDALA NG+V    Y T P C PSRA 
Sbjct: 52  VAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYCGPSRAG 111

Query: 66  FLTGKYPFRYGIDTPVGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
            LTG+Y  R+G++             +PV E    + +++ GY T ++GKWH+G +    
Sbjct: 112 LLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMGSHP-NF 170

Query: 123 LPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
            P NRGFD   G+  G   Y   +  +   ++++ L   RN +   P   ++YLT   + 
Sbjct: 171 HPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALS--RNGK---PAQLNEYLTTAISK 225

Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
           ++   + +    +P  + + + A H+                  E++   + HI++ DRR
Sbjct: 226 EAARFVSA--TEQPFMMYVAYNAPHSPLQAT-------------EQDLAKYQHIADLDRR 270

Query: 240 LFA 242
            +A
Sbjct: 271 TYA 273


>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
          Length = 913

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 98/192 (51%), Gaps = 15/192 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DV +H  + + TP + ALA +G+VL+R Y    C+PSR++ L+G+YP         G
Sbjct: 422 GWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRYPMHVNQYNMAG 480

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
             +   V V    + + LK  GY+TH +GKWH G +  +L+P  RGFD  +GY NG    
Sbjct: 481 DALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFDTSLGYLNG---A 537

Query: 143 NDSIHETDFAVGLDARRNMERYAPQ-----MSSKYLTDFFTDQSVHVIKSHNHSRPLFL- 196
            D   +   A G+     ++ YA        +  Y    + D ++ +I  H+ S PLF+ 
Sbjct: 538 EDHWTQARPACGVG--NFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDASVPLFVY 595

Query: 197 ---QITHAAVHT 205
              QI HA +  
Sbjct: 596 FAFQINHAPMQV 607


>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 540

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/210 (32%), Positives = 101/210 (48%), Gaps = 8/210 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW D+   G   I TP ID+L   G+ L+R YT P C+P+RAA +TG+ P R GI  + V
Sbjct: 35  GWADISLRGA-PIDTPAIDSLFSEGLTLDRFYTTPICSPTRAALMTGRDPLRLGISYSVV 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
              +   V   E  +P+  K  GY T ++GKWH+G ++E   P  RGFD+  G+ +  + 
Sbjct: 94  MPWMNNGVHPDEHFMPESFKAAGYQTAMVGKWHLGHSQEIFHPNARGFDDFYGHLHTEVG 153

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y           G+D +RN    A +    Y T    D++   IK+ +  +P FL +   
Sbjct: 154 YFLPFANQG---GVDFQRNGVTIADE---GYETFLLADEASRWIKARDKDKPFFLYMPFI 207

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFA 231
           A H+       L      + D  E  R+ A
Sbjct: 208 APHSPLEAPDDLVKKYENLEDTRELTRSAA 237


>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
 gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
          Length = 443

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 99/187 (52%), Gaps = 6/187 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + + TP++D LA  GI     Y+  P C+PSRA+ LTGKYP R G+   +
Sbjct: 19  GYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 78

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           GA      +P  E  L + LK  GY T L GKWH+G + EE  P   GFD   G+  G +
Sbjct: 79  GAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGLS-EETSPNAHGFDEFFGFKAGCV 137

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
            +   I     A G++   ++     ++  + +Y+T+  T++SV  I +S     P FL 
Sbjct: 138 DFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFLF 197

Query: 198 ITHAAVH 204
            ++ A H
Sbjct: 198 ASYNAPH 204


>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 610

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 23  GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G ND+G+   +    TP +D+LA +G+VL+ +YT   CTPSRA+ +TG+  FR G+    
Sbjct: 81  GTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRASLMTGRDSFRTGMQHGI 140

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD---------- 130
           V       +P  E  L    K  GYSTH+ GKWH+G   +   PF+RGFD          
Sbjct: 141 VDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYPFSRGFDTFLGYTGGGE 200

Query: 131 ---NHVGYWNGYLTYNDSIHETDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQSV 182
              NH   +       +     DF  G     +D   N  +  P M   Y T   TD+++
Sbjct: 201 GYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGPAMVDNYSTTIMTDRAI 260

Query: 183 HVIKSH----NHSRPLFLQITHAAVHT 205
            V + H    +   PLFL + + A HT
Sbjct: 261 DVAREHTGTASSDDPLFLYVAYQAAHT 287


>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 479

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/189 (37%), Positives = 101/189 (53%), Gaps = 10/189 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYG--IDT 79
           G+ D+G  G  +IPTP++D LA +GI   N + + P C+PSRA FLTGKY  R+G   + 
Sbjct: 49  GYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFLTGKYQTRFGHEFNP 108

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG- 138
            VG      +P+ E  +   L+  GY T LIGKWH G +K+   P +RGFD   G+  G 
Sbjct: 109 HVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGFSKDH-HPQSRGFDEFFGFLVGG 167

Query: 139 --YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
             YL + +       A   D         PQ    Y TD FT++++  + S   ++P FL
Sbjct: 168 HNYLLHKEVKARFGTAHSHDMIYRGREVEPQ--EGYATDLFTNEALRWM-SGPPNKPWFL 224

Query: 197 QITHAAVHT 205
            +++ AVHT
Sbjct: 225 YLSYNAVHT 233


>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 658

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 107/224 (47%), Gaps = 25/224 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW+DVG+HG + I TPNID LA  G+ L  +Y    C PSR   LTG+Y  R G+     
Sbjct: 215 GWHDVGYHG-SIIDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRYRIRTGL----Y 269

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-GCNKEELLPFNRGFDNHVGYWNG--- 138
             V   + + E  L   L E GY T ++GKWH+ G    E  P +RGF   +G+  G   
Sbjct: 270 GDVCDFMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGFQTFLGFHGGSQN 329

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y T+      +++    D   N      +   +Y T  + +++  +I+ H   +PLFL +
Sbjct: 330 YFTHRRGGSNSEY----DFWANDTSIGREYDGRYSTMVYAEEAQRIIRHHRTEQPLFLYL 385

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +  AVH+            L VP   E D+    I +  RR++A
Sbjct: 386 SFQAVHSP-----------LLVPSAYE-DKYRTGIEDDKRRVYA 417


>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
 gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
          Length = 629

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 112/229 (48%), Gaps = 26/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G ND+   G+   PTP +DAL+ + + L RHYT  TC+PSRA+ ++G++P   G     G
Sbjct: 48  GNNDIASWGDGRAPTPTLDALSASAVRLRRHYTDSTCSPSRASLISGRHPVSVGFQAD-G 106

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELLPFNRGFDNHVGYWNGYL 140
            G++  + VT   LP+ L+ LGY T  +GKWH+G     EE+ P  +GFD    YW G L
Sbjct: 107 LGLSPDL-VT---LPKSLRSLGYRTLHVGKWHLGEALEYEEIQPGQQGFD----YWFGML 158

Query: 141 TYNDSIHETDFAVGLDARRNMERY---------APQMSSKYLTDFFTDQSVHVIKSHNHS 191
             N  + +     G   RR              AP     YL D  TD++V ++KS    
Sbjct: 159 --NHFVLQGPGPDGRPVRRQPTHINPWLQDNGSAPAQHQGYLDDILTDKAVELVKSGVGE 216

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
           +P F+ +   + HT    +    T   Q PD  E  + FA +S  D  +
Sbjct: 217 KPWFINLWLFSPHTPYQPSPAFST---QFPDTPEG-KYFAILSQLDHNM 261


>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
 gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
          Length = 520

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 78/232 (33%), Positives = 110/232 (47%), Gaps = 33/232 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   + TPN+D LA +G++ ++ Y     C+PSRA  LT + P R+G +  +
Sbjct: 67  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNL 126

Query: 82  GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            A             +P +EK L  +L   GY+T LIGKWH+G   E   P  RGFD+  
Sbjct: 127 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 185

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNH 190
           G   G      S H     +     RN +R     SS+YLTDFFTD+ +  I   KS N 
Sbjct: 186 GMLTG------SHHYFPATMKHVIERNGKR-VDDFSSEYLTDFFTDEGLRFIDQHKSANP 238

Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +P F+  ++ A HT                  E +   FA+I N  RR +A
Sbjct: 239 DQPWFVFFSYNAPHTPMHAT-------------EADLARFANIQNQKRRTYA 277


>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
          Length = 266

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 34/215 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           G+ D+G+HG  +  TPN+D LA  G+ L  +Y  P C+P+R+  +TG+Y    G+    +
Sbjct: 39  GYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTGRYQIHTGLQHDII 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
                  +P+    +   LK +GYSTH IGKWH+G  K+E  P  RGFD++ GY  G   
Sbjct: 98  WPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGED 157

Query: 139 -YLTYN-DSIH-------------------------ETDFAVGLDARRNMERYAPQMSSK 171
            Y  YN D+ H                         + +   G D  R+M      M+  
Sbjct: 158 YYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDL-RDMNEPVTDMNGT 216

Query: 172 YLTDFFTDQSVHVIK-SHNHSRPLFLQITHAAVHT 205
           Y T  +T +++ +I  +    +P  L + + AVH+
Sbjct: 217 YSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS 251


>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 472

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 14/187 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TPNID LA   + L+R Y  P C+P+RA  LTG YPFR+GI   V 
Sbjct: 40  GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 98

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +   K  +P   +  P++L +LGY    + GKWH+G       P + G     G++NG +
Sbjct: 99  SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 158

Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
            Y   +   + D+    D+    E Y+ ++    + DF        I  + ++ P++  +
Sbjct: 159 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDF--------IDRNANAGPVYAYV 209

Query: 199 THAAVHT 205
              A H+
Sbjct: 210 AFNAPHS 216


>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
          Length = 503

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 100/202 (49%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 23  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 82

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P  E+LLP+ LK  GY++ ++GKWH+G ++ +  P  RGFD   
Sbjct: 83  GHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKRGFDEWF 141

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVI-KS 187
           G  N +    D+    +  V  D     R  E +   + +    LT  +T +++  + + 
Sbjct: 142 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQLYTQEALDFVQRQ 201

Query: 188 HNHSRPLFL----QITHAAVHT 205
           H   RP FL      THA V+ 
Sbjct: 202 HAARRPFFLYWAIDATHAPVYA 223


>gi|313212712|emb|CBY36647.1| unnamed protein product [Oikopleura dioica]
          Length = 260

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 91/158 (57%), Gaps = 6/158 (3%)

Query: 59  CTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
           C+PSRA FLTG+Y FRYG+ + P+       +   EKLLP+YLKE+GY TH +GKWH+G 
Sbjct: 38  CSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLGY 97

Query: 118 NKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTD 175
             E   P NRGFD  +G++ G + Y+   H T  A+G  L+   N E + P+   ++ + 
Sbjct: 98  CNESFQPHNRGFDTFLGHYGGGVDYH--THATQGALGSYLNHFLNGEPHIPEDGFEFASY 155

Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
            +++++  V++  N  +P F+ +   A H   A    L
Sbjct: 156 AWSNRTRKVLRE-NTDKPNFVYLAFNAPHEKVAAPQDL 192


>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 443

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 96/185 (51%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVG +G  +I TPN+D +A  G+  + +Y+  P CT SR A LTGKYP R G    +
Sbjct: 37  GYGDVGINGNPNIETPNLDRMAMEGMRFSNYYSASPACTASRYALLTGKYPSRAGFRWVL 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +   E  + + LKE GY T + GKWH+G  ++E LP   GFD +VG     L 
Sbjct: 97  NPTDQIGIHQQESTIAERLKEKGYRTAIYGKWHLGSTRKEFLPLANGFDEYVG-----LP 151

Query: 142 Y-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y ND I      + L +  +     P  S   LT  +T++++  I + N  +P F+ + +
Sbjct: 152 YSNDMIPPKYPDIALLSGYDTLELNPDQSK--LTRLYTEKAIAFI-TKNAKQPFFIYLPY 208

Query: 201 AAVHT 205
           A  HT
Sbjct: 209 AMPHT 213


>gi|291227581|ref|XP_002733761.1| PREDICTED: arylsulfatase A-like, partial [Saccoglossus kowalevskii]
          Length = 158

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 52/113 (46%), Positives = 70/113 (61%), Gaps = 2/113 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
           GWNDVG+H  + I TP++D +A +G+ L  +Y    CTP+R  FLTGK+     + +  +
Sbjct: 36  GWNDVGYH-NSSISTPHMDTIANDGVKLESYYVGHVCTPTRGMFLTGKHMINLRLYNGII 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           G    K +PV E  + Q L+E  Y+TH IGKWH+G  KEE LP NRGFD   G
Sbjct: 95  GGHDPKCLPVNEVTVAQKLREYNYATHAIGKWHLGYYKEECLPINRGFDTFFG 147


>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 467

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 96/185 (51%), Gaps = 16/185 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVGFH  N +PTPN+D LA  G+ L +HY  P C+P+R AFL+G+Y  R+ + TP  
Sbjct: 52  GWGDVGFHHGN-VPTPNLDHLAGEGLELMQHYVYPVCSPTRCAFLSGRYASRFSVTTPQN 110

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
               +   VT   L + LK +GY T L GKWH+G +K E  P   GFD+  G   G +  
Sbjct: 111 PRAFRWDTVT---LARALKSVGYDTALCGKWHLG-SKPEWGPQKFGFDHSYGSLAGGVGP 166

Query: 143 ND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            D    I E       D +   E+        ++TD  T ++V  ++S    +P FL + 
Sbjct: 167 WDHHYKIGEFTQTWHRDGKLIEEQ-------GHVTDLITKEAVEWLESRT-DKPFFLYVP 218

Query: 200 HAAVH 204
             AVH
Sbjct: 219 FTAVH 223


>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
 gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 511

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 75/237 (31%), Positives = 110/237 (46%), Gaps = 31/237 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVGF+G  DI TP +D LA NG +    Y   P C PSR+A LTG+YP   G    +
Sbjct: 59  GYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSRSAILTGRYPHLTGTAYNL 118

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
               ++       VPV E  + + L+  GY T  IGKWH+G    +  P  RGFD+  G+
Sbjct: 119 FHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGA-APKFHPNKRGFDDFYGF 177

Query: 136 WNGYLTYNDSIHETDFAVGLDARR-NMERYA--------PQMSSKYLTDFFTDQSVHVIK 186
             G   Y  S ++  +     A   N+  Y         P   ++Y+TD F+ +++  IK
Sbjct: 178 LGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPANETEYITDGFSREAIKNIK 237

Query: 187 -SHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +    +P F+ + + A H      A             E+   FAHI + DRR +A
Sbjct: 238 IAAAKKQPFFIYLAYNAPHVPLQAKA-------------EDVAKFAHIKDKDRRTYA 281


>gi|260824685|ref|XP_002607298.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
 gi|229292644|gb|EEN63308.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
          Length = 178

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 9/124 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTP-------SRAAFLTGKYPFRY 75
           GWNDVG+H   D+ TP +D LA  G++LN+ Y    CTP       SR AF+TG +P+  
Sbjct: 39  GWNDVGWHNP-DVKTPVLDQLANEGVILNQSYVNYVCTPFPVVKSRSRTAFMTGYFPYHV 97

Query: 76  GIDTPVGAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           G    V     A+ +P     LP+ LK+LGY+TH++GKWH+G       P  RGFD+  G
Sbjct: 98  GTQHQVFFPFQAQGIPSNFSFLPEKLKDLGYATHMVGKWHLGFCNWNYTPTYRGFDSFFG 157

Query: 135 YWNG 138
           Y+NG
Sbjct: 158 YYNG 161


>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
 gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
          Length = 452

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 99/187 (52%), Gaps = 6/187 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + + TP++D LA  GI     Y+  P C+PSRA+ LTGKYP R G+   +
Sbjct: 28  GYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 87

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           GA      +P  E  L + LK  GY T L GKWH+G + EE  P   GFD   G+  G +
Sbjct: 88  GAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLS-EETSPNAHGFDEFFGFKAGCV 146

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
            +   I     A G++   ++     ++  + +Y+T+  T++SV  I +S     P FL 
Sbjct: 147 DFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFLF 206

Query: 198 ITHAAVH 204
            ++ A H
Sbjct: 207 ASYNAPH 213


>gi|410097286|ref|ZP_11292268.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224604|gb|EKN17536.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 446

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 93/186 (50%), Gaps = 10/186 (5%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
            G+ D+G  G  DI TPN+D +A +G+    +Y+  P  T SR + LTG+YP R G    
Sbjct: 39  MGYGDIGVTGHPDIKTPNLDRMALDGMRFTNYYSASPASTASRYSLLTGRYPVRAGFRWV 98

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +     + +   E  + + LKE GY+T + GKWH+G  K+E LP   GFD +VG     L
Sbjct: 99  LSPDAERGIHPRELTIAELLKEQGYATAIYGKWHLGSTKKEYLPLQNGFDEYVG-----L 153

Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            Y ND I      + L    +  R  P  S   LT  +T++++  IK H      F+ + 
Sbjct: 154 PYSNDMIPPKYPDIALMCGNDTLRMNPDQSE--LTALYTEKAISFIKKHKKEN-FFVYVP 210

Query: 200 HAAVHT 205
           +A  H 
Sbjct: 211 YAMPHV 216


>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 498

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 100/197 (50%), Gaps = 24/197 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G   + TPN+D LA +G++ ++ Y     C+PSRA  LTG+ P R+G +  +
Sbjct: 45  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104

Query: 82  GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            A             +P +EK L  +L   GY+T LIGKWH+G   E   P  RGFD+  
Sbjct: 105 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163

Query: 134 GYWNGYLTYNDSIHETDFAVGLD--ARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--- 188
           G   G   Y        F   ++    RN +R     SS+YLTDFFTD+ +  I  H   
Sbjct: 164 GMLTGGHHY--------FPTTMNHVIERNGKR-VENFSSEYLTDFFTDEGLRFIDQHESA 214

Query: 189 NHSRPLFLQITHAAVHT 205
           N  +P F+  ++ A HT
Sbjct: 215 NPDQPWFVFFSYNAPHT 231


>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
 gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 462

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 92/198 (46%), Gaps = 23/198 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D GFHG  +  TP +D  A N +  ++ Y +   C PSRA  LTGKY  ++G +   
Sbjct: 37  GYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENN 96

Query: 82  GAGVAK---------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
             G+            +P+ +K +  YLKE GY T L GKWH G N +   P  RGFD  
Sbjct: 97  VPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQG-NADRFHPTKRGFDEF 155

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKS 187
            G+  G  +Y        F    +  RN +R        Q    YLT+   D+++  I+ 
Sbjct: 156 YGFRGGARSY------MPFGADNELTRNEDRLERGFGGFQEHEGYLTEELADEAIAFIE- 208

Query: 188 HNHSRPLFLQITHAAVHT 205
            N   P F+ +   AVHT
Sbjct: 209 RNQKNPFFVYLAFNAVHT 226


>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 472

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 74/236 (31%), Positives = 108/236 (45%), Gaps = 32/236 (13%)

Query: 24  WNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPVG 82
           W DV  +G  D+PTPNID +A  G+  +  Y   + C  SRA  +TG+ P R+G    + 
Sbjct: 39  WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98

Query: 83  --AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
               V   +PV +K +   L+ LGY T   GKWH+G ++ +  P NRGFD   G+  G  
Sbjct: 99  DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGADR-QFYPTNRGFDEFFGFLAGET 157

Query: 141 TYND---------SIHETDFAVGLDARRNMERYAPQMS-----SKYLTDFFTDQSVHVI- 185
            Y D               + +G     +     P        SKYLT+  TD++V  I 
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217

Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
           +S +  +P F  + + A H             LQVP     DR FA++ +P RR +
Sbjct: 218 RSADAKQPFFSYVAYNAPHWP-----------LQVP-QAYYDR-FANVKDPVRRTY 260


>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
 gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
          Length = 439

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 98/187 (52%), Gaps = 6/187 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + + TP++D LA  G+     Y+  P C+PSRA+ LTGKYP R G+   +
Sbjct: 15  GYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGKYPVRAGVGEIL 74

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           GA      +P  E  L + LK  GY T L GKWH+G +K E  P   GFD   G+  G +
Sbjct: 75  GAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGLSK-ETSPNAHGFDEFFGFKAGCV 133

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
            +   I       G++   ++     ++  + +Y+T+  T++SV  IK S     P FL 
Sbjct: 134 DFYSHIFYWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVDFIKRSREQEAPFFLF 193

Query: 198 ITHAAVH 204
            ++ A H
Sbjct: 194 ASYNAPH 200


>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
          Length = 492

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 2/117 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G+    D+ +PNIDALA + + L +HY  P+CTPSRAAFLTG+Y  R G+ + V 
Sbjct: 35  GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                + +P+ E LL +  K+ GY T L GKWH+G    +  P NRGFD   G++ G
Sbjct: 94  RPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLG 150


>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 472

 Score =  103 bits (256), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 93/185 (50%), Gaps = 10/185 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TPNID LA   + L+R Y  P C+P+RA  LTG YPFR+G    V 
Sbjct: 40  GWNDVGFHG-SEIRTPNIDRLANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVV 98

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +   K  +P   +  P++L +LGY    + GKWH+G       P   G     G++NG +
Sbjct: 99  SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAI 158

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y        F   LD  RN +    +    Y T+   +  V  I  + ++ P++  +  
Sbjct: 159 DY---FSRERFGQ-LDWHRNFDSVHEE---GYSTELVGNAVVDFIDRNANAGPVYAYVAF 211

Query: 201 AAVHT 205
            A H+
Sbjct: 212 NAPHS 216


>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 512

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 71/225 (31%), Positives = 111/225 (49%), Gaps = 34/225 (15%)

Query: 23  GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           G+NDVG+ G+N      TP +D+LA NG+ L  +YT   C+P+R A +TG    R  ID 
Sbjct: 40  GYNDVGYWGQNHGSAAKTPFLDSLAENGVRLENYYTHSVCSPTRGALMTG----RNRIDI 95

Query: 80  PVGAGVA-----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            +  G+      + +P+   LLP+ L   GY+T +IGKWH+G +  +  P+NRGF    G
Sbjct: 96  GLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMIGKWHLGFSSSKYAPWNRGFHGFYG 155

Query: 135 -------YWNGYL--TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
                  YW+ +L    + +I   DF        N      +   +Y    +  ++ +VI
Sbjct: 156 FLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTN------ETWGQYSAHVYASRARYVI 209

Query: 186 KSHNHSRPLFLQITHAAVHT--GTAGNAKLPTGLLQVPDMEENDR 228
           + H+ S+PLFL +     HT  G   +   P       D+E++DR
Sbjct: 210 QHHDQSKPLFLYLPLQTPHTPLGAPSHYYEP-----FKDIEDDDR 249


>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
 gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
          Length = 472

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 75/237 (31%), Positives = 109/237 (45%), Gaps = 38/237 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ ++G  G   IPTP+ID+LA +GI   + Y T P C+PSRA  LTG+ P R+G +  P
Sbjct: 37  GYGELGCQGNPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEFNP 96

Query: 81  VGA---GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW- 136
           +GA        +P  E+ + + L + GY+T LIGKWH+G    +  PF  GFD   G+  
Sbjct: 97  IGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLG-GTADYHPFRHGFDEFFGFMH 155

Query: 137 --------------------------NGYLTYNDSIHETDFAV---GLDARRNMERYA-P 166
                                      G     + I+ T         DA   + R   P
Sbjct: 156 EGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDYDANNPIIRGGQP 215

Query: 167 QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDM 223
              ++YLTD FT ++V  I  H   +P FL + + AVH+   G  K      Q+ D+
Sbjct: 216 VNETEYLTDAFTREAVSFINRH-QDKPFFLYLAYNAVHSPLQGKKKDIQHFTQIEDI 271


>gi|313246966|emb|CBY35811.1| unnamed protein product [Oikopleura dioica]
          Length = 388

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 91/158 (57%), Gaps = 6/158 (3%)

Query: 59  CTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
           C+PSRA FLTG+Y FRYG+ + P+       +   EKLLP+YLKE+GY TH +GKWH+G 
Sbjct: 21  CSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLGY 80

Query: 118 NKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTD 175
             E   P NRGFD  +G++ G + Y+   H T  A+G  L+   N E + P+   ++ + 
Sbjct: 81  CNESFQPHNRGFDTFLGHYGGGVDYH--THATQGALGSYLNHFLNGEPHIPEDGFEFASY 138

Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
            +++++  V++  N  +P F+ +   A H   A    L
Sbjct: 139 AWSNRTRKVLRE-NTDKPNFVYLAFNAPHEKVAAPQDL 175


>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
 gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
          Length = 463

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 100/185 (54%), Gaps = 9/185 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ND GF G  ++ TPNIDAL   G+V  + H      +PSRA  +TG+Y  R+G +  +
Sbjct: 43  GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECNL 102

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +     +P+ E+ + +  K  GY T  IGKWH+G +++E  P NRGFD   G   G   
Sbjct: 103 -SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGFDLFYGMKAGGRD 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y  +  ++D        RN+     Q+   KYLTD F++++V  I  +  S+P  + + +
Sbjct: 161 YFYNEKKSDRP---GDERNLLLNDRQVKFEKYLTDAFSEKAVEFI--NESSQPFMMYLAY 215

Query: 201 AAVHT 205
            AVHT
Sbjct: 216 NAVHT 220


>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
 gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
          Length = 545

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 91/184 (49%), Gaps = 8/184 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW DVG+HG  DI TP++D LA  G+ LNR YT P C+P+RAA +TG+ P R G+   V 
Sbjct: 44  GWADVGYHG-GDIDTPSLDRLAQQGVRLNRFYTTPICSPTRAALMTGRDPIRLGVTYGVI 102

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  V   E  +P+  +  GY T +IGKWH+G  +    P NRGF++  G+ +  + 
Sbjct: 103 FPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLHTEVG 162

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           +           G D +RN      Q    YL     D+    I+  +  RP  + +   
Sbjct: 163 FYPPFSNQG---GKDFQRNGVSIDDQGYETYL---LADEVSRYIRERDRDRPFLVYMPFI 216

Query: 202 AVHT 205
           A HT
Sbjct: 217 APHT 220


>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 461

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 93/192 (48%), Gaps = 16/192 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + I TPNID LA  G      Y+  P C+PSRAA +TG+YP R  +   +
Sbjct: 39  GYGDLGCYG-SPIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGRYPTRVEVPVVL 97

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G G A  +P +E  + Q LK  GY T  IGKWHIG +    LP NRGFD   G     + 
Sbjct: 98  GPGDA-GLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNRGFDEFFG-----VP 150

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y+  I            R     AP +    LT  FT +++  ++      P FL + H 
Sbjct: 151 YSADITPCPLM------RGSSVVAPAVDCSTLTSSFTQEALDFMR-RAQDNPFFLYLAHT 203

Query: 202 AVHTGTAGNAKL 213
           A H   A + + 
Sbjct: 204 APHLPLAASPRF 215


>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
 gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
          Length = 502

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 76/230 (33%), Positives = 108/230 (46%), Gaps = 19/230 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G+ D GF G + I TPN+D LA    V  + Y +   C PSRA  LTGKY  R+G +   
Sbjct: 71  GYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGKYQQRFGFEENN 130

Query: 79  -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
                +  G  G    +P+ ++ +  YL   GYST LIGKWH G N ++  P  RGF++ 
Sbjct: 131 VPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQG-NADKFHPTKRGFEHF 189

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDA-RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
            G+  G  +Y +       +   D   R    Y  + S  YLT    D+++  IK  N  
Sbjct: 190 YGFRGGARSYFEFGPNNPVSYPEDRLERGFAHY--KESPHYLTQALADEAIKFIK-QNQR 246

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDME-ENDRTFAHISNPDRRL 240
            P FL ++  AVHT    N +    L Q P +  +  R  A   + DR +
Sbjct: 247 EPFFLFLSFNAVHTPMDANKE---DLAQFPQLSGKRQRVAAMTLSMDREI 293


>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 106/221 (47%), Gaps = 24/221 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           GW + G  G   IPTP+ID++A NG+   + +   T C+PSRA  LTG+YP R+G +   
Sbjct: 52  GWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNR 111

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A V+  + + E  L   L  LGY T  +GKWH+G +  E  P  RGFD       G L 
Sbjct: 112 IANVS-GLDLQETTLADRLHGLGYKTACVGKWHLG-DGPEYRPTKRGFDEFF----GTLA 165

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
                H T F   +D+R + +       + Y TD +  +SV  I     S P FL +   
Sbjct: 166 NTPFFHPTKF---VDSRVSNDVAEVSDENFYTTDEYAKRSVEWIGQQQQS-PWFLYLPFN 221

Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           A H             LQ P  +  DR F  I++P R+LFA
Sbjct: 222 AQHAP-----------LQAP-QKYLDR-FESIADPKRKLFA 249


>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
 gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 463

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 100/185 (54%), Gaps = 9/185 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ND GF G  ++ TPNIDAL   G+V  + H      +PSRA  +TG+Y  R+G +  +
Sbjct: 43  GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECNL 102

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            +     +P+ E+ + +  K  GY T  IGKWH+G +++E  P NRGFD   G   G   
Sbjct: 103 -SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGFDLFYGMKAGGRD 160

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y  +  ++D        RN+     Q+   KYLTD F++++V  I  +  S+P  + + +
Sbjct: 161 YFYNEKKSDRP---GDERNLLLNDRQVKFEKYLTDAFSEKAVEFI--NESSQPFMMYLAY 215

Query: 201 AAVHT 205
            AVHT
Sbjct: 216 NAVHT 220


>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
 gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
          Length = 579

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 14/187 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWNDVGFHG ++I TPNID LA   + L+R Y  P C+P+RA  LTG YPFR+GI   V 
Sbjct: 147 GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 205

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +   K  +P   +  P++L +LGY    + GKWH+G       P + G     G++NG +
Sbjct: 206 SPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 265

Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
            Y   +   + D+    D+    E Y+ ++    + DF        I  + ++ P++  +
Sbjct: 266 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDF--------IDRNANAGPVYAYV 316

Query: 199 THAAVHT 205
              A H+
Sbjct: 317 AFNAPHS 323


>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 92/184 (50%), Gaps = 10/184 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ DVGF     I +PN D LA  G+VLNRHY    C+PSRA+FLTG++P       P+ 
Sbjct: 34  GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASFLTGRWPHHAHQWNPLM 92

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
             +     +   +LP  LK   Y+TH++GKWH+G      LP NRGFD   G+  G    
Sbjct: 93  DNMI-GTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYLPINRGFDTSTGFLGG---G 148

Query: 143 NDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
            D ++E      +D  +N    AP   +  Y    + D    V+ +HN   PLF  +   
Sbjct: 149 EDHMNEKS-GCSIDYWKNN---APDPRNGTYDAYNYRDDLTDVMNNHNADNPLFFYLPLH 204

Query: 202 AVHT 205
            VHT
Sbjct: 205 NVHT 208


>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 471

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 105/226 (46%), Gaps = 19/226 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---ID 78
           G+ D GF G   + TPN+D LA  G+   + Y +  TC PSRA  +TG+Y  ++G   I+
Sbjct: 38  GYADFGFQGSETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEIN 97

Query: 79  TP-------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
            P          G    +P+ E  +  Y+K LGY T   GKWH+G   +EL P +RGFD 
Sbjct: 98  VPGYMSEHSAIKGAEMGIPLDEVTMGDYMKSLGYRTAFYGKWHLGGT-DELHPMHRGFDE 156

Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
             G+  G   Y  Y  +  E   AV  D +        Q    YLTD   +++   I+  
Sbjct: 157 FYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEGYLTDVLAEKANQFIEKA 216

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
              +P F+ ++  AVHT        P  L + P ++   +  A ++
Sbjct: 217 -PDKPFFIFLSFNAVHTPMEAT---PEDLAKFPQLKGKRKEVAAMT 258


>gi|449138311|ref|ZP_21773581.1| arylsulfatase [Rhodopirellula europaea 6C]
 gi|448883084|gb|EMB13628.1| arylsulfatase [Rhodopirellula europaea 6C]
          Length = 585

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 90/187 (48%), Gaps = 24/187 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  HG + I TP +DALA     L+R Y  P C P+RAA LTG+YP R G+    
Sbjct: 40  QGWGDLASHGNSKISTPTLDALANQSARLDRFYVSPVCAPTRAALLTGRYPERTGV---- 95

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            AGV    + +   E  L +  +  GY+T   GKWH G  +  L P  +GFD   G+  G
Sbjct: 96  -AGVTGRREVMRAEETTLAEMFQAAGYATGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 153

Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           +   Y+D++ E          RN     P  +  Y+TD  TD ++  +  H    P F  
Sbjct: 154 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAIEFVNVH-RDHPFFCY 199

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 200 VPLNAPH 206


>gi|87307004|ref|ZP_01089150.1| arylsulfatase [Blastopirellula marina DSM 3645]
 gi|87290377|gb|EAQ82265.1| arylsulfatase [Blastopirellula marina DSM 3645]
          Length = 542

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 102/199 (51%), Gaps = 23/199 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----D 78
           G++D+G+HG  +I TPNIDALA++G+  ++ Y    C P+RA  +TG YP + GI    +
Sbjct: 40  GFSDLGYHG-GEIATPNIDALAHSGVRFSQFYNNGRCCPTRATLMTGLYPHQTGIGHMTE 98

Query: 79  TPVGAGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
           +P  A      P T +         + + L++ GY+T + GKWH+G N +   P  RGF+
Sbjct: 99  SPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMSGKWHLGENDKSRWPLQRGFE 158

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKS 187
            + G  +G   Y     +    +G     N +   P+ ++    Y TD FTD ++  +K 
Sbjct: 159 KYFGCLSGATLYFFPDGDRKMTLG-----NQQIAEPESTTDQPFYTTDAFTDYAIRFLKE 213

Query: 188 HN--HSRPLFLQITHAAVH 204
                 RP+FL + + A H
Sbjct: 214 EQAGQQRPMFLYLAYTAPH 232


>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1818

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 107/222 (48%), Gaps = 54/222 (24%)

Query: 23  GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR---- 74
           G+ DVG++G+    N + TP IDALA  G+ L+R+YT P CTPSRAA L+GKYP      
Sbjct: 55  GFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAALLSGKYPATTGTY 114

Query: 75  YGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           +G+  P        +P+   LLP+ L    Y +H +GKW +G +  + LP  RGFD+ +G
Sbjct: 115 HGVLNPQS---TWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPEARGFDSFLG 170

Query: 135 YWNGYLTY-----NDSIHE---------------------------TDFAVGLDARRNME 162
           +   YL +     + S HE                            DF+ GL  +R   
Sbjct: 171 F---YLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFS-GLGGQRR-- 224

Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
                    Y TD F D++V +I++     PLFL +   AVH
Sbjct: 225 ----DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVH 262



 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 29/209 (13%)

Query: 23  GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI- 77
           G++DVG++ +    N + TP +D+LA  G+ L R+YT P CTPSRAA L+G YP   G+ 
Sbjct: 630 GFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPSRAALLSGMYPASSGMY 689

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
              + A     + +  +L+PQ L    Y +H +GKW +G      +P  RGF + +G+++
Sbjct: 690 HKMITAQSNWGLDLDLELIPQRLPA-AYRSHAVGKWDVGHYTWSHVPQFRGFRSFLGFYS 748

Query: 138 GYLTYNDSIHET-DFAVGLDARRNME---RYAPQMSSK-----------------YLTDF 176
             + Y    HET D    L+     E   R A + SS                  Y TD 
Sbjct: 749 PIIDYY--THETFDTLQCLEEMELTECEARLASECSSSIKDFNFDGDPLPLADGTYSTDV 806

Query: 177 FTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           F  ++  +I+      PLFL +   AVH 
Sbjct: 807 FAARARDLIRKEAPKHPLFLYVAFNAVHA 835


>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
          Length = 507

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 70/236 (29%), Positives = 110/236 (46%), Gaps = 32/236 (13%)

Query: 23  GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF------ 73
           GW +VG+H      ++ TPNID L   G+ LN+HY    C+PSR++ ++G+ P       
Sbjct: 35  GWANVGYHRNPPTREVVTPNIDDLVKQGLELNQHYAYRCCSPSRSSLISGRLPIHVSDQN 94

Query: 74  ----RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
                Y  + P+      A+P     + + +KE GY+TH +GKW  G    +  P  RGF
Sbjct: 95  IAPTNYNPNDPISG--FSAIPRNMTGIAEKMKEAGYATHQVGKWDAGMATPDHTPKGRGF 152

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS----SKYLTDFFTDQSVHVI 185
           D   GY++ Y  Y   + ++    G+    N ++ A  ++     KY    F ++ + ++
Sbjct: 153 DTSFGYFHHYNDYYTEVVDSCNGTGVVDLWNTDQPAHGINGTGPDKYEEALFRERLLDIV 212

Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
             H+ S PLFL      VHT            LQVPD   N   F+ I + DR  +
Sbjct: 213 SKHDPSTPLFLYYAPHIVHT-----------PLQVPDEYLN--KFSFIDDKDRMYY 255


>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 117/229 (51%), Gaps = 27/229 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYP--FRYGIDT 79
           G+ D+GF G  DI TP+IDALA +G+   + Y +   C PSRA  LTG+Y   F  G + 
Sbjct: 33  GYEDLGFLGAPDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGLLTGRYQQLFGSGENP 92

Query: 80  PVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           P    ++K      +P+ E+++   LK   Y+T +IGKWH+G + E+  P  R  D + G
Sbjct: 93  PETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMGLSHEQ-RPTQRSVDYYYG 151

Query: 135 YWNGYLTYNDSIHETDFA-VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           + NG  +Y ++  +   A +     RN E   P   S Y T+ F D+ V+ IK  N  +P
Sbjct: 152 FLNGAHSYREAKMDMKGAPMTWPIFRNNE---PVPFSGYTTEVFNDEGVNFIK-RNKDKP 207

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            FL +++ +VH       K         D++ +D    HI    RR+++
Sbjct: 208 FFLYMSYNSVHGPWEAQPK---------DLQRSD----HIKKKWRRIYS 243


>gi|114326210|ref|NP_001041587.1| arylsulfatase E precursor [Canis lupus familiaris]
 gi|81158056|tpe|CAI85002.1| TPA: arylsulfatase E [Canis lupus familiaris]
          Length = 585

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/137 (43%), Positives = 77/137 (56%), Gaps = 14/137 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPLRSGMVSSN 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
           G       GV+  +P  E    + LK+ GY+T LIGKWH+G N E        P N GFD
Sbjct: 105 GYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHGFD 164

Query: 131 NHVGYWNGYLTYNDSIH 147
           +  G    +    D IH
Sbjct: 165 HFYGM--PFSMMGDCIH 179


>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 469

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/114 (43%), Positives = 70/114 (61%), Gaps = 2/114 (1%)

Query: 23  GWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+NDVG+H +   I T NIDALA  G+ L  +Y  P CTPSR+ FL+GKY    G+    
Sbjct: 40  GYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQFLSGKYLIHNGMQHLV 99

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           +   V + +P+ +  +   L + GY+THL+GKWH+G  K+E  P NRGF +  G
Sbjct: 100 IDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFG 153


>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 468

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 14/194 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
           G+ D GF G    PTPN+D LA  G+V  + YT    C PSRA  LTG+Y  R+G +   
Sbjct: 37  GYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTTAAVCGPSRAGLLTGRYQQRFGFEENN 96

Query: 79  -------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
                  +    G    +P+ EK +  YL +LGY + ++GKWH+G N +   P  RGF  
Sbjct: 97  VPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGYQSIVLGKWHMG-NADRYHPLKRGFTE 155

Query: 132 HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
             G+  G  ++   + +   A   + R  +     Q   KYLT    D +   I   N  
Sbjct: 156 FYGFRGGARSFY-PLTQKQAADKPEDRLEIGYKKYQEPKKYLTYDLADAACDFI-DRNKK 213

Query: 192 RPLFLQITHAAVHT 205
           +P F+ ++  AVH+
Sbjct: 214 KPFFMYVSFNAVHS 227


>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 501

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 105/224 (46%), Gaps = 27/224 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI--- 77
           QG+ D+G  G  +I TP++D LA  G  L   Y T P CTPSR + LTG+YP R GI   
Sbjct: 48  QGYRDLGSFGSEEIMTPHLDRLAKEGAKLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDM 107

Query: 78  ---------------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
                          +  V       + V EKLLP  LK  GY + + GKW +G +K   
Sbjct: 108 IRNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPAGYVSAIYGKWDLGIHK-RF 166

Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
           LP  RGFD+  G+ N  + Y    HE     G+ +     +   +    Y T  F  ++V
Sbjct: 167 LPLARGFDDFYGFTNTGIDY--FTHER---YGVPSMYRNNQPTEEDKGTYCTYLFQREAV 221

Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
             IK  NH +P FL +   A H  ++ + ++  G  Q P+  +N
Sbjct: 222 RFIK-ENHQKPFFLYLPFNAPHGASSLDPRIRGG-AQAPEKYKN 263


>gi|291231643|ref|XP_002735773.1| PREDICTED: steroid sulfatase-like [Saccoglossus kowalevskii]
          Length = 572

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 73/120 (60%), Gaps = 8/120 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP- 80
           G  D+G +G + I TPNID LA  G+ L  +    P CTPSRAAFLTG+YP R G+ T  
Sbjct: 40  GIGDLGCYGNDTIRTPNIDLLASEGVKLTHNIVPTPICTPSRAAFLTGRYPIRSGLGTSS 99

Query: 81  --VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE----ELLPFNRGFDNHVG 134
             + AG +  +P  E  + + LK++GY+T ++GKWH+G + E    E  P N+GFD   G
Sbjct: 100 AFICAGCSAGMPTQEVTIAEMLKDVGYATAILGKWHLGIHSEEQNNEFHPLNQGFDYFYG 159


>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 443

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/227 (32%), Positives = 101/227 (44%), Gaps = 26/227 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-- 79
           G+ DVGF G + I TP+ID LA +G++ ++ Y +   C PSRA  +TGK   R+G D   
Sbjct: 20  GYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQVRFGFDNNL 79

Query: 80  ----PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
               P        +P++EK L   L E GY   L+GKWH+G +KE+  P  RGF    GY
Sbjct: 80  TNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKEQYHPLKRGFHEFWGY 138

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G   Y           G D         PQ  S Y+TD   D+ V  I+ H    P F
Sbjct: 139 LGGGHHY---FRSKPNGKGYDCPIECNYKTPQPIS-YITDDKGDECVDFIRRHK-DEPFF 193

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L  +  A H                   EE+ + ++HI    RR + 
Sbjct: 194 LFASFNAPHAPMHAK-------------EEDLKLYSHIEGEKRRAYC 227


>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
          Length = 486

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 89/177 (50%), Gaps = 6/177 (3%)

Query: 35  IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTE 93
           I TPNIDA++  G+ L  +Y  P CTPSR+  L+G+Y    G+    +  G+  A+P+  
Sbjct: 11  IKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPSALPLDT 70

Query: 94  KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETD-FA 152
           +LLP+ ++  GY T   GKWH+G  K    P+ RGF N  GY  G   Y       D   
Sbjct: 71  ELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTRCIDHHK 130

Query: 153 VGLDARRNMERYAPQM----SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
            G+D   + E +  ++    +S+Y    +  Q+ + I   +  +P FL +   +VH 
Sbjct: 131 CGIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPMQSVHA 187


>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 651

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 101/208 (48%), Gaps = 25/208 (12%)

Query: 23  GWNDVGFHGEND-------IPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFR 74
           G+ DVGF+ + +       IPTP +D LA NGI+  N H   P C PSRAA +TG  P R
Sbjct: 39  GYADVGFNRDANFPAEKGVIPTPELDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSR 98

Query: 75  YGI--DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
            G+  + P     +  +P+ E   P+ L++  Y T   GKWH+G  + +  P +RGFD  
Sbjct: 99  IGVQYNLPNDINTSLGIPLEETYFPKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYF 158

Query: 133 VGYWNGYLTYNDSIHE--------------TDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
            G+  G   Y +  +E               ++   L  +R+          +YLTD  T
Sbjct: 159 FGFLGGGKAYFEREYEDLYYRRLGGSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILT 218

Query: 179 DQSVHVI-KSHNHSRPLFLQITHAAVHT 205
           D++++ I ++   S P F+ + + A HT
Sbjct: 219 DEAINYIAENKTKSDPFFMYVAYNAPHT 246


>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
           boliviensis]
          Length = 482

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 96/200 (48%), Gaps = 19/200 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LKE GY T ++GKWH+G ++ +  P   GFD   
Sbjct: 62  AHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180

Query: 189 NHSRPLFL----QITHAAVH 204
              RP FL      THA V+
Sbjct: 181 ARRRPFFLYWAVDATHAPVY 200


>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 470

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 20/206 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G++D GF G   + TP ID LA   +V  + Y T   C PSRA   TGKY  R+G +   
Sbjct: 38  GYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGKYQQRFGFEENN 97

Query: 79  -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
                +  G  G    +P T+  + ++LKELGY T L GKWH G N ++  P  RGFDN 
Sbjct: 98  VPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NHDDYHPTKRGFDNF 156

Query: 133 VGYWN---GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSH 188
            G+     GY  Y++   +   +  ++  R+ + Y       YLTD    Q+ H I +S 
Sbjct: 157 YGFREGARGYFAYSNEEQQAYPSQKME--RDFKHYIEH--EGYLTDALATQTSHFIGQSV 212

Query: 189 NHSRPLFLQITHAAVHT-GTAGNAKL 213
            + +P F  ++ +AVH    A NA L
Sbjct: 213 VNKQPFFAVLSFSAVHAPMQATNADL 238


>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 462

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 96/183 (52%), Gaps = 14/183 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ND+  +G  DI TP ID+L   GI     Y+  P C+PSRAA LTG+YP R GI    
Sbjct: 46  GYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTGRYPIRQGITGVF 105

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +   E  + + L+E GY T L+GKWH+G +++  LP   GF ++ G     + 
Sbjct: 106 WPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQKH-LPLQNGFHSYFG-----IP 159

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y++ +    +  G D    +E Y  ++   Y T  +T+++V  I+  N  +P FL + H+
Sbjct: 160 YSNDMDMVVYMRGND----VESY--EVDQHYTTRRYTEEAVQFIE-QNKDQPFFLYLAHS 212

Query: 202 AVH 204
             H
Sbjct: 213 MPH 215


>gi|323452509|gb|EGB08383.1| hypothetical protein AURANDRAFT_37517, partial [Aureococcus
           anophagefferens]
          Length = 235

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/148 (37%), Positives = 81/148 (54%), Gaps = 10/148 (6%)

Query: 23  GWNDVGFHG-----ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
           GWND G+H      E    TP +D LA +G+ L  +YT P C+PSRA  +TG+Y  R GI
Sbjct: 38  GWNDAGYHNGGRPNEGWTSTPTLDRLAASGVKLESYYTAPICSPSRAQIMTGRYQIRVGI 97

Query: 78  DTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
                GA     +P+ E  +   L  L Y T + GKWH+G ++   LP +RGFD H G++
Sbjct: 98  QHGCYGASQGTGLPLGEVTIADALSRLDYETWMFGKWHLGFDEAAFLPTSRGFDYHYGHY 157

Query: 137 NGYL-TYNDSIHETDFA---VGLDARRN 160
           +  +  +N ++ +T      VGLD  R+
Sbjct: 158 DACVNAWNHTVGKTGTEKPRVGLDWHRD 185


>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
 gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
          Length = 484

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/226 (32%), Positives = 108/226 (47%), Gaps = 22/226 (9%)

Query: 23  GWNDVGF-HGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           G+NDVGF +G+ +I TP +DALA  G+V    Y T P C PSRA  +TG+Y  R+G++  
Sbjct: 39  GYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITGRYQARFGMENN 98

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           V          +P+TEK  P  L+E+GY T + GKWH+G       P  RGFD   G+ +
Sbjct: 99  VTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLG-GAPHFQPNERGFDYFYGFLD 157

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFT-DQSVHVIKSHNHSRPLFL 196
           G   +N    E     G      M         +YLT   + D + ++ ++     P F+
Sbjct: 158 G--GHNYMPGEVHLGAGGYLLPIMRNKGVAEFDEYLTTALSRDAARYIERTSKEQAPFFI 215

Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            +++ A H             LQ P  +     +AHI +  RR +A
Sbjct: 216 YMSYNAPHAP-----------LQAP--QNYLEKYAHIKDEKRRTYA 248


>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 635

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 103/210 (49%), Gaps = 36/210 (17%)

Query: 22  QGWNDVGF----HGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
            G+ND+G+    H  N + TP +D LA  G+ L R+YT   C+PSR A LTG YP   G+
Sbjct: 128 MGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALLTGLYPASTGL 187

Query: 78  DTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
              V    +   +P+   L+PQ+L    Y +H IGKW +G      +P  RGF ++VG++
Sbjct: 188 YHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTGRGFHSYVGFY 246

Query: 137 NGYLTY----------------------NDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
              + Y                      NDSI + ++    D     + Y     ++Y T
Sbjct: 247 GTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNY----DGAATGDEY----YNRYST 298

Query: 175 DFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
           D FTD++V ++++ +   PLFL +   AVH
Sbjct: 299 DIFTDRAVELLRTESARNPLFLYVAFNAVH 328


>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
          Length = 491

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/207 (32%), Positives = 101/207 (48%), Gaps = 20/207 (9%)

Query: 18  KLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG 76
           K +  GW D+G +GE    TPN+D +A  G++    YT  P C+PSRAA LTG+ P R G
Sbjct: 5   KSVNMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNG 64

Query: 77  IDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
             T  G          +   +P +E+LLP+ LKE GY + ++GKWH+G ++ +  P   G
Sbjct: 65  FYTTSGHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHG 123

Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVH 183
           FD   G  N +    D+    +  V  D     R  E +   + +    LT  +  +++ 
Sbjct: 124 FDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALD 183

Query: 184 VIKSHNHS-RPLFL----QITHAAVHT 205
            I+    + RP FL      THA V+ 
Sbjct: 184 FIRRQQAARRPFFLYWAVDATHAPVYA 210


>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
          Length = 981

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 97/204 (47%), Gaps = 27/204 (13%)

Query: 25  NDVGFHG---ENDIPTP-NIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
           +DVG +     +D+P P NI  L   G+ L  +Y    C+P+RAA L+GK+  + G    
Sbjct: 83  DDVGMNDLWQSSDLPVPENIATLVAEGVELTAYYGQSMCSPARAALLSGKFVHKIGFSDK 142

Query: 81  VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            G      A    +VP+   L+P+ LK  GY TH IGKW+IG   E  LP+ RGFD  VG
Sbjct: 143 WGPKREVTAFSNYSVPLGHVLMPEALKRNGYGTHGIGKWNIGHCNEAYLPWMRGFDTFVG 202

Query: 135 YW--------------NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
           Y               + YL  ND++   DF   +   R + +     +  Y T+ F  +
Sbjct: 203 YLTDGIGYTDHVADGPSSYLYDNDALDLYDF---VSHERGVTKNGSAYAGAYTTEIFNAR 259

Query: 181 SVHVIKSHNHSRPLFLQITHAAVH 204
           +  +++      PLFL + H  VH
Sbjct: 260 AETILREEPSDAPLFLWLAHHGVH 283


>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 472

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 108/228 (47%), Gaps = 26/228 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D G +G   IPTPNIDA+A  G      Y +   C PSRA  LTG+Y  R+G +   
Sbjct: 40  GYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEHNT 99

Query: 82  GAGVAKAVPVT-------EKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              +A    +T       E+ +   ++  GY T  IGKWH G ++ +  P NRGF+   G
Sbjct: 100 SNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQG-DEPKHFPLNRGFNEFYG 158

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
           +  G   + D            A  N +   P+    YLTD FTD++   I + N  +P 
Sbjct: 159 FTGG---HRDFFAYKGKRTNEHALYNNKEIVPENEITYLTDMFTDKATSFITA-NKDKPF 214

Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           F+ +++ AVHT    NAK         D+ E    +A I++  RR +A
Sbjct: 215 FMYLSYNAVHTPM--NAK--------KDLMER---YASIADTGRRAYA 249


>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 100/188 (53%), Gaps = 11/188 (5%)

Query: 22  QGWNDVGFHGEND--IPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID 78
           QG+ DV ++ E+D  I TP+ DALA +G++ +R YT    C+ +R+  +TG+Y  RYGI 
Sbjct: 35  QGYADVSYNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTRSGLMTGRYQQRYGIY 94

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW-N 137
           T    G      +  K +P YLKE GY +   GKWH+G ++ +  P +RGFD+  G+   
Sbjct: 95  TAGEGGTG--TDLNAKFIPNYLKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGR 151

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           G   +     E D   G    R +E   P     YLT   T+++V  I+  N  +P F  
Sbjct: 152 GAHDFFRLEKEYDGKFGGPIYRGLE---PIDDKGYLTTRITEETVKFIE-ENKDKPFFAY 207

Query: 198 ITHAAVHT 205
           + + AVHT
Sbjct: 208 VAYNAVHT 215


>gi|323144144|ref|ZP_08078781.1| arylsulfatase [Succinatimonas hippei YIT 12066]
 gi|322416091|gb|EFY06788.1| arylsulfatase [Succinatimonas hippei YIT 12066]
          Length = 472

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 105/211 (49%), Gaps = 36/211 (17%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D G +G +   TP IDALA  G      Y + P C+P+RA+ LTGKYP R G+   +
Sbjct: 16  GWTDTGCYGSSFYETPRIDALALEGARFTDAYASCPVCSPTRASILTGKYPARLGLTQWI 75

Query: 82  GA---GVAKAVPVTEKL------LPQYLKELGYSTHLIGKWHIGCNKEELL---PFNRGF 129
           G    G    VP  + L      L + LK+ GY T  +GKWH+  + EE     P   GF
Sbjct: 76  GGHSEGKLADVPYIDHLSTDEISLAKALKQGGYKTWHVGKWHLSKHNEERFDTYPDKHGF 135

Query: 130 DNHVGY------WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
           D ++G       +NGY +            G++   +     P+   +YLTD  TD+++ 
Sbjct: 136 DVNIGGCHFGHPFNGYFS----------PYGIETLED----GPE--GEYLTDRLTDEAIK 179

Query: 184 VIK-SHNHSRPLFLQITHAAVHTGTAGNAKL 213
           +IK S N  +P F+ ++H AVHT    + +L
Sbjct: 180 LIKGSKNDDKPWFMYLSHYAVHTPIECHEEL 210


>gi|313239626|emb|CBY14523.1| unnamed protein product [Oikopleura dioica]
 gi|313245438|emb|CBY40171.1| unnamed protein product [Oikopleura dioica]
          Length = 309

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 97/189 (51%), Gaps = 10/189 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DV ++ E  + TPN++ +   G      Y+  TC+PSRAA LTG + +R G+D  P 
Sbjct: 89  GWADVSWNNEF-VKTPNLERIRKQGRTFTNLYSHSTCSPSRAALLTGIFAWRLGLDGAPF 147

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+  +L+P   K+L Y  H IGKWH G   + L P  RGFD+  G+++G + 
Sbjct: 148 NPTKVNGIPLGVELIPAKFKKLNYENHFIGKWHGGFCHQNLTPTERGFDSFYGFYSGAVN 207

Query: 142 YNDSIHETDF---AVGLDARR---NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           Y    HE+ +      LD R      E+   + +  Y T  FT++++  I + + +    
Sbjct: 208 Y--LTHESKYDAKGAALDYREVKDGKEKILKEKNGVYTTADFTERALEKIDNFDENGGNL 265

Query: 196 LQITHAAVH 204
           L +++ A H
Sbjct: 266 LFVSYNAPH 274


>gi|443734044|gb|ELU18180.1| hypothetical protein CAPTEDRAFT_89708, partial [Capitella teleta]
          Length = 113

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 70/108 (64%), Gaps = 4/108 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           GWNDVGFHG   + TPN+DALAY+G++L  +Y  P CTPSRAA +TG++P   G+   V 
Sbjct: 1   GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQHGVI 60

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
            +     + + E++LP+YL+++GY TH +GK    C  +    F+ GF
Sbjct: 61  ISSQPYGLDLKERILPEYLRDIGYKTHAVGKVCFICIADC---FDWGF 105


>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
          Length = 526

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 92/188 (48%), Gaps = 7/188 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG++DVG +G  DIPTPN+DA+A +G++L   Y   P C+ SRA  LTG YP R GI   
Sbjct: 84  QGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTGCYPNRVGIHNA 143

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNG 138
           +       +   E+ L + L++ GY T + GKWH+G +  + LP   GFD   G  Y N 
Sbjct: 144 LMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLG-DHPDFLPTRHGFDEFFGIPYSND 202

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
               +  +    F  G       ER    +   + LT   T++SV  I  H    P FL 
Sbjct: 203 MWPLH-PLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQITERSVDFINRHKEE-PFFLY 260

Query: 198 ITHAAVHT 205
           + H   H 
Sbjct: 261 VPHPQPHV 268


>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 494

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 69/189 (36%), Positives = 94/189 (49%), Gaps = 22/189 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP-----FRYGI 77
           G+ DVGF     I +PN D LA  G+VLNRHY    C+PSRA+ LTG++P     +  G 
Sbjct: 34  GFADVGFKNPA-ISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRWPHHTHQWNLGN 92

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           ++  G  +A        ++P  LK   Y+TH++GKWH G      LP NRGFD   G+  
Sbjct: 93  NSTAGTNLAMT------MIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFDTSSGFLC 146

Query: 138 GYLTYNDSIHETDFAV-GLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           G        H T  A+  +D  +N    AP   +  Y    + D    +I SHN + PLF
Sbjct: 147 G-----SEDHMTQNAICAIDYWKNN---APDPRNGTYDAYIYRDDLTDIINSHNTNEPLF 198

Query: 196 LQITHAAVH 204
           L +    VH
Sbjct: 199 LYLPLHNVH 207


>gi|380791197|gb|AFE67474.1| arylsulfatase E precursor, partial [Macaca mulatta]
          Length = 232

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 111/237 (46%), Gaps = 32/237 (13%)

Query: 23  GWNDVGFHG----ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           GW DVG+H     ++DI TP ID L   GI L RHY    C+P+RA+F +G+ P  +GID
Sbjct: 64  GWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRASFQSGRLPV-HGID 122

Query: 79  TPVGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
             V     +A +P     + Q+L + GY++H +GKW +G       P  RG++  + Y  
Sbjct: 123 GQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTPHGRGYNTSLNYFG 182

Query: 136 -----WNG--YLTYNDSIHETDFAVGLDARRNM---ERYAPQMS-SKYLTDFFTDQSVHV 184
                WN   +    +++         D  ++    +R A  +S + Y    F  +   +
Sbjct: 183 HANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTDRPAHNLSGTLYEEQLFVQRITDI 242

Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
           I++H+ S+PLFL       H             LQ P   E  + FA+I  P RR++
Sbjct: 243 IEAHDPSQPLFLTYASKVAHYP-----------LQAP--IEYQQQFANIEPPSRRVY 286


>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 96/188 (51%), Gaps = 18/188 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR-YGIDTPV 81
           G+ DV F     I +PN + LA  G++L+RHY    C+PSRA+FLTG++P   +  + P 
Sbjct: 33  GYADVSFRNPA-IHSPNFEKLAKEGLILDRHYVFKYCSPSRASFLTGRWPHHAHQWNPPE 91

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
            A V   + +T  ++P  LK   Y TH+IGKWH G  KE  LP NRGFD   G+  G   
Sbjct: 92  DALVGANLKMT--MIPAKLKLARYKTHMIGKWHEGLYKEAYLPINRGFDTMSGFLGGGEN 149

Query: 142 Y-NDSI-HETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
           + N  +   TDF    G D+R          +  Y    + D    +I +HN S P FL 
Sbjct: 150 HMNQQVGCATDFWKNDGPDSR----------NGSYDAYTYRDDLTDIITNHNPSDPFFLY 199

Query: 198 ITHAAVHT 205
           +    VHT
Sbjct: 200 LPLHNVHT 207


>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 94/188 (50%), Gaps = 20/188 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP-----FRYGI 77
           G+ DVGF     I +PN D LA  G+VLNRHY    C+PSRA+ LTG++P     +   +
Sbjct: 34  GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASLLTGRWPHHAHQWNPLM 92

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           D+ +G  +         +LP  LK   YSTH++GKWH+G      LP NRGFD   G+  
Sbjct: 93  DSTIGTNI------NMTMLPAKLKAANYSTHMVGKWHLGFFDPRYLPINRGFDTSTGF-- 144

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            +    D ++E      +D  +N    AP   +  Y    + D    V+ ++N   PLFL
Sbjct: 145 -FGCCEDHMNEKS-GCSIDYWKNN---APDPRNGTYDAYNYRDDLTDVMSNYNTENPLFL 199

Query: 197 QITHAAVH 204
            +    VH
Sbjct: 200 YLPLHNVH 207


>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 470

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 70/227 (30%), Positives = 105/227 (46%), Gaps = 26/227 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+ F G   + TP+ID LA +G+     Y +   C+PSRA  LTG+    +G D  +
Sbjct: 46  GYGDLSFTGSTQVKTPHIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNNL 105

Query: 82  GAG------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
                        +PV  K +  +LK+LGY T L+GKWH+G  +++  P NRGFD   GY
Sbjct: 106 ANSQPGFDPAFLGLPVNVKTVGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGY 164

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G    +D    ++   G  A+       PQ  + Y+TD   D+ ++ I+ H    P F
Sbjct: 165 LGG---GHDYFEASEAKRGYKAKIKCNYKTPQEIT-YITDDKGDECINFIQRHK-DEPFF 219

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           L  +  A HT     A             E+   + HI +  RR +A
Sbjct: 220 LYASFNAPHTPMQATA-------------EDLAIYQHIEDRKRRTYA 253


>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 479

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 67/234 (28%), Positives = 108/234 (46%), Gaps = 28/234 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+NDVGF+G  DI TPN+D LA NG+++   Y   P C PSR + +TGKY    G    +
Sbjct: 39  GYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGKYAHTMGAQFNI 98

Query: 82  ---GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                G    +P+  K + + L+E GY T   GKWH+G +     P  RGFD   G+  G
Sbjct: 99  PSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLGAD-TPFHPNKRGFDEFYGFLGG 157

Query: 139 YLTYNDSIHETDFA-VGLDARRNMERYAPQM--------SSKYLTDFFTDQSVH-VIKSH 188
              Y    ++  +  +     +N+  Y   +          +Y+TD  + ++V+ V K+ 
Sbjct: 158 GHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKEYITDGLSREAVNFVYKAS 217

Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              +P F+ + + A H             + +   +E+   F  I +  RR +A
Sbjct: 218 EKKQPFFMYLAYNAPH-------------VPLQAKKEDMAVFKSIKDEKRRTYA 258


>gi|406833280|ref|ZP_11092874.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 1053

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/200 (35%), Positives = 96/200 (48%), Gaps = 23/200 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +G     TPNID LA  GI   + Y   P   P+RAA LTG+YP R  +   V
Sbjct: 118 GWADLGCYGSKFHKTPNIDRLAQRGIRFTQAYAAAPIGQPTRAAILTGRYPQRMNLTASV 177

Query: 82  GAG------------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
            A             VAKA+P+ E  + + LK  GY+T  IGKWH+G   E   P  +GF
Sbjct: 178 AADPHDSKRRLTPPDVAKALPLEEVTIAEALKAAGYATGCIGKWHLG--GEGFGPKEQGF 235

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           D  V        Y      +DFA  +D   + +       + +YLTD    ++   +KSH
Sbjct: 236 DVSVAAAATGAIY------SDFAPYVDVDGKPIPGLEQAPAGEYLTDRLALEAAKFVKSH 289

Query: 189 NHSRPLFLQITHAAVHTGTA 208
             ++P FL + H AVH   A
Sbjct: 290 -QAKPFFLYLPHFAVHLPAA 308


>gi|71280931|ref|YP_269082.1| sulfatase [Colwellia psychrerythraea 34H]
 gi|71146671|gb|AAZ27144.1| sulfatase family protein [Colwellia psychrerythraea 34H]
          Length = 492

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 97/199 (48%), Gaps = 21/199 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+  +G N   TPNID LA +G+  +  Y   P C PSR A  +G YP RYG+  P 
Sbjct: 42  GRQDLSTYGSNFYETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGV--PQ 99

Query: 82  GAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV--GYWNG 138
           G  V K  +P++     ++LKE GY T  IGKWH+G  KE   P  +GFD+ +  G+W  
Sbjct: 100 GERVGKHHLPLSAVTFGEHLKEAGYQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGA 157

Query: 139 ----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
               Y  Y   + ++    G       E        +YLTD  TD+++  I+     +P 
Sbjct: 158 PPSYYFPYT-KMSKSGKNKGFAKVEGSEE-------EYLTDRLTDEALTFIE-QKKDQPF 208

Query: 195 FLQITHAAVHTGTAGNAKL 213
            L + H AVHT   G   L
Sbjct: 209 LLVLAHYAVHTPIEGKPAL 227


>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 500

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 86/171 (50%), Gaps = 9/171 (5%)

Query: 35  IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
           I TP+   L  NG++LNRHY    C+PSRA+FLTG++P       P   G+     +   
Sbjct: 50  IKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTPPGMV-GTNINMT 108

Query: 95  LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG 154
           +LP  LK  GYSTH++GKWH G      LP NRGFD      +G+L   +          
Sbjct: 109 MLPAKLKTAGYSTHMVGKWHQGLYDPAYLPVNRGFDTS----SGFLQAGEGHFNQTIGCA 164

Query: 155 LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQITHAAVH 204
           +D  +N   +AP   +     +  ++ +  I S H+ S+PLFL +    VH
Sbjct: 165 VDFWKN---HAPDTRNGTYDSYIYNKDLTTIFSKHDASKPLFLYLPLHNVH 212


>gi|332223751|ref|XP_003261032.1| PREDICTED: arylsulfatase E isoform 2 [Nomascus leucogenys]
          Length = 614

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPMRSGMVSSI 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 HFYG 197


>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
 gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
          Length = 488

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 108/218 (49%), Gaps = 17/218 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+GF G  +I TPNIDALA+ G+V +  Y T P C PSRA  LTG+Y  R+G++   
Sbjct: 46  GYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYCGPSRAGLLTGRYQARFGMEINA 105

Query: 82  GAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                     +PV E    + +++ GY T +IGKWH+G +     P NRGFD   G+  G
Sbjct: 106 AHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMGSHP-NFHPNNRGFDYFYGFLGG 164

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              Y   +  +   ++++ L   RN +   P   ++YLT   + ++     +   S+P  
Sbjct: 165 GHDYFPESVKVSNEEYSIPLS--RNGK---PAQLNEYLTTAISKEAAEF--AMTTSQPFM 217

Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
           + + + A H       K       + D+  N RT+A +
Sbjct: 218 MYVAYNAPHQPLEATQKDLAKYQHIEDI--NRRTYAAM 253


>gi|332223749|ref|XP_003261031.1| PREDICTED: arylsulfatase E isoform 1 [Nomascus leucogenys]
          Length = 589

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPMRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|410988042|ref|XP_004000297.1| PREDICTED: arylsulfatase E [Felis catus]
          Length = 585

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G + I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDIGCYGNDTIRTPNIDRLARDGVMLTQHLAAASVCTPSRAAFLTGRYPLRSGMVSSN 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
           G       GV+  +P  E    + LK+ GY+T LIGKWH+G N E        P N GFD
Sbjct: 105 GYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHGFD 164

Query: 131 NHVG 134
           +  G
Sbjct: 165 HFYG 168


>gi|410340669|gb|JAA39281.1| arylsulfatase E (chondrodysplasia punctata 1) [Pan troglodytes]
          Length = 599

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 58  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 117

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 118 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 177

Query: 131 NHVG 134
           +  G
Sbjct: 178 HFYG 181


>gi|194227646|ref|XP_001495573.2| PREDICTED: arylsulfatase E [Equus caballus]
          Length = 623

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 101/195 (51%), Gaps = 22/195 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G   I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 83  GVGDVGCYGNTTIRTPNIDRLAKDGVMLTQHIAAASVCTPSRAAFLTGRYPVRSGMVSSN 142

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
           G  V +       +P  E    + LK+ GY+T LIGKWH+G N +        P N GFD
Sbjct: 143 GYRVLQWTAASGGLPTNETTFAKILKDTGYATGLIGKWHLGLNCQSSNDHCHHPLNHGFD 202

Query: 131 NHVGYWNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           +  G    +    D +H   ++  VGL+ + N   +  Q+ +     F T + +H++   
Sbjct: 203 HFYGM--PFSMMGDCVHWELSEKRVGLENKLN---FCSQIMAIAALTFTTGKLIHLMAG- 256

Query: 189 NHSRPLFLQITHAAV 203
             S  L +  T AA+
Sbjct: 257 --SWALVIWSTVAAI 269


>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 469

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 74/225 (32%), Positives = 104/225 (46%), Gaps = 23/225 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+GF G  +I TPNIDALA NG    N + T P C PSR   LTG+Y  R G++  V
Sbjct: 28  GYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYCGPSRVGLLTGRYQARLGMENNV 87

Query: 82  G---AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                     +P++E      L+++GY T + GKWH+G       P  RGFD   G+ +G
Sbjct: 88  SYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLG-GAPHFQPNKRGFDYFYGFLDG 146

Query: 139 YLTYN-DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
              Y  D +        L   RN +        +YLT   +  +V  I     S P F+ 
Sbjct: 147 GHNYMPDQVTVGGDGYSLPLMRNTQVTE---FDEYLTTALSRDAVKYIHRQQES-PFFMY 202

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +++ A HT            LQ P   E    + HI + DRR++A
Sbjct: 203 LSYNAPHTP-----------LQAP--AEYIEKYKHIEDEDRRVYA 234


>gi|426395032|ref|XP_004063784.1| PREDICTED: arylsulfatase E isoform 2 [Gorilla gorilla gorilla]
          Length = 614

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 HFYG 197


>gi|120659872|gb|AAI30439.1| Arylsulfatase E (chondrodysplasia punctata 1) [Homo sapiens]
 gi|313883184|gb|ADR83078.1| arylsulfatase E (chondrodysplasia punctata 1) [synthetic construct]
          Length = 589

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDVGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
            queenslandica]
          Length = 2318

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 10/183 (5%)

Query: 23   GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
            G+ D  F     I TPN   L  NG++LNRHY    C+PSRA+FLTG++P       P  
Sbjct: 1152 GFADASFRNP-AIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTP 1210

Query: 83   AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
             G+     +   +LP  LK  GY+TH++GKWH G      LP NRGFD      +G+L  
Sbjct: 1211 LGMV-GTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFDTS----SGFLQA 1265

Query: 143  NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV-HVIKSHNHSRPLFLQITHA 201
             +          +D  +N    AP   +     +  ++ +  V   H+ S+PLFL +   
Sbjct: 1266 EEGHFNQTIGCAVDFWKND---APDTRNGTCDSYIYNKDLTTVFNEHDASKPLFLYLPLH 1322

Query: 202  AVH 204
             VH
Sbjct: 1323 NVH 1325


>gi|297709347|ref|XP_002831396.1| PREDICTED: arylsulfatase E isoform 2 [Pongo abelii]
          Length = 614

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 HFYG 197


>gi|297303268|ref|XP_002806170.1| PREDICTED: arylsulfatase E isoform 2 [Macaca mulatta]
          Length = 614

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 HFYG 197


>gi|426395030|ref|XP_004063783.1| PREDICTED: arylsulfatase E isoform 1 [Gorilla gorilla gorilla]
          Length = 589

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 443

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 96/199 (48%), Gaps = 33/199 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+  +G  D  TPNID LA  GI  +N +   P CTP+R AF+TG+YP R    TPV
Sbjct: 42  GYGDLSGYGRKDFLTPNIDKLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPAR----TPV 97

Query: 82  G-------------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
           G              G+  A P    L    ++  GY T LIGKWH+G   +   P   G
Sbjct: 98  GLMEPLTPSKRDSTVGLTAAFPSVATL----MRASGYETALIGKWHLGFLPQN-SPVKNG 152

Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS---SKYLTDFFTDQSVHVI 185
           FD   G  +G   Y    H+T    G   RR  + Y    +     YLTD FT ++V  +
Sbjct: 153 FDYFFGIHSGAADYIS--HKT----GPAGRRIHDLYENDQAVYPEGYLTDLFTQKAVTFL 206

Query: 186 KSHNHSRPLFLQITHAAVH 204
           K   H++P FL +T+ A H
Sbjct: 207 K-QKHNKPFFLTLTYNAAH 224


>gi|297709345|ref|XP_002831395.1| PREDICTED: arylsulfatase E isoform 1 [Pongo abelii]
          Length = 589

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|301770871|ref|XP_002920858.1| PREDICTED: arylsulfatase E-like [Ailuropoda melanoleuca]
          Length = 592

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 72/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHVAAASVCTPSRAAFLTGRYPLRSGMVSSN 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       GV   +P  E    + LK+ GY+T LIGKWH+G N +        P N GFD
Sbjct: 105 GYRVLQWTGVPGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCDSSSDHCHHPLNHGFD 164

Query: 131 NHVG 134
           +  G
Sbjct: 165 HFYG 168


>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
          Length = 764

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 95/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 73  GWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRTGFYTTN 132

Query: 81  -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK  GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 133 AHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG-HRPQFHPLRHGFDEWF 191

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 192 GAPNCHFGPYDNKARPNIPVYRDWEMVGRFYEEFPISLKTGEANLTQIYLQEALDFIKRQ 251

Query: 189 NHSRPLFLQ----ITHAAVHT 205
              RP FL      THA V+ 
Sbjct: 252 AGRRPFFLHWAIDATHAPVYA 272


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 97/189 (51%), Gaps = 13/189 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+  +G   I TPN+D LA  G++  + H     C+P+RAA +TGKY  R G++  V
Sbjct: 39  GYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVV 98

Query: 82  GAGVAKAV--PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            A   + V   + E  L + LK+LGY+T + GKWH+G +K    P  +GFD  VG+ +G 
Sbjct: 99  TAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLGYDK-AFNPTLQGFDEFVGFVSGN 157

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN---HSRPLFL 196
           + Y+  I +  +    D  +       +    Y TD  ++  V  I+ HN      P FL
Sbjct: 158 VDYHGHIDQEGYLDWWDGVK------IKNEKGYTTDLISEYGVKFIQEHNPEVKRAPFFL 211

Query: 197 QITHAAVHT 205
            + H A H+
Sbjct: 212 YLPHEAPHS 220


>gi|284039849|ref|YP_003389779.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283819142|gb|ADB40980.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 533

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 100/195 (51%), Gaps = 19/195 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G  ++ TPN+D LA  GI L   Y    C P+RA+ LTG+YP   G+   V 
Sbjct: 52  GFSDIGCYG-GEVNTPNLDKLAAGGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 110

Query: 83  AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              A   P + +         + + LKE GYST+++GKWH+G  + E  P  RGF+++ G
Sbjct: 111 MPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVG-ERPEHWPLKRGFEHYFG 169

Query: 135 YWNGYLTYNDSI--HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHN 189
             +G  +Y + I   +    + LD +     + P     Y+TD FTD +V  +   K   
Sbjct: 170 LISGASSYYEIIPAEKGKRFIVLDDK----EFTPPADGFYMTDAFTDYAVQYLNQQKQEQ 225

Query: 190 HSRPLFLQITHAAVH 204
             +P F+ + + A H
Sbjct: 226 ADKPFFMYLAYTAPH 240


>gi|62510430|sp|Q60HH5.1|ARSE_MACFA RecName: Full=Arylsulfatase E; Short=ASE; Flags: Precursor
 gi|52782187|dbj|BAD51940.1| arylsulfatase E [Macaca fascicularis]
          Length = 588

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|62897927|dbj|BAD96903.1| arylsulfatase E precursor variant [Homo sapiens]
          Length = 589

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 480

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/233 (32%), Positives = 111/233 (47%), Gaps = 28/233 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
           G+ DVGF+G  DI TPNID LA +G   +  Y   P C PSRAA +TG+YP + G   + 
Sbjct: 41  GYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMTGRYPHKIGSQFNL 100

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P   G    VP   K + + L E  Y T  +GKWH+G +  +  P  RGFD + G+  G 
Sbjct: 101 PT-RGSNVGVPTDAKFISKLLNENNYFTGALGKWHMG-DAPQYHPNKRGFDEYYGFLGGG 158

Query: 140 LTY-NDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVI-KSHN 189
             Y  D              +N+  Y   +         ++Y+TD  + ++V+ + K+ N
Sbjct: 159 HNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKETQYITDALSREAVNFVDKAVN 218

Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              P FL + + A H        +P   LQ  D  E+   F +I N DR+ +A
Sbjct: 219 KKNPFFLYLAYNAPH--------VP---LQAKD--EDMAMFPNIKNKDRKTYA 258


>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 484

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 94/197 (47%), Gaps = 21/197 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D GF G   + TPN+D LA +G    + Y T  TC PSRA  +TGKY  R+G +   
Sbjct: 34  GYADFGFQGSKIMKTPNLDKLAKSGAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 93

Query: 82  GAGVAKA----------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
             G   A          +P+ +  +  +LK+LGY T + GKWH+G + +   P  RGFD 
Sbjct: 94  VPGYMSANSKFLADDMGLPLDQLTIADHLKKLGYKTAMYGKWHLG-DADRYHPTKRGFDE 152

Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
             G+  G   Y  YND          LD R        Q  ++Y+TD    ++V  I+  
Sbjct: 153 FYGFRGGARNYFGYNDVSK-----ANLDNRMERGFGNYQEPTEYVTDALAKEAVSFIEK- 206

Query: 189 NHSRPLFLQITHAAVHT 205
           N   P F+ +   AVHT
Sbjct: 207 NKGNPFFIYLAFNAVHT 223


>gi|157266309|ref|NP_000038.2| arylsulfatase E precursor [Homo sapiens]
 gi|77416850|sp|P51690.2|ARSE_HUMAN RecName: Full=Arylsulfatase E; Short=ASE; Flags: Precursor
 gi|62897959|dbj|BAD96919.1| arylsulfatase E precursor variant [Homo sapiens]
 gi|119619123|gb|EAW98717.1| arylsulfatase E (chondrodysplasia punctata 1), isoform CRA_a [Homo
           sapiens]
          Length = 589

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|355757152|gb|EHH60677.1| Arylsulfatase E [Macaca fascicularis]
          Length = 589

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|109129828|ref|XP_001116129.1| PREDICTED: arylsulfatase E isoform 1 [Macaca mulatta]
 gi|355704585|gb|EHH30510.1| Arylsulfatase E [Macaca mulatta]
          Length = 589

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 500

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 111/235 (47%), Gaps = 32/235 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+NDVGF+G  DI TPN+D LA NG+  +  Y   P C PSRAA +TG+YP + G    +
Sbjct: 51  GYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNL 110

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
               +   V   E  + Q +K  GY T  +GKWH+G    E  P   GFD   G+  G  
Sbjct: 111 PEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLG-EASEYHPNKHGFDEFYGFLGGGH 169

Query: 141 TYNDSIHETDF----AVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVI-KS 187
            Y     E  +    A G+    N+  Y   +         ++Y+TD  + ++V+ + K+
Sbjct: 170 NYFPEQFEAAYNKRVAQGM---TNINMYLTPLEHNGKEVRETEYITDGLSREAVNFVDKA 226

Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
               +P FL + + A H        +P   LQ    EE+   F+ I +  RR +A
Sbjct: 227 AAKKKPFFLYLAYNAPH--------VP---LQAK--EEDMAMFSQIKDKKRRTYA 268


>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
          Length = 502

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 100/202 (49%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 21  GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 80

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P +E+LLPQ LKE GY+T ++GKWH+G ++ +  P   GFD   
Sbjct: 81  GHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEWF 139

Query: 134 GYWNGYLTYNDSIHETDFAVGLD---ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  +     R  E +   + +    LT  +  +++  I+  
Sbjct: 140 GSPNCHFGPYDNKARPNIPVYRNWDMVGRFYEEFPINVKTGESNLTQIYLQEALDFIRQQ 199

Query: 189 NHSR-PLFL----QITHAAVHT 205
             ++ P FL      THA V+ 
Sbjct: 200 QAAQHPFFLYWAVDATHAPVYA 221


>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 590

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 96/202 (47%), Gaps = 29/202 (14%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  HG  ++ TPNID+LA +G +  R Y  P C P+RA FLTG+Y  R G+    
Sbjct: 34  QGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGRYHPRGGVRGVT 93

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
             G  + + + EK + +  K  GY+T   GKWH G  +    P  RGFD + G+ +G+  
Sbjct: 94  SGG--ERLDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGFDEYYGFTSGHWG 150

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D   E +               P   + ++TD  TD ++  IK+ +  RP F  +  
Sbjct: 151 EYFDPPLEHN-------------GRPVQGNGFITDDLTDHAISFIKA-SKDRPFFCYLPF 196

Query: 201 AAVHTGTAGNAKLPTGLLQVPD 222
              H+            +QVPD
Sbjct: 197 NTPHSP-----------MQVPD 207


>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
           griseus]
          Length = 493

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/208 (31%), Positives = 102/208 (49%), Gaps = 20/208 (9%)

Query: 16  TEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFR 74
           TE+    GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R
Sbjct: 5   TERAEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIR 64

Query: 75  YGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
            G  T  G          +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P  
Sbjct: 65  NGFYTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLK 123

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQS 181
            GFD   G  N +    D+  + +  V  D     R  E +   + +    LT  +  ++
Sbjct: 124 HGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEA 183

Query: 182 VHVIKS-HNHSRPLFL----QITHAAVH 204
           +  I++ H    P FL      THA V+
Sbjct: 184 LDFIRTQHARQSPFFLYWAIDATHAPVY 211


>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
          Length = 525

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/221 (32%), Positives = 105/221 (47%), Gaps = 24/221 (10%)

Query: 8   GVAKAVPVTEKLL----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPS 62
           GVA+A+     LL      GW D+G +GE    TPN+D +A  G++    YT  P C+PS
Sbjct: 26  GVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPS 85

Query: 63  RAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
           RAA LTG+ P R G  T  G          +   +P +E LLP  LK  GY++ ++GKWH
Sbjct: 86  RAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWH 145

Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS- 170
           +G ++ +  P   GFD   G  N +    D+    +  V  D     R  E +   + + 
Sbjct: 146 LG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTG 204

Query: 171 -KYLTDFFTDQSVHVIKSHNHS-RPLFL----QITHAAVHT 205
              LT  +  +++  I+    + RP FL      THA V+ 
Sbjct: 205 EANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHAPVYA 245


>gi|338213632|ref|YP_004657687.1| arylsulfatase [Runella slithyformis DSM 19594]
 gi|336307453|gb|AEI50555.1| Arylsulfatase [Runella slithyformis DSM 19594]
          Length = 535

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 99/196 (50%), Gaps = 21/196 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G  ++ TPNID +A NGI L   Y    C P+RA+ LTG+YP   G+   V 
Sbjct: 54  GFSDIGCYG-GEVNTPNIDQMAANGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 112

Query: 83  AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              A   P + +         + + LK+ GY T+++GKWH+G  + +  P  RGFDN+ G
Sbjct: 113 MPNAAIQPGSYQGFLDDRYPTIAEQLKKTGYHTYMLGKWHVG-ERPQHWPLKRGFDNYFG 171

Query: 135 YWNGYLTYNDSI---HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSH 188
             +G  +Y + I       F V  D     + + P     Y+TD FTD +V  +   K  
Sbjct: 172 LISGASSYYEIIPAEKGKRFMVLDD-----KEFTPPSDGFYVTDAFTDYAVQYLNKQKQE 226

Query: 189 NHSRPLFLQITHAAVH 204
              +P F+ + + A H
Sbjct: 227 AADKPFFMYLAYTAPH 242


>gi|410056148|ref|XP_003317386.2| PREDICTED: arylsulfatase E [Pan troglodytes]
          Length = 750

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 72/121 (59%), Gaps = 12/121 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 129 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 188

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 189 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 248

Query: 131 N 131
           +
Sbjct: 249 H 249


>gi|116251005|ref|YP_766843.1| arylsulfatase [Rhizobium leguminosarum bv. viciae 3841]
 gi|115255653|emb|CAK06734.1| putative arylsulfatase [Rhizobium leguminosarum bv. viciae 3841]
          Length = 503

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 72/119 (60%), Gaps = 6/119 (5%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW D G +G  +     TPN+D LA  G++L   Y+ PTCTP+R+A LTG+ P R G+  
Sbjct: 51  GWGDPGLYGGGEAVGAATPNMDRLAREGLMLTSTYSQPTCTPTRSAILTGRLPVRTGLTR 110

Query: 80  PVGAG--VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           P+ AG  + K     E  LP+ L E+GY+T L GKWH+G + E + P + GFD   G++
Sbjct: 111 PILAGDKITKNPWAEEASLPKLLGEVGYATVLCGKWHVG-DVEGMRPHDVGFDEFYGFY 168


>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
          Length = 1591

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 58/191 (30%), Positives = 95/191 (49%), Gaps = 13/191 (6%)

Query: 25   NDVGFHG---ENDIPTPN-IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
            +DVG +      D+P P  +  L  +G+ L  +Y    C+P+RA  +TGK+  + G    
Sbjct: 1103 DDVGLNDLWRSTDLPKPTEMSKLVRDGVELTSYYGQSLCSPARATLMTGKFAHKIGFSDQ 1162

Query: 81   VGAGVAK-------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
             G GV +       +VP+   +LPQ +K LGY TH IGKW+IG    + +P+ RGFD  V
Sbjct: 1163 QG-GVREVTAYSNFSVPLGHDMLPQGMKRLGYQTHAIGKWNIGHCNVKYMPWQRGFDTFV 1221

Query: 134  GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
            GY+   + Y D + +T     ++    +     +    Y T  FT+++  V+       P
Sbjct: 1222 GYFTDGIGYTDHVSDTANTYTVN-DGGLAFNGSEYEGTYTTALFTERAEKVLHDAPEDAP 1280

Query: 194  LFLQITHAAVH 204
            LF+ + +  +H
Sbjct: 1281 LFMWLAYHGMH 1291


>gi|440717770|ref|ZP_20898247.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436437072|gb|ELP30746.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 480

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 94/193 (48%), Gaps = 27/193 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QG+ DVG  G  DI TP +DA+A  G+ L   Y  P C PSRAA +TG YP R       
Sbjct: 23  QGYQDVGCFGSPDIRTPRLDAMAKEGMKLTSFYAQPICGPSRAALMTGCYPLRV-----A 77

Query: 82  GAGVAKAV-PVT---EKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
             G  K + P+    E  + + LK  GY+T   GKW +  + +     +LLP  +GFD  
Sbjct: 78  ERGHTKQIHPILHEGEITIAEVLKTKGYATACFGKWDLAKHAQSGFFPDLLPTGQGFD-- 135

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
             Y+ G  T ND +         +  RN E   P+     LT  +TD+++  I+  N ++
Sbjct: 136 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 184

Query: 193 PLFLQITHAAVHT 205
           P F+ I H   HT
Sbjct: 185 PFFVYIPHTMPHT 197


>gi|254516321|ref|ZP_05128380.1| steryl-sulfatase [gamma proteobacterium NOR5-3]
 gi|219674744|gb|EED31111.1| steryl-sulfatase [gamma proteobacterium NOR5-3]
          Length = 500

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/205 (31%), Positives = 99/205 (48%), Gaps = 25/205 (12%)

Query: 23  GWNDVGFHGEND----IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           G+ D+G +G       +PTP ID LA  G++L + +  P CTP+RAA LTG+Y  R G+ 
Sbjct: 49  GYGDLGVYGSGGELRGMPTPRIDQLASEGMMLTQFFVEPGCTPTRAALLTGRYSQRAGLG 108

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-HVGYW- 136
           + + AG    +  +E  L +  K  GY+T + GKWH+G  K+  LP N+GFD  HVG   
Sbjct: 109 SIIIAGTPSTLQDSEVTLAELFKSQGYATAMTGKWHLGGEKQS-LPINQGFDEWHVGILQ 167

Query: 137 --NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSS--------------KYLTDFFTDQ 180
             +G L Y D +  + F+    A+     +  +                 +++     + 
Sbjct: 168 TTDGVL-YPDGMRRSGFSEAAIAKSQTAIWESEPGKDVVKKVRPYDLEYRRHIEGDIAEA 226

Query: 181 SVHVIKSHNHSR-PLFLQITHAAVH 204
           SV  IK     + P FL +  + VH
Sbjct: 227 SVKYIKEQAKEKEPFFLYVGWSHVH 251


>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
 gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
          Length = 522

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/217 (31%), Positives = 102/217 (47%), Gaps = 20/217 (9%)

Query: 8   GVAKAVPVTEKLL----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPS 62
           GVA+A+     LL      GW D+G +GE    TPN+D +A  G++    YT  P C+PS
Sbjct: 22  GVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPS 81

Query: 63  RAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
           RAA LTG+ P R G  T  G          +   +P +E LLP  LK  GY++ ++GKWH
Sbjct: 82  RAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWH 141

Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS- 170
           +G ++ +  P   GFD   G  N +    D+    +  V  D     R  E +   + + 
Sbjct: 142 LG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTG 200

Query: 171 -KYLTDFFTDQSVHVIKSHNHS-RPLFLQITHAAVHT 205
              LT  +  +++  I+    + RP FL     A H 
Sbjct: 201 EANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA 237


>gi|417302808|ref|ZP_12089892.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327540882|gb|EGF27442.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 480

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G  G  +IPTP IDALA +G+     Y   + C+PSRA FL+G+Y  R+G D  P
Sbjct: 46  GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
            G         +P  +K   ++L+  GY T LIGKWH+G    + +P ++GFD   G+  
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164

Query: 136 --------------W---------NGYLTYND--------SIHETDFAVG---LDARRNM 161
                         W          G    N          I+E D+  G   LD    +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGQFETNQRTIRGNYARINEPDYDAGNPMLDGSEPI 224

Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
           + +       YLTD  TD+++  I +   S P  + +++ AVH+    +           
Sbjct: 225 DHW------NYLTDTITDKAIDAI-TQTASNPFAMVVSYNAVHSPMQASL---------- 267

Query: 222 DMEENDRTFAHISNPDRRLFA 242
              E+     HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIDDPQRRIFA 285


>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
          Length = 328

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/189 (34%), Positives = 92/189 (48%), Gaps = 16/189 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALA-YNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G+   + I TPNID LA   G++L+  Y    C+PSRA+FLTG+ P       P 
Sbjct: 11  GYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRVPIHVHQGNP- 68

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           G   A    +   ++P  L+  GY T ++GKWH G +  E LP NRGFD   GY +G   
Sbjct: 69  GLAAAGCTNLNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRGFDTSFGYLSGEED 128

Query: 142 YND------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
           + D        + TDF   LD+   + R     S +Y      D  V +I+ H   +PL 
Sbjct: 129 HMDQTTNGGQCNVTDFW--LDSGPAIRRNGTYSSFQY-----NDAIVDIIQQHAPEQPLM 181

Query: 196 LQITHAAVH 204
           L      VH
Sbjct: 182 LYAALQNVH 190


>gi|32473691|ref|NP_866685.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
           SH 1]
 gi|32444227|emb|CAD74224.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
           SH 1]
          Length = 480

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G  G  +IPTP IDALA +G+     Y   + C+PSRA FL+G+Y  R+G D  P
Sbjct: 46  GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
            G         +P  +K   ++L+  GY T LIGKWH+G    + +P ++GFD   G+  
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPSQ-VPTSKGFDRFFGFLH 164

Query: 136 --------------W---------NGYLTYNDS--------IHETDFAVG---LDARRNM 161
                         W          G    N          I+E D+  G   LD    +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGRFETNQKTIRGNYARINEPDYDAGNPMLDGSEPI 224

Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
           E +       YLTD  TD+++  I +   S+P  + +++ AVH+    +           
Sbjct: 225 EHW------NYLTDSITDKAIDAI-TQTASKPFAMVVSYNAVHSPMQASL---------- 267

Query: 222 DMEENDRTFAHISNPDRRLFA 242
              E+      I +P RR+FA
Sbjct: 268 ---EDHAAMELIDDPQRRIFA 285


>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 484

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/220 (31%), Positives = 104/220 (47%), Gaps = 27/220 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI--- 77
           QG+ND+G    +D+ TP++D LA  G  L   Y   P CTPSRA+ LTG+YP R GI   
Sbjct: 38  QGYNDLGVLN-SDLITPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGRYPQRNGIYDM 96

Query: 78  ---------------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
                          +  V       +   EKLLP+YLK+LGY++ + GKW +G  K   
Sbjct: 97  IRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGKWDLGSLK-RF 155

Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
           LP NRGFD   G+ N  + Y    HE     G+ +         +   +Y T+ F  +++
Sbjct: 156 LPTNRGFDEFYGFVNTGIDY--FTHER---YGVPSMFRQTSLTEEDRGEYATELFKREAL 210

Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
             +     S P  L +   A H  ++ + ++    +Q P+
Sbjct: 211 AFLDRAEASEPFLLYLPFNAPHNSSSLDPRI-RSTVQAPE 249


>gi|47230520|emb|CAF99713.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 554

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 99/202 (49%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G+    TPN+DA+A  G++    YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 18  GWGDLGVFGQPSKETPNLDAMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRNGFYTTN 77

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +   E LLPQ LK+ GY + ++GKWH+G ++ + LP   GFD  +
Sbjct: 78  GHARNAYTPQEIVGGISKDEILLPQMLKKRGYISKIVGKWHLG-HRPQYLPLEHGFDEWL 136

Query: 134 GYWNGYL-TYNDSIHET----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           G  N +   YN+S+       + +  L       R   +M    LT  +  +S+  ++  
Sbjct: 137 GAPNCHFGPYNNSVKPNIPVYNNSEMLGRYYEEFRIDRKMGESNLTQMYLLESLDFVRRQ 196

Query: 189 NHS-RPLFL----QITHAAVHT 205
             + RP FL      THA V+ 
Sbjct: 197 AEAQRPFFLYWAPDATHAPVYA 218


>gi|300114943|ref|YP_003761518.1| sulfatase [Nitrosococcus watsonii C-113]
 gi|299540880|gb|ADJ29197.1| sulfatase [Nitrosococcus watsonii C-113]
          Length = 463

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 93/187 (49%), Gaps = 12/187 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
           G+ DVG +G   I TPN+DALA  G    + H   P CTP+RAA LTG Y  R G   I 
Sbjct: 53  GYGDVGCYGNQHIKTPNLDALAKRGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLQIIP 112

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                 +AKA+P+ E    + LK +GYST LIGKWH+G ++    P  +GFD + G    
Sbjct: 113 KDQRYAMAKAMPLAEITFAEALKAVGYSTALIGKWHLG-DRPSFSPSRQGFDEYFG---- 167

Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + Y+  +H    +   L   RN E          +T + T+++V  I  H  S P  L 
Sbjct: 168 -IPYSHDMHPWRKSFPPLPLMRNEEIIELNPDLDDMTQYCTEEAVQFISKHK-SNPFLLY 225

Query: 198 ITHAAVH 204
           + H   H
Sbjct: 226 MPHPMPH 232


>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 446

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 6/185 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           GW DV +HG  D  TP IDA+A  G+   + Y   + C PSRA  LTG+Y   +G+ T  
Sbjct: 31  GWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGILTGRYQQLFGVVT-- 88

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-WNGYL 140
                K +P ++K + + LK  GY +   GKWH+G  K +  P +RGFD   G+ +  + 
Sbjct: 89  NGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQ-FPNDRGFDTFYGFHFGAHD 147

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y           G       +         YLT+  TD +V  I+  N  +P F+ + +
Sbjct: 148 YYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFIEE-NKDQPFFMYVAY 206

Query: 201 AAVHT 205
            +VH+
Sbjct: 207 NSVHS 211


>gi|421613374|ref|ZP_16054460.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
 gi|408495968|gb|EKK00541.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
          Length = 480

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 115/261 (44%), Gaps = 62/261 (23%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G  G  +IPTP IDALA +G+     Y   + C+PSRA FL+G+Y  R+G D  P
Sbjct: 46  GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
            G         +P  +K   ++L+  GY T LIGKWH+G    + +P ++GFD   G+  
Sbjct: 106 TGERNNHPIAGLPPQQKTFIEHLQSAGYLTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164

Query: 136 --------------WN---------GYLTYND--------SIHETDFAVG---LDARRNM 161
                         W          G    N          I+E D+  G   LD    +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPAGQFKTNQRTIRGNYARINEPDYDAGNPMLDGSEPI 224

Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
           + +       YLTD  TD+++  I +   S+P  + +++ AVH+    +           
Sbjct: 225 DHW------NYLTDTITDKAIDSI-TQTPSKPFAMVVSYNAVHSPMQASL---------- 267

Query: 222 DMEENDRTFAHISNPDRRLFA 242
              E+     HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIDDPQRRIFA 285


>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 467

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 71/229 (31%), Positives = 104/229 (45%), Gaps = 30/229 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
           G+ D GF G  +  TP +D LA   I  ++ Y +   C PSRA  LTGKY  ++G +   
Sbjct: 38  GYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSAAVCGPSRAGILTGKYQQKFGFEENN 97

Query: 79  ------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
                 T    G    +P+ +  +  YL++LGY T L GKWH G N +   P  RGFD  
Sbjct: 98  VPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKTALFGKWHQG-NADRFHPTKRGFDEF 156

Query: 133 VGYWNG---YLTYND----SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
            G+  G   Y+ Y+D    S +E     G       E         YLTD    +++  I
Sbjct: 157 YGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLEHE--------GYLTDELAHEAISFI 208

Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
            + N   P F+ ++  AVHT     A+    L Q P ++   +T A ++
Sbjct: 209 -NRNKKHPFFIYLSFNAVHTPMEATAE---DLEQFPHLKGKRKTLAAMT 253


>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 109/243 (44%), Gaps = 39/243 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF--------- 73
           G+++ G   + +  TPN+D LA +G++L++ Y+   CTPSR++FL+G+ P          
Sbjct: 52  GFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRLPLHVFHSNRVS 111

Query: 74  --RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
              +    P  AGV   +P     +P ++++ GY TH++GKW  G    +  P  RGFD+
Sbjct: 112 SASWDSQHPDTAGV--GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIATPQHSPLGRGFDS 169

Query: 132 HVGYW---NGYLTYN--DSI--------HETDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
            + Y+   N Y  YN  D++            +  G     N    A      Y    F 
Sbjct: 170 SLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPADAPIGTYEEHVFR 229

Query: 179 DQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDR 238
           D ++ VI  H+ S PLFL       H             LQVP     DR F HI +P R
Sbjct: 230 DHALDVISKHDASTPLFLYYASHIAHAP-----------LQVPQAYL-DR-FQHIPDPIR 276

Query: 239 RLF 241
           R +
Sbjct: 277 RTY 279


>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
          Length = 528

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 48  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 107

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   VP +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 108 AHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 166

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 167 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 226

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 227 ARRHPFFLYWAVDATHAPVYA 247


>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
          Length = 482

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 99/202 (49%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA L+G+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFYTTN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP  LKE GY+T ++GKWH+G ++ +  P   GFD   
Sbjct: 62  AHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+  + +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 121 GSPNCHFGPYDNKAKPNIPVYKDWEMVGRFYEEFPINVKTGESNLTQIYLQEALDFIKRQ 180

Query: 189 NHS-RPLFL----QITHAAVHT 205
             + RP FL      THA V+ 
Sbjct: 181 QAARRPFFLYWAVDATHAPVYA 202


>gi|87309449|ref|ZP_01091584.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
 gi|87287757|gb|EAQ79656.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
          Length = 478

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 95/201 (47%), Gaps = 28/201 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G  D PTP++D LA  G +    Y T   C+ SRA  LTG Y  R GI   +
Sbjct: 36  GYADIGPFGAKDYPTPHLDQLAQEGTICTDFYVTQAVCSASRAGLLTGCYNNRIGILGAL 95

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
           G      +   E  L +  K+ GY+T   GKWH+G + EE LP   GFD++VG  Y N  
Sbjct: 96  GPQSKIGISAEETTLAEICKQKGYATACYGKWHLG-HHEEFLPLQHGFDDYVGLPYSNDM 154

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQM----------------SSKYLTDFFTDQSVH 183
             Y+  +        L   +  +RY P +                  + LT  +T+++V 
Sbjct: 155 WPYHPELRH------LTKDQQQKRY-PDLPLYEKNEIIDTEVTPEDQRNLTTLYTEKAVK 207

Query: 184 VIKSHNHSRPLFLQITHAAVH 204
            I   NH++P FL + H+ VH
Sbjct: 208 FIDD-NHAQPFFLYVPHSMVH 227


>gi|449136003|ref|ZP_21771428.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448885345|gb|EMB15791.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 480

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 91/192 (47%), Gaps = 25/192 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QG+ DVG  G  DI TP +DA+A +G+     Y  P C PSRAA +TG YP R       
Sbjct: 23  QGYQDVGCFGSPDIRTPRLDAMAKDGMKFTSFYAQPICGPSRAALMTGCYPLRVA----E 78

Query: 82  GAGVAKAVPV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNHV 133
              + +  P+    E  + + LK  GY+T   GKW +  + +     +LLP  +GFD   
Sbjct: 79  RGHIKQIHPILHEDEITIAEVLKTKGYATACFGKWDLAKHTQTDFFPDLLPTGQGFD--- 135

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
            Y+ G  T ND +         +  RN E   P      LT  +TD+++  I+  N  +P
Sbjct: 136 -YFYGTPTSNDRV--------ANLYRNKELIEPDSDMATLTQRYTDEAISFIE-QNQDQP 185

Query: 194 LFLQITHAAVHT 205
            F+ I H   HT
Sbjct: 186 FFVYIPHTMPHT 197


>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
           [Oryzias latipes]
          Length = 525

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 93/202 (46%), Gaps = 24/202 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G+    TPN+DA+A  GI+    YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
           G          +   +   E LLPQ LKE GY   ++GKWH+G ++ + LP   GFD   
Sbjct: 102 GHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWF 160

Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
                H G +N  +  N  ++     +G    R  E +    +     LT  + +  +  
Sbjct: 161 GAPNCHFGPYNNTVRPNIPVYNNSEMLG----RYFEEFKIDKKTGESNLTQMYLEAGLDF 216

Query: 185 IKSHNHS-RPLFLQITHAAVHT 205
           I     + RP FL     A H+
Sbjct: 217 ISRQAEAKRPFFLYWAADATHS 238


>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
           africana]
          Length = 513

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 99/202 (49%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P +E LLP+ LK+  Y+T ++GKWH+G ++ +  P   GFD   
Sbjct: 102 GHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIK-S 187
           G  N +    D+  + +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 188 HNHSRPLFL----QITHAAVHT 205
            +  RP FL      THA V+ 
Sbjct: 221 QSQQRPFFLYWAIDATHAPVYA 242


>gi|403255190|ref|XP_003920329.1| PREDICTED: arylsulfatase E isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 614

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA  G+ L +H +  + CTPSRAAFLTG+YP R G+ +  
Sbjct: 74  GIGDIGCYGNNTMRTPNIDHLAEFGVKLTQHVSAASLCTPSRAAFLTGRYPIRSGMVSST 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       GV   +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GHRVLQWTGVPGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 SFYG 197


>gi|254482499|ref|ZP_05095738.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
 gi|214037190|gb|EEB77858.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
          Length = 602

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 64/186 (34%), Positives = 94/186 (50%), Gaps = 10/186 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ND+  +  +D PTP +DA+A  G+   RHY   +CT SR A LTG+YP R G   P  
Sbjct: 27  GYNDLAINNGSDSPTPRLDAIAAQGVRFTRHYAESSCTASRVALLTGRYPARVGAH-PYL 85

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YLT 141
            G+   +      LP  L   GY  H++GKWH G +  E  P  +GFD+  G+ N  YL 
Sbjct: 86  NGIDHEL----MTLPDALGSEGYIRHMVGKWHTGDSHRESRPEYQGFDHWFGFINQLYLR 141

Query: 142 --YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
             +  + +       ++     E    Q    +LTD  TD+++ VIK   +  P FL ++
Sbjct: 142 GPHRSANYRRGKPTYINPWLENELGDLQQYEGHLTDILTDRALDVIKREQN--PWFLYLS 199

Query: 200 HAAVHT 205
           + A HT
Sbjct: 200 YYAPHT 205


>gi|403255188|ref|XP_003920328.1| PREDICTED: arylsulfatase E isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 589

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA  G+ L +H +  + CTPSRAAFLTG+YP R G+ +  
Sbjct: 49  GIGDIGCYGNNTMRTPNIDHLAEFGVKLTQHVSAASLCTPSRAAFLTGRYPIRSGMVSST 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       GV   +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GHRVLQWTGVPGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 SFYG 172


>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
           [Oryzias latipes]
          Length = 523

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 93/202 (46%), Gaps = 24/202 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G+    TPN+DA+A  GI+    YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
           G          +   +   E LLPQ LKE GY   ++GKWH+G ++ + LP   GFD   
Sbjct: 102 GHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWF 160

Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
                H G +N  +  N  ++     +G    R  E +    +     LT  + +  +  
Sbjct: 161 GAPNCHFGPYNNTVRPNIPVYNNSEMLG----RYFEEFKIDKKTGESNLTQMYLEAGLDF 216

Query: 185 IKSHNHS-RPLFLQITHAAVHT 205
           I     + RP FL     A H+
Sbjct: 217 ISRQAEAKRPFFLYWAADATHS 238


>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 434

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/186 (31%), Positives = 96/186 (51%), Gaps = 5/186 (2%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G + + TP++D LA  GI     Y+  P C+PSRA+ LTGKYP + G+ + +
Sbjct: 15  GYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVTSIL 74

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           G     K + + +  L   LKE GY T L GKWH+G +  E  P   GFD   G+  G +
Sbjct: 75  GGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASA-EYGPNAHGFDQFYGFRAGCI 133

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
            Y   I       G++   ++ R   ++  + +Y+T+  T ++   I +     P F+ +
Sbjct: 134 DYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATSYIDAAPDDEPYFMYV 193

Query: 199 THAAVH 204
            + A H
Sbjct: 194 AYNAPH 199


>gi|189053665|dbj|BAG35917.1| unnamed protein product [Homo sapiens]
          Length = 589

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 72/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +   P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGPPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
           domestica]
          Length = 520

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 73/231 (31%), Positives = 103/231 (44%), Gaps = 30/231 (12%)

Query: 3   TPVGAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTC 59
            P+GAG     P    LL    GW D+G  GE    TP++D +A  G++    YT  P C
Sbjct: 17  APLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGMLFPNFYTANPLC 76

Query: 60  TPSRAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIG 111
           +PSRAA LTG+ P R G  T  G          +   +P +E LLP+ LK+ GY   ++G
Sbjct: 77  SPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVG 136

Query: 112 KWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP----- 166
           KWH+G ++ +  P   GFD   G  N +    D+    +  V     RN E         
Sbjct: 137 KWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPV----YRNWEMVGRFYEDF 191

Query: 167 ----QMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL----QITHAAVHTGTA 208
               +     LT  +  ++V  IK    H +P FL      THA V+   +
Sbjct: 192 PINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHAPVYASKS 242


>gi|791004|emb|CAA58556.1| ARSE [Homo sapiens]
          Length = 589

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 73/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFE 168

Query: 131 NHVG 134
           +  G
Sbjct: 169 HFYG 172


>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 476

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 95/194 (48%), Gaps = 24/194 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---- 77
           G+ D+  +G  D+ TPNIDAL  +G+  +R Y   P C+P+RAA LTG YP   G+    
Sbjct: 42  GYGDLSSYGAADLKTPNIDALVASGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVPGVI 101

Query: 78  ----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
               D   G    +AV     LLPQ LK  GY T L+GKWH+G +    LP  RGFD   
Sbjct: 102 RTHPDDSWGVLSPQAV-----LLPQVLKGAGYHTALVGKWHLGLSGAS-LPSRRGFDLFH 155

Query: 134 GYWNGYLT--YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
           G+    +   +N   H  ++    D   + + +A  + S++  DF  +       S    
Sbjct: 156 GFLGDMMDDYHNHRRHGINYMRRDDREIDPKGHATDLFSQWAIDFLNE-------SKGQD 208

Query: 192 RPLFLQITHAAVHT 205
           RP FL++ +   HT
Sbjct: 209 RPFFLELAYNVPHT 222


>gi|254435647|ref|ZP_05049154.1| sulfatase, putative [Nitrosococcus oceani AFC27]
 gi|207088758|gb|EDZ66030.1| sulfatase, putative [Nitrosococcus oceani AFC27]
          Length = 463

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 94/187 (50%), Gaps = 12/187 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
           G+ DVG +G   I TPN+DALA  G    + H   P CTP+RAA LTG Y  R G   I 
Sbjct: 53  GYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIP 112

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                 +AKA+ + E    + LK +GYST L+GKWH+G ++   LP  +GFD + G    
Sbjct: 113 KDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLG-DRPAFLPPRQGFDEYFG---- 167

Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + Y+  +H    +   L   R  E         +LT + T+++V  I S N  RP  L 
Sbjct: 168 -IPYSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFI-SKNKDRPFLLY 225

Query: 198 ITHAAVH 204
           + H   H
Sbjct: 226 MPHPMPH 232


>gi|77164258|ref|YP_342783.1| sulfatase [Nitrosococcus oceani ATCC 19707]
 gi|76882572|gb|ABA57253.1| Sulfatase [Nitrosococcus oceani ATCC 19707]
          Length = 440

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 94/187 (50%), Gaps = 12/187 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
           G+ DVG +G   I TPN+DALA  G    + H   P CTP+RAA LTG Y  R G   I 
Sbjct: 30  GYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIP 89

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                 +AKA+ + E    + LK +GYST L+GKWH+G ++   LP  +GFD + G    
Sbjct: 90  KDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLG-DRPAFLPPRQGFDEYFG---- 144

Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            + Y+  +H    +   L   R  E         +LT + T+++V  I S N  RP  L 
Sbjct: 145 -IPYSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFI-SKNKDRPFLLY 202

Query: 198 ITHAAVH 204
           + H   H
Sbjct: 203 MPHPMPH 209


>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
 gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
          Length = 472

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 72/229 (31%), Positives = 107/229 (46%), Gaps = 30/229 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+GF G   I TP++D LA NG+   + Y +   C+PSRA F+TG     +G D  +
Sbjct: 46  GYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQVEFGHDNNL 105

Query: 82  GAGVA-------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            AGV          +P+++K +  +L +LGY   LIGKWH+G  + +  P  RGFD   G
Sbjct: 106 -AGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLG-KEPQFHPLKRGFDEFWG 163

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
           Y  G   Y +S+       G   +  +E  +       Y+TD   ++SV  I+ H    P
Sbjct: 164 YTGGGHDYFESLPN-----GKGYKEPLESNFKTPDPITYITDDVGNESVDFIERHK-DEP 217

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            FL     A HT                 +EE+   + HI +  RR +A
Sbjct: 218 FFLFAAFNAPHTPMQA-------------LEEDLALYQHIEDKKRRTYA 253


>gi|402909420|ref|XP_003917419.1| PREDICTED: arylsulfatase E [Papio anubis]
          Length = 687

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   + TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 147 GIGDIGCYGNTTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 206

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 207 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 266

Query: 131 NHVG 134
           +  G
Sbjct: 267 HFYG 270


>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_a [Homo
           sapiens]
          Length = 708

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241


>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
 gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
          Length = 703

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 72/229 (31%), Positives = 107/229 (46%), Gaps = 25/229 (10%)

Query: 23  GWNDVGFHGENDIP----TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           G  DVG++   + P    TP +D+LA   + L  +Y  P CTP+RAA LTG+Y    G+ 
Sbjct: 73  GVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAALLTGRYAVNVGMP 132

Query: 79  TPVGAGVAKAVPVTEKLLPQYLK-ELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            P+       +  +   LP+ LK E  YSTHL+GKWH+G  K +  P  RGFD+  G   
Sbjct: 133 FPLIGDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRPLARGFDSFYGLLG 192

Query: 138 GYLT-YNDSIHETDFAVGLDARRN-MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHS--- 191
                Y   + E       D  +N  E  A ++  K + T  F+ ++V VI+ H+     
Sbjct: 193 ASFDHYTKKMGEVR-----DLWKNEAEVPAKEVDEKEHATTLFSREAVKVIEEHSARGHA 247

Query: 192 -------RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
                   PLFL + ++A H     + K       VP+   + RTF  +
Sbjct: 248 GAKDGDMDPLFLYLAYSAPHAPLQADEKFMKLCSDVPN--RHRRTFCAM 294


>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
          Length = 522

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 97/202 (48%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 41  GWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAALLTGRLPIRSGFYTTN 100

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P +E LLP  LK  GY++ ++GKWH+G ++ +  P   GFD   
Sbjct: 101 GHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 159

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  I+  
Sbjct: 160 GSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANLTQIYLQEALEFIQRQ 219

Query: 189 NHS-RPLFL----QITHAAVHT 205
             + RP FL      THA V+ 
Sbjct: 220 QAAHRPFFLYWAVDATHAPVYA 241


>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
           mulatta]
          Length = 503

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 23  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 82

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 83  AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 141

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 142 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 201

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 202 ARHHPFFLYWAVDATHAPVYA 222


>gi|441523101|ref|ZP_21004735.1| arylsulfatase [Gordonia sihwensis NBRC 108236]
 gi|441457320|dbj|GAC62696.1| arylsulfatase [Gordonia sihwensis NBRC 108236]
          Length = 783

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 64/197 (32%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++DVG  G  +IPTPNID LA +G  L+ ++T P C+P+RAA LTG  P R G  +   
Sbjct: 56  GYSDVGPFGA-EIPTPNIDRLARSGFRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 114

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
             P   G+   +      LP+ L+E GY+T  +GKWH+  + +       +  P  RGFD
Sbjct: 115 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 174

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KS 187
           ++ G   G     +S    +  +  ++  +++ Y       Y+TD  TD+++  I   ++
Sbjct: 175 SYYGSLEGL----NSFFNPNQLIADNSVVDVDEYP---DGYYVTDDLTDRAIDQITALRA 227

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL   H A+H
Sbjct: 228 HDSDKPFFLYFAHIAMH 244


>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
 gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
 gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
 gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_c [Homo
           sapiens]
          Length = 522

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241


>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 460

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 87/184 (47%), Gaps = 16/184 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG-IDTP 80
           G+ D G   +    TPN+D L   G  ++  Y   P C P+RAA LTG+YP R G +DT 
Sbjct: 19  GYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGRYPHRTGAVDTY 78

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              G    + + E  L    +  GY T LIGKWH+G   +E  P  RGFD  +G+  G+ 
Sbjct: 79  EAIG-GDRMALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRGFDTFIGFRGGWS 137

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y        +   LD    +E         Y+TD  T++S+  I+  N  +P FL   +
Sbjct: 138 DY--------YQYKLDRNGILE----ASDGTYMTDVITEESIRFIRE-NREQPFFLHAAY 184

Query: 201 AAVH 204
            A H
Sbjct: 185 NAPH 188


>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 477

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 94/186 (50%), Gaps = 4/186 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG++DVG +G +DI TP++D LA  G+     Y     C+ SRAA LTG YP R GI   
Sbjct: 41  QGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGTYPNRLGIHGA 100

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +   E  +   LK LGY+T + GKWH+G +  E LP N+GFD + G      
Sbjct: 101 LDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLG-HHPEFLPTNQGFDEYFGIPYSND 159

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYL-TDFFTDQSVHVIKSHNHSRPLFLQIT 199
            + +     D+   L   +N +      + + + T +FT++S+  I+  N  RP FL + 
Sbjct: 160 MWPNHPQTKDYYPPLPIYQNDKVVDTIWNDQSMFTTWFTEKSIDFIE-RNKDRPFFLYLA 218

Query: 200 HAAVHT 205
           H   H 
Sbjct: 219 HPMPHV 224


>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
          Length = 492

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 19/207 (9%)

Query: 20  LPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID 78
           L  GW D+G  GE    TP++D +A  G++    YT  P C+PSRAA LTG+ P R G  
Sbjct: 31  LQMGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFY 90

Query: 79  TPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
           T             +   +P +E LLP+ LK+ GY   ++GKWH+G ++ +  P   GFD
Sbjct: 91  TTNAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFD 149

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVI 185
              G  N +    D+    +  V  +     R  E +   + +    LT  +  ++V  I
Sbjct: 150 EWFGAPNCHFGPYDNKARPNIPVYRNWEMVGRFFEDFPINLKTGEANLTQIYLQEAVDFI 209

Query: 186 KSHNHSRPLFL----QITHAAVHTGTA 208
           K   H +P FL      THA V+   +
Sbjct: 210 KQQAHQQPFFLYWAVDATHAPVYASKS 236


>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
          Length = 522

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241


>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
           gorilla]
          Length = 528

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 48  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 107

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 108 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 166

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 167 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 226

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 227 ARHHPFFLYWAVDATHAPVYA 247


>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
          Length = 522

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241


>gi|149199717|ref|ZP_01876749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
 gi|149137234|gb|EDM25655.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
          Length = 486

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 93/197 (47%), Gaps = 13/197 (6%)

Query: 27  VGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----DTP-- 80
           +  +G  DI TPNIDALA  G++ N  Y++P+CTPSR   LTGKYPFR G     D P  
Sbjct: 48  ISCYGAEDIKTPNIDALAAGGMIFNNAYSMPSCTPSRTTLLTGKYPFRTGYVNHWDVPRW 107

Query: 81  -VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR-GFDNHVGYWNG 138
            +G    K  P T     + +K+LGY T   GKW +   + E L   + GFD+    W G
Sbjct: 108 GIGYFDWKQKPNTT--FARLMKDLGYRTFATGKWQLNDFRLEPLAMQKHGFDDWA-MWTG 164

Query: 139 YLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
             T  D  HE        +A  N +  +     ++  D +TD  ++ ++  N  +P+ + 
Sbjct: 165 CETSKDKTHEKKSTQRYWNAHINTKEGSKTYKGQFGPDLYTDHLINFMRK-NKDKPMCIY 223

Query: 198 ITHAAVHTGTAGNAKLP 214
                 HT  A     P
Sbjct: 224 YPMVLPHTPVAATPDEP 240


>gi|296234833|ref|XP_002762635.1| PREDICTED: arylsulfatase E [Callithrix jacchus]
          Length = 449

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA  G+ L +H +  + CTPSRAAFLTG+YP R G+ + V
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEFGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSV 108

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
           G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESAGDHCHHPLHHGFD 168

Query: 131 NHVG 134
              G
Sbjct: 169 YFYG 172


>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
          Length = 482

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 62  AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201


>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
          Length = 483

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTSN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 62  GHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +   + +    LT  +  +++  I++ 
Sbjct: 121 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 180

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 181 HARQSPFFLYWAIDATHAPVY 201


>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
          Length = 221

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 97/185 (52%), Gaps = 14/185 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+ F G   I TPNIDAL   G++  + Y    C+PSRA+ L+G+Y   +G+   + 
Sbjct: 41  GYDDLYFRGHQ-IRTPNIDALQEEGLLFTQMYMQDVCSPSRASILSGRYAMHHGVTDWIP 99

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
              +  + + +  L   ++E GY T  +GKWH+G  K    P  RGF++ +GY++G   Y
Sbjct: 100 PRDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWAYTPTFRGFNSFLGYYSGGEDY 159

Query: 143 NDSIHETDFAVGLDARRNMERY--------APQMSSKYLTDFFTDQSVHVIKSHNHSR-P 193
               HETD A   D  R+  R+        A  +  +Y T  F+++++ +I     +  P
Sbjct: 160 --FTHETDNAY--DMHRDEGRHCGPNCSIPAWDLKGQYSTTIFSEEAIRIINQRQAADPP 215

Query: 194 LFLQI 198
           LFL +
Sbjct: 216 LFLYL 220


>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
           paniscus]
          Length = 482

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 62  AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201


>gi|371776857|ref|ZP_09483179.1| sulfatase [Anaerophaga sp. HS1]
          Length = 542

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 70/206 (33%), Positives = 101/206 (49%), Gaps = 39/206 (18%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G  +I TPNID LA  GI   + Y    C PSRA+ LTG YP R GI+    
Sbjct: 41  GYSDLGCYG-GEIHTPNIDQLASQGIRFTQMYNTARCCPSRASLLTGHYPHRAGIN---- 95

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EEL-------------- 122
            G+   + +    + + LKE GY T + GKWH+   K      E+L              
Sbjct: 96  -GMGVNLSMNTATIAEVLKENGYHTGMTGKWHLSETKPLDDPTEQLRWLAHRVDYGSFSP 154

Query: 123 ---LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
               P NRGFD H G   G + Y D      F++  + +   E   P+    Y+TDF T+
Sbjct: 155 LENYPCNRGFDEHWGVIWGVVNYFDP-----FSLVHNEKPIKE--VPE--DFYMTDFITE 205

Query: 180 QSVHVIKSHNH-SRPLFLQITHAAVH 204
           +S+ +I S++   +P FL + H A H
Sbjct: 206 KSIELIDSYSKDDKPFFLYVAHTAPH 231


>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
 gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
          Length = 478

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 92/184 (50%), Gaps = 14/184 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G +DI TPNID +A  GI      +  P C+PSRA  LTG+ P R GI+T  
Sbjct: 58  GYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPSRAGLLTGRMPQRMGINTVF 117

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +   E  + + LK  GY T ++GKWH+G + E  LP N+GF  + G     + 
Sbjct: 118 FPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLG-HLERFLPLNQGFYEYFG-----IP 171

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y++ +    +       R  E  A  +  +Y+T  +T++S+  I +    +P FL + H 
Sbjct: 172 YSNDMASVVYM------RGNEVEAYHVDQRYMTRTYTEESLKFIDASG-DQPFFLYLAHN 224

Query: 202 AVHT 205
             H 
Sbjct: 225 MPHV 228


>gi|340619482|ref|YP_004737935.1| sulfatase [Zobellia galactanivorans]
 gi|339734279|emb|CAZ97656.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
          Length = 586

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 97/192 (50%), Gaps = 21/192 (10%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
           QG+ D+G++G   + TP IDA A   +  +     P C P+RAA +TG++P R G+ DT 
Sbjct: 39  QGFGDLGYYGNPHVKTPTIDAFARESVRFDEFIVSPVCAPTRAALMTGRHPLRTGVRDTY 98

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G  +     +T   L + LK+ GY+T ++GKWH+G N     P ++GFD  + + +G +
Sbjct: 99  RGGAIMSTNEIT---LAEMLKQEGYATGMVGKWHLGDNYPS-RPQDQGFDFTLRHLSGGI 154

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVIKSHNHSR 192
                   T       A+R+   + P +        S  Y +D FTD ++  I   N ++
Sbjct: 155 GQPGDWPNT-------AKRDSSYFNPVLWKNGEMFQSEGYCSDVFTDVAIDFI-DQNKAK 206

Query: 193 PLFLQITHAAVH 204
           P FL + + A H
Sbjct: 207 PFFLYLAYNAPH 218


>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
 gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
          Length = 480

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 114/257 (44%), Gaps = 54/257 (21%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G  G  +IPTP IDALA +G+     Y   + C+PSRA F++G+Y  R+G D  P
Sbjct: 46  GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFMSGRYQSRFGYDLNP 105

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
            G         +P  +K   ++L+  GY T LIGKWH+G    + +P ++GFD   G+  
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYHTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164

Query: 136 --------------WNGYLTYNDSIHETDFAV------GLDARRNMERY----------A 165
                         W   +  ++S+    F        G  AR N   Y           
Sbjct: 165 EGHFYVPGPPYENVWT--MLRDNSLPAGQFETNQRTIRGNYARINEPAYDTGNPVLDGGE 222

Query: 166 PQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEE 225
           P     YLTD  TD++V  I S   S P  + +++ AVH+    +              E
Sbjct: 223 PIDDWNYLTDTITDKAVDTI-SQAASNPFAMVVSYNAVHSPMQASL-------------E 268

Query: 226 NDRTFAHISNPDRRLFA 242
           +     HI++P RR+FA
Sbjct: 269 DHAAMDHIADPQRRIFA 285


>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
 gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
          Length = 457

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 93/201 (46%), Gaps = 31/201 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRY------- 75
           G  D G +G  +  TP+ID LA  G+     Y  P C+P+RA+ +TGK+P R        
Sbjct: 43  GCKDAGCYGATNFSTPHIDRLANQGMRFTDAYAAPVCSPTRASLMTGKHPARLHLTNFIP 102

Query: 76  --GIDTPVGA----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK-EELLPFNRG 128
             G   P G     G    +P+ EK + Q L   GY   +IGKWH+G     E  P NRG
Sbjct: 103 QIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRG 162

Query: 129 FD-----NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
           FD      H G +N +  + D   +  +A  L          P     YL D  TD+++ 
Sbjct: 163 FDRVVLSEHHGIFNYFYPFVDQ-QKWPYAGPL----------PGNPGDYLPDRLTDEAID 211

Query: 184 VIKSHNHSRPLFLQITHAAVH 204
            ++  N  RP FL ++H +VH
Sbjct: 212 FVRE-NRERPFFLYLSHWSVH 231


>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 466

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/231 (29%), Positives = 111/231 (48%), Gaps = 32/231 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
           G+ D G +G  DIPTP+IDALA  G+   N + T   C PSRA  L G+Y  R G     
Sbjct: 38  GYEDFGCYGSQDIPTPHIDALAKGGVRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNV 97

Query: 78  -DTPVGAGVAKAVPVTE--KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
            D P      + + +++  + +   ++  GY T  IGKWH G N+ +  P ++GF++  G
Sbjct: 98  SDLPADGYQMQDIGLSDTVRTIADQMQSNGYETMAIGKWHQG-NETKHHPLHKGFNHFFG 156

Query: 135 YWNGYLTY---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
           +  G+ ++     +I + +  +      N      +    YLTD FTD+++  ++     
Sbjct: 157 FIGGHRSFFPIRTAIKQEEKIL------NDYTEVDEKDVYYLTDMFTDKAISYMR-QKRD 209

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +P F+ +++ AVHT        P  L Q          FAH+ +  RR +A
Sbjct: 210 KPYFIYLSYNAVHTPVEAT---PQKLAQ----------FAHLKDAQRRSYA 247


>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 556

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 113/237 (47%), Gaps = 59/237 (24%)

Query: 23  GWNDVG-FHGEND---IPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI 77
           G+ND+  F G  D   + TP+ID LA +G+V  + Y+   TC PSRA  +TG+YP R G 
Sbjct: 76  GYNDISTFGGGLDGGRVKTPHIDQLAADGVVFTQSYSGAGTCAPSRAMLMTGRYPTRTGF 135

Query: 78  D-TPVGAGVA--------------------------------KAVPVTEKLLPQYLKELG 104
           + TP  +G+A                                + +P  E  + + LKE G
Sbjct: 136 EFTPTPSGMAPMLSRISAEMGRGTPSMIYDAALDESKPPYEQQGLPPEEVTIAEILKERG 195

Query: 105 YSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLD-------A 157
           Y+T  IGKWH+G  ++ + P  +GFD  +   +G     D  +  +  +  D       A
Sbjct: 196 YATFHIGKWHLG-RQDGMAPHEQGFDQSLLMASGLFLPEDDPNVVNAKLDFDPIDQFLWA 254

Query: 158 RR---------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           R          + +R+ P     YLTD++TD+S+++I + N +RP FL + H  VHT
Sbjct: 255 RMAFANSFNSGDQDRFEP---GGYLTDYWTDESINIINA-NKNRPFFLYLGHWGVHT 307


>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 550

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 98/191 (51%), Gaps = 14/191 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
           G++D+G +G ++I TPNID LAY G+ L   Y    C P+RA+ +TG+YP + G+   D 
Sbjct: 41  GYSDLGAYG-SEIQTPNIDKLAYEGLRLREFYNNSICAPTRASLITGQYPHKAGVGYFDV 99

Query: 80  PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            +G    +     E L   + L++ GYST L GKWH+G N     P  RGFD   G   G
Sbjct: 100 NLGIPPYQGYLNKESLTFGEVLRQAGYSTLLSGKWHVG-NDSLHWPKQRGFDRFFGVIGG 158

Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH-SRP 193
              Y D+    +      V L+   + +R  P+ +S Y TD  T+ +V  +   N   +P
Sbjct: 159 GSNYFDAEPMPLGRQYPVVILE---DNQRQKPKANSYYFTDEITNHAVQFLDEQNKMDKP 215

Query: 194 LFLQITHAAVH 204
            FL + + A H
Sbjct: 216 FFLYLAYTAPH 226


>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
 gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
          Length = 474

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 22/200 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDT-- 79
           G+ D+G  G   I TP +D +A  G+   + Y+  P CTPSRAA LTG+YP R G+ +  
Sbjct: 48  GYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYSAAPVCTPSRAALLTGRYPIRSGMCSDK 107

Query: 80  -----PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
                P   G    +P +E  L + LK  GY T  +GKWH+G +  + LP N GFD++ G
Sbjct: 108 RRVLFPNSGG---GIPASEVTLAEALKAAGYKTACVGKWHLG-HLPQFLPTNNGFDSYFG 163

Query: 135 --YWNGYLTYNDSIHETDFAVGLDAR-------RNMERYAPQMSSKYLTDFFTDQSVHVI 185
             Y N      D  H     +  + +       RN E          +T  +T++++ +I
Sbjct: 164 IPYSNDMDRVADRKHGRSIFLKPEVKFWNVPLMRNTEVVELPADQTTITKRYTEEAIKLI 223

Query: 186 KSHNHSRPLFLQITHAAVHT 205
           +  N  +P F+ + H   H 
Sbjct: 224 Q-QNKQQPFFIYLAHNMPHV 242


>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 483

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 105/210 (50%), Gaps = 22/210 (10%)

Query: 5   VGAGVAKAVPV-TEK-------LLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL 56
           VG    +A P  +EK       +   G++DV  +GE    TPNIDALA  GI     Y  
Sbjct: 19  VGVSCTEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAA 78

Query: 57  -PTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            P  +PSRA FLTG+YP R GI           +   E  + + LK  GY+T  IGKWH+
Sbjct: 79  SPISSPSRAGFLTGRYPARMGIQGVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHL 138

Query: 116 GCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTD 175
           G ++E+ LP  +GFD + G     + Y++ +    +  G      +E +   +++  +T 
Sbjct: 139 G-SREKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRG----NEVEEFHIDINN--VTK 186

Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
            +T+++V  I+     +P FL + H+ +H 
Sbjct: 187 KYTEEAVDYIR-RKADQPFFLFLAHSMMHV 215


>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 444

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 17/193 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D+G +G  D  TP++D LA  GI     Y   T CTPSRA  LTG+YP R  +   
Sbjct: 44  QGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTGQYPKRSNLHEA 103

Query: 81  VGAGVAKAVPVTE-KLLPQ------YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           V        P +E  L PQ       LK  GYST  IGKWH+G +K+E +P+N+GFD   
Sbjct: 104 V------LFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQGFDTFY 156

Query: 134 GYWNGYLTYNDSIHETDF-AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           G        N      DF +  L    N +        +YLT  +T+++V  IK+    +
Sbjct: 157 GVPYSNDMDNYYYKNIDFQSPPLPFYENTKVIENGSDQRYLTKRYTEETVKRIKNRGE-K 215

Query: 193 PLFLQITHAAVHT 205
           P F+ + H   HT
Sbjct: 216 PFFIYLAHNMPHT 228


>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
           rubripes]
          Length = 519

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 99/206 (48%), Gaps = 28/206 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G+    TPN+DA+A  G++L   YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 38  GWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGRLPVRNGFYTTN 97

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
           G          +   +   E LLPQ LK+ GY   ++GKWH+G ++ + LP   GFD   
Sbjct: 98  GHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLG-HRPQYLPLEHGFDEWF 156

Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
                H G +N  +  N  ++   + +G    R  E +    +     LT  +  + +  
Sbjct: 157 GAPNCHFGPYNNSVRPNIPVYRNSWMLG----RYYEEFKIDKKTGESNLTQMYLLEGLDF 212

Query: 185 IKSHNHS-RPLFL----QITHAAVHT 205
           I+S   + +P FL      THA V+ 
Sbjct: 213 IQSQAEAQKPFFLYWAPDATHAPVYA 238


>gi|297284666|ref|XP_002802639.1| PREDICTED: hypothetical protein LOC697850 [Macaca mulatta]
          Length = 1113

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 95/200 (47%), Gaps = 19/200 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           GW D+G +GE    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 81  -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GYWNGYLTYNDSIHETDFAVGLD---ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220

Query: 189 NHSRPLFL----QITHAAVH 204
               P FL      THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVY 240


>gi|440718712|ref|ZP_20899155.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SWK14]
 gi|436436039|gb|ELP29830.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SWK14]
          Length = 480

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 74/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
           G+ + G  G  +IPTP IDALA +G+     Y   + C+PSRA FL+G+Y  R+G D  P
Sbjct: 46  GYGETGMMGNAEIPTPAIDALAQSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105

Query: 81  VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
            G         +P  +K   ++L+  GY T LIGKWH+G    + +P ++GFD   G+  
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164

Query: 136 --------------W---------NGYLTYND--------SIHETDFAVG---LDARRNM 161
                         W          G    N          I+E D+  G   LD    +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGRFETNQRTIRGNYARINEPDYDAGNPMLDDSEPI 224

Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
           + +       YLTD  T +++  I +   S+P  + +++ AVH+    +           
Sbjct: 225 DHW------NYLTDTITAKAIDAI-TQTASKPFAMVVSYNAVHSPMQASL---------- 267

Query: 222 DMEENDRTFAHISNPDRRLFA 242
              E+     HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIGDPQRRIFA 285


>gi|223940482|ref|ZP_03632332.1| sulfatase [bacterium Ellin514]
 gi|223890844|gb|EEF57355.1| sulfatase [bacterium Ellin514]
          Length = 635

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 94/195 (48%), Gaps = 19/195 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D+G  G     TPN+D +A  G+ L   Y  P CTPSRA  LTG Y  R  +   + 
Sbjct: 36  GYGDIGPFGSTLNRTPNLDRMAKEGMKLTSFYAAPLCTPSRAQILTGCYAKRVSLPKVLS 95

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +   E+ + + LK  GY+T  IGKWH+G +  E LP   GFD+++G     L Y
Sbjct: 96  PRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVG-DAPENLPTRHGFDHYLG-----LPY 149

Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKY------------LTDFFTDQSVHVIKSHNH 190
           ++ +   +      A+R      P +  +             LT+ +TD++V  I++ N 
Sbjct: 150 SNDMGGEEPGKDQPAKRGARPPLPLVRDEQVIEVVKPADQDRLTERYTDEAVKFIRA-ND 208

Query: 191 SRPLFLQITHAAVHT 205
            +P FL + H AVH 
Sbjct: 209 KQPFFLYLAHTAVHA 223


>gi|344308474|ref|XP_003422902.1| PREDICTED: arylsulfatase E [Loxodonta africana]
          Length = 626

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G   + TPNI+ LA +G+ L +H    PTCTPSRAAFLTG+YP R G+ +  
Sbjct: 86  GIGDVGCYGNTTLRTPNINRLAEDGVTLTQHIAAAPTCTPSRAAFLTGRYPLRSGMVSSR 145

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G        V+  +P +E    + LK  GY+T LIGKWH+G N E        P N GFD
Sbjct: 146 GNRVLQWTAVSGGLPESETTFAKILKNEGYATGLIGKWHLGLNCESPSDHCHHPLNHGFD 205

Query: 131 NHVG 134
              G
Sbjct: 206 YFYG 209


>gi|326384383|ref|ZP_08206064.1| arylsulfatase [Gordonia neofelifaecis NRRL B-59395]
 gi|326196981|gb|EGD54174.1| arylsulfatase [Gordonia neofelifaecis NRRL B-59395]
          Length = 784

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 104/198 (52%), Gaps = 25/198 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G  +IPTPNID +A +G  L+ ++T P C+P+RAA LTG  P R G  +   
Sbjct: 57  GYSDIGPFGA-EIPTPNIDRIAASGYRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 115

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
             P   G+   +      LP+ L+E GY+T  +GKWH+  + +       +  P  RGFD
Sbjct: 116 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 175

Query: 131 NHVGYWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK--- 186
           ++ G   G  + +N +    D +V +D     E Y       Y+TD  TD+++  IK   
Sbjct: 176 SYYGSLEGLNSFFNPNQLIADNSV-VDVDEYPEGY-------YVTDDLTDRAIDQIKALR 227

Query: 187 SHNHSRPLFLQITHAAVH 204
           +H+  +P FL   H A+H
Sbjct: 228 AHDADKPFFLYFAHIAMH 245


>gi|313217411|emb|CBY38513.1| unnamed protein product [Oikopleura dioica]
          Length = 449

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 75/140 (53%), Gaps = 7/140 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW DV ++ E  + TPN++ +   G      Y+  TC+PSRAA LTG Y +R G+D  P 
Sbjct: 89  GWADVSWNNEF-VKTPNLERIRKQGRTFTNLYSHSTCSPSRAALLTGIYAWRLGLDGAPF 147

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +P+  +L+P   K+L Y  H IGKWH G   + L P  RGFD+  G+++G + 
Sbjct: 148 NPTKVNGIPLGVELIPAKFKKLNYENHFIGKWHGGFCHQNLTPTERGFDSFYGFYSGAVN 207

Query: 142 YNDSIHETDF---AVGLDAR 158
           Y    HE+ +      LD R
Sbjct: 208 Y--LTHESKYDAKGAALDYR 225


>gi|223985528|ref|ZP_03635584.1| hypothetical protein HOLDEFILI_02890 [Holdemania filiformis DSM
           12042]
 gi|223962505|gb|EEF66961.1| hypothetical protein HOLDEFILI_02890 [Holdemania filiformis DSM
           12042]
          Length = 470

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 28/205 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGID--- 78
           GW D+   G +   TP+ID L   G+  ++ Y   P C+PSRA+ L+GKYP R  +    
Sbjct: 16  GWMDLSCQGSSFYETPHIDQLRREGMAFDQAYAACPVCSPSRASILSGKYPARLKVTDWI 75

Query: 79  ----------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
                       + A   K + V+E  + +  +E GY T  +GKWH+G  KE   P + G
Sbjct: 76  DHENYHPCRGKLIDAPYIKELSVSEFSMAKAFQEAGYQTWHVGKWHLG--KEATYPEHHG 133

Query: 129 FD-NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
           FD N  G W G+              G  +  +ME  +     +YLTD    ++  +I+S
Sbjct: 134 FDVNLGGSWWGHPK-----------KGYFSPYHMENLSDGPEGEYLTDRIGAEAAALIRS 182

Query: 188 HNHSRPLFLQITHAAVHTGTAGNAK 212
            +  RP FL + H AVHT     A+
Sbjct: 183 RDPQRPFFLNLWHYAVHTPLQAKAE 207


>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 525

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 96/184 (52%), Gaps = 14/184 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G++DV  +GE    TPNIDALA  GI     Y   P  +PSRA FLTG+YP R GI    
Sbjct: 87  GYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVF 146

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +   E  + + LK  GY+T  IGKWH+G ++E+ LP  +GFD + G     + 
Sbjct: 147 YPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLPLQQGFDEYFG-----IP 200

Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
           Y++ +    +  G      +E +   +++  +T  +T+++V  I+     +P FL + H+
Sbjct: 201 YSNDMSAQVYLRG----NEVEEFHIDINN--VTKKYTEEAVDYIR-RKADQPFFLFLAHS 253

Query: 202 AVHT 205
            +H 
Sbjct: 254 MMHV 257


>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
           musculus]
          Length = 462

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 98/202 (48%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 61  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 120

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 121 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 179

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +T +++  I++ 
Sbjct: 180 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 239

Query: 188 HNHSRPLFL----QITHAAVHT 205
           H    P FL      THA V+ 
Sbjct: 240 HARQSPFFLYWAIDATHAPVYA 261


>gi|409196554|ref|ZP_11225217.1| sulfatase [Marinilabilia salmonicolor JCM 21150]
          Length = 542

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 97/206 (47%), Gaps = 39/206 (18%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G  +I TPNIDALA  G+   + +    C PSRA+ LTG YP + GID    
Sbjct: 41  GYSDLGCYG-GEIQTPNIDALATGGVRFTQMHNTARCCPSRASLLTGHYPHKAGID---- 95

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EEL-------------- 122
            G+   + +    + + LKE GY T + GKWH+   K      E+L              
Sbjct: 96  -GMGVNLSMNTATIAEVLKENGYHTGMTGKWHLSETKPVNDPDEQLRWMAHQVNYGPFSP 154

Query: 123 ---LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
               P NRGFD H G   G + + D               N E         Y+TDF T+
Sbjct: 155 LENYPCNRGFDEHWGVIWGVVNFFDP---------FSLVHNEEPIKEVPDDFYMTDFVTE 205

Query: 180 QSVHVIKSHNH-SRPLFLQITHAAVH 204
           +SV++I +++   +P FL + H A H
Sbjct: 206 KSVNLIDTYSKDDKPFFLYVAHTAPH 231


>gi|288870334|ref|ZP_06113738.2| sulfatase family protein [Clostridium hathewayi DSM 13479]
 gi|288867589|gb|EFC99887.1| sulfatase family protein [Clostridium hathewayi DSM 13479]
          Length = 471

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 95/203 (46%), Gaps = 34/203 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
           GW D+   G     TPNID L   G+V  N + + P C+PSRA+ LTGKYP R G+    
Sbjct: 17  GWRDLACTGSTFYETPNIDRLCRQGMVFANSYASCPVCSPSRASCLTGKYPARLGVTDWI 76

Query: 78  ----------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
                        + A   K +P  E  + Q LK+ GY T  +GKWH+G    E  P + 
Sbjct: 77  DMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGKWHLG--GREFYPEHF 134

Query: 128 GFDNHVG--YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
           GFD ++G   W          H  D   G  +   +E  +     +YLTD  TD++V ++
Sbjct: 135 GFDVNIGGCSWG---------HPHD---GYFSPYGIETLSEGPEGEYLTDRITDEAVRLL 182

Query: 186 KSHNHS---RPLFLQITHAAVHT 205
           +        +P ++ + H AVHT
Sbjct: 183 RKRQACGSRKPFYMNLCHYAVHT 205


>gi|146302379|ref|YP_001196970.1| sulfatase [Flavobacterium johnsoniae UW101]
 gi|146156797|gb|ABQ07651.1| sulfatase [Flavobacterium johnsoniae UW101]
          Length = 551

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/191 (34%), Positives = 94/191 (49%), Gaps = 12/191 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
           G++D+G +G ++I TPN+D LA  G+ L   Y    C P+RA+ LTG+Y  + G+   D 
Sbjct: 42  GYSDLGNYG-SEIKTPNLDKLASEGLRLREFYNNSICAPTRASLLTGQYQHKAGVGFFDV 100

Query: 80  PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            +G    +     E L L +  +  GYST L GKWH+G   +   P  RGFD   G   G
Sbjct: 101 NLGLPAYQGYLNKESLTLGEVFRSGGYSTLLSGKWHVGSEDQAQWPNQRGFDKFYGILKG 160

Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
              Y D+      +T + V L   RN E   P+  S Y TD   + +V  +   N  ++P
Sbjct: 161 ASNYFDTKPLPFGKTPYPVKL--IRNNEELHPKDDSYYFTDEIGNNAVTFLDEQNKENKP 218

Query: 194 LFLQITHAAVH 204
            FL +   A H
Sbjct: 219 FFLYLAFTAPH 229


>gi|421612348|ref|ZP_16053456.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408496803|gb|EKK01354.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 482

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 27/193 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QG+ DVG  G  DI TP +DA+A  G+     Y  P C PSRAA +TG YP R       
Sbjct: 25  QGYQDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRV-----A 79

Query: 82  GAGVAKAV-PV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
             G  K + P+    E  + + LK  GY++   GKW +  + +     +LLP  +GFD  
Sbjct: 80  ERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFPDLLPTGQGFD-- 137

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
             Y+ G  T ND +         +  RN E   P+     LT  +TD+++  I+  N ++
Sbjct: 138 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 186

Query: 193 PLFLQITHAAVHT 205
           P F+ I H   HT
Sbjct: 187 PFFVYIPHTMPHT 199


>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 480

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 87/186 (46%), Gaps = 7/186 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G     TP IDALA +G+  +  Y+  P C+P+RAA +TGK P R GI   +
Sbjct: 37  GSQDLGVEGSKFYETPAIDALAASGVRFSSFYSAHPVCSPTRAALMTGKVPQRVGITDYI 96

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
                 A+P  E  + +     GY T  +GKWH+G   +   P   GF        G   
Sbjct: 97  KPKSGVALPTAETTIGEAFAAQGYQTGYVGKWHLG-EADADQPAQHGFQWTAAVNRGGQP 155

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            +Y     + D   G D   ++    P     YLTD  T +S+  +K  + ++P FL  +
Sbjct: 156 ASYYYPYRKKD---GKDTLWDVPDLEPGTEGDYLTDALTGKSLEFLKQRDTTKPFFLCFS 212

Query: 200 HAAVHT 205
           H AVHT
Sbjct: 213 HYAVHT 218


>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
           musculus]
 gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
          Length = 440

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 39  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 99  AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +T +++  I++ 
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238


>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
          Length = 482

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 95/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 62  AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFDEWF 120

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201


>gi|403069089|ref|ZP_10910421.1| arylsulfatase [Oceanobacillus sp. Ndiop]
          Length = 513

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/190 (35%), Positives = 96/190 (50%), Gaps = 22/190 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+  +G  +I TPN+D LA NG+   + Y    C PSRA+ LTG YP + GI     
Sbjct: 16  GFSDLSSYG-GEISTPNLDQLANNGLRFTQFYNSARCCPSRASLLTGLYPHQAGIGEMTE 74

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             +TP   G  K   VT   L + LKE GY T+L GKWH+G    E +P  RGFD+  G 
Sbjct: 75  DRETPGYRGYLKNQCVT---LAEVLKEGGYHTYLSGKWHVG----ERMPTERGFDDFYGL 127

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHNHSRPL 194
             G+ ++ D  +      G   R   +         Y TD  TD ++  I +S +  +P 
Sbjct: 128 LGGFASFWDKENYVRLPEGRPERSYSD------GEFYATDAITDHALDFIEESRSDEQPY 181

Query: 195 FLQITHAAVH 204
           FL +++ A H
Sbjct: 182 FLYLSYNAPH 191


>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
          Length = 537

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 56  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 115

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 116 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 174

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +T +++  I++ 
Sbjct: 175 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 234

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 235 HARQSPFFLYWAIDATHAPVY 255


>gi|344338189|ref|ZP_08769122.1| sulfatase [Thiocapsa marina 5811]
 gi|343802243|gb|EGV20184.1| sulfatase [Thiocapsa marina 5811]
          Length = 531

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/119 (45%), Positives = 70/119 (58%), Gaps = 6/119 (5%)

Query: 23  GWNDVGFHGENDIP---TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW D G +G        TPNID LA  G+ L   Y+ PTCTP+R+A LTG+ P R G+  
Sbjct: 79  GWGDPGVYGGGAAIGAATPNIDRLAGEGLTLTSTYSQPTCTPTRSAILTGRLPVRTGLTR 138

Query: 80  PVGAG--VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
           P+ AG  +AK     E  LP+ L E GY+T L GKWH+G   E + P + GFD + GY+
Sbjct: 139 PILAGDTLAKNPWADEISLPKLLGEAGYTTVLTGKWHVG-EAEGMRPQDIGFDEYYGYY 196


>gi|305665652|ref|YP_003861939.1| arylsulfatase [Maribacter sp. HTCC2170]
 gi|88710408|gb|EAR02640.1| arylsulfatase [Maribacter sp. HTCC2170]
          Length = 589

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 99/211 (46%), Gaps = 34/211 (16%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
           QG+ D+G+ G   + TPNID+ A   I +N  Y  P C P+RA+ +TG+Y  R GI DT 
Sbjct: 42  QGYGDLGYTGNPHVKTPNIDSFASESIRMNNFYVSPVCAPTRASLMTGRYSLRTGIRDTY 101

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G  +  +  VT   + + LK+  Y T + GKWH+G N     P ++GFD  + + +G +
Sbjct: 102 NGGAIMASNEVT---IAEMLKQANYKTGVFGKWHLGDNYPS-RPNDQGFDESLIHLSGGM 157

Query: 141 TYNDSIHETDFAVGLDARR---------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
                    DF       R         N ER   +    Y +D F + ++  I+  NH 
Sbjct: 158 G-----QVGDFTTYFQKERSYFDPVLWHNGER---ESYEGYCSDIFAENAIDFIEK-NHD 208

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
           +P F  ++  A HT            LQVPD
Sbjct: 209 QPFFCYLSFNAPHTP-----------LQVPD 228


>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
 gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
          Length = 470

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 17/186 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDT 79
           G+ D+  +G   I TP ID +  +G+ L + Y+  P C+ +R A +TG+Y +R   G++ 
Sbjct: 47  GYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMTGQYQYRLALGLEE 106

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P+ AG    +P ++  LP  LK+ GY T LIGKWH+G    +  P   G+D+  G+    
Sbjct: 107 PL-AGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGA-YPKYGPLKSGYDHFYGFRGSA 164

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHNHSRPLFLQI 198
           L+Y +  H  DF          E  AP   + Y TD   D++V +I KS +  RP F  +
Sbjct: 165 LSYYN--HGKDF---------WEDDAPVEKAGYFTDLLGDKTVELIQKSDSCERPFFASV 213

Query: 199 THAAVH 204
              A H
Sbjct: 214 HFNAPH 219


>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
           musculus]
 gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
 gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
           musculus]
          Length = 520

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 39  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 99  AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +T +++  I++ 
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238


>gi|221043426|dbj|BAH13390.1| unnamed protein product [Homo sapiens]
          Length = 614

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID LA  G+ L +H +  + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74  GIGDIGCYGNNTMRTPNIDRLAEAGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       G +  +P  E    + LK  GY+T LIGKWH+G N E        P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKGKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query: 131 NHVG 134
           +  G
Sbjct: 194 HFYG 197


>gi|32471071|ref|NP_864064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32396773|emb|CAD71738.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 490

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 27/193 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QG+ DVG  G  DI TP +DA+A  G+     Y  P C PSRAA +TG YP R       
Sbjct: 33  QGYEDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRV-----A 87

Query: 82  GAGVAKAV-PV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
             G  K + P+    E  + + LK  GY++   GKW +  + +     +LLP  +GFD  
Sbjct: 88  ERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSDLLPTGQGFD-- 145

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
             Y+ G  T ND +         +  RN E   P+     LT  +TD+++  I+  N ++
Sbjct: 146 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 194

Query: 193 PLFLQITHAAVHT 205
           P F+ I H   HT
Sbjct: 195 PFFVYIPHTMPHT 207


>gi|431796258|ref|YP_007223162.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787023|gb|AGA77152.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 603

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/244 (29%), Positives = 113/244 (46%), Gaps = 31/244 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
           QG+ D GF G   + TP +D LA   +  ++ Y  P C P+RA+ +TG+Y  R GI DT 
Sbjct: 50  QGYGDFGFTGNPHVQTPVLDGLAEESMFFDQFYVSPVCAPTRASLMTGRYSLRTGIRDTY 109

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
            G  +     VT   + + LK+ GY T + GKWH+G N     P ++GFD  V + +G +
Sbjct: 110 NGGAIMATEEVT---IAEMLKDAGYRTGIFGKWHLGDNYPS-RPMDQGFDESVIHLSGGM 165

Query: 141 TYNDSIHETDFAVGLDARRN--MERYAPQMSSK-YLTDFFTDQSVHVIKSHN---HSRPL 194
                I  T +  G  +  +  +     Q S K Y TD FT +++  +  H+     +P 
Sbjct: 166 GQVGDI--TTYYQGDSSYFDPVLWHNGQQESYKGYCTDIFTQEAIAFVSDHDGGEKRQPF 223

Query: 195 FLQITHAAVHT----------------GTAG--NAKLPTGLLQVPDMEENDRTFAHISNP 236
           F+ ++  A HT                 T+G   A +P+  +   D E   R +A + N 
Sbjct: 224 FVYLSLNAPHTPLQVPDEYYQKYKDIDPTSGLDEAMMPSQEMTESDKEHARRVYAMVENI 283

Query: 237 DRRL 240
           D  L
Sbjct: 284 DDNL 287


>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 466

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 70/231 (30%), Positives = 108/231 (46%), Gaps = 32/231 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D G +G  DIPTP+IDALA  GI   N + T   C PSRA  L G+Y  R G +  V
Sbjct: 38  GYEDFGCYGSQDIPTPHIDALAKGGIRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNV 97

Query: 82  GAGVAKAVPV-------TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
               A    +       T + +   ++  GY T  IGKWH G N+ +  P ++GF++  G
Sbjct: 98  SDLPADGYQIQDIGLSDTVRTIADQMQSNGYETMAIGKWHQG-NETKHHPLHKGFNHFFG 156

Query: 135 YWNGYLTY---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
           +  G+ ++     +I + +  +      N      +    YLTD FTD+++  ++     
Sbjct: 157 FIGGHRSFFPIRTAIKQEEKIL------NDYTEVDEKDVYYLTDMFTDKAISYMR-QKRD 209

Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           +P F+ +++ AVHT        P  L Q          FA + N  RR +A
Sbjct: 210 KPYFIYLSYNAVHTPVEAT---PQKLAQ----------FARLKNAHRRSYA 247


>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
 gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
          Length = 476

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 96/201 (47%), Gaps = 29/201 (14%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D+   G   + TP ID +A  G  L   Y   P CTPSRAA +TG YP R  ID  
Sbjct: 42  QGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTGTYPKR--IDMA 99

Query: 81  VG-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
            G       AG  K +   E  + + LK +GY T + GKWH+G ++ E LP  +GFD   
Sbjct: 100 RGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLG-DQPEFLPTRQGFDEFF 158

Query: 134 GYWNGYLTYNDSIH-----ETDFAVG----LDARRNMERYAPQMSSKYLTDFFTDQSVHV 184
           G     L Y+  IH     ++ F       LD    +E       + YLT  FT+++V  
Sbjct: 159 G-----LPYSHDIHPYHPQQSHFKFPSLPLLDGEEVIEM---DPDADYLTKRFTERAVQF 210

Query: 185 IKSHNHSRPLFLQITHAAVHT 205
           I+  N  +P FL + H   HT
Sbjct: 211 IEK-NKDQPFFLYMPHPIPHT 230


>gi|408674712|ref|YP_006874460.1| sulfatase [Emticicia oligotrophica DSM 17448]
 gi|387856336|gb|AFK04433.1| sulfatase [Emticicia oligotrophica DSM 17448]
          Length = 518

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 70/225 (31%), Positives = 107/225 (47%), Gaps = 30/225 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G ++I TPN+D LA NG+ L   Y    C P+RA+ LTGKY    G+   V 
Sbjct: 43  GFSDIGCYG-SEISTPNLDKLAANGLKLRNFYNAGRCCPTRASLLTGKYSHAVGMGNMVS 101

Query: 83  AGVAK------------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
               K            +VP     + + LK++GY T++ GKWH+G    +  P  RGF+
Sbjct: 102 FEDQKVPKDNYQGYLEPSVPT----IAEDLKKVGYHTYMTGKWHVG-ESPDYWPLKRGFE 156

Query: 131 NHVGYWNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
            + G  +G  +Y + + E    F V  D     + Y       Y TD FTD+++  ++S 
Sbjct: 157 RYFGLISGASSYFEVLQEKRKRFVVQDD-----KEYVLPKDGYYATDAFTDKAIEFLESS 211

Query: 189 N-HSRPLFLQITHAA----VHTGTAGNAKLPTGLLQVPDMEENDR 228
           +  + P FL + + A    +H      AK     LQ  D    DR
Sbjct: 212 DKQNNPFFLYLAYTAPHFPLHAYEEDIAKYENFYLQGWDKTRTDR 256


>gi|340368306|ref|XP_003382693.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
          Length = 230

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 92/189 (48%), Gaps = 22/189 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF---RYGIDT 79
           G+ DV F     I +PN   LA  G++L+RHY    C+P+R +FLTG++P    +Y I  
Sbjct: 36  GFADVSFRNPA-IKSPNFQKLAETGLILDRHYVYRYCSPTRVSFLTGRWPHHAHQYNIKP 94

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG- 138
               G      +   +LP  LK +GY TH++GKWH G  + + LP NRGFD   G+ +G 
Sbjct: 95  NFQIGTN----INMTMLPAKLKTVGYKTHMVGKWHQGFFQPKFLPINRGFDTSSGFLSGA 150

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
              +  Y D          +D  RN + Y  + +  Y    + D    +  +H   +P+F
Sbjct: 151 EDHFTQYRD--------CAIDYWRN-DTYDTR-NGTYDAYTYKDDLTKIFDAHETQKPMF 200

Query: 196 LQITHAAVH 204
           L +    VH
Sbjct: 201 LYLPLHNVH 209


>gi|119619126|gb|EAW98720.1| arylsulfatase E (chondrodysplasia punctata 1), isoform CRA_d [Homo
           sapiens]
          Length = 599

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 73/125 (58%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDI-PTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
           G  D+G +G N +  TPNID LA +G+ L +H +  + CTPSRAAFLTG+YP R G+ + 
Sbjct: 58  GIGDIGCYGNNTMRQTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSS 117

Query: 81  VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
           +G       G +  +P  E    + LKE GY+T LIGKWH+G N E        P + GF
Sbjct: 118 IGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGF 177

Query: 130 DNHVG 134
           D+  G
Sbjct: 178 DHFYG 182


>gi|281349830|gb|EFB25414.1| hypothetical protein PANDA_009654 [Ailuropoda melanoleuca]
          Length = 581

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/125 (44%), Positives = 72/125 (57%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDI-PTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
           G  D+G +G N I  TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ + 
Sbjct: 41  GIGDIGCYGNNSIRQTPNIDRLAEDGVMLTQHVAAASVCTPSRAAFLTGRYPLRSGMVSS 100

Query: 81  VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
            G       GV   +P  E    + LK+ GY+T LIGKWH+G N +        P N GF
Sbjct: 101 NGYRVLQWTGVPGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCDSSSDHCHHPLNHGF 160

Query: 130 DNHVG 134
           D+  G
Sbjct: 161 DHFYG 165


>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 466

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/209 (33%), Positives = 100/209 (47%), Gaps = 21/209 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI---- 77
           G+ D+G +G   + TP +D LA  G+ L   YT  PTCT SRA  LTG+YP R G+    
Sbjct: 46  GYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQL 105

Query: 78  --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
             D   G G+ K    +E L+P+YLK+ GY T   GKW++G +     P  RGFD   G+
Sbjct: 106 SADENYGDGLRK----SEVLIPEYLKQQGYRTACFGKWNVGFSPGS-RPTERGFDEFFGF 160

Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
             G + Y    +   +A   D  R ++         Y TD F D +   I S    +P F
Sbjct: 161 AAGNIDY----YHHYYAGRHDLWRGLKEV---FVEGYSTDLFADAACQYI-SAESDQPFF 212

Query: 196 LQITHAAVHTGTAGNAKLPTG-LLQVPDM 223
           + +   A H  +  N +   G   Q PD+
Sbjct: 213 IYLPFNAPHFPSQRNKQPGQGNEWQAPDL 241


>gi|417301290|ref|ZP_12088451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327542405|gb|EGF28888.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 482

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 91/189 (48%), Gaps = 19/189 (10%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QG+ DVG  G  DI TP +DA+A  G+     Y  P C PSRAA +TG YP R      +
Sbjct: 25  QGYQDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRVAERGHI 84

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNHVGYW 136
              +   +   E  + + LK  GY++   GKW +  + +     +LLP  +GFD    Y+
Sbjct: 85  KQ-IHPILHEDEVTIAEVLKTNGYASACFGKWDLAKHAQSGFFPDLLPTGQGFD----YF 139

Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
            G  T ND +         +  RN E   P+     LT  +TD+++  I+  N ++P F+
Sbjct: 140 YGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQPFFV 190

Query: 197 QITHAAVHT 205
            I H   HT
Sbjct: 191 YIPHTMPHT 199


>gi|421612351|ref|ZP_16053459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408496806|gb|EKK01357.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 474

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G   + TPN+DA+A  G+  +R Y   P C+P+R + LTG+YPFR+GI   
Sbjct: 43  QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
              G+     V E  + + L++ GY+T + GKWHIG  K + +        P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158

Query: 133 VGYWNGYLTYNDSIHETDF 151
               +   T++ +I   D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177


>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
           familiaris]
 gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
           familiaris]
          Length = 522

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 41  GWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 100

Query: 81  -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P  E +LP+ LKE GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 101 RHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 159

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQ 219

Query: 189 NHS-RPLFL----QITHAAVH 204
             + RP FL      THA V+
Sbjct: 220 QAAQRPFFLYWAIDATHAPVY 240


>gi|32471068|ref|NP_864061.1| N-acetylgalactosamine-6-sulfatase [Rhodopirellula baltica SH 1]
 gi|32396770|emb|CAD71735.1| N-acetylgalactosamine-6-sulfatase [Rhodopirellula baltica SH 1]
          Length = 474

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G   + TPN+DA+A  G+  +R Y   P C+P+R + LTG+YPFR+GI   
Sbjct: 43  QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
              G+     V E  + + L++ GY+T + GKWHIG  K + +        P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158

Query: 133 VGYWNGYLTYNDSIHETDF 151
               +   T++ +I   D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177


>gi|417301293|ref|ZP_12088454.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327542408|gb|EGF28891.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 474

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G   + TPN+DA+A  G+  +R Y   P C+P+R + LTG+YPFR+GI   
Sbjct: 43  QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
              G+     V E  + + L++ GY+T + GKWHIG  K + +        P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158

Query: 133 VGYWNGYLTYNDSIHETDF 151
               +   T++ +I   D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177


>gi|313226814|emb|CBY21959.1| unnamed protein product [Oikopleura dioica]
          Length = 582

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---D 78
           G  DVGF+G + I T NID LA +G++L++H      CTPSRAAFLTG+ P RYGI    
Sbjct: 32  GSGDVGFNGNSTIGTTNIDQLAEDGVILDQHLAPASVCTPSRAAFLTGRLPIRYGIAANG 91

Query: 79  TPVGAGVAKA----VPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEELLPFNRGF 129
           T V   +  A    +P +E    + L++ GY T L+GKWH+G N      +   PFN GF
Sbjct: 92  TRVRVNIWNATPNGLPRSELTFAKVLQKEGYKTALVGKWHLGMNHNNNHDQNYHPFNHGF 151

Query: 130 DNHVG 134
           D+  G
Sbjct: 152 DSWFG 156


>gi|440717773|ref|ZP_20898250.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436437075|gb|ELP30749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 474

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QGW DVGF+G   + TPN+DA+A  G+  +R Y   P C+P+R + LTG+YPFR+GI   
Sbjct: 43  QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
              G+     V E  + + L++ GY+T + GKWHIG  K + +        P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158

Query: 133 VGYWNGYLTYNDSIHETDF 151
               +   T++ +I   D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177


>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
          Length = 526

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 95/205 (46%), Gaps = 20/205 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 45  GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 104

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LKE GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 105 AHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHLG-HRPQFHPLKHGFDEWF 163

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 164 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQ 223

Query: 189 N-HSRPLFL----QITHAAVHTGTA 208
                P FL      THA V+   A
Sbjct: 224 QAQQHPFFLYWAIDATHAPVYASKA 248


>gi|45383412|ref|NP_989703.1| arylsulfatase H precursor [Gallus gallus]
 gi|33330173|gb|AAQ10453.1| arylsulfatase [Gallus gallus]
          Length = 590

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 56/124 (45%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
           G  DVG +G + I TPNID LA  G+ L +H T  P CTPSRAAFLTG+YP R G+D   
Sbjct: 46  GIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAAFLTGRYPIRSGMDAVN 105

Query: 79  ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
                   G +  +P  E    + L++ GYST LIGKWH+G N E        P N GF+
Sbjct: 106 NYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRNDHCHHPLNHGFE 165

Query: 131 NHVG 134
              G
Sbjct: 166 YFYG 169


>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 463

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 18/191 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYG----I 77
           G+ D+  +G  D+ +PNID L   G+     Y   P C+P+RAA L+GKYP R G    I
Sbjct: 45  GYGDLSCYGATDLQSPNIDKLVSRGLKFTNFYANCPVCSPTRAAILSGKYPDRVGVPGVI 104

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            T          P  E LLP  L+  GY + +IGKWH+G       P +RGFD+  GY  
Sbjct: 105 RTHADNSWGYLAPEAE-LLPSLLQPAGYHSAIIGKWHLGLEAPN-RPNDRGFDHFKGYLG 162

Query: 138 GYLT--YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPL 194
             +   Y+   H  ++      R N +   P+    + TD FT+ S   +K   ++ +P 
Sbjct: 163 DMMDDYYDHRRHGINY-----MRENEQEIDPE---GHATDLFTEWSCDYLKEQADNEQPF 214

Query: 195 FLQITHAAVHT 205
           FL + + A HT
Sbjct: 215 FLYLAYNAPHT 225


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 100/198 (50%), Gaps = 25/198 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG-IDTP 80
           G+ D+GF G +   TPN+D LA   +  +R Y+  PTC PSR + +TGKYP R G +   
Sbjct: 50  GFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHG 109

Query: 81  VGAGVA------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              GVA        +P+TE  + + LK+ GY+T  IGKWHIG  K E  P  RGFD  + 
Sbjct: 110 KLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVDIA 167

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA---PQMSSK----YLTDFFTDQSVHVIKS 187
                   N+      +    ++    +R A   P +  +    +LTD   +++V  I S
Sbjct: 168 -------SNEFCCPGSYMYPFESNNEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIHS 220

Query: 188 HNHSRPLFLQITHAAVHT 205
            +  +P FL ++  AVHT
Sbjct: 221 TD-EKPFFLNMSFYAVHT 237


>gi|87306992|ref|ZP_01089138.1| arylsulfatase [Blastopirellula marina DSM 3645]
 gi|87290365|gb|EAQ82253.1| arylsulfatase [Blastopirellula marina DSM 3645]
          Length = 710

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 94/191 (49%), Gaps = 17/191 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+G +G  +IPTPNIDALA  G    + Y    C PSRA+ +TG YP + GI     
Sbjct: 40  GYSDLGCYG-GEIPTPNIDALAKRGARFTQVYNSARCCPSRASLMTGLYPTQAGIGDFTT 98

Query: 78  DTPV---GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           D P    G G    +      L + LK  GY  + +GKWH+     E  P  RGFD   G
Sbjct: 99  DRPSPDRGPGYLGRLNEQCVTLAEVLKPAGYGCYYVGKWHM---HPETGPIRRGFDEFYG 155

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RP 193
           Y      ++   ++ ++ + L A R  E   PQ    Y TD F D ++  I+    S +P
Sbjct: 156 Y---ARDHSHDQYDAEYYIRLPAGREKEIDPPQ-QDYYATDVFNDYALEFIRQGQQSDKP 211

Query: 194 LFLQITHAAVH 204
            FL + H++ H
Sbjct: 212 WFLFLGHSSPH 222


>gi|313242390|emb|CBY34540.1| unnamed protein product [Oikopleura dioica]
          Length = 582

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---D 78
           G  DVGF+G + I T NID LA +G++L++H      CTPSRAAFLTG+ P RYGI    
Sbjct: 32  GSGDVGFNGNSTIGTTNIDQLAEDGVILDQHLAPASVCTPSRAAFLTGRLPIRYGIAANG 91

Query: 79  TPVGAGVAKA----VPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEELLPFNRGF 129
           T V   +  A    +P +E    + L++ GY T L+GKWH+G N      +   PFN GF
Sbjct: 92  TRVRVNIWNATPNGLPRSELTFAKVLQKEGYKTALVGKWHLGMNHNNNHDQNYHPFNHGF 151

Query: 130 DNHVG 134
           D+  G
Sbjct: 152 DSWFG 156


>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 585

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 67/118 (56%), Gaps = 3/118 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+  +G  DI TPNID+LA++G +    Y  P C+P+RA  LTG+Y FR G+ +  
Sbjct: 32  QGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAELLTGRYAFRSGVRSTS 91

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
             G  +   + E+ +    K+ GY+T   GKWH G  +    P  RGFD   G+ +G+
Sbjct: 92  EGG--ERFNLDEQTIADVFKKAGYATGAFGKWHSGM-QYPYHPNGRGFDEFYGFCSGH 146


>gi|304309759|ref|YP_003809357.1| sulfatase [gamma proteobacterium HdN1]
 gi|301795492|emb|CBL43690.1| probable sulfatase precursor [gamma proteobacterium HdN1]
          Length = 661

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 96/193 (49%), Gaps = 21/193 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G ND+   G+    TPN+D  +   + L R Y   TC+PSRA+ LTG+YP R G   P+ 
Sbjct: 76  GVNDIASWGDGSAQTPNLDKFSSESVRLRRDYGDSTCSPSRASLLTGQYPARVGF-LPIA 134

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE--ELLPFNRGFDNHVGYWNGYL 140
            G++  +P     LP  LK LGYST  +GKWH+G   E  E+ P   GFD    YW G+L
Sbjct: 135 LGLSPDLPT----LPGSLKSLGYSTFHVGKWHLGEALEYPEIQPSYHGFD----YWMGFL 186

Query: 141 TYNDSIHETDFAVGLDARRN-------MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSR 192
            +   +   D    L +R          E  AP +    YL D   D++V +I+ +   +
Sbjct: 187 NHF-VLQGPDETGKLVSRVPTHINPWLQENGAPPVQRMGYLDDLLVDKAVELIE-NTGEK 244

Query: 193 PLFLQITHAAVHT 205
           P F+ +   + HT
Sbjct: 245 PWFINLWLYSPHT 257


>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
 gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
          Length = 553

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 100/227 (44%), Gaps = 46/227 (20%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGID-TP 80
           G+ND+   G   +PTPNID++A  G      Y+    C PSRA  +TG+Y  R G + TP
Sbjct: 82  GFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTP 141

Query: 81  VGAGV------------------------AKAVPVTEKLLP-------QYLKELGYSTHL 109
              G+                        AKA P  E+ LP       + LK  GY    
Sbjct: 142 TPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSEITLAEALKPKGYHNIH 201

Query: 110 IGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS 169
           IGKWH+G N  E LP  +GFD  V   +G     DS    +  +  D          Q +
Sbjct: 202 IGKWHLG-NAPEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFDPIDQFLWARMQYA 260

Query: 170 SK-----------YLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
           +            YLTDF+TD+++  I++ N +RP FL + H  VHT
Sbjct: 261 TSYNGSAWFEPKGYLTDFYTDEAIKAIEA-NRNRPFFLYLAHWGVHT 306


>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
 gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
          Length = 413

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 93/189 (49%), Gaps = 14/189 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+  +G  +  TP++D LA NGI   + H +   C+P+RA  LTG+Y  R GID  V
Sbjct: 6   GYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAGIDGVV 65

Query: 82  GAGVAK----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
            A   K     +   E  L Q L++ GY T + GKWH+G  + +  P  RGF   VGY +
Sbjct: 66  YANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQR-QYNPTFRGFQQFVGYVS 124

Query: 138 GYLTYNDSIHETD-FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           G + Y   +  T  F    +A  N E         Y+T    D ++  I+     +P F+
Sbjct: 125 GNVDYFAHLDGTGVFDWWHNAELNREEQG------YVTHLINDHALEFIRQQ-QEKPFFV 177

Query: 197 QITHAAVHT 205
            I H AVH+
Sbjct: 178 YIAHEAVHS 186


>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 616

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 93/205 (45%), Gaps = 33/205 (16%)

Query: 23  GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G ND+G+   +    TP +D+L+  G+ L ++YT   CTPSRA+ +TG+  FR G+   V
Sbjct: 131 GTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGRDTFRTGMQYEV 190

Query: 82  GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
                A  +P+ E  L +  K         GKWH+G   +   PF RGFD  +GY     
Sbjct: 191 VEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGFDTFLGYMGAVR 242

Query: 141 TYNDSIHE---------------TDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQ 180
            Y  S HE                DF  G     ++   N  R  P     Y T   TD+
Sbjct: 243 GY--SSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGNYSTTIITDR 300

Query: 181 SVHVIKSHNHSRPLFLQITHAAVHT 205
           ++ V K H    P FL ++H AVH+
Sbjct: 301 AIEVAKEHGED-PFFLYVSHQAVHS 324


>gi|321446094|gb|EFX60811.1| hypothetical protein DAPPUDRAFT_17868 [Daphnia pulex]
          Length = 125

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 52/115 (45%), Positives = 68/115 (59%), Gaps = 4/115 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNR-HYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+  +G   I TP++DALA  G+     H     CTPSRA  LTG+YP R G+D P+
Sbjct: 12  GWGDLACYGGTAIKTPHLDALAGRGVRFTESHACDSVCTPSRAGLLTGRYPKRMGLDFPL 71

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELLPFNRGFDNHVG 134
            AG A  +   E  LPQ LK  GY T ++GKWH+G     + L P N GFD+++G
Sbjct: 72  NAG-ATGLNAFETTLPQALKLRGYHTAMVGKWHLGDYTKDKGLNPTNFGFDSYLG 125


>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 462

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 101/214 (47%), Gaps = 27/214 (12%)

Query: 11  KAVPVTEKLLPQ------------GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LP 57
           +A+ VT K  P+            G+ D    G   I TP ID++   G++L + Y+  P
Sbjct: 22  QALAVTRKAAPERPNIVFIMADDLGYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTP 81

Query: 58  TCTPSRAAFLTGKYPFRY--GIDTPVG--AGVAKAVPVTEKLLPQYLKELGYSTHLIGKW 113
            C+P+R A LTG Y  R+  G++ P+G  A     VP+    +   +K LGY T L+GKW
Sbjct: 82  ICSPTRTALLTGCYAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKW 141

Query: 114 HIGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSS 170
           H+G       P   G+D+ +G   G   Y  +   +      VGL      E  A    +
Sbjct: 142 HLG-EPPAHGPLKHGYDHFLGIVEGGADYFVHRMVMSGKPAGVGL-----AEDDAQTDRT 195

Query: 171 KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
            YLTD F D++V VI+    ++P FL +   A H
Sbjct: 196 GYLTDIFGDEAVRVIE-EGGNQPFFLSLHFTAPH 228


>gi|453364754|dbj|GAC79720.1| putative arylsulfatase [Gordonia malaquae NBRC 108250]
          Length = 783

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 102/197 (51%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G  +IPTPNID +A  G  L+ ++T P C+P+RAA LTG  P R G  +   
Sbjct: 56  GYSDIGPFGA-EIPTPNIDRIAATGYRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 114

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
             P   G+   +      LP+ L+E GY+T  +GKWH+  + +       +  P  RGFD
Sbjct: 115 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 174

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS---VHVIKS 187
           ++ G   G     +S    +  +  ++  +++ Y       Y+TD  TD++   V  +++
Sbjct: 175 SYYGSLEGL----NSFFHPNQLIADNSVVDVDEYP---EGYYVTDDLTDRAIGQVKALRA 227

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL   H A+H
Sbjct: 228 HDADKPFFLYFAHIAMH 244


>gi|453077746|ref|ZP_21980484.1| arylsulfatase [Rhodococcus triatomae BKS 15-14]
 gi|452758328|gb|EME16720.1| arylsulfatase [Rhodococcus triatomae BKS 15-14]
          Length = 769

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 101/198 (51%), Gaps = 25/198 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
           G++D+G  G ++I TPN+D LA +G+ L  ++T P C+PSRAA LTG  P R G      
Sbjct: 50  GYSDIGPFG-SEIETPNLDRLAASGVRLTNYHTTPLCSPSRAALLTGVNPHRAGYGFVAN 108

Query: 79  -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI--------GCNKEELLPFNRGF 129
             P   G+   +    + LP+ L+  GY+T+ +GKWH+        G  ++   P  RGF
Sbjct: 109 ADPGFPGLRLELSDDTQTLPEILRAGGYATYAVGKWHLVRDANIRPGSGRDS-WPTQRGF 167

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK--- 186
           D + G   G     +S    +  V  ++  +++ Y       YLTD  TD++V  +K   
Sbjct: 168 DRYYGSLEGL----NSFFHPNQLVSDNSAVDVDEYP---EGYYLTDDLTDKAVTYLKDLR 220

Query: 187 SHNHSRPLFLQITHAAVH 204
           +H   +P FL   H A+H
Sbjct: 221 AHEPDKPFFLYFAHVAMH 238


>gi|323454250|gb|EGB10120.1| hypothetical protein AURANDRAFT_62683 [Aureococcus anophagefferens]
          Length = 555

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/144 (38%), Positives = 78/144 (54%), Gaps = 21/144 (14%)

Query: 23  GWNDVGFHGENDIP----TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
           GW+D+      D+P    +P I  LA  G+ L  +Y    CTP+RAA LTGK+  R G  
Sbjct: 13  GWDDL--WESRDLPPAVVSPTIFRLAKEGVKLTSYYGQSYCTPARAALLTGKFVHRLGFA 70

Query: 79  TP---------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
           +P         V      +VP+  +LLP +L+ LGY+TH +GKW++G    E LP+ RGF
Sbjct: 71  SPEADYWGPLEVVGDANYSVPLGHELLPAHLRNLGYATHGVGKWNVGHCATEYLPWKRGF 130

Query: 130 DNHVGYWNGYLTYNDSIHETDFAV 153
           D  +GY      ++D IH T  AV
Sbjct: 131 DTFLGY------FSDGIHYTTHAV 148


>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 104/218 (47%), Gaps = 37/218 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++DVGF   + I TPNID                     +A+ L+G+Y   +GI   + 
Sbjct: 46  GFDDVGFK-SHQIKTPNID---------------------QASILSGRYAMHHGIVNWIP 83

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
              +  +P+    LPQ LK  GY TH IGKWH+G  K +  P  RGF++ +GY++G   Y
Sbjct: 84  PKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTFRGFNSFLGYYSGGENY 143

Query: 143 NDSIHETDFAVGLD----ARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
               +   + +  D      +N  + A  +  +Y T  F+D++V VI  H   +PLFL +
Sbjct: 144 FTHKNGPAYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFSDEAVRVIDDHIGPKPLFLYL 203

Query: 199 THAAVHTGTAGNAKLPTGLLQ-----VPDMEENDRTFA 231
            + AVH      A+ P   +      +PD +   RTFA
Sbjct: 204 AYQAVHE----PAQAPQSYIDPYTDLIPDAQR--RTFA 235


>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
 gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
           norvegicus]
          Length = 524

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 43  GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 103 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 161

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +   + +    LT  +  +++  I++ 
Sbjct: 162 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 221

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 222 HARQSPFFLYWAIDATHAPVY 242


>gi|301623486|ref|XP_002941046.1| PREDICTED: arylsulfatase D-like [Xenopus (Silurana) tropicalis]
          Length = 569

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/165 (36%), Positives = 82/165 (49%), Gaps = 15/165 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  +VG +G N + TPNID LA  G+ L  H    + CTPSRAAFLTG+YP R G+    
Sbjct: 35  GIGEVGCYGNNTLRTPNIDRLAREGVRLTHHIAAASLCTPSRAAFLTGRYPIRSGMTGHE 94

Query: 82  G-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN---KEELL--PFNRGF 129
           G       + V+  +P  E    + L+E GY+T +IGKWH+G N   K +    P N GF
Sbjct: 95  GGYLVLMWSAVSGGLPTNETTFAKILQEQGYTTGIIGKWHLGVNCRSKNDFCYHPLNHGF 154

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
           D   G    Y   ND        + +  R  ++ YA   +   LT
Sbjct: 155 DYFYGL--TYTLINDCEESMPSEIHVPFRAKLQFYAQLFAMTLLT 197


>gi|399029424|ref|ZP_10730306.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
 gi|398072706|gb|EJL63910.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
          Length = 546

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 65/191 (34%), Positives = 94/191 (49%), Gaps = 12/191 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
           G++D+G +G ++I TPN+D LA  G  L   Y    C P+RA+ LTG+Y  + G+   D 
Sbjct: 38  GYSDLGNYG-SEIKTPNLDKLAAEGTRLREFYNNSICAPTRASLLTGQYQHKAGVGYFDV 96

Query: 80  PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            +G    +     E L L +  +  GYST L GKWH+G   +   P  RGFD   G   G
Sbjct: 97  NLGLPAYQGYLNKESLTLGEVFRSGGYSTILSGKWHVGSEDKSQWPNQRGFDKFYGILKG 156

Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
              Y D+      +T + V L   RN E   P+  S Y TD   + +V  ++  N  ++P
Sbjct: 157 ASNYFDTKPLPFGKTPYPVSL--IRNNEVLHPKDDSYYFTDEIGNNAVTFLEEQNKENKP 214

Query: 194 LFLQITHAAVH 204
            FL +   A H
Sbjct: 215 FFLYLAFTAPH 225


>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
           [Ailuropoda melanoleuca]
          Length = 519

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 38  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 97

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P  E LLP+ LK  GY++ ++GKWH+G ++ +  P   GFD   
Sbjct: 98  GHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 156

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D       Y       Q     LT  +  +++  +K  
Sbjct: 157 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQ 216

Query: 189 N-HSRPLFLQI----THAAVHT 205
               RP FL      THA V+ 
Sbjct: 217 QVAQRPFFLYWAIDGTHAPVYA 238


>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
          Length = 520

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 20/202 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 37  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 96

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P  E LLP+ LK  GY++ ++GKWH+G ++ +  P   GFD   
Sbjct: 97  GHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 155

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D       Y       Q     LT  +  +++  +K  
Sbjct: 156 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQ 215

Query: 189 N-HSRPLFLQI----THAAVHT 205
               RP FL      THA V+ 
Sbjct: 216 QVAQRPFFLYWAIDGTHAPVYA 237


>gi|229822462|ref|YP_002883988.1| sulfatase [Beutenbergia cavernae DSM 12333]
 gi|229568375|gb|ACQ82226.1| sulfatase [Beutenbergia cavernae DSM 12333]
          Length = 478

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 100/201 (49%), Gaps = 24/201 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G     TP+IDALA +G      Y   P C+P+RA+ LTGKYP R G+   +
Sbjct: 27  GWRDLGCFGSTFYETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTGKYPARVGVTNWI 86

Query: 82  GAGVAKA---------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
           G     A         +P  E  L + L+  GY T  +GKWH+G  +   LP + GFD +
Sbjct: 87  GGHAIGALRDVPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGGGRH--LPEHHGFDLN 144

Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
           VG   G  + +   +   + +G      +E  AP    ++LTD  TD +V +++S + + 
Sbjct: 145 VG---GSASGSPVSYYAPYGIG-----ALED-APD--GEFLTDRLTDVAVDLVRSSDDA- 192

Query: 193 PLFLQITHAAVHTGTAGNAKL 213
           P  L + H AVHT     A L
Sbjct: 193 PFLLNLWHYAVHTPIEAPAHL 213


>gi|254511428|ref|ZP_05123495.1| arylsulfatase [Rhodobacteraceae bacterium KLH11]
 gi|221535139|gb|EEE38127.1| arylsulfatase [Rhodobacteraceae bacterium KLH11]
          Length = 545

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 45/96 (46%), Positives = 65/96 (67%), Gaps = 1/96 (1%)

Query: 35  IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
           I TP+I+ LA  G+ L R YT P+CTP+R A LTG++P R G++    A V + +P +E 
Sbjct: 90  IETPSINQLATEGMSLMRMYTEPSCTPTRTAMLTGRHPIRAGVEEVKVALVGEGLPASEV 149

Query: 95  LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
            LP+ LK++GY+T  +GKWH G + E+  P N+GFD
Sbjct: 150 TLPEILKQVGYNTAHVGKWHQG-DIEQSYPHNQGFD 184


>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
 gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
          Length = 522

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 20/205 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y   P C+PSRAA LTG+ P R G  T  
Sbjct: 41  GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIRTGFYTTN 100

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          +   +P  E LLP+ LK  GY++ ++GKWH+G ++ +  P   GFD   
Sbjct: 101 GHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 159

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQ 219

Query: 189 NHS-RPLFL----QITHAAVHTGTA 208
             +  P FL      THA V+   A
Sbjct: 220 QATHHPFFLYWAIDATHAPVYASRA 244


>gi|149196937|ref|ZP_01873990.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149140047|gb|EDM28447.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 462

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 41/219 (18%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+  +G   I +P ID LA  G+ L  +Y   P C+ SRAA LTG+YP   G+   
Sbjct: 33  QGYNDLSCYGSKTIKSPRIDQLAEEGLKLTSYYVASPVCSASRAALLTGRYPKLVGV--- 89

Query: 81  VGAGVA------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              GV       K +    + + + LK +GY+T  +GKWH+G ++ E LP N+GFD++ G
Sbjct: 90  --PGVFFPNRGHKGLDPKHQTIAKLLKSVGYATKAVGKWHLG-DELEFLPTNQGFDSYYG 146

Query: 135 Y-------------WNGYLTYNDSIHETDFAVGLDARR----NMERYAPQM--------- 168
                         ++    Y + + +       +A +     M+   P M         
Sbjct: 147 IPYSNDMTPAFSMKYSENCLYREGVDQEALKKAFEANKIKPVGMKDKVPLMRNDECIEMP 206

Query: 169 -SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQITHAAVHT 205
                +T  FTD+S+  I +S   ++P FL + H+  HT
Sbjct: 207 ADQSTITKRFTDESIKFIDESTASNKPFFLYLAHSMPHT 245


>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
          Length = 782

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 60/170 (35%), Positives = 88/170 (51%), Gaps = 24/170 (14%)

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           + A     VP+ EKLLP+ LKE GY+TH++GKWH+G  ++E LP  RGFD++ GY  G  
Sbjct: 363 IWACQPNCVPLDEKLLPELLKEAGYATHMVGKWHLGMYRKECLPTRRGFDSYFGYLLGSE 422

Query: 141 TYND--------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
            Y          S++ T  A+     R+ E  A    + Y T+ F  ++V +I +H   +
Sbjct: 423 DYYSHERCVLIRSLNVTRCALDF---RDGEEVAVGYKNMYSTNVFAKRAVDLIANHPPDK 479

Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           PLFL +   +VH             LQVP  EE  + ++ I N  RR +A
Sbjct: 480 PLFLYLAFQSVHEP-----------LQVP--EEYVKPYSFIQNKKRRNYA 516


>gi|332665095|ref|YP_004447883.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332333909|gb|AEE51010.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 531

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 14/191 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G  +  TPN+D LA  GI L   Y    C P+RA+ LTG Y    G+   V 
Sbjct: 54  GYSDIGCYG-GEAQTPNLDKLATKGIKLRSFYNAGRCCPTRASLLTGNYSHAAGMGNMVS 112

Query: 83  AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
               K  P   +         + ++L+++GY T++ GKWH+G  + E  P  RGFD + G
Sbjct: 113 FDDQKVTPGPYQGYLDPNTPTIAEHLRQVGYHTYMTGKWHVG-ERPEHWPLKRGFDRYFG 171

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
             +G  ++ + + E      +   ++ E   PQ    Y TD FTD+++  I+     S+P
Sbjct: 172 LISGASSFFEILQEKRKRYMV--LQDQEWVLPQ-EGFYATDAFTDRAIEFIQGQAPQSKP 228

Query: 194 LFLQITHAAVH 204
            FL + + A H
Sbjct: 229 FFLYLAYTAPH 239


>gi|225012438|ref|ZP_03702874.1| sulfatase [Flavobacteria bacterium MS024-2A]
 gi|225003415|gb|EEG41389.1| sulfatase [Flavobacteria bacterium MS024-2A]
          Length = 471

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 90/192 (46%), Gaps = 15/192 (7%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D+G +G  DI TPN+D LA  G     +Y T P C+ SRA+ LTG YP R GI   
Sbjct: 35  QGFGDLGVYGATDIKTPNLDRLAGEGARFTSYYATQPVCSASRASILTGCYPDRIGIHNA 94

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
              G    +   E  L + LKE GY+T + GKWH+G +  E  P   GFD + G     +
Sbjct: 95  YSPGSKVGLNPEETTLAELLKEKGYATGIFGKWHLG-DAPEFQPRKHGFDEYYG-----I 148

Query: 141 TYNDSI---HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
            Y++ +   H    AV     +    N           +LT   TD+++  IK  N   P
Sbjct: 149 LYSNDMWPKHPQQGAVFNFPDIKLYENETPLRVLEDQTFLTGALTDRAIDFIKK-NKENP 207

Query: 194 LFLQITHAAVHT 205
            F+ + H   H 
Sbjct: 208 FFVYLPHPQPHV 219


>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
          Length = 540

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 98/207 (47%), Gaps = 26/207 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GWN   FH  N+I TP +  L  NG+ L  HYT   C+P+RA+FLTG++P+++ +     
Sbjct: 41  GWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRFPYKHEMTN--- 96

Query: 83  AGVAKAVPVTEKL--------LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
                 +P T  L        L   LK+  YSTH IGKWH+G  K+E  P  RGFD   G
Sbjct: 97  ---TNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGFDTTFG 153

Query: 135 YWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR- 192
           +  G    Y      +  AV L    + E  A  M+  +    FTD ++ +I+++   R 
Sbjct: 154 FLTGGENHYTQRAFVSPPAVDL---WDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQLRN 210

Query: 193 ------PLFLQITHAAVHTGTAGNAKL 213
                 PLF+      VH  T    +L
Sbjct: 211 ATGDAPPLFIYFALHDVHAPTQSPVRL 237


>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
          Length = 339

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 51/118 (43%), Positives = 69/118 (58%), Gaps = 4/118 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G+ D G+   +DI TPNID L  +GI     Y+   C+PSR+AFL+G+Y +  G+   V 
Sbjct: 43  GYQDAGYR-NSDIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGVI 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNG 138
           G   A  + +    +  YLKEL Y+TH  GKWH+G CNK E  P  RGFD   G ++G
Sbjct: 102 GDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSG 158


>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
          Length = 478

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 91/186 (48%), Gaps = 11/186 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRY--GIDT 79
           G+ DV  +G  D+ TPN+D +A  G+  L  +     C+ +R A +TG+Y +R   G++ 
Sbjct: 50  GYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTALITGRYQYRLPIGLEE 109

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P+G G    +P     LP  L++ GY T L+GKWH+G    +  P   G+D+  G+  G 
Sbjct: 110 PLGIGRDVGLPPEHPTLPSLLRKAGYRTTLLGKWHLGA-LPKFGPLQSGYDHFYGFRGGS 168

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
           + Y         A         +   P   S YLTD    ++V VI  ++HS RP  + +
Sbjct: 169 VDY------YTHAGPDQRDDLWDDDVPLRQSGYLTDLLGSRAVDVINGYSHSDRPFLVSL 222

Query: 199 THAAVH 204
             +A H
Sbjct: 223 HFSAPH 228


>gi|296140673|ref|YP_003647916.1| sulfatase [Tsukamurella paurometabola DSM 20162]
 gi|296028807|gb|ADG79577.1| sulfatase [Tsukamurella paurometabola DSM 20162]
          Length = 766

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 97/203 (47%), Gaps = 35/203 (17%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G +G ++IPTP++DALA  GI    H+T P C+P+RAA LTG  P R G      
Sbjct: 50  GFSDIGPYG-SEIPTPHLDALAARGIRSVNHHTTPVCSPARAALLTGINPHRAGY----- 103

Query: 83  AGVAKAVPVTEKL----------LPQYLKELGYSTHLIGKWHIGCNK-------EELLPF 125
           A VA + P    L          LP+ L+E GY+T+ +GKWH+  +            P 
Sbjct: 104 ASVANSDPGYPNLRLSLADDVLTLPEILREAGYATYAVGKWHLAKDSRLGPDADRGSWPL 163

Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHV 184
            RGFD++ G   G    N   H          R N      +     Y+TD  TD +   
Sbjct: 164 QRGFDHYYGSLEG---LNSFFHPNQL-----VRDNTADPVTEYPDDFYVTDALTDTATSW 215

Query: 185 IK---SHNHSRPLFLQITHAAVH 204
           +K   +H+  +P FL   H A+H
Sbjct: 216 LKDLRAHDADKPFFLYFAHIAMH 238


>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
          Length = 466

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 43  GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 103 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 161

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +   + +    LT  +  +++  I++ 
Sbjct: 162 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 221

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 222 HARQSPFFLYWAIDATHAPVY 242


>gi|241267368|ref|XP_002406367.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496881|gb|EEC06521.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 158

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 83/145 (57%), Gaps = 3/145 (2%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
           A+P+   L+P+Y + LGY TH++GKWH+G    + +P  RGFD  +G++N  L Y +   
Sbjct: 13  ALPLDYTLMPEYFRRLGYKTHMVGKWHLGYYDRKYVPLKRGFDTFIGFYNPSLDYYNQNF 72

Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH-TG 206
             +   G D R     Y  +   +Y T ++T ++V +I+ H+ S P+FL ++H A H +G
Sbjct: 73  TGNNHTGHDFRCGDRNYWAE-EKEYATYYYTRKTVEIIRCHDKSTPMFLFLSHQAPHVSG 131

Query: 207 TAGNAKLPT-GLLQVPDMEENDRTF 230
                ++PT G+  V  + EN+RT 
Sbjct: 132 GRPLLQVPTHGVRNVSYIGENNRTL 156


>gi|422371415|ref|ZP_16451795.1| arylsulfatase [Escherichia coli MS 16-3]
 gi|315296831|gb|EFU56120.1| arylsulfatase [Escherichia coli MS 16-3]
          Length = 551

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 52/116 (44%), Positives = 68/116 (58%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+YP  +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYPIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|148223780|ref|NP_001086368.1| MGC82105 protein precursor [Xenopus laevis]
 gi|49522125|gb|AAH75173.1| MGC82105 protein [Xenopus laevis]
          Length = 569

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 85/171 (49%), Gaps = 15/171 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  +VG +G N + TPNID LA  G+ L  H    + CTPSRAAFLTG+YP R G+    
Sbjct: 35  GIGEVGCYGNNTLRTPNIDRLAREGVKLTHHIAASSLCTPSRAAFLTGRYPIRSGMTGHD 94

Query: 82  G-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN---KEELL--PFNRGF 129
           G       + V+  +P  E    + L+E GY+T +IGKWH+G N   +++    P N GF
Sbjct: 95  GGYLVLMWSAVSGGLPTNETTFAKILQEQGYTTGIIGKWHLGVNCRSRDDFCHHPLNHGF 154

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
           D + G    Y   ND        + +  R  +  YA   +   LT   T +
Sbjct: 155 DYYYGLL--YTLINDCQASMPSEIHVAFRAQLLFYAQLFAVTLLTAMVTKR 203


>gi|118084193|ref|XP_416855.2| PREDICTED: arylsulfatase D [Gallus gallus]
          Length = 596

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 68/127 (53%), Gaps = 18/127 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           G  DVG +G N I TPNID LA  G+ L +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 53  GIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSGMASSN 112

Query: 81  --------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNR 127
                    G+G    +P  E    + L++ GY+T LIGKWH G N E        P N 
Sbjct: 113 RYRALQWNAGSG---GLPANETTFARLLQQQGYTTGLIGKWHQGVNCESFSDHCHHPLNH 169

Query: 128 GFDNHVG 134
           GFD   G
Sbjct: 170 GFDYFYG 176


>gi|149196006|ref|ZP_01873062.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
 gi|149140853|gb|EDM29250.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
          Length = 713

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 68/206 (33%), Positives = 92/206 (44%), Gaps = 33/206 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDT-- 79
           GWND+  +G     TP++D +A  G      Y   P C+P+RA+ L GKYP R G+    
Sbjct: 251 GWNDIACYGSQFYETPHLDKMAKEGFRFTDAYAANPVCSPTRASILLGKYPSRVGLSNHS 310

Query: 80  ----PVGAG-------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL---LPF 125
               P G G       V   +P+ +  L + LKE+GY T  IGKWH+  + +      P 
Sbjct: 311 GSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHLQAHHDTSRNHFPE 370

Query: 126 NRGFD-NHVGYWNG-----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
             GFD N  G+  G     Y  Y    H +          N+   A      YLTD  TD
Sbjct: 371 KHGFDLNIAGHRMGQPGSFYFPYKSKQHPS---------TNVPDMADGQEGDYLTDKLTD 421

Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHT 205
           +++H IK  N   P FL   +  VHT
Sbjct: 422 KAIHYIKE-NKDTPFFLNFWYYTVHT 446


>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
 gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 39  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GF+   
Sbjct: 99  AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFNEWF 157

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +T +++  I++ 
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238


>gi|33601723|ref|NP_889283.1| sulfatase [Bordetella bronchiseptica RB50]
 gi|412337890|ref|YP_006966645.1| sulfatase [Bordetella bronchiseptica 253]
 gi|33576160|emb|CAE33239.1| probable sulfatase [Bordetella bronchiseptica RB50]
 gi|408767724|emb|CCJ52480.1| probable sulfatase [Bordetella bronchiseptica 253]
          Length = 464

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW ++G +G   I   PTP IDALA  G           C P+R+A +TG++P R G   
Sbjct: 19  GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V AG+ + +   E+ L Q   E GY+T + GKWH+G +KE   P +RGFD     W G 
Sbjct: 79  SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133

Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
             T N+S+     AVG D                 A R  ERY  +M  + + +  T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189

Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
              I  H    P FL +    +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212


>gi|453072694|ref|ZP_21975742.1| arylsulfatase [Rhodococcus qingshengii BKS 20-40]
 gi|452757342|gb|EME15747.1| arylsulfatase [Rhodococcus qingshengii BKS 20-40]
          Length = 773

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G  G ++I TP +D LA  GI +  ++T P C+PSRAA LTG  P R G      
Sbjct: 55  GYSDIGPFG-SEIETPTLDRLAAQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113

Query: 83  A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
           A     G+   +    + LP+ L+  GY+T+ +GKWH+  +         +  P  RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
            + G   G     +S +  +  +  ++  +++ Y       YLTD  TD++V  IK   +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTDDLTDKAVGYIKDLRA 226

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL   H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243


>gi|410420178|ref|YP_006900627.1| sulfatase [Bordetella bronchiseptica MO149]
 gi|408447473|emb|CCJ59148.1| probable sulfatase [Bordetella bronchiseptica MO149]
          Length = 464

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW ++G +G   I   PTP IDALA  G           C P+R+A +TG++P R G   
Sbjct: 19  GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V AG+ + +   E+ L Q   E GY+T + GKWH+G +KE   P +RGFD     W G 
Sbjct: 79  SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133

Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
             T N+S+     AVG D                 A R  ERY  +M  + + +  T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189

Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
              I  H    P FL +    +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212


>gi|260804443|ref|XP_002597097.1| hypothetical protein BRAFLDRAFT_215750 [Branchiostoma floridae]
 gi|229282360|gb|EEN53109.1| hypothetical protein BRAFLDRAFT_215750 [Branchiostoma floridae]
          Length = 577

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 50/103 (48%), Positives = 62/103 (60%), Gaps = 7/103 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI---D 78
           G  DVG  G + I TPNID++A  G  L +H    P CTPSRAAFLTG+YP RYG+   D
Sbjct: 34  GIGDVGCFGNDTIRTPNIDSIAAKGAKLTQHLAAAPVCTPSRAAFLTGRYPIRYGMAGRD 93

Query: 79  TP---VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN 118
            P   V   +   +P +E   PQ  K+ GY T L+GKWH+G N
Sbjct: 94  LPMAFVQLAIPSGLPRSEVTFPQLAKDHGYQTALLGKWHLGLN 136


>gi|33597308|ref|NP_884951.1| sulfatase [Bordetella parapertussis 12822]
 gi|427814649|ref|ZP_18981713.1| probable sulfatase [Bordetella bronchiseptica 1289]
 gi|33573735|emb|CAE38032.1| probable sulfatase [Bordetella parapertussis]
 gi|410565649|emb|CCN23207.1| probable sulfatase [Bordetella bronchiseptica 1289]
          Length = 464

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW ++G +G   I   PTP IDALA  G           C P+R+A +TG++P R G   
Sbjct: 19  GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V AG+ + +   E+ L Q   E GY+T + GKWH+G +KE   P +RGFD     W G 
Sbjct: 79  SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133

Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
             T N+S+     AVG D                 A R  ERY  +M  + + +  T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189

Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
              I  H    P FL +    +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212


>gi|404216649|ref|YP_006670870.1| Arylsulfatase A-related enzyme [Gordonia sp. KTR9]
 gi|403647448|gb|AFR50688.1| Arylsulfatase A-related enzyme [Gordonia sp. KTR9]
          Length = 797

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+   G  +I TPN+D LA NGI L+ ++T P C+P+RAA LTG  P R G  +   
Sbjct: 72  GYSDIAPFGA-EIDTPNLDRLARNGIRLSNYHTTPVCSPARAALLTGLNPHRAGYGSVAN 130

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI------GCNKEE-LLPFNRGFD 130
             P   G+   +      LP+ L+E GY+T+ +GKWH+      G  ++    P  RGFD
Sbjct: 131 SDPGFPGLRLELADDVLALPEILRESGYATYAVGKWHLVRDANMGPGRDRGSWPLQRGFD 190

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
           ++ G   G     +S    +  +  ++  ++E Y       Y+TD  TD+++  IKS   
Sbjct: 191 SYYGSLEGL----NSFFYPNQLIADNSVVDVETYP---EDYYVTDDLTDRAIGQIKSLRA 243

Query: 188 HNHSRPLFLQITHAAVH 204
            + ++P FL   H A+H
Sbjct: 244 QDPTKPFFLYFAHIAMH 260


>gi|296231792|ref|XP_002761310.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Callithrix jacchus]
          Length = 458

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 77/162 (47%), Gaps = 31/162 (19%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LKE GY T ++GKWH+G ++ +  P   GFD   
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEWF 160

Query: 134 GY---------------------WNGYLTYNDSIHETDFAVG 154
           G                      W     Y D++ E D ++G
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYGDAVREMDDSIG 202


>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 223

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 89/165 (53%), Gaps = 10/165 (6%)

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           + A  A  V + EK LPQYLK +GY TH IGKWH+G   +E  P  RGFD+  GY+ G  
Sbjct: 6   IFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYYGGQA 65

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSRPLFLQ 197
            Y D    ++   GLD   +    +  + ++   Y T  ++ +++  I++HN ++P+FL 
Sbjct: 66  DYWDHSLASNGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLY 125

Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + + AVH+     A L    LQ P  +E    F+HI +  R+ +A
Sbjct: 126 LAYQAVHS-----ANLREYPLQAP--QEWVDKFSHIKHKGRQNYA 163


>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
 gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
          Length = 471

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 91/184 (49%), Gaps = 10/184 (5%)

Query: 26  DVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDTPV- 81
           D+G +G+ DI TPN+D LA  GI   + Y     C+ +R A +TG+Y +R   G++ P+ 
Sbjct: 47  DLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALITGRYQYRLRGGLEEPIA 106

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
           GA     +P T   LP  LK+ GY T LIGKWH+G       P   G+D+  G + G + 
Sbjct: 107 GASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGY-LPTFGPLKSGYDSFFGNYGGAID 165

Query: 142 YNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
           Y    H+    VG   + ++ E   P     Y TD    ++V  ++     +P  L + +
Sbjct: 166 Y--FTHKP--GVGPQVKEDLYEGEVPVHQIGYYTDLLGARAVDFVQKQQAGKPFLLSLHY 221

Query: 201 AAVH 204
            A H
Sbjct: 222 TAPH 225


>gi|395840581|ref|XP_003793133.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase E [Otolemur
           garnettii]
          Length = 811

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 65/106 (61%), Gaps = 7/106 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G   I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDVGCYGNRTIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPVRSGMVSSD 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEE 121
           G      AGV+  +P  E    + L++ GY T LIGKWH+G N E 
Sbjct: 105 GDRVLQWAGVSGGLPTNETTFAKILQDKGYVTGLIGKWHLGLNCES 150


>gi|226184211|dbj|BAH32315.1| probable arylsulfatase [Rhodococcus erythropolis PR4]
          Length = 773

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
           G++D+G  G ++I TP +D LA  GI +  ++T P C+PSRAA LTG  P R G      
Sbjct: 55  GYSDIGPFG-SEIETPTLDRLASQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113

Query: 79  -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
             P   G+   +    + LP+ L+  GY+T+ +GKWH+  +         +  P  RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
            + G   G     +S +  +  +  ++  +++ Y       YLTD  TD++V  IK   +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTDDLTDKAVGYIKDLRA 226

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL   H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243


>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 465

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 27/202 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDT- 79
           QG+ DVG  G   I TPN+D +A  G      Y     CTPSR+A +TG+ P R G+ + 
Sbjct: 39  QGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNFYVAANVCTPSRSAIMTGRLPVRTGMYSN 98

Query: 80  ------PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                 P   G    +P TE  + + LK  GYST  IGKWH+G +  E LP + GFD + 
Sbjct: 99  TRRVLFPDSGG---GLPATENTIAKLLKTSGYSTAAIGKWHLG-HLPEYLPTSHGFDTYF 154

Query: 134 G--YWNGYLTYNDSIHETDFAVG---------LDARRNMERYAPQMSSKYLTDFFTDQSV 182
           G  Y N     ND   +  FA           +  +  +ER A Q +   +T  +T+++V
Sbjct: 155 GIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRDKEIIERPADQTT---ITKRYTEEAV 211

Query: 183 HVIKSHNHSRPLFLQITHAAVH 204
             IK+ N  +P F+ + H+  H
Sbjct: 212 SYIKA-NKDQPFFIYLAHSLPH 232


>gi|348555459|ref|XP_003463541.1| PREDICTED: steryl-sulfatase-like [Cavia porcellus]
          Length = 580

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/125 (42%), Positives = 68/125 (54%), Gaps = 12/125 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   + TPNID LA  G+ L +H    P CTPSRAAF+TG+YP R G+ +  
Sbjct: 33  GIGDLGCYGNQTLRTPNIDRLAGGGVKLTQHLAASPLCTPSRAAFMTGRYPIRLGMASHS 92

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
             GV      +  +P +E    + LKE GYST LIGKWH+G N          P   GFD
Sbjct: 93  RMGVYLFTASSGGLPTSEVTFARLLKEQGYSTALIGKWHLGINCYNTTDFCHHPLRHGFD 152

Query: 131 NHVGY 135
              G+
Sbjct: 153 YFYGF 157


>gi|345327068|ref|XP_001514429.2| PREDICTED: arylsulfatase E [Ornithorhynchus anatinus]
          Length = 629

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G + + TPNID LA  G+ L +H +  + CTPSRAAFLTG+YP R G+ +  
Sbjct: 85  GIGDLGCYGNDTLRTPNIDRLAQEGVRLTQHISAASVCTPSRAAFLTGRYPIRSGMVSSD 144

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G  V +       +P  E    + L+E GYST LIGKWH G N E        P N GFD
Sbjct: 145 GYRVLRWTACSGGLPANETTFGEILQEQGYSTGLIGKWHQGLNCERSWDHCHHPLNHGFD 204

Query: 131 NHVG 134
              G
Sbjct: 205 YFFG 208


>gi|410635289|ref|ZP_11345904.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
 gi|410145262|dbj|GAC13109.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
          Length = 493

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 95/194 (48%), Gaps = 15/194 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP--TCTPSRAAFLTGKYPFRYGIDTP 80
           G+ D+   G + I TPNID++   G   +R + +P   C+PSRA+ LTG+YP R G+   
Sbjct: 53  GYGDISSFGADGIRTPNIDSIGQEGFT-SRDFFIPANVCSPSRASLLTGRYPMRNGMPVA 111

Query: 81  VGAGVAKAVPV------TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
           V     K V         E  +P+ LK  GY + ++GKWH+G  ++   P + GFD H+G
Sbjct: 112 VNPLSEKHVSSHFGLHPDEITIPEMLKPAGYRSLMVGKWHLGFQQKGSHPLDAGFDEHLG 171

Query: 135 YWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
               Y  Y    ++  + +  D +   R  E    ++  + +T  +TD+ +  I+     
Sbjct: 172 LLGNY--YKARENDPRYPILKDNQTLYRGHEAVKEEIELEEVTQRYTDEVISFIEREKDG 229

Query: 192 RPLFLQITHAAVHT 205
            P F+   H  VH+
Sbjct: 230 -PFFVYFAHNIVHS 242


>gi|326913667|ref|XP_003203156.1| PREDICTED: arylsulfatase D-like [Meleagris gallopavo]
          Length = 576

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 68/127 (53%), Gaps = 18/127 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           G  DVG +G N I TPNID LA  G+ L +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 33  GIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSGMASSN 92

Query: 81  --------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNR 127
                    G+G    +P  E    + L++ GY+T LIGKWH G N E        P N 
Sbjct: 93  QYRALQWNAGSG---GLPANETTFARILQQQGYTTGLIGKWHQGVNCESFNDHCHHPLNH 149

Query: 128 GFDNHVG 134
           GFD   G
Sbjct: 150 GFDYFYG 156


>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
          Length = 431

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 87/163 (53%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYN---- 143
            +P+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   Y     
Sbjct: 19  CIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 78

Query: 144 ----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
               D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +PLFL + 
Sbjct: 79  CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPLFLYLA 135

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 136 LQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 165


>gi|149178470|ref|ZP_01857059.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
 gi|148842683|gb|EDL57057.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
          Length = 491

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 94/193 (48%), Gaps = 16/193 (8%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ DVG  G  +I TPN+D +A  GI     Y     C+ SR A LTG YP R GI   
Sbjct: 56  QGYQDVGVFGSPNIKTPNLDQMAKEGIRFTDFYAAQAVCSASRVALLTGCYPNRVGIRGA 115

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +G      +   E  + + +K  GY+T + GKWH+G +  E LP   GFD + G     L
Sbjct: 116 LGPQSKIGINAEETTIAEVVKPQGYATAIYGKWHLG-HLPEFLPTRHGFDEYFG-----L 169

Query: 141 TYNDSIHETDFAVG-----LDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSR 192
            Y++ +       G     L    N     P+++ K    L+ ++T+++V  I + NH +
Sbjct: 170 PYSNDMWPFHPTAGKRFPDLPLIENETVINPKVTGKEQAQLSTWYTERAVSFI-NKNHDK 228

Query: 193 PLFLQITHAAVHT 205
           P FL + H+  H 
Sbjct: 229 PFFLYVPHSMPHV 241


>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
 gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
          Length = 487

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 19/200 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+G +G  +I TPN+D LA  G      Y+    C+PSRAA LTG YP R G+   
Sbjct: 57  QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 116

Query: 81  VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
           V    +K  +   E  +  +LK  GY+T  +GKWH+G +K E LP + GFD++ G  Y N
Sbjct: 117 VLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 175

Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
                     G ++ +D   +   AV L      ++ E     +  + +T  +TD+++  
Sbjct: 176 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDRAIEF 235

Query: 185 IKSHNHSRPLFLQITHAAVH 204
           +++ N  +P FL + H+  H
Sbjct: 236 VEA-NQDKPFFLYLPHSMPH 254


>gi|336118326|ref|YP_004573095.1| arylsulfatase [Microlunatus phosphovorus NM-1]
 gi|334686107|dbj|BAK35692.1| arylsulfatase [Microlunatus phosphovorus NM-1]
          Length = 785

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 114/230 (49%), Gaps = 38/230 (16%)

Query: 5   VGAGVAKAVPV--TEKLLPQG-------------WNDVGFHGENDIPTPNIDALAYNGIV 49
           +G  ++++VP    E+  PQG             + D+G +G ++I TP++D LA +G+ 
Sbjct: 25  IGRTISESVPAWPAERTAPQGSPNVIVIVVDDLGYADLGPYG-SEIATPHLDRLAADGVR 83

Query: 50  LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGA-----GVAKAVPVTEKLLPQYLKELG 104
              ++T P C+PSRAA LTG  P + G   P  +          +P     L + L+E G
Sbjct: 84  FTNYHTTPLCSPSRAALLTGLNPHKAGFAFPANSDPGYPAYTFTLPDNAPTLAETLRERG 143

Query: 105 YSTHLIGKWHIGCNK-------EELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDA 157
           Y+T  +GKWH+  ++       +   P  RGFD + G   G+     S+H     V  ++
Sbjct: 144 YATFALGKWHLTGDRLQHDGASKASWPCQRGFDRYFGALEGFT----SLHAPHRLVWDNS 199

Query: 158 RRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNHSRPLFLQITHAAVH 204
              ++ +    +  YLTD  T++++ +I   ++ +  +P FL + HAAVH
Sbjct: 200 PYPVQEFP---ADYYLTDDLTERAIEMISTLRAADADKPFFLYLAHAAVH 246


>gi|334145238|ref|YP_004538448.1| sulfatase family protein [Novosphingobium sp. PP1Y]
 gi|333937122|emb|CCA90481.1| sulfatase family protein [Novosphingobium sp. PP1Y]
          Length = 425

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 89/184 (48%), Gaps = 11/184 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGI-DTP 80
           G+ D+   G   I TPNID +A  G   ++ Y     CTPSRA  LTG+YP R G+ D  
Sbjct: 20  GYGDLSITGARGIKTPNIDRMAREGRTFSQFYAAANLCTPSRAGLLTGRYPVRTGLGDKV 79

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +     + +P +E  +P  LK   Y+T L GKWH+G    + LP + GFD  VG     +
Sbjct: 80  ILYNDDRVLPTSEVTIPTALKTAEYATGLFGKWHLGHRGPDWLPTHHGFDRFVG-----I 134

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y+   H+    V + A    E +        L   F +++   I + N  RP F+++  
Sbjct: 135 PYS---HDMSPLVLVRAEAGKEAHQVPTEITPLQQIFCEEAEQFI-TENAERPFFVELAL 190

Query: 201 AAVH 204
           +A H
Sbjct: 191 SAPH 194


>gi|427822349|ref|ZP_18989411.1| probable sulfatase [Bordetella bronchiseptica Bbr77]
 gi|410587614|emb|CCN02660.1| probable sulfatase [Bordetella bronchiseptica Bbr77]
          Length = 464

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW ++G +G   I   PTP IDALA  G           C P+R+A +TG++P R G   
Sbjct: 19  GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V AG+ + +   E+ L Q   E GY+T + GKWH+G +KE   P +RGFD     W G 
Sbjct: 79  SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133

Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
             T N+S+     AVG D                 A R  ERY  +M  + + +  T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189

Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
              I  H    P FL +    +H
Sbjct: 190 CEFIGRHAGKVPFFLYVPLTQLH 212


>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 477

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 97/197 (49%), Gaps = 26/197 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+  +G     TP++D+LA  GI   + Y+  P C+PSRAA LTGK+P R  +   +
Sbjct: 48  GYMDLRCYGNPYNETPHLDSLARRGIRFTQAYSACPVCSPSRAAILTGKHPARLHLTNFI 107

Query: 82  G------------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
           G            A   + +P +E  L + LK+ GY T ++GKWH+G   + L P  +GF
Sbjct: 108 GGERVDTTSSLLPAEWRRYLPASETTLAELLKQQGYVTGMVGKWHLGNTGDSLTPTAQGF 167

Query: 130 DNHVGYW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
           D       NG   YN SI   +  V  D  +           +YLTD  TD ++  I  +
Sbjct: 168 DYERQISKNGLDYYNYSIASNNKTVFEDTGK-----------EYLTDKLTDYALEFIDQN 216

Query: 189 NH-SRPLFLQITHAAVH 204
               +PLFL + ++A H
Sbjct: 217 KAGQKPLFLYLAYSAPH 233


>gi|345330079|ref|XP_001507106.2| PREDICTED: arylsulfatase D-like [Ornithorhynchus anatinus]
          Length = 607

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 66/125 (52%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG----- 76
           G  D+G +G   I TPNID LA  G+ L +H    P CTPSRA+FLTG+YP R G     
Sbjct: 42  GIGDLGCYGNTTIRTPNIDRLAKEGVRLTQHLAAAPLCTPSRASFLTGRYPIRSGRMESE 101

Query: 77  --IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
             I   V  G +  +P  E    + L++ GY+T LIGKWH G N E        P N GF
Sbjct: 102 ELIRVIVWNGASGGLPANETTFARILQQQGYTTGLIGKWHQGVNCESRTDYCHHPLNHGF 161

Query: 130 DNHVG 134
           D   G
Sbjct: 162 DYFFG 166


>gi|429202673|ref|ZP_19194044.1| arylsulfatase [Streptomyces ipomoeae 91-03]
 gi|428661782|gb|EKX61267.1| arylsulfatase [Streptomyces ipomoeae 91-03]
          Length = 769

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 92/197 (46%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G  G  ++PTP +DALA  G+ L  ++TLP C+PSRAA LTG  P R G      
Sbjct: 49  GYSDIGPFGA-EVPTPVLDALAEQGVRLTNYHTLPLCSPSRAALLTGANPHRVGYAMVAN 107

Query: 83  A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-------LPFNRGFD 130
           A     G    +      L + L+  GY+T+ +GKWH+  +            P  +GFD
Sbjct: 108 ADPGFPGYGMEIADDFPTLAETLRGAGYATYAVGKWHLARDASSSAAADRSNWPLQKGFD 167

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
            + G   G                LD     E Y       Y TD  TDQ++ ++KS   
Sbjct: 168 QYYGVLEGLTNLFHPHQLVRDNSPLDIDEFPEGY-------YYTDDITDQAIAMVKSLRA 220

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL + H AVH
Sbjct: 221 HDPDKPFFLYLAHNAVH 237


>gi|427819008|ref|ZP_18986071.1| probable sulfatase [Bordetella bronchiseptica D445]
 gi|410570008|emb|CCN18144.1| probable sulfatase [Bordetella bronchiseptica D445]
          Length = 464

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW ++G +G   I   PTP IDALA  G           C P+R+A +TG++P R G   
Sbjct: 19  GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
            V AG+ + +   E+ L Q   E GY+T + GKWH+G +KE   P +RGFD     W G 
Sbjct: 79  SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133

Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
             T N+S+     AVG D                 A R  ERY  +M  + + +  T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189

Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
              I  H    P FL +    +H
Sbjct: 190 CEFIGRHAGKVPFFLYVPLTQLH 212


>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
          Length = 475

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 87/163 (53%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYN---- 143
            +P+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G   Y     
Sbjct: 68  CIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 127

Query: 144 ----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
               D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +PLFL + 
Sbjct: 128 CTVIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLA 184

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +RR +A
Sbjct: 185 LQSVHEP-----------LQVP--EEYLKPYDFIQDKNRRHYA 214


>gi|395804314|ref|ZP_10483554.1| sulfatase [Flavobacterium sp. F52]
 gi|395433413|gb|EJF99366.1| sulfatase [Flavobacterium sp. F52]
          Length = 550

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 12/191 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
           G++D+G +G ++I TPN+D LA  G  L   Y    C P+RA+ LTG+Y  + G+   D 
Sbjct: 42  GYSDLGNYG-SEIKTPNLDRLAKEGTRLREFYNNSICAPTRASLLTGQYQHKAGVGYFDV 100

Query: 80  PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            +G    +     E L L +  +  GYST + GKWH+G   +   P  RGFD   G   G
Sbjct: 101 NLGLPAYQGYLNKESLTLGEVFRSGGYSTLMSGKWHVGSEDQSQWPNQRGFDKFYGILKG 160

Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
              Y D+       T + V +   RN E   P+  S Y TD   + +V  ++  N  ++P
Sbjct: 161 ASNYFDTKPLPFGTTPYPVKM--IRNNEELHPKDDSYYFTDEIGNNAVTFLEEQNKENKP 218

Query: 194 LFLQITHAAVH 204
            FL +   A H
Sbjct: 219 FFLYLAFTAPH 229


>gi|357393955|ref|YP_004908796.1| putative arylsulfatase [Kitasatospora setae KM-6054]
 gi|311900432|dbj|BAJ32840.1| putative arylsulfatase [Kitasatospora setae KM-6054]
          Length = 778

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 93/197 (47%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
           G++D+G  G +++PTP +D LA  G+ L  ++T+P C+P+RAA LTG  P R G      
Sbjct: 54  GYSDIGPFG-SEVPTPTLDGLAERGVKLANYHTMPLCSPARAALLTGLNPHRVGYSFVAN 112

Query: 79  -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFNRGFD 130
             P   G    +      L Q L + GY+T+ +GKWH+         +     P  +GFD
Sbjct: 113 ADPGFPGYGMEIAGDIPTLAQTLHDAGYATYAVGKWHLTRDSASSAADNRANWPLQKGFD 172

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
            + G   G  +             LD     + Y       Y TD  TDQ++ ++KS   
Sbjct: 173 QYYGVLEGLTSLFHPHQLVRDNSPLDIDEFPDGY-------YYTDDITDQAIGMVKSLRA 225

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL + H AVH
Sbjct: 226 HDADKPFFLYLAHNAVH 242


>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 97/201 (48%), Gaps = 20/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 39  GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E LLP+ LK+ GY+  ++GKWH+G ++ +  P   GFD   
Sbjct: 99  AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
           G  N +    D+  + +  V  D     R  E +    +     LT  +  +++  I++ 
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYLQEALDFIRTQ 217

Query: 188 HNHSRPLFL----QITHAAVH 204
           H    P FL      THA V+
Sbjct: 218 HARQGPFFLYWAIDATHAPVY 238


>gi|317479852|ref|ZP_07938971.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|316903981|gb|EFV25816.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 541

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 23/198 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++DVG +G  +IPTPNID LA  G+   + Y      P+RA+ LTG YP + GI     
Sbjct: 48  GYSDVGCYG-GEIPTPNIDRLAQKGVRYTQFYNSGRSCPTRASLLTGLYPQQAGIGAMSE 106

Query: 80  ----------PVGAGVAKAVPVTEK---LLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
                     P   GV   +    +    + + LKE GY T++ GKWH+G + +E  P  
Sbjct: 107 DPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVLKEAGYHTYMTGKWHVGMHGKEKWPLQ 166

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
           RGF++  G   G  +Y     +    + LD   N    APQ S  Y TD FTD ++  I 
Sbjct: 167 RGFEHFYGILAGASSYLKP--QGGRGLTLD---NTNLPAPQ-SPYYTTDAFTDYAIRFID 220

Query: 187 SHNHSRPLFLQITHAAVH 204
                 P FL + + A H
Sbjct: 221 EQTDDNPFFLYLAYNAPH 238


>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 470

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 19/200 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+G +G  +I TPN+D LA  G      Y+    C+PSRAA LTG YP R G+   
Sbjct: 38  QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97

Query: 81  VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
           V    +K  +   E  +  +LK  GY+T  +GKWH+G +K E LP + GFD++ G  Y N
Sbjct: 98  VLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156

Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
                     G ++ +D   +   AV L      ++ E     +  + +T  +TD+++  
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 216

Query: 185 IKSHNHSRPLFLQITHAAVH 204
           +++ N  +P FL + H+  H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235


>gi|149198650|ref|ZP_01875694.1| arylsulfatase (aryl-sulfate sulphohydrolase) [Lentisphaera araneosa
           HTCC2155]
 gi|149138365|gb|EDM26774.1| arylsulfatase (aryl-sulfate sulphohydrolase) [Lentisphaera araneosa
           HTCC2155]
          Length = 569

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 19/195 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D+G +G  +I TPN+D LA  G+   + Y    C P+RA+ LTG YP + GI   + 
Sbjct: 34  GYTDIGSYG-GEIDTPNLDGLAKEGLRFTQFYNTGRCCPTRASLLTGLYPHQAGIGHMMS 92

Query: 83  ----AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEEL--LPFNRGFDN 131
                G    +  T   + + LK   YST+++GKWH+  N     KE     P NRGFD+
Sbjct: 93  DRGTDGYRGDLNKTSVTIAEVLKPAAYSTYMVGKWHVTKNLLNDDKESQYNWPLNRGFDH 152

Query: 132 HVGYWNGYLTYND--SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
             G  +G  ++ D  S+   D  +      N   Y P+  + Y TD  +D ++  I  H+
Sbjct: 153 FYGTIHGAGSFFDPNSLTRDDKYI---TPENDPEYQPE--TYYYTDAISDNAIKYINEHD 207

Query: 190 HSRPLFLQITHAAVH 204
             +P F+ + + A H
Sbjct: 208 SQKPFFMYVAYTAAH 222


>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
 gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
          Length = 468

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 95/190 (50%), Gaps = 15/190 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYP--FRYGIDT 79
           G  D+G +G  DI TP IDA+A  G+     Y     C+P+R A LTG+Y   FR G++ 
Sbjct: 49  GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108

Query: 80  PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P+   G   ++P   + LP  L++LGY+T L+GKWH+G       P + G+D   G  +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVG-ELPASSPLDHGYDYFFGIASG 167

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPL 194
              Y  +  +I+  +     + R  ++R        YLTD    +++  ++ +    RP 
Sbjct: 168 GTDYFAHATTINGHEMGKLFENRTEIQR------PGYLTDLLGAKAIDRMRLAARQDRPF 221

Query: 195 FLQITHAAVH 204
           F+ +   A H
Sbjct: 222 FISLHFTAPH 231


>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
          Length = 614

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 7/116 (6%)

Query: 24  WNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPVG 82
           +ND   H      TP +  ++ +G VL+  Y  PTCTPSRA  +TG+Y  R G+ D+ + 
Sbjct: 61  YNDAALH------TPELQRMSEHGFVLDNFYAAPTCTPSRAMLMTGRYNIRNGMQDSVIH 114

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           +   + VP+ E+ L Q L + GY T  IGKWH+G +++   P  RGFD   G   G
Sbjct: 115 STEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMHRDAYTPLKRGFDLFYGILTG 170


>gi|346644762|ref|NP_001231049.1| arylsulfatase E precursor [Sus scrofa]
          Length = 585

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 70/120 (58%), Gaps = 12/120 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G + I TPNID LA +G++L +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDLGCYGNHTIRTPNIDRLAADGVMLTQHLAAASLCTPSRAAFLTGRYPLRSGMVSST 104

Query: 82  GAGVAKAV------PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G+ V + V      P  E    + LK+ GY T L+GKWH+G N +        P N GFD
Sbjct: 105 GSRVLQWVAASGGLPPNETTFAKILKDKGYVTGLVGKWHLGLNCDSSEDHCHHPLNHGFD 164


>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 580

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 99/188 (52%), Gaps = 12/188 (6%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
           G++D G +G ++I TPNID LAY G+ L   Y    C P+RA+ +TG+YP + G+   +T
Sbjct: 44  GYSDFGAYG-SEIQTPNIDKLAYGGLRLKEFYNNSICAPTRASLITGQYPHKAGLGYFNT 102

Query: 80  PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
            +G    +     E L   + L++ GY+T+L GKWH+G N     P  RGF+   G+  G
Sbjct: 103 NLGLPAYQGWLNQESLTFGEVLQQGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGG 161

Query: 139 YLTYNDSIHETDFAVGLDARRNMER--YAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
              Y D     + A  ++   N +R   AP    KYLTD  T+ ++  I + +  +P FL
Sbjct: 162 ASNYYDISPYPEKAPPVELVENNQRINLAP---GKYLTDEITNHALSYI-NESKDKPFFL 217

Query: 197 QITHAAVH 204
            +   A H
Sbjct: 218 YLAFNAPH 225


>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
 gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
          Length = 502

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 95/201 (47%), Gaps = 19/201 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +GE    TPN+D +A  G++    Y+  P  +PSRAA LTG+ P R G  T  
Sbjct: 16  GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTGRLPIRNGFYTTN 75

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E+LLP+ LK+ GY + ++GKWH+G ++ +  P   GFD   
Sbjct: 76  AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 134

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
           G  N +    D+    +  V  D     R  E +   + +    LT  +  +++  IK  
Sbjct: 135 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 194

Query: 189 NHSRPLFL----QITHAAVHT 205
               P FL      THA V+ 
Sbjct: 195 ARHHPFFLYWAVDATHAPVYA 215


>gi|160890611|ref|ZP_02071614.1| hypothetical protein BACUNI_03056 [Bacteroides uniformis ATCC 8492]
 gi|156859610|gb|EDO53041.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
          Length = 520

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 23/198 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++DVG +G  +IPTPNID LA  G+   + Y      P+RA+ LTG YP + GI     
Sbjct: 27  GYSDVGCYG-GEIPTPNIDRLAQKGVRYTQFYNSGRSCPTRASLLTGLYPQQAGIGAMSE 85

Query: 80  ----------PVGAGVAKAVPVTEK---LLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
                     P   GV   +    +    + + LKE GY T++ GKWH+G + +E  P  
Sbjct: 86  DPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVLKEAGYHTYMTGKWHVGMHGKEKWPLQ 145

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
           RGF++  G   G  +Y     +    + LD   N    APQ S  Y TD FTD ++  I 
Sbjct: 146 RGFEHFYGILAGASSYLKP--QGGRGLTLD---NTNLPAPQ-SPYYTTDAFTDYAIRFID 199

Query: 187 SHNHSRPLFLQITHAAVH 204
                 P FL + + A H
Sbjct: 200 EQTDDNPFFLYLAYNAPH 217


>gi|291231637|ref|XP_002735770.1| PREDICTED: steroid sulfatase-like [Saccoglossus kowalevskii]
          Length = 584

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/125 (44%), Positives = 71/125 (56%), Gaps = 13/125 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG  G + I TPNID LA  G+ LN +    PTCTPSRAAFLTG+YP R G+ + +
Sbjct: 35  GIGDVGCFGNDTIRTPNIDRLAAEGVKLNHNMVPAPTCTPSRAAFLTGRYPIRMGLASRL 94

Query: 82  GA---GVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EELLPFNRGF 129
                G   A+   P +E    + LKE GY+T ++GKWH+G +        E  P N+GF
Sbjct: 95  AGTMFGYNSAIGGMPSSEITFAELLKEAGYTTAVLGKWHLGLHSFSFGRNFEFHPLNQGF 154

Query: 130 DNHVG 134
           D   G
Sbjct: 155 DFFYG 159


>gi|443716274|gb|ELU07882.1| hypothetical protein CAPTEDRAFT_217757 [Capitella teleta]
          Length = 324

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G + + TP++D++  NG+ L+      + CTPSRAA +T +Y  R G+ + +
Sbjct: 34  GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 93

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
            + ++ + +P +E  LPQ L+E GY+T LIGKWH+G N++ L     P  RGFD   G
Sbjct: 94  TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 151


>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 508

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 57/249 (22%)

Query: 23  GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR----- 74
           GW +VG+H      ++ TPNID+L   G+ L++HY    C+PSR++ ++G+ P       
Sbjct: 36  GWANVGYHRNPPTKEVVTPNIDSLVRQGLELDQHYVFNVCSPSRSSLMSGRLPIHVNDLN 95

Query: 75  -----YGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
                Y  D PV      A+P     + Q +K  GY TH +GKW  G       P  RGF
Sbjct: 96  IEPDYYNPDDPVSG--FSAIPRNMTGIAQKMKLGGYDTHQVGKWDAGMATHTHTPKGRGF 153

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTD-------------- 175
           D+  GY++         H  DF   +D +   +    ++   ++TD              
Sbjct: 154 DSSFGYFH---------HANDFYTEIDGKPCNKT---KIVDIWVTDKPGYGLNGTGPDNY 201

Query: 176 ---FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH 232
               F +Q + V+  H+  +PLFL      VH             LQVP   ++   F+ 
Sbjct: 202 EEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----------PLQVPQRYQD--KFSF 248

Query: 233 ISNPDRRLF 241
           I + DR+++
Sbjct: 249 IDDHDRQIY 257


>gi|406833313|ref|ZP_11092907.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 613

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/233 (30%), Positives = 102/233 (43%), Gaps = 35/233 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           GW D    G   + TPNID++A  G+ L+R +  P C P+RA FLTG+Y  R G+   V 
Sbjct: 38  GWGDYSHSGNQQVSTPNIDSIAKGGVSLDRFFVCPVCAPTRAEFLTGRYHPRGGVRG-VS 96

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY----WNG 138
            G+ + + + EK L    +  GY+T   GKWH G ++    P  RGFD + GY    W  
Sbjct: 97  TGLER-LDLDEKTLADAFQAAGYATGAFGKWHNG-SQWPYHPTARGFDEYFGYTAGHWGE 154

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
           Y           F   L+    M R     +  Y+ D  TD+++  I +H   +P    +
Sbjct: 155 Y-----------FDAPLEDHGEMVR-----TKGYIVDVCTDRALQFIDAHQQ-KPFLCYV 197

Query: 199 THAAVHTGTAG--------NAKLPTGLLQVPDMEENDRT---FAHISNPDRRL 240
                H+  A           +  + L   PD E  + T    A I N DR +
Sbjct: 198 PFTTPHSPWAAPESDWMRFRDRPLSQLASEPDQEVPEHTRCALAMIENQDRNV 250


>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
 gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
          Length = 477

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 105/199 (52%), Gaps = 21/199 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR-YGIDTPV 81
           G+ D+   G     TPN++A+A  G+ L   Y  P C+PSRAA +TG YP R   I   +
Sbjct: 42  GYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGCYPKRALTIPHVL 101

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
             G A+ +   E  + + +KE GY+T +IGKWH+G ++ + LP  +GFD + G  Y N  
Sbjct: 102 FPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGFDYYYGLPYSNDM 160

Query: 140 LTYNDSIHETDFAVGLDARRN--------------MERYAPQMSSKYLTDFFTDQSVHVI 185
               D + ++++   +  R+               ++R   +  ++ +T+ +T++++  I
Sbjct: 161 GPAADGV-KSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTN-YTEEAIQFI 218

Query: 186 KSHNHSRPLFLQITHAAVH 204
           + H   +P FL + H+AVH
Sbjct: 219 RDH-QEKPFFLYLPHSAVH 236


>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
 gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
          Length = 468

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 95/190 (50%), Gaps = 15/190 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYP--FRYGIDT 79
           G  D+G +G  DI TP IDA+A  G+     Y     C+P+R A LTG+Y   FR G++ 
Sbjct: 49  GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108

Query: 80  PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P+   G   ++P   + LP  L++LGY+T L+GKWH+G       P + G+D   G  +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVG-ELPASSPLDHGYDYFFGIASG 167

Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV-HVIKSHNHSRPL 194
              Y  +  +I+  +     + R  ++R        YLTD    +++  + ++    RP 
Sbjct: 168 GTDYFAHATTINGHEMGKLFENRTEIQR------PGYLTDLLGAKAIDRMQQAARQDRPF 221

Query: 195 FLQITHAAVH 204
           F+ +   A H
Sbjct: 222 FISLHFTAPH 231


>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
          Length = 336

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 4/118 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D G+   +DI TPNID L  +GI     Y+   C+PSR++FL+G+Y +  G+   V 
Sbjct: 105 GYQDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGVI 163

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNG 138
           +  A   + +    L  YLKEL Y+TH  GKWH+G CNK E  P  RGFD   G ++G
Sbjct: 164 SDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSG 220


>gi|346992478|ref|ZP_08860550.1| sulfatase [Ruegeria sp. TW15]
          Length = 546

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/96 (46%), Positives = 63/96 (65%), Gaps = 1/96 (1%)

Query: 35  IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
           I TP+I+  A  G+ L R YT P+CTP+R A LTG++P R G+     A V + +P +E 
Sbjct: 90  IETPSINQFATEGLSLMRMYTEPSCTPTRTAMLTGRHPVRAGVSEVKVALVGEGLPASEV 149

Query: 95  LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
            LP+ LKE+GY+T  +GKWH G + E+  P N+GFD
Sbjct: 150 TLPEILKEVGYNTVHVGKWHQG-DIEQAYPHNQGFD 184


>gi|126337083|ref|XP_001362844.1| PREDICTED: arylsulfatase E [Monodelphis domestica]
          Length = 583

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N + TPNID+LA+ G+ L +H    + CTPSRAA LTG+YP R G+ +  
Sbjct: 43  GIGDIGCYGNNTMRTPNIDSLAHEGVKLTQHLAAASVCTPSRAALLTGRYPIRSGMVSDN 102

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G  V +       +P  E    + L++ GY+T LIGKWH+G N E  +     P N GFD
Sbjct: 103 GYRVLQWTAASGGLPSNETTFAKILQKEGYATGLIGKWHLGLNCESSIDHCHHPLNHGFD 162

Query: 131 NHVG 134
              G
Sbjct: 163 FFYG 166


>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
          Length = 551

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
            VP+ EKLLPQ+LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G         
Sbjct: 139 CVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 198

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            T  D+++ T  A+     R+ E  A    + Y T+ FT ++  +I +H   +PLFL + 
Sbjct: 199 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAATLITNHPPEKPLFLYLA 255

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 256 LQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 285


>gi|395527024|ref|XP_003765652.1| PREDICTED: arylsulfatase E [Sarcophilus harrisii]
          Length = 585

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G N I TPNID LA  G+   +H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDIGCYGNNTIRTPNIDRLAKEGVKFTQHIAAASVCTPSRAAFLTGRYPIRSGMTSYN 104

Query: 82  GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G  V +       +P  E    + L++ GY+T LIGKWH+G N E  +     P N GFD
Sbjct: 105 GLPVLQWTATSGGLPSNETTFAKILQKEGYTTGLIGKWHLGLNCESRIDHCHHPLNHGFD 164

Query: 131 NHVG 134
              G
Sbjct: 165 FFYG 168


>gi|115906036|ref|XP_797340.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 162

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
           GW+DV  HG + I TPNID LA  G+ L  +Y  P CTP+R+A +TG++P   G+    +
Sbjct: 44  GWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGRHPIHTGMQHDTI 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGK 112
           GA     + + EK + Q+LK LGYSTH +GK
Sbjct: 104 GAAEPWGLGLDEKTMAQHLKSLGYSTHAVGK 134


>gi|296470448|tpg|DAA12563.1| TPA: arylsulfatase E [Bos taurus]
          Length = 583

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G   I TPNID LA +G+ L +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFLTGRYPLRSGMVSSQ 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
           G        V+  +P +E    + LK  GY+T LIGKWH+G  C   +     P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLGLSCASPDDHCHHPLNHGFD 164

Query: 131 NHVG 134
           +  G
Sbjct: 165 HFYG 168


>gi|147901243|ref|NP_001091457.1| arylsulfatase E precursor [Bos taurus]
 gi|146186636|gb|AAI40584.1| ARSE protein [Bos taurus]
 gi|152941128|gb|ABS45001.1| arylsulfatase E precursor [Bos taurus]
          Length = 583

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G   I TPNID LA +G+ L +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFLTGRYPLRSGMVSSQ 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
           G        V+  +P +E    + LK  GY+T LIGKWH+G  C   +     P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLGLSCASPDDHCHHPLNHGFD 164

Query: 131 NHVG 134
           +  G
Sbjct: 165 HFYG 168


>gi|433607608|ref|YP_007039977.1| Sulfatase [Saccharothrix espanaensis DSM 44229]
 gi|407885461|emb|CCH33104.1| Sulfatase [Saccharothrix espanaensis DSM 44229]
          Length = 760

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 42/206 (20%)

Query: 23  GWNDVG-FHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G F GE  + TPN+DALA  G+ L  ++T P C+PSRAA LTG  P R G   P 
Sbjct: 48  GYADIGPFGGE--VATPNLDALAAGGLRLTNYHTTPLCSPSRAALLTGLNPHRAGFAFPA 105

Query: 82  GA-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFNRGF 129
            A       +  +P     L + L++ GY+T  +GKWH+               P  RGF
Sbjct: 106 NADPGYPAYSFQLPDDAPSLAESLRDAGYATFAVGKWHLTRDAASHDAADRSSWPVQRGF 165

Query: 130 DNHVGYWNGY--------LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS 181
           D + G   G         L +++S ++ D+  G                 +LTD  TD++
Sbjct: 166 DRYFGSLEGLTNLHHPHRLVWDNSPYDGDYPDGY----------------FLTDDLTDRA 209

Query: 182 VHVI---KSHNHSRPLFLQITHAAVH 204
           V +I   ++++  +P FL   H A+H
Sbjct: 210 VRMIDTLRANDPDKPFFLYFAHHAMH 235


>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
          Length = 548

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 71/122 (58%), Gaps = 4/122 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D G+   +DI TPNID L  +GI     Y+   C+PSR++FL+G+Y +  G+   V 
Sbjct: 40  GYHDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGVI 98

Query: 83  AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNGYL 140
           +  A   + +    L  YLKEL Y+TH  GKWH+G CNK E  P  RGFD   G ++G  
Sbjct: 99  SDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSGEG 157

Query: 141 TY 142
            Y
Sbjct: 158 KY 159


>gi|325106428|ref|YP_004276082.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975276|gb|ADY54260.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 535

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 102/195 (52%), Gaps = 25/195 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G  G +D+ TPN+D +A  G+ +   Y    C PSRA+ LTG Y  + G+   V 
Sbjct: 37  GYSDIGCFG-SDVQTPNLDEMASKGLKMANFYNASRCCPSRASLLTGLYAHQAGVGDMVN 95

Query: 83  A-------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           A       G      VT   + + L++ GY+T + GKWH+G NKE   P  RGFD + G 
Sbjct: 96  ARPYPAYQGYLNKTSVT---IAEVLQKNGYNTIMGGKWHVGQNKEN-WPLQRGFDKYFGL 151

Query: 136 WNGYLTYNDSI-----HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHN 189
            +G  +Y ++       +   A+G       E + P  ++ Y TD +TD ++  I ++ N
Sbjct: 152 IDGANSYFENRPYRPNQKLTIALG------NEEFTPG-ANYYSTDAYTDYALRFIEETKN 204

Query: 190 HSRPLFLQITHAAVH 204
           +++P FL + + A H
Sbjct: 205 NNKPFFLYLAYQAPH 219


>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
          Length = 544

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 75/250 (30%), Positives = 119/250 (47%), Gaps = 51/250 (20%)

Query: 2   DTPVGAGVAKAVPVTEKLLPQGWNDVGFH----GENDIPTPNIDALAYNGIVLNRHYTL- 56
           DTPV       + V    +  G+ND+  H     +  + T NIDALA +GI+  R Y   
Sbjct: 52  DTPVDDNRPNIILVLADDM--GYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAAN 109

Query: 57  PTCTPSRAAFLTGKYPFRYGID-TPVGA------------------------GVAKAVPV 91
            TC PSRA+ +TGKYP R+G + TP+ A                         V+   P 
Sbjct: 110 ATCAPSRASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPF 169

Query: 92  TEKLLP-------QYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYND 144
            E+ +P       + L++ GY T  IGKWH+G ++  + P ++GF + +G         D
Sbjct: 170 MEQGMPTEQITIAEVLRDAGYYTAHIGKWHLG-HEYGMDPMSQGFQDSLGLVGPLYLPED 228

Query: 145 --SIHETDFAVGLDAR-RNMERYAPQMS-------SKYLTDFFTDQSVHVIKSHNHSRPL 194
              +    F   +D     M +Y+   +        KY+TD++TD+++ VI+ +N +RP 
Sbjct: 229 HPDVVNAKFDTRIDKMIWGMGQYSANFNGGDLFAPDKYVTDYYTDEALKVIE-NNKNRPF 287

Query: 195 FLQITHAAVH 204
           FL ++H A+H
Sbjct: 288 FLYLSHWAIH 297


>gi|126337066|ref|XP_001381279.1| PREDICTED: steryl-sulfatase-like [Monodelphis domestica]
          Length = 813

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D G +G   + TPNID +A  G+   +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 266 GIGDPGCYGNTTLRTPNIDRIAKGGVKFTQHLAASPLCTPSRAAFLTGRYPIRSGMASRS 325

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
             GV      +  +P  E    + LK  GYST LIGKWH+G  CN  +     P N GFD
Sbjct: 326 KVGVFLFSASSGGLPTNEITFAKLLKNQGYSTALIGKWHLGINCNSRDDFCHHPLNHGFD 385

Query: 131 NHVG 134
           +  G
Sbjct: 386 HFYG 389


>gi|392941987|ref|ZP_10307629.1| arylsulfatase A family protein [Frankia sp. QA3]
 gi|392285281|gb|EIV91305.1| arylsulfatase A family protein [Frankia sp. QA3]
          Length = 796

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 98/202 (48%), Gaps = 33/202 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
           G++D+G  G  +IPTP +D LA  G+ L  ++T+P C+P+RAA LTG  P R G      
Sbjct: 72  GYSDIGPFGA-EIPTPALDRLAERGVRLTNYHTMPLCSPARAALLTGLNPHRVGYAMVAN 130

Query: 82  --------GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFN 126
                   G  +A  VP     L Q L + GY+T+ +GKWH+         +     P  
Sbjct: 131 ADPGFPGYGMEIADDVPT----LAQLLHDAGYATYAVGKWHLTRDSASNAADDRRNWPLQ 186

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQSVHVI 185
           +GFD + G   G LT     H+         R N      ++ +  Y TD  TDQ++ ++
Sbjct: 187 KGFDQYYGVLEG-LTSLFHPHQL-------VRDNSPLQVDELPAGYYYTDDITDQAISMV 238

Query: 186 ---KSHNHSRPLFLQITHAAVH 204
              ++H+  +P FL + H AVH
Sbjct: 239 TSLRAHDPEKPFFLYLAHNAVH 260


>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 749

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 12/188 (6%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D+G +G  ++ TP IDALA  G     +Y   P C+PSRA  LTG YP R G    
Sbjct: 39  QGYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTGCYPRRVGNHIW 98

Query: 81  V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           V  A     +   E  L +   + GY+T  IGKWH+G + E  LP N+GFD++ G     
Sbjct: 99  VHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLGFH-EPFLPQNQGFDHYFG----- 152

Query: 140 LTYNDSIHETDF---AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
           L +N    ET +     G+   RN +          LT  +TD+++  ++ H   +P FL
Sbjct: 153 LLHNLDPVETVYFEEQGGVPLLRNDQVVQRPADPAELTKQYTDEAISWMEQH-RDQPFFL 211

Query: 197 QITHAAVH 204
            + H  +H
Sbjct: 212 YLPHTMLH 219


>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 468

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/219 (31%), Positives = 99/219 (45%), Gaps = 27/219 (12%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---- 76
           QG+ D+   G+N   TP +D LA +G  L   Y + P CTPSRA+ +TG+YP R G    
Sbjct: 42  QGFADLSCIGDNGCRTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDM 101

Query: 77  ----------IDTP----VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
                     + TP    V A       + E  L   LK+ GY + + GKW  G   +  
Sbjct: 102 IRNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWD-GGQLKRY 160

Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
           LP  RGFD + G+ N  + Y    HE     G+ +     +   +    YLTD F  +++
Sbjct: 161 LPLQRGFDQYYGFANTGVDY--FTHER---YGVPSMFRDNQPTEEDKGTYLTDLFEREAI 215

Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
             I   NH RP FL +   A H+ +  +  +  G  Q P
Sbjct: 216 RFI-DENHDRPFFLYLPFNAPHSASNLDRSI-RGFAQAP 252


>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
          Length = 476

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 92/186 (49%), Gaps = 4/186 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
           QG++DVG  G +DI TP++D LA  G+     Y     C+ SRAA LTG Y  R GI   
Sbjct: 40  QGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGVYSNRLGIHGA 99

Query: 81  VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
           +       +   E  +   LK LGY+T + GKWH+G +  E LP N+GFD ++G      
Sbjct: 100 LDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLG-HYPEFLPTNQGFDEYLGIPYSND 158

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYL-TDFFTDQSVHVIKSHNHSRPLFLQIT 199
            + +     D+   L   +N +      + + + T  FT++S+  I+  N  RP FL + 
Sbjct: 159 MWPNHPQTKDYYPPLPLYQNDKVIDTIWNDQSMFTTLFTEKSIDFIE-RNKDRPFFLYLA 217

Query: 200 HAAVHT 205
           H   H 
Sbjct: 218 HPMPHV 223


>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
          Length = 431

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 87/164 (53%), Gaps = 26/164 (15%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
            +P+ EKLLP+ L+E GY TH++GKWH+G  K+E LP +RGFD +     GYL  ++  +
Sbjct: 17  CLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDTYF----GYLLGSEDYY 72

Query: 148 ETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
             D  V + A+         R+ E  A    + Y T+ FT++++ +I  H   +PLFL +
Sbjct: 73  SHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIAHHKTEKPLFLYL 132

Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              +VH             L+VP  EE  + ++ I +  RR +A
Sbjct: 133 AFQSVHEP-----------LEVP--EEYMKPYSSIKDAKRRHYA 163


>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
           queenslandica]
          Length = 523

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 94/198 (47%), Gaps = 16/198 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
           GW D+G +G     TPN+D +A  G++L   Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 44  GWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAMLTGRLPIRNGFYTTN 103

Query: 81  -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E L P+ L++ GY+T +IGKWH+G  +    P   GFD   
Sbjct: 104 AHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-QQTHYHPLKHGFDEFF 162

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVI-KS 187
           G  N +    D   + +  V  +A      Y       +     LT  +T +++  I K+
Sbjct: 163 GSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTGESNLTVEYTQEAIKFINKN 222

Query: 188 HNHSRPLFLQITHAAVHT 205
             + +P FL  T  A HT
Sbjct: 223 AANKKPFFLYWTPDATHT 240


>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 486

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/214 (32%), Positives = 95/214 (44%), Gaps = 39/214 (18%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
           GW+D+G +G +   TPNID  A   +     Y +  C+PSR+  +TGK+  R        
Sbjct: 37  GWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGKHAARLHFTIWAE 96

Query: 82  GAGVAKA-------------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
           GA    A             +P +EK +  YLK  GY T LIGKWH+G    E  P   G
Sbjct: 97  GAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWHLG--DWEHYPEAHG 154

Query: 129 FDNHVG--YWNGYLTY-----NDSIHETDFAVGLDARRNMERYAPQMS----SKYLTDFF 177
           FD ++G   W    T+         H  +F           RY P +      +YLTD  
Sbjct: 155 FDINIGGTNWGAPQTFWWPYSGSGTHGPEF-----------RYIPHLEYGHPGEYLTDRL 203

Query: 178 TDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNA 211
           TD+++ VI  H   +P F+ + H AVHT     A
Sbjct: 204 TDEAIKVI-DHAGDQPFFVYLAHHAVHTPIEAKA 236


>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
          Length = 519

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
            VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G         
Sbjct: 107 CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 166

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +PLFL + 
Sbjct: 167 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFLYLA 223

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 224 LQSVHEP-----------LQVP--EEYLKPYNFIQDKNRHYYA 253


>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
 gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
          Length = 499

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 97/192 (50%), Gaps = 13/192 (6%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDT- 79
           QG+ DVG  G + + TPN+D +A  G++L   Y   P C+PSRA  +TG YP R  + T 
Sbjct: 55  QGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGSYPRRVDMATG 114

Query: 80  ---PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
              PV  A   K +   E  + + LK +GY+T + GKWH+G ++ E LP  +GFD   G 
Sbjct: 115 SNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPTRQGFDEFFGL 173

Query: 136 WNGY---LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
              +    T+    H     + L    N+    P  + +YLT   T++++  I+ H  + 
Sbjct: 174 PYSHDIAPTHKRQAHFKFPDLPLMENENVIELNP--NPEYLTRRITERAIDFIERHQDA- 230

Query: 193 PLFLQITHAAVH 204
           P FL + H   H
Sbjct: 231 PFFLYLPHPMPH 242


>gi|426256630|ref|XP_004021940.1| PREDICTED: arylsulfatase E [Ovis aries]
          Length = 583

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 71/124 (57%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G + + TPNID LA +G+ L +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 45  GIGDVGCYGNSTLRTPNIDRLAADGVRLTQHLAAAPVCTPSRAAFLTGRYPLRSGMVSSQ 104

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
           G        V+  +P +E    + LK  GY+T L+GKWH+G  C   +     P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLVGKWHLGLSCASPDDHCHHPLNHGFD 164

Query: 131 NHVG 134
           +  G
Sbjct: 165 HFYG 168


>gi|187735071|ref|YP_001877183.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187425123|gb|ACD04402.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 542

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 94/188 (50%), Gaps = 11/188 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----D 78
           GW+D G +G ++IPTP +D LA  G++  R YT   C+PSRA+ +TG  P +  +    D
Sbjct: 46  GWSDPGCYG-SEIPTPALDTLARQGMLATRLYTASRCSPSRASIMTGCEPHKVDVGLLDD 104

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                G    +      LP+ LK+ GY T+L GKWH+G  +    P++RGFD   G   G
Sbjct: 105 DSGRPGYRGRLNPGIPTLPELLKKAGYRTYLSGKWHLGKVRGS-YPWDRGFDRSRGLLGG 163

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQ 197
              Y   + ++ F  G + +       P+    Y+TD  T  ++  I     SR P FL 
Sbjct: 164 AADYYRPMPDSPF--GENGKLLRPEDLPE--DFYMTDDITKTALAYIGDAAKSRQPFFLY 219

Query: 198 ITHAAVHT 205
           + + A HT
Sbjct: 220 VAYTAPHT 227


>gi|344297991|ref|XP_003420678.1| PREDICTED: steryl-sulfatase [Loxodonta africana]
          Length = 578

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 68/124 (54%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D G +G   + TPNID LA  G+ L +H    P CTPSRAAF+TG+YP R G+ +  
Sbjct: 33  GIGDPGCYGNKTLRTPNIDRLAQGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASSS 92

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
             GV      +  +P +E    + LK  GYST LIGKWH+G  CN +      P + GFD
Sbjct: 93  RIGVYIVTASSGGLPTSEITFARLLKNQGYSTALIGKWHLGSNCNSKSDFCHHPLSHGFD 152

Query: 131 NHVG 134
              G
Sbjct: 153 YFYG 156


>gi|332662522|ref|YP_004445310.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332331336|gb|AEE48437.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 449

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 90/197 (45%), Gaps = 27/197 (13%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
            G+ D+  +G  D  TPN+D LA  GI  +N +   P C P+RAAF+TG+YP +    TP
Sbjct: 41  MGYGDLSCYGRKDYTTPNLDKLASQGIKFVNAYSAAPVCNPTRAAFMTGRYPAK----TP 96

Query: 81  VGA-------------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
           +G              G+    P    L    +   GY T LIGKWH+G   +   P   
Sbjct: 97  IGLIEPLTQSKRDSTFGLTAEFPSIATL----MSASGYETALIGKWHLGFLPQH-SPVKN 151

Query: 128 GFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
           GFD   G    +    D I       G  A    E   P     YLT+ F+ ++V  IK 
Sbjct: 152 GFDYFFGI---HSAAADYISHKSGLPGNRAHDLYENDTPVYPEGYLTNLFSQKAVAYIK- 207

Query: 188 HNHSRPLFLQITHAAVH 204
             H++P FL IT+ AVH
Sbjct: 208 QKHNKPFFLTITYNAVH 224


>gi|229490602|ref|ZP_04384440.1| arylsulfatase [Rhodococcus erythropolis SK121]
 gi|229322422|gb|EEN88205.1| arylsulfatase [Rhodococcus erythropolis SK121]
          Length = 773

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G  G ++I TP +D LA  GI +  ++T P C+PSRAA LTG  P R G      
Sbjct: 55  GYSDIGPFG-SEIETPTLDRLAAQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113

Query: 83  A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
           A     G+   +    + LP+ L+  GY+T+ +GKWH+  +         +  P  RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
            + G   G     +S +  +  +  ++  +++ Y       YLT+  TD++V  IK   +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTEDLTDKAVGYIKDLRA 226

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL   H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243


>gi|410988054|ref|XP_004000303.1| PREDICTED: steryl-sulfatase [Felis catus]
          Length = 578

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 69/120 (57%), Gaps = 12/120 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   + TPNID LA  G+ L +H    P CTPSRAAF+TG+YP R G+ +  
Sbjct: 33  GIGDLGCYGNKTLRTPNIDRLAEGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASEF 92

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--C-NKEELL--PFNRGFD 130
             GV      +  +P +E    + LK  GYST LIGKWH+G  C NK +    P + GFD
Sbjct: 93  LVGVYLFSASSGGLPTSEITFAKLLKGQGYSTALIGKWHLGTNCHNKSDFCHHPLSHGFD 152


>gi|145391|gb|AAC32036.1| putative arylsulfatase [Escherichia coli]
          Length = 475

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/116 (44%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWHIG NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGLTT-LPQLLHDQGYVTQAIGKWHIGENKES-QPQNVGFDDFRGF 210


>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
 gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
          Length = 479

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 102/202 (50%), Gaps = 27/202 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID--- 78
           G+ D+G +G   I +PN+D +A  GI     Y   + CTPSRA  LTG+ P R G+    
Sbjct: 49  GYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQ 108

Query: 79  ----TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
                P   G    +P TE  + + LKE  Y T L+GKWH+G +     P + GFD + G
Sbjct: 109 IRVLFPTSTG---GLPTTEITIAKALKEKDYRTALVGKWHLG-HLPGFQPLDHGFDEYFG 164

Query: 135 --YWNGY-----LTYNDSI---HETDFAVGLDARRN-MERYAPQMSSKYLTDFFTDQSVH 183
             Y N +     L+Y  +I    + DF V L   R+ +ER A Q +   +T  +T ++V 
Sbjct: 165 IPYSNDHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNT---ITKRYTQEAVS 221

Query: 184 VIKSHNHSRPLFLQITHAAVHT 205
            IK  N ++P FL + H+  H 
Sbjct: 222 FIKK-NSNQPFFLYLAHSMPHV 242


>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
 gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
          Length = 509

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 96/204 (47%), Gaps = 25/204 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           GW DVG++G     TPNID LA  G++    Y   +  +PSR + +TGKYP R GI   +
Sbjct: 42  GWADVGYNGSRFYETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDWI 101

Query: 82  ------------------GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL- 122
                                +   +P+ E  + +  KE GY+T+ +GKWH  C ++ L 
Sbjct: 102 PGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWH--CAEDSLY 159

Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQS 181
            P  +GFD ++G W       + I  +    G         Y P     ++LTD   D+S
Sbjct: 160 YPQYQGFDVNIGGW--LKGSPNGIRRSQGGKGAYCSPYRNPYLPDGPEGEFLTDRLGDES 217

Query: 182 VHVIKSHNHSRPLFLQITHAAVHT 205
           + +IK+ +  +P FL +   AVHT
Sbjct: 218 IKLIKNSSADKPFFLYLAFYAVHT 241


>gi|422831041|ref|ZP_16879191.1| hypothetical protein ESNG_03696 [Escherichia coli B093]
 gi|371602932|gb|EHN91614.1| hypothetical protein ESNG_03696 [Escherichia coli B093]
          Length = 270

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 486

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 93/205 (45%), Gaps = 30/205 (14%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D+  +G  DI TP ID +A  G+  N  Y    C+P+RA+ +TG +  R GI   + 
Sbjct: 41  GYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTGSHSIRVGIGGVMF 100

Query: 83  AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
                 +   E  LP+ LK+ GY+T +IGKWH+G N++   P N GFD    YW G    
Sbjct: 101 PRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHGFD----YWYGTPAS 155

Query: 143 NDS-----------------------IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
           N+                        I   + A     R N+    P   S++ T  +T 
Sbjct: 156 NNQFYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQSQF-TQRYTR 214

Query: 180 QSVHVIKSHNHSRPLFLQITHAAVH 204
           +++  I + NH +P F+ + H   H
Sbjct: 215 ETIRFI-TENHEQPFFIYLAHNMPH 238


>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 469

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 109/233 (46%), Gaps = 38/233 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-- 79
           G+ D+G+ G   IPTPNID LA  G+     Y T   C PSRA FLTG+Y  R+G +T  
Sbjct: 43  GYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNP 102

Query: 80  PVGA-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
           P         VP +E+L  + L+ +GY T ++GKWHIG +     P NR          G
Sbjct: 103 PYDRHNTIAGVPASERLFAERLQAVGYKTGIVGKWHIGSHSIH-HPNNR----------G 151

Query: 139 YLTYNDSIHETDFAVGLDARRNMER--YAPQMSS-------KYLTDFFTDQSVHVIKSHN 189
           +  +   +        +D R  M+     P M +        YLT   TD+++  I+  N
Sbjct: 152 FDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMRNGSSVDVEGYLTTQLTDEAIGFIE-RN 210

Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              P FL +++ A           P   LQ P  EE+   F+H+   +RR+++
Sbjct: 211 EKDPFFLFLSYNA-----------PHAPLQAP--EESIAKFSHVEGKERRVYS 250


>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 728

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/185 (32%), Positives = 95/185 (51%), Gaps = 6/185 (3%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ D+G +G  ++ TP ID +A  GI    +Y   P C+PSRA  LTG YP R G    
Sbjct: 16  QGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGLLTGCYPRRVGNHVW 75

Query: 81  V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           V  A     +   E  L +  K+ GY T  IGKWH+G + E  LP N+GFD++ G  +  
Sbjct: 76  VHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLGFH-EPFLPQNQGFDHYFGLLHN- 133

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           L   ++++  D   G+  +R+ +          LT  +T++++  I++ N   P  L + 
Sbjct: 134 LDPVETVYFEDVG-GVPLQRDRDVVKRPADPDELTKLYTNEAIDFIEA-NKEGPFLLYLP 191

Query: 200 HAAVH 204
           H  +H
Sbjct: 192 HTMLH 196


>gi|390368732|ref|XP_784356.2| PREDICTED: N-acetylgalactosamine-6-sulfatase-like
           [Strongylocentrotus purpuratus]
          Length = 482

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/195 (33%), Positives = 96/195 (49%), Gaps = 18/195 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +G     TPN+D +A  GI+L   Y   P  +PSRAA LTG+ P R G  T  
Sbjct: 7   GWGDLGIYGNPAKETPNLDQMAAEGILLPDFYAANPLGSPSRAALLTGRLPIRNGFYTTN 66

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           G          V   +P +E LLP+ LK  GY + ++GKWH+G +  + LP   GFD   
Sbjct: 67  GHAHNAWSQQIVKGGIPDSEILLPKLLKLSGYKSKIVGKWHLG-HLPQYLPLKHGFDEWF 125

Query: 134 GYWNGYLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKY-LTDFFTDQSVHVI-KSHN 189
           G  N ++    N  ++     +G    R  E++  + + +  LT  +  + ++ I KS  
Sbjct: 126 GAPNCHIKSLPNIPVYRDSEMIG----RYFEQFIIEKNGESNLTQLYIKEGLNFIEKSAE 181

Query: 190 HSRPLFLQITHAAVH 204
             +P FL  T  A H
Sbjct: 182 AKQPFFLYWTPDATH 196


>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
          Length = 461

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 89/190 (46%), Gaps = 19/190 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNG-IVLNRHYTLPTCTPSRAAFLTGKYPFRYG----I 77
           G+ D GF G  DI TPNID LA  G I  + H      +PSR+  LTG+Y  RYG    +
Sbjct: 42  GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECNL 101

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
           D P        +P  E+LLP  LK  GY T  IGKWH+G    +  P  +GFD   G   
Sbjct: 102 DKP-----GDGIPDDEELLPALLKRYGYRTGCIGKWHLGSEPSQ-RPNAKGFDTFYGLLA 155

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHSRPLF 195
           G+ +Y    ++ + +   D   N+++Y           +FTD+     +       +P  
Sbjct: 156 GHRSY---FYDPETS---DKDGNLQQYQYNGQKLSFDGYFTDELASKARQFVAESEQPFM 209

Query: 196 LQITHAAVHT 205
           L ++  A H+
Sbjct: 210 LYMSFTAPHS 219


>gi|397733173|ref|ZP_10499895.1| sulfatase family protein [Rhodococcus sp. JVH1]
 gi|396930984|gb|EJI98171.1| sulfatase family protein [Rhodococcus sp. JVH1]
          Length = 790

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G ++I TPN++ LA +G  L+ ++T   C+P+RAA LTG  P R G  +   
Sbjct: 65  GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
             P   G+   +      L + L+  GY+TH +GKWH+  +         +  P  RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
           ++ G   G     +S +  +  +  ++  ++E Y    S  Y+TD  TD++V  IKS   
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVSRIKSLRA 236

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL  +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253


>gi|111020297|ref|YP_703269.1| arylsulfatase [Rhodococcus jostii RHA1]
 gi|110819827|gb|ABG95111.1| arylsulfatase [Rhodococcus jostii RHA1]
          Length = 790

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G ++I TPN++ LA +G  L+ ++T   C+P+RAA LTG  P R G  +   
Sbjct: 65  GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
             P   G+   +      L + L+  GY+TH +GKWH+  +         +  P  RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
           ++ G   G     +S +  +  +  ++  ++E Y    S  Y+TD  TD++V  IKS   
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVSRIKSLRA 236

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL  +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253


>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
          Length = 431

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
            VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G         
Sbjct: 19  CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 78

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +PLFL + 
Sbjct: 79  CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFLYLA 135

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 136 LQSVHEP-----------LQVP--EEYLKPYNFIQDKNRHYYA 165


>gi|395527008|ref|XP_003765645.1| PREDICTED: steryl-sulfatase [Sarcophilus harrisii]
          Length = 585

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D G +G   + TPNID +A  G+   +H    P CTPSRAAFLTG+YP R G+ +  
Sbjct: 38  GIGDPGCYGNTTLRTPNIDRIAKGGVKFTQHLAASPLCTPSRAAFLTGRYPVRSGMASRS 97

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
             GV      +  +P  E    + LK  GYST LIGKWH+G  CN  +     P N GFD
Sbjct: 98  KVGVFLFSASSGGLPANEITFAKLLKNQGYSTALIGKWHLGINCNSRDDFCHHPLNHGFD 157

Query: 131 NHVG 134
           +  G
Sbjct: 158 HFYG 161


>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 525

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 42/212 (19%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G+    TPN+DA+A  G++L   YT  P C+PSRAA LTG+ P R G  T  
Sbjct: 42  GWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +   E LLPQ LK  GY + ++GKWH+G ++ + LP   GFD   
Sbjct: 102 AHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLG-HRPQYLPLKNGFDEWF 160

Query: 134 GYWNGYL-TYNDSIHETDFAVGLDARRNMERY-APQMSSKYLTDFFTDQSV--------- 182
           G  N +   YND            ++ N+  Y   +M  ++  DF  D++          
Sbjct: 161 GSPNCHFGPYNDQ-----------SKPNIPVYNNSEMLGRFYEDFKIDRNTGESNLTQIY 209

Query: 183 ------HVIKSHNHSRPLFL----QITHAAVH 204
                  +++     +P FL      THA V+
Sbjct: 210 LMEGLDFILRQTKAQQPFFLYWAVDATHAPVY 241


>gi|196229912|ref|ZP_03128776.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196226238|gb|EDY20744.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 588

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/222 (31%), Positives = 103/222 (46%), Gaps = 34/222 (15%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           Q W D+  +G  ++ TPNID+LA  G +L+  +  P C+P+RA FLTG+Y  R G+    
Sbjct: 37  QAWGDLSINGNTNLSTPNIDSLATTGALLDHFFVCPVCSPTRAEFLTGRYHLRGGVHG-- 94

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
            +   + + + E+ + +  K  GY+T   GKWH G  +    P  RGFD + G+ +G+  
Sbjct: 95  VSSGGERLNLDERTIAEAFKAAGYATGAFGKWHNGM-QYPYHPNARGFDEYYGFCSGHWG 153

Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
            Y D+  E +  +               S  +L D FT  ++  I+  N  RP F  +  
Sbjct: 154 DYFDAPIEHNGQI-------------VQSHGFLIDDFTQHAMDFIE-QNKDRPFFCYVPF 199

Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
              HT            LQVP     DR F   SN D +L A
Sbjct: 200 NTPHTP-----------LQVP-----DRWFDKFSNMDLKLRA 225


>gi|147906969|ref|NP_001086084.1| arylsulfatase D precursor [Xenopus laevis]
 gi|49257838|gb|AAH74170.1| MGC81982 protein [Xenopus laevis]
          Length = 586

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/117 (43%), Positives = 67/117 (57%), Gaps = 7/117 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G + I TPNID LA  G+ L +H +  P CTPSRAAF+TG+YP R G++   
Sbjct: 40  GIGDIGCYGNDTIRTPNIDRLAKEGLKLKQHISAAPLCTPSRAAFVTGRYPIRSGMELGS 99

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
           G      AG +  +P  E      L++ GYST LIGKWH+G N      F    +NH
Sbjct: 100 GGRIIFWAGSSAGLPPNETTFATILQQQGYSTGLIGKWHLGVNCASRNDFCHHPNNH 156


>gi|336429765|ref|ZP_08609725.1| hypothetical protein HMPREF0994_05731 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336002095|gb|EGN32220.1| hypothetical protein HMPREF0994_05731 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 472

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 92/199 (46%), Gaps = 31/199 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---- 77
           GW D+G  G     TPNID +   G+  +  Y   P C+PSRA+FL+G+YP R G+    
Sbjct: 15  GWRDLGCSGSTFYETPNIDQMCREGMRFDCAYAACPVCSPSRASFLSGQYPARIGVTDWI 74

Query: 78  ----------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
                        + A   K +P     + + L+  GY T  +GKWH+G      LP N 
Sbjct: 75  DESGTFHPLKGKLIDAPYLKHMPENTITVAERLRNAGYQTWHVGKWHLGGGN--YLPENF 132

Query: 128 GFDNHVG--YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
           GFD ++G   W G+ +Y           G  +  ++         +YLTD  TD+++ +I
Sbjct: 133 GFDVNIGGCEW-GHPSY-----------GYFSPYHIPTLEDGPEGEYLTDRLTDEAIDLI 180

Query: 186 KSHNHSRPLFLQITHAAVH 204
           +     +P FL   H AVH
Sbjct: 181 RKAPDDKPFFLNFCHYAVH 199


>gi|449275706|gb|EMC84474.1| Steryl-sulfatase, partial [Columba livia]
          Length = 552

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   + TPNID LA  G+ L +H    P CTPSRAAFLTG+YP R G+    
Sbjct: 14  GIGDLGCYGNRTLRTPNIDRLAEEGVTLTQHIAASPLCTPSRAAFLTGRYPIRSGMAAFS 73

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
             GV      +  +P  E    + LK+ GY+T LIGKWH+G N E        P + GFD
Sbjct: 74  RVGVFLFSASSGGLPSEEITFTKLLKQRGYATALIGKWHLGMNCESSNDFCHHPLSHGFD 133

Query: 131 NHVG 134
              G
Sbjct: 134 YFYG 137


>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 460

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/232 (30%), Positives = 112/232 (48%), Gaps = 22/232 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
           QG NDVG +G ++IPTP+ID LA  G++  ++Y+    CTPSR   LTG+ P R   D  
Sbjct: 38  QGINDVGCYG-SEIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNPTR-SQDQL 95

Query: 81  VGAGV-------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
           +GA +        + +   E  +   L++ GY T L+GKWH+G   E  LP   GFD   
Sbjct: 96  LGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFLPTAHGFDLFR 155

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-R 192
           G+  G + Y        +    D   N    +    + Y TD  T+++ H +K    + +
Sbjct: 156 GHTGGCIDY----FTMTYGNIPDWYHNQRHVS---ENGYATDLITEEAEHFLKDQQTTDK 208

Query: 193 PLFLQITHAAVHTGTAGN--AKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           P FL +++ A H G   +   + P  ++Q     ++ +    I +  RR FA
Sbjct: 209 PFFLFLSYNAPHFGKGWSPGDQSPVNIMQA--RGDDLKRVGTIKDKVRREFA 258


>gi|229587773|ref|YP_002869892.1| arylsulfatase [Pseudomonas fluorescens SBW25]
 gi|229359639|emb|CAY46482.1| arylsulfatase (ec 3.1.6.1) (aryl-sulfate sulphohydrolase)
           [Pseudomonas fluorescens SBW25]
          Length = 536

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 103/201 (51%), Gaps = 24/201 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G++D+G  G  +I TP++DALA NG+ L   +T PTC+P+R+  LTG      GI T   
Sbjct: 16  GFSDLGAFG-GEISTPHLDALALNGLRLTDFHTAPTCSPTRSMLLTGTDHHIAGIGTMAE 74

Query: 83  AGVAKAVP-------VTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD--- 130
           A   + +        + +K+  LP+ L+E GY T + GKWH+G    EL P  RGF+   
Sbjct: 75  ALTPELIGKPGYEGYLNDKVVALPELLREAGYQTLMSGKWHLGLTA-ELAPHARGFERSF 133

Query: 131 -------NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
                  NH G+   Y  +   + ++  A+ ++  R +E+        Y +D F D+ +H
Sbjct: 134 SLLPGAANHYGFEPTYDEHTPGLLKSTPALYIEDDRFVEQLPKDF---YSSDAFGDKLLH 190

Query: 184 VIKSHNHSRPLFLQITHAAVH 204
            +K  + +RP F  +  +A H
Sbjct: 191 YLKERDQARPFFAYLPFSAPH 211


>gi|348549768|ref|XP_003460705.1| PREDICTED: arylsulfatase E-like, partial [Cavia porcellus]
          Length = 613

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 78/151 (51%), Gaps = 12/151 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   + TPNID LA +G+ L  H    + CTPSRAAFLTG+YP R G+ +  
Sbjct: 73  GIGDLGCYGNGTLRTPNIDRLAEHGVKLTHHIAAASVCTPSRAAFLTGRYPIRSGMVSYN 132

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
           G       GV   +P +E    + LK+ GY+T LIGKWH+G N E        P + GFD
Sbjct: 133 GYRVLQWTGVPGGLPASEVTFAKLLKDSGYTTGLIGKWHLGLNCETSSDHCHHPLSHGFD 192

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNM 161
           +  G     +        ++  VGL  R  +
Sbjct: 193 HFYGMPFSMMADCQQWALSERRVGLQNRLRL 223


>gi|149638294|ref|XP_001514413.1| PREDICTED: arylsulfatase D-like [Ornithorhynchus anatinus]
          Length = 576

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  DVG +G + + TPNID LA  G+ L +H    P CTPSRAAFLTG++ FR G++   
Sbjct: 32  GIGDVGCYGNDTLRTPNIDRLAKEGVKLTQHLAAAPLCTPSRAAFLTGRHAFRSGMEASN 91

Query: 82  G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
           G       G +  +P+ E    + L++ GY+T LIGKWH G N E        P N GFD
Sbjct: 92  GYRALQWNGGSGGLPINETTFAKILQQQGYATGLIGKWHQGVNCESRNDSCHHPLNHGFD 151

Query: 131 NHVG 134
              G
Sbjct: 152 FFYG 155


>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
          Length = 418

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 88  AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
            VP+ EKLLPQ LKE GY+TH++GKWH+G  ++E LP  RGFD + GY  G         
Sbjct: 8   CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSEDYYSHER 67

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            T  D+++ T  A+     R+ E  A    + Y T+ FT+++  +I +H   +PLFL + 
Sbjct: 68  CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLFLYLA 124

Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
             +VH             LQVP  EE  + +  I + +R  +A
Sbjct: 125 LQSVHEP-----------LQVP--EEYLKPYKFIQDKNRHHYA 154


>gi|226362295|ref|YP_002780073.1| arylsulfatase [Rhodococcus opacus B4]
 gi|226240780|dbj|BAH51128.1| arylsulfatase [Rhodococcus opacus B4]
          Length = 787

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G ++I TPN++ LA +G  L+ ++T   C+P+RAA LTG  P R G  +   
Sbjct: 62  GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 120

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
             P   G+   +      L + L+  GY+TH +GKWH+  +         +  P  RGFD
Sbjct: 121 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 180

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
           ++ G   G     +S +  +  +  ++  ++E Y    S  Y+TD  TD++V  IKS   
Sbjct: 181 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVARIKSLRA 233

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL  +H A+H
Sbjct: 234 HDADKPFFLYFSHIAMH 250


>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/277 (27%), Positives = 112/277 (40%), Gaps = 74/277 (26%)

Query: 7   AGVAKAVPVTEK-------LLPQGWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTL 56
           AG+    PV +K       +   GW +VG+H      ++ TPNID L   G+ L++HY  
Sbjct: 11  AGLVAGQPVRQKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAY 70

Query: 57  PTCTPSRAAFLTGKYPF----------RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYS 106
             C+PSR+  ++G+ P            Y  + PV      A+P     + + +KE GY+
Sbjct: 71  KFCSPSRSCLMSGRLPIHVNDLNLAPTNYNPNDPVSG--FSAIPRNMTGIAEKMKEAGYA 128

Query: 107 THLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP 166
           TH +GKW  G    +  P  RGFD   GY++         H+ D+          E   P
Sbjct: 129 THQVGKWDAGMATPDHTPKGRGFDTSFGYYH---------HDNDYYT--------EVVGP 171

Query: 167 QMS----------------------SKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
           Q S                       KY    F ++ + V+  H+ + PLFL       H
Sbjct: 172 QCSGSPIVDLWDTDHPAHGINGTGPDKYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAH 231

Query: 205 TGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
           T            LQVPD   N   F+ I + DR+ +
Sbjct: 232 T-----------PLQVPDDYLN--KFSFIDDSDRKYY 255


>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 461

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 91/189 (48%), Gaps = 16/189 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+   G  D+ TP+ID+L   G+     Y   + C+PSRAA LTG+YP R G+   +
Sbjct: 47  GYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVPGVI 106

Query: 82  GAGVAKA---VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
              V  +   +  +  LLP YL++ GY +  IGKWH+G      LP  RGF    G   G
Sbjct: 107 RDEVQDSWGYLASSATLLPTYLRKQGYHSANIGKWHLGLESPN-LPNERGFQEFYGLLEG 165

Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSK--YLTDFFTDQSVHVIKSHNHSR-PLF 195
            +         D+ V L   +N  R+  Q+     + TD FTD +V  +      + P F
Sbjct: 166 MM--------DDYVVKLRHGQNFLRHNGQVIDPPGHATDVFTDAAVRYLNDRKAKKDPFF 217

Query: 196 LQITHAAVH 204
           L + + A H
Sbjct: 218 LYLAYTAPH 226


>gi|424860530|ref|ZP_18284476.1| arylsulfatase [Rhodococcus opacus PD630]
 gi|356659002|gb|EHI39366.1| arylsulfatase [Rhodococcus opacus PD630]
          Length = 790

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
           G++D+G  G ++I TPN++ LA +G  L+ ++T   C+P+RAA LTG  P R G  +   
Sbjct: 65  GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123

Query: 80  --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
             P   G+   +      L + L+  GY+TH +GKWH+  +         +  P  RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183

Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
           ++ G   G     +S +  +  +  ++  ++E Y    S  Y+TD  TD+++  IKS   
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAISRIKSLRA 236

Query: 188 HNHSRPLFLQITHAAVH 204
           H+  +P FL  +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253


>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 82/196 (41%), Gaps = 38/196 (19%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDTPV 81
           GW DVGF+G  +  TPN+D LA  G    R YT    C PSRAA +TG+Y    G+    
Sbjct: 51  GWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMTGRYTIHDGVSRN- 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG----CNKEELLPFNRGFDNHVGYWN 137
                  +P  E  L +  K  GY T L GKWH G     +K  + P ++GFD   G+  
Sbjct: 110 ----NDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVHPMDQGFDEFFGF-- 163

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVIKSHN 189
                             DA+   E+Y  Q+         S Y  D F D ++  +K H 
Sbjct: 164 -----------------TDAKHAWEKYPEQLWHGRELKPVSGYSDDMFADHAIDFLKRHK 206

Query: 190 HS-RPLFLQITHAAVH 204
               P FL +     H
Sbjct: 207 EKPTPFFLYVPFINTH 222


>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 470

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 103/200 (51%), Gaps = 19/200 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+G +G  +I TPN+D LA  G      Y+    C+PSRAA LTG YP R G+   
Sbjct: 38  QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97

Query: 81  VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
           V    +   +   E  +  +LK  GY+T  +GKWH+G +K E LP + GFD++ G  Y N
Sbjct: 98  VLFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156

Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
                     G ++ +D   +   AV L      ++ E     +  + +T  +TD+++  
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDRAIEF 216

Query: 185 IKSHNHSRPLFLQITHAAVH 204
           +++ N  +P FL + H+  H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235


>gi|326798263|ref|YP_004316082.1| sulfatase [Sphingobacterium sp. 21]
 gi|326549027|gb|ADZ77412.1| sulfatase [Sphingobacterium sp. 21]
          Length = 559

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/198 (30%), Positives = 94/198 (47%), Gaps = 24/198 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+G +G  +I TP++D+LA +G+   + Y    C PSRA+ +TG YP +  I     
Sbjct: 49  GYSDLGCYG-GEIQTPHLDSLAASGLRFTQFYNAARCCPSRASLMTGLYPHQAAIGHMTN 107

Query: 78  --------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
                   D  V     +  P T  L  + LK  GY+T + GKWH+G  ++E  P  RGF
Sbjct: 108 PSEHFTQHDYHVPGYRGELSPQTHTL-AEVLKTAGYTTLMTGKWHLGMERKEQWPLQRGF 166

Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---K 186
           D++ G  +G   Y          +  D  +  +       + Y TD FTD ++  I   K
Sbjct: 167 DHYYGILDGASNYFQPAQPRGITLDNDTLKVDD------PNFYTTDAFTDHAIQFIDQSK 220

Query: 187 SHNHSRPLFLQITHAAVH 204
             +  RP FL + + A H
Sbjct: 221 QQDGERPFFLYLAYTAPH 238


>gi|443709810|gb|ELU04315.1| hypothetical protein CAPTEDRAFT_117141 [Capitella teleta]
          Length = 562

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G + + TP++D++  NG+ L+      + CTPSRAA +T +Y  R G+ + +
Sbjct: 34  GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 93

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
            + ++ + +P +E  LPQ L+E GY+T LIGKWH+G N++ L     P  RGFD   G
Sbjct: 94  TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 151


>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
 gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
          Length = 477

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 95/198 (47%), Gaps = 37/198 (18%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DV  +G     TPNID LA +GI     +T  P C P+RA+ ++G+Y  R G+ T V
Sbjct: 34  GYTDVACYGSKYYETPNIDKLAKDGIKFTDGHTCGPNCQPTRASLMSGQYGPRTGVYT-V 92

Query: 82  GA---------------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
           G+                V K +P+ +  L Q LK+ GY+T + GKWH+G +KE   P  
Sbjct: 93  GSIDRFAWQTRSLHPVENVTK-LPLDKITLAQSLKKAGYATGMFGKWHLGEDKEH-HPAQ 150

Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
           RGFD  +                   V  D   N +   P+   +YL DF TD+++  IK
Sbjct: 151 RGFDEAL---------------VSMGVHFDFVTNPKVDYPK--DEYLADFLTDKALDFIK 193

Query: 187 SHNHSRPLFLQITHAAVH 204
            H    P FL + H AVH
Sbjct: 194 RHK-DEPFFLYLPHYAVH 210


>gi|443716273|gb|ELU07881.1| hypothetical protein CAPTEDRAFT_43570, partial [Capitella teleta]
          Length = 492

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G + + TP++D++  NG+ L+      + CTPSRAA +T +Y  R G+ + +
Sbjct: 14  GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 73

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
            + ++ + +P +E  LPQ L+E GY+T LIGKWH+G N++ L     P  RGFD   G
Sbjct: 74  TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 131


>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
           [Ciona intestinalis]
          Length = 513

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 26/203 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G +G+    TPN+D +A  G +    Y+  P C+PSRAA LTG+ P R G  T  
Sbjct: 32  GWGDLGINGQPSKETPNLDNMAKEGTLFTDFYSANPLCSPSRAALLTGRLPIRNGFYTSN 91

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF---- 129
             G        +   +P  E L+ + L   GY+  LIGKWH+G  +E+ LP   GF    
Sbjct: 92  YHGHNGYTPQHIVGGIPDHEILVSELLSSAGYTNKLIGKWHLG-QQEQYLPLKHGFHEWF 150

Query: 130 ---DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYL---TDFFTDQSVH 183
              + H G ++   T N  ++     VG    R  E +A + S KYL   T ++  +++ 
Sbjct: 151 GSPNCHFGPYDDKTTPNIPVYNNTEMVG----RYYEEFAIE-SHKYLSNMTQYYIQEALD 205

Query: 184 VI-KSHNHSRPLFLQITHAAVHT 205
            I +   + +P FL     A H+
Sbjct: 206 FIERMERNEKPFFLYWAPDATHS 228


>gi|419117362|ref|ZP_13662369.1| sulfatase family protein [Escherichia coli DEC5A]
 gi|419134020|ref|ZP_13678843.1| sulfatase family protein [Escherichia coli DEC5D]
 gi|377957343|gb|EHV20878.1| sulfatase family protein [Escherichia coli DEC5A]
 gi|377970376|gb|EHV33738.1| sulfatase family protein [Escherichia coli DEC5D]
          Length = 531

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/168 (36%), Positives = 86/168 (51%), Gaps = 15/168 (8%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+    
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF---- 190

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDF-FTDQSVHVIK 186
               +S+ +  +    DA  N E       S+Y+    F+   VH ++
Sbjct: 191 ----NSVSDM-YTEWRDAHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 233


>gi|291285214|ref|YP_003502032.1| Arylsulfatase precursor [Escherichia coli O55:H7 str. CB9615]
 gi|387509248|ref|YP_006161504.1| arylsulfatase [Escherichia coli O55:H7 str. RM12579]
 gi|419128593|ref|ZP_13673461.1| sulfatase family protein [Escherichia coli DEC5C]
 gi|419139161|ref|ZP_13683950.1| sulfatase family protein [Escherichia coli DEC5E]
 gi|209753344|gb|ACI74979.1| HemY protein [Escherichia coli]
 gi|290765087|gb|ADD59048.1| Arylsulfatase precursor [Escherichia coli O55:H7 str. CB9615]
 gi|374361242|gb|AEZ42949.1| arylsulfatase [Escherichia coli O55:H7 str. RM12579]
 gi|377969336|gb|EHV32714.1| sulfatase family protein [Escherichia coli DEC5C]
 gi|377980212|gb|EHV43478.1| sulfatase family protein [Escherichia coli DEC5E]
          Length = 551

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/168 (36%), Positives = 86/168 (51%), Gaps = 15/168 (8%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+    
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF---- 210

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDF-FTDQSVHVIK 186
               +S+ +  +    DA  N E       S+Y+    F+   VH ++
Sbjct: 211 ----NSVSDM-YTEWRDAHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253


>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 460

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 92/201 (45%), Gaps = 25/201 (12%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFR---YGID 78
           G+ D+G +G   I TP++D +A  G+     Y     CTPSRAA LTG+YP R   YG  
Sbjct: 41  GYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAALLTGRYPVRSGMYGKR 100

Query: 79  TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
             +       +P  E  LP+ LK  GY+T  +GKWH+G + E   P ++GFD   G    
Sbjct: 101 RVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIH-EGSRPLDQGFDQSFG---- 155

Query: 139 YLTY-NDSIHETDFAVGLDAR-------------RNMERYAPQMSSKYLTDFFTDQSVHV 184
            L Y ND     D   G                 RN E         +LT  +T+++V  
Sbjct: 156 -LPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVHLTGHYTEEAVKF 214

Query: 185 IKSHNHSRPLFLQITHAAVHT 205
           I+    S+P FL + H+  H 
Sbjct: 215 IQ-QKKSQPFFLYMAHSFPHV 234


>gi|440713850|ref|ZP_20894444.1| arylsulfatase [Rhodopirellula baltica SWK14]
 gi|436441359|gb|ELP34602.1| arylsulfatase [Rhodopirellula baltica SWK14]
          Length = 1571

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 97/191 (50%), Gaps = 17/191 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+G +G  +I TPNIDALA +G+ L + Y    C PSRA+ +TG YP + GI     
Sbjct: 25  GYSDLGCYG-GEISTPNIDALAADGVKLTQVYNSARCCPSRASLMTGLYPTQAGIGDFTT 83

Query: 78  ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              +   G G    +      + + LK  GY  + +GKWH+     +  P  RGFD+  G
Sbjct: 84  REPNRTRGQGYLGRLRDDCVTMAEVLKPEGYGCYYVGKWHM---HPKTGPIKRGFDDFYG 140

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRP 193
           Y N    ++   ++ D+ + L   R ++   P     Y TD F D ++  I+   + ++P
Sbjct: 141 YTN---DHSHDQYDADYYIRLPENR-VKEIDPPADQFYATDVFNDYAIEFIRQGQSTNKP 196

Query: 194 LFLQITHAAVH 204
            FL + H++ H
Sbjct: 197 WFLFLGHSSPH 207



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 53/92 (57%), Gaps = 6/92 (6%)

Query: 25  NDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPVGA 83
           +D+  +G   +PTPN++ LA  G+V +  Y T+ +C+PSR + +TG+YP   G       
Sbjct: 721 DDLSVYGNAFVPTPNLERLASKGLVFDNAYLTISSCSPSRCSMITGRYPHNTG-----AP 775

Query: 84  GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            +   +P T++   Q L+E GY T + GK H+
Sbjct: 776 ELHTTLPETQRTFVQSLREAGYHTVISGKNHM 807


>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 459

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 2/113 (1%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+  +G   I TPNID +   GI L   Y     C+PSRAA LTG+YP R G+   +
Sbjct: 50  GWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAALLTGRYPIRSGMQHVI 109

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
                  +P  E  + + LK  GY T ++GKWH+G ++EE  P N+GFD   G
Sbjct: 110 FPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPTNQGFDWFYG 161


>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 491

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 112/238 (47%), Gaps = 36/238 (15%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGID--- 78
           G++D+G+ G  +I +P ID LA NG++  N + T P C PSRA  +TG++  R+G++   
Sbjct: 37  GYSDLGYTGSTEIESPVIDKLANNGVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINA 96

Query: 79  --TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
             +P    +   +PV E    + L+  GY T +IGKWH+G    +  P NRGFD   G+ 
Sbjct: 97  TYSPFDQHM--GLPVDEPTFAKRLQPAGYRTGIIGKWHLGA-APQFHPNNRGFDYFYGFL 153

Query: 137 NGYLTYNDSIHETDFAVGLDARR-----NMERYAPQMSSK-------YLTDFFTDQSVHV 184
           +G   Y      T   + L   +     N     P + +K       YLT   +  +   
Sbjct: 154 SGGHDYFPESVNTHLELVLPNGKPNYGANEGTLLPLLRNKNAAEFDDYLTTALSKDAARF 213

Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
           + S    +P  L + + A HT            LQ P  +E    ++HI +P RR++A
Sbjct: 214 VTS--SEQPFCLYLAYNAPHTP-----------LQAP--KETIAKYSHIKDPKRRIYA 256


>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
 gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
          Length = 454

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/187 (33%), Positives = 90/187 (48%), Gaps = 11/187 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDT 79
           GW D+G +G +D  TPN+D LA  G+   + Y     C+ +R A +TG+Y +R   G++ 
Sbjct: 23  GWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIALITGRYQYRLPAGLEE 82

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
           P+ A     +P     LP  L+E GY T LIGKWH+G       P   G+D   G   G 
Sbjct: 83  PI-ARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KPPTYGPLKSGYDRFFGNIGGA 140

Query: 140 LTYNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQ-SVHVIKSHNHSRPLFLQ 197
           L Y    H+    VG    R++ E   P   + Y T+   D+ S +V    +  +P FL 
Sbjct: 141 LDY--FTHKP--GVGAQVPRDLWEGDVPVERTGYYTNILGDEASAYVRAREDEKKPFFLS 196

Query: 198 ITHAAVH 204
           +   A H
Sbjct: 197 LHFTAPH 203


>gi|82779021|ref|YP_405370.1| arylsulfatase [Shigella dysenteriae Sd197]
 gi|81243169|gb|ABB63879.1| arylsulfatase [Shigella dysenteriae Sd197]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 494

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 73/229 (31%), Positives = 104/229 (45%), Gaps = 18/229 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVGF G  +I TP +D LA  G++ N  Y T   C PSRA  +TG+Y  R+G++   
Sbjct: 34  GYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGRYQARFGVEVNF 93

Query: 82  GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
                     +P  EK     LK+ GY T +IGKWH+G       P NRGFD   G+  G
Sbjct: 94  PYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLGA-AYPYHPNNRGFDYFYGFLGG 152

Query: 139 YLTYNDSIHETDFAVGLDARR-----NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
              Y      T   + L+  +     N   Y P M +    +F  D+ +    S + +R 
Sbjct: 153 AHDYMPENTSTTVPLTLENGKVNHMANAGSYLPLMRNNVNAEF--DEYLTTALSRDAAR- 209

Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
            F++ T        + NA  P   LQ P  +     +AHI +  RR +A
Sbjct: 210 -FIEKTEGPFCVYLSYNA--PHTPLQAP--KALIEKYAHIESQKRRTYA 253


>gi|16131653|ref|NP_418245.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
           MG1655]
 gi|170083285|ref|YP_001732605.1| acrylsulfatase-like protein [Escherichia coli str. K-12 substr.
           DH10B]
 gi|238902878|ref|YP_002928674.1| acrylsulfatase-like enzyme [Escherichia coli BW2952]
 gi|300950438|ref|ZP_07164359.1| arylsulfatase [Escherichia coli MS 116-1]
 gi|300955197|ref|ZP_07167593.1| arylsulfatase [Escherichia coli MS 175-1]
 gi|386282536|ref|ZP_10060184.1| arylsulfatase [Escherichia sp. 4_1_40B]
 gi|386597667|ref|YP_006094067.1| sulfatase [Escherichia coli DH1]
 gi|387623453|ref|YP_006131081.1| acrylsulfatase-like protein [Escherichia coli DH1]
 gi|388479449|ref|YP_491641.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
           W3110]
 gi|417265077|ref|ZP_12052456.1| arylsulfatase [Escherichia coli 2.3916]
 gi|417279133|ref|ZP_12066443.1| arylsulfatase [Escherichia coli 3.2303]
 gi|417294264|ref|ZP_12081543.1| arylsulfatase [Escherichia coli B41]
 gi|417636751|ref|ZP_12286956.1| arylsulfatase [Escherichia coli STEC_S1191]
 gi|417945692|ref|ZP_12588922.1| arylsulfatase [Escherichia coli XH140A]
 gi|417977667|ref|ZP_12618448.1| arylsulfatase [Escherichia coli XH001]
 gi|418305431|ref|ZP_12917225.1| arylsulfatase [Escherichia coli UMNF18]
 gi|419150829|ref|ZP_13695474.1| sulfatase family protein [Escherichia coli DEC6B]
 gi|419938621|ref|ZP_14455447.1| arylsulfatase [Escherichia coli 75]
 gi|422818955|ref|ZP_16867167.1| arylsulfatase [Escherichia coli M919]
 gi|423703323|ref|ZP_17677755.1| arylsulfatase [Escherichia coli H730]
 gi|432629424|ref|ZP_19865388.1| arylsulfatase [Escherichia coli KTE77]
 gi|432663050|ref|ZP_19898677.1| arylsulfatase [Escherichia coli KTE111]
 gi|432687632|ref|ZP_19922919.1| arylsulfatase [Escherichia coli KTE156]
 gi|432689129|ref|ZP_19924394.1| arylsulfatase [Escherichia coli KTE161]
 gi|432739299|ref|ZP_19974026.1| arylsulfatase [Escherichia coli KTE42]
 gi|432878167|ref|ZP_20095616.1| arylsulfatase [Escherichia coli KTE154]
 gi|433050271|ref|ZP_20237590.1| arylsulfatase [Escherichia coli KTE120]
 gi|442591326|ref|ZP_21009811.1| Arylsulfatase [Escherichia coli O10:K5(L):H4 str. ATCC 23506]
 gi|450252901|ref|ZP_21902275.1| arylsulfatase [Escherichia coli S17]
 gi|114256|sp|P25549.2|ASLA_ECOLI RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
           sulphohydrolase; Flags: Precursor
 gi|148200|gb|AAA67597.1| unknown [Escherichia coli str. K-12 substr. MG1655]
 gi|1790233|gb|AAC76804.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
           MG1655]
 gi|85676250|dbj|BAE77500.1| acrylsulfatase-like enzyme [Escherichia coli str. K12 substr.
           W3110]
 gi|169891120|gb|ACB04827.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
           DH10B]
 gi|238859923|gb|ACR61921.1| acrylsulfatase-like enzyme [Escherichia coli BW2952]
 gi|260451356|gb|ACX41778.1| sulfatase [Escherichia coli DH1]
 gi|300317879|gb|EFJ67663.1| arylsulfatase [Escherichia coli MS 175-1]
 gi|300450227|gb|EFK13847.1| arylsulfatase [Escherichia coli MS 116-1]
 gi|315138377|dbj|BAJ45536.1| acrylsulfatase-like enzyme [Escherichia coli DH1]
 gi|339417529|gb|AEJ59201.1| arylsulfatase [Escherichia coli UMNF18]
 gi|342362592|gb|EGU26709.1| arylsulfatase [Escherichia coli XH140A]
 gi|344192660|gb|EGV46749.1| arylsulfatase [Escherichia coli XH001]
 gi|345384819|gb|EGX14677.1| arylsulfatase [Escherichia coli STEC_S1191]
 gi|359333933|dbj|BAL40380.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
           MDS42]
 gi|377988755|gb|EHV51930.1| sulfatase family protein [Escherichia coli DEC6B]
 gi|385537513|gb|EIF84384.1| arylsulfatase [Escherichia coli M919]
 gi|385708462|gb|EIG45474.1| arylsulfatase [Escherichia coli H730]
 gi|386120386|gb|EIG69015.1| arylsulfatase [Escherichia sp. 4_1_40B]
 gi|386221259|gb|EII43703.1| arylsulfatase [Escherichia coli 2.3916]
 gi|386237910|gb|EII74850.1| arylsulfatase [Escherichia coli 3.2303]
 gi|386252452|gb|EIJ02144.1| arylsulfatase [Escherichia coli B41]
 gi|388409969|gb|EIL70230.1| arylsulfatase [Escherichia coli 75]
 gi|431160114|gb|ELE60632.1| arylsulfatase [Escherichia coli KTE77]
 gi|431196490|gb|ELE95416.1| arylsulfatase [Escherichia coli KTE111]
 gi|431218879|gb|ELF16304.1| arylsulfatase [Escherichia coli KTE156]
 gi|431234376|gb|ELF29777.1| arylsulfatase [Escherichia coli KTE161]
 gi|431278972|gb|ELF69943.1| arylsulfatase [Escherichia coli KTE42]
 gi|431417407|gb|ELG99870.1| arylsulfatase [Escherichia coli KTE154]
 gi|431561779|gb|ELI35141.1| arylsulfatase [Escherichia coli KTE120]
 gi|441608564|emb|CCP95648.1| Arylsulfatase [Escherichia coli O10:K5(L):H4 str. ATCC 23506]
 gi|449314180|gb|EMD04354.1| arylsulfatase [Escherichia coli S17]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|422836217|ref|ZP_16884265.1| arylsulfatase [Escherichia coli E101]
 gi|371609566|gb|EHN98103.1| arylsulfatase [Escherichia coli E101]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 491

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 91/197 (46%), Gaps = 16/197 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G  G     TPN+D +A  G+   R Y   P C+ SR A +TG YP R GI   +
Sbjct: 45  GYGDLGCFGAKGQATPNLDRMAAEGVKFERFYVAQPVCSASRMALMTGCYPNRVGIKGAL 104

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
           G G    +   E  L + +K+ GY+T   GKWH+G +  + LP   GFD ++G  Y N  
Sbjct: 105 GPGAKVGISKEETTLAELVKQNGYATAAFGKWHLG-DDPQFLPVRHGFDEYLGLPYSNDM 163

Query: 140 LTY-----NDSIHETDFAVG------LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
             Y     N +  +     G      +D  R +      +    LT ++T ++V  I + 
Sbjct: 164 WPYHPELVNLTPEQRKKRRGFPALPLVDGDRIILPEVTTVEQTRLTTWYTQRAVKFINT- 222

Query: 189 NHSRPLFLQITHAAVHT 205
           N  +P  L + H+  H 
Sbjct: 223 NKDKPFLLYLAHSMPHV 239


>gi|432578071|ref|ZP_19814516.1| arylsulfatase [Escherichia coli KTE56]
 gi|431111494|gb|ELE15393.1| arylsulfatase [Escherichia coli KTE56]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|416812738|ref|ZP_11890780.1| arylsulfatase [Escherichia coli O55:H7 str. 3256-97]
 gi|419123311|ref|ZP_13668247.1| sulfatase family protein [Escherichia coli DEC5B]
 gi|320655339|gb|EFX23281.1| arylsulfatase [Escherichia coli O55:H7 str. 3256-97 TW 07815]
 gi|377960957|gb|EHV24432.1| sulfatase family protein [Escherichia coli DEC5B]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|331655483|ref|ZP_08356476.1| arylsulfatase [Escherichia coli M718]
 gi|331046804|gb|EGI18888.1| arylsulfatase [Escherichia coli M718]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|301646135|ref|ZP_07246034.1| arylsulfatase [Escherichia coli MS 146-1]
 gi|331644533|ref|ZP_08345653.1| arylsulfatase [Escherichia coli H736]
 gi|432634707|ref|ZP_19870604.1| arylsulfatase [Escherichia coli KTE81]
 gi|432706534|ref|ZP_19941627.1| arylsulfatase [Escherichia coli KTE171]
 gi|301075604|gb|EFK90410.1| arylsulfatase [Escherichia coli MS 146-1]
 gi|331036205|gb|EGI08440.1| arylsulfatase [Escherichia coli H736]
 gi|431175847|gb|ELE75834.1| arylsulfatase [Escherichia coli KTE81]
 gi|431239856|gb|ELF34322.1| arylsulfatase [Escherichia coli KTE171]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|386616609|ref|YP_006136275.1| arylsulfatase [Escherichia coli UMNK88]
 gi|404377193|ref|ZP_10982332.1| arylsulfatase [Escherichia sp. 1_1_43]
 gi|419177522|ref|ZP_13721328.1| sulfatase family protein [Escherichia coli DEC7B]
 gi|421777517|ref|ZP_16214112.1| hypothetical protein ECAD30_36210 [Escherichia coli AD30]
 gi|422769204|ref|ZP_16822925.1| sulfatase [Escherichia coli E1520]
 gi|226838702|gb|EEH70730.1| arylsulfatase [Escherichia sp. 1_1_43]
 gi|323934189|gb|EGB30620.1| sulfatase [Escherichia coli E1520]
 gi|332345778|gb|AEE59112.1| arylsulfatase [Escherichia coli UMNK88]
 gi|378028430|gb|EHV91048.1| sulfatase family protein [Escherichia coli DEC7B]
 gi|408457431|gb|EKJ81227.1| hypothetical protein ECAD30_36210 [Escherichia coli AD30]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|149198127|ref|ZP_01875174.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
 gi|149138729|gb|EDM27135.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
          Length = 484

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 92/194 (47%), Gaps = 17/194 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G ND+  +G     TP++D LA +G+     YT  P C P+R A LTGKYP R+ +  P 
Sbjct: 31  GVNDLSCNGSTFYETPHMDQLAADGVKFTNAYTAFPRCLPARQALLTGKYPSRFDVQ-PY 89

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN--HVGYWNGY 139
                + +P  E    + LKE GY T  IGKWH+G   ++  P  +GFD+  H G+    
Sbjct: 90  ---PKQHLPFEEVTFGEALKEEGYETSYIGKWHLGHKGQD--PSKQGFDHIVHTGHAGAT 144

Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
            ++        +   ++   ++E         YLTD   D++   IKS    +P  L + 
Sbjct: 145 KSFF-------YPFPVEKGHSVENPVKGKEGDYLTDILRDEACEFIKS-KADKPFLLVMA 196

Query: 200 HAAVHTGTAGNAKL 213
           H AVHT   G   L
Sbjct: 197 HYAVHTPLEGRPDL 210


>gi|432951060|ref|ZP_20144803.1| arylsulfatase [Escherichia coli KTE197]
 gi|431477526|gb|ELH57294.1| arylsulfatase [Escherichia coli KTE197]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 487

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 67/209 (32%), Positives = 97/209 (46%), Gaps = 34/209 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ DVG +G     TPN D LA+ G    + H     C+ SRAA +TG YP R GI+  +
Sbjct: 44  GYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRAAIMTGCYPNRIGIEGAM 103

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +   E  +PQ  K  GY+T ++GKWH+G    E LP +RGFD     W G   
Sbjct: 104 EPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLG-TPTEFLPTHRGFDE----WFGLPY 158

Query: 142 YNDS---------------IHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHV 184
            ND                ++E D  +  G++  R+ME+         LT  +T+++V+ 
Sbjct: 159 SNDQWPLHPEKPGKFPPLPLYEGDKVINPGIN-HRDMEQ---------LTTQYTERAVNF 208

Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
           I   NH +P FL +     H   A + K 
Sbjct: 209 I-DRNHDKPFFLYVAQTMPHVPLAVSDKF 236


>gi|432452061|ref|ZP_19694315.1| arylsulfatase [Escherichia coli KTE193]
 gi|433035723|ref|ZP_20223410.1| arylsulfatase [Escherichia coli KTE112]
 gi|430977211|gb|ELC94062.1| arylsulfatase [Escherichia coli KTE193]
 gi|431545828|gb|ELI20473.1| arylsulfatase [Escherichia coli KTE112]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGLTT-LPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|301029014|ref|ZP_07192169.1| arylsulfatase [Escherichia coli MS 196-1]
 gi|299878025|gb|EFI86236.1| arylsulfatase [Escherichia coli MS 196-1]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|293417264|ref|ZP_06659889.1| arylsulfatase [Escherichia coli B185]
 gi|416778747|ref|ZP_11876078.1| arylsulfatase [Escherichia coli O157:H7 str. G5101]
 gi|416790105|ref|ZP_11880971.1| arylsulfatase [Escherichia coli O157:H- str. 493-89]
 gi|416801879|ref|ZP_11885859.1| arylsulfatase [Escherichia coli O157:H- str. H 2687]
 gi|419077994|ref|ZP_13623490.1| sulfatase family protein [Escherichia coli DEC3F]
 gi|420283149|ref|ZP_14785379.1| arylsulfatase [Escherichia coli TW06591]
 gi|425263807|ref|ZP_18655783.1| arylsulfatase [Escherichia coli EC96038]
 gi|445014625|ref|ZP_21330719.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA48]
 gi|209753338|gb|ACI74976.1| HemY protein [Escherichia coli]
 gi|291431032|gb|EFF04027.1| arylsulfatase [Escherichia coli B185]
 gi|320639283|gb|EFX08905.1| arylsulfatase [Escherichia coli O157:H7 str. G5101]
 gi|320644668|gb|EFX13718.1| arylsulfatase [Escherichia coli O157:H- str. 493-89]
 gi|320649993|gb|EFX18496.1| arylsulfatase [Escherichia coli O157:H- str. H 2687]
 gi|377917014|gb|EHU81083.1| sulfatase family protein [Escherichia coli DEC3F]
 gi|390779048|gb|EIO46785.1| arylsulfatase [Escherichia coli TW06591]
 gi|408177243|gb|EKI04058.1| arylsulfatase [Escherichia coli EC96038]
 gi|444620232|gb|ELV94241.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA48]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|255653026|ref|NP_001157425.1| steryl-sulfatase precursor [Equus caballus]
          Length = 578

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 68/120 (56%), Gaps = 12/120 (10%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D G +G   + TPNID LA  G+ L +H    P CTPSRAAF+TG+YP R G+ +  
Sbjct: 33  GIGDPGCYGNKTLRTPNIDRLAEGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASQS 92

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--C-NKEELL--PFNRGFD 130
             GV      +  +P +E    + LK  GYST LIGKWH+G  C NK +    P + GFD
Sbjct: 93  KVGVFLFSASSGGLPTSEITFAKLLKNQGYSTALIGKWHLGTNCHNKTDFCHHPLSHGFD 152


>gi|387609603|ref|YP_006098459.1| arylsulfatase [Escherichia coli 042]
 gi|284923903|emb|CBG37002.1| arylsulfatase [Escherichia coli 042]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432943494|ref|ZP_20140329.1| arylsulfatase [Escherichia coli KTE196]
 gi|433045335|ref|ZP_20232807.1| arylsulfatase [Escherichia coli KTE117]
 gi|431466713|gb|ELH46730.1| arylsulfatase [Escherichia coli KTE196]
 gi|431551968|gb|ELI25931.1| arylsulfatase [Escherichia coli KTE117]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|424818211|ref|ZP_18243362.1| arylsulfatase-like enzyme [Escherichia fergusonii ECD227]
 gi|325499231|gb|EGC97090.1| arylsulfatase-like enzyme [Escherichia fergusonii ECD227]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|416833512|ref|ZP_11900392.1| arylsulfatase [Escherichia coli O157:H7 str. LSU-61]
 gi|320666088|gb|EFX33102.1| arylsulfatase [Escherichia coli O157:H7 str. LSU-61]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432866678|ref|ZP_20089015.1| arylsulfatase [Escherichia coli KTE146]
 gi|431400801|gb|ELG84165.1| arylsulfatase [Escherichia coli KTE146]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 461

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 89/194 (45%), Gaps = 27/194 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNG-IVLNRHYTLPTCTPSRAAFLTGKYPFRYG----I 77
           G+ D GF G  DI TPNID LA  G I  + H      +PSR+  LTG+Y  RYG    +
Sbjct: 42  GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECNL 101

Query: 78  DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR----GFDNHV 133
           D P        +P  E+LLP  LK  GY T  IGKWH+G       PF R    GFD   
Sbjct: 102 DKP-----GDGIPDDEELLPALLKRYGYRTGCIGKWHLGSK-----PFQRPNAKGFDTFY 151

Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHS 191
           G   G+ +Y    ++ + +   D   N+++Y           +FTD+     +       
Sbjct: 152 GLLAGHRSY---FYDPETS---DKDGNLQQYQYNGQKLSFDGYFTDELASKARQFVAESE 205

Query: 192 RPLFLQITHAAVHT 205
           +P  L ++  A H+
Sbjct: 206 QPFMLYMSFTAPHS 219


>gi|417588940|ref|ZP_12239701.1| arylsulfatase [Escherichia coli STEC_C165-02]
 gi|432491631|ref|ZP_19733489.1| arylsulfatase [Escherichia coli KTE213]
 gi|432841656|ref|ZP_20075110.1| arylsulfatase [Escherichia coli KTE140]
 gi|433205551|ref|ZP_20389292.1| arylsulfatase [Escherichia coli KTE95]
 gi|345331076|gb|EGW63537.1| arylsulfatase [Escherichia coli STEC_C165-02]
 gi|431016987|gb|ELD30504.1| arylsulfatase [Escherichia coli KTE213]
 gi|431384928|gb|ELG68918.1| arylsulfatase [Escherichia coli KTE140]
 gi|431715513|gb|ELJ79661.1| arylsulfatase [Escherichia coli KTE95]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|366158865|ref|ZP_09458727.1| arylsulfatase [Escherichia sp. TW09308]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|331685518|ref|ZP_08386102.1| arylsulfatase [Escherichia coli H299]
 gi|450195629|ref|ZP_21892583.1| arylsulfatase [Escherichia coli SEPT362]
 gi|331077219|gb|EGI48433.1| arylsulfatase [Escherichia coli H299]
 gi|449316170|gb|EMD06291.1| arylsulfatase [Escherichia coli SEPT362]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|416900367|ref|ZP_11929642.1| arylsulfatase [Escherichia coli STEC_7v]
 gi|417116418|ref|ZP_11967279.1| arylsulfatase [Escherichia coli 1.2741]
 gi|422335426|ref|ZP_16416425.1| arylsulfatase [Escherichia coli 4_1_47FAA]
 gi|422784491|ref|ZP_16837271.1| sulfatase [Escherichia coli TW10509]
 gi|422803393|ref|ZP_16851881.1| sulfatase [Escherichia coli M863]
 gi|432768176|ref|ZP_20002565.1| arylsulfatase [Escherichia coli KTE50]
 gi|432964607|ref|ZP_20153677.1| arylsulfatase [Escherichia coli KTE202]
 gi|433065269|ref|ZP_20252170.1| arylsulfatase [Escherichia coli KTE125]
 gi|323964045|gb|EGB59535.1| sulfatase [Escherichia coli M863]
 gi|323974382|gb|EGB69510.1| sulfatase [Escherichia coli TW10509]
 gi|327250650|gb|EGE62356.1| arylsulfatase [Escherichia coli STEC_7v]
 gi|373243576|gb|EHP63078.1| arylsulfatase [Escherichia coli 4_1_47FAA]
 gi|386138962|gb|EIG80117.1| arylsulfatase [Escherichia coli 1.2741]
 gi|431321440|gb|ELG09041.1| arylsulfatase [Escherichia coli KTE50]
 gi|431467324|gb|ELH47334.1| arylsulfatase [Escherichia coli KTE202]
 gi|431577842|gb|ELI50465.1| arylsulfatase [Escherichia coli KTE125]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432891395|ref|ZP_20104113.1| arylsulfatase [Escherichia coli KTE165]
 gi|431429800|gb|ELH11635.1| arylsulfatase [Escherichia coli KTE165]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432817589|ref|ZP_20051339.1| arylsulfatase [Escherichia coli KTE115]
 gi|431360005|gb|ELG46626.1| arylsulfatase [Escherichia coli KTE115]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432394470|ref|ZP_19637286.1| arylsulfatase [Escherichia coli KTE21]
 gi|430913861|gb|ELC34980.1| arylsulfatase [Escherichia coli KTE21]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|331649624|ref|ZP_08350706.1| arylsulfatase [Escherichia coli M605]
 gi|417664426|ref|ZP_12314005.1| arylsulfatase [Escherichia coli AA86]
 gi|432399749|ref|ZP_19642522.1| arylsulfatase [Escherichia coli KTE25]
 gi|432725267|ref|ZP_19960180.1| arylsulfatase [Escherichia coli KTE17]
 gi|432729876|ref|ZP_19964748.1| arylsulfatase [Escherichia coli KTE18]
 gi|432743565|ref|ZP_19978278.1| arylsulfatase [Escherichia coli KTE23]
 gi|432988296|ref|ZP_20176975.1| arylsulfatase [Escherichia coli KTE217]
 gi|433113077|ref|ZP_20298924.1| arylsulfatase [Escherichia coli KTE150]
 gi|330908100|gb|EGH36619.1| arylsulfatase [Escherichia coli AA86]
 gi|331041494|gb|EGI13642.1| arylsulfatase [Escherichia coli M605]
 gi|430912911|gb|ELC34083.1| arylsulfatase [Escherichia coli KTE25]
 gi|431262486|gb|ELF54476.1| arylsulfatase [Escherichia coli KTE17]
 gi|431270646|gb|ELF61808.1| arylsulfatase [Escherichia coli KTE18]
 gi|431280856|gb|ELF71765.1| arylsulfatase [Escherichia coli KTE23]
 gi|431502009|gb|ELH80902.1| arylsulfatase [Escherichia coli KTE217]
 gi|431624566|gb|ELI93182.1| arylsulfatase [Escherichia coli KTE150]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432604635|ref|ZP_19840861.1| arylsulfatase [Escherichia coli KTE66]
 gi|431136569|gb|ELE38427.1| arylsulfatase [Escherichia coli KTE66]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|15804389|ref|NP_290429.1| arylsulfatase [Escherichia coli O157:H7 str. EDL933]
 gi|15833985|ref|NP_312758.1| arylsulfatase [Escherichia coli O157:H7 str. Sakai]
 gi|168750392|ref|ZP_02775414.1| arylsulfatase [Escherichia coli O157:H7 str. EC4113]
 gi|168753693|ref|ZP_02778700.1| arylsulfatase [Escherichia coli O157:H7 str. EC4401]
 gi|168768077|ref|ZP_02793084.1| arylsulfatase [Escherichia coli O157:H7 str. EC4486]
 gi|168775653|ref|ZP_02800660.1| arylsulfatase [Escherichia coli O157:H7 str. EC4196]
 gi|168780695|ref|ZP_02805702.1| arylsulfatase [Escherichia coli O157:H7 str. EC4076]
 gi|168786634|ref|ZP_02811641.1| arylsulfatase [Escherichia coli O157:H7 str. EC869]
 gi|168801140|ref|ZP_02826147.1| arylsulfatase [Escherichia coli O157:H7 str. EC508]
 gi|195938087|ref|ZP_03083469.1| arylsulfatase [Escherichia coli O157:H7 str. EC4024]
 gi|208807165|ref|ZP_03249502.1| arylsulfatase [Escherichia coli O157:H7 str. EC4206]
 gi|208812341|ref|ZP_03253670.1| arylsulfatase [Escherichia coli O157:H7 str. EC4045]
 gi|208818746|ref|ZP_03259066.1| arylsulfatase [Escherichia coli O157:H7 str. EC4042]
 gi|209399246|ref|YP_002273316.1| arylsulfatase [Escherichia coli O157:H7 str. EC4115]
 gi|217324531|ref|ZP_03440615.1| arylsulfatase [Escherichia coli O157:H7 str. TW14588]
 gi|254795796|ref|YP_003080633.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. TW14359]
 gi|261225573|ref|ZP_05939854.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. FRIK2000]
 gi|261255619|ref|ZP_05948152.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. FRIK966]
 gi|387885028|ref|YP_006315330.1| arylsulfatase [Escherichia coli Xuzhou21]
 gi|416307618|ref|ZP_11654659.1| Arylsulfatase [Escherichia coli O157:H7 str. 1044]
 gi|416319752|ref|ZP_11662304.1| Arylsulfatase [Escherichia coli O157:H7 str. EC1212]
 gi|416326910|ref|ZP_11666985.1| Arylsulfatase [Escherichia coli O157:H7 str. 1125]
 gi|419043232|ref|ZP_13590209.1| sulfatase family protein [Escherichia coli DEC3A]
 gi|419053675|ref|ZP_13600540.1| sulfatase family protein [Escherichia coli DEC3B]
 gi|419065757|ref|ZP_13612456.1| sulfatase family protein [Escherichia coli DEC3D]
 gi|419089105|ref|ZP_13634453.1| sulfatase family protein [Escherichia coli DEC4B]
 gi|419094926|ref|ZP_13640200.1| sulfatase family protein [Escherichia coli DEC4C]
 gi|420284129|ref|ZP_14786350.1| arylsulfatase [Escherichia coli TW10246]
 gi|420289833|ref|ZP_14792003.1| arylsulfatase [Escherichia coli TW11039]
 gi|420306826|ref|ZP_14808811.1| arylsulfatase [Escherichia coli TW10119]
 gi|420312194|ref|ZP_14814119.1| arylsulfatase [Escherichia coli EC1738]
 gi|420317848|ref|ZP_14819716.1| arylsulfatase [Escherichia coli EC1734]
 gi|421826575|ref|ZP_16261927.1| arylsulfatase [Escherichia coli FRIK920]
 gi|421833433|ref|ZP_16268710.1| arylsulfatase [Escherichia coli PA7]
 gi|424086526|ref|ZP_17822995.1| arylsulfatase [Escherichia coli FDA517]
 gi|424149999|ref|ZP_17881358.1| arylsulfatase [Escherichia coli PA15]
 gi|424163724|ref|ZP_17886776.1| arylsulfatase [Escherichia coli PA24]
 gi|424257376|ref|ZP_17892318.1| arylsulfatase [Escherichia coli PA25]
 gi|424336064|ref|ZP_17898254.1| arylsulfatase [Escherichia coli PA28]
 gi|424452330|ref|ZP_17903957.1| arylsulfatase [Escherichia coli PA32]
 gi|424465025|ref|ZP_17915332.1| arylsulfatase [Escherichia coli PA39]
 gi|424477749|ref|ZP_17927048.1| arylsulfatase [Escherichia coli PA42]
 gi|424483530|ref|ZP_17932496.1| arylsulfatase [Escherichia coli TW07945]
 gi|424489726|ref|ZP_17938248.1| arylsulfatase [Escherichia coli TW09098]
 gi|424503046|ref|ZP_17949915.1| arylsulfatase [Escherichia coli EC4203]
 gi|424509319|ref|ZP_17955671.1| arylsulfatase [Escherichia coli EC4196]
 gi|424516725|ref|ZP_17961296.1| arylsulfatase [Escherichia coli TW14313]
 gi|424522852|ref|ZP_17966941.1| arylsulfatase [Escherichia coli TW14301]
 gi|424528725|ref|ZP_17972420.1| arylsulfatase [Escherichia coli EC4421]
 gi|424565822|ref|ZP_18006808.1| arylsulfatase [Escherichia coli EC4437]
 gi|425106679|ref|ZP_18508978.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 5.2239]
 gi|425134377|ref|ZP_18535213.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 8.2524]
 gi|425140970|ref|ZP_18541336.1| arylsulfatase [Escherichia coli 10.0833]
 gi|425182830|ref|ZP_18580511.1| arylsulfatase [Escherichia coli FRIK1999]
 gi|425214470|ref|ZP_18609857.1| arylsulfatase [Escherichia coli PA4]
 gi|425220598|ref|ZP_18615545.1| arylsulfatase [Escherichia coli PA23]
 gi|425227243|ref|ZP_18621694.1| arylsulfatase [Escherichia coli PA49]
 gi|425233401|ref|ZP_18627425.1| arylsulfatase [Escherichia coli PA45]
 gi|425239322|ref|ZP_18633027.1| arylsulfatase [Escherichia coli TT12B]
 gi|425245557|ref|ZP_18638849.1| arylsulfatase [Escherichia coli MA6]
 gi|425297274|ref|ZP_18687384.1| arylsulfatase [Escherichia coli PA38]
 gi|425356984|ref|ZP_18743030.1| arylsulfatase [Escherichia coli EC1850]
 gi|425362933|ref|ZP_18748565.1| arylsulfatase [Escherichia coli EC1856]
 gi|425369198|ref|ZP_18754261.1| arylsulfatase [Escherichia coli EC1862]
 gi|425395119|ref|ZP_18778210.1| arylsulfatase [Escherichia coli EC1868]
 gi|425401173|ref|ZP_18783863.1| arylsulfatase [Escherichia coli EC1869]
 gi|425407269|ref|ZP_18789474.1| arylsulfatase [Escherichia coli EC1870]
 gi|425413627|ref|ZP_18795373.1| arylsulfatase [Escherichia coli NE098]
 gi|425419942|ref|ZP_18801197.1| arylsulfatase [Escherichia coli FRIK523]
 gi|425431239|ref|ZP_18811832.1| arylsulfatase [Escherichia coli 0.1304]
 gi|428955719|ref|ZP_19027493.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 88.1042]
 gi|428961741|ref|ZP_19033004.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 89.0511]
 gi|428968345|ref|ZP_19039033.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.0091]
 gi|428974127|ref|ZP_19044422.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.0039]
 gi|428980562|ref|ZP_19050355.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.2281]
 gi|428986322|ref|ZP_19055695.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 93.0055]
 gi|428992434|ref|ZP_19061406.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 93.0056]
 gi|428998330|ref|ZP_19066905.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 94.0618]
 gi|429004718|ref|ZP_19072762.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 95.0183]
 gi|429023077|ref|ZP_19089577.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 96.0428]
 gi|429047250|ref|ZP_19111946.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 96.0107]
 gi|444933149|ref|ZP_21252147.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0814]
 gi|444938616|ref|ZP_21257339.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0815]
 gi|444955357|ref|ZP_21273413.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0848]
 gi|444988025|ref|ZP_21304792.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA11]
 gi|444998582|ref|ZP_21315071.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA13]
 gi|445004127|ref|ZP_21320506.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA2]
 gi|445009545|ref|ZP_21325764.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA47]
 gi|445020547|ref|ZP_21336501.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA8]
 gi|445031363|ref|ZP_21347018.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.1781]
 gi|445061276|ref|ZP_21373782.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0670]
 gi|452967387|ref|ZP_21965614.1| arylsulfatase [Escherichia coli O157:H7 str. EC4009]
 gi|12518665|gb|AAG58993.1|AE005611_3 arylsulfatase [Escherichia coli O157:H7 str. EDL933]
 gi|13364207|dbj|BAB38154.1| arylsulfatase [Escherichia coli O157:H7 str. Sakai]
 gi|187768864|gb|EDU32708.1| arylsulfatase [Escherichia coli O157:H7 str. EC4196]
 gi|188015437|gb|EDU53559.1| arylsulfatase [Escherichia coli O157:H7 str. EC4113]
 gi|189001398|gb|EDU70384.1| arylsulfatase [Escherichia coli O157:H7 str. EC4076]
 gi|189359385|gb|EDU77804.1| arylsulfatase [Escherichia coli O157:H7 str. EC4401]
 gi|189362713|gb|EDU81132.1| arylsulfatase [Escherichia coli O157:H7 str. EC4486]
 gi|189373327|gb|EDU91743.1| arylsulfatase [Escherichia coli O157:H7 str. EC869]
 gi|189376690|gb|EDU95106.1| arylsulfatase [Escherichia coli O157:H7 str. EC508]
 gi|208726966|gb|EDZ76567.1| arylsulfatase [Escherichia coli O157:H7 str. EC4206]
 gi|208733618|gb|EDZ82305.1| arylsulfatase [Escherichia coli O157:H7 str. EC4045]
 gi|208738869|gb|EDZ86551.1| arylsulfatase [Escherichia coli O157:H7 str. EC4042]
 gi|209160646|gb|ACI38079.1| arylsulfatase [Escherichia coli O157:H7 str. EC4115]
 gi|209753340|gb|ACI74977.1| HemY protein [Escherichia coli]
 gi|209753342|gb|ACI74978.1| HemY protein [Escherichia coli]
 gi|209753346|gb|ACI74980.1| HemY protein [Escherichia coli]
 gi|217320752|gb|EEC29176.1| arylsulfatase [Escherichia coli O157:H7 str. TW14588]
 gi|254595196|gb|ACT74557.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. TW14359]
 gi|320191108|gb|EFW65758.1| Arylsulfatase [Escherichia coli O157:H7 str. EC1212]
 gi|326344255|gb|EGD68015.1| Arylsulfatase [Escherichia coli O157:H7 str. 1125]
 gi|326347917|gb|EGD71631.1| Arylsulfatase [Escherichia coli O157:H7 str. 1044]
 gi|377889357|gb|EHU53821.1| sulfatase family protein [Escherichia coli DEC3B]
 gi|377900988|gb|EHU65312.1| sulfatase family protein [Escherichia coli DEC3A]
 gi|377903743|gb|EHU68033.1| sulfatase family protein [Escherichia coli DEC3D]
 gi|377926648|gb|EHU90578.1| sulfatase family protein [Escherichia coli DEC4B]
 gi|377937826|gb|EHV01599.1| sulfatase family protein [Escherichia coli DEC4C]
 gi|386798486|gb|AFJ31520.1| arylsulfatase [Escherichia coli Xuzhou21]
 gi|390638282|gb|EIN17795.1| arylsulfatase [Escherichia coli FDA517]
 gi|390697452|gb|EIN71872.1| arylsulfatase [Escherichia coli PA15]
 gi|390717573|gb|EIN90355.1| arylsulfatase [Escherichia coli PA24]
 gi|390718159|gb|EIN90917.1| arylsulfatase [Escherichia coli PA25]
 gi|390724290|gb|EIN96850.1| arylsulfatase [Escherichia coli PA28]
 gi|390737522|gb|EIO08810.1| arylsulfatase [Escherichia coli PA32]
 gi|390758531|gb|EIO27972.1| arylsulfatase [Escherichia coli PA39]
 gi|390764824|gb|EIO34019.1| arylsulfatase [Escherichia coli PA42]
 gi|390786076|gb|EIO53604.1| arylsulfatase [Escherichia coli TW07945]
 gi|390796617|gb|EIO63888.1| arylsulfatase [Escherichia coli TW10246]
 gi|390800063|gb|EIO67176.1| arylsulfatase [Escherichia coli TW09098]
 gi|390803137|gb|EIO70161.1| arylsulfatase [Escherichia coli TW11039]
 gi|390813562|gb|EIO80172.1| arylsulfatase [Escherichia coli TW10119]
 gi|390822474|gb|EIO88593.1| arylsulfatase [Escherichia coli EC4203]
 gi|390827584|gb|EIO93340.1| arylsulfatase [Escherichia coli EC4196]
 gi|390840744|gb|EIP04747.1| arylsulfatase [Escherichia coli TW14313]
 gi|390842854|gb|EIP06687.1| arylsulfatase [Escherichia coli TW14301]
 gi|390847768|gb|EIP11292.1| arylsulfatase [Escherichia coli EC4421]
 gi|390890102|gb|EIP49788.1| arylsulfatase [Escherichia coli EC4437]
 gi|390897906|gb|EIP57206.1| arylsulfatase [Escherichia coli EC1738]
 gi|390905781|gb|EIP64706.1| arylsulfatase [Escherichia coli EC1734]
 gi|408061394|gb|EKG95913.1| arylsulfatase [Escherichia coli PA7]
 gi|408063893|gb|EKG98380.1| arylsulfatase [Escherichia coli FRIK920]
 gi|408094561|gb|EKH27578.1| arylsulfatase [Escherichia coli FRIK1999]
 gi|408125024|gb|EKH55664.1| arylsulfatase [Escherichia coli PA4]
 gi|408134768|gb|EKH64584.1| arylsulfatase [Escherichia coli PA23]
 gi|408136831|gb|EKH66561.1| arylsulfatase [Escherichia coli PA49]
 gi|408143728|gb|EKH73002.1| arylsulfatase [Escherichia coli PA45]
 gi|408152108|gb|EKH80557.1| arylsulfatase [Escherichia coli TT12B]
 gi|408157151|gb|EKH85317.1| arylsulfatase [Escherichia coli MA6]
 gi|408211269|gb|EKI35821.1| arylsulfatase [Escherichia coli PA38]
 gi|408271069|gb|EKI91218.1| arylsulfatase [Escherichia coli EC1850]
 gi|408274160|gb|EKI94185.1| arylsulfatase [Escherichia coli EC1856]
 gi|408282170|gb|EKJ01508.1| arylsulfatase [Escherichia coli EC1862]
 gi|408303342|gb|EKJ20804.1| arylsulfatase [Escherichia coli EC1868]
 gi|408315829|gb|EKJ32128.1| arylsulfatase [Escherichia coli EC1869]
 gi|408321282|gb|EKJ37321.1| arylsulfatase [Escherichia coli EC1870]
 gi|408323022|gb|EKJ38991.1| arylsulfatase [Escherichia coli NE098]
 gi|408333985|gb|EKJ48893.1| arylsulfatase [Escherichia coli FRIK523]
 gi|408341923|gb|EKJ56359.1| arylsulfatase [Escherichia coli 0.1304]
 gi|408544793|gb|EKK22239.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 5.2239]
 gi|408575638|gb|EKK51291.1| arylsulfatase [Escherichia coli 10.0833]
 gi|408578549|gb|EKK54066.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 8.2524]
 gi|427201292|gb|EKV71685.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 88.1042]
 gi|427201431|gb|EKV71813.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 89.0511]
 gi|427217561|gb|EKV86619.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.0091]
 gi|427221289|gb|EKV90150.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.2281]
 gi|427224246|gb|EKV92963.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 90.0039]
 gi|427237712|gb|EKW05236.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 93.0056]
 gi|427238127|gb|EKW05647.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 93.0055]
 gi|427242462|gb|EKW09869.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 94.0618]
 gi|427255779|gb|EKW22020.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 95.0183]
 gi|427273038|gb|EKW37738.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 96.0428]
 gi|427295797|gb|EKW58879.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 96.0107]
 gi|444534967|gb|ELV15132.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0814]
 gi|444545275|gb|ELV24202.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0815]
 gi|444559302|gb|ELV36536.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0848]
 gi|444589438|gb|ELV64773.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA11]
 gi|444603250|gb|ELV77960.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA13]
 gi|444612439|gb|ELV86732.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA2]
 gi|444619015|gb|ELV93076.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA47]
 gi|444626740|gb|ELW00530.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli PA8]
 gi|444637079|gb|ELW10455.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.1781]
 gi|444666662|gb|ELW38722.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Escherichia coli 99.0670]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|331675268|ref|ZP_08376019.1| arylsulfatase [Escherichia coli TA280]
 gi|432855803|ref|ZP_20083494.1| arylsulfatase [Escherichia coli KTE144]
 gi|331067554|gb|EGI38958.1| arylsulfatase [Escherichia coli TA280]
 gi|431397088|gb|ELG80549.1| arylsulfatase [Escherichia coli KTE144]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|300930037|ref|ZP_07145468.1| arylsulfatase [Escherichia coli MS 187-1]
 gi|300462052|gb|EFK25545.1| arylsulfatase [Escherichia coli MS 187-1]
          Length = 551

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|386621472|ref|YP_006141052.1| Arysulfatase [Escherichia coli NA114]
 gi|387831691|ref|YP_003351628.1| arylsulfatase [Escherichia coli SE15]
 gi|417285665|ref|ZP_12072956.1| arylsulfatase [Escherichia coli TW07793]
 gi|425302701|ref|ZP_18692579.1| sulfatase [Escherichia coli 07798]
 gi|432424203|ref|ZP_19666739.1| arylsulfatase [Escherichia coli KTE178]
 gi|432502356|ref|ZP_19744104.1| arylsulfatase [Escherichia coli KTE216]
 gi|432561066|ref|ZP_19797718.1| arylsulfatase [Escherichia coli KTE49]
 gi|432696664|ref|ZP_19931854.1| arylsulfatase [Escherichia coli KTE162]
 gi|432708193|ref|ZP_19943267.1| arylsulfatase [Escherichia coli KTE6]
 gi|432923069|ref|ZP_20125775.1| arylsulfatase [Escherichia coli KTE173]
 gi|432929759|ref|ZP_20130711.1| arylsulfatase [Escherichia coli KTE175]
 gi|432983306|ref|ZP_20172072.1| arylsulfatase [Escherichia coli KTE211]
 gi|433098628|ref|ZP_20284793.1| arylsulfatase [Escherichia coli KTE139]
 gi|433108057|ref|ZP_20294015.1| arylsulfatase [Escherichia coli KTE148]
 gi|281180848|dbj|BAI57178.1| arylsulfatase [Escherichia coli SE15]
 gi|333971973|gb|AEG38778.1| Arysulfatase [Escherichia coli NA114]
 gi|386250906|gb|EII97073.1| arylsulfatase [Escherichia coli TW07793]
 gi|408210360|gb|EKI34925.1| sulfatase [Escherichia coli 07798]
 gi|430941426|gb|ELC61573.1| arylsulfatase [Escherichia coli KTE178]
 gi|431025678|gb|ELD38776.1| arylsulfatase [Escherichia coli KTE216]
 gi|431088262|gb|ELD94158.1| arylsulfatase [Escherichia coli KTE49]
 gi|431230664|gb|ELF26439.1| arylsulfatase [Escherichia coli KTE162]
 gi|431254637|gb|ELF47905.1| arylsulfatase [Escherichia coli KTE6]
 gi|431434482|gb|ELH16131.1| arylsulfatase [Escherichia coli KTE173]
 gi|431439906|gb|ELH21237.1| arylsulfatase [Escherichia coli KTE175]
 gi|431487956|gb|ELH67597.1| arylsulfatase [Escherichia coli KTE211]
 gi|431612056|gb|ELI81311.1| arylsulfatase [Escherichia coli KTE139]
 gi|431623625|gb|ELI92256.1| arylsulfatase [Escherichia coli KTE148]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|170681296|ref|YP_001746117.1| arylsulfatase [Escherichia coli SMS-3-5]
 gi|218701288|ref|YP_002408917.1| arylsulfatase-like enzyme [Escherichia coli IAI39]
 gi|218707435|ref|YP_002414954.1| arylsulfatase-like enzyme [Escherichia coli UMN026]
 gi|251787058|ref|YP_003001362.1| arylsulfatase [Escherichia coli BL21(DE3)]
 gi|253775576|ref|YP_003038407.1| sulfatase [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
 gi|254163742|ref|YP_003046850.1| acrylsulfatase-like enzyme [Escherichia coli B str. REL606]
 gi|254290492|ref|YP_003056240.1| acrylsulfatase-like protein [Escherichia coli BL21(DE3)]
 gi|300900653|ref|ZP_07118810.1| arylsulfatase [Escherichia coli MS 198-1]
 gi|300939985|ref|ZP_07154612.1| arylsulfatase [Escherichia coli MS 21-1]
 gi|301025769|ref|ZP_07189282.1| arylsulfatase [Escherichia coli MS 69-1]
 gi|386626692|ref|YP_006146420.1| acrylsulfatase-like protein [Escherichia coli O7:K1 str. CE10]
 gi|417142458|ref|ZP_11985033.1| arylsulfatase [Escherichia coli 97.0259]
 gi|417310376|ref|ZP_12097190.1| Arylsulfatase [Escherichia coli PCN033]
 gi|419918842|ref|ZP_14437018.1| arylsulfatase [Escherichia coli KD2]
 gi|419937453|ref|ZP_14454349.1| arylsulfatase [Escherichia coli 576-1]
 gi|422789242|ref|ZP_16841973.1| sulfatase [Escherichia coli H489]
 gi|422794128|ref|ZP_16846819.1| sulfatase [Escherichia coli TA007]
 gi|422977438|ref|ZP_16977390.1| arylsulfatase [Escherichia coli TA124]
 gi|432355836|ref|ZP_19599096.1| arylsulfatase [Escherichia coli KTE2]
 gi|432404201|ref|ZP_19646943.1| arylsulfatase [Escherichia coli KTE26]
 gi|432428468|ref|ZP_19670947.1| arylsulfatase [Escherichia coli KTE181]
 gi|432463169|ref|ZP_19705299.1| arylsulfatase [Escherichia coli KTE204]
 gi|432478164|ref|ZP_19720148.1| arylsulfatase [Escherichia coli KTE208]
 gi|432520017|ref|ZP_19757195.1| arylsulfatase [Escherichia coli KTE228]
 gi|432540185|ref|ZP_19777075.1| arylsulfatase [Escherichia coli KTE235]
 gi|432545634|ref|ZP_19782456.1| arylsulfatase [Escherichia coli KTE236]
 gi|432551113|ref|ZP_19787861.1| arylsulfatase [Escherichia coli KTE237]
 gi|432619113|ref|ZP_19855210.1| arylsulfatase [Escherichia coli KTE75]
 gi|432624169|ref|ZP_19860181.1| arylsulfatase [Escherichia coli KTE76]
 gi|432633749|ref|ZP_19869665.1| arylsulfatase [Escherichia coli KTE80]
 gi|432643401|ref|ZP_19879221.1| arylsulfatase [Escherichia coli KTE83]
 gi|432668396|ref|ZP_19903964.1| arylsulfatase [Escherichia coli KTE116]
 gi|432682583|ref|ZP_19917933.1| arylsulfatase [Escherichia coli KTE143]
 gi|432716430|ref|ZP_19951443.1| arylsulfatase [Escherichia coli KTE9]
 gi|432772575|ref|ZP_20006886.1| arylsulfatase [Escherichia coli KTE54]
 gi|432795059|ref|ZP_20029130.1| arylsulfatase [Escherichia coli KTE78]
 gi|432796570|ref|ZP_20030603.1| arylsulfatase [Escherichia coli KTE79]
 gi|432889599|ref|ZP_20102871.1| arylsulfatase [Escherichia coli KTE158]
 gi|432915470|ref|ZP_20120725.1| arylsulfatase [Escherichia coli KTE190]
 gi|433021056|ref|ZP_20209132.1| arylsulfatase [Escherichia coli KTE105]
 gi|433055431|ref|ZP_20242583.1| arylsulfatase [Escherichia coli KTE122]
 gi|433070166|ref|ZP_20256927.1| arylsulfatase [Escherichia coli KTE128]
 gi|433160958|ref|ZP_20345771.1| arylsulfatase [Escherichia coli KTE177]
 gi|433180675|ref|ZP_20365046.1| arylsulfatase [Escherichia coli KTE82]
 gi|170519014|gb|ACB17192.1| arylsulfatase [Escherichia coli SMS-3-5]
 gi|218371274|emb|CAR19108.1| arylsulfatase-like enzyme [Escherichia coli IAI39]
 gi|218434532|emb|CAR15458.1| arylsulfatase-like enzyme [Escherichia coli UMN026]
 gi|242379331|emb|CAQ34142.1| arylsulfatase [Escherichia coli BL21(DE3)]
 gi|253326620|gb|ACT31222.1| sulfatase [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
 gi|253975643|gb|ACT41314.1| acrylsulfatase-like enzyme [Escherichia coli B str. REL606]
 gi|253979799|gb|ACT45469.1| acrylsulfatase-like enzyme [Escherichia coli BL21(DE3)]
 gi|300355853|gb|EFJ71723.1| arylsulfatase [Escherichia coli MS 198-1]
 gi|300395824|gb|EFJ79362.1| arylsulfatase [Escherichia coli MS 69-1]
 gi|300455159|gb|EFK18652.1| arylsulfatase [Escherichia coli MS 21-1]
 gi|323959055|gb|EGB54724.1| sulfatase [Escherichia coli H489]
 gi|323969359|gb|EGB64658.1| sulfatase [Escherichia coli TA007]
 gi|338768019|gb|EGP22825.1| Arylsulfatase [Escherichia coli PCN033]
 gi|349740428|gb|AEQ15134.1| acrylsulfatase-like enzyme [Escherichia coli O7:K1 str. CE10]
 gi|371593286|gb|EHN82169.1| arylsulfatase [Escherichia coli TA124]
 gi|386155482|gb|EIH11837.1| arylsulfatase [Escherichia coli 97.0259]
 gi|388389333|gb|EIL50867.1| arylsulfatase [Escherichia coli KD2]
 gi|388397635|gb|EIL58607.1| arylsulfatase [Escherichia coli 576-1]
 gi|430872049|gb|ELB95668.1| arylsulfatase [Escherichia coli KTE2]
 gi|430922521|gb|ELC43273.1| arylsulfatase [Escherichia coli KTE26]
 gi|430950294|gb|ELC69680.1| arylsulfatase [Escherichia coli KTE181]
 gi|430985119|gb|ELD01726.1| arylsulfatase [Escherichia coli KTE204]
 gi|431001673|gb|ELD17249.1| arylsulfatase [Escherichia coli KTE208]
 gi|431047436|gb|ELD57436.1| arylsulfatase [Escherichia coli KTE228]
 gi|431066676|gb|ELD75300.1| arylsulfatase [Escherichia coli KTE235]
 gi|431070527|gb|ELD78830.1| arylsulfatase [Escherichia coli KTE236]
 gi|431075966|gb|ELD83482.1| arylsulfatase [Escherichia coli KTE237]
 gi|431150628|gb|ELE51678.1| arylsulfatase [Escherichia coli KTE75]
 gi|431155700|gb|ELE56446.1| arylsulfatase [Escherichia coli KTE76]
 gi|431166920|gb|ELE67223.1| arylsulfatase [Escherichia coli KTE80]
 gi|431176984|gb|ELE76924.1| arylsulfatase [Escherichia coli KTE83]
 gi|431197016|gb|ELE95883.1| arylsulfatase [Escherichia coli KTE116]
 gi|431216855|gb|ELF14447.1| arylsulfatase [Escherichia coli KTE143]
 gi|431269839|gb|ELF61140.1| arylsulfatase [Escherichia coli KTE9]
 gi|431323462|gb|ELG10960.1| arylsulfatase [Escherichia coli KTE54]
 gi|431335466|gb|ELG22604.1| arylsulfatase [Escherichia coli KTE78]
 gi|431347741|gb|ELG34619.1| arylsulfatase [Escherichia coli KTE79]
 gi|431413193|gb|ELG95987.1| arylsulfatase [Escherichia coli KTE158]
 gi|431435072|gb|ELH16685.1| arylsulfatase [Escherichia coli KTE190]
 gi|431526493|gb|ELI03242.1| arylsulfatase [Escherichia coli KTE105]
 gi|431565331|gb|ELI38466.1| arylsulfatase [Escherichia coli KTE122]
 gi|431578355|gb|ELI50962.1| arylsulfatase [Escherichia coli KTE128]
 gi|431673056|gb|ELJ39287.1| arylsulfatase [Escherichia coli KTE177]
 gi|431697635|gb|ELJ62737.1| arylsulfatase [Escherichia coli KTE82]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|432871692|ref|ZP_20091722.1| arylsulfatase [Escherichia coli KTE147]
 gi|431407654|gb|ELG90863.1| arylsulfatase [Escherichia coli KTE147]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|26250538|ref|NP_756578.1| arylsulfatase [Escherichia coli CFT073]
 gi|91213321|ref|YP_543307.1| arylsulfatase [Escherichia coli UTI89]
 gi|117626058|ref|YP_859381.1| arylsulfatase-like enzyme [Escherichia coli APEC O1]
 gi|218560864|ref|YP_002393777.1| arylsulfatase-like enzyme [Escherichia coli S88]
 gi|222158494|ref|YP_002558633.1| Arylsulfatase [Escherichia coli LF82]
 gi|227888617|ref|ZP_04006422.1| arylsulfatase [Escherichia coli 83972]
 gi|237702808|ref|ZP_04533289.1| arylsulfatase [Escherichia sp. 3_2_53FAA]
 gi|300985749|ref|ZP_07177575.1| arylsulfatase [Escherichia coli MS 45-1]
 gi|331660144|ref|ZP_08361080.1| arylsulfatase [Escherichia coli TA206]
 gi|386601825|ref|YP_006103331.1| arylsulfatase [Escherichia coli IHE3034]
 gi|386606378|ref|YP_006112678.1| arylsulfatase [Escherichia coli UM146]
 gi|386631737|ref|YP_006151457.1| arylsulfatase [Escherichia coli str. 'clone D i2']
 gi|386636657|ref|YP_006156376.1| arylsulfatase [Escherichia coli str. 'clone D i14']
 gi|386641433|ref|YP_006108231.1| arylsulfatase [Escherichia coli ABU 83972]
 gi|387619093|ref|YP_006122115.1| arylsulfatase-like enzyme [Escherichia coli O83:H1 str. NRG 857C]
 gi|417087757|ref|ZP_11954615.1| arylsulfatase [Escherichia coli cloneA_i1]
 gi|419702641|ref|ZP_14230230.1| arylsulfatase [Escherichia coli SCI-07]
 gi|419943286|ref|ZP_14459846.1| arylsulfatase [Escherichia coli HM605]
 gi|422361501|ref|ZP_16442123.1| arylsulfatase [Escherichia coli MS 110-3]
 gi|422364128|ref|ZP_16444656.1| arylsulfatase [Escherichia coli MS 153-1]
 gi|422381318|ref|ZP_16461486.1| arylsulfatase [Escherichia coli MS 57-2]
 gi|422752092|ref|ZP_16805997.1| sulfatase [Escherichia coli H252]
 gi|422757517|ref|ZP_16811335.1| sulfatase [Escherichia coli H263]
 gi|422842088|ref|ZP_16890054.1| arylsulfatase [Escherichia coli H397]
 gi|432360252|ref|ZP_19603463.1| arylsulfatase [Escherichia coli KTE4]
 gi|432365052|ref|ZP_19608205.1| arylsulfatase [Escherichia coli KTE5]
 gi|432408873|ref|ZP_19651574.1| arylsulfatase [Escherichia coli KTE28]
 gi|432414063|ref|ZP_19656715.1| arylsulfatase [Escherichia coli KTE39]
 gi|432434023|ref|ZP_19676445.1| arylsulfatase [Escherichia coli KTE187]
 gi|432438756|ref|ZP_19681132.1| arylsulfatase [Escherichia coli KTE188]
 gi|432458941|ref|ZP_19701114.1| arylsulfatase [Escherichia coli KTE201]
 gi|432493051|ref|ZP_19734879.1| arylsulfatase [Escherichia coli KTE214]
 gi|432506691|ref|ZP_19748408.1| arylsulfatase [Escherichia coli KTE220]
 gi|432526272|ref|ZP_19763383.1| arylsulfatase [Escherichia coli KTE230]
 gi|432555880|ref|ZP_19792596.1| arylsulfatase [Escherichia coli KTE47]
 gi|432571073|ref|ZP_19807577.1| arylsulfatase [Escherichia coli KTE53]
 gi|432576042|ref|ZP_19812509.1| arylsulfatase [Escherichia coli KTE55]
 gi|432590252|ref|ZP_19826602.1| arylsulfatase [Escherichia coli KTE58]
 gi|432595012|ref|ZP_19831322.1| arylsulfatase [Escherichia coli KTE60]
 gi|432600055|ref|ZP_19836323.1| arylsulfatase [Escherichia coli KTE62]
 gi|432605236|ref|ZP_19841445.1| arylsulfatase [Escherichia coli KTE67]
 gi|432653453|ref|ZP_19889189.1| arylsulfatase [Escherichia coli KTE87]
 gi|432756755|ref|ZP_19991298.1| arylsulfatase [Escherichia coli KTE22]
 gi|432780960|ref|ZP_20015175.1| arylsulfatase [Escherichia coli KTE59]
 gi|432785785|ref|ZP_20019960.1| arylsulfatase [Escherichia coli KTE63]
 gi|432789824|ref|ZP_20023950.1| arylsulfatase [Escherichia coli KTE65]
 gi|432818588|ref|ZP_20052309.1| arylsulfatase [Escherichia coli KTE118]
 gi|432824720|ref|ZP_20058383.1| arylsulfatase [Escherichia coli KTE123]
 gi|432847019|ref|ZP_20079530.1| arylsulfatase [Escherichia coli KTE141]
 gi|432901390|ref|ZP_20111476.1| arylsulfatase [Escherichia coli KTE192]
 gi|432976023|ref|ZP_20164854.1| arylsulfatase [Escherichia coli KTE209]
 gi|432997582|ref|ZP_20186161.1| arylsulfatase [Escherichia coli KTE218]
 gi|433002177|ref|ZP_20190694.1| arylsulfatase [Escherichia coli KTE223]
 gi|433010000|ref|ZP_20198410.1| arylsulfatase [Escherichia coli KTE229]
 gi|433030748|ref|ZP_20218592.1| arylsulfatase [Escherichia coli KTE109]
 gi|433060323|ref|ZP_20247353.1| arylsulfatase [Escherichia coli KTE124]
 gi|433089526|ref|ZP_20275883.1| arylsulfatase [Escherichia coli KTE137]
 gi|433117730|ref|ZP_20303508.1| arylsulfatase [Escherichia coli KTE153]
 gi|433127432|ref|ZP_20312972.1| arylsulfatase [Escherichia coli KTE160]
 gi|433141506|ref|ZP_20326742.1| arylsulfatase [Escherichia coli KTE167]
 gi|433151458|ref|ZP_20336453.1| arylsulfatase [Escherichia coli KTE174]
 gi|433165816|ref|ZP_20350540.1| arylsulfatase [Escherichia coli KTE179]
 gi|433170813|ref|ZP_20355427.1| arylsulfatase [Escherichia coli KTE180]
 gi|433209948|ref|ZP_20393610.1| arylsulfatase [Escherichia coli KTE97]
 gi|433214827|ref|ZP_20398400.1| arylsulfatase [Escherichia coli KTE99]
 gi|442603424|ref|ZP_21018314.1| Arylsulfatase [Escherichia coli Nissle 1917]
 gi|26110968|gb|AAN83152.1|AE016769_267 Arylsulfatase [Escherichia coli CFT073]
 gi|91074895|gb|ABE09776.1| arylsulfatase [Escherichia coli UTI89]
 gi|115515182|gb|ABJ03257.1| arylsulfatase-like enzyme [Escherichia coli APEC O1]
 gi|218367633|emb|CAR05416.1| arylsulfatase-like enzyme [Escherichia coli S88]
 gi|222035499|emb|CAP78244.1| Arylsulfatase [Escherichia coli LF82]
 gi|226902979|gb|EEH89238.1| arylsulfatase [Escherichia sp. 3_2_53FAA]
 gi|227834456|gb|EEJ44922.1| arylsulfatase [Escherichia coli 83972]
 gi|294491821|gb|ADE90577.1| arylsulfatase [Escherichia coli IHE3034]
 gi|300407975|gb|EFJ91513.1| arylsulfatase [Escherichia coli MS 45-1]
 gi|307555925|gb|ADN48700.1| arylsulfatase [Escherichia coli ABU 83972]
 gi|307628862|gb|ADN73166.1| arylsulfatase [Escherichia coli UM146]
 gi|312948354|gb|ADR29181.1| arylsulfatase-like enzyme [Escherichia coli O83:H1 str. NRG 857C]
 gi|315284686|gb|EFU44131.1| arylsulfatase [Escherichia coli MS 110-3]
 gi|315293143|gb|EFU52495.1| arylsulfatase [Escherichia coli MS 153-1]
 gi|323949318|gb|EGB45208.1| sulfatase [Escherichia coli H252]
 gi|323954005|gb|EGB49803.1| sulfatase [Escherichia coli H263]
 gi|324007464|gb|EGB76683.1| arylsulfatase [Escherichia coli MS 57-2]
 gi|331052712|gb|EGI24747.1| arylsulfatase [Escherichia coli TA206]
 gi|355349486|gb|EHF98691.1| arylsulfatase [Escherichia coli cloneA_i1]
 gi|355422636|gb|AER86833.1| arylsulfatase [Escherichia coli str. 'clone D i2']
 gi|355427556|gb|AER91752.1| arylsulfatase [Escherichia coli str. 'clone D i14']
 gi|371602152|gb|EHN90863.1| arylsulfatase [Escherichia coli H397]
 gi|380346174|gb|EIA34473.1| arylsulfatase [Escherichia coli SCI-07]
 gi|388421298|gb|EIL80915.1| arylsulfatase [Escherichia coli HM605]
 gi|430873064|gb|ELB96643.1| arylsulfatase [Escherichia coli KTE4]
 gi|430883010|gb|ELC06017.1| arylsulfatase [Escherichia coli KTE5]
 gi|430925914|gb|ELC46510.1| arylsulfatase [Escherichia coli KTE28]
 gi|430932513|gb|ELC52934.1| arylsulfatase [Escherichia coli KTE39]
 gi|430950092|gb|ELC69482.1| arylsulfatase [Escherichia coli KTE187]
 gi|430959635|gb|ELC77946.1| arylsulfatase [Escherichia coli KTE188]
 gi|430978961|gb|ELC95750.1| arylsulfatase [Escherichia coli KTE201]
 gi|431030675|gb|ELD43681.1| arylsulfatase [Escherichia coli KTE214]
 gi|431034586|gb|ELD46512.1| arylsulfatase [Escherichia coli KTE220]
 gi|431047332|gb|ELD57333.1| arylsulfatase [Escherichia coli KTE230]
 gi|431080812|gb|ELD87602.1| arylsulfatase [Escherichia coli KTE47]
 gi|431096853|gb|ELE02308.1| arylsulfatase [Escherichia coli KTE53]
 gi|431104181|gb|ELE08784.1| arylsulfatase [Escherichia coli KTE55]
 gi|431117359|gb|ELE20598.1| arylsulfatase [Escherichia coli KTE58]
 gi|431125512|gb|ELE27914.1| arylsulfatase [Escherichia coli KTE60]
 gi|431127282|gb|ELE29584.1| arylsulfatase [Escherichia coli KTE62]
 gi|431144258|gb|ELE45965.1| arylsulfatase [Escherichia coli KTE67]
 gi|431186570|gb|ELE86110.1| arylsulfatase [Escherichia coli KTE87]
 gi|431299643|gb|ELF89214.1| arylsulfatase [Escherichia coli KTE22]
 gi|431323810|gb|ELG11276.1| arylsulfatase [Escherichia coli KTE59]
 gi|431325691|gb|ELG13072.1| arylsulfatase [Escherichia coli KTE63]
 gi|431334993|gb|ELG22137.1| arylsulfatase [Escherichia coli KTE65]
 gi|431373409|gb|ELG59015.1| arylsulfatase [Escherichia coli KTE118]
 gi|431377662|gb|ELG62788.1| arylsulfatase [Escherichia coli KTE123]
 gi|431392061|gb|ELG75664.1| arylsulfatase [Escherichia coli KTE141]
 gi|431422034|gb|ELH04229.1| arylsulfatase [Escherichia coli KTE192]
 gi|431485157|gb|ELH64821.1| arylsulfatase [Escherichia coli KTE209]
 gi|431501773|gb|ELH80749.1| arylsulfatase [Escherichia coli KTE218]
 gi|431504449|gb|ELH83075.1| arylsulfatase [Escherichia coli KTE223]
 gi|431520843|gb|ELH98162.1| arylsulfatase [Escherichia coli KTE229]
 gi|431540067|gb|ELI15698.1| arylsulfatase [Escherichia coli KTE109]
 gi|431565570|gb|ELI38649.1| arylsulfatase [Escherichia coli KTE124]
 gi|431600472|gb|ELI70142.1| arylsulfatase [Escherichia coli KTE137]
 gi|431630329|gb|ELI98666.1| arylsulfatase [Escherichia coli KTE153]
 gi|431639991|gb|ELJ07757.1| arylsulfatase [Escherichia coli KTE160]
 gi|431655359|gb|ELJ22392.1| arylsulfatase [Escherichia coli KTE167]
 gi|431666969|gb|ELJ33591.1| arylsulfatase [Escherichia coli KTE174]
 gi|431683098|gb|ELJ48737.1| arylsulfatase [Escherichia coli KTE179]
 gi|431683712|gb|ELJ49340.1| arylsulfatase [Escherichia coli KTE180]
 gi|431728000|gb|ELJ91727.1| arylsulfatase [Escherichia coli KTE97]
 gi|431731386|gb|ELJ94889.1| arylsulfatase [Escherichia coli KTE99]
 gi|441715848|emb|CCQ04291.1| Arylsulfatase [Escherichia coli Nissle 1917]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|415773843|ref|ZP_11486390.1| arylsulfatase [Escherichia coli 3431]
 gi|417615460|ref|ZP_12265908.1| arylsulfatase [Escherichia coli STEC_EH250]
 gi|417620470|ref|ZP_12270871.1| arylsulfatase [Escherichia coli G58-1]
 gi|418960318|ref|ZP_13512209.1| arylsulfatase [Escherichia coli J53]
 gi|422773882|ref|ZP_16827563.1| sulfatase [Escherichia coli E482]
 gi|425275090|ref|ZP_18666469.1| sulfatase [Escherichia coli TW15901]
 gi|425285668|ref|ZP_18676680.1| sulfatase [Escherichia coli TW00353]
 gi|315618503|gb|EFU99089.1| arylsulfatase [Escherichia coli 3431]
 gi|323938937|gb|EGB35156.1| sulfatase [Escherichia coli E482]
 gi|345357636|gb|EGW89828.1| arylsulfatase [Escherichia coli STEC_EH250]
 gi|345369687|gb|EGX01669.1| arylsulfatase [Escherichia coli G58-1]
 gi|384376925|gb|EIE34825.1| arylsulfatase [Escherichia coli J53]
 gi|408189606|gb|EKI15317.1| sulfatase [Escherichia coli TW15901]
 gi|408197795|gb|EKI23046.1| sulfatase [Escherichia coli TW00353]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|415838432|ref|ZP_11520403.1| arylsulfatase [Escherichia coli RN587/1]
 gi|417282000|ref|ZP_12069300.1| arylsulfatase [Escherichia coli 3003]
 gi|425280249|ref|ZP_18671461.1| sulfatase [Escherichia coli ARS4.2123]
 gi|323189479|gb|EFZ74759.1| arylsulfatase [Escherichia coli RN587/1]
 gi|386246329|gb|EII88059.1| arylsulfatase [Escherichia coli 3003]
 gi|408197402|gb|EKI22665.1| sulfatase [Escherichia coli ARS4.2123]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|309784517|ref|ZP_07679155.1| arylsulfatase [Shigella dysenteriae 1617]
 gi|308927623|gb|EFP73092.1| arylsulfatase [Shigella dysenteriae 1617]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|293413242|ref|ZP_06655904.1| conserved hypothetical protein [Escherichia coli B354]
 gi|291468190|gb|EFF10687.1| conserved hypothetical protein [Escherichia coli B354]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|425269797|ref|ZP_18661408.1| arylsulfatase [Escherichia coli 5412]
 gi|408180246|gb|EKI06871.1| arylsulfatase [Escherichia coli 5412]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|331665445|ref|ZP_08366344.1| arylsulfatase [Escherichia coli TA143]
 gi|331057343|gb|EGI29332.1| arylsulfatase [Escherichia coli TA143]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|218550978|ref|YP_002384769.1| arylsulfatase-like protein [Escherichia fergusonii ATCC 35469]
 gi|218358519|emb|CAQ91166.1| arylsulfatase-like enzyme [Escherichia fergusonii ATCC 35469]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|419912589|ref|ZP_14431039.1| arylsulfatase [Escherichia coli KD1]
 gi|433200564|ref|ZP_20384444.1| arylsulfatase [Escherichia coli KTE94]
 gi|388391448|gb|EIL52915.1| arylsulfatase [Escherichia coli KD1]
 gi|431716610|gb|ELJ80717.1| arylsulfatase [Escherichia coli KTE94]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 472

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 33/205 (16%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
           GW D+G +G     TPNID L   G+     Y+  P C+PSRA+ LTGK P   G     
Sbjct: 40  GWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILTGKNPANIGFTGHI 99

Query: 79  TPVGA----GVAKAVP--------VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
           T +G        + +P        + EK++P+ L + GY++  IGKWH+G  +E+  P +
Sbjct: 100 TAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKWHVG-EEEKFFPTH 158

Query: 127 RGFD-NHVGYWNG-----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
           +GF  N  GY +G     +  +            LD R            +YLT+  TD+
Sbjct: 159 QGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNRE---------EGQYLTNRLTDE 209

Query: 181 SVHVIKSHNHSRPLFLQITHAAVHT 205
           +++ I   N   P FL ++H AVHT
Sbjct: 210 AINFI-DENKEGPFFLYLSHYAVHT 233


>gi|422808164|ref|ZP_16856590.1| sulfatase [Escherichia fergusonii B253]
 gi|324111024|gb|EGC05011.1| sulfatase [Escherichia fergusonii B253]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|433002698|ref|ZP_20191206.1| arylsulfatase [Escherichia coli KTE227]
 gi|433155988|ref|ZP_20340912.1| arylsulfatase [Escherichia coli KTE176]
 gi|431521739|gb|ELH98978.1| arylsulfatase [Escherichia coli KTE227]
 gi|431669827|gb|ELJ36193.1| arylsulfatase [Escherichia coli KTE176]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|416333496|ref|ZP_11670723.1| Arylsulfatase [Escherichia coli WV_060327]
 gi|320197610|gb|EFW72222.1| Arylsulfatase [Escherichia coli WV_060327]
          Length = 551

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|189404399|ref|ZP_03007442.1| arylsulfatase [Escherichia coli O157:H7 str. EC4501]
 gi|419112683|ref|ZP_13657724.1| sulfatase family protein [Escherichia coli DEC4F]
 gi|420272332|ref|ZP_14774678.1| arylsulfatase [Escherichia coli PA22]
 gi|420277929|ref|ZP_14780207.1| arylsulfatase [Escherichia coli PA40]
 gi|420300900|ref|ZP_14802942.1| arylsulfatase [Escherichia coli TW09109]
 gi|423728023|ref|ZP_17701804.1| arylsulfatase [Escherichia coli PA31]
 gi|424080129|ref|ZP_17817068.1| arylsulfatase [Escherichia coli FDA505]
 gi|424092938|ref|ZP_17828845.1| arylsulfatase [Escherichia coli FRIK1996]
 gi|424099629|ref|ZP_17834866.1| arylsulfatase [Escherichia coli FRIK1985]
 gi|424105821|ref|ZP_17840535.1| arylsulfatase [Escherichia coli FRIK1990]
 gi|424112461|ref|ZP_17846671.1| arylsulfatase [Escherichia coli 93-001]
 gi|424118395|ref|ZP_17852214.1| arylsulfatase [Escherichia coli PA3]
 gi|424124595|ref|ZP_17857876.1| arylsulfatase [Escherichia coli PA5]
 gi|424130759|ref|ZP_17863645.1| arylsulfatase [Escherichia coli PA9]
 gi|424137072|ref|ZP_17869492.1| arylsulfatase [Escherichia coli PA10]
 gi|424143628|ref|ZP_17875464.1| arylsulfatase [Escherichia coli PA14]
 gi|424458495|ref|ZP_17909576.1| arylsulfatase [Escherichia coli PA33]
 gi|424471258|ref|ZP_17921040.1| arylsulfatase [Escherichia coli PA41]
 gi|424496418|ref|ZP_17943938.1| arylsulfatase [Escherichia coli TW09195]
 gi|424534867|ref|ZP_17978199.1| arylsulfatase [Escherichia coli EC4422]
 gi|424540955|ref|ZP_17983883.1| arylsulfatase [Escherichia coli EC4013]
 gi|424547101|ref|ZP_17989416.1| arylsulfatase [Escherichia coli EC4402]
 gi|424553297|ref|ZP_17995108.1| arylsulfatase [Escherichia coli EC4439]
 gi|424559500|ref|ZP_18000878.1| arylsulfatase [Escherichia coli EC4436]
 gi|424571948|ref|ZP_18012466.1| arylsulfatase [Escherichia coli EC4448]
 gi|424578107|ref|ZP_18018125.1| arylsulfatase [Escherichia coli EC1845]
 gi|424583930|ref|ZP_18023560.1| arylsulfatase [Escherichia coli EC1863]
 gi|425158660|ref|ZP_18557907.1| arylsulfatase [Escherichia coli PA34]
 gi|425164979|ref|ZP_18563850.1| arylsulfatase [Escherichia coli FDA506]
 gi|425170725|ref|ZP_18569183.1| arylsulfatase [Escherichia coli FDA507]
 gi|425176770|ref|ZP_18574874.1| arylsulfatase [Escherichia coli FDA504]
 gi|425189128|ref|ZP_18586383.1| arylsulfatase [Escherichia coli FRIK1997]
 gi|425195857|ref|ZP_18592612.1| arylsulfatase [Escherichia coli NE1487]
 gi|425202335|ref|ZP_18598528.1| arylsulfatase [Escherichia coli NE037]
 gi|425208713|ref|ZP_18604495.1| arylsulfatase [Escherichia coli FRIK2001]
 gi|425257550|ref|ZP_18650031.1| arylsulfatase [Escherichia coli CB7326]
 gi|425313969|ref|ZP_18703121.1| arylsulfatase [Escherichia coli EC1735]
 gi|425319950|ref|ZP_18708712.1| arylsulfatase [Escherichia coli EC1736]
 gi|425326088|ref|ZP_18714400.1| arylsulfatase [Escherichia coli EC1737]
 gi|425332401|ref|ZP_18720199.1| arylsulfatase [Escherichia coli EC1846]
 gi|425338577|ref|ZP_18725901.1| arylsulfatase [Escherichia coli EC1847]
 gi|425344871|ref|ZP_18731744.1| arylsulfatase [Escherichia coli EC1848]
 gi|425350712|ref|ZP_18737155.1| arylsulfatase [Escherichia coli EC1849]
 gi|425375503|ref|ZP_18760127.1| arylsulfatase [Escherichia coli EC1864]
 gi|425388390|ref|ZP_18771933.1| arylsulfatase [Escherichia coli EC1866]
 gi|189365812|gb|EDU84228.1| arylsulfatase [Escherichia coli O157:H7 str. EC4501]
 gi|377952239|gb|EHV15835.1| sulfatase family protein [Escherichia coli DEC4F]
 gi|390637152|gb|EIN16708.1| arylsulfatase [Escherichia coli FRIK1996]
 gi|390637579|gb|EIN17122.1| arylsulfatase [Escherichia coli FDA505]
 gi|390655840|gb|EIN33752.1| arylsulfatase [Escherichia coli FRIK1985]
 gi|390656638|gb|EIN34498.1| arylsulfatase [Escherichia coli 93-001]
 gi|390659504|gb|EIN37266.1| arylsulfatase [Escherichia coli FRIK1990]
 gi|390674022|gb|EIN50230.1| arylsulfatase [Escherichia coli PA3]
 gi|390677315|gb|EIN53370.1| arylsulfatase [Escherichia coli PA5]
 gi|390680688|gb|EIN56515.1| arylsulfatase [Escherichia coli PA9]
 gi|390691949|gb|EIN66669.1| arylsulfatase [Escherichia coli PA10]
 gi|390696242|gb|EIN70731.1| arylsulfatase [Escherichia coli PA14]
 gi|390711207|gb|EIN84190.1| arylsulfatase [Escherichia coli PA22]
 gi|390736938|gb|EIO08254.1| arylsulfatase [Escherichia coli PA31]
 gi|390741168|gb|EIO12258.1| arylsulfatase [Escherichia coli PA33]
 gi|390755740|gb|EIO25271.1| arylsulfatase [Escherichia coli PA40]
 gi|390761899|gb|EIO31170.1| arylsulfatase [Escherichia coli PA41]
 gi|390804528|gb|EIO71494.1| arylsulfatase [Escherichia coli TW09109]
 gi|390821974|gb|EIO88123.1| arylsulfatase [Escherichia coli TW09195]
 gi|390858190|gb|EIP20598.1| arylsulfatase [Escherichia coli EC4422]
 gi|390862478|gb|EIP24661.1| arylsulfatase [Escherichia coli EC4013]
 gi|390866610|gb|EIP28560.1| arylsulfatase [Escherichia coli EC4402]
 gi|390874882|gb|EIP35966.1| arylsulfatase [Escherichia coli EC4439]
 gi|390880252|gb|EIP40944.1| arylsulfatase [Escherichia coli EC4436]
 gi|390891496|gb|EIP51124.1| arylsulfatase [Escherichia coli EC4448]
 gi|390915602|gb|EIP74111.1| arylsulfatase [Escherichia coli EC1845]
 gi|390915802|gb|EIP74302.1| arylsulfatase [Escherichia coli EC1863]
 gi|408065071|gb|EKG99547.1| arylsulfatase [Escherichia coli PA34]
 gi|408075209|gb|EKH09447.1| arylsulfatase [Escherichia coli FDA506]
 gi|408080203|gb|EKH14287.1| arylsulfatase [Escherichia coli FDA507]
 gi|408088389|gb|EKH21761.1| arylsulfatase [Escherichia coli FDA504]
 gi|408100742|gb|EKH33224.1| arylsulfatase [Escherichia coli FRIK1997]
 gi|408105667|gb|EKH37814.1| arylsulfatase [Escherichia coli NE1487]
 gi|408112477|gb|EKH44127.1| arylsulfatase [Escherichia coli NE037]
 gi|408118660|gb|EKH49779.1| arylsulfatase [Escherichia coli FRIK2001]
 gi|408170353|gb|EKH97562.1| arylsulfatase [Escherichia coli CB7326]
 gi|408223502|gb|EKI47271.1| arylsulfatase [Escherichia coli EC1735]
 gi|408235040|gb|EKI58027.1| arylsulfatase [Escherichia coli EC1736]
 gi|408237773|gb|EKI60619.1| arylsulfatase [Escherichia coli EC1737]
 gi|408243000|gb|EKI65548.1| arylsulfatase [Escherichia coli EC1846]
 gi|408251826|gb|EKI73540.1| arylsulfatase [Escherichia coli EC1847]
 gi|408256119|gb|EKI77512.1| arylsulfatase [Escherichia coli EC1848]
 gi|408262776|gb|EKI83690.1| arylsulfatase [Escherichia coli EC1849]
 gi|408288447|gb|EKJ07270.1| arylsulfatase [Escherichia coli EC1864]
 gi|408304492|gb|EKJ21917.1| arylsulfatase [Escherichia coli EC1866]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
 gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
          Length = 596

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 3/118 (2%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
           QGW D+ F+G  ++ TPNIDA+A NG      Y  P C+P+RA  LTGKY  R G+ +  
Sbjct: 47  QGWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGKYAARLGVYSTS 106

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
             G  +     E  + +  K+ GY T   GKWH G  +    P +RGFD++ G+ +G+
Sbjct: 107 TGG--ERFNSKETTIAEIFKKAGYKTTAYGKWHSGM-QPPYHPNSRGFDDYYGFTSGH 161


>gi|194438593|ref|ZP_03070681.1| arylsulfatase [Escherichia coli 101-1]
 gi|293407428|ref|ZP_06651348.1| arylsulfatase [Escherichia coli FVEC1412]
 gi|298383168|ref|ZP_06992762.1| arylsulfatase [Escherichia coli FVEC1302]
 gi|442596910|ref|ZP_21014711.1| Arylsulfatase [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
 gi|194422397|gb|EDX38396.1| arylsulfatase [Escherichia coli 101-1]
 gi|291425539|gb|EFE98577.1| arylsulfatase [Escherichia coli FVEC1412]
 gi|298276404|gb|EFI17923.1| arylsulfatase [Escherichia coli FVEC1302]
 gi|441654658|emb|CCQ00624.1| Arylsulfatase [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
          Length = 531

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 518

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 95/197 (48%), Gaps = 15/197 (7%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           GW D+G  G     TPN+D +A  G ++   Y   P C+PSRA+ LTG+ P R G  T  
Sbjct: 38  GWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGRLPIRNGFYTTN 97

Query: 82  GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
                      +   +P +E +LP+ L + GY + +IGKWH+G ++ +  P   GFD + 
Sbjct: 98  DHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQYHPLKHGFDEYF 156

Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSSKY-LTDFFTDQSVHVI-KSH 188
           G  N ++   D+  + +  V  DA    R  E +    S +  LT  F ++++  I K H
Sbjct: 157 GAPNCHVGPYDNKKQPNIPVYRDADMIGRYYEEFKIDKSGESNLTQMFIEEAIAFIEKQH 216

Query: 189 NHSRPLFLQITHAAVHT 205
                 FL  T  A H+
Sbjct: 217 QTGEQFFLYWTPDASHS 233


>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
          Length = 558

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 49/122 (40%), Positives = 72/122 (59%), Gaps = 4/122 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
           G+ D G+   + I TPNID L  +GI     Y+   C+PSR++FL+G+YP++ G+   V 
Sbjct: 82  GFQDAGYR-NSAIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSFLSGRYPYKSGMQHGVI 140

Query: 83  AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNGYL 140
           +      + +  K L  YLK+L Y+TH +GKWH+G CNK +  P  RGFD   G ++G  
Sbjct: 141 SDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNK-KCTPTYRGFDTFSGGYSGEG 199

Query: 141 TY 142
            Y
Sbjct: 200 DY 201


>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
 gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
          Length = 489

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 102/200 (51%), Gaps = 19/200 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+G +G  +I TPN+D LA  G      Y+    C+PSRAA LTG YP R G+   
Sbjct: 57  QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 116

Query: 81  V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
           V        +   E  +  +LK  GY+T  +GKWH+G +K E LP + GFD++ G  Y N
Sbjct: 117 VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 175

Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
                     G ++ +D   +   AV L      ++ E     +  + +T  +TD+++  
Sbjct: 176 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 235

Query: 185 IKSHNHSRPLFLQITHAAVH 204
           +++ N  +P FL + H+  H
Sbjct: 236 VEA-NQDKPFFLYLPHSMPH 254


>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 443

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 95/194 (48%), Gaps = 8/194 (4%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D+G +G   I TP +D +A +G+ L   Y   P CTP+RAA +TG Y  R G+ TP+
Sbjct: 45  GYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRAALMTGCYATRVGLPTPL 104

Query: 82  GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
                  +  +E  L + +++ GY T  +GKWH+G ++    P   GF NH  YW   L 
Sbjct: 105 HVYDEIGINESEFTLGEAMQQCGYETVCVGKWHLG-HQPRFYPTEHGF-NH--YWGTPLG 160

Query: 142 YNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
           +  +      A+G   D   +  R  P      LT+  T+++V  I++    RP FL + 
Sbjct: 161 HMFNRPAVGKAIGDTSDLFLDDTREIPFPEDADLTERLTEKAVEFIEA-KRDRPFFLFLA 219

Query: 200 HAAVHTGTAGNAKL 213
           H   H   A + K 
Sbjct: 220 HPMPHEPLAASEKF 233


>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 389

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 74/137 (54%), Gaps = 5/137 (3%)

Query: 5   VGAGVAKAVPVTEKLLPQGWN--DVGFHGENDIPTPNIDALAYN-GIVLNRHYTLPTCTP 61
           + A    A P    +L   W   DV F     I +P+ ++LA   G++L+RHY    C+P
Sbjct: 14  IAAATVNAKPNLVFVLVDDWGFADVSFRNPA-ISSPHFESLATKEGLILDRHYVFKYCSP 72

Query: 62  SRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEE 121
           SRA+FLTG++P       P  +G+  A  +   ++P  LK  GY TH++GKWH G   ++
Sbjct: 73  SRASFLTGRFPHHAHQWNPPQSGLVGA-NINMTMIPAKLKTAGYKTHMVGKWHEGFYLKK 131

Query: 122 LLPFNRGFDNHVGYWNG 138
            LP NRGFD   G+  G
Sbjct: 132 FLPINRGFDTMSGFLGG 148


>gi|306815164|ref|ZP_07449317.1| arylsulfatase [Escherichia coli NC101]
 gi|432383695|ref|ZP_19626619.1| arylsulfatase [Escherichia coli KTE15]
 gi|432389603|ref|ZP_19632481.1| arylsulfatase [Escherichia coli KTE16]
 gi|432516187|ref|ZP_19753401.1| arylsulfatase [Escherichia coli KTE224]
 gi|432613801|ref|ZP_19849957.1| arylsulfatase [Escherichia coli KTE72]
 gi|432648469|ref|ZP_19884253.1| arylsulfatase [Escherichia coli KTE86]
 gi|432658034|ref|ZP_19893730.1| arylsulfatase [Escherichia coli KTE93]
 gi|432701313|ref|ZP_19936456.1| arylsulfatase [Escherichia coli KTE169]
 gi|432747772|ref|ZP_19982433.1| arylsulfatase [Escherichia coli KTE43]
 gi|432907621|ref|ZP_20116004.1| arylsulfatase [Escherichia coli KTE194]
 gi|432940617|ref|ZP_20138518.1| arylsulfatase [Escherichia coli KTE183]
 gi|432974071|ref|ZP_20162913.1| arylsulfatase [Escherichia coli KTE207]
 gi|432987644|ref|ZP_20176354.1| arylsulfatase [Escherichia coli KTE215]
 gi|433040814|ref|ZP_20228399.1| arylsulfatase [Escherichia coli KTE113]
 gi|433084725|ref|ZP_20271169.1| arylsulfatase [Escherichia coli KTE133]
 gi|433103396|ref|ZP_20289464.1| arylsulfatase [Escherichia coli KTE145]
 gi|433146435|ref|ZP_20331564.1| arylsulfatase [Escherichia coli KTE168]
 gi|433190604|ref|ZP_20374689.1| arylsulfatase [Escherichia coli KTE88]
 gi|305851533|gb|EFM51987.1| arylsulfatase [Escherichia coli NC101]
 gi|430902979|gb|ELC24724.1| arylsulfatase [Escherichia coli KTE16]
 gi|430903083|gb|ELC24827.1| arylsulfatase [Escherichia coli KTE15]
 gi|431037897|gb|ELD48867.1| arylsulfatase [Escherichia coli KTE224]
 gi|431146038|gb|ELE47637.1| arylsulfatase [Escherichia coli KTE72]
 gi|431177479|gb|ELE77403.1| arylsulfatase [Escherichia coli KTE86]
 gi|431188145|gb|ELE87644.1| arylsulfatase [Escherichia coli KTE93]
 gi|431239692|gb|ELF34164.1| arylsulfatase [Escherichia coli KTE169]
 gi|431289672|gb|ELF80413.1| arylsulfatase [Escherichia coli KTE43]
 gi|431427116|gb|ELH09159.1| arylsulfatase [Escherichia coli KTE194]
 gi|431459667|gb|ELH39959.1| arylsulfatase [Escherichia coli KTE183]
 gi|431478375|gb|ELH58123.1| arylsulfatase [Escherichia coli KTE207]
 gi|431493817|gb|ELH73409.1| arylsulfatase [Escherichia coli KTE215]
 gi|431548007|gb|ELI22297.1| arylsulfatase [Escherichia coli KTE113]
 gi|431597311|gb|ELI67218.1| arylsulfatase [Escherichia coli KTE133]
 gi|431615727|gb|ELI84849.1| arylsulfatase [Escherichia coli KTE145]
 gi|431657075|gb|ELJ24043.1| arylsulfatase [Escherichia coli KTE168]
 gi|431701561|gb|ELJ66476.1| arylsulfatase [Escherichia coli KTE88]
          Length = 494

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 40  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 99

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 100 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 153


>gi|218692077|ref|YP_002400289.1| arylsulfatase-like enzyme [Escherichia coli ED1a]
 gi|218429641|emb|CAR10603.2| arylsulfatase-like enzyme [Escherichia coli ED1a]
          Length = 551

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYITQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 470

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 102/200 (51%), Gaps = 19/200 (9%)

Query: 22  QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
           QG+ND+G +G  +I TPN+D LA  G      Y+    C+PSRAA LTG YP R G+   
Sbjct: 38  QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97

Query: 81  V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
           V        +   E  +  +LK  GY+T  +GKWH+G +K E LP + GFD++ G  Y N
Sbjct: 98  VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156

Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
                     G ++ +D   +   AV L      ++ E     +  + +T  +TD+++  
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 216

Query: 185 IKSHNHSRPLFLQITHAAVH 204
           +++ N  +P FL + H+  H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235


>gi|419156315|ref|ZP_13700868.1| sulfatase family protein, partial [Escherichia coli DEC6C]
 gi|377992619|gb|EHV55765.1| sulfatase family protein, partial [Escherichia coli DEC6C]
          Length = 370

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 77  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190


>gi|110644124|ref|YP_671854.1| arylsulfatase [Escherichia coli 536]
 gi|191174275|ref|ZP_03035784.1| arylsulfatase [Escherichia coli F11]
 gi|215489127|ref|YP_002331558.1| acrylsulfatase-like protein [Escherichia coli O127:H6 str.
           E2348/69]
 gi|300979327|ref|ZP_07174511.1| arylsulfatase [Escherichia coli MS 200-1]
 gi|312969473|ref|ZP_07783675.1| arylsulfatase [Escherichia coli 2362-75]
 gi|417758226|ref|ZP_12406286.1| sulfatase family protein [Escherichia coli DEC2B]
 gi|418999243|ref|ZP_13546819.1| sulfatase family protein [Escherichia coli DEC1A]
 gi|419004605|ref|ZP_13552112.1| sulfatase family protein [Escherichia coli DEC1B]
 gi|419010286|ref|ZP_13557693.1| sulfatase family protein [Escherichia coli DEC1C]
 gi|419015988|ref|ZP_13563321.1| sulfatase family protein [Escherichia coli DEC1D]
 gi|419020913|ref|ZP_13568209.1| sulfatase family protein [Escherichia coli DEC1E]
 gi|419031516|ref|ZP_13578655.1| sulfatase family protein [Escherichia coli DEC2C]
 gi|419037120|ref|ZP_13584190.1| sulfatase family protein [Escherichia coli DEC2D]
 gi|419042214|ref|ZP_13589228.1| sulfatase family protein [Escherichia coli DEC2E]
 gi|422373936|ref|ZP_16454231.1| arylsulfatase [Escherichia coli MS 60-1]
 gi|432443330|ref|ZP_19685662.1| arylsulfatase [Escherichia coli KTE189]
 gi|432448474|ref|ZP_19690769.1| arylsulfatase [Escherichia coli KTE191]
 gi|432473153|ref|ZP_19715188.1| arylsulfatase [Escherichia coli KTE206]
 gi|432585327|ref|ZP_19821717.1| arylsulfatase [Escherichia coli KTE57]
 gi|432715659|ref|ZP_19950682.1| arylsulfatase [Escherichia coli KTE8]
 gi|432734553|ref|ZP_19969374.1| arylsulfatase [Escherichia coli KTE45]
 gi|432761638|ref|ZP_19996125.1| arylsulfatase [Escherichia coli KTE46]
 gi|432804034|ref|ZP_20037983.1| arylsulfatase [Escherichia coli KTE84]
 gi|433016118|ref|ZP_20204444.1| arylsulfatase [Escherichia coli KTE104]
 gi|433025709|ref|ZP_20213674.1| arylsulfatase [Escherichia coli KTE106]
 gi|433080012|ref|ZP_20266526.1| arylsulfatase [Escherichia coli KTE131]
 gi|433122417|ref|ZP_20308070.1| arylsulfatase [Escherichia coli KTE157]
 gi|433325279|ref|ZP_20402423.1| arylsulfatase [Escherichia coli J96]
 gi|110345716|gb|ABG71953.1| arylsulfatase [Escherichia coli 536]
 gi|190905458|gb|EDV65088.1| arylsulfatase [Escherichia coli F11]
 gi|215267199|emb|CAS11647.1| acrylsulfatase-like enzyme [Escherichia coli O127:H6 str. E2348/69]
 gi|300308080|gb|EFJ62600.1| arylsulfatase [Escherichia coli MS 200-1]
 gi|312286020|gb|EFR13938.1| arylsulfatase [Escherichia coli 2362-75]
 gi|324014744|gb|EGB83963.1| arylsulfatase [Escherichia coli MS 60-1]
 gi|377838924|gb|EHU04028.1| sulfatase family protein [Escherichia coli DEC1C]
 gi|377838996|gb|EHU04098.1| sulfatase family protein [Escherichia coli DEC1A]
 gi|377841721|gb|EHU06782.1| sulfatase family protein [Escherichia coli DEC1B]
 gi|377852838|gb|EHU17750.1| sulfatase family protein [Escherichia coli DEC1D]
 gi|377855891|gb|EHU20754.1| sulfatase family protein [Escherichia coli DEC1E]
 gi|377870201|gb|EHU34889.1| sulfatase family protein [Escherichia coli DEC2B]
 gi|377872176|gb|EHU36825.1| sulfatase family protein [Escherichia coli DEC2C]
 gi|377874253|gb|EHU38882.1| sulfatase family protein [Escherichia coli DEC2D]
 gi|377885985|gb|EHU50474.1| sulfatase family protein [Escherichia coli DEC2E]
 gi|430962751|gb|ELC80603.1| arylsulfatase [Escherichia coli KTE189]
 gi|430970859|gb|ELC87904.1| arylsulfatase [Escherichia coli KTE191]
 gi|430995319|gb|ELD11616.1| arylsulfatase [Escherichia coli KTE206]
 gi|431114313|gb|ELE17857.1| arylsulfatase [Escherichia coli KTE57]
 gi|431251061|gb|ELF45079.1| arylsulfatase [Escherichia coli KTE8]
 gi|431270540|gb|ELF61703.1| arylsulfatase [Escherichia coli KTE45]
 gi|431305314|gb|ELF93643.1| arylsulfatase [Escherichia coli KTE46]
 gi|431345125|gb|ELG32052.1| arylsulfatase [Escherichia coli KTE84]
 gi|431526204|gb|ELI02963.1| arylsulfatase [Escherichia coli KTE104]
 gi|431530145|gb|ELI06830.1| arylsulfatase [Escherichia coli KTE106]
 gi|431592977|gb|ELI63542.1| arylsulfatase [Escherichia coli KTE131]
 gi|431638384|gb|ELJ06419.1| arylsulfatase [Escherichia coli KTE157]
 gi|432346351|gb|ELL40835.1| arylsulfatase [Escherichia coli J96]
          Length = 551

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)

Query: 23  GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
           GW DVGF+G       PTP+IDA+A  G++L   Y+ P+ +P+RA  LTG+Y   +GI  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query: 80  PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
           P   G    +      LPQ L + GY T  IGKWH+G NKE   P N GFD+  G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYITQAIGKWHMGENKES-QPQNVGFDDFRGF 210


>gi|421611816|ref|ZP_16052946.1| arylsulfatase [Rhodopirellula baltica SH28]
 gi|408497377|gb|EKK01906.1| arylsulfatase [Rhodopirellula baltica SH28]
          Length = 1553

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 17/191 (8%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+G +G  +I TPNIDALA +G+ L + Y    C PSRA+ +TG YP + GI     
Sbjct: 7   GYSDLGCYG-GEISTPNIDALAADGVKLTQVYNSARCCPSRASLMTGLYPTQAGIGDFTA 65

Query: 78  ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
              +   G G    +      + + LK  GY  + +GKWH+     +  P  RGFD   G
Sbjct: 66  REPNRTRGQGYLGRLRDDCVTMAEVLKPEGYGCYYVGKWHM---HPKTGPIKRGFDEFYG 122

Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRP 193
           Y N    ++   ++ D+ + L   R ++   P     Y TD F D ++  I+   + ++P
Sbjct: 123 YTN---DHSHDQYDADYYIRLPENR-VKEIDPPADQFYATDVFNDYAIEFIRQGQSTNKP 178

Query: 194 LFLQITHAAVH 204
            FL + H++ H
Sbjct: 179 WFLFLGHSSPH 189



 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 53/92 (57%), Gaps = 6/92 (6%)

Query: 25  NDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPVGA 83
           +D+  +G   +PTPN++ LA  G+V +  Y T+ +C+PSR + +TG+YP   G       
Sbjct: 703 DDLSVYGNAFVPTPNLERLASKGLVFDNAYLTISSCSPSRCSMITGRYPHNTG-----AP 757

Query: 84  GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
            +   +P T++   Q L++ GY T + GK H+
Sbjct: 758 ELHTTLPETQRTFVQSLRDAGYHTVISGKNHM 789


>gi|149179303|ref|ZP_01857864.1| arylsulfatase [Planctomyces maris DSM 8797]
 gi|148841844|gb|EDL56246.1| arylsulfatase [Planctomyces maris DSM 8797]
          Length = 506

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 95/202 (47%), Gaps = 27/202 (13%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
           G++D+G +G  +I TPNIDALA  G+  ++ Y    C P+RA  +TG +P + GI     
Sbjct: 38  GFSDIGCYG-GEIETPNIDALAAGGVRFSQFYNSGRCCPTRATLMTGLHPQQTGIGWMTN 96

Query: 78  ---DT------PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
              DT      P   G      VT   L + LK  GY+T + GKWH+G N ++  P  RG
Sbjct: 97  PPGDTRGYSKPPAYQGYLNRKCVT---LAEVLKPAGYATLMTGKWHLGFNAQDRWPLQRG 153

Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKS 187
           FD   G  +G   +   +       G     ++E  A     + Y TD +TD ++  +  
Sbjct: 154 FDKFFGCVSGATRFFHPVVPRGMTFG---NEDIETPASTTDRRFYTTDAYTDYAIRFLNE 210

Query: 188 HNHS-----RPLFLQITHAAVH 204
           H  +     +P FL + + A H
Sbjct: 211 HQQAKETQDKPFFLYLAYTAPH 232


>gi|443718583|gb|ELU09136.1| hypothetical protein CAPTEDRAFT_144340 [Capitella teleta]
          Length = 557

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 73/118 (61%), Gaps = 6/118 (5%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G  G + + TP++D++  NG+ L+      + CTPSRAA +T +Y  R G+++ +
Sbjct: 34  GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRSGMESVI 93

Query: 82  GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
            + ++ + +P +E  LPQ L+E GY+T LIGKWH+G N++ +     P  RGFD   G
Sbjct: 94  LSLMSPQGLPASEYTLPQMLQEQGYATALIGKWHLGWNRQLMDHYYSPLKRGFDFFFG 151


>gi|326913618|ref|XP_003203133.1| PREDICTED: steryl-sulfatase-like, partial [Meleagris gallopavo]
          Length = 485

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 66/124 (53%), Gaps = 12/124 (9%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
           G  D+G +G   +  PNID LA  G+ L +H    P CTPSRAAFLTG+YP R G+    
Sbjct: 64  GIGDLGCYGNRTLRLPNIDRLAKEGVTLTQHLAASPLCTPSRAAFLTGRYPIRSGMAAFS 123

Query: 82  GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
             GV      +  +P  E    + LK+ GY+T LIGKWH+G N E        P + GFD
Sbjct: 124 RVGVFLFSASSGGLPSEEITFSKVLKQRGYATALIGKWHLGMNCESSNDFCHHPLSHGFD 183

Query: 131 NHVG 134
              G
Sbjct: 184 YFYG 187


>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 508

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 75/238 (31%), Positives = 105/238 (44%), Gaps = 63/238 (26%)

Query: 23  GWNDVGFHG----ENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI 77
           G ND+   G    +  + TPNID LA  G + +  Y+   TC PSRA  +TG+YP R G 
Sbjct: 30  GINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAMLMTGRYPTRTGF 89

Query: 78  D-TPVGAGVAKAVPV-----------------TEKLLPQY---------------LKELG 104
           + TP   G+++ VP+                  EKL+P +               LK+ G
Sbjct: 90  EYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTEEVTLAEVLKDRG 149

Query: 105 YSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YLTYNDS----------------IH 147
           Y T  IGKWH+G N     P ++GFD  +   +G +L   D                   
Sbjct: 150 YHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGVEARLDFDPIDKFLWA 208

Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
             DFA   +     E         YLTD++TD+S+ VI + N +RP FL + H  VHT
Sbjct: 209 RMDFAASYNGSDWFE------PGGYLTDYWTDESLKVIDA-NKNRPFFLYLAHWGVHT 259


>gi|429093555|ref|ZP_19156139.1| Arylsulfatase [Cronobacter dublinensis 1210]
 gi|426741528|emb|CCJ82252.1| Arylsulfatase [Cronobacter dublinensis 1210]
          Length = 502

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 101/197 (51%), Gaps = 7/197 (3%)

Query: 23  GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
           G+ D G +G   + TPNID+LA  G+    +Y   P C+PSRA  LTG+ PFR GI + +
Sbjct: 47  GYGDTGIYGHPIVKTPNIDSLAQQGMRFTEYYAPAPLCSPSRAGLLTGRTPFRTGIRSWI 106

Query: 82  GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHI--GCNK-EELLPFNRGFDNHVGYWN 137
            +G    A+   EK +  YLKE GY T ++GK H+  G ++ ++    + GFD  +    
Sbjct: 107 PSGGKNVALGRNEKTIASYLKEQGYDTAMMGKLHLNAGADRTDQPQAKDMGFDYSLVNAA 166

Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYA-PQMSSKYLT-DFFTDQSVHVIKSHNHSRPLF 195
           G++T +    +T    G+       R   P  + K ++ +  + +++H + S   ++P F
Sbjct: 167 GFVTSDLDKVKTRPRYGVVYPNGFYRNGQPIGTVKQMSGELVSSEAIHWLDSRKDNKPFF 226

Query: 196 LQITHAAVHTGTAGNAK 212
           L +    VHT  A   K
Sbjct: 227 LYVAFTEVHTPLASPQK 243


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.138    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,355,981,890
Number of Sequences: 23463169
Number of extensions: 198194790
Number of successful extensions: 386081
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3408
Number of HSP's successfully gapped in prelim test: 5416
Number of HSP's that attempted gapping in prelim test: 369900
Number of HSP's gapped (non-prelim): 9560
length of query: 242
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 104
effective length of database: 9,121,278,045
effective search space: 948612916680
effective search space used: 948612916680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)